Query lcl|NC_021535.1_cdsid_YP_008126260.1 [gene=22] [protein=hypothetical protein] [protein_id=YP_008126260.1] [location=13732..14091] Match_columns 119 No_of_seqs 21 out of 24 Neff 3.7 Searched_HMMs 1612 Date Thu Nov 7 16:54:41 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_19 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_19_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:7776 Length: 119 # 100.0 8.7E-63 5.4E-66 360.8 9.7 119 1-119 1-119 (119) 2 protein:vir:78298 Length: 131 100.0 1.5E-59 9.3E-63 343.1 9.4 117 1-119 1-117 (131) 3 protein:vir:78503 Length: 131 100.0 1.5E-59 9.3E-63 343.1 9.4 117 1-119 1-117 (131) 4 protein:vir:2347 Length: 131 # 100.0 1.5E-59 9.3E-63 343.1 9.4 117 1-119 1-117 (131) 5 protein:vir:104091 Length: 111 100.0 1.5E-48 9.1E-52 282.8 7.7 107 1-119 1-108 (111) 6 protein:vir:2435 Length: 111 # 100.0 1.4E-47 8.5E-51 277.5 7.9 107 1-119 1-108 (111) 7 protein:vir:4230 Length: 111 # 100.0 3.3E-47 2.1E-50 275.4 8.3 107 1-119 1-108 (111) 8 protein:vir:96829 Length: 135 83.8 0.05 3.1E-05 27.7 8.5 112 1-119 1-115 (135) 9 protein:vir:96121 Length: 137 83.5 0.053 3.3E-05 27.6 8.4 112 1-119 1-117 (137) 10 protein:vir:94796 Length: 137 81.4 0.068 4.2E-05 27.0 8.2 112 1-119 1-117 (137) 11 protein:vir:95894 Length: 137 78.2 0.11 6.8E-05 25.9 8.3 112 1-119 1-117 (137) 12 protein:vir:93738 Length: 137 78.1 0.11 6.8E-05 25.8 8.2 112 1-119 1-117 (137) 13 protein:vir:94490 Length: 137 78.1 0.11 6.8E-05 25.8 8.2 112 1-119 1-117 (137) 14 protein:vir:97427 Length: 137 78.1 0.11 6.8E-05 25.8 8.2 112 1-119 1-117 (137) 15 protein:vir:94108 Length: 149 77.5 0.087 5.4E-05 26.4 7.5 111 1-119 13-141 (149) 16 protein:vir:105467 Length: 144 77.4 0.036 2.2E-05 28.5 5.4 112 1-119 1-125 (144) 17 protein:vir:105916 Length: 149 76.7 0.094 5.8E-05 26.2 7.5 109 1-119 13-129 (149) 18 protein:vir:107099 Length: 137 76.6 0.13 8.3E-05 25.4 8.4 111 1-119 1-117 (137) 19 protein:vir:105330 Length: 137 76.3 0.13 8.1E-05 25.4 8.2 112 1-119 1-117 (137) 20 protein:vir:5978 Length: 144 # 66.1 0.27 0.00017 23.7 8.3 107 1-119 4-120 (144) 21 protein:vir:966 Length: 123 # 58.2 0.21 0.00013 24.3 5.4 99 1-119 1-110 (123) 22 protein:vir:79034 Length: 141 53.2 0.22 0.00013 24.2 4.6 105 1-119 1-120 (141) 23 protein:vir:102963 Length: 163 53.0 0.19 0.00012 24.5 4.3 104 1-119 1-139 (163) 24 protein:vir:107545 Length: 140 52.8 0.55 0.00034 22.0 7.4 110 1-119 1-116 (140) 25 protein:vir:97982 Length: 140 52.8 0.55 0.00034 22.0 7.4 110 1-119 1-116 (140) 26 protein:vir:95062 Length: 116 43.3 0.47 0.00029 22.4 4.8 92 16-119 1-96 (116) 27 protein:vir:102338 Length: 116 42.4 0.39 0.00024 22.9 4.1 92 20-119 1-100 (116) 28 protein:vir:94654 Length: 142 36.0 1.2 0.00074 20.2 7.6 114 1-119 1-131 (142) 29 protein:vir:106041 Length: 137 26.3 1.7 0.001 19.4 4.9 104 1-119 4-113 (137) 30 protein:vir:97327 Length: 116 25.4 1.4 0.00087 19.8 4.3 94 10-119 1-108 (116) 31 protein:vir:1243 Length: 116 # 25.4 1.4 0.00087 19.8 4.3 94 10-119 1-108 (116) 32 protein:vir:95789 Length: 114 22.1 1.5 0.00095 19.6 3.8 92 1-119 1-98 (114) 33 protein:vir:2507 Length: 83 # 21.9 1.4 0.00085 19.8 3.5 82 5-106 1-83 (83) 34 protein:vir:99528 Length: 92 # 20.6 2.7 0.0017 18.2 5.7 79 1-89 1-92 (92) No 1 >protein:vir:7776 Length: 119 # NCBI annotation: gp21 # Family: family:all:2819 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817610;genbank:gi:29566040;genbank:GeneID:1259234 Probab=100.00 E-value=8.7e-63 Score=360.83 Aligned_cols=119 Identities=100% Similarity=1.472 Sum_probs=118.9 Q ss_pred CcccccchhhhhhhhcChhHHHHHHHHHHhhhHHHHHHHHHHhhcCCcccccccccceeeecccCCcceEEEecCCCcce Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDHLTRVTVTRGDVDSFINLEAPNAMA 80 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~gV~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki~gp~~~a~I~~~~G~vD~~V~L~d~~ala 80 (119) ||++|++++||++|||||+||++|++|+|+|.+|||+||++||+||+|+||+||+|+++||+++|||||||+|||||||| T Consensus 1 Ma~~y~~~~ln~vvA~l~~v~~~v~~E~~~v~~RA~~NLa~Arast~~~k~~gp~~~~~Id~a~gdvD~~v~l~apna~a 80 (119) T protein:vir:77 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDHLTRVTVTRGDVDSFINLEAPNAMA 80 (119) T ss_pred CcccccccchhhhhhhchhHHHHHHHHHHhhhHHHHHHHHHHhhcccccceecCCCcceeccccCCcceEEeecCCCchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeccccccceecCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 81 IEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 81 IEfGH~~sg~f~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) |||||.|||+|+||++|+++|||+|+||||||+|||||| T Consensus 81 IEfGhapsgvf~pG~~yg~vdtkapeglYILTrAA~l~g 119 (119) T protein:vir:77 81 IEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) T ss_pred hcccccccceecccccccccCCCCCCCeEeeecccccCC Confidence 999999999999999999999999999999999999999 No 2 >protein:vir:78298 Length: 131 # NCBI annotation: gp18 # Family: family:all:2819 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491670;genbank:gi:157786494;genbank:GeneID:5625776 Probab=100.00 E-value=1.5e-59 Score=343.08 Aligned_cols=117 Identities=55% Similarity=0.938 Sum_probs=116.1 Q ss_pred CcccccchhhhhhhhcChhHHHHHHHHHHhhhHHHHHHHHHHhhcCCcccccccccceeeecccCCcceEEEecCCCcce Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDHLTRVTVTRGDVDSFINLEAPNAMA 80 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~gV~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki~gp~~~a~I~~~~G~vD~~V~L~d~~ala 80 (119) ||+||++++||++|||||+||++|++|+|+|++|||+||++||+||+|+||+||+|+++||+++|||||||+|||||||| T Consensus 1 ma~~~~~~~ln~vvA~l~~vk~~v~~E~~~v~~RA~~NLa~Arast~~~k~~gp~hla~I~~~~gdvD~~v~Ldapna~a 80 (131) T protein:vir:78 1 MPLYYGRSGLNKVVSHLPGVVHEMRSEADEVADRAKANLAAARASTQWEKIHGPDHLTKITRTNGSVDAYVNMEAPSPES 80 (131) T ss_pred CcccccchhhhhhhhhchhHHHHHHHHHHhhhHHHHHHHHHHhhcCcccccccCCCcceeeeeeCCcceEEeecCCCchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeccccccceecCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 81 IEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 81 IEfGH~~sg~f~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) |||||.|||+|+| +||+.+ +|+|+||||||+|||||| T Consensus 81 IEfGhapsgvf~p-~k~G~~-tkapeglYILTrAA~lgg 117 (131) T protein:vir:78 81 IEYGHYPSGVFDP-EKYGRV-TKAPQGLYILTGAAGFGG 117 (131) T ss_pred heeccccccccCC-cccCcc-cCCCCcceeeeccccccc Confidence 9999999999999 899997 999999999999999999 No 3 >protein:vir:78503 Length: 131 # NCBI annotation: gp18 # Family: family:all:2819 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491589;genbank:gi:157786412;genbank:GeneID:5625655 Probab=100.00 E-value=1.5e-59 Score=343.08 Aligned_cols=117 Identities=55% Similarity=0.938 Sum_probs=116.1 Q ss_pred CcccccchhhhhhhhcChhHHHHHHHHHHhhhHHHHHHHHHHhhcCCcccccccccceeeecccCCcceEEEecCCCcce Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDHLTRVTVTRGDVDSFINLEAPNAMA 80 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~gV~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki~gp~~~a~I~~~~G~vD~~V~L~d~~ala 80 (119) ||+||++++||++|||||+||++|++|+|+|++|||+||++||+||+|+||+||+|+++||+++|||||||+|||||||| T Consensus 1 ma~~~~~~~ln~vvA~l~~vk~~v~~E~~~v~~RA~~NLa~Arast~~~k~~gp~hla~I~~~~gdvD~~v~Ldapna~a 80 (131) T protein:vir:78 1 MPLYYGRSGLNKVVSHLPGVVHEMRSEADEVADRAKANLAAARASTQWEKIHGPDHLTKITRTNGSVDAYVNMEAPSPES 80 (131) T ss_pred CcccccchhhhhhhhhchhHHHHHHHHHHhhhHHHHHHHHHHhhcCcccccccCCCcceeeeeeCCcceEEeecCCCchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeccccccceecCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 81 IEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 81 IEfGH~~sg~f~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) |||||.|||+|+| +||+.+ +|+|+||||||+|||||| T Consensus 81 IEfGhapsgvf~p-~k~G~~-tkapeglYILTrAA~lgg 117 (131) T protein:vir:78 81 IEYGHYPSGVFDP-EKYGRV-TKAPQGLYILTGAAGFGG 117 (131) T ss_pred heeccccccccCC-cccCcc-cCCCCcceeeeccccccc Confidence 9999999999999 899997 999999999999999999 No 4 >protein:vir:2347 Length: 131 # NCBI annotation: gp17 # Family: family:all:2819 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075284;genbank:gi:12657871;genbank:GeneID:920131 Probab=100.00 E-value=1.5e-59 Score=343.08 Aligned_cols=117 Identities=55% Similarity=0.938 Sum_probs=116.1 Q ss_pred CcccccchhhhhhhhcChhHHHHHHHHHHhhhHHHHHHHHHHhhcCCcccccccccceeeecccCCcceEEEecCCCcce Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDHLTRVTVTRGDVDSFINLEAPNAMA 80 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~gV~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki~gp~~~a~I~~~~G~vD~~V~L~d~~ala 80 (119) ||+||++++||++|||||+||++|++|+|+|++|||+||++||+||+|+||+||+|+++||+++|||||||+|||||||| T Consensus 1 ma~~~~~~~ln~vvA~l~~vk~~v~~E~~~v~~RA~~NLa~Arast~~~k~~gp~hla~I~~~~gdvD~~v~Ldapna~a 80 (131) T protein:vir:23 1 MPLYYGRSGLNKVVSHLPGVVHEMRSEADEVADRAKANLAAARASTQWEKIHGPDHLTKITRTNGSVDAYVNMEAPSPES 80 (131) T ss_pred CcccccchhhhhhhhhchhHHHHHHHHHHhhhHHHHHHHHHHhhcCcccccccCCCcceeeeeeCCcceEEeecCCCchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeccccccceecCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 81 IEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 81 IEfGH~~sg~f~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) |||||.|||+|+| +||+.+ +|+|+||||||+|||||| T Consensus 81 IEfGhapsgvf~p-~k~G~~-tkapeglYILTrAA~lgg 117 (131) T protein:vir:23 81 IEYGHYPSGVFDP-EKYGRV-TKAPQGLYILTGAAGFGG 117 (131) T ss_pred heeccccccccCC-cccCcc-cCCCCcceeeeccccccc Confidence 9999999999999 899997 999999999999999999 No 5 >protein:vir:104091 Length: 111 # NCBI annotation: gp23 # Family: family:all:2819 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655602;genbank:gi:109392473;genbank:GeneID:4156959 Probab=100.00 E-value=1.5e-48 Score=282.82 Aligned_cols=107 Identities=41% Similarity=0.623 Sum_probs=104.3 Q ss_pred CcccccchhhhhhhhcChhHHHHHHHHHHhhhHHHHHHHHHHhhcCCcccc-cccccceeeecccCCcceEEEecCCCcc Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRKAEANLAQARASTRWHKI-FGPDHLTRVTVTRGDVDSFINLEAPNAM 79 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~gV~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki-~gp~~~a~I~~~~G~vD~~V~L~d~~al 79 (119) ||++|.+. |+++|||++||++|++|++++.+|||+||+|||+|++|+|+ ++| ++||+++||||+||+|++|||| T Consensus 1 ma~vy~~~--n~vva~l~~vk~avr~e~~~v~~RAr~nLa~Arastr~~~~G~~p---t~I~~a~gDVD~~v~l~apnam 75 (111) T protein:vir:10 1 MAKVYANA--NEAAARHVDTKRAVRRVNRDVEGRARSNLAQANSTTRVTPTGYFP---AEIDSSEHDVDCYTTLHAPNAM 75 (111) T ss_pred Cccccccc--CcEEeechhhHHHHHHHHhhhhhHHHHHHHHhhhcccccccCccc---ceeeeecCCcceEEEecCCCch Confidence 99999999 89999999999999999999999999999999999999999 999 9999999999999999999999 Q ss_pred eeeccccccceecCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 80 AIEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 80 aIEfGH~~sg~f~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) ||||||.|||+| .+.+||+|+||||||+|| .|| T Consensus 76 aiEfGh~psG~f------~~~~tkap~glyILTrAA-~gg 108 (111) T protein:vir:10 76 ALEFGHEPSGVF------AGTDTKSPDPQYILTRAA-YGG 108 (111) T ss_pred hhhhccCcccee------cccccCCCCCceeeeecc-ccc Confidence 999999999998 467799999999999999 999 No 6 >protein:vir:2435 Length: 111 # NCBI annotation: gp21 # Family: family:all:2819 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046837;genbank:gi:9630405;genbank:GeneID:1261628 Probab=100.00 E-value=1.4e-47 Score=277.50 Aligned_cols=107 Identities=43% Similarity=0.622 Sum_probs=102.3 Q ss_pred CcccccchhhhhhhhcChhHHHHHHHHHHhhhHHHHHHHHHHhhcCCcccc-cccccceeeecccCCcceEEEecCCCcc Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRKAEANLAQARASTRWHKI-FGPDHLTRVTVTRGDVDSFINLEAPNAM 79 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~gV~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki-~gp~~~a~I~~~~G~vD~~V~L~d~~al 79 (119) ||++|.++ |++++||++|+++|++|++++.+||++||+++|+|++|+|+ +||++++++ .||||+||+|++|||| T Consensus 1 makvyana--N~v~ahl~~vk~avr~Ea~ev~~RAr~NLA~arastri~k~g~~P~~I~~~---~gdvD~~~~l~APnam 75 (111) T protein:vir:24 1 MAKVYANA--NKVAARHVDVRKRVKEERDGVTRRARTNLARANKTTRITKEGYFPASIEEV---DGDVDFHTVLHAPNAF 75 (111) T ss_pred Ccccccch--hhHhhhchhHHHHHHHHHhhhhhhHHHhHHHhhhcceecccccCccccccc---cCCcceEEEecCCChh Confidence 99999999 99999999999999999999999999999999999999999 999985554 5999999999999999 Q ss_pred eeeccccccceecCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 80 AIEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 80 aIEfGH~~sg~f~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) ||||||+|||+| ++.+||+|+||||||+|| +|| T Consensus 76 AiEfGH~PSG~F------~g~dTKaP~glYILt~AA-~~g 108 (111) T protein:vir:24 76 ALEFGHAPSGFF------AGTDTKPPDPEYILTRAA-IGG 108 (111) T ss_pred hhhccCCCccee------cccccCCCCCceeeeccc-ccc Confidence 999999999999 477899999999999999 999 No 7 >protein:vir:4230 Length: 111 # NCBI annotation: predicted 12.0Kd protein # Family: family:all:2819 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039685;swissprot:sw:q05227;genbank:gi:9625451;uniprot:Q05227;genbank:GeneID:2942925 Probab=100.00 E-value=3.3e-47 Score=275.40 Aligned_cols=107 Identities=44% Similarity=0.642 Sum_probs=103.1 Q ss_pred CcccccchhhhhhhhcChhHHHHHHHHHHhhhHHHHHHHHHHhhcCC-cccccccccceeeecccCCcceEEEecCCCcc Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRKAEANLAQARASTR-WHKIFGPDHLTRVTVTRGDVDSFINLEAPNAM 79 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~gV~~~v~~e~~~i~~RA~anLa~aR~s~~-~~ki~gp~~~a~I~~~~G~vD~~V~L~d~~al 79 (119) ||++|.++ |++++||++|+++|++|++++.+||++||+++|+|++ |.|+++|++ |+.++||||+||+|++|||| T Consensus 1 makvyana--N~v~a~~~~~k~avr~E~~~v~~RAraNLA~a~astri~~~g~~p~~---it~~~gdvD~~~~l~APnam 75 (111) T protein:vir:42 1 MAKVYANA--NKVAARYVETRDAVRDERNKVTRRAKANLARQNSTTRITDEGYFPAT---ITEQDGDVDFHTILNAPNAL 75 (111) T ss_pred Ccceecch--hhhhhhchhHHHHHHHHHhhhhhhHHHhHHHhhhccccccccccCce---eecccCCcceEEEecCCChh Confidence 99999999 9999999999999999999999999999999999999 999999988 56677999999999999999 Q ss_pred eeeccccccceecCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 80 AIEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 80 aIEfGH~~sg~f~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) ||||||+|||+| ++.+||+|+||||||+|| +|| T Consensus 76 AiEfGH~PSG~F------~g~dTKaPe~~YILt~AA-igg 108 (111) T protein:vir:42 76 ALEFGHAPSGFF------AGTDTKPPEATYILTRAA-IGG 108 (111) T ss_pred hhhcccCCccee------cccccCCCCceeeeeccc-ccc Confidence 999999999999 477899999999999999 999 No 8 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=83.83 E-value=0.05 Score=27.70 Aligned_cols=112 Identities=15% Similarity=0.098 Sum_probs=52.5 Q ss_pred CcccccchhhhhhhhcChhHHHHHHHHHHhhhHH-HHHHHHHHhhcCCcccccccccceeee--cccCCcceEEEecCCC Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRK-AEANLAQARASTRWHKIFGPDHLTRVT--VTRGDVDSFINLEAPN 77 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~gV~~~v~~e~~~i~~R-A~anLa~aR~s~~~~ki~gp~~~a~I~--~~~G~vD~~V~L~d~~ 77 (119) ||++. .-|+++.+.+......+++.......+ |+....+++...++ ++|- ...+|+ ++.+..-.-|.=+.+- T Consensus 1 Ma~~~--~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~apv--dTG~-Lr~SI~~~~~~~g~~~~V~~~~~Y 75 (135) T protein:vir:96 1 MAKVK--YGADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMPV--DTGF-LRQSTTVDFENGGFTGVVKIGSNY 75 (135) T ss_pred Cchhh--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--cchh-hhcceeEEeecCcEEEEEecCCCc Confidence 99973 244566666555444444333322222 22222333333321 2221 223443 3344444556656778 Q ss_pred cceeeccccccceecCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 78 AMAIEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 78 alaIEfGH~~sg~f~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) |.-+||||.+.....+.++-.....+-+.|.++-|+ |.-+ T Consensus 76 A~~ve~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~--~~~a 115 (135) T protein:vir:96 76 AVYVNYGTGIYATKGSRAHKIPWTYKDPNGKWHTTY--GQMP 115 (135) T ss_pred cchhhcccccccCCCccccccccccccCCcceeecC--CcCC Confidence 999999997765443322212211233445555554 3333 No 9 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=83.53 E-value=0.053 Score=27.59 Aligned_cols=112 Identities=23% Similarity=0.195 Sum_probs=53.6 Q ss_pred CcccccchhhhhhhhcChhHHHHHHHHHHhhhHH-HHHHHHHHhhcCCcccccccccceee--ecccCCcceEEEecCCC Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRK-AEANLAQARASTRWHKIFGPDHLTRV--TVTRGDVDSFINLEAPN 77 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~gV~~~v~~e~~~i~~R-A~anLa~aR~s~~~~ki~gp~~~a~I--~~~~G~vD~~V~L~d~~ 77 (119) ||++. ..+++++..|......+++...+...+ |......++...+. ++|- ...+| .+..+..-..|.-+.+- T Consensus 1 Ma~~~--~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~pv--dTG~-L~~Si~~~~~~~g~~~~V~~~~~Y 75 (137) T protein:vir:96 1 MAKVK--YGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAPV--DLGF-LKESIDFKVTDGGFSSVISVGAEY 75 (137) T ss_pred CchhH--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--Cccc-hhcCceeEeecCceEEEEecCCCc Confidence 99997 245566666655555554444433333 23333344443331 2332 11333 33455666778888888 Q ss_pred cceeeccccccceecCcccccCC--CCCCCCceeeeecccccCC Q lcl|NC_021535. 78 AMAIEFGHQPSGVFGPGGMFGHL--DTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 78 alaIEfGH~~sg~f~~g~k~~~~--~tka~~GLyILT~AAgl~~ 119 (119) |.-+||||-+...-..+.++... --+.+.|-++-|+ |.-+ T Consensus 76 A~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--g~~a 117 (137) T protein:vir:96 76 AIYVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTY--GQQA 117 (137) T ss_pred ccccccCccccccCCCccccccccceeeccCcceeecC--CCCC Confidence 99999998544322221111000 0112234444443 2222 No 10 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=81.39 E-value=0.068 Score=26.97 Aligned_cols=112 Identities=16% Similarity=0.117 Sum_probs=49.7 Q ss_pred CcccccchhhhhhhhcChhHHHHHHHHHHhhhHH-HHHHHHHHhhcCCcccccccccceeee--cccCCcceEEEecCCC Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRK-AEANLAQARASTRWHKIFGPDHLTRVT--VTRGDVDSFINLEAPN 77 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~gV~~~v~~e~~~i~~R-A~anLa~aR~s~~~~ki~gp~~~a~I~--~~~G~vD~~V~L~d~~ 77 (119) ||++. ..++++...|......+++.......+ |......++...++ +.|- ...+|+ +..+.+-..|.-+.+- T Consensus 1 Ma~~~--~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPv--dTG~-Lr~SI~~~~~~~~~~~~V~~~~~Y 75 (137) T protein:vir:94 1 MAKVK--YGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISLMPV--DTGY-LRESVTMDFKDGGFTGVINIGSEY 75 (137) T ss_pred CchhH--HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--Ccch-hhcCceeEeecCcEEEEEecCCCc Confidence 99994 233455555544444443332222222 22222333332221 2221 224443 3445566677777888 Q ss_pred cceeeccccccceecCcccccCCC--CCCCCceeeeecccccCC Q lcl|NC_021535. 78 AMAIEFGHQPSGVFGPGGMFGHLD--TKAPEGLYIITSAAGLRG 119 (119) Q Consensus 78 alaIEfGH~~sg~f~~g~k~~~~~--tka~~GLyILT~AAgl~~ 119 (119) |.-+||||.+......+.++-... -..+.|-++-|+ |.-. T Consensus 76 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--g~~a 117 (137) T protein:vir:94 76 AIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTK--GQHA 117 (137) T ss_pred ccccccCccccccCCCcccccccccceeccCCceeecC--CcCC Confidence 999999987655443322211110 011233333333 1122 No 11 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=78.19 E-value=0.11 Score=25.85 Aligned_cols=112 Identities=16% Similarity=0.119 Sum_probs=51.9 Q ss_pred CcccccchhhhhhhhcChhHHHHHHHHHHhhhHHH-HHHHHHHhhcCCcccccccccceeee--cccCCcceEEEecCCC Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRKA-EANLAQARASTRWHKIFGPDHLTRVT--VTRGDVDSFINLEAPN 77 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~gV~~~v~~e~~~i~~RA-~anLa~aR~s~~~~ki~gp~~~a~I~--~~~G~vD~~V~L~d~~ 77 (119) ||++. ..++++...|......+++.......++ ......++...++ +.|- ...+|+ +..+..-..|.-+.+- T Consensus 1 Ma~~~--~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv--~TG~-L~~Si~~~~~~~~~~~~V~~~~~Y 75 (137) T protein:vir:95 1 MAKVK--YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPV--DTGY-LRESVTMDFKDGGFTGVINIGSEY 75 (137) T ss_pred CchhH--HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--cchh-hhcCeeeEeeCCceEEEEecCCCc Confidence 99987 2345555555544444443333332222 2222333333332 2222 223443 3445555566666778 Q ss_pred cceeeccccccceecCcccccCCC--CCCCCceeeeecccccCC Q lcl|NC_021535. 78 AMAIEFGHQPSGVFGPGGMFGHLD--TKAPEGLYIITSAAGLRG 119 (119) Q Consensus 78 alaIEfGH~~sg~f~~g~k~~~~~--tka~~GLyILT~AAgl~~ 119 (119) |.-+||||.+......+.++-... -..+.|-++-|+ |.-. T Consensus 76 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--g~~a 117 (137) T protein:vir:95 76 AIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTK--GQHA 117 (137) T ss_pred ccccccCccccccCCCcccccccccceeccCcceeecC--CCCC Confidence 999999997765554432211110 012334444443 2222 No 12 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=78.05 E-value=0.11 Score=25.83 Aligned_cols=112 Identities=15% Similarity=0.100 Sum_probs=53.3 Q ss_pred CcccccchhhhhhhhcChhHHHHHHHHHHhhhHH-HHHHHHHHhhcCCcccccccccceeee--cccCCcceEEEecCCC Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRK-AEANLAQARASTRWHKIFGPDHLTRVT--VTRGDVDSFINLEAPN 77 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~gV~~~v~~e~~~i~~R-A~anLa~aR~s~~~~ki~gp~~~a~I~--~~~G~vD~~V~L~d~~ 77 (119) ||++. ..+++++..|......+.+.......+ |+....+++...++ +.|- ...+|+ +..+.+-..|.-..+- T Consensus 1 Ma~~~--~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPv--dTG~-Lr~SI~~~~~~~~~~~~V~~~~~Y 75 (137) T protein:vir:93 1 MAKVK--YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPV--DTGY-LRESVTMDFKDSGFTGVINIGSEY 75 (137) T ss_pred CchhH--HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--cccc-hhccceeEeecCceEEEEecCCCc Confidence 99987 234455555554444444433333322 22233344443332 2222 223443 3445555667777788 Q ss_pred cceeeccccccceecCcccccCCC--CCCCCceeeeecccccCC Q lcl|NC_021535. 78 AMAIEFGHQPSGVFGPGGMFGHLD--TKAPEGLYIITSAAGLRG 119 (119) Q Consensus 78 alaIEfGH~~sg~f~~g~k~~~~~--tka~~GLyILT~AAgl~~ 119 (119) |.-+||||.+......+.++-... -..+.|-++-|+ |.-. T Consensus 76 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--g~~a 117 (137) T protein:vir:93 76 AIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTK--GQHA 117 (137) T ss_pred ccccccCccccccCCCcccccccccceeccCcceeecC--CCCC Confidence 999999997765444332211110 012344444443 2222 No 13 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=78.05 E-value=0.11 Score=25.83 Aligned_cols=112 Identities=15% Similarity=0.100 Sum_probs=53.3 Q ss_pred CcccccchhhhhhhhcChhHHHHHHHHHHhhhHH-HHHHHHHHhhcCCcccccccccceeee--cccCCcceEEEecCCC Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRK-AEANLAQARASTRWHKIFGPDHLTRVT--VTRGDVDSFINLEAPN 77 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~gV~~~v~~e~~~i~~R-A~anLa~aR~s~~~~ki~gp~~~a~I~--~~~G~vD~~V~L~d~~ 77 (119) ||++. ..+++++..|......+.+.......+ |+....+++...++ +.|- ...+|+ +..+.+-..|.-..+- T Consensus 1 Ma~~~--~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPv--dTG~-Lr~SI~~~~~~~~~~~~V~~~~~Y 75 (137) T protein:vir:94 1 MAKVK--YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPV--DTGY-LRESVTMDFKDSGFTGVINIGSEY 75 (137) T ss_pred CchhH--HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--cccc-hhccceeEeecCceEEEEecCCCc Confidence 99987 234455555554444444433333322 22233344443332 2222 223443 3445555667777788 Q ss_pred cceeeccccccceecCcccccCCC--CCCCCceeeeecccccCC Q lcl|NC_021535. 78 AMAIEFGHQPSGVFGPGGMFGHLD--TKAPEGLYIITSAAGLRG 119 (119) Q Consensus 78 alaIEfGH~~sg~f~~g~k~~~~~--tka~~GLyILT~AAgl~~ 119 (119) |.-+||||.+......+.++-... -..+.|-++-|+ |.-. T Consensus 76 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--g~~a 117 (137) T protein:vir:94 76 AIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTK--GQHA 117 (137) T ss_pred ccccccCccccccCCCcccccccccceeccCcceeecC--CCCC Confidence 999999997765444332211110 012344444443 2222 No 14 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=78.05 E-value=0.11 Score=25.83 Aligned_cols=112 Identities=15% Similarity=0.100 Sum_probs=53.3 Q ss_pred CcccccchhhhhhhhcChhHHHHHHHHHHhhhHH-HHHHHHHHhhcCCcccccccccceeee--cccCCcceEEEecCCC Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRK-AEANLAQARASTRWHKIFGPDHLTRVT--VTRGDVDSFINLEAPN 77 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~gV~~~v~~e~~~i~~R-A~anLa~aR~s~~~~ki~gp~~~a~I~--~~~G~vD~~V~L~d~~ 77 (119) ||++. ..+++++..|......+.+.......+ |+....+++...++ +.|- ...+|+ +..+.+-..|.-..+- T Consensus 1 Ma~~~--~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPv--dTG~-Lr~SI~~~~~~~~~~~~V~~~~~Y 75 (137) T protein:vir:97 1 MAKVK--YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPV--DTGY-LRESVTMDFKDSGFTGVINIGSEY 75 (137) T ss_pred CchhH--HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--cccc-hhccceeEeecCceEEEEecCCCc Confidence 99987 234455555554444444433333322 22233344443332 2222 223443 3445555667777788 Q ss_pred cceeeccccccceecCcccccCCC--CCCCCceeeeecccccCC Q lcl|NC_021535. 78 AMAIEFGHQPSGVFGPGGMFGHLD--TKAPEGLYIITSAAGLRG 119 (119) Q Consensus 78 alaIEfGH~~sg~f~~g~k~~~~~--tka~~GLyILT~AAgl~~ 119 (119) |.-+||||.+......+.++-... -..+.|-++-|+ |.-. T Consensus 76 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--g~~a 117 (137) T protein:vir:97 76 AIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTK--GQHA 117 (137) T ss_pred ccccccCccccccCCCcccccccccceeccCcceeecC--CCCC Confidence 999999997765444332211110 012344444443 2222 No 15 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=77.53 E-value=0.087 Score=26.39 Aligned_cols=111 Identities=18% Similarity=0.153 Sum_probs=46.9 Q ss_pred CcccccchhhhhhhhcChh----HHHHHHHHHHhhhHHHHHHHHHHhhcCCcccccccccceeee--cccCCcceEEEec Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDG----VKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDHLTRVT--VTRGDVDSFINLE 74 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~g----V~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki~gp~~~a~I~--~~~G~vD~~V~L~ 74 (119) ||++. .-|+++...|.. ++..+++...+.+.+. ...++...+ .+.|- ...+|+ ++.+.+-..|.-+ T Consensus 13 Ma~~~--~Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v---~~~ak~~aP--vdTG~-Lr~SI~~~~~~~g~~~~V~~~ 84 (149) T protein:vir:94 13 MAKVK--YGADSMVVELDKFDKKIEEWVKKGIAKTTTKI---YNTAVALAP--VDLGF-LEESIDFKYFDGGLSSVISVG 84 (149) T ss_pred HHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHhCC--cccch-hhcCeeEEeeCCcEEEEEecC Confidence 99875 233444444433 3333333333332222 222333222 12222 223343 3444566667777 Q ss_pred CCCcceeeccccccceecCcccccCCCCCC--CCceeeeec----------ccccCC Q lcl|NC_021535. 75 APNAMAIEFGHQPSGVFGPGGMFGHLDTKA--PEGLYIITS----------AAGLRG 119 (119) Q Consensus 75 d~~alaIEfGH~~sg~f~~g~k~~~~~tka--~~GLyILT~----------AAgl~~ 119 (119) .+-|.-+||||.+.+.-..+.+.....-++ +.|-++-|+ |....= T Consensus 85 ~~YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~ 141 (149) T protein:vir:94 85 ADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGR 141 (149) T ss_pred CCcccccccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHH Confidence 788999999996655433333222111100 122222222 211110 No 16 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=77.37 E-value=0.036 Score=28.48 Aligned_cols=112 Identities=18% Similarity=0.117 Sum_probs=60.3 Q ss_pred Cccc-ccchhhhhhhhcChh------HHHHHHHHHHhhhHHHHHHHHHHhhcCCcccccccccceee-----ecccCCcc Q lcl|NC_021535. 1 MARL-IGQKAMNHVISHLDG------VKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDHLTRV-----TVTRGDVD 68 (119) Q Consensus 1 MA~~-yg~~~l~~vvA~~~g------V~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki~gp~~~a~I-----~~~~G~vD 68 (119) |+.- +--+.|++++..+.. ++..+++...+++.+..+. +.+.++. ++|- ..-++ ..+.+..- T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~---vk~~tPV--dTG~-Lr~S~~~~~~~~~~~~~~ 74 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRI---LEANTPV--KQGN-LRRSWTAEGPTYGCGGWT 74 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHH---HHHhCCC--Ccch-hccceeecceeeecCeeE Confidence 6642 122444555554433 4445555555555555444 3333331 2221 11222 23334444 Q ss_pred eEEEecCCCcceeeccc-cccceecCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 69 SFINLEAPNAMAIEFGH-QPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 69 ~~V~L~d~~alaIEfGH-~~sg~f~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) .-|.=..+=|--+|||| +..|-|-|. .+.+...+|++|.|.|++|...-- T Consensus 75 ~~V~n~~~YA~~VE~Ghr~~~G~~v~~-~~~~~~~g~V~G~~~~~~a~~~~~ 125 (144) T protein:vir:10 75 IKLINNAEYASYVESGHRQTPGRYVPV-LKKRLVRDWVPGQFYMKKSIPQIQ 125 (144) T ss_pred EEEecCCCcccccccceeecCCccccc-CCCccccceecCccchHHHHHHHH Confidence 44444556699999999 333433343 345556789999999999988766 No 17 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=76.69 E-value=0.094 Score=26.21 Aligned_cols=109 Identities=19% Similarity=0.179 Sum_probs=46.7 Q ss_pred CcccccchhhhhhhhcChh----HHHHHHHHHHhhhHHHHHHHHHHhhcCCcccccccccceeee--cccCCcceEEEec Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDG----VKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDHLTRVT--VTRGDVDSFINLE 74 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~g----V~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki~gp~~~a~I~--~~~G~vD~~V~L~ 74 (119) ||++. .-|+++...+.. ++.++++...+.+.+.+. +++...+ .++|- ...+|+ +..+.+-..|.-+ T Consensus 13 Ma~v~--~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~---~ak~~aP--vdTG~-L~~SI~~~~~~~g~~~~V~~~ 84 (149) T protein:vir:10 13 MAKVK--YGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYN---TAVALAP--VDLGF-LEESIDFKYFDGGLSSVISVG 84 (149) T ss_pred hHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHhCC--cccch-hhccceEEecCCcEEEEEecC Confidence 99983 234444444433 333333333333222222 2222222 12222 223343 3445566677777 Q ss_pred CCCcceeeccccccceecCcccccCCCC--CCCCceeeeecccccCC Q lcl|NC_021535. 75 APNAMAIEFGHQPSGVFGPGGMFGHLDT--KAPEGLYIITSAAGLRG 119 (119) Q Consensus 75 d~~alaIEfGH~~sg~f~~g~k~~~~~t--ka~~GLyILT~AAgl~~ 119 (119) .+-|.-+||||.+.+.-..+.+.....- +-+.|-++-|+ |.-. T Consensus 85 ~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--g~~a 129 (149) T protein:vir:10 85 ADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTY--GQAP 129 (149) T ss_pred CCcccccccCccccccCCcccccccccceeeccccceecCC--CCCC Confidence 8889999999965443322222111110 01122223222 1222 No 18 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=76.63 E-value=0.13 Score=25.38 Aligned_cols=111 Identities=18% Similarity=0.203 Sum_probs=49.8 Q ss_pred Ccccc-cchhhhhhhhcChhHHHHHHHHHHhhhHHHHH-HHHHHhhcCCcccccccccceeee--cccCCcceEEEecCC Q lcl|NC_021535. 1 MARLI-GQKAMNHVISHLDGVKDAVYAEAKERGRKAEA-NLAQARASTRWHKIFGPDHLTRVT--VTRGDVDSFINLEAP 76 (119) Q Consensus 1 MA~~y-g~~~l~~vvA~~~gV~~~v~~e~~~i~~RA~a-nLa~aR~s~~~~ki~gp~~~a~I~--~~~G~vD~~V~L~d~ 76 (119) ||++. |-. +++..+.....++.+.......++-. ...+++...++ ++|- ...+|+ +..+.+-..|.-..+ T Consensus 1 Ma~~~~Gl~---~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv--dTG~-Lr~SI~~~~~~~~~~~~V~~~~~ 74 (137) T protein:vir:10 1 MAKVKYGNW---ELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPV--DTGY-LRESVSMDFKKGGLTGVINIGSE 74 (137) T ss_pred CchhHhhHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--Ccch-hhcCeeEEeeCCcEEEEEecCCC Confidence 99995 444 44444444333333333322222211 12222222221 2222 223443 455667778888888 Q ss_pred CcceeeccccccceecCcccccCCC--CCCCCceeeeecccccCC Q lcl|NC_021535. 77 NAMAIEFGHQPSGVFGPGGMFGHLD--TKAPEGLYIITSAAGLRG 119 (119) Q Consensus 77 ~alaIEfGH~~sg~f~~g~k~~~~~--tka~~GLyILT~AAgl~~ 119 (119) -|.-+|||+.+...-..+.++.... -..+.|-++-|+ |.-. T Consensus 75 Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--g~~a 117 (137) T protein:vir:10 75 YAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTK--GQHA 117 (137) T ss_pred cccccccCccccccCCCccccccccceeeccccceeccC--CCCC Confidence 8999999975544332222111110 122334444442 2222 No 19 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=76.33 E-value=0.13 Score=25.42 Aligned_cols=112 Identities=17% Similarity=0.178 Sum_probs=53.8 Q ss_pred CcccccchhhhhhhhcChhHHHHHHHHHHhhhHHHHHHH-HHHhhcCCcccccccccceee--ecccCCcceEEEecCCC Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRKAEANL-AQARASTRWHKIFGPDHLTRV--TVTRGDVDSFINLEAPN 77 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~gV~~~v~~e~~~i~~RA~anL-a~aR~s~~~~ki~gp~~~a~I--~~~~G~vD~~V~L~d~~ 77 (119) ||++. ..++++...|.....++++.......++-..+ .+++...+ .++|- ...+| ++..+.+-..|.-+.+- T Consensus 1 Ma~~~--~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aP--v~TG~-Lr~SI~~~~~~~~~~~~V~~~~~Y 75 (137) T protein:vir:10 1 MAKVK--YGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMP--VDTGY-LRESVSMDFKKGGLTGVINIGSEY 75 (137) T ss_pred Cccch--hCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cCcch-hhcCeeeEecCCcEEEEEecCCcc Confidence 99995 23445666665555555555544444433322 23333222 12332 22344 33445566777777888 Q ss_pred cceeeccccccceecCcccccCCC--CCCCCceeeeecccccCC Q lcl|NC_021535. 78 AMAIEFGHQPSGVFGPGGMFGHLD--TKAPEGLYIITSAAGLRG 119 (119) Q Consensus 78 alaIEfGH~~sg~f~~g~k~~~~~--tka~~GLyILT~AAgl~~ 119 (119) |.-+|||+.+.+....+.+..... -..+.|-|+-|+ |.-. T Consensus 76 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--g~~a 117 (137) T protein:vir:10 76 AVYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTK--GQHA 117 (137) T ss_pred ccccccCccccccCCCcccccccceeeeccccccccCC--CCCC Confidence 999999986654433322211110 011233333332 1222 No 20 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=66.07 E-value=0.27 Score=23.67 Aligned_cols=107 Identities=18% Similarity=0.185 Sum_probs=46.3 Q ss_pred Ccccc---cchhhhhhhhcC-----hhHHHHHHHHHHhhhHHHHHHHHHHhhcCCcccccccccceeeec--ccCCcceE Q lcl|NC_021535. 1 MARLI---GQKAMNHVISHL-----DGVKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDHLTRVTV--TRGDVDSF 70 (119) Q Consensus 1 MA~~y---g~~~l~~vvA~~-----~gV~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki~gp~~~a~I~~--~~G~vD~~ 70 (119) |+--+ |.+.|.+.+... ..|+.+|.+-+.++...|+ ...++ ++| +...+|+. ..+.+... T Consensus 4 ms~~i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak-------~~apv--~TG-~Lr~SI~~~~~~~g~~~~ 73 (144) T protein:vir:59 4 MSVRIDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAA-------SLAPV--DEG-NLKNSIQIDYKNNGLTAE 73 (144) T ss_pred ceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HhCCc--cch-hhhcCeeEEeecCcEEEE Confidence 44434 333333222222 2233333333333333333 32221 122 22244543 34556667 Q ss_pred EEecCCCcceeeccccccceecCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 71 INLEAPNAMAIEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 71 V~L~d~~alaIEfGH~~sg~f~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) |.-..+-|.-+|||+.+.+....+.+....---...|.|+-|+ |.-+ T Consensus 74 V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~--g~~a 120 (144) T protein:vir:59 74 ITVGAEYAIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQ--GAPA 120 (144) T ss_pred EecCCCccchhhcCccccccCCCccccccccccccccceecCC--CCCC Confidence 7777888999999997766544332211110011234444443 2222 No 21 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=58.19 E-value=0.21 Score=24.27 Aligned_cols=99 Identities=15% Similarity=0.188 Sum_probs=49.0 Q ss_pred Ccccccchhhhhhh----hcC-hhHHHHHHHHHHhhhHHHHHHHHHH--hhcCCcccccccccceeeecccCCcceEEEe Q lcl|NC_021535. 1 MARLIGQKAMNHVI----SHL-DGVKDAVYAEAKERGRKAEANLAQA--RASTRWHKIFGPDHLTRVTVTRGDVDSFINL 73 (119) Q Consensus 1 MA~~yg~~~l~~vv----A~~-~gV~~~v~~e~~~i~~RA~anLa~a--R~s~~~~ki~gp~~~a~I~~~~G~vD~~V~L 73 (119) ||+-+-=..|...| ... ..|...|.+..++++..+...|.+- .+||++.|-. +|+++... ..++. T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW------~~k~~~~~--~~~v~ 72 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSPKRTGDYAKNW------TSQKLKNG--DQVIY 72 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccccce------eeeecCCe--eEEEE Confidence 88866544443333 333 3466668888888888888888763 3445444432 34443322 33444 Q ss_pred cC--CCcce--eeccccccceecCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 74 EA--PNAMA--IEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 74 ~d--~~ala--IEfGH~~sg~f~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) .. .-.|+ +||||.-.+ |++. +|-.-|.-|--... T Consensus 73 ~~~~~y~l~HLLE~GHa~r~----GGrV--------~a~phI~paee~~~ 110 (123) T protein:vir:96 73 QKAPTYRLTHLLENGHAKRN----GGRV--------SPKVHIAPVEEELV 110 (123) T ss_pred EecCCcceEEeeecceeecC----Ccee--------CcchhhhHHHHHHH Confidence 43 33455 499997421 3222 22222211111111 No 22 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=53.20 E-value=0.22 Score=24.21 Aligned_cols=105 Identities=15% Similarity=0.092 Sum_probs=50.0 Q ss_pred Ccccccc--hhhhhhhhcChh-----HHHHHHHHHHhhhHHHHHHHHHHhhcCC-----ccccc--c-cccceeeecccC Q lcl|NC_021535. 1 MARLIGQ--KAMNHVISHLDG-----VKDAVYAEAKERGRKAEANLAQARASTR-----WHKIF--G-PDHLTRVTVTRG 65 (119) Q Consensus 1 MA~~yg~--~~l~~vvA~~~g-----V~~~v~~e~~~i~~RA~anLa~aR~s~~-----~~ki~--g-p~~~a~I~~~~G 65 (119) ||+--+- +.|+++...+.. ++..+++-.++++.+. |..+...++ ..+-. | .....+++.+.+ T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l---~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~~g~ 77 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARL---LGKVIRRTPVDTGFLRQGWNGVAYARSLPVYKQGN 77 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHH---HHHHHHhCCCcchhhcccccccccccccceeecCC Confidence 8885432 455666655543 3334444444444333 333333332 11110 0 111122333333 Q ss_pred CcceEEEecCCCcceeeccccccceecCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 66 DVDSFINLEAPNAMAIEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 66 ~vD~~V~L~d~~alaIEfGH~~sg~f~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) ..-.-|.=..+=|--+||||.. .+| .++++|.|+|+++-.--= T Consensus 78 ~~~v~v~n~~~YA~~VE~Ghr~----~~~-------~gfV~G~fml~~s~~~~~ 120 (141) T protein:vir:79 78 NYIIEVVNPTEYASYVNFGHRT----KDG-------KGWVKGQHFLTISEMELQ 120 (141) T ss_pred eeEEEEecCCcchhhhhcceee----cCC-------cceeCCchhHHHHHHHHH Confidence 3222233334558889999953 111 258899999999865332 No 23 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=53.01 E-value=0.19 Score=24.48 Aligned_cols=104 Identities=12% Similarity=0.135 Sum_probs=53.5 Q ss_pred CcccccchhhhhhhhcChh------HHHHHHHHHHhhhHHHHHHHHHHhhcCCccc------------------------ Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDG------VKDAVYAEAKERGRKAEANLAQARASTRWHK------------------------ 50 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~g------V~~~v~~e~~~i~~RA~anLa~aR~s~~~~k------------------------ 50 (119) |.-=+.-+.+.++...+.. ++..+++.+.+++.+. |+.+.+-|++.. T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~l---l~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~ 77 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTEL---KSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHG 77 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHH---HHHHHHhCCcccchhhhhhhhhcccchhhhhccccc Confidence 8877777777776665543 3334555555554443 444444344311 Q ss_pred -cccc--cc--ceeeecccCCcceEEEecCCCcceeeccccccceecCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 51 -IFGP--DH--LTRVTVTRGDVDSFINLEAPNAMAIEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 51 -i~gp--~~--~a~I~~~~G~vD~~V~L~d~~alaIEfGH~~sg~f~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) ..|- .+ ..++..+.+..-.-|.=..+=|--|||||-.. +.++++|.|+|+++-..-= T Consensus 78 k~tG~lr~swk~~~~~k~~~~~~v~v~N~~~YA~~VE~GHR~~------------~gGfV~G~fml~~s~~~~~ 139 (163) T protein:vir:10 78 KQGGTLQKGWSKSRIEVSGRTYKQKVYNKVYYAPHVEYGHKTV------------NGGFVPGQFFLHKTVEDTK 139 (163) T ss_pred cccchhhccceecceeecCCceEEEEEecCCccchhhcceeec------------CCceeccchhhHHHHHHHH Confidence 1111 00 01222221211111222234488899999431 1358899999999876554 No 24 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=52.75 E-value=0.55 Score=22.02 Aligned_cols=110 Identities=12% Similarity=0.047 Sum_probs=56.0 Q ss_pred CcccccchhhhhhhhcChhHHHHHHHHHHhhhHHHHHHH-HHHhhcCCcccccccccceeeeccc--CCcceEEEec--- Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRKAEANL-AQARASTRWHKIFGPDHLTRVTVTR--GDVDSFINLE--- 74 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~gV~~~v~~e~~~i~~RA~anL-a~aR~s~~~~ki~gp~~~a~I~~~~--G~vD~~V~L~--- 74 (119) |+++-.+..+. .....+...+....+.+..++-.++ ++++...++ +.| ....+|+... +.--.++... T Consensus 1 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPv--dtG-~Lr~SI~~~~~~~~~~~~~~~v~~~ 74 (140) T protein:vir:10 1 MATIRARARIE---IDEAALERESGEHLRAFHRSLTRRIANQSRVAVPV--RTG-NLGRTIGELPQVYTPFRVRGGVEAT 74 (140) T ss_pred Ceeeeeeeeee---eCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc--cch-hhhccceeeeeeCCCceEEEEecCC Confidence 88887765433 2222333333333333333333333 333333332 111 1225565432 2212233333 Q ss_pred CCCcceeeccccccceecCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 75 APNAMAIEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 75 d~~alaIEfGH~~sg~f~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) .+-|.-+|||.-|.-+...+.+.... ...|-++..+--+--| T Consensus 75 a~YA~~Ve~GT~ph~I~pk~~k~L~~---~~~G~~~~~k~V~hpG 116 (140) T protein:vir:10 75 ADYAAPVHEGSRPHAIRARNAQYLHF---WWHGREMFRKSVWHPG 116 (140) T ss_pred ccchhhhccCCCCceeecCCCcccee---ecCCCEEEeeeeecCC Confidence 34599999999887666555443332 3578888888777777 No 25 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=52.75 E-value=0.55 Score=22.02 Aligned_cols=110 Identities=12% Similarity=0.047 Sum_probs=56.0 Q ss_pred CcccccchhhhhhhhcChhHHHHHHHHHHhhhHHHHHHH-HHHhhcCCcccccccccceeeeccc--CCcceEEEec--- Q lcl|NC_021535. 1 MARLIGQKAMNHVISHLDGVKDAVYAEAKERGRKAEANL-AQARASTRWHKIFGPDHLTRVTVTR--GDVDSFINLE--- 74 (119) Q Consensus 1 MA~~yg~~~l~~vvA~~~gV~~~v~~e~~~i~~RA~anL-a~aR~s~~~~ki~gp~~~a~I~~~~--G~vD~~V~L~--- 74 (119) |+++-.+..+. .....+...+....+.+..++-.++ ++++...++ +.| ....+|+... +.--.++... T Consensus 1 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPv--dtG-~Lr~SI~~~~~~~~~~~~~~~v~~~ 74 (140) T protein:vir:97 1 MATIRARARIE---IDEAALERESGEHLRAFHRSLTRRIANQSRVAVPV--RTG-NLGRTIGELPQVYTPFRVRGGVEAT 74 (140) T ss_pred Ceeeeeeeeee---eCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc--cch-hhhccceeeeeeCCCceEEEEecCC Confidence 88887765433 2222333333333333333333333 333333332 111 1225565432 2212233333 Q ss_pred CCCcceeeccccccceecCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 75 APNAMAIEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 75 d~~alaIEfGH~~sg~f~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) .+-|.-+|||.-|.-+...+.+.... ...|-++..+--+--| T Consensus 75 a~YA~~Ve~GT~ph~I~pk~~k~L~~---~~~G~~~~~k~V~hpG 116 (140) T protein:vir:97 75 ADYAAPVHEGSRPHAIRARNAQYLHF---WWHGREMFRKSVWHPG 116 (140) T ss_pred ccchhhhccCCCCceeecCCCcccee---ecCCCEEEeeeeecCC Confidence 34599999999887666555443332 3578888888777777 No 26 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=43.28 E-value=0.47 Score=22.39 Aligned_cols=92 Identities=15% Similarity=0.130 Sum_probs=43.0 Q ss_pred cChhHHHHHHHHHHhhhHHHHHHHHHHhhcCCcccccccccceeee--cccCCcceEEEecCCCcceeeccccccceecC Q lcl|NC_021535. 16 HLDGVKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDHLTRVT--VTRGDVDSFINLEAPNAMAIEFGHQPSGVFGP 93 (119) Q Consensus 16 ~~~gV~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki~gp~~~a~I~--~~~G~vD~~V~L~d~~alaIEfGH~~sg~f~~ 93 (119) -..-|+.+|.+-+.++...|++ .+..+||. ...+|+ ...+.+-..|.-+.+-|.-+|||+.+.+.-.. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~--~apv~TG~--------Lr~SI~~~~~~~~~~~~V~~~~~Ya~yvE~GTg~~~~~~~ 70 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIIS--LMPVDTGY--------LRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAG 70 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHh--hCCccccc--------cccceeEEeecCcEEEEEecCCCccceeecCccccccCCC Confidence 1122334444444444333333 12233332 224443 34566777888888899999999877665443 Q ss_pred cccccCCC--CCCCCceeeeecccccCC Q lcl|NC_021535. 94 GGMFGHLD--TKAPEGLYIITSAAGLRG 119 (119) Q Consensus 94 g~k~~~~~--tka~~GLyILT~AAgl~~ 119 (119) +.+....+ -+.+.|.++-|+ |... T Consensus 71 ~~~~~~~~~~~~~~~g~~~~t~--g~~a 96 (116) T protein:vir:95 71 GSRAKNIPWSYKDANGKWHTTK--GQHA 96 (116) T ss_pred ccccccccceeecCccceeeCC--CCCC Confidence 32211110 011233444333 2222 No 27 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=42.37 E-value=0.39 Score=22.85 Aligned_cols=92 Identities=12% Similarity=0.136 Sum_probs=47.3 Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHhhcCCccc-cccccc----ceeeecccCCcceEEEecCCCcceeeccc-cccce--e Q lcl|NC_021535. 20 VKDAVYAEAKERGRKAEANLAQARASTRWHK-IFGPDH----LTRVTVTRGDVDSFINLEAPNAMAIEFGH-QPSGV--F 91 (119) Q Consensus 20 V~~~v~~e~~~i~~RA~anLa~aR~s~~~~k-i~gp~~----~a~I~~~~G~vD~~V~L~d~~alaIEfGH-~~sg~--f 91 (119) +...+++-+.+++.+..+ .+-+-|+.-+ +.|-=. ..+++...+. |.=..+=|--+|||| +..|. + T Consensus 1 l~~~~~~~~~~~a~~l~~---~vk~rTPv~~~d~G~LR~sW~~g~v~k~~~~----v~N~~eYA~~VE~GHRq~~g~g~~ 73 (116) T protein:vir:10 1 MSKNLRRAKNNIGNKLLR---KVKPKTPVAKIDGGTARKSWKYKELNLFDGV----VSNNVEYIHHLEYGHRTRQGTGTS 73 (116) T ss_pred CchHHHHHHHHHHHHHHH---HHHhhCCCCcCCCcccccCceeeeeeccCce----eecCCcccccccCCceeeCCccee Confidence 667777777777655444 3333444322 122100 0133332221 333344588899999 43342 2 Q ss_pred cCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 92 GPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 92 ~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) .|..- .....+|++|.|.|+++-..-= T Consensus 74 ~~~~g-krlk~~~V~G~fml~~s~~e~~ 100 (116) T protein:vir:10 74 ENYRP-KPNGISFVPGVFMLARSVDEMS 100 (116) T ss_pred ccccc-ccccCCccCceehHHHHHHHHH Confidence 22100 1223479999999999865433 No 28 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=36.00 E-value=1.2 Score=20.15 Aligned_cols=114 Identities=13% Similarity=0.058 Sum_probs=43.0 Q ss_pred Ccccc---cchhhhhhhhcCh-hHHHHHHHHHHhhhHHHHHHHHHHhhcCCccccccc--ccc-eeeecccCCcceEEEe Q lcl|NC_021535. 1 MARLI---GQKAMNHVISHLD-GVKDAVYAEAKERGRKAEANLAQARASTRWHKIFGP--DHL-TRVTVTRGDVDSFINL 73 (119) Q Consensus 1 MA~~y---g~~~l~~vvA~~~-gV~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki~gp--~~~-a~I~~~~G~vD~~V~L 73 (119) ||.+= +-..+...+..++ .++.++.+...+. |+....+++.-.++ ++|- .+. .++..+...+-..|.- T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~---a~~i~~~ak~~aPv--~TG~Lr~SI~~~~~~~g~~~~~~v~~ 75 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLTGAAREATEAA---ANDMVNMAKGLCPV--DTGRLRSSIQAVPSGGRFSFSVTIGT 75 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHhCCc--cchhhhccceeeeccCCceEEEEEec Confidence 99883 4443343333322 2333333322222 22222333333332 2222 222 2233333334455666 Q ss_pred cCCCcceeeccccccceecCcccccCCCCCC----------CCceeeeecccccCC Q lcl|NC_021535. 74 EAPNAMAIEFGHQPSGVFGPGGMFGHLDTKA----------PEGLYIITSAAGLRG 119 (119) Q Consensus 74 ~d~~alaIEfGH~~sg~f~~g~k~~~~~tka----------~~GLyILT~AAgl~~ 119 (119) ..+-|.-+||||-|..+...+.+....+.+. .++.-.|..|.--.= T Consensus 76 ~~~YA~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~ 131 (142) T protein:vir:94 76 NVTYAADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAAS 131 (142) T ss_pred CcccchhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHH Confidence 6778999999997765443222211110000 011111111110000 No 29 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=26.32 E-value=1.7 Score=19.35 Aligned_cols=104 Identities=13% Similarity=0.066 Sum_probs=49.9 Q ss_pred Ccccccc-hhhhhhhhcChhHHHHHHHHHHhhhHHHHHHHHHHhhcCCcccccccccceeeeccc---C--CcceEEEec Q lcl|NC_021535. 1 MARLIGQ-KAMNHVISHLDGVKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDHLTRVTVTR---G--DVDSFINLE 74 (119) Q Consensus 1 MA~~yg~-~~l~~vvA~~~gV~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki~gp~~~a~I~~~~---G--~vD~~V~L~ 74 (119) +++|=-+ ..+ +..|+..+++....++.+-++. ++...++ ..| ....+|+..- + .+-..|.-+ T Consensus 4 s~~i~i~~~~l------~~~v~~~~k~~l~~~a~~i~~~---ak~~aPv--~tG-~Lr~SI~~~~~~~~~~~~~~~v~~~ 71 (137) T protein:vir:10 4 TARIHINEPEL------ERQTGAIFRGKHRSITRRIATQ---ARADVPV--RTG-NLGRGIQEMPQTYRPFHVGGGVEDN 71 (137) T ss_pred eEEEeeCHHHH------HHHHHHHHHHHHHHHHHHHHHH---HHHhCCc--ccc-hhhcCceeeeeccccceEEEEEecC Confidence 4555322 222 2344444444444443332221 2221221 111 1224554321 1 223345555 Q ss_pred CCCcceeeccccccceecCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 75 APNAMAIEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 75 d~~alaIEfGH~~sg~f~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) ++-|.-+|||.-|.-+..-++++... ...|-++.++.-+--| T Consensus 72 ~~YA~~ve~GT~ph~I~pk~~k~l~f---~~~G~~v~~k~v~hpG 113 (137) T protein:vir:10 72 VDYAAPVHEGSRPHRITARHANALHF---FWHGREVFRKSVWHPG 113 (137) T ss_pred CCceeeeeecCCCceeecccCceeee---eeCCceEEeeeeecCC Confidence 67799999998766555444443322 2458888888776666 No 30 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=25.43 E-value=1.4 Score=19.77 Aligned_cols=94 Identities=15% Similarity=0.096 Sum_probs=39.7 Q ss_pred hhhhhhcChhHHHHHHHHHHhhhHHHHHHHHHHhhcCCcccccccccceeee--cccCCcceEEEecCCCcceeeccccc Q lcl|NC_021535. 10 MNHVISHLDGVKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDHLTRVT--VTRGDVDSFINLEAPNAMAIEFGHQP 87 (119) Q Consensus 10 l~~vvA~~~gV~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki~gp~~~a~I~--~~~G~vD~~V~L~d~~alaIEfGH~~ 87 (119) |+ .-|+.+|.+-+.++...|+++ +..+||. ...+|+ ...+.+-..|.-+.+-|.-+|||.-+ T Consensus 1 v~------~~v~~~~~~~~~~i~~~ak~~--aPv~TG~--------Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~ 64 (116) T protein:vir:97 1 ME------RWVKRGIAKTTAKIHNTIISL--MPVDTGY--------LRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGI 64 (116) T ss_pred Ch------HHHHHHHHHHHHHHHHHHHHh--CCcCccc--------ccccceEEeecCcEEEEEecCCCcccccccCCcc Confidence 22 223344444444444444331 2233332 224443 33455667777778889999999766 Q ss_pred cceecCcccccCC------------CCCCCCceeeeecccccCC Q lcl|NC_021535. 88 SGVFGPGGMFGHL------------DTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 88 sg~f~~g~k~~~~------------~tka~~GLyILT~AAgl~~ 119 (119) .+.-..+.+.-.. .|+-+++.-.|..|....= T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~ 108 (116) T protein:vir:97 65 YATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGR 108 (116) T ss_pred cccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHH Confidence 5433322111000 0111222222222211111 No 31 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=25.43 E-value=1.4 Score=19.77 Aligned_cols=94 Identities=15% Similarity=0.096 Sum_probs=39.7 Q ss_pred hhhhhhcChhHHHHHHHHHHhhhHHHHHHHHHHhhcCCcccccccccceeee--cccCCcceEEEecCCCcceeeccccc Q lcl|NC_021535. 10 MNHVISHLDGVKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDHLTRVT--VTRGDVDSFINLEAPNAMAIEFGHQP 87 (119) Q Consensus 10 l~~vvA~~~gV~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki~gp~~~a~I~--~~~G~vD~~V~L~d~~alaIEfGH~~ 87 (119) |+ .-|+.+|.+-+.++...|+++ +..+||. ...+|+ ...+.+-..|.-+.+-|.-+|||.-+ T Consensus 1 v~------~~v~~~~~~~~~~i~~~ak~~--aPv~TG~--------Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~ 64 (116) T protein:vir:12 1 ME------RWVKRGIAKTTAKIHNTIISL--MPVDTGY--------LRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGI 64 (116) T ss_pred Ch------HHHHHHHHHHHHHHHHHHHHh--CCcCccc--------ccccceEEeecCcEEEEEecCCCcccccccCCcc Confidence 22 223344444444444444331 2233332 224443 33455667777778889999999766 Q ss_pred cceecCcccccCC------------CCCCCCceeeeecccccCC Q lcl|NC_021535. 88 SGVFGPGGMFGHL------------DTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 88 sg~f~~g~k~~~~------------~tka~~GLyILT~AAgl~~ 119 (119) .+.-..+.+.-.. .|+-+++.-.|..|....= T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~ 108 (116) T protein:vir:12 65 YATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGR 108 (116) T ss_pred cccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHH Confidence 5433322111000 0111222222222211111 No 32 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=22.05 E-value=1.5 Score=19.57 Aligned_cols=92 Identities=16% Similarity=0.087 Sum_probs=44.8 Q ss_pred Cc-ccccchhhhhhhhcCh-----hHHHHHHHHHHhhhHHHHHHHHHHhhcCCcccccccccceeeecccCCcceEEEec Q lcl|NC_021535. 1 MA-RLIGQKAMNHVISHLD-----GVKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDHLTRVTVTRGDVDSFINLE 74 (119) Q Consensus 1 MA-~~yg~~~l~~vvA~~~-----gV~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki~gp~~~a~I~~~~G~vD~~V~L~ 74 (119) |. ++=|=..|.+-+..+. .|+.+|.+-+..+.. +++...+ ++.| ....+|+...+..+..|.-. T Consensus 1 msi~i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~-------~ak~~aP--v~TG-~Lr~sI~~~~~g~~~~V~~~ 70 (114) T protein:vir:95 1 MAIKWQGIEKLVATISNAQPKAVEQSLQVLKNNGEKGKR-------IAKQLAP--KDTE-FLKDHITTSYPGMEAHIHGE 70 (114) T ss_pred CeeeeehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHhCC--cCch-hhhhceeeecCceEEEeecC Confidence 66 4444333333332222 233344443444333 3333222 1222 23367888777778888777 Q ss_pred CCCcceeeccccccceecCcccccCCCCCCCCceeeeecccccCC Q lcl|NC_021535. 75 APNAMAIEFGHQPSGVFGPGGMFGHLDTKAPEGLYIITSAAGLRG 119 (119) Q Consensus 75 d~~alaIEfGH~~sg~f~~g~k~~~~~tka~~GLyILT~AAgl~~ 119 (119) ..=+.-+||||- .+++.+.|..|-.-.- T Consensus 71 ~~Ya~yvE~GT~-----------------~~~aqPfl~pa~~~~~ 98 (114) T protein:vir:95 71 AGYDGYQEYGTR-----------------FQPGTPHFRPMMEQIQ 98 (114) T ss_pred CCccceeecCcc-----------------ccCCCccchhhHHHHH Confidence 777778899872 1233344444443333 No 33 >protein:vir:2507 Length: 83 # NCBI annotation: hypothetical protein # Family: family:all:1171 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569748;genbank:gi:18496898;genbank:GeneID:932258 Probab=21.87 E-value=1.4 Score=19.83 Aligned_cols=82 Identities=18% Similarity=0.203 Sum_probs=54.6 Q ss_pred ccchhhhhhhhcChhHHHHHHHHHHhhhHHHHHHHHHHhhcCCccccccccc-ceeeecccCCcceEEEecCCCcceeec Q lcl|NC_021535. 5 IGQKAMNHVISHLDGVKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDH-LTRVTVTRGDVDSFINLEAPNAMAIEF 83 (119) Q Consensus 5 yg~~~l~~vvA~~~gV~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki~gp~~-~a~I~~~~G~vD~~V~L~d~~alaIEf 83 (119) ..... ++.|-++|+|++++..-.-+|+++|-+--.+ ||+ .++.++.+..+-.+|--.-.++|-.|. T Consensus 1 ~P~Se-hrKIR~lpev~kacq~lga~ia~~Ag~iA~a------------~DgYgte~~vGrdR~Rv~V~a~~~aaikaE~ 67 (83) T protein:vir:25 1 MPNSE-HRKIRKLPEVQAELQRLAAEVARRAGGIADA------------PDGYGTDLEVGRTRARAHVWPKSSAAIKAEI 67 (83) T ss_pred CCCcc-cchhccchhHHHHHHHhhHHHhhhhccccCC------------CCCcccceeecCceeeEEEeecCCceeeecc Confidence 34443 7889999999999999999998887543222 233 478888888888888888788888888 Q ss_pred cccccceecCcccccCCCCCCCC Q lcl|NC_021535. 84 GHQPSGVFGPGGMFGHLDTKAPE 106 (119) Q Consensus 84 GH~~sg~f~~g~k~~~~~tka~~ 106 (119) |-.|---..- +.++-+ T Consensus 68 ~~aPlmq~aA-------e~gp~~ 83 (83) T protein:vir:25 68 KTAPLMTIAA-------EQGPQQ 83 (83) T ss_pred cCCcchhhhh-------hcCCCC Confidence 8754221100 000111 No 34 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=20.61 E-value=2.7 Score=18.18 Aligned_cols=79 Identities=16% Similarity=0.251 Sum_probs=44.6 Q ss_pred Cccc-c---cchhhhhhh---hcChhHHHHHHHHHHhhhHHHHHHHHHHhhcCCcccccccccceee--ecccCCcceEE Q lcl|NC_021535. 1 MARL-I---GQKAMNHVI---SHLDGVKDAVYAEAKERGRKAEANLAQARASTRWHKIFGPDHLTRV--TVTRGDVDSFI 71 (119) Q Consensus 1 MA~~-y---g~~~l~~vv---A~~~gV~~~v~~e~~~i~~RA~anLa~aR~s~~~~ki~gp~~~a~I--~~~~G~vD~~V 71 (119) |||+ | |-.+|.+-+ +..+.|+..|..-+.++..+|+.|- =-+|| - -.-+| +...|..=..| T Consensus 1 Ma~~~i~~~Gld~L~~~L~~~~~~~~v~~vv~~~~~~l~~~ak~~a--p~dTG-------~-lrrSI~~~~~~~g~~~~v 70 (92) T protein:vir:99 1 MADYSISWDGLDALDEALANQQNMNTVKKVVKKHTANLMTATQQAV--PVDTG-------H-LKQSAQIQISRDGFTGSV 70 (92) T ss_pred CCceeeEeehHHHHHHHHHhhccHHHHHHHHHHHHHHHHHHHHHhC--CCCcc-------c-cceeeeEEeecCCeeEEE Confidence 9994 3 444443333 3347799999999999988888752 11222 1 11333 33445555555 Q ss_pred EecCC---Ccceeecccc-ccc Q lcl|NC_021535. 72 NLEAP---NAMAIEFGHQ-PSG 89 (119) Q Consensus 72 ~L~d~---~alaIEfGH~-~sg 89 (119) ...+| =+.-+|||+- -+- T Consensus 71 ~~~gp~a~Ya~YvE~GTR~M~A 92 (92) T protein:vir:99 71 TYGGGLVNYAAYVEFGTRFMDS 92 (92) T ss_pred EeccCccccccccccceeecCC Confidence 54433 3777899982 111 Done!