Query lcl|NC_019411.1_cdsid_YP_006989816.1 [gene=D870_gp261] [protein=hypothetical protein] [protein_id=YP_006989816.1] [location=54121..54642] Match_columns 173 No_of_seqs 87 out of 91 Neff 6.1 Searched_HMMs 1612 Date Thu Nov 7 18:15:46 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_83 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_83_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95176 Length: 172 100.0 2.7E-61 1.7E-64 352.6 16.5 167 1-173 1-172 (172) 2 protein:vir:94955 Length: 170 100.0 3.5E-59 2.1E-62 341.1 15.7 168 4-173 1-170 (170) 3 protein:vir:80389 Length: 172 100.0 3.3E-58 2.1E-61 335.7 16.2 161 3-173 1-172 (172) 4 protein:vir:78383 Length: 169 100.0 3.7E-58 2.3E-61 335.4 16.0 164 3-173 1-169 (169) 5 protein:vir:95004 Length: 169 100.0 4.5E-58 2.8E-61 335.0 15.9 164 3-173 1-169 (169) 6 protein:vir:97267 Length: 172 100.0 1.2E-55 7.4E-59 321.7 15.0 161 3-173 1-172 (172) 7 protein:vir:43 Length: 131 # N 98.1 3.9E-08 2.4E-11 61.2 9.4 125 17-169 1-131 (131) 8 protein:vir:80967 Length: 131 98.1 3.9E-08 2.4E-11 61.2 9.3 125 17-169 1-131 (131) 9 protein:vir:98900 Length: 132 97.7 1.1E-06 6.7E-10 53.3 10.1 124 17-168 1-132 (132) 10 protein:vir:79701 Length: 144 94.3 0.0021 1.3E-06 35.2 10.2 132 16-168 1-144 (144) 11 protein:vir:9821 Length: 138 # 93.9 0.0013 7.8E-07 36.5 8.0 126 1-169 1-138 (138) 12 protein:vir:9576 Length: 131 # 93.5 0.0056 3.5E-06 32.9 10.9 123 16-173 1-130 (131) 13 protein:vir:94761 Length: 132 92.8 0.0094 5.9E-06 31.7 11.1 125 16-173 1-131 (132) 14 protein:vir:2505 Length: 128 # 92.1 0.0022 1.4E-06 35.2 6.7 122 6-173 1-124 (128) 15 protein:vir:4788 Length: 130 # 90.3 0.011 7E-06 31.3 8.8 124 17-169 1-130 (130) 16 protein:vir:80320 Length: 188 90.0 0.019 1.2E-05 30.0 9.9 128 3-167 1-188 (188) 17 protein:vir:9761 Length: 140 # 89.4 0.024 1.5E-05 29.5 9.8 123 16-173 1-134 (140) 18 protein:vir:107756 Length: 147 89.1 0.016 9.9E-06 30.4 8.7 126 1-173 1-140 (147) 19 protein:vir:100103 Length: 120 87.9 0.027 1.7E-05 29.2 9.1 120 13-167 1-120 (120) 20 protein:vir:1435 Length: 188 # 87.6 0.036 2.2E-05 28.5 9.6 127 1-167 1-188 (188) 21 protein:vir:101652 Length: 188 87.3 0.02 1.2E-05 30.0 8.0 121 22-162 1-188 (188) 22 protein:vir:7857 Length: 188 # 87.3 0.02 1.2E-05 30.0 8.0 121 22-162 1-188 (188) 23 protein:vir:99002 Length: 158 86.7 0.03 1.9E-05 28.9 8.7 123 16-173 1-124 (158) 24 protein:vir:1887 Length: 108 # 85.7 0.028 1.8E-05 29.1 8.0 105 1-167 1-108 (108) 25 protein:vir:192 Length: 108 # 85.7 0.028 1.8E-05 29.1 8.0 105 1-167 1-108 (108) 26 protein:vir:100245 Length: 113 85.2 0.04 2.5E-05 28.3 8.5 113 17-167 1-113 (113) 27 protein:vir:1640 Length: 132 # 84.6 0.059 3.7E-05 27.3 11.4 125 16-173 1-131 (132) 28 protein:vir:93592 Length: 108 84.3 0.056 3.4E-05 27.5 8.9 108 16-168 1-108 (108) 29 protein:vir:1993 Length: 141 # 83.4 0.017 1E-05 30.3 5.6 119 17-157 1-141 (141) 30 protein:vir:5256 Length: 119 # 79.9 0.1 6.2E-05 26.1 9.7 106 19-169 1-119 (119) 31 protein:vir:80036 Length: 111 75.0 0.036 2.2E-05 28.5 4.7 109 16-171 1-111 (111) 32 protein:vir:79074 Length: 150 74.6 0.13 8.3E-05 25.4 7.7 118 17-156 1-150 (150) 33 protein:vir:103846 Length: 138 73.0 0.18 0.00011 24.7 8.3 123 17-173 1-137 (138) 34 protein:vir:107864 Length: 150 72.8 0.18 0.00011 24.7 8.1 118 17-156 1-150 (150) 35 protein:vir:99570 Length: 153 72.1 0.19 0.00012 24.6 10.9 128 3-173 1-147 (153) 36 protein:vir:10365 Length: 115 71.2 0.2 0.00012 24.4 8.4 114 19-169 1-115 (115) 37 protein:vir:486 Length: 107 # 71.0 0.2 0.00013 24.4 8.3 104 18-167 1-107 (107) 38 protein:vir:99848 Length: 172 69.8 0.081 5E-05 26.6 5.4 134 1-156 1-172 (172) 39 protein:vir:79640 Length: 134 66.0 0.27 0.00017 23.7 8.8 117 16-173 1-130 (134) 40 protein:vir:99222 Length: 138 63.6 0.046 2.8E-05 27.9 2.7 123 17-173 1-137 (138) 41 protein:vir:79253 Length: 138 63.6 0.046 2.8E-05 27.9 2.7 123 17-173 1-137 (138) 42 protein:vir:107702 Length: 136 60.7 0.37 0.00023 23.0 10.1 120 14-173 1-133 (136) 43 protein:vir:4512 Length: 107 # 58.6 0.41 0.00025 22.7 8.2 107 18-167 1-107 (107) 44 protein:vir:102961 Length: 131 55.5 0.44 0.00027 22.5 6.6 118 21-171 1-131 (131) 45 protein:vir:81069 Length: 115 55.4 0.48 0.0003 22.3 8.7 113 19-169 1-115 (115) 46 protein:vir:96108 Length: 155 53.1 0.54 0.00033 22.1 10.4 126 13-173 1-149 (155) 47 protein:vir:8104 Length: 170 # 50.1 0.62 0.00038 21.7 9.9 116 31-165 1-170 (170) 48 protein:vir:103283 Length: 125 49.8 0.63 0.00039 21.7 8.9 109 31-173 1-121 (125) 49 protein:vir:97069 Length: 115 48.6 0.67 0.00041 21.6 9.0 113 19-169 1-115 (115) 50 protein:vir:98481 Length: 136 46.6 0.73 0.00045 21.3 8.2 116 16-173 1-119 (136) 51 protein:vir:104344 Length: 132 39.4 1 0.00063 20.5 8.5 116 16-173 1-127 (132) 52 protein:vir:4458 Length: 107 # 38.7 1.1 0.00065 20.5 8.6 107 18-161 1-107 (107) 53 protein:vir:5742 Length: 110 # 36.2 1.2 0.00074 20.2 8.7 108 17-161 1-110 (110) 54 protein:vir:94507 Length: 113 35.6 1.2 0.00076 20.1 8.8 113 19-171 1-113 (113) 55 protein:vir:3034 Length: 111 # 30.1 1.1 0.00066 20.4 4.5 100 50-169 1-111 (111) 56 protein:vir:4702 Length: 113 # 21.1 2.7 0.0016 18.3 7.7 106 3-167 1-113 (113) 57 protein:vir:78849 Length: 110 20.2 2.8 0.0017 18.1 10.1 110 19-171 1-110 (110) 58 protein:vir:96390 Length: 110 20.2 2.8 0.0017 18.1 10.1 110 19-171 1-110 (110) 59 protein:vir:103957 Length: 110 20.2 2.8 0.0017 18.1 10.1 110 19-171 1-110 (110) 60 protein:vir:97145 Length: 110 20.2 2.8 0.0017 18.1 10.1 110 19-171 1-110 (110) 61 protein:vir:9311 Length: 110 # 20.2 2.8 0.0017 18.1 10.1 110 19-171 1-110 (110) 62 protein:vir:99796 Length: 110 20.2 2.8 0.0017 18.1 10.1 110 19-171 1-110 (110) 63 protein:vir:96221 Length: 110 20.2 2.8 0.0017 18.1 10.1 110 19-171 1-110 (110) 64 protein:vir:9928 Length: 118 # 20.0 2.8 0.0018 18.1 10.9 117 20-171 1-118 (118) No 1 >protein:vir:95176 Length: 172 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:703 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293420;genbank:gi:148912841;genbank:GeneID:5228251 Probab=100.00 E-value=2.7e-61 Score=352.62 Aligned_cols=167 Identities=28% Similarity=0.386 Sum_probs=150.2 Q ss_pred CeeEEEEeCCCCCCCccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhh-hcccccccCCccccccCCc Q lcl|NC_019411. 1 MAFTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDR-SIAWAGEKVDEDSGLRWPR 79 (173) Q Consensus 1 m~M~live~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~-~~~f~G~r~~~~Q~lawPR 79 (173) |+|+||||||+|+|+||||+|+++|++||++|++ |.++++++||++|++|++|||+ .++|+|+|++++|+|+||| T Consensus 1 ~~Malive~~~g~~~anSYvtv~ea~aY~~~rg~----~~~~~~~~ke~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR 76 (172) T protein:vir:95 1 MAITIVVEDGSGVTNANSYVSVADARIYASNRGV----ELPLDDDELAAMLIRSTDYLEAQACRFQGKPTSTTQALQWPR 76 (172) T ss_pred CceeEEEeCCCCCCcccccccHHHHHHHHHhcCC----cCCCChHHHHHHHHHHHHHhhccCCceeeeecCCcccccCCc Confidence 9999999999999999999999999999999965 7778999999999999999996 4799999999999999999 Q ss_pred CCCcccCCeeeccccchHHHHHHHHHHHHHHHcCCCCCCc--cccceeEEecceeEEeecCCCCcc--cchHHHHHHHhh Q lcl|NC_019411. 80 TGVYDVDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQ--TTRGMKEIQVDVIELKFDSEIQRG--SMPDIVMSILEG 155 (173) Q Consensus 80 ~gv~~~dg~~~~~d~IP~~V~~A~~elA~~~~~~~~~~~~--~~~~v~~ekVG~i~veY~~~~~~~--~~~~~v~~lL~~ 155 (173) +|+. .+|..+++|.||++||+||||||++++++++..+. ..+.||+||||+|+|||+.+.+.+ +.|++|++||+| T Consensus 77 ~g~~-~~~~~v~~~~IP~~V~~A~~elA~~~~~~~~~~~~~~~~~~vk~~kVG~I~veY~~~~~~~~~~~~~~v~~LL~p 155 (172) T protein:vir:95 77 TGVF-LNEDEVPSNVIPKSLIAAQVQLTMAINAGFDLQPNVSPQDYVTREKVGPIETEYADPLSVGIMPTFTAANALLAP 155 (172) T ss_pred CCcc-cCcccccccchhHHHHHHHHHHHHHHHcCccccccCCcccceeEEeccceEEeeccCCCCCCcccHHHHHHHHhh Confidence 9987 79999999999999999999999999998765443 456799999999999998766544 679999999999 Q ss_pred hhhcccCCCcceeeeecC Q lcl|NC_019411. 156 LGVVKTGTRPAFKKIIRH 173 (173) Q Consensus 156 ll~~~~g~~~~~~~~~R~ 173 (173) |++..+|+++ .+|++|- T Consensus 156 ~l~~~~~~~~-~~r~~r~ 172 (172) T protein:vir:95 156 LFGECASNKF-ALRTIRV 172 (172) T ss_pred hhcccCCcce-eeEEEeC Confidence 9986655544 5999999 No 2 >protein:vir:94955 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239280;genbank:gi:66392062;genbank:GeneID:5076600 Probab=100.00 E-value=3.5e-59 Score=341.09 Aligned_cols=168 Identities=23% Similarity=0.452 Sum_probs=157.0 Q ss_pred EEEEeCCCCCCCccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCc Q lcl|NC_019411. 4 TFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVY 83 (173) Q Consensus 4 ~live~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~ 83 (173) -||||||+|+|+||||+|++||++||+.|+. ...|.++|+++||++|++|+||||+.|+|+|+|++++|+|+|||+|+. T Consensus 1 m~~i~~~~g~~~AnSYvtv~ea~aY~~~r~~-~~~w~~~~~~~~e~aL~~A~dyId~~~~f~G~r~~~~Q~l~wPR~g~~ 79 (170) T protein:vir:94 1 MPTVDATPGSITANSYVTVAEANSYFDGSYG-RPLWTSASEDEKASLVISASRYLDQMMAWIGAPTNPEQSMWWPCKNAV 79 (170) T ss_pred CceeecCCCCCcccceecHHHHHHHHHhhcc-ccccCCCCHHHHHHHHHHHHHHhccccccccccCCcchhhcccccCcc Confidence 2556999999999999999999999999974 668999999999999999999999889999999999999999999986 Q ss_pred ccCCeeeccccchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhh--hccc Q lcl|NC_019411. 84 DVDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLG--VVKT 161 (173) Q Consensus 84 ~~dg~~~~~d~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll--~~~~ 161 (173) .||..+++|.||++||+||||||++++++++..+..++.||+||||+|||||+.+++...+++.|++||+||+ +..+ T Consensus 80 -~dg~~~~~~~IP~~V~~Aq~elA~~~~~~~~~~~~~~~~v~~~kVG~i~veY~~~~~~~~~~~~v~~LL~p~l~~~~~g 158 (170) T protein:vir:94 80 -IGGMTLSQVSIPVKVKIAVFELAYFMLESGAALSFADQTIDSVKVGTIRVEFTKNSTDAGLPTFVEAMLSGFGSPVLYG 158 (170) T ss_pred -cCccccccchhhHHHHHHHHHHHHHHHhCcccCcccccceeeEecceeEEEecCCCCCCccHHHHHHHhhhhhcccccc Confidence 9999999999999999999999999999988888888899999999999999988888889999999999999 4467 Q ss_pred CCCcceeeeecC Q lcl|NC_019411. 162 GTRPAFKKIIRH 173 (173) Q Consensus 162 g~~~~~~~~~R~ 173 (173) +++..+++|+|- T Consensus 159 ~~~~~~~~~~r~ 170 (170) T protein:vir:94 159 SNAARSIDLVRA 170 (170) T ss_pred ccccceeeeecC Confidence 778899999999 No 3 >protein:vir:80389 Length: 172 # NCBI annotation: BcepGomrgp09 # Family: family:all:703 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210229;genbank:gi:146329921;genbank:GeneID:5123498 Probab=100.00 E-value=3.3e-58 Score=335.69 Aligned_cols=161 Identities=32% Similarity=0.527 Sum_probs=142.8 Q ss_pred eEEEEeCCCCCCCccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhh-cccccccCCccccccCCcCC Q lcl|NC_019411. 3 FTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRS-IAWAGEKVDEDSGLRWPRTG 81 (173) Q Consensus 3 M~live~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~-~~f~G~r~~~~Q~lawPR~g 81 (173) |+||||||+|+|+||||+|+++|++||++|++ .+++++||++|++|+||||+. ++|+|+|++++|+|+|||+| T Consensus 1 Malived~~g~~~anSYvt~~~a~aY~~~rg~------~~~~d~~e~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g 74 (172) T protein:vir:80 1 MALIVEDGTGKPDANTYAGADFVIAYAQARGV------TVDADEAERLILEAMDYIESFRRRWKGERNTREQGLTWPRHD 74 (172) T ss_pred CeeEeeCCCCCccccccccHHHHHHHHHHcCC------CcCHHHHHHHHHHHHHHHhhccCccccccCCccccccccccC Confidence 99999999999999999999999999999854 456778999999999999983 37999999999999999999 Q ss_pred CcccCCeeeccccchHHHHHHHHHHHHHHHcCCC-CCCccccceeEEecceeEEeecCCCCc---------ccchHHHHH Q lcl|NC_019411. 82 VYDVDGFLIPSDAIPQQLMEATAEMAAALMNNDW-TSPQTTRGMKEIQVDVIELKFDSEIQR---------GSMPDIVMS 151 (173) Q Consensus 82 v~~~dg~~~~~d~IP~~V~~A~~elA~~~~~~~~-~~~~~~~~v~~ekVG~i~veY~~~~~~---------~~~~~~v~~ 151 (173) ++ .||..+|+|.||++||+||||||++++++.. .+......||+||||+||+||+.+.+. .++|++|++ T Consensus 75 ~~-~~g~~~~~~~IP~~v~~A~~elA~~~~~g~~~~~~~~~~~v~~ekVG~i~~eY~~~~~~~~~~~~~~~~~~~~~v~~ 153 (172) T protein:vir:80 75 AV-VDGFVIPSDVIPKELQSAVAAAVIEQVNGFELQQSQDQWAVRIEKVDVIEVQYAAGGGGQSASANAPMKPTFPKIDA 153 (172) T ss_pred cc-cCcccccccchhHHHHHHHHHHHHHHhcCCccCcCCCCceeeEEeccceEEeeecccCccccccccCCccchHHHHH Confidence 86 8999999999999999999999999999854 444556679999999999999855432 356899999 Q ss_pred HHhhhhhcccCCCcceeeeecC Q lcl|NC_019411. 152 ILEGLGVVKTGTRPAFKKIIRH 173 (173) Q Consensus 152 lL~~ll~~~~g~~~~~~~~~R~ 173 (173) ||+|||+ |+++.+++++|- T Consensus 154 LL~p~l~---~~gg~~~~~vrg 172 (172) T protein:vir:80 154 LLNPLLV---GDGGLFLVAVRG 172 (172) T ss_pred HHhhhhc---CCCCeeeeeecC Confidence 9999986 557778999999 No 4 >protein:vir:78383 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110841;genbank:gi:134288602;genbank:GeneID:5179646 Probab=100.00 E-value=3.7e-58 Score=335.44 Aligned_cols=164 Identities=21% Similarity=0.260 Sum_probs=147.7 Q ss_pred eEEEEeCCCCCCCccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhh-cccccccCCccccccCCcCC Q lcl|NC_019411. 3 FTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRS-IAWAGEKVDEDSGLRWPRTG 81 (173) Q Consensus 3 M~live~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~-~~f~G~r~~~~Q~lawPR~g 81 (173) |+||||||+|+|+||||+|+++|++||++|++ |..+|+++||++|++|++|||+. ++|+|+|++++|+|+|||+| T Consensus 1 MaliV~~~~g~~~anSYvtv~~a~aY~~~rg~----~~~~d~~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg 76 (169) T protein:vir:78 1 MPLIVETGQGIPNADSYVSLEDGRALAAKYGL----ELPEDDTAAEAALRNGAVYVGLFESQMCGRRVSANQALAFPRTG 76 (169) T ss_pred CeeEeeCCCCCccccccccHHHHHHHHHHcCC----cCCCChHHHHHHHHHHHHHhhhccccceeeeCCcccccccccCC Confidence 99999999999999999999999999999976 66779999999999999999972 38999999999999999999 Q ss_pred CcccCCeeeccccchHHHHHHHHHHHHHHHcCCCCC-CccccceeEEec-ceeEEeecCCCCcc--cchHHHHHHHhhhh Q lcl|NC_019411. 82 VYDVDGFLIPSDAIPQQLMEATAEMAAALMNNDWTS-PQTTRGMKEIQV-DVIELKFDSEIQRG--SMPDIVMSILEGLG 157 (173) Q Consensus 82 v~~~dg~~~~~d~IP~~V~~A~~elA~~~~~~~~~~-~~~~~~v~~ekV-G~i~veY~~~~~~~--~~~~~v~~lL~~ll 157 (173) +. .||..+|+|.||++||+||||||++++++++.+ +...+.|++|+| |+|+|||+.+++.+ +.++++++||+||| T Consensus 77 ~~-~~g~~~~~~~IP~~v~~A~~elA~~~~~g~~~~~~~~~~~v~~e~v~G~i~veY~~~~~~~~~~~~~~~~~LL~p~l 155 (169) T protein:vir:78 77 VT-LHGFPQPSNVIPPLVIQAQVMAAVEYGAGTDVRGSTDGREVQTERVEGAVTVSYFKNGYSGGTVSITTADDALRPLL 155 (169) T ss_pred ce-ecccccccccchHHHHHHHHHHHHHHhcCcccCCCCCcceeEEEEecCceeEeecCCCCCCCcccHHHHHHHhhhhc Confidence 86 999999999999999999999999999986555 556677988887 99999998776554 57899999999999 Q ss_pred hcccCCCcceeeeecC Q lcl|NC_019411. 158 VVKTGTRPAFKKIIRH 173 (173) Q Consensus 158 ~~~~g~~~~~~~~~R~ 173 (173) + +|+|+..++++|- T Consensus 156 ~--~~~g~~~i~~~rg 169 (169) T protein:vir:78 156 C--GSNNAYSFNVFRG 169 (169) T ss_pred c--cCCCcceeeeecC Confidence 6 4466677999999 No 5 >protein:vir:95004 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224025;genbank:gi:62327312;genbank:GeneID:5176832 Probab=100.00 E-value=4.5e-58 Score=334.96 Aligned_cols=164 Identities=21% Similarity=0.264 Sum_probs=146.5 Q ss_pred eEEEEeCCCCCCCccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhh-cccccccCCccccccCCcCC Q lcl|NC_019411. 3 FTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRS-IAWAGEKVDEDSGLRWPRTG 81 (173) Q Consensus 3 M~live~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~-~~f~G~r~~~~Q~lawPR~g 81 (173) |+||||||+|+|+||||+|++||++||++|++ |.++|+.+||++|++|++|||+. ++|+|+|++++|+|+|||+| T Consensus 1 M~liv~~~~g~~~anSYvt~~ea~aY~~~rg~----~~~~dd~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg 76 (169) T protein:vir:95 1 MPLIVETGQGLPNADSYVSLEDGRALAAKYGL----ELPEDDIAAEASLRNGAVYVGLFESQMCGRRVSANQALAFPRTG 76 (169) T ss_pred CeeEEeCCCCCCcccccccHHHHHHHHHHcCC----cCCCCHHHHHHHHHHHHHHhhccccccccccCCcchhhccccCC Confidence 99999999999999999999999999999975 66779999999999999999983 38999999999999999999 Q ss_pred CcccCCeeeccccchHHHHHHHHHHHHHHHcCCCCC-CccccceeEEec-ceeEEeecCCCCcc--cchHHHHHHHhhhh Q lcl|NC_019411. 82 VYDVDGFLIPSDAIPQQLMEATAEMAAALMNNDWTS-PQTTRGMKEIQV-DVIELKFDSEIQRG--SMPDIVMSILEGLG 157 (173) Q Consensus 82 v~~~dg~~~~~d~IP~~V~~A~~elA~~~~~~~~~~-~~~~~~v~~ekV-G~i~veY~~~~~~~--~~~~~v~~lL~~ll 157 (173) + +.||..++++.||++||+||||||+++++++... +...+.|++||+ |+||+||+.+++.+ +.|+++++||+||| T Consensus 77 ~-~~~g~~~~~~~IP~~V~~A~~elA~~~~~g~~~~~~~~~~~v~~e~v~G~i~veY~~~~~~~~~~~~~a~~~LL~p~l 155 (169) T protein:vir:95 77 I-DLHGFPQPSNVIPSLVIQAQVMAAVEYGAGTDVRGSTDGREVQTERVEGAVTVSYFKNGYSGGTVSITAADDALRPLL 155 (169) T ss_pred c-eecccccccccchHHHHHHHHHHHHHHHcCccccCCCCccceeeeeeccceeEeecCCCCcCccccHHHHHHhhhhhc Confidence 7 4999999999999999999999999999986544 455667888766 99999998877654 57899999999999 Q ss_pred hcccCCCcceeeeecC Q lcl|NC_019411. 158 VVKTGTRPAFKKIIRH 173 (173) Q Consensus 158 ~~~~g~~~~~~~~~R~ 173 (173) + +|+|+..++++|- T Consensus 156 ~--g~~g~~~i~~~rg 169 (169) T protein:vir:95 156 C--GSNNAYSFNVFRG 169 (169) T ss_pred c--cCCCcceeeeecC Confidence 6 4466667999999 No 6 >protein:vir:97267 Length: 172 # NCBI annotation: hypothetical protein ORF024 # Family: family:all:703 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294532;genbank:gi:149408253;genbank:GeneID:5237132 Probab=100.00 E-value=1.2e-55 Score=321.70 Aligned_cols=161 Identities=22% Similarity=0.294 Sum_probs=138.5 Q ss_pred eEEEEeCCCC-CCCccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhccccccc-CCccccccCCcC Q lcl|NC_019411. 3 FTFVVETGAG-DPAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEK-VDEDSGLRWPRT 80 (173) Q Consensus 3 M~live~g~g-~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r-~~~~Q~lawPR~ 80 (173) |+||||||+| +|+||||+|+++|++||+.|++ .|.+.++++||++|++|++|||+.|+|+|+| ++++|+|+|||+ T Consensus 1 m~liveD~t~~~~~AnSYvtv~~a~aY~~~rg~---~~~a~~~~~ke~aLi~A~~yiD~~~~f~G~r~~~~~Q~l~WPRt 77 (172) T protein:vir:97 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGN---SFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRT 77 (172) T ss_pred CceEeeCCCCCCCCccccccHHHHHHHHHhcCc---ccCCCCcHHHHHHHHHHHHHHhhhhhcccCCCCCcchhhhcccC Confidence 8999999998 8999999999999999999976 3878899999999999999999989999987 589999999999 Q ss_pred CCcccCCeeeccccchHHHHHHHHHHHHHHHcCCCCCCc---c---ccceeEEecceeEEeecCCCCc---ccchHHHHH Q lcl|NC_019411. 81 GVYDVDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQ---T---TRGMKEIQVDVIELKFDSEIQR---GSMPDIVMS 151 (173) Q Consensus 81 gv~~~dg~~~~~d~IP~~V~~A~~elA~~~~~~~~~~~~---~---~~~v~~ekVG~i~veY~~~~~~---~~~~~~v~~ 151 (173) |++ ||..+|+|.||++||+||||||++++++++.+.. . ...+||+|||+|+++|+..++. .+.|++|++ T Consensus 78 g~~--d~~~~~~~~IP~~v~~A~~elA~~al~~~l~~d~~~~~~~~~v~~kr~kvg~i~~~y~~~~~~~~~~p~~~~v~a 155 (172) T protein:vir:97 78 DAW--DRDRYYINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQ 155 (172) T ss_pred CCC--CCcccccccccHHHHHHHHHHHHHHHhcccccccccccccccceeeeeeecceeeEeeccCCCCCccccHHHHHH Confidence 985 7899999999999999999999999999875432 2 1248999999999999764443 468999999 Q ss_pred HHhhhhhcccCCCcceeeeecC Q lcl|NC_019411. 152 ILEGLGVVKTGTRPAFKKIIRH 173 (173) Q Consensus 152 lL~~ll~~~~g~~~~~~~~~R~ 173 (173) ||+|++...+| + +++|. T Consensus 156 LL~p~gl~~~~--~---~~~r~ 172 (172) T protein:vir:97 156 KLVRAGLVRSG--G---TLLRG 172 (172) T ss_pred HHhhhccccCc--c---eeccC Confidence 99998643322 2 56677 No 7 >protein:vir:43 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463469;swissprot:trembl:q9t1b5;genbank:gi:16798791;uniprot:Q9T1B5;genbank:GeneID:922374 Probab=98.15 E-value=3.9e-08 Score=61.18 Aligned_cols=125 Identities=18% Similarity=0.199 Sum_probs=78.4 Q ss_pred cccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccch Q lcl|NC_019411. 17 NSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIP 96 (173) Q Consensus 17 nSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP 96 (173) =+|+|.++..... + + ...++++-+++|.+|+++||. +.| -|.. ..+..-..+.+| T Consensus 1 M~Y~d~~~Y~~~y---~-g----~~i~e~~F~~l~~rAs~~ID~-~T~-------------~ri~---~~~~~~~~~~~~ 55 (131) T protein:vir:43 1 MPYTTLEFYNDEY---A-G----EHLEQDEFDKLLKHAERKIDS-VTF-------------YRIR---KGGIESFSEFIQ 55 (131) T ss_pred CCCCCHHHHHHhh---C-C----CCCCHhHHHHHHHHHHHHHHH-Hhc-------------cccc---ccCccccchhhH Confidence 6799998875432 1 1 245678889999999999997 332 2211 011111124689 Q ss_pred HHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCccc------chHHHHHHHhhhhhcccCCCcceee Q lcl|NC_019411. 97 QQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGS------MPDIVMSILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 97 ~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~------~~~~v~~lL~~ll~~~~g~~~~~~~ 169 (173) .+||.|+|+.|-.+...+.......+++++++||..||+|...+.... ....+..+|.+-+.- -+|-+ .| T Consensus 56 ~~vk~A~c~q~e~~~~~g~~s~~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~TGLl--yrGV~-~~ 131 (131) T protein:vir:43 56 HQIQLATCNQIEYFKEAGGTSELAVSKPDNVSIGRTSISDSNFASTATSLNSGLIGSDVRSYLAHTGLL--YNGVG-VR 131 (131) T ss_pred HHHHHHHHHHHHHHHHhHHHhhhhccccCeeecCceEEeecccccchhhhchhhhHHHHHHHHhccCCe--ecCCC-CC Confidence 999999999997766654333334456899999999999975443221 345677777663321 13333 22 No 8 >protein:vir:80967 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468394;genbank:gi:157324968;genbank:GeneID:5601402 Probab=98.14 E-value=3.9e-08 Score=61.18 Aligned_cols=125 Identities=17% Similarity=0.165 Sum_probs=78.8 Q ss_pred cccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccch Q lcl|NC_019411. 17 NSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIP 96 (173) Q Consensus 17 nSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP 96 (173) =+|+|.++...... | ...++++-+++|.+|+++||. +.| -|.. ..+..=..+.+| T Consensus 1 M~Y~d~~~Y~~~y~--G------~~i~e~~F~~l~~rAs~~ID~-~T~-------------~ri~---~~~~d~~~~~~~ 55 (131) T protein:vir:80 1 MPYTTLEFYTNEYA--G------EHLEQDEFAKLLKHAERKIDS-VTF-------------YRIR---KSGIEAFSEFIQ 55 (131) T ss_pred CCCCCHHHHHHhhC--C------CCCchhHHHHHHHHHHHHHHH-Hhc-------------cccc---ccccccCchhHH Confidence 67999988754321 1 234677888999999999997 332 2211 011111124689 Q ss_pred HHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCccc------chHHHHHHHhhhhhcccCCCcceee Q lcl|NC_019411. 97 QQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGS------MPDIVMSILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 97 ~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~------~~~~v~~lL~~ll~~~~g~~~~~~~ 169 (173) .+||.|+|+.|-.+...+.......+++++++||..||+|...+..+. ....+..+|.+-+.- -+|-+ .| T Consensus 56 ~~vk~A~c~q~e~~~~~g~~~~~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~TGLl--yrGV~-~~ 131 (131) T protein:vir:80 56 HQIQLATCNQIEYFKEAGGTSELAVSKPDNVSIGRTSISDSNFASTATSLNSGLVGSDVRSYLAHTGLL--YNGVG-VR 131 (131) T ss_pred HHHHHHHHHHHHHHHHhhhhhhhcccccCeeeeCceEEeeccccchhhhhhhhhhHHHHHHHHhccCCe--ecCCC-CC Confidence 999999999997766655444334567999999999999976443221 345577777663321 13333 22 No 9 >protein:vir:98900 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:3882 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164420;genbank:gi:56694910;genbank:GeneID:3197290 Probab=97.69 E-value=1.1e-06 Score=53.30 Aligned_cols=124 Identities=15% Similarity=0.041 Sum_probs=77.5 Q ss_pred cccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccch Q lcl|NC_019411. 17 NSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIP 96 (173) Q Consensus 17 nSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP 96 (173) =+|+|.++.+.|. | +..++++-+++|.+|+++||. +.| + |. +.++..-....++ T Consensus 1 M~Y~t~~~Y~~~~---G------~~i~e~~F~~l~~rAs~~ID~-iT~-~------------ri---~~~~~~~d~~~~~ 54 (132) T protein:vir:98 1 MPYLTYEEFMDLN---G------RDIDDKKFEKLLPKASAIIDG-VTG-H------------FY---QKVDMEKDNAWRV 54 (132) T ss_pred CCCCCHHHHHhhc---C------CCCCHHHHHHHHHHHHHHHHH-Hhc-c------------cc---cCCCccccChHHH Confidence 6899999987762 1 234677889999999999997 333 1 10 0111111124577 Q ss_pred HHHHHHHHHHHHHHHcCCCCCC-ccccceeEEecceeEEeecCCCC-cc---c---chHHHHHHHhhhhhcccCCCccee Q lcl|NC_019411. 97 QQLMEATAEMAAALMNNDWTSP-QTTRGMKEIQVDVIELKFDSEIQ-RG---S---MPDIVMSILEGLGVVKTGTRPAFK 168 (173) Q Consensus 97 ~~V~~A~~elA~~~~~~~~~~~-~~~~~v~~ekVG~i~veY~~~~~-~~---~---~~~~v~~lL~~ll~~~~g~~~~~~ 168 (173) .+||.|+|..+-.+...+.... ...+.+++++||..+|+|..+.+ .+ . ....+..+|.+.+.-. +|.+.- T Consensus 55 ~~vk~A~c~qiey~~~~G~~sae~~~~~~~S~svG~~Svs~~s~~~~~~~~~~~~~~~~~a~~~L~~tGLLy--rGV~~~ 132 (132) T protein:vir:98 55 NQFKLALCAQIEYFDALGATTFEEINNSPQTFQAGRTSVSNASRYNPSGANESKPLVAEDVYIYLQGTGLLF--QGVKTW 132 (132) T ss_pred HHHHHHHHHHHHHHHhccchhhhhccCccceeeeCcEEEEeeccCCcccccccccchHHHHHHHHhhcCCcc--ccCCCC Confidence 8999999999977666554333 23556999999999999964332 11 1 2355777886643322 222222 No 10 >protein:vir:79701 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285884;genbank:gi:148750841;genbank:GeneID:5220403 Probab=94.32 E-value=0.0021 Score=35.21 Aligned_cols=132 Identities=11% Similarity=0.140 Sum_probs=76.2 Q ss_pred ccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhccc-ccccCCccccccCCcCCCcccCCeeecccc Q lcl|NC_019411. 16 ANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAW-AGEKVDEDSGLRWPRTGVYDVDGFLIPSDA 94 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f-~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~ 94 (173) --+|.|-+|.+.+ +. ...++++-+++|.+|.+-||...++ .+.- =+.. .+.|+..=.+.. T Consensus 1 ~~pYLTy~ef~~l----g~-----~~~~~d~F~kllk~A~~~ID~~T~y~~~~y---------~~~~-i~~d~~~d~~~~ 61 (144) T protein:vir:79 1 MKPYLTTSDFEKL----GY-----ELKKPDNFGKLLKSATVLINQICSYYDPAF---------AYHD-LEADSQADPDSY 61 (144) T ss_pred CCcccchhhhhhh----CC-----CCcchhhhhhHHHHHHHHhhhhhhhhcccc---------cccc-ccccccccchhh Confidence 5789998887644 21 1235677899999999999984432 1100 0000 111111112234 Q ss_pred ch---HHHHHHHHHHHHHHHcCCCCCC-c-cccceeEEecceeEEeecCCCCccc------chHHHHHHHhhhhhcccCC Q lcl|NC_019411. 95 IP---QQLMEATAEMAAALMNNDWTSP-Q-TTRGMKEIQVDVIELKFDSEIQRGS------MPDIVMSILEGLGVVKTGT 163 (173) Q Consensus 95 IP---~~V~~A~~elA~~~~~~~~~~~-~-~~~~v~~ekVG~i~veY~~~~~~~~------~~~~v~~lL~~ll~~~~g~ 163 (173) || .+||.|.|.-...+-..+.... + ..+.+++.+||-.+|+|...+..+. ..+-+..+|.+.+.-- + T Consensus 62 ~~~r~~~vKkA~a~QIeY~~~~G~~sa~e~~~~~~~S~svGrtsvs~~~~~~~s~t~~~~~v~~~a~~yL~~tGLLY--r 139 (144) T protein:vir:79 62 LFRQAMAFKKAVALEMLFLEDSGYSSAYDVAQGALNSFTVGHTSMSLNPSAGQNLTVGSTGVVKSAYDLLGRYGLLF--S 139 (144) T ss_pred hhHHHHHHHHHHHHHHHHHHHcCCcchhhhhcCccceeEecceEEeecCCCccccccccccccHHHHHHHhhcCccc--c Confidence 56 4567777776655544444333 2 3567999999999999965443321 2367777887754322 3 Q ss_pred Cccee Q lcl|NC_019411. 164 RPAFK 168 (173) Q Consensus 164 ~~~~~ 168 (173) |.+.+ T Consensus 140 GV~s~ 144 (144) T protein:vir:79 140 GVASL 144 (144) T ss_pred ccccC Confidence 44433 No 11 >protein:vir:9821 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795583;genbank:gi:28876338;genbank:GeneID:1257921 Probab=93.86 E-value=0.0013 Score=36.49 Aligned_cols=126 Identities=15% Similarity=0.131 Sum_probs=72.5 Q ss_pred CeeEEEEeCCCCCCCccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcC Q lcl|NC_019411. 1 MAFTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRT 80 (173) Q Consensus 1 m~M~live~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~ 80 (173) |-|+.| +|.|.+|.+.+ +. + +++.-+++|.+|++-||..+++.= T Consensus 1 ~~~~~M-----------~YlT~eey~~l----~~-----~--~~~dF~kllk~As~~ID~~t~~~y-------------- 44 (138) T protein:vir:98 1 MEVVII-----------AFLTQKEFEDL----GF-----D--DVEDFEKMEKRASHAVNLYCRNRY-------------- 44 (138) T ss_pred Cccccc-----------cccchHHHhcc----CC-----C--ChhhHHHHHHHHHHHhhhhhcccc-------------- Confidence 666554 79999987654 22 1 334588999999999998443221 Q ss_pred CCcccCCeeeccc--cchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCccc----------chHH Q lcl|NC_019411. 81 GVYDVDGFLIPSD--AIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGS----------MPDI 148 (173) Q Consensus 81 gv~~~dg~~~~~d--~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~----------~~~~ 148 (173) ++.-|.++ .+=.+||.|.|.--..+...+.......+..++.+||-.++.|+....+++ ...- T Consensus 45 -----~~~d~e~d~~~r~~~vKkA~a~QIeY~~~~G~ts~~d~~~~~s~svGrTSiS~~~~~~~~s~~~~~~~~~~~s~~ 119 (138) T protein:vir:98 45 -----DYKDLKKEIALVQKAVKRAIAYQIAYLNDSGVMTAEDKQSFAGISLGRTSISYTVGHGQGSQQKTLADRFNLCLD 119 (138) T ss_pred -----ccccccchhHHHHHHHHHHHHHHHHHHHHcCCcchhhccCcCceEeeeeEeecccccccccccccccccccccHH Confidence 11112221 123456666665554444444444444677899999999999954443221 1224 Q ss_pred HHHHHhhhhhcccCCCcceee Q lcl|NC_019411. 149 VMSILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 149 v~~lL~~ll~~~~g~~~~~~~ 169 (173) +..+|.+.+.- =+|..--| T Consensus 120 A~~~L~~tGLL--Y~GV~yd~ 138 (138) T protein:vir:98 120 AENELLVVGLG--YTGISYDR 138 (138) T ss_pred HHHHHhhcCcc--cccCcccC Confidence 45577664431 23444444 No 12 >protein:vir:9576 Length: 131 # NCBI annotation: gp42 # Family: family:all:1271 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862881;genbank:gi:32469473;genbank:GeneID:1461318 Probab=93.50 E-value=0.0056 Score=32.92 Aligned_cols=123 Identities=17% Similarity=0.129 Sum_probs=73.6 Q ss_pred ccccccHHHHHHHHHhccCccccccCCCHH---HHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeecc Q lcl|NC_019411. 16 ANSYCDVQFADDYIYANVYANTAWDALDQD---GKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPS 92 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~---~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~ 92 (173) -+.|+|+++..+-| | .++++ ..+.+|-.|+++|..++ |+.|. +.+....++ T Consensus 1 m~~fAtv~D~~~rw--r--------~Lt~~E~~ra~~LL~~As~~ir~~~---------------p~~~~-~l~~~~~~~ 54 (131) T protein:vir:95 1 MENFATVEDLKKLW--R--------ALKFDEEKRAEALLEVVSHSLRVEA---------------KKVGK-DLDGLVATD 54 (131) T ss_pred CCccCCHHHHHHHh--c--------CCCHHHHHHHHHHHHHHHHHHHHhh---------------hhccC-CccccccCC Confidence 68999999998665 3 22333 44567789999998764 44432 344555555 Q ss_pred ccchHHHHHHHHHHHHHHHcCCCCCCccccce--eEEeccee--EEeecCCCCcccchHHHHHHHhhhhhcccCCCccee Q lcl|NC_019411. 93 DAIPQQLMEATAEMAAALMNNDWTSPQTTRGM--KEIQVDVI--ELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFK 168 (173) Q Consensus 93 d~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v--~~ekVG~i--~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~ 168 (173) +..+.-++..+|+.....+..+- +.+ ++ .++..|+. +.+|..++..-..-+.-..+| ++ +|++.+-+ T Consensus 55 ~~~~~~~~~V~~~~V~Ral~~~~---~~~-G~tq~S~TaG~ys~S~t~~~p~g~lylt~~e~~~L-Gl----~~~r~~~i 125 (131) T protein:vir:95 55 PSFTMVVKSVTVDVVARTLMTST---DQE-PMTQVAESALGYSFSGSYLVPGGGLFIKDSELKRL-GL----KKQRYGVI 125 (131) T ss_pred ccchHHHHHHHHHHHHHHhcCCC---CCC-CceeeeeecccceeeeeeecCCCCceeChHHHHHh-CC----CCCceeEE Confidence 66778899999999988765442 111 32 46888988 555654433323334444555 22 24555444 Q ss_pred eeecC Q lcl|NC_019411. 169 KIIRH 173 (173) Q Consensus 169 ~~~R~ 173 (173) .+-=. T Consensus 126 ~~~~~ 130 (131) T protein:vir:95 126 DIYGT 130 (131) T ss_pred eeccC Confidence 44333 No 13 >protein:vir:94761 Length: 132 # NCBI annotation: unknown # Family: family:all:1271 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996708;genbank:gi:45597423;genbank:GeneID:2769029 Probab=92.80 E-value=0.0094 Score=31.69 Aligned_cols=125 Identities=13% Similarity=0.005 Sum_probs=65.2 Q ss_pred ccccccHHHHHHHHHhccCccccccCCCHH---HHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeecc Q lcl|NC_019411. 16 ANSYCDVQFADDYIYANVYANTAWDALDQD---GKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPS 92 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~---~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~ 92 (173) -+.|+|+++..+-| | .++++ ..+.+|-.|+++|..++ |+.+.-......... T Consensus 1 m~~fAtv~Dl~~r~--r--------~L~~dE~~ra~~LL~dAs~~iR~~~---------------~~~~~~~~~~~~~~~ 55 (132) T protein:vir:94 1 MNPFATVDDLTMLW--R--------PLKGDEKERAEKLLEIVSDTLREEA---------------DKVGRDLDVMISEKP 55 (132) T ss_pred CCCcCCHHHHHHHh--c--------cCChhHHHHHHHHHHHHHHHHHHHH---------------hhhccccccccCCCC Confidence 68999999998654 2 23333 34557789999998653 333321111111111 Q ss_pred ccchHHHHHHHHHHHHHHHcCCCCCCcccc-ceeEEeccee--EEeecCCCCcccchHHHHHHHhhhhhcccCCCcceee Q lcl|NC_019411. 93 DAIPQQLMEATAEMAAALMNNDWTSPQTTR-GMKEIQVDVI--ELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 93 d~IP~~V~~A~~elA~~~~~~~~~~~~~~~-~v~~ekVG~i--~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~~ 169 (173) |..|.-++.-+|.....++..+. +.++ .-.++..|+. +.+|..+...-..-..-..+| ++ +|++.+-+. T Consensus 56 d~~~~~~k~V~~~~V~Ral~~~~---~~~g~tq~S~TaG~ys~S~T~~np~G~lylt~~e~~~L-Gl----~~~r~~~i~ 127 (132) T protein:vir:94 56 SYFSSVVKSVTVDIVARTLMTST---DQEPMTQTTESALGYSVSGSYLVPGGGLFIKNSELSRL-GL----KKQRFGVID 127 (132) T ss_pred ccchhHHHHHHHHHHHHHhcCCC---CCCCceeeeeecccceeeeeeecCCCCceeChHHHHhh-CC----CCCceEEEe Confidence 22344466777888877766532 1111 1246788987 666754433323334445555 22 234444333 Q ss_pred eecC Q lcl|NC_019411. 170 IIRH 173 (173) Q Consensus 170 ~~R~ 173 (173) +-=. T Consensus 128 ~~~~ 131 (132) T protein:vir:94 128 FYGN 131 (132) T ss_pred ecCC Confidence 3333 No 14 >protein:vir:2505 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:28222 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569746;genbank:gi:18496896;genbank:GeneID:932265 Probab=92.10 E-value=0.0022 Score=35.17 Aligned_cols=122 Identities=11% Similarity=0.104 Sum_probs=75.1 Q ss_pred EEeCCCCCCCccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCccc Q lcl|NC_019411. 6 VVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDV 85 (173) Q Consensus 6 ive~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~ 85 (173) +.|- .+++|+++..+-+... . +..+..+.+.+|-.|+|.|.+.+ |.| T Consensus 1 ~~~~-------~alAtvdDv~~~lrr~-L-----t~dE~~~a~~Ll~eAsdlI~g~l-~~~------------------- 47 (128) T protein:vir:25 1 MTEC-------KALATSQDVKRALRRD-L-----TEAEQTDLSELLAEATDLVVGYL-HPY------------------- 47 (128) T ss_pred Cccc-------hhccCHHHHHHHhcCC-C-----CHHHHHHHHHHHhcchheeeeec-CCC------------------- Confidence 3343 7899999988665321 1 11122333445679999998742 222 Q ss_pred CCeeeccccchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccc--hHHHHHHHhhhhhcccCC Q lcl|NC_019411. 86 DGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSM--PDIVMSILEGLGVVKTGT 163 (173) Q Consensus 86 dg~~~~~d~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~--~~~v~~lL~~ll~~~~g~ 163 (173) ++| |.+|.-|+.-+|..+..+++-++.... .-++.+-|+.+++|..+++++.. -..-+.+|+|+= T Consensus 48 ---~vp-~~~p~~v~rVvA~ivarAltr~~~~~p---e~~S~TAgpfs~~ft~~~~~~g~yLTaa~k~~Lrp~R------ 114 (128) T protein:vir:25 48 ---PVP-TPTPGPIKRVVASMVAAVLTRPTQILP---ETQSLTADGFGVTFTPGGNSPGPYLSAALKQRLRPYR------ 114 (128) T ss_pred ---CCC-CCCCchHHHHHHHHHHHHhhCCCccCC---CceeeecccccccccCCCCCCCceEcHHHHhhccccc------ Confidence 122 568888999999999888776654433 22555779999888766666654 467788999982 Q ss_pred CcceeeeecC Q lcl|NC_019411. 164 RPAFKKIIRH 173 (173) Q Consensus 164 ~~~~~~~~R~ 173 (173) .+.|-.=.=| T Consensus 115 ~~~~sV~l~s 124 (128) T protein:vir:25 115 TGMVAVEMGS 124 (128) T ss_pred ceeeEeeccc Confidence 2222211222 No 15 >protein:vir:4788 Length: 130 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150168;swissprot:trembl:q94m43;genbank:gi:15088779;uniprot:Q94M43;genbank:GeneID:955972 Probab=90.27 E-value=0.011 Score=31.27 Aligned_cols=124 Identities=13% Similarity=0.050 Sum_probs=73.2 Q ss_pred cccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcc-cccccCCccccccCCcCCCcccCCeeeccccc Q lcl|NC_019411. 17 NSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIA-WAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAI 95 (173) Q Consensus 17 nSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~-f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~I 95 (173) =.|.|.+|.+.| +. . +++.-+++|.+|++-||...+ |.=...+- .=+...+ T Consensus 1 M~YlT~eey~el----~~-----~--~~~~F~kl~k~A~~~ID~~t~~~y~~~~~~-----------------~~~~~~r 52 (130) T protein:vir:47 1 MTYLTQEEFDEL----DF-----D--EVTDFEKLAKRAKIAIDLYTNGIYQKDIDF-----------------EKEIAYR 52 (130) T ss_pred CCCCchhhHhhc----CC-----C--ChhhHHHHHHHHHHHHHHHhcccccccCCc-----------------cCcchHH Confidence 679999998866 22 1 345688999999999997443 22111111 1112234 Q ss_pred hHHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcc--cchH---HHHHHHhhhhhcccCCCcceee Q lcl|NC_019411. 96 PQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRG--SMPD---IVMSILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 96 P~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~--~~~~---~v~~lL~~ll~~~~g~~~~~~~ 169 (173) =.+||.|.|.-...+-..+.......+.+.+.+||-.+++|....... ..+. -+..+|.+.+.. .=+|..--| T Consensus 53 ~~~vK~A~a~QieY~~~~G~~s~~~~~~~~S~svGrtSis~~~~~~~~~~~~~~vs~da~~~L~~tGL~-Ly~GV~yd~ 130 (130) T protein:vir:47 53 KSAVKLAMAFQIAYLDASGIMSADDKQLANSVSIGRTSISYSTSQSTLAGQRFNLSMDAENALRQAGFS-LVVGVAYDR 130 (130) T ss_pred HHHHHHHHHHHHHHHHHhccccchhccCcceeeecceeeecCcCccccccCCccccHHHHHHHHhcccc-cccCCCccC Confidence 467888887777665555544444577799999999999997644322 2232 344466554320 013333344 No 16 >protein:vir:80320 Length: 188 # NCBI annotation: gp8, conserved hypothetical protein # Family: family:all:501 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111087;genbank:gi:134288682;genbank:GeneID:4960567 Probab=90.04 E-value=0.019 Score=29.97 Aligned_cols=128 Identities=16% Similarity=0.064 Sum_probs=54.4 Q ss_pred eEEEEeCCCCCCCccccccHHHHHHHHHhccCccccccCCCHHHHHHHH-HHHHHHHhhhcccccccC---Cccccc-cC Q lcl|NC_019411. 3 FTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFL-VRASKYLDRSIAWAGEKV---DEDSGL-RW 77 (173) Q Consensus 3 M~live~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL-~~As~~id~~~~f~G~r~---~~~Q~l-aw 77 (173) |..|+-+.+ | +..=+|++|++++.... ..-+|+..+..| ..|.+++|+. .|+.- ...+.+ .| T Consensus 1 M~~~~~~~p--p-a~ePVtL~e~K~hLRid-------~~~eD~~l~~~lI~aA~~~~E~~---~gr~l~~qt~~~~~~~~ 67 (188) T protein:vir:80 1 MAAVLVEYL--D-DAEPLTFEEVAFQCRID-------DDDERDFVERIVIPGARQAAESK---SGAAIRKARYVERLSGF 67 (188) T ss_pred CCceeeccC--C-CCcccCHHHHHHHcCCC-------CchhhHHHHHHHHHHHHHHHHHH---hCCeeeeeeEEEEecCC Confidence 444443322 2 33348999999985322 122456665555 5788899873 33221 111111 23 Q ss_pred CcCCCc---------------ccCCeeec---------------------------------------cccchHHHHHHH Q lcl|NC_019411. 78 PRTGVY---------------DVDGFLIP---------------------------------------SDAIPQQLMEAT 103 (173) Q Consensus 78 PR~gv~---------------~~dg~~~~---------------------------------------~d~IP~~V~~A~ 103 (173) |+.+.. +.+|.... .+.+|..||+|. T Consensus 68 ~~~~i~Lp~~PV~sV~sV~~~d~~G~~~~l~~~~y~l~~~~~~~~l~~~~~~~~p~~~~V~V~~~AG~~~~vP~~ik~ai 147 (188) T protein:vir:80 68 PLAEISLSVGQVIRVDSIEIRDASGATTTLDADAFELVQLGREALLVPEGQARWPFARAVTITYQAGVDLARYPSVRTWM 147 (188) T ss_pred CCCceEecccccceeeEEEEEcCCCcEEeecccceEEeecCCCcEEEEecCCCCCCCceEEEEEEecccccChHHHHHHH Confidence 332111 11111100 023455555555 Q ss_pred HHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccc-hHHHHHHHhhhhhcccCCCcce Q lcl|NC_019411. 104 AEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSM-PDIVMSILEGLGVVKTGTRPAF 167 (173) Q Consensus 104 ~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~-~~~v~~lL~~ll~~~~g~~~~~ 167 (173) ..++..+.++-... . .+...+.. +..+++||+||=+ -+|| T Consensus 148 ll~va~~Ye~Re~~---------------~----~g~~~~~~P~~~v~~Ll~pyRv-----p~~~ 188 (188) T protein:vir:80 148 LLAAAWAYDHRELF---------------S----EGQPIGEMPGGYADVLLNPITV-----PPRF 188 (188) T ss_pred HHHHHHHHhccccc---------------c----cccccccccHHHHHHHhhccCC-----CCCC Confidence 55554443321100 0 00001112 2346777777744 2233 No 17 >protein:vir:9761 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795523;genbank:gi:28876281;genbank:GeneID:1257822 Probab=89.36 E-value=0.024 Score=29.47 Aligned_cols=123 Identities=12% Similarity=0.044 Sum_probs=64.3 Q ss_pred ccccccHHHHHHHHHhccCccccccCCCH---HHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeecc Q lcl|NC_019411. 16 ANSYCDVQFADDYIYANVYANTAWDALDQ---DGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPS 92 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~---~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~ 92 (173) -+.|+|+++..+-| | .+++ +..+.+|-.|+++|...+ |+.|. +.+-..... T Consensus 1 m~~fATv~Dv~~rw--r--------~Lt~dE~~ra~~LL~dAS~~iR~~~---------------p~~g~-~~~~~~~~~ 54 (140) T protein:vir:97 1 MGNFATTDDVILLW--R--------PLSVDELKRANALLKVVSDTLRMEA---------------DKVGK-DLDKTMVDK 54 (140) T ss_pred CCcCCCHHHHHHHh--c--------CCCHhHHHHHHHHHHHHHHHHHHhh---------------hhccC-CcchhcccC Confidence 68999999998765 3 2233 344567789999998754 44331 111111112 Q ss_pred ccchHHHHHHHHHHHHHHHc-CCCCCCcccc-ceeEEeccee--EEeecCCCCcccchHHHHHHHhhhhhcccCCCccee Q lcl|NC_019411. 93 DAIPQQLMEATAEMAAALMN-NDWTSPQTTR-GMKEIQVDVI--ELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFK 168 (173) Q Consensus 93 d~IP~~V~~A~~elA~~~~~-~~~~~~~~~~-~v~~ekVG~i--~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~ 168 (173) +.-+.-++..+|......+. +++. ++ .-.++..|+. +.+|..+...-..-+.-..+| ++ +|.+.+-+ T Consensus 55 ~~~~~~~k~V~~~mV~Ral~~~~d~----~G~tq~S~TaG~ys~S~T~~np~G~lylt~~e~~~L-Gl----~~~r~~~i 125 (140) T protein:vir:97 55 PYFVNVIKSVTVDIVARTLMTSTQG----EPMSQESQSALGYTWSGTYLVPGGGLFIKDNELKRL-GL----KKQRYGGI 125 (140) T ss_pred ccchhHHHHHHHHHHHHHhcCCCCC----CcceeeeeeccchhheeeeecCCCCceeChHHHHHh-CC----CCCceeee Confidence 22344456677777766443 3221 11 1246788988 666754433333334444555 22 24444433 Q ss_pred eee----cC Q lcl|NC_019411. 169 KII----RH 173 (173) Q Consensus 169 ~~~----R~ 173 (173) .+. |. T Consensus 126 ~~~g~~~~~ 134 (140) T protein:vir:97 126 ELYGEIKRD 134 (140) T ss_pred cccCccccC Confidence 332 22 No 18 >protein:vir:107756 Length: 147 # NCBI annotation: gp21 # Family: family:all:664 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024869;genbank:gi:48697511;genbank:GeneID:2948377 Probab=89.09 E-value=0.016 Score=30.43 Aligned_cols=126 Identities=13% Similarity=0.052 Sum_probs=62.3 Q ss_pred CeeEEEEeCCCCCCCccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcC Q lcl|NC_019411. 1 MAFTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRT 80 (173) Q Consensus 1 m~M~live~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~ 80 (173) |+.++- ++.+.+.+ -........+|+..+..|..|..+|+.. +| +. T Consensus 1 m~v~fd---------------~~~Fr~~f----PeFad~~~~pd~~i~~~l~~A~~~l~~~-~~-------------~~- 46 (147) T protein:vir:10 1 MDHTLD---------------ITKFRALF----PEFNNDVKYPDALLEQWYAVAGEYLGLT-DY-------------AC- 46 (147) T ss_pred CceecC---------------HHHHHHhc----ccccCCccCCHHHHHHHHHHHHHhhccc-cC-------------Cc- Confidence 433332 33343322 2222233568999999999999999963 32 11 Q ss_pred CCcccCCeeeccccchHHHHHHHHHHHHHHHcCC--CCC-CccccceeEEecceeEEeecCCCCcccc--------h-HH Q lcl|NC_019411. 81 GVYDVDGFLIPSDAIPQQLMEATAEMAAALMNND--WTS-PQTTRGMKEIQVDVIELKFDSEIQRGSM--------P-DI 148 (173) Q Consensus 81 gv~~~dg~~~~~d~IP~~V~~A~~elA~~~~~~~--~~~-~~~~~~v~~ekVG~i~veY~~~~~~~~~--------~-~~ 148 (173) +.+ ...-.++.+.++.+++.-. ... ....+.++++++|+|||+|+........ + .- T Consensus 47 ---~~~---------g~~~~~~l~Ll~AHll~l~~~~~~g~g~~G~v~Sas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~ 114 (147) T protein:vir:10 47 ---GLN---------GNTLDLALMQLTAHLMKSATILSSNKGAPMVMTSATIDKVSISTLAPPIKNGWQYWLSTTPYGQM 114 (147) T ss_pred ---ccC---------hhhHHHHHHHHHHHHHHHHHhhccCCCcccceeeeeecceeeeeecCCCCCcchhhhhcCHHHHH Confidence 011 1334555666665444321 111 1234568999999999999855332221 1 12 Q ss_pred HHHHHhhhhhc--ccCCCcceeeeecC Q lcl|NC_019411. 149 VMSILEGLGVV--KTGTRPAFKKIIRH 173 (173) Q Consensus 149 v~~lL~~ll~~--~~g~~~~~~~~~R~ 173 (173) .-+|++.+... -. +|...-.-+|. T Consensus 115 y~~l~~~~~~Gg~vv-gG~p~r~a~r~ 140 (147) T protein:vir:10 115 LWALLSMRSSGGFVY-GGSPELSGYRR 140 (147) T ss_pred HHHHHHhhCccceec-CCCCccccccc Confidence 33455554320 00 11122223333 No 19 >protein:vir:100103 Length: 120 # NCBI annotation: gp7 # Family: family:all:363 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945037;genbank:gi:38707897;genbank:GeneID:2744150 Probab=87.93 E-value=0.027 Score=29.18 Aligned_cols=120 Identities=8% Similarity=-0.038 Sum_probs=68.3 Q ss_pred CCCccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeecc Q lcl|NC_019411. 13 DPAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPS 92 (173) Q Consensus 13 ~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~ 92 (173) .++=-+.+|+++++.++.-. ..-||+-.+..+..|.+|+.. |.|++...++...=+-.... ..-... T Consensus 1 ~~~~m~~vtL~e~K~hLRvd-------~d~DD~lI~~~i~AA~~~v~~---~~~r~l~~~~~~~~~~~~~~---~~~~~~ 67 (120) T protein:vir:10 1 MADQTPIVSLEVALAHLRED-------AGVADDLIKIYIGAATQSASD---YVDRKLYANDAEMQAAVADA---TAGADP 67 (120) T ss_pred CCCCCCccCHHHHHHHcCCC-------CCcchHHHHHHHHHHHHHHHH---HhCCcccccccccchhhhcc---cccccc Confidence 55667899999999985432 234778888877888888874 56666543322210000000 001122 Q ss_pred ccchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcce Q lcl|NC_019411. 93 DAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAF 167 (173) Q Consensus 93 d~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~ 167 (173) ..||+.++.|++.|.-....+-.... +|. ..+....+..++.||.||=. ..|. T Consensus 68 ~~~~~~i~~AvLllvg~~YenRe~~~----------~~~-------~~~~~~lP~~v~~Ll~~yR~-----~~gv 120 (120) T protein:vir:10 68 IVANDAIRAAILLTIGKLYAFREDVV----------SGA-------SASVTELPSGAKSLLFPYRV-----GLGV 120 (120) T ss_pred ccCCHHHHHHHHHHHHHHHhchhhhh----------hcc-------cccccccCHHHHHHHHHhhh-----ccCC Confidence 45899999999999866555432110 010 11222345568889988732 3332 No 20 >protein:vir:1435 Length: 188 # NCBI annotation: hypothetical protein # Family: family:all:501 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536364;genbank:gi:17975169;genbank:GeneID:929149 Probab=87.55 E-value=0.036 Score=28.51 Aligned_cols=127 Identities=20% Similarity=0.104 Sum_probs=55.7 Q ss_pred CeeEEEEeCCCCCCCccccccHHHHHHHHHhccCccccccCCCHHHHHHHH-HHHHHHHhhhcccccccCC---cc-ccc Q lcl|NC_019411. 1 MAFTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFL-VRASKYLDRSIAWAGEKVD---ED-SGL 75 (173) Q Consensus 1 m~M~live~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL-~~As~~id~~~~f~G~r~~---~~-Q~l 75 (173) |.+- +|++ + .+..=+|++|++++..-. ..-+|+..+..| ..|.+++|+.. |+.-- .. .-- T Consensus 1 m~~~-~~~~-p---pa~epVtLae~K~~lrid-------~~~eD~~l~~~li~aA~~~~E~~t---gr~l~~qt~~~~~~ 65 (188) T protein:vir:14 1 MAAV-LVEY-L---DDAEPLTFEEVAFQCRID-------DDDERDFVERVVIPGARQAAESKA---GAAIRKARYVEHLS 65 (188) T ss_pred CCce-eeec-C---CCCCccCHHHHHHHcCCC-------CchhHHHHHHHHHHHHHHHHHHHh---CCeeeeeeEEEEec Confidence 4432 3333 2 234568999999985322 222456665544 57788999632 32110 00 111 Q ss_pred cCCcCCC---------------cccCCee----------------------------------------eccccchHHHH Q lcl|NC_019411. 76 RWPRTGV---------------YDVDGFL----------------------------------------IPSDAIPQQLM 100 (173) Q Consensus 76 awPR~gv---------------~~~dg~~----------------------------------------~~~d~IP~~V~ 100 (173) .||+.+. ++.+|.. ++ +.||..|| T Consensus 66 ~~~~~~~~Lp~~Pv~sV~sV~~~d~~g~~~~l~~~~y~l~~~~~~~~l~~~~~~~~p~~~~V~V~~~AG~~-~~vP~~ik 144 (188) T protein:vir:14 66 GFPPAEVPLSVGQVISVDSIEIRDASGATTTLDAGAFELVQLGRETLLVPAGQARWPYARAVTIKYQAGID-LARYPSVR 144 (188) T ss_pred CcCCCceEecccCcceeeEEEEEcCCCceEeecccceEEeecCCCcEEEEecCCCCCCCceEEEEEEecCc-cCchHHHH Confidence 1222110 0011110 11 24666666 Q ss_pred HHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccch-HHHHHHHhhhhhcccCCCcce Q lcl|NC_019411. 101 EATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMP-DIVMSILEGLGVVKTGTRPAF 167 (173) Q Consensus 101 ~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~-~~v~~lL~~ll~~~~g~~~~~ 167 (173) +|...++..+.++-... . .+...+..+ ..++.||+||=. -+|| T Consensus 145 ~Aill~va~~Y~~Re~~-----------------~--~g~~~~~lP~~~v~~Ll~pyRv-----P~~~ 188 (188) T protein:vir:14 145 SWMLLAAAWAYDHRELY-----------------S--DGQPMGEMPGGYSDVLLNPITV-----PPRF 188 (188) T ss_pred HHHHHHHHHHHhccccc-----------------c--cccccccccHHHHHHHhhccCC-----CCCC Confidence 66666665544431100 0 001111122 346777777754 2233 No 21 >protein:vir:101652 Length: 188 # NCBI annotation: gp15 # Family: family:all:11114 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654770;genbank:gi:109302768;genbank:GeneID:4156086 Probab=87.32 E-value=0.02 Score=29.96 Aligned_cols=121 Identities=20% Similarity=0.226 Sum_probs=63.0 Q ss_pred HHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccc---------------cccCCc--------------- Q lcl|NC_019411. 22 VQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWA---------------GEKVDE--------------- 71 (173) Q Consensus 22 v~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~---------------G~r~~~--------------- 71 (173) ..+|+..+.. .+ +|...-+-||..|+.-+.+.++|. |++.-+ T Consensus 1 ~~~~~~la~~------~~--~da~~v~lAL~~As~~VR~~~g~~i~~V~~dt~~id~~Gg~~~LP~~Pvv~i~~Ve~~~~ 72 (188) T protein:vir:10 1 MTFAQQLADA------FP--EDADDAATALSWAKSQVEGYCGRKFDLVEDDVAIVDPYCGSSLLPSIPVVEISKVEGYLP 72 (188) T ss_pred CchhhhHHHh------cC--CCcchHHHHHHHHHHHHHHHhCCcceeeeeeeeeeecCCCceeeccCcceeeeEEEEEee Confidence 1112222111 11 122223337888888777644321 110000 Q ss_pred --------------------------ccc-------ccCCcC----CCcccCCeeeccccchHHHHHHHHHHHHHHHcCC Q lcl|NC_019411. 72 --------------------------DSG-------LRWPRT----GVYDVDGFLIPSDAIPQQLMEATAEMAAALMNND 114 (173) Q Consensus 72 --------------------------~Q~-------lawPR~----gv~~~dg~~~~~d~IP~~V~~A~~elA~~~~~~~ 114 (173) .|+ -.||+. -|...+ =-++||.+|+...|++|-+++.+| T Consensus 73 ~G~~~~~v~~r~y~~~g~~~~l~~trg~pg~~~~r~~~WPw~p~~VtVTytH----Gy~evP~eiv~lv~d~A~~~~~np 148 (188) T protein:vir:10 73 TGNGMDWVELTNYWFKRDTGLIFDTTGLPGSEWSTGHTWPWLPGSLRVTYTH----GYNPVPDELIDVAIRLAREYQSNP 148 (188) T ss_pred CCcccccccccccccccceeeecccccCcccccccccccccCcceEEEEEec----CCCcccHHHHHHHHHHHHHHhcCc Confidence 011 124421 111111 124799999999999999888775 Q ss_pred CCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccC Q lcl|NC_019411. 115 WTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTG 162 (173) Q Consensus 115 ~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g 162 (173) .. ...++||++|++|+. .+..+.-+.=..+|++|-....- T Consensus 149 ~~-------L~q~~vG~~S~tfa~-~~~~sl~~~~~~il~ry~l~~~~ 188 (188) T protein:vir:10 149 EL-------LVSKQVGEIERRFGS-VAGTSLSKADQAILDRYVIATLA 188 (188) T ss_pred cc-------ceeeecCceeeeccc-ccCCcccchhHHhhccccccccC Confidence 43 378999999999984 33223344556677776543222 No 22 >protein:vir:7857 Length: 188 # NCBI annotation: gp14 # Family: family:all:11114 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817464;genbank:gi:29565893;genbank:GeneID:1259086 Probab=87.32 E-value=0.02 Score=29.96 Aligned_cols=121 Identities=20% Similarity=0.226 Sum_probs=63.0 Q ss_pred HHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccc---------------cccCCc--------------- Q lcl|NC_019411. 22 VQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWA---------------GEKVDE--------------- 71 (173) Q Consensus 22 v~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~---------------G~r~~~--------------- 71 (173) ..+|+..+.. .+ +|...-+-||..|+.-+.+.++|. |++.-+ T Consensus 1 ~~~~~~la~~------~~--~da~~v~lAL~~As~~VR~~~g~~i~~V~~dt~~id~~Gg~~~LP~~Pvv~i~~Ve~~~~ 72 (188) T protein:vir:78 1 MTFAQQLADA------FP--EDADDAATALSWAKSQVEGYCGRKFDLVEDDVAIVDPYCGSSLLPSIPVVEISKVEGYLP 72 (188) T ss_pred CchhhhHHHh------cC--CCcchHHHHHHHHHHHHHHHhCCcceeeeeeeeeeecCCCceeeccCcceeeeEEEEEee Confidence 1112222111 11 122223337888888777644321 110000 Q ss_pred --------------------------ccc-------ccCCcC----CCcccCCeeeccccchHHHHHHHHHHHHHHHcCC Q lcl|NC_019411. 72 --------------------------DSG-------LRWPRT----GVYDVDGFLIPSDAIPQQLMEATAEMAAALMNND 114 (173) Q Consensus 72 --------------------------~Q~-------lawPR~----gv~~~dg~~~~~d~IP~~V~~A~~elA~~~~~~~ 114 (173) .|+ -.||+. -|...+ =-++||.+|+...|++|-+++.+| T Consensus 73 ~G~~~~~v~~r~y~~~g~~~~l~~trg~pg~~~~r~~~WPw~p~~VtVTytH----Gy~evP~eiv~lv~d~A~~~~~np 148 (188) T protein:vir:78 73 TGNGMDWVELTNYWFKRDTGLIFDTTGLPGSEWSTGHTWPWLPGSLRVTYTH----GYNPVPDELIDVAIRLAREYQSNP 148 (188) T ss_pred CCcccccccccccccccceeeecccccCcccccccccccccCcceEEEEEec----CCCcccHHHHHHHHHHHHHHhcCc Confidence 011 124421 111111 124799999999999999888775 Q ss_pred CCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccC Q lcl|NC_019411. 115 WTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTG 162 (173) Q Consensus 115 ~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g 162 (173) .. ...++||++|++|+. .+..+.-+.=..+|++|-....- T Consensus 149 ~~-------L~q~~vG~~S~tfa~-~~~~sl~~~~~~il~ry~l~~~~ 188 (188) T protein:vir:78 149 EL-------LVSKQVGEIERRFGS-VAGTSLSKADQAILDRYVIATLA 188 (188) T ss_pred cc-------ceeeecCceeeeccc-ccCCcccchhHHhhccccccccC Confidence 43 378999999999984 33223344556677776543222 No 23 >protein:vir:99002 Length: 158 # NCBI annotation: gp31 # Family: family:all:32652 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655896;genbank:gi:109521468;genbank:GeneID:4157967 Probab=86.70 E-value=0.03 Score=28.90 Aligned_cols=123 Identities=11% Similarity=0.055 Sum_probs=74.3 Q ss_pred ccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccc Q lcl|NC_019411. 16 ANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAI 95 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~I 95 (173) --+|+||+|++.+++ |.-..+..+. ..+|=.+||-.=.| +-..|...||-. +.+ T Consensus 1 ~~alasvee~~trl~--------~~lp~~~~r~--~a~a~~vLd~~S~~----ar~~~gr~W~~~------------~da 54 (158) T protein:vir:99 1 MAALVSVEEFTTFLR--------VPLPEEGSEK--YTQMEFLLTLASDW----ARELSCKPWLLP------------ADA 54 (158) T ss_pred CcceeeHhhhhhhhc--------ccCChhhhHH--HHHHHHHHHHHHHH----HHHhcCccCCCC------------Ccc Confidence 568999999998863 2211122222 22333455531011 112356778821 357 Q ss_pred hHHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcc-cchHHHHHHHhhhhhcccCCCcceeeeecC Q lcl|NC_019411. 96 PQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRG-SMPDIVMSILEGLGVVKTGTRPAFKKIIRH 173 (173) Q Consensus 96 P~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~-~~~~~v~~lL~~ll~~~~g~~~~~~~~~R~ 173 (173) |.-|+.-|...|-..++++. ++..+.+|+-++.|....... ..-+.=..+|+-|...+ +|...+-+-|. T Consensus 55 P~~vr~ivL~aa~R~~~NP~-------g~~~~~~G~~~~~~~~~g~~~~ffT~~E~~~L~r~~~s~--GG~~~~~ttR~ 124 (158) T protein:vir:99 55 PVTARGIILAASRREWNNPK-------RVSYVVKGPQSATFMQSAYPPGFFTDAEEAKLRSYGRST--GNWGVIETYRD 124 (158) T ss_pred hhHHHHHHHHHHHHHHhcCC-------ceEEeeecchhhhcccccCCCcccCHHHHHHHHHhhccc--CceeEEEeecC Confidence 88888888888888877764 568889999999997666443 33344456777776433 34455566666 No 24 >protein:vir:1887 Length: 108 # NCBI annotation: gp6 # Family: family:all:6764 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037667;genbank:gi:9634125;genbank:GeneID:1262472 Probab=85.66 E-value=0.028 Score=29.06 Aligned_cols=105 Identities=12% Similarity=0.056 Sum_probs=62.1 Q ss_pred CeeEEEEeCCCCCCCccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcC Q lcl|NC_019411. 1 MAFTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRT 80 (173) Q Consensus 1 m~M~live~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~ 80 (173) |+|.. =+++|+++++.|..-. ..-+|+-.+..+..|.+|+.. |.|++... T Consensus 1 ~~~~~-----------M~~vtLee~K~hLRid-------~dddD~lI~~~i~AA~~~v~~---~~~~~~~~--------- 50 (108) T protein:vir:18 1 MAIDV-----------LDVISLSLFKQQIEFE-------EDDRDELITLYAQAAFDYCMR---WCDEPAWK--------- 50 (108) T ss_pred CCCCc-----------ccccCHHHHHHHcCCC-------CCcchHHHHHHHHHHHHHHHH---HhCCcccc--------- Confidence 66654 3689999999995432 123777777777788888864 45544211 Q ss_pred CCcccCCeeeccccchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcc Q lcl|NC_019411. 81 GVYDVDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVK 160 (173) Q Consensus 81 gv~~~dg~~~~~d~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~ 160 (173) ....+|..|+.|+..|+-.+.++- |.++.. ....++.++.||.||=. - T Consensus 51 ----------~~~~~p~~ik~AiLllv~~~YenR------------E~~~~~---------~~~~~~~~~~LL~pYR~-~ 98 (108) T protein:vir:18 51 ----------VAADIPAAVKGAVLLVFADMFEHR------------TAQSEV---------QLYENAAAERMMFIHRN-W 98 (108) T ss_pred ----------cccccchHHHHHHHHHHHHHHhcc------------cccccc---------hhhhhHHHHHHHHHHHh-c Confidence 124589999999999886655442 111111 11224678899988621 1 Q ss_pred cCCC---cce Q lcl|NC_019411. 161 TGTR---PAF 167 (173) Q Consensus 161 ~g~~---~~~ 167 (173) -|.. -|. T Consensus 99 ~g~~~~~~~~ 108 (108) T protein:vir:18 99 RGKAESEEGS 108 (108) T ss_pred CCCCCcccCC Confidence 1111 122 No 25 >protein:vir:192 Length: 108 # NCBI annotation: Gp6 # Family: family:all:6764 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037702;genbank:gi:9634167;genbank:GeneID:1262532 Probab=85.66 E-value=0.028 Score=29.06 Aligned_cols=105 Identities=12% Similarity=0.056 Sum_probs=62.1 Q ss_pred CeeEEEEeCCCCCCCccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcC Q lcl|NC_019411. 1 MAFTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRT 80 (173) Q Consensus 1 m~M~live~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~ 80 (173) |+|.. =+++|+++++.|..-. ..-+|+-.+..+..|.+|+.. |.|++... T Consensus 1 ~~~~~-----------M~~vtLee~K~hLRid-------~dddD~lI~~~i~AA~~~v~~---~~~~~~~~--------- 50 (108) T protein:vir:19 1 MAIDV-----------LDVISLSLFKQQIEFE-------EDDRDELITLYAQAAFDYCMR---WCDEPAWK--------- 50 (108) T ss_pred CCCCc-----------ccccCHHHHHHHcCCC-------CCcchHHHHHHHHHHHHHHHH---HhCCcccc--------- Confidence 66654 3689999999995432 123777777777788888864 45544211 Q ss_pred CCcccCCeeeccccchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcc Q lcl|NC_019411. 81 GVYDVDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVK 160 (173) Q Consensus 81 gv~~~dg~~~~~d~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~ 160 (173) ....+|..|+.|+..|+-.+.++- |.++.. ....++.++.||.||=. - T Consensus 51 ----------~~~~~p~~ik~AiLllv~~~YenR------------E~~~~~---------~~~~~~~~~~LL~pYR~-~ 98 (108) T protein:vir:19 51 ----------VAADIPAAVKGAVLLVFADMFEHR------------TAQSEV---------QLYENAAAERMMFIHRN-W 98 (108) T ss_pred ----------cccccchHHHHHHHHHHHHHHhcc------------cccccc---------hhhhhHHHHHHHHHHHh-c Confidence 124589999999999886655442 111111 11224678899988621 1 Q ss_pred cCCC---cce Q lcl|NC_019411. 161 TGTR---PAF 167 (173) Q Consensus 161 ~g~~---~~~ 167 (173) -|.. -|. T Consensus 99 ~g~~~~~~~~ 108 (108) T protein:vir:19 99 RGKAESEEGS 108 (108) T ss_pred CCCCCcccCC Confidence 1111 122 No 26 >protein:vir:100245 Length: 113 # NCBI annotation: gp74 # Family: family:all:363 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355410;genbank:gi:77864700;genbank:GeneID:3725967 Probab=85.24 E-value=0.04 Score=28.27 Aligned_cols=113 Identities=13% Similarity=0.088 Sum_probs=65.5 Q ss_pred cccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccch Q lcl|NC_019411. 17 NSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIP 96 (173) Q Consensus 17 nSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP 96 (173) =+++|+++++.+.... ..-||+-.+..+..|.+++.. |.|++...+|.....-..+.+. ......|| T Consensus 1 M~~vtLee~K~hLRvd-------~d~dD~lI~~li~AA~~~ve~---~l~r~l~~~~~~~~~~~~~~~~---~~~~~~~p 67 (113) T protein:vir:10 1 MALVELKLALGFVRAN-------AGVEDDVVQMLLDAATQSAVD---YLNRQVFETEDAMTTAIEAGTA---GQNPMVVN 67 (113) T ss_pred CCCCCHHHHHHHcCCC-------CCcchHHHHHHHHHHHHHHHH---HhCccccccccccccccccccc---cccccccC Confidence 5689999999885432 223777788777788888874 5676655544433222111110 11224589 Q ss_pred HHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcce Q lcl|NC_019411. 97 QQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAF 167 (173) Q Consensus 97 ~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~ 167 (173) ..|+.|+..+.-....+- |.+. ..+....+..++.||.||=. -+|. T Consensus 68 ~~i~~AvLllv~~~Y~nR------------e~~~--------~~~~~~lP~~v~~Ll~~yR~-----~~g~ 113 (113) T protein:vir:10 68 AAIRAAILKITAELYANR------------EDTA--------FGPITELPLNARALLRPHRI-----IPGV 113 (113) T ss_pred hHHHHHHHHHHHHHHhhh------------hhhc--------hhhhhccCHHHHHHHHHhhh-----hcCC Confidence 999999999986655441 1110 00112345668999999732 2221 No 27 >protein:vir:1640 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695061;genbank:gi:23455752;genbank:GeneID:955488 Probab=84.64 E-value=0.059 Score=27.33 Aligned_cols=125 Identities=13% Similarity=0.074 Sum_probs=67.1 Q ss_pred ccccccHHHHHHHHHhccCccccccCCCH---HHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeecc Q lcl|NC_019411. 16 ANSYCDVQFADDYIYANVYANTAWDALDQ---DGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPS 92 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~---~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~ 92 (173) -+.|+|+++..+-| | .+++ +..+.+|-.|+++|-.++ |+.+. ..+...-++ T Consensus 1 m~~fAtv~Dv~~r~--r--------~L~~~E~~ra~~lL~dAs~~ir~~~---------------p~~~~-~l~a~~~e~ 54 (132) T protein:vir:16 1 MNPFATVDDLTMLW--R--------PLKGDEKERAEKLLEIVSDSLREEA---------------DKVGR-DLYAMIAEK 54 (132) T ss_pred CCccCCHHHHHHHh--c--------CCCHhHHHHHHHHHHHHHHHHHHhh---------------hhhcc-ccccccccc Confidence 68999999997665 3 2233 345667789999997653 33321 122222222 Q ss_pred cc-chHHHHHHHHHHHHHHHcCCCCCCccccceeEEeccee--EEeecCCCCcccchHHHHHHHhhhhhcccCCCcceee Q lcl|NC_019411. 93 DA-IPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVI--ELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 93 d~-IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i--~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~~ 169 (173) .. .+.-++.-+|+.....+..+.... +..-.++..|+. +.+|..++..-..-+.-..+| ++ ++++.+.+. T Consensus 55 ~~~~~~~~~~V~~~~V~Ral~~~~~~~--G~tq~S~TaG~ys~S~t~~~p~G~lylt~~e~~~L-G~----~~~r~~~i~ 127 (132) T protein:vir:16 55 PSYFASVVKSVTVDIVARTLMTSTDQE--PMTQTTESALGYSVSGSYLVPGGGLFIKNSELSRL-GL----KKQRFGVID 127 (132) T ss_pred cccchhHHHHHHHHHHHHHhcCCCCCC--CceeeeeeccchheeeeeecCCCcceeChHHHHhh-CC----CCCceEEEe Confidence 22 344467788888887776542211 112246788988 566654432222223334444 21 345555444 Q ss_pred eecC Q lcl|NC_019411. 170 IIRH 173 (173) Q Consensus 170 ~~R~ 173 (173) +-=. T Consensus 128 ~~~~ 131 (132) T protein:vir:16 128 FYGN 131 (132) T ss_pred ecCC Confidence 4333 No 28 >protein:vir:93592 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:1879 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449297;genbank:gi:157166045;uniprot:Q6H9U4;genbank:GeneID:5580414 Probab=84.30 E-value=0.056 Score=27.46 Aligned_cols=108 Identities=18% Similarity=0.169 Sum_probs=62.9 Q ss_pred ccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccc Q lcl|NC_019411. 16 ANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAI 95 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~I 95 (173) --.++|+++++++..-. + .-+|+..+..+..|.+|+.. |.+.+.+. ..+.++.+. ...+ T Consensus 1 mm~~vtLeevK~hLRId--~-----d~dD~li~~~i~aA~~~v~~---~l~~~~~~----------~~~~~~~~~-~~~~ 59 (108) T protein:vir:93 1 MTALLTLEEIKAHLRVD--H-----DADDDMLMDKVRQATAVLLA---YIQGSRDK----------VIREDGELI-PGEA 59 (108) T ss_pred CCcCCCHHHHHHHcCCC--C-----CcChHHHHHHHHHHHHHHHH---Hhcccccc----------ccccccccc-cccC Confidence 24468999999985432 1 23677777777788788764 23322211 122233333 2457 Q ss_pred hHHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCccee Q lcl|NC_019411. 96 PQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFK 168 (173) Q Consensus 96 P~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~ 168 (173) |..|+.|++.|.-....+-.. ++.. ..+.+..|..|..||.|| +.+... T Consensus 60 ~~~i~~AvLlLv~~~YenRe~------------~~~~------~~~~~elP~~v~~Ll~~~------R~p~~~ 108 (108) T protein:vir:93 60 LTRMKGAAMRLTGMLYRNPDL------------AERE------ELLQGELPFSVSVLIYDL------RCPTVL 108 (108) T ss_pred ChHHHHHHHHHHHHHHhcccc------------cccc------ccccccCCHHHHHHHHHc------cccccC Confidence 899999999999766554321 1110 112334566788888887 345444 No 29 >protein:vir:1993 Length: 141 # NCBI annotation: Hypothetical protein # Family: family:all:348 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050640;genbank:gi:9633527;genbank:GeneID:2636296 Probab=83.35 E-value=0.017 Score=30.32 Aligned_cols=119 Identities=18% Similarity=0.195 Sum_probs=63.4 Q ss_pred cccccHHHHHHHHHhccC---c--cccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeec Q lcl|NC_019411. 17 NSYCDVQFADDYIYANVY---A--NTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIP 91 (173) Q Consensus 17 nSY~tv~~aday~~~r~~---~--~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~ 91 (173) =+|+|.++..+.+..+-. + .......+++..+++|..|+..||+- .+.| +.+| T Consensus 1 M~Y~T~~Dl~~~~ge~~l~~Lt~d~~~~g~~d~~~i~~Al~dA~~eIdgy---L~~R-------------------Y~lP 58 (141) T protein:vir:19 1 MNYATVNDLCARYTRTRLDILTRPKTADGQPDDAVAEQALADASAFIDGY---LAAR-------------------FVLP 58 (141) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcCCCCccccCHHHHHHHHHHHHHHHHHH---Hhhc-------------------ccCC Confidence 579999999887654322 1 11233567888899999999999983 2222 1244 Q ss_pred cccchHHHHHHHHHHHHHHHcCCCCCCccccc----e---eEEecceeEEeecCCCCcc------cc----hHHHHHHHh Q lcl|NC_019411. 92 SDAIPQQLMEATAEMAAALMNNDWTSPQTTRG----M---KEIQVDVIELKFDSEIQRG------SM----PDIVMSILE 154 (173) Q Consensus 92 ~d~IP~~V~~A~~elA~~~~~~~~~~~~~~~~----v---~~ekVG~i~veY~~~~~~~------~~----~~~v~~lL~ 154 (173) -..+|.-|+..||-+|.+.+.+...+...... + +...-|.+++--....... .. ......=++ T Consensus 59 l~~~P~~L~~~a~dIA~Y~L~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~~~~~~~~r~f~r~~~ 138 (141) T protein:vir:19 59 LTVVPSLLKRQCCVVAWFYLNESQPTEQITATYRDTVRWLEQVRDGKTDPGVESRTAASPEGEDLVQVQSDPPVFSRKQK 138 (141) T ss_pred ccccchHHHHHHHHHHHHHHhcCCCChHHHHHHHHHHHHHHHHhcCccccCCCCCCCCCCCCCceeEeecCCcccCcccc Confidence 56789999999999997666554322111111 1 2222255555322111100 00 000000023 Q ss_pred hhh Q lcl|NC_019411. 155 GLG 157 (173) Q Consensus 155 ~ll 157 (173) ||+ T Consensus 139 G~~ 141 (141) T protein:vir:19 139 GFI 141 (141) T ss_pred cCC Confidence 333 No 30 >protein:vir:5256 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852761;genbank:gi:31544036;uniprot:Q7Y5T9;genbank:GeneID:2753553 Probab=79.89 E-value=0.1 Score=26.06 Aligned_cols=106 Identities=11% Similarity=0.119 Sum_probs=60.3 Q ss_pred cccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccchHH Q lcl|NC_019411. 19 YCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIPQQ 98 (173) Q Consensus 19 Y~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP~~ 98 (173) -.|++++.+-+= .-...+|+..+..|-.|..+|+. -+| | .. T Consensus 1 m~t~~~Fr~~~P-------eF~~~pd~~i~~~l~~A~~~l~~-~~~-g------------------------------~~ 41 (119) T protein:vir:52 1 MPLTEDFLLRYT-------EFGKTDAKRIGLFLSDAQAEVSK-VQW-G------------------------------KL 41 (119) T ss_pred CCcHHHHHHhhh-------hccCCCHHHHHHHHHHHHHhhCC-cCC-c------------------------------hH Confidence 566776665542 22457899999999999999985 344 1 11 Q ss_pred HHHHHHHHHHHHHcC--CCCC--CccccceeEEecceeEEeecCCCCcccch---------HHHHHHHhhhhhcccCCCc Q lcl|NC_019411. 99 LMEATAEMAAALMNN--DWTS--PQTTRGMKEIQVDVIELKFDSEIQRGSMP---------DIVMSILEGLGVVKTGTRP 165 (173) Q Consensus 99 V~~A~~elA~~~~~~--~~~~--~~~~~~v~~ekVG~i~veY~~~~~~~~~~---------~~v~~lL~~ll~~~~g~~~ 165 (173) -.++.+.++.+++.= .... ....+.++++++|.|+|+|+........- .-.-+|++.+. .+ T Consensus 42 ~~~~~~L~~AH~l~l~~~~~~~~g~~~g~v~S~s~G~vSvS~~~~~~~~~~~~w~~~T~YG~~y~~L~r~~g------~G 115 (119) T protein:vir:52 42 YDRGVMALTAHLLKLSADAEISGGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIG------VG 115 (119) T ss_pred HHHHHHHHHHHHHHhhhhhhccccccccceeeeeecceeeeeeccccCCcchhhhhcCHHHHHHHHHHHHhc------CC Confidence 233555666554431 1111 12345699999999999997544322111 22344555553 22 Q ss_pred ceee Q lcl|NC_019411. 166 AFKK 169 (173) Q Consensus 166 ~~~~ 169 (173) +++- T Consensus 116 g~Va 119 (119) T protein:vir:52 116 VMVA 119 (119) T ss_pred CcCC Confidence 2222 No 31 >protein:vir:80036 Length: 111 # NCBI annotation: gp10 # Family: family:all:3186 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468714;genbank:gi:157325294;genbank:GeneID:5601727 Probab=74.99 E-value=0.036 Score=28.51 Aligned_cols=109 Identities=12% Similarity=0.069 Sum_probs=56.4 Q ss_pred ccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccc Q lcl|NC_019411. 16 ANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAI 95 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~I 95 (173) -|+= ++.... .+.. -++++|+..+.++-.| |++. |.+.||- T Consensus 1 m~tt--v~~vkl-~a~~------L~~~sDDsl~~~I~dA--~~e~------------~a~gFp~---------------- 41 (111) T protein:vir:80 1 MKTD--VSKLKL-TASS------LASVSDDSLQVHIDDS--YLEV------------QEKGFPE---------------- 41 (111) T ss_pred Cchh--HHHHHH-hhHh------hcCCChHHHHHHHHHH--HHHh------------hcCCCCh---------------- Confidence 1221 222221 1111 1256777777766555 3332 3444553 Q ss_pred hHHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcc--cchHHHHHHHhhhhhcccCCCcceeeee Q lcl|NC_019411. 96 PQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRG--SMPDIVMSILEGLGVVKTGTRPAFKKII 171 (173) Q Consensus 96 P~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~--~~~~~v~~lL~~ll~~~~g~~~~~~~~~ 171 (173) +--.+||--||++++.-+ .++|++||||.++-+|++.+... .+-+|-...+ -|+....|++.-...|+ T Consensus 42 -~~~e~a~rYLa~HLat~~------~~~v~sE~V~~Lk~~Y~~~~~~~~l~~s~wGq~Y~-rL~k~~~~gs~~~~vVv 111 (111) T protein:vir:80 42 -KFEERANRYLAAHLATLA------NKNVKSEAVGSLKREYYEVKGDSGLLSTEYGQEYA-RLLKEANGGSGISMVVV 111 (111) T ss_pred -hHHHHHHHHHHHHHHHhc------CCCCchhhhhhHHHHhhhcccccccccchhHHHHH-HHHHHhcCCccceeeeC Confidence 223567777888776553 56799999999999998655432 2334433333 22222233333333444 No 32 >protein:vir:79074 Length: 150 # NCBI annotation: gp10, conserved hypothetical protein # Family: family:all:348 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111210;genbank:gi:134288797;genbank:GeneID:4960748 Probab=74.58 E-value=0.13 Score=25.37 Aligned_cols=118 Identities=14% Similarity=0.201 Sum_probs=61.6 Q ss_pred cccccHHHHHHHHHhccC-----------ccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCccc Q lcl|NC_019411. 17 NSYCDVQFADDYIYANVY-----------ANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDV 85 (173) Q Consensus 17 nSY~tv~~aday~~~r~~-----------~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~ 85 (173) =+|+|++++.+.+..+-. ........+++..+++|..|+..||+- .+.| T Consensus 1 M~Y~T~~Dl~~r~ge~~l~~Ltd~~~~~~~~~~~~~~d~~~i~~Al~dA~~~IDgy---L~~R----------------- 60 (150) T protein:vir:79 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESAVRQAEEIVDAH---LRGR----------------- 60 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccccccccccccCHHHHHHHHHHHHHHHHHH---Hhhh----------------- Confidence 689999999887653211 011223467788899999999999983 2222 Q ss_pred CCeeeccccchHHHHHHHHHHHHHHHcCC----CCCCc-ccc----ce---eEEecceeEEeecC----CCCcccch--- Q lcl|NC_019411. 86 DGFLIPSDAIPQQLMEATAEMAAALMNND----WTSPQ-TTR----GM---KEIQVDVIELKFDS----EIQRGSMP--- 146 (173) Q Consensus 86 dg~~~~~d~IP~~V~~A~~elA~~~~~~~----~~~~~-~~~----~v---~~ekVG~i~veY~~----~~~~~~~~--- 146 (173) +.+|-..+|..|+..||-+|.+.+-.. ...++ ... .+ +...=|.++.--.. +++++..+ T Consensus 61 --Y~lPl~~vP~~L~~~a~dIA~Y~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~v~~~ 138 (150) T protein:vir:79 61 --YNLPLSPVPTVIKDVTVNLARHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPSGPATPEPGEMKVRAR 138 (150) T ss_pred --ccCCcccccHHHHHHHHHHHHHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCccCCCCCCceeeecC Confidence 123445799999999999996554321 11111 111 11 11122555442211 11111100 Q ss_pred --HHHHHHHhhh Q lcl|NC_019411. 147 --DIVMSILEGL 156 (173) Q Consensus 147 --~~v~~lL~~l 156 (173) .+=..-|++| T Consensus 139 ~r~f~r~~l~g~ 150 (150) T protein:vir:79 139 RRQFDADLLERF 150 (150) T ss_pred CCccChhhccCC Confidence 0112223333 No 33 >protein:vir:103846 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938245;genbank:gi:38229150;genbank:GeneID:2648161 Probab=73.01 E-value=0.18 Score=24.73 Aligned_cols=123 Identities=12% Similarity=0.128 Sum_probs=62.0 Q ss_pred cccccHHHHHHHHHhccC---c---cccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeee Q lcl|NC_019411. 17 NSYCDVQFADDYIYANVY---A---NTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLI 90 (173) Q Consensus 17 nSY~tv~~aday~~~r~~---~---~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~ 90 (173) =+|+|.++..+.+..+-. + .......+++..+++|..|+..||+- .+.| +.+ T Consensus 1 M~Y~T~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~~eIdgy---L~~R-------------------Y~l 58 (138) T protein:vir:10 1 MSYCTQADLVEQYGEASIRQLSDRVNKPATTIDPAVVAQAIADADAEIDLH---LHAR-------------------YQL 58 (138) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccCCCcCccCHHHHHHHHHHHHHHHHHH---Hhhc-------------------ccC Confidence 579999999987544321 0 11223568888999999999999983 2222 124 Q ss_pred ccccchHHHHHHHHHHHHHHHcCCCCCCc-cccc----e---eEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccC Q lcl|NC_019411. 91 PSDAIPQQLMEATAEMAAALMNNDWTSPQ-TTRG----M---KEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTG 162 (173) Q Consensus 91 ~~d~IP~~V~~A~~elA~~~~~~~~~~~~-~~~~----v---~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g 162 (173) |-..+|.-|+..||-+|.+.+.+...... .... + +...=|.++.--.......+...- ....+.. T Consensus 59 Pl~~vP~~L~~~a~dIA~Y~L~~~~~~~e~~~~rY~~Ai~~L~~Ia~G~~~Lg~~~~~~~~~~~~~-------~~~~s~~ 131 (138) T protein:vir:10 59 PLAQVPVVLKRVACVLAFANLHTQVKDDHPAILDAERKRKLLGGISSGKLSLALTSSGTPAPIANT-------VQISSQR 131 (138) T ss_pred CccccchHHHHHHHHHHHHHHhcCCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCcccCCCCCc-------eeeecCC Confidence 45679999999999999765554322111 1111 1 112225555432211111100000 0000111 Q ss_pred CCcceeeeecC Q lcl|NC_019411. 163 TRPAFKKIIRH 173 (173) Q Consensus 163 ~~~~~~~~~R~ 173 (173) +-|| |. T Consensus 132 r~Fg-----~d 137 (138) T protein:vir:10 132 NDFG-----GT 137 (138) T ss_pred ccCC-----CC Confidence 1111 11 No 34 >protein:vir:107864 Length: 150 # NCBI annotation: gp36 # Family: family:all:348 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024709;genbank:gi:48696946;genbank:GeneID:2845952 Probab=72.85 E-value=0.18 Score=24.70 Aligned_cols=118 Identities=14% Similarity=0.204 Sum_probs=61.5 Q ss_pred cccccHHHHHHHHHhccC---------c--cccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCccc Q lcl|NC_019411. 17 NSYCDVQFADDYIYANVY---------A--NTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDV 85 (173) Q Consensus 17 nSY~tv~~aday~~~r~~---------~--~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~ 85 (173) =+|+|++++.+.|..+-. + .......+++..+++|..|+..||+-+ +.| T Consensus 1 M~Y~T~~Dl~~r~ge~el~~Ltd~~~~g~~~~~~~~~d~~~i~~Al~dA~~eIDgYL---~~R----------------- 60 (150) T protein:vir:10 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESSVRQAEEIVDAHL---RGR----------------- 60 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccCccccchhhcCHHHHHHHHHHHHHHHHHHH---hhh----------------- Confidence 689999999887653211 0 011234677888999999999999832 222 Q ss_pred CCeeeccccchHHHHHHHHHHHHHHHcCC----CCCCc-cccc----e---eEEecceeEEeecCC----CCcccch--- Q lcl|NC_019411. 86 DGFLIPSDAIPQQLMEATAEMAAALMNND----WTSPQ-TTRG----M---KEIQVDVIELKFDSE----IQRGSMP--- 146 (173) Q Consensus 86 dg~~~~~d~IP~~V~~A~~elA~~~~~~~----~~~~~-~~~~----v---~~ekVG~i~veY~~~----~~~~~~~--- 146 (173) +.+|-..+|..|+..||-+|.+.+-.. ...++ .... + +...=|.++..-... ++++..+ T Consensus 61 --Y~lPl~~vP~~L~~~a~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~v~~~ 138 (150) T protein:vir:10 61 --YNLPLSPVPTVIKDVTVNLARHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPSGPATPEPGEMKVRAR 138 (150) T ss_pred --ccCCcccccHHHHHHHHHHHHHHHHhcccccCCCCHHHHHHHHHHHHHHHHHhcCcccCCCCCCCCCCCCceeeeecC Confidence 123445799999999999996554321 11111 1111 1 111225554432111 1111100 Q ss_pred --HHHHHHHhhh Q lcl|NC_019411. 147 --DIVMSILEGL 156 (173) Q Consensus 147 --~~v~~lL~~l 156 (173) .+=..-|++| T Consensus 139 ~r~f~r~~l~gf 150 (150) T protein:vir:10 139 RRQFDADLLERF 150 (150) T ss_pred CCccChhhccCC Confidence 0112233333 No 35 >protein:vir:99570 Length: 153 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039799;genbank:gi:126011049;genbank:GeneID:4818265 Probab=72.13 E-value=0.19 Score=24.58 Aligned_cols=128 Identities=13% Similarity=0.063 Sum_probs=64.8 Q ss_pred eEEEEeCCCCCCCccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCC Q lcl|NC_019411. 3 FTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGV 82 (173) Q Consensus 3 M~live~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv 82 (173) |+.+|=| ++.+.+ +.--...-+..+|+..+..|-.|.-+|+.. + |+.... T Consensus 1 m~~~~fd------------~~~Fr~----~fPeFad~~~~Pd~~i~~~l~~A~~~l~~~-~-------------~~~~~~ 50 (153) T protein:vir:99 1 MADPVYN------------DGLFRI----MYPEFADQEKYPPEVIEIYYDTATLFITGS-M-------------FPCAAL 50 (153) T ss_pred CCcccCC------------hHHHHH----hcccccCccccCHHHHHHHHHHHHHhhcCc-c-------------cccccc Confidence 4444433 333332 211222233568999999999999999852 1 232111 Q ss_pred cccCCeeeccccchHHHHHHHHHHHHHHHc-------C-CCCCCccccceeEEecceeEEeecCCCCcccc--------h Q lcl|NC_019411. 83 YDVDGFLIPSDAIPQQLMEATAEMAAALMN-------N-DWTSPQTTRGMKEIQVDVIELKFDSEIQRGSM--------P 146 (173) Q Consensus 83 ~~~dg~~~~~d~IP~~V~~A~~elA~~~~~-------~-~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~--------~ 146 (173) -++..+++.+.++.+++. + ........+.++++++|.|||.|+.+...... + T Consensus 51 ------------~g~~~~~~l~Ll~AH~l~L~~~~~~~~~~a~~~~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~Y 118 (153) T protein:vir:99 51 ------------SGKQLVGALNMLTAHLMSLSMQRSQTALGATNDQGGYTLSATIGEVSVSKMAPPAKDGWEFWLAQTPY 118 (153) T ss_pred ------------ChHHHHHHHHHHHHHHHHHHhhhhcccccCCCccccceeeeeecceeeeeecCCCCCchhHhhhcCHH Confidence 135566777777766542 1 11122334568999999999999755433221 1 Q ss_pred -HHHHHHHhhhhhc--ccCCCcceeeeecC Q lcl|NC_019411. 147 -DIVMSILEGLGVV--KTGTRPAFKKIIRH 173 (173) Q Consensus 147 -~~v~~lL~~ll~~--~~g~~~~~~~~~R~ 173 (173) .-.-+|++.+... -.|+.+ --.-+|. T Consensus 119 Gq~fw~l~~~~~~Gg~v~gg~p-e~~~~r~ 147 (153) T protein:vir:99 119 GQALWALLKMLSVGGFAIGGLP-ERTGFRK 147 (153) T ss_pred HHHHHHHHHHhcccccccCCCC-ccccccc Confidence 1233455554320 011111 1222344 No 36 >protein:vir:10365 Length: 115 # NCBI annotation: conserved phage protein # Family: family:all:363 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858957;genbank:gi:32128422;genbank:GeneID:2648387 Probab=71.21 E-value=0.2 Score=24.43 Aligned_cols=114 Identities=9% Similarity=-0.002 Sum_probs=63.2 Q ss_pred cccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCcccc-ccCCcCCCcccCCeeeccccchH Q lcl|NC_019411. 19 YCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSG-LRWPRTGVYDVDGFLIPSDAIPQ 97 (173) Q Consensus 19 Y~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~-lawPR~gv~~~dg~~~~~d~IP~ 97 (173) =+|+++++.+. |.-.. =+..+|+-.+..+-.|.+|+.. |.|++.-..|. +..+..+.. . ......||. T Consensus 1 mvtLe~~K~hL--Rid~~--d~d~dD~li~~~i~AA~~~v~~---~~~r~l~~~~~~~~~~~~~~~-~---~~~~~~~p~ 69 (115) T protein:vir:10 1 MITLAMVQRHL--QAELY--EDDERDYVMQQLLPAARESAEL---FINRKLYDTQADMLADQAAGV-D---PAGQLLITR 69 (115) T ss_pred CCCHHHHHHHc--CCCCC--CCchhhHHHHHHHHHHHHHHHH---HhCCccccccccccccccccc-C---CcccccCCh Confidence 89999999885 42110 0123566677777788888874 56666543221 222222111 0 011224899 Q ss_pred HHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcceee Q lcl|NC_019411. 98 QLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 98 ~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~~ 169 (173) .|+.|...+.-+...+-.. ..+| +....+..++.||.||=. -+| +| T Consensus 70 ~i~~AiLLlvg~~Y~nRe~----------~~~~----------~~~elP~~v~~LL~pyR~-----~~g-v~ 115 (115) T protein:vir:10 70 TVEQAILLTVGEWYANREQ----------VWVK----------GVGLVTSSAQNLLHPYRK-----FAG-VR 115 (115) T ss_pred HHHHHHHHHHHHHHhcchh----------cccc----------hhhhcCHHHHHHHHHHHh-----cCC-CC Confidence 9999999999665553210 0111 112345568999999842 222 22 No 37 >protein:vir:486 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543093;swissprot:trembl:q8w626;genbank:gi:18249905;uniprot:Q8W626;genbank:GeneID:929692 Probab=70.95 E-value=0.2 Score=24.39 Aligned_cols=104 Identities=8% Similarity=0.015 Sum_probs=61.3 Q ss_pred ccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCcccc---ccCCcCCCcccCCeeecccc Q lcl|NC_019411. 18 SYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSG---LRWPRTGVYDVDGFLIPSDA 94 (173) Q Consensus 18 SY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~---lawPR~gv~~~dg~~~~~d~ 94 (173) =++|+++++.|..-. +. . .-+|+-.+..+..|.+||.. |.|++....+. ..||.. -. T Consensus 1 M~vtL~e~K~hLRid--~D--~-~ddD~li~~~i~aA~~~i~~---~~~r~l~~~~~~~~~~~~~~------------~~ 60 (107) T protein:vir:48 1 MLLKEEEIKSHLRLD--DG--L-YSDGDFLKLLAQAVQKRTET---YLNRKLYAPEETIPEDDPDG------------MH 60 (107) T ss_pred CCCCHHHHHHHcCCC--CC--C-chhHHHHHHHHHHHHHHHHH---HhccccccccccccccCccc------------cc Confidence 688999999995432 11 1 12556677777788888874 56776544332 233321 24 Q ss_pred chHHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcce Q lcl|NC_019411. 95 IPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAF 167 (173) Q Consensus 95 IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~ 167 (173) ||..++.|+..|+-....+-. .+. ..+....+..++.||.||=. -+. T Consensus 61 ~~~~ik~Avlllv~~~Y~NRe------------~v~--------~~~~~~iP~~v~~LL~~yR~------~~l 107 (107) T protein:vir:48 61 LTDDVRLAMLMLVSHFYENRS------------TIT--------DVEKLETPMSFRWLAGPYRI------VPL 107 (107) T ss_pred cchhHHHHHHHHHHHHHhhhh------------hhc--------cccccccCHHHHHHHHHhhc------cCC Confidence 789999999998866554321 110 01112344568889988832 222 No 38 >protein:vir:99848 Length: 172 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164077;genbank:gi:56692609;genbank:GeneID:3192576 Probab=69.80 E-value=0.081 Score=26.56 Aligned_cols=134 Identities=14% Similarity=0.093 Sum_probs=58.6 Q ss_pred CeeEEEEeCCCCC----------CCccccccHHHHHHHHHhccCccccc-------cCCCHHHHHHHHHHHHHHHhhhcc Q lcl|NC_019411. 1 MAFTFVVETGAGD----------PAANSYCDVQFADDYIYANVYANTAW-------DALDQDGKERFLVRASKYLDRSIA 63 (173) Q Consensus 1 m~M~live~g~g~----------~~AnSY~tv~~aday~~~r~~~~~~w-------~~~~~~~~e~aL~~As~~id~~~~ 63 (173) |+|=.+++|=.-. -.-+.|.+-+++.+.+-. +.-...| ...++...+.+|..|+..||+-+. T Consensus 1 ~~mYaT~~dl~~r~g~~el~qLt~~~~~~~~~~~l~d~~~~-~~~~~~~~~~~~~~g~~d~~~i~~Al~dA~aeIDgYL~ 79 (172) T protein:vir:99 1 MAVYITLPELAERPGAEELSQAATPRPLQAVDSELLDALLR-GLPVDRWTPEEIEVGHATVEVINSAVSDAQGYIDGFLQ 79 (172) T ss_pred CcccccHHHHHhhcCHHHHHHHhccccccCCHHHHHHHhhc-chhhhhcccccccccccCHHHHHHHHHHHHHHHHHHHh Confidence 6664333331100 011222222222221111 1111222 245778899999999999998432 Q ss_pred cccccCCccccccCCcCCCcccCCeeeccccchHHHHHHHHHHHHHHHcCC----CCCCc-cccc----e---eEEecce Q lcl|NC_019411. 64 WAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIPQQLMEATAEMAAALMNND----WTSPQ-TTRG----M---KEIQVDV 131 (173) Q Consensus 64 f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP~~V~~A~~elA~~~~~~~----~~~~~-~~~~----v---~~ekVG~ 131 (173) |+ ++.+|-..+|.-|+..||-+|.+.+... ...++ ..+. + +...-|. T Consensus 80 --~R-------------------~Y~lPL~~vP~~L~~~a~dIArY~L~~~r~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk 138 (172) T protein:vir:99 80 --RR-------------------GYSLPLAKRYGVVTGWTRAIARYLLHQDRLGPGAEKDPIVRDYRDALKFLQLIAEGK 138 (172) T ss_pred --cc-------------------cccCCCcccchHHHHHHHHHHHHHHHhccCCcccCCHHHHHHHHHHHHHHHHHhcCc Confidence 11 1234556799999999999997544421 11111 1111 1 2222255 Q ss_pred eEEeecCCC----Ccccch-----HHHHHHHhhh Q lcl|NC_019411. 132 IELKFDSEI----QRGSMP-----DIVMSILEGL 156 (173) Q Consensus 132 i~veY~~~~----~~~~~~-----~~v~~lL~~l 156 (173) ++.--.... +++..+ .+=..-|++| T Consensus 139 ~~Lg~~~~~~~~~~~~~~v~~~~r~F~rd~L~gf 172 (172) T protein:vir:99 139 FSLGPDDPLTPPGGGVPQVLAPARTFSHDTLKDY 172 (172) T ss_pred cccCCCCCCCCCCCCceeeecCCCccChhhccCC Confidence 444211111 111100 1112233333 No 39 >protein:vir:79640 Length: 134 # NCBI annotation: gp38 # Family: family:all:5122 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285527;genbank:gi:148734510;genbank:GeneID:5219998 Probab=66.01 E-value=0.27 Score=23.66 Aligned_cols=117 Identities=14% Similarity=0.111 Sum_probs=61.4 Q ss_pred ccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccc Q lcl|NC_019411. 16 ANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAI 95 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~I 95 (173) -|==.+|+.+ ..+ | ..-...+|+..+..+-.|..||+.. .| T Consensus 1 m~d~~~ve~F----r~l-~--PeF~~vpde~l~~~~~~A~~~i~~~-~~------------------------------- 41 (134) T protein:vir:79 1 MNDIEILEQI----YKI-A--PAFKKVDPELIQAWIELAKDFVCEK-HF------------------------------- 41 (134) T ss_pred CchHHHHHHH----HHh-c--cccccCCHHHHHHHHHHhhhhhcCC-CC------------------------------- Confidence 1221223333 233 1 2335678899998888898888642 11 Q ss_pred hHHHHHHHHHHHHHHHcC------CCCCCc-cccceeE-EecceeEEeecCCCCcccc-----hHHHHHHHhhhhhcccC Q lcl|NC_019411. 96 PQQLMEATAEMAAALMNN------DWTSPQ-TTRGMKE-IQVDVIELKFDSEIQRGSM-----PDIVMSILEGLGVVKTG 162 (173) Q Consensus 96 P~~V~~A~~elA~~~~~~------~~~~~~-~~~~v~~-ekVG~i~veY~~~~~~~~~-----~~~v~~lL~~ll~~~~g 162 (173) .+....|...++++++.- +..... ..++|.+ ...|+++|+|+..+..+.. -|+= +++.-|.. ..+ T Consensus 42 g~~~~~al~lltAHLl~l~~~~~~~g~~~~~~~grv~ssst~G~vSvS~a~ps~~~~~~Wl~~TpYG-q~y~~L~k-~~~ 119 (134) T protein:vir:79 42 KDKYFRAVALYTLHLMTLDGAMKQESESVESYSHRIASFSLTGEFSQTFSKVSDDTSGNTLRQTPWG-KMYEVLNK-KKG 119 (134) T ss_pred ChHHHHHHHHHHHHHHhhcccccccccccccccchhhhhhhhcceeeeccCcccchhHHHHhcCHHH-HHHHHHHH-hhc Confidence 134566777777776642 222221 2234544 5689999999865543321 1221 23323332 445 Q ss_pred CCcceeeeecC Q lcl|NC_019411. 163 TRPAFKKIIRH 173 (173) Q Consensus 163 ~~~~~~~~~R~ 173 (173) +|+|..--.|+ T Consensus 120 GGf~~~t~~~~ 130 (134) T protein:vir:79 120 GGFGLTTAFHR 130 (134) T ss_pred cchHhhhhccc Confidence 66665544444 No 40 >protein:vir:99222 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950460;genbank:gi:119953661;genbank:GeneID:4643082 Probab=63.60 E-value=0.046 Score=27.93 Aligned_cols=123 Identities=14% Similarity=0.195 Sum_probs=61.4 Q ss_pred cccccHHHHHHHHHhccC---c---cccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeee Q lcl|NC_019411. 17 NSYCDVQFADDYIYANVY---A---NTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLI 90 (173) Q Consensus 17 nSY~tv~~aday~~~r~~---~---~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~ 90 (173) =+|+|.+++.+.+..+-. + .......+++-.+++|..|+..||+-+ +.| +.+ T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL---~~R-------------------Y~l 58 (138) T protein:vir:99 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHL---HGR-------------------YQL 58 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHH---hhc-------------------ccC Confidence 579999999876544321 0 112235678889999999999999832 222 124 Q ss_pred ccccchHHHHHHHHHHHHHHHcCCCCCCc-cccc----e---eEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccC Q lcl|NC_019411. 91 PSDAIPQQLMEATAEMAAALMNNDWTSPQ-TTRG----M---KEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTG 162 (173) Q Consensus 91 ~~d~IP~~V~~A~~elA~~~~~~~~~~~~-~~~~----v---~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g 162 (173) |-..+|..|+..||-+|.+.+.+...... .... + +...-|.++.--...........- ... .++ T Consensus 59 Pl~~vP~~L~~~a~dIA~Y~L~~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~-------~~~-~~~ 130 (138) T protein:vir:99 59 PLASVPTALKRIACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANT-------VQI-SEG 130 (138) T ss_pred CccccchHHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcCCCCCCc-------eee-ecC Confidence 45679999999999999766554332221 1111 1 111224444422111100000000 000 000 Q ss_pred CCcceeeeecC Q lcl|NC_019411. 163 TRPAFKKIIRH 173 (173) Q Consensus 163 ~~~~~~~~~R~ 173 (173) . ..+.|. T Consensus 131 ~----r~F~Rd 137 (138) T protein:vir:99 131 R----NDWGAD 137 (138) T ss_pred C----CCCCCC Confidence 0 001122 No 41 >protein:vir:79253 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469165;genbank:gi:157835007;genbank:GeneID:5648834 Probab=63.60 E-value=0.046 Score=27.93 Aligned_cols=123 Identities=14% Similarity=0.195 Sum_probs=61.4 Q ss_pred cccccHHHHHHHHHhccC---c---cccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeee Q lcl|NC_019411. 17 NSYCDVQFADDYIYANVY---A---NTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLI 90 (173) Q Consensus 17 nSY~tv~~aday~~~r~~---~---~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~ 90 (173) =+|+|.+++.+.+..+-. + .......+++-.+++|..|+..||+-+ +.| +.+ T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL---~~R-------------------Y~l 58 (138) T protein:vir:79 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHL---HGR-------------------YQL 58 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHH---hhc-------------------ccC Confidence 579999999876544321 0 112235678889999999999999832 222 124 Q ss_pred ccccchHHHHHHHHHHHHHHHcCCCCCCc-cccc----e---eEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccC Q lcl|NC_019411. 91 PSDAIPQQLMEATAEMAAALMNNDWTSPQ-TTRG----M---KEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTG 162 (173) Q Consensus 91 ~~d~IP~~V~~A~~elA~~~~~~~~~~~~-~~~~----v---~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g 162 (173) |-..+|..|+..||-+|.+.+.+...... .... + +...-|.++.--...........- ... .++ T Consensus 59 Pl~~vP~~L~~~a~dIA~Y~L~~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~-------~~~-~~~ 130 (138) T protein:vir:79 59 PLASVPTALKRIACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANT-------VQI-SEG 130 (138) T ss_pred CccccchHHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcCCCCCCc-------eee-ecC Confidence 45679999999999999766554332221 1111 1 111224444422111100000000 000 000 Q ss_pred CCcceeeeecC Q lcl|NC_019411. 163 TRPAFKKIIRH 173 (173) Q Consensus 163 ~~~~~~~~~R~ 173 (173) . ..+.|. T Consensus 131 ~----r~F~Rd 137 (138) T protein:vir:79 131 R----NDWGAD 137 (138) T ss_pred C----CCCCCC Confidence 0 001122 No 42 >protein:vir:107702 Length: 136 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003900;genbank:gi:45686316;genbank:GeneID:2773042 Probab=60.72 E-value=0.37 Score=22.97 Aligned_cols=120 Identities=13% Similarity=0.060 Sum_probs=64.1 Q ss_pred CCccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccc Q lcl|NC_019411. 14 PAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSD 93 (173) Q Consensus 14 ~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d 93 (173) -+-...+++ -+-|..+ | ..-+..+|+..+..+--|..||... .| T Consensus 1 ~~~~~~~~~---ve~fR~l-~--PeF~dvPde~i~~~~d~A~~~v~~~-~~----------------------------- 44 (136) T protein:vir:10 1 MNQETLIAV---VEQMRKL-V--PALRKVPDETLYAWVEMAELFVCQK-TF----------------------------- 44 (136) T ss_pred CCchHHHHH---HHHHHHh-c--cccccCCHHHHHHHHHHHHHhhcCC-CC----------------------------- Confidence 011122222 2223333 2 2345668888888888888888641 11 Q ss_pred cchHHHHHHHHHHHHHHHcCCCC------CC-ccccceeE-EecceeEEeecCCCCcccc-----hHHHHHHHhhhhhcc Q lcl|NC_019411. 94 AIPQQLMEATAEMAAALMNNDWT------SP-QTTRGMKE-IQVDVIELKFDSEIQRGSM-----PDIVMSILEGLGVVK 160 (173) Q Consensus 94 ~IP~~V~~A~~elA~~~~~~~~~------~~-~~~~~v~~-ekVG~i~veY~~~~~~~~~-----~~~v~~lL~~ll~~~ 160 (173) .+...+|...++++++.-+.. .. ...++|++ ..+|+++|+|+..+.++.. -|+ =+++.-|+. . T Consensus 45 --Gk~y~~al~lltAHLl~l~~~~~~~~~~~~~~s~rv~ssat~GevSVS~a~~s~~~s~~WL~~Tpy-Gq~y~aL~k-~ 120 (136) T protein:vir:10 45 --KDAYVKALALYALHLAFLDGALKGEDEDLESYSRRVTSFSLSGEFSQTFGEVTKNQSGDMMLSTPW-GKMFEQLKA-R 120 (136) T ss_pred --hhHHHHHHHHHHHHHHhcccccccccccccccccceehheeccceeEeeccccCchhhHhhhcCHH-HHHHHHHHh-h Confidence 234667777788877733221 11 22344555 6689999999865544431 122 223334554 3 Q ss_pred cCCCcceeeeecC Q lcl|NC_019411. 161 TGTRPAFKKIIRH 173 (173) Q Consensus 161 ~g~~~~~~~~~R~ 173 (173) .|+||+..-=++. T Consensus 121 ~~gGf~l~t~~~~ 133 (136) T protein:vir:10 121 RRGRFALMTGLRG 133 (136) T ss_pred cccchhhhhcccc Confidence 5667766655554 No 43 >protein:vir:4512 Length: 107 # NCBI annotation: unknown # Family: family:all:363 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599038;genbank:gi:19548996;genbank:GeneID:935218 Probab=58.59 E-value=0.41 Score=22.71 Aligned_cols=107 Identities=11% Similarity=0.102 Sum_probs=61.6 Q ss_pred ccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccchH Q lcl|NC_019411. 18 SYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIPQ 97 (173) Q Consensus 18 SY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP~ 97 (173) =++|+++++.|.... +. . .-+|+-.+..+..|..||.. |.|++....+.. ||-.. .+| -.||+ T Consensus 1 M~vtL~e~K~hLRId--~D--~-~ddD~lI~~~i~AA~~~i~~---~~~r~~~~~~~~-~~~~~---~~~-----~~~~~ 63 (107) T protein:vir:45 1 MLLKMEEIKLQLRLD--DD--F-SDEDELLELLGKAAQSRTEN---FLNRKLYATADD-RPADD---PDG-----LVISD 63 (107) T ss_pred CCCCHHHHHHHcCCC--CC--C-chhHHHHHHHHHHHHHHHHH---Hhcccccccccc-ccccc---ccc-----ccCCh Confidence 689999999995432 11 1 12455577777788889874 678776554433 44321 121 23689 Q ss_pred HHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcce Q lcl|NC_019411. 98 QLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAF 167 (173) Q Consensus 98 ~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~ 167 (173) .++.|+..+.-....+-.. +. ..+....+..++.||.||=. -+. T Consensus 64 ~~~~AvLllv~~~Y~NRe~------------~~--------~~~~~~lp~~v~~Ll~~~R~------~~~ 107 (107) T protein:vir:45 64 DVKLALLLLVSHFYENRST------------VT--------DVEKMELPMSFNWLVAPYRL------IPL 107 (107) T ss_pred hHHHHHHHHHHHHHhhhhh------------cc--------ccchhccchHHHHHHHHHhh------cCC Confidence 9999999888655443211 10 00111245568889988722 111 No 44 >protein:vir:102961 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:26777 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945287;genbank:gi:39653722;uniprot:Q708M5;genbank:GeneID:2672875 Probab=55.53 E-value=0.44 Score=22.53 Aligned_cols=118 Identities=17% Similarity=0.132 Sum_probs=64.8 Q ss_pred cHHHHHH----HHHh--ccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeecccc Q lcl|NC_019411. 21 DVQFADD----YIYA--NVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDA 94 (173) Q Consensus 21 tv~~ada----y~~~--r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~ 94 (173) -.++++. |... ++..... +..|+.-.+-+|.++-.+|=..++ -.+ T Consensus 1 ~~~~lkq~~~~~~~~~~l~~~~d~-~~kD~~vl~faie~v~~~IlnycN----------------------------ike 51 (131) T protein:vir:10 1 MIQELKQDNTMYLISCVRKMRQDN-YFKDMEVLHYALTQAENEILNYIH----------------------------QDS 51 (131) T ss_pred Chhhhhhhhhhhhhhhhhcccccc-ccchHHHHHHHHHHHHHHHhhhcC----------------------------Ccc Confidence 3444443 2221 1111100 011333456667777666543221 136 Q ss_pred chHHHHHHHHHHHHHHHcCCCCCC-------ccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcce Q lcl|NC_019411. 95 IPQQLMEATAEMAAALMNNDWTSP-------QTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAF 167 (173) Q Consensus 95 IP~~V~~A~~elA~~~~~~~~~~~-------~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~ 167 (173) ||.++++-....|.-++.++...+ ...+.|++.|-|.-+|+|..++..-.+...+..+|..|-. .=-.| T Consensus 52 iP~~Le~v~~~maiDll~~e~~~~~k~~~i~~~~g~VsSI~eGDTsIsf~s~t~~~qrl~~~~s~l~~Y~~----qL~~y 127 (131) T protein:vir:10 52 VPGRLENVWIDMTNDLLDKVKEQSVLAEKAGADDFSVKSIKMGDTTIEKVSPYEMIQRMKQVPSSLERYKR----QLNRF 127 (131) T ss_pred cchhhHHHHHHHHHHHHhhhcccccccccccccccceeeeeecceeeeccCCccHHHHHHHHHHHHhhhHH----HHhhh Confidence 899999999999998888775433 2345699999999999997666544444444455544421 11123 Q ss_pred eeee Q lcl|NC_019411. 168 KKII 171 (173) Q Consensus 168 ~~~~ 171 (173) -|++ T Consensus 128 RRL~ 131 (131) T protein:vir:10 128 RKLL 131 (131) T ss_pred cccC Confidence 3444 No 45 >protein:vir:81069 Length: 115 # NCBI annotation: p10 # Family: family:all:363 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285680;genbank:gi:148727188;genbank:GeneID:5247114 Probab=55.37 E-value=0.48 Score=22.33 Aligned_cols=113 Identities=10% Similarity=0.005 Sum_probs=60.7 Q ss_pred cccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccc-cCCcCCC-cccCCeeeccccch Q lcl|NC_019411. 19 YCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGL-RWPRTGV-YDVDGFLIPSDAIP 96 (173) Q Consensus 19 Y~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~l-awPR~gv-~~~dg~~~~~d~IP 96 (173) -+|+++++++.... .. -+..||.-.+..+-.|.+++. +|.|++.-.+|.. .++.... .+.+|. .|| T Consensus 1 ivtLee~K~HlRid--~d--d~deDD~li~~~i~AA~~~v~---~~l~r~l~~~~~~~~~~~~~~~~~~~~~-----~~p 68 (115) T protein:vir:81 1 MITLAMVQRHLQAE--LY--EDDERDYVMQQLLPAARESAE---LFINRKLYDTQADMLADQAAGVDPAGQL-----LIT 68 (115) T ss_pred CCCHHHHHHHcCCC--CC--CCccchHHHHHHHHHHHHHHH---HHhCCccccccccccccccccCCCCccc-----ccC Confidence 89999999885322 11 012345556666666666665 3566665443332 2222211 111211 378 Q ss_pred HHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcceee Q lcl|NC_019411. 97 QQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 97 ~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~~ 169 (173) +.|+.|+..+.-.+..+- |.|. .++....+..++.||.||=. -+| +| T Consensus 69 ~~i~~AiLllvg~~Y~NR------------E~v~--------~~~~~elP~~~~~LL~pyR~-----~~g-~~ 115 (115) T protein:vir:81 69 RTVEQAILLTLGEWYSSR------------EQVW--------TKGAGLVTSSAQNLLHPYRK-----FAG-VR 115 (115) T ss_pred HHHHHHHHHHHHHHHhcc------------chhc--------chhhhhcCHHHHHHHHHHHh-----hcC-CC Confidence 999999999986655542 1110 00122345678999999843 222 22 No 46 >protein:vir:96108 Length: 155 # NCBI annotation: hypothetical protein ORF025 # Family: family:all:664 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294442;genbank:gi:149408339;genbank:GeneID:5237227 Probab=53.14 E-value=0.54 Score=22.07 Aligned_cols=126 Identities=9% Similarity=-0.044 Sum_probs=60.3 Q ss_pred CCCccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeecc Q lcl|NC_019411. 13 DPAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPS 92 (173) Q Consensus 13 ~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~ 92 (173) .+.+ +++. |- ++.--...-+..+|+..+..|-.|.-+|+.+ +.+.+ .|- T Consensus 1 ~v~f----d~~~---FR-~~fPeFad~~~~Pd~~i~~~l~~A~~~l~~~--~~~s~---------~~~------------ 49 (155) T protein:vir:96 1 MVIF----DEQK---FR-TLFPEFADPASYPAVRLQLYFDIACEFISDR--DSPYR---------ILN------------ 49 (155) T ss_pred Cccc----CHHH---HH-HhCccccCcccCCHHHHHHHHHHHHHhhcCC--Ccccc---------ccC------------ Confidence 2222 1222 32 2322222234568999999999999999742 11110 111 Q ss_pred ccchHHHHHHHHHHHHHHHc-------CCC-----CCCccccceeEEecceeEEeecCCCCcccc--------h-HHHHH Q lcl|NC_019411. 93 DAIPQQLMEATAEMAAALMN-------NDW-----TSPQTTRGMKEIQVDVIELKFDSEIQRGSM--------P-DIVMS 151 (173) Q Consensus 93 d~IP~~V~~A~~elA~~~~~-------~~~-----~~~~~~~~v~~ekVG~i~veY~~~~~~~~~--------~-~~v~~ 151 (173) ...-+++.+.++.+++. +.. ......+.++++++|+|||.|+.+...... + .-.-+ T Consensus 50 ---g~~~~~~l~Ll~AH~l~L~~~~~~gaa~~g~~~~g~~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~f~~ 126 (155) T protein:vir:96 50 ---GKALEACLYLLTAHLLSLSTMQVQGAAGGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWA 126 (155) T ss_pred ---hHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccccceeeceecceeeeeecCCCCCchhHHhhcCHHHHHHHH Confidence 23345555666655442 111 111234568999999999999865432221 1 12344 Q ss_pred HHhhhhhc--ccCCCcceeeeecC Q lcl|NC_019411. 152 ILEGLGVV--KTGTRPAFKKIIRH 173 (173) Q Consensus 152 lL~~ll~~--~~g~~~~~~~~~R~ 173 (173) |++.+... -.| |-..-.-+|. T Consensus 127 l~~~~~~Gg~~vg-G~per~~~r~ 149 (155) T protein:vir:96 127 LLSVKAVGGFYIG-GLPERRGFRK 149 (155) T ss_pred HHHHhcccccccC-CCCccccccc Confidence 55554320 001 1112233444 No 47 >protein:vir:8104 Length: 170 # NCBI annotation: gp8 # Family: family:all:3238 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817685;genbank:gi:29566116;genbank:GeneID:1259310 Probab=50.13 E-value=0.62 Score=21.73 Aligned_cols=116 Identities=13% Similarity=0.133 Sum_probs=65.0 Q ss_pred hccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCc---cc--cccC--------Cc---CCC--cccCCee--- Q lcl|NC_019411. 31 ANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDE---DS--GLRW--------PR---TGV--YDVDGFL--- 89 (173) Q Consensus 31 ~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~---~Q--~law--------PR---~gv--~~~dg~~--- 89 (173) .||+ -+++.+.+.+|..|+.-+.+ |+|.+..| ++ .+.+ |- ..+ ...||.. T Consensus 1 ~~~~------~a~~~~~q~~l~aA~a~vR~---~cGwhv~P~v~d~t~~ldg~G~~vl~LPt~pvvsV~sV~~~G~~l~~ 71 (170) T protein:vir:81 1 MRGQ------FADNTEAQAAIDAVLAAARR---WCGWHVSPVIIDDVMEVDGPGGRVLSLPTLNLVSVKSVVELGYALDV 71 (170) T ss_pred Cccc------ccCchHHHHHHHHHHHHHHH---HhCCcccceecccEEEEeCCCCeeEECCCCcceeeEEEEECCeeecC Confidence 6665 34677777788888888775 34444321 22 1111 11 000 0122222 Q ss_pred ---------------------------------eccccchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEee Q lcl|NC_019411. 90 ---------------------------------IPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKF 136 (173) Q Consensus 90 ---------------------------------~~~d~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY 136 (173) +.++++|..+..-.|.+|-++..+ .....+++.+++.++.+| T Consensus 72 ~~~~~~~~~glL~r~~G~~~~~~~~V~VT~tHGy~~~~apd~~~~vi~~~a~r~~~s-----~~~~~l~~~~~~~vs~~~ 146 (170) T protein:vir:81 72 STLDRSRRKGTLTKPYGRWTARDGAIVVTATHGFTETEAADWRRAVVQLVGRRAQTS-----RPSADLKRKKVDDVEYEW 146 (170) T ss_pred ccceeecCCceEEecCCccccccceEEEEEEeCCCCCccchHHHHHHHHHHHHhhcc-----CCcccceeeeccceeeee Confidence 344578998888888888765442 223357899999999999 Q ss_pred cCCCCcccchHHHHHHHhhhhhcccCCCc Q lcl|NC_019411. 137 DSEIQRGSMPDIVMSILEGLGVVKTGTRP 165 (173) Q Consensus 137 ~~~~~~~~~~~~v~~lL~~ll~~~~g~~~ 165 (173) .... . +.-+.-..+|.+|= .|..+ T Consensus 147 ~~~~-~-s~~~~~~~iL~~Yr---l~~~p 170 (170) T protein:vir:81 147 FETA-V-SVDAELSAVFSPFR---ILPSP 170 (170) T ss_pred cccc-c-ccCHHHHHhhhhcc---cCCCC Confidence 6322 1 12233455787773 34555 No 48 >protein:vir:103283 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277463;genbank:gi:71834105;genbank:GeneID:3562394 Probab=49.81 E-value=0.63 Score=21.69 Aligned_cols=109 Identities=11% Similarity=0.052 Sum_probs=56.1 Q ss_pred hccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccchHHHHHHHHHHHHHH Q lcl|NC_019411. 31 ANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIPQQLMEATAEMAAAL 110 (173) Q Consensus 31 ~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP~~V~~A~~elA~~~ 110 (173) .|.. +.+-...+|+..++-+--|..||=.. .-=++...|.-.+|+++ T Consensus 1 mR~l-~P~f~~vpdevi~~wid~A~lFVC~~--------------------------------~fg~~~~~Al~lytlHL 47 (125) T protein:vir:10 1 MRTL-YPPLKSQPDDVLNAWIEVAKLFICLD--------------------------------KFGDKQVQALAFYTLHL 47 (125) T ss_pred Cccc-cchhhccCHHHHHHHHHHHHHHHHHh--------------------------------hhhhHHHHHHHHHHHHH Confidence 3432 22233446666665555555555210 01133455666666666 Q ss_pred HcCCCC-------CCccccceeEEec-ceeEEeecCCCCcccc----hHHHHHHHhhhhhcccCCCcceeeeecC Q lcl|NC_019411. 111 MNNDWT-------SPQTTRGMKEIQV-DVIELKFDSEIQRGSM----PDIVMSILEGLGVVKTGTRPAFKKIIRH 173 (173) Q Consensus 111 ~~~~~~-------~~~~~~~v~~ekV-G~i~veY~~~~~~~~~----~~~v~~lL~~ll~~~~g~~~~~~~~~R~ 173 (173) +.-+.. ..+..+++++-+. |+++++|+..+..+.- -...-.|+.-|+. +.|+|++..--.+. T Consensus 48 m~~dga~k~e~~~~~~~s~r~~s~slsGE~Sit~~~~s~d~s~~~L~~T~wGk~~~~L~k-~~~GgFaL~T~~~~ 121 (125) T protein:vir:10 48 LSQDIALKTENDSSQTSSERVKSYSLSGEYTISYDTSTAAASSSNLEESSWGKLYIDLMR-LKVGRWGLITSGGS 121 (125) T ss_pred HhcccccccccccccccccceeeeeeccceEeecccccccccccccccCchHHHHHHHHH-hcCCceeeeccccc Confidence 654431 1123356888885 9999999876654431 1223334444444 44666665443333 No 49 >protein:vir:97069 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453566;genbank:gi:84662601;genbank:GeneID:5142484 Probab=48.55 E-value=0.67 Score=21.55 Aligned_cols=113 Identities=11% Similarity=0.026 Sum_probs=59.9 Q ss_pred cccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCcccccc-CC-cCCCcccCCeeeccccch Q lcl|NC_019411. 19 YCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLR-WP-RTGVYDVDGFLIPSDAIP 96 (173) Q Consensus 19 Y~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~la-wP-R~gv~~~dg~~~~~d~IP 96 (173) =+|+++++.+.... ... +.-||.-.+..+-.|.+++.. |.|++....|... ++ ..+..+.+| ..|| T Consensus 1 mvtLee~K~hLRid--~d~--~d~DDali~~~i~AA~~~v~~---~l~r~l~~~~~~~~~~~~~~~~~~~~-----~~~p 68 (115) T protein:vir:97 1 MITLAMMQRHLQAE--LYE--DDERDYVMQQLLPAARESAEL---FLNRKLYDVQADMLADQVLGVDPSDQ-----LLIT 68 (115) T ss_pred CCCHHHHHHHcCCC--CCC--CchhhHHHHHHHHHHHHHHHH---HhCCcccchhhcccccccccCCCccc-----ccCC Confidence 89999999885332 110 111344566666677777763 5666654433321 11 111111111 2379 Q ss_pred HHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcceee Q lcl|NC_019411. 97 QQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 97 ~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~~ 169 (173) +.|+.|...|.-.+..+-.. +..|+ ....+..++.||.||=. -+| +| T Consensus 69 ~~i~~AiLllvg~~Y~NRE~----------v~~~~----------~~elP~~~~~LL~pyR~-----~~G-v~ 115 (115) T protein:vir:97 69 RTVEQAILLTVGEWYSSREQ----------VWIKG----------AGLVTSSAQNLLHPYRK-----FAG-VR 115 (115) T ss_pred HHHHHHHHHHHHHHHhcccc----------ccccc----------ccccCHHHHHHHHHHHh-----hcC-CC Confidence 99999999988665544210 01111 22345678999999832 222 22 No 50 >protein:vir:98481 Length: 136 # NCBI annotation: ORF3 # Family: family:all:32440 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958281;genbank:gi:41057255;uniprot:Q38596;genbank:GeneID:2732865 Probab=46.59 E-value=0.73 Score=21.33 Aligned_cols=116 Identities=16% Similarity=0.056 Sum_probs=54.1 Q ss_pred ccccccHHHHHHHHHhccCccccccCCCHH---HHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeecc Q lcl|NC_019411. 16 ANSYCDVQFADDYIYANVYANTAWDALDQD---GKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPS 92 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~---~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~ 92 (173) -.+|+|++|..+.|..- ..+ +++ +.+++|-.|++.|-..+ |.-+. T Consensus 1 M~~fAtv~Dl~~rw~~~-----~~d--ee~~ra~~~~lL~dAS~~ir~~~---------------p~~~~---------- 48 (136) T protein:vir:98 1 MAAYATVEDYQARAAVT-----LPD--GSPRRAQVEAYLDDASALMARHI---------------PTGHT---------- 48 (136) T ss_pred CCccCCHHHHHHHhccC-----CCC--chhHHHHHHHHHHHHHHHHHHhC---------------CCCCC---------- Confidence 68999999998765421 111 222 34667889999997643 33211 Q ss_pred ccchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcceeeeec Q lcl|NC_019411. 93 DAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKIIR 172 (173) Q Consensus 93 d~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~~~~R 172 (173) .-|.-++.-+|......+.++. +..+++.|.-+-.+..++.--.+...++. | ++-...-|...+-+-|-+ T Consensus 49 -~~~~~~~~V~~~~V~R~~~np~-------G~~s~TaG~ys~s~t~~G~Lylt~~E~~~-L-g~~rqr~~~~d~a~si~~ 118 (136) T protein:vir:98 49 -PDPGTLRAICVAVVRRVMANPG-------GYRQRTIGQYAETLGEDGGLYLTEDEKGQ-L-QPPDQTAPDADAAYSLDL 118 (136) T ss_pred -CChhHHHHHHHHHHHHHhhCCC-------CcccccchhHHHhhhcCCCcccChHHHHH-h-CCCCCcccccccceeccc Confidence 1144456666666666665433 34456677644333221110012233333 3 221111111112222222 Q ss_pred C Q lcl|NC_019411. 173 H 173 (173) Q Consensus 173 ~ 173 (173) . T Consensus 119 ~ 119 (136) T protein:vir:98 119 D 119 (136) T ss_pred C Confidence 2 No 51 >protein:vir:104344 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398973;genbank:gi:81343957;genbank:GeneID:3778876 Probab=39.44 E-value=1 Score=20.54 Aligned_cols=116 Identities=14% Similarity=0.079 Sum_probs=60.6 Q ss_pred ccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccc Q lcl|NC_019411. 16 ANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAI 95 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~I 95 (173) -| ++.-+.+.. . +..-...||+..+.-+--|-.||=. +.. T Consensus 1 ~~-----~~~~e~~R~--l-~P~f~kvpdevI~~wielA~lfVc~--------------------------------~~~ 40 (132) T protein:vir:10 1 MN-----DAILAFMRS--L-VPALKAVDDESINVWIDLARLYVCA--------------------------------DKF 40 (132) T ss_pred Cc-----hHHHHHHHH--h-cchhhcCChHHHHHHHHHHHHHHHh--------------------------------hcC Confidence 11 122223221 1 2233566777777655555555421 234 Q ss_pred hHHHHHHHHHHHHHHHcCCCCC--CccccceeEEec------ceeEEeecCCCCccc---chHHHHHHHhhhhhcccCCC Q lcl|NC_019411. 96 PQQLMEATAEMAAALMNNDWTS--PQTTRGMKEIQV------DVIELKFDSEIQRGS---MPDIVMSILEGLGVVKTGTR 164 (173) Q Consensus 96 P~~V~~A~~elA~~~~~~~~~~--~~~~~~v~~ekV------G~i~veY~~~~~~~~---~~~~v~~lL~~ll~~~~g~~ 164 (173) +++...|.-..|++++.-|... .+..+..-+++| |+++++|+..++.+. .-||= .|+.-|+. ..|+| T Consensus 41 g~~~~~AlaL~taHLm~~dga~k~en~~~~t~S~rvaS~Sl~Ge~Sisf~~~sa~~s~L~~tp~G-kl~~~L~k-~~~Gg 118 (132) T protein:vir:10 41 GNDADRAVGLYALHLMLSDGAFKGENEGLETYSRRMASYSLSGEFSITYDNQSAIQGDLSSSSWG-RMYKALLR-KKGGG 118 (132) T ss_pred chhHHHHHHHHHHHHhhccccccccccchhhhhhhhhhhcccCceeeecccccccccccccCcHH-HHHHHHHH-hccCc Confidence 5667778888888887765432 223333334444 999999987665432 12333 55555554 45666 Q ss_pred cceeeeecC Q lcl|NC_019411. 165 PAFKKIIRH 173 (173) Q Consensus 165 ~~~~~~~R~ 173 (173) +|..--.+. T Consensus 119 fgL~t~~~~ 127 (132) T protein:vir:10 119 FGLITSAAG 127 (132) T ss_pred cccccccCc Confidence 654433333 No 52 >protein:vir:4458 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700381;genbank:gi:23505453;genbank:GeneID:955660 Probab=38.71 E-value=1.1 Score=20.46 Aligned_cols=107 Identities=10% Similarity=0.111 Sum_probs=60.4 Q ss_pred ccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccchH Q lcl|NC_019411. 18 SYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIPQ 97 (173) Q Consensus 18 SY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP~ 97 (173) =++|+++++.+.... +. . .-||.-.+..+-.|.+||.. |.|++....+.. +|... .+| -.+|. T Consensus 1 M~vtLee~K~hLRId--~D--~-~dDD~lI~~~i~AA~~~i~~---~~~r~l~~~~~~-~~~~~---~~~-----~~~~~ 63 (107) T protein:vir:44 1 MLLSVEEIKAQLRLD--ED--F-EADERYLQLLARAVQKRTET---YLNRKLYAPDET-IPDSD---PDG-----LLLQD 63 (107) T ss_pred CCCCHHHHHHHcCCC--CC--C-chhHHHHHHHHHHHHHHHHH---hhcCcccccccc-ccccc---ccc-----ccchh Confidence 789999999985432 11 1 11455677677788899874 677776544332 33321 122 24688 Q ss_pred HHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhccc Q lcl|NC_019411. 98 QLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKT 161 (173) Q Consensus 98 ~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~ 161 (173) .++.|++.|+-....+-.... ..+....+-.+..||.||=.--. T Consensus 64 ~~~~AiLllv~~~Y~NRe~~~--------------------~~~~~~lP~~v~~Ll~~yR~~p~ 107 (107) T protein:vir:44 64 DIRLGMLMLISHFYENRSSVT--------------------EVEKLDMPQSFGWLVGPYRYFPQ 107 (107) T ss_pred hHHHHHHHHHHHHHhhhhhhc--------------------cccccccCHHHHHHHHHhhhcCC Confidence 899999999866554321110 00111234457788877622111 No 53 >protein:vir:5742 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892053;genbank:gi:33770516;uniprot:Q7Y407;genbank:GeneID:2637465 Probab=36.17 E-value=1.2 Score=20.17 Aligned_cols=108 Identities=15% Similarity=0.039 Sum_probs=61.8 Q ss_pred cccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccc--cCCcCCCcccCCeeecccc Q lcl|NC_019411. 17 NSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGL--RWPRTGVYDVDGFLIPSDA 94 (173) Q Consensus 17 nSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~l--awPR~gv~~~dg~~~~~d~ 94 (173) =...|+++.++...-. .. - .-+|+-.+..+..|-.+++ +|.|+|.-.++.. +-|- +.+|..+ T Consensus 1 m~mitLeeiK~hlRid--~D--~-~~eD~lL~~y~~AA~~~~e---~~~~rkLy~~~~~~~~~p~----~~~gl~~---- 64 (110) T protein:vir:57 1 MGMTSLSNVKTQLRLE--ED--F-TEHDDFIESLIDAAQRSIE---RTYYCVLVDSQEALEKLPE----GVRGFLI---- 64 (110) T ss_pred CCCCCHHHHHHHcCCC--CC--C-ChhHHHHHHHHHHHHHHHH---HHhCCcccCCccccccCCC----CCCcccc---- Confidence 3457999999875332 11 0 1245556666667777776 4678876543321 2231 2355555 Q ss_pred chHHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhccc Q lcl|NC_019411. 95 IPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKT 161 (173) Q Consensus 95 IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~ 161 (173) ++.|+.|+..|.-+...+ ||.|++. .....+-.++.||.|+..-.- T Consensus 65 -~~di~~A~Lllv~hwYeN------------REav~~~--------~~~~~P~~v~~Ll~P~~~~~~ 110 (110) T protein:vir:57 65 -EPDTQLAARMMVAQWYLN------------PKGTSPD--------GDTPAQLGVEYLLFPLMEHTV 110 (110) T ss_pred -CHHHHHHHHHHHHHHHhc------------ccccccc--------cccchhHHHHHHHHHHHhhcC Confidence 466999999988665544 2222221 122335678999999875332 No 54 >protein:vir:94507 Length: 113 # NCBI annotation: putative DNA packaging protein # Family: family:all:372 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223891;genbank:gi:62327103;genbank:GeneID:5075523 Probab=35.59 E-value=1.2 Score=20.11 Aligned_cols=113 Identities=11% Similarity=0.078 Sum_probs=68.5 Q ss_pred cccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccchHH Q lcl|NC_019411. 19 YCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIPQQ 98 (173) Q Consensus 19 Y~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP~~ 98 (173) -..++..+.-. + .++...++.++-.+.+|.+.|=..+ |. ++ +..+.||.+ T Consensus 1 M~~L~~vK~~l-----g--i~d~~~D~lL~~iI~~a~~~i~~~l---~~------------------~~--~~~~~iP~~ 50 (113) T protein:vir:94 1 MALLDSIKLRI-----G--IEDTKQDDLLTDIISDVQARVLAYV---NQ------------------DG--LVQSELPNG 50 (113) T ss_pred CchHHHHHHHh-----C--CCCCchhhHHHHHHHHHHHHHHHHh---CC------------------cc--chhhhhhhH Confidence 22344444321 2 2345556777778888888886532 11 11 123689999 Q ss_pred HHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcceeeee Q lcl|NC_019411. 99 LMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 171 (173) Q Consensus 99 V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~~~~ 171 (173) +..-.++.|+...+-- ...|+++++++-.|++|.... -|...+..|.-+.....+++.| +|.+ T Consensus 51 l~~Iv~evavkryNR~-----g~EG~~S~SeeG~S~sf~~~~----df~~y~~~l~~~~~~~~~~~~g-~rF~ 113 (113) T protein:vir:94 51 LDFVIKDVTIRIYNKI-----GDEGKESSSEGNVSNTWDTPA----DLSEYSDVLDVYRKSYKRRSAG-MRFI 113 (113) T ss_pred HHHHHHHHHHHHhccc-----CCccceeeecCceeeeecCcc----chhhHHHHHHHHHhhccCCCCC-ceeC Confidence 9999999999866542 334789999999999996422 1444445555555444445555 3666 No 55 >protein:vir:3034 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438147;genbank:gi:16271810;genbank:GeneID:929268 Probab=30.09 E-value=1.1 Score=20.44 Aligned_cols=100 Identities=13% Similarity=0.115 Sum_probs=49.5 Q ss_pred HHHHHHHHHhhhcc-cccccCCccccccCCcCCCcccCCeeeccccchHHHHHHHHHHHHHHHcCCCCCCccccceeEEe Q lcl|NC_019411. 50 FLVRASKYLDRSIA-WAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQ 128 (173) Q Consensus 50 aL~~As~~id~~~~-f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ek 128 (173) ++.+|..-||...+ |. .+.+-+-...| | + ..+|.|.|.--..+-+.+-......+.+++.+ T Consensus 1 L~k~A~~~Id~~t~~fY-~~~dle~D~~~-R---------------~-~~fK~Aia~QI~Yld~~G~~t~~d~~s~~Sis 62 (111) T protein:vir:30 1 MEKRASHAVNLYCRNRY-DYKDLKKEIAL-V---------------Q-KAVKRAIAYQIAYLNDSGVMTAEDKQSFAGIS 62 (111) T ss_pred CchhhHHHHhHhhchhh-hhhhHHHHHHH-H---------------H-HHHHHHHHHHHHHHHhcCCCChhhccCcceee Confidence 66789999997442 22 11111111222 2 1 34566655443334333434433466799999 Q ss_pred cceeEEeecCCCCcc-------cchHHHHH---HHhhhhhcccCCCcceee Q lcl|NC_019411. 129 VDVIELKFDSEIQRG-------SMPDIVMS---ILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 129 VG~i~veY~~~~~~~-------~~~~~v~~---lL~~ll~~~~g~~~~~~~ 169 (173) ||-.++.|+...+.+ ..|..... +|...+. .-+|...-| T Consensus 63 vGrTsiS~~~~~~~~~~~~~t~~~~~l~~da~n~L~~~Gl--ly~GV~yd~ 111 (111) T protein:vir:30 63 LGRTSISYTVGHGQGSQQKTLADRFNLCLDAENELLVVGL--GYTGISYDR 111 (111) T ss_pred ecceeeeccCccCCCCccccccccccchHHHHHHHHhhcc--ccccccccC Confidence 999999996433321 23444333 4433221 113334344 No 56 >protein:vir:4702 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061634;genbank:gi:9635721;genbank:GeneID:1263015 Probab=21.11 E-value=2.7 Score=18.26 Aligned_cols=106 Identities=9% Similarity=0.049 Sum_probs=56.4 Q ss_pred eEEEEeCCCCCCCccccccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCC Q lcl|NC_019411. 3 FTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGV 82 (173) Q Consensus 3 M~live~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv 82 (173) |+++.++ +++++.|+.-. . .-+|+-.+..+-.|-.||.. +.|..... .| T Consensus 1 M~vt~~d------------LeeiK~~LRID--~-----d~DD~li~~~i~AA~~~I~~---ai~~~~~~-----~~---- 49 (113) T protein:vir:47 1 MQLTAEE------------LKLLKKHCKID--H-----NSEDDLLEIYYSWAFHEIAS---AVTDEPSK-----YI---- 49 (113) T ss_pred CcccHHH------------HHHHHHHhCCC--C-----CcchHHHHHHHHHHHHHHHh---hccccccc-----cc---- Confidence 6666555 89999996432 1 22667777777788899864 34433211 01 Q ss_pred cccCCeeeccccchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhh------- Q lcl|NC_019411. 83 YDVDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEG------- 155 (173) Q Consensus 83 ~~~dg~~~~~d~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~------- 155 (173) +....|+.++.|++.|+-....+-....+. .....+-.|..||.+ T Consensus 50 --------~~~~~~~~~~~AvllLv~~~YeNR~a~~~~--------------------~~~~vp~~v~sli~qlR~~y~~ 101 (113) T protein:vir:47 50 --------DWFKSHPLFARAIYPLASYYFENRIAYLDR--------------------DLSLAPHMVLSTVHKLRGSFEQ 101 (113) T ss_pred --------cccCCchHHHHHHHHHHHHHHhhhhhcccc--------------------ccccccHHHHHHHHHHHHHHHH Confidence 112346789999999986654443211100 001122234444433 Q ss_pred hhhcccCCCcce Q lcl|NC_019411. 156 LGVVKTGTRPAF 167 (173) Q Consensus 156 ll~~~~g~~~~~ 167 (173) .+....|+..|. T Consensus 102 ~~~~~~~~~~~~ 113 (113) T protein:vir:47 102 FLESENDEESGT 113 (113) T ss_pred HhhhcCCCCCCC Confidence 345555555555 No 57 >protein:vir:78849 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285363;genbank:gi:148717891;genbank:GeneID:5246980 Probab=20.23 E-value=2.8 Score=18.12 Aligned_cols=110 Identities=15% Similarity=0.099 Sum_probs=66.6 Q ss_pred cccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccchHH Q lcl|NC_019411. 19 YCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIPQQ 98 (173) Q Consensus 19 Y~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP~~ 98 (173) -..++..+.-. +.. +...|+.++-.+.+|.+.|-..+ |. -.+.||.+ T Consensus 1 M~~L~~vK~~l-----gI~--d~~~D~lL~~ii~~a~~~i~~~l---~~-----------------------~~~~iP~~ 47 (110) T protein:vir:78 1 MTTLADVKKRI-----GLK--DEKQDEQLEEIIKSCESQLLSML---PI-----------------------EVEQIPER 47 (110) T ss_pred CchHHHHHHHh-----CCC--CCchhHHHHHHHHHHHHHHHHHh---cc-----------------------chhhhhhH Confidence 23344444332 221 34456677777788888776422 21 12579999 Q ss_pred HHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcceeeee Q lcl|NC_019411. 99 LMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 171 (173) Q Consensus 99 V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~~~~ 171 (173) +..-++++|+...+-- ...|+++++++-.|++|.. .-|.-....|.-+.-.....+-|.+|++ T Consensus 48 l~~iv~ev~vkryNR~-----g~EG~~S~S~eG~S~sf~d-----~d~~~y~~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:78 48 FSYMIKEVAVKRYNRI-----GAEGMTSEAVDGRSNAYEL-----NDFKEYEAIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred HHHHHHHHHHHHhccc-----CccccceeecCceeeeecc-----cccchHHHHHHHHHhhcCCCCCceeeeC Confidence 9999999999866542 2236899999999999953 2233344444444433334445666666 No 58 >protein:vir:96390 Length: 110 # NCBI annotation: ORF048 # Family: family:all:372 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239650;genbank:gi:66395410;genbank:GeneID:5132866 Probab=20.23 E-value=2.8 Score=18.12 Aligned_cols=110 Identities=15% Similarity=0.099 Sum_probs=66.6 Q ss_pred cccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccchHH Q lcl|NC_019411. 19 YCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIPQQ 98 (173) Q Consensus 19 Y~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP~~ 98 (173) -..++..+.-. +.. +...|+.++-.+.+|.+.|-..+ |. -.+.||.+ T Consensus 1 M~~L~~vK~~l-----gI~--d~~~D~lL~~ii~~a~~~i~~~l---~~-----------------------~~~~iP~~ 47 (110) T protein:vir:96 1 MTTLADVKKRI-----GLK--DEKQDEQLEEIIKSCESQLLSML---PI-----------------------EVEQIPER 47 (110) T ss_pred CchHHHHHHHh-----CCC--CCchhHHHHHHHHHHHHHHHHHh---cc-----------------------chhhhhhH Confidence 23344444332 221 34456677777788888776422 21 12579999 Q ss_pred HHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcceeeee Q lcl|NC_019411. 99 LMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 171 (173) Q Consensus 99 V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~~~~ 171 (173) +..-++++|+...+-- ...|+++++++-.|++|.. .-|.-....|.-+.-.....+-|.+|++ T Consensus 48 l~~iv~ev~vkryNR~-----g~EG~~S~S~eG~S~sf~d-----~d~~~y~~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:96 48 FSYMIKEVAVKRYNRI-----GAEGMTSEAVDGRSNAYEL-----NDFKEYEAIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred HHHHHHHHHHHHhccc-----CccccceeecCceeeeecc-----cccchHHHHHHHHHhhcCCCCCceeeeC Confidence 9999999999866542 2236899999999999953 2233344444444433334445666666 No 59 >protein:vir:103957 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873994;genbank:gi:118430769;genbank:GeneID:4525451 Probab=20.23 E-value=2.8 Score=18.12 Aligned_cols=110 Identities=15% Similarity=0.099 Sum_probs=66.6 Q ss_pred cccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccchHH Q lcl|NC_019411. 19 YCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIPQQ 98 (173) Q Consensus 19 Y~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP~~ 98 (173) -..++..+.-. +.. +...|+.++-.+.+|.+.|-..+ |. -.+.||.+ T Consensus 1 M~~L~~vK~~l-----gI~--d~~~D~lL~~ii~~a~~~i~~~l---~~-----------------------~~~~iP~~ 47 (110) T protein:vir:10 1 MTTLADVKKRI-----GLK--DEKQDEQLEEIIKSCESQLLSML---PI-----------------------EVEQIPER 47 (110) T ss_pred CchHHHHHHHh-----CCC--CCchhHHHHHHHHHHHHHHHHHh---cc-----------------------chhhhhhH Confidence 23344444332 221 34456677777788888776422 21 12579999 Q ss_pred HHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcceeeee Q lcl|NC_019411. 99 LMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 171 (173) Q Consensus 99 V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~~~~ 171 (173) +..-++++|+...+-- ...|+++++++-.|++|.. .-|.-....|.-+.-.....+-|.+|++ T Consensus 48 l~~iv~ev~vkryNR~-----g~EG~~S~S~eG~S~sf~d-----~d~~~y~~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:10 48 FSYMIKEVAVKRYNRI-----GAEGMTSEAVDGRSNAYEL-----NDFKEYEAIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred HHHHHHHHHHHHhccc-----CccccceeecCceeeeecc-----cccchHHHHHHHHHhhcCCCCCceeeeC Confidence 9999999999866542 2236899999999999953 2233344444444433334445666666 No 60 >protein:vir:97145 Length: 110 # NCBI annotation: ORF049 # Family: family:all:372 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239728;genbank:gi:66394913;genbank:GeneID:5130878 Probab=20.23 E-value=2.8 Score=18.12 Aligned_cols=110 Identities=15% Similarity=0.099 Sum_probs=66.6 Q ss_pred cccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccchHH Q lcl|NC_019411. 19 YCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIPQQ 98 (173) Q Consensus 19 Y~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP~~ 98 (173) -..++..+.-. +.. +...|+.++-.+.+|.+.|-..+ |. -.+.||.+ T Consensus 1 M~~L~~vK~~l-----gI~--d~~~D~lL~~ii~~a~~~i~~~l---~~-----------------------~~~~iP~~ 47 (110) T protein:vir:97 1 MTTLADVKKRI-----GLK--DEKQDEQLEEIIKSCESQLLSML---PI-----------------------EVEQIPER 47 (110) T ss_pred CchHHHHHHHh-----CCC--CCchhHHHHHHHHHHHHHHHHHh---cc-----------------------chhhhhhH Confidence 23344444332 221 34456677777788888776422 21 12579999 Q ss_pred HHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcceeeee Q lcl|NC_019411. 99 LMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 171 (173) Q Consensus 99 V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~~~~ 171 (173) +..-++++|+...+-- ...|+++++++-.|++|.. .-|.-....|.-+.-.....+-|.+|++ T Consensus 48 l~~iv~ev~vkryNR~-----g~EG~~S~S~eG~S~sf~d-----~d~~~y~~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:97 48 FSYMIKEVAVKRYNRI-----GAEGMTSEAVDGRSNAYEL-----NDFKEYEAIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred HHHHHHHHHHHHhccc-----CccccceeecCceeeeecc-----cccchHHHHHHHHHhhcCCCCCceeeeC Confidence 9999999999866542 2236899999999999953 2233344444444433334445666666 No 61 >protein:vir:9311 Length: 110 # NCBI annotation: phi Mu50B-like protein # Family: family:all:372 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803289;genbank:gi:29028599;genbank:GeneID:1258047 Probab=20.23 E-value=2.8 Score=18.12 Aligned_cols=110 Identities=15% Similarity=0.099 Sum_probs=66.6 Q ss_pred cccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccchHH Q lcl|NC_019411. 19 YCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIPQQ 98 (173) Q Consensus 19 Y~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP~~ 98 (173) -..++..+.-. +.. +...|+.++-.+.+|.+.|-..+ |. -.+.||.+ T Consensus 1 M~~L~~vK~~l-----gI~--d~~~D~lL~~ii~~a~~~i~~~l---~~-----------------------~~~~iP~~ 47 (110) T protein:vir:93 1 MTTLADVKKRI-----GLK--DEKQDEQLEEIIKSCESQLLSML---PI-----------------------EVEQIPER 47 (110) T ss_pred CchHHHHHHHh-----CCC--CCchhHHHHHHHHHHHHHHHHHh---cc-----------------------chhhhhhH Confidence 23344444332 221 34456677777788888776422 21 12579999 Q ss_pred HHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcceeeee Q lcl|NC_019411. 99 LMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 171 (173) Q Consensus 99 V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~~~~ 171 (173) +..-++++|+...+-- ...|+++++++-.|++|.. .-|.-....|.-+.-.....+-|.+|++ T Consensus 48 l~~iv~ev~vkryNR~-----g~EG~~S~S~eG~S~sf~d-----~d~~~y~~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:93 48 FSYMIKEVAVKRYNRI-----GAEGMTSEAVDGRSNAYEL-----NDFKEYEAIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred HHHHHHHHHHHHhccc-----CccccceeecCceeeeecc-----cccchHHHHHHHHHhhcCCCCCceeeeC Confidence 9999999999866542 2236899999999999953 2233344444444433334445666666 No 62 >protein:vir:99796 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004309;genbank:gi:122891763;genbank:GeneID:4712351 Probab=20.23 E-value=2.8 Score=18.12 Aligned_cols=110 Identities=15% Similarity=0.099 Sum_probs=66.6 Q ss_pred cccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccchHH Q lcl|NC_019411. 19 YCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIPQQ 98 (173) Q Consensus 19 Y~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP~~ 98 (173) -..++..+.-. +.. +...|+.++-.+.+|.+.|-..+ |. -.+.||.+ T Consensus 1 M~~L~~vK~~l-----gI~--d~~~D~lL~~ii~~a~~~i~~~l---~~-----------------------~~~~iP~~ 47 (110) T protein:vir:99 1 MTTLADVKKRI-----GLK--DEKQDEQLEEIIKSCESQLLSML---PI-----------------------EVEQIPER 47 (110) T ss_pred CchHHHHHHHh-----CCC--CCchhHHHHHHHHHHHHHHHHHh---cc-----------------------chhhhhhH Confidence 23344444332 221 34456677777788888776422 21 12579999 Q ss_pred HHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcceeeee Q lcl|NC_019411. 99 LMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 171 (173) Q Consensus 99 V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~~~~ 171 (173) +..-++++|+...+-- ...|+++++++-.|++|.. .-|.-....|.-+.-.....+-|.+|++ T Consensus 48 l~~iv~ev~vkryNR~-----g~EG~~S~S~eG~S~sf~d-----~d~~~y~~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:99 48 FSYMIKEVAVKRYNRI-----GAEGMTSEAVDGRSNAYEL-----NDFKEYEAIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred HHHHHHHHHHHHhccc-----CccccceeecCceeeeecc-----cccchHHHHHHHHHhhcCCCCCceeeeC Confidence 9999999999866542 2236899999999999953 2233344444444433334445666666 No 63 >protein:vir:96221 Length: 110 # NCBI annotation: ORF044 # Family: family:all:372 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239573;genbank:gi:66395333;genbank:GeneID:5132767 Probab=20.23 E-value=2.8 Score=18.12 Aligned_cols=110 Identities=15% Similarity=0.099 Sum_probs=66.6 Q ss_pred cccHHHHHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccchHH Q lcl|NC_019411. 19 YCDVQFADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIPQQ 98 (173) Q Consensus 19 Y~tv~~aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP~~ 98 (173) -..++..+.-. +.. +...|+.++-.+.+|.+.|-..+ |. -.+.||.+ T Consensus 1 M~~L~~vK~~l-----gI~--d~~~D~lL~~ii~~a~~~i~~~l---~~-----------------------~~~~iP~~ 47 (110) T protein:vir:96 1 MTTLADVKKRI-----GLK--DEKQDEQLEEIIKSCESQLLSML---PI-----------------------EVEQIPER 47 (110) T ss_pred CchHHHHHHHh-----CCC--CCchhHHHHHHHHHHHHHHHHHh---cc-----------------------chhhhhhH Confidence 23344444332 221 34456677777788888776422 21 12579999 Q ss_pred HHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcceeeee Q lcl|NC_019411. 99 LMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 171 (173) Q Consensus 99 V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~~~~ 171 (173) +..-++++|+...+-- ...|+++++++-.|++|.. .-|.-....|.-+.-.....+-|.+|++ T Consensus 48 l~~iv~ev~vkryNR~-----g~EG~~S~S~eG~S~sf~d-----~d~~~y~~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:96 48 FSYMIKEVAVKRYNRI-----GAEGMTSEAVDGRSNAYEL-----NDFKEYEAIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred HHHHHHHHHHHHhccc-----CccccceeecCceeeeecc-----cccchHHHHHHHHHhhcCCCCCceeeeC Confidence 9999999999866542 2236899999999999953 2233344444444433334445666666 No 64 >protein:vir:9928 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795690;genbank:gi:28876458;genbank:GeneID:1258013 Probab=20.02 E-value=2.8 Score=18.09 Aligned_cols=117 Identities=15% Similarity=0.074 Sum_probs=65.8 Q ss_pred ccHHH-HHHHHHhccCccccccCCCHHHHHHHHHHHHHHHhhhcccccccCCccccccCCcCCCcccCCeeeccccchHH Q lcl|NC_019411. 20 CDVQF-ADDYIYANVYANTAWDALDQDGKERFLVRASKYLDRSIAWAGEKVDEDSGLRWPRTGVYDVDGFLIPSDAIPQQ 98 (173) Q Consensus 20 ~tv~~-aday~~~r~~~~~~w~~~~~~~~e~aL~~As~~id~~~~f~G~r~~~~Q~lawPR~gv~~~dg~~~~~d~IP~~ 98 (173) -+-++ ++.. ..+ .+...=+...++..+-.|.+|.+.|=..+ |... . .-.++||.+ T Consensus 1 md~~~~L~~v-K~~-lgI~~~D~~~D~lL~~~i~~a~~~i~~~l---~~~~------------~-------~~~~eiP~~ 56 (118) T protein:vir:99 1 MGDKQLIDDI-KLF-IGISKGDGAQDELITLAIYESKERVLAKL---NEYS------------E-------TEITKIPDR 56 (118) T ss_pred CchhhHHHHH-HHH-hCCCCCchhhHHHHHHHHHHHHHHHHHHh---cccc------------c-------cchhhhhHH Confidence 11111 3322 122 12211122335566667778887775422 2111 0 012579999 Q ss_pred HHHHHHHHHHHHHcCCCCCCccccceeEEecceeEEeecCCCCcccchHHHHHHHhhhhhcccCCCcceeeee Q lcl|NC_019411. 99 LMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 171 (173) Q Consensus 99 V~~A~~elA~~~~~~~~~~~~~~~~v~~ekVG~i~veY~~~~~~~~~~~~v~~lL~~ll~~~~g~~~~~~~~~ 171 (173) +...++++|+...+-- ...|+++++++-.|++|.. -|+-.+..|.-+.......+-|.+|++ T Consensus 57 l~~iv~evav~ryNR~-----g~EG~~S~SeeG~S~sf~~------d~~ey~~~l~~~~~~~~~~~~g~v~Fi 118 (118) T protein:vir:99 57 LRFIVRDVAIKRFNRI-----NSEGAVEDSEEGKTFKWDS------YLKEYESTLRSAAIGKVYSGKGVARFI 118 (118) T ss_pred HHHHHHHHHHHHhcCc-----CCcccceeecCCeeeeecc------CchhHHHHHHHHhhhcccCcCcceeeC Confidence 9999999999866531 2236899999999999952 134455555555544444555666777 Done!