Query lcl|NC_019407.1_cdsid_YP_006988766.1 [gene=D868_gp264] [protein=hypothetical protein] [protein_id=YP_006988766.1] [location=53564..54085] Match_columns 173 No_of_seqs 88 out of 92 Neff 6.0 Searched_HMMs 1612 Date Thu Nov 7 18:12:56 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_84 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_84_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95176 Length: 172 100.0 2.5E-61 1.6E-64 352.8 16.1 167 1-173 1-172 (172) 2 protein:vir:94955 Length: 170 100.0 7E-59 4.3E-62 339.4 15.4 168 4-173 1-170 (170) 3 protein:vir:80389 Length: 172 100.0 2.7E-58 1.7E-61 336.2 15.9 161 3-173 1-172 (172) 4 protein:vir:78383 Length: 169 100.0 1.2E-57 7.3E-61 332.7 15.6 164 3-173 1-169 (169) 5 protein:vir:95004 Length: 169 100.0 1.8E-57 1.1E-60 331.7 15.5 164 3-173 1-169 (169) 6 protein:vir:97267 Length: 172 100.0 1.4E-55 8.4E-59 321.4 14.9 161 3-173 1-172 (172) 7 protein:vir:80967 Length: 131 97.9 2.1E-07 1.3E-10 57.2 9.3 125 17-172 1-131 (131) 8 protein:vir:43 Length: 131 # N 97.9 2.2E-07 1.4E-10 57.1 9.0 125 17-172 1-131 (131) 9 protein:vir:98900 Length: 132 97.6 1.6E-06 9.6E-10 52.4 9.8 124 17-173 1-132 (132) 10 protein:vir:9576 Length: 131 # 93.9 0.0032 2E-06 34.3 10.4 124 16-173 1-130 (131) 11 protein:vir:4788 Length: 130 # 93.0 0.0038 2.3E-06 33.9 9.2 124 17-169 1-130 (130) 12 protein:vir:5256 Length: 119 # 91.8 0.0094 5.9E-06 31.7 9.8 106 19-169 1-119 (119) 13 protein:vir:2505 Length: 128 # 91.6 0.0031 1.9E-06 34.4 6.9 122 13-173 1-124 (128) 14 protein:vir:9821 Length: 138 # 91.4 0.0053 3.3E-06 33.0 8.0 126 1-169 1-138 (138) 15 protein:vir:107756 Length: 147 91.3 0.0087 5.4E-06 31.9 9.1 126 3-173 1-140 (147) 16 protein:vir:79701 Length: 144 91.2 0.013 8.2E-06 30.9 10.0 132 16-168 1-144 (144) 17 protein:vir:9761 Length: 140 # 89.5 0.023 1.4E-05 29.6 9.8 126 16-173 1-134 (140) 18 protein:vir:1887 Length: 108 # 89.3 0.013 8.3E-06 30.9 8.4 105 1-167 1-108 (108) 19 protein:vir:192 Length: 108 # 89.3 0.013 8.3E-06 30.9 8.4 105 1-167 1-108 (108) 20 protein:vir:94761 Length: 132 89.1 0.023 1.5E-05 29.5 9.6 125 16-173 1-131 (132) 21 protein:vir:99002 Length: 158 88.4 0.032 2E-05 28.8 10.2 123 16-173 1-124 (158) 22 protein:vir:107702 Length: 136 87.6 0.037 2.3E-05 28.5 9.6 121 14-173 1-133 (136) 23 protein:vir:79640 Length: 134 86.3 0.04 2.5E-05 28.2 9.1 117 16-173 1-130 (134) 24 protein:vir:1435 Length: 188 # 85.8 0.05 3.1E-05 27.7 9.9 127 1-167 1-188 (188) 25 protein:vir:79074 Length: 150 85.7 0.027 1.7E-05 29.2 7.9 118 17-156 1-150 (150) 26 protein:vir:100245 Length: 113 85.2 0.048 2.9E-05 27.8 9.0 113 17-167 1-113 (113) 27 protein:vir:1993 Length: 141 # 84.7 0.0045 2.8E-06 33.4 3.1 119 17-157 1-141 (141) 28 protein:vir:80320 Length: 188 84.6 0.059 3.7E-05 27.3 10.1 128 1-167 1-188 (188) 29 protein:vir:100103 Length: 120 84.0 0.064 4E-05 27.1 9.7 120 13-167 1-120 (120) 30 protein:vir:107864 Length: 150 83.9 0.036 2.2E-05 28.5 7.7 118 17-156 1-150 (150) 31 protein:vir:10365 Length: 115 83.5 0.062 3.8E-05 27.2 8.8 114 19-169 1-115 (115) 32 protein:vir:7857 Length: 188 # 82.8 0.042 2.6E-05 28.1 7.6 121 22-162 1-188 (188) 33 protein:vir:101652 Length: 188 82.8 0.042 2.6E-05 28.1 7.6 121 22-162 1-188 (188) 34 protein:vir:93592 Length: 108 81.5 0.085 5.3E-05 26.5 9.5 108 16-168 1-108 (108) 35 protein:vir:486 Length: 107 # 81.4 0.076 4.7E-05 26.7 8.5 104 18-161 1-107 (107) 36 protein:vir:1640 Length: 132 # 79.6 0.1 6.4E-05 26.0 9.5 128 16-173 1-131 (132) 37 protein:vir:103846 Length: 138 78.8 0.022 1.4E-05 29.6 4.7 123 17-173 1-137 (138) 38 protein:vir:99570 Length: 153 76.6 0.13 8.3E-05 25.4 10.8 128 3-173 1-147 (153) 39 protein:vir:97069 Length: 115 75.7 0.14 8.9E-05 25.2 8.3 113 19-169 1-115 (115) 40 protein:vir:4512 Length: 107 # 75.4 0.14 8.9E-05 25.2 8.1 107 18-161 1-107 (107) 41 protein:vir:81069 Length: 115 72.3 0.18 0.00011 24.6 8.2 113 19-169 1-115 (115) 42 protein:vir:103283 Length: 125 72.0 0.19 0.00012 24.6 9.0 109 31-173 1-122 (125) 43 protein:vir:99848 Length: 172 65.5 0.079 4.9E-05 26.6 4.4 134 1-156 1-172 (172) 44 protein:vir:79253 Length: 138 64.6 0.041 2.6E-05 28.2 2.7 119 17-173 1-137 (138) 45 protein:vir:99222 Length: 138 64.6 0.041 2.6E-05 28.2 2.7 119 17-173 1-137 (138) 46 protein:vir:4458 Length: 107 # 60.0 0.38 0.00024 22.9 8.8 107 18-161 1-107 (107) 47 protein:vir:80036 Length: 111 58.1 0.15 9.4E-05 25.1 4.5 109 16-171 1-111 (111) 48 protein:vir:96108 Length: 155 50.6 0.6 0.00037 21.8 10.5 126 13-173 1-149 (155) 49 protein:vir:104344 Length: 132 44.7 0.8 0.00049 21.1 8.9 116 16-173 1-129 (132) 50 protein:vir:102961 Length: 131 44.1 0.82 0.00051 21.1 6.8 118 21-171 1-131 (131) 51 protein:vir:5742 Length: 110 # 43.8 0.83 0.00052 21.0 9.0 108 17-161 1-110 (110) 52 protein:vir:3034 Length: 111 # 43.6 0.48 0.0003 22.3 4.9 97 50-169 1-111 (111) 53 protein:vir:94507 Length: 113 41.5 0.92 0.00057 20.8 9.2 113 19-171 1-113 (113) 54 protein:vir:98481 Length: 136 38.9 1 0.00065 20.5 7.6 113 16-173 1-127 (136) 55 protein:vir:1384 Length: 92 # 35.6 1.2 0.00076 20.1 8.9 92 20-164 1-92 (92) 56 protein:vir:8104 Length: 170 # 32.2 1.4 0.00089 19.7 9.8 116 31-165 1-170 (170) 57 protein:vir:4702 Length: 113 # 22.3 2.5 0.0015 18.4 8.5 106 3-167 1-113 (113) 58 protein:vir:94064 Length: 167 21.8 2.5 0.0016 18.4 9.6 124 3-173 1-145 (167) 59 protein:vir:2432 Length: 124 # 21.6 2.6 0.0016 18.3 6.5 119 17-173 1-124 (124) No 1 >protein:vir:95176 Length: 172 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:703 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293420;genbank:gi:148912841;genbank:GeneID:5228251 Probab=100.00 E-value=2.5e-61 Score=352.82 Aligned_cols=167 Identities=28% Similarity=0.401 Sum_probs=148.9 Q ss_pred CeeeEEeeCCCCCCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhh-hhccccccCCccccccCCc Q lcl|NC_019407. 1 MAFTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDR-TIAWAGEKVDEDSGLRWPR 79 (173) Q Consensus 1 ~~M~liVe~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~-~~~~~G~r~~~~Q~lawPR 79 (173) |||+||||||+|+|+||||+|+++|++||++|+. |.++++++||++|++|++|||+ .++|+|+|++++|+|+||| T Consensus 1 ~~Malive~~~g~~~anSYvtv~ea~aY~~~rg~----~~~~~~~~ke~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR 76 (172) T protein:vir:95 1 MAITIVVEDGSGVTNANSYVSVADARIYASNRGV----ELPLDDDELAAMLIRSTDYLEAQACRFQGKPTSTTQALQWPR 76 (172) T ss_pred CceeEEEeCCCCCCcccccccHHHHHHHHHhcCC----cCCCChHHHHHHHHHHHHHhhccCCceeeeecCCcccccCCc Confidence 9999999999999999999999999999999854 7777999999999999999996 4799999999999999999 Q ss_pred CCCcccCCeeeccccchHHHHHHHHHHHHHHHcCCCCCCc--cccceeEEecCeeEEeecCCCCCc--cchHHHHHHHHh Q lcl|NC_019407. 80 AGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQ--TTRGMKEIQVDVIELKFDSEIQRG--SMPDIVMSILEG 155 (173) Q Consensus 80 ~gv~~~dg~~~~~~~IP~~V~~A~~elA~~~~~~~~~~~~--~~~~v~~~kVG~isveY~~~~~~~--~~~~~v~~lL~~ 155 (173) +|+. .+|..+++|.||++||+||||||+++++++++.+. ..+.||+||||+|+|||+.+.+.+ +.|++|++||+| T Consensus 77 ~g~~-~~~~~v~~~~IP~~V~~A~~elA~~~~~~~~~~~~~~~~~~vk~~kVG~I~veY~~~~~~~~~~~~~~v~~LL~p 155 (172) T protein:vir:95 77 TGVF-LNEDEVPSNVIPKSLIAAQVQLTMAINAGFDLQPNVSPQDYVTREKVGPIETEYADPLSVGIMPTFTAANALLAP 155 (172) T ss_pred CCcc-cCcccccccchhHHHHHHHHHHHHHHHcCccccccCCcccceeEEeccceEEeeccCCCCCCcccHHHHHHHHhh Confidence 9987 69999999999999999999999999999765544 445699999999999998776554 679999999999 Q ss_pred hhhhccCCcccccceecC Q lcl|NC_019407. 156 LGVVKTGTRPAFKKIIRH 173 (173) Q Consensus 156 ll~~~~G~~~~~~rv~R~ 173 (173) |++..+|++++ +||+|- T Consensus 156 ~l~~~~~~~~~-~r~~r~ 172 (172) T protein:vir:95 156 LFGECASNKFA-LRTIRV 172 (172) T ss_pred hhcccCCccee-eEEEeC Confidence 99755444443 699999 No 2 >protein:vir:94955 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239280;genbank:gi:66392062;genbank:GeneID:5076600 Probab=100.00 E-value=7e-59 Score=339.41 Aligned_cols=168 Identities=23% Similarity=0.457 Sum_probs=154.1 Q ss_pred eEEeeCCCCCCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCc Q lcl|NC_019407. 4 TFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVY 83 (173) Q Consensus 4 ~liVe~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~ 83 (173) -||||||+|+|+||||+|++||++||+.|+ ....|.++++++||++|++|++|||++|+|+|+|++++|+|+|||+|+. T Consensus 1 m~~i~~~~g~~~AnSYvtv~ea~aY~~~r~-~~~~w~~~~~~~~e~aL~~A~dyId~~~~f~G~r~~~~Q~l~wPR~g~~ 79 (170) T protein:vir:94 1 MPTVDATPGSITANSYVTVAEANSYFDGSY-GRPLWTSASEDEKASLVISASRYLDQMMAWIGAPTNPEQSMWWPCKNAV 79 (170) T ss_pred CceeecCCCCCcccceecHHHHHHHHHhhc-cccccCCCCHHHHHHHHHHHHHHhccccccccccCCcchhhcccccCcc Confidence 255699999999999999999999999996 4667999999999999999999999889999999999999999999986 Q ss_pred ccCCeeeccccchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhc--c Q lcl|NC_019407. 84 DIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVK--T 161 (173) Q Consensus 84 ~~dg~~~~~~~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~--~ 161 (173) .||..++++.||++||+||||||++++++++..+..++.||+||||+|+|||+.+++..++|+.|++||+||+... . T Consensus 80 -~dg~~~~~~~IP~~V~~Aq~elA~~~~~~~~~~~~~~~~v~~~kVG~i~veY~~~~~~~~~~~~v~~LL~p~l~~~~~g 158 (170) T protein:vir:94 80 -IGGMTLSQVSIPVKVKIAVFELAYFMLESGAALSFADQTIDSVKVGTIRVEFTKNSTDAGLPTFVEAMLSGFGSPVLYG 158 (170) T ss_pred -cCccccccchhhHHHHHHHHHHHHHHHhCcccCcccccceeeEecceeEEEecCCCCCCccHHHHHHHhhhhhcccccc Confidence 8999999999999999999999999999988888888899999999999999988888889999999999999652 3 Q ss_pred CCcccccceecC Q lcl|NC_019407. 162 GTRPAFKKIIRH 173 (173) Q Consensus 162 G~~~~~~rv~R~ 173 (173) +++...++|+|. T Consensus 159 ~~~~~~~~~~r~ 170 (170) T protein:vir:94 159 SNAARSIDLVRA 170 (170) T ss_pred ccccceeeeecC Confidence 345556799999 No 3 >protein:vir:80389 Length: 172 # NCBI annotation: BcepGomrgp09 # Family: family:all:703 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210229;genbank:gi:146329921;genbank:GeneID:5123498 Probab=100.00 E-value=2.7e-58 Score=336.19 Aligned_cols=161 Identities=32% Similarity=0.530 Sum_probs=141.7 Q ss_pred eeEEeeCCCCCCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhh-hccccccCCccccccCCcCC Q lcl|NC_019407. 3 FTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRT-IAWAGEKVDEDSGLRWPRAG 81 (173) Q Consensus 3 M~liVe~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~-~~~~G~r~~~~Q~lawPR~g 81 (173) |+||||||+|+|+||||+|+++|++||++| |..+++++||++|++|+||||+. ++|+|+|++++|+|+|||+| T Consensus 1 Malived~~g~~~anSYvt~~~a~aY~~~r------g~~~~~d~~e~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g 74 (172) T protein:vir:80 1 MALIVEDGTGKPDANTYAGADFVIAYAQAR------GVTVDADEAERLILEAMDYIESFRRRWKGERNTREQGLTWPRHD 74 (172) T ss_pred CeeEeeCCCCCccccccccHHHHHHHHHHc------CCCcCHHHHHHHHHHHHHHHhhccCccccccCCccccccccccC Confidence 999999999999999999999999999998 34556778999999999999983 37999999999999999999 Q ss_pred CcccCCeeeccccchHHHHHHHHHHHHHHHcCCC-CCCccccceeEEecCeeEEeecCCCCC---------ccchHHHHH Q lcl|NC_019407. 82 VYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDW-TSPQTTRGMKEIQVDVIELKFDSEIQR---------GSMPDIVMS 151 (173) Q Consensus 82 v~~~dg~~~~~~~IP~~V~~A~~elA~~~~~~~~-~~~~~~~~v~~~kVG~isveY~~~~~~---------~~~~~~v~~ 151 (173) ++ .||..+|++.||++||+||||||++++++.. .+......||+||||+||+||+.+.+. .++|++|++ T Consensus 75 ~~-~~g~~~~~~~IP~~v~~A~~elA~~~~~g~~~~~~~~~~~v~~ekVG~i~~eY~~~~~~~~~~~~~~~~~~~~~v~~ 153 (172) T protein:vir:80 75 AV-VDGFVIPSDVIPKELQSAVAAAVIEQVNGFELQQSQDQWAVRIEKVDVIEVQYAAGGGGQSASANAPMKPTFPKIDA 153 (172) T ss_pred cc-cCcccccccchhHHHHHHHHHHHHHHhcCCccCcCCCCceeeEEeccceEEeeecccCccccccccCCccchHHHHH Confidence 86 8999999999999999999999999999854 444556679999999999999855433 356899999 Q ss_pred HHHhhhhhccCCcccccceecC Q lcl|NC_019407. 152 ILEGLGVVKTGTRPAFKKIIRH 173 (173) Q Consensus 152 lL~~ll~~~~G~~~~~~rv~R~ 173 (173) ||+||++ |+++.++++||- T Consensus 154 LL~p~l~---~~gg~~~~~vrg 172 (172) T protein:vir:80 154 LLNPLLV---GDGGLFLVAVRG 172 (172) T ss_pred HHhhhhc---CCCCeeeeeecC Confidence 9999986 445567799999 No 4 >protein:vir:78383 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110841;genbank:gi:134288602;genbank:GeneID:5179646 Probab=100.00 E-value=1.2e-57 Score=332.70 Aligned_cols=164 Identities=21% Similarity=0.259 Sum_probs=145.3 Q ss_pred eeEEeeCCCCCCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhh-hccccccCCccccccCCcCC Q lcl|NC_019407. 3 FTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRT-IAWAGEKVDEDSGLRWPRAG 81 (173) Q Consensus 3 M~liVe~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~-~~~~G~r~~~~Q~lawPR~g 81 (173) |+||||||+|+|+||||+|+++|++||++|++ |..+++++||++|++|++|||+. ++|+|+|++++|+|+|||+| T Consensus 1 MaliV~~~~g~~~anSYvtv~~a~aY~~~rg~----~~~~d~~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg 76 (169) T protein:vir:78 1 MPLIVETGQGIPNADSYVSLEDGRALAAKYGL----ELPEDDTAAEAALRNGAVYVGLFESQMCGRRVSANQALAFPRTG 76 (169) T ss_pred CeeEeeCCCCCccccccccHHHHHHHHHHcCC----cCCCChHHHHHHHHHHHHHhhhccccceeeeCCcccccccccCC Confidence 99999999999999999999999999999964 55669999999999999999972 38999999999999999999 Q ss_pred CcccCCeeeccccchHHHHHHHHHHHHHHHcCCCCC-CccccceeEEec-CeeEEeecCCCCCc--cchHHHHHHHHhhh Q lcl|NC_019407. 82 VYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTS-PQTTRGMKEIQV-DVIELKFDSEIQRG--SMPDIVMSILEGLG 157 (173) Q Consensus 82 v~~~dg~~~~~~~IP~~V~~A~~elA~~~~~~~~~~-~~~~~~v~~~kV-G~isveY~~~~~~~--~~~~~v~~lL~~ll 157 (173) +. .||..+|++.||++||+||||||++++++++.. +...+.|++||| |+|++||+.+++.+ +.|++|++||+||+ T Consensus 77 ~~-~~g~~~~~~~IP~~v~~A~~elA~~~~~g~~~~~~~~~~~v~~e~v~G~i~veY~~~~~~~~~~~~~~~~~LL~p~l 155 (169) T protein:vir:78 77 VT-LHGFPQPSNVIPPLVIQAQVMAAVEYGAGTDVRGSTDGREVQTERVEGAVTVSYFKNGYSGGTVSITTADDALRPLL 155 (169) T ss_pred ce-ecccccccccchHHHHHHHHHHHHHHhcCcccCCCCCcceeEEEEecCceeEeecCCCCCCCcccHHHHHHHhhhhc Confidence 86 899999999999999999999999999987655 456667888887 99999999877654 56899999999999 Q ss_pred hhccCCcccccceecC Q lcl|NC_019407. 158 VVKTGTRPAFKKIIRH 173 (173) Q Consensus 158 ~~~~G~~~~~~rv~R~ 173 (173) + .|+|+..+||+|- T Consensus 156 ~--~~~g~~~i~~~rg 169 (169) T protein:vir:78 156 C--GSNNAYSFNVFRG 169 (169) T ss_pred c--cCCCcceeeeecC Confidence 6 3344445699999 No 5 >protein:vir:95004 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224025;genbank:gi:62327312;genbank:GeneID:5176832 Probab=100.00 E-value=1.8e-57 Score=331.67 Aligned_cols=164 Identities=20% Similarity=0.244 Sum_probs=144.1 Q ss_pred eeEEeeCCCCCCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhh-hccccccCCccccccCCcCC Q lcl|NC_019407. 3 FTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRT-IAWAGEKVDEDSGLRWPRAG 81 (173) Q Consensus 3 M~liVe~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~-~~~~G~r~~~~Q~lawPR~g 81 (173) |+||||||+|+|+||||+|++||++||++|++ |..+|+++||++|++|++|||+. ++|+|+|++++|+|+|||+| T Consensus 1 M~liv~~~~g~~~anSYvt~~ea~aY~~~rg~----~~~~dd~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg 76 (169) T protein:vir:95 1 MPLIVETGQGLPNADSYVSLEDGRALAAKYGL----ELPEDDIAAEASLRNGAVYVGLFESQMCGRRVSANQALAFPRTG 76 (169) T ss_pred CeeEEeCCCCCCcccccccHHHHHHHHHHcCC----cCCCCHHHHHHHHHHHHHHhhccccccccccCCcchhhccccCC Confidence 99999999999999999999999999999964 55569999999999999999983 38999999999999999999 Q ss_pred CcccCCeeeccccchHHHHHHHHHHHHHHHcCCCCCC-ccccceeEEec-CeeEEeecCCCCCc--cchHHHHHHHHhhh Q lcl|NC_019407. 82 VYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSP-QTTRGMKEIQV-DVIELKFDSEIQRG--SMPDIVMSILEGLG 157 (173) Q Consensus 82 v~~~dg~~~~~~~IP~~V~~A~~elA~~~~~~~~~~~-~~~~~v~~~kV-G~isveY~~~~~~~--~~~~~v~~lL~~ll 157 (173) +. .||..++++.||++||+||||||++++++++..+ ...+.|+++|+ |+|++||+.+++.+ +.|+++++||+||+ T Consensus 77 ~~-~~g~~~~~~~IP~~V~~A~~elA~~~~~g~~~~~~~~~~~v~~e~v~G~i~veY~~~~~~~~~~~~~a~~~LL~p~l 155 (169) T protein:vir:95 77 ID-LHGFPQPSNVIPSLVIQAQVMAAVEYGAGTDVRGSTDGREVQTERVEGAVTVSYFKNGYSGGTVSITAADDALRPLL 155 (169) T ss_pred ce-ecccccccccchHHHHHHHHHHHHHHHcCccccCCCCccceeeeeeccceeEeecCCCCcCccccHHHHHHhhhhhc Confidence 64 8999999999999999999999999999876444 45566887766 99999999877665 56899999999999 Q ss_pred hhccCCcccccceecC Q lcl|NC_019407. 158 VVKTGTRPAFKKIIRH 173 (173) Q Consensus 158 ~~~~G~~~~~~rv~R~ 173 (173) + +|+|...+||+|- T Consensus 156 ~--g~~g~~~i~~~rg 169 (169) T protein:vir:95 156 C--GSNNAYSFNVFRG 169 (169) T ss_pred c--cCCCcceeeeecC Confidence 6 3344445699999 No 6 >protein:vir:97267 Length: 172 # NCBI annotation: hypothetical protein ORF024 # Family: family:all:703 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294532;genbank:gi:149408253;genbank:GeneID:5237132 Probab=100.00 E-value=1.4e-55 Score=321.38 Aligned_cols=161 Identities=21% Similarity=0.278 Sum_probs=138.0 Q ss_pred eeEEeeCCCC-CCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhcccccc-CCccccccCCcC Q lcl|NC_019407. 3 FTFVVETGAG-DPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEK-VDEDSGLRWPRA 80 (173) Q Consensus 3 M~liVe~g~g-~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r-~~~~Q~lawPR~ 80 (173) |+||||||+| +|+||||+|+++|++||+.|++ .|.+.++++||++|++|++|||+.|+|+|+| ++++|+|+|||+ T Consensus 1 m~liveD~t~~~~~AnSYvtv~~a~aY~~~rg~---~~~a~~~~~ke~aLi~A~~yiD~~~~f~G~r~~~~~Q~l~WPRt 77 (172) T protein:vir:97 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGN---SFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRT 77 (172) T ss_pred CceEeeCCCCCCCCccccccHHHHHHHHHhcCc---ccCCCCcHHHHHHHHHHHHHHhhhhhcccCCCCCcchhhhcccC Confidence 8899999998 7999999999999999999975 3777799999999999999999989999987 589999999999 Q ss_pred CCcccCCeeeccccchHHHHHHHHHHHHHHHcCCCCCCcc------ccceeEEecCeeEEeecCCCCC---ccchHHHHH Q lcl|NC_019407. 81 GVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQT------TRGMKEIQVDVIELKFDSEIQR---GSMPDIVMS 151 (173) Q Consensus 81 gv~~~dg~~~~~~~IP~~V~~A~~elA~~~~~~~~~~~~~------~~~v~~~kVG~isveY~~~~~~---~~~~~~v~~ 151 (173) |++ ||..+++|.||++||+||||||++++++++.+... ...+||+|||+|+++|+..++. .+.|++|++ T Consensus 78 g~~--d~~~~~~~~IP~~v~~A~~elA~~al~~~l~~d~~~~~~~~~v~~kr~kvg~i~~~y~~~~~~~~~~p~~~~v~a 155 (172) T protein:vir:97 78 DAW--DRDRYYINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQ 155 (172) T ss_pred CCC--CCcccccccccHHHHHHHHHHHHHHHhcccccccccccccccceeeeeeecceeeEeeccCCCCCccccHHHHHH Confidence 985 68999999999999999999999999998754321 2248999999999999765443 467999999 Q ss_pred HHHhhhhhccCCcccccceecC Q lcl|NC_019407. 152 ILEGLGVVKTGTRPAFKKIIRH 173 (173) Q Consensus 152 lL~~ll~~~~G~~~~~~rv~R~ 173 (173) ||+|++... ++| +++|. T Consensus 156 LL~p~gl~~---~~~--~~~r~ 172 (172) T protein:vir:97 156 KLVRAGLVR---SGG--TLLRG 172 (172) T ss_pred HHhhhcccc---Ccc--eeccC Confidence 999975432 222 66777 No 7 >protein:vir:80967 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468394;genbank:gi:157324968;genbank:GeneID:5601402 Probab=97.90 E-value=2.1e-07 Score=57.21 Aligned_cols=125 Identities=18% Similarity=0.203 Sum_probs=76.3 Q ss_pred cccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccch Q lcl|NC_019407. 17 NSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIP 96 (173) Q Consensus 17 nSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~IP 96 (173) =+|+|.++..... .+. .+++++-+++|.+|+++||. +.| -|.. ..+..=..+.+| T Consensus 1 M~Y~d~~~Y~~~y----~G~----~i~e~~F~~l~~rAs~~ID~-~T~-------------~ri~---~~~~d~~~~~~~ 55 (131) T protein:vir:80 1 MPYTTLEFYTNEY----AGE----HLEQDEFAKLLKHAERKIDS-VTF-------------YRIR---KSGIEAFSEFIQ 55 (131) T ss_pred CCCCCHHHHHHhh----CCC----CCchhHHHHHHHHHHHHHHH-Hhc-------------cccc---ccccccCchhHH Confidence 5799998875432 122 24678889999999999998 332 2210 011111124689 Q ss_pred HHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCcc------chHHHHHHHHhhhhhccCCcccccce Q lcl|NC_019407. 97 QQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGS------MPDIVMSILEGLGVVKTGTRPAFKKI 170 (173) Q Consensus 97 ~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~------~~~~v~~lL~~ll~~~~G~~~~~~rv 170 (173) .+||.|+|+.|-.+...+.......+++++++||..+|+|...+..+. ....+..+|.+-+-=. +|-+ T Consensus 56 ~~vk~A~c~q~e~~~~~g~~~~~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~TGLly--rGV~---- 129 (131) T protein:vir:80 56 HQIQLATCNQIEYFKEAGGTSELAVSKPDNVSIGRTSISDSNFASTATSLNSGLVGSDVRSYLAHTGLLY--NGVG---- 129 (131) T ss_pred HHHHHHHHHHHHHHHHhhhhhhhcccccCeeeeCceEEeeccccchhhhhhhhhhHHHHHHHHhccCCee--cCCC---- Confidence 999999999997666644333334567999999999999976443321 2334555565432111 2333 Q ss_pred ec Q lcl|NC_019407. 171 IR 172 (173) Q Consensus 171 ~R 172 (173) +| T Consensus 130 ~~ 131 (131) T protein:vir:80 130 VR 131 (131) T ss_pred CC Confidence 12 No 8 >protein:vir:43 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463469;swissprot:trembl:q9t1b5;genbank:gi:16798791;uniprot:Q9T1B5;genbank:GeneID:922374 Probab=97.88 E-value=2.2e-07 Score=57.07 Aligned_cols=125 Identities=19% Similarity=0.232 Sum_probs=75.9 Q ss_pred cccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccch Q lcl|NC_019407. 17 NSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIP 96 (173) Q Consensus 17 nSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~IP 96 (173) =+|+|.++..... + + ..+++++-+++|.+|+++||. +.| -|.. ..+..-..+.+| T Consensus 1 M~Y~d~~~Y~~~y---~-g----~~i~e~~F~~l~~rAs~~ID~-~T~-------------~ri~---~~~~~~~~~~~~ 55 (131) T protein:vir:43 1 MPYTTLEFYNDEY---A-G----EHLEQDEFDKLLKHAERKIDS-VTF-------------YRIR---KGGIESFSEFIQ 55 (131) T ss_pred CCCCCHHHHHHhh---C-C----CCCCHhHHHHHHHHHHHHHHH-Hhc-------------cccc---ccCccccchhhH Confidence 5799998875432 1 2 234778889999999999997 332 1211 001111124689 Q ss_pred HHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCcc------chHHHHHHHHhhhhhccCCcccccce Q lcl|NC_019407. 97 QQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGS------MPDIVMSILEGLGVVKTGTRPAFKKI 170 (173) Q Consensus 97 ~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~------~~~~v~~lL~~ll~~~~G~~~~~~rv 170 (173) .+||.|+|+.|-.+...+.......+++++++||..+|+|...+.... ....+..+|.+-+-=. +|-+ T Consensus 56 ~~vk~A~c~q~e~~~~~g~~s~~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~TGLly--rGV~---- 129 (131) T protein:vir:43 56 HQIQLATCNQIEYFKEAGGTSELAVSKPDNVSIGRTSISDSNFASTATSLNSGLIGSDVRSYLAHTGLLY--NGVG---- 129 (131) T ss_pred HHHHHHHHHHHHHHHHhHHHhhhhccccCeeecCceEEeecccccchhhhchhhhHHHHHHHHhccCCee--cCCC---- Confidence 999999999997666544333334456999999999999976443221 2344555565432111 2333 Q ss_pred ec Q lcl|NC_019407. 171 IR 172 (173) Q Consensus 171 ~R 172 (173) +| T Consensus 130 ~~ 131 (131) T protein:vir:43 130 VR 131 (131) T ss_pred CC Confidence 12 No 9 >protein:vir:98900 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:3882 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164420;genbank:gi:56694910;genbank:GeneID:3197290 Probab=97.60 E-value=1.6e-06 Score=52.43 Aligned_cols=124 Identities=15% Similarity=0.046 Sum_probs=74.9 Q ss_pred cccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccch Q lcl|NC_019407. 17 NSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIP 96 (173) Q Consensus 17 nSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~IP 96 (173) =+|+|.++.+.|. +. .+++++-+++|.+|+++||. +.| ++ . +.++..-....++ T Consensus 1 M~Y~t~~~Y~~~~-----G~----~i~e~~F~~l~~rAs~~ID~-iT~-~r------------i---~~~~~~~d~~~~~ 54 (132) T protein:vir:98 1 MPYLTYEEFMDLN-----GR----DIDDKKFEKLLPKASAIIDG-VTG-HF------------Y---QKVDMEKDNAWRV 54 (132) T ss_pred CCCCCHHHHHhhc-----CC----CCCHHHHHHHHHHHHHHHHH-Hhc-cc------------c---cCCCccccChHHH Confidence 6799999887762 22 24778899999999999997 433 11 0 0111111234577 Q ss_pred HHHHHHHHHHHHHHHcCCCCCC-ccccceeEEecCeeEEeecCCCCC----cc---chHHHHHHHHhhhhhccCCccccc Q lcl|NC_019407. 97 QQLMEATAEMAAALMNNDWTSP-QTTRGMKEIQVDVIELKFDSEIQR----GS---MPDIVMSILEGLGVVKTGTRPAFK 168 (173) Q Consensus 97 ~~V~~A~~elA~~~~~~~~~~~-~~~~~v~~~kVG~isveY~~~~~~----~~---~~~~v~~lL~~ll~~~~G~~~~~~ 168 (173) .+||.|+|..+-.+.+.+.... ...+.+++++||..+|+|..+.+. .. ...-+..+|.+.+-=. +|.+ T Consensus 55 ~~vk~A~c~qiey~~~~G~~sae~~~~~~~S~svG~~Svs~~s~~~~~~~~~~~~~~~~~a~~~L~~tGLLy--rGV~-- 130 (132) T protein:vir:98 55 NQFKLALCAQIEYFDALGATTFEEINNSPQTFQAGRTSVSNASRYNPSGANESKPLVAEDVYIYLQGTGLLF--QGVK-- 130 (132) T ss_pred HHHHHHHHHHHHHHHhccchhhhhccCccceeeeCcEEEEeeccCCcccccccccchHHHHHHHHhhcCCcc--ccCC-- Confidence 8999999999976665443332 235569999999999999754321 11 1234555665532211 1211 Q ss_pred ceecC Q lcl|NC_019407. 169 KIIRH 173 (173) Q Consensus 169 rv~R~ 173 (173) |- T Consensus 131 ---~~ 132 (132) T protein:vir:98 131 ---TW 132 (132) T ss_pred ---CC Confidence 11 No 10 >protein:vir:9576 Length: 131 # NCBI annotation: gp42 # Family: family:all:1271 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862881;genbank:gi:32469473;genbank:GeneID:1461318 Probab=93.95 E-value=0.0032 Score=34.28 Aligned_cols=124 Identities=15% Similarity=0.185 Sum_probs=69.7 Q ss_pred ccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccc Q lcl|NC_019407. 16 ANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAI 95 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~I 95 (173) -+.|+|+++..+-| |.... + .....+.+|-.|+++|..++ |+.+. +.+....+++.. T Consensus 1 m~~fAtv~D~~~rw--r~Lt~---~--E~~ra~~LL~~As~~ir~~~---------------p~~~~-~l~~~~~~~~~~ 57 (131) T protein:vir:95 1 MENFATVEDLKKLW--RALKF---D--EEKRAEALLEVVSHSLRVEA---------------KKVGK-DLDGLVATDPSF 57 (131) T ss_pred CCccCCHHHHHHHh--cCCCH---H--HHHHHHHHHHHHHHHHHHhh---------------hhccC-CccccccCCccc Confidence 68999999998665 32111 0 22355778899999998764 43331 244444455566 Q ss_pred hHHHHHHHHHHHHHHHcCCCCCCccccce--eEEecCee--EEeecCCCCCccc--hHHHHHHHHhhhhhccCCcccccc Q lcl|NC_019407. 96 PQQLMEATAEMAAALMNNDWTSPQTTRGM--KEIQVDVI--ELKFDSEIQRGSM--PDIVMSILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 96 P~~V~~A~~elA~~~~~~~~~~~~~~~~v--~~~kVG~i--sveY~~~~~~~~~--~~~v~~lL~~ll~~~~G~~~~~~r 169 (173) +.-++..+|+.....+..+. ...++ .++..|+. +.+|. .+.+.. -..-..+| +. .|.+.+-+- T Consensus 58 ~~~~~~V~~~~V~Ral~~~~----~~~G~tq~S~TaG~ys~S~t~~--~p~g~lylt~~e~~~L---Gl--~~~r~~~i~ 126 (131) T protein:vir:95 58 TMVVKSVTVDVVARTLMTST----DQEPMTQVAESALGYSFSGSYL--VPGGGLFIKDSELKRL---GL--KKQRYGVID 126 (131) T ss_pred hHHHHHHHHHHHHHHhcCCC----CCCCceeeeeecccceeeeeee--cCCCCceeChHHHHHh---CC--CCCceeEEe Confidence 78899999999988775431 11233 46888988 45554 334432 23333334 21 233333222 Q ss_pred eecC Q lcl|NC_019407. 170 IIRH 173 (173) Q Consensus 170 v~R~ 173 (173) +-=. T Consensus 127 ~~~~ 130 (131) T protein:vir:95 127 IYGT 130 (131) T ss_pred eccC Confidence 2211 No 11 >protein:vir:4788 Length: 130 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150168;swissprot:trembl:q94m43;genbank:gi:15088779;uniprot:Q94M43;genbank:GeneID:955972 Probab=93.01 E-value=0.0038 Score=33.87 Aligned_cols=124 Identities=14% Similarity=0.105 Sum_probs=71.7 Q ss_pred cccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhc-cccccCCccccccCCcCCCcccCCeeeccccc Q lcl|NC_019407. 17 NSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIA-WAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAI 95 (173) Q Consensus 17 nSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~-~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~I 95 (173) =.|.|.+|.+.| +. . ++++-+++|.+|++-||...+ |.=...+ ..=+...+ T Consensus 1 M~YlT~eey~el----~~--~-----~~~~F~kl~k~A~~~ID~~t~~~y~~~~~-----------------~~~~~~~r 52 (130) T protein:vir:47 1 MTYLTQEEFDEL----DF--D-----EVTDFEKLAKRAKIAIDLYTNGIYQKDID-----------------FEKEIAYR 52 (130) T ss_pred CCCCchhhHhhc----CC--C-----ChhhHHHHHHHHHHHHHHHhcccccccCC-----------------ccCcchHH Confidence 679999998866 21 1 345699999999999997543 2211111 01112334 Q ss_pred hHHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCc--cchHHHH---HHHHhhhhhccCCcccccc Q lcl|NC_019407. 96 PQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRG--SMPDIVM---SILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 96 P~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~--~~~~~v~---~lL~~ll~~~~G~~~~~~r 169 (173) =.+||.|.|.-...+-..+.......+.+++.+||-.+++|....... ..+.... .+|.+.+-. +=+|..+=| T Consensus 53 ~~~vK~A~a~QieY~~~~G~~s~~~~~~~~S~svGrtSis~~~~~~~~~~~~~~vs~da~~~L~~tGL~-Ly~GV~yd~ 130 (130) T protein:vir:47 53 KSAVKLAMAFQIAYLDASGIMSADDKQLANSVSIGRTSISYSTSQSTLAGQRFNLSMDAENALRQAGFS-LVVGVAYDR 130 (130) T ss_pred HHHHHHHHHHHHHHHHHhccccchhccCcceeeecceeeecCcCccccccCCccccHHHHHHHHhcccc-cccCCCccC Confidence 467788877766555544444444567799999999999997644332 2233222 244443220 003444445 No 12 >protein:vir:5256 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852761;genbank:gi:31544036;uniprot:Q7Y5T9;genbank:GeneID:2753553 Probab=91.79 E-value=0.0094 Score=31.69 Aligned_cols=106 Identities=12% Similarity=0.139 Sum_probs=60.9 Q ss_pred cccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccchHH Q lcl|NC_019407. 19 YCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQQ 98 (173) Q Consensus 19 Y~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~IP~~ 98 (173) -.|++++.+-+ +..+..+|+..+..|-.|..+|+. -+| | .. T Consensus 1 m~t~~~Fr~~~-------PeF~~~pd~~i~~~l~~A~~~l~~-~~~-g------------------------------~~ 41 (119) T protein:vir:52 1 MPLTEDFLLRY-------TEFGKTDAKRIGLFLSDAQAEVSK-VQW-G------------------------------KL 41 (119) T ss_pred CCcHHHHHHhh-------hhccCCCHHHHHHHHHHHHHhhCC-cCC-c------------------------------hH Confidence 56676665443 445668999999999999999985 344 1 11 Q ss_pred HHHHHHHHHHHHHc--CCCCC--CccccceeEEecCeeEEeecCCCCCccc--------h-HHHHHHHHhhhhhccCCcc Q lcl|NC_019407. 99 LMEATAEMAAALMN--NDWTS--PQTTRGMKEIQVDVIELKFDSEIQRGSM--------P-DIVMSILEGLGVVKTGTRP 165 (173) Q Consensus 99 V~~A~~elA~~~~~--~~~~~--~~~~~~v~~~kVG~isveY~~~~~~~~~--------~-~~v~~lL~~ll~~~~G~~~ 165 (173) -.++.+.++.+++. +.... ....+.++++++|+|+|+|+........ + .-.-+|++.++ .+ T Consensus 42 ~~~~~~L~~AH~l~l~~~~~~~~g~~~g~v~S~s~G~vSvS~~~~~~~~~~~~w~~~T~YG~~y~~L~r~~g------~G 115 (119) T protein:vir:52 42 YDRGVMALTAHLLKLSADAEISGGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIG------VG 115 (119) T ss_pred HHHHHHHHHHHHHHhhhhhhccccccccceeeeeecceeeeeeccccCCcchhhhhcCHHHHHHHHHHHHhc------CC Confidence 23455666666553 11111 2234568999999999999755432211 1 12233444442 22 Q ss_pred cccc Q lcl|NC_019407. 166 AFKK 169 (173) Q Consensus 166 ~~~r 169 (173) +++- T Consensus 116 g~Va 119 (119) T protein:vir:52 116 VMVA 119 (119) T ss_pred CcCC Confidence 2222 No 13 >protein:vir:2505 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:28222 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569746;genbank:gi:18496896;genbank:GeneID:932265 Probab=91.56 E-value=0.0031 Score=34.37 Aligned_cols=122 Identities=11% Similarity=0.074 Sum_probs=75.9 Q ss_pred CCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeecc Q lcl|NC_019407. 13 DPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPS 92 (173) Q Consensus 13 ~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~ 92 (173) --.-.+++|+++..+-+..- . +..++.+...+|-.|+|-|.+.+ |.| ++| T Consensus 1 ~~~~~alAtvdDv~~~lrr~---L---t~dE~~~a~~Ll~eAsdlI~g~l-~~~----------------------~vp- 50 (128) T protein:vir:25 1 MTECKALATSQDVKRALRRD---L---TEAEQTDLSELLAEATDLVVGYL-HPY----------------------PVP- 50 (128) T ss_pred CccchhccCHHHHHHHhcCC---C---CHHHHHHHHHHHhcchheeeeec-CCC----------------------CCC- Confidence 11347899999988765321 1 11133444556788999998742 222 122 Q ss_pred ccchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccch--HHHHHHHHhhhhhccCCcccccce Q lcl|NC_019407. 93 DAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMP--DIVMSILEGLGVVKTGTRPAFKKI 170 (173) Q Consensus 93 ~~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~--~~v~~lL~~ll~~~~G~~~~~~rv 170 (173) |.+|.-|+.-+|..+..+++-+..... .-++.+-|+.+++|..+++++..| ..-+.+|+|+- .+.|+-= T Consensus 51 ~~~p~~v~rVvA~ivarAltr~~~~~p---e~~S~TAgpfs~~ft~~~~~~g~yLTaa~k~~Lrp~R------~~~~sV~ 121 (128) T protein:vir:25 51 TPTPGPIKRVVASMVAAVLTRPTQILP---ETQSLTADGFGVTFTPGGNSPGPYLSAALKQRLRPYR------TGMVAVE 121 (128) T ss_pred CCCCchHHHHHHHHHHHHhhCCCccCC---CceeeecccccccccCCCCCCCceEcHHHHhhccccc------ceeeEee Confidence 568888999999999888775543333 234556799998887777777654 66778899983 2222222 Q ss_pred ecC Q lcl|NC_019407. 171 IRH 173 (173) Q Consensus 171 ~R~ 173 (173) .=| T Consensus 122 l~s 124 (128) T protein:vir:25 122 MGS 124 (128) T ss_pred ccc Confidence 222 No 14 >protein:vir:9821 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795583;genbank:gi:28876338;genbank:GeneID:1257921 Probab=91.38 E-value=0.0053 Score=33.05 Aligned_cols=126 Identities=13% Similarity=0.123 Sum_probs=69.9 Q ss_pred CeeeEEeeCCCCCCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcC Q lcl|NC_019407. 1 MAFTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRA 80 (173) Q Consensus 1 ~~M~liVe~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~ 80 (173) |-|+. =+|.|.+|.+.+ ++ . +.++-+++|.+|++-||..+++. T Consensus 1 ~~~~~-----------M~YlT~eey~~l----~~--~-----~~~dF~kllk~As~~ID~~t~~~--------------- 43 (138) T protein:vir:98 1 MEVVI-----------IAFLTQKEFEDL----GF--D-----DVEDFEKMEKRASHAVNLYCRNR--------------- 43 (138) T ss_pred Ccccc-----------ccccchHHHhcc----CC--C-----ChhhHHHHHHHHHHHhhhhhccc--------------- Confidence 55543 479999987654 22 1 33459999999999999854332 Q ss_pred CCcccCCeeeccc--cchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCc-------cchH---H Q lcl|NC_019407. 81 GVYDIDGFLIPSD--AIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRG-------SMPD---I 148 (173) Q Consensus 81 gv~~~dg~~~~~~--~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~-------~~~~---~ 148 (173) .++.-+.++ .+=.+||.|.|.-...+-..+.......+..++.+||-.+++|+.....+ .+|. - T Consensus 44 ----y~~~d~e~d~~~r~~~vKkA~a~QIeY~~~~G~ts~~d~~~~~s~svGrTSiS~~~~~~~~s~~~~~~~~~~~s~~ 119 (138) T protein:vir:98 44 ----YDYKDLKKEIALVQKAVKRAIAYQIAYLNDSGVMTAEDKQSFAGISLGRTSISYTVGHGQGSQQKTLADRFNLCLD 119 (138) T ss_pred ----cccccccchhHHHHHHHHHHHHHHHHHHHHcCCcchhhccCcCceEeeeeEeecccccccccccccccccccccHH Confidence 111222222 12345666666555444443444444466789999999999985443222 1222 2 Q ss_pred HHHHHHhhhhhccCCcccccc Q lcl|NC_019407. 149 VMSILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 149 v~~lL~~ll~~~~G~~~~~~r 169 (173) +..+|.+.+- +=+|..+=| T Consensus 120 A~~~L~~tGL--LY~GV~yd~ 138 (138) T protein:vir:98 120 AENELLVVGL--GYTGISYDR 138 (138) T ss_pred HHHHHhhcCc--ccccCcccC Confidence 2235655432 114445555 No 15 >protein:vir:107756 Length: 147 # NCBI annotation: gp21 # Family: family:all:664 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024869;genbank:gi:48697511;genbank:GeneID:2948377 Probab=91.31 E-value=0.0087 Score=31.88 Aligned_cols=126 Identities=14% Similarity=0.106 Sum_probs=63.1 Q ss_pred eeEEeeCCCCCCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCC Q lcl|NC_019407. 3 FTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGV 82 (173) Q Consensus 3 M~liVe~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv 82 (173) |+.+. +++.+.+.+ -........+|+..+..|..|..+|+.. +| +. T Consensus 1 m~v~f-------------d~~~Fr~~f----PeFad~~~~pd~~i~~~l~~A~~~l~~~-~~-------------~~--- 46 (147) T protein:vir:10 1 MDHTL-------------DITKFRALF----PEFNNDVKYPDALLEQWYAVAGEYLGLT-DY-------------AC--- 46 (147) T ss_pred Cceec-------------CHHHHHHhc----ccccCCccCCHHHHHHHHHHHHHhhccc-cC-------------Cc--- Confidence 44433 334444332 2222223568999999999999999963 32 11 Q ss_pred cccCCeeeccccchHHHHHHHHHHHHHHHcCC--CCC-CccccceeEEecCeeEEeecCCCCCccc--------h-HHHH Q lcl|NC_019407. 83 YDIDGFLIPSDAIPQQLMEATAEMAAALMNND--WTS-PQTTRGMKEIQVDVIELKFDSEIQRGSM--------P-DIVM 150 (173) Q Consensus 83 ~~~dg~~~~~~~IP~~V~~A~~elA~~~~~~~--~~~-~~~~~~v~~~kVG~isveY~~~~~~~~~--------~-~~v~ 150 (173) +.+ ...-.++.+.++.+++.=. ... ....+.++++++|+|||+|+........ + .-.- T Consensus 47 -~~~---------g~~~~~~l~Ll~AHll~l~~~~~~g~g~~G~v~Sas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~y~ 116 (147) T protein:vir:10 47 -GLN---------GNTLDLALMQLTAHLMKSATILSSNKGAPMVMTSATIDKVSISTLAPPIKNGWQYWLSTTPYGQMLW 116 (147) T ss_pred -ccC---------hhhHHHHHHHHHHHHHHHHHhhccCCCcccceeeeeecceeeeeecCCCCCcchhhhhcCHHHHHHH Confidence 011 1234466666665544321 111 1234558999999999999855332221 1 1223 Q ss_pred HHHHhhhhh--ccCCcccccceecC Q lcl|NC_019407. 151 SILEGLGVV--KTGTRPAFKKIIRH 173 (173) Q Consensus 151 ~lL~~ll~~--~~G~~~~~~rv~R~ 173 (173) +|++.+... -.|+.| --..+|. T Consensus 117 ~l~~~~~~Gg~vvgG~p-~r~a~r~ 140 (147) T protein:vir:10 117 ALLSMRSSGGFVYGGSP-ELSGYRR 140 (147) T ss_pred HHHHhhCccceecCCCC-ccccccc Confidence 355555320 112222 1233444 No 16 >protein:vir:79701 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285884;genbank:gi:148750841;genbank:GeneID:5220403 Probab=91.19 E-value=0.013 Score=30.89 Aligned_cols=132 Identities=11% Similarity=0.155 Sum_probs=71.4 Q ss_pred ccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhcc-ccccCCccccccCCcCCCcccCCeeecccc Q lcl|NC_019407. 16 ANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAW-AGEKVDEDSGLRWPRAGVYDIDGFLIPSDA 94 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~-~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~ 94 (173) --+|.|-+|.+.+ +. ...++++-+++|.+|.+-||...++ .+. +=+.. .+.|+..=.+.. T Consensus 1 ~~pYLTy~ef~~l----g~-----~~~~~d~F~kllk~A~~~ID~~T~y~~~~---------y~~~~-i~~d~~~d~~~~ 61 (144) T protein:vir:79 1 MKPYLTTSDFEKL----GY-----ELKKPDNFGKLLKSATVLINQICSYYDPA---------FAYHD-LEADSQADPDSY 61 (144) T ss_pred CCcccchhhhhhh----CC-----CCcchhhhhhHHHHHHHHhhhhhhhhccc---------ccccc-ccccccccchhh Confidence 5789998887644 21 1235677999999999999985432 111 00000 011111112234 Q ss_pred ch---HHHHHHHHHHHHHHHcCCCCCC-c-cccceeEEecCeeEEeecCCCCCcc---ch---HHHHHHHHhhhhhccCC Q lcl|NC_019407. 95 IP---QQLMEATAEMAAALMNNDWTSP-Q-TTRGMKEIQVDVIELKFDSEIQRGS---MP---DIVMSILEGLGVVKTGT 163 (173) Q Consensus 95 IP---~~V~~A~~elA~~~~~~~~~~~-~-~~~~v~~~kVG~isveY~~~~~~~~---~~---~~v~~lL~~ll~~~~G~ 163 (173) || .+||.|.|.-...+-..+.... . ..+.+++.+||-.+++|...+..+. .+ .-+-.+|.+.+--- + T Consensus 62 ~~~r~~~vKkA~a~QIeY~~~~G~~sa~e~~~~~~~S~svGrtsvs~~~~~~~s~t~~~~~v~~~a~~yL~~tGLLY--r 139 (144) T protein:vir:79 62 LFRQAMAFKKAVALEMLFLEDSGYSSAYDVAQGALNSFTVGHTSMSLNPSAGQNLTVGSTGVVKSAYDLLGRYGLLF--S 139 (144) T ss_pred hhHHHHHHHHHHHHHHHHHHHcCCcchhhhhcCccceeEecceEEeecCCCccccccccccccHHHHHHHhhcCccc--c Confidence 56 4467777766654444333332 2 3566999999999999965543321 22 44455555543211 2 Q ss_pred ccccc Q lcl|NC_019407. 164 RPAFK 168 (173) Q Consensus 164 ~~~~~ 168 (173) |.+-. T Consensus 140 GV~s~ 144 (144) T protein:vir:79 140 GVASL 144 (144) T ss_pred ccccC Confidence 22222 No 17 >protein:vir:9761 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795523;genbank:gi:28876281;genbank:GeneID:1257822 Probab=89.50 E-value=0.023 Score=29.61 Aligned_cols=126 Identities=12% Similarity=0.047 Sum_probs=61.9 Q ss_pred ccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccc Q lcl|NC_019407. 16 ANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAI 95 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~I 95 (173) -+.|+|+++..+-| |-... + ..+..+.+|-.|+++|..++ |+.|. +.+-.....+.- T Consensus 1 m~~fATv~Dv~~rw--r~Lt~---d--E~~ra~~LL~dAS~~iR~~~---------------p~~g~-~~~~~~~~~~~~ 57 (140) T protein:vir:97 1 MGNFATTDDVILLW--RPLSV---D--ELKRANALLKVVSDTLRMEA---------------DKVGK-DLDKTMVDKPYF 57 (140) T ss_pred CCcCCCHHHHHHHh--cCCCH---h--HHHHHHHHHHHHHHHHHHhh---------------hhccC-CcchhcccCccc Confidence 68999999998765 32111 0 12455778899999998753 43331 111111111223 Q ss_pred hHHHHHHHHHHHHHHHcCCCCCCccccc--eeEEecCee--EEeecCCCCCccchHHHHHHHHhhhhhccCCccccc--- Q lcl|NC_019407. 96 PQQLMEATAEMAAALMNNDWTSPQTTRG--MKEIQVDVI--ELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFK--- 168 (173) Q Consensus 96 P~~V~~A~~elA~~~~~~~~~~~~~~~~--v~~~kVG~i--sveY~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~--- 168 (173) +.-++..+|....+.+.- +....+ -.++..|+. +.+|..+...-..-+.-..+| +. .|.+.+-+ T Consensus 58 ~~~~k~V~~~mV~Ral~~----~~d~~G~tq~S~TaG~ys~S~T~~np~G~lylt~~e~~~L---Gl--~~~r~~~i~~~ 128 (140) T protein:vir:97 58 VNVIKSVTVDIVARTLMT----STQGEPMSQESQSALGYTWSGTYLVPGGGLFIKDNELKRL---GL--KKQRYGGIELY 128 (140) T ss_pred hhHHHHHHHHHHHHHhcC----CCCCCcceeeeeeccchhheeeeecCCCCceeChHHHHHh---CC--CCCceeeeccc Confidence 444567777777664421 112123 346788987 555653322222223333333 22 23333311 Q ss_pred -ceecC Q lcl|NC_019407. 169 -KIIRH 173 (173) Q Consensus 169 -rv~R~ 173 (173) ..-|. T Consensus 129 g~~~~~ 134 (140) T protein:vir:97 129 GEIKRD 134 (140) T ss_pred CccccC Confidence 22222 No 18 >protein:vir:1887 Length: 108 # NCBI annotation: gp6 # Family: family:all:6764 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037667;genbank:gi:9634125;genbank:GeneID:1262472 Probab=89.28 E-value=0.013 Score=30.85 Aligned_cols=105 Identities=12% Similarity=0.060 Sum_probs=61.6 Q ss_pred CeeeEEeeCCCCCCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcC Q lcl|NC_019407. 1 MAFTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRA 80 (173) Q Consensus 1 ~~M~liVe~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~ 80 (173) |+|.. =+++|+++++.|. |.-+ .-+|+..+..+..|.+|+.. |.|++... T Consensus 1 ~~~~~-----------M~~vtLee~K~hL--Rid~-----dddD~lI~~~i~AA~~~v~~---~~~~~~~~--------- 50 (108) T protein:vir:18 1 MAIDV-----------LDVISLSLFKQQI--EFEE-----DDRDELITLYAQAAFDYCMR---WCDEPAWK--------- 50 (108) T ss_pred CCCCc-----------ccccCHHHHHHHc--CCCC-----CcchHHHHHHHHHHHHHHHH---HhCCcccc--------- Confidence 55432 5799999999994 4322 23778888888888888874 44543211 Q ss_pred CCcccCCeeeccccchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhc Q lcl|NC_019407. 81 GVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVK 160 (173) Q Consensus 81 gv~~~dg~~~~~~~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~ 160 (173) ....+|..|+.|+..|+-.+.++- +.++.. ....++.+..||.|+-. - T Consensus 51 ----------~~~~~p~~ik~AiLllv~~~YenR------------E~~~~~---------~~~~~~~~~~LL~pYR~-~ 98 (108) T protein:vir:18 51 ----------VAADIPAAVKGAVLLVFADMFEHR------------TAQSEV---------QLYENAAAERMMFIHRN-W 98 (108) T ss_pred ----------cccccchHHHHHHHHHHHHHHhcc------------cccccc---------hhhhhHHHHHHHHHHHh-c Confidence 124589999999999987665432 111110 11223578999988722 1 Q ss_pred cCCcc---cc Q lcl|NC_019407. 161 TGTRP---AF 167 (173) Q Consensus 161 ~G~~~---~~ 167 (173) -|..- |. T Consensus 99 ~g~~~~~~~~ 108 (108) T protein:vir:18 99 RGKAESEEGS 108 (108) T ss_pred CCCCCcccCC Confidence 11000 11 No 19 >protein:vir:192 Length: 108 # NCBI annotation: Gp6 # Family: family:all:6764 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037702;genbank:gi:9634167;genbank:GeneID:1262532 Probab=89.28 E-value=0.013 Score=30.85 Aligned_cols=105 Identities=12% Similarity=0.060 Sum_probs=61.6 Q ss_pred CeeeEEeeCCCCCCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcC Q lcl|NC_019407. 1 MAFTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRA 80 (173) Q Consensus 1 ~~M~liVe~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~ 80 (173) |+|.. =+++|+++++.|. |.-+ .-+|+..+..+..|.+|+.. |.|++... T Consensus 1 ~~~~~-----------M~~vtLee~K~hL--Rid~-----dddD~lI~~~i~AA~~~v~~---~~~~~~~~--------- 50 (108) T protein:vir:19 1 MAIDV-----------LDVISLSLFKQQI--EFEE-----DDRDELITLYAQAAFDYCMR---WCDEPAWK--------- 50 (108) T ss_pred CCCCc-----------ccccCHHHHHHHc--CCCC-----CcchHHHHHHHHHHHHHHHH---HhCCcccc--------- Confidence 55432 5799999999994 4322 23778888888888888874 44543211 Q ss_pred CCcccCCeeeccccchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhc Q lcl|NC_019407. 81 GVYDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVK 160 (173) Q Consensus 81 gv~~~dg~~~~~~~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~ 160 (173) ....+|..|+.|+..|+-.+.++- +.++.. ....++.+..||.|+-. - T Consensus 51 ----------~~~~~p~~ik~AiLllv~~~YenR------------E~~~~~---------~~~~~~~~~~LL~pYR~-~ 98 (108) T protein:vir:19 51 ----------VAADIPAAVKGAVLLVFADMFEHR------------TAQSEV---------QLYENAAAERMMFIHRN-W 98 (108) T ss_pred ----------cccccchHHHHHHHHHHHHHHhcc------------cccccc---------hhhhhHHHHHHHHHHHh-c Confidence 124589999999999987665432 111110 11223578999988722 1 Q ss_pred cCCcc---cc Q lcl|NC_019407. 161 TGTRP---AF 167 (173) Q Consensus 161 ~G~~~---~~ 167 (173) -|..- |. T Consensus 99 ~g~~~~~~~~ 108 (108) T protein:vir:19 99 RGKAESEEGS 108 (108) T ss_pred CCCCCcccCC Confidence 11000 11 No 20 >protein:vir:94761 Length: 132 # NCBI annotation: unknown # Family: family:all:1271 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996708;genbank:gi:45597423;genbank:GeneID:2769029 Probab=89.06 E-value=0.023 Score=29.52 Aligned_cols=125 Identities=11% Similarity=-0.005 Sum_probs=62.1 Q ss_pred ccccccHHHHHHHHHhccCCcccCCCCCH---HHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeecc Q lcl|NC_019407. 16 ANSYCDVQFADDYIYANVYANTAWDALDQ---DEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPS 92 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~---~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~ 92 (173) -+.|+|+++..+-| | .+++ +..+.+|-.|+++|..++ |+.+.-......... T Consensus 1 m~~fAtv~Dl~~r~--r--------~L~~dE~~ra~~LL~dAs~~iR~~~---------------~~~~~~~~~~~~~~~ 55 (132) T protein:vir:94 1 MNPFATVDDLTMLW--R--------PLKGDEKERAEKLLEIVSDTLREEA---------------DKVGRDLDVMISEKP 55 (132) T ss_pred CCCcCCHHHHHHHh--c--------cCChhHHHHHHHHHHHHHHHHHHHH---------------hhhccccccccCCCC Confidence 68999999998654 2 2233 445667899999998653 333221111111111 Q ss_pred ccchHHHHHHHHHHHHHHHcCCCCCCcccc-ceeEEecCee--EEeecCCCCCccchHHHHHHHHhhhhhccCCcccccc Q lcl|NC_019407. 93 DAIPQQLMEATAEMAAALMNNDWTSPQTTR-GMKEIQVDVI--ELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 93 ~~IP~~V~~A~~elA~~~~~~~~~~~~~~~-~v~~~kVG~i--sveY~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~r 169 (173) +..|.-++.-+|.....++..+. +..+ .-.++..|+. +.+|..+...-.....-..+| +. .|++.+-+- T Consensus 56 d~~~~~~k~V~~~~V~Ral~~~~---~~~g~tq~S~TaG~ys~S~T~~np~G~lylt~~e~~~L---Gl--~~~r~~~i~ 127 (132) T protein:vir:94 56 SYFSSVVKSVTVDIVARTLMTST---DQEPMTQTTESALGYSVSGSYLVPGGGLFIKNSELSRL---GL--KKQRFGVID 127 (132) T ss_pred ccchhHHHHHHHHHHHHHhcCCC---CCCCceeeeeecccceeeeeeecCCCCceeChHHHHhh---CC--CCCceEEEe Confidence 22344466777888887775432 1122 2346788987 555643322222223333333 22 123332111 Q ss_pred eecC Q lcl|NC_019407. 170 IIRH 173 (173) Q Consensus 170 v~R~ 173 (173) +-=. T Consensus 128 ~~~~ 131 (132) T protein:vir:94 128 FYGN 131 (132) T ss_pred ecCC Confidence 1111 No 21 >protein:vir:99002 Length: 158 # NCBI annotation: gp31 # Family: family:all:32652 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655896;genbank:gi:109521468;genbank:GeneID:4157967 Probab=88.44 E-value=0.032 Score=28.76 Aligned_cols=123 Identities=11% Similarity=0.035 Sum_probs=73.2 Q ss_pred ccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccc Q lcl|NC_019407. 16 ANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAI 95 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~I 95 (173) --+|+||+|++.+++ .+.++ |..+ ...+|=.+||-.=.| +-..|...||-. +.+ T Consensus 1 ~~alasvee~~trl~-----~~lp~---~~~r--~~a~a~~vLd~~S~~----ar~~~gr~W~~~------------~da 54 (158) T protein:vir:99 1 MAALVSVEEFTTFLR-----VPLPE---EGSE--KYTQMEFLLTLASDW----ARELSCKPWLLP------------ADA 54 (158) T ss_pred CcceeeHhhhhhhhc-----ccCCh---hhhH--HHHHHHHHHHHHHHH----HHHhcCccCCCC------------Ccc Confidence 468999999998762 22221 2222 223444455531111 112356678821 347 Q ss_pred hHHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccch-HHHHHHHHhhhhhccCCcccccceecC Q lcl|NC_019407. 96 PQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMP-DIVMSILEGLGVVKTGTRPAFKKIIRH 173 (173) Q Consensus 96 P~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~-~~v~~lL~~ll~~~~G~~~~~~rv~R~ 173 (173) |.-|+.-|...|-..++++. ++..+++|+-++.|........-| +.=..+|+-|...+ +|..-.-+-|. T Consensus 55 P~~vr~ivL~aa~R~~~NP~-------g~~~~~~G~~~~~~~~~g~~~~ffT~~E~~~L~r~~~s~--GG~~~~~ttR~ 124 (158) T protein:vir:99 55 PVTARGIILAASRREWNNPK-------RVSYVVKGPQSATFMQSAYPPGFFTDAEEAKLRSYGRST--GNWGVIETYRD 124 (158) T ss_pred hhHHHHHHHHHHHHHHhcCC-------ceEEeeecchhhhcccccCCCcccCHHHHHHHHHhhccc--CceeEEEeecC Confidence 88888888888888877764 788899999999997766443333 44455677776433 23333344555 No 22 >protein:vir:107702 Length: 136 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003900;genbank:gi:45686316;genbank:GeneID:2773042 Probab=87.57 E-value=0.037 Score=28.47 Aligned_cols=121 Identities=13% Similarity=0.057 Sum_probs=67.3 Q ss_pred CCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccc Q lcl|NC_019407. 14 PAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSD 93 (173) Q Consensus 14 ~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~ 93 (173) -+-...+++= +-|..+ .+..+..+|+..+..+--|..||... .| T Consensus 1 ~~~~~~~~~v---e~fR~l---~PeF~dvPde~i~~~~d~A~~~v~~~-~~----------------------------- 44 (136) T protein:vir:10 1 MNQETLIAVV---EQMRKL---VPALRKVPDETLYAWVEMAELFVCQK-TF----------------------------- 44 (136) T ss_pred CCchHHHHHH---HHHHHh---ccccccCCHHHHHHHHHHHHHhhcCC-CC----------------------------- Confidence 1112222332 223333 35567789999999999999998742 21 Q ss_pred cchHHHHHHHHHHHHHHHcCCC------CC-CccccceeE-EecCeeEEeecCCCCCccch----HHHHHHHHhhhhhcc Q lcl|NC_019407. 94 AIPQQLMEATAEMAAALMNNDW------TS-PQTTRGMKE-IQVDVIELKFDSEIQRGSMP----DIVMSILEGLGVVKT 161 (173) Q Consensus 94 ~IP~~V~~A~~elA~~~~~~~~------~~-~~~~~~v~~-~kVG~isveY~~~~~~~~~~----~~v~~lL~~ll~~~~ 161 (173) .+...+|...++++++.-+. .. ....++|++ ..+|+++|+|+..+.++..+ .-.=+++.-|.. .. T Consensus 45 --Gk~y~~al~lltAHLl~l~~~~~~~~~~~~~~s~rv~ssat~GevSVS~a~~s~~~s~~WL~~TpyGq~y~aL~k-~~ 121 (136) T protein:vir:10 45 --KDAYVKALALYALHLAFLDGALKGEDEDLESYSRRVTSFSLSGEFSQTFGEVTKNQSGDMMLSTPWGKMFEQLKA-RR 121 (136) T ss_pred --hhHHHHHHHHHHHHHHhcccccccccccccccccceehheeccceeEeeccccCchhhHhhhcCHHHHHHHHHHh-hc Confidence 13455777788887773221 11 122344554 66899999998665444321 112234555655 35 Q ss_pred CCcccccceecC Q lcl|NC_019407. 162 GTRPAFKKIIRH 173 (173) Q Consensus 162 G~~~~~~rv~R~ 173 (173) |+||+.+--++. T Consensus 122 ~gGf~l~t~~~~ 133 (136) T protein:vir:10 122 RGRFALMTGLRG 133 (136) T ss_pred ccchhhhhcccc Confidence 667776644444 No 23 >protein:vir:79640 Length: 134 # NCBI annotation: gp38 # Family: family:all:5122 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285527;genbank:gi:148734510;genbank:GeneID:5219998 Probab=86.33 E-value=0.04 Score=28.24 Aligned_cols=117 Identities=15% Similarity=0.132 Sum_probs=64.2 Q ss_pred ccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccc Q lcl|NC_019407. 16 ANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAI 95 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~I 95 (173) -|=-.+|+. |..+ .+.....+|+..+..+-.|..||+.. .| T Consensus 1 m~d~~~ve~----Fr~l---~PeF~~vpde~l~~~~~~A~~~i~~~-~~------------------------------- 41 (134) T protein:vir:79 1 MNDIEILEQ----IYKI---APAFKKVDPELIQAWIELAKDFVCEK-HF------------------------------- 41 (134) T ss_pred CchHHHHHH----HHHh---ccccccCCHHHHHHHHHHhhhhhcCC-CC------------------------------- Confidence 121122332 2233 35567789999999999999999742 11 Q ss_pred hHHHHHHHHHHHHHHHcC------CCCCCc-cccceeE-EecCeeEEeecCCCCCccch-----HHHHHHHHhhhhhccC Q lcl|NC_019407. 96 PQQLMEATAEMAAALMNN------DWTSPQ-TTRGMKE-IQVDVIELKFDSEIQRGSMP-----DIVMSILEGLGVVKTG 162 (173) Q Consensus 96 P~~V~~A~~elA~~~~~~------~~~~~~-~~~~v~~-~kVG~isveY~~~~~~~~~~-----~~v~~lL~~ll~~~~G 162 (173) .+....|...++++++.- +..... ..++|.+ ...|+++|+|+..+..+..+ |+= +++.-|.. ..+ T Consensus 42 g~~~~~al~lltAHLl~l~~~~~~~g~~~~~~~grv~ssst~G~vSvS~a~ps~~~~~~Wl~~TpYG-q~y~~L~k-~~~ 119 (134) T protein:vir:79 42 KDKYFRAVALYTLHLMTLDGAMKQESESVESYSHRIASFSLTGEFSQTFSKVSDDTSGNTLRQTPWG-KMYEVLNK-KKG 119 (134) T ss_pred ChHHHHHHHHHHHHHHhhcccccccccccccccchhhhhhhhcceeeeccCcccchhHHHHhcCHHH-HHHHHHHH-hhc Confidence 134557777777777742 222211 2234554 55899999998755443211 221 34444444 345 Q ss_pred CcccccceecC Q lcl|NC_019407. 163 TRPAFKKIIRH 173 (173) Q Consensus 163 ~~~~~~rv~R~ 173 (173) +|+|..--.|+ T Consensus 120 GGf~~~t~~~~ 130 (134) T protein:vir:79 120 GGFGLTTAFHR 130 (134) T ss_pred cchHhhhhccc Confidence 67765544444 No 24 >protein:vir:1435 Length: 188 # NCBI annotation: hypothetical protein # Family: family:all:501 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536364;genbank:gi:17975169;genbank:GeneID:929149 Probab=85.83 E-value=0.05 Score=27.72 Aligned_cols=127 Identities=19% Similarity=0.098 Sum_probs=52.6 Q ss_pred CeeeEEeeCCCCCCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHH-HHHHHHHhhhhhccccccCCc----cccc Q lcl|NC_019407. 1 MAFTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERF-LVRASKYLDRTIAWAGEKVDE----DSGL 75 (173) Q Consensus 1 ~~M~liVe~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~a-L~~As~~id~~~~~~G~r~~~----~Q~l 75 (173) |++-|+. + |.+-.=+|++|++++. |.- ...+|+..+.. +..|.+++++.+ |+.--. ..-- T Consensus 1 m~~~~~~-~----ppa~epVtLae~K~~l--rid-----~~~eD~~l~~~li~aA~~~~E~~t---gr~l~~qt~~~~~~ 65 (188) T protein:vir:14 1 MAAVLVE-Y----LDDAEPLTFEEVAFQC--RID-----DDDERDFVERVVIPGARQAAESKA---GAAIRKARYVEHLS 65 (188) T ss_pred CCceeee-c----CCCCCccCHHHHHHHc--CCC-----CchhHHHHHHHHHHHHHHHHHHHh---CCeeeeeeEEEEec Confidence 5544433 2 2345568999999994 431 11245555554 457788999632 321110 0111 Q ss_pred cCCcCCCc---------------ccCCee----------------------------------------eccccchHHHH Q lcl|NC_019407. 76 RWPRAGVY---------------DIDGFL----------------------------------------IPSDAIPQQLM 100 (173) Q Consensus 76 awPR~gv~---------------~~dg~~----------------------------------------~~~~~IP~~V~ 100 (173) .||+.+.. +.+|.. ++ +.||+.|| T Consensus 66 ~~~~~~~~Lp~~Pv~sV~sV~~~d~~g~~~~l~~~~y~l~~~~~~~~l~~~~~~~~p~~~~V~V~~~AG~~-~~vP~~ik 144 (188) T protein:vir:14 66 GFPPAEVPLSVGQVISVDSIEIRDASGATTTLDAGAFELVQLGRETLLVPAGQARWPYARAVTIKYQAGID-LARYPSVR 144 (188) T ss_pred CcCCCceEecccCcceeeEEEEEcCCCceEeecccceEEeecCCCcEEEEecCCCCCCCceEEEEEEecCc-cCchHHHH Confidence 12221100 001110 11 23555555 Q ss_pred HHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccc-hHHHHHHHHhhhhhccCCcccc Q lcl|NC_019407. 101 EATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSM-PDIVMSILEGLGVVKTGTRPAF 167 (173) Q Consensus 101 ~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~-~~~v~~lL~~ll~~~~G~~~~~ 167 (173) +|...++..+.++-.. +. .....+.. +.++++||+||-.- +|| T Consensus 145 ~Aill~va~~Y~~Re~-----------------~~--~g~~~~~lP~~~v~~Ll~pyRvP-----~~~ 188 (188) T protein:vir:14 145 SWMLLAAAWAYDHREL-----------------YS--DGQPMGEMPGGYSDVLLNPITVP-----PRF 188 (188) T ss_pred HHHHHHHHHHHhcccc-----------------cc--cccccccccHHHHHHHhhccCCC-----CCC Confidence 5555555444332100 00 00001112 23466677666432 122 No 25 >protein:vir:79074 Length: 150 # NCBI annotation: gp10, conserved hypothetical protein # Family: family:all:348 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111210;genbank:gi:134288797;genbank:GeneID:4960748 Probab=85.68 E-value=0.027 Score=29.16 Aligned_cols=118 Identities=14% Similarity=0.192 Sum_probs=61.7 Q ss_pred cccccHHHHHHHHHhccC---------C--cccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCccc Q lcl|NC_019407. 17 NSYCDVQFADDYIYANVY---------A--NTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDI 85 (173) Q Consensus 17 nSY~tv~~aday~~~r~~---------~--~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~ 85 (173) =+|+|+++..+.|..+-. + .+.....+++..+++|..|+..||+.+. .| T Consensus 1 M~Y~T~~Dl~~r~ge~~l~~Ltd~~~~~~~~~~~~~~d~~~i~~Al~dA~~~IDgyL~---~R----------------- 60 (150) T protein:vir:79 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESAVRQAEEIVDAHLR---GR----------------- 60 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccccccccccccCHHHHHHHHHHHHHHHHHHHh---hh----------------- Confidence 689999999887653211 1 1223456788899999999999998432 21 Q ss_pred CCeeeccccchHHHHHHHHHHHHHHHcCC----C-CCCcccc----ce---eEEecCeeEEeecC----CCCCccch--- Q lcl|NC_019407. 86 DGFLIPSDAIPQQLMEATAEMAAALMNND----W-TSPQTTR----GM---KEIQVDVIELKFDS----EIQRGSMP--- 146 (173) Q Consensus 86 dg~~~~~~~IP~~V~~A~~elA~~~~~~~----~-~~~~~~~----~v---~~~kVG~isveY~~----~~~~~~~~--- 146 (173) +.+|-..+|..|+..||-+|.+.+-.. . .+..... .+ +.+.-|.++.--.. +.+++..+ T Consensus 61 --Y~lPl~~vP~~L~~~a~dIA~Y~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~v~~~ 138 (150) T protein:vir:79 61 --YNLPLSPVPTVIKDVTVNLARHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPSGPATPEPGEMKVRAR 138 (150) T ss_pred --ccCCcccccHHHHHHHHHHHHHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCccCCCCCCceeeecC Confidence 234456799999999999996555321 1 1111111 11 12222554442211 11111000 Q ss_pred --HHHHHHHHhh Q lcl|NC_019407. 147 --DIVMSILEGL 156 (173) Q Consensus 147 --~~v~~lL~~l 156 (173) .+=..-|++| T Consensus 139 ~r~f~r~~l~g~ 150 (150) T protein:vir:79 139 RRQFDADLLERF 150 (150) T ss_pred CCccChhhccCC Confidence 0001122222 No 26 >protein:vir:100245 Length: 113 # NCBI annotation: gp74 # Family: family:all:363 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355410;genbank:gi:77864700;genbank:GeneID:3725967 Probab=85.23 E-value=0.048 Score=27.84 Aligned_cols=113 Identities=12% Similarity=0.093 Sum_probs=65.1 Q ss_pred cccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccch Q lcl|NC_019407. 17 NSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIP 96 (173) Q Consensus 17 nSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~IP 96 (173) =+++|+++++.+. |.-+ .-+|+..+..+.-|.+++.. |.|++....|.....-..+.+. ......|| T Consensus 1 M~~vtLee~K~hL--Rvd~-----d~dD~lI~~li~AA~~~ve~---~l~r~l~~~~~~~~~~~~~~~~---~~~~~~~p 67 (113) T protein:vir:10 1 MALVELKLALGFV--RANA-----GVEDDVVQMLLDAATQSAVD---YLNRQVFETEDAMTTAIEAGTA---GQNPMVVN 67 (113) T ss_pred CCCCCHHHHHHHc--CCCC-----CcchHHHHHHHHHHHHHHHH---HhCccccccccccccccccccc---cccccccC Confidence 5678999999884 4322 23677788777788888874 5666655554433222111110 11224589 Q ss_pred HHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhccCCcccc Q lcl|NC_019407. 97 QQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAF 167 (173) Q Consensus 97 ~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~ 167 (173) +.|+.|+..|.-....+- |.+.. .+....|..++.||.|+-. -+|. T Consensus 68 ~~i~~AvLllv~~~Y~nR------------e~~~~--------~~~~~lP~~v~~Ll~~yR~-----~~g~ 113 (113) T protein:vir:10 68 AAIRAAILKITAELYANR------------EDTAF--------GPITELPLNARALLRPHRI-----IPGV 113 (113) T ss_pred hHHHHHHHHHHHHHHhhh------------hhhch--------hhhhccCHHHHHHHHHhhh-----hcCC Confidence 999999999986665431 11100 1112345567899999843 1221 No 27 >protein:vir:1993 Length: 141 # NCBI annotation: Hypothetical protein # Family: family:all:348 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050640;genbank:gi:9633527;genbank:GeneID:2636296 Probab=84.69 E-value=0.0045 Score=33.43 Aligned_cols=119 Identities=18% Similarity=0.205 Sum_probs=63.3 Q ss_pred cccccHHHHHHHHHhccC---C--cccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeec Q lcl|NC_019407. 17 NSYCDVQFADDYIYANVY---A--NTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIP 91 (173) Q Consensus 17 nSY~tv~~aday~~~r~~---~--~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~ 91 (173) =+|+|.++..+.|..+-. + .......+++..+++|..|+..||+.+ +.| +.+| T Consensus 1 M~Y~T~~Dl~~~~ge~~l~~Lt~d~~~~g~~d~~~i~~Al~dA~~eIdgyL---~~R-------------------Y~lP 58 (141) T protein:vir:19 1 MNYATVNDLCARYTRTRLDILTRPKTADGQPDDAVAEQALADASAFIDGYL---AAR-------------------FVLP 58 (141) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcCCCCccccCHHHHHHHHHHHHHHHHHHH---hhc-------------------ccCC Confidence 579999999887654322 1 112334678888999999999999843 221 2345 Q ss_pred cccchHHHHHHHHHHHHHHHcCCCCCCccccc----e---eEEecCeeEEeecCCCCC---c-c--ch----HHHHHHHH Q lcl|NC_019407. 92 SDAIPQQLMEATAEMAAALMNNDWTSPQTTRG----M---KEIQVDVIELKFDSEIQR---G-S--MP----DIVMSILE 154 (173) Q Consensus 92 ~~~IP~~V~~A~~elA~~~~~~~~~~~~~~~~----v---~~~kVG~isveY~~~~~~---~-~--~~----~~v~~lL~ 154 (173) -..+|.-|+..||-+|.+.+.+...+...... + +.+.-|.+++--...... + . .+ .....=++ T Consensus 59 l~~~P~~L~~~a~dIA~Y~L~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~~~~~~~~r~f~r~~~ 138 (141) T protein:vir:19 59 LTVVPSLLKRQCCVVAWFYLNESQPTEQITATYRDTVRWLEQVRDGKTDPGVESRTAASPEGEDLVQVQSDPPVFSRKQK 138 (141) T ss_pred ccccchHHHHHHHHHHHHHHhcCCCChHHHHHHHHHHHHHHHHhcCccccCCCCCCCCCCCCCceeEeecCCcccCcccc Confidence 56789999999999997666554322111111 1 222225555532111100 0 0 00 00000012 Q ss_pred hhh Q lcl|NC_019407. 155 GLG 157 (173) Q Consensus 155 ~ll 157 (173) ||+ T Consensus 139 G~~ 141 (141) T protein:vir:19 139 GFI 141 (141) T ss_pred cCC Confidence 222 No 28 >protein:vir:80320 Length: 188 # NCBI annotation: gp8, conserved hypothetical protein # Family: family:all:501 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111087;genbank:gi:134288682;genbank:GeneID:4960567 Probab=84.56 E-value=0.059 Score=27.30 Aligned_cols=128 Identities=18% Similarity=0.096 Sum_probs=50.4 Q ss_pred CeeeEEeeCCCCCCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHH-HHHHHHhhhhhccccccCCc---ccc-c Q lcl|NC_019407. 1 MAFTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFL-VRASKYLDRTIAWAGEKVDE---DSG-L 75 (173) Q Consensus 1 ~~M~liVe~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL-~~As~~id~~~~~~G~r~~~---~Q~-l 75 (173) |.+- +|++. .+..=+|++|++++. |.- ..-+|+..+..| ..|.+++++. .|+.--. .+. - T Consensus 1 M~~~-~~~~p----pa~ePVtL~e~K~hL--Rid-----~~~eD~~l~~~lI~aA~~~~E~~---~gr~l~~qt~~~~~~ 65 (188) T protein:vir:80 1 MAAV-LVEYL----DDAEPLTFEEVAFQC--RID-----DDDERDFVERIVIPGARQAAESK---SGAAIRKARYVERLS 65 (188) T ss_pred CCce-eeccC----CCCcccCHHHHHHHc--CCC-----CchhhHHHHHHHHHHHHHHHHHH---hCCeeeeeeEEEEec Confidence 4433 33332 233448999999994 431 112455565544 5688899863 2322110 111 1 Q ss_pred cCCcCCCc---------------ccCCeeec---------------------------------------cccchHHHHH Q lcl|NC_019407. 76 RWPRAGVY---------------DIDGFLIP---------------------------------------SDAIPQQLME 101 (173) Q Consensus 76 awPR~gv~---------------~~dg~~~~---------------------------------------~~~IP~~V~~ 101 (173) .||+.++. +.+|.... .+.+|..||+ T Consensus 66 ~~~~~~i~Lp~~PV~sV~sV~~~d~~G~~~~l~~~~y~l~~~~~~~~l~~~~~~~~p~~~~V~V~~~AG~~~~vP~~ik~ 145 (188) T protein:vir:80 66 GFPLAEISLSVGQVIRVDSIEIRDASGATTTLDADAFELVQLGREALLVPEGQARWPFARAVTITYQAGVDLARYPSVRT 145 (188) T ss_pred CCCCCceEecccccceeeEEEEEcCCCcEEeecccceEEeecCCCcEEEEecCCCCCCCceEEEEEEecccccChHHHHH Confidence 23332111 11111100 0124444444 Q ss_pred HHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccc-hHHHHHHHHhhhhhccCCcccc Q lcl|NC_019407. 102 ATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSM-PDIVMSILEGLGVVKTGTRPAF 167 (173) Q Consensus 102 A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~-~~~v~~lL~~ll~~~~G~~~~~ 167 (173) |...++..+.++-.. + . .....+.. +.++++||+||-.- +|| T Consensus 146 aill~va~~Ye~Re~------------~---~----~g~~~~~~P~~~v~~Ll~pyRvp-----~~~ 188 (188) T protein:vir:80 146 WMLLAAAWAYDHREL------------F---S----EGQPIGEMPGGYADVLLNPITVP-----PRF 188 (188) T ss_pred HHHHHHHHHHhcccc------------c---c----cccccccccHHHHHHHhhccCCC-----CCC Confidence 444444433321100 0 0 00000111 23456666666431 122 No 29 >protein:vir:100103 Length: 120 # NCBI annotation: gp7 # Family: family:all:363 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945037;genbank:gi:38707897;genbank:GeneID:2744150 Probab=84.01 E-value=0.064 Score=27.14 Aligned_cols=120 Identities=9% Similarity=-0.036 Sum_probs=67.8 Q ss_pred CCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeecc Q lcl|NC_019407. 13 DPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPS 92 (173) Q Consensus 13 ~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~ 92 (173) .++=-+.+|+++++.+. |.-+ ..||+..+..+.-|.+|+.. |.|++...++...=+-.... ..-... T Consensus 1 ~~~~m~~vtL~e~K~hL--Rvd~-----d~DD~lI~~~i~AA~~~v~~---~~~r~l~~~~~~~~~~~~~~---~~~~~~ 67 (120) T protein:vir:10 1 MADQTPIVSLEVALAHL--REDA-----GVADDLIKIYIGAATQSASD---YVDRKLYANDAEMQAAVADA---TAGADP 67 (120) T ss_pred CCCCCCccCHHHHHHHc--CCCC-----CcchHHHHHHHHHHHHHHHH---HhCCcccccccccchhhhcc---cccccc Confidence 45667899999999984 4322 23777888888888888874 56666543322210000000 000122 Q ss_pred ccchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhccCCcccc Q lcl|NC_019407. 93 DAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAF 167 (173) Q Consensus 93 ~~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~ 167 (173) ..||+.++.|++.|.-....+-.... +| ...+....+..++.||.|+-. ..|. T Consensus 68 ~~~~~~i~~AvLllvg~~YenRe~~~----------~~-------~~~~~~~lP~~v~~Ll~~yR~-----~~gv 120 (120) T protein:vir:10 68 IVANDAIRAAILLTIGKLYAFREDVV----------SG-------ASASVTELPSGAKSLLFPYRV-----GLGV 120 (120) T ss_pred ccCCHHHHHHHHHHHHHHHhchhhhh----------hc-------ccccccccCHHHHHHHHHhhh-----ccCC Confidence 45899999999999976654422100 01 011223345568889988832 2332 No 30 >protein:vir:107864 Length: 150 # NCBI annotation: gp36 # Family: family:all:348 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024709;genbank:gi:48696946;genbank:GeneID:2845952 Probab=83.86 E-value=0.036 Score=28.51 Aligned_cols=118 Identities=14% Similarity=0.189 Sum_probs=61.6 Q ss_pred cccccHHHHHHHHHhccC---------C--cccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCccc Q lcl|NC_019407. 17 NSYCDVQFADDYIYANVY---------A--NTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDI 85 (173) Q Consensus 17 nSY~tv~~aday~~~r~~---------~--~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~ 85 (173) =+|+|+++..+.|..+-. + .+.....+++..+++|..|+..||+.+. .| T Consensus 1 M~Y~T~~Dl~~r~ge~el~~Ltd~~~~g~~~~~~~~~d~~~i~~Al~dA~~eIDgYL~---~R----------------- 60 (150) T protein:vir:10 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESSVRQAEEIVDAHLR---GR----------------- 60 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccCccccchhhcCHHHHHHHHHHHHHHHHHHHh---hh----------------- Confidence 689999999887653211 1 1223356788899999999999998432 21 Q ss_pred CCeeeccccchHHHHHHHHHHHHHHHcCC----C-CCCcccc----ce---eEEecCeeEEeecCC----CCCccch--- Q lcl|NC_019407. 86 DGFLIPSDAIPQQLMEATAEMAAALMNND----W-TSPQTTR----GM---KEIQVDVIELKFDSE----IQRGSMP--- 146 (173) Q Consensus 86 dg~~~~~~~IP~~V~~A~~elA~~~~~~~----~-~~~~~~~----~v---~~~kVG~isveY~~~----~~~~~~~--- 146 (173) +.+|-..+|..|+..||-+|.+.|... . .+..... .+ +.+.-|.++..-... ++++..+ T Consensus 61 --Y~lPl~~vP~~L~~~a~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~v~~~ 138 (150) T protein:vir:10 61 --YNLPLSPVPTVIKDVTVNLARHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPSGPATPEPGEMKVRAR 138 (150) T ss_pred --ccCCcccccHHHHHHHHHHHHHHHHhcccccCCCCHHHHHHHHHHHHHHHHHhcCcccCCCCCCCCCCCCceeeeecC Confidence 223446799999999999996555321 1 1111111 11 111225554422111 1111000 Q ss_pred --HHHHHHHHhh Q lcl|NC_019407. 147 --DIVMSILEGL 156 (173) Q Consensus 147 --~~v~~lL~~l 156 (173) .+=..-|++| T Consensus 139 ~r~f~r~~l~gf 150 (150) T protein:vir:10 139 RRQFDADLLERF 150 (150) T ss_pred CCccChhhccCC Confidence 0001122233 No 31 >protein:vir:10365 Length: 115 # NCBI annotation: conserved phage protein # Family: family:all:363 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858957;genbank:gi:32128422;genbank:GeneID:2648387 Probab=83.48 E-value=0.062 Score=27.21 Aligned_cols=114 Identities=11% Similarity=0.029 Sum_probs=63.3 Q ss_pred cccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCcccc-ccCCcCCCcccCCeeeccccchH Q lcl|NC_019407. 19 YCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSG-LRWPRAGVYDIDGFLIPSDAIPQ 97 (173) Q Consensus 19 Y~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~-lawPR~gv~~~dg~~~~~~~IP~ 97 (173) =+|+++++.+. |.-... ...+|+-.+..+-.|.+|+.. |.|++.-..|. +..+..+... ......||+ T Consensus 1 mvtLe~~K~hL--Rid~~d--~d~dD~li~~~i~AA~~~v~~---~~~r~l~~~~~~~~~~~~~~~~----~~~~~~~p~ 69 (115) T protein:vir:10 1 MITLAMVQRHL--QAELYE--DDERDYVMQQLLPAARESAEL---FINRKLYDTQADMLADQAAGVD----PAGQLLITR 69 (115) T ss_pred CCCHHHHHHHc--CCCCCC--CchhhHHHHHHHHHHHHHHHH---HhCCcccccccccccccccccC----CcccccCCh Confidence 78999999885 431100 112466678888888888874 56666543222 2222221110 011234899 Q ss_pred HHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhccCCcccccc Q lcl|NC_019407. 98 QLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 98 ~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~r 169 (173) .|+.|...|.-+...+-.. ..+| +....|..++.||.|+-. .+-+| T Consensus 70 ~i~~AiLLlvg~~Y~nRe~----------~~~~----------~~~elP~~v~~LL~pyR~------~~gv~ 115 (115) T protein:vir:10 70 TVEQAILLTVGEWYANREQ----------VWVK----------GVGLVTSSAQNLLHPYRK------FAGVR 115 (115) T ss_pred HHHHHHHHHHHHHHhcchh----------cccc----------hhhhcCHHHHHHHHHHHh------cCCCC Confidence 9999999999766543110 0011 112345568999999843 22233 No 32 >protein:vir:7857 Length: 188 # NCBI annotation: gp14 # Family: family:all:11114 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817464;genbank:gi:29565893;genbank:GeneID:1259086 Probab=82.83 E-value=0.042 Score=28.13 Aligned_cols=121 Identities=20% Similarity=0.235 Sum_probs=63.2 Q ss_pred HHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccc--------------c-ccCCc--------------- Q lcl|NC_019407. 22 VQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWA--------------G-EKVDE--------------- 71 (173) Q Consensus 22 v~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~--------------G-~r~~~--------------- 71 (173) ..+|+..+.. ..++...-+-||..|+.-+.+.++|. | ++.-+ T Consensus 1 ~~~~~~la~~--------~~~da~~v~lAL~~As~~VR~~~g~~i~~V~~dt~~id~~Gg~~~LP~~Pvv~i~~Ve~~~~ 72 (188) T protein:vir:78 1 MTFAQQLADA--------FPEDADDAATALSWAKSQVEGYCGRKFDLVEDDVAIVDPYCGSSLLPSIPVVEISKVEGYLP 72 (188) T ss_pred CchhhhHHHh--------cCCCcchHHHHHHHHHHHHHHHhCCcceeeeeeeeeeecCCCceeeccCcceeeeEEEEEee Confidence 1112222111 11234444557888888887654321 1 10000 Q ss_pred --------------------------cc-------cccCCc----CCCcccCCeeeccccchHHHHHHHHHHHHHHHcCC Q lcl|NC_019407. 72 --------------------------DS-------GLRWPR----AGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNND 114 (173) Q Consensus 72 --------------------------~Q-------~lawPR----~gv~~~dg~~~~~~~IP~~V~~A~~elA~~~~~~~ 114 (173) .| .-.||+ .-|....| -++||.+|+...|++|-+++.+| T Consensus 73 ~G~~~~~v~~r~y~~~g~~~~l~~trg~pg~~~~r~~~WPw~p~~VtVTytHG----y~evP~eiv~lv~d~A~~~~~np 148 (188) T protein:vir:78 73 TGNGMDWVELTNYWFKRDTGLIFDTTGLPGSEWSTGHTWPWLPGSLRVTYTHG----YNPVPDELIDVAIRLAREYQSNP 148 (188) T ss_pred CCcccccccccccccccceeeecccccCcccccccccccccCcceEEEEEecC----CCcccHHHHHHHHHHHHHHhcCc Confidence 01 112442 11111111 24799999999999999888764 Q ss_pred CCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhccC Q lcl|NC_019407. 115 WTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTG 162 (173) Q Consensus 115 ~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~G 162 (173) . ....++||++|++|+.....+ .-.+=..+|+++---.+- T Consensus 149 ~-------~L~q~~vG~~S~tfa~~~~~s-l~~~~~~il~ry~l~~~~ 188 (188) T protein:vir:78 149 E-------LLVSKQVGEIERRFGSVAGTS-LSKADQAILDRYVIATLA 188 (188) T ss_pred c-------cceeeecCceeeecccccCCc-ccchhHHhhccccccccC Confidence 3 467899999999998533332 223445566665322211 No 33 >protein:vir:101652 Length: 188 # NCBI annotation: gp15 # Family: family:all:11114 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654770;genbank:gi:109302768;genbank:GeneID:4156086 Probab=82.83 E-value=0.042 Score=28.13 Aligned_cols=121 Identities=20% Similarity=0.235 Sum_probs=63.2 Q ss_pred HHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccc--------------c-ccCCc--------------- Q lcl|NC_019407. 22 VQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWA--------------G-EKVDE--------------- 71 (173) Q Consensus 22 v~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~--------------G-~r~~~--------------- 71 (173) ..+|+..+.. ..++...-+-||..|+.-+.+.++|. | ++.-+ T Consensus 1 ~~~~~~la~~--------~~~da~~v~lAL~~As~~VR~~~g~~i~~V~~dt~~id~~Gg~~~LP~~Pvv~i~~Ve~~~~ 72 (188) T protein:vir:10 1 MTFAQQLADA--------FPEDADDAATALSWAKSQVEGYCGRKFDLVEDDVAIVDPYCGSSLLPSIPVVEISKVEGYLP 72 (188) T ss_pred CchhhhHHHh--------cCCCcchHHHHHHHHHHHHHHHhCCcceeeeeeeeeeecCCCceeeccCcceeeeEEEEEee Confidence 1112222111 11234444557888888887654321 1 10000 Q ss_pred --------------------------cc-------cccCCc----CCCcccCCeeeccccchHHHHHHHHHHHHHHHcCC Q lcl|NC_019407. 72 --------------------------DS-------GLRWPR----AGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNND 114 (173) Q Consensus 72 --------------------------~Q-------~lawPR----~gv~~~dg~~~~~~~IP~~V~~A~~elA~~~~~~~ 114 (173) .| .-.||+ .-|....| -++||.+|+...|++|-+++.+| T Consensus 73 ~G~~~~~v~~r~y~~~g~~~~l~~trg~pg~~~~r~~~WPw~p~~VtVTytHG----y~evP~eiv~lv~d~A~~~~~np 148 (188) T protein:vir:10 73 TGNGMDWVELTNYWFKRDTGLIFDTTGLPGSEWSTGHTWPWLPGSLRVTYTHG----YNPVPDELIDVAIRLAREYQSNP 148 (188) T ss_pred CCcccccccccccccccceeeecccccCcccccccccccccCcceEEEEEecC----CCcccHHHHHHHHHHHHHHhcCc Confidence 01 112442 11111111 24799999999999999888764 Q ss_pred CCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhccC Q lcl|NC_019407. 115 WTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTG 162 (173) Q Consensus 115 ~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~G 162 (173) . ....++||++|++|+.....+ .-.+=..+|+++---.+- T Consensus 149 ~-------~L~q~~vG~~S~tfa~~~~~s-l~~~~~~il~ry~l~~~~ 188 (188) T protein:vir:10 149 E-------LLVSKQVGEIERRFGSVAGTS-LSKADQAILDRYVIATLA 188 (188) T ss_pred c-------cceeeecCceeeecccccCCc-ccchhHHhhccccccccC Confidence 3 467899999999998533332 223445566665322211 No 34 >protein:vir:93592 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:1879 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449297;genbank:gi:157166045;uniprot:Q6H9U4;genbank:GeneID:5580414 Probab=81.51 E-value=0.085 Score=26.45 Aligned_cols=108 Identities=19% Similarity=0.180 Sum_probs=62.5 Q ss_pred ccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccc Q lcl|NC_019407. 16 ANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAI 95 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~I 95 (173) --.++|+++++++. |.-+. .+|+..+..+..|.+|+.. |.+.+.+. ..+.++.+. ...+ T Consensus 1 mm~~vtLeevK~hL--RId~d-----~dD~li~~~i~aA~~~v~~---~l~~~~~~----------~~~~~~~~~-~~~~ 59 (108) T protein:vir:93 1 MTALLTLEEIKAHL--RVDHD-----ADDDMLMDKVRQATAVLLA---YIQGSRDK----------VIREDGELI-PGEA 59 (108) T ss_pred CCcCCCHHHHHHHc--CCCCC-----cChHHHHHHHHHHHHHHHH---Hhcccccc----------ccccccccc-cccC Confidence 23467899999984 54222 2677788777888788864 23322211 122223333 3457 Q ss_pred hHHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhccCCccccc Q lcl|NC_019407. 96 PQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFK 168 (173) Q Consensus 96 P~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~ 168 (173) |..|+.|++.|.-...++-.. ++.. +.+.+..|..|..||.|+ +.+... T Consensus 60 ~~~i~~AvLlLv~~~YenRe~------------~~~~------~~~~~elP~~v~~Ll~~~------R~p~~~ 108 (108) T protein:vir:93 60 LTRMKGAAMRLTGMLYRNPDL------------AERE------ELLQGELPFSVSVLIYDL------RCPTVL 108 (108) T ss_pred ChHHHHHHHHHHHHHHhcccc------------cccc------ccccccCCHHHHHHHHHc------cccccC Confidence 899999999999766544221 1110 122334566788888888 344444 No 35 >protein:vir:486 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543093;swissprot:trembl:q8w626;genbank:gi:18249905;uniprot:Q8W626;genbank:GeneID:929692 Probab=81.40 E-value=0.076 Score=26.71 Aligned_cols=104 Identities=9% Similarity=0.008 Sum_probs=62.5 Q ss_pred ccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCcccc---ccCCcCCCcccCCeeecccc Q lcl|NC_019407. 18 SYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSG---LRWPRAGVYDIDGFLIPSDA 94 (173) Q Consensus 18 SY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~---lawPR~gv~~~dg~~~~~~~ 94 (173) =++|+++++.|. |.-+. ++ -+|+..+..+..|.+||.. |.|++....+. ..+|. .-. T Consensus 1 M~vtL~e~K~hL--Rid~D--~~-ddD~li~~~i~aA~~~i~~---~~~r~l~~~~~~~~~~~~~------------~~~ 60 (107) T protein:vir:48 1 MLLKEEEIKSHL--RLDDG--LY-SDGDFLKLLAQAVQKRTET---YLNRKLYAPEETIPEDDPD------------GMH 60 (107) T ss_pred CCCCHHHHHHHc--CCCCC--Cc-hhHHHHHHHHHHHHHHHHH---HhccccccccccccccCcc------------ccc Confidence 678999999994 43222 11 1556677777788888874 56776544332 23332 124 Q ss_pred chHHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhcc Q lcl|NC_019407. 95 IPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKT 161 (173) Q Consensus 95 IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~ 161 (173) ||+.++.|+..|+-....+-. .+.. .+....|..++.||.|+-...+ T Consensus 61 ~~~~ik~Avlllv~~~Y~NRe------------~v~~--------~~~~~iP~~v~~LL~~yR~~~l 107 (107) T protein:vir:48 61 LTDDVRLAMLMLVSHFYENRS------------TITD--------VEKLETPMSFRWLAGPYRIVPL 107 (107) T ss_pred cchhHHHHHHHHHHHHHhhhh------------hhcc--------ccccccCHHHHHHHHHhhccCC Confidence 789999999999866554321 1100 1112344568899999855444 No 36 >protein:vir:1640 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695061;genbank:gi:23455752;genbank:GeneID:955488 Probab=79.56 E-value=0.1 Score=25.99 Aligned_cols=128 Identities=14% Similarity=0.122 Sum_probs=63.4 Q ss_pred ccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccc-c Q lcl|NC_019407. 16 ANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSD-A 94 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~-~ 94 (173) -+.|+|+++..+-| |.... + ..+..+.+|-.|+++|-.++ |+.+. .++...-++. . T Consensus 1 m~~fAtv~Dv~~r~--r~L~~---~--E~~ra~~lL~dAs~~ir~~~---------------p~~~~-~l~a~~~e~~~~ 57 (132) T protein:vir:16 1 MNPFATVDDLTMLW--RPLKG---D--EKERAEKLLEIVSDSLREEA---------------DKVGR-DLYAMIAEKPSY 57 (132) T ss_pred CCccCCHHHHHHHh--cCCCH---h--HHHHHHHHHHHHHHHHHHhh---------------hhhcc-cccccccccccc Confidence 68999999998665 31110 0 12356778899999998653 33221 1222221221 2 Q ss_pred chHHHHHHHHHHHHHHHcCCCCCCccccceeEEecCee--EEeecCCCCCccchHHHHHHHHhhhhhccCCcccccceec Q lcl|NC_019407. 95 IPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVI--ELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKIIR 172 (173) Q Consensus 95 IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~i--sveY~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~rv~R 172 (173) .+.-++.-+|+....++..+... .+..-.++..|+. +.+|.. +.|..+ +-+..++-|+. .|++++-.-+-= T Consensus 58 ~~~~~~~V~~~~V~Ral~~~~~~--~G~tq~S~TaG~ys~S~t~~~--p~G~ly-lt~~e~~~LG~--~~~r~~~i~~~~ 130 (132) T protein:vir:16 58 FASVVKSVTVDIVARTLMTSTDQ--EPMTQTTESALGYSVSGSYLV--PGGGLF-IKNSELSRLGL--KKQRFGVIDFYG 130 (132) T ss_pred chhHHHHHHHHHHHHHhcCCCCC--CCceeeeeeccchheeeeeec--CCCcce-eChHHHHhhCC--CCCceEEEeecC Confidence 34446778888888777554211 1112356788988 555653 334332 11222222222 233333222211 Q ss_pred C Q lcl|NC_019407. 173 H 173 (173) Q Consensus 173 ~ 173 (173) . T Consensus 131 ~ 131 (132) T protein:vir:16 131 N 131 (132) T ss_pred C Confidence 1 No 37 >protein:vir:103846 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938245;genbank:gi:38229150;genbank:GeneID:2648161 Probab=78.84 E-value=0.022 Score=29.64 Aligned_cols=123 Identities=12% Similarity=0.138 Sum_probs=62.0 Q ss_pred cccccHHHHHHHHHhccC------CcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeee Q lcl|NC_019407. 17 NSYCDVQFADDYIYANVY------ANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLI 90 (173) Q Consensus 17 nSY~tv~~aday~~~r~~------~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~ 90 (173) =+|+|.++..+.+..+-. ........+++..+++|..|+..||+.+ +.| +.+ T Consensus 1 M~Y~T~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~~eIdgyL---~~R-------------------Y~l 58 (138) T protein:vir:10 1 MSYCTQADLVEQYGEASIRQLSDRVNKPATTIDPAVVAQAIADADAEIDLHL---HAR-------------------YQL 58 (138) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccCCCcCccCHHHHHHHHHHHHHHHHHHH---hhc-------------------ccC Confidence 579999999987544321 1112335688889999999999999843 221 224 Q ss_pred ccccchHHHHHHHHHHHHHHHcCCCCCCc-cccc----e---eEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhccC Q lcl|NC_019407. 91 PSDAIPQQLMEATAEMAAALMNNDWTSPQ-TTRG----M---KEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTG 162 (173) Q Consensus 91 ~~~~IP~~V~~A~~elA~~~~~~~~~~~~-~~~~----v---~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~G 162 (173) |-..+|.-|+..||-+|.+.+.+...... .... + +.+.-|.++.--.......+... .....+.. T Consensus 59 Pl~~vP~~L~~~a~dIA~Y~L~~~~~~~e~~~~rY~~Ai~~L~~Ia~G~~~Lg~~~~~~~~~~~~-------~~~~~s~~ 131 (138) T protein:vir:10 59 PLAQVPVVLKRVACVLAFANLHTQVKDDHPAILDAERKRKLLGGISSGKLSLALTSSGTPAPIAN-------TVQISSQR 131 (138) T ss_pred CccccchHHHHHHHHHHHHHHhcCCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCcccCCCCC-------ceeeecCC Confidence 55679999999999999766653322111 1111 1 12222555543221111100000 00000000 Q ss_pred CcccccceecC Q lcl|NC_019407. 163 TRPAFKKIIRH 173 (173) Q Consensus 163 ~~~~~~rv~R~ 173 (173) +-|| |. T Consensus 132 r~Fg-----~d 137 (138) T protein:vir:10 132 NDFG-----GT 137 (138) T ss_pred ccCC-----CC Confidence 0000 11 No 38 >protein:vir:99570 Length: 153 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039799;genbank:gi:126011049;genbank:GeneID:4818265 Probab=76.62 E-value=0.13 Score=25.38 Aligned_cols=128 Identities=13% Similarity=0.074 Sum_probs=65.8 Q ss_pred eeEEeeCCCCCCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCC Q lcl|NC_019407. 3 FTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGV 82 (173) Q Consensus 3 M~liVe~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv 82 (173) |+.+|= +++.+.+ +.--...-+..+|+..+..|..|.-+|+.. + |+... T Consensus 1 m~~~~f------------d~~~Fr~----~fPeFad~~~~Pd~~i~~~l~~A~~~l~~~-~-------------~~~~~- 49 (153) T protein:vir:99 1 MADPVY------------NDGLFRI----MYPEFADQEKYPPEVIEIYYDTATLFITGS-M-------------FPCAA- 49 (153) T ss_pred CCcccC------------ChHHHHH----hcccccCccccCHHHHHHHHHHHHHhhcCc-c-------------ccccc- Confidence 555552 2343333 221222223568999999999999999852 1 22111 Q ss_pred cccCCeeeccccchHHHHHHHHHHHHHHHc-------C-CCCCCccccceeEEecCeeEEeecCCCCCccc--------h Q lcl|NC_019407. 83 YDIDGFLIPSDAIPQQLMEATAEMAAALMN-------N-DWTSPQTTRGMKEIQVDVIELKFDSEIQRGSM--------P 146 (173) Q Consensus 83 ~~~dg~~~~~~~IP~~V~~A~~elA~~~~~-------~-~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~--------~ 146 (173) .-++..+++.+.++.+++. + ........+.++++++|+|||.|+.+...... + T Consensus 50 -----------~~g~~~~~~l~Ll~AH~l~L~~~~~~~~~~a~~~~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~Y 118 (153) T protein:vir:99 50 -----------LSGKQLVGALNMLTAHLMSLSMQRSQTALGATNDQGGYTLSATIGEVSVSKMAPPAKDGWEFWLAQTPY 118 (153) T ss_pred -----------cChHHHHHHHHHHHHHHHHHHhhhhcccccCCCccccceeeeeecceeeeeecCCCCCchhHhhhcCHH Confidence 1134566777777766542 1 11122234568999999999999755433221 1 Q ss_pred H-HHHHHHHhhhhh--ccCCcccccceecC Q lcl|NC_019407. 147 D-IVMSILEGLGVV--KTGTRPAFKKIIRH 173 (173) Q Consensus 147 ~-~v~~lL~~ll~~--~~G~~~~~~rv~R~ 173 (173) - -.=+|++.+... -.|+.|- -..+|. T Consensus 119 Gq~fw~l~~~~~~Gg~v~gg~pe-~~~~r~ 147 (153) T protein:vir:99 119 GQALWALLKMLSVGGFAIGGLPE-RTGFRK 147 (153) T ss_pred HHHHHHHHHHhcccccccCCCCc-cccccc Confidence 1 223355554321 1121221 133444 No 39 >protein:vir:97069 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453566;genbank:gi:84662601;genbank:GeneID:5142484 Probab=75.72 E-value=0.14 Score=25.21 Aligned_cols=113 Identities=12% Similarity=0.051 Sum_probs=60.3 Q ss_pred cccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCcccccc-CC-cCCCcccCCeeeccccch Q lcl|NC_019407. 19 YCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLR-WP-RAGVYDIDGFLIPSDAIP 96 (173) Q Consensus 19 Y~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~la-wP-R~gv~~~dg~~~~~~~IP 96 (173) -+|+++++.+. |.-... ..-+|...+..+-.|.+++.. |.|++....|... ++ ..+....+| -.|| T Consensus 1 mvtLee~K~hL--Rid~d~--~d~DDali~~~i~AA~~~v~~---~l~r~l~~~~~~~~~~~~~~~~~~~~-----~~~p 68 (115) T protein:vir:97 1 MITLAMMQRHL--QAELYE--DDERDYVMQQLLPAARESAEL---FLNRKLYDVQADMLADQVLGVDPSDQ-----LLIT 68 (115) T ss_pred CCCHHHHHHHc--CCCCCC--CchhhHHHHHHHHHHHHHHHH---HhCCcccchhhcccccccccCCCccc-----ccCC Confidence 89999999885 432111 111344566666677777763 5666654443322 11 111111111 2379 Q ss_pred HHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhccCCcccccc Q lcl|NC_019407. 97 QQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 97 ~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~r 169 (173) +.|+.|...|.-.+..+- |.|.. .+....|-.++.||.|+-.. .| +| T Consensus 69 ~~i~~AiLllvg~~Y~NR------------E~v~~--------~~~~elP~~~~~LL~pyR~~-~G-----v~ 115 (115) T protein:vir:97 69 RTVEQAILLTVGEWYSSR------------EQVWI--------KGAGLVTSSAQNLLHPYRKF-AG-----VR 115 (115) T ss_pred HHHHHHHHHHHHHHHhcc------------ccccc--------ccccccCHHHHHHHHHHHhh-cC-----CC Confidence 999999999986655431 11100 01223456789999998431 22 22 No 40 >protein:vir:4512 Length: 107 # NCBI annotation: unknown # Family: family:all:363 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599038;genbank:gi:19548996;genbank:GeneID:935218 Probab=75.39 E-value=0.14 Score=25.20 Aligned_cols=107 Identities=11% Similarity=0.106 Sum_probs=62.4 Q ss_pred ccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccchH Q lcl|NC_019407. 18 SYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQ 97 (173) Q Consensus 18 SY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~IP~ 97 (173) =++|+++++.|. |.-+. .+ -+|+-.+..+..|..||.. |.|++....+.. ||-.. .++ -.||+ T Consensus 1 M~vtL~e~K~hL--RId~D--~~-ddD~lI~~~i~AA~~~i~~---~~~r~~~~~~~~-~~~~~---~~~-----~~~~~ 63 (107) T protein:vir:45 1 MLLKMEEIKLQL--RLDDD--FS-DEDELLELLGKAAQSRTEN---FLNRKLYATADD-RPADD---PDG-----LVISD 63 (107) T ss_pred CCCCHHHHHHHc--CCCCC--Cc-hhHHHHHHHHHHHHHHHHH---Hhcccccccccc-ccccc---ccc-----ccCCh Confidence 688999999994 43222 11 1455677777888899874 677776554433 44321 121 23689 Q ss_pred HHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhcc Q lcl|NC_019407. 98 QLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKT 161 (173) Q Consensus 98 ~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~ 161 (173) .++.|+..+.-....+-.. +. ..+....+..++.||.|+-.... T Consensus 64 ~~~~AvLllv~~~Y~NRe~------------~~--------~~~~~~lp~~v~~Ll~~~R~~~~ 107 (107) T protein:vir:45 64 DVKLALLLLVSHFYENRST------------VT--------DVEKMELPMSFNWLVAPYRLIPL 107 (107) T ss_pred hHHHHHHHHHHHHHhhhhh------------cc--------ccchhccchHHHHHHHHHhhcCC Confidence 9999999888655433211 10 01111334567889988743222 No 41 >protein:vir:81069 Length: 115 # NCBI annotation: p10 # Family: family:all:363 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285680;genbank:gi:148727188;genbank:GeneID:5247114 Probab=72.34 E-value=0.18 Score=24.61 Aligned_cols=113 Identities=11% Similarity=0.025 Sum_probs=60.6 Q ss_pred cccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccc-cCCcCCC-cccCCeeeccccch Q lcl|NC_019407. 19 YCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGL-RWPRAGV-YDIDGFLIPSDAIP 96 (173) Q Consensus 19 Y~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~l-awPR~gv-~~~dg~~~~~~~IP 96 (173) -+|+++++++. |.-... ...+|+-.+..+-.|.+++. +|.|++.-.+|.. .++.... .+.+| -.|| T Consensus 1 ivtLee~K~Hl--Rid~dd--~deDD~li~~~i~AA~~~v~---~~l~r~l~~~~~~~~~~~~~~~~~~~~-----~~~p 68 (115) T protein:vir:81 1 MITLAMVQRHL--QAELYE--DDERDYVMQQLLPAARESAE---LFINRKLYDTQADMLADQAAGVDPAGQ-----LLIT 68 (115) T ss_pred CCCHHHHHHHc--CCCCCC--CccchHHHHHHHHHHHHHHH---HHhCCccccccccccccccccCCCCcc-----cccC Confidence 89999999885 431111 11245556666666666665 3566665443332 2222211 11111 1378 Q ss_pred HHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhccCCcccccc Q lcl|NC_019407. 97 QQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 97 ~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~r 169 (173) +.|+.|+..|.-.+..+- |.|.. ++....+..++.||.|+-.. .| +| T Consensus 69 ~~i~~AiLllvg~~Y~NR------------E~v~~--------~~~~elP~~~~~LL~pyR~~-~g-----~~ 115 (115) T protein:vir:81 69 RTVEQAILLTLGEWYSSR------------EQVWT--------KGAGLVTSSAQNLLHPYRKF-AG-----VR 115 (115) T ss_pred HHHHHHHHHHHHHHHhcc------------chhcc--------hhhhhcCHHHHHHHHHHHhh-cC-----CC Confidence 999999999986665431 11100 11223456689999998432 23 22 No 42 >protein:vir:103283 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277463;genbank:gi:71834105;genbank:GeneID:3562394 Probab=72.05 E-value=0.19 Score=24.57 Aligned_cols=109 Identities=11% Similarity=0.048 Sum_probs=61.1 Q ss_pred hccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccchHHHHHHHHHHHHHH Q lcl|NC_019407. 31 ANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAAL 110 (173) Q Consensus 31 ~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~IP~~V~~A~~elA~~~ 110 (173) +|.. .++....+++..++-+--|..||=.. . -=++...|...+|+++ T Consensus 1 mR~l-~P~f~~vpdevi~~wid~A~lFVC~~-~-------------------------------fg~~~~~Al~lytlHL 47 (125) T protein:vir:10 1 MRTL-YPPLKSQPDDVLNAWIEVAKLFICLD-K-------------------------------FGDKQVQALAFYTLHL 47 (125) T ss_pred Cccc-cchhhccCHHHHHHHHHHHHHHHHHh-h-------------------------------hhhHHHHHHHHHHHHH Confidence 5542 44556667777777666676666321 1 1133446777777776 Q ss_pred HcCCC-------CCCccccceeEEe-cCeeEEeecCCCCCccch----HHHHHHHHhhhhhccCCccccc-ceecC Q lcl|NC_019407. 111 MNNDW-------TSPQTTRGMKEIQ-VDVIELKFDSEIQRGSMP----DIVMSILEGLGVVKTGTRPAFK-KIIRH 173 (173) Q Consensus 111 ~~~~~-------~~~~~~~~v~~~k-VG~isveY~~~~~~~~~~----~~v~~lL~~ll~~~~G~~~~~~-rv~R~ 173 (173) +.-+. ...+..+++++-+ .|+++++|+..+..+.-+ .-.-.|+.-|+. ..|+|++.. +..+. T Consensus 48 m~~dga~k~e~~~~~~~s~r~~s~slsGE~Sit~~~~s~d~s~~~L~~T~wGk~~~~L~k-~~~GgFaL~T~~~~~ 122 (125) T protein:vir:10 48 LSQDIALKTENDSSQTSSERVKSYSLSGEYTISYDTSTAAASSSNLEESSWGKLYIDLMR-LKVGRWGLITSGGSR 122 (125) T ss_pred HhcccccccccccccccccceeeeeeccceEeecccccccccccccccCchHHHHHHHHH-hcCCceeeecccccc Confidence 65432 1123345688888 499999998776554321 112334555555 446677655 22222 No 43 >protein:vir:99848 Length: 172 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164077;genbank:gi:56692609;genbank:GeneID:3192576 Probab=65.54 E-value=0.079 Score=26.62 Aligned_cols=134 Identities=16% Similarity=0.139 Sum_probs=60.0 Q ss_pred CeeeEEeeCCCCCC----------CccccccHHHHHHHHHhccCCcccC-------CCCCHHHHHHHHHHHHHHhhhhhc Q lcl|NC_019407. 1 MAFTFVVETGAGDP----------AANSYCDVQFADDYIYANVYANTAW-------DALDQDEKERFLVRASKYLDRTIA 63 (173) Q Consensus 1 ~~M~liVe~g~g~~----------~AnSY~tv~~aday~~~r~~~~~~w-------~~~~~~~ke~aL~~As~~id~~~~ 63 (173) |+|=.+++|=.-.. ..+.|.+-++..+.+.. +.-...| ...+++..+.+|..|+..||+.+. T Consensus 1 ~~mYaT~~dl~~r~g~~el~qLt~~~~~~~~~~~l~d~~~~-~~~~~~~~~~~~~~g~~d~~~i~~Al~dA~aeIDgYL~ 79 (172) T protein:vir:99 1 MAVYITLPELAERPGAEELSQAATPRPLQAVDSELLDALLR-GLPVDRWTPEEIEVGHATVEVINSAVSDAQGYIDGFLQ 79 (172) T ss_pred CcccccHHHHHhhcCHHHHHHHhccccccCCHHHHHHHhhc-chhhhhcccccccccccCHHHHHHHHHHHHHHHHHHHh Confidence 77755544411111 12233333332222111 1111222 235788999999999999999543 Q ss_pred cccccCCccccccCCcCCCcccCCeeeccccchHHHHHHHHHHHHHHHcCC--C--C-CCccccc----e---eEEecCe Q lcl|NC_019407. 64 WAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQQLMEATAEMAAALMNND--W--T-SPQTTRG----M---KEIQVDV 131 (173) Q Consensus 64 ~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~IP~~V~~A~~elA~~~~~~~--~--~-~~~~~~~----v---~~~kVG~ 131 (173) |+ ++.+|-..+|.-|+..||-+|.+.+... . . +...... + +.+.-|. T Consensus 80 --~R-------------------~Y~lPL~~vP~~L~~~a~dIArY~L~~~r~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk 138 (172) T protein:vir:99 80 --RR-------------------GYSLPLAKRYGVVTGWTRAIARYLLHQDRLGPGAEKDPIVRDYRDALKFLQLIAEGK 138 (172) T ss_pred --cc-------------------cccCCCcccchHHHHHHHHHHHHHHHhccCCcccCCHHHHHHHHHHHHHHHHHhcCc Confidence 11 1345556799999999999996555421 1 1 1111111 1 2222255 Q ss_pred eEEeecCCC--CC-c-cch-----HHHHHHHHhh Q lcl|NC_019407. 132 IELKFDSEI--QR-G-SMP-----DIVMSILEGL 156 (173) Q Consensus 132 isveY~~~~--~~-~-~~~-----~~v~~lL~~l 156 (173) ++.-=.... ++ + ..+ .+=..-|++| T Consensus 139 ~~Lg~~~~~~~~~~~~~~v~~~~r~F~rd~L~gf 172 (172) T protein:vir:99 139 FSLGPDDPLTPPGGGVPQVLAPARTFSHDTLKDY 172 (172) T ss_pred cccCCCCCCCCCCCCceeeecCCCccChhhccCC Confidence 544211111 11 1 000 0111122233 No 44 >protein:vir:79253 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469165;genbank:gi:157835007;genbank:GeneID:5648834 Probab=64.64 E-value=0.041 Score=28.18 Aligned_cols=119 Identities=13% Similarity=0.174 Sum_probs=62.2 Q ss_pred cccccHHHHHHHHHhccC------CcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeee Q lcl|NC_019407. 17 NSYCDVQFADDYIYANVY------ANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLI 90 (173) Q Consensus 17 nSY~tv~~aday~~~r~~------~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~ 90 (173) =+|+|.++..+.|..+-. ........+++..+++|..|+..||+.+. .| +.+ T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL~---~R-------------------Y~l 58 (138) T protein:vir:79 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLH---GR-------------------YQL 58 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHHh---hc-------------------ccC Confidence 579999999876554321 11122356788899999999999998432 21 234 Q ss_pred ccccchHHHHHHHHHHHHHHHcCCCCCCc-cccc----e---eEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhccC Q lcl|NC_019407. 91 PSDAIPQQLMEATAEMAAALMNNDWTSPQ-TTRG----M---KEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTG 162 (173) Q Consensus 91 ~~~~IP~~V~~A~~elA~~~~~~~~~~~~-~~~~----v---~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~G 162 (173) |-..+|.-|+..||-+|.+.+.+...... .... + +.+.-|.++.--.......+ ++ T Consensus 59 Pl~~vP~~L~~~a~dIA~Y~L~~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~----------------~~ 122 (138) T protein:vir:79 59 PLASVPTALKRIACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAP----------------VA 122 (138) T ss_pred CccccchHHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcCCC----------------CC Confidence 55679999999999999766654332211 1111 1 12222544442111110000 00 Q ss_pred Ccccc---ccee-cC Q lcl|NC_019407. 163 TRPAF---KKII-RH 173 (173) Q Consensus 163 ~~~~~---~rv~-R~ 173 (173) ++.-+ -|++ |. T Consensus 123 ~~~~~~~~~r~F~Rd 137 (138) T protein:vir:79 123 NTVQISEGRNDWGAD 137 (138) T ss_pred CceeeecCCCCCCCC Confidence 00000 0111 22 No 45 >protein:vir:99222 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950460;genbank:gi:119953661;genbank:GeneID:4643082 Probab=64.64 E-value=0.041 Score=28.18 Aligned_cols=119 Identities=13% Similarity=0.174 Sum_probs=62.2 Q ss_pred cccccHHHHHHHHHhccC------CcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeee Q lcl|NC_019407. 17 NSYCDVQFADDYIYANVY------ANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLI 90 (173) Q Consensus 17 nSY~tv~~aday~~~r~~------~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~ 90 (173) =+|+|.++..+.|..+-. ........+++..+++|..|+..||+.+. .| +.+ T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL~---~R-------------------Y~l 58 (138) T protein:vir:99 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLH---GR-------------------YQL 58 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHHh---hc-------------------ccC Confidence 579999999876554321 11122356788899999999999998432 21 234 Q ss_pred ccccchHHHHHHHHHHHHHHHcCCCCCCc-cccc----e---eEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhccC Q lcl|NC_019407. 91 PSDAIPQQLMEATAEMAAALMNNDWTSPQ-TTRG----M---KEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTG 162 (173) Q Consensus 91 ~~~~IP~~V~~A~~elA~~~~~~~~~~~~-~~~~----v---~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~G 162 (173) |-..+|.-|+..||-+|.+.+.+...... .... + +.+.-|.++.--.......+ ++ T Consensus 59 Pl~~vP~~L~~~a~dIA~Y~L~~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~----------------~~ 122 (138) T protein:vir:99 59 PLASVPTALKRIACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAP----------------VA 122 (138) T ss_pred CccccchHHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcCCC----------------CC Confidence 55679999999999999766654332211 1111 1 12222544442111110000 00 Q ss_pred Ccccc---ccee-cC Q lcl|NC_019407. 163 TRPAF---KKII-RH 173 (173) Q Consensus 163 ~~~~~---~rv~-R~ 173 (173) ++.-+ -|++ |. T Consensus 123 ~~~~~~~~~r~F~Rd 137 (138) T protein:vir:99 123 NTVQISEGRNDWGAD 137 (138) T ss_pred CceeeecCCCCCCCC Confidence 00000 0111 22 No 46 >protein:vir:4458 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700381;genbank:gi:23505453;genbank:GeneID:955660 Probab=59.98 E-value=0.38 Score=22.88 Aligned_cols=107 Identities=10% Similarity=0.128 Sum_probs=61.3 Q ss_pred ccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccchH Q lcl|NC_019407. 18 SYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQ 97 (173) Q Consensus 18 SY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~IP~ 97 (173) =++|+++++.+. |.-+. ++ -||.-.+..+..|.+||.. |.|++....+.. +|... .+| -.+|. T Consensus 1 M~vtLee~K~hL--RId~D--~~-dDD~lI~~~i~AA~~~i~~---~~~r~l~~~~~~-~~~~~---~~~-----~~~~~ 63 (107) T protein:vir:44 1 MLLSVEEIKAQL--RLDED--FE-ADERYLQLLARAVQKRTET---YLNRKLYAPDET-IPDSD---PDG-----LLLQD 63 (107) T ss_pred CCCCHHHHHHHc--CCCCC--Cc-hhHHHHHHHHHHHHHHHHH---hhcCcccccccc-ccccc---ccc-----ccchh Confidence 688999999994 43221 11 1455677777788899874 677776544432 33321 112 24688 Q ss_pred HHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhcc Q lcl|NC_019407. 98 QLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKT 161 (173) Q Consensus 98 ~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~ 161 (173) .++.|++.|+-....+-.. +.. .+....+-.+..||.|+--.-. T Consensus 64 ~~~~AiLllv~~~Y~NRe~-------~~~-------------~~~~~lP~~v~~Ll~~yR~~p~ 107 (107) T protein:vir:44 64 DIRLGMLMLISHFYENRSS-------VTE-------------VEKLDMPQSFGWLVGPYRYFPQ 107 (107) T ss_pred hHHHHHHHHHHHHHhhhhh-------hcc-------------ccccccCHHHHHHHHHhhhcCC Confidence 8999999998665543211 000 1112244457788888733222 No 47 >protein:vir:80036 Length: 111 # NCBI annotation: gp10 # Family: family:all:3186 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468714;genbank:gi:157325294;genbank:GeneID:5601727 Probab=58.11 E-value=0.15 Score=25.08 Aligned_cols=109 Identities=10% Similarity=0.011 Sum_probs=53.4 Q ss_pred ccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccc Q lcl|NC_019407. 16 ANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAI 95 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~I 95 (173) -|+= ++....- +.. =++++|+..+.++-.|..-. |.+.||- T Consensus 1 m~tt--v~~vkl~-a~~------L~~~sDDsl~~~I~dA~~e~--------------~a~gFp~---------------- 41 (111) T protein:vir:80 1 MKTD--VSKLKLT-ASS------LASVSDDSLQVHIDDSYLEV--------------QEKGFPE---------------- 41 (111) T ss_pred Cchh--HHHHHHh-hHh------hcCCChHHHHHHHHHHHHHh--------------hcCCCCh---------------- Confidence 1221 2222211 111 12467777776655553333 3344453 Q ss_pred hHHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCc--cchHHHHHHHHhhhhhccCCccccccee Q lcl|NC_019407. 96 PQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRG--SMPDIVMSILEGLGVVKTGTRPAFKKII 171 (173) Q Consensus 96 P~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~--~~~~~v~~lL~~ll~~~~G~~~~~~rv~ 171 (173) +--.+||--||++++.=+ ..+|++||||.++-+|++.+... .+-+|=.-.+ .|+....|++....-|+ T Consensus 42 -~~~e~a~rYLa~HLat~~------~~~v~sE~V~~Lk~~Y~~~~~~~~l~~s~wGq~Y~-rL~k~~~~gs~~~~vVv 111 (111) T protein:vir:80 42 -KFEERANRYLAAHLATLA------NKNVKSEAVGSLKREYYEVKGDSGLLSTEYGQEYA-RLLKEANGGSGISMVVV 111 (111) T ss_pred -hHHHHHHHHHHHHHHHhc------CCCCchhhhhhHHHHhhhcccccccccchhHHHHH-HHHHHhcCCccceeeeC Confidence 123367777888766432 55799999999999998644332 1223322222 22222223333333444 No 48 >protein:vir:96108 Length: 155 # NCBI annotation: hypothetical protein ORF025 # Family: family:all:664 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294442;genbank:gi:149408339;genbank:GeneID:5237227 Probab=50.64 E-value=0.6 Score=21.79 Aligned_cols=126 Identities=10% Similarity=-0.047 Sum_probs=60.5 Q ss_pred CCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeecc Q lcl|NC_019407. 13 DPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPS 92 (173) Q Consensus 13 ~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~ 92 (173) .+.. +++.+. ++.--...-+..+|+..+..|..|.-+|+.+ +.+. ..| + T Consensus 1 ~v~f----d~~~FR----~~fPeFad~~~~Pd~~i~~~l~~A~~~l~~~--~~~s---------~~~------~------ 49 (155) T protein:vir:96 1 MVIF----DEQKFR----TLFPEFADPASYPAVRLQLYFDIACEFISDR--DSPY---------RIL------N------ 49 (155) T ss_pred Cccc----CHHHHH----HhCccccCcccCCHHHHHHHHHHHHHhhcCC--Cccc---------ccc------C------ Confidence 2222 223332 3322222223568999999999999999742 1111 001 1 Q ss_pred ccchHHHHHHHHHHHHHHHc-------CCC-----CCCccccceeEEecCeeEEeecCCCCCcc--------chH-HHHH Q lcl|NC_019407. 93 DAIPQQLMEATAEMAAALMN-------NDW-----TSPQTTRGMKEIQVDVIELKFDSEIQRGS--------MPD-IVMS 151 (173) Q Consensus 93 ~~IP~~V~~A~~elA~~~~~-------~~~-----~~~~~~~~v~~~kVG~isveY~~~~~~~~--------~~~-~v~~ 151 (173) ...-+++.+.++.+++. |.. ......+.++++++|+|||.|+.+..... .+- -.=+ T Consensus 50 ---g~~~~~~l~Ll~AH~l~L~~~~~~gaa~~g~~~~g~~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~f~~ 126 (155) T protein:vir:96 50 ---GKALEACLYLLTAHLLSLSTMQVQGAAGGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWA 126 (155) T ss_pred ---hHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccccceeeceecceeeeeecCCCCCchhHHhhcCHHHHHHHH Confidence 23344566666655543 111 11123455899999999999986543322 111 2233 Q ss_pred HHHhhhhh--ccCCcccccceecC Q lcl|NC_019407. 152 ILEGLGVV--KTGTRPAFKKIIRH 173 (173) Q Consensus 152 lL~~ll~~--~~G~~~~~~rv~R~ 173 (173) |++.+... -.|+.| --..+|. T Consensus 127 l~~~~~~Gg~~vgG~p-er~~~r~ 149 (155) T protein:vir:96 127 LLSVKAVGGFYIGGLP-ERRGFRK 149 (155) T ss_pred HHHHhcccccccCCCC-ccccccc Confidence 55555320 011111 1134455 No 49 >protein:vir:104344 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398973;genbank:gi:81343957;genbank:GeneID:3778876 Probab=44.69 E-value=0.8 Score=21.12 Aligned_cols=116 Identities=15% Similarity=0.081 Sum_probs=64.1 Q ss_pred ccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccc Q lcl|NC_019407. 16 ANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAI 95 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~I 95 (173) -| ++.-+.+ |. -.+....+||+..+.-+--|-.||=. +.. T Consensus 1 ~~-----~~~~e~~--R~-l~P~f~kvpdevI~~wielA~lfVc~--------------------------------~~~ 40 (132) T protein:vir:10 1 MN-----DAILAFM--RS-LVPALKAVDDESINVWIDLARLYVCA--------------------------------DKF 40 (132) T ss_pred Cc-----hHHHHHH--HH-hcchhhcCChHHHHHHHHHHHHHHHh--------------------------------hcC Confidence 11 1222232 22 13556677888888766666666632 223 Q ss_pred hHHHHHHHHHHHHHHHcCCCCC--Cccccc-----eeEEec-CeeEEeecCCCCCccc---hHHHHHHHHhhhhhccCCc Q lcl|NC_019407. 96 PQQLMEATAEMAAALMNNDWTS--PQTTRG-----MKEIQV-DVIELKFDSEIQRGSM---PDIVMSILEGLGVVKTGTR 164 (173) Q Consensus 96 P~~V~~A~~elA~~~~~~~~~~--~~~~~~-----v~~~kV-G~isveY~~~~~~~~~---~~~v~~lL~~ll~~~~G~~ 164 (173) +++...|....|++++.-|... .+..+. |++-++ |+++++|+..++.+.- -||= .|+.-|+. ..|+| T Consensus 41 g~~~~~AlaL~taHLm~~dga~k~en~~~~t~S~rvaS~Sl~Ge~Sisf~~~sa~~s~L~~tp~G-kl~~~L~k-~~~Gg 118 (132) T protein:vir:10 41 GNDADRAVGLYALHLMLSDGAFKGENEGLETYSRRMASYSLSGEFSITYDNQSAIQGDLSSSSWG-RMYKALLR-KKGGG 118 (132) T ss_pred chhHHHHHHHHHHHHhhccccccccccchhhhhhhhhhhcccCceeeecccccccccccccCcHH-HHHHHHHH-hccCc Confidence 5666788888888888754322 222223 344443 9999999876654321 1333 55655555 44567 Q ss_pred cccc--ceecC Q lcl|NC_019407. 165 PAFK--KIIRH 173 (173) Q Consensus 165 ~~~~--rv~R~ 173 (173) +|.. ..+|- T Consensus 119 fgL~t~~~~~~ 129 (132) T protein:vir:10 119 FGLITSAAGGG 129 (132) T ss_pred cccccccCcCC Confidence 7655 33333 No 50 >protein:vir:102961 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:26777 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945287;genbank:gi:39653722;uniprot:Q708M5;genbank:GeneID:2672875 Probab=44.14 E-value=0.82 Score=21.06 Aligned_cols=118 Identities=17% Similarity=0.132 Sum_probs=62.7 Q ss_pred cHHHHHH----HHHh--ccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeecccc Q lcl|NC_019407. 21 DVQFADD----YIYA--NVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDA 94 (173) Q Consensus 21 tv~~ada----y~~~--r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~ 94 (173) -.++++. |... ++..... ...|+.-.+-+|.++-.+|=-.++ -.+ T Consensus 1 ~~~~lkq~~~~~~~~~~l~~~~d~-~~kD~~vl~faie~v~~~IlnycN----------------------------ike 51 (131) T protein:vir:10 1 MIQELKQDNTMYLISCVRKMRQDN-YFKDMEVLHYALTQAENEILNYIH----------------------------QDS 51 (131) T ss_pred Chhhhhhhhhhhhhhhhhcccccc-ccchHHHHHHHHHHHHHHHhhhcC----------------------------Ccc Confidence 4444443 2211 1111100 001233456666666666542111 136 Q ss_pred chHHHHHHHHHHHHHHHcCCCCCC-------ccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhccCCcccc Q lcl|NC_019407. 95 IPQQLMEATAEMAAALMNNDWTSP-------QTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAF 167 (173) Q Consensus 95 IP~~V~~A~~elA~~~~~~~~~~~-------~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~ 167 (173) ||.+++.-....|.-++.++.+.+ ...+.|++.|-|.-+|+|..+.....+...+..+|..+... =-.| T Consensus 52 iP~~Le~v~~~maiDll~~e~~~~~k~~~i~~~~g~VsSI~eGDTsIsf~s~t~~~qrl~~~~s~l~~Y~~q----L~~y 127 (131) T protein:vir:10 52 VPGRLENVWIDMTNDLLDKVKEQSVLAEKAGADDFSVKSIKMGDTTIEKVSPYEMIQRMKQVPSSLERYKRQ----LNRF 127 (131) T ss_pred cchhhHHHHHHHHHHHHhhhcccccccccccccccceeeeeecceeeeccCCccHHHHHHHHHHHHhhhHHH----Hhhh Confidence 899999999999998888765433 23456999999999999976654433333333344333111 1112 Q ss_pred ccee Q lcl|NC_019407. 168 KKII 171 (173) Q Consensus 168 ~rv~ 171 (173) -|++ T Consensus 128 RRL~ 131 (131) T protein:vir:10 128 RKLL 131 (131) T ss_pred cccC Confidence 3555 No 51 >protein:vir:5742 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892053;genbank:gi:33770516;uniprot:Q7Y407;genbank:GeneID:2637465 Probab=43.79 E-value=0.83 Score=21.02 Aligned_cols=108 Identities=14% Similarity=0.045 Sum_probs=62.5 Q ss_pred cccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccc--cCCcCCCcccCCeeecccc Q lcl|NC_019407. 17 NSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGL--RWPRAGVYDIDGFLIPSDA 94 (173) Q Consensus 17 nSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~l--awPR~gv~~~dg~~~~~~~ 94 (173) =..+|+++.++.. |.-.. .+ -+|+-.+..+..|-.+++ +|.|+|.-.++.. +-|- +.+|..+ T Consensus 1 m~mitLeeiK~hl--Rid~D--~~-~eD~lL~~y~~AA~~~~e---~~~~rkLy~~~~~~~~~p~----~~~gl~~---- 64 (110) T protein:vir:57 1 MGMTSLSNVKTQL--RLEED--FT-EHDDFIESLIDAAQRSIE---RTYYCVLVDSQEALEKLPE----GVRGFLI---- 64 (110) T ss_pred CCCCCHHHHHHHc--CCCCC--CC-hhHHHHHHHHHHHHHHHH---HHhCCcccCCccccccCCC----CCCcccc---- Confidence 3457899999874 43111 11 145556666666677776 4677776543321 2231 2345444 Q ss_pred chHHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhcc Q lcl|NC_019407. 95 IPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKT 161 (173) Q Consensus 95 IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~ 161 (173) ++.|+.|+..|.-+...+ ||.|++. .....+-.++.||.|+....- T Consensus 65 -~~di~~A~Lllv~hwYeN------------REav~~~--------~~~~~P~~v~~Ll~P~~~~~~ 110 (110) T protein:vir:57 65 -EPDTQLAARMMVAQWYLN------------PKGTSPD--------GDTPAQLGVEYLLFPLMEHTV 110 (110) T ss_pred -CHHHHHHHHHHHHHHHhc------------ccccccc--------cccchhHHHHHHHHHHHhhcC Confidence 567999999988665543 2222221 122335678999999976543 No 52 >protein:vir:3034 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438147;genbank:gi:16271810;genbank:GeneID:929268 Probab=43.58 E-value=0.48 Score=22.32 Aligned_cols=97 Identities=12% Similarity=0.117 Sum_probs=48.3 Q ss_pred HHHHHHHHhhhhhc-cccccCCccccccCCcCCCcccCCeeeccccch---HHHHHHHHHHHHHHHcCCCCCCcccccee Q lcl|NC_019407. 50 FLVRASKYLDRTIA-WAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIP---QQLMEATAEMAAALMNNDWTSPQTTRGMK 125 (173) Q Consensus 50 aL~~As~~id~~~~-~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~IP---~~V~~A~~elA~~~~~~~~~~~~~~~~v~ 125 (173) ++.+|..-||..++ |. .+. -+..| +- ..+|.|.|.--..+-+.+-......+.++ T Consensus 1 L~k~A~~~Id~~t~~fY-~~~-------------------dle~D-~~~R~~~fK~Aia~QI~Yld~~G~~t~~d~~s~~ 59 (111) T protein:vir:30 1 MEKRASHAVNLYCRNRY-DYK-------------------DLKKE-IALVQKAVKRAIAYQIAYLNDSGVMTAEDKQSFA 59 (111) T ss_pred CchhhHHHHhHhhchhh-hhh-------------------hHHHH-HHHHHHHHHHHHHHHHHHHHhcCCCChhhccCcc Confidence 67789999997442 22 111 12222 21 34555555443333333333333466699 Q ss_pred EEecCeeEEeecCCCCC-------ccchHHHHH---HHHhhhhhccCCcccccc Q lcl|NC_019407. 126 EIQVDVIELKFDSEIQR-------GSMPDIVMS---ILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 126 ~~kVG~isveY~~~~~~-------~~~~~~v~~---lL~~ll~~~~G~~~~~~r 169 (173) +.+||-.+++|+..... ..+|..... +|...+--. +|..+=| T Consensus 60 SisvGrTsiS~~~~~~~~~~~~~t~~~~~l~~da~n~L~~~Glly--~GV~yd~ 111 (111) T protein:vir:30 60 GISLGRTSISYTVGHGQGSQQKTLADRFNLCLDAENELLVVGLGY--TGISYDR 111 (111) T ss_pred eeeecceeeeccCccCCCCccccccccccchHHHHHHHHhhcccc--ccccccC Confidence 99999999998643321 133443333 332221101 3444445 No 53 >protein:vir:94507 Length: 113 # NCBI annotation: putative DNA packaging protein # Family: family:all:372 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223891;genbank:gi:62327103;genbank:GeneID:5075523 Probab=41.48 E-value=0.92 Score=20.77 Aligned_cols=113 Identities=11% Similarity=0.081 Sum_probs=68.1 Q ss_pred cccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccchHH Q lcl|NC_019407. 19 YCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQQ 98 (173) Q Consensus 19 Y~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~IP~~ 98 (173) -..++..+.-. +. ++...|+..+..|.+|.+.|=..+. . ++ +..+.||.+ T Consensus 1 M~~L~~vK~~l-----gi--~d~~~D~lL~~iI~~a~~~i~~~l~---~------------------~~--~~~~~iP~~ 50 (113) T protein:vir:94 1 MALLDSIKLRI-----GI--EDTKQDDLLTDIISDVQARVLAYVN---Q------------------DG--LVQSELPNG 50 (113) T ss_pred CchHHHHHHHh-----CC--CCCchhhHHHHHHHHHHHHHHHHhC---C------------------cc--chhhhhhhH Confidence 22234443221 22 3444577788888888888865321 1 11 223689999 Q ss_pred HHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhccCCccccccee Q lcl|NC_019407. 99 LMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTRPAFKKII 171 (173) Q Consensus 99 V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~G~~~~~~rv~ 171 (173) +..-.++.|+...|- -...++++.+++-.|++|.... -|.-.+..|.-+.....+++.| .|.+ T Consensus 51 l~~Iv~evavkryNR-----~g~EG~~S~SeeG~S~sf~~~~----df~~y~~~l~~~~~~~~~~~~g-~rF~ 113 (113) T protein:vir:94 51 LDFVIKDVTIRIYNK-----IGDEGKESSSEGNVSNTWDTPA----DLSEYSDVLDVYRKSYKRRSAG-MRFI 113 (113) T ss_pred HHHHHHHHHHHHhcc-----cCCccceeeecCceeeeecCcc----chhhHHHHHHHHHhhccCCCCC-ceeC Confidence 999999999987653 2344799999999999996422 1333444454554443444444 2666 No 54 >protein:vir:98481 Length: 136 # NCBI annotation: ORF3 # Family: family:all:32440 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958281;genbank:gi:41057255;uniprot:Q38596;genbank:GeneID:2732865 Probab=38.91 E-value=1 Score=20.48 Aligned_cols=113 Identities=18% Similarity=0.146 Sum_probs=52.9 Q ss_pred ccccccHHHHHHHHHhccCCcccCCCCCHH---HHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeecc Q lcl|NC_019407. 16 ANSYCDVQFADDYIYANVYANTAWDALDQD---EKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPS 92 (173) Q Consensus 16 AnSY~tv~~aday~~~r~~~~~~w~~~~~~---~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~ 92 (173) -.+|+|+++..+.|.. ...+ +++ ..+++|-.|++.|-.++ |.-+. T Consensus 1 M~~fAtv~Dl~~rw~~-----~~~d--ee~~ra~~~~lL~dAS~~ir~~~---------------p~~~~---------- 48 (136) T protein:vir:98 1 MAAYATVEDYQARAAV-----TLPD--GSPRRAQVEAYLDDASALMARHI---------------PTGHT---------- 48 (136) T ss_pred CCccCCHHHHHHHhcc-----CCCC--chhHHHHHHHHHHHHHHHHHHhC---------------CCCCC---------- Confidence 6899999999876532 1111 222 34667999999998642 33111 Q ss_pred ccchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccch---HHHHHHHHhhhhhccCCcccccc Q lcl|NC_019407. 93 DAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMP---DIVMSILEGLGVVKTGTRPAFKK 169 (173) Q Consensus 93 ~~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~---~~v~~lL~~ll~~~~G~~~~~~r 169 (173) .-|.-++.-+|......+.++. +..+++.|.-+-.+..+ |..| .-++. | ++.....|...+-.- T Consensus 49 -~~~~~~~~V~~~~V~R~~~np~-------G~~s~TaG~ys~s~t~~---G~Lylt~~E~~~-L-g~~rqr~~~~d~a~s 115 (136) T protein:vir:98 49 -PDPGTLRAICVAVVRRVMANPG-------GYRQRTIGQYAETLGED---GGLYLTEDEKGQ-L-QPPDQTAPDADAAYS 115 (136) T ss_pred -CChhHHHHHHHHHHHHHhhCCC-------CcccccchhHHHhhhcC---CCcccChHHHHH-h-CCCCCccccccccee Confidence 1144455666666666665432 34456677543332221 2211 22222 2 111111111111111 Q ss_pred e--------ecC Q lcl|NC_019407. 170 I--------IRH 173 (173) Q Consensus 170 v--------~R~ 173 (173) | .|. T Consensus 116 i~~~~~~~~~~~ 127 (136) T protein:vir:98 116 LDLDPGTRAWVD 127 (136) T ss_pred cccCCCcCCcCC Confidence 1 121 No 55 >protein:vir:1384 Length: 92 # NCBI annotation: Gp7 protein # Family: family:all:316 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612836;genbank:gi:20065970;genbank:GeneID:935785 Probab=35.57 E-value=1.2 Score=20.10 Aligned_cols=92 Identities=14% Similarity=0.063 Sum_probs=55.6 Q ss_pred ccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccchHHH Q lcl|NC_019407. 20 CDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIPQQL 99 (173) Q Consensus 20 ~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~IP~~V 99 (173) +|+++++.|. |.-+. -+|+..+..+-.|-.||.. +.|+ .+..|+.+ T Consensus 1 vtLeevK~~L--RID~d-----dDD~lI~~~i~aA~~~i~~---~~~~------------------------~~~~~~~~ 46 (92) T protein:vir:13 1 MDLRELKEYL--RIDFE-----EDDILLRSLLLAAEEYLYN---AGIK------------------------RDYKKSLY 46 (92) T ss_pred CCHHHHHHHc--CCCCC-----cchHHHHHHHHHHHHHHHh---hccc------------------------cccchhHH Confidence 9999999995 43222 2777888888888899964 2221 12356788 Q ss_pred HHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHhhhhhccCCc Q lcl|NC_019407. 100 MEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEGLGVVKTGTR 164 (173) Q Consensus 100 ~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~ll~~~~G~~ 164 (173) +.|++.|+-+...+-... . ........+-.|..||.+|-.....+| T Consensus 47 ~~Avlllv~~~YenR~~~----------~---------~~~~~~~ip~~v~sll~~lR~~~~~~~ 92 (92) T protein:vir:13 47 SLAIKILVKHWYDNRDCV----------V---------AGNVNNKLEYSLNAILTQLRYCGDDNG 92 (92) T ss_pred HHHHHHHHHHhHhccccc----------c---------ccchhhhhhHHHHHHHHHhhhccCCCC Confidence 888888886554332110 0 001112345678888888754333333 No 56 >protein:vir:8104 Length: 170 # NCBI annotation: gp8 # Family: family:all:3238 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817685;genbank:gi:29566116;genbank:GeneID:1259310 Probab=32.22 E-value=1.4 Score=19.71 Aligned_cols=116 Identities=12% Similarity=0.084 Sum_probs=64.2 Q ss_pred hccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCc---cc--cccC--------Cc---CCC--cccCCee--- Q lcl|NC_019407. 31 ANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDE---DS--GLRW--------PR---AGV--YDIDGFL--- 89 (173) Q Consensus 31 ~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~---~Q--~law--------PR---~gv--~~~dg~~--- 89 (173) .|++ - +++.+.+.+|.-|+.-+.+. +|.+..| ++ .+.+ |- ..+ ...||.. T Consensus 1 ~~~~--~----a~~~~~q~~l~aA~a~vR~~---cGwhv~P~v~d~t~~ldg~G~~vl~LPt~pvvsV~sV~~~G~~l~~ 71 (170) T protein:vir:81 1 MRGQ--F----ADNTEAQAAIDAVLAAARRW---CGWHVSPVIIDDVMEVDGPGGRVLSLPTLNLVSVKSVVELGYALDV 71 (170) T ss_pred Cccc--c----cCchHHHHHHHHHHHHHHHH---hCCcccceecccEEEEeCCCCeeEECCCCcceeeEEEEECCeeecC Confidence 5643 2 26677778888888888753 4433221 22 1111 11 000 0112222 Q ss_pred ---------------------------------eccccchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEee Q lcl|NC_019407. 90 ---------------------------------IPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKF 136 (173) Q Consensus 90 ---------------------------------~~~~~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY 136 (173) +.++++|..+..-.|.+|-++.. ++....+++.++++++.+| T Consensus 72 ~~~~~~~~~glL~r~~G~~~~~~~~V~VT~tHGy~~~~apd~~~~vi~~~a~r~~~-----s~~~~~l~~~~~~~vs~~~ 146 (170) T protein:vir:81 72 STLDRSRRKGTLTKPYGRWTARDGAIVVTATHGFTETEAADWRRAVVQLVGRRAQT-----SRPSADLKRKKVDDVEYEW 146 (170) T ss_pred ccceeecCCceEEecCCccccccceEEEEEEeCCCCCccchHHHHHHHHHHHHhhc-----cCCcccceeeeccceeeee Confidence 34457899898888888876543 2333468899999999999 Q ss_pred cCCCCCccchHHHHHHHHhhhhhccCCcc Q lcl|NC_019407. 137 DSEIQRGSMPDIVMSILEGLGVVKTGTRP 165 (173) Q Consensus 137 ~~~~~~~~~~~~v~~lL~~ll~~~~G~~~ 165 (173) .... .... +.-..+|.+| .+|..+ T Consensus 147 ~~~~-~s~~-~~~~~iL~~Y---rl~~~p 170 (170) T protein:vir:81 147 FETA-VSVD-AELSAVFSPF---RILPSP 170 (170) T ss_pred cccc-cccC-HHHHHhhhhc---ccCCCC Confidence 7433 1122 3334467776 344455 No 57 >protein:vir:4702 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061634;genbank:gi:9635721;genbank:GeneID:1263015 Probab=22.28 E-value=2.5 Score=18.43 Aligned_cols=106 Identities=10% Similarity=0.051 Sum_probs=55.7 Q ss_pred eeEEeeCCCCCCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCC Q lcl|NC_019407. 3 FTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGV 82 (173) Q Consensus 3 M~liVe~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv 82 (173) |+++.|+ +++++.|+ |.-+. -+|+-.+..+-.|-.||.. +.|.+... .| T Consensus 1 M~vt~~d------------LeeiK~~L--RID~d-----~DD~li~~~i~AA~~~I~~---ai~~~~~~-----~~---- 49 (113) T protein:vir:47 1 MQLTAEE------------LKLLKKHC--KIDHN-----SEDDLLEIYYSWAFHEIAS---AVTDEPSK-----YI---- 49 (113) T ss_pred CcccHHH------------HHHHHHHh--CCCCC-----cchHHHHHHHHHHHHHHHh---hccccccc-----cc---- Confidence 6666544 89999996 54222 2677777777788899964 34432211 01 Q ss_pred cccCCeeeccccchHHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccchHHHHHHHHh------- Q lcl|NC_019407. 83 YDIDGFLIPSDAIPQQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMPDIVMSILEG------- 155 (173) Q Consensus 83 ~~~dg~~~~~~~IP~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~~v~~lL~~------- 155 (173) +....|+.++.|++.|+-....+-.. +... .....|-.|..||.+ T Consensus 50 --------~~~~~~~~~~~AvllLv~~~YeNR~a------------~~~~--------~~~~vp~~v~sli~qlR~~y~~ 101 (113) T protein:vir:47 50 --------DWFKSHPLFARAIYPLASYYFENRIA------------YLDR--------DLSLAPHMVLSTVHKLRGSFEQ 101 (113) T ss_pred --------cccCCchHHHHHHHHHHHHHHhhhhh------------cccc--------ccccccHHHHHHHHHHHHHHHH Confidence 11234678999999998665543211 1000 111222234444433 Q ss_pred hhhhccCCcccc Q lcl|NC_019407. 156 LGVVKTGTRPAF 167 (173) Q Consensus 156 ll~~~~G~~~~~ 167 (173) ++....|+..|. T Consensus 102 ~~~~~~~~~~~~ 113 (113) T protein:vir:47 102 FLESENDEESGT 113 (113) T ss_pred HhhhcCCCCCCC Confidence 344444544444 No 58 >protein:vir:94064 Length: 167 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453623;genbank:gi:84662659;genbank:GeneID:5142574 Probab=21.77 E-value=2.5 Score=18.36 Aligned_cols=124 Identities=10% Similarity=0.048 Sum_probs=56.3 Q ss_pred eeEEeeCCCCCCCccccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHH-HhhhhhccccccCCccccccCCcCC Q lcl|NC_019407. 3 FTFVVETGAGDPAANSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASK-YLDRTIAWAGEKVDEDSGLRWPRAG 81 (173) Q Consensus 3 M~liVe~g~g~~~AnSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~-~id~~~~~~G~r~~~~Q~lawPR~g 81 (173) |..+|= +++.+.+ + .+..+..+|+..+..|..|.. ++|.. +|.- T Consensus 1 M~~~~F------------d~~~FR~----~---fPeFa~~Pd~~i~~~l~~A~~~~l~~~-~~s~--------------- 45 (167) T protein:vir:94 1 MAVVVF------------DPTAFKL----V---YPEFVAVPDARLTALFNTVGYTILDNT-DASV--------------- 45 (167) T ss_pred CCcccC------------ChHHHHH----h---chhcccCCHHHHHHHHHHHHHhhcCCC-Cccc--------------- Confidence 555552 3444332 2 244556789999998888854 55531 2110 Q ss_pred CcccCCeeeccccchHHHHHHHHHHHHHHHc--C----C-C--CCCccccceeEEecCeeEEeecCCCCCccchH----- Q lcl|NC_019407. 82 VYDIDGFLIPSDAIPQQLMEATAEMAAALMN--N----D-W--TSPQTTRGMKEIQVDVIELKFDSEIQRGSMPD----- 147 (173) Q Consensus 82 v~~~dg~~~~~~~IP~~V~~A~~elA~~~~~--~----~-~--~~~~~~~~v~~~kVG~isveY~~~~~~~~~~~----- 147 (173) + .| ...-+++...++.+++. + + . ......+.|+++++|+|||.|+........-. T Consensus 46 -~-~~---------~~~~~~~l~LltAHll~L~~~~~a~~~~~~~~g~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T 114 (167) T protein:vir:94 46 -I-VD---------PLRRAPLLDLLVAHMLALFGYVNADGSITPGTGTVGRVANASEGSVSTSLAYSTPTGAGEAWFTQT 114 (167) T ss_pred -c-cc---------hhhHHHHHHHHHHHHHHHhhhhhhhcccccccccchheeeccccceeeeeecCCCCCchhhhhhcC Confidence 0 01 01122333344443321 1 1 1 11112355899999999999987654432211 Q ss_pred ----HHHHHHHhhhh--hccCCcccccceecC Q lcl|NC_019407. 148 ----IVMSILEGLGV--VKTGTRPAFKKIIRH 173 (173) Q Consensus 148 ----~v~~lL~~ll~--~~~G~~~~~~rv~R~ 173 (173) -.-+|++.+.. .-.|+.++ +.--|+ T Consensus 115 ~YGq~fwaL~~~~g~Gg~v~gG~~~-~~~~~~ 145 (167) T protein:vir:94 115 PYGAMYWAMSAPFRSFHYVAAGLSG-VGYSQD 145 (167) T ss_pred HHHHHHHHHHHHhcccccccCCCCC-CCCCcc Confidence 22334554432 11111110 111222 No 59 >protein:vir:2432 Length: 124 # NCBI annotation: gp19 # Family: family:all:2817 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046834;genbank:gi:9630402;genbank:GeneID:1261576 Probab=21.63 E-value=2.6 Score=18.33 Aligned_cols=119 Identities=14% Similarity=0.138 Sum_probs=59.9 Q ss_pred cccccHHHHHHHHHhccCCcccCCCCCHHHHHHHHHHHHHHhhhhhccccccCCccccccCCcCCCcccCCeeeccccch Q lcl|NC_019407. 17 NSYCDVQFADDYIYANVYANTAWDALDQDEKERFLVRASKYLDRTIAWAGEKVDEDSGLRWPRAGVYDIDGFLIPSDAIP 96 (173) Q Consensus 17 nSY~tv~~aday~~~r~~~~~~w~~~~~~~ke~aL~~As~~id~~~~~~G~r~~~~Q~lawPR~gv~~~dg~~~~~~~IP 96 (173) =+|+|+++..+.|. |..+. -.....+..|-.|++.|-++ ||.-... +..+.-| T Consensus 1 ~~~At~~Dv~~rw~-r~Lt~-----~E~~~ve~lL~dAs~~ir~r---------------~P~l~~~------~~~~~~~ 53 (124) T protein:vir:24 1 MAYATADDVVTLWA-KEPEP-----EVMALIERRLEQVERMIRRR---------------IPDLDAR------VSSDIFR 53 (124) T ss_pred CCCCCHHHHHHHhC-CCCCH-----HHHHHHHHHHHHHHHHHHhc---------------CCCcchh------cCCCCCh Confidence 68999999988764 32111 12344678888998888754 3432211 1112335 Q ss_pred HHHHHHHHHHHHHHHcCCCCCCccccceeEEecCeeEEeecCCCCCccch--HHHHHHHHhhhhhccCCccc-cc--cee Q lcl|NC_019407. 97 QQLMEATAEMAAALMNNDWTSPQTTRGMKEIQVDVIELKFDSEIQRGSMP--DIVMSILEGLGVVKTGTRPA-FK--KII 171 (173) Q Consensus 97 ~~V~~A~~elA~~~~~~~~~~~~~~~~v~~~kVG~isveY~~~~~~~~~~--~~v~~lL~~ll~~~~G~~~~-~~--rv~ 171 (173) ..|+.-+|..-..++.++. ++++++.|.-+-.-....+.|..+ +-=-.+|.|- . +.+.+ +. -+. T Consensus 54 ~~v~~V~a~~V~R~~rnP~-------G~~s~T~G~Ys~sl~~~~~~g~Lylt~~E~~~Lg~~---r-~~~~~~i~p~~~~ 122 (124) T protein:vir:24 54 ADLIDIEADAVLRLVRNPE-------GYLSETDGAYTYQLQADLSQGKLVILDEEWTTLGVN---R-LSRMSTLVPNIVM 122 (124) T ss_pred hhHHHHHHHHHHHHhhCCC-------CceecccchhHHhhhhcccCCceeeCHHHHHhhCcc---c-ccceeEeecceee Confidence 6677888888888776543 466777787654433333334332 2211223221 1 11111 11 011 Q ss_pred cC Q lcl|NC_019407. 172 RH 173 (173) Q Consensus 172 R~ 173 (173) =+ T Consensus 123 ~~ 124 (124) T protein:vir:24 123 PT 124 (124) T ss_pred CC Confidence 11 Done!