Query lcl|Aclame:protein:vir:97267|NCBI_annot:hypothetical protein ORF024|genbank:acc:YP_001294532;genbank:gi:149408253;genbank:GeneID:5237132 Match_columns 172 No_of_seqs 92 out of 98 Neff 5.9 Searched_HMMs 1612 Date Mon Dec 2 01:56:44 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_81 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_81_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:97267 Length: 172 100.0 1.5E-67 9.6E-71 386.9 16.4 172 1-172 1-172 (172) 2 protein:vir:95004 Length: 169 100.0 4.6E-55 2.9E-58 318.5 15.4 163 1-172 1-169 (169) 3 protein:vir:78383 Length: 169 100.0 6.3E-55 3.9E-58 317.7 15.3 163 1-172 1-169 (169) 4 protein:vir:80389 Length: 172 100.0 9.4E-55 5.9E-58 316.8 15.5 161 1-172 1-172 (172) 5 protein:vir:95176 Length: 172 100.0 3.7E-54 2.3E-57 313.5 14.9 164 1-172 3-169 (172) 6 protein:vir:94955 Length: 170 100.0 2.9E-52 1.8E-55 303.1 15.1 160 1-172 1-163 (170) 7 protein:vir:43 Length: 131 # N 97.8 7.4E-07 4.6E-10 54.2 10.2 124 16-172 1-127 (131) 8 protein:vir:80967 Length: 131 97.7 9.1E-07 5.7E-10 53.7 10.1 122 16-172 1-127 (131) 9 protein:vir:98900 Length: 132 97.3 6.5E-06 4.1E-09 49.0 10.0 122 16-172 1-128 (132) 10 protein:vir:79701 Length: 144 94.3 0.0022 1.4E-06 35.1 10.3 136 15-170 1-144 (144) 11 protein:vir:107756 Length: 147 93.3 0.003 1.9E-06 34.4 9.1 128 1-172 1-137 (147) 12 protein:vir:7857 Length: 188 # 93.2 0.0012 7.6E-07 36.6 6.8 122 1-166 1-188 (188) 13 protein:vir:101652 Length: 188 93.2 0.0012 7.6E-07 36.6 6.8 122 1-166 1-188 (188) 14 protein:vir:9576 Length: 131 # 92.7 0.007 4.3E-06 32.4 10.2 129 15-172 1-129 (131) 15 protein:vir:99570 Length: 153 92.5 0.0083 5.2E-06 32.0 10.4 131 1-172 1-144 (153) 16 protein:vir:5256 Length: 119 # 91.4 0.01 6.2E-06 31.6 9.6 113 18-170 1-119 (119) 17 protein:vir:4788 Length: 130 # 91.4 0.01 6.4E-06 31.5 9.6 126 16-171 1-130 (130) 18 protein:vir:1435 Length: 188 # 90.5 0.018 1.1E-05 30.1 10.2 129 1-166 1-188 (188) 19 protein:vir:9761 Length: 140 # 89.2 0.028 1.7E-05 29.1 10.1 129 15-172 1-129 (140) 20 protein:vir:9821 Length: 138 # 89.1 0.025 1.5E-05 29.4 9.7 131 1-171 1-138 (138) 21 protein:vir:94761 Length: 132 86.4 0.046 2.9E-05 27.9 10.4 130 15-172 1-130 (132) 22 protein:vir:99002 Length: 158 86.0 0.034 2.1E-05 28.7 8.5 120 15-172 1-120 (158) 23 protein:vir:100103 Length: 120 84.3 0.048 3E-05 27.8 8.6 119 12-168 1-120 (120) 24 protein:vir:80320 Length: 188 78.6 0.11 7E-05 25.8 10.0 127 1-166 1-188 (188) 25 protein:vir:79074 Length: 150 76.6 0.066 4.1E-05 27.1 6.6 128 16-172 1-149 (150) 26 protein:vir:2505 Length: 128 # 76.4 0.075 4.6E-05 26.8 6.8 125 12-172 1-127 (128) 27 protein:vir:103846 Length: 138 76.3 0.019 1.2E-05 30.1 3.5 126 16-172 1-137 (138) 28 protein:vir:1640 Length: 132 # 75.7 0.14 8.9E-05 25.2 9.8 129 15-172 1-130 (132) 29 protein:vir:107864 Length: 150 74.3 0.084 5.2E-05 26.5 6.5 128 16-172 1-149 (150) 30 protein:vir:1887 Length: 108 # 72.2 0.19 0.00012 24.6 8.3 107 1-172 1-107 (108) 31 protein:vir:192 Length: 108 # 72.2 0.19 0.00012 24.6 8.3 107 1-172 1-107 (108) 32 protein:vir:1993 Length: 141 # 71.2 0.013 8.1E-06 30.9 1.3 127 16-172 1-139 (141) 33 protein:vir:94064 Length: 167 66.4 0.27 0.00017 23.7 8.0 129 1-172 1-157 (167) 34 protein:vir:100245 Length: 113 66.0 0.27 0.00017 23.7 8.0 113 16-168 1-113 (113) 35 protein:vir:99222 Length: 138 65.6 0.26 0.00016 23.8 7.3 125 16-166 1-138 (138) 36 protein:vir:79253 Length: 138 65.6 0.26 0.00016 23.8 7.3 125 16-166 1-138 (138) 37 protein:vir:10365 Length: 115 57.4 0.43 0.00027 22.6 8.8 112 18-164 1-115 (115) 38 protein:vir:96108 Length: 155 56.5 0.45 0.00028 22.5 9.0 129 12-172 1-146 (155) 39 protein:vir:3639 Length: 158 # 43.5 0.84 0.00052 21.0 8.7 131 1-172 1-143 (158) 40 protein:vir:101559 Length: 158 43.5 0.84 0.00052 21.0 8.7 131 1-172 1-143 (158) 41 protein:vir:3034 Length: 111 # 41.8 0.7 0.00043 21.4 5.5 103 46-171 1-111 (111) 42 protein:vir:97069 Length: 115 40.0 0.99 0.00061 20.6 8.8 111 18-164 1-115 (115) 43 protein:vir:8104 Length: 170 # 32.8 1.4 0.00087 19.8 8.8 118 30-166 1-170 (170) 44 protein:vir:104344 Length: 132 32.2 1.2 0.00075 20.1 5.2 96 54-172 1-131 (132) 45 protein:vir:8430 Length: 189 # 28.9 1.7 0.0011 19.3 7.6 138 1-166 1-189 (189) 46 protein:vir:93592 Length: 108 28.2 1.8 0.0011 19.2 9.0 107 15-169 1-108 (108) 47 protein:vir:4512 Length: 107 # 24.0 2.2 0.0014 18.7 7.7 104 17-162 1-107 (107) 48 protein:vir:486 Length: 107 # 23.8 2.3 0.0014 18.6 7.6 101 17-162 1-107 (107) 49 protein:vir:78595 Length: 158 23.6 2.3 0.0014 18.6 8.9 131 1-172 1-143 (158) 50 protein:vir:106739 Length: 158 23.6 2.3 0.0014 18.6 8.9 131 1-172 1-143 (158) 51 protein:vir:81069 Length: 115 22.9 2.4 0.0015 18.5 8.8 110 18-164 1-115 (115) 52 protein:vir:99848 Length: 172 21.3 2.6 0.0016 18.3 6.6 126 15-160 1-172 (172) 53 protein:vir:103283 Length: 125 21.1 2.6 0.0016 18.3 6.6 90 81-172 1-119 (125) No 1 >protein:vir:97267 Length: 172 # NCBI annotation: hypothetical protein ORF024 # Family: family:all:703 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294532;genbank:gi:149408253;genbank:GeneID:5237132 Probab=100.00 E-value=1.5e-67 Score=386.87 Aligned_cols=172 Identities=100% Similarity=1.458 Sum_probs=165.3 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCC Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAW 80 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~ 80 (172) |+||||||||++|+||||+|+++|++||+.||++|++.++++||++|++|++|||+.|+|+|+|+++++|+|+|||+|++ T Consensus 1 m~liveD~t~~~~~AnSYvtv~~a~aY~~~rg~~~~a~~~~~ke~aLi~A~~yiD~~~~f~G~r~~~~~Q~l~WPRtg~~ 80 (172) T protein:vir:97 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAW 80 (172) T ss_pred CceEeeCCCCCCCCccccccHHHHHHHHHhcCcccCCCCcHHHHHHHHHHHHHHhhhhhcccCCCCCcchhhhcccCCCC Confidence 99999999999999999999999999999999999999999999999999999999889999987789999999999999 Q ss_pred CCCccccccchHHHHHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhh Q lcl|Aclame:pro 81 DRDRYYINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRA 160 (172) Q Consensus 81 ~~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~ 160 (172) +++.+++|.||++||+||||||+++++++++++.+..+..+.+.+||+|||+|+++|+..+++.+..|+|++|++||.|+ T Consensus 81 d~~~~~~~~IP~~v~~A~~elA~~al~~~l~~d~~~~~~~~~v~~kr~kvg~i~~~y~~~~~~~~~~p~~~~v~aLL~p~ 160 (172) T protein:vir:97 81 DRDRYYINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRA 160 (172) T ss_pred CCcccccccccHHHHHHHHHHHHHHHhcccccccccccccccceeeeeeecceeeEeeccCCCCCccccHHHHHHHHhhh Confidence 99999999999999999999999999999999999888888889999999999999987666667789999999999999 Q ss_pred ccccCCcccccC Q lcl|Aclame:pro 161 GLVRSGGTLLRG 172 (172) Q Consensus 161 g~~~~~g~~~r~ 172 (172) |++++||.|+|| T Consensus 161 gl~~~~~~~~r~ 172 (172) T protein:vir:97 161 GLVRSGGTLLRG 172 (172) T ss_pred ccccCcceeccC Confidence 999999999999 No 2 >protein:vir:95004 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224025;genbank:gi:62327312;genbank:GeneID:5176832 Probab=100.00 E-value=4.6e-55 Score=318.48 Aligned_cols=163 Identities=23% Similarity=0.231 Sum_probs=142.1 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhh-hccCccccCccccccccccCC Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRF-NFVGKKRLGRDQTTEWPRTDA 79 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~-~~~G~r~~~~~Q~lawPR~g~ 79 (172) |+||||||+| +|+||||+|+++|++||++||+.+++ |+++||++|++|++|||+++ +|+|+| .+++|+|+|||+|+ T Consensus 1 M~liv~~~~g-~~~anSYvt~~ea~aY~~~rg~~~~~-dd~~~e~aL~~A~~yid~~~~~f~G~r-~~~~Q~l~wPRtg~ 77 (169) T protein:vir:95 1 MPLIVETGQG-LPNADSYVSLEDGRALAAKYGLELPE-DDIAAEASLRNGAVYVGLFESQMCGRR-VSANQALAFPRTGI 77 (169) T ss_pred CeeEEeCCCC-CCcccccccHHHHHHHHHHcCCcCCC-CHHHHHHHHHHHHHHhhcccccccccc-CCcchhhccccCCc Confidence 9999999999 79999999999999999999997775 89999999999999999964 799999 68999999999998 Q ss_pred -CCCCccccccchHHHHHHHHHHHHHHHhccCCCCcccccccccceeeEEee-CceEEEeeccCCCCCCCCcHHHHHHHh Q lcl|Aclame:pro 80 -WDRDRYYINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAV-GPISESVTFVGGAVFQMPKYPAADQKL 157 (172) Q Consensus 80 -~~~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kv-G~i~~~y~~~~~~~~~~p~~~~v~~lL 157 (172) ++++.++++.||++||+||||||+++++++..++.+. ...++++|+ |+|+++|+. ++..+..|+|+++++|| T Consensus 78 ~~~g~~~~~~~IP~~V~~A~~elA~~~~~g~~~~~~~~-----~~~v~~e~v~G~i~veY~~-~~~~~~~~~~~a~~~LL 151 (169) T protein:vir:95 78 DLHGFPQPSNVIPSLVIQAQVMAAVEYGAGTDVRGSTD-----GREVQTERVEGAVTVSYFK-NGYSGGTVSITAADDAL 151 (169) T ss_pred eecccccccccchHHHHHHHHHHHHHHHcCccccCCCC-----ccceeeeeeccceeEeecC-CCCcCccccHHHHHHhh Confidence 6999999999999999999999999999876555432 234677666 999999985 45556679999999999 Q ss_pred hhhccccCCc---ccccC Q lcl|Aclame:pro 158 VRAGLVRSGG---TLLRG 172 (172) Q Consensus 158 ~~~g~~~~~g---~~~r~ 172 (172) .|+.+..+|+ .++|| T Consensus 152 ~p~l~g~~g~~~i~~~rg 169 (169) T protein:vir:95 152 RPLLCGSNNAYSFNVFRG 169 (169) T ss_pred hhhcccCCCcceeeeecC Confidence 9987766554 47788 No 3 >protein:vir:78383 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110841;genbank:gi:134288602;genbank:GeneID:5179646 Probab=100.00 E-value=6.3e-55 Score=317.74 Aligned_cols=163 Identities=23% Similarity=0.250 Sum_probs=141.5 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhh-hccCccccCccccccccccCC Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRF-NFVGKKRLGRDQTTEWPRTDA 79 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~-~~~G~r~~~~~Q~lawPR~g~ 79 (172) |+||||||+| +|+||||+|+++|++||++||+.+++ |+++||++|++|++|||+++ +|+|+| .+++|+|+|||+|+ T Consensus 1 MaliV~~~~g-~~~anSYvtv~~a~aY~~~rg~~~~~-d~~~~e~aL~~A~~yid~~~~~f~G~r-~~~~Q~l~wPRtg~ 77 (169) T protein:vir:78 1 MPLIVETGQG-IPNADSYVSLEDGRALAAKYGLELPE-DDTAAEAALRNGAVYVGLFESQMCGRR-VSANQALAFPRTGV 77 (169) T ss_pred CeeEeeCCCC-CccccccccHHHHHHHHHHcCCcCCC-ChHHHHHHHHHHHHHhhhccccceeee-CCcccccccccCCc Confidence 9999999999 79999999999999999999997775 89999999999999999853 899999 68999999999998 Q ss_pred -CCCCccccccchHHHHHHHHHHHHHHHhccCCCCcccccccccceeeEEee-CceEEEeeccCCCCCCCCcHHHHHHHh Q lcl|Aclame:pro 80 -WDRDRYYINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAV-GPISESVTFVGGAVFQMPKYPAADQKL 157 (172) Q Consensus 80 -~~~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kv-G~i~~~y~~~~~~~~~~p~~~~v~~lL 157 (172) ++++.+|++.||++||+||||||++++++++.++... ...+++||| |+|++||+. ++..+..|+|+++++|| T Consensus 78 ~~~g~~~~~~~IP~~v~~A~~elA~~~~~g~~~~~~~~-----~~~v~~e~v~G~i~veY~~-~~~~~~~~~~~~~~~LL 151 (169) T protein:vir:78 78 TLHGFPQPSNVIPPLVIQAQVMAAVEYGAGTDVRGSTD-----GREVQTERVEGAVTVSYFK-NGYSGGTVSITTADDAL 151 (169) T ss_pred eecccccccccchHHHHHHHHHHHHHHhcCcccCCCCC-----cceeEEEEecCceeEeecC-CCCCCCcccHHHHHHHh Confidence 6999999999999999999999999999876655432 234777777 999999975 44556679999999999 Q ss_pred hhhccccCCcc---cccC Q lcl|Aclame:pro 158 VRAGLVRSGGT---LLRG 172 (172) Q Consensus 158 ~~~g~~~~~g~---~~r~ 172 (172) +|+.+..+|+. ++|| T Consensus 152 ~p~l~~~~g~~~i~~~rg 169 (169) T protein:vir:78 152 RPLLCGSNNAYSFNVFRG 169 (169) T ss_pred hhhcccCCCcceeeeecC Confidence 99877555543 5688 No 4 >protein:vir:80389 Length: 172 # NCBI annotation: BcepGomrgp09 # Family: family:all:703 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210229;genbank:gi:146329921;genbank:GeneID:5123498 Probab=100.00 E-value=9.4e-55 Score=316.77 Aligned_cols=161 Identities=29% Similarity=0.388 Sum_probs=136.8 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhh-hhccCccccCccccccccccCC Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQR-FNFVGKKRLGRDQTTEWPRTDA 79 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~-~~~~G~r~~~~~Q~lawPR~g~ 79 (172) |+||||||+| +|+||||+|+++|++||++||++++ +++||++|++|++|||++ ++|+|+| .+++|+|+|||+|+ T Consensus 1 Malived~~g-~~~anSYvt~~~a~aY~~~rg~~~~---~d~~e~aL~~A~dyid~~~~~f~G~r-~~~~Q~l~wPR~g~ 75 (172) T protein:vir:80 1 MALIVEDGTG-KPDANTYAGADFVIAYAQARGVTVD---ADEAERLILEAMDYIESFRRRWKGER-NTREQGLTWPRHDA 75 (172) T ss_pred CeeEeeCCCC-CccccccccHHHHHHHHHHcCCCcC---HHHHHHHHHHHHHHHhhccCcccccc-CCccccccccccCc Confidence 9999999998 7999999999999999999977655 568999999999999996 3799999 68999999999998 Q ss_pred -CCCCccccccchHHHHHHHHHHHHHHHhc-cCCCCcccccccccceeeEEeeCceEEEeeccCCC------CCCCCcHH Q lcl|Aclame:pro 80 -WDRDRYYINDIPPEVKEACAEYALRALAA-ELNPDPERNASGVAVLSKSEAVGPISESVTFVGGA------VFQMPKYP 151 (172) Q Consensus 80 -~~~~~~~~d~IP~~V~~A~~elA~~~l~~-~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~------~~~~p~~~ 151 (172) ++++.++++.||++||+||||||++++++ ++.++. ....+||||||+|++||+..... ++..|+|+ T Consensus 76 ~~~g~~~~~~~IP~~v~~A~~elA~~~~~g~~~~~~~------~~~~v~~ekVG~i~~eY~~~~~~~~~~~~~~~~~~~~ 149 (172) T protein:vir:80 76 VVDGFVIPSDVIPKELQSAVAAAVIEQVNGFELQQSQ------DQWAVRIEKVDVIEVQYAAGGGGQSASANAPMKPTFP 149 (172) T ss_pred ccCcccccccchhHHHHHHHHHHHHHHhcCCccCcCC------CCceeeEEeccceEEeeecccCccccccccCCccchH Confidence 69999999999999999999999988887 344332 23458999999999999854221 23478999 Q ss_pred HHHHHhhhhccccCC--cccccC Q lcl|Aclame:pro 152 AADQKLVRAGLVRSG--GTLLRG 172 (172) Q Consensus 152 ~v~~lL~~~g~~~~~--g~~~r~ 172 (172) +|++||+|+-+..+| -.|+|| T Consensus 150 ~v~~LL~p~l~~~gg~~~~~vrg 172 (172) T protein:vir:80 150 KIDALLNPLLVGDGGLFLVAVRG 172 (172) T ss_pred HHHHHHhhhhcCCCCeeeeeecC Confidence 999999998555444 257899 No 5 >protein:vir:95176 Length: 172 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:703 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293420;genbank:gi:148912841;genbank:GeneID:5228251 Probab=100.00 E-value=3.7e-54 Score=313.52 Aligned_cols=164 Identities=29% Similarity=0.394 Sum_probs=138.0 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhh-hhccCccccCccccccccccCC Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQR-FNFVGKKRLGRDQTTEWPRTDA 79 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~-~~~~G~r~~~~~Q~lawPR~g~ 79 (172) |+||||||+| +|+||||+|+++|++||++||..++ .++++||++|++|++|||++ ++|+|+| ++++|+|+|||+|+ T Consensus 3 Malive~~~g-~~~anSYvtv~ea~aY~~~rg~~~~-~~~~~ke~aL~~A~dyid~~~~~f~G~r-~~~~Q~l~wPR~g~ 79 (172) T protein:vir:95 3 ITIVVEDGSG-VTNANSYVSVADARIYASNRGVELP-LDDDELAAMLIRSTDYLEAQACRFQGKP-TSTTQALQWPRTGV 79 (172) T ss_pred eeEEEeCCCC-CCcccccccHHHHHHHHHhcCCcCC-CChHHHHHHHHHHHHHhhccCCceeeee-cCCcccccCCcCCc Confidence 9999999998 6999999999999999999988555 48899999999999999973 6999999 68999999999998 Q ss_pred -CCCCccccccchHHHHHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhh Q lcl|Aclame:pro 80 -WDRDRYYINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLV 158 (172) Q Consensus 80 -~~~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~ 158 (172) ++++.++++.||++||+||||||++++++++... .......+||+|||+|+++|+. ++..+..|+|+++++||. T Consensus 80 ~~~~~~v~~~~IP~~V~~A~~elA~~~~~~~~~~~----~~~~~~~vk~~kVG~I~veY~~-~~~~~~~~~~~~v~~LL~ 154 (172) T protein:vir:95 80 FLNEDEVPSNVIPKSLIAAQVQLTMAINAGFDLQP----NVSPQDYVTREKVGPIETEYAD-PLSVGIMPTFTAANALLA 154 (172) T ss_pred ccCcccccccchhHHHHHHHHHHHHHHHcCccccc----cCCcccceeEEeccceEEeecc-CCCCCCcccHHHHHHHHh Confidence 5899999999999999999999998888754221 2223456899999999999975 344566799999999998 Q ss_pred hhccccC-CcccccC Q lcl|Aclame:pro 159 RAGLVRS-GGTLLRG 172 (172) Q Consensus 159 ~~g~~~~-~g~~~r~ 172 (172) |+...-+ ++.+.|- T Consensus 155 p~l~~~~~~~~~~r~ 169 (172) T protein:vir:95 155 PLFGECASNKFALRT 169 (172) T ss_pred hhhcccCCcceeeEE Confidence 8744333 3334444 No 6 >protein:vir:94955 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239280;genbank:gi:66392062;genbank:GeneID:5076600 Probab=100.00 E-value=2.9e-52 Score=303.15 Aligned_cols=160 Identities=23% Similarity=0.231 Sum_probs=139.1 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCc--cccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccC Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGN--SFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTD 78 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~--~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g 78 (172) |.+| |||+| +|+||||+|++||++||+.|+. .|...|+++||++|++|++|||+.|+|+|+| .+++|+|+|||+| T Consensus 1 m~~i-~~~~g-~~~AnSYvtv~ea~aY~~~r~~~~~w~~~~~~~~e~aL~~A~dyId~~~~f~G~r-~~~~Q~l~wPR~g 77 (170) T protein:vir:94 1 MPTV-DATPG-SITANSYVTVAEANSYFDGSYGRPLWTSASEDEKASLVISASRYLDQMMAWIGAP-TNPEQSMWWPCKN 77 (170) T ss_pred Ccee-ecCCC-CCcccceecHHHHHHHHHhhccccccCCCCHHHHHHHHHHHHHHhcccccccccc-CCcchhhcccccC Confidence 7555 89998 6999999999999999999986 5999999999999999999999988999999 6899999999999 Q ss_pred C-CCCCccccccchHHHHHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHh Q lcl|Aclame:pro 79 A-WDRDRYYINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKL 157 (172) Q Consensus 79 ~-~~~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL 157 (172) + +++..++++.||++||+||||||++++++++..+. ....+||||||+|+++|+.++ +..+++++|++|| T Consensus 78 ~~~dg~~~~~~~IP~~V~~Aq~elA~~~~~~~~~~~~------~~~~v~~~kVG~i~veY~~~~---~~~~~~~~v~~LL 148 (170) T protein:vir:94 78 AVIGGMTLSQVSIPVKVKIAVFELAYFMLESGAALSF------ADQTIDSVKVGTIRVEFTKNS---TDAGLPTFVEAML 148 (170) T ss_pred cccCccccccchhhHHHHHHHHHHHHHHHhCcccCcc------cccceeeEecceeEEEecCCC---CCCccHHHHHHHh Confidence 7 69999999999999999999999999988765432 123589999999999997333 3457899999999 Q ss_pred hhhccccCCcccccC Q lcl|Aclame:pro 158 VRAGLVRSGGTLLRG 172 (172) Q Consensus 158 ~~~g~~~~~g~~~r~ 172 (172) .|+....++|.+-.. T Consensus 149 ~p~l~~~~~g~~~~~ 163 (170) T protein:vir:94 149 SGFGSPVLYGSNAAR 163 (170) T ss_pred hhhhccccccccccc Confidence 998877777776544 No 7 >protein:vir:43 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463469;swissprot:trembl:q9t1b5;genbank:gi:16798791;uniprot:Q9T1B5;genbank:GeneID:922374 Probab=97.76 E-value=7.4e-07 Score=54.19 Aligned_cols=124 Identities=15% Similarity=0.129 Sum_probs=74.2 Q ss_pred cccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCccccccchHHHH Q lcl|Aclame:pro 16 NAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYINDIPPEVK 95 (172) Q Consensus 16 nSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~~d~IP~~V~ 95 (172) =.|+|.++..... -|...+ +++-++++.+|+++||.. .|- |. +..+ ++. ..+.+|.+|| T Consensus 1 M~Y~d~~~Y~~~y--~g~~i~---e~~F~~l~~rAs~~ID~~-T~~--ri---------~~~~-~~~---~~~~~~~~vk 59 (131) T protein:vir:43 1 MPYTTLEFYNDEY--AGEHLE---QDEFDKLLKHAERKIDSV-TFY--RI---------RKGG-IES---FSEFIQHQIQ 59 (131) T ss_pred CCCCCHHHHHHhh--CCCCCC---HhHHHHHHHHHHHHHHHH-hcc--cc---------cccC-ccc---cchhhHHHHH Confidence 5677777654311 244454 566789999999999986 431 10 0000 111 1256899999 Q ss_pred HHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCC---CCCcHHHHHHHhhhhccccCCcccccC Q lcl|Aclame:pro 96 EACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVF---QMPKYPAADQKLVRAGLVRSGGTLLRG 172 (172) Q Consensus 96 ~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~---~~p~~~~v~~lL~~~g~~~~~g~~~r~ 172 (172) .|+|+.|-........ .+ .....+++++||..|++|...+.... .......+..+|.+-|| |-|| T Consensus 60 ~A~c~q~e~~~~~g~~--s~----~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~TGL------lyrG 127 (131) T protein:vir:43 60 LATCNQIEYFKEAGGT--SE----LAVSKPDNVSIGRTSISDSNFASTATSLNSGLIGSDVRSYLAHTGL------LYNG 127 (131) T ss_pred HHHHHHHHHHHHhHHH--hh----hhccccCeeecCceEEeecccccchhhhchhhhHHHHHHHHhccCC------eecC Confidence 9999999654432111 11 11223789999999999975333221 12235677778876655 4555 No 8 >protein:vir:80967 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468394;genbank:gi:157324968;genbank:GeneID:5601402 Probab=97.72 E-value=9.1e-07 Score=53.70 Aligned_cols=122 Identities=14% Similarity=0.123 Sum_probs=74.1 Q ss_pred cccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCcc--ccccchHH Q lcl|Aclame:pro 16 NAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRY--YINDIPPE 93 (172) Q Consensus 16 nSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~--~~d~IP~~ 93 (172) =.|+|.++..... -|...+ +++-++++.+|+++||.. .|- |. +..++ ..+.+|.+ T Consensus 1 M~Y~d~~~Y~~~y--~G~~i~---e~~F~~l~~rAs~~ID~~-T~~--ri---------------~~~~~d~~~~~~~~~ 57 (131) T protein:vir:80 1 MPYTTLEFYTNEY--AGEHLE---QDEFAKLLKHAERKIDSV-TFY--RI---------------RKSGIEAFSEFIQHQ 57 (131) T ss_pred CCCCCHHHHHHhh--CCCCCc---hhHHHHHHHHHHHHHHHH-hcc--cc---------------cccccccCchhHHHH Confidence 5677777654321 344455 456789999999999986 431 11 01111 12478999 Q ss_pred HHHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCC--CCC-cHHHHHHHhhhhccccCCcccc Q lcl|Aclame:pro 94 VKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVF--QMP-KYPAADQKLVRAGLVRSGGTLL 170 (172) Q Consensus 94 V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~--~~p-~~~~v~~lL~~~g~~~~~g~~~ 170 (172) ||.|+|+.|-......... ......+++++||..|++|...+.... ..+ ..+.+..+|.+-|+ |- T Consensus 58 vk~A~c~q~e~~~~~g~~~------~~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~TGL------ly 125 (131) T protein:vir:80 58 IQLATCNQIEYFKEAGGTS------ELAVSKPDNVSIGRTSISDSNFASTATSLNSGLVGSDVRSYLAHTGL------LY 125 (131) T ss_pred HHHHHHHHHHHHHHhhhhh------hhcccccCeeeeCceEEeeccccchhhhhhhhhhHHHHHHHHhccCC------ee Confidence 9999999996544422111 111234789999999999975333221 112 34557778876655 45 Q ss_pred cC Q lcl|Aclame:pro 171 RG 172 (172) Q Consensus 171 r~ 172 (172) || T Consensus 126 rG 127 (131) T protein:vir:80 126 NG 127 (131) T ss_pred cC Confidence 55 No 9 >protein:vir:98900 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:3882 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164420;genbank:gi:56694910;genbank:GeneID:3197290 Probab=97.31 E-value=6.5e-06 Score=49.00 Aligned_cols=122 Identities=18% Similarity=0.065 Sum_probs=74.8 Q ss_pred cccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCccc--cccchHH Q lcl|Aclame:pro 16 NAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYY--INDIPPE 93 (172) Q Consensus 16 nSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~--~d~IP~~ 93 (172) =.|+|.++.+.| .|...+ +++-+++|.+|+++||.. .|. |. +..++. ...++.+ T Consensus 1 M~Y~t~~~Y~~~---~G~~i~---e~~F~~l~~rAs~~ID~i-T~~--ri---------------~~~~~~~d~~~~~~~ 56 (132) T protein:vir:98 1 MPYLTYEEFMDL---NGRDID---DKKFEKLLPKASAIIDGV-TGH--FY---------------QKVDMEKDNAWRVNQ 56 (132) T ss_pred CCCCCHHHHHhh---cCCCCC---HHHHHHHHHHHHHHHHHH-hcc--cc---------------cCCCccccChHHHHH Confidence 678888877654 344444 566899999999999976 441 21 111122 2457788 Q ss_pred HHHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCC---CCCCc-HHHHHHHhhhhccccCCccc Q lcl|Aclame:pro 94 VKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAV---FQMPK-YPAADQKLVRAGLVRSGGTL 169 (172) Q Consensus 94 V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~---~~~p~-~~~v~~lL~~~g~~~~~g~~ 169 (172) ||.|+|..+-+....... +.+. ....++++++|..+++|..+.+.. ...+. .+-+..+|.+.|+ | T Consensus 57 vk~A~c~qiey~~~~G~~-sae~----~~~~~~S~svG~~Svs~~s~~~~~~~~~~~~~~~~~a~~~L~~tGL------L 125 (132) T protein:vir:98 57 FKLALCAQIEYFDALGAT-TFEE----INNSPQTFQAGRTSVSNASRYNPSGANESKPLVAEDVYIYLQGTGL------L 125 (132) T ss_pred HHHHHHHHHHHHHhccch-hhhh----ccCccceeeeCcEEEEeeccCCcccccccccchHHHHHHHHhhcCC------c Confidence 999999998644332211 1111 123378999999999997433322 22233 2456778876655 5 Q ss_pred ccC Q lcl|Aclame:pro 170 LRG 172 (172) Q Consensus 170 ~r~ 172 (172) -|| T Consensus 126 yrG 128 (132) T protein:vir:98 126 FQG 128 (132) T ss_pred ccc Confidence 677 No 10 >protein:vir:79701 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285884;genbank:gi:148750841;genbank:GeneID:5220403 Probab=94.33 E-value=0.0022 Score=35.13 Aligned_cols=136 Identities=20% Similarity=0.081 Sum_probs=75.5 Q ss_pred ccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhc-cCccccCccccccccccCCC-CCCccccccch- Q lcl|Aclame:pro 15 ANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNF-VGKKRLGRDQTTEWPRTDAW-DRDRYYINDIP- 91 (172) Q Consensus 15 AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~-~G~r~~~~~Q~lawPR~g~~-~~~~~~~d~IP- 91 (172) --.|.|-+|.+. .|.... +++.-+++|.+|++-||....+ .+.-. +..+- |.+-=....|| T Consensus 1 ~~pYLTy~ef~~----lg~~~~--~~d~F~kllk~A~~~ID~~T~y~~~~y~----------~~~i~~d~~~d~~~~~~~ 64 (144) T protein:vir:79 1 MKPYLTTSDFEK----LGYELK--KPDNFGKLLKSATVLINQICSYYDPAFA----------YHDLEADSQADPDSYLFR 64 (144) T ss_pred CCcccchhhhhh----hCCCCc--chhhhhhHHHHHHHHhhhhhhhhccccc----------cccccccccccchhhhhH Confidence 567888776643 333333 4566899999999999997432 22100 00010 00001122355 Q ss_pred --HHHHHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCC---CCcHHHHHHHhhhhccccCC Q lcl|Aclame:pro 92 --PEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQ---MPKYPAADQKLVRAGLVRSG 166 (172) Q Consensus 92 --~~V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~---~p~~~~v~~lL~~~g~~~~~ 166 (172) .+||.|.|.-..+.-.. +..+..... ...+++..||-.+++|.+.+..+.. ....+-+..+|.+.|+.-.| T Consensus 65 r~~~vKkA~a~QIeY~~~~-G~~sa~e~~---~~~~~S~svGrtsvs~~~~~~~s~t~~~~~v~~~a~~yL~~tGLLYrG 140 (144) T protein:vir:79 65 QAMAFKKAVALEMLFLEDS-GYSSAYDVA---QGALNSFTVGHTSMSLNPSAGQNLTVGSTGVVKSAYDLLGRYGLLFSG 140 (144) T ss_pred HHHHHHHHHHHHHHHHHHc-CCcchhhhh---cCccceeEecceEEeecCCCccccccccccccHHHHHHHhhcCccccc Confidence 45688888776433222 222221111 2347899999999999765544333 33446778888888775433 Q ss_pred cccc Q lcl|Aclame:pro 167 GTLL 170 (172) Q Consensus 167 g~~~ 170 (172) =..+ T Consensus 141 V~s~ 144 (144) T protein:vir:79 141 VASL 144 (144) T ss_pred cccC Confidence 2222 No 11 >protein:vir:107756 Length: 147 # NCBI annotation: gp21 # Family: family:all:664 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024869;genbank:gi:48697511;genbank:GeneID:2948377 Probab=93.27 E-value=0.003 Score=34.40 Aligned_cols=128 Identities=16% Similarity=0.151 Sum_probs=67.7 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCC Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAW 80 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~ 80 (172) |+.+. +++++.+-+=+....- ..+|+..+..|..|..+|+.. +|.-.. T Consensus 1 m~v~f--------------d~~~Fr~~fPeFad~~-~~pd~~i~~~l~~A~~~l~~~-~~~~~~---------------- 48 (147) T protein:vir:10 1 MDHTL--------------DITKFRALFPEFNNDV-KYPDALLEQWYAVAGEYLGLT-DYACGL---------------- 48 (147) T ss_pred Cceec--------------CHHHHHHhcccccCCc-cCCHHHHHHHHHHHHHhhccc-cCCccc---------------- Confidence 55443 4666666554443321 124788899999999999975 442100 Q ss_pred CCCccccccchHHHHHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCC-----CCcHH-HHH Q lcl|Aclame:pro 81 DRDRYYINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQ-----MPKYP-AAD 154 (172) Q Consensus 81 ~~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~-----~p~~~-~v~ 154 (172) + .....++.+.++.+.+.-...... .......+.|.++|.+||+|......+.. ...|- ... T Consensus 49 ~---------g~~~~~~l~Ll~AHll~l~~~~~~---g~g~~G~v~Sas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~y~ 116 (147) T protein:vir:10 49 N---------GNTLDLALMQLTAHLMKSATILSS---NKGAPMVMTSATIDKVSISTLAPPIKNGWQYWLSTTPYGQMLW 116 (147) T ss_pred C---------hhhHHHHHHHHHHHHHHHHHhhcc---CCCcccceeeeeecceeeeeecCCCCCcchhhhhcCHHHHHHH Confidence 0 122235555555444433211111 22334568999999999999854333221 22232 223 Q ss_pred HHhh---hhccccCCcccccC Q lcl|Aclame:pro 155 QKLV---RAGLVRSGGTLLRG 172 (172) Q Consensus 155 ~lL~---~~g~~~~~g~~~r~ 172 (172) +|+. ..|.+.||-+...+ T Consensus 117 ~l~~~~~~Gg~vvgG~p~r~a 137 (147) T protein:vir:10 117 ALLSMRSSGGFVYGGSPELSG 137 (147) T ss_pred HHHHhhCccceecCCCCcccc Confidence 3333 33455566665555 No 12 >protein:vir:7857 Length: 188 # NCBI annotation: gp14 # Family: family:all:11114 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817464;genbank:gi:29565893;genbank:GeneID:1259086 Probab=93.21 E-value=0.0012 Score=36.56 Aligned_cols=122 Identities=17% Similarity=0.179 Sum_probs=60.4 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhcc--------------Ccccc Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFV--------------GKKRL 66 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~--------------G~r~~ 66 (172) |+ |+..-....+ .+++..+.||..|+.-+-+.++|. |.+.. T Consensus 1 ~~------------------------~~~~la~~~~-~da~~v~lAL~~As~~VR~~~g~~i~~V~~dt~~id~~Gg~~~ 55 (188) T protein:vir:78 1 MT------------------------FAQQLADAFP-EDADDAATALSWAKSQVEGYCGRKFDLVEDDVAIVDPYCGSSL 55 (188) T ss_pred Cc------------------------hhhhHHHhcC-CCcchHHHHHHHHHHHHHHHhCCcceeeeeeeeeeecCCCcee Confidence 22 2222211222 233444557888887777665432 21100 Q ss_pred C-----------------------------------------cccc-------ccccccC----CCCCCccccccchHHH Q lcl|Aclame:pro 67 G-----------------------------------------RDQT-------TEWPRTD----AWDRDRYYINDIPPEV 94 (172) Q Consensus 67 ~-----------------------------------------~~Q~-------lawPR~g----~~~~~~~~~d~IP~~V 94 (172) = ..|+ -.||+.- +....|+ +.||.+| T Consensus 56 LP~~Pvv~i~~Ve~~~~~G~~~~~v~~r~y~~~g~~~~l~~trg~pg~~~~r~~~WPw~p~~VtVTytHGy--~evP~ei 133 (188) T protein:vir:78 56 LPSIPVVEISKVEGYLPTGNGMDWVELTNYWFKRDTGLIFDTTGLPGSEWSTGHTWPWLPGSLRVTYTHGY--NPVPDEL 133 (188) T ss_pred eccCcceeeeEEEEEeeCCcccccccccccccccceeeecccccCcccccccccccccCcceEEEEEecCC--CcccHHH Confidence 0 0011 1244210 1111222 4799999 Q ss_pred HHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhhccccCC Q lcl|Aclame:pro 95 KEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVRSG 166 (172) Q Consensus 95 ~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~g~~~~~ 166 (172) +...|++|-+++.+|.+ ..+++||++|+.|+..++.+ ...+ =..+|.++-+..-- T Consensus 134 v~lv~d~A~~~~~np~~-------------L~q~~vG~~S~tfa~~~~~s--l~~~--~~~il~ry~l~~~~ 188 (188) T protein:vir:78 134 IDVAIRLAREYQSNPEL-------------LVSKQVGEIERRFGSVAGTS--LSKA--DQAILDRYVIATLA 188 (188) T ss_pred HHHHHHHHHHHhcCccc-------------ceeeecCceeeecccccCCc--ccch--hHHhhccccccccC Confidence 99999999998888643 34699999999998543332 2222 22333322111111 No 13 >protein:vir:101652 Length: 188 # NCBI annotation: gp15 # Family: family:all:11114 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654770;genbank:gi:109302768;genbank:GeneID:4156086 Probab=93.21 E-value=0.0012 Score=36.56 Aligned_cols=122 Identities=17% Similarity=0.179 Sum_probs=60.4 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhcc--------------Ccccc Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFV--------------GKKRL 66 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~--------------G~r~~ 66 (172) |+ |+..-....+ .+++..+.||..|+.-+-+.++|. |.+.. T Consensus 1 ~~------------------------~~~~la~~~~-~da~~v~lAL~~As~~VR~~~g~~i~~V~~dt~~id~~Gg~~~ 55 (188) T protein:vir:10 1 MT------------------------FAQQLADAFP-EDADDAATALSWAKSQVEGYCGRKFDLVEDDVAIVDPYCGSSL 55 (188) T ss_pred Cc------------------------hhhhHHHhcC-CCcchHHHHHHHHHHHHHHHhCCcceeeeeeeeeeecCCCcee Confidence 22 2222211222 233444557888887777665432 21100 Q ss_pred C-----------------------------------------cccc-------ccccccC----CCCCCccccccchHHH Q lcl|Aclame:pro 67 G-----------------------------------------RDQT-------TEWPRTD----AWDRDRYYINDIPPEV 94 (172) Q Consensus 67 ~-----------------------------------------~~Q~-------lawPR~g----~~~~~~~~~d~IP~~V 94 (172) = ..|+ -.||+.- +....|+ +.||.+| T Consensus 56 LP~~Pvv~i~~Ve~~~~~G~~~~~v~~r~y~~~g~~~~l~~trg~pg~~~~r~~~WPw~p~~VtVTytHGy--~evP~ei 133 (188) T protein:vir:10 56 LPSIPVVEISKVEGYLPTGNGMDWVELTNYWFKRDTGLIFDTTGLPGSEWSTGHTWPWLPGSLRVTYTHGY--NPVPDEL 133 (188) T ss_pred eccCcceeeeEEEEEeeCCcccccccccccccccceeeecccccCcccccccccccccCcceEEEEEecCC--CcccHHH Confidence 0 0011 1244210 1111222 4799999 Q ss_pred HHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhhccccCC Q lcl|Aclame:pro 95 KEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVRSG 166 (172) Q Consensus 95 ~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~g~~~~~ 166 (172) +...|++|-+++.+|.+ ..+++||++|+.|+..++.+ ...+ =..+|.++-+..-- T Consensus 134 v~lv~d~A~~~~~np~~-------------L~q~~vG~~S~tfa~~~~~s--l~~~--~~~il~ry~l~~~~ 188 (188) T protein:vir:10 134 IDVAIRLAREYQSNPEL-------------LVSKQVGEIERRFGSVAGTS--LSKA--DQAILDRYVIATLA 188 (188) T ss_pred HHHHHHHHHHHhcCccc-------------ceeeecCceeeecccccCCc--ccch--hHHhhccccccccC Confidence 99999999998888643 34699999999998543332 2222 22333322111111 No 14 >protein:vir:9576 Length: 131 # NCBI annotation: gp42 # Family: family:all:1271 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862881;genbank:gi:32469473;genbank:GeneID:1461318 Probab=92.67 E-value=0.007 Score=32.40 Aligned_cols=129 Identities=16% Similarity=0.069 Sum_probs=74.5 Q ss_pred ccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCccccccchHHH Q lcl|Aclame:pro 15 ANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYINDIPPEV 94 (172) Q Consensus 15 AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~~d~IP~~V 94 (172) -+.|+|++|..+-| | ..........+.+|-.|+++|...++-.|.. ..+|+ .+.+..+.-+ T Consensus 1 m~~fAtv~D~~~rw--r--~Lt~~E~~ra~~LL~~As~~ir~~~p~~~~~------l~~~~---------~~~~~~~~~~ 61 (131) T protein:vir:95 1 MENFATVEDLKKLW--R--ALKFDEEKRAEALLEVVSHSLRVEAKKVGKD------LDGLV---------ATDPSFTMVV 61 (131) T ss_pred CCccCCHHHHHHHh--c--CCCHHHHHHHHHHHHHHHHHHHHhhhhccCC------ccccc---------cCCccchHHH Confidence 68999999998755 3 2332222345889999999999877544422 11121 2234567778 Q ss_pred HHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhhccccCCcccccC Q lcl|Aclame:pro 95 KEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVRSGGTLLRG 172 (172) Q Consensus 95 ~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~g~~~~~g~~~r~ 172 (172) +.-+|+...+++..+-. . .+....++..|+.+-+..+.+. ++..-.-..-..+|+. .-.|-|++-.-| T Consensus 62 ~~V~~~~V~Ral~~~~~--~------~G~tq~S~TaG~ys~S~t~~~p-~g~lylt~~e~~~LGl-~~~r~~~i~~~~ 129 (131) T protein:vir:95 62 KSVTVDVVARTLMTSTD--Q------EPMTQVAESALGYSFSGSYLVP-GGGLFIKDSELKRLGL-KKQRYGVIDIYG 129 (131) T ss_pred HHHHHHHHHHHhcCCCC--C------CCceeeeeecccceeeeeeecC-CCCceeChHHHHHhCC-CCCceeEEeecc Confidence 88999999888765421 1 1223457888988555443332 2222222334455543 225555666666 No 15 >protein:vir:99570 Length: 153 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039799;genbank:gi:126011049;genbank:GeneID:4818265 Probab=92.50 E-value=0.0083 Score=31.99 Aligned_cols=131 Identities=16% Similarity=0.071 Sum_probs=70.4 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCc--cccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccC Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGN--SFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTD 78 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~--~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g 78 (172) |..+|-| ++.+.+.+=+... .+| |+..+..|..|..+|+.. +|.... . T Consensus 1 m~~~~fd-------------~~~Fr~~fPeFad~~~~P---d~~i~~~l~~A~~~l~~~-~~~~~~-~------------ 50 (153) T protein:vir:99 1 MADPVYN-------------DGLFRIMYPEFADQEKYP---PEVIEIYYDTATLFITGS-MFPCAA-L------------ 50 (153) T ss_pred CCcccCC-------------hHHHHHhcccccCccccC---HHHHHHHHHHHHHhhcCc-cccccc-c------------ Confidence 7777754 3444444433332 244 788899999999999854 343221 0 Q ss_pred CCCCCccccccchHHHHHHHHHHHHHHHhccC--CCCcccccccccceeeEEeeCceEEEeeccCCCCCC-----CCcHH Q lcl|Aclame:pro 79 AWDRDRYYINDIPPEVKEACAEYALRALAAEL--NPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQ-----MPKYP 151 (172) Q Consensus 79 ~~~~~~~~~d~IP~~V~~A~~elA~~~l~~~l--~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~-----~p~~~ 151 (172) -....+++.+.++.+.|.-.. ........+...+.++|+++|.+||+|......++. ...|- T Consensus 51 -----------~g~~~~~~l~Ll~AH~l~L~~~~~~~~~~a~~~~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YG 119 (153) T protein:vir:99 51 -----------SGKQLVGALNMLTAHLMSLSMQRSQTALGATNDQGGYTLSATIGEVSVSKMAPPAKDGWEFWLAQTPYG 119 (153) T ss_pred -----------ChHHHHHHHHHHHHHHHHHHhhhhcccccCCCccccceeeeeecceeeeeecCCCCCchhHhhhcCHHH Confidence 134455666666655543211 111111233345678999999999999754433221 22232 Q ss_pred ----HHHHHhhhhccccCCcccccC Q lcl|Aclame:pro 152 ----AADQKLVRAGLVRSGGTLLRG 172 (172) Q Consensus 152 ----~v~~lL~~~g~~~~~g~~~r~ 172 (172) .+..+++..|.+.+|-+-..+ T Consensus 120 q~fw~l~~~~~~Gg~v~gg~pe~~~ 144 (153) T protein:vir:99 120 QALWALLKMLSVGGFAIGGLPERTG 144 (153) T ss_pred HHHHHHHHHhcccccccCCCCcccc Confidence 223333444555655554444 No 16 >protein:vir:5256 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852761;genbank:gi:31544036;uniprot:Q7Y5T9;genbank:GeneID:2753553 Probab=91.42 E-value=0.01 Score=31.55 Aligned_cols=113 Identities=16% Similarity=0.088 Sum_probs=64.0 Q ss_pred cccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCccccccchHHHHHH Q lcl|Aclame:pro 18 YISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYINDIPPEVKEA 97 (172) Q Consensus 18 Y~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~~d~IP~~V~~A 97 (172) ..|++++++-+=+... .+|+..+..|..|..||+.. +| |.. -.++ T Consensus 1 m~t~~~Fr~~~PeF~~----~pd~~i~~~l~~A~~~l~~~-~~-g~~-----------------------------~~~~ 45 (119) T protein:vir:52 1 MPLTEDFLLRYTEFGK----TDAKRIGLFLSDAQAEVSKV-QW-GKL-----------------------------YDRG 45 (119) T ss_pred CCcHHHHHHhhhhccC----CCHHHHHHHHHHHHHhhCCc-CC-chH-----------------------------HHHH Confidence 7788888887755533 46888999999999999864 55 211 1234 Q ss_pred HHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCC-----CCcHH-HHHHHhhhhccccCCcccc Q lcl|Aclame:pro 98 CAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQ-----MPKYP-AADQKLVRAGLVRSGGTLL 170 (172) Q Consensus 98 ~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~-----~p~~~-~v~~lL~~~g~~~~~g~~~ 170 (172) .+.++.+.|.-..... ...+.....++|.++|.+||+|+.....+.. ...|- -..+|+.+ .+.||+.- T Consensus 46 ~~L~~AH~l~l~~~~~--~~~g~~~g~v~S~s~G~vSvS~~~~~~~~~~~~w~~~T~YG~~y~~L~r~---~g~Gg~Va 119 (119) T protein:vir:52 46 VMALTAHLLKLSADAE--ISGGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRL---IGVGVMVA 119 (119) T ss_pred HHHHHHHHHHhhhhhh--ccccccccceeeeeecceeeeeeccccCCcchhhhhcCHHHHHHHHHHHH---hcCCCcCC Confidence 4555555543221111 1123344678999999999999754443222 12221 12334433 23333333 No 17 >protein:vir:4788 Length: 130 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150168;swissprot:trembl:q94m43;genbank:gi:15088779;uniprot:Q94M43;genbank:GeneID:955972 Probab=91.41 E-value=0.01 Score=31.49 Aligned_cols=126 Identities=21% Similarity=0.116 Sum_probs=71.9 Q ss_pred cccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhh-ccCccccCccccccccccCCCCCCccccccchHHH Q lcl|Aclame:pro 16 NAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFN-FVGKKRLGRDQTTEWPRTDAWDRDRYYINDIPPEV 94 (172) Q Consensus 16 nSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~-~~G~r~~~~~Q~lawPR~g~~~~~~~~~d~IP~~V 94 (172) =.|.|.++.+.| +.. ++++-+++|.+|++-||...+ |.=.. .+ ..=+...+=.+| T Consensus 1 M~YlT~eey~el----~~~----~~~~F~kl~k~A~~~ID~~t~~~y~~~-~~---------------~~~~~~~r~~~v 56 (130) T protein:vir:47 1 MTYLTQEEFDEL----DFD----EVTDFEKLAKRAKIAIDLYTNGIYQKD-ID---------------FEKEIAYRKSAV 56 (130) T ss_pred CCCCchhhHhhc----CCC----ChhhHHHHHHHHHHHHHHHhccccccc-CC---------------ccCcchHHHHHH Confidence 678888888764 332 345689999999999998642 32111 10 011233456788 Q ss_pred HHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCC-CCcH-HHHHHHhhhhcc-ccCCccccc Q lcl|Aclame:pro 95 KEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQ-MPKY-PAADQKLVRAGL-VRSGGTLLR 171 (172) Q Consensus 95 ~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~-~p~~-~~v~~lL~~~g~-~~~~g~~~r 171 (172) |.|.|.-..+.-+... .+. .....+.+..||-.+++|...+++... .+.+ .-+..+|.+.|+ +-.|=.--| T Consensus 57 K~A~a~QieY~~~~G~-~s~-----~~~~~~~S~svGrtSis~~~~~~~~~~~~~~vs~da~~~L~~tGL~Ly~GV~yd~ 130 (130) T protein:vir:47 57 KLAMAFQIAYLDASGI-MSA-----DDKQLANSVSIGRTSISYSTSQSTLAGQRFNLSMDAENALRQAGFSLVVGVAYDR 130 (130) T ss_pred HHHHHHHHHHHHHhcc-ccc-----hhccCcceeeecceeeecCcCccccccCCccccHHHHHHHHhcccccccCCCccC Confidence 8888877654332211 111 113447899999999999764433322 2222 223447777776 333333333 No 18 >protein:vir:1435 Length: 188 # NCBI annotation: hypothetical protein # Family: family:all:501 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536364;genbank:gi:17975169;genbank:GeneID:929149 Probab=90.55 E-value=0.018 Score=30.11 Aligned_cols=129 Identities=19% Similarity=0.113 Sum_probs=64.9 Q ss_pred Cee-EeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHH-HHHHHHHHhhhhh-hccCcccc----------- Q lcl|Aclame:pro 1 MAL-IVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEA-AVIRATDYLDQRF-NFVGKKRL----------- 66 (172) Q Consensus 1 M~l-iVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~-aL~~Asdyid~~~-~~~G~r~~----------- 66 (172) |.. |++. |.+..=+|++|+++|.--- .+.+|+.... ++..|.+++++.. +....++. T Consensus 1 m~~~~~~~-----ppa~epVtLae~K~~lrid----~~~eD~~l~~~li~aA~~~~E~~tgr~l~~qt~~~~~~~~~~~~ 71 (188) T protein:vir:14 1 MAAVLVEY-----LDDAEPLTFEEVAFQCRID----DDDERDFVERVVIPGARQAAESKAGAAIRKARYVEHLSGFPPAE 71 (188) T ss_pred CCceeeec-----CCCCCccCHHHHHHHcCCC----CchhHHHHHHHHHHHHHHHHHHHhCCeeeeeeEEEEecCcCCCc Confidence 655 5542 4456678999999986542 1223455545 4557788998743 11111100 Q ss_pred --------------------Ccccc----------------------ccccccCCC---CCCccccccchHHHHHHHHHH Q lcl|Aclame:pro 67 --------------------GRDQT----------------------TEWPRTDAW---DRDRYYINDIPPEVKEACAEY 101 (172) Q Consensus 67 --------------------~~~Q~----------------------lawPR~g~~---~~~~~~~d~IP~~V~~A~~el 101 (172) +..|. ..||+.+.+ ...|++ +.+|..+|+|...+ T Consensus 72 ~~Lp~~Pv~sV~sV~~~d~~g~~~~l~~~~y~l~~~~~~~~l~~~~~~~~p~~~~V~V~~~AG~~-~~vP~~ik~Aill~ 150 (188) T protein:vir:14 72 VPLSVGQVISVDSIEIRDASGATTTLDAGAFELVQLGRETLLVPAGQARWPYARAVTIKYQAGID-LARYPSVRSWMLLA 150 (188) T ss_pred eEecccCcceeeEEEEEcCCCceEeecccceEEeecCCCcEEEEecCCCCCCCceEEEEEEecCc-cCchHHHHHHHHHH Confidence 00000 113332211 111233 35888899988888 Q ss_pred HHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhhccccCC Q lcl|Aclame:pro 102 ALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVRSG 166 (172) Q Consensus 102 A~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~g~~~~~ 166 (172) +.....+- +.+ .. +......| +.++++||.|+=++.+. T Consensus 151 va~~Y~~R------------------e~~-----~~---g~~~~~lP-~~~v~~Ll~pyRvP~~~ 188 (188) T protein:vir:14 151 AAWAYDHR------------------ELY-----SD---GQPMGEMP-GGYSDVLLNPITVPPRF 188 (188) T ss_pred HHHHHhcc------------------ccc-----cc---cccccccc-HHHHHHHhhccCCCCCC Confidence 86654431 111 00 00001122 34588999988766666 No 19 >protein:vir:9761 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795523;genbank:gi:28876281;genbank:GeneID:1257822 Probab=89.19 E-value=0.028 Score=29.12 Aligned_cols=129 Identities=16% Similarity=0.086 Sum_probs=66.8 Q ss_pred ccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCccccccchHHH Q lcl|Aclame:pro 15 ANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYINDIPPEV 94 (172) Q Consensus 15 AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~~d~IP~~V 94 (172) -+.|+|++|..+-| | ..........+.+|-.|+++|...++=.|. +..+...++ +.-+.-+ T Consensus 1 m~~fATv~Dv~~rw--r--~Lt~dE~~ra~~LL~dAS~~iR~~~p~~g~---~~~~~~~~~------------~~~~~~~ 61 (140) T protein:vir:97 1 MGNFATTDDVILLW--R--PLSVDELKRANALLKVVSDTLRMEADKVGK---DLDKTMVDK------------PYFVNVI 61 (140) T ss_pred CCcCCCHHHHHHHh--c--CCCHhHHHHHHHHHHHHHHHHHHhhhhccC---CcchhcccC------------ccchhHH Confidence 68999999998766 3 333222344688999999999987642231 112222222 1223445 Q ss_pred HHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhhccccCCcccccC Q lcl|Aclame:pro 95 KEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVRSGGTLLRG 172 (172) Q Consensus 95 ~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~g~~~~~g~~~r~ 172 (172) +.-+|....+++..+- +. .+....++..|+.+.+..+.+.. +..-.-..-..+|+. +--|-|++-.-| T Consensus 62 k~V~~~mV~Ral~~~~--d~------~G~tq~S~TaG~ys~S~T~~np~-G~lylt~~e~~~LGl-~~~r~~~i~~~g 129 (140) T protein:vir:97 62 KSVTVDIVARTLMTST--QG------EPMSQESQSALGYTWSGTYLVPG-GGLFIKDNELKRLGL-KKQRYGGIELYG 129 (140) T ss_pred HHHHHHHHHHHhcCCC--CC------CcceeeeeeccchhheeeeecCC-CCceeChHHHHHhCC-CCCceeeecccC Confidence 5667777666554321 11 12224567889884443333222 222222334455543 224445555555 No 20 >protein:vir:9821 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795583;genbank:gi:28876338;genbank:GeneID:1257921 Probab=89.11 E-value=0.025 Score=29.39 Aligned_cols=131 Identities=19% Similarity=0.091 Sum_probs=68.5 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCC Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAW 80 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~ 80 (172) |-+.. =+|.|.+|.+. .+.. +.++-+++|.+|++-||...++.=.+ .+-+- T Consensus 1 ~~~~~----------M~YlT~eey~~----l~~~----~~~dF~kllk~As~~ID~~t~~~y~~-~d~e~---------- 51 (138) T protein:vir:98 1 MEVVI----------IAFLTQKEFED----LGFD----DVEDFEKMEKRASHAVNLYCRNRYDY-KDLKK---------- 51 (138) T ss_pred Ccccc----------ccccchHHHhc----cCCC----ChhhHHHHHHHHHHHhhhhhcccccc-ccccc---------- Confidence 44433 36888887654 2333 34458999999999999864321111 11111 Q ss_pred CCCccccccchHHHHHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCC----CC---CcHHHH Q lcl|Aclame:pro 81 DRDRYYINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVF----QM---PKYPAA 153 (172) Q Consensus 81 ~~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~----~~---p~~~~v 153 (172) ..+.+=.+||.|.|.--.+.-.... .+.. ..+..++..||-.+++|+...++.. .. ..-.-+ T Consensus 52 -----d~~~r~~~vKkA~a~QIeY~~~~G~-ts~~-----d~~~~~s~svGrTSiS~~~~~~~~s~~~~~~~~~~~s~~A 120 (138) T protein:vir:98 52 -----EIALVQKAVKRAIAYQIAYLNDSGV-MTAE-----DKQSFAGISLGRTSISYTVGHGQGSQQKTLADRFNLCLDA 120 (138) T ss_pred -----hhHHHHHHHHHHHHHHHHHHHHcCC-cchh-----hccCcCceEeeeeEeecccccccccccccccccccccHHH Confidence 1122445777777765533322211 1111 1334678999999999843222211 11 122334 Q ss_pred HHHhhhhccccCCccccc Q lcl|Aclame:pro 154 DQKLVRAGLVRSGGTLLR 171 (172) Q Consensus 154 ~~lL~~~g~~~~~g~~~r 171 (172) ..+|.+.|++-.|=.--| T Consensus 121 ~~~L~~tGLLY~GV~yd~ 138 (138) T protein:vir:98 121 ENELLVVGLGYTGISYDR 138 (138) T ss_pred HHHHhhcCcccccCcccC Confidence 558888777543333333 No 21 >protein:vir:94761 Length: 132 # NCBI annotation: unknown # Family: family:all:1271 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996708;genbank:gi:45597423;genbank:GeneID:2769029 Probab=86.38 E-value=0.046 Score=27.92 Aligned_cols=130 Identities=18% Similarity=0.179 Sum_probs=69.8 Q ss_pred ccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCccccccchHHH Q lcl|Aclame:pro 15 ANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYINDIPPEV 94 (172) Q Consensus 15 AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~~d~IP~~V 94 (172) -+.|+|++|..+-| | .......+..+.+|-.|+++|...++=.|.. . +.=...|| +..+.-+ T Consensus 1 m~~fAtv~Dl~~r~--r--~L~~dE~~ra~~LL~dAs~~iR~~~~~~~~~-~-~~~~~~~~------------d~~~~~~ 62 (132) T protein:vir:94 1 MNPFATVDDLTMLW--R--PLKGDEKERAEKLLEIVSDTLREEADKVGRD-L-DVMISEKP------------SYFSSVV 62 (132) T ss_pred CCCcCCHHHHHHHh--c--cCChhHHHHHHHHHHHHHHHHHHHHhhhccc-c-ccccCCCC------------ccchhHH Confidence 68999999998754 2 3333222445788999999998765433322 1 11112222 2234445 Q ss_pred HHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhhccccCCcccccC Q lcl|Aclame:pro 95 KEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVRSGGTLLRG 172 (172) Q Consensus 95 ~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~g~~~~~g~~~r~ 172 (172) +.-+|....+++..+-.. . +....++..|+.+.+..+.+.. +..---..-..+|+. +-.|-|++-.-| T Consensus 63 k~V~~~~V~Ral~~~~~~----~----g~tq~S~TaG~ys~S~T~~np~-G~lylt~~e~~~LGl-~~~r~~~i~~~~ 130 (132) T protein:vir:94 63 KSVTVDIVARTLMTSTDQ----E----PMTQTTESALGYSVSGSYLVPG-GGLFIKNSELSRLGL-KKQRFGVIDFYG 130 (132) T ss_pred HHHHHHHHHHHhcCCCCC----C----CceeeeeecccceeeeeeecCC-CCceeChHHHHhhCC-CCCceEEEeecC Confidence 667788888877653211 1 1223567889885554443322 222222344555543 224445555555 No 22 >protein:vir:99002 Length: 158 # NCBI annotation: gp31 # Family: family:all:32652 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655896;genbank:gi:109521468;genbank:GeneID:4157967 Probab=85.97 E-value=0.034 Score=28.67 Aligned_cols=120 Identities=17% Similarity=0.085 Sum_probs=71.8 Q ss_pred ccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCccccccchHHH Q lcl|Aclame:pro 15 ANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYINDIPPEV 94 (172) Q Consensus 15 AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~~d~IP~~V 94 (172) --+|+||+|++... .|+-.++. ++...+|-.+||-.=.|. | ..|-..||- .+.+|.-| T Consensus 1 ~~alasvee~~trl-----~~~lp~~~--~r~~a~a~~vLd~~S~~a--r---~~~gr~W~~----------~~daP~~v 58 (158) T protein:vir:99 1 MAALVSVEEFTTFL-----RVPLPEEG--SEKYTQMEFLLTLASDWA--R---ELSCKPWLL----------PADAPVTA 58 (158) T ss_pred CcceeeHhhhhhhh-----cccCChhh--hHHHHHHHHHHHHHHHHH--H---HhcCccCCC----------CCcchhHH Confidence 57899999999987 45543333 333344445566432221 1 345667882 35678888 Q ss_pred HHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhhccccCCcccccC Q lcl|Aclame:pro 95 KEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVRSGGTLLRG 172 (172) Q Consensus 95 ~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~g~~~~~g~~~r~ 172 (172) +.-|...|-+.+++|. .+..+.+|+-++.|..+....+ --.+.=...|..++..+ ||.-..+ T Consensus 59 r~ivL~aa~R~~~NP~-------------g~~~~~~G~~~~~~~~~g~~~~--ffT~~E~~~L~r~~~s~-GG~~~~~ 120 (158) T protein:vir:99 59 RGIILAASRREWNNPK-------------RVSYVVKGPQSATFMQSAYPPG--FFTDAEEAKLRSYGRST-GNWGVIE 120 (158) T ss_pred HHHHHHHHHHHHhcCC-------------ceEEeeecchhhhcccccCCCc--ccCHHHHHHHHHhhccc-CceeEEE Confidence 8888888888887743 2567888999999964432221 11134456777665333 4443322 No 23 >protein:vir:100103 Length: 120 # NCBI annotation: gp7 # Family: family:all:363 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945037;genbank:gi:38707897;genbank:GeneID:2744150 Probab=84.33 E-value=0.048 Score=27.81 Aligned_cols=119 Identities=18% Similarity=0.099 Sum_probs=62.6 Q ss_pred CCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCccccccch Q lcl|Aclame:pro 12 VAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYINDIP 91 (172) Q Consensus 12 ~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~~d~IP 91 (172) .++=-+++|+++++.+..-.+. .||+-.+..+..|.+|+.. |.|++- ..+....+-..+....+.....|| T Consensus 1 ~~~~m~~vtL~e~K~hLRvd~d----~DD~lI~~~i~AA~~~v~~---~~~r~l--~~~~~~~~~~~~~~~~~~~~~~~~ 71 (120) T protein:vir:10 1 MADQTPIVSLEVALAHLREDAG----VADDLIKIYIGAATQSASD---YVDRKL--YANDAEMQAAVADATAGADPIVAN 71 (120) T ss_pred CCCCCCccCHHHHHHHcCCCCC----cchHHHHHHHHHHHHHHHH---HhCCcc--cccccccchhhhccccccccccCC Confidence 5777899999999998765422 3566677777778787774 556552 222222222111122234445689 Q ss_pred HHHHHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhhcccc-CCcc Q lcl|Aclame:pro 92 PEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVR-SGGT 168 (172) Q Consensus 92 ~~V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~g~~~-~~g~ 168 (172) +.++.|++.+.-....+ .+. ..+|. .......| ..+..||.++ | ..|. T Consensus 72 ~~i~~AvLllvg~~Yen-----Re~-----------~~~~~--------~~~~~~lP--~~v~~Ll~~y---R~~~gv 120 (120) T protein:vir:10 72 DAIRAAILLTIGKLYAF-----RED-----------VVSGA--------SASVTELP--SGAKSLLFPY---RVGLGV 120 (120) T ss_pred HHHHHHHHHHHHHHHhc-----hhh-----------hhhcc--------cccccccC--HHHHHHHHHh---hhccCC Confidence 99999888877544333 210 00010 00001112 2366777642 2 2233 No 24 >protein:vir:80320 Length: 188 # NCBI annotation: gp8, conserved hypothetical protein # Family: family:all:501 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111087;genbank:gi:134288682;genbank:GeneID:4960567 Probab=78.59 E-value=0.11 Score=25.78 Aligned_cols=127 Identities=15% Similarity=0.010 Sum_probs=59.5 Q ss_pred Cee-EeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHH-HHHHHHHHhhhhhh-ccCccccCccccc-cccc Q lcl|Aclame:pro 1 MAL-IVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEA-AVIRATDYLDQRFN-FVGKKRLGRDQTT-EWPR 76 (172) Q Consensus 1 M~l-iVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~-aL~~Asdyid~~~~-~~G~r~~~~~Q~l-awPR 76 (172) |.. |++. |.+..=+|++|++++.--.. +.+|+..+. ++..|.+++++... ....+ +..+.+ .||+ T Consensus 1 M~~~~~~~-----ppa~ePVtL~e~K~hLRid~----~~eD~~l~~~lI~aA~~~~E~~~gr~l~~q--t~~~~~~~~~~ 69 (188) T protein:vir:80 1 MAAVLVEY-----LDDAEPLTFEEVAFQCRIDD----DDERDFVERIVIPGARQAAESKSGAAIRKA--RYVERLSGFPL 69 (188) T ss_pred CCceeecc-----CCCCcccCHHHHHHHcCCCC----chhhHHHHHHHHHHHHHHHHHHhCCeeeee--eEEEEecCCCC Confidence 665 5542 34445589999999875421 123455555 44578888887531 11111 011111 2332 Q ss_pred cCCC---------------C------------------------------------------CCccccccchHHHHHHHH Q lcl|Aclame:pro 77 TDAW---------------D------------------------------------------RDRYYINDIPPEVKEACA 99 (172) Q Consensus 77 ~g~~---------------~------------------------------------------~~~~~~d~IP~~V~~A~~ 99 (172) .++. | ..|++ +.+|..+|+|.. T Consensus 70 ~~i~Lp~~PV~sV~sV~~~d~~G~~~~l~~~~y~l~~~~~~~~l~~~~~~~~p~~~~V~V~~~AG~~-~~vP~~ik~ail 148 (188) T protein:vir:80 70 AEISLSVGQVIRVDSIEIRDASGATTTLDADAFELVQLGREALLVPEGQARWPFARAVTITYQAGVD-LARYPSVRTWML 148 (188) T ss_pred CceEecccccceeeEEEEEcCCCcEEeecccceEEeecCCCcEEEEecCCCCCCCceEEEEEEeccc-ccChHHHHHHHH Confidence 2110 0 01111 246667777777 Q ss_pred HHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhhccccCC Q lcl|Aclame:pro 100 EYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVRSG 166 (172) Q Consensus 100 elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~g~~~~~ 166 (172) .++.....+- +.+ ..+.+...-...++++||.|+=++-|. T Consensus 149 l~va~~Ye~R------------------e~~---------~~g~~~~~~P~~~v~~Ll~pyRvp~~~ 188 (188) T protein:vir:80 149 LAAAWAYDHR------------------ELF---------SEGQPIGEMPGGYADVLLNPITVPPRF 188 (188) T ss_pred HHHHHHHhcc------------------ccc---------ccccccccccHHHHHHHhhccCCCCCC Confidence 6665443321 111 000000111134578888877555555 No 25 >protein:vir:79074 Length: 150 # NCBI annotation: gp10, conserved hypothetical protein # Family: family:all:348 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111210;genbank:gi:134288797;genbank:GeneID:4960748 Probab=76.58 E-value=0.066 Score=27.06 Aligned_cols=128 Identities=13% Similarity=0.109 Sum_probs=60.0 Q ss_pred cccccHHHHHHHHHHcC---------c-----cccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCC Q lcl|Aclame:pro 16 NAYISVEEFKTYHTDRG---------N-----SFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWD 81 (172) Q Consensus 16 nSY~sva~a~aY~~~rg---------~-----~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~ 81 (172) =+|+|++|..+.+...- . +....+++-.+++|..|+..||+. .+.| . T Consensus 1 M~Y~T~~Dl~~r~ge~~l~~Ltd~~~~~~~~~~~~~~d~~~i~~Al~dA~~~IDgy---L~~R-Y--------------- 61 (150) T protein:vir:79 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESAVRQAEEIVDAH---LRGR-Y--------------- 61 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccccccccccccCHHHHHHHHHHHHHHHHHH---Hhhh-c--------------- Confidence 79999999998875321 1 112335566799999999999984 3333 1 Q ss_pred CCccccccchHHHHHHHHHHHHHHHhccCCCC---ccccccccc---ceeeEEeeCceEEEeeccCCC-CCCCCcHHHHH Q lcl|Aclame:pro 82 RDRYYINDIPPEVKEACAEYALRALAAELNPD---PERNASGVA---VLSKSEAVGPISESVTFVGGA-VFQMPKYPAAD 154 (172) Q Consensus 82 ~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~~---~~~~~~~~~---~~vk~~kvG~i~~~y~~~~~~-~~~~p~~~~v~ 154 (172) .+|...+|..|+..||-+|.+.|....... .+....+=. ...+...-|.++.--...+.. ..+...+ . T Consensus 62 --~lPl~~vP~~L~~~a~dIA~Y~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~v---~ 136 (150) T protein:vir:79 62 --NLPLSPVPTVIKDVTVNLARHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPSGPATPEPGEMKV---R 136 (150) T ss_pred --cCCcccccHHHHHHHHHHHHHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCccCCCCCCceee---e Confidence 135567899999999999988886532110 000000000 000011114333321110000 0000000 0 Q ss_pred HHhhhhccccCCcccccC Q lcl|Aclame:pro 155 QKLVRAGLVRSGGTLLRG 172 (172) Q Consensus 155 ~lL~~~g~~~~~g~~~r~ 172 (172) .-=..|+. -..|| T Consensus 137 ~~~r~f~r-----~~l~g 149 (150) T protein:vir:79 137 ARRRQFDA-----DLLER 149 (150) T ss_pred cCCCccCh-----hhccC Confidence 00000100 01122 No 26 >protein:vir:2505 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:28222 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569746;genbank:gi:18496896;genbank:GeneID:932265 Probab=76.45 E-value=0.075 Score=26.76 Aligned_cols=125 Identities=15% Similarity=0.067 Sum_probs=72.9 Q ss_pred CCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCccccccch Q lcl|Aclame:pro 12 VAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYINDIP 91 (172) Q Consensus 12 ~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~~d~IP 91 (172) -..-.+++|+.+..+-+. ..+...++.+...+|-.|+|-|.+. -|.+. ..+.+| T Consensus 1 ~~~~~alAtvdDv~~~lr---r~Lt~dE~~~a~~Ll~eAsdlI~g~-l~~~~----------------------vp~~~p 54 (128) T protein:vir:25 1 MTECKALATSQDVKRALR---RDLTEAEQTDLSELLAEATDLVVGY-LHPYP----------------------VPTPTP 54 (128) T ss_pred CccchhccCHHHHHHHhc---CCCCHHHHHHHHHHHhcchheeeee-cCCCC----------------------CCCCCC Confidence 245577888888776432 2344333444567788999999875 23331 135688 Q ss_pred HHHHHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhh--ccccCCccc Q lcl|Aclame:pro 92 PEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRA--GLVRSGGTL 169 (172) Q Consensus 92 ~~V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~--g~~~~~g~~ 169 (172) ..|+.-+|..+..+|..+...-+ ..++..-|+.+++|..++.+. ..---..-..+|.|. |.+.-+--. T Consensus 55 ~~v~rVvA~ivarAltr~~~~~p---------e~~S~TAgpfs~~ft~~~~~~-g~yLTaa~k~~Lrp~R~~~~sV~l~s 124 (128) T protein:vir:25 55 GPIKRVVASMVAAVLTRPTQILP---------ETQSLTADGFGVTFTPGGNSP-GPYLSAALKQRLRPYRTGMVAVEMGS 124 (128) T ss_pred chHHHHHHHHHHHHhhCCCccCC---------CceeeecccccccccCCCCCC-CceEcHHHHhhcccccceeeEeeccc Confidence 88999999999888887654332 234667799998886544332 212123345567653 222222222 Q ss_pred ccC Q lcl|Aclame:pro 170 LRG 172 (172) Q Consensus 170 ~r~ 172 (172) +|= T Consensus 125 ery 127 (128) T protein:vir:25 125 ERY 127 (128) T ss_pred ccC Confidence 232 No 27 >protein:vir:103846 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938245;genbank:gi:38229150;genbank:GeneID:2648161 Probab=76.32 E-value=0.019 Score=30.06 Aligned_cols=126 Identities=12% Similarity=0.089 Sum_probs=59.9 Q ss_pred cccccHHHHHHHHHHc--------Cc-cccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCccc Q lcl|Aclame:pro 16 NAYISVEEFKTYHTDR--------GN-SFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYY 86 (172) Q Consensus 16 nSY~sva~a~aY~~~r--------g~-~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~ 86 (172) =+|+|.++..+.+... .. .....+++-.+++|..|+..||+. .+.| . .+| T Consensus 1 M~Y~T~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~~eIdgy---L~~R-Y-----------------~lP 59 (138) T protein:vir:10 1 MSYCTQADLVEQYGEASIRQLSDRVNKPATTIDPAVVAQAIADADAEIDLH---LHAR-Y-----------------QLP 59 (138) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccCCCcCccCHHHHHHHHHHHHHHHHHH---Hhhc-c-----------------cCC Confidence 5799999999876433 11 123446666899999999999983 3334 1 245 Q ss_pred cccchHHHHHHHHHHHHHHHhccCCCCccccc--ccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhhcccc Q lcl|Aclame:pro 87 INDIPPEVKEACAEYALRALAAELNPDPERNA--SGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVR 164 (172) Q Consensus 87 ~d~IP~~V~~A~~elA~~~l~~~l~~~~~~~~--~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~g~~~ 164 (172) ...+|.-|+..||-+|.+.|.....++..... ...-...+...-|.++.--........+...+ .+ .. T Consensus 60 l~~vP~~L~~~a~dIA~Y~L~~~~~~~e~~~~rY~~Ai~~L~~Ia~G~~~Lg~~~~~~~~~~~~~~-~~---------~s 129 (138) T protein:vir:10 60 LAQVPVVLKRVACVLAFANLHTQVKDDHPAILDAERKRKLLGGISSGKLSLALTSSGTPAPIANTV-QI---------SS 129 (138) T ss_pred ccccchHHHHHHHHHHHHHHhcCCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCcccCCCCCce-ee---------ec Confidence 56789999999999999988754322110000 00000000111133333211110000000000 00 00 Q ss_pred CCcccccC Q lcl|Aclame:pro 165 SGGTLLRG 172 (172) Q Consensus 165 ~~g~~~r~ 172 (172) .+-...|. T Consensus 130 ~~r~Fg~d 137 (138) T protein:vir:10 130 QRNDFGGT 137 (138) T ss_pred CCccCCCC Confidence 00000011 No 28 >protein:vir:1640 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695061;genbank:gi:23455752;genbank:GeneID:955488 Probab=75.70 E-value=0.14 Score=25.20 Aligned_cols=129 Identities=16% Similarity=0.132 Sum_probs=68.9 Q ss_pred ccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCcccc-ccchHH Q lcl|Aclame:pro 15 ANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYI-NDIPPE 93 (172) Q Consensus 15 AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~~-d~IP~~ 93 (172) -+.|+|++|..+-| | ..+.......+.+|-.|+++|-..++=.|+. .-+|+. +. +..+.- T Consensus 1 m~~fAtv~Dv~~r~--r--~L~~~E~~ra~~lL~dAs~~ir~~~p~~~~~------l~a~~~---------e~~~~~~~~ 61 (132) T protein:vir:16 1 MNPFATVDDLTMLW--R--PLKGDEKERAEKLLEIVSDSLREEADKVGRD------LYAMIA---------EKPSYFASV 61 (132) T ss_pred CCccCCHHHHHHHh--c--CCCHhHHHHHHHHHHHHHHHHHHhhhhhccc------cccccc---------cccccchhH Confidence 68999999998765 3 3333222356889999999998765322221 111221 11 223344 Q ss_pred HHHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhhccccCCcccccC Q lcl|Aclame:pro 94 VKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVRSGGTLLRG 172 (172) Q Consensus 94 V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~g~~~~~g~~~r~ 172 (172) ++.-+|+...+++..+-... +....++..|+.+.+..+.+.. +..-.-..-..+|+. +.-|-|.+-.-| T Consensus 62 ~~~V~~~~V~Ral~~~~~~~--------G~tq~S~TaG~ys~S~t~~~p~-G~lylt~~e~~~LG~-~~~r~~~i~~~~ 130 (132) T protein:vir:16 62 VKSVTVDIVARTLMTSTDQE--------PMTQTTESALGYSVSGSYLVPG-GGLFIKNSELSRLGL-KKQRFGVIDFYG 130 (132) T ss_pred HHHHHHHHHHHHhcCCCCCC--------CceeeeeeccchheeeeeecCC-CcceeChHHHHhhCC-CCCceEEEeecC Confidence 56777888888877642211 1223567889985554433322 232222334445532 223334444444 No 29 >protein:vir:107864 Length: 150 # NCBI annotation: gp36 # Family: family:all:348 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024709;genbank:gi:48696946;genbank:GeneID:2845952 Probab=74.31 E-value=0.084 Score=26.47 Aligned_cols=128 Identities=13% Similarity=0.111 Sum_probs=59.8 Q ss_pred cccccHHHHHHHHHHcC---------c-----cccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCC Q lcl|Aclame:pro 16 NAYISVEEFKTYHTDRG---------N-----SFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWD 81 (172) Q Consensus 16 nSY~sva~a~aY~~~rg---------~-----~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~ 81 (172) =+|+|++|..+.+...- . +....+++-.+++|..|+..||+. .+.| . T Consensus 1 M~Y~T~~Dl~~r~ge~el~~Ltd~~~~g~~~~~~~~~d~~~i~~Al~dA~~eIDgY---L~~R-Y--------------- 61 (150) T protein:vir:10 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESSVRQAEEIVDAH---LRGR-Y--------------- 61 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccCccccchhhcCHHHHHHHHHHHHHHHHHH---Hhhh-c--------------- Confidence 79999999988775321 1 112345666789999999999984 3333 1 Q ss_pred CCccccccchHHHHHHHHHHHHHHHhccCCCC---cccccccccc---eeeEEeeCceEEEeeccCC-CCCCCCcHHHHH Q lcl|Aclame:pro 82 RDRYYINDIPPEVKEACAEYALRALAAELNPD---PERNASGVAV---LSKSEAVGPISESVTFVGG-AVFQMPKYPAAD 154 (172) Q Consensus 82 ~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~~---~~~~~~~~~~---~vk~~kvG~i~~~y~~~~~-~~~~~p~~~~v~ 154 (172) .+|...+|..|+..||-+|.+.|....... .+....+=.. ..+...-|.++..-...+. ...+...+ . T Consensus 62 --~lPl~~vP~~L~~~a~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~v---~ 136 (150) T protein:vir:10 62 --NLPLSPVPTVIKDVTVNLARHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPSGPATPEPGEMKV---R 136 (150) T ss_pred --cCCcccccHHHHHHHHHHHHHHHHhcccccCCCCHHHHHHHHHHHHHHHHHhcCcccCCCCCCCCCCCCceeee---e Confidence 134567899999999999988886532110 1100000000 0001111433332211000 00000000 0 Q ss_pred HHhhhhccccCCcccccC Q lcl|Aclame:pro 155 QKLVRAGLVRSGGTLLRG 172 (172) Q Consensus 155 ~lL~~~g~~~~~g~~~r~ 172 (172) .-=..|+. -..|| T Consensus 137 ~~~r~f~r-----~~l~g 149 (150) T protein:vir:10 137 ARRRQFDA-----DLLER 149 (150) T ss_pred cCCCccCh-----hhccC Confidence 00000000 01122 No 30 >protein:vir:1887 Length: 108 # NCBI annotation: gp6 # Family: family:all:6764 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037667;genbank:gi:9634125;genbank:GeneID:1262472 Probab=72.21 E-value=0.19 Score=24.59 Aligned_cols=107 Identities=21% Similarity=0.124 Sum_probs=63.2 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCC Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAW 80 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~ 80 (172) |++-+ =+++|+++++.|..--. ..+|+-.+..+..|.+|+. .|.|++.. + T Consensus 1 ~~~~~----------M~~vtLee~K~hLRid~----dddD~lI~~~i~AA~~~v~---~~~~~~~~--------~----- 50 (108) T protein:vir:18 1 MAIDV----------LDVISLSLFKQQIEFEE----DDRDELITLYAQAAFDYCM---RWCDEPAW--------K----- 50 (108) T ss_pred CCCCc----------ccccCHHHHHHHcCCCC----CcchHHHHHHHHHHHHHHH---HHhCCccc--------c----- Confidence 66544 47999999999965321 1356666777777777776 35554421 0 Q ss_pred CCCccccccchHHHHHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhh Q lcl|Aclame:pro 81 DRDRYYINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRA 160 (172) Q Consensus 81 ~~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~ 160 (172) ....+|..|+.|+..++-....+ |+.+.. .. ... .+.++.||.|+ T Consensus 51 -----~~~~~p~~ik~AiLllv~~~Yen------------------RE~~~~---------~~-~~~--~~~~~~LL~pY 95 (108) T protein:vir:18 51 -----VAADIPAAVKGAVLLVFADMFEH------------------RTAQSE---------VQ-LYE--NAAAERMMFIH 95 (108) T ss_pred -----cccccchHHHHHHHHHHHHHHhc------------------cccccc---------ch-hhh--hHHHHHHHHHH Confidence 12457888888877766544332 222210 10 011 24688999877 Q ss_pred ccccCCcccccC Q lcl|Aclame:pro 161 GLVRSGGTLLRG 172 (172) Q Consensus 161 g~~~~~g~~~r~ 172 (172) =-.+|---.+-| T Consensus 96 R~~~g~~~~~~~ 107 (108) T protein:vir:18 96 RNWRGKAESEEG 107 (108) T ss_pred HhcCCCCCcccC Confidence 556665556666 No 31 >protein:vir:192 Length: 108 # NCBI annotation: Gp6 # Family: family:all:6764 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037702;genbank:gi:9634167;genbank:GeneID:1262532 Probab=72.21 E-value=0.19 Score=24.59 Aligned_cols=107 Identities=21% Similarity=0.124 Sum_probs=63.2 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCC Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAW 80 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~ 80 (172) |++-+ =+++|+++++.|..--. ..+|+-.+..+..|.+|+. .|.|++.. + T Consensus 1 ~~~~~----------M~~vtLee~K~hLRid~----dddD~lI~~~i~AA~~~v~---~~~~~~~~--------~----- 50 (108) T protein:vir:19 1 MAIDV----------LDVISLSLFKQQIEFEE----DDRDELITLYAQAAFDYCM---RWCDEPAW--------K----- 50 (108) T ss_pred CCCCc----------ccccCHHHHHHHcCCCC----CcchHHHHHHHHHHHHHHH---HHhCCccc--------c----- Confidence 66544 47999999999965321 1356666777777777776 35554421 0 Q ss_pred CCCccccccchHHHHHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhh Q lcl|Aclame:pro 81 DRDRYYINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRA 160 (172) Q Consensus 81 ~~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~ 160 (172) ....+|..|+.|+..++-....+ |+.+.. .. ... .+.++.||.|+ T Consensus 51 -----~~~~~p~~ik~AiLllv~~~Yen------------------RE~~~~---------~~-~~~--~~~~~~LL~pY 95 (108) T protein:vir:19 51 -----VAADIPAAVKGAVLLVFADMFEH------------------RTAQSE---------VQ-LYE--NAAAERMMFIH 95 (108) T ss_pred -----cccccchHHHHHHHHHHHHHHhc------------------cccccc---------ch-hhh--hHHHHHHHHHH Confidence 12457888888877766544332 222210 10 011 24688999877 Q ss_pred ccccCCcccccC Q lcl|Aclame:pro 161 GLVRSGGTLLRG 172 (172) Q Consensus 161 g~~~~~g~~~r~ 172 (172) =-.+|---.+-| T Consensus 96 R~~~g~~~~~~~ 107 (108) T protein:vir:19 96 RNWRGKAESEEG 107 (108) T ss_pred HhcCCCCCcccC Confidence 556665556666 No 32 >protein:vir:1993 Length: 141 # NCBI annotation: Hypothetical protein # Family: family:all:348 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050640;genbank:gi:9633527;genbank:GeneID:2636296 Probab=71.16 E-value=0.013 Score=30.90 Aligned_cols=127 Identities=14% Similarity=0.109 Sum_probs=62.1 Q ss_pred cccccHHHHHHHHHHcCc--------cccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCcccc Q lcl|Aclame:pro 16 NAYISVEEFKTYHTDRGN--------SFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYI 87 (172) Q Consensus 16 nSY~sva~a~aY~~~rg~--------~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~~ 87 (172) =+|+|.++..+.+...-. .....+++-.+++|..|+..||+. .+.| . .+|. T Consensus 1 M~Y~T~~Dl~~~~ge~~l~~Lt~d~~~~g~~d~~~i~~Al~dA~~eIdgy---L~~R-Y-----------------~lPl 59 (141) T protein:vir:19 1 MNYATVNDLCARYTRTRLDILTRPKTADGQPDDAVAEQALADASAFIDGY---LAAR-F-----------------VLPL 59 (141) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcCCCCccccCHHHHHHHHHHHHHHHHHH---Hhhc-c-----------------cCCc Confidence 679999999987753311 122335566799999999999983 3334 1 2355 Q ss_pred ccchHHHHHHHHHHHHHHHhccCCCCccccccccc-ceeeEEeeCceEEEeeccCCC---CCCCCcHHHHHHHhhhhccc Q lcl|Aclame:pro 88 NDIPPEVKEACAEYALRALAAELNPDPERNASGVA-VLSKSEAVGPISESVTFVGGA---VFQMPKYPAADQKLVRAGLV 163 (172) Q Consensus 88 d~IP~~V~~A~~elA~~~l~~~l~~~~~~~~~~~~-~~vk~~kvG~i~~~y~~~~~~---~~~~p~~~~v~~lL~~~g~~ 163 (172) ..+|.-|+..||-+|.+.|.+...+++...--..+ ...+...-|.++.--...+.. +.....+ .. .+.-|. T Consensus 60 ~~~P~~L~~~a~dIA~Y~L~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~~~~---~~--~~r~f~ 134 (141) T protein:vir:19 60 TVVPSLLKRQCCVVAWFYLNESQPTEQITATYRDTVRWLEQVRDGKTDPGVESRTAASPEGEDLVQV---QS--DPPVFS 134 (141) T ss_pred cccchHHHHHHHHHHHHHHhcCCCChHHHHHHHHHHHHHHHHhcCccccCCCCCCCCCCCCCceeEe---ec--CCcccC Confidence 67899999999999998887654322110000000 000111124444422111110 0000111 00 001111 Q ss_pred cCCcccccC Q lcl|Aclame:pro 164 RSGGTLLRG 172 (172) Q Consensus 164 ~~~g~~~r~ 172 (172) | -.|| T Consensus 135 r----~~~G 139 (141) T protein:vir:19 135 R----KQKG 139 (141) T ss_pred c----cccc Confidence 1 1233 No 33 >protein:vir:94064 Length: 167 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453623;genbank:gi:84662659;genbank:GeneID:5142574 Probab=66.41 E-value=0.27 Score=23.72 Aligned_cols=129 Identities=19% Similarity=0.120 Sum_probs=57.4 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHH-HHhhhhhhccCccccCccccccccccCC Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRAT-DYLDQRFNFVGKKRLGRDQTTEWPRTDA 79 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~As-dyid~~~~~~G~r~~~~~Q~lawPR~g~ 79 (172) |..+|-| ++.+.+-+=+.. +.+|+..+..|..|. -++|.. +|...+ + T Consensus 1 M~~~~Fd-------------~~~FR~~fPeFa----~~Pd~~i~~~l~~A~~~~l~~~-~~s~~~--~------------ 48 (167) T protein:vir:94 1 MAVVVFD-------------PTAFKLVYPEFV----AVPDARLTALFNTVGYTILDNT-DASVIV--D------------ 48 (167) T ss_pred CCcccCC-------------hHHHHHhchhcc----cCCHHHHHHHHHHHHHhhcCCC-Cccccc--c------------ Confidence 7766643 444444443332 235677888888774 456643 222211 0 Q ss_pred CCCCccccccchHHHHHHHHHHHHHHHh--ccCCCCc-ccccccccceeeEEeeCceEEEeeccCCCCCC-----CCcH- Q lcl|Aclame:pro 80 WDRDRYYINDIPPEVKEACAEYALRALA--AELNPDP-ERNASGVAVLSKSEAVGPISESVTFVGGAVFQ-----MPKY- 150 (172) Q Consensus 80 ~~~~~~~~d~IP~~V~~A~~elA~~~l~--~~l~~~~-~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~-----~p~~- 150 (172) ...-+++-..++.+.|. +-...+. ..........++|+++|.+||+|+.....+.. ...| T Consensus 49 -----------~~~~~~~l~LltAHll~L~~~~~a~~~~~~~~g~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YG 117 (167) T protein:vir:94 49 -----------PLRRAPLLDLLVAHMLALFGYVNADGSITPGTGTVGRVANASEGSVSTSLAYSTPTGAGEAWFTQTPYG 117 (167) T ss_pred -----------hhhHHHHHHHHHHHHHHHhhhhhhhcccccccccchheeeccccceeeeeecCCCCCchhhhhhcCHHH Confidence 01111222223322221 1011110 11122234568999999999999765443321 1222 Q ss_pred ---HHHHHHhhhhccccCCcc---------------cccC Q lcl|Aclame:pro 151 ---PAADQKLVRAGLVRSGGT---------------LLRG 172 (172) Q Consensus 151 ---~~v~~lL~~~g~~~~~g~---------------~~r~ 172 (172) =.+..+++..|.+.+|.+ ..|| T Consensus 118 q~fwaL~~~~g~Gg~v~gG~~~~~~~~~~~r~vgg~f~~~ 157 (167) T protein:vir:94 118 AMYWAMSAPFRSFHYVAAGLSGVGYSQDYLSTYAGVETRL 157 (167) T ss_pred HHHHHHHHHhcccccccCCCCCCCCCccccCccceeEeec Confidence 122333333344444433 2222 No 34 >protein:vir:100245 Length: 113 # NCBI annotation: gp74 # Family: family:all:363 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355410;genbank:gi:77864700;genbank:GeneID:3725967 Probab=66.01 E-value=0.27 Score=23.66 Aligned_cols=113 Identities=11% Similarity=0.060 Sum_probs=56.5 Q ss_pred cccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCccccccchHHHH Q lcl|Aclame:pro 16 NAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYINDIPPEVK 95 (172) Q Consensus 16 nSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~~d~IP~~V~ 95 (172) =+++|+++++.+..-... .||+-.+..+..|.+++.. |.|++ .-..|.- |+-.......+.....||+.++ T Consensus 1 M~~vtLee~K~hLRvd~d----~dD~lI~~li~AA~~~ve~---~l~r~-l~~~~~~-~~~~~~~~~~~~~~~~~p~~i~ 71 (113) T protein:vir:10 1 MALVELKLALGFVRANAG----VEDDVVQMLLDAATQSAVD---YLNRQ-VFETEDA-MTTAIEAGTAGQNPMVVNAAIR 71 (113) T ss_pred CCCCCHHHHHHHcCCCCC----cchHHHHHHHHHHHHHHHH---HhCcc-ccccccc-cccccccccccccccccChHHH Confidence 578899999998764432 2566677777777777763 55554 2223322 2211111111223345899999 Q ss_pred HHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhhccccCCcc Q lcl|Aclame:pro 96 EACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVRSGGT 168 (172) Q Consensus 96 ~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~g~~~~~g~ 168 (172) .|+..+.-....+ . +.+. .++....| ..++.||.|+= .-+|. T Consensus 72 ~AvLllv~~~Y~n-----R-------------e~~~---------~~~~~~lP--~~v~~Ll~~yR--~~~g~ 113 (113) T protein:vir:10 72 AAILKITAELYAN-----R-------------EDTA---------FGPITELP--LNARALLRPHR--IIPGV 113 (113) T ss_pred HHHHHHHHHHHhh-----h-------------hhhc---------hhhhhccC--HHHHHHHHHhh--hhcCC Confidence 8888776544332 1 1110 01111122 23566775531 11222 No 35 >protein:vir:99222 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950460;genbank:gi:119953661;genbank:GeneID:4643082 Probab=65.65 E-value=0.26 Score=23.75 Aligned_cols=125 Identities=14% Similarity=0.142 Sum_probs=59.1 Q ss_pred cccccHHHHHHHHHHc--------Cc-cccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCccc Q lcl|Aclame:pro 16 NAYISVEEFKTYHTDR--------GN-SFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYY 86 (172) Q Consensus 16 nSY~sva~a~aY~~~r--------g~-~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~ 86 (172) =+|+|.++..+.+... .. .....+++-.+++|..|+..||+. .+.| . .+| T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgY---L~~R-Y-----------------~lP 59 (138) T protein:vir:99 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLH---LHGR-Y-----------------QLP 59 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHH---Hhhc-c-----------------cCC Confidence 6799999999876432 11 123345666899999999999984 3333 1 245 Q ss_pred cccchHHHHHHHHHHHHHHHhccCCCCccccc--ccccceeeEEeeCceEEEeeccCCC--CCCCCcHHHHHHHhhhhcc Q lcl|Aclame:pro 87 INDIPPEVKEACAEYALRALAAELNPDPERNA--SGVAVLSKSEAVGPISESVTFVGGA--VFQMPKYPAADQKLVRAGL 162 (172) Q Consensus 87 ~d~IP~~V~~A~~elA~~~l~~~l~~~~~~~~--~~~~~~vk~~kvG~i~~~y~~~~~~--~~~~p~~~~v~~lL~~~g~ 162 (172) ...+|..|+..||-+|.+.|.+....+..... ...-...+...-|.++.--...... +.....|..= +.-| T Consensus 60 l~~vP~~L~~~a~dIA~Y~L~~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~~~~~~-----~r~F 134 (138) T protein:vir:99 60 LASVPTALKRIACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISEG-----RNDW 134 (138) T ss_pred ccccchHHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcCCCCCCceeeecC-----CCCC Confidence 66799999999999999998764322110000 0000000001113222211100000 0000000000 0000 Q ss_pred ccCC Q lcl|Aclame:pro 163 VRSG 166 (172) Q Consensus 163 ~~~~ 166 (172) .|-- T Consensus 135 ~Rd~ 138 (138) T protein:vir:99 135 GADW 138 (138) T ss_pred CCCC Confidence 0000 No 36 >protein:vir:79253 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469165;genbank:gi:157835007;genbank:GeneID:5648834 Probab=65.65 E-value=0.26 Score=23.75 Aligned_cols=125 Identities=14% Similarity=0.142 Sum_probs=59.1 Q ss_pred cccccHHHHHHHHHHc--------Cc-cccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCccc Q lcl|Aclame:pro 16 NAYISVEEFKTYHTDR--------GN-SFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYY 86 (172) Q Consensus 16 nSY~sva~a~aY~~~r--------g~-~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~ 86 (172) =+|+|.++..+.+... .. .....+++-.+++|..|+..||+. .+.| . .+| T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgY---L~~R-Y-----------------~lP 59 (138) T protein:vir:79 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLH---LHGR-Y-----------------QLP 59 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHH---Hhhc-c-----------------cCC Confidence 6799999999876432 11 123345666899999999999984 3333 1 245 Q ss_pred cccchHHHHHHHHHHHHHHHhccCCCCccccc--ccccceeeEEeeCceEEEeeccCCC--CCCCCcHHHHHHHhhhhcc Q lcl|Aclame:pro 87 INDIPPEVKEACAEYALRALAAELNPDPERNA--SGVAVLSKSEAVGPISESVTFVGGA--VFQMPKYPAADQKLVRAGL 162 (172) Q Consensus 87 ~d~IP~~V~~A~~elA~~~l~~~l~~~~~~~~--~~~~~~vk~~kvG~i~~~y~~~~~~--~~~~p~~~~v~~lL~~~g~ 162 (172) ...+|..|+..||-+|.+.|.+....+..... ...-...+...-|.++.--...... +.....|..= +.-| T Consensus 60 l~~vP~~L~~~a~dIA~Y~L~~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~~~~~~-----~r~F 134 (138) T protein:vir:79 60 LASVPTALKRIACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISEG-----RNDW 134 (138) T ss_pred ccccchHHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcCCCCCCceeeecC-----CCCC Confidence 66799999999999999998764322110000 0000000001113222211100000 0000000000 0000 Q ss_pred ccCC Q lcl|Aclame:pro 163 VRSG 166 (172) Q Consensus 163 ~~~~ 166 (172) .|-- T Consensus 135 ~Rd~ 138 (138) T protein:vir:79 135 GADW 138 (138) T ss_pred CCCC Confidence 0000 No 37 >protein:vir:10365 Length: 115 # NCBI annotation: conserved phage protein # Family: family:all:363 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858957;genbank:gi:32128422;genbank:GeneID:2648387 Probab=57.41 E-value=0.43 Score=22.57 Aligned_cols=112 Identities=15% Similarity=-0.009 Sum_probs=55.1 Q ss_pred cccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCccccccchHHHHHH Q lcl|Aclame:pro 18 YISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYINDIPPEVKEA 97 (172) Q Consensus 18 Y~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~~d~IP~~V~~A 97 (172) -+|+++++.+..--. .-...+|+-.+..+..|.+|+.. |.|++.....-.+..+..+. ..+.....||..++.| T Consensus 1 mvtLe~~K~hLRid~-~d~d~dD~li~~~i~AA~~~v~~---~~~r~l~~~~~~~~~~~~~~--~~~~~~~~~p~~i~~A 74 (115) T protein:vir:10 1 MITLAMVQRHLQAEL-YEDDERDYVMQQLLPAARESAEL---FINRKLYDTQADMLADQAAG--VDPAGQLLITRTVEQA 74 (115) T ss_pred CCCHHHHHHHcCCCC-CCCchhhHHHHHHHHHHHHHHHH---HhCCcccccccccccccccc--cCCcccccCChHHHHH Confidence 899999999873211 12222455567777788787763 55655321111222222111 1111123489999998 Q ss_pred HHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhh---cccc Q lcl|Aclame:pro 98 CAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRA---GLVR 164 (172) Q Consensus 98 ~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~---g~~~ 164 (172) ...+.-....+ .+. ..+|. ....| -.++.||.|+ |-+| T Consensus 75 iLLlvg~~Y~n-----Re~-----------~~~~~-----------~~elP--~~v~~LL~pyR~~~gv~ 115 (115) T protein:vir:10 75 ILLTVGEWYAN-----REQ-----------VWVKG-----------VGLVT--SSAQNLLHPYRKFAGVR 115 (115) T ss_pred HHHHHHHHHhc-----chh-----------cccch-----------hhhcC--HHHHHHHHHHHhcCCCC Confidence 88777544332 110 11111 11122 2256666543 4444 No 38 >protein:vir:96108 Length: 155 # NCBI annotation: hypothetical protein ORF025 # Family: family:all:664 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294442;genbank:gi:149408339;genbank:GeneID:5237227 Probab=56.54 E-value=0.45 Score=22.46 Aligned_cols=129 Identities=13% Similarity=0.053 Sum_probs=60.6 Q ss_pred CCcccccccHHHHHHHHHHcCc--cccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCcccccc Q lcl|Aclame:pro 12 VAGANAYISVEEFKTYHTDRGN--SFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYIND 89 (172) Q Consensus 12 ~~~AnSY~sva~a~aY~~~rg~--~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~~d~ 89 (172) ++.. +++++.+.+=+... .+| |+..+..|..|..+|+.. ++ ..+.. | T Consensus 1 ~v~f----d~~~FR~~fPeFad~~~~P---d~~i~~~l~~A~~~l~~~-~~-~s~~~-------~--------------- 49 (155) T protein:vir:96 1 MVIF----DEQKFRTLFPEFADPASYP---AVRLQLYFDIACEFISDR-DS-PYRIL-------N--------------- 49 (155) T ss_pred Cccc----CHHHHHHhCccccCcccCC---HHHHHHHHHHHHHhhcCC-Cc-ccccc-------C--------------- Confidence 1121 23444443333332 243 788899999999999743 21 11100 1 Q ss_pred chHHHHHHHHHHHHHHHhccCC------CCcccccccccceeeEEeeCceEEEeeccCCCCCC-----CCcHH-HHHHHh Q lcl|Aclame:pro 90 IPPEVKEACAEYALRALAAELN------PDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQ-----MPKYP-AADQKL 157 (172) Q Consensus 90 IP~~V~~A~~elA~~~l~~~l~------~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~-----~p~~~-~v~~lL 157 (172) .....++.+.++.+.|.-... ...-...+...+.++|+++|.+||+|+.....++. ...|- -.-+|+ T Consensus 50 -g~~~~~~l~Ll~AH~l~L~~~~~~gaa~~g~~~~g~~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~f~~l~ 128 (155) T protein:vir:96 50 -GKALEACLYLLTAHLLSLSTMQVQGAAGGGVTAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALL 128 (155) T ss_pred -hHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccccceeeceecceeeeeecCCCCCchhHHhhcCHHHHHHHHHH Confidence 122334555554444321100 00001122345678999999999999865443321 22232 223333 Q ss_pred h---hhccccCCcccccC Q lcl|Aclame:pro 158 V---RAGLVRSGGTLLRG 172 (172) Q Consensus 158 ~---~~g~~~~~g~~~r~ 172 (172) . ..|.+.+|-|-..+ T Consensus 129 ~~~~~Gg~~vgG~per~~ 146 (155) T protein:vir:96 129 SVKAVGGFYIGGLPERRG 146 (155) T ss_pred HHhcccccccCCCCcccc Confidence 3 33444444444444 No 39 >protein:vir:3639 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705634;genbank:gi:23752319;genbank:GeneID:955737 Probab=43.47 E-value=0.84 Score=20.99 Aligned_cols=131 Identities=18% Similarity=0.171 Sum_probs=56.9 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHH-HHhhhhhhccCccccCccccccccccCC Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRAT-DYLDQRFNFVGKKRLGRDQTTEWPRTDA 79 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~As-dyid~~~~~~G~r~~~~~Q~lawPR~g~ 79 (172) |. .-|++=+=|..+|.+.+.- ..+.+|+..+..|..|. -++|.. .|.-.+ + T Consensus 1 ~~------------~~~~~v~Fd~a~FR~~fPe-Fa~~pd~~i~~~~~~A~~~~l~n~-~~s~~~--~------------ 52 (158) T protein:vir:36 1 MS------------TPPYRITFDPAGFIAEYPE-FATVPTPRLQAMFNQAQAALLDNT-GGSPVT--D------------ 52 (158) T ss_pred CC------------CCCceEEcChHHHHHhCcc-cccCCHHHHHHHHHhhhheeeCCc-cccccc--C------------ Confidence 22 1233333333444444421 12245778888888884 466654 221111 0 Q ss_pred CCCCccccccchHHHHHHHHHHHHHHHhccCCCC-cccccccccceeeEEeeCceEEEeeccCC-CCC-----CCCcHH- Q lcl|Aclame:pro 80 WDRDRYYINDIPPEVKEACAEYALRALAAELNPD-PERNASGVAVLSKSEAVGPISESVTFVGG-AVF-----QMPKYP- 151 (172) Q Consensus 80 ~~~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~~-~~~~~~~~~~~vk~~kvG~i~~~y~~~~~-~~~-----~~p~~~- 151 (172) +.+-+..-..++.+.|. |... .....+...+.++|.++|.+||+|+.... .+. ....|- T Consensus 53 -----------~~~r~~ll~LltAHll~--L~~~~~~g~~~g~vG~vsSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~ 119 (158) T protein:vir:36 53 -----------DNVLRELFNMLVAHLLT--LFSAAPTSANSRPPGRLSSATEGTVSSSFEYALPQGSAIAPWYNQTQYGA 119 (158) T ss_pred -----------hHHHHHHHHHHHHHHHH--HhhhhhcccccCcccceeeeeeCceeEEeeeccccCCchhhhhhcCHHHH Confidence 11112222233323222 1111 11111222257899999999999975332 221 122231 Q ss_pred ---HHHHHhhhhccccCCcccccC Q lcl|Aclame:pro 152 ---AADQKLVRAGLVRSGGTLLRG 172 (172) Q Consensus 152 ---~v~~lL~~~g~~~~~g~~~r~ 172 (172) .+..+++..|.+.+|++-..+ T Consensus 120 ~fw~L~~~~~~Gg~v~Gg~pe~~~ 143 (158) T protein:vir:36 120 MFWMATAHYRSARYMVSGGSGIGT 143 (158) T ss_pred HHHHHHHhhCccccccccCCcccc Confidence 223333444555555544333 No 40 >protein:vir:101559 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958112;genbank:gi:41057658;genbank:GeneID:2716816 Probab=43.47 E-value=0.84 Score=20.99 Aligned_cols=131 Identities=18% Similarity=0.171 Sum_probs=56.9 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHH-HHhhhhhhccCccccCccccccccccCC Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRAT-DYLDQRFNFVGKKRLGRDQTTEWPRTDA 79 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~As-dyid~~~~~~G~r~~~~~Q~lawPR~g~ 79 (172) |. .-|++=+=|..+|.+.+.- ..+.+|+..+..|..|. -++|.. .|.-.+ + T Consensus 1 ~~------------~~~~~v~Fd~a~FR~~fPe-Fa~~pd~~i~~~~~~A~~~~l~n~-~~s~~~--~------------ 52 (158) T protein:vir:10 1 MS------------TPPYRITFDPAGFIAEYPE-FATVPTPRLQAMFNQAQAALLDNT-GGSPVT--D------------ 52 (158) T ss_pred CC------------CCCceEEcChHHHHHhCcc-cccCCHHHHHHHHHhhhheeeCCc-cccccc--C------------ Confidence 22 1233333333444444421 12245778888888884 466654 221111 0 Q ss_pred CCCCccccccchHHHHHHHHHHHHHHHhccCCCC-cccccccccceeeEEeeCceEEEeeccCC-CCC-----CCCcHH- Q lcl|Aclame:pro 80 WDRDRYYINDIPPEVKEACAEYALRALAAELNPD-PERNASGVAVLSKSEAVGPISESVTFVGG-AVF-----QMPKYP- 151 (172) Q Consensus 80 ~~~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~~-~~~~~~~~~~~vk~~kvG~i~~~y~~~~~-~~~-----~~p~~~- 151 (172) +.+-+..-..++.+.|. |... .....+...+.++|.++|.+||+|+.... .+. ....|- T Consensus 53 -----------~~~r~~ll~LltAHll~--L~~~~~~g~~~g~vG~vsSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~ 119 (158) T protein:vir:10 53 -----------DNVLRELFNMLVAHLLT--LFSAAPTSANSRPPGRLSSATEGTVSSSFEYALPQGSAIAPWYNQTQYGA 119 (158) T ss_pred -----------hHHHHHHHHHHHHHHHH--HhhhhhcccccCcccceeeeeeCceeEEeeeccccCCchhhhhhcCHHHH Confidence 11112222233323222 1111 11111222257899999999999975332 221 122231 Q ss_pred ---HHHHHhhhhccccCCcccccC Q lcl|Aclame:pro 152 ---AADQKLVRAGLVRSGGTLLRG 172 (172) Q Consensus 152 ---~v~~lL~~~g~~~~~g~~~r~ 172 (172) .+..+++..|.+.+|++-..+ T Consensus 120 ~fw~L~~~~~~Gg~v~Gg~pe~~~ 143 (158) T protein:vir:10 120 MFWMATAHYRSARYMVSGGSGIGT 143 (158) T ss_pred HHHHHHHhhCccccccccCCcccc Confidence 223333444555555544333 No 41 >protein:vir:3034 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438147;genbank:gi:16271810;genbank:GeneID:929268 Probab=41.77 E-value=0.7 Score=21.44 Aligned_cols=103 Identities=20% Similarity=0.079 Sum_probs=51.9 Q ss_pred HHHHHHHHhhhhh-hccCccccCccccccccccCCCCCCccccccchHHHHHHHHHHHHHHHhccCCCCcccccccccce Q lcl|Aclame:pro 46 AVIRATDYLDQRF-NFVGKKRLGRDQTTEWPRTDAWDRDRYYINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVL 124 (172) Q Consensus 46 aL~~Asdyid~~~-~~~G~r~~~~~Q~lawPR~g~~~~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~ 124 (172) ++-+|..-||... .|.-.. +-+-...| | =.++|.|.|.--.+ ++.-+..+... ... T Consensus 1 L~k~A~~~Id~~t~~fY~~~--dle~D~~~-R--------------~~~fK~Aia~QI~Y-ld~~G~~t~~d-----~~s 57 (111) T protein:vir:30 1 MEKRASHAVNLYCRNRYDYK--DLKKEIAL-V--------------QKAVKRAIAYQIAY-LNDSGVMTAED-----KQS 57 (111) T ss_pred CchhhHHHHhHhhchhhhhh--hHHHHHHH-H--------------HHHHHHHHHHHHHH-HHhcCCCChhh-----ccC Confidence 7888999999754 232111 11111222 2 14566665544322 22222222221 234 Q ss_pred eeEEeeCceEEEeeccCCCCCC----CCcHHHH---HHHhhhhccccCCccccc Q lcl|Aclame:pro 125 SKSEAVGPISESVTFVGGAVFQ----MPKYPAA---DQKLVRAGLVRSGGTLLR 171 (172) Q Consensus 125 vk~~kvG~i~~~y~~~~~~~~~----~p~~~~v---~~lL~~~g~~~~~g~~~r 171 (172) .++..||-.+++|....+.++. .-+|-.. ..+|..+|+.-.|=.--| T Consensus 58 ~~SisvGrTsiS~~~~~~~~~~~~~t~~~~~l~~da~n~L~~~Glly~GV~yd~ 111 (111) T protein:vir:30 58 FAGISLGRTSISYTVGHGQGSQQKTLADRFNLCLDAENELLVVGLGYTGISYDR 111 (111) T ss_pred cceeeecceeeeccCccCCCCccccccccccchHHHHHHHHhhccccccccccC Confidence 7899999999999633222211 1233333 347877777665544455 No 42 >protein:vir:97069 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453566;genbank:gi:84662601;genbank:GeneID:5142484 Probab=40.01 E-value=0.99 Score=20.61 Aligned_cols=111 Identities=14% Similarity=-0.033 Sum_probs=50.3 Q ss_pred cccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccc-cccccCCCCCCccccccchHHHHH Q lcl|Aclame:pro 18 YISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTT-EWPRTDAWDRDRYYINDIPPEVKE 96 (172) Q Consensus 18 Y~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~l-awPR~g~~~~~~~~~d~IP~~V~~ 96 (172) -+|+++++.+..--.. -...||.-.+..+..|.+++. +|.|++. -..|.. .++. .....+-....||+.|+. T Consensus 1 mvtLee~K~hLRid~d-~~d~DDali~~~i~AA~~~v~---~~l~r~l-~~~~~~~~~~~--~~~~~~~~~~~~p~~i~~ 73 (115) T protein:vir:97 1 MITLAMMQRHLQAELY-EDDERDYVMQQLLPAARESAE---LFLNRKL-YDVQADMLADQ--VLGVDPSDQLLITRTVEQ 73 (115) T ss_pred CCCHHHHHHHcCCCCC-CCchhhHHHHHHHHHHHHHHH---HHhCCcc-cchhhcccccc--cccCCCcccccCCHHHHH Confidence 8999999998732211 111123345555566666555 3555442 122211 1111 001111112248999988 Q ss_pred HHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhh---cccc Q lcl|Aclame:pro 97 ACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRA---GLVR 164 (172) Q Consensus 97 A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~---g~~~ 164 (172) |...+.-. +..|. |.+.. ++.+..| -.+..||.|+ +-+| T Consensus 74 AiLllvg~-----~Y~NR-------------E~v~~---------~~~~elP--~~~~~LL~pyR~~~Gv~ 115 (115) T protein:vir:97 74 AILLTVGE-----WYSSR-------------EQVWI---------KGAGLVT--SSAQNLLHPYRKFAGVR 115 (115) T ss_pred HHHHHHHH-----HHhcc-------------ccccc---------ccccccC--HHHHHHHHHHHhhcCCC Confidence 88777644 33322 22200 0111122 2356666553 4444 No 43 >protein:vir:8104 Length: 170 # NCBI annotation: gp8 # Family: family:all:3238 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817685;genbank:gi:29566116;genbank:GeneID:1259310 Probab=32.78 E-value=1.4 Score=19.78 Aligned_cols=118 Identities=10% Similarity=0.000 Sum_probs=65.8 Q ss_pred HcCccccCCCHHHHHHHHHHHHHHhhhhhhccCcccc-Cc----------------------------cccc-------- Q lcl|Aclame:pro 30 DRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRL-GR----------------------------DQTT-------- 72 (172) Q Consensus 30 ~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~-~~----------------------------~Q~l-------- 72 (172) -||.-- ++++-+.+|..|+.-+.+.+.|+=.+.+ +. .+++ T Consensus 1 ~~~~~a---~~~~~q~~l~aA~a~vR~~cGwhv~P~v~d~t~~ldg~G~~vl~LPt~pvvsV~sV~~~G~~l~~~~~~~~ 77 (170) T protein:vir:81 1 MRGQFA---DNTEAQAAIDAVLAAARRWCGWHVSPVIIDDVMEVDGPGGRVLSLPTLNLVSVKSVVELGYALDVSTLDRS 77 (170) T ss_pred Cccccc---CchHHHHHHHHHHHHHHHHhCCcccceecccEEEEeCCCCeeEECCCCcceeeEEEEECCeeecCccceee Confidence 455432 3444456777777766665544311100 00 1112 Q ss_pred -----------cccc--cCC--CCCCccccccchHHHHHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEe Q lcl|Aclame:pro 73 -----------EWPR--TDA--WDRDRYYINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESV 137 (172) Q Consensus 73 -----------awPR--~g~--~~~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y 137 (172) .||| .++ ....|+..+.+|..+..-.|.+|-++.... ....+++.+++.++.+| T Consensus 78 ~~~glL~r~~G~~~~~~~~V~VT~tHGy~~~~apd~~~~vi~~~a~r~~~s~-----------~~~~l~~~~~~~vs~~~ 146 (170) T protein:vir:81 78 RRKGTLTKPYGRWTARDGAIVVTATHGFTETEAADWRRAVVQLVGRRAQTSR-----------PSADLKRKKVDDVEYEW 146 (170) T ss_pred cCCceEEecCCccccccceEEEEEEeCCCCCccchHHHHHHHHHHHHhhccC-----------Ccccceeeeccceeeee Confidence 2555 122 144567778899999998888887654421 12236788999999998 Q ss_pred eccCCCCCCCCcHHHHHHHhhhhccccCC Q lcl|Aclame:pro 138 TFVGGAVFQMPKYPAADQKLVRAGLVRSG 166 (172) Q Consensus 138 ~~~~~~~~~~p~~~~v~~lL~~~g~~~~~ 166 (172) .. .+... -+.-.++|.++=+...+ T Consensus 147 ~~--~~~s~---~~~~~~iL~~Yrl~~~p 170 (170) T protein:vir:81 147 FE--TAVSV---DAELSAVFSPFRILPSP 170 (170) T ss_pred cc--ccccc---CHHHHHhhhhcccCCCC Confidence 62 22111 13345577776666666 No 44 >protein:vir:104344 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398973;genbank:gi:81343957;genbank:GeneID:3778876 Probab=32.22 E-value=1.2 Score=20.12 Aligned_cols=96 Identities=14% Similarity=0.035 Sum_probs=38.4 Q ss_pred hhhhhhccCccccCccccccccccCCCCCCccc-cccchHHHHHHHHHHH---------------------HHHHhccCC Q lcl|Aclame:pro 54 LDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYY-INDIPPEVKEACAEYA---------------------LRALAAELN 111 (172) Q Consensus 54 id~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~-~d~IP~~V~~A~~elA---------------------~~~l~~~l~ 111 (172) +| +-.+++=|+= +| -..+|++|+++=+|+| +.++.-|.. T Consensus 1 ~~-------------~~~~e~~R~l------~P~f~kvpdevI~~wielA~lfVc~~~~g~~~~~AlaL~taHLm~~dga 61 (132) T protein:vir:10 1 MN-------------DAILAFMRSL------VPALKAVDDESINVWIDLARLYVCADKFGNDADRAVGLYALHLMLSDGA 61 (132) T ss_pred Cc-------------hHHHHHHHHh------cchhhcCChHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHhhcccc Confidence 00 0111111110 11 1356666666655554 334433333 Q ss_pred CCcccccccccceeeEEee------CceEEEeeccCCCCCC---CCcHHHHHHHhh----hhccccCCcccccC Q lcl|Aclame:pro 112 PDPERNASGVAVLSKSEAV------GPISESVTFVGGAVFQ---MPKYPAADQKLV----RAGLVRSGGTLLRG 172 (172) Q Consensus 112 ~~~~~~~~~~~~~vk~~kv------G~i~~~y~~~~~~~~~---~p~~~~v~~lL~----~~g~~~~~g~~~r~ 172 (172) ... ++.....-+.+| |+++++|...+..++- .|-=-....|+. .||+...++.=-.| T Consensus 62 ~k~----en~~~~t~S~rvaS~Sl~Ge~Sisf~~~sa~~s~L~~tp~Gkl~~~L~k~~~GgfgL~t~~~~~~cg 131 (132) T protein:vir:10 62 FKG----ENEGLETYSRRMASYSLSGEFSITYDNQSAIQGDLSSSSWGRMYKALLRKKGGGFGLITSAAGGGCG 131 (132) T ss_pred ccc----cccchhhhhhhhhhhcccCceeeecccccccccccccCcHHHHHHHHHHhccCccccccccCcCCCC Confidence 322 222223334444 9999999854432221 222233444553 34333333222222 No 45 >protein:vir:8430 Length: 189 # NCBI annotation: gp25 # Family: family:all:3238 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818326;genbank:gi:29566762;genbank:GeneID:1260021 Probab=28.86 E-value=1.7 Score=19.31 Aligned_cols=138 Identities=16% Similarity=0.110 Sum_probs=67.0 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCcccc-------------- Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRL-------------- 66 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~-------------- 66 (172) |+--. |.++.+.|. ||.... ++..-+.+|..|..-+-+.|.|.=.+.. T Consensus 1 ~~~el--------------~p~~~~~~~--~g~l~A--dd~~~q~aLdAA~a~vRr~CGWHV~PV~~dtv~vdg~G~svL 62 (189) T protein:vir:84 1 MAEEL--------------TPEDVDTYT--QGRIDK--DDPETARALAAALSRARRACGWHVTPVVESTVRLHGSGHDFI 62 (189) T ss_pred Ccccc--------------cchhcchhh--cceecc--CChhHHHHHHHHHHHHHHhhCCcccceeeeeEEEcCCCCceE Confidence 43211 112222221 221111 1222233444444333333322111100 Q ss_pred -----------------------------C-----ccccccccccCCC--CCCccccccch-HHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 67 -----------------------------G-----RDQTTEWPRTDAW--DRDRYYINDIP-PEVKEACAEYALRALAAE 109 (172) Q Consensus 67 -----------------------------~-----~~Q~lawPR~g~~--~~~~~~~d~IP-~~V~~A~~elA~~~l~~~ 109 (172) + .-+.-.|||.++. ...|++. +| .++..+.|.+|-++ T Consensus 63 ~LPTlrlvsV~sVt~dG~~~~~~~v~~~~~~~Gll~r~~Gw~~~g~I~VT~tHGy~~--~pa~di~~vv~~mA~rA---- 136 (189) T protein:vir:84 63 VLPTLKPVELLSITEDGEEVDLDEVYFVSREPGVLYKKCGWWCRGPIEVTLTHGFTA--EEAGDFREVVLQAVDVA---- 136 (189) T ss_pred ecCCccceeeeeeeecCeecccccceeccCCcceeEeCCCcccCCeEEEEEEcCCCC--CCchhHHHHHHHHHHhh---- Confidence 0 0122245654331 3344543 35 48899999999554 Q ss_pred CCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhhccccCC Q lcl|Aclame:pro 110 LNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVRSG 166 (172) Q Consensus 110 l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~g~~~~~ 166 (172) ++.. ++......+++|||+|+..+..-.+.+-+.-.-+.++.+|.-|-++..- T Consensus 137 -~~~~---~~~~~g~~~~~~v~dv~~r~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 189 (189) T protein:vir:84 137 -NLMV---GTGATGPITGLEVDDVNMRWSGLVDRSWGIAKNPMLESVLYQYRLVAIA 189 (189) T ss_pred -hccC---CCcccccccceeecceeeehhhhcccccccccchHHHHHHhhhhhhccC Confidence 1111 2224556889999999999875445555556667888888877665544 No 46 >protein:vir:93592 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:1879 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449297;genbank:gi:157166045;uniprot:Q6H9U4;genbank:GeneID:5580414 Probab=28.24 E-value=1.8 Score=19.23 Aligned_cols=107 Identities=16% Similarity=0.103 Sum_probs=54.2 Q ss_pred ccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCc-cccccchHH Q lcl|Aclame:pro 15 ANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDR-YYINDIPPE 93 (172) Q Consensus 15 AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~-~~~d~IP~~ 93 (172) --.++|+++++.+..-.+. .||+..+..+..|.+|+-.. .+.+.. ...+..+ .....+|.. T Consensus 1 mm~~vtLeevK~hLRId~d----~dD~li~~~i~aA~~~v~~~---l~~~~~-----------~~~~~~~~~~~~~~~~~ 62 (108) T protein:vir:93 1 MTALLTLEEIKAHLRVDHD----ADDDMLMDKVRQATAVLLAY---IQGSRD-----------KVIREDGELIPGEALTR 62 (108) T ss_pred CCcCCCHHHHHHHcCCCCC----cChHHHHHHHHHHHHHHHHH---hccccc-----------cccccccccccccCChH Confidence 3456689999998764433 25666777777777777542 222210 0111111 123356788 Q ss_pred HHHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhhccccCCccc Q lcl|Aclame:pro 94 VKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVRSGGTL 169 (172) Q Consensus 94 V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~g~~~~~g~~ 169 (172) ++.|++.|.-....+ . +.++..++ ..+..| -.+..||.+ .|.+-.| T Consensus 63 i~~AvLlLv~~~Yen-----R-------------e~~~~~~~-------~~~elP--~~v~~Ll~~---~R~p~~~ 108 (108) T protein:vir:93 63 MKGAAMRLTGMLYRN-----P-------------DLAEREEL-------LQGELP--FSVSVLIYD---LRCPTVL 108 (108) T ss_pred HHHHHHHHHHHHHhc-----c-------------cccccccc-------ccccCC--HHHHHHHHH---ccccccC Confidence 888888777544333 2 22211100 001112 346667765 4555555 No 47 >protein:vir:4512 Length: 107 # NCBI annotation: unknown # Family: family:all:363 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599038;genbank:gi:19548996;genbank:GeneID:935218 Probab=24.03 E-value=2.2 Score=18.68 Aligned_cols=104 Identities=19% Similarity=0.183 Sum_probs=54.1 Q ss_pred ccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccccccccccCCCCCCccccccchHHHHH Q lcl|Aclame:pro 17 AYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYINDIPPEVKE 96 (172) Q Consensus 17 SY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~lawPR~g~~~~~~~~~d~IP~~V~~ 96 (172) =++|+++++.|..--+. +. .+|+-.+..+..|..|+. +|.|++ . .++.-.||-.+ . ..-.||..++. T Consensus 1 M~vtL~e~K~hLRId~D-~~-ddD~lI~~~i~AA~~~i~---~~~~r~-~-~~~~~~~~~~~---~---~~~~~~~~~~~ 67 (107) T protein:vir:45 1 MLLKMEEIKLQLRLDDD-FS-DEDELLELLGKAAQSRTE---NFLNRK-L-YATADDRPADD---P---DGLVISDDVKL 67 (107) T ss_pred CCCCHHHHHHHcCCCCC-Cc-hhHHHHHHHHHHHHHHHH---HHhccc-c-ccccccccccc---c---ccccCChhHHH Confidence 78999999998754322 22 244456777788888887 466765 2 33444454321 1 11236888888 Q ss_pred HHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhh---hcc Q lcl|Aclame:pro 97 ACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVR---AGL 162 (172) Q Consensus 97 A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~---~g~ 162 (172) |+..+.-....+ . +.+... +. ...| ..+..||.+ .++ T Consensus 68 AvLllv~~~Y~N-----R-------------e~~~~~--------~~-~~lp--~~v~~Ll~~~R~~~~ 107 (107) T protein:vir:45 68 ALLLLVSHFYEN-----R-------------STVTDV--------EK-MELP--MSFNWLVAPYRLIPL 107 (107) T ss_pred HHHHHHHHHHhh-----h-------------hhcccc--------ch-hccc--hHHHHHHHHHhhcCC Confidence 887776543332 1 111000 00 0112 224555543 344 No 48 >protein:vir:486 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543093;swissprot:trembl:q8w626;genbank:gi:18249905;uniprot:Q8W626;genbank:GeneID:929692 Probab=23.79 E-value=2.3 Score=18.64 Aligned_cols=101 Identities=11% Similarity=0.062 Sum_probs=51.5 Q ss_pred ccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCccc---cccccccCCCCCCccccccchHH Q lcl|Aclame:pro 17 AYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQ---TTEWPRTDAWDRDRYYINDIPPE 93 (172) Q Consensus 17 SY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q---~lawPR~g~~~~~~~~~d~IP~~ 93 (172) =++|+++++.|..--+. +. .||+-.+..+..|.+|+.. |.|++ ....+ +..||. .-.||.. T Consensus 1 M~vtL~e~K~hLRid~D-~~-ddD~li~~~i~aA~~~i~~---~~~r~-l~~~~~~~~~~~~~----------~~~~~~~ 64 (107) T protein:vir:48 1 MLLKEEEIKSHLRLDDG-LY-SDGDFLKLLAQAVQKRTET---YLNRK-LYAPEETIPEDDPD----------GMHLTDD 64 (107) T ss_pred CCCCHHHHHHHcCCCCC-Cc-hhHHHHHHHHHHHHHHHHH---Hhccc-cccccccccccCcc----------ccccchh Confidence 78899999998754322 22 2455567777777787763 55544 22211 122332 1247888 Q ss_pred HHHHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhh---cc Q lcl|Aclame:pro 94 VKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRA---GL 162 (172) Q Consensus 94 V~~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~---g~ 162 (172) ++.|+..+.-.. ..|. +.+..-+ ....| ..++.||.|+ ++ T Consensus 65 ik~Avlllv~~~-----Y~NR-------------e~v~~~~---------~~~iP--~~v~~LL~~yR~~~l 107 (107) T protein:vir:48 65 VRLAMLMLVSHF-----YENR-------------STITDVE---------KLETP--MSFRWLAGPYRIVPL 107 (107) T ss_pred HHHHHHHHHHHH-----Hhhh-------------hhhcccc---------ccccC--HHHHHHHHHhhccCC Confidence 888777766543 3322 2111000 01112 2355566543 44 No 49 >protein:vir:78595 Length: 158 # NCBI annotation: BcepNY3gp07 # Family: family:all:664 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294844;genbank:gi:149882907;genbank:GeneID:5291066 Probab=23.55 E-value=2.3 Score=18.61 Aligned_cols=131 Identities=20% Similarity=0.177 Sum_probs=55.7 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHH-HhhhhhhccCccccCccccccccccCC Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATD-YLDQRFNFVGKKRLGRDQTTEWPRTDA 79 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asd-yid~~~~~~G~r~~~~~Q~lawPR~g~ 79 (172) |. .-|++=+=|..+|.+.+. ...+..|+..+..|..|.. ++|.. .+...+ + T Consensus 1 ~~------------~~~~~v~Fd~a~FR~~fP-eFa~~pd~~i~~~~~~A~~~~~~~~-~~s~~~--~------------ 52 (158) T protein:vir:78 1 MS------------TPPYRITFDPAGFIAEYP-EFATVATPRLQAMFNQAQTALLDNT-GGSPVT--D------------ 52 (158) T ss_pred CC------------CCCceEEcChHHHHHhch-hhccCCHHHHHHHHHHhhhhhcCCC-cccccc--C------------ Confidence 22 122332223333433332 1122357788888888855 44532 111100 0 Q ss_pred CCCCccccccchHHHHHHHHHHHHHHHhccCCC-CcccccccccceeeEEeeCceEEEeeccCC-CCC-----CCCcHH- Q lcl|Aclame:pro 80 WDRDRYYINDIPPEVKEACAEYALRALAAELNP-DPERNASGVAVLSKSEAVGPISESVTFVGG-AVF-----QMPKYP- 151 (172) Q Consensus 80 ~~~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~-~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~-~~~-----~~p~~~- 151 (172) ..+-+.+...+..+.|. |.. +.....+.....+.|.++|.+||+|+.... .+. ....|- T Consensus 53 -----------~~~r~~ll~LltAHll~--L~~~~~~~a~~g~~G~isSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~ 119 (158) T protein:vir:78 53 -----------DNVLRELFNMLVAHLLT--LFGATPTSANSRPPGRLSSAAEGTVSSSFEFKLPEGSAIAPWYNQTQYGA 119 (158) T ss_pred -----------hhHHHHHHHHHHHHHHH--HhHhhhccccCCcccceeeeeecceEEEEeecCccccchhHHhhcCHHHH Confidence 11112222233323322 110 001112233457899999999999975432 211 122221 Q ss_pred ---HHHHHhhhhccccCCcccccC Q lcl|Aclame:pro 152 ---AADQKLVRAGLVRSGGTLLRG 172 (172) Q Consensus 152 ---~v~~lL~~~g~~~~~g~~~r~ 172 (172) .+..+++..|.+.+|++-..+ T Consensus 120 ~fwal~~~~~~Ggy~~gg~pe~~~ 143 (158) T protein:vir:78 120 MFWMATARYRSARYMVSGGSGIGT 143 (158) T ss_pred HHHHHHHHhcccccccccCCcccc Confidence 223333444555555544333 No 50 >protein:vir:106739 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944316;genbank:gi:38638615;genbank:GeneID:2657368 Probab=23.55 E-value=2.3 Score=18.61 Aligned_cols=131 Identities=20% Similarity=0.177 Sum_probs=55.7 Q ss_pred CeeEeecCCCCCCcccccccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHH-HhhhhhhccCccccCccccccccccCC Q lcl|Aclame:pro 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATD-YLDQRFNFVGKKRLGRDQTTEWPRTDA 79 (172) Q Consensus 1 M~liVedgtg~~~~AnSY~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asd-yid~~~~~~G~r~~~~~Q~lawPR~g~ 79 (172) |. .-|++=+=|..+|.+.+. ...+..|+..+..|..|.. ++|.. .+...+ + T Consensus 1 ~~------------~~~~~v~Fd~a~FR~~fP-eFa~~pd~~i~~~~~~A~~~~~~~~-~~s~~~--~------------ 52 (158) T protein:vir:10 1 MS------------TPPYRITFDPAGFIAEYP-EFATVATPRLQAMFNQAQTALLDNT-GGSPVT--D------------ 52 (158) T ss_pred CC------------CCCceEEcChHHHHHhch-hhccCCHHHHHHHHHHhhhhhcCCC-cccccc--C------------ Confidence 22 122332223333433332 1122357788888888855 44532 111100 0 Q ss_pred CCCCccccccchHHHHHHHHHHHHHHHhccCCC-CcccccccccceeeEEeeCceEEEeeccCC-CCC-----CCCcHH- Q lcl|Aclame:pro 80 WDRDRYYINDIPPEVKEACAEYALRALAAELNP-DPERNASGVAVLSKSEAVGPISESVTFVGG-AVF-----QMPKYP- 151 (172) Q Consensus 80 ~~~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~-~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~-~~~-----~~p~~~- 151 (172) ..+-+.+...+..+.|. |.. +.....+.....+.|.++|.+||+|+.... .+. ....|- T Consensus 53 -----------~~~r~~ll~LltAHll~--L~~~~~~~a~~g~~G~isSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~ 119 (158) T protein:vir:10 53 -----------DNVLRELFNMLVAHLLT--LFGATPTSANSRPPGRLSSAAEGTVSSSFEFKLPEGSAIAPWYNQTQYGA 119 (158) T ss_pred -----------hhHHHHHHHHHHHHHHH--HhHhhhccccCCcccceeeeeecceEEEEeecCccccchhHHhhcCHHHH Confidence 11112222233323322 110 001112233457899999999999975432 211 122221 Q ss_pred ---HHHHHhhhhccccCCcccccC Q lcl|Aclame:pro 152 ---AADQKLVRAGLVRSGGTLLRG 172 (172) Q Consensus 152 ---~v~~lL~~~g~~~~~g~~~r~ 172 (172) .+..+++..|.+.+|++-..+ T Consensus 120 ~fwal~~~~~~Ggy~~gg~pe~~~ 143 (158) T protein:vir:10 120 MFWMATARYRSARYMVSGGSGIGT 143 (158) T ss_pred HHHHHHHHhcccccccccCCcccc Confidence 223333444555555544333 No 51 >protein:vir:81069 Length: 115 # NCBI annotation: p10 # Family: family:all:363 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285680;genbank:gi:148727188;genbank:GeneID:5247114 Probab=22.92 E-value=2.4 Score=18.52 Aligned_cols=110 Identities=14% Similarity=-0.019 Sum_probs=49.8 Q ss_pred cccHHHHHHHHHHcCccccCCCHHHHHHHHHHHHHHhhhhhhccCccccCcccc-ccccccCC-CCCCccccccchHHHH Q lcl|Aclame:pro 18 YISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQT-TEWPRTDA-WDRDRYYINDIPPEVK 95 (172) Q Consensus 18 Y~sva~a~aY~~~rg~~w~~~~~~~ke~aL~~Asdyid~~~~~~G~r~~~~~Q~-lawPR~g~-~~~~~~~~d~IP~~V~ 95 (172) -+|+++++.+..--.. -...||+-.+..+..|.+++. +|.+++.. ..|. +.++..+. .+..+ -.||+.|+ T Consensus 1 ivtLee~K~HlRid~d-d~deDD~li~~~i~AA~~~v~---~~l~r~l~-~~~~~~~~~~~~~~~~~~~---~~~p~~i~ 72 (115) T protein:vir:81 1 MITLAMVQRHLQAELY-EDDERDYVMQQLLPAARESAE---LFINRKLY-DTQADMLADQAAGVDPAGQ---LLITRTVE 72 (115) T ss_pred CCCHHHHHHHcCCCCC-CCccchHHHHHHHHHHHHHHH---HHhCCccc-cccccccccccccCCCCcc---cccCHHHH Confidence 8999999998732211 111134444555555555444 34444421 2221 22222221 11111 23899999 Q ss_pred HHHHHHHHHHHhccCCCCcccccccccceeeEEeeCceEEEeeccCCCCCCCCcHHHHHHHhhhh---cccc Q lcl|Aclame:pro 96 EACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRA---GLVR 164 (172) Q Consensus 96 ~A~~elA~~~l~~~l~~~~~~~~~~~~~~vk~~kvG~i~~~y~~~~~~~~~~p~~~~v~~lL~~~---g~~~ 164 (172) .|+..+.-.. ..|. |.|- .++....| ..+..||.|+ .-+| T Consensus 73 ~AiLllvg~~-----Y~NR-------------E~v~---------~~~~~elP--~~~~~LL~pyR~~~g~~ 115 (115) T protein:vir:81 73 QAILLTLGEW-----YSSR-------------EQVW---------TKGAGLVT--SSAQNLLHPYRKFAGVR 115 (115) T ss_pred HHHHHHHHHH-----Hhcc-------------chhc---------chhhhhcC--HHHHHHHHHHHhhcCCC Confidence 8887776443 3322 2220 01111123 2357777654 2233 No 52 >protein:vir:99848 Length: 172 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164077;genbank:gi:56692609;genbank:GeneID:3192576 Probab=21.25 E-value=2.6 Score=18.28 Aligned_cols=126 Identities=14% Similarity=0.022 Sum_probs=55.8 Q ss_pred ccccccHHHHHHHHHHc-------------------------C--cccc-------CCCHHHHHHHHHHHHHHhhhhhhc Q lcl|Aclame:pro 15 ANAYISVEEFKTYHTDR-------------------------G--NSFA-------GSTDPQIEAAVIRATDYLDQRFNF 60 (172) Q Consensus 15 AnSY~sva~a~aY~~~r-------------------------g--~~w~-------~~~~~~ke~aL~~Asdyid~~~~~ 60 (172) ---|+|++|..+-+..+ . ..|. ..+++-.+.+|..|+..||+... T Consensus 1 ~~mYaT~~dl~~r~g~~el~qLt~~~~~~~~~~~l~d~~~~~~~~~~~~~~~~~~g~~d~~~i~~Al~dA~aeIDgYL~- 79 (172) T protein:vir:99 1 MAVYITLPELAERPGAEELSQAATPRPLQAVDSELLDALLRGLPVDRWTPEEIEVGHATVEVINSAVSDAQGYIDGFLQ- 79 (172) T ss_pred CcccccHHHHHhhcCHHHHHHHhccccccCCHHHHHHHhhcchhhhhcccccccccccCHHHHHHHHHHHHHHHHHHHh- Confidence 23455666555443211 1 1122 23566689999999999998532 Q ss_pred cCccccCccccccccccCCCCCCccccccchHHHHHHHHHHHHHHHhccCCC---Cccccccccc---ceeeEEeeCceE Q lcl|Aclame:pro 61 VGKKRLGRDQTTEWPRTDAWDRDRYYINDIPPEVKEACAEYALRALAAELNP---DPERNASGVA---VLSKSEAVGPIS 134 (172) Q Consensus 61 ~G~r~~~~~Q~lawPR~g~~~~~~~~~d~IP~~V~~A~~elA~~~l~~~l~~---~~~~~~~~~~---~~vk~~kvG~i~ 134 (172) | | . ..+|...+|.-|+..||-+|.+.|...-.. ..+....+=. ...+...-|.++ T Consensus 80 -~-R-~----------------Y~lPL~~vP~~L~~~a~dIArY~L~~~r~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~ 140 (172) T protein:vir:99 80 -R-R-G----------------YSLPLAKRYGVVTGWTRAIARYLLHQDRLGPGAEKDPIVRDYRDALKFLQLIAEGKFS 140 (172) T ss_pred -c-c-c----------------ccCCCcccchHHHHHHHHHHHHHHHhccCCcccCCHHHHHHHHHHHHHHHHHhcCccc Confidence 2 2 0 124667899999999999999988753210 0110000000 001111114433 Q ss_pred EEeecc-CCCCCCCCcHH-----HHHHHhhhh Q lcl|Aclame:pro 135 ESVTFV-GGAVFQMPKYP-----AADQKLVRA 160 (172) Q Consensus 135 ~~y~~~-~~~~~~~p~~~-----~v~~lL~~~ 160 (172) .--... ...++....+. +-..-|..| T Consensus 141 Lg~~~~~~~~~~~~~~v~~~~r~F~rd~L~gf 172 (172) T protein:vir:99 141 LGPDDPLTPPGGGVPQVLAPARTFSHDTLKDY 172 (172) T ss_pred cCCCCCCCCCCCCceeeecCCCccChhhccCC Confidence 321100 01111111110 111222111 No 53 >protein:vir:103283 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277463;genbank:gi:71834105;genbank:GeneID:3562394 Probab=21.14 E-value=2.6 Score=18.26 Aligned_cols=90 Identities=19% Similarity=0.087 Sum_probs=41.8 Q ss_pred CCCcccc-ccchHHHHHHHHHHH---------------------HHHHhccCCCCc-ccccccccceeeEEe-eCceEEE Q lcl|Aclame:pro 81 DRDRYYI-NDIPPEVKEACAEYA---------------------LRALAAELNPDP-ERNASGVAVLSKSEA-VGPISES 136 (172) Q Consensus 81 ~~~~~~~-d~IP~~V~~A~~elA---------------------~~~l~~~l~~~~-~~~~~~~~~~vk~~k-vG~i~~~ 136 (172) .+.-+|. -.+|++|++|=+|+| +.++.-|..... .....+-..++++-+ .|+++++ T Consensus 1 mR~l~P~f~~vpdevi~~wid~A~lFVC~~~fg~~~~~Al~lytlHLm~~dga~k~e~~~~~~~s~r~~s~slsGE~Sit 80 (125) T protein:vir:10 1 MRTLYPPLKSQPDDVLNAWIEVAKLFICLDKFGDKQVQALAFYTLHLLSQDIALKTENDSSQTSSERVKSYSLSGEYTIS 80 (125) T ss_pred CccccchhhccCHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhcccccccccccccccccceeeeeeccceEee Confidence 2222332 457999998888775 222223331111 111222334577777 5999999 Q ss_pred eeccCCCC--C---CCCcHHHHHHHhhhhccccCCcccccC Q lcl|Aclame:pro 137 VTFVGGAV--F---QMPKYPAADQKLVRAGLVRSGGTLLRG 172 (172) Q Consensus 137 y~~~~~~~--~---~~p~~~~v~~lL~~~g~~~~~g~~~r~ 172 (172) |....... + ..|-=-+...|+.. -+-+.|.|-+| T Consensus 81 ~~~~s~d~s~~~L~~T~wGk~~~~L~k~--~~GgFaL~T~~ 119 (125) T protein:vir:10 81 YDTSTAAASSSNLEESSWGKLYIDLMRL--KVGRWGLITSG 119 (125) T ss_pred cccccccccccccccCchHHHHHHHHHh--cCCceeeeccc Confidence 97543222 1 12322344455531 12222334333 Done!