Query lcl|Aclame:protein:vir:103760|NCBI_annot:hypothetical protein|genbank:acc:YP_024931;genbank:gi:48697201;genbank:GeneID:2846084 Match_columns 207 No_of_seqs 100 out of 105 Neff 6.7 Searched_HMMs 1612 Date Sat Nov 30 23:14:41 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_4 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_4_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:103760 Length: 207 100.0 2.6E-86 1.6E-89 489.8 19.3 207 1-207 1-207 (207) 2 protein:vir:107429 Length: 223 100.0 1.1E-76 6.6E-80 437.1 18.3 206 1-207 1-219 (223) 3 protein:vir:107803 Length: 223 100.0 1.1E-76 6.6E-80 437.1 18.3 206 1-207 1-219 (223) 4 protein:vir:98502 Length: 223 100.0 1.1E-76 6.6E-80 437.1 18.3 206 1-207 1-219 (223) 5 protein:vir:7328 Length: 201 # 100.0 1E-74 6.4E-78 426.2 17.4 193 1-195 1-201 (201) 6 protein:vir:95323 Length: 201 100.0 2.2E-73 1.4E-76 418.9 16.5 193 1-195 1-201 (201) 7 protein:vir:100027 Length: 204 99.7 3.8E-19 2.4E-22 121.6 17.4 190 1-206 1-204 (204) 8 protein:vir:103305 Length: 245 99.7 1.3E-18 7.8E-22 118.7 17.6 194 1-207 13-230 (245) 9 protein:vir:99676 Length: 197 99.7 9.3E-18 5.8E-21 114.0 17.5 178 1-196 12-197 (197) 10 protein:vir:7020 Length: 246 # 99.6 2.7E-17 1.7E-20 111.4 17.4 194 1-207 22-231 (246) 11 protein:vir:80215 Length: 211 99.6 3.6E-17 2.2E-20 110.7 17.9 186 1-207 1-203 (211) 12 protein:vir:94712 Length: 188 99.6 3.5E-17 2.1E-20 110.8 17.0 173 1-194 1-188 (188) 13 protein:vir:97033 Length: 245 99.6 6.7E-17 4.2E-20 109.2 17.3 194 1-207 22-231 (245) 14 protein:vir:105646 Length: 245 99.6 6.9E-17 4.3E-20 109.2 17.3 194 1-207 22-231 (245) 15 protein:vir:6325 Length: 184 # 99.6 4.8E-17 3E-20 110.1 16.3 177 2-198 1-184 (184) 16 protein:vir:3365 Length: 196 # 99.3 2.9E-13 1.8E-16 89.3 16.9 179 1-196 10-196 (196) 17 protein:vir:1542 Length: 196 # 99.3 3.2E-13 2E-16 89.1 17.0 179 1-196 10-196 (196) 18 protein:vir:94565 Length: 196 99.3 6E-13 3.7E-16 87.6 17.1 179 1-201 10-196 (196) 19 protein:vir:8886 Length: 195 # 99.2 1.4E-12 8.9E-16 85.5 17.3 175 1-193 10-195 (195) 20 protein:vir:10451 Length: 196 99.2 1.8E-12 1.1E-15 85.0 16.9 179 1-196 10-196 (196) 21 protein:vir:78929 Length: 184 99.2 1.8E-12 1.1E-15 85.0 16.2 177 2-202 1-184 (184) 22 protein:vir:2202 Length: 196 # 99.2 2.7E-12 1.7E-15 84.0 16.8 178 1-196 10-196 (196) 23 protein:vir:78741 Length: 197 99.2 4.3E-12 2.6E-15 82.9 17.4 184 1-207 1-197 (197) 24 protein:vir:351 Length: 242 # 97.9 1.8E-06 1.1E-09 52.1 13.6 182 1-207 1-231 (242) 25 protein:vir:1780 Length: 67 # 96.9 3.7E-06 2.3E-09 50.4 5.3 61 1-62 1-67 (67) 26 protein:vir:105380 Length: 160 92.3 0.012 7.4E-06 31.1 12.6 145 1-207 4-155 (160) 27 protein:vir:176 Length: 160 # 92.2 0.012 7.7E-06 31.0 12.6 145 1-207 4-155 (160) 28 protein:vir:80185 Length: 221 91.6 0.007 4.3E-06 32.4 9.0 181 1-194 1-221 (221) 29 protein:vir:3130 Length: 250 # 88.9 0.029 1.8E-05 29.0 11.0 181 1-201 1-250 (250) 30 protein:vir:105524 Length: 166 86.4 0.046 2.8E-05 27.9 12.5 145 1-207 4-165 (166) 31 protein:vir:3528 Length: 160 # 84.5 0.06 3.7E-05 27.3 12.4 144 1-205 4-160 (160) 32 protein:vir:9267 Length: 166 # 70.1 0.21 0.00013 24.3 10.7 148 1-207 3-160 (166) 33 protein:vir:100918 Length: 166 66.4 0.27 0.00017 23.7 11.1 149 1-207 3-160 (166) 34 protein:vir:2108 Length: 166 # 62.0 0.34 0.00021 23.1 10.9 149 1-207 3-160 (166) 35 protein:vir:94601 Length: 211 54.9 0.49 0.00031 22.3 11.2 178 1-200 1-211 (211) 36 protein:vir:95463 Length: 267 35.6 1.2 0.00076 20.1 11.3 197 1-199 1-267 (267) 37 protein:vir:95880 Length: 236 31.1 1.5 0.00094 19.6 8.3 194 1-197 1-236 (236) No 1 >protein:vir:103760 Length: 207 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024931;genbank:gi:48697201;genbank:GeneID:2846084 Probab=100.00 E-value=2.6e-86 Score=489.81 Aligned_cols=207 Identities=100% Similarity=1.456 Sum_probs=203.8 Q ss_pred CCCHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeEecccCCCCCCCCccccccCcccc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEAPLFGFSYQYRLPTDF 80 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~~La~l~~~p~~~~~yaY~lP~Dc 80 (207) |+|+|+|||+||++||+++|+||+|+|++|++|++||+++||++||+|||+||+||++|++++++|++||.|+|+||.|| T Consensus 1 M~S~v~IcN~AL~~lGa~~I~s~~e~s~~A~~c~~~Y~~~r~~~L~~~pW~FA~~r~~La~~~~~P~~~~~yaY~LP~Dc 80 (207) T protein:vir:10 1 MASQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEAPLFGFSYQYRLPTDF 80 (207) T ss_pred CCCHHHHHHHHHHhhchhhhcccccCCHHHHHHHHhhHHHHHHHHhccChhhHhhhhhhcccccCCCCCCcccccCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEeeecccCccccccccccceEEeCCEEEecCCCceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhccCCHHHHHHH Q lcl|Aclame:pro 81 IRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQAPLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLTQSATKRQGA 160 (207) Q Consensus 81 lrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pLt~~~~~~~~l 160 (207) |||++|++++..+.....++|+|+|++|++|++++++|+||++|+|++.|||+|++||+|+||++||+|||+|.++++.+ T Consensus 81 lrv~~v~~~~~~~~~~~~~~~~v~g~~ll~~~~~~~~l~Y~~~v~d~~~fd~~F~~ala~~LAa~lA~pLt~~~~~~~~~ 160 (207) T protein:vir:10 81 IRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQAPLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLTQSATKRQGA 160 (207) T ss_pred eEeeeecCCCCccccccccceEecCCeEEecCCCcEEEEEeecCCChhhhhHHHHHHHHHHHHHHhhHhhcCChHHHHHH Confidence 99999999988877788889999999999999889999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccCCCCCCCcccccC Q lcl|Aclame:pro 161 WAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGVAFPGETPIIRS 207 (207) Q Consensus 161 ~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~~~~~~~~~~~~~ 207 (207) +|+|+++|++|+.+|++|++++++++++|+++|+|+.+||+.||||| T Consensus 161 ~q~~~~~l~~A~~~da~e~~~~~~~~~~~l~aR~~~~~~~~~~~~~~ 207 (207) T protein:vir:10 161 WAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGVAFPGETPIIRS 207 (207) T ss_pred HHHHHHHHHHHHhcccccCcccccCCcchhhhcccccccccCCcccC Confidence 99999999999999999999999999999999999999999999999 No 2 >protein:vir:107429 Length: 223 # NCBI annotation: Bbp14 # Family: family:all:1524 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958683;genbank:gi:41179375;genbank:GeneID:2717223 Probab=100.00 E-value=1.1e-76 Score=437.08 Aligned_cols=206 Identities=27% Similarity=0.376 Sum_probs=187.1 Q ss_pred CCCHHHHHHHHHHHhcchhcc---ccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeEecccCCCCCCCCccccccCc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRIT---SLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEAPLFGFSYQYRLP 77 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~---Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~~La~l~~~p~~~~~yaY~lP 77 (207) |+|+|+|||+||++||+++|. |++|+|++|++|++||+++||++||+|||+||+||++|+++++ |++||.|+|+|| T Consensus 1 M~S~v~IcN~AL~~lG~~~i~~~~s~~E~s~~A~~C~~~Y~~~r~~~Lr~~pW~FA~~r~~La~~a~-p~~~~~yaY~LP 79 (223) T protein:vir:10 1 MASEVDICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELHTWGFATKCAQLAAMGI-SRPEWRFAYAQP 79 (223) T ss_pred CCCHHHHHHHHHHhcccchhhcccCCCCCCHHHHHHHHhhHHHHHHHHhhcCchhHhhhhhhhhccc-CCCCcccccccc Confidence 999999999999999998765 6789999999999999999999999999999999999999885 667999999999 Q ss_pred ccceEeeecccCccccc---cccccceEEe----CCEEEecCCCceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhc Q lcl|Aclame:pro 78 TDFIRLLQVGQFDVYPR---TDTRGLFSIE----NGNILTDMQAPLYIRYAKRVTDPNAMDALFREAFACRLAAEACESL 150 (207) Q Consensus 78 ~Dclrv~~v~~~~~~~~---~~~~~~y~v~----g~~l~~~~~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pL 150 (207) .|||||++|++.+.... ......|+++ |+++|++++++++|+||.+|+|++.|||+|++||+|+||++||+|| T Consensus 80 ~Dclrv~~v~~~~~~~~~~~~~~~~~~~~e~~~~g~~~i~td~~~~~l~Y~~~v~d~~~fd~lF~~Ala~~LAa~lA~pL 159 (223) T protein:vir:10 80 ADAIKIVAVLPHDAANIEAGIDNAQPFSCEIDNTGADIILTNQVNAVARYISLVKDTTKFSPLFVQALAWHLASMLAGPL 159 (223) T ss_pred ccceeeeeeccccccccccccccccceEEeeccccceeeeecCCceEEEEeecCCChhcccHHHHHHHHHHHHHHhhHhh Confidence 99999999988765422 2334568886 6677777778999999999999999999999999999999999999 Q ss_pred cCCHHHHH---HHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccCCCCCCCcccccC Q lcl|Aclame:pro 151 TQSATKRQ---GAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGVAFPGETPIIRS 207 (207) Q Consensus 151 t~~~~~~~---~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~~~~~~~~~~~~~ 207 (207) |+|.++.+ .+.++|++.|++|+.+|++|+++++.++++|+++|++|++||++|=.-. T Consensus 160 t~~~~~~q~a~~~~~~y~~~l~~A~~~da~e~~~~~~~~~~~l~aR~~~~~~~~~~~~~~ 219 (223) T protein:vir:10 160 LKGDVGAAESKRCVGAMQAYLSQAMVSDANQRKTKPAHMPEWMRARGAGFVDGNIPGLPN 219 (223) T ss_pred cCCcchHHHHHHHHHHHHHHHHHHHhcccccCcccccccchhhhhcccCCCCCCCCCCCc Confidence 99986655 7789999999999999999999999999999999999999999997766 No 3 >protein:vir:107803 Length: 223 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1524 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996624;genbank:gi:45580758;genbank:GeneID:2767879 Probab=100.00 E-value=1.1e-76 Score=437.08 Aligned_cols=206 Identities=27% Similarity=0.376 Sum_probs=187.1 Q ss_pred CCCHHHHHHHHHHHhcchhcc---ccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeEecccCCCCCCCCccccccCc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRIT---SLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEAPLFGFSYQYRLP 77 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~---Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~~La~l~~~p~~~~~yaY~lP 77 (207) |+|+|+|||+||++||+++|. |++|+|++|++|++||+++||++||+|||+||+||++|+++++ |++||.|+|+|| T Consensus 1 M~S~v~IcN~AL~~lG~~~i~~~~s~~E~s~~A~~C~~~Y~~~r~~~Lr~~pW~FA~~r~~La~~a~-p~~~~~yaY~LP 79 (223) T protein:vir:10 1 MASEVDICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELHTWGFATKCAQLAAMGI-SRPEWRFAYAQP 79 (223) T ss_pred CCCHHHHHHHHHHhcccchhhcccCCCCCCHHHHHHHHhhHHHHHHHHhhcCchhHhhhhhhhhccc-CCCCcccccccc Confidence 999999999999999998765 6789999999999999999999999999999999999999885 667999999999 Q ss_pred ccceEeeecccCccccc---cccccceEEe----CCEEEecCCCceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhc Q lcl|Aclame:pro 78 TDFIRLLQVGQFDVYPR---TDTRGLFSIE----NGNILTDMQAPLYIRYAKRVTDPNAMDALFREAFACRLAAEACESL 150 (207) Q Consensus 78 ~Dclrv~~v~~~~~~~~---~~~~~~y~v~----g~~l~~~~~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pL 150 (207) .|||||++|++.+.... ......|+++ |+++|++++++++|+||.+|+|++.|||+|++||+|+||++||+|| T Consensus 80 ~Dclrv~~v~~~~~~~~~~~~~~~~~~~~e~~~~g~~~i~td~~~~~l~Y~~~v~d~~~fd~lF~~Ala~~LAa~lA~pL 159 (223) T protein:vir:10 80 ADAIKIVAVLPHDAANIEAGIDNAQPFSCEIDNTGADIILTNQVNAVARYISLVKDTTKFSPLFVQALAWHLASMLAGPL 159 (223) T ss_pred ccceeeeeeccccccccccccccccceEEeeccccceeeeecCCceEEEEeecCCChhcccHHHHHHHHHHHHHHhhHhh Confidence 99999999988765422 2334568886 6677777778999999999999999999999999999999999999 Q ss_pred cCCHHHHH---HHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccCCCCCCCcccccC Q lcl|Aclame:pro 151 TQSATKRQ---GAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGVAFPGETPIIRS 207 (207) Q Consensus 151 t~~~~~~~---~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~~~~~~~~~~~~~ 207 (207) |+|.++.+ .+.++|++.|++|+.+|++|+++++.++++|+++|++|++||++|=.-. T Consensus 160 t~~~~~~q~a~~~~~~y~~~l~~A~~~da~e~~~~~~~~~~~l~aR~~~~~~~~~~~~~~ 219 (223) T protein:vir:10 160 LKGDVGAAESKRCVGAMQAYLSQAMVSDANQRKTKPAHMPEWMRARGAGFVDGNIPGLPN 219 (223) T ss_pred cCCcchHHHHHHHHHHHHHHHHHHHhcccccCcccccccchhhhhcccCCCCCCCCCCCc Confidence 99986655 7789999999999999999999999999999999999999999997766 No 4 >protein:vir:98502 Length: 223 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1524 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996576;genbank:gi:45569507;genbank:GeneID:2767830 Probab=100.00 E-value=1.1e-76 Score=437.08 Aligned_cols=206 Identities=27% Similarity=0.376 Sum_probs=187.1 Q ss_pred CCCHHHHHHHHHHHhcchhcc---ccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeEecccCCCCCCCCccccccCc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRIT---SLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEAPLFGFSYQYRLP 77 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~---Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~~La~l~~~p~~~~~yaY~lP 77 (207) |+|+|+|||+||++||+++|. |++|+|++|++|++||+++||++||+|||+||+||++|+++++ |++||.|+|+|| T Consensus 1 M~S~v~IcN~AL~~lG~~~i~~~~s~~E~s~~A~~C~~~Y~~~r~~~Lr~~pW~FA~~r~~La~~a~-p~~~~~yaY~LP 79 (223) T protein:vir:98 1 MASEVDICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELHTWGFATKCAQLAAMGI-SRPEWRFAYAQP 79 (223) T ss_pred CCCHHHHHHHHHHhcccchhhcccCCCCCCHHHHHHHHhhHHHHHHHHhhcCchhHhhhhhhhhccc-CCCCcccccccc Confidence 999999999999999998765 6789999999999999999999999999999999999999885 667999999999 Q ss_pred ccceEeeecccCccccc---cccccceEEe----CCEEEecCCCceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhc Q lcl|Aclame:pro 78 TDFIRLLQVGQFDVYPR---TDTRGLFSIE----NGNILTDMQAPLYIRYAKRVTDPNAMDALFREAFACRLAAEACESL 150 (207) Q Consensus 78 ~Dclrv~~v~~~~~~~~---~~~~~~y~v~----g~~l~~~~~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pL 150 (207) .|||||++|++.+.... ......|+++ |+++|++++++++|+||.+|+|++.|||+|++||+|+||++||+|| T Consensus 80 ~Dclrv~~v~~~~~~~~~~~~~~~~~~~~e~~~~g~~~i~td~~~~~l~Y~~~v~d~~~fd~lF~~Ala~~LAa~lA~pL 159 (223) T protein:vir:98 80 ADAIKIVAVLPHDAANIEAGIDNAQPFSCEIDNTGADIILTNQVNAVARYISLVKDTTKFSPLFVQALAWHLASMLAGPL 159 (223) T ss_pred ccceeeeeeccccccccccccccccceEEeeccccceeeeecCCceEEEEeecCCChhcccHHHHHHHHHHHHHHhhHhh Confidence 99999999988765422 2334568886 6677777778999999999999999999999999999999999999 Q ss_pred cCCHHHHH---HHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccCCCCCCCcccccC Q lcl|Aclame:pro 151 TQSATKRQ---GAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGVAFPGETPIIRS 207 (207) Q Consensus 151 t~~~~~~~---~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~~~~~~~~~~~~~ 207 (207) |+|.++.+ .+.++|++.|++|+.+|++|+++++.++++|+++|++|++||++|=.-. T Consensus 160 t~~~~~~q~a~~~~~~y~~~l~~A~~~da~e~~~~~~~~~~~l~aR~~~~~~~~~~~~~~ 219 (223) T protein:vir:98 160 LKGDVGAAESKRCVGAMQAYLSQAMVSDANQRKTKPAHMPEWMRARGAGFVDGNIPGLPN 219 (223) T ss_pred cCCcchHHHHHHHHHHHHHHHHHHHhcccccCcccccccchhhhhcccCCCCCCCCCCCc Confidence 99986655 7789999999999999999999999999999999999999999997766 No 5 >protein:vir:7328 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848219;genbank:gi:30387390;genbank:GeneID:2641866 Probab=100.00 E-value=1e-74 Score=426.21 Aligned_cols=193 Identities=32% Similarity=0.426 Sum_probs=180.4 Q ss_pred CCCHHHHHHHHHHHhcc-hhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeEecccCCCCCCCCccccccCccc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGD-KRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEAPLFGFSYQYRLPTD 79 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~-~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~~La~l~~~p~~~~~yaY~lP~D 79 (207) |+|+|+|||+||++||+ ++|+||+|+|++|++|++||+++||++||+|||+||+||++|+++++ |++||.|+|+||.| T Consensus 1 M~S~v~IcN~AL~~iG~a~~I~s~~e~s~~A~~c~~~Y~~~r~~~Lr~~pW~FA~~r~~La~~a~-~p~~~~yaY~LP~D 79 (201) T protein:vir:73 1 MASVIEICNRALSNIGNSRSINSLIEASKEAGQCSLHFDACRDAALADFDWNFATKRVALADTNN-PPPDWQYAYQYPSD 79 (201) T ss_pred CCCHHHHHHHHHHhhcCcccccccccCCHHHHHHHHhhHHHHHHHHhhcCchhHhhhhhhhhccc-CCCCCccccccccc Confidence 99999999999999995 79999999999999999999999999999999999999999998886 55899999999999 Q ss_pred ceEeeecccCcccccccc-------ccceEEeCCEEEecCCCceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhccC Q lcl|Aclame:pro 80 FIRLLQVGQFDVYPRTDT-------RGLFSIENGNILTDMQAPLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLTQ 152 (207) Q Consensus 80 clrv~~v~~~~~~~~~~~-------~~~y~v~g~~l~~~~~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pLt~ 152 (207) ||||++|++++..+.... ...|+++|+.|+||. ++++|+||.+|+|++.|||+|++||+|+||++||+|||+ T Consensus 80 clrv~~v~~~~~~~~~~~~~~~~~~~~~~~ieg~~i~td~-~~~~l~Y~~~v~d~~~fd~lF~~ala~~LAa~lA~plt~ 158 (201) T protein:vir:73 80 CVRITEIMPTGIRNPTAAQRIEYVVGSNEDLTGKLIYTDQ-PKAWLKYMARVTDVNMYDAIFMEALSWRLAAAINMALTG 158 (201) T ss_pred ceeeeeeccccccccccccccchhccccccccCCEeeecC-CceeEEEeecCCCcccccHHHHHHHHHHHHHHhhHhhcC Confidence 999999988775433221 135889999999985 579999999999999999999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccC Q lcl|Aclame:pro 153 SATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNG 195 (207) Q Consensus 153 ~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~ 195 (207) |.++++.+.|+|++++++|+.+|++|++++..+.++|+++|+| T Consensus 159 ~~~~~~~~~q~~~~~~~~A~~~d~~e~~~~~~~~~~~l~aR~~ 201 (201) T protein:vir:73 159 SADLGNNALTMYNRVILSAGSHSQNESQEPQPPVDEFTAARLS 201 (201) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhhccccCCCCCCchHHHhhcC Confidence 9999999999999999999999999999999999999999999 No 6 >protein:vir:95323 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512268;genbank:gi:89152435;genbank:GeneID:3952992 Probab=100.00 E-value=2.2e-73 Score=418.88 Aligned_cols=193 Identities=33% Similarity=0.409 Sum_probs=177.7 Q ss_pred CCCHHHHHHHHHHHhc-chhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeEecccCCCCCCCCccccccCccc Q lcl|Aclame:pro 1 MASQVGICNRALTKIG-DKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEAPLFGFSYQYRLPTD 79 (207) Q Consensus 1 MaS~v~IcN~AL~~lG-~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~~La~l~~~p~~~~~yaY~lP~D 79 (207) |+|+|+|||+||++|| +++|+||+|+|++|++|++||+++||++||+|||+||+||++|+++++ |++||.|+|+||.| T Consensus 1 M~S~v~IcN~AL~~iG~a~~I~s~~e~s~~A~~C~~~Y~~~r~~~L~~~pW~FA~~r~~La~~a~-~~~~~~yay~LP~D 79 (201) T protein:vir:95 1 MASVVEICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVALADTSN-PPPDWEYAYQYPSD 79 (201) T ss_pred CCCHHHHHHHHHHHhCCcccccccccCCHHHHHHHHhhHHHHHHHHhhcCchhhhhhhhcccccC-CCCCCcccccccch Confidence 9999999999999999 579999999999999999999999999999999999999999998876 66899999999999 Q ss_pred ceEeeecccCcccccc-ccc------cceEEeCCEEEecCCCceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhccC Q lcl|Aclame:pro 80 FIRLLQVGQFDVYPRT-DTR------GLFSIENGNILTDMQAPLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLTQ 152 (207) Q Consensus 80 clrv~~v~~~~~~~~~-~~~------~~y~v~g~~l~~~~~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pLt~ 152 (207) ||||++|..+|..... ... .+|+++|+.|++|. ++++|+||++|+|++.|||+|++||+|+||++||+|||+ T Consensus 80 clrv~~v~~~g~~~~~~~~~~~f~v~~~~~~~g~~l~td~-~~~~l~Yv~~v~d~~~fd~~F~~ala~~LAa~la~plt~ 158 (201) T protein:vir:95 80 CLRITEIMLPGVRNPTAAMRVQYEVGADTNGTGKLIYTDQ-PQAWLKYVSRVTDVNMFDAIFMEALAWRLAAAINMALTG 158 (201) T ss_pred hhhhhhhccCCccccccccchhhhccccccccCceeeecC-CceEEEEeecCCChhhccHHHHHHHHHHHHHHhhHhhcC Confidence 9999999877653221 112 24566788888775 679999999999999999999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccC Q lcl|Aclame:pro 153 SATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNG 195 (207) Q Consensus 153 ~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~ 195 (207) |.++++.++|+|++.+++|+.+|++|++++.+++++|+++|.. T Consensus 159 ~~~~~~~~~q~~~~~l~~A~~~da~e~~~~~~~~~~~l~aRl~ 201 (201) T protein:vir:95 159 NADLGTFALNMYNRVILSAGSHSQNESQEPQPPVDEFTIARLS 201 (201) T ss_pred ChHHHHHHHHHHHHHHHHHHhcccccCcccCCCcchhhhhhcC Confidence 9999999999999999999999999999999999999999999 No 7 >protein:vir:100027 Length: 204 # NCBI annotation: T7-like tail tubular protein A # Family: family:all:824 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214207;genbank:gi:61806430;genbank:GeneID:3294707 Probab=99.72 E-value=3.8e-19 Score=121.56 Aligned_cols=190 Identities=15% Similarity=0.181 Sum_probs=140.6 Q ss_pred CC-------CHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCcee-eeEecccCCCCCCCCccc Q lcl|Aclame:pro 1 MA-------SQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTK-ARAQLAALAEAPLFGFSY 72 (207) Q Consensus 1 Ma-------S~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~-~r~~La~l~~~p~~~~~y 72 (207) |+ |+++-.|.=|..||+.||+|+|++.+.+..+...++.+++.+|+ +.|.|.+ ++..|.|++. .|+-+ T Consensus 1 ~~~~~~~~~teL~AVN~~L~aIGespV~sld~~npdva~a~~iL~~v~~~vqs-~GW~FNte~~~~ltPd~~---~g~I~ 76 (204) T protein:vir:10 1 MATTTIQPDTELSAVNSILGSIGQSPLTTLNYNNPETAFVYNLLVEANKDVQG-EGWHFNTEDHVLVTPDAT---TKYIN 76 (204) T ss_pred CceecccccchhHHHHHHHHhhCccccccccCCCccHHHHHHHHHHHHHHHhc-CCceeeccCCeeeeeeCC---CCeEE Confidence 55 58899999999999999999999999999999999999999997 9999999 5677766632 34433 Q ss_pred cccCcccceEeeecccCccccccccccceEEeCCEEEecCC------CceEEEEEeecCChhhccHHHHHHHHHHHHHHh Q lcl|Aclame:pro 73 QYRLPTDFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQ------APLYIRYAKRVTDPNAMDALFREAFACRLAAEA 146 (207) Q Consensus 73 aY~lP~Dclrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~------~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~l 146 (207) +|.|+|++...+... +...+|...||+|+-... .++.++-|.. -+.+.+|+.|+.+++.|.|.+. T Consensus 77 ---~P~n~L~v~~~~~~~-----d~~~~~v~Rgg~LYD~~~~t~~f~~~i~v~iv~~-~~FeelPe~~~~~I~~rAa~~f 147 (204) T protein:vir:10 77 ---VPSNYLRYDLHSGHV-----DKSMDLVKRNGRLYDKVGHTDQFDDDLYLDIVTL-YPFEDVPPIFQRYIISKAAVRA 147 (204) T ss_pred ---cCcceeeeeecCCcc-----cccceeEEeCCeEEecccCceeecCcceEEEEee-cChhhccHHHHHHHHHHHHHHH Confidence 899999998765432 112356666776653322 2466664554 3466799999999999999999 Q ss_pred hhhccCCHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccCCCCCCCccccc Q lcl|Aclame:pro 147 CESLTQSATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGVAFPGETPIIR 206 (207) Q Consensus 147 A~pLt~~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~~~~~~~~~~~~ 206 (207) +.++.++.++.+.+.++.+.+....+..+.+|+.-.-+...++-..|-. .|-++ ++| T Consensus 148 ~~~~~g~~~~~q~l~~~e~~ar~~~~~~e~~q~~~N~~~~~~~~~~~~~--~p~~~-l~r 204 (204) T protein:vir:10 148 ATQLVANRELVALLQVQEQSARANVLEYECNQGDHSFMGWPHESSYRPY--QPYKA-LQR 204 (204) T ss_pred HhhcCCchhHHHHHHHHHHHHHHHHHHHhHhhcCcccccCCCCCCccCc--Cchhh-hcC Confidence 9999999999999999999999999999988875333322222211111 11111 122 No 8 >protein:vir:103305 Length: 245 # NCBI annotation: tail-like protein # Family: family:all:824 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039669;genbank:gi:125999998;genbank:GeneID:4818381 Probab=99.69 E-value=1.3e-18 Score=118.73 Aligned_cols=194 Identities=19% Similarity=0.118 Sum_probs=152.1 Q ss_pred CC--------CHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHH--HHhcCCCCceeee-EecccCCCCCCCC Q lcl|Aclame:pro 1 MA--------SQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDA--CLRAHVWSFTKAR-AQLAALAEAPLFG 69 (207) Q Consensus 1 Ma--------S~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~--~L~~~~W~FA~~r-~~La~l~~~p~~~ 69 (207) |+ |+++-.|.=|..||+.||+|+|++.+.+..|++.++.+.+. .|..+.|.|.+-. ..|.|.+. T Consensus 13 ~~~~~~~~~dteLdAVN~~L~aIGEsPV~sld~~npdva~A~~IL~~v~~~vQ~llseGW~FNte~~~~ltPd~~----- 87 (245) T protein:vir:10 13 LASVNLDTVDTRLEAINLCLRAVGYASIESEDSGDLDAADASKILATVGQRVQYNGGKGWWFNVEPNWQMTPDAN----- 87 (245) T ss_pred CcccccccccchHHHHHHHHHhhcccccccccCCchhHHHHHHHHHHHHHHHHhhcCCCeeEeecCCceeccCCC----- Confidence 65 36788999999999999999999999999999999999988 6789999999976 46666532 Q ss_pred ccccccCcccceEeeecccCccccccccccceEEeCCEEEecCCC------c------eEEEEEeecCChhhccHHHHHH Q lcl|Aclame:pro 70 FSYQYRLPTDFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQA------P------LYIRYAKRVTDPNAMDALFREA 137 (207) Q Consensus 70 ~~yaY~lP~Dclrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~~------~------~~l~Y~~~v~d~~~~~~~F~~a 137 (207) +.+.+|.|+|++..-...+ ....+|...|++|+-.... | +.++.|.. -+-+.+|..|+.+ T Consensus 88 --g~i~iP~n~L~v~~~~~~~-----~~~~~~v~RGgkLYD~~n~T~~F~~pv~~~~~~~v~iV~~-~pFedlPe~~q~y 159 (245) T protein:vir:10 88 --GEILIPNNAIAAWQDVRYD-----DKKVLISIRGRKVYNMNTHSTDFSNSLNREGFFRMTFMLN-LPFEHMPVSARQA 159 (245) T ss_pred --CceecCccchhhhcccccC-----CCccceEEcCCeeEecccCceeccCccccccceeEEEEee-CChhhccHHHHHH Confidence 3588999999996532211 1123577777777643321 2 24666666 4467789999999 Q ss_pred HHHHHHHHhhhhccCCHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhcc-CCCCCCCcccccC Q lcl|Aclame:pro 138 FACRLAAEACESLTQSATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRN-GVAFPGETPIIRS 207 (207) Q Consensus 138 la~~LAa~lA~pLt~~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~-~~~~~~~~~~~~~ 207 (207) ++.|.|.+.+.+..++.++.+.+.|+.+.+....+..+.+++.-.-+...++...++ --|+|-.-|-... T Consensus 160 I~~rAA~~f~~~~~G~~~~~q~l~q~e~~a~~~~~~~~~~q~~~Nm~~~~p~~~~~r~~v~~~~~~~~~~~ 230 (245) T protein:vir:10 160 IAYQAAVEFMVSKEFDAQKVQIWQQLAQQMQIDMGQESANQQSLNMFVNNPTQAHFGSMVGGPNANATFSR 230 (245) T ss_pred HHHHHHHHHHhhccCchhHHHHHHHHHHHHHHHHHHHHHhhcCcccccCCchhhhcchhcccccccccccc Confidence 999999999999999999999999999999999999998888777776677777776 4455666665533 No 9 >protein:vir:99676 Length: 197 # NCBI annotation: Tail tubular protein A # Family: family:all:824 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249590;genbank:gi:68299741;genbank:GeneID:3799991 Probab=99.65 E-value=9.3e-18 Score=113.95 Aligned_cols=178 Identities=10% Similarity=0.120 Sum_probs=141.3 Q ss_pred CCCHHHHHHHHHHHhcchhcccccc-CCHHHHHHHHhhHHHHHHHHhcCCCCceeee-EecccCCCCCCCCccccccCcc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLDE-DSKAAATLNSMYDDVLDACLRAHVWSFTKAR-AQLAALAEAPLFGFSYQYRLPT 78 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~e-~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r-~~La~l~~~p~~~~~yaY~lP~ 78 (207) +.|+++-.|.=|..||+.||+|+|+ +.+.+..+...++.+++.+|+ +.|.|-+-+ ..|.|.+. . .+.+ +|. T Consensus 12 ~~teLdAVN~~L~aIGesPV~sld~~~npdva~a~~iL~~v~~~vqs-~GW~FNte~~~~ltPd~~---~--~~I~-~P~ 84 (197) T protein:vir:99 12 YQAELDAINDILASIGESPVNTLESDANADVVNARRILHKINRQEQS-KGWTFNIEEGATLVPDVY---S--QLIP-YMP 84 (197) T ss_pred ccchhHHHHHHHHhhcccccccccCCCCccHHHHHHHHHHHHHHHhc-CCceeeecCCeeeeecCC---C--CeEE-cCc Confidence 6689999999999999999999998 579999999999999999996 999999987 66766544 2 3344 799 Q ss_pred cceEeeecccCccccccccccceEEeCCEEEecCC------CceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhccC Q lcl|Aclame:pro 79 DFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQ------APLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLTQ 152 (207) Q Consensus 79 Dclrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~------~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pLt~ 152 (207) |+|++...+.. +|...||+|+-... .|+.++-|.. -+.+..|..|+.+++.|.|.+.+....+ T Consensus 85 n~L~v~~~~~~----------~~v~Rgg~LYD~~n~T~~F~~pi~v~iv~~-~~FeelPe~~~~yI~~rAa~~f~~~~~G 153 (197) T protein:vir:99 85 NYLSVTTTGGT----------PYVNRGGYVYDRINKTDRFTSPITVNLISL-RTFDEMPEQFKSYIVTKASKEFNIRFFG 153 (197) T ss_pred ceeeeecCcCc----------eeEEeCCeeEeccCCcEeeCCceEEEEEEe-cChhhccHHHHHHHHHHHHHHHHhhccC Confidence 99999754332 46677777654332 3567775555 4466789999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccCC Q lcl|Aclame:pro 153 SATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGV 196 (207) Q Consensus 153 ~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~~ 196 (207) +.++.+.+.++.+.+...++..+.+|+.-.-+-.++++..=-++ T Consensus 154 ~~~~~q~l~~~e~~a~~~~~e~e~~qg~~Nml~~~~~~~~~~~r 197 (197) T protein:vir:99 154 APEIDTVLGNELIDLERAVNEYELDYGAFNIFNSDPYVSGAISR 197 (197) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHhhCCcceeeCChhhhccccC Confidence 99999999999999999999999999864444444444322222 No 10 >protein:vir:7020 Length: 246 # NCBI annotation: tail protein # Family: family:all:824 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853593;genbank:gi:31711675;genbank:GeneID:1481801 Probab=99.62 E-value=2.7e-17 Score=111.40 Aligned_cols=194 Identities=14% Similarity=0.152 Sum_probs=151.9 Q ss_pred CCCHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHH--HhcCCCCceeeeE-ecccCCCCCCCCccccccCc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDAC--LRAHVWSFTKARA-QLAALAEAPLFGFSYQYRLP 77 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~--L~~~~W~FA~~r~-~La~l~~~p~~~~~yaY~lP 77 (207) -.|+++-.|.=|..||+.||+|+|++.+++..++..++.+.+.+ |..+.|.|.+-+. .|.|++ ...+.+| T Consensus 22 ~~TeLdAVN~~L~aIGEsPV~sld~~n~d~~~a~~iL~~v~~~vq~~lseGW~FNte~~~~ltPD~-------~g~I~iP 94 (246) T protein:vir:70 22 IDSKLEAVNLCMRAIGREGVDSLDSGDLDAEDASKMLDIVSQRFQYNKGGGWWFNREPNWRIVPDT-------NGEVNLP 94 (246) T ss_pred chhhHHHHHHHHHhhCccccccccCCCccHHHHHHHHHHHHHHHHHhccCCeeEeecCceeeccCC-------CCeEecC Confidence 24688899999999999999999999999999999999998886 5689999998865 576542 2358899 Q ss_pred ccceEeeecccCccccccccccceEEeCCEEEecC------------CCceEEEEEeecCChhhccHHHHHHHHHHHHHH Q lcl|Aclame:pro 78 TDFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDM------------QAPLYIRYAKRVTDPNAMDALFREAFACRLAAE 145 (207) Q Consensus 78 ~Dclrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~------------~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~ 145 (207) .|+|+|...+... ....+|...||+|+-.. +.|+.++.|... +-+..|..|+.+++.|.|.+ T Consensus 95 ~n~L~v~~~~~~~-----~~~~~vv~RGgkLYD~~n~T~~F~~~~~~D~pv~v~IV~~~-~FedLPe~~q~yI~~rAA~~ 168 (246) T protein:vir:70 95 NNCLAVLQCYALG-----ERKVPMTMRAGKLYSTWNHTFDMRSHVNKDGAIRLTLLTYL-PFEHLPTSVMQAIAYQAAVE 168 (246) T ss_pred ccceeeeeccCcc-----cCceeeEEcCCeeEeecccceecccccccCcceEEEEEecC-ChhhhhHHHHHHHHHHHHHH Confidence 9999998775421 12346778888885431 357888888774 47778999999999999999 Q ss_pred hhhhccCCHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhcc-CCCCCCCcccccC Q lcl|Aclame:pro 146 ACESLTQSATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRN-GVAFPGETPIIRS 207 (207) Q Consensus 146 lA~pLt~~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~-~~~~~~~~~~~~~ 207 (207) .+..+.++..+.+.+.++.+.+....+..+.+|+.-.-+...++..-++ --|+|-.-|-... T Consensus 169 f~~~~~gd~~~~~~~~~~e~~a~~~~~~~e~~q~~~Nml~~~~~~r~~r~m~~~~~~~~~~~~ 231 (246) T protein:vir:70 169 FIVSKDADKTKLTTHQQIAAQLFVDVQSEQMSQKRLNMLVHNPTQRQFGIMAGGSQNVPAYSH 231 (246) T ss_pred HHhhccCchHHHHHHHHHHHHHHHHHHHHHHhhCCcceeeCchhhhhhhhhhccccccccccc Confidence 9999999999999999999999999999998888644444455555544 3344555555433 No 11 >protein:vir:80215 Length: 211 # NCBI annotation: putative tail tubular protein A # Family: family:all:824 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522885;genbank:gi:158345178;genbank:GeneID:5687478 Probab=99.62 E-value=3.6e-17 Score=110.72 Aligned_cols=186 Identities=18% Similarity=0.168 Sum_probs=140.7 Q ss_pred CC-CHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeE-ecccCCCCCCCCccccccCcc Q lcl|Aclame:pro 1 MA-SQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARA-QLAALAEAPLFGFSYQYRLPT 78 (207) Q Consensus 1 Ma-S~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~-~La~l~~~p~~~~~yaY~lP~ 78 (207) |. |+++-.|.=|..||+.||+|+|++.+.+..+...++.+++.+|+ +.|.|-+-.. .|.|.++ | +.| +|. T Consensus 1 ~~~teLdAVN~~L~aIGEsPV~sld~~npdva~a~~iL~~v~r~vqs-eGW~FNte~~~~ltPd~~----g--~I~-iP~ 72 (211) T protein:vir:80 1 MQLTFLEAVNLVLRELGETPVTSVDETYPTLAQILPAMEDARRNTLA-EGWWFNSFDDFTASPSPA----G--EVL-LSE 72 (211) T ss_pred CcchHHHHHHHHHHhhCccccccccCCchhHHHHHHHHHHHHHHHcc-CCeeEeecCCceeccCCC----C--eEe-cCc Confidence 87 89999999999999999999999999999999999999999999 9999998765 7777642 3 344 999 Q ss_pred cceEeeecccCccccccccccceEEeCCEEEecCC------CceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhccC Q lcl|Aclame:pro 79 DFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQ------APLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLTQ 152 (207) Q Consensus 79 Dclrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~------~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pLt~ 152 (207) |+|++...+.. .|...|++|+-... .|++++-|.. -+.+..|..|+.+++.+.|.+.+....+ T Consensus 73 n~L~v~~~~~~----------~~~~Rgg~LYD~~n~T~~F~~pi~v~iv~~-~~FeeLPe~~~~yI~~rAa~~f~~~~~G 141 (211) T protein:vir:80 73 DTLAFYPDDVE----------KFTWAGRYVRVTGTGSKVVGAPVKGRVVLD-IPYDELPEGMRYLVVYRCAYEVYVADFG 141 (211) T ss_pred cceEEeeCCCe----------eeeeeCceEEeccCCcEeeCCceEEEEEee-cChhhccHHHHHHHHHHHHHHHHhhcCC Confidence 99999765422 35666776653322 3567775555 3466789999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhh--hccC-------CCCCCCcccccC Q lcl|Aclame:pro 153 SATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLE--SRNG-------VAFPGETPIIRS 207 (207) Q Consensus 153 ~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~--aR~~-------~~~~~~~~~~~~ 207 (207) +.++.+.+.++.+.+....+..+..++.-. +....... -|+| |-+| -.|+-|. T Consensus 142 ~d~~~q~l~~ee~~a~~~l~~~e~~q~~~N-m~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 203 (211) T protein:vir:80 142 ADSTAQVIANKMSAAYVEVRAVHIRQRKLT-LRKRTPATSGVKRGTTNELLCRIVP-AAPVWRK 203 (211) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHhhcCcc-ccccCcccccccccccchhhhcccc-Ccccccc Confidence 999999999999999888888887776422 22211111 1112 1122 2345554 No 12 >protein:vir:94712 Length: 188 # NCBI annotation: tail tube # Family: family:all:824 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338121;genbank:gi:77118199;genbank:GeneID:3707735 Probab=99.62 E-value=3.5e-17 Score=110.83 Aligned_cols=173 Identities=15% Similarity=0.226 Sum_probs=136.6 Q ss_pred CC--------CHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeee-EecccCCCCCCCCcc Q lcl|Aclame:pro 1 MA--------SQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKAR-AQLAALAEAPLFGFS 71 (207) Q Consensus 1 Ma--------S~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r-~~La~l~~~p~~~~~ 71 (207) |+ |+++-.|.=|..||+.||+|++++.+.+..+...++.+++.+|+ +.|.|-+-+ ..|.|.+.. . T Consensus 1 ~~~~~~~~~~teL~AVN~~L~aIGespV~sld~~npdva~a~~iL~~v~~~vqs-~GW~FNte~~~~ltPd~~~-----g 74 (188) T protein:vir:94 1 MAQYIPLNANDDLDAINDMLAAIGEPAVLQLDEGNADVSNAQRILHRVNRQVQA-KGWNFNINEAAVLTPDVQD-----N 74 (188) T ss_pred CCccccccccchhHHHHHHHHhhCccccccccCCCccHHHHHHHHHHHHHHHhc-CCceeeecCCeeeeeeCCC-----C Confidence 55 48889999999999999999999999999999999999999996 999999876 567766432 3 Q ss_pred ccccCcccceEeeecccCccccccccccceEEeCCEEEecCC------CceEEEEEeecCChhhccHHHHHHHHHHHHHH Q lcl|Aclame:pro 72 YQYRLPTDFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQ------APLYIRYAKRVTDPNAMDALFREAFACRLAAE 145 (207) Q Consensus 72 yaY~lP~Dclrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~------~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~ 145 (207) +.| +|.|+|+|...+.. ..|...||+|+-... .|++++-|.. -+.+..|..|+.+++.|.|.+ T Consensus 75 ~I~-~P~n~L~v~~~~~~---------~~~v~Rgg~LYD~~n~T~~F~~pi~v~iv~~-~~FeelPe~~~~~I~~rAa~~ 143 (188) T protein:vir:94 75 RIR-FLPSYLRVMTAGAT---------SYYSNMGGYLYDLSTQSTTFTDPITVELVEM-KPFSEMPVVFRDYIVTKASRE 143 (188) T ss_pred eEe-cCcceeeeecCCCc---------eeEeecCCeeEeccCCcEeeCCceeEEEEee-cChhhccHHHHHHHHHHHHHH Confidence 344 89999999854321 346666777653322 3566675555 446678999999999999999 Q ss_pred hhhhccCCHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhcc Q lcl|Aclame:pro 146 ACESLTQSATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRN 194 (207) Q Consensus 146 lA~pLt~~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~ 194 (207) .+....++.++.+.+.++.+.+...++..+.+|+.-.-+.+ + .|+ T Consensus 144 f~~~~~G~~~~~q~l~~~e~~a~~~~~e~e~~q~~~Nml~~-~---~~~ 188 (188) T protein:vir:94 144 FNAKFFGSPESELYLREQEAELYQQVMEYEMDTGRYNMMSD-I---GRD 188 (188) T ss_pred HHhhccCchhHHHHHHHHHHHHHHHHHHHHHhhCCcccccc-c---cCC Confidence 99999999999999999999999999999999875322211 1 111 No 13 >protein:vir:97033 Length: 245 # NCBI annotation: 32 # Family: family:all:824 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654133;genbank:gi:108862017;genbank:GeneID:5075982 Probab=99.60 E-value=6.7e-17 Score=109.24 Aligned_cols=194 Identities=16% Similarity=0.162 Sum_probs=149.9 Q ss_pred CCCHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHH--HhcCCCCceeeeE-ecccCCCCCCCCccccccCc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDAC--LRAHVWSFTKARA-QLAALAEAPLFGFSYQYRLP 77 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~--L~~~~W~FA~~r~-~La~l~~~p~~~~~yaY~lP 77 (207) -.|+++-.|.=|..||+.||+|+|+++.++..+...++.+.+.. +..+.|.|.+-.. .|.|++ ...+.+| T Consensus 22 ~~TeLdAVN~~L~aIGEsPV~sld~~~~~~~~va~al~~l~~~~r~vqseGW~FNte~~~~ltPD~-------~g~I~iP 94 (245) T protein:vir:97 22 IDSKLEAVNLCMRAIGREGVDSLDSGDLDAEDASKMIDIVSQRFQYNKGGGWWFNREPNWQLAPDT-------NGEVNLP 94 (245) T ss_pred hhhhHHHHHHHHHhhCccccceecCCCcchHHHHHHHHHHHHHHHHHccCCeeEeecCCeeeccCC-------CCeEecC Confidence 23678889999999999999999999999999999999988764 7789999998865 676643 2358899 Q ss_pred ccceEeeecccCccccccccccceEEeCCEEEecC------------CCceEEEEEeecCChhhccHHHHHHHHHHHHHH Q lcl|Aclame:pro 78 TDFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDM------------QAPLYIRYAKRVTDPNAMDALFREAFACRLAAE 145 (207) Q Consensus 78 ~Dclrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~------------~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~ 145 (207) .|||+|...+..+ ....+|...|++|+-+. +..+++..|... +-+..|..|+.+++.|.|.+ T Consensus 95 ~n~L~v~~~~~~~-----~~~~~~v~RggrLYD~~nhT~~F~~pi~~~~~~~v~Iv~~~-pFEdLPe~~q~yI~~rAA~~ 168 (245) T protein:vir:97 95 NNCLAVLQCYALG-----EKKVPMTMRAGKLYSTWSHTFDMRKHVNANGMIRLTLLTLL-PYEHLPTSVMQAIAYQAAVE 168 (245) T ss_pred ccceeeeccCccc-----cccceeEeccceEEeccccceecccccccCcceEEEEEeeC-ChhhhhHHHHHHHHHHHHHH Confidence 9999998765432 12245777788886443 123567777775 57788999999999999999 Q ss_pred hhhhccCCHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhcc-CCCCCCCcccccC Q lcl|Aclame:pro 146 ACESLTQSATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRN-GVAFPGETPIIRS 207 (207) Q Consensus 146 lA~pLt~~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~-~~~~~~~~~~~~~ 207 (207) .+....++.++.+.+.++.+.+....+..+.+++.-.-+...++..-++ --|+|-.-|-... T Consensus 169 f~~~~~G~~~~~q~l~qee~~a~~~l~e~e~~q~~~Nml~~~~~~r~~r~m~~~~~~~~~~~~ 231 (245) T protein:vir:97 169 FIVSKDADQTKLATAQQIATQLLMDVQSEQMSQKRLNMLVHNPTQRQFGIMAGGSQNVPAYSH 231 (245) T ss_pred HHhhccCchhHHHHHHHHHHHHHHHHHHHHHhhCCcceeeCchhhhhhhhhhccccccccccc Confidence 9999999999999999999999999999999888644444445555444 3345555555433 No 14 >protein:vir:105646 Length: 245 # NCBI annotation: putative tail tubular A protein # Family: family:all:824 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425010;genbank:gi:83571758;uniprot:Q2WC42;genbank:GeneID:3837287 Probab=99.60 E-value=6.9e-17 Score=109.18 Aligned_cols=194 Identities=16% Similarity=0.160 Sum_probs=149.8 Q ss_pred CCCHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHH--HhcCCCCceeeeE-ecccCCCCCCCCccccccCc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDAC--LRAHVWSFTKARA-QLAALAEAPLFGFSYQYRLP 77 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~--L~~~~W~FA~~r~-~La~l~~~p~~~~~yaY~lP 77 (207) -.|+++-.|.=|..||+.||+|+|+++.++..+...++.+.+.. +..+.|.|.+-.. .|.|++ ...+.+| T Consensus 22 ~~TeLdAVN~~L~aIGEsPV~sld~~~~~~~~va~al~~l~~~~r~vqseGW~FNte~~~~ltPD~-------~g~I~iP 94 (245) T protein:vir:10 22 IDSKLEAVNLCMRAIGREGVDSLDSGDLDAEDASKMIDIVSQRFQYNKGGGWWFNREPNWQIAPDT-------NGEVNLP 94 (245) T ss_pred hhhhHHHHHHHHHhhCccccceecCCCcchHHHHHHHHHHHHHHHHHccCCeeEeecCCeeeccCC-------CCeEecC Confidence 23678889999999999999999999999999999999988764 7789999998865 676642 2358899 Q ss_pred ccceEeeecccCccccccccccceEEeCCEEEecC------------CCceEEEEEeecCChhhccHHHHHHHHHHHHHH Q lcl|Aclame:pro 78 TDFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDM------------QAPLYIRYAKRVTDPNAMDALFREAFACRLAAE 145 (207) Q Consensus 78 ~Dclrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~------------~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~ 145 (207) .|||+|...+..+ ....+|...|++|+-+. +..+++..|... +-+..|..|+.+++.|.|.+ T Consensus 95 ~n~L~v~~~~~~~-----~~~~~~v~RggrLYD~~nhT~~F~~pi~~~~~~~v~Iv~~~-pFEdLPe~~q~yI~~rAA~~ 168 (245) T protein:vir:10 95 NNCLAVLQCYALG-----EKKVPMTMRAGKLYSTWSHTFDMRKHVNANGMIRLTLLTLL-PYEHLPTSVMQAIAYQAAVE 168 (245) T ss_pred ccceeeeccCccc-----cccceeEeccceEEeccccceecccccccCcceEEEEEeeC-ChhhhhHHHHHHHHHHHHHH Confidence 9999998765432 12245777788886443 123567777775 57788999999999999999 Q ss_pred hhhhccCCHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhcc-CCCCCCCcccccC Q lcl|Aclame:pro 146 ACESLTQSATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRN-GVAFPGETPIIRS 207 (207) Q Consensus 146 lA~pLt~~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~-~~~~~~~~~~~~~ 207 (207) .+....++.++.+.+.++.+.+....+..+.+++.-.-+...++..-++ --|+|-.-|-... T Consensus 169 f~~~~~G~~~~~q~l~qee~~a~~~l~e~e~~q~~~Nml~~~~~~r~~r~m~~~~~~~~~~~~ 231 (245) T protein:vir:10 169 FIVSKDADQTKLATAQQIATQLLMDVQSEQMSQKRLNMLVHNPTQRQFGIMAGGSQNVPAYSH 231 (245) T ss_pred HHhhccCchhHHHHHHHHHHHHHHHHHHHHHhhCCcceeeCchhhhhhhhhhccccccccccc Confidence 9999999999999999999999999999999888644444445555444 3345555555433 No 15 >protein:vir:6325 Length: 184 # NCBI annotation: tail tubular protein A # Family: family:all:824 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877472;genbank:gi:33300844;uniprot:Q7Y2D2;genbank:GeneID:1482614 Probab=99.60 E-value=4.8e-17 Score=110.06 Aligned_cols=177 Identities=15% Similarity=0.111 Sum_probs=138.5 Q ss_pred CCHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeE-ecccCCCCCCCCccccccCcccc Q lcl|Aclame:pro 2 ASQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARA-QLAALAEAPLFGFSYQYRLPTDF 80 (207) Q Consensus 2 aS~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~-~La~l~~~p~~~~~yaY~lP~Dc 80 (207) +|+++-.|.=|..||+.||+|+|++.+.+..+...++.+++.+|+ +.|.|-+-.. .|.|.++ | +.| +|.|+ T Consensus 1 ~teL~AVN~~L~aIGespV~sld~~npdva~a~~iL~~v~~~vqs-~GW~FNte~~~~ltPd~~----g--~I~-~P~n~ 72 (184) T protein:vir:63 1 MLLLDAVNVILRKIGELPTLSMDETYPTMAIALPELEDQRIQLLT-QGWWFNTWWRHKLTPDPT----G--RIN-LPKGT 72 (184) T ss_pred CchHHHHHHHHHhhCccccceecCCCccHHHHHHHHHHHHHHHhc-CCceEeecCCceeeecCC----C--eEE-cCcce Confidence 999999999999999999999999999999999999999999996 9999999864 7877742 3 344 99999 Q ss_pred eEeeecccCccccccccccceEEeCCEEEecCC------CceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhccCCH Q lcl|Aclame:pro 81 IRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQ------APLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLTQSA 154 (207) Q Consensus 81 lrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~------~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pLt~~~ 154 (207) |++...+. ++...||+|+-... .|+.+.-|.. -+.+..|..|+.+++.+.|.+.+....|+. T Consensus 73 L~v~~~~~-----------d~~~Rgg~LyD~~n~t~~F~~~i~v~iv~~-~~FeelPe~~~~~I~~rAa~~f~~~~~G~~ 140 (184) T protein:vir:63 73 LAFYPDSP-----------DLQWDGLGVRDANTGDDRIGKPVEGRLVLS-REWDHIPEIAQRVIAHQAALAVYTHEIGPD 140 (184) T ss_pred eeeecCCC-----------ceEEcCCEEEeccCCcEEeCCceEEEEEee-cChhhccHHHHHHHHHHHHHHHHhhccCch Confidence 99975432 34445665543221 3567775555 346678999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccCCCC Q lcl|Aclame:pro 155 TKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGVAF 198 (207) Q Consensus 155 ~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~~~~ 198 (207) ++.+.+.++.+.+...++..+.+|+.-.-....++..-|+.=-- T Consensus 141 ~~~q~l~~~e~~a~~~~~~~e~~q~~~Nm~~~~~~~~~~~~l~~ 184 (184) T protein:vir:63 141 ETAQVIAQELQAYQNELSRMHTRSRPLNTQAKRSFSRWRRSLRT 184 (184) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcCcchhhhhhhHHHHHhhcC Confidence 99999999999999999999998875433322223222221100 No 16 >protein:vir:3365 Length: 196 # NCBI annotation: tail tubular protein A # Family: family:all:824 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523336;swissprot:trembl:q8w5u4;genbank:gi:17570827;uniprot:Q8W5U4;genbank:GeneID:927450 Probab=99.31 E-value=2.9e-13 Score=89.35 Aligned_cols=179 Identities=13% Similarity=0.192 Sum_probs=140.7 Q ss_pred CCCHHHHHHHHHHHhcchhccccccC-CHHHHHHHHhhHHHHHHHHhcCCCCceeeeE-ecccCCCCCCCCccccccCcc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLDED-SKAAATLNSMYDDVLDACLRAHVWSFTKARA-QLAALAEAPLFGFSYQYRLPT 78 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~e~-s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~-~La~l~~~p~~~~~yaY~lP~ 78 (207) +.|+++-.|.=|..||+.||+|+|+. .+.+..+....+.+++.+| ..-|.|=+-+. .|.|++. .| --.+|. T Consensus 10 ~~teLdAVN~~L~aIGEspV~sld~~~npdva~a~~iL~~v~~~vq-seGW~FNte~~~~ltPD~~---~g---~I~vP~ 82 (196) T protein:vir:33 10 TAEELSAVNDILASIGEPPVSTLEGDANADVANARRVLNKINRQIQ-SRGWTFNIEEGVTLLPDAF---SG---MIPFSS 82 (196) T ss_pred hhhhhHHHHHHHHhcCccccccccCCCCccHHHHHHHHHHHHHHHh-hCCceEeecCceeEeeeCC---CC---eEecCc Confidence 67899999999999999999999986 5999999999999999999 67899998866 7887743 12 234799 Q ss_pred cceEeeecccCccccccccccceEEeCCEEEecCC------CceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhccC Q lcl|Aclame:pro 79 DFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQ------APLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLTQ 152 (207) Q Consensus 79 Dclrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~------~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pLt~ 152 (207) |+|++..... ..+|...||+|+-... .|+.+.-|.. -+-+..|..|+..++.|-|.+.....-+ T Consensus 83 n~L~v~~~~~---------~~~~v~Rgg~LYD~~n~T~~F~~pi~v~iv~~-~~FedlPe~~~~yI~~rAa~~f~~~~~G 152 (196) T protein:vir:33 83 DYLSVMATSG---------QTQYVNRGGYLYDRSAKTDRFPSGVQVNLIRL-REFDEMPECFRNYIVTKASRQFNNRFFG 152 (196) T ss_pred ceeEEecCCC---------ceeEEEcCCeEEeccCCcEEeCCceEEEEEee-cChhhhhHHHHHHHHHHHHHHHHHhhcC Confidence 9999976432 1457777888764332 3577775555 4577789999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccCC Q lcl|Aclame:pro 153 SATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGV 196 (207) Q Consensus 153 ~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~~ 196 (207) +.++.+.+.++.+.+....+..+.+++.-.-+-..+++..=-++ T Consensus 153 ~~~~~q~l~~ee~~a~~~~~~~e~~q~~~Nml~~~~~~~~~~~r 196 (196) T protein:vir:33 153 APEVDGVLQEEEQEAWSACFEYELDYGNYNMLDGDAFTSGLLNR 196 (196) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHhhCCcceeecCchhhccccC Confidence 99999999999999999999999988754333333433322222 No 17 >protein:vir:1542 Length: 196 # NCBI annotation: tail tubular protein A # Family: family:all:824 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052110;swissprot:trembl:q9t106;genbank:gi:9634036;uniprot:Q9T106;genbank:GeneID:1262371 Probab=99.30 E-value=3.2e-13 Score=89.05 Aligned_cols=179 Identities=12% Similarity=0.177 Sum_probs=140.2 Q ss_pred CCCHHHHHHHHHHHhcchhcccccc-CCHHHHHHHHhhHHHHHHHHhcCCCCceeee-EecccCCCCCCCCccccccCcc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLDE-DSKAAATLNSMYDDVLDACLRAHVWSFTKAR-AQLAALAEAPLFGFSYQYRLPT 78 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~e-~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r-~~La~l~~~p~~~~~yaY~lP~ 78 (207) +.|+++-.|.=|..||+.||+|+|. +.+.+..+....+.+++.+| ..-|.|=+-+ ..|.|++. .| --.+|. T Consensus 10 ~~teLdAVN~~L~aIGEspV~sld~~~npdva~a~~iL~~v~~~vq-seGW~FNte~~~~ltPD~~---~g---~I~vP~ 82 (196) T protein:vir:15 10 TAEELSAVNDILASIGEPPVSTLEGDANADVANARRVLNKINRQIQ-SRGWTFNIEEGVTLLPDAF---SG---MIPFSS 82 (196) T ss_pred hhhhhHHHHHHHHhcCccccccccCCCCccHHHHHHHHHHHHHHHh-hCCceEeecCCceeeecCC---CC---eEecCc Confidence 6789999999999999999999994 67999999999999999999 6789999887 57877743 12 244799 Q ss_pred cceEeeecccCccccccccccceEEeCCEEEecCC------CceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhccC Q lcl|Aclame:pro 79 DFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQ------APLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLTQ 152 (207) Q Consensus 79 Dclrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~------~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pLt~ 152 (207) |+|++..... ..+|...||+|+-... .|+++.-|.. -+-+..|..|+..++.+-|.+.....-+ T Consensus 83 n~L~v~~~~~---------~~~~v~Rgg~LYD~~n~T~~F~~pi~v~iv~~-~~FedlPe~~~~yI~~rAa~~f~~~~~G 152 (196) T protein:vir:15 83 DYLSVMATSG---------QTQYINRGGYLYDRSAKTDRFPSGVQVNLIRL-REFDEMPECFRNYIVTKASRQFNNRFFG 152 (196) T ss_pred ceeEEecCCC---------ceeEEEcCCeEEeccCCcEEeCCceEEEEEee-cChhhhhHHHHHHHHHHHHHHHHHhccC Confidence 9999976432 2457778888764332 3577775555 4577789999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccCC Q lcl|Aclame:pro 153 SATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGV 196 (207) Q Consensus 153 ~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~~ 196 (207) +.++.+.+.++.+.+....+..+.+++.-.-+-..+++..=-++ T Consensus 153 ~~~~~q~l~~~e~~a~~~l~~~e~~q~~~Nml~~~~~~~~~~~r 196 (196) T protein:vir:15 153 APEVDGVLQEEEQEAWRACFEYELDYGNYNMLDGDAFTSGLLNR 196 (196) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHhhCCcceeecCchhhccccC Confidence 99999999999999999999999888754333333333321122 No 18 >protein:vir:94565 Length: 196 # NCBI annotation: Tubular tail protein A # Family: family:all:824 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919013;genbank:gi:119637777;genbank:GeneID:5179325 Probab=99.28 E-value=6e-13 Score=87.61 Aligned_cols=179 Identities=13% Similarity=0.196 Sum_probs=139.5 Q ss_pred CCCHHHHHHHHHHHhcchhccccc-cCCHHHHHHHHhhHHHHHHHHhcCCCCceeee-EecccCCCCCCCCccccccCcc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLD-EDSKAAATLNSMYDDVLDACLRAHVWSFTKAR-AQLAALAEAPLFGFSYQYRLPT 78 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~-e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r-~~La~l~~~p~~~~~yaY~lP~ 78 (207) +.|+++-.|.=|..||+.||+|+| ++.+.+..+....+.+++.+| ..-|.|=+-+ ..|.|++.. |+ =.+|. T Consensus 10 ~~teLdAVN~~L~aIGEspV~sld~~~npdva~a~~iL~~v~r~vq-seGW~FNte~~~~ltPD~~~---g~---I~vP~ 82 (196) T protein:vir:94 10 TGEELAAVNDILASIGEPPVSTLEGDTNADVDNARRVLNKINRQIQ-SKGWTFNIEGGQQLLPDVFN---GL---IPYMS 82 (196) T ss_pred hhhhhHHHHHHHHhccccccccccCCCCccHHHHHHHHHHHHHHHh-hCCceeeecCCeeeeeeCCC---Ce---EecCc Confidence 678999999999999999999998 579999999999999999999 6789999887 678776432 22 23799 Q ss_pred cceEeeecccCccccccccccceEEeCCEEEecC------CCceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhccC Q lcl|Aclame:pro 79 DFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDM------QAPLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLTQ 152 (207) Q Consensus 79 Dclrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~------~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pLt~ 152 (207) |+|++...+. ...|...||+|+-.. ..|+++.-|.. -+-+..|..|+..++.+-|.+.....-+ T Consensus 83 n~L~v~~~~~---------~~~~v~Rgg~lYD~~n~T~~F~~pi~~~iv~~-~~FedlPe~~~~yI~~rAa~~f~~~~~G 152 (196) T protein:vir:94 83 DYLSVLSEGG---------ATAYVNRGGYVFDRTTGTDIFEGPVTVTIIKL-REFYEMPECFRSWIVTKAARQFNNRFFG 152 (196) T ss_pred ceeEEeeCCC---------ceeeEEcCceEEeccCCceEeCCceEEEEEee-cChhhhhHHHHHHHHHHHHHHHHHhhcC Confidence 9999986542 235777787776432 24577776666 4577789999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccCCCCCCC Q lcl|Aclame:pro 153 SATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGVAFPGE 201 (207) Q Consensus 153 ~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~~~~~~~ 201 (207) +.++.+.+.++.+.+....+..+.+|+.-.-+-..++.. +.=+| T Consensus 153 ~~~~~q~l~~~e~~a~~~~~e~e~~q~~~Nm~~~~~~~~-----~~~~r 196 (196) T protein:vir:94 153 APEIDAVLAEEEQEAKMQCHEYELDFGNFNMLDGDAFTG-----GLLSR 196 (196) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCcccc-----hhccC Confidence 999999999999999999999998887533222222221 11112 No 19 >protein:vir:8886 Length: 195 # NCBI annotation: tail tubular protein A # Family: family:all:824 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813775;genbank:gi:29366730;genbank:GeneID:1258838 Probab=99.24 E-value=1.4e-12 Score=85.51 Aligned_cols=175 Identities=14% Similarity=0.194 Sum_probs=137.1 Q ss_pred CCCHHHHHHHHHHHhcchhcccccc-CCHHHHHHHHhhHHHHHHHHhcCCCCceeee-EecccCCCCCCCCccccccCcc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLDE-DSKAAATLNSMYDDVLDACLRAHVWSFTKAR-AQLAALAEAPLFGFSYQYRLPT 78 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~e-~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r-~~La~l~~~p~~~~~yaY~lP~ 78 (207) -.|+++-.|.=|..||+.||+|+|+ +.+.+..+....+.+++.+| ..-|.|=+-+ ..|.|++. .|+ =.+|. T Consensus 10 ~~teLdAVN~~L~aIGEspV~sld~~~npdva~a~~iL~~v~~~vq-seGW~FNte~~~~ltpD~~---~g~---I~~P~ 82 (195) T protein:vir:88 10 TDDELAAINDMLAAIGESPVSSLEGDPNADVANARRILNQVNREVQ-SRGWTFNIEEGAVLSPDSF---SGL---IEYLS 82 (195) T ss_pred ccchhHHHHHHHHhccccccccccCCCCccHHHHHHHHHHHHHHHh-hCCceEeecCCeeeeeeCC---CCe---EecCc Confidence 3479999999999999999999998 47999999999999999999 6789999887 78877643 132 23799 Q ss_pred cceEeeecccCccccccccccceEEeCCEEEecC------CCceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhccC Q lcl|Aclame:pro 79 DFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDM------QAPLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLTQ 152 (207) Q Consensus 79 Dclrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~------~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pLt~ 152 (207) |+|++...+.. .|...||+|+-.. ..|+++.-|.. -+-+..|..|+..++.+-|.+.....-+ T Consensus 83 n~L~v~~~~~~----------~~v~Rgg~lYD~~n~T~~F~~pi~~~iv~~-~~FedlPe~~~~yI~~rAa~~f~~~~~G 151 (195) T protein:vir:88 83 DYLRITTSGGT----------VYVNRGGYVYDRSTKTDVYTNDITVDLIRF-KTFSEMPECFRSYIVAKASRRFNIRFFG 151 (195) T ss_pred ceeEEeecCCe----------eEEEeCCEEEeccCCceEeCCceEEEEEee-cChhhhhHHHHHHHHHHHHHHHHHhcCC Confidence 99998866432 4777788876222 24677776666 4577789999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCch---hhhhc Q lcl|Aclame:pro 153 SATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDT---WLESR 193 (207) Q Consensus 153 ~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~---~~~aR 193 (207) +.++.+.+.++.+.+....+..+.+|+.-.-+-..+ -+..| T Consensus 152 ~~~~~~~l~~~e~~A~~~~~e~e~~qg~~Nm~~~~~~~~~~~~r 195 (195) T protein:vir:88 152 AGEIEGSLQEQESEAWQQCQEYELDYGGFNMIDGDSYVGGIASR 195 (195) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhhCCcceeecCcccchhccC Confidence 999999999999999999999998887422211111 12223 No 20 >protein:vir:10451 Length: 196 # NCBI annotation: tail protein # Family: family:all:824 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848298;genbank:gi:30387489;genbank:GeneID:1733943 Probab=99.22 E-value=1.8e-12 Score=84.96 Aligned_cols=179 Identities=13% Similarity=0.181 Sum_probs=138.8 Q ss_pred CCCHHHHHHHHHHHhcchhccccccC-CHHHHHHHHhhHHHHHHHHhcCCCCceeeeE-ecccCCCCCCCCccccccCcc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLDED-SKAAATLNSMYDDVLDACLRAHVWSFTKARA-QLAALAEAPLFGFSYQYRLPT 78 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~e~-s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~-~La~l~~~p~~~~~yaY~lP~ 78 (207) +.|+++-.|.=|..||+.||+|+|+. .+.+..+....+.+++.+| ..-|.|=+-+. .|.|++. .++ . ..|. T Consensus 10 ~~teLdAVN~~L~aIGEspV~sld~~~npdva~a~~iL~~v~~~vq-seGW~FNtE~~~~ltpD~~---~~~--i-~~p~ 82 (196) T protein:vir:10 10 TAAELSAVNDILASIGEPPVSTLEGDSNADVANARRILNKINRQIQ-SRGWTFNIEEGITLLPDVY---SNL--I-VYSD 82 (196) T ss_pred ccchhHHHHHHHHhccccccccccCCCCccHHHHHHHHHHHHHHHh-hCCceeeecCceeecccCC---CCe--e-eCCc Confidence 67899999999999999999999975 5999999999999999999 67899998654 5766644 222 1 1288 Q ss_pred cceEeeecccCccccccccccceEEeCCEEEecCC------CceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhccC Q lcl|Aclame:pro 79 DFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQ------APLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLTQ 152 (207) Q Consensus 79 Dclrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~------~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pLt~ 152 (207) |+|++..++.. ..|...||+|+-... .|+++.-|.. -+-+..|..|+..++.|-|.+......+ T Consensus 83 n~L~~~~~~~~---------~~~v~Rgg~LYD~~n~T~~F~~pi~~~iv~~-~~FedlPe~~~~yI~~rAa~~f~~~~~G 152 (196) T protein:vir:10 83 DYLSLMATSGQ---------SIYVNRGGYVYDRTSQSDRFDSGITVNIIRL-RDYDEMPECFRYWIVTKASRQFNNRFFG 152 (196) T ss_pred ceeeeecCCCc---------eeeeeeCCeEEeccCCcEeeCCeeEEEEEee-cChhhhhHHHHHHHHHHHHHHHHHhhcC Confidence 99999866332 346677777653322 3677775655 4577789999999999999999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccCC Q lcl|Aclame:pro 153 SATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGV 196 (207) Q Consensus 153 ~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~~ 196 (207) +.++.+.+.++.+.+....+..+.+|+.-.-+-..+|+..=-++ T Consensus 153 ~~~~~q~l~~~e~~a~~~l~e~e~~q~~~Nml~~~p~~~~~~~r 196 (196) T protein:vir:10 153 APEVEGVLQEEEDEARRLCMEYEVDYGGYNMLDGDAFTSGLLTR 196 (196) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHhhcCcceeecCchhhccccC Confidence 99999999999999999999999998864433334443322222 No 21 >protein:vir:78929 Length: 184 # NCBI annotation: putative tail tubular protein A # Family: family:all:824 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522825;genbank:gi:158345060;genbank:GeneID:5687419 Probab=99.20 E-value=1.8e-12 Score=85.01 Aligned_cols=177 Identities=16% Similarity=0.135 Sum_probs=136.7 Q ss_pred CCHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeee-EecccCCCCCCCCccccccCcccc Q lcl|Aclame:pro 2 ASQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKAR-AQLAALAEAPLFGFSYQYRLPTDF 80 (207) Q Consensus 2 aS~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r-~~La~l~~~p~~~~~yaY~lP~Dc 80 (207) +|+++-.|.=|..||+.||+|+|++.+.+..|....+.+++.+| ..-|.|=+-+ ..|.|+++ |. ..+|.|+ T Consensus 1 ~teLdAVN~~L~aIGEspV~sld~~npdva~a~~iL~~v~~~vq-seGW~FNte~~~~ltPd~~----g~---I~~P~n~ 72 (184) T protein:vir:78 1 MLLLDAVNVILRKIGELPIPSMDETYPTMAIALPELEDQRIQLL-TQGWWFNTWWKHKLTPDPQ----GR---INLPKDT 72 (184) T ss_pred CchHHHHHHHHHhhCCcccccccCCCccHHHHHHHHHHHHHHHh-hCCceEeecCCeeeeecCC----Ce---EEcCccc Confidence 99999999999999999999999999999999999999999999 6789999986 68887753 32 3589999 Q ss_pred eEeeecccCccccccccccceEEeCCEEEecCC------CceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhccCCH Q lcl|Aclame:pro 81 IRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQ------APLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLTQSA 154 (207) Q Consensus 81 lrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~------~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pLt~~~ 154 (207) |++...+. ++...||+|+-... .|+++.-|.. -+-+..|..|+..++.+-|.+......|+. T Consensus 73 L~i~~~~~-----------d~~~Rgg~lYD~~n~T~~F~~~i~~~iv~~-~~FedlPe~~~~yI~~rAa~~f~~~~~G~~ 140 (184) T protein:vir:78 73 LAFYPDSP-----------DLQWDGLGVRDANTGDDRIGKSVEGRLVLS-REWDRIPEIAQRVIAHQAALAVYTHEIGPD 140 (184) T ss_pred eEeecCCc-----------eeEEcCcEEEeccCCcEEeCCeeEEEEEee-cChhhhhHHHHHHHHHHHHHHHHHhhcCch Confidence 99965332 35556666653322 3577776665 447778999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccCCCCCCCc Q lcl|Aclame:pro 155 TKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGVAFPGET 202 (207) Q Consensus 155 ~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~~~~~~~~ 202 (207) ++.+.+.++.+.+....+..+.+++.-.-.....+..-|++= |+ T Consensus 141 ~~~q~l~~ee~~a~~~~~~~e~~q~~~N~~~~~~~~r~r~~~----~~ 184 (184) T protein:vir:78 141 ETAQVIAQELQGYQNELSRMHTRSRPLNTQAKRSFSRWRRSL----RT 184 (184) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcCcchHHhhhhhHHHhhh----cC Confidence 999999999999999999999888753322111111111110 00 No 22 >protein:vir:2202 Length: 196 # NCBI annotation: tail tubular protein # Family: family:all:824 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041999;swissprot:sw:p03746;genbank:gi:9627471;uniprot:P03746;genbank:GeneID:1261030 Probab=99.19 E-value=2.7e-12 Score=83.98 Aligned_cols=178 Identities=13% Similarity=0.179 Sum_probs=138.5 Q ss_pred CCCHHHHHHHHHHHhcchhcccccc-CCHHHHHHHHhhHHHHHHHHhcCCCCceeeeE-ecccCCCCCCCCccccccC-c Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLDE-DSKAAATLNSMYDDVLDACLRAHVWSFTKARA-QLAALAEAPLFGFSYQYRL-P 77 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~e-~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~-~La~l~~~p~~~~~yaY~l-P 77 (207) +.|+++-.|.=|..||+.||+|+|+ +.+.+..+....+.+++.+| ..-|.|=+-+. .|.|++. .+ +-+ | T Consensus 10 ~~teLdAVN~~L~aIGEspV~sld~~~npdva~a~~iL~~v~~~vq-seGW~FNtE~~~~ltpD~~---~~----~i~p~ 81 (196) T protein:vir:22 10 TAAELSAVNDILASIGEPPVSTLEGDANADAANARRILNKINRQIQ-SRGWTFNIEEGITLLPDVY---SN----LIVYS 81 (196) T ss_pred hhhhhHHHHHHHHhcCccccccccCCCCccHHHHHHHHHHHHHHHh-hCCceeeecCceeecccCC---CC----eEeCc Confidence 7789999999999999999999998 47999999999999999999 67899998654 5766644 22 223 6 Q ss_pred ccceEeeecccCccccccccccceEEeCCEEEecCC------CceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhcc Q lcl|Aclame:pro 78 TDFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQ------APLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLT 151 (207) Q Consensus 78 ~Dclrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~------~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pLt 151 (207) .|+|++..++.. .+|...||+|+-... .|+++.-|.. -+-+..|..|+..++.|-|.+...... T Consensus 82 ~~~L~~~~~~~~---------~~~v~Rgg~LYD~~n~T~~F~~pi~~~iv~~-~~FedlPe~~~~yI~~rAa~~f~~~~~ 151 (196) T protein:vir:22 82 DDYLSLMSTSGQ---------SIYVNRGGYVYDRTSQSDRFDSGITVNIIRL-RDYDEMPECFRYWIVTKASRQFNNRFF 151 (196) T ss_pred cceeeeecCCCc---------eeeeeeCCeEEeccCCcEeeCCceEEEEEee-cChhhhhHHHHHHHHHHHHHHHHHhhc Confidence 799999876432 357777777653322 3677775665 457778999999999999999999999 Q ss_pred CCHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccCC Q lcl|Aclame:pro 152 QSATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGV 196 (207) Q Consensus 152 ~~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~~ 196 (207) ++.++.+.+.++.+.+....+..+.+|+.-.-+-..+|+..=-++ T Consensus 152 G~~~~~q~l~~~e~~a~~~l~e~e~~q~~~Nml~~~p~~~~~~~r 196 (196) T protein:vir:22 152 GAPEVEGVLQEEEDEARRLCMEYEMDYGGYNMLDGDAFTSGLLTR 196 (196) T ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhhcCcceeecCchhhccccC Confidence 999999999999999999999999998864433334443322222 No 23 >protein:vir:78741 Length: 197 # NCBI annotation: tail tube A # Family: family:all:824 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285449;genbank:gi:148724483;genbank:GeneID:5220212 Probab=99.18 E-value=4.3e-12 Score=82.92 Aligned_cols=184 Identities=21% Similarity=0.221 Sum_probs=135.3 Q ss_pred CC---CHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeee-EecccCCCCCCCCccccccC Q lcl|Aclame:pro 1 MA---SQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKAR-AQLAALAEAPLFGFSYQYRL 76 (207) Q Consensus 1 Ma---S~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r-~~La~l~~~p~~~~~yaY~l 76 (207) |+ |+++-.|.=|..||+.||+|+|++.+.+..+....+.+++.+| ..-|.|=+-+ ..|.|.++ |. -.+ T Consensus 1 m~~~~teLdAVN~~L~aIGEspV~sld~~npdva~a~~iL~~v~~~vq-seGW~FNte~~~~l~pd~~----g~---I~~ 72 (197) T protein:vir:78 1 MASKLTKLGAVNIVLTNIGMAPVTLIDSNNPMVATAQTILDEVSGSVQ-SEGWSYNTERAYPFIKDNT----GR---IAI 72 (197) T ss_pred CccchhHHHHHHHHHHhhCCcccceeeCCCccHHHHHHHHHHHHHHHh-hCCceEeecCCceecCCCC----Ce---Eec Confidence 77 5889999999999999999999999999999999999999999 6789999876 46665432 32 568 Q ss_pred cccceEeeecccCccccccccccceEEeCCEEEecCC------CceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhc Q lcl|Aclame:pro 77 PTDFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQ------APLYIRYAKRVTDPNAMDALFREAFACRLAAEACESL 150 (207) Q Consensus 77 P~Dclrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~------~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pL 150 (207) |.|+|++--.+.. ..++...||+|+-... .|+++.-|.. -+-+..|..|+..++.+-|.+..... T Consensus 73 P~n~L~vd~~~~~--------~~~~v~Rgg~LYD~~n~T~~F~~pi~~~iv~~-~~FedlPe~~~~yI~~rAa~~f~~~~ 143 (197) T protein:vir:78 73 PSNVLSLDCASTS--------KYDLIIRGGFLYDKAGHTDVFTENLELDVVWC-FEFDDLPEAVKNYITIRAANLFAGRA 143 (197) T ss_pred CccceEEecCCCc--------eeeEEEeCCeEEeccCCcEEeCCceEEEEEee-cChhhhhHHHHHHHHHHHHHHHHHhh Confidence 9999999544321 0135567777753322 3577775665 44777799999999999999999999 Q ss_pred cCCHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccCCCCCCCcc---cccC Q lcl|Aclame:pro 151 TQSATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGVAFPGETP---IIRS 207 (207) Q Consensus 151 t~~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~~~~~~~~~---~~~~ 207 (207) .|+.++.+.+.++.+.+....+..+.+|+.-.-+ +.+.+.++..=-| ++|- T Consensus 144 ~G~~~~~q~l~~~e~~a~~~~~~~e~~q~~~Nml------~~~~~~~~~~yrp~~~l~r~ 197 (197) T protein:vir:78 144 VGSAEAVKYSQREEAAARAAIIEYETQQGDYNML------ESESGRDIYTYRPFDAVYRF 197 (197) T ss_pred cCchhHHHHHHHHHHHHHHHHHHHHHhhcCcCcc------cCccccCcCcccchhhhhcC Confidence 9999999999999999999998888887642211 1111111111111 1111 No 24 >protein:vir:351 Length: 242 # NCBI annotation: hypothetical protein # Family: family:all:3196 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203465;genbank:gi:15320621;genbank:GeneID:921727 Probab=97.86 E-value=1.8e-06 Score=52.13 Aligned_cols=182 Identities=14% Similarity=0.147 Sum_probs=113.2 Q ss_pred CC--CHHHHHHHHHHHhcchhcc---ccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeEecccCCCCCCCCcccccc Q lcl|Aclame:pro 1 MA--SQVGICNRALTKIGDKRIT---SLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEAPLFGFSYQYR 75 (207) Q Consensus 1 Ma--S~v~IcN~AL~~lG~~~I~---Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~~La~l~~~p~~~~~yaY~ 75 (207) |+ |.++|.|.+...+|-.... =+.-.-+.......+-...-+++.+.|+|++-++...++-.++. -.|. T Consensus 1 ~~~~t~lsiin~v~~~i~L~~~~~a~v~sstD~~~~~l~ala~~~g~eia~~~dW~~l~~~~~~t~~~~~------~~y~ 74 (242) T protein:vir:35 1 MAWDTAASIINDAAVELGLLATDVADPYASADVNLVQLCRLLKSLGQDMVRDYQWTHLQQQWTFATQVGL------ANYE 74 (242) T ss_pred CchHHHHHHHHHHHHHhcccCCccccccccchHHHHHHHHHHHHHHHHHHHhcCCcchheeeeecccccC------CCCC Confidence 88 8999999999888843222 23334466777777888888888899999999999888654432 2577 Q ss_pred CcccceEeeec----ccC----------cc-------ccccccccceEEeCCEEEecCCC----ceEEEEEeecC----- Q lcl|Aclame:pro 76 LPTDFIRLLQV----GQF----------DV-------YPRTDTRGLFSIENGNILTDMQA----PLYIRYAKRVT----- 125 (207) Q Consensus 76 lP~Dclrv~~v----~~~----------~~-------~~~~~~~~~y~v~g~~l~~~~~~----~~~l~Y~~~v~----- 125 (207) ||.||-|+++- .+. +. ........-|.+.|++|+..-.+ .+...|+++.- T Consensus 75 lP~D~~R~v~~~~w~rt~~~p~~gP~s~~~W~~l~~~~sa~~~~~~~ri~ggqi~~~P~paa~~~~~f~YiSknWv~~~~ 154 (242) T protein:vir:35 75 MPPDYNRFVDQTGWNRTQRMPLLGPLSAQGWQLLQVLTSAGTVDVMYRLVGGEFVLHPTPESVADIAYEYVSSHWVGTGG 154 (242) T ss_pred cchhhHHhhcCcccceeecceecCCcChhhhhhhhhhccCCCCCceEEEEcCEEEeecCcccccceeEeeecCccccCCC Confidence 89999988841 000 00 00011123599999999876553 35789998851 Q ss_pred -------------Chhhcc-HHHHHHHHHHHHHHhhhhccCCHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhh Q lcl|Aclame:pro 126 -------------DPNAMD-ALFREAFACRLAAEACESLTQSATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLE 191 (207) Q Consensus 126 -------------d~~~~~-~~F~~ala~~LAa~lA~pLt~~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~ 191 (207) |+..|| -++.--|.||-=. .+.---+.....|+..|.+|+..|.-.+.-. T Consensus 155 ~~~k~~~t~~ad~Dt~~l~erLL~LGlIWRWkr-------aKGldy~e~l~~YE~aL~~~~~~d~G~~~l~--------- 218 (242) T protein:vir:35 155 SETPNADAPESGGDTLFFDRRLLVCGLKLRWQR-------AKGFDSTACQDDYDKALERAQGGDGAAPVLS--------- 218 (242) T ss_pred CccccccccccCCCceechHHHHhHhHHHHHHh-------hcCCCHHHHHHHHHHHHHHHHhhcCCCceec--------- Confidence 222332 2444445554311 1112234667889999999998875443211 Q ss_pred hccCCCCCCCcccccC Q lcl|Aclame:pro 192 SRNGVAFPGETPIIRS 207 (207) Q Consensus 192 aR~~~~~~~~~~~~~~ 207 (207) =+++|++.|..++ T Consensus 219 ---l~~~~~~~p~~~~ 231 (242) T protein:vir:35 219 ---LNRRPFATNRMLD 231 (242) T ss_pred ---CCCCccCCccccC Confidence 1234556666665 No 25 >protein:vir:1780 Length: 67 # NCBI annotation: tail protein B # Family: family:all:824 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570346;genbank:gi:18640505;genbank:GeneID:932718 Probab=96.92 E-value=3.7e-06 Score=50.36 Aligned_cols=61 Identities=23% Similarity=0.400 Sum_probs=55.0 Q ss_pred CC-----CHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeee-EecccC Q lcl|Aclame:pro 1 MA-----SQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKAR-AQLAAL 62 (207) Q Consensus 1 Ma-----S~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r-~~La~l 62 (207) || |+++-.|.=|..||+.||+|++++.+.+.......+.+++.++ ..-|.|=+-+ ..|.|+ T Consensus 1 ~~~~~~~teLdAVN~~L~aIGesPV~sld~~npdva~a~~iL~~v~~~vq-seGW~FNte~~~~ltPd 67 (67) T protein:vir:17 1 MAPIKRTSELDALNVKMTNIGQQPIVNINNTNPQVALAKTVLNQVTSDVL-TEGWIFNRELDYPLTPQ 67 (67) T ss_pred CCCccccchhhHHHHHHHhhCccccccccCCCccHHHHHHHHHHHHHHHh-hCCceeeccCceeecCC Confidence 43 6999999999999999999999999999999999999999999 6779998765 777777 No 26 >protein:vir:105380 Length: 160 # NCBI annotation: gene 7 protein # Family: family:all:1693 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958183;genbank:gi:41057285;genbank:GeneID:2716627 Probab=92.31 E-value=0.012 Score=31.13 Aligned_cols=145 Identities=17% Similarity=0.194 Sum_probs=90.7 Q ss_pred CCCHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeEecccCCCCCCCCccccccCcccc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEAPLFGFSYQYRLPTDF 80 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~~La~l~~~p~~~~~yaY~lP~Dc 80 (207) |.|+=+|.=-||.++|-.+-+++.|--++... +.+++--+-++.|.+ ..-.-||. T Consensus 4 ~~Tkgdiv~~Alrklgvas~at~~Dvepqs~~-----~gindLe~MMaeW~a-----------~gi~lGy~--------- 58 (160) T protein:vir:10 4 VLTKGEIVLFALRKFAIASNASLTDVEPQSIE-----DGVNDLEDMMSEWMI-----------NPGDIGYA--------- 58 (160) T ss_pred chhHHHHHHHHHHHhcccccccccCCChHHHH-----HHHHHHHHHHHHhcc-----------CCcceeee--------- Confidence 77999999999999997666666553333322 223333334455641 00011332 Q ss_pred eEeeecccCccccccccccceEEeCCEEEecCCCceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhcc--CCHHHHH Q lcl|Aclame:pro 81 IRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQAPLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLT--QSATKRQ 158 (207) Q Consensus 81 lrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pLt--~~~~~~~ 158 (207) +++.+. +.+-.|..--|+.|++++.+.||.+||...- -++.... T Consensus 59 ----------------------------Fa~~~e------~p~pdD~~glp~~~~~aV~~~LAvria~dygie~s~~~~s 104 (160) T protein:vir:10 59 ----------------------------FATGDE------QPLPDDESGLPRKYKHAVGYQLLLRMLSDYSLEPTPQVLS 104 (160) T ss_pred ----------------------------ecccCC------CCCccccccCChhHHHHHHHHHHHHHHHhcCCCcCHHHHH Confidence 222221 2223455667999999999999999999865 4778888 Q ss_pred HHHHHHHHHHHHHHhhhhhcCCCCCC-----CCchhhhhccCCCCCCCcccccC Q lcl|Aclame:pro 159 GAWAEHDQAIAAAIRVNAIERPAQPL-----GDDTWLESRNGVAFPGETPIIRS 207 (207) Q Consensus 159 ~l~~~~~~~l~~A~~~da~e~~~~~~-----~~~~~~~aR~~~~~~~~~~~~~~ 207 (207) .+...|+..+.....+-+.+++..-+ ..+.|...|+. ||+-|-+-+ T Consensus 105 ~A~~ay~~Ll~a~~~~p~~~~~~~mP~gsGn~~~~~~~~~yy---~~~~~~~d~ 155 (160) T protein:vir:10 105 NAQRSYDALMTDTLVVPSMRRRGDFPVGQGNKYDVFTSDRYY---PGDLPLIDG 155 (160) T ss_pred HHHHHHHHHHHHHhhcchhhccCCCCCCCCCccccccccccc---CCCccccCC Confidence 89999999988888887776654321 12456666654 455454444 No 27 >protein:vir:176 Length: 160 # NCBI annotation: DNA stabilization protein # Family: family:all:1693 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112081;genbank:gi:13559871;genbank:GeneID:920968 Probab=92.19 E-value=0.012 Score=31.02 Aligned_cols=145 Identities=17% Similarity=0.197 Sum_probs=90.5 Q ss_pred CCCHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeEecccCCCCCCCCccccccCcccc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEAPLFGFSYQYRLPTDF 80 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~~La~l~~~p~~~~~yaY~lP~Dc 80 (207) |.|+=+|.=-||.++|-.+-+++.|--++... +.+++--+-++.|.+ ..-.-||. T Consensus 4 ~~Tkgdiv~~Alrklgvas~at~~Dvepqs~~-----~gindLe~MMaeW~a-----------~gi~lGy~--------- 58 (160) T protein:vir:17 4 VLTKGEIVLFALRKFAIASNASLTDVEPQSIE-----DGVNDLEDMMSEWMI-----------NPGDIGYA--------- 58 (160) T ss_pred chhHHHHHHHHHHHhcccccccccCCChHHHH-----HHHHHHHHHHHHhcc-----------CCcceeee--------- Confidence 77999999999999997666666553333322 223333334455641 00011332 Q ss_pred eEeeecccCccccccccccceEEeCCEEEecCCCceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhcc--CCHHHHH Q lcl|Aclame:pro 81 IRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQAPLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLT--QSATKRQ 158 (207) Q Consensus 81 lrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pLt--~~~~~~~ 158 (207) +++.+. +.+-.|..--|+.|++++.+.||.+||...- -++.... T Consensus 59 ----------------------------Fa~~~e------~p~pdD~~glp~~~~~aV~~~LAvria~dygie~s~~~~s 104 (160) T protein:vir:17 59 ----------------------------FATGDE------QPLPDDESGLPRKYKHAVGYQLLLRMLSDYSLEPTPQVLS 104 (160) T ss_pred ----------------------------ecccCC------CCCccccccCChhHHHHHHHHHHHHHHHhcCCCcCHHHHH Confidence 222221 2223455667999999999999999999865 4778888 Q ss_pred HHHHHHHHHHHHHHhhhhhcCCCCCC-----CCchhhhhccCCCCCCCcccccC Q lcl|Aclame:pro 159 GAWAEHDQAIAAAIRVNAIERPAQPL-----GDDTWLESRNGVAFPGETPIIRS 207 (207) Q Consensus 159 ~l~~~~~~~l~~A~~~da~e~~~~~~-----~~~~~~~aR~~~~~~~~~~~~~~ 207 (207) .+...|+..+.....+-+.+++..-+ ..+.|...|+. ||+-|-+-+ T Consensus 105 ~A~~ay~~Ll~a~~~~p~~~~~~~mP~gsgn~~~~~~~~~yy---~~~~~~~d~ 155 (160) T protein:vir:17 105 NAQRSYDALMTDTLVVPSIRRRGDFPVGQGNKYDVFTSDRYY---PGDLPLIDG 155 (160) T ss_pred HHHHHHHHHHHHHhhcchhhccCCCCCCCCCccccccccccc---CCCccccCC Confidence 89999999988888877766654321 12456666654 455454444 No 28 >protein:vir:80185 Length: 221 # NCBI annotation: tail tubular protein A # Family: family:all:12085 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285796;genbank:gi:148747830;genbank:GeneID:5220461 Probab=91.65 E-value=0.007 Score=32.41 Aligned_cols=181 Identities=15% Similarity=0.175 Sum_probs=95.7 Q ss_pred CC--CHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeEecccCCCCCCCCccccccCcc Q lcl|Aclame:pro 1 MA--SQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEAPLFGFSYQYRLPT 78 (207) Q Consensus 1 Ma--S~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~~La~l~~~p~~~~~yaY~lP~ 78 (207) |. |-+.-||.-|..||+.++..|+- +-....+..|..+.+++-+-|.|.+-.-. -|.-.|.---..=. T Consensus 1 M~~TtLL~A~NEVL~~iGE~~~~~~s~--~~G~r~k~~~~~AlR~V~aiH~W~~L~~~--------i~AiSW~~D~A~L~ 70 (221) T protein:vir:80 1 MSDTTLLQASNEVLRSIGERPLLQLSG--TTGDRLKDVFRQALRDVEAIHTWDWLYNQ--------IPAISWTQDEAYLG 70 (221) T ss_pred CCcchhHhhhHHHHhhccchhhhhhcc--chhHHHHHHHHHHHHHHHHHHHHHHHHhh--------ccceeeccccchhh Confidence 76 57788999999999999998864 44567788999999999999999864321 22233332222223 Q ss_pred cceEeeecccCccccccccc----------------------cceE--EeCCEEEecCCC-----ceEEEE--EeecC-- Q lcl|Aclame:pro 79 DFIRLLQVGQFDVYPRTDTR----------------------GLFS--IENGNILTDMQA-----PLYIRY--AKRVT-- 125 (207) Q Consensus 79 Dclrv~~v~~~~~~~~~~~~----------------------~~y~--v~g~~l~~~~~~-----~~~l~Y--~~~v~-- 125 (207) |.-++..|..++..-+..+. .-|. .++++++.+-.+ .-.|++ .-..+ T Consensus 71 ~IQ~L~tVS~G~kt~G~~ELqwvdftdYdk~~~~s~T~Tddn~m~Y~~~~~~~V~L~P~P~~s~a~~~IrF~VL~~~T~P 150 (221) T protein:vir:80 71 DIQRLFTVSCGDKTTGYRELQWVDFTDYDKQPITSYTGTDDNAMWYTMTSNGRVKLNPYPEDSQAQQRIRFYVLQTLTMP 150 (221) T ss_pred hhhhhhhhcccccccchhhhhccccccccccceeeeecccCcceeeeeccCCeeEeccCccccccccceeEEEeeccccC Confidence 33333333222111111110 1111 123444332111 111222 11111 Q ss_pred --C---hhhccHHHHHHHHHHHHHHhhhhccCCHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhcc Q lcl|Aclame:pro 126 --D---PNAMDALFREAFACRLAAEACESLTQSATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRN 194 (207) Q Consensus 126 --d---~~~~~~~F~~ala~~LAa~lA~pLt~~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~ 194 (207) + -+..|.-|...+.-+--.-++.--+.+.+-++.+..+|+..-.+-+ .+|++.....-+-+-..|+ T Consensus 151 S~~s~~FsvLPe~~~~LV~~~A~~LM~~~H~~D~~~A~~~~~E~Ei~s~~~R---~~erkaptqqlsmyrrrrr 221 (221) T protein:vir:80 151 SQDSSTFSVLPERYMPLVIKRASYLMALRHLDDTSGAAYFNNEYEILSQQYR---NNERKAPTQQLSMYRRRRR 221 (221) T ss_pred CCCCccccccchhhhHHHHHHHHHHHHHhhcchhhHHHHhhhHHHHHHHHHh---hhhhcchhHHHHHHHhhcC Confidence 2 2334677777665555555566667888899999999986544433 2344333222222323333 No 29 >protein:vir:3130 Length: 250 # NCBI annotation: hypothetical protein # Family: family:all:7212 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640312;genbank:gi:21234411;genbank:GeneID:956053 Probab=88.93 E-value=0.029 Score=28.99 Aligned_cols=181 Identities=20% Similarity=0.258 Sum_probs=86.2 Q ss_pred CC--CHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeE-ecccCCCC-----C--C--- Q lcl|Aclame:pro 1 MA--SQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARA-QLAALAEA-----P--L--- 67 (207) Q Consensus 1 Ma--S~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~-~La~l~~~-----p--~--- 67 (207) |. |-++|.-.-|+..|.+.++|+.|+|.|.......--.+...+....+|.|-.-+- .+..-+++ | . T Consensus 1 ~~~~tll~iv~~~~~~~~sdev~s~~~dsie~~~~~~~~~~v~~~~~~~~~w~~lkl~p~~~~A~pth~~ls~P~~vk~i 80 (250) T protein:vir:31 1 MPKRTLLQIVKKMAQKTGSDEVTSLSEDSIEIQDMVDCALEVLEDIIYRNDWEFLKDRPAQLEAGTNAIELSIPDNVRKI 80 (250) T ss_pred CchhhHHHHHHHHHHhcccchhcccchhhhHHHHHHHHHHHHHHHHhhcCCcceeeecccccccccceeeeeccccccce Confidence 88 7889999999999999999999999999999888888888899999997754321 11111111 1 0 Q ss_pred --CCcc------------ccccCcccceEeeecccCccccccccccceEEeCCE-------------------------- Q lcl|Aclame:pro 68 --FGFS------------YQYRLPTDFIRLLQVGQFDVYPRTDTRGLFSIENGN-------------------------- 107 (207) Q Consensus 68 --~~~~------------yaY~lP~Dclrv~~v~~~~~~~~~~~~~~y~v~g~~-------------------------- 107 (207) -.|. -.|..|.++++-++-..+...+. +.-.|.|=. T Consensus 81 ~fl~Y~v~da~~~~~yRtlky~~PdeF~~sl~sn~~~d~D~----~~v~I~gveLlirnd~~P~YyTSFDd~tvvlDSYD 156 (250) T protein:vir:31 81 QTLRYRYEDAGVQNCFRTLRYMYPHEFMERLQNNKPTDPDT----TTVTINGVELYPKTNRHPRYWTSFDEQNVVLDSYD 156 (250) T ss_pred eeeeeeecccccceeeeeeeccChHHHHHHHhhcCCCCccc----ceeEeeeeeeeeecCCCcceecccCCceeeeeccc Confidence 0122 23677887775443221111110 111111111 Q ss_pred -----EEecCCCc--eEEEEEeec-CChhhc-cHHHHHHHHHHHHHHhhhhc-----cCCHHHHHHHHHHHHHHHHH--H Q lcl|Aclame:pro 108 -----ILTDMQAP--LYIRYAKRV-TDPNAM-DALFREAFACRLAAEACESL-----TQSATKRQGAWAEHDQAIAA--A 171 (207) Q Consensus 108 -----l~~~~~~~--~~l~Y~~~v-~d~~~~-~~~F~~ala~~LAa~lA~pL-----t~~~~~~~~l~~~~~~~l~~--A 171 (207) ++.+.... +.+.|+.=. .+.+.| |+.=.+++++.|+-..+... +.++...+...+.|-...+. + T Consensus 157 asvd~tl~~ak~~A~~~v~y~~F~ds~~D~fvp~Ipd~~f~~ll~EAks~Af~~fkq~anpkaEq~arR~~~q~~~k~~~ 236 (250) T protein:vir:31 157 ATQNPTGVDATDSAIIATLYLDFTGSDADSWVAPIPESLFTLWEQEAVAEAFVQFRQTENPRAERRSRRTYVQQIKKEPV 236 (250) T ss_pred cccccccccchheeeeeeecceecCCcccccCCCCcHHHHHHHHHHhhHHHhhhhhcccCchhHHHHHHHHHHHhhhccc Confidence 11111111 122222110 111222 22223344444443333332 23444334444433333332 2 Q ss_pred HhhhhhcCCCCCCCCchhhhhccCCCCCCC Q lcl|Aclame:pro 172 IRVNAIERPAQPLGDDTWLESRNGVAFPGE 201 (207) Q Consensus 172 ~~~da~e~~~~~~~~~~~~~aR~~~~~~~~ 201 (207) ... +++..++-+ || T Consensus 237 ~hk--n~G~~~~NY--------------GR 250 (250) T protein:vir:31 237 THK--DEGSDEVNY--------------GR 250 (250) T ss_pred ccc--ccCcCCCCC--------------CC Confidence 222 222222222 22 No 30 >protein:vir:105524 Length: 166 # NCBI annotation: phage DNA stabilization protein # Family: family:all:1693 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516193;genbank:gi:89885996;genbank:GeneID:3964384 Probab=86.39 E-value=0.046 Score=27.92 Aligned_cols=145 Identities=18% Similarity=0.282 Sum_probs=87.6 Q ss_pred CCCHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeEecccCCCCCCCCccccccCcccc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEAPLFGFSYQYRLPTDF 80 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~~La~l~~~p~~~~~yaY~lP~Dc 80 (207) |.|+=+|.=.||.++|--+-+++.+-.++.- =+.+++--+-++.|. + ..-.-||. T Consensus 4 ~~TkgeiV~~Alrklgv~s~at~~Dvepq~~-----~~gv~dLe~MMaeW~--------a---~gi~lGy~--------- 58 (166) T protein:vir:10 4 LLTKGDVVLFALRKCAVASNATLTDVEPQSV-----TDGLDDLEAMMAEWR--------A---RGIDIGYR--------- 58 (166) T ss_pred cchHHHHHHHHHHHhccCcchhhcCCCHHHH-----HHHHHHHHHHHHHhc--------c---Ccceeeee--------- Confidence 7799999999999999765555554333321 123444444556671 0 00011332 Q ss_pred eEeeecccCccccccccccceEEeCCEEEecCCCceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhcc--CCHHHHH Q lcl|Aclame:pro 81 IRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQAPLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLT--QSATKRQ 158 (207) Q Consensus 81 lrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pLt--~~~~~~~ 158 (207) +++.+.. .+-.|..--|+.|++++.+.||.+||...- -+++... T Consensus 59 ----------------------------Fa~~~e~------p~pdD~~glp~~~~~aV~~~LAvria~dygi~~s~~~~s 104 (166) T protein:vir:10 59 ----------------------------FAQDEQT------VSPDDPTGVLPLFTSAVGYQLMLRVLADYGIAPTPRQEA 104 (166) T ss_pred ----------------------------ecccCCC------CCccccccCchhHHHHHHHHHHHHHHHhcCCCcCHHHHH Confidence 2222221 223455567999999999999999999864 4777888 Q ss_pred HHHHHHHHHHHHHHhhhhhcCCCC-CCCCc----hhhhhccCCCCCCCcccc----------cC Q lcl|Aclame:pro 159 GAWAEHDQAIAAAIRVNAIERPAQ-PLGDD----TWLESRNGVAFPGETPII----------RS 207 (207) Q Consensus 159 ~l~~~~~~~l~~A~~~da~e~~~~-~~~~~----~~~~aR~~~~~~~~~~~~----------~~ 207 (207) .+...|+..+.....+-..+++.. |.+.+ .|...|+ +|++-|.+ || T Consensus 105 ~A~~ay~~Ll~a~~~~p~~~r~~dmP~GsGN~~~~~~~~~y---y~~~~~~~dg~~~~~~~~~~ 165 (166) T protein:vir:10 105 TAHTAYDALLTATLTVPSLKRRGDMPTGQGNQYDHFISGRY---YPDKRENVNGDNAITPAKRS 165 (166) T ss_pred HHHHHHHHHHHHHhhcccccccCCCCcccCccccccccccc---ccCCccccCcccccCccccc Confidence 888899988888777776666543 22222 2333443 34444443 44 No 31 >protein:vir:3528 Length: 160 # NCBI annotation: P27 # Family: family:all:1693 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050988;genbank:gi:9633574;genbank:GeneID:1262321 Probab=84.52 E-value=0.06 Score=27.29 Aligned_cols=144 Identities=13% Similarity=0.092 Sum_probs=90.7 Q ss_pred CCCHHHHHHHHHHHhcchhccccccCCHH--HHHHHHhhHHHHHHHHhcCCCCceeeeEecccCCCCCCCCccccccCcc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLDEDSKA--AATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEAPLFGFSYQYRLPT 78 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~e~s~~--A~~c~~~Y~~~rd~~L~~~~W~FA~~r~~La~l~~~p~~~~~yaY~lP~ 78 (207) |+|+=+|.=-||.++|-.+-.++.|=-++ .+.|+ |--+-++.|. .+ .-.-||. T Consensus 4 ~~Tkgdiv~~Alrk~gvas~at~~DvePq~~e~gin-------dLe~MMaeW~--a~---------gi~lGy~------- 58 (160) T protein:vir:35 4 PLTKGEIVLFALRKAGIASEATNIDVEPQSFEEGIN-------DLEDLMAELQ--IT---------FGDLGYQ------- 58 (160) T ss_pred cchHHHHHHHHHHHhhccccccccCCCHHHHHHHHH-------HHHHHHHhhc--cC---------Cceeeee------- Confidence 77899999999999998777666653332 23333 3333345551 11 0011332 Q ss_pred cceEeeecccCccccccccccceEEeCCEEEecCCCceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhhcc--CCHHH Q lcl|Aclame:pro 79 DFIRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQAPLYIRYAKRVTDPNAMDALFREAFACRLAAEACESLT--QSATK 156 (207) Q Consensus 79 Dclrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~pLt--~~~~~ 156 (207) +.+++.. .+-.|..-.|+.|++++.+.||.+||...- -++.. T Consensus 59 ------------------------------Fa~~~e~------p~pdD~~glp~~~~~aV~~~LAvria~dyG~~~s~~~ 102 (160) T protein:vir:35 59 ------------------------------FSAEEEN------PTSDDASGLPRKYKQVMGYQLMLRMLSDYGIEPTPRQ 102 (160) T ss_pred ------------------------------ecccCCC------CCcccccCCchhHHHHHHHHHHHHHHHhhCCCcCHHH Confidence 2222221 223455667999999999999999999865 46778 Q ss_pred HHHHHHHHHHHHHHHHhhhhhcCCCCCC-CC----chhhhhccC----CCCCCCcccc Q lcl|Aclame:pro 157 RQGAWAEHDQAIAAAIRVNAIERPAQPL-GD----DTWLESRNG----VAFPGETPII 205 (207) Q Consensus 157 ~~~l~~~~~~~l~~A~~~da~e~~~~~~-~~----~~~~~aR~~----~~~~~~~~~~ 205 (207) ...+...|+..+.....+-..+++.+-+ +. ..|...|+. --.|+..||- T Consensus 103 ~a~A~~ay~~Ll~a~~~~p~~~r~~~mP~GsGN~~~a~~g~ryy~~r~~~~~~~~p~~ 160 (160) T protein:vir:35 103 EASAAAAYDALLTDTLSVPSIARRGDMPVGQGNNYTALGTASYYVERGFHAKNTDPVS 160 (160) T ss_pred HHHHHHHHHHHHHHhhccccccccCCCCcccccccccCCcceeccCceeccCCCCcCC Confidence 8888889999998888887777654422 21 234455541 1346777877 No 32 >protein:vir:9267 Length: 166 # NCBI annotation: 4 # Family: family:all:1693 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720331;genbank:gi:24371589;genbank:GeneID:955812 Probab=70.15 E-value=0.21 Score=24.27 Aligned_cols=148 Identities=14% Similarity=0.094 Sum_probs=86.6 Q ss_pred CCCHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeEecccCCCCCCCCccccccCcccc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEAPLFGFSYQYRLPTDF 80 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~~La~l~~~p~~~~~yaY~lP~Dc 80 (207) |.|+=+|.=-||.++|-.+-+++.|--++.-. +-+++--+-++.|--+ +..=.-||.|+ T Consensus 3 m~Tkgdiv~~AlrklgVas~atltDvEpqs~~-----dgi~dLe~MMaeW~~~---------a~gI~lGY~fa------- 61 (166) T protein:vir:92 3 IKTKGDLARAALRKLGVASDATLTDIEPQSMQ-----DAVDDLEAMMAEWYQD---------GKGIITGYIFS------- 61 (166) T ss_pred cchHHHHHHHHHHHhccccccccccCChHHHH-----HHHHHHHHHHHHHhhc---------CCCceeccccc------- Confidence 88999999999999998766666654444322 3455555566677310 00001244222 Q ss_pred eEeeecccCccccccccccceEEeCCEEEecCCCceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhh--ccCCHHHHH Q lcl|Aclame:pro 81 IRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQAPLYIRYAKRVTDPNAMDALFREAFACRLAAEACES--LTQSATKRQ 158 (207) Q Consensus 81 lrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~p--Lt~~~~~~~ 158 (207) +.+. +.+-.|..-.|+.|.+|+.+.||.+||.- |--+++... T Consensus 62 ------------------------------~~~e------~p~pdD~~glp~~a~~av~~~LAvria~dygie~t~~v~s 105 (166) T protein:vir:92 62 ------------------------------DDDN------PPAEGDDHGLRSSAVSAVFHNLACRIAPDYALEATAKIIA 105 (166) T ss_pred ------------------------------cccC------CCCCccccCCCHhHHHHHHHHHHHHHHhhcCCCcCHHHHH Confidence 1111 22334566679999999999999999987 445788888 Q ss_pred HHHHHHHHHHHHHHhhhhhcCC-CCCCCC------chhhhhcc-CCCCCCCcccccC Q lcl|Aclame:pro 159 GAWAEHDQAIAAAIRVNAIERP-AQPLGD------DTWLESRN-GVAFPGETPIIRS 207 (207) Q Consensus 159 ~l~~~~~~~l~~A~~~da~e~~-~~~~~~------~~~~~aR~-~~~~~~~~~~~~~ 207 (207) .....|...+.......+.-++ +..++. ..|...|. -+.+. .+-+.. T Consensus 106 ~A~~~y~aLl~~t~~a~~~r~~Y~~dmP~G~Gn~~~~~~~~~y~~~~~~--~~~t~p 160 (166) T protein:vir:92 106 TAKYGKELLYKQTAIARAKRAPYPSRMPTGSGNSFANLNEWHYFPGEQN--ADSTTP 160 (166) T ss_pred HHHHHHHHHHHHHHhhcccccCCCCCCCcCCCccchhcccccccccccc--CCCCCc Confidence 8999999888877776665442 111111 12333332 22221 111111 No 33 >protein:vir:100918 Length: 166 # NCBI annotation: Gp4 # Family: family:all:1693 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006410;genbank:gi:46358702;genbank:GeneID:2777068 Probab=66.42 E-value=0.27 Score=23.72 Aligned_cols=149 Identities=14% Similarity=0.081 Sum_probs=86.7 Q ss_pred CCCHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeEecccCCCCCCCCccccccCcccc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEAPLFGFSYQYRLPTDF 80 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~~La~l~~~p~~~~~yaY~lP~Dc 80 (207) |.|+=+|.=-||.++|-.+-+++.|--++.-. +-+++--+-++.|--+. ..=.-||.|+ T Consensus 3 m~TkgdiVl~AlrklgVas~atltDvEpqs~~-----dgi~dLe~MMaeW~~~a---------~gI~lGY~fa------- 61 (166) T protein:vir:10 3 IKTKGDLVRAALRKLGVASDATLTDVEPQSMQ-----DAVDDLEAMMAEWYQDG---------KGIVTGYVFS------- 61 (166) T ss_pred cchHHHHHHHHHHHhccccccccccCChHHHH-----HHHHHHHHHHHHHhhcC---------CCceecceec------- Confidence 88999999999999998766666654444322 34555555566673110 0001244222 Q ss_pred eEeeecccCccccccccccceEEeCCEEEecCCCceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhh--ccCCHHHHH Q lcl|Aclame:pro 81 IRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQAPLYIRYAKRVTDPNAMDALFREAFACRLAAEACES--LTQSATKRQ 158 (207) Q Consensus 81 lrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~p--Lt~~~~~~~ 158 (207) +.+. +.+-.|..-.|+.|.+|+.+.||.+||.- |--+++... T Consensus 62 ------------------------------~~~e------~p~pdD~~glp~~a~~av~~~LAvria~dygie~t~~v~s 105 (166) T protein:vir:10 62 ------------------------------DDDN------PPSEGDDHGLRSSAISAVFHNLACRIAPDYALEATAKIIA 105 (166) T ss_pred ------------------------------ccCC------CCCCccccCCCHhHHHHHHHHHHHHHHhhcCCCcCHHHHH Confidence 1111 22334566679999999999999999987 445788888 Q ss_pred HHHHHHHHHHHHHHhhhhhcCCC-CCCCC------chhhhhccCCCCCCCcccccC Q lcl|Aclame:pro 159 GAWAEHDQAIAAAIRVNAIERPA-QPLGD------DTWLESRNGVAFPGETPIIRS 207 (207) Q Consensus 159 ~l~~~~~~~l~~A~~~da~e~~~-~~~~~------~~~~~aR~~~~~~~~~~~~~~ 207 (207) .....|...++.+....+.-++= ..++. ..|...|..-+.- ..+-+.. T Consensus 106 ~A~~ay~aLl~~t~~a~~~~~~Y~~dmP~G~Gn~~~~~~~~~y~~~~~-~~~~~~p 160 (166) T protein:vir:10 106 TAKYGKELLYKQTAIARAKRAPYPSRMPTGSGNSFANLNEWHYFPGEQ-NADSTTP 160 (166) T ss_pred HHHHHHHHHHHHHHhhhhhcccCCCCCCcCCCccchhccccccccccc-ccCCCCC Confidence 99999999988877776665521 11111 1233333211100 0111111 No 34 >protein:vir:2108 Length: 166 # NCBI annotation: head completion protein # Family: family:all:1693 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059632;genbank:gi:9635540;genbank:GeneID:1262828 Probab=62.05 E-value=0.34 Score=23.14 Aligned_cols=149 Identities=14% Similarity=0.087 Sum_probs=84.8 Q ss_pred CCCHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeEecccCCCCCCCCccccccCcccc Q lcl|Aclame:pro 1 MASQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEAPLFGFSYQYRLPTDF 80 (207) Q Consensus 1 MaS~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~~La~l~~~p~~~~~yaY~lP~Dc 80 (207) |.|+=+|.=-||.++|-.+-+++.|--++.-. +-+++--+-++.|--+. ..=.-||.|+ T Consensus 3 m~TkgdiVl~AlrklgVas~atltDvEpqs~~-----dgi~dLe~MMaeW~~~a---------~gI~lGY~fa------- 61 (166) T protein:vir:21 3 IKTKGDLVRAALRKLGVASDATLTDVEPQSMQ-----DAVDDLEAMMAEWYQDG---------KGIITGYVFS------- 61 (166) T ss_pred cchHHHHHHHHHHHhccccccccccCChHHHH-----HHHHHHHHHHHHHhhcC---------CCceecceec------- Confidence 88999999999999998766666654444322 34555555666773100 0001244222 Q ss_pred eEeeecccCccccccccccceEEeCCEEEecCCCceEEEEEeecCChhhccHHHHHHHHHHHHHHhhhh--ccCCHHHHH Q lcl|Aclame:pro 81 IRLLQVGQFDVYPRTDTRGLFSIENGNILTDMQAPLYIRYAKRVTDPNAMDALFREAFACRLAAEACES--LTQSATKRQ 158 (207) Q Consensus 81 lrv~~v~~~~~~~~~~~~~~y~v~g~~l~~~~~~~~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~p--Lt~~~~~~~ 158 (207) +.+. +.+-.|..-.|+.|.+|+.+.||.+||.- |--+++... T Consensus 62 ------------------------------~~~e------~p~pdD~~glp~~a~~av~~~LAvria~dygie~t~~v~~ 105 (166) T protein:vir:21 62 ------------------------------DDEN------PPAEGDDHGLRSSAVSAVFHNLACRIAPDYALEATAKIIA 105 (166) T ss_pred ------------------------------ccCC------CCCCccccCCCHhHHHHHHHHHHHHHHhhcCCCcCHHHHH Confidence 1111 22334566679999999999999999987 445777888 Q ss_pred HHHHHHHHHHHHHHhhhhhcCC-CCCCCC------chhhhhccCCCCCCCcccccC Q lcl|Aclame:pro 159 GAWAEHDQAIAAAIRVNAIERP-AQPLGD------DTWLESRNGVAFPGETPIIRS 207 (207) Q Consensus 159 ~l~~~~~~~l~~A~~~da~e~~-~~~~~~------~~~~~aR~~~~~~~~~~~~~~ 207 (207) .....|...+......-++-++ +..++. ..|...|..-+. +..+-+.. T Consensus 106 ~A~~aya~Lls~~~~a~~kR~~y~~dmP~G~Gn~~~~~~~~~y~~~~-~~~~~t~p 160 (166) T protein:vir:21 106 TAKYGKELLYKQTAISRAKRAPYPSRMPTGSGNSFANLNEWHYFPGE-QNADSTTP 160 (166) T ss_pred HHHHHHHHHHHHHhhhhhhccCCCCCCCcCCCccchhcccccccccc-ccCCCCCc Confidence 8888888888866555555331 111111 123333321111 01121211 No 35 >protein:vir:94601 Length: 211 # NCBI annotation: PfWMP4_36 # Family: family:all:12085 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762666;genbank:gi:115304374;genbank:GeneID:5142301 Probab=54.90 E-value=0.49 Score=22.27 Aligned_cols=178 Identities=15% Similarity=0.211 Sum_probs=93.7 Q ss_pred CC--CHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeEecccCCCCCCCCcc------- Q lcl|Aclame:pro 1 MA--SQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEAPLFGFS------- 71 (207) Q Consensus 1 Ma--S~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~~La~l~~~p~~~~~------- 71 (207) |. |-+.-||.-|..+|+.++..|+- +-.+..+..|.++.+++-+-|.|.+-...+ |.-.|. T Consensus 1 M~~TsLL~A~NEVL~~vGE~~~~~~t~--~~G~k~r~~~~~AiR~V~slH~W~~L~~~v--------~AlSW~~~~D~A~ 70 (211) T protein:vir:94 1 MPTTSLLIACNEVLLNVGELEVADFTT--PVGKKARLAYNSAIRAVSSLHAWQHLQATV--------SALSWNVEGDIAT 70 (211) T ss_pred CCcchhHhhhHHHHhhccchhhhhhcc--chhHHHHHHHHHHHHHHHHHHHHHHHHhhc--------chheeccccceeh Confidence 76 57788999999999999998864 445677889999999999999998743221 222222 Q ss_pred -------ccccCcccceEeeec---ccCccccccccccceE--EeCCEEEecCCC----ceEEEE--EeecC----Chhh Q lcl|Aclame:pro 72 -------YQYRLPTDFIRLLQV---GQFDVYPRTDTRGLFS--IENGNILTDMQA----PLYIRY--AKRVT----DPNA 129 (207) Q Consensus 72 -------yaY~lP~Dclrv~~v---~~~~~~~~~~~~~~y~--v~g~~l~~~~~~----~~~l~Y--~~~v~----d~~~ 129 (207) |.-.+=.|.||-..- ...+..-......-|. .+.++++.+-.+ .-.|++ .-..+ +.+. T Consensus 71 L~~IQ~L~sVS~G~~~~rs~G~~ELYer~~~~a~T~T~l~Y~~~~~~~V~L~P~P~~a~~~~IkF~VL~~~T~PS~~t~~ 150 (211) T protein:vir:94 71 LTPIQELYSVSLGTDVLRSVGFDELYERDIRIAATATPLYYARAEQNSVLLYPTPSVADRPNIKFRVLLQPTVPSLPTDN 150 (211) T ss_pred hhhhhhhhhhhccchhhhhcchhhhhhcccceeeccchhheeeccCCeeEeccCcccccccceeEEEeeccccCCCCCCc Confidence 222233333332221 0111000000001121 234444432211 112232 11111 2222 Q ss_pred --ccHHHHHHHHHHHHHHhhhhccCCHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccCCCCCC Q lcl|Aclame:pro 130 --MDALFREAFACRLAAEACESLTQSATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGVAFPG 200 (207) Q Consensus 130 --~~~~F~~ala~~LAa~lA~pLt~~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~~~~~~ 200 (207) .|.-|...+.-+--.-++.--+.+.+-++.+..+|+..-.+-+.....|-- .- -||.|. T Consensus 151 FtLPd~~~~LV~~~A~~LM~~~H~~D~~~A~~~~~E~El~t~~~R~~q~~~~~-----------~~-~~~~~~ 211 (211) T protein:vir:94 151 FTLPDDFYDLVHIYAQMLMHRNHTTDLQAAQACQSEFELRTHMVRTRQTSQVV-----------GN-MGGYPT 211 (211) T ss_pred ccCchhHHHHHHHHHHHHHHHhhcchhhHHHHhhhHHHHHHHHHhcchhhhhh-----------hc-cCCCCC Confidence 477887766655555566666788899999999998765544433222210 01 122222 No 36 >protein:vir:95463 Length: 267 # NCBI annotation: hypothetical protein ORF040 # Family: family:all:7212 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294633;genbank:gi:149408199;genbank:GeneID:5237030 Probab=35.59 E-value=1.2 Score=20.11 Aligned_cols=197 Identities=15% Similarity=0.206 Sum_probs=99.8 Q ss_pred CC-CHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCCCCceeeeEecccCCCC-CCC---------- Q lcl|Aclame:pro 1 MA-SQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQLAALAEA-PLF---------- 68 (207) Q Consensus 1 Ma-S~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~W~FA~~r~~La~l~~~-p~~---------- 68 (207) |- |-++|.-.-|+..|.+.++|+. +|.|.......--.+...+.....|.--.+.-+|.|...+ -+. T Consensus 1 ~~~tll~iv~~~~~~~~sdev~s~~-dtie~~~~~~~~~~v~~~mi~~r~wp~~~~~lkl~p~~~~A~pth~~l~tpVk~ 79 (267) T protein:vir:95 1 MIKTLLDIVQDILSEMSSDEVNSIN-DTIESMQVAQIVKSVYMSMMSNRNWPHQRKLIQLEPSGDDAYPTHMKLQTPIKE 79 (267) T ss_pred ChhHHHHHHHHHHHhcccchhhhhh-hhHHHHHHHHHHHHHHHHHHhhcccchhhhheeeccccccccceeeeeccccce Confidence 76 8999999999999999999999 8888888887777888888888999766666666443221 110 Q ss_pred ----Ccc-------------ccccCcccceEeeecccCccccc--------------cccccceE--EeCCE-------- Q lcl|Aclame:pro 69 ----GFS-------------YQYRLPTDFIRLLQVGQFDVYPR--------------TDTRGLFS--IENGN-------- 107 (207) Q Consensus 69 ----~~~-------------yaY~lP~Dclrv~~v~~~~~~~~--------------~~~~~~y~--v~g~~-------- 107 (207) .|. -.|..|.++++.+.-+++...+. ......|. ..+.. T Consensus 80 i~fl~Y~~~kda~~~~~yRtlky~~PdeF~~~~~~rn~~~dn~~~~~d~sgveLlirnd~~P~YyTSFDd~tvVlDSYDa 159 (267) T protein:vir:95 80 MCFINYDCVKDGETRKRYRTMKWAEPDDFLRSISKRNNDQDNIDVIIDPSGVELLIRNDLAPTYYTSFDDTTLIFDSYDK 159 (267) T ss_pred eeeeeeeeeeccccceeeeeeeccChHHHHHhhhccCCCCCCceeEEccCceEEEeecCCCcceecccCCceeeeecccc Confidence 111 22677888777764322211100 00001111 01111 Q ss_pred ----EEecCCCc--eEEEEEeecCChhhccHHHHHHHHHHHHHHhhhh-----ccCCHHHHHHHHHHHHHHHHHHHhhhh Q lcl|Aclame:pro 108 ----ILTDMQAP--LYIRYAKRVTDPNAMDALFREAFACRLAAEACES-----LTQSATKRQGAWAEHDQAIAAAIRVNA 176 (207) Q Consensus 108 ----l~~~~~~~--~~l~Y~~~v~d~~~~~~~F~~ala~~LAa~lA~p-----Lt~~~~~~~~l~~~~~~~l~~A~~~da 176 (207) ++++.... ++.-|+.- .+-+--|+.=.+++++.|+-..+.. =+.++...+...+.+-...+.+-..|. T Consensus 160 svd~tl~~ak~~A~~~~~p~ff-~~D~fVp~Ipd~~f~~ll~EAks~Af~~fkq~anpkaEq~arRq~v~~~~k~~~vn~ 238 (267) T protein:vir:95 160 AVDDTLQKSKIQAMAYVMPVFF-MDDDFIPEIPDEARAALLEEAKSRAFITIKQMANQKAEQEAQRQQAWLSRKAWRVNG 238 (267) T ss_pred ccccccccccceeEEEEeeeee-cCCccCCCCcHHHHHHHHHHhhHHHhhhhhccCCchhHHHHHHHHHHHHhhhhhhcc Confidence 12111111 12222332 1111112222334444444333333 234566666666666666666655553 Q ss_pred hcCCC------CCCCCchhhhhccCCCCC Q lcl|Aclame:pro 177 IERPA------QPLGDDTWLESRNGVAFP 199 (207) Q Consensus 177 ~e~~~------~~~~~~~~~~aR~~~~~~ 199 (207) --... -....+++.+.-..---| T Consensus 239 G~~~~NYGRn~~~~~~~~~~~~~~~~~~~ 267 (267) T protein:vir:95 239 GIKYPNYGRNSMKYQSSPYFDKHNKKPTP 267 (267) T ss_pred CccccccCccccccccCccccccCCCCCC Confidence 32211 111234455444433333 No 37 >protein:vir:95880 Length: 236 # NCBI annotation: 30 kDa protein # Family: family:all:31944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950545;genbank:gi:119952236;genbank:GeneID:5075707 Probab=31.15 E-value=1.5 Score=19.59 Aligned_cols=194 Identities=16% Similarity=0.123 Sum_probs=87.3 Q ss_pred CC-CHHHHHHHHHHHhcchhccccccCCHHHHHHHHhhHHHHHHHHhcCC-CCcee--eeEecc------cC-------C Q lcl|Aclame:pro 1 MA-SQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHV-WSFTK--ARAQLA------AL-------A 63 (207) Q Consensus 1 Ma-S~v~IcN~AL~~lG~~~I~Sl~e~s~~A~~c~~~Y~~~rd~~L~~~~-W~FA~--~r~~La------~l-------~ 63 (207) |- -+--||-+|+..|.+-.+..-|-+--+-..--.+-..+-+.+|+-|- .+.-. -++.+- +| . T Consensus 1 ~~~lkev~~~La~gqL~N~~~V~~d~g~i~~~~~p~ii~a~N~gl~~Lh~Rf~lk~~~i~vem~eg~~~Y~L~~~y~v~~ 80 (236) T protein:vir:95 1 MYYIEELFCRLANGVLNNTGIVTDDRGDIEDDSKPFIIVAANEALTRLHGRFNMRNNNVVVEMQEGRTNYPLLAKYAVQS 80 (236) T ss_pred CchHHHHHHHHhcceecceeeeecccccccccccchHHHHHhHHHHHHhhhhhhccCcEEEEEeeCceecccchhhhhcc Confidence 44 23346777766555433333222111111112222333445555442 21111 111111 11 0 Q ss_pred CCCCCC------ccccccCcccceEeeecccCcc----cc-ccccccceEEeCCEEEecCCC---ceEEEEEeecC---- Q lcl|Aclame:pro 64 EAPLFG------FSYQYRLPTDFIRLLQVGQFDV----YP-RTDTRGLFSIENGNILTDMQA---PLYIRYAKRVT---- 125 (207) Q Consensus 64 ~~p~~~------~~yaY~lP~Dclrv~~v~~~~~----~~-~~~~~~~y~v~g~~l~~~~~~---~~~l~Y~~~v~---- 125 (207) ..|-.| -..-=.-|.|.|||.+|..... .+ ...+.-.|.=+.+.|-.+... -+.++|-++-+ T Consensus 81 ~~p~~~~~~fI~d~~~~~~~~~ilri~~V~dd~G~~~~Lnd~~~~~sv~~P~~nvLqi~~~~~~~~l~vkyq~~~~~l~~ 160 (236) T protein:vir:95 81 YDPNEVKCPFIMDLAGEKFAEDVIRILEVYDDKGRRRPLNDRNNPCSLFTPRPNVLQNNAPKAWEVLNVMYQAKHPKLST 160 (236) T ss_pred CCCCCcccchhhccccchhHHHHHHHHhhccCCCcccccCCCCCCceeeeCCCcceeeecCCCcceEEEEeecCCCceee Confidence 111110 0011123678899998863211 11 001101111112222222222 25677766532 Q ss_pred -----ChhhccHHHHHHHHHHHHHHhhhhcc--CCHHHHHHHHHHHHHHHHHHHhhhhhcCCCCCCCCchhhhhccCCC Q lcl|Aclame:pro 126 -----DPNAMDALFREAFACRLAAEACESLT--QSATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGVA 197 (207) Q Consensus 126 -----d~~~~~~~F~~ala~~LAa~lA~pLt--~~~~~~~~l~~~~~~~l~~A~~~da~e~~~~~~~~~~~~~aR~~~~ 197 (207) .+-..|-+..+||.+..|++.--|+- +|..+.....+.|+..... .+++..+..+.-+..-.-.|+||- T Consensus 161 ~eD~~~~idlP~t~~~aL~~yVA~r~~T~ig~~EnTAk~~~y~~~Yes~c~~---v~~~~l~s~~~v~~~~~f~r~Gw~ 236 (236) T protein:vir:95 161 AEDGYNEIDIPDTLDPALDAYIAYRYYTSLNTPESSAKAAEYLSFYDSICRE---VVEYDLTSDTEVDTNTLFRKRGWR 236 (236) T ss_pred eeCCcccccCCcchHHHHHHHHHHHhhccCCCcccchhhhhHHHHHHHHHhh---HHhhccccccccccccccccCCCC Confidence 23346889999999999999988886 4667777888888887774 344433333211111111234444 Done!