Query lcl|NC_011107.1_cdsid_YP_002117819.1 [gene=PT2_gp40] [protein=tail tubular protein B] [protein_id=YP_002117819.1] [location=26204..28684] Match_columns 826 No_of_seqs 164 out of 198 Neff 8.2 Searched_HMMs 1612 Date Thu Nov 7 13:01:24 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_40 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_40_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:6326 Length: 826 # 100.0 7E-260 4E-263 1441.5 91.1 826 1-826 1-826 (826) 2 protein:vir:78957 Length: 826 100.0 3E-255 2E-258 1416.2 90.0 823 1-826 1-826 (826) 3 protein:vir:10452 Length: 794 100.0 1E-227 7E-231 1264.8 82.5 764 1-826 1-794 (794) 4 protein:vir:80253 Length: 777 100.0 9E-226 6E-229 1254.3 83.4 774 1-826 1-777 (777) 5 protein:vir:3366 Length: 801 # 100.0 3E-223 2E-226 1240.5 83.1 767 1-826 1-801 (801) 6 protein:vir:1543 Length: 801 # 100.0 2E-223 1E-226 1241.1 82.3 770 1-826 1-801 (801) 7 protein:vir:2203 Length: 794 # 100.0 2E-222 1E-225 1236.1 84.8 771 1-826 1-794 (794) 8 protein:vir:97014 Length: 800 100.0 1E-221 8E-225 1231.5 81.0 770 2-826 1-800 (800) 9 protein:vir:99677 Length: 794 100.0 5E-221 3E-224 1228.5 83.2 763 1-826 1-794 (794) 10 protein:vir:8887 Length: 808 # 100.0 2E-220 1E-223 1225.3 83.5 774 1-826 1-808 (808) 11 protein:vir:94583 Length: 792 100.0 1E-220 7E-224 1226.4 82.2 766 1-826 1-792 (792) 12 protein:vir:94713 Length: 785 100.0 3E-218 2E-221 1212.9 81.5 760 1-826 1-785 (785) 13 protein:vir:105647 Length: 800 100.0 4E-216 3E-219 1201.4 80.1 769 2-826 1-800 (800) 14 protein:vir:7021 Length: 803 # 100.0 5E-215 3E-218 1195.7 82.3 759 2-826 1-803 (803) 15 protein:vir:100022 Length: 976 100.0 1E-212 8E-216 1182.2 82.6 800 1-826 1-976 (976) 16 protein:vir:103341 Length: 806 100.0 1E-212 7E-216 1182.7 80.9 765 2-826 1-806 (806) 17 protein:vir:78703 Length: 905 100.0 2E-212 1E-215 1181.2 79.9 806 1-826 1-905 (905) 18 protein:vir:103790 Length: 768 100.0 9E-169 6E-172 941.8 72.0 723 1-823 1-768 (768) 19 protein:vir:1778 Length: 680 # 100.0 4E-162 2E-165 905.4 57.8 575 1-592 1-680 (680) 20 protein:vir:95324 Length: 823 100.0 3E-154 2E-157 861.9 66.0 707 1-822 1-823 (823) 21 protein:vir:107802 Length: 681 100.0 3E-150 2E-153 840.1 67.2 657 1-821 1-681 (681) 22 protein:vir:107423 Length: 681 100.0 3E-150 2E-153 840.1 67.2 657 1-821 1-681 (681) 23 protein:vir:98487 Length: 681 100.0 3E-150 2E-153 840.1 67.2 657 1-821 1-681 (681) 24 protein:vir:7329 Length: 825 # 100.0 3E-149 2E-152 834.5 65.3 708 1-822 1-825 (825) 25 protein:vir:102644 Length: 594 100.0 3E-131 2E-134 735.9 58.7 554 1-822 1-594 (594) 26 protein:vir:94602 Length: 1012 99.5 3.1E-12 1.9E-15 83.7 36.6 757 1-826 1-1010(1012) 27 protein:vir:80177 Length: 1027 99.3 3.1E-10 1.9E-13 72.7 30.9 741 1-826 1-898 (1027) 28 protein:vir:2625 Length: 715 # 99.1 2.1E-09 1.3E-12 68.2 40.9 631 1-823 1-715 (715) 29 protein:vir:8837 Length: 513 # 98.0 7.1E-06 4.4E-09 48.8 35.6 475 208-825 1-513 (513) 30 protein:vir:95475 Length: 771 97.1 0.00016 9.8E-08 41.4 41.5 660 1-823 1-771 (771) 31 protein:vir:3133 Length: 911 # 95.2 0.0026 1.6E-06 34.8 36.1 691 1-826 1-839 (911) 32 protein:vir:105563 Length: 396 87.4 0.039 2.4E-05 28.3 18.4 368 1-573 1-396 (396) 33 protein:vir:108312 Length: 458 67.9 0.25 0.00015 23.9 36.9 446 149-818 1-458 (458) No 1 >protein:vir:6326 Length: 826 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877473;genbank:gi:33300845;uniprot:Q7Y2D1;genbank:GeneID:1482615 Probab=100.00 E-value=6.6e-260 Score=1441.49 Aligned_cols=826 Identities=100% Similarity=1.514 Sum_probs=798.3 Q ss_pred CCceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcCCCceEEEE Q lcl|NC_011107. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd~~e~~~~~ 80 (826) ||+|+|+||||++|||||||++|||+||++|+||+|+|+|||+||||++||++++++++..++||+|+++||+.||+|++ T Consensus 1 M~~i~~s~~n~~~GvSqq~d~~r~~~q~~~~~N~~~~~~~G~~rRpg~~~v~~~~~~~~~~~~~~~~~~~r~~~~~~~~~ 80 (826) T protein:vir:63 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) T ss_pred CceeeeecchhhcceeccCchHhhhhhhhhhhcceeeccCCcccCchhHhhhhhccCCccccccEEEEEecCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999999988889999999999999999999 Q ss_pred EecCCeEEEEEcCCCEEEEecCccccccccCCccceEEEEEcCEEEEEeCcccCcccccccCCCCCCccEEEEEcccccC Q lcl|NC_011107. 81 AQHRGELYLFDERDGRLLMGQPLVHDYLKANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAGWLYIKAGQYS 160 (826) Q Consensus 81 ~~~~g~i~v~~~~~g~~~~~~~~~~~y~~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~~~v~~g~y~ 160 (826) ++++|+||||++++|+++.+++..++|++++++++|+|+|+||||||||++++|++..+.+++++++.|+++++++|+|+ T Consensus 81 ~~~~g~irv~~~~~g~~~~~~~~~~~y~~~~~~~~l~~~t~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~v~~g~Y~ 160 (826) T protein:vir:63 81 AQHRGELYLFDERDGRLLMGQPLVHDYLKANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAGWLYIKAGQYS 160 (826) T ss_pred EecCCcEEEEEcCCCeEEEcCCCCCceeeecCccceEEEEeCCEEEEEeCCeeeeeccccccccCCCCcEEEEeeccccC Confidence 99999999999999999988888889999999999999999999999999999999888889999999999999999999 Q ss_pred ceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeeccceEEEeecccccceeeccccc Q lcl|NC_011107. 161 KAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDANA 240 (826) Q Consensus 161 ~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga~~~t~~~~~~~~~~~a~~~~~ 240 (826) ++|+|+|++.+.+.++++..+++|+++++.++..+........+.+|++.++...+.+...|+.....+....++++..+ T Consensus 161 ~~y~vti~~~~~~~gt~~s~t~t~~t~~~~~a~~~~~~~~~~~s~~yia~~l~~~~~a~~~~~~~~~t~~~~~~~~~~~a 240 (826) T protein:vir:63 161 KAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDANA 240 (826) T ss_pred ceEEEEEEeccccCCccccceEEEEeccCCcccccccccceeeeeeeeeeeceeeeeeccccccCCCccccceecCCccc Confidence 99999999999999999999999999999988888888888889999999999998888899999999898999999888 Q ss_pred eeecccccccccccceEEEecCCceEEEEecCCCCcceEEEEEEeecccccccccccCcccccEeeeeeeeeEeccCCCC Q lcl|NC_011107. 241 ATIAGYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVMATGSTK 320 (826) Q Consensus 241 ~t~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (826) .+.++..+.....++++++....+..+..+.++|+++.++++.+.++++++||+.+|+.+..++++.+.+++++++++++ T Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~~p~~~~~~~~~~~~~~~~~~~g~~~ 320 (826) T protein:vir:63 241 ATIAGYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVMATGSTK 320 (826) T ss_pred ceeecceeEecccccEEEEeeCCcccEEEccCCCCcceEEEEEeeccceeeccccCCCcccceEEEeeEEeEEecCCCcc Confidence 88889999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEEcCCceEEEeecccccccccceeEEEEEecCCCeEEEeccCcCccccCCccccCCccccCCCccEEEEEcceE Q lcl|NC_011107. 321 APVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTRGITGMTTFQGRL 400 (826) Q Consensus 321 ~~~y~~~~~~~~~w~e~~~~g~~~~~~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~~~q~RL 400 (826) +.||++|++.+++|+||++||+.++++||||.|++++++++|++++++|++|.+||+++||+|+|+|++|++|+|||||| T Consensus 321 d~~y~~~~~~~~~w~e~~~~~~~~~~~tmp~~l~~~~~~~~f~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL 400 (826) T protein:vir:63 321 APVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTRGITGMTTFQGRL 400 (826) T ss_pred cceEEEEEcCCceEEEEeecCcccccccceEEEEEeccCCeEEEeccccccccccccccCCCccccCCCceEEEEEeceE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEE Q lcl|NC_011107. 401 VLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVIS 480 (826) Q Consensus 401 ~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~~~ 480 (826) +|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+|+++++|+|||+++||+|+++++|||+|++++ T Consensus 401 ~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~~~q~~ls~~~~lTP~~~~i~ 480 (826) T protein:vir:63 401 VLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVIS 480 (826) T ss_pred EEeeCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEEEEEcCCCCEEEEEEc Q lcl|NC_011107. 481 ITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAASSGYLVFGTS 560 (826) Q Consensus 481 ~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~~~s~~p~~~v~~~~ 560 (826) ++|+|+|+++++|+.+|++++|++++|+++++||||.|++|+++.|+++|||+|++|||+++|.+|++|++|++++|+++ T Consensus 481 ~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~~~s~v~e~~~~~d~~~~y~~~dlt~~~~~l~~~~v~~~a~s~~~~~v~~~~~ 560 (826) T protein:vir:63 481 ITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAASSGYLVFGTS 560 (826) T ss_pred EEEeecccCCCCceEeCCeEEEEecCCCceeEEEEEEeeeccccceehhHHHHHHHHhcCCCeEEEEEcCCCCEEEEEEc Confidence 99999999999999999999999999988999999999999999999999999999999999999999999999999999 Q ss_pred CCCeEEEEEEeecCCceeeeeeEeeecCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEeecCcccCCCcccccccceEEE Q lcl|NC_011107. 561 TADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEA 640 (826) Q Consensus 561 ~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~~~~~d~~~~~~~ 640 (826) +||+|++|+|||+++||+|+|||||+|+|+|++||+++|+||++|+|++++++|||++|+++.+...+++.+|....+++ T Consensus 561 ~dg~l~~~~y~~~~~e~~v~aW~~~~~~g~v~~~~~i~d~l~~iv~r~~~~~~~r~~~e~~~~~~~~~~~~~d~~~~~d~ 640 (826) T protein:vir:63 561 TADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEA 640 (826) T ss_pred CCCEEEEEEEeeCCCcEEEEeEEEEecCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEEecCCccccccCCccceEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999899999999999999 Q ss_pred eecccceeccceeeccCCcccceeeEEecCceeeeeecccceecCCceEEEecCCCCCceEEEeeeeeEEEEeCCeeEec Q lcl|NC_011107. 641 TVDGELELTKQHWDLIKDASAVYQLQPVAGAYMERTHLGVKRETNTKVFLDVPEAVVGAVYVVGCEFWSKVEFTPPVLRD 720 (826) Q Consensus 641 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~v~vGl~y~~~~~~~~~~i~~ 720 (826) .+.+...+....++.+.++..+.++.+++|+.+....++..++.++..+|++|++..+.+|+|||+|+++++|+||++++ T Consensus 641 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~l~~~~~~~~~~v~VGl~y~s~~~~~~~~~~~ 720 (826) T protein:vir:63 641 TVAGELELTKQHWDLIKDASAVYQLQPVAGAYMERTHLGVKRETNTKVFLDVPEAVVGAVYVVGCEFWSKVEFTPPVLRD 720 (826) T ss_pred eeeeeeccCcceeecccCcccccEEEEeeCccccCCccceEEecCCEEEEecCCCccccEEEEeeeeeEEEEecceEEEc Confidence 88888888887777777888889999999999988888888888999999999999999999999999999999999999 Q ss_pred CCCCceeecceEEEEEEEEeeccceEEEEecCCCCccceeeeccCccccccccccCccccceeEEEEEecccCceeEEEE Q lcl|NC_011107. 721 HNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLVDSAVVPLPARVDMATSKFEL 800 (826) Q Consensus 721 ~~g~~~~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i 800 (826) ++|++++.||+||+|++|+|.+||+|.+.|+++.++..+.+.+++.++++.++..|.|++.++++++|+.+++++.+|+| T Consensus 721 ~~g~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~p~~~t~~~~vP~~~~~~~~~i~i 800 (826) T protein:vir:63 721 HNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLVDSAVVPLPARVDMATSKFEL 800 (826) T ss_pred cCCCcceeccEEEEEEEEEeeccccEEEEecCccccceeEeecCCceecccccccccccccceEEEEEEeeccceEEEEE Confidence 99999999999999999999999999999999999988889999999999999999999999999999999999999999 Q ss_pred EECCCCCEEEEEEEEEEEeecccccC Q lcl|NC_011107. 801 SCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 801 ~~~~P~P~tvl~i~weg~y~~r~rrv 826 (826) +++.|+||+|++|+|||+||+|+||| T Consensus 801 ~~d~P~p~~il~i~~~~~yn~r~rrv 826 (826) T protein:vir:63 801 SCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) T ss_pred EeCCCCcEEEEEEEEEEEEeceeecC Confidence 99999999999999999999999999 No 2 >protein:vir:78957 Length: 826 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522826;genbank:gi:158345061;genbank:GeneID:5687447 Probab=100.00 E-value=2.8e-255 Score=1416.16 Aligned_cols=823 Identities=94% Similarity=1.445 Sum_probs=779.8 Q ss_pred CCceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcCCCceEEEE Q lcl|NC_011107. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd~~e~~~~~ 80 (826) ||+|+|+||||+||||||||++|||+||++|+||+|+|+|||+||||++||++++++++..++||+|+++||+.||+|++ T Consensus 1 M~~i~~~~~nl~gGvSqq~d~~r~~~q~~~~~N~~~~~~gG~~rRpgt~~va~~~~~~~~~~~~f~~~~~r~s~e~~~~l 80 (826) T protein:vir:78 1 MSYKQSAYPNLLMGVSQQVAFERLPGQLSEQINMVSDPVSGLRRRSGIELMASLLHTDQPWPRPYLYHTNLGGRSIAMLV 80 (826) T ss_pred CcceeeecchhccceecccchHhhhhhhhhhhcceeccccccccCCchHhhhhhccCCcCCceeEEEEeccCCcceEEEE Confidence 99999999999999999999999999999999999999999999999999999999988889999999999999999999 Q ss_pred EecCCeEEEEEcCCCEEEEecCccccccccCCccceEEEEEcCEEEEEeCcccCcccccccCCCCCCccEEEEEcccccC Q lcl|NC_011107. 81 AQHRGELYLFDERDGRLLMGQPLVHDYLKANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAGWLYIKAGQYS 160 (826) Q Consensus 81 ~~~~g~i~v~~~~~g~~~~~~~~~~~y~~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~~~v~~g~y~ 160 (826) ++++|+||||++.+|+++..++...+|+++.+.++|+|+|+||||||||++++|++..+.....+++.++++++|+|+|+ T Consensus 81 ~~~~g~irv~~~~~g~~~~~~~~~~~y~~~~~~~~l~~~t~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~v~~g~y~ 160 (826) T protein:vir:78 81 AQHRGELYLFDEKDGRLLMGQPLVHDYLKASDYRQLRAATVADDLFIANLEVRPEADKADVLGVDPSKTGWLYIKAGQYS 160 (826) T ss_pred EEcCCcEEEEECCCCEEEEecCcccceeecCCcceeEEEEEcCEEEEEcCcEeeeeccccccCCCCCceEEEEecccccC Confidence 99999999999999999998887788888888889999999999999999999998887777888999999999999999 Q ss_pred ceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeeccceEEEeecccccceeeccccc Q lcl|NC_011107. 161 KAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDANA 240 (826) Q Consensus 161 ~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga~~~t~~~~~~~~~~~a~~~~~ 240 (826) |+|+|+|++.+.+.+++++.+++|++|+++.+......+..+.+..|+++++...+.....|......+....+.+.... T Consensus 161 ~~y~v~i~~~~~~~~~~~s~t~~y~t~~~~~~~~~~~~~~~~~~~~~~a~~l~~~~~~~~~~~~~~~t~~~~~~~~~~~~ 240 (826) T protein:vir:78 161 KAFSLTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLFGKFFGAPEYTLPNSTKKYPKVDPDPAA 240 (826) T ss_pred ceeEEEeccceeecccccceeEEEEeccCCccccccccccceecchhhheecceeeccccceeeeccceeEeeccccccc Confidence 99999999999999999999999999999999888888888999999999999888888899999988888888888888 Q ss_pred eeecccccccccccceEEEecCCceEEEEecCCCCcceEEEEEEeecccccccccccCcccccEeeeeeeeeEeccCCCC Q lcl|NC_011107. 241 ATIAGYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVMATGSTK 320 (826) Q Consensus 241 ~t~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (826) ...+++.......++++++.+..+..+..++++|+++.+.++.+.++++++||+.+|+.+.+++.++++++++++++++. T Consensus 241 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~a~~p~~~~~~~~~~~~~~~~~~~g~~~ 320 (826) T protein:vir:78 241 ATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAIMATGSTK 320 (826) T ss_pred eeeccceeecccccceEEEecCCCeEEEeccCCCccceEEEeeEEEecccceeeeecccccceEEEEEEeeeEecCCCcc Confidence 88888888888889999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEEcCCceEEEeecccccccccceeEEEEEecCCCeEEEeccCcCccccCCccccCCccccCCCccEEEEEcceE Q lcl|NC_011107. 321 APVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTRGITGMTTFQGRL 400 (826) Q Consensus 321 ~~~y~~~~~~~~~w~e~~~~g~~~~~~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~~~q~RL 400 (826) ++||++|++.+++|+||++||+.+++++|||.++.++++++|++++.+|++|.+||+++||+|+|+|++|++|+|||||| T Consensus 321 ~~~y~~~~~~~~~w~e~a~~g~~~~~~tmp~~l~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~i~~v~f~q~RL 400 (826) T protein:vir:78 321 APVYFAWDAANRRWAERAAYGTDWVLKKMPLALRWDESTDTYSLNELEYDRRGSGDEETNPTFNFVKRGITGMTTFQGRL 400 (826) T ss_pred cceeEEEEcCCceEEEeeccCcccccccccEEEEEecCCCeEEEeeccccccccCcccccCcccccCCCceEEEEEeceE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEE Q lcl|NC_011107. 401 VLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVIS 480 (826) Q Consensus 401 ~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~~~ 480 (826) +|++|++|||||+||||||++++++++.|||||+++++++++|+|+|+|+++++|+|||+++||+|+++++|||+|++++ T Consensus 401 ~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~~s~~~~~i~~~v~~~~~L~l~T~~~e~~l~~~~~lTP~~~~~~ 480 (826) T protein:vir:78 401 VLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVIS 480 (826) T ss_pred EEeeCCeEEEEeccCccccccccccCCCCCCcEEEEEccCcceeEEEEEecCCcEEEEecCcEEEEeCCCcccceeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEEEEEcCCCCEEEEEEc Q lcl|NC_011107. 481 ITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAASSGYLVFGTS 560 (826) Q Consensus 481 ~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~~~s~~p~~~v~~~~ 560 (826) ++|+|+|+++++|+.+|+++||+++||+.+++||||.|++|++++|+++|||+|++|||+++|.+|++|++|++++|+++ T Consensus 481 ~~s~~~~~~~~~Pv~vG~~v~F~~~r~~~~s~v~e~~~~~~~~~~y~~~dlt~~~~~l~~~~v~~~a~s~~~~~~v~~~~ 560 (826) T protein:vir:78 481 ITTQYDVDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAASSGYLVFGTS 560 (826) T ss_pred EEEeecccCCCCceEeCCeEEEEecCCCceeEEEEEEeeecccCccchHHHHHHHHHhcCCCeEEEEEeCCCCeEEEEEc Confidence 99999999999999999999999999988999999999999999999999999999999999999999999999999999 Q ss_pred CCCeEEEEEEeecCCceeeeeeEeeecCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEeecCcccCCCcccccccceEEE Q lcl|NC_011107. 561 TADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEA 640 (826) Q Consensus 561 ~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~~~~~d~~~~~~~ 640 (826) +||+|++|||||+++||+|+|||||+|+|+|++||+++|+||++|+|++++++|||.+++++.++..+.+.+|+.+..++ T Consensus 561 ~~g~l~~~ty~~~~~e~~v~aW~~~~~~g~v~~v~~i~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~ 640 (826) T protein:vir:78 561 AADEMICHQYLWQGNEKVQNAYHRWTLRHQIIGAYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEA 640 (826) T ss_pred CCCeEEEEEEEecCCcEEEEeEEEEccCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEEecCCCccccccccceeEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999888888 Q ss_pred eecccceeccceeeccCCcccceeeEEecCceeeeeec---ccceecCCceEEEecCCCCCceEEEeeeeeEEEEeCCee Q lcl|NC_011107. 641 TVDGELELTKQHWDLIKDASAVYQLQPVAGAYMERTHL---GVKRETNTKVFLDVPEAVVGAVYVVGCEFWSKVEFTPPV 717 (826) Q Consensus 641 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~---g~~~~~~~~~~~~~~~~~~~~~v~vGl~y~~~~~~~~~~ 717 (826) .+++..++.+...+...+.. .+.+++|..+..... +...+..+..++.+|++..+++|+|||+|+++++|+||+ T Consensus 641 ~~~~~~~~~~~~~~~~~~~~---~~~~~~g~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~VGl~y~s~~~~~~~~ 717 (826) T protein:vir:78 641 TVDGELELTKQHWDLIKDGA---AVYQLQPQVGAYMERYQLGVKRETSTKVFLDVPEAVVGSVYVVGCEFWSKVEFTPPV 717 (826) T ss_pred EEcceeccccceeEEecCCc---eeeeeccceeeeccccceeccccCCCceEEEeCCCccccEEEEeeceeEEEEeCceE Confidence 88888888887766554433 355666655443333 333455667789999999999999999999999999999 Q ss_pred EecCCCCceeecceEEEEEEEEeeccceEEEEecCCCCccceeeeccCccccccccccCccccceeEEEEEecccCceeE Q lcl|NC_011107. 718 LRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLVDSAVVPLPARVDMATSK 797 (826) Q Consensus 718 i~~~~g~~~~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~ 797 (826) +++++|++++.+|+||+|++|+|.+||.|.+.|+++.++....+.++++++.++++..+.|+..++++++|+.+++++.+ T Consensus 718 ~~~~~g~~~~~~r~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~~t~~v~vp~~~~~~~~~ 797 (826) T protein:vir:78 718 LRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLSSRQLNAGEPLVDSAVVPLPARVDMATSK 797 (826) T ss_pred EecCCCcceeecceEEEEEEEEeeccccEEEEeCCCccCcceeeeecccccccccccCCcccccceEEEEeeeccCceEE Confidence 99999999999999999999999999999999999999888888999999999999999998899999999999999999 Q ss_pred EEEEECCCCCEEEEEEEEEEEeecccccC Q lcl|NC_011107. 798 FELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 798 v~i~~~~P~P~tvl~i~weg~y~~r~rrv 826 (826) |+|+++.|+||+|++|+|||+||+|+||| T Consensus 798 i~i~~d~P~P~tvlai~~~~~y~~r~rrv 826 (826) T protein:vir:78 798 FELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) T ss_pred EEEEeCCCCcEEEEEEeEEEEecceeecC Confidence 99999999999999999999999999999 No 3 >protein:vir:10452 Length: 794 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848299;genbank:gi:30387490;genbank:GeneID:1733952 Probab=100.00 E-value=1.1e-227 Score=1264.78 Aligned_cols=764 Identities=23% Similarity=0.326 Sum_probs=655.2 Q ss_pred CCceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcCCCceEEEE Q lcl|NC_011107. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd~~e~~~~~ 80 (826) ||+|+|+||||++|||||||++|||+||++|+||+|+|+|||+||||++||+++++.+.....+++|+++||+.||||++ T Consensus 1 M~~i~~s~~n~~~GvSqq~D~~Ry~~q~~~~~N~~~~~~gG~~rRpGt~fv~~l~~~~~~~~~~~~~~~~rd~~e~~~v~ 80 (794) T protein:vir:10 1 MALISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLKTLGDNGALGQAPYIHLINRDENEQYYAV 80 (794) T ss_pred CcceeeecchhhcccccCCchHHhhhhHhhhhcceeeeccCcccCcchhhheeccCCCccccceeeeEEecCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999999887778899999999999999999 Q ss_pred EecCCeEEEEEcCCCEEEEecCcccccc-ccCCccceEEEEEcCEEEEEeCcccCccccc--ccCCCCCCccEEEEEccc Q lcl|NC_011107. 81 AQHRGELYLFDERDGRLLMGQPLVHDYL-KANDYRQLRAATVADDLFIANLSVKPEADRT--DIKGVDPNKAGWLYIKAG 157 (826) Q Consensus 81 ~~~~g~i~v~~~~~g~~~~~~~~~~~y~-~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~--~~~~~~~~~~a~~~v~~g 157 (826) ++++ +||||+++|.+..+..+...+|+ +++++++|+|+|+||+|||||++++|++..+ ....+++..|+++++|+| T Consensus 81 ~~~~-~irv~~~~G~~~~v~~~~~~~Y~~aa~~~~~l~~~q~aD~~fivn~~~~~~~~~~~~~~~~~~~~~~~~~~v~~g 159 (794) T protein:vir:10 81 FTGT-GIRVFDLAGNEKQVRYPNGSNYIKTANPRSDLRMVTVADYTFIVNRNVVVQKDPNSVNLANYNPKQDGLINIRGG 159 (794) T ss_pred EeCC-eEEEEEcCCcEEEEEcCCCCcceecCCCcceEEEEEEcCEEEEEcCCeeeeeeccccccCCCCCCccEEEEeccc Confidence 8877 59999998877776666667887 4578889999999999999999999998654 344577888999999999 Q ss_pred ccCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeecc-ceEEEeecccccceeec Q lcl|NC_011107. 158 QYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGA-PEYTLPNSTKKYPKVDP 236 (826) Q Consensus 158 ~y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga-~~~t~~~~~~~~~~~a~ 236 (826) +|+|+|+++|++.+ .+++++|+++. +......+.++++.+|..++... .+|+. T Consensus 160 ~y~r~y~i~i~~~~---------~at~~tpdgt~-----~~~~~~~s~~~ia~~L~~~l~a~~~g~t~------------ 213 (794) T protein:vir:10 160 QYGRELIVHINGKD---------VATYKIPDGSK-----PEHVNNTDAQWLAERLAKQMRINLSGWTV------------ 213 (794) T ss_pred ccceEEEeccCCcc---------eeEEEecCCCC-----cccceecchhhhhhhhhhhhhcccCCceE------------ Confidence 99999999998764 36788887753 45556777888888887654322 22222 Q ss_pred cccceeecccccccccccceEEEecCCceE--EEEecCCCCcceEEEEEEeecccccccccccCcccccEeeeeeeeeEe Q lcl|NC_011107. 237 DANAATIAGYLNQRGVQDGYIAFRGDADIH--VEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVM 314 (826) Q Consensus 237 ~~~~~t~a~~~~~~~~~~g~~~~~~~~~~~--~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 314 (826) ...++++++.+.+... ...+.+++.+..+.++.+.++.+++||+.+|.+ +.+++.+ T Consensus 214 --------------~~~g~~i~i~a~s~~~~~t~s~~~~~~~~~~~~v~~~~~~~~~lp~~~~~G--------~~v~i~~ 271 (794) T protein:vir:10 214 --------------NVGQGFIHVTAPSGQQIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAPNG--------YMVKIVG 271 (794) T ss_pred --------------EeCCeEEEEEeccCceeccccccCCcCcceeEEEEeccCcceecccCCCCC--------cEEEEEe Confidence 2234456665544433 234556677889999999999999999988842 4566677 Q ss_pred ccCCCCcceEEEEEcCCceEEEeecccccccc--cceeEEEEEecCCCeEEEeccCcCccccCCccccCCccccCCCccE Q lcl|NC_011107. 315 ATGSTKAPVYFEWDSANRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTRGITG 392 (826) Q Consensus 315 ~~~~~~~~~y~~~~~~~~~w~e~~~~g~~~~~--~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~ 392 (826) ..++..+.||++|+..+++|+||++||+..++ +||||.++ ++++++|+++.++|++|.+||+++||+|+|+|++|++ T Consensus 272 ~~~~~~~~yyv~~~~~~~~w~E~~~~g~~~~~~~~tmP~~l~-r~~~~t~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~ 350 (794) T protein:vir:10 272 DASKSADQYYVRYDAERKVWTETLGWNTENQVLLETMPHALV-RAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSIND 350 (794) T ss_pred CCCCCcceeEEEEEcCCcEEEEecccceeEEEecccceeEEE-EeccceEEeeecccccccccccccCccCcccCCCccE Confidence 77888899999999999999999999986665 69999998 7799999999999999999999999999999999999 Q ss_pred EEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccc Q lcl|NC_011107. 393 MTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIV 472 (826) Q Consensus 393 v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~l 472 (826) |+||||||+|++|++|||||+||||||+++|++++.|||||+++++++++++|+|+|+++++|+|||+++||+|+++++| T Consensus 351 v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~i~T~~~q~~l~~~~~l 430 (794) T protein:vir:10 351 VFFFRNRLGFLSGENIILSRTAKYFNFYPASIANLSNDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTL 430 (794) T ss_pred EEEEcceEEEeeCCeEEEEecCCcccccccccccCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEe-eccccccccchhHHHHHHHHhcCCCeEEEEEc-C Q lcl|NC_011107. 473 TPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMA-PSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAA-A 550 (826) Q Consensus 473 TP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~-~~~~~~~~~~~~dls~~~~~~~~~~v~~~~~s-~ 550 (826) ||+|++++++|+|+|++.++|+.+|++++|++++| ++++++||+ |+ +..+.|+++|||+|++|||++++..++++ + T Consensus 431 TP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~~~r~~~~~-~~~d~y~a~Dlt~~~~hl~~~~v~~~~~~~~ 508 (794) T protein:vir:10 431 TSRSVELNLTTQFDVQDRARPYGIGRNVYFASPRS-SYTSIHRYYAVQ-DVSSVKNSEDITSHVPNYIPNGVFSICGSGT 508 (794) T ss_pred cceeEEEEEEEeecccCCCCceEeCCeEEEEecCC-CeeEEEEEeeec-cccCceehhhHHHHHHHhcCCceEEEEEeCC Confidence 99999999999999999999999999999999998 577776654 55 45556999999999999999998877665 5 Q ss_pred CCCEEEEEEcCCCeEEEEEEeecCCceeeeeeEeeecCCcEEEEEEE--CCeEEEEEEeCCCEEEEEEEEeecCcccCCC Q lcl|NC_011107. 551 SSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFT--GDNLMVLIQKGQEIALGRMHLNSLPAREGLQ 628 (826) Q Consensus 551 ~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~--~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~ 628 (826) +|..++|+++++|+|++|+|||+++||+|+|||||+|+|.|+++|++ +|+||++|+|++++++|||.+.+ ......+ T Consensus 509 ~~~~~~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~d~l~~iv~r~~~~~~~r~~~~~-~~~~~~~ 587 (794) T protein:vir:10 509 ENFCSVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSDMYVILRNEFNTFLARISFTK-NAIDLQG 587 (794) T ss_pred CCcEEEEEEcCCCEEEEEEEeecCCceEEEeEEEEEcCCcEEEEEEEecCCeEEEEEEeCCCEEEEEEEEee-cCCCCCC Confidence 55677899999999999999999999999999999999999988865 89999999999999999996543 3333333 Q ss_pred cccccccceEEEeecccceec-------cceeeccCCcccceeeEEecCceeeeeecccce-------ecCCceEEEecC Q lcl|NC_011107. 629 YPKYDYWRRIEATVDGELELT-------KQHWDLIKDASAVYQLQPVAGAYMERTHLGVKR-------ETNTKVFLDVPE 694 (826) Q Consensus 629 ~~~~d~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~-------~~~~~~~~~~~~ 694 (826) .++ ..++||..++. .......-....++++.|++|+.+...++|... ..++..++++++ T Consensus 588 ~~~-------~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~g~~~~eg~~v~~~adg~~~~~~~~~~~~~g~~~l~i~~ 660 (794) T protein:vir:10 588 EPY-------RAFMDMKIRYTIPSGTYNDDTFTTSIHIPTIYGANFGRGKITVLEPDGKITVFEQPTSGWQSDPWLRLSG 660 (794) T ss_pred ccc-------eeeeecceEEEecCcccccccccceEEcccccCcccccccEEEEecCCceeeeeeeeeeeecceEEEecC Confidence 332 12233333322 111111112234568899999999998888543 224556889999 Q ss_pred CCCCceEEEeeeeeEEEEeCCeeEecCCCCceee----cceEEEEEEEEeeccceEEEEecCCCCccceeeeccCccccc Q lcl|NC_011107. 695 AVVGAVYVVGCEFWSKVEFTPPVLRDHNGLPMTS----TRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFS 770 (826) Q Consensus 695 ~~~~~~v~vGl~y~~~~~~~~~~i~~~~g~~~~~----gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~ 770 (826) +.++++|+|||+|+++++|+||++++++|++++. ||+||+|++++|.+||+|.+.|+++.++. .+.+.+.++++ T Consensus 661 ~~~a~~v~vGl~y~s~~~~~~~~i~~~~~~~~~~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~~--~~~~~~~~~~~ 738 (794) T protein:vir:10 661 NLEGREVFIGFNINFVYEFSKFLIKQTTDDGSTSTEDIGRLQLRRAWVNYEDSGTFDIYVENQSSNW--KYTMAGARLGS 738 (794) T ss_pred CCCCceEEEeeeeeEEEEecceEEEccCCCcceeeeccccEEEEEEEEEeeccccEEEEEcCCcccc--ceeeccceecc Confidence 9999999999999999999999999999987754 89999999999999999999999987654 35678889999 Q ss_pred cccccCccccceeEEEEEecccCceeEEEEEECCCCCEEEEEEEEEEEeecccccC Q lcl|NC_011107. 771 RQLNAGEPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 771 ~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r~rrv 826 (826) .++..|.+++.+|+++||+.+|+++.+|+|+|++|+||+|++|+|||+||+|.||| T Consensus 739 ~~~~~g~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlsi~~eg~y~~r~~~v 794 (794) T protein:vir:10 739 NTLRAGRLNLGTGQYRFPVVGNAKFNTVYILSDETTPLNIIGCGWEGNYLRRSSGI 794 (794) T ss_pred ccccccccccccceEEEEecccCceEEEEEEECCCCceEEEEEEEEEEEeccccCC Confidence 99999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:80253 Length: 777 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522886;genbank:gi:158345179;genbank:GeneID:5687516 Probab=100.00 E-value=9.2e-226 Score=1254.30 Aligned_cols=774 Identities=39% Similarity=0.650 Sum_probs=649.5 Q ss_pred CCceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcCCCceEEEE Q lcl|NC_011107. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd~~e~~~~~ 80 (826) ||+|+|+||||++|||||+|++|||+||++|+||+|+|+|||+||||++||+++++.+.....++ |..++++.||+|++ T Consensus 1 M~~i~~~~~nf~~GvS~q~D~~ry~~q~~~~~N~~~~~~gG~~rRpGt~fv~~l~~~~~~~~~~~-~~~~~~~~e~~~~l 79 (777) T protein:vir:80 1 MSYFAGSYRQLLFGVSQQTAKDRLEGQVESQLNMQSDLVTGPRRRSPVHLIADAMAATDANRLAY-SLATFSGREVLLLV 79 (777) T ss_pred CceeeeecchhhcccccCCchHHhhhHHhhhhcceeeeccCceeCcchHhhhhhcCCCcccceeE-EEEecCCCeeEEEE Confidence 99999999999999999999999999999999999999999999999999999998766544444 66789999999999 Q ss_pred EecCCeEEEEEcCCCEEEEecCccccccccCCccceEEEEEcCEEEEEeCcccCcccccc--cCCCCCCccEEEEEcccc Q lcl|NC_011107. 81 AQHRGELYLFDERDGRLLMGQPLVHDYLKANDYRQLRAATVADDLFIANLSVKPEADRTD--IKGVDPNKAGWLYIKAGQ 158 (826) Q Consensus 81 ~~~~g~i~v~~~~~g~~~~~~~~~~~y~~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~--~~~~~~~~~a~~~v~~g~ 158 (826) ++++|+||||++.+|.++..+. .+|+++++.++|+|+|+||+|||||++++|++..+. ...+.++.++++++++|+ T Consensus 80 ~~g~g~irv~~~~~g~~~~~~~--~~Yl~a~~~~~l~~~q~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~~~v~~~~ 157 (777) T protein:vir:80 80 DTLDGTLTILDDATGEVLFTGT--NSYLTAGTGRSIRFAALDDSVFVANTEVIPQTQLWSGASAYPDPTRAGYLYVVAGA 157 (777) T ss_pred EecCCeEEEEECCCCeEEEecC--CCceeeccccceeEEEEcCEEEEEeCCccceeeecccCCCccCcccceEEEeeccC Confidence 9999999999999999888764 578888899999999999999999999999985433 234667788999999999 Q ss_pred cCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeeccceEEEeecccccceeeccc Q lcl|NC_011107. 159 YSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDA 238 (826) Q Consensus 159 y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga~~~t~~~~~~~~~~~a~~~ 238 (826) |+++|+|+|++.......+ .+..++ .......+.++++.+|...+.....++. T Consensus 158 ~g~~y~i~i~~~~~~~~~t----~~~~t~---------~~~~~~~~~~~ia~~L~~~~~~~~~~~s-------------- 210 (777) T protein:vir:80 158 FSKQYRLSITNQVTGVTTS----VDVTTS---------ATEASQATGEYVITQLRTAAEADATIGT-------------- 210 (777) T ss_pred CCceeeEeecCCcCceeEE----EecCCc---------ccccccccchhhhhhhhhhhccccceee-------------- Confidence 9999999998765542221 121111 1122234556777777644332211110 Q ss_pred cceeecccccccccccceEEEecCCceEEEEecCCCCcceEEEEEEeecccccccccccCcccccEeeeeeeeeEeccCC Q lcl|NC_011107. 239 NAATIAGYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVMATGS 318 (826) Q Consensus 239 ~~~t~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 318 (826) .++ ......++++++...... ..+.++|+++.+....+.+++..+||+.+|.. ...++..++. T Consensus 211 ----~~~--~~~~~~g~~~~i~~~~~~--~~t~~~g~~~~~~~~~~~v~~~~~lp~~~~~~---------~~~~~~~~~~ 273 (777) T protein:vir:80 211 ----AAG--FAYYQDGAYLYVTAPEAI--AVSTDSGSNFLRASNAASIRDAAELPAKLPAD---------ADGFIIATGA 273 (777) T ss_pred ----cCc--eEEEeCCcEEEEEecCce--eEecCCcCccceeeeeEEEeeccccccccccc---------cceEEEeCCC Confidence 000 011223455666665544 34667888899999999999999999998753 2445667788 Q ss_pred CCcceEEEEEcCCceEEEeecccccccccceeEEEEEecCCCeEEEeccCcCccccCCccccCCccccCCCccEEEEEcc Q lcl|NC_011107. 319 TKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTRGITGMTTFQG 398 (826) Q Consensus 319 ~~~~~y~~~~~~~~~w~e~~~~g~~~~~~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~~~q~ 398 (826) +++.||++|+..+++|+||++||+.+++++|||.++.. .++|++++++|++|.+||+++||+|||+|++|++|+|||| T Consensus 274 ~~~~~y~~~~~~~~~w~e~~~~~~~~~~~t~p~~l~~~--~~~~~~~~~~w~~r~~gd~~tn~~Psf~g~~i~~v~f~q~ 351 (777) T protein:vir:80 274 AKNKTYFRWVDLERKWDEDASRGAQAELIDMPLRITYS--APNFSLTALNYERRASGDATSNPALKFTEQGISGMTTMQG 351 (777) T ss_pred CCCceEEEEEccCcEEEEeecccccccccccceEEEec--CCceEeeccCCccccccccccCCCceecCCceeEEEEEcc Confidence 88999999999999999999999999999999999753 3689999999999999999999999999999999999999 Q ss_pred eEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccccccceE Q lcl|NC_011107. 399 RLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAV 478 (826) Q Consensus 399 RL~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~ 478 (826) ||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+|+++++|+|||+++||+|+++++|||+|++ T Consensus 352 RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdDpI~~~~ss~~~~~i~~~v~~~~~L~i~T~~~e~~l~~~~~lTP~~~~ 431 (777) T protein:vir:80 352 RLVLLAGEYVCMSASGNPLRWFRASVSTQSDDDPIEVAATAPVASPYEYAVAFNKDLVLFAKTHQGLVPGANLLTSRNAT 431 (777) T ss_pred eeeeecCCeEEEEeccCccccccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCceEEEeCCCcccceeEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEEEEEcCCCCEEEEE Q lcl|NC_011107. 479 ISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAASSGYLVFG 558 (826) Q Consensus 479 ~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~~~s~~p~~~v~~ 558 (826) ++++|+|+|+++++|+.+|+++||+++|++++++||||+|+++.++.|+++|||+|++|||+++|.+|++|++|++++|+ T Consensus 432 ~~~~s~~~~~~~~~Pv~vG~~v~Fv~~r~g~~s~v~e~~~~~~~~d~y~a~Dlt~~~~hl~~~~v~~~a~s~~p~~v~~~ 511 (777) T protein:vir:80 432 AAVVTEYSFQNSCSPVVAGRTVFFASPRSGPWSAVWEMLPSQYTDAQVEASDSTSHLPKYIAGPVRFLATSSTTSIVVVG 511 (777) T ss_pred EEEEEeeccCCCCCceEeCCeEEEEecCCCceeEEeeeeecccccCceehhHHHHHHHHhcCCceEEEEEcCCCceEEEE Confidence 99999999999999999999999999888788999999999888778999999999999999999999999999999999 Q ss_pred EcCCCeEEEEEEeecCCceeeeeeEeeecCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEeecCcccCCCcccccccceE Q lcl|NC_011107. 559 TSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRI 638 (826) Q Consensus 559 ~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~~~~~d~~~~~ 638 (826) +++||+|++|||||+++||+|+|||||+|+|+|++||+++|+||++|+|+...++|||.+....+....+..++|+.+.. T Consensus 512 ~~~dg~l~~~ty~~~~~e~~v~aW~r~~~~g~v~~v~~i~d~l~~iv~r~~~~~le~~~~~~~~d~~~~~~~~~D~~~~~ 591 (777) T protein:vir:80 512 TSNLRELVVHEYLWQGGEKVHAAWHKWSFPQDITGAYFRGDRLILLFHVAGRVILGELFMQRLGDAQSIPGGFLDLYRVG 591 (777) T ss_pred EcCCCeEEEEEEeecCCceEEEeeEEeccCCcEEEEEEECCEEEEEEEcCCeEEEEEEeeccCCCCcccceeeeeeeeee Confidence 99999999999999999999999999999999999999999999999999999999997766655554555566654433 Q ss_pred EEeecccceeccceeeccCCcccceeeEEecCceeeeee-cccceecCCceEEEecCCCCCceEEEeeeeeEEEEeCCee Q lcl|NC_011107. 639 EATVDGELELTKQHWDLIKDASAVYQLQPVAGAYMERTH-LGVKRETNTKVFLDVPEAVVGAVYVVGCEFWSKVEFTPPV 717 (826) Q Consensus 639 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~-~g~~~~~~~~~~~~~~~~~~~~~v~vGl~y~~~~~~~~~~ 717 (826) ....++..+......+ .-......+..+.|....... .+..........+.++++..+++|+|||+|+++++|+||+ T Consensus 592 ~~~~~~~~~~~~~~~~--~~~~~~~~~v~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~~~~~~v~VGl~y~s~~~~~~~~ 669 (777) T protein:vir:80 592 AANADEEVAIPAFAAD--LYPEDSTFAYKLSGEFQSLGQRCGDRRVDGATVYIKVVGAQAGDQYRIGLRYLSKLGPTRPI 669 (777) T ss_pred eeeeCCccceeEeecc--ccCCcceeEEEecCcccccceeeeeEEeCCceeeEEEcCCCCCCEEEEeeeeEEEEEeCceE Confidence 3333332221111100 000111111222222111111 1233344455678899999999999999999999999999 Q ss_pred EecCCCCceeecceEEEEEEEEeeccceEEEEecCCCCccceeeeccCccccccccccCccccceeEEEEEecccCceeE Q lcl|NC_011107. 718 LRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLVDSAVVPLPARVDMATSK 797 (826) Q Consensus 718 i~~~~g~~~~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~ 797 (826) +++++|+.++.+|+||+|++|+|++||+|.+.|+++.++. ..+.+++.+++++++..|.|++.++++++|+.+|+++.+ T Consensus 670 ~~~~~g~~~~~~r~~i~r~~~~~~~sg~~~v~v~~~~~~~-~~~~~~~~~~~~~~~~~g~~~~~tg~~~vp~~~~~~~~~ 748 (777) T protein:vir:80 670 LRDPNGVPITTERTQLHRLTWSLDSTGEVTFRVADQARGE-SAYTTTPLRLYSRDLGAGLPLAATATLDTPARVDMQTAQ 748 (777) T ss_pred EeCCCCceeeecCeEEEEEEEEeeccccEEEEEcCCCCcc-eeeeecCceecccccccccccccceEEEEEEeecCcceE Confidence 9999999999999999999999999999999999988754 457888999999999999999999999999999999999 Q ss_pred EEEEECCCCCEEEEEEEEEEEeecccccC Q lcl|NC_011107. 798 FELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 798 v~i~~~~P~P~tvl~i~weg~y~~r~rrv 826 (826) |+|++++|+||+|++|+|||+||+|+||= T Consensus 749 v~i~~d~P~P~tilsi~~e~~y~~r~~r~ 777 (777) T protein:vir:80 749 FSLETDDYYDMNITSLEYGFRYNQRYRRQ 777 (777) T ss_pred EEEEECCCCceEEEEEEEEEEeecccccC Confidence 99999999999999999999999999966 No 5 >protein:vir:3366 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523337;swissprot:trembl:q8w5u3;genbank:gi:17570828;goa:Q8W5U3;uniprot:Q8W5U3;genbank:GeneID:927453 Probab=100.00 E-value=3.1e-223 Score=1240.45 Aligned_cols=767 Identities=21% Similarity=0.303 Sum_probs=643.2 Q ss_pred CCceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcCCCceEEEE Q lcl|NC_011107. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd~~e~~~~~ 80 (826) ||+|+|+||||+||||||||++|||+||++|+||+|+|+|||+||||++||++++.+++..++||+|+++|++.|+|+ + T Consensus 1 M~~i~~~~~nl~~GvSqq~d~~r~~~q~~~~~N~~~~~~gG~~rRpGt~~va~~~~~~~~~~~~~~~~~~r~~~~~y~-l 79 (801) T protein:vir:33 1 MALISQSIKNLKGGISQQPDILRFTEQGSVQINGWSSESEGIQKRPPMIHLKTLGTAGYVGAQPYVHLINRDEFEQYF-V 79 (801) T ss_pred CceeEeeccceecceeccchhHhhhhhHhhhhcceeecccCcccCchhHhhhhhcCCCccccceEEEEEEeCCceEEE-E Confidence 999999999999999999999999999999999999999999999999999999999888899999999999887776 5 Q ss_pred EecCCeEEEEEcCCCEEEEecCcccccc-ccCCccceEEEEEcCEEEEEeCcccCccccc--ccCCCCCCccEEEEEccc Q lcl|NC_011107. 81 AQHRGELYLFDERDGRLLMGQPLVHDYL-KANDYRQLRAATVADDLFIANLSVKPEADRT--DIKGVDPNKAGWLYIKAG 157 (826) Q Consensus 81 ~~~~g~i~v~~~~~g~~~~~~~~~~~y~-~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~--~~~~~~~~~~a~~~v~~g 157 (826) ++++++||||+++|++..+... .+|+ ++++.++|+++|+||+|||+|++++|++... +...++++++++++++++ T Consensus 80 ~~~~~~irv~~~~G~~~~v~~~--~~y~~~~~~~~~l~~~t~aD~~fi~nr~~~p~~~~~~~~~~~~~~~~~~li~v~~~ 157 (801) T protein:vir:33 80 VFTGEDIKVFDLDGKEYQVRGD--RSYVRTANPREDLRMVTVADYTFVTNRKVVVQSNDQSVNLPGFKDQGDALINVRGG 157 (801) T ss_pred EEcCCeEEEEccCCcEEEEecC--CcceeecCcchheEEEEEcCEEEEeeCCeeecccCCcccccccCCCcceEEEEeec Confidence 6678999999987766655543 3455 4678899999999999999999999998653 456677888999999999 Q ss_pred ccCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeee----------ccceEEEeec Q lcl|NC_011107. 158 QYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFF----------GAPEYTLPNS 227 (826) Q Consensus 158 ~y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~----------ga~~~t~~~~ 227 (826) +|+++|+|+++++. .+.+++|++.. .....+....+++.++...+. +...|+.. T Consensus 158 ~yg~t~~I~i~gs~---------~~~~~~~~gs~-----~~~v~~~s~~~~A~~l~~~~~~~~~~~~~~~~~~~w~~~-- 221 (801) T protein:vir:33 158 QYGRRLSIEFNGAE---------RAAVQLPDGSQ-----PAHVNEVDGQAIAEKLAAQLRNNLGNPNNDQDPNKWRFN-- 221 (801) T ss_pred ccceEEEEEECCcc---------eEEEEeecccc-----ccccccccchhhhhhhhhhhhccCccceeeecCceEEEE-- Confidence 99999999998753 34555555432 122223333344444433221 11222221 Q ss_pred ccccceeeccccceeecccccccccccceEEEecCCce--EEEEecCCCCcceEEEEEEeecccccccccccCcccccEe Q lcl|NC_011107. 228 TKKYPKVDPDANAATIAGYLNQRGVQDGYIAFRGDADI--HVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVG 305 (826) Q Consensus 228 ~~~~~~~a~~~~~~t~a~~~~~~~~~~g~~~~~~~~~~--~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~ 305 (826) ..++++++..+++. ....+.++++++.+.++.++++++++||..+|.+ T Consensus 222 ------------------------~~~g~~~i~~p~~~~~~~itt~~g~~~~~~~~~~~~v~~~~~lp~~~~~g------ 271 (801) T protein:vir:33 222 ------------------------VGPGFIHILAPNNDNVWGLQTKDGYADQLINPVTHYTQSFQKLPINAPDG------ 271 (801) T ss_pred ------------------------ecCeEEEEecCCCcccccccccCCccceeEEEEeecccceeeeeeecCCC------ Confidence 11223333333222 2234667788899999999999999999988753 Q ss_pred eeeeeeeEeccCCCCcceEEEEEcCCceEEEeecccccccc--cceeEEEEEecCCCeEEEeccCcCccccCCccccCCc Q lcl|NC_011107. 306 VQFMDGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTF 383 (826) Q Consensus 306 ~~~~~~~~~~~~~~~~~~y~~~~~~~~~w~e~~~~g~~~~~--~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~p 383 (826) +..++...++.+.+.||++|++.+++|+||++||+..++ ++|||+++ ++++++|+++.++|++|.+||+++||+| T Consensus 272 --~~v~v~~~~~~~~~~y~v~~~~~~~~w~e~~~~g~~~~~~~~tmp~~l~-~~~~~tf~~~~~~w~~r~~gd~~tnp~p 348 (801) T protein:vir:33 272 --YIVKIVGDTSKTADQYYVRFDLNRKVWVETIGWNTRTHLHYHTMPWALV-RASDGNFDFKYLEWGARTVGDDTTNPYP 348 (801) T ss_pred --cEEEEEecCCCcccceEEEEEcCCcEEEEeeccccceeeeecccceEEE-EccCceEEecccCccccccCCccccCcc Confidence 345566778888899999999999999999999986666 58999998 7899999999999999999999999999 Q ss_pred cccCCCccEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcE Q lcl|NC_011107. 384 NFVTRGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQ 463 (826) Q Consensus 384 sf~g~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q 463 (826) +|+|++|++|+||||||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+|+++++|+|||+++| T Consensus 349 sf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q 428 (801) T protein:vir:33 349 SFTGQTINDIFFFRNRLGFLSGENIILSRTSKYFNFFPASVSNYSDDDPIDVAVSHDRVSTLKYAVPFSEELLLWSDQAQ 428 (801) T ss_pred cccCCCceEEEEEcceEEEeeCCeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeeEEeecCCcEEEEecCcE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeCCccccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCe Q lcl|NC_011107. 464 AVVPGGGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPA 543 (826) Q Consensus 464 ~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v 543 (826) |+|+++++|||+|++++++|+|+|+++++|+.+|+++||++++| ++++++|+++..+..+.|+++|||+|++|||++++ T Consensus 429 ~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~v~r~~~~~~~~d~y~~~Dlt~~~~~~~~~~~ 507 (801) T protein:vir:33 429 FVLTASDILSSRSVGLNLTTQFDVQDRARPHGVGRNVYFSSPRA-SFTSINRYYAVQDVSSVKNAEDMTAHVPNYIPNGV 507 (801) T ss_pred EEEeCCCcccceeEEEEEEEeecccCCCCceEecCeEEEEecCC-CeeEEEEEEeecccccceehhhHHHHHHHhcCCce Confidence 99999999999999999999999999999999999999999988 57777665544455556999999999999999999 Q ss_pred EEEEEcCCCCEE-EEEEcCCCeEEEEEEeecCCceeeeeeEeeecCCcEEEEEE--ECCeEEEEEEeCCCEEEEEEEEee Q lcl|NC_011107. 544 EYIQAAASSGYL-VFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYF--TGDNLMVLIQKGQEIALGRMHLNS 620 (826) Q Consensus 544 ~~~~~s~~p~~~-v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~--~~d~l~~vv~r~~~~~~~r~~~~~ 620 (826) .+|+++++|+++ +|+.+++|+|++|+|||+++||+|+|||||+|+|.|+++|+ ++|+|||+|+|++..++|||++.+ T Consensus 508 ~~~~~~~~~~~~~~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~d~l~~vv~r~~~~~le~~~~~~ 587 (801) T protein:vir:33 508 FSISGTTAENFVAILTSGAPNRVYIYKFLYIDEEIRQQSWSHWDFGDNVTVFAAQVINSTMTVLMSNEHAVWMGRLHFTK 587 (801) T ss_pred EEEEEcCCCCeEEEEEecCCCEEEEEEEecCCCceEEEeeEEEEcCCCEEEEEEecCCCEEEEEEEcCCcEEEEEEEEee Confidence 999999888876 68889999999999999999999999999999999887775 699999999999999999997654 Q ss_pred cCcccCCCcc---cccccceEEEeecccceeccceeeccCCcccceeeEEecCceeeeeecccce-------ecCCceEE Q lcl|NC_011107. 621 LPAREGLQYP---KYDYWRRIEATVDGELELTKQHWDLIKDASAVYQLQPVAGAYMERTHLGVKR-------ETNTKVFL 690 (826) Q Consensus 621 ~~~~~~~~~~---~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~-------~~~~~~~~ 690 (826) ...... ..+ ++|.. ..+...++++.........+...++++.|++|..+...++|... ...+..++ T Consensus 588 ~~~d~~-~~~~~~~lD~~---~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~eg~~v~~~~dG~v~~~~~~~~~~~~~~~l 663 (801) T protein:vir:33 588 DSIDLP-GEPYRLYIDAK---RKYTIPAGTYNDDTYQTSISLSTIYGMNFTKGRVSVVFPDGKIVEIDQPINGWSSDPML 663 (801) T ss_pred ccccCC-CccceEEeecc---eEEEecccceecCccccccccccccCCccccceEEEEEeCCceEeeeeccccccCceeE Confidence 332211 111 23322 22333444554433222223344678999999999999998653 22356688 Q ss_pred EecCCCCCceEEEeeeeeEEEEeCCeeEecCCCC----ceeecceEEEEEEEEeeccceEEEEecCCCCccceeeeccCc Q lcl|NC_011107. 691 DVPEAVVGAVYVVGCEFWSKVEFTPPVLRDHNGL----PMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPL 766 (826) Q Consensus 691 ~~~~~~~~~~v~vGl~y~~~~~~~~~~i~~~~g~----~~~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~ 766 (826) +++++.++++|+|||+|+++++|+||+++.++|+ ....+|+||+|++|++.+||+|.+.|+++.++ ..+.+.+. T Consensus 664 ~i~~~~~~~~v~vGl~y~s~~~~~~~~~~~~~~~~~~~~~~~~r~~l~r~~~~~~~tg~~~v~v~~~~~~--~~~~~~~~ 741 (801) T protein:vir:33 664 RLDGNQEGQVVYIGFNIPFTYTFSKFLIKKTAEDGSTATEDIGRLQLRRAWVNYEDSGAFIIRVNNLSRE--FIYTMAGA 741 (801) T ss_pred EecCCCCCCEEEEeeeeeEEEEeCceEEeccCCCCceeeeeeccEEEEEEEEEeecCcceEEEECCcccc--eeeeeccc Confidence 9999999999999999999999999999977654 45668999999999999999999999998764 34788899 Q ss_pred cccccccccCccccceeEEEEEecccCceeEEEEEECCCCCEEEEEEEEEEEeecccccC Q lcl|NC_011107. 767 RLFSRQLNAGEPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 767 ~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r~rrv 826 (826) ++++.++..+.|++++|++++|+.+|+++.+|+|++++|+||+||+|+|||+||+|.||| T Consensus 742 ~~~~~~~~~~~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvl~i~~eg~y~~r~~~~ 801 (801) T protein:vir:33 742 RLGSDNLRVGGSNIGTGQYRFPVVGNAQTNTVTIESDASTPLNIIGCGWEGNYLRRSSGI 801 (801) T ss_pred ccccccccccccccccceEEEEeeccCceEEEEEEeCCCCCEEEEEEEEEEEEeccccCC Confidence 999999999999999999999999999999999999999999999999999999999999 No 6 >protein:vir:1543 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052111;swissprot:trembl:q9t105;genbank:gi:9634037;uniprot:Q9T105;genbank:GeneID:1262408 Probab=100.00 E-value=2.4e-223 Score=1241.09 Aligned_cols=770 Identities=22% Similarity=0.306 Sum_probs=646.2 Q ss_pred CCceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcCCCceEEEE Q lcl|NC_011107. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd~~e~~~~~ 80 (826) ||+|+|+||||++|||||||++|||+||++|+||+|+|+|||+||||++||+++++.+...++||+|+++||+.|+|+ + T Consensus 1 M~~i~~s~~n~~~GvSqq~d~~r~~~q~~~~~N~~~~~~gGl~rRpGt~~va~~~~~~~~~~~~~~~~~~~~~~e~y~-l 79 (801) T protein:vir:15 1 MALISQSIKNLKGGISQQPDILRFAEQGSVQINGWSSESEGLQKRPPMIHLKTLGPAGYVGAQPYVHLINRDEFEQYF-V 79 (801) T ss_pred CceeeeecchhhcceecCcchHhhhhhHhhhhcceeccccCcccCCchheeeeecCCCCcccceeEEEEEeCCceEEE-E Confidence 999999999999999999999999999999999999999999999999999999998888899999999999887776 5 Q ss_pred EecCCeEEEEEcCCCEEEEecCcccccc-ccCCccceEEEEEcCEEEEEeCcccCccccc--ccCCCCCCccEEEEEccc Q lcl|NC_011107. 81 AQHRGELYLFDERDGRLLMGQPLVHDYL-KANDYRQLRAATVADDLFIANLSVKPEADRT--DIKGVDPNKAGWLYIKAG 157 (826) Q Consensus 81 ~~~~g~i~v~~~~~g~~~~~~~~~~~y~-~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~--~~~~~~~~~~a~~~v~~g 157 (826) ++++++||||+++|.+..+.... +|+ +++++++|+++|+||+|||+|++++|++... +...+++..|++++++++ T Consensus 80 ~~~~~~irv~~~~G~~~~v~~~~--~y~~~~~~~~~l~~~~~aD~~fi~nr~~~~~~~~~~~~~~~~~~~~~alv~v~~~ 157 (801) T protein:vir:15 80 VFTGEDIKVFDLDGKEYQVRGDR--SYVRTANPREDLRMITVADYTFVTNRKVVVQSNDQSVNLPGFKDQGDALINVRGG 157 (801) T ss_pred EEcCCeEEEEccCCcEEEEecCC--ccccccCchhheeEEEEcCEEEEeeCCeeeecccCccccCccCCCCceEEEeeec Confidence 77789999999876666555433 444 4578889999999999999999999997643 455677888999999999 Q ss_pred ccCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeee----------ccceEEEeec Q lcl|NC_011107. 158 QYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFF----------GAPEYTLPNS 227 (826) Q Consensus 158 ~y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~----------ga~~~t~~~~ 227 (826) +|+++|+|+++++. .+.+++|++... .........+++..+...+. +...|+. T Consensus 158 ~yg~t~~I~i~gs~---------~~~~t~~~gs~~-----~~~~~~s~~~ia~~l~~~~~~~~p~~~~~~~~~~w~~--- 220 (801) T protein:vir:15 158 QYGRRLSIEFNGAE---------RAAVQLPDGSQP-----AHVNEVDGQAIAEKLAAQLRNNLGNPNNDQDPNKWRF--- 220 (801) T ss_pred cCceeEEEEeCCcc---------eEEEEeccCccc-----chhhhcceeechHHHhhhhhhccCccceeccCccEEE--- Confidence 99999999998753 355666665432 22223333444444433221 1111221 Q ss_pred ccccceeeccccceeecccccccccccceEEEecCCce--EEEEecCCCCcceEEEEEEeecccccccccccCcccccEe Q lcl|NC_011107. 228 TKKYPKVDPDANAATIAGYLNQRGVQDGYIAFRGDADI--HVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVG 305 (826) Q Consensus 228 ~~~~~~~a~~~~~~t~a~~~~~~~~~~g~~~~~~~~~~--~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~ 305 (826) ...++++++..+.+. ....+.++++++.+.+..+.++++++||..+|++ T Consensus 221 -----------------------~~~~g~~~i~a~~~~~~~~~~t~dg~~~~~~~~~~~~v~~~~~lp~~~~~G------ 271 (801) T protein:vir:15 221 -----------------------NVGPGFIHILAPNNDNVWGLQTKDGYADQLINPVTHYTQSFQKLPINAPDG------ 271 (801) T ss_pred -----------------------EecCcEEEEeCCCCcccceeeeccccCceeeeEEeecccceeeeeeecCCC------ Confidence 112233444433332 2345678888999999999999999999988753 Q ss_pred eeeeeeeEeccCCCCcceEEEEEcCCceEEEeecccccccc--cceeEEEEEecCCCeEEEeccCcCccccCCccccCCc Q lcl|NC_011107. 306 VQFMDGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTF 383 (826) Q Consensus 306 ~~~~~~~~~~~~~~~~~~y~~~~~~~~~w~e~~~~g~~~~~--~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~p 383 (826) +.+++.+.++.+.+.||++|+..+++|+||++||..+++ +||||.++ +.++++|+++.++|++|.+||+++||+| T Consensus 272 --~~v~v~~~~~~~~~~y~v~~~~~~~~w~E~a~~g~~~~~~~~tmp~~lv-~~~~~~~~~~~~~w~~r~~gd~~tnp~p 348 (801) T protein:vir:15 272 --YIVKIVGDTSKTADQYYVRFDLNRKVWVETIGWNTRTHLYYHTMPWALV-RASDGNFDFKVLEWGARTVGDDTTNPYP 348 (801) T ss_pred --cEEEEEecCCCccceEEEEEEcCCeeEEeecccccceeeeccccceEEE-eeccceEEEeccccccccCCccccCCcc Confidence 345567778888899999999999999999999987777 58999998 6789999999999999999999999999 Q ss_pred cccCCCccEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcE Q lcl|NC_011107. 384 NFVTRGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQ 463 (826) Q Consensus 384 sf~g~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q 463 (826) +|+|++|++|+||||||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+|+++++|+|||+++| T Consensus 349 sf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~i~t~~~q 428 (801) T protein:vir:15 349 SFTGQTINDIFFFRNRLGFLSGENIILSRTSKYFNFFPASVSNYSDDDPIDVAVSHNRVSTLKYAVPFSEELLLWSDQAQ 428 (801) T ss_pred cccCCCceEEEEEcceEEEeeCCeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeCCccccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCe Q lcl|NC_011107. 464 AVVPGGGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPA 543 (826) Q Consensus 464 ~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v 543 (826) |+|+++++|||+|++++++|+|+|+++++|+.+|+++||++++| ++++++|+++..+..+.|+++|||+|++|||++++ T Consensus 429 ~~ls~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~~~r~~~~~~~~d~y~a~Dlt~~~~hl~~~~v 507 (801) T protein:vir:15 429 FVLTASGILSSRSVELNLTTQFDVQDRARPHGVGRNVYFASPRA-SFTSINRYYAVQDVSSVKNAEDMTAHVPNYIPNGV 507 (801) T ss_pred EEEcCCCcccceeEEEEEEEeeeccCCCCceEeCCeEEEEecCC-CeeEEEEEEeecccccceehhhHHHHHHHhcCCce Confidence 99999999999999999999999999999999999999999998 57777665443455556999999999999999999 Q ss_pred EEEEEcCCCCE-EEEEEcCCCeEEEEEEeecCCceeeeeeEeeecCCcEEEEEE--ECCeEEEEEEeCCCEEEEEEEEee Q lcl|NC_011107. 544 EYIQAAASSGY-LVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYF--TGDNLMVLIQKGQEIALGRMHLNS 620 (826) Q Consensus 544 ~~~~~s~~p~~-~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~--~~d~l~~vv~r~~~~~~~r~~~~~ 620 (826) .+++++++|+. ++|+++++|+|++|||||+++||+|+|||||+|+|.++++|+ .+|+||++|+|+++.+++||++.. T Consensus 508 ~~~~~~~~~~~~~~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~d~l~~~v~r~~~~~~~r~~~~~ 587 (801) T protein:vir:15 508 FSISGTTAENFAAILTSGAPNRVYIYKFLYIDEEIRQQSWSHWDFGDNVTVFAAQVINSTMTVLMGNEHAVWMGRLHFTK 587 (801) T ss_pred EEEEEeCCCCcEEEEEEcCCCEEEEEEEecCCCceEEEeeEEEEcCCCEEEEEEEecCCEEEEEEEecCcEEEEEEEEcc Confidence 99999876665 579999999999999999999999999999999999988876 489999999999999999996554 Q ss_pred cCcccCCCcccccccceEEEeecccceeccceeeccCCcccceeeEEecCceeeeeecccceec----C---CceEEEec Q lcl|NC_011107. 621 LPAREGLQYPKYDYWRRIEATVDGELELTKQHWDLIKDASAVYQLQPVAGAYMERTHLGVKRET----N---TKVFLDVP 693 (826) Q Consensus 621 ~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~~~----~---~~~~~~~~ 693 (826) ... ...+.++.-+.+....+...+.++.........+...++++.|++|..+....+|...+. + ...++.++ T Consensus 588 ~~~-~~~~~~~~~~lD~~~~~~~~~~t~~~~~~~~~~~~~~~~gl~~l~g~~v~v~~dG~~~~~~~~~~g~~~~~~~~i~ 666 (801) T protein:vir:15 588 NSI-DIPGEPYRLYIDAKRKYTIPAGTYNDDTYQTSISLATIYGMNFTKGRVSVVFPDGKIIEVDQPINGWSSDPVLRLD 666 (801) T ss_pred ccc-cCCCcceeeeeeeeeeEeeccceeccCceecccccccccccccccceEEEEEeCCceeeeeeecCcccCcceEEEc Confidence 322 122222211111111223344555554444445566788999999999999999965432 1 23478899 Q ss_pred CCCCCceEEEeeeeeEEEEeCCeeEecCCCC----ceeecceEEEEEEEEeeccceEEEEecCCCCccceeeeccCcccc Q lcl|NC_011107. 694 EAVVGAVYVVGCEFWSKVEFTPPVLRDHNGL----PMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLF 769 (826) Q Consensus 694 ~~~~~~~v~vGl~y~~~~~~~~~~i~~~~g~----~~~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~ 769 (826) ++.++++|+|||+|+++++|+||+++.++|+ ....+|+||+|++|+|.+||.|.+.|+++.++. .+..++.+++ T Consensus 667 ~~~~~~~v~vGl~y~~~~~~~~~~~~~~~~~~~~~~~~~~rl~l~r~~~~~~~tg~~~~~v~~~~~~~--~~~~~~~~~~ 744 (801) T protein:vir:15 667 GNQEGQVVYIGFNIPFTYTFSKFLIKKTAEDGSTATEDIGRLQLRRAWVNYEDSGAFTIRVNNLSREF--IYTMAGARLG 744 (801) T ss_pred CCCCCcEEEEeeeeeEEEEecceEEeccCCCCCceeeeeccEEEEEEEEEeccCcceEEEECCccccc--ceeecCcccc Confidence 9999999999999999999999999977664 445689999999999999999999999988753 4778899999 Q ss_pred ccccccCccccceeEEEEEecccCceeEEEEEECCCCCEEEEEEEEEEEeecccccC Q lcl|NC_011107. 770 SRQLNAGEPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 770 ~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r~rrv 826 (826) ++++..+.|++++++++||+.+|+++.+|+|++++|+||+||+|+|||+||+|.||| T Consensus 745 ~~~~~~~~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlsi~~e~~y~~r~~~~ 801 (801) T protein:vir:15 745 SDNLRVGRSNIGTGQYRFPVVGNAQTNLVTIESDASTPLNIIGCGWEGNYLRRSSGI 801 (801) T ss_pred cccccccccccccceEEEEEeecCceEEEEEEECCCCcEEEEEEEEEEEEeccccCC Confidence 999999999999999999999999999999999999999999999999999999999 No 7 >protein:vir:2203 Length: 794 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_042000;swissprot:sw:p03747;genbank:gi:9627472;uniprot:P03747;genbank:GeneID:1261024 Probab=100.00 E-value=1.9e-222 Score=1236.11 Aligned_cols=771 Identities=23% Similarity=0.319 Sum_probs=650.0 Q ss_pred CCceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcCCCceEEEE Q lcl|NC_011107. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd~~e~~~~~ 80 (826) ||+|+|+||||++|||||+|++|||+||++|+||+|+|+|||+||||++||++++++......++.|+++|++.|+|++ T Consensus 1 M~~i~~s~~n~~~GvSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~~~~~~~~~y~l- 79 (794) T protein:vir:22 1 MALISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTLGDNGALGQAPYIHLINRDEHEQYYA- 79 (794) T ss_pred CceeeeecchhhcccccCCchHHhhhHHhhhhcceeeccCCceeCCchHhhhhhcccCCCCCccEEEEEEeCCCcEEEE- Confidence 9999999999999999999999999999999999999999999999999999999877666678889999988888876 Q ss_pred EecCCeEEEEEcCCCEEEEecCcccccc-ccCCccceEEEEEcCEEEEEeCcccCccccc--ccCCCCCCccEEEEEccc Q lcl|NC_011107. 81 AQHRGELYLFDERDGRLLMGQPLVHDYL-KANDYRQLRAATVADDLFIANLSVKPEADRT--DIKGVDPNKAGWLYIKAG 157 (826) Q Consensus 81 ~~~~g~i~v~~~~~g~~~~~~~~~~~y~-~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~--~~~~~~~~~~a~~~v~~g 157 (826) ++++++||||+++|..+.+..+...+|+ ++++.++|+|+|+||+|||||++++|++... +...+++.+++++++++| T Consensus 80 ~~~~~~irv~~~~G~~~~v~~~~~~~y~~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~~~g~v~v~~g 159 (794) T protein:vir:22 80 VFTGSGIRVFDLSGNEKQVRYPNGSNYIKTANPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVRGG 159 (794) T ss_pred EEcCCeEEEEecCCcEEEeecCCCccceecCCCcccEEEEEEcCEEEEEcCCeeeeEeeccccCCCCCCCceEEEEccCC Confidence 5566789999987777666656666776 4688899999999999999999999998543 344567788999999999 Q ss_pred ccCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeecc-ceEEEeecccccceeec Q lcl|NC_011107. 158 QYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGA-PEYTLPNSTKKYPKVDP 236 (826) Q Consensus 158 ~y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga-~~~t~~~~~~~~~~~a~ 236 (826) +|+++|+++|++.+. +.+++|++.. ..+....++++++.+|..++.+. .+|+.. T Consensus 160 ~y~~ty~v~I~~~~~---------a~~~~p~gt~-----~~~~~~~~~~~ia~~L~~~l~~~~~~~t~~----------- 214 (794) T protein:vir:22 160 QYGRELIVHINGKDV---------AKYKIPDGSQ-----PEHVNNTDAQWLAEELAKQMRTNLSDWTVN----------- 214 (794) T ss_pred ccceeEEEEeccCcc---------eEEEEcCCCc-----cccceeechhhhhhhhhhhheeccccceEE----------- Confidence 999999999987653 5677777664 34556778889998887654321 222221 Q ss_pred cccceeecccccccccccceEEEecC--CceEEEEecCCCCcceEEEEEEeecccccccccccCcccccEeeeeeeeeEe Q lcl|NC_011107. 237 DANAATIAGYLNQRGVQDGYIAFRGD--ADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVM 314 (826) Q Consensus 237 ~~~~~t~a~~~~~~~~~~g~~~~~~~--~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 314 (826) ..++++++... ..+....+.++++++.+.+..+.++++++||+.+|.+ +..++.+ T Consensus 215 ---------------~~~~~~~i~a~~~~~~~~~t~~~g~~~t~~~~~~~~~~~~~~lp~~~~~G--------~~v~i~~ 271 (794) T protein:vir:22 215 ---------------VGQGFIHVTAPSGQQIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAPNG--------YMVKIVG 271 (794) T ss_pred ---------------eCCceEEEEEcCCceEEEEeeecccCcceeEEEEeccccceeccccCCCC--------eEEEEEe Confidence 12223333333 2333345567778889999999999999999988843 3455667 Q ss_pred ccCCCCcceEEEEEcCCceEEEeecccccccc--cceeEEEEEecCCCeEEEeccCcCccccCCccccCCccccCCCccE Q lcl|NC_011107. 315 ATGSTKAPVYFEWDSANRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTRGITG 392 (826) Q Consensus 315 ~~~~~~~~~y~~~~~~~~~w~e~~~~g~~~~~--~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~ 392 (826) ..++..+.||++|+..+++|+||++|++..++ .||||.++ ++++++|++++.+|++|.+||+++||+|||+|++|++ T Consensus 272 ~~~~~~~~Y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~lv-~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~i~~ 350 (794) T protein:vir:22 272 DASKSADQYYVRYDAERKVWTETLGWNTEDQVLWETMPHALV-RAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSIND 350 (794) T ss_pred CCCCCcceeEEEEeccceEEEEeeeccceeeecccceeeEee-eccCCcEEEeeccccccccCccccCCcceecCCCcce Confidence 77777899999999999999999999986666 69999998 7899999999999999999999999999999999999 Q ss_pred EEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccc Q lcl|NC_011107. 393 MTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIV 472 (826) Q Consensus 393 v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~l 472 (826) |+||||||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+++++| T Consensus 351 v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~ss~~~~~i~~~v~~~~~L~i~t~~~e~~l~~~~~l 430 (794) T protein:vir:22 351 VFFFRNRLGFLSGENIILSRTAKYFNFYPASIANLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTL 430 (794) T ss_pred EEEEcceEEEecCCeEEEEccCCccccccccCcCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEEEEEcC-C Q lcl|NC_011107. 473 TPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAA-S 551 (826) Q Consensus 473 TP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~~~s~-~ 551 (826) ||+|++++++|+|+|+++++|+.+|+++||++++| ++++++|+++..+..+.|+++|||+|++|||++++..+++++ + T Consensus 431 TP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~~~r~~~~~~~~d~y~~~Dlt~~~~~~~~~~~~~~~~~~~~ 509 (794) T protein:vir:22 431 TSKSVELNLTTQFDVQDRARPFGIGRNVYFASPRS-SFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTE 509 (794) T ss_pred cceeEEEEEEEEeeccCCCCceEeCCeEEEEecCC-CeeEEEEeEeeecccCceehhhHHHHHHHhcCCceEEEEEeCCC Confidence 99999999999999999999999999999999988 577776655545655569999999999999999988887755 4 Q ss_pred CCEEEEEEcCCCeEEEEEEeecCCceeeeeeEeeecCCcEEEEEEE--CCeEEEEEEeCCCEEEEEEEEeecCcccCCCc Q lcl|NC_011107. 552 SGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFT--GDNLMVLIQKGQEIALGRMHLNSLPAREGLQY 629 (826) Q Consensus 552 p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~--~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~~ 629 (826) |..++|+++++|+|++|+|||+++||+|+|||||+|+|.|+++|+. +|+||++|+|++++++|||.+ ++......+. T Consensus 510 ~~~v~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~d~l~~iv~r~~~~~~~r~~~-~~~~~~~~~~ 588 (794) T protein:vir:22 510 NFCSVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSDMYVILRNEFNTFLARISF-TKNAIDLQGE 588 (794) T ss_pred CcEEEEEEcCCCEEEEEEEeecCCceeEEeeEEEEcCCCEEEEEEEecCCEEEEEEEeCCCEEEEEEEE-eeccccCCCc Confidence 5667899999999999999999999999999999999999988865 899999999999999999954 4444444444 Q ss_pred ccccccce-EEEeecccceeccceeeccCCcccceeeEEecCceeeeeecccceec-------CCceEEEecCCCCCceE Q lcl|NC_011107. 630 PKYDYWRR-IEATVDGELELTKQHWDLIKDASAVYQLQPVAGAYMERTHLGVKRET-------NTKVFLDVPEAVVGAVY 701 (826) Q Consensus 630 ~~~d~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~~~-------~~~~~~~~~~~~~~~~v 701 (826) ++..+.+. ...+++++..+.+... ...+...++++.|++|+.+....+|..... ++..+++++++.++++| T Consensus 589 ~~~~~lD~~~~~~~~~g~~~~~~~~-t~~~~~~~~g~~~~~g~~v~~~~dg~~~~~~~~~~~~~~~~~~~v~~~~~~~~v 667 (794) T protein:vir:22 589 PYRAFMDMKIRYTIPSGTYNDDTFT-TSIHIPTIYGANFGRGKITVLEPDGKITVFEQPTAGWNSDPWLRLSGNLEGRMV 667 (794) T ss_pred cceeeeeeeEEEeeccceeecCCcc-eEEEcccccCcccccceEEEEEcCCceeeceeeeeeeeccceEEeCCCCCCcEE Confidence 44322221 2223333332221111 111223356788999999999888854322 33357899999999999 Q ss_pred EEeeeeeEEEEeCCeeEecCCCCcee----ecceEEEEEEEEeeccceEEEEecCCCCccceeeeccCccccccccccCc Q lcl|NC_011107. 702 VVGCEFWSKVEFTPPVLRDHNGLPMT----STRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGE 777 (826) Q Consensus 702 ~vGl~y~~~~~~~~~~i~~~~g~~~~----~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 777 (826) +|||+|+++++|+||++++++|.+.+ .||+||+|++++|.+||+|.+.++++.++. .+.+++.+++++++..|. T Consensus 668 ~VGl~y~s~~~~~~~~~~~~~~~~~~~~~~~grl~l~r~~~~~~~tg~~~v~v~~~~~~~--~~~~~~~~~g~~~~~~g~ 745 (794) T protein:vir:22 668 YIGFNINFVYEFSKFLIKQTADDGSTSTEDIGRLQLRRAWVNYENSGTFDIYVENQSSNW--KYTMAGARLGSNTLRAGR 745 (794) T ss_pred EEeeeeeEEEEecceEEEecCCCccceeeecceEEEEEEEEEeccccceEEEEcCCCccc--ceeecCceecccccccCc Confidence 99999999999999999999986543 489999999999999999999999877653 467888999999999999 Q ss_pred cccceeEEEEEecccCceeEEEEEECCCCCEEEEEEEEEEEeecccccC Q lcl|NC_011107. 778 PLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 778 ~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r~rrv 826 (826) +++.+++++||+++|+++.+|+|+|++|+||+|++|+|||+||+|.||| T Consensus 746 ~~~~tg~~~vp~~~~~~~~~v~i~~d~p~P~tvlsi~~eg~y~~r~~~v 794 (794) T protein:vir:22 746 LNLGTGQYRFPVVGNAKFNTVYILSDETTPLNIIGCGWEGNYLRRSSGI 794 (794) T ss_pred ccccCceEEEEecccCceEEEEEEECCCCCEEEEEEeEEEEEeccccCC Confidence 9999999999999999999999999999999999999999999999999 No 8 >protein:vir:97014 Length: 800 # NCBI annotation: 33 # Family: family:all:825 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654134;genbank:gi:108862018;genbank:GeneID:5075963 Probab=100.00 E-value=1.3e-221 Score=1231.52 Aligned_cols=770 Identities=21% Similarity=0.293 Sum_probs=652.8 Q ss_pred CceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcCCCceEEEEE Q lcl|NC_011107. 2 SYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLGGRSIAMLVA 81 (826) Q Consensus 2 ~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd~~e~~~~~~ 81 (826) =.|+||||||+||||||||++|||+||++|+||+|+|+|||+||||++||+++++++.. .+++.|.++||+.||||+++ T Consensus 1 ~~v~~s~~n~~~GvSqq~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~fva~l~~~~~~-~~~~~~~~~~d~~eq~~v~~ 79 (800) T protein:vir:97 1 MEVQGSLGRQIQGISQQPPAVRLDGQCTAMVNMIPDVVNGTQSRMGTTHIAKILDAGTD-DMATHHYRRGDGDEEYFFTL 79 (800) T ss_pred CeeEeechhhhcccccCchhHhhhhhhhhhhcceeccccccccCCchhhheeecCCCcc-cceeEEEEEcCCceEEEEEE Confidence 25889999999999999999999999999999999999999999999999999988753 56778889999999999999 Q ss_pred ecCCeEEEEEcCCCEEEEecCcc-cccc--ccCCccceEEEEEcCEEEEEeCcccCcccccccCCCCCCccEEEEEcccc Q lcl|NC_011107. 82 QHRGELYLFDERDGRLLMGQPLV-HDYL--KANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAGWLYIKAGQ 158 (826) Q Consensus 82 ~~~g~i~v~~~~~g~~~~~~~~~-~~y~--~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~~~v~~g~ 158 (826) ++.+++|||+++|+.+.+..... .+|+ ++++.++|+++|+||||||+|++++|++..... .++.+++++++|+|+ T Consensus 80 ~~~~~~rv~~~~G~~~~v~~~~~~~~y~~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~--~~~~~~~~~~v~~g~ 157 (800) T protein:vir:97 80 KKGQVPEIFDKYGRKCNVTSQDAPMTYLSEVVNPREDVQFMTIADVTFMLNRRKVVKASSRKS--PKVGNKAIVFCAYGQ 157 (800) T ss_pred EcCCEEEEEecCCcEEEEecCCcceEEEeccCCCccceeEEEEcCEEEEeeCceecccccccc--cCCCcceEEEEeecc Confidence 99999999999887776655433 2354 467899999999999999999999999754433 467788999999999 Q ss_pred cCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeecc---ceEEEeecccccceee Q lcl|NC_011107. 159 YSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGA---PEYTLPNSTKKYPKVD 235 (826) Q Consensus 159 y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga---~~~t~~~~~~~~~~~a 235 (826) |+|+|+|+|+++. ++.|.+|+++. .......++++++.++...+..+ .+|+.. T Consensus 158 y~~~y~i~I~~~~---------~~~~~t~~~t~-----~~~~~~~~~~~ia~ql~~~~~~~~~~~~~t~~---------- 213 (800) T protein:vir:97 158 YGTSYSIVINGAN---------AASFKTPDGGS-----ADHVEQIRTERITSELYSKLQQWSGVSDYEIQ---------- 213 (800) T ss_pred cceeeeeccCCcc---------eEEEEEcCCCC-----cccceeccHHHHHHHHHHhhhccccccceEEE---------- Confidence 9999999998753 46777877653 34556677788888887655322 112221 Q ss_pred ccccceeecccccccccccceEEEecCCceE-EEEecCCCCcceEEEEEEeecccccccccccCcccccEeeeeeeeeEe Q lcl|NC_011107. 236 PDANAATIAGYLNQRGVQDGYIAFRGDADIH-VEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVM 314 (826) Q Consensus 236 ~~~~~~t~a~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 314 (826) ..++++++....+.. ...+.++++++.+.++.+.++++++||+++|.+ +.+.+.. T Consensus 214 ----------------~~G~~~~i~~~~~~~~~v~t~~g~~~~~~~~~~~~v~~~~~lp~~~~~g--------~~v~i~~ 269 (800) T protein:vir:97 214 ----------------RDGTSIFIERRDGASFTITTTDGAKGKDLVAIKNKVSSTDLLPSRAPAG--------YKVQVWP 269 (800) T ss_pred ----------------eCCcEEEEEEcCCceEEEEecCCcCceeeeEEeeeccchhhchhhCCCC--------cEEEEEc Confidence 112234444333322 244677788889999999999999999999864 2334444 Q ss_pred ccCCCCcceEEEEEcC---CceEEEeecccccccc--cceeEEEEEec---CCCeEEEeccCcCccccCCccccCCcccc Q lcl|NC_011107. 315 ATGSTKAPVYFEWDSA---NRRWAERAAYGTDWVL--KKMPLALRWDE---ATDTYSLNELEYDRRGSGDEDTNPTFNFV 386 (826) Q Consensus 315 ~~~~~~~~~y~~~~~~---~~~w~e~~~~g~~~~~--~t~p~~~~~~~---~~~~f~~~~~~w~~r~~gd~~tnp~psf~ 386 (826) ..+..++.||++|+.. .++|+||++++...++ .+|||.++... .+++|++++.+|++|.+|||++||+|+|+ T Consensus 270 ~~~~~~~~y~~~~~~~~~~~~~w~e~~~~~~~~~~~~~tmp~~~~~~~~~~~~g~~~~~~~~w~~r~~gd~~tnp~p~f~ 349 (800) T protein:vir:97 270 TGSKPESRYWLQAEPKEGNLVSWKETIAADVLLGFDKGTMPYIIERTDIINGIAQFKIRQGDWEDRKVGDDLTNPMPSFI 349 (800) T ss_pred cCCCCCceEEEEEEecccCcceEEEeeccccccceecccceEEEEEeecccccceeEEEeccccccccCccccCcccccc Confidence 5556678899999854 4799999999976655 58999998543 67899999999999999999999999999 Q ss_pred C----CCccEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCc Q lcl|NC_011107. 387 T----RGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKY 462 (826) Q Consensus 387 g----~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~ 462 (826) | ++|++|+||||||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+|+++++|+|||+++ T Consensus 350 ~~~~~~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~v~~i~~~v~~~~~L~i~T~~~ 429 (800) T protein:vir:97 350 DEEVPQTIGGMFMVQNRLCFTAGEAVIASRTSYFFDFFRYTVISALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKS 429 (800) T ss_pred CCcCCCCceeEEEEeeeEEEecCCeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeeEEeecCCcEEEEecCc Confidence 8 789999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeCCccccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCC Q lcl|NC_011107. 463 QAVVPGGGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGP 542 (826) Q Consensus 463 q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~ 542 (826) ||+|+++++|||+|++++++|+|+|+++++|+.+|++++|++++| ++++||||+|+.++|+ |+++|||+|++|||+++ T Consensus 430 q~~ls~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~fv~~~g-~~s~vre~~~~~~~d~-~~a~DlT~~~~hl~~~~ 507 (800) T protein:vir:97 430 QFILPGDKPLEKSNALLKPVTTFEVNNKVKPVVTGESVMFATNDG-SYSGVREFYTDSYSDT-KKAQAITSHVNKLIEGN 507 (800) T ss_pred EEEEeCCCcccceeEEEEEEEeeeccCCCCcEEeCCeEEEeeCCC-CeeEEEEEeeeecccc-eehhhHHHHHHHhcCCc Confidence 999999999999999999999999999999999999999999998 5789999999977776 99999999999999999 Q ss_pred eEEEEEcCCCCE-EEEEEcCCCeEEEEEEeecCCceeeeeeEeeecCC--cEEEEEEECCeEEEEEEeCCCEEEEEEEEe Q lcl|NC_011107. 543 AEYIQAAASSGY-LVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRH--QIIGAYFTGDNLMVLIQKGQEIALGRMHLN 619 (826) Q Consensus 543 v~~~~~s~~p~~-~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g--~v~~~~~~~d~l~~vv~r~~~~~~~r~~~~ 619 (826) +.+|+++++|+. ++|+++++|+|++||||++++||+|+|||||+++| .+++|++++|+||++|+|+++.++|||.++ T Consensus 508 v~~~~~~~~~~~~v~~~~~~~~~l~~~~y~~~~~e~~~~aW~~~~~~~~~~~~~~~~~~d~l~~vv~r~~~~~ler~~~~ 587 (800) T protein:vir:97 508 ITNMAASTNVNRLLVTTDKYRNIIYCYDWLWQGTDRVQSAWHVWKWPIGTKVRGMFYSGELLYLLLERGDGVYLEKMDMG 587 (800) T ss_pred eEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceEEEeEEEEecCCCeEEEEEEEcCCeEEEEEEcCCcEEEEEEecc Confidence 999999888776 57899999999999999999999999999999976 677888899999999999999999999776 Q ss_pred ecCcccCCCcccccc--------cceEEEeecccceeccceeeccCCcccceeeEEecCceeeeeecccceecCCceEEE Q lcl|NC_011107. 620 SLPAREGLQYPKYDY--------WRRIEATVDGELELTKQHWDLIKDASAVYQLQPVAGAYMERTHLGVKRETNTKVFLD 691 (826) Q Consensus 620 ~~~~~~~~~~~~~d~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~ 691 (826) ........+..++|. ....+.++++.+.+....... .....+.++.|++|+.+.... +.......+..+. T Consensus 588 ~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~-~~~~~v~g~~~~~G~~v~~~~-~~~~~~~~~~~~~ 665 (800) T protein:vir:97 588 DALTYGLNDRIRMDRQAELVFKHFKAEDEWVSEPLPWVPTNPEL-LDCILIEGWDSYIGGSFLFKY-NPSDNTLSTTFDM 665 (800) T ss_pred cCcCcccccceeccccceeeeeeeecccceEeccccccCCCcce-eEEEEecccccccCceEEEEe-cCccCcccccceE Confidence 543322222233332 222234566666665432211 123445688999999987654 3444444555778 Q ss_pred ecCCCCCceEEEeeeeeEEEEeCCeeEecCCCCceeecceEEEEEEEEeeccceEEEEecCCCCccceeeeccCcccccc Q lcl|NC_011107. 692 VPEAVVGAVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSR 771 (826) Q Consensus 692 ~~~~~~~~~v~vGl~y~~~~~~~~~~i~~~~g~~~~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~ 771 (826) +++++.+++|+|||+|+++++|+||++++++|+++..+|+||+|++|+|.+||+|++.|++..++....+.+.+.++++. T Consensus 666 ~~~~~~~~~v~vGl~Y~~~~~~~p~~i~~~~g~~~~~~r~~i~r~~~~~~~sg~~~~~v~~~~~~~~~~~~~~~~~~g~~ 745 (800) T protein:vir:97 666 YDDSHVKAKVIVGQIYPQEFEPTPVVIRDNQDRVSYIDVPVVGLVHLNLDMYPDFSVEVKNVKSGKVRRVLASNRIGGAL 745 (800) T ss_pred EeCCCCCcEEEEeeeeeEEEEecceEEEecCCCceeecceEEEEEEEeecccccEEEEEccccCCceeeeecCccccccc Confidence 89999999999999999999999999999999999999999999999999999999999999988777777888999999 Q ss_pred ccccCccccceeEEEEEecccCceeEEEEEECCCCCEEEEEEEEEEEeecccccC Q lcl|NC_011107. 772 QLNAGEPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 772 ~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r~rrv 826 (826) ++..|.|++++|+++||+.+|+++.+|+|+|++|+||+|++|+|||+||+|+||| T Consensus 746 ~~~~g~~~~~tg~~~vp~~g~~~~~~v~i~~d~PlP~tvlsi~~eg~y~~r~~rv 800 (800) T protein:vir:97 746 NNTVGYVEPREGVFRFPLRAKSTDVVYRIIVESPHTFQLRDIEWEGSYNPTKRRV 800 (800) T ss_pred cccCCccccccceEEEEeecccceeEEEEEECCCCcEEEEEEEEEEEeecccccC Confidence 9999999999999999999999999999999999999999999999999999999 No 9 >protein:vir:99677 Length: 794 # NCBI annotation: Tail tubular protein B # Family: family:all:825 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249591;genbank:gi:68299742;genbank:GeneID:3799992 Probab=100.00 E-value=4.7e-221 Score=1228.51 Aligned_cols=763 Identities=22% Similarity=0.308 Sum_probs=645.9 Q ss_pred CCceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcCCCceEEEE Q lcl|NC_011107. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd~~e~~~~~ 80 (826) ||+|+|+||||++|||||+|++|||+||++|+||+|+|+|||+||||++||++++++.....+++.|.++|++.|+| ++ T Consensus 1 M~~i~~s~~n~~~GvS~q~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~f~~~~~~~y-~l 79 (794) T protein:vir:99 1 MALISQSIKNLKGGISQQPDILRYSDQGSKQINGFSSEVEGLQKRPPSVHIKRLTDQFGLGQKPYCHIINRDEVERY-AV 79 (794) T ss_pred CceeeeecchhhcceecCCchHHhhhhHhhhhcceeeeccCcccCCccceeeeecCCCCCccccEEEEEEeCCCceE-EE Confidence 99999999999999999999999999999999999999999999999999999999877777888999999876655 55 Q ss_pred EecCCeEEEEEcCCCEEEEec-Ccccccc-ccCCccceEEEEEcCEEEEEeCcccCcccc--cccCCCCCCccEEEEEcc Q lcl|NC_011107. 81 AQHRGELYLFDERDGRLLMGQ-PLVHDYL-KANDYRQLRAATVADDLFIANLSVKPEADR--TDIKGVDPNKAGWLYIKA 156 (826) Q Consensus 81 ~~~~g~i~v~~~~~g~~~~~~-~~~~~y~-~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~--~~~~~~~~~~~a~~~v~~ 156 (826) ++++++||||++.+|....+. +...+|+ +++++.+|+|+|+||+|||+|++++|++.. ++...++++.|+++++++ T Consensus 80 ~f~~~~irv~~~~~g~~~~v~~~~~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~~~v~~ 159 (794) T protein:vir:99 80 FFTGSNIRVFDLFTGDEKTVNAPNGLSYVSSSNPRKDLRMVTVADYTFILNRNVATAQGTTNTPSGLAPFGHFGLVVIRG 159 (794) T ss_pred EEcCCeEEEEECCCCeEEEeeccccccccccCCccceeeEEEEccEEEEEcCCeeeeEeeeeccccCcCCCceEEEEecc Confidence 777899999999888776653 3445666 457888999999999999999999999864 355678888999999999 Q ss_pred cccCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeeccceEEEeecccccceeec Q lcl|NC_011107. 157 GQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDP 236 (826) Q Consensus 157 g~y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga~~~t~~~~~~~~~~~a~ 236 (826) ++|+++|+++++++. ++++++|++... ......+.++++.++...+..+ +|+. T Consensus 160 g~y~~~y~v~i~gs~---------ta~~~tp~~~~~-----~~~~~~s~~~ia~~l~~~l~~~-g~~v------------ 212 (794) T protein:vir:99 160 GQYGRTYRIKVNGSV---------EASFETPLGDQV-----AHAKQIDIAYIIDQLAAGLINK-GWAV------------ 212 (794) T ss_pred CCCCceEEEEecCCc---------ccceeeccCccc-----ccccccchhhhhhhhHhhhhcc-cceE------------ Confidence 999999999998753 456777766532 2233456677777776543321 1111 Q ss_pred cccceeecccccccccccceEEEecCCce--EEEEecCCCCcceEEEEEEeecccccccccccCcccccEeeeeeeeeEe Q lcl|NC_011107. 237 DANAATIAGYLNQRGVQDGYIAFRGDADI--HVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVM 314 (826) Q Consensus 237 ~~~~~t~a~~~~~~~~~~g~~~~~~~~~~--~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 314 (826) ...++++++...... ....+.++.+++.+....+.++++++||+.+|.+ +.+.+.+ T Consensus 213 --------------~~~~g~~~i~~~~~~~v~t~s~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G--------~~v~v~~ 270 (794) T protein:vir:99 213 --------------TKGSGYFYFSKSGSVIINSLEVEDGYNGQLAWGIINDVQKTTQLPVYAPNN--------YIIRVSG 270 (794) T ss_pred --------------EeCCeEEEEEecCCceeEEEEeecCCCCceeeEEeeeccceeecccCCCCC--------eEEEEec Confidence 122344444444443 3344567778889999999999999999988853 3455666 Q ss_pred ccCCCCcceEEEEEcCCceEEEeecccccccc--cceeEEEEEecCCCeEEEeccCcCccccCCccccCCccccCCCccE Q lcl|NC_011107. 315 ATGSTKAPVYFEWDSANRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTRGITG 392 (826) Q Consensus 315 ~~~~~~~~~y~~~~~~~~~w~e~~~~g~~~~~--~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~ 392 (826) ..+..++.||++|+..+++|+||+++++..++ +||||.++ ++++++|++++.+|++|.+||+++||+|||+|++|++ T Consensus 271 ~~~~~~~~y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~~v-~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~is~ 349 (794) T protein:vir:99 271 DPTLNQDDYYVRFDASRNVWTECPAPNIKADYNKATMPHVLI-READGTFTFKQADWTHRAAGDDETNPYPSFIGNSIND 349 (794) T ss_pred cCCCCCCceEEEEEcCCceEEeeccceeecceeccceEEEEe-ccCCCceeEeeccccccccCCcccCCCccccCcceeE Confidence 77778899999999999999999999976665 69999998 6789999999999999999999999999999999999 Q ss_pred EEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccc Q lcl|NC_011107. 393 MTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIV 472 (826) Q Consensus 393 v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~l 472 (826) |+||||||+|+++++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+++++| T Consensus 350 v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~l~~~~~l 429 (794) T protein:vir:99 350 IFFFRNRLGFLSGENVILSGSGNYFNFFPESVAVLTDTDPIDVAVSTNRISILKYAVPFSEELILWSDQAQFVLSSDGGL 429 (794) T ss_pred EEEEeeeEEEecCCeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEE-eeccccccccchhHHHHHHHHhcCCCe-EEEEEcC Q lcl|NC_011107. 473 TPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEM-APSPSTDSHYVAEDVTSHIPSYMPGPA-EYIQAAA 550 (826) Q Consensus 473 TP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~-~~~~~~~~~~~~~dls~~~~~~~~~~v-~~~~~s~ 550 (826) ||+|++++++|+|+|+++++|+.+|+++||++++| ++++++|+ .|+.++| .|+++|||+|++|||+|++ ..+++++ T Consensus 430 TP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~v~r~~~~~~~~d-~y~a~Dlt~~~~hl~~~~~~~~~a~~~ 507 (794) T protein:vir:99 430 TPTTIRLDLTTEFEVTEQARPYGIGRGVYFVSPRA-KFSSVRRFYAVQDVTQ-VKNAEDISAHVPYYVENGVFKMSGSST 507 (794) T ss_pred cceeEEEEEEEEeeccCCCCceEeCCeEEEEecCC-CeeEEEEeeeeccccC-ceehhhHHHHHHHhcCCCeEEEEEeCC Confidence 99999999999999999999999999999999998 56777665 5775555 5999999999999999985 5568899 Q ss_pred CCCEEEEEEcCCCeEEEEEEeecCCceeeeeeEeeecCCcEEEEEE--ECCeEEEEEEeCCCEEEEEEEEeecCcccCCC Q lcl|NC_011107. 551 SSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYF--TGDNLMVLIQKGQEIALGRMHLNSLPAREGLQ 628 (826) Q Consensus 551 ~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~--~~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~ 628 (826) +|..++|++++||+|++|||||+++||+|+|||||+|+|.++++|+ .+|+||++|+|++++++|||++++.... ..+ T Consensus 508 ~~~~~v~~~~~~g~l~~~~y~~~~~eq~v~aW~~~~~~g~~~~~~~~~~~d~l~~~v~r~~~~~ler~~~~~~~~~-~~~ 586 (794) T protein:vir:99 508 ENFLTILTEGNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVLCCDMIGAVMHLIIDSPSGVLMEKIEFTQNTKD-YPD 586 (794) T ss_pred CCcEEEEEEcCCCEEEEEEEeecCCceEEEeEEEEEcCCCeEEEEEEEcCCEEEEEEEeCCCEEEEEEEeeeCCCC-CCC Confidence 9999999999999999999999999999999999999999887775 4999999999999999999976543322 112 Q ss_pred cccccccceEEEeecccceec-------cceeeccCCcccceeeEEecCceeeeeeccccee--------cCCceEEEec Q lcl|NC_011107. 629 YPKYDYWRRIEATVDGELELT-------KQHWDLIKDASAVYQLQPVAGAYMERTHLGVKRE--------TNTKVFLDVP 693 (826) Q Consensus 629 ~~~~d~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~~--------~~~~~~~~~~ 693 (826) .++ . ..+|+...+. ........+...++++.|++|+.+...++|.... .....++.+| T Consensus 587 ~~~---~----~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~g~~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~ 659 (794) T protein:vir:99 587 EPY---R----LYVDRKIEYTFPEGSYNDDDFKTRVKLKDIYGSTPANGQYVFISLGGVTFTFDPPAGGWQANDGLIEFD 659 (794) T ss_pred ccc---c----eeeeeeeeeeecccccccCcceeEEeccccccccccCCceEEEEeCCceeeeecccceEecCccEEEec Confidence 211 1 1223333322 1111111233457889999999999999986522 2345578899 Q ss_pred CCCCCceEEEeeeeeEEEEeCCeeEecCCCCceee----cceEEEEEEEEeeccceEEEEecCCCCccceeeeccCcccc Q lcl|NC_011107. 694 EAVVGAVYVVGCEFWSKVEFTPPVLRDHNGLPMTS----TRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLF 769 (826) Q Consensus 694 ~~~~~~~v~vGl~y~~~~~~~~~~i~~~~g~~~~~----gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~ 769 (826) ++.++++|+|||+|+++++|+||++++++++++.. ||+||+|++|+|.+||+|++.++++.++. .+.+.+.+++ T Consensus 660 ~~~~~~~v~vGl~y~s~~~~~~~~~~~~~~~g~~~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~~--~~~~~~~~~~ 737 (794) T protein:vir:99 660 GDLRGTKFFVGEAYTFLYEFSKFLIKTTDTADGVATEDIGRLQLRRAWVNYDKSGNFRVEVNNQGRTF--TYNMTGNRLS 737 (794) T ss_pred CCCCCcEEEEeeeeeEEEeecceEEeecCCCCceeeeccceEEEEEEEEEeecccceEEEECCCccce--eeeccccccc Confidence 99999999999999999999999999888765543 89999999999999999999999988753 4667889999 Q ss_pred ccccccCccccceeEEEEEecccCceeEEEEEECCCCCEEEEEEEEEEEeecccccC Q lcl|NC_011107. 770 SRQLNAGEPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 770 ~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r~rrv 826 (826) ++++..+.+++.+|+++||+.+|+++.+|+|++++|+||+|++|+|||+||+|.||| T Consensus 738 ~~~~~~g~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlsi~~e~~y~~r~~~v 794 (794) T protein:vir:99 738 TNELILGDESLDTGQFRYAVSGNATQVTVSLISDTPNPLSIIGGGWEGYYVRRSSGI 794 (794) T ss_pred cccccccccccccceEEEEecccccceEEEEEECCCCCEEEEEEEEEEEEeccccCC Confidence 999999999999999999999999999999999999999999999999999999999 No 10 >protein:vir:8887 Length: 808 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813776;genbank:gi:29366731;genbank:GeneID:1258831 Probab=100.00 E-value=1.8e-220 Score=1225.29 Aligned_cols=774 Identities=22% Similarity=0.295 Sum_probs=636.1 Q ss_pred CCceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcCCCceEEEE Q lcl|NC_011107. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd~~e~~~~~ 80 (826) ||+|+|+||||+||||||||++|||+||++|+||+|+|+|||+||||++||+++++++....+++.|+++|++.|+||++ T Consensus 1 M~~v~~s~~n~~~GvSqq~d~~R~~~q~~~~~N~~~~~~gG~~rRpgt~~v~~l~~~~~~~~~~~~~~~~~~~~~~y~v~ 80 (808) T protein:vir:88 1 MGLVSQSVKNLKGGISQQPDILRFSNQGALQINGWSSETQGLQKRPPTTFTKRLQNKGFLGTKPLVHLINRDAQEQYFVG 80 (808) T ss_pred CcceeeecchhccceeccchhHhhhhhhhhhhcceeeeccccccCCchheeeeeeccCCCCCCcEEEEEEeCcCceEEEE Confidence 99999999999999999999999999999999999999999999999999999998877777899999999999999998 Q ss_pred EecCCeEEEEEcCCCEEEEecCcccccc-ccCCccceEEEEEcCEEEEEeCcccCccccc--ccCCCCCCccEEEEEccc Q lcl|NC_011107. 81 AQHRGELYLFDERDGRLLMGQPLVHDYL-KANDYRQLRAATVADDLFIANLSVKPEADRT--DIKGVDPNKAGWLYIKAG 157 (826) Q Consensus 81 ~~~~g~i~v~~~~~g~~~~~~~~~~~y~-~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~--~~~~~~~~~~a~~~v~~g 157 (826) ++++| ||||+++|++..+.+..+ |+ +++++++|+++|+||+|||||++++|++..+ ....+++..|+++++|+| T Consensus 81 ~~~~~-i~v~~~~G~~~~v~~~~~--y~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~~~~~~~~~~~~~~~vr~g 157 (808) T protein:vir:88 81 FSGTG-LAVWDLKGNNYTVRGYNG--YANCANPRTDLRLITVADYTFVVNRNTVCQMGSTLTHAAYPRLDGRAIINVRGG 157 (808) T ss_pred EeCCe-EEEEEcCCceEEEeecCc--ceEecCChhheeEEEEcCEEEEEcCCcceeecccccccCCCCCCccEEEEEccc Confidence 88775 999999888887776544 44 5689999999999999999999999987544 556677888999999999 Q ss_pred ccCceeEEEEeeccccceeeeeeEEEEeecCCCCccc------cccccceEEecceeeechheeeeccceEEEeeccccc Q lcl|NC_011107. 158 QYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNP------NLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKY 231 (826) Q Consensus 158 ~y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~------~~~~~~~~~~~~~ia~~l~~~~~ga~~~t~~~~~~~~ 231 (826) +|+++|+|+|++.... ...+.++..+.++.... ........+..++++.++..++.... T Consensus 158 ~y~~~y~i~i~g~~s~----~~~t~t~~~~~~s~~~v~~~~~~~~~~~~~~~~~~~ia~~l~~~~~~~~----------- 222 (808) T protein:vir:88 158 QYGRTLSITINGDGTG----SSPQASIKMPNGSAEKVPAGDPYAGMNQVDMTDASWIAAELARQLTVSL----------- 222 (808) T ss_pred ccCceEEEEEecCCcc----eeeeEeEEEccCcccceeeccceeecccCCccccccchhhheeeeeecc----------- Confidence 9999999999986432 33445666665542211 11112222333444444433222110 Q ss_pred ceeeccccceeecccccccccccceEEEecCCc--eEEEEecCCCCcceEEEEEEeecccccccccccCcccccEeeeee Q lcl|NC_011107. 232 PKVDPDANAATIAGYLNQRGVQDGYIAFRGDAD--IHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFM 309 (826) Q Consensus 232 ~~~a~~~~~~t~a~~~~~~~~~~g~~~~~~~~~--~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~ 309 (826) ...........+++++....+ .....++++++++.+.+..+.|+++++||+.+|.+ +. T Consensus 223 ------------~~~~~~~~~~~~~~~i~~~a~~~~~~~~t~~g~~~~~~~~~~~~v~~~~~lp~~~p~g--------~~ 282 (808) T protein:vir:88 223 ------------GGSGWSFQAGTGWILINAPANDNVRQIATKDGYADTLLSGFIYQVQTFTKLPANAPPG--------YL 282 (808) T ss_pred ------------cccceEEEeccceEEEEeccCceeEEEcccCCcCcceeeeeeeeccceeeccccCCCC--------cE Confidence 001111222234455544433 34455778888999999999999999999999853 33 Q ss_pred eeeEeccCCCCcceEEEEEcCCceEEEeecccccccc--cceeEEEEEecCCCeEEEeccCcCccccCCccccCCccccC Q lcl|NC_011107. 310 DGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVT 387 (826) Q Consensus 310 ~~~~~~~~~~~~~~y~~~~~~~~~w~e~~~~g~~~~~--~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g 387 (826) ..+....+...++||++|+..+++|+||++||+..++ .+|||.++ ++++++|+++.++|++|.+||++|||+|+|+| T Consensus 283 v~i~~~~~~~~~~~yv~~~~~~~~w~e~~~~~~~~~~~~~tmp~~lv-~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g 361 (808) T protein:vir:88 283 VEITGESARSGDNYWVQYDASGKVWKETAKPKIIAGFNNATLPHALV-RAADGQFDWTPLTWDGRNAGDDDTNPMPSFVG 361 (808) T ss_pred EEEEecCCCCCceeEEEEEcCCeEEEEeeeccceeeecccceeEEEE-ecCCceEEEEecccccccccccccCccceecC Confidence 4556666778899999999999999999999987776 58999998 67899999999999999999999999999999 Q ss_pred CCccEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEe Q lcl|NC_011107. 388 RGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVP 467 (826) Q Consensus 388 ~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~ 467 (826) ++|++|+||||||+|++|++|||||+||||||++++++++.|||||+++++++++++|+|+|+++++|+|||+++||+|+ T Consensus 362 ~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~i~T~~~e~~l~ 441 (808) T protein:vir:88 362 ATINDVFFFRNRLGFLSGENVVMSRTSKYFNFFPSSVATLSDDDPIDVAISHNRISILKYAVPFSEQLLLWSDQAQFVLS 441 (808) T ss_pred CceeEEEEEcceEEEeeCCeEEEEeccCcccccCCcccCCCCCccEEEEecCCccceeeEEeecCCcEEEEecCcEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCccccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEE-EeeccccccccchhHHHHHHHHhcCCCeEEE Q lcl|NC_011107. 468 GGGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHE-MAPSPSTDSHYVAEDVTSHIPSYMPGPAEYI 546 (826) Q Consensus 468 ~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e-~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~ 546 (826) ++++|||+|++++++|+|+|++.++|+.+|++++|++++| ++++++| |.|+. ..+.|+++|||+|++|||++++..+ T Consensus 442 ~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~f~~~~g-~~~~v~r~~~~~~-~~d~y~~~dlt~~~~h~~~~~~~~~ 519 (808) T protein:vir:88 442 SKTILSSKTIELDLTTEFDVSDGARPYGIGRGVYFAAPRA-SFTSLKRYYAIQD-VSDVKSAEDVSAHVPSYITNTVHAI 519 (808) T ss_pred CCCcccceeEEEEEEEEecccCCCCceEeCCeEEEEecCC-CeeEEEEEEEeee-ccCceehhhHHHHHHHhcCCCeEEE Confidence 9999999999999999999999999999999999999998 5666655 55664 4445999999999999999988777 Q ss_pred EE-cCCCCEEEEEEcCCCeEEEEEEeecCCceeeeeeEeeecCCcEEEE----EEECCeEEEEEEeCCCEEEEEEEEeec Q lcl|NC_011107. 547 QA-AASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGA----YFTGDNLMVLIQKGQEIALGRMHLNSL 621 (826) Q Consensus 547 ~~-s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~----~~~~d~l~~vv~r~~~~~~~r~~~~~~ 621 (826) ++ +++|..++|++++||+|++|||||+++||+|+|||||+|+|.++++ ++++|+||++|+|+.+.++|||.+++. T Consensus 520 ~~~~~~~~~~v~~~~~~g~l~~~~y~~~~~e~~v~aW~r~~~~g~~~~~~~~~~~~~d~l~~vV~r~~~~~ler~~~~~~ 599 (808) T protein:vir:88 520 HGSGTENFVSILSDGSPNKVFIYKFLYLDEILQQQSFSHWEFGDAATTRVLAASCIGSYCYLMIDRPEGLCLERMEFTQH 599 (808) T ss_pred EEeCCCCeEEEEEEcCCCEEEEEEEeccCCceeEEeeEEEecCCCeeEEEEEEeccCCEEEEEEEcCCcEEEEEEeeccC Confidence 55 5566678999999999999999999999999999999999877654 445999999999999999999976443 Q ss_pred CcccCCCcccccccceEEEeecccceeccceeeccCCccc------ceeeEEecCceeeeeeccccee-----cCCceEE Q lcl|NC_011107. 622 PAREGLQYPKYDYWRRIEATVDGELELTKQHWDLIKDASA------VYQLQPVAGAYMERTHLGVKRE-----TNTKVFL 690 (826) Q Consensus 622 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~g~~~~~~~~g~~~~-----~~~~~~~ 690 (826) . .+....+. .+++||..++.+...+....... -.++.|+++..+....+|.... .....++ T Consensus 600 ~----~~~~~~~~----~~~lD~~~~~~~g~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~dg~~~~~~~~~~~~~~~~ 671 (808) T protein:vir:88 600 T----IDYSIEPY----RTYMDMKKTIVLGAYNIDTNLTSFDVRTAYGGTPGPESTFYTIDQQGVLIEHEARDWATNPYI 671 (808) T ss_pred C----CCCccccc----eeeeeeeeeeccccccCccccceeecccccccccccceeEEEEcCCceEEeeecccccCcceE Confidence 2 22222222 23455555554433222111111 1234466666666666664332 2345578 Q ss_pred EecCCCCCceEEEeeeeeEEEEeCCeeEecCCCCceee----cceEEEEEEEEeeccceEEEEecCCCCccceeeeccCc Q lcl|NC_011107. 691 DVPEAVVGAVYVVGCEFWSKVEFTPPVLRDHNGLPMTS----TRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPL 766 (826) Q Consensus 691 ~~~~~~~~~~v~vGl~y~~~~~~~~~~i~~~~g~~~~~----gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~ 766 (826) .++++.++++|+|||+|+++++|+||++++++|++++. ||+||+|+++++.+||+|.+.++++.++ ..+.+.+. T Consensus 672 ~~~~~~~~~~v~vGl~y~s~~~~~p~~~~~~~g~~~~~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~--~~~~~~~~ 749 (808) T protein:vir:88 672 SFVGNRAGEQMVIGKQYTFQYEFSKFLIKQTADDGSTSTEDIGRLQLRRAWLNYEESGAFEINVNNGSSE--FVYVMTGG 749 (808) T ss_pred EeCCCccCceEEEeeeeeEEEEecceEEecCCCCcceeecccceEEEEEEEEEeecccceEEEeCCCccc--ceeeccCc Confidence 99999999999999999999999999999999987765 7899999999999999999999986654 45777888 Q ss_pred cccccccccCccccceeEEEEEecccCceeEEEEEECCCCCEEEEEEEEEEEeecccccC Q lcl|NC_011107. 767 RLFSRQLNAGEPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 767 ~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r~rrv 826 (826) +++++++ .+.+++.+|++++|+.+|+++.+|+|++++|+||+||+|+|||+||+|+||| T Consensus 750 ~~~~~~~-~~~~~~~tg~~~vp~~~~~~~~~v~i~~d~P~P~tilsi~~eg~y~~r~~~v 808 (808) T protein:vir:88 750 RLGIQRV-LGELSVGTGQFKFPVTGNAVNQRVTITSSNPNPLNVIGCGWEGNYIRRSSGI 808 (808) T ss_pred ccCcccc-cCccccccceEEEEecccCceeEEEEEECCCCceEEEEEEEEEEEeccccCC Confidence 8887765 7778889999999999999999999999999999999999999999999999 No 11 >protein:vir:94583 Length: 792 # NCBI annotation: Tubular tail protein B # Family: family:all:825 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919014;genbank:gi:119637778;genbank:GeneID:5179343 Probab=100.00 E-value=1.2e-220 Score=1226.36 Aligned_cols=766 Identities=23% Similarity=0.317 Sum_probs=640.9 Q ss_pred CCceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcCCCceEEEE Q lcl|NC_011107. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd~~e~~~~~ 80 (826) ||+|+|+||||++|||||+|++|||+||++|+||+|+|+|||+||||++||+++++.+.....++.|.++||+.|+|++ T Consensus 1 M~~i~~s~~n~~~GiSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~~~~~~~q~y~l- 79 (792) T protein:vir:94 1 MALISQSVKNLKGGISQQPNILRFPEQGSEQINGWSSETEGLQKRPPFVFTKTIGDQNALGAKPLVHLINRDSAEQYYV- 79 (792) T ss_pred CcceeeecchhhcceecCcchHHhhhhhhhhhcceeeeccccccCChhHHHHhhhcCCCCCcccEEEEEEeCCCceEEE- Confidence 9999999999999999999999999999999999999999999999999999999887776788999999988766655 Q ss_pred EecCCeEEEEEcCCCEEEEecCccccccc-cCCccceEEEEEcCEEEEEeCcccCcccccccCCCCCCccEEEEEccccc Q lcl|NC_011107. 81 AQHRGELYLFDERDGRLLMGQPLVHDYLK-ANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAGWLYIKAGQY 159 (826) Q Consensus 81 ~~~~g~i~v~~~~~g~~~~~~~~~~~y~~-a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~~~v~~g~y 159 (826) ++++++||||+++|++.++... .+|+. +++.++|+++|+||+|||+|++++|++..+.....++.+++++++++|+| T Consensus 80 ~f~~~~~rv~~~~g~~~~~~~~--~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~~~~~~~~~~~~~~~~~~~v~i~~g~y 157 (792) T protein:vir:94 80 VFTGQGVRVFDLNGKEYDVKGD--LSYVKVENPRDDLRMVTVADYTFIVNRNMVVRPDTTPLYTLKENGDCLINIRGGMY 157 (792) T ss_pred EEcCCeEEEEecCCceEEeccc--CceeeecCCcceeEEEEEcCEEEEEeCCccceeEecCcCCCCCCceEEEEccCCCc Confidence 5556779999998887776554 45664 56778899999999999999999999887777777888899999999999 Q ss_pred CceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeecc---ceEEEeecccccceeec Q lcl|NC_011107. 160 SKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGA---PEYTLPNSTKKYPKVDP 236 (826) Q Consensus 160 ~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga---~~~t~~~~~~~~~~~a~ 236 (826) +++|+++|++.. +++.+|.+.. +......++++++.++....... .+|++ T Consensus 158 ~~~y~i~i~~~~----------~~~~~~~~t~-----~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~------------ 210 (792) T protein:vir:94 158 GRTLAFTINNTK----------IAYEIAHGDA-----PEHSKQTDAQWLVKKLAGLARLNVAFKGWTF------------ 210 (792) T ss_pred ceeEEEEecCce----------eeeeeecCcc-----cceecccchhhhhhhhhhhccccccccccEE------------ Confidence 999999998753 3344444432 22333445567776665432211 11211 Q ss_pred cccceeecccccccccccceEEEecCCc--eEEEEecCCCCcceEEEEEEeecccccccccccCcccccEeeeeeeeeEe Q lcl|NC_011107. 237 DANAATIAGYLNQRGVQDGYIAFRGDAD--IHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVM 314 (826) Q Consensus 237 ~~~~~t~a~~~~~~~~~~g~~~~~~~~~--~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 314 (826) ...++++++....+ .....+.++.+++.+.++.+.++++++||+.+|.+ +.+.+.+ T Consensus 211 --------------~~~~~~~~i~~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~lp~~~~~G--------~~v~i~~ 268 (792) T protein:vir:94 211 --------------TEGPGYIHVIAPSNSQINSLSTEDGYADQLMNAVMHTSQSFSRLPVEAPNG--------YTVKIVG 268 (792) T ss_pred --------------EECCeEEEEEecCCceeeeeecccCcCcceeeeeeecccccccccccCCCC--------cEEEEEc Confidence 11233444433332 23345667788899999999999999999998854 3455666 Q ss_pred ccCCCCcceEEEEEcCCceEEEeecccccccc--cceeEEEEEecCCCeEEEeccCcCccccCCccccCCccccCCCccE Q lcl|NC_011107. 315 ATGSTKAPVYFEWDSANRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTRGITG 392 (826) Q Consensus 315 ~~~~~~~~~y~~~~~~~~~w~e~~~~g~~~~~--~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~ 392 (826) ..+++.+.||++|+..+++|+||+++|+..++ ++|||.++ ++++++|++++.+|++|.+||+++||+|+|+|++|++ T Consensus 269 ~~~~~~d~y~v~~~~~~~~w~E~~~~~~~~~~~~~tmp~~lv-~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~g~~i~~ 347 (792) T protein:vir:94 269 DTSKTSDMFYVQYDNMKKVWKEVAGWGVQKGLNGGTMPHALV-RQADGSFQMQVLPWTQRTCGDMDTNPTPSIVDQKIND 347 (792) T ss_pred cCCCCccceEEEEEcCCceEEEecccceeeeecccccCeeEE-EcCCCcEEEEeccccccccCccccCccceeccCCcce Confidence 77778899999999999999999999976665 58999998 7789999999999999999999999999999999999 Q ss_pred EEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccc Q lcl|NC_011107. 393 MTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIV 472 (826) Q Consensus 393 v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~l 472 (826) |+||||||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+|+++++|+|||+++||+|+++++| T Consensus 348 v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~~~q~~l~~~~~l 427 (792) T protein:vir:94 348 VFFFRNRLGFLAGENIVMSRTSKYFSLFPASVANLSDDDPIDVAVSHNRISILKYAVPFSEELLLWSDQAQFVLSAQGIL 427 (792) T ss_pred EEEEcceEEEecCCeEEEEccCCcccCccccccCCCCCccEEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEEE-EEcCC Q lcl|NC_011107. 473 TPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYI-QAAAS 551 (826) Q Consensus 473 TP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~-~~s~~ 551 (826) ||+|++++++|+|+|+++++|+.+|+++||++++| ++++++||++..+.++.|+++|||+|++|||+|++..+ +++++ T Consensus 428 TP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g-~~~~v~r~~~~~~~~d~y~a~DlT~~~~hl~~~~v~~~~a~~~~ 506 (792) T protein:vir:94 428 SPKSVELNLTTEFDVSDRARPFGVGRGVYFASPRA-SYTSLNRYYAVQDVSSVKSAEDMSAHVPNYIPNGVFSIRGSSTE 506 (792) T ss_pred cceeEEEEEEEEeeccCCCCceEeCCeEEEeecCC-CeeEEEeeeeeccccCceehhhHHHHHHHhcCCceEEEEEeCCC Confidence 99999999999999999999999999999999988 57888776554565556999999999999999987655 67778 Q ss_pred CCEEEEEEcCCCeEEEEEEeecCCceeeeeeEeeecCCcEEEEEE--ECCeEEEEEEeCCCEEEEEEEEeecCcccCCCc Q lcl|NC_011107. 552 SGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYF--TGDNLMVLIQKGQEIALGRMHLNSLPAREGLQY 629 (826) Q Consensus 552 p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~--~~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~~ 629 (826) |..++|+++++|+|++|||||+++||+|+|||||+++|.|+++|+ ++|+||++|+|++++++|||.+.. +.....+. T Consensus 507 ~~~vv~~~~~~g~l~~~ty~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~D~l~~~v~r~~~~~~~r~~~~~-~~~d~~~~ 585 (792) T protein:vir:94 507 NFISVLSSNAPSRIFLYKFLYLNEEIAQQSWSHWELGSNVTVLACDSIGSTMYLVLRNQSHTWMCRAHFTK-NSIDFPDE 585 (792) T ss_pred CcEEEEEEcCCCeEEEEEEeecCCceEEEeEEEEEcCCcEEEEEEeecCCEEEEEEEeCCCEEEEEEEEee-cccccCCC Confidence 889999999999999999999999999999999999998887765 589999999999999999996543 32222222 Q ss_pred cc---ccccceEEEeecccceeccceeeccCCcccceeeEEecCceeeeeecccce--------ecCCceEEEecCCCCC Q lcl|NC_011107. 630 PK---YDYWRRIEATVDGELELTKQHWDLIKDASAVYQLQPVAGAYMERTHLGVKR--------ETNTKVFLDVPEAVVG 698 (826) Q Consensus 630 ~~---~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~--------~~~~~~~~~~~~~~~~ 698 (826) ++ +|....+ .....++.............++++.|++|+.+...++|... ......++.++++.++ T Consensus 586 ~~~~~lD~~~~~---~~~~~~~~~~~~~T~~~~~~~~gl~~l~G~~v~v~~dG~~~~~~~~~~~~~~~~~~i~~~g~~~a 662 (792) T protein:vir:94 586 PYRLYIDNKVKY---VIPEGSYNDDTYATTVKPVDVYGMKYWTGKFYIVASDGLVSWFEPPRGGWPNGVPMLTMSGNREG 662 (792) T ss_pred cceeeeeeeeeE---EecCcceecCceeeeeccccccCcccccCcEEEEEecCceeEeecccceecCCccEEEecCCccC Confidence 22 2222111 11112222211111122334678999999999999998542 1234557899999999 Q ss_pred ceEEEeeeeeEEEEeCCeeEecCCCCce----eecceEEEEEEEEeeccceEEEEecCCCCccceeeeccCccccccccc Q lcl|NC_011107. 699 AVYVVGCEFWSKVEFTPPVLRDHNGLPM----TSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLN 774 (826) Q Consensus 699 ~~v~vGl~y~~~~~~~~~~i~~~~g~~~----~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~ 774 (826) ++|+|||+|+++++|+||+++.++|.+. ..||+||+|++++|.+||.|.+++++..++ ..+++.+.+++++++. T Consensus 663 ~~v~VGl~y~~~~~~~~~~~~~~~g~~~~~~~~~gr~rl~r~~~~~~~tg~~~v~~~~~~~~--~~~~~~~~~~~~~~~~ 740 (792) T protein:vir:94 663 ETIYVGLAISFRYVFSKFLIKKTADDGSIATEDIGRLQLRRAWVNYEDSGAFTVEVENTSRL--FSYDMAGARLGSNVLR 740 (792) T ss_pred CeEEEeeeeeEEEEeccceeeccCCCcCccccceeeEEEEEEEEeeeccceeEEEEcCCCcc--eeeeeccceecccccc Confidence 9999999999999999999998887644 348999999999999999999999988765 3567888999999999 Q ss_pred cCccccceeEEEEEecccCceeEEEEEECCCCCEEEEEEEEEEEeecccccC Q lcl|NC_011107. 775 AGEPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 775 ~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r~rrv 826 (826) .|.|+..+++++||+.+|+++.+|+|++++|+||+|++|+|||+||+|.||| T Consensus 741 ~g~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlai~~eg~y~~r~~~v 792 (792) T protein:vir:94 741 AGGLNVGTGQFRFPVTGNAQLNEVRIISEHTTPLNVIGCGWEGNYLRRSSGI 792 (792) T ss_pred ccccccccceEEEEeeccCceEEEEEEECCCCCEEEEEEEEEEEEeccccCC Confidence 9999999999999999999999999999999999999999999999999999 No 12 >protein:vir:94713 Length: 785 # NCBI annotation: tail tube # Family: family:all:825 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338122;genbank:gi:77118200;genbank:GeneID:3707736 Probab=100.00 E-value=3.3e-218 Score=1212.90 Aligned_cols=760 Identities=20% Similarity=0.314 Sum_probs=632.9 Q ss_pred CCceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcCCCceEEEE Q lcl|NC_011107. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd~~e~~~~~ 80 (826) ||+|+|+||||++|||||+|++|||+||++|+||+|+|+|||+||||++||++++..+. ..++.|.++|++.|+|+ + T Consensus 1 M~~~~~s~~n~~~GvSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~~v~~l~~~~~--~~~~~~~f~~~~~~~y~-l 77 (785) T protein:vir:94 1 MPLITQSIKNLKGGISQQPDILRFSDQGEAQVNCWSSESDGLQKRPPTVFKRRLNIDVG--SNPKFHLINRDEQEQYY-I 77 (785) T ss_pred CcceeeecchhhcceecCCchHHhhhHHhhhhcceeeeccCcccCChhHhhhcccCCCC--cCcEEEEEEeCCCceEE-E Confidence 99999999999999999999999999999999999999999999999999999987654 45567777887776655 4 Q ss_pred EecCCeEEEEEcCCCEEEEecCccccccc-cCCccceEEEEEcCEEEEEeCcccCccccc-ccCCCCCCccEEEEEcccc Q lcl|NC_011107. 81 AQHRGELYLFDERDGRLLMGQPLVHDYLK-ANDYRQLRAATVADDLFIANLSVKPEADRT-DIKGVDPNKAGWLYIKAGQ 158 (826) Q Consensus 81 ~~~~g~i~v~~~~~g~~~~~~~~~~~y~~-a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~-~~~~~~~~~~a~~~v~~g~ 158 (826) ++++|+||||+++|.+..+.+ ..+|+. +++.++|+|+|+||+|||||++++|++... ....+++.+|+++++++|+ T Consensus 78 ~~~~~~irv~~~~G~~~~v~~--~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~~~~~~~~~~~~~~~~~~~~~~i~~g~ 155 (785) T protein:vir:94 78 VFNGSNIQIVDLSGNQYSVSG--SVDYVKSSNPRDDIRVVTVADYTFVVNRKVVVKGGSEKSHSGYNRKARALINLRGGQ 155 (785) T ss_pred EEcCCeEEEEecCCcEEEEec--CCCceeecCchhheeeEeeCCEEEEEcCCcceeeeeccCCcCCCCCCceEEEecccc Confidence 667899999998766666554 345664 467789999999999999999999998654 3445777889999999999 Q ss_pred cCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeecc-ceEEEeecccccceeecc Q lcl|NC_011107. 159 YSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGA-PEYTLPNSTKKYPKVDPD 237 (826) Q Consensus 159 y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga-~~~t~~~~~~~~~~~a~~ 237 (826) |+++|++.+++.. ++++++|++..... .....+.+++..++..++..+ .+|+.. T Consensus 156 y~~~y~i~i~g~~---------~at~~t~~~s~a~~----s~~~~s~~~i~~~l~~~l~a~~t~~t~~------------ 210 (785) T protein:vir:94 156 YGRTLKVGINGGV---------KVSHKLPAGNDAEN----DPPKVDAQAIGAALRDLLVTAYPTFTFD------------ 210 (785) T ss_pred cceeEEEeeCCcc---------eeEEEEccCccccc----cccccchHHHHHHHHHHhhccccceeEE------------ Confidence 9999999998753 46677777654322 233445556666666554332 222221 Q ss_pred ccceeecccccccccccceEEEecCCce--EEEEecCCCCcceEEEEEEeecccccccccccCcccccEeeeeeeeeEec Q lcl|NC_011107. 238 ANAATIAGYLNQRGVQDGYIAFRGDADI--HVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVMA 315 (826) Q Consensus 238 ~~~~t~a~~~~~~~~~~g~~~~~~~~~~--~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 315 (826) ..++++++.+.... ....+.++++++.+.++.+.++++++||..++++ +.+++.+. T Consensus 211 --------------~~g~~i~i~a~s~t~~~~~s~~~~~~~t~~~~~~~~~~~~~~Lp~~~~~G--------~~v~v~~~ 268 (785) T protein:vir:94 211 --------------LGSGFLLITAPSGTDINSVETEDGYANQLISPVLDTVQTISKLPLAAPNG--------YIIKIQGE 268 (785) T ss_pred --------------ecCcEEEEEecCCccccceeeecccCCeEEEEEEeeccceeccccccCCC--------CEEEEEcc Confidence 12334555444332 2345667778889999999999999999888753 44667777 Q ss_pred cCCCCcceEEEEEcCCceEEEeecccccccc--cceeEEEEEecCCCeEEEeccCcCccccCCccccCCccccCCCccEE Q lcl|NC_011107. 316 TGSTKAPVYFEWDSANRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTRGITGM 393 (826) Q Consensus 316 ~~~~~~~~y~~~~~~~~~w~e~~~~g~~~~~--~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v 393 (826) +++.++.||++|+..+++|+||++||+..++ .+|||.++ ++++++|++++.+|++|.+||+++||+|||+|++|++| T Consensus 269 ~~~~~~~y~v~~~~~~g~w~e~~~~g~~~~~~~~tmp~~l~-~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v 347 (785) T protein:vir:94 269 TNSSADEYYVMYDSNTKTWKETVEPGVVTGFDNTTMPHALV-RQSDGSFEFKALDWSKRGAGNDDTNPMPSFVDATINDV 347 (785) T ss_pred CCCCccceEEEEEcCCceEEEecccceeeeeeccccceEEE-eccCCceEEeccccccccCCCcccCCcceecccccceE Confidence 8888899999999999999999999987666 48999997 66899999999999999999999999999999999999 Q ss_pred EEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCcccc Q lcl|NC_011107. 394 TTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVT 473 (826) Q Consensus 394 ~~~q~RL~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lT 473 (826) +||||||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+|+++++|+|||+++||+|+++++|| T Consensus 348 ~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~T~~~e~~l~~~~~lT 427 (785) T protein:vir:94 348 FFYRNRLGFLSGENVIMSRSASYFAFFPKSVATLSDDDPIDVAVSHPRISILKYAVPFSEQLLLWSDEVQFVMTSSGVLT 427 (785) T ss_pred EEEeceEEEecCCeEEEEccCCcccCccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEcCCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCe-EEEEEcCCC Q lcl|NC_011107. 474 PRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPA-EYIQAAASS 552 (826) Q Consensus 474 P~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v-~~~~~s~~p 552 (826) |+|++++++|+|+|+++++|+.+|++++|++++| ++++++|+.+.++..+.|+++|||+|++|||+|++ ..+++|++| T Consensus 428 P~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~v~r~~~~~~~~d~y~~~dlt~~~~~~~~g~~~~~~a~~~~~ 506 (785) T protein:vir:94 428 SKSIQLDVGSEFALGDNARPFAVGRSVFFSAPRG-SFTSIKRYFAVADVSDVKDADDTTGHVLSYIPNGVFDIQGTGTEN 506 (785) T ss_pred ceeEEEEEEEeeeccCCCCceEeCCeEEEEecCC-CeeEEEeeeeecccccceehhhHHHHHHHhcCCCcEEEEEecCCC Confidence 9999999999999999999999999999999998 57878787665566667999999999999999975 577889999 Q ss_pred CEEEEEEcCCCeEEEEEEeecCCceeeeeeEeeecCC--cEEEEEEECCeEEEEEEeCCCEEEEEEEEeecCcccCCCcc Q lcl|NC_011107. 553 GYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRH--QIIGAYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYP 630 (826) Q Consensus 553 ~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g--~v~~~~~~~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~~~ 630 (826) ..++|++++||+|++|||||+++||+|+|||||+|+| +++++|+++|++|++++|..+.+++++++.... .+.+ T Consensus 507 ~~~~~~~~~~g~l~~~~y~~~~~e~~v~aW~r~~~~~~~~~~~~~~~~d~~~~vv~r~~g~~~~~ie~~~~~----~d~~ 582 (785) T protein:vir:94 507 YICVNSTGAYNRIYIYKFLFKDSVQLQASWSHWEFPKDDKILASASIGSTMFIVRQHQGGVDIEHLKFIKEA----TDFP 582 (785) T ss_pred cEEEEEEcCCCEEEEEEEeecCCceEEEEEEEEEeCCCeEEEEEEEeCCEEEEEEEcCCCEEEEEEEeeccc----CCCC Confidence 9999999999999999999999999999999999976 688999999999999999987788777543211 1222 Q ss_pred cccccceEEEeecccceecc--ceeeccC------CcccceeeEEecCceeeeeeccccee----cCCceEEEecCCCCC Q lcl|NC_011107. 631 KYDYWRRIEATVDGELELTK--QHWDLIK------DASAVYQLQPVAGAYMERTHLGVKRE----TNTKVFLDVPEAVVG 698 (826) Q Consensus 631 ~~d~~~~~~~~~~~~~~~~~--~~~~~~~------~~~~~~~~~~~~g~~~~~~~~g~~~~----~~~~~~~~~~~~~~~ 698 (826) .++.. .++||...+.. ..+.... ......++.|++|+.+.+.++|...+ ..+..+++++++.++ T Consensus 583 ~~~~~----~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~~g~~~leg~~v~v~adG~~~~~~~v~~~~~tl~~~g~~~~ 658 (785) T protein:vir:94 583 SEPYR----LHVDSKVSMVIPIGSFNADTYKTTVDIGAAYGGNAPSPGRYYLIDSQGAYLDLGELTSISTVITLNGDWSG 658 (785) T ss_pred Cccee----EEeeeeeEEEecCcceeccccccccccccccccCCccCCeEEEEeeCCcCccCceEcCCCcEEEecCCCCC Confidence 11111 22333333211 1111111 11223478999999999999986643 345678999999999 Q ss_pred ceEEEeeeeeEEEEeCCeeEecCCCCcee---ecceEEEEEEEEeeccceEEEEecCCCCccceeeeccCcccccccccc Q lcl|NC_011107. 699 AVYVVGCEFWSKVEFTPPVLRDHNGLPMT---STRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNA 775 (826) Q Consensus 699 ~~v~vGl~y~~~~~~~~~~i~~~~g~~~~---~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~ 775 (826) ++|+|||+|+++++|+||++++++|.++. .||+||+|++++|.+||+|.++++++.++ ..+...+.+++++ .. T Consensus 659 ~~v~vGl~y~~~~~~~~~~~~~~~~~~~~~~~~gr~~l~r~~~~~~~sg~~~v~v~~~~~~--~~~~~~~~~~g~~--~~ 734 (785) T protein:vir:94 659 RTVFIGRSYLMSYKFSRFLIKIEDDSGTQSEDTGRLQLRRAWVNYRDTGALRLIVRNGERE--FVNTFNGYTLGQQ--TI 734 (785) T ss_pred ceEEEeeeeeEEEeecceeEEecCCCcccccccccEEEEEEEEEeecccceEEEecCCCcc--ceeeecCcccCcc--cc Confidence 99999999999999999999999986544 48999999999999999999999987764 3466777787754 45 Q ss_pred CccccceeEEEEEecccCceeEEEEEECCCCCEEEEEEEEEEEeecccccC Q lcl|NC_011107. 776 GEPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 776 ~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r~rrv 826 (826) +.|++.+|+++||+.+|+++.+|+|+|++|+||+|++|+|||+||+|+||| T Consensus 735 ~~~~~~tg~~~vp~~g~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r~~~v 785 (785) T protein:vir:94 735 GTTNIGDGQYRFAMNGNALTTSLTLESDYPTPVSIVGCGWEASYAKKARSV 785 (785) T ss_pred cccccccceEEEEeecccceEEEEEEECCCCceEEEEEEEEEEEeccccCC Confidence 778899999999999999999999999999999999999999999999999 No 13 >protein:vir:105647 Length: 800 # NCBI annotation: putative tail tubular B protein # Family: family:all:825 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425011;genbank:gi:83571759;uniprot:Q2WC41;genbank:GeneID:3837288 Probab=100.00 E-value=4.1e-216 Score=1201.42 Aligned_cols=769 Identities=21% Similarity=0.287 Sum_probs=621.1 Q ss_pred CceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcCCCceEEEEE Q lcl|NC_011107. 2 SYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLGGRSIAMLVA 81 (826) Q Consensus 2 ~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd~~e~~~~~~ 81 (826) =.|+||||||+||||||||++|||+||++|+||+|+|+|||+||||++||+++++++...+.+|.|.+++++.|++++++ T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~fva~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (800) T protein:vir:10 1 MEVQGSLGRQIQGISQQPPAVRLDGQCTTMVNMVPDVVNGTQSRMGTTHIAKLLDEGTDNMATHHYRRGEGDEEYFFTLK 80 (800) T ss_pred CeEEeecchhcccccccchhHhhhhhhhhhhcceeeeccCcccCCcceEEEeecCCCCCccEEEEEecCCccceEEEEEE Confidence 25889999999999999999999999999999999999999999999999999988877677777777777777777776 Q ss_pred ecCCeEEEEEcCCCEEEEecCcc-cccc--ccCCccceEEEEEcCEEEEEeCcccCcccccccCCCCCCccEEEEEcccc Q lcl|NC_011107. 82 QHRGELYLFDERDGRLLMGQPLV-HDYL--KANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAGWLYIKAGQ 158 (826) Q Consensus 82 ~~~g~i~v~~~~~g~~~~~~~~~-~~y~--~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~~~v~~g~ 158 (826) .+ +++|||+++|..+.+..+.+ ..|+ ++++.++|+++|+||+|||+|++++|++... ...++++++++++|+|+ T Consensus 81 ~g-~~~rv~~~~G~~~~v~~~~~~~~~~~~~~~~~~~l~~~tvaD~tfi~n~~~~~~~~~~--~~~~~~~~~~~~vr~g~ 157 (800) T protein:vir:10 81 KG-QVPEIFDKHGRKCNVISQDAPMTYLSEVVNPREDVQFMTIADVTFMLNRRKVVKVSNR--KSPKVGDKAIVFCAYGQ 157 (800) T ss_pred cC-CeEEEEecCCcEEEeecCCcceeeeeccCCchhhEEEEEEcCEEEEecCccccccccc--CCCCCCceEEEEEeccc Confidence 65 68999998766665554332 2333 4578889999999999999999999997533 33556678999999999 Q ss_pred cCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeeccc---eEEEeecccccceee Q lcl|NC_011107. 159 YSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAP---EYTLPNSTKKYPKVD 235 (826) Q Consensus 159 y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga~---~~t~~~~~~~~~~~a 235 (826) |+++|+|++++.. ++.+++|++.. .....+.++++++.+|...+..+. +++.. T Consensus 158 y~~~y~i~i~g~~---------~~~~~t~~~~~-----~~~~~~~s~~~i~~~L~~~l~~~~~~~~~t~~---------- 213 (800) T protein:vir:10 158 YGTSYSIIINGTT---------AASFKTPDGGS-----AEHVEQIRTERITSELYSKLQQWSGVNDYEIQ---------- 213 (800) T ss_pred cccceeEEeccce---------EEEEEecCCCc-----ccccccccHHHHHHHHHhhhhhcCcccceEEE---------- Confidence 9999999998753 46777777654 345556678888888876543221 11111 Q ss_pred ccccceeecccccccccccceEEEecCCceE-EEEecCCCCcceEEEEEEeecccccccccccCcccccEeeeeeeeeEe Q lcl|NC_011107. 236 PDANAATIAGYLNQRGVQDGYIAFRGDADIH-VEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVM 314 (826) Q Consensus 236 ~~~~~~t~a~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 314 (826) ..++++++....... ...+.+++.++.+.++.+.++++++||+.+|.+ +...+.. T Consensus 214 ----------------~~g~~i~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~Lp~~~~~g--------~~~~i~~ 269 (800) T protein:vir:10 214 ----------------RDGTSIFIERRDGKSFTVTTTDGAKGKDLVAIKNKVSSTDLLPSRAPAG--------YKVQVWP 269 (800) T ss_pred ----------------EcCcEEEEEEecCCceEEEEeecCCcceEEEEEeeccceeeccccCCCC--------ceEEEEc Confidence 112333333322222 334667788899999999999999999999864 2233344 Q ss_pred ccCCCCcceEEEEEc---CCceEEEeecccccccc--cceeEEEEEec---CCCeEEEeccCcCccccCCccccCCcccc Q lcl|NC_011107. 315 ATGSTKAPVYFEWDS---ANRRWAERAAYGTDWVL--KKMPLALRWDE---ATDTYSLNELEYDRRGSGDEDTNPTFNFV 386 (826) Q Consensus 315 ~~~~~~~~~y~~~~~---~~~~w~e~~~~g~~~~~--~t~p~~~~~~~---~~~~f~~~~~~w~~r~~gd~~tnp~psf~ 386 (826) ..+.+.+.||++|+. +.++|+||+++++..++ ++|||+++... .+++|++++.+|++|.+|||++||+|+|+ T Consensus 270 ~~~~~~~~y~~~~~~~~~~~~~w~e~~~~~~~~~~~~~tmp~~lv~~~~~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~ 349 (800) T protein:vir:10 270 TGSKPESRYWLQAEPKEGNLVSWKETIAADVLLGFDKGTMPYIIERTGIIDGIAQFKIRQGDWEDRKVGDDLTNPMPSFI 349 (800) T ss_pred CCCCCCceeEEEEEeccccceEEEeecccCceeeeecccccEEEEEeeeeecceeEEEEeccccccccCCCCCCCCchhc Confidence 445567889999985 44789999999986665 58999998654 47899999999999999999999999999 Q ss_pred C----CCccEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCc Q lcl|NC_011107. 387 T----RGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKY 462 (826) Q Consensus 387 g----~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~ 462 (826) | ++|++|+||||||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+|+++++|+|||+++ T Consensus 350 ~~~~~~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~g~ 429 (800) T protein:vir:10 350 DEEVPQTIGGMFMVQNRLCFTAGEAVIASRTSYFFDFFRYTVISALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKS 429 (800) T ss_pred CCCCCCCceeEEEEeeeEEEeeCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeeeEeecCCcEEEEecCc Confidence 8 579999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeCCccccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCC Q lcl|NC_011107. 463 QAVVPGGGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGP 542 (826) Q Consensus 463 q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~ 542 (826) ||+|+++++|||+|++++++|+|+|+++++|+.+|++++|++++| ++++||||+|+.++|+ |+++|||+|++|||+++ T Consensus 430 q~~l~g~~~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g-~~s~vre~~~~~~~d~-~~a~DlT~~~~hl~~~~ 507 (800) T protein:vir:10 430 QFILPGDKPLEKSNALLKPVTTFEVNNKVKPVVTGESVMFATNDG-SYSGVREFYTDSYSDT-KKAQAITSHVNKLIEGN 507 (800) T ss_pred EEEEeCCCcccceeEEEEEEEeeeccCCCCceEeCCeEEEecCCC-CeeEEEEEeeeecccc-eehhhHHhHHHHhcCCc Confidence 999999999999999999999999999999999999999999998 5789999999987776 99999999999999999 Q ss_pred eEEEEEcCCCCEE-EEEEcCCCeEEEEEEeecCCceeeeeeEeeecC--CcEEEEEEECCeEEEEEEeCCCEEEEEEEEe Q lcl|NC_011107. 543 AEYIQAAASSGYL-VFGTSTADEMICHQYLWQGNEKVQNAFHRWTLR--HQIIGAYFTGDNLMVLIQKGQEIALGRMHLN 619 (826) Q Consensus 543 v~~~~~s~~p~~~-v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~--g~v~~~~~~~d~l~~vv~r~~~~~~~r~~~~ 619 (826) |.+|++|++|+++ +|+.+++|+|++|||||+++||+|+|||||+++ +.+++|++++|+||++|+|+++.++|||++. T Consensus 508 v~~~~~~~~~~~~v~~~~~~~~~l~~~~yl~~~~e~~~~aW~~w~~~~~~~~~~~~~~~d~l~~iv~r~~~~~ier~~~~ 587 (800) T protein:vir:10 508 ITNMAASTNVNRLLVTTDKYRNIIYCYDWLWQGTDRVQSAWHVWEWPMGTKVRGMFYSGELLYLLLERGDGVYLEKMDMG 587 (800) T ss_pred eEEEEEeCCCCeEEEEEEcCCCeEEEEEEeecCCceEEEEEEEEEcCCCcEEEEEEEeCCeEEEEEECCCcEEEEEEecc Confidence 9999999988876 578889999999999999999999999999975 4677888899999999999999999998654 Q ss_pred ecCcccCCCcccccccceEEEeecccceeccceee--ccCCcccceeeEEecCceeeeeecccc----eecCCceEE--- Q lcl|NC_011107. 620 SLPAREGLQYPKYDYWRRIEATVDGELELTKQHWD--LIKDASAVYQLQPVAGAYMERTHLGVK----RETNTKVFL--- 690 (826) Q Consensus 620 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~g~~~~~~~~g~~----~~~~~~~~~--- 690 (826) ...+....+..++|+....... ..++...... .+.......++.++++.......+|.. ....+..++ T Consensus 588 ~~~~~~~~~~~~lD~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~~~~~~~g~~~~~~~ 664 (800) T protein:vir:10 588 DALTYGLNDRIRMDRQAELIFK---HFKAEDEWISEPLPWTPTNPELLDCILIEGWDSYIGGSFLFKYKPSDNTLSTTFD 664 (800) T ss_pred cCccccccceeeeecceeeccc---ccccCcceEEEeccccccCCcceEEeeeccceeecCceeEEEEEecCCceEeeee Confidence 3332221222233322111100 0000000000 000111112333333332222222211 112222222 Q ss_pred EecCCCCCceEEEeeeeeEEEEeCCeeEecCCCCceeecceEEEEEEEEeeccceEEEEecCCCCccceeeeccCccccc Q lcl|NC_011107. 691 DVPEAVVGAVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFS 770 (826) Q Consensus 691 ~~~~~~~~~~v~vGl~y~~~~~~~~~~i~~~~g~~~~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~ 770 (826) ..+++..+.+|+|||+|+++++|+||++++++|+.++.+|+||+|++|+|.+||+|.+++++..++....+.+.+...++ T Consensus 665 ~~~g~~~~~~v~VGl~Y~s~~~~~~~~i~~~~g~~~~~~r~~i~r~~~~~~~sg~~~~~v~~~~~~~~~~~~~~~~~~g~ 744 (800) T protein:vir:10 665 MHDDNHVKAKVIVGQIYPQEFEPTPVVIRDRQDRVSYIDVPVVGLVHLNLDMYPDFSVEVKNVKSGKVRRVLASNRIGGA 744 (800) T ss_pred ecCCCcccceEEEeeeeeEEEeecceEEEcCCCcccccCCeEEEEEEEEeecCceEEEEeccCcccceeEEccCCeeccc Confidence 12678889999999999999999999999999999999999999999999999999999999988877777788888888 Q ss_pred cccccCccccceeEEEEEecccCceeEEEEEECCCCCEEEEEEEEEEEeecccccC Q lcl|NC_011107. 771 RQLNAGEPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 771 ~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r~rrv 826 (826) .+...|.+++.+|+++||+.+|+++.+|+|++++|+||+|++|+|||+||+|+||| T Consensus 745 ~~~~~g~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlai~~eg~y~~r~~rv 800 (800) T protein:vir:10 745 LNNTVGYVEPREGVFRFPLRAKSTDAVYRIIVESPHTFQLRDIEWEGSYNPTKRRV 800 (800) T ss_pred cccccCcccccCceEEEEEeccCceeEEEEEECCCCcEEEEEEEEEEEeecccccC Confidence 88889999999999999999999999999999999999999999999999999999 No 14 >protein:vir:7021 Length: 803 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853594;genbank:gi:31711676;genbank:GeneID:1481802 Probab=100.00 E-value=4.6e-215 Score=1195.68 Aligned_cols=759 Identities=22% Similarity=0.292 Sum_probs=623.8 Q ss_pred CceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcC--CCceEEE Q lcl|NC_011107. 2 SYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLG--GRSIAML 79 (826) Q Consensus 2 ~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd--~~e~~~~ 79 (826) =.|+|+||||+||||||||++|||+||++|+||+|+|+|||+||||++||+++++.+. ..++.|+++|+ +.||||+ T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~~va~l~~~~~--~~~~~~~~~~~~~~~e~~~~ 78 (803) T protein:vir:70 1 MEVQGSLGRQIQGISQQPPAVRLDGQCSEMVNMVPDVVEGTKSRMGTTHIAKLLEYGE--DDMAVHHYRRGGEGEEEYFF 78 (803) T ss_pred CeEEeecchhccccccCchHHhhhhhhhhhhcceeeeccccccCChhhhhhhhcCCCc--ccceeeEEEecCCCceEEEE Confidence 3689999999999999999999999999999999999999999999999999987654 45677777764 5689999 Q ss_pred EEecCCeEEEEEcCCCEEEEecCcc-cccc--ccCCccceEEEEEcCEEEEEeCcccCcccccccCCCCCCccEEEEEcc Q lcl|NC_011107. 80 VAQHRGELYLFDERDGRLLMGQPLV-HDYL--KANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAGWLYIKA 156 (826) Q Consensus 80 ~~~~~g~i~v~~~~~g~~~~~~~~~-~~y~--~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~~~v~~ 156 (826) +++++|+||||+++|++.++.+..+ ..|+ +++++++|+++|+||||||+|++++|++..++. .++++++++++|+ T Consensus 79 ~~~~~~~irv~~~~G~~~~v~~~~~~~~~l~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~--~~~~~~~~~~vr~ 156 (803) T protein:vir:70 79 IMKKGQVPEIFDKQGRKCMVQSQDAPMTYLSEVTNPREDVQFMTIADVTFMLNRKKIVKARPERS--PQVGSTAIVFMAY 156 (803) T ss_pred EEecCCeEEEEEcCCcEEEEecCCceeEEEeecCCChhheeEEEEcCEEEEecCceeeeeccccC--CCCCCceEEEEee Confidence 9999999999999888777665443 2344 457889999999999999999999999865543 4667789999999 Q ss_pred cccCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeecc---ceEEEeecccccce Q lcl|NC_011107. 157 GQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGA---PEYTLPNSTKKYPK 233 (826) Q Consensus 157 g~y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga---~~~t~~~~~~~~~~ 233 (826) |+|+++|+|+|++.. ++.|.++++.. .......+.++++.++....... .+|+.. T Consensus 157 g~y~~~y~itIng~~---------~a~~~t~~~~~-----~~~~~~~~~~~ia~~l~~~~~~~~s~a~~~~~-------- 214 (803) T protein:vir:70 157 GQYGTHYKIIIDGVV---------AAGYKTRDGAE-----AHHIEDIRTESIAYNLYQSLQSWDKIADYEIQ-------- 214 (803) T ss_pred cCCcceEEEEeCCcc---------eEEEEeCCCcc-----cccccccchhhhhhhhhhheeccccccceEEE-------- Confidence 999999999998863 35677777653 22334456778888776543221 112221 Q ss_pred eeccccceeecccccccccccceEEEecC--CceEEEEecCCCCcceEEEEEEeecccccccccccCcccccEeeeeeee Q lcl|NC_011107. 234 VDPDANAATIAGYLNQRGVQDGYIAFRGD--ADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDG 311 (826) Q Consensus 234 ~a~~~~~~t~a~~~~~~~~~~g~~~~~~~--~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~~~ 311 (826) ..++++++... .+.....+.++..++.+....+.++++++||+.+|.+ +.. T Consensus 215 ------------------~~g~~~~i~~~~~~~~~~~~t~~g~~~~~~~~~~~~v~~~~~Lp~~~~~g---------~~v 267 (803) T protein:vir:70 215 ------------------LDGTSIYITRRDGSTTFDITTEDGAKGKDLVAIKYKVASTDLLPSRAPEG---------YKV 267 (803) T ss_pred ------------------ECCcEEEEEEcCCCCeeEEEeecCcCCcEEEEEEecccceeeccccCCCC---------ceE Confidence 12233333332 2223334566778899999999999999999999854 233 Q ss_pred eEeccCC-CCcceEEEEEcCC---ceEEEeecccccccc--cceeEEEEEec---CCCeEEEeccCcCccccCCccccCC Q lcl|NC_011107. 312 AVMATGS-TKAPVYFEWDSAN---RRWAERAAYGTDWVL--KKMPLALRWDE---ATDTYSLNELEYDRRGSGDEDTNPT 382 (826) Q Consensus 312 ~~~~~~~-~~~~~y~~~~~~~---~~w~e~~~~g~~~~~--~t~p~~~~~~~---~~~~f~~~~~~w~~r~~gd~~tnp~ 382 (826) .+..+++ ..+.||++|+..+ ++|+||+++|...++ +||||.++... ..++|++++.+|+.|.+|||+|||+ T Consensus 268 ~v~~~g~~~~d~y~v~~~~~~~~~~~w~e~a~~g~~~~~~~~t~p~~~v~~~~~~~~~~~~~~~~~~~~r~~gdd~tnp~ 347 (803) T protein:vir:70 268 QVWPTGSKPESRYWLQAEKQNGNIVSWKETLAADVLIGFDKSTMPYIIERTGFVNGIAQFKIRQGDWEDRKVGDDLTNPM 347 (803) T ss_pred EEEcCCCCCCceeeEEEEeccCCccceEeeeccceeeeeecccccEEEEEEEEeecceeEEEEeeccccccccccccCcc Confidence 3444544 4578999998643 689999999987766 68999997432 3478999999999999999999999 Q ss_pred ccccC----CCccEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEE Q lcl|NC_011107. 383 FNFVT----RGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVF 458 (826) Q Consensus 383 psf~g----~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~ 458 (826) |+|++ ++|++|+||||||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|| T Consensus 348 psf~~~~~~~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~i~ 427 (803) T protein:vir:70 348 PSFIDEEVPQTLGGMFMVQNRLCVTAGEAVIATRTSYFFDFFRYTAVSAVATDPFDVFSDASEVYQLKHAVTLDGSTVLF 427 (803) T ss_pred ccccCccCCCCceeEEEEeceEEEeeCCeEEEEccCCccccccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEE Confidence 99997 67999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCcEEEEeCCccccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHh Q lcl|NC_011107. 459 AKKYQAVVPGGGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSY 538 (826) Q Consensus 459 t~~~q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~ 538 (826) |+++||+|+++++|||+|++++++|+|+|+++++|+.+|++++|++++| ++++||||.|+.++|+ |+++|||+|++|| T Consensus 428 T~~~q~~l~g~~~lTP~~~~i~~~s~~~~~~~~~Pv~vg~~v~fv~~~g-~~s~vre~~~~~~~d~-y~a~Dlt~~a~hl 505 (803) T protein:vir:70 428 ADKSQFILPGDKPLEKSNVLLKPVTTFEVNNNVKPVATGESVMFATSEG-AYSGIREFYTDSYSDT-KKAQAITSHVNKL 505 (803) T ss_pred ecCcEEEEeCCCcccceeEEEEEEEEeeccCCCccEEeCCeEEEeccCC-CeeEEEEEeccccccc-eehhhhhhhhHhh Confidence 9999999999999999999999999999999999999999999999998 5789999999987776 9999999999999 Q ss_pred cCCCeEEEEEcCCCCEEE-EEEcCCCeEEEEEEeecCCceeeeeeEeeecCCcEEEEEEE--CCeEEEEEEeCCCEE-EE Q lcl|NC_011107. 539 MPGPAEYIQAAASSGYLV-FGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFT--GDNLMVLIQKGQEIA-LG 614 (826) Q Consensus 539 ~~~~v~~~~~s~~p~~~v-~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~--~d~l~~vv~r~~~~~-~~ 614 (826) |+++|.+|+++++|++++ |+.+++++|++||||++++||+|+|||||+|+|.|+++|++ +|+|||+|+|+++++ +| T Consensus 506 ~~~~v~~~~~~~~~~~~v~~~~~~~~~l~~~~yl~~~~e~~v~aW~r~~~~g~~~~~~~~~~~d~l~~vv~r~~~g~~ie 585 (803) T protein:vir:70 506 LEGNVIMMSASTNVNRLLVLTDKYRNIIYCYDWLWQGTERVQAAWHKWEWPLGTFIRGMFYSGEHLYLLIERGSTGVYLE 585 (803) T ss_pred cCCceEEEEEeCCCCeEEEEEEcCCCeEEEEEEEecCCcEEEEeEEEEEcCCCEEEEEEEecCCEEEEEEEECCCeEEEE Confidence 999999999999988765 66789999999999999999999999999999999998887 899999999998874 67 Q ss_pred EEEEeecCcccCCCcccccccceEEEeecccceeccceeecc---------CCcccceeeEEec--------Cceeeeee Q lcl|NC_011107. 615 RMHLNSLPAREGLQYPKYDYWRRIEATVDGELELTKQHWDLI---------KDASAVYQLQPVA--------GAYMERTH 677 (826) Q Consensus 615 r~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~--------g~~~~~~~ 677 (826) ||.+....... ...++++|++.++.+...... .....+.++.+.. ++.+.... T Consensus 586 r~~~~~~~~~~----------~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 655 (803) T protein:vir:70 586 RMDMGDALVYN----------LNDRIRMDRQAELIFRHIKAEDVWVSEPLPWQPTDVTLLDCVLIDGWDSYIGGSFLFSY 655 (803) T ss_pred EEecccccccC----------CcceeEeccceeEeeccccCCceeeeecccccCcccceeeEEEeeeeeeecCCeEEEEE Confidence 77543322211 122345555555443322111 1111122233332 22222111 Q ss_pred cccceecCCceEEEecCCCCCceEEEeeeeeEEEEeCCeeEecCCCCceeecceEEEEEEEEeeccceEEEEecCCCCcc Q lcl|NC_011107. 678 LGVKRETNTKVFLDVPEAVVGAVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPN 757 (826) Q Consensus 678 ~g~~~~~~~~~~~~~~~~~~~~~v~vGl~y~~~~~~~~~~i~~~~g~~~~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~ 757 (826) ..... ......+.++++..+++|+|||+|+++++|+||++++++|+.+..+|+||+|++|+|++||+|.+.|++..+.. T Consensus 656 ~~g~~-t~~~~~~~~~~~~~a~~v~VGl~Y~~~~~~~~~~i~~~~~~~~~~~~~rl~r~~~~~~~sg~~~v~v~~~~~~~ 734 (803) T protein:vir:70 656 NPGDN-TLTTTFDMHDDDHVKAKVVVGQLYPQEFEPTQVVIRDNQERVSYIDVPTVGLVHLNLDKYPDFKVEVKNLKSGK 734 (803) T ss_pred cCCCc-cceeeeeEECCCCcccEEEEeeeeeEEEeecceEEEcCCCccccccccEEEEEEEEeecccceEEEEecCCccc Confidence 11111 11233456789999999999999999999999999999999999999999999999999999999999998887 Q ss_pred ceeeeccCccccccccccCccccceeEEEEEecccCceeEEEEEECCCCCEEEEEEEEEEEeecccccC Q lcl|NC_011107. 758 QPWYDTTPLRLFSRQLNAGEPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 758 ~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r~rrv 826 (826) ...+.+++.++++.++.+|.|+..+|+++||+.+++++.+|+|++++|+||+|++|+|||+||+|+||| T Consensus 735 ~~~~~~s~~~~g~~~~~~g~~~~~tg~~~vP~~~~~~~~~v~i~~d~P~P~tvlsi~weg~y~~r~rrv 803 (803) T protein:vir:70 735 VRNVLASNRVGGAINNIVGYVEPREGVFKFPLRSLSTDTVYRVMVESPHTFQLRDIEWEGSYNPTKRRV 803 (803) T ss_pred cceeeccchhccccccccCccccccceEEEEeeccCcceEEEEEECCCCCeEEEEEEEEEEEecccccC Confidence 777788899999999999999999999999999999999999999999999999999999999999999 No 15 >protein:vir:100022 Length: 976 # NCBI annotation: T7-like tail tubular protein B # Family: family:all:825 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214208;genbank:gi:61806431;genbank:GeneID:3294702 Probab=100.00 E-value=1.3e-212 Score=1182.19 Aligned_cols=800 Identities=17% Similarity=0.239 Sum_probs=605.3 Q ss_pred CCceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccc----eeEEEEEcCCCce Q lcl|NC_011107. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPR----PFLYHTNLGGRSI 76 (826) Q Consensus 1 M~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~----~~~~~~~rd~~e~ 76 (826) ||+|+|+||||++|||||||.+|+||||++|+||+|||+.||+||||++||+.|.+.+....+ ..+|+++||+.|+ T Consensus 1 M~~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRp~~~~v~~l~~~~~~~~~~~~~~~~~~~~r~~~e~ 80 (976) T protein:vir:10 1 MASVTQTIPTLTGGLSQQPDELKIPGQVSVANNVIPDVTHGLLKRPGGKLVASISDNGTAALNSQTNGKWFSYYRDETES 80 (976) T ss_pred CcceeecchhhhCcceecchhhcCCchhhhhhccccccccccccCCcceeeeeecCCCcccccccccceEEEEEcCCCcE Confidence 999999999999999999999999999999999999999999999999999999988765544 4569999999999 Q ss_pred EEEEEecCCeEEEEEcCCCEEEEecCc------cccccccCCccceEEEEEcCEEEEEeCcccCcccccccCCCCCCccE Q lcl|NC_011107. 77 AMLVAQHRGELYLFDERDGRLLMGQPL------VHDYLKANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAG 150 (826) Q Consensus 77 ~~~~~~~~g~i~v~~~~~g~~~~~~~~------~~~y~~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a 150 (826) |++++.++|.|+|||+.+|+.+.+... ...|+++++.++|++++|||||||+|+++.|++..+.. ...++.| T Consensus 81 y~~~~~~~g~~~v~~~~~G~~~~v~~~~~~~~~~~~yl~~~~~~~~~~~tv~d~tfi~N~~~~~~~~~~~~--~~~~~~~ 158 (976) T protein:vir:10 81 YIGQVSRSGDINMWRCSDGQAMTVNYDSGTATALTTYLTHTNDEDIQTLTLNDYTFLTNRTKTVAMSSTVE--PVRPPEV 158 (976) T ss_pred EEEEEecCCceEEEEccCCeEEEEEcCCCcccccchhhccCCcceeEEEEEccEEEEecCceEEeeccccc--CCCCceE Confidence 999999999999999988888876432 23488899999999999999999999999999876544 4455679 Q ss_pred EEEEcccccCceeEEEEeeccccceeeeeeEEEE---eecC------CC---------------Cccc------------ Q lcl|NC_011107. 151 WLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATY---VTPD------NA---------------STNP------------ 194 (826) Q Consensus 151 ~~~v~~g~y~~~y~v~i~g~~~s~~tt~~~tasy---ttp~------g~---------------~t~~------------ 194 (826) +++||+|+|+|+|+|+|++...+...++..+... .++. ++ .+.. T Consensus 159 ~~~v~~~~y~~~y~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~s~~~G~~~~~~~v~~ 238 (976) T protein:vir:10 159 FIDLKATAYARQYAVNLFDNTTTTAVSTVTRIDVELIKSSNNYCDSNGAMVARTSRPSNSTRCDDSAGDGRDAYAPNVGT 238 (976) T ss_pred EEEeeeeccceEEEEEEcCCcccceeeeeeeeeeccccCCcccccccccchhhHhHhhhhhcccccccccccccCceeee Confidence 9999999999999999988765543222211000 0000 00 0000 Q ss_pred -------------cccccceEE-------------ecceeeec----------------hh-ee----e-------eccc Q lcl|NC_011107. 195 -------------NLAEAPFQT-------------SVGYIAWQ----------------LY-GK----F-------FGAP 220 (826) Q Consensus 195 -------------~~~~~~~~~-------------~~~~ia~~----------------l~-~~----~-------~ga~ 220 (826) ......... ....+... +. .. + .+.. T Consensus 239 ~~f~~~~G~~~~i~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~Y~~~y~~~~~v~~~~~ 318 (976) T protein:vir:10 239 KVFNVTDGASLTDEANSGSYTYTIDVKDSSNNSVNRGVNLYFRIRTVGQSVPFTTGSGSSATTTYQARYTTTFDLLYGGT 318 (976) T ss_pred eEEEeccCccceEcCCcceEEEEeeccccceEEeecCCceEEEEccccceeecccccccceeeeeeEEEEeEEEEecCCC Confidence 000000000 00000000 00 00 0 0000 Q ss_pred eE--------EEeec----cc----------ccceeec----cc-----cceeecc----cc-------c-ccccccceE Q lcl|NC_011107. 221 EY--------TLPNS----TK----------KYPKVDP----DA-----NAATIAG----YL-------N-QRGVQDGYI 257 (826) Q Consensus 221 ~~--------t~~~~----~~----------~~~~~a~----~~-----~~~t~a~----~~-------~-~~~~~~g~~ 257 (826) +| ++... .. ....+.+ .. ....++. .. + +....++++ T Consensus 319 g~~~~~~~~V~v~g~~Y~it~~~~~~~~~~a~~~~~~~~~t~~d~~~~~~~~~ia~~L~~~l~a~~~~~g~tv~~~g~~~ 398 (976) T protein:vir:10 319 GWQEGDYFYVWMKDGYYKITVEAISTANVQANLGLIRPNPTPFDTETAVTAESIIGDIRTAIIATGNFTSANVQQIGTGL 398 (976) T ss_pred CcccCceEEEEccccceeeEEEEeeceeEEeccccccCcCCcCcccccccHHHHHHHHHHhhcccccccceEEEEcCcEE Confidence 11 00000 00 0000000 00 0000000 00 0 111233455 Q ss_pred EEecCCceEEEEecCCCCcceEEEEEEeecccccccccccCcccccEeeeeeeeeEeccCCCCcceEEEEEcCC-----c Q lcl|NC_011107. 258 AFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVMATGSTKAPVYFEWDSAN-----R 332 (826) Q Consensus 258 ~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~-----~ 332 (826) ++.+++.. +..+.+ .++.++++.++|++++|||+++|++ +.+ .++.+++..|.||++|++.+ + T Consensus 399 ~i~~~~~~-~~~s~~--~~~~~~~~~~~V~~~~~LP~~~~~g----~~v-----~V~~~~~~~d~yyv~~~~~~~~~~~~ 466 (976) T protein:vir:10 399 YVTRPSGT-FNVTAP--SSDLLRVMSGEVANVDDLPSQCKHG----YVV-----KVANSEADADDYYVKFFGHNNRDGDG 466 (976) T ss_pred EEEecCcc-eEecCC--CceeEEEEEeeecchhhhhhhccCC----cEE-----EEecCCCCceeEEEEeeccccccccc Confidence 55554442 222333 3457899999999999999998853 222 25788888899999998755 5 Q ss_pred eEEEeecccccccc--cceeEEEEEecCCCeEEEeccCcCccccCCccccCCccccCCCccEEEEEcceEEEecCCeEEE Q lcl|NC_011107. 333 RWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTRGITGMTTFQGRLVLLSQEYVCM 410 (826) Q Consensus 333 ~w~e~~~~g~~~~~--~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~~~q~RL~f~~~~~v~~ 410 (826) +|+||++||...++ ++|||.|+ ++++|+|.+++++|+.|.|||++|||+|+|+|++|++|+||||||+|++|++||| T Consensus 467 ~w~E~~~~g~~~g~~~~tmP~~l~-~~~~g~f~~~~~~w~~r~vGd~~tnp~psf~g~~is~v~f~q~RL~f~s~~~v~~ 545 (976) T protein:vir:10 467 VWEECAKPSRNIEFDKGTMPIQLV-RQANGTFTVSQATWQNAEVGDELTNPNPSFVGKTINQLVFFRNRLVFLSDENVIM 545 (976) T ss_pred eEEEeeccccccccccccccEEEE-ecccCeEEeeeccccccccCCcccCcCceecccccceEEEEcceEEEecCCeEEE Confidence 89999999976665 69999997 7899999999999999999999999999999999999999999999999999999 Q ss_pred EecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCC-ccccccceEEEEEEeecccc Q lcl|NC_011107. 411 SASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGG-GIVTPRTAVISITTQYDLDT 489 (826) Q Consensus 411 S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~-~~lTP~~~~~~~~s~~~~~~ 489 (826) ||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+++ ++|||+|++++++|+|+|++ T Consensus 546 Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~g~e~~lsg~~~~lTP~t~~i~~~s~~~~~~ 625 (976) T protein:vir:10 546 SRPGEFFNFWSKTATTFTPQDVIDLSCSSTYPAIVYDGIQVNAGLLLFTKNQQFMLTTDSDILSPETAKINAVSSYNFNE 625 (976) T ss_pred EecCCccccccccccCCCCCccEEEEecCCcceeeEEEEecCCcEEEEecCceEEEecCCceecceeEEEEEEEeeeccC Confidence 99999999999999999999999999999999999999999999999999999999985 59999999999999999999 Q ss_pred CCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEEEEEcCCCCEEEEEEcCCCeEEEEE Q lcl|NC_011107. 490 RAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAASSGYLVFGTSTADEMICHQ 569 (826) Q Consensus 490 ~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~~~s~~p~~~v~~~~~~g~l~~~t 569 (826) .++|+.+|++++|++++| ++++++||.+++++++ +.++|||+|++|||++.+..|++|++|.+++|+++++|+|++|| T Consensus 626 ~v~Pv~vG~~v~Fv~~~g-~~~r~~~~~~~~~~~~-~~~~dlt~~~~~l~~g~~~~~a~~~~~~~vv~~~~~~g~l~~~t 703 (976) T protein:vir:10 626 KTHPVSLGTTVAFIDNAN-QFTRFFEMSNVVRQGE-PDVVDQSKVISRLLDKNISLVSVSRENSVVFFSQKDTDKIYCFR 703 (976) T ss_pred CCccEEeCCeEEEEecCC-CeEEEEEEeecccccc-cchhHHHHHhhhhcCCceEEEEEcCCCcEEEEEEcCCCEEEEEE Confidence 999999999999999988 6899999999998886 78899999999999999999999999999999999999999999 Q ss_pred EeecCCceeeeeeEeeecCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEeecCcccCCC-----------c---cccccc Q lcl|NC_011107. 570 YLWQGNEKVQNAFHRWTLRHQIIGAYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQ-----------Y---PKYDYW 635 (826) Q Consensus 570 yl~~~~e~~v~aW~~w~~~g~v~~~~~~~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~-----------~---~~~d~~ 635 (826) |||+++||+|+|||||+|+|+|++||+++|+|||+|+|+++++++||.++..+...... . .++|.. T Consensus 704 y~~~~~eq~v~aWsr~~~~G~v~sv~~i~D~ly~vV~r~~~g~~~r~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~ 783 (976) T protein:vir:10 704 YFTSGEKRLLQAWTTWTITGNIQYHCMLDDALYVVTRNNNKDQIVKYSLKLDDAGHFVTDTQGTTSTDDDSIYRVHLDHS 783 (976) T ss_pred EeecCCceeEEeeEEEecCCcEEEEEEeCCeEEEEEEecCCeEEEEEEEEECCccceeeeccCccccccCCcceeeeccc Confidence 99999999999999999999999999999999999999999999999665433221110 0 122222 Q ss_pred ceEEEeecccceeccceeeccCCcccceeeEEecCceeeeeecccce------ecCCceEEEecCCCCCceEEEeeeeeE Q lcl|NC_011107. 636 RRIEATVDGELELTKQHWDLIKDASAVYQLQPVAGAYMERTHLGVKR------ETNTKVFLDVPEAVVGAVYVVGCEFWS 709 (826) Q Consensus 636 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~------~~~~~~~~~~~~~~~~~~v~vGl~y~~ 709 (826) .. .+..+.++.+.......+. ..+....++.+...+++... ......++.++++..+++|||||+|++ T Consensus 784 ~~---~~~~~~t~~~~t~~t~~~~---~~~~~~~~~~~~~~~d~~~~~~~~~~~~v~g~~i~l~g~~~~~~v~VGl~Y~s 857 (976) T protein:vir:10 784 SS---VTAASNTYNTTTIKTTIPK---PNGYESTKQLVAYDTDAGNDLGRYALVTVSGSNLEIPGNWSNNSFIIGYLYEM 857 (976) T ss_pred eE---EEeccccccCCceeEEeec---CccccCceeEEEEecccCcccccceeeeecCCeeEecCCCCCCeEEEeeeeEE Confidence 11 1122223322221111111 01111122222222222111 112234688899999999999999999 Q ss_pred EEEeCCeeEecCCCCceee---cceEEEEEEEEeeccceEEEEecCCCCccceeeeccCccccccccccCccccc-eeEE Q lcl|NC_011107. 710 KVEFTPPVLRDHNGLPMTS---TRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLVD-SAVV 785 (826) Q Consensus 710 ~~~~~~~~i~~~~g~~~~~---gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-tg~~ 785 (826) +++|+||+|++++|+++.. +||+|+|++|+|.+||.|++.+++..+.... +.+...+ .+....+.+|+. .+.+ T Consensus 858 ~~~~~~~~i~~~~g~~~~~~~~gRl~i~r~~~~~~~tg~~~v~v~~~~~~~~~-~~~~~~~--~~~~~~~~~pl~~~~~~ 934 (976) T protein:vir:10 858 DVQLPTLYVTQQVGDKYRSDAKSSLIVHRIKFSFGPLGVYSTTIQRDGKPDFT-ETKELGL--AGVVGASRLPIVPEVIE 934 (976) T ss_pred EEeecceeEEeCCCCcccccceeeEEEEEEEEEeecccceEEEEcCCCCcccc-ccccccc--cCcccccccceecCcEE Confidence 9999999999998876544 7899999999999999999999886654322 2222111 233344455655 4568 Q ss_pred EEEecccCceeEEEEEECCCCCEEEEEEEEEEEeecccc-cC Q lcl|NC_011107. 786 PLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYR-RV 826 (826) Q Consensus 786 ~vp~~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r~r-rv 826 (826) +||+.+|+++.+|+|+|++|+||+||+|+|||+||+|+| || T Consensus 935 ~vP~~~~~~~~~v~i~~d~PlP~tilsi~~eg~yn~r~~r~~ 976 (976) T protein:vir:10 935 TVPCYERNTNLKVNVKSEHPAPATLYSLAWEGDFTNRFYKRV 976 (976) T ss_pred EEEeccCCceeEEEEEECCCCceEEEEEEEEEEeccceeecC Confidence 999999999999999999999999999999999999998 77 No 16 >protein:vir:103341 Length: 806 # NCBI annotation: tail tubular protein B-like protein # Family: family:all:825 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039670;genbank:gi:125999999;genbank:GeneID:4818417 Probab=100.00 E-value=1e-212 Score=1182.73 Aligned_cols=765 Identities=20% Similarity=0.275 Sum_probs=616.0 Q ss_pred CceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcCCCceEEEEE Q lcl|NC_011107. 2 SYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLGGRSIAMLVA 81 (826) Q Consensus 2 ~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd~~e~~~~~~ 81 (826) =.|+||||||+||||||||++|||+||++|+||+|+|+|||+||||++||++++.... ..+++|.++|+..++.|+++ T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~~Gl~rRPgt~~va~l~~~~~--~~~~~~~~~~~~~~~~y~v~ 78 (806) T protein:vir:10 1 MEVQGSYGRQLQGVSQQPIAVRLPGQVTSQLNAVPNVVDGLKTRMGSKHLARILNSLD--ANSLIHHYKRGDDAEEYFVI 78 (806) T ss_pred CeeEeecchhccceeccChhHhhhhhhhhhhcceeccccccccCCchhhhhhhcCCCC--ccceEEEEEecCCceEEEEE Confidence 4799999999999999999999999999999999999999999999999999987543 45677888986666667788 Q ss_pred ecCCeEEEEEcCCCEEEEecCcc--cccccc--CCccceEEEEEcCEEEEEeCcccCcccccccCCCCCCccEEEEEccc Q lcl|NC_011107. 82 QHRGELYLFDERDGRLLMGQPLV--HDYLKA--NDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAGWLYIKAG 157 (826) Q Consensus 82 ~~~g~i~v~~~~~g~~~~~~~~~--~~y~~a--~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~~~v~~g 157 (826) +.+|.||||+..+|..+.+...+ ..|+.+ +++++|+++|+||||||+|++++|++...+. .+++.|+++++++| T Consensus 79 ~~~g~i~v~~~~~G~~~~v~~~~~~~~yl~~~~~~~~~l~~~tvaD~tfi~n~~~~~~~~~~~~--~~~~~~~~v~v~~g 156 (806) T protein:vir:10 79 LQPGQVPVIFTVGGLACPVNTQGSAATYLSSSSLPRETTQLMTIGDYTFVLNRKMPVQARGDVT--PSLDNKGLVYVAYA 156 (806) T ss_pred EcCCcEEEEEcCCCcEEEecCCCceEEEeccCCCCcceeeEEEEcCEEEEecCcEeeeeccccc--CCCCcceEEEEeec Confidence 88899999998888877765433 336553 4778999999999999999999999765443 35667899999999 Q ss_pred ccCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeecc----ceEEEeecccccce Q lcl|NC_011107. 158 QYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGA----PEYTLPNSTKKYPK 233 (826) Q Consensus 158 ~y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga----~~~t~~~~~~~~~~ 233 (826) +|+++|+++|++.. ++.+++|++... .......+++++.++..++... .+|+... T Consensus 157 ~y~~~y~i~Ing~~---------~a~~~t~~~~~~-----~~~~~~~~~~~a~~l~~~l~~~~~~~~~~~~~~------- 215 (806) T protein:vir:10 157 NFSFTYQILINGQV---------AAEHKTASSEDV-----KNEDLVRTDYVAGKLLENFNSRTASFPGFSMYQ------- 215 (806) T ss_pred ccCceeeEEeccce---------EEEEEeccCCCc-----ccccccchhHHHHHHHhhhcccccccceeEEEE------- Confidence 99999999998753 467777776532 2233445566666665543221 1111111 Q ss_pred eeccccceeecccccccccccceEEEecC-CceEEEEecCCCCcceEEEEEEeecccccccccccCcccccEeeeeeeee Q lcl|NC_011107. 234 VDPDANAATIAGYLNQRGVQDGYIAFRGD-ADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGA 312 (826) Q Consensus 234 ~a~~~~~~t~a~~~~~~~~~~g~~~~~~~-~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~~~~ 312 (826) .+..+++... .......+.++++++.+.++.++++++++||+.+|.+. .+.+ T Consensus 216 -------------------~g~~~~i~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~lp~~~~~g~--------~v~i 268 (806) T protein:vir:10 216 -------------------DGNVLVVDNSNGANYALTTVDGADGQDLVAIRHKVTNLDTLPNRAPVGY--------KVQV 268 (806) T ss_pred -------------------cccEEEEecCCCCccEEEEeeCCCCceeEEeecccCccccCccccCCCc--------EEEE Confidence 1112222222 22233446788999999999999999999999988642 2233 Q ss_pred EeccCCCCcceEEEEEc---CCceEEEeeccccccc--ccceeEEEEEec----CCCeEEEeccCcCccccCCccccCCc Q lcl|NC_011107. 313 VMATGSTKAPVYFEWDS---ANRRWAERAAYGTDWV--LKKMPLALRWDE----ATDTYSLNELEYDRRGSGDEDTNPTF 383 (826) Q Consensus 313 ~~~~~~~~~~~y~~~~~---~~~~w~e~~~~g~~~~--~~t~p~~~~~~~----~~~~f~~~~~~w~~r~~gd~~tnp~p 383 (826) ....+...+.||++|+. +.++|+|+++|+...+ .++|||.++... .+++|.++.++|++|.+|||++||+| T Consensus 269 ~~~~~~~~~~y~v~~~~~~~~~~~w~e~~~~~~~~~~~~~t~p~~~v~~~~~~~~~~~~~~~~~~w~~r~~Gd~~tn~~p 348 (806) T protein:vir:10 269 WPTGSKPESRYWLQAESQDGSKVTWVETIAPGVRKGWNAATMPHVLVRESLNANGSANFTYRPGEWEDRDVGDDLTNDFP 348 (806) T ss_pred eccCCCCCCceEEEEEeeccCceEEEeecccccccceeccccceEEEeeeeeecccceeEEEecccccccccccccCccC Confidence 33334556889999954 5678999999996544 479999998543 37899999999999999999999999 Q ss_pred cccC----CCccEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEe Q lcl|NC_011107. 384 NFVT----RGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFA 459 (826) Q Consensus 384 sf~g----~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t 459 (826) +|+| ++|++|+||||||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+||| T Consensus 349 sF~~~~~~~~it~v~f~q~RL~f~s~~~v~~Srsgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~t 428 (806) T protein:vir:10 349 SLLNDSSPQPISSMLMVQNRLMLTSGEAVVASRTSRFFDFFRYTVLATVDTDPFDVFADIEEVYNIRWSAQMDGDVVLFT 428 (806) T ss_pred cccCCCCCccceEEEEEeeeEEEecCCeEEEEccCCcccCccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEe Confidence 9998 689999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCcEEEEeCCccccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhc Q lcl|NC_011107. 460 KKYQAVVPGGGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYM 539 (826) Q Consensus 460 ~~~q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~ 539 (826) +++||+|+++++|||+|++++++|+|+|+++|+|+.+|++++|++++| ++++||||+|+.++|+ |+++|||+|++||| T Consensus 429 ~~~q~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g-~~s~vre~~y~~~~d~-~~~~DlT~~~~hl~ 506 (806) T protein:vir:10 429 SDQQFTLPGDKPLTPTSAVIRPVTQFKMTPGVKPAPSGDSILFAFDQG-SYSGIREFFTDSYSDT-KKAQPATSHVDKYI 506 (806) T ss_pred cCcEEEEeCCCcccceeEEEEEEEeecccCCCCceEeCCeEEEeeCCC-CeeEEEEEEeeeeccc-eehhhHHHHHHHhc Confidence 999999999999999999999999999999999999999999999998 6789999999987786 89999999999999 Q ss_pred CCCeEE-EEEcCCCCEEEEEEcCCCeEEEEEEeecCCceeeeeeEeeecCCc--EEEEEEECCeEEEEEEeCCC-EEEEE Q lcl|NC_011107. 540 PGPAEY-IQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQ--IIGAYFTGDNLMVLIQKGQE-IALGR 615 (826) Q Consensus 540 ~~~v~~-~~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~--v~~~~~~~d~l~~vv~r~~~-~~~~r 615 (826) +|++.. ++++++|..++|++++||+|++|||||+++||+|+|||||+++|. +++|++++|+||++|+|++. +.+.+ T Consensus 507 ~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~e~~v~aW~rw~~~~~~~~~~~~~~~d~l~~vv~R~~~~~g~~~ 586 (806) T protein:vir:10 507 RGKVLELSASSSFNRAFIITSPDRNILYVYDWLYEGTEKVQNAWHKWSFPAGTVLHAVSYSNEKLYLVLTRTNTSGGVAG 586 (806) T ss_pred CCCeEEEEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceEEEeEEeeeeCCCeEEEEEEEecCeEEEEEEEcCCcccEEE Confidence 997655 566677778899999999999999999999999999999999765 77889999999999999873 34445 Q ss_pred EEEeecCcccCCCcccccccceEEEeecccceeccceeecc--CCcccceeeEEecCceeeeeecccceecCC------- Q lcl|NC_011107. 616 MHLNSLPAREGLQYPKYDYWRRIEATVDGELELTKQHWDLI--KDASAVYQLQPVAGAYMERTHLGVKRETNT------- 686 (826) Q Consensus 616 ~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~------- 686 (826) +++|+++.....+.... + ...+|+..++.+...... .+...+.++.|++|..+...++|....... T Consensus 587 ~~iE~~~~~~~~~~~~~-~----~~~lD~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~g~~~~~g~~~~~~~~ 661 (806) T protein:vir:10 587 VYIEVMDMGDELEYGLQ-D----RVRMDRRATLSMTYNATTRVWTSSALPWLPQDLSSLDAVLVSGWAGYVGGAFQFSYN 661 (806) T ss_pred EEEEeecCCCCCCcccc-e----eeeccccceEEEeccccccceeeeeeccccccccceeEEEEeeccccCCceEEEEEc Confidence 45888765543332211 1 122333333332110000 011122334445555444444443221110 Q ss_pred ------ceEEEecCCCCCceEEEeeeeeEEEEeCCeeEecCCCCceeecceEEEEEEEEeeccceEEEEecCCCCcccee Q lcl|NC_011107. 687 ------KVFLDVPEAVVGAVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPW 760 (826) Q Consensus 687 ------~~~~~~~~~~~~~~v~vGl~y~~~~~~~~~~i~~~~g~~~~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~ 760 (826) ...+..+ +..+.+|+|||+|+++++|+||++++++++.+..+|+||+|+++++.+|++|.+.|+++.+..+.. T Consensus 662 ~~~~~v~~~~~~~-~~~~~~v~vGl~Y~s~~~~t~p~~~~~~~~~~~~~r~~l~r~~~~~~~s~~~~~~v~~~~~~~~~~ 740 (806) T protein:vir:10 662 ASNNTISTNFDLA-EGNTATIVVGETYWYEVEPTPPLIKDSKDRVSYLDTPTVGNVYLNLDMYPDFSVVVTDKETLQERT 740 (806) T ss_pred CccceEeeeeeec-CCCCcEEEEeeeeeEEEEECCeeEeccCCCccccccEEEEEEEEEeecceeeEEEEcccCCCccee Confidence 1112222 345778999999999999999999999988888899999999999999999999999988877777 Q ss_pred eeccCccccccccccCccccceeEEEEEecccCceeEEEEEECCCCCEEEEEEEEEEEeecccccC Q lcl|NC_011107. 761 YDTTPLRLFSRQLNAGEPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 761 ~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r~rrv 826 (826) +.+.+.++++.++..|.|+..+++++||+.+|+++.+|+|+|++|+||+|++|+|||+||+|+||| T Consensus 741 ~~~~~~~~~~~~~~~g~~~~~tg~~~vp~~~~~~~~~v~i~~d~P~P~tvlai~~eg~y~~r~~rv 806 (806) T protein:vir:10 741 VYLANKTAGSITNVIGYIAPHEGTLRIPLRRKSTDVSFKIRSKSPATFQLRDIEWTGSYNPRKRRV 806 (806) T ss_pred eeccCcccccccccccccccccceEEEEeeecCceeEEEEEECCCCceEEEEEEEEEEeecccccC Confidence 888999999999999999999999999999999999999999999999999999999999999999 No 17 >protein:vir:78703 Length: 905 # NCBI annotation: tail tube B # Family: family:all:825 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285450;genbank:gi:148724484;genbank:GeneID:5220174 Probab=100.00 E-value=2e-212 Score=1181.20 Aligned_cols=806 Identities=18% Similarity=0.224 Sum_probs=619.5 Q ss_pred CCceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcCCCceEEEE Q lcl|NC_011107. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd~~e~~~~~ 80 (826) ||+|+|+||||++|||||||++|+||||++|+||+|||+.||+||||++||+.|.++.. ..+++|+++||+.|+|+++ T Consensus 1 M~~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRp~~~~i~~l~~~~~--~~~~~~~~~r~~~e~y~~~ 78 (905) T protein:vir:78 1 MGAVLQKIPNLLGGVSQQPDPVKLPGQVREAENVYLDPTFGCRKRPATKFVGELATNLP--SDTRWFPIFRDAGERYAVA 78 (905) T ss_pred CccceecchhhhCceeecchhhcCCcchhhhhccccccccccccCchhhhhhhhcCCCC--CCceEEEEEeCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999987643 5789999999999999999 Q ss_pred EecCC----eEEEEEcCCCEEEEec--CccccccccCCccceEEEEEcCEEEEEeCcccCcccccccCCCCCCccEEEEE Q lcl|NC_011107. 81 AQHRG----ELYLFDERDGRLLMGQ--PLVHDYLKANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAGWLYI 154 (826) Q Consensus 81 ~~~~g----~i~v~~~~~g~~~~~~--~~~~~y~~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~~~v 154 (826) +..+| .|||||+.+|++..+. +....|+++++.++|++++|||||||+|++++|++..+.. ++++++|+++| T Consensus 79 ~~~~g~~~~~i~v~d~~~G~~~~V~~~~~~~~yl~~~~~~~l~~~tv~d~tfi~N~~~~~~~~~~~~--~~~~~~~~~~v 156 (905) T protein:vir:78 79 LYKDGSGNTQVRVWDMQTGAERTVTPDATATAYLATTNLNNLNWLTVADYTLLSNKERIVTMSGASE--VDSNQRALVEI 156 (905) T ss_pred EeeCCCCCcceEEEEccCCcEEEEecCCCccceeecCCCcceEEEEEcCEEEEEcCceeeeecCCCC--cCCCCeEEEEE Confidence 98877 4999999888766664 3457899999999999999999999999999999876544 56777899999 Q ss_pred cccccCceeEEEEeeccccceeeeeeE-EEEeecCCCCccc---cc---------------cccceEEecceee------ Q lcl|NC_011107. 155 KAGQYSKAFSMTIKVKDNATGTTYSHT-ATYVTPDNASTNP---NL---------------AEAPFQTSVGYIA------ 209 (826) Q Consensus 155 ~~g~y~~~y~v~i~g~~~s~~tt~~~t-asyttp~g~~t~~---~~---------------~~~~~~~~~~~ia------ 209 (826) |+|+|+|+|+|+|++...++..++... +....+....... .. .+-....+++..+ T Consensus 157 ~~g~y~r~y~v~I~~~~~~~~~t~~~~~a~~~s~~s~~~~~~g~~~~~~~~~~~~~t~~~~~~l~f~~~~~~~~~~~~~~ 236 (905) T protein:vir:78 157 NAISYNTTYSIDLDRDGASQQVKVYRAKALEISPGSFEVEDGGVCTEHDVQNYTNQTIGSSTGLAFQVRVQCAAYLENNE 236 (905) T ss_pred EeeccceeEEEEEeCCCCceeeeeeccccceeccccccccccccccccceeeeecceeeccCCceeEEeeccccccCCCc Confidence 999999999999999877655443322 2222211100000 00 0000000000000 Q ss_pred -----echheeeeccceE---------------EE-----------eecccccceeeccccc--eee---c--------- Q lcl|NC_011107. 210 -----WQLYGKFFGAPEY---------------TL-----------PNSTKKYPKVDPDANA--ATI---A--------- 244 (826) Q Consensus 210 -----~~l~~~~~ga~~~---------------t~-----------~~~~~~~~~~a~~~~~--~t~---a--------- 244 (826) ..-....++..+| .+ ............+..+ .+. + T Consensus 237 ~~~~~~~~~~l~~g~~~~~~~~~~~v~~~g~~y~i~i~~~~~~~~~~~~~~~~~~t~~d~~a~~~~~~~i~~~l~~~~~~ 316 (905) T protein:vir:78 237 YRSRYNVSVVLQNGGTGFRKGDMITVNLNGRDYNIRVTQEEFVYTYASDGTAAHTTPQDSTAGTLDIGQITAGLVNSVNL 316 (905) T ss_pred ccccccceeeeeccccccccCccEEEeeccceEEEEEecceeEEEecCCCcccccCccCccCccccHHHHHHHHHHhhcc Confidence 0000000111111 00 0000000000000000 000 0 Q ss_pred ccccccccccceEEEecCCceE-EEEecCCCCcceEEEEEEeecccccccccccCcccccEeeeeeeeeEeccCCCCcce Q lcl|NC_011107. 245 GYLNQRGVQDGYIAFRGDADIH-VEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVMATGSTKAPV 323 (826) Q Consensus 245 ~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (826) .........++++++...+... ...+.+++.++.+.++.+.|+++++||+++|.+ +.+.++...+...+.| T Consensus 317 ~~~~~~~~~g~~i~v~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~Lp~~~~~g--------~~v~v~~~~~~~~d~y 388 (905) T protein:vir:78 317 ISNYSAQAVGNVIEIERTDGRDFNLGVRGGATNRAMTAIKGTANSIVDLPGQCFDG--------FELKVINTENAESDDY 388 (905) T ss_pred cccEEEEecCcEEEEEecCCCccEEEEeccCCcceEEEEeccccccccCccccCCC--------cEEEEEeCCCCCcceE Confidence 0001233456777776665432 335677788889999999999999999999853 2334444444556889 Q ss_pred EEEEEc------CCceEEEeecccccccc--cceeEEEEEecCCCeEEEeccC-------cCccccCCccccCCccccCC Q lcl|NC_011107. 324 YFEWDS------ANRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELE-------YDRRGSGDEDTNPTFNFVTR 388 (826) Q Consensus 324 y~~~~~------~~~~w~e~~~~g~~~~~--~t~p~~~~~~~~~~~f~~~~~~-------w~~r~~gd~~tnp~psf~g~ 388 (826) |++|+. ++++|+||++||+.+++ +||||.++ ++++|+|.++..+ |.+|.+||+++||+|+|+|+ T Consensus 389 yv~~~~~~~~~~~~~~W~E~~~~~~~~~~~~~tmp~~l~-r~~~g~f~~~~~~~~~~~~~~~~r~~Gd~~Tnp~psf~g~ 467 (905) T protein:vir:78 389 YVVFRSAAEGIPGSGSWEETVAPGIERGFNTSTMPHALI-RQADGNFTLEALNDEGTITGWAQREVGDDDTNPKPSFVGR 467 (905) T ss_pred EEEEEecccCCcCceeEEEecccccccccccccccEEEE-EecCceEEEEEeccccccccccccccCCcccCCCCcccCC Confidence 999964 35699999999987766 69999997 7799999999887 99999999999999999999 Q ss_pred CccEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeC Q lcl|NC_011107. 389 GITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPG 468 (826) Q Consensus 389 ~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~ 468 (826) +|++|+||||||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|++ T Consensus 468 ~is~v~f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdDpI~~~~ss~~~~~i~~~v~~~~~L~ifT~g~ef~lsg 547 (905) T protein:vir:78 468 GISDMFFYNNRLGFLSEDAVIMSQPGDYFNFFVTSAITISDSDPIDVTASSTKPAILRAAIGAPKGLILFAENSQFLLAS 547 (905) T ss_pred CcceEEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeEEEeecCCcEEEEecCceEEEec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred Cc-cccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEEEE Q lcl|NC_011107. 469 GG-IVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQ 547 (826) Q Consensus 469 ~~-~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~~ 547 (826) ++ +|||+|++++++|+|+|+++|+|+.+|+++||++++| ++++||||.|+.++|+ |+++|+|+|++|||++++..+ T Consensus 548 ~~~~lTP~s~~i~~~S~~~~~~~v~Pv~vG~~vlFv~~~g-~~s~vre~~y~~~~d~-y~a~DlT~~a~hl~~g~v~~~- 624 (905) T protein:vir:78 548 QEVVFSTATIKLTEISDYFYRSLAKPVSTGVSIAFVSEAD-TYSKIFEMSIDSVDNR-PQVADITRIVPEYVPTGLTWS- 624 (905) T ss_pred CCccccceeEEEEeEEeecccCCCCcEEeCCeEEEeecCC-CeeEEEEEEeeecccc-eehhHHHHHHHHhcCCceEEE- Confidence 65 7999999999999999999999999999999999997 5789999999988886 899999999999999998755 Q ss_pred EcCCCCEEEEEEcCCCeEEEEEEeecCCceeeeeeEeeecCCcEEEEEEECCeEEEEEEeCCCEEEEEE--EEeecCccc Q lcl|NC_011107. 548 AAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFTGDNLMVLIQKGQEIALGRM--HLNSLPARE 625 (826) Q Consensus 548 ~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~~d~l~~vv~r~~~~~~~r~--~~~~~~~~~ 625 (826) ++++|+.+||+++++|+|++|+|||+++||+|+|||||+|+|.+++||++.|++|++|+|..++...|+ .|..+++.. T Consensus 625 ~~s~~~~~v~~~~~~~~l~~ytyl~~~~eq~v~AWsrw~~~G~~~~~a~i~d~~~~vV~r~~~G~~~~~~~~l~~~~~~~ 704 (905) T protein:vir:78 625 VSTPNNSMMLFGDNSNTAYIFKFFNQGNERQVAGWSKWILPGEQRMCGFFADTGYFVLYDSTTGSYVLSAMELLDDPDSA 704 (905) T ss_pred EecCCCcEEEEEcCCCeEEEEEeecCCCceeEEeEEEEecCCCeEEEEEEcCCEEEEEEEccCCeEEEEEEeeccccCcc Confidence 456788899999999999999999999999999999999999999999999999999999877765554 333444444 Q ss_pred CCCcccccccceEEEe-ecccceeccceeeccCCcccceeeEEecCceeeeeecccceecCCceE--EEecCCCCCceEE Q lcl|NC_011107. 626 GLQYPKYDYWRRIEAT-VDGELELTKQHWDLIKDASAVYQLQPVAGAYMERTHLGVKRETNTKVF--LDVPEAVVGAVYV 702 (826) Q Consensus 626 ~~~~~~~d~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~--~~~~~~~~~~~v~ 702 (826) ..+....+...+.+++ .++.+++....... ......++..|+++..+....+|.......... .-......+++|+ T Consensus 705 ~~d~~~~~~~~~~d~~~~~~~~t~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~dG~~~~~~~~~~~~~~~~t~~~a~~v~ 783 (905) T protein:vir:78 705 SIDTAFSSFLPRLDNYVVKSDLTVVDNGDGT-LTVDLEAGQAMTGATPVIMFTDGPSEFAFSQPTITAGQFTVDTTDDFV 783 (905) T ss_pred ccccceeeeeeccceeeecccceecccCcce-EeeeccCccccccceeEEEeeCCceeeeEEEEEeeceeeccccCCeEE Confidence 5555444433333333 34444432110000 011122445566666655555554321110000 0011122467899 Q ss_pred EeeeeeEEEEeCCeeEecCCCCceeecceEEEEEEEEeeccceEEEEecCCCCccceeeeccCccccccccccCccccce Q lcl|NC_011107. 703 VGCEFWSKVEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLVDS 782 (826) Q Consensus 703 vGl~y~~~~~~~~~~i~~~~g~~~~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t 782 (826) |||+|+++++|+||+++.+++... .+|++|+|++|+|++|++|.+++++.+++... .....++.+..+..+.|+..+ T Consensus 784 VGl~Y~s~v~~~p~~~~~~~~s~~-~~~~rI~rv~lr~~~Sg~~~v~v~~~~~~~~~--~~~~~~~~~~~~~~~~p~~~t 860 (905) T protein:vir:78 784 VGFKYETKITLPGFFTSEENKADR-VYAPIVEFLYLDLYYSGRYQIEVDRIGYDTIN--IDAGSIDANIYLADGAPLKEI 860 (905) T ss_pred EeeeeeEEEeecceEeccCCCccc-ccceEEEEEEEEeecceeEEEEEcCCCcceec--ccccceecCcccCcccccccc Confidence 999999999999999987776544 47889999999999999999999998876533 234556677777778888899 Q ss_pred eEEEEEecccCceeEEEEEECCCCCEEEEEEEEEEEeecccc-cC Q lcl|NC_011107. 783 AVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYR-RV 826 (826) Q Consensus 783 g~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r~r-rv 826 (826) |+++||+.+|+++.+|+|+|++|+||+|++|+|||+||+|+| || T Consensus 861 g~~~vP~~g~~~~~~v~I~sd~PlP~tvlsi~weg~Yn~r~~~~~ 905 (905) T protein:vir:78 861 ATENVPLFTPGDQVTVTIKAPDPFPSAITGYSWQGHYNRRGIAFI 905 (905) T ss_pred cEEEEEeeccCceeEEEEEECCCCcEEEEEEEEEEEeccceeecC Confidence 999999999999999999999999999999999999999999 88 No 18 >protein:vir:103790 Length: 768 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024932;genbank:gi:48697202;genbank:GeneID:2846114 Probab=100.00 E-value=9.3e-169 Score=941.80 Aligned_cols=723 Identities=13% Similarity=0.094 Sum_probs=523.5 Q ss_pred CCceeeechhhhcc-----cccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCC-ccccceeEEEEEcCCC Q lcl|NC_011107. 1 MSYKQSAYPNLLMG-----VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTD-QPWPRPFLYHTNLGGR 74 (826) Q Consensus 1 M~~v~~s~~n~~~G-----VSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~-~~~~~~~~~~~~rd~~ 74 (826) |++++++++||.+| +++|+|++||++||++|+||+++|+|||+||||++||+++++++ +.+++||.|. . T Consensus 1 M~~~~~~~~~F~~GelsP~l~~r~Dl~ry~~~~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~lipf~~~-----~ 75 (768) T protein:vir:10 1 MPKAAPQQVSFDAGELSPLLGARVDLAKYPNGCQVMENFIATVQGPAIRRGGKRFVAATKDSTKQSWLLPFIVA-----D 75 (768) T ss_pred CCcceeeeeeccCceechhhcccchHHHHHHHHhhhhcceeeecCCceecCchhhhhhhcCCCCCeeEEEEEec-----C Confidence 99999999999999 89999999999999999999999999999999999999998765 3456677653 4 Q ss_pred ceEEEEEecCCeEEEEEcCCCEEEEe-------cCccccccc-cCCccceEEEEEcCEEEEEeCcccCcccccccCCCCC Q lcl|NC_011107. 75 SIAMLVAQHRGELYLFDERDGRLLMG-------QPLVHDYLK-ANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDP 146 (826) Q Consensus 75 e~~~~~~~~~g~i~v~~~~~g~~~~~-------~~~~~~y~~-a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~ 146 (826) ++.|++++++++||||+. +|..+.. .+....|+. .+++.+|+|+|+||+|||+|++|||++. .+... T Consensus 76 ~~~y~l~fg~~~irv~~~-~g~v~~~~~~~e~~tp~~~~~l~~~~~~~~L~~~q~aD~~~i~~~~~~p~~l----~r~~~ 150 (768) T protein:vir:10 76 GIAYMLEFGDHYIRFFVN-RGQLVNAGAPVEIATPYALADLTTEDGTFAIRATQSADTMYLFHGGYPTQKL----LRTSA 150 (768) T ss_pred ccEEEEEEcCCEEEEEEC-CcEEEecCeeEEEEcCCCcceeecccccceeEEEeecCEEEEEcCCcceeEE----EEecC Confidence 788899999999999975 4443321 222334443 4677889999999999999999999852 23333 Q ss_pred Ccc--EEEEEcccccCc---eeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeeccce Q lcl|NC_011107. 147 NKA--GWLYIKAGQYSK---AFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPE 221 (826) Q Consensus 147 ~~~--a~~~v~~g~y~~---~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga~~ 221 (826) +.| +.+.++.++|.+ +.+++++.++.++..++...+...+|....... ++...... ...+ T Consensus 151 ~~w~l~~~~~~~gp~~~~n~~~~vti~~s~~~~~~T~tasa~~~~~~~v~~~~------------~l~~~~~~---~~~~ 215 (768) T protein:vir:10 151 TTFSLQPVTFVGGPFAAVNSDNNVRVHASAGTGAVTLVASASVFRPSDVGTLF------------YLEQEDNS---FVKP 215 (768) T ss_pred CCceeEEeeecCccccccccceeEEEEecccceeEEEeecCCccchhhcceee------------eeeeeccc---cccc Confidence 444 456678877743 556666666555444433322222222221110 00000000 0011 Q ss_pred EEEeecccccceeeccccceeecccccccccccceEEEecCCceEEEEecCCCCcceEEEEEEeecccccccccccCccc Q lcl|NC_011107. 222 YTLPNSTKKYPKVDPDANAATIAGYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGA 301 (826) Q Consensus 222 ~t~~~~~~~~~~~a~~~~~~t~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~ 301 (826) |....... -+++...........+.+++.... +.+..|...+. T Consensus 216 ~~~~~~~g-------------------------~~~~~~~~~~~~~~~~~~~~~~~~---------~~t~~~~~~~~--- 258 (768) T protein:vir:10 216 WVVHQKIG-------------------------PSELRRVGDRVYLCTAVGTATPQV---------TGTETPTHTSG--- 258 (768) T ss_pred cEEEEeee-------------------------eEEEEecCCceEEeeeeccccccc---------cceeccccccC--- Confidence 11100000 011111111111111111111000 00111111111 Q ss_pred ccEeeeeeeeeEecc-CCCCcceEEEEEcCCceEEEeecccccccccceeEEEEEecCCCeEEE--------eccCcCcc Q lcl|NC_011107. 302 PGVGVQFMDGAVMAT-GSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSL--------NELEYDRR 372 (826) Q Consensus 302 ~~~~~~~~~~~~~~~-~~~~~~~y~~~~~~~~~w~e~~~~g~~~~~~t~p~~~~~~~~~~~f~~--------~~~~w~~r 372 (826) +......+..... .......+++|......|.+..+ +...+|++..+.. .++++.+ ....|..+ T Consensus 259 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~----~~~~t~~~~~~~~-~~~~~~~~~~~~~~~~~~t~~~~ 331 (768) T protein:vir:10 259 --SRWDGTGQDESATDEYGSIGAEWEYQHSGYGTVLITG----YTNDQVVTGTVAT-NDPADPGMLPNTVVTLTGTYKWA 331 (768) T ss_pred --ceEEEecCcccccccccccceEEEEEEcCCceEEEEE----ecCCeeEEeeeee-ecCcccccccccccccCCCcccc Confidence 0000000000000 00112234455443333333222 2345677766533 2333333 33345666 Q ss_pred ccCCccccCCccccCCCccEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecC Q lcl|NC_011107. 373 GSGDEDTNPTFNFVTRGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFN 452 (826) Q Consensus 373 ~~gd~~tnp~psf~g~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~ 452 (826) ..+++++||+| ++|+||||||+|++|++|||||+||||||++++++++.|||||+++++++++++|+|++++ T Consensus 332 ~~~~~~~~g~P-------s~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdD~I~~~~ss~~~~~i~~~v~~- 403 (768) T protein:vir:10 332 RSLFNSTDGFP-------QMGTFWRNRLCLMRDRWLAMSVSADFETFKTKDADQQTDDSAIVQQLNARQLNKLAWMVES- 403 (768) T ss_pred cCCCcCCCCCc-------eEEEEEeeeEEEeeCCEEEEEcccccccccccccccccCCccEEEEecCCcceeEEEEeec- Confidence 66667777655 5679999999999999999999999999999999999999999999999999999999999 Q ss_pred CcEEEEecCcEEEEeC---CccccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchh Q lcl|NC_011107. 453 KDLIVFAKKYQAVVPG---GGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAE 529 (826) Q Consensus 453 ~~L~l~t~~~q~~i~~---~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~ 529 (826) ++|+|||+++||+|++ +++|||+|++++++|.|+++ +++|+.+|++++|++++|+ .||||.|+.+.|+ |+++ T Consensus 404 ~~L~i~T~~~q~~l~~~~~~~~lTP~~~~i~~~s~~g~~-~~~Pv~vG~~v~fv~~~g~---~vre~~y~~~~d~-y~a~ 478 (768) T protein:vir:10 404 DSLLIGMTGDEWVIGPANASQPVSAANLNAARRTSYGSK-RIQPVQVGGTIMFVQKAGR---KLRDFKYDFSSDN-YVST 478 (768) T ss_pred CcEEEEecCceEEEecCCCCcccccceEEEEEeehhccc-ccccEEeCCeEEEEcCCCC---EEEEEEeeeecCc-eecc Confidence 5899999999999987 35899999999999999775 6999999999999999884 7999999966665 9999 Q ss_pred HHHHHHHHhcCC------CeEEEEEcCCCCEEEEEEcCCCeEEEEEEeecCCceeeeeeEeeec-CCcEEEEEEE----- Q lcl|NC_011107. 530 DVTSHIPSYMPG------PAEYIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTL-RHQIIGAYFT----- 597 (826) Q Consensus 530 dls~~~~~~~~~------~v~~~~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~-~g~v~~~~~~----- 597 (826) |+|+|++||+++ +|..||++++|++++||+++||+|++|+|++++++|+|+|||||++ +|.|++||+| T Consensus 479 DlT~~a~hl~~~~~~~~~~i~~~a~~~~p~~v~~~v~~dg~l~~~ty~~e~~~q~v~aW~~~~~~~g~v~~v~~i~~~~g 558 (768) T protein:vir:10 479 DVTKIADHITRGRAGTNSGIMSLCFQQEPHSVVWAARADGQLIGCTYDEEAGRSDVYGWHRHPDANGFVECVASMPAPDG 558 (768) T ss_pred hhhhhhhhhccccCccccceeeEEEeecCCeEEEEEecCCeEEEEEEecCCCceeEEeEEEEEcCCCEEEEEEEEecCCC Confidence 999999999986 3899999999999999999999999999998888899999999975 7999999998 Q ss_pred -CCeEEEEEEeCCCEEEEEEEEeecCcccCCCcccccccceEEEeecccceeccceeeccCCcccceeeEEecCceeeee Q lcl|NC_011107. 598 -GDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEATVDGELELTKQHWDLIKDASAVYQLQPVAGAYMERT 676 (826) Q Consensus 598 -~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 676 (826) +|+||++|+|.++++.+|| +|++......+ .....++++||++++.+. +...+.++.|++|+.+... T Consensus 559 ~~d~l~~~v~r~~~g~~~~~-ie~l~~~~~~~-----~~~~~~~~~D~~~~~~~~------~~~~~~gl~~leg~~v~v~ 626 (768) T protein:vir:10 559 ASDDLWVIVRRQVNGQTVRY-VEYLNPALQDD-----EPQSSAFYVDAGITYNGV------PTSTIAGLGHLEGVTVAVL 626 (768) T ss_pred CccEEEEEEEecCCCeEEEE-EEecCcccccc-----cccccceEeccccccCCc------ceeeecCCCCcccceEEEE Confidence 6999999999999999998 77766532222 222345778888887653 3445778999999999999 Q ss_pred ecccceecCCceEEEecCCCCCceEEEeeeeeEEEEeCCeeEecCCCCceeecceEEEEEEEEeeccceEEEEecCCCCc Q lcl|NC_011107. 677 HLGVKRETNTKVFLDVPEAVVGAVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARP 756 (826) Q Consensus 677 ~~g~~~~~~~~~~~~~~~~~~~~~v~vGl~y~~~~~~~~~~i~~~~g~~~~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~ 756 (826) ++|...+........+.++.++++|+|||+|+++++|+||++++++|..++ +|+||+|++|+|++|+++.++++...+. T Consensus 627 ~dG~~~~~~~v~~g~itl~~~~~~v~vG~~y~s~~~~~p~~~~~~~gs~~~-~~~ri~r~~v~~~~S~~~~~~~~~~~~~ 705 (768) T protein:vir:10 627 TDGAVHPSRTVTAGAITLDWSASIVHIGVPTTCRIQTMQLNAGAANGTAQG-KTKRVTNIATRFSRSLGGVVGPTFDDND 705 (768) T ss_pred ECCEeccCceecCCEEEeCCCCceEEEeEeeeEEEEecceEeecCCccccc-cceEEEEEEEEEecccceEEEecCCCCC Confidence 999877666555566777888999999999999999999999999887644 6889999999999999999987665432 Q ss_pred cceeeeccCccccccccccCccccceeEEEEEec-ccCceeEEEEEECCCCCEEEEEEEEEEEeeccc Q lcl|NC_011107. 757 NQPWYDTTPLRLFSRQLNAGEPLVDSAVVPLPAR-VDMATSKFELSCHSPYDMNVRAVEYNFKSNQTY 823 (826) Q Consensus 757 ~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~-~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r~ 823 (826) +...+.+..+..+.. .++++||++++|+. +++++.+|+|+|++|+||+||+|+||+++|.|+ T Consensus 706 ----~~~~~~r~~~~~~~~-~~~l~TG~~~v~~~~~~~~~~~i~i~~d~P~P~tvlsi~~~~~~nd~~ 768 (768) T protein:vir:10 706 ----LEQLSFRKPSNAMDR-AVPLFDGDMESDWRGGYEGQSWICYQNDQPLPVTLLGFFPILDTQDDR 768 (768) T ss_pred ----ceeeeeEecCcccCc-cCCcccCEEEEEecCCCCcceEEEEEECCCCCEEEEEEEEEEEEeecC Confidence 223344544444332 24678999999975 558889999999999999999999999999999 No 19 >protein:vir:1778 Length: 680 # NCBI annotation: tail protein A # Family: family:all:825 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570344;genbank:gi:18640503;genbank:GeneID:932716 Probab=100.00 E-value=4e-162 Score=905.42 Aligned_cols=575 Identities=21% Similarity=0.310 Sum_probs=448.8 Q ss_pred CCceeeechhhhcccccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcCCCceEEEE Q lcl|NC_011107. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~~GVSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd~~e~~~~~ 80 (826) ||+|+|+||||++|||||||.+|+||||++|+||+|||+.||+||||++||+.|...+ ..+++|+++||+.|+||++ T Consensus 1 M~~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRpg~~~i~~l~~~~---~~~~~~~~~rd~~e~~~~~ 77 (680) T protein:vir:17 1 MAAVEQMVPNLLGGISQQPDPLKLPGQVKQARNVQLDPTFGALKRPGTELIMQVTGIP---KRAKWIPIMRDAREHYYVA 77 (680) T ss_pred CccceecchhhhCcceecchhhcCcchhhhhhccccCcCcccccCccceeeeeccCCC---CCceeEEEecCCCCeEEEE Confidence 9999999999999999999999999999999999999999999999999999986543 6788999999999999999 Q ss_pred EecCC-------eEEEEEcCCCEEEEecCc---ccccc--ccCCccceEEEEEcCEEEEEeCcccCcccccccCCCCCCc Q lcl|NC_011107. 81 AQHRG-------ELYLFDERDGRLLMGQPL---VHDYL--KANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNK 148 (826) Q Consensus 81 ~~~~g-------~i~v~~~~~g~~~~~~~~---~~~y~--~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~ 148 (826) +..+| .|||||+.+|+...+... .+.|+ .+++..+||+++|||||||+||++.|++... .+++++ T Consensus 78 ~~~~g~~~~~~~~i~v~d~~~G~~~~v~~~~~~~~~~~~~~~~~~~~lr~~tv~d~tfi~N~~v~~~~~~~---~~~~~~ 154 (680) T protein:vir:17 78 IYREGANESGDLRIRVFDLKAGVERAVSFVGGEVEEYFPGDETDWEAIRSLTIGDYTFLSNPNVQPTTWSR---SFSRRP 154 (680) T ss_pred EEcCCCcccccceeEEEEccCCeEEEEEcCCCceEEEeecCCCCccceEEEEEcCEEEEECCeEEEeccCC---CCCCCC Confidence 99887 499999988866655322 22333 4678889999999999999999999998754 345667 Q ss_pred cEEEEEcccccCceeEEEEeeccccceeeeeeE-----EEEeecCCCCccccccccceEEecceeeechheee------- Q lcl|NC_011107. 149 AGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHT-----ATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKF------- 216 (826) Q Consensus 149 ~a~~~v~~g~y~~~y~v~i~g~~~s~~tt~~~t-----asyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~------- 216 (826) .|+++|++|+|+|+|+|+|++...+..+....+ +.+..+.+............+..+..++..+.... T Consensus 155 ~g~~~v~~~ayg~ty~v~ing~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~Ag~~t~~~~~~a~la~~l~~~~~~~~~~~ 234 (680) T protein:vir:17 155 EGLVTIGAAGYGTSYIVDFATEDSGQQRRWAVQEMQAPKTKRKKGDGSPDEAGETTVNNWNGTGLSFRVKVEARAFLVDD 234 (680) T ss_pred eeEEEEEEeeeeeEEEEEEeccccceeeeeeeeeeeccccccccccccCCCCcceeeeeeeeeeeeeeeeeccceeeecC Confidence 899999999999999999999866543322211 11111111111111111112222222222111000 Q ss_pred ---------------eccce-------EEEeeccccccee-------------------eccc---cce---eec----- Q lcl|NC_011107. 217 ---------------FGAPE-------YTLPNSTKKYPKV-------------------DPDA---NAA---TIA----- 244 (826) Q Consensus 217 ---------------~ga~~-------~t~~~~~~~~~~~-------------------a~~~---~~~---t~a----- 244 (826) ++..+ .++...+....+. .+.. ... .++ T Consensus 235 g~~~~~~y~~~~~l~~tg~~~~~~~~t~~v~~~G~~y~IsI~~~~~~~~~~~~~s~~~~t~~~~~a~~at~~~Ia~~L~~ 314 (680) T protein:vir:17 235 GEEYGHNYIPYVTLLTPGNNTSPFPDTIRVDVSGEGWDIKVTKQIQSKVYANLGTAQFTTPVDQSGGGASTSDIVTGLSA 314 (680) T ss_pred CCceEEEEeeEEEEecCCccccccCceEEEecccceeEEEEccceeeEeccCccceeeeeccCCcccceeHHHHHHHHHH Confidence 00000 0000000000000 0000 000 000 Q ss_pred ---c-cccccccccceEEEecC---CceEE-EEecCCCCcceEEEEEEeecccccccccccCcccccEeeeeeeeeEecc Q lcl|NC_011107. 245 ---G-YLNQRGVQDGYIAFRGD---ADIHV-EVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVMAT 316 (826) Q Consensus 245 ---~-~~~~~~~~~g~~~~~~~---~~~~~-~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 316 (826) . ..-.....++++++... ....+ ..+.++++++.+.++.+.|+++++||+++|.+ +.++++... T Consensus 315 ~i~~~~~~~~~~~g~~i~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~Lp~~a~~g--------~~v~v~~~~ 386 (680) T protein:vir:17 315 AINGLGTFTAESIGNVIRVRYSDPTRTDEFTMSARGGTSGTGLESIKYSVDTLAELPTKCWND--------YQVAVRNTQ 386 (680) T ss_pred hhcccCcEEEEECCCEEEEEeccCCCceEEEeeccCCCCceeeeeeeeeeccccccccccCCC--------cEEEEEeCC Confidence 0 00112333456676532 22223 34568888899999999999999999999853 334555666 Q ss_pred CCCCcceEEEEEc--------CCceEEEeecccccccc--cceeEEEEEecCCCeEEEeccC-------cCccccCCccc Q lcl|NC_011107. 317 GSTKAPVYFEWDS--------ANRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELE-------YDRRGSGDEDT 379 (826) Q Consensus 317 ~~~~~~~y~~~~~--------~~~~w~e~~~~g~~~~~--~t~p~~~~~~~~~~~f~~~~~~-------w~~r~~gd~~t 379 (826) +...+.||++|+. ..++|+||++||+.+++ +||||.++ ++++|+|.++..+ |++|.+|||++ T Consensus 387 ~~~~~~Yyv~~~~~~~~~~~~~~~~W~E~~~~~~~~~~~~~tmp~~l~-r~~~g~f~~~~~~~~~~~~~~~~r~~Gdd~t 465 (680) T protein:vir:17 387 DTEVDDYYVKFETDVEDADVPGSGYWVETVKNGDDGGLVDDTMPHVLV-RNALGDFTFSSLNNSSYGKTWADRSVGSEDT 465 (680) T ss_pred CCcccceEEEEeccCcccCcccccceeecccCcccceeccCcceEEEE-EccCceeEEEeeccccccccccccccCCccc Confidence 6777999999986 44689999999987666 58999998 6689999999886 99999999999 Q ss_pred cCCcccc--CCCccEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEE Q lcl|NC_011107. 380 NPTFNFV--TRGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIV 457 (826) Q Consensus 380 np~psf~--g~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l 457 (826) ||+|+|+ |++|++|+||||||+|++|++|||||+||||||++++++++.|||||+++++++++++|+|+++++++|+| T Consensus 466 np~psF~~~G~~p~~v~f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l 545 (680) T protein:vir:17 466 NPHPTFTESGNGIYGMFMYKNRLGFLTQDAVIMSQVGDYFNFYATSGVTISDADPIDMATSDTKPVKLEAAISSTSGAIL 545 (680) T ss_pred CCCcccccCCCCceEEEEEcceEEEeeCCeEEEEccCCcccccccccccCCCCccEEEEEcCCcceeeeEEeecCCcEEE Confidence 9999999 88999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EecCcEEEEeCC-ccccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHH Q lcl|NC_011107. 458 FAKKYQAVVPGG-GIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIP 536 (826) Q Consensus 458 ~t~~~q~~i~~~-~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~ 536 (826) ||+++||+|+++ ++|||+|++++++|+|+|++.|+|+.+|+.+||++++| ++++||||.|+.+.++ |+++|||+|++ T Consensus 546 ~t~g~q~~ls~~~~~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g-~~s~vre~~y~~~~d~-y~a~DlT~~a~ 623 (680) T protein:vir:17 546 FGNQAQFRLSSPDESFGPKTATLDKISNYTYESKADPVQTGVSMIFPTNMG-TYSSVYELSTESAKGT-PVIEDSSRVIP 623 (680) T ss_pred EecCeEEEEecCCceecceeEEEEEEEeecccCCCCceEeCCeEEEeecCC-CcceEEEEeeeeccCc-eehhhHHHHHH Confidence 999999999984 69999999999999999999999999999999999998 5789999999876665 99999999999 Q ss_pred HhcCCCeEEE-EEcCCCCEEEEEEcCCCeEEEEEEeecCCceeeeeeEeeecCCcEE Q lcl|NC_011107. 537 SYMPGPAEYI-QAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQII 592 (826) Q Consensus 537 ~~~~~~v~~~-~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~ 592 (826) |||+|++..+ +++++|..++|++++||+|++|+|||+++||+|+|||||+|++.=. T Consensus 624 hl~~g~v~~~~~~~~~~~~~~~~~~~~~~l~~~~yl~~~~e~~v~aW~rw~~~~~d~ 680 (680) T protein:vir:17 624 RLIPSGLTWSTASMNNDTVFFGNAKKGRNVYVFRFFNEGQERKVAGWTTWYYEDQDH 680 (680) T ss_pred HhcCCceEEEEeeCCCCeEEEEEEcCCCEEEEEEEeeCCCceEEEEEEEEecCCCCC Confidence 9999988776 5556666789999999999999999999999999999999986433 No 20 >protein:vir:95324 Length: 823 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512269;genbank:gi:89152436;genbank:GeneID:3952993 Probab=100.00 E-value=3.4e-154 Score=861.92 Aligned_cols=707 Identities=14% Similarity=0.128 Sum_probs=494.3 Q ss_pred CCceeeechhhhcc-----cccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCC-ccccceeEEEEEcCCC Q lcl|NC_011107. 1 MSYKQSAYPNLLMG-----VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTD-QPWPRPFLYHTNLGGR 74 (826) Q Consensus 1 M~~v~~s~~n~~~G-----VSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~-~~~~~~~~~~~~rd~~ 74 (826) |+ +++.++||.+| +++|+|++||++||++|+||+++|+|||+||||++||+++++++ +.+++||.|. . T Consensus 1 m~-i~~~q~sF~~GElsP~l~gR~Dl~ry~~q~~~~~N~~~~~~GGl~rRpGt~fva~~~~~~g~~rLipf~~s-----~ 74 (823) T protein:vir:95 1 MA-ISWIQPSFAGGEIGPSLYGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPNRKCRLIPFQFS-----T 74 (823) T ss_pred Cc-ceeechhccCceechheeccchHHHHHHHHhhhhCcEeeecCCceecCchhhhhhhcCCCCCeeEEEEEeC-----C Confidence 99 99999999999 99999999999999999999999999999999999999998765 4567788764 3 Q ss_pred ceEEEEEecCCeEEEEEcCCCEEEEecC----ccccccccCCccceEEEEEcCEEEEEeCcccCcccccccCCCCCCccE Q lcl|NC_011107. 75 SIAMLVAQHRGELYLFDERDGRLLMGQP----LVHDYLKANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAG 150 (826) Q Consensus 75 e~~~~~~~~~g~i~v~~~~~g~~~~~~~----~~~~y~~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a 150 (826) +|.|++++++++||||+. ++.++.... ...+|. +....+|+|+|+||+|||+|++|+|++ +.+..++.|. T Consensus 75 ~q~y~Lefg~~~irV~~~-~g~vv~~~~~~~ev~tPy~-~~~l~~Lr~~qsaD~~fivh~~~~p~~----L~r~~~~~w~ 148 (823) T protein:vir:95 75 VQTYALEFGHQYMRVIKD-GALVLNSSNVIYEIATPYT-EADLFRIKFTQSADVLTLVHPAYPPKE----LRRYAHDNWQ 148 (823) T ss_pred CcEEEEEEcCCeEEEEeC-CcEEEecCCceeEEecccc-cccccceeEEEeccEEEEEcCCccceE----EEecCCCCce Confidence 688888999999999964 333322211 223443 345679999999999999999999985 3444555554 Q ss_pred E--EEEcccccCc---eeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeeccceEEEe Q lcl|NC_011107. 151 W--LYIKAGQYSK---AFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLP 225 (826) Q Consensus 151 ~--~~v~~g~y~~---~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga~~~t~~ 225 (826) + +.++.++|.. ++++++.........+...++....++.. ... +++. T Consensus 149 l~~~~~~~gp~~~~~~~~t~~v~~~~~~~~~t~ta~~~~~~~d~v---------------g~~-------------~~l~ 200 (823) T protein:vir:95 149 LVDVVTKNGPFEDINIDESLTVYASASTGTITLTASASIFGAEQV---------------GKL-------------FYLE 200 (823) T ss_pred EEEEEEeccccccccccceeEEeccccCceeEEeecccccchhhc---------------cce-------------EEEe Confidence 4 4456677743 34455544333222111111111111000 000 0000 Q ss_pred ecccccceeeccccceeecccccccccccceEEEecCCceEEEEecCCCCcceEEEEEEeecccccccccccCcccccEe Q lcl|NC_011107. 226 NSTKKYPKVDPDANAATIAGYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVG 305 (826) Q Consensus 226 ~~~~~~~~~a~~~~~~t~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~ 305 (826) ......... +....... .+.. ...+.+. +... ..-++....|...+.. +. T Consensus 201 ~~~~~~~~~-~~~~~~~~---------~~~~--~~~~~~~-----------~~~~---~~~~~g~~~~~~~~~~----~~ 250 (823) T protein:vir:95 201 QPAVDSVPV-WETSKSTS---------IGDI--RRADSNY-----------YRAV---TAGKTGTLRPSHTEGT----SW 250 (823) T ss_pred ccccceeee-cceeeeec---------ccce--EEecccc-----------eeee---eccccceeecccCCcc----eE Confidence 000000000 00000000 0000 0000000 0000 0000111112222221 11 Q ss_pred eeeeeeeEeccCCCCcceE---EEEEcCCceEEEeeccccccc---ccceeEEEEEecCCCeEEEeccCcCccccCCccc Q lcl|NC_011107. 306 VQFMDGAVMATGSTKAPVY---FEWDSANRRWAERAAYGTDWV---LKKMPLALRWDEATDTYSLNELEYDRRGSGDEDT 379 (826) Q Consensus 306 ~~~~~~~~~~~~~~~~~~y---~~~~~~~~~w~e~~~~g~~~~---~~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~t 379 (826) + ...++..+++| ..+....+.|++++..+.... ..+||+.+. +..+++|.++...|.. T Consensus 251 ----~---~~~~~~~~~~~~~~~~~~~~~g~~~~t~v~~~~~~~~~~~~~~~~~~-~~~~~t~~~~~~~~~~-------- 314 (823) T protein:vir:95 251 ----D---GWGGSGDDDTGIEWEYLHSGFGIARITAVNGTTATAEVISYIPSQVV-GEDNASYKWAKYAWNS-------- 314 (823) T ss_pred ----E---eceecccccceeEEEEEeCCcceEEEEeecceeeeceEeeeeccccc-cCCcCCccccccccCc-------- Confidence 1 11122333333 334566789999876653221 246888776 5677888877777753 Q ss_pred cCCccccCCCccEEEEEcceEEEec----CCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcE Q lcl|NC_011107. 380 NPTFNFVTRGITGMTTFQGRLVLLS----QEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDL 455 (826) Q Consensus 380 np~psf~g~~~~~v~~~q~RL~f~~----~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L 455 (826) .++||++|+||||||+|++ |++|||||+||||||+++++ ++|||||+++++++++|.|+|+++++ +| T Consensus 315 ------~~g~Ps~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~~--~~DdD~I~~~~s~~~~~~i~~~v~~~-~L 385 (823) T protein:vir:95 315 ------VNGYPGTVVYYQQRLYFAASTAFPQTIWASRTGDYKDFGKSNP--TQDDDRIIYTYAGRQVNEIRHLIDVG-SL 385 (823) T ss_pred ------CCCCccEEEEEeceEEEEEcCCCCcEEEEeccCCccccccccC--CCCCCcEEEEEcCCcceEEEEEeecC-cE Confidence 2357788999999999995 69999999999999999984 57999999999999999999999995 79 Q ss_pred EEEecCcEEEEeCC--ccccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHH Q lcl|NC_011107. 456 IVFAKKYQAVVPGG--GIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTS 533 (826) Q Consensus 456 ~l~t~~~q~~i~~~--~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~ 533 (826) +|||+++||+|+++ ++|||+|+.++++|+|++ ++++|+.+|+.++|++++| ++||||.|+.+.+ .|+++|+|+ T Consensus 386 li~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~~~Fv~~~g---~~vre~~~~~~~d-~~~~~dlT~ 460 (823) T protein:vir:95 386 VALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGS-SNVPPIAVANIALFVQEKG---SVVRDLAYSFDVD-GYQGNDLTI 460 (823) T ss_pred EEEecCcEEEEEcCCCcccceeeEEEEEeecccc-ccccceEeCCeEEEEecCC---CEEEEEEEeeecC-ceecchhhh Confidence 99999999999874 699999999999999965 5799999999999999877 4799999996655 599999999 Q ss_pred HHHHhcCC-CeEEEEEcCCCCEEEEEEcCCCeEEEEEEeecCCceeeeeeEeeecCCcEEEEEEE----CCeEEEEEEeC Q lcl|NC_011107. 534 HIPSYMPG-PAEYIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFT----GDNLMVLIQKG 608 (826) Q Consensus 534 ~~~~~~~~-~v~~~~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~----~d~l~~vv~r~ 608 (826) |++|++++ +|.+||++++|++++||+++||+|++|+|+ +||+|.|||||+++|+|++||++ +|+|||+|+|+ T Consensus 461 ~a~hl~~~~~i~~~a~~~~p~~~~~~v~~dG~l~~~ty~---~~q~v~aW~~~~~~g~~~~~~~i~~~~~d~l~~~v~R~ 537 (823) T protein:vir:95 461 LANHLFQKHSIVDWCFSIVPYSSAFCIRDDGKLLVMTYL---RDQQVFAWAPQSSTGKYESTCSISEGNEDAVYFVVNRT 537 (823) T ss_pred hhhhhcCCCceEEEEEecCCCeEEEEEecCCcEEEEEEe---cccceeeeEEEecCCcEEEEEEecCCCCCEEEEEEEec Confidence 99999987 799999999999999999999999999998 88999999999999999999998 68999999999 Q ss_pred CCEEEEEEEEeecCcccCC---CcccccccceEE-------------------------Eee-----------cccce-- Q lcl|NC_011107. 609 QEIALGRMHLNSLPAREGL---QYPKYDYWRRIE-------------------------ATV-----------DGELE-- 647 (826) Q Consensus 609 ~~~~~~r~~~~~~~~~~~~---~~~~~d~~~~~~-------------------------~~~-----------~~~~~-- 647 (826) ++++..|| +|++...... +..++|....++ ..+ ++..+ T Consensus 538 i~g~~~~y-iE~~~~~~~~~~~~~~~lD~~~s~~g~~~~~~~~~l~~g~~~l~~l~g~~v~~adg~~~~~~~v~g~i~l~ 616 (823) T protein:vir:95 538 VNGQTVRY-IERLSSRLFTSDEDAFFVDSGLSYDGRNTSDRTMTITGGSGEWDYLAEYTISVSGGAYFTSSDVGAQLQFP 616 (823) T ss_pred cCCeEEEE-EEeeccccCCCccceeEEEEEEEeecCcccceeeEecCCCCcccccCceEEEecCcceECCccceeEEEeC Confidence 99988887 6766543321 122222111000 000 00000 Q ss_pred -----------ecc-------------------------cee-----eccCCcccceeeEEecCceeeeeecccceecCC Q lcl|NC_011107. 648 -----------LTK-------------------------QHW-----DLIKDASAVYQLQPVAGAYMERTHLGVKRETNT 686 (826) Q Consensus 648 -----------~~~-------------------------~~~-----~~~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~ 686 (826) +.. ... ....+...+++|+|++|+++.+.+||.+++... T Consensus 617 ~~~~~~~vGl~~~~~i~~~~~~v~~~~a~~~~~~r~v~a~l~~~~t~~~~~~~~~~~gL~hleg~tv~v~~dg~~~~~~~ 696 (823) T protein:vir:95 617 YTGADPDTGYEVSKELRCDIISVTSNTAVVVRANRNVPPSLRNVATTNWQMARRTFGGLSHLEGQTVNILSDANVEPQKV 696 (823) T ss_pred cCCCccccccceEEEEEEeeceeeCCceEEEccCCcccceeeeeeccccccccceeeeccccccceEEEEEcCeeeCCeE Confidence 000 000 001123456789999999999999998877665 Q ss_pred ceEEEecCCCCCceEEEeeeeeEEEEeCCeeEecCCCCceeecc-eEEEEEEEEeeccceEEEEecCCCCccceeeeccC Q lcl|NC_011107. 687 KVFLDVPEAVVGAVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTR-AVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTP 765 (826) Q Consensus 687 ~~~~~~~~~~~~~~v~vGl~y~~~~~~~~~~i~~~~g~~~~~gr-~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~ 765 (826) +....++++..+..|+|||+|+++++|+||++..+ |. .+|| +||+++.++|++|.++.++.+....+.. + T Consensus 697 v~~G~vtl~~~~~~v~vGl~~~~~~~~l~~~~~~~-g~--~~g~~~ri~~~~~~~~~s~~~~~g~~~~~l~~~------~ 767 (823) T protein:vir:95 697 VSGGAVTLESPGAVVHIGLPITAEFETLDININGQ-ET--LLDKKQVIPSVTLVVNASRGIWATTPGGKWYEY------P 767 (823) T ss_pred ecCCEEEecCCCCEEEEeecceeeEEecchhcCCC-cc--cCCceeEEeEEEEEEEeeeeEEEecCCCceeEe------e Confidence 55556666777889999999999999999998753 54 3455 4799999999999999988655432221 2 Q ss_pred ccccccccccCccccceeEEEEEe-cccCceeEEEEEECCCCCEEEEEEEEEEEeecc Q lcl|NC_011107. 766 LRLFSRQLNAGEPLVDSAVVPLPA-RVDMATSKFELSCHSPYDMNVRAVEYNFKSNQT 822 (826) Q Consensus 766 ~~~~~~~~~~~~~~~~tg~~~vp~-~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r 822 (826) .|-. +.....|+++||++++++ .+|+++++|+|+|++|||||||+|..|...+-= T Consensus 768 ~r~~--~~~~~~~~~~tG~~~~~~~~~~~~~~~~~i~q~~plp~tvl~v~~~~~~~g~ 823 (823) T protein:vir:95 768 QREF--EFYDDPVDDATGKVEVKLDSNWGKNGRVKIRQLDPLPLSVLAVIPRLTVGGF 823 (823) T ss_pred ccCC--CcccCCCCcccceEEEecCCCcCCccEEEEEEcCCCceEEEEEEEEEEecCC Confidence 2211 222233468999999987 799999999999999999999999988765443 No 21 >protein:vir:107802 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996623;genbank:gi:45580757;genbank:GeneID:2767878 Probab=100.00 E-value=3.2e-150 Score=840.15 Aligned_cols=657 Identities=14% Similarity=0.132 Sum_probs=480.8 Q ss_pred CCceeeechhhhcc-----cccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCc-cccceeEEEEEcCCC Q lcl|NC_011107. 1 MSYKQSAYPNLLMG-----VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQ-PWPRPFLYHTNLGGR 74 (826) Q Consensus 1 M~~v~~s~~n~~~G-----VSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~-~~~~~~~~~~~rd~~ 74 (826) |+++....+||.+| +..|.|++||+++|++|+||++.|+||++||||++|+++++++.. .+++||.|.. T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~----- 75 (681) T protein:vir:10 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSV----- 75 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCC----- Confidence 99999999999999 668999999999999999999999999999999999999998864 5689999864 Q ss_pred ceEEEEEecCCeEEEEEcCCCEEEEec---CccccccccCCccceEEEEEcCEEEEEeCcccCcccccccCCCCCCccEE Q lcl|NC_011107. 75 SIAMLVAQHRGELYLFDERDGRLLMGQ---PLVHDYLKANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAGW 151 (826) Q Consensus 75 e~~~~~~~~~g~i~v~~~~~g~~~~~~---~~~~~y~~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~ 151 (826) ||+|++|+++++||||. +++.++..+ ....+|.+ ....+|+|+|+||++||||++|||++ +.+..+..|.+ T Consensus 76 ~~~~~l~~g~~~~r~~~-~~~~~~~~~~~~~~~tpy~~-~~l~~l~~~q~aD~~~i~h~~~~p~~----L~r~~~~~W~l 149 (681) T protein:vir:10 76 TQTMVIELGAGYFRFHT-NGGTLLDGAVPYEIANPYAE-ADLFNIHYVQSADVLTLVHPNYAPRE----LRRLGATNWQL 149 (681) T ss_pred CceEEEEEeCCeEEEEe-CCcEEeeCcEeEEecCCCCh-hhhcCceEEEEcCEEEEECCCCcceE----EEEccCCceEE Confidence 89999999999999994 455544321 12345644 45678999999999999999999985 44455566655 Q ss_pred EEEc--ccccCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeeccceEEEeeccc Q lcl|NC_011107. 152 LYIK--AGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTK 229 (826) Q Consensus 152 ~~v~--~g~y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga~~~t~~~~~~ 229 (826) ..+. .+.+. .-++ +++. ..+. . .....+.. T Consensus 150 ~~~~f~~~p~~-p~~~---------------~at~--~~~~--------~--~~t~~~~v-------------------- 181 (681) T protein:vir:10 150 ATIAFTSPVAT-PTSV---------------TATS--NNKG--------T--DYTYRYVV-------------------- 181 (681) T ss_pred EEEEecccccc-ceee---------------eeec--cCCc--------c--ceeEeEEE-------------------- Confidence 4321 11110 0000 0000 0000 0 00000000 Q ss_pred ccceeeccccceeecccccccccccceE-EEecCCceEEEEecCCCCcceEEEEEEeecccccccccccCcccccEeeee Q lcl|NC_011107. 230 KYPKVDPDANAATIAGYLNQRGVQDGYI-AFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQF 308 (826) Q Consensus 230 ~~~~~a~~~~~~t~a~~~~~~~~~~g~~-~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~ 308 (826) .+.+. ..+.. ......+......+++ . T Consensus 182 ----~avda--------------~t~~~s~~~~~~tvt~~~~~~~----------------------------------~ 209 (681) T protein:vir:10 182 ----TALDA--------------EGKTESAPSSAGTCTNNLFTNG----------------------------------G 209 (681) T ss_pred ----EEeec--------------ccceeecCCcceEEeeeeecCC----------------------------------c Confidence 00000 00000 0000000000000000 0 Q ss_pred eeeeEeccCCCCcceEEEEEcCCceEEEeecccccccccceeEEEEEecCCCeEEEeccCcCccccCCccccCCccccCC Q lcl|NC_011107. 309 MDGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTR 388 (826) Q Consensus 309 ~~~~~~~~~~~~~~~y~~~~~~~~~w~e~~~~g~~~~~~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~ 388 (826) ...+.|..+.... .|-.+....++|... +... + + .+......+......+...+++.++ ++ T Consensus 210 ~~t~~w~a~~g~~-~~~V~~~~~gi~g~i-g~~~--~--~------------~~~~~~~~~~~~~t~~~~~~~~~~~-~g 270 (681) T protein:vir:10 210 ANTIAWSASSGAS-RYNVYKEQGGLYGYI-GQTT--G--T------------SLVDDNIAPDLSVTPPIYDAVFNAA-GD 270 (681) T ss_pred ceeEEEEecCCce-eeeecccceeEEEEe-eccc--e--e------------eeeecccccCccccccccccccccC-CC Confidence 1111222222221 111111222233221 1110 0 0 0111111122222223334555443 46 Q ss_pred CccEEEEEcceEEEe----cCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEE Q lcl|NC_011107. 389 GITGMTTFQGRLVLL----SQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQA 464 (826) Q Consensus 389 ~~~~v~~~q~RL~f~----~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~ 464 (826) ||++|+||||||+|+ +||+|||||+||||||+++++ +.|||||++++++++++.|+|+++++ +|+|||+++|| T Consensus 271 yP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~--~~ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~ 347 (681) T protein:vir:10 271 YPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLP--VRDDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEW 347 (681) T ss_pred ceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccCC--CCCCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEE Confidence 999999999999999 589999999999999999984 58999999999999999999999995 79999999999 Q ss_pred EEeC--CccccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCC- Q lcl|NC_011107. 465 VVPG--GGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPG- 541 (826) Q Consensus 465 ~i~~--~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~- 541 (826) .|++ +++|||+|++++++|.|++ ++++|+.+|++++|++++|+ .||||.|+.+.+ .|+++|+|++++|++++ T Consensus 348 ~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~fv~~~g~---~vre~~y~~~~d-~~~~~dlt~~a~Hl~~~~ 422 (681) T protein:vir:10 348 RVASVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTIYGAARGG---HVRELAYNWQAN-GFVTGDLSLRAAHLFDNL 422 (681) T ss_pred EEecCCCccccceeEEEEEeeeecc-ccccceeeCCeEEEEecCCC---EEEEEEEeeecC-ceeccchhhhhhhhcCCC Confidence 9987 4699999999999999976 57999999999999999884 799999997666 49999999999999997 Q ss_pred CeEEEEEcCCCCEEEEEEcCCCeEEEEEEeecCCceeeeeeEeeecCCcEEEEEEE----CCeEEEEEEeCCCEEEEEEE Q lcl|NC_011107. 542 PAEYIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFT----GDNLMVLIQKGQEIALGRMH 617 (826) Q Consensus 542 ~v~~~~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~----~d~l~~vv~r~~~~~~~r~~ 617 (826) +|.+|+++++|++++||+++||+|++|+|+ +||+|.|||||+++|+|++||++ +|+||++|+|++++..++| T Consensus 423 ~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~---~eq~v~aW~~~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~y- 498 (681) T protein:vir:10 423 DILDMAYAKAPQPIVWFISSSGKLLGLTYV---PEQQIGAWHQHDTDGVFESCAVVAEGNEDRLYAVVRRTIGGNEVRY- 498 (681) T ss_pred CeEEEEEecCCCEEEEEEecCCcEEEEEEe---cccceeeEEEEecCCcEEEEEEecCCCCcEEEEEEEecCCCCeEEE- Confidence 899999999999999999999999999997 88999999999999999999999 6899999999999988887 Q ss_pred EeecCcccCCCcccccccceEEEeecccceeccceeeccCCcccceeeEEecCceeeeeecccceecCCceEEEecCCCC Q lcl|NC_011107. 618 LNSLPAREGLQYPKYDYWRRIEATVDGELELTKQHWDLIKDASAVYQLQPVAGAYMERTHLGVKRETNTKVFLDVPEAVV 697 (826) Q Consensus 618 ~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~~~~~~ 697 (826) +|+++..... ....++++||++++.+. +...++++.|++|+.+.+.++|............+.++.+ T Consensus 499 ie~~~~~~~~-------~~~~~~~vD~~~t~~~~------~~~~~sgl~~leG~tv~i~aDG~~~~~~~V~~G~itl~~~ 565 (681) T protein:vir:10 499 VERMASRQFD-------AQADAFFVDSGLTYSGE------PVSHISGLEHLEGKTVSILADGAVHPQRVVTDGAIDLDVE 565 (681) T ss_pred EEecCCcccc-------ccccceEeeccccccCc------ceeeeccccCCCCcEEEEEeCCeecCcEeecCcEEEeCcC Confidence 7777654321 11224678898887653 3345678999999999999999877655444456677788 Q ss_pred CceEEEeeeeeEEEEeCCeeEecCCCCceeecceEEEEEEEEeeccceEEEEecCCCCccceeeeccCccccccccccCc Q lcl|NC_011107. 698 GAVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGE 777 (826) Q Consensus 698 ~~~v~vGl~y~~~~~~~~~~i~~~~g~~~~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 777 (826) +.+|+|||+|+++++|+||+++.++|.+.+ .+++|+|+.|++++|.+++++++.+..+. .....+.++ ... T Consensus 566 ~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g-~~~ri~rv~lr~~~S~g~~~~~~~~~l~~--~~~~~~~~~------g~~ 636 (681) T protein:vir:10 566 AGTVHIGLPITAELQTLPVAMQLDGSFGQG-RVKNINKLWLRVHRSSGIFAGPHADALTE--VKQRTSEPY------GSP 636 (681) T ss_pred CceEEEeeeceeEEEecceeeecCCcccCC-ceEEEEEEEEEEEcccceEEeeCCCceEE--EEEeccccc------ccc Confidence 999999999999999999999999887654 36799999999999999999887654332 112222121 223 Q ss_pred cccceeEEEEEec-ccCceeEEEEEECCCCCEEEEEEEEEEEeec Q lcl|NC_011107. 778 PLVDSAVVPLPAR-VDMATSKFELSCHSPYDMNVRAVEYNFKSNQ 821 (826) Q Consensus 778 ~~~~tg~~~vp~~-~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~ 821 (826) +++.||++++|+. +|+++.+|+|+|++|+||+|++|+||-...- T Consensus 637 ~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~vgg 681 (681) T protein:vir:10 637 PALKSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIAIGA 681 (681) T ss_pred CCccCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEEeeC Confidence 5679999999974 8899999999999999999999999998888 No 22 >protein:vir:107423 Length: 681 # NCBI annotation: Bbp13 # Family: family:all:780 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958682;genbank:gi:41179374;genbank:GeneID:2717217 Probab=100.00 E-value=3.2e-150 Score=840.15 Aligned_cols=657 Identities=14% Similarity=0.132 Sum_probs=480.8 Q ss_pred CCceeeechhhhcc-----cccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCc-cccceeEEEEEcCCC Q lcl|NC_011107. 1 MSYKQSAYPNLLMG-----VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQ-PWPRPFLYHTNLGGR 74 (826) Q Consensus 1 M~~v~~s~~n~~~G-----VSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~-~~~~~~~~~~~rd~~ 74 (826) |+++....+||.+| +..|.|++||+++|++|+||++.|+||++||||++|+++++++.. .+++||.|.. T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~----- 75 (681) T protein:vir:10 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSV----- 75 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCC----- Confidence 99999999999999 668999999999999999999999999999999999999998864 5689999864 Q ss_pred ceEEEEEecCCeEEEEEcCCCEEEEec---CccccccccCCccceEEEEEcCEEEEEeCcccCcccccccCCCCCCccEE Q lcl|NC_011107. 75 SIAMLVAQHRGELYLFDERDGRLLMGQ---PLVHDYLKANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAGW 151 (826) Q Consensus 75 e~~~~~~~~~g~i~v~~~~~g~~~~~~---~~~~~y~~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~ 151 (826) ||+|++|+++++||||. +++.++..+ ....+|.+ ....+|+|+|+||++||||++|||++ +.+..+..|.+ T Consensus 76 ~~~~~l~~g~~~~r~~~-~~~~~~~~~~~~~~~tpy~~-~~l~~l~~~q~aD~~~i~h~~~~p~~----L~r~~~~~W~l 149 (681) T protein:vir:10 76 TQTMVIELGAGYFRFHT-NGGTLLDGAVPYEIANPYAE-ADLFNIHYVQSADVLTLVHPNYAPRE----LRRLGATNWQL 149 (681) T ss_pred CceEEEEEeCCeEEEEe-CCcEEeeCcEeEEecCCCCh-hhhcCceEEEEcCEEEEECCCCcceE----EEEccCCceEE Confidence 89999999999999994 455544321 12345644 45678999999999999999999985 44455566655 Q ss_pred EEEc--ccccCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeeccceEEEeeccc Q lcl|NC_011107. 152 LYIK--AGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTK 229 (826) Q Consensus 152 ~~v~--~g~y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga~~~t~~~~~~ 229 (826) ..+. .+.+. .-++ +++. ..+. . .....+.. T Consensus 150 ~~~~f~~~p~~-p~~~---------------~at~--~~~~--------~--~~t~~~~v-------------------- 181 (681) T protein:vir:10 150 ATIAFTSPVAT-PTSV---------------TATS--NNKG--------T--DYTYRYVV-------------------- 181 (681) T ss_pred EEEEecccccc-ceee---------------eeec--cCCc--------c--ceeEeEEE-------------------- Confidence 4321 11110 0000 0000 0000 0 00000000 Q ss_pred ccceeeccccceeecccccccccccceE-EEecCCceEEEEecCCCCcceEEEEEEeecccccccccccCcccccEeeee Q lcl|NC_011107. 230 KYPKVDPDANAATIAGYLNQRGVQDGYI-AFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQF 308 (826) Q Consensus 230 ~~~~~a~~~~~~t~a~~~~~~~~~~g~~-~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~ 308 (826) .+.+. ..+.. ......+......+++ . T Consensus 182 ----~avda--------------~t~~~s~~~~~~tvt~~~~~~~----------------------------------~ 209 (681) T protein:vir:10 182 ----TALDA--------------EGKTESAPSSAGTCTNNLFTNG----------------------------------G 209 (681) T ss_pred ----EEeec--------------ccceeecCCcceEEeeeeecCC----------------------------------c Confidence 00000 00000 0000000000000000 0 Q ss_pred eeeeEeccCCCCcceEEEEEcCCceEEEeecccccccccceeEEEEEecCCCeEEEeccCcCccccCCccccCCccccCC Q lcl|NC_011107. 309 MDGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTR 388 (826) Q Consensus 309 ~~~~~~~~~~~~~~~y~~~~~~~~~w~e~~~~g~~~~~~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~ 388 (826) ...+.|..+.... .|-.+....++|... +... + + .+......+......+...+++.++ ++ T Consensus 210 ~~t~~w~a~~g~~-~~~V~~~~~gi~g~i-g~~~--~--~------------~~~~~~~~~~~~~t~~~~~~~~~~~-~g 270 (681) T protein:vir:10 210 ANTIAWSASSGAS-RYNVYKEQGGLYGYI-GQTT--G--T------------SLVDDNIAPDLSVTPPIYDAVFNAA-GD 270 (681) T ss_pred ceeEEEEecCCce-eeeecccceeEEEEe-eccc--e--e------------eeeecccccCccccccccccccccC-CC Confidence 1111222222221 111111222233221 1110 0 0 0111111122222223334555443 46 Q ss_pred CccEEEEEcceEEEe----cCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEE Q lcl|NC_011107. 389 GITGMTTFQGRLVLL----SQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQA 464 (826) Q Consensus 389 ~~~~v~~~q~RL~f~----~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~ 464 (826) ||++|+||||||+|+ +||+|||||+||||||+++++ +.|||||++++++++++.|+|+++++ +|+|||+++|| T Consensus 271 yP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~--~~ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~ 347 (681) T protein:vir:10 271 YPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLP--VRDDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEW 347 (681) T ss_pred ceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccCC--CCCCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEE Confidence 999999999999999 589999999999999999984 58999999999999999999999995 79999999999 Q ss_pred EEeC--CccccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCC- Q lcl|NC_011107. 465 VVPG--GGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPG- 541 (826) Q Consensus 465 ~i~~--~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~- 541 (826) .|++ +++|||+|++++++|.|++ ++++|+.+|++++|++++|+ .||||.|+.+.+ .|+++|+|++++|++++ T Consensus 348 ~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~fv~~~g~---~vre~~y~~~~d-~~~~~dlt~~a~Hl~~~~ 422 (681) T protein:vir:10 348 RVASVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTIYGAARGG---HVRELAYNWQAN-GFVTGDLSLRAAHLFDNL 422 (681) T ss_pred EEecCCCccccceeEEEEEeeeecc-ccccceeeCCeEEEEecCCC---EEEEEEEeeecC-ceeccchhhhhhhhcCCC Confidence 9987 4699999999999999976 57999999999999999884 799999997666 49999999999999997 Q ss_pred CeEEEEEcCCCCEEEEEEcCCCeEEEEEEeecCCceeeeeeEeeecCCcEEEEEEE----CCeEEEEEEeCCCEEEEEEE Q lcl|NC_011107. 542 PAEYIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFT----GDNLMVLIQKGQEIALGRMH 617 (826) Q Consensus 542 ~v~~~~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~----~d~l~~vv~r~~~~~~~r~~ 617 (826) +|.+|+++++|++++||+++||+|++|+|+ +||+|.|||||+++|+|++||++ +|+||++|+|++++..++| T Consensus 423 ~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~---~eq~v~aW~~~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~y- 498 (681) T protein:vir:10 423 DILDMAYAKAPQPIVWFISSSGKLLGLTYV---PEQQIGAWHQHDTDGVFESCAVVAEGNEDRLYAVVRRTIGGNEVRY- 498 (681) T ss_pred CeEEEEEecCCCEEEEEEecCCcEEEEEEe---cccceeeEEEEecCCcEEEEEEecCCCCcEEEEEEEecCCCCeEEE- Confidence 899999999999999999999999999997 88999999999999999999999 6899999999999988887 Q ss_pred EeecCcccCCCcccccccceEEEeecccceeccceeeccCCcccceeeEEecCceeeeeecccceecCCceEEEecCCCC Q lcl|NC_011107. 618 LNSLPAREGLQYPKYDYWRRIEATVDGELELTKQHWDLIKDASAVYQLQPVAGAYMERTHLGVKRETNTKVFLDVPEAVV 697 (826) Q Consensus 618 ~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~~~~~~ 697 (826) +|+++..... ....++++||++++.+. +...++++.|++|+.+.+.++|............+.++.+ T Consensus 499 ie~~~~~~~~-------~~~~~~~vD~~~t~~~~------~~~~~sgl~~leG~tv~i~aDG~~~~~~~V~~G~itl~~~ 565 (681) T protein:vir:10 499 VERMASRQFD-------AQADAFFVDSGLTYSGE------PVSHISGLEHLEGKTVSILADGAVHPQRVVTDGAIDLDVE 565 (681) T ss_pred EEecCCcccc-------ccccceEeeccccccCc------ceeeeccccCCCCcEEEEEeCCeecCcEeecCcEEEeCcC Confidence 7777654321 11224678898887653 3345678999999999999999877655444456677788 Q ss_pred CceEEEeeeeeEEEEeCCeeEecCCCCceeecceEEEEEEEEeeccceEEEEecCCCCccceeeeccCccccccccccCc Q lcl|NC_011107. 698 GAVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGE 777 (826) Q Consensus 698 ~~~v~vGl~y~~~~~~~~~~i~~~~g~~~~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 777 (826) +.+|+|||+|+++++|+||+++.++|.+.+ .+++|+|+.|++++|.+++++++.+..+. .....+.++ ... T Consensus 566 ~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g-~~~ri~rv~lr~~~S~g~~~~~~~~~l~~--~~~~~~~~~------g~~ 636 (681) T protein:vir:10 566 AGTVHIGLPITAELQTLPVAMQLDGSFGQG-RVKNINKLWLRVHRSSGIFAGPHADALTE--VKQRTSEPY------GSP 636 (681) T ss_pred CceEEEeeeceeEEEecceeeecCCcccCC-ceEEEEEEEEEEEcccceEEeeCCCceEE--EEEeccccc------ccc Confidence 999999999999999999999999887654 36799999999999999999887654332 112222121 223 Q ss_pred cccceeEEEEEec-ccCceeEEEEEECCCCCEEEEEEEEEEEeec Q lcl|NC_011107. 778 PLVDSAVVPLPAR-VDMATSKFELSCHSPYDMNVRAVEYNFKSNQ 821 (826) Q Consensus 778 ~~~~tg~~~vp~~-~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~ 821 (826) +++.||++++|+. +|+++.+|+|+|++|+||+|++|+||-...- T Consensus 637 ~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~vgg 681 (681) T protein:vir:10 637 PALKSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIAIGA 681 (681) T ss_pred CCccCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEEeeC Confidence 5679999999974 8899999999999999999999999998888 No 23 >protein:vir:98487 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996575;genbank:gi:45569506;genbank:GeneID:2767815 Probab=100.00 E-value=3.2e-150 Score=840.15 Aligned_cols=657 Identities=14% Similarity=0.132 Sum_probs=480.8 Q ss_pred CCceeeechhhhcc-----cccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCc-cccceeEEEEEcCCC Q lcl|NC_011107. 1 MSYKQSAYPNLLMG-----VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQ-PWPRPFLYHTNLGGR 74 (826) Q Consensus 1 M~~v~~s~~n~~~G-----VSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~-~~~~~~~~~~~rd~~ 74 (826) |+++....+||.+| +..|.|++||+++|++|+||++.|+||++||||++|+++++++.. .+++||.|.. T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~----- 75 (681) T protein:vir:98 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSV----- 75 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCC----- Confidence 99999999999999 668999999999999999999999999999999999999998864 5689999864 Q ss_pred ceEEEEEecCCeEEEEEcCCCEEEEec---CccccccccCCccceEEEEEcCEEEEEeCcccCcccccccCCCCCCccEE Q lcl|NC_011107. 75 SIAMLVAQHRGELYLFDERDGRLLMGQ---PLVHDYLKANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAGW 151 (826) Q Consensus 75 e~~~~~~~~~g~i~v~~~~~g~~~~~~---~~~~~y~~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~ 151 (826) ||+|++|+++++||||. +++.++..+ ....+|.+ ....+|+|+|+||++||||++|||++ +.+..+..|.+ T Consensus 76 ~~~~~l~~g~~~~r~~~-~~~~~~~~~~~~~~~tpy~~-~~l~~l~~~q~aD~~~i~h~~~~p~~----L~r~~~~~W~l 149 (681) T protein:vir:98 76 TQTMVIELGAGYFRFHT-NGGTLLDGAVPYEIANPYAE-ADLFNIHYVQSADVLTLVHPNYAPRE----LRRLGATNWQL 149 (681) T ss_pred CceEEEEEeCCeEEEEe-CCcEEeeCcEeEEecCCCCh-hhhcCceEEEEcCEEEEECCCCcceE----EEEccCCceEE Confidence 89999999999999994 455544321 12345644 45678999999999999999999985 44455566655 Q ss_pred EEEc--ccccCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeeccceEEEeeccc Q lcl|NC_011107. 152 LYIK--AGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTK 229 (826) Q Consensus 152 ~~v~--~g~y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga~~~t~~~~~~ 229 (826) ..+. .+.+. .-++ +++. ..+. . .....+.. T Consensus 150 ~~~~f~~~p~~-p~~~---------------~at~--~~~~--------~--~~t~~~~v-------------------- 181 (681) T protein:vir:98 150 ATIAFTSPVAT-PTSV---------------TATS--NNKG--------T--DYTYRYVV-------------------- 181 (681) T ss_pred EEEEecccccc-ceee---------------eeec--cCCc--------c--ceeEeEEE-------------------- Confidence 4321 11110 0000 0000 0000 0 00000000 Q ss_pred ccceeeccccceeecccccccccccceE-EEecCCceEEEEecCCCCcceEEEEEEeecccccccccccCcccccEeeee Q lcl|NC_011107. 230 KYPKVDPDANAATIAGYLNQRGVQDGYI-AFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQF 308 (826) Q Consensus 230 ~~~~~a~~~~~~t~a~~~~~~~~~~g~~-~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~ 308 (826) .+.+. ..+.. ......+......+++ . T Consensus 182 ----~avda--------------~t~~~s~~~~~~tvt~~~~~~~----------------------------------~ 209 (681) T protein:vir:98 182 ----TALDA--------------EGKTESAPSSAGTCTNNLFTNG----------------------------------G 209 (681) T ss_pred ----EEeec--------------ccceeecCCcceEEeeeeecCC----------------------------------c Confidence 00000 00000 0000000000000000 0 Q ss_pred eeeeEeccCCCCcceEEEEEcCCceEEEeecccccccccceeEEEEEecCCCeEEEeccCcCccccCCccccCCccccCC Q lcl|NC_011107. 309 MDGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTR 388 (826) Q Consensus 309 ~~~~~~~~~~~~~~~y~~~~~~~~~w~e~~~~g~~~~~~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~ 388 (826) ...+.|..+.... .|-.+....++|... +... + + .+......+......+...+++.++ ++ T Consensus 210 ~~t~~w~a~~g~~-~~~V~~~~~gi~g~i-g~~~--~--~------------~~~~~~~~~~~~~t~~~~~~~~~~~-~g 270 (681) T protein:vir:98 210 ANTIAWSASSGAS-RYNVYKEQGGLYGYI-GQTT--G--T------------SLVDDNIAPDLSVTPPIYDAVFNAA-GD 270 (681) T ss_pred ceeEEEEecCCce-eeeecccceeEEEEe-eccc--e--e------------eeeecccccCccccccccccccccC-CC Confidence 1111222222221 111111222233221 1110 0 0 0111111122222223334555443 46 Q ss_pred CccEEEEEcceEEEe----cCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEE Q lcl|NC_011107. 389 GITGMTTFQGRLVLL----SQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQA 464 (826) Q Consensus 389 ~~~~v~~~q~RL~f~----~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~ 464 (826) ||++|+||||||+|+ +||+|||||+||||||+++++ +.|||||++++++++++.|+|+++++ +|+|||+++|| T Consensus 271 yP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~--~~ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~ 347 (681) T protein:vir:98 271 YPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLP--VRDDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEW 347 (681) T ss_pred ceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccCC--CCCCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEE Confidence 999999999999999 589999999999999999984 58999999999999999999999995 79999999999 Q ss_pred EEeC--CccccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCC- Q lcl|NC_011107. 465 VVPG--GGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPG- 541 (826) Q Consensus 465 ~i~~--~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~- 541 (826) .|++ +++|||+|++++++|.|++ ++++|+.+|++++|++++|+ .||||.|+.+.+ .|+++|+|++++|++++ T Consensus 348 ~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~fv~~~g~---~vre~~y~~~~d-~~~~~dlt~~a~Hl~~~~ 422 (681) T protein:vir:98 348 RVASVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTIYGAARGG---HVRELAYNWQAN-GFVTGDLSLRAAHLFDNL 422 (681) T ss_pred EEecCCCccccceeEEEEEeeeecc-ccccceeeCCeEEEEecCCC---EEEEEEEeeecC-ceeccchhhhhhhhcCCC Confidence 9987 4699999999999999976 57999999999999999884 799999997666 49999999999999997 Q ss_pred CeEEEEEcCCCCEEEEEEcCCCeEEEEEEeecCCceeeeeeEeeecCCcEEEEEEE----CCeEEEEEEeCCCEEEEEEE Q lcl|NC_011107. 542 PAEYIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFT----GDNLMVLIQKGQEIALGRMH 617 (826) Q Consensus 542 ~v~~~~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~----~d~l~~vv~r~~~~~~~r~~ 617 (826) +|.+|+++++|++++||+++||+|++|+|+ +||+|.|||||+++|+|++||++ +|+||++|+|++++..++| T Consensus 423 ~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~---~eq~v~aW~~~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~y- 498 (681) T protein:vir:98 423 DILDMAYAKAPQPIVWFISSSGKLLGLTYV---PEQQIGAWHQHDTDGVFESCAVVAEGNEDRLYAVVRRTIGGNEVRY- 498 (681) T ss_pred CeEEEEEecCCCEEEEEEecCCcEEEEEEe---cccceeeEEEEecCCcEEEEEEecCCCCcEEEEEEEecCCCCeEEE- Confidence 899999999999999999999999999997 88999999999999999999999 6899999999999988887 Q ss_pred EeecCcccCCCcccccccceEEEeecccceeccceeeccCCcccceeeEEecCceeeeeecccceecCCceEEEecCCCC Q lcl|NC_011107. 618 LNSLPAREGLQYPKYDYWRRIEATVDGELELTKQHWDLIKDASAVYQLQPVAGAYMERTHLGVKRETNTKVFLDVPEAVV 697 (826) Q Consensus 618 ~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~~~~~~ 697 (826) +|+++..... ....++++||++++.+. +...++++.|++|+.+.+.++|............+.++.+ T Consensus 499 ie~~~~~~~~-------~~~~~~~vD~~~t~~~~------~~~~~sgl~~leG~tv~i~aDG~~~~~~~V~~G~itl~~~ 565 (681) T protein:vir:98 499 VERMASRQFD-------AQADAFFVDSGLTYSGE------PVSHISGLEHLEGKTVSILADGAVHPQRVVTDGAIDLDVE 565 (681) T ss_pred EEecCCcccc-------ccccceEeeccccccCc------ceeeeccccCCCCcEEEEEeCCeecCcEeecCcEEEeCcC Confidence 7777654321 11224678898887653 3345678999999999999999877655444456677788 Q ss_pred CceEEEeeeeeEEEEeCCeeEecCCCCceeecceEEEEEEEEeeccceEEEEecCCCCccceeeeccCccccccccccCc Q lcl|NC_011107. 698 GAVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGE 777 (826) Q Consensus 698 ~~~v~vGl~y~~~~~~~~~~i~~~~g~~~~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 777 (826) +.+|+|||+|+++++|+||+++.++|.+.+ .+++|+|+.|++++|.+++++++.+..+. .....+.++ ... T Consensus 566 ~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g-~~~ri~rv~lr~~~S~g~~~~~~~~~l~~--~~~~~~~~~------g~~ 636 (681) T protein:vir:98 566 AGTVHIGLPITAELQTLPVAMQLDGSFGQG-RVKNINKLWLRVHRSSGIFAGPHADALTE--VKQRTSEPY------GSP 636 (681) T ss_pred CceEEEeeeceeEEEecceeeecCCcccCC-ceEEEEEEEEEEEcccceEEeeCCCceEE--EEEeccccc------ccc Confidence 999999999999999999999999887654 36799999999999999999887654332 112222121 223 Q ss_pred cccceeEEEEEec-ccCceeEEEEEECCCCCEEEEEEEEEEEeec Q lcl|NC_011107. 778 PLVDSAVVPLPAR-VDMATSKFELSCHSPYDMNVRAVEYNFKSNQ 821 (826) Q Consensus 778 ~~~~tg~~~vp~~-~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~ 821 (826) +++.||++++|+. +|+++.+|+|+|++|+||+|++|+||-...- T Consensus 637 ~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~vgg 681 (681) T protein:vir:98 637 PALKSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIAIGA 681 (681) T ss_pred CCccCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEEeeC Confidence 5679999999974 8899999999999999999999999998888 No 24 >protein:vir:7329 Length: 825 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848220;genbank:gi:30387391;genbank:GeneID:2641863 Probab=100.00 E-value=3.4e-149 Score=834.53 Aligned_cols=708 Identities=13% Similarity=0.126 Sum_probs=480.4 Q ss_pred CCceeeechhhhcc-----cccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCC-ccccceeEEEEEcCCC Q lcl|NC_011107. 1 MSYKQSAYPNLLMG-----VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTD-QPWPRPFLYHTNLGGR 74 (826) Q Consensus 1 M~~v~~s~~n~~~G-----VSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~-~~~~~~~~~~~~rd~~ 74 (826) |+. +...+||.+| +..|.|++||.++|++|+||+++|+||++||||++||+++++++ +.+++||.|+. T Consensus 1 m~~-~~~q~sF~~GElsP~l~gR~Dl~~y~~g~~~~~N~~~~p~Gg~~rRpGt~fva~~~~~~~~~rLipF~fs~----- 74 (825) T protein:vir:73 1 MAF-SWIQPSFAGGEIGPSLYGRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPAKYPDRKCRLIPFQFST----- 74 (825) T ss_pred Ccc-ceeccccccceechhhcccchHHHHHHHHHHhcCcEEEecCCceecCchHHhHhhcCCCCCEEEEEEEeCC----- Confidence 873 4566899999 77899999999999999999999999999999999999999875 45789998754 Q ss_pred ceEEEEEecCCeEEEEEcCCCEEEEecC----ccccccccCCccceEEEEEcCEEEEEeCcccCcccccccCCCCCCccE Q lcl|NC_011107. 75 SIAMLVAQHRGELYLFDERDGRLLMGQP----LVHDYLKANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAG 150 (826) Q Consensus 75 e~~~~~~~~~g~i~v~~~~~g~~~~~~~----~~~~y~~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a 150 (826) ||+|++++++++||||.. ++.++...+ ...+|. +....+|+++|+||++||+|+++||++ +.+.....|. T Consensus 75 ~q~y~Lefg~~~lrv~~~-gg~v~~~~~~~~e~~TPy~-~~~l~~l~~~QsaD~~~i~h~~~pp~~----L~r~~~~~W~ 148 (825) T protein:vir:73 75 VQTYALEFGHNYMRVIKD-GAYVLTTSNVIYELAMPYA-DTDLFRIKFTQSADVLTLVHPAYPPKE----LRRYAHDNWQ 148 (825) T ss_pred CcEEEEEEeCCeEEEEeC-CceEeccCCceEEEecccc-hhhhhhheeeeecCEEEEEcCCCceeE----EEEecCCCcE Confidence 799999999999999975 444432221 345563 346778999999999999999999985 3334444555 Q ss_pred EEEE--ccccc---CceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeeccceEEEe Q lcl|NC_011107. 151 WLYI--KAGQY---SKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLP 225 (826) Q Consensus 151 ~~~v--~~g~y---~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga~~~t~~ 225 (826) +..+ ..+.+ +.+.++++..++.....+...+++...++... .... +.......+. .|... T Consensus 149 l~~~~f~~gp~~~in~~~sv~v~asg~tg~~TiTaS~a~~~~~~vG---------~~i~---~~~~~v~si~---~~~~~ 213 (825) T protein:vir:73 149 IVDVTTKNGPFEDINVDETVKVYASASTGTITLTASSAIFGAEQVG---------KLFY---LEQPAVDSVP---VWETS 213 (825) T ss_pred EEEEeccCCccccccccccceeeecccCceeEEEeeccccCchhcC---------eEEE---Eecccccccc---eeeee Confidence 5332 22222 11222223222222111111111111111000 0000 0000000000 00000 Q ss_pred ecccccceeeccccceeecccccccccccceEEEecCCceEEEEecCCCCcceEEEEEEeecccccccccccCcccccEe Q lcl|NC_011107. 226 NSTKKYPKVDPDANAATIAGYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVG 305 (826) Q Consensus 226 ~~~~~~~~~a~~~~~~t~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~ 305 (826) .. ...+.++.. . +..+... ...+.....|+... +.. T Consensus 214 ~~-----------------------~~~~~v~~~--~-----------~~~~~~~-------~~~~~~t~~~~a~~-g~~ 249 (825) T protein:vir:73 214 KT-----------------------TAINDVRRA--D-----------SNYYRAN-------TSGKTGTLRPSHTE-GMS 249 (825) T ss_pred eE-----------------------EEeeeEEEC--C-----------Cceeeee-------cccccceeeccccC-Cce Confidence 00 000000000 0 0000000 00000001111100 000 Q ss_pred eeeeeeeEeccCCCCc-ceEEEEEcCCceEEEeeccccccc-----ccceeEEEEEecCCCeEEEeccCcCccccCCccc Q lcl|NC_011107. 306 VQFMDGAVMATGSTKA-PVYFEWDSANRRWAERAAYGTDWV-----LKKMPLALRWDEATDTYSLNELEYDRRGSGDEDT 379 (826) Q Consensus 306 ~~~~~~~~~~~~~~~~-~~y~~~~~~~~~w~e~~~~g~~~~-----~~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~t 379 (826) . . .......... ..|..+..+.+.++.+..++.... ...+|+..+ ..++++++++...|.++ T Consensus 250 ~---~-~~~g~~~~~~~~~~~~~~~~~g~~~it~~~~~~~~~~~~~~~~~~~~~~-~~~~~t~~~~~~~~~~~------- 317 (825) T protein:vir:73 250 W---D-GWGGTGSDDTGIQWEYLHSGFGIAKITAVAGDGLTATADVVSFIPSQVV-GSANASYKWAKYAWNSV------- 317 (825) T ss_pred e---E-eeeeecccCCceEEEEEecCCceEEEeeccccceeeccccceecccccc-cCCCCCcccccCCcccC------- Confidence 0 0 0000011111 122233445555655544332111 112444443 34556666666666432 Q ss_pred cCCccccCCCccEEEEEcceEEEe----cCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcE Q lcl|NC_011107. 380 NPTFNFVTRGITGMTTFQGRLVLL----SQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDL 455 (826) Q Consensus 380 np~psf~g~~~~~v~~~q~RL~f~----~~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L 455 (826) | .||++|+||||||+|+ +|++|||||+||||||+++++ ++|||||+++++++++|.|+|+++++ +| T Consensus 318 ~-------gyPs~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~~--~~DdD~I~~~~s~~~~~~i~~~~~~~-~L 387 (825) T protein:vir:73 318 N-------GYPSTVVYYQQRLYFAASTAYPQTIWASRTGDYKDFGKNNP--IQDDDRIIYTYAGRQVNEIRHLIDVG-NL 387 (825) T ss_pred C-------CCccEEEEEcceEEEeecCCCCCEEEEEccCCccccccCCC--CCCCccEEEEEcCCcceeEEEEeecC-cE Confidence 3 4777899999999999 589999999999999999984 68999999999999999999999995 89 Q ss_pred EEEecCcEEEEeCC--ccccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHH Q lcl|NC_011107. 456 IVFAKKYQAVVPGG--GIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTS 533 (826) Q Consensus 456 ~l~t~~~q~~i~~~--~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~ 533 (826) +|||+++||+|+++ ++|||+|++++++|+|+++ +++|+.+|++++|++++|+ +||||.|+.+.+ .|+++|+|+ T Consensus 388 ~~~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~~-~~~Pv~vg~~~~Fv~~~g~---~vre~~~~~~~d-~~~~~dlt~ 462 (825) T protein:vir:73 388 VALTSGGEYTISGDQNKVLTPSAFSFSSQGNNGSS-NVPPIAVANIALFIQEKGS---VVRDLAYSFDVD-GYQGTDLTI 462 (825) T ss_pred EEEecCceEEEecCCCcccceeeEEEEeeeeeccc-cccceEeCCeEEEEeCCCC---eEEEEEEeeecC-ceeccchhh Confidence 99999999999875 6999999999999999765 6999999999999998774 799999997665 599999999 Q ss_pred HHHHhcCC-CeEEEEEcCCCCEEEEEEcCCCeEEEEEEeecCCceeeeeeEeeecCCcEEEEEEE----CCeEEEEEEeC Q lcl|NC_011107. 534 HIPSYMPG-PAEYIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFT----GDNLMVLIQKG 608 (826) Q Consensus 534 ~~~~~~~~-~v~~~~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~----~d~l~~vv~r~ 608 (826) |++|++++ ++.+|+++++|++++|++++||+|++|+|+ +||+|.|||||+++|+|++||++ +|+||++|+|+ T Consensus 463 ~a~hl~~~~~~~~~a~~~~p~~~~~~v~~dg~l~~~ty~---~~q~v~aW~~~~~~g~v~~~~~i~~~~~D~l~~iV~R~ 539 (825) T protein:vir:73 463 LANHLFQKHSIVDWSFCIVPYSSAFCIRDDGKLLVLTYL---RDQQVFAWAPQSSAGKYESTCSISEGSEDAVYFVVNRT 539 (825) T ss_pred hhHhhccCCceEEEEEcCCCceEEEEEecCCeEEEEEEe---ccccceeeEEEecCCcEEEEEEecCCCccEEEEEEEEe Confidence 99999997 799999999999999999999999999997 89999999999999999999999 68999999999 Q ss_pred CCEEEEEEEEeecCcccCCCcc---cccccce-----------------EEEeec--------c-cce------------ Q lcl|NC_011107. 609 QEIALGRMHLNSLPAREGLQYP---KYDYWRR-----------------IEATVD--------G-ELE------------ 647 (826) Q Consensus 609 ~~~~~~r~~~~~~~~~~~~~~~---~~d~~~~-----------------~~~~~~--------~-~~~------------ 647 (826) ++++..|| +|++....+.+.+ ++|.-.. .+...+ + ..+ T Consensus 540 ~~g~~~~y-iE~~~~~~~~~~~~~~~vD~g~~~~g~~~~~~l~~l~g~tv~~~~~g~~~~~v~~g~itl~~~~~~~i~l~ 618 (825) T protein:vir:73 540 INGQTVRY-IERLSSRLFTNDEDAFFVDCGLSYDGRNTSSRTMTISGGTGDWSYQVDYPVTVSGGAYFVNTDVGAQIQFP 618 (825) T ss_pred eCCceEEE-EEEecccccCCCcceeEEEEEeeecccceeeceeeeCCceEEEEeCCeEEEEEcCCeEEecccceEEEEec Confidence 99888887 6666544332221 2221000 000000 0 000 Q ss_pred eccce------------------------------------e-----eccCCcccceeeEEecCceeeeeecccceecCC Q lcl|NC_011107. 648 LTKQH------------------------------------W-----DLIKDASAVYQLQPVAGAYMERTHLGVKRETNT 686 (826) Q Consensus 648 ~~~~~------------------------------------~-----~~~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~ 686 (826) +.... . ....+...+++|+||||++|.+++||.+.+... T Consensus 619 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~v~~~~~~~a~~~~~~~t~~~~a~~~~~gL~hLeG~~v~v~~Dg~~~~~~~ 698 (825) T protein:vir:73 619 YTGTDPDTNEPVAKELRGDIISVTSNTAVVVRFNRNVPPVLRNVATTNWQMARQTFSGLAHLEGQTVNILSDASVEPQKT 698 (825) T ss_pred ccCcccccccceeceeeEEEccccCceEEEEEecccccceeeeecccCCCcchheeccccccCCceEEEEECCeeeCCeE Confidence 00000 0 001122345789999999999999998887766 Q ss_pred ceEEEecCCCCCceEEEeeeeeEEEEeCCeeEecCCCCceeecc-eEEEEEEEEeeccceEEEEecCCCCccceeeeccC Q lcl|NC_011107. 687 KVFLDVPEAVVGAVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTR-AVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTP 765 (826) Q Consensus 687 ~~~~~~~~~~~~~~v~vGl~y~~~~~~~~~~i~~~~g~~~~~gr-~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~ 765 (826) +....+.++.++++|||||+|+++++++||++..+ |. .+|| +||+++.++|++|.++.++.+.+..+.. + T Consensus 699 V~~G~vtl~~~~~~v~vGl~y~~~~~~l~~~~~~~-g~--~~g~~~ri~~~~~~~~~s~~~~~g~~~~~l~~~------~ 769 (825) T protein:vir:73 699 VTGGAVTLESPGAVVHIGLPITAEFETLDININGQ-ET--LLDKKQVIPTVTMVVNASRGIWATTPGGTWYEY------P 769 (825) T ss_pred ecCcEEEecCCceEEEEeeCccceEEecccccCCC-cc--ccCccEEEEEEEEEEEeeeeEEEecCCCcceEe------e Confidence 66667777788999999999999999999998643 54 3454 5899999999999999988655432221 2 Q ss_pred ccccccccccCcc-ccceeEEEEEe-cccCceeEEEEEECCCCCEEEEEEEEEEEeecc Q lcl|NC_011107. 766 LRLFSRQLNAGEP-LVDSAVVPLPA-RVDMATSKFELSCHSPYDMNVRAVEYNFKSNQT 822 (826) Q Consensus 766 ~~~~~~~~~~~~~-~~~tg~~~vp~-~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r 822 (826) .+-. + ..++| +++||++++++ .+|+++.+|+|+|++|||||||+|..|...+-= T Consensus 770 ~r~~--~-~~~~~~~~~tG~~~~~~~~~~~~~~~~~i~q~~PlP~tvlav~~~~~~~g~ 825 (825) T protein:vir:73 770 QREF--E-FYDDPVDDATGKVEVKLDSNWDKNGRVKVRQLDPLPLSVLAVLPRLTVGGF 825 (825) T ss_pred ccCC--C-cccCCCccccCcEEEecCCCCCCccEEEEEEcCCCCEEEEEEEEEEEecCC Confidence 2211 1 13444 68999999987 799999999999999999999999988775544 No 25 >protein:vir:102644 Length: 594 # NCBI annotation: Hypothetical protein # Family: family:all:780 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024422;genbank:gi:48696643;genbank:GeneID:2948111 Probab=100.00 E-value=3.3e-131 Score=735.90 Aligned_cols=554 Identities=12% Similarity=0.070 Sum_probs=436.5 Q ss_pred CCceeeechhhhcc-----cccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCC-ccccceeEEEEEcCCC Q lcl|NC_011107. 1 MSYKQSAYPNLLMG-----VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTD-QPWPRPFLYHTNLGGR 74 (826) Q Consensus 1 M~~v~~s~~n~~~G-----VSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~-~~~~~~~~~~~~rd~~ 74 (826) |+.+ +..||.+| +..|.|++||+++|++|+||++.|+||+.||||++|++++++++ +.++.||.|. . T Consensus 1 m~~~--~~~~F~~GelsP~l~~r~Dl~~y~~~~~~~~n~~~~~~G~~~rR~G~~~~~~~~~~~~~~~lipF~~s-----~ 73 (594) T protein:vir:10 1 MADF--SQTSFKGGVIAPRLQFNEYESAYHHSIEDAVNFVVTEQGSLITRCGSEEVGLCQDGEVRLFRLPAVDA-----P 73 (594) T ss_pred Ccee--eccccCcceecceeccchhHHHHHHHHhhhhceEEEecCCeecCChhHhhhhccCCCCCEEEEEEEeC-----C Confidence 9998 47899999 55799999999999999999999999999999999999999875 4678999986 4 Q ss_pred ceEEEEEecCCeEEEEEcCCCEEEE-ecCcc----ccccc--cCCccceEEEEEcCEEEEEeCcccCcccccccCCCCCC Q lcl|NC_011107. 75 SIAMLVAQHRGELYLFDERDGRLLM-GQPLV----HDYLK--ANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPN 147 (826) Q Consensus 75 e~~~~~~~~~g~i~v~~~~~g~~~~-~~~~~----~~y~~--a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~ 147 (826) +|+|++|++++++|+|. .++..+. ..+.+ .+|.. .....+|+|+|++|+++|+|++++|+. +.+.... T Consensus 74 ~~~~~le~g~~~~r~~~-~~~~~v~~~~~~~~~~~tp~~~t~~~~l~~i~~tqsad~~~~~~~~~~p~~----L~R~~~~ 148 (594) T protein:vir:10 74 SNDVIVEVGNTNIAVWV-NDVRQVVANTPSEWRNTIDRIQTAYDTIGDDAGAANTGRLIMVHPALQPKR----LYRDNNN 148 (594) T ss_pred CCeEEEEEcCCeEEEEe-cCcEEEEccCCCcccccccceeeccCCccceEEEEEeeEEEEEcCCCCceE----EEEccCC Confidence 89999999999999995 4444333 22222 12221 234678999999999999999998862 1000000 Q ss_pred ccEEEEEcccccCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeeccceEEEeec Q lcl|NC_011107. 148 KAGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNS 227 (826) Q Consensus 148 ~~a~~~v~~g~y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga~~~t~~~~ 227 (826) .|.+ T Consensus 149 ~w~~---------------------------------------------------------------------------- 152 (594) T protein:vir:10 149 AWQF---------------------------------------------------------------------------- 152 (594) T ss_pred CceE---------------------------------------------------------------------------- Confidence 0000 Q ss_pred ccccceeeccccceeecccccccccccceEEEecCCceEEEEecCCCCcceEEEEEEeecccccccccccCcccccEeee Q lcl|NC_011107. 228 TKKYPKVDPDANAATIAGYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQ 307 (826) Q Consensus 228 ~~~~~~~a~~~~~~t~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~ 307 (826) . T Consensus 153 ----------------------------------------------------~--------------------------- 153 (594) T protein:vir:10 153 ----------------------------------------------------V--------------------------- 153 (594) T ss_pred ----------------------------------------------------E--------------------------- Confidence 0 Q ss_pred eeeeeEeccCCCCcceEEEEEcCCceEEEeecccccccccceeEEEEEecCCCeEEEeccCcCccccCCccccCCccccC Q lcl|NC_011107. 308 FMDGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVT 387 (826) Q Consensus 308 ~~~~~~~~~~~~~~~~y~~~~~~~~~w~e~~~~g~~~~~~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g 387 (826) ..+|..+..++.+ . T Consensus 154 ----------------------------------------------------------~~~~~~~p~~~~~--------~ 167 (594) T protein:vir:10 154 ----------------------------------------------------------NMHTGAVPAEWSP--------S 167 (594) T ss_pred ----------------------------------------------------------ecccCcccccccC--------C Confidence 0000000000000 2 Q ss_pred CCccEEEEEcceEEEec----CCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcE Q lcl|NC_011107. 388 RGITGMTTFQGRLVLLS----QEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQ 463 (826) Q Consensus 388 ~~~~~v~~~q~RL~f~~----~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q 463 (826) +||++|+||||||+|++ |++|||||+||||||++++++ .|||||++.+ +++.+.| |+++++++|+|||+++| T Consensus 168 ~~p~~v~f~q~RL~f~~~~~~p~~v~~Srtgd~~nF~~~~~~--~ddd~i~~~~-s~~~~~~-~~v~~~~~L~i~t~~~e 243 (594) T protein:vir:10 168 NYPQTVGIFQNRVWYVGSPVHRTYFWATRAGKLEDIAPSTAN--NPNDPISFVG-IMEGTPC-WIIASSDVLTIGTTIND 243 (594) T ss_pred ccceEEEEEeeeEEEEeCCCCCceEEEEecccccccccCCCC--CCCccEEEEE-ecccceE-EEEecCCceEEEecCce Confidence 47899999999999998 578999999999999999854 7999999955 4565555 55777889999999999 Q ss_pred EEEeCC--ccccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcC- Q lcl|NC_011107. 464 AVVPGG--GIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMP- 540 (826) Q Consensus 464 ~~i~~~--~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~- 540 (826) |+|+++ ++|||+|+.++++|.+ +++.++|+.+|+.++|++++|+ .||||.|+.+.+ .|+++|||+|++|+|+ T Consensus 244 ~~l~~~~~~~lTp~~~~~~~~s~~-g~~~~~P~~vg~~~~fv~~~g~---~vre~~y~~~~d-~y~~~dlt~~a~hl~~~ 318 (594) T protein:vir:10 244 YQLAASTGVSVTAATAILRRSSVQ-GTAAVQGIPAEEQVIFCSRNKS---KVYAMNYVREQD-NWIPDEMSSQAQHLFTP 318 (594) T ss_pred EEEecCCCcccccceEEEEEeeee-ccCCCcceeeCCeEEEEcCCCC---EEEEEEEeeccC-ceeccchhhhhhhhcCc Confidence 999875 5899999999999965 6789999999999999998874 799999997665 4999999999999974 Q ss_pred ------CCeEEEEEcCCCCEEEEEEcCCCeEEEEEEeecCCceeeeeeEeee-cCCcEEEEEEE----CCeEEEEEEeC- Q lcl|NC_011107. 541 ------GPAEYIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWT-LRHQIIGAYFT----GDNLMVLIQKG- 608 (826) Q Consensus 541 ------~~v~~~~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~-~~g~v~~~~~~----~d~l~~vv~r~- 608 (826) ++|.+||++++|++++||+++||.|.+++|+ +||+|.|||||+ ++|+|++||+| +|++|++|+|. T Consensus 319 ~~~~~~~~i~~~a~~~~p~~~~~~v~~dG~l~~~ty~---~eq~v~aWs~~~~t~G~v~~va~i~~~~~d~l~~~V~R~~ 395 (594) T protein:vir:10 319 ISSAKGASVRRVAYISDAAKSLWVVLENGQINYCCFD---RTTDTKAWTQLELSGGKVIDIAAAFNPDSDYAYVAVVRSK 395 (594) T ss_pred cccccCceEEEEEEecCCceEEEEEeCCCeEEEEEEe---cccceeeeEeeccCCCcEEEEEEeecCCCCEEEEEEEECC Confidence 5799999999999999999999999999997 899999999998 58999999998 78999999994 Q ss_pred -CCEEEEEE-EEeecCcccCCCcccccccceEEEeecccceeccceeeccCCcccceeeEEecCceeeeeecccce---- Q lcl|NC_011107. 609 -QEIALGRM-HLNSLPAREGLQYPKYDYWRRIEATVDGELELTKQHWDLIKDASAVYQLQPVAGAYMERTHLGVKR---- 682 (826) Q Consensus 609 -~~~~~~r~-~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~---- 682 (826) +++..+|+ .+|++......+. ..+++++++.++.. .+.++.|++|+.+.+.++|... T Consensus 396 ti~g~~~~y~~lE~~~~~~~~~~-------~~~~~~d~~~~~~~----------~vsgl~hLeg~tv~v~aDG~~~~~~~ 458 (594) T protein:vir:10 396 AINGVQKNYTVLEKISSPRTDWK-------RADGWVVAQVNQNG----------DVLNLDRYIGRTAVIFSKYGLEAEVE 458 (594) T ss_pred ccccceeeEEEeecCCCcccccc-------ccceeeeecccccc----------eeecccccCCceEEEEeCCeecCCeE Confidence 67877776 4676665433322 23566777666532 2457899999999999998753 Q ss_pred ecCCceEEEecCCCCCceEEEeeeeeEEEEeCCeeEecCCCCceeecceEEEEEEEEeeccceEEEEecCCCCccceeee Q lcl|NC_011107. 683 ETNTKVFLDVPEAVVGAVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYD 762 (826) Q Consensus 683 ~~~~~~~~~~~~~~~~~~v~vGl~y~~~~~~~~~~i~~~~g~~~~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~ 762 (826) +..+..+|+..++.++++|||||+|+++++++||++++++|+.++. |+||+|++|+|++|.+++++.+.......... T Consensus 459 V~~g~itL~~~~~~~~~~v~VGl~Y~s~i~~lp~~~~~~~gs~~g~-r~ri~r~~v~~~~S~g~~vg~~~~~~r~~~~~- 536 (594) T protein:vir:10 459 VNNIGLTHRINGYDPNTVYYVGYKMDSYFRTLTPSNGDMKKSMFGS-KIRISKVQLALFDSIEPTVNGEPADDRSTDDI- 536 (594) T ss_pred EcCCeeEeeccCCCCcceEEEeeeeeEEEEeecccccCCcccccCc-cEEEEEEEEEEEcceeeEECCcccccccchhh- Confidence 3456778888888889999999999999999999999998876554 88999999999999999987553221111111 Q ss_pred ccCccccccccccCccccc--eeEEEEEecccCceeEEEEEECCCCCEEEEEEEEEEEeecc Q lcl|NC_011107. 763 TTPLRLFSRQLNAGEPLVD--SAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQT 822 (826) Q Consensus 763 ~~~~~~~~~~~~~~~~~~~--tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r 822 (826) .+.. .....+.|++. ++.+.++..||+++.+|+|+|++|+|||||||.+|...+.= T Consensus 537 ---~~~~-~~~~~g~~~~~tg~~~v~~~~~G~~~~~~i~I~qd~PlPltvlai~~ev~~~~~ 594 (594) T protein:vir:10 537 ---MDAR-LLDFSSNSGSSNGTRLVDYNPLGWENDGKMVIAVEQPFLCEVVGVFSVVQSNKV 594 (594) T ss_pred ---cccc-CCcccCcccccCCceEEEEccCCcCcccEEEEEECCCcCEEEEEEEEEEEeccC Confidence 1111 22334455544 45566777899999999999999999999999999999998 No 26 >protein:vir:94602 Length: 1012 # NCBI annotation: PfWMP4_35 # Family: family:all:12083 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762665;genbank:gi:115304373;genbank:GeneID:5142302 Probab=99.52 E-value=3.1e-12 Score=83.72 Aligned_cols=757 Identities=14% Similarity=0.093 Sum_probs=317.0 Q ss_pred CCc-----eeeechhhhcc--cccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEEEEcCC Q lcl|NC_011107. 1 MSY-----KQSAYPNLLMG--VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYHTNLGG 73 (826) Q Consensus 1 M~~-----v~~s~~n~~~G--VSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~~~rd~ 73 (826) |-. +++-..+=.+| +|.-|-..-|.+ --...||=.+..|-+.||.|+..+..-..... ..+.+.+.+.--= T Consensus 1 mtqQQ~~eiqG~~t~~F~GL~~s~S~~~IP~~~-SP~~~N~DV~~~G~V~rR~GT~l~~~Y~inn~-s~~~~s~~irt~L 78 (1012) T protein:vir:94 1 MTQQQATEIQGPFTREFSGLDISNSVGAIPVSG-SPVFHNCDVSDDGAVVRRRGTALVNTYNINNA-SGRAWSDTIRTKL 78 (1012) T ss_pred CCccccccccccccccccccccccccccccccC-CCceEEeecccCcceeehhhhhhhhhhccccc-Ccceeeeeehhhc Confidence 321 11111222233 333332222222 12457888999999999999999965442221 1223333332111 Q ss_pred CceEEEEEecCCeEEEEEcCCCE---EEEecCccccccccC-CccceEEEEE---cCEEEEEeCcccC-ccc--ccccCC Q lcl|NC_011107. 74 RSIAMLVAQHRGELYLFDERDGR---LLMGQPLVHDYLKAN-DYRQLRAATV---ADDLFIANLSVKP-EAD--RTDIKG 143 (826) Q Consensus 74 ~e~~~~~~~~~g~i~v~~~~~g~---~~~~~~~~~~y~~a~-~~~~l~~~~v---aD~~fi~n~~~~~-~~~--~~~~~~ 143 (826) ...|+|+...-|-+.+.-..+.+ ...+.. ......++ ++++.-|+-+ -|=+.|+-+++|| |.+ ...... T Consensus 79 G~eYfiLs~~~GLL~~~~~~~~AVG~~K~~a~-V~~ss~~~V~Pssm~F~~~S~~~~R~LILT~~~~~VQ~~F~E~T~s~ 157 (1012) T protein:vir:94 79 GSEYFILSNDVGLLISLMRDDEAVGMPKEVAV-VSKSSIWTVPPSSMCFIPVSAPYDRLLILTPEHPIVQLSFLERTLSF 157 (1012) T ss_pred cceeEEEecCCceEEEeeecccccccchhhhh-hhhhhccccCCcceEEEeccCCCCcEEEEcCCCceEEEEEeeeeeee Confidence 12355666655544443322221 111111 11111122 3444444443 3557777777765 210 000000 Q ss_pred CCCCc--cE-------------EEEEcc--------cccCceeEEEEeeccccceeeeeeEE--EEeecCCCCccccccc Q lcl|NC_011107. 144 VDPNK--AG-------------WLYIKA--------GQYSKAFSMTIKVKDNATGTTYSHTA--TYVTPDNASTNPNLAE 198 (826) Q Consensus 144 ~~~~~--~a-------------~~~v~~--------g~y~~~y~v~i~g~~~s~~tt~~~ta--syttp~g~~t~~~~~~ 198 (826) .-.++ .+ ..|... -.-+++|.+++.-.+=+.-.+.+... +|+..--.-++.|-++ T Consensus 158 T~~t~~~~~V~~~~a~~~~~~~~L~~~~N~sS~~~~~~~~T~~AmT~~NP~~S~~ls~~~V~~qtytltirqi~W~WWAE 237 (1012) T protein:vir:94 158 TCTTNHGGGVFSFTAPISVNDTTLWRDTNASSYIVTDAAGTVYAMTQKNPDFSFRLSGSFVVGQTYTLTIRQITWQWWAE 237 (1012) T ss_pred eccCCccceEeecccceeecCeeEEecccccceeEeeccceEEEEEeeCCceeEEEEEEEecCcccceeehhhhhhhhhh Confidence 00000 01 111100 01134555555433222222211111 1111000000000000 Q ss_pred ----------------cceEEe-cceeeechhee--------------------ee--------------------ccce Q lcl|NC_011107. 199 ----------------APFQTS-VGYIAWQLYGK--------------------FF--------------------GAPE 221 (826) Q Consensus 199 ----------------~~~~~~-~~~ia~~l~~~--------------------~~--------------------ga~~ 221 (826) .+...+ ..+|...|... ++ +... T Consensus 238 Sm~~~G~~~~~~~SRFNV~~~DQ~V~IP~~L~tDiD~v~~~~~~~~l~~~~ss~F~~~~~~~~T~~P~~AD~YG~~~G~~ 317 (1012) T protein:vir:94 238 SMYYEGQDMMQNTSRFNVTSIDQNVKIPDRLITDIDPVYKNSQGLGLFVFWSSRFDSNGWAGPTTSPNTADEYGFSGGGR 317 (1012) T ss_pred hHhhhhhHHHhhhhhcccccccccccchhHHhhhhhhhhhccCCccEEEEEeeeecCceeecCCCCCCCcccccccCCce Confidence 000000 00111111000 00 0000 Q ss_pred ---------EEEeeccccc---ceeeccccc-eee---------cccccccccccceEEEec------------------ Q lcl|NC_011107. 222 ---------YTLPNSTKKY---PKVDPDANA-ATI---------AGYLNQRGVQDGYIAFRG------------------ 261 (826) Q Consensus 222 ---------~t~~~~~~~~---~~~a~~~~~-~t~---------a~~~~~~~~~~g~~~~~~------------------ 261 (826) .++..+.|.+ .+..++..- .++ ..+.+..-..+--++..+ T Consensus 318 ~tpp~~~~~A~L~~aPFF~TFG~~~s~TP~P~~~V~iLR~RELRFN~G~GA~~~~L~V~~D~~~~t~Nnvpfspsnfqt~ 397 (1012) T protein:vir:94 318 FTPPSLVPGATLQAAPFFITFGGIYSGTPTPINQVNILRLRELRFNGGTGAKPDDLQVYNDTVEHTWNNVPFSPSNFQTW 397 (1012) T ss_pred eccccccccceeeccceEEEeccccCCCCCChhheeeeeeeeeeeccCCCCCCcceEEEEcceeeeccccccCcccccce Confidence 0111111110 011111000 000 000000000000011100 Q ss_pred -----CCceEEEEecCCCCcceEEEEEEeecccccccccccCcccccEeeeeeeeeEeccCCCCcceEEEEEcCCceEEE Q lcl|NC_011107. 262 -----DADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVMATGSTKAPVYFEWDSANRRWAE 336 (826) Q Consensus 262 -----~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~w~e 336 (826) +.+-.+..-...|+--.-..-..-..+...||+++|-.-+ . - +...|. .....+|.. T Consensus 398 atT~~~T~R~~~L~~A~G~~~~~A~Y~A~~GATnnlpanaPL~IS----~--------~----sA~s~~--~~~R~v~~~ 459 (1012) T protein:vir:94 398 ATTYTATDRVITLMSAVGDRFNNANYFAILGATNNLPANAPLHIS----C--------L----SASSYL--GGSRRVWYR 459 (1012) T ss_pred eeeeeecceeEEEeeeccccccCcceEEEeecccccccCCccccc----c--------c----cceeee--ccceeeeee Confidence 0000000000001000000011112234456666653211 0 0 000111 111123322 Q ss_pred eecccccccccceeEEEEEecCCCeEE-EeccCcCccccCCccccCCccccCCCccEEEEEcceEEEec----CCeEEEE Q lcl|NC_011107. 337 RAAYGTDWVLKKMPLALRWDEATDTYS-LNELEYDRRGSGDEDTNPTFNFVTRGITGMTTFQGRLVLLS----QEYVCMS 411 (826) Q Consensus 337 ~~~~g~~~~~~t~p~~~~~~~~~~~f~-~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~~~q~RL~f~~----~~~v~~S 411 (826) ... ..-+++- -+.|+ .....|.+-. .++.|.--+.||.||++.+ +..+.+| T Consensus 460 ~~~----T~~~~~~--------G~Y~r~YGiG~~~~Y~------------~~~F~~I~TiY~~RLiL~~~s~~~~~~~~S 515 (1012) T protein:vir:94 460 NLP----TTGGTLD--------GCYVRAYGIGKYVDYS------------KRSFHAIGTIYRDRLILVNPSTATDQLLIS 515 (1012) T ss_pred ccc----cCCceEe--------eeEEEEEEeeeeeecC------------CccccceeeeeeeeeEEeccCCCcceEEEe Confidence 110 0000000 00110 0011132211 1346677789999999997 4569999 Q ss_pred ecCC------cccCc-ccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEEEEEe Q lcl|NC_011107. 412 ASNN------PHRWF-KKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVISITTQ 484 (826) Q Consensus 412 ~~gd------~~nF~-~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~~~~~s~ 484 (826) .+|| |+||+ .+..+...|.||+++.+++.-.+.|.-++...+.|++||..+-|.+.|++.++|..-.++..|+ T Consensus 516 ~~GD~~~~G~~Y~F~QiTD~L~G~~tDPF~L~VtSe~~e~iT~~~~WQ~~LFV~T~~~T~~~~GGe~~~~s~~~VN~vSt 595 (1012) T protein:vir:94 516 EIGDATVPGEFYQFMQITDMLQGVTTDPFTLNVTSEGRERITAVTGWQKRLFVFTGSNTYSIEGGEQFGESSYAVNLVST 595 (1012) T ss_pred ecCCcccCceeeeeeeeehhhccCcCCceeEEEcccccceeeeeeeeceeEEEEeccceEeeccccccchhHHHHHhHHh Confidence 9877 89998 4556677899999999999888899999999999999999999999999999999999999999 Q ss_pred eccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCC-------eEEEEEcCCCCEEEE Q lcl|NC_011107. 485 YDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGP-------AEYIQAAASSGYLVF 557 (826) Q Consensus 485 ~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~-------v~~~~~s~~p~~~v~ 557 (826) |+.-+.---|+..-.|+|.++-| ++++......+ .|.+-+-|..+.++|.+- ...|.|-++.+.+.. T Consensus 596 ~G~~N~~~VV~T~~~V~Ym~~~G-----~F~L~~k~~~~-~Y~A~ErSvKIR~~F~~~~~ss~~~~~Wl~~~e~~~~LYi 669 (1012) T protein:vir:94 596 YGAFNQNCVVVTNLTVLYMNKFG-----LFDLMNKPNTD-SYGAFERSVKIRGLFQNLAGSSGDNLHWLRYNESSNKLYI 669 (1012) T ss_pred hcccCcceEEEeeeEEEEeeccc-----eeeccCCccCC-cchhhhhhhhhhhhhhhhccccccceeeeeeccCCceEEE Confidence 96666555577888999997644 88888876555 599999999999998751 223333333332211 Q ss_pred --EEcCC----CeEEEEEEeecCCceeeeeeEeeecCCcEEEEEEE----CCeEEEEEEeCCCEEEEEE----------- Q lcl|NC_011107. 558 --GTSTA----DEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFT----GDNLMVLIQKGQEIALGRM----------- 616 (826) Q Consensus 558 --~~~~~----g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~----~d~l~~vv~r~~~~~~~r~----------- 616 (826) ...++ ..+++|-+. . .+|+..+..|.|.---.+ .+...+.|......++-+. T Consensus 670 ~L~~~~dT~~~S~~~~~N~~---~----DSWs~~~s~~~Fq~YP~V~~~~~~t~L~~i~~~~TV~ML~~~~~~YiDFati 742 (1012) T protein:vir:94 670 GLAAEGDTRTTSRNLMLNFT---W----DSWSTLSSAAPFQMYPAVQLFKYMTWLTNINAPLTVAMLATEMPFYIDFATI 742 (1012) T ss_pred EecCCCcchhhhhhhhhhhh---h----cchhhhhccCCcccchhhhhhhhhhhhhhhcCchhhhhhhhccceeeeeehh Confidence 11111 123333332 1 478888776654321111 2222222222222221111 Q ss_pred --------------EEeecCcccCC---CcccccccceEEEeecccc-----eeccce-------eeccCC-cccceeeE Q lcl|NC_011107. 617 --------------HLNSLPAREGL---QYPKYDYWRRIEATVDGEL-----ELTKQH-------WDLIKD-ASAVYQLQ 666 (826) Q Consensus 617 --------------~~~~~~~~~~~---~~~~~d~~~~~~~~~~~~~-----~~~~~~-------~~~~~~-~~~~~~~~ 666 (826) +..-+.+.-.+ ..|..+... +..+.+.. .|+-.. ..+.++ ....+.+. T Consensus 743 rthiypF~~CaG~~~~~Vms~~~GIY~~~~P~tP~I~--~~tit~ss~~~~k~Yq~~T~~~GT~tLt~~~~~~~~~~~l~ 820 (1012) T protein:vir:94 743 RTHIYPFTFCAGQRDVSVMSDSRGIYNLPLPVTPGIL--DYTITASSKAGAKTYQRNTASAGTETLTLRNPMMDYADTLE 820 (1012) T ss_pred cccccceeeeccceeeEEEecCCceEEecccccceee--eeEeeccchhhhheeccccccccceeeeecChhhhcCcEEE Confidence 11111110000 011111000 11111100 011000 000111 01111222 Q ss_pred EecC----ceeeeeeccccee--------cCC----------ceEEEecCCCC-CceEEEeeeeeEEEEeCCeeEecCCC Q lcl|NC_011107. 667 PVAG----AYMERTHLGVKRE--------TNT----------KVFLDVPEAVV-GAVYVVGCEFWSKVEFTPPVLRDHNG 723 (826) Q Consensus 667 ~~~g----~~~~~~~~g~~~~--------~~~----------~~~~~~~~~~~-~~~v~vGl~y~~~~~~~~~~i~~~~g 723 (826) .+.| +.+.........+ .++ .-+|....... ...|.+|.-|.+.+.-+-+.+. T Consensus 821 LL~~~~~~~~~a~V~~~~~~~~TT~~TV~~N~~~~lQ~T~~~GS~L~~~~~LsqN~~~~~G~~Y~S~Y~SP~F~L~---- 896 (1012) T protein:vir:94 821 LLGGNVNASQFAMVMSNGFEPYTTYPTVTYNGVAPLQWTVTGGSGLNNRPILSQNNNCIMGMIYPSVYASPIFDLE---- 896 (1012) T ss_pred EecCCCCccEEEEEeecccccccccceEEecceeeeeEEEecCCccccccccccCceEEEeecchhhhcchhhhhh---- Confidence 2222 1221111000000 011 00111111111 3468899999999886655441 Q ss_pred Cceeecce-EEEEEEEEeecc--ceEEEEecCCCCccceeeeccCcc-ccccccccCccccc------------eeEEEE Q lcl|NC_011107. 724 LPMTSTRA-VLHRYNVNFGWT--GEFLWRISDTARPNQPWYDTTPLR-LFSRQLNAGEPLVD------------SAVVPL 787 (826) Q Consensus 724 ~~~~~gr~-~v~r~~~~~~~t--~~~~v~v~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~------------tg~~~v 787 (826) . -+|| ||.++.|-|.-+ ..++..+..++-...... +... .-++....-.|... --...+ T Consensus 897 S---L~~LKr~K~~~L~~Dttvtsqlkynltsgfsqvsvln--tawvavvsnynenivpavvsyqvgnsyeirrvvelsi 971 (1012) T protein:vir:94 897 S---LGRLKRLKKLHLQMDTTVTSQLKYNLTSGFSQVSVLN--TAWVAVVSNYNENIVPAVVSYQVGNSYEIRRVVELSI 971 (1012) T ss_pred h---hhhhhheeeeeEEeeeeeeeeeeeehhcccceeeeec--ceeeeeeeccCccccceeeeeecCCceeeeEEEEEee Confidence 1 2454 577877776554 445544444332211110 1100 00110000111111 123466 Q ss_pred EecccCceeEEEEEECCCCCEEEEEEEEEEEeecccccC Q lcl|NC_011107. 788 PARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 788 p~~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r~rrv 826 (826) |+.|..-+.++.|.+-..-.+.+-+.+++.+=-+-.|-| T Consensus 972 plqgygcdyqfyiasvgaeafklaayefdiqpqrdkryv 1010 (1012) T protein:vir:94 972 PLQGYGCDYQFYIASVGAEAFKLAAYEFDIQPQRDKRYV 1010 (1012) T ss_pred cccccccceeEeeeeccccceeeeeeeeccccchhhhhc Confidence 888888889999999999999999999887654433322 No 27 >protein:vir:80177 Length: 1027 # NCBI annotation: tail tubular protein B # Family: family:all:12083 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285795;genbank:gi:148747829;genbank:GeneID:5220453 Probab=99.25 E-value=3.1e-10 Score=72.75 Aligned_cols=741 Identities=13% Similarity=0.091 Sum_probs=269.8 Q ss_pred CC----ceeee------chhhhcc--cccCChhHhhhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccceeEEE Q lcl|NC_011107. 1 MS----YKQSA------YPNLLMG--VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPRPFLYH 68 (826) Q Consensus 1 M~----~v~~s------~~n~~~G--VSqq~d~~R~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~~~~~~ 68 (826) |- +-+|- ..+=.+| +|.-|=..-|.+ --...|+=.+..|-+.||.|++.+..-..+.+.+. |. T Consensus 1 mvnsferrtQQ~~dlG~~s~~F~GL~~t~S~~~IP~~~-SP~~~N~DV~~~G~V~kR~GT~i~~~Y~~t~~~~t----~~ 75 (1027) T protein:vir:80 1 MVNSFERRTQQGDDLGIRSSNFGGLNTTASPLNIPYED-SPNLLNVDVDVSGNVSKRQGTEILLKYANTTPVYT----FP 75 (1027) T ss_pred CCcchhhhhccccccccccccccccccccccccccccC-CCceEEeecccCcceeehhhhhhhhhhccCCceee----ee Confidence 21 11110 1111222 232222222221 12356888999999999999999976555444333 32 Q ss_pred EEcCCCceEEEEEecCCeEEEEEcCCCE---EEEecCccccccccCCccceE-EEEEcCEEEEEeCcccC-ccc--cccc Q lcl|NC_011107. 69 TNLGGRSIAMLVAQHRGELYLFDERDGR---LLMGQPLVHDYLKANDYRQLR-AATVADDLFIANLSVKP-EAD--RTDI 141 (826) Q Consensus 69 ~~rd~~e~~~~~~~~~g~i~v~~~~~g~---~~~~~~~~~~y~~a~~~~~l~-~~~vaD~~fi~n~~~~~-~~~--~~~~ 141 (826) + |---...|++...-|-+.+.-..+.+ ..... .......++.+.=.- ..-+-|=+.|+-+.+|| |.. .... T Consensus 76 v-ks~LG~dYvLt~~~GLL~~~~~~~~AVG~~K~~s-~V~~aa~~~V~P~F~~~S~~~~R~LILT~~~~~VQ~~F~E~T~ 153 (1027) T protein:vir:80 76 V-KSVLGYDYVLTKSGGLLEVAGVIGKAVGAYKSFS-NVFSAAAANVKPYFTLLSDVEPRVLILTGTNTPVQVKFVEQTF 153 (1027) T ss_pred e-hhhccceeeEecCCceEEEeeecccccccchhhh-hhhhhhhcccCceeEEccCCCCcEEEEcCCCceEEEEEeeeee Confidence 2 11123444666655544443322221 11110 001111112111000 11134556777777665 211 0001 Q ss_pred CCCCCCc--cEEEE-Eccccc--------------------CceeEEEEee-ccccceeeeeeEEEEeecCCCCcccc-- Q lcl|NC_011107. 142 KGVDPNK--AGWLY-IKAGQY--------------------SKAFSMTIKV-KDNATGTTYSHTATYVTPDNASTNPN-- 195 (826) Q Consensus 142 ~~~~~~~--~a~~~-v~~g~y--------------------~~~y~v~i~g-~~~s~~tt~~~tasyttp~g~~t~~~-- 195 (826) .....++ .+++. -.-.+| +++|.+++.- .+= +...++..--+...+.. T Consensus 154 t~T~~s~~~~~V~~~~s~~~~~~~~L~~~~N~tS~~~~~~~~T~~AlT~~NlP~~------S~~mt~~~V~~~W~WWAES 227 (1027) T protein:vir:80 154 TTTSGSPTTTVVIPNASRFQYDTPILYMNRNFTSGATYSYNSTTRALTISNLPSW------SGSMTFDLVLPVWSWWAES 227 (1027) T ss_pred eeeccCCccceEeecccceeecCeeEEecccccceeEeeccceEEEEEeccCCcc------eeEEEEeEEecchhhhhhH Confidence 1111111 11111 111122 2223333221 110 11111111111100000 Q ss_pred ---c---------cccceEE-ecceeeechheeee-------ccceEEEeecccccceeec-cccceeeccccccccccc Q lcl|NC_011107. 196 ---L---------AEAPFQT-SVGYIAWQLYGKFF-------GAPEYTLPNSTKKYPKVDP-DANAATIAGYLNQRGVQD 254 (826) Q Consensus 196 ---~---------~~~~~~~-~~~~ia~~l~~~~~-------ga~~~t~~~~~~~~~~~a~-~~~~~t~a~~~~~~~~~~ 254 (826) . .-.+... +..+|...|...+. +.+-.-.-.+.|.....+. +.+-+.. ..-...+ T Consensus 228 l~~~G~~~~~~~SRFNV~~~DQ~V~IP~~L~sDlD~i~~~~~~~~m~~~~ta~F~~~~~~~~T~~P~~A----D~YG~~~ 303 (1027) T protein:vir:80 228 LRWFGDRFYDAVSRFNVNKADQSVAIPAALRSDLDTIQGTYGRYPMLLYKTATFNDTYTFSNTGQPANA----DSYGWGD 303 (1027) T ss_pred HhhhhhHHHhhhhhcccccccccccchhHHhhhhhhhhhccCCccEEEEEeeeecCceeecCCCCCCCc----ccccccC Confidence 0 0000000 01122222222111 1110111111111110000 0000000 0000011 Q ss_pred ceEEEecCC------------------------ceEEEEe----cCCCC---cceEEEEEEeecccccccccccCc---- Q lcl|NC_011107. 255 GYIAFRGDA------------------------DIHVEVS----TDMGN---NYGIASGGMSLNATADLPALLPGV---- 299 (826) Q Consensus 255 g~~~~~~~~------------------------~~~~~~~----~~~g~---~~~~~~~~~~v~~~~~l~~~~~~~---- 299 (826) |-.|-..++ ...+..- =++|. +..+....+ ..++..+ ++. T Consensus 304 G~~~~~~~~A~L~~sPFF~TFG~~~t~TP~P~~~V~lLR~RELRFN~G~GA~~~~L~V~~D----~~~~s~N-~ssT~~~ 378 (1027) T protein:vir:80 304 GSVYNVGASAYLNTSPFFATFGDTRTPTPQPPETVHLLRQRELRFNYGNGATGANLRVTVD----GTALSAN-YSSTVAG 378 (1027) T ss_pred CceEeecccceeeccceEEEeccccCCCCCchhheeeeeeeeeeeccCCCCCCcceEEEEc----ceeeeee-eeeeeee Confidence 111110000 0000000 00000 000000000 0011100 000 Q ss_pred ccccEeeeeeeeeEeccCCCC--cceEEEEEcCCceEEEeeccccccccc-ceeEEEEEecCCCeEE---Ee---ccCcC Q lcl|NC_011107. 300 GAPGVGVQFMDGAVMATGSTK--APVYFEWDSANRRWAERAAYGTDWVLK-KMPLALRWDEATDTYS---LN---ELEYD 370 (826) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~~--~~~y~~~~~~~~~w~e~~~~g~~~~~~-t~p~~~~~~~~~~~f~---~~---~~~w~ 370 (826) ....+..- +..++--+++ -.||..+-+++----.|.+.-...+.+ +.-........+++.. ++ ...|. T Consensus 379 T~R~~~L~---~A~G~~~~~A~dlayY~A~~GATPL~IS~~aA~t~~~~~R~yi~~~~~~T~~~~~~G~Y~k~YGlG~~~ 455 (1027) T protein:vir:80 379 TNRAYALY---KADGTLCTSASDLAYYIAFTGATPLGISPTAAVTITNVDRTYIGSAATQTDNAYVQGGYFKVYGLGLWA 455 (1027) T ss_pred cceeEEEe---eeccccccccccceeeeeeeccccccccccceeeeecCceeeeeeeccccCCceEeeeEEEEEEeeeee Confidence 00000000 0011111111 124444433221000000000000000 0000000001111110 01 01122 Q ss_pred ccccCCccccCCccccCCCccEEEEEcceEEEec----CCeEEEEecCC------cccCc-ccccccCCCCccEEEEEcC Q lcl|NC_011107. 371 RRGSGDEDTNPTFNFVTRGITGMTTFQGRLVLLS----QEYVCMSASNN------PHRWF-KKSAAALNDDDPIEIAAQG 439 (826) Q Consensus 371 ~r~~gd~~tnp~psf~g~~~~~v~~~q~RL~f~~----~~~v~~S~~gd------~~nF~-~~s~~~~~DdD~i~~~~~~ 439 (826) .- -.++.|.--+.||.||++.+ +..+.+|.+|| ++||+ .+..+...|.||+++.+++ T Consensus 456 ~Y------------~~~~F~~I~TvY~~RLvL~~~t~~~~~~~~S~~GD~~~~G~~Y~F~QvTD~L~G~~sDPF~L~VsS 523 (1027) T protein:vir:80 456 NY------------GTGQFPRIATVYQSRLVLGGFTNDPTRVVFSATGDTVEGGVKYNFFQVTDDLDGLDSDPFDLVVSS 523 (1027) T ss_pred ec------------CCccccceeeeeeeeeEEeccCCCcceEEEeecCCcccCceeeeeeeeehhhccCcCCceeEEEec Confidence 21 12456788899999999997 45699999887 89998 4556677899999999987 Q ss_pred C-CceeEEEEeecCCcEEEEecCcEEEEeCCcc-ccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEe Q lcl|NC_011107. 440 S-LTEPYEHAVTFNKDLIVFAKKYQAVVPGGGI-VTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMA 517 (826) Q Consensus 440 ~-~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~-lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~ 517 (826) . -.+.|.-++...+.|++||..+-|.+.|++. ++|..-.++..|+++.-+.-.-|+....|+|.++-| ++.+. T Consensus 524 sq~~d~vT~~~~WQ~~LFV~T~~~T~~~~GGd~t~~~a~~~VN~iSs~G~~N~~~VV~T~~~V~Yl~~~G-----~F~L~ 598 (1027) T protein:vir:80 524 SQADDYVTGLVEWQSSLFVLTRRATFRANGGDATISPARRFVNYISSLGLVNPFSVVRTDTAVFYLSDSG-----VFNLT 598 (1027) T ss_pred ccccceeeeeeeeceeEEEEecceeEEeecCccccchhHHHHHHHHhhcccCcceEEEeeeEEEEeeccc-----eeecc Confidence 4 4566777888899999999999999999775 999999999999996666556677888999997654 88888 Q ss_pred eccccccccchhHHHHHHHHhcCCC-------eEEEEEcCCCCEEEE-EEc-CC----CeEEEEEEeecCCceeeeeeEe Q lcl|NC_011107. 518 PSPSTDSHYVAEDVTSHIPSYMPGP-------AEYIQAAASSGYLVF-GTS-TA----DEMICHQYLWQGNEKVQNAFHR 584 (826) Q Consensus 518 ~~~~~~~~~~~~dls~~~~~~~~~~-------v~~~~~s~~p~~~v~-~~~-~~----g~l~~~tyl~~~~e~~v~aW~~ 584 (826) ...+.+ .|.+-+-|..+.++|.+- ...|.|-++.+.+.. ..+ ++ ..+++|-+. . .+|+. T Consensus 599 ~r~~~~-~Y~A~EkSiKIR~~F~~~~~ta~~~~~Wm~~~q~~~~LYv~L~~~~eT~~~S~~~~~N~~---~----DSWt~ 670 (1027) T protein:vir:80 599 PRVEDG-EYQAIEKSIKIRKVFGKTTSTAVSSAAWMSFDQNRKVLYVALPRGSETTVASALYVYNTF---R----DSWTQ 670 (1027) T ss_pred CCccCC-cchhhhhhhhhhhhhhhhccccccceeeeeeccCCceEEEEecCCCcchhhhhhhhhhhh---h----cchhh Confidence 876555 599999999999998752 222333333222211 111 11 123343332 2 48888 Q ss_pred eecCCcEEEE---EE----ECCeEEEEEEeCCCEEEEEEEEeecCcccCCCcccccccceEEEe--ecccceeccceeec Q lcl|NC_011107. 585 WTLRHQIIGA---YF----TGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEAT--VDGELELTKQHWDL 655 (826) Q Consensus 585 w~~~g~v~~~---~~----~~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~~~~~d~~~~~~~~--~~~~~~~~~~~~~~ 655 (826) .++.|.|.-- -. ..|...+.|......+|-+.++.+-.+ +..++ - .+..... ..+..+++...|.. T Consensus 671 ~~t~~~Fk~YtghP~V~~~~~~s~L~~v~~~~TV~ML~~~~~~YvD--FF~~C-G--~~~~~Vlt~~~GIY~~~~P~wns 745 (1027) T protein:vir:80 671 YDTLGGFKTYTGHPYVDTVLGDSFLLMVAYGGTVCMLKLYGSRYVD--FFNKC-G--SFTGNVLTANSGIYTWTAPFWNS 745 (1027) T ss_pred hhcccCcccccCCchhhhhhhhhhhhhhcCchhhhhhhhhcchhhh--hhhhc-c--cceeeEEecCCceeEeecccccC Confidence 8877765432 22 245554555555555554443322110 00000 0 0001111 11122222211111 Q ss_pred cCCc-ccceeeEEecCceeeeeecccceecCCceEEEecCCCCCceEEEeeeeeEEEEeCCeeEec--------CCCCce Q lcl|NC_011107. 656 IKDA-SAVYQLQPVAGAYMERTHLGVKRETNTKVFLDVPEAVVGAVYVVGCEFWSKVEFTPPVLRD--------HNGLPM 726 (826) Q Consensus 656 ~~~~-~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~v~vGl~y~~~~~~~~~~i~~--------~~g~~~ 726 (826) +.-. -.+.+...++-+..+...+-.+++......+.|-.+ | .++.+-.|.++. .++++. T Consensus 746 P~I~~~svs~tt~~~~q~Ye~~T~~~vvpydnvedlsiyvn--------G----T~Ls~~~~~~~~~~~i~LL~~~~~~~ 813 (1027) T protein:vir:80 746 PVISNISVSGTTTLAVQRYELPTDLQVVPYDNVEDLSIYVN--------G----TRLSFGTDWVKQGKAIYLLSDPGDGK 813 (1027) T ss_pred CeeeEEEeeccchhhhheeccccccccccccccccceeeec--------c----eeEeecCchhhcCCEEEEecCCCCcc Confidence 1000 001111222223333333333344333333333111 2 123333332221 122111 Q ss_pred eecceEEEEEEEEeeccceEEEEecCCCCccceeeec-cCccccccccccCccccceeEEEEEecccCceeEEE--EEEC Q lcl|NC_011107. 727 TSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDT-TPLRLFSRQLNAGEPLVDSAVVPLPARVDMATSKFE--LSCH 803 (826) Q Consensus 727 ~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~--i~~~ 803 (826) . -..|-|+-+++..-+.... +.....-.++.. .+++ +.+....|.-+..+ |+.. +..+.+- -.+- T Consensus 814 ~--~s~Vprcpvnvsy~~~~~~---~~TT~~TV~~N~~~~iQ-~Tdy~~~GS~L~~~-----~~Lt-N~~~~~G~~Y~S~ 881 (1027) T protein:vir:80 814 T--VSIVPRCPVNVSYQGDVTF---DETTAQTVWVNNLLQIQ-GTDYTLSGSTLTFT-----DTLT-NAVVEVGNAYISY 881 (1027) T ss_pred e--EEEEecccccccccccccc---cccccceEEecceeeec-cceeeeccCccccc-----cccc-cceEEEeecchhh Confidence 1 0134555555433221111 000000011110 0000 00000000000000 1111 0111100 0000 Q ss_pred CCCCEEEEEEEEEEEeecccccC Q lcl|NC_011107. 804 SPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 804 ~P~P~tvl~i~weg~y~~r~rrv 826 (826) ---||=+| --..|.+|| T Consensus 882 Y~SP~F~L------~SL~~LKk~ 898 (1027) T protein:vir:80 882 YQSPMFLL------GSLSNLKKV 898 (1027) T ss_pred hcchhhhh------hhhhhhhhe Confidence 01111111 112233333 No 28 >protein:vir:2625 Length: 715 # NCBI annotation: gp27 # Family: family:all:5234 # MgeID: mge:55 # MgeName: SIO1 # Cross-refs: genbank:acc:NP_064766;genbank:gi:9964636;genbank:GeneID:1263056 Probab=99.10 E-value=2.1e-09 Score=68.17 Aligned_cols=631 Identities=15% Similarity=0.134 Sum_probs=283.4 Q ss_pred CCc--eeeechhhhcccccCChhHhhhc-hhhhhhcceeeccCCcccCCchhhhh-----hhcCCCccccceeEEEEE-c Q lcl|NC_011107. 1 MSY--KQSAYPNLLMGVSQQVPFERLPG-QLSEQINMVSDPVSGLRRRSGIELMA-----HLRHTDQPWPRPFLYHTN-L 71 (826) Q Consensus 1 M~~--v~~s~~n~~~GVSqq~d~~R~~~-q~~~~~N~~~~~~~Gl~rRpGt~~v~-----~~~~~~~~~~~~~~~~~~-r 71 (826) |++ -+-....|++|.=--.-++-||. ..-.-+||...-.|--+||-|+-|-- ....+. +...-.+.+. - T Consensus 1 m~~~~~~~~vNtFv~GliTEas~ltfpqnasiDe~N~~l~rdG~r~RR~g~~~E~~~vls~~~vp~--galv~~~~W~na 78 (715) T protein:vir:26 1 MPQSLTQRTVNTFIKGLITEASELTFPENASVDELNCSLGRDGTRRRRKAVTLEDNHVLSDVVVPE--GALVQTLDWYNV 78 (715) T ss_pred CCcccchhHHhhhhhheeeccccccCCccceeeeeeeeecCCCcchhhccceeecceEEEEEeecC--ceeeeeechhhc Confidence 995 33467789999544444555554 44467899998888888888875432 221111 1112222221 1 Q ss_pred -CCCceEEEEEecCCeEEEEEcCC-----CEEEEecCcccccccc--CCcc-ceEEEEEcCEEEEEeCcccCcccccccC Q lcl|NC_011107. 72 -GGRSIAMLVAQHRGELYLFDERD-----GRLLMGQPLVHDYLKA--NDYR-QLRAATVADDLFIANLSVKPEADRTDIK 142 (826) Q Consensus 72 -d~~e~~~~~~~~~g~i~v~~~~~-----g~~~~~~~~~~~y~~a--~~~~-~l~~~~vaD~~fi~n~~~~~~~~~~~~~ 142 (826) ++....+.+..-.--+++|.+.+ ++++......+-+..- .|.+ .++++.+..++.|+||..-+-......+ T Consensus 79 ~G~v~~~~livqvg~~l~f~q~t~~pLs~~n~~~svdl~~~~~~vn~SPsh~~v~v~~~~G~livanp~i~~~~~~~d~~ 158 (715) T protein:vir:26 79 AGQVNLEFLVVQVNNILYFYEKSTDPLSANKYSGSVDLNTHSASNNLSPSEERVQVTSLNGYLIVASPAINTFYLGFNTS 158 (715) T ss_pred ccccCcEEEEEEeccEEEEEeccCCccccCceeEEeeecceecccccccceeEEEEEEeeeEEEEecCCccEEEEEecCC Confidence 22233333333222467775544 2333222222222211 2333 5788888999999999876643211111 Q ss_pred CCCCCccEEEEEcccccCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeeccceE Q lcl|NC_011107. 143 GVDPNKAGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEY 222 (826) Q Consensus 143 ~~~~~~~a~~~v~~g~y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga~~~ 222 (826) ...-.+.. +.+| +..+..+| +.+..+..|.+...+.+ .+.+ +.+.++ +| T Consensus 159 t~s~t~~~-ll~r------~r~f~~qg------~d~~~g~~y~~~gt~~t------------n~~i-ynlyN~-----gw 207 (715) T protein:vir:26 159 TEAFTATS-ISFK------ERDFEWQG------SDVDVTSLYFGEGTSVS------------NQRI-YDTYNV-----GW 207 (715) T ss_pred cceeEeeE-EEEE------eeeheeec------cccccccccccCCcccC------------chhh-eecccc-----ee Confidence 11100000 1111 11111122 22222222322111111 1111 112111 11 Q ss_pred EEeecccccceeeccccceeecccccccccccceEEEecCCceEEEEecCCCCcceEEEEEEeecccccccccccCcccc Q lcl|NC_011107. 223 TLPNSTKKYPKVDPDANAATIAGYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAP 302 (826) Q Consensus 223 t~~~~~~~~~~~a~~~~~~t~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~ 302 (826) ..+..+ ...+..+-|+ +...... .+.+...+-..+.+ T Consensus 208 ~~p~gt-------------------~~~N~~~~yi-Vypa~s~------------~~~S~kd~n~afsk----------- 244 (715) T protein:vir:26 208 VGPKGS-------------------AALNTYGSYI-VYPALTH------------PWYSGKDANGAFNK----------- 244 (715) T ss_pred ecceeE-------------------EEEcCCCCce-Eeccccc------------ccCCCcccccccCh----------- Confidence 111000 0000111111 0000000 00000000000000 Q ss_pred cEeeeeeeeeEeccCCCCcceEEEEEcCCceEEEeecccccccccceeEEEEEecCCCeEEEeccCcCccccCCccccCC Q lcl|NC_011107. 303 GVGVQFMDGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPT 382 (826) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~w~e~~~~g~~~~~~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~ 382 (826) .-|.|. -+|..+- +.|.|.+.. ..+...+-+. .. T Consensus 245 -----------------------------~ad~ei-----~tGt~~~--------~~G~yi~D~--~~~g~~~lee--ev 278 (715) T protein:vir:26 245 -----------------------------ADWLEI-----YTGSSLA--------SNGHYVLDV--FNKARTGLTT--EV 278 (715) T ss_pred -----------------------------hhcccc-----ccccccc--------cCceEEEee--eecCCccchh--hh Confidence 001110 0010000 111111000 0000000000 00 Q ss_pred ccccCCCccEEEEEcceEEEec------CCeEEEEecCC--------cccCcccccc--cCCCCccEEEEEcCCCceeEE Q lcl|NC_011107. 383 FNFVTRGITGMTTFQGRLVLLS------QEYVCMSASNN--------PHRWFKKSAA--ALNDDDPIEIAAQGSLTEPYE 446 (826) Q Consensus 383 psf~g~~~~~v~~~q~RL~f~~------~~~v~~S~~gd--------~~nF~~~s~~--~~~DdD~i~~~~~~~~~~~i~ 446 (826) .-+.+.+++.|.+|.+|++ +..|.+||.=+ |.+=+|++.. .+.|.|..-+.+-+.. .|. T Consensus 279 ---~k~R~rsv~~yaGrV~yagiD~dkng~rilfSqLv~s~~di~nCyQd~DPTsee~~dLidTDGg~iri~gah--~ii 353 (715) T protein:vir:26 279 ---ETGRFRSVAAYAGRVFYAGIDSAKNGGKVYFSRLTERMSDVGNCYQVNDPTSEVLSDLLDTDGGVVRIPDAH--NIR 353 (715) T ss_pred ---hcCCCcceeeecceEEEeecccccCCCeEEEehhhcchhhcccccccCCCchhhhhhhhhcCCCEEEecCCC--Cce Confidence 0245677999999999994 45799998432 5555555432 4678899888886553 356 Q ss_pred EEeecCCcEEEEecCcEEEEeC-CccccccceEEEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccc Q lcl|NC_011107. 447 HAVTFNKDLIVFAKKYQAVVPG-GGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSH 525 (826) Q Consensus 447 ~~v~~~~~L~l~t~~~q~~i~~-~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~ 525 (826) -|+.|+..|+||...+-|+|.| +...|.++..+...++.+|++.=.=+++|+.++|-+++| |..+.-+ +..+- T Consensus 354 ~Lv~f~~sLlvf~~NGVWAi~G~d~g~tATdY~ltKIs~vg~sspnSvVvv~~~i~~WsdtG-----Iyal~~N-d~fn~ 427 (715) T protein:vir:26 354 KLHVLGASLLVFAENGVWAVAGVDNVFRATEYAITRISDVGLSNENSFVVADGIPIWWGKTG-----IYAVQQS-ENLNT 427 (715) T ss_pred eEEEecceEEEEEecceEEEeccCCceeeeeeEEEEeeeeccCCCccEEEecceEEEeeCCc-----EEEEEec-cccCc Confidence 6899999999999999999976 468999999999999999998777799999999998876 7767665 33455 Q ss_pred cchhHHH-HHHHHhcCC----CeEEE--EEcCCCCEEEEEEcCCCeEEEEEEeecCC----ceeeeeeEeeecC---Cc- Q lcl|NC_011107. 526 YVAEDVT-SHIPSYMPG----PAEYI--QAAASSGYLVFGTSTADEMICHQYLWQGN----EKVQNAFHRWTLR---HQ- 590 (826) Q Consensus 526 ~~~~dls-~~~~~~~~~----~v~~~--~~s~~p~~~v~~~~~~g~l~~~tyl~~~~----e~~v~aW~~w~~~---g~- 590 (826) +.++.|| ..+.+|.+. .+... .|-.-++.+.|+..+..++.=|+|- + +-...|+-+|..+ |. T Consensus 428 ~tAqNLTekTIq~~~~~I~~dk~knVtg~fd~~e~rVyW~yPn~dt~vdykyd---~vLV~dLalgaFYp~~v~~~a~~~ 504 (715) T protein:vir:26 428 PTAQNLSLSTIQTLWNNISNAKKAQVTVEYDKINQRVFWFYPDNDESVDYKYN---NILVMDLALQAFYPWRVEDEASST 504 (715) T ss_pred chhhccchHHHHHHHhhcchhhhcceEEEEEccCCEEEEEEcCCceeeceeec---CeEEEEeccccccccccccccccc Confidence 8899999 788887654 22222 4456678899988776677766661 1 1122466666432 22 Q ss_pred ---EEEEE------------------EE---CCeEEEEEEeC-CCEEEEEEEEeecCccc------CCCcccccccceEE Q lcl|NC_011107. 591 ---IIGAY------------------FT---GDNLMVLIQKG-QEIALGRMHLNSLPARE------GLQYPKYDYWRRIE 639 (826) Q Consensus 591 ---v~~~~------------------~~---~d~l~~vv~r~-~~~~~~r~~~~~~~~~~------~~~~~~~d~~~~~~ 639 (826) |.++. +. ++.+..+.-|. ..+-.+-..+++....- +.+..++|+-. T Consensus 505 ~~~ig~~~~~~~~~~~t~~~vv~~~~v~~~g~~~~v~~~~r~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~dw~s--- 581 (715) T protein:vir:26 505 SYIIGTSYYGGLGSTSTETQVVNGADVVVNGSDNVVATLYRDYLEGDSEIKLLVRDGTTGKMTFATFRGDTYLDWGS--- 581 (715) T ss_pred ceeeeeeeeCCcccccchhheeccceEEEeccceEEEEeecccccccceEEEEEEcCCceeEEEecccCceeeeccc--- Confidence 11110 00 11111111221 11112222222211110 11111222110 Q ss_pred EeecccceeccceeeccCCcccceeeEEecCceeeeeecccceecCCceEE--EecCCCCCceEEEeeeeeEEEEeCCee Q lcl|NC_011107. 640 ATVDGELELTKQHWDLIKDASAVYQLQPVAGAYMERTHLGVKRETNTKVFL--DVPEAVVGAVYVVGCEFWSKVEFTPPV 717 (826) Q Consensus 640 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~--~~~~~~~~~~v~vGl~y~~~~~~~~~~ 717 (826) .+....-...-+.++..+.. .....++. +...+ + -|.=|-.| +|+.+ T Consensus 582 -----------------~d~~~~~~~gy~~~gd~~~~------k~~pyvt~~~~~ted--g-~v~~~~g~----~p~n~- 630 (715) T protein:vir:26 582 -----------------ADYKSFAEAGYDFMGDITTF------KNAPYVTTYMRVTED--G-YVASGAGY----EFINP- 630 (715) T ss_pred -----------------cchhhHHHhhhhhcccceee------ecCceEEEEEEEecc--c-ceeccCCc----cccCC- Confidence 00000000000111111000 00000010 10000 0 00000000 11110 Q ss_pred EecCCCCceeecceEEEEEEEEeeccceEEEEecCCCCccceeeeccCccccccccccCc-cccceeEEEEEecccCcee Q lcl|NC_011107. 718 LRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGE-PLVDSAVVPLPARVDMATS 796 (826) Q Consensus 718 i~~~~g~~~~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~tg~~~vp~~~~~~~~ 796 (826) .--+-.+..+..+++. ..+..|.....++.-++....+ -|-.+-+-+..++|..+-. T Consensus 631 ------------sSclm~~sw~ws~s~s----------t~~eaYk~~~~~~~~p~~~s~~~yp~~~VvTKsriRG~Gr~~ 688 (715) T protein:vir:26 631 ------------SSCLMSVSWNLSKSGS----------TPREIYKLKDVPVVNPNDLSSINYPTDTVVTKSKVRGRGRSM 688 (715) T ss_pred ------------cceEEEEEeeeccCCC----------ChhhhheecceeeeCCCccccccCCcceeEeeeeeeccceEE Confidence 0011112222222221 1112222222222111111110 1112222234577888999 Q ss_pred EEEEEECCCCCEEEEEEEEEEEeeccc Q lcl|NC_011107. 797 KFELSCHSPYDMNVRAVEYNFKSNQTY 823 (826) Q Consensus 797 ~v~i~~~~P~P~tvl~i~weg~y~~r~ 823 (826) +++|.+...-.|+|++.+.-|--|+.+ T Consensus 689 ~~rf~s~~gKdlhl~Gysilg~~~~~~ 715 (715) T protein:vir:26 689 KFRFESVAGKDFHLVGYEVIGAKNNSY 715 (715) T ss_pred EEEEEecCCcceEEEeEEEEecccCCC Confidence 999999999999999999999988888 No 29 >protein:vir:8837 Length: 513 # NCBI annotation: constituent protein # Family: family:all:4957 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775245;genbank:gi:27476043;genbank:GeneID:2700591 Probab=98.01 E-value=7.1e-06 Score=48.83 Aligned_cols=475 Identities=10% Similarity=0.023 Sum_probs=184.8 Q ss_pred eeechheeeeccceEEEeecccccceeeccccceeeccc-ccccccccceEEEecCCceEEEEecCCCCcceEEEEEEee Q lcl|NC_011107. 208 IAWQLYGKFFGAPEYTLPNSTKKYPKVDPDANAATIAGY-LNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSL 286 (826) Q Consensus 208 ia~~l~~~~~ga~~~t~~~~~~~~~~~a~~~~~~t~a~~-~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v 286 (826) ++-.... .-+....+.-.+|...-...... .|.....+.+ ...++-..+. T Consensus 1 ~~~~~~~---------~~~~~g~~~d~~p~~lp~~a~s~~~N~~~~~~~~--~~~~g~~pv~------------------ 51 (513) T protein:vir:88 1 MALERQE---------VKNPTGIVTDIAPADLPLDKWSFGNNVRFKNGKA--QKALGHSPIF------------------ 51 (513) T ss_pred CCcCChh---------hcccccceeccChhhcCCCcceeeeeeeEeccee--eecCccceee------------------ Confidence 1111100 00011111111111000000000 1111111111 1110000000 Q ss_pred cccccccccccCcccccEeeeeeeeeEeccCCCCcceEEEEEcCCceEEEeec------ccccccccceeEEEEEecCCC Q lcl|NC_011107. 287 NATADLPALLPGVGAPGVGVQFMDGAVMATGSTKAPVYFEWDSANRRWAERAA------YGTDWVLKKMPLALRWDEATD 360 (826) Q Consensus 287 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~w~e~~~------~g~~~~~~t~p~~~~~~~~~~ 360 (826) +.|+..+... ..+...+.. +..- .+ ...|.++++.+ |.-..+ ....|.+.++--.++....+. T Consensus 52 ---a~~~~~~~g~--~~~~~~g~~-~~~~--~~-~~~~~~~~~~t--~~dvs~~~~~~~~~~~w~~~~f~~~i~a~ng~~ 120 (513) T protein:vir:88 52 ---DTAQAPILDM--FPFIRNNIP-YWLL--CS-EKRLYLADGTT--IIDVSPGPYSASVTNRWSVGSFNGVIFANDGVN 120 (513) T ss_pred ---ecCCCCceee--eeeecCCCe-EEEE--ee-ceEEEEecCce--eeeccccceeecccCceeeeeecCEEEEEcCCC Confidence 0111110000 000000000 0000 00 11233333222 221111 001111111111111111111 Q ss_pred eEEEeccCcCc-cccCCccccCCccccCCCccEEEEEcceEEEec--------CCeEEEEecCCccc----CcccccccC Q lcl|NC_011107. 361 TYSLNELEYDR-RGSGDEDTNPTFNFVTRGITGMTTFQGRLVLLS--------QEYVCMSASNNPHR----WFKKSAAAL 427 (826) Q Consensus 361 ~f~~~~~~w~~-r~~gd~~tnp~psf~g~~~~~v~~~q~RL~f~~--------~~~v~~S~~gd~~n----F~~~s~~~~ 427 (826) .... |.. ...=.++.+ +|+ ...-..|.+|++||++++ |+.|+.|..+|... |..+. .. T Consensus 121 --~~q~--~~~~s~~f~dl~g-~p~--~~~a~~i~v~~~flv~~~~t~~~~~~PnrV~wS~~~D~~~~P~~W~~t~--~t 191 (513) T protein:vir:88 121 --PPHH--LPPTESVFRVLPN-FPA--NTTFRRLKSFKNFLIGLNVTSNSIEMPQMVWWSTSADAGGVPASWDPTD--PT 191 (513) T ss_pred --cceE--EcCCCceeeeccC-CCc--ccceEEEEEEeeEEEEeecccCcCCCCceEEEecccCCccccccccccc--cc Confidence 1111 111 000112221 111 114567889999999974 67899999999643 42221 11 Q ss_pred CCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEe-CCccccccceEEEEEEeeccc-cCCCcEEeCCeEEEEec Q lcl|NC_011107. 428 NDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVP-GGGIVTPRTAVISITTQYDLD-TRAAPAVTGRSVYFAAE 505 (826) Q Consensus 428 ~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~-~~~~lTP~~~~~~~~s~~~~~-~~~~Pv~vg~~v~f~~~ 505 (826) .+.+=.++ .+....|...++....|+||++.+-|.++ .++ |....++....-.|. +.-.=+.+|+.+||+++ T Consensus 192 ~~a~~~~l---~d~~g~~v~g~~~g~~liif~e~~i~~m~y~g~---~~if~~~~i~~~~G~~~p~SI~~~~~~~ffls~ 265 (513) T protein:vir:88 192 KDAGQNTL---ADTNGAIVDGVKLRDSFIIYKEDSVYSMRYIGG---LYIFQFQQLFNDVGILGPNCAIEFDGNHFVVGH 265 (513) T ss_pred Cccccccc---CCCccceeeeeecccceEEEecccEEEEEecCC---CceEEEEeecccccccCCceeEEECCeEEEEeC Confidence 22222222 34445566678888899999999999996 322 334455444433333 22233679999999988 Q ss_pred CCCceeEEEEEeeccccccccchhHHHHHHHHhcCC-----CeEEEEEcC-CCC-EEEEEEcC-C-------CeEEEEEE Q lcl|NC_011107. 506 RALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPG-----PAEYIQAAA-SSG-YLVFGTST-A-------DEMICHQY 570 (826) Q Consensus 506 ~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~-----~v~~~~~s~-~p~-~~v~~~~~-~-------g~l~~~ty 570 (826) +| ++.+ ...+ .+.- ....+++.|-. ....+...- +.+ .+.|+-.+ + .++++|-| T Consensus 266 ~G-----f~~~--~G~~---~~~I-g~ekVdk~f~~~~n~~~~~~~~~~~d~~~~~v~~~y~s~~~~~~~~~~~~lVYd~ 334 (513) T protein:vir:88 266 GD-----VYVH--NGVQ---KQSV-IDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSEPGKHCDRAIIWNW 334 (513) T ss_pred Cc-----eEEe--cCce---eeec-ccchhhhhhhccCCcccceEEEEEEcCcccEEEEEecCCCCCCCcccceEEEEEc Confidence 75 5422 2211 1110 11234443322 222222222 223 33343111 1 35677777 Q ss_pred eecCCceeeeeeEeeecCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEeecCcccCCCcccccccceEEEeecccceecc Q lcl|NC_011107. 571 LWQGNEKVQNAFHRWTLRHQIIGAYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEATVDGELELTK 650 (826) Q Consensus 571 l~~~~e~~v~aW~~w~~~g~v~~~~~~~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 650 (826) + + +.|+.-+.+..+-.+..+-+.+........... .+...+++ ..+.+-. T Consensus 335 ~----~---~~Ws~~~~p~~~~g~~g~~~~~~~~~~~~~~~~--------------~d~~~~~~--------~~~~~~~- 384 (513) T protein:vir:88 335 K----E---NTWSIRDLPNVLSGAYGIIDPKTSNLWDDDSNP--------------WDTDTSVW--------GEGSYNP- 384 (513) T ss_pred c----C---CeEEEEeccchhhcccccccccccceecccccc--------------cccchhhh--------hcccccc- Confidence 4 2 257655555443322211111111110000000 00000000 0000000 Q ss_pred ceeeccCCcccceeeEEecCceeeeeecccceecCCceEEEecCCCCCceEEEeeeeeEEEEeCCeeEecCCCCceeecc Q lcl|NC_011107. 651 QHWDLIKDASAVYQLQPVAGAYMERTHLGVKRETNTKVFLDVPEAVVGAVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTR 730 (826) Q Consensus 651 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~v~vGl~y~~~~~~~~~~i~~~~g~~~~~gr 730 (826) .. ..+. .....+..+. .+... .-.-|-++++.++...+.+. ++ ++ T Consensus 385 -----~~--~sl~-~~~~~~~~~~--------------~fd~~------~~f~G~~lea~~~t~~~~~~--~~-----~~ 429 (513) T protein:vir:88 385 -----AK--SSMI-FTSFQDAKLF--------------LFGET------STFSGQSFTSTLERSDIYLG--DD-----RM 429 (513) T ss_pred -----cc--ceeE-eeeccCCcee--------------eeccc------ccccCCceEEEEEecCcccc--Cc-----hh Confidence 00 0000 0011111111 01100 01458888999888766542 11 23 Q ss_pred -eEEEEEEEEeeccceEEEEecCCCCccceeeeccCccccccccccCccccceeEEEEEecccCceeEEEEEECCCCCEE Q lcl|NC_011107. 731 -AVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLVDSAVVPLPARVDMATSKFELSCHSPYDMN 809 (826) Q Consensus 731 -~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~t 809 (826) ++|+++...+...|.+.+.++........ ....... .....+.-.++.+...+..+++|+...--|++ T Consensus 430 ~~~i~~v~~~~t~~g~~t~~vg~~~~~~~~------~~~s~~~-----~~~~~~~~~~~~r~~gRy~~~ri~i~~~~~w~ 498 (513) T protein:vir:88 430 MKTVSAVIPHITGNGVCNIWVGNAQVQGSG------IRWKGPY-----PYRIGQDYKIDTKHVGRYIALKFDFASAGDWY 498 (513) T ss_pred heeeeeeeeeeecceEEEEEEeeeccCccc------cccccce-----eeecccCceEEeccCCceEEEEEEccCCCceE Confidence 36778777787788777776554332221 1111110 00112234466677778888888888899999 Q ss_pred EEEEEEEEEeeccccc Q lcl|NC_011107. 810 VRAVEYNFKSNQTYRR 825 (826) Q Consensus 810 vl~i~weg~y~~r~rr 825 (826) +.++++|..--. .|| T Consensus 499 ~~G~~ve~~~~~-g~R 513 (513) T protein:vir:88 499 FNGYTLEMAPKA-GMR 513 (513) T ss_pred EeeEEEEEecCC-CCC Confidence 999999887521 333 No 30 >protein:vir:95475 Length: 771 # NCBI annotation: hypothetical protein ORF038 # Family: family:all:5234 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294631;genbank:gi:149408197;genbank:GeneID:5237042 Probab=97.13 E-value=0.00016 Score=41.43 Aligned_cols=660 Identities=13% Similarity=0.093 Sum_probs=269.9 Q ss_pred CCce--eeechhhhcccccCChhHhhhc-hhhhhhcceeeccCCcccCCchhhhh-------hhcCCCccccceeEEEEE Q lcl|NC_011107. 1 MSYK--QSAYPNLLMGVSQQVPFERLPG-QLSEQINMVSDPVSGLRRRSGIELMA-------HLRHTDQPWPRPFLYHTN 70 (826) Q Consensus 1 M~~v--~~s~~n~~~GVSqq~d~~R~~~-q~~~~~N~~~~~~~Gl~rRpGt~~v~-------~~~~~~~~~~~~~~~~~~ 70 (826) |++- +-....|++|.=--.-++-||. ..-.-+||...-.|--+||-|+-|-- ....+.........+.+. T Consensus 1 m~~~~~~~~vNtFv~GliTEas~ltfpqnasiDe~N~~l~rdG~r~RR~g~~~E~~~~~vls~~~vpa~g~~~v~~~~W~ 80 (771) T protein:vir:95 1 MAKTTNAAEFNTFVGGLITEASPLTFPQNASIDEVNFILNRDGSRNRRNGMDFENGATKVVCNTLVPADGTIAVTSHNWE 80 (771) T ss_pred CCcccchhHHhhhhhheeeccccccCCccceeeeeeeeecCCCcchhhceeeeecCCceEEEEEEecccceEEeeeechh Confidence 9953 3467789999444444555554 44467899999888888888875542 111111111111122211 Q ss_pred -c-CCCceEEEEEecCCeEEEEEcCCCEEEEecCcccccc--ccCCccceEEEEEcCEEEEEeCcccCcccccccCCCCC Q lcl|NC_011107. 71 -L-GGRSIAMLVAQHRGELYLFDERDGRLLMGQPLVHDYL--KANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDP 146 (826) Q Consensus 71 -r-d~~e~~~~~~~~~g~i~v~~~~~g~~~~~~~~~~~y~--~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~ 146 (826) - ++....+.+..-.--+++|.+.+-.+. .+.. +-+. .-.|.+.|++..+..++.|+||..-+-......+...- T Consensus 81 na~G~v~~~~livqvg~~l~f~q~t~~pLs-~~n~-~~~a~~nlSPsh~isv~v~~G~livanp~i~~~~~~~d~~t~s~ 158 (771) T protein:vir:95 81 NAGGEVGRWISLVQVGTELKFFQTTGETLS-EGNF-YNYQFVNMSPSHKLSYAVVDGLLVVANGSRDIYVFEYDSGSVSV 158 (771) T ss_pred hcccccCcEEEEEEeccEEEEEecCCCccc-ccce-eeeecceeccceeEEEEEeeeEEEEecCCccEEEEEecCCccee Confidence 1 222333333332224677654332221 1111 1111 11355568888889999999998766422111111110 Q ss_pred CccEEEEEcccccCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeeccceEEEee Q lcl|NC_011107. 147 NKAGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPN 226 (826) Q Consensus 147 ~~~a~~~v~~g~y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga~~~t~~~ 226 (826) .+-. +.+|.- +. ..+...++.+..+-.|.+...+.+ .+.+ +.+.++ +|..+. T Consensus 159 t~~~-ll~r~r-f~--------~q~~~~G~d~~~~~~~~~~gt~~t------------n~~i-ynlyN~-----gw~~pk 210 (771) T protein:vir:95 159 TTKR-LLVRDL-FG--------VQDIVNGVDLRQGNDIATRPTVQT------------NAHI-YNLRNQ-----TFGVPR 210 (771) T ss_pred Eeee-eeeeeh-hh--------ccccccccceecccccccCCcccC------------chhh-eecccc-----ceeccc Confidence 0000 111110 00 000011111222222222111111 1111 111111 111111 Q ss_pred cccccceeeccccceeecccccccccccceEEEecCCceEEEEecCCCCcceEEEEEEeec-ccccccccccCcccccEe Q lcl|NC_011107. 227 STKKYPKVDPDANAATIAGYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLN-ATADLPALLPGVGAPGVG 305 (826) Q Consensus 227 ~~~~~~~~a~~~~~~t~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~-~~~~l~~~~~~~~~~~~~ 305 (826) .... +... ..... ..|+..... +.+....++ ++.+-....+.. .+.-. T Consensus 211 ~~~~-------snt~-------~~~iV--~~y~a~~g~--------------~pS~sd~~N~a~~k~~~~Ei~t-~~~f~ 259 (771) T protein:vir:95 211 VTWH-------SNEP-------SDPIV--TFRSAASGK--------------FPSNSDSVNLALSKRADVEPST-TDRFR 259 (771) T ss_pred cccc-------cCCc-------cccce--EeeeccCCC--------------CcCCceeeccccchhhccceee-ecccc Confidence 1100 0000 00000 000000000 000000000 000000000000 00000 Q ss_pred eeeeeeeEeccCCCCcceEEE--EEcCCceEEEeecccccccccceeEEEEEecCCCeEEEeccCcCccccCCccccCCc Q lcl|NC_011107. 306 VQFMDGAVMATGSTKAPVYFE--WDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTF 383 (826) Q Consensus 306 ~~~~~~~~~~~~~~~~~~y~~--~~~~~~~w~e~~~~g~~~~~~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~p 383 (826) .....++..-+...+-..|+- |..+.+.-.|++ +.+.|+| T Consensus 260 ~~~~~~~~~Gt~~~~~G~yi~da~~~g~~~Lt~~v--------------------------------------e~~gr~~ 301 (771) T protein:vir:95 260 AEDIVLNPIGTYETARGFFIIDAMARGKSRLEEIV--------------------------------------KLKQRYP 301 (771) T ss_pred hhhhhhcccCcccccCcceeeehhhhcccccceee--------------------------------------eccccch Confidence 000011100111111111110 000000000000 0011222 Q ss_pred ccc-----------CCCccEEEEEcceEEEec------------C---CeEEEEecCC--------cccCcccccc--cC Q lcl|NC_011107. 384 NFV-----------TRGITGMTTFQGRLVLLS------------Q---EYVCMSASNN--------PHRWFKKSAA--AL 427 (826) Q Consensus 384 sf~-----------g~~~~~v~~~q~RL~f~~------------~---~~v~~S~~gd--------~~nF~~~s~~--~~ 427 (826) +-. -+..+.|+=|-.|.|+++ | ..|.+||.=+ |.+=+|++.. .+ T Consensus 302 s~~~~~~~l~~~~t~~~~~~vaeyagRvwYag~~~~~iD~dkng~~~~~~ilfSqLv~s~~di~nCyQd~DPTsee~~dL 381 (771) T protein:vir:95 302 SLSFGVSSLPQDETPGGASVVCEYAGRVWYAGFSGQIIDGDDQSPRLVSYILFSQLVDSPADIVNCYQDGDPTSTEEPEL 381 (771) T ss_pred hhhccccccccccCCCCceeEEeeeeeEEEecceeEEeeccccCCceeeeEeeehhhcchhhcccccccCCCchhhhhhh Confidence 110 023456888889988876 1 1488898432 5555555432 46 Q ss_pred CCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeC--CccccccceEEEEEEeeccccCCCcEEeCCeEEEEec Q lcl|NC_011107. 428 NDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPG--GGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAE 505 (826) Q Consensus 428 ~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~--~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~ 505 (826) .|.|..-+.+-+.. .|.-|+.|+..|+||...+-|+|.+ +...|.++..+...++.+|++.=.=+++|+.++|-++ T Consensus 382 idTDGg~iri~gah--~ii~Lv~f~~sLlvfc~NGVWAi~ggsd~g~tAtdY~ltKIs~vg~sspnSvVvvg~~i~ywsd 459 (771) T protein:vir:95 382 VDTDGGFIRIEGAH--DIINLVNVGSAVMVVAANGIWMIQGGSDYGFTATNYLVTKISEHGCSSPNSVVVVDNSFMYWGD 459 (771) T ss_pred hhcCCCEEEecCCC--CceeEEEecceEEEEEecceEEEEeccCCceeeeeeEEEEeeeeccCCCccEEEecceEEEeeC Confidence 78899988886553 3566899999999999999999954 4589999999999999999987777999999999988 Q ss_pred CCCceeEEEEEeeccccccccchhHHH-HHHHHhcCC----CeEEE--EEcCCCCEEEEEEcC--CCe---E--EEEEEe Q lcl|NC_011107. 506 RALGFMGLHEMAPSPSTDSHYVAEDVT-SHIPSYMPG----PAEYI--QAAASSGYLVFGTST--ADE---M--ICHQYL 571 (826) Q Consensus 506 ~g~~~~~v~e~~~~~~~~~~~~~~dls-~~~~~~~~~----~v~~~--~~s~~p~~~v~~~~~--~g~---l--~~~tyl 571 (826) +| |..+.-++ .+-+.++.|| ..+.+|.+. .+... .|-.-++.+.|+..+ |++ + ++|. T Consensus 460 tg-----Iyal~~Nd--fn~~tAqnLTekTIq~~~~~I~~dk~knVtg~fd~~e~rvyw~yPn~~D~~~e~~t~LV~d-- 530 (771) T protein:vir:95 460 DG-----IYHLTRNQ--YGDYVANNLTEKTIQKYYEKIPSDAILNATGFYDSYDKKVKWLYNTVLDGRTEPVTELVFD-- 530 (771) T ss_pred Cc-----eEEEeecc--cCcchhhccchHHHHHHHhhcchhhhcceEEEEEccCCEEEEEecceecCCCcceeeeeee-- Confidence 76 77676664 4458899999 788887654 22222 344556677776442 111 1 2222 Q ss_pred ecCCceeeeeeEee---e-cCCcE----EEEE---------------------EECCeEE--EEEEeCCCEEEEEEEEee Q lcl|NC_011107. 572 WQGNEKVQNAFHRW---T-LRHQI----IGAY---------------------FTGDNLM--VLIQKGQEIALGRMHLNS 620 (826) Q Consensus 572 ~~~~e~~v~aW~~w---~-~~g~v----~~~~---------------------~~~d~l~--~vv~r~~~~~~~r~~~~~ 620 (826) -...|+-+| + .+|.. .++. .-+|.+- ..+|-..-+-++.+.+++ T Consensus 531 -----LalgaFYp~~i~~~~ag~l~~~vg~~~~p~~~lv~T~~eV~v~~~~v~~tG~~vtV~~~~r~~~~~~~~y~~~~~ 605 (771) T protein:vir:95 531 -----LALGAFYPSKIGSLTAGRLPIPVGSVKIPPYKLVETGEEVTVASEQVTATGELVTVKVSTRSPVIRETKYIIVEK 605 (771) T ss_pred -----ecccccccccccccccCccceeeeeeecCccccccccceEEecceeeEecCCceEEEEEEeeccccceEEEEEEe Confidence 223466666 3 23321 1110 0011111 111111112222222221 Q ss_pred cCcc------cCCCcccccccceEEEeecccceeccceeeccCCcccceeeEEecCceeeeeecccceecCCceEEEecC Q lcl|NC_011107. 621 LPAR------EGLQYPKYDYWRRIEATVDGELELTKQHWDLIKDASAVYQLQPVAGAYMERTHLGVKRETNTKVFLDVPE 694 (826) Q Consensus 621 ~~~~------~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~~~ 694 (826) .-.. ...+..++|+-...-..++ |... .+.| ..++|..- ..+ T Consensus 606 dg~~g~~~Fa~~~~~~f~DW~sv~~~~vd----y~sy---------------~~~g----Y~~~gd~~-------~~k-- 653 (771) T protein:vir:95 606 LSSPMRISFGGYTDEEFVDWKSVDGIGVD----APAY---------------LLTG----YLAGGDYQ-------REK-- 653 (771) T ss_pred cCCCeeEEeccccCcceeecccCCCcccc----hHHH---------------HHhh----hhccchhe-------eee-- Confidence 1110 0111122222100000000 0000 0000 00000000 000 Q ss_pred CCCCceEEEeeeee-EEEE-eCCeeEecCCCCceee-cceEEEEEEEEeeccceEEEEecCCCCccceeeeccCcccccc Q lcl|NC_011107. 695 AVVGAVYVVGCEFW-SKVE-FTPPVLRDHNGLPMTS-TRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSR 771 (826) Q Consensus 695 ~~~~~~v~vGl~y~-~~~~-~~~~~i~~~~g~~~~~-gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~ 771 (826) -+||- ++++ +-.=++.+..|+-... -.--+-.+.++...++. .++-......|.....+ ..+ T Consensus 654 ---------~~PYit~y~~~tedg~v~~~~g~~~p~n~sSclm~~sw~ws~s~~-----t~k~~~~~eaYk~~~~~-~p~ 718 (771) T protein:vir:95 654 ---------FVPYITFHFKKTEDGFVEDAEGDWTPTNQSSCMVQSQWSWTNSPA-----SNKWGRTWQAYRFRRHF-FPD 718 (771) T ss_pred ---------ccceEEEEEEeecccceecccccccccCCcceEEEEEeeeecCCC-----CCccccchheeeeccee-ccC Confidence 01110 0000 0000111111110000 00011122223333321 01100011122222211 111 Q ss_pred ccccCccccceeEE--EEEecccCceeEEEEEECCCCCEEEEEEEEEEEeeccc Q lcl|NC_011107. 772 QLNAGEPLVDSAVV--PLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTY 823 (826) Q Consensus 772 ~~~~~~~~~~tg~~--~vp~~~~~~~~~v~i~~~~P~P~tvl~i~weg~y~~r~ 823 (826) ++. .+.-.....+ +..++|..+-.+++|.+...-.|+|++.+.--..|-.. T Consensus 719 ~~~-~~~yp~~~VV~TKsriRG~Gr~~~~rf~s~~gKdlhl~Gysil~~~~~~~ 771 (771) T protein:vir:95 719 NID-NQFDDGNSVVETKSRLRGSGKVLSLYITTEPKKNLHIYGWSMLVDVNGTV 771 (771) T ss_pred Ccc-hhcCCccceeeeeheeeecceEEEEEEEecCCcceEEEeEEEEEeecCcC Confidence 111 1110111122 33567888899999999999999999999888887776 No 31 >protein:vir:3133 Length: 911 # NCBI annotation: hypothetical protein # Family: family:all:5234 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640315;genbank:gi:21234408;genbank:GeneID:956056 Probab=95.18 E-value=0.0026 Score=34.76 Aligned_cols=691 Identities=14% Similarity=0.100 Sum_probs=254.2 Q ss_pred CCceeeechhhh--cccccCChhHhhhc-hhhhhhcceeeccCCcccCCch-------hhhhhhcCCCccccceeEEEEE Q lcl|NC_011107. 1 MSYKQSAYPNLL--MGVSQQVPFERLPG-QLSEQINMVSDPVSGLRRRSGI-------ELMAHLRHTDQPWPRPFLYHTN 70 (826) Q Consensus 1 M~~v~~s~~n~~--~GVSqq~d~~R~~~-q~~~~~N~~~~~~~Gl~rRpGt-------~~v~~~~~~~~~~~~~~~~~~~ 70 (826) |+.-.+....|+ +|---.-.+..|.. -.-..+||=...+|=.+||-|. +|+.......+++-...+.-+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (911) T protein:vir:31 1 MAARKGAVNRFTPVRGWVTEGNLANYGQDVALDVENMDIEKTGLTQRRFGLFAETSSEQFLSTFTATARARGLLAVKEWR 80 (911) T ss_pred CccccccccccccceeeeecCchhhcCceeEeeeccccchhcccchhheeeeeccchhhhhhhhhhhhhhcceeehhhHH Confidence 887666665553 33222223444432 2346789988888878888775 3443332222211000110000 Q ss_pred -cCCCceEEEEEecCCe-EEEEEcC-----CCEEEEecCccccccccCCc-cceEEEEEcCEEEEEeCcccCcccccccC Q lcl|NC_011107. 71 -LGGRSIAMLVAQHRGE-LYLFDER-----DGRLLMGQPLVHDYLKANDY-RQLRAATVADDLFIANLSVKPEADRTDIK 142 (826) Q Consensus 71 -rd~~e~~~~~~~~~g~-i~v~~~~-----~g~~~~~~~~~~~y~~a~~~-~~l~~~~vaD~~fi~n~~~~~~~~~~~~~ 142 (826) ..++...-+++|..|+ +.|.... ...++...-.....-..... +-.+..---.+-.|.||..-|-.. ++. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ 158 (911) T protein:vir:31 81 EAWGDKDVNMLIFHAGYKVHVVQDTAPLRDANILLTIDLLEAGIKLDGVIDSPVHISVGVGFAIITNPRIEPVLI--KLD 158 (911) T ss_pred HhhCCCcceEEEEecCcEEEEEecccCccccceEEEeeeeccCceeeeeecCceeEEeeceEEEeecCccceEEE--Eee Confidence 0112222233444443 2222110 01111111001100000000 012222223566777887655311 011 Q ss_pred CCCCCccEEEEEcccccCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechh-eeeeccce Q lcl|NC_011107. 143 GVDPNKAGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLY-GKFFGAPE 221 (826) Q Consensus 143 ~~~~~~~a~~~v~~g~y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~-~~~~ga~~ 221 (826) ..+.. ++-.+ .| ..+++.|..-. --.+|++--+-+...-...........|..-.+. ..-.+..+ T Consensus 159 ~~~~~--~~~~~---~~-~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (911) T protein:vir:31 159 DVDDE--GVPTL---SY-EPLTLLIRTRE--------LLTPYTTGTNYGDTLTPEEEWNLYNSGWATITRATKDKSGSGT 224 (911) T ss_pred ccCcc--Ccccc---cc-cceeeEeeehh--------hccccccccccCcccCchhhcccccccceeeeeecccCCccce Confidence 11110 00000 00 00111110000 0011111100000000000001111111000000 00000000 Q ss_pred EEEeecccccceeeccccceeecccccccccccceEEEecCCc---eEE--EEecCCCCcceEEEEEEeecccccccccc Q lcl|NC_011107. 222 YTLPNSTKKYPKVDPDANAATIAGYLNQRGVQDGYIAFRGDAD---IHV--EVSTDMGNNYGIASGGMSLNATADLPALL 296 (826) Q Consensus 222 ~t~~~~~~~~~~~a~~~~~~t~a~~~~~~~~~~g~~~~~~~~~---~~~--~~~~~~g~~~~~~~~~~~v~~~~~l~~~~ 296 (826) ..+ -| -.| |+..... ..+ +...-......... +-+ T Consensus 225 ~~~----------~~-----------------~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~------- 264 (911) T protein:vir:31 225 VYV----------NP-----------------VQY-YFDKRGVYPSHSVLYNSMKQESAKEIVAL-----NVF------- 264 (911) T ss_pred EEE----------ch-----------------hhe-eecccCcCcchhhhhhhhhhhccceeEEE-----eee------- Confidence 000 00 000 0000000 000 00000000000000 000 Q ss_pred cCcccccEeeeeeeeeEeccCCCC------cceEEEEEcCC---------------------c-eEEEeecccccccccc Q lcl|NC_011107. 297 PGVGAPGVGVQFMDGAVMATGSTK------APVYFEWDSAN---------------------R-RWAERAAYGTDWVLKK 348 (826) Q Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~------~~~y~~~~~~~---------------------~-~w~e~~~~g~~~~~~t 348 (826) ++..-.+|.-.+-+.+ +.||. +... + .=.|.-.| .|++. T Consensus 265 --------~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~e~~np---~gl~~ 331 (911) T protein:vir:31 265 --------SPWADEKINFGTTTPPLGRYIHSAYYF--DSAAILSLGIGNLTPPTSDGTTEGSGPAEEEISNP---IGLDN 331 (911) T ss_pred --------ccccccccccccCCCchhhhhhhheee--ccceeeeecccccCCCCCCCccCCCCCchhhhcCC---CCccc Confidence 0000000100000000 11221 1000 0 00000001 11110 Q ss_pred ---ee-EEEEEecCCCeEEEeccCcCccccCCccccCCccccCCCccEEEEEcceEEEec-----CCeEEEEecC----- Q lcl|NC_011107. 349 ---MP-LALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTRGITGMTTFQGRLVLLS-----QEYVCMSASN----- 414 (826) Q Consensus 349 ---~p-~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~~~q~RL~f~~-----~~~v~~S~~g----- 414 (826) +- ..+. +.|+.. |. +...|+|++||.+|++|+. ...|.+|+.- T Consensus 332 igt~~n~k~~---a~~~~~-----~~---------------~~~r~r~~~~yaGRVfyaD~dkngk~rIlFSqLv~sl~d 388 (911) T protein:vir:31 332 IGTVNNLKLI---AEGTVR-----WT---------------VKDRPRCSGYHNGHVYFGDRDKNGKTRILVSQLVNSLDN 388 (911) T ss_pred ccchhceeee---ecccee-----ee---------------ecccccceeeeccEEEEeeeccCcceeEEEEeecccccc Confidence 00 0000 112211 21 1246899999999999995 3479999853 Q ss_pred ---CcccCcccccc--cCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCC--ccccccceEEEEEEeecc Q lcl|NC_011107. 415 ---NPHRWFKKSAA--ALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGG--GIVTPRTAVISITTQYDL 487 (826) Q Consensus 415 ---d~~nF~~~s~~--~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~--~~lTP~~~~~~~~s~~~~ 487 (826) +|++=++++.. .+.|.|-.-+.+... ..|+-+|.+++.|++|..++.|.|.|. ...|.++..|...+..+| T Consensus 389 i~nCYQdaDPTSeee~DLIdTDGg~vri~ga--h~Ii~LV~~G~sLlVFcaNGVWAI~G~d~~g~TATdy~ItKIsdvGc 466 (911) T protein:vir:31 389 IPKCFQDADPTAEEINDLIATDGFTMYPVGM--GAPITMVEFNKRLLLLCTNGVWAIRGTSGGGATATDFTLDKVASVEF 466 (911) T ss_pred ccccccCCCccccccchhhhcCCcEEecCCC--CCceEEEEecCeEEEEEeCcEEEEeccCCCceeeeeeEEEEEeeeee Confidence 36655555433 244567777776443 557889999999999999999999764 479999999999999999 Q ss_pred ccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHH-HHHHHhcCC----CeEEE--EEcCCCCEEEEEEc Q lcl|NC_011107. 488 DTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVT-SHIPSYMPG----PAEYI--QAAASSGYLVFGTS 560 (826) Q Consensus 488 ~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls-~~~~~~~~~----~v~~~--~~s~~p~~~v~~~~ 560 (826) ++.=.=|++|+.++|-+++| |..+...+..+ +.++.+| ..+..|.+. .|... .|-..++.+.|+.. T Consensus 467 sspNSVVvVgn~i~fWSd~G-----IyaLganqfnD--~tAnNLTesTIQ~y~d~I~~dkIkNVtgtyd~de~rVyW~yP 539 (911) T protein:vir:31 467 NSPQSVVDIGTAIVFWSERG-----IIAIGVNDFGD--LTSNNLTENTIDEYYDSLDRDIIKNVKGTFINDENRVYWVVP 539 (911) T ss_pred CCCCeEEEecCceEEeeCCc-----EEEEeecccCc--cccccccHHHHHHHHhhcChhhhceEEEEEEccCCEEEEEec Confidence 98888899999999999986 66666665444 5788888 567777654 23332 33455667778765 Q ss_pred C---CCeEEEEE----EeecCCceeeeeeEeeecC-CcEEEEEEE----------------------------------- Q lcl|NC_011107. 561 T---ADEMICHQ----YLWQGNEKVQNAFHRWTLR-HQIIGAYFT----------------------------------- 597 (826) Q Consensus 561 ~---~g~l~~~t----yl~~~~e~~v~aW~~w~~~-g~v~~~~~~----------------------------------- 597 (826) + ....+.+. +.+ .-.-.+|-+|... |.++..-.. T Consensus 540 n~lDe~teykt~~~~ILVf---dLatgaFYPwtvs~gpLl~~p~y~Lv~TreEvtvPi~~etgaiIve~gsdPV~~tl~v 616 (911) T protein:vir:31 540 NKQDSNGEYKTDGELVLVL---NLDTGGFYKHTVSGGPLLHAPFRRLVNTRAEVSIPITETDGTVITDTLGDPVTVTRTV 616 (911) T ss_pred CccCCccceeecCceEEEE---EeccCcccceeeecceeecccccccccccccceeeEEeecceEEEecCCCCeEEEEee Confidence 2 22343322 110 1123588888653 444311100 Q ss_pred -----CCeEEEE-EEeCCCEEEEEEEEeecCcccCCCcccccccceE--EEeecccceeccceee-ccCCcccceeeEEe Q lcl|NC_011107. 598 -----GDNLMVL-IQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRI--EATVDGELELTKQHWD-LIKDASAVYQLQPV 668 (826) Q Consensus 598 -----~d~l~~v-v~r~~~~~~~r~~~~~~~~~~~~~~~~~d~~~~~--~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 668 (826) +...||+ +|-+.+|++.-+ -+..+ .-+.|....+...+. ..+++.+..+.+.-.. ...|... ..-+ T Consensus 617 dttGvDg~ayLl~frdg~~g~~~f~-a~~~~-~~~~dw~~~~~~~~~~y~s~~~~~y~~~~~~~~~~~~pyi~---sy~~ 691 (911) T protein:vir:31 617 TTTGVDGLAYFASFDDGVNGQFNFI-AEHQP-WGFADWANVPNMTRVNYSSYVDFAYEYPEVMIGNISLPYIH---SYYL 691 (911) T ss_pred ecccccceeEEEeeccCCcceEEEE-EeecC-CeeeccccCccccccchhHHHHhhhhhhhhhhhcccCceee---eeee Confidence 2334555 333333332211 11111 111122211111111 1112222222111000 0001100 0011 Q ss_pred cCceeeeeecccceecCCceEEEecCCCCCceEEEeeeeeEEEEeCCeeEecCCCCceeecceEEEEEEEEeeccceEEE Q lcl|NC_011107. 669 AGAYMERTHLGVKRETNTKVFLDVPEAVVGAVYVVGCEFWSKVEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLW 748 (826) Q Consensus 669 ~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~v~vGl~y~~~~~~~~~~i~~~~g~~~~~gr~~v~r~~~~~~~t~~~~v 748 (826) .|-.++ .++-...+..-...++- +.++-..| .+.+-+..|+...|- ..|+--.=.+.++-...+ T Consensus 692 ~~~rv~--~~~y~~~~a~~~f~~~~---~~~~~~~~-----~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~v 755 (911) T protein:vir:31 692 TGIRVQ--TEQYTTETAHLSFHRVQ---AHQTTALG-----TVTFHKVDMMVSTGM------QVISFHKDDLLRTEAVTL 755 (911) T ss_pred eeeEEe--ccceeeecccceeEeee---cccceeee-----eeeeeeeeehhhccc------eeeeeccccceeeeeeEE Confidence 111111 01111111111111111 11111122 122223333322221 111111112223333332 Q ss_pred EecCCCCccceeeeccCccccccccccCc-cccceeEEEEEecccCceeEEEEEEC-----------CCCCEEEEEEEEE Q lcl|NC_011107. 749 RISDTARPNQPWYDTTPLRLFSRQLNAGE-PLVDSAVVPLPARVDMATSKFELSCH-----------SPYDMNVRAVEYN 816 (826) Q Consensus 749 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~tg~~~vp~~~~~~~~~v~i~~~-----------~P~P~tvl~i~we 816 (826) . +++.+ .-++++.........+.. -|...|++-|-..+ +- .-.+.|+ .-.+.++. +|- T Consensus 756 V--NGDAE---~GtmTGWtvtaG~~d~~Ta~p~~rGSyfFa~~n-n~--n~aL~QDIDSagaaaIDAG~v~ynvS--awl 825 (911) T protein:vir:31 756 V--NPDAE---TGDATGWTVTAGTLDVRTAAPLYQGSYYFWSDS-NA--NFAAYQDIDPVGGGYITAGELANNVI--EAK 825 (911) T ss_pred E--cCCCC---CCCCCcceeeccchhhccCCchhcceEeEcCCC-Cc--chhhheeccccccceeeeccchhhhh--hhh Confidence 1 21111 223444333332222211 13456666664222 22 2222222 23344443 377 Q ss_pred EEeeccc--cc--C Q lcl|NC_011107. 817 FKSNQTY--RR--V 826 (826) Q Consensus 817 g~y~~r~--rr--v 826 (826) +.|..+. -| | T Consensus 826 ~gyAaqnd~Dr~~l 839 (911) T protein:vir:31 826 LSWAARGNTDLGTV 839 (911) T ss_pred hhhccCCCCccceE Confidence 7777764 34 3 No 32 >protein:vir:105563 Length: 396 # NCBI annotation: hypothetical protein # Family: family:all:27455 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164316;genbank:gi:56692963;genbank:GeneID:3197174 Probab=87.42 E-value=0.039 Score=28.32 Aligned_cols=368 Identities=11% Similarity=0.049 Sum_probs=144.0 Q ss_pred CCceeeechhhhcc--cccCChhHh----hhchhhhhhcceeeccCCcccCCchhhhhhhcCCCccccc---eeEEEEEc Q lcl|NC_011107. 1 MSYKQSAYPNLLMG--VSQQVPFER----LPGQLSEQINMVSDPVSGLRRRSGIELMAHLRHTDQPWPR---PFLYHTNL 71 (826) Q Consensus 1 M~~v~~s~~n~~~G--VSqq~d~~R----~~~q~~~~~N~~~~~~~Gl~rRpGt~~v~~~~~~~~~~~~---~~~~~~~r 71 (826) |+-+ +|--|+|= |+.-.++.| -..-+++.+|.=.++.|=.+||-|.+-+...+..+ .+.. -+.|.. T Consensus 1 ~~~~--~~~~~~ginnv~~e~~l~~~~~~~~~~~r~a~nvdi~~~G~~~~r~~~tr~~~g~l~~-~~~~~~~~~~~~~-- 75 (396) T protein:vir:10 1 MATT--SLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQ-LWQSPLHGDAFGA-- 75 (396) T ss_pred Ccce--eeeeeecccccccccccccCCCcccceeeeeeeecccCCCchhhhccCcccCCceecc-cccCccccceeee-- Confidence 8866 33333321 444444443 34478999999999999999999988885443322 1111 122222 Q ss_pred CCCceEEEEEecCCeEEEEEcCCCEEEEecCccccccccCCccceEEEEEcCEEEEEeCcccCcccc---cccCCCCCCc Q lcl|NC_011107. 72 GGRSIAMLVAQHRGELYLFDERDGRLLMGQPLVHDYLKANDYRQLRAATVADDLFIANLSVKPEADR---TDIKGVDPNK 148 (826) Q Consensus 72 d~~e~~~~~~~~~g~i~v~~~~~g~~~~~~~~~~~y~~a~~~~~l~~~~vaD~~fi~n~~~~~~~~~---~~~~~~~~~~ 148 (826) .+..|.... ++..++|-. ..+..++ +.+.+.+|=+|..+-..+=-+.. ..+.-..+.+ T Consensus 76 --~~~tl~~~~-~~~w~~~~~---v~v~~~p-------------va~d~~~~Rvy~t~~~~p~~~~~~~~y~L~vp~P~~ 136 (396) T protein:vir:10 76 --LGDQWGKVD-PHSWTFEPL---AQIGEGD-------------LSHEVLNNRVCVAGTAGIFTYDGAQAERLTLDTPAP 136 (396) T ss_pred --CCceEEEEe-CCeEEEEee---eeeccCc-------------hhccccCCeEEEEcCCCceeeeCCcceecCcCCCcc Confidence 133333221 122232211 0011111 11223344455555332211000 0010011111 Q ss_pred cEEEE----EcccccCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeeccceEEE Q lcl|NC_011107. 149 AGWLY----IKAGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTL 224 (826) Q Consensus 149 ~a~~~----v~~g~y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga~~~t~ 224 (826) .-.+. +..++| .+..+|-+-.+-..-+ .++... T Consensus 137 a~~~a~~Gsl~~~~~-------------------~Y~~t~V~~~gEEs~p------------~~~S~~------------ 173 (396) T protein:vir:10 137 PLLVAGAGSLSQGTY-------------------GAAVAWLRGPQESAPS------------LIAFAE------------ 173 (396) T ss_pred cccccccCccCCceE-------------------EEEEEEEecCCCcCcc------------cccccc------------ Confidence 11110 111111 1112221111100000 000000 Q ss_pred eecccccceeeccccceeecccccccccccceEEEecCCceEEEEecCCCCcceEEEEEEeecccccccccccCcccccE Q lcl|NC_011107. 225 PNSTKKYPKVDPDANAATIAGYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGV 304 (826) Q Consensus 225 ~~~~~~~~~~a~~~~~~t~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~ 304 (826) ++..++....+. .+...+. . T Consensus 174 --------------------------------------------v~~~gg~~vtl~-----------~~~~~~i-----~ 193 (396) T protein:vir:10 174 --------------------------------------------VTDAGALEVTFP-----------LCLDASV-----T 193 (396) T ss_pred --------------------------------------------cCCCCCcEEEEE-----------cccCCCc-----c Confidence 000000000000 0000000 0 Q ss_pred eeeeeeeeEeccCCCCcceEEEEEcCCceEEEeecccccccccceeEEEEEecCCCeEEEeccCcCccccCCccccCCcc Q lcl|NC_011107. 305 GVQFMDGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFN 384 (826) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~y~~~~~~~~~w~e~~~~g~~~~~~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~ps 384 (826) .. . +..++++...+|+.-+ ......+|.+...+|...-.-..-=-|+|. T Consensus 194 --~~--R-iYrS~~~G~~~~l~aE--------------------------~~a~~~s~vlPs~~w~gpP~~~~gL~pmP~ 242 (396) T protein:vir:10 194 --GA--R-LYLTRANGGELLLAGD--------------------------YPLGAATVILPTLPELGRPAQFRHLSPMPT 242 (396) T ss_pred --eE--E-EEEeCCChhhhhheeh--------------------------hccceeeeeeecCCCCCCCccccccccCch Confidence 00 0 0111222222221110 011222333445567543211111123332 Q ss_pred ccCCCccEEEEEcceEEEecCCeEEEEecCCcccCc-ccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcE Q lcl|NC_011107. 385 FVTRGITGMTTFQGRLVLLSQEYVCMSASNNPHRWF-KKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQ 463 (826) Q Consensus 385 f~g~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~-~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q 463 (826) . ..+.||..||+++.++.||+|...-++=+. ++.-+ .-+ ..|.-+.+++.+|+++|+++- T Consensus 243 G-----~~~A~faGRi~~A~Gn~V~FSEp~~Ph~~~~~~~~~--~~~------------~~Iv~lapv~~gL~Vgt~~~~ 303 (396) T protein:vir:10 243 G-----KHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFV--QMP------------QRITFVQPVDGGIWVGQVDHV 303 (396) T ss_pred h-----HhhhhhcceEEEEeCCEEEEecCCCCceecchhccC--CCC------------CceEEEEEecCeEEEEEcCcE Confidence 1 147899999999999999999999874222 22111 111 235667788899999999999 Q ss_pred EEEeCCcc--ccccceEEEEEEeecccc---------CCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHH Q lcl|NC_011107. 464 AVVPGGGI--VTPRTAVISITTQYDLDT---------RAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVT 532 (826) Q Consensus 464 ~~i~~~~~--lTP~~~~~~~~s~~~~~~---------~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls 532 (826) |.+.|.++ ++........ ..-|+. .-..+..|..++|+++.| +. .. ..++ - ...++ T Consensus 304 y~~~G~dP~sms~~~l~~~~--pvp~S~v~~p~~~~s~rs~~~~~~~~lwas~dG-----l~---~g-~~~G-~-v~~l~ 370 (396) T protein:vir:10 304 AFLDGADPASLSVSRRASRA--PVPGSAVLVPAEVVGTNASPDGSPVAVWLAENG-----YV---MG-TSSG-A-IAEVH 370 (396) T ss_pred EEEEcCChhHcceeecccCC--CcccchhcccchhhhcccccccCcEEEEccCCc-----EE---EE-cCCc-e-eeeec Confidence 99998643 3333332110 011222 222345688999999987 22 11 2222 1 11122 Q ss_pred HHHHHhcCCCeEEEEEcCCCCEEEEEEcCCCeEEEEEEeec Q lcl|NC_011107. 533 SHIPSYMPGPAEYIQAAASSGYLVFGTSTADEMICHQYLWQ 573 (826) Q Consensus 533 ~~~~~~~~~~v~~~~~s~~p~~~v~~~~~~g~l~~~tyl~~ 573 (826) . ..+... ...+.+ + +..|++.+++ + + T Consensus 371 ~---~~i~p~-~~~A~~-------~-~~~drRy~~~--~-~ 396 (396) T protein:vir:10 371 A---GVLAGI-TGRAGT-------S-VVFDRRLLTA--V-S 396 (396) T ss_pred c---cccCCC-cccceE-------E-EeecCeEEEE--e-C Confidence 2 122211 111111 1 1122222211 0 0 No 33 >protein:vir:108312 Length: 458 # NCBI annotation: hypothetical protein # Family: family:all:1540 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552278;genbank:gi:160700603;genbank:GeneID:5758828 Probab=67.90 E-value=0.25 Score=23.93 Aligned_cols=446 Identities=11% Similarity=0.067 Sum_probs=163.9 Q ss_pred cEEEEEcccccCceeEEEEeeccccceeeeeeEEEEeecCCCCccccccccceEEecceeeechheeeeccceEEEeecc Q lcl|NC_011107. 149 AGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNST 228 (826) Q Consensus 149 ~a~~~v~~g~y~~~y~v~i~g~~~s~~tt~~~tasyttp~g~~t~~~~~~~~~~~~~~~ia~~l~~~~~ga~~~t~~~~~ 228 (826) -..+-|..|+|.. .... .+ .+..++..+.+.........-+..+... T Consensus 1 m~~~~ip~gsy~a-------------------------~~~~------~d--aq~~VN~yp~~~e~g~ss~~l~~tPGl~ 47 (458) T protein:vir:10 1 MVQRQIPLVATTA-------------------------EGDV------SG--QEILVNVYPRKSDGGKYPFTLRHTPGLA 47 (458) T ss_pred Cceeeeceeeeec-------------------------cccc------cc--ceeeeeeeeecccccccccceEecCCce Confidence 1222222222210 0000 00 1111122221111000000000000000 Q ss_pred cccceeeccccceeecccccccccccceEEEecCCceEEEEecCCCCcceEEEEEEeecccccccccccCcccccEeeee Q lcl|NC_011107. 229 KKYPKVDPDANAATIAGYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQF 308 (826) Q Consensus 229 ~~~~~~a~~~~~~t~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~~~~~~ 308 (826) .. .+..+... .......|..|...+...-. ++.++ . ... ..+++..-+. .-... T Consensus 48 ~f-----~~~~~~~~----~g~~~~~g~ly~v~g~~LY~-V~~~~-~-------~~~---iG~i~gsg~V-----sMa~n 101 (458) T protein:vir:10 48 FF-----CELPTFPV----MAMHQNGSRAFAVTPRDMYE-ISKDG-T-------YKR---LGSVDFKGRV-----VMEDN 101 (458) T ss_pred ee-----ecCCCCce----eeEEecCCEEEEeeCceEEE-EeCCc-e-------EEE---EecccCceeE-----EEeeC Confidence 00 00000000 00111122222222211100 11111 0 000 0111100000 00000 Q ss_pred eeeeEeccCCCCcceEEEEEcCCceEEEeecccccccccceeEEEEEecCCCeEEEeccCcCccccCCccccCCccccCC Q lcl|NC_011107. 309 MDGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTR 388 (826) Q Consensus 309 ~~~~~~~~~~~~~~~y~~~~~~~~~w~e~~~~g~~~~~~t~p~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~ 388 (826) .+..++..+. ..++|+..+ ..+ .++ |.+ .|. T Consensus 102 g~q~vi~~G~----~gY~yd~at----------------------------~~~--~~i-~d~------------~~~-- 132 (458) T protein:vir:10 102 GKQIVMVDGE----KGYYYDSET----------------------------EIV--QEI-KAE------------GFY-- 132 (458) T ss_pred CcEEEEEECC----eEEEEeecc----------------------------cEE--Eec-cCc------------ccc-- Confidence 0111222121 112222111 111 111 111 111 Q ss_pred CccEEEEEcceEEEec--CCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcE--E Q lcl|NC_011107. 389 GITGMTTFQGRLVLLS--QEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQ--A 464 (826) Q Consensus 389 ~~~~v~~~q~RL~f~~--~~~v~~S~~gd~~nF~~~s~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q--~ 464 (826) .+..|.|..+|++|.. .+.++.|-.+| . -=||++++-+..+++.|.-++.+.+.|++|.+..- | T Consensus 133 ~~~~v~~~dGy~V~~~~g~~~~~is~L~d------~------s~d~l~fa~Ae~~pD~iv~i~~~~~~i~~fG~~TiEvw 200 (458) T protein:vir:10 133 PASTVTYQDGYFIFDRKGTGQFFISELLD------V------AFDPLDFATAEGQPDPLLAVLSDHREVFMFGQETIEVW 200 (458) T ss_pred CcceEEEeCcEEEEEeeCCCEEEEEecCc------c------eeCcceeeeecCCCCceEEEEeeccEEEEEeccceEEE Confidence 3678999999999885 44566675443 1 15699999999999999999999999999987764 7 Q ss_pred EEeCCccccccceE-EEEEEeeccccCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHH-HHHhcCCC Q lcl|NC_011107. 465 VVPGGGIVTPRTAV-ISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSH-IPSYMPGP 542 (826) Q Consensus 465 ~i~~~~~lTP~~~~-~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~-~~~~~~~~ 542 (826) ..+|+..+.=.... ... ..+|++.-.=..+|++++|++..+ .|+.+ . .|+++-||-| +++.|.. T Consensus 201 ~ntG~a~fpy~r~~ga~i--~~Gcaa~~sv~~~~~t~~~l~~d~----~Vy~l--~-----g~~~~rIST~aIE~~i~s- 266 (458) T protein:vir:10 201 YNSGAADFPFERNQGAFI--EKGIGAPYSVAKTNNTVYFIGSDL----MIYQI--T-----GYTPVRISTHAVEQTLKG- 266 (458) T ss_pred EecCCCCcceeeccccee--eecccCcchhhhhCceEEEEcCCe----EEEEe--c-----CceeEEeeCHHHHHHHhc- Confidence 77775332211111 111 346776655578999999998654 45543 2 2333333333 3333322 Q ss_pred eEEEEEcCCCCEEEEEEcCCC-eEEEEEEeecCCceeeeeeEeeecCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEeec Q lcl|NC_011107. 543 AEYIQAAASSGYLVFGTSTAD-EMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFTGDNLMVLIQKGQEIALGRMHLNSL 621 (826) Q Consensus 543 v~~~~~s~~p~~~v~~~~~~g-~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~~d~l~~vv~r~~~~~~~r~~~~~~ 621 (826) | .-.+.+.++-..+| .+|+++| +.. ... |.+|+. -.+|.+-+ .+..+|+ T Consensus 267 -----y-~~~da~a~t~~~eGH~fy~Ltf----P~a-~~T---w~yD~~--------t~~Wher~---Sg~~~~~----- 316 (458) T protein:vir:10 267 -----V-NLSDAFAYTYQSEGHLFYVLTI----PGK-NLT---WCYDIS--------SGSWHVRQ---SYQFDRH----- 316 (458) T ss_pred -----C-ChhheEEEEEEecCeEEEEEEC----CCC-Cce---eEEecc--------cccceeec---cCCCCce----- Confidence 1 12224455555454 3666665 211 011 111110 01121110 0011111 Q ss_pred CcccCCCcccccccceEEEeecccceeccceeeccCCcccceeeEEecCceeeeeecccceecCCceEEEecCCCCCceE Q lcl|NC_011107. 622 PAREGLQYPKYDYWRRIEATVDGELELTKQHWDLIKDASAVYQLQPVAGAYMERTHLGVKRETNTKVFLDVPEAVVGAVY 701 (826) Q Consensus 622 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~v 701 (826) +..++..-.+..+.++. . -..+..+.... ... T Consensus 317 --------------Ra~~~v~~~g~~~vGD~----~----ng~ly~ld~~~---------~td----------------- 348 (458) T protein:vir:10 317 --------------VSNNSIYFDQKTLVGDF----Q----NGRIYIMADNY---------YTD----------------- 348 (458) T ss_pred --------------EEEEEEEeCCeEEEEEc----C----CCeEEEEcccC---------cCC----------------- Confidence 11111111111111100 0 00011111100 000 Q ss_pred EEeeeeeEEEEeCCeeEecCCCCceeecceEEEEEEEEeeccceEEEEecCCCCccce--eeeccCccccccc-c--ccC Q lcl|NC_011107. 702 VVGCEFWSKVEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQP--WYDTTPLRLFSRQ-L--NAG 776 (826) Q Consensus 702 ~vGl~y~~~~~~~~~~i~~~~g~~~~~gr~~v~r~~~~~~~t~~~~v~v~~~~~~~~~--~~~~~~~~~~~~~-~--~~~ 776 (826) -|-+++..+.++. +. ++ ..|++++++.|.+.---+... +++-+.+. .++.++..-.+.+ . ..| T Consensus 349 -~g~~i~~~~~~p~--~~--~~----~~rl~~~~~el~~~tGvg~~~---~~~~~p~~~l~~S~d~g~~~s~~~~~~~lg 416 (458) T protein:vir:10 349 -DGDPVVREFILPV--VN--NG----REFLTVDSLELDLSSGVGLTV---GQGSDPELRVYFSKDNGNEYSQNFKVGKIG 416 (458) T ss_pred -CCceeeeeeeccc--ee--CC----CCeEEEEEEEEEEecceeeee---CCCCCceEEEEEeeCCCcccchhHHHhhcC Confidence 0222233333322 11 11 124445555554421111110 11111111 1111111111111 1 124 Q ss_pred ccccceeEEEEEecccCceeEEEEEECCCCCEEEEEEEEEEE Q lcl|NC_011107. 777 EPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFK 818 (826) Q Consensus 777 ~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~weg~ 818 (826) ++-.....+++.-.|..++--++|+-..|.|.+|+++..+-+ T Consensus 417 ~~gey~tr~~~~rlG~ar~rvf~v~~s~p~~~~l~ga~~~~r 458 (458) T protein:vir:10 417 RKGEFLTRAKVNRFGCARQFTFKVEISDPIPVDIGGAWVEVR 458 (458) T ss_pred CcchhhhhhhhhhhccCcceEEEEEEecchhhcceeeeEEeC Confidence 443334444554456677766999999999999999999888 Done!