Query lcl|NC_012418.1_cdsid_YP_002727857.1 [gene=PPphikF77_gp38] [protein=putative tail tubular protein B] [protein_id=YP_002727857.1] [location=26366..28846] Match_columns 826 No_of_seqs 162 out of 195 Neff 8.2 Searched_HMMs 1612 Date Thu Nov 7 12:59:05 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_38 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_38_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:78957 Length: 826 100.0 1E-255 7E-259 1418.4 92.3 825 1-826 1-826 (826) 2 protein:vir:6326 Length: 826 # 100.0 2E-255 1E-258 1417.3 91.4 822 1-826 1-826 (826) 3 protein:vir:10452 Length: 794 100.0 5E-227 3E-230 1261.1 84.6 769 1-826 1-794 (794) 4 protein:vir:80253 Length: 777 100.0 3E-225 2E-228 1251.4 85.6 774 1-826 1-777 (777) 5 protein:vir:3366 Length: 801 # 100.0 5E-223 3E-226 1239.2 84.3 775 1-826 1-801 (801) 6 protein:vir:2203 Length: 794 # 100.0 4E-222 2E-225 1234.5 87.4 770 1-826 1-794 (794) 7 protein:vir:1543 Length: 801 # 100.0 7E-222 4E-225 1233.1 83.9 775 1-826 1-801 (801) 8 protein:vir:99677 Length: 794 100.0 1E-221 8E-225 1231.5 84.5 767 1-826 1-794 (794) 9 protein:vir:94583 Length: 792 100.0 2E-221 1E-224 1231.2 84.7 769 1-826 1-792 (792) 10 protein:vir:94713 Length: 785 100.0 3E-221 2E-224 1229.9 83.9 765 1-826 1-785 (785) 11 protein:vir:8887 Length: 808 # 100.0 7E-220 4E-223 1222.1 85.0 778 1-826 1-808 (808) 12 protein:vir:97014 Length: 800 100.0 4E-217 2E-220 1207.2 83.0 770 2-826 1-800 (800) 13 protein:vir:105647 Length: 800 100.0 7E-217 4E-220 1205.8 82.3 769 2-826 1-800 (800) 14 protein:vir:7021 Length: 803 # 100.0 4E-215 3E-218 1195.9 86.0 769 2-826 1-803 (803) 15 protein:vir:100022 Length: 976 100.0 4E-214 2E-217 1190.5 82.1 796 1-826 1-976 (976) 16 protein:vir:78703 Length: 905 100.0 7E-213 4E-216 1183.7 79.7 800 1-826 1-905 (905) 17 protein:vir:103341 Length: 806 100.0 1E-211 8E-215 1176.8 82.9 771 2-826 1-806 (806) 18 protein:vir:103790 Length: 768 100.0 6E-168 4E-171 937.4 72.3 722 1-823 1-768 (768) 19 protein:vir:1778 Length: 680 # 100.0 1E-162 7E-166 908.3 54.1 573 1-592 1-680 (680) 20 protein:vir:95324 Length: 823 100.0 8E-155 5E-158 865.5 66.6 706 1-822 1-823 (823) 21 protein:vir:107802 Length: 681 100.0 2E-150 1E-153 841.9 67.3 660 1-821 1-681 (681) 22 protein:vir:98487 Length: 681 100.0 2E-150 1E-153 841.9 67.3 660 1-821 1-681 (681) 23 protein:vir:107423 Length: 681 100.0 2E-150 1E-153 841.9 67.3 660 1-821 1-681 (681) 24 protein:vir:7329 Length: 825 # 100.0 4E-150 2E-153 839.8 62.9 708 1-822 1-825 (825) 25 protein:vir:102644 Length: 594 100.0 2E-133 1E-136 747.8 58.1 558 1-822 1-594 (594) 26 protein:vir:94602 Length: 1012 99.6 5.2E-14 3.3E-17 93.4 35.0 780 1-825 1-1012(1012) 27 protein:vir:80177 Length: 1027 99.4 2E-11 1.2E-14 79.3 33.4 762 1-826 1-936 (1027) 28 protein:vir:2625 Length: 715 # 99.2 8.9E-10 5.5E-13 70.2 41.4 635 1-823 1-715 (715) 29 protein:vir:95475 Length: 771 98.3 1.2E-06 7.3E-10 53.1 41.6 658 1-823 1-771 (771) 30 protein:vir:8837 Length: 513 # 98.0 7.1E-06 4.4E-09 48.8 35.8 475 217-825 1-513 (513) 31 protein:vir:3133 Length: 911 # 95.5 0.002 1.2E-06 35.4 36.2 669 1-826 1-839 (911) 32 protein:vir:105563 Length: 396 94.6 0.0039 2.4E-06 33.8 18.1 375 1-573 1-396 (396) 33 protein:vir:108312 Length: 458 85.1 0.055 3.4E-05 27.5 36.6 426 149-818 1-458 (458) 34 protein:vir:3529 Length: 477 # 64.3 0.3 0.00019 23.4 32.1 437 241-816 1-477 (477) 35 protein:vir:95324 Length: 823 21.3 2.6 0.0016 18.3 37.6 666 48-826 1-741 (823) No 1 >protein:vir:78957 Length: 826 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522826;genbank:gi:158345061;genbank:GeneID:5687447 Probab=100.00 E-value=1.1e-255 Score=1418.42 Aligned_cols=825 Identities=95% Similarity=1.469 Sum_probs=780.7 Q ss_pred CcceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeCCCceEEEE Q lcl|NC_012418. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd~~e~~~i~ 80 (826) ||+|+|+||||+||||||||++|||+||++|+||+|+|+|||+||||++||+++++++..+++||+|.++||++|++|++ T Consensus 1 M~~i~~~~~nl~gGvSqq~d~~r~~~q~~~~~N~~~~~~gG~~rRpgt~~va~~~~~~~~~~~~f~~~~~r~s~e~~~~l 80 (826) T protein:vir:78 1 MSYKQSAYPNLLMGVSQQVAFERLPGQLSEQINMVSDPVSGLRRRSGIELMASLLHTDQPWPRPYLYHTNLGGRSIAMLV 80 (826) T ss_pred CcceeeecchhccceecccchHhhhhhhhhhhcceeccccccccCCchHhhhhhccCCcCCceeEEEEeccCCcceEEEE Confidence 99999999999999999999999999999999999999999999999999999999988889999999999999999999 Q ss_pred EecCCeEEEEECCCCEEEEecCcccccccCCCcccEEEEEecCEEEEeeCCcceeeeecccCCCCCCccEEEEEcccccC Q lcl|NC_012418. 81 AQHRGELYLFDERDGRLLMGQPLVHDYLKAADYRQLRAATVADDLFIANLSVKPEADRTDVKGVDPNKAGWLYIKAGQYS 160 (826) Q Consensus 81 ~~~~g~irv~d~~~g~~~~~~~~~~~yl~~~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~~~vr~g~Y~ 160 (826) ++++|+|||||+.+|.+++.++...+|++++++++|+++||||+|||||++++|++..+.....+++.++++++|+|+|+ T Consensus 81 ~~~~g~irv~~~~~g~~~~~~~~~~~y~~~~~~~~l~~~t~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~v~~g~y~ 160 (826) T protein:vir:78 81 AQHRGELYLFDEKDGRLLMGQPLVHDYLKASDYRQLRAATVADDLFIANLEVRPEADKADVLGVDPSKTGWLYIKAGQYS 160 (826) T ss_pred EEcCCcEEEEECCCCEEEEecCcccceeecCCcceeEEEEEcCEEEEEcCcEeeeeccccccCCCCCceEEEEecccccC Confidence 99999999999999999999888889999999999999999999999999999999888777888999999999999999 Q ss_pred eeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeeeeeccceecccccc Q lcl|NC_012418. 161 KAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDTAA 240 (826) Q Consensus 161 r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t~~~~~~~~~~~~ 240 (826) |+|+|+|++.+.+..++...++.|++|++..+.........+.+..|+++++...+...+.|.....+..+.+..+.... T Consensus 161 ~~y~v~i~~~~~~~~~~~s~t~~y~t~~~~~~~~~~~~~~~~~~~~~~a~~l~~~~~~~~~~~~~~~t~~~~~~~~~~~~ 240 (826) T protein:vir:78 161 KAFSLTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLFGKFFGAPEYTLPNSTKKYPKVDPDPAA 240 (826) T ss_pred ceeEEEeccceeecccccceeEEEEeccCCccccccccccceecchhhheecceeeccccceeeeccceeEeeccccccc Confidence 99999999999999999999999999999999888888999999999999999888888889888888888888888888 Q ss_pred cccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEeeeEecCCCcc Q lcl|NC_012418. 241 ATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAVMATGSTK 320 (826) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 320 (826) ....+++......++++++.+.++..+..++++|+++...++.+.|+++++|++.+|+.+.+|+.+++++++++++++.. T Consensus 241 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~a~~p~~~~~~~~~~~~~~~~~~~g~~~ 320 (826) T protein:vir:78 241 ATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAIMATGSTK 320 (826) T ss_pred eeeccceeecccccceEEEecCCCeEEEeccCCCccceEEEeeEEEecccceeeeecccccceEEEEEEeeeEecCCCcc Confidence 88889999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEecCCceEEEeecccccccccceeEEEEEecCCCeeEEeecCCcccccCCcccccCccccCCCceEEEEEcceE Q lcl|NC_012418. 321 APVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFVTRGITGMTTFQGRL 400 (826) Q Consensus 321 ~~~y~~~~~~~~~w~E~~~~g~~~~~~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~~~q~RL 400 (826) ++||++|+..+++|+||++||+.+++.||||.++.+.++++|++++.+|++|.+||+++||+|+|+|++|++|+|||||| T Consensus 321 ~~~y~~~~~~~~~w~e~a~~g~~~~~~tmp~~l~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~i~~v~f~q~RL 400 (826) T protein:vir:78 321 APVYFAWDAANRRWAERAAYGTDWVLKKMPLALRWDESTDTYSLNELEYDRRGSGDEETNPTFNFVKRGITGMTTFQGRL 400 (826) T ss_pred cceeEEEEcCCceEEEeeccCcccccccccEEEEEecCCCeEEEeeccccccccCcccccCcccccCCCceEEEEEeceE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEE Q lcl|NC_012418. 401 VLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVIS 480 (826) Q Consensus 401 ~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~~~~~lTP~~~~~~ 480 (826) +|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+++++|||+|++++ T Consensus 401 ~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~~s~~~~~i~~~v~~~~~L~l~T~~~e~~l~~~~~lTP~~~~~~ 480 (826) T protein:vir:78 401 VLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVIS 480 (826) T ss_pred EEeeCCeEEEEeccCccccccccccCCCCCCcEEEEEccCcceeEEEEEecCCcEEEEecCcEEEEeCCCcccceeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEEEEEcCCCCEEEEEEc Q lcl|NC_012418. 481 ITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAASSGYLVFGTS 560 (826) Q Consensus 481 ~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~~~s~~p~~~v~~~~ 560 (826) ++|+|+|+++++|+.+|++++|+++||+.+++||||.|++|++++|+++|||+|++|||+++|.+|++|++|++++|+++ T Consensus 481 ~~s~~~~~~~~~Pv~vG~~v~F~~~r~~~~s~v~e~~~~~~~~~~y~~~dlt~~~~~l~~~~v~~~a~s~~~~~~v~~~~ 560 (826) T protein:vir:78 481 ITTQYDVDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAASSGYLVFGTS 560 (826) T ss_pred EEEeecccCCCCceEeCCeEEEEecCCCceeEEEEEEeeecccCccchHHHHHHHHHhcCCCeEEEEEeCCCCeEEEEEc Confidence 99999999999999999999999999988999999999999999999999999999999999999999999999999999 Q ss_pred CCCeEEEEEEeeCCCceeeEeeEeeecCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEeecCCcCCCCcccccceEEEEe Q lcl|NC_012418. 561 TADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGTYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEA 640 (826) Q Consensus 561 ~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~~d~l~~vv~R~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~ 640 (826) +||+|++|||||+++||+|+|||||+|+|+|++||+++|+||++|+|++++++|||.++++++++..+.+.+|+++.+++ T Consensus 561 ~~g~l~~~ty~~~~~e~~v~aW~~~~~~g~v~~v~~i~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~ 640 (826) T protein:vir:78 561 AADEMICHQYLWQGNEKVQNAYHRWTLRHQIIGAYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEA 640 (826) T ss_pred CCCeEEEEEEEecCCcEEEEeEEEEccCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEEecCCCccccccccceeEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999998887 Q ss_pred eccceeeecccccCccccccce-EEeeeeeeEeeccEEcccEecCCCeEEEeecCCcCCceEEEEEeeeeEEEeCCeEEE Q lcl|NC_012418. 641 TVEGELELTKQHWDLIKDAPAV-YQLQPVAGAFMERYQLGVKRETNTKVFLDVPEAVVGSVYVVGCEFWSKVEFTPPVLR 719 (826) Q Consensus 641 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~v~vG~~y~~~~~~~~~~~~ 719 (826) ..++....+....+...+.... ++++. .+...++..++...+..+.++|++|++.++++|+|||+|+++++|+||+++ T Consensus 641 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~VGl~y~s~~~~~~~~~~ 719 (826) T protein:vir:78 641 TVDGELELTKQHWDLIKDGAAVYQLQPQ-VGAYMERYQLGVKRETSTKVFLDVPEAVVGSVYVVGCEFWSKVEFTPPVLR 719 (826) T ss_pred EEcceeccccceeEEecCCceeeeeccc-eeeeccccceeccccCCCceEEEeCCCccccEEEEeeceeEEEEeCceEEe Confidence 7777666655555444333333 44443 444555555666778889999999999999999999999999999999999 Q ss_pred CCCCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccccccccccccCccccccceEEEEeecccceeEEE Q lcl|NC_012418. 720 DHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLVDSAVVPLPARVAMATSKFE 799 (826) Q Consensus 720 ~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~~p~~~~~~~~~v~ 799 (826) +++|+.++.+|+||+|++|+|.+||.|.++|+++.++....+.+.+.++++.++.++.|+..++++++|+.+++++.+|+ T Consensus 720 ~~~g~~~~~~r~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~~t~~v~vp~~~~~~~~~i~ 799 (826) T protein:vir:78 720 DHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLSSRQLNAGEPLVDSAVVPLPARVDMATSKFE 799 (826) T ss_pred cCCCcceeecceEEEEEEEEeeccccEEEEeCCCccCcceeeeecccccccccccCCcccccceEEEEeeeccCceEEEE Confidence 99999999999999999999999999999999999988788888999999999999999889999999999999999999 Q ss_pred EEECCCCCEEEEEEEEEEEEecccccC Q lcl|NC_012418. 800 LSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 800 i~~~~p~P~tvl~i~~eg~y~~r~rrv 826 (826) |+++.|+||+|++|+|||+||+|+||| T Consensus 800 i~~d~P~P~tvlai~~~~~y~~r~rrv 826 (826) T protein:vir:78 800 LSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) T ss_pred EEeCCCCcEEEEEEeEEEEecceeecC Confidence 999999999999999999999999999 No 2 >protein:vir:6326 Length: 826 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877473;genbank:gi:33300845;uniprot:Q7Y2D1;genbank:GeneID:1482615 Probab=100.00 E-value=1.7e-255 Score=1417.31 Aligned_cols=822 Identities=96% Similarity=1.459 Sum_probs=781.5 Q ss_pred CcceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeCCCceEEEE Q lcl|NC_012418. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd~~e~~~i~ 80 (826) ||+|+|+||||++|||||||++|||+||++|+||+|+|+|||+||||++||+++++++...++||+|+++||+.|++|++ T Consensus 1 M~~i~~s~~n~~~GvSqq~d~~r~~~q~~~~~N~~~~~~~G~~rRpg~~~v~~~~~~~~~~~~~~~~~~~r~~~~~~~~~ 80 (826) T protein:vir:63 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) T ss_pred CceeeeecchhhcceeccCchHhhhhhhhhhhcceeeccCCcccCchhHhhhhhccCCccccccEEEEEecCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999999888889999999999999999999 Q ss_pred EecCCeEEEEECCCCEEEEecCcccccccCCCcccEEEEEecCEEEEeeCCcceeeeecccCCCCCCccEEEEEcccccC Q lcl|NC_012418. 81 AQHRGELYLFDERDGRLLMGQPLVHDYLKAADYRQLRAATVADDLFIANLSVKPEADRTDVKGVDPNKAGWLYIKAGQYS 160 (826) Q Consensus 81 ~~~~g~irv~d~~~g~~~~~~~~~~~yl~~~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~~~vr~g~Y~ 160 (826) ++++|+|||||+++|+++++++..++|+++.+.++|+++||||+|||||++++|++..+.....+++.++++++|+|+|+ T Consensus 81 ~~~~g~irv~~~~~g~~~~~~~~~~~y~~~~~~~~l~~~t~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~v~~g~Y~ 160 (826) T protein:vir:63 81 AQHRGELYLFDERDGRLLMGQPLVHDYLKANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAGWLYIKAGQYS 160 (826) T ss_pred EecCCcEEEEEcCCCeEEEcCCCCCceeeecCccceEEEEeCCEEEEEeCCeeeeeccccccccCCCCcEEEEeeccccC Confidence 99999999999999999999888889999888889999999999999999999999887778888899999999999999 Q ss_pred eeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeeeeeccceecccccc Q lcl|NC_012418. 161 KAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDTAA 240 (826) Q Consensus 161 r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t~~~~~~~~~~~~ 240 (826) ++|+|+|++.+.+.++..+.++.|+++++.++..++.....+++..+++.++...+.+...|+....+.....++++..+ T Consensus 161 ~~y~vti~~~~~~~gt~~s~t~t~~t~~~~~a~~~~~~~~~~~s~~yia~~l~~~~~a~~~~~~~~~t~~~~~~~~~~~a 240 (826) T protein:vir:63 161 KAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDANA 240 (826) T ss_pred ceEEEEEEeccccCCccccceEEEEeccCCcccccccccceeeeeeeeeeeceeeeeeccccccCCCccccceecCCccc Confidence 99999999999988899899999999999999988888888899999999999988888889888888888888888888 Q ss_pred cccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEeeeEecCCCcc Q lcl|NC_012418. 241 ATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAVMATGSTK 320 (826) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 320 (826) ...+++.......++++++...++..+..++++|+++.+.++.+.++++++||+.+|..+..++.+++.+++++.+|+.. T Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~~p~~~~~~~~~~~~~~~~~~~g~~~ 320 (826) T protein:vir:63 241 ATIAGYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVMATGSTK 320 (826) T ss_pred ceeecceeEecccccEEEEeeCCcccEEEccCCCCcceEEEEEeeccceeeccccCCCcccceEEEeeEEeEEecCCCcc Confidence 88888888888899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEecCCceEEEeecccccccccceeEEEEEecCCCeeEEeecCCcccccCCcccccCccccCCCceEEEEEcceE Q lcl|NC_012418. 321 APVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFVTRGITGMTTFQGRL 400 (826) Q Consensus 321 ~~~y~~~~~~~~~w~E~~~~g~~~~~~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~~~q~RL 400 (826) ++||++|+..+++|+||++||+.+++.||||.|+.+.++++|++++.+|++|.+||+++||+|+|+|++|++|+|||||| T Consensus 321 d~~y~~~~~~~~~w~e~~~~~~~~~~~tmp~~l~~~~~~~~f~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL 400 (826) T protein:vir:63 321 APVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTRGITGMTTFQGRL 400 (826) T ss_pred cceEEEEEcCCceEEEEeecCcccccccceEEEEEeccCCeEEEeccccccccccccccCCCccccCCCceEEEEEeceE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEE Q lcl|NC_012418. 401 VLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVIS 480 (826) Q Consensus 401 ~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~~~~~lTP~~~~~~ 480 (826) +|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+++++|||+|++++ T Consensus 401 ~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~~~q~~ls~~~~lTP~~~~i~ 480 (826) T protein:vir:63 401 VLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVIS 480 (826) T ss_pred EEeeCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEEEEEcCCCCEEEEEEc Q lcl|NC_012418. 481 ITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAASSGYLVFGTS 560 (826) Q Consensus 481 ~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~~~s~~p~~~v~~~~ 560 (826) ++|+|+|+++++|+.+|++++|+|++|+++++||||.|++|+++.|+++|||+|++|||+++|.+|++|++|++++|+++ T Consensus 481 ~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~~~s~v~e~~~~~d~~~~y~~~dlt~~~~~l~~~~v~~~a~s~~~~~v~~~~~ 560 (826) T protein:vir:63 481 ITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAASSGYLVFGTS 560 (826) T ss_pred EEEeecccCCCCceEeCCeEEEEecCCCceeEEEEEEeeeccccceehhHHHHHHHHhcCCCeEEEEEcCCCCEEEEEEc Confidence 99999999999999999999999999988999999999999999999999999999999999999999999999999999 Q ss_pred CCCeEEEEEEeeCCCceeeEeeEeeecCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEeecCCcCCCCcccccceEEEEe Q lcl|NC_012418. 561 TADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGTYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEA 640 (826) Q Consensus 561 ~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~~d~l~~vv~R~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~ 640 (826) +||+|++|+|||+++||+|+|||||+|+|+|++||+++|+||++|+|++++++|||.++++.+.+...++.+|++..++| T Consensus 561 ~dg~l~~~~y~~~~~e~~v~aW~~~~~~g~v~~~~~i~d~l~~iv~r~~~~~~~r~~~e~~~~~~~~~~~~~d~~~~~d~ 640 (826) T protein:vir:63 561 TADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEA 640 (826) T ss_pred CCCEEEEEEEeeCCCcEEEEeEEEEecCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEEecCCccccccCCccceEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999988889999999999999 Q ss_pred eccceeeecccccCccccccceEEeeeeeeEeeccEEcc----cEecCCCeEEEeecCCcCCceEEEEEeeeeEEEeCCe Q lcl|NC_012418. 641 TVEGELELTKQHWDLIKDAPAVYQLQPVAGAFMERYQLG----VKRETNTKVFLDVPEAVVGSVYVVGCEFWSKVEFTPP 716 (826) Q Consensus 641 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~g~~~l~~~~~~~~~~v~vG~~y~~~~~~~~~ 716 (826) +..+........++. ..-|+++..+..++|+.+.+ ...+.+|.++|++|++..+.+|+|||+|+++++|+|| T Consensus 641 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~l~~~~~~~~~~v~VGl~y~s~~~~~~~ 716 (826) T protein:vir:63 641 TVAGELELTKQHWDL----IKDASAVYQLQPVAGAYMERTHLGVKRETNTKVFLDVPEAVVGAVYVVGCEFWSKVEFTPP 716 (826) T ss_pred eeeeeeccCcceeec----ccCcccccEEEEeeCccccCCccceEEecCCEEEEecCCCccccEEEEeeeeeEEEEecce Confidence 988777655544432 22367777777888887654 3577789999999999999999999999999999999 Q ss_pred EEECCCCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccccccccccccCccccccceEEEEeeccccee Q lcl|NC_012418. 717 VLRDHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLVDSAVVPLPARVAMATS 796 (826) Q Consensus 717 ~~~~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~~p~~~~~~~~ 796 (826) ++++++|+.++.+|+||+|++|+|.+||+|.++|+++.++....+.++++++++.++.+|.|++.++++++|+.+++++. T Consensus 717 ~~~~~~g~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~p~~~t~~~~vP~~~~~~~~ 796 (826) T protein:vir:63 717 VLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLVDSAVVPLPARVDMATS 796 (826) T ss_pred EEEccCCCcceeccEEEEEEEEEeeccccEEEEecCccccceeEeecCCceecccccccccccccceEEEEEEeeccceE Confidence 99999999999999999999999999999999999999988888999999999999999999999999999999999999 Q ss_pred EEEEEECCCCCEEEEEEEEEEEEecccccC Q lcl|NC_012418. 797 KFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 797 ~v~i~~~~p~P~tvl~i~~eg~y~~r~rrv 826 (826) +|+|+|+.|+||+|++|+||++||+|+||| T Consensus 797 ~i~i~~d~P~p~~il~i~~~~~yn~r~rrv 826 (826) T protein:vir:63 797 KFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) T ss_pred EEEEEeCCCCcEEEEEEEEEEEEeceeecC Confidence 999999999999999999999999999999 No 3 >protein:vir:10452 Length: 794 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848299;genbank:gi:30387490;genbank:GeneID:1733952 Probab=100.00 E-value=5.2e-227 Score=1261.14 Aligned_cols=769 Identities=22% Similarity=0.311 Sum_probs=656.5 Q ss_pred CcceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeCCCceEEEE Q lcl|NC_012418. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd~~e~~~i~ 80 (826) ||+|+|+||||++|||||||++|||+||++|+||+|+|+|||+||||++||+++++++.....++.|+++||+.|+||++ T Consensus 1 M~~i~~s~~n~~~GvSqq~D~~Ry~~q~~~~~N~~~~~~gG~~rRpGt~fv~~l~~~~~~~~~~~~~~~~rd~~e~~~v~ 80 (794) T protein:vir:10 1 MALISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLKTLGDNGALGQAPYIHLINRDENEQYYAV 80 (794) T ss_pred CcceeeecchhhcccccCCchHHhhhhHhhhhcceeeeccCcccCcchhhheeccCCCccccceeeeEEecCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999998887777889999999999999999 Q ss_pred EecCCeEEEEECCCCEEEEecCcccccccCCC-cccEEEEEecCEEEEeeCCcceeeeecc--cCCCCCCccEEEEEccc Q lcl|NC_012418. 81 AQHRGELYLFDERDGRLLMGQPLVHDYLKAAD-YRQLRAATVADDLFIANLSVKPEADRTD--VKGVDPNKAGWLYIKAG 157 (826) Q Consensus 81 ~~~~g~irv~d~~~g~~~~~~~~~~~yl~~~~-~~~l~~~~vaD~~fi~n~~~~~~~~~~~--~~~~~~~~~a~~~vr~g 157 (826) +++. +||||+++|+.+.+..+...+|+.+++ ..+|+++|+||+|||||++++|++..+. ...+++..++++++|+| T Consensus 81 ~~~~-~irv~~~~G~~~~v~~~~~~~Y~~aa~~~~~l~~~q~aD~~fivn~~~~~~~~~~~~~~~~~~~~~~~~~~v~~g 159 (794) T protein:vir:10 81 FTGT-GIRVFDLAGNEKQVRYPNGSNYIKTANPRSDLRMVTVADYTFIVNRNVVVQKDPNSVNLANYNPKQDGLINIRGG 159 (794) T ss_pred EeCC-eEEEEEcCCcEEEEEcCCCCcceecCCCcceEEEEEEcCEEEEEcCCeeeeeeccccccCCCCCCccEEEEeccc Confidence 9876 599999988887777667788886654 4589999999999999999999986553 34567778999999999 Q ss_pred ccCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeeeeeccceeccc Q lcl|NC_012418. 158 QYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPD 237 (826) Q Consensus 158 ~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t~~~~~~~~~ 237 (826) +|+|+|+++|++.+ .+.+++|++.. +.....++.++++.++...+... T Consensus 160 ~y~r~y~i~i~~~~---------~at~~tpdgt~-----~~~~~~~s~~~ia~~L~~~l~a~------------------ 207 (794) T protein:vir:10 160 QYGRELIVHINGKD---------VATYKIPDGSK-----PEHVNNTDAQWLAERLAKQMRIN------------------ 207 (794) T ss_pred ccceEEEeccCCcc---------eeEEEecCCCC-----cccceecchhhhhhhhhhhhhcc------------------ Confidence 99999999998764 35677777643 34556677888888887654321 Q ss_pred ccccccccceEecccCCcEEEEEcCCCeEE--EEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEeeeEec Q lcl|NC_012418. 238 TAAATVAGYLNQRGVQDGYIAFRGDGDIVV--EVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAVMA 315 (826) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~~ 315 (826) .++++.... ++++++.+..+... ..+.+++.++.+.++.+.++++++||+.+| +|+.+++ .+. T Consensus 208 -----~~g~t~~~~--g~~i~i~a~s~~~~~t~s~~~~~~~~~~~~v~~~~~~~~~lp~~~~----~G~~v~i----~~~ 272 (794) T protein:vir:10 208 -----LSGWTVNVG--QGFIHVTAPSGQQIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAP----NGYMVKI----VGD 272 (794) T ss_pred -----cCCceEEeC--CeEEEEEeccCceeccccccCCcCcceeEEEEeccCcceecccCCC----CCcEEEE----EeC Confidence 112333322 45666666655443 234556667889999999999999888766 4666654 455 Q ss_pred CCCccceEEEEEecCCceEEEeeccccccc--ccceeEEEEEecCCCeeEEeecCCcccccCCcccccCccccCCCceEE Q lcl|NC_012418. 316 TGSTKAPVYFEWDSANRRWAERAAYGTDWV--LKKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFVTRGITGM 393 (826) Q Consensus 316 ~~~~~~~~y~~~~~~~~~w~E~~~~g~~~~--~~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v 393 (826) .++..++||++|+..+++|+||++||+..+ ..||||.++ ++++++|++++.+|++|.+||+++||+|+|+|++|++| T Consensus 273 ~~~~~~~yyv~~~~~~~~w~E~~~~g~~~~~~~~tmP~~l~-r~~~~t~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v 351 (794) T protein:vir:10 273 ASKSADQYYVRYDAERKVWTETLGWNTENQVLLETMPHALV-RAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDV 351 (794) T ss_pred CCCCcceeEEEEEcCCcEEEEecccceeEEEecccceeEEE-EeccceEEeeecccccccccccccCccCcccCCCccEE Confidence 677789999999999999999999997655 469999998 77899999999999999999999999999999999999 Q ss_pred EEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCcccc Q lcl|NC_012418. 394 TTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVT 473 (826) Q Consensus 394 ~~~q~RL~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~~~~~lT 473 (826) +||||||+|++|++|||||+||||||+++|++++.|||||+++++++++++|+|+++++++|+|||+++||+|+++++|| T Consensus 352 ~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~i~T~~~q~~l~~~~~lT 431 (794) T protein:vir:10 352 FFFRNRLGFLSGENIILSRTAKYFNFYPASIANLSNDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLT 431 (794) T ss_pred EEEcceEEEeeCCeEEEEecCCcccccccccccCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEE-eeccccccccchhHHHHHHHHhcCCCeEEEEEc-CC Q lcl|NC_012418. 474 PRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEM-APSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAA-AS 551 (826) Q Consensus 474 P~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~-~~~~~~~~~~~~~dls~~~~~~~~~~v~~~~~s-~~ 551 (826) |+|++++++|+|+|++.++|+.+|++++|++++| ++++++|| .|+.+.| .|+++|||+|++|||++++..++++ ++ T Consensus 432 P~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~~~r~~~~~~~~d-~y~a~Dlt~~~~hl~~~~v~~~~~~~~~ 509 (794) T protein:vir:10 432 SRSVELNLTTQFDVQDRARPYGIGRNVYFASPRS-SYTSIHRYYAVQDVSS-VKNSEDITSHVPNYIPNGVFSICGSGTE 509 (794) T ss_pred ceeEEEEEEEeecccCCCCceEeCCeEEEEecCC-CeeEEEEEeeeccccC-ceehhhHHHHHHHhcCCceEEEEEeCCC Confidence 9999999999999999999999999999999988 57777665 5665555 5999999999999999998887665 45 Q ss_pred CCEEEEEEcCCCeEEEEEEeeCCCceeeEeeEeeecCCcEEEEEEE--CCeEEEEEEeCCCEEEEEEEEeecCCcCCCCc Q lcl|NC_012418. 552 SGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGTYFT--GDNLMVLIQKGQEIALGRMHLNSLPAREGLQY 629 (826) Q Consensus 552 p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~--~d~l~~vv~R~~~~~~~r~~~~~~~~~~~~~~ 629 (826) |..++|+++++|+|++|+|||+++||+|+|||||+|+|.|+++|++ +|+||++|+|++++++|||.+.+.. .+. T Consensus 510 ~~~~~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~d~l~~iv~r~~~~~~~r~~~~~~~----~~~ 585 (794) T protein:vir:10 510 NFCSVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSDMYVILRNEFNTFLARISFTKNA----IDL 585 (794) T ss_pred CcEEEEEEcCCCEEEEEEEeecCCceEEEeEEEEEcCCcEEEEEEEecCCeEEEEEEeCCCEEEEEEEEeecC----CCC Confidence 5677899999999999999999999999999999999999988865 8999999999999999999775543 344 Q ss_pred ccccceEEEEeeccceeeeccccc-------CccccccceEEeeeeeeEeeccEEccc---EecCCCeEEEeecCCcCCc Q lcl|NC_012418. 630 PKYDYWRRIEATVEGELELTKQHW-------DLIKDAPAVYQLQPVAGAFMERYQLGV---KRETNTKVFLDVPEAVVGS 699 (826) Q Consensus 630 ~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~g~~~l~~~~~~~~~ 699 (826) +.+++..++||.......-...+. .........|+++.++...+|+..... ..+.++..+|.+|++..++ T Consensus 586 ~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~g~~~~eg~~v~~~adg~~~~~~~~~~~~~g~~~l~i~~~~~a~ 665 (794) T protein:vir:10 586 QGEPYRAFMDMKIRYTIPSGTYNDDTFTTSIHIPTIYGANFGRGKITVLEPDGKITVFEQPTSGWQSDPWLRLSGNLEGR 665 (794) T ss_pred CCccceeeeecceEEEecCcccccccccceEEcccccCcccccccEEEEecCCceeeeeeeeeeeecceEEEecCCCCCc Confidence 455556666765533322111110 011112345899999999999965543 3344677899999999999 Q ss_pred eEEEEEeeeeEEEeCCeEEECCCCcee----eecceEEEEEEEEeeccceEEEEecCCCCCceeeeeecccccccccccc Q lcl|NC_012418. 700 VYVVGCEFWSKVEFTPPVLRDHNGLPM----TSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNA 775 (826) Q Consensus 700 ~v~vG~~y~~~~~~~~~~~~~~~g~~~----~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~ 775 (826) +|+|||+|+++++|+||++++++|+.+ ..+|+||+|++++|.+||+|.++|+++.++. .+.+.+.++++..+.+ T Consensus 666 ~v~vGl~y~s~~~~~~~~i~~~~~~~~~~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~~--~~~~~~~~~~~~~~~~ 743 (794) T protein:vir:10 666 EVFIGFNINFVYEFSKFLIKQTTDDGSTSTEDIGRLQLRRAWVNYEDSGTFDIYVENQSSNW--KYTMAGARLGSNTLRA 743 (794) T ss_pred eEEEeeeeeEEEEecceEEEccCCCcceeeeccccEEEEEEEEEeeccccEEEEEcCCcccc--ceeeccceeccccccc Confidence 999999999999999999999988644 4589999999999999999999999987653 4567899999999999 Q ss_pred CccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecccccC Q lcl|NC_012418. 776 GEPLVDSAVVPLPARVAMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 776 ~~~~~~tg~~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r~rrv 826 (826) +.+++.+|+++||+.+|+++.+|+|+|++|+||+|+||+|||+||+|+||| T Consensus 744 g~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlsi~~eg~y~~r~~~v 794 (794) T protein:vir:10 744 GRLNLGTGQYRFPVVGNAKFNTVYILSDETTPLNIIGCGWEGNYLRRSSGI 794 (794) T ss_pred cccccccceEEEEecccCceEEEEEEECCCCceEEEEEEEEEEEeccccCC Confidence 999999999999999999999999999999999999999999999999999 No 4 >protein:vir:80253 Length: 777 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522886;genbank:gi:158345179;genbank:GeneID:5687516 Probab=100.00 E-value=3.2e-225 Score=1251.35 Aligned_cols=774 Identities=38% Similarity=0.662 Sum_probs=649.2 Q ss_pred CcceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeCCCceEEEE Q lcl|NC_012418. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd~~e~~~i~ 80 (826) ||+|+|+||||++|||||+|++|||+||++|+||+|+|+|||+||||++||+++++.......++ |..+++++|++|++ T Consensus 1 M~~i~~~~~nf~~GvS~q~D~~ry~~q~~~~~N~~~~~~gG~~rRpGt~fv~~l~~~~~~~~~~~-~~~~~~~~e~~~~l 79 (777) T protein:vir:80 1 MSYFAGSYRQLLFGVSQQTAKDRLEGQVESQLNMQSDLVTGPRRRSPVHLIADAMAATDANRLAY-SLATFSGREVLLLV 79 (777) T ss_pred CceeeeecchhhcccccCCchHHhhhHHhhhhcceeeeccCceeCcchHhhhhhcCCCcccceeE-EEEecCCCeeEEEE Confidence 99999999999999999999999999999999999999999999999999999997665433333 55678999999999 Q ss_pred EecCCeEEEEECCCCEEEEecCcccccccCCCcccEEEEEecCEEEEeeCCcceeeeeccc--CCCCCCccEEEEEcccc Q lcl|NC_012418. 81 AQHRGELYLFDERDGRLLMGQPLVHDYLKAADYRQLRAATVADDLFIANLSVKPEADRTDV--KGVDPNKAGWLYIKAGQ 158 (826) Q Consensus 81 ~~~~g~irv~d~~~g~~~~~~~~~~~yl~~~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~--~~~~~~~~a~~~vr~g~ 158 (826) ++++|+|||||+.+|.++..+. .+|+++.++++|+|+|+||+|||||++++|++..+.. ..++++.++++++++|+ T Consensus 80 ~~g~g~irv~~~~~g~~~~~~~--~~Yl~a~~~~~l~~~q~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~~~v~~~~ 157 (777) T protein:vir:80 80 DTLDGTLTILDDATGEVLFTGT--NSYLTAGTGRSIRFAALDDSVFVANTEVIPQTQLWSGASAYPDPTRAGYLYVVAGA 157 (777) T ss_pred EecCCeEEEEECCCCeEEEecC--CCceeeccccceeEEEEcCEEEEEeCCccceeeecccCCCccCcccceEEEeeccC Confidence 9999999999999998887764 5798888888999999999999999999999865433 34677788999999999 Q ss_pred cCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeeeeeccceecccc Q lcl|NC_012418. 159 YSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDT 238 (826) Q Consensus 159 Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t~~~~~~~~~~ 238 (826) |+++|+|.|++...+...+... .++. ....+.+..+++.++...+.....+ T Consensus 158 ~g~~y~i~i~~~~~~~~~t~~~----~t~~---------~~~~~~~~~~ia~~L~~~~~~~~~~---------------- 208 (777) T protein:vir:80 158 FSKQYRLSITNQVTGVTTSVDV----TTSA---------TEASQATGEYVITQLRTAAEADATI---------------- 208 (777) T ss_pred CCceeeEeecCCcCceeEEEec----CCcc---------cccccccchhhhhhhhhhhccccce---------------- Confidence 9999999998776543332211 1111 1223345667777776544322111 Q ss_pred cccccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEeeeEecCCC Q lcl|NC_012418. 239 AAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAVMATGS 318 (826) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~~~~~ 318 (826) .+.++++... .++++++....+.. .+..+|+++++....+.|++..+||+++|... +.++..++. T Consensus 209 --~s~~~~~~~~--~g~~~~i~~~~~~~--~t~~~g~~~~~~~~~~~v~~~~~lp~~~~~~~---------~~~~~~~~~ 273 (777) T protein:vir:80 209 --GTAAGFAYYQ--DGAYLYVTAPEAIA--VSTDSGSNFLRASNAASIRDAAELPAKLPADA---------DGFIIATGA 273 (777) T ss_pred --eecCceEEEe--CCcEEEEEecCcee--EecCCcCccceeeeeEEEeecccccccccccc---------ceEEEeCCC Confidence 1122333333 34566666666543 35667788889999999999999999987532 234556777 Q ss_pred ccceEEEEEecCCceEEEeecccccccccceeEEEEEecCCCeeEEeecCCcccccCCcccccCccccCCCceEEEEEcc Q lcl|NC_012418. 319 TKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFVTRGITGMTTFQG 398 (826) Q Consensus 319 ~~~~~y~~~~~~~~~w~E~~~~g~~~~~~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~~~q~ 398 (826) ..++||++|+..+++|+||++||+.+++.+|||.++.. +++|.+++.+|++|.+||+++||+|||+|++|++|+|||| T Consensus 274 ~~~~~y~~~~~~~~~w~e~~~~~~~~~~~t~p~~l~~~--~~~~~~~~~~w~~r~~gd~~tn~~Psf~g~~i~~v~f~q~ 351 (777) T protein:vir:80 274 AKNKTYFRWVDLERKWDEDASRGAQAELIDMPLRITYS--APNFSLTALNYERRASGDATSNPALKFTEQGISGMTTMQG 351 (777) T ss_pred CCCceEEEEEccCcEEEEeecccccccccccceEEEec--CCceEeeccCCccccccccccCCCceecCCceeEEEEEcc Confidence 88999999999999999999999999999999999743 3689999999999999999999999999999999999999 Q ss_pred eEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccccccceE Q lcl|NC_012418. 399 RLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAV 478 (826) Q Consensus 399 RL~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~~~~~lTP~~~~ 478 (826) ||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+++++|||+|++ T Consensus 352 RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdDpI~~~~ss~~~~~i~~~v~~~~~L~i~T~~~e~~l~~~~~lTP~~~~ 431 (777) T protein:vir:80 352 RLVLLAGEYVCMSASGNPLRWFRASVSTQSDDDPIEVAATAPVASPYEYAVAFNKDLVLFAKTHQGLVPGANLLTSRNAT 431 (777) T ss_pred eeeeecCCeEEEEeccCccccccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCceEEEeCCCcccceeEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEEEEEcCCCCEEEEE Q lcl|NC_012418. 479 ISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAASSGYLVFG 558 (826) Q Consensus 479 ~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~~~s~~p~~~v~~ 558 (826) ++++|+|+|+++++|+.+|++++|+++|++++++||||+|+++.++.|+++|||+|++|||+++|.+|++|++|++++|+ T Consensus 432 ~~~~s~~~~~~~~~Pv~vG~~v~Fv~~r~g~~s~v~e~~~~~~~~d~y~a~Dlt~~~~hl~~~~v~~~a~s~~p~~v~~~ 511 (777) T protein:vir:80 432 AAVVTEYSFQNSCSPVVAGRTVFFASPRSGPWSAVWEMLPSQYTDAQVEASDSTSHLPKYIAGPVRFLATSSTTSIVVVG 511 (777) T ss_pred EEEEEeeccCCCCCceEeCCeEEEEecCCCceeEEeeeeecccccCceehhHHHHHHHHhcCCceEEEEEcCCCceEEEE Confidence 99999999999999999999999999888788999999999877667999999999999999999999999999999999 Q ss_pred EcCCCeEEEEEEeeCCCceeeEeeEeeecCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEeecCCcCCCCcccccceEEE Q lcl|NC_012418. 559 TSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGTYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRI 638 (826) Q Consensus 559 ~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~~d~l~~vv~R~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~ 638 (826) +++||+|++|||||+++||+|+|||||+|+|+|++||+++|+||++|+|+...++|||.++...+....+...+|+...+ T Consensus 512 ~~~dg~l~~~ty~~~~~e~~v~aW~r~~~~g~v~~v~~i~d~l~~iv~r~~~~~le~~~~~~~~d~~~~~~~~~D~~~~~ 591 (777) T protein:vir:80 512 TSNLRELVVHEYLWQGGEKVHAAWHKWSFPQDITGAYFRGDRLILLFHVAGRVILGELFMQRLGDAQSIPGGFLDLYRVG 591 (777) T ss_pred EcCCCeEEEEEEeecCCceEEEeeEEeccCCcEEEEEEECCEEEEEEEcCCeEEEEEEeeccCCCCcccceeeeeeeeee Confidence 99999999999999999999999999999999999999999999999999999999998877776665556666665544 Q ss_pred EeeccceeeecccccC-ccccccceEEeeeeeeEeeccEEcccEecCCCeEEEeecCCcCCceEEEEEeeeeEEEeCCeE Q lcl|NC_012418. 639 EATVEGELELTKQHWD-LIKDAPAVYQLQPVAGAFMERYQLGVKRETNTKVFLDVPEAVVGSVYVVGCEFWSKVEFTPPV 717 (826) Q Consensus 639 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~v~vG~~y~~~~~~~~~~ 717 (826) ....++..+.+....+ ..+.....+..+........ ..+..........+.++++..+++|+|||+|+++++|+||+ T Consensus 592 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~--~~~~v~~~~~~~~~~v~~~~~~~~v~VGl~y~s~~~~~~~~ 669 (777) T protein:vir:80 592 AANADEEVAIPAFAADLYPEDSTFAYKLSGEFQSLGQ--RCGDRRVDGATVYIKVVGAQAGDQYRIGLRYLSKLGPTRPI 669 (777) T ss_pred eeeeCCccceeEeeccccCCcceeEEEecCcccccce--eeeeEEeCCceeeEEEcCCCCCCEEEEeeeeEEEEEeCceE Confidence 4444444433332222 22222222222221111111 11112222222345566778899999999999999999999 Q ss_pred EECCCCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccccccccccccCccccccceEEEEeecccceeE Q lcl|NC_012418. 718 LRDHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLVDSAVVPLPARVAMATSK 797 (826) Q Consensus 718 ~~~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~~p~~~~~~~~~ 797 (826) +++++|++++.+|+||+|++|+|++|++|.++|+++.++. ..+.+++.++++.++.++.|++.||++++|+.+++++.+ T Consensus 670 ~~~~~g~~~~~~r~~i~r~~~~~~~sg~~~v~v~~~~~~~-~~~~~~~~~~~~~~~~~g~~~~~tg~~~vp~~~~~~~~~ 748 (777) T protein:vir:80 670 LRDPNGVPITTERTQLHRLTWSLDSTGEVTFRVADQARGE-SAYTTTPLRLYSRDLGAGLPLAATATLDTPARVDMQTAQ 748 (777) T ss_pred EeCCCCceeeecCeEEEEEEEEeeccccEEEEEcCCCCcc-eeeeecCceecccccccccccccceEEEEEEeecCcceE Confidence 9999999999999999999999999999999999988764 567789999999999999999999999999999999999 Q ss_pred EEEEECCCCCEEEEEEEEEEEEecccccC Q lcl|NC_012418. 798 FELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 798 v~i~~~~p~P~tvl~i~~eg~y~~r~rrv 826 (826) |+|+|++|+||+|+||+|||+||+|+||= T Consensus 749 v~i~~d~P~P~tilsi~~e~~y~~r~~r~ 777 (777) T protein:vir:80 749 FSLETDDYYDMNITSLEYGFRYNQRYRRQ 777 (777) T ss_pred EEEEECCCCceEEEEEEEEEEeecccccC Confidence 99999999999999999999999995444 No 5 >protein:vir:3366 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523337;swissprot:trembl:q8w5u3;genbank:gi:17570828;goa:Q8W5U3;uniprot:Q8W5U3;genbank:GeneID:927453 Probab=100.00 E-value=5.2e-223 Score=1239.20 Aligned_cols=775 Identities=21% Similarity=0.307 Sum_probs=643.1 Q ss_pred CcceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeCCCceEEEE Q lcl|NC_012418. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd~~e~~~i~ 80 (826) ||+|+|+||||+||||||||++|||+||++|+||+|+|+|||+||||++||++++.+....++||+|+++||+.|+|+++ T Consensus 1 M~~i~~~~~nl~~GvSqq~d~~r~~~q~~~~~N~~~~~~gG~~rRpGt~~va~~~~~~~~~~~~~~~~~~r~~~~~y~l~ 80 (801) T protein:vir:33 1 MALISQSIKNLKGGISQQPDILRFTEQGSVQINGWSSESEGIQKRPPMIHLKTLGTAGYVGAQPYVHLINRDEFEQYFVV 80 (801) T ss_pred CceeEeeccceecceeccchhHhhhhhHhhhhcceeecccCcccCchhHhhhhhcCCCccccceEEEEEEeCCceEEEEE Confidence 99999999999999999999999999999999999999999999999999999999888889999999999999999876 Q ss_pred EecCCeEEEEECCCCEEEEecCcccccccCCCc-ccEEEEEecCEEEEeeCCcceeeeec--ccCCCCCCccEEEEEccc Q lcl|NC_012418. 81 AQHRGELYLFDERDGRLLMGQPLVHDYLKAADY-RQLRAATVADDLFIANLSVKPEADRT--DVKGVDPNKAGWLYIKAG 157 (826) Q Consensus 81 ~~~~g~irv~d~~~g~~~~~~~~~~~yl~~~~~-~~l~~~~vaD~~fi~n~~~~~~~~~~--~~~~~~~~~~a~~~vr~g 157 (826) +. +++|||||++|+.+.+.. ..+|+.+.++ ++|+++|+||+|||+|++++|++... ....++++++++++++++ T Consensus 81 ~~-~~~irv~~~~G~~~~v~~--~~~y~~~~~~~~~l~~~t~aD~~fi~nr~~~p~~~~~~~~~~~~~~~~~~li~v~~~ 157 (801) T protein:vir:33 81 FT-GEDIKVFDLDGKEYQVRG--DRSYVRTANPREDLRMVTVADYTFVTNRKVVVQSNDQSVNLPGFKDQGDALINVRGG 157 (801) T ss_pred Ec-CCeEEEEccCCcEEEEec--CCcceeecCcchheEEEEEcCEEEEeeCCeeecccCCcccccccCCCcceEEEEeec Confidence 65 688999998766555443 2357655554 47999999999999999999998643 345677788999999999 Q ss_pred ccCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccc-eEEeeeeeccceecc Q lcl|NC_012418. 158 QYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPE-YTLPNSTKKYPKVDP 236 (826) Q Consensus 158 ~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~-~~~~~~t~~~~~~~~ 236 (826) +|+++|+|++++.. ++.+++|++... ....+.+..+++.++......... .....+. T Consensus 158 ~yg~t~~I~i~gs~---------~~~~~~~~gs~~-----~~v~~~s~~~~A~~l~~~~~~~~~~~~~~~~~-------- 215 (801) T protein:vir:33 158 QYGRRLSIEFNGAE---------RAAVQLPDGSQP-----AHVNEVDGQAIAEKLAAQLRNNLGNPNNDQDP-------- 215 (801) T ss_pred ccceEEEEEECCcc---------eEEEEeeccccc-----cccccccchhhhhhhhhhhhccCccceeeecC-------- Confidence 99999999998753 234455543221 123333444455544433221111 0000000 Q ss_pred cccccccccceEecccCCcEEEEEcCCCeEE--EEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEeeeEe Q lcl|NC_012418. 237 DTAAATVAGYLNQRGVQDGYIAFRGDGDIVV--EVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAVM 314 (826) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~ 314 (826) ..+.. ...++++++...++... ..+.++++++.+.++.+.|+++++||..+| +|+.++ +.. T Consensus 216 -------~~w~~--~~~~g~~~i~~p~~~~~~~itt~~g~~~~~~~~~~~~v~~~~~lp~~~~----~g~~v~----v~~ 278 (801) T protein:vir:33 216 -------NKWRF--NVGPGFIHILAPNNDNVWGLQTKDGYADQLINPVTHYTQSFQKLPINAP----DGYIVK----IVG 278 (801) T ss_pred -------ceEEE--EecCeEEEEecCCCcccccccccCCccceeEEEEeecccceeeeeeecC----CCcEEE----EEe Confidence 01111 11223444444444322 346677788999999999999999998876 456554 344 Q ss_pred cCCCccceEEEEEecCCceEEEeecccccccc--cceeEEEEEecCCCeeEEeecCCcccccCCcccccCccccCCCceE Q lcl|NC_012418. 315 ATGSTKAPVYFEWDSANRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFVTRGITG 392 (826) Q Consensus 315 ~~~~~~~~~y~~~~~~~~~w~E~~~~g~~~~~--~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~ 392 (826) .++++.+.||++|+..+++|+||++||+..++ .+|||+|+ ++++++|++++.+|++|.+||+++||+|+|+|++|++ T Consensus 279 ~~~~~~~~y~v~~~~~~~~w~e~~~~g~~~~~~~~tmp~~l~-~~~~~tf~~~~~~w~~r~~gd~~tnp~psf~g~~~~~ 357 (801) T protein:vir:33 279 DTSKTADQYYVRFDLNRKVWVETIGWNTRTHLHYHTMPWALV-RASDGNFDFKYLEWGARTVGDDTTNPYPSFTGQTIND 357 (801) T ss_pred cCCCcccceEEEEEcCCcEEEEeeccccceeeeecccceEEE-EccCceEEecccCccccccCCccccCcccccCCCceE Confidence 56778899999999999999999999976665 58999998 7889999999999999999999999999999999999 Q ss_pred EEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccc Q lcl|NC_012418. 393 MTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIV 472 (826) Q Consensus 393 v~~~q~RL~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~~~~~l 472 (826) |+||||||+|++|++|||||+||||||+++|+++++|||||+++++++++|+|+|+++++++|+|||+++||+|+++++| T Consensus 358 v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~l~~~~~l 437 (801) T protein:vir:33 358 IFFFRNRLGFLSGENIILSRTSKYFNFFPASVSNYSDDDPIDVAVSHDRVSTLKYAVPFSEELLLWSDQAQFVLTASDIL 437 (801) T ss_pred EEEEcceEEEeeCCeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEE-eeccccccccchhHHHHHHHHhcCCCeEEEEEcCC Q lcl|NC_012418. 473 TPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEM-APSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAAS 551 (826) Q Consensus 473 TP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~-~~~~~~~~~~~~~dls~~~~~~~~~~v~~~~~s~~ 551 (826) ||+|++++++|+|+|+++++|+.+|+++||+|++| ++++++|+ .|..+.| .|+++|||+|++|||++++.+|+++++ T Consensus 438 TP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~v~r~~~~~~~~d-~y~~~Dlt~~~~~~~~~~~~~~~~~~~ 515 (801) T protein:vir:33 438 SSRSVGLNLTTQFDVQDRARPHGVGRNVYFSSPRA-SFTSINRYYAVQDVSS-VKNAEDMTAHVPNYIPNGVFSISGTTA 515 (801) T ss_pred cceeEEEEEEEeecccCCCCceEecCeEEEEecCC-CeeEEEEEEeeccccc-ceehhhHHHHHHHhcCCceEEEEEcCC Confidence 99999999999999999999999999999999988 57777665 5554455 599999999999999999999999998 Q ss_pred CCEE-EEEEcCCCeEEEEEEeeCCCceeeEeeEeeecCCcEEEEEE--ECCeEEEEEEeCCCEEEEEEEEeecCCcCCCC Q lcl|NC_012418. 552 SGYL-VFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGTYF--TGDNLMVLIQKGQEIALGRMHLNSLPAREGLQ 628 (826) Q Consensus 552 p~~~-v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~--~~d~l~~vv~R~~~~~~~r~~~~~~~~~~~~~ 628 (826) |+++ +|+++++|+|++||||++++||+|+|||||+|+|.|+++|+ ++|+|||+|+|+++.++|||++... ..+ T Consensus 516 ~~~~~~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~d~l~~vv~r~~~~~le~~~~~~~----~~d 591 (801) T protein:vir:33 516 ENFVAILTSGAPNRVYIYKFLYIDEEIRQQSWSHWDFGDNVTVFAAQVINSTMTVLMSNEHAVWMGRLHFTKD----SID 591 (801) T ss_pred CCeEEEEEecCCCEEEEEEEecCCCceEEEeeEEEEcCCCEEEEEEecCCCEEEEEEEcCCcEEEEEEEEeec----ccc Confidence 8876 58889999999999999999999999999999999887775 6999999999999999999976542 344 Q ss_pred cccccceEEEEeeccceeeeccc-------ccCccccccceEEeeeeeeEeeccEEcccE---ecCCCeEEEeecCCcCC Q lcl|NC_012418. 629 YPKYDYWRRIEATVEGELELTKQ-------HWDLIKDAPAVYQLQPVAGAFMERYQLGVK---RETNTKVFLDVPEAVVG 698 (826) Q Consensus 629 ~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~g~~~l~~~~~~~~ 698 (826) .+..+++.++|+........... .++........|+++.++.+++||..+... .+..+..+|.+|++..+ T Consensus 592 ~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~eg~~v~~~~dG~v~~~~~~~~~~~~~~~l~i~~~~~~ 671 (801) T protein:vir:33 592 LPGEPYRLYIDAKRKYTIPAGTYNDDTYQTSISLSTIYGMNFTKGRVSVVFPDGKIVEIDQPINGWSSDPMLRLDGNQEG 671 (801) T ss_pred CCCccceEEeecceEEEecccceecCccccccccccccCCccccceEEEEEeCCceEeeeeccccccCceeEEecCCCCC Confidence 45556666676643221111111 111112224558999999999999775433 23346788999999999 Q ss_pred ceEEEEEeeeeEEEeCCeEEECC----CCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccccccccccc Q lcl|NC_012418. 699 SVYVVGCEFWSKVEFTPPVLRDH----NGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLN 774 (826) Q Consensus 699 ~~v~vG~~y~~~~~~~~~~~~~~----~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~ 774 (826) ++|+|||+|+++++|+||+++.+ +.+++..+|+||+|++|++.+||+|++.|+++.++ ..+..+++++++.++. T Consensus 672 ~~v~vGl~y~s~~~~~~~~~~~~~~~~~~~~~~~~r~~l~r~~~~~~~tg~~~v~v~~~~~~--~~~~~~~~~~~~~~~~ 749 (801) T protein:vir:33 672 QVVYIGFNIPFTYTFSKFLIKKTAEDGSTATEDIGRLQLRRAWVNYEDSGAFIIRVNNLSRE--FIYTMAGARLGSDNLR 749 (801) T ss_pred CEEEEeeeeeEEEEeCceEEeccCCCCceeeeeeccEEEEEEEEEeecCcceEEEECCcccc--eeeeeccccccccccc Confidence 99999999999999999999944 44678889999999999999999999999988764 4578899999999999 Q ss_pred cCccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecccccC Q lcl|NC_012418. 775 AGEPLVDSAVVPLPARVAMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 775 ~~~~~~~tg~~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r~rrv 826 (826) ++.|++++|++++|+.+|+++.+|+|+|++|+||+|++|+|||+||+|+||| T Consensus 750 ~~~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvl~i~~eg~y~~r~~~~ 801 (801) T protein:vir:33 750 VGGSNIGTGQYRFPVVGNAQTNTVTIESDASTPLNIIGCGWEGNYLRRSSGI 801 (801) T ss_pred ccccccccceEEEEeeccCceEEEEEEeCCCCCEEEEEEEEEEEEeccccCC Confidence 9999999999999999999999999999999999999999999999999999 No 6 >protein:vir:2203 Length: 794 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_042000;swissprot:sw:p03747;genbank:gi:9627472;uniprot:P03747;genbank:GeneID:1261024 Probab=100.00 E-value=3.9e-222 Score=1234.45 Aligned_cols=770 Identities=22% Similarity=0.314 Sum_probs=647.7 Q ss_pred CcceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeCCCceEEEE Q lcl|NC_012418. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd~~e~~~i~ 80 (826) ||+|+|+||||++|||||+|++|||+||++|+||+|+|+|||+||||++||++++++......++.|.++|++.|+||++ T Consensus 1 M~~i~~s~~n~~~GvSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~~~~~~~~~y~l~ 80 (794) T protein:vir:22 1 MALISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTLGDNGALGQAPYIHLINRDEHEQYYAV 80 (794) T ss_pred CceeeeecchhhcccccCCchHHhhhHHhhhhcceeeccCCceeCCchHhhhhhcccCCCCCccEEEEEEeCCCcEEEEE Confidence 99999999999999999999999999999999999999999999999999999998776667788899999999999987 Q ss_pred EecCCeEEEEECCCCEEEEecCcccccccCC-CcccEEEEEecCEEEEeeCCcceeeeecc--cCCCCCCccEEEEEccc Q lcl|NC_012418. 81 AQHRGELYLFDERDGRLLMGQPLVHDYLKAA-DYRQLRAATVADDLFIANLSVKPEADRTD--VKGVDPNKAGWLYIKAG 157 (826) Q Consensus 81 ~~~~g~irv~d~~~g~~~~~~~~~~~yl~~~-~~~~l~~~~vaD~~fi~n~~~~~~~~~~~--~~~~~~~~~a~~~vr~g 157 (826) +.+ ++||||+++|..+.+..+...+|+.++ ...+|+++|+||+|||||++++|++.... ...+++.+++++++|+| T Consensus 81 ~~~-~~irv~~~~G~~~~v~~~~~~~y~~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~~~g~v~v~~g 159 (794) T protein:vir:22 81 FTG-SGIRVFDLSGNEKQVRYPNGSNYIKTANPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVRGG 159 (794) T ss_pred EcC-CeEEEEecCCcEEEeecCCCccceecCCCcccEEEEEEcCEEEEEcCCeeeeEeeccccCCCCCCCceEEEEccCC Confidence 764 579999987777777666677787554 44589999999999999999999996543 34467778999999999 Q ss_pred ccCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeeeeeccceeccc Q lcl|NC_012418. 158 QYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPD 237 (826) Q Consensus 158 ~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t~~~~~~~~~ 237 (826) +|+++|+++|++.+. +.+++|++... ....+++.++++.+|...+.+.. T Consensus 160 ~y~~ty~v~I~~~~~---------a~~~~p~gt~~-----~~~~~~~~~~ia~~L~~~l~~~~----------------- 208 (794) T protein:vir:22 160 QYGRELIVHINGKDV---------AKYKIPDGSQP-----EHVNNTDAQWLAEELAKQMRTNL----------------- 208 (794) T ss_pred ccceeEEEEeccCcc---------eEEEEcCCCcc-----ccceeechhhhhhhhhhhheecc----------------- Confidence 999999999987653 45667766432 34566778889888876543211 Q ss_pred ccccccccceEecccCCcEEEEEcCCC--eEEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEeeeEec Q lcl|NC_012418. 238 TAAATVAGYLNQRGVQDGYIAFRGDGD--IVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAVMA 315 (826) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~~ 315 (826) .+++.... ++++++.+.++ .....+.++++++.+.+..+.++++++||..+| .|+.+++ .+. T Consensus 209 ------~~~t~~~~--~~~~~i~a~~~~~~~~~t~~~g~~~t~~~~~~~~~~~~~~lp~~~~----~G~~v~i----~~~ 272 (794) T protein:vir:22 209 ------SDWTVNVG--QGFIHVTAPSGQQIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAP----NGYMVKI----VGD 272 (794) T ss_pred ------ccceEEeC--CceEEEEEcCCceEEEEeeecccCcceeEEEEeccccceeccccCC----CCeEEEE----EeC Confidence 11222221 22344444433 333345666778888999999999999888766 4565553 445 Q ss_pred CCCccceEEEEEecCCceEEEeecccccccc--cceeEEEEEecCCCeeEEeecCCcccccCCcccccCccccCCCceEE Q lcl|NC_012418. 316 TGSTKAPVYFEWDSANRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFVTRGITGM 393 (826) Q Consensus 316 ~~~~~~~~y~~~~~~~~~w~E~~~~g~~~~~--~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v 393 (826) .++..+.||++|+..+++|+||++||+..++ .||||.++ ++++++|++++.+|++|.+||+++||+|||+|++|++| T Consensus 273 ~~~~~~~Y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~lv-~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~i~~v 351 (794) T protein:vir:22 273 ASKSADQYYVRYDAERKVWTETLGWNTEDQVLWETMPHALV-RAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDV 351 (794) T ss_pred CCCCcceeEEEEeccceEEEEeeeccceeeecccceeeEee-eccCCcEEEeeccccccccCccccCCcceecCCCcceE Confidence 5667799999999999999999999976655 69999998 78899999999999999999999999999999999999 Q ss_pred EEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCcccc Q lcl|NC_012418. 394 TTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVT 473 (826) Q Consensus 394 ~~~q~RL~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~~~~~lT 473 (826) +||||||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+++++|| T Consensus 352 ~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~ss~~~~~i~~~v~~~~~L~i~t~~~e~~l~~~~~lT 431 (794) T protein:vir:22 352 FFFRNRLGFLSGENIILSRTAKYFNFYPASIANLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLT 431 (794) T ss_pred EEEcceEEEecCCeEEEEccCCccccccccCcCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEEEEEcC-CC Q lcl|NC_012418. 474 PRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAA-SS 552 (826) Q Consensus 474 P~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~~~s~-~p 552 (826) |+|++++++|+|+|+++++|+.+|+++||+|++| ++++++|+++..+..+.|+++|||+|++|||++++..+++++ +| T Consensus 432 P~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~~~r~~~~~~~~d~y~~~Dlt~~~~~~~~~~~~~~~~~~~~~ 510 (794) T protein:vir:22 432 SKSVELNLTTQFDVQDRARPFGIGRNVYFASPRS-SFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTEN 510 (794) T ss_pred ceeEEEEEEEEeeccCCCCceEeCCeEEEEecCC-CeeEEEEeEeeecccCceehhhHHHHHHHhcCCceEEEEEeCCCC Confidence 9999999999999999999999999999999988 577776654444544459999999999999999998887755 45 Q ss_pred CEEEEEEcCCCeEEEEEEeeCCCceeeEeeEeeecCCcEEEEEEE--CCeEEEEEEeCCCEEEEEEEEeecCCcCCCCcc Q lcl|NC_012418. 553 GYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGTYFT--GDNLMVLIQKGQEIALGRMHLNSLPAREGLQYP 630 (826) Q Consensus 553 ~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~--~d~l~~vv~R~~~~~~~r~~~~~~~~~~~~~~~ 630 (826) ..++|+++++|+|++|+|||+++||+|+|||||+|+|.|+++|+. +|+||++|+|++++++|||.+++.. .+.+ T Consensus 511 ~~v~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~d~l~~iv~r~~~~~~~r~~~~~~~----~~~~ 586 (794) T protein:vir:22 511 FCSVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSDMYVILRNEFNTFLARISFTKNA----IDLQ 586 (794) T ss_pred cEEEEEEcCCCEEEEEEEeecCCceeEEeeEEEEcCCCEEEEEEEecCCEEEEEEEeCCCEEEEEEEEeecc----ccCC Confidence 677899999999999999999999999999999999999988865 8999999999999999999776532 2334 Q ss_pred cccceEEEEeeccceeeeccc-------ccCccccccceEEeeeeeeEeeccEEcccEecC---CCeEEEeecCCcCCce Q lcl|NC_012418. 631 KYDYWRRIEATVEGELELTKQ-------HWDLIKDAPAVYQLQPVAGAFMERYQLGVKRET---NTKVFLDVPEAVVGSV 700 (826) Q Consensus 631 ~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~g~~~l~~~~~~~~~~ 700 (826) .+++..++|+.......-... ..+........|+++.++...+||.......+. ++..++.++++..+++ T Consensus 587 ~~~~~~~lD~~~~~~~~~g~~~~~~~~t~~~~~~~~g~~~~~g~~v~~~~dg~~~~~~~~~~~~~~~~~~~v~~~~~~~~ 666 (794) T protein:vir:22 587 GEPYRAFMDMKIRYTIPSGTYNDDTFTTSIHIPTIYGANFGRGKITVLEPDGKITVFEQPTAGWNSDPWLRLSGNLEGRM 666 (794) T ss_pred CccceeeeeeeEEEeeccceeecCCcceEEEcccccCcccccceEEEEEcCCceeeceeeeeeeeccceEEeCCCCCCcE Confidence 444555566543322111000 001111123458899999999999776655543 3456778899999999 Q ss_pred EEEEEeeeeEEEeCCeEEECCCC----ceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccccccccccccC Q lcl|NC_012418. 701 YVVGCEFWSKVEFTPPVLRDHNG----LPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAG 776 (826) Q Consensus 701 v~vG~~y~~~~~~~~~~~~~~~g----~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 776 (826) |+|||+|+++++|+||++++++| +++..+|+||+|++++|.+||+|.++|+++.++. .+.+++.++++..+.+| T Consensus 667 v~VGl~y~s~~~~~~~~~~~~~~~~~~~~~~~grl~l~r~~~~~~~tg~~~v~v~~~~~~~--~~~~~~~~~g~~~~~~g 744 (794) T protein:vir:22 667 VYIGFNINFVYEFSKFLIKQTADDGSTSTEDIGRLQLRRAWVNYENSGTFDIYVENQSSNW--KYTMAGARLGSNTLRAG 744 (794) T ss_pred EEEeeeeeEEEEecceEEEecCCCccceeeecceEEEEEEEEEeccccceEEEEcCCCccc--ceeecCceecccccccC Confidence 99999999999999999998887 5566789999999999999999999999877653 46788999999999999 Q ss_pred ccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecccccC Q lcl|NC_012418. 777 EPLVDSAVVPLPARVAMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 777 ~~~~~tg~~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r~rrv 826 (826) .+++.+++++||+++++++.+|+|+|++|+||+|++|+|||+||+|+||| T Consensus 745 ~~~~~tg~~~vp~~~~~~~~~v~i~~d~p~P~tvlsi~~eg~y~~r~~~v 794 (794) T protein:vir:22 745 RLNLGTGQYRFPVVGNAKFNTVYILSDETTPLNIIGCGWEGNYLRRSSGI 794 (794) T ss_pred cccccCceEEEEecccCceEEEEEEECCCCCEEEEEEeEEEEEeccccCC Confidence 99999999999999999999999999999999999999999999999999 No 7 >protein:vir:1543 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052111;swissprot:trembl:q9t105;genbank:gi:9634037;uniprot:Q9T105;genbank:GeneID:1262408 Probab=100.00 E-value=6.9e-222 Score=1233.07 Aligned_cols=775 Identities=22% Similarity=0.310 Sum_probs=643.9 Q ss_pred CcceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeCCCceEEEE Q lcl|NC_012418. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd~~e~~~i~ 80 (826) ||+|+|+||||++|||||||++|||+||++|+||+|+|+|||+||||++||+++++.+...+++|+|+|+||+.|+|+++ T Consensus 1 M~~i~~s~~n~~~GvSqq~d~~r~~~q~~~~~N~~~~~~gGl~rRpGt~~va~~~~~~~~~~~~~~~~~~~~~~e~y~l~ 80 (801) T protein:vir:15 1 MALISQSIKNLKGGISQQPDILRFAEQGSVQINGWSSESEGLQKRPPMIHLKTLGPAGYVGAQPYVHLINRDEFEQYFVV 80 (801) T ss_pred CceeeeecchhhcceecCcchHhhhhhHhhhhcceeccccCcccCCchheeeeecCCCCcccceeEEEEEeCCceEEEEE Confidence 99999999999999999999999999999999999999999999999999999999888888999999999999999876 Q ss_pred EecCCeEEEEECCCCEEEEecCcccccccCCCcc-cEEEEEecCEEEEeeCCcceeeeec--ccCCCCCCccEEEEEccc Q lcl|NC_012418. 81 AQHRGELYLFDERDGRLLMGQPLVHDYLKAADYR-QLRAATVADDLFIANLSVKPEADRT--DVKGVDPNKAGWLYIKAG 157 (826) Q Consensus 81 ~~~~g~irv~d~~~g~~~~~~~~~~~yl~~~~~~-~l~~~~vaD~~fi~n~~~~~~~~~~--~~~~~~~~~~a~~~vr~g 157 (826) +. +++|||||++|+.+.+... .+|+.+.++. +|+++|+||+|||+|++++|++... +...+++..+++++++++ T Consensus 81 ~~-~~~irv~~~~G~~~~v~~~--~~y~~~~~~~~~l~~~~~aD~~fi~nr~~~~~~~~~~~~~~~~~~~~~alv~v~~~ 157 (801) T protein:vir:15 81 FT-GEDIKVFDLDGKEYQVRGD--RSYVRTANPREDLRMITVADYTFVTNRKVVVQSNDQSVNLPGFKDQGDALINVRGG 157 (801) T ss_pred Ec-CCeEEEEccCCcEEEEecC--CccccccCchhheeEEEEcCEEEEeeCCeeeecccCccccCccCCCCceEEEeeec Confidence 64 6899999987655554432 3566666654 8999999999999999999998644 335567777999999999 Q ss_pred ccCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecc-cceEEeeeeeccceecc Q lcl|NC_012418. 158 QYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGA-PEYTLPNSTKKYPKVDP 236 (826) Q Consensus 158 ~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~-~~~~~~~~t~~~~~~~~ 236 (826) +|+++|+|++++.. .+.+++|++... ....+.+..+++.++...+... +......+ T Consensus 158 ~yg~t~~I~i~gs~---------~~~~t~~~gs~~-----~~~~~~s~~~ia~~l~~~~~~~~p~~~~~~~--------- 214 (801) T protein:vir:15 158 QYGRRLSIEFNGAE---------RAAVQLPDGSQP-----AHVNEVDGQAIAEKLAAQLRNNLGNPNNDQD--------- 214 (801) T ss_pred cCceeEEEEeCCcc---------eEEEEeccCccc-----chhhhcceeechHHHhhhhhhccCccceecc--------- Confidence 99999999998753 344555554322 1223334444555544433211 11100000 Q ss_pred cccccccccceEecccCCcEEEEEcCCCeE--EEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEeeeEe Q lcl|NC_012418. 237 DTAAATVAGYLNQRGVQDGYIAFRGDGDIV--VEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAVM 314 (826) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~ 314 (826) ...+.. ...++++++.+.++.. ...+.++++++.+.+..+.|+++++||..+| .|+.+++ .+ T Consensus 215 ------~~~w~~--~~~~g~~~i~a~~~~~~~~~~t~dg~~~~~~~~~~~~v~~~~~lp~~~~----~G~~v~v----~~ 278 (801) T protein:vir:15 215 ------PNKWRF--NVGPGFIHILAPNNDNVWGLQTKDGYADQLINPVTHYTQSFQKLPINAP----DGYIVKI----VG 278 (801) T ss_pred ------CccEEE--EecCcEEEEeCCCCcccceeeeccccCceeeeEEeecccceeeeeeecC----CCcEEEE----Ee Confidence 001111 1223455555555443 2356677788999999999999999998876 4666553 44 Q ss_pred cCCCccceEEEEEecCCceEEEeecccccccc--cceeEEEEEecCCCeeEEeecCCcccccCCcccccCccccCCCceE Q lcl|NC_012418. 315 ATGSTKAPVYFEWDSANRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFVTRGITG 392 (826) Q Consensus 315 ~~~~~~~~~y~~~~~~~~~w~E~~~~g~~~~~--~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~ 392 (826) .++++.++||++|+..+++|+||++||+..++ .||||.++ +.++++|+++..+|++|.+||+++||+|+|+|++|++ T Consensus 279 ~~~~~~~~y~v~~~~~~~~w~E~a~~g~~~~~~~~tmp~~lv-~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~g~~~~~ 357 (801) T protein:vir:15 279 DTSKTADQYYVRFDLNRKVWVETIGWNTRTHLYYHTMPWALV-RASDGNFDFKVLEWGARTVGDDTTNPYPSFTGQTIND 357 (801) T ss_pred cCCCccceEEEEEEcCCeeEEeecccccceeeeccccceEEE-eeccceEEEeccccccccCCccccCCcccccCCCceE Confidence 56788899999999999999999999987765 58999998 6789999999999999999999999999999999999 Q ss_pred EEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccc Q lcl|NC_012418. 393 MTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIV 472 (826) Q Consensus 393 v~~~q~RL~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~~~~~l 472 (826) |+||||||+|++|++|||||+||||||+++|+++++|||||+++++++++|+|+|+++++++|+|||+++||+|+++++| T Consensus 358 v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~i~t~~~q~~ls~~~~l 437 (801) T protein:vir:15 358 IFFFRNRLGFLSGENIILSRTSKYFNFFPASVSNYSDDDPIDVAVSHNRVSTLKYAVPFSEELLLWSDQAQFVLTASGIL 437 (801) T ss_pred EEEEcceEEEeeCCeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEcCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEE-EeeccccccccchhHHHHHHHHhcCCCeEEEEEcCC Q lcl|NC_012418. 473 TPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHE-MAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAAS 551 (826) Q Consensus 473 TP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e-~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~~~s~~ 551 (826) ||+|++++++|+|+|+++++|+.+|+++||+|++| ++++++| |.|+.+.| .|+++|||+|++|||++++.+|+++++ T Consensus 438 TP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~~~r~~~~~~~~d-~y~a~Dlt~~~~hl~~~~v~~~~~~~~ 515 (801) T protein:vir:15 438 SSRSVELNLTTQFDVQDRARPHGVGRNVYFASPRA-SFTSINRYYAVQDVSS-VKNAEDMTAHVPNYIPNGVFSISGTTA 515 (801) T ss_pred cceeEEEEEEEeeeccCCCCceEeCCeEEEEecCC-CeeEEEEEEeeccccc-ceehhhHHHHHHHhcCCceEEEEEeCC Confidence 99999999999999999999999999999999988 5777766 45554455 599999999999999999999999887 Q ss_pred CCE-EEEEEcCCCeEEEEEEeeCCCceeeEeeEeeecCCcEEEEEE--ECCeEEEEEEeCCCEEEEEEEEeecCCcCCCC Q lcl|NC_012418. 552 SGY-LVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGTYF--TGDNLMVLIQKGQEIALGRMHLNSLPAREGLQ 628 (826) Q Consensus 552 p~~-~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~--~~d~l~~vv~R~~~~~~~r~~~~~~~~~~~~~ 628 (826) |+. ++|+++++|+|++|||||+++||+|+|||||+|+|.|+++|+ .+|+||++|+|+++.+++||.+.... .+ T Consensus 516 ~~~~~~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~d~l~~~v~r~~~~~~~r~~~~~~~----~~ 591 (801) T protein:vir:15 516 ENFAAILTSGAPNRVYIYKFLYIDEEIRQQSWSHWDFGDNVTVFAAQVINSTMTVLMGNEHAVWMGRLHFTKNS----ID 591 (801) T ss_pred CCcEEEEEEcCCCEEEEEEEecCCCceEEEeeEEEEcCCCEEEEEEEecCCEEEEEEEecCcEEEEEEEEcccc----cc Confidence 665 579999999999999999999999999999999999988876 48999999999999999999765432 23 Q ss_pred cccccceEEEEeeccceeeeccccc-------CccccccceEEeeeeeeEeeccEEcccEecCCC---eEEEeecCCcCC Q lcl|NC_012418. 629 YPKYDYWRRIEATVEGELELTKQHW-------DLIKDAPAVYQLQPVAGAFMERYQLGVKRETNT---KVFLDVPEAVVG 698 (826) Q Consensus 629 ~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g---~~~l~~~~~~~~ 698 (826) ...+.+..++||.......-.+... +........|+++..+.+++||..+....+.+| ..++.++++..+ T Consensus 592 ~~~~~~~~~lD~~~~~~~~~~t~~~~~~~~~~~~~~~~gl~~l~g~~v~v~~dG~~~~~~~~~~g~~~~~~~~i~~~~~~ 671 (801) T protein:vir:15 592 IPGEPYRLYIDAKRKYTIPAGTYNDDTYQTSISLATIYGMNFTKGRVSVVFPDGKIIEVDQPINGWSSDPVLRLDGNQEG 671 (801) T ss_pred CCCcceeeeeeeeeeEeeccceeccCceecccccccccccccccceEEEEEeCCceeeeeeecCcccCcceEEEcCCCCC Confidence 3334444566665433222111110 111122345899999999999998887777776 346677888999 Q ss_pred ceEEEEEeeeeEEEeCCeEEEC----CCCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccccccccccc Q lcl|NC_012418. 699 SVYVVGCEFWSKVEFTPPVLRD----HNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLN 774 (826) Q Consensus 699 ~~v~vG~~y~~~~~~~~~~~~~----~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~ 774 (826) ++|+|||+|+++++|+||+++. ++.+++..+|+||+|++|++.+||.|.+.|+++.++ ..+..++.++++.++. T Consensus 672 ~~v~vGl~y~~~~~~~~~~~~~~~~~~~~~~~~~~rl~l~r~~~~~~~tg~~~~~v~~~~~~--~~~~~~~~~~~~~~~~ 749 (801) T protein:vir:15 672 QVVYIGFNIPFTYTFSKFLIKKTAEDGSTATEDIGRLQLRRAWVNYEDSGAFTIRVNNLSRE--FIYTMAGARLGSDNLR 749 (801) T ss_pred cEEEEeeeeeEEEEecceEEeccCCCCCceeeeeccEEEEEEEEEeccCcceEEEECCcccc--cceeecCccccccccc Confidence 9999999999999999999994 444677889999999999999999999999998775 3577899999999999 Q ss_pred cCccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecccccC Q lcl|NC_012418. 775 AGEPLVDSAVVPLPARVAMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 775 ~~~~~~~tg~~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r~rrv 826 (826) ++.|++++|+++||+.+|+++.+|+|+|++|+||+|++|+|||+||+|+||| T Consensus 750 ~~~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlsi~~e~~y~~r~~~~ 801 (801) T protein:vir:15 750 VGRSNIGTGQYRFPVVGNAQTNLVTIESDASTPLNIIGCGWEGNYLRRSSGI 801 (801) T ss_pred ccccccccceEEEEEeecCceEEEEEEECCCCcEEEEEEEEEEEEeccccCC Confidence 9999999999999999999999999999999999999999999999999999 No 8 >protein:vir:99677 Length: 794 # NCBI annotation: Tail tubular protein B # Family: family:all:825 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249591;genbank:gi:68299742;genbank:GeneID:3799992 Probab=100.00 E-value=1.3e-221 Score=1231.50 Aligned_cols=767 Identities=21% Similarity=0.282 Sum_probs=648.9 Q ss_pred CcceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeCCCceEEEE Q lcl|NC_012418. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd~~e~~~i~ 80 (826) ||+|+|+||||++|||||+|++|||+||++|+||+|+|+|||+||||++||++++++..+..+++.|.|+|++.|+|+++ T Consensus 1 M~~i~~s~~n~~~GvS~q~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~f~~~~~~~y~l~ 80 (794) T protein:vir:99 1 MALISQSIKNLKGGISQQPDILRYSDQGSKQINGFSSEVEGLQKRPPSVHIKRLTDQFGLGQKPYCHIINRDEVERYAVF 80 (794) T ss_pred CceeeeecchhhcceecCCchHHhhhhHhhhhcceeeeccCcccCCccceeeeecCCCCCccccEEEEEEeCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999998877777889999999988888776 Q ss_pred EecCCeEEEEECCCCEEEEe-cCcccccccCCCc-ccEEEEEecCEEEEeeCCcceeeee--cccCCCCCCccEEEEEcc Q lcl|NC_012418. 81 AQHRGELYLFDERDGRLLMG-QPLVHDYLKAADY-RQLRAATVADDLFIANLSVKPEADR--TDVKGVDPNKAGWLYIKA 156 (826) Q Consensus 81 ~~~~g~irv~d~~~g~~~~~-~~~~~~yl~~~~~-~~l~~~~vaD~~fi~n~~~~~~~~~--~~~~~~~~~~~a~~~vr~ 156 (826) + ++++||||++.+|....+ .+...+|+.++++ .+|+++|+||+|||+|++++|++.. +....++++.++++++++ T Consensus 81 f-~~~~irv~~~~~g~~~~v~~~~~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~~~v~~ 159 (794) T protein:vir:99 81 F-TGSNIRVFDLFTGDEKTVNAPNGLSYVSSSNPRKDLRMVTVADYTFILNRNVATAQGTTNTPSGLAPFGHFGLVVIRG 159 (794) T ss_pred E-cCCeEEEEECCCCeEEEeeccccccccccCCccceeeEEEEccEEEEEcCCeeeeEeeeeccccCcCCCceEEEEecc Confidence 6 568999999988866554 4556678776654 4899999999999999999999864 345667888899999999 Q ss_pred cccCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeeeeeccceecc Q lcl|NC_012418. 157 GQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDP 236 (826) Q Consensus 157 g~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t~~~~~~~~ 236 (826) ++|+++|++++++.. ++.+++|++.... .....+.++++.++...+.. .+| T Consensus 160 g~y~~~y~v~i~gs~---------ta~~~tp~~~~~~-----~~~~~s~~~ia~~l~~~l~~-~g~-------------- 210 (794) T protein:vir:99 160 GQYGRTYRIKVNGSV---------EASFETPLGDQVA-----HAKQIDIAYIIDQLAAGLIN-KGW-------------- 210 (794) T ss_pred CCCCceEEEEecCCc---------ccceeeccCcccc-----cccccchhhhhhhhHhhhhc-ccc-------------- Confidence 999999999998753 3566677654432 33345677788777665432 111 Q ss_pred cccccccccceEecccCCcEEEEEcCCCe--EEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEeeeEe Q lcl|NC_012418. 237 DTAAATVAGYLNQRGVQDGYIAFRGDGDI--VVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAVM 314 (826) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~ 314 (826) ... ..++++++...++. ....+.++.+++++....+.|+++++||+.+| +|+.+++ .+ T Consensus 211 ----------~v~--~~~g~~~i~~~~~~~v~t~s~~~g~~~t~~~~~~~~v~~~~~Lp~~~~----~G~~v~v----~~ 270 (794) T protein:vir:99 211 ----------AVT--KGSGYFYFSKSGSVIINSLEVEDGYNGQLAWGIINDVQKTTQLPVYAP----NNYIIRV----SG 270 (794) T ss_pred ----------eEE--eCCeEEEEEecCCceeEEEEeecCCCCceeeEEeeeccceeecccCCC----CCeEEEE----ec Confidence 111 12334444444443 33345566678889999999999999988776 4666653 34 Q ss_pred cCCCccceEEEEEecCCceEEEeeccccccc--ccceeEEEEEecCCCeeEEeecCCcccccCCcccccCccccCCCceE Q lcl|NC_012418. 315 ATGSTKAPVYFEWDSANRRWAERAAYGTDWV--LKKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFVTRGITG 392 (826) Q Consensus 315 ~~~~~~~~~y~~~~~~~~~w~E~~~~g~~~~--~~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~ 392 (826) ..+..+++||++|+..+++|+||+++++..+ ..||||.++ ++++++|++++.+|++|.+||+++||+|||+|++|++ T Consensus 271 ~~~~~~~~y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~~v-~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~is~ 349 (794) T protein:vir:99 271 DPTLNQDDYYVRFDASRNVWTECPAPNIKADYNKATMPHVLI-READGTFTFKQADWTHRAAGDDETNPYPSFIGNSIND 349 (794) T ss_pred cCCCCCCceEEEEEcCCceEEeeccceeecceeccceEEEEe-ccCCCceeEeeccccccccCCcccCCCccccCcceeE Confidence 4567789999999999999999999997655 469999997 7889999999999999999999999999999999999 Q ss_pred EEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccc Q lcl|NC_012418. 393 MTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIV 472 (826) Q Consensus 393 v~~~q~RL~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~~~~~l 472 (826) |+||||||+|+++++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+++++| T Consensus 350 v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~l~~~~~l 429 (794) T protein:vir:99 350 IFFFRNRLGFLSGENVILSGSGNYFNFFPESVAVLTDTDPIDVAVSTNRISILKYAVPFSEELILWSDQAQFVLSSDGGL 429 (794) T ss_pred EEEEeeeEEEecCCeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEE-eeccccccccchhHHHHHHHHhcCCCe-EEEEEcC Q lcl|NC_012418. 473 TPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEM-APSPSTDSHYVAEDVTSHIPSYMPGPA-EYIQAAA 550 (826) Q Consensus 473 TP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~-~~~~~~~~~~~~~dls~~~~~~~~~~v-~~~~~s~ 550 (826) ||+|++++++|+|+|+++++|+.+|+++||+|++| ++++++|+ .|+.++|+ |+++|||+|++|||+|++ ..+++++ T Consensus 430 TP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~v~r~~~~~~~~d~-y~a~Dlt~~~~hl~~~~~~~~~a~~~ 507 (794) T protein:vir:99 430 TPTTIRLDLTTEFEVTEQARPYGIGRGVYFVSPRA-KFSSVRRFYAVQDVTQV-KNAEDISAHVPYYVENGVFKMSGSST 507 (794) T ss_pred cceeEEEEEEEEeeccCCCCceEeCCeEEEEecCC-CeeEEEEeeeeccccCc-eehhhHHHHHHHhcCCCeEEEEEeCC Confidence 99999999999999999999999999999999988 57777655 68766665 999999999999999986 4568999 Q ss_pred CCCEEEEEEcCCCeEEEEEEeeCCCceeeEeeEeeecCCcEEEEEEE--CCeEEEEEEeCCCEEEEEEEEeecCCcCCCC Q lcl|NC_012418. 551 SSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGTYFT--GDNLMVLIQKGQEIALGRMHLNSLPAREGLQ 628 (826) Q Consensus 551 ~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~--~d~l~~vv~R~~~~~~~r~~~~~~~~~~~~~ 628 (826) +|..++|++++||+|++||||++++||+|+|||||+|+|.++++|++ +|+||++|+|++++++|||++.+.. .+ T Consensus 508 ~~~~~v~~~~~~g~l~~~~y~~~~~eq~v~aW~~~~~~g~~~~~~~~~~~d~l~~~v~r~~~~~ler~~~~~~~----~~ 583 (794) T protein:vir:99 508 ENFLTILTEGNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVLCCDMIGAVMHLIIDSPSGVLMEKIEFTQNT----KD 583 (794) T ss_pred CCcEEEEEEcCCCEEEEEEEeecCCceEEEeEEEEEcCCCeEEEEEEEcCCEEEEEEEeCCCEEEEEEEeeeCC----CC Confidence 99999999999999999999999999999999999999998877764 9999999999999999999765532 34 Q ss_pred cccccceEEEEeeccceeeecccccCc-------cccccceEEeeeeeeEeeccEEcccE----ecCCCeEEEeecCCcC Q lcl|NC_012418. 629 YPKYDYWRRIEATVEGELELTKQHWDL-------IKDAPAVYQLQPVAGAFMERYQLGVK----RETNTKVFLDVPEAVV 697 (826) Q Consensus 629 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~g~~~l~~~~~~~ 697 (826) .+.+++..++||...........+.+. .......|++|.++.+.+||..+... .......++.+|++.. T Consensus 584 ~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~g~~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~~ 663 (794) T protein:vir:99 584 YPDEPYRLYVDRKIEYTFPEGSYNDDDFKTRVKLKDIYGSTPANGQYVFISLGGVTFTFDPPAGGWQANDGLIEFDGDLR 663 (794) T ss_pred CCCcccceeeeeeeeeeecccccccCcceeEEeccccccccccCCceEEEEeCCceeeeecccceEecCccEEEecCCCC Confidence 455566667777666554333322211 11123348899999999998765432 2333445677888899 Q ss_pred CceEEEEEeeeeEEEeCCeEEECCCCc----eeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeecccccccccc Q lcl|NC_012418. 698 GSVYVVGCEFWSKVEFTPPVLRDHNGL----PMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQL 773 (826) Q Consensus 698 ~~~v~vG~~y~~~~~~~~~~~~~~~g~----~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~ 773 (826) +++|+|||+|+++++|+||++++++++ +...||+||+|++|+|.+||+|++.++++.++ ..+.+.+.++++.++ T Consensus 664 ~~~v~vGl~y~s~~~~~~~~~~~~~~~g~~~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~--~~~~~~~~~~~~~~~ 741 (794) T protein:vir:99 664 GTKFFVGEAYTFLYEFSKFLIKTTDTADGVATEDIGRLQLRRAWVNYDKSGNFRVEVNNQGRT--FTYNMTGNRLSTNEL 741 (794) T ss_pred CcEEEEeeeeeEEEeecceEEeecCCCCceeeeccceEEEEEEEEEeecccceEEEECCCccc--eeeeccccccccccc Confidence 999999999999999999999976643 33458999999999999999999999998875 346678999999999 Q ss_pred ccCccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecccccC Q lcl|NC_012418. 774 NAGEPLVDSAVVPLPARVAMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 774 ~~~~~~~~tg~~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r~rrv 826 (826) .++.++++||+++||+.+|+++.+|+|+|++|+||+|++|+|||+||+|+||| T Consensus 742 ~~g~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlsi~~e~~y~~r~~~v 794 (794) T protein:vir:99 742 ILGDESLDTGQFRYAVSGNATQVTVSLISDTPNPLSIIGGGWEGYYVRRSSGI 794 (794) T ss_pred cccccccccceEEEEecccccceEEEEEECCCCCEEEEEEEEEEEEeccccCC Confidence 99999999999999999999999999999999999999999999999999999 No 9 >protein:vir:94583 Length: 792 # NCBI annotation: Tubular tail protein B # Family: family:all:825 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919014;genbank:gi:119637778;genbank:GeneID:5179343 Probab=100.00 E-value=1.5e-221 Score=1231.16 Aligned_cols=769 Identities=23% Similarity=0.312 Sum_probs=646.8 Q ss_pred CcceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeCCCceEEEE Q lcl|NC_012418. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd~~e~~~i~ 80 (826) ||+|+|+||||++|||||||++|||+||++|+||+|+|+|||+||||++||+++++.......++.|.|+||+.|+|+++ T Consensus 1 M~~i~~s~~n~~~GiSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~~~~~~~q~y~l~ 80 (792) T protein:vir:94 1 MALISQSVKNLKGGISQQPNILRFPEQGSEQINGWSSETEGLQKRPPFVFTKTIGDQNALGAKPLVHLINRDSAEQYYVV 80 (792) T ss_pred CcceeeecchhhcceecCcchHHhhhhhhhhhcceeeeccccccCChhHHHHhhhcCCCCCcccEEEEEEeCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999998777777789999999999999887 Q ss_pred EecCCeEEEEECCCCEEEEecCcccccccCCCcc-cEEEEEecCEEEEeeCCcceeeeecccCCCCCCccEEEEEccccc Q lcl|NC_012418. 81 AQHRGELYLFDERDGRLLMGQPLVHDYLKAADYR-QLRAATVADDLFIANLSVKPEADRTDVKGVDPNKAGWLYIKAGQY 159 (826) Q Consensus 81 ~~~~g~irv~d~~~g~~~~~~~~~~~yl~~~~~~-~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~~~vr~g~Y 159 (826) +.+ ++|||||++|+++++... .+|+.+.+++ +|+++|+||+|||+|++++|++..+....+++.+++++++++|+| T Consensus 81 f~~-~~~rv~~~~g~~~~~~~~--~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~~~~~~~~~~~~~~~~~~~v~i~~g~y 157 (792) T protein:vir:94 81 FTG-QGVRVFDLNGKEYDVKGD--LSYVKVENPRDDLRMVTVADYTFIVNRNMVVRPDTTPLYTLKENGDCLINIRGGMY 157 (792) T ss_pred EcC-CeEEEEecCCceEEeccc--CceeeecCCcceeEEEEEcCEEEEEeCCccceeEecCcCCCCCCceEEEEccCCCc Confidence 765 569999998777766543 5777766654 799999999999999999999987777777888899999999999 Q ss_pred CeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeeeeeccceeccccc Q lcl|NC_012418. 160 SKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDTA 239 (826) Q Consensus 160 ~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t~~~~~~~~~~~ 239 (826) +++|+++|++.. +.+++|.+... ....+.++.+++.++......... T Consensus 158 ~~~y~i~i~~~~----------~~~~~~~~t~~-----~~~~~~~~~~i~~~l~~~~~~~~~------------------ 204 (792) T protein:vir:94 158 GRTLAFTINNTK----------IAYEIAHGDAP-----EHSKQTDAQWLVKKLAGLARLNVA------------------ 204 (792) T ss_pred ceeEEEEecCce----------eeeeeecCccc-----ceecccchhhhhhhhhhhcccccc------------------ Confidence 999999998643 23444443322 234455677888777664332111 Q ss_pred ccccccceEecccCCcEEEEEcCCCeEE--EEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEeeeEecCC Q lcl|NC_012418. 240 AATVAGYLNQRGVQDGYIAFRGDGDIVV--EVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAVMATG 317 (826) Q Consensus 240 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~~~~ 317 (826) ..+++.. ..++|+++.+..+... ..+.++.+++++.+..+.|+++++||+.+| +|+.+++ .+..+ T Consensus 205 ---~~~~~~~--~~~~~~~i~~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~lp~~~~----~G~~v~i----~~~~~ 271 (792) T protein:vir:94 205 ---FKGWTFT--EGPGYIHVIAPSNSQINSLSTEDGYADQLMNAVMHTSQSFSRLPVEAP----NGYTVKI----VGDTS 271 (792) T ss_pred ---ccccEEE--ECCeEEEEEecCCceeeeeecccCcCcceeeeeeecccccccccccCC----CCcEEEE----EccCC Confidence 0111111 2234555555554433 345667778889999999999999988766 4555554 34566 Q ss_pred CccceEEEEEecCCceEEEeecccccccc--cceeEEEEEecCCCeeEEeecCCcccccCCcccccCccccCCCceEEEE Q lcl|NC_012418. 318 STKAPVYFEWDSANRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFVTRGITGMTT 395 (826) Q Consensus 318 ~~~~~~y~~~~~~~~~w~E~~~~g~~~~~--~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~~ 395 (826) ++.++||++|+..+++|+||+++|+..++ .+|||.++ ++++++|++++.+|++|.+||+++||+|+|+|++|++|+| T Consensus 272 ~~~d~y~v~~~~~~~~w~E~~~~~~~~~~~~~tmp~~lv-~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~g~~i~~v~f 350 (792) T protein:vir:94 272 KTSDMFYVQYDNMKKVWKEVAGWGVQKGLNGGTMPHALV-RQADGSFQMQVLPWTQRTCGDMDTNPTPSIVDQKINDVFF 350 (792) T ss_pred CCccceEEEEEcCCceEEEecccceeeeecccccCeeEE-EcCCCcEEEEeccccccccCccccCccceeccCCcceEEE Confidence 77899999999999999999999977664 58999998 7889999999999999999999999999999999999999 Q ss_pred EcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCcccccc Q lcl|NC_012418. 396 FQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPR 475 (826) Q Consensus 396 ~q~RL~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~~~~~lTP~ 475 (826) |||||+|++|++|||||+||||||+++|+++++|||||+++++++++|+|+|+++++++|+|||+++||+|+++++|||+ T Consensus 351 ~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~~~q~~l~~~~~lTP~ 430 (792) T protein:vir:94 351 FRNRLGFLAGENIVMSRTSKYFSLFPASVANLSDDDPIDVAVSHNRISILKYAVPFSEELLLWSDQAQFVLSAQGILSPK 430 (792) T ss_pred EcceEEEecCCeEEEEccCCcccCccccccCCCCCccEEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeCCCcccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEEE-EEcCCCCE Q lcl|NC_012418. 476 TAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYI-QAAASSGY 554 (826) Q Consensus 476 ~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~-~~s~~p~~ 554 (826) |++++++|+|+|+++++|+.+|++++|++++| ++++++||++..+..+.|+++|||+|++|||+|++..+ ++|++|.. T Consensus 431 ~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g-~~~~v~r~~~~~~~~d~y~a~DlT~~~~hl~~~~v~~~~a~~~~~~~ 509 (792) T protein:vir:94 431 SVELNLTTEFDVSDRARPFGVGRGVYFASPRA-SYTSLNRYYAVQDVSSVKSAEDMSAHVPNYIPNGVFSIRGSSTENFI 509 (792) T ss_pred eEEEEEEEEeeccCCCCceEeCCeEEEeecCC-CeeEEEeeeeeccccCceehhhHHHHHHHhcCCceEEEEEeCCCCcE Confidence 99999999999999999999999999999988 57888776544454445999999999999999987655 67788889 Q ss_pred EEEEEcCCCeEEEEEEeeCCCceeeEeeEeeecCCcEEEEEE--ECCeEEEEEEeCCCEEEEEEEEeecCCcCCCCcccc Q lcl|NC_012418. 555 LVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGTYF--TGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKY 632 (826) Q Consensus 555 ~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~--~~d~l~~vv~R~~~~~~~r~~~~~~~~~~~~~~~~~ 632 (826) ++|+++++|+|++|||||+++||+|+|||||+|+|.|+++|+ ++|+||++|+|++++++|||.+.+ +..+++.+ T Consensus 510 vv~~~~~~g~l~~~ty~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~D~l~~~v~r~~~~~~~r~~~~~----~~~d~~~~ 585 (792) T protein:vir:94 510 SVLSSNAPSRIFLYKFLYLNEEIAQQSWSHWELGSNVTVLACDSIGSTMYLVLRNQSHTWMCRAHFTK----NSIDFPDE 585 (792) T ss_pred EEEEEcCCCeEEEEEEeecCCceEEEeEEEEEcCCcEEEEEEeecCCEEEEEEEeCCCEEEEEEEEee----cccccCCC Confidence 999999999999999999999999999999999998887765 589999999999999999997764 23344555 Q ss_pred cceEEEEeeccceeeecccccC-------ccccccceEEeeeeeeEeeccEEcc----cEecCCCeEEEeecCCcCCceE Q lcl|NC_012418. 633 DYWRRIEATVEGELELTKQHWD-------LIKDAPAVYQLQPVAGAFMERYQLG----VKRETNTKVFLDVPEAVVGSVY 701 (826) Q Consensus 633 ~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~g~~~l~~~~~~~~~~v 701 (826) .+..++|+.........+...+ ........|++|..+.+.+||...- ......+..++.+|++..+++| T Consensus 586 ~~~~~lD~~~~~~~~~~~~~~~~~~T~~~~~~~~gl~~l~G~~v~v~~dG~~~~~~~~~~~~~~~~~~i~~~g~~~a~~v 665 (792) T protein:vir:94 586 PYRLYIDNKVKYVIPEGSYNDDTYATTVKPVDVYGMKYWTGKFYIVASDGLVSWFEPPRGGWPNGVPMLTMSGNREGETI 665 (792) T ss_pred cceeeeeeeeeEEecCcceecCceeeeeccccccCcccccCcEEEEEecCceeEeecccceecCCccEEEecCCccCCeE Confidence 6666777765543322211111 1122334588999999999986532 2233344567788899999999 Q ss_pred EEEEeeeeEEEeCCeEEECCCC----ceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccccccccccccCc Q lcl|NC_012418. 702 VVGCEFWSKVEFTPPVLRDHNG----LPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGE 777 (826) Q Consensus 702 ~vG~~y~~~~~~~~~~~~~~~g----~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 777 (826) +|||+|+++++|+||++++++| ++...||+||+|++++|.+||.|.++++++.++ ..+.+.++++++..+.+|. T Consensus 666 ~VGl~y~~~~~~~~~~~~~~~g~~~~~~~~~gr~rl~r~~~~~~~tg~~~v~~~~~~~~--~~~~~~~~~~~~~~~~~g~ 743 (792) T protein:vir:94 666 YVGLAISFRYVFSKFLIKKTADDGSIATEDIGRLQLRRAWVNYEDSGAFTVEVENTSRL--FSYDMAGARLGSNVLRAGG 743 (792) T ss_pred EEeeeeeEEEEeccceeeccCCCcCccccceeeEEEEEEEEeeeccceeEEEEcCCCcc--eeeeeccceeccccccccc Confidence 9999999999999999997765 455678999999999999999999999988765 4567789999999999999 Q ss_pred cccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecccccC Q lcl|NC_012418. 778 PLVDSAVVPLPARVAMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 778 ~~~~tg~~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r~rrv 826 (826) |++.+++++||+.+|+++.+|+|+|++|+||+|+||+|||+||+|+||| T Consensus 744 ~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlai~~eg~y~~r~~~v 792 (792) T protein:vir:94 744 LNVGTGQFRFPVTGNAQLNEVRIISEHTTPLNVIGCGWEGNYLRRSSGI 792 (792) T ss_pred cccccceEEEEeeccCceEEEEEEECCCCCEEEEEEEEEEEEeccccCC Confidence 9999999999999999999999999999999999999999999999999 No 10 >protein:vir:94713 Length: 785 # NCBI annotation: tail tube # Family: family:all:825 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338122;genbank:gi:77118200;genbank:GeneID:3707736 Probab=100.00 E-value=2.6e-221 Score=1229.88 Aligned_cols=765 Identities=20% Similarity=0.295 Sum_probs=642.7 Q ss_pred CcceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeCCCceEEEE Q lcl|NC_012418. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd~~e~~~i~ 80 (826) ||+|+|+||||++|||||||++|||+||++|+||+|+|+|||+||||++||++++..+. ..++.|.|+|++.|+|+++ T Consensus 1 M~~~~~s~~n~~~GvSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~~v~~l~~~~~--~~~~~~~f~~~~~~~y~l~ 78 (785) T protein:vir:94 1 MPLITQSIKNLKGGISQQPDILRFSDQGEAQVNCWSSESDGLQKRPPTVFKRRLNIDVG--SNPKFHLINRDEQEQYYIV 78 (785) T ss_pred CcceeeecchhhcceecCCchHHhhhHHhhhhcceeeeccCcccCChhHhhhcccCCCC--cCcEEEEEEeCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999876543 4567888999999988887 Q ss_pred EecCCeEEEEECCCCEEEEecCcccccccCCCcc-cEEEEEecCEEEEeeCCcceeeeec-ccCCCCCCccEEEEEcccc Q lcl|NC_012418. 81 AQHRGELYLFDERDGRLLMGQPLVHDYLKAADYR-QLRAATVADDLFIANLSVKPEADRT-DVKGVDPNKAGWLYIKAGQ 158 (826) Q Consensus 81 ~~~~g~irv~d~~~g~~~~~~~~~~~yl~~~~~~-~l~~~~vaD~~fi~n~~~~~~~~~~-~~~~~~~~~~a~~~vr~g~ 158 (826) + ++|+|||||++|..+.+.+ ..+|+.+.++. +|+++|+||+|||||++++|++... ....+++..++++++++|+ T Consensus 79 ~-~~~~irv~~~~G~~~~v~~--~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~~~~~~~~~~~~~~~~~~~~~~i~~g~ 155 (785) T protein:vir:94 79 F-NGSNIQIVDLSGNQYSVSG--SVDYVKSSNPRDDIRVVTVADYTFVVNRKVVVKGGSEKSHSGYNRKARALINLRGGQ 155 (785) T ss_pred E-cCCeEEEEecCCcEEEEec--CCCceeecCchhheeeEeeCCEEEEEcCCcceeeeeccCCcCCCCCCceEEEecccc Confidence 7 4689999998655555543 34677666554 7999999999999999999998654 3455778889999999999 Q ss_pred cCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeeeeeccceecccc Q lcl|NC_012418. 159 YSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDT 238 (826) Q Consensus 159 Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t~~~~~~~~~~ 238 (826) |+++|++.|++.. ++.+++|++..+. ......+.+++..++.+++..... T Consensus 156 y~~~y~i~i~g~~---------~at~~t~~~s~a~----~s~~~~s~~~i~~~l~~~l~a~~t----------------- 205 (785) T protein:vir:94 156 YGRTLKVGINGGV---------KVSHKLPAGNDAE----NDPPKVDAQAIGAALRDLLVTAYP----------------- 205 (785) T ss_pred cceeEEEeeCCcc---------eeEEEEccCcccc----ccccccchHHHHHHHHHHhhcccc----------------- Confidence 9999999998643 3556666655432 223344566777777665443211 Q ss_pred cccccccceEecccCCcEEEEEcCCCeEE--EEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEeeeEecC Q lcl|NC_012418. 239 AAATVAGYLNQRGVQDGYIAFRGDGDIVV--EVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAVMAT 316 (826) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~~~ 316 (826) .++.. ..++++++.+..+... .++.+++.++.+.+..+.++++++||..++ .|+.++ +.+.+ T Consensus 206 ------~~t~~--~~g~~i~i~a~s~t~~~~~s~~~~~~~t~~~~~~~~~~~~~~Lp~~~~----~G~~v~----v~~~~ 269 (785) T protein:vir:94 206 ------TFTFD--LGSGFLLITAPSGTDINSVETEDGYANQLISPVLDTVQTISKLPLAAP----NGYIIK----IQGET 269 (785) T ss_pred ------ceeEE--ecCcEEEEEecCCccccceeeecccCCeEEEEEEeeccceeccccccC----CCCEEE----EEccC Confidence 12222 1234566655554432 355666778888999999999999887766 455554 34556 Q ss_pred CCccceEEEEEecCCceEEEeecccccccc--cceeEEEEEecCCCeeEEeecCCcccccCCcccccCccccCCCceEEE Q lcl|NC_012418. 317 GSTKAPVYFEWDSANRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFVTRGITGMT 394 (826) Q Consensus 317 ~~~~~~~y~~~~~~~~~w~E~~~~g~~~~~--~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~ 394 (826) ++..++||++|+..+++|+||++||+..++ .+|||.++ ++++++|++++.+|++|.+||+++||+|||+|++|++|+ T Consensus 270 ~~~~~~y~v~~~~~~g~w~e~~~~g~~~~~~~~tmp~~l~-~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~ 348 (785) T protein:vir:94 270 NSSADEYYVMYDSNTKTWKETVEPGVVTGFDNTTMPHALV-RQSDGSFEFKALDWSKRGAGNDDTNPMPSFVDATINDVF 348 (785) T ss_pred CCCccceEEEEEcCCceEEEecccceeeeeeccccceEEE-eccCCceEEeccccccccCCCcccCCcceecccccceEE Confidence 778899999999999999999999987665 58999997 678999999999999999999999999999999999999 Q ss_pred EEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccccc Q lcl|NC_012418. 395 TFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTP 474 (826) Q Consensus 395 ~~q~RL~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~~~~~lTP 474 (826) ||||||+|++|++|||||+||||||+++|+++++|||||+++++++++|+|+|+++++++|+|||+++||+|+++++||| T Consensus 349 f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~T~~~e~~l~~~~~lTP 428 (785) T protein:vir:94 349 FYRNRLGFLSGENVIMSRSASYFAFFPKSVATLSDDDPIDVAVSHPRISILKYAVPFSEQLLLWSDEVQFVMTSSGVLTS 428 (785) T ss_pred EEeceEEEecCCeEEEEccCCcccCccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEcCCCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCe-EEEEEcCCCC Q lcl|NC_012418. 475 RTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPA-EYIQAAASSG 553 (826) Q Consensus 475 ~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v-~~~~~s~~p~ 553 (826) +|++++++|+|+|+++++|+.+|++++|++++| ++++++||.+..+..+.|+++|||+|++|||+|++ ..+++|++|+ T Consensus 429 ~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~v~r~~~~~~~~d~y~~~dlt~~~~~~~~g~~~~~~a~~~~~~ 507 (785) T protein:vir:94 429 KSIQLDVGSEFALGDNARPFAVGRSVFFSAPRG-SFTSIKRYFAVADVSDVKDADDTTGHVLSYIPNGVFDIQGTGTENY 507 (785) T ss_pred eeEEEEEEEeeeccCCCCceEeCCeEEEEecCC-CeeEEEeeeeecccccceehhhHHHHHHHhcCCCcEEEEEecCCCc Confidence 999999999999999999999999999999988 57888777655555556999999999999999975 5678999999 Q ss_pred EEEEEEcCCCeEEEEEEeeCCCceeeEeeEeeecCC--cEEEEEEECCeEEEEEEeCCCEEEEEEEEeecCCcCCCCccc Q lcl|NC_012418. 554 YLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRH--QIIGTYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPK 631 (826) Q Consensus 554 ~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g--~v~~~~~~~d~l~~vv~R~~~~~~~r~~~~~~~~~~~~~~~~ 631 (826) .++|++++||+|++|||||+++||+|+|||||+|+| +++++|+++|++|++++|.++.+++++.... ...+... T Consensus 508 ~~~~~~~~~g~l~~~~y~~~~~e~~v~aW~r~~~~~~~~~~~~~~~~d~~~~vv~r~~g~~~~~ie~~~----~~~d~~~ 583 (785) T protein:vir:94 508 ICVNSTGAYNRIYIYKFLFKDSVQLQASWSHWEFPKDDKILASASIGSTMFIVRQHQGGVDIEHLKFIK----EATDFPS 583 (785) T ss_pred EEEEEEcCCCEEEEEEEeecCCceEEEEEEEEEeCCCeEEEEEEEeCCEEEEEEEcCCCEEEEEEEeec----ccCCCCC Confidence 999999999999999999999999999999999976 6889999999999999999877777774322 2233445 Q ss_pred ccceEEEEeeccceeeecccccCccc--------cccceEEeeeeeeEeeccEEcccEecCCCeEEEeecCCcCCceEEE Q lcl|NC_012418. 632 YDYWRRIEATVEGELELTKQHWDLIK--------DAPAVYQLQPVAGAFMERYQLGVKRETNTKVFLDVPEAVVGSVYVV 703 (826) Q Consensus 632 ~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~v~v 703 (826) +++..++||.............+.+. -....|+++.++.+.+||..++...+..+..+|.+|++..+.+|+| T Consensus 584 ~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~~g~~~leg~~v~v~adG~~~~~~~v~~~~~tl~~~g~~~~~~v~v 663 (785) T protein:vir:94 584 EPYRLHVDSKVSMVIPIGSFNADTYKTTVDIGAAYGGNAPSPGRYYLIDSQGAYLDLGELTSISTVITLNGDWSGRTVFI 663 (785) T ss_pred cceeEEeeeeeEEEecCcceeccccccccccccccccCCccCCeEEEEeeCCcCccCceEcCCCcEEEecCCCCCceEEE Confidence 55666777765443322222111111 0134588999999999999998888888889999999999999999 Q ss_pred EEeeeeEEEeCCeEEECCCCc---eeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccccccccccccCcccc Q lcl|NC_012418. 704 GCEFWSKVEFTPPVLRDHNGL---PMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLV 780 (826) Q Consensus 704 G~~y~~~~~~~~~~~~~~~g~---~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 780 (826) ||+|+++++|+||++++++|+ ++..+|+||+|++++|.+||+|+++|+++.++ ..+.+.++++++. .++.||+ T Consensus 664 Gl~y~~~~~~~~~~~~~~~~~~~~~~~~gr~~l~r~~~~~~~sg~~~v~v~~~~~~--~~~~~~~~~~g~~--~~~~~~~ 739 (785) T protein:vir:94 664 GRSYLMSYKFSRFLIKIEDDSGTQSEDTGRLQLRRAWVNYRDTGALRLIVRNGERE--FVNTFNGYTLGQQ--TIGTTNI 739 (785) T ss_pred eeeeeEEEeecceeEEecCCCcccccccccEEEEEEEEEeecccceEEEecCCCcc--ceeeecCcccCcc--ccccccc Confidence 999999999999999998874 44568999999999999999999999887764 3466788898854 5677899 Q ss_pred ccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecccccC Q lcl|NC_012418. 781 DSAVVPLPARVAMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 781 ~tg~~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r~rrv 826 (826) ++|+++||+.+|+++.+|+|+|++|+||+|++|+|||+||+|+||| T Consensus 740 ~tg~~~vp~~g~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r~~~v 785 (785) T protein:vir:94 740 GDGQYRFAMNGNALTTSLTLESDYPTPVSIVGCGWEASYAKKARSV 785 (785) T ss_pred ccceEEEEeecccceEEEEEEECCCCceEEEEEEEEEEEeccccCC Confidence 9999999999999999999999999999999999999999999999 No 11 >protein:vir:8887 Length: 808 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813776;genbank:gi:29366731;genbank:GeneID:1258831 Probab=100.00 E-value=7e-220 Score=1222.07 Aligned_cols=778 Identities=22% Similarity=0.287 Sum_probs=634.8 Q ss_pred CcceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeCCCceEEEE Q lcl|NC_012418. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd~~e~~~i~ 80 (826) ||+|+|+||||+||||||||++|||+||++|+||+|+|+|||+||||++||++++++.....+++.|.++|++.|+||++ T Consensus 1 M~~v~~s~~n~~~GvSqq~d~~R~~~q~~~~~N~~~~~~gG~~rRpgt~~v~~l~~~~~~~~~~~~~~~~~~~~~~y~v~ 80 (808) T protein:vir:88 1 MGLVSQSVKNLKGGISQQPDILRFSNQGALQINGWSSETQGLQKRPPTTFTKRLQNKGFLGTKPLVHLINRDAQEQYFVG 80 (808) T ss_pred CcceeeecchhccceeccchhHhhhhhhhhhhcceeeeccccccCCchheeeeeeccCCCCCCcEEEEEEeCcCceEEEE Confidence 99999999999999999999999999999999999999999999999999999988877677889999999999999999 Q ss_pred EecCCeEEEEECCCCEEEEecCcccccccCCCcc-cEEEEEecCEEEEeeCCcceeeeec--ccCCCCCCccEEEEEccc Q lcl|NC_012418. 81 AQHRGELYLFDERDGRLLMGQPLVHDYLKAADYR-QLRAATVADDLFIANLSVKPEADRT--DVKGVDPNKAGWLYIKAG 157 (826) Q Consensus 81 ~~~~g~irv~d~~~g~~~~~~~~~~~yl~~~~~~-~l~~~~vaD~~fi~n~~~~~~~~~~--~~~~~~~~~~a~~~vr~g 157 (826) ++++| |||||++|+.+++.+.. +|+.++++. +|+++|+||+|||||++++|++..+ ....+++..++++++|+| T Consensus 81 ~~~~~-i~v~~~~G~~~~v~~~~--~y~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~~~~~~~~~~~~~~~vr~g 157 (808) T protein:vir:88 81 FSGTG-LAVWDLKGNNYTVRGYN--GYANCANPRTDLRLITVADYTFVVNRNTVCQMGSTLTHAAYPRLDGRAIINVRGG 157 (808) T ss_pred EeCCe-EEEEEcCCceEEEeecC--cceEecCChhheeEEEEcCEEEEEcCCcceeecccccccCCCCCCccEEEEEccc Confidence 98764 99999988777776543 566655544 7999999999999999999998554 345567788999999999 Q ss_pred ccCeeEEEEEeccCCcceeeeeeeeeEEeccCcccccc------ccccceeeccchhhhhhhheeecccceEEeeeeecc Q lcl|NC_012418. 158 QYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPN------LAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKY 231 (826) Q Consensus 158 ~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~------~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t~~~ 231 (826) +|+++|+|+|++...+. ..+..+..+.+...... ...........+++.++...+... T Consensus 158 ~y~~~y~i~i~g~~s~~----~~t~t~~~~~~s~~~v~~~~~~~~~~~~~~~~~~~ia~~l~~~~~~~------------ 221 (808) T protein:vir:88 158 QYGRTLSITINGDGTGS----SPQASIKMPNGSAEKVPAGDPYAGMNQVDMTDASWIAAELARQLTVS------------ 221 (808) T ss_pred ccCceEEEEEecCCcce----eeeEeEEEccCcccceeeccceeecccCCccccccchhhheeeeeec------------ Confidence 99999999999765432 23455666554322211 111112222333333333222111 Q ss_pred ceecccccccccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEee Q lcl|NC_012418. 232 PKVDPDTAAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDG 311 (826) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~ 311 (826) ....+++........++...++.+....++.++++++.+.+..+.|+++++||+.+|. |+.++ T Consensus 222 ---------~~~~~~~~~~~~~~~~i~~~a~~~~~~~~t~~g~~~~~~~~~~~~v~~~~~lp~~~p~----g~~v~---- 284 (808) T protein:vir:88 222 ---------LGGSGWSFQAGTGWILINAPANDNVRQIATKDGYADTLLSGFIYQVQTFTKLPANAPP----GYLVE---- 284 (808) T ss_pred ---------ccccceEEEeccceEEEEeccCceeEEEcccCCcCcceeeeeeeeccceeeccccCCC----CcEEE---- Confidence 0111233333322333444444444455667788888999999999999999998874 44443 Q ss_pred eEecCCCccceEEEEEecCCceEEEeecccccccc--cceeEEEEEecCCCeeEEeecCCcccccCCcccccCccccCCC Q lcl|NC_012418. 312 AVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFVTRG 389 (826) Q Consensus 312 ~~~~~~~~~~~~y~~~~~~~~~w~E~~~~g~~~~~--~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~ 389 (826) +....++..++||++|+..+++|+||++||+..++ .||||.++ ++++++|.+++.+|++|.+||++|||+|+|+|++ T Consensus 285 i~~~~~~~~~~~yv~~~~~~~~w~e~~~~~~~~~~~~~tmp~~lv-~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~ 363 (808) T protein:vir:88 285 ITGESARSGDNYWVQYDASGKVWKETAKPKIIAGFNNATLPHALV-RAADGQFDWTPLTWDGRNAGDDDTNPMPSFVGAT 363 (808) T ss_pred EEecCCCCCceeEEEEEcCCeEEEEeeeccceeeecccceeEEEE-ecCCceEEEEecccccccccccccCccceecCCc Confidence 23445678899999999999999999999987665 58999998 7789999999999999999999999999999999 Q ss_pred ceEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCC Q lcl|NC_012418. 390 ITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGG 469 (826) Q Consensus 390 ~~~v~~~q~RL~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~~~ 469 (826) |++|+||||||+|++|++|||||+||||||+++|++++.|||||+++++++++++|+|+|+++++|+|||+++||+|+++ T Consensus 364 ~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~i~T~~~e~~l~~~ 443 (808) T protein:vir:88 364 INDVFFFRNRLGFLSGENVVMSRTSKYFNFFPSSVATLSDDDPIDVAISHNRISILKYAVPFSEQLLLWSDQAQFVLSSK 443 (808) T ss_pred eeEEEEEcceEEEeeCCeEEEEeccCcccccCCcccCCCCCccEEEEecCCccceeeEEeecCCcEEEEecCcEEEEeCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEE-EeeccccccccchhHHHHHHHHhcCCCeEEEEE Q lcl|NC_012418. 470 GIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHE-MAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQA 548 (826) Q Consensus 470 ~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e-~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~~~ 548 (826) ++|||+|++++++|+|+|++.++|+.+|++++|++++| ++++++| |.|+.+.|+ |+++|||+|++|||++++..+++ T Consensus 444 ~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~f~~~~g-~~~~v~r~~~~~~~~d~-y~~~dlt~~~~h~~~~~~~~~~~ 521 (808) T protein:vir:88 444 TILSSKTIELDLTTEFDVSDGARPYGIGRGVYFAAPRA-SFTSLKRYYAIQDVSDV-KSAEDVSAHVPSYITNTVHAIHG 521 (808) T ss_pred CcccceeEEEEEEEEecccCCCCceEeCCeEEEEecCC-CeeEEEEEEEeeeccCc-eehhhHHHHHHHhcCCCeEEEEE Confidence 99999999999999999999999999999999999988 5676655 567655554 99999999999999998877754 Q ss_pred -cCCCCEEEEEEcCCCeEEEEEEeeCCCceeeEeeEeeecCCcEEEE----EEECCeEEEEEEeCCCEEEEEEEEeecCC Q lcl|NC_012418. 549 -AASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGT----YFTGDNLMVLIQKGQEIALGRMHLNSLPA 623 (826) Q Consensus 549 -s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~----~~~~d~l~~vv~R~~~~~~~r~~~~~~~~ 623 (826) +++|..++|++++||+|++|||||+++||+|+|||||+|+|.++++ ++++|+||++|+|+++.++|||.+.+ T Consensus 522 ~~~~~~~~v~~~~~~g~l~~~~y~~~~~e~~v~aW~r~~~~g~~~~~~~~~~~~~d~l~~vV~r~~~~~ler~~~~~--- 598 (808) T protein:vir:88 522 SGTENFVSILSDGSPNKVFIYKFLYLDEILQQQSFSHWEFGDAATTRVLAASCIGSYCYLMIDRPEGLCLERMEFTQ--- 598 (808) T ss_pred eCCCCeEEEEEEcCCCEEEEEEEeccCCceeEEeeEEEecCCCeeEEEEEEeccCCEEEEEEEcCCcEEEEEEeecc--- Confidence 5566678999999999999999999999999999999999877654 44599999999999999999997644 Q ss_pred cCCCCcccccceEEEEeeccceee---e----cc-cccCccccccceEEeeeeeeEeeccEEcccE-ecCCCeEEEeecC Q lcl|NC_012418. 624 REGLQYPKYDYWRRIEATVEGELE---L----TK-QHWDLIKDAPAVYQLQPVAGAFMERYQLGVK-RETNTKVFLDVPE 694 (826) Q Consensus 624 ~~~~~~~~~~~~~~~~~~~~~~~~---~----~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~g~~~l~~~~ 694 (826) +..++..+++..++||....... . +. ...+.+ ....|+++.......++..+... ....+..++.+|+ T Consensus 599 -~~~~~~~~~~~~~lD~~~~~~~g~~~~~~~~t~~~~~~~~--~~~~~~~~~~~~~~~dg~~~~~~~~~~~~~~~~~~~~ 675 (808) T protein:vir:88 599 -HTIDYSIEPYRTYMDMKKTIVLGAYNIDTNLTSFDVRTAY--GGTPGPESTFYTIDQQGVLIEHEARDWATNPYISFVG 675 (808) T ss_pred -CCCCCccccceeeeeeeeeeccccccCccccceeeccccc--ccccccceeEEEEcCCceEEeeecccccCcceEEeCC Confidence 34455556666667765432211 0 10 011111 12235666666667777655332 3334556788899 Q ss_pred CcCCceEEEEEeeeeEEEeCCeEEECCCCceee----ecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccccccc Q lcl|NC_012418. 695 AVVGSVYVVGCEFWSKVEFTPPVLRDHNGLPMT----SARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFS 770 (826) Q Consensus 695 ~~~~~~v~vG~~y~~~~~~~~~~~~~~~g~~~~----~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~ 770 (826) +..+++|+|||+|+++++|+||++++++|+.++ .+|+||+|+++++.+||+|.++|+++.++ ..+.+.++++++ T Consensus 676 ~~~~~~v~vGl~y~s~~~~~p~~~~~~~g~~~~~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~--~~~~~~~~~~~~ 753 (808) T protein:vir:88 676 NRAGEQMVIGKQYTFQYEFSKFLIKQTADDGSTSTEDIGRLQLRRAWLNYEESGAFEINVNNGSSE--FVYVMTGGRLGI 753 (808) T ss_pred CccCceEEEeeeeeEEEEecceEEecCCCCcceeecccceEEEEEEEEEeecccceEEEeCCCccc--ceeeccCcccCc Confidence 999999999999999999999999999887655 47899999999999999999999987664 356778999997 Q ss_pred cccccCccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecccccC Q lcl|NC_012418. 771 RQLNAGEPLVDSAVVPLPARVAMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 771 ~~~~~~~~~~~tg~~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r~rrv 826 (826) .++ .+.+++++|+++||+.+|+++.+|+|+|++|+||+|+||+|||+||+|+||| T Consensus 754 ~~~-~~~~~~~tg~~~vp~~~~~~~~~v~i~~d~P~P~tilsi~~eg~y~~r~~~v 808 (808) T protein:vir:88 754 QRV-LGELSVGTGQFKFPVTGNAVNQRVTITSSNPNPLNVIGCGWEGNYIRRSSGI 808 (808) T ss_pred ccc-cCccccccceEEEEecccCceeEEEEEECCCCceEEEEEEEEEEEeccccCC Confidence 765 7788999999999999999999999999999999999999999999999999 No 12 >protein:vir:97014 Length: 800 # NCBI annotation: 33 # Family: family:all:825 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654134;genbank:gi:108862018;genbank:GeneID:5075963 Probab=100.00 E-value=3.5e-217 Score=1207.24 Aligned_cols=770 Identities=21% Similarity=0.277 Sum_probs=629.3 Q ss_pred cceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeCCCceEEEEE Q lcl|NC_012418. 2 SYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLVA 81 (826) Q Consensus 2 ~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd~~e~~~i~~ 81 (826) =.|+||||||+||||||||++|||+||++|+||+|+|+|||+||||++||+++++++. ...++.|.++||+.|+||+++ T Consensus 1 ~~v~~s~~n~~~GvSqq~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~fva~l~~~~~-~~~~~~~~~~~d~~eq~~v~~ 79 (800) T protein:vir:97 1 MEVQGSLGRQIQGISQQPPAVRLDGQCTAMVNMIPDVVNGTQSRMGTTHIAKILDAGT-DDMATHHYRRGDGDEEYFFTL 79 (800) T ss_pred CeeEeechhhhcccccCchhHhhhhhhhhhhcceeccccccccCCchhhheeecCCCc-ccceeEEEEEcCCceEEEEEE Confidence 2588999999999999999999999999999999999999999999999999988764 356777889999999999999 Q ss_pred ecCCeEEEEECCCCEEEEecCcc-cccc--cCCCcccEEEEEecCEEEEeeCCcceeeeecccCCCCCCccEEEEEcccc Q lcl|NC_012418. 82 QHRGELYLFDERDGRLLMGQPLV-HDYL--KAADYRQLRAATVADDLFIANLSVKPEADRTDVKGVDPNKAGWLYIKAGQ 158 (826) Q Consensus 82 ~~~g~irv~d~~~g~~~~~~~~~-~~yl--~~~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~~~vr~g~ 158 (826) ++.++||||+++|+.+.+..... ..|+ ++++.++|+++||||||||+|++++|++.... ...+..++++++|+|+ T Consensus 80 ~~~~~~rv~~~~G~~~~v~~~~~~~~y~~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~--~~~~~~~~~~~v~~g~ 157 (800) T protein:vir:97 80 KKGQVPEIFDKYGRKCNVTSQDAPMTYLSEVVNPREDVQFMTIADVTFMLNRRKVVKASSRK--SPKVGNKAIVFCAYGQ 157 (800) T ss_pred EcCCEEEEEecCCcEEEEecCCcceEEEeccCCCccceeEEEEcCEEEEeeCceeccccccc--ccCCCcceEEEEeecc Confidence 99999999999777766654432 2354 33456699999999999999999999985433 3456788999999999 Q ss_pred cCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeeeeeccceecccc Q lcl|NC_012418. 159 YSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDT 238 (826) Q Consensus 159 Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t~~~~~~~~~~ 238 (826) |+|+|+|+|++.. ++.|.+|++.. .+...++++.+++.++.+.+..+.+ T Consensus 158 y~~~y~i~I~~~~---------~~~~~t~~~t~-----~~~~~~~~~~~ia~ql~~~~~~~~~----------------- 206 (800) T protein:vir:97 158 YGTSYSIVINGAN---------AASFKTPDGGS-----ADHVEQIRTERITSELYSKLQQWSG----------------- 206 (800) T ss_pred cceeeeeccCCcc---------eEEEEEcCCCC-----cccceeccHHHHHHHHHHhhhcccc----------------- Confidence 9999999998653 45667776543 3456667788899888776543211 Q ss_pred cccccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEeeeEecC-C Q lcl|NC_012418. 239 AAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAVMAT-G 317 (826) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~~~-~ 317 (826) .++++....+...|++...+.++. ..+.++++++.++++.++|+++++||+++|. |+.+. +..+ + T Consensus 207 ----~~~~t~~~~G~~~~i~~~~~~~~~-v~t~~g~~~~~~~~~~~~v~~~~~lp~~~~~----g~~v~-----i~~~~~ 272 (800) T protein:vir:97 207 ----VSDYEIQRDGTSIFIERRDGASFT-ITTTDGAKGKDLVAIKNKVSSTDLLPSRAPA----GYKVQ-----VWPTGS 272 (800) T ss_pred ----ccceEEEeCCcEEEEEEcCCceEE-EEecCCcCceeeeEEeeeccchhhchhhCCC----CcEEE-----EEccCC Confidence 123334334333344444444443 4567777888999999999999999998874 44443 3333 4 Q ss_pred CccceEEEEEecC---CceEEEeeccccccc--ccceeEEEEEec---CCCeeEEeecCCcccccCCcccccCccccC-- Q lcl|NC_012418. 318 STKAPVYFEWDSA---NRRWAERAAYGTDWV--LKKMPLALRWDE---ATDTYSLNELDYDRRGSGDEDTNPTFNFVT-- 387 (826) Q Consensus 318 ~~~~~~y~~~~~~---~~~w~E~~~~g~~~~--~~tmp~~~~~~~---~~~~f~~~~~~w~~r~~gd~~tnp~psf~g-- 387 (826) ...+.||++|+.. .++|+||+++|...+ ..||||+++... ++++|++++.+|++|.+|||++||+|+|+| T Consensus 273 ~~~~~y~~~~~~~~~~~~~w~e~~~~~~~~~~~~~tmp~~~~~~~~~~~~g~~~~~~~~w~~r~~gd~~tnp~p~f~~~~ 352 (800) T protein:vir:97 273 KPESRYWLQAEPKEGNLVSWKETIAADVLLGFDKGTMPYIIERTDIINGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEE 352 (800) T ss_pred CCCceEEEEEEecccCcceEEEeeccccccceecccceEEEEEeecccccceeEEEeccccccccCccccCccccccCCc Confidence 5667889999864 468999999996654 569999998543 678999999999999999999999999998 Q ss_pred --CCceEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEE Q lcl|NC_012418. 388 --RGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAV 465 (826) Q Consensus 388 --~~~~~v~~~q~RL~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~ 465 (826) ++|++|+||||||+|++|++|||||+||||||+++|+++++|||||+++++++++|+|+|+|+++++|+|||+++||+ T Consensus 353 ~~~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~v~~i~~~v~~~~~L~i~T~~~q~~ 432 (800) T protein:vir:97 353 VPQTIGGMFMVQNRLCFTAGEAVIASRTSYFFDFFRYTVISALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQFI 432 (800) T ss_pred CCCCceeEEEEeeeEEEecCCeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeeEEeecCCcEEEEecCcEEE Confidence 789999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeCCccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEE Q lcl|NC_012418. 466 VPGGGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEY 545 (826) Q Consensus 466 ~~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~ 545 (826) |+++++|||+|++++++|+|+|+++++|+.+|++++|++++| ++++||||+|+.++|+ |+++|||+|++|||+|++.+ T Consensus 433 ls~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~fv~~~g-~~s~vre~~~~~~~d~-~~a~DlT~~~~hl~~~~v~~ 510 (800) T protein:vir:97 433 LPGDKPLEKSNALLKPVTTFEVNNKVKPVVTGESVMFATNDG-SYSGVREFYTDSYSDT-KKAQAITSHVNKLIEGNITN 510 (800) T ss_pred EeCCCcccceeEEEEEEEeeeccCCCCcEEeCCeEEEeeCCC-CeeEEEEEeeeecccc-eehhhHHHHHHHhcCCceEE Confidence 999999999999999999999999999999999999999988 5789999999988887 99999999999999999999 Q ss_pred EEEcCCCCE-EEEEEcCCCeEEEEEEeeCCCceeeEeeEeeecCC--cEEEEEEECCeEEEEEEeCCCEEEEEEEEeecC Q lcl|NC_012418. 546 IQAAASSGY-LVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRH--QIIGTYFTGDNLMVLIQKGQEIALGRMHLNSLP 622 (826) Q Consensus 546 ~~~s~~p~~-~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g--~v~~~~~~~d~l~~vv~R~~~~~~~r~~~~~~~ 622 (826) |+++++|++ ++|+++++|+|++||||++++||+|+|||||+++| .+++|++++|+||++|+|+++.++|||.++... T Consensus 511 ~~~~~~~~~~v~~~~~~~~~l~~~~y~~~~~e~~~~aW~~~~~~~~~~~~~~~~~~d~l~~vv~r~~~~~ler~~~~~~~ 590 (800) T protein:vir:97 511 MAASTNVNRLLVTTDKYRNIIYCYDWLWQGTDRVQSAWHVWKWPIGTKVRGMFYSGELLYLLLERGDGVYLEKMDMGDAL 590 (800) T ss_pred EEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceEEEeEEEEecCCCeEEEEEEEcCCeEEEEEEcCCcEEEEEEecccCc Confidence 999988876 56899999999999999999999999999999976 677888899999999999999999999765433 Q ss_pred CcCCCCcccccc--eEEEEeeccceeeecccccCcccc---------ccceEEeeeeeeEeeccEEcccEecCCCeEEEe Q lcl|NC_012418. 623 AREGLQYPKYDY--WRRIEATVEGELELTKQHWDLIKD---------APAVYQLQPVAGAFMERYQLGVKRETNTKVFLD 691 (826) Q Consensus 623 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~ 691 (826) +....+...+|. ..+.+.+......+.........+ ....|++|..+.... +...+...+..+. T Consensus 591 ~~~~~~~~~lD~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~v~g~~~~~G~~v~~~~-~~~~~~~~~~~~~---- 665 (800) T protein:vir:97 591 TYGLNDRIRMDRQAELVFKHFKAEDEWVSEPLPWVPTNPELLDCILIEGWDSYIGGSFLFKY-NPSDNTLSTTFDM---- 665 (800) T ss_pred CcccccceeccccceeeeeeeecccceEeccccccCCCcceeEEEEecccccccCceEEEEe-cCccCcccccceE---- Confidence 221111111221 111111111101110000000000 112355666553322 2222222333333 Q ss_pred ecCCcCCceEEEEEeeeeEEEeCCeEEECCCCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeecccccccc Q lcl|NC_012418. 692 VPEAVVGSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSR 771 (826) Q Consensus 692 ~~~~~~~~~v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~ 771 (826) ++++..+++|+|||+|+++++|+||++++++|+++..+|+||+|++|+|.+||+|++.|.++.++....+...+.++++. T Consensus 666 ~~~~~~~~~v~vGl~Y~~~~~~~p~~i~~~~g~~~~~~r~~i~r~~~~~~~sg~~~~~v~~~~~~~~~~~~~~~~~~g~~ 745 (800) T protein:vir:97 666 YDDSHVKAKVIVGQIYPQEFEPTPVVIRDNQDRVSYIDVPVVGLVHLNLDMYPDFSVEVKNVKSGKVRRVLASNRIGGAL 745 (800) T ss_pred EeCCCCCcEEEEeeeeeEEEEecceEEEecCCCceeecceEEEEEEEeecccccEEEEEccccCCceeeeecCccccccc Confidence 34556678999999999999999999999999999999999999999999999999999999998777778899999999 Q ss_pred ccccCccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecccccC Q lcl|NC_012418. 772 QLNAGEPLVDSAVVPLPARVAMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 772 ~~~~~~~~~~tg~~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r~rrv 826 (826) .+..+.|++++|+++||+.+|+++.+|+|+|++|+||+|+||+|||+||+|+||| T Consensus 746 ~~~~g~~~~~tg~~~vp~~g~~~~~~v~i~~d~PlP~tvlsi~~eg~y~~r~~rv 800 (800) T protein:vir:97 746 NNTVGYVEPREGVFRFPLRAKSTDVVYRIIVESPHTFQLRDIEWEGSYNPTKRRV 800 (800) T ss_pred cccCCccccccceEEEEeecccceeEEEEEECCCCcEEEEEEEEEEEeecccccC Confidence 9999999999999999999999999999999999999999999999999999999 No 13 >protein:vir:105647 Length: 800 # NCBI annotation: putative tail tubular B protein # Family: family:all:825 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425011;genbank:gi:83571759;uniprot:Q2WC41;genbank:GeneID:3837288 Probab=100.00 E-value=6.5e-217 Score=1205.79 Aligned_cols=769 Identities=22% Similarity=0.307 Sum_probs=633.4 Q ss_pred cceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeCCCceEEEEE Q lcl|NC_012418. 2 SYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLVA 81 (826) Q Consensus 2 ~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd~~e~~~i~~ 81 (826) =.|+||||||+||||||||++|||+||++|+||+|+|+|||+||||++||++++++....+.+|.|..++++.|++++++ T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~fva~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (800) T protein:vir:10 1 MEVQGSLGRQIQGISQQPPAVRLDGQCTTMVNMVPDVVNGTQSRMGTTHIAKLLDEGTDNMATHHYRRGEGDEEYFFTLK 80 (800) T ss_pred CeEEeecchhcccccccchhHhhhhhhhhhhcceeeeccCcccCCcceEEEeecCCCCCccEEEEEecCCccceEEEEEE Confidence 25889999999999999999999999999999999999999999999999999887766565666666566667777777 Q ss_pred ecCCeEEEEECCCCEEEEecCcc-ccccc-CCCc-ccEEEEEecCEEEEeeCCcceeeeecccCCCCCCccEEEEEcccc Q lcl|NC_012418. 82 QHRGELYLFDERDGRLLMGQPLV-HDYLK-AADY-RQLRAATVADDLFIANLSVKPEADRTDVKGVDPNKAGWLYIKAGQ 158 (826) Q Consensus 82 ~~~g~irv~d~~~g~~~~~~~~~-~~yl~-~~~~-~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~~~vr~g~ 158 (826) .+ +++|||+++|..+.+..+.. ..|+. +.++ ++|+++||||+|||+|++++|++... .......++++++|+|+ T Consensus 81 ~g-~~~rv~~~~G~~~~v~~~~~~~~~~~~~~~~~~~l~~~tvaD~tfi~n~~~~~~~~~~--~~~~~~~~~~~~vr~g~ 157 (800) T protein:vir:10 81 KG-QVPEIFDKHGRKCNVISQDAPMTYLSEVVNPREDVQFMTIADVTFMLNRRKVVKVSNR--KSPKVGDKAIVFCAYGQ 157 (800) T ss_pred cC-CeEEEEecCCcEEEeecCCcceeeeeccCCchhhEEEEEEcCEEEEecCccccccccc--CCCCCCceEEEEEeccc Confidence 65 68999998766665554332 23443 3444 48999999999999999999998533 33455678999999999 Q ss_pred cCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeeeeeccceecccc Q lcl|NC_012418. 159 YSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDT 238 (826) Q Consensus 159 Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t~~~~~~~~~~ 238 (826) |+|+|+|+|++.. ++.+++|++.. +++..++++++++++|...+..+. T Consensus 158 y~~~y~i~i~g~~---------~~~~~t~~~~~-----~~~~~~~s~~~i~~~L~~~l~~~~------------------ 205 (800) T protein:vir:10 158 YGTSYSIIINGTT---------AASFKTPDGGS-----AEHVEQIRTERITSELYSKLQQWS------------------ 205 (800) T ss_pred cccceeEEeccce---------EEEEEecCCCc-----ccccccccHHHHHHHHHhhhhhcC------------------ Confidence 9999999998653 35667776543 345666788899999877654321 Q ss_pred cccccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEeeeEecC-C Q lcl|NC_012418. 239 AAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAVMAT-G 317 (826) Q Consensus 239 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~~~-~ 317 (826) ...+++....+..-|++...+.+.. ..+.++++++++.++.+.|+++++||..+|. |+.+. ++.+ + T Consensus 206 ---~~~~~t~~~~g~~i~i~~~~~~~~~-~~~~~~~~~~~~~~~~~~v~~~~~Lp~~~~~----g~~~~-----i~~~~~ 272 (800) T protein:vir:10 206 ---GVNDYEIQRDGTSIFIERRDGKSFT-VTTTDGAKGKDLVAIKNKVSSTDLLPSRAPA----GYKVQ-----VWPTGS 272 (800) T ss_pred ---cccceEEEEcCcEEEEEEecCCceE-EEEeecCCcceEEEEEeeccceeeccccCCC----CceEE-----EEcCCC Confidence 1122333334433344444444443 3566777889999999999999999988874 44443 3334 4 Q ss_pred CccceEEEEEecC---CceEEEeecccccccc--cceeEEEEEec---CCCeeEEeecCCcccccCCcccccCccccC-- Q lcl|NC_012418. 318 STKAPVYFEWDSA---NRRWAERAAYGTDWVL--KKMPLALRWDE---ATDTYSLNELDYDRRGSGDEDTNPTFNFVT-- 387 (826) Q Consensus 318 ~~~~~~y~~~~~~---~~~w~E~~~~g~~~~~--~tmp~~~~~~~---~~~~f~~~~~~w~~r~~gd~~tnp~psf~g-- 387 (826) .+.+.||++|+.. .++|+||+++++..++ .+|||+++... .+++|++++.+|++|.+|||++||+|+|+| T Consensus 273 ~~~~~y~~~~~~~~~~~~~w~e~~~~~~~~~~~~~tmp~~lv~~~~~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~~~~ 352 (800) T protein:vir:10 273 KPESRYWLQAEPKEGNLVSWKETIAADVLLGFDKGTMPYIIERTGIIDGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEE 352 (800) T ss_pred CCCceeEEEEEeccccceEEEeecccCceeeeecccccEEEEEeeeeecceeEEEEeccccccccCCCCCCCCchhcCCC Confidence 5678899999864 3689999999986654 58999998543 378999999999999999999999999998 Q ss_pred --CCceEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEE Q lcl|NC_012418. 388 --RGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAV 465 (826) Q Consensus 388 --~~~~~v~~~q~RL~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~ 465 (826) ++|++|+||||||+|++|++|||||+||||||+++|+++++|||||+++++++++|+|+|+++++++|+|||+++||+ T Consensus 353 ~~~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~g~q~~ 432 (800) T protein:vir:10 353 VPQTIGGMFMVQNRLCFTAGEAVIASRTSYFFDFFRYTVISALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQFI 432 (800) T ss_pred CCCCceeEEEEeeeEEEeeCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeeeEeecCCcEEEEecCcEEE Confidence 579999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeCCccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEE Q lcl|NC_012418. 466 VPGGGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEY 545 (826) Q Consensus 466 ~~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~ 545 (826) |+++++|||+|++++++|+|+|+++++|+.+|++++|++++| ++++||||+|+.++|+ |+++|||+|++|||+|++.+ T Consensus 433 l~g~~~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g-~~s~vre~~~~~~~d~-~~a~DlT~~~~hl~~~~v~~ 510 (800) T protein:vir:10 433 LPGDKPLEKSNALLKPVTTFEVNNKVKPVVTGESVMFATNDG-SYSGVREFYTDSYSDT-KKAQAITSHVNKLIEGNITN 510 (800) T ss_pred EeCCCcccceeEEEEEEEeeeccCCCCceEeCCeEEEecCCC-CeeEEEEEeeeecccc-eehhhHHhHHHHhcCCceEE Confidence 999999999999999999999999999999999999999988 5789999999988887 99999999999999999999 Q ss_pred EEEcCCCCEE-EEEEcCCCeEEEEEEeeCCCceeeEeeEeeecC--CcEEEEEEECCeEEEEEEeCCCEEEEEEEEeecC Q lcl|NC_012418. 546 IQAAASSGYL-VFGTSTADEMICHQYLWQGNEKVQNAFHRWTLR--HQIIGTYFTGDNLMVLIQKGQEIALGRMHLNSLP 622 (826) Q Consensus 546 ~~~s~~p~~~-v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~--g~v~~~~~~~d~l~~vv~R~~~~~~~r~~~~~~~ 622 (826) |++|++|+++ +|+++++|+|++|||||+++||+|+|||||+++ +.+++|++++|+||++|+|+++.++|||.+.... T Consensus 511 ~~~~~~~~~~v~~~~~~~~~l~~~~yl~~~~e~~~~aW~~w~~~~~~~~~~~~~~~d~l~~iv~r~~~~~ier~~~~~~~ 590 (800) T protein:vir:10 511 MAASTNVNRLLVTTDKYRNIIYCYDWLWQGTDRVQSAWHVWEWPMGTKVRGMFYSGELLYLLLERGDGVYLEKMDMGDAL 590 (800) T ss_pred EEEeCCCCeEEEEEEcCCCeEEEEEEeecCCceEEEEEEEEEcCCCcEEEEEEEeCCeEEEEEECCCcEEEEEEecccCc Confidence 9999988876 578889999999999999999999999999985 4677888899999999999999999998654322 Q ss_pred CcCCCCcccccceEEEEeeccceee--------ecccccCcccc-ccceEEeeeeeeEeeccEEcccEecCCCeEEEee- Q lcl|NC_012418. 623 AREGLQYPKYDYWRRIEATVEGELE--------LTKQHWDLIKD-APAVYQLQPVAGAFMERYQLGVKRETNTKVFLDV- 692 (826) Q Consensus 623 ~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~- 692 (826) +. .+++..++|+....... +.........+ ....++.......+.++.......+.++.++++. T Consensus 591 ~~------~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~~~~~~~~g~~~~~~~ 664 (800) T protein:vir:10 591 TY------GLNDRIRMDRQAELIFKHFKAEDEWISEPLPWTPTNPELLDCILIEGWDSYIGGSFLFKYKPSDNTLSTTFD 664 (800) T ss_pred cc------cccceeeeecceeecccccccCcceEEEeccccccCCcceEEeeeccceeecCceeEEEEEecCCceEeeee Confidence 21 12223334432211110 00000000111 1112344444556667777776777788887754 Q ss_pred --cCCcCCceEEEEEeeeeEEEeCCeEEECCCCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccccccc Q lcl|NC_012418. 693 --PEAVVGSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFS 770 (826) Q Consensus 693 --~~~~~~~~v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~ 770 (826) +++..+.+|+|||+|+++++|+||++++++|++++.+|+||+|++|+|.+||+|.+++.+..++....+...+++.++ T Consensus 665 ~~~g~~~~~~v~VGl~Y~s~~~~~~~~i~~~~g~~~~~~r~~i~r~~~~~~~sg~~~~~v~~~~~~~~~~~~~~~~~~g~ 744 (800) T protein:vir:10 665 MHDDNHVKAKVIVGQIYPQEFEPTPVVIRDRQDRVSYIDVPVVGLVHLNLDMYPDFSVEVKNVKSGKVRRVLASNRIGGA 744 (800) T ss_pred ecCCCcccceEEEeeeeeEEEeecceEEEcCCCcccccCCeEEEEEEEEeecCceEEEEeccCcccceeEEccCCeeccc Confidence 677788999999999999999999999999999999999999999999999999999999999888888889999999 Q ss_pred cccccCccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecccccC Q lcl|NC_012418. 771 RQLNAGEPLVDSAVVPLPARVAMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 771 ~~~~~~~~~~~tg~~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r~rrv 826 (826) .++.+|.+|++||+++||+.+|+++.+|+|++++|+||+|++|+|||+||+|+||| T Consensus 745 ~~~~~g~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlai~~eg~y~~r~~rv 800 (800) T protein:vir:10 745 LNNTVGYVEPREGVFRFPLRAKSTDAVYRIIVESPHTFQLRDIEWEGSYNPTKRRV 800 (800) T ss_pred cccccCcccccCceEEEEEeccCceeEEEEEECCCCcEEEEEEEEEEEeecccccC Confidence 99999999999999999999999999999999999999999999999999999999 No 14 >protein:vir:7021 Length: 803 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853594;genbank:gi:31711676;genbank:GeneID:1481802 Probab=100.00 E-value=4.1e-215 Score=1195.90 Aligned_cols=769 Identities=22% Similarity=0.291 Sum_probs=624.4 Q ss_pred cceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeC--CCceEEE Q lcl|NC_012418. 2 SYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLG--GRSIAML 79 (826) Q Consensus 2 ~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd--~~e~~~i 79 (826) =.|+|+||||+||||||||++|||+||++|+||+|+|+|||+||||++||+++++.. ...++.|+++|+ +.|+||+ T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~~va~l~~~~--~~~~~~~~~~~~~~~~e~~~~ 78 (803) T protein:vir:70 1 MEVQGSLGRQIQGISQQPPAVRLDGQCSEMVNMVPDVVEGTKSRMGTTHIAKLLEYG--EDDMAVHHYRRGGEGEEEYFF 78 (803) T ss_pred CeEEeecchhccccccCchHHhhhhhhhhhhcceeeeccccccCChhhhhhhhcCCC--cccceeeEEEecCCCceEEEE Confidence 368999999999999999999999999999999999999999999999999988654 356788888874 4689999 Q ss_pred EEecCCeEEEEECCCCEEEEecCcc-ccccc--CCCcccEEEEEecCEEEEeeCCcceeeeecccCCCCCCccEEEEEcc Q lcl|NC_012418. 80 VAQHRGELYLFDERDGRLLMGQPLV-HDYLK--AADYRQLRAATVADDLFIANLSVKPEADRTDVKGVDPNKAGWLYIKA 156 (826) Q Consensus 80 ~~~~~g~irv~d~~~g~~~~~~~~~-~~yl~--~~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~~~vr~ 156 (826) +++++|+||||+++|+.+++.+... ..|+. +...++|+++||||+|||+|++++|++..+.. .+++.++++++|+ T Consensus 79 ~~~~~~~irv~~~~G~~~~v~~~~~~~~~l~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~--~~~~~~~~~~vr~ 156 (803) T protein:vir:70 79 IMKKGQVPEIFDKQGRKCMVQSQDAPMTYLSEVTNPREDVQFMTIADVTFMLNRKKIVKARPERS--PQVGSTAIVFMAY 156 (803) T ss_pred EEecCCeEEEEEcCCcEEEEecCCceeEEEeecCCChhheeEEEEcCEEEEecCceeeeeccccC--CCCCCceEEEEee Confidence 9999999999999887776655432 33553 23345899999999999999999999876543 4566789999999 Q ss_pred cccCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeeeeeccceecc Q lcl|NC_012418. 157 GQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDP 236 (826) Q Consensus 157 g~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t~~~~~~~~ 236 (826) |+|+++|+|+|++.. ++.|+++++... .....++.++++.++....... T Consensus 157 g~y~~~y~itIng~~---------~a~~~t~~~~~~-----~~~~~~~~~~ia~~l~~~~~~~----------------- 205 (803) T protein:vir:70 157 GQYGTHYKIIIDGVV---------AAGYKTRDGAEA-----HHIEDIRTESIAYNLYQSLQSW----------------- 205 (803) T ss_pred cCCcceEEEEeCCcc---------eEEEEeCCCccc-----ccccccchhhhhhhhhhheecc----------------- Confidence 999999999998753 345666665432 2334556778888876544321 Q ss_pred cccccccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEeeeEecC Q lcl|NC_012418. 237 DTAAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAVMAT 316 (826) Q Consensus 237 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~~~ 316 (826) .+.++++....+...++....+.+.....+.++..++++....+.|+++++||+++|. |+.+. +..+ T Consensus 206 ----~s~a~~~~~~~g~~~~i~~~~~~~~~~~~t~~g~~~~~~~~~~~~v~~~~~Lp~~~~~----g~~v~-----v~~~ 272 (803) T protein:vir:70 206 ----DKIADYEIQLDGTSIYITRRDGSTTFDITTEDGAKGKDLVAIKYKVASTDLLPSRAPE----GYKVQ-----VWPT 272 (803) T ss_pred ----ccccceEEEECCcEEEEEEcCCCCeeEEEeecCcCCcEEEEEEecccceeeccccCCC----CceEE-----EEcC Confidence 0112344433333333333333333344566677788999999999999999998874 34433 3344 Q ss_pred C-CccceEEEEEecCC---ceEEEeecccccccc--cceeEEEEEe---cCCCeeEEeecCCcccccCCcccccCccccC Q lcl|NC_012418. 317 G-STKAPVYFEWDSAN---RRWAERAAYGTDWVL--KKMPLALRWD---EATDTYSLNELDYDRRGSGDEDTNPTFNFVT 387 (826) Q Consensus 317 ~-~~~~~~y~~~~~~~---~~w~E~~~~g~~~~~--~tmp~~~~~~---~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g 387 (826) + .+.|.||++|+..+ ++|+||+++|...++ .||||.++.. .+.++|.+++.+|+.|.+|||+|||+|+|+| T Consensus 273 g~~~~d~y~v~~~~~~~~~~~w~e~a~~g~~~~~~~~t~p~~~v~~~~~~~~~~~~~~~~~~~~r~~gdd~tnp~psf~~ 352 (803) T protein:vir:70 273 GSKPESRYWLQAEKQNGNIVSWKETLAADVLIGFDKSTMPYIIERTGFVNGIAQFKIRQGDWEDRKVGDDLTNPMPSFID 352 (803) T ss_pred CCCCCceeeEEEEeccCCccceEeeeccceeeeeecccccEEEEEEEEeecceeEEEEeeccccccccccccCccccccC Confidence 4 45678999998644 589999999987765 5899999742 2457899999999999999999999999997 Q ss_pred ----CCceEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcE Q lcl|NC_012418. 388 ----RGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQ 463 (826) Q Consensus 388 ----~~~~~v~~~q~RL~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q 463 (826) ++|++|+||||||+|++|++|||||+||||||+++|+++++|||||+++++++++|+|+|+++++++|+|||+++| T Consensus 353 ~~~~~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~i~T~~~q 432 (803) T protein:vir:70 353 EEVPQTLGGMFMVQNRLCVTAGEAVIATRTSYFFDFFRYTAVSAVATDPFDVFSDASEVYQLKHAVTLDGSTVLFADKSQ 432 (803) T ss_pred ccCCCCceeEEEEeceEEEeeCCeEEEEccCCccccccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcE Confidence 5799999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeCCccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCe Q lcl|NC_012418. 464 AVVPGGGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPA 543 (826) Q Consensus 464 ~~~~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v 543 (826) |+|+++++|||+|++++++|+|+|+++++|+.+|++++|++++| ++++||||+|+.++|+ |+++|||+|++|||++++ T Consensus 433 ~~l~g~~~lTP~~~~i~~~s~~~~~~~~~Pv~vg~~v~fv~~~g-~~s~vre~~~~~~~d~-y~a~Dlt~~a~hl~~~~v 510 (803) T protein:vir:70 433 FILPGDKPLEKSNVLLKPVTTFEVNNNVKPVATGESVMFATSEG-AYSGIREFYTDSYSDT-KKAQAITSHVNKLLEGNV 510 (803) T ss_pred EEEeCCCcccceeEEEEEEEEeeccCCCccEEeCCeEEEeccCC-CeeEEEEEeccccccc-eehhhhhhhhHhhcCCce Confidence 99999999999999999999999999999999999999999988 5789999999988887 999999999999999999 Q ss_pred EEEEEcCCCCEEE-EEEcCCCeEEEEEEeeCCCceeeEeeEeeecCCcEEEEEEE--CCeEEEEEEeCCCEE-EEEEEEe Q lcl|NC_012418. 544 EYIQAAASSGYLV-FGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGTYFT--GDNLMVLIQKGQEIA-LGRMHLN 619 (826) Q Consensus 544 ~~~~~s~~p~~~v-~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~--~d~l~~vv~R~~~~~-~~r~~~~ 619 (826) .+|++|++|++++ |+.+++++|++||||++++||+|+|||||+|+|.|+++|++ +|+|||+|+|+++++ +|||.++ T Consensus 511 ~~~~~~~~~~~~v~~~~~~~~~l~~~~yl~~~~e~~v~aW~r~~~~g~~~~~~~~~~~d~l~~vv~r~~~g~~ier~~~~ 590 (803) T protein:vir:70 511 IMMSASTNVNRLLVLTDKYRNIIYCYDWLWQGTERVQAAWHKWEWPLGTFIRGMFYSGEHLYLLIERGSTGVYLERMDMG 590 (803) T ss_pred EEEEEeCCCCeEEEEEEcCCCeEEEEEEEecCCcEEEEeEEEEEcCCCEEEEEEEecCCEEEEEEEECCCeEEEEEEecc Confidence 9999999998765 67789999999999999999999999999999999999887 899999999998774 7787553 Q ss_pred ecCCcCCCCcccccceEEEEeeccceee--------ecccccCccccccce-EEeeeeeeEeeccEEcccEecCCCeE-- Q lcl|NC_012418. 620 SLPAREGLQYPKYDYWRRIEATVEGELE--------LTKQHWDLIKDAPAV-YQLQPVAGAFMERYQLGVKRETNTKV-- 688 (826) Q Consensus 620 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~g~~-- 688 (826) ..... .+.+..++|+....... ......+..+....+ +.+........++.+........... T Consensus 591 ~~~~~------~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~g~~t~~~ 664 (803) T protein:vir:70 591 DALVY------NLNDRIRMDRQAELIFRHIKAEDVWVSEPLPWQPTDVTLLDCVLIDGWDSYIGGSFLFSYNPGDNTLTT 664 (803) T ss_pred ccccc------CCcceeEeccceeEeeccccCCceeeeecccccCcccceeeEEEeeeeeeecCCeEEEEEcCCCcccee Confidence 32211 22233455544322211 111122233322322 22233333334443332221111111 Q ss_pred -EEeecCCcCCceEEEEEeeeeEEEeCCeEEECCCCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeecccc Q lcl|NC_012418. 689 -FLDVPEAVVGSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLR 767 (826) Q Consensus 689 -~l~~~~~~~~~~v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~ 767 (826) ...++++..+++|+|||+|+++++|+||++++++|+++..+|+||+|++|+|++||+|.++|+++.+.....+.+++++ T Consensus 665 ~~~~~~~~~~a~~v~VGl~Y~~~~~~~~~~i~~~~~~~~~~~~~rl~r~~~~~~~sg~~~v~v~~~~~~~~~~~~~s~~~ 744 (803) T protein:vir:70 665 TFDMHDDDHVKAKVVVGQLYPQEFEPTQVVIRDNQERVSYIDVPTVGLVHLNLDKYPDFKVEVKNLKSGKVRNVLASNRV 744 (803) T ss_pred eeeEECCCCcccEEEEeeeeeEEEeecceEEEcCCCccccccccEEEEEEEEeecccceEEEEecCCccccceeeccchh Confidence 1335677788999999999999999999999999999999999999999999999999999999888876677889999 Q ss_pred ccccccccCccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecccccC Q lcl|NC_012418. 768 LFSRQLNAGEPLVDSAVVPLPARVAMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 768 ~~~~~~~~~~~~~~tg~~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r~rrv 826 (826) +++.++.+|.+++++|+++||+.+++++.+|+|++++|+||+|++|+|||+||+|+||| T Consensus 745 ~g~~~~~~g~~~~~tg~~~vP~~~~~~~~~v~i~~d~P~P~tvlsi~weg~y~~r~rrv 803 (803) T protein:vir:70 745 GGAINNIVGYVEPREGVFKFPLRSLSTDTVYRVMVESPHTFQLRDIEWEGSYNPTKRRV 803 (803) T ss_pred ccccccccCccccccceEEEEeeccCcceEEEEEECCCCCeEEEEEEEEEEEecccccC Confidence 99999999999999999999999999999999999999999999999999999999999 No 15 >protein:vir:100022 Length: 976 # NCBI annotation: T7-like tail tubular protein B # Family: family:all:825 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214208;genbank:gi:61806431;genbank:GeneID:3294702 Probab=100.00 E-value=4e-214 Score=1190.49 Aligned_cols=796 Identities=17% Similarity=0.251 Sum_probs=608.7 Q ss_pred CcceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccc----cccEEEEEeCCCce Q lcl|NC_012418. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWP----RPFLYHTNLGGRSI 76 (826) Q Consensus 1 M~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~----~~~~~~~~rd~~e~ 76 (826) ||+|+|+||||++|||||||.+|+|+|+++|+||+|||+.||+||||++||+.|.+...... ...+|+++||+.|+ T Consensus 1 M~~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRp~~~~v~~l~~~~~~~~~~~~~~~~~~~~r~~~e~ 80 (976) T protein:vir:10 1 MASVTQTIPTLTGGLSQQPDELKIPGQVSVANNVIPDVTHGLLKRPGGKLVASISDNGTAALNSQTNGKWFSYYRDETES 80 (976) T ss_pred CcceeecchhhhCcceecchhhcCCchhhhhhccccccccccccCCcceeeeeecCCCcccccccccceEEEEEcCCCcE Confidence 99999999999999999999999999999999999999999999999999999987765433 45679999999999 Q ss_pred EEEEEecCCeEEEEECCCCEEEEecCc------ccccccCCCcccEEEEEecCEEEEeeCCcceeeeecccCCCCCCccE Q lcl|NC_012418. 77 AMLVAQHRGELYLFDERDGRLLMGQPL------VHDYLKAADYRQLRAATVADDLFIANLSVKPEADRTDVKGVDPNKAG 150 (826) Q Consensus 77 ~~i~~~~~g~irv~d~~~g~~~~~~~~------~~~yl~~~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a 150 (826) |++++.++|.|+|||+.+|.+..+... ...||++...++||+++|||||||+|++++|++..+.. ....+.| T Consensus 81 y~~~~~~~g~~~v~~~~~G~~~~v~~~~~~~~~~~~yl~~~~~~~~~~~tv~d~tfi~N~~~~~~~~~~~~--~~~~~~~ 158 (976) T protein:vir:10 81 YIGQVSRSGDINMWRCSDGQAMTVNYDSGTATALTTYLTHTNDEDIQTLTLNDYTFLTNRTKTVAMSSTVE--PVRPPEV 158 (976) T ss_pred EEEEEecCCceEEEEccCCeEEEEEcCCCcccccchhhccCCcceeEEEEEccEEEEecCceEEeeccccc--CCCCceE Confidence 999999999999999998877765432 23488877777999999999999999999999876544 3445679 Q ss_pred EEEEcccccCeeEEEEEeccCCcceeeeeeeee----------------------------------------------- Q lcl|NC_012418. 151 WLYIKAGQYSKAFSMTIKVKDNATGTTYSHTAT----------------------------------------------- 183 (826) Q Consensus 151 ~~~vr~g~Y~r~ytv~i~g~~~s~~~t~~~ta~----------------------------------------------- 183 (826) +++||+|+|+|+|+|+|++...+...++..+.. T Consensus 159 ~~~v~~~~y~~~y~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~s~~~G~~~~~~~v~~ 238 (976) T protein:vir:10 159 FIDLKATAYARQYAVNLFDNTTTTAVSTVTRIDVELIKSSNNYCDSNGAMVARTSRPSNSTRCDDSAGDGRDAYAPNVGT 238 (976) T ss_pred EEEeeeeccceEEEEEEcCCcccceeeeeeeeeeccccCCcccccccccchhhHhHhhhhhcccccccccccccCceeee Confidence 999999999999999998776543222221100 Q ss_pred --EEeccCccc------------------c-------cc-------c---------cccceeeccc--hh----h----- Q lcl|NC_012418. 184 --YVTPDNAST------------------N-------PN-------L---------AEAPFQTSVG--YI----A----- 209 (826) Q Consensus 184 --y~~p~~~~t------------------~-------~~-------~---------~~~~~~~~~~--~i----~----- 209 (826) +..-++... + .+ . .+..+..++. +. . T Consensus 239 ~~f~~~~G~~~~i~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~Y~~~y~~~~~v~~~~~ 318 (976) T protein:vir:10 239 KVFNVTDGASLTDEANSGSYTYTIDVKDSSNNSVNRGVNLYFRIRTVGQSVPFTTGSGSSATTTYQARYTTTFDLLYGGT 318 (976) T ss_pred eEEEeccCccceEcCCcceEEEEeeccccceEEeecCCceEEEEccccceeecccccccceeeeeeEEEEeEEEEecCCC Confidence 000000000 0 00 0 0000000000 00 0 Q ss_pred -hhh-hheeecc--cceEEee-------eeeccceecc-----c---------c----cc-----cccccceEecccCCc Q lcl|NC_012418. 210 -WQL-YGKFFGA--PEYTLPN-------STKKYPKVDP-----D---------T----AA-----ATVAGYLNQRGVQDG 255 (826) Q Consensus 210 -~~l-~~~~~s~--~~~~~~~-------~t~~~~~~~~-----~---------~----~~-----~~~~~~~~~~~~~~~ 255 (826) ++- ....... ..+.+.. .......+.+ + . .. ....+++... .++ T Consensus 319 g~~~~~~~~V~v~g~~Y~it~~~~~~~~~~a~~~~~~~~~t~~d~~~~~~~~~ia~~L~~~l~a~~~~~g~tv~~--~g~ 396 (976) T protein:vir:10 319 GWQEGDYFYVWMKDGYYKITVEAISTANVQANLGLIRPNPTPFDTETAVTAESIIGDIRTAIIATGNFTSANVQQ--IGT 396 (976) T ss_pred CcccCceEEEEccccceeeEEEEeeceeEEeccccccCcCCcCcccccccHHHHHHHHHHhhcccccccceEEEE--cCc Confidence 000 0000000 0000000 0000000000 0 0 00 1112223332 345 Q ss_pred EEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEeeeEecCCCccceEEEEEecCC---- Q lcl|NC_012418. 256 YIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAVMATGSTKAPVYFEWDSAN---- 331 (826) Q Consensus 256 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~y~~~~~~~---- 331 (826) ++++.+.++... .+.++ .+.+.+..++|++++|||+.+| .|+.++ ++.+++..|+||++|+..+ T Consensus 397 ~~~i~~~~~~~~-~s~~~--~~~~~~~~~~V~~~~~LP~~~~----~g~~v~-----V~~~~~~~d~yyv~~~~~~~~~~ 464 (976) T protein:vir:10 397 GLYVTRPSGTFN-VTAPS--SDLLRVMSGEVANVDDLPSQCK----HGYVVK-----VANSEADADDYYVKFFGHNNRDG 464 (976) T ss_pred EEEEEecCcceE-ecCCC--ceeEEEEEeeecchhhhhhhcc----CCcEEE-----EecCCCCceeEEEEeeccccccc Confidence 667766665322 22222 3578899999999999999887 466665 5677788899999998755 Q ss_pred -ceEEEeeccccccc--ccceeEEEEEecCCCeeEEeecCCcccccCCcccccCccccCCCceEEEEEcceEEEecCCeE Q lcl|NC_012418. 332 -RRWAERAAYGTDWV--LKKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFVTRGITGMTTFQGRLVLLSQEYV 408 (826) Q Consensus 332 -~~w~E~~~~g~~~~--~~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~~~q~RL~~~~~~~v 408 (826) ++|+||++||...+ ..||||.|+ ++++|+|.+++++|+.|.|||+++||+|+|+|++|++|+||||||+|++|++| T Consensus 465 ~~~w~E~~~~g~~~g~~~~tmP~~l~-~~~~g~f~~~~~~w~~r~vGd~~tnp~psf~g~~is~v~f~q~RL~f~s~~~v 543 (976) T protein:vir:10 465 DGVWEECAKPSRNIEFDKGTMPIQLV-RQANGTFTVSQATWQNAEVGDELTNPNPSFVGKTINQLVFFRNRLVFLSDENV 543 (976) T ss_pred cceEEEeeccccccccccccccEEEE-ecccCeEEeeeccccccccCCcccCcCceecccccceEEEEcceEEEecCCeE Confidence 48999999997655 469999997 78899999999999999999999999999999999999999999999999999 Q ss_pred EEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCC-ccccccceEEEEEEeecc Q lcl|NC_012418. 409 CMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGG-GIVTPRTAVISITTQYDL 487 (826) Q Consensus 409 ~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~~~-~~lTP~~~~~~~~s~~~~ 487 (826) ||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+++ ++|||+|++++++|+|+| T Consensus 544 ~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~g~e~~lsg~~~~lTP~t~~i~~~s~~~~ 623 (976) T protein:vir:10 544 IMSRPGEFFNFWSKTATTFTPQDVIDLSCSSTYPAIVYDGIQVNAGLLLFTKNQQFMLTTDSDILSPETAKINAVSSYNF 623 (976) T ss_pred EEEecCCccccccccccCCCCCccEEEEecCCcceeeEEEEecCCcEEEEecCceEEEecCCceecceeEEEEEEEeeec Confidence 9999999999999999999999999999999999999999999999999999999999985 599999999999999999 Q ss_pred cCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEEEEEcCCCCEEEEEEcCCCeEEE Q lcl|NC_012418. 488 DTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAASSGYLVFGTSTADEMIC 567 (826) Q Consensus 488 ~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~~~s~~p~~~v~~~~~~g~l~~ 567 (826) +++++|+.+|++++|++++| ++++++||.|++++++ +.++|||+|++|||++++..|++|++|.+++|+++++|+|++ T Consensus 624 ~~~v~Pv~vG~~v~Fv~~~g-~~~r~~~~~~~~~~~~-~~~~dlt~~~~~l~~g~~~~~a~~~~~~~vv~~~~~~g~l~~ 701 (976) T protein:vir:10 624 NEKTHPVSLGTTVAFIDNAN-QFTRFFEMSNVVRQGE-PDVVDQSKVISRLLDKNISLVSVSRENSVVFFSQKDTDKIYC 701 (976) T ss_pred cCCCccEEeCCeEEEEecCC-CeEEEEEEeecccccc-cchhHHHHHhhhhcCCceEEEEEcCCCcEEEEEEcCCCEEEE Confidence 99999999999999999888 5899999999999887 788999999999999999999999999999999999999999 Q ss_pred EEEeeCCCceeeEeeEeeecCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEeecCCcCCCC--------cccccceEEEE Q lcl|NC_012418. 568 HQYLWQGNEKVQNAFHRWTLRHQIIGTYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQ--------YPKYDYWRRIE 639 (826) Q Consensus 568 ~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~~d~l~~vv~R~~~~~~~r~~~~~~~~~~~~~--------~~~~~~~~~~~ 639 (826) |||||+++||+|+|||||+|+|+|++||+++|+|||+|+|+++++++||.|+..+...... ...+.++.++| T Consensus 702 ~ty~~~~~eq~v~aWsr~~~~G~v~sv~~i~D~ly~vV~r~~~g~~~r~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~lD 781 (976) T protein:vir:10 702 FRYFTSGEKRLLQAWTTWTITGNIQYHCMLDDALYVVTRNNNKDQIVKYSLKLDDAGHFVTDTQGTTSTDDDSIYRVHLD 781 (976) T ss_pred EEEeecCCceeEEeeEEEecCCcEEEEEEeCCeEEEEEEecCCeEEEEEEEEECCccceeeeccCccccccCCcceeeec Confidence 9999999999999999999999999999999999999999999999999887654432211 11334444555 Q ss_pred eeccce----------eeecccccCccccccceEEeeeeeeEeeccEE----cccEecCCCeEEEeecCCcCCceEEEEE Q lcl|NC_012418. 640 ATVEGE----------LELTKQHWDLIKDAPAVYQLQPVAGAFMERYQ----LGVKRETNTKVFLDVPEAVVGSVYVVGC 705 (826) Q Consensus 640 ~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~g~~~l~~~~~~~~~~v~vG~ 705 (826) ...... .......++.+... +..+...+++.. .....+.++.++| |++..+++||||| T Consensus 782 ~~~~~~~~~~t~~~~t~~t~~~~~~~~~~~------~~~~~~~~d~~~~~~~~~~~~v~g~~i~l--~g~~~~~~v~VGl 853 (976) T protein:vir:10 782 HSSSVTAASNTYNTTTIKTTIPKPNGYEST------KQLVAYDTDAGNDLGRYALVTVSGSNLEI--PGNWSNNSFIIGY 853 (976) T ss_pred cceEEEeccccccCCceeEEeecCccccCc------eeEEEEecccCcccccceeeeecCCeeEe--cCCCCCCeEEEee Confidence 432211 11112222222221 222222222211 1223455666554 6788889999999 Q ss_pred eeeeEEEeCCeEEECCCCce---eeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccccccccccccCcccccc Q lcl|NC_012418. 706 EFWSKVEFTPPVLRDHNGLP---MTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLVDS 782 (826) Q Consensus 706 ~y~~~~~~~~~~~~~~~g~~---~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t 782 (826) +|+++++|+||++++++|.. ...+||+|+|++|+|.+||.|++.+++..+....... ...+ ......+.+|+.+ T Consensus 854 ~Y~s~~~~~~~~i~~~~g~~~~~~~~gRl~i~r~~~~~~~tg~~~v~v~~~~~~~~~~~~-~~~~--~~~~~~~~~pl~~ 930 (976) T protein:vir:10 854 LYEMDVQLPTLYVTQQVGDKYRSDAKSSLIVHRIKFSFGPLGVYSTTIQRDGKPDFTETK-ELGL--AGVVGASRLPIVP 930 (976) T ss_pred eeEEEEeecceeEEeCCCCcccccceeeEEEEEEEEEeecccceEEEEcCCCCccccccc-cccc--cCcccccccceec Confidence 99999999999999877643 3457999999999999999999999887655333222 2222 1233344566665 Q ss_pred c-eEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecc-cccC Q lcl|NC_012418. 783 A-VVPLPARVAMATSKFELSCHSPYDMNVRAVEYNFKSNQT-YRRV 826 (826) Q Consensus 783 g-~~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r-~rrv 826 (826) + .+.||+.+|+++.+|+|+|++|+||+|++|+|||+||+| +||| T Consensus 931 ~~~~~vP~~~~~~~~~v~i~~d~PlP~tilsi~~eg~yn~r~~r~~ 976 (976) T protein:vir:10 931 EVIETVPCYERNTNLKVNVKSEHPAPATLYSLAWEGDFTNRFYKRV 976 (976) T ss_pred CcEEEEEeccCCceeEEEEEECCCCceEEEEEEEEEEeccceeecC Confidence 4 588999999999999999999999999999999999888 8888 No 16 >protein:vir:78703 Length: 905 # NCBI annotation: tail tube B # Family: family:all:825 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285450;genbank:gi:148724484;genbank:GeneID:5220174 Probab=100.00 E-value=7e-213 Score=1183.68 Aligned_cols=800 Identities=19% Similarity=0.243 Sum_probs=616.8 Q ss_pred CcceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeCCCceEEEE Q lcl|NC_012418. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd~~e~~~i~ 80 (826) ||+|+|+||||++|||||||++|+|+|+++|+||+|||+.||+||||++||+.|.++ ....+++|+++||+.|+|+++ T Consensus 1 M~~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRp~~~~i~~l~~~--~~~~~~~~~~~r~~~e~y~~~ 78 (905) T protein:vir:78 1 MGAVLQKIPNLLGGVSQQPDPVKLPGQVREAENVYLDPTFGCRKRPATKFVGELATN--LPSDTRWFPIFRDAGERYAVA 78 (905) T ss_pred CccceecchhhhCceeecchhhcCCcchhhhhccccccccccccCchhhhhhhhcCC--CCCCceEEEEEeCCCceEEEE Confidence 999999999999999999999999999999999999999999999999999999865 346789999999999999999 Q ss_pred EecCC----eEEEEECCCCEEEEec--CcccccccCCCcccEEEEEecCEEEEeeCCcceeeeecccCCCCCCccEEEEE Q lcl|NC_012418. 81 AQHRG----ELYLFDERDGRLLMGQ--PLVHDYLKAADYRQLRAATVADDLFIANLSVKPEADRTDVKGVDPNKAGWLYI 154 (826) Q Consensus 81 ~~~~g----~irv~d~~~g~~~~~~--~~~~~yl~~~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~~~v 154 (826) +..+| .|||||+.+|.+..+. +....||++++.++||+++|||||||+|++++|+++.++. ++++++|+++| T Consensus 79 ~~~~g~~~~~i~v~d~~~G~~~~V~~~~~~~~yl~~~~~~~l~~~tv~d~tfi~N~~~~~~~~~~~~--~~~~~~~~~~v 156 (905) T protein:vir:78 79 LYKDGSGNTQVRVWDMQTGAERTVTPDATATAYLATTNLNNLNWLTVADYTLLSNKERIVTMSGASE--VDSNQRALVEI 156 (905) T ss_pred EeeCCCCCcceEEEEccCCcEEEEecCCCccceeecCCCcceEEEEEcCEEEEEcCceeeeecCCCC--cCCCCeEEEEE Confidence 98776 4999999888665553 4567999998878999999999999999999999876554 46677899999 Q ss_pred cccccCeeEEEEEeccCCcceeeeeee-eeEEeccCccccc--ccc----cc------------ceeeccchh------- Q lcl|NC_012418. 155 KAGQYSKAFSMTIKVKDNATGTTYSHT-ATYVTPDNASTNP--NLA----EA------------PFQTSVGYI------- 208 (826) Q Consensus 155 r~g~Y~r~ytv~i~g~~~s~~~t~~~t-a~y~~p~~~~t~~--~~~----~~------------~~~~~~~~i------- 208 (826) |+|+|+|+|+|+|++...+...++... +....+.+.+... ... .. ....+.+.. T Consensus 157 ~~g~y~r~y~v~I~~~~~~~~~t~~~~~a~~~s~~s~~~~~~g~~~~~~~~~~~~~t~~~~~~l~f~~~~~~~~~~~~~~ 236 (905) T protein:vir:78 157 NAISYNTTYSIDLDRDGASQQVKVYRAKALEISPGSFEVEDGGVCTEHDVQNYTNQTIGSSTGLAFQVRVQCAAYLENNE 236 (905) T ss_pred EeeccceeEEEEEeCCCCceeeeeeccccceeccccccccccccccccceeeeecceeeccCCceeEEeeccccccCCCc Confidence 999999999999999887765554322 1111111110000 000 00 000000000 Q ss_pred ----hhhhhheeecccceE------Eeeeeecc--------------------ceec--ccccc--------------cc Q lcl|NC_012418. 209 ----AWQLYGKFFGAPEYT------LPNSTKKY--------------------PKVD--PDTAA--------------AT 242 (826) Q Consensus 209 ----~~~l~~~~~s~~~~~------~~~~t~~~--------------------~~~~--~~~~~--------------~~ 242 (826) ...-...+++..+|. +......+ .... ..... .. T Consensus 237 ~~~~~~~~~~l~~g~~~~~~~~~~~v~~~g~~y~i~i~~~~~~~~~~~~~~~~~~t~~d~~a~~~~~~~i~~~l~~~~~~ 316 (905) T protein:vir:78 237 YRSRYNVSVVLQNGGTGFRKGDMITVNLNGRDYNIRVTQEEFVYTYASDGTAAHTTPQDSTAGTLDIGQITAGLVNSVNL 316 (905) T ss_pred ccccccceeeeeccccccccCccEEEeeccceEEEEEecceeEEEecCCCcccccCccCccCccccHHHHHHHHHHhhcc Confidence 000000000000000 00000000 0000 00000 01 Q ss_pred cccceEecccCCcEEEEEcCCCeEE-EEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEeeeEecCCCccc Q lcl|NC_012418. 243 VAGYLNQRGVQDGYIAFRGDGDIVV-EVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAVMATGSTKA 321 (826) Q Consensus 243 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 321 (826) .+.++.. ..++++++...++..+ ..+.+++.+.++.++.++|+++++||+++|. |+.+++ +...+...| T Consensus 317 ~~~~~~~--~~g~~i~v~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~Lp~~~~~----g~~v~v----~~~~~~~~d 386 (905) T protein:vir:78 317 ISNYSAQ--AVGNVIEIERTDGRDFNLGVRGGATNRAMTAIKGTANSIVDLPGQCFD----GFELKV----INTENAESD 386 (905) T ss_pred cccEEEE--ecCcEEEEEecCCCccEEEEeccCCcceEEEEeccccccccCccccCC----CcEEEE----EeCCCCCcc Confidence 1223333 3456666666655322 3566777788889999999999999998874 454443 222345668 Q ss_pred eEEEEEecC------CceEEEeecccccccc--cceeEEEEEecCCCeeEEeecC-------CcccccCCcccccCcccc Q lcl|NC_012418. 322 PVYFEWDSA------NRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELD-------YDRRGSGDEDTNPTFNFV 386 (826) Q Consensus 322 ~~y~~~~~~------~~~w~E~~~~g~~~~~--~tmp~~~~~~~~~~~f~~~~~~-------w~~r~~gd~~tnp~psf~ 386 (826) .|||+|+.. +++|+||++||+.+++ .||||.++ ++++|+|.++..+ |++|.+||+++||+|+|+ T Consensus 387 ~yyv~~~~~~~~~~~~~~W~E~~~~~~~~~~~~~tmp~~l~-r~~~g~f~~~~~~~~~~~~~~~~r~~Gd~~Tnp~psf~ 465 (905) T protein:vir:78 387 DYYVVFRSAAEGIPGSGSWEETVAPGIERGFNTSTMPHALI-RQADGNFTLEALNDEGTITGWAQREVGDDDTNPKPSFV 465 (905) T ss_pred eEEEEEEecccCCcCceeEEEecccccccccccccccEEEE-EecCceEEEEEeccccccccccccccCCcccCCCCccc Confidence 999999754 4689999999987655 69999997 7889999999886 999999999999999999 Q ss_pred CCCceEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEE Q lcl|NC_012418. 387 TRGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVV 466 (826) Q Consensus 387 g~~~~~v~~~q~RL~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~ 466 (826) |++|++|+||||||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+| T Consensus 466 g~~is~v~f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdDpI~~~~ss~~~~~i~~~v~~~~~L~ifT~g~ef~l 545 (905) T protein:vir:78 466 GRGISDMFFYNNRLGFLSEDAVIMSQPGDYFNFFVTSAITISDSDPIDVTASSTKPAILRAAIGAPKGLILFAENSQFLL 545 (905) T ss_pred CCCcceEEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeEEEeecCCcEEEEecCceEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eCCc-cccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEE Q lcl|NC_012418. 467 PGGG-IVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEY 545 (826) Q Consensus 467 ~~~~-~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~ 545 (826) ++++ +|||+|++++++|+|+|+++|+|+.+|+++||++++| ++++||||.|+.++|+ |+++|||+|++|||++++.. T Consensus 546 sg~~~~lTP~s~~i~~~S~~~~~~~v~Pv~vG~~vlFv~~~g-~~s~vre~~y~~~~d~-y~a~DlT~~a~hl~~g~v~~ 623 (905) T protein:vir:78 546 ASQEVVFSTATIKLTEISDYFYRSLAKPVSTGVSIAFVSEAD-TYSKIFEMSIDSVDNR-PQVADITRIVPEYVPTGLTW 623 (905) T ss_pred ecCCccccceeEEEEeEEeecccCCCCcEEeCCeEEEeecCC-CeeEEEEEEeeecccc-eehhHHHHHHHHhcCCceEE Confidence 9865 7999999999999999999999999999999999987 5789999999998887 89999999999999999865 Q ss_pred EEEcCCCCEEEEEEcCCCeEEEEEEeeCCCceeeEeeEeeecCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEe--ecCC Q lcl|NC_012418. 546 IQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGTYFTGDNLMVLIQKGQEIALGRMHLN--SLPA 623 (826) Q Consensus 546 ~~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~~d~l~~vv~R~~~~~~~r~~~~--~~~~ 623 (826) +++++|+.+||+++++|+|++|+||++++||+|+|||||+|+|.+++||++.|++|++|+|..++...++.++ ..++ T Consensus 624 -~~~s~~~~~v~~~~~~~~l~~ytyl~~~~eq~v~AWsrw~~~G~~~~~a~i~d~~~~vV~r~~~G~~~~~~~~l~~~~~ 702 (905) T protein:vir:78 624 -SVSTPNNSMMLFGDNSNTAYIFKFFNQGNERQVAGWSKWILPGEQRMCGFFADTGYFVLYDSTTGSYVLSAMELLDDPD 702 (905) T ss_pred -EEecCCCcEEEEEcCCCeEEEEEeecCCCceeEEeEEEEecCCCeEEEEEEcCCEEEEEEEccCCeEEEEEEeeccccC Confidence 4567888899999999999999999999999999999999999999999999999999999988776665444 3344 Q ss_pred cCCCCcccccceEEEEeecccee-e---ecccccCccccccceEEeeeeeeEeeccEEcccE---ecCCCeEEEeecCCc Q lcl|NC_012418. 624 REGLQYPKYDYWRRIEATVEGEL-E---LTKQHWDLIKDAPAVYQLQPVAGAFMERYQLGVK---RETNTKVFLDVPEAV 696 (826) Q Consensus 624 ~~~~~~~~~~~~~~~~~~~~~~~-~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~g~~~l~~~~~~ 696 (826) ....+....++..+++++..... + ...............|+.+..+....|+...+.. .+..+.++++ T Consensus 703 ~~~~d~~~~~~~~~~d~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dG~~~~~~~~~~~~~~~~t~~----- 777 (905) T protein:vir:78 703 SASIDTAFSSFLPRLDNYVVKSDLTVVDNGDGTLTVDLEAGQAMTGATPVIMFTDGPSEFAFSQPTITAGQFTVD----- 777 (905) T ss_pred ccccccceeeeeeccceeeecccceecccCcceEeeeccCccccccceeEEEeeCCceeeeEEEEEeeceeeccc----- Confidence 44455555555555565543221 0 0001111001112224445555555666443322 2233333332 Q ss_pred CCceEEEEEeeeeEEEeCCeEEECCCCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccccccccccccC Q lcl|NC_012418. 697 VGSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAG 776 (826) Q Consensus 697 ~~~~v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 776 (826) .+.+|+|||+|+++++|+||+++.+++... .+|++|+|++|+|++|++|.+++++..++... ...+.++.+..+..+ T Consensus 778 ~a~~v~VGl~Y~s~v~~~p~~~~~~~~s~~-~~~~rI~rv~lr~~~Sg~~~v~v~~~~~~~~~--~~~~~~~~~~~~~~~ 854 (905) T protein:vir:78 778 TTDDFVVGFKYETKITLPGFFTSEENKADR-VYAPIVEFLYLDLYYSGRYQIEVDRIGYDTIN--IDAGSIDANIYLADG 854 (905) T ss_pred cCCeEEEeeeeeEEEeecceEeccCCCccc-ccceEEEEEEEEeecceeEEEEEcCCCcceec--ccccceecCcccCcc Confidence 456799999999999999998887665544 57889999999999999999999988776532 234556667777778 Q ss_pred ccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecc-cccC Q lcl|NC_012418. 777 EPLVDSAVVPLPARVAMATSKFELSCHSPYDMNVRAVEYNFKSNQT-YRRV 826 (826) Q Consensus 777 ~~~~~tg~~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r-~rrv 826 (826) .|+..+|+++||+.+|+++.+|+|+|++|+||+|++|+|||+||+| ++|| T Consensus 855 ~p~~~tg~~~vP~~g~~~~~~v~I~sd~PlP~tvlsi~weg~Yn~r~~~~~ 905 (905) T protein:vir:78 855 APLKEIATENVPLFTPGDQVTVTIKAPDPFPSAITGYSWQGHYNRRGIAFI 905 (905) T ss_pred cccccccEEEEEeeccCceeEEEEEECCCCcEEEEEEEEEEEeccceeecC Confidence 8888999999999999999999999999999999999999999999 8899 No 17 >protein:vir:103341 Length: 806 # NCBI annotation: tail tubular protein B-like protein # Family: family:all:825 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039670;genbank:gi:125999999;genbank:GeneID:4818417 Probab=100.00 E-value=1.3e-211 Score=1176.75 Aligned_cols=771 Identities=20% Similarity=0.298 Sum_probs=620.5 Q ss_pred cceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeCCCceEEEEE Q lcl|NC_012418. 2 SYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLVA 81 (826) Q Consensus 2 ~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd~~e~~~i~~ 81 (826) =.|+||||||+||||||||++|||+||++|+||+|+|+|||+||||++||+++++.. ...+++|+|+|++.+++|+++ T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~~Gl~rRPgt~~va~l~~~~--~~~~~~~~~~~~~~~~~y~v~ 78 (806) T protein:vir:10 1 MEVQGSYGRQLQGVSQQPIAVRLPGQVTSQLNAVPNVVDGLKTRMGSKHLARILNSL--DANSLIHHYKRGDDAEEYFVI 78 (806) T ss_pred CeeEeecchhccceeccChhHhhhhhhhhhhcceeccccccccCCchhhhhhhcCCC--CccceEEEEEecCCceEEEEE Confidence 479999999999999999999999999999999999999999999999999987643 357888999996665556667 Q ss_pred ecCCeEEEEECCCCEEEEecC--cccccccCC--CcccEEEEEecCEEEEeeCCcceeeeecccCCCCCCccEEEEEccc Q lcl|NC_012418. 82 QHRGELYLFDERDGRLLMGQP--LVHDYLKAA--DYRQLRAATVADDLFIANLSVKPEADRTDVKGVDPNKAGWLYIKAG 157 (826) Q Consensus 82 ~~~g~irv~d~~~g~~~~~~~--~~~~yl~~~--~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~~~vr~g 157 (826) +.+|+||||+..+|..+.+.. ....||++. +..+|+++||||+|||+|++++|++...+.. +++.++++++|+| T Consensus 79 ~~~g~i~v~~~~~G~~~~v~~~~~~~~yl~~~~~~~~~l~~~tvaD~tfi~n~~~~~~~~~~~~~--~~~~~~~v~v~~g 156 (806) T protein:vir:10 79 LQPGQVPVIFTVGGLACPVNTQGSAATYLSSSSLPRETTQLMTIGDYTFVLNRKMPVQARGDVTP--SLDNKGLVYVAYA 156 (806) T ss_pred EcCCcEEEEEcCCCcEEEecCCCceEEEeccCCCCcceeeEEEEcCEEEEecCcEeeeecccccC--CCCcceEEEEeec Confidence 778899999988887666543 334576653 3468999999999999999999998765443 4556899999999 Q ss_pred ccCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeeeeeccceeccc Q lcl|NC_012418. 158 QYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPD 237 (826) Q Consensus 158 ~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t~~~~~~~~~ 237 (826) +|+++|+++|++.. ++.+++|++... ......++.+++.++++.+..... T Consensus 157 ~y~~~y~i~Ing~~---------~a~~~t~~~~~~-----~~~~~~~~~~~a~~l~~~l~~~~~---------------- 206 (806) T protein:vir:10 157 NFSFTYQILINGQV---------AAEHKTASSEDV-----KNEDLVRTDYVAGKLLENFNSRTA---------------- 206 (806) T ss_pred ccCceeeEEeccce---------EEEEEeccCCCc-----ccccccchhHHHHHHHhhhccccc---------------- Confidence 99999999998653 456677765432 233445677888888776543211 Q ss_pred ccccccccceEecccCCcEEEEEcCCC-eEEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEeeeEecC Q lcl|NC_012418. 238 TAAATVAGYLNQRGVQDGYIAFRGDGD-IVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAVMAT 316 (826) Q Consensus 238 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~~~ 316 (826) ...++.....+. .+++..... .....+.++++++.+.+..++|+++++||.++|. |+.+. +..+ T Consensus 207 ----~~~~~~~~~~g~--~~~i~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~lp~~~~~----g~~v~-----i~~~ 271 (806) T protein:vir:10 207 ----SFPGFSMYQDGN--VLVVDNSNGANYALTTVDGADGQDLVAIRHKVTNLDTLPNRAPV----GYKVQ-----VWPT 271 (806) T ss_pred ----ccceeEEEEccc--EEEEecCCCCccEEEEeeCCCCceeEEeecccCccccCccccCC----CcEEE-----Eecc Confidence 111222222222 233322222 1223467788899999999999999999988875 34433 3344 Q ss_pred -CCccceEEEEEec---CCceEEEeeccccccc--ccceeEEEEEec----CCCeeEEeecCCcccccCCcccccCcccc Q lcl|NC_012418. 317 -GSTKAPVYFEWDS---ANRRWAERAAYGTDWV--LKKMPLALRWDE----ATDTYSLNELDYDRRGSGDEDTNPTFNFV 386 (826) Q Consensus 317 -~~~~~~~y~~~~~---~~~~w~E~~~~g~~~~--~~tmp~~~~~~~----~~~~f~~~~~~w~~r~~gd~~tnp~psf~ 386 (826) +...+.||++|+. ++++|+||++++...+ ..+|||.++... .+++|.+++++|++|.+|||++||+|+|+ T Consensus 272 ~~~~~~~y~v~~~~~~~~~~~w~e~~~~~~~~~~~~~t~p~~~v~~~~~~~~~~~~~~~~~~w~~r~~Gd~~tn~~psF~ 351 (806) T protein:vir:10 272 GSKPESRYWLQAESQDGSKVTWVETIAPGVRKGWNAATMPHVLVRESLNANGSANFTYRPGEWEDRDVGDDLTNDFPSLL 351 (806) T ss_pred CCCCCCceEEEEEeeccCceEEEeecccccccceeccccceEEEeeeeeecccceeEEEecccccccccccccCccCccc Confidence 4456788999964 4578999999996554 569999998543 37899999999999999999999999999 Q ss_pred C----CCceEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCc Q lcl|NC_012418. 387 T----RGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKY 462 (826) Q Consensus 387 g----~~~~~v~~~q~RL~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~ 462 (826) | ++|++|+||||||+|++|++|||||+||||||+++|+++++|||||+++++++++|+|+|+++++++|+|||+++ T Consensus 352 ~~~~~~~it~v~f~q~RL~f~s~~~v~~Srsgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~t~~~ 431 (806) T protein:vir:10 352 NDSSPQPISSMLMVQNRLMLTSGEAVVASRTSRFFDFFRYTVLATVDTDPFDVFADIEEVYNIRWSAQMDGDVVLFTSDQ 431 (806) T ss_pred CCCCCccceEEEEEeeeEEEecCCeEEEEccCCcccCccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCc Confidence 8 789999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeCCccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCC Q lcl|NC_012418. 463 QAVVPGGGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGP 542 (826) Q Consensus 463 q~~~~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~ 542 (826) ||+|+++++|||+|++++++|+|+|+++++|+.+|++++|++++| ++++||||+|+.++|+ |+++|||+|++|||+|+ T Consensus 432 q~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g-~~s~vre~~y~~~~d~-~~~~DlT~~~~hl~~g~ 509 (806) T protein:vir:10 432 QFTLPGDKPLTPTSAVIRPVTQFKMTPGVKPAPSGDSILFAFDQG-SYSGIREFFTDSYSDT-KKAQPATSHVDKYIRGK 509 (806) T ss_pred EEEEeCCCcccceeEEEEEEEeecccCCCCceEeCCeEEEeeCCC-CeeEEEEEEeeeeccc-eehhhHHHHHHHhcCCC Confidence 999999999999999999999999999999999999999999988 5789999999988887 89999999999999997 Q ss_pred eEEE-EEcCCCCEEEEEEcCCCeEEEEEEeeCCCceeeEeeEeeecCCc--EEEEEEECCeEEEEEEeCC--CEEEEEEE Q lcl|NC_012418. 543 AEYI-QAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQ--IIGTYFTGDNLMVLIQKGQ--EIALGRMH 617 (826) Q Consensus 543 v~~~-~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~--v~~~~~~~d~l~~vv~R~~--~~~~~r~~ 617 (826) +..+ ++|++|..++|++++||+|++|||||+++||+|+|||||+++|. +++|++++|+||++|+|++ ++...+ + T Consensus 510 ~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~e~~v~aW~rw~~~~~~~~~~~~~~~d~l~~vv~R~~~~~g~~~~-~ 588 (806) T protein:vir:10 510 VLELSASSSFNRAFIITSPDRNILYVYDWLYEGTEKVQNAWHKWSFPAGTVLHAVSYSNEKLYLVLTRTNTSGGVAGV-Y 588 (806) T ss_pred eEEEEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceEEEeEEeeeeCCCeEEEEEEEecCeEEEEEEEcCCcccEEEE-E Confidence 6555 56677778999999999999999999999999999999999765 7788999999999999987 444333 4 Q ss_pred EeecCCcCCCCcccccceEEEEeeccceeeeccc--ccCccccccceEEeeeeeeEeeccEEcc-------cEecCCCeE Q lcl|NC_012418. 618 LNSLPAREGLQYPKYDYWRRIEATVEGELELTKQ--HWDLIKDAPAVYQLQPVAGAFMERYQLG-------VKRETNTKV 688 (826) Q Consensus 618 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~g~~ 688 (826) +|++......+. .+.+..++|+.....+..... .+.........|+++..+...+++.... .....+..+ T Consensus 589 iE~~~~~~~~~~-~~~~~~~lD~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~g~~~~~g~~~~~~~~~~~~~v 667 (806) T protein:vir:10 589 IEVMDMGDELEY-GLQDRVRMDRRATLSMTYNATTRVWTSSALPWLPQDLSSLDAVLVSGWAGYVGGAFQFSYNASNNTI 667 (806) T ss_pred EEeecCCCCCCc-ccceeeeccccceEEEeccccccceeeeeeccccccccceeEEEEeeccccCCceEEEEEcCccceE Confidence 555544333322 344555667655444322111 1111111112255666666666654321 011111222 Q ss_pred EE--eecCCcCCceEEEEEeeeeEEEeCCeEEECCCCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccc Q lcl|NC_012418. 689 FL--DVPEAVVGSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPL 766 (826) Q Consensus 689 ~l--~~~~~~~~~~v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~ 766 (826) +. +.+ +..+.+|+|||+|+++++|+||+++++++++...+|+||+|+++++.+|++|.+.|+++.+.....+.+.++ T Consensus 668 ~~~~~~~-~~~~~~v~vGl~Y~s~~~~t~p~~~~~~~~~~~~~r~~l~r~~~~~~~s~~~~~~v~~~~~~~~~~~~~~~~ 746 (806) T protein:vir:10 668 STNFDLA-EGNTATIVVGETYWYEVEPTPPLIKDSKDRVSYLDTPTVGNVYLNLDMYPDFSVVVTDKETLQERTVYLANK 746 (806) T ss_pred eeeeeec-CCCCcEEEEeeeeeEEEEECCeeEeccCCCccccccEEEEEEEEEeecceeeEEEEcccCCCcceeeeccCc Confidence 22 221 235778999999999999999999999998888899999999999999999999999988877778888999 Q ss_pred cccccccccCccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecccccC Q lcl|NC_012418. 767 RLFSRQLNAGEPLVDSAVVPLPARVAMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 767 ~~~~~~~~~~~~~~~tg~~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r~rrv 826 (826) ++++.++.+|.||+++|+++||+.+|+++.+|+|+|++|+||+|+||+|||+||+|+||| T Consensus 747 ~~~~~~~~~g~~~~~tg~~~vp~~~~~~~~~v~i~~d~P~P~tvlai~~eg~y~~r~~rv 806 (806) T protein:vir:10 747 TAGSITNVIGYIAPHEGTLRIPLRRKSTDVSFKIRSKSPATFQLRDIEWTGSYNPRKRRV 806 (806) T ss_pred ccccccccccccccccceEEEEeeecCceeEEEEEECCCCceEEEEEEEEEEeecccccC Confidence 999999999999999999999999999999999999999999999999999999999999 No 18 >protein:vir:103790 Length: 768 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024932;genbank:gi:48697202;genbank:GeneID:2846114 Probab=100.00 E-value=6e-168 Score=937.36 Aligned_cols=722 Identities=13% Similarity=0.082 Sum_probs=509.8 Q ss_pred Ccceeeechhhhcc-----cccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCccc-ccccccEEEEEeCCC Q lcl|NC_012418. 1 MSYKQSAYPNLLMG-----VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQ-PWPRPFLYHTNLGGR 74 (826) Q Consensus 1 M~~v~~s~~n~~gG-----VSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~-~~~~~~~~~~~rd~~ 74 (826) |++++++++||++| +++|+|++||++||++|+||+|+|+|||+|||||+||+++++++. .+++||. |++. T Consensus 1 M~~~~~~~~~F~~GelsP~l~~r~Dl~ry~~~~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~lipf~----~~~~ 76 (768) T protein:vir:10 1 MPKAAPQQVSFDAGELSPLLGARVDLAKYPNGCQVMENFIATVQGPAIRRGGKRFVAATKDSTKQSWLLPFI----VADG 76 (768) T ss_pred CCcceeeeeeccCceechhhcccchHHHHHHHHhhhhcceeeecCCceecCchhhhhhhcCCCCCeeEEEEE----ecCc Confidence 99999999999999 899999999999999999999999999999999999999987553 2454444 4444 Q ss_pred ceEEEEEecCCeEEEEECCCCEE-------EEecCcccccccCCCcc-cEEEEEecCEEEEeeCCcceeeeecccCCCCC Q lcl|NC_012418. 75 SIAMLVAQHRGELYLFDERDGRL-------LMGQPLVHDYLKAADYR-QLRAATVADDLFIANLSVKPEADRTDVKGVDP 146 (826) Q Consensus 75 e~~~i~~~~~g~irv~d~~~g~~-------~~~~~~~~~yl~~~~~~-~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~ 146 (826) +.|++++++++||||+. +|.+ .+..+....||+..++. +|+++|+||+|||+|++|||++.... ...++ T Consensus 77 -~~y~l~fg~~~irv~~~-~g~v~~~~~~~e~~tp~~~~~l~~~~~~~~L~~~q~aD~~~i~~~~~~p~~l~r~-~~~~w 153 (768) T protein:vir:10 77 -IAYMLEFGDHYIRFFVN-RGQLVNAGAPVEIATPYALADLTTEDGTFAIRATQSADTMYLFHGGYPTQKLLRT-SATTF 153 (768) T ss_pred -cEEEEEEcCCEEEEEEC-CcEEEecCeeEEEEcCCCcceeecccccceeEEEeecCEEEEEcCCcceeEEEEe-cCCCc Confidence 45555667899999975 4433 34455666677665554 79999999999999999999874322 12222 Q ss_pred CccEEEEEcccccCe---eEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceE Q lcl|NC_012418. 147 NKAGWLYIKAGQYSK---AFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYT 223 (826) Q Consensus 147 ~~~a~~~vr~g~Y~r---~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~ 223 (826) ..+.+.++.|.|.+ +.+++++....++..+...++....|..+........ .. . .....|. T Consensus 154 -~l~~~~~~~gp~~~~n~~~~vti~~s~~~~~~T~tasa~~~~~~~v~~~~~l~~---------~~---~---~~~~~~~ 217 (768) T protein:vir:10 154 -SLQPVTFVGGPFAAVNSDNNVRVHASAGTGAVTLVASASVFRPSDVGTLFYLEQ---------ED---N---SFVKPWV 217 (768) T ss_pred -eeEEeeecCccccccccceeEEEEecccceeEEEeecCCccchhhcceeeeeee---------ec---c---ccccccE Confidence 22445567766643 4555555554444333322222222222211100000 00 0 0000111 Q ss_pred EeeeeeccceecccccccccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccce Q lcl|NC_012418. 224 LPNSTKKYPKVDPDTAAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPG 303 (826) Q Consensus 224 ~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g 303 (826) .... .+.+++....+........+++.... .. ...|.. ..+ T Consensus 218 ~~~~-------------------------~g~~~~~~~~~~~~~~~~~~~~~~~~--~~-------t~~~~~-----~~~ 258 (768) T protein:vir:10 218 VHQK-------------------------IGPSELRRVGDRVYLCTAVGTATPQV--TG-------TETPTH-----TSG 258 (768) T ss_pred EEEe-------------------------eeeEEEEecCCceEEeeeeccccccc--cc-------eecccc-----ccC Confidence 0000 00111111111111111111111110 00 111111 112 Q ss_pred eEEEEEeeeEecC-CCccceEEEEEecCC---ceEEEeecccccccccceeEEEEEecCCCeeEE--------eecCCcc Q lcl|NC_012418. 304 TGVQFMDGAVMAT-GSTKAPVYFEWDSAN---RRWAERAAYGTDWVLKKMPLALRWDEATDTYSL--------NELDYDR 371 (826) Q Consensus 304 ~~~~~~~~~~~~~-~~~~~~~y~~~~~~~---~~w~E~~~~g~~~~~~tmp~~~~~~~~~~~f~~--------~~~~w~~ 371 (826) +......+....+ .......+++|...+ +.|.++ +..+|++..+... .+++.+ ....|.- T Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-------~~~t~~~~~~~~~-~~~~~~~~~~~~~~~~~t~~~ 330 (768) T protein:vir:10 259 SRWDGTGQDESATDEYGSIGAEWEYQHSGYGTVLITGY-------TNDQVVTGTVATN-DPADPGMLPNTVVTLTGTYKW 330 (768) T ss_pred ceEEEecCcccccccccccceEEEEEEcCCceEEEEEe-------cCCeeEEeeeeee-cCcccccccccccccCCCccc Confidence 1111111100000 001111233443333 334433 3446777766432 233333 3334666 Q ss_pred cccCCcccccCccccCCCceEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeec Q lcl|NC_012418. 372 RGSGDEDTNPTFNFVTRGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTF 451 (826) Q Consensus 372 r~~gd~~tnp~psf~g~~~~~v~~~q~RL~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~ 451 (826) +..+++++||+|+ +|+||||||+|++|++|||||+||||||++++++++.|||||+++++++++++|+|++++ T Consensus 331 ~~~~~~~~~g~Ps-------~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdD~I~~~~ss~~~~~i~~~v~~ 403 (768) T protein:vir:10 331 ARSLFNSTDGFPQ-------MGTFWRNRLCLMRDRWLAMSVSADFETFKTKDADQQTDDSAIVQQLNARQLNKLAWMVES 403 (768) T ss_pred ccCCCcCCCCCce-------EEEEEeeeEEEeeCCEEEEEcccccccccccccccccCCccEEEEecCCcceeEEEEeec Confidence 6666677777655 589999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcEEEEecCcEEEEeC---CccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccch Q lcl|NC_012418. 452 NKDLIVFAKKYQAVVPG---GGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVA 528 (826) Q Consensus 452 ~~~L~l~t~~~q~~~~~---~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~ 528 (826) ++|+|||+++||+|++ +++|||+|++++++|.|+++ +++|+.+|++++|+|++|+ .||||.|+.+.|+ |++ T Consensus 404 -~~L~i~T~~~q~~l~~~~~~~~lTP~~~~i~~~s~~g~~-~~~Pv~vG~~v~fv~~~g~---~vre~~y~~~~d~-y~a 477 (768) T protein:vir:10 404 -DSLLIGMTGDEWVIGPANASQPVSAANLNAARRTSYGSK-RIQPVQVGGTIMFVQKAGR---KLRDFKYDFSSDN-YVS 477 (768) T ss_pred -CcEEEEecCceEEEecCCCCcccccceEEEEEeehhccc-ccccEEeCCeEEEEcCCCC---EEEEEEeeeecCc-eec Confidence 5899999999999987 35899999999999999775 6999999999999999884 7999999977775 999 Q ss_pred hHHHHHHHHhcCC------CeEEEEEcCCCCEEEEEEcCCCeEEEEEEeeCCCceeeEeeEeeec-CCcEEEEEEE---- Q lcl|NC_012418. 529 EDVTSHIPSYMPG------PAEYIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTL-RHQIIGTYFT---- 597 (826) Q Consensus 529 ~dls~~~~~~~~~------~v~~~~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~-~g~v~~~~~~---- 597 (826) +|+|+|++||+++ +|.+|+++++|++++||+++||+|++|+|++++++|+|+|||||++ +|.|++||+| T Consensus 478 ~DlT~~a~hl~~~~~~~~~~i~~~a~~~~p~~v~~~v~~dg~l~~~ty~~e~~~q~v~aW~~~~~~~g~v~~v~~i~~~~ 557 (768) T protein:vir:10 478 TDVTKIADHITRGRAGTNSGIMSLCFQQEPHSVVWAARADGQLIGCTYDEEAGRSDVYGWHRHPDANGFVECVASMPAPD 557 (768) T ss_pred chhhhhhhhhccccCccccceeeEEEeecCCeEEEEEecCCeEEEEEEecCCCceeEEeEEEEEcCCCEEEEEEEEecCC Confidence 9999999999986 3899999999999999999999999999998888899999999975 7999999998 Q ss_pred --CCeEEEEEEeCCCEEEEEEEEeecCCcCCCCcccccceEEEEeeccceeeecccccCccccccceEEeeeeeeEeecc Q lcl|NC_012418. 598 --GDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEATVEGELELTKQHWDLIKDAPAVYQLQPVAGAFMER 675 (826) Q Consensus 598 --~d~l~~vv~R~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 675 (826) +|+||++|+|+++++.+||.++ +... ...........++||........ ........|++++++.+++|| T Consensus 558 g~~d~l~~~v~r~~~g~~~~~ie~-l~~~-~~~~~~~~~~~~~D~~~~~~~~~------~~~~~gl~~leg~~v~v~~dG 629 (768) T protein:vir:10 558 GASDDLWVIVRRQVNGQTVRYVEY-LNPA-LQDDEPQSSAFYVDAGITYNGVP------TSTIAGLGHLEGVTVAVLTDG 629 (768) T ss_pred CCccEEEEEEEecCCCeEEEEEEe-cCcc-cccccccccceEeccccccCCcc------eeeecCCCCcccceEEEEECC Confidence 6999999999999999998544 3322 22222222334566654332211 111122347899999999999 Q ss_pred EEcccEecCCCeEEEeecCCcCCceEEEEEeeeeEEEeCCeEEECCCCceeeecceEEEEEEEEeeccceEEEEecCCCC Q lcl|NC_012418. 676 YQLGVKRETNTKVFLDVPEAVVGSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTAR 755 (826) Q Consensus 676 ~~~~~~~~~~g~~~l~~~~~~~~~~v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~ 755 (826) ..++...+.+|.++|+.+ +.+|+|||+|+++++|+||++++++|..++ +|+||+|++|+|++|++++++++...+ T Consensus 630 ~~~~~~~v~~g~itl~~~----~~~v~vG~~y~s~~~~~p~~~~~~~gs~~~-~~~ri~r~~v~~~~S~~~~~~~~~~~~ 704 (768) T protein:vir:10 630 AVHPSRTVTAGAITLDWS----ASIVHIGVPTTCRIQTMQLNAGAANGTAQG-KTKRVTNIATRFSRSLGGVVGPTFDDN 704 (768) T ss_pred EeccCceecCCEEEeCCC----CceEEEeEeeeEEEEecceEeecCCccccc-cceEEEEEEEEEecccceEEEecCCCC Confidence 999999999999999864 567999999999999999999999987764 688999999999999999998765543 Q ss_pred CceeeeeeccccccccccccCccccccceEEEEee-cccceeEEEEEECCCCCEEEEEEEEEEEEeccc Q lcl|NC_012418. 756 PNQPWYDTTPLRLFSRQLNAGEPLVDSAVVPLPAR-VAMATSKFELSCHSPYDMNVRAVEYNFKSNQTY 823 (826) Q Consensus 756 ~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~~p~~-~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r~ 823 (826) . +...+.+..+..+.. .+|++||++++|+. +++++.+|+|+|++|+||+||||+||+++|.|+ T Consensus 705 ~----~~~~~~r~~~~~~~~-~~~l~TG~~~v~~~~~~~~~~~i~i~~d~P~P~tvlsi~~~~~~nd~~ 768 (768) T protein:vir:10 705 D----LEQLSFRKPSNAMDR-AVPLFDGDMESDWRGGYEGQSWICYQNDQPLPVTLLGFFPILDTQDDR 768 (768) T ss_pred C----ceeeeeEecCcccCc-cCCcccCEEEEEecCCCCcceEEEEEECCCCCEEEEEEEEEEEEeecC Confidence 3 112233444444332 35779999999986 458889999999999999999999999999998 No 19 >protein:vir:1778 Length: 680 # NCBI annotation: tail protein A # Family: family:all:825 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570344;genbank:gi:18640503;genbank:GeneID:932716 Probab=100.00 E-value=1.2e-162 Score=908.30 Aligned_cols=573 Identities=20% Similarity=0.316 Sum_probs=446.0 Q ss_pred CcceeeechhhhcccccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeCCCceEEEE Q lcl|NC_012418. 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) Q Consensus 1 M~~v~~s~~n~~gGVSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd~~e~~~i~ 80 (826) ||+|+|+||||++|||||||.+|+|+||++|+||+|||+.||+||||++||+.|... ...+.+|+++||+.|+|+++ T Consensus 1 M~~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRpg~~~i~~l~~~---~~~~~~~~~~rd~~e~~~~~ 77 (680) T protein:vir:17 1 MAAVEQMVPNLLGGISQQPDPLKLPGQVKQARNVQLDPTFGALKRPGTELIMQVTGI---PKRAKWIPIMRDAREHYYVA 77 (680) T ss_pred CccceecchhhhCcceecchhhcCcchhhhhhccccCcCcccccCccceeeeeccCC---CCCceeEEEecCCCCeEEEE Confidence 999999999999999999999999999999999999999999999999999988643 35788999999999999999 Q ss_pred EecCC-------eEEEEECCCCEEEEecCc---ccccc--cCCCcccEEEEEecCEEEEeeCCcceeeeecccCCCCCCc Q lcl|NC_012418. 81 AQHRG-------ELYLFDERDGRLLMGQPL---VHDYL--KAADYRQLRAATVADDLFIANLSVKPEADRTDVKGVDPNK 148 (826) Q Consensus 81 ~~~~g-------~irv~d~~~g~~~~~~~~---~~~yl--~~~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~ 148 (826) +...| .|||||+.+|....+... ...|+ ++.+..+||+++|||||||+|+++.|++..+. .+++. T Consensus 78 ~~~~g~~~~~~~~i~v~d~~~G~~~~v~~~~~~~~~~~~~~~~~~~~lr~~tv~d~tfi~N~~v~~~~~~~~---~~~~~ 154 (680) T protein:vir:17 78 IYREGANESGDLRIRVFDLKAGVERAVSFVGGEVEEYFPGDETDWEAIRSLTIGDYTFLSNPNVQPTTWSRS---FSRRP 154 (680) T ss_pred EEcCCCcccccceeEEEEccCCeEEEEEcCCCceEEEeecCCCCccceEEEEEcCEEEEECCeEEEeccCCC---CCCCC Confidence 98877 499999998865554332 12333 33445689999999999999999999987543 45567 Q ss_pred cEEEEEcccccCeeEEEEEeccCCcceeeeeeee-----eEEeccCccccccccccceeeccchhhhhhh---h------ Q lcl|NC_012418. 149 AGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTA-----TYVTPDNASTNPNLAEAPFQTSVGYIAWQLY---G------ 214 (826) Q Consensus 149 ~a~~~vr~g~Y~r~ytv~i~g~~~s~~~t~~~ta-----~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~---~------ 214 (826) .|+++|++|+|+|+|.|+|++...+..+....+. .+..+.+...........++.....++..+. . T Consensus 155 ~g~~~v~~~ayg~ty~v~ing~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~Ag~~t~~~~~~a~la~~l~~~~~~~~~~~ 234 (680) T protein:vir:17 155 EGLVTIGAAGYGTSYIVDFATEDSGQQRRWAVQEMQAPKTKRKKGDGSPDEAGETTVNNWNGTGLSFRVKVEARAFLVDD 234 (680) T ss_pred eeEEEEEEeeeeeEEEEEEeccccceeeeeeeeeeeccccccccccccCCCCcceeeeeeeeeeeeeeeeeccceeeecC Confidence 8999999999999999999998765543322221 1111111111100000111111111111000 0 Q ss_pred -------------eeecc----------------cceEEe--eee----------eccce-------------ecccccc Q lcl|NC_012418. 215 -------------KFFGA----------------PEYTLP--NST----------KKYPK-------------VDPDTAA 240 (826) Q Consensus 215 -------------~~~s~----------------~~~~~~--~~t----------~~~~~-------------~~~~~~~ 240 (826) ..++. ..+.+. ... ..+.. ++..... T Consensus 235 g~~~~~~y~~~~~l~~tg~~~~~~~~t~~v~~~G~~y~IsI~~~~~~~~~~~~~s~~~~t~~~~~a~~at~~~Ia~~L~~ 314 (680) T protein:vir:17 235 GEEYGHNYIPYVTLLTPGNNTSPFPDTIRVDVSGEGWDIKVTKQIQSKVYANLGTAQFTTPVDQSGGGASTSDIVTGLSA 314 (680) T ss_pred CCceEEEEeeEEEEecCCccccccCceEEEecccceeEEEEccceeeEeccCccceeeeeccCCcccceeHHHHHHHHHH Confidence 00000 000000 000 00000 0000000 Q ss_pred --cccccceEecccCCcEEEE--EcCCCe--EEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEeeeEe Q lcl|NC_012418. 241 --ATVAGYLNQRGVQDGYIAF--RGDGDI--VVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAVM 314 (826) Q Consensus 241 --~~~~~~~~~~~~~~~~~~~--~~~~~~--~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~ 314 (826) .....++.... +++|++ .+..+. ...++.++++++++.++.++|+++++||+++| +|+.+++ +. T Consensus 315 ~i~~~~~~~~~~~--g~~i~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~Lp~~a~----~g~~v~v----~~ 384 (680) T protein:vir:17 315 AINGLGTFTAESI--GNVIRVRYSDPTRTDEFTMSARGGTSGTGLESIKYSVDTLAELPTKCW----NDYQVAV----RN 384 (680) T ss_pred hhcccCcEEEEEC--CCEEEEEeccCCCceEEEeeccCCCCceeeeeeeeeeccccccccccC----CCcEEEE----Ee Confidence 01123333333 344555 433332 23345678888999999999999999999887 4677654 34 Q ss_pred cCCCccceEEEEEecC--------CceEEEeecccccccc--cceeEEEEEecCCCeeEEeecC-------CcccccCCc Q lcl|NC_012418. 315 ATGSTKAPVYFEWDSA--------NRRWAERAAYGTDWVL--KKMPLALRWDEATDTYSLNELD-------YDRRGSGDE 377 (826) Q Consensus 315 ~~~~~~~~~y~~~~~~--------~~~w~E~~~~g~~~~~--~tmp~~~~~~~~~~~f~~~~~~-------w~~r~~gd~ 377 (826) ..++..+.|||+|+.. .++|+||++||+.+++ .||||.++ ++++|.|.++..+ |++|.+||| T Consensus 385 ~~~~~~~~Yyv~~~~~~~~~~~~~~~~W~E~~~~~~~~~~~~~tmp~~l~-r~~~g~f~~~~~~~~~~~~~~~~r~~Gdd 463 (680) T protein:vir:17 385 TQDTEVDDYYVKFETDVEDADVPGSGYWVETVKNGDDGGLVDDTMPHVLV-RNALGDFTFSSLNNSSYGKTWADRSVGSE 463 (680) T ss_pred CCCCcccceEEEEeccCcccCcccccceeecccCcccceeccCcceEEEE-EccCceeEEEeeccccccccccccccCCc Confidence 4567779999999863 4589999999977654 58999998 7789999999876 999999999 Q ss_pred ccccCcccc--CCCceEEEEEcceEEEecCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcE Q lcl|NC_012418. 378 DTNPTFNFV--TRGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDL 455 (826) Q Consensus 378 ~tnp~psf~--g~~~~~v~~~q~RL~~~~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L 455 (826) ++||+|+|+ |++|++|+||||||+|++|++|||||+||||||++++++++.|||||+++++++++++|+|+++++++| T Consensus 464 ~tnp~psF~~~G~~p~~v~f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L 543 (680) T protein:vir:17 464 DTNPHPTFTESGNGIYGMFMYKNRLGFLTQDAVIMSQVGDYFNFYATSGVTISDADPIDMATSDTKPVKLEAAISSTSGA 543 (680) T ss_pred ccCCCcccccCCCCceEEEEEcceEEEeeCCeEEEEccCCcccccccccccCCCCccEEEEEcCCcceeeeEEeecCCcE Confidence 999999999 889999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecCcEEEEeCC-ccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHH Q lcl|NC_012418. 456 IVFAKKYQAVVPGG-GIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSH 534 (826) Q Consensus 456 ~l~t~~~q~~~~~~-~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~ 534 (826) +|||+++||+|+++ ++|||+|++++++|+|+|++.|+|+.+|+.+||++++| ++++||||.|+.+.|+ |+++|||+| T Consensus 544 ~l~t~g~q~~ls~~~~~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g-~~s~vre~~y~~~~d~-y~a~DlT~~ 621 (680) T protein:vir:17 544 ILFGNQAQFRLSSPDESFGPKTATLDKISNYTYESKADPVQTGVSMIFPTNMG-TYSSVYELSTESAKGT-PVIEDSSRV 621 (680) T ss_pred EEEecCeEEEEecCCceecceeEEEEEEEeecccCCCCceEeCCeEEEeecCC-CcceEEEEeeeeccCc-eehhhHHHH Confidence 99999999999984 69999999999999999999999999999999999988 5789999999987776 999999999 Q ss_pred HHHhcCCCeEEE-EEcCCCCEEEEEEcCCCeEEEEEEeeCCCceeeEeeEeeecCCcEE Q lcl|NC_012418. 535 IPSYMPGPAEYI-QAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQII 592 (826) Q Consensus 535 ~~~~~~~~v~~~-~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~ 592 (826) ++|||+|++..+ +++++|..++|++++||+|++|||||+++||+|+|||||+|++.=. T Consensus 622 a~hl~~g~v~~~~~~~~~~~~~~~~~~~~~~l~~~~yl~~~~e~~v~aW~rw~~~~~d~ 680 (680) T protein:vir:17 622 IPRLIPSGLTWSTASMNNDTVFFGNAKKGRNVYVFRFFNEGQERKVAGWTTWYYEDQDH 680 (680) T ss_pred HHHhcCCceEEEEeeCCCCeEEEEEEcCCCEEEEEEEeeCCCceEEEEEEEEecCCCCC Confidence 999999988776 5566677789999999999999999999999999999999986433 No 20 >protein:vir:95324 Length: 823 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512269;genbank:gi:89152436;genbank:GeneID:3952993 Probab=100.00 E-value=7.7e-155 Score=865.49 Aligned_cols=706 Identities=14% Similarity=0.106 Sum_probs=488.4 Q ss_pred Ccceeeechhhhcc-----cccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCccc-ccccccEEEEEeCCC Q lcl|NC_012418. 1 MSYKQSAYPNLLMG-----VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQ-PWPRPFLYHTNLGGR 74 (826) Q Consensus 1 M~~v~~s~~n~~gG-----VSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~-~~~~~~~~~~~rd~~ 74 (826) |+ +++.++||.+| +++|+|++||++||++|+||+|+|+|||+|||||+||+++++++. .+++||. |++. T Consensus 1 m~-i~~~q~sF~~GElsP~l~gR~Dl~ry~~q~~~~~N~~~~~~GGl~rRpGt~fva~~~~~~g~~rLipf~----~s~~ 75 (823) T protein:vir:95 1 MA-ISWIQPSFAGGEIGPSLYGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPNRKCRLIPFQ----FSTV 75 (823) T ss_pred Cc-ceeechhccCceechheeccchHHHHHHHHhhhhCcEeeecCCceecCchhhhhhhcCCCCCeeEEEEE----eCCC Confidence 99 99999999999 999999999999999999999999999999999999999987554 3566655 4444 Q ss_pred ceEEEEEecCCeEEEEECCCCEEEEecC----cccccccCCCcccEEEEEecCEEEEeeCCcceeeeecccCCCCCCccE Q lcl|NC_012418. 75 SIAMLVAQHRGELYLFDERDGRLLMGQP----LVHDYLKAADYRQLRAATVADDLFIANLSVKPEADRTDVKGVDPNKAG 150 (826) Q Consensus 75 e~~~i~~~~~g~irv~d~~~g~~~~~~~----~~~~yl~~~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a 150 (826) |.|+ +++++++||||+ ++|.++.... ...+| ++.+.++|+|+|+||+|||+|++|||++.... ...++... T Consensus 76 q~y~-Lefg~~~irV~~-~~g~vv~~~~~~~ev~tPy-~~~~l~~Lr~~qsaD~~fivh~~~~p~~L~r~-~~~~w~l~- 150 (823) T protein:vir:95 76 QTYA-LEFGHQYMRVIK-DGALVLNSSNVIYEIATPY-TEADLFRIKFTQSADVLTLVHPAYPPKELRRY-AHDNWQLV- 150 (823) T ss_pred cEEE-EEEcCCeEEEEe-CCcEEEecCCceeEEeccc-ccccccceeEEEeccEEEEEcCCccceEEEec-CCCCceEE- Confidence 5555 555688999995 3443322111 12234 56677899999999999999999999875322 12222222 Q ss_pred EEEEcccccCe---eEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeee Q lcl|NC_012418. 151 WLYIKAGQYSK---AFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNS 227 (826) Q Consensus 151 ~~~vr~g~Y~r---~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~ 227 (826) .+..+.|.|.. ++++++....... .++.+.... . ..+.+ ++.. +++... T Consensus 151 ~~~~~~gp~~~~~~~~t~~v~~~~~~~--------~~t~ta~~~-------~-------~~~d~-vg~~-----~~l~~~ 202 (823) T protein:vir:95 151 DVVTKNGPFEDINIDESLTVYASASTG--------TITLTASAS-------I-------FGAEQ-VGKL-----FYLEQP 202 (823) T ss_pred EEEEeccccccccccceeEEeccccCc--------eeEEeeccc-------c-------cchhh-ccce-----EEEecc Confidence 23345565543 3334443222211 111111100 0 00000 0000 000000 Q ss_pred eeccceecccccccccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEE Q lcl|NC_012418. 228 TKKYPKVDPDTAAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQ 307 (826) Q Consensus 228 t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~ 307 (826) ... .... +........+............... + ++. ...|......+.+ T Consensus 203 ~~~--~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~---~------------~~g----~~~~~~~~~~~~~- 251 (823) T protein:vir:95 203 AVD--SVPV---------WETSKSTSIGDIRRADSNYYRAVTA---G------------KTG----TLRPSHTEGTSWD- 251 (823) T ss_pred ccc--eeee---------cceeeeecccceEEecccceeeeec---c------------ccc----eeecccCCcceEE- Confidence 000 0000 0000000000010000000000000 0 000 1111111111111 Q ss_pred EEeeeEecCCCccceEEE---EEecCCceEEEeeccccccc---ccceeEEEEEecCCCeeEEeecCCcccccCCccccc Q lcl|NC_012418. 308 FMDGAVMATGSTKAPVYF---EWDSANRRWAERAAYGTDWV---LKKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNP 381 (826) Q Consensus 308 ~~~~~~~~~~~~~~~~y~---~~~~~~~~w~E~~~~g~~~~---~~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp 381 (826) ...++..+++|. .+....+.|++++..+.... ..+||+.++ ++.+++|.++...|.. T Consensus 252 ------~~~~~~~~~~~~~~~~~~~~~g~~~~t~v~~~~~~~~~~~~~~~~~~-~~~~~t~~~~~~~~~~---------- 314 (823) T protein:vir:95 252 ------GWGGSGDDDTGIEWEYLHSGFGIARITAVNGTTATAEVISYIPSQVV-GEDNASYKWAKYAWNS---------- 314 (823) T ss_pred ------eceecccccceeEEEEEeCCcceEEEEeecceeeeceEeeeeccccc-cCCcCCccccccccCc---------- Confidence 112223333333 23456688999876664322 246888776 6667888888877753 Q ss_pred CccccCCCceEEEEEcceEEEec----CCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEE Q lcl|NC_012418. 382 TFNFVTRGITGMTTFQGRLVLLS----QEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIV 457 (826) Q Consensus 382 ~psf~g~~~~~v~~~q~RL~~~~----~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l 457 (826) ..+||++|+||||||+|++ |++|||||+||||||+++++ ++|||||+++++++++|.|+|+++++ +|+| T Consensus 315 ----~~g~Ps~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~~--~~DdD~I~~~~s~~~~~~i~~~v~~~-~Lli 387 (823) T protein:vir:95 315 ----VNGYPGTVVYYQQRLYFAASTAFPQTIWASRTGDYKDFGKSNP--TQDDDRIIYTYAGRQVNEIRHLIDVG-SLVA 387 (823) T ss_pred ----CCCCccEEEEEeceEEEEEcCCCCcEEEEeccCCccccccccC--CCCCCcEEEEEcCCcceEEEEEeecC-cEEE Confidence 2356788999999999995 79999999999999999984 57999999999999999999999995 7999 Q ss_pred EecCcEEEEeCC--ccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHH Q lcl|NC_012418. 458 FAKKYQAVVPGG--GIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHI 535 (826) Q Consensus 458 ~t~~~q~~~~~~--~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~ 535 (826) ||+++||+|+++ ++|||+|++++++|.|++ ++++|+.+|+.++|+|++| ++||||.|+.+.++ |+++|+|+|+ T Consensus 388 ~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~~~Fv~~~g---~~vre~~~~~~~d~-~~~~dlT~~a 462 (823) T protein:vir:95 388 LTSGGEYVITGDQNKVLTPSSFAFSSQGSNGS-SNVPPIAVANIALFVQEKG---SVVRDLAYSFDVDG-YQGNDLTILA 462 (823) T ss_pred EecCcEEEEEcCCCcccceeeEEEEEeecccc-ccccceEeCCeEEEEecCC---CEEEEEEEeeecCc-eecchhhhhh Confidence 999999999874 699999999999999965 5799999999999999877 47999999977665 9999999999 Q ss_pred HHhcCC-CeEEEEEcCCCCEEEEEEcCCCeEEEEEEeeCCCceeeEeeEeeecCCcEEEEEEE----CCeEEEEEEeCCC Q lcl|NC_012418. 536 PSYMPG-PAEYIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGTYFT----GDNLMVLIQKGQE 610 (826) Q Consensus 536 ~~~~~~-~v~~~~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~----~d~l~~vv~R~~~ 610 (826) +|++++ +|.+|+|+++|++++||+++||+|++|+|+ +||+|.|||||+++|+|++||++ +|+||++|+|+++ T Consensus 463 ~hl~~~~~i~~~a~~~~p~~~~~~v~~dG~l~~~ty~---~~q~v~aW~~~~~~g~~~~~~~i~~~~~d~l~~~v~R~i~ 539 (823) T protein:vir:95 463 NHLFQKHSIVDWCFSIVPYSSAFCIRDDGKLLVMTYL---RDQQVFAWAPQSSTGKYESTCSISEGNEDAVYFVVNRTVN 539 (823) T ss_pred hhhcCCCceEEEEEecCCCeEEEEEecCCcEEEEEEe---cccceeeeEEEecCCcEEEEEEecCCCCCEEEEEEEeccC Confidence 999997 799999999999999999999999999998 88999999999999999999998 6899999999999 Q ss_pred EEEEEEEEeecCCcCCCCcccccceEEEEeecc---------------------------------------ceeeec-- Q lcl|NC_012418. 611 IALGRMHLNSLPAREGLQYPKYDYWRRIEATVE---------------------------------------GELELT-- 649 (826) Q Consensus 611 ~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------------------------------~~~~~~-- 649 (826) ++..||.| ++........++..+.++...+.+ +....+ T Consensus 540 g~~~~yiE-~~~~~~~~~~~~~~~lD~~~s~~g~~~~~~~~~l~~g~~~l~~l~g~~v~~adg~~~~~~~v~g~i~l~~~ 618 (823) T protein:vir:95 540 GQTVRYIE-RLSSRLFTSDEDAFFVDSGLSYDGRNTSDRTMTITGGSGEWDYLAEYTISVSGGAYFTSSDVGAQLQFPYT 618 (823) T ss_pred CeEEEEEE-eeccccCCCccceeEEEEEEEeecCcccceeeEecCCCCcccccCceEEEecCcceECCccceeEEEeCcC Confidence 98888744 433333333333333222211110 000000 Q ss_pred ------------------------------------ccccC--------cc-ccccceEEeeeeeeEeeccEEcccEecC Q lcl|NC_012418. 650 ------------------------------------KQHWD--------LI-KDAPAVYQLQPVAGAFMERYQLGVKRET 684 (826) Q Consensus 650 ------------------------------------~~~~~--------~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 684 (826) ..... .. ......|++|+++.+++||..++...+. T Consensus 619 ~~~~~vGl~~~~~i~~~~~~v~~~~a~~~~~~r~v~a~l~~~~t~~~~~~~~~~~gL~hleg~tv~v~~dg~~~~~~~v~ 698 (823) T protein:vir:95 619 GADPDTGYEVSKELRCDIISVTSNTAVVVRANRNVPPSLRNVATTNWQMARRTFGGLSHLEGQTVNILSDANVEPQKVVS 698 (823) T ss_pred CCccccccceEEEEEEeeceeeCCceEEEccCCcccceeeeeeccccccccceeeeccccccceEEEEEcCeeeCCeEec Confidence 00000 00 0011238899999999999999999999 Q ss_pred CCeEEEeecCCcCCceEEEEEeeeeEEEeCCeEEECCCCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeec Q lcl|NC_012418. 685 NTKVFLDVPEAVVGSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTT 764 (826) Q Consensus 685 ~g~~~l~~~~~~~~~~v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~ 764 (826) +|.++|+.|+ ..|+|||+|+++++|+||++.. +|.++++ ++||++++++|++|.++.++.+.... .+. T Consensus 699 ~G~vtl~~~~----~~v~vGl~~~~~~~~l~~~~~~-~g~~~g~-~~ri~~~~~~~~~s~~~~~g~~~~~l---~~~--- 766 (823) T protein:vir:95 699 GGAVTLESPG----AVVHIGLPITAEFETLDINING-QETLLDK-KQVIPSVTLVVNASRGIWATTPGGKW---YEY--- 766 (823) T ss_pred CCEEEecCCC----CEEEEeecceeeEEecchhcCC-CcccCCc-eeEEeEEEEEEEeeeeEEEecCCCce---eEe--- Confidence 9999999875 4699999999999999999885 4766644 45899999999999999987543321 111 Q ss_pred cccccccccccCccccccceEEEEe-ecccceeEEEEEECCCCCEEEEEEEEEEEEecc Q lcl|NC_012418. 765 PLRLFSRQLNAGEPLVDSAVVPLPA-RVAMATSKFELSCHSPYDMNVRAVEYNFKSNQT 822 (826) Q Consensus 765 ~~~~~~~~~~~~~~~~~tg~~~~p~-~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r 822 (826) +.|- .+.....|+++||++++++ .+|+++.+|+|+|++|||||||||..|...+=- T Consensus 767 ~~r~--~~~~~~~~~~~tG~~~~~~~~~~~~~~~~~i~q~~plp~tvl~v~~~~~~~g~ 823 (823) T protein:vir:95 767 PQRE--FEFYDDPVDDATGKVEVKLDSNWGKNGRVKIRQLDPLPLSVLAVIPRLTVGGF 823 (823) T ss_pred eccC--CCcccCCCCcccceEEEecCCCcCCccEEEEEEcCCCceEEEEEEEEEEecCC Confidence 1221 1222233578999999998 589999999999999999999999999887655 No 21 >protein:vir:107802 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996623;genbank:gi:45580757;genbank:GeneID:2767878 Probab=100.00 E-value=1.5e-150 Score=841.91 Aligned_cols=660 Identities=14% Similarity=0.127 Sum_probs=469.5 Q ss_pred Ccceeeechhhhcc-----cccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCccc-ccccccEEEEEeCCC Q lcl|NC_012418. 1 MSYKQSAYPNLLMG-----VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQ-PWPRPFLYHTNLGGR 74 (826) Q Consensus 1 M~~v~~s~~n~~gG-----VSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~-~~~~~~~~~~~rd~~ 74 (826) |+++....+||.+| +..|.|++||+++|++|+||++.|+||++|||||+|++++++++. .+++||.|+ . T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~-----~ 75 (681) T protein:vir:10 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYS-----V 75 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeC-----C Confidence 99999999999999 668999999999999999999999999999999999999998764 468888876 4 Q ss_pred ceEEEEEecCCeEEEEECCCCEEEEec-C--cccccccCCCcccEEEEEecCEEEEeeCCcceeeeecccCCCCCCccEE Q lcl|NC_012418. 75 SIAMLVAQHRGELYLFDERDGRLLMGQ-P--LVHDYLKAADYRQLRAATVADDLFIANLSVKPEADRTDVKGVDPNKAGW 151 (826) Q Consensus 75 e~~~i~~~~~g~irv~d~~~g~~~~~~-~--~~~~yl~~~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~ 151 (826) +++|++++++++|||| .+++.++..+ + ..++| +++++.+|+|+|+||+|||||+++||++... ....+|+-.++ T Consensus 76 ~~~~~l~~g~~~~r~~-~~~~~~~~~~~~~~~~tpy-~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r-~~~~~W~l~~~ 152 (681) T protein:vir:10 76 TQTMVIELGAGYFRFH-TNGGTLLDGAVPYEIANPY-AEADLFNIHYVQSADVLTLVHPNYAPRELRR-LGATNWQLATI 152 (681) T ss_pred CceEEEEEeCCeEEEE-eCCcEEeeCcEeEEecCCC-ChhhhcCceEEEEcCEEEEECCCCcceEEEE-ccCCceEEEEE Confidence 7888899999999999 4566554321 1 12334 5667789999999999999999999988422 22222221111 Q ss_pred EEEcccccCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeeeeecc Q lcl|NC_012418. 152 LYIKAGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKY 231 (826) Q Consensus 152 ~~vr~g~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t~~~ 231 (826) .+..+ .+. ..+++ ++ ... .+.. .+..+ T Consensus 153 ~f~~~-p~~-p~~~~---------------at--~~~-----------------------------~~~~-----~t~~~ 179 (681) T protein:vir:10 153 AFTSP-VAT-PTSVT---------------AT--SNN-----------------------------KGTD-----YTYRY 179 (681) T ss_pred Eeccc-ccc-ceeee---------------ee--ccC-----------------------------Cccc-----eeEeE Confidence 11110 000 00000 00 000 0000 00000 Q ss_pred ceecccccccccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEee Q lcl|NC_012418. 232 PKVDPDTAAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDG 311 (826) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~ 311 (826) ...+.+... . ..+. ..... .+....-+. +... . T Consensus 180 ~v~avda~t--------------~-----------~~s~---~~~~~--tvt~~~~~~-------------~~~~----t 212 (681) T protein:vir:10 180 VVTALDAEG--------------K-----------TESA---PSSAG--TCTNNLFTN-------------GGAN----T 212 (681) T ss_pred EEEEeeccc--------------c-----------eeec---CCcce--EEeeeeecC-------------Ccce----e Confidence 000000000 0 0000 00000 000000000 0000 0 Q ss_pred eEecCCCccceEEEEEecCCceEEEeecccccccccceeEEEEEecCCCeeEEeecCCcccccCCcccccCccccCCCce Q lcl|NC_012418. 312 AVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFVTRGIT 391 (826) Q Consensus 312 ~~~~~~~~~~~~y~~~~~~~~~w~E~~~~g~~~~~~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~ 391 (826) ..+...... ..|-.+....++|... +... +....+..+ .+......+...+|+.++ ++||+ T Consensus 213 ~~w~a~~g~-~~~~V~~~~~gi~g~i-g~~~--~~~~~~~~~--------------~~~~~~t~~~~~~~~~~~-~gyP~ 273 (681) T protein:vir:10 213 IAWSASSGA-SRYNVYKEQGGLYGYI-GQTT--GTSLVDDNI--------------APDLSVTPPIYDAVFNAA-GDYPA 273 (681) T ss_pred EEEEecCCc-eeeeecccceeEEEEe-eccc--eeeeeeccc--------------ccCccccccccccccccC-CCceE Confidence 011111111 1111111222333221 1110 000000000 111111122334666554 56899 Q ss_pred EEEEEcceEEEe----cCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEe Q lcl|NC_012418. 392 GMTTFQGRLVLL----SQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVP 467 (826) Q Consensus 392 ~v~~~q~RL~~~----~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~ 467 (826) +|+||||||+|+ +||+|||||+||||||+++++ ++|||||++++++++++.|+|+++++ +|+|||+++||.|+ T Consensus 274 ~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~--~~ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~ 350 (681) T protein:vir:10 274 AVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLP--VRDDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVA 350 (681) T ss_pred EEEEEcceEEEeeCCCCCcEEEEEcccCcccccccCC--CCCCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEe Confidence 999999999999 589999999999999999984 57999999999999999999999995 79999999999998 Q ss_pred C--CccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCC-CeE Q lcl|NC_012418. 468 G--GGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPG-PAE 544 (826) Q Consensus 468 ~--~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~-~v~ 544 (826) + +++|||+|++++++|.|++ ++++|+.+|++++|+|++|+ .||||.|+.+.|+ |+++|+|++++|++++ +|. T Consensus 351 ~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~fv~~~g~---~vre~~y~~~~d~-~~~~dlt~~a~Hl~~~~~i~ 425 (681) T protein:vir:10 351 SVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTIYGAARGG---HVRELAYNWQANG-FVTGDLSLRAAHLFDNLDIL 425 (681) T ss_pred cCCCccccceeEEEEEeeeecc-ccccceeeCCeEEEEecCCC---EEEEEEEeeecCc-eeccchhhhhhhhcCCCCeE Confidence 7 4699999999999999976 47999999999999999884 7999999977775 9999999999999997 899 Q ss_pred EEEEcCCCCEEEEEEcCCCeEEEEEEeeCCCceeeEeeEeeecCCcEEEEEEE----CCeEEEEEEeCCCEEEEEEEEee Q lcl|NC_012418. 545 YIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGTYFT----GDNLMVLIQKGQEIALGRMHLNS 620 (826) Q Consensus 545 ~~~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~----~d~l~~vv~R~~~~~~~r~~~~~ 620 (826) +|+++++|++++||+++||+|++|+|+ +||+|.|||||+|+|+|++||++ +|+||++|+|++++..+++ +|+ T Consensus 426 ~~a~~~~p~~~~~~v~~dg~l~~~ty~---~eq~v~aW~~~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~y-ie~ 501 (681) T protein:vir:10 426 DMAYAKAPQPIVWFISSSGKLLGLTYV---PEQQIGAWHQHDTDGVFESCAVVAEGNEDRLYAVVRRTIGGNEVRY-VER 501 (681) T ss_pred EEEEecCCCEEEEEEecCCcEEEEEEe---cccceeeEEEEecCCcEEEEEEecCCCCcEEEEEEEecCCCCeEEE-EEe Confidence 999999999999999999999999997 88999999999999999999999 6899999999999988887 444 Q ss_pred cCCcCCCCcccccceEEEEeeccceeeecccccCccccccceEEeeeeeeEeeccEEcccEecCCCeEEEeecCCcCCce Q lcl|NC_012418. 621 LPAREGLQYPKYDYWRRIEATVEGELELTKQHWDLIKDAPAVYQLQPVAGAFMERYQLGVKRETNTKVFLDVPEAVVGSV 700 (826) Q Consensus 621 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~ 700 (826) +.........+. .++||......... . ......|++|+++.+++||..++...+.+|.++|+.+ +.+ T Consensus 502 ~~~~~~~~~~~~---~~vD~~~t~~~~~~-~-----~~sgl~~leG~tv~i~aDG~~~~~~~V~~G~itl~~~----~~~ 568 (681) T protein:vir:10 502 MASRQFDAQADA---FFVDSGLTYSGEPV-S-----HISGLEHLEGKTVSILADGAVHPQRVVTDGAIDLDVE----AGT 568 (681) T ss_pred cCCccccccccc---eEeeccccccCcce-e-----eeccccCCCCcEEEEEeCCeecCcEeecCcEEEeCcC----Cce Confidence 444333222222 23555443322110 0 1122247899999999999999999999999999865 567 Q ss_pred EEEEEeeeeEEEeCCeEEECCCCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccccccccccccCcccc Q lcl|NC_012418. 701 YVVGCEFWSKVEFTPPVLRDHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLV 780 (826) Q Consensus 701 v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 780 (826) |+|||+|+++++|+||+++.++|...++ ++||+|+.|++.+|.+++++++...... ...+.+.+++ ..+++ T Consensus 569 v~VGl~Y~s~i~~lp~~~~~~~g~~~g~-~~ri~rv~lr~~~S~g~~~~~~~~~l~~--~~~~~~~~~g------~~~~l 639 (681) T protein:vir:10 569 VHIGLPITAELQTLPVAMQLDGSFGQGR-VKNINKLWLRVHRSSGIFAGPHADALTE--VKQRTSEPYG------SPPAL 639 (681) T ss_pred EEEeeeceeEEEecceeeecCCcccCCc-eEEEEEEEEEEEcccceEEeeCCCceEE--EEEecccccc------ccCCc Confidence 9999999999999999999998877654 6799999999999999999876543322 2222222221 23578 Q ss_pred ccceEEEEee-cccceeEEEEEECCCCCEEEEEEEEEEEEec Q lcl|NC_012418. 781 DSAVVPLPAR-VAMATSKFELSCHSPYDMNVRAVEYNFKSNQ 821 (826) Q Consensus 781 ~tg~~~~p~~-~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~ 821 (826) +||++++|+. +|+++.+|+|+|++|+||+|+||+||....- T Consensus 640 ~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~vgg 681 (681) T protein:vir:10 640 KSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIAIGA 681 (681) T ss_pred cCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEEeeC Confidence 9999999986 7899999999999999999999999999987 No 22 >protein:vir:98487 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996575;genbank:gi:45569506;genbank:GeneID:2767815 Probab=100.00 E-value=1.5e-150 Score=841.91 Aligned_cols=660 Identities=14% Similarity=0.127 Sum_probs=469.5 Q ss_pred Ccceeeechhhhcc-----cccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCccc-ccccccEEEEEeCCC Q lcl|NC_012418. 1 MSYKQSAYPNLLMG-----VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQ-PWPRPFLYHTNLGGR 74 (826) Q Consensus 1 M~~v~~s~~n~~gG-----VSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~-~~~~~~~~~~~rd~~ 74 (826) |+++....+||.+| +..|.|++||+++|++|+||++.|+||++|||||+|++++++++. .+++||.|+ . T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~-----~ 75 (681) T protein:vir:98 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYS-----V 75 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeC-----C Confidence 99999999999999 668999999999999999999999999999999999999998764 468888876 4 Q ss_pred ceEEEEEecCCeEEEEECCCCEEEEec-C--cccccccCCCcccEEEEEecCEEEEeeCCcceeeeecccCCCCCCccEE Q lcl|NC_012418. 75 SIAMLVAQHRGELYLFDERDGRLLMGQ-P--LVHDYLKAADYRQLRAATVADDLFIANLSVKPEADRTDVKGVDPNKAGW 151 (826) Q Consensus 75 e~~~i~~~~~g~irv~d~~~g~~~~~~-~--~~~~yl~~~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~ 151 (826) +++|++++++++|||| .+++.++..+ + ..++| +++++.+|+|+|+||+|||||+++||++... ....+|+-.++ T Consensus 76 ~~~~~l~~g~~~~r~~-~~~~~~~~~~~~~~~~tpy-~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r-~~~~~W~l~~~ 152 (681) T protein:vir:98 76 TQTMVIELGAGYFRFH-TNGGTLLDGAVPYEIANPY-AEADLFNIHYVQSADVLTLVHPNYAPRELRR-LGATNWQLATI 152 (681) T ss_pred CceEEEEEeCCeEEEE-eCCcEEeeCcEeEEecCCC-ChhhhcCceEEEEcCEEEEECCCCcceEEEE-ccCCceEEEEE Confidence 7888899999999999 4566554321 1 12334 5667789999999999999999999988422 22222221111 Q ss_pred EEEcccccCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeeeeecc Q lcl|NC_012418. 152 LYIKAGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKY 231 (826) Q Consensus 152 ~~vr~g~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t~~~ 231 (826) .+..+ .+. ..+++ ++ ... .+.. .+..+ T Consensus 153 ~f~~~-p~~-p~~~~---------------at--~~~-----------------------------~~~~-----~t~~~ 179 (681) T protein:vir:98 153 AFTSP-VAT-PTSVT---------------AT--SNN-----------------------------KGTD-----YTYRY 179 (681) T ss_pred Eeccc-ccc-ceeee---------------ee--ccC-----------------------------Cccc-----eeEeE Confidence 11110 000 00000 00 000 0000 00000 Q ss_pred ceecccccccccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEee Q lcl|NC_012418. 232 PKVDPDTAAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDG 311 (826) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~ 311 (826) ...+.+... . ..+. ..... .+....-+. +... . T Consensus 180 ~v~avda~t--------------~-----------~~s~---~~~~~--tvt~~~~~~-------------~~~~----t 212 (681) T protein:vir:98 180 VVTALDAEG--------------K-----------TESA---PSSAG--TCTNNLFTN-------------GGAN----T 212 (681) T ss_pred EEEEeeccc--------------c-----------eeec---CCcce--EEeeeeecC-------------Ccce----e Confidence 000000000 0 0000 00000 000000000 0000 0 Q ss_pred eEecCCCccceEEEEEecCCceEEEeecccccccccceeEEEEEecCCCeeEEeecCCcccccCCcccccCccccCCCce Q lcl|NC_012418. 312 AVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFVTRGIT 391 (826) Q Consensus 312 ~~~~~~~~~~~~y~~~~~~~~~w~E~~~~g~~~~~~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~ 391 (826) ..+...... ..|-.+....++|... +... +....+..+ .+......+...+|+.++ ++||+ T Consensus 213 ~~w~a~~g~-~~~~V~~~~~gi~g~i-g~~~--~~~~~~~~~--------------~~~~~~t~~~~~~~~~~~-~gyP~ 273 (681) T protein:vir:98 213 IAWSASSGA-SRYNVYKEQGGLYGYI-GQTT--GTSLVDDNI--------------APDLSVTPPIYDAVFNAA-GDYPA 273 (681) T ss_pred EEEEecCCc-eeeeecccceeEEEEe-eccc--eeeeeeccc--------------ccCccccccccccccccC-CCceE Confidence 011111111 1111111222333221 1110 000000000 111111122334666554 56899 Q ss_pred EEEEEcceEEEe----cCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEe Q lcl|NC_012418. 392 GMTTFQGRLVLL----SQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVP 467 (826) Q Consensus 392 ~v~~~q~RL~~~----~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~ 467 (826) +|+||||||+|+ +||+|||||+||||||+++++ ++|||||++++++++++.|+|+++++ +|+|||+++||.|+ T Consensus 274 ~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~--~~ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~ 350 (681) T protein:vir:98 274 AVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLP--VRDDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVA 350 (681) T ss_pred EEEEEcceEEEeeCCCCCcEEEEEcccCcccccccCC--CCCCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEe Confidence 999999999999 589999999999999999984 57999999999999999999999995 79999999999998 Q ss_pred C--CccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCC-CeE Q lcl|NC_012418. 468 G--GGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPG-PAE 544 (826) Q Consensus 468 ~--~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~-~v~ 544 (826) + +++|||+|++++++|.|++ ++++|+.+|++++|+|++|+ .||||.|+.+.|+ |+++|+|++++|++++ +|. T Consensus 351 ~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~fv~~~g~---~vre~~y~~~~d~-~~~~dlt~~a~Hl~~~~~i~ 425 (681) T protein:vir:98 351 SVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTIYGAARGG---HVRELAYNWQANG-FVTGDLSLRAAHLFDNLDIL 425 (681) T ss_pred cCCCccccceeEEEEEeeeecc-ccccceeeCCeEEEEecCCC---EEEEEEEeeecCc-eeccchhhhhhhhcCCCCeE Confidence 7 4699999999999999976 47999999999999999884 7999999977775 9999999999999997 899 Q ss_pred EEEEcCCCCEEEEEEcCCCeEEEEEEeeCCCceeeEeeEeeecCCcEEEEEEE----CCeEEEEEEeCCCEEEEEEEEee Q lcl|NC_012418. 545 YIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGTYFT----GDNLMVLIQKGQEIALGRMHLNS 620 (826) Q Consensus 545 ~~~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~----~d~l~~vv~R~~~~~~~r~~~~~ 620 (826) +|+++++|++++||+++||+|++|+|+ +||+|.|||||+|+|+|++||++ +|+||++|+|++++..+++ +|+ T Consensus 426 ~~a~~~~p~~~~~~v~~dg~l~~~ty~---~eq~v~aW~~~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~y-ie~ 501 (681) T protein:vir:98 426 DMAYAKAPQPIVWFISSSGKLLGLTYV---PEQQIGAWHQHDTDGVFESCAVVAEGNEDRLYAVVRRTIGGNEVRY-VER 501 (681) T ss_pred EEEEecCCCEEEEEEecCCcEEEEEEe---cccceeeEEEEecCCcEEEEEEecCCCCcEEEEEEEecCCCCeEEE-EEe Confidence 999999999999999999999999997 88999999999999999999999 6899999999999988887 444 Q ss_pred cCCcCCCCcccccceEEEEeeccceeeecccccCccccccceEEeeeeeeEeeccEEcccEecCCCeEEEeecCCcCCce Q lcl|NC_012418. 621 LPAREGLQYPKYDYWRRIEATVEGELELTKQHWDLIKDAPAVYQLQPVAGAFMERYQLGVKRETNTKVFLDVPEAVVGSV 700 (826) Q Consensus 621 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~ 700 (826) +.........+. .++||......... . ......|++|+++.+++||..++...+.+|.++|+.+ +.+ T Consensus 502 ~~~~~~~~~~~~---~~vD~~~t~~~~~~-~-----~~sgl~~leG~tv~i~aDG~~~~~~~V~~G~itl~~~----~~~ 568 (681) T protein:vir:98 502 MASRQFDAQADA---FFVDSGLTYSGEPV-S-----HISGLEHLEGKTVSILADGAVHPQRVVTDGAIDLDVE----AGT 568 (681) T ss_pred cCCccccccccc---eEeeccccccCcce-e-----eeccccCCCCcEEEEEeCCeecCcEeecCcEEEeCcC----Cce Confidence 444333222222 23555443322110 0 1122247899999999999999999999999999865 567 Q ss_pred EEEEEeeeeEEEeCCeEEECCCCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccccccccccccCcccc Q lcl|NC_012418. 701 YVVGCEFWSKVEFTPPVLRDHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLV 780 (826) Q Consensus 701 v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 780 (826) |+|||+|+++++|+||+++.++|...++ ++||+|+.|++.+|.+++++++...... ...+.+.+++ ..+++ T Consensus 569 v~VGl~Y~s~i~~lp~~~~~~~g~~~g~-~~ri~rv~lr~~~S~g~~~~~~~~~l~~--~~~~~~~~~g------~~~~l 639 (681) T protein:vir:98 569 VHIGLPITAELQTLPVAMQLDGSFGQGR-VKNINKLWLRVHRSSGIFAGPHADALTE--VKQRTSEPYG------SPPAL 639 (681) T ss_pred EEEeeeceeEEEecceeeecCCcccCCc-eEEEEEEEEEEEcccceEEeeCCCceEE--EEEecccccc------ccCCc Confidence 9999999999999999999998877654 6799999999999999999876543322 2222222221 23578 Q ss_pred ccceEEEEee-cccceeEEEEEECCCCCEEEEEEEEEEEEec Q lcl|NC_012418. 781 DSAVVPLPAR-VAMATSKFELSCHSPYDMNVRAVEYNFKSNQ 821 (826) Q Consensus 781 ~tg~~~~p~~-~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~ 821 (826) +||++++|+. +|+++.+|+|+|++|+||+|+||+||....- T Consensus 640 ~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~vgg 681 (681) T protein:vir:98 640 KSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIAIGA 681 (681) T ss_pred cCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEEeeC Confidence 9999999986 7899999999999999999999999999987 No 23 >protein:vir:107423 Length: 681 # NCBI annotation: Bbp13 # Family: family:all:780 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958682;genbank:gi:41179374;genbank:GeneID:2717217 Probab=100.00 E-value=1.5e-150 Score=841.91 Aligned_cols=660 Identities=14% Similarity=0.127 Sum_probs=469.5 Q ss_pred Ccceeeechhhhcc-----cccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCccc-ccccccEEEEEeCCC Q lcl|NC_012418. 1 MSYKQSAYPNLLMG-----VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQ-PWPRPFLYHTNLGGR 74 (826) Q Consensus 1 M~~v~~s~~n~~gG-----VSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~-~~~~~~~~~~~rd~~ 74 (826) |+++....+||.+| +..|.|++||+++|++|+||++.|+||++|||||+|++++++++. .+++||.|+ . T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~-----~ 75 (681) T protein:vir:10 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYS-----V 75 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeC-----C Confidence 99999999999999 668999999999999999999999999999999999999998764 468888876 4 Q ss_pred ceEEEEEecCCeEEEEECCCCEEEEec-C--cccccccCCCcccEEEEEecCEEEEeeCCcceeeeecccCCCCCCccEE Q lcl|NC_012418. 75 SIAMLVAQHRGELYLFDERDGRLLMGQ-P--LVHDYLKAADYRQLRAATVADDLFIANLSVKPEADRTDVKGVDPNKAGW 151 (826) Q Consensus 75 e~~~i~~~~~g~irv~d~~~g~~~~~~-~--~~~~yl~~~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~ 151 (826) +++|++++++++|||| .+++.++..+ + ..++| +++++.+|+|+|+||+|||||+++||++... ....+|+-.++ T Consensus 76 ~~~~~l~~g~~~~r~~-~~~~~~~~~~~~~~~~tpy-~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r-~~~~~W~l~~~ 152 (681) T protein:vir:10 76 TQTMVIELGAGYFRFH-TNGGTLLDGAVPYEIANPY-AEADLFNIHYVQSADVLTLVHPNYAPRELRR-LGATNWQLATI 152 (681) T ss_pred CceEEEEEeCCeEEEE-eCCcEEeeCcEeEEecCCC-ChhhhcCceEEEEcCEEEEECCCCcceEEEE-ccCCceEEEEE Confidence 7888899999999999 4566554321 1 12334 5667789999999999999999999988422 22222221111 Q ss_pred EEEcccccCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeeeeecc Q lcl|NC_012418. 152 LYIKAGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKY 231 (826) Q Consensus 152 ~~vr~g~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t~~~ 231 (826) .+..+ .+. ..+++ ++ ... .+.. .+..+ T Consensus 153 ~f~~~-p~~-p~~~~---------------at--~~~-----------------------------~~~~-----~t~~~ 179 (681) T protein:vir:10 153 AFTSP-VAT-PTSVT---------------AT--SNN-----------------------------KGTD-----YTYRY 179 (681) T ss_pred Eeccc-ccc-ceeee---------------ee--ccC-----------------------------Cccc-----eeEeE Confidence 11110 000 00000 00 000 0000 00000 Q ss_pred ceecccccccccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEee Q lcl|NC_012418. 232 PKVDPDTAAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDG 311 (826) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~ 311 (826) ...+.+... . ..+. ..... .+....-+. +... . T Consensus 180 ~v~avda~t--------------~-----------~~s~---~~~~~--tvt~~~~~~-------------~~~~----t 212 (681) T protein:vir:10 180 VVTALDAEG--------------K-----------TESA---PSSAG--TCTNNLFTN-------------GGAN----T 212 (681) T ss_pred EEEEeeccc--------------c-----------eeec---CCcce--EEeeeeecC-------------Ccce----e Confidence 000000000 0 0000 00000 000000000 0000 0 Q ss_pred eEecCCCccceEEEEEecCCceEEEeecccccccccceeEEEEEecCCCeeEEeecCCcccccCCcccccCccccCCCce Q lcl|NC_012418. 312 AVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFVTRGIT 391 (826) Q Consensus 312 ~~~~~~~~~~~~y~~~~~~~~~w~E~~~~g~~~~~~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~ 391 (826) ..+...... ..|-.+....++|... +... +....+..+ .+......+...+|+.++ ++||+ T Consensus 213 ~~w~a~~g~-~~~~V~~~~~gi~g~i-g~~~--~~~~~~~~~--------------~~~~~~t~~~~~~~~~~~-~gyP~ 273 (681) T protein:vir:10 213 IAWSASSGA-SRYNVYKEQGGLYGYI-GQTT--GTSLVDDNI--------------APDLSVTPPIYDAVFNAA-GDYPA 273 (681) T ss_pred EEEEecCCc-eeeeecccceeEEEEe-eccc--eeeeeeccc--------------ccCccccccccccccccC-CCceE Confidence 011111111 1111111222333221 1110 000000000 111111122334666554 56899 Q ss_pred EEEEEcceEEEe----cCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEe Q lcl|NC_012418. 392 GMTTFQGRLVLL----SQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVP 467 (826) Q Consensus 392 ~v~~~q~RL~~~----~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~ 467 (826) +|+||||||+|+ +||+|||||+||||||+++++ ++|||||++++++++++.|+|+++++ +|+|||+++||.|+ T Consensus 274 ~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~--~~ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~ 350 (681) T protein:vir:10 274 AVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLP--VRDDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVA 350 (681) T ss_pred EEEEEcceEEEeeCCCCCcEEEEEcccCcccccccCC--CCCCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEe Confidence 999999999999 589999999999999999984 57999999999999999999999995 79999999999998 Q ss_pred C--CccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCC-CeE Q lcl|NC_012418. 468 G--GGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPG-PAE 544 (826) Q Consensus 468 ~--~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~-~v~ 544 (826) + +++|||+|++++++|.|++ ++++|+.+|++++|+|++|+ .||||.|+.+.|+ |+++|+|++++|++++ +|. T Consensus 351 ~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~fv~~~g~---~vre~~y~~~~d~-~~~~dlt~~a~Hl~~~~~i~ 425 (681) T protein:vir:10 351 SVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTIYGAARGG---HVRELAYNWQANG-FVTGDLSLRAAHLFDNLDIL 425 (681) T ss_pred cCCCccccceeEEEEEeeeecc-ccccceeeCCeEEEEecCCC---EEEEEEEeeecCc-eeccchhhhhhhhcCCCCeE Confidence 7 4699999999999999976 47999999999999999884 7999999977775 9999999999999997 899 Q ss_pred EEEEcCCCCEEEEEEcCCCeEEEEEEeeCCCceeeEeeEeeecCCcEEEEEEE----CCeEEEEEEeCCCEEEEEEEEee Q lcl|NC_012418. 545 YIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGTYFT----GDNLMVLIQKGQEIALGRMHLNS 620 (826) Q Consensus 545 ~~~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~----~d~l~~vv~R~~~~~~~r~~~~~ 620 (826) +|+++++|++++||+++||+|++|+|+ +||+|.|||||+|+|+|++||++ +|+||++|+|++++..+++ +|+ T Consensus 426 ~~a~~~~p~~~~~~v~~dg~l~~~ty~---~eq~v~aW~~~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~y-ie~ 501 (681) T protein:vir:10 426 DMAYAKAPQPIVWFISSSGKLLGLTYV---PEQQIGAWHQHDTDGVFESCAVVAEGNEDRLYAVVRRTIGGNEVRY-VER 501 (681) T ss_pred EEEEecCCCEEEEEEecCCcEEEEEEe---cccceeeEEEEecCCcEEEEEEecCCCCcEEEEEEEecCCCCeEEE-EEe Confidence 999999999999999999999999997 88999999999999999999999 6899999999999988887 444 Q ss_pred cCCcCCCCcccccceEEEEeeccceeeecccccCccccccceEEeeeeeeEeeccEEcccEecCCCeEEEeecCCcCCce Q lcl|NC_012418. 621 LPAREGLQYPKYDYWRRIEATVEGELELTKQHWDLIKDAPAVYQLQPVAGAFMERYQLGVKRETNTKVFLDVPEAVVGSV 700 (826) Q Consensus 621 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~ 700 (826) +.........+. .++||......... . ......|++|+++.+++||..++...+.+|.++|+.+ +.+ T Consensus 502 ~~~~~~~~~~~~---~~vD~~~t~~~~~~-~-----~~sgl~~leG~tv~i~aDG~~~~~~~V~~G~itl~~~----~~~ 568 (681) T protein:vir:10 502 MASRQFDAQADA---FFVDSGLTYSGEPV-S-----HISGLEHLEGKTVSILADGAVHPQRVVTDGAIDLDVE----AGT 568 (681) T ss_pred cCCccccccccc---eEeeccccccCcce-e-----eeccccCCCCcEEEEEeCCeecCcEeecCcEEEeCcC----Cce Confidence 444333222222 23555443322110 0 1122247899999999999999999999999999865 567 Q ss_pred EEEEEeeeeEEEeCCeEEECCCCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccccccccccccCcccc Q lcl|NC_012418. 701 YVVGCEFWSKVEFTPPVLRDHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLV 780 (826) Q Consensus 701 v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 780 (826) |+|||+|+++++|+||+++.++|...++ ++||+|+.|++.+|.+++++++...... ...+.+.+++ ..+++ T Consensus 569 v~VGl~Y~s~i~~lp~~~~~~~g~~~g~-~~ri~rv~lr~~~S~g~~~~~~~~~l~~--~~~~~~~~~g------~~~~l 639 (681) T protein:vir:10 569 VHIGLPITAELQTLPVAMQLDGSFGQGR-VKNINKLWLRVHRSSGIFAGPHADALTE--VKQRTSEPYG------SPPAL 639 (681) T ss_pred EEEeeeceeEEEecceeeecCCcccCCc-eEEEEEEEEEEEcccceEEeeCCCceEE--EEEecccccc------ccCCc Confidence 9999999999999999999998877654 6799999999999999999876543322 2222222221 23578 Q ss_pred ccceEEEEee-cccceeEEEEEECCCCCEEEEEEEEEEEEec Q lcl|NC_012418. 781 DSAVVPLPAR-VAMATSKFELSCHSPYDMNVRAVEYNFKSNQ 821 (826) Q Consensus 781 ~tg~~~~p~~-~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~ 821 (826) +||++++|+. +|+++.+|+|+|++|+||+|+||+||....- T Consensus 640 ~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~vgg 681 (681) T protein:vir:10 640 KSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIAIGA 681 (681) T ss_pred cCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEEeeC Confidence 9999999986 7899999999999999999999999999987 No 24 >protein:vir:7329 Length: 825 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848220;genbank:gi:30387391;genbank:GeneID:2641863 Probab=100.00 E-value=3.8e-150 Score=839.76 Aligned_cols=708 Identities=13% Similarity=0.096 Sum_probs=479.9 Q ss_pred Ccceeeechhhhcc-----cccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCccc-ccccccEEEEEeCCC Q lcl|NC_012418. 1 MSYKQSAYPNLLMG-----VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQ-PWPRPFLYHTNLGGR 74 (826) Q Consensus 1 M~~v~~s~~n~~gG-----VSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~-~~~~~~~~~~~rd~~ 74 (826) |+. +...+||.+| +..|.|++||+++|++|+||+++|+||++|||||+||+++++++. .+++||.|+ . T Consensus 1 m~~-~~~q~sF~~GElsP~l~gR~Dl~~y~~g~~~~~N~~~~p~Gg~~rRpGt~fva~~~~~~~~~rLipF~fs-----~ 74 (825) T protein:vir:73 1 MAF-SWIQPSFAGGEIGPSLYGRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPAKYPDRKCRLIPFQFS-----T 74 (825) T ss_pred Ccc-ceeccccccceechhhcccchHHHHHHHHHHhcCcEEEecCCceecCchHHhHhhcCCCCCEEEEEEEeC-----C Confidence 873 4567899999 778999999999999999999999999999999999999997654 488888776 4 Q ss_pred ceEEEEEecCCeEEEEECCCCEEEEecC----cccccccCCCcccEEEEEecCEEEEeeCCcceeeeecccCCCCCCccE Q lcl|NC_012418. 75 SIAMLVAQHRGELYLFDERDGRLLMGQP----LVHDYLKAADYRQLRAATVADDLFIANLSVKPEADRTDVKGVDPNKAG 150 (826) Q Consensus 75 e~~~i~~~~~g~irv~d~~~g~~~~~~~----~~~~yl~~~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a 150 (826) +++|++++++++||||.. +|.++...+ ..++| ++++..+|+++|+||+|||+|+++||++.... ...++.-.. T Consensus 75 ~q~y~Lefg~~~lrv~~~-gg~v~~~~~~~~e~~TPy-~~~~l~~l~~~QsaD~~~i~h~~~pp~~L~r~-~~~~W~l~~ 151 (825) T protein:vir:73 75 VQTYALEFGHNYMRVIKD-GAYVLTTSNVIYELAMPY-ADTDLFRIKFTQSADVLTLVHPAYPPKELRRY-AHDNWQIVD 151 (825) T ss_pred CcEEEEEEeCCeEEEEeC-CceEeccCCceEEEeccc-chhhhhhheeeeecCEEEEEcCCCceeEEEEe-cCCCcEEEE Confidence 688888889999999964 554432221 23445 56677899999999999999999999885322 122333232 Q ss_pred EEEEccccc--CeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeeee Q lcl|NC_012418. 151 WLYIKAGQY--SKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNST 228 (826) Q Consensus 151 ~~~vr~g~Y--~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t 228 (826) +.+..+..+ +...++++.........+...++....+++.... .. +.......+. .|..... T Consensus 152 ~~f~~gp~~~in~~~sv~v~asg~tg~~TiTaS~a~~~~~~vG~~---------i~---~~~~~v~si~---~~~~~~~- 215 (825) T protein:vir:73 152 VTTKNGPFEDINVDETVKVYASASTGTITLTASSAIFGAEQVGKL---------FY---LEQPAVDSVP---VWETSKT- 215 (825) T ss_pred EeccCCccccccccccceeeecccCceeEEEeeccccCchhcCeE---------EE---Eecccccccc---eeeeeeE- Confidence 222211111 1111222222111111111111110000000000 00 0000000000 0000000 Q ss_pred eccceecccccccccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEE Q lcl|NC_012418. 229 KKYPKVDPDTAAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQF 308 (826) Q Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~ 308 (826) .. .+-+......... .....+.....|... .+... T Consensus 216 -------------------~~----~~~v~~~~~~~~~-------------------~~~~~~~~t~~~~a~-~g~~~-- 250 (825) T protein:vir:73 216 -------------------TA----INDVRRADSNYYR-------------------ANTSGKTGTLRPSHT-EGMSW-- 250 (825) T ss_pred -------------------EE----eeeEEECCCceee-------------------eecccccceeecccc-CCcee-- Confidence 00 0000000000000 000000111111110 11110 Q ss_pred EeeeEecCCCccc-eEEEEEecCCceEEEeecccccc-----cccceeEEEEEecCCCeeEEeecCCcccccCCcccccC Q lcl|NC_012418. 309 MDGAVMATGSTKA-PVYFEWDSANRRWAERAAYGTDW-----VLKKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPT 382 (826) Q Consensus 309 ~~~~~~~~~~~~~-~~y~~~~~~~~~w~E~~~~g~~~-----~~~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~ 382 (826) ........... ..|.....+.+.++.+..++... ....||+.++ +..+++++++...|.. +| T Consensus 251 --~~~~g~~~~~~~~~~~~~~~~~g~~~it~~~~~~~~~~~~~~~~~~~~~~-~~~~~t~~~~~~~~~~-------~~-- 318 (825) T protein:vir:73 251 --DGWGGTGSDDTGIQWEYLHSGFGIAKITAVAGDGLTATADVVSFIPSQVV-GSANASYKWAKYAWNS-------VN-- 318 (825) T ss_pred --EeeeeecccCCceEEEEEecCCceEEEeeccccceeeccccceecccccc-cCCCCCcccccCCccc-------CC-- Confidence 00111111111 11222234455566555443211 1223444444 4556667777766743 23 Q ss_pred ccccCCCceEEEEEcceEEEe----cCCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEE Q lcl|NC_012418. 383 FNFVTRGITGMTTFQGRLVLL----SQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVF 458 (826) Q Consensus 383 psf~g~~~~~v~~~q~RL~~~----~~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~ 458 (826) .||+.|+||||||+|+ +|++|||||+||||||++++ +++|||||+++++++++|.|+|+++++ +|+|| T Consensus 319 -----gyPs~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~--~~~DdD~I~~~~s~~~~~~i~~~~~~~-~L~~~ 390 (825) T protein:vir:73 319 -----GYPSTVVYYQQRLYFAASTAYPQTIWASRTGDYKDFGKNN--PIQDDDRIIYTYAGRQVNEIRHLIDVG-NLVAL 390 (825) T ss_pred -----CCccEEEEEcceEEEeecCCCCCEEEEEccCCccccccCC--CCCCCccEEEEEcCCcceeEEEEeecC-cEEEE Confidence 4667799999999999 58999999999999999998 467999999999999999999999985 89999 Q ss_pred ecCcEEEEeCC--ccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHH Q lcl|NC_012418. 459 AKKYQAVVPGG--GIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIP 536 (826) Q Consensus 459 t~~~q~~~~~~--~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~ 536 (826) |+++||+|+++ ++|||+|++++++|.|+++ +++|+.+|++++|+|++|+ +||||.|+.+.++ |+++|+|+|++ T Consensus 391 t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~~-~~~Pv~vg~~~~Fv~~~g~---~vre~~~~~~~d~-~~~~dlt~~a~ 465 (825) T protein:vir:73 391 TSGGEYTISGDQNKVLTPSAFSFSSQGNNGSS-NVPPIAVANIALFIQEKGS---VVRDLAYSFDVDG-YQGTDLTILAN 465 (825) T ss_pred ecCceEEEecCCCcccceeeEEEEeeeeeccc-cccceEeCCeEEEEeCCCC---eEEEEEEeeecCc-eeccchhhhhH Confidence 99999999875 6999999999999999775 6999999999999998774 7999999977765 99999999999 Q ss_pred HhcCC-CeEEEEEcCCCCEEEEEEcCCCeEEEEEEeeCCCceeeEeeEeeecCCcEEEEEEE----CCeEEEEEEeCCCE Q lcl|NC_012418. 537 SYMPG-PAEYIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGTYFT----GDNLMVLIQKGQEI 611 (826) Q Consensus 537 ~~~~~-~v~~~~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~----~d~l~~vv~R~~~~ 611 (826) |++++ ++.+|+|+++|++++|++++||+|++|+|+ +||+|.|||||+++|+|++||++ +|+||++|+|++++ T Consensus 466 hl~~~~~~~~~a~~~~p~~~~~~v~~dg~l~~~ty~---~~q~v~aW~~~~~~g~v~~~~~i~~~~~D~l~~iV~R~~~g 542 (825) T protein:vir:73 466 HLFQKHSIVDWSFCIVPYSSAFCIRDDGKLLVLTYL---RDQQVFAWAPQSSAGKYESTCSISEGSEDAVYFVVNRTING 542 (825) T ss_pred hhccCCceEEEEEcCCCceEEEEEecCCeEEEEEEe---ccccceeeEEEecCCcEEEEEEecCCCccEEEEEEEEeeCC Confidence 99997 799999999999999999999999999997 89999999999999999999999 68999999999998 Q ss_pred EEEEEEEeecCCcCCCCcccccceEEEEeec-----------------------------cceeeeccc------c---- Q lcl|NC_012418. 612 ALGRMHLNSLPAREGLQYPKYDYWRRIEATV-----------------------------EGELELTKQ------H---- 652 (826) Q Consensus 612 ~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------------~~~~~~~~~------~---- 652 (826) +.+||.| ++......+.++..|.++...+. ++..+.+.. + T Consensus 543 ~~~~yiE-~~~~~~~~~~~~~~~vD~g~~~~g~~~~~~l~~l~g~tv~~~~~g~~~~~v~~g~itl~~~~~~~i~l~~~~ 621 (825) T protein:vir:73 543 QTVRYIE-RLSSRLFTNDEDAFFVDCGLSYDGRNTSSRTMTISGGTGDWSYQVDYPVTVSGGAYFVNTDVGAQIQFPYTG 621 (825) T ss_pred ceEEEEE-EecccccCCCcceeEEEEEeeecccceeeceeeeCCceEEEEeCCeEEEEEcCCeEEecccceEEEEecccC Confidence 8888744 44443444444433333221110 000000000 0 Q ss_pred ---------------------------------------------cC--ccccccceEEeeeeeeEeeccEEcccEecCC Q lcl|NC_012418. 653 ---------------------------------------------WD--LIKDAPAVYQLQPVAGAFMERYQLGVKRETN 685 (826) Q Consensus 653 ---------------------------------------------~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 685 (826) +. ...-....|+||+++.+++||..++...+.+ T Consensus 622 ~~~~~~~~~~~~~~~~i~~~~~~~~v~v~~~~~~~a~~~~~~~t~~~~a~~~~~gL~hLeG~~v~v~~Dg~~~~~~~V~~ 701 (825) T protein:vir:73 622 TDPDTNEPVAKELRGDIISVTSNTAVVVRFNRNVPPVLRNVATTNWQMARQTFSGLAHLEGQTVNILSDASVEPQKTVTG 701 (825) T ss_pred cccccccceeceeeEEEccccCceEEEEEecccccceeeeecccCCCcchheeccccccCCceEEEEECCeeeCCeEecC Confidence 00 0000223489999999999999999999999 Q ss_pred CeEEEeecCCcCCceEEEEEeeeeEEEeCCeEEECCCCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeecc Q lcl|NC_012418. 686 TKVFLDVPEAVVGSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTP 765 (826) Q Consensus 686 g~~~l~~~~~~~~~~v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~ 765 (826) |.++|+.| +++|||||+|+++++||||++.. +|.++++ ++||+++.++|++|.+++++.+..... ... T Consensus 702 G~vtl~~~----~~~v~vGl~y~~~~~~l~~~~~~-~g~~~g~-~~ri~~~~~~~~~s~~~~~g~~~~~l~---~~~--- 769 (825) T protein:vir:73 702 GAVTLESP----GAVVHIGLPITAEFETLDINING-QETLLDK-KQVIPTVTMVVNASRGIWATTPGGTWY---EYP--- 769 (825) T ss_pred cEEEecCC----ceEEEEeeCccceEEecccccCC-CccccCc-cEEEEEEEEEEEeeeeEEEecCCCcce---Eee--- Confidence 99999976 46799999999999999999875 4777654 458999999999999999875443221 111 Q ss_pred ccccccccccCc-cccccceEEEEe-ecccceeEEEEEECCCCCEEEEEEEEEEEEecc Q lcl|NC_012418. 766 LRLFSRQLNAGE-PLVDSAVVPLPA-RVAMATSKFELSCHSPYDMNVRAVEYNFKSNQT 822 (826) Q Consensus 766 ~~~~~~~~~~~~-~~~~tg~~~~p~-~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r 822 (826) .|- .+. .++ |+++||++++++ .+|+++.+|+|+|++|||||||||..|...+=- T Consensus 770 ~r~--~~~-~~~~~~~~tG~~~~~~~~~~~~~~~~~i~q~~PlP~tvlav~~~~~~~g~ 825 (825) T protein:vir:73 770 QRE--FEF-YDDPVDDATGKVEVKLDSNWDKNGRVKVRQLDPLPLSVLAVLPRLTVGGF 825 (825) T ss_pred ccC--CCc-ccCCCccccCcEEEecCCCCCCccEEEEEEcCCCCEEEEEEEEEEEecCC Confidence 121 122 344 578999999998 589999999999999999999999999887665 No 25 >protein:vir:102644 Length: 594 # NCBI annotation: Hypothetical protein # Family: family:all:780 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024422;genbank:gi:48696643;genbank:GeneID:2948111 Probab=100.00 E-value=2.2e-133 Score=747.82 Aligned_cols=558 Identities=11% Similarity=0.052 Sum_probs=437.6 Q ss_pred Ccceeeechhhhcc-----cccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCccc-ccccccEEEEEeCCC Q lcl|NC_012418. 1 MSYKQSAYPNLLMG-----VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQ-PWPRPFLYHTNLGGR 74 (826) Q Consensus 1 M~~v~~s~~n~~gG-----VSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~-~~~~~~~~~~~rd~~ 74 (826) |+.+ +..||.+| +..|.|++||+++|++|+||++.|+||+.|||||+|++++++++. .++.||.|+ . T Consensus 1 m~~~--~~~~F~~GelsP~l~~r~Dl~~y~~~~~~~~n~~~~~~G~~~rR~G~~~~~~~~~~~~~~~lipF~~s-----~ 73 (594) T protein:vir:10 1 MADF--SQTSFKGGVIAPRLQFNEYESAYHHSIEDAVNFVVTEQGSLITRCGSEEVGLCQDGEVRLFRLPAVDA-----P 73 (594) T ss_pred Ccee--eccccCcceecceeccchhHHHHHHHHhhhhceEEEecCCeecCChhHhhhhccCCCCCEEEEEEEeC-----C Confidence 9998 48899999 558999999999999999999999999999999999999997654 578888876 4 Q ss_pred ceEEEEEecCCeEEEEECCCCEEEEecC-----ccccccc--CCCcccEEEEEecCEEEEeeCCcceeeeecccCCCCCC Q lcl|NC_012418. 75 SIAMLVAQHRGELYLFDERDGRLLMGQP-----LVHDYLK--AADYRQLRAATVADDLFIANLSVKPEADRTDVKGVDPN 147 (826) Q Consensus 75 e~~~i~~~~~g~irv~d~~~g~~~~~~~-----~~~~yl~--~~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~ 147 (826) ++.|+++++++++|+| +.++..+.... ..++|.. .....+|+|+|++|+++|+|+.++|++.... T Consensus 74 ~~~~~le~g~~~~r~~-~~~~~~v~~~~~~~~~~~tp~~~t~~~~l~~i~~tqsad~~~~~~~~~~p~~L~R~------- 145 (594) T protein:vir:10 74 SNDVIVEVGNTNIAVW-VNDVRQVVANTPSEWRNTIDRIQTAYDTIGDDAGAANTGRLIMVHPALQPKRLYRD------- 145 (594) T ss_pred CCeEEEEEcCCeEEEE-ecCcEEEEccCCCcccccccceeeccCCccceEEEEEeeEEEEEcCCCCceEEEEc------- Confidence 7888889999999999 45665544321 1112211 1245689999999999999999988642100 Q ss_pred ccEEEEEcccccCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeee Q lcl|NC_012418. 148 KAGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNS 227 (826) Q Consensus 148 ~~a~~~vr~g~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~ 227 (826) ..+ .|.+ T Consensus 146 -------~~~---------------------------------------------------------------~w~~--- 152 (594) T protein:vir:10 146 -------NNN---------------------------------------------------------------AWQF--- 152 (594) T ss_pred -------cCC---------------------------------------------------------------CceE--- Confidence 000 0000 Q ss_pred eeccceecccccccccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEE Q lcl|NC_012418. 228 TKKYPKVDPDTAAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQ 307 (826) Q Consensus 228 t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~ 307 (826) T Consensus 153 -------------------------------------------------------------------------------- 152 (594) T protein:vir:10 153 -------------------------------------------------------------------------------- 152 (594) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EEeeeEecCCCccceEEEEEecCCceEEEeecccccccccceeEEEEEecCCCeeEEeecCCcccccCCcccccCccccC Q lcl|NC_012418. 308 FMDGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFVT 387 (826) Q Consensus 308 ~~~~~~~~~~~~~~~~y~~~~~~~~~w~E~~~~g~~~~~~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g 387 (826) ...+|..+..++.+ . T Consensus 153 ---------------------------------------------------------~~~~~~~~p~~~~~--------~ 167 (594) T protein:vir:10 153 ---------------------------------------------------------VNMHTGAVPAEWSP--------S 167 (594) T ss_pred ---------------------------------------------------------EecccCcccccccC--------C Confidence 00000000000000 2 Q ss_pred CCceEEEEEcceEEEec----CCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcE Q lcl|NC_012418. 388 RGITGMTTFQGRLVLLS----QEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQ 463 (826) Q Consensus 388 ~~~~~v~~~q~RL~~~~----~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q 463 (826) +||++|+||||||+|++ |++|||||+||||||++++++ .|||||++++ +++.+.| |++++.++|+|||+++| T Consensus 168 ~~p~~v~f~q~RL~f~~~~~~p~~v~~Srtgd~~nF~~~~~~--~ddd~i~~~~-s~~~~~~-~~v~~~~~L~i~t~~~e 243 (594) T protein:vir:10 168 NYPQTVGIFQNRVWYVGSPVHRTYFWATRAGKLEDIAPSTAN--NPNDPISFVG-IMEGTPC-WIIASSDVLTIGTTIND 243 (594) T ss_pred ccceEEEEEeeeEEEEeCCCCCceEEEEecccccccccCCCC--CCCccEEEEE-ecccceE-EEEecCCceEEEecCce Confidence 46788999999999998 578999999999999999854 6999999965 4565555 55777889999999999 Q ss_pred EEEeCC--ccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcC- Q lcl|NC_012418. 464 AVVPGG--GIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMP- 540 (826) Q Consensus 464 ~~~~~~--~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~- 540 (826) |+|+++ ++|||+|+.++++|.+ +++.++|+.+|+.++|+|++|+ +||||.|+.+.++ |+++|||+|++|+++ T Consensus 244 ~~l~~~~~~~lTp~~~~~~~~s~~-g~~~~~P~~vg~~~~fv~~~g~---~vre~~y~~~~d~-y~~~dlt~~a~hl~~~ 318 (594) T protein:vir:10 244 YQLAASTGVSVTAATAILRRSSVQ-GTAAVQGIPAEEQVIFCSRNKS---KVYAMNYVREQDN-WIPDEMSSQAQHLFTP 318 (594) T ss_pred EEEecCCCcccccceEEEEEeeee-ccCCCcceeeCCeEEEEcCCCC---EEEEEEEeeccCc-eeccchhhhhhhhcCc Confidence 999875 5899999999999975 6689999999999999998774 7999999977765 999999999999974 Q ss_pred ------CCeEEEEEcCCCCEEEEEEcCCCeEEEEEEeeCCCceeeEeeEeee-cCCcEEEEEEE----CCeEEEEEEeC- Q lcl|NC_012418. 541 ------GPAEYIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWT-LRHQIIGTYFT----GDNLMVLIQKG- 608 (826) Q Consensus 541 ------~~v~~~~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~v~aW~~w~-~~g~v~~~~~~----~d~l~~vv~R~- 608 (826) ++|.+|+|+++|++++||+++||.|++++|+ +||+|.|||||+ ++|+|++||+| +|++|++|+|. T Consensus 319 ~~~~~~~~i~~~a~~~~p~~~~~~v~~dG~l~~~ty~---~eq~v~aWs~~~~t~G~v~~va~i~~~~~d~l~~~V~R~~ 395 (594) T protein:vir:10 319 ISSAKGASVRRVAYISDAAKSLWVVLENGQINYCCFD---RTTDTKAWTQLELSGGKVIDIAAAFNPDSDYAYVAVVRSK 395 (594) T ss_pred cccccCceEEEEEEecCCceEEEEEeCCCeEEEEEEe---cccceeeeEeeccCCCcEEEEEEeecCCCCEEEEEEEECC Confidence 5899999999999999999999999999996 899999999998 58999999998 79999999994 Q ss_pred -CCEEEEEE-EEeecCCcCCCCcccccceEEEEeeccceeeecccccCccccccceEEeeeeeeEeeccEEcccEecCCC Q lcl|NC_012418. 609 -QEIALGRM-HLNSLPAREGLQYPKYDYWRRIEATVEGELELTKQHWDLIKDAPAVYQLQPVAGAFMERYQLGVKRETNT 686 (826) Q Consensus 609 -~~~~~~r~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 686 (826) +++..+|+ ++|++......+.....|. ++......++ ....|++|.++.+++||..++...+.+| T Consensus 396 ti~g~~~~y~~lE~~~~~~~~~~~~~~~~---d~~~~~~~~v----------sgl~hLeg~tv~v~aDG~~~~~~~V~~g 462 (594) T protein:vir:10 396 AINGVQKNYTVLEKISSPRTDWKRADGWV---VAQVNQNGDV----------LNLDRYIGRTAVIFSKYGLEAEVEVNNI 462 (594) T ss_pred ccccceeeEEEeecCCCccccccccceee---eeccccccee----------ecccccCCceEEEEeCCeecCCeEEcCC Confidence 57777776 3555544433333332222 2222111111 1234899999999999999999999999 Q ss_pred eEEEeecCCcCCceEEEEEeeeeEEEeCCeEEECCCCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccc Q lcl|NC_012418. 687 KVFLDVPEAVVGSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPL 766 (826) Q Consensus 687 ~~~l~~~~~~~~~~v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~ 766 (826) .++|+..++..+++|||||+|+++++++||++++++|+.+++ |+||+|++|+|++|.+++++.+........... T Consensus 463 ~itL~~~~~~~~~~v~VGl~Y~s~i~~lp~~~~~~~gs~~g~-r~ri~r~~v~~~~S~g~~vg~~~~~~r~~~~~~---- 537 (594) T protein:vir:10 463 GLTHRINGYDPNTVYYVGYKMDSYFRTLTPSNGDMKKSMFGS-KIRISKVQLALFDSIEPTVNGEPADDRSTDDIM---- 537 (594) T ss_pred eeEeeccCCCCcceEEEeeeeeEEEEeecccccCCcccccCc-cEEEEEEEEEEEcceeeEECCcccccccchhhc---- Confidence 999998888889999999999999999999999999987766 889999999999999999876543221111111 Q ss_pred cccccccccCccccccce--EEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecc Q lcl|NC_012418. 767 RLFSRQLNAGEPLVDSAV--VPLPARVAMATSKFELSCHSPYDMNVRAVEYNFKSNQT 822 (826) Q Consensus 767 ~~~~~~~~~~~~~~~tg~--~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r 822 (826) ........+.|++.+|. +.++..||+++.+|+|+|++|+|||||||.+|.+.|+= T Consensus 538 -~~~~~~~~g~~~~~tg~~~v~~~~~G~~~~~~i~I~qd~PlPltvlai~~ev~~~~~ 594 (594) T protein:vir:10 538 -DARLLDFSSNSGSSNGTRLVDYNPLGWENDGKMVIAVEQPFLCEVVGVFSVVQSNKV 594 (594) T ss_pred -cccCCcccCcccccCCceEEEEccCCcCcccEEEEEECCCcCEEEEEEEEEEEeccC Confidence 11123445666666665 55566799999999999999999999999999999998 No 26 >protein:vir:94602 Length: 1012 # NCBI annotation: PfWMP4_35 # Family: family:all:12083 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762665;genbank:gi:115304373;genbank:GeneID:5142302 Probab=99.62 E-value=5.2e-14 Score=93.39 Aligned_cols=780 Identities=13% Similarity=0.087 Sum_probs=319.1 Q ss_pred Ccc-----eeeechhhhcc--cccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEEEEeCC Q lcl|NC_012418. 1 MSY-----KQSAYPNLLMG--VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGG 73 (826) Q Consensus 1 M~~-----v~~s~~n~~gG--VSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~~~rd~ 73 (826) |-. +++-..+=.+| +|.-|-..-|.+ --...||=.+..|-+.||.||+.+-.-...... ..-+.+.+.--= T Consensus 1 mtqQQ~~eiqG~~t~~F~GL~~s~S~~~IP~~~-SP~~~N~DV~~~G~V~rR~GT~l~~~Y~inn~s-~~~~s~~irt~L 78 (1012) T protein:vir:94 1 MTQQQATEIQGPFTREFSGLDISNSVGAIPVSG-SPVFHNCDVSDDGAVVRRRGTALVNTYNINNAS-GRAWSDTIRTKL 78 (1012) T ss_pred CCccccccccccccccccccccccccccccccC-CCceEEeecccCcceeehhhhhhhhhhcccccC-cceeeeeehhhc Confidence 332 11112222233 333332222222 124578889999999999999998644311111 111233332111 Q ss_pred CceEEEEEecCCeEEEE---ECCCCEEEEecCcccccccCCCcccEEEEEe---cCEEEEeeCCcceee-e-ecc----c Q lcl|NC_012418. 74 RSIAMLVAQHRGELYLF---DERDGRLLMGQPLVHDYLKAADYRQLRAATV---ADDLFIANLSVKPEA-D-RTD----V 141 (826) Q Consensus 74 ~e~~~i~~~~~g~irv~---d~~~g~~~~~~~~~~~yl~~~~~~~l~~~~v---aD~~fi~n~~~~~~~-~-~~~----~ 141 (826) ...|+|+-..-|-+.+. +++=|........-...+..-.+++..|+-+ -|-+.|+-+++||.. + ... . T Consensus 79 G~eYfiLs~~~GLL~~~~~~~~AVG~~K~~a~V~~ss~~~V~Pssm~F~~~S~~~~R~LILT~~~~~VQ~~F~E~T~s~T 158 (1012) T protein:vir:94 79 GSEYFILSNDVGLLISLMRDDEAVGMPKEVAVVSKSSIWTVPPSSMCFIPVSAPYDRLLILTPEHPIVQLSFLERTLSFT 158 (1012) T ss_pred cceeEEEecCCceEEEeeecccccccchhhhhhhhhhccccCCcceEEEeccCCCCcEEEEcCCCceEEEEEeeeeeeee Confidence 12344554444433332 2222211111111111212223446666653 467888888888742 1 111 1 Q ss_pred CCCCCCccEEEEEccccc--------------------CeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccce Q lcl|NC_012418. 142 KGVDPNKAGWLYIKAGQY--------------------SKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPF 201 (826) Q Consensus 142 ~~~~~~~~a~~~vr~g~Y--------------------~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~ 201 (826) ...+.....+-+-.--+| +++|.+++...+-+. ..++...-+. .. T Consensus 159 ~~t~~~~~V~~~~a~~~~~~~~L~~~~N~sS~~~~~~~~T~~AmT~~NP~~S~------~ls~~~V~~q---------ty 223 (1012) T protein:vir:94 159 CTTNHGGGVFSFTAPISVNDTTLWRDTNASSYIVTDAAGTVYAMTQKNPDFSF------RLSGSFVVGQ---------TY 223 (1012) T ss_pred ccCCccceEeecccceeecCeeEEecccccceeEeeccceEEEEEeeCCceeE------EEEEEEecCc---------cc Confidence 111111111111111122 333333333221111 1111110000 00 Q ss_pred eeccchhhhhh-hheeecccceEEeeee--eccceeccc--ccccccccceE---ecccCCcEEEEEcCCCeEE------ Q lcl|NC_012418. 202 QTSVGYIAWQL-YGKFFGAPEYTLPNST--KKYPKVDPD--TAAATVAGYLN---QRGVQDGYIAFRGDGDIVV------ 267 (826) Q Consensus 202 ~~~~~~i~~~l-~~~~~s~~~~~~~~~t--~~~~~~~~~--~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~------ 267 (826) ......|.++. .++.. +.+...-..+ +....+... ......+..-. ...+-+-++++.+.-+..- T Consensus 224 tltirqi~W~WWAESm~-~~G~~~~~~~SRFNV~~~DQ~V~IP~~L~tDiD~v~~~~~~~~l~~~~ss~F~~~~~~~~T~ 302 (1012) T protein:vir:94 224 TLTIRQITWQWWAESMY-YEGQDMMQNTSRFNVTSIDQNVKIPDRLITDIDPVYKNSQGLGLFVFWSSRFDSNGWAGPTT 302 (1012) T ss_pred ceeehhhhhhhhhhhHh-hhhhHHHhhhhhcccccccccccchhHHhhhhhhhhhccCCccEEEEEeeeecCceeecCCC Confidence 00000111110 00000 0000000000 000000000 00000000000 0001111222222111000 Q ss_pred --EEeecCC--------------CcceE-EEEEEEeeccccccccccCCccceeEEEEE-eeeEecCCCccceEEEEEec Q lcl|NC_012418. 268 --EVSTDMG--------------NNYGI-ASGGMSLNATADLPALLPGAGTPGTGVQFM-DGAVMATGSTKAPVYFEWDS 329 (826) Q Consensus 268 --~~~~~~g--------------~~~~~-~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~-~~~~~~~~~~~~~~y~~~~~ 329 (826) +..+.-| ..++- +.... ++.++-.-.|..-...-..+.+ ...-..+|++.++--|.-++ T Consensus 303 ~P~~AD~YG~~~G~~~tpp~~~~~A~L~~aPFF~---TFG~~~s~TP~P~~~V~iLR~RELRFN~G~GA~~~~L~V~~D~ 379 (1012) T protein:vir:94 303 SPNTADEYGFSGGGRFTPPSLVPGATLQAAPFFI---TFGGIYSGTPTPINQVNILRLRELRFNGGTGAKPDDLQVYNDT 379 (1012) T ss_pred CCCCcccccccCCceeccccccccceeeccceEE---EeccccCCCCCChhheeeeeeeeeeeccCCCCCCcceEEEEcc Confidence 0000000 00000 01111 1111111111110011111110 11223445555554443333 Q ss_pred CCce-------------EEEeec-----------cccccccc--------------ceeEEEEEecCCCeeEEeecCCcc Q lcl|NC_012418. 330 ANRR-------------WAERAA-----------YGTDWVLK--------------KMPLALRWDEATDTYSLNELDYDR 371 (826) Q Consensus 330 ~~~~-------------w~E~~~-----------~g~~~~~~--------------tmp~~~~~~~~~~~f~~~~~~w~~ 371 (826) .+-. |..+.. -|..+..+ ..|..+.-..+..-......-|.+ T Consensus 380 ~~~t~Nnvpfspsnfqt~atT~~~T~R~~~L~~A~G~~~~~A~Y~A~~GATnnlpanaPL~IS~~sA~s~~~~~R~v~~~ 459 (1012) T protein:vir:94 380 VEHTWNNVPFSPSNFQTWATTYTATDRVITLMSAVGDRFNNANYFAILGATNNLPANAPLHISCLSASSYLGGSRRVWYR 459 (1012) T ss_pred eeeeccccccCcccccceeeeeeecceeEEEeeeccccccCcceEEEeecccccccCCccccccccceeeeccceeeeee Confidence 2222 332221 11111110 011111100000000000111222 Q ss_pred cccCCccccc-----------Cc-cccCCCceEEEEEcceEEEec----CCeEEEEecCC------cccCcc-cccccCC Q lcl|NC_012418. 372 RGSGDEDTNP-----------TF-NFVTRGITGMTTFQGRLVLLS----QEYVCMSASNN------PHRWFK-KSAAALN 428 (826) Q Consensus 372 r~~gd~~tnp-----------~p-sf~g~~~~~v~~~q~RL~~~~----~~~v~~S~~gd------~~nF~~-~s~~~~~ 428 (826) .+--|--+-. +- .-.++.+.--+.||.||++.+ +..+.+|.+|| |+||+. ...+.-. T Consensus 460 ~~~T~~~~~~G~Y~r~YGiG~~~~Y~~~~F~~I~TiY~~RLiL~~~s~~~~~~~~S~~GD~~~~G~~Y~F~QiTD~L~G~ 539 (1012) T protein:vir:94 460 NLPTTGGTLDGCYVRAYGIGKYVDYSKRSFHAIGTIYRDRLILVNPSTATDQLLISEIGDATVPGEFYQFMQITDMLQGV 539 (1012) T ss_pred ccccCCceEeeeEEEEEEeeeeeecCCccccceeeeeeeeeEEeccCCCcceEEEeecCCcccCceeeeeeeeehhhccC Confidence 1110000000 00 112445667789999999998 45699999877 899984 4445677 Q ss_pred CCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCC Q lcl|NC_012418. 429 DDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERAL 508 (826) Q Consensus 429 ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~ 508 (826) |.||+++.+++.-.+.|.-++...+.|++||..+-|.+.|++.++|..-.++..|+|+.-+.---|+..-.|+|..+- T Consensus 540 ~tDPF~L~VtSe~~e~iT~~~~WQ~~LFV~T~~~T~~~~GGe~~~~s~~~VN~vSt~G~~N~~~VV~T~~~V~Ym~~~-- 617 (1012) T protein:vir:94 540 TTDPFTLNVTSEGRERITAVTGWQKRLFVFTGSNTYSIEGGEQFGESSYAVNLVSTYGAFNQNCVVVTNLTVLYMNKF-- 617 (1012) T ss_pred cCCceeEEEcccccceeeeeeeeceeEEEEeccceEeeccccccchhHHHHHhHHhhcccCcceEEEeeeEEEEeecc-- Confidence 899999999998888999999999999999999999999999999999999999999665555557788889999764 Q ss_pred ceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEEEEEcCCCCEEEEEEc--CCCeEEEEEEeeCCCcee-------- Q lcl|NC_012418. 509 GFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAASSGYLVFGTS--TADEMICHQYLWQGNEKV-------- 578 (826) Q Consensus 509 ~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~~~s~~p~~~v~~~~--~~g~l~~~tyl~~~~e~~-------- 578 (826) ++.++....+.+ .|.+-+-|..+.++|.+- .+..-+.+.|+.- +.++||+ =|..+.|.. T Consensus 618 ---G~F~L~~k~~~~-~Y~A~ErSvKIR~~F~~~-----~~ss~~~~~Wl~~~e~~~~LYi--~L~~~~dT~~~S~~~~~ 686 (1012) T protein:vir:94 618 ---GLFDLMNKPNTD-SYGAFERSVKIRGLFQNL-----AGSSGDNLHWLRYNESSNKLYI--GLAAEGDTRTTSRNLML 686 (1012) T ss_pred ---ceeeccCCccCC-cchhhhhhhhhhhhhhhh-----ccccccceeeeeeccCCceEEE--EecCCCcchhhhhhhhh Confidence 488888877655 499999999999998651 1222233333322 2223332 122222211 Q ss_pred ---eEeeEeeecCCcEEEEEEE----CCeEEEEEEeCCCEEEEEEEEeecCCcCCCCc---------------------- Q lcl|NC_012418. 579 ---QNAFHRWTLRHQIIGTYFT----GDNLMVLIQKGQEIALGRMHLNSLPAREGLQY---------------------- 629 (826) Q Consensus 579 ---v~aW~~w~~~g~v~~~~~~----~d~l~~vv~R~~~~~~~r~~~~~~~~~~~~~~---------------------- 629 (826) -.+|+..++.|.|.---.+ .+...+.+......++-+..+.+--+...+.. T Consensus 687 N~~~DSWs~~~s~~~Fq~YP~V~~~~~~t~L~~i~~~~TV~ML~~~~~~YiDFatirthiypF~~CaG~~~~~Vms~~~G 766 (1012) T protein:vir:94 687 NFTWDSWSTLSSAAPFQMYPAVQLFKYMTWLTNINAPLTVAMLATEMPFYIDFATIRTHIYPFTFCAGQRDVSVMSDSRG 766 (1012) T ss_pred hhhhcchhhhhccCCcccchhhhhhhhhhhhhhhcCchhhhhhhhccceeeeeehhcccccceeeeccceeeEEEecCCc Confidence 1478887776654322111 22222222222222221111111111100000 Q ss_pred ----------ccccceEEEEeec-----------cceeeecccccCccccccceEEeee-----eeeEe-eccEEcc--- Q lcl|NC_012418. 630 ----------PKYDYWRRIEATV-----------EGELELTKQHWDLIKDAPAVYQLQP-----VAGAF-MERYQLG--- 679 (826) Q Consensus 630 ----------~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~-~~~~~~~--- 679 (826) ..+|+........ .++-+.+..++ .......+++.+. +...+ .++.+.- T Consensus 767 IY~~~~P~tP~I~~~tit~ss~~~~k~Yq~~T~~~GT~tLt~~~~-~~~~~~~l~LL~~~~~~~~~a~V~~~~~~~~TT~ 845 (1012) T protein:vir:94 767 IYNLPLPVTPGILDYTITASSKAGAKTYQRNTASAGTETLTLRNP-MMDYADTLELLGGNVNASQFAMVMSNGFEPYTTY 845 (1012) T ss_pred eEEecccccceeeeeEeeccchhhhheeccccccccceeeeecCh-hhhcCcEEEEecCCCCccEEEEEeeccccccccc Confidence 0111111111100 00000111111 0111222222221 11111 1111110 Q ss_pred -cEe----------cCCCeEEEe-ecCCcCCceEEEEEeeeeEEEeCCeEEECCCCceeeecce-EEEEEEEEeec--cc Q lcl|NC_012418. 680 -VKR----------ETNTKVFLD-VPEAVVGSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSARA-VLHRYNVNFGW--TG 744 (826) Q Consensus 680 -~~~----------~~~g~~~l~-~~~~~~~~~v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~-~i~~~~~~~~~--t~ 744 (826) ..+ +.+|. +|. .|--.....+.+|..|.+.+.-+-+.+. + -+|| ||.++.|-+.- +. T Consensus 846 ~TV~~N~~~~lQ~T~~~GS-~L~~~~~LsqN~~~~~G~~Y~S~Y~SP~F~L~-----S--L~~LKr~K~~~L~~Dttvts 917 (1012) T protein:vir:94 846 PTVTYNGVAPLQWTVTGGS-GLNNRPILSQNNNCIMGMIYPSVYASPIFDLE-----S--LGRLKRLKKLHLQMDTTVTS 917 (1012) T ss_pred ceEEecceeeeeEEEecCC-ccccccccccCceEEEeecchhhhcchhhhhh-----h--hhhhhheeeeeEEeeeeeee Confidence 011 11111 000 0000113458899999999885444331 1 1344 57777766654 35 Q ss_pred eEEEEecCCCCCceeeeeec-cccccccccccCccccc------------cceEEEEeecccceeEEEEEECCCCCEEEE Q lcl|NC_012418. 745 EFLWRISDTARPNQPWYDTT-PLRLFSRQLNAGEPLVD------------SAVVPLPARVAMATSKFELSCHSPYDMNVR 811 (826) Q Consensus 745 ~~~v~v~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~------------tg~~~~p~~~~~~~~~v~i~~~~p~P~tvl 811 (826) .++.++..++.........- .....-.++.. |... --...+|+.+..-+.++.|.+-..-.|.+- T Consensus 918 qlkynltsgfsqvsvlntawvavvsnyneniv--pavvsyqvgnsyeirrvvelsiplqgygcdyqfyiasvgaeafkla 995 (1012) T protein:vir:94 918 QLKYNLTSGFSQVSVLNTAWVAVVSNYNENIV--PAVVSYQVGNSYEIRRVVELSIPLQGYGCDYQFYIASVGAEAFKLA 995 (1012) T ss_pred eeeeehhcccceeeeecceeeeeeeccCcccc--ceeeeeecCCceeeeEEEEEeecccccccceeEeeeeccccceeee Confidence 55555544443322111100 00000011111 1111 113456777888888999999999999999 Q ss_pred EEEEEEEEec--c-ccc Q lcl|NC_012418. 812 AVEYNFKSNQ--T-YRR 825 (826) Q Consensus 812 ~i~~eg~y~~--r-~rr 825 (826) +.+++.+=-+ | .|| T Consensus 996 ayefdiqpqrdkryvrr 1012 (1012) T protein:vir:94 996 AYEFDIQPQRDKRYVRR 1012 (1012) T ss_pred eeeeccccchhhhhccC Confidence 9998877533 3 333 No 27 >protein:vir:80177 Length: 1027 # NCBI annotation: tail tubular protein B # Family: family:all:12083 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285795;genbank:gi:148747829;genbank:GeneID:5220453 Probab=99.42 E-value=2e-11 Score=79.25 Aligned_cols=762 Identities=12% Similarity=0.103 Sum_probs=271.9 Q ss_pred Cc----ceeee------chhhhcc--cccCChhHhcccchhhhhcceeeccCCcccCChhHhHhhhcCcccccccccEEE Q lcl|NC_012418. 1 MS----YKQSA------YPNLLMG--VSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYH 68 (826) Q Consensus 1 M~----~v~~s------~~n~~gG--VSqQ~D~~Ry~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~~~~~~ 68 (826) |- +-+|- ..+=.+| +|.-|=..-|.+ --...|+=.+..|-+.||.||+.+-.-..+...+ .|. T Consensus 1 mvnsferrtQQ~~dlG~~s~~F~GL~~t~S~~~IP~~~-SP~~~N~DV~~~G~V~kR~GT~i~~~Y~~t~~~~----t~~ 75 (1027) T protein:vir:80 1 MVNSFERRTQQGDDLGIRSSNFGGLNTTASPLNIPYED-SPNLLNVDVDVSGNVSKRQGTEILLKYANTTPVY----TFP 75 (1027) T ss_pred CCcchhhhhccccccccccccccccccccccccccccC-CCceEEeecccCcceeehhhhhhhhhhccCCcee----eee Confidence 21 11110 1111222 333332222221 1235788899999999999999986554433332 233 Q ss_pred EEeCCCceEEEEEecCCeEEEE---ECCCCEEEEecCcccccccCCCcccEEEE---EecCEEEEeeCCcceee-e-e-- Q lcl|NC_012418. 69 TNLGGRSIAMLVAQHRGELYLF---DERDGRLLMGQPLVHDYLKAADYRQLRAA---TVADDLFIANLSVKPEA-D-R-- 138 (826) Q Consensus 69 ~~rd~~e~~~i~~~~~g~irv~---d~~~g~~~~~~~~~~~yl~~~~~~~l~~~---~vaD~~fi~n~~~~~~~-~-~-- 138 (826) +. ---...||+-..-|-+.+. +++=|........ ... ++++.+- .|+ -+-|-+.|+-+++||.. + . T Consensus 76 vk-s~LG~dYvLt~~~GLL~~~~~~~~AVG~~K~~s~V-~~a-a~~~V~P-~F~~~S~~~~R~LILT~~~~~VQ~~F~E~ 151 (1027) T protein:vir:80 76 VK-SVLGYDYVLTKSGGLLEVAGVIGKAVGAYKSFSNV-FSA-AAANVKP-YFTLLSDVEPRVLILTGTNTPVQVKFVEQ 151 (1027) T ss_pred eh-hhccceeeEecCCceEEEeeecccccccchhhhhh-hhh-hhcccCc-eeEEccCCCCcEEEEcCCCceEEEEEeee Confidence 32 1011223444333333332 2222211111100 011 1222211 222 24567788888877632 1 1 Q ss_pred --cccCCCCCCccEEEEEcccccCeeEEEEEeccCCcceeeeeeeeeEEe-----ccCcc---------c-ccccc---- Q lcl|NC_012418. 139 --TDVKGVDPNKAGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATYVT-----PDNAS---------T-NPNLA---- 197 (826) Q Consensus 139 --~~~~~~~~~~~a~~~vr~g~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~-----p~~~~---------t-~~~~~---- 197 (826) +.....+.....+-+-.--+|+.+ .+=.+....++.++......+.+ |+-+. . .++.+ T Consensus 152 T~t~T~~s~~~~~V~~~~s~~~~~~~-~L~~~~N~tS~~~~~~~~T~~AlT~~NlP~~S~~mt~~~V~~~W~WWAESl~~ 230 (1027) T protein:vir:80 152 TFTTTSGSPTTTVVIPNASRFQYDTP-ILYMNRNFTSGATYSYNSTTRALTISNLPSWSGSMTFDLVLPVWSWWAESLRW 230 (1027) T ss_pred eeeeeccCCccceEeecccceeecCe-eEEecccccceeEeeccceEEEEEeccCCcceeEEEEeEEecchhhhhhHHhh Confidence 111111111111111111122211 01111111111111111111111 11000 0 00000 Q ss_pred -----------ccceeec-cchhhhhhhheee-------cccceEEeeeeeccceecccccccccccceEecccC-CcEE Q lcl|NC_012418. 198 -----------EAPFQTS-VGYIAWQLYGKFF-------GAPEYTLPNSTKKYPKVDPDTAAATVAGYLNQRGVQ-DGYI 257 (826) Q Consensus 198 -----------~~~~~~~-~~~i~~~l~~~~~-------s~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 257 (826) -+++.++ .-+|+..+...+. ..+-.....+++....+......... +-.++. +|-+ T Consensus 231 ~G~~~~~~~SRFNV~~~DQ~V~IP~~L~sDlD~i~~~~~~~~m~~~~ta~F~~~~~~~~T~~P~~----AD~YG~~~G~~ 306 (1027) T protein:vir:80 231 FGDRFYDAVSRFNVNKADQSVAIPAALRSDLDTIQGTYGRYPMLLYKTATFNDTYTFSNTGQPAN----ADSYGWGDGSV 306 (1027) T ss_pred hhhHHHhhhhhcccccccccccchhHHhhhhhhhhhccCCccEEEEEeeeecCceeecCCCCCCC----cccccccCCce Confidence 0011111 0012222211110 00111112222222211111000000 000000 0001 Q ss_pred EEEcCCC-----eEEEEeec--CCCcceEEEEEEEeeccccccccccCCccceeEEEEEee------------------- Q lcl|NC_012418. 258 AFRGDGD-----IVVEVSTD--MGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDG------------------- 311 (826) Q Consensus 258 ~~~~~~~-----~~~~~~~~--~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~------------------- 311 (826) +-..... ..+.+-.+ .+.++-+.. .|-+ ..-+|.=+.-++ ..+.-.++.+. T Consensus 307 ~~~~~~A~L~~sPFF~TFG~~~t~TP~P~~~-V~lL-R~RELRFN~G~G-A~~~~L~V~~D~~~~s~N~ssT~~~T~R~~ 383 (1027) T protein:vir:80 307 YNVGASAYLNTSPFFATFGDTRTPTPQPPET-VHLL-RQRELRFNYGNG-ATGANLRVTVDGTALSANYSSTVAGTNRAY 383 (1027) T ss_pred EeecccceeeccceEEEeccccCCCCCchhh-eeee-eeeeeeeccCCC-CCCcceEEEEcceeeeeeeeeeeeecceeE Confidence 0000000 00000000 000000000 0000 000000000000 00001111100 Q ss_pred ----eEec--CCCccceEEEEEecCCceEEEeecccccccccceeEEEEE--ecCCCe-----eEEe-ecCCcccccCCc Q lcl|NC_012418. 312 ----AVMA--TGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRW--DEATDT-----YSLN-ELDYDRRGSGDE 377 (826) Q Consensus 312 ----~~~~--~~~~~~~~y~~~~~~~~~w~E~~~~g~~~~~~tmp~~~~~--~~~~~~-----f~~~-~~~w~~r~~gd~ 377 (826) .++. +++..=.||..+.+++----.|.+.-...+-. -..+... ...+++ |+.- -..|.. T Consensus 384 ~L~~A~G~~~~~A~dlayY~A~~GATPL~IS~~aA~t~~~~~-R~yi~~~~~~T~~~~~~G~Y~k~YGlG~~~~------ 456 (1027) T protein:vir:80 384 ALYKADGTLCTSASDLAYYIAFTGATPLGISPTAAVTITNVD-RTYIGSAATQTDNAYVQGGYFKVYGLGLWAN------ 456 (1027) T ss_pred EEeeeccccccccccceeeeeeeccccccccccceeeeecCc-eeeeeeeccccCCceEeeeEEEEEEeeeeee------ Confidence 0000 01111123433333221000000000000000 0000000 000111 1100 011221 Q ss_pred ccccCccccCCCceEEEEEcceEEEec----CCeEEEEecCC------cccCccc-ccccCCCCccEEEEEcCCC-ceeE Q lcl|NC_012418. 378 DTNPTFNFVTRGITGMTTFQGRLVLLS----QEYVCMSASNN------PHRWFKK-SAAALNDDDPIEIAAQGSL-TEPY 445 (826) Q Consensus 378 ~tnp~psf~g~~~~~v~~~q~RL~~~~----~~~v~~S~~gd------~~nF~~~-s~~~~~ddD~i~~~~~~~~-~~~i 445 (826) .-.|+.|.--+.||.||++.+ +..+.+|.+|| ++||+.- ..+.-.|.||+++.++++| .+.| T Consensus 457 ------Y~~~~F~~I~TvY~~RLvL~~~t~~~~~~~~S~~GD~~~~G~~Y~F~QvTD~L~G~~sDPF~L~VsSsq~~d~v 530 (1027) T protein:vir:80 457 ------YGTGQFPRIATVYQSRLVLGGFTNDPTRVVFSATGDTVEGGVKYNFFQVTDDLDGLDSDPFDLVVSSSQADDYV 530 (1027) T ss_pred ------cCCccccceeeeeeeeeEEeccCCCcceEEEeecCCcccCceeeeeeeeehhhccCcCCceeEEEeccccccee Confidence 112456777899999999998 45799999987 8999844 4456779999999998866 5677 Q ss_pred EEEeecCCcEEEEecCcEEEEeCCcc-ccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeecccccc Q lcl|NC_012418. 446 EHAVTFNKDLIVFAKKYQAVVPGGGI-VTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDS 524 (826) Q Consensus 446 ~~~v~~~~~L~l~t~~~q~~~~~~~~-lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~ 524 (826) .-++...+.|++||..+-|.+.|++. ++|..-.++..|+|+.-+.-.-|+..-.|+|.++- ++..+....+.+ T Consensus 531 T~~~~WQ~~LFV~T~~~T~~~~GGd~t~~~a~~~VN~iSs~G~~N~~~VV~T~~~V~Yl~~~-----G~F~L~~r~~~~- 604 (1027) T protein:vir:80 531 TGLVEWQSSLFVLTRRATFRANGGDATISPARRFVNYISSLGLVNPFSVVRTDTAVFYLSDS-----GVFNLTPRVEDG- 604 (1027) T ss_pred eeeeeeceeEEEEecceeEEeecCccccchhHHHHHHHHhhcccCcceEEEeeeEEEEeecc-----ceeeccCCccCC- Confidence 78889999999999999999999775 99999999999999665555557788899999764 488888877655 Q ss_pred ccchhHHHHHHHHhcCCCeEEEEEcCCCCEEEEEEcC--CCeEEEEEEeeCCCceee-----------EeeEeeecCCcE Q lcl|NC_012418. 525 HYVAEDVTSHIPSYMPGPAEYIQAAASSGYLVFGTST--ADEMICHQYLWQGNEKVQ-----------NAFHRWTLRHQI 591 (826) Q Consensus 525 ~~~~~dls~~~~~~~~~~v~~~~~s~~p~~~v~~~~~--~g~l~~~tyl~~~~e~~v-----------~aW~~w~~~g~v 591 (826) .|.+-+-|..+.++|.+- .+..-+++.|+.-+ .++||+ =|..+.|..+ .+|+..++.|.| T Consensus 605 ~Y~A~EkSiKIR~~F~~~-----~~ta~~~~~Wm~~~q~~~~LYv--~L~~~~eT~~~S~~~~~N~~~DSWt~~~t~~~F 677 (1027) T protein:vir:80 605 EYQAIEKSIKIRKVFGKT-----TSTAVSSAAWMSFDQNRKVLYV--ALPRGSETTVASALYVYNTFRDSWTQYDTLGGF 677 (1027) T ss_pred cchhhhhhhhhhhhhhhh-----ccccccceeeeeeccCCceEEE--EecCCCcchhhhhhhhhhhhhcchhhhhcccCc Confidence 599999999999998651 12223333343222 222322 2222222211 478888887765 Q ss_pred EEE---EEE----CCeEEEEEEeCCCEEEEEEEEeecCCcCCCCcccccceEEEEeeccceeeecccccCccccccceEE Q lcl|NC_012418. 592 IGT---YFT----GDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEATVEGELELTKQHWDLIKDAPAVYQ 664 (826) Q Consensus 592 ~~~---~~~----~d~l~~vv~R~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 664 (826) .-- -.+ .|...+.|......++-+..+.+-.+... .|..+-..+-....+.-+++.+.|..+.-.. .. T Consensus 678 k~YtghP~V~~~~~~s~L~~v~~~~TV~ML~~~~~~YvDFF~---~CG~~~~~Vlt~~~GIY~~~~P~wnsP~I~~--~s 752 (1027) T protein:vir:80 678 KTYTGHPYVDTVLGDSFLLMVAYGGTVCMLKLYGSRYVDFFN---KCGSFTGNVLTANSGIYTWTAPFWNSPVISN--IS 752 (1027) T ss_pred ccccCCchhhhhhhhhhhhhhcCchhhhhhhhhcchhhhhhh---hcccceeeEEecCCceeEeecccccCCeeeE--EE Confidence 432 222 45555555555555554444333222111 1222333444455566666666554321100 11 Q ss_pred eeeeeeEeeccEE-------cccEecCCCeEEEeecCCcCCceEEEEEeeeeEEEeCCeEEECCCCceeeecceEEEEEE Q lcl|NC_012418. 665 LQPVAGAFMERYQ-------LGVKRETNTKVFLDVPEAVVGSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSARAVLHRYN 737 (826) Q Consensus 665 ~~~~~~~~~~~~~-------~~~~~~~~g~~~l~~~~~~~~~~v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~~i~~~~ 737 (826) ...+....+..+. ++.+.+.+-.|.. .+..+..|-+..-+-+.+-..-...+|+.. ..+-|+- T Consensus 753 vs~tt~~~~q~Ye~~T~~~vvpydnvedlsiyv------nGT~Ls~~~~~~~~~~~i~LL~~~~~~~~~----s~Vprcp 822 (1027) T protein:vir:80 753 VSGTTTLAVQRYELPTDLQVVPYDNVEDLSIYV------NGTRLSFGTDWVKQGKAIYLLSDPGDGKTV----SIVPRCP 822 (1027) T ss_pred eeccchhhhheeccccccccccccccccceeee------cceeEeecCchhhcCCEEEEecCCCCcceE----EEEeccc Confidence 1111111111111 1222221111110 111111121111111110000001122211 1344555 Q ss_pred EEeeccceE--------EEEecCCCCCceeeeeeccccccccccccCccccccceEEE----------Eee-----cc-- Q lcl|NC_012418. 738 VNFGWTGEF--------LWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLVDSAVVPL----------PAR-----VA-- 792 (826) Q Consensus 738 ~~~~~t~~~--------~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~~----------p~~-----~~-- 792 (826) +++..-+.. .|-++....-....|... +..+.. .|-+....+++ |+. ++ T Consensus 823 vnvsy~~~~~~~~TT~~TV~~N~~~~iQ~Tdy~~~-----GS~L~~-~~~LtN~~~~~G~~Y~S~Y~SP~F~L~SL~~LK 896 (1027) T protein:vir:80 823 VNVSYQGDVTFDETTAQTVWVNNLLQIQGTDYTLS-----GSTLTF-TDTLTNAVVEVGNAYISYYQSPMFLLGSLSNLK 896 (1027) T ss_pred ccccccccccccccccceEEecceeeeccceeeec-----cCcccc-ccccccceEEEeecchhhhcchhhhhhhhhhhh Confidence 444322211 111111110000111100 000000 00010111111 100 00 Q ss_pred -cceeEEEEEECCCCCEEEEEEEEEEEEe-----cccccC Q lcl|NC_012418. 793 -MATSKFELSCHSPYDMNVRAVEYNFKSN-----QTYRRV 826 (826) Q Consensus 793 -~~~~~v~i~~~~p~P~tvl~i~~eg~y~-----~r~rrv 826 (826) ....-+..-.++-||.--++=--.|+=- +-..|- T Consensus 897 k~K~~~L~~Dnedvlpvytigdlasgqdvddlvgkwktra 936 (1027) T protein:vir:80 897 KVKHVYLYFDNEDVLPVYTIGDLASGQDVDDLVGKWKTRA 936 (1027) T ss_pred heeeeEEEEcCCcceeeeeeccccCCCchhHhhhhhcccc Confidence 0011111111222222111100000000 000000 No 28 >protein:vir:2625 Length: 715 # NCBI annotation: gp27 # Family: family:all:5234 # MgeID: mge:55 # MgeName: SIO1 # Cross-refs: genbank:acc:NP_064766;genbank:gi:9964636;genbank:GeneID:1263056 Probab=99.17 E-value=8.9e-10 Score=70.20 Aligned_cols=635 Identities=14% Similarity=0.131 Sum_probs=287.1 Q ss_pred Ccc-e-eeechhhhcccccCChhHhcccc-hhhhhcceeeccCCcccCChhHhHhh-----hcCcccccccccEEEEE-- Q lcl|NC_012418. 1 MSY-K-QSAYPNLLMGVSQQVPFERLPGQ-LSEQINMVSDPVSGLRRRSGIELMAH-----LLHTDQPWPRPFLYHTN-- 70 (826) Q Consensus 1 M~~-v-~~s~~n~~gGVSqQ~D~~Ry~~q-~~~~~N~~~~~~gGl~rRpGt~fv~~-----~~~~~~~~~~~~~~~~~-- 70 (826) |++ . +-....|++|.=--.-++-||.- .-.-+||...-.|--+||-|+-|-.. ...+ .++.--.+.+. T Consensus 1 m~~~~~~~~vNtFv~GliTEas~ltfpqnasiDe~N~~l~rdG~r~RR~g~~~E~~~vls~~~vp--~galv~~~~W~na 78 (715) T protein:vir:26 1 MPQSLTQRTVNTFIKGLITEASELTFPENASVDELNCSLGRDGTRRRRKAVTLEDNHVLSDVVVP--EGALVQTLDWYNV 78 (715) T ss_pred CCcccchhHHhhhhhheeeccccccCCccceeeeeeeeecCCCcchhhccceeecceEEEEEeec--Cceeeeeechhhc Confidence 995 3 35678899994444444555543 44568999988888888888754321 1111 11221222222 Q ss_pred eCCCceEEEEEecCCeEEEEECCCCEEEEecC---cccccc-cCC--Ccc--cEEEEEecCEEEEeeCCcceeeeecccC Q lcl|NC_012418. 71 LGGRSIAMLVAQHRGELYLFDERDGRLLMGQP---LVHDYL-KAA--DYR--QLRAATVADDLFIANLSVKPEADRTDVK 142 (826) Q Consensus 71 rd~~e~~~i~~~~~g~irv~d~~~g~~~~~~~---~~~~yl-~~~--~~~--~l~~~~vaD~~fi~n~~~~~~~~~~~~~ 142 (826) -++....|+++.-.--+++|.+.+-++.-... .+..+- ... .|. .++++.+..++.|+||..-+-...-+.+ T Consensus 79 ~G~v~~~~livqvg~~l~f~q~t~~pLs~~n~~~svdl~~~~~~vn~SPsh~~v~v~~~~G~livanp~i~~~~~~~d~~ 158 (715) T protein:vir:26 79 AGQVNLEFLVVQVNNILYFYEKSTDPLSANKYSGSVDLNTHSASNNLSPSEERVQVTSLNGYLIVASPAINTFYLGFNTS 158 (715) T ss_pred ccccCcEEEEEEeccEEEEEeccCCccccCceeEEeeecceecccccccceeEEEEEEeeeEEEEecCCccEEEEEecCC Confidence 23445556555432236666544322211110 011110 111 122 5778889999999999887754321111 Q ss_pred CCCCCccEEEEEcccccCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccce Q lcl|NC_012418. 143 GVDPNKAGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEY 222 (826) Q Consensus 143 ~~~~~~~a~~~vr~g~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~ 222 (826) ...-.+. -+-+|.-.+ ...| +.+..+.-|.+...+-+ .+++ .+.+ +.+| T Consensus 159 t~s~t~~-~ll~r~r~f------~~qg------~d~~~g~~y~~~gt~~t------------n~~i----ynly--N~gw 207 (715) T protein:vir:26 159 TEAFTAT-SISFKERDF------EWQG------SDVDVTSLYFGEGTSVS------------NQRI----YDTY--NVGW 207 (715) T ss_pred cceeEee-EEEEEeeeh------eeec------cccccccccccCCcccC------------chhh----eecc--ccee Confidence 1100000 011111100 0111 11111111212111111 1111 1111 1122 Q ss_pred EEeeeeeccceecccccccccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccc Q lcl|NC_012418. 223 TLPNSTKKYPKVDPDTAAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTP 302 (826) Q Consensus 223 ~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~ 302 (826) ....++ ........|+...+... ..+++ T Consensus 208 ~~p~gt-------------------~~~N~~~~yiVypa~s~------------------------------~~~S~--- 235 (715) T protein:vir:26 208 VGPKGS-------------------AALNTYGSYIVYPALTH------------------------------PWYSG--- 235 (715) T ss_pred ecceeE-------------------EEEcCCCCceEeccccc------------------------------ccCCC--- Confidence 211111 11111111221111100 00000 Q ss_pred eeEEEEEeeeEecCCCccceEEEEEecCCceEEEeecccccccccceeEEEEEecCCCeeEEeecCCcccccCCcccccC Q lcl|NC_012418. 303 GTGVQFMDGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPT 382 (826) Q Consensus 303 g~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~w~E~~~~g~~~~~~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~ 382 (826) +.++..+ +...|.|.. +|..+. +.|-|-+..-. +++ .|. T Consensus 236 ---------------kd~n~af-----sk~ad~ei~-----tGt~~~--------~~G~yi~D~~~-~g~-------~~l 274 (715) T protein:vir:26 236 ---------------KDANGAF-----NKADWLEIY-----TGSSLA--------SNGHYVLDVFN-KAR-------TGL 274 (715) T ss_pred ---------------ccccccc-----Chhhccccc-----cccccc--------cCceEEEeeee-cCC-------ccc Confidence 0000000 000122210 111110 11111100000 000 000 Q ss_pred ccc--cCCCceEEEEEcceEEEec------CCeEEEEecCC--------cccCcccccc--cCCCCccEEEEEcCCCcee Q lcl|NC_012418. 383 FNF--VTRGITGMTTFQGRLVLLS------QEYVCMSASNN--------PHRWFKKSAA--ALNDDDPIEIAAQGSLTEP 444 (826) Q Consensus 383 psf--~g~~~~~v~~~q~RL~~~~------~~~v~~S~~gd--------~~nF~~~s~~--~~~ddD~i~~~~~~~~~~~ 444 (826) -.- .++ +.+++.|.+|.+|++ +..|.+||.=+ |.+=+|++.. .+.|.|..-+.+-+-. . T Consensus 275 eeev~k~R-~rsv~~yaGrV~yagiD~dkng~rilfSqLv~s~~di~nCyQd~DPTsee~~dLidTDGg~iri~gah--~ 351 (715) T protein:vir:26 275 TTEVETGR-FRSVAAYAGRVFYAGIDSAKNGGKVYFSRLTERMSDVGNCYQVNDPTSEVLSDLLDTDGGVVRIPDAH--N 351 (715) T ss_pred hhhhhcCC-CcceeeecceEEEeecccccCCCeEEEehhhcchhhcccccccCCCchhhhhhhhhcCCCEEEecCCC--C Confidence 000 233 456999999999995 45799998633 5555555533 4678899888886653 3 Q ss_pred EEEEeecCCcEEEEecCcEEEEeC-CccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccc Q lcl|NC_012418. 445 YEHAVTFNKDLIVFAKKYQAVVPG-GGIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTD 523 (826) Q Consensus 445 i~~~v~~~~~L~l~t~~~q~~~~~-~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~ 523 (826) |.-++.|+..|+||...+-|+|.| +...|.++..+...++.+|++.=.=+++|+.++|-+++| |..+.-+ +.. T Consensus 352 ii~Lv~f~~sLlvf~~NGVWAi~G~d~g~tATdY~ltKIs~vg~sspnSvVvv~~~i~~WsdtG-----Iyal~~N-d~f 425 (715) T protein:vir:26 352 IRKLHVLGASLLVFAENGVWAVAGVDNVFRATEYAITRISDVGLSNENSFVVADGIPIWWGKTG-----IYAVQQS-ENL 425 (715) T ss_pred ceeEEEecceEEEEEecceEEEeccCCceeeeeeEEEEeeeeccCCCccEEEecceEEEeeCCc-----EEEEEec-ccc Confidence 666899999999999999999977 468999999999999999998777799999999999876 7767665 334 Q ss_pred cccchhHHH-HHHHHhcCC----CeEEE--EEcCCCCEEEEEEcCCCeEEEEEEeeCCCcee-----eEeeEeeecC--- Q lcl|NC_012418. 524 SHYVAEDVT-SHIPSYMPG----PAEYI--QAAASSGYLVFGTSTADEMICHQYLWQGNEKV-----QNAFHRWTLR--- 588 (826) Q Consensus 524 ~~~~~~dls-~~~~~~~~~----~v~~~--~~s~~p~~~v~~~~~~g~l~~~tyl~~~~e~~-----v~aW~~w~~~--- 588 (826) +-+.++.|| ..+.+|.+. .+... .|-.-++.+.|+..+..++.=|+| .+-. ..|+-+|..+ T Consensus 426 n~~tAqNLTekTIq~~~~~I~~dk~knVtg~fd~~e~rVyW~yPn~dt~vdyky----d~vLV~dLalgaFYp~~v~~~a 501 (715) T protein:vir:26 426 NTPTAQNLSLSTIQTLWNNISNAKKAQVTVEYDKINQRVFWFYPDNDESVDYKY----NNILVMDLALQAFYPWRVEDEA 501 (715) T ss_pred CcchhhccchHHHHHHHhhcchhhhcceEEEEEccCCEEEEEEcCCceeeceee----cCeEEEEecccccccccccccc Confidence 558999999 788887654 23332 455667889998877777776666 2211 2455555443 Q ss_pred Cc----EEEEE--------------------EE-CCeEEEEEEeC-CCEEEEEEEEeecCCcCCCCcccccceEEEEeec Q lcl|NC_012418. 589 HQ----IIGTY--------------------FT-GDNLMVLIQKG-QEIALGRMHLNSLPAREGLQYPKYDYWRRIEATV 642 (826) Q Consensus 589 g~----v~~~~--------------------~~-~d~l~~vv~R~-~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~ 642 (826) |. |.++. +- ++.+..+..|. ..+-.+...++++...- ......+. T Consensus 502 ~~~~~~ig~~~~~~~~~~~t~~~vv~~~~v~~~g~~~~v~~~~r~~~~~~~~~~~~~~~~~~~---------~~~f~~~~ 572 (715) T protein:vir:26 502 SSTSYIIGTSYYGGLGSTSTETQVVNGADVVVNGSDNVVATLYRDYLEGDSEIKLLVRDGTTG---------KMTFATFR 572 (715) T ss_pred cccceeeeeeeeCCcccccchhheeccceEEEeccceEEEEeecccccccceEEEEEEcCCce---------eEEEeccc Confidence 22 11110 00 11111111121 11111222222211110 00011110 Q ss_pred cce-eeecccccCccccccceEEeeeeeeEeeccEEcccEecCCCeEEEeecCCcCCceEEEEEeeeeEEEeCCeEEECC Q lcl|NC_012418. 643 EGE-LELTKQHWDLIKDAPAVYQLQPVAGAFMERYQLGVKRETNTKVFLDVPEAVVGSVYVVGCEFWSKVEFTPPVLRDH 721 (826) Q Consensus 643 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~v~vG~~y~~~~~~~~~~~~~~ 721 (826) +.+ .+|.+ -+.++ ++ ..+.++.|.....+ ...-||..+ +++ -.=++.+. T Consensus 573 ~~~~~dw~s------~d~~~-~~---~~gy~~~gd~~~~k--~~pyvt~~~-----------------~~t-edg~v~~~ 622 (715) T protein:vir:26 573 GDTYLDWGS------ADYKS-FA---EAGYDFMGDITTFK--NAPYVTTYM-----------------RVT-EDGYVASG 622 (715) T ss_pred Cceeeeccc------cchhh-HH---Hhhhhhcccceeee--cCceEEEEE-----------------EEe-cccceecc Confidence 000 01110 01110 00 00111111110000 000111110 000 00011111 Q ss_pred CCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeeeccccccccccccCc-cccccceEEEEeecccceeEEEE Q lcl|NC_012418. 722 NGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGE-PLVDSAVVPLPARVAMATSKFEL 800 (826) Q Consensus 722 ~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~tg~~~~p~~~~~~~~~v~i 800 (826) .|=.-.+...-+-.+..+..+++ ......|.+..+++...+....+ -|-.+-+-+..++|..+-.+++| T Consensus 623 ~g~~p~n~sSclm~~sw~ws~s~----------st~~eaYk~~~~~~~~p~~~s~~~yp~~~VvTKsriRG~Gr~~~~rf 692 (715) T protein:vir:26 623 AGYEFINPSSCLMSVSWNLSKSG----------STPREIYKLKDVPVVNPNDLSSINYPTDTVVTKSKVRGRGRSMKFRF 692 (715) T ss_pred CCccccCCcceEEEEEeeeccCC----------CChhhhheecceeeeCCCccccccCCcceeEeeeeeeccceEEEEEE Confidence 11000011111222222332322 11112233222222222221110 11222334556789999999999 Q ss_pred EECCCCCEEEEEEEEEEEEeccc Q lcl|NC_012418. 801 SCHSPYDMNVRAVEYNFKSNQTY 823 (826) Q Consensus 801 ~~~~p~P~tvl~i~~eg~y~~r~ 823 (826) .+...-.|+|++.+..|--|+.+ T Consensus 693 ~s~~gKdlhl~Gysilg~~~~~~ 715 (715) T protein:vir:26 693 ESVAGKDFHLVGYEVIGAKNNSY 715 (715) T ss_pred EecCCcceEEEeEEEEecccCCC Confidence 99999999999999999998888 No 29 >protein:vir:95475 Length: 771 # NCBI annotation: hypothetical protein ORF038 # Family: family:all:5234 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294631;genbank:gi:149408197;genbank:GeneID:5237042 Probab=98.34 E-value=1.2e-06 Score=53.09 Aligned_cols=658 Identities=13% Similarity=0.071 Sum_probs=279.2 Q ss_pred Ccce--eeechhhhcccccCChhHhccc-chhhhhcceeeccCCcccCChhHhHh-------hhcCcccccccccEEEEE Q lcl|NC_012418. 1 MSYK--QSAYPNLLMGVSQQVPFERLPG-QLSEQINMVSDPVSGLRRRSGIELMA-------HLLHTDQPWPRPFLYHTN 70 (826) Q Consensus 1 M~~v--~~s~~n~~gGVSqQ~D~~Ry~~-q~~~~~N~~~~~~gGl~rRpGt~fv~-------~~~~~~~~~~~~~~~~~~ 70 (826) |++- +-....|++|.=--.-++-||. ..-.-+||...-.|--+||-|+-|-. ....++.....--.+.+. T Consensus 1 m~~~~~~~~vNtFv~GliTEas~ltfpqnasiDe~N~~l~rdG~r~RR~g~~~E~~~~~vls~~~vpa~g~~~v~~~~W~ 80 (771) T protein:vir:95 1 MAKTTNAAEFNTFVGGLITEASPLTFPQNASIDEVNFILNRDGSRNRRNGMDFENGATKVVCNTLVPADGTIAVTSHNWE 80 (771) T ss_pred CCcccchhHHhhhhhheeeccccccCCccceeeeeeeeecCCCcchhhceeeeecCCceEEEEEEecccceEEeeeechh Confidence 9953 3467889999444444455554 44467899998888888888875532 111111111101122222 Q ss_pred --eCCCceEEEEEecCCeEEEEECCCCEEEEecCcccccccCC---Ccc-cEEEEEecCEEEEeeCCcceeeeecccCCC Q lcl|NC_012418. 71 --LGGRSIAMLVAQHRGELYLFDERDGRLLMGQPLVHDYLKAA---DYR-QLRAATVADDLFIANLSVKPEADRTDVKGV 144 (826) Q Consensus 71 --rd~~e~~~i~~~~~g~irv~d~~~g~~~~~~~~~~~yl~~~---~~~-~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~ 144 (826) -++....|+++.-.--+++|.+.+-++.-... ++.+. .|. .|++..+..++.|+||..-+-...-+.+.. T Consensus 81 na~G~v~~~~livqvg~~l~f~q~t~~pLs~~n~----~~~a~~nlSPsh~isv~v~~G~livanp~i~~~~~~~d~~t~ 156 (771) T protein:vir:95 81 NAGGEVGRWISLVQVGTELKFFQTTGETLSEGNF----YNYQFVNMSPSHKLSYAVVDGLLVVANGSRDIYVFEYDSGSV 156 (771) T ss_pred hcccccCcEEEEEEeccEEEEEecCCCcccccce----eeeecceeccceeEEEEEeeeEEEEecCCccEEEEEecCCcc Confidence 23445556555432236666543323321111 11111 122 488888899999999988765422111111 Q ss_pred CCCc-cEEEEEcccccCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceE Q lcl|NC_012418. 145 DPNK-AGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYT 223 (826) Q Consensus 145 ~~~~-~a~~~vr~g~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~ 223 (826) .-.+ .-++.-|.+=- .| .+|. .+..+.-|.+...+- +.+++ .+.++ .+|. T Consensus 157 s~t~~~ll~r~rf~~q--~~---~~G~------d~~~~~~~~~~gt~~------------tn~~i----ynlyN--~gw~ 207 (771) T protein:vir:95 157 SVTTKRLLVRDLFGVQ--DI---VNGV------DLRQGNDIATRPTVQ------------TNAHI----YNLRN--QTFG 207 (771) T ss_pred eeEeeeeeeeehhhcc--cc---cccc------ceecccccccCCccc------------Cchhh----eeccc--ccee Confidence 0000 00111111000 00 0000 011111111111110 01111 11111 1111 Q ss_pred EeeeeeccceecccccccccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccce Q lcl|NC_012418. 224 LPNSTKKYPKVDPDTAAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPG 303 (826) Q Consensus 224 ~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g 303 (826) ..-. +........+ .+...-. +....|++.. T Consensus 208 ~pk~-------------------~~~snt~~~~------------------------iV~~y~a----~~g~~pS~sd-- 238 (771) T protein:vir:95 208 VPRV-------------------TWHSNEPSDP------------------------IVTFRSA----ASGKFPSNSD-- 238 (771) T ss_pred cccc-------------------ccccCCcccc------------------------ceEeeec----cCCCCcCCce-- Confidence 0000 0000000000 0000000 0111111100 Q ss_pred eEEEEEeeeEecCCCccceEEEEEecCCceEEEeeccc--ccccccceeEEEEEecCCCeeEEeecCCcccc-cCC-ccc Q lcl|NC_012418. 304 TGVQFMDGAVMATGSTKAPVYFEWDSANRRWAERAAYG--TDWVLKKMPLALRWDEATDTYSLNELDYDRRG-SGD-EDT 379 (826) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~w~E~~~~g--~~~~~~tmp~~~~~~~~~~~f~~~~~~w~~r~-~gd-~~t 379 (826) .++ .. .+...|+|...-. ..++....|..- ...+.|-|-+.+-.-.... --- ..+ T Consensus 239 -~~N---~a----------------~~k~~~~Ei~t~~~f~~~~~~~~~~Gt-~~~~~G~yi~da~~~g~~~Lt~~ve~~ 297 (771) T protein:vir:95 239 -SVN---LA----------------LSKRADVEPSTTDRFRAEDIVLNPIGT-YETARGFFIIDAMARGKSRLEEIVKLK 297 (771) T ss_pred -eec---cc----------------cchhhccceeeecccchhhhhhcccCc-ccccCcceeeehhhhcccccceeeecc Confidence 000 00 0011122222100 000000000000 0111222211111000000 000 113 Q ss_pred ccCcccc-----------CCCceEEEEEcceEEEec-----------C----CeEEEEecCC--------cccCcccccc Q lcl|NC_012418. 380 NPTFNFV-----------TRGITGMTTFQGRLVLLS-----------Q----EYVCMSASNN--------PHRWFKKSAA 425 (826) Q Consensus 380 np~psf~-----------g~~~~~v~~~q~RL~~~~-----------~----~~v~~S~~gd--------~~nF~~~s~~ 425 (826) .|+|+.. -+..+.|+=|-.|.|+++ + .+|.+||.=| |.+=+|++.. T Consensus 298 gr~~s~~~~~~~l~~~~t~~~~~~vaeyagRvwYag~~~~~iD~dkng~~~~~~ilfSqLv~s~~di~nCyQd~DPTsee 377 (771) T protein:vir:95 298 QRYPSLSFGVSSLPQDETPGGASVVCEYAGRVWYAGFSGQIIDGDDQSPRLVSYILFSQLVDSPADIVNCYQDGDPTSTE 377 (771) T ss_pred ccchhhhccccccccccCCCCceeEEeeeeeEEEecceeEEeeccccCCceeeeEeeehhhcchhhcccccccCCCchhh Confidence 3444321 123466889999999886 1 1489998633 5555555533 Q ss_pred --cCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeC--CccccccceEEEEEEeecccCCCCcEEeCCeEE Q lcl|NC_012418. 426 --ALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPG--GGIVTPRTAVISITTQYDLDTRAAPAVTGRSVY 501 (826) Q Consensus 426 --~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~~--~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~ 501 (826) .+.|.|..-+.+-+-. .|.-|+.|+..|+||...+-|+|.+ +...|.++..+...++.+|++.=.=+++|+.++ T Consensus 378 ~~dLidTDGg~iri~gah--~ii~Lv~f~~sLlvfc~NGVWAi~ggsd~g~tAtdY~ltKIs~vg~sspnSvVvvg~~i~ 455 (771) T protein:vir:95 378 EPELVDTDGGFIRIEGAH--DIINLVNVGSAVMVVAANGIWMIQGGSDYGFTATNYLVTKISEHGCSSPNSVVVVDNSFM 455 (771) T ss_pred hhhhhhcCCCEEEecCCC--CceeEEEecceEEEEEecceEEEEeccCCceeeeeeEEEEeeeeccCCCccEEEecceEE Confidence 4678899888886653 3666899999999999999999954 458999999999999999998777799999999 Q ss_pred EEecCCCceeEEEEEeeccccccccchhHHH-HHHHHhcCC----CeEEE--EEcCCCCEEEEEEcC--CCe---E--EE Q lcl|NC_012418. 502 FAAERALGFMGLHEMAPSPSTDSHYVAEDVT-SHIPSYMPG----PAEYI--QAAASSGYLVFGTST--ADE---M--IC 567 (826) Q Consensus 502 f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls-~~~~~~~~~----~v~~~--~~s~~p~~~v~~~~~--~g~---l--~~ 567 (826) |-+++| |..+.-+. .+-+.++.|| ..+.+|.+. .+... .|-.-++.+.|+..+ |++ + ++ T Consensus 456 ywsdtg-----Iyal~~Nd--fn~~tAqnLTekTIq~~~~~I~~dk~knVtg~fd~~e~rvyw~yPn~~D~~~e~~t~LV 528 (771) T protein:vir:95 456 YWGDDG-----IYHLTRNQ--YGDYVANNLTEKTIQKYYEKIPSDAILNATGFYDSYDKKVKWLYNTVLDGRTEPVTELV 528 (771) T ss_pred EeeCCc-----eEEEeecc--cCcchhhccchHHHHHHHhhcchhhhcceEEEEEccCCEEEEEecceecCCCcceeeee Confidence 999876 77776664 4458999999 788887654 22222 345556677776442 111 1 22 Q ss_pred EEEeeCCCceeeEeeEee---e-cCCcE----EEEEE-------ECCeEE----------------EEEEeCCCEEEEEE Q lcl|NC_012418. 568 HQYLWQGNEKVQNAFHRW---T-LRHQI----IGTYF-------TGDNLM----------------VLIQKGQEIALGRM 616 (826) Q Consensus 568 ~tyl~~~~e~~v~aW~~w---~-~~g~v----~~~~~-------~~d~l~----------------~vv~R~~~~~~~r~ 616 (826) |. -...|+-+| + .+|.. .++.. .+.++- ..+|-..-+-++.+ T Consensus 529 ~d-------LalgaFYp~~i~~~~ag~l~~~vg~~~~p~~~lv~T~~eV~v~~~~v~~tG~~vtV~~~~r~~~~~~~~y~ 601 (771) T protein:vir:95 529 FD-------LALGAFYPSKIGSLTAGRLPIPVGSVKIPPYKLVETGEEVTVASEQVTATGELVTVKVSTRSPVIRETKYI 601 (771) T ss_pred ee-------ecccccccccccccccCccceeeeeeecCccccccccceEEecceeeEecCCceEEEEEEeeccccceEEE Confidence 22 123466666 3 23322 11100 011111 11111111112222 Q ss_pred EEeecCCcCCCCcccccceEEEEeeccceeeecccccCccccccceEEeeeeee-EeeccEEcccEecCCCeEEEeecCC Q lcl|NC_012418. 617 HLNSLPAREGLQYPKYDYWRRIEATVEGELELTKQHWDLIKDAPAVYQLQPVAG-AFMERYQLGVKRETNTKVFLDVPEA 695 (826) Q Consensus 617 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~g~~~l~~~~~ 695 (826) ..+++ ...+........++.+-+-+..-..+.-.. .+.-++..+ + T Consensus 602 ~~~~d-------------------g~~g~~~Fa~~~~~~f~DW~sv~~~~vdy~sy~~~gY~~~--------------g- 647 (771) T protein:vir:95 602 IVEKL-------------------SSPMRISFGGYTDEEFVDWKSVDGIGVDAPAYLLTGYLAG--------------G- 647 (771) T ss_pred EEEec-------------------CCCeeEEeccccCcceeecccCCCcccchHHHHHhhhhcc--------------c- Confidence 11111 111111110000111000000000000000 000011111 1 Q ss_pred cCCceEEEEEeeeeEEEe-----CCeEEECCCCceee-ecceEEEEEEEEeeccceEEEEecCCCCCceeeeeecccccc Q lcl|NC_012418. 696 VVGSVYVVGCEFWSKVEF-----TPPVLRDHNGLPMT-SARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLF 769 (826) Q Consensus 696 ~~~~~v~vG~~y~~~~~~-----~~~~~~~~~g~~~~-~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~ 769 (826) ..+|..+--++++ -.=++.+..|...- +...-+-.+..+...++. .++-...+..|.+...++. T Consensus 648 -----d~~~~k~~PYit~y~~~tedg~v~~~~g~~~p~n~sSclm~~sw~ws~s~~-----t~k~~~~~eaYk~~~~~~p 717 (771) T protein:vir:95 648 -----DYQREKFVPYITFHFKKTEDGFVEDAEGDWTPTNQSSCMVQSQWSWTNSPA-----SNKWGRTWQAYRFRRHFFP 717 (771) T ss_pred -----hheeeeccceEEEEEEeecccceecccccccccCCcceEEEEEeeeecCCC-----CCccccchheeeecceecc Confidence 1111111111111 00111122221110 011112223333333331 1222222333433322221 Q ss_pred ccccccCccccccce--EEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEeccc Q lcl|NC_012418. 770 SRQLNAGEPLVDSAV--VPLPARVAMATSKFELSCHSPYDMNVRAVEYNFKSNQTY 823 (826) Q Consensus 770 ~~~~~~~~~~~~tg~--~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y~~r~ 823 (826) . + .....-..... -+..++|..+-.+++|.+...-.|+|++.++....|-.. T Consensus 718 ~-~-~~~~~yp~~~VV~TKsriRG~Gr~~~~rf~s~~gKdlhl~Gysil~~~~~~~ 771 (771) T protein:vir:95 718 D-N-IDNQFDDGNSVVETKSRLRGSGKVLSLYITTEPKKNLHIYGWSMLVDVNGTV 771 (771) T ss_pred C-C-cchhcCCccceeeeeheeeecceEEEEEEEecCCcceEEEeEEEEEeecCcC Confidence 1 1 11111111222 344678889999999999999999999999999888776 No 30 >protein:vir:8837 Length: 513 # NCBI annotation: constituent protein # Family: family:all:4957 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775245;genbank:gi:27476043;genbank:GeneID:2700591 Probab=98.01 E-value=7.1e-06 Score=48.81 Aligned_cols=475 Identities=10% Similarity=0.007 Sum_probs=184.0 Q ss_pred ecccceEEeeeeeccceeccc-ccccccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeecccccccc Q lcl|NC_012418. 217 FGAPEYTLPNSTKKYPKVDPD-TAAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPAL 295 (826) Q Consensus 217 ~s~~~~~~~~~t~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~ 295 (826) .+.....+-.....+...+|. +....-+...+.....+.+... .+......+-+.....+-.. . T Consensus 1 ~~~~~~~~~~~~g~~~d~~p~~lp~~a~s~~~N~~~~~~~~~~~---~g~~pv~a~~~~~~~g~~~~---~--------- 65 (513) T protein:vir:88 1 MALERQEVKNPTGIVTDIAPADLPLDKWSFGNNVRFKNGKAQKA---LGHSPIFDTAQAPILDMFPF---I--------- 65 (513) T ss_pred CCcCChhhcccccceeccChhhcCCCcceeeeeeeEecceeeec---CccceeeecCCCCceeeeee---e--------- Confidence 111111111111111111211 1110001111111111111110 00000000000000000000 0 Q ss_pred ccCCccceeEEEEEeeeEecCCCccceEEEEEecCCceEEEeecc------cccccccceeEEEEEecCCCeeEEeecCC Q lcl|NC_012418. 296 LPGAGTPGTGVQFMDGAVMATGSTKAPVYFEWDSANRRWAERAAY------GTDWVLKKMPLALRWDEATDTYSLNELDY 369 (826) Q Consensus 296 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~w~E~~~~------g~~~~~~tmp~~~~~~~~~~~f~~~~~~w 369 (826) ..|..+.+. .+ ...|..+++. .|.-..+. ...|.+.++--.++..+.+..... .+- T Consensus 66 -----~~g~~~~~~-------~~--~~~~~~~~~~--t~~dvs~~~~~~~~~~~w~~~~f~~~i~a~ng~~~~q~--~~~ 127 (513) T protein:vir:88 66 -----RNNIPYWLL-------CS--EKRLYLADGT--TIIDVSPGPYSASVTNRWSVGSFNGVIFANDGVNPPHH--LPP 127 (513) T ss_pred -----cCCCeEEEE-------ee--ceEEEEecCc--eeeeccccceeecccCceeeeeecCEEEEEcCCCcceE--EcC Confidence 000010000 00 0111111111 12111000 001111110001111222221111 110 Q ss_pred cccccCCcccccCccccCCCceEEEEEcceEEEec--------CCeEEEEecCCc----ccCcccccccCCCCccEEEEE Q lcl|NC_012418. 370 DRRGSGDEDTNPTFNFVTRGITGMTTFQGRLVLLS--------QEYVCMSASNNP----HRWFKKSAAALNDDDPIEIAA 437 (826) Q Consensus 370 ~~r~~gd~~tnp~psf~g~~~~~v~~~q~RL~~~~--------~~~v~~S~~gd~----~nF~~~s~~~~~ddD~i~~~~ 437 (826) ......|-..+| + ...-..|.+|++||++++ |+.|+.|..+|. ..|.... ...+.+=.++ T Consensus 128 ~s~~f~dl~g~p--~--~~~a~~i~v~~~flv~~~~t~~~~~~PnrV~wS~~~D~~~~P~~W~~t~--~t~~a~~~~l-- 199 (513) T protein:vir:88 128 TESVFRVLPNFP--A--NTTFRRLKSFKNFLIGLNVTSNSIEMPQMVWWSTSADAGGVPASWDPTD--PTKDAGQNTL-- 199 (513) T ss_pred CCceeeeccCCC--c--ccceEEEEEEeeEEEEeecccCcCCCCceEEEecccCCccccccccccc--ccCccccccc-- Confidence 001111111222 1 114466889999999975 678999999996 4443221 1122222222 Q ss_pred cCCCceeEEEEeecCCcEEEEecCcEEEEe-CCccccccceEEEEEEeeccc-CCCCcEEeCCeEEEEecCCCceeEEEE Q lcl|NC_012418. 438 QGSLTEPYEHAVTFNKDLIVFAKKYQAVVP-GGGIVTPRTAVISITTQYDLD-TRAAPAVTGRSVYFAAERALGFMGLHE 515 (826) Q Consensus 438 ~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~-~~~~lTP~~~~~~~~s~~~~~-~~~~Pv~vg~~v~f~~~~g~~~~~v~e 515 (826) .+....|...++....|+||++.+-|.++ .++ |....++....-.|. +.-.=+.+|+.+||++++| ++. T Consensus 200 -~d~~g~~v~g~~~g~~liif~e~~i~~m~y~g~---~~if~~~~i~~~~G~~~p~SI~~~~~~~ffls~~G-----f~~ 270 (513) T protein:vir:88 200 -ADTNGAIVDGVKLRDSFIIYKEDSVYSMRYIGG---LYIFQFQQLFNDVGILGPNCAIEFDGNHFVVGHGD-----VYV 270 (513) T ss_pred -CCCccceeeeeecccceEEEecccEEEEEecCC---CceEEEEeecccccccCCceeEEECCeEEEEeCCc-----eEE Confidence 34445566677888899999999999996 322 334455554433333 2223367999999999865 542 Q ss_pred EeeccccccccchhHHHHHHHHhcCC-----CeEEEEEcC-CCCE-EEEEEcC-C-------CeEEEEEEeeCCCceeeE Q lcl|NC_012418. 516 MAPSPSTDSHYVAEDVTSHIPSYMPG-----PAEYIQAAA-SSGY-LVFGTST-A-------DEMICHQYLWQGNEKVQN 580 (826) Q Consensus 516 ~~~~~~~~~~~~~~dls~~~~~~~~~-----~v~~~~~s~-~p~~-~v~~~~~-~-------g~l~~~tyl~~~~e~~v~ 580 (826) + ...+ .+.- ....+++.|-. ....+...- ..+. +.|+-.+ + .++++|-|+ + + T Consensus 271 ~--~G~~---~~~I-g~ekVdk~f~~~~n~~~~~~~~~~~d~~~~~v~~~y~s~~~~~~~~~~~~lVYd~~----~---~ 337 (513) T protein:vir:88 271 H--NGVQ---KQSV-IDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSEPGKHCDRAIIWNWK----E---N 337 (513) T ss_pred e--cCce---eeec-ccchhhhhhhccCCcccceEEEEEEcCcccEEEEEecCCCCCCCcccceEEEEEcc----C---C Confidence 2 2111 1110 11234443322 222222222 2233 3343111 1 356777774 2 3 Q ss_pred eeEeeecCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEeecCCcCCCCcccccceEEEEeeccceeeecccccCcccccc Q lcl|NC_012418. 581 AFHRWTLRHQIIGTYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEATVEGELELTKQHWDLIKDAP 660 (826) Q Consensus 581 aW~~w~~~g~v~~~~~~~d~l~~vv~R~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 660 (826) .|+.-+.+..+-.+..+-+.+............. ...++.. .+ ....... T Consensus 338 ~Ws~~~~p~~~~g~~g~~~~~~~~~~~~~~~~~d--------------~~~~~~~--~~----~~~~~~~---------- 387 (513) T protein:vir:88 338 TWSIRDLPNVLSGAYGIIDPKTSNLWDDDSNPWD--------------TDTSVWG--EG----SYNPAKS---------- 387 (513) T ss_pred eEEEEeccchhhcccccccccccceecccccccc--------------cchhhhh--cc----ccccccc---------- Confidence 5765555544333322211111111111000000 0000000 00 0000000 Q ss_pred ceEEeeeeeeEeeccEEcccEecCCCeEEEeecCCcCCceEEEEEeeeeEEEeCCeEEECCCCceeeecc-eEEEEEEEE Q lcl|NC_012418. 661 AVYQLQPVAGAFMERYQLGVKRETNTKVFLDVPEAVVGSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSAR-AVLHRYNVN 739 (826) Q Consensus 661 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r-~~i~~~~~~ 739 (826) .+.. ....++... .++.- .-.-|-++++.++.....+.+. ++ .+|+++... T Consensus 388 sl~~-----~~~~~~~~~----------~fd~~------~~f~G~~lea~~~t~~~~~~~~-------~~~~~i~~v~~~ 439 (513) T protein:vir:88 388 SMIF-----TSFQDAKLF----------LFGET------STFSGQSFTSTLERSDIYLGDD-------RMMKTVSAVIPH 439 (513) T ss_pred eeEe-----eeccCCcee----------eeccc------ccccCCceEEEEEecCccccCc-------hhheeeeeeeee Confidence 0000 011111111 01100 1145778888888665544221 23 357777777 Q ss_pred eeccceEEEEecCCCCCceeeeeeccccccccccccCccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEE Q lcl|NC_012418. 740 FGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLVDSAVVPLPARVAMATSKFELSCHSPYDMNVRAVEYNFKS 819 (826) Q Consensus 740 ~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~y 819 (826) +...|.+.+.++......... .+... ..-...++..++.+...+..+++|+...--|+++.++++|..= T Consensus 440 ~t~~g~~t~~vg~~~~~~~~~------~~s~~-----~~~~~~~~~~~~~r~~gRy~~~ri~i~~~~~w~~~G~~ve~~~ 508 (513) T protein:vir:88 440 ITGNGVCNIWVGNAQVQGSGI------RWKGP-----YPYRIGQDYKIDTKHVGRYIALKFDFASAGDWYFNGYTLEMAP 508 (513) T ss_pred eecceEEEEEEeeeccCcccc------ccccc-----eeeecccCceEEeccCCceEEEEEEccCCCceEEeeEEEEEec Confidence 777787777766544332211 11100 0001123455677777888888998888999999888887653 Q ss_pred ecc-ccc Q lcl|NC_012418. 820 NQT-YRR 825 (826) Q Consensus 820 ~~r-~rr 825 (826) | .|| T Consensus 509 --~~g~R 513 (513) T protein:vir:88 509 --KAGMR 513 (513) T ss_pred --CCCCC Confidence 3 444 No 31 >protein:vir:3133 Length: 911 # NCBI annotation: hypothetical protein # Family: family:all:5234 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640315;genbank:gi:21234408;genbank:GeneID:956056 Probab=95.50 E-value=0.002 Score=35.44 Aligned_cols=669 Identities=14% Similarity=0.141 Sum_probs=250.0 Q ss_pred Ccceeeechhhh--cccccCChhHhcccc-hhhhhcceeeccCCcccCChh-------HhHhhhcCccccccccc--EEE Q lcl|NC_012418. 1 MSYKQSAYPNLL--MGVSQQVPFERLPGQ-LSEQINMVSDPVSGLRRRSGI-------ELMAHLLHTDQPWPRPF--LYH 68 (826) Q Consensus 1 M~~v~~s~~n~~--gGVSqQ~D~~Ry~~q-~~~~~N~~~~~~gGl~rRpGt-------~fv~~~~~~~~~~~~~~--~~~ 68 (826) |+.-.+....|+ +|---.-.+..|... .-..+||=...+|=.+||-|. +|+......++ +++. +.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~ 78 (911) T protein:vir:31 1 MAARKGAVNRFTPVRGWVTEGNLANYGQDVALDVENMDIEKTGLTQRRFGLFAETSSEQFLSTFTATAR--ARGLLAVKE 78 (911) T ss_pred CccccccccccccceeeeecCchhhcCceeEeeeccccchhcccchhheeeeeccchhhhhhhhhhhhh--hcceeehhh Confidence 887666666654 342222244444332 345788888777777888775 34433222221 1111 000 Q ss_pred EE--eCCCceEEEEEecCCe-EEEEECCCCE-----EEEecCcccccccCCC-----c-ccEEEEEecCEEEEeeCCcce Q lcl|NC_012418. 69 TN--LGGRSIAMLVAQHRGE-LYLFDERDGR-----LLMGQPLVHDYLKAAD-----Y-RQLRAATVADDLFIANLSVKP 134 (826) Q Consensus 69 ~~--rd~~e~~~i~~~~~g~-irv~d~~~g~-----~~~~~~~~~~yl~~~~-----~-~~l~~~~vaD~~fi~n~~~~~ 134 (826) +. -++.+.-|+ +++.|+ +.|-. ++.+ .+..- ..|.+.- . +...+.---.+..|.||...| T Consensus 79 ~~~~~~~~~~~~~-~~~~~~~~~~~~-~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (911) T protein:vir:31 79 WREAWGDKDVNML-IFHAGYKVHVVQ-DTAPLRDANILLTI----DLLEAGIKLDGVIDSPVHISVGVGFAIITNPRIEP 152 (911) T ss_pred HHHhhCCCcceEE-EEecCcEEEEEe-cccCccccceEEEe----eeeccCceeeeeecCceeEEeeceEEEeecCccce Confidence 10 112223333 334453 22221 1111 11100 1111110 0 001111112455667777666 Q ss_pred eee-ecccCCCCCCccEEEEEcccccCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhh Q lcl|NC_012418. 135 EAD-RTDVKGVDPNKAGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLY 213 (826) Q Consensus 135 ~~~-~~~~~~~~~~~~a~~~vr~g~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~ 213 (826) ... .+.... +++- .-.|- ..++.|.... -..+|++-..-+... .+.+-- T Consensus 153 ~~~~~~~~~~-----~~~~---~~~~~-~~~~~~~~~~--------~~~~~~~~~~~~~~~-------------~~~~~~ 202 (911) T protein:vir:31 153 VLIKLDDVDD-----EGVP---TLSYE-PLTLLIRTRE--------LLTPYTTGTNYGDTL-------------TPEEEW 202 (911) T ss_pred EEEEeeccCc-----cCcc---ccccc-ceeeEeeehh--------hccccccccccCccc-------------Cchhhc Confidence 431 111100 0000 00010 1111111100 011222211000000 011101 Q ss_pred heeecccceEEeeeeeccceecccccccccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeecccccc Q lcl|NC_012418. 214 GKFFGAPEYTLPNSTKKYPKVDPDTAAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLP 293 (826) Q Consensus 214 ~~~~s~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~ 293 (826) +.++ .+|.... + .+.+..|+++ ...+.||---+-. T Consensus 203 ~~~~--~~~~~~~--------~--------------------------------~~~~~~~~~~---~~~~~~~~~~~~~ 237 (911) T protein:vir:31 203 NLYN--SGWATIT--------R--------------------------------ATKDKSGSGT---VYVNPVQYYFDKR 237 (911) T ss_pred cccc--ccceeee--------e--------------------------------ecccCCccce---EEEchhheeeccc Confidence 1111 1221100 0 0001111111 0001111000001 Q ss_pred ccccCCc----------------------cceeEEEEEeeeEecC-C-CccceEEEEEecCCceEEEeecccccccccce Q lcl|NC_012418. 294 ALLPGAG----------------------TPGTGVQFMDGAVMAT-G-STKAPVYFEWDSANRRWAERAAYGTDWVLKKM 349 (826) Q Consensus 294 ~~~~~~~----------------------~~g~~~~~~~~~~~~~-~-~~~~~~y~~~~~~~~~w~E~~~~g~~~~~~tm 349 (826) ...|++. +.-++.++ +...+. | -.-..||+ +.+..- . .+... T Consensus 238 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~--~~~~~~-----~----~~~~~- 303 (911) T protein:vir:31 238 GVYPSHSVLYNSMKQESAKEIVALNVFSPWADEKINF--GTTTPPLGRYIHSAYYF--DSAAIL-----S----LGIGN- 303 (911) T ss_pred CcCcchhhhhhhhhhhccceeEEEeeecccccccccc--ccCCCchhhhhhhheee--ccceee-----e----ecccc- Confidence 1111110 00000000 000000 0 00112221 111000 0 00000 Q ss_pred eEEEEEecCCCeeEEeecCCccc-ccCCcccccCc-------------------cccCCCceEEEEEcceEEEec----- Q lcl|NC_012418. 350 PLALRWDEATDTYSLNELDYDRR-GSGDEDTNPTF-------------------NFVTRGITGMTTFQGRLVLLS----- 404 (826) Q Consensus 350 p~~~~~~~~~~~f~~~~~~w~~r-~~gd~~tnp~p-------------------sf~g~~~~~v~~~q~RL~~~~----- 404 (826) |....++|+- +.+ .+.+..+||.. .-+...|+|++||.+|++|+. T Consensus 304 ---~~~~~~~~~~-------~~~~p~~~e~~np~gl~~igt~~n~k~~a~~~~~~~~~~r~r~~~~yaGRVfyaD~dkng 373 (911) T protein:vir:31 304 ---LTPPTSDGTT-------EGSGPAEEEISNPIGLDNIGTVNNLKLIAEGTVRWTVKDRPRCSGYHNGHVYFGDRDKNG 373 (911) T ss_pred ---cCCCCCCCcc-------CCCCCchhhhcCCCCcccccchhceeeeeccceeeeecccccceeeeccEEEEeeeccCc Confidence 0000011110 000 01122344421 011235789999999999995 Q ss_pred CCeEEEEecC--------CcccCcccccc--cCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCC--ccc Q lcl|NC_012418. 405 QEYVCMSASN--------NPHRWFKKSAA--ALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGG--GIV 472 (826) Q Consensus 405 ~~~v~~S~~g--------d~~nF~~~s~~--~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~~~--~~l 472 (826) ...|.+|+.- +|++=++.+.. .+.|.|-.-+.+.. ...|+-+|.+++.|++|..++.|.|.|. ... T Consensus 374 k~rIlFSqLv~sl~di~nCYQdaDPTSeee~DLIdTDGg~vri~g--ah~Ii~LV~~G~sLlVFcaNGVWAI~G~d~~g~ 451 (911) T protein:vir:31 374 KTRILVSQLVNSLDNIPKCFQDADPTAEEINDLIATDGFTMYPVG--MGAPITMVEFNKRLLLLCTNGVWAIRGTSGGGA 451 (911) T ss_pred ceeEEEEeeccccccccccccCCCccccccchhhhcCCcEEecCC--CCCceEEEEecCeEEEEEeCcEEEEeccCCCce Confidence 3479999874 36666665533 24467887777654 4568889999999999999999999774 479 Q ss_pred cccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHH-HHHHHhcCC----CeEEE- Q lcl|NC_012418. 473 TPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVT-SHIPSYMPG----PAEYI- 546 (826) Q Consensus 473 TP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls-~~~~~~~~~----~v~~~- 546 (826) |.++..|...+..+|++.=.=|++|+.++|-+++| |..+...+..+ +.++.+| ..+..|.+. .|+.. T Consensus 452 TATdy~ItKIsdvGcsspNSVVvVgn~i~fWSd~G-----IyaLganqfnD--~tAnNLTesTIQ~y~d~I~~dkIkNVt 524 (911) T protein:vir:31 452 TATDFTLDKVASVEFNSPQSVVDIGTAIVFWSERG-----IIAIGVNDFGD--LTSNNLTENTIDEYYDSLDRDIIKNVK 524 (911) T ss_pred eeeeeEEEEEeeeeeCCCCeEEEecCceEEeeCCc-----EEEEeecccCc--cccccccHHHHHHHHhhcChhhhceEE Confidence 99999999999999998777799999999999876 66666665444 6788888 567777653 23332 Q ss_pred -EEcCCCCEEEEEEcC---CCeEEEEE----EeeCCCceeeEeeEeeecC-CcEEEEEEE-----CCeE---------EE Q lcl|NC_012418. 547 -QAAASSGYLVFGTST---ADEMICHQ----YLWQGNEKVQNAFHRWTLR-HQIIGTYFT-----GDNL---------MV 603 (826) Q Consensus 547 -~~s~~p~~~v~~~~~---~g~l~~~t----yl~~~~e~~v~aW~~w~~~-g~v~~~~~~-----~d~l---------~~ 603 (826) .|-..++.+.|+..+ ..+.+.+. +.. .-...+|-+|... |.++..-.. +.++ ++ T Consensus 525 gtyd~de~rVyW~yPn~lDe~teykt~~~~ILVf---dLatgaFYPwtvs~gpLl~~p~y~Lv~TreEvtvPi~~etgai 601 (911) T protein:vir:31 525 GTFINDENRVYWVVPNKQDSNGEYKTDGELVLVL---NLDTGGFYKHTVSGGPLLHAPFRRLVNTRAEVSIPITETDGTV 601 (911) T ss_pred EEEEccCCEEEEEecCccCCccceeecCceEEEE---EeccCcccceeeecceeecccccccccccccceeeEEeecceE Confidence 344566778887552 22343321 100 1123588888663 444321111 1111 12 Q ss_pred EEEeCCC-EEEEEEEEeecCCcCCCCcccccceEEEEeeccceeee-cccccCccccccceEE----------------- Q lcl|NC_012418. 604 LIQKGQE-IALGRMHLNSLPAREGLQYPKYDYWRRIEATVEGELEL-TKQHWDLIKDAPAVYQ----------------- 664 (826) Q Consensus 604 vv~R~~~-~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~----------------- 664 (826) |++...+ ....+... .+.-..+-+....+....+..+. -...++++-+-..... T Consensus 602 Ive~gsdPV~~tl~vd-------ttGvDg~ayLl~frdg~~g~~~f~a~~~~~~~~dw~~~~~~~~~~y~s~~~~~y~~~ 674 (911) T protein:vir:31 602 ITDTLGDPVTVTRTVT-------TTGVDGLAYFASFDDGVNGQFNFIAEHQPWGFADWANVPNMTRVNYSSYVDFAYEYP 674 (911) T ss_pred EEecCCCCeEEEEeee-------cccccceeEEEeeccCCcceEEEEEeecCCeeeccccCccccccchhHHHHhhhhhh Confidence 2222211 11111100 00000000000011111111111 0001111100000000 Q ss_pred ----eeeeeeEeeccEEcccEecCCCeEEEeecCCcCCceEEEEEeeeeEEEeCCeEEECCCCceeeecceEEEEEEEE- Q lcl|NC_012418. 665 ----LQPVAGAFMERYQLGVKRETNTKVFLDVPEAVVGSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSARAVLHRYNVN- 739 (826) Q Consensus 665 ----~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~~i~~~~~~- 739 (826) ......+++..++-+.+..+.+-++ ++ .|+.|- +.|.-.+..-|.++.|++.+. T Consensus 675 ~~~~~~~~~pyi~sy~~~~~rv~~~~y~~---------~~--a~~~f~----------~~~~~~~~~~~~~~~~~~~~~~ 733 (911) T protein:vir:31 675 EVMIGNISLPYIHSYYLTGIRVQTEQYTT---------ET--AHLSFH----------RVQAHQTTALGTVTFHKVDMMV 733 (911) T ss_pred hhhhhcccCceeeeeeeeeeEEeccceee---------ec--ccceeE----------eeecccceeeeeeeeeeeeehh Confidence 0000001111111111111111000 00 011100 000000111112223332221 Q ss_pred -------------eeccceEEEEecCCCCCceeeeeeccccccccccccCc-cccccceEEEEeecccceeEEEEEEC-- Q lcl|NC_012418. 740 -------------FGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGE-PLVDSAVVPLPARVAMATSKFELSCH-- 803 (826) Q Consensus 740 -------------~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~tg~~~~p~~~~~~~~~v~i~~~-- 803 (826) +.++-...+ |+ .+. ...+++|..+...+..+.. -|+..|.+=|-..+ .+.-.+.|+ T Consensus 734 ~~~~~~~~~~~~~~~~~~~~~v-VN-GDA---E~GtmTGWtvtaG~~d~~Ta~p~~rGSyfFa~~n---n~n~aL~QDID 805 (911) T protein:vir:31 734 STGMQVISFHKDDLLRTEAVTL-VN-PDA---ETGDATGWTVTAGTLDVRTAAPLYQGSYYFWSDS---NANFAAYQDID 805 (911) T ss_pred hccceeeeeccccceeeeeeEE-Ec-CCC---CCCCCCcceeeccchhhccCCchhcceEeEcCCC---Ccchhhheecc Confidence 112222222 11 111 1233344333333332222 24556665553211 222223332 Q ss_pred ---------CCCCEEEEEEEEEEEEeccc--cc--C Q lcl|NC_012418. 804 ---------SPYDMNVRAVEYNFKSNQTY--RR--V 826 (826) Q Consensus 804 ---------~p~P~tvl~i~~eg~y~~r~--rr--v 826 (826) .-+|.++. +|-+.|-.+. -| | T Consensus 806 SagaaaIDAG~v~ynvS--awl~gyAaqnd~Dr~~l 839 (911) T protein:vir:31 806 PVGGGYITAGELANNVI--EAKLSWAARGNTDLGTV 839 (911) T ss_pred ccccceeeeccchhhhh--hhhhhhccCCCCccceE Confidence 23344443 3777775552 44 3 No 32 >protein:vir:105563 Length: 396 # NCBI annotation: hypothetical protein # Family: family:all:27455 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164316;genbank:gi:56692963;genbank:GeneID:3197174 Probab=94.62 E-value=0.0039 Score=33.78 Aligned_cols=375 Identities=12% Similarity=0.077 Sum_probs=149.7 Q ss_pred Ccceeeechhhhcc--cccCChhHh----cccchhhhhcceeeccCCcccCChhHhHhhhcCccccccc---ccEEEEEe Q lcl|NC_012418. 1 MSYKQSAYPNLLMG--VSQQVPFER----LPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPR---PFLYHTNL 71 (826) Q Consensus 1 M~~v~~s~~n~~gG--VSqQ~D~~R----y~~q~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~~---~~~~~~~r 71 (826) |+-+ +|--|+|= |+.-.++.| -..-+++.+|.=.+..|=.+||-|.+-+....... .+.. .+.|.. T Consensus 1 ~~~~--~~~~~~ginnv~~e~~l~~~~~~~~~~~r~a~nvdi~~~G~~~~r~~~tr~~~g~l~~-~~~~~~~~~~~~~-- 75 (396) T protein:vir:10 1 MATT--SLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQ-LWQSPLHGDAFGA-- 75 (396) T ss_pred Ccce--eeeeeecccccccccccccCCCcccceeeeeeeecccCCCchhhhccCcccCCceecc-cccCccccceeee-- Confidence 8866 33333322 444444544 34459999999999999999999988776443321 1111 122221 Q ss_pred CCCceEEEEEecCCeEEEEECCCCEEEEecCcccccccCCCcccEEEEEecCEEEEeeCCcceeeeecccCCCCCCccEE Q lcl|NC_012418. 72 GGRSIAMLVAQHRGELYLFDERDGRLLMGQPLVHDYLKAADYRQLRAATVADDLFIANLSVKPEADRTDVKGVDPNKAGW 151 (826) Q Consensus 72 d~~e~~~i~~~~~g~irv~d~~~g~~~~~~~~~~~yl~~~~~~~l~~~~vaD~~fi~n~~~~~~~~~~~~~~~~~~~~a~ 151 (826) ..+.|.-++.+.-........+ +..+.....+|-+|..+...+--+.. T Consensus 76 -----------~~~tl~~~~~~~w~~~~~v~v~--------~~pva~d~~~~Rvy~t~~~~p~~~~~------------- 123 (396) T protein:vir:10 76 -----------LGDQWGKVDPHSWTFEPLAQIG--------EGDLSHEVLNNRVCVAGTAGIFTYDG------------- 123 (396) T ss_pred -----------CCceEEEEeCCeEEEEeeeeec--------cCchhccccCCeEEEEcCCCceeeeC------------- Confidence 1222322222221111110001 11233445566667666444321110 Q ss_pred EEEcccccCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhhheeecccceEEeeeeecc Q lcl|NC_012418. 152 LYIKAGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKY 231 (826) Q Consensus 152 ~~vr~g~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~~~~~s~~~~~~~~~t~~~ 231 (826) ++.|.+.+..-. .+...+.. + .+.... ..|+. .|. ....+. T Consensus 124 --------~~~y~L~vp~P~----------~a~~~a~~-G---sl~~~~----~~Y~~-----------t~V-~~~gEE- 164 (396) T protein:vir:10 124 --------AQAERLTLDTPA----------PPLLVAGA-G---SLSQGT----YGAAV-----------AWL-RGPQES- 164 (396) T ss_pred --------CcceecCcCCCc----------cccccccc-C---ccCCce----EEEEE-----------EEE-ecCCCc- Confidence 111222221110 01111110 0 000000 00100 000 000000 Q ss_pred ceecccccccccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCCccceeEEEEEee Q lcl|NC_012418. 232 PKVDPDTAAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDG 311 (826) Q Consensus 232 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~ 311 (826) .++.... .. .+.++.-. |+- + .+.++ .+...+++- T Consensus 165 --s~p~~~S--------------------------~~-v~~~gg~~--------vtl----~--~~~~~-~i~~~RiYr- 199 (396) T protein:vir:10 165 --APSLIAF--------------------------AE-VTDAGALE--------VTF----P--LCLDA-SVTGARLYL- 199 (396) T ss_pred --Ccccccc--------------------------cc-cCCCCCcE--------EEE----E--cccCC-CcceEEEEE- Confidence 0000000 00 00000000 000 0 00000 112223321 Q ss_pred eEecCCCccceEEEEEecCCceEEEeecccccccccceeEEEEEecCCCeeEEeecCCcccccCCcccccCccccCCCce Q lcl|NC_012418. 312 AVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFVTRGIT 391 (826) Q Consensus 312 ~~~~~~~~~~~~y~~~~~~~~~w~E~~~~g~~~~~~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~~~ 391 (826) +++....+|+.-+ ....+.+|.+...+|........-=-|+|.. . T Consensus 200 ----S~~~G~~~~l~aE--------------------------~~a~~~s~vlPs~~w~gpP~~~~gL~pmP~G-----~ 244 (396) T protein:vir:10 200 ----TRANGGELLLAGD--------------------------YPLGAATVILPTLPELGRPAQFRHLSPMPTG-----K 244 (396) T ss_pred ----eCCChhhhhheeh--------------------------hccceeeeeeecCCCCCCCccccccccCchh-----H Confidence 1222222222111 1112233444556675443222112334332 2 Q ss_pred EEEEEcceEEEecCCeEEEEecCCcccCc-ccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCc Q lcl|NC_012418. 392 GMTTFQGRLVLLSQEYVCMSASNNPHRWF-KKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGG 470 (826) Q Consensus 392 ~v~~~q~RL~~~~~~~v~~S~~gd~~nF~-~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~~~~~~ 470 (826) .+.||.+||+++.++.||+|...-++=+. +..-++ . ...|.-+.+.+.+|+++|+++-|.+.|.+ T Consensus 245 ~~A~faGRi~~A~Gn~V~FSEp~~Ph~~~~~~~~~~------------~--~~~Iv~lapv~~gL~Vgt~~~~y~~~G~d 310 (396) T protein:vir:10 245 HLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQ------------M--PQRITFVQPVDGGIWVGQVDHVAFLDGAD 310 (396) T ss_pred hhhhhcceEEEEeCCEEEEecCCCCceecchhccCC------------C--CCceEEEEEecCeEEEEEcCcEEEEEcCC Confidence 47899999999999999999998763221 111111 1 12356677888999999999999999864 Q ss_pred c--ccccceEEEEEEeecccC---------CCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHHHHHhc Q lcl|NC_012418. 471 I--VTPRTAVISITTQYDLDT---------RAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYM 539 (826) Q Consensus 471 ~--lTP~~~~~~~~s~~~~~~---------~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~ 539 (826) + ++.......+ ..-|+. .-..+..|..++|+++.| +. . ...++ - ...++. ..+ T Consensus 311 P~sms~~~l~~~~--pvp~S~v~~p~~~~s~rs~~~~~~~~lwas~dG-----l~---~-g~~~G-~-v~~l~~---~~i 374 (396) T protein:vir:10 311 PASLSVSRRASRA--PVPGSAVLVPAEVVGTNASPDGSPVAVWLAENG-----YV---M-GTSSG-A-IAEVHA---GVL 374 (396) T ss_pred hhHcceeecccCC--CcccchhcccchhhhcccccccCcEEEEccCCc-----EE---E-EcCCc-e-eeeecc---ccc Confidence 2 3333332110 011222 222345688999999887 22 1 12222 1 111222 122 Q ss_pred CCCeEEEEEcCCCCEEEEEEcCCCeEEEEEEeeC Q lcl|NC_012418. 540 PGPAEYIQAAASSGYLVFGTSTADEMICHQYLWQ 573 (826) Q Consensus 540 ~~~v~~~~~s~~p~~~v~~~~~~g~l~~~tyl~~ 573 (826) ... ...+++ | +..|++.+++ + + T Consensus 375 ~p~-~~~A~~-------~-~~~drRy~~~--~-~ 396 (396) T protein:vir:10 375 AGI-TGRAGT-------S-VVFDRRLLTA--V-S 396 (396) T ss_pred CCC-cccceE-------E-EeecCeEEEE--e-C Confidence 211 111211 1 1122222221 1 0 No 33 >protein:vir:108312 Length: 458 # NCBI annotation: hypothetical protein # Family: family:all:1540 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552278;genbank:gi:160700603;genbank:GeneID:5758828 Probab=85.14 E-value=0.055 Score=27.49 Aligned_cols=426 Identities=10% Similarity=0.048 Sum_probs=157.9 Q ss_pred cEEEEEcccccCeeEEEEEeccCCcceeeeeeeeeEEeccCccccccccccceeeccchhhhhhh-----heeecccceE Q lcl|NC_012418. 149 AGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLY-----GKFFGAPEYT 223 (826) Q Consensus 149 ~a~~~vr~g~Y~r~ytv~i~g~~~s~~~t~~~ta~y~~p~~~~t~~~~~~~~~~~~~~~i~~~l~-----~~~~s~~~~~ 223 (826) -+..-|..|+|... +. ..+ .+-..++.+.... ..+...++.. T Consensus 1 m~~~~ip~gsy~a~-------------------------~~------~~d--aq~~VN~yp~~~e~g~ss~~l~~tPGl~ 47 (458) T protein:vir:10 1 MVQRQIPLVATTAE-------------------------GD------VSG--QEILVNVYPRKSDGGKYPFTLRHTPGLA 47 (458) T ss_pred Cceeeeceeeeecc-------------------------cc------ccc--ceeeeeeeeecccccccccceEecCCce Confidence 11222222222110 00 000 0111111111100 0000001100 Q ss_pred Eeeeeeccceecccccccccccc----eEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEEEeeccccccccccCC Q lcl|NC_012418. 224 LPNSTKKYPKVDPDTAAATVAGY----LNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGA 299 (826) Q Consensus 224 ~~~~t~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~~~~~~~ 299 (826) .- . +.......+- .......+..+|....+.... .+.+.+-... -++ T Consensus 48 ~f---------~-~~~~~~~~g~~~~~g~ly~v~g~~LY~V~~~~~~~-----------------~iG~i~gsg~--VsM 98 (458) T protein:vir:10 48 FF---------C-ELPTFPVMAMHQNGSRAFAVTPRDMYEISKDGTYK-----------------RLGSVDFKGR--VVM 98 (458) T ss_pred ee---------e-cCCCCceeeEEecCCEEEEeeCceEEEEeCCceEE-----------------EEecccCcee--EEE Confidence 00 0 0000000000 000011111222111111100 0100000000 000 Q ss_pred ccceeEEEEEeeeEecCCCccceEEEEEecCCceEEEeecccccccccceeEEEEEecCCCeeEEeecCCcccccCCccc Q lcl|NC_012418. 300 GTPGTGVQFMDGAVMATGSTKAPVYFEWDSANRRWAERAAYGTDWVLKKMPLALRWDEATDTYSLNELDYDRRGSGDEDT 379 (826) Q Consensus 300 ~~~g~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~w~E~~~~g~~~~~~tmp~~~~~~~~~~~f~~~~~~w~~r~~gd~~t 379 (826) +..|. +.++..|. .+++|+..+ ..+. ..|.+ T Consensus 99 a~ng~------q~vi~~G~----~gY~yd~at----------------------------~~~~---~i~d~-------- 129 (458) T protein:vir:10 99 EDNGK------QIVMVDGE----KGYYYDSET----------------------------EIVQ---EIKAE-------- 129 (458) T ss_pred eeCCc------EEEEEECC----eEEEEeecc----------------------------cEEE---eccCc-------- Confidence 00010 11111121 011222211 1111 11110 Q ss_pred ccCccccCCCceEEEEEcceEEEec--CCeEEEEecCCcccCcccccccCCCCccEEEEEcCCCceeEEEEeecCCcEEE Q lcl|NC_012418. 380 NPTFNFVTRGITGMTTFQGRLVLLS--QEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIV 457 (826) Q Consensus 380 np~psf~g~~~~~v~~~q~RL~~~~--~~~v~~S~~gd~~nF~~~s~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l 457 (826) .| ..+..|.|..+|++|.. +..++.|...| . -=||++++-+.++++.|.-++.+.+.|++ T Consensus 130 ----~~--~~~~~v~~~dGy~V~~~~g~~~~~is~L~d------~------s~d~l~fa~Ae~~pD~iv~i~~~~~~i~~ 191 (458) T protein:vir:10 130 ----GF--YPASTVTYQDGYFIFDRKGTGQFFISELLD------V------AFDPLDFATAEGQPDPLLAVLSDHREVFM 191 (458) T ss_pred ----cc--cCcceEEEeCcEEEEEeeCCCEEEEEecCc------c------eeCcceeeeecCCCCceEEEEeeccEEEE Confidence 11 12578999999999885 44566675444 1 14799999999999999999999999999 Q ss_pred EecCcE--EEEeCCccccccceE-EEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHHHH Q lcl|NC_012418. 458 FAKKYQ--AVVPGGGIVTPRTAV-ISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSH 534 (826) Q Consensus 458 ~t~~~q--~~~~~~~~lTP~~~~-~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~ 534 (826) |.+..- |..+|+..+.=.... ... ..+|++.-.=..+|++++|+...+ .|+.+. .|+++-||-| T Consensus 192 fG~~TiEvw~ntG~a~fpy~r~~ga~i--~~Gcaa~~sv~~~~~t~~~l~~d~----~Vy~l~-------g~~~~rIST~ 258 (458) T protein:vir:10 192 FGQETIEVWYNSGAADFPFERNQGAFI--EKGIGAPYSVAKTNNTVYFIGSDL----MIYQIT-------GYTPVRISTH 258 (458) T ss_pred EeccceEEEEecCCCCcceeeccccee--eecccCcchhhhhCceEEEEcCCe----EEEEec-------CceeEEeeCH Confidence 987764 777775332211111 111 246776555578999999998643 454431 2333334433 Q ss_pred -HHHhcCCCeEEEEEcCCCCEEEEEEcCCC-eEEEEEEeeCCCcee--------eEeeEeeecCC--cEEEEEEE-CCeE Q lcl|NC_012418. 535 -IPSYMPGPAEYIQAAASSGYLVFGTSTAD-EMICHQYLWQGNEKV--------QNAFHRWTLRH--QIIGTYFT-GDNL 601 (826) Q Consensus 535 -~~~~~~~~v~~~~~s~~p~~~v~~~~~~g-~l~~~tyl~~~~e~~--------v~aW~~w~~~g--~v~~~~~~-~d~l 601 (826) +++.|.. | .-.+.+.++-..+| .+|.++| +... ...||.-++.+ ....-|++ .+.- T Consensus 259 aIE~~i~s------y-~~~da~a~t~~~eGH~fy~Ltf----P~a~~Tw~yD~~t~~Wher~Sg~~~~~Ra~~~v~~~g~ 327 (458) T protein:vir:10 259 AVEQTLKG------V-NLSDAFAYTYQSEGHLFYVLTI----PGKNLTWCYDISSGSWHVRQSYQFDRHVSNNSIYFDQK 327 (458) T ss_pred HHHHHHhc------C-ChhheEEEEEEecCeEEEEEEC----CCCCceeEEecccccceeeccCCCCceEEEEEEEeCCe Confidence 3333322 1 11124444444444 3666665 2110 01244322110 01111110 0000 Q ss_pred EEEEEeCCCEEEEEEEEeecCCcCCCCcccccceEEEEeeccceeeecccccCccccccceEEeeeeeeEeeccEEcccE Q lcl|NC_012418. 602 MVLIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEATVEGELELTKQHWDLIKDAPAVYQLQPVAGAFMERYQLGVK 681 (826) Q Consensus 602 ~~vv~R~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 681 (826) +++=.. .++.+-+++. T Consensus 328 ~~vGD~-~ng~ly~ld~--------------------------------------------------------------- 343 (458) T protein:vir:10 328 TLVGDF-QNGRIYIMAD--------------------------------------------------------------- 343 (458) T ss_pred EEEEEc-CCCeEEEEcc--------------------------------------------------------------- Confidence 000000 0001111100 Q ss_pred ecCCCeEEEeecCCcCCceEEEEEeeeeEEEeCCeEEECCCCceeeecceEEEEEEEEeeccceEEEEecCCCCCcee-- Q lcl|NC_012418. 682 RETNTKVFLDVPEAVVGSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQP-- 759 (826) Q Consensus 682 ~~~~g~~~l~~~~~~~~~~v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~-- 759 (826) ..+.+ -|-+.+..+.+ |+ +. . .+.|++++++.|.+. ||-..+ .++.-+.+. T Consensus 344 ---------~~~td-------~g~~i~~~~~~-p~-~~-~-----~~~rl~~~~~el~~~-tGvg~~--~~~~~~p~~~l 396 (458) T protein:vir:10 344 ---------NYYTD-------DGDPVVREFIL-PV-VN-N-----GREFLTVDSLELDLS-SGVGLT--VGQGSDPELRV 396 (458) T ss_pred ---------cCcCC-------CCceeeeeeec-cc-ee-C-----CCCeEEEEEEEEEEe-cceeee--eCCCCCceEEE Confidence 00000 12222233222 11 11 1 112444555554331 111000 011111111 Q ss_pred -eeeecccccccccc--ccCccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEE Q lcl|NC_012418. 760 -WYDTTPLRLFSRQL--NAGEPLVDSAVVPLPARVAMATSKFELSCHSPYDMNVRAVEYNFK 818 (826) Q Consensus 760 -~~~~~~~~~~~~~~--~~~~~~~~tg~~~~p~~~~~~~~~v~i~~~~p~P~tvl~i~~eg~ 818 (826) +-+..+.-++.... .+|++-.+...+.+.-.|..++--++|+-..|.|.+|+++..+.+ T Consensus 397 ~~S~d~g~~~s~~~~~~~lg~~gey~tr~~~~rlG~ar~rvf~v~~s~p~~~~l~ga~~~~r 458 (458) T protein:vir:10 397 YFSKDNGNEYSQNFKVGKIGRKGEFLTRAKVNRFGCARQFTFKVEISDPIPVDIGGAWVEVR 458 (458) T ss_pred EEeeCCCcccchhHHHhhcCCcchhhhhhhhhhhccCcceEEEEEEecchhhcceeeeEEeC Confidence 11111111111111 124443333444443346666666999999999999999999998 No 34 >protein:vir:3529 Length: 477 # NCBI annotation: P28 # Family: family:all:1540 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050989;genbank:gi:9633575;genbank:GeneID:1262322 Probab=64.33 E-value=0.3 Score=23.44 Aligned_cols=437 Identities=10% Similarity=0.081 Sum_probs=155.5 Q ss_pred cccccceEecccCCcEEEEEcCCCeEEEEeecCCCcceEEEEEE-EeeccccccccccCCccceeEEEEEeeeEecCCCc Q lcl|NC_012418. 241 ATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGM-SLNATADLPALLPGAGTPGTGVQFMDGAVMATGST 319 (826) Q Consensus 241 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~~~~~~ 319 (826) .....+-.. .-+.-..+........++..... .+.. ........+.-++. .+|-.. ....+|-. T Consensus 1 ~~~~~~m~~-------~~ipl~~g~~~~~~~~d~~~~~P-VN~~a~p~~~~~s~~~L~~--~pG~~~-----~~~~~G~~ 65 (477) T protein:vir:35 1 MLSEVFMPK-------IQIPLAKGLVKDIKTADYIDALP-VNMLATPKEVLNASGYLRS--FPGIEK-----KQDAKGVS 65 (477) T ss_pred Ccccceeee-------eccccccccccccccccceeeee-eccceeecccccccccccc--CCccee-----eccCCccc Confidence 000000000 00000001000000111111110 0000 01111111111111 112111 11111211 Q ss_pred cceEEEEEe------cCCceEEEeecccccc--cccceeEE-----EEEecCCCeeEEeecCCcccccCCcccccCcccc Q lcl|NC_012418. 320 KAPVYFEWD------SANRRWAERAAYGTDW--VLKKMPLA-----LRWDEATDTYSLNELDYDRRGSGDEDTNPTFNFV 386 (826) Q Consensus 320 ~~~~y~~~~------~~~~~w~E~~~~g~~~--~~~tmp~~-----~~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~ 386 (826) ...+|...+ .++..|+-+..-|... +...|-+- ++..-...-|..+-..++-+ ....+-+|.|. T Consensus 66 RG~~~~~~~g~lY~V~G~~LY~v~~~vG~I~gsg~VsMa~n~~~~aIv~~g~~~gy~y~~t~~~~~---~~~~~~~p~~~ 142 (477) T protein:vir:35 66 RGVHFNTKNNALYRVCGNTLYRNDKEVADIAGMSRVSMSHSSHSQAICFEGKVKLYRYDGTEKALS---NWPKDKYPQYD 142 (477) T ss_pred cceeEeecCCeEEEEecCeeEeeeeeeeeecccccEEEeeCCcEEEEEECCcceeEEEecccceee---ecCccccCCcc Confidence 222222111 1223343111112111 12222221 11010111122222222221 12233467777 Q ss_pred CCCceEEEEEcceEEEecC--CeEEEEecCCcccCcccccccCCCCccEE-EEEcCCCceeEEEEeecCCcEEEEecCcE Q lcl|NC_012418. 387 TRGITGMTTFQGRLVLLSQ--EYVCMSASNNPHRWFKKSAAALNDDDPIE-IAAQGSLTEPYEHAVTFNKDLIVFAKKYQ 463 (826) Q Consensus 387 g~~~~~v~~~q~RL~~~~~--~~v~~S~~gd~~nF~~~s~~~~~ddD~i~-~~~~~~~~~~i~~~v~~~~~L~l~t~~~q 463 (826) ...+..|+|..+|++|.-+ +.++.|-.-|-.- -|+++ ++.+.++++.|.-++.+.+.|++|.+..- T Consensus 143 l~~~~~v~f~dGyfV~~~~gt~~~~iS~L~d~s~-----------~d~~~~FasAE~~pD~Ivgi~~~~~~i~lfG~~Ti 211 (477) T protein:vir:35 143 LGEVIDVCRNRGRYIWLQKGGERFGVTDLEDESK-----------PDRYQPFYRAESQPDGIVSVDAWRDLIVCFGSSSI 211 (477) T ss_pred ccceeEEEeeCceEEEeecCCCeEEEeecCCccc-----------cccccccccccCCCCceEEEEeeccEEEEEeccce Confidence 7778889999999988753 3344464333222 26666 66677888889999999999999987774 Q ss_pred --EEEeCCcccc-c-c--ceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccccccchhHHH-HHHH Q lcl|NC_012418. 464 --AVVPGGGIVT-P-R--TAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVT-SHIP 536 (826) Q Consensus 464 --~~~~~~~~lT-P-~--~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls-~~~~ 536 (826) |..+|+..++ | - ......+ .+|++.-.=..+|++++|+......--.|+.+ +.|+++-|| .-++ T Consensus 212 Evw~ntG~a~f~~p~~r~~~~~mIq--~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~~-------~g~q~~rIST~aIE 282 (477) T protein:vir:35 212 EYFTLTGSADTSQPLYIHQAAYMIQ--AGIAGRDCKCRYQDKYAILSHQSTGQPAVYLI-------GAGEKNKISTATID 282 (477) T ss_pred EEEEecCCCCCCcceeecCCceeee--ecccCchhhhhhCceEEEEecCCCcccEEEEc-------cCceeEEecCHHHH Confidence 7778765454 2 1 1122233 47776655588999999998753221234322 124444453 3344 Q ss_pred HhcCCCeEEEEEcCCCCE--EEEEEcCCC-eEEEEEEe------eCCCceeeEeeEeeecC---CcEEEEEEE-CCeEEE Q lcl|NC_012418. 537 SYMPGPAEYIQAAASSGY--LVFGTSTAD-EMICHQYL------WQGNEKVQNAFHRWTLR---HQIIGTYFT-GDNLMV 603 (826) Q Consensus 537 ~~~~~~v~~~~~s~~p~~--~v~~~~~~g-~l~~~tyl------~~~~e~~v~aW~~w~~~---g~v~~~~~~-~d~l~~ 603 (826) +.|.. |...... ++++...+| ++|+++|= +-.-++..-.|+--..+ ......|++ -+--++ T Consensus 283 ~~i~a------y~~~e~a~af~~t~~~eGH~fy~LtfP~~Tw~yD~at~~w~e~W~~~~~g~~~~~~Ra~~~~~~~g~~~ 356 (477) T protein:vir:35 283 KIIRY------YSADELAASFMESIRFDNHELLLLHLPKHTLCFDGSASHQYSQWSLLKSGFYDEPYRAIDFMFFDNQIT 356 (477) T ss_pred HHHHh------cCCcchhceeEEEEEeCCeeEEEEEcCCceEEEecccccccceeeeeccCCccCceEEEEEEEeCCeEE Confidence 44432 2222221 223333333 24444440 00011100112211111 122222222 122212 Q ss_pred EEEeCCCEEEEEEEEeecCCcCCCCcccccceEEEEeeccceeeecccccCccccccceEEeeeeeeEeeccEEcccEec Q lcl|NC_012418. 604 LIQKGQEIALGRMHLNSLPAREGLQYPKYDYWRRIEATVEGELELTKQHWDLIKDAPAVYQLQPVAGAFMERYQLGVKRE 683 (826) Q Consensus 604 vv~R~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 683 (826) +=.. .++.+-+++..... ++ +.......+.. T Consensus 357 vGD~-~ng~l~~ld~~~~~-----d~----------------------------------------g~~i~~~~~~p--- 387 (477) T protein:vir:35 357 VGDK-KEGVLGHLIFNASN-----QY----------------------------------------EQQTEHLLYTP--- 387 (477) T ss_pred EEEc-CCCeEEEECCCCcc-----cC----------------------------------------CCccceEEecc--- Confidence 2111 12222222110000 00 00000000000 Q ss_pred CCCeEEEeecCCcCCceEEEEEeeeeEEEeCCeEEECCCCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeee Q lcl|NC_012418. 684 TNTKVFLDVPEAVVGSVYVVGCEFWSKVEFTPPVLRDHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDT 763 (826) Q Consensus 684 ~~g~~~l~~~~~~~~~~v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~ 763 (826) .+..++ ..+. ..+++ +..+-| ....+ +-|+..+-| ... T Consensus 388 -----~~~~d~----~Rv~-----~~el~-----~~tGvg--q~~d~-----v~L~~sddG-~~~--------------- 425 (477) T protein:vir:35 388 -----MIKADN----ARLF-----DFELE-----ASTGVA--QIADK-----LFLSVTTDG-INY--------------- 425 (477) T ss_pred -----eeeCCC----CeEE-----EEEEE-----EecCcC--ccCce-----EEEEEeccc-ccc--------------- Confidence 000000 0111 11111 110111 11111 122221110 000 Q ss_pred ccccccccccccCccccccceEEEEeecccc---eeEEEEEECCCCCEEEEEEEEE Q lcl|NC_012418. 764 TPLRLFSRQLNAGEPLVDSAVVPLPARVAMA---TSKFELSCHSPYDMNVRAVEYN 816 (826) Q Consensus 764 ~~~~~~~~~~~~~~~~~~tg~~~~p~~~~~~---~~~v~i~~~~p~P~tvl~i~~e 816 (826) ...++ ...|++--+.....+--.|..+ ..++++....|.+++++++-.| T Consensus 426 ~~~~~----~~~g~~g~~~~r~~~~RlG~~r~~vgf~~r~~~~~pv~l~~~~~~~e 477 (477) T protein:vir:35 426 SREQL----IEQNSPFQYDKRILWRRIGRVRKNIGFKIRIITKSPVTLSDLSIRME 477 (477) T ss_pred cccee----ecCCCccccccceeeeeeeeceeccceEEEEEecCCceeccceeEeC Confidence 00000 0112222222233333334333 2679999999999999999999 No 35 >protein:vir:95324 Length: 823 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512269;genbank:gi:89152436;genbank:GeneID:3952993 Probab=21.34 E-value=2.6 Score=18.29 Aligned_cols=666 Identities=9% Similarity=0.018 Sum_probs=192.7 Q ss_pred hHhHhhhcCcccccccccEEEEEeCCCceEEEEEecCCeEEEEECCCCEEEEe-cCcccccccCCCcccEEEEEecCEEE Q lcl|NC_012418. 48 IELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLVAQHRGELYLFDERDGRLLMG-QPLVHDYLKAADYRQLRAATVADDLF 126 (826) Q Consensus 48 t~fv~~~~~~~~~~~~~~~~~~~rd~~e~~~i~~~~~g~irv~d~~~g~~~~~-~~~~~~yl~~~~~~~l~~~~vaD~~f 126 (826) ++ +..++ .+|+-||-.-.|. +...+.-|. .+...+.+ -+....-+.- ....+++-.. T Consensus 1 m~-i~~~q-----------~sF~~GElsP~l~---gR~Dl~ry~-~q~~~~~N~~~~~~GGl~r--RpGt~fva~~---- 58 (823) T protein:vir:95 1 MA-ISWIQ-----------PSFAGGEIGPSLY---GRIDMAKYQ-VALRKCDNFIVRQYGGVEN--RPGTRFVGAA---- 58 (823) T ss_pred Cc-ceeec-----------hhccCceechhee---ccchHHHHH-HHHhhhhCcEeeecCCcee--cCchhhhhhh---- Confidence 11 11111 1121111111110 000000000 00000000 0000000000 0011111100 Q ss_pred EeeCCcceeeeecccCCCCCCccEEEEEcccccCeeEEEE--EeccCCcceeeeeeeeeEEeccCcc-ccccccccceee Q lcl|NC_012418. 127 IANLSVKPEADRTDVKGVDPNKAGWLYIKAGQYSKAFSMT--IKVKDNATGTTYSHTATYVTPDNAS-TNPNLAEAPFQT 203 (826) Q Consensus 127 i~n~~~~~~~~~~~~~~~~~~~~a~~~vr~g~Y~r~ytv~--i~g~~~s~~~t~~~ta~y~~p~~~~-t~~~~~~~~~~~ 203 (826) -+++-+.+-. ...++. ...++-.-++.|-|-|.-. +.. ..+..+..+.+|+..+..+ -....+|....+ T Consensus 59 -~~~~g~~rLi---pf~~s~-~q~y~Lefg~~~irV~~~~g~vv~---~~~~~~ev~tPy~~~~l~~Lr~~qsaD~~fiv 130 (823) T protein:vir:95 59 -KYPNRKCRLI---PFQFST-VQTYALEFGHQYMRVIKDGALVLN---SSNVIYEIATPYTEADLFRIKFTQSADVLTLV 130 (823) T ss_pred -cCCCCCeeEE---EEEeCC-CcEEEEEEcCCeEEEEeCCcEEEe---cCCceeEEecccccccccceeEEEeccEEEEE Confidence 0000000000 000111 1122222233444444210 000 1111222223333221111 122346778888 Q ss_pred ccchhhhhhhheeecccceEEeeeeeccceecccccccccccceEecccCCcEEEEEcCCCeEEEEeecCC-------Cc Q lcl|NC_012418. 204 SVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDTAAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMG-------NN 276 (826) Q Consensus 204 ~~~~i~~~l~~~~~s~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-------~~ 276 (826) |.++.++++.+. +...|.+....+........... .+ -+.......+.++..+...... ....+. .. T Consensus 131 h~~~~p~~L~r~--~~~~w~l~~~~~~~gp~~~~~~~--~t-~~v~~~~~~~~~t~ta~~~~~~-~d~vg~~~~l~~~~~ 204 (823) T protein:vir:95 131 HPAYPPKELRRY--AHDNWQLVDVVTKNGPFEDINID--ES-LTVYASASTGTITLTASASIFG-AEQVGKLFYLEQPAV 204 (823) T ss_pred cCCccceEEEec--CCCCceEEEEEEecccccccccc--ce-eEEeccccCceeEEeecccccc-hhhccceEEEecccc Confidence 888888877543 33456666554322111100000 00 0000111111111111000000 000000 00 Q ss_pred ceEEE--------EEEEeeccccccccccCCccceeEEEEEeeeEecCCCccceEEEEEecCC--ceEEEeecccccccc Q lcl|NC_012418. 277 YGIAS--------GGMSLNATADLPALLPGAGTPGTGVQFMDGAVMATGSTKAPVYFEWDSAN--RRWAERAAYGTDWVL 346 (826) Q Consensus 277 ~~~~~--------~~~~v~~~~~l~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~y~~~~~~~--~~w~E~~~~g~~~~~ 346 (826) ..... .........++...... +. .+ ..........+|+.+.... ..|.|.... T Consensus 205 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~-----~g-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 268 (823) T protein:vir:95 205 DSVPVWETSKSTSIGDIRRADSNYYRAVTA----GK-----TG-TLRPSHTEGTSWDGWGGSGDDDTGIEWEYL------ 268 (823) T ss_pred ceeeecceeeeecccceEEecccceeeeec----cc-----cc-eeecccCCcceEEeceecccccceeEEEEE------ Confidence 00000 00000000000000000 00 00 0011122233333322211 112221111 Q ss_pred cceeEEEEEecCCCeeEEeecCCcccc-------cCCcccccCcccc--CCCceEEEEEcceEEEecCCe-EEEEecCCc Q lcl|NC_012418. 347 KKMPLALRWDEATDTYSLNELDYDRRG-------SGDEDTNPTFNFV--TRGITGMTTFQGRLVLLSQEY-VCMSASNNP 416 (826) Q Consensus 347 ~tmp~~~~~~~~~~~f~~~~~~w~~r~-------~gd~~tnp~psf~--g~~~~~v~~~q~RL~~~~~~~-v~~S~~gd~ 416 (826) ....|.+.+...+..-.. -.....+.-++|. -...+...=|-.-..|- .+. ++++-.+.. T Consensus 269 ---------~~~~g~~~~t~v~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~g~Ps~v~f~-q~RL~f~g~~~~p 338 (823) T protein:vir:95 269 ---------HSGFGIARITAVNGTTATAEVISYIPSQVVGEDNASYKWAKYAWNSVNGYPGTVVYY-QQRLYFAASTAFP 338 (823) T ss_pred ---------eCCcceEEEEeecceeeeceEeeeeccccccCCcCCccccccccCcCCCCccEEEEE-eceEEEEEcCCCC Confidence 111222222211110000 0000001111111 00000000000000111 122 333333344 Q ss_pred ccCcccccccCCCCccEEEEEcC--CCceeEEEEeecCCcEEEEecCcEEEEeCCc--cccccceEEEEEEeecccCCCC Q lcl|NC_012418. 417 HRWFKKSAAALNDDDPIEIAAQG--SLTEPYEHAVTFNKDLIVFAKKYQAVVPGGG--IVTPRTAVISITTQYDLDTRAA 492 (826) Q Consensus 417 ~nF~~~s~~~~~ddD~i~~~~~~--~~~~~i~~~v~~~~~L~l~t~~~q~~~~~~~--~lTP~~~~~~~~s~~~~~~~~~ 492 (826) ++.+-+- -.|..+|..+. ...+.|.-.+..++.=. -+|+++.++ ++|-. .+...... ....+- T Consensus 339 ~~v~~Sr-----tgd~~nF~~~~~~~DdD~I~~~~s~~~~~~-----i~~~v~~~~Lli~t~~-~e~~l~~~--~~~~lT 405 (823) T protein:vir:95 339 QTIWASR-----TGDYKDFGKSNPTQDDDRIIYTYAGRQVNE-----IRHLIDVGSLVALTSG-GEYVITGD--QNKVLT 405 (823) T ss_pred cEEEEec-----cCCccccccccCCCCCCcEEEEEcCCcceE-----EEEEeecCcEEEEecC-cEEEEEcC--CCcccc Confidence 4443331 23444443322 23344554444332111 122332221 11111 11111100 001112 Q ss_pred cEEeCCeEE-EEecCCCceeEEEEEeeccccccccchhHHHHHHHHhcCCCeEEEEEcCCCCEEE---EE---E--cCCC Q lcl|NC_012418. 493 PAVTGRSVY-FAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYIQAAASSGYLV---FG---T--STAD 563 (826) Q Consensus 493 Pv~vg~~v~-f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~~~~~~~v~~~~~s~~p~~~v---~~---~--~~~g 563 (826) |-.+ .+ -.+..|- +.++ |....+....+.+ -...|++++|+-..+.-. .+ . .... T Consensus 406 P~~~---~~~~~s~~g~--~~~~----------Pv~vg~~~~Fv~~-~g~~vre~~~~~~~d~~~~~dlT~~a~hl~~~~ 469 (823) T protein:vir:95 406 PSSF---AFSSQGSNGS--SNVP----------PIAVANIALFVQE-KGSVVRDLAYSFDVDGYQGNDLTILANHLFQKH 469 (823) T ss_pred eeeE---EEEEeecccc--cccc----------ceEeCCeEEEEec-CCCEEEEEEEeeecCceecchhhhhhhhhcCCC Confidence 2111 11 0000000 0010 0000000011110 012456666654322111 00 0 0112 Q ss_pred eEEEEEEeeCCCceeeEeeEeeecCCcEEEEEEECC-eEEEEEEeCCCEEEEEEE-------------EeecCCcCC--- Q lcl|NC_012418. 564 EMICHQYLWQGNEKVQNAFHRWTLRHQIIGTYFTGD-NLMVLIQKGQEIALGRMH-------------LNSLPAREG--- 626 (826) Q Consensus 564 ~l~~~tyl~~~~e~~v~aW~~w~~~g~v~~~~~~~d-~l~~vv~R~~~~~~~r~~-------------~~~~~~~~~--- 626 (826) .+..+.|. ++-.-..|. +.-+|++.+++.+.+ +++-=-+..-++..+.++ .++...... T Consensus 470 ~i~~~a~~---~~p~~~~~~-v~~dG~l~~~ty~~~q~v~aW~~~~~~g~~~~~~~i~~~~~d~l~~~v~R~i~g~~~~y 545 (823) T protein:vir:95 470 SIVDWCFS---IVPYSSAFC-IRDDGKLLVMTYLRDQQVFAWAPQSSTGKYESTCSISEGNEDAVYFVVNRTVNGQTVRY 545 (823) T ss_pred ceEEEEEe---cCCCeEEEE-EecCCcEEEEEEecccceeeeEEEecCCcEEEEEEecCCCCCEEEEEEEeccCCeEEEE Confidence 33333432 111112343 222566666555522 222222222223222221 222111000 Q ss_pred ---C---CcccccceEEEEeeccceeeecccccCcccc--ccceEEeeeeeeEeeccEEcccEecCCCeEEEeecCCcCC Q lcl|NC_012418. 627 ---L---QYPKYDYWRRIEATVEGELELTKQHWDLIKD--APAVYQLQPVAGAFMERYQLGVKRETNTKVFLDVPEAVVG 698 (826) Q Consensus 627 ---~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~ 698 (826) . .....+...++||...+...........+.. ....|+++.++.. +|+.+.....+ .++++|+.+ . T Consensus 546 iE~~~~~~~~~~~~~~~lD~~~s~~g~~~~~~~~~l~~g~~~l~~l~g~~v~~-adg~~~~~~~v-~g~i~l~~~----~ 619 (823) T protein:vir:95 546 IERLSSRLFTSDEDAFFVDSGLSYDGRNTSDRTMTITGGSGEWDYLAEYTISV-SGGAYFTSSDV-GAQLQFPYT----G 619 (823) T ss_pred EEeeccccCCCccceeEEEEEEEeecCcccceeeEecCCCCcccccCceEEEe-cCcceECCccc-eeEEEeCcC----C Confidence 0 1123344455666555444322211111111 1123667776644 77777666555 588988754 4 Q ss_pred ceEEEEEeeeeEEEeCCeEEECCCCceeeecceEEEEEEEEeeccceEEEEecCCCCCceeeeee-ccccccccccccCc Q lcl|NC_012418. 699 SVYVVGCEFWSKVEFTPPVLRDHNGLPMTSARAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDT-TPLRLFSRQLNAGE 777 (826) Q Consensus 699 ~~v~vG~~y~~~~~~~~~~~~~~~g~~~~~~r~~i~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 777 (826) ..+++|++|++.++++++...++. ...+... ++..+.++...+.....-.. ....... .+... ..+..|. T Consensus 620 ~~~~vGl~~~~~i~~~~~~v~~~~-a~~~~~~-r~v~a~l~~~~t~~~~~~~~-----~~~gL~hleg~tv--~v~~dg~ 690 (823) T protein:vir:95 620 ADPDTGYEVSKELRCDIISVTSNT-AVVVRAN-RNVPPSLRNVATTNWQMARR-----TFGGLSHLEGQTV--NILSDAN 690 (823) T ss_pred CccccccceEEEEEEeeceeeCCc-eEEEccC-Ccccceeeeeeccccccccc-----eeeeccccccceE--EEEEcCe Confidence 568999999999999988876443 3332222 23333333322222111000 0000000 00000 0000121 Q ss_pred --cc--cccceEEEEeecccceeEEEEEEC-CCCCEEEEEEEEEEEEecccccC Q lcl|NC_012418. 778 --PL--VDSAVVPLPARVAMATSKFELSCH-SPYDMNVRAVEYNFKSNQTYRRV 826 (826) Q Consensus 778 --~~--~~tg~~~~p~~~~~~~~~v~i~~~-~p~P~tvl~i~~eg~y~~r~rrv 826 (826) |+ ...|.+++|.....-++-+..++. .|||+.+. +.|..--|.||| T Consensus 691 ~~~~~~v~~G~vtl~~~~~~v~vGl~~~~~~~~l~~~~~---~~g~~~g~~~ri 741 (823) T protein:vir:95 691 VEPQKVVSGGAVTLESPGAVVHIGLPITAEFETLDININ---GQETLLDKKQVI 741 (823) T ss_pred eeCCeEecCCEEEecCCCCEEEEeecceeeEEecchhcC---CCcccCCceeEE Confidence 22 235777776543222222333332 47777654 357777888888 Done!