Query lcl|NC_015271.1_cdsid_YP_004306686.1 [gene=12] [protein=predicted tail tubular protein B] [protein_id=YP_004306686.1] [location=21693..24080] Match_columns 795 No_of_seqs 150 out of 213 Neff 8.1 Searched_HMMs 1612 Date Thu Nov 7 12:50:07 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_35 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_35_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:10452 Length: 794 100.0 7E-258 4E-261 1430.5 90.3 794 1-795 1-794 (794) 2 protein:vir:2203 Length: 794 # 100.0 8E-257 5E-260 1424.7 92.4 794 1-795 1-794 (794) 3 protein:vir:99677 Length: 794 100.0 7E-252 4E-255 1397.6 89.4 792 1-795 1-794 (794) 4 protein:vir:1543 Length: 801 # 100.0 1E-249 6E-253 1385.8 87.1 792 1-795 1-801 (801) 5 protein:vir:94583 Length: 792 100.0 3E-248 2E-251 1377.8 88.8 789 1-795 1-792 (792) 6 protein:vir:3366 Length: 801 # 100.0 2E-247 1E-250 1373.2 87.2 792 1-795 1-801 (801) 7 protein:vir:94713 Length: 785 100.0 2E-246 1E-249 1367.2 89.8 783 1-795 1-785 (785) 8 protein:vir:8887 Length: 808 # 100.0 1E-237 6E-241 1319.8 86.7 787 1-795 1-808 (808) 9 protein:vir:97014 Length: 800 100.0 9E-226 6E-229 1254.4 84.5 771 2-795 1-800 (800) 10 protein:vir:105647 Length: 800 100.0 1E-223 6E-227 1243.1 82.6 782 2-795 1-800 (800) 11 protein:vir:7021 Length: 803 # 100.0 5E-223 3E-226 1239.3 84.9 779 2-795 1-803 (803) 12 protein:vir:103341 Length: 806 100.0 1E-220 8E-224 1226.0 85.4 775 2-795 1-806 (806) 13 protein:vir:80253 Length: 777 100.0 1E-218 9E-222 1215.0 84.5 764 1-795 1-777 (777) 14 protein:vir:6326 Length: 826 # 100.0 5E-214 3E-217 1190.0 84.9 769 1-795 1-826 (826) 15 protein:vir:100022 Length: 976 100.0 2E-211 1E-214 1175.5 81.9 773 1-795 1-976 (976) 16 protein:vir:78957 Length: 826 100.0 6E-210 4E-213 1167.7 83.4 771 1-795 1-826 (826) 17 protein:vir:78703 Length: 905 100.0 1E-209 8E-213 1165.8 83.9 770 1-795 1-905 (905) 18 protein:vir:103790 Length: 768 100.0 3E-179 2E-182 999.4 74.7 725 1-792 1-768 (768) 19 protein:vir:95324 Length: 823 100.0 8E-169 5E-172 942.3 73.1 722 1-791 1-823 (823) 20 protein:vir:107802 Length: 681 100.0 7E-165 4E-168 920.7 70.1 666 1-790 1-681 (681) 21 protein:vir:107423 Length: 681 100.0 7E-165 4E-168 920.7 70.1 666 1-790 1-681 (681) 22 protein:vir:98487 Length: 681 100.0 7E-165 4E-168 920.7 70.1 666 1-790 1-681 (681) 23 protein:vir:7329 Length: 825 # 100.0 8E-165 5E-168 920.3 70.1 720 1-791 1-825 (825) 24 protein:vir:1778 Length: 680 # 100.0 2E-161 1E-164 902.2 54.0 541 1-551 1-680 (680) 25 protein:vir:102644 Length: 594 100.0 1E-140 7E-144 787.7 59.4 561 1-791 1-594 (594) 26 protein:vir:94602 Length: 1012 99.6 6.5E-13 4E-16 87.4 35.4 761 1-795 1-1010(1012) 27 protein:vir:80177 Length: 1027 99.1 4.7E-10 2.9E-13 71.7 27.1 725 1-795 1-936 (1027) 28 protein:vir:2625 Length: 715 # 99.1 1.3E-09 8.3E-13 69.2 40.3 627 1-792 1-715 (715) 29 protein:vir:95475 Length: 771 98.2 2.7E-06 1.7E-09 51.1 39.2 653 1-792 1-771 (771) 30 protein:vir:3133 Length: 911 # 98.0 8.5E-06 5.3E-09 48.4 38.5 686 1-774 1-911 (911) 31 protein:vir:8837 Length: 513 # 97.6 4.4E-05 2.8E-08 44.4 39.2 481 1-794 1-513 (513) 32 protein:vir:105563 Length: 396 95.3 0.0024 1.5E-06 35.0 21.7 380 1-532 1-396 (396) 33 protein:vir:95324 Length: 823 89.9 0.024 1.5E-05 29.5 38.0 628 31-795 1-741 (823) 34 protein:vir:3529 Length: 477 # 89.2 0.028 1.7E-05 29.1 31.9 430 200-712 1-477 (477) 35 protein:vir:105428 Length: 472 87.7 0.037 2.3E-05 28.4 33.7 423 175-712 1-472 (472) 36 protein:vir:177 Length: 472 # 79.6 0.1 6.4E-05 26.0 34.5 427 175-712 1-472 (472) 37 protein:vir:2109 Length: 472 # 55.6 0.48 0.0003 22.3 31.3 423 209-712 1-472 (472) 38 protein:vir:108312 Length: 458 34.5 1.3 0.0008 20.0 34.9 437 175-787 1-458 (458) No 1 >protein:vir:10452 Length: 794 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848299;genbank:gi:30387490;genbank:GeneID:1733952 Probab=100.00 E-value=6.7e-258 Score=1430.49 Aligned_cols=794 Identities=85% Similarity=1.361 Sum_probs=772.6 Q ss_pred CCceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCceEEEE Q lcl|NC_015271. 1 MALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESEQYYAV 80 (795) Q Consensus 1 M~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~~y~l~ 80 (795) ||+|+|+||||++|||||+|++||++||++|+||+|+|+|||+||||++||+++++++......++++|++|++|+|+|+ T Consensus 1 M~~i~~s~~n~~~GvSqq~D~~Ry~~q~~~~~N~~~~~~gG~~rRpGt~fv~~l~~~~~~~~~~~~~~~~rd~~e~~~v~ 80 (794) T protein:vir:10 1 MALISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLKTLGDNGALGQAPYIHLINRDENEQYYAV 80 (794) T ss_pred CcceeeecchhhcccccCCchHHhhhhHhhhhcceeeeccCcccCcchhhheeccCCCccccceeeeEEecCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999998877777778888899999999999 Q ss_pred EeCCeEEEEecCCcEEEEEECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccEEEeccc Q lcl|NC_015271. 81 FTGTGIRVFDLAGNERQVRYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINVRGG 160 (795) Q Consensus 81 ~~~~~~rv~~~~g~~~~v~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~~~v~~~ 160 (795) |++++||||+++|.++.+.. ++.+||+.+++++++|+|+|+||+|||+|++++|++..++.+...+++..++|++++++ T Consensus 81 ~~~~~irv~~~~G~~~~v~~-~~~~~Y~~aa~~~~~l~~~q~aD~~fivn~~~~~~~~~~~~~~~~~~~~~~~~~~v~~g 159 (794) T protein:vir:10 81 FTGTGIRVFDLAGNEKQVRY-PNGSNYIKTANPRSDLRMVTVADYTFIVNRNVVVQKDPNSVNLANYNPKQDGLINIRGG 159 (794) T ss_pred EeCCeEEEEEcCCcEEEEEc-CCCCcceecCCCcceEEEEEEcCEEEEEcCCeeeeeeccccccCCCCCCccEEEEeccc Confidence 99999999999999888765 56789999999999999999999999999999999999988888888889999999999 Q ss_pred ccCeEEEEEECCceeEEEEecCCCCcccccccchhHHhHhhhhhcccccCceeeeecCceEEEEecCCcceeeEEEecCc Q lcl|NC_015271. 161 QYGRTLQIIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELARQCRVSAPGWTFNVGQGYIHIIAPEGQQIDSLTTKDGY 240 (795) Q Consensus 161 ~~~~ty~vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s~~~g~t~~~~g~~~~i~~~~~~~~~~~~~~dg~ 240 (795) +|+++|++++++...+++++|++++++.+.+++++|++.+|.+++.+..++|++..++++++|+++++..+.++++.+++ T Consensus 160 ~y~r~y~i~i~~~~~at~~tpdgt~~~~~~~~s~~~ia~~L~~~l~a~~~g~t~~~~g~~i~i~a~s~~~~~t~s~~~~~ 239 (794) T protein:vir:10 160 QYGRELIVHINGKDVATYKIPDGSKPEHVNNTDAQWLAERLAKQMRINLSGWTVNVGQGFIHVTAPSGQQIDSFTTKDGY 239 (794) T ss_pred ccceEEEeccCCcceeEEEecCCCCcccceecchhhhhhhhhhhhhcccCCceEEeCCeEEEEEeccCceeccccccCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999989999999999 Q ss_pred CcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeEEEEeecCce Q lcl|NC_015271. 241 ADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALVRAADGN 320 (795) Q Consensus 241 ~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~~~v~~~~~t 320 (795) .++++.++.++++++++||++|++|++|+|.++++++.++||++|+...+.|+||++++...+++.+|||+.++++++++ T Consensus 240 ~~~~~~~v~~~~~~~~~lp~~~~~G~~v~i~~~~~~~~~~yyv~~~~~~~~w~E~~~~g~~~~~~~~tmP~~l~r~~~~t 319 (794) T protein:vir:10 240 ADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWTETLGWNTENQVLLETMPHALVRAADGN 319 (794) T ss_pred CcceeEEEEeccCcceecccCCCCCcEEEEEeCCCCCcceeEEEEEcCCcEEEEecccceeEEEecccceeEEEEeccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeecccCCccccCCcccccccccccCCCccEEEEEcceEEEecCCeEEEEecCCccccccccccccCCCCcEEEEEcCCC Q lcl|NC_015271. 321 FELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSGENIILSRTAKYFNFYPASIATLSDDDPIDVAVSTNR 400 (795) Q Consensus 321 ~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~~ 400 (795) |+++..+|++|.+||+++||+|+|+|++|++|+||||||+|+++++|||||+||||||+++|++++.|||||++++++++ T Consensus 320 ~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~ 399 (794) T protein:vir:10 320 FDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVFFFRNRLGFLSGENIILSRTAKYFNFYPASIANLSNDDPIDVAVSTNR 399 (794) T ss_pred EEeeecccccccccccccCccCcccCCCccEEEEEcceEEEeeCCeEEEEecCCcccccccccccCCCCccEEEEecCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCeeEEEEEEeeccc Q lcl|NC_015271. 401 IAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQDV 480 (795) Q Consensus 401 ~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~~~v~~~~~~~~~ 480 (795) +|+|+|+++++++|+|||+++||+|+++++|||+|++++++|+|+|+++|+|+.+|++++|++++|+|++++||++|+++ T Consensus 400 ~~~i~~~v~~~~~L~i~T~~~q~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~ 479 (794) T protein:vir:10 400 IAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSVELNLTTQFDVQDRARPYGIGRNVYFASPRSSYTSIHRYYAVQDV 479 (794) T ss_pred ceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEeecccCCCCceEeCCeEEEEecCCCeeEEEEEeeeccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecCCCeEEEEEEEeCC Q lcl|NC_015271. 481 SSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGSNVQVLACQCINS 560 (795) Q Consensus 481 ~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~g~~~~~~~~~~~d 560 (795) +|+|+++|||+|++|||+++++.+++++++|.+++|+++++|+|++|+||++++||+|+|||||+|+|.++++|+.+.+| T Consensus 480 ~d~y~a~Dlt~~~~hl~~~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~d 559 (794) T protein:vir:10 480 SSVKNSEDITSHVPNYIPNGVFSICGSGTENFCSVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISS 559 (794) T ss_pred cCceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceEEEeEEEEEcCCcEEEEEEEecCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999988899 Q ss_pred EEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeecccccCCcccCceEEEEecCCc Q lcl|NC_015271. 561 DMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLPTIYGADFAKGKITVLEADGK 640 (795) Q Consensus 561 ~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~~g~~v~~~adg~ 640 (795) ++|++|+|+++++++||.+.++..++.++++++||||++++..++++|+++++.+.+++++++|++|+||+++.+.+||. T Consensus 560 ~l~~iv~r~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~g~~~~eg~~v~~~adg~ 639 (794) T protein:vir:10 560 DMYVILRNEFNTFLARISFTKNAIDLQGEPYRAFMDMKIRYTIPSGTYNDDTFTTSIHIPTIYGANFGRGKITVLEPDGK 639 (794) T ss_pred eEEEEEEeCCCEEEEEEEEeecCCCCCCccceeeeecceEEEecCcccccccccceEEcccccCcccccccEEEEecCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEEEEEEeeccceEEEE Q lcl|NC_015271. 641 ITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRRAWVNYEDSGTFDIY 720 (795) Q Consensus 641 ~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~~~~~~~~t~~~~v~ 720 (795) ......+..++++.++++|++++++++|+|||+|+++++++||++++++|++++..+.+||+||+|++++|.+||+|.+. T Consensus 640 ~~~~~~~~~~~~g~~~l~i~~~~~a~~v~vGl~y~s~~~~~~~~i~~~~~~~~~~~~~~gr~~l~r~~~~~~~tg~~~v~ 719 (794) T protein:vir:10 640 ITVFEQPTSGWQSDPWLRLSGNLEGREVFIGFNINFVYEFSKFLIKQTTDDGSTSTEDIGRLQLRRAWVNYEDSGTFDIY 719 (794) T ss_pred eeeeeeeeeeeecceEEEecCCCCCceEEEeeeeeEEEEecceEEEccCCCcceeeeccccEEEEEEEEEeeccccEEEE Confidence 99998999999899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCcccccccccccccccccccccccccccceEEEEeeecccceEEEEEECCCCCEEEEEEEEEEEEeccccCC Q lcl|NC_015271. 721 VENQSSNWKYTMAGARLGAHVMRTGKLNLGTGQYRFPVVGNAKFNTVFILSDATTPLNIIGCGWEGNYLRRSSGI 795 (795) Q Consensus 721 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r~rrv 795 (795) |+++++++.+.+.+.+++++.+.++++|+.+|++++|+++|+++.+|+|+|++|+||+|+||+|||+||+|+||| T Consensus 720 v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlsi~~eg~y~~r~~~v 794 (794) T protein:vir:10 720 VENQSSNWKYTMAGARLGSNTLRAGRLNLGTGQYRFPVVGNAKFNTVYILSDETTPLNIIGCGWEGNYLRRSSGI 794 (794) T ss_pred EcCCccccceeeccceeccccccccccccccceEEEEecccCceEEEEEEECCCCceEEEEEEEEEEEeccccCC Confidence 999999888889999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:2203 Length: 794 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_042000;swissprot:sw:p03747;genbank:gi:9627472;uniprot:P03747;genbank:GeneID:1261024 Probab=100.00 E-value=7.6e-257 Score=1424.73 Aligned_cols=794 Identities=85% Similarity=1.357 Sum_probs=774.5 Q ss_pred CCceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCceEEEE Q lcl|NC_015271. 1 MALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESEQYYAV 80 (795) Q Consensus 1 M~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~~y~l~ 80 (795) ||+|+|+||||++|||||+|++||++||++|+||+|+|+|||+||||++||++|+++..+....+|++|+|+++|+|+|+ T Consensus 1 M~~i~~s~~n~~~GvSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~~~~~~~~~y~l~ 80 (794) T protein:vir:22 1 MALISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTLGDNGALGQAPYIHLINRDEHEQYYAV 80 (794) T ss_pred CceeeeecchhhcccccCCchHHhhhHHhhhhcceeeccCCceeCCchHhhhhhcccCCCCCccEEEEEEeCCCcEEEEE Confidence 99999999999999999999999999999999999999999999999999999998777777889999999999999999 Q ss_pred EeCCeEEEEecCCcEEEEEECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccEEEeccc Q lcl|NC_015271. 81 FTGTGIRVFDLAGNERQVRYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINVRGG 160 (795) Q Consensus 81 ~~~~~~rv~~~~g~~~~v~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~~~v~~~ 160 (795) |++++||||+++|.++.+... +..+|+.++++.++|+|+|+||+|||+|++++|+++.|+.+..+|.+.+++|++++++ T Consensus 81 ~~~~~irv~~~~G~~~~v~~~-~~~~y~~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~~~g~v~v~~g 159 (794) T protein:vir:22 81 FTGSGIRVFDLSGNEKQVRYP-NGSNYIKTANPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVRGG 159 (794) T ss_pred EcCCeEEEEecCCcEEEeecC-CCccceecCCCcccEEEEEEcCEEEEEcCCeeeeEeeccccCCCCCCCceEEEEccCC Confidence 999999999999998887654 5689999999999999999999999999999999999999999999999999999999 Q ss_pred ccCeEEEEEECCceeEEEEecCCCCcccccccchhHHhHhhhhhcccccCceeeeecCceEEEEecCCcceeeEEEecCc Q lcl|NC_015271. 161 QYGRTLQIIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELARQCRVSAPGWTFNVGQGYIHIIAPEGQQIDSLTTKDGY 240 (795) Q Consensus 161 ~~~~ty~vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s~~~g~t~~~~g~~~~i~~~~~~~~~~~~~~dg~ 240 (795) +|+++|++++++...+++++|++++.......+++|++.+|..++....++|++..++++++|++++++.+.+++++|++ T Consensus 160 ~y~~ty~v~I~~~~~a~~~~p~gt~~~~~~~~~~~~ia~~L~~~l~~~~~~~t~~~~~~~~~i~a~~~~~~~~~t~~~g~ 239 (794) T protein:vir:22 160 QYGRELIVHINGKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRTNLSDWTVNVGQGFIHVTAPSGQQIDSFTTKDGY 239 (794) T ss_pred ccceeEEEEeccCcceEEEEcCCCccccceeechhhhhhhhhhhheeccccceEEeCCceEEEEEcCCceEEEEeeeccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeEEEEeecCce Q lcl|NC_015271. 241 ADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALVRAADGN 320 (795) Q Consensus 241 ~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~~~v~~~~~t 320 (795) +++++.++.++++++++||+.|++|++++|.+++++..++||++|+..++.|+||++++...+++.+|||+.++++++++ T Consensus 240 ~~t~~~~~~~~~~~~~~lp~~~~~G~~v~i~~~~~~~~~~Y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~lv~~~~~~ 319 (794) T protein:vir:22 240 ADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWTETLGWNTEDQVLWETMPHALVRAADGN 319 (794) T ss_pred CcceeEEEEeccccceeccccCCCCeEEEEEeCCCCCcceeEEEEeccceEEEEeeeccceeeecccceeeEeeeccCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeecccCCccccCCcccccccccccCCCccEEEEEcceEEEecCCeEEEEecCCccccccccccccCCCCcEEEEEcCCC Q lcl|NC_015271. 321 FELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSGENIILSRTAKYFNFYPASIATLSDDDPIDVAVSTNR 400 (795) Q Consensus 321 ~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~~ 400 (795) |++++.+|++|.+||+++||+|||+|++|++|+||||||+|+++++|||||+||||||+++|++++.|||||++++++++ T Consensus 320 ~~~~~~~w~~r~~Gd~~tnp~psf~g~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~ss~~ 399 (794) T protein:vir:22 320 FDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVFFFRNRLGFLSGENIILSRTAKYFNFYPASIANLSDDDPIDVAVSTNR 399 (794) T ss_pred EEEeeccccccccCccccCCcceecCCCcceEEEEcceEEEecCCeEEEEccCCccccccccCcCCCCCccEEEEecCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCeeEEEEEEeeccc Q lcl|NC_015271. 401 IAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQDV 480 (795) Q Consensus 401 ~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~~~v~~~~~~~~~ 480 (795) +|+|+|+++++++|+|||+++||+|+++++|||+|++++++|+|+|+++|+|+.+|++++|+|++|+|++++|+++++++ T Consensus 400 ~~~i~~~v~~~~~L~i~t~~~e~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~ 479 (794) T protein:vir:22 400 IAILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQDV 479 (794) T ss_pred ceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEEeeccCCCCceEeCCeEEEEecCCCeeEEEEeEeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecCCCeEEEEEEEeCC Q lcl|NC_015271. 481 SSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGSNVQVLACQCINS 560 (795) Q Consensus 481 ~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~g~~~~~~~~~~~d 560 (795) +|+|+++|||+|++|||+|+++.+++++++|.+++|+++++|+|++|+||++++||+|+|||||+|+|.++++|+.+.+| T Consensus 480 ~d~y~~~Dlt~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~d 559 (794) T protein:vir:22 480 SSVKNAEDITSHVPNYIPNGVFSICGSGTENFCSVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVLACQSISS 559 (794) T ss_pred cCceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceeEEeeEEEEcCCCEEEEEEEecCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999988899 Q ss_pred EEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeecccccCCcccCceEEEEecCCc Q lcl|NC_015271. 561 DMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLPTIYGADFAKGKITVLEADGK 640 (795) Q Consensus 561 ~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~~g~~v~~~adg~ 640 (795) +||++|+|+++++++||.+.++..++.++++++||||+.++++++++|+++++.+.+.+++++|++|++|+++.+.+||. T Consensus 560 ~l~~iv~r~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~g~~~~~~~~t~~~~~~~~g~~~~~g~~v~~~~dg~ 639 (794) T protein:vir:22 560 DMYVILRNEFNTFLARISFTKNAIDLQGEPYRAFMDMKIRYTIPSGTYNDDTFTTSIHIPTIYGANFGRGKITVLEPDGK 639 (794) T ss_pred EEEEEEEeCCCEEEEEEEEeeccccCCCccceeeeeeeEEEeeccceeecCCcceEEEcccccCcccccceEEEEEcCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEEEEEEeeccceEEEE Q lcl|NC_015271. 641 ITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRRAWVNYEDSGTFDIY 720 (795) Q Consensus 641 ~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~~~~~~~~t~~~~v~ 720 (795) +.....++++++..++++|++++++++|||||+|+++++++||++++++|+++......||+||+|++++|.+||+|.++ T Consensus 640 ~~~~~~~~~~~~~~~~~~v~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~~~~~~~~~~~grl~l~r~~~~~~~tg~~~v~ 719 (794) T protein:vir:22 640 ITVFEQPTAGWNSDPWLRLSGNLEGRMVYIGFNINFVYEFSKFLIKQTADDGSTSTEDIGRLQLRRAWVNYENSGTFDIY 719 (794) T ss_pred eeeceeeeeeeeccceEEeCCCCCCcEEEEeeeeeEEEEecceEEEecCCCccceeeecceEEEEEEEEEeccccceEEE Confidence 99999999999999999999999999999999999999999999999999999888889999999999999999999999 Q ss_pred ecCCcccccccccccccccccccccccccccceEEEEeeecccceEEEEEECCCCCEEEEEEEEEEEEeccccCC Q lcl|NC_015271. 721 VENQSSNWKYTMAGARLGAHVMRTGKLNLGTGQYRFPVVGNAKFNTVFILSDATTPLNIIGCGWEGNYLRRSSGI 795 (795) Q Consensus 721 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r~rrv 795 (795) |+++++++.+++++.+++++.+..+.+++.++.+++|+++|+++.+|+|+|++|+||+|++|+|||+||+|+||| T Consensus 720 v~~~~~~~~~~~~~~~~g~~~~~~g~~~~~tg~~~vp~~~~~~~~~v~i~~d~p~P~tvlsi~~eg~y~~r~~~v 794 (794) T protein:vir:22 720 VENQSSNWKYTMAGARLGSNTLRAGRLNLGTGQYRFPVVGNAKFNTVYILSDETTPLNIIGCGWEGNYLRRSSGI 794 (794) T ss_pred EcCCCcccceeecCceecccccccCcccccCceEEEEecccCceEEEEEEECCCCCEEEEEEeEEEEEeccccCC Confidence 999999888889999999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:99677 Length: 794 # NCBI annotation: Tail tubular protein B # Family: family:all:825 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249591;genbank:gi:68299742;genbank:GeneID:3799992 Probab=100.00 E-value=6.6e-252 Score=1397.62 Aligned_cols=792 Identities=52% Similarity=0.922 Sum_probs=759.4 Q ss_pred CCceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCceEEEE Q lcl|NC_015271. 1 MALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESEQYYAV 80 (795) Q Consensus 1 M~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~~y~l~ 80 (795) ||+|+|+||||++|||||+|++||++||++|+||+|+|+||++||||++||++++++..+.++++||+|+|+++|+|+|+ T Consensus 1 M~~i~~s~~n~~~GvS~q~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~f~~~~~~~y~l~ 80 (794) T protein:vir:99 1 MALISQSIKNLKGGISQQPDILRYSDQGSKQINGFSSEVEGLQKRPPSVHIKRLTDQFGLGQKPYCHIINRDEVERYAVF 80 (794) T ss_pred CceeeeecchhhcceecCCchHHhhhhHhhhhcceeeeccCcccCCccceeeeecCCCCCccccEEEEEEeCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999999888888999999999999999999 Q ss_pred EeCCeEEEEec-CCcEEEEEECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccEEEecc Q lcl|NC_015271. 81 FTGTGIRVFDL-AGNERQVRYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINVRG 159 (795) Q Consensus 81 ~~~~~~rv~~~-~g~~~~v~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~~~v~~ 159 (795) |++++||||++ +|.+++|.. +...+|+.+++++++|+|+|+||+|||+|++++|+++.+...........++|++++. T Consensus 81 f~~~~irv~~~~~g~~~~v~~-~~~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~~~v~~ 159 (794) T protein:vir:99 81 FTGSNIRVFDLFTGDEKTVNA-PNGLSYVSSSNPRKDLRMVTVADYTFILNRNVATAQGTTNTPSGLAPFGHFGLVVIRG 159 (794) T ss_pred EcCCeEEEEECCCCeEEEeec-cccccccccCCccceeeEEEEccEEEEEcCCeeeeEeeeeccccCcCCCceEEEEecc Confidence 99999999997 577777755 4568899999999999999999999999999999999876666667788899999999 Q ss_pred cccCeEEEEEECCceeEEEEecCCCCcccccccchhHHhHhhhhhcccccCceeeeecCceEEEEecCCcceeeEEEecC Q lcl|NC_015271. 160 GQYGRTLQIIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELARQCRVSAPGWTFNVGQGYIHIIAPEGQQIDSLTTKDG 239 (795) Q Consensus 160 ~~~~~ty~vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s~~~g~t~~~~g~~~~i~~~~~~~~~~~~~~dg 239 (795) ++|+++|++++++..++++++|++++.....+.+.+|++.+|...+. ..+|++...++++++.++++..+.+++++++ T Consensus 160 g~y~~~y~v~i~gs~ta~~~tp~~~~~~~~~~~s~~~ia~~l~~~l~--~~g~~v~~~~g~~~i~~~~~~~v~t~s~~~g 237 (794) T protein:vir:99 160 GQYGRTYRIKVNGSVEASFETPLGDQVAHAKQIDIAYIIDQLAAGLI--NKGWAVTKGSGYFYFSKSGSVIINSLEVEDG 237 (794) T ss_pred CCCCceEEEEecCCcccceeeccCcccccccccchhhhhhhhHhhhh--cccceEEeCCeEEEEEecCCceeEEEEeecC Confidence 99999999999999999999999999999999999999999987764 4678999999999999999999999999999 Q ss_pred cCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeEEEEeecCc Q lcl|NC_015271. 240 YADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALVRAADG 319 (795) Q Consensus 240 ~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~~~v~~~~~ 319 (795) +++++++.+++.|+++++||+.|++|++|+|.+..++++++||++|+...++|+||++++..++++.+|||+.+++++++ T Consensus 238 ~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~~~~~~~~y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~~v~~~~~ 317 (794) T protein:vir:99 238 YNGQLAWGIINDVQKTTQLPVYAPNNYIIRVSGDPTLNQDDYYVRFDASRNVWTECPAPNIKADYNKATMPHVLIREADG 317 (794) T ss_pred CCCceeeEEeeeccceeecccCCCCCeEEEEeccCCCCCCceEEEEEcCCceEEeeccceeecceeccceEEEEeccCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeecccCCccccCCcccccccccccCCCccEEEEEcceEEEecCCeEEEEecCCccccccccccccCCCCcEEEEEcCC Q lcl|NC_015271. 320 NFELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSGENIILSRTAKYFNFYPASIATLSDDDPIDVAVSTN 399 (795) Q Consensus 320 t~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~ 399 (795) +|++++.+|++|.+||+++||+|||+|++|++|+||||||+|+++++|||||+||||||+++|++++.|||||+++++++ T Consensus 318 ~~~~~~~~w~~r~~Gd~~tnp~psf~g~~is~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~~~~ 397 (794) T protein:vir:99 318 TFTFKQADWTHRAAGDDETNPYPSFIGNSINDIFFFRNRLGFLSGENVILSGSGNYFNFFPESVAVLTDTDPIDVAVSTN 397 (794) T ss_pred ceeEeeccccccccCCcccCCCccccCcceeEEEEEeeeEEEecCCeEEEEecCCccccccccccCCCCCccEEEEecCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCeeEEEEEEeecc Q lcl|NC_015271. 400 RIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQD 479 (795) Q Consensus 400 ~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~~~v~~~~~~~~ 479 (795) ++|+|+|+++++++|+|||+++||+|+|+++|||+|++++++|+|+|+++|+|+.+|++++|+|++|+|++++|+|+|++ T Consensus 398 ~~~~i~~~v~~~~~L~l~t~~~q~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~v~r~~~~~~ 477 (794) T protein:vir:99 398 RISILKYAVPFSEELILWSDQAQFVLSSDGGLTPTTIRLDLTTEFEVTEQARPYGIGRGVYFVSPRAKFSSVRRFYAVQD 477 (794) T ss_pred cceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEEeeccCCCCceEeCCeEEEEecCCCeeEEEEeeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecCCCeEEEEEEEeC Q lcl|NC_015271. 480 VSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGSNVQVLACQCIN 559 (795) Q Consensus 480 ~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~g~~~~~~~~~~~ 559 (795) ++|+|+++|||+|++|||+|+++++++++++|.+++|++++||+|++||||++++||+|+|||||+|+|+++++|+.+.+ T Consensus 478 ~~d~y~a~Dlt~~~~hl~~~~~~~~~a~~~~~~~~v~~~~~~g~l~~~~y~~~~~eq~v~aW~~~~~~g~~~~~~~~~~~ 557 (794) T protein:vir:99 478 VTQVKNAEDISAHVPYYVENGVFKMSGSSTENFLTILTEGNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRVLCCDMIG 557 (794) T ss_pred ccCceehhhHHHHHHHhcCCCeEEEEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceEEEeEEEEEcCCCeEEEEEEEcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CEEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeecccccCCcccCceEEEEecCC Q lcl|NC_015271. 560 SDMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLPTIYGADFAKGKITVLEADG 639 (795) Q Consensus 560 d~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~~g~~v~~~adg 639 (795) |+||++|+|+++.++|||.+.++..+...++++++|||+++++.+.++++...+.+.++++.++|++|++|+++.+.+|| T Consensus 558 d~l~~~v~r~~~~~ler~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~g~~v~~~~dg 637 (794) T protein:vir:99 558 AVMHLIIDSPSGVLMEKIEFTQNTKDYPDEPYRLYVDRKIEYTFPEGSYNDDDFKTRVKLKDIYGSTPANGQYVFISLGG 637 (794) T ss_pred CEEEEEEEeCCCEEEEEEEeeeCCCCCCCcccceeeeeeeeeeecccccccCcceeEEeccccccccccCCceEEEEeCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccceeeeccC-CCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEEEEEEeeccceEE Q lcl|NC_015271. 640 KITEFEEPEVGWK-NDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRRAWVNYEDSGTFD 718 (795) Q Consensus 640 ~~~~~~~~~~g~~-~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~~~~~~~~t~~~~ 718 (795) .......+..++. ....+++++++++++|+|||+|+++++++||++++++++++..+..+|||||||++|++.+|++|+ T Consensus 638 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~vGl~y~s~~~~~~~~~~~~~~~g~~~~~~~gr~~l~r~~~~~~~tg~~~ 717 (794) T protein:vir:99 638 VTFTFDPPAGGWQANDGLIEFDGDLRGTKFFVGEAYTFLYEFSKFLIKTTDTADGVATEDIGRLQLRRAWVNYDKSGNFR 717 (794) T ss_pred ceeeeecccceEecCccEEEecCCCCCcEEEEeeeeeEEEeecceEEeecCCCCceeeeccceEEEEEEEEEeecccceE Confidence 9887766655543 345789999999999999999999999999999999999999888999999999999999999999 Q ss_pred EEecCCcccccccccccccccccccccccccccceEEEEeeecccceEEEEEECCCCCEEEEEEEEEEEEeccccCC Q lcl|NC_015271. 719 IYVENQSSNWKYTMAGARLGAHVMRTGKLNLGTGQYRFPVVGNAKFNTVFILSDATTPLNIIGCGWEGNYLRRSSGI 795 (795) Q Consensus 719 v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r~rrv 795 (795) +.++++.+++.+.+.+.+++++.+..+.+|+.+|++++|+.+|+++.+|+|+|++|+||+|+||+|||+||+|+||| T Consensus 718 v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlsi~~e~~y~~r~~~v 794 (794) T protein:vir:99 718 VEVNNQGRTFTYNMTGNRLSTNELILGDESLDTGQFRYAVSGNATQVTVSLISDTPNPLSIIGGGWEGYYVRRSSGI 794 (794) T ss_pred EEECCCccceeeeccccccccccccccccccccceEEEEecccccceEEEEEECCCCCEEEEEEEEEEEEeccccCC Confidence 99999998887788999999999999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:1543 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052111;swissprot:trembl:q9t105;genbank:gi:9634037;uniprot:Q9T105;genbank:GeneID:1262408 Probab=100.00 E-value=9.5e-250 Score=1385.81 Aligned_cols=792 Identities=68% Similarity=1.144 Sum_probs=759.8 Q ss_pred CCceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCceEEEE Q lcl|NC_015271. 1 MALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESEQYYAV 80 (795) Q Consensus 1 M~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~~y~l~ 80 (795) ||+|+|+||||++|||||+|++||++||++|+||+|+|+|||+||||++||+++++++...+++++|+|+|+++|+|+|+ T Consensus 1 M~~i~~s~~n~~~GvSqq~d~~r~~~q~~~~~N~~~~~~gGl~rRpGt~~va~~~~~~~~~~~~~~~~~~~~~~e~y~l~ 80 (801) T protein:vir:15 1 MALISQSIKNLKGGISQQPDILRFAEQGSVQINGWSSESEGLQKRPPMIHLKTLGPAGYVGAQPYVHLINRDEFEQYFVV 80 (801) T ss_pred CceeeeecchhhcceecCcchHhhhhhHhhhhcceeccccCcccCCchheeeeecCCCCcccceeEEEEEeCCceEEEEE Confidence 99999999999999999999999999999999999999999999999999999998888888899999999999999999 Q ss_pred EeCCeEEEEecCCcEEEEEECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccEEEeccc Q lcl|NC_015271. 81 FTGTGIRVFDLAGNERQVRYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINVRGG 160 (795) Q Consensus 81 ~~~~~~rv~~~~g~~~~v~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~~~v~~~ 160 (795) |++++||||+++|.++.+.. ..+|+.+++++++|+++|+||+|||+|++++|+++.|..+...++...++|++++++ T Consensus 81 ~~~~~irv~~~~G~~~~v~~---~~~y~~~~~~~~~l~~~~~aD~~fi~nr~~~~~~~~~~~~~~~~~~~~~alv~v~~~ 157 (801) T protein:vir:15 81 FTGEDIKVFDLDGKEYQVRG---DRSYVRTANPREDLRMITVADYTFVTNRKVVVQSNDQSVNLPGFKDQGDALINVRGG 157 (801) T ss_pred EcCCeEEEEccCCcEEEEec---CCccccccCchhheeEEEEcCEEEEeeCCeeeecccCccccCccCCCCceEEEeeec Confidence 99999999999999887754 457998899999999999999999999999999999987777788888999999999 Q ss_pred ccCeEEEEEECCceeEEEEecCCCCcccccccchhHHhHhhhhhcc---------cccCceeeeecCceEEEEecCCcce Q lcl|NC_015271. 161 QYGRTLQIIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELARQCR---------VSAPGWTFNVGQGYIHIIAPEGQQI 231 (795) Q Consensus 161 ~~~~ty~vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~---------s~~~g~t~~~~g~~~~i~~~~~~~~ 231 (795) +|+++|+++++|...+.+++|.++.+......+..+++..|..++. ...++|++...++++++.++++... T Consensus 158 ~yg~t~~I~i~gs~~~~~t~~~gs~~~~~~~~s~~~ia~~l~~~~~~~~p~~~~~~~~~~w~~~~~~g~~~i~a~~~~~~ 237 (801) T protein:vir:15 158 QYGRRLSIEFNGAERAAVQLPDGSQPAHVNEVDGQAIAEKLAAQLRNNLGNPNNDQDPNKWRFNVGPGFIHILAPNNDNV 237 (801) T ss_pred cCceeEEEEeCCcceEEEEeccCcccchhhhcceeechHHHhhhhhhccCccceeccCccEEEEecCcEEEEeCCCCccc Confidence 9999999999999999999999988888888887777777665443 3556799999999999999998888 Q ss_pred eeEEEecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeE Q lcl|NC_015271. 232 DSLTTKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPH 311 (795) Q Consensus 232 ~~~~~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~ 311 (795) ..++++|++.++++.++.+.++++++||.++++|++|+|.++.+++.++||++|+...++|+|+++++..++++.++||| T Consensus 238 ~~~~t~dg~~~~~~~~~~~~v~~~~~lp~~~~~G~~v~v~~~~~~~~~~y~v~~~~~~~~w~E~a~~g~~~~~~~~tmp~ 317 (801) T protein:vir:15 238 WGLQTKDGYADQLINPVTHYTQSFQKLPINAPDGYIVKIVGDTSKTADQYYVRFDLNRKVWVETIGWNTRTHLYYHTMPW 317 (801) T ss_pred ceeeeccccCceeeeEEeecccceeeeeeecCCCcEEEEEecCCCccceEEEEEEcCCeeEEeecccccceeeeccccce Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeecCceeeecccCCccccCCcccccccccccCCCccEEEEEcceEEEecCCeEEEEecCCccccccccccccCCCCc Q lcl|NC_015271. 312 ALVRAADGNFELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSGENIILSRTAKYFNFYPASIATLSDDDP 391 (795) Q Consensus 312 ~~v~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~ 391 (795) .+++.++++|+++.++|++|.+||+++||+|+|.|++|++|+||||||+|+++++|||||+||||||+++|++++.|||| T Consensus 318 ~lv~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~ 397 (801) T protein:vir:15 318 ALVRASDGNFDFKVLEWGARTVGDDTTNPYPSFTGQTINDIFFFRNRLGFLSGENIILSRTSKYFNFFPASVSNYSDDDP 397 (801) T ss_pred EEEeeccceEEEeccccccccCCccccCCcccccCCCceEEEEEcceEEEeeCCeEEEEecCCccccccccccCCCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCeeEE Q lcl|NC_015271. 392 IDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSI 471 (795) Q Consensus 392 i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~~~v 471 (795) |+++++++++|+|+|+++++++|+|||+++||+|+|+++|||+|++++++|+|+|+++|+|+.+|++++|+|++|+|+++ T Consensus 398 i~~~~~~~~~~~i~~~v~~~~~L~i~t~~~q~~ls~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~ 477 (801) T protein:vir:15 398 IDVAVSHNRVSTLKYAVPFSEELLLWSDQAQFVLTASGILSSRSVELNLTTQFDVQDRARPHGVGRNVYFASPRASFTSI 477 (801) T ss_pred EEEEecCCcceeeEEEeecCCcEEEEecCcEEEEcCCCcccceeEEEEEEEeeeccCCCCceEeCCeEEEEecCCCeeEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecCCCeE Q lcl|NC_015271. 472 HRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGSNVQ 551 (795) Q Consensus 472 ~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~g~~~ 551 (795) +|+++|++++|+|+++|||+|++|||+++++++++++++|.+++|+++++|+|++||||++++||+|+|||||+|+|+++ T Consensus 478 ~r~~~~~~~~d~y~a~Dlt~~~~hl~~~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~ 557 (801) T protein:vir:15 478 NRYYAVQDVSSVKNAEDMTAHVPNYIPNGVFSISGTTAENFAAILTSGAPNRVYIYKFLYIDEEIRQQSWSHWDFGDNVT 557 (801) T ss_pred EEEEeecccccceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCEEEEEEEecCCCceEEEeeEEEEcCCCEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEeCCEEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeecccccCCcccCce Q lcl|NC_015271. 552 VLACQCINSDMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLPTIYGADFAKGK 631 (795) Q Consensus 552 ~~~~~~~~d~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~~g~ 631 (795) ++|+.+.+|+||++|+|+++.+++||.+..++.+...+++++||||+.++.+++++++.....++.+.+.++|++|++|+ T Consensus 558 ~~~~~~~~d~l~~~v~r~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~t~~~~~~~~~~~~~~~~gl~~l~g~ 637 (801) T protein:vir:15 558 VFAAQVINSTMTVLMGNEHAVWMGRLHFTKNSIDIPGEPYRLYIDAKRKYTIPAGTYNDDTYQTSISLATIYGMNFTKGR 637 (801) T ss_pred EEEEEecCCEEEEEEEecCcEEEEEEEEccccccCCCcceeeeeeeeeeEeeccceeccCceecccccccccccccccce Confidence 99998889999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEEEEEEe Q lcl|NC_015271. 632 ITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRRAWVNY 711 (795) Q Consensus 632 ~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~~~~~~ 711 (795) ++.+++||.+++...+..|+...+++++++++++++|+|||+|+++++++||+++.++|++......+|||||||++|++ T Consensus 638 ~v~v~~dG~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~~~~~~~~~~~~rl~l~r~~~~~ 717 (801) T protein:vir:15 638 VSVVFPDGKIIEVDQPINGWSSDPVLRLDGNQEGQVVYIGFNIPFTYTFSKFLIKKTAEDGSTATEDIGRLQLRRAWVNY 717 (801) T ss_pred EEEEEeCCceeeeeeecCcccCcceEEEcCCCCCcEEEEeeeeeEEEEecceEEeccCCCCCceeeeeccEEEEEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999988888889999999999999 Q ss_pred eccceEEEEecCCcccccccccccccccccccccccccccceEEEEeeecccceEEEEEECCCCCEEEEEEEEEEEEecc Q lcl|NC_015271. 712 EDSGTFDIYVENQSSNWKYTMAGARLGAHVMRTGKLNLGTGQYRFPVVGNAKFNTVFILSDATTPLNIIGCGWEGNYLRR 791 (795) Q Consensus 712 ~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r 791 (795) .+||.|.+.|+++.++..+...+.+++++.+..+.+++.++.+++|+.+|+++.+|+|+|++|+||+|+||+|||+||+| T Consensus 718 ~~tg~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlsi~~e~~y~~r 797 (801) T protein:vir:15 718 EDSGAFTIRVNNLSREFIYTMAGARLGSDNLRVGRSNIGTGQYRFPVVGNAQTNLVTIESDASTPLNIIGCGWEGNYLRR 797 (801) T ss_pred ccCcceEEEECCcccccceeecCcccccccccccccccccceEEEEEeecCceEEEEEEECCCCcEEEEEEEEEEEEecc Confidence 99999999999998887788999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCC Q lcl|NC_015271. 792 SSGI 795 (795) Q Consensus 792 ~rrv 795 (795) +||| T Consensus 798 ~~~~ 801 (801) T protein:vir:15 798 SSGI 801 (801) T ss_pred ccCC Confidence 9999 No 5 >protein:vir:94583 Length: 792 # NCBI annotation: Tubular tail protein B # Family: family:all:825 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919014;genbank:gi:119637778;genbank:GeneID:5179343 Probab=100.00 E-value=2.8e-248 Score=1377.77 Aligned_cols=789 Identities=63% Similarity=1.098 Sum_probs=749.2 Q ss_pred CCceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCceEEEE Q lcl|NC_015271. 1 MALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESEQYYAV 80 (795) Q Consensus 1 M~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~~y~l~ 80 (795) ||+|+|+||||++|||||+|++||++||++|+||+|+|+||++||||++||+.+++++.+..+++|++|+||++|+|+|+ T Consensus 1 M~~i~~s~~n~~~GiSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~~~~~~~q~y~l~ 80 (792) T protein:vir:94 1 MALISQSVKNLKGGISQQPNILRFPEQGSEQINGWSSETEGLQKRPPFVFTKTIGDQNALGAKPLVHLINRDSAEQYYVV 80 (792) T ss_pred CcceeeecchhhcceecCcchHHhhhhhhhhhcceeeeccccccCChhHHHHhhhcCCCCCcccEEEEEEeCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999998888888999999999999999999 Q ss_pred EeCCeEEEEecCCcEEEEEECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccEEEeccc Q lcl|NC_015271. 81 FTGTGIRVFDLAGNERQVRYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINVRGG 160 (795) Q Consensus 81 ~~~~~~rv~~~~g~~~~v~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~~~v~~~ 160 (795) |++++||||+.+|.++.+ .+..||+.+++++++|+|+|+||+|||+|++++|+++.|.....+ +.++++++++++ T Consensus 81 f~~~~~rv~~~~g~~~~~---~~~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~~~~~~~~~~~~~--~~~~~~v~i~~g 155 (792) T protein:vir:94 81 FTGQGVRVFDLNGKEYDV---KGDLSYVKVENPRDDLRMVTVADYTFIVNRNMVVRPDTTPLYTLK--ENGDCLINIRGG 155 (792) T ss_pred EcCCeEEEEecCCceEEe---cccCceeeecCCcceeEEEEEcCEEEEEeCCccceeEecCcCCCC--CCceEEEEccCC Confidence 999999999999987755 445799999999999999999999999999999999998766543 456899999999 Q ss_pred ccCeEEEEEECCceeEEEEecCCCCcccccccchhHHhHhhhhhc--ccccCceeeeecCceEEEEecCCcceeeEEEec Q lcl|NC_015271. 161 QYGRTLQIIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELARQC--RVSAPGWTFNVGQGYIHIIAPEGQQIDSLTTKD 238 (795) Q Consensus 161 ~~~~ty~vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~--~s~~~g~t~~~~g~~~~i~~~~~~~~~~~~~~d 238 (795) +|+++|++++++. .+.+++|.++.+......+++|++.+|.... +....+|++...+++++|+++++..+.+++++| T Consensus 156 ~y~~~y~i~i~~~-~~~~~~~~~t~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 234 (792) T protein:vir:94 156 MYGRTLAFTINNT-KIAYEIAHGDAPEHSKQTDAQWLVKKLAGLARLNVAFKGWTFTEGPGYIHVIAPSNSQINSLSTED 234 (792) T ss_pred CcceeEEEEecCc-eeeeeeecCcccceecccchhhhhhhhhhhccccccccccEEEECCeEEEEEecCCceeeeeeccc Confidence 9999999999986 4678889989999999999999999987754 445568999999999999999999999999999 Q ss_pred CcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeEEEEeecC Q lcl|NC_015271. 239 GYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALVRAAD 318 (795) Q Consensus 239 g~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~~~v~~~~ 318 (795) |+.+++++++++.++++++||+.+++|++|+|.+..+++.+.||++|+...++|+||++++..++++..+||+.++++++ T Consensus 235 g~~~~~~~~~~~~v~~~~~lp~~~~~G~~v~i~~~~~~~~d~y~v~~~~~~~~w~E~~~~~~~~~~~~~tmp~~lv~~~~ 314 (792) T protein:vir:94 235 GYADQLMNAVMHTSQSFSRLPVEAPNGYTVKIVGDTSKTSDMFYVQYDNMKKVWKEVAGWGVQKGLNGGTMPHALVRQAD 314 (792) T ss_pred CcCcceeeeeeecccccccccccCCCCcEEEEEccCCCCccceEEEEEcCCceEEEecccceeeeecccccCeeEEEcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeecccCCccccCCcccccccccccCCCccEEEEEcceEEEecCCeEEEEecCCccccccccccccCCCCcEEEEEcC Q lcl|NC_015271. 319 GNFELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSGENIILSRTAKYFNFYPASIATLSDDDPIDVAVST 398 (795) Q Consensus 319 ~t~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~ 398 (795) ++|++...+|++|.+||+++||.|+|.|++|++|+||||||+|+|+++|||||+||||||+++|++++.|||||++++++ T Consensus 315 ~~~~~~~~~w~~r~~gd~~tnp~psf~g~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss 394 (792) T protein:vir:94 315 GSFQMQVLPWTQRTCGDMDTNPTPSIVDQKINDVFFFRNRLGFLAGENIVMSRTSKYFSLFPASVANLSDDDPIDVAVSH 394 (792) T ss_pred CcEEEEeccccccccCccccCccceeccCCcceEEEEcceEEEecCCeEEEEccCCcccCccccccCCCCCccEEEEecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCeeEEEEEEeec Q lcl|NC_015271. 399 NRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQ 478 (795) Q Consensus 399 ~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~~~v~~~~~~~ 478 (795) +++|+|+|+++++++|+|||+++||+|+|+++|||+|++++++|+|+|+++|+|+.+|++++|++++|+|++++|+++|+ T Consensus 395 ~~~~~i~~~v~~~~~L~l~T~~~q~~l~~~~~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~~~~v~r~~~~~ 474 (792) T protein:vir:94 395 NRISILKYAVPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTEFDVSDRARPFGVGRGVYFASPRASYTSLNRYYAVQ 474 (792) T ss_pred CcceeeeEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEEeeccCCCCceEeCCeEEEeecCCCeeEEEeeeeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecCCCeEEEEEEEe Q lcl|NC_015271. 479 DVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGSNVQVLACQCI 558 (795) Q Consensus 479 ~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~g~~~~~~~~~~ 558 (795) +++|+|+++|||+|++|||+|+++++++++++|.+++|+++++|+|++||||++++||+|+|||||+|+|+++++|+.+. T Consensus 475 ~~~d~y~a~DlT~~~~hl~~~~v~~~~a~~~~~~~vv~~~~~~g~l~~~ty~~~~~e~~v~aW~~~~~~g~~~~~~~~~~ 554 (792) T protein:vir:94 475 DVSSVKSAEDMSAHVPNYIPNGVFSIRGSSTENFISVLSSNAPSRIFLYKFLYLNEEIAQQSWSHWELGSNVTVLACDSI 554 (792) T ss_pred cccCceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCeEEEEEEeecCCceEEEeEEEEEcCCcEEEEEEeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999889 Q ss_pred CCEEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeecccccCCcccCceEEEEecC Q lcl|NC_015271. 559 NSDMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLPTIYGADFAKGKITVLEAD 638 (795) Q Consensus 559 ~d~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~~g~~v~~~ad 638 (795) +|+||++|+|+++++++||.+.++..+...+++++||||+.++..++++|++.+..+.....+++||+|++|+++.+.+| T Consensus 555 ~D~l~~~v~r~~~~~~~r~~~~~~~~d~~~~~~~~~lD~~~~~~~~~~~~~~~~~~T~~~~~~~~gl~~l~G~~v~v~~d 634 (792) T protein:vir:94 555 GSTMYLVLRNQSHTWMCRAHFTKNSIDFPDEPYRLYIDNKVKYVIPEGSYNDDTYATTVKPVDVYGMKYWTGKFYIVASD 634 (792) T ss_pred CCEEEEEEEeCCCEEEEEEEEeecccccCCCcceeeeeeeeeEEecCcceecCceeeeeccccccCcccccCcEEEEEec Confidence 99999999999999999999999999999999999999999999999999999999988889999999999999999999 Q ss_pred Ccccccceeeecc-CCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEEEEEEeeccceE Q lcl|NC_015271. 639 GKITEFEEPEVGW-KNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRRAWVNYEDSGTF 717 (795) Q Consensus 639 g~~~~~~~~~~g~-~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~~~~~~~~t~~~ 717 (795) |.......+..++ ...+++++++++++++|+|||+|+++++++||+++.++|+++......||+||||++++|.+||.| T Consensus 635 G~~~~~~~~~~~~~~~~~~i~~~g~~~a~~v~VGl~y~~~~~~~~~~~~~~~g~~~~~~~~~gr~rl~r~~~~~~~tg~~ 714 (792) T protein:vir:94 635 GLVSWFEPPRGGWPNGVPMLTMSGNREGETIYVGLAISFRYVFSKFLIKKTADDGSIATEDIGRLQLRRAWVNYEDSGAF 714 (792) T ss_pred CceeEeecccceecCCccEEEecCCccCCeEEEeeeeeEEEEeccceeeccCCCcCccccceeeEEEEEEEEeeecccee Confidence 9876554444333 345678999999999999999999999999999999999988888889999999999999999999 Q ss_pred EEEecCCcccccccccccccccccccccccccccceEEEEeeecccceEEEEEECCCCCEEEEEEEEEEEEeccccCC Q lcl|NC_015271. 718 DIYVENQSSNWKYTMAGARLGAHVMRTGKLNLGTGQYRFPVVGNAKFNTVFILSDATTPLNIIGCGWEGNYLRRSSGI 795 (795) Q Consensus 718 ~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r~rrv 795 (795) .+++++.+++..+.+.+++++++.+..+.+++.++++++|+.+|+++.+|+|+|++|+||+|+||+|||+||+|+||| T Consensus 715 ~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlai~~eg~y~~r~~~v 792 (792) T protein:vir:94 715 TVEVENTSRLFSYDMAGARLGSNVLRAGGLNVGTGQFRFPVTGNAQLNEVRIISEHTTPLNVIGCGWEGNYLRRSSGI 792 (792) T ss_pred EEEEcCCCcceeeeeccceeccccccccccccccceEEEEeeccCceEEEEEEECCCCCEEEEEEEEEEEEeccccCC Confidence 999999888877778999999999999999999999999999999999999999999999999999999999999999 No 6 >protein:vir:3366 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523337;swissprot:trembl:q8w5u3;genbank:gi:17570828;goa:Q8W5U3;uniprot:Q8W5U3;genbank:GeneID:927453 Probab=100.00 E-value=1.9e-247 Score=1373.19 Aligned_cols=792 Identities=68% Similarity=1.137 Sum_probs=757.9 Q ss_pred CCceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCceEEEE Q lcl|NC_015271. 1 MALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESEQYYAV 80 (795) Q Consensus 1 M~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~~y~l~ 80 (795) ||+|+|+||||++|||||+|++||++||++|+||+|+|+|||+||||++||++++.++....++++|+|+|+++|+|+|+ T Consensus 1 M~~i~~~~~nl~~GvSqq~d~~r~~~q~~~~~N~~~~~~gG~~rRpGt~~va~~~~~~~~~~~~~~~~~~r~~~~~y~l~ 80 (801) T protein:vir:33 1 MALISQSIKNLKGGISQQPDILRFTEQGSVQINGWSSESEGIQKRPPMIHLKTLGTAGYVGAQPYVHLINRDEFEQYFVV 80 (801) T ss_pred CceeEeeccceecceeccchhHhhhhhHhhhhcceeecccCcccCchhHhhhhhcCCCccccceEEEEEEeCCceEEEEE Confidence 99999999999999999999999999999999999999999999999999999998888888999999999999999999 Q ss_pred EeCCeEEEEecCCcEEEEEECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccEEEeccc Q lcl|NC_015271. 81 FTGTGIRVFDLAGNERQVRYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINVRGG 160 (795) Q Consensus 81 ~~~~~~rv~~~~g~~~~v~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~~~v~~~ 160 (795) |++++||||+++|.++.+.. ..+|+.+++++++|+++|+||+|||+|++++|++..+......++..++++++++.+ T Consensus 81 ~~~~~irv~~~~G~~~~v~~---~~~y~~~~~~~~~l~~~t~aD~~fi~nr~~~p~~~~~~~~~~~~~~~~~~li~v~~~ 157 (801) T protein:vir:33 81 FTGEDIKVFDLDGKEYQVRG---DRSYVRTANPREDLRMVTVADYTFVTNRKVVVQSNDQSVNLPGFKDQGDALINVRGG 157 (801) T ss_pred EcCCeEEEEccCCcEEEEec---CCcceeecCcchheEEEEEcCEEEEeeCCeeecccCCcccccccCCCcceEEEEeec Confidence 99999999999999887754 467998999999999999999999999999999988877777788888999999999 Q ss_pred ccCeEEEEEECCceeEEEEecCCCCcccccccchhHHhHhhhhhcc---------cccCceeeeecCceEEEEecCCcce Q lcl|NC_015271. 161 QYGRTLQIIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELARQCR---------VSAPGWTFNVGQGYIHIIAPEGQQI 231 (795) Q Consensus 161 ~~~~ty~vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~---------s~~~g~t~~~~g~~~~i~~~~~~~~ 231 (795) +|+++|+++++|...+.+++|.++.+....+.+..+++..|..++. ...++|++..+++.+++.++++... T Consensus 158 ~yg~t~~I~i~gs~~~~~~~~~gs~~~~v~~~s~~~~A~~l~~~~~~~~~~~~~~~~~~~w~~~~~~g~~~i~~p~~~~~ 237 (801) T protein:vir:33 158 QYGRRLSIEFNGAERAAVQLPDGSQPAHVNEVDGQAIAEKLAAQLRNNLGNPNNDQDPNKWRFNVGPGFIHILAPNNDNV 237 (801) T ss_pred ccceEEEEEECCcceEEEEeeccccccccccccchhhhhhhhhhhhccCccceeeecCceEEEEecCeEEEEecCCCccc Confidence 9999999999999999999999888888888888888777765543 2456788888888899999998888 Q ss_pred eeEEEecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeE Q lcl|NC_015271. 232 DSLTTKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPH 311 (795) Q Consensus 232 ~~~~~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~ 311 (795) ..+++.+++.++++.++.+.++++++||.++++|++|+|.++.+.+.++||++|+...++|+||++++...+++.++||+ T Consensus 238 ~~itt~~g~~~~~~~~~~~~v~~~~~lp~~~~~g~~v~v~~~~~~~~~~y~v~~~~~~~~w~e~~~~g~~~~~~~~tmp~ 317 (801) T protein:vir:33 238 WGLQTKDGYADQLINPVTHYTQSFQKLPINAPDGYIVKIVGDTSKTADQYYVRFDLNRKVWVETIGWNTRTHLHYHTMPW 317 (801) T ss_pred ccccccCCccceeEEEEeecccceeeeeeecCCCcEEEEEecCCCcccceEEEEEcCCcEEEEeeccccceeeeecccce Confidence 88899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeecCceeeecccCCccccCCcccccccccccCCCccEEEEEcceEEEecCCeEEEEecCCccccccccccccCCCCc Q lcl|NC_015271. 312 ALVRAADGNFELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSGENIILSRTAKYFNFYPASIATLSDDDP 391 (795) Q Consensus 312 ~~v~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~ 391 (795) .|++.++++|+++.++|++|.+||+++||+|+|.|++|++|+||||||+|+++++|||||+||||||+++|++++.|||| T Consensus 318 ~l~~~~~~tf~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~ 397 (801) T protein:vir:33 318 ALVRASDGNFDFKYLEWGARTVGDDTTNPYPSFTGQTINDIFFFRNRLGFLSGENIILSRTSKYFNFFPASVSNYSDDDP 397 (801) T ss_pred EEEEccCceEEecccCccccccCCccccCcccccCCCceEEEEEcceEEEeeCCeEEEEecCCccccccccccCCCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCeeEE Q lcl|NC_015271. 392 IDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSI 471 (795) Q Consensus 392 i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~~~v 471 (795) |+++++++++|+|+|+++++++|+|||+++||+|+|+++|||+|++++++|+|+|+++|+|+.+|++++|+|++|+|+++ T Consensus 398 i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~v 477 (801) T protein:vir:33 398 IDVAVSHDRVSTLKYAVPFSEELLLWSDQAQFVLTASDILSSRSVGLNLTTQFDVQDRARPHGVGRNVYFSSPRASFTSI 477 (801) T ss_pred EEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEeecccCCCCceEecCeEEEEecCCCeeEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecCCCeE Q lcl|NC_015271. 472 HRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGSNVQ 551 (795) Q Consensus 472 ~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~g~~~ 551 (795) +|+++|++++|+|+++|||+|++|||+|++++++++++++.+++|+++++|+|++|+||++++||+|+|||||+|+|+++ T Consensus 478 ~r~~~~~~~~d~y~~~Dlt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~ 557 (801) T protein:vir:33 478 NRYYAVQDVSSVKNAEDMTAHVPNYIPNGVFSISGTTAENFVAILTSGAPNRVYIYKFLYIDEEIRQQSWSHWDFGDNVT 557 (801) T ss_pred EEEEeecccccceehhhHHHHHHHhcCCceEEEEEcCCCCeEEEEEecCCCEEEEEEEecCCCceEEEeeEEEEcCCCEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEeCCEEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeecccccCCcccCce Q lcl|NC_015271. 552 VLACQCINSDMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLPTIYGADFAKGK 631 (795) Q Consensus 552 ~~~~~~~~d~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~~g~ 631 (795) ++|+.+.+|+||++|+|+++.+++||.+.++..+...+++++||||+.++++.+++|+..+..++..++.++|++|+||+ T Consensus 558 ~~~~~~~~d~l~~vv~r~~~~~le~~~~~~~~~d~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~eg~ 637 (801) T protein:vir:33 558 VFAAQVINSTMTVLMSNEHAVWMGRLHFTKDSIDLPGEPYRLYIDAKRKYTIPAGTYNDDTYQTSISLSTIYGMNFTKGR 637 (801) T ss_pred EEEEecCCCEEEEEEEcCCcEEEEEEEEeeccccCCCccceEEeecceEEEecccceecCccccccccccccCCccccce Confidence 99998889999999999999999999989999999999999999999999999999999999999999999999999999 Q ss_pred EEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEEEEEEe Q lcl|NC_015271. 632 ITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRRAWVNY 711 (795) Q Consensus 632 ~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~~~~~~ 711 (795) ++.+++||.+++...+..++.+..++++++++++++|+|||+|+++++++||+++.++|++......+||+||||++|++ T Consensus 638 ~v~~~~dG~v~~~~~~~~~~~~~~~l~i~~~~~~~~v~vGl~y~s~~~~~~~~~~~~~~~~~~~~~~~~r~~l~r~~~~~ 717 (801) T protein:vir:33 638 VSVVFPDGKIVEIDQPINGWSSDPMLRLDGNQEGQVVYIGFNIPFTYTFSKFLIKKTAEDGSTATEDIGRLQLRRAWVNY 717 (801) T ss_pred EEEEEeCCceEeeeeccccccCceeEEecCCCCCCEEEEeeeeeEEEEeCceEEeccCCCCceeeeeeccEEEEEEEEEe Confidence 99999999999888888888888999999999999999999999999999999999999888888889999999999999 Q ss_pred eccceEEEEecCCcccccccccccccccccccccccccccceEEEEeeecccceEEEEEECCCCCEEEEEEEEEEEEecc Q lcl|NC_015271. 712 EDSGTFDIYVENQSSNWKYTMAGARLGAHVMRTGKLNLGTGQYRFPVVGNAKFNTVFILSDATTPLNIIGCGWEGNYLRR 791 (795) Q Consensus 712 ~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r 791 (795) .+|++|++.|+++.++..+...++++++..+..+.+++.++.+++|+.+|+++.+|+|+|++|+||+||||+|||+||+| T Consensus 718 ~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvl~i~~eg~y~~r 797 (801) T protein:vir:33 718 EDSGAFIIRVNNLSREFIYTMAGARLGSDNLRVGGSNIGTGQYRFPVVGNAQTNTVTIESDASTPLNIIGCGWEGNYLRR 797 (801) T ss_pred ecCcceEEEECCcccceeeeecccccccccccccccccccceEEEEeeccCceEEEEEEeCCCCCEEEEEEEEEEEEecc Confidence 99999999999998888888999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCC Q lcl|NC_015271. 792 SSGI 795 (795) Q Consensus 792 ~rrv 795 (795) +||| T Consensus 798 ~~~~ 801 (801) T protein:vir:33 798 SSGI 801 (801) T ss_pred ccCC Confidence 9999 No 7 >protein:vir:94713 Length: 785 # NCBI annotation: tail tube # Family: family:all:825 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338122;genbank:gi:77118200;genbank:GeneID:3707736 Probab=100.00 E-value=2.3e-246 Score=1367.24 Aligned_cols=783 Identities=50% Similarity=0.888 Sum_probs=740.5 Q ss_pred CCceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCceEEEE Q lcl|NC_015271. 1 MALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESEQYYAV 80 (795) Q Consensus 1 M~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~~y~l~ 80 (795) ||+|+|+||||++|||||+|++||++||++|+||+|+|+||++||||++||++++..+ +.+.++|+|+|+++|+|+|+ T Consensus 1 M~~~~~s~~n~~~GvSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~~v~~l~~~~--~~~~~~~~f~~~~~~~y~l~ 78 (785) T protein:vir:94 1 MPLITQSIKNLKGGISQQPDILRFSDQGEAQVNCWSSESDGLQKRPPTVFKRRLNIDV--GSNPKFHLINRDEQEQYYIV 78 (785) T ss_pred CcceeeecchhhcceecCCchHHhhhHHhhhhcceeeeccCcccCChhHhhhcccCCC--CcCcEEEEEEeCCCceEEEE Confidence 9999999999999999999999999999999999999999999999999999986543 45779999999999999999 Q ss_pred EeCCeEEEEecCCcEEEEEECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccEEEeccc Q lcl|NC_015271. 81 FTGTGIRVFDLAGNERQVRYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINVRGG 160 (795) Q Consensus 81 ~~~~~~rv~~~~g~~~~v~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~~~v~~~ 160 (795) |++++||||+++|.++.+. +..||+.+++++++|+|+|+||+|||+|++++|+++.|..+. .|.+.+++|+.++.+ T Consensus 79 ~~~~~irv~~~~G~~~~v~---~~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~~~~~~~~~~~-~~~~~~~~~~~i~~g 154 (785) T protein:vir:94 79 FNGSNIQIVDLSGNQYSVS---GSVDYVKSSNPRDDIRVVTVADYTFVVNRKVVVKGGSEKSHS-GYNRKARALINLRGG 154 (785) T ss_pred EcCCeEEEEecCCcEEEEe---cCCCceeecCchhheeeEeeCCEEEEEcCCcceeeeeccCCc-CCCCCCceEEEeccc Confidence 9999999999999887764 567899999999999999999999999999999999886654 477889999999999 Q ss_pred ccCeEEEEEECCceeEEEEecCCCCccc-ccccchhHHhHhhhhhcccccCceeeeecCceEEEEecCCcceeeEEEecC Q lcl|NC_015271. 161 QYGRTLQIIINGNTQATYQIPDGSQPEH-VNNTDAQWLAEELARQCRVSAPGWTFNVGQGYIHIIAPEGQQIDSLTTKDG 239 (795) Q Consensus 161 ~~~~ty~vt~~g~~~a~~ttp~~s~~~~-~~~~~~~~i~~~l~~~~~s~~~g~t~~~~g~~~~i~~~~~~~~~~~~~~dg 239 (795) +|+++|++.+++..++++++++++++.. ....+.++++.++.+++..+.++|++...++++++++++++.+.+++++++ T Consensus 155 ~y~~~y~i~i~g~~~at~~t~~~s~a~~s~~~~s~~~i~~~l~~~l~a~~t~~t~~~~g~~i~i~a~s~t~~~~~s~~~~ 234 (785) T protein:vir:94 155 QYGRTLKVGINGGVKVSHKLPAGNDAENDPPKVDAQAIGAALRDLLVTAYPTFTFDLGSGFLLITAPSGTDINSVETEDG 234 (785) T ss_pred ccceeEEEeeCCcceeEEEEccCccccccccccchHHHHHHHHHHhhccccceeEEecCcEEEEEecCCccccceeeecc Confidence 9999999999999999999998887654 577888999999999999999999999999999999999999999999999 Q ss_pred cCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeEEEEeecCc Q lcl|NC_015271. 240 YADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALVRAADG 319 (795) Q Consensus 240 ~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~~~v~~~~~ 319 (795) +++++++++.++++++++||.+|++|++|+|.++.++++++||++|+...++|+||+++++..+++.++||+.|++.+++ T Consensus 235 ~~~t~~~~~~~~~~~~~~Lp~~~~~G~~v~v~~~~~~~~~~y~v~~~~~~g~w~e~~~~g~~~~~~~~tmp~~l~~~~~~ 314 (785) T protein:vir:94 235 YANQLISPVLDTVQTISKLPLAAPNGYIIKIQGETNSSADEYYVMYDSNTKTWKETVEPGVVTGFDNTTMPHALVRQSDG 314 (785) T ss_pred cCCeEEEEEEeeccceeccccccCCCCEEEEEccCCCCccceEEEEEcCCceEEEecccceeeeeeccccceEEEeccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeecccCCccccCCcccccccccccCCCccEEEEEcceEEEecCCeEEEEecCCccccccccccccCCCCcEEEEEcCC Q lcl|NC_015271. 320 NFELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSGENIILSRTAKYFNFYPASIATLSDDDPIDVAVSTN 399 (795) Q Consensus 320 t~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~ 399 (795) +|++++.+|++|.+||+++||+|||.|++|++|+||||||+|+++++|||||+||||||+++|++++.|||||+++++++ T Consensus 315 ~~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~~~~ 394 (785) T protein:vir:94 315 SFEFKALDWSKRGAGNDDTNPMPSFVDATINDVFFYRNRLGFLSGENVIMSRSASYFAFFPKSVATLSDDDPIDVAVSHP 394 (785) T ss_pred ceEEeccccccccCCCcccCCcceecccccceEEEEeceEEEecCCeEEEEccCCcccCccccccCCCCCccEEEEecCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCeeEEEEEEeecc Q lcl|NC_015271. 400 RIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQD 479 (795) Q Consensus 400 ~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~~~v~~~~~~~~ 479 (795) |+|+|+|+++++++|+|||+++||+|+++++|||+|++++++|+|+|+++++|+.+|++++|++++|+|++++|++++++ T Consensus 395 ~~~~i~~~v~~~~~L~l~T~~~e~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~v~r~~~~~~ 474 (785) T protein:vir:94 395 RISILKYAVPFSEQLLLWSDEVQFVMTSSGVLTSKSIQLDVGSEFALGDNARPFAVGRSVFFSAPRGSFTSIKRYFAVAD 474 (785) T ss_pred cceeeEEEeecCCcEEEEecCcEEEEcCCCcccceeEEEEEEEeeeccCCCCceEeCCeEEEEecCCCeeEEEeeeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecCCCeEEEEEEEeC Q lcl|NC_015271. 480 VSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGSNVQVLACQCIN 559 (795) Q Consensus 480 ~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~g~~~~~~~~~~~ 559 (795) ++|+|+++|||+|++|||+|+++++++++++|++++|++++||+|++||||++++||+|+|||||+|+|.+.++|++..+ T Consensus 475 ~~d~y~~~dlt~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~g~l~~~~y~~~~~e~~v~aW~r~~~~~~~~~~~~~~~~ 554 (785) T protein:vir:94 475 VSDVKDADDTTGHVLSYIPNGVFDIQGTGTENYICVNSTGAYNRIYIYKFLFKDSVQLQASWSHWEFPKDDKILASASIG 554 (785) T ss_pred cccceehhhHHHHHHHhcCCCcEEEEEecCCCcEEEEEEcCCCEEEEEEEeecCCceEEEEEEEEEeCCCeEEEEEEEeC Confidence 99999999999999999999999999999999999999999999999999999999999999999999988888988899 Q ss_pred CEEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeeccccc-CCcccCceEEEEecC Q lcl|NC_015271. 560 SDMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLPTIY-GADFAKGKITVLEAD 638 (795) Q Consensus 560 d~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~-gl~~~~g~~v~~~ad 638 (795) |++|++++|.++.+++++...+...|...+++++||||+.++..++++|++++..+...++..+ |+.|+||++|.+++| T Consensus 555 d~~~~vv~r~~g~~~~~ie~~~~~~d~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~~g~~~leg~~v~v~ad 634 (785) T protein:vir:94 555 STMFIVRQHQGGVDIEHLKFIKEATDFPSEPYRLHVDSKVSMVIPIGSFNADTYKTTVDIGAAYGGNAPSPGRYYLIDSQ 634 (785) T ss_pred CEEEEEEEcCCCEEEEEEEeecccCCCCCcceeEEeeeeeEEEecCcceeccccccccccccccccCCccCCeEEEEeeC Confidence 9999999999888888888788888999999999999999999999999999988887666655 799999999999999 Q ss_pred CcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEEEEEEeeccceEE Q lcl|NC_015271. 639 GKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRRAWVNYEDSGTFD 718 (795) Q Consensus 639 g~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~~~~~~~~t~~~~ 718 (795) |..++...+.. +..++++++++++++|+|||+|+++++|+||++++++|++. ..+.+||+||+|++++|.+|++|+ T Consensus 635 G~~~~~~~v~~---~~~tl~~~g~~~~~~v~vGl~y~~~~~~~~~~~~~~~~~~~-~~~~~gr~~l~r~~~~~~~sg~~~ 710 (785) T protein:vir:94 635 GAYLDLGELTS---ISTVITLNGDWSGRTVFIGRSYLMSYKFSRFLIKIEDDSGT-QSEDTGRLQLRRAWVNYRDTGALR 710 (785) T ss_pred CcCccCceEcC---CCcEEEecCCCCCceEEEeeeeeEEEeecceeEEecCCCcc-cccccccEEEEEEEEEeecccceE Confidence 99888776654 34688999999999999999999999999999999997654 567789999999999999999999 Q ss_pred EEecCCcccccccccccccccccccccccccccceEEEEeeecccceEEEEEECCCCCEEEEEEEEEEEEeccccCC Q lcl|NC_015271. 719 IYVENQSSNWKYTMAGARLGAHVMRTGKLNLGTGQYRFPVVGNAKFNTVFILSDATTPLNIIGCGWEGNYLRRSSGI 795 (795) Q Consensus 719 v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r~rrv 795 (795) +++++..+++.+.+.+++++++ ..+.+|+.++++++|+.+|+++.+|+|+|++|+||+|+||+|||+||+|+||| T Consensus 711 v~v~~~~~~~~~~~~~~~~g~~--~~~~~~~~tg~~~vp~~g~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r~~~v 785 (785) T protein:vir:94 711 LIVRNGEREFVNTFNGYTLGQQ--TIGTTNIGDGQYRFAMNGNALTTSLTLESDYPTPVSIVGCGWEASYAKKARSV 785 (785) T ss_pred EEecCCCccceeeecCcccCcc--cccccccccceEEEEeecccceEEEEEEECCCCceEEEEEEEEEEEeccccCC Confidence 9999988888788899999864 45788999999999999999999999999999999999999999999999999 No 8 >protein:vir:8887 Length: 808 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813776;genbank:gi:29366731;genbank:GeneID:1258831 Probab=100.00 E-value=1e-237 Score=1319.85 Aligned_cols=787 Identities=55% Similarity=0.937 Sum_probs=729.0 Q ss_pred CCceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCceEEEE Q lcl|NC_015271. 1 MALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESEQYYAV 80 (795) Q Consensus 1 M~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~~y~l~ 80 (795) ||+|+|+||||++|||||+|++||++||++|+||+|+|+|||+||||++||+++++++....++++++|+|+++|+|+|+ T Consensus 1 M~~v~~s~~n~~~GvSqq~d~~R~~~q~~~~~N~~~~~~gG~~rRpgt~~v~~l~~~~~~~~~~~~~~~~~~~~~~y~v~ 80 (808) T protein:vir:88 1 MGLVSQSVKNLKGGISQQPDILRFSNQGALQINGWSSETQGLQKRPPTTFTKRLQNKGFLGTKPLVHLINRDAQEQYFVG 80 (808) T ss_pred CcceeeecchhccceeccchhHhhhhhhhhhhcceeeeccccccCCchheeeeeeccCCCCCCcEEEEEEeCcCceEEEE Confidence 99999999999999999999999999999999999999999999999999999988887778899999999999999999 Q ss_pred EeCCeEEEEecCCcEEEEEECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccEEEeccc Q lcl|NC_015271. 81 FTGTGIRVFDLAGNERQVRYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINVRGG 160 (795) Q Consensus 81 ~~~~~~rv~~~~g~~~~v~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~~~v~~~ 160 (795) |++++||||+++|.++.+. +..+|+.+++++++|+|+|+||+|||+|++++|++..+.......+...+++++++.+ T Consensus 81 ~~~~~i~v~~~~G~~~~v~---~~~~y~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~~~~~~~~~~~~~~~vr~g 157 (808) T protein:vir:88 81 FSGTGLAVWDLKGNNYTVR---GYNGYANCANPRTDLRLITVADYTFVVNRNTVCQMGSTLTHAAYPRLDGRAIINVRGG 157 (808) T ss_pred EeCCeEEEEEcCCceEEEe---ecCcceEecCChhheeEEEEcCEEEEEcCCcceeecccccccCCCCCCccEEEEEccc Confidence 9999999999999998875 4568999999999999999999999999999999877765656667788899999999 Q ss_pred ccCeEEEEEECCc-----eeEEEEecCCCCcc-----------cccccchhHHhHhhhhhccccc--CceeeeecCceEE Q lcl|NC_015271. 161 QYGRTLQIIINGN-----TQATYQIPDGSQPE-----------HVNNTDAQWLAEELARQCRVSA--PGWTFNVGQGYIH 222 (795) Q Consensus 161 ~~~~ty~vt~~g~-----~~a~~ttp~~s~~~-----------~~~~~~~~~i~~~l~~~~~s~~--~g~t~~~~g~~~~ 222 (795) +|+++|+++++|. .++++.++.++.+. ......+.|++..+..++.+.. .+|++...++.++ T Consensus 158 ~y~~~y~i~i~g~~s~~~~t~t~~~~~~s~~~v~~~~~~~~~~~~~~~~~~~ia~~l~~~~~~~~~~~~~~~~~~~~~~~ 237 (808) T protein:vir:88 158 QYGRTLSITINGDGTGSSPQASIKMPNGSAEKVPAGDPYAGMNQVDMTDASWIAAELARQLTVSLGGSGWSFQAGTGWIL 237 (808) T ss_pred ccCceEEEEEecCCcceeeeEeEEEccCcccceeeccceeecccCCccccccchhhheeeeeecccccceEEEeccceEE Confidence 9999999999873 34566666654322 3345667888888887766533 4678888889999 Q ss_pred EEecCCcceeeEEEecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEee Q lcl|NC_015271. 223 IIAPEGQQIDSLTTKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVND 302 (795) Q Consensus 223 i~~~~~~~~~~~~~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~ 302 (795) +..+.+..+.++++++|++++.+.++.+.|+++++||+.+++|++++|.++.++..++||++|+...++|+|+++++... T Consensus 238 i~~~a~~~~~~~~t~~g~~~~~~~~~~~~v~~~~~lp~~~p~g~~v~i~~~~~~~~~~~yv~~~~~~~~w~e~~~~~~~~ 317 (808) T protein:vir:88 238 INAPANDNVRQIATKDGYADTLLSGFIYQVQTFTKLPANAPPGYLVEITGESARSGDNYWVQYDASGKVWKETAKPKIIA 317 (808) T ss_pred EEeccCceeEEEcccCCcCcceeeeeeeeccceeeccccCCCCcEEEEEecCCCCCceeEEEEEcCCeEEEEeeecccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEeccceeEEEEeecCceeeecccCCccccCCcccccccccccCCCccEEEEEcceEEEecCCeEEEEecCCcccccccc Q lcl|NC_015271. 303 QLLFETMPHALVRAADGNFELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSGENIILSRTAKYFNFYPAS 382 (795) Q Consensus 303 ~~~~~t~p~~~v~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t 382 (795) +++.++||+.++++++++|+++..+|++|.+||+++||.|+|+|++|++|+||||||+|+|+++|||||+||||||++++ T Consensus 318 ~~~~~tmp~~lv~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t 397 (808) T protein:vir:88 318 GFNNATLPHALVRAADGQFDWTPLTWDGRNAGDDDTNPMPSFVGATINDVFFFRNRLGFLSGENVVMSRTSKYFNFFPSS 397 (808) T ss_pred eecccceeEEEEecCCceEEEEecccccccccccccCccceecCCceeEEEEEcceEEEeeCCeEEEEeccCcccccCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCCCcEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecCcCCCCcEEeCCeEEEE Q lcl|NC_015271. 383 IATLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFA 462 (795) Q Consensus 383 ~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv 462 (795) ++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|+++|||+|++++++|+|+|+++|+|+.+|++++|+ T Consensus 398 ~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~i~T~~~e~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~f~ 477 (808) T protein:vir:88 398 VATLSDDDPIDVAISHNRISILKYAVPFSEQLLLWSDQAQFVLSSKTILSSKTIELDLTTEFDVSDGARPYGIGRGVYFA 477 (808) T ss_pred ccCCCCCccEEEEecCCccceeeEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEEecccCCCCceEeCCeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCeeEEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeE Q lcl|NC_015271. 463 SPRSSFTSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWS 542 (795) Q Consensus 463 ~~~g~~~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~ 542 (795) +++|+|++++|+|+|++++|+|+++|||+|++|||+|+++++++++++|.+++|++++||+|++||||++++||+|+||| T Consensus 478 ~~~g~~~~v~r~~~~~~~~d~y~~~dlt~~~~h~~~~~~~~~~~~~~~~~~~v~~~~~~g~l~~~~y~~~~~e~~v~aW~ 557 (808) T protein:vir:88 478 APRASFTSLKRYYAIQDVSDVKSAEDVSAHVPSYITNTVHAIHGSGTENFVSILSDGSPNKVFIYKFLYLDEILQQQSFS 557 (808) T ss_pred ecCCCeeEEEEEEEeeeccCceehhhHHHHHHHhcCCCeEEEEEeCCCCeEEEEEEcCCCEEEEEEEeccCCceeEEeeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeecCCCeEEEEE--EEeCCEEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeecc Q lcl|NC_015271. 543 HWDFGSNVQVLAC--QCINSDMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLP 620 (795) Q Consensus 543 ~w~~~g~~~~~~~--~~~~d~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~ 620 (795) ||+|+|+++++++ ...+|+||++|+|+++.++|||.+.++..+...+++++||||+.++. .+.+++.+..+.+... T Consensus 558 r~~~~g~~~~~~~~~~~~~d~l~~vV~r~~~~~ler~~~~~~~~~~~~~~~~~~lD~~~~~~--~g~~~~~~~~t~~~~~ 635 (808) T protein:vir:88 558 HWEFGDAATTRVLAASCIGSYCYLMIDRPEGLCLERMEFTQHTIDYSIEPYRTYMDMKKTIV--LGAYNIDTNLTSFDVR 635 (808) T ss_pred EEecCCCeeEEEEEEeccCCEEEEEEEcCCcEEEEEEeeccCCCCCccccceeeeeeeeeec--cccccCccccceeecc Confidence 9999998875544 44689999999999999999999999999999999999999998875 4678888888877765 Q ss_pred cc-cCCcccCceEEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceecccc Q lcl|NC_015271. 621 TI-YGADFAKGKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDI 699 (795) Q Consensus 621 ~~-~gl~~~~g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~ 699 (795) .. .++.|+++..+.+.+||..... ...++.+.+++++++++++++|+|||+|+++++|+||++++++|++++..+.. T Consensus 636 ~~~~~~~~~~~~~~~~~~dg~~~~~--~~~~~~~~~~~~~~~~~~~~~v~vGl~y~s~~~~~p~~~~~~~g~~~~~~~~~ 713 (808) T protein:vir:88 636 TAYGGTPGPESTFYTIDQQGVLIEH--EARDWATNPYISFVGNRAGEQMVIGKQYTFQYEFSKFLIKQTADDGSTSTEDI 713 (808) T ss_pred cccccccccceeEEEEcCCceEEee--ecccccCcceEEeCCCccCceEEEeeeeeEEEEecceEEecCCCCcceeeccc Confidence 54 4778999999999999876543 23356677889999999999999999999999999999999999999999999 Q ss_pred ccEEEEEEEEEeeccceEEEEecCCcccccccccccccccccccccccccccceEEEEeeecccceEEEEEECCCCCEEE Q lcl|NC_015271. 700 GRLQLRRAWVNYEDSGTFDIYVENQSSNWKYTMAGARLGAHVMRTGKLNLGTGQYRFPVVGNAKFNTVFILSDATTPLNI 779 (795) Q Consensus 700 grl~l~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tv 779 (795) ||+||+|+++++.+|++|.+.++++.++..+.+.+++++++. ..+.+|+.+|++++|+.+|+++.+|+|+|++|+||+| T Consensus 714 gr~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~-~~~~~~~~tg~~~vp~~~~~~~~~v~i~~d~P~P~ti 792 (808) T protein:vir:88 714 GRLQLRRAWLNYEESGAFEINVNNGSSEFVYVMTGGRLGIQR-VLGELSVGTGQFKFPVTGNAVNQRVTITSSNPNPLNV 792 (808) T ss_pred ceEEEEEEEEEeecccceEEEeCCCcccceeeccCcccCccc-ccCccccccceEEEEecccCceeEEEEEECCCCceEE Confidence 999999999999999999999999888888889999999764 5888999999999999999999999999999999999 Q ss_pred EEEEEEEEEeccccCC Q lcl|NC_015271. 780 IGCGWEGNYLRRSSGI 795 (795) Q Consensus 780 lsi~~eg~y~~r~rrv 795 (795) |||+|||+||+|+||| T Consensus 793 lsi~~eg~y~~r~~~v 808 (808) T protein:vir:88 793 IGCGWEGNYIRRSSGI 808 (808) T ss_pred EEEEEEEEEeccccCC Confidence 9999999999999999 No 9 >protein:vir:97014 Length: 800 # NCBI annotation: 33 # Family: family:all:825 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654134;genbank:gi:108862018;genbank:GeneID:5075963 Probab=100.00 E-value=8.9e-226 Score=1254.38 Aligned_cols=771 Identities=24% Similarity=0.363 Sum_probs=688.7 Q ss_pred CceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEE--e-CCCceEE Q lcl|NC_015271. 2 ALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLIN--R-DESEQYY 78 (795) Q Consensus 2 ~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~--~-~~~~~y~ 78 (795) =.|+|+||||++|||||+|++||++||++|+||+|+|+|||+||||++||+++++++. ..++++|. + +.+|+|+ T Consensus 1 ~~v~~s~~n~~~GvSqq~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~fva~l~~~~~---~~~~~~~~~~~d~~eq~~v 77 (800) T protein:vir:97 1 MEVQGSLGRQIQGISQQPPAVRLDGQCTAMVNMIPDVVNGTQSRMGTTHIAKILDAGT---DDMATHHYRRGDGDEEYFF 77 (800) T ss_pred CeeEeechhhhcccccCchhHhhhhhhhhhhcceeccccccccCCchhhheeecCCCc---ccceeEEEEEcCCceEEEE Confidence 3589999999999999999999999999999999999999999999999999987654 33555544 3 4568889 Q ss_pred EEEeCCeEEEEecCCcEEEEEECCCccccee-cCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccEEEe Q lcl|NC_015271. 79 AVFTGTGIRVFDLAGNERQVRYTTDGSTYIN-TNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINV 157 (795) Q Consensus 79 l~~~~~~~rv~~~~g~~~~v~~~~~~~~yl~-~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~~~v 157 (795) |+|+++++|||+++|+++.|.......+|+. +++++++|+|+|+||+|||+|++++|++..+.. ..++.++++++ T Consensus 78 ~~~~~~~~rv~~~~G~~~~v~~~~~~~~y~~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~----~~~~~~~~~~v 153 (800) T protein:vir:97 78 TLKKGQVPEIFDKYGRKCNVTSQDAPMTYLSEVVNPREDVQFMTIADVTFMLNRRKVVKASSRKS----PKVGNKAIVFC 153 (800) T ss_pred EEEcCCEEEEEecCCcEEEEecCCcceEEEeccCCCccceeEEEEcCEEEEeeCceecccccccc----cCCCcceEEEE Confidence 9999999999999999999988887778874 456888999999999999999999999876543 35678899999 Q ss_pred cccccCeEEEEEECCceeEEEEecCCCCcccccccchhHHhHhhhhhccc--ccCceeeeecCceEEEEecCCcceeeEE Q lcl|NC_015271. 158 RGGQYGRTLQIIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELARQCRV--SAPGWTFNVGQGYIHIIAPEGQQIDSLT 235 (795) Q Consensus 158 ~~~~~~~ty~vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s--~~~g~t~~~~g~~~~i~~~~~~~~~~~~ 235 (795) ++|+|+++|+++|++..++++++|+++++..+.+.++++++.+|...+.. ...+|++...+++++|+++++... +++ T Consensus 154 ~~g~y~~~y~i~I~~~~~~~~~t~~~t~~~~~~~~~~~~ia~ql~~~~~~~~~~~~~t~~~~G~~~~i~~~~~~~~-~v~ 232 (800) T protein:vir:97 154 AYGQYGTSYSIVINGANAASFKTPDGGSADHVEQIRTERITSELYSKLQQWSGVSDYEIQRDGTSIFIERRDGASF-TIT 232 (800) T ss_pred eecccceeeeeccCCcceEEEEEcCCCCcccceeccHHHHHHHHHHhhhccccccceEEEeCCcEEEEEEcCCceE-EEE Confidence 99999999999999999999999999999999999999999999998865 347799999999999999887654 688 Q ss_pred EecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeec---CceEEEeeeeeEeeeEeccceeEE Q lcl|NC_015271. 236 TKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTT---RKVWSETLGWNVNDQLLFETMPHA 312 (795) Q Consensus 236 ~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~---~~~w~E~~~~~~~~~~~~~t~p~~ 312 (795) +.++++++++.+++++|+++++||+.+++|+.++|+++++++.+.||++|+.+ .++|+|+++++...+++.++|||. T Consensus 233 t~~g~~~~~~~~~~~~v~~~~~lp~~~~~g~~v~i~~~~~~~~~~y~~~~~~~~~~~~~w~e~~~~~~~~~~~~~tmp~~ 312 (800) T protein:vir:97 233 TTDGAKGKDLVAIKNKVSSTDLLPSRAPAGYKVQVWPTGSKPESRYWLQAEPKEGNLVSWKETIAADVLLGFDKGTMPYI 312 (800) T ss_pred ecCCcCceeeeEEeeeccchhhchhhCCCCcEEEEEccCCCCCceEEEEEEecccCcceEEEeeccccccceecccceEE Confidence 99999999999999999999999999999999999999999999999999864 568999999999999999999999 Q ss_pred EEee----cCceeeecccCCccccCCcccccccccccC----CCccEEEEEcceEEEecCCeEEEEecCCcccccccccc Q lcl|NC_015271. 313 LVRA----ADGNFELKRIEWSPKTCGDDDTNPWPSFMD----STINDVFFFRNRLGLLSGENIILSRTAKYFNFYPASIA 384 (795) Q Consensus 313 ~v~~----~~~t~~~~~~~w~~~~~gd~~~np~psf~~----~~~~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~ 384 (795) +++. ..++|++++.+|++|.+||+++||.|+|++ ++|++|+||||||+|+++++|||||+||||||+++|++ T Consensus 313 ~~~~~~~~~~g~~~~~~~~w~~r~~gd~~tnp~p~f~~~~~~~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~ 392 (800) T protein:vir:97 313 IERTDIINGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTIGGMFMVQNRLCFTAGEAVIASRTSYFFDFFRYTVI 392 (800) T ss_pred EEEeecccccceeEEEeccccccccCccccCccccccCCcCCCCceeEEEEeeeEEEecCCeEEEEecCCcccccccccc Confidence 9987 577999999999999999999999999998 78999999999999999999999999999999999999 Q ss_pred ccCCCCcEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecCcCCCCcEEeCCeEEEEec Q lcl|NC_015271. 385 TLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASP 464 (795) Q Consensus 385 ~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~ 464 (795) ++.|||||+++++++|+|+|+|+++++++|+|||+++||+|+|+++|||+|++++++|+|+|+++|+|+.+|++++|+++ T Consensus 393 ~~~DdD~I~~~~ss~~v~~i~~~v~~~~~L~i~T~~~q~~ls~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~fv~~ 472 (800) T protein:vir:97 393 SALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQFILPGDKPLEKSNALLKPVTTFEVNNKVKPVVTGESVMFATN 472 (800) T ss_pred CCCCCccEEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEeeeccCCCCcEEeCCeEEEeeC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCeeEEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEee Q lcl|NC_015271. 465 RSSFTSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHW 544 (795) Q Consensus 465 ~g~~~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w 544 (795) +|+|++++|| +|++.+|+|+++|||+|++|||+|+++++++++++|.+++|+++++|+|++|+||++++||+|+||||| T Consensus 473 ~g~~s~vre~-~~~~~~d~~~a~DlT~~~~hl~~~~v~~~~~~~~~~~~v~~~~~~~~~l~~~~y~~~~~e~~~~aW~~~ 551 (800) T protein:vir:97 473 DGSYSGVREF-YTDSYSDTKKAQAITSHVNKLIEGNITNMAASTNVNRLLVTTDKYRNIIYCYDWLWQGTDRVQSAWHVW 551 (800) T ss_pred CCCeeEEEEE-eeeecccceehhhHHHHHHHhcCCceEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceEEEeEEEE Confidence 9999888876 568999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCeEEEEEEEeCCEEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecC----------ccccccccc Q lcl|NC_015271. 545 DFGSNVQVLACQCINSDMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPN----------GTYNDDTFT 614 (795) Q Consensus 545 ~~~g~~~~~~~~~~~d~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~----------~~~~~~~~~ 614 (795) ++++...++|+.+.+|+||++|+|+++.++|||...+.. + ..++++++||+..+..... +......+. T Consensus 552 ~~~~~~~~~~~~~~~d~l~~vv~r~~~~~ler~~~~~~~-~-~~~~~~~~lD~~~~~~~~~~~~~~~~v~~~~~~~~~~~ 629 (800) T protein:vir:97 552 KWPIGTKVRGMFYSGELLYLLLERGDGVYLEKMDMGDAL-T-YGLNDRIRMDRQAELVFKHFKAEDEWVSEPLPWVPTNP 629 (800) T ss_pred ecCCCeEEEEEEEcCCeEEEEEEcCCcEEEEEEecccCc-C-cccccceeccccceeeeeeeecccceEeccccccCCCc Confidence 999988888988899999999999999999998765443 2 3456678888653332111 111122233 Q ss_pred ceeecccccCCcccCceEEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccce Q lcl|NC_015271. 615 TTLHLPTIYGADFAKGKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGST 694 (795) Q Consensus 615 t~~~~~~~~gl~~~~g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~ 694 (795) +......++|++|++|.+|.+.. +.......+ ....+++++++++|||||+|+++++|+||++++++|+.+. T Consensus 630 ~~~~~~~v~g~~~~~G~~v~~~~-~~~~~~~~~-------~~~~~~~~~~~~~v~vGl~Y~~~~~~~p~~i~~~~g~~~~ 701 (800) T protein:vir:97 630 ELLDCILIEGWDSYIGGSFLFKY-NPSDNTLST-------TFDMYDDSHVKAKVIVGQIYPQEFEPTPVVIRDNQDRVSY 701 (800) T ss_pred ceeEEEEecccccccCceEEEEe-cCccCcccc-------cceEEeCCCCCcEEEEeeeeeEEEEecceEEEecCCCcee Confidence 34445567899999999986544 322222222 2347899999999999999999999999999999886654 Q ss_pred eccccccEEEEEEEEEeeccceEEEEecCCcccc--cccccccccccccccccccccccceEEEEeeecccceEEEEEEC Q lcl|NC_015271. 695 STEDIGRLQLRRAWVNYEDSGTFDIYVENQSSNW--KYTMAGARLGAHVMRTGKLNLGTGQYRFPVVGNAKFNTVFILSD 772 (795) Q Consensus 695 ~~~~~grl~l~~~~~~~~~t~~~~v~v~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~ 772 (795) .+|+||+|++|+|.+|++|++.|++..++. .....+++++++.+..+.+|+.+|++++||.+|+++.+|+|+|+ T Consensus 702 ----~~r~~i~r~~~~~~~sg~~~~~v~~~~~~~~~~~~~~~~~~g~~~~~~g~~~~~tg~~~vp~~g~~~~~~v~i~~d 777 (800) T protein:vir:97 702 ----IDVPVVGLVHLNLDMYPDFSVEVKNVKSGKVRRVLASNRIGGALNNTVGYVEPREGVFRFPLRAKSTDVVYRIIVE 777 (800) T ss_pred ----ecceEEEEEEEeecccccEEEEEccccCCceeeeecCccccccccccCCccccccceEEEEeecccceeEEEEEEC Confidence 488999999999999999999999987753 33578899999999999999999999999999999999999999 Q ss_pred CCCCEEEEEEEEEEEEeccccCC Q lcl|NC_015271. 773 ATTPLNIIGCGWEGNYLRRSSGI 795 (795) Q Consensus 773 ~P~P~tvlsi~~eg~y~~r~rrv 795 (795) +||||+|+||+|||+||+|+||| T Consensus 778 ~PlP~tvlsi~~eg~y~~r~~rv 800 (800) T protein:vir:97 778 SPHTFQLRDIEWEGSYNPTKRRV 800 (800) T ss_pred CCCcEEEEEEEEEEEeecccccC Confidence 99999999999999999999999 No 10 >protein:vir:105647 Length: 800 # NCBI annotation: putative tail tubular B protein # Family: family:all:825 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425011;genbank:gi:83571759;uniprot:Q2WC41;genbank:GeneID:3837288 Probab=100.00 E-value=1e-223 Score=1243.10 Aligned_cols=782 Identities=24% Similarity=0.352 Sum_probs=685.2 Q ss_pred CceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCceEEEEE Q lcl|NC_015271. 2 ALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESEQYYAVF 81 (795) Q Consensus 2 ~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~~y~l~~ 81 (795) =.|+|+||||++|||||+|++||++||++|+||+|+|+|||+||||++||++++++.......+.+.-..++++.|+++| T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~fva~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (800) T protein:vir:10 1 MEVQGSLGRQIQGISQQPPAVRLDGQCTTMVNMVPDVVNGTQSRMGTTHIAKLLDEGTDNMATHHYRRGEGDEEYFFTLK 80 (800) T ss_pred CeEEeecchhcccccccchhHhhhhhhhhhhcceeeeccCcccCCcceEEEeecCCCCCccEEEEEecCCccceEEEEEE Confidence 35899999999999999999999999999999999999999999999999999876654332222211235567778888 Q ss_pred eCCeEEEEecCCcEEEEEECCCccccee-cCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccEEEeccc Q lcl|NC_015271. 82 TGTGIRVFDLAGNERQVRYTTDGSTYIN-TNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINVRGG 160 (795) Q Consensus 82 ~~~~~rv~~~~g~~~~v~~~~~~~~yl~-~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~~~v~~~ 160 (795) .++++|||+++|+++++........|+. +.+++++|+|+|+||+|||+|++++|+++.|..+.. ..+++++++.+ T Consensus 81 ~g~~~rv~~~~G~~~~v~~~~~~~~~~~~~~~~~~~l~~~tvaD~tfi~n~~~~~~~~~~~~~~~----~~~~~~~vr~g 156 (800) T protein:vir:10 81 KGQVPEIFDKHGRKCNVISQDAPMTYLSEVVNPREDVQFMTIADVTFMLNRRKVVKVSNRKSPKV----GDKAIVFCAYG 156 (800) T ss_pred cCCeEEEEecCCcEEEeecCCcceeeeeccCCchhhEEEEEEcCEEEEecCcccccccccCCCCC----CceEEEEEecc Confidence 9999999999999999988777666765 567888999999999999999999999988865443 36789999999 Q ss_pred ccCeEEEEEECCceeEEEEecCCCCcccccccchhHHhHhhhhhccc--ccCceeeeecCceEEEEecCCcceeeEEEec Q lcl|NC_015271. 161 QYGRTLQIIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELARQCRV--SAPGWTFNVGQGYIHIIAPEGQQIDSLTTKD 238 (795) Q Consensus 161 ~~~~ty~vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s--~~~g~t~~~~g~~~~i~~~~~~~~~~~~~~d 238 (795) +|+++|++++++..++.+++|.++++..+.+.++++++.+|...+.. +.++|++...++.++|+++++. ..++++.+ T Consensus 157 ~y~~~y~i~i~g~~~~~~~t~~~~~~~~~~~~s~~~i~~~L~~~l~~~~~~~~~t~~~~g~~i~i~~~~~~-~~~~~~~~ 235 (800) T protein:vir:10 157 QYGTSYSIIINGTTAASFKTPDGGSAEHVEQIRTERITSELYSKLQQWSGVNDYEIQRDGTSIFIERRDGK-SFTVTTTD 235 (800) T ss_pred ccccceeEEeccceEEEEEecCCCcccccccccHHHHHHHHHhhhhhcCcccceEEEEcCcEEEEEEecCC-ceEEEEee Confidence 99999999999999999999999999999999999999999888754 4467999999999999988765 44788999 Q ss_pred CcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEee---cCceEEEeeeeeEeeeEeccceeEEEEe Q lcl|NC_015271. 239 GYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDT---TRKVWSETLGWNVNDQLLFETMPHALVR 315 (795) Q Consensus 239 g~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~---~~~~w~E~~~~~~~~~~~~~t~p~~~v~ 315 (795) ++.++++..++++|++.++||..+++|+.++|.++.+++.+.||++|+. +.+.|+|+++++...+++.++||+++++ T Consensus 236 ~~~~~~~~~~~~~v~~~~~Lp~~~~~g~~~~i~~~~~~~~~~y~~~~~~~~~~~~~w~e~~~~~~~~~~~~~tmp~~lv~ 315 (800) T protein:vir:10 236 GAKGKDLVAIKNKVSSTDLLPSRAPAGYKVQVWPTGSKPESRYWLQAEPKEGNLVSWKETIAADVLLGFDKGTMPYIIER 315 (800) T ss_pred cCCcceEEEEEeeccceeeccccCCCCceEEEEcCCCCCCceeEEEEEeccccceEEEeecccCceeeeecccccEEEEE Confidence 9999999999999999999999999999999999999999999999986 4567999999999999999999999998 Q ss_pred ec----CceeeecccCCccccCCcccccccccccC----CCccEEEEEcceEEEecCCeEEEEecCCccccccccccccC Q lcl|NC_015271. 316 AA----DGNFELKRIEWSPKTCGDDDTNPWPSFMD----STINDVFFFRNRLGLLSGENIILSRTAKYFNFYPASIATLS 387 (795) Q Consensus 316 ~~----~~t~~~~~~~w~~~~~gd~~~np~psf~~----~~~~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~ 387 (795) .+ +++|++.+.+|++|.+||+++||+|+|++ ++|++|+||||||+|+++++|||||+||||||+++|++++. T Consensus 316 ~~~~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~~~~~~~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~ 395 (800) T protein:vir:10 316 TGIIDGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTIGGMFMVQNRLCFTAGEAVIASRTSYFFDFFRYTVISAL 395 (800) T ss_pred eeeeecceeEEEEeccccccccCCCCCCCCchhcCCCCCCCceeEEEEeeeEEEeeCCeEEEEccCCccccccccccCCC Confidence 86 78999999999999999999999999997 57999999999999999999999999999999999999999 Q ss_pred CCCcEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCC Q lcl|NC_015271. 388 DDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSS 467 (795) Q Consensus 388 DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~ 467 (795) |||||+++++++++|+|+|+++++++|+|||+++||+|+|+++|||+|++++++|+|+|+++|+|+.+|++++|++++|+ T Consensus 396 DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~g~q~~l~g~~~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~ 475 (800) T protein:vir:10 396 ATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQFILPGDKPLEKSNALLKPVTTFEVNNKVKPVVTGESVMFATNDGS 475 (800) T ss_pred CCccEEEEEcCCcceeeeeEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEeeeccCCCCceEeCCeEEEecCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecC Q lcl|NC_015271. 468 FTSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFG 547 (795) Q Consensus 468 ~~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~ 547 (795) |++|+||. |++.+|+|+++|||+|++|||+|++++++++++++.+++|+++++|+|++||||++++||+|+|||||+++ T Consensus 476 ~s~vre~~-~~~~~d~~~a~DlT~~~~hl~~~~v~~~~~~~~~~~~v~~~~~~~~~l~~~~yl~~~~e~~~~aW~~w~~~ 554 (800) T protein:vir:10 476 YSGVREFY-TDSYSDTKKAQAITSHVNKLIEGNITNMAASTNVNRLLVTTDKYRNIIYCYDWLWQGTDRVQSAWHVWEWP 554 (800) T ss_pred eeEEEEEe-eeecccceehhhHHhHHHHhcCCceEEEEEeCCCCeEEEEEEcCCCeEEEEEEeecCCceEEEEEEEEEcC Confidence 98888874 68999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred CCeEEEEEEEeCCEEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccc--eeecccccCC Q lcl|NC_015271. 548 SNVQVLACQCINSDMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTT--TLHLPTIYGA 625 (795) Q Consensus 548 g~~~~~~~~~~~d~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t--~~~~~~~~gl 625 (795) +...++||...+|+||++|+|+++.++|||...... ...+++++||||..++.....++....... +.......++ T Consensus 555 ~~~~~~~~~~~~d~l~~iv~r~~~~~ier~~~~~~~--~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 632 (800) T protein:vir:10 555 MGTKVRGMFYSGELLYLLLERGDGVYLEKMDMGDAL--TYGLNDRIRMDRQAELIFKHFKAEDEWISEPLPWTPTNPELL 632 (800) T ss_pred CCcEEEEEEEeCCeEEEEEECCCcEEEEEEecccCc--cccccceeeeecceeecccccccCcceEEEeccccccCCcce Confidence 777777888889999999999999999998654432 246778899999988766544332221100 0111122344 Q ss_pred cccCceEEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEE Q lcl|NC_015271. 626 DFAKGKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLR 705 (795) Q Consensus 626 ~~~~g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~ 705 (795) .+++......+++|.+.....+..|+...+..-.+++.++++|+|||+|+++++++||++++++|+.+. .+|+||+ T Consensus 633 ~~~~~~~~~~~~~g~v~~~~~~~~g~~~~~~~~~~g~~~~~~v~VGl~Y~s~~~~~~~~i~~~~g~~~~----~~r~~i~ 708 (800) T protein:vir:10 633 DCILIEGWDSYIGGSFLFKYKPSDNTLSTTFDMHDDNHVKAKVIVGQIYPQEFEPTPVVIRDRQDRVSY----IDVPVVG 708 (800) T ss_pred EEeeeccceeecCceeEEEEEecCCceEeeeeecCCCcccceEEEeeeeeEEEeecceEEEcCCCcccc----cCCeEEE Confidence 444444455556677777766666543332222366789999999999999999999999999886543 3889999 Q ss_pred EEEEEeeccceEEEEecCCcccc--cccccccccccccccccccccccceEEEEeeecccceEEEEEECCCCCEEEEEEE Q lcl|NC_015271. 706 RAWVNYEDSGTFDIYVENQSSNW--KYTMAGARLGAHVMRTGKLNLGTGQYRFPVVGNAKFNTVFILSDATTPLNIIGCG 783 (795) Q Consensus 706 ~~~~~~~~t~~~~v~v~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlsi~ 783 (795) |++|+|.+||+|.+++++..++. .+...+++.+.+.+..|.+|+.+|++++|+.+|+++.+|+|+|++|+||+|++|+ T Consensus 709 r~~~~~~~sg~~~~~v~~~~~~~~~~~~~~~~~~g~~~~~~g~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlai~ 788 (800) T protein:vir:10 709 LVHLNLDMYPDFSVEVKNVKSGKVRRVLASNRIGGALNNTVGYVEPREGVFRFPLRAKSTDAVYRIIVESPHTFQLRDIE 788 (800) T ss_pred EEEEEeecCceEEEEeccCcccceeEEccCCeeccccccccCcccccCceEEEEEeccCceeEEEEEECCCCcEEEEEEE Confidence 99999999999999999877642 3456778888888889999999999999999999999999999999999999999 Q ss_pred EEEEEeccccCC Q lcl|NC_015271. 784 WEGNYLRRSSGI 795 (795) Q Consensus 784 ~eg~y~~r~rrv 795 (795) |||+||+|+||| T Consensus 789 ~eg~y~~r~~rv 800 (800) T protein:vir:10 789 WEGSYNPTKRRV 800 (800) T ss_pred EEEEeecccccC Confidence 999999999999 No 11 >protein:vir:7021 Length: 803 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853594;genbank:gi:31711676;genbank:GeneID:1481802 Probab=100.00 E-value=5.1e-223 Score=1239.26 Aligned_cols=779 Identities=23% Similarity=0.349 Sum_probs=680.3 Q ss_pred CceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEE---eCCCceEE Q lcl|NC_015271. 2 ALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLIN---RDESEQYY 78 (795) Q Consensus 2 ~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~---~~~~~~y~ 78 (795) =+|+|+||||++|||||+|++||++||++|+||+|+|+|||+||||++||++|++++.. ..+++.+. ++++|+|+ T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~~va~l~~~~~~--~~~~~~~~~~~~~~e~~~~ 78 (803) T protein:vir:70 1 MEVQGSLGRQIQGISQQPPAVRLDGQCSEMVNMVPDVVEGTKSRMGTTHIAKLLEYGED--DMAVHHYRRGGEGEEEYFF 78 (803) T ss_pred CeEEeecchhccccccCchHHhhhhhhhhhhcceeeeccccccCChhhhhhhhcCCCcc--cceeeEEEecCCCceEEEE Confidence 36899999999999999999999999999999999999999999999999998876542 33444444 34679999 Q ss_pred EEEeCCeEEEEecCCcEEEEEECCCccccee-cCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccEEEe Q lcl|NC_015271. 79 AVFTGTGIRVFDLAGNERQVRYTTDGSTYIN-TNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINV 157 (795) Q Consensus 79 l~~~~~~~rv~~~~g~~~~v~~~~~~~~yl~-~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~~~v 157 (795) |+|++++||||+++|+++.+........|+. +++++++|+|+|+||+|||+|++++|++..+.. ..++.++++++ T Consensus 79 ~~~~~~~irv~~~~G~~~~v~~~~~~~~~l~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~----~~~~~~~~~~v 154 (803) T protein:vir:70 79 IMKKGQVPEIFDKQGRKCMVQSQDAPMTYLSEVTNPREDVQFMTIADVTFMLNRKKIVKARPERS----PQVGSTAIVFM 154 (803) T ss_pred EEecCCeEEEEEcCCcEEEEecCCceeEEEeecCCChhheeEEEEcCEEEEecCceeeeeccccC----CCCCCceEEEE Confidence 9999999999999999998877666566664 557788999999999999999999999865543 34567899999 Q ss_pred cccccCeEEEEEECCceeEEEEecCCCCcccccccchhHHhHhhhhhccc--ccCceeeeecCceEEEEecCCcceeeEE Q lcl|NC_015271. 158 RGGQYGRTLQIIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELARQCRV--SAPGWTFNVGQGYIHIIAPEGQQIDSLT 235 (795) Q Consensus 158 ~~~~~~~ty~vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s--~~~g~t~~~~g~~~~i~~~~~~~~~~~~ 235 (795) +.++|+++|+++++|..++++++++++.+..+.+++.+|++.++...+.+ +.++|++...+++++|.++++....+++ T Consensus 155 r~g~y~~~y~itIng~~~a~~~t~~~~~~~~~~~~~~~~ia~~l~~~~~~~~s~a~~~~~~~g~~~~i~~~~~~~~~~~~ 234 (803) T protein:vir:70 155 AYGQYGTHYKIIIDGVVAAGYKTRDGAEAHHIEDIRTESIAYNLYQSLQSWDKIADYEIQLDGTSIYITRRDGSTTFDIT 234 (803) T ss_pred eecCCcceEEEEeCCcceEEEEeCCCcccccccccchhhhhhhhhhheeccccccceEEEECCcEEEEEEcCCCCeeEEE Confidence 99999999999999999999999999999999999999999998877643 4467999999999999999998889999 Q ss_pred EecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeec---CceEEEeeeeeEeeeEeccceeEE Q lcl|NC_015271. 236 TKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTT---RKVWSETLGWNVNDQLLFETMPHA 312 (795) Q Consensus 236 ~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~---~~~w~E~~~~~~~~~~~~~t~p~~ 312 (795) ++++..++++..+++.|+++++||+.|++|+.+.|.+++.++.|.||++|+.. .++|+|+++++...+++.+|||+. T Consensus 235 t~~g~~~~~~~~~~~~v~~~~~Lp~~~~~g~~v~v~~~g~~~~d~y~v~~~~~~~~~~~w~e~a~~g~~~~~~~~t~p~~ 314 (803) T protein:vir:70 235 TEDGAKGKDLVAIKYKVASTDLLPSRAPEGYKVQVWPTGSKPESRYWLQAEKQNGNIVSWKETLAADVLIGFDKSTMPYI 314 (803) T ss_pred eecCcCCcEEEEEEecccceeeccccCCCCceEEEEcCCCCCCceeeEEEEeccCCccceEeeeccceeeeeecccccEE Confidence 99999999999999999999999999999999999999999999999999874 458999999999999999999999 Q ss_pred EEeec----CceeeecccCCccccCCcccccccccccC----CCccEEEEEcceEEEecCCeEEEEecCCcccccccccc Q lcl|NC_015271. 313 LVRAA----DGNFELKRIEWSPKTCGDDDTNPWPSFMD----STINDVFFFRNRLGLLSGENIILSRTAKYFNFYPASIA 384 (795) Q Consensus 313 ~v~~~----~~t~~~~~~~w~~~~~gd~~~np~psf~~----~~~~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~ 384 (795) +++.+ .++|++.+.+|+.|.+|||++||+|+|.+ ++|++|+||||||+|+++++|||||+||||||+++|++ T Consensus 315 ~v~~~~~~~~~~~~~~~~~~~~r~~gdd~tnp~psf~~~~~~~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~ 394 (803) T protein:vir:70 315 IERTGFVNGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTLGGMFMVQNRLCVTAGEAVIATRTSYFFDFFRYTAV 394 (803) T ss_pred EEEEEEeecceeEEEEeeccccccccccccCccccccCccCCCCceeEEEEeceEEEeeCCeEEEEccCCcccccccccc Confidence 99865 46799999999999999999999999987 67999999999999999999999999999999999999 Q ss_pred ccCCCCcEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecCcCCCCcEEeCCeEEEEec Q lcl|NC_015271. 385 TLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASP 464 (795) Q Consensus 385 ~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~ 464 (795) ++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|+++|||+|++++++|+|+|+++|+|+.+|++++|+++ T Consensus 395 ~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~i~T~~~q~~l~g~~~lTP~~~~i~~~s~~~~~~~~~Pv~vg~~v~fv~~ 474 (803) T protein:vir:70 395 SAVATDPFDVFSDASEVYQLKHAVTLDGSTVLFADKSQFILPGDKPLEKSNVLLKPVTTFEVNNNVKPVATGESVMFATS 474 (803) T ss_pred CCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEEeeccCCCccEEeCCeEEEecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCeeEEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEee Q lcl|NC_015271. 465 RSSFTSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHW 544 (795) Q Consensus 465 ~g~~~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w 544 (795) +|+|++++|| +|++.+|+|+++|||+|++|||++++++++++++++.+++|+.+++++|++|+||++++||+|+||||| T Consensus 475 ~g~~s~vre~-~~~~~~d~y~a~Dlt~~a~hl~~~~v~~~~~~~~~~~~v~~~~~~~~~l~~~~yl~~~~e~~v~aW~r~ 553 (803) T protein:vir:70 475 EGAYSGIREF-YTDSYSDTKKAQAITSHVNKLLEGNVIMMSASTNVNRLLVLTDKYRNIIYCYDWLWQGTERVQAAWHKW 553 (803) T ss_pred CCCeeEEEEE-eccccccceehhhhhhhhHhhcCCceEEEEEeCCCCeEEEEEEcCCCeEEEEEEEecCCcEEEEeEEEE Confidence 9999888876 568999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCeEEEEEEEeCCEEEEEEEeCCCE-EEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeec---- Q lcl|NC_015271. 545 DFGSNVQVLACQCINSDMYVILRNEFNT-FLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHL---- 619 (795) Q Consensus 545 ~~~g~~~~~~~~~~~d~l~~~v~R~~~~-~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~---- 619 (795) +|+|+++++|+...+|++|++|+|++++ ++|||..... ....+++++||||+.++....- +.....+.... T Consensus 554 ~~~g~~~~~~~~~~~d~l~~vv~r~~~g~~ier~~~~~~--~~~~~~~~~~lD~~~~~~~~~~--~~~~~~~~~~~~~~~ 629 (803) T protein:vir:70 554 EWPLGTFIRGMFYSGEHLYLLIERGSTGVYLERMDMGDA--LVYNLNDRIRMDRQAELIFRHI--KAEDVWVSEPLPWQP 629 (803) T ss_pred EcCCCEEEEEEEecCCEEEEEEEECCCeEEEEEEecccc--cccCCcceeEeccceeEeeccc--cCCceeeeecccccC Confidence 9999999999988999999999999876 5788765544 3346778999999988765421 11111111111 Q ss_pred ccccCCcccCceEEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceecccc Q lcl|NC_015271. 620 PTIYGADFAKGKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDI 699 (795) Q Consensus 620 ~~~~gl~~~~g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~ 699 (795) ..+.++.+.++......++|.+..............-..+++++++++|+|||+|+++++++||++++++|+.+. . T Consensus 630 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~g~~t~~~~~~~~~~~~~a~~v~VGl~Y~~~~~~~~~~i~~~~~~~~~----~ 705 (803) T protein:vir:70 630 TDVTLLDCVLIDGWDSYIGGSFLFSYNPGDNTLTTTFDMHDDDHVKAKVVVGQLYPQEFEPTQVVIRDNQERVSY----I 705 (803) T ss_pred cccceeeEEEeeeeeeecCCeEEEEEcCCCccceeeeeEECCCCcccEEEEeeeeeEEEeecceEEEcCCCcccc----c Confidence 122334444443333334443322221111111111236799999999999999999999999999999986543 3 Q ss_pred ccEEEEEEEEEeeccceEEEEecCCccccc--ccccccccccccccccccccccceEEEEeeecccceEEEEEECCCCCE Q lcl|NC_015271. 700 GRLQLRRAWVNYEDSGTFDIYVENQSSNWK--YTMAGARLGAHVMRTGKLNLGTGQYRFPVVGNAKFNTVFILSDATTPL 777 (795) Q Consensus 700 grl~l~~~~~~~~~t~~~~v~v~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~ 777 (795) +|+||+|++|++++|++|.+.|.+..++.. +.++++++++..+..|.+|+.+|+++|||++|+++.+|+|+|++|+|| T Consensus 706 ~~~rl~r~~~~~~~sg~~~v~v~~~~~~~~~~~~~s~~~~g~~~~~~g~~~~~tg~~~vP~~~~~~~~~v~i~~d~P~P~ 785 (803) T protein:vir:70 706 DVPTVGLVHLNLDKYPDFKVEVKNLKSGKVRNVLASNRVGGAINNIVGYVEPREGVFKFPLRSLSTDTVYRVMVESPHTF 785 (803) T ss_pred cccEEEEEEEEeecccceEEEEecCCccccceeeccchhccccccccCccccccceEEEEeeccCcceEEEEEECCCCCe Confidence 678899999999999999999998877643 458899999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEEEEeccccCC Q lcl|NC_015271. 778 NIIGCGWEGNYLRRSSGI 795 (795) Q Consensus 778 tvlsi~~eg~y~~r~rrv 795 (795) +|++|+|||+||+|+||| T Consensus 786 tvlsi~weg~y~~r~rrv 803 (803) T protein:vir:70 786 QLRDIEWEGSYNPTKRRV 803 (803) T ss_pred EEEEEEEEEEEecccccC Confidence 999999999999999999 No 12 >protein:vir:103341 Length: 806 # NCBI annotation: tail tubular protein B-like protein # Family: family:all:825 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039670;genbank:gi:125999999;genbank:GeneID:4818417 Probab=100.00 E-value=1.4e-220 Score=1225.98 Aligned_cols=775 Identities=23% Similarity=0.372 Sum_probs=685.4 Q ss_pred CceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEe-CCCceEEEE Q lcl|NC_015271. 2 ALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINR-DESEQYYAV 80 (795) Q Consensus 2 ~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~-~~~~~y~l~ 80 (795) =+|+|+||||++|||||+|++||++||++|+||+|+|+|||+||||++||+++.+. ...+.+++.|++ ++.|.|+|+ T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~~Gl~rRPgt~~va~l~~~--~~~~~~~~~~~~~~~~~~y~v~ 78 (806) T protein:vir:10 1 MEVQGSYGRQLQGVSQQPIAVRLPGQVTSQLNAVPNVVDGLKTRMGSKHLARILNS--LDANSLIHHYKRGDDAEEYFVI 78 (806) T ss_pred CeeEeecchhccceeccChhHhhhhhhhhhhcceeccccccccCCchhhhhhhcCC--CCccceEEEEEecCCceEEEEE Confidence 48999999999999999999999999999999999999999999999999998753 345678899999 556899999 Q ss_pred EeCCeEEEEec-CCcEEEEEECCCcccceecC-CchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccEEEec Q lcl|NC_015271. 81 FTGTGIRVFDL-AGNERQVRYTTDGSTYINTN-NPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINVR 158 (795) Q Consensus 81 ~~~~~~rv~~~-~g~~~~v~~~~~~~~yl~~~-~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~~~v~ 158 (795) |.++++|+|++ +|.++.+...+....|+.+. .++++|+|+|+||+|||+|++++|+++.+..+. .+.+++++++ T Consensus 79 ~~~g~i~v~~~~~G~~~~v~~~~~~~~yl~~~~~~~~~l~~~tvaD~tfi~n~~~~~~~~~~~~~~----~~~~~~v~v~ 154 (806) T protein:vir:10 79 LQPGQVPVIFTVGGLACPVNTQGSAATYLSSSSLPRETTQLMTIGDYTFVLNRKMPVQARGDVTPS----LDNKGLVYVA 154 (806) T ss_pred EcCCcEEEEEcCCCcEEEecCCCceEEEeccCCCCcceeeEEEEcCEEEEecCcEeeeecccccCC----CCcceEEEEe Confidence 99999999995 78888888777766677654 477899999999999999999999998775432 3456899999 Q ss_pred ccccCeEEEEEECCceeEEEEecCCCCcccccccchhHHhHhhhhhccc---ccCceeeeecCceEEEEecCCcceeeEE Q lcl|NC_015271. 159 GGQYGRTLQIIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELARQCRV---SAPGWTFNVGQGYIHIIAPEGQQIDSLT 235 (795) Q Consensus 159 ~~~~~~ty~vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s---~~~g~t~~~~g~~~~i~~~~~~~~~~~~ 235 (795) +++|+++|++++++...+++++++++++......+++|++.++++++.. ..++|+....+..++|+++++. ...++ T Consensus 155 ~g~y~~~y~i~Ing~~~a~~~t~~~~~~~~~~~~~~~~~a~~l~~~l~~~~~~~~~~~~~~~g~~~~i~~~~~~-~~~~~ 233 (806) T protein:vir:10 155 YANFSFTYQILINGQVAAEHKTASSEDVKNEDLVRTDYVAGKLLENFNSRTASFPGFSMYQDGNVLVVDNSNGA-NYALT 233 (806) T ss_pred ecccCceeeEEeccceEEEEEeccCCCcccccccchhHHHHHHHhhhcccccccceeEEEEcccEEEEecCCCC-ccEEE Confidence 9999999999999999999999999999999999999999999988765 5567888889999999887654 45677 Q ss_pred EecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEee---cCceEEEeeeeeEeeeEeccceeEE Q lcl|NC_015271. 236 TKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDT---TRKVWSETLGWNVNDQLLFETMPHA 312 (795) Q Consensus 236 ~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~---~~~~w~E~~~~~~~~~~~~~t~p~~ 312 (795) +.+|+.++.+.+++++++++++||+.+++|+.++|+++.++..+.||++|+. +.++|+|+++++...+++.++||+. T Consensus 234 ~~~g~~~~~~~~~~~~v~~~~~lp~~~~~g~~v~i~~~~~~~~~~y~v~~~~~~~~~~~w~e~~~~~~~~~~~~~t~p~~ 313 (806) T protein:vir:10 234 TVDGADGQDLVAIRHKVTNLDTLPNRAPVGYKVQVWPTGSKPESRYWLQAESQDGSKVTWVETIAPGVRKGWNAATMPHV 313 (806) T ss_pred EeeCCCCceeEEeecccCccccCccccCCCcEEEEeccCCCCCCceEEEEEeeccCceEEEeecccccccceeccccceE Confidence 8899999999999999999999999999999999999999999999999975 4568999999999999999999999 Q ss_pred EEeec-----CceeeecccCCccccCCcccccccccccC----CCccEEEEEcceEEEecCCeEEEEecCCccccccccc Q lcl|NC_015271. 313 LVRAA-----DGNFELKRIEWSPKTCGDDDTNPWPSFMD----STINDVFFFRNRLGLLSGENIILSRTAKYFNFYPASI 383 (795) Q Consensus 313 ~v~~~-----~~t~~~~~~~w~~~~~gd~~~np~psf~~----~~~~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~ 383 (795) +++.+ +++|.++.++|++|.+||+++||.|+|.+ ++|++|+||||||+|+++++|||||+||||||+++|+ T Consensus 314 ~v~~~~~~~~~~~~~~~~~~w~~r~~Gd~~tn~~psF~~~~~~~~it~v~f~q~RL~f~s~~~v~~Srsgd~~nF~~~t~ 393 (806) T protein:vir:10 314 LVRESLNANGSANFTYRPGEWEDRDVGDDLTNDFPSLLNDSSPQPISSMLMVQNRLMLTSGEAVVASRTSRFFDFFRYTV 393 (806) T ss_pred EEeeeeeecccceeEEEecccccccccccccCccCcccCCCCCccceEEEEEeeeEEEecCCeEEEEccCCcccCccccc Confidence 99876 78999999999999999999999999987 6899999999999999999999999999999999999 Q ss_pred cccCCCCcEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecCcCCCCcEEeCCeEEEEe Q lcl|NC_015271. 384 ATLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFAS 463 (795) Q Consensus 384 ~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~ 463 (795) +++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|+++|||+|++++++|+|+|+++|+|+.+|++++|++ T Consensus 394 ~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~t~~~q~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~Fv~ 473 (806) T protein:vir:10 394 LATVDTDPFDVFADIEEVYNIRWSAQMDGDVVLFTSDQQFTLPGDKPLTPTSAVIRPVTQFKMTPGVKPAPSGDSILFAF 473 (806) T ss_pred cCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEeecccCCCCceEeCCeEEEee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCeeEEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEe Q lcl|NC_015271. 464 PRSSFTSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSH 543 (795) Q Consensus 464 ~~g~~~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~ 543 (795) ++|+|++++|| +|++.+|+|+++|||+|++|||+|+++++++++++|.+++|++++||+|++||||++++||+|+|||| T Consensus 474 ~~g~~s~vre~-~y~~~~d~~~~~DlT~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~e~~v~aW~r 552 (806) T protein:vir:10 474 DQGSYSGIREF-FTDSYSDTKKAQPATSHVDKYIRGKVLELSASSSFNRAFIITSPDRNILYVYDWLYEGTEKVQNAWHK 552 (806) T ss_pred CCCCeeEEEEE-EeeeeccceehhhHHHHHHHhcCCCeEEEEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceEEEeEEe Confidence 99998888887 48899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecCCCeEEEEEEEeCCEEEEEEEeCC--CEEEEEEEEeeccccC--CCCcceeeeeeeeeEeecCcccccccccceeec Q lcl|NC_015271. 544 WDFGSNVQVLACQCINSDMYVILRNEF--NTFLTRVSFTKSTVDL--QGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHL 619 (795) Q Consensus 544 w~~~g~~~~~~~~~~~d~l~~~v~R~~--~~~~~r~~~~~~~~~~--~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~ 619 (795) |+++|...++|+.+.+|++|++|+|++ ++...++.|+.+..+. ..+++++||||+.++.....++ ... ... T Consensus 553 w~~~~~~~~~~~~~~~d~l~~vv~R~~~~~g~~~~~iE~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~---~~~--~~~ 627 (806) T protein:vir:10 553 WSFPAGTVLHAVSYSNEKLYLVLTRTNTSGGVAGVYIEVMDMGDELEYGLQDRVRMDRRATLSMTYNAT---TRV--WTS 627 (806) T ss_pred eeeCCCeEEEEEEEecCeEEEEEEEcCCcccEEEEEEEeecCCCCCCcccceeeeccccceEEEecccc---ccc--eee Confidence 999998888888889999999999987 5555556555543322 3567889999999987653322 111 123 Q ss_pred ccccCCcccCceEEEEecCCcccccc----eeeeccCCCceEEEe---cCCCCcEEEEeEeeeEEEEecceeEEccCCcc Q lcl|NC_015271. 620 PTIYGADFAKGKITVLEADGKITEFE----EPEVGWKNDPELRLN---GNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDG 692 (795) Q Consensus 620 ~~~~gl~~~~g~~v~~~adg~~~~~~----~~~~g~~~~~~~~i~---~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~ 692 (795) ..++++.|++|+.+.+.+||..+... .+..+. ..++++. .+.++++|+|||+|+++++|+||++++++++. T Consensus 628 ~~l~~~~~~~~~~~~~~~~g~~~~~g~~~~~~~~~~--~~~v~~~~~~~~~~~~~v~vGl~Y~s~~~~t~p~~~~~~~~~ 705 (806) T protein:vir:10 628 SALPWLPQDLSSLDAVLVSGWAGYVGGAFQFSYNAS--NNTISTNFDLAEGNTATIVVGETYWYEVEPTPPLIKDSKDRV 705 (806) T ss_pred eeeccccccccceeEEEEeeccccCCceEEEEEcCc--cceEeeeeeecCCCCcEEEEeeeeeEEEEECCeeEeccCCCc Confidence 34567899999999999988644221 111111 1122222 25678899999999999999999999887655 Q ss_pred ceeccccccEEEEEEEEEeeccceEEEEecCCccc--ccccccccccccccccccccccccceEEEEeeecccceEEEEE Q lcl|NC_015271. 693 STSTEDIGRLQLRRAWVNYEDSGTFDIYVENQSSN--WKYTMAGARLGAHVMRTGKLNLGTGQYRFPVVGNAKFNTVFIL 770 (795) Q Consensus 693 ~~~~~~~grl~l~~~~~~~~~t~~~~v~v~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~ 770 (795) .. .+|+||+|+++++.+|++|.+.+.++.+. ..+.+.+.+++++.+..+.+|+.+++++|||++|+++.+|+|+ T Consensus 706 ~~----~~r~~l~r~~~~~~~s~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~tg~~~vp~~~~~~~~~v~i~ 781 (806) T protein:vir:10 706 SY----LDTPTVGNVYLNLDMYPDFSVVVTDKETLQERTVYLANKTAGSITNVIGYIAPHEGTLRIPLRRKSTDVSFKIR 781 (806) T ss_pred cc----cccEEEEEEEEEeecceeeEEEEcccCCCcceeeeccCcccccccccccccccccceEEEEeeecCceeEEEEE Confidence 43 47899999999999999999999887653 3456889999999999999999999999999999999999999 Q ss_pred ECCCCCEEEEEEEEEEEEeccccCC Q lcl|NC_015271. 771 SDATTPLNIIGCGWEGNYLRRSSGI 795 (795) Q Consensus 771 ~~~P~P~tvlsi~~eg~y~~r~rrv 795 (795) |++|+||+|+||+|||+||+|+||| T Consensus 782 ~d~P~P~tvlai~~eg~y~~r~~rv 806 (806) T protein:vir:10 782 SKSPATFQLRDIEWTGSYNPRKRRV 806 (806) T ss_pred ECCCCceEEEEEEEEEEeecccccC Confidence 9999999999999999999999999 No 13 >protein:vir:80253 Length: 777 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522886;genbank:gi:158345179;genbank:GeneID:5687516 Probab=100.00 E-value=1.4e-218 Score=1214.97 Aligned_cols=764 Identities=20% Similarity=0.294 Sum_probs=661.8 Q ss_pred CCceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCceEEEE Q lcl|NC_015271. 1 MALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESEQYYAV 80 (795) Q Consensus 1 M~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~~y~l~ 80 (795) ||+|+++||||++|||||+|++||++||++|+||+|+|+|||+||||++||+++++++......++++|.++++|+|+|+ T Consensus 1 M~~i~~~~~nf~~GvS~q~D~~ry~~q~~~~~N~~~~~~gG~~rRpGt~fv~~l~~~~~~~~~~~~~~~~~~~e~~~~l~ 80 (777) T protein:vir:80 1 MSYFAGSYRQLLFGVSQQTAKDRLEGQVESQLNMQSDLVTGPRRRSPVHLIADAMAATDANRLAYSLATFSGREVLLLVD 80 (777) T ss_pred CceeeeecchhhcccccCCchHHhhhHHhhhhcceeeeccCceeCcchHhhhhhcCCCcccceeEEEEecCCCeeEEEEE Confidence 99999999999999999999999999999999999999999999999999999998877777778888999999999999 Q ss_pred EeCCeEEEEecCCcEEEEEECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccEEEeccc Q lcl|NC_015271. 81 FTGTGIRVFDLAGNERQVRYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINVRGG 160 (795) Q Consensus 81 ~~~~~~rv~~~~g~~~~v~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~~~v~~~ 160 (795) |++++||||+++|..+.+ ....||+.+. +.++|+|+|+||+|||+|++++|+++.|..+...|.++.++|++++++ T Consensus 81 ~g~g~irv~~~~~g~~~~---~~~~~Yl~a~-~~~~l~~~q~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~~~v~~~ 156 (777) T protein:vir:80 81 TLDGTLTILDDATGEVLF---TGTNSYLTAG-TGRSIRFAALDDSVFVANTEVIPQTQLWSGASAYPDPTRAGYLYVVAG 156 (777) T ss_pred ecCCeEEEEECCCCeEEE---ecCCCceeec-cccceeEEEEcCEEEEEeCCccceeeecccCCCccCcccceEEEeecc Confidence 999999999987765433 4467898765 456899999999999999999999999998888899999999999999 Q ss_pred ccCeEEEEEECCcee-EEEEecCCCCcccccccchhHHhHhhhhhccc-----ccCceeeeecCceEEEEecCCcceeeE Q lcl|NC_015271. 161 QYGRTLQIIINGNTQ-ATYQIPDGSQPEHVNNTDAQWLAEELARQCRV-----SAPGWTFNVGQGYIHIIAPEGQQIDSL 234 (795) Q Consensus 161 ~~~~ty~vt~~g~~~-a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s-----~~~g~t~~~~g~~~~i~~~~~~~~~~~ 234 (795) +|+++|++.+++... .+++.+.+++.....+.+++|++.+|..++.+ ..++|++...+++++++++++.. + T Consensus 157 ~~g~~y~i~i~~~~~~~~~t~~~~t~~~~~~~~~~~~ia~~L~~~~~~~~~~~s~~~~~~~~~g~~~~i~~~~~~~---~ 233 (777) T protein:vir:80 157 AFSKQYRLSITNQVTGVTTSVDVTTSATEASQATGEYVITQLRTAAEADATIGTAAGFAYYQDGAYLYVTAPEAIA---V 233 (777) T ss_pred CCCceeeEeecCCcCceeEEEecCCcccccccccchhhhhhhhhhhccccceeecCceEEEeCCcEEEEEecCcee---E Confidence 999999999986543 45555666667777888899999999766543 44679999999999999887643 3 Q ss_pred EEecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeEEEE Q lcl|NC_015271. 235 TTKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALV 314 (795) Q Consensus 235 ~~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~~~v 314 (795) + .+++.+.++....+.|++..+||+.++.++.+.+.++++++ ++||++|+..+++|+|+++++...++ .+||+.++ T Consensus 234 t-~~~g~~~~~~~~~~~v~~~~~lp~~~~~~~~~~~~~~~~~~-~~~y~~~~~~~~~w~e~~~~~~~~~~--~t~p~~l~ 309 (777) T protein:vir:80 234 S-TDSGSNFLRASNAASIRDAAELPAKLPADADGFIIATGAAK-NKTYFRWVDLERKWDEDASRGAQAEL--IDMPLRIT 309 (777) T ss_pred e-cCCcCccceeeeeEEEeeccccccccccccceEEEeCCCCC-CceEEEEEccCcEEEEeecccccccc--cccceEEE Confidence 3 34556677888889999999999999999888888766654 67999999999999999999998876 69999999 Q ss_pred eecCceeeecccCCccccCCcccccccccccCCCccEEEEEcceEEEecCCeEEEEecCCccccccccccccCCCCcEEE Q lcl|NC_015271. 315 RAADGNFELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSGENIILSRTAKYFNFYPASIATLSDDDPIDV 394 (795) Q Consensus 315 ~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~ 394 (795) +.++ +|+++..+|++|.+||+++||.|||+|++|++|+||||||+|+|+++|||||+||||||+++|++++.|||||++ T Consensus 310 ~~~~-~~~~~~~~w~~r~~gd~~tn~~Psf~g~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdDpI~~ 388 (777) T protein:vir:80 310 YSAP-NFSLTALNYERRASGDATSNPALKFTEQGISGMTTMQGRLVLLAGEYVCMSASGNPLRWFRASVSTQSDDDPIEV 388 (777) T ss_pred ecCC-ceEeeccCCccccccccccCCCceecCCceeEEEEEcceeeeecCCeEEEEeccCccccccccccCCCCCccEEE Confidence 8764 899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecCcCCCCcEEeCCeEEEEec-CCCeeEEEE Q lcl|NC_015271. 395 AVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASP-RSSFTSIHR 473 (795) Q Consensus 395 ~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~-~g~~~~v~~ 473 (795) +++++++|+|+|+++++++|+|||+++||+|+|+++|||+|++++++|+|+|+++|+|+.+|++++|+++ .|+|++++| T Consensus 389 ~~ss~~~~~i~~~v~~~~~L~i~T~~~e~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~Fv~~r~g~~s~v~e 468 (777) T protein:vir:80 389 AATAPVASPYEYAVAFNKDLVLFAKTHQGLVPGANLLTSRNATAAVVTEYSFQNSCSPVVAGRTVFFASPRSGPWSAVWE 468 (777) T ss_pred EEcCCcceeeeeeeecCCcEEEEecCceEEEeCCCcccceeEEEEEEEeeccCCCCCceEeCCeEEEEecCCCceeEEee Confidence 9999999999999999999999999999999999999999999999999999999999999999999975 467899999 Q ss_pred EEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecCCCeEEE Q lcl|NC_015271. 474 YYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGSNVQVL 553 (795) Q Consensus 474 ~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~g~~~~~ 553 (795) |+++++++|+|+++|||+|++|||++++. +++++++|.+++|++++||+|++||||++++||+|+|||||+|+|+++++ T Consensus 469 ~~~~~~~~d~y~a~Dlt~~~~hl~~~~v~-~~a~s~~p~~v~~~~~~dg~l~~~ty~~~~~e~~v~aW~r~~~~g~v~~v 547 (777) T protein:vir:80 469 MLPSQYTDAQVEASDSTSHLPKYIAGPVR-FLATSSTTSIVVVGTSNLRELVVHEYLWQGGEKVHAAWHKWSFPQDITGA 547 (777) T ss_pred eeecccccCceehhHHHHHHHHhcCCceE-EEEEcCCCceEEEEEcCCCeEEEEEEeecCCceEEEeeEEeccCCcEEEE Confidence 98777899999999999999999999855 55778888889999999999999999999999999999999999998877 Q ss_pred EEEEeCCEEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeecccccCCcccCceEE Q lcl|NC_015271. 554 ACQCINSDMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLPTIYGADFAKGKIT 633 (795) Q Consensus 554 ~~~~~~d~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~~g~~v 633 (795) |+ ++|+||++|+|++..++|||..... .|...++ .+||||... .+.+|+.....+ ..+++|+.+... T Consensus 548 ~~--i~d~l~~iv~r~~~~~le~~~~~~~-~d~~~~~-~~~~D~~~~---~~~~~~~~~~~~------~~~~~~~~~~~~ 614 (777) T protein:vir:80 548 YF--RGDRLILLFHVAGRVILGELFMQRL-GDAQSIP-GGFLDLYRV---GAANADEEVAIP------AFAADLYPEDST 614 (777) T ss_pred EE--ECCEEEEEEEcCCeEEEEEEeeccC-CCCcccc-eeeeeeeee---eeeeeCCcccee------EeeccccCCcce Confidence 66 5899999999999999999854433 3444444 589999643 233444433222 234555554443 Q ss_pred EEecCCccccc-----ceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEEEE Q lcl|NC_015271. 634 VLEADGKITEF-----EEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRRAW 708 (795) Q Consensus 634 ~~~adg~~~~~-----~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~~~ 708 (795) ...+.+..... ..+... .....++++++.++++|+|||+|+++++|+||++++++|+.+.. +|+||+|++ T Consensus 615 ~~v~~~~~~~~~~~~~~~v~~~-~~~~~~~v~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~g~~~~~----~r~~i~r~~ 689 (777) T protein:vir:80 615 FAYKLSGEFQSLGQRCGDRRVD-GATVYIKVVGAQAGDQYRIGLRYLSKLGPTRPILRDPNGVPITT----ERTQLHRLT 689 (777) T ss_pred eEEEecCcccccceeeeeEEeC-CceeeEEEcCCCCCCEEEEeeeeEEEEEeCceEEeCCCCceeee----cCeEEEEEE Confidence 33332221111 112211 22345789999999999999999999999999999998866543 789999999 Q ss_pred EEeeccceEEEEecCCccc-ccccccccccccccccccccccccceEEEEeeecccceEEEEEECCCCCEEEEEEEEEEE Q lcl|NC_015271. 709 VNYEDSGTFDIYVENQSSN-WKYTMAGARLGAHVMRTGKLNLGTGQYRFPVVGNAKFNTVFILSDATTPLNIIGCGWEGN 787 (795) Q Consensus 709 ~~~~~t~~~~v~v~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlsi~~eg~ 787 (795) |+|++|++|.+.|+++.++ ..+.+++.+++++.+.++.+++.++++++|+.+|+++.+|+|+|++|+||+|+||+|||+ T Consensus 690 ~~~~~sg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~tg~~~vp~~~~~~~~~v~i~~d~P~P~tilsi~~e~~ 769 (777) T protein:vir:80 690 WSLDSTGEVTFRVADQARGESAYTTTPLRLYSRDLGAGLPLAATATLDTPARVDMQTAQFSLETDDYYDMNITSLEYGFR 769 (777) T ss_pred EEeeccccEEEEEcCCCCcceeeeecCceecccccccccccccceEEEEEEeecCcceEEEEEECCCCceEEEEEEEEEE Confidence 9999999999999887774 456788999999999999999999999999999999999999999999999999999999 Q ss_pred EeccccCC Q lcl|NC_015271. 788 YLRRSSGI 795 (795) Q Consensus 788 y~~r~rrv 795 (795) ||+|+||= T Consensus 770 y~~r~~r~ 777 (777) T protein:vir:80 770 YNQRYRRQ 777 (777) T ss_pred eecccccC Confidence 99996666 No 14 >protein:vir:6326 Length: 826 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877473;genbank:gi:33300845;uniprot:Q7Y2D1;genbank:GeneID:1482615 Probab=100.00 E-value=4.9e-214 Score=1190.01 Aligned_cols=769 Identities=22% Similarity=0.295 Sum_probs=640.1 Q ss_pred CCceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCC-CceEEE Q lcl|NC_015271. 1 MALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDE-SEQYYA 79 (795) Q Consensus 1 M~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~-~~~y~l 79 (795) ||+|+|+||||++|||||+|++||++||++|+||+|+|+|||+||||++||+++++++....++++|.++|++ +|+|+| T Consensus 1 M~~i~~s~~n~~~GvSqq~d~~r~~~q~~~~~N~~~~~~~G~~rRpg~~~v~~~~~~~~~~~~~~~~~~~r~~~~~~~~~ 80 (826) T protein:vir:63 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) T ss_pred CceeeeecchhhcceeccCchHhhhhhhhhhhcceeeccCCcccCchhHhhhhhccCCccccccEEEEEecCCCceEEEE Confidence 9999999999999999999999999999999999999999999999999999999887777788888888865 567888 Q ss_pred EEeCCeEEEEecCCcEEEEEECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccEEEecc Q lcl|NC_015271. 80 VFTGTGIRVFDLAGNERQVRYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINVRG 159 (795) Q Consensus 80 ~~~~~~~rv~~~~g~~~~v~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~~~v~~ 159 (795) +|++++||||+++|.... ......++|+.+ +++++|+|+|+||+|||+|++++|++..+.. ...+++.++++++++ T Consensus 81 ~~~~g~irv~~~~~g~~~-~~~~~~~~y~~~-~~~~~l~~~t~aD~~fi~n~~~~p~~~~~~~--~~~~~~~~~~~~v~~ 156 (826) T protein:vir:63 81 AQHRGELYLFDERDGRLL-MGQPLVHDYLKA-NDYRQLRAATVADDLFIANLSVKPEADRTDI--KGVDPNKAGWLYIKA 156 (826) T ss_pred EecCCcEEEEEcCCCeEE-EcCCCCCceeee-cCccceEEEEeCCEEEEEeCCeeeeeccccc--cccCCCCcEEEEeec Confidence 999999999998665432 222345678765 5678899999999999999999999865532 345667889999999 Q ss_pred cccCeEEEEEECCc---------eeEEEEecCCCCcc-----cccccchhHHhHhhhhhcccc----------------- Q lcl|NC_015271. 160 GQYGRTLQIIINGN---------TQATYQIPDGSQPE-----HVNNTDAQWLAEELARQCRVS----------------- 208 (795) Q Consensus 160 ~~~~~ty~vt~~g~---------~~a~~ttp~~s~~~-----~~~~~~~~~i~~~l~~~~~s~----------------- 208 (795) ++|+++|++++++. .+++++++.+.... .....+..|++.++...+... T Consensus 157 g~Y~~~y~vti~~~~~~~gt~~s~t~t~~t~~~~~a~~~~~~~~~~~s~~yia~~l~~~~~a~~~~~~~~~t~~~~~~~~ 236 (826) T protein:vir:63 157 GQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDP 236 (826) T ss_pred cccCceEEEEEEeccccCCccccceEEEEeccCCcccccccccceeeeeeeeeeeceeeeeeccccccCCCccccceecC Confidence 99999999999763 34677777654322 223345566665554332110 Q ss_pred ------cCce--eeeecCceEEEEecCCcceeeEEEecCcCcccceeEEEeccceeecccccCCC----eEEE----EEc Q lcl|NC_015271. 209 ------APGW--TFNVGQGYIHIIAPEGQQIDSLTTKDGYADQLINPVTHYAQSFSKLPTNAPEG----YVVK----IVG 272 (795) Q Consensus 209 ------~~g~--t~~~~g~~~~i~~~~~~~~~~~~~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G----~~v~----v~~ 272 (795) ..++ .....+.++++..+..... ..+.++++.....+....+++.++||+.++++ +.+. +.+ T Consensus 237 ~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~g~~~~~~~~~~~~~~~~~l~~~~p~~~~~~~~~~~~~~~~~ 314 (826) T protein:vir:63 237 DANAATIAGYLNQRGVQDGYIAFRGDADIHV--EVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVM 314 (826) T ss_pred CcccceeecceeEecccccEEEEeeCCcccE--EEccCCCCcceEEEEEeeccceeeccccCCCcccceEEEeeEEeEEe Confidence 0011 1112234455444332211 12234556667778888999999999888764 3332 344 Q ss_pred CCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeEEEE-eecCceeeecccCCccccCCcccccccccccCCCccE Q lcl|NC_015271. 273 DASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALV-RAADGNFELKRIEWSPKTCGDDDTNPWPSFMDSTIND 351 (795) Q Consensus 273 ~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~~~v-~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~~~~ 351 (795) ..+...+.||++|+...++|+||++++.. +..++||+.|+ +.++++|++++++|++|.+||+++||+|+|+|++|++ T Consensus 315 ~~g~~~d~~y~~~~~~~~~w~e~~~~~~~--~~~~tmp~~l~~~~~~~~f~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~ 392 (826) T protein:vir:63 315 ATGSTKAPVYFEWDSANRRWAERAAYGTD--WVLKKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTRGITG 392 (826) T ss_pred cCCCcccceEEEEEcCCceEEEEeecCcc--cccccceEEEEEeccCCeEEEeccccccccccccccCCCccccCCCceE Confidence 55667789999999999999999999975 45589999998 5688999999999999999999999999999999999 Q ss_pred EEEEcceEEEecCCeEEEEecCCccccccccccccCCCCcEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCccc Q lcl|NC_015271. 352 VFFFRNRLGLLSGENIILSRTAKYFNFYPASIATLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTL 431 (795) Q Consensus 352 v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~l 431 (795) |+||||||+|+++++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|+++| T Consensus 393 v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~~~q~~ls~~~~l 472 (826) T protein:vir:63 393 MTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIV 472 (826) T ss_pred EEEEeceEEEeeCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceEEEEEEeecCcCCCCcEEeCCeEEEEecCC-CeeEEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCC Q lcl|NC_015271. 432 TSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRS-SFTSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTE 510 (795) Q Consensus 432 TP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g-~~~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~ 510 (795) ||+|++++++|+|+|+++|+|+.+|++++|+|++| +|++++||++..+.++.|+++|||+|++|||++++.. ++++++ T Consensus 473 TP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~~~s~v~e~~~~~d~~~~y~~~dlt~~~~~l~~~~v~~-~a~s~~ 551 (826) T protein:vir:63 473 TPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEY-IQAAAS 551 (826) T ss_pred cceeEEEEEEEeecccCCCCceEeCCeEEEEecCCCceeEEEEEEeeeccccceehhHHHHHHHHhcCCCeEE-EEEcCC Confidence 99999999999999999999999999999999987 5889999875556667799999999999999998665 566777 Q ss_pred CeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecCCCeEEEEEEEeCCEEEEEEEeCCCEEEEEEEEeec----cccC Q lcl|NC_015271. 511 NFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGSNVQVLACQCINSDMYVILRNEFNTFLTRVSFTKS----TVDL 586 (795) Q Consensus 511 ~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~g~~~~~~~~~~~d~l~~~v~R~~~~~~~r~~~~~~----~~~~ 586 (795) |.+++|++++||+|++|+||++++||+|+|||||+|+|+++++|+ ++|+||++|+|+++++++|+.++.. ..++ T Consensus 552 ~~~v~~~~~~dg~l~~~~y~~~~~e~~v~aW~~~~~~g~v~~~~~--i~d~l~~iv~r~~~~~~~r~~~e~~~~~~~~~~ 629 (826) T protein:vir:63 552 SGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYF--TGDNLMVLIQKGQEIALGRMHLNSLPAREGLQY 629 (826) T ss_pred CCEEEEEEcCCCEEEEEEEeeCCCcEEEEeEEEEecCCcEEEEEE--ECCeEEEEEEeCCCEEEEEEEEEecCCcccccc Confidence 788899999999999999999999999999999999998876665 5899999999999999999855443 2334 Q ss_pred CCCcceeeeeeeeeEeecCcccccccccceeecccccCCcccCceEEEEecCCcccccceeeec-cCCCceEEEecCCCC Q lcl|NC_015271. 587 QGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLPTIYGADFAKGKITVLEADGKITEFEEPEVG-WKNDPELRLNGNLEG 665 (795) Q Consensus 587 ~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~~g~~v~~~adg~~~~~~~~~~g-~~~~~~~~i~~~~~~ 665 (795) ..++++.++||...+...... . ..-.+++|+++..+.+.+|+.+........- ..+..++++++++.+ T Consensus 630 ~~~d~~~~~d~~~~~~~~~~~-------~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v~l~~~~~~~~ 698 (826) T protein:vir:63 630 PKYDYWRRIEATVAGELELTK-------Q----HWDLIKDASAVYQLQPVAGAYMERTHLGVKRETNTKVFLDVPEAVVG 698 (826) T ss_pred CCccceEEEEEeeeeeeccCc-------c----eeecccCcccccEEEEeeCccccCCccceEEecCCEEEEecCCCccc Confidence 556778899998776543211 0 1113689999999999999987654332221 122235677888999 Q ss_pred cEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEEEEEEeeccceEEEEecCCccc--cccccccccccccccc Q lcl|NC_015271. 666 SVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRRAWVNYEDSGTFDIYVENQSSN--WKYTMAGARLGAHVMR 743 (795) Q Consensus 666 ~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~~~~~~~~t~~~~v~v~~~~~~--~~~~~~~~~~~~~~~~ 743 (795) ++|+|||+|+++++++||++++++|+++.. ||+||||++|+|.+||+|.+.|+++.++ ..+...+.++++..+. T Consensus 699 ~~v~VGl~y~s~~~~~~~~~~~~~g~~~~~----gr~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~ 774 (826) T protein:vir:63 699 AVYVVGCEFWSKVEFTPPVLRDHNGLPMTS----TRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLN 774 (826) T ss_pred cEEEEeeeeeEEEEecceEEEccCCCccee----ccEEEEEEEEEeeccccEEEEecCccccceeEeecCCceecccccc Confidence 999999999999999999999999987654 7999999999999999999999988775 3457888999998888 Q ss_pred ccccccccceEEEEeeecccceEEEEEECCCCCEEEEEEEEEEEEeccccCC Q lcl|NC_015271. 744 TGKLNLGTGQYRFPVVGNAKFNTVFILSDATTPLNIIGCGWEGNYLRRSSGI 795 (795) Q Consensus 744 ~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r~rrv 795 (795) .|++++.++++++||.+++++.+|+|+|+.|+||+|++|+|||+||+|+||| T Consensus 775 ~g~p~~~t~~~~vP~~~~~~~~~i~i~~d~P~p~~il~i~~~~~yn~r~rrv 826 (826) T protein:vir:63 775 AGEPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) T ss_pred cccccccceEEEEEEeeccceEEEEEEeCCCCcEEEEEEEEEEEEeceeecC Confidence 9999999999999999999999999999999999999999999999999999 No 15 >protein:vir:100022 Length: 976 # NCBI annotation: T7-like tail tubular protein B # Family: family:all:825 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214208;genbank:gi:61806431;genbank:GeneID:3294702 Probab=100.00 E-value=2.2e-211 Score=1175.53 Aligned_cols=773 Identities=24% Similarity=0.401 Sum_probs=658.2 Q ss_pred CCceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCccc----CcEEEEEEeCCCce Q lcl|NC_015271. 1 MALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGP----APYIHLINRDESEQ 76 (795) Q Consensus 1 M~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~----~~~l~~f~~~~~~~ 76 (795) ||+|+|+||||++|||||||.+|+|+|+++|+||+|||+.||+||||++||+.|.++..... +.++|+++|+++|+ T Consensus 1 M~~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRp~~~~v~~l~~~~~~~~~~~~~~~~~~~~r~~~e~ 80 (976) T protein:vir:10 1 MASVTQTIPTLTGGLSQQPDELKIPGQVSVANNVIPDVTHGLLKRPGGKLVASISDNGTAALNSQTNGKWFSYYRDETES 80 (976) T ss_pred CcceeecchhhhCcceecchhhcCCchhhhhhccccccccccccCCcceeeeeecCCCcccccccccceEEEEEcCCCcE Confidence 99999999999999999999999999999999999999999999999999999887654432 45779999999999 Q ss_pred EEEEEeCCe-EEEEec-CCcEEEEEECCCccc----ceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCC Q lcl|NC_015271. 77 YYAVFTGTG-IRVFDL-AGNERQVRYTTDGST----YINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPK 150 (795) Q Consensus 77 y~l~~~~~~-~rv~~~-~g~~~~v~~~~~~~~----yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~ 150 (795) |++.+...+ |+|||+ +|.+++|+...+..+ |+. ++++++|+++++||++||+|+++.|++... .....+ T Consensus 81 y~~~~~~~g~~~v~~~~~G~~~~v~~~~~~~~~~~~yl~-~~~~~~~~~~tv~d~tfi~N~~~~~~~~~~----~~~~~~ 155 (976) T protein:vir:10 81 YIGQVSRSGDINMWRCSDGQAMTVNYDSGTATALTTYLT-HTNDEDIQTLTLNDYTFLTNRTKTVAMSST----VEPVRP 155 (976) T ss_pred EEEEEecCCceEEEEccCCeEEEEEcCCCcccccchhhc-cCCcceeEEEEEccEEEEecCceEEeeccc----ccCCCC Confidence 999998775 999998 799999998876543 443 578889999999999999999999987533 224456 Q ss_pred cccEEEecccccCeEEEEEECCceeE---EEEec---------------------------------------------- Q lcl|NC_015271. 151 QDALINVRGGQYGRTLQIIINGNTQA---TYQIP---------------------------------------------- 181 (795) Q Consensus 151 ~~~~~~v~~~~~~~ty~vt~~g~~~a---~~ttp---------------------------------------------- 181 (795) +.+|+++++++|+++|+++|+|...+ ++.++ T Consensus 156 ~~~~~~v~~~~y~~~y~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~s~~~G~~~~~~~ 235 (976) T protein:vir:10 156 PEVFIDLKATAYARQYAVNLFDNTTTTAVSTVTRIDVELIKSSNNYCDSNGAMVARTSRPSNSTRCDDSAGDGRDAYAPN 235 (976) T ss_pred ceEEEEeeeeccceEEEEEEcCCcccceeeeeeeeeeccccCCcccccccccchhhHhHhhhhhcccccccccccccCce Confidence 67999999999999999999654311 11110 Q ss_pred ---------CCC-------------------------------------------------------------------- Q lcl|NC_015271. 182 ---------DGS-------------------------------------------------------------------- 184 (795) Q Consensus 182 ---------~~s-------------------------------------------------------------------- 184 (795) ++. T Consensus 236 v~~~~f~~~~G~~~~i~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~Y~~~y~~~~~v~~ 315 (976) T protein:vir:10 236 VGTKVFNVTDGASLTDEANSGSYTYTIDVKDSSNNSVNRGVNLYFRIRTVGQSVPFTTGSGSSATTTYQARYTTTFDLLY 315 (976) T ss_pred eeeeEEEeccCccceEcCCcceEEEEeeccccceEEeecCCceEEEEccccceeecccccccceeeeeeEEEEeEEEEec Confidence 000 Q ss_pred ---------C-----------------------------------cccccccchhHHhHhhhhhcccc--cCceeeeecC Q lcl|NC_015271. 185 ---------Q-----------------------------------PEHVNNTDAQWLAEELARQCRVS--APGWTFNVGQ 218 (795) Q Consensus 185 ---------~-----------------------------------~~~~~~~~~~~i~~~l~~~~~s~--~~g~t~~~~g 218 (795) . .+.....++++++.+|...+... ..++++...+ T Consensus 316 ~~~g~~~~~~~~V~v~g~~Y~it~~~~~~~~~~a~~~~~~~~~t~~d~~~~~~~~~ia~~L~~~l~a~~~~~g~tv~~~g 395 (976) T protein:vir:10 316 GGTGWQEGDYFYVWMKDGYYKITVEAISTANVQANLGLIRPNPTPFDTETAVTAESIIGDIRTAIIATGNFTSANVQQIG 395 (976) T ss_pred CCCCcccCceEEEEccccceeeEEEEeeceeEEeccccccCcCCcCcccccccHHHHHHHHHHhhcccccccceEEEEcC Confidence 0 00001134566777777766532 3677888899 Q ss_pred ceEEEEecCCcceeeEEEecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecC-----ceEE Q lcl|NC_015271. 219 GYIHIIAPEGQQIDSLTTKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTR-----KVWS 293 (795) Q Consensus 219 ~~~~i~~~~~~~~~~~~~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~-----~~w~ 293 (795) ++++|+++++... +++. .++.+..+.++|+++++||..|++|++|+| +...+..|+||++|+..+ ++|+ T Consensus 396 ~~~~i~~~~~~~~--~s~~---~~~~~~~~~~~V~~~~~LP~~~~~g~~v~V-~~~~~~~d~yyv~~~~~~~~~~~~~w~ 469 (976) T protein:vir:10 396 TGLYVTRPSGTFN--VTAP---SSDLLRVMSGEVANVDDLPSQCKHGYVVKV-ANSEADADDYYVKFFGHNNRDGDGVWE 469 (976) T ss_pred cEEEEEecCcceE--ecCC---CceeEEEEEeeecchhhhhhhccCCcEEEE-ecCCCCceeEEEEeeccccccccceEE Confidence 9999998886422 3322 235689999999999999999999999998 555567799999998644 5899 Q ss_pred EeeeeeEeeeEeccceeEEEEeecCceeeecccCCccccCCcccccccccccCCCccEEEEEcceEEEecCCeEEEEecC Q lcl|NC_015271. 294 ETLGWNVNDQLLFETMPHALVRAADGNFELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSGENIILSRTA 373 (795) Q Consensus 294 E~~~~~~~~~~~~~t~p~~~v~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~~~v~~Sr~g 373 (795) |+++++..++++.++||+.|+++++++|++++++|+.|.+||+++||+|+|+|++|++|+||||||+|+++++|||||+| T Consensus 470 E~~~~g~~~g~~~~tmP~~l~~~~~g~f~~~~~~w~~r~vGd~~tnp~psf~g~~is~v~f~q~RL~f~s~~~v~~Srtg 549 (976) T protein:vir:10 470 ECAKPSRNIEFDKGTMPIQLVRQANGTFTVSQATWQNAEVGDELTNPNPSFVGKTINQLVFFRNRLVFLSDENVIMSRPG 549 (976) T ss_pred EeeccccccccccccccEEEEecccCeEEeeeccccccccCCcccCcCceecccccceEEEEcceEEEecCCeEEEEecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccccccccccCCCCcEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCC-ccccccceEEEEEEeecCcCCCCc Q lcl|NC_015271. 374 KYFNFYPASIATLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTAS-GTLTSRSIELNLTTQFDVQDRARP 452 (795) Q Consensus 374 d~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~-~~lTP~~~~~~~~s~~~~~~~~~P 452 (795) |||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+++ ++|||+|++++++|+|+|+++|+| T Consensus 550 d~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~g~e~~lsg~~~~lTP~t~~i~~~s~~~~~~~v~P 629 (976) T protein:vir:10 550 EFFNFWSKTATTFTPQDVIDLSCSSTYPAIVYDGIQVNAGLLLFTKNQQFMLTTDSDILSPETAKINAVSSYNFNEKTHP 629 (976) T ss_pred CccccccccccCCCCCccEEEEecCCcceeeEEEEecCCcEEEEecCceEEEecCCceecceeEEEEEEEeeeccCCCcc Confidence 9999999999999999999999999999999999999999999999999999985 599999999999999999999999 Q ss_pred EEeCCeEEEEecCCCeeEEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeC Q lcl|NC_015271. 453 FGIGRNVYFASPRSSFTSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYL 532 (795) Q Consensus 453 v~vg~~v~fv~~~g~~~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~ 532 (795) +.+|++++|++++|+++|+++| .++..+++|.+.|||+|++|||+|.+ .+++++++|.+++|+++++|+|++||||++ T Consensus 630 v~vG~~v~Fv~~~g~~~r~~~~-~~~~~~~~~~~~dlt~~~~~l~~g~~-~~~a~~~~~~~vv~~~~~~g~l~~~ty~~~ 707 (976) T protein:vir:10 630 VSLGTTVAFIDNANQFTRFFEM-SNVVRQGEPDVVDQSKVISRLLDKNI-SLVSVSRENSVVFFSQKDTDKIYCFRYFTS 707 (976) T ss_pred EEeCCeEEEEecCCCeEEEEEE-eecccccccchhHHHHHhhhhcCCce-EEEEEcCCCcEEEEEEcCCCEEEEEEEeec Confidence 9999999999999999999887 45667789999999999999999875 567899999999999999999999999999 Q ss_pred CCceeEEeeEeeecCCCeEEEEEEEeCCEEEEEEEeCCCEEEEEEEEeeccccC------------CCCcceeeeeeeee Q lcl|NC_015271. 533 NEELRQQSWSHWDFGSNVQVLACQCINSDMYVILRNEFNTFLTRVSFTKSTVDL------------QGEPYRAFMDMKIR 600 (795) Q Consensus 533 ~~eq~v~aW~~w~~~g~~~~~~~~~~~d~l~~~v~R~~~~~~~r~~~~~~~~~~------------~~~~~~~~lD~~~~ 600 (795) ++||+|+|||||+|+|+++++|+ ++|++|++|+|+++++++|+.|++.+... ..+.++++||++.+ T Consensus 708 ~~eq~v~aWsr~~~~G~v~sv~~--i~D~ly~vV~r~~~g~~~r~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~ 785 (976) T protein:vir:10 708 GEKRLLQAWTTWTITGNIQYHCM--LDDALYVVTRNNNKDQIVKYSLKLDDAGHFVTDTQGTTSTDDDSIYRVHLDHSSS 785 (976) T ss_pred CCceeEEeeEEEecCCcEEEEEE--eCCeEEEEEEecCCeEEEEEEEEECCccceeeeccCccccccCCcceeeeccceE Confidence 99999999999999999988776 48999999999999999999988764321 24567889999999 Q ss_pred EeecCcccccccccceeecccccCCcccCceEEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEe Q lcl|NC_015271. 601 YMIPNGTYNDDTFTTTLHLPTIYGADFAKGKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEF 680 (795) Q Consensus 601 ~~~~~~~~~~~~~~t~~~~~~~~gl~~~~g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~ 680 (795) +.+.+++|+.++..+.++++++. +++++.+...+|+.... ..+......+++++|++++++++|||||+|+++++| T Consensus 786 ~~~~~~t~~~~t~~t~~~~~~~~---~~~~~~~~~~~d~~~~~-~~~~~~~v~g~~i~l~g~~~~~~v~VGl~Y~s~~~~ 861 (976) T protein:vir:10 786 VTAASNTYNTTTIKTTIPKPNGY---ESTKQLVAYDTDAGNDL-GRYALVTVSGSNLEIPGNWSNNSFIIGYLYEMDVQL 861 (976) T ss_pred EEeccccccCCceeEEeecCccc---cCceeEEEEecccCccc-ccceeeeecCCeeEecCCCCCCeEEEeeeeEEEEee Confidence 99999999999988888888654 45678888888875422 222223344467899999999999999999999999 Q ss_pred cceeEEccCCccceeccccccEEEEEEEEEeeccceEEEEecCCccc-ccccccccccccccccccccccccc-eEEEEe Q lcl|NC_015271. 681 SKFRIKQVDNDGSTSTEDIGRLQLRRAWVNYEDSGTFDIYVENQSSN-WKYTMAGARLGAHVMRTGKLNLGTG-QYRFPV 758 (795) Q Consensus 681 ~~~~~~~~~g~~~~~~~~~grl~l~~~~~~~~~t~~~~v~v~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~tg-~~~vp~ 758 (795) +||++++++|+++. ....|||+|||++|+|.+|++|++.+++.+++ +...+...+. .....+.+|+.++ .+++|| T Consensus 862 ~~~~i~~~~g~~~~-~~~~gRl~i~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~--~~~~~~~~pl~~~~~~~vP~ 938 (976) T protein:vir:10 862 PTLYVTQQVGDKYR-SDAKSSLIVHRIKFSFGPLGVYSTTIQRDGKPDFTETKELGLA--GVVGASRLPIVPEVIETVPC 938 (976) T ss_pred cceeEEeCCCCccc-ccceeeEEEEEEEEEeecccceEEEEcCCCCcccccccccccc--CcccccccceecCcEEEEEe Confidence 99999999987754 46778999999999999999999999886654 3333322222 2233456777664 578999 Q ss_pred eecccceEEEEEECCCCCEEEEEEEEEEEEecc-ccCC Q lcl|NC_015271. 759 VGNAKFNTVFILSDATTPLNIIGCGWEGNYLRR-SSGI 795 (795) Q Consensus 759 ~~~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r-~rrv 795 (795) ++|+++.+|+|+|++|+||+|++|+|||+||+| +||| T Consensus 939 ~~~~~~~~v~i~~d~PlP~tilsi~~eg~yn~r~~r~~ 976 (976) T protein:vir:10 939 YERNTNLKVNVKSEHPAPATLYSLAWEGDFTNRFYKRV 976 (976) T ss_pred ccCCceeEEEEEECCCCceEEEEEEEEEEeccceeecC Confidence 999999999999999999999999999999999 6667 No 16 >protein:vir:78957 Length: 826 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522826;genbank:gi:158345061;genbank:GeneID:5687447 Probab=100.00 E-value=5.8e-210 Score=1167.68 Aligned_cols=771 Identities=22% Similarity=0.307 Sum_probs=618.2 Q ss_pred CCceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCC-ceEEE Q lcl|NC_015271. 1 MALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDES-EQYYA 79 (795) Q Consensus 1 M~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~-~~y~l 79 (795) ||+|+|+||||++|||||+|++||++||++|+||+|+|+|||+||||++||+++++++....+++.|.++++++ |+|+| T Consensus 1 M~~i~~~~~nl~gGvSqq~d~~r~~~q~~~~~N~~~~~~gG~~rRpgt~~va~~~~~~~~~~~~f~~~~~r~s~e~~~~l 80 (826) T protein:vir:78 1 MSYKQSAYPNLLMGVSQQVAFERLPGQLSEQINMVSDPVSGLRRRSGIELMASLLHTDQPWPRPYLYHTNLGGRSIAMLV 80 (826) T ss_pred CcceeeecchhccceecccchHhhhhhhhhhhcceeccccccccCCchHhhhhhccCCcCCceeEEEEeccCCcceEEEE Confidence 99999999999999999999999999999999999999999999999999999988776655666666666554 57889 Q ss_pred EEeCCeEEEEecCCcEEEEEECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccEEEecc Q lcl|NC_015271. 80 VFTGTGIRVFDLAGNERQVRYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINVRG 159 (795) Q Consensus 80 ~~~~~~~rv~~~~g~~~~v~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~~~v~~ 159 (795) +|++++||||+++|....+ ..+....|+.+++ .++|+|+|+||+|||+|++++|++...... ..++..++++++++ T Consensus 81 ~~~~g~irv~~~~~g~~~~-~~~~~~~y~~~~~-~~~l~~~t~aD~~fi~n~~~~p~~~~~~~~--~~~~~~~~~~~v~~ 156 (826) T protein:vir:78 81 AQHRGELYLFDEKDGRLLM-GQPLVHDYLKASD-YRQLRAATVADDLFIANLEVRPEADKADVL--GVDPSKTGWLYIKA 156 (826) T ss_pred EEcCCcEEEEECCCCEEEE-ecCcccceeecCC-cceeEEEEEcCEEEEEcCcEeeeecccccc--CCCCCceEEEEecc Confidence 9999999999976554432 2233456766554 468999999999999999999987432222 23345679999999 Q ss_pred cccCeEEEEEECCc---------eeEEEEecCCCCc-----ccccccchhHHhHhhhhhcccccCceeee---------- Q lcl|NC_015271. 160 GQYGRTLQIIINGN---------TQATYQIPDGSQP-----EHVNNTDAQWLAEELARQCRVSAPGWTFN---------- 215 (795) Q Consensus 160 ~~~~~ty~vt~~g~---------~~a~~ttp~~s~~-----~~~~~~~~~~i~~~l~~~~~s~~~g~t~~---------- 215 (795) ++|+++|+|++++. .+++|.+|.++.. .........|++.++..... ....|... T Consensus 157 g~y~~~y~v~i~~~~~~~~~~~s~t~~y~t~~~~~~~~~~~~~~~~~~~~~~a~~l~~~~~-~~~~~~~~~~t~~~~~~~ 235 (826) T protein:vir:78 157 GQYSKAFSLTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLFGKFF-GAPEYTLPNSTKKYPKVD 235 (826) T ss_pred cccCceeEEEeccceeecccccceeEEEEeccCCccccccccccceecchhhheecceeec-cccceeeeccceeEeecc Confidence 99999999999862 3467777776543 23344456677777765432 11111111 Q ss_pred ----------------ecCceEEEEecCCcceeeEEEecCcCcccceeEEEeccceeecc----cccCCCeEEEE----E Q lcl|NC_015271. 216 ----------------VGQGYIHIIAPEGQQIDSLTTKDGYADQLINPVTHYAQSFSKLP----TNAPEGYVVKI----V 271 (795) Q Consensus 216 ----------------~~g~~~~i~~~~~~~~~~~~~~dg~~~t~~~~~~~~v~~~~~l~----~~~~~G~~v~v----~ 271 (795) ..++++++.+++.... ..+.+++++.........|+.+++|| ..+.+|+.+.+ . T Consensus 236 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~--~~~~~~g~~~~~~~~~~~v~~~~~l~a~~p~~~~~~~~~~~~~~~~ 313 (826) T protein:vir:78 236 PDPAAATVAGYLNQRGVQDGYIAFRGDGDIVV--EVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAI 313 (826) T ss_pred ccccceeeccceeecccccceEEEecCCCeEE--EeccCCCccceEEEeeEEEecccceeeeecccccceEEEEEEeeeE Confidence 1123344443322211 12335555566677777788877765 44556666553 4 Q ss_pred cCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeEEEEe-ecCceeeecccCCccccCCcccccccccccCCCcc Q lcl|NC_015271. 272 GDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALVR-AADGNFELKRIEWSPKTCGDDDTNPWPSFMDSTIN 350 (795) Q Consensus 272 ~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~~~v~-~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~~~ 350 (795) ...++..+.||++|+..+++|+|++++++. ++.++||+.+++ .++++|+++..+|++|.+||+++||+|+|.|++|+ T Consensus 314 ~~~g~~~~~~y~~~~~~~~~w~e~a~~g~~--~~~~tmp~~l~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~i~ 391 (826) T protein:vir:78 314 MATGSTKAPVYFAWDAANRRWAERAAYGTD--WVLKKMPLALRWDESTDTYSLNELEYDRRGSGDEETNPTFNFVKRGIT 391 (826) T ss_pred ecCCCcccceeEEEEcCCceEEEeeccCcc--cccccccEEEEEecCCCeEEEeeccccccccCcccccCcccccCCCce Confidence 445667789999999999999999999975 577899999984 67899999999999999999999999999999999 Q ss_pred EEEEEcceEEEecCCeEEEEecCCccccccccccccCCCCcEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCcc Q lcl|NC_015271. 351 DVFFFRNRLGLLSGENIILSRTAKYFNFYPASIATLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGT 430 (795) Q Consensus 351 ~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~ 430 (795) +|+||||||+|+++++|||||+||||||++++++++.|||||+++++++++|.|+|+++++++|+|||+++||+|+|+++ T Consensus 392 ~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~~s~~~~~i~~~v~~~~~L~l~T~~~e~~l~~~~~ 471 (826) T protein:vir:78 392 GMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGI 471 (826) T ss_pred EEEEEeceEEEeeCCeEEEEeccCccccccccccCCCCCCcEEEEEccCcceeEEEEEecCCcEEEEecCcEEEEeCCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCC-CeeEEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCC Q lcl|NC_015271. 431 LTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRS-SFTSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSST 509 (795) Q Consensus 431 lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g-~~~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~ 509 (795) |||+|++++++|+|+|+++|+|+.+|++++|++++| +|++++||++..+.++.|+++|||+|++|||++++..+ ++++ T Consensus 472 lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~F~~~r~~~~s~v~e~~~~~~~~~~y~~~dlt~~~~~l~~~~v~~~-a~s~ 550 (826) T protein:vir:78 472 VTPRTAVISITTQYDVDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYI-QAAA 550 (826) T ss_pred ccceeEEEEEEEeecccCCCCceEeCCeEEEEecCCCceeEEEEEEeeecccCccchHHHHHHHHHhcCCCeEEE-EEeC Confidence 999999999999999999999999999999999887 58899998755566667999999999999999987654 6666 Q ss_pred CCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecCCCeEEEEEEEeCCEEEEEEEeCCCEEEEEEEEeeccccCC-C Q lcl|NC_015271. 510 ENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGSNVQVLACQCINSDMYVILRNEFNTFLTRVSFTKSTVDLQ-G 588 (795) Q Consensus 510 ~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~g~~~~~~~~~~~d~l~~~v~R~~~~~~~r~~~~~~~~~~~-~ 588 (795) +|.+++|++++||+|++|||+++++||+|+|||||+|+|+++++|+ ++|+||++|+|+++++++||.++..+.+.. . T Consensus 551 ~~~~~v~~~~~~g~l~~~ty~~~~~e~~v~aW~~~~~~g~v~~v~~--i~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~ 628 (826) T protein:vir:78 551 SSGYLVFGTSAADEMICHQYLWQGNEKVQNAYHRWTLRHQIIGAYF--TGDNLMVLIQKGQEIALGRMHLNSLPAREGLQ 628 (826) T ss_pred CCCeEEEEEcCCCeEEEEEEEecCCcEEEEeEEEEccCCcEEEEEE--ECCeEEEEEEeCCCEEEEEEEEEecCCCcccc Confidence 7777899999999999999999999999999999999998886665 589999999999999999996655433221 1 Q ss_pred Ccc-eeeeeeeeeEeecCcccccccccceeecccccCCcccCceEEEEecCCcccccceeeeccCCCceEEEecCCCCcE Q lcl|NC_015271. 589 EPY-RAFMDMKIRYMIPNGTYNDDTFTTTLHLPTIYGADFAKGKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSV 667 (795) Q Consensus 589 ~~~-~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~~g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~ 667 (795) ... .++.++...+. +.... ....-..+....++.|++|+.+....++.+.... .+.+.+++++++++.+++ T Consensus 629 ~~~~~~~~~~~~~~~--~~~~~--~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~----~~~~~~~l~~~~~~~~~~ 700 (826) T protein:vir:78 629 YPKYDYWRRIEATVD--GELEL--TKQHWDLIKDGAAVYQLQPQVGAYMERYQLGVKR----ETSTKVFLDVPEAVVGSV 700 (826) T ss_pred ccccceeEEEEEEEc--ceecc--ccceeEEecCCceeeeeccceeeeccccceeccc----cCCCceEEEeCCCccccE Confidence 111 12334333221 11110 1111123344456778888777666665543222 223345788999999999 Q ss_pred EEEeEeeeEEEEecceeEEccCCccceeccccccEEEEEEEEEeeccceEEEEecCCccccc--cccccccccccccccc Q lcl|NC_015271. 668 VYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRRAWVNYEDSGTFDIYVENQSSNWK--YTMAGARLGAHVMRTG 745 (795) Q Consensus 668 v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~~~~~~~~t~~~~v~v~~~~~~~~--~~~~~~~~~~~~~~~~ 745 (795) |+|||+|+++++++||++++++|+++.. ||+||+|++|+|.+|+.|.+.|+++.++.. +...+.++....+..+ T Consensus 701 v~VGl~y~s~~~~~~~~~~~~~g~~~~~----~r~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~l~~g 776 (826) T protein:vir:78 701 YVVGCEFWSKVEFTPPVLRDHNGLPMTS----TRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLSSRQLNAG 776 (826) T ss_pred EEEeeceeEEEEeCceEEecCCCcceee----cceEEEEEEEEeeccccEEEEeCCCccCcceeeeecccccccccccCC Confidence 9999999999999999999999987654 799999999999999999999998877643 3455566666666778 Q ss_pred ccccccceEEEEeeecccceEEEEEECCCCCEEEEEEEEEEEEeccccCC Q lcl|NC_015271. 746 KLNLGTGQYRFPVVGNAKFNTVFILSDATTPLNIIGCGWEGNYLRRSSGI 795 (795) Q Consensus 746 ~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r~rrv 795 (795) ++...++++++|+.+++++.+|+|+|+.|+||+|++|+|||+||+|+||| T Consensus 777 ~~~~~t~~v~vp~~~~~~~~~i~i~~d~P~P~tvlai~~~~~y~~r~rrv 826 (826) T protein:vir:78 777 EPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) T ss_pred cccccceEEEEeeeccCceEEEEEEeCCCCcEEEEEEeEEEEecceeecC Confidence 88888999999999999999999999999999999999999999999999 No 17 >protein:vir:78703 Length: 905 # NCBI annotation: tail tube B # Family: family:all:825 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285450;genbank:gi:148724484;genbank:GeneID:5220174 Probab=100.00 E-value=1.3e-209 Score=1165.81 Aligned_cols=770 Identities=25% Similarity=0.406 Sum_probs=648.5 Q ss_pred CCceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCceEEEE Q lcl|NC_015271. 1 MALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESEQYYAV 80 (795) Q Consensus 1 M~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~~y~l~ 80 (795) ||+|+|+||||++|||||||.+|+|+|+++|+||+|||+.||+||||++||+.|.++ +..+.++|+|+||+.|+|++. T Consensus 1 M~~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRp~~~~i~~l~~~--~~~~~~~~~~~r~~~e~y~~~ 78 (905) T protein:vir:78 1 MGAVLQKIPNLLGGVSQQPDPVKLPGQVREAENVYLDPTFGCRKRPATKFVGELATN--LPSDTRWFPIFRDAGERYAVA 78 (905) T ss_pred CccceecchhhhCceeecchhhcCCcchhhhhccccccccccccCchhhhhhhhcCC--CCCCceEEEEEeCCCceEEEE Confidence 999999999999999999999999999999999999999999999999999998765 346789999999999999999 Q ss_pred EeCCe-----EEEEec-CCcEEEEEECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccE Q lcl|NC_015271. 81 FTGTG-----IRVFDL-AGNERQVRYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDAL 154 (795) Q Consensus 81 ~~~~~-----~rv~~~-~g~~~~v~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~ 154 (795) +...+ |||||+ +|.+++|+..+....||.+. ++++|+++++||++||+|++++|++... ....+++++| T Consensus 79 ~~~~g~~~~~i~v~d~~~G~~~~V~~~~~~~~yl~~~-~~~~l~~~tv~d~tfi~N~~~~~~~~~~----~~~~~~~~~~ 153 (905) T protein:vir:78 79 LYKDGSGNTQVRVWDMQTGAERTVTPDATATAYLATT-NLNNLNWLTVADYTLLSNKERIVTMSGA----SEVDSNQRAL 153 (905) T ss_pred EeeCCCCCcceEEEEccCCcEEEEecCCCccceeecC-CCcceEEEEEcCEEEEEcCceeeeecCC----CCcCCCCeEE Confidence 97655 999999 79999999888888999765 5889999999999999999999987432 3457778999 Q ss_pred EEecccccCeEEEEEECCceeEEE---EecCC------------------------------------------------ Q lcl|NC_015271. 155 INVRGGQYGRTLQIIINGNTQATY---QIPDG------------------------------------------------ 183 (795) Q Consensus 155 ~~v~~~~~~~ty~vt~~g~~~a~~---ttp~~------------------------------------------------ 183 (795) +++++|+|+++|+++|++.....+ +++.+ T Consensus 154 ~~v~~g~y~r~y~v~I~~~~~~~~~t~~~~~a~~~s~~s~~~~~~g~~~~~~~~~~~~~t~~~~~~l~f~~~~~~~~~~~ 233 (905) T protein:vir:78 154 VEINAISYNTTYSIDLDRDGASQQVKVYRAKALEISPGSFEVEDGGVCTEHDVQNYTNQTIGSSTGLAFQVRVQCAAYLE 233 (905) T ss_pred EEEEeeccceeEEEEEeCCCCceeeeeeccccceeccccccccccccccccceeeeecceeeccCCceeEEeeccccccC Confidence 999999999999999976422111 00000 Q ss_pred ---------CCc--------------------------------------------------ccccccchhHHhHhhhhh Q lcl|NC_015271. 184 ---------SQP--------------------------------------------------EHVNNTDAQWLAEELARQ 204 (795) Q Consensus 184 ---------s~~--------------------------------------------------~~~~~~~~~~i~~~l~~~ 204 (795) +.. ..+...+.++++.+|.++ T Consensus 234 ~~~~~~~~~~~~~l~~g~~~~~~~~~~~v~~~g~~y~i~i~~~~~~~~~~~~~~~~~~t~~d~~a~~~~~~~i~~~l~~~ 313 (905) T protein:vir:78 234 NNEYRSRYNVSVVLQNGGTGFRKGDMITVNLNGRDYNIRVTQEEFVYTYASDGTAAHTTPQDSTAGTLDIGQITAGLVNS 313 (905) T ss_pred CCcccccccceeeeeccccccccCccEEEeeccceEEEEEecceeEEEecCCCcccccCccCccCccccHHHHHHHHHHh Confidence 000 001112235777777776 Q ss_pred cccccCceeeeecCceEEEEecCCcceeeEEEecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEE Q lcl|NC_015271. 205 CRVSAPGWTFNVGQGYIHIIAPEGQQIDSLTTKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVR 284 (795) Q Consensus 205 ~~s~~~g~t~~~~g~~~~i~~~~~~~~~~~~~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~ 284 (795) +. ..+.|++...+++++|.++++... .+++++|+.+++++++++.|+++++||+++++|++|+|.++.+++.|+||++ T Consensus 314 ~~-~~~~~~~~~~g~~i~v~~~~~~~~-~~~~~~g~~~~~~~~~~~~v~~~~~Lp~~~~~g~~v~v~~~~~~~~d~yyv~ 391 (905) T protein:vir:78 314 VN-LISNYSAQAVGNVIEIERTDGRDF-NLGVRGGATNRAMTAIKGTANSIVDLPGQCFDGFELKVINTENAESDDYYVV 391 (905) T ss_pred hc-ccccEEEEecCcEEEEEecCCCcc-EEEEeccCCcceEEEEeccccccccCccccCCCcEEEEEeCCCCCcceEEEE Confidence 64 567799999999999999988654 5789999999999999999999999999999999999999999999999999 Q ss_pred Eee------cCceEEEeeeeeEeeeEeccceeEEEEeecCceeeecccC-------CccccCCcccccccccccCCCccE Q lcl|NC_015271. 285 YDT------TRKVWSETLGWNVNDQLLFETMPHALVRAADGNFELKRIE-------WSPKTCGDDDTNPWPSFMDSTIND 351 (795) Q Consensus 285 ~~~------~~~~w~E~~~~~~~~~~~~~t~p~~~v~~~~~t~~~~~~~-------w~~~~~gd~~~np~psf~~~~~~~ 351 (795) |+. +++.|+||++++...+++.+||||.|+++++++|++...+ |++|.+||+++||+|+|+|++|++ T Consensus 392 ~~~~~~~~~~~~~W~E~~~~~~~~~~~~~tmp~~l~r~~~g~f~~~~~~~~~~~~~~~~r~~Gd~~Tnp~psf~g~~is~ 471 (905) T protein:vir:78 392 FRSAAEGIPGSGSWEETVAPGIERGFNTSTMPHALIRQADGNFTLEALNDEGTITGWAQREVGDDDTNPKPSFVGRGISD 471 (905) T ss_pred EEecccCCcCceeEEEecccccccccccccccEEEEEecCceEEEEEeccccccccccccccCCcccCCCCcccCCCcce Confidence 964 4668999999999999999999999999999999998886 999999999999999999999999 Q ss_pred EEEEcceEEEecCCeEEEEecCCccccccccccccCCCCcEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCc-c Q lcl|NC_015271. 352 VFFFRNRLGLLSGENIILSRTAKYFNFYPASIATLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASG-T 430 (795) Q Consensus 352 v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~-~ 430 (795) |+||||||+|+++++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|++ + T Consensus 472 v~f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdDpI~~~~ss~~~~~i~~~v~~~~~L~ifT~g~ef~lsg~~~~ 551 (905) T protein:vir:78 472 MFFYNNRLGFLSEDAVIMSQPGDYFNFFVTSAITISDSDPIDVTASSTKPAILRAAIGAPKGLILFAENSQFLLASQEVV 551 (905) T ss_pred EEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeEEEeecCCcEEEEecCceEEEecCCcc Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999865 7 Q ss_pred ccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCeeEEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCC Q lcl|NC_015271. 431 LTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTE 510 (795) Q Consensus 431 lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~ 510 (795) |||+|++++++|+|+|+++|+|+.+|++++|++++|+|++++|| +|++.+|+|+++|+|+|++|||+|+++.+ ++++ T Consensus 552 lTP~s~~i~~~S~~~~~~~v~Pv~vG~~vlFv~~~g~~s~vre~-~y~~~~d~y~a~DlT~~a~hl~~g~v~~~--~~s~ 628 (905) T protein:vir:78 552 FSTATIKLTEISDYFYRSLAKPVSTGVSIAFVSEADTYSKIFEM-SIDSVDNRPQVADITRIVPEYVPTGLTWS--VSTP 628 (905) T ss_pred ccceeEEEEeEEeecccCCCCcEEeCCeEEEeecCCCeeEEEEE-EeeecccceehhHHHHHHHHhcCCceEEE--EecC Confidence 99999999999999999999999999999999999999888885 78899999999999999999999997644 4556 Q ss_pred CeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecCCCeEEEEEEEeCCEEEEEEEeCCCEEEEEEEEeeccc---cCC Q lcl|NC_015271. 511 NFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGSNVQVLACQCINSDMYVILRNEFNTFLTRVSFTKSTV---DLQ 587 (795) Q Consensus 511 ~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~g~~~~~~~~~~~d~l~~~v~R~~~~~~~r~~~~~~~~---~~~ 587 (795) |..++|+++++|+|++|+||++++||+|+|||||+|+|.++++|+. .|++|++|+|..++...++.+++... ... T Consensus 629 ~~~~v~~~~~~~~l~~ytyl~~~~eq~v~AWsrw~~~G~~~~~a~i--~d~~~~vV~r~~~G~~~~~~~~l~~~~~~~~~ 706 (905) T protein:vir:78 629 NNSMMLFGDNSNTAYIFKFFNQGNERQVAGWSKWILPGEQRMCGFF--ADTGYFVLYDSTTGSYVLSAMELLDDPDSASI 706 (905) T ss_pred CCcEEEEEcCCCeEEEEEeecCCCceeEEeEEEEecCCCeEEEEEE--cCCEEEEEEEccCCeEEEEEEeeccccCcccc Confidence 7778899999999999999999999999999999999999887665 58899999998887777766554211 111 Q ss_pred CCcceeeeeeeeeEeecCc-ccccccccceeecccccCCcccCceEEEEecCCcccccceeeeccCCCceEEEecCCCCc Q lcl|NC_015271. 588 GEPYRAFMDMKIRYMIPNG-TYNDDTFTTTLHLPTIYGADFAKGKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGS 666 (795) Q Consensus 588 ~~~~~~~lD~~~~~~~~~~-~~~~~~~~t~~~~~~~~gl~~~~g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~ 666 (795) +...+.++||...+..+.. ++.... .........+++.|++++.+.+.+||..........+. .. .+. ...++ T Consensus 707 d~~~~~~~~~~d~~~~~~~~t~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~dG~~~~~~~~~~~~--~~--~~t-~~~a~ 780 (905) T protein:vir:78 707 DTAFSSFLPRLDNYVVKSDLTVVDNG-DGTLTVDLEAGQAMTGATPVIMFTDGPSEFAFSQPTIT--AG--QFT-VDTTD 780 (905) T ss_pred ccceeeeeeccceeeecccceecccC-cceEeeeccCccccccceeEEEeeCCceeeeEEEEEee--ce--eec-cccCC Confidence 2223445666555554433 222211 12223344567788888888888898764322221110 00 122 23678 Q ss_pred EEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEEEEEEeeccceEEEEecCCcccccccccccccccccccccc Q lcl|NC_015271. 667 VVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRRAWVNYEDSGTFDIYVENQSSNWKYTMAGARLGAHVMRTGK 746 (795) Q Consensus 667 ~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~ 746 (795) +|+|||+|+++++++||+++.++++.. .+|++|+|++|+|++|++|++++++.+++........+..+..+.++. T Consensus 781 ~v~VGl~Y~s~v~~~p~~~~~~~~s~~-----~~~~rI~rv~lr~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~ 855 (905) T protein:vir:78 781 DFVVGFKYETKITLPGFFTSEENKADR-----VYAPIVEFLYLDLYYSGRYQIEVDRIGYDTINIDAGSIDANIYLADGA 855 (905) T ss_pred eEEEeeeeeEEEeecceEeccCCCccc-----ccceEEEEEEEEeecceeEEEEEcCCCcceecccccceecCcccCccc Confidence 899999999999999999988776543 368899999999999999999999988775443344455555566778 Q ss_pred cccccceEEEEeeecccceEEEEEECCCCCEEEEEEEEEEEEecc-ccCC Q lcl|NC_015271. 747 LNLGTGQYRFPVVGNAKFNTVFILSDATTPLNIIGCGWEGNYLRR-SSGI 795 (795) Q Consensus 747 ~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r-~rrv 795 (795) +++.++.++|||.+|+++.+|+|+|++|+||+|+||+|||+||+| ++|| T Consensus 856 p~~~tg~~~vP~~g~~~~~~v~I~sd~PlP~tvlsi~weg~Yn~r~~~~~ 905 (905) T protein:vir:78 856 PLKEIATENVPLFTPGDQVTVTIKAPDPFPSAITGYSWQGHYNRRGIAFI 905 (905) T ss_pred ccccccEEEEEeeccCceeEEEEEECCCCcEEEEEEEEEEEeccceeecC Confidence 888999999999999999999999999999999999999999999 6677 No 18 >protein:vir:103790 Length: 768 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024932;genbank:gi:48697202;genbank:GeneID:2846114 Probab=100.00 E-value=2.9e-179 Score=999.41 Aligned_cols=725 Identities=13% Similarity=0.099 Sum_probs=556.3 Q ss_pred CCceeeechhhccc-----cccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCc Q lcl|NC_015271. 1 MALISQSIKNLKGG-----ISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESE 75 (795) Q Consensus 1 M~~v~~~~~n~~~G-----vS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~ 75 (795) ||+++++++||.+| +++|+|++||++||++|+||+++|+|||+||||++||+++++.+ ...+|+||.|+++| T Consensus 1 M~~~~~~~~~F~~GelsP~l~~r~Dl~ry~~~~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~---~~~~lipf~~~~~~ 77 (768) T protein:vir:10 1 MPKAAPQQVSFDAGELSPLLGARVDLAKYPNGCQVMENFIATVQGPAIRRGGKRFVAATKDST---KQSWLLPFIVADGI 77 (768) T ss_pred CCcceeeeeeccCceechhhcccchHHHHHHHHhhhhcceeeecCCceecCchhhhhhhcCCC---CCeeEEEEEecCcc Confidence 99999999999999 78899999999999999999999999999999999999987654 46799999999999 Q ss_pred eEEEEEeCCeEEEEecCCcEEEE-----EECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCC Q lcl|NC_015271. 76 QYYAVFTGTGIRVFDLAGNERQV-----RYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPK 150 (795) Q Consensus 76 ~y~l~~~~~~~rv~~~~g~~~~v-----~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~ 150 (795) +|+|+|++++||||+.+|.+... ...+....||.+.+++++|+|+|+||+|||+|++|||+++.|.++ +.|+.. T Consensus 78 ~y~l~fg~~~irv~~~~g~v~~~~~~~e~~tp~~~~~l~~~~~~~~L~~~q~aD~~~i~~~~~~p~~l~r~~~-~~w~l~ 156 (768) T protein:vir:10 78 AYMLEFGDHYIRFFVNRGQLVNAGAPVEIATPYALADLTTEDGTFAIRATQSADTMYLFHGGYPTQKLLRTSA-TTFSLQ 156 (768) T ss_pred EEEEEEcCCEEEEEECCcEEEecCeeEEEEcCCCcceeecccccceeEEEeecCEEEEEcCCcceeEEEEecC-CCceeE Confidence 99999999999999988865431 233455566777788889999999999999999999999999876 558888 Q ss_pred cccEEEecccccCeEEE--EEECCceeEEEEecCCCCcccccccchhHHhHhhhh--hcccccCceeeeecCceEEEEec Q lcl|NC_015271. 151 QDALINVRGGQYGRTLQ--IIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELAR--QCRVSAPGWTFNVGQGYIHIIAP 226 (795) Q Consensus 151 ~~~~~~v~~~~~~~ty~--vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~--~~~s~~~g~t~~~~g~~~~i~~~ 226 (795) ++.|...+++.++...+ ++..+...+.+.++++. ..+++++...+.. .......+|......+...+... T Consensus 157 ~~~~~~gp~~~~n~~~~vti~~s~~~~~~T~tasa~------~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~g~~~~~~~ 230 (768) T protein:vir:10 157 PVTFVGGPFAAVNSDNNVRVHASAGTGAVTLVASAS------VFRPSDVGTLFYLEQEDNSFVKPWVVHQKIGPSELRRV 230 (768) T ss_pred EeeecCccccccccceeEEEEecccceeEEEeecCC------ccchhhcceeeeeeeeccccccccEEEEeeeeEEEEec Confidence 88887777777665444 44454444333333322 2233333322211 12223345544433333333344 Q ss_pred CCcceeeEEEecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceE-----EEEEeecCceEEEeeeeeEe Q lcl|NC_015271. 227 EGQQIDSLTTKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQY-----YVRYDTTRKVWSETLGWNVN 301 (795) Q Consensus 227 ~~~~~~~~~~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~y-----y~~~~~~~~~w~E~~~~~~~ 301 (795) ++.......+.++... ..-.+++.+..|+...+.++.....+.+ ++++......|.+ . T Consensus 231 ~~~~~~~~~~~~~~~~-----------~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------i 293 (768) T protein:vir:10 231 GDRVYLCTAVGTATPQ-----------VTGTETPTHTSGSRWDGTGQDESATDEYGSIGAEWEYQHSGYGTVL------I 293 (768) T ss_pred CCceEEeeeecccccc-----------ccceeccccccCceEEEecCcccccccccccceEEEEEEcCCceEE------E Confidence 3333333333322211 0112334455566555545443322221 2233332222222 2 Q ss_pred eeEeccceeEEEEeecCceeee--------cccCCccccCCcccccccccccCCCccEEEEEcceEEEecCCeEEEEecC Q lcl|NC_015271. 302 DQLLFETMPHALVRAADGNFEL--------KRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSGENIILSRTA 373 (795) Q Consensus 302 ~~~~~~t~p~~~v~~~~~t~~~--------~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~~~v~~Sr~g 373 (795) .+....+|++..++....++.. ....|..+..+++++|| ||++|+||||||+|+|+++|||||+| T Consensus 294 ~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~g-------~Ps~v~f~q~RL~f~~~~~v~~Srtg 366 (768) T protein:vir:10 294 TGYTNDQVVTGTVATNDPADPGMLPNTVVTLTGTYKWARSLFNSTDG-------FPQMGTFWRNRLCLMRDRWLAMSVSA 366 (768) T ss_pred EEecCCeeEEeeeeeecCcccccccccccccCCCcccccCCCcCCCC-------CceEEEEEeeeEEEeeCCEEEEEccc Confidence 2334455666655554433332 23345555555555565 55779999999999999999999999 Q ss_pred CccccccccccccCCCCcEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCC---ccccccceEEEEEEeecCcCCC Q lcl|NC_015271. 374 KYFNFYPASIATLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTAS---GTLTSRSIELNLTTQFDVQDRA 450 (795) Q Consensus 374 d~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~---~~lTP~~~~~~~~s~~~~~~~~ 450 (795) |||||++++++++.|||||+++++++++|+|+|++++ ++|+|||+++||+|+++ ++|||+|++++++|.|+++ +| T Consensus 367 d~~nF~~~s~~~~~DdD~I~~~~ss~~~~~i~~~v~~-~~L~i~T~~~q~~l~~~~~~~~lTP~~~~i~~~s~~g~~-~~ 444 (768) T protein:vir:10 367 DFETFKTKDADQQTDDSAIVQQLNARQLNKLAWMVES-DSLLIGMTGDEWVIGPANASQPVSAANLNAARRTSYGSK-RI 444 (768) T ss_pred ccccccccccccccCCccEEEEecCCcceeEEEEeec-CcEEEEecCceEEEecCCCCcccccceEEEEEeehhccc-cc Confidence 9999999999999999999999999999999999999 58999999999999873 5899999999999999765 79 Q ss_pred CcEEeCCeEEEEecCCCeeEEEEEEeeccccCceehhhHHHHHHHhcCCCc-----EEEEEeCCCCeEEEEEEcCCCEEE Q lcl|NC_015271. 451 RPFGIGRNVYFASPRSSFTSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGV-----FDICGSSTENFCAVLSQGDQSKIF 525 (795) Q Consensus 451 ~Pv~vg~~v~fv~~~g~~~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~-----~~~~~~~~~~~~~~~~~~~dg~l~ 525 (795) +|+.+|++++|+|++|++ +|+++|++++|+|+++|+|+|++||+++.. +..+++++.|..++||+++||+|+ T Consensus 445 ~Pv~vG~~v~fv~~~g~~---vre~~y~~~~d~y~a~DlT~~a~hl~~~~~~~~~~i~~~a~~~~p~~v~~~v~~dg~l~ 521 (768) T protein:vir:10 445 QPVQVGGTIMFVQKAGRK---LRDFKYDFSSDNYVSTDVTKIADHITRGRAGTNSGIMSLCFQQEPHSVVWAARADGQLI 521 (768) T ss_pred ccEEeCCeEEEEcCCCCE---EEEEEeeeecCceecchhhhhhhhhccccCccccceeeEEEeecCCeEEEEEecCCeEE Confidence 999999999999999953 455678999999999999999999999864 677899999999999999999999 Q ss_pred EEEEeeCCCceeEEeeEeeec-CCCeEEEEEEE----eCCEEEEEEEeCCCEEEEEEEEeeccc--cCCCCcceeeeeee Q lcl|NC_015271. 526 MYKFLYLNEELRQQSWSHWDF-GSNVQVLACQC----INSDMYVILRNEFNTFLTRVSFTKSTV--DLQGEPYRAFMDMK 598 (795) Q Consensus 526 ~~ty~~~~~eq~v~aW~~w~~-~g~~~~~~~~~----~~d~l~~~v~R~~~~~~~r~~~~~~~~--~~~~~~~~~~lD~~ 598 (795) +|+|++++++|+|+|||||++ +|.++++|+.. .+|+||++|+|++++..+|+.|.+.+. ......+++||||+ T Consensus 522 ~~ty~~e~~~q~v~aW~~~~~~~g~v~~v~~i~~~~g~~d~l~~~v~r~~~g~~~~~ie~l~~~~~~~~~~~~~~~~D~~ 601 (768) T protein:vir:10 522 GCTYDEEAGRSDVYGWHRHPDANGFVECVASMPAPDGASDDLWVIVRRQVNGQTVRYVEYLNPALQDDEPQSSAFYVDAG 601 (768) T ss_pred EEEEecCCCceeEEeEEEEEcCCCEEEEEEEEecCCCCccEEEEEEEecCCCeEEEEEEecCcccccccccccceEeccc Confidence 999999888899999999985 78888887753 369999999999999999988877652 22345678999998 Q ss_pred eeEeecCcccccccccceeecccccCCcccCceEEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEE Q lcl|NC_015271. 599 IRYMIPNGTYNDDTFTTTLHLPTIYGADFAKGKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVY 678 (795) Q Consensus 599 ~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~~g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~ 678 (795) .++... +...++|++|++|+++.+++||..++...+.+| +|+++.++++|+|||+|++++ T Consensus 602 ~~~~~~-------------~~~~~~gl~~leg~~v~v~~dG~~~~~~~v~~g-------~itl~~~~~~v~vG~~y~s~~ 661 (768) T protein:vir:10 602 ITYNGV-------------PTSTIAGLGHLEGVTVAVLTDGAVHPSRTVTAG-------AITLDWSASIVHIGVPTTCRI 661 (768) T ss_pred cccCCc-------------ceeeecCCCCcccceEEEEECCEeccCceecCC-------EEEeCCCCceEEEeEeeeEEE Confidence 765422 123578999999999999999999998888766 789999999999999999999 Q ss_pred EecceeEEccCCccceeccccccEEEEEEEEEeeccceEEEEecCCcccccccccccccccccccccccccccceEEEEe Q lcl|NC_015271. 679 EFSKFRIKQVDNDGSTSTEDIGRLQLRRAWVNYEDSGTFDIYVENQSSNWKYTMAGARLGAHVMRTGKLNLGTGQYRFPV 758 (795) Q Consensus 679 ~~~~~~~~~~~g~~~~~~~~~grl~l~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~ 758 (795) +++||++++++|... .+|+||+|++++|.+|++++++++...+.. ...+.+..+.. ....+|+.+|++++|+ T Consensus 662 ~~~p~~~~~~~gs~~-----~~~~ri~r~~v~~~~S~~~~~~~~~~~~~~--~~~~~r~~~~~-~~~~~~l~TG~~~v~~ 733 (768) T protein:vir:10 662 QTMQLNAGAANGTAQ-----GKTKRVTNIATRFSRSLGGVVGPTFDDNDL--EQLSFRKPSNA-MDRAVPLFDGDMESDW 733 (768) T ss_pred EecceEeecCCcccc-----ccceEEEEEEEEEecccceEEEecCCCCCc--eeeeeEecCcc-cCccCCcccCEEEEEe Confidence 999999998887543 368999999999999999999886654321 11122222222 2345788999999998 Q ss_pred ee-cccceEEEEEECCCCCEEEEEEEEEEEEeccc Q lcl|NC_015271. 759 VG-NAKFNTVFILSDATTPLNIIGCGWEGNYLRRS 792 (795) Q Consensus 759 ~~-~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r~ 792 (795) .+ |+++.+|+|+|++|+||+||||+||+++|+|+ T Consensus 734 ~~~~~~~~~i~i~~d~P~P~tvlsi~~~~~~nd~~ 768 (768) T protein:vir:10 734 RGGYEGQSWICYQNDQPLPVTLLGFFPILDTQDDR 768 (768) T ss_pred cCCCCcceEEEEEECCCCCEEEEEEEEEEEEeecC Confidence 65 57889999999999999999999999999999 No 19 >protein:vir:95324 Length: 823 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512269;genbank:gi:89152436;genbank:GeneID:3952993 Probab=100.00 E-value=7.7e-169 Score=942.26 Aligned_cols=722 Identities=13% Similarity=0.102 Sum_probs=543.5 Q ss_pred CCceeeechhhccc-----cccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCc Q lcl|NC_015271. 1 MALISQSIKNLKGG-----ISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESE 75 (795) Q Consensus 1 M~~v~~~~~n~~~G-----vS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~ 75 (795) |+ |+++++||.+| +++|+|++||++||++|+||+++|+||++|||||+||+++++++ ...+|+||.||++| T Consensus 1 m~-i~~~q~sF~~GElsP~l~gR~Dl~ry~~q~~~~~N~~~~~~GGl~rRpGt~fva~~~~~~---g~~rLipf~~s~~q 76 (823) T protein:vir:95 1 MA-ISWIQPSFAGGEIGPSLYGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN---RKCRLIPFQFSTVQ 76 (823) T ss_pred Cc-ceeechhccCceechheeccchHHHHHHHHhhhhCcEeeecCCceecCchhhhhhhcCCC---CCeeEEEEEeCCCc Confidence 99 99999999999 88899999999999999999999999999999999999987754 46799999999999 Q ss_pred eEEEEEeCCeEEEEecCCcEEEE--EECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCccc Q lcl|NC_015271. 76 QYYAVFTGTGIRVFDLAGNERQV--RYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDA 153 (795) Q Consensus 76 ~y~l~~~~~~~rv~~~~g~~~~v--~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~ 153 (795) .|+|+|++++||||+.+|.+... ......+||. +..+.+|+|+|+||+|||+|++|||+++.|.++ ..|...+.. T Consensus 77 ~y~Lefg~~~irV~~~~g~vv~~~~~~~ev~tPy~--~~~l~~Lr~~qsaD~~fivh~~~~p~~L~r~~~-~~w~l~~~~ 153 (823) T protein:vir:95 77 TYALEFGHQYMRVIKDGALVLNSSNVIYEIATPYT--EADLFRIKFTQSADVLTLVHPAYPPKELRRYAH-DNWQLVDVV 153 (823) T ss_pred EEEEEEcCCeEEEEeCCcEEEecCCceeEEecccc--cccccceeEEEeccEEEEEcCCccceEEEecCC-CCceEEEEE Confidence 99999999999999866643210 1111223453 334678999999999999999999999999877 468888888 Q ss_pred EEEecccccCeEEEEEECC-ceeEEEEecCCCCcccccccchhHHhHhhhhhcccccCceeeeecCceEEEEecCCccee Q lcl|NC_015271. 154 LINVRGGQYGRTLQIIING-NTQATYQIPDGSQPEHVNNTDAQWLAEELARQCRVSAPGWTFNVGQGYIHIIAPEGQQID 232 (795) Q Consensus 154 ~~~v~~~~~~~ty~vt~~g-~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s~~~g~t~~~~g~~~~i~~~~~~~~~ 232 (795) |...++++.+.++++++.. ..+..++.+...+...+..... .+..............+...+... T Consensus 154 ~~~gp~~~~~~~~t~~v~~~~~~~~~t~ta~~~~~~~d~vg~---------~~~l~~~~~~~~~~~~~~~~~~~~----- 219 (823) T protein:vir:95 154 TKNGPFEDINIDESLTVYASASTGTITLTASASIFGAEQVGK---------LFYLEQPAVDSVPVWETSKSTSIG----- 219 (823) T ss_pred EeccccccccccceeEEeccccCceeEEeecccccchhhccc---------eEEEeccccceeeecceeeeeccc----- Confidence 8888888877777766532 1222333332222211111110 000000000000000000000000 Q ss_pred eEEEecCcCcccceeEEEeccceeec-ccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeE-eeeEecccee Q lcl|NC_015271. 233 SLTTKDGYADQLINPVTHYAQSFSKL-PTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNV-NDQLLFETMP 310 (795) Q Consensus 233 ~~~~~dg~~~t~~~~~~~~v~~~~~l-~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~-~~~~~~~t~p 310 (795) .....+ ...... ..+.....+ |...+..+.+...++.......+|..+..+.|.|++++..+. .......+|| T Consensus 220 ~~~~~~-~~~~~~----~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~t~v~~~~~~~~~~~~~~ 294 (823) T protein:vir:95 220 DIRRAD-SNYYRA----VTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYLHSGFGIARITAVNGTTATAEVISYIP 294 (823) T ss_pred ceEEec-ccceee----eeccccceeecccCCcceEEeceecccccceeEEEEEeCCcceEEEEeecceeeeceEeeeec Confidence 000000 000000 011111122 233333344544444333333344444566788888765442 3344567799 Q ss_pred EEEEeecCceeeecccCCccccCCcccccccccccCCCccEEEEEcceEEEecC----CeEEEEecCCcccccccccccc Q lcl|NC_015271. 311 HALVRAADGNFELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSG----ENIILSRTAKYFNFYPASIATL 386 (795) Q Consensus 311 ~~~v~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~----~~v~~Sr~gd~~nF~~~t~~~~ 386 (795) +.+++..+.++++....|+. ..+||++|+||||||+|+++ ++|||||+||||||+++++ + T Consensus 295 ~~~~~~~~~t~~~~~~~~~~--------------~~g~Ps~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~~--~ 358 (823) T protein:vir:95 295 SQVVGEDNASYKWAKYAWNS--------------VNGYPGTVVYYQQRLYFAASTAFPQTIWASRTGDYKDFGKSNP--T 358 (823) T ss_pred cccccCCcCCccccccccCc--------------CCCCccEEEEEeceEEEEEcCCCCcEEEEeccCCccccccccC--C Confidence 99999988898888777753 44677999999999999976 7999999999999999984 5 Q ss_pred CCCCcEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCC--ccccccceEEEEEEeecCcCCCCcEEeCCeEEEEec Q lcl|NC_015271. 387 SDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTAS--GTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASP 464 (795) Q Consensus 387 ~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~--~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~ 464 (795) +|||||+++++++++|.|+|+++++ +|+|||+++||+|+++ ++|||+|++++++|.|++ ++|+|+.+|+.++|+|+ T Consensus 359 ~DdD~I~~~~s~~~~~~i~~~v~~~-~Lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~~~Fv~~ 436 (823) T protein:vir:95 359 QDDDRIIYTYAGRQVNEIRHLIDVG-SLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGS-SNVPPIAVANIALFVQE 436 (823) T ss_pred CCCCcEEEEEcCCcceEEEEEeecC-cEEEEecCcEEEEEcCCCcccceeeEEEEEeecccc-ccccceEeCCeEEEEec Confidence 7999999999999999999999995 7999999999999875 689999999999999865 67999999999999999 Q ss_pred CCCeeEEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEee Q lcl|NC_015271. 465 RSSFTSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHW 544 (795) Q Consensus 465 ~g~~~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w 544 (795) +|+ ++ |++.|++++|+|+++|+|+|++||+++..+..++++++|.+++|++++||+|++|+|+. ||+|.||||| T Consensus 437 ~g~--~v-re~~~~~~~d~~~~~dlT~~a~hl~~~~~i~~~a~~~~p~~~~~~v~~dG~l~~~ty~~---~q~v~aW~~~ 510 (823) T protein:vir:95 437 KGS--VV-RDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDGKLLVMTYLR---DQQVFAWAPQ 510 (823) T ss_pred CCC--EE-EEEEEeeecCceecchhhhhhhhhcCCCceEEEEEecCCCeEEEEEecCCcEEEEEEec---ccceeeeEEE Confidence 885 34 45678899999999999999999999988888899999999999999999999999974 7889999999 Q ss_pred ecCCCeEEEEEEE--eCCEEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcc--------------- Q lcl|NC_015271. 545 DFGSNVQVLACQC--INSDMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGT--------------- 607 (795) Q Consensus 545 ~~~g~~~~~~~~~--~~d~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~--------------- 607 (795) +++|+++++|+.+ .+|+||++|+|+++++.+++.|.+.+.....+++++||||+.+|...... T Consensus 511 ~~~g~~~~~~~i~~~~~d~l~~~v~R~i~g~~~~yiE~~~~~~~~~~~~~~~lD~~~s~~g~~~~~~~~~l~~g~~~l~~ 590 (823) T protein:vir:95 511 SSTGKYESTCSISEGNEDAVYFVVNRTVNGQTVRYIERLSSRLFTSDEDAFFVDSGLSYDGRNTSDRTMTITGGSGEWDY 590 (823) T ss_pred ecCCcEEEEEEecCCCCCEEEEEEEeccCCeEEEEEEeeccccCCCccceeEEEEEEEeecCcccceeeEecCCCCcccc Confidence 9999999998865 47999999999999999999998888777788889999999887532210 Q ss_pred ------------ccccc-ccce------------------------------------------------------eecc Q lcl|NC_015271. 608 ------------YNDDT-FTTT------------------------------------------------------LHLP 620 (795) Q Consensus 608 ------------~~~~~-~~t~------------------------------------------------------~~~~ 620 (795) +.... .... .+.. T Consensus 591 l~g~~v~~adg~~~~~~~v~g~i~l~~~~~~~~vGl~~~~~i~~~~~~v~~~~a~~~~~~r~v~a~l~~~~t~~~~~~~~ 670 (823) T protein:vir:95 591 LAEYTISVSGGAYFTSSDVGAQLQFPYTGADPDTGYEVSKELRCDIISVTSNTAVVVRANRNVPPSLRNVATTNWQMARR 670 (823) T ss_pred cCceEEEecCcceECCccceeEEEeCcCCCccccccceEEEEEEeeceeeCCceEEEccCCcccceeeeeeccccccccc Confidence 00000 0000 0123 Q ss_pred cccCCcccCceEEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccc Q lcl|NC_015271. 621 TIYGADFAKGKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIG 700 (795) Q Consensus 621 ~~~gl~~~~g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~g 700 (795) .++||+||||++|.+++||.+++..++.+| +|+|+.+++.|||||+|+++++++||++..+ |.+. .. T Consensus 671 ~~~gL~hleg~tv~v~~dg~~~~~~~v~~G-------~vtl~~~~~~v~vGl~~~~~~~~l~~~~~~~-g~~~-----g~ 737 (823) T protein:vir:95 671 TFGGLSHLEGQTVNILSDANVEPQKVVSGG-------AVTLESPGAVVHIGLPITAEFETLDININGQ-ETLL-----DK 737 (823) T ss_pred eeeeccccccceEEEEEcCeeeCCeEecCC-------EEEecCCCCEEEEeecceeeEEecchhcCCC-cccC-----Cc Confidence 356999999999999999999999999888 8999999999999999999999999998753 4332 12 Q ss_pred cEEEEEEEEEeeccceEEEEecCCcccccccccccccccccccccccccccceEEEEe-eecccceEEEEEECCCCCEEE Q lcl|NC_015271. 701 RLQLRRAWVNYEDSGTFDIYVENQSSNWKYTMAGARLGAHVMRTGKLNLGTGQYRFPV-VGNAKFNTVFILSDATTPLNI 779 (795) Q Consensus 701 rl~l~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~-~~~~~~~~v~i~~~~P~P~tv 779 (795) ++||++++++|++|.+++++.+....+.. . .+ .......++|+++|++++++ .+|+++++|+|+|++|||||| T Consensus 738 ~~ri~~~~~~~~~s~~~~~g~~~~~l~~~-~---~r--~~~~~~~~~~~~tG~~~~~~~~~~~~~~~~~i~q~~plp~tv 811 (823) T protein:vir:95 738 KQVIPSVTLVVNASRGIWATTPGGKWYEY-P---QR--EFEFYDDPVDDATGKVEVKLDSNWGKNGRVKIRQLDPLPLSV 811 (823) T ss_pred eeEEeEEEEEEEeeeeEEEecCCCceeEe-e---cc--CCCcccCCCCcccceEEEecCCCcCCccEEEEEEcCCCceEE Confidence 46899999999999999998755432211 1 11 12334567889999999987 689999999999999999999 Q ss_pred EEEEEEEEEecc Q lcl|NC_015271. 780 IGCGWEGNYLRR 791 (795) Q Consensus 780 lsi~~eg~y~~r 791 (795) |||..|...+-= T Consensus 812 l~v~~~~~~~g~ 823 (823) T protein:vir:95 812 LAVIPRLTVGGF 823 (823) T ss_pred EEEEEEEEecCC Confidence 999988775544 No 20 >protein:vir:107802 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996623;genbank:gi:45580757;genbank:GeneID:2767878 Probab=100.00 E-value=6.7e-165 Score=920.65 Aligned_cols=666 Identities=14% Similarity=0.137 Sum_probs=508.7 Q ss_pred CCceeeechhhccc-cc----cCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCc Q lcl|NC_015271. 1 MALISQSIKNLKGG-IS----QQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESE 75 (795) Q Consensus 1 M~~v~~~~~n~~~G-vS----~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~ 75 (795) |++++.+++||.+| || .|.|++||+++|++|+||++.|+||++||||++||+++++.. ...||+||.|+++| T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~---~~~rlipf~~~~~~ 77 (681) T protein:vir:10 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSA---KKVRLIPFTYSVTQ 77 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCC---CcEEEEEEEeCCCc Confidence 99999999999999 55 589999999999999999999999999999999999988754 35799999999999 Q ss_pred eEEEEEeCCeEEEEecCCcEEEE-EECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccE Q lcl|NC_015271. 76 QYYAVFTGTGIRVFDLAGNERQV-RYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDAL 154 (795) Q Consensus 76 ~y~l~~~~~~~rv~~~~g~~~~v-~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~ 154 (795) +|+|||++++||||..+|.+... ......+||. +..+.+|+|+|+||+|||||+++||++|.|.++ +.|+..++.| T Consensus 78 ~~~l~~g~~~~r~~~~~~~~~~~~~~~~~~tpy~--~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~-~~W~l~~~~f 154 (681) T protein:vir:10 78 TMVIELGAGYFRFHTNGGTLLDGAVPYEIANPYA--EADLFNIHYVQSADVLTLVHPNYAPRELRRLGA-TNWQLATIAF 154 (681) T ss_pred eEEEEEeCCeEEEEeCCcEEeeCcEeEEecCCCC--hhhhcCceEEEEcCEEEEECCCCcceEEEEccC-CceEEEEEEe Confidence 99999999999999877765432 2223356674 345678999999999999999999999999766 5588777666 Q ss_pred EEecccccCeEEEEEECCceeEEEEecCCCCcccccccchhHHhHhhhhhcccccCceeeeecCceEEEEecCCcceeeE Q lcl|NC_015271. 155 INVRGGQYGRTLQIIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELARQCRVSAPGWTFNVGQGYIHIIAPEGQQIDSL 234 (795) Q Consensus 155 ~~v~~~~~~~ty~vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s~~~g~t~~~~g~~~~i~~~~~~~~~~~ 234 (795) ...++...+ ++.+... .....+. ..++.+.++..... T Consensus 155 ~~~p~~p~~----~~at~~~---------------------------------~~~~~t~-----~~~v~avda~t~~~- 191 (681) T protein:vir:10 155 TSPVATPTS----VTATSNN---------------------------------KGTDYTY-----RYVVTALDAEGKTE- 191 (681) T ss_pred cccccccee----eeeeccC---------------------------------CccceeE-----eEEEEEeeccccee- Confidence 544443221 1110000 0000000 00111111100000 Q ss_pred EEecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeEEEE Q lcl|NC_015271. 235 TTKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALV 314 (795) Q Consensus 235 ~~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~~~v 314 (795) +...... .+.... . ..+...++.+++..+...|.+. .+.+.+.+.++.......+.. T Consensus 192 -s~~~~~~----tvt~~~--~-------~~~~~~t~~w~a~~g~~~~~V~--~~~~gi~g~ig~~~~~~~~~~------- 248 (681) T protein:vir:10 192 -SAPSSAG----TCTNNL--F-------TNGGANTIAWSASSGASRYNVY--KEQGGLYGYIGQTTGTSLVDD------- 248 (681) T ss_pred -ecCCcce----EEeeee--e-------cCCcceeEEEEecCCceeeeec--ccceeEEEEeeccceeeeeec------- Confidence 0000000 000000 0 1123334455555555444321 122222222322222221111 Q ss_pred eecCceeeecccCCccccCCcccccccccccCCCccEEEEEcceEEEec----CCeEEEEecCCccccccccccccCCCC Q lcl|NC_015271. 315 RAADGNFELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLS----GENIILSRTAKYFNFYPASIATLSDDD 390 (795) Q Consensus 315 ~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~----~~~v~~Sr~gd~~nF~~~t~~~~~DdD 390 (795) ...++.....+...+++ +-.++||++|+||||||+|++ +++|||||+|||+||++++ ++.||| T Consensus 249 ----------~~~~~~~~t~~~~~~~~-~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~--~~~ddD 315 (681) T protein:vir:10 249 ----------NIAPDLSVTPPIYDAVF-NAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSL--PVRDDD 315 (681) T ss_pred ----------ccccCcccccccccccc-ccCCCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccC--CCCCCc Confidence 11111122222223443 345679999999999999995 4799999999999999998 458999 Q ss_pred cEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeC--CccccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCe Q lcl|NC_015271. 391 PIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTA--SGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSF 468 (795) Q Consensus 391 ~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~--~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~ 468 (795) ||+++++++++|.|+|+++++ +|+|||+++||.|++ +++|||+|++++++|.|++ ++|+|+.+|++++|+|++|++ T Consensus 316 ~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~fv~~~g~~ 393 (681) T protein:vir:10 316 RVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTIYGAARGGH 393 (681) T ss_pred cEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeeeecc-ccccceeeCCeEEEEecCCCE Confidence 999999999999999999995 799999999999987 4699999999999999976 579999999999999999964 Q ss_pred eEEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecCC Q lcl|NC_015271. 469 TSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGS 548 (795) Q Consensus 469 ~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~g 548 (795) +|+++|++++|+|+++|+|++++|++++..+..++++++|.+++||+++||+|++|+|+ +||+|+|||||+|+| T Consensus 394 ---vre~~y~~~~d~~~~~dlt~~a~Hl~~~~~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~---~eq~v~aW~~~~~~g 467 (681) T protein:vir:10 394 ---VRELAYNWQANGFVTGDLSLRAAHLFDNLDILDMAYAKAPQPIVWFISSSGKLLGLTYV---PEQQIGAWHQHDTDG 467 (681) T ss_pred ---EEEEEEeeecCceeccchhhhhhhhcCCCCeEEEEEecCCCEEEEEEecCCcEEEEEEe---cccceeeEEEEecCC Confidence 45568899999999999999999999987777789999999999999999999999997 467799999999999 Q ss_pred CeEEEEEEE--eCCEEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeecccccCCc Q lcl|NC_015271. 549 NVQVLACQC--INSDMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLPTIYGAD 626 (795) Q Consensus 549 ~~~~~~~~~--~~d~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~ 626 (795) +++++|+.. .+|.||++|+|++++..+++.|.++..........+|+||+.++... +...++||+ T Consensus 468 ~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~yie~~~~~~~~~~~~~~~vD~~~t~~~~-------------~~~~~sgl~ 534 (681) T protein:vir:10 468 VFESCAVVAEGNEDRLYAVVRRTIGGNEVRYVERMASRQFDAQADAFFVDSGLTYSGE-------------PVSHISGLE 534 (681) T ss_pred cEEEEEEecCCCCcEEEEEEEecCCCCeEEEEEecCCccccccccceEeeccccccCc-------------ceeeecccc Confidence 999988864 36899999999999988898888877666666677899999886532 234578999 Q ss_pred ccCceEEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEE Q lcl|NC_015271. 627 FAKGKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRR 706 (795) Q Consensus 627 ~~~g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~ 706 (795) |++|++|.+.+||..++..++.+| .|+++.++++|||||+|+++++++||+++.++|.+.+ .++||+| T Consensus 535 ~leG~tv~i~aDG~~~~~~~V~~G-------~itl~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g-----~~~ri~r 602 (681) T protein:vir:10 535 HLEGKTVSILADGAVHPQRVVTDG-------AIDLDVEAGTVHIGLPITAELQTLPVAMQLDGSFGQG-----RVKNINK 602 (681) T ss_pred CCCCcEEEEEeCCeecCcEeecCc-------EEEeCcCCceEEEeeeceeEEEecceeeecCCcccCC-----ceEEEEE Confidence 999999999999999999888877 6889999999999999999999999999999875542 3689999 Q ss_pred EEEEeeccceEEEEecCCcccccccccccccccccccccccccccceEEEEee-ecccceEEEEEECCCCCEEEEEEEEE Q lcl|NC_015271. 707 AWVNYEDSGTFDIYVENQSSNWKYTMAGARLGAHVMRTGKLNLGTGQYRFPVV-GNAKFNTVFILSDATTPLNIIGCGWE 785 (795) Q Consensus 707 ~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~-~~~~~~~v~i~~~~P~P~tvlsi~~e 785 (795) ++|++.+|.+++++++....+....+.+.++ ..++++.+|++++|+. +|+++.+|+|+|++|+||+|+||+|| T Consensus 603 v~lr~~~S~g~~~~~~~~~l~~~~~~~~~~~------g~~~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~e 676 (681) T protein:vir:10 603 LWLRVHRSSGIFAGPHADALTEVKQRTSEPY------GSPPALKSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAE 676 (681) T ss_pred EEEEEEcccceEEeeCCCceEEEEEeccccc------cccCCccCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEE Confidence 9999999999999987654433222223333 3457889999999985 78999999999999999999999999 Q ss_pred EEEec Q lcl|NC_015271. 786 GNYLR 790 (795) Q Consensus 786 g~y~~ 790 (795) ....- T Consensus 677 v~vgg 681 (681) T protein:vir:10 677 IAIGA 681 (681) T ss_pred EEeeC Confidence 99988 No 21 >protein:vir:107423 Length: 681 # NCBI annotation: Bbp13 # Family: family:all:780 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958682;genbank:gi:41179374;genbank:GeneID:2717217 Probab=100.00 E-value=6.7e-165 Score=920.65 Aligned_cols=666 Identities=14% Similarity=0.137 Sum_probs=508.7 Q ss_pred CCceeeechhhccc-cc----cCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCc Q lcl|NC_015271. 1 MALISQSIKNLKGG-IS----QQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESE 75 (795) Q Consensus 1 M~~v~~~~~n~~~G-vS----~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~ 75 (795) |++++.+++||.+| || .|.|++||+++|++|+||++.|+||++||||++||+++++.. ...||+||.|+++| T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~---~~~rlipf~~~~~~ 77 (681) T protein:vir:10 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSA---KKVRLIPFTYSVTQ 77 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCC---CcEEEEEEEeCCCc Confidence 99999999999999 55 589999999999999999999999999999999999988754 35799999999999 Q ss_pred eEEEEEeCCeEEEEecCCcEEEE-EECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccE Q lcl|NC_015271. 76 QYYAVFTGTGIRVFDLAGNERQV-RYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDAL 154 (795) Q Consensus 76 ~y~l~~~~~~~rv~~~~g~~~~v-~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~ 154 (795) +|+|||++++||||..+|.+... ......+||. +..+.+|+|+|+||+|||||+++||++|.|.++ +.|+..++.| T Consensus 78 ~~~l~~g~~~~r~~~~~~~~~~~~~~~~~~tpy~--~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~-~~W~l~~~~f 154 (681) T protein:vir:10 78 TMVIELGAGYFRFHTNGGTLLDGAVPYEIANPYA--EADLFNIHYVQSADVLTLVHPNYAPRELRRLGA-TNWQLATIAF 154 (681) T ss_pred eEEEEEeCCeEEEEeCCcEEeeCcEeEEecCCCC--hhhhcCceEEEEcCEEEEECCCCcceEEEEccC-CceEEEEEEe Confidence 99999999999999877765432 2223356674 345678999999999999999999999999766 5588777666 Q ss_pred EEecccccCeEEEEEECCceeEEEEecCCCCcccccccchhHHhHhhhhhcccccCceeeeecCceEEEEecCCcceeeE Q lcl|NC_015271. 155 INVRGGQYGRTLQIIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELARQCRVSAPGWTFNVGQGYIHIIAPEGQQIDSL 234 (795) Q Consensus 155 ~~v~~~~~~~ty~vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s~~~g~t~~~~g~~~~i~~~~~~~~~~~ 234 (795) ...++...+ ++.+... .....+. ..++.+.++..... T Consensus 155 ~~~p~~p~~----~~at~~~---------------------------------~~~~~t~-----~~~v~avda~t~~~- 191 (681) T protein:vir:10 155 TSPVATPTS----VTATSNN---------------------------------KGTDYTY-----RYVVTALDAEGKTE- 191 (681) T ss_pred cccccccee----eeeeccC---------------------------------CccceeE-----eEEEEEeeccccee- Confidence 544443221 1110000 0000000 00111111100000 Q ss_pred EEecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeEEEE Q lcl|NC_015271. 235 TTKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALV 314 (795) Q Consensus 235 ~~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~~~v 314 (795) +...... .+.... . ..+...++.+++..+...|.+. .+.+.+.+.++.......+.. T Consensus 192 -s~~~~~~----tvt~~~--~-------~~~~~~t~~w~a~~g~~~~~V~--~~~~gi~g~ig~~~~~~~~~~------- 248 (681) T protein:vir:10 192 -SAPSSAG----TCTNNL--F-------TNGGANTIAWSASSGASRYNVY--KEQGGLYGYIGQTTGTSLVDD------- 248 (681) T ss_pred -ecCCcce----EEeeee--e-------cCCcceeEEEEecCCceeeeec--ccceeEEEEeeccceeeeeec------- Confidence 0000000 000000 0 1123334455555555444321 122222222322222221111 Q ss_pred eecCceeeecccCCccccCCcccccccccccCCCccEEEEEcceEEEec----CCeEEEEecCCccccccccccccCCCC Q lcl|NC_015271. 315 RAADGNFELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLS----GENIILSRTAKYFNFYPASIATLSDDD 390 (795) Q Consensus 315 ~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~----~~~v~~Sr~gd~~nF~~~t~~~~~DdD 390 (795) ...++.....+...+++ +-.++||++|+||||||+|++ +++|||||+|||+||++++ ++.||| T Consensus 249 ----------~~~~~~~~t~~~~~~~~-~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~--~~~ddD 315 (681) T protein:vir:10 249 ----------NIAPDLSVTPPIYDAVF-NAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSL--PVRDDD 315 (681) T ss_pred ----------ccccCcccccccccccc-ccCCCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccC--CCCCCc Confidence 11111122222223443 345679999999999999995 4799999999999999998 458999 Q ss_pred cEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeC--CccccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCe Q lcl|NC_015271. 391 PIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTA--SGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSF 468 (795) Q Consensus 391 ~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~--~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~ 468 (795) ||+++++++++|.|+|+++++ +|+|||+++||.|++ +++|||+|++++++|.|++ ++|+|+.+|++++|+|++|++ T Consensus 316 ~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~fv~~~g~~ 393 (681) T protein:vir:10 316 RVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTIYGAARGGH 393 (681) T ss_pred cEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeeeecc-ccccceeeCCeEEEEecCCCE Confidence 999999999999999999995 799999999999987 4699999999999999976 579999999999999999964 Q ss_pred eEEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecCC Q lcl|NC_015271. 469 TSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGS 548 (795) Q Consensus 469 ~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~g 548 (795) +|+++|++++|+|+++|+|++++|++++..+..++++++|.+++||+++||+|++|+|+ +||+|+|||||+|+| T Consensus 394 ---vre~~y~~~~d~~~~~dlt~~a~Hl~~~~~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~---~eq~v~aW~~~~~~g 467 (681) T protein:vir:10 394 ---VRELAYNWQANGFVTGDLSLRAAHLFDNLDILDMAYAKAPQPIVWFISSSGKLLGLTYV---PEQQIGAWHQHDTDG 467 (681) T ss_pred ---EEEEEEeeecCceeccchhhhhhhhcCCCCeEEEEEecCCCEEEEEEecCCcEEEEEEe---cccceeeEEEEecCC Confidence 45568899999999999999999999987777789999999999999999999999997 467799999999999 Q ss_pred CeEEEEEEE--eCCEEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeecccccCCc Q lcl|NC_015271. 549 NVQVLACQC--INSDMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLPTIYGAD 626 (795) Q Consensus 549 ~~~~~~~~~--~~d~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~ 626 (795) +++++|+.. .+|.||++|+|++++..+++.|.++..........+|+||+.++... +...++||+ T Consensus 468 ~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~yie~~~~~~~~~~~~~~~vD~~~t~~~~-------------~~~~~sgl~ 534 (681) T protein:vir:10 468 VFESCAVVAEGNEDRLYAVVRRTIGGNEVRYVERMASRQFDAQADAFFVDSGLTYSGE-------------PVSHISGLE 534 (681) T ss_pred cEEEEEEecCCCCcEEEEEEEecCCCCeEEEEEecCCccccccccceEeeccccccCc-------------ceeeecccc Confidence 999988864 36899999999999988898888877666666677899999886532 234578999 Q ss_pred ccCceEEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEE Q lcl|NC_015271. 627 FAKGKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRR 706 (795) Q Consensus 627 ~~~g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~ 706 (795) |++|++|.+.+||..++..++.+| .|+++.++++|||||+|+++++++||+++.++|.+.+ .++||+| T Consensus 535 ~leG~tv~i~aDG~~~~~~~V~~G-------~itl~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g-----~~~ri~r 602 (681) T protein:vir:10 535 HLEGKTVSILADGAVHPQRVVTDG-------AIDLDVEAGTVHIGLPITAELQTLPVAMQLDGSFGQG-----RVKNINK 602 (681) T ss_pred CCCCcEEEEEeCCeecCcEeecCc-------EEEeCcCCceEEEeeeceeEEEecceeeecCCcccCC-----ceEEEEE Confidence 999999999999999999888877 6889999999999999999999999999999875542 3689999 Q ss_pred EEEEeeccceEEEEecCCcccccccccccccccccccccccccccceEEEEee-ecccceEEEEEECCCCCEEEEEEEEE Q lcl|NC_015271. 707 AWVNYEDSGTFDIYVENQSSNWKYTMAGARLGAHVMRTGKLNLGTGQYRFPVV-GNAKFNTVFILSDATTPLNIIGCGWE 785 (795) Q Consensus 707 ~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~-~~~~~~~v~i~~~~P~P~tvlsi~~e 785 (795) ++|++.+|.+++++++....+....+.+.++ ..++++.+|++++|+. +|+++.+|+|+|++|+||+|+||+|| T Consensus 603 v~lr~~~S~g~~~~~~~~~l~~~~~~~~~~~------g~~~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~e 676 (681) T protein:vir:10 603 LWLRVHRSSGIFAGPHADALTEVKQRTSEPY------GSPPALKSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAE 676 (681) T ss_pred EEEEEEcccceEEeeCCCceEEEEEeccccc------cccCCccCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEE Confidence 9999999999999987654433222223333 3457889999999985 78999999999999999999999999 Q ss_pred EEEec Q lcl|NC_015271. 786 GNYLR 790 (795) Q Consensus 786 g~y~~ 790 (795) ....- T Consensus 677 v~vgg 681 (681) T protein:vir:10 677 IAIGA 681 (681) T ss_pred EEeeC Confidence 99988 No 22 >protein:vir:98487 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996575;genbank:gi:45569506;genbank:GeneID:2767815 Probab=100.00 E-value=6.7e-165 Score=920.65 Aligned_cols=666 Identities=14% Similarity=0.137 Sum_probs=508.7 Q ss_pred CCceeeechhhccc-cc----cCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCc Q lcl|NC_015271. 1 MALISQSIKNLKGG-IS----QQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESE 75 (795) Q Consensus 1 M~~v~~~~~n~~~G-vS----~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~ 75 (795) |++++.+++||.+| || .|.|++||+++|++|+||++.|+||++||||++||+++++.. ...||+||.|+++| T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~---~~~rlipf~~~~~~ 77 (681) T protein:vir:98 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSA---KKVRLIPFTYSVTQ 77 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCC---CcEEEEEEEeCCCc Confidence 99999999999999 55 589999999999999999999999999999999999988754 35799999999999 Q ss_pred eEEEEEeCCeEEEEecCCcEEEE-EECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccE Q lcl|NC_015271. 76 QYYAVFTGTGIRVFDLAGNERQV-RYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDAL 154 (795) Q Consensus 76 ~y~l~~~~~~~rv~~~~g~~~~v-~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~ 154 (795) +|+|||++++||||..+|.+... ......+||. +..+.+|+|+|+||+|||||+++||++|.|.++ +.|+..++.| T Consensus 78 ~~~l~~g~~~~r~~~~~~~~~~~~~~~~~~tpy~--~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~-~~W~l~~~~f 154 (681) T protein:vir:98 78 TMVIELGAGYFRFHTNGGTLLDGAVPYEIANPYA--EADLFNIHYVQSADVLTLVHPNYAPRELRRLGA-TNWQLATIAF 154 (681) T ss_pred eEEEEEeCCeEEEEeCCcEEeeCcEeEEecCCCC--hhhhcCceEEEEcCEEEEECCCCcceEEEEccC-CceEEEEEEe Confidence 99999999999999877765432 2223356674 345678999999999999999999999999766 5588777666 Q ss_pred EEecccccCeEEEEEECCceeEEEEecCCCCcccccccchhHHhHhhhhhcccccCceeeeecCceEEEEecCCcceeeE Q lcl|NC_015271. 155 INVRGGQYGRTLQIIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELARQCRVSAPGWTFNVGQGYIHIIAPEGQQIDSL 234 (795) Q Consensus 155 ~~v~~~~~~~ty~vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s~~~g~t~~~~g~~~~i~~~~~~~~~~~ 234 (795) ...++...+ ++.+... .....+. ..++.+.++..... T Consensus 155 ~~~p~~p~~----~~at~~~---------------------------------~~~~~t~-----~~~v~avda~t~~~- 191 (681) T protein:vir:98 155 TSPVATPTS----VTATSNN---------------------------------KGTDYTY-----RYVVTALDAEGKTE- 191 (681) T ss_pred cccccccee----eeeeccC---------------------------------CccceeE-----eEEEEEeeccccee- Confidence 544443221 1110000 0000000 00111111100000 Q ss_pred EEecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeEEEE Q lcl|NC_015271. 235 TTKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALV 314 (795) Q Consensus 235 ~~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~~~v 314 (795) +...... .+.... . ..+...++.+++..+...|.+. .+.+.+.+.++.......+.. T Consensus 192 -s~~~~~~----tvt~~~--~-------~~~~~~t~~w~a~~g~~~~~V~--~~~~gi~g~ig~~~~~~~~~~------- 248 (681) T protein:vir:98 192 -SAPSSAG----TCTNNL--F-------TNGGANTIAWSASSGASRYNVY--KEQGGLYGYIGQTTGTSLVDD------- 248 (681) T ss_pred -ecCCcce----EEeeee--e-------cCCcceeEEEEecCCceeeeec--ccceeEEEEeeccceeeeeec------- Confidence 0000000 000000 0 1123334455555555444321 122222222322222221111 Q ss_pred eecCceeeecccCCccccCCcccccccccccCCCccEEEEEcceEEEec----CCeEEEEecCCccccccccccccCCCC Q lcl|NC_015271. 315 RAADGNFELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLS----GENIILSRTAKYFNFYPASIATLSDDD 390 (795) Q Consensus 315 ~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~----~~~v~~Sr~gd~~nF~~~t~~~~~DdD 390 (795) ...++.....+...+++ +-.++||++|+||||||+|++ +++|||||+|||+||++++ ++.||| T Consensus 249 ----------~~~~~~~~t~~~~~~~~-~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~--~~~ddD 315 (681) T protein:vir:98 249 ----------NIAPDLSVTPPIYDAVF-NAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSL--PVRDDD 315 (681) T ss_pred ----------ccccCcccccccccccc-ccCCCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccC--CCCCCc Confidence 11111122222223443 345679999999999999995 4799999999999999998 458999 Q ss_pred cEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeC--CccccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCe Q lcl|NC_015271. 391 PIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTA--SGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSF 468 (795) Q Consensus 391 ~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~--~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~ 468 (795) ||+++++++++|.|+|+++++ +|+|||+++||.|++ +++|||+|++++++|.|++ ++|+|+.+|++++|+|++|++ T Consensus 316 ~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~fv~~~g~~ 393 (681) T protein:vir:98 316 RVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTIYGAARGGH 393 (681) T ss_pred cEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeeeecc-ccccceeeCCeEEEEecCCCE Confidence 999999999999999999995 799999999999987 4699999999999999976 579999999999999999964 Q ss_pred eEEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecCC Q lcl|NC_015271. 469 TSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGS 548 (795) Q Consensus 469 ~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~g 548 (795) +|+++|++++|+|+++|+|++++|++++..+..++++++|.+++||+++||+|++|+|+ +||+|+|||||+|+| T Consensus 394 ---vre~~y~~~~d~~~~~dlt~~a~Hl~~~~~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~---~eq~v~aW~~~~~~g 467 (681) T protein:vir:98 394 ---VRELAYNWQANGFVTGDLSLRAAHLFDNLDILDMAYAKAPQPIVWFISSSGKLLGLTYV---PEQQIGAWHQHDTDG 467 (681) T ss_pred ---EEEEEEeeecCceeccchhhhhhhhcCCCCeEEEEEecCCCEEEEEEecCCcEEEEEEe---cccceeeEEEEecCC Confidence 45568899999999999999999999987777789999999999999999999999997 467799999999999 Q ss_pred CeEEEEEEE--eCCEEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeecccccCCc Q lcl|NC_015271. 549 NVQVLACQC--INSDMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLPTIYGAD 626 (795) Q Consensus 549 ~~~~~~~~~--~~d~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~ 626 (795) +++++|+.. .+|.||++|+|++++..+++.|.++..........+|+||+.++... +...++||+ T Consensus 468 ~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~yie~~~~~~~~~~~~~~~vD~~~t~~~~-------------~~~~~sgl~ 534 (681) T protein:vir:98 468 VFESCAVVAEGNEDRLYAVVRRTIGGNEVRYVERMASRQFDAQADAFFVDSGLTYSGE-------------PVSHISGLE 534 (681) T ss_pred cEEEEEEecCCCCcEEEEEEEecCCCCeEEEEEecCCccccccccceEeeccccccCc-------------ceeeecccc Confidence 999988864 36899999999999988898888877666666677899999886532 234578999 Q ss_pred ccCceEEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEE Q lcl|NC_015271. 627 FAKGKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRR 706 (795) Q Consensus 627 ~~~g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~ 706 (795) |++|++|.+.+||..++..++.+| .|+++.++++|||||+|+++++++||+++.++|.+.+ .++||+| T Consensus 535 ~leG~tv~i~aDG~~~~~~~V~~G-------~itl~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g-----~~~ri~r 602 (681) T protein:vir:98 535 HLEGKTVSILADGAVHPQRVVTDG-------AIDLDVEAGTVHIGLPITAELQTLPVAMQLDGSFGQG-----RVKNINK 602 (681) T ss_pred CCCCcEEEEEeCCeecCcEeecCc-------EEEeCcCCceEEEeeeceeEEEecceeeecCCcccCC-----ceEEEEE Confidence 999999999999999999888877 6889999999999999999999999999999875542 3689999 Q ss_pred EEEEeeccceEEEEecCCcccccccccccccccccccccccccccceEEEEee-ecccceEEEEEECCCCCEEEEEEEEE Q lcl|NC_015271. 707 AWVNYEDSGTFDIYVENQSSNWKYTMAGARLGAHVMRTGKLNLGTGQYRFPVV-GNAKFNTVFILSDATTPLNIIGCGWE 785 (795) Q Consensus 707 ~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~-~~~~~~~v~i~~~~P~P~tvlsi~~e 785 (795) ++|++.+|.+++++++....+....+.+.++ ..++++.+|++++|+. +|+++.+|+|+|++|+||+|+||+|| T Consensus 603 v~lr~~~S~g~~~~~~~~~l~~~~~~~~~~~------g~~~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~e 676 (681) T protein:vir:98 603 LWLRVHRSSGIFAGPHADALTEVKQRTSEPY------GSPPALKSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAE 676 (681) T ss_pred EEEEEEcccceEEeeCCCceEEEEEeccccc------cccCCccCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEE Confidence 9999999999999987654433222223333 3457889999999985 78999999999999999999999999 Q ss_pred EEEec Q lcl|NC_015271. 786 GNYLR 790 (795) Q Consensus 786 g~y~~ 790 (795) ....- T Consensus 677 v~vgg 681 (681) T protein:vir:98 677 IAIGA 681 (681) T ss_pred EEeeC Confidence 99988 No 23 >protein:vir:7329 Length: 825 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848220;genbank:gi:30387391;genbank:GeneID:2641863 Probab=100.00 E-value=7.6e-165 Score=920.33 Aligned_cols=720 Identities=13% Similarity=0.110 Sum_probs=519.6 Q ss_pred CCceeeechhhccc-cc----cCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCc Q lcl|NC_015271. 1 MALISQSIKNLKGG-IS----QQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESE 75 (795) Q Consensus 1 M~~v~~~~~n~~~G-vS----~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~ 75 (795) |+ ++..++||.+| || .|+|++||++||++|+||+++|+||++||||++||+.+++++ ...||+||.|+++| T Consensus 1 m~-~~~~q~sF~~GElsP~l~gR~Dl~~y~~g~~~~~N~~~~p~Gg~~rRpGt~fva~~~~~~---~~~rLipF~fs~~q 76 (825) T protein:vir:73 1 MA-FSWIQPSFAGGEIGPSLYGRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPAKYPD---RKCRLIPFQFSTVQ 76 (825) T ss_pred Cc-cceeccccccceechhhcccchHHHHHHHHHHhcCcEEEecCCceecCchHHhHhhcCCC---CCEEEEEEEeCCCc Confidence 75 55788999999 44 589999999999999999999999999999999999987754 36799999999999 Q ss_pred eEEEEEeCCeEEEEecCCcEEEE--EECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCccc Q lcl|NC_015271. 76 QYYAVFTGTGIRVFDLAGNERQV--RYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDA 153 (795) Q Consensus 76 ~y~l~~~~~~~rv~~~~g~~~~v--~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~ 153 (795) +|+|||++++||||..+|.+..- ......+||. +..+.+|+|+|+||+|||+|+++||+++.|.++. .|...+.. T Consensus 77 ~y~Lefg~~~lrv~~~gg~v~~~~~~~~e~~TPy~--~~~l~~l~~~QsaD~~~i~h~~~pp~~L~r~~~~-~W~l~~~~ 153 (825) T protein:vir:73 77 TYALEFGHNYMRVIKDGAYVLTTSNVIYELAMPYA--DTDLFRIKFTQSADVLTLVHPAYPPKELRRYAHD-NWQIVDVT 153 (825) T ss_pred EEEEEEeCCeEEEEeCCceEeccCCceEEEecccc--hhhhhhheeeeecCEEEEEcCCCceeEEEEecCC-CcEEEEEe Confidence 99999999999999977754210 1112235663 3456789999999999999999999999998764 58877777 Q ss_pred EEEecccccCeEEEEEE--CCceeEEEEecCCCCcccccccchhH-HhHhhhhhcccccCceeeeecCceEEEEecCCcc Q lcl|NC_015271. 154 LINVRGGQYGRTLQIII--NGNTQATYQIPDGSQPEHVNNTDAQW-LAEELARQCRVSAPGWTFNVGQGYIHIIAPEGQQ 230 (795) Q Consensus 154 ~~~v~~~~~~~ty~vt~--~g~~~a~~ttp~~s~~~~~~~~~~~~-i~~~l~~~~~s~~~g~t~~~~g~~~~i~~~~~~~ 230 (795) |...++...+....+++ .+.....+.+++.. .....+.-... +.......+. .+...+....+.+ ....+ . T Consensus 154 f~~gp~~~in~~~sv~v~asg~tg~~TiTaS~a-~~~~~~vG~~i~~~~~~v~si~-~~~~~~~~~~~~v---~~~~~-~ 227 (825) T protein:vir:73 154 TKNGPFEDINVDETVKVYASASTGTITLTASSA-IFGAEQVGKLFYLEQPAVDSVP-VWETSKTTAINDV---RRADS-N 227 (825) T ss_pred ccCCccccccccccceeeecccCceeEEEeecc-ccCchhcCeEEEEecccccccc-eeeeeeEEEeeeE---EECCC-c Confidence 66666655544333332 11111111111111 10000000000 0000000000 0000000000000 00000 0 Q ss_pred eeeEEEecCcCcccceeEEEeccceeecccccCCCeEEE-EEcCCCCCcceEEEEEeecCceEEEeeeeeEeee---Eec Q lcl|NC_015271. 231 IDSLTTKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVK-IVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQ---LLF 306 (795) Q Consensus 231 ~~~~~~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~-v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~---~~~ 306 (795) .... .......++++.+..|.... +.+........-+.....+.+.++.+...+.... -.. T Consensus 228 ~~~~---------------~~~~~~~t~~~~a~~g~~~~~~~g~~~~~~~~~~~~~~~~~g~~~it~~~~~~~~~~~~~~ 292 (825) T protein:vir:73 228 YYRA---------------NTSGKTGTLRPSHTEGMSWDGWGGTGSDDTGIQWEYLHSGFGIAKITAVAGDGLTATADVV 292 (825) T ss_pred eeee---------------ecccccceeeccccCCceeEeeeeecccCCceEEEEEecCCceEEEeeccccceeeccccc Confidence 0000 00011112333444443222 2222211111111122334444433332221111 112 Q ss_pred cceeEEEEeecCceeeecccCCccccCCcccccccccccCCCccEEEEEcceEEEec----CCeEEEEecCCcccccccc Q lcl|NC_015271. 307 ETMPHALVRAADGNFELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLS----GENIILSRTAKYFNFYPAS 382 (795) Q Consensus 307 ~t~p~~~v~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~----~~~v~~Sr~gd~~nF~~~t 382 (795) ..+|+.++..++.++++....|+. .++||++|+||||||+|++ +++|||||+||||||++++ T Consensus 293 ~~~~~~~~~~~~~t~~~~~~~~~~--------------~~gyPs~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~ 358 (825) T protein:vir:73 293 SFIPSQVVGSANASYKWAKYAWNS--------------VNGYPSTVVYYQQRLYFAASTAYPQTIWASRTGDYKDFGKNN 358 (825) T ss_pred eecccccccCCCCCcccccCCccc--------------CCCCccEEEEEcceEEEeecCCCCCEEEEEccCCccccccCC Confidence 235666666677777777766653 2357788999999999995 4899999999999999998 Q ss_pred ccccCCCCcEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCC--ccccccceEEEEEEeecCcCCCCcEEeCCeEE Q lcl|NC_015271. 383 IATLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTAS--GTLTSRSIELNLTTQFDVQDRARPFGIGRNVY 460 (795) Q Consensus 383 ~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~--~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~ 460 (795) +++|||||+++++++++|.|+|++++ ++|+|||+++||+|+++ ++|||+|++++++|.|+++ +|+|+.+|++++ T Consensus 359 --~~~DdD~I~~~~s~~~~~~i~~~~~~-~~L~~~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~~-~~~Pv~vg~~~~ 434 (825) T protein:vir:73 359 --PIQDDDRIIYTYAGRQVNEIRHLIDV-GNLVALTSGGEYTISGDQNKVLTPSAFSFSSQGNNGSS-NVPPIAVANIAL 434 (825) T ss_pred --CCCCCccEEEEEcCCcceeEEEEeec-CcEEEEecCceEEEecCCCcccceeeEEEEeeeeeccc-cccceEeCCeEE Confidence 46899999999999999999999998 58999999999999875 6999999999999999764 799999999999 Q ss_pred EEecCCCeeEEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEe Q lcl|NC_015271. 461 FASPRSSFTSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQS 540 (795) Q Consensus 461 fv~~~g~~~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~a 540 (795) |+|++|+ ++ |++.|++++|+|+++|+|+|++||+++..+..+++++.|.+++|++++||+|++|+|++ ||+|+| T Consensus 435 Fv~~~g~--~v-re~~~~~~~d~~~~~dlt~~a~hl~~~~~~~~~a~~~~p~~~~~~v~~dg~l~~~ty~~---~q~v~a 508 (825) T protein:vir:73 435 FIQEKGS--VV-RDLAYSFDVDGYQGTDLTILANHLFQKHSIVDWSFCIVPYSSAFCIRDDGKLLVLTYLR---DQQVFA 508 (825) T ss_pred EEeCCCC--eE-EEEEEeeecCceeccchhhhhHhhccCCceEEEEEcCCCceEEEEEecCCeEEEEEEec---ccccee Confidence 9999885 34 45678899999999999999999999988888999999999999999999999999974 788999 Q ss_pred eEeeecCCCeEEEEEEEe--CCEEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCc----------c- Q lcl|NC_015271. 541 WSHWDFGSNVQVLACQCI--NSDMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNG----------T- 607 (795) Q Consensus 541 W~~w~~~g~~~~~~~~~~--~d~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~----------~- 607 (795) ||||+++|+++++|+... +|.||++|+|+++++.+||.|++.+.....+++++||||+.+|..... + T Consensus 509 W~~~~~~g~v~~~~~i~~~~~D~l~~iV~R~~~g~~~~yiE~~~~~~~~~~~~~~~vD~g~~~~g~~~~~~l~~l~g~tv 588 (825) T protein:vir:73 509 WAPQSSAGKYESTCSISEGSEDAVYFVVNRTINGQTVRYIERLSSRLFTNDEDAFFVDCGLSYDGRNTSSRTMTISGGTG 588 (825) T ss_pred eEEEecCCcEEEEEEecCCCccEEEEEEEEeeCCceEEEEEEecccccCCCcceeEEEEEeeecccceeeceeeeCCceE Confidence 999999999999998764 589999999999999999999998888888889999999887753210 0 Q ss_pred ---cc---------------cccc-------------------------------cc----------------------e Q lcl|NC_015271. 608 ---YN---------------DDTF-------------------------------TT----------------------T 616 (795) Q Consensus 608 ---~~---------------~~~~-------------------------------~t----------------------~ 616 (795) ++ .... .+ . T Consensus 589 ~~~~~g~~~~~v~~g~itl~~~~~~~i~l~~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~v~~~~~~~a~~~~~~~t~~~ 668 (825) T protein:vir:73 589 DWSYQVDYPVTVSGGAYFVNTDVGAQIQFPYTGTDPDTNEPVAKELRGDIISVTSNTAVVVRFNRNVPPVLRNVATTNWQ 668 (825) T ss_pred EEEeCCeEEEEEcCCeEEecccceEEEEecccCcccccccceeceeeEEEccccCceEEEEEecccccceeeeecccCCC Confidence 00 0000 00 0 Q ss_pred eecccccCCcccCceEEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceec Q lcl|NC_015271. 617 LHLPTIYGADFAKGKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTST 696 (795) Q Consensus 617 ~~~~~~~gl~~~~g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~ 696 (795) .+...++||+||||++|.+++||.+++..+|++| +|+|+.++++|||||+|++++++|||++..+ |.++ T Consensus 669 ~a~~~~~gL~hLeG~~v~v~~Dg~~~~~~~V~~G-------~vtl~~~~~~v~vGl~y~~~~~~l~~~~~~~-g~~~--- 737 (825) T protein:vir:73 669 MARQTFSGLAHLEGQTVNILSDASVEPQKTVTGG-------AVTLESPGAVVHIGLPITAEFETLDININGQ-ETLL--- 737 (825) T ss_pred cchheeccccccCCceEEEEECCeeeCCeEecCc-------EEEecCCceEEEEeeCccceEEecccccCCC-cccc--- Confidence 0012458999999999999999999999999887 8999999999999999999999999998643 3322 Q ss_pred cccccEEEEEEEEEeeccceEEEEecCCcccccccccccccccccccccccccccceEEEEe-eecccceEEEEEECCCC Q lcl|NC_015271. 697 EDIGRLQLRRAWVNYEDSGTFDIYVENQSSNWKYTMAGARLGAHVMRTGKLNLGTGQYRFPV-VGNAKFNTVFILSDATT 775 (795) Q Consensus 697 ~~~grl~l~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~-~~~~~~~~v~i~~~~P~ 775 (795) ..++||++++++|.+|.+++++.+....+. .. ..+......++|+++|++++++ .+|+++++|+|+|++|| T Consensus 738 --g~~~ri~~~~~~~~~s~~~~~g~~~~~l~~-~~-----~r~~~~~~~~~~~~tG~~~~~~~~~~~~~~~~~i~q~~Pl 809 (825) T protein:vir:73 738 --DKKQVIPTVTMVVNASRGIWATTPGGTWYE-YP-----QREFEFYDDPVDDATGKVEVKLDSNWDKNGRVKVRQLDPL 809 (825) T ss_pred --CccEEEEEEEEEEEeeeeEEEecCCCcceE-ee-----ccCCCcccCCCccccCcEEEecCCCCCCccEEEEEEcCCC Confidence 225799999999999999999865543221 11 1112334567889999999987 79999999999999999 Q ss_pred CEEEEEEEEEEEEecc Q lcl|NC_015271. 776 PLNIIGCGWEGNYLRR 791 (795) Q Consensus 776 P~tvlsi~~eg~y~~r 791 (795) |||||||..|...+-= T Consensus 810 P~tvlav~~~~~~~g~ 825 (825) T protein:vir:73 810 PLSVLAVLPRLTVGGF 825 (825) T ss_pred CEEEEEEEEEEEecCC Confidence 9999999988876655 No 24 >protein:vir:1778 Length: 680 # NCBI annotation: tail protein A # Family: family:all:825 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570344;genbank:gi:18640503;genbank:GeneID:932716 Probab=100.00 E-value=1.6e-161 Score=902.20 Aligned_cols=541 Identities=26% Similarity=0.458 Sum_probs=480.1 Q ss_pred CCceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCceEEEE Q lcl|NC_015271. 1 MALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESEQYYAV 80 (795) Q Consensus 1 M~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~~y~l~ 80 (795) ||+|+|+||||++|||||||.+|+|+||++|+||+|+|+.||+||||++||+.|.+. ..+.++|+++|+++|+|++. T Consensus 1 M~~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRpg~~~i~~l~~~---~~~~~~~~~~rd~~e~~~~~ 77 (680) T protein:vir:17 1 MAAVEQMVPNLLGGISQQPDPLKLPGQVKQARNVQLDPTFGALKRPGTELIMQVTGI---PKRAKWIPIMRDAREHYYVA 77 (680) T ss_pred CccceecchhhhCcceecchhhcCcchhhhhhccccCcCcccccCccceeeeeccCC---CCCceeEEEecCCCCeEEEE Confidence 999999999999999999999999999999999999999999999999999988653 45778999999999999999 Q ss_pred EeCCe--------EEEEec-CCcEEEEEECCCcc-cce-ecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCC Q lcl|NC_015271. 81 FTGTG--------IRVFDL-AGNERQVRYTTDGS-TYI-NTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNP 149 (795) Q Consensus 81 ~~~~~--------~rv~~~-~g~~~~v~~~~~~~-~yl-~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~ 149 (795) +...+ |||||+ +|.+++|....+.. .|+ .++++..+||+++++|++||+|+++.|++... ...+ T Consensus 78 ~~~~g~~~~~~~~i~v~d~~~G~~~~v~~~~~~~~~~~~~~~~~~~~lr~~tv~d~tfi~N~~v~~~~~~~-----~~~~ 152 (680) T protein:vir:17 78 IYREGANESGDLRIRVFDLKAGVERAVSFVGGEVEEYFPGDETDWEAIRSLTIGDYTFLSNPNVQPTTWSR-----SFSR 152 (680) T ss_pred EEcCCCcccccceeEEEEccCCeEEEEEcCCCceEEEeecCCCCccceEEEEEcCEEEEECCeEEEeccCC-----CCCC Confidence 98887 999998 58999998665422 222 24456678999999999999999999987642 2345 Q ss_pred CcccEEEecccccCeEEEEEECCceeEEE--------------------------------------------------- Q lcl|NC_015271. 150 KQDALINVRGGQYGRTLQIIINGNTQATY--------------------------------------------------- 178 (795) Q Consensus 150 ~~~~~~~v~~~~~~~ty~vt~~g~~~a~~--------------------------------------------------- 178 (795) ++.++++++.++|+++|+|++++...... T Consensus 153 ~~~g~~~v~~~ayg~ty~v~ing~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~Ag~~t~~~~~~a~la~~l~~~~~~~~~ 232 (680) T protein:vir:17 153 RPEGLVTIGAAGYGTSYIVDFATEDSGQQRRWAVQEMQAPKTKRKKGDGSPDEAGETTVNNWNGTGLSFRVKVEARAFLV 232 (680) T ss_pred CCeeEEEEEEeeeeeEEEEEEeccccceeeeeeeeeeeccccccccccccCCCCcceeeeeeeeeeeeeeeeeccceeee Confidence 67799999999999999999865211000 Q ss_pred -------------Ee----cCC--C--------------------------------------CcccccccchhHHhHhh Q lcl|NC_015271. 179 -------------QI----PDG--S--------------------------------------QPEHVNNTDAQWLAEEL 201 (795) Q Consensus 179 -------------tt----p~~--s--------------------------------------~~~~~~~~~~~~i~~~l 201 (795) .+ ..+ + ........++++|+.+| T Consensus 233 ~~g~~~~~~y~~~~~l~~tg~~~~~~~~t~~v~~~G~~y~IsI~~~~~~~~~~~~~s~~~~t~~~~~a~~at~~~Ia~~L 312 (680) T protein:vir:17 233 DDGEEYGHNYIPYVTLLTPGNNTSPFPDTIRVDVSGEGWDIKVTKQIQSKVYANLGTAQFTTPVDQSGGGASTSDIVTGL 312 (680) T ss_pred cCCCceEEEEeeEEEEecCCccccccCceEEEecccceeEEEEccceeeEeccCccceeeeeccCCcccceeHHHHHHHH Confidence 00 000 0 00011224567788888 Q ss_pred hhhcccccCceeeeecCceEEEEec--CCcceeeEEEecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcc Q lcl|NC_015271. 202 ARQCRVSAPGWTFNVGQGYIHIIAP--EGQQIDSLTTKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSAD 279 (795) Q Consensus 202 ~~~~~s~~~g~t~~~~g~~~~i~~~--~~~~~~~~~~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~ 279 (795) ...+ ...+++++.+.++++||... .+.....+++++|++++++.+++++|+++++||++|++||+|+|.++.++..+ T Consensus 313 ~~~i-~~~~~~~~~~~g~~i~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~Lp~~a~~g~~v~v~~~~~~~~~ 391 (680) T protein:vir:17 313 SAAI-NGLGTFTAESIGNVIRVRYSDPTRTDEFTMSARGGTSGTGLESIKYSVDTLAELPTKCWNDYQVAVRNTQDTEVD 391 (680) T ss_pred HHhh-cccCcEEEEECCCEEEEEeccCCCceEEEeeccCCCCceeeeeeeeeeccccccccccCCCcEEEEEeCCCCccc Confidence 7666 35678999999999999765 44556788889999999999999999999999999999999999999999999 Q ss_pred eEEEEEee--------cCceEEEeeeeeEeeeEeccceeEEEEeecCceeeecccC-------CccccCCcccccccccc Q lcl|NC_015271. 280 QYYVRYDT--------TRKVWSETLGWNVNDQLLFETMPHALVRAADGNFELKRIE-------WSPKTCGDDDTNPWPSF 344 (795) Q Consensus 280 ~yy~~~~~--------~~~~w~E~~~~~~~~~~~~~t~p~~~v~~~~~t~~~~~~~-------w~~~~~gd~~~np~psf 344 (795) +||++|+. ..+.|+||++++...+++.+|||+.|++.+++.|.+...+ |++|.+|||++||+|+| T Consensus 392 ~Yyv~~~~~~~~~~~~~~~~W~E~~~~~~~~~~~~~tmp~~l~r~~~g~f~~~~~~~~~~~~~~~~r~~Gdd~tnp~psF 471 (680) T protein:vir:17 392 DYYVKFETDVEDADVPGSGYWVETVKNGDDGGLVDDTMPHVLVRNALGDFTFSSLNNSSYGKTWADRSVGSEDTNPHPTF 471 (680) T ss_pred ceEEEEeccCcccCcccccceeecccCcccceeccCcceEEEEEccCceeEEEeeccccccccccccccCCcccCCCccc Confidence 99999986 4558999999999999999999999999999999998875 99999999999999999 Q ss_pred c--CCCccEEEEEcceEEEecCCeEEEEecCCccccccccccccCCCCcEEEEEcCCCceeEEEEeecCCcEEEEecCcE Q lcl|NC_015271. 345 M--DSTINDVFFFRNRLGLLSGENIILSRTAKYFNFYPASIATLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQ 422 (795) Q Consensus 345 ~--~~~~~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q 422 (795) . |++|++|+||||||+|+++++|||||+||||||++++++++.|||||+++++++++|+|+|+++++++|+|||+++| T Consensus 472 ~~~G~~p~~v~f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~t~g~q 551 (680) T protein:vir:17 472 TESGNGIYGMFMYKNRLGFLTQDAVIMSQVGDYFNFYATSGVTISDADPIDMATSDTKPVKLEAAISSTSGAILFGNQAQ 551 (680) T ss_pred ccCCCCceEEEEEcceEEEeeCCeEEEEccCCcccccccccccCCCCccEEEEEcCCcceeeeEEeecCCcEEEEecCeE Confidence 8 78999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeCC-ccccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCeeEEEEEEeeccccCceehhhHHHHHHHhcCCCc Q lcl|NC_015271. 423 FVLTAS-GTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGV 501 (795) Q Consensus 423 ~~i~~~-~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~ 501 (795) |+|+++ ++|||+|++++++|+|+|+++|+|+.+|++++|++++|+|++|+|| +|++++|+|+++|||+|++|||+|++ T Consensus 552 ~~ls~~~~~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~~s~vre~-~y~~~~d~y~a~DlT~~a~hl~~g~v 630 (680) T protein:vir:17 552 FRLSSPDESFGPKTATLDKISNYTYESKADPVQTGVSMIFPTNMGTYSSVYEL-STESAKGTPVIEDSSRVIPRLIPSGL 630 (680) T ss_pred EEEecCCceecceeEEEEEEEeecccCCCCceEeCCeEEEeecCCCcceEEEE-eeeeccCceehhhHHHHHHHhcCCce Confidence 999985 6999999999999999999999999999999999999999888876 89999999999999999999999999 Q ss_pred EEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecCCCeE Q lcl|NC_015271. 502 FDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGSNVQ 551 (795) Q Consensus 502 ~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~g~~~ 551 (795) +.+++++++|.+++|++++||+|++|+||++++||+|+|||||++++.-. T Consensus 631 ~~~~~~~~~~~~~~~~~~~~~~l~~~~yl~~~~e~~v~aW~rw~~~~~d~ 680 (680) T protein:vir:17 631 TWSTASMNNDTVFFGNAKKGRNVYVFRFFNEGQERKVAGWTTWYYEDQDH 680 (680) T ss_pred EEEEeeCCCCeEEEEEEcCCCEEEEEEEeeCCCceEEEEEEEEecCCCCC Confidence 99999999999999999999999999999999999999999999988654 No 25 >protein:vir:102644 Length: 594 # NCBI annotation: Hypothetical protein # Family: family:all:780 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024422;genbank:gi:48696643;genbank:GeneID:2948111 Probab=100.00 E-value=1.2e-140 Score=787.68 Aligned_cols=561 Identities=15% Similarity=0.089 Sum_probs=446.8 Q ss_pred CCceeeechhhccc-ccc----CCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCc Q lcl|NC_015271. 1 MALISQSIKNLKGG-ISQ----QPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESE 75 (795) Q Consensus 1 M~~v~~~~~n~~~G-vS~----q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~ 75 (795) |+++ +++||++| ||+ |+|++||+++|++|+||++.|+||++||||++|++.+++++ .+.+|+||.|+++| T Consensus 1 m~~~--~~~~F~~GelsP~l~~r~Dl~~y~~~~~~~~n~~~~~~G~~~rR~G~~~~~~~~~~~---~~~~lipF~~s~~~ 75 (594) T protein:vir:10 1 MADF--SQTSFKGGVIAPRLQFNEYESAYHHSIEDAVNFVVTEQGSLITRCGSEEVGLCQDGE---VRLFRLPAVDAPSN 75 (594) T ss_pred Ccee--eccccCcceecceeccchhHHHHHHHHhhhhceEEEecCCeecCChhHhhhhccCCC---CCEEEEEEEeCCCC Confidence 9997 59999999 665 89999999999999999999999999999999999987753 36799999999999 Q ss_pred eEEEEEeCCeEEEEecCCcEEEEEECCCcccceecC-------CchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCC Q lcl|NC_015271. 76 QYYAVFTGTGIRVFDLAGNERQVRYTTDGSTYINTN-------NPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFN 148 (795) Q Consensus 76 ~y~l~~~~~~~rv~~~~g~~~~v~~~~~~~~yl~~~-------~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~ 148 (795) +|+|||+++++|+|..+|.... ..++.||..++ ....+|+|+|++|+++++|++++|+++.|.++.. |. T Consensus 76 ~~~le~g~~~~r~~~~~~~~v~---~~~~~~~~~~tp~~~t~~~~l~~i~~tqsad~~~~~~~~~~p~~L~R~~~~~-w~ 151 (594) T protein:vir:10 76 DVIVEVGNTNIAVWVNDVRQVV---ANTPSEWRNTIDRIQTAYDTIGDDAGAANTGRLIMVHPALQPKRLYRDNNNA-WQ 151 (594) T ss_pred eEEEEEcCCeEEEEecCcEEEE---ccCCCcccccccceeeccCCccceEEEEEeeEEEEEcCCCCceEEEEccCCC-ce Confidence 9999999999999987775432 22334443322 3467899999999999999999999999864422 21 Q ss_pred CCcccEEEecccccCeEEEEEECCceeEEEEecCCCCcccccccchhHHhHhhhhhcccccCceeeeecCceEEEEecCC Q lcl|NC_015271. 149 PKQDALINVRGGQYGRTLQIIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELARQCRVSAPGWTFNVGQGYIHIIAPEG 228 (795) Q Consensus 149 ~~~~~~~~v~~~~~~~ty~vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s~~~g~t~~~~g~~~~i~~~~~ 228 (795) +..+ .| T Consensus 152 -----~~~~-----------------------------------------------------~~---------------- 157 (594) T protein:vir:10 152 -----FVNM-----------------------------------------------------HT---------------- 157 (594) T ss_pred -----EEec-----------------------------------------------------cc---------------- Confidence 0000 00 Q ss_pred cceeeEEEecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccc Q lcl|NC_015271. 229 QQIDSLTTKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFET 308 (795) Q Consensus 229 ~~~~~~~~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t 308 (795) . + . T Consensus 158 ------------~-----------------------~----------~-------------------------------- 160 (594) T protein:vir:10 158 ------------G-----------------------A----------V-------------------------------- 160 (594) T ss_pred ------------C-----------------------c----------c-------------------------------- Confidence 0 0 0 Q ss_pred eeEEEEeecCceeeecccCCccccCCcccccccccccCCCccEEEEEcceEEEecC----CeEEEEecCCcccccccccc Q lcl|NC_015271. 309 MPHALVRAADGNFELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSG----ENIILSRTAKYFNFYPASIA 384 (795) Q Consensus 309 ~p~~~v~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~----~~v~~Sr~gd~~nF~~~t~~ 384 (795) ..+|. ..+||++|+||||||+|+|+ ++|||||+|||+||+++++ T Consensus 161 ----------------p~~~~---------------~~~~p~~v~f~q~RL~f~~~~~~p~~v~~Srtgd~~nF~~~~~- 208 (594) T protein:vir:10 161 ----------------PAEWS---------------PSNYPQTVGIFQNRVWYVGSPVHRTYFWATRAGKLEDIAPSTA- 208 (594) T ss_pred ----------------ccccc---------------CCccceEEEEEeeeEEEEeCCCCCceEEEEecccccccccCCC- Confidence 00000 02466899999999999998 4699999999999999985 Q ss_pred ccCCCCcEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCC--ccccccceEEEEEEeecCcCCCCcEEeCCeEEEE Q lcl|NC_015271. 385 TLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTAS--GTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFA 462 (795) Q Consensus 385 ~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~--~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv 462 (795) +.|||||++.+ +++.+.| |+++++++|+|||+++||+|+++ ++|||+|++++++|.+ +++.|+|+.+|+.++|+ T Consensus 209 -~~ddd~i~~~~-s~~~~~~-~~v~~~~~L~i~t~~~e~~l~~~~~~~lTp~~~~~~~~s~~-g~~~~~P~~vg~~~~fv 284 (594) T protein:vir:10 209 -NNPNDPISFVG-IMEGTPC-WIIASSDVLTIGTTINDYQLAASTGVSVTAATAILRRSSVQ-GTAAVQGIPAEEQVIFC 284 (594) T ss_pred -CCCCccEEEEE-ecccceE-EEEecCCceEEEecCceEEEecCCCcccccceEEEEEeeee-ccCCCcceeeCCeEEEE Confidence 47999999854 5565555 55777889999999999999875 5899999999999975 66899999999999999 Q ss_pred ecCCCeeEEEEEEeeccccCceehhhHHHHHHHhcC------CCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCce Q lcl|NC_015271. 463 SPRSSFTSIHRYYAVQDVSSVKNAEDITAHVQNYIP------NGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEEL 536 (795) Q Consensus 463 ~~~g~~~~v~~~~~~~~~~d~~~~~dls~~~~hl~~------g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq 536 (795) |++|++ +|++.|++++|+|+++|+|+|++|||+ +..+..+++++.|.+++||+++||.|.+++|+ +|| T Consensus 285 ~~~g~~---vre~~y~~~~d~y~~~dlt~~a~hl~~~~~~~~~~~i~~~a~~~~p~~~~~~v~~dG~l~~~ty~---~eq 358 (594) T protein:vir:10 285 SRNKSK---VYAMNYVREQDNWIPDEMSSQAQHLFTPISSAKGASVRRVAYISDAAKSLWVVLENGQINYCCFD---RTT 358 (594) T ss_pred cCCCCE---EEEEEEeeccCceeccchhhhhhhhcCccccccCceEEEEEEecCCceEEEEEeCCCeEEEEEEe---ccc Confidence 998853 455678899999999999999999984 45567788888889999999999999999996 578 Q ss_pred eEEeeEeee-cCCCeEEEEEEE--eCCEEEEEEEeC--CCEEEEEE--EEeeccccCCCCcceeeeeeeeeEeecCcccc Q lcl|NC_015271. 537 RQQSWSHWD-FGSNVQVLACQC--INSDMYVILRNE--FNTFLTRV--SFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYN 609 (795) Q Consensus 537 ~v~aW~~w~-~~g~~~~~~~~~--~~d~l~~~v~R~--~~~~~~r~--~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~ 609 (795) +|.|||||+ ++|.|+++|+.. .+|++|++|+|. +++..+++ .|+.+......++..+|+||..++. T Consensus 359 ~v~aWs~~~~t~G~v~~va~i~~~~~d~l~~~V~R~~ti~g~~~~y~~lE~~~~~~~~~~~~~~~~d~~~~~~------- 431 (594) T protein:vir:10 359 DTKAWTQLELSGGKVIDIAAAFNPDSDYAYVAVVRSKAINGVQKNYTVLEKISSPRTDWKRADGWVVAQVNQN------- 431 (594) T ss_pred ceeeeEeeccCCCcEEEEEEeecCCCCEEEEEEEECCccccceeeEEEeecCCCccccccccceeeeeccccc------- Confidence 899999998 589999988864 479999999995 56777665 3444444444445556777765532 Q ss_pred cccccceeecccccCCcccCceEEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccC Q lcl|NC_015271. 610 DDTFTTTLHLPTIYGADFAKGKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVD 689 (795) Q Consensus 610 ~~~~~t~~~~~~~~gl~~~~g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~ 689 (795) ..++||+||+|++|.+++||..++..++..|. .++...++.++++|||||+|+++++++||++++++ T Consensus 432 ----------~~vsgl~hLeg~tv~v~aDG~~~~~~~V~~g~---itL~~~~~~~~~~v~VGl~Y~s~i~~lp~~~~~~~ 498 (594) T protein:vir:10 432 ----------GDVLNLDRYIGRTAVIFSKYGLEAEVEVNNIG---LTHRINGYDPNTVYYVGYKMDSYFRTLTPSNGDMK 498 (594) T ss_pred ----------ceeecccccCCceEEEEeCCeecCCeEEcCCe---eEeeccCCCCcceEEEeeeeeEEEEeecccccCCc Confidence 12568999999999999999999988887652 23444557889999999999999999999999988 Q ss_pred CccceeccccccEEEEEEEEEeeccceEEEEecCCcccccccccccccccccccccccccccce--EEEEeeecccceEE Q lcl|NC_015271. 690 NDGSTSTEDIGRLQLRRAWVNYEDSGTFDIYVENQSSNWKYTMAGARLGAHVMRTGKLNLGTGQ--YRFPVVGNAKFNTV 767 (795) Q Consensus 690 g~~~~~~~~~grl~l~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~--~~vp~~~~~~~~~v 767 (795) |+.++ +|+||+|++|+|.+|.+++++.+.. +++.... .......+..+.+++.+|+ +.+++.||+++.+| T Consensus 499 gs~~g-----~r~ri~r~~v~~~~S~g~~vg~~~~--~~r~~~~-~~~~~~~~~~g~~~~~tg~~~v~~~~~G~~~~~~i 570 (594) T protein:vir:10 499 KSMFG-----SKIRISKVQLALFDSIEPTVNGEPA--DDRSTDD-IMDARLLDFSSNSGSSNGTRLVDYNPLGWENDGKM 570 (594) T ss_pred ccccC-----ccEEEEEEEEEEEcceeeEECCccc--ccccchh-hccccCCcccCcccccCCceEEEEccCCcCcccEE Confidence 75443 4899999999999999999875432 2221111 1112234556677777765 55566799999999 Q ss_pred EEEECCCCCEEEEEEEEEEEEecc Q lcl|NC_015271. 768 FILSDATTPLNIIGCGWEGNYLRR 791 (795) Q Consensus 768 ~i~~~~P~P~tvlsi~~eg~y~~r 791 (795) +|+|++|||||||||.+|...|+- T Consensus 571 ~I~qd~PlPltvlai~~ev~~~~~ 594 (594) T protein:vir:10 571 VIAVEQPFLCEVVGVFSVVQSNKV 594 (594) T ss_pred EEEECCCcCEEEEEEEEEEEeccC Confidence 999999999999999999999999 No 26 >protein:vir:94602 Length: 1012 # NCBI annotation: PfWMP4_35 # Family: family:all:12083 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762665;genbank:gi:115304373;genbank:GeneID:5142302 Probab=99.56 E-value=6.5e-13 Score=87.39 Aligned_cols=761 Identities=12% Similarity=0.139 Sum_probs=318.3 Q ss_pred CCc-----eeeechhhccc--cccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCC Q lcl|NC_015271. 1 MAL-----ISQSIKNLKGG--ISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDE 73 (795) Q Consensus 1 M~~-----v~~~~~n~~~G--vS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~ 73 (795) |-. +++...+=++| +|.-+-.--|. +---..||=++..|-+.||.|+..+....-... ..+.+-+++.--- T Consensus 1 mtqQQ~~eiqG~~t~~F~GL~~s~S~~~IP~~-~SP~~~N~DV~~~G~V~rR~GT~l~~~Y~inn~-s~~~~s~~irt~L 78 (1012) T protein:vir:94 1 MTQQQATEIQGPFTREFSGLDISNSVGAIPVS-GSPVFHNCDVSDDGAVVRRRGTALVNTYNINNA-SGRAWSDTIRTKL 78 (1012) T ss_pred CCcccccccccccccccccccccccccccccc-CCCceEEeecccCcceeehhhhhhhhhhccccc-Ccceeeeeehhhc Confidence 542 33444444555 33322111122 123467999999999999999999976441111 1233444443322 Q ss_pred C-ceEEEEEeCCeEEEEecCCcE----EEEEECCCcccceecCCchhheeEEEEc---CEEEEEeCCcccEEE---Eecc Q lcl|NC_015271. 74 S-EQYYAVFTGTGIRVFDLAGNE----RQVRYTTDGSTYINTNNPRNDLRMVTVA---DYTFIVNRNVRVTRD---TNSV 142 (795) Q Consensus 74 ~-~~y~l~~~~~~~rv~~~~g~~----~~v~~~~~~~~yl~~~~~~~~l~~~q~a---D~~~i~~~~~~p~~~---~r~~ 142 (795) . |+|+|--..+-+.+.-.+|+. ..+...... -.++-.+ +++-|.-+. |.++|.-+++||-.+ .|+. T Consensus 79 G~eYfiLs~~~GLL~~~~~~~~AVG~~K~~a~V~~s--s~~~V~P-ssm~F~~~S~~~~R~LILT~~~~~VQ~~F~E~T~ 155 (1012) T protein:vir:94 79 GSEYFILSNDVGLLISLMRDDEAVGMPKEVAVVSKS--SIWTVPP-SSMCFIPVSAPYDRLLILTPEHPIVQLSFLERTL 155 (1012) T ss_pred cceeEEEecCCceEEEeeecccccccchhhhhhhhh--hccccCC-cceEEEeccCCCCcEEEEcCCCceEEEEEeeeee Confidence 3 444555444444443333321 111111110 0111112 335555443 577888788877553 3332 Q ss_pred cCCCCCCCcccEEE----------------------ecccccCeEEEEEECCce-----eEEEEecCCCCcccccccchh Q lcl|NC_015271. 143 NLAGFNPKQDALIN----------------------VRGGQYGRTLQIIINGNT-----QATYQIPDGSQPEHVNNTDAQ 195 (795) Q Consensus 143 ~~~~~~~~~~~~~~----------------------v~~~~~~~ty~vt~~g~~-----~a~~ttp~~s~~~~~~~~~~~ 195 (795) ....-++ ..+-++ +-..+.+++|.++++.-+ +.....|. +..-..-+.+=+ T Consensus 156 s~T~~t~-~~~~V~~~~a~~~~~~~~L~~~~N~sS~~~~~~~~T~~AmT~~NP~~S~~ls~~~V~~q-tytltirqi~W~ 233 (1012) T protein:vir:94 156 SFTCTTN-HGGGVFSFTAPISVNDTTLWRDTNASSYIVTDAAGTVYAMTQKNPDFSFRLSGSFVVGQ-TYTLTIRQITWQ 233 (1012) T ss_pred eeeccCC-ccceEeecccceeecCeeEEecccccceeEeeccceEEEEEeeCCceeEEEEEEEecCc-ccceeehhhhhh Confidence 2211111 111110 011112334444432100 00000000 000000111112 Q ss_pred HHhHh-------hhhh---cccccCceee-----------------eecCceEEEE---------ecCCccee--eEEEe Q lcl|NC_015271. 196 WLAEE-------LARQ---CRVSAPGWTF-----------------NVGQGYIHII---------APEGQQID--SLTTK 237 (795) Q Consensus 196 ~i~~~-------l~~~---~~s~~~g~t~-----------------~~~g~~~~i~---------~~~~~~~~--~~~~~ 237 (795) |-++. +.+. ++-...+..+ ..-+-.++.. .++++... +-.-+ T Consensus 234 WWAESm~~~G~~~~~~~SRFNV~~~DQ~V~IP~~L~tDiD~v~~~~~~~~l~~~~ss~F~~~~~~~~T~~P~~AD~YG~~ 313 (1012) T protein:vir:94 234 WWAESMYYEGQDMMQNTSRFNVTSIDQNVKIPDRLITDIDPVYKNSQGLGLFVFWSSRFDSNGWAGPTTSPNTADEYGFS 313 (1012) T ss_pred hhhhhHhhhhhHHHhhhhhcccccccccccchhHHhhhhhhhhhccCCccEEEEEeeeecCceeecCCCCCCCccccccc Confidence 22221 1110 1100000000 0000111111 11111000 00000 Q ss_pred cCcCcccceeEEEeccceeecccccCCCe-------------E-----EEEEcCCCCCcceEEEEEeecCc--------- Q lcl|NC_015271. 238 DGYADQLINPVTHYAQSFSKLPTNAPEGY-------------V-----VKIVGDASKSADQYYVRYDTTRK--------- 290 (795) Q Consensus 238 dg~~~t~~~~~~~~v~~~~~l~~~~~~G~-------------~-----v~v~~~~~~~~~~yy~~~~~~~~--------- 290 (795) +|+.-+.+.... .. ....-|.-+.-|. + .+..+..+...++.-+.-+..+- T Consensus 314 ~G~~~tpp~~~~-~A-~L~~aPFF~TFG~~~s~TP~P~~~V~iLR~RELRFN~G~GA~~~~L~V~~D~~~~t~Nnvpfsp 391 (1012) T protein:vir:94 314 GGGRFTPPSLVP-GA-TLQAAPFFITFGGIYSGTPTPINQVNILRLRELRFNGGTGAKPDDLQVYNDTVEHTWNNVPFSP 391 (1012) T ss_pred CCceeccccccc-cc-eeeccceEEEeccccCCCCCChhheeeeeeeeeeeccCCCCCCcceEEEEcceeeeccccccCc Confidence 111100000000 00 0000000000010 1 11111111111222221111111 Q ss_pred ----eEEEe-eeeeEeeeEecc----------------------ceeEEEEeecCceee-ecccCCccccCCccc---cc Q lcl|NC_015271. 291 ----VWSET-LGWNVNDQLLFE----------------------TMPHALVRAADGNFE-LKRIEWSPKTCGDDD---TN 339 (795) Q Consensus 291 ----~w~E~-~~~~~~~~~~~~----------------------t~p~~~v~~~~~t~~-~~~~~w~~~~~gd~~---~n 339 (795) .|..+ ++-++++.+..+ +.|..|--.+..++. ....-|....--+-. .. T Consensus 392 snfqt~atT~~~T~R~~~L~~A~G~~~~~A~Y~A~~GATnnlpanaPL~IS~~sA~s~~~~~R~v~~~~~~T~~~~~~G~ 471 (1012) T protein:vir:94 392 SNFQTWATTYTATDRVITLMSAVGDRFNNANYFAILGATNNLPANAPLHISCLSASSYLGGSRRVWYRNLPTTGGTLDGC 471 (1012) T ss_pred ccccceeeeeeecceeEEEeeeccccccCcceEEEeecccccccCCccccccccceeeeccceeeeeeccccCCceEeee Confidence 23222 122222222111 111111111111110 001112221111100 00 Q ss_pred -----cccc---c-cCCCccEEEEEcceEEEecC----CeEEEEecCC------cccccc-ccccccCCCCcEEEEEcCC Q lcl|NC_015271. 340 -----PWPS---F-MDSTINDVFFFRNRLGLLSG----ENIILSRTAK------YFNFYP-ASIATLSDDDPIDVAVSTN 399 (795) Q Consensus 340 -----p~ps---f-~~~~~~~v~f~q~RL~f~~~----~~v~~Sr~gd------~~nF~~-~t~~~~~DdD~i~~~~~~~ 399 (795) .... + .+..+.--+.||.||++++. ..+.+|.+|| |+||+. +..+.-.|.||+++.+++. T Consensus 472 Y~r~YGiG~~~~Y~~~~F~~I~TiY~~RLiL~~~s~~~~~~~~S~~GD~~~~G~~Y~F~QiTD~L~G~~tDPF~L~VtSe 551 (1012) T protein:vir:94 472 YVRAYGIGKYVDYSKRSFHAIGTIYRDRLILVNPSTATDQLLISEIGDATVPGEFYQFMQITDMLQGVTTDPFTLNVTSE 551 (1012) T ss_pred EEEEEEeeeeeecCCccccceeeeeeeeeEEeccCCCcceEEEeecCCcccCceeeeeeeeehhhccCcCCceeEEEccc Confidence 0000 0 13446667899999999975 4589999877 899985 5556778899999999998 Q ss_pred CceeEEEEeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCeeEEEEEEeecc Q lcl|NC_015271. 400 RIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQD 479 (795) Q Consensus 400 ~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~~~v~~~~~~~~ 479 (795) -.+.|..++...+.|++||..+-|.+.|++.++|..-.+...|+|+.-+.-.-|+..-.|+|..+.| ++.++. .- T Consensus 552 ~~e~iT~~~~WQ~~LFV~T~~~T~~~~GGe~~~~s~~~VN~vSt~G~~N~~~VV~T~~~V~Ym~~~G----~F~L~~-k~ 626 (1012) T protein:vir:94 552 GRERITAVTGWQKRLFVFTGSNTYSIEGGEQFGESSYAVNLVSTYGAFNQNCVVVTNLTVLYMNKFG----LFDLMN-KP 626 (1012) T ss_pred ccceeeeeeeeceeEEEEeccceEeeccccccchhHHHHHhHHhhcccCcceEEEeeeEEEEeeccc----eeeccC-Cc Confidence 8888999999999999999999999999999999999999999996655555688888999998876 556543 45 Q ss_pred ccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeE-----------EeeEeeecCC Q lcl|NC_015271. 480 VSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQ-----------QSWSHWDFGS 548 (795) Q Consensus 480 ~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v-----------~aW~~w~~~g 548 (795) +.|+|.+.+-|+.+..+|..- ...+-.+...+....+.+.||+ =|+.+.|.-+ .+|+..++.| T Consensus 627 ~~~~Y~A~ErSvKIR~~F~~~----~~ss~~~~~Wl~~~e~~~~LYi--~L~~~~dT~~~S~~~~~N~~~DSWs~~~s~~ 700 (1012) T protein:vir:94 627 NTDSYGAFERSVKIRGLFQNL----AGSSGDNLHWLRYNESSNKLYI--GLAAEGDTRTTSRNLMLNFTWDSWSTLSSAA 700 (1012) T ss_pred cCCcchhhhhhhhhhhhhhhh----ccccccceeeeeeccCCceEEE--EecCCCcchhhhhhhhhhhhhcchhhhhccC Confidence 789999999999999998542 1222222222222333334433 2232322221 2788888877 Q ss_pred CeEEEEE-EE-eCCEEEEEEEeCCCEEEEEEE----EeeccccCCCCcceeeeeeeeeEeecCcccccccccceee---- Q lcl|NC_015271. 549 NVQVLAC-QC-INSDMYVILRNEFNTFLTRVS----FTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLH---- 618 (795) Q Consensus 549 ~~~~~~~-~~-~~d~l~~~v~R~~~~~~~r~~----~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~---- 618 (795) .|+.-.. .. .++...+.|.-....++-+.. ....++....- +|--|+.+..+.-.+-+.+....+.+ T Consensus 701 ~Fq~YP~V~~~~~~t~L~~i~~~~TV~ML~~~~~~YiDFatirthiy---pF~~CaG~~~~~Vms~~~GIY~~~~P~tP~ 777 (1012) T protein:vir:94 701 PFQMYPAVQLFKYMTWLTNINAPLTVAMLATEMPFYIDFATIRTHIY---PFTFCAGQRDVSVMSDSRGIYNLPLPVTPG 777 (1012) T ss_pred CcccchhhhhhhhhhhhhhhcCchhhhhhhhccceeeeeehhccccc---ceeeeccceeeEEEecCCceEEecccccce Confidence 7763322 11 122111111111111110000 00000000000 11122222211111111111111100 Q ss_pred ------------------ccccc-CC--------------------cccCceEEEEecCCcccccceeeeccCCC----- Q lcl|NC_015271. 619 ------------------LPTIY-GA--------------------DFAKGKITVLEADGKITEFEEPEVGWKND----- 654 (795) Q Consensus 619 ------------------~~~~~-gl--------------------~~~~g~~v~~~adg~~~~~~~~~~g~~~~----- 654 (795) ++.-+ |. +.-.|+++.+.-.....++.+..-.+-++ T Consensus 778 I~~~tit~ss~~~~k~Yq~~T~~~GT~tLt~~~~~~~~~~~l~LL~~~~~~~~~a~V~~~~~~~~TT~~TV~~N~~~~lQ 857 (1012) T protein:vir:94 778 ILDYTITASSKAGAKTYQRNTASAGTETLTLRNPMMDYADTLELLGGNVNASQFAMVMSNGFEPYTTYPTVTYNGVAPLQ 857 (1012) T ss_pred eeeeEeeccchhhhheeccccccccceeeeecChhhhcCcEEEEecCCCCccEEEEEeecccccccccceEEecceeeee Confidence 00000 00 01112222222111111111111111000 Q ss_pred ------ceEEEecCCC-CcEEEEeEeeeEEEEecceeEEccCCccceeccccccE-EEEEEEEEeecc--ceEEEEecCC Q lcl|NC_015271. 655 ------PELRLNGNLE-GSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRL-QLRRAWVNYEDS--GTFDIYVENQ 724 (795) Q Consensus 655 ------~~~~i~~~~~-~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl-~l~~~~~~~~~t--~~~~v~v~~~ 724 (795) ..+....-.. ...+.+|.-|.|..+-+-+.+ ..-||| ||.++.|-|.-+ ..++..+..+ T Consensus 858 ~T~~~GS~L~~~~~LsqN~~~~~G~~Y~S~Y~SP~F~L-----------~SL~~LKr~K~~~L~~Dttvtsqlkynltsg 926 (1012) T protein:vir:94 858 WTVTGGSGLNNRPILSQNNNCIMGMIYPSVYASPIFDL-----------ESLGRLKRLKKLHLQMDTTVTSQLKYNLTSG 926 (1012) T ss_pred EEEecCCccccccccccCceEEEeecchhhhcchhhhh-----------hhhhhhhheeeeeEEeeeeeeeeeeeehhcc Confidence 0000000001 346888999998877554432 122453 677877776543 4444444332 Q ss_pred ccc-ccc--cccccccccccccccccc---------c-ccceEEEEeeecccceEEEEEECCCCCEEEEEEEEEEEEecc Q lcl|NC_015271. 725 SSN-WKY--TMAGARLGAHVMRTGKLN---------L-GTGQYRFPVVGNAKFNTVFILSDATTPLNIIGCGWEGNYLRR 791 (795) Q Consensus 725 ~~~-~~~--~~~~~~~~~~~~~~~~~~---------~-~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r 791 (795) ... ... -+....-.-.++...+.. + ..-+..+|+.|..-+.++.|.+-.--.+.+-+.+++.+=-+- T Consensus 927 fsqvsvlntawvavvsnynenivpavvsyqvgnsyeirrvvelsiplqgygcdyqfyiasvgaeafklaayefdiqpqrd 1006 (1012) T protein:vir:94 927 FSQVSVLNTAWVAVVSNYNENIVPAVVSYQVGNSYEIRRVVELSIPLQGYGCDYQFYIASVGAEAFKLAAYEFDIQPQRD 1006 (1012) T ss_pred cceeeeecceeeeeeeccCccccceeeeeecCCceeeeEEEEEeecccccccceeEeeeeccccceeeeeeeeccccchh Confidence 211 000 000000000111110000 0 012445688888888999999999999999999988764322 Q ss_pred ccCC Q lcl|NC_015271. 792 SSGI 795 (795) Q Consensus 792 ~rrv 795 (795) .|-| T Consensus 1007 kryv 1010 (1012) T protein:vir:94 1007 KRYV 1010 (1012) T ss_pred hhhc Confidence 2222 No 27 >protein:vir:80177 Length: 1027 # NCBI annotation: tail tubular protein B # Family: family:all:12083 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285795;genbank:gi:148747829;genbank:GeneID:5220453 Probab=99.15 E-value=4.7e-10 Score=71.72 Aligned_cols=725 Identities=14% Similarity=0.143 Sum_probs=284.0 Q ss_pred CCc-----ee-----eechhhccc--cccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEE Q lcl|NC_015271. 1 MAL-----IS-----QSIKNLKGG--ISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHL 68 (795) Q Consensus 1 M~~-----v~-----~~~~n~~~G--vS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~ 68 (795) |-. -+ +...+=++| +|.-+=.--|. +---+.||=++..|-+.||.|+..+....++. +.+-++ T Consensus 1 mvnsferrtQQ~~dlG~~s~~F~GL~~t~S~~~IP~~-~SP~~~N~DV~~~G~V~kR~GT~i~~~Y~~t~----~~~t~~ 75 (1027) T protein:vir:80 1 MVNSFERRTQQGDDLGIRSSNFGGLNTTASPLNIPYE-DSPNLLNVDVDVSGNVSKRQGTEILLKYANTT----PVYTFP 75 (1027) T ss_pred CCcchhhhhcccccccccccccccccccccccccccc-CCCceEEeecccCcceeehhhhhhhhhhccCC----ceeeee Confidence 211 00 122233344 23222111111 12246789999999999999999997655332 345555 Q ss_pred EEeCCCceEEEEEeCCeEEEEecCCcEE----EEEEC-----CCcccceecCCchhheeEEEEcCEEEEEeCCcccEEE- Q lcl|NC_015271. 69 INRDESEQYYAVFTGTGIRVFDLAGNER----QVRYT-----TDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRD- 138 (795) Q Consensus 69 f~~~~~~~y~l~~~~~~~rv~~~~g~~~----~v~~~-----~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~- 138 (795) +.---.-.|+|--..+-+.+--.+|+.. .+... ....||.. -.--.-|.++|.-+.+||-.+ T Consensus 76 vks~LG~dYvLt~~~GLL~~~~~~~~AVG~~K~~s~V~~aa~~~V~P~F~--------~~S~~~~R~LILT~~~~~VQ~~ 147 (1027) T protein:vir:80 76 VKSVLGYDYVLTKSGGLLEVAGVIGKAVGAYKSFSNVFSAAAANVKPYFT--------LLSDVEPRVLILTGTNTPVQVK 147 (1027) T ss_pred ehhhccceeeEecCCceEEEeeecccccccchhhhhhhhhhhcccCceeE--------EccCCCCcEEEEcCCCceEEEE Confidence 5444444566655555454443333211 11000 11112211 011233567777777776543 Q ss_pred --EecccCCCCCCCcccEEEecc-----------------------cccCeEEEEEECCceeEEEEecCCCC--cccccc Q lcl|NC_015271. 139 --TNSVNLAGFNPKQDALINVRG-----------------------GQYGRTLQIIINGNTQATYQIPDGSQ--PEHVNN 191 (795) Q Consensus 139 --~r~~~~~~~~~~~~~~~~v~~-----------------------~~~~~ty~vt~~g~~~a~~ttp~~s~--~~~~~~ 191 (795) .|+.....-++ ..+-+ +++ ...+++|.++++. .|+-+- .....- T Consensus 148 F~E~T~t~T~~s~-~~~~V-~~~~s~~~~~~~~L~~~~N~tS~~~~~~~~T~~AlT~~N-------lP~~S~~mt~~~V~ 218 (1027) T protein:vir:80 148 FVEQTFTTTSGSP-TTTVV-IPNASRFQYDTPILYMNRNFTSGATYSYNSTTRALTISN-------LPSWSGSMTFDLVL 218 (1027) T ss_pred EeeeeeeeeccCC-ccceE-eecccceeecCeeEEecccccceeEeeccceEEEEEecc-------CCcceeEEEEeEEe Confidence 33322211111 11111 111 1112233333321 011110 000000 Q ss_pred cchhHHhH-------hhhhh---cccccCceee-----------------eecCceEEEE---------ecCCcceeeEE Q lcl|NC_015271. 192 TDAQWLAE-------ELARQ---CRVSAPGWTF-----------------NVGQGYIHII---------APEGQQIDSLT 235 (795) Q Consensus 192 ~~~~~i~~-------~l~~~---~~s~~~g~t~-----------------~~~g~~~~i~---------~~~~~~~~~~~ 235 (795) ..=+|-++ ++.+. ++-......+ ..-+-.++.+ .++++... . T Consensus 219 ~~W~WWAESl~~~G~~~~~~~SRFNV~~~DQ~V~IP~~L~sDlD~i~~~~~~~~m~~~~ta~F~~~~~~~~T~~P~~-A- 296 (1027) T protein:vir:80 219 PVWSWWAESLRWFGDRFYDAVSRFNVNKADQSVAIPAALRSDLDTIQGTYGRYPMLLYKTATFNDTYTFSNTGQPAN-A- 296 (1027) T ss_pred cchhhhhhHHhhhhhHHHhhhhhcccccccccccchhHHhhhhhhhhhccCCccEEEEEeeeecCceeecCCCCCCC-c- Confidence 01112111 11110 1000000000 0000111111 11111000 0 Q ss_pred EecCcCcccceeEEEecccee---ecccccCCCe-------------E-----EEEEcCCCCCcceEEEEEeecCce--E Q lcl|NC_015271. 236 TKDGYADQLINPVTHYAQSFS---KLPTNAPEGY-------------V-----VKIVGDASKSADQYYVRYDTTRKV--W 292 (795) Q Consensus 236 ~~dg~~~t~~~~~~~~v~~~~---~l~~~~~~G~-------------~-----v~v~~~~~~~~~~yy~~~~~~~~~--w 292 (795) -+=|+.+-. ...+-.-+ .-|.-+.-|. + .+..+..+...++.-+.-+..+-. | T Consensus 297 D~YG~~~G~----~~~~~~~A~L~~sPFF~TFG~~~t~TP~P~~~V~lLR~RELRFN~G~GA~~~~L~V~~D~~~~s~N~ 372 (1027) T protein:vir:80 297 DSYGWGDGS----VYNVGASAYLNTSPFFATFGDTRTPTPQPPETVHLLRQRELRFNYGNGATGANLRVTVDGTALSANY 372 (1027) T ss_pred ccccccCCc----eEeecccceeeccceEEEeccccCCCCCchhheeeeeeeeeeeccCCCCCCcceEEEEcceeeeeee Confidence 000000000 00000000 0010000111 1 111111111112222211111112 2 Q ss_pred EEe-eeeeEeeeEec--------------------cceeEEEEeecCceeee--cccCCccccCCccc---cc-----cc Q lcl|NC_015271. 293 SET-LGWNVNDQLLF--------------------ETMPHALVRAADGNFEL--KRIEWSPKTCGDDD---TN-----PW 341 (795) Q Consensus 293 ~E~-~~~~~~~~~~~--------------------~t~p~~~v~~~~~t~~~--~~~~w~~~~~gd~~---~n-----p~ 341 (795) ..+ ++-++.+.+.. +| |..|--.+..++.- +..-|....--|-. .. .. T Consensus 373 ssT~~~T~R~~~L~~A~G~~~~~A~dlayY~A~~GAT-PL~IS~~aA~t~~~~~R~yi~~~~~~T~~~~~~G~Y~k~YGl 451 (1027) T protein:vir:80 373 SSTVAGTNRAYALYKADGTLCTSASDLAYYIAFTGAT-PLGISPTAAVTITNVDRTYIGSAATQTDNAYVQGGYFKVYGL 451 (1027) T ss_pred eeeeeecceeEEEeeeccccccccccceeeeeeeccc-cccccccceeeeecCceeeeeeeccccCCceEeeeEEEEEEe Confidence 211 11222222111 11 11111111111100 00012211110000 00 00 Q ss_pred c---cc-cCCCccEEEEEcceEEEecC----CeEEEEecCC------cccccc-ccccccCCCCcEEEEEcCCCc-eeEE Q lcl|NC_015271. 342 P---SF-MDSTINDVFFFRNRLGLLSG----ENIILSRTAK------YFNFYP-ASIATLSDDDPIDVAVSTNRI-AILK 405 (795) Q Consensus 342 p---sf-~~~~~~~v~f~q~RL~f~~~----~~v~~Sr~gd------~~nF~~-~t~~~~~DdD~i~~~~~~~~~-~~i~ 405 (795) . .+ .+..|.--+.||.||++++. ..+.+|.+|| ++||+. +..+.-.|.||+++.++++|. +.|. T Consensus 452 G~~~~Y~~~~F~~I~TvY~~RLvL~~~t~~~~~~~~S~~GD~~~~G~~Y~F~QvTD~L~G~~sDPF~L~VsSsq~~d~vT 531 (1027) T protein:vir:80 452 GLWANYGTGQFPRIATVYQSRLVLGGFTNDPTRVVFSATGDTVEGGVKYNFFQVTDDLDGLDSDPFDLVVSSSQADDYVT 531 (1027) T ss_pred eeeeecCCccccceeeeeeeeeEEeccCCCcceEEEeecCCcccCceeeeeeeeehhhccCcCCceeEEEecccccceee Confidence 0 01 24456778899999999975 4599999877 899985 555677889999999988554 5588 Q ss_pred EEeecCCcEEEEecCcEEEEeCCcc-ccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCeeEEEEEEeeccccCce Q lcl|NC_015271. 406 YAVPFSEELLIWSDEAQFVLTASGT-LTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQDVSSVK 484 (795) Q Consensus 406 ~~v~~~~~L~l~t~~~q~~i~~~~~-lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~~~v~~~~~~~~~~d~~ 484 (795) -++...+.|++||..+-|.+.|++. ++|..-.+...|+|+.-+.-.-|+....|+|.++.| ++.++. .-+.++| T Consensus 532 ~~~~WQ~~LFV~T~~~T~~~~GGd~t~~~a~~~VN~iSs~G~~N~~~VV~T~~~V~Yl~~~G----~F~L~~-r~~~~~Y 606 (1027) T protein:vir:80 532 GLVEWQSSLFVLTRRATFRANGGDATISPARRFVNYISSLGLVNPFSVVRTDTAVFYLSDSG----VFNLTP-RVEDGEY 606 (1027) T ss_pred eeeeeceeEEEEecceeEEeecCccccchhHHHHHHHHhhcccCcceEEEeeeEEEEeeccc----eeeccC-CccCCcc Confidence 8999999999999999999998775 999999999999997666566688899999998876 556543 4578999 Q ss_pred ehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeE-----------EeeEeeecCCCeEEE Q lcl|NC_015271. 485 NAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQ-----------QSWSHWDFGSNVQVL 553 (795) Q Consensus 485 ~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v-----------~aW~~w~~~g~~~~~ 553 (795) .+.+-|+.+..+|..- ...+-.+...+....+.+.||+ =|+.+.|.-+ .+|+..++.|.|+.- T Consensus 607 ~A~EkSiKIR~~F~~~----~~ta~~~~~Wm~~~q~~~~LYv--~L~~~~eT~~~S~~~~~N~~~DSWt~~~t~~~Fk~Y 680 (1027) T protein:vir:80 607 QAIEKSIKIRKVFGKT----TSTAVSSAAWMSFDQNRKVLYV--ALPRGSETTVASALYVYNTFRDSWTQYDTLGGFKTY 680 (1027) T ss_pred hhhhhhhhhhhhhhhh----ccccccceeeeeeccCCceEEE--EecCCCcchhhhhhhhhhhhhcchhhhhcccCcccc Confidence 9999999999998542 1122222222222233333333 2333333222 378888888887754 Q ss_pred EE----E-EeCCEEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeeccc------- Q lcl|NC_015271. 554 AC----Q-CINSDMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLPT------- 621 (795) Q Consensus 554 ~~----~-~~~d~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~------- 621 (795) .. . +.++...+.|.-....++-+....+ -.| +|--| .+..+.-.+-++++...+.|.+. T Consensus 681 tghP~V~~~~~~s~L~~v~~~~TV~ML~~~~~~-YvD-------FF~~C-G~~~~~Vlt~~~GIY~~~~P~wnsP~I~~~ 751 (1027) T protein:vir:80 681 TGHPYVDTVLGDSFLLMVAYGGTVCMLKLYGSR-YVD-------FFNKC-GSFTGNVLTANSGIYTWTAPFWNSPVISNI 751 (1027) T ss_pred cCCchhhhhhhhhhhhhhcCchhhhhhhhhcch-hhh-------hhhhc-ccceeeEEecCCceeEeecccccCCeeeEE Confidence 21 1 1233333333333333222211000 001 12223 33333333344444444444322 Q ss_pred -ccCCcccCceEEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccc Q lcl|NC_015271. 622 -IYGADFAKGKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIG 700 (795) Q Consensus 622 -~~gl~~~~g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~g 700 (795) ++|...+..+++..-.|-.+.++..++.- .|- ..+.+...|-+..- +--.+++...++++... T Consensus 752 svs~tt~~~~q~Ye~~T~~~vvpydnvedl-------siy--vnGT~Ls~~~~~~~--~~~~i~LL~~~~~~~~~----- 815 (1027) T protein:vir:80 752 SVSGTTTLAVQRYELPTDLQVVPYDNVEDL-------SIY--VNGTRLSFGTDWVK--QGKAIYLLSDPGDGKTV----- 815 (1027) T ss_pred Eeeccchhhhheeccccccccccccccccc-------eee--ecceeEeecCchhh--cCCEEEEecCCCCcceE----- Confidence 22333444445554455555555444421 110 11112222211111 11122333333333211 Q ss_pred cEEEEEEEEEee--------ccceEEEEecCCcccccccccccccccccccccccccccceEEE----------Ee---- Q lcl|NC_015271. 701 RLQLRRAWVNYE--------DSGTFDIYVENQSSNWKYTMAGARLGAHVMRTGKLNLGTGQYRF----------PV---- 758 (795) Q Consensus 701 rl~l~~~~~~~~--------~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~v----------p~---- 758 (795) -+|-|+-+++. +|..-.|.++.-..- .++...++...+.. .+-+....+.+ |+ T Consensus 816 -s~Vprcpvnvsy~~~~~~~~TT~~TV~~N~~~~i---Q~Tdy~~~GS~L~~-~~~LtN~~~~~G~~Y~S~Y~SP~F~L~ 890 (1027) T protein:vir:80 816 -SIVPRCPVNVSYQGDVTFDETTAQTVWVNNLLQI---QGTDYTLSGSTLTF-TDTLTNAVVEVGNAYISYYQSPMFLLG 890 (1027) T ss_pred -EEEecccccccccccccccccccceEEecceeee---ccceeeeccCcccc-ccccccceEEEeecchhhhcchhhhhh Confidence 12434333322 111111211110000 00000111110000 00111111111 11 Q ss_pred -e---ecccceEEEEEECCCCCEEEEEEEEEEEEec-----cccCC Q lcl|NC_015271. 759 -V---GNAKFNTVFILSDATTPLNIIGCGWEGNYLR-----RSSGI 795 (795) Q Consensus 759 -~---~~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~-----r~rrv 795 (795) . ++-+..-+..-.++-||.--++=--.|+=-. -.-|- T Consensus 891 SL~~LKk~K~~~L~~Dnedvlpvytigdlasgqdvddlvgkwktra 936 (1027) T protein:vir:80 891 SLSNLKKVKHVYLYFDNEDVLPVYTIGDLASGQDVDDLVGKWKTRA 936 (1027) T ss_pred hhhhhhheeeeEEEEcCCcceeeeeeccccCCCchhHhhhhhcccc Confidence 0 1112222333333444432211111111000 00000 No 28 >protein:vir:2625 Length: 715 # NCBI annotation: gp27 # Family: family:all:5234 # MgeID: mge:55 # MgeName: SIO1 # Cross-refs: genbank:acc:NP_064766;genbank:gi:9964636;genbank:GeneID:1263056 Probab=99.14 E-value=1.3e-09 Score=69.23 Aligned_cols=627 Identities=12% Similarity=0.102 Sum_probs=276.9 Q ss_pred CCcee--eechhhccc-cccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhh-----hhcCCCCcccCcEE--EEEE Q lcl|NC_015271. 1 MALIS--QSIKNLKGG-ISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLK-----TLGGSDTLGPAPYI--HLIN 70 (795) Q Consensus 1 M~~v~--~~~~n~~~G-vS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~-----~l~~~~~~~~~~~l--~~f~ 70 (795) |++-. -....|++| |-+--.+-==++..-.-+||.....|--+||-|+-|-. .+..+ ++..+ +.+. T Consensus 1 m~~~~~~~~vNtFv~GliTEas~ltfpqnasiDe~N~~l~rdG~r~RR~g~~~E~~~vls~~~vp----~galv~~~~W~ 76 (715) T protein:vir:26 1 MPQSLTQRTVNTFIKGLITEASELTFPENASVDELNCSLGRDGTRRRRKAVTLEDNHVLSDVVVP----EGALVQTLDWY 76 (715) T ss_pred CCcccchhHHhhhhhheeeccccccCCccceeeeeeeeecCCCcchhhccceeecceEEEEEeec----Cceeeeeechh Confidence 99733 456889999 44433333234455567899999999889998876532 11111 11111 1111 Q ss_pred --eCCCceEEEEEeCCe-EEEEecCCcEEE-EEECCC-cccceecC---Cchh-heeEEEEcCEEEEEeCCcccEEEEec Q lcl|NC_015271. 71 --RDESEQYYAVFTGTG-IRVFDLAGNERQ-VRYTTD-GSTYINTN---NPRN-DLRMVTVADYTFIVNRNVRVTRDTNS 141 (795) Q Consensus 71 --~~~~~~y~l~~~~~~-~rv~~~~g~~~~-v~~~~~-~~~yl~~~---~~~~-~l~~~q~aD~~~i~~~~~~p~~~~r~ 141 (795) -+.-..-||++..++ +.+|...+..+. ..+... ...|-.+. .|.+ .++++.+..+++|+||..-|..+.-. T Consensus 77 na~G~v~~~~livqvg~~l~f~q~t~~pLs~~n~~~svdl~~~~~~vn~SPsh~~v~v~~~~G~livanp~i~~~~~~~d 156 (715) T protein:vir:26 77 NVAGQVNLEFLVVQVNNILYFYEKSTDPLSANKYSGSVDLNTHSASNNLSPSEERVQVTSLNGYLIVASPAINTFYLGFN 156 (715) T ss_pred hcccccCcEEEEEEeccEEEEEeccCCccccCceeEEeeecceecccccccceeEEEEEEeeeEEEEecCCccEEEEEec Confidence 122345555555444 666665442211 111000 00111111 1222 47888889999999999988776443 Q ss_pred ccCCCCCCCcccEEEecccccCeEEEEEECCceeEEEEecCCCCcccccccchhHHhHhhhhhcccccCc----eeeeec Q lcl|NC_015271. 142 VNLAGFNPKQDALINVRGGQYGRTLQIIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELARQCRVSAPG----WTFNVG 217 (795) Q Consensus 142 ~~~~~~~~~~~~~~~v~~~~~~~ty~vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s~~~g----~t~~~~ 217 (795) ..+-.+++....+-...++--+..|... ..| .......+++-+ .+..+.++.+ +-+... T Consensus 157 ~~t~s~t~~~ll~r~r~f~~qg~d~~~g------~~y-------~~~gt~~tn~~i----ynlyN~gw~~p~gt~~~N~~ 219 (715) T protein:vir:26 157 TSTEAFTATSISFKERDFEWQGSDVDVT------SLY-------FGEGTSVSNQRI----YDTYNVGWVGPKGSAALNTY 219 (715) T ss_pred CCcceeEeeEEEEEeeeheeeccccccc------ccc-------ccCCcccCchhh----eecccceeecceeEEEEcCC Confidence 3222222111111111111111111110 011 000111111111 1111111111 111111 Q ss_pred CceEEEEecCCcceeeEEEecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeee Q lcl|NC_015271. 218 QGYIHIIAPEGQQIDSLTTKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLG 297 (795) Q Consensus 218 g~~~~i~~~~~~~~~~~~~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~ 297 (795) +. ||..++.+..+.- .+++-.. ++.. .|.|.-- T Consensus 220 ~~--yiVypa~s~~~~S-------------~kd~n~a---fsk~-----------------------------ad~ei~t 252 (715) T protein:vir:26 220 GS--YIVYPALTHPWYS-------------GKDANGA---FNKA-----------------------------DWLEIYT 252 (715) T ss_pred CC--ceEecccccccCC-------------Ccccccc---cChh-----------------------------hcccccc Confidence 11 2222222211100 0000000 0000 0111000 Q ss_pred eeEeeeEeccceeEEEEeecCceeeecccCCccccCCcccccccccccCCCccEEEEEcceEEEec------CCeEEEEe Q lcl|NC_015271. 298 WNVNDQLLFETMPHALVRAADGNFELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLS------GENIILSR 371 (795) Q Consensus 298 ~~~~~~~~~~t~p~~~v~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~------~~~v~~Sr 371 (795) +-.....-|.+. +...+-+. .-+. .+ ..+.++|++.|.+|.+|++ +..|.+|| T Consensus 253 -----Gt~~~~~G~yi~---D~~~~g~~-~lee-ev-----------~k~R~rsv~~yaGrV~yagiD~dkng~rilfSq 311 (715) T protein:vir:26 253 -----GSSLASNGHYVL---DVFNKART-GLTT-EV-----------ETGRFRSVAAYAGRVFYAGIDSAKNGGKVYFSR 311 (715) T ss_pred -----ccccccCceEEE---eeeecCCc-cchh-hh-----------hcCCCcceeeecceEEEeecccccCCCeEEEeh Confidence 000000001110 00000000 0000 00 0244578999999999994 34699998 Q ss_pred cCC--------cccccccccc--ccCCCCcEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCC-ccccccceEEEE Q lcl|NC_015271. 372 TAK--------YFNFYPASIA--TLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTAS-GTLTSRSIELNL 440 (795) Q Consensus 372 ~gd--------~~nF~~~t~~--~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~-~~lTP~~~~~~~ 440 (795) .=+ |.+=++++.. .+.|.|...+.+.+.. .|.-|+.++..|+||...+.|+|.|. ...|.++..+.+ T Consensus 312 Lv~s~~di~nCyQd~DPTsee~~dLidTDGg~iri~gah--~ii~Lv~f~~sLlvf~~NGVWAi~G~d~g~tATdY~ltK 389 (715) T protein:vir:26 312 LTERMSDVGNCYQVNDPTSEVLSDLLDTDGGVVRIPDAH--NIRKLHVLGASLLVFAENGVWAVAGVDNVFRATEYAITR 389 (715) T ss_pred hhcchhhcccccccCCCchhhhhhhhhcCCCEEEecCCC--CceeEEEecceEEEEEecceEEEeccCCceeeeeeEEEE Confidence 643 4444454422 4778999999997754 36668999999999999999999774 589999999999 Q ss_pred EEeecCcCCCCcEEeCCeEEEEecCCCeeEEEEEEeeccccCceehhhHH-HHHHHhcCCC---cE--EEEEeCCCCeEE Q lcl|NC_015271. 441 TTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQDVSSVKNAEDIT-AHVQNYIPNG---VF--DICGSSTENFCA 514 (795) Q Consensus 441 ~s~~~~~~~~~Pv~vg~~v~fv~~~g~~~~v~~~~~~~~~~d~~~~~dls-~~~~hl~~g~---~~--~~~~~~~~~~~~ 514 (795) ++..+|++.=.=|++|+.++|-.++| |+..++.+ .-.-+.++.|| ..+..|.+.= .+ ....+-.-+.-+ T Consensus 390 Is~vg~sspnSvVvv~~~i~~WsdtG----Iyal~~Nd-~fn~~tAqNLTekTIq~~~~~I~~dk~knVtg~fd~~e~rV 464 (715) T protein:vir:26 390 ISDVGLSNENSFVVADGIPIWWGKTG----IYAVQQSE-NLNTPTAQNLSLSTIQTLWNNISNAKKAQVTVEYDKINQRV 464 (715) T ss_pred eeeeccCCCccEEEecceEEEeeCCc----EEEEEecc-ccCcchhhccchHHHHHHHhhcchhhhcceEEEEEccCCEE Confidence 99999998888899999999999887 56665543 13458899999 7777776431 11 122343344455 Q ss_pred EEEEcCCCEEEEEEEeeCC-CceeEEeeEeeec---CCCeEEE-EEEE--------e----------------------- Q lcl|NC_015271. 515 VLSQGDQSKIFMYKFLYLN-EELRQQSWSHWDF---GSNVQVL-ACQC--------I----------------------- 558 (795) Q Consensus 515 ~~~~~~dg~l~~~ty~~~~-~eq~v~aW~~w~~---~g~~~~~-~~~~--------~----------------------- 558 (795) .|..-+..++.-|+|-.-- =+-...|+-+|.. .|...++ .... . T Consensus 465 yW~yPn~dt~vdykyd~vLV~dLalgaFYp~~v~~~a~~~~~~ig~~~~~~~~~~~t~~~vv~~~~v~~~g~~~~v~~~~ 544 (715) T protein:vir:26 465 FWFYPDNDESVDYKYNNILVMDLALQAFYPWRVEDEASSTSYIIGTSYYGGLGSTSTETQVVNGADVVVNGSDNVVATLY 544 (715) T ss_pred EEEEcCCceeeceeecCeEEEEecccccccccccccccccceeeeeeeeCCcccccchhheeccceEEEeccceEEEEee Confidence 6666555566555552100 0011224455543 2221111 1000 0 Q ss_pred -----CCEEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeecccccCCcccCceEE Q lcl|NC_015271. 559 -----NSDMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLPTIYGADFAKGKIT 633 (795) Q Consensus 559 -----~d~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~~g~~v 633 (795) +|.-|.++.|.+. ..||.+.-. .+.-|||.+. ++...|. +.|.. T Consensus 545 r~~~~~~~~~~~~~~~~~--~~~~~f~~~-------~~~~~~dw~s---~d~~~~~------------------~~gy~- 593 (715) T protein:vir:26 545 RDYLEGDSEIKLLVRDGT--TGKMTFATF-------RGDTYLDWGS---ADYKSFA------------------EAGYD- 593 (715) T ss_pred cccccccceEEEEEEcCC--ceeEEEecc-------cCceeeeccc---cchhhHH------------------Hhhhh- Confidence 1111122222110 111111110 0112344432 0111000 00000 Q ss_pred EEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEEEEEEeec Q lcl|NC_015271. 634 VLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRRAWVNYED 713 (795) Q Consensus 634 ~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~~~~~~~~ 713 (795) +-|....+..+.-. ...+++++|. -|.=|-.| +|..+ -+-|+ .+.++..+ T Consensus 594 ---~~gd~~~~k~~pyv---t~~~~~tedg---~v~~~~g~----~p~n~---------------sSclm--~~sw~ws~ 643 (715) T protein:vir:26 594 ---FMGDITTFKNAPYV---TTYMRVTEDG---YVASGAGY----EFINP---------------SSCLM--SVSWNLSK 643 (715) T ss_pred ---hcccceeeecCceE---EEEEEEeccc---ceeccCCc----cccCC---------------cceEE--EEEeeecc Confidence 00000000000000 0000111110 00000001 01110 01111 11222222 Q ss_pred cceEEEEecCCccccccccccccccccccccccc-ccccceEEEEeeecccceEEEEEECCCCCEEEEEEEEEEEEeccc Q lcl|NC_015271. 714 SGTFDIYVENQSSNWKYTMAGARLGAHVMRTGKL-NLGTGQYRFPVVGNAKFNTVFILSDATTPLNIIGCGWEGNYLRRS 792 (795) Q Consensus 714 t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r~ 792 (795) ++. .+ ...+.+..+++..+.+.+.+- |..+-.-...++|..+-.+++|++...-.|+|++.+.-|--|+.+ T Consensus 644 s~s------t~--~eaYk~~~~~~~~p~~~s~~~yp~~~VvTKsriRG~Gr~~~~rf~s~~gKdlhl~Gysilg~~~~~~ 715 (715) T protein:vir:26 644 SGS------TP--REIYKLKDVPVVNPNDLSSINYPTDTVVTKSKVRGRGRSMKFRFESVAGKDFHLVGYEVIGAKNNSY 715 (715) T ss_pred CCC------Ch--hhhheecceeeeCCCccccccCCcceeEeeeeeeccceEEEEEEEecCCcceEEEeEEEEecccCCC Confidence 211 11 111222222232222222111 111111133567888999999999999999999999999988888 No 29 >protein:vir:95475 Length: 771 # NCBI annotation: hypothetical protein ORF038 # Family: family:all:5234 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294631;genbank:gi:149408197;genbank:GeneID:5237042 Probab=98.20 E-value=2.7e-06 Score=51.09 Aligned_cols=653 Identities=12% Similarity=0.100 Sum_probs=276.5 Q ss_pred CCcee--eechhhccc-cccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhh----hcCCCCcccCcEE----EEE Q lcl|NC_015271. 1 MALIS--QSIKNLKGG-ISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKT----LGGSDTLGPAPYI----HLI 69 (795) Q Consensus 1 M~~v~--~~~~n~~~G-vS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~----l~~~~~~~~~~~l----~~f 69 (795) |++-+ -....|++| |-+--.+-==++..-.-+||.....|--+||-|+-|-.. |.+.-. +++.++ +.+ T Consensus 1 m~~~~~~~~vNtFv~GliTEas~ltfpqnasiDe~N~~l~rdG~r~RR~g~~~E~~~~~vls~~~v-pa~g~~~v~~~~W 79 (771) T protein:vir:95 1 MAKTTNAAEFNTFVGGLITEASPLTFPQNASIDEVNFILNRDGSRNRRNGMDFENGATKVVCNTLV-PADGTIAVTSHNW 79 (771) T ss_pred CCcccchhHHhhhhhheeeccccccCCccceeeeeeeeecCCCcchhhceeeeecCCceEEEEEEe-cccceEEeeeech Confidence 99743 456889999 444333333345555678999999999999988765321 000001 111222 111 Q ss_pred E--eCCCceEEEEEeCCe-EEEEecCCcEE-EEEECCCcccceecCC---chhheeEEEEcCEEEEEeCCcccEEEEecc Q lcl|NC_015271. 70 N--RDESEQYYAVFTGTG-IRVFDLAGNER-QVRYTTDGSTYINTNN---PRNDLRMVTVADYTFIVNRNVRVTRDTNSV 142 (795) Q Consensus 70 ~--~~~~~~y~l~~~~~~-~rv~~~~g~~~-~v~~~~~~~~yl~~~~---~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~ 142 (795) . -++-..-||++..++ +.+|...+..+ ...+ |+.+.. |.+.|+++.+..+++|+||..-|..+.-.. T Consensus 80 ~na~G~v~~~~livqvg~~l~f~q~t~~pLs~~n~------~~~a~~nlSPsh~isv~v~~G~livanp~i~~~~~~~d~ 153 (771) T protein:vir:95 80 ENAGGEVGRWISLVQVGTELKFFQTTGETLSEGNF------YNYQFVNMSPSHKLSYAVVDGLLVVANGSRDIYVFEYDS 153 (771) T ss_pred hhcccccCcEEEEEEeccEEEEEecCCCcccccce------eeeecceeccceeEEEEEeeeEEEEecCCccEEEEEecC Confidence 1 122345556555544 66776555222 1111 222222 334588888889999999998887764322 Q ss_pred cCCCCCCCcccEEEecccccCeEEEEEECCcee---EEEEecCCCCcccccccchhHHhHhhhhhcccccCceeeeecCc Q lcl|NC_015271. 143 NLAGFNPKQDALINVRGGQYGRTLQIIINGNTQ---ATYQIPDGSQPEHVNNTDAQWLAEELARQCRVSAPGWTFNVGQG 219 (795) Q Consensus 143 ~~~~~~~~~~~~~~v~~~~~~~ty~vt~~g~~~---a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s~~~g~t~~~~g~ 219 (795) .+-.+.+.... +.-+++ ++...+|..- ..+-+ .....+++-+ .+.-..+|...... T Consensus 154 ~t~s~t~~~ll-~r~rf~-----~q~~~~G~d~~~~~~~~~-------~gt~~tn~~i-------ynlyN~gw~~pk~~- 212 (771) T protein:vir:95 154 GSVSVTTKRLL-VRDLFG-----VQDIVNGVDLRQGNDIAT-------RPTVQTNAHI-------YNLRNQTFGVPRVT- 212 (771) T ss_pred CcceeEeeeee-eeehhh-----ccccccccceeccccccc-------CCcccCchhh-------eeccccceeccccc- Confidence 21111111000 000010 0001111110 00000 0000000000 00111112111000 Q ss_pred eEEEEecCCcceeeEEEecCcCcccce-eEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeee Q lcl|NC_015271. 220 YIHIIAPEGQQIDSLTTKDGYADQLIN-PVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGW 298 (795) Q Consensus 220 ~~~i~~~~~~~~~~~~~~dg~~~t~~~-~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~ 298 (795) . +.++... .++.. ..+....+.+ .|.|-+.+ +...|+|...- T Consensus 213 -----~--------------~snt~~~~iV~~y----~a~~g~~pS~------------sd~~N~a~--~k~~~~Ei~t~ 255 (771) T protein:vir:95 213 -----W--------------HSNEPSDPIVTFR----SAASGKFPSN------------SDSVNLAL--SKRADVEPSTT 255 (771) T ss_pred -----c--------------ccCCccccceEee----eccCCCCcCC------------ceeecccc--chhhccceeee Confidence 0 0011000 00000 0011111111 11111000 11234454443 Q ss_pred eEeeeEeccceeEEEEeecCceeeecccCCcc-ccCCc-ccccccccccC-----------CCccEEEEEcceEEEecC- Q lcl|NC_015271. 299 NVNDQLLFETMPHALVRAADGNFELKRIEWSP-KTCGD-DDTNPWPSFMD-----------STINDVFFFRNRLGLLSG- 364 (795) Q Consensus 299 ~~~~~~~~~t~p~~~v~~~~~t~~~~~~~w~~-~~~gd-~~~np~psf~~-----------~~~~~v~f~q~RL~f~~~- 364 (795) +.....+..-.|......+.+-|...+..-.. +...- +.+.|+|+..- +..+.|+=|-.|.|+++. T Consensus 256 ~~f~~~~~~~~~~Gt~~~~~G~yi~da~~~g~~~Lt~~ve~~gr~~s~~~~~~~l~~~~t~~~~~~vaeyagRvwYag~~ 335 (771) T protein:vir:95 256 DRFRAEDIVLNPIGTYETARGFFIIDAMARGKSRLEEIVKLKQRYPSLSFGVSSLPQDETPGGASVVCEYAGRVWYAGFS 335 (771) T ss_pred cccchhhhhhcccCcccccCcceeeehhhhcccccceeeeccccchhhhccccccccccCCCCceeEEeeeeeEEEecce Confidence 22222211111111111122222211110000 00000 11333333211 233569999999999872 Q ss_pred --------------CeEEEEecCC--------cccccccccc--ccCCCCcEEEEEcCCCceeEEEEeecCCcEEEEecC Q lcl|NC_015271. 365 --------------ENIILSRTAK--------YFNFYPASIA--TLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDE 420 (795) Q Consensus 365 --------------~~v~~Sr~gd--------~~nF~~~t~~--~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~ 420 (795) ..|.+||.=| |.+=++++.. .+.|.|...+.+.+.. .|.-|+.+++.|+||... T Consensus 336 ~~~iD~dkng~~~~~~ilfSqLv~s~~di~nCyQd~DPTsee~~dLidTDGg~iri~gah--~ii~Lv~f~~sLlvfc~N 413 (771) T protein:vir:95 336 GQIIDGDDQSPRLVSYILFSQLVDSPADIVNCYQDGDPTSTEEPELVDTDGGFIRIEGAH--DIINLVNVGSAVMVVAAN 413 (771) T ss_pred eEEeeccccCCceeeeEeeehhhcchhhcccccccCCCchhhhhhhhhcCCCEEEecCCC--CceeEEEecceEEEEEec Confidence 1489998633 4444454422 4778999999998754 366689999999999999 Q ss_pred cEEEEeC-C-ccccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCeeEEEEEEeeccccCceehhhHH-HHHHHhc Q lcl|NC_015271. 421 AQFVLTA-S-GTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQDVSSVKNAEDIT-AHVQNYI 497 (795) Q Consensus 421 ~q~~i~~-~-~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~~~v~~~~~~~~~~d~~~~~dls-~~~~hl~ 497 (795) +-|+|.| + ...|.++..+.+++..+|++.=.=|++|+.++|-.++| |+..++.+ -.-+.++.|| ..+..|. T Consensus 414 GVWAi~ggsd~g~tAtdY~ltKIs~vg~sspnSvVvvg~~i~ywsdtg----Iyal~~Nd--fn~~tAqnLTekTIq~~~ 487 (771) T protein:vir:95 414 GIWMIQGGSDYGFTATNYLVTKISEHGCSSPNSVVVVDNSFMYWGDDG----IYHLTRNQ--YGDYVANNLTEKTIQKYY 487 (771) T ss_pred ceEEEEeccCCceeeeeeEEEEeeeeccCCCccEEEecceEEEeeCCc----eEEEeecc--cCcchhhccchHHHHHHH Confidence 9999954 3 48999999999999999998888899999999999887 56666554 3458899999 7777776 Q ss_pred CCCc---EE--EEEeCCCCeEEEEEEcC--C-C--EE--EEEEEeeCCCceeEEeeEee---ec-CCCeEEE-EEE---- Q lcl|NC_015271. 498 PNGV---FD--ICGSSTENFCAVLSQGD--Q-S--KI--FMYKFLYLNEELRQQSWSHW---DF-GSNVQVL-ACQ---- 556 (795) Q Consensus 498 ~g~~---~~--~~~~~~~~~~~~~~~~~--d-g--~l--~~~ty~~~~~eq~v~aW~~w---~~-~g~~~~~-~~~---- 556 (795) +.=. +. ...+-.-+.-+.|...+ | + -+ ++|. -...|+-+| +. .|...+. ... T Consensus 488 ~~I~~dk~knVtg~fd~~e~rvyw~yPn~~D~~~e~~t~LV~d-------LalgaFYp~~i~~~~ag~l~~~vg~~~~p~ 560 (771) T protein:vir:95 488 EKIPSDAILNATGFYDSYDKKVKWLYNTVLDGRTEPVTELVFD-------LALGAFYPSKIGSLTAGRLPIPVGSVKIPP 560 (771) T ss_pred hhcchhhhcceEEEEEccCCEEEEEecceecCCCcceeeeeee-------ecccccccccccccccCccceeeeeeecCc Confidence 4311 11 12222223333343321 0 0 01 1222 123366666 32 2222100 000 Q ss_pred ----EeCCEEEEE------------EE---e-CCCEEEEEEEEeeccc----cCCCCcceeeeeeeeeEeecCccccccc Q lcl|NC_015271. 557 ----CINSDMYVI------------LR---N-EFNTFLTRVSFTKSTV----DLQGEPYRAFMDMKIRYMIPNGTYNDDT 612 (795) Q Consensus 557 ----~~~d~l~~~------------v~---R-~~~~~~~r~~~~~~~~----~~~~~~~~~~lD~~~~~~~~~~~~~~~~ 612 (795) ..+.++-+- |. | ..-+-++.+....+.. .+..-.+.-|+|.+. +++.+ T Consensus 561 ~~lv~T~~eV~v~~~~v~~tG~~vtV~~~~r~~~~~~~~y~~~~~dg~~g~~~Fa~~~~~~f~DW~s---v~~~~----- 632 (771) T protein:vir:95 561 YKLVETGEEVTVASEQVTATGELVTVKVSTRSPVIRETKYIIVEKLSSPMRISFGGYTDEEFVDWKS---VDGIG----- 632 (771) T ss_pred cccccccceEEecceeeEecCCceEEEEEEeeccccceEEEEEEecCCCeeEEeccccCcceeeccc---CCCcc----- Confidence 001111110 00 0 0000000000000000 000000001222211 00000 Q ss_pred ccceeecccccCCcccCceEEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEE-----ecceeEEc Q lcl|NC_015271. 613 FTTTLHLPTIYGADFAKGKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYE-----FSKFRIKQ 687 (795) Q Consensus 613 ~~t~~~~~~~~gl~~~~g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~-----~~~~~~~~ 687 (795) +|-...- ..|. .+--.++|..+--+++ +-.=++-+ T Consensus 633 ------------------------vdy~sy~----~~gY------------~~~gd~~~~k~~PYit~y~~~tedg~v~~ 672 (771) T protein:vir:95 633 ------------------------VDAPAYL----LTGY------------LAGGDYQREKFVPYITFHFKKTEDGFVED 672 (771) T ss_pred ------------------------cchHHHH----Hhhh------------hccchheeeeccceEEEEEEeecccceec Confidence 0000000 0000 0000111111111110 00111111 Q ss_pred cCCccceeccccccEEEEEEEEEeeccceEEEEecCCccccccccccccccccccccc---ccccccceE--EEEeeecc Q lcl|NC_015271. 688 VDNDGSTSTEDIGRLQLRRAWVNYEDSGTFDIYVENQSSNWKYTMAGARLGAHVMRTG---KLNLGTGQY--RFPVVGNA 762 (795) Q Consensus 688 ~~g~~~~~~~~~grl~l~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~tg~~--~vp~~~~~ 762 (795) ..|+--. ..+-+-|+ .+. +........+.+.-.+...++.+..+..+ +.-.....+ ...++|.. T Consensus 673 ~~g~~~p-~n~sSclm--~~s--------w~ws~s~~t~k~~~~~eaYk~~~~~~p~~~~~~~yp~~~VV~TKsriRG~G 741 (771) T protein:vir:95 673 AEGDWTP-TNQSSCMV--QSQ--------WSWTNSPASNKWGRTWQAYRFRRHFFPDNIDNQFDDGNSVVETKSRLRGSG 741 (771) T ss_pred ccccccc-cCCcceEE--EEE--------eeeecCCCCCccccchheeeecceeccCCcchhcCCccceeeeeheeeecc Confidence 1110000 00001111 111 22222222222222333333433222111 111111122 23567888 Q ss_pred cceEEEEEECCCCCEEEEEEEEEEEEeccc Q lcl|NC_015271. 763 KFNTVFILSDATTPLNIIGCGWEGNYLRRS 792 (795) Q Consensus 763 ~~~~v~i~~~~P~P~tvlsi~~eg~y~~r~ 792 (795) +-.+++|++...-.|+|++.+.--..|-.. T Consensus 742 r~~~~rf~s~~gKdlhl~Gysil~~~~~~~ 771 (771) T protein:vir:95 742 KVLSLYITTEPKKNLHIYGWSMLVDVNGTV 771 (771) T ss_pred eEEEEEEEecCCcceEEEeEEEEEeecCcC Confidence 999999999999999999999888888776 No 30 >protein:vir:3133 Length: 911 # NCBI annotation: hypothetical protein # Family: family:all:5234 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640315;genbank:gi:21234408;genbank:GeneID:956056 Probab=97.97 E-value=8.5e-06 Score=48.38 Aligned_cols=686 Identities=12% Similarity=0.100 Sum_probs=267.8 Q ss_pred CCceeeechhhc--cc-cccCCcHHHhhhh-hhhhhcceeeccCCceeCCch-------HhhhhhcCCCCcccCcEEEE- Q lcl|NC_015271. 1 MALISQSIKNLK--GG-ISQQPDILRYPDQ-GSQQVNGWSSESEGLQKRPPM-------VFLKTLGGSDTLGPAPYIHL- 68 (795) Q Consensus 1 M~~v~~~~~n~~--~G-vS~q~d~~ry~~~-~~~~~N~~~~p~gGl~rRpGt-------~~v~~l~~~~~~~~~~~l~~- 68 (795) |+.-.+....|+ +| |-+ -.+..|..- +-..+||=....|=-+||-|. +|+..+..... ++..|.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 77 (911) T protein:vir:31 1 MAARKGAVNRFTPVRGWVTE-GNLANYGQDVALDVENMDIEKTGLTQRRFGLFAETSSEQFLSTFTATAR--ARGLLAVK 77 (911) T ss_pred Cccccccccccccceeeeec-CchhhcCceeEeeeccccchhcccchhheeeeeccchhhhhhhhhhhhh--hcceeehh Confidence 887767666664 33 221 123333222 335789988888888888775 34443332221 2222211 Q ss_pred -E--EeCCCceEEEEEeCCe-EEEEecCC------cEEEEEECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEE Q lcl|NC_015271. 69 -I--NRDESEQYYAVFTGTG-IRVFDLAG------NERQVRYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRD 138 (795) Q Consensus 69 -f--~~~~~~~y~l~~~~~~-~rv~~~~g------~~~~v~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~ 138 (795) + ..+..+.-||+|..++ +.|..... -+++...-..+- -|.. ..-+-....---...+|.||...|.-+ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (911) T protein:vir:31 78 EWREAWGDKDVNMLIFHAGYKVHVVQDTAPLRDANILLTIDLLEAGI-KLDG-VIDSPVHISVGVGFAIITNPRIEPVLI 155 (911) T ss_pred hHHHhhCCCcceEEEEecCcEEEEEecccCccccceEEEeeeeccCc-eeee-eecCceeEEeeceEEEeecCccceEEE Confidence 1 1244566788888777 22322111 111111100000 0000 000112223333567788988887554 Q ss_pred EecccCC--CCCCCcccEE-----------EecccccCeEEEEEE------CCceeEEEEecCCCCcccccccch-hHHh Q lcl|NC_015271. 139 TNSVNLA--GFNPKQDALI-----------NVRGGQYGRTLQIII------NGNTQATYQIPDGSQPEHVNNTDA-QWLA 198 (795) Q Consensus 139 ~r~~~~~--~~~~~~~~~~-----------~v~~~~~~~ty~vt~------~g~~~a~~ttp~~s~~~~~~~~~~-~~i~ 198 (795) .-..-.+ -..++-.++. +.-+-.|+.+.+..- +|=.+.+-.+.+-+-.+. ....+ ++.- T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 234 (911) T protein:vir:31 156 KLDDVDDEGVPTLSYEPLTLLIRTRELLTPYTTGTNYGDTLTPEEEWNLYNSGWATITRATKDKSGSGT-VYVNPVQYYF 234 (911) T ss_pred EeeccCccCcccccccceeeEeeehhhccccccccccCcccCchhhcccccccceeeeeecccCCccce-EEEchhheee Confidence 2211111 0111111111 011122332222111 111111111111000000 00000 0000 Q ss_pred Hh---------hhhhcccccCceeeeecCceEEEEecCCcceeeEEEecCcCccc-------------cee-E-EEe--- Q lcl|NC_015271. 199 EE---------LARQCRVSAPGWTFNVGQGYIHIIAPEGQQIDSLTTKDGYADQL-------------INP-V-THY--- 251 (795) Q Consensus 199 ~~---------l~~~~~s~~~g~t~~~~g~~~~i~~~~~~~~~~~~~~dg~~~t~-------------~~~-~-~~~--- 251 (795) .+ |.+.+. +. ++..+..+.+-.-+.+.- +.+ . .++ T Consensus 235 ~~~~~~~~~~~~~~~~~-~~-----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 296 (911) T protein:vir:31 235 DKRGVYPSHSVLYNSMK-QE-----------------SAKEIVALNVFSPWADEKINFGTTTPPLGRYIHSAYYFDSAAI 296 (911) T ss_pred cccCcCcchhhhhhhhh-hh-----------------ccceeEEEeeeccccccccccccCCCchhhhhhhheeecccee Confidence 00 000000 00 000000000000000000 000 0 000 Q ss_pred -ccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeEEEEeecCceeeecccCCcc Q lcl|NC_015271. 252 -AQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALVRAADGNFELKRIEWSP 330 (795) Q Consensus 252 -v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~~~v~~~~~t~~~~~~~w~~ 330 (795) .-.+.+|.+...+|. +.+++.. =+|.. .+.+++.-....-+-..+.++.+|. T Consensus 297 ~~~~~~~~~~~~~~~~------~~~~~p~------------~~e~~---np~gl~~igt~~n~k~~a~~~~~~~------ 349 (911) T protein:vir:31 297 LSLGIGNLTPPTSDGT------TEGSGPA------------EEEIS---NPIGLDNIGTVNNLKLIAEGTVRWT------ 349 (911) T ss_pred eeecccccCCCCCCCc------cCCCCCc------------hhhhc---CCCCcccccchhceeeeeccceeee------ Confidence 000111111111220 0000000 00000 0112221110111111233333322 Q ss_pred ccCCcccccccccccCCCccEEEEEcceEEEecC-----CeEEEEecCC--------cccccccccc--ccCCCCcEEEE Q lcl|NC_015271. 331 KTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSG-----ENIILSRTAK--------YFNFYPASIA--TLSDDDPIDVA 395 (795) Q Consensus 331 ~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~-----~~v~~Sr~gd--------~~nF~~~t~~--~~~DdD~i~~~ 395 (795) +...|+|++||.+|++|+.. ..|.+|+.-+ |++=++++.. .+.|.|-..+. T Consensus 350 --------------~~~r~r~~~~yaGRVfyaD~dkngk~rIlFSqLv~sl~di~nCYQdaDPTSeee~DLIdTDGg~vr 415 (911) T protein:vir:31 350 --------------VKDRPRCSGYHNGHVYFGDRDKNGKTRILVSQLVNSLDNIPKCFQDADPTAEEINDLIATDGFTMY 415 (911) T ss_pred --------------ecccccceeeeccEEEEeeeccCcceeEEEEeeccccccccccccCCCccccccchhhhcCCcEEe Confidence 22346999999999999953 3699998743 5555555432 35567888888 Q ss_pred EcCCCceeEEEEeecCCcEEEEecCcEEEEeCCc--cccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCeeEEEE Q lcl|NC_015271. 396 VSTNRIAILKYAVPFSEELLIWSDEAQFVLTASG--TLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHR 473 (795) Q Consensus 396 ~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~--~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~~~v~~ 473 (795) +... ..|+-|+.+++.|++|..++.|.|.|.+ ..|.++..+.+.+..+|++.=.=|++|+.++|-++.| |+. T Consensus 416 i~ga--h~Ii~LV~~G~sLlVFcaNGVWAI~G~d~~g~TATdy~ItKIsdvGcsspNSVVvVgn~i~fWSd~G----Iya 489 (911) T protein:vir:31 416 PVGM--GAPITMVEFNKRLLLLCTNGVWAIRGTSGGGATATDFTLDKVASVEFNSPQSVVDIGTAIVFWSERG----IIA 489 (911) T ss_pred cCCC--CCceEEEEecCeEEEEEeCcEEEEeccCCCceeeeeeEEEEEeeeeeCCCCeEEEecCceEEeeCCc----EEE Confidence 7664 4588899999999999999999997754 7999999999999999998888899999999999987 556 Q ss_pred EEeeccccCceehhhHH-HHHHHhcCC----CcEEEE-EeCCCCeEEEEEEc---CCCEEEEEE----EeeCCCceeEEe Q lcl|NC_015271. 474 YYAVQDVSSVKNAEDIT-AHVQNYIPN----GVFDIC-GSSTENFCAVLSQG---DQSKIFMYK----FLYLNEELRQQS 540 (795) Q Consensus 474 ~~~~~~~~d~~~~~dls-~~~~hl~~g----~~~~~~-~~~~~~~~~~~~~~---~dg~l~~~t----y~~~~~eq~v~a 540 (795) +.+.+ -.-+.++.+| ..+..|.+. .+...+ .+-..+..+.|..- +.++++.+. +.. .-...+ T Consensus 490 Lganq--fnD~tAnNLTesTIQ~y~d~I~~dkIkNVtgtyd~de~rVyW~yPn~lDe~teykt~~~~ILVf---dLatga 564 (911) T protein:vir:31 490 IGVND--FGDLTSNNLTENTIDEYYDSLDRDIIKNVKGTFINDENRVYWVVPNKQDSNGEYKTDGELVLVL---NLDTGG 564 (911) T ss_pred Eeecc--cCccccccccHHHHHHHHhhcChhhhceEEEEEEccCCEEEEEecCccCCccceeecCceEEEE---EeccCc Confidence 65554 3347888888 556666542 111112 12222223344432 333444321 111 111347 Q ss_pred eEeeecCCCeEE---------------E----------------------EEEE--eCCEEEEE-EEeCCCEEEEEEEEe Q lcl|NC_015271. 541 WSHWDFGSNVQV---------------L----------------------ACQC--INSDMYVI-LRNEFNTFLTRVSFT 580 (795) Q Consensus 541 W~~w~~~g~~~~---------------~----------------------~~~~--~~d~l~~~-v~R~~~~~~~r~~~~ 580 (795) |-+|+..+.-.. + +... .+...|++ +|...++++.-+.+- T Consensus 565 FYPwtvs~gpLl~~p~y~Lv~TreEvtvPi~~etgaiIve~gsdPV~~tl~vdttGvDg~ayLl~frdg~~g~~~f~a~~ 644 (911) T protein:vir:31 565 FYKHTVSGGPLLHAPFRRLVNTRAEVSIPITETDGTVITDTLGDPVTVTRTVTTTGVDGLAYFASFDDGVNGQFNFIAEH 644 (911) T ss_pred ccceeeecceeecccccccccccccceeeEEeecceEEEecCCCCeEEEEeeecccccceeEEEeeccCCcceEEEEEee Confidence 888875332111 0 0000 12456666 333344443332222 Q ss_pred ecc--ccCCC------CcceeeeeeeeeEeecCcccccccccceeecc--------cccCCcccCceEE----------- Q lcl|NC_015271. 581 KST--VDLQG------EPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLP--------TIYGADFAKGKIT----------- 633 (795) Q Consensus 581 ~~~--~~~~~------~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~--------~~~gl~~~~g~~v----------- 633 (795) +.- .|+.. ..|.-|.|.+-.|...-.+.-+....++..+. +...-.|++=..| T Consensus 645 ~~~~~~dw~~~~~~~~~~y~s~~~~~y~~~~~~~~~~~~pyi~sy~~~~~rv~~~~y~~~~a~~~f~~~~~~~~~~~~~~ 724 (911) T protein:vir:31 645 QPWGFADWANVPNMTRVNYSSYVDFAYEYPEVMIGNISLPYIHSYYLTGIRVQTEQYTTETAHLSFHRVQAHQTTALGTV 724 (911) T ss_pred cCCeeeccccCccccccchhHHHHhhhhhhhhhhhcccCceeeeeeeeeeEEeccceeeecccceeEeeecccceeeeee Confidence 111 01111 11111222222221111111111111111000 0000011111111 Q ss_pred ---------------EEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecce--------------- Q lcl|NC_015271. 634 ---------------VLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKF--------------- 683 (795) Q Consensus 634 ---------------~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~--------------- 683 (795) .-+....+++...++..+.+. + .++.++.++..|-+ .-....|+ T Consensus 725 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vVNGDA---E-~GtmTGWtvtaG~~--d~~Ta~p~~rGSyfFa~~nn~n~ 798 (911) T protein:vir:31 725 TFHKVDMMVSTGMQVISFHKDDLLRTEAVTLVNPDA---E-TGDATGWTVTAGTL--DVRTAAPLYQGSYYFWSDSNANF 798 (911) T ss_pred eeeeeeehhhccceeeeeccccceeeeeeEEEcCCC---C-CCCCCcceeeccch--hhccCCchhcceEeEcCCCCcch Confidence 000000111111111000000 0 11222333333321 11111111 Q ss_pred -eEEccCCccceeccccccE-----------------EEEEEEEEeeccceEEEEecCCccccccccccccccccccccc Q lcl|NC_015271. 684 -RIKQVDNDGSTSTEDIGRL-----------------QLRRAWVNYEDSGTFDIYVENQSSNWKYTMAGARLGAHVMRTG 745 (795) Q Consensus 684 -~~~~~~g~~~~~~~~~grl-----------------~l~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~ 745 (795) .+++.+ +.....-.-|++ -..++.+.+.+-.|-...+.+.++ +.+........... T Consensus 799 aL~QDID-SagaaaIDAG~v~ynvSawl~gyAaqnd~Dr~~l~vEfLDAsGTVL~~tdTgr-----~sa~d~ltrntdt~ 872 (911) T protein:vir:31 799 AAYQDID-PVGGGYITAGELANNVIEAKLSWAARGNTDLGTVYIECLDAIGTVLASTDTTR-----FSGHDTWTRYGDAV 872 (911) T ss_pred hhheecc-ccccceeeeccchhhhhhhhhhhccCCCCccceEEEEEEeccCceecceecCc-----ccceeeeecccccc Confidence 011111 000000001111 012344455444442222222221 22223332222345 Q ss_pred ccccccceEEEEee----------ecccceEEEEEECCC Q lcl|NC_015271. 746 KLNLGTGQYRFPVV----------GNAKFNTVFILSDAT 774 (795) Q Consensus 746 ~~~~~tg~~~vp~~----------~~~~~~~v~i~~~~P 774 (795) ..|..+-++++-+. +.-++.++.++...- T Consensus 873 ~lP~GTRTiRI~Lv~TrvAgndnDgYaDDISL~L~~~~~ 911 (911) T protein:vir:31 873 VLPTDTDTIRVWIVGTLVATNDVNYYVDDIQLNLEVHNV 911 (911) T ss_pred cCCCCCcEEEEEEEEEEeccCccceEEeceEEEEEEecC Confidence 67777777777552 122556677665544 No 31 >protein:vir:8837 Length: 513 # NCBI annotation: constituent protein # Family: family:all:4957 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775245;genbank:gi:27476043;genbank:GeneID:2700591 Probab=97.56 E-value=4.4e-05 Score=44.44 Aligned_cols=481 Identities=13% Similarity=0.093 Sum_probs=224.9 Q ss_pred CCceeeechhhccccccCCcHHHhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCCceEEEE Q lcl|NC_015271. 1 MALISQSIKNLKGGISQQPDILRYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDESEQYYAV 80 (795) Q Consensus 1 M~~v~~~~~n~~~GvS~q~d~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~~~y~l~ 80 (795) ||...+..-.-.|.|--+-+.+-=.++-..+.|+++. .|++.||||-.=+.+..... -.-+++|. ..+-.++|. T Consensus 1 ~~~~~~~~~~~~g~~~d~~p~~lp~~a~s~~~N~~~~-~~~~~~~~g~~pv~a~~~~~----~~g~~~~~-~~g~~~~~~ 74 (513) T protein:vir:88 1 MALERQEVKNPTGIVTDIAPADLPLDKWSFGNNVRFK-NGKAQKALGHSPIFDTAQAP----ILDMFPFI-RNNIPYWLL 74 (513) T ss_pred CCcCChhhcccccceeccChhhcCCCcceeeeeeeEe-cceeeecCccceeeecCCCC----ceeeeeee-cCCCeEEEE Confidence 8887777777677775554444423444666777765 68889999998874321111 12245553 456677777 Q ss_pred EeCCeEEEEecCCcEEEEEECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccEEEeccc Q lcl|NC_015271. 81 FTGTGIRVFDLAGNERQVRYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINVRGG 160 (795) Q Consensus 81 ~~~~~~rv~~~~g~~~~v~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~~~v~~~ 160 (795) .+...++.++ +...+ .+ ...+|..+. -...+|+|-+|+++++|...+|+...- .. T Consensus 75 ~~~~~~~~~~--~~t~~-dv--s~~~~~~~~--~~~w~~~~f~~~i~a~ng~~~~q~~~~----~s-------------- 129 (513) T protein:vir:88 75 CSEKRLYLAD--GTTII-DV--SPGPYSASV--TNRWSVGSFNGVIFANDGVNPPHHLPP----TE-------------- 129 (513) T ss_pred eeceEEEEec--Cceee-ec--cccceeecc--cCceeeeeecCEEEEEcCCCcceEEcC----CC-------------- Confidence 7776655554 22111 11 113342111 112445555555555443333322100 00 Q ss_pred ccCeEEEEEECCceeEEEEecCCCCcccccccchhHHhHhhhhhcccccCceeeeecCceEEEEecCCcceeeEEEecCc Q lcl|NC_015271. 161 QYGRTLQIIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELARQCRVSAPGWTFNVGQGYIHIIAPEGQQIDSLTTKDGY 240 (795) Q Consensus 161 ~~~~ty~vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s~~~g~t~~~~g~~~~i~~~~~~~~~~~~~~dg~ 240 (795) . T Consensus 130 --------------------------------------------------------------------~----------- 130 (513) T protein:vir:88 130 --------------------------------------------------------------------S----------- 130 (513) T ss_pred --------------------------------------------------------------------c----------- Confidence 0 Q ss_pred CcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeEEEEeecCce Q lcl|NC_015271. 241 ADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALVRAADGN 320 (795) Q Consensus 241 ~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~~~v~~~~~t 320 (795) .+++||.. | T Consensus 131 -------------~f~dl~g~------------------------------------------------p---------- 139 (513) T protein:vir:88 131 -------------VFRVLPNF------------------------------------------------P---------- 139 (513) T ss_pred -------------eeeeccCC------------------------------------------------C---------- Confidence 00000000 0 Q ss_pred eeecccCCccccCCcccccccccccCCCccEEEEEcceEEEecC--------CeEEEEecCCcc----ccccccccccCC Q lcl|NC_015271. 321 FELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSG--------ENIILSRTAKYF----NFYPASIATLSD 388 (795) Q Consensus 321 ~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~--------~~v~~Sr~gd~~----nF~~~t~~~~~D 388 (795) ..| .-..|.+|++||++++. +.|+.|..+|.. .|..+. ...+ T Consensus 140 -----~~~-------------------~a~~i~v~~~flv~~~~t~~~~~~PnrV~wS~~~D~~~~P~~W~~t~--~t~~ 193 (513) T protein:vir:88 140 -----ANT-------------------TFRRLKSFKNFLIGLNVTSNSIEMPQMVWWSTSADAGGVPASWDPTD--PTKD 193 (513) T ss_pred -----ccc-------------------ceEEEEEEeeEEEEeecccCcCCCCceEEEecccCCccccccccccc--ccCc Confidence 000 11356778888888642 469999999963 342221 1122 Q ss_pred CCcEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEe-CCccccccceEEEEEEe-ecCcCCCCcEEeCCeEEEEecCC Q lcl|NC_015271. 389 DDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLT-ASGTLTSRSIELNLTTQ-FDVQDRARPFGIGRNVYFASPRS 466 (795) Q Consensus 389 dD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~-~~~~lTP~~~~~~~~s~-~~~~~~~~Pv~vg~~v~fv~~~g 466 (795) .+=.++ .+....|...+.....|+||++.+-|.++ .++ |....++.... -+|.+.-.=+.+|+.++|+.+.| T Consensus 194 a~~~~l---~d~~g~~v~g~~~g~~liif~e~~i~~m~y~g~---~~if~~~~i~~~~G~~~p~SI~~~~~~~ffls~~G 267 (513) T protein:vir:88 194 AGQNTL---ADTNGAIVDGVKLRDSFIIYKEDSVYSMRYIGG---LYIFQFQQLFNDVGILGPNCAIEFDGNHFVVGHGD 267 (513) T ss_pred cccccc---CCCccceeeeeecccceEEEecccEEEEEecCC---CceEEEEeecccccccCCceeEEECCeEEEEeCCc Confidence 222222 33344556677778899999999999996 332 23344444433 23333333488999999999987 Q ss_pred CeeEEEEEEeeccccCceehhhHHHHHHHhc-----CCCcEEEEEeCCCCe-EEEEEEcC-C-------CEEEEEEEeeC Q lcl|NC_015271. 467 SFTSIHRYYAVQDVSSVKNAEDITAHVQNYI-----PNGVFDICGSSTENF-CAVLSQGD-Q-------SKIFMYKFLYL 532 (795) Q Consensus 467 ~~~~v~~~~~~~~~~d~~~~~dls~~~~hl~-----~g~~~~~~~~~~~~~-~~~~~~~~-d-------g~l~~~ty~~~ 532 (795) ++ ++ +..+-...+. ..++++| +.....+...-.+.. -++|+-.+ + ..+++|-|+ T Consensus 268 ----f~-~~--~G~~~~~Ig~---ekVdk~f~~~~n~~~~~~~~~~~d~~~~~v~~~y~s~~~~~~~~~~~~lVYd~~-- 335 (513) T protein:vir:88 268 ----VY-VH--NGVQKQSVID---AQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSEPGKHCDRAIIWNWK-- 335 (513) T ss_pred ----eE-Ee--cCceeeeccc---chhhhhhhccCCcccceEEEEEEcCcccEEEEEecCCCCCCCcccceEEEEEcc-- Confidence 22 22 2111111111 1222222 122223333333333 23333111 0 245666653 Q ss_pred CCceeEEeeEeeecCCCeEEEEEEEeCCEEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCccccccc Q lcl|NC_015271. 533 NEELRQQSWSHWDFGSNVQVLACQCINSDMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDT 612 (795) Q Consensus 533 ~~eq~v~aW~~w~~~g~~~~~~~~~~~d~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~ 612 (795) . ..|+.-+.+..+ ..+...-+.+.......... + .|... T Consensus 336 --~---~~Ws~~~~p~~~--~g~~g~~~~~~~~~~~~~~~----------~-----------~d~~~------------- 374 (513) T protein:vir:88 336 --E---NTWSIRDLPNVL--SGAYGIIDPKTSNLWDDDSN----------P-----------WDTDT------------- 374 (513) T ss_pred --C---CeEEEEeccchh--hcccccccccccceeccccc----------c-----------cccch------------- Confidence 1 256655544321 00100001111111100000 0 01000 Q ss_pred ccceeecccccCCcccC--ceEEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCC Q lcl|NC_015271. 613 FTTTLHLPTIYGADFAK--GKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDN 690 (795) Q Consensus 613 ~~t~~~~~~~~gl~~~~--g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g 690 (795) +....++.... --.+...++|.++... ..+ -.-|-++++.++...+.+.+ T Consensus 375 ------~~~~~~~~~~~~~sl~~~~~~~~~~~~fd--~~~-----------------~f~G~~lea~~~t~~~~~~~--- 426 (513) T protein:vir:88 375 ------SVWGEGSYNPAKSSMIFTSFQDAKLFLFG--ETS-----------------TFSGQSFTSTLERSDIYLGD--- 426 (513) T ss_pred ------hhhhccccccccceeEeeeccCCceeeec--ccc-----------------cccCCceEEEEEecCccccC--- Confidence 00000100000 0011222333322110 000 13467788888887776521 Q ss_pred ccceecccccc-EEEEEEEEEeeccceEEEEecCCcccccccccccccccccccccccccc-cceEEEEeeecccceEEE Q lcl|NC_015271. 691 DGSTSTEDIGR-LQLRRAWVNYEDSGTFDIYVENQSSNWKYTMAGARLGAHVMRTGKLNLG-TGQYRFPVVGNAKFNTVF 768 (795) Q Consensus 691 ~~~~~~~~~gr-l~l~~~~~~~~~t~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-tg~~~vp~~~~~~~~~v~ 768 (795) .++ ++|+++...+...+.+.+.+......... .....+.... ..+..++++...+-.+++ T Consensus 427 --------~~~~~~i~~v~~~~t~~g~~t~~vg~~~~~~~~----------~~~s~~~~~~~~~~~~~~~r~~gRy~~~r 488 (513) T protein:vir:88 427 --------DRMMKTVSAVIPHITGNGVCNIWVGNAQVQGSG----------IRWKGPYPYRIGQDYKIDTKHVGRYIALK 488 (513) T ss_pred --------chhheeeeeeeeeeecceEEEEEEeeeccCccc----------cccccceeeecccCceEEeccCCceEEEE Confidence 123 36777777777777776665443221110 0000111111 123445666777888888 Q ss_pred EEECCCCCEEEEEEEEEEEEeccccC Q lcl|NC_015271. 769 ILSDATTPLNIIGCGWEGNYLRRSSG 794 (795) Q Consensus 769 i~~~~P~P~tvlsi~~eg~y~~r~rr 794 (795) |+...--|+++.++++|..--. .|| T Consensus 489 i~i~~~~~w~~~G~~ve~~~~~-g~R 513 (513) T protein:vir:88 489 FDFASAGDWYFNGYTLEMAPKA-GMR 513 (513) T ss_pred EEccCCCceEEeeEEEEEecCC-CCC Confidence 8888899999999999887531 233 No 32 >protein:vir:105563 Length: 396 # NCBI annotation: hypothetical protein # Family: family:all:27455 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164316;genbank:gi:56692963;genbank:GeneID:3197174 Probab=95.28 E-value=0.0024 Score=34.96 Aligned_cols=380 Identities=11% Similarity=-0.013 Sum_probs=150.8 Q ss_pred CCceeeechhhccc--cccCCcHH----HhhhhhhhhhcceeeccCCceeCCchHhhhhhcCCCCcccCcEEEEEEeCCC Q lcl|NC_015271. 1 MALISQSIKNLKGG--ISQQPDIL----RYPDQGSQQVNGWSSESEGLQKRPPMVFLKTLGGSDTLGPAPYIHLINRDES 74 (795) Q Consensus 1 M~~v~~~~~n~~~G--vS~q~d~~----ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~l~~~~~~~~~~~l~~f~~~~~ 74 (795) |+.+. +--|+|= |+.-.+++ --..-+|++.|+=.++.|=++||-|.+-+......+. -..-++-+.|... T Consensus 1 ~~~~~--~~~~~ginnv~~e~~l~~~~~~~~~~~r~a~nvdi~~~G~~~~r~~~tr~~~g~l~~~--~~~~~~~~~~~~~ 76 (396) T protein:vir:10 1 MATTS--LVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQL--WQSPLHGDAFGAL 76 (396) T ss_pred Cccee--eeeeecccccccccccccCCCcccceeeeeeeecccCCCchhhhccCcccCCceeccc--ccCccccceeeeC Confidence 88763 3222221 44433444 3556799999999999999999999887754222110 0011222222223 Q ss_pred ceEEEEEeCCeEEEEecCCcEEEEEECCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccE Q lcl|NC_015271. 75 EQYYAVFTGTGIRVFDLAGNERQVRYTTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDAL 154 (795) Q Consensus 75 ~~y~l~~~~~~~rv~~~~g~~~~v~~~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~ 154 (795) +.-+..+.++.-++|- -+.+ ...-+.+.+..|-+|.++...+-.+. .. T Consensus 77 ~~tl~~~~~~~w~~~~------~v~v------------~~~pva~d~~~~Rvy~t~~~~p~~~~----~~---------- 124 (396) T protein:vir:10 77 GDQWGKVDPHSWTFEP------LAQI------------GEGDLSHEVLNNRVCVAGTAGIFTYD----GA---------- 124 (396) T ss_pred CceEEEEeCCeEEEEe------eeee------------ccCchhccccCCeEEEEcCCCceeee----CC---------- Confidence 3333333322222221 0000 11224456667777877744442221 00 Q ss_pred EEecccccCeEEEEEECCceeEEEEecCCCCcccccccchhHHhHhhhhhcccccCceeeeecCceEEEEecCCcceeeE Q lcl|NC_015271. 155 INVRGGQYGRTLQIIINGNTQATYQIPDGSQPEHVNNTDAQWLAEELARQCRVSAPGWTFNVGQGYIHIIAPEGQQIDSL 234 (795) Q Consensus 155 ~~v~~~~~~~ty~vt~~g~~~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s~~~g~t~~~~g~~~~i~~~~~~~~~~~ 234 (795) ..|.+.+. +|.. .+.... .| +. ..+.+.|+... + T Consensus 125 ---------~~y~L~vp--------~P~~-a~~~a~-------------------~G-sl-~~~~~~Y~~t~-------V 158 (396) T protein:vir:10 125 ---------QAERLTLD--------TPAP-PLLVAG-------------------AG-SL-SQGTYGAAVAW-------L 158 (396) T ss_pred ---------cceecCcC--------CCcc-cccccc-------------------cC-cc-CCceEEEEEEE-------E Confidence 01111110 0000 000000 00 00 00111111100 0 Q ss_pred EEecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeEEEE Q lcl|NC_015271. 235 TTKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALV 314 (795) Q Consensus 235 ~~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~~~v 314 (795) .+.+....+......++ ...|..+++..-.+...+...+.....+|. .. +-.+.+| T Consensus 159 --~~~gEEs~p~~~S~~v~--------~~gg~~vtl~~~~~~~i~~~RiYrS~~~G~---------~~-~l~aE~~---- 214 (396) T protein:vir:10 159 --RGPQESAPSLIAFAEVT--------DAGALEVTFPLCLDASVTGARLYLTRANGG---------EL-LLAGDYP---- 214 (396) T ss_pred --ecCCCcCcccccccccC--------CCCCcEEEEEcccCCCcceEEEEEeCCChh---------hh-hheehhc---- Confidence 00000000000000000 011112222211111111111100111110 00 0001111 Q ss_pred eecCceeeecccCCccccCCcccccccccccCCCccEEEEEcceEEEecCCeEEEEecCCccccc-cccccccCCCCcEE Q lcl|NC_015271. 315 RAADGNFELKRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSGENIILSRTAKYFNFY-PASIATLSDDDPID 393 (795) Q Consensus 315 ~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~-~~t~~~~~DdD~i~ 393 (795) ....+|.++..+|.....-...=-|.|+. .-+++|.+||+++.++.||+|...-++=++ ++.-+ T Consensus 215 -a~~~s~vlPs~~w~gpP~~~~gL~pmP~G-----~~~A~faGRi~~A~Gn~V~FSEp~~Ph~~~~~~~~~--------- 279 (396) T protein:vir:10 215 -LGAATVILPTLPELGRPAQFRHLSPMPTG-----KHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFV--------- 279 (396) T ss_pred -cceeeeeeecCCCCCCCccccccccCchh-----HhhhhhcceEEEEeCCEEEEecCCCCceecchhccC--------- Confidence 11224445556775432111111233321 258899999999999999999999874222 22111 Q ss_pred EEEcCCCceeEEEEeecCCcEEEEecCcEEEEeCCcc--ccccceEEE---EEEeec----CcCCCCcEEeCCeEEEEec Q lcl|NC_015271. 394 VAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGT--LTSRSIELN---LTTQFD----VQDRARPFGIGRNVYFASP 464 (795) Q Consensus 394 ~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~--lTP~~~~~~---~~s~~~----~~~~~~Pv~vg~~v~fv~~ 464 (795) .. ...|.-+.+...+|+++|+++-|.+.|.++ |+.+..... ++|..- +.+.-..+..+..++|.++ T Consensus 280 ---~~--~~~Iv~lapv~~gL~Vgt~~~~y~~~G~dP~sms~~~l~~~~pvp~S~v~~p~~~~s~rs~~~~~~~~lwas~ 354 (396) T protein:vir:10 280 ---QM--PQRITFVQPVDGGIWVGQVDHVAFLDGADPASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLAE 354 (396) T ss_pred ---CC--CCceEEEEEecCeEEEEEcCcEEEEEcCChhHcceeecccCCCcccchhcccchhhhcccccccCcEEEEccC Confidence 11 123566778889999999999999998653 443333211 111100 0122333456889999999 Q ss_pred CCCeeEEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeC Q lcl|NC_015271. 465 RSSFTSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLYL 532 (795) Q Consensus 465 ~g~~~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~ 532 (795) .|=+ -- . .++- +..+....+.+...... -+ +..|++.+++- + T Consensus 355 dGl~----~g----~-~~G~----v~~l~~~~i~p~~~~A~---------~~-~~~drRy~~~~---~ 396 (396) T protein:vir:10 355 NGYV----MG----T-SSGA----IAEVHAGVLAGITGRAG---------TS-VVFDRRLLTAV---S 396 (396) T ss_pred CcEE----EE----c-CCce----eeeecccccCCCcccce---------EE-EeecCeEEEEe---C Confidence 8822 11 0 1111 11111122222211111 01 11233332211 0 No 33 >protein:vir:95324 Length: 823 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512269;genbank:gi:89152436;genbank:GeneID:3952993 Probab=89.94 E-value=0.024 Score=29.51 Aligned_cols=628 Identities=10% Similarity=0.040 Sum_probs=180.3 Q ss_pred hhcce--eeccCC-----ceeCCchHhhhh-hcC-------C-CCcccCc--EEE-EEEeCCCceEEEEE--eCCe--EE Q lcl|NC_015271. 31 QVNGW--SSESEG-----LQKRPPMVFLKT-LGG-------S-DTLGPAP--YIH-LINRDESEQYYAVF--TGTG--IR 87 (795) Q Consensus 31 ~~N~~--~~p~gG-----l~rRpGt~~v~~-l~~-------~-~~~~~~~--~l~-~f~~~~~~~y~l~~--~~~~--~r 87 (795) |.--+ ..-.|| |..|.=...... ++. . .....++ +++ .........-++.| .... +- T Consensus 1 m~i~~~q~sF~~GElsP~l~gR~Dl~ry~~q~~~~~N~~~~~~GGl~rRpGt~fva~~~~~~g~~rLipf~~s~~q~y~L 80 (823) T protein:vir:95 1 MAISWIQPSFAGGEIGPSLYGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPNRKCRLIPFQFSTVQTYAL 80 (823) T ss_pred CcceeechhccCceechheeccchHHHHHHHHhhhhCcEeeecCCceecCchhhhhhhcCCCCCeeEEEEEeCCCcEEEE Confidence 22111 233678 888876654321 100 0 0000011 100 00011111122211 1111 11 Q ss_pred EEecCCcEEEEEE------CCCcccceecCCchhheeEEEEcCEEEEEeCCcccEEEEecccCCCCCCCcccEEEecccc Q lcl|NC_015271. 88 VFDLAGNERQVRY------TTDGSTYINTNNPRNDLRMVTVADYTFIVNRNVRVTRDTNSVNLAGFNPKQDALINVRGGQ 161 (795) Q Consensus 88 v~~~~g~~~~v~~------~~~~~~yl~~~~~~~~l~~~q~aD~~~i~~~~~~p~~~~r~~~~~~~~~~~~~~~~v~~~~ 161 (795) +| .+.-..+-- ..+..+|..+.+ +...+..|.-|.--.++. .+.. .+ -++..+ .+.+. T Consensus 81 ef--g~~~irV~~~~g~vv~~~~~~~ev~tP----y~~~~l~~Lr~~qsaD~~--fivh-~~-----~~p~~L--~r~~~ 144 (823) T protein:vir:95 81 EF--GHQYMRVIKDGALVLNSSNVIYEIATP----YTEADLFRIKFTQSADVL--TLVH-PA-----YPPKEL--RRYAH 144 (823) T ss_pred EE--cCCeEEEEeCCcEEEecCCceeEEecc----cccccccceeEEEeccEE--EEEc-CC-----ccceEE--EecCC Confidence 11 111011100 000111111110 000111121121111110 0000 00 000000 00111 Q ss_pred cCeEEEEE-E-CCc---e---eEEEEecCCCCcccccccchhHHhHhhhhhcccccCceeeeecCceEEEEecCCcceee Q lcl|NC_015271. 162 YGRTLQII-I-NGN---T---QATYQIPDGSQPEHVNNTDAQWLAEELARQCRVSAPGWTFNVGQGYIHIIAPEGQQIDS 233 (795) Q Consensus 162 ~~~ty~vt-~-~g~---~---~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s~~~g~t~~~~g~~~~i~~~~~~~~~~ 233 (795) .+...... + .+. . .....++++...... +.+....+.....+..+++.......... T Consensus 145 ~~w~l~~~~~~~gp~~~~~~~~t~~v~~~~~~~~~t---------------~ta~~~~~~~d~vg~~~~l~~~~~~~~~~ 209 (823) T protein:vir:95 145 DNWQLVDVVTKNGPFEDINIDESLTVYASASTGTIT---------------LTASASIFGAEQVGKLFYLEQPAVDSVPV 209 (823) T ss_pred CCceEEEEEEeccccccccccceeEEeccccCceeE---------------EeecccccchhhccceEEEeccccceeee Confidence 11000000 0 000 0 000011111111000 01112223333334444443222111110 Q ss_pred EEEecCcCcccceeEEEeccceeecccccCCCeEEEEEcCCCCCcceEEEEEee--cCceEEEeeeeeEeeeEeccceeE Q lcl|NC_015271. 234 LTTKDGYADQLINPVTHYAQSFSKLPTNAPEGYVVKIVGDASKSADQYYVRYDT--TRKVWSETLGWNVNDQLLFETMPH 311 (795) Q Consensus 234 ~~~~dg~~~t~~~~~~~~v~~~~~l~~~~~~G~~v~v~~~~~~~~~~yy~~~~~--~~~~w~E~~~~~~~~~~~~~t~p~ 311 (795) ...+ ..+.+.... .. ..+.......|....+.+..... .+|+.+.. .+..|.|..- T Consensus 210 -~~~~--~~~~~~~~~-~~--~~~~~~~~~~~~~g~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~-------------- 267 (823) T protein:vir:95 210 -WETS--KSTSIGDIR-RA--DSNYYRAVTAGKTGTLRPSHTEG--TSWDGWGGSGDDDTGIEWEY-------------- 267 (823) T ss_pred -ccee--eeecccceE-Ee--cccceeeeeccccceeecccCCc--ceEEeceecccccceeEEEE-------------- Confidence 0000 000000000 00 00111111111111111111111 12222111 1111222111 Q ss_pred EEEeecCceeeecccCCc-------cccCCcccccccccccCCCccEEEEEcceEEEecCCeEEEEecCCcccccccccc Q lcl|NC_015271. 312 ALVRAADGNFELKRIEWS-------PKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSGENIILSRTAKYFNFYPASIA 384 (795) Q Consensus 312 ~~v~~~~~t~~~~~~~w~-------~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~ 384 (795) .+...+.++....... .........+..++|. ++ + ..|- + .|= T Consensus 268 --~~~~~g~~~~t~v~~~~~~~~~~~~~~~~~~~~~~~t~~---------~~-~-------~~~~----~-~~g------ 317 (823) T protein:vir:95 268 --LHSGFGIARITAVNGTTATAEVISYIPSQVVGEDNASYK---------WA-K-------YAWN----S-VNG------ 317 (823) T ss_pred --EeCCcceEEEEeecceeeeceEeeeeccccccCCcCCcc---------cc-c-------cccC----c-CCC------ Confidence 1111111111110000 0000000000000000 00 0 0010 0 110 Q ss_pred ccCCCCcEEEEEcCCCceeEEEEeecCCcEEEEecC--cEEEE-e--CC-------ccccccc-eEEEEEEeecCcCCCC Q lcl|NC_015271. 385 TLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDE--AQFVL-T--AS-------GTLTSRS-IELNLTTQFDVQDRAR 451 (795) Q Consensus 385 ~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~--~q~~i-~--~~-------~~lTP~~-~~~~~~s~~~~~~~~~ 451 (795) -|-. +.-+...|+++... -|+++ + |+ .+++... +.+...+. -.+.++ T Consensus 318 -----~Ps~-------------v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~~~~DdD~I~~~~s~~--~~~~i~ 377 (823) T protein:vir:95 318 -----YPGT-------------VVYYQQRLYFAASTAFPQTIWASRTGDYKDFGKSNPTQDDDRIIYTYAGR--QVNEIR 377 (823) T ss_pred -----CccE-------------EEEEeceEEEEEcCCCCcEEEEeccCCccccccccCCCCCCcEEEEEcCC--cceEEE Confidence 1110 11222234333321 13332 1 11 1222211 22222221 122233 Q ss_pred cEEeCCeEEEEecCCCeeEEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEee Q lcl|NC_015271. 452 PFGIGRNVYFASPRSSFTSIHRYYAVQDVSSVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQSKIFMYKFLY 531 (795) Q Consensus 452 Pv~vg~~v~fv~~~g~~~~v~~~~~~~~~~d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~ 531 (795) =++-.+.++....++.+ . ......+......+++-..-.+ | ...+-... -...++++.+....+.-|.|-. T Consensus 378 ~~v~~~~Lli~t~~~e~----~--l~~~~~~~lTP~~~~~~~~s~~-g-~~~~~Pv~-vg~~~~Fv~~~g~~vre~~~~~ 448 (823) T protein:vir:95 378 HLIDVGSLVALTSGGEY----V--ITGDQNKVLTPSSFAFSSQGSN-G-SSNVPPIA-VANIALFVQEKGSVVRDLAYSF 448 (823) T ss_pred EEeecCcEEEEecCcEE----E--EEcCCCcccceeeEEEEEeecc-c-cccccceE-eCCeEEEEecCCCEEEEEEEee Confidence 33334455555655543 1 1122222222222221100001 1 00000001 1123445555555565555543 Q ss_pred CCC---ceeEEeeEeeecCCC-eEEEEEEEeCCEEEEEEEeCCCEEE---------------------EEEE-------- Q lcl|NC_015271. 532 LNE---ELRQQSWSHWDFGSN-VQVLACQCINSDMYVILRNEFNTFL---------------------TRVS-------- 578 (795) Q Consensus 532 ~~~---eq~v~aW~~w~~~g~-~~~~~~~~~~d~l~~~v~R~~~~~~---------------------~r~~-------- 578 (795) ... -+++.-=+.|-+.|. +..+|.....+.+.++++-++.... +.+. T Consensus 449 ~~d~~~~~dlT~~a~hl~~~~~i~~~a~~~~p~~~~~~v~~dG~l~~~ty~~~q~v~aW~~~~~~g~~~~~~~i~~~~~d 528 (823) T protein:vir:95 449 DVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDGKLLVMTYLRDQQVFAWAPQSSTGKYESTCSISEGNED 528 (823) T ss_pred ecCceecchhhhhhhhhcCCCceEEEEEecCCCeEEEEEecCCcEEEEEEecccceeeeEEEecCCcEEEEEEecCCCCC Confidence 221 112222234544432 1112211111222112222221111 1110 Q ss_pred ----EeeccccCCCCcceeeeee------eeeEeecCc-ccccccccce--eecc-cccCCcccCceEEEEecCCccccc Q lcl|NC_015271. 579 ----FTKSTVDLQGEPYRAFMDM------KIRYMIPNG-TYNDDTFTTT--LHLP-TIYGADFAKGKITVLEADGKITEF 644 (795) Q Consensus 579 ----~~~~~~~~~~~~~~~~lD~------~~~~~~~~~-~~~~~~~~t~--~~~~-~~~gl~~~~g~~v~~~adg~~~~~ 644 (795) ..+...+.....+.-.++. ...+.++.+ +|+.. +... ..+. ....+.|++|++|.+ +||...+. T Consensus 529 ~l~~~v~R~i~g~~~~yiE~~~~~~~~~~~~~~~lD~~~s~~g~-~~~~~~~~l~~g~~~l~~l~g~~v~~-adg~~~~~ 606 (823) T protein:vir:95 529 AVYFVVNRTVNGQTVRYIERLSSRLFTSDEDAFFVDSGLSYDGR-NTSDRTMTITGGSGEWDYLAEYTISV-SGGAYFTS 606 (823) T ss_pred EEEEEEEeccCCeEEEEEEeeccccCCCccceeEEEEEEEeecC-cccceeeEecCCCCcccccCceEEEe-cCcceECC Confidence 0000011110001001111 111222222 23322 2211 1221 123478999999876 88888777 Q ss_pred ceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEEEEEEeeccceEEEEecCC Q lcl|NC_015271. 645 EEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRRAWVNYEDSGTFDIYVENQ 724 (795) Q Consensus 645 ~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~~~~~~~~t~~~~v~v~~~ 724 (795) ..+ .+ +|.++++++.+++|++|++.++++++++.++. ..... .+ ++....++...+.+ .++... T Consensus 607 ~~v-~g-------~i~l~~~~~~~~vGl~~~~~i~~~~~~v~~~~-a~~~~---~~--r~v~a~l~~~~t~~--~~~~~~ 670 (823) T protein:vir:95 607 SDV-GA-------QLQFPYTGADPDTGYEVSKELRCDIISVTSNT-AVVVR---AN--RNVPPSLRNVATTN--WQMARR 670 (823) T ss_pred ccc-ee-------EEEeCcCCCccccccceEEEEEEeeceeeCCc-eEEEc---cC--Ccccceeeeeeccc--cccccc Confidence 666 34 78889999999999999999999999886432 11111 01 22233333322222 221111 Q ss_pred cccccccccccccccccccccc--cc--cccceEEEEeeecccceEEEE---EECCCCCEEEEEEEEEEEEeccccCC Q lcl|NC_015271. 725 SSNWKYTMAGARLGAHVMRTGK--LN--LGTGQYRFPVVGNAKFNTVFI---LSDATTPLNIIGCGWEGNYLRRSSGI 795 (795) Q Consensus 725 ~~~~~~~~~~~~~~~~~~~~~~--~~--~~tg~~~vp~~~~~~~~~v~i---~~~~P~P~tvlsi~~eg~y~~r~rrv 795 (795) ...-....+|+.+.- +.+|. ++ +..|.+++|.. .....+-| .--.|||+.+. +.|..--|.||| T Consensus 671 ~~~gL~hleg~tv~v--~~dg~~~~~~~v~~G~vtl~~~--~~~v~vGl~~~~~~~~l~~~~~---~~g~~~g~~~ri 741 (823) T protein:vir:95 671 TFGGLSHLEGQTVNI--LSDANVEPQKVVSGGAVTLESP--GAVVHIGLPITAEFETLDININ---GQETLLDKKQVI 741 (823) T ss_pred eeeeccccccceEEE--EEcCeeeCCeEecCCEEEecCC--CCEEEEeecceeeEEecchhcC---CCcccCCceeEE Confidence 111112333433321 12222 22 23577766642 22333322 12256776644 357777888888 No 34 >protein:vir:3529 Length: 477 # NCBI annotation: P28 # Family: family:all:1540 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050989;genbank:gi:9633575;genbank:GeneID:1262322 Probab=89.22 E-value=0.028 Score=29.13 Aligned_cols=430 Identities=12% Similarity=0.105 Sum_probs=153.1 Q ss_pred hhhhhcccccCceeeeecCceEEEEecCCcceeeEEEecCcCcccceeE-EEeccceeecccccCCCeEEEEEcCCC-CC Q lcl|NC_015271. 200 ELARQCRVSAPGWTFNVGQGYIHIIAPEGQQIDSLTTKDGYADQLINPV-THYAQSFSKLPTNAPEGYVVKIVGDAS-KS 277 (795) Q Consensus 200 ~l~~~~~s~~~g~t~~~~g~~~~i~~~~~~~~~~~~~~dg~~~t~~~~~-~~~v~~~~~l~~~~~~G~~v~v~~~~~-~~ 277 (795) -|.+-+ .+-..+....+-.. ....+..+..+.+ + +.+. +..-.+...|++ ..|...+..+++- .+ T Consensus 1 ~~~~~~---m~~~~ipl~~g~~~-~~~~~d~~~~~PV-----N--~~a~p~~~~~s~~~L~~--~pG~~~~~~~~G~~RG 67 (477) T protein:vir:35 1 MLSEVF---MPKIQIPLAKGLVK-DIKTADYIDALPV-----N--MLATPKEVLNASGYLRS--FPGIEKKQDAKGVSRG 67 (477) T ss_pred Ccccce---eeeecccccccccc-ccccccceeeeee-----c--cceeecccccccccccc--CCcceeeccCCccccc Confidence 110000 00000110000000 0000000111100 0 0000 000001111111 0111111110000 00 Q ss_pred ------cceEEEEEeecCceEE------EeeeeeEeeeEeccceeEEEEeec-CceeeecccCCccccCCcccccccccc Q lcl|NC_015271. 278 ------ADQYYVRYDTTRKVWS------ETLGWNVNDQLLFETMPHALVRAA-DGNFELKRIEWSPKTCGDDDTNPWPSF 344 (795) Q Consensus 278 ------~~~yy~~~~~~~~~w~------E~~~~~~~~~~~~~t~p~~~v~~~-~~t~~~~~~~w~~~~~gd~~~np~psf 344 (795) ....|+ ..++..|+ +.++-+.+.-..+++.- .++... ..-|..+..++..+. ...+.+|+| T Consensus 68 ~~~~~~~g~lY~--V~G~~LY~v~~~vG~I~gsg~VsMa~n~~~~-aIv~~g~~~gy~y~~t~~~~~~---~~~~~~p~~ 141 (477) T protein:vir:35 68 VHFNTKNNALYR--VCGNTLYRNDKEVADIAGMSRVSMSHSSHSQ-AICFEGKVKLYRYDGTEKALSN---WPKDKYPQY 141 (477) T ss_pred eeEeecCCeEEE--EecCeeEeeeeeeeeecccccEEEeeCCcEE-EEEECCcceeEEEecccceeee---cCccccCCc Confidence 000111 11222222 22222222222222211 122111 112333222222211 112235666 Q ss_pred cCCCccEEEEEcceEEEecC--CeEEEEecCCccccccccccccCCCCcEE-EEEcCCCceeEEEEeecCCcEEEEecCc Q lcl|NC_015271. 345 MDSTINDVFFFRNRLGLLSG--ENIILSRTAKYFNFYPASIATLSDDDPID-VAVSTNRIAILKYAVPFSEELLIWSDEA 421 (795) Q Consensus 345 ~~~~~~~v~f~q~RL~f~~~--~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~-~~~~~~~~~~i~~~v~~~~~L~l~t~~~ 421 (795) ....+..|+|..+|++|.-+ +.++.|-.-|-.- -|+++ +..+..+++.|.-++...+.|++|.+.. T Consensus 142 ~l~~~~~v~f~dGyfV~~~~gt~~~~iS~L~d~s~-----------~d~~~~FasAE~~pD~Ivgi~~~~~~i~lfG~~T 210 (477) T protein:vir:35 142 DLGEVIDVCRNRGRYIWLQKGGERFGVTDLEDESK-----------PDRYQPFYRAESQPDGIVSVDAWRDLIVCFGSSS 210 (477) T ss_pred cccceeEEEeeCceEEEeecCCCeEEEeecCCccc-----------cccccccccccCCCCceEEEEeeccEEEEEeccc Confidence 66777789999999887643 4455564333222 34555 5567778888888999999999997766 Q ss_pred E--EEEeCCcccc-c-cceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCee-EEEEEEeeccccCceehhhHH-HHHHH Q lcl|NC_015271. 422 Q--FVLTASGTLT-S-RSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSFT-SIHRYYAVQDVSSVKNAEDIT-AHVQN 495 (795) Q Consensus 422 q--~~i~~~~~lT-P-~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~~-~v~~~~~~~~~~d~~~~~dls-~~~~h 495 (795) - |..+|+..++ | ...+-..+-..+|++.-.=..+|++++|+......- .+++ .++|+++=|| .-++. T Consensus 211 iEvw~ntG~a~f~~p~~r~~~~~mIq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~-------~~g~q~~rIST~aIE~ 283 (477) T protein:vir:35 211 IEYFTLTGSADTSQPLYIHQAAYMIQAGIAGRDCKCRYQDKYAILSHQSTGQPAVYL-------IGAGEKNKISTATIDK 283 (477) T ss_pred eEEEEecCCCCCCcceeecCCceeeeecccCchhhhhhCceEEEEecCCCcccEEEE-------ccCceeEEecCHHHHH Confidence 5 7788875554 2 111111112357877666689999999998864322 2333 3345565553 33444 Q ss_pred hcCC------CcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecC---CCeEEEEEE---------- Q lcl|NC_015271. 496 YIPN------GVFDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFG---SNVQVLACQ---------- 556 (795) Q Consensus 496 l~~g------~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~---g~~~~~~~~---------- 556 (795) .|+. ....+++++.+.+.+.....-+- -+| |.-..+ +..-.|+--... ......+++ T Consensus 284 ~i~ay~~~e~a~af~~t~~~eGH~fy~LtfP~~-Tw~--yD~at~-~w~e~W~~~~~g~~~~~~Ra~~~~~~~g~~~vGD 359 (477) T protein:vir:35 284 IIRYYSADELAASFMESIRFDNHELLLLHLPKH-TLC--FDGSAS-HQYSQWSLLKSGFYDEPYRAIDFMFFDNQITVGD 359 (477) T ss_pred HHHhcCCcchhceeEEEEEeCCeeEEEEEcCCc-eEE--Eecccc-cccceeeeeccCCccCceEEEEEEEeCCeEEEEE Confidence 4433 11234566677765544444443 223 322121 111124332222 222222322 Q ss_pred EeCCEEEEEE---EeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeeccccc-CCcccCceE Q lcl|NC_015271. 557 CINSDMYVIL---RNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLPTIY-GADFAKGKI 632 (795) Q Consensus 557 ~~~d~l~~~v---~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~-gl~~~~g~~ 632 (795) ..+..||.+- .+..+-.++++.. +.-...+..|++ |-.+.....-++. ..++-|.+-- |..+ |.. T Consensus 360 ~~ng~l~~ld~~~~~d~g~~i~~~~~---~p~~~~d~~Rv~-~~el~~~tGvgq~-----~d~v~L~~sddG~~~--~~~ 428 (477) T protein:vir:35 360 KKEGVLGHLIFNASNQYEQQTEHLLY---TPMIKADNARLF-DFELEASTGVAQI-----ADKLFLSVTTDGINY--SRE 428 (477) T ss_pred cCCCeEEEECCCCcccCCCccceEEe---cceeeCCCCeEE-EEEEEEecCcCcc-----CceEEEEEecccccc--ccc Confidence 2245566553 3333333444321 111222333433 4333322111110 0011111000 0000 000 Q ss_pred EEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEEEEEEee Q lcl|NC_015271. 633 TVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRRAWVNYE 712 (795) Q Consensus 633 v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~~~~~~~ 712 (795) ..+ +-|+.+++.+-..= -++- . + +=-||+. .++..+-|.. |+-+..+|. T Consensus 429 ~~~-~~g~~g~~~~r~~~------~RlG--~-~-r~~vgf~--~r~~~~~pv~------------------l~~~~~~~e 477 (477) T protein:vir:35 429 QLI-EQNSPFQYDKRILW------RRIG--R-V-RKNIGFK--IRIITKSPVT------------------LSDLSIRME 477 (477) T ss_pred eee-cCCCccccccceee------eeee--e-c-eeccceE--EEEEecCCce------------------eccceeEeC Confidence 000 11211111111100 0000 0 0 0023433 3333333321 222222222 No 35 >protein:vir:105428 Length: 472 # NCBI annotation: gene 8 protein # Family: family:all:1540 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958184;genbank:gi:41057286;genbank:GeneID:2716675 Probab=87.70 E-value=0.037 Score=28.43 Aligned_cols=423 Identities=11% Similarity=0.046 Sum_probs=153.0 Q ss_pred eEEEEecC--CCCcccccccchhHHhHhhhhhcccccCceeeeecCceEEEEecCCcceeeEEEecCcCcccceeEEEec Q lcl|NC_015271. 175 QATYQIPD--GSQPEHVNNTDAQWLAEELARQCRVSAPGWTFNVGQGYIHIIAPEGQQIDSLTTKDGYADQLINPVTHYA 252 (795) Q Consensus 175 ~a~~ttp~--~s~~~~~~~~~~~~i~~~l~~~~~s~~~g~t~~~~g~~~~i~~~~~~~~~~~~~~dg~~~t~~~~~~~~v 252 (795) -..-..|- +... ....+.++....++-.-.. .......++ +.+..|- . T Consensus 1 m~~~~~pl~~G~~~---~~~~~d~~~~~pVN~~a~~---~~~~~s~~~-------------l~~tPGl------~----- 50 (472) T protein:vir:10 1 MPIQQLPLMKGVGK---DFRNADYIDYLPVNMLATP---KEILNSSGY-------------LRSFPGI------A----- 50 (472) T ss_pred CCeeeeeeccCcee---eccccchhheeeeeeeeec---cCCCcccce-------------eecCCCc------e----- Confidence 11111111 1100 0011111100001000000 000000000 1110000 0 Q ss_pred cceeeccccc-------CCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeEEEEeecCce--eee Q lcl|NC_015271. 253 QSFSKLPTNA-------PEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALVRAADGN--FEL 323 (795) Q Consensus 253 ~~~~~l~~~~-------~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~~~v~~~~~t--~~~ 323 (795) ..+++++.+ ++|-.-.|.++ .-|.+.. .|-+.++-+.+.-.++++..- +...+.. |.. T Consensus 51 -~~a~v~G~~RG~~~~~~~g~lY~V~G~-----~LY~v~~-----~iGsiag~grVsMa~n~~~~a--v~~~g~~~~Y~y 117 (472) T protein:vir:10 51 -KRSDVNGVSRGVEYNMAQNAVYRVCGG-----KLYKGES-----EVGDVAGSGRVSMAHGRTSQA--VGVNGQLVEYRY 117 (472) T ss_pred -eeccCCccccceEEEeeCCeEEEEecc-----eEeeeec-----ceecccCcccEEEecCCcEEE--EEECCceeEEEe Confidence 011222222 12222223221 1222211 244444444443333333211 1111111 222 Q ss_pred cccCCccccCCcccccccccccCCCccEEEEEcceEEEecC--CeEEEEecCCccccccccccccCCCCcEEEEEcCCCc Q lcl|NC_015271. 324 KRIEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSG--ENIILSRTAKYFNFYPASIATLSDDDPIDVAVSTNRI 401 (795) Q Consensus 324 ~~~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~--~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~~~ 401 (795) ....-....-.++..+|..+ -.....|+|...|++|.-. +.++.|-..|-+.+ ++.-.++.+..++ T Consensus 118 d~~v~t~~~~~~d~~~p~~d--lg~~~dv~f~dGyfV~~~~Gt~~~~is~l~d~~~~----------~~y~~fa~AE~~p 185 (472) T protein:vir:10 118 DGTVKTVSNWPTDSGFTQYE--LGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHP----------DRYSAQYRAESQP 185 (472) T ss_pred eccchhhhcccccccccccc--ccceeeeeeecceEEEeccCcceEEEeccCCcccc----------ccccccccccCCC Confidence 11110000001122223222 2345689999999988753 33445655553321 2233345577778 Q ss_pred eeEEEEeecCCcEEEEecCcE--EEEeCCccccccceEEEEEE----eecCcCCCCcEEeCCeEEEEecCCCee-EEEEE Q lcl|NC_015271. 402 AILKYAVPFSEELLIWSDEAQ--FVLTASGTLTSRSIELNLTT----QFDVQDRARPFGIGRNVYFASPRSSFT-SIHRY 474 (795) Q Consensus 402 ~~i~~~v~~~~~L~l~t~~~q--~~i~~~~~lTP~~~~~~~~s----~~~~~~~~~Pv~vg~~v~fv~~~g~~~-~v~~~ 474 (795) +.|.-++...+.|++|.+..- |..+|+. +|+-+-..+++ ..+|++.-.=..+|++++|+......- .+++ T Consensus 186 D~Ivgi~~~~~~i~lfG~~TiEvw~ntG~a--~~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~- 262 (472) T protein:vir:10 186 DGIIGIGTWRDFIVCFGSSTIEYFSLTGAT--TAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISNPATGAPSVYI- 262 (472) T ss_pred CceEEEEeeccEEEEEeccceEEEEecCCC--CcccCceeecccceeeecccCcchhhecCceEEEEecCCccccEEEE- Confidence 888888999999999977665 7777753 22222233322 346777666689999999998863221 2333 Q ss_pred EeeccccCceehhhH-HHHHHHhcCCC----cE--EEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEe-eec Q lcl|NC_015271. 475 YAVQDVSSVKNAEDI-TAHVQNYIPNG----VF--DICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSH-WDF 546 (795) Q Consensus 475 ~~~~~~~d~~~~~dl-s~~~~hl~~g~----~~--~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~-w~~ 546 (795) .++|+++=+ |.-++..|+.. +. .+.+++.+.+.+.....-+ .-+||.-- . .-||. |-. T Consensus 263 ------~~g~~~~rIST~aIE~~i~~y~~~e~~dA~~~t~~~~GH~fy~LtfP~-~Tw~yD~~--t-----~~Wherw~~ 328 (472) T protein:vir:10 263 ------IGSGQVSPIASASIEKILRSYTADELADGVMESLRFDAHELLIIHLPR-HVLVYDAS--S-----SANGPQWCV 328 (472) T ss_pred ------ccCceeEEecCHHHHHHHHhcCCccccceeEEEEEeCCeEEEEEEcCC-ceeEeecc--c-----ccCceeeee Confidence 334566666 33444555443 22 2455666666554444444 33343311 1 13443 222 Q ss_pred C--C----CeEEEEE----------EEeCCEEEEEEEe---CCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcc Q lcl|NC_015271. 547 G--S----NVQVLAC----------QCINSDMYVILRN---EFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGT 607 (795) Q Consensus 547 ~--g----~~~~~~~----------~~~~d~l~~~v~R---~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~ 607 (795) . | .....++ ...+..||.+--. ..+-.++++.. +.-...+..|+ +|..+.....- T Consensus 329 ~~~g~~~~~~Ra~~~~~~~g~~~vGD~~ng~ly~l~~~~~td~G~~i~~~~~---~p~~~~d~~Rv-~d~~ve~~~G~-- 402 (472) T protein:vir:10 329 LKTGLYDDVYRAIDFIYEGNQITCGDKLESVTGKLQFDISSQYGLQQEHLLF---TPLFKADNARC-FDLEVESSTGV-- 402 (472) T ss_pred ecCCCccCceEEEEEEEeCCeEEEEEcCCCeEEEEcccCcCcCCCcceEEEe---ccceeCCCCeE-EEEEEEeecCC-- Confidence 1 1 1122222 2234556655433 22223333321 11222233443 34443322211 Q ss_pred cccccccceeecccccCCcccCc-eEEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEE Q lcl|NC_015271. 608 YNDDTFTTTLHLPTIYGADFAKG-KITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIK 686 (795) Q Consensus 608 ~~~~~~~t~~~~~~~~gl~~~~g-~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~ 686 (795) ++-+- ..+.-++||........ +....+ -.|..++....+-.- T Consensus 403 ------------------~~~adp~~~~~~sDg~~~g~~~~-----------~~~~~~-------g~~~~R~~~~RlG~~ 446 (472) T protein:vir:10 403 ------------------AQYADRLFLSATTDGINYGREQM-----------IEQNEP-------FVYDKRVLWKRVGRI 446 (472) T ss_pred ------------------CcccCceEEEeccCCcccchhhh-----------hhhccC-------cccccceeeeeeeec Confidence 11110 11122333211000000 000000 011111111111100 Q ss_pred ccC-CccceeccccccEEEEEEEEEee Q lcl|NC_015271. 687 QVD-NDGSTSTEDIGRLQLRRAWVNYE 712 (795) Q Consensus 687 ~~~-g~~~~~~~~~grl~l~~~~~~~~ 712 (795) ..+ |-.... ....++.|..+++++. T Consensus 447 r~~vgf~~r~-~~~~~v~l~ga~~~~e 472 (472) T protein:vir:10 447 RKNVGFKLRV-ITKSPVTLSGAQIRIE 472 (472) T ss_pred cccceEEEEE-EeccccceeeeeEEeC Confidence 000 000000 0012334556666665 No 36 >protein:vir:177 Length: 472 # NCBI annotation: DNA stabilization protein # Family: family:all:1540 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112082;genbank:gi:13559872;genbank:GeneID:920982 Probab=79.59 E-value=0.1 Score=26.00 Aligned_cols=427 Identities=11% Similarity=0.064 Sum_probs=156.5 Q ss_pred eEEEEecC--CCCcccccccchhHHhHhhhhhcccccCceeeeecCceEEEEecCCcceeeEEEecCcCcccceeEEEec Q lcl|NC_015271. 175 QATYQIPD--GSQPEHVNNTDAQWLAEELARQCRVSAPGWTFNVGQGYIHIIAPEGQQIDSLTTKDGYADQLINPVTHYA 252 (795) Q Consensus 175 ~a~~ttp~--~s~~~~~~~~~~~~i~~~l~~~~~s~~~g~t~~~~g~~~~i~~~~~~~~~~~~~~dg~~~t~~~~~~~~v 252 (795) -..-..|- +... ....+.++....++-.-.. .......++ +.+..|- . T Consensus 1 m~~~~~Pl~~G~~~---~~~~~d~~~~~pVN~~a~~---~~~~~s~~~-------------l~~tPGl------~----- 50 (472) T protein:vir:17 1 MPIQQLPLMKGVGK---DFRNADYIDYLPVNMLATP---KEILNSSGY-------------LRSFPGI------A----- 50 (472) T ss_pred CCeeeeeeccCcee---eccccchhheeeeeeeeec---cCCCcccce-------------eecCCCc------e----- Confidence 11111111 1100 0011111100001000000 000000000 1110000 0 Q ss_pred cceeeccccc-------CCCeEEEEEcCCCCCcceEEEEEeecCceEEEeeeeeEeeeEeccceeEEEEeecCceeeecc Q lcl|NC_015271. 253 QSFSKLPTNA-------PEGYVVKIVGDASKSADQYYVRYDTTRKVWSETLGWNVNDQLLFETMPHALVRAADGNFELKR 325 (795) Q Consensus 253 ~~~~~l~~~~-------~~G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~~~~~~~~~~~~t~p~~~v~~~~~t~~~~~ 325 (795) ..+++++.+ ++|-.-.|.++ .-|.+. ..|-+.++-+.+.-.++++..-.++.....-|.... T Consensus 51 -~~a~v~G~~RG~~~~~~~g~lY~V~G~-----~LY~v~-----~~iGsiag~grVsMa~n~~~~av~~~g~~~~Y~y~~ 119 (472) T protein:vir:17 51 -KRSDVNGVSRGVEYNMAQNAVYRVCGG-----KLYKGE-----SEVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDG 119 (472) T ss_pred -eeccCCccccceEEEeeCCeEEEEecc-----eEeeee-----cceecccCcccEEEecCCcEEEEEECCceeEEEeec Confidence 011222222 12222223221 122221 124444444444443444332112111111122211 Q ss_pred cCCccccCCcccccccccccCCCccEEEEEcceEEEecC--CeEEEEecCCccccccccccccCCCCcEEEEEcCCCcee Q lcl|NC_015271. 326 IEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLSG--ENIILSRTAKYFNFYPASIATLSDDDPIDVAVSTNRIAI 403 (795) Q Consensus 326 ~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~~--~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~ 403 (795) ..-....-.++..+|-.+ -.....|+|...|++|.-. +.++.|-..|-+.+ ++.-.++.+..+++. T Consensus 120 ~v~t~~~~~~d~~~~~~d--lg~~~dv~f~dGyfV~~~~Gt~~~~is~l~d~~~~----------~~y~~fa~AE~~pD~ 187 (472) T protein:vir:17 120 TVKTVSNWPTDSGFTQYE--LGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHP----------DRYSAQYRAESQPDG 187 (472) T ss_pred cchhhhcccccccccccc--ccceeeeeeecceEEEeccCcceEEEeccCCcccc----------ccccccccccCCCCc Confidence 110000001122223222 2345689999999988753 33445655553321 223334557777888 Q ss_pred EEEEeecCCcEEEEecCcE--EEEeCCccccccceEEEEEE----eecCcCCCCcEEeCCeEEEEecCCCee-EEEEEEe Q lcl|NC_015271. 404 LKYAVPFSEELLIWSDEAQ--FVLTASGTLTSRSIELNLTT----QFDVQDRARPFGIGRNVYFASPRSSFT-SIHRYYA 476 (795) Q Consensus 404 i~~~v~~~~~L~l~t~~~q--~~i~~~~~lTP~~~~~~~~s----~~~~~~~~~Pv~vg~~v~fv~~~g~~~-~v~~~~~ 476 (795) |.-++...+.|++|.+..- |..+|+...+- +-+.+++ ..+|++.-.=..+|++++|+......- .+++ T Consensus 188 Ivgi~~~~~~i~lfG~~TiEvw~ntG~a~~~~--fpy~r~~g~~iq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~--- 262 (472) T protein:vir:17 188 IIGIGTWRDFIVCFGSSTIEYFSLTGATTVGA--ALYVAQPSLMVQKGIAGTYCKTPFADSYAFISNPATGAPSVYI--- 262 (472) T ss_pred eEEEEeeccEEEEEeccceEEEEeeCCCCCCc--CceeecCcceeeecccCcchhhecCceEEEEecCCccccEEEE--- Confidence 8888999999999977665 77888643321 1122221 346777666689999999998863221 2333 Q ss_pred eccccCceehhhHH-HHHHHhcCCC----cE--EEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEe-eecC- Q lcl|NC_015271. 477 VQDVSSVKNAEDIT-AHVQNYIPNG----VF--DICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSH-WDFG- 547 (795) Q Consensus 477 ~~~~~d~~~~~dls-~~~~hl~~g~----~~--~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~-w~~~- 547 (795) .++|+++=|| .-++..|+.. +. .+.+++.+.+.+.....-+ .-+||.-- . .-||. |-.. T Consensus 263 ----~~g~~~~rIST~aIE~~i~~y~~~e~~dA~~~t~~~~GH~fy~LtfP~-~Tw~yD~~--t-----~~Wherw~~~~ 330 (472) T protein:vir:17 263 ----IGSGQVSPISSASIEKILRSYTADELADGVMESLRFDAHELLIIHLPR-HVLVYDAS--S-----SANGPQWCVLK 330 (472) T ss_pred ----ccCceeEEecCHHHHHHHHhcCCccccceeEEEEEeCCeEEEEEEcCC-ceeEeecc--c-----ccCceeeeeec Confidence 3445666663 3444555443 22 2455666666554444444 33343311 1 13443 2221 Q ss_pred -C----CeEEEE----------EEEeCCEEEEEEEe---CCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccc Q lcl|NC_015271. 548 -S----NVQVLA----------CQCINSDMYVILRN---EFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYN 609 (795) Q Consensus 548 -g----~~~~~~----------~~~~~d~l~~~v~R---~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~ 609 (795) | .....+ ....+..||.+--. ..+-.++++... .-...+..+++ |..+.... ++ T Consensus 331 ~g~~~~~~Ra~~~~~~~g~~~vGD~~ng~ly~ld~~~~td~g~pi~~~~~~---p~~~~~~~RV~-d~el~~~t--G~-- 402 (472) T protein:vir:17 331 TGLYDDVYRAIDFIYEGNQITCGDKLESVTGKLQFDISSQYDKQQEHLLFT---PLFKADNARVF-DLEVESST--GV-- 402 (472) T ss_pred CCCccCceEEEEEEEeCCeEEEEEcCCCeEEEEcccCcCCCCceeEEEEec---ceeeCCCceEE-EEEEeeeC--Cc-- Confidence 1 112222 22234556665432 222334443222 11222223443 43332211 11 Q ss_pred cccccceeecccccCCcccCceEEEEecCCcccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccC Q lcl|NC_015271. 610 DDTFTTTLHLPTIYGADFAKGKITVLEADGKITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVD 689 (795) Q Consensus 610 ~~~~~t~~~~~~~~gl~~~~g~~v~~~adg~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~ 689 (795) +....+ ....-++||....... -+....+ -.|..++....+-.-..+ T Consensus 403 -~~~adp--------------~~l~~~sDg~~~g~~~-----------~~~~~~~-------g~~~~R~~~~RlG~~r~~ 449 (472) T protein:vir:17 403 -AQYADR--------------LFLSATTDGINYGREQ-----------MIEQNEP-------FVYDKRVLWKRVGRIRKN 449 (472) T ss_pred -ccCCCc--------------eEEEcccCCcccchhh-----------hhhhccC-------cccccceeeeeeeecccc Confidence 000000 1111233321000000 0000000 012222222211100000 Q ss_pred CccceeccccccEEEEEEEEEee Q lcl|NC_015271. 690 NDGSTSTEDIGRLQLRRAWVNYE 712 (795) Q Consensus 690 g~~~~~~~~~grl~l~~~~~~~~ 712 (795) =.=...-....++.|..+++++. T Consensus 450 v~f~~~~~~~~~~~l~~a~~~~e 472 (472) T protein:vir:17 450 VGFKLRVITKSPVTLSGCQIRIE 472 (472) T ss_pred ceEEEEEeecccceeeeeEEEeC Confidence 00000001113455777777777 No 37 >protein:vir:2109 Length: 472 # NCBI annotation: head completion protein # Family: family:all:1540 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059633;genbank:gi:9635541;genbank:GeneID:1262840 Probab=55.57 E-value=0.48 Score=22.35 Aligned_cols=423 Identities=10% Similarity=0.007 Sum_probs=149.8 Q ss_pred cCceeeeecCceEEEEecCCcceeeEEEecCcCcccceeEEEec-cceeecccccCCCeEEEE------EcCC-CCCcce Q lcl|NC_015271. 209 APGWTFNVGQGYIHIIAPEGQQIDSLTTKDGYADQLINPVTHYA-QSFSKLPTNAPEGYVVKI------VGDA-SKSADQ 280 (795) Q Consensus 209 ~~g~t~~~~g~~~~i~~~~~~~~~~~~~~dg~~~t~~~~~~~~v-~~~~~l~~~~~~G~~v~v------~~~~-~~~~~~ 280 (795) .+-..+....+-..- ...+..+..+.+ | +.+....+ ....- -.+..|...+. ++-- ++-.+. T Consensus 1 m~~~q~Pl~~g~~~~-~~~~d~~~~~pV-N------~~a~~~~~~~s~~~--lr~tPG~~~~~~~~g~~RG~~~~t~~~~ 70 (472) T protein:vir:21 1 MPIQQLPMMKGMGKD-FKNADYIDYLPV-N------MLATPKEILNSSGY--LRSFPGITKRYDMNGVSRGVEYNTAQNA 70 (472) T ss_pred CceEEeecccccccc-ccccceeeeeee-e------eeeeccCCccccee--eeecCCcceeccCCCceeeeeecccCCe Confidence 222222222111000 000111111110 0 00000000 00000 00112211111 1100 010111 Q ss_pred EEEEEeecCceEEEeeeeeEee------eEeccceeEEEEeecCce--eeecccCCccccCCcccccccccccCCCccEE Q lcl|NC_015271. 281 YYVRYDTTRKVWSETLGWNVND------QLLFETMPHALVRAADGN--FELKRIEWSPKTCGDDDTNPWPSFMDSTINDV 352 (795) Q Consensus 281 yy~~~~~~~~~w~E~~~~~~~~------~~~~~t~p~~~v~~~~~t--~~~~~~~w~~~~~gd~~~np~psf~~~~~~~v 352 (795) .|+ ..+...|+-.+.-+++. -..+.+. ..+ ...+.. |......+....-.++..+|-. .-..+-.| T Consensus 71 ly~--V~G~~LY~v~~~~G~i~gsgrVsMa~n~~~-~~v-~~~~~~~~Y~~~~~~~t~~~~~~d~~f~~~--dl~~~~dv 144 (472) T protein:vir:21 71 VYR--VCGGKLYKGESEVGDVAGSGRVSMAHGRTS-QAV-GVNGQLVEYRYDGTVKTVSNWPADSGFTQY--ELGSVRDI 144 (472) T ss_pred EEE--EeCCceEEEeeeeeeecccccEEEeeCCeE-EEE-EECCceeEEEEecchhhhhcccCccccccc--cccceeEE Confidence 121 11333343222222221 1112211 111 111111 2222222211111111122211 11223479 Q ss_pred EEEcceEEEecC--CeEEEEecCCccccccccccccCCCCcEEEEEcCCCceeEEEEeecCCcEEEEecCcE--EEEeCC Q lcl|NC_015271. 353 FFFRNRLGLLSG--ENIILSRTAKYFNFYPASIATLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQ--FVLTAS 428 (795) Q Consensus 353 ~f~q~RL~f~~~--~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q--~~i~~~ 428 (795) +|...|++|.-. +..+-|-.-|-+..++- ++ ++-+..+++.|.-++...+.|++|.+..- |..+|+ T Consensus 145 ~f~dGyfV~~~~gt~~f~is~l~d~~~~~~y--------~~--FatAE~~pD~Iv~i~~~~~~l~lfG~~TiEvw~ntG~ 214 (472) T protein:vir:21 145 TRLRGRYAWSKDGTDSWFITDLEDESHPDRY--------SA--QYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGA 214 (472) T ss_pred EEecceEEEccCCcceeEEecCCCCccccCC--------cc--ceeeccCCCceEEEEeeccEEEEEeccceEEEEecCC Confidence 999999988753 33444544443221110 11 46677778888889999999999977665 777775 Q ss_pred ccccccceEEEEEE----eecCcCCCCcEEeCCeEEEEecCCCeeEEEEEEeeccccCceehhhHH-HHHHHhcCCCc-- Q lcl|NC_015271. 429 GTLTSRSIELNLTT----QFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQDVSSVKNAEDIT-AHVQNYIPNGV-- 501 (795) Q Consensus 429 ~~lTP~~~~~~~~s----~~~~~~~~~Pv~vg~~v~fv~~~g~~~~v~~~~~~~~~~d~~~~~dls-~~~~hl~~g~~-- 501 (795) . ++..+-+.+++ ..+|++.-.=..+|++++|+...+..--.+.. .++|+++=|| .-++..|+... T Consensus 215 a--d~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~s~~~l~~~~~g~~~V~~------~~g~qa~rIST~aIE~~i~~y~~~ 286 (472) T protein:vir:21 215 T--TAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYI------IGSGQASPIATASIEKIIRSYTAE 286 (472) T ss_pred C--CcCcCceEEcCcceeeecccCcchhhecCceEEEEecCCCcccEEEE------ccCceeEEecCHHHHHHHHhcCCc Confidence 3 23333344433 35787766668999999999988643222222 3455666663 33444444331 Q ss_pred ----EEEEEeCCCCeEEEEEEcCCCEEEEEEEeeCCCceeEEeeEeeecCCC---eEEEEEEE----------eCCEEEE Q lcl|NC_015271. 502 ----FDICGSSTENFCAVLSQGDQSKIFMYKFLYLNEELRQQSWSHWDFGSN---VQVLACQC----------INSDMYV 564 (795) Q Consensus 502 ----~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~eq~v~aW~~w~~~g~---~~~~~~~~----------~~d~l~~ 564 (795) ..+.+++.+.+.+.....-+. -+||.- ..+ +...-||.-+..+. ....++++ .+..||. T Consensus 287 e~~~A~~~t~~~eGH~fy~LtfP~~-Tw~yD~--at~-~~~e~W~~~~sg~~~~~~R~~~~~~~~g~~ivGD~~nG~ly~ 362 (472) T protein:vir:21 287 EMATGVMETLRFDSHELLIIHLPRH-VLVYDA--SSS-QNGPQWCVLKTGLYDDVYRGVDFMYEGNQITCGDKSEAVVGQ 362 (472) T ss_pred cccceEEEEEEeCCeEEEEEEcCCe-eEEEEc--ccC-ccCceeeeeccCCCcCceeEEEEEeeCCeEEEEEcCCCeEEE Confidence 235566677765554444453 334331 111 11112777766432 22223322 2345555 Q ss_pred EEEeC---CCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeecccccCCcccCceEEEE--ecCC Q lcl|NC_015271. 565 ILRNE---FNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLPTIYGADFAKGKITVL--EADG 639 (795) Q Consensus 565 ~v~R~---~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~~g~~v~~--~adg 639 (795) +--.. .+...+++.. ..-...+..++ +|..+... .|....+- .+.+ ..|| T Consensus 363 L~fd~~~~~d~~~~~~r~---~p~~~~dn~R~-fd~eve~~--------------------~Gv~q~~d-~v~L~wSddG 417 (472) T protein:vir:21 363 LQFDISSQYDKQQEHLLF---TPLFKADNARC-FDLEVESS--------------------TGVAQYAD-RLFLSATTDG 417 (472) T ss_pred EEecccccCCCcCcEEEE---ccceeCCCCEE-EEEeeecc--------------------CCCCCcCc-EEEEEeeccc Confidence 42111 1111112111 00011111222 23222211 11111110 1111 1122 Q ss_pred cccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEEEEEEee Q lcl|NC_015271. 640 KITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRRAWVNYE 712 (795) Q Consensus 640 ~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~~~~~~~ 712 (795) ...... ..+.++.++ .|..++....+-.-..+=.=...-....++.|+.+.+++. T Consensus 418 ~~~~~~-----------~~~~~g~~g-------~~~tr~~~~RlG~~r~~v~f~~r~~~~~~~~l~g~~~~~E 472 (472) T protein:vir:21 418 INYGRE-----------QMIEQNEPF-------VYDKRVLWKRVGRIRRLIGFKLRVITKSPVTLSGCQIRLE 472 (472) T ss_pred cccccc-----------eeeccCCcc-------chhcceeeeeeeecccceeEEEEEEecCcceeeeeEEeeC Confidence 211000 011111110 0111111111110000000000000113445667777766 No 38 >protein:vir:108312 Length: 458 # NCBI annotation: hypothetical protein # Family: family:all:1540 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552278;genbank:gi:160700603;genbank:GeneID:5758828 Probab=34.49 E-value=1.3 Score=19.98 Aligned_cols=437 Identities=13% Similarity=0.089 Sum_probs=173.2 Q ss_pred eEEEEecCCCCcccccccchhHHhHhhhhhcccccCceeeeecCceEEEEecCCcceeeEEEecCcCcccceeEEEeccc Q lcl|NC_015271. 175 QATYQIPDGSQPEHVNNTDAQWLAEELARQCRVSAPGWTFNVGQGYIHIIAPEGQQIDSLTTKDGYADQLINPVTHYAQS 254 (795) Q Consensus 175 ~a~~ttp~~s~~~~~~~~~~~~i~~~l~~~~~s~~~g~t~~~~g~~~~i~~~~~~~~~~~~~~dg~~~t~~~~~~~~v~~ 254 (795) -+.-..|.+++.......+. ..+++-+-.. ........ +++..-+... T Consensus 1 m~~~~ip~gsy~a~~~~~da----q~~VN~yp~~---~e~g~ss~--~l~~tPGl~~----------------------- 48 (458) T protein:vir:10 1 MVQRQIPLVATTAEGDVSGQ----EILVNVYPRK---SDGGKYPF--TLRHTPGLAF----------------------- 48 (458) T ss_pred Cceeeeceeeeecccccccc----eeeeeeeeec---cccccccc--ceEecCCcee----------------------- Confidence 22223333221111000000 0000000000 00000000 0111111000 Q ss_pred eeecccccCC------CeEEEEEcCCCCCcceEEEEEeecCceEEEee---eeeEeeeEeccceeEEEEeecCceeeecc Q lcl|NC_015271. 255 FSKLPTNAPE------GYVVKIVGDASKSADQYYVRYDTTRKVWSETL---GWNVNDQLLFETMPHALVRAADGNFELKR 325 (795) Q Consensus 255 ~~~l~~~~~~------G~~v~v~~~~~~~~~~yy~~~~~~~~~w~E~~---~~~~~~~~~~~t~p~~~v~~~~~t~~~~~ 325 (795) .+++++.... |....+++ .+=|- ..++++.+|.. +-+.+...++++ ..++.....-|...- T Consensus 49 f~~~~~~~~~g~~~~~g~ly~v~g-----~~LY~---V~~~~~~~~iG~i~gsg~VsMa~ng~--q~vi~~G~~gY~yd~ 118 (458) T protein:vir:10 49 FCELPTFPVMAMHQNGSRAFAVTP-----RDMYE---ISKDGTYKRLGSVDFKGRVVMEDNGK--QIVMVDGEKGYYYDS 118 (458) T ss_pred eecCCCCceeeEEecCCEEEEeeC-----ceEEE---EeCCceEEEEecccCceeEEEeeCCc--EEEEEECCeEEEEee Confidence 1222222221 22222221 11121 12222222222 223333333333 111112222222211 Q ss_pred cCCccccCCcccccccccccCCCccEEEEEcceEEEec--CCeEEEEecCCccccccccccccCCCCcEEEEEcCCCcee Q lcl|NC_015271. 326 IEWSPKTCGDDDTNPWPSFMDSTINDVFFFRNRLGLLS--GENIILSRTAKYFNFYPASIATLSDDDPIDVAVSTNRIAI 403 (795) Q Consensus 326 ~~w~~~~~gd~~~np~psf~~~~~~~v~f~q~RL~f~~--~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~ 403 (795) ..+.-- .+-.+.| ..+..|.|..+|++|.. ++.++.|-.+| . --||++++-+.++++. T Consensus 119 at~~~~------~i~d~~~--~~~~~v~~~dGy~V~~~~g~~~~~is~L~d------~------s~d~l~fa~Ae~~pD~ 178 (458) T protein:vir:10 119 ETEIVQ------EIKAEGF--YPASTVTYQDGYFIFDRKGTGQFFISELLD------V------AFDPLDFATAEGQPDP 178 (458) T ss_pred cccEEE------eccCccc--cCcceEEEeCcEEEEEeeCCCEEEEEecCc------c------eeCcceeeeecCCCCc Confidence 110000 0011122 22578999999999873 45677785444 1 1469999989999999 Q ss_pred EEEEeecCCcEEEEecCcE--EEEeCCccccccceEEEEEEeecCcCCCCcEEeCCeEEEEecCCCeeEEEEEEeecccc Q lcl|NC_015271. 404 LKYAVPFSEELLIWSDEAQ--FVLTASGTLTSRSIELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQDVS 481 (795) Q Consensus 404 i~~~v~~~~~L~l~t~~~q--~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~fv~~~g~~~~v~~~~~~~~~~ 481 (795) |.-++...+.|++|.+..- |..+|+..+.=.......+ .++|++.-.=..+|++++|+...+ .|++. T Consensus 179 iv~i~~~~~~i~~fG~~TiEvw~ntG~a~fpy~r~~ga~i-~~Gcaa~~sv~~~~~t~~~l~~d~---~Vy~l------- 247 (458) T protein:vir:10 179 LLAVLSDHREVFMFGQETIEVWYNSGAADFPFERNQGAFI-EKGIGAPYSVAKTNNTVYFIGSDL---MIYQI------- 247 (458) T ss_pred eEEEEeeccEEEEEeccceEEEEecCCCCcceeeccccee-eecccCcchhhhhCceEEEEcCCe---EEEEe------- Confidence 9999999999999977665 7778864322111111111 346776666688999999998865 34443 Q ss_pred CceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCC-CEEEEEEEeeCCCceeEEeeEeeecCCCeEEEEEEEeCC Q lcl|NC_015271. 482 SVKNAEDITAHVQNYIPNGVFDICGSSTENFCAVLSQGDQ-SKIFMYKFLYLNEELRQQSWSHWDFGSNVQVLACQCINS 560 (795) Q Consensus 482 d~~~~~dls~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~d-g~l~~~ty~~~~~eq~v~aW~~w~~~g~~~~~~~~~~~d 560 (795) ++|+++-+|-|+ |+.-+ .++ +-...+.++-..+ ..+|+++| ++.. | .|-+|..- T Consensus 248 ~g~~~~rIST~a---IE~~i---~sy-~~~da~a~t~~~eGH~fy~Ltf-P~a~------~-Tw~yD~~t---------- 302 (458) T protein:vir:10 248 TGYTPVRISTHA---VEQTL---KGV-NLSDAFAYTYQSEGHLFYVLTI-PGKN------L-TWCYDISS---------- 302 (458) T ss_pred cCceeEEeeCHH---HHHHH---hcC-ChhheEEEEEEecCeEEEEEEC-CCCC------c-eeEEeccc---------- Confidence 345555444332 33322 222 1222344444333 35666665 2111 1 12333211 Q ss_pred EEEEEEEeCCCEEEEEEEEeeccccCCCCcceeeeeeeeeEeecCcccccccccceeecccccCCcccCceEEEEecCCc Q lcl|NC_015271. 561 DMYVILRNEFNTFLTRVSFTKSTVDLQGEPYRAFMDMKIRYMIPNGTYNDDTFTTTLHLPTIYGADFAKGKITVLEADGK 640 (795) Q Consensus 561 ~l~~~v~R~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~~g~~v~~~adg~ 640 (795) .+|..-+ + +..++ + ...|...+ .+. .+-| ++++|..... .+.. T Consensus 303 ~~Wher~-S--g~~~~--------------~--Ra~~~v~~---~g~-------------~~vG-D~~ng~ly~l-d~~~ 345 (458) T protein:vir:10 303 GSWHVRQ-S--YQFDR--------------H--VSNNSIYF---DQK-------------TLVG-DFQNGRIYIM-ADNY 345 (458) T ss_pred ccceeec-c--CCCCc--------------e--EEEEEEEe---CCe-------------EEEE-EcCCCeEEEE-cccC Confidence 1232211 0 00111 1 11221111 000 0001 3334433222 1010 Q ss_pred ccccceeeeccCCCceEEEecCCCCcEEEEeEeeeEEEEecceeEEccCCccceeccccccEEEEEEEEEeeccceEEEE Q lcl|NC_015271. 641 ITEFEEPEVGWKNDPELRLNGNLEGSVVYVGFNIDFVYEFSKFRIKQVDNDGSTSTEDIGRLQLRRAWVNYEDSGTFDIY 720 (795) Q Consensus 641 ~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~grl~l~~~~~~~~~t~~~~v~ 720 (795) ... -|-+.+..+.++++ ++ .+.|++++++.|.+. ||--.+ T Consensus 346 ------~td--------------------~g~~i~~~~~~p~~-----~~-------~~~rl~~~~~el~~~-tGvg~~- 385 (458) T protein:vir:10 346 ------YTD--------------------DGDPVVREFILPVV-----NN-------GREFLTVDSLELDLS-SGVGLT- 385 (458) T ss_pred ------cCC--------------------CCceeeeeeeccce-----eC-------CCCeEEEEEEEEEEe-cceeee- Confidence 000 12223333334332 11 113566677666653 221111 Q ss_pred ecCCcc--ccccccc---ccccccccc--cccccccccceEEEEeeecccceEEEEEECCCCCEEEEEEEEEEE Q lcl|NC_015271. 721 VENQSS--NWKYTMA---GARLGAHVM--RTGKLNLGTGQYRFPVVGNAKFNTVFILSDATTPLNIIGCGWEGN 787 (795) Q Consensus 721 v~~~~~--~~~~~~~---~~~~~~~~~--~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlsi~~eg~ 787 (795) .++.. +-...++ |..+++... ..|++-......+..-.|..++--++|+-.+|.|.+|+++..+.+ T Consensus 386 -~~~~~~p~~~l~~S~d~g~~~s~~~~~~~lg~~gey~tr~~~~rlG~ar~rvf~v~~s~p~~~~l~ga~~~~r 458 (458) T protein:vir:10 386 -VGQGSDPELRVYFSKDNGNEYSQNFKVGKIGRKGEFLTRAKVNRFGCARQFTFKVEISDPIPVDIGGAWVEVR 458 (458) T ss_pred -eCCCCCceEEEEEeeCCCcccchhHHHhhcCCcchhhhhhhhhhhccCcceEEEEEEecchhhcceeeeEEeC Confidence 11110 0001111 111111100 112222222222222245666666999999999999999999988 Done!