Query lcl|NC_019510.1_cdsid_YP_007005461.1 [gene=F409_gp37] [protein=tail tubular protein B] [protein_id=YP_007005461.1] [location=22710..25109] Match_columns 799 No_of_seqs 150 out of 216 Neff 8.1 Searched_HMMs 1612 Date Thu Nov 7 17:00:28 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_37 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_37_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:2203 Length: 794 # 100.0 6E-248 4E-251 1376.0 90.7 790 1-799 1-794 (794) 2 protein:vir:10452 Length: 794 100.0 3E-247 2E-250 1372.5 88.7 790 1-799 1-794 (794) 3 protein:vir:1543 Length: 801 # 100.0 4E-246 3E-249 1365.8 88.9 793 1-799 1-801 (801) 4 protein:vir:94713 Length: 785 100.0 1E-245 8E-249 1363.3 87.7 785 1-799 1-785 (785) 5 protein:vir:99677 Length: 794 100.0 7E-245 4E-248 1359.2 88.7 788 1-799 1-794 (794) 6 protein:vir:3366 Length: 801 # 100.0 2E-243 1E-246 1351.2 88.3 793 1-799 1-801 (801) 7 protein:vir:94583 Length: 792 100.0 1E-242 8E-246 1346.6 87.5 789 1-799 1-792 (792) 8 protein:vir:8887 Length: 808 # 100.0 1E-238 6E-242 1325.4 87.7 794 1-799 1-808 (808) 9 protein:vir:7021 Length: 803 # 100.0 7E-221 4E-224 1227.5 85.0 777 2-799 1-803 (803) 10 protein:vir:97014 Length: 800 100.0 2E-220 1E-223 1224.8 84.3 768 2-799 1-800 (800) 11 protein:vir:105647 Length: 800 100.0 7E-220 5E-223 1222.0 82.6 780 2-799 1-800 (800) 12 protein:vir:103341 Length: 806 100.0 4E-217 2E-220 1207.0 86.2 773 2-799 1-806 (806) 13 protein:vir:80253 Length: 777 100.0 8E-217 5E-220 1205.2 84.7 766 1-799 1-777 (777) 14 protein:vir:6326 Length: 826 # 100.0 7E-214 5E-217 1189.0 85.0 770 1-799 1-826 (826) 15 protein:vir:78957 Length: 826 100.0 2E-210 1E-213 1170.0 85.6 775 1-799 1-826 (826) 16 protein:vir:100022 Length: 976 100.0 4E-209 2E-212 1163.4 80.4 778 1-799 1-976 (976) 17 protein:vir:78703 Length: 905 100.0 3E-206 2E-209 1147.5 81.4 772 1-799 1-905 (905) 18 protein:vir:103790 Length: 768 100.0 3E-179 2E-182 999.6 74.4 730 1-796 1-768 (768) 19 protein:vir:95324 Length: 823 100.0 4E-169 2E-172 944.0 70.5 721 1-795 1-823 (823) 20 protein:vir:7329 Length: 825 # 100.0 5E-167 3E-170 932.2 68.7 721 1-795 1-825 (825) 21 protein:vir:107423 Length: 681 100.0 1E-164 6E-168 919.6 70.1 663 1-794 1-681 (681) 22 protein:vir:98487 Length: 681 100.0 1E-164 6E-168 919.6 70.1 663 1-794 1-681 (681) 23 protein:vir:107802 Length: 681 100.0 1E-164 6E-168 919.6 70.1 663 1-794 1-681 (681) 24 protein:vir:1778 Length: 680 # 100.0 4E-160 2E-163 894.5 54.9 547 1-556 1-680 (680) 25 protein:vir:102644 Length: 594 100.0 6E-143 3E-146 800.5 59.0 563 1-795 1-594 (594) 26 protein:vir:94602 Length: 1012 99.3 1.1E-10 6.6E-14 75.3 38.5 765 1-798 1-1012(1012) 27 protein:vir:80177 Length: 1027 99.1 5.7E-10 3.5E-13 71.3 26.2 743 1-799 1-936 (1027) 28 protein:vir:2625 Length: 715 # 99.1 3.5E-09 2.2E-12 66.9 41.8 621 1-796 1-715 (715) 29 protein:vir:95475 Length: 771 98.2 2.7E-06 1.7E-09 51.1 40.7 654 1-796 1-771 (771) 30 protein:vir:8837 Length: 513 # 97.6 3.4E-05 2.1E-08 45.1 39.6 483 1-798 1-513 (513) 31 protein:vir:3133 Length: 911 # 97.2 0.00012 7.4E-08 42.1 36.0 655 1-799 1-839 (911) 32 protein:vir:105563 Length: 396 92.0 0.013 8.3E-06 30.9 18.9 372 1-537 1-396 (396) 33 protein:vir:108312 Length: 458 91.5 0.016 9.7E-06 30.5 33.2 439 149-791 1-458 (458) 34 protein:vir:105428 Length: 472 81.3 0.087 5.4E-05 26.4 34.4 426 180-754 1-472 (472) 35 protein:vir:177 Length: 472 # 79.0 0.11 6.8E-05 25.9 35.0 426 180-745 1-472 (472) 36 protein:vir:7329 Length: 825 # 61.5 0.35 0.00022 23.1 34.3 610 62-799 1-743 (825) 37 protein:vir:95324 Length: 823 51.8 0.57 0.00035 21.9 39.0 630 86-799 1-741 (823) 38 protein:vir:105525 Length: 472 44.0 0.82 0.00051 21.0 32.6 426 191-716 1-472 (472) 39 protein:vir:2109 Length: 472 # 41.2 0.94 0.00058 20.7 29.9 422 214-718 1-472 (472) 40 protein:vir:3529 Length: 477 # 32.0 1.5 0.0009 19.7 33.2 429 206-744 1-477 (477) No 1 >protein:vir:2203 Length: 794 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_042000;swissprot:sw:p03747;genbank:gi:9627472;uniprot:P03747;genbank:GeneID:1261024 Probab=100.00 E-value=5.8e-248 Score=1376.02 Aligned_cols=790 Identities=63% Similarity=1.070 Sum_probs=754.8 Q ss_pred CCceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCceEEEEE Q lcl|NC_019510. 1 MGLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYEQYYVV 80 (799) Q Consensus 1 M~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~~y~l~ 80 (799) ||+|+|+||||+||||||+|++||++||++|+||+|+|+||++||||++||++++++..+.+..++++|+|+++|+|+|+ T Consensus 1 M~~i~~s~~n~~~GvSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~~~~~~~~~y~l~ 80 (794) T protein:vir:22 1 MALISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTLGDNGALGQAPYIHLINRDEHEQYYAV 80 (794) T ss_pred CceeeeecchhhcccccCCchHHhhhHHhhhhcceeeccCCceeCCchHhhhhhcccCCCCCccEEEEEEeCCCcEEEEE Confidence 99999999999999999999999999999999999999999999999999999988777778889999999999999999 Q ss_pred EeCCeEEEEeCCCeEEEEE--ecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceEEEEecCC Q lcl|NC_019510. 81 FTGGDIKVFDLNGQEYAVR--GDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDALINVRGGQ 158 (799) Q Consensus 81 ~~~g~irv~~~~G~~~~v~--~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~~~v~~~~ 158 (799) |++++||||+++|.++.|. ...+|+.+++++.+|+|+|+||++||+|++++|+++.|.++..++++.++++++++.++ T Consensus 81 ~~~~~irv~~~~G~~~~v~~~~~~~y~~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~~~g~v~v~~g~ 160 (794) T protein:vir:22 81 FTGSGIRVFDLSGNEKQVRYPNGSNYIKTANPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVRGGQ 160 (794) T ss_pred EcCCeEEEEecCCcEEEeecCCCccceecCCCcccEEEEEEcCEEEEEcCCeeeeEeeccccCCCCCCCceEEEEccCCc Confidence 9999999999999888875 55678888899999999999999999999999999999999999999999999999999 Q ss_pred CCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEEEecCCCceeE Q lcl|NC_019510. 159 YGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAPDGDSIRG 238 (799) Q Consensus 159 y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~~~~~~~~ 238 (799) |+++|.+++++...+++++|+++.. .+....++++|+.+|..++. +...+|++.+++++++++++++..++. T Consensus 161 y~~ty~v~I~~~~~a~~~~p~gt~~-----~~~~~~~~~~ia~~L~~~l~---~~~~~~t~~~~~~~~~i~a~~~~~~~~ 232 (794) T protein:vir:22 161 YGRELIVHINGKDVAKYKIPDGSQP-----EHVNNTDAQWLAEELAKQMR---TNLSDWTVNVGQGFIHVTAPSGQQIDS 232 (794) T ss_pred cceeEEEEeccCcceEEEEcCCCcc-----ccceeechhhhhhhhhhhhe---eccccceEEeCCceEEEEEcCCceEEE Confidence 9999999999999999999988653 34456678889988887764 356789999999999999999999999 Q ss_pred EEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEeccccccceeeccceeeEE Q lcl|NC_019510. 239 LQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVGLNNGTMPWSL 318 (799) Q Consensus 239 ~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~~~t~p~~l 318 (799) +++.+|.+++.+.++.+.++++++||++|++|++++|.+++++..++||++|+..++.|+||+++++..+++.+|||+.+ T Consensus 233 ~t~~~g~~~t~~~~~~~~~~~~~~lp~~~~~G~~v~i~~~~~~~~~~Y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~l 312 (794) T protein:vir:22 233 FTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWTETLGWNTEDQVLWETMPHAL 312 (794) T ss_pred EeeecccCcceeEEEEeccccceeccccCCCCeEEEEEeCCCCCcceeEEEEeccceEEEEeeeccceeeecccceeeEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEE Q lcl|NC_019510. 319 IRAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLSGENIILSRTAKYFNMYPASVAVLSDDDPID 398 (799) Q Consensus 319 v~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~ 398 (799) +++++++|+++..+|++|.+||+++||.|||+|++|++|+||||||+|+++++|||||+||||||+++|++++.|||||+ T Consensus 313 v~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~ 392 (794) T protein:vir:22 313 VRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVFFFRNRLGFLSGENIILSRTAKYFNFYPASIANLSDDDPID 392 (794) T ss_pred eeccCCcEEEeeccccccccCccccCCcceecCCCcceEEEEcceEEEecCCeEEEEccCCccccccccCcCCCCCccEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEE Q lcl|NC_019510. 399 VAVSHNRVSILKYAVPFSEELLLWADEAQFVLNASGVLSAKSVELNLTTEFDVNDGARPYGIGRGVYFASPRATFTSINR 478 (799) Q Consensus 399 ~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r 478 (799) ++++++++|+|+|+++++++|+|||+++||+|+++++|||+|+++.++|+|+|+++|+|+.+|++++|+|++|+|++++| T Consensus 393 ~~~ss~~~~~i~~~v~~~~~L~i~t~~~e~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r 472 (794) T protein:vir:22 393 VAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFASPRSSFTSIHR 472 (794) T ss_pred EEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEEeeccCCCCceEeCCeEEEEecCCCeeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEeeEeeecCCCeEEE Q lcl|NC_019510. 479 YYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQSWSHWDFGDNVTVL 558 (799) Q Consensus 479 ~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~w~~~g~~~~~ 558 (799) +++|++++|+|+++|||+|++|||+|+++++++++++|.+++|+++++|+|++|+|||+++||+|+|||||+|+|+++++ T Consensus 473 ~~~~~~~~d~y~~~Dlt~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~~~ 552 (794) T protein:vir:22 473 YYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFCSVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVL 552 (794) T ss_pred eEeeecccCceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceeEEeeEEEEcCCCEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeCCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEccccccccccccccccccccCccccccceE Q lcl|NC_019510. 559 AANSIGSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTVDLNAVFGGMRWQVGKI 638 (799) Q Consensus 559 ~~~~~~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~ 638 (799) |+.+.+|+||++|+|+++++++||.+.++..+...+++++||||+.+++++++.++++.+.+.......+ ++.|++|++ T Consensus 553 ~~~~~~d~l~~iv~r~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~g~~~~~~~~t~~~~~~~~-g~~~~~g~~ 631 (794) T protein:vir:22 553 ACQSISSDMYVILRNEFNTFLARISFTKNAIDLQGEPYRAFMDMKIRYTIPSGTYNDDTFTTSIHIPTIY-GANFGRGKI 631 (794) T ss_pred EEEecCCEEEEEEEeCCCEEEEEEEEeeccccCCCccceeeeeeeEEEeeccceeecCCcceEEEccccc-CcccccceE Confidence 9998899999999999999999999999999999999999999999999999999999888877666554 778999999 Q ss_pred EEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCccceeeeecceEEEEEEEEEee Q lcl|NC_019510. 639 LVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTEDVGRLQHRRAWLNYE 718 (799) Q Consensus 639 v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~~rl~l~~~~~~~~ 718 (799) +.+.+||.++...++.+|+...+.+++++++++++|+|||+|+++++|+||++++++|.+++++...|||||||++++|. T Consensus 632 v~~~~dg~~~~~~~~~~~~~~~~~~~v~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~~~~~~~~~~~grl~l~r~~~~~~ 711 (794) T protein:vir:22 632 TVLEPDGKITVFEQPTAGWNSDPWLRLSGNLEGRMVYIGFNINFVYEFSKFLIKQTADDGSTSTEDIGRLQLRRAWVNYE 711 (794) T ss_pred EEEEcCCceeeceeeeeeeeccceEEeCCCCCCcEEEEeeeeeEEEEecceEEEecCCCccceeeecceEEEEEEEEEec Confidence 99999999999999999998989999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceEEEEecCCCcccceeccccccCcc--cccccccccceEEEEEeecccceEEEEEECCCCcEEEEEEEEEEEEeccc Q lcl|NC_019510. 719 QSGAFYVDVTNLGRSYRYTMSGKPLGDT--TLGQANLESGQFRFPLAGNAQYNRVVLTSDYTTPLSIIGCGWEGNYIRRS 796 (799) Q Consensus 719 ~t~~~~~~v~~~~~~~~~~~~~~~~~~~--~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~~eg~y~~r~ 796 (799) +||+|.+.|++.++++.+.+.+.+++.+ ..+.+++.++.++||+++|+++.+|+|+|++|+||+|++|+|||+||+|+ T Consensus 712 ~tg~~~v~v~~~~~~~~~~~~~~~~g~~~~~~g~~~~~tg~~~vp~~~~~~~~~v~i~~d~p~P~tvlsi~~eg~y~~r~ 791 (794) T protein:vir:22 712 NSGTFDIYVENQSSNWKYTMAGARLGSNTLRAGRLNLGTGQYRFPVVGNAKFNTVYILSDETTPLNIIGCGWEGNYLRRS 791 (794) T ss_pred cccceEEEEcCCCcccceeecCceecccccccCcccccCceEEEEecccCceEEEEEEECCCCCEEEEEEeEEEEEeccc Confidence 9999999999988888888999999864 45788999999999999999999999999999999999999999999999 Q ss_pred cCC Q lcl|NC_019510. 797 TGI 799 (799) Q Consensus 797 rrv 799 (799) ||| T Consensus 792 ~~v 794 (794) T protein:vir:22 792 SGI 794 (794) T ss_pred cCC Confidence 999 No 2 >protein:vir:10452 Length: 794 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848299;genbank:gi:30387490;genbank:GeneID:1733952 Probab=100.00 E-value=2.5e-247 Score=1372.52 Aligned_cols=790 Identities=63% Similarity=1.069 Sum_probs=749.9 Q ss_pred CCceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCceEEEEE Q lcl|NC_019510. 1 MGLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYEQYYVV 80 (799) Q Consensus 1 M~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~~y~l~ 80 (799) ||+|+|+||||++|||||+|++||++||++|+||+|+|+||++||||++||++++..+......+++++++++.|+|+|+ T Consensus 1 M~~i~~s~~n~~~GvSqq~D~~Ry~~q~~~~~N~~~~~~gG~~rRpGt~fv~~l~~~~~~~~~~~~~~~~rd~~e~~~v~ 80 (794) T protein:vir:10 1 MALISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLKTLGDNGALGQAPYIHLINRDENEQYYAV 80 (794) T ss_pred CcceeeecchhhcccccCCchHHhhhhHhhhhcceeeeccCcccCcchhhheeccCCCccccceeeeEEecCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999988777777788888999999999999 Q ss_pred EeCCeEEEEeCCCeEEEE--EecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceEEEEecCC Q lcl|NC_019510. 81 FTGGDIKVFDLNGQEYAV--RGDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDALINVRGGQ 158 (799) Q Consensus 81 ~~~g~irv~~~~G~~~~v--~~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~~~v~~~~ 158 (799) |++++||||+.+|.++.| ++..+|+.+++++++|+|+|+||+|||+|++++|++..+......+++..++++.+++++ T Consensus 81 ~~~~~irv~~~~G~~~~v~~~~~~~Y~~aa~~~~~l~~~q~aD~~fivn~~~~~~~~~~~~~~~~~~~~~~~~~~v~~g~ 160 (794) T protein:vir:10 81 FTGTGIRVFDLAGNEKQVRYPNGSNYIKTANPRSDLRMVTVADYTFIVNRNVVVQKDPNSVNLANYNPKQDGLINIRGGQ 160 (794) T ss_pred EeCCeEEEEEcCCcEEEEEcCCCCcceecCCCcceEEEEEEcCEEEEEcCCeeeeeeccccccCCCCCCccEEEEecccc Confidence 999999999999987766 466789888999999999999999999999999999999887777888899999999999 Q ss_pred CCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEEEecCCCceeE Q lcl|NC_019510. 159 YGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAPDGDSIRG 238 (799) Q Consensus 159 y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~~~~~~~~ 238 (799) |+++|++++++...+++++|+++.. .+....+.++++.+|..++.. ..++|++.+++++++|+++++..+++ T Consensus 161 y~r~y~i~i~~~~~at~~tpdgt~~-----~~~~~~s~~~ia~~L~~~l~a---~~~g~t~~~~g~~i~i~a~s~~~~~t 232 (794) T protein:vir:10 161 YGRELIVHINGKDVATYKIPDGSKP-----EHVNNTDAQWLAERLAKQMRI---NLSGWTVNVGQGFIHVTAPSGQQIDS 232 (794) T ss_pred cceEEEeccCCcceeEEEecCCCCc-----ccceecchhhhhhhhhhhhhc---ccCCceEEeCCeEEEEEeccCceecc Confidence 9999999999999999999988653 344556778899888877654 45689999999999999999988899 Q ss_pred EEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEeccccccceeeccceeeEE Q lcl|NC_019510. 239 LQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVGLNNGTMPWSL 318 (799) Q Consensus 239 ~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~~~t~p~~l 318 (799) +++.++..++.+.++.+.++++++||++|++|++++|.++++++.++||++|+...+.|+||++++...+++.++||+.| T Consensus 233 ~s~~~~~~~~~~~~v~~~~~~~~~lp~~~~~G~~v~i~~~~~~~~~~yyv~~~~~~~~w~E~~~~g~~~~~~~~tmP~~l 312 (794) T protein:vir:10 233 FTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWTETLGWNTENQVLLETMPHAL 312 (794) T ss_pred ccccCCcCcceeEEEEeccCcceecccCCCCCcEEEEEeCCCCCcceeEEEEEcCCcEEEEecccceeEEEecccceeEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEE Q lcl|NC_019510. 319 IRAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLSGENIILSRTAKYFNMYPASVAVLSDDDPID 398 (799) Q Consensus 319 v~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~ 398 (799) +++++++|+++..+|++|.+||+++||.|||+|++|++|+||||||+|+++++|||||+||||||+++|++++.|||||+ T Consensus 313 ~r~~~~t~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~ 392 (794) T protein:vir:10 313 VRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVFFFRNRLGFLSGENIILSRTAKYFNFYPASIANLSNDDPID 392 (794) T ss_pred EEeccceEEeeecccccccccccccCccCcccCCCccEEEEEcceEEEeeCCeEEEEecCCcccccccccccCCCCccEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEE Q lcl|NC_019510. 399 VAVSHNRVSILKYAVPFSEELLLWADEAQFVLNASGVLSAKSVELNLTTEFDVNDGARPYGIGRGVYFASPRATFTSINR 478 (799) Q Consensus 399 ~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r 478 (799) ++++++++|+|+|+++++++|+|||+++||+|+++++|||+|++++++|+|+|++.|+|+.+|++++|++++|+|++++| T Consensus 393 ~~~ss~~~~~i~~~v~~~~~L~i~T~~~q~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r 472 (794) T protein:vir:10 393 VAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTASGTLTSRSVELNLTTQFDVQDRARPYGIGRNVYFASPRSSYTSIHR 472 (794) T ss_pred EEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEeecccCCCCceEeCCeEEEEecCCCeeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEeeEeeecCCCeEEE Q lcl|NC_019510. 479 YYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQSWSHWDFGDNVTVL 558 (799) Q Consensus 479 ~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~w~~~g~~~~~ 558 (799) +++|++++|+|+++|||+|++|||+++++.+++++++|.+++|+++++|+|++|+|||+++||+|+|||||+|+|.++++ T Consensus 473 ~~~~~~~~d~y~a~Dlt~~~~hl~~~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~~~ 552 (794) T protein:vir:10 473 YYAVQDVSSVKNSEDITSHVPNYIPNGVFSICGSGTENFCSVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQVL 552 (794) T ss_pred EeeeccccCceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceEEEeEEEEEcCCcEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeCCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEccccccccccccccccccccCccccccceE Q lcl|NC_019510. 559 AANSIGSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTVDLNAVFGGMRWQVGKI 638 (799) Q Consensus 559 ~~~~~~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~ 638 (799) |+.+.+|+||++|+|+++++++|+.+.++..+...+++++||||+.++.++++.++...+.+....... .|+.|++|++ T Consensus 553 ~~~~~~d~l~~iv~r~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~-~g~~~~eg~~ 631 (794) T protein:vir:10 553 ACQSISSDMYVILRNEFNTFLARISFTKNAIDLQGEPYRAFMDMKIRYTIPSGTYNDDTFTTSIHIPTI-YGANFGRGKI 631 (794) T ss_pred EEEecCCeEEEEEEeCCCEEEEEEEEeecCCCCCCccceeeeecceEEEecCcccccccccceEEcccc-cCcccccccE Confidence 999889999999999999999999999999999999999999999999999999888888777755544 5899999999 Q ss_pred EEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCccceeeeecceEEEEEEEEEee Q lcl|NC_019510. 639 LVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTEDVGRLQHRRAWLNYE 718 (799) Q Consensus 639 v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~~rl~l~~~~~~~~ 718 (799) +.+.+||......++..++...+++++++++++++|+|||+|+++++|+||++++++|+++++....|||||||+++++. T Consensus 632 v~~~adg~~~~~~~~~~~~~g~~~l~i~~~~~a~~v~vGl~y~s~~~~~~~~i~~~~~~~~~~~~~~gr~~l~r~~~~~~ 711 (794) T protein:vir:10 632 TVLEPDGKITVFEQPTSGWQSDPWLRLSGNLEGREVFIGFNINFVYEFSKFLIKQTTDDGSTSTEDIGRLQLRRAWVNYE 711 (794) T ss_pred EEEecCCceeeeeeeeeeeecceEEEecCCCCCceEEEeeeeeEEEEecceEEEccCCCcceeeeccccEEEEEEEEEee Confidence 99999999999888888888888999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceEEEEecCCCcccceeccccccCcc--cccccccccceEEEEEeecccceEEEEEECCCCcEEEEEEEEEEEEeccc Q lcl|NC_019510. 719 QSGAFYVDVTNLGRSYRYTMSGKPLGDT--TLGQANLESGQFRFPLAGNAQYNRVVLTSDYTTPLSIIGCGWEGNYIRRS 796 (799) Q Consensus 719 ~t~~~~~~v~~~~~~~~~~~~~~~~~~~--~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~~eg~y~~r~ 796 (799) +||+|.+.|++.++++.+.+.+.+++.+ ..+++|+.+|+++||+.+|+++.+|+|+|++|+||+|+||+|||+||+|+ T Consensus 712 ~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlsi~~eg~y~~r~ 791 (794) T protein:vir:10 712 DSGTFDIYVENQSSNWKYTMAGARLGSNTLRAGRLNLGTGQYRFPVVGNAKFNTVYILSDETTPLNIIGCGWEGNYLRRS 791 (794) T ss_pred ccccEEEEEcCCccccceeeccceeccccccccccccccceEEEEecccCceEEEEEEECCCCceEEEEEEEEEEEeccc Confidence 9999999999988888788889888854 45788999999999999999999999999999999999999999999999 Q ss_pred cCC Q lcl|NC_019510. 797 TGI 799 (799) Q Consensus 797 rrv 799 (799) ||| T Consensus 792 ~~v 794 (794) T protein:vir:10 792 SGI 794 (794) T ss_pred cCC Confidence 999 No 3 >protein:vir:1543 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052111;swissprot:trembl:q9t105;genbank:gi:9634037;uniprot:Q9T105;genbank:GeneID:1262408 Probab=100.00 E-value=4.2e-246 Score=1365.84 Aligned_cols=793 Identities=65% Similarity=1.102 Sum_probs=751.4 Q ss_pred CCceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCceEEEEE Q lcl|NC_019510. 1 MGLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYEQYYVV 80 (799) Q Consensus 1 M~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~~y~l~ 80 (799) ||+|+|+||||++|||||+|++||++||++|+||+|+|+|||+||||++||++++.++.....+++|+|+|+++|+|+|| T Consensus 1 M~~i~~s~~n~~~GvSqq~d~~r~~~q~~~~~N~~~~~~gGl~rRpGt~~va~~~~~~~~~~~~~~~~~~~~~~e~y~l~ 80 (801) T protein:vir:15 1 MALISQSIKNLKGGISQQPDILRFAEQGSVQINGWSSESEGLQKRPPMIHLKTLGPAGYVGAQPYVHLINRDEFEQYFVV 80 (801) T ss_pred CceeeeecchhhcceecCcchHhhhhhHhhhhcceeccccCcccCCchheeeeecCCCCcccceeEEEEEeCCceEEEEE Confidence 99999999999999999999999999999999999999999999999999999998888888899999999999999999 Q ss_pred EeCCeEEEEeCCCeEEEEEecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceEEEEecCCCC Q lcl|NC_019510. 81 FTGGDIKVFDLNGQEYAVRGDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDALINVRGGQYG 160 (799) Q Consensus 81 ~~~g~irv~~~~G~~~~v~~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~~~v~~~~y~ 160 (799) |++++||||+++|.+++++++.+|+.+++++++|+++|+||++||+|++++|+++.+..+...+++..+++++++.++|+ T Consensus 81 ~~~~~irv~~~~G~~~~v~~~~~y~~~~~~~~~l~~~~~aD~~fi~nr~~~~~~~~~~~~~~~~~~~~~alv~v~~~~yg 160 (801) T protein:vir:15 81 FTGEDIKVFDLDGKEYQVRGDRSYVRTANPREDLRMITVADYTFVTNRKVVVQSNDQSVNLPGFKDQGDALINVRGGQYG 160 (801) T ss_pred EcCCeEEEEccCCcEEEEecCCccccccCchhheeEEEEcCEEEEeeCCeeeecccCccccCccCCCCceEEEeeeccCc Confidence 99999999999999999999999999999999999999999999999999999999988777788888999999999999 Q ss_pred ceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccc------cccccceEEEECCcEEEEEecCCC Q lcl|NC_019510. 161 RTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRES------LAGNPGWTINVGTGFVNIIAPDGD 234 (799) Q Consensus 161 ~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~------~a~~~~~t~~~~g~~i~i~a~~~~ 234 (799) ++|++++++...+.+++|+++.. ....+.+++++++.|...+... ..+..+|++...++.+++.++++. T Consensus 161 ~t~~I~i~gs~~~~~t~~~gs~~-----~~~~~~s~~~ia~~l~~~~~~~~p~~~~~~~~~~w~~~~~~g~~~i~a~~~~ 235 (801) T protein:vir:15 161 RRLSIEFNGAERAAVQLPDGSQP-----AHVNEVDGQAIAEKLAAQLRNNLGNPNNDQDPNKWRFNVGPGFIHILAPNND 235 (801) T ss_pred eeEEEEeCCcceEEEEeccCccc-----chhhhcceeechHHHhhhhhhccCccceeccCccEEEEecCcEEEEeCCCCc Confidence 99999999988888899887643 3345566777777776655432 134568999999999999999998 Q ss_pred ceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEeccccccceeeccce Q lcl|NC_019510. 235 SIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVGLNNGTM 314 (799) Q Consensus 235 ~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~~~t~ 314 (799) ....+++++|.+++.+.++.+.++++++||.++++|++|+|.+++++++++||++|+...++|+||++++...+++.+|| T Consensus 236 ~~~~~~t~dg~~~~~~~~~~~~v~~~~~lp~~~~~G~~v~v~~~~~~~~~~y~v~~~~~~~~w~E~a~~g~~~~~~~~tm 315 (801) T protein:vir:15 236 NVWGLQTKDGYADQLINPVTHYTQSFQKLPINAPDGYIVKIVGDTSKTADQYYVRFDLNRKVWVETIGWNTRTHLYYHTM 315 (801) T ss_pred ccceeeeccccCceeeeEEeecccceeeeeeecCCCcEEEEEecCCCccceEEEEEEcCCeeEEeecccccceeeecccc Confidence 88889999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEEeccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEecCCeEEEEccCCccccccccccCCCCC Q lcl|NC_019510. 315 PWSLIRAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLSGENIILSRTAKYFNMYPASVAVLSDD 394 (799) Q Consensus 315 p~~lv~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~Dd 394 (799) ||.|++.++++|+++.++|++|.+||+++||.|+|+|++|++|+||||||+|+++++|||||+||||||+++|++++.|| T Consensus 316 p~~lv~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~Dd 395 (801) T protein:vir:15 316 PWALVRASDGNFDFKVLEWGARTVGDDTTNPYPSFTGQTINDIFFFRNRLGFLSGENIILSRTSKYFNFFPASVSNYSDD 395 (801) T ss_pred ceEEEeeccceEEEeccccccccCCccccCCcccccCCCceEEEEEcceEEEeeCCeEEEEecCCccccccccccCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCce Q lcl|NC_019510. 395 DPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNASGVLSAKSVELNLTTEFDVNDGARPYGIGRGVYFASPRATFT 474 (799) Q Consensus 395 D~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~ 474 (799) |||+++++++++|+|+|+++++++|+|||+++||+|+++++|||+|+++.++|+|+|+++|+|+.+|++++|+|++|+|+ T Consensus 396 D~i~~~~~~~~~~~i~~~v~~~~~L~i~t~~~q~~ls~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~ 475 (801) T protein:vir:15 396 DPIDVAVSHNRVSTLKYAVPFSEELLLWSDQAQFVLTASGILSSRSVELNLTTQFDVQDRARPHGVGRNVYFASPRASFT 475 (801) T ss_pred ccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEcCCCcccceeEEEEEEEeeeccCCCCceEeCCeEEEEecCCCee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEeeEeeecCCC Q lcl|NC_019510. 475 SINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQSWSHWDFGDN 554 (799) Q Consensus 475 ~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~w~~~g~ 554 (799) +++|+|+|++++|+|+++|||+|++|||+++++++++++++|.+++|+++++|+|++|||||+++||+|+|||||+|+|+ T Consensus 476 ~~~r~~~~~~~~d~y~a~Dlt~~~~hl~~~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~ 555 (801) T protein:vir:15 476 SINRYYAVQDVSSVKNAEDMTAHVPNYIPNGVFSISGTTAENFAAILTSGAPNRVYIYKFLYIDEEIRQQSWSHWDFGDN 555 (801) T ss_pred EEEEEEeecccccceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCEEEEEEEecCCCceEEEeeEEEEcCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEEEeCCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEccccccccccccccccccccCccccc Q lcl|NC_019510. 555 VTVLAANSIGSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTVDLNAVFGGMRWQ 634 (799) Q Consensus 555 ~~~~~~~~~~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 634 (799) ++++|+...+|+||++|+|+++.+++++.+..+..+...+++++||||+.++.+.++.+......++.. .....|+.|+ T Consensus 556 ~~~~~~~~~~d~l~~~v~r~~~~~~~r~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~t~~~~~~~~~~~-~~~~~gl~~l 634 (801) T protein:vir:15 556 VTVFAAQVINSTMTVLMGNEHAVWMGRLHFTKNSIDIPGEPYRLYIDAKRKYTIPAGTYNDDTYQTSIS-LATIYGMNFT 634 (801) T ss_pred EEEEEEEecCCEEEEEEEecCcEEEEEEEEccccccCCCcceeeeeeeeeeEeeccceeccCceecccc-cccccccccc Confidence 999999888999999999999999999999999999999999999999999999888887776665543 4556799999 Q ss_pred cceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCccceeeeecceEEEEEEE Q lcl|NC_019510. 635 VGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTEDVGRLQHRRAW 714 (799) Q Consensus 635 ~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~~rl~l~~~~ 714 (799) +|+++.+.+||.+++..++.+|+...+++++++++++++|+|||+|+++++|+||+++.++|.++.+....+||||||++ T Consensus 635 ~g~~v~v~~dG~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~~~~~~~~~~~~rl~l~r~~ 714 (801) T protein:vir:15 635 KGRVSVVFPDGKIIEVDQPINGWSSDPVLRLDGNQEGQVVYIGFNIPFTYTFSKFLIKKTAEDGSTATEDIGRLQLRRAW 714 (801) T ss_pred cceEEEEEeCCceeeeeeecCcccCcceEEEcCCCCCcEEEEeeeeeEEEEecceEEeccCCCCCceeeeeccEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999888888889999999999 Q ss_pred EEeeccceEEEEecCCCcccceeccccccCcc--cccccccccceEEEEEeecccceEEEEEECCCCcEEEEEEEEEEEE Q lcl|NC_019510. 715 LNYEQSGAFYVDVTNLGRSYRYTMSGKPLGDT--TLGQANLESGQFRFPLAGNAQYNRVVLTSDYTTPLSIIGCGWEGNY 792 (799) Q Consensus 715 ~~~~~t~~~~~~v~~~~~~~~~~~~~~~~~~~--~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~~eg~y 792 (799) |++.+||.|.+.|++.++++.+.+.+.+++.+ .++.|++.+|+++||+.+|+++.+|+|+|++|+||+|+||+|||+| T Consensus 715 ~~~~~tg~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlsi~~e~~y 794 (801) T protein:vir:15 715 VNYEDSGAFTIRVNNLSREFIYTMAGARLGSDNLRVGRSNIGTGQYRFPVVGNAQTNLVTIESDASTPLNIIGCGWEGNY 794 (801) T ss_pred EEeccCcceEEEECCcccccceeecCcccccccccccccccccceEEEEEeecCceEEEEEEECCCCcEEEEEEEEEEEE Confidence 99999999999999999998888899999853 5678899999999999999999999999999999999999999999 Q ss_pred eccccCC Q lcl|NC_019510. 793 IRRSTGI 799 (799) Q Consensus 793 ~~r~rrv 799 (799) |+|+||| T Consensus 795 ~~r~~~~ 801 (801) T protein:vir:15 795 LRRSSGI 801 (801) T ss_pred eccccCC Confidence 9999999 No 4 >protein:vir:94713 Length: 785 # NCBI annotation: tail tube # Family: family:all:825 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338122;genbank:gi:77118200;genbank:GeneID:3707736 Probab=100.00 E-value=1.2e-245 Score=1363.26 Aligned_cols=785 Identities=52% Similarity=0.902 Sum_probs=741.7 Q ss_pred CCceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCceEEEEE Q lcl|NC_019510. 1 MGLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYEQYYVV 80 (799) Q Consensus 1 M~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~~y~l~ 80 (799) ||+|+|+||||++|||||+|++||++||++|+||+|+|+||++||||++||++++.. .....++|+|+|+++|+|+|| T Consensus 1 M~~~~~s~~n~~~GvSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~~v~~l~~~--~~~~~~~~~f~~~~~~~y~l~ 78 (785) T protein:vir:94 1 MPLITQSIKNLKGGISQQPDILRFSDQGEAQVNCWSSESDGLQKRPPTVFKRRLNID--VGSNPKFHLINRDEQEQYYIV 78 (785) T ss_pred CcceeeecchhhcceecCCchHHhhhHHhhhhcceeeeccCcccCChhHhhhcccCC--CCcCcEEEEEEeCCCceEEEE Confidence 999999999999999999999999999999999999999999999999999988644 356778999999999999999 Q ss_pred EeCCeEEEEeCCCeEEEEEecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceEEEEecCCCC Q lcl|NC_019510. 81 FTGGDIKVFDLNGQEYAVRGDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDALINVRGGQYG 160 (799) Q Consensus 81 ~~~g~irv~~~~G~~~~v~~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~~~v~~~~y~ 160 (799) |++++||||+++|.++.+++..+|+.+.+++++|+|+|+||+|||+|++++|+++.+.++. .+++.+++++.++.++|+ T Consensus 79 ~~~~~irv~~~~G~~~~v~~~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~~~~~~~~~~~-~~~~~~~~~~~i~~g~y~ 157 (785) T protein:vir:94 79 FNGSNIQIVDLSGNQYSVSGSVDYVKSSNPRDDIRVVTVADYTFVVNRKVVVKGGSEKSHS-GYNRKARALINLRGGQYG 157 (785) T ss_pred EcCCeEEEEecCCcEEEEecCCCceeecCchhheeeEeeCCEEEEEcCCcceeeeeccCCc-CCCCCCceEEEecccccc Confidence 9999999999999999999999999889999999999999999999999999999887764 366788999999999999 Q ss_pred ceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEEEecCCCceeEEE Q lcl|NC_019510. 161 RTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAPDGDSIRGLQ 240 (799) Q Consensus 161 ~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~~~~~~~~~~ 240 (799) ++|++.+++...+++++++++... .+....+.+++..++..++. +...+|++...+++++++++++...++++ T Consensus 158 ~~y~i~i~g~~~at~~t~~~s~a~----~s~~~~s~~~i~~~l~~~l~---a~~t~~t~~~~g~~i~i~a~s~t~~~~~s 230 (785) T protein:vir:94 158 RTLKVGINGGVKVSHKLPAGNDAE----NDPPKVDAQAIGAALRDLLV---TAYPTFTFDLGSGFLLITAPSGTDINSVE 230 (785) T ss_pred eeEEEeeCCcceeEEEEccCcccc----ccccccchHHHHHHHHHHhh---ccccceeEEecCcEEEEEecCCcccccee Confidence 999999999999999998876532 22344556677777776654 35678999999999999999999899999 Q ss_pred EEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEeccccccceeeccceeeEEEe Q lcl|NC_019510. 241 TKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVGLNNGTMPWSLIR 320 (799) Q Consensus 241 ~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~~~t~p~~lv~ 320 (799) +.+|.+++.+.++++.++++++||.+|++|++++|.+++++++++||++|+...++|+||+++++..+++..+||+.|++ T Consensus 231 ~~~~~~~t~~~~~~~~~~~~~~Lp~~~~~G~~v~v~~~~~~~~~~y~v~~~~~~g~w~e~~~~g~~~~~~~~tmp~~l~~ 310 (785) T protein:vir:94 231 TEDGYANQLISPVLDTVQTISKLPLAAPNGYIIKIQGETNSSADEYYVMYDSNTKTWKETVEPGVVTGFDNTTMPHALVR 310 (785) T ss_pred eecccCCeEEEEEEeeccceeccccccCCCCEEEEEccCCCCccceEEEEEcCCceEEEecccceeeeeeccccceEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEE Q lcl|NC_019510. 321 AADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLSGENIILSRTAKYFNMYPASVAVLSDDDPIDVA 400 (799) Q Consensus 321 ~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~ 400 (799) .++++|+++..+|++|.+||+++||.|||+|++|++|+||||||+|+++++|||||+||||||+++|++++.|||||+++ T Consensus 311 ~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~ 390 (785) T protein:vir:94 311 QSDGSFEFKALDWSKRGAGNDDTNPMPSFVDATINDVFFYRNRLGFLSGENVIMSRSASYFAFFPKSVATLSDDDPIDVA 390 (785) T ss_pred ccCCceEEeccccccccCCCcccCCcceecccccceEEEEeceEEEecCCeEEEEccCCcccCccccccCCCCCccEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EcCCcceeeeeeeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEE Q lcl|NC_019510. 401 VSHNRVSILKYAVPFSEELLLWADEAQFVLNASGVLSAKSVELNLTTEFDVNDGARPYGIGRGVYFASPRATFTSINRYY 480 (799) Q Consensus 401 ~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~ 480 (799) ++++|+|+|+|+++++++|+|||+++||+|+++++|||+|++++++|+|+|+++++|+.+|++++|++++|+|++++|++ T Consensus 391 ~~~~~~~~i~~~v~~~~~L~l~T~~~e~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~v~r~~ 470 (785) T protein:vir:94 391 VSHPRISILKYAVPFSEQLLLWSDEVQFVMTSSGVLTSKSIQLDVGSEFALGDNARPFAVGRSVFFSAPRGSFTSIKRYF 470 (785) T ss_pred ecCCcceeeEEEeecCCcEEEEecCcEEEEcCCCcccceeEEEEEEEeeeccCCCCceEeCCeEEEEecCCCeeEEEeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEeeEeeecCCCeEEEEE Q lcl|NC_019510. 481 AVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQSWSHWDFGDNVTVLAA 560 (799) Q Consensus 481 ~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~w~~~g~~~~~~~ 560 (799) +|++++|+|+++|||+|++|||+|+++++++++++|++++|++++||+|++|||||+++||+|+|||||+|+|.+.++|+ T Consensus 471 ~~~~~~d~y~~~dlt~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~g~l~~~~y~~~~~e~~v~aW~r~~~~~~~~~~~~ 550 (785) T protein:vir:94 471 AVADVSDVKDADDTTGHVLSYIPNGVFDIQGTGTENYICVNSTGAYNRIYIYKFLFKDSVQLQASWSHWEFPKDDKILAS 550 (785) T ss_pred eecccccceehhhHHHHHHHhcCCCcEEEEEecCCCcEEEEEEcCCCEEEEEEEeecCCceEEEEEEEEEeCCCeEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998888888 Q ss_pred EEeCCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEccccccccccccccccccccCccccccceEEE Q lcl|NC_019510. 561 NSIGSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTVDLNAVFGGMRWQVGKILV 640 (799) Q Consensus 561 ~~~~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~ 640 (799) ...+|++|++++|.++.+++++.+..+..|...+++++||||+.++.++.+.++.+.+.++..+...+.++.|++|+++. T Consensus 551 ~~~~d~~~~vv~r~~g~~~~~ie~~~~~~d~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~~g~~~leg~~v~ 630 (785) T protein:vir:94 551 ASIGSTMFIVRQHQGGVDIEHLKFIKEATDFPSEPYRLHVDSKVSMVIPIGSFNADTYKTTVDIGAAYGGNAPSPGRYYL 630 (785) T ss_pred EEeCCEEEEEEEcCCCEEEEEEEeecccCCCCCcceeEEeeeeeEEEecCcceeccccccccccccccccCCccCCeEEE Confidence 88899999999999888999998888999999999999999999999999999999999999999999999999999999 Q ss_pred EecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCccceeeeecceEEEEEEEEEeecc Q lcl|NC_019510. 641 SDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTEDVGRLQHRRAWLNYEQS 720 (799) Q Consensus 641 ~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~~rl~l~~~~~~~~~t 720 (799) +++||..++..++..+. .++++++|+++++|+|||+|+++++|+||++++++|+.. .+..+||+||||++++|.+| T Consensus 631 v~adG~~~~~~~v~~~~---~tl~~~g~~~~~~v~vGl~y~~~~~~~~~~~~~~~~~~~-~~~~~gr~~l~r~~~~~~~s 706 (785) T protein:vir:94 631 IDSQGAYLDLGELTSIS---TVITLNGDWSGRTVFIGRSYLMSYKFSRFLIKIEDDSGT-QSEDTGRLQLRRAWVNYRDT 706 (785) T ss_pred EeeCCcCccCceEcCCC---cEEEecCCCCCceEEEeeeeeEEEeecceeEEecCCCcc-cccccccEEEEEEEEEeecc Confidence 99999999888776654 467899999999999999999999999999999997664 45667999999999999999 Q ss_pred ceEEEEecCCCcccceeccccccCcccccccccccceEEEEEeecccceEEEEEECCCCcEEEEEEEEEEEEeccccCC Q lcl|NC_019510. 721 GAFYVDVTNLGRSYRYTMSGKPLGDTTLGQANLESGQFRFPLAGNAQYNRVVLTSDYTTPLSIIGCGWEGNYIRRSTGI 799 (799) Q Consensus 721 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~~eg~y~~r~rrv 799 (799) ++|++++++..+++.+.+.++++|++.++.+|+++|+++||+.+|+++.+|+|+|++|+||+|+||+|||+||+|+||| T Consensus 707 g~~~v~v~~~~~~~~~~~~~~~~g~~~~~~~~~~tg~~~vp~~g~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r~~~v 785 (785) T protein:vir:94 707 GALRLIVRNGEREFVNTFNGYTLGQQTIGTTNIGDGQYRFAMNGNALTTSLTLESDYPTPVSIVGCGWEASYAKKARSV 785 (785) T ss_pred cceEEEecCCCccceeeecCcccCcccccccccccceEEEEeecccceEEEEEEECCCCceEEEEEEEEEEEeccccCC Confidence 9999999988888888899999999999999999999999999999999999999999999999999999999999999 No 5 >protein:vir:99677 Length: 794 # NCBI annotation: Tail tubular protein B # Family: family:all:825 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249591;genbank:gi:68299742;genbank:GeneID:3799992 Probab=100.00 E-value=6.7e-245 Score=1359.23 Aligned_cols=788 Identities=51% Similarity=0.897 Sum_probs=741.9 Q ss_pred CCceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCceEEEEE Q lcl|NC_019510. 1 MGLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYEQYYVV 80 (799) Q Consensus 1 M~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~~y~l~ 80 (799) ||+|+|+||||++|||||+|++||++||++|+||+++|+||++||||++||+++++++.++++++||+|+|+++|+|+|+ T Consensus 1 M~~i~~s~~n~~~GvS~q~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~f~~~~~~~y~l~ 80 (794) T protein:vir:99 1 MALISQSIKNLKGGISQQPDILRYSDQGSKQINGFSSEVEGLQKRPPSVHIKRLTDQFGLGQKPYCHIINRDEVERYAVF 80 (794) T ss_pred CceeeeecchhhcceecCCchHHhhhhHhhhhcceeeeccCcccCCccceeeeecCCCCCccccEEEEEEeCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999998888888999999999999999999 Q ss_pred EeCCeEEEEeC-CCeEEEEEec--CccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceEEEEecC Q lcl|NC_019510. 81 FTGGDIKVFDL-NGQEYAVRGD--KSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDALINVRGG 157 (799) Q Consensus 81 ~~~g~irv~~~-~G~~~~v~~~--~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~~~v~~~ 157 (799) |++++||||+. +|.++.|..+ .+|+++++++++|+|+|+||+|||+|++++|+++.+..+...+++..+++++++.+ T Consensus 81 f~~~~irv~~~~~g~~~~v~~~~~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~~~v~~g 160 (794) T protein:vir:99 81 FTGSNIRVFDLFTGDEKTVNAPNGLSYVSSSNPRKDLRMVTVADYTFILNRNVATAQGTTNTPSGLAPFGHFGLVVIRGG 160 (794) T ss_pred EcCCeEEEEECCCCeEEEeeccccccccccCCccceeeEEEEccEEEEEcCCeeeeEeeeeccccCcCCCceEEEEeccC Confidence 99999999986 6888877544 67888999999999999999999999999999998877777888899999999999 Q ss_pred CCCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEEEecCCCcee Q lcl|NC_019510. 158 QYGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAPDGDSIR 237 (799) Q Consensus 158 ~y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~~~~~~~ 237 (799) +|+++|+++++++.++++.+|+++... .....++++++.++...+. ..+|++...++.+++.++.+.... T Consensus 161 ~y~~~y~v~i~gs~ta~~~tp~~~~~~-----~~~~~s~~~ia~~l~~~l~-----~~g~~v~~~~g~~~i~~~~~~~v~ 230 (794) T protein:vir:99 161 QYGRTYRIKVNGSVEASFETPLGDQVA-----HAKQIDIAYIIDQLAAGLI-----NKGWAVTKGSGYFYFSKSGSVIIN 230 (794) T ss_pred CCCceEEEEecCCcccceeeccCcccc-----cccccchhhhhhhhHhhhh-----cccceEEeCCeEEEEEecCCceeE Confidence 999999999999999999999876543 2344567788888776553 357899999999999999999999 Q ss_pred EEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEeccccccceeeccceeeE Q lcl|NC_019510. 238 GLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVGLNNGTMPWS 317 (799) Q Consensus 238 ~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~~~t~p~~ 317 (799) ++++++|.+++.+..+++.++++++||+.|++|++|+|.+..++++++||++|+...++|+||+++++..+++.+|||+. T Consensus 231 t~s~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~~~~~~~~y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~ 310 (794) T protein:vir:99 231 SLEVEDGYNGQLAWGIINDVQKTTQLPVYAPNNYIIRVSGDPTLNQDDYYVRFDASRNVWTECPAPNIKADYNKATMPHV 310 (794) T ss_pred EEEeecCCCCceeeEEeeeccceeecccCCCCCeEEEEeccCCCCCCceEEEEEcCCceEEeeccceeecceeccceEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccE Q lcl|NC_019510. 318 LIRAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLSGENIILSRTAKYFNMYPASVAVLSDDDPI 397 (799) Q Consensus 318 lv~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i 397 (799) ++++++++|++++.+|++|.+||+++||.|||+|++|++|+||||||+|+++++|||||+||||||+++|++++.||||| T Consensus 311 ~v~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~is~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I 390 (794) T protein:vir:99 311 LIREADGTFTFKQADWTHRAAGDDETNPYPSFIGNSINDIFFFRNRLGFLSGENVILSGSGNYFNFFPESVAVLTDTDPI 390 (794) T ss_pred EeccCCCceeEeeccccccccCCcccCCCccccCcceeEEEEEeeeEEEecCCeEEEEecCCccccccccccCCCCCccE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEE Q lcl|NC_019510. 398 DVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNASGVLSAKSVELNLTTEFDVNDGARPYGIGRGVYFASPRATFTSIN 477 (799) Q Consensus 398 ~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~ 477 (799) +++++++++|+|+|+++++++|+|||+++||+|+|+++|||+|+++.++|+|+|+++|+|+.+|++++|+|++|+|++++ T Consensus 391 ~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~v~ 470 (794) T protein:vir:99 391 DVAVSTNRISILKYAVPFSEELILWSDQAQFVLSSDGGLTPTTIRLDLTTEFEVTEQARPYGIGRGVYFVSPRAKFSSVR 470 (794) T ss_pred EEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEEeeccCCCCceEeCCeEEEEecCCCeeEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEeeEeeecCCCeEE Q lcl|NC_019510. 478 RYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQSWSHWDFGDNVTV 557 (799) Q Consensus 478 r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~w~~~g~~~~ 557 (799) |+|+|++++|+|+++|||+|++|||+|+++++++++++|++++|++++||+|++||||++++||+|+|||||+|+|++++ T Consensus 471 r~~~~~~~~d~y~a~Dlt~~~~hl~~~~~~~~~a~~~~~~~~v~~~~~~g~l~~~~y~~~~~eq~v~aW~~~~~~g~~~~ 550 (794) T protein:vir:99 471 RFYAVQDVTQVKNAEDISAHVPYYVENGVFKMSGSSTENFLTILTEGNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRV 550 (794) T ss_pred EeeeeccccCceehhhHHHHHHHhcCCCeEEEEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceEEEeEEEEEcCCCeEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeCCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEccccccccccccccccccccCccccccce Q lcl|NC_019510. 558 LAANSIGSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTVDLNAVFGGMRWQVGK 637 (799) Q Consensus 558 ~~~~~~~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~ 637 (799) +|+.+.+|+||++|+|+++.+++||.+.++..+...++++++|||+.++..+++.++.+.+.+...... ..|+.|++|+ T Consensus 551 ~~~~~~~d~l~~~v~r~~~~~ler~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~l~g~ 629 (794) T protein:vir:99 551 LCCDMIGAVMHLIIDSPSGVLMEKIEFTQNTKDYPDEPYRLYVDRKIEYTFPEGSYNDDDFKTRVKLKD-IYGSTPANGQ 629 (794) T ss_pred EEEEEcCCEEEEEEEeCCCEEEEEEEeeeCCCCCCCcccceeeeeeeeeeecccccccCcceeEEeccc-cccccccCCc Confidence 999999999999999999999999999999999999999999999999999999988887777765544 5788999999 Q ss_pred EEEEecCCcccccccccccee-cCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCccceeeeecceEEEEEEEEE Q lcl|NC_019510. 638 ILVSDEVGEVRQYEPPAGGWA-SDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTEDVGRLQHRRAWLN 716 (799) Q Consensus 638 ~v~~~adG~~~~~~~v~~g~~-~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~~rl~l~~~~~~ 716 (799) ++.+.+||.+.....+..+.. ....+++++++++++|+|||+|+++++|+||++++++++++..+...|||+|||++|+ T Consensus 630 ~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~vGl~y~s~~~~~~~~~~~~~~~g~~~~~~~gr~~l~r~~~~ 709 (794) T protein:vir:99 630 YVFISLGGVTFTFDPPAGGWQANDGLIEFDGDLRGTKFFVGEAYTFLYEFSKFLIKTTDTADGVATEDIGRLQLRRAWVN 709 (794) T ss_pred eEEEEeCCceeeeecccceEecCccEEEecCCCCCcEEEEeeeeeEEEeecceEEeecCCCCceeeeccceEEEEEEEEE Confidence 999999999877666544432 4467889999999999999999999999999999999999999999999999999999 Q ss_pred eeccceEEEEecCCCcccceeccccccCc--ccccccccccceEEEEEeecccceEEEEEECCCCcEEEEEEEEEEEEec Q lcl|NC_019510. 717 YEQSGAFYVDVTNLGRSYRYTMSGKPLGD--TTLGQANLESGQFRFPLAGNAQYNRVVLTSDYTTPLSIIGCGWEGNYIR 794 (799) Q Consensus 717 ~~~t~~~~~~v~~~~~~~~~~~~~~~~~~--~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~~eg~y~~ 794 (799) +.+|++|++.+++.++++.+.+.+.+++. ..++.+|++||++++|+.+|+++.+|+|+|++|+||+|+||+|||+||+ T Consensus 710 ~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlsi~~e~~y~~ 789 (794) T protein:vir:99 710 YDKSGNFRVEVNNQGRTFTYNMTGNRLSTNELILGDESLDTGQFRYAVSGNATQVTVSLISDTPNPLSIIGGGWEGYYVR 789 (794) T ss_pred eecccceEEEECCCccceeeeccccccccccccccccccccceEEEEecccccceEEEEEECCCCCEEEEEEEEEEEEec Confidence 99999999999999998888888889874 5567899999999999999999999999999999999999999999999 Q ss_pred cccCC Q lcl|NC_019510. 795 RSTGI 799 (799) Q Consensus 795 r~rrv 799 (799) |+||| T Consensus 790 r~~~v 794 (794) T protein:vir:99 790 RSSGI 794 (794) T ss_pred cccCC Confidence 99999 No 6 >protein:vir:3366 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523337;swissprot:trembl:q8w5u3;genbank:gi:17570828;goa:Q8W5U3;uniprot:Q8W5U3;genbank:GeneID:927453 Probab=100.00 E-value=1.9e-243 Score=1351.25 Aligned_cols=793 Identities=64% Similarity=1.099 Sum_probs=748.1 Q ss_pred CCceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCceEEEEE Q lcl|NC_019510. 1 MGLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYEQYYVV 80 (799) Q Consensus 1 M~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~~y~l~ 80 (799) ||+|+|+||||+||||||+|++||++||++|+||+|+|+|||+||||++||++++.++.....+++++++|+++|+|+|+ T Consensus 1 M~~i~~~~~nl~~GvSqq~d~~r~~~q~~~~~N~~~~~~gG~~rRpGt~~va~~~~~~~~~~~~~~~~~~r~~~~~y~l~ 80 (801) T protein:vir:33 1 MALISQSIKNLKGGISQQPDILRFTEQGSVQINGWSSESEGIQKRPPMIHLKTLGTAGYVGAQPYVHLINRDEFEQYFVV 80 (801) T ss_pred CceeEeeccceecceeccchhHhhhhhHhhhhcceeecccCcccCchhHhhhhhcCCCccccceEEEEEEeCCceEEEEE Confidence 99999999999999999999999999999999999999999999999999999998877888999999999999999999 Q ss_pred EeCCeEEEEeCCCeEEEEEecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceEEEEecCCCC Q lcl|NC_019510. 81 FTGGDIKVFDLNGQEYAVRGDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDALINVRGGQYG 160 (799) Q Consensus 81 ~~~g~irv~~~~G~~~~v~~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~~~v~~~~y~ 160 (799) |++++||||+++|.+++++++.+|+.+++++++|+++|+||++||+|++++|++..+..+...+++.++++++++.++|+ T Consensus 81 ~~~~~irv~~~~G~~~~v~~~~~y~~~~~~~~~l~~~t~aD~~fi~nr~~~p~~~~~~~~~~~~~~~~~~li~v~~~~yg 160 (801) T protein:vir:33 81 FTGEDIKVFDLDGKEYQVRGDRSYVRTANPREDLRMVTVADYTFVTNRKVVVQSNDQSVNLPGFKDQGDALINVRGGQYG 160 (801) T ss_pred EcCCeEEEEccCCcEEEEecCCcceeecCcchheEEEEEcCEEEEeeCCeeecccCCcccccccCCCcceEEEEeecccc Confidence 99999999999999999999999999999999999999999999999999999988877767778888999999999999 Q ss_pred ceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccc------cccccceEEEECCcEEEEEecCCC Q lcl|NC_019510. 161 RTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRES------LAGNPGWTINVGTGFVNIIAPDGD 234 (799) Q Consensus 161 ~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~------~a~~~~~t~~~~g~~i~i~a~~~~ 234 (799) ++|++++++...+.+++++++.. ....+....+++..|..++... ..+..+|++...++.+++.++++. T Consensus 161 ~t~~I~i~gs~~~~~~~~~gs~~-----~~v~~~s~~~~A~~l~~~~~~~~~~~~~~~~~~~w~~~~~~g~~~i~~p~~~ 235 (801) T protein:vir:33 161 RRLSIEFNGAERAAVQLPDGSQP-----AHVNEVDGQAIAEKLAAQLRNNLGNPNNDQDPNKWRFNVGPGFIHILAPNND 235 (801) T ss_pred eEEEEEECCcceEEEEeeccccc-----cccccccchhhhhhhhhhhhccCccceeeecCceEEEEecCeEEEEecCCCc Confidence 99999999988888888887543 2344455666777666554432 234678999999999999999998 Q ss_pred ceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEeccccccceeeccce Q lcl|NC_019510. 235 SIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVGLNNGTM 314 (799) Q Consensus 235 ~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~~~t~ 314 (799) ..+.+++.+|..++.+.++.+.++++++||.++++|++++|.++++++.++||++|+...++|+||++++...++++++| T Consensus 236 ~~~~itt~~g~~~~~~~~~~~~v~~~~~lp~~~~~g~~v~v~~~~~~~~~~y~v~~~~~~~~w~e~~~~g~~~~~~~~tm 315 (801) T protein:vir:33 236 NVWGLQTKDGYADQLINPVTHYTQSFQKLPINAPDGYIVKIVGDTSKTADQYYVRFDLNRKVWVETIGWNTRTHLHYHTM 315 (801) T ss_pred ccccccccCCccceeEEEEeecccceeeeeeecCCCcEEEEEecCCCcccceEEEEEcCCcEEEEeeccccceeeeeccc Confidence 88889999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEEeccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEecCCeEEEEccCCccccccccccCCCCC Q lcl|NC_019510. 315 PWSLIRAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLSGENIILSRTAKYFNMYPASVAVLSDD 394 (799) Q Consensus 315 p~~lv~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~Dd 394 (799) |+.|+++++++|+++.++|++|.+||+++||.|+|.|++|++|+||||||+|+++++|||||+||||||+++|++++.|| T Consensus 316 p~~l~~~~~~tf~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~Dd 395 (801) T protein:vir:33 316 PWALVRASDGNFDFKYLEWGARTVGDDTTNPYPSFTGQTINDIFFFRNRLGFLSGENIILSRTSKYFNFFPASVSNYSDD 395 (801) T ss_pred ceEEEEccCceEEecccCccccccCCccccCcccccCCCceEEEEEcceEEEeeCCeEEEEecCCccccccccccCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCce Q lcl|NC_019510. 395 DPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNASGVLSAKSVELNLTTEFDVNDGARPYGIGRGVYFASPRATFT 474 (799) Q Consensus 395 D~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~ 474 (799) |||+++++++++|+|+|+++++++|+|||+++||+|+++++|||+|+++.++|+|+|+++|+|+.+|++++|+|++|+|+ T Consensus 396 D~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~ 475 (801) T protein:vir:33 396 DPIDVAVSHDRVSTLKYAVPFSEELLLWSDQAQFVLTASDILSSRSVGLNLTTQFDVQDRARPHGVGRNVYFSSPRASFT 475 (801) T ss_pred ccEEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEeecccCCCCceEecCeEEEEecCCCee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEeeEeeecCCC Q lcl|NC_019510. 475 SINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQSWSHWDFGDN 554 (799) Q Consensus 475 ~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~w~~~g~ 554 (799) +++|+|+|++++|+|+++|||+|++|||+|++++++++++++..++|+++++|+|++|+||++++||+|+|||||+|+|+ T Consensus 476 ~v~r~~~~~~~~d~y~~~Dlt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~ 555 (801) T protein:vir:33 476 SINRYYAVQDVSSVKNAEDMTAHVPNYIPNGVFSISGTTAENFVAILTSGAPNRVYIYKFLYIDEEIRQQSWSHWDFGDN 555 (801) T ss_pred EEEEEEeecccccceehhhHHHHHHHhcCCceEEEEEcCCCCeEEEEEecCCCEEEEEEEecCCCceEEEeeEEEEcCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEEEEeCCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEccccccccccccccccccccCccccc Q lcl|NC_019510. 555 VTVLAANSIGSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTVDLNAVFGGMRWQ 634 (799) Q Consensus 555 ~~~~~~~~~~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 634 (799) ++++|+...+|+||++|+|+++.++++|.+..+.++...+++++||||+.++.+.++.++.+...++...... .|+.|+ T Consensus 556 ~~~~~~~~~~d~l~~vv~r~~~~~le~~~~~~~~~d~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~-~gl~~~ 634 (801) T protein:vir:33 556 VTVFAAQVINSTMTVLMSNEHAVWMGRLHFTKDSIDLPGEPYRLYIDAKRKYTIPAGTYNDDTYQTSISLSTI-YGMNFT 634 (801) T ss_pred EEEEEEecCCCEEEEEEEcCCcEEEEEEEEeeccccCCCccceEEeecceEEEecccceecCccccccccccc-cCCccc Confidence 9999988889999999999999999999999999999999999999999999999999988877776654443 489999 Q ss_pred cceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCccceeeeecceEEEEEEE Q lcl|NC_019510. 635 VGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTEDVGRLQHRRAW 714 (799) Q Consensus 635 ~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~~rl~l~~~~ 714 (799) ||+++.+++||.+++..++..++....++++++++++++|+|||+|+++++|+||+++.+++.++......+|+||||++ T Consensus 635 eg~~v~~~~dG~v~~~~~~~~~~~~~~~l~i~~~~~~~~v~vGl~y~s~~~~~~~~~~~~~~~~~~~~~~~~r~~l~r~~ 714 (801) T protein:vir:33 635 KGRVSVVFPDGKIVEIDQPINGWSSDPMLRLDGNQEGQVVYIGFNIPFTYTFSKFLIKKTAEDGSTATEDIGRLQLRRAW 714 (801) T ss_pred cceEEEEEeCCceEeeeeccccccCceeEEecCCCCCCEEEEeeeeeEEEEeCceEEeccCCCCceeeeeeccEEEEEEE Confidence 99999999999999888888888888999999999999999999999999999999999999888888889999999999 Q ss_pred EEeeccceEEEEecCCCcccceeccccccCcc--cccccccccceEEEEEeecccceEEEEEECCCCcEEEEEEEEEEEE Q lcl|NC_019510. 715 LNYEQSGAFYVDVTNLGRSYRYTMSGKPLGDT--TLGQANLESGQFRFPLAGNAQYNRVVLTSDYTTPLSIIGCGWEGNY 792 (799) Q Consensus 715 ~~~~~t~~~~~~v~~~~~~~~~~~~~~~~~~~--~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~~eg~y 792 (799) +++.+|++|++.|++.++++.+.+.+++++.. .++.+++++|++++|+.+|+++.+|+|+|++|+||+|+||+|||+| T Consensus 715 ~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvl~i~~eg~y 794 (801) T protein:vir:33 715 VNYEDSGAFIIRVNNLSREFIYTMAGARLGSDNLRVGGSNIGTGQYRFPVVGNAQTNTVTIESDASTPLNIIGCGWEGNY 794 (801) T ss_pred EEeecCcceEEEECCcccceeeeecccccccccccccccccccceEEEEeeccCceEEEEEEeCCCCCEEEEEEEEEEEE Confidence 99999999999999999998888999999854 5678999999999999999999999999999999999999999999 Q ss_pred eccccCC Q lcl|NC_019510. 793 IRRSTGI 799 (799) Q Consensus 793 ~~r~rrv 799 (799) |+|+||| T Consensus 795 ~~r~~~~ 801 (801) T protein:vir:33 795 LRRSSGI 801 (801) T ss_pred eccccCC Confidence 9999999 No 7 >protein:vir:94583 Length: 792 # NCBI annotation: Tubular tail protein B # Family: family:all:825 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919014;genbank:gi:119637778;genbank:GeneID:5179343 Probab=100.00 E-value=1.3e-242 Score=1346.60 Aligned_cols=789 Identities=60% Similarity=1.031 Sum_probs=736.1 Q ss_pred CCceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCceEEEEE Q lcl|NC_019510. 1 MGLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYEQYYVV 80 (799) Q Consensus 1 M~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~~y~l~ 80 (799) ||+|+|+||||++|||||+|++||++||++|+||+|+|+||++||||++||+++++.+.+.++.+|++|+|+++|+|+|+ T Consensus 1 M~~i~~s~~n~~~GiSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~~~~~~~q~y~l~ 80 (792) T protein:vir:94 1 MALISQSVKNLKGGISQQPNILRFPEQGSEQINGWSSETEGLQKRPPFVFTKTIGDQNALGAKPLVHLINRDSAEQYYVV 80 (792) T ss_pred CcceeeecchhhcceecCcchHHhhhhhhhhhcceeeeccccccCChhHHHHhhhcCCCCCcccEEEEEEeCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999998888899999999999999999999 Q ss_pred EeCCeEEEEeCCCeEEEEEecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceEEEEecCCCC Q lcl|NC_019510. 81 FTGGDIKVFDLNGQEYAVRGDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDALINVRGGQYG 160 (799) Q Consensus 81 ~~~g~irv~~~~G~~~~v~~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~~~v~~~~y~ 160 (799) |++++||||+.+|.++.++++.+|+.+.++.++|+|+|+||++||+|++++|+++.|..+..+ +.+++++++++++|+ T Consensus 81 f~~~~~rv~~~~g~~~~~~~~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~~~~~~~~~~~~~--~~~~~~v~i~~g~y~ 158 (792) T protein:vir:94 81 FTGQGVRVFDLNGKEYDVKGDLSYVKVENPRDDLRMVTVADYTFIVNRNMVVRPDTTPLYTLK--ENGDCLINIRGGMYG 158 (792) T ss_pred EcCCeEEEEecCCceEEecccCceeeecCCcceeEEEEEcCEEEEEeCCccceeEecCcCCCC--CCceEEEEccCCCcc Confidence 999999999999999999999999999999999999999999999999999999998776543 567899999999999 Q ss_pred ceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEEEecCCCceeEEE Q lcl|NC_019510. 161 RTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAPDGDSIRGLQ 240 (799) Q Consensus 161 ~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~~~~~~~~~~ 240 (799) ++|++++++.. +++.+++++.. .......+++++.+|...... ..+..+|++.+.+++++|+++++..+.+++ T Consensus 159 ~~y~i~i~~~~-~~~~~~~~t~~-----~~~~~~~~~~i~~~l~~~~~~-~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 231 (792) T protein:vir:94 159 RTLAFTINNTK-IAYEIAHGDAP-----EHSKQTDAQWLVKKLAGLARL-NVAFKGWTFTEGPGYIHVIAPSNSQINSLS 231 (792) T ss_pred eeEEEEecCce-eeeeeecCccc-----ceecccchhhhhhhhhhhccc-cccccccEEEECCeEEEEEecCCceeeeee Confidence 99999998764 45667766543 233455678888887665443 344678999999999999999998888999 Q ss_pred EEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEeccccccceeeccceeeEEEe Q lcl|NC_019510. 241 TKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVGLNNGTMPWSLIR 320 (799) Q Consensus 241 ~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~~~t~p~~lv~ 320 (799) +.+|..++.+..+++.++++++||+.|++|++++|++..+++.+.||++|+...++|+||+++++..++++++||+.+++ T Consensus 232 ~~~g~~~~~~~~~~~~v~~~~~lp~~~~~G~~v~i~~~~~~~~d~y~v~~~~~~~~w~E~~~~~~~~~~~~~tmp~~lv~ 311 (792) T protein:vir:94 232 TEDGYADQLMNAVMHTSQSFSRLPVEAPNGYTVKIVGDTSKTSDMFYVQYDNMKKVWKEVAGWGVQKGLNGGTMPHALVR 311 (792) T ss_pred cccCcCcceeeeeeecccccccccccCCCCcEEEEEccCCCCccceEEEEEcCCceEEEecccceeeeecccccCeeEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEE Q lcl|NC_019510. 321 AADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLSGENIILSRTAKYFNMYPASVAVLSDDDPIDVA 400 (799) Q Consensus 321 ~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~ 400 (799) +++++|+++.++|++|.+||+++||.|||+|+.|++|+||||||+|+++++|||||+||||||+++|++++.|||||+++ T Consensus 312 ~~~~~~~~~~~~w~~r~~gd~~tnp~psf~g~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~ 391 (792) T protein:vir:94 312 QADGSFQMQVLPWTQRTCGDMDTNPTPSIVDQKINDVFFFRNRLGFLAGENIVMSRTSKYFSLFPASVANLSDDDPIDVA 391 (792) T ss_pred cCCCcEEEEeccccccccCccccCccceeccCCcceEEEEcceEEEecCCeEEEEccCCcccCccccccCCCCCccEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EcCCcceeeeeeeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEE Q lcl|NC_019510. 401 VSHNRVSILKYAVPFSEELLLWADEAQFVLNASGVLSAKSVELNLTTEFDVNDGARPYGIGRGVYFASPRATFTSINRYY 480 (799) Q Consensus 401 ~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~ 480 (799) ++++++|+|+|+++++++|+|||+++||+|+|+++|||+|+++.++|+|+|+++|+|+.+|++++|++++|+|++++|+| T Consensus 392 ~ss~~~~~i~~~v~~~~~L~l~T~~~q~~l~~~~~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~~~~v~r~~ 471 (792) T protein:vir:94 392 VSHNRISILKYAVPFSEELLLWSDQAQFVLSAQGILSPKSVELNLTTEFDVSDRARPFGVGRGVYFASPRASYTSLNRYY 471 (792) T ss_pred ecCCcceeeeEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEEeeccCCCCceEeCCeEEEeecCCCeeEEEeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEeeEeeecCCCeEEEEE Q lcl|NC_019510. 481 AVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQSWSHWDFGDNVTVLAA 560 (799) Q Consensus 481 ~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~w~~~g~~~~~~~ 560 (799) +|++++|+|+++|||+|++|||+|+++.+++++++|++++|++++||+|++|||||+++||+|+|||||+++|+++++|+ T Consensus 472 ~~~~~~d~y~a~DlT~~~~hl~~~~v~~~~a~~~~~~~vv~~~~~~g~l~~~ty~~~~~e~~v~aW~~~~~~g~~~~~~~ 551 (792) T protein:vir:94 472 AVQDVSSVKSAEDMSAHVPNYIPNGVFSIRGSSTENFISVLSSNAPSRIFLYKFLYLNEEIAQQSWSHWELGSNVTVLAC 551 (792) T ss_pred eeccccCceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCeEEEEEEeecCCceEEEeEEEEEcCCcEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeCCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEccccccccccccccccccccCccccccceEEE Q lcl|NC_019510. 561 NSIGSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTVDLNAVFGGMRWQVGKILV 640 (799) Q Consensus 561 ~~~~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~ 640 (799) .+.+|+||++|+|+++++++||.+.++..+...++++++|||+.++...++.+......+...+.. ..++.|++|+++. T Consensus 552 ~~~~D~l~~~v~r~~~~~~~r~~~~~~~~d~~~~~~~~~lD~~~~~~~~~~~~~~~~~~T~~~~~~-~~gl~~l~G~~v~ 630 (792) T protein:vir:94 552 DSIGSTMYLVLRNQSHTWMCRAHFTKNSIDFPDEPYRLYIDNKVKYVIPEGSYNDDTYATTVKPVD-VYGMKYWTGKFYI 630 (792) T ss_pred eecCCEEEEEEEeCCCEEEEEEEEeecccccCCCcceeeeeeeeeEEecCcceecCceeeeecccc-ccCcccccCcEEE Confidence 888999999999999999999999999999888999999999999999988887777666655544 4699999999999 Q ss_pred EecCCccccccccccce-ecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCccceeeeecceEEEEEEEEEeec Q lcl|NC_019510. 641 SDEVGEVRQYEPPAGGW-ASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTEDVGRLQHRRAWLNYEQ 719 (799) Q Consensus 641 ~~adG~~~~~~~v~~g~-~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~~rl~l~~~~~~~~~ 719 (799) +.+||.......+..+. ...+++++++++++++|+|||+|+++++|+||+++.++|.++.+....||+||||++++|.+ T Consensus 631 v~~dG~~~~~~~~~~~~~~~~~~i~~~g~~~a~~v~VGl~y~~~~~~~~~~~~~~~g~~~~~~~~~gr~rl~r~~~~~~~ 710 (792) T protein:vir:94 631 VASDGLVSWFEPPRGGWPNGVPMLTMSGNREGETIYVGLAISFRYVFSKFLIKKTADDGSIATEDIGRLQLRRAWVNYED 710 (792) T ss_pred EEecCceeEeecccceecCCccEEEecCCccCCeEEEeeeeeEEEEeccceeeccCCCcCccccceeeEEEEEEEEeeec Confidence 99999876544443332 23457899999999999999999999999999999999999989889999999999999999 Q ss_pred cceEEEEecCCCcccceeccccccCcc--cccccccccceEEEEEeecccceEEEEEECCCCcEEEEEEEEEEEEecccc Q lcl|NC_019510. 720 SGAFYVDVTNLGRSYRYTMSGKPLGDT--TLGQANLESGQFRFPLAGNAQYNRVVLTSDYTTPLSIIGCGWEGNYIRRST 797 (799) Q Consensus 720 t~~~~~~v~~~~~~~~~~~~~~~~~~~--~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~~eg~y~~r~r 797 (799) |+.|.+++++.+++..+.+.+++++++ ..++|++.+++++||+.+|+++.+|+|+|++|+||+|+||+|||+||+|+| T Consensus 711 tg~~~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlai~~eg~y~~r~~ 790 (792) T protein:vir:94 711 SGAFTVEVENTSRLFSYDMAGARLGSNVLRAGGLNVGTGQFRFPVTGNAQLNEVRIISEHTTPLNVIGCGWEGNYLRRSS 790 (792) T ss_pred cceeEEEEcCCCcceeeeeccceeccccccccccccccceEEEEeeccCceEEEEEEECCCCCEEEEEEEEEEEEecccc Confidence 999999999998888888888998864 457888999999999999999999999999999999999999999999999 Q ss_pred CC Q lcl|NC_019510. 798 GI 799 (799) Q Consensus 798 rv 799 (799) || T Consensus 791 ~v 792 (792) T protein:vir:94 791 GI 792 (792) T ss_pred CC Confidence 99 No 8 >protein:vir:8887 Length: 808 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813776;genbank:gi:29366731;genbank:GeneID:1258831 Probab=100.00 E-value=9.8e-239 Score=1325.42 Aligned_cols=794 Identities=57% Similarity=0.970 Sum_probs=736.5 Q ss_pred CCceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCceEEEEE Q lcl|NC_019510. 1 MGLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYEQYYVV 80 (799) Q Consensus 1 M~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~~y~l~ 80 (799) ||+|+|+||||+||||||+|++||++||++|+||+|+|+|||+||||++||+++++++....+.++++|+|+++|+|+|+ T Consensus 1 M~~v~~s~~n~~~GvSqq~d~~R~~~q~~~~~N~~~~~~gG~~rRpgt~~v~~l~~~~~~~~~~~~~~~~~~~~~~y~v~ 80 (808) T protein:vir:88 1 MGLVSQSVKNLKGGISQQPDILRFSNQGALQINGWSSETQGLQKRPPTTFTKRLQNKGFLGTKPLVHLINRDAQEQYFVG 80 (808) T ss_pred CcceeeecchhccceeccchhHhhhhhhhhhhcceeeeccccccCCchheeeeeeccCCCCCCcEEEEEEeCcCceEEEE Confidence 99999999999999999999999999999999999999999999999999999988888888899999999999999999 Q ss_pred EeCCeEEEEeCCCeEEEEEecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceEEEEecCCCC Q lcl|NC_019510. 81 FTGGDIKVFDLNGQEYAVRGDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDALINVRGGQYG 160 (799) Q Consensus 81 ~~~g~irv~~~~G~~~~v~~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~~~v~~~~y~ 160 (799) |++++||||+.+|.++.+++..+|+.+++++++|+|+|+||++||+|++++|++..+......++...|++++++.++|+ T Consensus 81 ~~~~~i~v~~~~G~~~~v~~~~~y~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~~~~~~~~~~~~~~~vr~g~y~ 160 (808) T protein:vir:88 81 FSGTGLAVWDLKGNNYTVRGYNGYANCANPRTDLRLITVADYTFVVNRNTVCQMGSTLTHAAYPRLDGRAIINVRGGQYG 160 (808) T ss_pred EeCCeEEEEEcCCceEEEeecCcceEecCChhheeEEEEcCEEEEEcCCcceeecccccccCCCCCCccEEEEEcccccC Confidence 99999999999999999999999999999999999999999999999999999887776667777888999999999999 Q ss_pred ceeeEEeccc-----eEEEEEecCCcccccee------cccccccccccchhhhhhccccccccccceEEEECCcEEEEE Q lcl|NC_019510. 161 RTLNVIFNEA-----TRATIKLPSGTGTTPPI------EEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNII 229 (799) Q Consensus 161 ~ty~v~~~~~-----~~~~~~tp~gt~~~~~~------~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~ 229 (799) ++|++++++. .+.++.+++++...... ........+++++..+..++.+.. ...+|++...++.+++. T Consensus 161 ~~y~i~i~g~~s~~~~t~t~~~~~~s~~~v~~~~~~~~~~~~~~~~~~~ia~~l~~~~~~~~-~~~~~~~~~~~~~~~i~ 239 (808) T protein:vir:88 161 RTLSITINGDGTGSSPQASIKMPNGSAEKVPAGDPYAGMNQVDMTDASWIAAELARQLTVSL-GGSGWSFQAGTGWILIN 239 (808) T ss_pred ceEEEEEecCCcceeeeEeEEEccCcccceeeccceeecccCCccccccchhhheeeeeecc-cccceEEEeccceEEEE Confidence 9999999863 34566667665443322 223334556778877776665432 33578999999999999 Q ss_pred ecCCCceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEecccccccee Q lcl|NC_019510. 230 APDGDSIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVGL 309 (799) Q Consensus 230 a~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~ 309 (799) .+.+..++.+++++|.+++.+.++.+.|+++++||..+++|++++|.+..+++.++||++|+...+.|+||+++++..++ T Consensus 240 ~~a~~~~~~~~t~~g~~~~~~~~~~~~v~~~~~lp~~~p~g~~v~i~~~~~~~~~~~yv~~~~~~~~w~e~~~~~~~~~~ 319 (808) T protein:vir:88 240 APANDNVRQIATKDGYADTLLSGFIYQVQTFTKLPANAPPGYLVEITGESARSGDNYWVQYDASGKVWKETAKPKIIAGF 319 (808) T ss_pred eccCceeEEEcccCCcCcceeeeeeeeccceeeccccCCCCcEEEEEecCCCCCceeEEEEEcCCeEEEEeeeccceeee Confidence 99998899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccceeeEEEeccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEecCCeEEEEccCCcccccccccc Q lcl|NC_019510. 310 NNGTMPWSLIRAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLSGENIILSRTAKYFNMYPASVA 389 (799) Q Consensus 310 ~~~t~p~~lv~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~ 389 (799) ++++|||.++++++++|+++.++|++|.+||+++||.|||+|++|++|+||||||+|+++++|||||+||||||++++++ T Consensus 320 ~~~tmp~~lv~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~ 399 (808) T protein:vir:88 320 NNATLPHALVRAADGQFDWTPLTWDGRNAGDDDTNPMPSFVGATINDVFFFRNRLGFLSGENVVMSRTSKYFNFFPSSVA 399 (808) T ss_pred cccceeEEEEecCCceEEEEecccccccccccccCccceecCCceeEEEEEcceEEEeeCCeEEEEeccCcccccCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecccCCCCcEEeCCeEEEEec Q lcl|NC_019510. 390 VLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNASGVLSAKSVELNLTTEFDVNDGARPYGIGRGVYFASP 469 (799) Q Consensus 390 ~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~ 469 (799) ++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|+++|||+|+++.++|+|+|++.|+|+.+|++++|+++ T Consensus 400 ~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~i~T~~~e~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~f~~~ 479 (808) T protein:vir:88 400 TLSDDDPIDVAISHNRISILKYAVPFSEQLLLWSDQAQFVLSSKTILSSKTIELDLTTEFDVSDGARPYGIGRGVYFAAP 479 (808) T ss_pred CCCCCccEEEEecCCccceeeEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEEecccCCCCceEeCCeEEEEec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCceeEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEeeEee Q lcl|NC_019510. 470 RATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQSWSHW 549 (799) Q Consensus 470 ~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~w 549 (799) +|+|++++|+|+|++++|+|+++|||+|++|||+|+++++++++++|.+++|++++||+|++|||||+++||+|+||||| T Consensus 480 ~g~~~~v~r~~~~~~~~d~y~~~dlt~~~~h~~~~~~~~~~~~~~~~~~~v~~~~~~g~l~~~~y~~~~~e~~v~aW~r~ 559 (808) T protein:vir:88 480 RASFTSLKRYYAIQDVSDVKSAEDVSAHVPSYITNTVHAIHGSGTENFVSILSDGSPNKVFIYKFLYLDEILQQQSFSHW 559 (808) T ss_pred CCCeeEEEEEEEeeeccCceehhhHHHHHHHhcCCCeEEEEEeCCCCeEEEEEEcCCCEEEEEEEeccCCceeEEeeEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCeEEEEEEE--eCCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEccccccccccccccccccc Q lcl|NC_019510. 550 DFGDNVTVLAANS--IGSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTVDLNAV 627 (799) Q Consensus 550 ~~~g~~~~~~~~~--~~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~ 627 (799) +|+|+++++|... .+|+||++|+|+++.+++||.+..+..+...+++++||||..++. ++.++.....+....... T Consensus 560 ~~~g~~~~~~~~~~~~~d~l~~vV~r~~~~~ler~~~~~~~~~~~~~~~~~~lD~~~~~~--~g~~~~~~~~t~~~~~~~ 637 (808) T protein:vir:88 560 EFGDAATTRVLAASCIGSYCYLMIDRPEGLCLERMEFTQHTIDYSIEPYRTYMDMKKTIV--LGAYNIDTNLTSFDVRTA 637 (808) T ss_pred ecCCCeeEEEEEEeccCCEEEEEEEcCCcEEEEEEeeccCCCCCccccceeeeeeeeeec--cccccCccccceeecccc Confidence 9999988766544 479999999999999999999988888998899999999998764 456666666777777888 Q ss_pred cCccccccceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCccceeeeecce Q lcl|NC_019510. 628 FGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTEDVGR 707 (799) Q Consensus 628 ~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~~r 707 (799) ++++.|+++..+.+.+||..+... ..++...+++++++++++++|+|||+|+++++|+||++++++|+++++++.+|| T Consensus 638 ~~~~~~~~~~~~~~~~dg~~~~~~--~~~~~~~~~~~~~~~~~~~~v~vGl~y~s~~~~~p~~~~~~~g~~~~~~~~~gr 715 (808) T protein:vir:88 638 YGGTPGPESTFYTIDQQGVLIEHE--ARDWATNPYISFVGNRAGEQMVIGKQYTFQYEFSKFLIKQTADDGSTSTEDIGR 715 (808) T ss_pred cccccccceeEEEEcCCceEEeee--cccccCcceEEeCCCccCceEEEeeeeeEEEEecceEEecCCCCcceeecccce Confidence 999999999999999999875443 345667788999999999999999999999999999999999999999999999 Q ss_pred EEEEEEEEEeeccceEEEEecCCCcccceeccccccCc-ccccccccccceEEEEEeecccceEEEEEECCCCcEEEEEE Q lcl|NC_019510. 708 LQHRRAWLNYEQSGAFYVDVTNLGRSYRYTMSGKPLGD-TTLGQANLESGQFRFPLAGNAQYNRVVLTSDYTTPLSIIGC 786 (799) Q Consensus 708 l~l~~~~~~~~~t~~~~~~v~~~~~~~~~~~~~~~~~~-~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i 786 (799) |||||+++++.+|++|.+.++++.+++.+.+.+++++. +.++.+|+++|++++|+.+|+++.+|+|+|++|+||+|+|| T Consensus 716 ~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~d~P~P~tilsi 795 (808) T protein:vir:88 716 LQLRRAWLNYEESGAFEINVNNGSSEFVYVMTGGRLGIQRVLGELSVGTGQFKFPVTGNAVNQRVTITSSNPNPLNVIGC 795 (808) T ss_pred EEEEEEEEEeecccceEEEeCCCcccceeeccCcccCcccccCccccccceEEEEecccCceeEEEEEECCCCceEEEEE Confidence 99999999999999999999988888888899999985 46678899999999999999999999999999999999999 Q ss_pred EEEEEEeccccCC Q lcl|NC_019510. 787 GWEGNYIRRSTGI 799 (799) Q Consensus 787 ~~eg~y~~r~rrv 799 (799) +|||+||+|+||| T Consensus 796 ~~eg~y~~r~~~v 808 (808) T protein:vir:88 796 GWEGNYIRRSSGI 808 (808) T ss_pred EEEEEEeccccCC Confidence 9999999999999 No 9 >protein:vir:7021 Length: 803 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853594;genbank:gi:31711676;genbank:GeneID:1481802 Probab=100.00 E-value=7e-221 Score=1227.55 Aligned_cols=777 Identities=23% Similarity=0.358 Sum_probs=669.2 Q ss_pred CceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEE---cCCceEEE Q lcl|NC_019510. 2 GLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLIN---RDEYEQYY 78 (799) Q Consensus 2 ~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~---~~~~~~y~ 78 (799) =+|+|+||||+||||||+|++||++||++|+||+|+|+|||+||||++||+++++++. ...+++++. ++++|+|+ T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~~va~l~~~~~--~~~~~~~~~~~~~~~e~~~~ 78 (803) T protein:vir:70 1 MEVQGSLGRQIQGISQQPPAVRLDGQCSEMVNMVPDVVEGTKSRMGTTHIAKLLEYGE--DDMAVHHYRRGGEGEEEYFF 78 (803) T ss_pred CeEEeecchhccccccCchHHhhhhhhhhhhcceeeeccccccCChhhhhhhhcCCCc--ccceeeEEEecCCCceEEEE Confidence 3699999999999999999999999999999999999999999999999999876543 333444444 34679999 Q ss_pred EEEeCCeEEEEeCCCeEEEEEecCccc----cccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceEEEE Q lcl|NC_019510. 79 VVFTGGDIKVFDLNGQEYAVRGDKSYV----QTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDALINV 154 (799) Q Consensus 79 l~~~~g~irv~~~~G~~~~v~~~~~y~----~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~~~v 154 (799) |+|++++||||+++|.++.+++..+|. ++++++++|+|+|+||+|||+|++++|++..+.. ..++.++++++ T Consensus 79 ~~~~~~~irv~~~~G~~~~v~~~~~~~~~l~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~----~~~~~~~~~~v 154 (803) T protein:vir:70 79 IMKKGQVPEIFDKQGRKCMVQSQDAPMTYLSEVTNPREDVQFMTIADVTFMLNRKKIVKARPERS----PQVGSTAIVFM 154 (803) T ss_pred EEecCCeEEEEEcCCcEEEEecCCceeEEEeecCCChhheeEEEEcCEEEEecCceeeeeccccC----CCCCCceEEEE Confidence 999999999999999999999888763 4678889999999999999999999999866543 34567899999 Q ss_pred ecCCCCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEEEecCCC Q lcl|NC_019510. 155 RGGQYGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAPDGD 234 (799) Q Consensus 155 ~~~~y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~~~~ 234 (799) +.++|+++|++++++...+.+++++++.. ..+...++++|+.++...+.+ ..+..+|+..+.++.++|.++++. T Consensus 155 r~g~y~~~y~itIng~~~a~~~t~~~~~~-----~~~~~~~~~~ia~~l~~~~~~-~~s~a~~~~~~~g~~~~i~~~~~~ 228 (803) T protein:vir:70 155 AYGQYGTHYKIIIDGVVAAGYKTRDGAEA-----HHIEDIRTESIAYNLYQSLQS-WDKIADYEIQLDGTSIYITRRDGS 228 (803) T ss_pred eecCCcceEEEEeCCcceEEEEeCCCccc-----ccccccchhhhhhhhhhheec-cccccceEEEECCcEEEEEEcCCC Confidence 99999999999999999999999987653 334556788999988777653 334568999999999999999999 Q ss_pred ceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecC---ceEEEEeccccccceeec Q lcl|NC_019510. 235 SIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLT---RKVWEETVGWNIQVGLNN 311 (799) Q Consensus 235 ~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~---~~~w~e~~~~~~~~~~~~ 311 (799) ..+.+++++|..++.+..+++.|+++++||+.|++|+.++|.++.+++.|.||++|+.. .++|+|++++++..+++. T Consensus 229 ~~~~~~t~~g~~~~~~~~~~~~v~~~~~Lp~~~~~g~~v~v~~~g~~~~d~y~v~~~~~~~~~~~w~e~a~~g~~~~~~~ 308 (803) T protein:vir:70 229 TTFDITTEDGAKGKDLVAIKYKVASTDLLPSRAPEGYKVQVWPTGSKPESRYWLQAEKQNGNIVSWKETLAADVLIGFDK 308 (803) T ss_pred CeeEEEeecCcCCcEEEEEEecccceeeccccCCCCceEEEEcCCCCCCceeeEEEEeccCCccceEeeeccceeeeeec Confidence 88999999999999999999999999999999999999999999999999999999754 458999999999999999 Q ss_pred cceeeEEEecc----CceEEEeecccCccccCccccccCccccC----CCeeEEEEEcceEEEecCCeEEEEccCCcccc Q lcl|NC_019510. 312 GTMPWSLIRAA----DGQFDFVANSWVGRTAGDDDTNPHPSFVG----QAITDVFFYRNRLGMLSGENIILSRTAKYFNM 383 (799) Q Consensus 312 ~t~p~~lv~~~----~~t~~~~~~~w~~~~~gd~~~np~psf~~----~~p~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF 383 (799) +||||.+++.+ .++|+++.++|+.|.+||+++||.|+|.+ +||++|+||||||+|+++++|||||+|||||| T Consensus 309 ~t~p~~~v~~~~~~~~~~~~~~~~~~~~r~~gdd~tnp~psf~~~~~~~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF 388 (803) T protein:vir:70 309 STMPYIIERTGFVNGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTLGGMFMVQNRLCVTAGEAVIATRTSYFFDF 388 (803) T ss_pred ccccEEEEEEEEeecceeEEEEeeccccccccccccCccccccCccCCCCceeEEEEeceEEEeeCCeEEEEccCCcccc Confidence 99999999875 46799999999999999999999999997 68999999999999999999999999999999 Q ss_pred ccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecccCCCCcEEeCCe Q lcl|NC_019510. 384 YPASVAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNASGVLSAKSVELNLTTEFDVNDGARPYGIGRG 463 (799) Q Consensus 384 ~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~ 463 (799) +++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|+++|||+|++++++|+|+|+++|+|+.+|++ T Consensus 389 ~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~i~T~~~q~~l~g~~~lTP~~~~i~~~s~~~~~~~~~Pv~vg~~ 468 (803) T protein:vir:70 389 FRYTAVSAVATDPFDVFSDASEVYQLKHAVTLDGSTVLFADKSQFILPGDKPLEKSNVLLKPVTTFEVNNNVKPVATGES 468 (803) T ss_pred ccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEEeeccCCCccEEeCCe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEecCCCceeEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceee Q lcl|NC_019510. 464 VYFASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQ 543 (799) Q Consensus 464 v~f~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v 543 (799) ++|++++|+|++++ +|.|++.+|+|+++|||+|++|||++++++++++++++.+++|+.+++++|++|+||++++||+| T Consensus 469 v~fv~~~g~~s~vr-e~~~~~~~d~y~a~Dlt~~a~hl~~~~v~~~~~~~~~~~~v~~~~~~~~~l~~~~yl~~~~e~~v 547 (803) T protein:vir:70 469 VMFATSEGAYSGIR-EFYTDSYSDTKKAQAITSHVNKLLEGNVIMMSASTNVNRLLVLTDKYRNIIYCYDWLWQGTERVQ 547 (803) T ss_pred EEEeccCCCeeEEE-EEeccccccceehhhhhhhhHhhcCCceEEEEEeCCCCeEEEEEEcCCCeEEEEEEEecCCcEEE Confidence 99999999886655 46799999999999999999999999999998888888889999999999999999999999999 Q ss_pred EeeEeeecCCCeEEEEEEEeCCEEEEEEEeCCCE-EEEEEEEEeeccCCCccccceeeceeEEEEEcccccccccccccc Q lcl|NC_019510. 544 QSWSHWDFGDNVTVLAANSIGSHMHVILQNGYDI-FMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTV 622 (799) Q Consensus 544 ~aW~~w~~~g~~~~~~~~~~~d~l~~~v~r~~~~-~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~ 622 (799) +|||||+|+|+++++|+.+.+|+||++|+|++++ +++||..... .+ ..+++++||||..++....-.. ....+.. T Consensus 548 ~aW~r~~~~g~~~~~~~~~~~d~l~~vv~r~~~g~~ier~~~~~~-~~-~~~~~~~~lD~~~~~~~~~~~~--~~~~~~~ 623 (803) T protein:vir:70 548 AAWHKWEWPLGTFIRGMFYSGEHLYLLIERGSTGVYLERMDMGDA-LV-YNLNDRIRMDRQAELIFRHIKA--EDVWVSE 623 (803) T ss_pred EeEEEEEcCCCEEEEEEEecCCEEEEEEEECCCeEEEEEEecccc-cc-cCCcceeEeccceeEeeccccC--Cceeeee Confidence 9999999999999999999999999999999865 4677754333 22 2456789999987765432100 0011100 Q ss_pred ---ccccccCccccccceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCccc Q lcl|NC_019510. 623 ---DLNAVFGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGG 699 (799) Q Consensus 623 ---~~~~~~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~ 699 (799) .......++.+.++......++|.+.......+....-.-..+++++++++|+|||+|+++++|+||++++++|+.+ T Consensus 624 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~g~~t~~~~~~~~~~~~~a~~v~VGl~Y~~~~~~~~~~i~~~~~~~~ 703 (803) T protein:vir:70 624 PLPWQPTDVTLLDCVLIDGWDSYIGGSFLFSYNPGDNTLTTTFDMHDDDHVKAKVVVGQLYPQEFEPTQVVIRDNQERVS 703 (803) T ss_pred cccccCcccceeeEEEeeeeeeecCCeEEEEEcCCCccceeeeeEECCCCcccEEEEeeeeeEEEeecceEEEcCCCccc Confidence 11112233344444444445555444333322211111224578899999999999999999999999999998776 Q ss_pred eeeeecceEEEEEEEEEeeccceEEEEecCCCccc--ceeccccccCc--ccccccccccceEEEEEeecccceEEEEEE Q lcl|NC_019510. 700 FSTEDVGRLQHRRAWLNYEQSGAFYVDVTNLGRSY--RYTMSGKPLGD--TTLGQANLESGQFRFPLAGNAQYNRVVLTS 775 (799) Q Consensus 700 ~~~~~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~~--~~~~~~~~~~~--~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~ 775 (799) .. +|+||+|++|++++|++|.+.|++..++. .+.+.++++|+ +.++.+|+++|+++||+++|+++.+|+|++ T Consensus 704 ~~----~~~rl~r~~~~~~~sg~~~v~v~~~~~~~~~~~~~s~~~~g~~~~~~g~~~~~tg~~~vP~~~~~~~~~v~i~~ 779 (803) T protein:vir:70 704 YI----DVPTVGLVHLNLDKYPDFKVEVKNLKSGKVRNVLASNRVGGAINNIVGYVEPREGVFKFPLRSLSTDTVYRVMV 779 (803) T ss_pred cc----cccEEEEEEEEeecccceEEEEecCCccccceeeccchhccccccccCccccccceEEEEeeccCcceEEEEEE Confidence 55 66789999999999999999999877753 35578888885 467889999999999999999999999999 Q ss_pred CCCCcEEEEEEEEEEEEeccccCC Q lcl|NC_019510. 776 DYTTPLSIIGCGWEGNYIRRSTGI 799 (799) Q Consensus 776 ~~P~P~tvl~i~~eg~y~~r~rrv 799 (799) ++|+||+|++|+|||+||+|+||| T Consensus 780 d~P~P~tvlsi~weg~y~~r~rrv 803 (803) T protein:vir:70 780 ESPHTFQLRDIEWEGSYNPTKRRV 803 (803) T ss_pred CCCCCeEEEEEEEEEEEecccccC Confidence 999999999999999999999999 No 10 >protein:vir:97014 Length: 800 # NCBI annotation: 33 # Family: family:all:825 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654134;genbank:gi:108862018;genbank:GeneID:5075963 Probab=100.00 E-value=2.3e-220 Score=1224.76 Aligned_cols=768 Identities=24% Similarity=0.383 Sum_probs=673.4 Q ss_pred CceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEE--c-CCceEEE Q lcl|NC_019510. 2 GLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLIN--R-DEYEQYY 78 (799) Q Consensus 2 ~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~--~-~~~~~y~ 78 (799) =.|+|+||||+||||||+|++||++||++|+||+|+|+|||+||||++||+++++++. ..++++|. + +++|+|+ T Consensus 1 ~~v~~s~~n~~~GvSqq~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~fva~l~~~~~---~~~~~~~~~~~d~~eq~~v 77 (800) T protein:vir:97 1 MEVQGSLGRQIQGISQQPPAVRLDGQCTAMVNMIPDVVNGTQSRMGTTHIAKILDAGT---DDMATHHYRRGDGDEEYFF 77 (800) T ss_pred CeeEeechhhhcccccCchhHhhhhhhhhhhcceeccccccccCCchhhheeecCCCc---ccceeEEEEEcCCceEEEE Confidence 2589999999999999999999999999999999999999999999999999877643 33444444 3 4678889 Q ss_pred EEEeCCeEEEEeCCCeEEEEEecCccc----cccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceEEEE Q lcl|NC_019510. 79 VVFTGGDIKVFDLNGQEYAVRGDKSYV----QTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDALINV 154 (799) Q Consensus 79 l~~~~g~irv~~~~G~~~~v~~~~~y~----~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~~~v 154 (799) |+|+++++|||+++|.++.|++..+|. .+++++++|+|+|+||++||+|++++|++..+.. ..+..++++++ T Consensus 78 ~~~~~~~~rv~~~~G~~~~v~~~~~~~~y~~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~----~~~~~~~~~~v 153 (800) T protein:vir:97 78 TLKKGQVPEIFDKYGRKCNVTSQDAPMTYLSEVVNPREDVQFMTIADVTFMLNRRKVVKASSRKS----PKVGNKAIVFC 153 (800) T ss_pred EEEcCCEEEEEecCCcEEEEecCCcceEEEeccCCCccceeEEEEcCEEEEeeCceecccccccc----cCCCcceEEEE Confidence 999999999999999999999887763 4578899999999999999999999999876543 34677899999 Q ss_pred ecCCCCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEEEecCCC Q lcl|NC_019510. 155 RGGQYGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAPDGD 234 (799) Q Consensus 155 ~~~~y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~~~~ 234 (799) +.++|+++|++++++..++++++|+++.. ......++++|+.+|...+.. ..+..+|++...+.+++|.++++. T Consensus 154 ~~g~y~~~y~i~I~~~~~~~~~t~~~t~~-----~~~~~~~~~~ia~ql~~~~~~-~~~~~~~t~~~~G~~~~i~~~~~~ 227 (800) T protein:vir:97 154 AYGQYGTSYSIVINGANAASFKTPDGGSA-----DHVEQIRTERITSELYSKLQQ-WSGVSDYEIQRDGTSIFIERRDGA 227 (800) T ss_pred eecccceeeeeccCCcceEEEEEcCCCCc-----ccceeccHHHHHHHHHHhhhc-cccccceEEEeCCcEEEEEEcCCc Confidence 99999999999999999999999987643 344556778888888777643 344578999999999999999877 Q ss_pred ceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEec---CceEEEEeccccccceeec Q lcl|NC_019510. 235 SIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNL---TRKVWEETVGWNIQVGLNN 311 (799) Q Consensus 235 ~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~---~~~~w~e~~~~~~~~~~~~ 311 (799) .. .+++.+|++++.+..+++.++++++||..|++|+.++|+++.+++.+.||++|+. +.++|+||++++...++++ T Consensus 228 ~~-~v~t~~g~~~~~~~~~~~~v~~~~~lp~~~~~g~~v~i~~~~~~~~~~y~~~~~~~~~~~~~w~e~~~~~~~~~~~~ 306 (800) T protein:vir:97 228 SF-TITTTDGAKGKDLVAIKNKVSSTDLLPSRAPAGYKVQVWPTGSKPESRYWLQAEPKEGNLVSWKETIAADVLLGFDK 306 (800) T ss_pred eE-EEEecCCcCceeeeEEeeeccchhhchhhCCCCcEEEEEccCCCCCceEEEEEEecccCcceEEEeeccccccceec Confidence 65 6899999999999999999999999999999999999999999999999999985 4578999999999999999 Q ss_pred cceeeEEEec----cCceEEEeecccCccccCccccccCccccC----CCeeEEEEEcceEEEecCCeEEEEccCCcccc Q lcl|NC_019510. 312 GTMPWSLIRA----ADGQFDFVANSWVGRTAGDDDTNPHPSFVG----QAITDVFFYRNRLGMLSGENIILSRTAKYFNM 383 (799) Q Consensus 312 ~t~p~~lv~~----~~~t~~~~~~~w~~~~~gd~~~np~psf~~----~~p~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF 383 (799) ++|||.+++. ..++|+++..+|++|.+||+++||.|+|++ ++|++|+||||||+|+++++|||||+|||||| T Consensus 307 ~tmp~~~~~~~~~~~~g~~~~~~~~w~~r~~gd~~tnp~p~f~~~~~~~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF 386 (800) T protein:vir:97 307 GTMPYIIERTDIINGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTIGGMFMVQNRLCFTAGEAVIASRTSYFFDF 386 (800) T ss_pred ccceEEEEEeecccccceeEEEeccccccccCccccCccccccCCcCCCCceeEEEEeeeEEEecCCeEEEEecCCcccc Confidence 9999999987 578999999999999999999999999998 79999999999999999999999999999999 Q ss_pred ccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecccCCCCcEEeCCe Q lcl|NC_019510. 384 YPASVAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNASGVLSAKSVELNLTTEFDVNDGARPYGIGRG 463 (799) Q Consensus 384 ~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~ 463 (799) +++|++++.|||||+++++++|+|+|+|+++++++|+|||+++||+|+|+++|||+|++++++|+|+|+++|+|+.+|++ T Consensus 387 ~~~t~~~~~DdD~I~~~~ss~~v~~i~~~v~~~~~L~i~T~~~q~~ls~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~ 466 (800) T protein:vir:97 387 FRYTVISALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQFILPGDKPLEKSNALLKPVTTFEVNNKVKPVVTGES 466 (800) T ss_pred ccccccCCCCCccEEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEeeeccCCCCcEEeCCe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEecCCCceeEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceee Q lcl|NC_019510. 464 VYFASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQ 543 (799) Q Consensus 464 v~f~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v 543 (799) ++|++++|+|++++ +|.|++.+|+|+++|||+|++|||+|+++++++++++|..++|+++++|+|++||||++++||+| T Consensus 467 v~fv~~~g~~s~vr-e~~~~~~~d~~~a~DlT~~~~hl~~~~v~~~~~~~~~~~~v~~~~~~~~~l~~~~y~~~~~e~~~ 545 (800) T protein:vir:97 467 VMFATNDGSYSGVR-EFYTDSYSDTKKAQAITSHVNKLIEGNITNMAASTNVNRLLVTTDKYRNIIYCYDWLWQGTDRVQ 545 (800) T ss_pred EEEeeCCCCeeEEE-EEeeeecccceehhhHHHHHHHhcCCceEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceEE Confidence 99999999887654 56899999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeEeeecCCCeEEEEEEEeCCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEE----------EEccccc Q lcl|NC_019510. 544 QSWSHWDFGDNVTVLAANSIGSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRY----------DIPANAF 613 (799) Q Consensus 544 ~aW~~w~~~g~~~~~~~~~~~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~----------~~~~~~~ 613 (799) +|||||++++...++|+.+.+|+||++|+|+++.++|||.+..+ .+. .++++.+||+..+. .++++.+ T Consensus 546 ~aW~~~~~~~~~~~~~~~~~~d~l~~vv~r~~~~~ler~~~~~~-~~~-~~~~~~~lD~~~~~~~~~~~~~~~~v~~~~~ 623 (800) T protein:vir:97 546 SAWHVWKWPIGTKVRGMFYSGELLYLLLERGDGVYLEKMDMGDA-LTY-GLNDRIRMDRQAELVFKHFKAEDEWVSEPLP 623 (800) T ss_pred EeEEEEecCCCeEEEEEEEcCCeEEEEEEcCCcEEEEEEecccC-cCc-ccccceeccccceeeeeeeecccceEecccc Confidence 99999999998888888888999999999999999999865433 232 24457788854332 3444444 Q ss_pred cccccccccccccccCccccccceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEc Q lcl|NC_019510. 614 NNDRYETTVDLNAVFGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKK 693 (799) Q Consensus 614 ~~~~~~~~~~~~~~~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~ 693 (799) +...+.... .....+|+.|++|.++... ++.... .++.+...+++++++++|||||+|+++++|+||++++ T Consensus 624 ~~~~~~~~~-~~~~v~g~~~~~G~~v~~~-~~~~~~-------~~~~~~~~~~~~~~~~~v~vGl~Y~~~~~~~p~~i~~ 694 (800) T protein:vir:97 624 WVPTNPELL-DCILIEGWDSYIGGSFLFK-YNPSDN-------TLSTTFDMYDDSHVKAKVIVGQIYPQEFEPTPVVIRD 694 (800) T ss_pred ccCCCccee-EEEEecccccccCceEEEE-ecCccC-------cccccceEEeCCCCCcEEEEeeeeeEEEEecceEEEe Confidence 443333222 2234578999999988654 343322 2333345677889999999999999999999999999 Q ss_pred CCCccceeeeecceEEEEEEEEEeeccceEEEEecCCCccc--ceeccccccCc--ccccccccccceEEEEEeecccce Q lcl|NC_019510. 694 QDESGGFSTEDVGRLQHRRAWLNYEQSGAFYVDVTNLGRSY--RYTMSGKPLGD--TTLGQANLESGQFRFPLAGNAQYN 769 (799) Q Consensus 694 ~~g~~~~~~~~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~~--~~~~~~~~~~~--~~~~~~~~~tg~~~vp~~~~~~~~ 769 (799) ++|+.+.. +|+||+|++|++.+|++|++.|++.+++. ...+.++++++ ...+.+++++|+++||+.+|+++. T Consensus 695 ~~g~~~~~----~r~~i~r~~~~~~~sg~~~~~v~~~~~~~~~~~~~~~~~~g~~~~~~g~~~~~tg~~~vp~~g~~~~~ 770 (800) T protein:vir:97 695 NQDRVSYI----DVPVVGLVHLNLDMYPDFSVEVKNVKSGKVRRVLASNRIGGALNNTVGYVEPREGVFRFPLRAKSTDV 770 (800) T ss_pred cCCCceee----cceEEEEEEEeecccccEEEEEccccCCceeeeecCccccccccccCCccccccceEEEEeeccccee Confidence 98876653 78999999999999999999999887753 33467788875 346778999999999999999999 Q ss_pred EEEEEECCCCcEEEEEEEEEEEEeccccCC Q lcl|NC_019510. 770 RVVLTSDYTTPLSIIGCGWEGNYIRRSTGI 799 (799) Q Consensus 770 ~v~i~~~~P~P~tvl~i~~eg~y~~r~rrv 799 (799) +|+|+|++|+||+|+||+|||+||+|+||| T Consensus 771 ~v~i~~d~PlP~tvlsi~~eg~y~~r~~rv 800 (800) T protein:vir:97 771 VYRIIVESPHTFQLRDIEWEGSYNPTKRRV 800 (800) T ss_pred EEEEEECCCCcEEEEEEEEEEEeecccccC Confidence 999999999999999999999999999999 No 11 >protein:vir:105647 Length: 800 # NCBI annotation: putative tail tubular B protein # Family: family:all:825 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425011;genbank:gi:83571759;uniprot:Q2WC41;genbank:GeneID:3837288 Probab=100.00 E-value=7.3e-220 Score=1221.98 Aligned_cols=780 Identities=24% Similarity=0.366 Sum_probs=680.2 Q ss_pred CceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCceEEEEEE Q lcl|NC_019510. 2 GLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYEQYYVVF 81 (799) Q Consensus 2 ~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~~y~l~~ 81 (799) =.|+|+||||+||||||+|++||++||++|+||+|+|+|||+||||++||+++++++......+.+....++++.|+++| T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~fva~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (800) T protein:vir:10 1 MEVQGSLGRQIQGISQQPPAVRLDGQCTTMVNMVPDVVNGTQSRMGTTHIAKLLDEGTDNMATHHYRRGEGDEEYFFTLK 80 (800) T ss_pred CeEEeecchhcccccccchhHhhhhhhhhhhcceeeeccCcccCCcceEEEeecCCCCCccEEEEEecCCccceEEEEEE Confidence 25899999999999999999999999999999999999999999999999999877655444433333335667788899 Q ss_pred eCCeEEEEeCCCeEEEEEecCccc----cccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceEEEEecC Q lcl|NC_019510. 82 TGGDIKVFDLNGQEYAVRGDKSYV----QTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDALINVRGG 157 (799) Q Consensus 82 ~~g~irv~~~~G~~~~v~~~~~y~----~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~~~v~~~ 157 (799) .++++|||+++|.++.++...++. .+.+++++|+|+|+||++||+|++++|+++.|..+. ...+++++++.+ T Consensus 81 ~g~~~rv~~~~G~~~~v~~~~~~~~~~~~~~~~~~~l~~~tvaD~tfi~n~~~~~~~~~~~~~~----~~~~~~~~vr~g 156 (800) T protein:vir:10 81 KGQVPEIFDKHGRKCNVISQDAPMTYLSEVVNPREDVQFMTIADVTFMLNRRKVVKVSNRKSPK----VGDKAIVFCAYG 156 (800) T ss_pred cCCeEEEEecCCcEEEeecCCcceeeeeccCCchhhEEEEEEcCEEEEecCcccccccccCCCC----CCceEEEEEecc Confidence 999999999999999998776542 467788899999999999999999999999886654 346799999999 Q ss_pred CCCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEEEecCCCcee Q lcl|NC_019510. 158 QYGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAPDGDSIR 237 (799) Q Consensus 158 ~y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~~~~~~~ 237 (799) +|+++|++++++...+.+++|+++.. ....+.++++++.+|...+... .+..+|++...++.++|.++++.+ + T Consensus 157 ~y~~~y~i~i~g~~~~~~~t~~~~~~-----~~~~~~s~~~i~~~L~~~l~~~-~~~~~~t~~~~g~~i~i~~~~~~~-~ 229 (800) T protein:vir:10 157 QYGTSYSIIINGTTAASFKTPDGGSA-----EHVEQIRTERITSELYSKLQQW-SGVNDYEIQRDGTSIFIERRDGKS-F 229 (800) T ss_pred ccccceeEEeccceEEEEEecCCCcc-----cccccccHHHHHHHHHhhhhhc-CcccceEEEEcCcEEEEEEecCCc-e Confidence 99999999999999999999987653 3455677888998888776543 355789999999999999988765 4 Q ss_pred EEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEec---CceEEEEeccccccceeeccce Q lcl|NC_019510. 238 GLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNL---TRKVWEETVGWNIQVGLNNGTM 314 (799) Q Consensus 238 ~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~---~~~~w~e~~~~~~~~~~~~~t~ 314 (799) .+++.+|+.++.+..+++.++++++||..|++|+.++|.++.+++.+.||++|+. +.+.|+||+++++..++++++| T Consensus 230 ~~~~~~~~~~~~~~~~~~~v~~~~~Lp~~~~~g~~~~i~~~~~~~~~~y~~~~~~~~~~~~~w~e~~~~~~~~~~~~~tm 309 (800) T protein:vir:10 230 TVTTTDGAKGKDLVAIKNKVSSTDLLPSRAPAGYKVQVWPTGSKPESRYWLQAEPKEGNLVSWKETIAADVLLGFDKGTM 309 (800) T ss_pred EEEEeecCCcceEEEEEeeccceeeccccCCCCceEEEEcCCCCCCceeEEEEEeccccceEEEeecccCceeeeecccc Confidence 7899999999999999999999999999999999999999999999999999985 4578999999999999999999 Q ss_pred eeEEEecc----CceEEEeecccCccccCccccccCccccC----CCeeEEEEEcceEEEecCCeEEEEccCCccccccc Q lcl|NC_019510. 315 PWSLIRAA----DGQFDFVANSWVGRTAGDDDTNPHPSFVG----QAITDVFFYRNRLGMLSGENIILSRTAKYFNMYPA 386 (799) Q Consensus 315 p~~lv~~~----~~t~~~~~~~w~~~~~gd~~~np~psf~~----~~p~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~ 386 (799) |+.+++.+ .++|+++..+|++|.+||+++||.|+|+| ++|++|+||||||+|++|++|||||+||||||+++ T Consensus 310 p~~lv~~~~~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~~~~~~~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~ 389 (800) T protein:vir:10 310 PYIIERTGIIDGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTIGGMFMVQNRLCFTAGEAVIASRTSYFFDFFRY 389 (800) T ss_pred cEEEEEeeeeecceeEEEEeccccccccCCCCCCCCchhcCCCCCCCceeEEEEeeeEEEeeCCeEEEEccCCccccccc Confidence 99999887 78999999999999999999999999998 57999999999999999999999999999999999 Q ss_pred cccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecccCCCCcEEeCCeEEE Q lcl|NC_019510. 387 SVAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNASGVLSAKSVELNLTTEFDVNDGARPYGIGRGVYF 466 (799) Q Consensus 387 t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f 466 (799) |++++.|||||+++++++|+|+|+|+++++++|+|||+++||+|+|+++|||+|++++++|+|+|+++|+|+.+|++++| T Consensus 390 t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~g~q~~l~g~~~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~F 469 (800) T protein:vir:10 390 TVISALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQFILPGDKPLEKSNALLKPVTTFEVNNKVKPVVTGESVMF 469 (800) T ss_pred cccCCCCCccEEEEEcCCcceeeeeEeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEeeeccCCCCceEeCCeEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EecCCCceeEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEee Q lcl|NC_019510. 467 ASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQSW 546 (799) Q Consensus 467 ~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW 546 (799) ++++|+|++++| |.|++.+|+|+++|||+|++|||+|++++++++++++..++|+++++|+|++|||||+++||+|+|| T Consensus 470 v~~~g~~s~vre-~~~~~~~d~~~a~DlT~~~~hl~~~~v~~~~~~~~~~~~v~~~~~~~~~l~~~~yl~~~~e~~~~aW 548 (800) T protein:vir:10 470 ATNDGSYSGVRE-FYTDSYSDTKKAQAITSHVNKLIEGNITNMAASTNVNRLLVTTDKYRNIIYCYDWLWQGTDRVQSAW 548 (800) T ss_pred ecCCCCeeEEEE-EeeeecccceehhhHHhHHHHhcCCceEEEEEeCCCCeEEEEEEcCCCeEEEEEEeecCCceEEEEE Confidence 999998877655 5789999999999999999999999999999888899999999999999999999999999999999 Q ss_pred EeeecCCCeEEEEEEEeCCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEcccccccccccccc-ccc Q lcl|NC_019510. 547 SHWDFGDNVTVLAANSIGSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTV-DLN 625 (799) Q Consensus 547 ~~w~~~g~~~~~~~~~~~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~-~~~ 625 (799) |||++++...++|+.+.+|+||++|+|+++.++|||....+ .+. .++++++|||..++................ ... T Consensus 549 ~~w~~~~~~~~~~~~~~~d~l~~iv~r~~~~~ier~~~~~~-~~~-~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~ 626 (800) T protein:vir:10 549 HVWEWPMGTKVRGMFYSGELLYLLLERGDGVYLEKMDMGDA-LTY-GLNDRIRMDRQAELIFKHFKAEDEWISEPLPWTP 626 (800) T ss_pred EEEEcCCCcEEEEEEEeCCeEEEEEECCCcEEEEEEecccC-ccc-cccceeeeecceeecccccccCcceEEEeccccc Confidence 99999876666777777999999999999999999865433 232 456789999987765443222111100000 001 Q ss_pred cccCccccccceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCccceeeeec Q lcl|NC_019510. 626 AVFGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTEDV 705 (799) Q Consensus 626 ~~~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~ 705 (799) ....++.++....+...++|.++....+.+|.++++....+++.++++|+|||+|+++++|+||++++++|+.++. T Consensus 627 ~~~~~~~~~~~~~~~~~~~g~v~~~~~~~~g~~~~~~~~~~g~~~~~~v~VGl~Y~s~~~~~~~~i~~~~g~~~~~---- 702 (800) T protein:vir:10 627 TNPELLDCILIEGWDSYIGGSFLFKYKPSDNTLSTTFDMHDDNHVKAKVIVGQIYPQEFEPTPVVIRDRQDRVSYI---- 702 (800) T ss_pred cCCcceEEeeeccceeecCceeEEEEEecCCceEeeeeecCCCcccceEEEeeeeeEEEeecceEEEcCCCccccc---- Confidence 1123455555555666777888888888899888876666788999999999999999999999999999877655 Q ss_pred ceEEEEEEEEEeeccceEEEEecCCCcc--cceeccccccCc--ccccccccccceEEEEEeecccceEEEEEECCCCcE Q lcl|NC_019510. 706 GRLQHRRAWLNYEQSGAFYVDVTNLGRS--YRYTMSGKPLGD--TTLGQANLESGQFRFPLAGNAQYNRVVLTSDYTTPL 781 (799) Q Consensus 706 ~rl~l~~~~~~~~~t~~~~~~v~~~~~~--~~~~~~~~~~~~--~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~ 781 (799) +|+||+|++|++.+||+|.+++++.+++ ....+.+++.|. +.++.+|+++|+++||+.+|+++.+|+|++++|+|| T Consensus 703 ~r~~i~r~~~~~~~sg~~~~~v~~~~~~~~~~~~~~~~~~g~~~~~~g~~~~~tg~~~vp~~g~~~~~~v~i~~d~P~P~ 782 (800) T protein:vir:10 703 DVPVVGLVHLNLDMYPDFSVEVKNVKSGKVRRVLASNRIGGALNNTVGYVEPREGVFRFPLRAKSTDAVYRIIVESPHTF 782 (800) T ss_pred CCeEEEEEEEEeecCceEEEEeccCcccceeEEccCCeeccccccccCcccccCceEEEEEeccCceeEEEEEECCCCcE Confidence 8899999999999999999999887764 334456677764 467889999999999999999999999999999999 Q ss_pred EEEEEEEEEEEeccccCC Q lcl|NC_019510. 782 SIIGCGWEGNYIRRSTGI 799 (799) Q Consensus 782 tvl~i~~eg~y~~r~rrv 799 (799) +|+||+|||+||+|+||| T Consensus 783 tvlai~~eg~y~~r~~rv 800 (800) T protein:vir:10 783 QLRDIEWEGSYNPTKRRV 800 (800) T ss_pred EEEEEEEEEEeecccccC Confidence 999999999999999999 No 12 >protein:vir:103341 Length: 806 # NCBI annotation: tail tubular protein B-like protein # Family: family:all:825 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039670;genbank:gi:125999999;genbank:GeneID:4818417 Probab=100.00 E-value=3.9e-217 Score=1207.04 Aligned_cols=773 Identities=24% Similarity=0.380 Sum_probs=673.6 Q ss_pred CceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEc-CCceEEEEE Q lcl|NC_019510. 2 GLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINR-DEYEQYYVV 80 (799) Q Consensus 2 ~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~-~~~~~y~l~ 80 (799) =+|+|+||||+||||||+|++||++||++|+||+|+|+|||+||||++||++++.. ..+..+++.|++ ++.++|+|+ T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~~Gl~rRPgt~~va~l~~~--~~~~~~~~~~~~~~~~~~y~v~ 78 (806) T protein:vir:10 1 MEVQGSYGRQLQGVSQQPIAVRLPGQVTSQLNAVPNVVDGLKTRMGSKHLARILNS--LDANSLIHHYKRGDDAEEYFVI 78 (806) T ss_pred CeeEeecchhccceeccChhHhhhhhhhhhhcceeccccccccCCchhhhhhhcCC--CCccceEEEEEecCCceEEEEE Confidence 48999999999999999999999999999999999999999999999999998654 345667888888 567899999 Q ss_pred EeCCeEEEEe-CCCeEEEEEecCcc---c-cccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceEEEEe Q lcl|NC_019510. 81 FTGGDIKVFD-LNGQEYAVRGDKSY---V-QTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDALINVR 155 (799) Q Consensus 81 ~~~g~irv~~-~~G~~~~v~~~~~y---~-~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~~~v~ 155 (799) |.+|.||||+ .+|.++.+..++.. + .+.+++.+|+|+|+||++||+|++++|+++.+..+ .+..+++++++ T Consensus 79 ~~~g~i~v~~~~~G~~~~v~~~~~~~~yl~~~~~~~~~l~~~tvaD~tfi~n~~~~~~~~~~~~~----~~~~~~~v~v~ 154 (806) T protein:vir:10 79 LQPGQVPVIFTVGGLACPVNTQGSAATYLSSSSLPRETTQLMTIGDYTFVLNRKMPVQARGDVTP----SLDNKGLVYVA 154 (806) T ss_pred EcCCcEEEEEcCCCcEEEecCCCceEEEeccCCCCcceeeEEEEcCEEEEecCcEeeeecccccC----CCCcceEEEEe Confidence 9999999998 58988887765442 2 23557889999999999999999999999977554 24557999999 Q ss_pred cCCCCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEEEecCCCc Q lcl|NC_019510. 156 GGQYGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAPDGDS 235 (799) Q Consensus 156 ~~~y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~~~~~ 235 (799) +++|+++|++++++...+++++++++.. .......+++++.++..++....+..++|+....+.+++|.++++.. T Consensus 155 ~g~y~~~y~i~Ing~~~a~~~t~~~~~~-----~~~~~~~~~~~a~~l~~~l~~~~~~~~~~~~~~~g~~~~i~~~~~~~ 229 (806) T protein:vir:10 155 YANFSFTYQILINGQVAAEHKTASSEDV-----KNEDLVRTDYVAGKLLENFNSRTASFPGFSMYQDGNVLVVDNSNGAN 229 (806) T ss_pred ecccCceeeEEeccceEEEEEeccCCCc-----ccccccchhHHHHHHHhhhcccccccceeEEEEcccEEEEecCCCCc Confidence 9999999999999999999999987643 23345567888999999888877888899999999999999887754 Q ss_pred eeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEec---CceEEEEeccccccceeecc Q lcl|NC_019510. 236 IRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNL---TRKVWEETVGWNIQVGLNNG 312 (799) Q Consensus 236 ~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~---~~~~w~e~~~~~~~~~~~~~ 312 (799) ..+++.+|..++.+.++++.++++++||.+|++|+.++|+++.++..+.||++|+. +.++|+|++++++..+++.+ T Consensus 230 -~~~~~~~g~~~~~~~~~~~~v~~~~~lp~~~~~g~~v~i~~~~~~~~~~y~v~~~~~~~~~~~w~e~~~~~~~~~~~~~ 308 (806) T protein:vir:10 230 -YALTTVDGADGQDLVAIRHKVTNLDTLPNRAPVGYKVQVWPTGSKPESRYWLQAESQDGSKVTWVETIAPGVRKGWNAA 308 (806) T ss_pred -cEEEEeeCCCCceeEEeecccCccccCccccCCCcEEEEeccCCCCCCceEEEEEeeccCceEEEeecccccccceecc Confidence 46888999999999999999999999999999999999999999999999999954 56789999999999999999 Q ss_pred ceeeEEEecc-----CceEEEeecccCccccCccccccCccccC----CCeeEEEEEcceEEEecCCeEEEEccCCcccc Q lcl|NC_019510. 313 TMPWSLIRAA-----DGQFDFVANSWVGRTAGDDDTNPHPSFVG----QAITDVFFYRNRLGMLSGENIILSRTAKYFNM 383 (799) Q Consensus 313 t~p~~lv~~~-----~~t~~~~~~~w~~~~~gd~~~np~psf~~----~~p~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF 383 (799) +||+.+++.+ .++|.++.++|++|.+||+++||.|+|++ ++|++|+||||||+|++|++|||||+|||||| T Consensus 309 t~p~~~v~~~~~~~~~~~~~~~~~~w~~r~~Gd~~tn~~psF~~~~~~~~it~v~f~q~RL~f~s~~~v~~Srsgd~~nF 388 (806) T protein:vir:10 309 TMPHVLVRESLNANGSANFTYRPGEWEDRDVGDDLTNDFPSLLNDSSPQPISSMLMVQNRLMLTSGEAVVASRTSRFFDF 388 (806) T ss_pred ccceEEEeeeeeecccceeEEEecccccccccccccCccCcccCCCCCccceEEEEEeeeEEEecCCeEEEEccCCcccC Confidence 9999999876 78999999999999999999999999998 67899999999999999999999999999999 Q ss_pred ccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecccCCCCcEEeCCe Q lcl|NC_019510. 384 YPASVAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNASGVLSAKSVELNLTTEFDVNDGARPYGIGRG 463 (799) Q Consensus 384 ~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~ 463 (799) +++|+++++|||||+++++++++|+|+|+++++++|+|||+++||+|+|+++|||+|++++++|+|+|+++|+|+.+|++ T Consensus 389 ~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~t~~~q~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~ 468 (806) T protein:vir:10 389 FRYTVLATVDTDPFDVFADIEEVYNIRWSAQMDGDVVLFTSDQQFTLPGDKPLTPTSAVIRPVTQFKMTPGVKPAPSGDS 468 (806) T ss_pred ccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCCcccceeEEEEEEEeecccCCCCceEeCCe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEecCCCceeEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceee Q lcl|NC_019510. 464 VYFASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQ 543 (799) Q Consensus 464 v~f~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v 543 (799) ++|++++|+|++++| |.|++.+|+|.++|||+|++|||+|+++++++++++|.+++|++++||+|++|||||+++||+| T Consensus 469 v~Fv~~~g~~s~vre-~~y~~~~d~~~~~DlT~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~e~~v 547 (806) T protein:vir:10 469 ILFAFDQGSYSGIRE-FFTDSYSDTKKAQPATSHVDKYIRGKVLELSASSSFNRAFIITSPDRNILYVYDWLYEGTEKVQ 547 (806) T ss_pred EEEeeCCCCeeEEEE-EEeeeeccceehhhHHHHHHHhcCCCeEEEEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceEE Confidence 999999998877655 5799999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeEeeecCCCeEEEEEEEeCCEEEEEEEeCC--CEEEEEEEEEeeccCC--CccccceeeceeEEEEEccccccccccc Q lcl|NC_019510. 544 QSWSHWDFGDNVTVLAANSIGSHMHVILQNGY--DIFMGSISFTKKTLDF--GNEPYRLYMDAKTRYDIPANAFNNDRYE 619 (799) Q Consensus 544 ~aW~~w~~~g~~~~~~~~~~~d~l~~~v~r~~--~~~~~~~~~~~~~~~~--~~~~~~~~lD~~~~~~~~~~~~~~~~~~ 619 (799) +|||||+++|...+.|+.+.+|+||++|+|++ ++...++.|.++..+. ..++++++|||+.++.....+.... T Consensus 548 ~aW~rw~~~~~~~~~~~~~~~d~l~~vv~R~~~~~g~~~~~iE~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~--- 624 (806) T protein:vir:10 548 NAWHKWSFPAGTVLHAVSYSNEKLYLVLTRTNTSGGVAGVYIEVMDMGDELEYGLQDRVRMDRRATLSMTYNATTRV--- 624 (806) T ss_pred EeEEeeeeCCCeEEEEEEEecCeEEEEEEEcCCcccEEEEEEEeecCCCCCCcccceeeeccccceEEEeccccccc--- Confidence 99999999998888888888999999999986 4555555544432211 1345778999998887654332111 Q ss_pred cccccccccCccccccceEEEEecCCcccccc----ccccceecCceEEEc---cCCCCCeEEEeEeeeEEEEecCeeEE Q lcl|NC_019510. 620 TTVDLNAVFGGMRWQVGKILVSDEVGEVRQYE----PPAGGWASDPTLRIV---GDMAGKRVFIGFAYEFRYEFSKFLIK 692 (799) Q Consensus 620 ~~~~~~~~~~g~~~~~g~~v~~~adG~~~~~~----~v~~g~~~~~~~~i~---~~~~~~~v~vGl~y~~~~~~~~~~~~ 692 (799) .......++.|++|+.+.+.++|..+... ++..|. .++++.. .+.++++|+|||+|+++++|+||+++ T Consensus 625 ---~~~~~l~~~~~~~~~~~~~~~~g~~~~~g~~~~~~~~~~--~~~v~~~~~~~~~~~~~v~vGl~Y~s~~~~t~p~~~ 699 (806) T protein:vir:10 625 ---WTSSALPWLPQDLSSLDAVLVSGWAGYVGGAFQFSYNAS--NNTISTNFDLAEGNTATIVVGETYWYEVEPTPPLIK 699 (806) T ss_pred ---eeeeeeccccccccceeEEEEeeccccCCceEEEEEcCc--cceEeeeeeecCCCCcEEEEeeeeeEEEEECCeeEe Confidence 12234568899999999999998765321 111111 1122222 25578899999999999999999999 Q ss_pred cCCCccceeeeecceEEEEEEEEEeeccceEEEEecCCCc--ccceeccccccCc--ccccccccccceEEEEEeecccc Q lcl|NC_019510. 693 KQDESGGFSTEDVGRLQHRRAWLNYEQSGAFYVDVTNLGR--SYRYTMSGKPLGD--TTLGQANLESGQFRFPLAGNAQY 768 (799) Q Consensus 693 ~~~g~~~~~~~~~~rl~l~~~~~~~~~t~~~~~~v~~~~~--~~~~~~~~~~~~~--~~~~~~~~~tg~~~vp~~~~~~~ 768 (799) +++++.... +|+||+|+++++.+|++|.+.|.+.++ ++.+.+.++++++ +.++.+|+.+++++||+.+|+++ T Consensus 700 ~~~~~~~~~----~r~~l~r~~~~~~~s~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~tg~~~vp~~~~~~~ 775 (806) T protein:vir:10 700 DSKDRVSYL----DTPTVGNVYLNLDMYPDFSVVVTDKETLQERTVYLANKTAGSITNVIGYIAPHEGTLRIPLRRKSTD 775 (806) T ss_pred ccCCCcccc----ccEEEEEEEEEeecceeeEEEEcccCCCcceeeeccCcccccccccccccccccceEEEEeeecCce Confidence 987665543 789999999999999999999987765 3456678888874 45678999999999999999999 Q ss_pred eEEEEEECCCCcEEEEEEEEEEEEeccccCC Q lcl|NC_019510. 769 NRVVLTSDYTTPLSIIGCGWEGNYIRRSTGI 799 (799) Q Consensus 769 ~~v~i~~~~P~P~tvl~i~~eg~y~~r~rrv 799 (799) .+|+|+|++|+||+|+||+|||+||+|+||| T Consensus 776 ~~v~i~~d~P~P~tvlai~~eg~y~~r~~rv 806 (806) T protein:vir:10 776 VSFKIRSKSPATFQLRDIEWTGSYNPRKRRV 806 (806) T ss_pred eEEEEEECCCCceEEEEEEEEEEeecccccC Confidence 9999999999999999999999999999999 No 13 >protein:vir:80253 Length: 777 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522886;genbank:gi:158345179;genbank:GeneID:5687516 Probab=100.00 E-value=8.2e-217 Score=1205.25 Aligned_cols=766 Identities=20% Similarity=0.273 Sum_probs=652.2 Q ss_pred CCceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCceEEEEE Q lcl|NC_019510. 1 MGLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYEQYYVV 80 (799) Q Consensus 1 M~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~~y~l~ 80 (799) ||+|+|+||||++|||||+|++||++||++|+||+|+|+||++||||++||++++.++......+++++.++++|+|+|+ T Consensus 1 M~~i~~~~~nf~~GvS~q~D~~ry~~q~~~~~N~~~~~~gG~~rRpGt~fv~~l~~~~~~~~~~~~~~~~~~~e~~~~l~ 80 (777) T protein:vir:80 1 MSYFAGSYRQLLFGVSQQTAKDRLEGQVESQLNMQSDLVTGPRRRSPVHLIADAMAATDANRLAYSLATFSGREVLLLVD 80 (777) T ss_pred CceeeeecchhhcccccCCchHHhhhHHhhhhcceeeeccCceeCcchHhhhhhcCCCcccceeEEEEecCCCeeEEEEE Confidence 99999999999999999999999999999999999999999999999999999998877788888888889999999999 Q ss_pred EeCCeEEEEeCCCeEEEEEecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceEEEEecCCCC Q lcl|NC_019510. 81 FTGGDIKVFDLNGQEYAVRGDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDALINVRGGQYG 160 (799) Q Consensus 81 ~~~g~irv~~~~G~~~~v~~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~~~v~~~~y~ 160 (799) |++++||||++.|..+.+.+..+|+. +++.++|+|+|+||++||+|++++|+++.+..++..|.++.+++++++.++|+ T Consensus 81 ~g~g~irv~~~~~g~~~~~~~~~Yl~-a~~~~~l~~~q~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~~~v~~~~~g 159 (777) T protein:vir:80 81 TLDGTLTILDDATGEVLFTGTNSYLT-AGTGRSIRFAALDDSVFVANTEVIPQTQLWSGASAYPDPTRAGYLYVVAGAFS 159 (777) T ss_pred ecCCeEEEEECCCCeEEEecCCCcee-eccccceeEEEEcCEEEEEeCCccceeeecccCCCccCcccceEEEeeccCCC Confidence 99999999998777777788889975 45677899999999999999999999999999988899999999999999999 Q ss_pred ceeeEEeccceEE-EEEecCCccccceecccccccccccchhhhhhccccc--cccccceEEEECCcEEEEEecCCCcee Q lcl|NC_019510. 161 RTLNVIFNEATRA-TIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRES--LAGNPGWTINVGTGFVNIIAPDGDSIR 237 (799) Q Consensus 161 ~ty~v~~~~~~~~-~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~--~a~~~~~t~~~~g~~i~i~a~~~~~~~ 237 (799) ++|++.+++...+ +++.+.++. ..+....+..+++.+|...+... ....++|++.+.+++++|+++++.. T Consensus 160 ~~y~i~i~~~~~~~~~t~~~~t~-----~~~~~~~~~~~ia~~L~~~~~~~~~~~s~~~~~~~~~g~~~~i~~~~~~~-- 232 (777) T protein:vir:80 160 KQYRLSITNQVTGVTTSVDVTTS-----ATEASQATGEYVITQLRTAAEADATIGTAAGFAYYQDGAYLYVTAPEAIA-- 232 (777) T ss_pred ceeeEeecCCcCceeEEEecCCc-----ccccccccchhhhhhhhhhhccccceeecCceEEEeCCcEEEEEecCcee-- Confidence 9999999865433 232222221 12334556778888887665443 2445789999999999999887643 Q ss_pred EEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEeccccccceeeccceeeE Q lcl|NC_019510. 238 GLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVGLNNGTMPWS 317 (799) Q Consensus 238 ~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~~~t~p~~ 317 (799) ++.. ++.+..+....+.+.+..+||..++.++.+.+....+ ..++||++|+..++.|+||++++...++ .+||+. T Consensus 233 -~t~~-~g~~~~~~~~~~~v~~~~~lp~~~~~~~~~~~~~~~~-~~~~~y~~~~~~~~~w~e~~~~~~~~~~--~t~p~~ 307 (777) T protein:vir:80 233 -VSTD-SGSNFLRASNAASIRDAAELPAKLPADADGFIIATGA-AKNKTYFRWVDLERKWDEDASRGAQAEL--IDMPLR 307 (777) T ss_pred -EecC-CcCccceeeeeEEEeeccccccccccccceEEEeCCC-CCCceEEEEEccCcEEEEeecccccccc--cccceE Confidence 3333 4455567778888999999999999998888876554 4577999999999999999999998877 589999 Q ss_pred EEeccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccE Q lcl|NC_019510. 318 LIRAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLSGENIILSRTAKYFNMYPASVAVLSDDDPI 397 (799) Q Consensus 318 lv~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i 397 (799) |++.+ ++|+++..+|++|.+||+++||.|||+|++|++|+||||||+|+++++|||||+||||||+++|++++.||||| T Consensus 308 l~~~~-~~~~~~~~~w~~r~~gd~~tn~~Psf~g~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdDpI 386 (777) T protein:vir:80 308 ITYSA-PNFSLTALNYERRASGDATSNPALKFTEQGISGMTTMQGRLVLLAGEYVCMSASGNPLRWFRASVSTQSDDDPI 386 (777) T ss_pred EEecC-CceEeeccCCccccccccccCCCceecCCceeEEEEEcceeeeecCCeEEEEeccCccccccccccCCCCCccE Confidence 99876 49999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecccCCCCcEEeCCeEEEEec-CCCceeE Q lcl|NC_019510. 398 DVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNASGVLSAKSVELNLTTEFDVNDGARPYGIGRGVYFASP-RATFTSI 476 (799) Q Consensus 398 ~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~-~g~~~~~ 476 (799) +++++++++|+|+|+++++++|+|||+++||+|+|+++|||+|++++++|+|+|+++|+|+.+|++++|+++ .|+|+++ T Consensus 387 ~~~~ss~~~~~i~~~v~~~~~L~i~T~~~e~~l~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~Fv~~r~g~~s~v 466 (777) T protein:vir:80 387 EVAATAPVASPYEYAVAFNKDLVLFAKTHQGLVPGANLLTSRNATAAVVTEYSFQNSCSPVVAGRTVFFASPRSGPWSAV 466 (777) T ss_pred EEEEcCCcceeeeeeeecCCcEEEEecCceEEEeCCCcccceeEEEEEEEeeccCCCCCceEeCCeEEEEecCCCceeEE Confidence 999999999999999999999999999999999999999999999999999999999999999999999975 5678889 Q ss_pred EEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEeeEeeecCCCeE Q lcl|NC_019510. 477 NRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQSWSHWDFGDNVT 556 (799) Q Consensus 477 ~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~w~~~g~~~ 556 (799) +|++..++++|+|+++|||+|++|||++++ .++++|++|.+++|++++||+|++|||||+++||+|+|||||+|+|+|+ T Consensus 467 ~e~~~~~~~~d~y~a~Dlt~~~~hl~~~~v-~~~a~s~~p~~v~~~~~~dg~l~~~ty~~~~~e~~v~aW~r~~~~g~v~ 545 (777) T protein:vir:80 467 WEMLPSQYTDAQVEASDSTSHLPKYIAGPV-RFLATSSTTSIVVVGTSNLRELVVHEYLWQGGEKVHAAWHKWSFPQDIT 545 (777) T ss_pred eeeeecccccCceehhHHHHHHHHhcCCce-EEEEEcCCCceEEEEEcCCCeEEEEEEeecCCceEEEeeEEeccCCcEE Confidence 888766689999999999999999999985 5568999999999999999999999999999999999999999999999 Q ss_pred EEEEEEeCCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEccccccccccccccccccccCccccccc Q lcl|NC_019510. 557 VLAANSIGSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTVDLNAVFGGMRWQVG 636 (799) Q Consensus 557 ~~~~~~~~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g 636 (799) ++|++ +|+||++|+|++..++|+|.... ..|...+ ...++||..... ..+.+... ....++.|+.+ T Consensus 546 ~v~~i--~d~l~~iv~r~~~~~le~~~~~~-~~d~~~~-~~~~~D~~~~~~----~~~~~~~~------~~~~~~~~~~~ 611 (777) T protein:vir:80 546 GAYFR--GDRLILLFHVAGRVILGELFMQR-LGDAQSI-PGGFLDLYRVGA----ANADEEVA------IPAFAADLYPE 611 (777) T ss_pred EEEEE--CCEEEEEEEcCCeEEEEEEeecc-CCCCccc-ceeeeeeeeeee----eeeCCccc------eeEeeccccCC Confidence 99876 89999999999999999985433 2333333 357999864322 22222111 11234444444 Q ss_pred eEEEEecCCccccccccccceec----CceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCccceeeeecceEEEEE Q lcl|NC_019510. 637 KILVSDEVGEVRQYEPPAGGWAS----DPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTEDVGRLQHRR 712 (799) Q Consensus 637 ~~v~~~adG~~~~~~~v~~g~~~----~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~~rl~l~~ 712 (799) ......+++...+......+.+. ...++++++.++++|+|||+|+++++|+||++++++|+.+++ +|+|||| T Consensus 612 ~~~~~v~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~g~~~~~----~r~~i~r 687 (777) T protein:vir:80 612 DSTFAYKLSGEFQSLGQRCGDRRVDGATVYIKVVGAQAGDQYRIGLRYLSKLGPTRPILRDPNGVPITT----ERTQLHR 687 (777) T ss_pred cceeEEEecCcccccceeeeeEEeCCceeeEEEcCCCCCCEEEEeeeeEEEEEeCceEEeCCCCceeee----cCeEEEE Confidence 33333333222222111222221 135788899999999999999999999999999999987765 7899999 Q ss_pred EEEEeeccceEEEEecCCCc-ccceeccccccCcc--cccccccccceEEEEEeecccceEEEEEECCCCcEEEEEEEEE Q lcl|NC_019510. 713 AWLNYEQSGAFYVDVTNLGR-SYRYTMSGKPLGDT--TLGQANLESGQFRFPLAGNAQYNRVVLTSDYTTPLSIIGCGWE 789 (799) Q Consensus 713 ~~~~~~~t~~~~~~v~~~~~-~~~~~~~~~~~~~~--~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~~e 789 (799) ++|+|++|++|.+.|++.++ ++++.+.+.++++. .++.|++.+|++++|+.+|+++.+|+|+|++|+||+|+||+|| T Consensus 688 ~~~~~~~sg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~tg~~~vp~~~~~~~~~v~i~~d~P~P~tilsi~~e 767 (777) T protein:vir:80 688 LTWSLDSTGEVTFRVADQARGESAYTTTPLRLYSRDLGAGLPLAATATLDTPARVDMQTAQFSLETDDYYDMNITSLEYG 767 (777) T ss_pred EEEEeeccccEEEEEcCCCCcceeeeecCceecccccccccccccceEEEEEEeecCcceEEEEEECCCCceEEEEEEEE Confidence 99999999999999988776 56777888888853 4678999999999999999999999999999999999999999 Q ss_pred EEEeccccCC Q lcl|NC_019510. 790 GNYIRRSTGI 799 (799) Q Consensus 790 g~y~~r~rrv 799 (799) |+||+|+||= T Consensus 768 ~~y~~r~~r~ 777 (777) T protein:vir:80 768 FRYNQRYRRQ 777 (777) T ss_pred EEeecccccC Confidence 9999996666 No 14 >protein:vir:6326 Length: 826 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877473;genbank:gi:33300845;uniprot:Q7Y2D1;genbank:GeneID:1482615 Probab=100.00 E-value=7.5e-214 Score=1189.01 Aligned_cols=770 Identities=21% Similarity=0.273 Sum_probs=640.2 Q ss_pred CCceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcC-CceEEEE Q lcl|NC_019510. 1 MGLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRD-EYEQYYV 79 (799) Q Consensus 1 M~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~-~~~~y~l 79 (799) ||+|+|+||||++|||||+|++||++||++|+||+|+|+|||+||||++||++++.++......++|.++|+ .+|+|+| T Consensus 1 M~~i~~s~~n~~~GvSqq~d~~r~~~q~~~~~N~~~~~~~G~~rRpg~~~v~~~~~~~~~~~~~~~~~~~r~~~~~~~~~ 80 (826) T protein:vir:63 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) T ss_pred CceeeeecchhhcceeccCchHhhhhhhhhhhcceeeccCCcccCchhHhhhhhccCCccccccEEEEEecCCCceEEEE Confidence 999999999999999999999999999999999999999999999999999999887666667788888885 5667889 Q ss_pred EEeCCeEEEEeCC-CeEEEEEecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceEEEEecCC Q lcl|NC_019510. 80 VFTGGDIKVFDLN-GQEYAVRGDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDALINVRGGQ 158 (799) Q Consensus 80 ~~~~g~irv~~~~-G~~~~v~~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~~~v~~~~ 158 (799) +|++|+||||+++ |..+.+........+++++++|+|+|+||++||+|++++|++..+. ....++..+++++++.++ T Consensus 81 ~~~~g~irv~~~~~g~~~~~~~~~~~y~~~~~~~~l~~~t~aD~~fi~n~~~~p~~~~~~--~~~~~~~~~~~~~v~~g~ 158 (826) T protein:vir:63 81 AQHRGELYLFDERDGRLLMGQPLVHDYLKANDYRQLRAATVADDLFIANLSVKPEADRTD--IKGVDPNKAGWLYIKAGQ 158 (826) T ss_pred EecCCcEEEEEcCCCeEEEcCCCCCceeeecCccceEEEEeCCEEEEEeCCeeeeecccc--ccccCCCCcEEEEeeccc Confidence 9999999999974 5555555443333456789999999999999999999999875432 234556788999999999 Q ss_pred CCceeeEEeccc---------eEEEEEecCCccccceecccccccccccchhhhhhcccccc------------------ Q lcl|NC_019510. 159 YGRTLNVIFNEA---------TRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESL------------------ 211 (799) Q Consensus 159 y~~ty~v~~~~~---------~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~------------------ 211 (799) |+++|++++++. .+++++++.+.+....+........+++++.++..++.... T Consensus 159 Y~~~y~vti~~~~~~~gt~~s~t~t~~t~~~~~a~~~~~~~~~~~s~~yia~~l~~~~~a~~~~~~~~~t~~~~~~~~~~ 238 (826) T protein:vir:63 159 YSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDA 238 (826) T ss_pred cCceEEEEEEeccccCCccccceEEEEeccCCcccccccccceeeeeeeeeeeceeeeeeccccccCCCccccceecCCc Confidence 999999999852 35778888877666555555555566677766544322110 Q ss_pred --ccccceE--EEECCcEEEEEecCCCceeEEEEEeccCcceeeEEeeeeccceeccccccCC----eEEE----EEecC Q lcl|NC_019510. 212 --AGNPGWT--INVGTGFVNIIAPDGDSIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDG----YTVK----IVGDT 279 (799) Q Consensus 212 --a~~~~~t--~~~~g~~i~i~a~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G----~~v~----i~~~~ 279 (799) ....+++ .....+++++..+..... . -+.++++.+.+......+++..+||..++++ +.+. +.... T Consensus 239 ~a~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~~g~~~~~~~~~~~~~~~~~l~~~~p~~~~~~~~~~~~~~~~~~~ 316 (826) T protein:vir:63 239 NAATIAGYLNQRGVQDGYIAFRGDADIHV-E-VSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVMAT 316 (826) T ss_pred ccceeecceeEecccccEEEEeeCCcccE-E-EccCCCCcceEEEEEeeccceeeccccCCCcccceEEEeeEEeEEecC Confidence 0011222 222345555555544322 1 2234455667777788889999998887764 3332 34455 Q ss_pred CCCcceeEEEEecCceEEEEeccccccceeeccceeeEEE-eccCceEEEeecccCccccCccccccCccccCCCeeEEE Q lcl|NC_019510. 280 SRSADKYYVRYNLTRKVWEETVGWNIQVGLNNGTMPWSLI-RAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVF 358 (799) Q Consensus 280 ~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~~~t~p~~lv-~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~ 358 (799) +...+.||++|+...++|+||+++++.. ..++||+.|+ +.++++|+++.++|++|.+||+++||+|+|+|+||++|+ T Consensus 317 g~~~d~~y~~~~~~~~~w~e~~~~~~~~--~~~tmp~~l~~~~~~~~f~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~ 394 (826) T protein:vir:63 317 GSTKAPVYFEWDSANRRWAERAAYGTDW--VLKKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVTRGITGMT 394 (826) T ss_pred CCcccceEEEEEcCCceEEEEeecCccc--ccccceEEEEEeccCCeEEEeccccccccccccccCCCccccCCCceEEE Confidence 6677899999999999999999999754 4479999998 568899999999999999999999999999999999999 Q ss_pred EEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCccccc Q lcl|NC_019510. 359 FYRNRLGMLSGENIILSRTAKYFNMYPASVAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNASGVLSA 438 (799) Q Consensus 359 f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~~lTP 438 (799) ||||||+|+++++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|+++||| T Consensus 395 f~q~RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~~~q~~ls~~~~lTP 474 (826) T protein:vir:63 395 TFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVTP 474 (826) T ss_pred EEeceEEEeeCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeEEEeecCCcEEEEecCcEEEEeCCCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceEEEEEEeecccCCCCcEEeCCeEEEEecCC-CceeEEEEEeecc-ccCceehhhHHHHHHHhcCCCcEEEEEcCCCC Q lcl|NC_019510. 439 KSVELNLTTEFDVNDGARPYGIGRGVYFASPRA-TFTSINRYYAVQD-VSAVKNAEDMTMHVPSYIPNGVFSISGSSTEN 516 (799) Q Consensus 439 ~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~~~r~~~~~~-~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~ 516 (799) +|++++++|+|+|+++|+|+.+|++++|+|++| +|++++|+ .|+. .++.|+++|||+|++|||++++. .+++|++| T Consensus 475 ~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~~~s~v~e~-~~~~d~~~~y~~~dlt~~~~~l~~~~v~-~~a~s~~~ 552 (826) T protein:vir:63 475 RTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEM-APSPSTDSHYVAEDVTSHIPSYMPGPAE-YIQAAASS 552 (826) T ss_pred eeEEEEEEEeecccCCCCceEeCCeEEEEecCCCceeEEEEE-EeeeccccceehhHHHHHHHHhcCCCeE-EEEEcCCC Confidence 999999999999999999999999999999987 57777775 5765 45569999999999999998755 45889999 Q ss_pred cEEEEEEcCCCeEEEEEeeeCCCceeeEeeEeeecCCCeEEEEEEEeCCEEEEEEEeCCCEEEEEEEEEe-e---ccCCC Q lcl|NC_019510. 517 FATVLTSGAKGKVFIYKFLYIDEQIQQQSWSHWDFGDNVTVLAANSIGSHMHVILQNGYDIFMGSISFTK-K---TLDFG 592 (799) Q Consensus 517 ~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~w~~~g~~~~~~~~~~~d~l~~~v~r~~~~~~~~~~~~~-~---~~~~~ 592 (799) ++++|++++||+|++|+|||+++||+|+|||||+|+|+|+++|++ +|+||++|+|+++++++|+.++. + ..+.. T Consensus 553 ~~v~~~~~~dg~l~~~~y~~~~~e~~v~aW~~~~~~g~v~~~~~i--~d~l~~iv~r~~~~~~~r~~~e~~~~~~~~~~~ 630 (826) T protein:vir:63 553 GYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFT--GDNLMVLIQKGQEIALGRMHLNSLPAREGLQYP 630 (826) T ss_pred CEEEEEEcCCCEEEEEEEeeCCCcEEEEeEEEEecCCcEEEEEEE--CCeEEEEEEeCCCEEEEEEEEEecCCccccccC Confidence 999999999999999999999999999999999999999998887 89999999999999999985432 2 12233 Q ss_pred ccccceeeceeEEEEEccccccccccccccccccccCccccccceEEEEecCCccccc----cccccceecCceEEEccC Q lcl|NC_019510. 593 NEPYRLYMDAKTRYDIPANAFNNDRYETTVDLNAVFGGMRWQVGKILVSDEVGEVRQY----EPPAGGWASDPTLRIVGD 668 (799) Q Consensus 593 ~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~adG~~~~~----~~v~~g~~~~~~~~i~~~ 668 (799) .++++.++||...+.... .......++.|+++..+.++++|.+... .++.+|.+ +++++++ T Consensus 631 ~~d~~~~~d~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v---~l~~~~~ 695 (826) T protein:vir:63 631 KYDYWRRIEATVAGELEL------------TKQHWDLIKDASAVYQLQPVAGAYMERTHLGVKRETNTKV---FLDVPEA 695 (826) T ss_pred CccceEEEEEeeeeeecc------------CcceeecccCcccccEEEEeeCccccCCccceEEecCCEE---EEecCCC Confidence 344556677765543211 0111124788999999999999988654 45566665 4578889 Q ss_pred CCCCeEEEeEeeeEEEEecCeeEEcCCCccceeeeecceEEEEEEEEEeeccceEEEEecCCCcc--cceeccccccCc- Q lcl|NC_019510. 669 MAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTEDVGRLQHRRAWLNYEQSGAFYVDVTNLGRS--YRYTMSGKPLGD- 745 (799) Q Consensus 669 ~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~--~~~~~~~~~~~~- 745 (799) +.+++|+|||+|+++++|+||++++++|+.+++ +|+||||+++++.+||+|.++|++..++ ..+.+.+.+++. T Consensus 696 ~~~~~v~VGl~y~s~~~~~~~~~~~~~g~~~~~----gr~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~ 771 (826) T protein:vir:63 696 VVGAVYVVGCEFWSKVEFTPPVLRDHNGLPMTS----TRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSR 771 (826) T ss_pred ccccEEEEeeeeeEEEEecceEEEccCCCccee----ccEEEEEEEEEeeccccEEEEecCccccceeEeecCCceeccc Confidence 999999999999999999999999999988765 8999999999999999999999988775 345677888874 Q ss_pred -ccccccccccceEEEEEeecccceEEEEEECCCCcEEEEEEEEEEEEeccccCC Q lcl|NC_019510. 746 -TTLGQANLESGQFRFPLAGNAQYNRVVLTSDYTTPLSIIGCGWEGNYIRRSTGI 799 (799) Q Consensus 746 -~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~~eg~y~~r~rrv 799 (799) ..++.|++.++++++|+.+++++.+|+|+|+.|+||+|++|+|||+||+|+||| T Consensus 772 ~~~~g~p~~~t~~~~vP~~~~~~~~~i~i~~d~P~p~~il~i~~~~~yn~r~rrv 826 (826) T protein:vir:63 772 QLNAGEPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) T ss_pred ccccccccccceEEEEEEeeccceEEEEEEeCCCCcEEEEEEEEEEEEeceeecC Confidence 356788889999999999999999999999999999999999999999999999 No 15 >protein:vir:78957 Length: 826 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522826;genbank:gi:158345061;genbank:GeneID:5687447 Probab=100.00 E-value=2.2e-210 Score=1170.03 Aligned_cols=775 Identities=20% Similarity=0.282 Sum_probs=618.7 Q ss_pred CCceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcC-CceEEEE Q lcl|NC_019510. 1 MGLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRD-EYEQYYV 79 (799) Q Consensus 1 M~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~-~~~~y~l 79 (799) ||+|+|+||||+||||||+|++||++||++|+||+|+|+|||+||||++||++++.++......+.|.++++ ++|+|+| T Consensus 1 M~~i~~~~~nl~gGvSqq~d~~r~~~q~~~~~N~~~~~~gG~~rRpgt~~va~~~~~~~~~~~~f~~~~~r~s~e~~~~l 80 (826) T protein:vir:78 1 MSYKQSAYPNLLMGVSQQVAFERLPGQLSEQINMVSDPVSGLRRRSGIELMASLLHTDQPWPRPYLYHTNLGGRSIAMLV 80 (826) T ss_pred CcceeeecchhccceecccchHhhhhhhhhhhcceeccccccccCCchHhhhhhccCCcCCceeEEEEeccCCcceEEEE Confidence 999999999999999999999999999999999999999999999999999999887665555666666665 5567889 Q ss_pred EEeCCeEEEEeC-CCeEEEEEecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeec-cccCCCCcCCCcceEEEEecC Q lcl|NC_019510. 80 VFTGGDIKVFDL-NGQEYAVRGDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADG-NLTNGGTFNDQKDALINVRGG 157 (799) Q Consensus 80 ~~~~g~irv~~~-~G~~~~v~~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~-~~~~~~~~~~~~~~~~~v~~~ 157 (799) +|++|+||||++ +|..+..+......+++.+.++|+|+|+||+|||+|++++|++.. +.. ..++..+++++++.+ T Consensus 81 ~~~~g~irv~~~~~g~~~~~~~~~~~y~~~~~~~~l~~~t~aD~~fi~n~~~~p~~~~~~~~---~~~~~~~~~~~v~~g 157 (826) T protein:vir:78 81 AQHRGELYLFDEKDGRLLMGQPLVHDYLKASDYRQLRAATVADDLFIANLEVRPEADKADVL---GVDPSKTGWLYIKAG 157 (826) T ss_pred EEcCCcEEEEECCCCEEEEecCcccceeecCCcceeEEEEEcCEEEEEcCcEeeeecccccc---CCCCCceEEEEeccc Confidence 999999999985 555554443233333455667999999999999999999998743 222 234567899999999 Q ss_pred CCCceeeEEeccc---------eEEEEEecCCccccceecccccccccccchhhhhhcccccc----cc----------- Q lcl|NC_019510. 158 QYGRTLNVIFNEA---------TRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESL----AG----------- 213 (799) Q Consensus 158 ~y~~ty~v~~~~~---------~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~----a~----------- 213 (799) +|+++|++++++. .++++++|++++..........+....+++.+|..+..... .. T Consensus 158 ~y~~~y~v~i~~~~~~~~~~~s~t~~y~t~~~~~~~~~~~~~~~~~~~~~~a~~l~~~~~~~~~~~~~~~t~~~~~~~~~ 237 (826) T protein:vir:78 158 QYSKAFSLTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLFGKFFGAPEYTLPNSTKKYPKVDPD 237 (826) T ss_pred ccCceeEEEeccceeecccccceeEEEEeccCCccccccccccceecchhhheecceeeccccceeeeccceeEeecccc Confidence 9999999999863 24778888888776666666666677777777654321110 00 Q ss_pred -----ccceEE--EECCcEEEEEecCCCceeEEEEEeccCcceeeEEeeeeccceec----cccccCCeEEEE----Eec Q lcl|NC_019510. 214 -----NPGWTI--NVGTGFVNIIAPDGDSIRGLQTKDGYADQLISPVTHYAQTFAKL----PQNAPDGYTVKI----VGD 278 (799) Q Consensus 214 -----~~~~t~--~~~g~~i~i~a~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l----~~~~~~G~~v~i----~~~ 278 (799) ..++.. ...++++++.++....+ ..+.+++++.........++.+++| |..+.+|+.+.+ ... T Consensus 238 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~--~~~~~~g~~~~~~~~~~~v~~~~~l~a~~p~~~~~~~~~~~~~~~~~~ 315 (826) T protein:vir:78 238 PAAATVAGYLNQRGVQDGYIAFRGDGDIVV--EVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAIMA 315 (826) T ss_pred ccceeeccceeecccccceEEEecCCCeEE--EeccCCCccceEEEeeEEEecccceeeeecccccceEEEEEEeeeEec Confidence 000000 11223455555433322 2233444455555555566666665 555666766653 344 Q ss_pred CCCCcceeEEEEecCceEEEEeccccccceeeccceeeEEE-eccCceEEEeecccCccccCccccccCccccCCCeeEE Q lcl|NC_019510. 279 TSRSADKYYVRYNLTRKVWEETVGWNIQVGLNNGTMPWSLI-RAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDV 357 (799) Q Consensus 279 ~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~~~t~p~~lv-~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v 357 (799) .++..+.||++|+..++.|+||+++++. ++.++||+.++ ++++++|+++..+|++|.+||+++||.|||+|++|++| T Consensus 316 ~g~~~~~~y~~~~~~~~~w~e~a~~g~~--~~~~tmp~~l~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g~~i~~v 393 (826) T protein:vir:78 316 TGSTKAPVYFAWDAANRRWAERAAYGTD--WVLKKMPLALRWDESTDTYSLNELEYDRRGSGDEETNPTFNFVKRGITGM 393 (826) T ss_pred CCCcccceeEEEEcCCceEEEeeccCcc--cccccccEEEEEecCCCeEEEeeccccccccCcccccCcccccCCCceEE Confidence 5677889999999999999999999975 56779999998 45789999999999999999999999999999999999 Q ss_pred EEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCcccc Q lcl|NC_019510. 358 FFYRNRLGMLSGENIILSRTAKYFNMYPASVAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNASGVLS 437 (799) Q Consensus 358 ~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~~lT 437 (799) +||||||+|+++++|||||+||||||++++++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|+++|| T Consensus 394 ~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~~s~~~~~i~~~v~~~~~L~l~T~~~e~~l~~~~~lT 473 (826) T protein:vir:78 394 TTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPGGGIVT 473 (826) T ss_pred EEEeceEEEeeCCeEEEEeccCccccccccccCCCCCCcEEEEEccCcceeEEEEEecCCcEEEEecCcEEEEeCCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceEEEEEEeecccCCCCcEEeCCeEEEEecCC-CceeEEEEEeecc-ccCceehhhHHHHHHHhcCCCcEEEEEcCCC Q lcl|NC_019510. 438 AKSVELNLTTEFDVNDGARPYGIGRGVYFASPRA-TFTSINRYYAVQD-VSAVKNAEDMTMHVPSYIPNGVFSISGSSTE 515 (799) Q Consensus 438 P~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~~~r~~~~~~-~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~ 515 (799) |+|++++++|+|+|+++|+|+.+|++++|++++| .|++++|+ .|+. .++.|+++|||+|++|||+++++. +|+|++ T Consensus 474 P~~~~~~~~s~~~~~~~~~Pv~vG~~v~F~~~r~~~~s~v~e~-~~~~~~~~~y~~~dlt~~~~~l~~~~v~~-~a~s~~ 551 (826) T protein:vir:78 474 PRTAVISITTQYDVDTRAAPAVTGRSVYFAAERALGFMGLHEM-APSPSTDSHYVAEDVTSHIPSYMPGPAEY-IQAAAS 551 (826) T ss_pred ceeEEEEEEEeecccCCCCceEeCCeEEEEecCCCceeEEEEE-EeeecccCccchHHHHHHHHHhcCCCeEE-EEEeCC Confidence 9999999999999999999999999999998887 57777765 5765 455699999999999999997655 588999 Q ss_pred CcEEEEEEcCCCeEEEEEeeeCCCceeeEeeEeeecCCCeEEEEEEEeCCEEEEEEEeCCCEEEEEEEEEeeccCCCcc- Q lcl|NC_019510. 516 NFATVLTSGAKGKVFIYKFLYIDEQIQQQSWSHWDFGDNVTVLAANSIGSHMHVILQNGYDIFMGSISFTKKTLDFGNE- 594 (799) Q Consensus 516 ~~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~w~~~g~~~~~~~~~~~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~- 594 (799) |++++|++++||+|++|||||+++||+|+|||||+|+|+|++||++ +|+||++|+|+++++++|+.++....+...+ T Consensus 552 ~~~~v~~~~~~g~l~~~ty~~~~~e~~v~aW~~~~~~g~v~~v~~i--~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~~ 629 (826) T protein:vir:78 552 SGYLVFGTSAADEMICHQYLWQGNEKVQNAYHRWTLRHQIIGAYFT--GDNLMVLIQKGQEIALGRMHLNSLPAREGLQY 629 (826) T ss_pred CCeEEEEEcCCCeEEEEEEEecCCcEEEEeEEEEccCCcEEEEEEE--CCeEEEEEEeCCCEEEEEEEEEecCCCccccc Confidence 9999999999999999999999999999999999999999998877 8999999999999999998654333222111 Q ss_pred cc-ceeeceeEEEEEccccccccccccccccccccCccccccceEEEEecCCccccccccccceecCceEEEccCCCCCe Q lcl|NC_019510. 595 PY-RLYMDAKTRYDIPANAFNNDRYETTVDLNAVFGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKR 673 (799) Q Consensus 595 ~~-~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~ 673 (799) .. .++.++. +.+++......... +......+..|++|+.+....++.+.... ...|. +++++++++.+++ T Consensus 630 ~~~~~~~~~~--~~~~~~~~~~~~~~---~~~~~~~~~~~~~g~~~~~~~~~~~~~~~-~~~~~---~~l~~~~~~~~~~ 700 (826) T protein:vir:78 630 PKYDYWRRIE--ATVDGELELTKQHW---DLIKDGAAVYQLQPQVGAYMERYQLGVKR-ETSTK---VFLDVPEAVVGSV 700 (826) T ss_pred cccceeEEEE--EEEcceecccccee---EEecCCceeeeeccceeeeccccceeccc-cCCCc---eEEEeCCCccccE Confidence 11 1122222 22232222211111 11122345678888877766666654333 23333 4678999999999 Q ss_pred EEEeEeeeEEEEecCeeEEcCCCccceeeeecceEEEEEEEEEeeccceEEEEecCCCcccc--eeccccccCccc--cc Q lcl|NC_019510. 674 VFIGFAYEFRYEFSKFLIKKQDESGGFSTEDVGRLQHRRAWLNYEQSGAFYVDVTNLGRSYR--YTMSGKPLGDTT--LG 749 (799) Q Consensus 674 v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~~~--~~~~~~~~~~~~--~~ 749 (799) |+|||+|+++++|+||++++++|+.+++ +|+||||+++++.+|+.|.+.|++..++.. +...+.+++.+. .+ T Consensus 701 v~VGl~y~s~~~~~~~~~~~~~g~~~~~----~r~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~l~~g 776 (826) T protein:vir:78 701 YVVGCEFWSKVEFTPPVLRDHNGLPMTS----TRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLSSRQLNAG 776 (826) T ss_pred EEEeeceeEEEEeCceEEecCCCcceee----cceEEEEEEEEeeccccEEEEeCCCccCcceeeeecccccccccccCC Confidence 9999999999999999999999988766 889999999999999999999998877543 334444555433 35 Q ss_pred ccccccceEEEEEeecccceEEEEEECCCCcEEEEEEEEEEEEeccccCC Q lcl|NC_019510. 750 QANLESGQFRFPLAGNAQYNRVVLTSDYTTPLSIIGCGWEGNYIRRSTGI 799 (799) Q Consensus 750 ~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~~eg~y~~r~rrv 799 (799) +|++.++++++|+.+|+++.+|+|+|+.|+||+|++|+|||+||+|+||| T Consensus 777 ~~~~~t~~v~vp~~~~~~~~~i~i~~d~P~P~tvlai~~~~~y~~r~rrv 826 (826) T protein:vir:78 777 EPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) T ss_pred cccccceEEEEeeeccCceEEEEEEeCCCCcEEEEEEeEEEEecceeecC Confidence 56677999999999999999999999999999999999999999999999 No 16 >protein:vir:100022 Length: 976 # NCBI annotation: T7-like tail tubular protein B # Family: family:all:825 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214208;genbank:gi:61806431;genbank:GeneID:3294702 Probab=100.00 E-value=3.5e-209 Score=1163.39 Aligned_cols=778 Identities=23% Similarity=0.376 Sum_probs=651.1 Q ss_pred CCceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCccc----ceEEEEEEcCCceE Q lcl|NC_019510. 1 MGLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGA----APLVHLINRDEYEQ 76 (799) Q Consensus 1 M~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~----~~~l~~~~~~~~~~ 76 (799) ||+|+|+||||++|||||+|.+|+|+||++|+||+|||+.||+||||++||+.|.+.+.... ...+|.++|++.++ T Consensus 1 M~~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRp~~~~v~~l~~~~~~~~~~~~~~~~~~~~r~~~e~ 80 (976) T protein:vir:10 1 MASVTQTIPTLTGGLSQQPDELKIPGQVSVANNVIPDVTHGLLKRPGGKLVASISDNGTAALNSQTNGKWFSYYRDETES 80 (976) T ss_pred CcceeecchhhhCcceecchhhcCCchhhhhhccccccccccccCCcceeeeeecCCCcccccccccceEEEEEcCCCcE Confidence 99999999999999999999999999999999999999999999999999999887765543 34669999999999 Q ss_pred EEEEEe-CCeEEEEeC-CCeEEEEEecCcc---c---cccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCc Q lcl|NC_019510. 77 YYVVFT-GGDIKVFDL-NGQEYAVRGDKSY---V---QTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQK 148 (799) Q Consensus 77 y~l~~~-~g~irv~~~-~G~~~~v~~~~~y---~---~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~ 148 (799) |++.+. +|.|+|||+ +|.+++|++.... + .++++.++|+++++||++||+|+++.|++.... ...+.+ T Consensus 81 y~~~~~~~g~~~v~~~~~G~~~~v~~~~~~~~~~~~yl~~~~~~~~~~~tv~d~tfi~N~~~~~~~~~~~----~~~~~~ 156 (976) T protein:vir:10 81 YIGQVSRSGDINMWRCSDGQAMTVNYDSGTATALTTYLTHTNDEDIQTLTLNDYTFLTNRTKTVAMSSTV----EPVRPP 156 (976) T ss_pred EEEEEecCCceEEEEccCCeEEEEEcCCCcccccchhhccCCcceeEEEEEccEEEEecCceEEeecccc----cCCCCc Confidence 998887 678999997 7999999877642 1 235888999999999999999999999875433 234566 Q ss_pred ceEEEEecCCCCceeeEEeccceEE---EEEecC---------------------------------------------- Q lcl|NC_019510. 149 DALINVRGGQYGRTLNVIFNEATRA---TIKLPS---------------------------------------------- 179 (799) Q Consensus 149 ~~~~~v~~~~y~~ty~v~~~~~~~~---~~~tp~---------------------------------------------- 179 (799) .++++++.++|+++|++++++...+ ++.++. T Consensus 157 ~~~~~v~~~~y~~~y~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~s~~~G~~~~~~~v 236 (976) T protein:vir:10 157 EVFIDLKATAYARQYAVNLFDNTTTTAVSTVTRIDVELIKSSNNYCDSNGAMVARTSRPSNSTRCDDSAGDGRDAYAPNV 236 (976) T ss_pred eEEEEeeeeccceEEEEEEcCCcccceeeeeeeeeeccccCCcccccccccchhhHhHhhhhhcccccccccccccCcee Confidence 7999999999999999999765322 111110 Q ss_pred ---------Cc----------------------------------------------cc--------------------- Q lcl|NC_019510. 180 ---------GT----------------------------------------------GT--------------------- 183 (799) Q Consensus 180 ---------gt----------------------------------------------~~--------------------- 183 (799) |. ++ T Consensus 237 ~~~~f~~~~G~~~~i~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~Y~~~y~~~~~v~~~ 316 (976) T protein:vir:10 237 GTKVFNVTDGASLTDEANSGSYTYTIDVKDSSNNSVNRGVNLYFRIRTVGQSVPFTTGSGSSATTTYQARYTTTFDLLYG 316 (976) T ss_pred eeeEEEeccCccceEcCCcceEEEEeeccccceEEeecCCceEEEEccccceeecccccccceeeeeeEEEEeEEEEecC Confidence 00 00 Q ss_pred -cce-------ec--------------------------------ccccccccccchhhhhhccccccccccceEEEECC Q lcl|NC_019510. 184 -TPP-------IE--------------------------------EQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGT 223 (799) Q Consensus 184 -~~~-------~~--------------------------------~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g 223 (799) +.+ +. +..+...+++++.+|...+... ....++++...+ T Consensus 317 ~~g~~~~~~~~V~v~g~~Y~it~~~~~~~~~~a~~~~~~~~~t~~d~~~~~~~~~ia~~L~~~l~a~-~~~~g~tv~~~g 395 (976) T protein:vir:10 317 GTGWQEGDYFYVWMKDGYYKITVEAISTANVQANLGLIRPNPTPFDTETAVTAESIIGDIRTAIIAT-GNFTSANVQQIG 395 (976) T ss_pred CCCcccCceEEEEccccceeeEEEEeeceeEEeccccccCcCCcCcccccccHHHHHHHHHHhhccc-ccccceEEEEcC Confidence 000 00 0000112223333333333211 223567888899 Q ss_pred cEEEEEecCCCceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCc-----eEEE Q lcl|NC_019510. 224 GFVNIIAPDGDSIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTR-----KVWE 298 (799) Q Consensus 224 ~~i~i~a~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~-----~~w~ 298 (799) ++++|+++++. +..+..+ +..++.+++.|+++++||..|++|++|+|.+ +++..|+||++|+..+ ++|+ T Consensus 396 ~~~~i~~~~~~--~~~s~~~---~~~~~~~~~~V~~~~~LP~~~~~g~~v~V~~-~~~~~d~yyv~~~~~~~~~~~~~w~ 469 (976) T protein:vir:10 396 TGLYVTRPSGT--FNVTAPS---SDLLRVMSGEVANVDDLPSQCKHGYVVKVAN-SEADADDYYVKFFGHNNRDGDGVWE 469 (976) T ss_pred cEEEEEecCcc--eEecCCC---ceeEEEEEeeecchhhhhhhccCCcEEEEec-CCCCceeEEEEeeccccccccceEE Confidence 99999988864 2333222 3468999999999999999999999999955 4457799999997543 5899 Q ss_pred EeccccccceeeccceeeEEEeccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEecCCeEEEEccC Q lcl|NC_019510. 299 ETVGWNIQVGLNNGTMPWSLIRAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLSGENIILSRTA 378 (799) Q Consensus 299 e~~~~~~~~~~~~~t~p~~lv~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~~~~v~~Sr~g 378 (799) ||++++...++++++|||.|+++++++|+++.++|+.|.+||+++||+|+|+|++|++|+||||||+|+++++|||||+| T Consensus 470 E~~~~g~~~g~~~~tmP~~l~~~~~g~f~~~~~~w~~r~vGd~~tnp~psf~g~~is~v~f~q~RL~f~s~~~v~~Srtg 549 (976) T protein:vir:10 470 ECAKPSRNIEFDKGTMPIQLVRQANGTFTVSQATWQNAEVGDELTNPNPSFVGKTINQLVFFRNRLVFLSDENVIMSRPG 549 (976) T ss_pred EeeccccccccccccccEEEEecccCeEEeeeccccccccCCcccCcCceecccccceEEEEcceEEEecCCeEEEEecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccccccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCC-ccccccceEEEEEEeecccCCCCc Q lcl|NC_019510. 379 KYFNMYPASVAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNAS-GVLSAKSVELNLTTEFDVNDGARP 457 (799) Q Consensus 379 d~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~-~~lTP~~~~~~~~s~~~~~~~~~P 457 (799) |||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+++ ++|||+|++++++|+|+|++.|+| T Consensus 550 d~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~g~e~~lsg~~~~lTP~t~~i~~~s~~~~~~~v~P 629 (976) T protein:vir:10 550 EFFNFWSKTATTFTPQDVIDLSCSSTYPAIVYDGIQVNAGLLLFTKNQQFMLTTDSDILSPETAKINAVSSYNFNEKTHP 629 (976) T ss_pred CccccccccccCCCCCccEEEEecCCcceeeEEEEecCCcEEEEecCceEEEecCCceecceeEEEEEEEeeeccCCCcc Confidence 9999999999999999999999999999999999999999999999999999986 599999999999999999999999 Q ss_pred EEeCCeEEEEecCCCceeEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeC Q lcl|NC_019510. 458 YGIGRGVYFASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYI 537 (799) Q Consensus 458 v~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~ 537 (799) +.+|++++|++++|+++|+++ |.|+..+++|.+.|||+|++|||+|.+ .+++++++|.+++|++++||+|++|||||+ T Consensus 630 v~vG~~v~Fv~~~g~~~r~~~-~~~~~~~~~~~~~dlt~~~~~l~~g~~-~~~a~~~~~~~vv~~~~~~g~l~~~ty~~~ 707 (976) T protein:vir:10 630 VSLGTTVAFIDNANQFTRFFE-MSNVVRQGEPDVVDQSKVISRLLDKNI-SLVSVSRENSVVFFSQKDTDKIYCFRYFTS 707 (976) T ss_pred EEeCCeEEEEecCCCeEEEEE-EeecccccccchhHHHHHhhhhcCCce-EEEEEcCCCcEEEEEEcCCCEEEEEEEeec Confidence 999999999999999888776 467878899999999999999999875 567899999999999999999999999999 Q ss_pred CCceeeEeeEeeecCCCeEEEEEEEeCCEEEEEEEeCCCEEEEEEEEEeeccCC------------CccccceeeceeEE Q lcl|NC_019510. 538 DEQIQQQSWSHWDFGDNVTVLAANSIGSHMHVILQNGYDIFMGSISFTKKTLDF------------GNEPYRLYMDAKTR 605 (799) Q Consensus 538 ~~e~~v~aW~~w~~~g~~~~~~~~~~~d~l~~~v~r~~~~~~~~~~~~~~~~~~------------~~~~~~~~lD~~~~ 605 (799) ++||+|+|||||+|+|++++||++ +|+||++|+|+++++++|+.+.++.... ..+.++++||+... T Consensus 708 ~~eq~v~aWsr~~~~G~v~sv~~i--~D~ly~vV~r~~~g~~~r~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~ 785 (976) T protein:vir:10 708 GEKRLLQAWTTWTITGNIQYHCML--DDALYVVTRNNNKDQIVKYSLKLDDAGHFVTDTQGTTSTDDDSIYRVHLDHSSS 785 (976) T ss_pred CCceeEEeeEEEecCCcEEEEEEe--CCeEEEEEEecCCeEEEEEEEEECCccceeeeccCccccccCCcceeeeccceE Confidence 999999999999999999999987 7999999999999999999887654211 12456789999999 Q ss_pred EEEccccccccccccccccccccCccccccceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEE Q lcl|NC_019510. 606 YDIPANAFNNDRYETTVDLNAVFGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYE 685 (799) Q Consensus 606 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~ 685 (799) +.+..+.++.+...+....+. ..+.+++.+...+||.... ..+....+..++++|++++++++|||||+|+++++ T Consensus 786 ~~~~~~t~~~~t~~t~~~~~~----~~~~~~~~~~~~~d~~~~~-~~~~~~~v~g~~i~l~g~~~~~~v~VGl~Y~s~~~ 860 (976) T protein:vir:10 786 VTAASNTYNTTTIKTTIPKPN----GYESTKQLVAYDTDAGNDL-GRYALVTVSGSNLEIPGNWSNNSFIIGYLYEMDVQ 860 (976) T ss_pred EEeccccccCCceeEEeecCc----cccCceeEEEEecccCccc-ccceeeeecCCeeEecCCCCCCeEEEeeeeEEEEe Confidence 999888888777666655443 3445677777778775432 22334456667889999999999999999999999 Q ss_pred ecCeeEEcCCCccceeeeecceEEEEEEEEEeeccceEEEEecCCCcc-cceeccccccCcccccccccccce-EEEEEe Q lcl|NC_019510. 686 FSKFLIKKQDESGGFSTEDVGRLQHRRAWLNYEQSGAFYVDVTNLGRS-YRYTMSGKPLGDTTLGQANLESGQ-FRFPLA 763 (799) Q Consensus 686 ~~~~~~~~~~g~~~~~~~~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~-~~~~~~~~~~~~~~~~~~~~~tg~-~~vp~~ 763 (799) |+||++++++|.++ .....+||+|||+++++.+|+.|++.+++.+++ +...+...+.+....+.+|+.++. ++||+. T Consensus 861 ~~~~~i~~~~g~~~-~~~~~gRl~i~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~pl~~~~~~~vP~~ 939 (976) T protein:vir:10 861 LPTLYVTQQVGDKY-RSDAKSSLIVHRIKFSFGPLGVYSTTIQRDGKPDFTETKELGLAGVVGASRLPIVPEVIETVPCY 939 (976) T ss_pred ecceeEEeCCCCcc-cccceeeEEEEEEEEEeecccceEEEEcCCCCccccccccccccCcccccccceecCcEEEEEec Confidence 99999999987664 456778999999999999999999999887664 344444455555556678887765 789999 Q ss_pred ecccceEEEEEECCCCcEEEEEEEEEEEEecc-ccCC Q lcl|NC_019510. 764 GNAQYNRVVLTSDYTTPLSIIGCGWEGNYIRR-STGI 799 (799) Q Consensus 764 ~~~~~~~v~i~~~~P~P~tvl~i~~eg~y~~r-~rrv 799 (799) +|+++.+|+|+|+.|+||+|++|+|||+||+| +||| T Consensus 940 ~~~~~~~v~i~~d~PlP~tilsi~~eg~yn~r~~r~~ 976 (976) T protein:vir:10 940 ERNTNLKVNVKSEHPAPATLYSLAWEGDFTNRFYKRV 976 (976) T ss_pred cCCceeEEEEEECCCCceEEEEEEEEEEeccceeecC Confidence 99999999999999999999999999999999 6677 No 17 >protein:vir:78703 Length: 905 # NCBI annotation: tail tube B # Family: family:all:825 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285450;genbank:gi:148724484;genbank:GeneID:5220174 Probab=100.00 E-value=2.8e-206 Score=1147.48 Aligned_cols=772 Identities=26% Similarity=0.406 Sum_probs=637.3 Q ss_pred CCceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCceEEEEE Q lcl|NC_019510. 1 MGLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYEQYYVV 80 (799) Q Consensus 1 M~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~~y~l~ 80 (799) ||+|+|+||||++|||||+|.+|+|+||++|+||+|||+.||+||||++||+.|.++ +.+..++|+++|++.|+|++. T Consensus 1 M~~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRp~~~~i~~l~~~--~~~~~~~~~~~r~~~e~y~~~ 78 (905) T protein:vir:78 1 MGAVLQKIPNLLGGVSQQPDPVKLPGQVREAENVYLDPTFGCRKRPATKFVGELATN--LPSDTRWFPIFRDAGERYAVA 78 (905) T ss_pred CccceecchhhhCceeecchhhcCCcchhhhhccccccccccccCchhhhhhhhcCC--CCCCceEEEEEeCCCceEEEE Confidence 999999999999999999999999999999999999999999999999999998765 456788999999999999999 Q ss_pred EeC-C----eEEEEeC-CCeEEEEEecC---ccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceE Q lcl|NC_019510. 81 FTG-G----DIKVFDL-NGQEYAVRGDK---SYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDAL 151 (799) Q Consensus 81 ~~~-g----~irv~~~-~G~~~~v~~~~---~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~ 151 (799) ++. | .|||||+ +|.+++|++++ .|+. +.+.++|+++++||++||+|+++.|++... ....+.++|+ T Consensus 79 ~~~~g~~~~~i~v~d~~~G~~~~V~~~~~~~~yl~-~~~~~~l~~~tv~d~tfi~N~~~~~~~~~~----~~~~~~~~~~ 153 (905) T protein:vir:78 79 LYKDGSGNTQVRVWDMQTGAERTVTPDATATAYLA-TTNLNNLNWLTVADYTLLSNKERIVTMSGA----SEVDSNQRAL 153 (905) T ss_pred EeeCCCCCcceEEEEccCCcEEEEecCCCccceee-cCCCcceEEEEEcCEEEEEcCceeeeecCC----CCcCCCCeEE Confidence 863 3 4999998 89999998753 5664 456889999999999999999999976543 3456778999 Q ss_pred EEEecCCCCceeeEEeccceEEEE---EecCCcccc-------------------------------------------- Q lcl|NC_019510. 152 INVRGGQYGRTLNVIFNEATRATI---KLPSGTGTT-------------------------------------------- 184 (799) Q Consensus 152 ~~v~~~~y~~ty~v~~~~~~~~~~---~tp~gt~~~-------------------------------------------- 184 (799) ++++.++|+++|.++|++...... +++.+...+ T Consensus 154 ~~v~~g~y~r~y~v~I~~~~~~~~~t~~~~~a~~~s~~s~~~~~~g~~~~~~~~~~~~~t~~~~~~l~f~~~~~~~~~~~ 233 (905) T protein:vir:78 154 VEINAISYNTTYSIDLDRDGASQQVKVYRAKALEISPGSFEVEDGGVCTEHDVQNYTNQTIGSSTGLAFQVRVQCAAYLE 233 (905) T ss_pred EEEEeeccceeEEEEEeCCCCceeeeeeccccceeccccccccccccccccceeeeecceeeccCCceeEEeeccccccC Confidence 999999999999999987543221 111110000 Q ss_pred -------------------------------------ceeccc----------c--c-----ccccccchhhhhhccccc Q lcl|NC_019510. 185 -------------------------------------PPIEEQ----------V--A-----AVDAQHIAEELAKQIRES 210 (799) Q Consensus 185 -------------------------------------~~~~~~----------~--a-----~~~~~~i~~~l~~~~~~~ 210 (799) +.+... + . ....+.+...|..++..+ T Consensus 234 ~~~~~~~~~~~~~l~~g~~~~~~~~~~~v~~~g~~y~i~i~~~~~~~~~~~~~~~~~~t~~d~~a~~~~~~~i~~~l~~~ 313 (905) T protein:vir:78 234 NNEYRSRYNVSVVLQNGGTGFRKGDMITVNLNGRDYNIRVTQEEFVYTYASDGTAAHTTPQDSTAGTLDIGQITAGLVNS 313 (905) T ss_pred CCcccccccceeeeeccccccccCccEEEeeccceEEEEEecceeEEEecCCCcccccCccCccCccccHHHHHHHHHHh Confidence 000000 0 0 000000111233334444 Q ss_pred cccccceEEEECCcEEEEEecCCCceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEE Q lcl|NC_019510. 211 LAGNPGWTINVGTGFVNIIAPDGDSIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRY 290 (799) Q Consensus 211 ~a~~~~~t~~~~g~~i~i~a~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~ 290 (799) ....++|++.+.++.++|.++++... .+++++|..++.+.++++.|+++++||.+|++|++++|.++.+.+.|.||++| T Consensus 314 ~~~~~~~~~~~~g~~i~v~~~~~~~~-~~~~~~g~~~~~~~~~~~~v~~~~~Lp~~~~~g~~v~v~~~~~~~~d~yyv~~ 392 (905) T protein:vir:78 314 VNLISNYSAQAVGNVIEIERTDGRDF-NLGVRGGATNRAMTAIKGTANSIVDLPGQCFDGFELKVINTENAESDDYYVVF 392 (905) T ss_pred hcccccEEEEecCcEEEEEecCCCcc-EEEEeccCCcceEEEEeccccccccCccccCCCcEEEEEeCCCCCcceEEEEE Confidence 55678899999999999999988654 58999999999999999999999999999999999999999999999999999 Q ss_pred e------cCceEEEEeccccccceeeccceeeEEEeccCceEEEeecc-------cCccccCccccccCccccCCCeeEE Q lcl|NC_019510. 291 N------LTRKVWEETVGWNIQVGLNNGTMPWSLIRAADGQFDFVANS-------WVGRTAGDDDTNPHPSFVGQAITDV 357 (799) Q Consensus 291 ~------~~~~~w~e~~~~~~~~~~~~~t~p~~lv~~~~~t~~~~~~~-------w~~~~~gd~~~np~psf~~~~p~~v 357 (799) + .+++.|+||+++++..+++++||||.|++++.|+|+++..+ |++|.+||+++||.|+|+|++|++| T Consensus 393 ~~~~~~~~~~~~W~E~~~~~~~~~~~~~tmp~~l~r~~~g~f~~~~~~~~~~~~~~~~r~~Gd~~Tnp~psf~g~~is~v 472 (905) T protein:vir:78 393 RSAAEGIPGSGSWEETVAPGIERGFNTSTMPHALIRQADGNFTLEALNDEGTITGWAQREVGDDDTNPKPSFVGRGISDM 472 (905) T ss_pred EecccCCcCceeEEEecccccccccccccccEEEEEecCceEEEEEeccccccccccccccCCcccCCCCcccCCCcceE Confidence 5 34679999999999999999999999999999999999887 9999999999999999999999999 Q ss_pred EEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCc-cc Q lcl|NC_019510. 358 FFYRNRLGMLSGENIILSRTAKYFNMYPASVAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNASG-VL 436 (799) Q Consensus 358 ~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~-~l 436 (799) +||||||+|+++++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|++ +| T Consensus 473 ~f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdDpI~~~~ss~~~~~i~~~v~~~~~L~ifT~g~ef~lsg~~~~l 552 (905) T protein:vir:78 473 FFYNNRLGFLSEDAVIMSQPGDYFNFFVTSAITISDSDPIDVTASSTKPAILRAAIGAPKGLILFAENSQFLLASQEVVF 552 (905) T ss_pred EEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeEEEeecCCcEEEEecCceEEEecCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999865 79 Q ss_pred cccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCC Q lcl|NC_019510. 437 SAKSVELNLTTEFDVNDGARPYGIGRGVYFASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTEN 516 (799) Q Consensus 437 TP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~ 516 (799) ||+|++++++|.|+|+++|+|+.+|++++|++++|+|++++| |+|++++|+|.++|+|+|++|||+|+++. +++++| T Consensus 553 TP~s~~i~~~S~~~~~~~v~Pv~vG~~vlFv~~~g~~s~vre-~~y~~~~d~y~a~DlT~~a~hl~~g~v~~--~~~s~~ 629 (905) T protein:vir:78 553 STATIKLTEISDYFYRSLAKPVSTGVSIAFVSEADTYSKIFE-MSIDSVDNRPQVADITRIVPEYVPTGLTW--SVSTPN 629 (905) T ss_pred cceeEEEEeEEeecccCCCCcEEeCCeEEEeecCCCeeEEEE-EEeeecccceehhHHHHHHHHhcCCceEE--EEecCC Confidence 999999999999999999999999999999999999877666 68999999999999999999999998764 456778 Q ss_pred cEEEEEEcCCCeEEEEEeeeCCCceeeEeeEeeecCCCeEEEEEEEeCCEEEEEEEeCCCEEEEEEEEEeeccC--C-Cc Q lcl|NC_019510. 517 FATVLTSGAKGKVFIYKFLYIDEQIQQQSWSHWDFGDNVTVLAANSIGSHMHVILQNGYDIFMGSISFTKKTLD--F-GN 593 (799) Q Consensus 517 ~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~w~~~g~~~~~~~~~~~d~l~~~v~r~~~~~~~~~~~~~~~~~--~-~~ 593 (799) .++||+++++|+|++|+||++++||+|+|||||+|+|.++++|++ .|++|++|+|..++...++.+.+.... . .+ T Consensus 630 ~~~v~~~~~~~~l~~ytyl~~~~eq~v~AWsrw~~~G~~~~~a~i--~d~~~~vV~r~~~G~~~~~~~~l~~~~~~~~~d 707 (905) T protein:vir:78 630 NSMMLFGDNSNTAYIFKFFNQGNERQVAGWSKWILPGEQRMCGFF--ADTGYFVLYDSTTGSYVLSAMELLDDPDSASID 707 (905) T ss_pred CcEEEEEcCCCeEEEEEeecCCCceeEEeEEEEecCCCeEEEEEE--cCCEEEEEEEccCCeEEEEEEeeccccCccccc Confidence 889999999999999999999999999999999999999998877 477899999987766555554442210 0 11 Q ss_pred cccceeeceeEEEEEccccccccccccccccccccCccccccceEEEEecCCcccccc-c--cccceecCceEEEccCCC Q lcl|NC_019510. 594 EPYRLYMDAKTRYDIPANAFNNDRYETTVDLNAVFGGMRWQVGKILVSDEVGEVRQYE-P--PAGGWASDPTLRIVGDMA 670 (799) Q Consensus 594 ~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~adG~~~~~~-~--v~~g~~~~~~~~i~~~~~ 670 (799) ...+.++||...+.++.+..+....... ......+++.|++++.+.+.+||...... . +.++.. ++ .+ T Consensus 708 ~~~~~~~~~~d~~~~~~~~t~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~dG~~~~~~~~~~~~~~~~-----t~---~~ 778 (905) T protein:vir:78 708 TAFSSFLPRLDNYVVKSDLTVVDNGDGT-LTVDLEAGQAMTGATPVIMFTDGPSEFAFSQPTITAGQF-----TV---DT 778 (905) T ss_pred cceeeeeeccceeeecccceecccCcce-EeeeccCccccccceeEEEeeCCceeeeEEEEEeeceee-----cc---cc Confidence 1123455555556555544332211111 12233467889999998889998764322 1 222222 22 35 Q ss_pred CCeEEEeEeeeEEEEecCeeEEcCCCccceeeeecceEEEEEEEEEeeccceEEEEecCCCcccceec-cccccCcc-cc Q lcl|NC_019510. 671 GKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTEDVGRLQHRRAWLNYEQSGAFYVDVTNLGRSYRYTM-SGKPLGDT-TL 748 (799) Q Consensus 671 ~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~~~~~~-~~~~~~~~-~~ 748 (799) +++|+|||+|+++++|+||+++.++++.++ +|++|+|++|+|++|++|.+++++.+++..... ..++.+.. .. T Consensus 779 a~~v~VGl~Y~s~v~~~p~~~~~~~~s~~~-----~~~rI~rv~lr~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~ 853 (905) T protein:vir:78 779 TDDFVVGFKYETKITLPGFFTSEENKADRV-----YAPIVEFLYLDLYYSGRYQIEVDRIGYDTINIDAGSIDANIYLAD 853 (905) T ss_pred CCeEEEeeeeeEEEeecceEeccCCCcccc-----cceEEEEEEEEeecceeEEEEEcCCCcceecccccceecCcccCc Confidence 788999999999999999999888776654 578999999999999999999998887654333 33444443 34 Q ss_pred cccccccceEEEEEeecccceEEEEEECCCCcEEEEEEEEEEEEecc-ccCC Q lcl|NC_019510. 749 GQANLESGQFRFPLAGNAQYNRVVLTSDYTTPLSIIGCGWEGNYIRR-STGI 799 (799) Q Consensus 749 ~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~~eg~y~~r-~rrv 799 (799) ++|++++|+++||+.+|+++.+|+|+|++|+||+|+||+|||+||+| ++|| T Consensus 854 ~~p~~~tg~~~vP~~g~~~~~~v~I~sd~PlP~tvlsi~weg~Yn~r~~~~~ 905 (905) T protein:vir:78 854 GAPLKEIATENVPLFTPGDQVTVTIKAPDPFPSAITGYSWQGHYNRRGIAFI 905 (905) T ss_pred ccccccccEEEEEeeccCceeEEEEEECCCCcEEEEEEEEEEEeccceeecC Confidence 56677899999999999999999999999999999999999999999 6777 No 18 >protein:vir:103790 Length: 768 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024932;genbank:gi:48697202;genbank:GeneID:2846114 Probab=100.00 E-value=2.6e-179 Score=999.64 Aligned_cols=730 Identities=13% Similarity=0.083 Sum_probs=550.7 Q ss_pred CCceeeeccceecc-----cccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCce Q lcl|NC_019510. 1 MGLVSQSIKNLKGG-----ISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYE 75 (799) Q Consensus 1 M~~v~~s~~~~~gG-----VSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~ 75 (799) |++++++++||.+| +++|+|++||++||++|+||+++|+|||+|||||+||+++++. ....+|+||.|+++| T Consensus 1 M~~~~~~~~~F~~GelsP~l~~r~Dl~ry~~~~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~---~~~~~lipf~~~~~~ 77 (768) T protein:vir:10 1 MPKAAPQQVSFDAGELSPLLGARVDLAKYPNGCQVMENFIATVQGPAIRRGGKRFVAATKDS---TKQSWLLPFIVADGI 77 (768) T ss_pred CCcceeeeeeccCceechhhcccchHHHHHHHHhhhhcceeeecCCceecCchhhhhhhcCC---CCCeeEEEEEecCcc Confidence 99999999999999 8999999999999999999999999999999999999987643 467789999999999 Q ss_pred EEEEEEeCCeEEEEeCCCeEEE----EEecCcc----ccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCC Q lcl|NC_019510. 76 QYYVVFTGGDIKVFDLNGQEYA----VRGDKSY----VQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQ 147 (799) Q Consensus 76 ~y~l~~~~g~irv~~~~G~~~~----v~~~~~y----~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~ 147 (799) +|+|||++++||||+.+|.++. .+...|| +.+.+++.+|+|+|+||+|||+|++|||++|.|.++.+ |... T Consensus 78 ~y~l~fg~~~irv~~~~g~v~~~~~~~e~~tp~~~~~l~~~~~~~~L~~~q~aD~~~i~~~~~~p~~l~r~~~~~-w~l~ 156 (768) T protein:vir:10 78 AYMLEFGDHYIRFFVNRGQLVNAGAPVEIATPYALADLTTEDGTFAIRATQSADTMYLFHGGYPTQKLLRTSATT-FSLQ 156 (768) T ss_pred EEEEEEcCCEEEEEECCcEEEecCeeEEEEcCCCcceeecccccceeEEEeecCEEEEEcCCcceeEEEEecCCC-ceeE Confidence 9999999999999998887653 2333443 44566778899999999999999999999999988754 3333 Q ss_pred cceEEEEecCCCCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEE Q lcl|NC_019510. 148 KDALINVRGGQYGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVN 227 (799) Q Consensus 148 ~~~~~~v~~~~y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~ 227 (799) +..+...+.+.++...++++ +++++....+.+.+.+.+.+++++..+....... ....+|+.....+... T Consensus 157 ~~~~~~gp~~~~n~~~~vti---------~~s~~~~~~T~tasa~~~~~~~v~~~~~l~~~~~-~~~~~~~~~~~~g~~~ 226 (768) T protein:vir:10 157 PVTFVGGPFAAVNSDNNVRV---------HASAGTGAVTLVASASVFRPSDVGTLFYLEQEDN-SFVKPWVVHQKIGPSE 226 (768) T ss_pred EeeecCccccccccceeEEE---------EecccceeEEEeecCCccchhhcceeeeeeeecc-ccccccEEEEeeeeEE Confidence 33333222233322222222 1222222334444556667777776544332222 2234665544444444 Q ss_pred EEecCCCceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcce-----e---EEEEecCceEEEE Q lcl|NC_019510. 228 IIAPDGDSIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADK-----Y---YVRYNLTRKVWEE 299 (799) Q Consensus 228 i~a~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~-----~---y~~~~~~~~~w~e 299 (799) +..+.+.......+.++..+ ....+++.+..|+.+.+........+. . |.+...+.+.|.+ T Consensus 227 ~~~~~~~~~~~~~~~~~~~~-----------~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 295 (768) T protein:vir:10 227 LRRVGDRVYLCTAVGTATPQ-----------VTGTETPTHTSGSRWDGTGQDESATDEYGSIGAEWEYQHSGYGTVLITG 295 (768) T ss_pred EEecCCceEEeeeecccccc-----------ccceeccccccCceEEEecCcccccccccccceEEEEEEcCCceEEEEE Confidence 44444433322222221111 111234555566555443333222111 1 2223334444444 Q ss_pred eccccccceeeccceeeEEEeccCceEEEeecccCccccC-ccccccCccccCCCeeEEEEEcceEEEecCCeEEEEccC Q lcl|NC_019510. 300 TVGWNIQVGLNNGTMPWSLIRAADGQFDFVANSWVGRTAG-DDDTNPHPSFVGQAITDVFFYRNRLGMLSGENIILSRTA 378 (799) Q Consensus 300 ~~~~~~~~~~~~~t~p~~lv~~~~~t~~~~~~~w~~~~~g-d~~~np~psf~~~~p~~v~f~q~RL~f~~~~~v~~Sr~g 378 (799) + ...+|++..++...+++......+.....+ .++....|+-.++||++|+||||||+|++|++|||||+| T Consensus 296 ~---------~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~g~Ps~v~f~q~RL~f~~~~~v~~Srtg 366 (768) T protein:vir:10 296 Y---------TNDQVVTGTVATNDPADPGMLPNTVVTLTGTYKWARSLFNSTDGFPQMGTFWRNRLCLMRDRWLAMSVSA 366 (768) T ss_pred e---------cCCeeEEeeeeeecCcccccccccccccCCCcccccCCCcCCCCCceEEEEEeeeEEEeeCCEEEEEccc Confidence 3 444566666665555554444443332222 122233444456788999999999999999999999999 Q ss_pred CccccccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCC---ccccccceEEEEEEeecccCCC Q lcl|NC_019510. 379 KYFNMYPASVAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNAS---GVLSAKSVELNLTTEFDVNDGA 455 (799) Q Consensus 379 d~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~---~~lTP~~~~~~~~s~~~~~~~~ 455 (799) |||||++++++++.|||||+++++++++|+|+|++++ ++|+|||+++||+|+++ ++|||+|++++++|.|++ ++| T Consensus 367 d~~nF~~~s~~~~~DdD~I~~~~ss~~~~~i~~~v~~-~~L~i~T~~~q~~l~~~~~~~~lTP~~~~i~~~s~~g~-~~~ 444 (768) T protein:vir:10 367 DFETFKTKDADQQTDDSAIVQQLNARQLNKLAWMVES-DSLLIGMTGDEWVIGPANASQPVSAANLNAARRTSYGS-KRI 444 (768) T ss_pred ccccccccccccccCCccEEEEecCCcceeEEEEeec-CcEEEEecCceEEEecCCCCcccccceEEEEEeehhcc-ccc Confidence 9999999999999999999999999999999999999 58999999999999873 589999999999999976 479 Q ss_pred CcEEeCCeEEEEecCCCceeEEEEEeeccccCceehhhHHHHHHHhcCCCc-----EEEEEcCCCCcEEEEEEcCCCeEE Q lcl|NC_019510. 456 RPYGIGRGVYFASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIPNGV-----FSISGSSTENFATVLTSGAKGKVF 530 (799) Q Consensus 456 ~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~-----~~~~a~~~~~~~~v~~~~~dg~l~ 530 (799) +|+.+|++++|+|++|+ .+|+|+|++++|+|+++|+|+|++||+++.. +..||++++|.+++||+++||+|+ T Consensus 445 ~Pv~vG~~v~fv~~~g~---~vre~~y~~~~d~y~a~DlT~~a~hl~~~~~~~~~~i~~~a~~~~p~~v~~~v~~dg~l~ 521 (768) T protein:vir:10 445 QPVQVGGTIMFVQKAGR---KLRDFKYDFSSDNYVSTDVTKIADHITRGRAGTNSGIMSLCFQQEPHSVVWAARADGQLI 521 (768) T ss_pred ccEEeCCeEEEEcCCCC---EEEEEEeeeecCceecchhhhhhhhhccccCccccceeeEEEeecCCeEEEEEecCCeEE Confidence 99999999999999994 4788899999999999999999999999764 677899999999999999999999 Q ss_pred EEEeeeCCCceeeEeeEeeec-CCCeEEEEEEEe----CCEEEEEEEeCCCEEEEEEEEEeeccC--CCccccceeecee Q lcl|NC_019510. 531 IYKFLYIDEQIQQQSWSHWDF-GDNVTVLAANSI----GSHMHVILQNGYDIFMGSISFTKKTLD--FGNEPYRLYMDAK 603 (799) Q Consensus 531 ~~tyl~~~~e~~v~aW~~w~~-~g~~~~~~~~~~----~d~l~~~v~r~~~~~~~~~~~~~~~~~--~~~~~~~~~lD~~ 603 (799) +|||++++++|+|+|||||++ +|.|++||+++. +|+||++|+|++++..+++.+.++... .....+++++||+ T Consensus 522 ~~ty~~e~~~q~v~aW~~~~~~~g~v~~v~~i~~~~g~~d~l~~~v~r~~~g~~~~~ie~l~~~~~~~~~~~~~~~~D~~ 601 (768) T protein:vir:10 522 GCTYDEEAGRSDVYGWHRHPDANGFVECVASMPAPDGASDDLWVIVRRQVNGQTVRYVEYLNPALQDDEPQSSAFYVDAG 601 (768) T ss_pred EEEEecCCCceeEEeEEEEEcCCCEEEEEEEEecCCCCccEEEEEEEecCCCeEEEEEEecCcccccccccccceEeccc Confidence 999998888899999999985 788999998863 589999999999999898877665421 1122345666665 Q ss_pred EEEEEccccccccccccccccccccCccccccceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEE Q lcl|NC_019510. 604 TRYDIPANAFNNDRYETTVDLNAVFGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFR 683 (799) Q Consensus 604 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~ 683 (799) .++ .+.. ....+|+.|++|+++.+++||.+++..++.+|.++ ++.++++|+|||+|+++ T Consensus 602 ~~~--------~~~~------~~~~~gl~~leg~~v~v~~dG~~~~~~~v~~g~it-------l~~~~~~v~vG~~y~s~ 660 (768) T protein:vir:10 602 ITY--------NGVP------TSTIAGLGHLEGVTVAVLTDGAVHPSRTVTAGAIT-------LDWSASIVHIGVPTTCR 660 (768) T ss_pred ccc--------CCcc------eeeecCCCCcccceEEEEECCEeccCceecCCEEE-------eCCCCceEEEeEeeeEE Confidence 443 3322 23457999999999999999999999999888554 45889999999999999 Q ss_pred EEecCeeEEcCCCccceeeeecceEEEEEEEEEeeccceEEEEecCCCcccceeccccccCcccccccccccceEEEEEe Q lcl|NC_019510. 684 YEFSKFLIKKQDESGGFSTEDVGRLQHRRAWLNYEQSGAFYVDVTNLGRSYRYTMSGKPLGDTTLGQANLESGQFRFPLA 763 (799) Q Consensus 684 ~~~~~~~~~~~~g~~~~~~~~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~~ 763 (799) |+|+||++++++|+.++ +|+||+|+++++.+|+++.+.++...+.... ...+..++....++|++||++++|+. T Consensus 661 ~~~~p~~~~~~~gs~~~-----~~~ri~r~~v~~~~S~~~~~~~~~~~~~~~~-~~~r~~~~~~~~~~~l~TG~~~v~~~ 734 (768) T protein:vir:10 661 IQTMQLNAGAANGTAQG-----KTKRVTNIATRFSRSLGGVVGPTFDDNDLEQ-LSFRKPSNAMDRAVPLFDGDMESDWR 734 (768) T ss_pred EEecceEeecCCccccc-----cceEEEEEEEEEecccceEEEecCCCCCcee-eeeEecCcccCccCCcccCEEEEEec Confidence 99999999999887765 5789999999999999999987655433211 11133344444467899999999984 Q ss_pred -ecccceEEEEEECCCCcEEEEEEEEEEEEeccc Q lcl|NC_019510. 764 -GNAQYNRVVLTSDYTTPLSIIGCGWEGNYIRRS 796 (799) Q Consensus 764 -~~~~~~~v~i~~~~P~P~tvl~i~~eg~y~~r~ 796 (799) +|+++.+|+|+|++|+||+|++|+||+++|+|+ T Consensus 735 ~~~~~~~~i~i~~d~P~P~tvlsi~~~~~~nd~~ 768 (768) T protein:vir:10 735 GGYEGQSWICYQNDQPLPVTLLGFFPILDTQDDR 768 (768) T ss_pred CCCCcceEEEEEECCCCCEEEEEEEEEEEEeecC Confidence 578999999999999999999999999999999 No 19 >protein:vir:95324 Length: 823 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512269;genbank:gi:89152436;genbank:GeneID:3952993 Probab=100.00 E-value=3.6e-169 Score=944.03 Aligned_cols=721 Identities=13% Similarity=0.106 Sum_probs=540.9 Q ss_pred CCceeeeccceecc-----cccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCce Q lcl|NC_019510. 1 MGLVSQSIKNLKGG-----ISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYE 75 (799) Q Consensus 1 M~~v~~s~~~~~gG-----VSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~ 75 (799) |+ +++.++||.+| +++|+|++||++||++|+||+++|+||++|||||+||+++++. ....+|+||.|++.| T Consensus 1 m~-i~~~q~sF~~GElsP~l~gR~Dl~ry~~q~~~~~N~~~~~~GGl~rRpGt~fva~~~~~---~g~~rLipf~~s~~q 76 (823) T protein:vir:95 1 MA-ISWIQPSFAGGEIGPSLYGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYP---NRKCRLIPFQFSTVQ 76 (823) T ss_pred Cc-ceeechhccCceechheeccchHHHHHHHHhhhhCcEeeecCCceecCchhhhhhhcCC---CCCeeEEEEEeCCCc Confidence 99 99999999999 9999999999999999999999999999999999999987654 467899999999999 Q ss_pred EEEEEEeCCeEEEEeCCCeEEE-----EEecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcce Q lcl|NC_019510. 76 QYYVVFTGGDIKVFDLNGQEYA-----VRGDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDA 150 (799) Q Consensus 76 ~y~l~~~~g~irv~~~~G~~~~-----v~~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~ 150 (799) +|+|||++++||||+++|.++. ++...| +++.++.+|+|+|+||+|||+|++|||++|.|.++.. |...+.. T Consensus 77 ~y~Lefg~~~irV~~~~g~vv~~~~~~~ev~tP--y~~~~l~~Lr~~qsaD~~fivh~~~~p~~L~r~~~~~-w~l~~~~ 153 (823) T protein:vir:95 77 TYALEFGHQYMRVIKDGALVLNSSNVIYEIATP--YTEADLFRIKFTQSADVLTLVHPAYPPKELRRYAHDN-WQLVDVV 153 (823) T ss_pred EEEEEEcCCeEEEEeCCcEEEecCCceeEEecc--cccccccceeEEEeccEEEEEcCCccceEEEecCCCC-ceEEEEE Confidence 9999999999999987665432 122333 4667788999999999999999999999999988854 3333333 Q ss_pred EEEEecCCCCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEEEe Q lcl|NC_019510. 151 LINVRGGQYGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIA 230 (799) Q Consensus 151 ~~~v~~~~y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a 230 (799) +...+.+..+.++++++ .+++......+....+.+.+++++..+.... ........|..... .. T Consensus 154 ~~~gp~~~~~~~~t~~v---------~~~~~~~~~t~ta~~~~~~~d~vg~~~~l~~-~~~~~~~~~~~~~~------~~ 217 (823) T protein:vir:95 154 TKNGPFEDINIDESLTV---------YASASTGTITLTASASIFGAEQVGKLFYLEQ-PAVDSVPVWETSKS------TS 217 (823) T ss_pred EeccccccccccceeEE---------eccccCceeEEeecccccchhhccceEEEec-cccceeeecceeee------ec Confidence 32222222222222222 2333333334444445555555543211100 00001111110000 00 Q ss_pred cCCCceeEEEEEeccCcceeeEEeeeeccceeccccccCC-eEEEEEecCCCCcceeEEEEecCceEEEEecccc-ccce Q lcl|NC_019510. 231 PDGDSIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDG-YTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWN-IQVG 308 (799) Q Consensus 231 ~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G-~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~-~~~~ 308 (799) .......+...... ..+.....+++.+..| +.+...++.......+|..+..+.+.|++++..+ .... T Consensus 218 -----~~~~~~~~~~~~~~-----~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~t~v~~~~~~~ 287 (823) T protein:vir:95 218 -----IGDIRRADSNYYRA-----VTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEWEYLHSGFGIARITAVNGTTATA 287 (823) T ss_pred -----ccceEEecccceee-----eeccccceeecccCCcceEEeceecccccceeEEEEEeCCcceEEEEeecceeeec Confidence 00000000000000 0111122333444333 4444443332222222332345667888775443 3344 Q ss_pred eeccceeeEEEeccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEec----CCeEEEEccCCccccc Q lcl|NC_019510. 309 LNNGTMPWSLIRAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLS----GENIILSRTAKYFNMY 384 (799) Q Consensus 309 ~~~~t~p~~lv~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~----~~~v~~Sr~gd~~nF~ 384 (799) ...++||+.+++...++|.+....|.. .++||++|+||||||+|++ |++|||||+|||+||+ T Consensus 288 ~~~~~~~~~~~~~~~~t~~~~~~~~~~--------------~~g~Ps~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~ 353 (823) T protein:vir:95 288 EVISYIPSQVVGEDNASYKWAKYAWNS--------------VNGYPGTVVYYQQRLYFAASTAFPQTIWASRTGDYKDFG 353 (823) T ss_pred eEeeeeccccccCCcCCccccccccCc--------------CCCCccEEEEEeceEEEEEcCCCCcEEEEeccCCccccc Confidence 566789999998888888777666643 5689999999999999995 6899999999999999 Q ss_pred cccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCC--ccccccceEEEEEEeecccCCCCcEEeCC Q lcl|NC_019510. 385 PASVAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNAS--GVLSAKSVELNLTTEFDVNDGARPYGIGR 462 (799) Q Consensus 385 ~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~--~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~ 462 (799) +++++ +|||||+++++++++|.|+|+++++ +|++||+++||+|+++ ++|||+|++++++|.|++ ++|+|+.+|+ T Consensus 354 ~~~~~--~DdD~I~~~~s~~~~~~i~~~v~~~-~Lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~ 429 (823) T protein:vir:95 354 KSNPT--QDDDRIIYTYAGRQVNEIRHLIDVG-SLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGS-SNVPPIAVAN 429 (823) T ss_pred cccCC--CCCCcEEEEEcCCcceEEEEEeecC-cEEEEecCcEEEEEcCCCcccceeeEEEEEeecccc-ccccceEeCC Confidence 99854 7999999999999999999999995 7999999999999875 689999999999999854 6899999999 Q ss_pred eEEEEecCCCceeEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCcee Q lcl|NC_019510. 463 GVYFASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQ 542 (799) Q Consensus 463 ~v~f~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~ 542 (799) .++|+|++|+ .+|+|.|++++|+|+++|+|+|++||+++..+..+|++++|++++|++++||+|++|+|+ +||+ T Consensus 430 ~~~Fv~~~g~---~vre~~~~~~~d~~~~~dlT~~a~hl~~~~~i~~~a~~~~p~~~~~~v~~dG~l~~~ty~---~~q~ 503 (823) T protein:vir:95 430 IALFVQEKGS---VVRDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDGKLLVMTYL---RDQQ 503 (823) T ss_pred eEEEEecCCC---EEEEEEEeeecCceecchhhhhhhhhcCCCceEEEEEecCCCeEEEEEecCCcEEEEEEe---cccc Confidence 9999999984 477889999999999999999999999998888889999999999999999999999996 7888 Q ss_pred eEeeEeeecCCCeEEEEEEEe--CCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEcc---------- Q lcl|NC_019510. 543 QQSWSHWDFGDNVTVLAANSI--GSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPA---------- 610 (799) Q Consensus 543 v~aW~~w~~~g~~~~~~~~~~--~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~---------- 610 (799) |.|||||+++|+|+++|++++ +|+||++|+|++++...++.|.+++.....+.+.+||||+.+|.... T Consensus 504 v~aW~~~~~~g~~~~~~~i~~~~~d~l~~~v~R~i~g~~~~yiE~~~~~~~~~~~~~~~lD~~~s~~g~~~~~~~~~l~~ 583 (823) T protein:vir:95 504 VFAWAPQSSTGKYESTCSISEGNEDAVYFVVNRTVNGQTVRYIERLSSRLFTSDEDAFFVDSGLSYDGRNTSDRTMTITG 583 (823) T ss_pred eeeeEEEecCCcEEEEEEecCCCCCEEEEEEEeccCCeEEEEEEeeccccCCCccceeEEEEEEEeecCcccceeeEecC Confidence 999999999999999999875 58999999999999988888888776666667889999997764211 Q ss_pred -----------------cccccccc------------------------------------------------------c Q lcl|NC_019510. 611 -----------------NAFNNDRY------------------------------------------------------E 619 (799) Q Consensus 611 -----------------~~~~~~~~------------------------------------------------------~ 619 (799) +....... . T Consensus 584 g~~~l~~l~g~~v~~adg~~~~~~~v~g~i~l~~~~~~~~vGl~~~~~i~~~~~~v~~~~a~~~~~~r~v~a~l~~~~t~ 663 (823) T protein:vir:95 584 GSGEWDYLAEYTISVSGGAYFTSSDVGAQLQFPYTGADPDTGYEVSKELRCDIISVTSNTAVVVRANRNVPPSLRNVATT 663 (823) T ss_pred CCCcccccCceEEEecCcceECCccceeEEEeCcCCCccccccceEEEEEEeeceeeCCceEEEccCCcccceeeeeecc Confidence 11100000 0 Q ss_pred cccccccccCccccccceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCccc Q lcl|NC_019510. 620 TTVDLNAVFGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGG 699 (799) Q Consensus 620 ~~~~~~~~~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~ 699 (799) ....+...++||.||||++|.+++||.+++..+|.+|.++++ .+++.|||||+|+++++||||++..+ |.++ T Consensus 664 ~~~~~~~~~~gL~hleg~tv~v~~dg~~~~~~~v~~G~vtl~-------~~~~~v~vGl~~~~~~~~l~~~~~~~-g~~~ 735 (823) T protein:vir:95 664 NWQMARRTFGGLSHLEGQTVNILSDANVEPQKVVSGGAVTLE-------SPGAVVHIGLPITAEFETLDININGQ-ETLL 735 (823) T ss_pred ccccccceeeeccccccceEEEEEcCeeeCCeEecCCEEEec-------CCCCEEEEeecceeeEEecchhcCCC-cccC Confidence 000023467799999999999999999999999999976654 78999999999999999999998864 7766 Q ss_pred eeeeecceEEEEEEEEEeeccceEEEEecCCCcccceeccccccCcccccccccccceEEEEE-eecccceEEEEEECCC Q lcl|NC_019510. 700 FSTEDVGRLQHRRAWLNYEQSGAFYVDVTNLGRSYRYTMSGKPLGDTTLGQANLESGQFRFPL-AGNAQYNRVVLTSDYT 778 (799) Q Consensus 700 ~~~~~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~-~~~~~~~~v~i~~~~P 778 (799) ++ ++||++++++|++|.++.++.+.... +.+. .|-.+.+..++|++||++++++ .+|+++++|+|+|++| T Consensus 736 g~-----~~ri~~~~~~~~~s~~~~~g~~~~~l-~~~~---~r~~~~~~~~~~~~tG~~~~~~~~~~~~~~~~~i~q~~p 806 (823) T protein:vir:95 736 DK-----KQVIPSVTLVVNASRGIWATTPGGKW-YEYP---QREFEFYDDPVDDATGKVEVKLDSNWGKNGRVKIRQLDP 806 (823) T ss_pred Cc-----eeEEeEEEEEEEeeeeEEEecCCCce-eEee---ccCCCcccCCCCcccceEEEecCCCcCCccEEEEEEcCC Confidence 65 35899999999999999987654322 2222 2223456667899999999997 7999999999999999 Q ss_pred CcEEEEEEEEEEEEecc Q lcl|NC_019510. 779 TPLSIIGCGWEGNYIRR 795 (799) Q Consensus 779 ~P~tvl~i~~eg~y~~r 795 (799) ||||||||..|-..+== T Consensus 807 lp~tvl~v~~~~~~~g~ 823 (823) T protein:vir:95 807 LPLSVLAVIPRLTVGGF 823 (823) T ss_pred CceEEEEEEEEEEecCC Confidence 99999999987664433 No 20 >protein:vir:7329 Length: 825 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848220;genbank:gi:30387391;genbank:GeneID:2641863 Probab=100.00 E-value=5.2e-167 Score=932.21 Aligned_cols=721 Identities=13% Similarity=0.117 Sum_probs=525.7 Q ss_pred CCceeeeccceecc-----cccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCce Q lcl|NC_019510. 1 MGLVSQSIKNLKGG-----ISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYE 75 (799) Q Consensus 1 M~~v~~s~~~~~gG-----VSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~ 75 (799) |+ ++..++||.+| +..|.|++||++||++|+||+++|+||++||||++||+++++. ++..+|+||.|+++| T Consensus 1 m~-~~~~q~sF~~GElsP~l~gR~Dl~~y~~g~~~~~N~~~~p~Gg~~rRpGt~fva~~~~~---~~~~rLipF~fs~~q 76 (825) T protein:vir:73 1 MA-FSWIQPSFAGGEIGPSLYGRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPAKYP---DRKCRLIPFQFSTVQ 76 (825) T ss_pred Cc-cceeccccccceechhhcccchHHHHHHHHHHhcCcEEEecCCceecCchHHhHhhcCC---CCCEEEEEEEeCCCc Confidence 86 44677999999 7789999999999999999999999999999999999987654 567899999999999 Q ss_pred EEEEEEeCCeEEEEeCCCeEEE-----EEecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcce Q lcl|NC_019510. 76 QYYVVFTGGDIKVFDLNGQEYA-----VRGDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDA 150 (799) Q Consensus 76 ~y~l~~~~g~irv~~~~G~~~~-----v~~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~ 150 (799) +|+|||++++||||+++|.++. .+..+|| ++.++.+|+|+|+||+|||+|+++||++|.|.++.+| . T Consensus 77 ~y~Lefg~~~lrv~~~gg~v~~~~~~~~e~~TPy--~~~~l~~l~~~QsaD~~~i~h~~~pp~~L~r~~~~~W------~ 148 (825) T protein:vir:73 77 TYALEFGHNYMRVIKDGAYVLTTSNVIYELAMPY--ADTDLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNW------Q 148 (825) T ss_pred EEEEEEeCCeEEEEeCCceEeccCCceEEEeccc--chhhhhhheeeeecCEEEEEcCCCceeEEEEecCCCc------E Confidence 9999999999999998775432 3344555 5677899999999999999999999999999887542 2 Q ss_pred EEEEecCCCCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEEEe Q lcl|NC_019510. 151 LINVRGGQYGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIA 230 (799) Q Consensus 151 ~~~v~~~~y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a 230 (799) +..+... ++..+.+..+ .. ....+++.....+++.+.+.+.+++.+..+....... .....|..........+.. T Consensus 149 l~~~~f~-~gp~~~in~~--~s-v~v~asg~tg~~TiTaS~a~~~~~~vG~~i~~~~~~v-~si~~~~~~~~~~~~~v~~ 223 (825) T protein:vir:73 149 IVDVTTK-NGPFEDINVD--ET-VKVYASASTGTITLTASSAIFGAEQVGKLFYLEQPAV-DSVPVWETSKTTAINDVRR 223 (825) T ss_pred EEEEecc-CCcccccccc--cc-ceeeecccCceeEEEeeccccCchhcCeEEEEecccc-cccceeeeeeEEEeeeEEE Confidence 2222211 1111111111 11 1122333333444555555555555553322111100 0111111100000000011 Q ss_pred cCCCceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEe-cCCCCcceeEEEEecCceEEEEecccccccee Q lcl|NC_019510. 231 PDGDSIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVG-DTSRSADKYYVRYNLTRKVWEETVGWNIQVGL 309 (799) Q Consensus 231 ~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~-~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~ 309 (799) .++. ..... .......+++.+..|..+.... .........|.....+.+.++.+...+..... T Consensus 224 ~~~~-~~~~~---------------~~~~~~t~~~~a~~g~~~~~~~g~~~~~~~~~~~~~~~~~g~~~it~~~~~~~~~ 287 (825) T protein:vir:73 224 ADSN-YYRAN---------------TSGKTGTLRPSHTEGMSWDGWGGTGSDDTGIQWEYLHSGFGIAKITAVAGDGLTA 287 (825) T ss_pred CCCc-eeeee---------------cccccceeeccccCCceeEeeeeecccCCceEEEEEecCCceEEEeeccccceee Confidence 1110 00000 0111123444555554443222 11111111122223334444333322211111 Q ss_pred e---ccceeeEEEeccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEe----cCCeEEEEccCCccc Q lcl|NC_019510. 310 N---NGTMPWSLIRAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGML----SGENIILSRTAKYFN 382 (799) Q Consensus 310 ~---~~t~p~~lv~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~----~~~~v~~Sr~gd~~n 382 (799) . ...+|..++..+++++++....|.. .++||++|+||||||+|+ +|++|||||+|||+| T Consensus 288 ~~~~~~~~~~~~~~~~~~t~~~~~~~~~~--------------~~gyPs~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~n 353 (825) T protein:vir:73 288 TADVVSFIPSQVVGSANASYKWAKYAWNS--------------VNGYPSTVVYYQQRLYFAASTAYPQTIWASRTGDYKD 353 (825) T ss_pred ccccceecccccccCCCCCcccccCCccc--------------CCCCccEEEEEcceEEEeecCCCCCEEEEEccCCccc Confidence 1 1124455555555555555555543 458999999999999999 579999999999999 Q ss_pred cccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCC--ccccccceEEEEEEeecccCCCCcEEe Q lcl|NC_019510. 383 MYPASVAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNAS--GVLSAKSVELNLTTEFDVNDGARPYGI 460 (799) Q Consensus 383 F~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~--~~lTP~~~~~~~~s~~~~~~~~~Pv~v 460 (799) |+++++ ++|||||+++++++++|.|+|+++++ +|+|||+++||+|+++ ++|||+|++++++|.|++ ++++|+.+ T Consensus 354 F~~~~~--~~DdD~I~~~~s~~~~~~i~~~~~~~-~L~~~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~v 429 (825) T protein:vir:73 354 FGKNNP--IQDDDRIIYTYAGRQVNEIRHLIDVG-NLVALTSGGEYTISGDQNKVLTPSAFSFSSQGNNGS-SNVPPIAV 429 (825) T ss_pred cccCCC--CCCCccEEEEEcCCcceeEEEEeecC-cEEEEecCceEEEecCCCcccceeeEEEEeeeeecc-ccccceEe Confidence 999985 57999999999999999999999985 8999999999999875 699999999999999966 57999999 Q ss_pred CCeEEEEecCCCceeEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCc Q lcl|NC_019510. 461 GRGVYFASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQ 540 (799) Q Consensus 461 g~~v~f~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e 540 (799) |++++|+|++|++ +|+|.|++++|+|+++|+|+|++||+++..+..+|++++|++++|++++||+|++|+|+ +| T Consensus 430 g~~~~Fv~~~g~~---vre~~~~~~~d~~~~~dlt~~a~hl~~~~~~~~~a~~~~p~~~~~~v~~dg~l~~~ty~---~~ 503 (825) T protein:vir:73 430 ANIALFIQEKGSV---VRDLAYSFDVDGYQGTDLTILANHLFQKHSIVDWSFCIVPYSSAFCIRDDGKLLVLTYL---RD 503 (825) T ss_pred CCeEEEEeCCCCe---EEEEEEeeecCceeccchhhhhHhhccCCceEEEEEcCCCceEEEEEecCCeEEEEEEe---cc Confidence 9999999999853 77889999999999999999999999998888899999999999999999999999996 78 Q ss_pred eeeEeeEeeecCCCeEEEEEEEe--CCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEc--------- Q lcl|NC_019510. 541 IQQQSWSHWDFGDNVTVLAANSI--GSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIP--------- 609 (799) Q Consensus 541 ~~v~aW~~w~~~g~~~~~~~~~~--~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~--------- 609 (799) |++.|||||+++|+|+++|++++ +|+||++|+|+++++.+++.|.++.....++++.+||||+.+|... T Consensus 504 q~v~aW~~~~~~g~v~~~~~i~~~~~D~l~~iV~R~~~g~~~~yiE~~~~~~~~~~~~~~~vD~g~~~~g~~~~~~l~~l 583 (825) T protein:vir:73 504 QQVFAWAPQSSAGKYESTCSISEGSEDAVYFVVNRTINGQTVRYIERLSSRLFTNDEDAFFVDCGLSYDGRNTSSRTMTI 583 (825) T ss_pred ccceeeEEEecCCcEEEEEEecCCCccEEEEEEEEeeCCceEEEEEEecccccCCCcceeEEEEEeeecccceeeceeee Confidence 88999999999999999999986 4899999999999998998888877666667788999997655421 Q ss_pred ------------------cccccccc-------------c--------------------c------------------- Q lcl|NC_019510. 610 ------------------ANAFNNDR-------------Y--------------------E------------------- 619 (799) Q Consensus 610 ------------------~~~~~~~~-------------~--------------------~------------------- 619 (799) +|+..... . . T Consensus 584 ~g~tv~~~~~g~~~~~v~~g~itl~~~~~~~i~l~~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~v~~~~~~~a~~~~~~ 663 (825) T protein:vir:73 584 SGGTGDWSYQVDYPVTVSGGAYFVNTDVGAQIQFPYTGTDPDTNEPVAKELRGDIISVTSNTAVVVRFNRNVPPVLRNVA 663 (825) T ss_pred CCceEEEEeCCeEEEEEcCCeEEecccceEEEEecccCcccccccceeceeeEEEccccCceEEEEEecccccceeeeec Confidence 11000000 0 0 Q ss_pred --cccccccccCccccccceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCc Q lcl|NC_019510. 620 --TTVDLNAVFGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDES 697 (799) Q Consensus 620 --~~~~~~~~~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~ 697 (799) ....+...++||.||||++|.+++||.+++..+|++|.++++ .++++|||||+|+++++||||++..| |. T Consensus 664 ~t~~~~a~~~~~gL~hLeG~~v~v~~Dg~~~~~~~V~~G~vtl~-------~~~~~v~vGl~y~~~~~~l~~~~~~~-g~ 735 (825) T protein:vir:73 664 TTNWQMARQTFSGLAHLEGQTVNILSDASVEPQKTVTGGAVTLE-------SPGAVVHIGLPITAEFETLDININGQ-ET 735 (825) T ss_pred ccCCCcchheeccccccCCceEEEEECCeeeCCeEecCcEEEec-------CCceEEEEeeCccceEEecccccCCC-cc Confidence 000112356899999999999999999999999999987765 78999999999999999999998754 66 Q ss_pred cceeeeecceEEEEEEEEEeeccceEEEEecCCCcccceeccccccCcccccccccccceEEEEE-eecccceEEEEEEC Q lcl|NC_019510. 698 GGFSTEDVGRLQHRRAWLNYEQSGAFYVDVTNLGRSYRYTMSGKPLGDTTLGQANLESGQFRFPL-AGNAQYNRVVLTSD 776 (799) Q Consensus 698 ~~~~~~~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~-~~~~~~~~v~i~~~ 776 (799) .+++ ++||+++.++|++|.++.+..+.... +.+.+ |-.+.+..++|++||++++++ .+|+++.+|+|+|+ T Consensus 736 ~~g~-----~~ri~~~~~~~~~s~~~~~g~~~~~l-~~~~~---r~~~~~~~~~~~~tG~~~~~~~~~~~~~~~~~i~q~ 806 (825) T protein:vir:73 736 LLDK-----KQVIPTVTMVVNASRGIWATTPGGTW-YEYPQ---REFEFYDDPVDDATGKVEVKLDSNWDKNGRVKVRQL 806 (825) T ss_pred ccCc-----cEEEEEEEEEEEeeeeEEEecCCCcc-eEeec---cCCCcccCCCccccCcEEEecCCCCCCccEEEEEEc Confidence 6655 45899999999999999987654422 22222 223445557899999999997 79999999999999 Q ss_pred CCCcEEEEEEEEEEEEecc Q lcl|NC_019510. 777 YTTPLSIIGCGWEGNYIRR 795 (799) Q Consensus 777 ~P~P~tvl~i~~eg~y~~r 795 (799) +|||||||||..|...+== T Consensus 807 ~PlP~tvlav~~~~~~~g~ 825 (825) T protein:vir:73 807 DPLPLSVLAVLPRLTVGGF 825 (825) T ss_pred CCCCEEEEEEEEEEEecCC Confidence 9999999999988775544 No 21 >protein:vir:107423 Length: 681 # NCBI annotation: Bbp13 # Family: family:all:780 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958682;genbank:gi:41179374;genbank:GeneID:2717217 Probab=100.00 E-value=1e-164 Score=919.61 Aligned_cols=663 Identities=14% Similarity=0.122 Sum_probs=492.2 Q ss_pred CCceeeeccceecc-----cccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCce Q lcl|NC_019510. 1 MGLVSQSIKNLKGG-----ISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYE 75 (799) Q Consensus 1 M~~v~~s~~~~~gG-----VSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~ 75 (799) |+++...++||.+| +..|.|++||+++|++|+||++.|+||++|||||+||++++.. ....||+||.|+++| T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~---~~~~rlipf~~~~~~ 77 (681) T protein:vir:10 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDS---AKKVRLIPFTYSVTQ 77 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCC---CCcEEEEEEEeCCCc Confidence 99999999999999 6689999999999999999999999999999999999987654 467899999999999 Q ss_pred EEEEEEeCCeEEEEeCCCeEEE----EEecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceE Q lcl|NC_019510. 76 QYYVVFTGGDIKVFDLNGQEYA----VRGDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDAL 151 (799) Q Consensus 76 ~y~l~~~~g~irv~~~~G~~~~----v~~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~ 151 (799) +|+|||++++||||..+|.++. ++..+|| ++.++.+|+|+|+||+|||||++|||++|.|.++.+ |....+.| T Consensus 78 ~~~l~~g~~~~r~~~~~~~~~~~~~~~~~~tpy--~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~-W~l~~~~f 154 (681) T protein:vir:10 78 TMVIELGAGYFRFHTNGGTLLDGAVPYEIANPY--AEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATN-WQLATIAF 154 (681) T ss_pred eEEEEEeCCeEEEEeCCcEEeeCcEeEEecCCC--ChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCc-eEEEEEEe Confidence 9999999999999987776542 3445554 667789999999999999999999999999988754 32222221 Q ss_pred EEEecCCCCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEEEec Q lcl|NC_019510. 152 INVRGGQYGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAP 231 (799) Q Consensus 152 ~~v~~~~y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~ 231 (799) ... .+.. -+++ + +....+. ..++. .++.+. T Consensus 155 ~~~---p~~p-~~~~------a---t~~~~~~--------------------------------~~t~~-----~~v~av 184 (681) T protein:vir:10 155 TSP---VATP-TSVT------A---TSNNKGT--------------------------------DYTYR-----YVVTAL 184 (681) T ss_pred ccc---cccc-eeee------e---eccCCcc--------------------------------ceeEe-----EEEEEe Confidence 110 0000 0000 0 0000000 00000 001111 Q ss_pred CCCceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEeccccccceeec Q lcl|NC_019510. 232 DGDSIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVGLNN 311 (799) Q Consensus 232 ~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~~ 311 (799) +...... +......+ ++.. .+ ..+....+..........+ --+....+.|.- .+......+ T Consensus 185 da~t~~~--s~~~~~~t----vt~~--~~-------~~~~~~t~~w~a~~g~~~~-~V~~~~~gi~g~-ig~~~~~~~-- 245 (681) T protein:vir:10 185 DAEGKTE--SAPSSAGT----CTNN--LF-------TNGGANTIAWSASSGASRY-NVYKEQGGLYGY-IGQTTGTSL-- 245 (681) T ss_pred eccccee--ecCCcceE----Eeee--ee-------cCCcceeEEEEecCCceee-eecccceeEEEE-eeccceeee-- Confidence 1000000 00000000 0000 00 0001111111111111111 111122222321 111111111 Q ss_pred cceeeEEEeccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEe----cCCeEEEEccCCcccccccc Q lcl|NC_019510. 312 GTMPWSLIRAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGML----SGENIILSRTAKYFNMYPAS 387 (799) Q Consensus 312 ~t~p~~lv~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~----~~~~v~~Sr~gd~~nF~~~t 387 (799) ......+......+...+ .++-.++||++|+||||||+|+ +|++|||||+|||+||++++ T Consensus 246 ---------------~~~~~~~~~~~t~~~~~~-~~~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~ 309 (681) T protein:vir:10 246 ---------------VDDNIAPDLSVTPPIYDA-VFNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSL 309 (681) T ss_pred ---------------eecccccCcccccccccc-ccccCCCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccC Confidence 111111111111222223 3455678999999999999999 47899999999999999998 Q ss_pred ccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCC--ccccccceEEEEEEeecccCCCCcEEeCCeEE Q lcl|NC_019510. 388 VAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNAS--GVLSAKSVELNLTTEFDVNDGARPYGIGRGVY 465 (799) Q Consensus 388 ~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~--~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~ 465 (799) + ++|||||+++++++++|.|+|+++++ +|+|||+++||.|+++ ++|||+|++++++|.|++ ++|+|+.+|++++ T Consensus 310 ~--~~ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~ 385 (681) T protein:vir:10 310 P--VRDDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTI 385 (681) T ss_pred C--CCCCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeeeecc-ccccceeeCCeEE Confidence 5 57999999999999999999999995 7999999999999874 699999999999999975 5799999999999 Q ss_pred EEecCCCceeEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEe Q lcl|NC_019510. 466 FASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQS 545 (799) Q Consensus 466 f~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~a 545 (799) |+|++|++ +|+|.|++++|+|+++|+|++++|++++..+..+|++++|.+++|++++||+|++|+|+ +||+|.| T Consensus 386 fv~~~g~~---vre~~y~~~~d~~~~~dlt~~a~Hl~~~~~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~---~eq~v~a 459 (681) T protein:vir:10 386 YGAARGGH---VRELAYNWQANGFVTGDLSLRAAHLFDNLDILDMAYAKAPQPIVWFISSSGKLLGLTYV---PEQQIGA 459 (681) T ss_pred EEecCCCE---EEEEEEeeecCceeccchhhhhhhhcCCCCeEEEEEecCCCEEEEEEecCCcEEEEEEe---cccceee Confidence 99999954 78889999999999999999999999987777889999999999999999999999996 7888999 Q ss_pred eEeeecCCCeEEEEEEEe--CCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEccccccccccccccc Q lcl|NC_019510. 546 WSHWDFGDNVTVLAANSI--GSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTVD 623 (799) Q Consensus 546 W~~w~~~g~~~~~~~~~~--~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~ 623 (799) ||||+++|+|++||++.+ +|.||++|+|++++..+++.+.++...........++||+.++. +. T Consensus 460 W~~~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~yie~~~~~~~~~~~~~~~vD~~~t~~--------~~------ 525 (681) T protein:vir:10 460 WHQHDTDGVFESCAVVAEGNEDRLYAVVRRTIGGNEVRYVERMASRQFDAQADAFFVDSGLTYS--------GE------ 525 (681) T ss_pred EEEEecCCcEEEEEEecCCCCcEEEEEEEecCCCCeEEEEEecCCccccccccceEeecccccc--------Cc------ Confidence 999999999999999875 48999999999998888887777654333333445666665442 21 Q ss_pred cccccCccccccceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCccceeee Q lcl|NC_019510. 624 LNAVFGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTE 703 (799) Q Consensus 624 ~~~~~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~ 703 (799) +...++|+.|++|+++.+.+||.+++..+|.+|.+++ +.++++|+|||+|+++++||||+++.++|..+++ T Consensus 526 ~~~~~sgl~~leG~tv~i~aDG~~~~~~~V~~G~itl-------~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g~-- 596 (681) T protein:vir:10 526 PVSHISGLEHLEGKTVSILADGAVHPQRVVTDGAIDL-------DVEAGTVHIGLPITAELQTLPVAMQLDGSFGQGR-- 596 (681) T ss_pred ceeeeccccCCCCcEEEEEeCCeecCcEeecCcEEEe-------CcCCceEEEeeeceeEEEecceeeecCCcccCCc-- Confidence 2234679999999999999999999999999997654 4789999999999999999999999999877765 Q ss_pred ecceEEEEEEEEEeeccceEEEEecCCCcccceeccccccCcccccccccccceEEEEE-eecccceEEEEEECCCCcEE Q lcl|NC_019510. 704 DVGRLQHRRAWLNYEQSGAFYVDVTNLGRSYRYTMSGKPLGDTTLGQANLESGQFRFPL-AGNAQYNRVVLTSDYTTPLS 782 (799) Q Consensus 704 ~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~-~~~~~~~~v~i~~~~P~P~t 782 (799) +++|+|+.+++++|.++++.++....+... ++-++.+..+++++||++++|+ .+|+++.+|+|+|++|+||+ T Consensus 597 ---~~ri~rv~lr~~~S~g~~~~~~~~~l~~~~----~~~~~~~g~~~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~t 669 (681) T protein:vir:10 597 ---VKNINKLWLRVHRSSGIFAGPHADALTEVK----QRTSEPYGSPPALKSEEIPLVLSPKWGDSGQLFVRQADPLPLM 669 (681) T ss_pred ---eEEEEEEEEEEEcccceEEeeCCCceEEEE----EeccccccccCCccCCeEEEEeCCCcCcceEEEEEECCCcCEE Confidence 568999999999999999988755443222 2223344446889999999998 58999999999999999999 Q ss_pred EEEEEEEEEEec Q lcl|NC_019510. 783 IIGCGWEGNYIR 794 (799) Q Consensus 783 vl~i~~eg~y~~ 794 (799) |+||+||-...- T Consensus 670 vlsi~~ev~vgg 681 (681) T protein:vir:10 670 IVSMSAEIAIGA 681 (681) T ss_pred EEEeeEEEEeeC Confidence 999999998888 No 22 >protein:vir:98487 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996575;genbank:gi:45569506;genbank:GeneID:2767815 Probab=100.00 E-value=1e-164 Score=919.61 Aligned_cols=663 Identities=14% Similarity=0.122 Sum_probs=492.2 Q ss_pred CCceeeeccceecc-----cccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCce Q lcl|NC_019510. 1 MGLVSQSIKNLKGG-----ISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYE 75 (799) Q Consensus 1 M~~v~~s~~~~~gG-----VSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~ 75 (799) |+++...++||.+| +..|.|++||+++|++|+||++.|+||++|||||+||++++.. ....||+||.|+++| T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~---~~~~rlipf~~~~~~ 77 (681) T protein:vir:98 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDS---AKKVRLIPFTYSVTQ 77 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCC---CCcEEEEEEEeCCCc Confidence 99999999999999 6689999999999999999999999999999999999987654 467899999999999 Q ss_pred EEEEEEeCCeEEEEeCCCeEEE----EEecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceE Q lcl|NC_019510. 76 QYYVVFTGGDIKVFDLNGQEYA----VRGDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDAL 151 (799) Q Consensus 76 ~y~l~~~~g~irv~~~~G~~~~----v~~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~ 151 (799) +|+|||++++||||..+|.++. ++..+|| ++.++.+|+|+|+||+|||||++|||++|.|.++.+ |....+.| T Consensus 78 ~~~l~~g~~~~r~~~~~~~~~~~~~~~~~~tpy--~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~-W~l~~~~f 154 (681) T protein:vir:98 78 TMVIELGAGYFRFHTNGGTLLDGAVPYEIANPY--AEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATN-WQLATIAF 154 (681) T ss_pred eEEEEEeCCeEEEEeCCcEEeeCcEeEEecCCC--ChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCc-eEEEEEEe Confidence 9999999999999987776542 3445554 667789999999999999999999999999988754 32222221 Q ss_pred EEEecCCCCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEEEec Q lcl|NC_019510. 152 INVRGGQYGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAP 231 (799) Q Consensus 152 ~~v~~~~y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~ 231 (799) ... .+.. -+++ + +....+. ..++. .++.+. T Consensus 155 ~~~---p~~p-~~~~------a---t~~~~~~--------------------------------~~t~~-----~~v~av 184 (681) T protein:vir:98 155 TSP---VATP-TSVT------A---TSNNKGT--------------------------------DYTYR-----YVVTAL 184 (681) T ss_pred ccc---cccc-eeee------e---eccCCcc--------------------------------ceeEe-----EEEEEe Confidence 110 0000 0000 0 0000000 00000 001111 Q ss_pred CCCceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEeccccccceeec Q lcl|NC_019510. 232 DGDSIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVGLNN 311 (799) Q Consensus 232 ~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~~ 311 (799) +...... +......+ ++.. .+ ..+....+..........+ --+....+.|.- .+......+ T Consensus 185 da~t~~~--s~~~~~~t----vt~~--~~-------~~~~~~t~~w~a~~g~~~~-~V~~~~~gi~g~-ig~~~~~~~-- 245 (681) T protein:vir:98 185 DAEGKTE--SAPSSAGT----CTNN--LF-------TNGGANTIAWSASSGASRY-NVYKEQGGLYGY-IGQTTGTSL-- 245 (681) T ss_pred eccccee--ecCCcceE----Eeee--ee-------cCCcceeEEEEecCCceee-eecccceeEEEE-eeccceeee-- Confidence 1000000 00000000 0000 00 0001111111111111111 111122222321 111111111 Q ss_pred cceeeEEEeccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEe----cCCeEEEEccCCcccccccc Q lcl|NC_019510. 312 GTMPWSLIRAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGML----SGENIILSRTAKYFNMYPAS 387 (799) Q Consensus 312 ~t~p~~lv~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~----~~~~v~~Sr~gd~~nF~~~t 387 (799) ......+......+...+ .++-.++||++|+||||||+|+ +|++|||||+|||+||++++ T Consensus 246 ---------------~~~~~~~~~~~t~~~~~~-~~~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~ 309 (681) T protein:vir:98 246 ---------------VDDNIAPDLSVTPPIYDA-VFNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSL 309 (681) T ss_pred ---------------eecccccCcccccccccc-ccccCCCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccC Confidence 111111111111222223 3455678999999999999999 47899999999999999998 Q ss_pred ccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCC--ccccccceEEEEEEeecccCCCCcEEeCCeEE Q lcl|NC_019510. 388 VAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNAS--GVLSAKSVELNLTTEFDVNDGARPYGIGRGVY 465 (799) Q Consensus 388 ~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~--~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~ 465 (799) + ++|||||+++++++++|.|+|+++++ +|+|||+++||.|+++ ++|||+|++++++|.|++ ++|+|+.+|++++ T Consensus 310 ~--~~ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~ 385 (681) T protein:vir:98 310 P--VRDDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTI 385 (681) T ss_pred C--CCCCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeeeecc-ccccceeeCCeEE Confidence 5 57999999999999999999999995 7999999999999874 699999999999999975 5799999999999 Q ss_pred EEecCCCceeEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEe Q lcl|NC_019510. 466 FASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQS 545 (799) Q Consensus 466 f~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~a 545 (799) |+|++|++ +|+|.|++++|+|+++|+|++++|++++..+..+|++++|.+++|++++||+|++|+|+ +||+|.| T Consensus 386 fv~~~g~~---vre~~y~~~~d~~~~~dlt~~a~Hl~~~~~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~---~eq~v~a 459 (681) T protein:vir:98 386 YGAARGGH---VRELAYNWQANGFVTGDLSLRAAHLFDNLDILDMAYAKAPQPIVWFISSSGKLLGLTYV---PEQQIGA 459 (681) T ss_pred EEecCCCE---EEEEEEeeecCceeccchhhhhhhhcCCCCeEEEEEecCCCEEEEEEecCCcEEEEEEe---cccceee Confidence 99999954 78889999999999999999999999987777889999999999999999999999996 7888999 Q ss_pred eEeeecCCCeEEEEEEEe--CCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEccccccccccccccc Q lcl|NC_019510. 546 WSHWDFGDNVTVLAANSI--GSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTVD 623 (799) Q Consensus 546 W~~w~~~g~~~~~~~~~~--~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~ 623 (799) ||||+++|+|++||++.+ +|.||++|+|++++..+++.+.++...........++||+.++. +. T Consensus 460 W~~~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~yie~~~~~~~~~~~~~~~vD~~~t~~--------~~------ 525 (681) T protein:vir:98 460 WHQHDTDGVFESCAVVAEGNEDRLYAVVRRTIGGNEVRYVERMASRQFDAQADAFFVDSGLTYS--------GE------ 525 (681) T ss_pred EEEEecCCcEEEEEEecCCCCcEEEEEEEecCCCCeEEEEEecCCccccccccceEeecccccc--------Cc------ Confidence 999999999999999875 48999999999998888887777654333333445666665442 21 Q ss_pred cccccCccccccceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCccceeee Q lcl|NC_019510. 624 LNAVFGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTE 703 (799) Q Consensus 624 ~~~~~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~ 703 (799) +...++|+.|++|+++.+.+||.+++..+|.+|.+++ +.++++|+|||+|+++++||||+++.++|..+++ T Consensus 526 ~~~~~sgl~~leG~tv~i~aDG~~~~~~~V~~G~itl-------~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g~-- 596 (681) T protein:vir:98 526 PVSHISGLEHLEGKTVSILADGAVHPQRVVTDGAIDL-------DVEAGTVHIGLPITAELQTLPVAMQLDGSFGQGR-- 596 (681) T ss_pred ceeeeccccCCCCcEEEEEeCCeecCcEeecCcEEEe-------CcCCceEEEeeeceeEEEecceeeecCCcccCCc-- Confidence 2234679999999999999999999999999997654 4789999999999999999999999999877765 Q ss_pred ecceEEEEEEEEEeeccceEEEEecCCCcccceeccccccCcccccccccccceEEEEE-eecccceEEEEEECCCCcEE Q lcl|NC_019510. 704 DVGRLQHRRAWLNYEQSGAFYVDVTNLGRSYRYTMSGKPLGDTTLGQANLESGQFRFPL-AGNAQYNRVVLTSDYTTPLS 782 (799) Q Consensus 704 ~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~-~~~~~~~~v~i~~~~P~P~t 782 (799) +++|+|+.+++++|.++++.++....+... ++-++.+..+++++||++++|+ .+|+++.+|+|+|++|+||+ T Consensus 597 ---~~ri~rv~lr~~~S~g~~~~~~~~~l~~~~----~~~~~~~g~~~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~t 669 (681) T protein:vir:98 597 ---VKNINKLWLRVHRSSGIFAGPHADALTEVK----QRTSEPYGSPPALKSEEIPLVLSPKWGDSGQLFVRQADPLPLM 669 (681) T ss_pred ---eEEEEEEEEEEEcccceEEeeCCCceEEEE----EeccccccccCCccCCeEEEEeCCCcCcceEEEEEECCCcCEE Confidence 568999999999999999988755443222 2223344446889999999998 58999999999999999999 Q ss_pred EEEEEEEEEEec Q lcl|NC_019510. 783 IIGCGWEGNYIR 794 (799) Q Consensus 783 vl~i~~eg~y~~ 794 (799) |+||+||-...- T Consensus 670 vlsi~~ev~vgg 681 (681) T protein:vir:98 670 IVSMSAEIAIGA 681 (681) T ss_pred EEEeeEEEEeeC Confidence 999999998888 No 23 >protein:vir:107802 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996623;genbank:gi:45580757;genbank:GeneID:2767878 Probab=100.00 E-value=1e-164 Score=919.61 Aligned_cols=663 Identities=14% Similarity=0.122 Sum_probs=492.2 Q ss_pred CCceeeeccceecc-----cccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCce Q lcl|NC_019510. 1 MGLVSQSIKNLKGG-----ISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYE 75 (799) Q Consensus 1 M~~v~~s~~~~~gG-----VSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~ 75 (799) |+++...++||.+| +..|.|++||+++|++|+||++.|+||++|||||+||++++.. ....||+||.|+++| T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~---~~~~rlipf~~~~~~ 77 (681) T protein:vir:10 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDS---AKKVRLIPFTYSVTQ 77 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCC---CCcEEEEEEEeCCCc Confidence 99999999999999 6689999999999999999999999999999999999987654 467899999999999 Q ss_pred EEEEEEeCCeEEEEeCCCeEEE----EEecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceE Q lcl|NC_019510. 76 QYYVVFTGGDIKVFDLNGQEYA----VRGDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDAL 151 (799) Q Consensus 76 ~y~l~~~~g~irv~~~~G~~~~----v~~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~ 151 (799) +|+|||++++||||..+|.++. ++..+|| ++.++.+|+|+|+||+|||||++|||++|.|.++.+ |....+.| T Consensus 78 ~~~l~~g~~~~r~~~~~~~~~~~~~~~~~~tpy--~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~-W~l~~~~f 154 (681) T protein:vir:10 78 TMVIELGAGYFRFHTNGGTLLDGAVPYEIANPY--AEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATN-WQLATIAF 154 (681) T ss_pred eEEEEEeCCeEEEEeCCcEEeeCcEeEEecCCC--ChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCc-eEEEEEEe Confidence 9999999999999987776542 3445554 667789999999999999999999999999988754 32222221 Q ss_pred EEEecCCCCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEEEec Q lcl|NC_019510. 152 INVRGGQYGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAP 231 (799) Q Consensus 152 ~~v~~~~y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~ 231 (799) ... .+.. -+++ + +....+. ..++. .++.+. T Consensus 155 ~~~---p~~p-~~~~------a---t~~~~~~--------------------------------~~t~~-----~~v~av 184 (681) T protein:vir:10 155 TSP---VATP-TSVT------A---TSNNKGT--------------------------------DYTYR-----YVVTAL 184 (681) T ss_pred ccc---cccc-eeee------e---eccCCcc--------------------------------ceeEe-----EEEEEe Confidence 110 0000 0000 0 0000000 00000 001111 Q ss_pred CCCceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEeccccccceeec Q lcl|NC_019510. 232 DGDSIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVGLNN 311 (799) Q Consensus 232 ~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~~ 311 (799) +...... +......+ ++.. .+ ..+....+..........+ --+....+.|.- .+......+ T Consensus 185 da~t~~~--s~~~~~~t----vt~~--~~-------~~~~~~t~~w~a~~g~~~~-~V~~~~~gi~g~-ig~~~~~~~-- 245 (681) T protein:vir:10 185 DAEGKTE--SAPSSAGT----CTNN--LF-------TNGGANTIAWSASSGASRY-NVYKEQGGLYGY-IGQTTGTSL-- 245 (681) T ss_pred eccccee--ecCCcceE----Eeee--ee-------cCCcceeEEEEecCCceee-eecccceeEEEE-eeccceeee-- Confidence 1000000 00000000 0000 00 0001111111111111111 111122222321 111111111 Q ss_pred cceeeEEEeccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEe----cCCeEEEEccCCcccccccc Q lcl|NC_019510. 312 GTMPWSLIRAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGML----SGENIILSRTAKYFNMYPAS 387 (799) Q Consensus 312 ~t~p~~lv~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~----~~~~v~~Sr~gd~~nF~~~t 387 (799) ......+......+...+ .++-.++||++|+||||||+|+ +|++|||||+|||+||++++ T Consensus 246 ---------------~~~~~~~~~~~t~~~~~~-~~~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~ 309 (681) T protein:vir:10 246 ---------------VDDNIAPDLSVTPPIYDA-VFNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSL 309 (681) T ss_pred ---------------eecccccCcccccccccc-ccccCCCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccC Confidence 111111111111222223 3455678999999999999999 47899999999999999998 Q ss_pred ccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCC--ccccccceEEEEEEeecccCCCCcEEeCCeEE Q lcl|NC_019510. 388 VAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNAS--GVLSAKSVELNLTTEFDVNDGARPYGIGRGVY 465 (799) Q Consensus 388 ~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~--~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~ 465 (799) + ++|||||+++++++++|.|+|+++++ +|+|||+++||.|+++ ++|||+|++++++|.|++ ++|+|+.+|++++ T Consensus 310 ~--~~ddD~i~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~ 385 (681) T protein:vir:10 310 P--VRDDDRVAFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTI 385 (681) T ss_pred C--CCCCccEEEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeeeecc-ccccceeeCCeEE Confidence 5 57999999999999999999999995 7999999999999874 699999999999999975 5799999999999 Q ss_pred EEecCCCceeEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEe Q lcl|NC_019510. 466 FASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQS 545 (799) Q Consensus 466 f~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~a 545 (799) |+|++|++ +|+|.|++++|+|+++|+|++++|++++..+..+|++++|.+++|++++||+|++|+|+ +||+|.| T Consensus 386 fv~~~g~~---vre~~y~~~~d~~~~~dlt~~a~Hl~~~~~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~---~eq~v~a 459 (681) T protein:vir:10 386 YGAARGGH---VRELAYNWQANGFVTGDLSLRAAHLFDNLDILDMAYAKAPQPIVWFISSSGKLLGLTYV---PEQQIGA 459 (681) T ss_pred EEecCCCE---EEEEEEeeecCceeccchhhhhhhhcCCCCeEEEEEecCCCEEEEEEecCCcEEEEEEe---cccceee Confidence 99999954 78889999999999999999999999987777889999999999999999999999996 7888999 Q ss_pred eEeeecCCCeEEEEEEEe--CCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEccccccccccccccc Q lcl|NC_019510. 546 WSHWDFGDNVTVLAANSI--GSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTVD 623 (799) Q Consensus 546 W~~w~~~g~~~~~~~~~~--~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~ 623 (799) ||||+++|+|++||++.+ +|.||++|+|++++..+++.+.++...........++||+.++. +. T Consensus 460 W~~~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g~~~~yie~~~~~~~~~~~~~~~vD~~~t~~--------~~------ 525 (681) T protein:vir:10 460 WHQHDTDGVFESCAVVAEGNEDRLYAVVRRTIGGNEVRYVERMASRQFDAQADAFFVDSGLTYS--------GE------ 525 (681) T ss_pred EEEEecCCcEEEEEEecCCCCcEEEEEEEecCCCCeEEEEEecCCccccccccceEeecccccc--------Cc------ Confidence 999999999999999875 48999999999998888887777654333333445666665442 21 Q ss_pred cccccCccccccceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCccceeee Q lcl|NC_019510. 624 LNAVFGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTE 703 (799) Q Consensus 624 ~~~~~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~ 703 (799) +...++|+.|++|+++.+.+||.+++..+|.+|.+++ +.++++|+|||+|+++++||||+++.++|..+++ T Consensus 526 ~~~~~sgl~~leG~tv~i~aDG~~~~~~~V~~G~itl-------~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g~-- 596 (681) T protein:vir:10 526 PVSHISGLEHLEGKTVSILADGAVHPQRVVTDGAIDL-------DVEAGTVHIGLPITAELQTLPVAMQLDGSFGQGR-- 596 (681) T ss_pred ceeeeccccCCCCcEEEEEeCCeecCcEeecCcEEEe-------CcCCceEEEeeeceeEEEecceeeecCCcccCCc-- Confidence 2234679999999999999999999999999997654 4789999999999999999999999999877765 Q ss_pred ecceEEEEEEEEEeeccceEEEEecCCCcccceeccccccCcccccccccccceEEEEE-eecccceEEEEEECCCCcEE Q lcl|NC_019510. 704 DVGRLQHRRAWLNYEQSGAFYVDVTNLGRSYRYTMSGKPLGDTTLGQANLESGQFRFPL-AGNAQYNRVVLTSDYTTPLS 782 (799) Q Consensus 704 ~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~vp~-~~~~~~~~v~i~~~~P~P~t 782 (799) +++|+|+.+++++|.++++.++....+... ++-++.+..+++++||++++|+ .+|+++.+|+|+|++|+||+ T Consensus 597 ---~~ri~rv~lr~~~S~g~~~~~~~~~l~~~~----~~~~~~~g~~~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~t 669 (681) T protein:vir:10 597 ---VKNINKLWLRVHRSSGIFAGPHADALTEVK----QRTSEPYGSPPALKSEEIPLVLSPKWGDSGQLFVRQADPLPLM 669 (681) T ss_pred ---eEEEEEEEEEEEcccceEEeeCCCceEEEE----EeccccccccCCccCCeEEEEeCCCcCcceEEEEEECCCcCEE Confidence 568999999999999999988755443222 2223344446889999999998 58999999999999999999 Q ss_pred EEEEEEEEEEec Q lcl|NC_019510. 783 IIGCGWEGNYIR 794 (799) Q Consensus 783 vl~i~~eg~y~~ 794 (799) |+||+||-...- T Consensus 670 vlsi~~ev~vgg 681 (681) T protein:vir:10 670 IVSMSAEIAIGA 681 (681) T ss_pred EEEeeEEEEeeC Confidence 999999998888 No 24 >protein:vir:1778 Length: 680 # NCBI annotation: tail protein A # Family: family:all:825 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570344;genbank:gi:18640503;genbank:GeneID:932716 Probab=100.00 E-value=3.9e-160 Score=894.52 Aligned_cols=547 Identities=27% Similarity=0.442 Sum_probs=476.3 Q ss_pred CCceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCceEEEEE Q lcl|NC_019510. 1 MGLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYEQYYVV 80 (799) Q Consensus 1 M~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~~y~l~ 80 (799) |++|+|+||||++|||||+|.+|+|+||++|+||+|+|+.||+||||++||+.|... .....+|+++|++.|+|++. T Consensus 1 M~~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRpg~~~i~~l~~~---~~~~~~~~~~rd~~e~~~~~ 77 (680) T protein:vir:17 1 MAAVEQMVPNLLGGISQQPDPLKLPGQVKQARNVQLDPTFGALKRPGTELIMQVTGI---PKRAKWIPIMRDAREHYYVA 77 (680) T ss_pred CccceecchhhhCcceecchhhcCcchhhhhhccccCcCcccccCccceeeeeccCC---CCCceeEEEecCCCCeEEEE Confidence 999999999999999999999999999999999999999999999999999988654 45677899999999999988 Q ss_pred EeCCe--------EEEEeC-CCeEEEEEecCcccc-----ccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCC Q lcl|NC_019510. 81 FTGGD--------IKVFDL-NGQEYAVRGDKSYVQ-----TANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFND 146 (799) Q Consensus 81 ~~~g~--------irv~~~-~G~~~~v~~~~~y~~-----t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~ 146 (799) +.... |||||+ +|.+++|++...+++ ++++..+||++++||++||+|+++.|++.+. +..+ T Consensus 78 ~~~~g~~~~~~~~i~v~d~~~G~~~~v~~~~~~~~~~~~~~~~~~~~lr~~tv~d~tfi~N~~v~~~~~~~-----~~~~ 152 (680) T protein:vir:17 78 IYREGANESGDLRIRVFDLKAGVERAVSFVGGEVEEYFPGDETDWEAIRSLTIGDYTFLSNPNVQPTTWSR-----SFSR 152 (680) T ss_pred EEcCCCcccccceeEEEEccCCeEEEEEcCCCceEEEeecCCCCccceEEEEEcCEEEEECCeEEEeccCC-----CCCC Confidence 87543 999996 799999998887653 3556679999999999999999999987753 3456 Q ss_pred CcceEEEEecCCCCceeeEEeccceEEE-------EEe-------------------------------------c---- Q lcl|NC_019510. 147 QKDALINVRGGQYGRTLNVIFNEATRAT-------IKL-------------------------------------P---- 178 (799) Q Consensus 147 ~~~~~~~v~~~~y~~ty~v~~~~~~~~~-------~~t-------------------------------------p---- 178 (799) .+.+++.++.++|+++|.|++++..... ... . T Consensus 153 ~~~g~~~v~~~ayg~ty~v~ing~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~Ag~~t~~~~~~a~la~~l~~~~~~~~~ 232 (680) T protein:vir:17 153 RPEGLVTIGAAGYGTSYIVDFATEDSGQQRRWAVQEMQAPKTKRKKGDGSPDEAGETTVNNWNGTGLSFRVKVEARAFLV 232 (680) T ss_pred CCeeEEEEEEeeeeeEEEEEEeccccceeeeeeeeeeeccccccccccccCCCCcceeeeeeeeeeeeeeeeeccceeee Confidence 7789999999999999999987632100 000 0 Q ss_pred -CCcc----------------------ccc-----------eeccc-----------------ccccccccchhhhhhcc Q lcl|NC_019510. 179 -SGTG----------------------TTP-----------PIEEQ-----------------VAAVDAQHIAEELAKQI 207 (799) Q Consensus 179 -~gt~----------------------~~~-----------~~~~~-----------------~a~~~~~~i~~~l~~~~ 207 (799) .+.. ..+ .+... .........++.|+.++ T Consensus 233 ~~g~~~~~~y~~~~~l~~tg~~~~~~~~t~~v~~~G~~y~IsI~~~~~~~~~~~~~s~~~~t~~~~~a~~at~~~Ia~~L 312 (680) T protein:vir:17 233 DDGEEYGHNYIPYVTLLTPGNNTSPFPDTIRVDVSGEGWDIKVTKQIQSKVYANLGTAQFTTPVDQSGGGASTSDIVTGL 312 (680) T ss_pred cCCCceEEEEeeEEEEecCCccccccCceEEEecccceeEEEEccceeeEeccCccceeeeeccCCcccceeHHHHHHHH Confidence 0000 000 00000 00000001222344444 Q ss_pred ccccccccceEEEECCcEEEEEec--CCCceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcce Q lcl|NC_019510. 208 RESLAGNPGWTINVGTGFVNIIAP--DGDSIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADK 285 (799) Q Consensus 208 ~~~~a~~~~~t~~~~g~~i~i~a~--~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~ 285 (799) .......++|++.+.+++++|... .+...+.+++++|.+++++.++++.|+++++||.+|++||.|+|.++.+++.++ T Consensus 313 ~~~i~~~~~~~~~~~g~~i~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~Lp~~a~~g~~v~v~~~~~~~~~~ 392 (680) T protein:vir:17 313 SAAINGLGTFTAESIGNVIRVRYSDPTRTDEFTMSARGGTSGTGLESIKYSVDTLAELPTKCWNDYQVAVRNTQDTEVDD 392 (680) T ss_pred HHhhcccCcEEEEECCCEEEEEeccCCCceEEEeeccCCCCceeeeeeeeeeccccccccccCCCcEEEEEeCCCCcccc Confidence 444456689999999999999654 555678889999999999999999999999999999999999999999999999 Q ss_pred eEEEEec--------CceEEEEeccccccceeeccceeeEEEeccCceEEEeecc-------cCccccCccccccCcccc Q lcl|NC_019510. 286 YYVRYNL--------TRKVWEETVGWNIQVGLNNGTMPWSLIRAADGQFDFVANS-------WVGRTAGDDDTNPHPSFV 350 (799) Q Consensus 286 ~y~~~~~--------~~~~w~e~~~~~~~~~~~~~t~p~~lv~~~~~t~~~~~~~-------w~~~~~gd~~~np~psf~ 350 (799) ||++|+. ..+.|+||++|++..+++.+||||.|++.+.|.|.++..+ |++|.+|||++||.|+|+ T Consensus 393 Yyv~~~~~~~~~~~~~~~~W~E~~~~~~~~~~~~~tmp~~l~r~~~g~f~~~~~~~~~~~~~~~~r~~Gdd~tnp~psF~ 472 (680) T protein:vir:17 393 YYVKFETDVEDADVPGSGYWVETVKNGDDGGLVDDTMPHVLVRNALGDFTFSSLNNSSYGKTWADRSVGSEDTNPHPTFT 472 (680) T ss_pred eEEEEeccCcccCcccccceeecccCcccceeccCcceEEEEEccCceeEEEeeccccccccccccccCCcccCCCcccc Confidence 9999975 4568999999999999999999999999999999999876 999999999999999999 Q ss_pred --CCCeeEEEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEE Q lcl|NC_019510. 351 --GQAITDVFFYRNRLGMLSGENIILSRTAKYFNMYPASVAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQF 428 (799) Q Consensus 351 --~~~p~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~ 428 (799) |+||++|+||||||+|+++++|||||+||||||++++++++.|||||+++++++++|+|+|+++++++|+|||+++|| T Consensus 473 ~~G~~p~~v~f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~t~g~q~ 552 (680) T protein:vir:17 473 ESGNGIYGMFMYKNRLGFLTQDAVIMSQVGDYFNFYATSGVTISDADPIDMATSDTKPVKLEAAISSTSGAILFGNQAQF 552 (680) T ss_pred cCCCCceEEEEEcceEEEeeCCeEEEEccCCcccccccccccCCCCccEEEEEcCCcceeeeEEeecCCcEEEEecCeEE Confidence 889999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeCC-ccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccCceehhhHHHHHHHhcCCCcE Q lcl|NC_019510. 429 VLNAS-GVLSAKSVELNLTTEFDVNDGARPYGIGRGVYFASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVF 507 (799) Q Consensus 429 ~i~~~-~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~ 507 (799) +|+++ ++|||+|++++++|+|+|++.|+|+.+|+.++|++++|+|++++| |+|++++|+|+++|||+|++|||+|+++ T Consensus 553 ~ls~~~~~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~~s~vre-~~y~~~~d~y~a~DlT~~a~hl~~g~v~ 631 (680) T protein:vir:17 553 RLSSPDESFGPKTATLDKISNYTYESKADPVQTGVSMIFPTNMGTYSSVYE-LSTESAKGTPVIEDSSRVIPRLIPSGLT 631 (680) T ss_pred EEecCCceecceeEEEEEEEeecccCCCCceEeCCeEEEeecCCCcceEEE-EeeeeccCceehhhHHHHHHHhcCCceE Confidence 99985 599999999999999999999999999999999999998877655 6999999999999999999999999999 Q ss_pred EEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEeeEeeecCCCeE Q lcl|NC_019510. 508 SISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQSWSHWDFGDNVT 556 (799) Q Consensus 508 ~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~w~~~g~~~ 556 (799) .+++++++|.+++|++++||+|++|+|||+++||+|+|||||+|++.-. T Consensus 632 ~~~~~~~~~~~~~~~~~~~~~l~~~~yl~~~~e~~v~aW~rw~~~~~d~ 680 (680) T protein:vir:17 632 WSTASMNNDTVFFGNAKKGRNVYVFRFFNEGQERKVAGWTTWYYEDQDH 680 (680) T ss_pred EEEeeCCCCeEEEEEEcCCCEEEEEEEeeCCCceEEEEEEEEecCCCCC Confidence 9999999999999999999999999999999999999999999987633 No 25 >protein:vir:102644 Length: 594 # NCBI annotation: Hypothetical protein # Family: family:all:780 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024422;genbank:gi:48696643;genbank:GeneID:2948111 Probab=100.00 E-value=5.6e-143 Score=800.46 Aligned_cols=563 Identities=11% Similarity=0.011 Sum_probs=453.7 Q ss_pred CCceeeeccceecc-----cccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCce Q lcl|NC_019510. 1 MGLVSQSIKNLKGG-----ISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYE 75 (799) Q Consensus 1 M~~v~~s~~~~~gG-----VSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~ 75 (799) |+++. ++||.+| +..|.|++||+++|++|+||++.|+||++||||++|++++++. +++.+|+||.|+++| T Consensus 1 m~~~~--~~~F~~GelsP~l~~r~Dl~~y~~~~~~~~n~~~~~~G~~~rR~G~~~~~~~~~~---~~~~~lipF~~s~~~ 75 (594) T protein:vir:10 1 MADFS--QTSFKGGVIAPRLQFNEYESAYHHSIEDAVNFVVTEQGSLITRCGSEEVGLCQDG---EVRLFRLPAVDAPSN 75 (594) T ss_pred Cceee--ccccCcceecceeccchhHHHHHHHHhhhhceEEEecCCeecCChhHhhhhccCC---CCCEEEEEEEeCCCC Confidence 99985 6999999 5579999999999999999999999999999999999987643 677899999999999 Q ss_pred EEEEEEeCCeEEEEeCCCeEEEEEecCccc----cc---cCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCc Q lcl|NC_019510. 76 QYYVVFTGGDIKVFDLNGQEYAVRGDKSYV----QT---ANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQK 148 (799) Q Consensus 76 ~y~l~~~~g~irv~~~~G~~~~v~~~~~y~----~t---~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~ 148 (799) +|+|||+++++|||..+|.++..+.+.+|. ++ ..++.+|+|+|++|++|++|++++|++|.|.++.. T Consensus 76 ~~~le~g~~~~r~~~~~~~~v~~~~~~~~~~~tp~~~t~~~~l~~i~~tqsad~~~~~~~~~~p~~L~R~~~~~------ 149 (594) T protein:vir:10 76 DVIVEVGNTNIAVWVNDVRQVVANTPSEWRNTIDRIQTAYDTIGDDAGAANTGRLIMVHPALQPKRLYRDNNNA------ 149 (594) T ss_pred eEEEEEcCCeEEEEecCcEEEEccCCCcccccccceeeccCCccceEEEEEeeEEEEEcCCCCceEEEEccCCC------ Confidence 999999999999999888776655555542 11 34578899999999999999999999998754321 Q ss_pred ceEEEEecCCCCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEE Q lcl|NC_019510. 149 DALINVRGGQYGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNI 228 (799) Q Consensus 149 ~~~~~v~~~~y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i 228 (799) |.+..+ . T Consensus 150 w~~~~~-------------------------------------------------------------~------------ 156 (594) T protein:vir:10 150 WQFVNM-------------------------------------------------------------H------------ 156 (594) T ss_pred ceEEec-------------------------------------------------------------c------------ Confidence 110000 0 Q ss_pred EecCCCceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEeccccccce Q lcl|NC_019510. 229 IAPDGDSIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVG 308 (799) Q Consensus 229 ~a~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~ 308 (799) |... T Consensus 157 --------------------------------------------------------------------~~~~-------- 160 (594) T protein:vir:10 157 --------------------------------------------------------------------TGAV-------- 160 (594) T ss_pred --------------------------------------------------------------------cCcc-------- Confidence 0000 Q ss_pred eeccceeeEEEeccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEecC----CeEEEEccCCccccc Q lcl|NC_019510. 309 LNNGTMPWSLIRAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLSG----ENIILSRTAKYFNMY 384 (799) Q Consensus 309 ~~~~t~p~~lv~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~~----~~v~~Sr~gd~~nF~ 384 (799) ...|. ..+||++|+||||||+|++. ++|||||+|||+||+ T Consensus 161 ---------------------p~~~~---------------~~~~p~~v~f~q~RL~f~~~~~~p~~v~~Srtgd~~nF~ 204 (594) T protein:vir:10 161 ---------------------PAEWS---------------PSNYPQTVGIFQNRVWYVGSPVHRTYFWATRAGKLEDIA 204 (594) T ss_pred ---------------------ccccc---------------CCccceEEEEEeeeEEEEeCCCCCceEEEEecccccccc Confidence 00000 12689999999999999984 689999999999999 Q ss_pred cccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCC--ccccccceEEEEEEeecccCCCCcEEeCC Q lcl|NC_019510. 385 PASVAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNAS--GVLSAKSVELNLTTEFDVNDGARPYGIGR 462 (799) Q Consensus 385 ~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~--~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~ 462 (799) +++++ .|||||++.+ +++.++| |+++++++|+|||+++||+|+++ ++|||+|+.++++|.+ |++.|+|+.+|+ T Consensus 205 ~~~~~--~ddd~i~~~~-s~~~~~~-~~v~~~~~L~i~t~~~e~~l~~~~~~~lTp~~~~~~~~s~~-g~~~~~P~~vg~ 279 (594) T protein:vir:10 205 PSTAN--NPNDPISFVG-IMEGTPC-WIIASSDVLTIGTTINDYQLAASTGVSVTAATAILRRSSVQ-GTAAVQGIPAEE 279 (594) T ss_pred cCCCC--CCCccEEEEE-ecccceE-EEEecCCceEEEecCceEEEecCCCcccccceEEEEEeeee-ccCCCcceeeCC Confidence 99865 6999999954 4565555 56778889999999999999885 5899999999999965 778999999999 Q ss_pred eEEEEecCCCceeEEEEEeeccccCceehhhHHHHHHHhcC------CCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeee Q lcl|NC_019510. 463 GVYFASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIP------NGVFSISGSSTENFATVLTSGAKGKVFIYKFLY 536 (799) Q Consensus 463 ~v~f~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~------g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~ 536 (799) .++|+|++|+ .+|+|.|++++|+|+++|||+|++|||+ ++.+..+|++++|++++|++++||.|++++|+ T Consensus 280 ~~~fv~~~g~---~vre~~y~~~~d~y~~~dlt~~a~hl~~~~~~~~~~~i~~~a~~~~p~~~~~~v~~dG~l~~~ty~- 355 (594) T protein:vir:10 280 QVIFCSRNKS---KVYAMNYVREQDNWIPDEMSSQAQHLFTPISSAKGASVRRVAYISDAAKSLWVVLENGQINYCCFD- 355 (594) T ss_pred eEEEEcCCCC---EEEEEEEeeccCceeccchhhhhhhhcCccccccCceEEEEEEecCCceEEEEEeCCCeEEEEEEe- Confidence 9999999985 4788899999999999999999999984 45577789999999999999999999999995 Q ss_pred CCCceeeEeeEeee-cCCCeEEEEEEEe--CCEEEEEEEeC--CCEEEEEEEEEeeccCCCccccceeeceeEEEEEccc Q lcl|NC_019510. 537 IDEQIQQQSWSHWD-FGDNVTVLAANSI--GSHMHVILQNG--YDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPAN 611 (799) Q Consensus 537 ~~~e~~v~aW~~w~-~~g~~~~~~~~~~--~d~l~~~v~r~--~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~ 611 (799) +||+|.|||||+ ++|.|+++|++++ +|++|++|+|. +++..+++ +.+|.++.. ...|+...++++++ T Consensus 356 --~eq~v~aWs~~~~t~G~v~~va~i~~~~~d~l~~~V~R~~ti~g~~~~y-~~lE~~~~~-----~~~~~~~~~~~d~~ 427 (594) T protein:vir:10 356 --RTTDTKAWTQLELSGGKVIDIAAAFNPDSDYAYVAVVRSKAINGVQKNY-TVLEKISSP-----RTDWKRADGWVVAQ 427 (594) T ss_pred --cccceeeeEeeccCCCcEEEEEEeecCCCCEEEEEEEECCccccceeeE-EEeecCCCc-----cccccccceeeeec Confidence 889999999998 5899999999875 68999999995 46666665 223333222 22344444555554 Q ss_pred cccccccccccccccccCccccccceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeE Q lcl|NC_019510. 612 AFNNDRYETTVDLNAVFGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLI 691 (799) Q Consensus 612 ~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~ 691 (799) ..+. ...+++.||+|+++.+++||.+++..+|.+|.++++ ..++.++++|||||+|+++|+|+||++ T Consensus 428 ~~~~----------~~vsgl~hLeg~tv~v~aDG~~~~~~~V~~g~itL~---~~~~~~~~~v~VGl~Y~s~i~~lp~~~ 494 (594) T protein:vir:10 428 VNQN----------GDVLNLDRYIGRTAVIFSKYGLEAEVEVNNIGLTHR---INGYDPNTVYYVGYKMDSYFRTLTPSN 494 (594) T ss_pred cccc----------ceeecccccCCceEEEEeCCeecCCeEEcCCeeEee---ccCCCCcceEEEeeeeeEEEEeecccc Confidence 4432 224689999999999999999999999999987653 567789999999999999999999999 Q ss_pred EcCCCccceeeeecceEEEEEEEEEeeccceEEEEecCCCcccceeccccccCcccccccccccceEE--EEEeecccce Q lcl|NC_019510. 692 KKQDESGGFSTEDVGRLQHRRAWLNYEQSGAFYVDVTNLGRSYRYTMSGKPLGDTTLGQANLESGQFR--FPLAGNAQYN 769 (799) Q Consensus 692 ~~~~g~~~~~~~~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~--vp~~~~~~~~ 769 (799) ++++|+.+++ |+||+|++|+|++|.+++++.+.............+ .....+.+++++|+.+ ++..||+++. T Consensus 495 ~~~~gs~~g~-----r~ri~r~~v~~~~S~g~~vg~~~~~~r~~~~~~~~~-~~~~~g~~~~~tg~~~v~~~~~G~~~~~ 568 (594) T protein:vir:10 495 GDMKKSMFGS-----KIRISKVQLALFDSIEPTVNGEPADDRSTDDIMDAR-LLDFSSNSGSSNGTRLVDYNPLGWENDG 568 (594) T ss_pred cCCcccccCc-----cEEEEEEEEEEEcceeeEECCcccccccchhhcccc-CCcccCcccccCCceEEEEccCCcCccc Confidence 9999887765 789999999999999998876532211111100011 1234456777777654 5567999999 Q ss_pred EEEEEECCCCcEEEEEEEEEEEEecc Q lcl|NC_019510. 770 RVVLTSDYTTPLSIIGCGWEGNYIRR 795 (799) Q Consensus 770 ~v~i~~~~P~P~tvl~i~~eg~y~~r 795 (799) +|+|+|++||||||+||.+|...|+= T Consensus 569 ~i~I~qd~PlPltvlai~~ev~~~~~ 594 (594) T protein:vir:10 569 KMVIAVEQPFLCEVVGVFSVVQSNKV 594 (594) T ss_pred EEEEEECCCcCEEEEEEEEEEEeccC Confidence 99999999999999999999999998 No 26 >protein:vir:94602 Length: 1012 # NCBI annotation: PfWMP4_35 # Family: family:all:12083 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762665;genbank:gi:115304373;genbank:GeneID:5142302 Probab=99.32 E-value=1.1e-10 Score=75.25 Aligned_cols=765 Identities=12% Similarity=0.112 Sum_probs=313.5 Q ss_pred CCc-----eeeeccceeccc--ccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcC- Q lcl|NC_019510. 1 MGL-----VSQSIKNLKGGI--SQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRD- 72 (799) Q Consensus 1 M~~-----v~~s~~~~~gGV--Sqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~- 72 (799) |-. +++-..+=++|+ |.-+-.--| .+---..|+=++..|-+.||.|+..+-... ......+.+-+++..- T Consensus 1 mtqQQ~~eiqG~~t~~F~GL~~s~S~~~IP~-~~SP~~~N~DV~~~G~V~rR~GT~l~~~Y~-inn~s~~~~s~~irt~L 78 (1012) T protein:vir:94 1 MTQQQATEIQGPFTREFSGLDISNSVGAIPV-SGSPVFHNCDVSDDGAVVRRRGTALVNTYN-INNASGRAWSDTIRTKL 78 (1012) T ss_pred CCccccccccccccccccccccccccccccc-cCCCceEEeecccCcceeehhhhhhhhhhc-ccccCcceeeeeehhhc Confidence 442 333334445552 221111111 112346799999999999999999987643 2223344444444332 Q ss_pred CceEEEEEEeCCeEEEEeCCCeEEE----EEec-CccccccCCcceeEEEEE---cCEEEEEeCCeeeeeeccccCCCCc Q lcl|NC_019510. 73 EYEQYYVVFTGGDIKVFDLNGQEYA----VRGD-KSYVQTANPRNSIRCVTV---ADYTFVVNRERVVQADGNLTNGGTF 144 (799) Q Consensus 73 ~~~~y~l~~~~g~irv~~~~G~~~~----v~~~-~~y~~t~~~~~~l~~~q~---aD~~~l~~~~~~p~~l~~~~~~~~~ 144 (799) +-++|+|.-..|-+.+.-.+|+.+- +... ....++--| ++..|.-+ -|-+.|.-+++||-.+.-....-.+ T Consensus 79 G~eYfiLs~~~GLL~~~~~~~~AVG~~K~~a~V~~ss~~~V~P-ssm~F~~~S~~~~R~LILT~~~~~VQ~~F~E~T~s~ 157 (1012) T protein:vir:94 79 GSEYFILSNDVGLLISLMRDDEAVGMPKEVAVVSKSSIWTVPP-SSMCFIPVSAPYDRLLILTPEHPIVQLSFLERTLSF 157 (1012) T ss_pred cceeEEEecCCceEEEeeecccccccchhhhhhhhhhccccCC-cceEEEeccCCCCcEEEEcCCCceEEEEEeeeeeee Confidence 2334455544555544433333221 1000 011111111 23344433 3567777777777554322211111 Q ss_pred C-CCcce--E-------------EE--------EecCCCCceeeEEeccce-----EEEEEecCCccccceecccccccc Q lcl|NC_019510. 145 N-DQKDA--L-------------IN--------VRGGQYGRTLNVIFNEAT-----RATIKLPSGTGTTPPIEEQVAAVD 195 (799) Q Consensus 145 ~-~~~~~--~-------------~~--------v~~~~y~~ty~v~~~~~~-----~~~~~tp~gt~~~~~~~~~~a~~~ 195 (799) . ..+.+ . .| +....-+++|.+++.... +-.+..|. .-+....++. T Consensus 158 T~~t~~~~~V~~~~a~~~~~~~~L~~~~N~sS~~~~~~~~T~~AmT~~NP~~S~~ls~~~V~~q------tytltirqi~ 231 (1012) T protein:vir:94 158 TCTTNHGGGVFSFTAPISVNDTTLWRDTNASSYIVTDAAGTVYAMTQKNPDFSFRLSGSFVVGQ------TYTLTIRQIT 231 (1012) T ss_pred eccCCccceEeecccceeecCeeEEecccccceeEeeccceEEEEEeeCCceeEEEEEEEecCc------ccceeehhhh Confidence 0 00000 0 01 011112333333332100 00111110 0011112223 Q ss_pred cccchhhhhhccccccccccceEEE------------------------ECCcEEEEEec-CCCcee-------EEEE-- Q lcl|NC_019510. 196 AQHIAEELAKQIRESLAGNPGWTIN------------------------VGTGFVNIIAP-DGDSIR-------GLQT-- 241 (799) Q Consensus 196 ~~~i~~~l~~~~~~~~a~~~~~t~~------------------------~~g~~i~i~a~-~~~~~~-------~~~~-- 241 (799) .+|-++.++-....--.....+++. .-+=-++.++. +.+.++ +.+. T Consensus 232 W~WWAESm~~~G~~~~~~~SRFNV~~~DQ~V~IP~~L~tDiD~v~~~~~~~~l~~~~ss~F~~~~~~~~T~~P~~AD~YG 311 (1012) T protein:vir:94 232 WQWWAESMYYEGQDMMQNTSRFNVTSIDQNVKIPDRLITDIDPVYKNSQGLGLFVFWSSRFDSNGWAGPTTSPNTADEYG 311 (1012) T ss_pred hhhhhhhHhhhhhHHHhhhhhcccccccccccchhHHhhhhhhhhhccCCccEEEEEeeeecCceeecCCCCCCCccccc Confidence 3333322211100000000000000 00000111111 000000 0000 Q ss_pred -EeccCcceeeEEe-------------eeeccceeccccccC---CeEEEEEecCCCCcceeEEEEecCceE-------- Q lcl|NC_019510. 242 -KDGYADQLISPVT-------------HYAQTFAKLPQNAPD---GYTVKIVGDTSRSADKYYVRYNLTRKV-------- 296 (799) Q Consensus 242 -~~g~~~~~~~~~~-------------~~v~~~~~l~~~~~~---G~~v~i~~~~~~~~~~~y~~~~~~~~~-------- 296 (799) .+|+..+....+. +..-+..--|+...+ -...+.-...++..++.-+.-+..+-. T Consensus 312 ~~~G~~~tpp~~~~~A~L~~aPFF~TFG~~~s~TP~P~~~V~iLR~RELRFN~G~GA~~~~L~V~~D~~~~t~Nnvpfsp 391 (1012) T protein:vir:94 312 FSGGGRFTPPSLVPGATLQAAPFFITFGGIYSGTPTPINQVNILRLRELRFNGGTGAKPDDLQVYNDTVEHTWNNVPFSP 391 (1012) T ss_pred ccCCceeccccccccceeeccceEEEeccccCCCCCChhheeeeeeeeeeeccCCCCCCcceEEEEcceeeeccccccCc Confidence 1111111000000 000000000110000 001112222222223222222222222 Q ss_pred -----EEEecc-ccccce--------eeccc--------------eeeEEEeccCceEE-EeecccCccccCccccc--- Q lcl|NC_019510. 297 -----WEETVG-WNIQVG--------LNNGT--------------MPWSLIRAADGQFD-FVANSWVGRTAGDDDTN--- 344 (799) Q Consensus 297 -----w~e~~~-~~~~~~--------~~~~t--------------~p~~lv~~~~~t~~-~~~~~w~~~~~gd~~~n--- 344 (799) |..+.+ -+.... ++++. .|..+--.+..++. ....-|....-.|.-+- T Consensus 392 snfqt~atT~~~T~R~~~L~~A~G~~~~~A~Y~A~~GATnnlpanaPL~IS~~sA~s~~~~~R~v~~~~~~T~~~~~~G~ 471 (1012) T protein:vir:94 392 SNFQTWATTYTATDRVITLMSAVGDRFNNANYFAILGATNNLPANAPLHISCLSASSYLGGSRRVWYRNLPTTGGTLDGC 471 (1012) T ss_pred ccccceeeeeeecceeEEEeeeccccccCcceEEEeecccccccCCccccccccceeeeccceeeeeeccccCCceEeee Confidence 222211 011111 01111 11111000000000 00011222111110000 Q ss_pred -----cCccc----cCCCeeEEEEEcceEEEecC----CeEEEEccCC------ccccc-cccccCCCCCccEEEEEcCC Q lcl|NC_019510. 345 -----PHPSF----VGQAITDVFFYRNRLGMLSG----ENIILSRTAK------YFNMY-PASVAVLSDDDPIDVAVSHN 404 (799) Q Consensus 345 -----p~psf----~~~~p~~v~f~q~RL~f~~~----~~v~~Sr~gd------~~nF~-~~t~~~~~DdD~i~~~~~~~ 404 (799) ....+ .+.+|.--+.||.||++.++ ..+.+|.+|| |+||+ .+..+...|.||+++.+++. T Consensus 472 Y~r~YGiG~~~~Y~~~~F~~I~TiY~~RLiL~~~s~~~~~~~~S~~GD~~~~G~~Y~F~QiTD~L~G~~tDPF~L~VtSe 551 (1012) T protein:vir:94 472 YVRAYGIGKYVDYSKRSFHAIGTIYRDRLILVNPSTATDQLLISEIGDATVPGEFYQFMQITDMLQGVTTDPFTLNVTSE 551 (1012) T ss_pred EEEEEEeeeeeecCCccccceeeeeeeeeEEeccCCCcceEEEeecCCcccCceeeeeeeeehhhccCcCCceeEEEccc Confidence 00011 24568888999999999985 4589999877 89998 55556778999999999997 Q ss_pred cceeeeeeeecCCcEEEEecCcEEEEeCCccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeecc Q lcl|NC_019510. 405 RVSILKYAVPFSEELLLWADEAQFVLNASGVLSAKSVELNLTTEFDVNDGARPYGIGRGVYFASPRATFTSINRYYAVQD 484 (799) Q Consensus 405 ~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~ 484 (799) -.+.|.-++...+.|++||..+-|.+.|++.++|..-.+...|+|+--+.---|+..-.|+|..+.| ++..+ -.- T Consensus 552 ~~e~iT~~~~WQ~~LFV~T~~~T~~~~GGe~~~~s~~~VN~vSt~G~~N~~~VV~T~~~V~Ym~~~G----~F~L~-~k~ 626 (1012) T protein:vir:94 552 GRERITAVTGWQKRLFVFTGSNTYSIEGGEQFGESSYAVNLVSTYGAFNQNCVVVTNLTVLYMNKFG----LFDLM-NKP 626 (1012) T ss_pred ccceeeeeeeeceeEEEEeccceEeeccccccchhHHHHHhHHhhcccCcceEEEeeeEEEEeeccc----eeecc-CCc Confidence 7888999999999999999999999999999999999999999995544445577888999998877 44443 356 Q ss_pred ccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCcee-----------eEeeEeeecCC Q lcl|NC_019510. 485 VSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQ-----------QQSWSHWDFGD 553 (799) Q Consensus 485 ~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~-----------v~aW~~w~~~g 553 (799) +.|+|.+.|-|+-+..+|..- ...+-++...+....+.++||+- |..+.|.- -.+|+..+..| T Consensus 627 ~~~~Y~A~ErSvKIR~~F~~~----~~ss~~~~~Wl~~~e~~~~LYi~--L~~~~dT~~~S~~~~~N~~~DSWs~~~s~~ 700 (1012) T protein:vir:94 627 NTDSYGAFERSVKIRGLFQNL----AGSSGDNLHWLRYNESSNKLYIG--LAAEGDTRTTSRNLMLNFTWDSWSTLSSAA 700 (1012) T ss_pred cCCcchhhhhhhhhhhhhhhh----ccccccceeeeeeccCCceEEEE--ecCCCcchhhhhhhhhhhhhcchhhhhccC Confidence 789999999999999999642 22222332333334444555542 22222221 13788877777 Q ss_pred CeEEEEEE--EeCCEEEEEE-EeCCCEEEE--EEEEEeeccCCCccccceeeceeEEEEEcc-----cccccccccccc- Q lcl|NC_019510. 554 NVTVLAAN--SIGSHMHVIL-QNGYDIFMG--SISFTKKTLDFGNEPYRLYMDAKTRYDIPA-----NAFNNDRYETTV- 622 (799) Q Consensus 554 ~~~~~~~~--~~~d~l~~~v-~r~~~~~~~--~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~-----~~~~~~~~~~~~- 622 (799) .|+.-..+ +.++ -|++. .-....++- .|.++.+-......-| +|--|+-++.+.. |.+.-..+.+.. T Consensus 701 ~Fq~YP~V~~~~~~-t~L~~i~~~~TV~ML~~~~~~YiDFatirthiy-pF~~CaG~~~~~Vms~~~GIY~~~~P~tP~I 778 (1012) T protein:vir:94 701 PFQMYPAVQLFKYM-TWLTNINAPLTVAMLATEMPFYIDFATIRTHIY-PFTFCAGQRDVSVMSDSRGIYNLPLPVTPGI 778 (1012) T ss_pred Ccccchhhhhhhhh-hhhhhhcCchhhhhhhhccceeeeeehhccccc-ceeeeccceeeEEEecCCceEEeccccccee Confidence 76532211 0111 11111 111000000 0000000000000000 0111111111100 000000000000 Q ss_pred -----------------ccc-------cccCccccccc------------eEEEEecCCccccccccccceec------- Q lcl|NC_019510. 623 -----------------DLN-------AVFGGMRWQVG------------KILVSDEVGEVRQYEPPAGGWAS------- 659 (799) Q Consensus 623 -----------------~~~-------~~~~g~~~~~g------------~~v~~~adG~~~~~~~v~~g~~~------- 659 (799) .+. -.+..-+.-.| +++.+.-.....+..+..-.++. T Consensus 779 ~~~tit~ss~~~~k~Yq~~T~~~GT~tLt~~~~~~~~~~~l~LL~~~~~~~~~a~V~~~~~~~~TT~~TV~~N~~~~lQ~ 858 (1012) T protein:vir:94 779 LDYTITASSKAGAKTYQRNTASAGTETLTLRNPMMDYADTLELLGGNVNASQFAMVMSNGFEPYTTYPTVTYNGVAPLQW 858 (1012) T ss_pred eeeEeeccchhhhheeccccccccceeeeecChhhhcCcEEEEecCCCCccEEEEEeecccccccccceEEecceeeeeE Confidence 000 00000011112 22222211111111110000000 Q ss_pred ----CceEEEccCCC-CCeEEEeEeeeEEEEecCeeEEcCCCccceeeeecceE-EEEEEEEEeec--cceEEEEecCCC Q lcl|NC_019510. 660 ----DPTLRIVGDMA-GKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTEDVGRL-QHRRAWLNYEQ--SGAFYVDVTNLG 731 (799) Q Consensus 660 ----~~~~~i~~~~~-~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~~rl-~l~~~~~~~~~--t~~~~~~v~~~~ 731 (799) +..++...-.+ ...+.+|.-|.|..+-+=+.+ + .. ||| +|.++.+-|.- +..++..+..+. T Consensus 859 T~~~GS~L~~~~~LsqN~~~~~G~~Y~S~Y~SP~F~L----~-SL------~~LKr~K~~~L~~Dttvtsqlkynltsgf 927 (1012) T protein:vir:94 859 TVTGGSGLNNRPILSQNNNCIMGMIYPSVYASPIFDL----E-SL------GRLKRLKKLHLQMDTTVTSQLKYNLTSGF 927 (1012) T ss_pred EEecCCccccccccccCceEEEeecchhhhcchhhhh----h-hh------hhhhheeeeeEEeeeeeeeeeeeehhccc Confidence 00000000001 345889999999887543322 1 11 233 35555554443 344544444332 Q ss_pred cccceecccc-----ccCcccccccc---------cc-cceEEEEEeecccceEEEEEECCCCcEEEEEEEEEEEEec-- Q lcl|NC_019510. 732 RSYRYTMSGK-----PLGDTTLGQAN---------LE-SGQFRFPLAGNAQYNRVVLTSDYTTPLSIIGCGWEGNYIR-- 794 (799) Q Consensus 732 ~~~~~~~~~~-----~~~~~~~~~~~---------~~-tg~~~vp~~~~~~~~~v~i~~~~P~P~tvl~i~~eg~y~~-- 794 (799) ......-..+ ...++.+...- +. --+..+|+.|..-+.++.|.+-.--.|.+-+.+++.+=-+ T Consensus 928 sqvsvlntawvavvsnynenivpavvsyqvgnsyeirrvvelsiplqgygcdyqfyiasvgaeafklaayefdiqpqrdk 1007 (1012) T protein:vir:94 928 SQVSVLNTAWVAVVSNYNENIVPAVVSYQVGNSYEIRRVVELSIPLQGYGCDYQFYIASVGAEAFKLAAYEFDIQPQRDK 1007 (1012) T ss_pred ceeeeecceeeeeeeccCccccceeeeeecCCceeeeEEEEEeecccccccceeEeeeeccccceeeeeeeeccccchhh Confidence 2211111111 11122211110 00 0124568888899999999999999999999988776332 Q ss_pred c-ccC Q lcl|NC_019510. 795 R-STG 798 (799) Q Consensus 795 r-~rr 798 (799) | .|| T Consensus 1008 ryvrr 1012 (1012) T protein:vir:94 1008 RYVRR 1012 (1012) T ss_pred hhccC Confidence 2 223 No 27 >protein:vir:80177 Length: 1027 # NCBI annotation: tail tubular protein B # Family: family:all:12083 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285795;genbank:gi:148747829;genbank:GeneID:5220453 Probab=99.12 E-value=5.7e-10 Score=71.26 Aligned_cols=743 Identities=14% Similarity=0.102 Sum_probs=274.6 Q ss_pred CC----c-ee-----eeccceeccc--ccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEE Q lcl|NC_019510. 1 MG----L-VS-----QSIKNLKGGI--SQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHL 68 (799) Q Consensus 1 M~----~-v~-----~s~~~~~gGV--Sqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~ 68 (799) |- | .+ +...+=++|+ |.-+=.--| .+---..|+=++..|-+.||.|+..+-+..+. ...+.++ T Consensus 1 mvnsferrtQQ~~dlG~~s~~F~GL~~t~S~~~IP~-~~SP~~~N~DV~~~G~V~kR~GT~i~~~Y~~t----~~~~t~~ 75 (1027) T protein:vir:80 1 MVNSFERRTQQGDDLGIRSSNFGGLNTTASPLNIPY-EDSPNLLNVDVDVSGNVSKRQGTEILLKYANT----TPVYTFP 75 (1027) T ss_pred CCcchhhhhccccccccccccccccccccccccccc-cCCCceEEeecccCcceeehhhhhhhhhhccC----Cceeeee Confidence 21 1 00 1112223331 111100001 11224678999999999999999999765433 2334444 Q ss_pred EEcCCceEEEEEEeCCeEEEEeCCCeEEE----EEec-CccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCC Q lcl|NC_019510. 69 INRDEYEQYYVVFTGGDIKVFDLNGQEYA----VRGD-KSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGT 143 (799) Q Consensus 69 ~~~~~~~~y~l~~~~g~irv~~~~G~~~~----v~~~-~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~ 143 (799) +..--.-.|+|.-..|-+.+.-.+|+.+- +... ....++--|.. +-.--.-|-+.|.-+++||-.+.-....-. T Consensus 76 vks~LG~dYvLt~~~GLL~~~~~~~~AVG~~K~~s~V~~aa~~~V~P~F-~~~S~~~~R~LILT~~~~~VQ~~F~E~T~t 154 (1027) T protein:vir:80 76 VKSVLGYDYVLTKSGGLLEVAGVIGKAVGAYKSFSNVFSAAAANVKPYF-TLLSDVEPRVLILTGTNTPVQVKFVEQTFT 154 (1027) T ss_pred ehhhccceeeEecCCceEEEeeecccccccchhhhhhhhhhhcccCcee-EEccCCCCcEEEEcCCCceEEEEEeeeeee Confidence 43322334655555555554433333221 0000 00011111110 001113456777777777654432221111 Q ss_pred cC-CCcceEEEEec-----------------------CCCCceeeEEecc------ceEEEEEecCCcccc---ceec-- Q lcl|NC_019510. 144 FN-DQKDALINVRG-----------------------GQYGRTLNVIFNE------ATRATIKLPSGTGTT---PPIE-- 188 (799) Q Consensus 144 ~~-~~~~~~~~v~~-----------------------~~y~~ty~v~~~~------~~~~~~~tp~gt~~~---~~~~-- 188 (799) +. -.+.+-..++. ..-+++|.+++.. .-+-.+..|..+... -+.+ T Consensus 155 ~T~~s~~~~~V~~~~s~~~~~~~~L~~~~N~tS~~~~~~~~T~~AlT~~NlP~~S~~mt~~~V~~~W~WWAESl~~~G~~ 234 (1027) T protein:vir:80 155 TTSGSPTTTVVIPNASRFQYDTPILYMNRNFTSGATYSYNSTTRALTISNLPSWSGSMTFDLVLPVWSWWAESLRWFGDR 234 (1027) T ss_pred eeccCCccceEeecccceeecCeeEEecccccceeEeeccceEEEEEeccCCcceeEEEEeEEecchhhhhhHHhhhhhH Confidence 10 00111111111 1112333333321 001111111111000 0000 Q ss_pred --cccccccc------ccchhhhhhccccccccccceEEEECCcEEEEEe---------cCCCceeEEEEEeccCcceee Q lcl|NC_019510. 189 --EQVAAVDA------QHIAEELAKQIRESLAGNPGWTINVGTGFVNIIA---------PDGDSIRGLQTKDGYADQLIS 251 (799) Q Consensus 189 --~~~a~~~~------~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a---------~~~~~~~~~~~~~g~~~~~~~ 251 (799) ++..-+.. -.|+..|...+..-.....++ +=-++.++ +++... +....+-+++.... T Consensus 235 ~~~~~SRFNV~~~DQ~V~IP~~L~sDlD~i~~~~~~~-----~m~~~~ta~F~~~~~~~~T~~P~-~AD~YG~~~G~~~~ 308 (1027) T protein:vir:80 235 FYDAVSRFNVNKADQSVAIPAALRSDLDTIQGTYGRY-----PMLLYKTATFNDTYTFSNTGQPA-NADSYGWGDGSVYN 308 (1027) T ss_pred HHhhhhhcccccccccccchhHHhhhhhhhhhccCCc-----cEEEEEeeeecCceeecCCCCCC-CcccccccCCceEe Confidence 00010100 011222221111000000000 00011111 111100 00000000010000 Q ss_pred EEe-------------eeeccceeccccccC---CeEEEEEecCCCCcceeEEEEecCceE--EEEecc-ccccce---- Q lcl|NC_019510. 252 PVT-------------HYAQTFAKLPQNAPD---GYTVKIVGDTSRSADKYYVRYNLTRKV--WEETVG-WNIQVG---- 308 (799) Q Consensus 252 ~~~-------------~~v~~~~~l~~~~~~---G~~v~i~~~~~~~~~~~y~~~~~~~~~--w~e~~~-~~~~~~---- 308 (799) ... +..-+..--|+...+ -...+.-...++..++.-+.-+..+-. |..+.+ -+.... T Consensus 309 ~~~~A~L~~sPFF~TFG~~~t~TP~P~~~V~lLR~RELRFN~G~GA~~~~L~V~~D~~~~s~N~ssT~~~T~R~~~L~~A 388 (1027) T protein:vir:80 309 VGASAYLNTSPFFATFGDTRTPTPQPPETVHLLRQRELRFNYGNGATGANLRVTVDGTALSANYSSTVAGTNRAYALYKA 388 (1027) T ss_pred ecccceeeccceEEEeccccCCCCCchhheeeeeeeeeeeccCCCCCCcceEEEEcceeeeeeeeeeeeecceeEEEeee Confidence 000 000000000111000 001111222222222222222222222 222111 011111 Q ss_pred ----------------eeccceeeEEEeccCceEEE--eecccCccccCccccc--------cCc----cccCCCeeEEE Q lcl|NC_019510. 309 ----------------LNNGTMPWSLIRAADGQFDF--VANSWVGRTAGDDDTN--------PHP----SFVGQAITDVF 358 (799) Q Consensus 309 ----------------~~~~t~p~~lv~~~~~t~~~--~~~~w~~~~~gd~~~n--------p~p----sf~~~~p~~v~ 358 (799) +..+| |..+--.+..++.= ...-|......|.-+- ... .-.+++|+--+ T Consensus 389 ~G~~~~~A~dlayY~A~~GAT-PL~IS~~aA~t~~~~~R~yi~~~~~~T~~~~~~G~Y~k~YGlG~~~~Y~~~~F~~I~T 467 (1027) T protein:vir:80 389 DGTLCTSASDLAYYIAFTGAT-PLGISPTAAVTITNVDRTYIGSAATQTDNAYVQGGYFKVYGLGLWANYGTGQFPRIAT 467 (1027) T ss_pred ccccccccccceeeeeeeccc-cccccccceeeeecCceeeeeeeccccCCceEeeeEEEEEEeeeeeecCCccccceee Confidence 11111 11110000000000 0001221111110000 000 11256788999 Q ss_pred EEcceEEEecC----CeEEEEccCC------ccccc-cccccCCCCCccEEEEEcCCc-ceeeeeeeecCCcEEEEecCc Q lcl|NC_019510. 359 FYRNRLGMLSG----ENIILSRTAK------YFNMY-PASVAVLSDDDPIDVAVSHNR-VSILKYAVPFSEELLLWADEA 426 (799) Q Consensus 359 f~q~RL~f~~~----~~v~~Sr~gd------~~nF~-~~t~~~~~DdD~i~~~~~~~~-~~~i~~~v~~~~~L~l~t~~~ 426 (799) .||.||++.++ ..+.+|.+|| ++||+ .+..+...|.||+++.++++| .+.|.-++...+.|++||..+ T Consensus 468 vY~~RLvL~~~t~~~~~~~~S~~GD~~~~G~~Y~F~QvTD~L~G~~sDPF~L~VsSsq~~d~vT~~~~WQ~~LFV~T~~~ 547 (1027) T protein:vir:80 468 VYQSRLVLGGFTNDPTRVVFSATGDTVEGGVKYNFFQVTDDLDGLDSDPFDLVVSSSQADDYVTGLVEWQSSLFVLTRRA 547 (1027) T ss_pred eeeeeeEEeccCCCcceEEEeecCCcccCceeeeeeeeehhhccCcCCceeEEEecccccceeeeeeeeceeEEEEecce Confidence 99999999985 4599999877 89998 555567789999999999876 556888889999999999999 Q ss_pred EEEEeCCcc-ccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccCceehhhHHHHHHHhcCCC Q lcl|NC_019510. 427 QFVLNASGV-LSAKSVELNLTTEFDVNDGARPYGIGRGVYFASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIPNG 505 (799) Q Consensus 427 e~~i~~~~~-lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~ 505 (799) -|.+.|++. ++|..-.+...|+|+--+.-.-|+..-.|+|.++.| ++..+ -.-+.++|.+.|-|+-+..+|..- T Consensus 548 T~~~~GGd~t~~~a~~~VN~iSs~G~~N~~~VV~T~~~V~Yl~~~G----~F~L~-~r~~~~~Y~A~EkSiKIR~~F~~~ 622 (1027) T protein:vir:80 548 TFRANGGDATISPARRFVNYISSLGLVNPFSVVRTDTAVFYLSDSG----VFNLT-PRVEDGEYQAIEKSIKIRKVFGKT 622 (1027) T ss_pred eEEeecCccccchhHHHHHHHHhhcccCcceEEEeeeEEEEeeccc----eeecc-CCccCCcchhhhhhhhhhhhhhhh Confidence 999999875 999999999999996545555577888999998877 44443 356789999999999999999642 Q ss_pred cEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceee-----------EeeEeeecCCCeEEEEEEE-----eCCEEEE Q lcl|NC_019510. 506 VFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQ-----------QSWSHWDFGDNVTVLAANS-----IGSHMHV 569 (799) Q Consensus 506 ~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v-----------~aW~~w~~~g~~~~~~~~~-----~~d~l~~ 569 (799) ...+-++...+....+.+.||+ =|..+.|.-+ .+|+.-++.|.|+.-..-+ .++ -|+ T Consensus 623 ----~~ta~~~~~Wm~~~q~~~~LYv--~L~~~~eT~~~S~~~~~N~~~DSWt~~~t~~~Fk~YtghP~V~~~~~~-s~L 695 (1027) T protein:vir:80 623 ----TSTAVSSAAWMSFDQNRKVLYV--ALPRGSETTVASALYVYNTFRDSWTQYDTLGGFKTYTGHPYVDTVLGD-SFL 695 (1027) T ss_pred ----ccccccceeeeeeccCCceEEE--EecCCCcchhhhhhhhhhhhhcchhhhhcccCcccccCCchhhhhhhh-hhh Confidence 1222222223333344445554 2222322222 3788888888876431111 122 233 Q ss_pred E-EEeCCCEEEEEEEEEeeccCCCccccc-eeeceeEEEEEccccccccccccc-------cccccccCccccccceEEE Q lcl|NC_019510. 570 I-LQNGYDIFMGSISFTKKTLDFGNEPYR-LYMDAKTRYDIPANAFNNDRYETT-------VDLNAVFGGMRWQVGKILV 640 (799) Q Consensus 570 ~-v~r~~~~~~~~~~~~~~~~~~~~~~~~-~~lD~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~g~~~~~g~~v~ 640 (799) + |.-. +.++.+..+... |- +|--| -++.+....-..+.+..+ .-.....++...+.-+... T Consensus 696 ~~v~~~--~TV~ML~~~~~~-------YvDFF~~C-G~~~~~Vlt~~~GIY~~~~P~wnsP~I~~~svs~tt~~~~q~Ye 765 (1027) T protein:vir:80 696 LMVAYG--GTVCMLKLYGSR-------YVDFFNKC-GSFTGNVLTANSGIYTWTAPFWNSPVISNISVSGTTTLAVQRYE 765 (1027) T ss_pred hhhcCc--hhhhhhhhhcch-------hhhhhhhc-ccceeeEEecCCceeEeecccccCCeeeEEEeeccchhhhheec Confidence 2 2222 221211111100 00 11112 122211111111111111 1111122233333334444 Q ss_pred EecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCccceeeeecceEEEEEEEEEeecc Q lcl|NC_019510. 641 SDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTEDVGRLQHRRAWLNYEQS 720 (799) Q Consensus 641 ~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~~rl~l~~~~~~~~~t 720 (799) ...|-.++|..-+.+-.|-.+ +..+..|-+..-+-+. +++-..++.+. ... ++-|+-+++..- T Consensus 766 ~~T~~~vvpydnvedlsiyvn---------GT~Ls~~~~~~~~~~~--i~LL~~~~~~~----~~s--~Vprcpvnvsy~ 828 (1027) T protein:vir:80 766 LPTDLQVVPYDNVEDLSIYVN---------GTRLSFGTDWVKQGKA--IYLLSDPGDGK----TVS--IVPRCPVNVSYQ 828 (1027) T ss_pred cccccccccccccccceeeec---------ceeEeecCchhhcCCE--EEEecCCCCcc----eEE--EEeccccccccc Confidence 444444444444443222110 1111222111111111 11111111111 111 233433333221 Q ss_pred c--------eEEEEecCCCc--ccceeccccccCcccccccccccceEEE----------EEe--------ecccceEEE Q lcl|NC_019510. 721 G--------AFYVDVTNLGR--SYRYTMSGKPLGDTTLGQANLESGQFRF----------PLA--------GNAQYNRVV 772 (799) Q Consensus 721 ~--------~~~~~v~~~~~--~~~~~~~~~~~~~~~~~~~~~~tg~~~v----------p~~--------~~~~~~~v~ 772 (799) + .-.+.++.... -.++...|.-+. ..+-|....+++ |+. ++-+..-+. T Consensus 829 ~~~~~~~TT~~TV~~N~~~~iQ~Tdy~~~GS~L~----~~~~LtN~~~~~G~~Y~S~Y~SP~F~L~SL~~LKk~K~~~L~ 904 (1027) T protein:vir:80 829 GDVTFDETTAQTVWVNNLLQIQGTDYTLSGSTLT----FTDTLTNAVVEVGNAYISYYQSPMFLLGSLSNLKKVKHVYLY 904 (1027) T ss_pred ccccccccccceEEecceeeeccceeeeccCccc----cccccccceEEEeecchhhhcchhhhhhhhhhhhheeeeEEE Confidence 1 11122211100 000111111111 111111112111 110 011112223 Q ss_pred EEECCCCcEEEEEEEEEEEEec-----cccCC Q lcl|NC_019510. 773 LTSDYTTPLSIIGCGWEGNYIR-----RSTGI 799 (799) Q Consensus 773 i~~~~P~P~tvl~i~~eg~y~~-----r~rrv 799 (799) .-..+-||.--++=--.|+=-. -..|- T Consensus 905 ~Dnedvlpvytigdlasgqdvddlvgkwktra 936 (1027) T protein:vir:80 905 FDNEDVLPVYTIGDLASGQDVDDLVGKWKTRA 936 (1027) T ss_pred EcCCcceeeeeeccccCCCchhHhhhhhcccc Confidence 3333334432221111111000 00000 No 28 >protein:vir:2625 Length: 715 # NCBI annotation: gp27 # Family: family:all:5234 # MgeID: mge:55 # MgeName: SIO1 # Cross-refs: genbank:acc:NP_064766;genbank:gi:9964636;genbank:GeneID:1263056 Probab=99.06 E-value=3.5e-09 Score=66.91 Aligned_cols=621 Identities=13% Similarity=0.141 Sum_probs=275.4 Q ss_pred CCce--eeeccceecc-cccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhh-----hhcCCCCcccceEEEEEEc- Q lcl|NC_019510. 1 MGLV--SQSIKNLKGG-ISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNR-----KLGAAGFLGAAPLVHLINR- 71 (799) Q Consensus 1 M~~v--~~s~~~~~gG-VSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~-----~~~~~~~~~~~~~l~~~~~- 71 (799) ||+- +-....|++| |-+--.+--=++..-.-+||.....|--+||-|+-+-. .+... ..+....+.+.- T Consensus 1 m~~~~~~~~vNtFv~GliTEas~ltfpqnasiDe~N~~l~rdG~r~RR~g~~~E~~~vls~~~vp--~galv~~~~W~na 78 (715) T protein:vir:26 1 MPQSLTQRTVNTFIKGLITEASELTFPENASVDELNCSLGRDGTRRRRKAVTLEDNHVLSDVVVP--EGALVQTLDWYNV 78 (715) T ss_pred CCcccchhHHhhhhhheeeccccccCCccceeeeeeeeecCCCcchhhccceeecceEEEEEeec--Cceeeeeechhhc Confidence 9963 3356778999 44333333334555567899999999999998876531 11111 111112222221 Q ss_pred -CCceEEEEEEeC-CeEEEEeCCCe-------EEEEEecCccc-cccCCcc-eeEEEEEcCEEEEEeCCeeeeeeccccC Q lcl|NC_019510. 72 -DEYEQYYVVFTG-GDIKVFDLNGQ-------EYAVRGDKSYV-QTANPRN-SIRCVTVADYTFVVNRERVVQADGNLTN 140 (799) Q Consensus 72 -~~~~~y~l~~~~-g~irv~~~~G~-------~~~v~~~~~y~-~t~~~~~-~l~~~q~aD~~~l~~~~~~p~~l~~~~~ 140 (799) ++-..-||++.- ..+.+|.+.+- ..++....-+. ....|.+ .++++....++.|+||..-|.-+.--.. T Consensus 79 ~G~v~~~~livqvg~~l~f~q~t~~pLs~~n~~~svdl~~~~~~vn~SPsh~~v~v~~~~G~livanp~i~~~~~~~d~~ 158 (715) T protein:vir:26 79 AGQVNLEFLVVQVNNILYFYEKSTDPLSANKYSGSVDLNTHSASNNLSPSEERVQVTSLNGYLIVASPAINTFYLGFNTS 158 (715) T ss_pred ccccCcEEEEEEeccEEEEEeccCCccccCceeEEeeecceecccccccceeEEEEEEeeeEEEEecCCccEEEEEecCC Confidence 222335555554 44677766552 22333322221 1123444 6888889999999999988766542221 Q ss_pred CCCcCCCcceEEEEecCCCCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEE Q lcl|NC_019510. 141 GGTFNDQKDALINVRGGQYGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTIN 220 (799) Q Consensus 141 ~~~~~~~~~~~~~v~~~~y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~ 220 (799) +-.+.+. . +-++.-++.++ |... . ++.. -.+.. ...+.+++- .++++ .+.+ +--+.. T Consensus 159 t~s~t~~--~-ll~r~r~f~~q------g~d~---~----~g~~-y~~~g-t~~tn~~iy-nlyN~---gw~~-p~gt~~ 215 (715) T protein:vir:26 159 TEAFTAT--S-ISFKERDFEWQ------GSDV---D----VTSL-YFGEG-TSVSNQRIY-DTYNV---GWVG-PKGSAA 215 (715) T ss_pred cceeEee--E-EEEEeeeheee------cccc---c----cccc-cccCC-cccCchhhe-ecccc---eeec-ceeEEE Confidence 1111100 0 11111112111 1100 0 0000 00000 000111111 12221 1111 000111 Q ss_pred ECCcEEEEEecCCCceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEe Q lcl|NC_019510. 221 VGTGFVNIIAPDGDSIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEET 300 (799) Q Consensus 221 ~~g~~i~i~a~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~ 300 (799) ...-.-||.-|..... .- +++.+... +..-.|.|. T Consensus 216 ~N~~~~yiVypa~s~~-~~---------------------------------------S~kd~n~a-----fsk~ad~ei 250 (715) T protein:vir:26 216 LNTYGSYIVYPALTHP-WY---------------------------------------SGKDANGA-----FNKADWLEI 250 (715) T ss_pred EcCCCCceEecccccc-cC---------------------------------------CCcccccc-----cChhhcccc Confidence 1100001111100000 00 00000000 011112221 Q ss_pred ccccccceeeccceeeEEEeccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEec------CCeEEE Q lcl|NC_019510. 301 VGWNIQVGLNNGTMPWSLIRAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLS------GENIIL 374 (799) Q Consensus 301 ~~~~~~~~~~~~t~p~~lv~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~------~~~v~~ 374 (799) -- +-.- .+.|-|...... +.+.--.++- ..+.|++++.|.+|.+|++ +..|.+ T Consensus 251 ~t-----Gt~~---------~~~G~yi~D~~~-~g~~~leeev------~k~R~rsv~~yaGrV~yagiD~dkng~rilf 309 (715) T protein:vir:26 251 YT-----GSSL---------ASNGHYVLDVFN-KARTGLTTEV------ETGRFRSVAAYAGRVFYAGIDSAKNGGKVYF 309 (715) T ss_pred cc-----cccc---------ccCceEEEeeee-cCCccchhhh------hcCCCcceeeecceEEEeecccccCCCeEEE Confidence 00 0000 001111110000 0000000000 0245778999999999995 447999 Q ss_pred EccC--------Ccccccccccc--CCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCC-ccccccceEE Q lcl|NC_019510. 375 SRTA--------KYFNMYPASVA--VLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNAS-GVLSAKSVEL 443 (799) Q Consensus 375 Sr~g--------d~~nF~~~t~~--~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~-~~lTP~~~~~ 443 (799) ||.= .|.+=++++.. .+.|.|...+.+.+. -.|.-|+.+++.|+||...+.|+|.|. ...|.++..+ T Consensus 310 SqLv~s~~di~nCyQd~DPTsee~~dLidTDGg~iri~ga--h~ii~Lv~f~~sLlvf~~NGVWAi~G~d~g~tATdY~l 387 (715) T protein:vir:26 310 SRLTERMSDVGNCYQVNDPTSEVLSDLLDTDGGVVRIPDA--HNIRKLHVLGASLLVFAENGVWAVAGVDNVFRATEYAI 387 (715) T ss_pred ehhhcchhhcccccccCCCchhhhhhhhhcCCCEEEecCC--CCceeEEEecceEEEEEecceEEEeccCCceeeeeeEE Confidence 9843 25555555433 466889988888763 455668999999999999999999775 4899999999 Q ss_pred EEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccCceehhhHH-HHHHHhcCC---CcEE--EEEcCCCCc Q lcl|NC_019510. 444 NLTTEFDVNDGARPYGIGRGVYFASPRATFTSINRYYAVQDVSAVKNAEDMT-MHVPSYIPN---GVFS--ISGSSTENF 517 (799) Q Consensus 444 ~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~~~~~dls-~~~~h~~~g---~~~~--~~a~~~~~~ 517 (799) .+++..+|++.=.=|++|+.++|-.++| |+...+ +..-.-+.++.|| ..++.|.+. ..+. ...|-+... T Consensus 388 tKIs~vg~sspnSvVvv~~~i~~WsdtG----Iyal~~-Nd~fn~~tAqNLTekTIq~~~~~I~~dk~knVtg~fd~~e~ 462 (715) T protein:vir:26 388 TRISDVGLSNENSFVVADGIPIWWGKTG----IYAVQQ-SENLNTPTAQNLSLSTIQTLWNNISNAKKAQVTVEYDKINQ 462 (715) T ss_pred EEeeeeccCCCccEEEecceEEEeeCCc----EEEEEe-ccccCcchhhccchHHHHHHHhhcchhhhcceEEEEEccCC Confidence 9999999998888899999999999987 444433 2223458899999 777777542 1111 123445556 Q ss_pred EEEEEEcCCCeEEEEEeeeCCC----ceeeEeeEeeec---CCCeE-EEEEEE--------eCCEE-------------- Q lcl|NC_019510. 518 ATVLTSGAKGKVFIYKFLYIDE----QIQQQSWSHWDF---GDNVT-VLAANS--------IGSHM-------------- 567 (799) Q Consensus 518 ~~v~~~~~dg~l~~~tyl~~~~----e~~v~aW~~w~~---~g~~~-~~~~~~--------~~d~l-------------- 567 (799) -+.|+.-+..++.-|+| ++ +-...|+-+|.. .|..- .+.... .+.++ T Consensus 463 rVyW~yPn~dt~vdyky---d~vLV~dLalgaFYp~~v~~~a~~~~~~ig~~~~~~~~~~~t~~~vv~~~~v~~~g~~~~ 539 (715) T protein:vir:26 463 RVFWFYPDNDESVDYKY---NNILVMDLALQAFYPWRVEDEASSTSYIIGTSYYGGLGSTSTETQVVNGADVVVNGSDNV 539 (715) T ss_pred EEEEEEcCCceeeceee---cCeEEEEecccccccccccccccccceeeeeeeeCCcccccchhheeccceEEEeccceE Confidence 77888866666665555 11 111124555543 22211 111110 01111 Q ss_pred EEEEEeCC-----C-------EEEEEEEEEeeccCCCccccceeeceeEEEEEccccccccccccccccccccCcccccc Q lcl|NC_019510. 568 HVILQNGY-----D-------IFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTVDLNAVFGGMRWQV 635 (799) Q Consensus 568 ~~~v~r~~-----~-------~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 635 (799) ..+.-|.- . +...+|.+ .. ..+.-+||... .+. ++....|+ T Consensus 540 v~~~~r~~~~~~~~~~~~~~~~~~~~~~f--~~-----~~~~~~~dw~s--------~d~--------~~~~~~gy---- 592 (715) T protein:vir:26 540 VATLYRDYLEGDSEIKLLVRDGTTGKMTF--AT-----FRGDTYLDWGS--------ADY--------KSFAEAGY---- 592 (715) T ss_pred EEEeecccccccceEEEEEEcCCceeEEE--ec-----ccCceeeeccc--------cch--------hhHHHhhh---- Confidence 11111110 0 00001110 00 00111222221 000 00000000 Q ss_pred ceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEE------EEecCeeEEcCCCccceeeeecceEE Q lcl|NC_019510. 636 GKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFR------YEFSKFLIKKQDESGGFSTEDVGRLQ 709 (799) Q Consensus 636 g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~------~~~~~~~~~~~~g~~~~~~~~~~rl~ 709 (799) .++|.+.- ++ ..|.-+.|.-..-+.. .+|..+ . +-|+ T Consensus 593 -----------------~~~gd~~~--~k---~~pyvt~~~~~tedg~v~~~~g~~p~n~------s---------Sclm 635 (715) T protein:vir:26 593 -----------------DFMGDITT--FK---NAPYVTTYMRVTEDGYVASGAGYEFINP------S---------SCLM 635 (715) T ss_pred -----------------hhccccee--ee---cCceEEEEEEEecccceeccCCccccCC------c---------ceEE Confidence 01111100 00 0011111110000000 011111 0 1111 Q ss_pred EEEEEEEeeccceEEEEecCCCcccceecccccc--CcccccccccccceE-EEEEeecccceEEEEEECCCCcEEEEEE Q lcl|NC_019510. 710 HRRAWLNYEQSGAFYVDVTNLGRSYRYTMSGKPL--GDTTLGQANLESGQF-RFPLAGNAQYNRVVLTSDYTTPLSIIGC 786 (799) Q Consensus 710 l~~~~~~~~~t~~~~~~v~~~~~~~~~~~~~~~~--~~~~~~~~~~~tg~~-~vp~~~~~~~~~v~i~~~~P~P~tvl~i 786 (799) .+.++..+++. .....|++ ..+++ +.+....--+.+..+ ...++|..+..+++|++..--.|+|++. T Consensus 636 --~~sw~ws~s~s------t~~eaYk~--~~~~~~~p~~~s~~~yp~~~VvTKsriRG~Gr~~~~rf~s~~gKdlhl~Gy 705 (715) T protein:vir:26 636 --SVSWNLSKSGS------TPREIYKL--KDVPVVNPNDLSSINYPTDTVVTKSKVRGRGRSMKFRFESVAGKDFHLVGY 705 (715) T ss_pred --EEEeeeccCCC------Chhhhhee--cceeeeCCCccccccCCcceeEeeeeeeccceEEEEEEEecCCcceEEEeE Confidence 11222222211 00111111 11111 111111000111111 2457888999999999999999999999 Q ss_pred EEEEEEeccc Q lcl|NC_019510. 787 GWEGNYIRRS 796 (799) Q Consensus 787 ~~eg~y~~r~ 796 (799) +.-|--|+.+ T Consensus 706 silg~~~~~~ 715 (715) T protein:vir:26 706 EVIGAKNNSY 715 (715) T ss_pred EEEecccCCC Confidence 9999999988 No 29 >protein:vir:95475 Length: 771 # NCBI annotation: hypothetical protein ORF038 # Family: family:all:5234 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294631;genbank:gi:149408197;genbank:GeneID:5237042 Probab=98.20 E-value=2.7e-06 Score=51.14 Aligned_cols=654 Identities=14% Similarity=0.098 Sum_probs=271.1 Q ss_pred CCce--eeeccceecc-cccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhh----hcCCCCc--ccc-eEEEEEE Q lcl|NC_019510. 1 MGLV--SQSIKNLKGG-ISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRK----LGAAGFL--GAA-PLVHLIN 70 (799) Q Consensus 1 M~~v--~~s~~~~~gG-VSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~----~~~~~~~--~~~-~~l~~~~ 70 (799) |++- +-....|++| |-+--.+--=++..-.-+||.....|--+||-|+-|-.. |.+...+ .+. ...+.+. T Consensus 1 m~~~~~~~~vNtFv~GliTEas~ltfpqnasiDe~N~~l~rdG~r~RR~g~~~E~~~~~vls~~~vpa~g~~~v~~~~W~ 80 (771) T protein:vir:95 1 MAKTTNAAEFNTFVGGLITEASPLTFPQNASIDEVNFILNRDGSRNRRNGMDFENGATKVVCNTLVPADGTIAVTSHNWE 80 (771) T ss_pred CCcccchhHHhhhhhheeeccccccCCccceeeeeeeeecCCCcchhhceeeeecCCceEEEEEEecccceEEeeeechh Confidence 9973 3346778999 444333333345555678999999999999998765320 0000000 010 1112222 Q ss_pred c--CCceEEEEEEeCC-eEEEEeCCCeEEEEEecCccccc---cCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCc Q lcl|NC_019510. 71 R--DEYEQYYVVFTGG-DIKVFDLNGQEYAVRGDKSYVQT---ANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTF 144 (799) Q Consensus 71 ~--~~~~~y~l~~~~g-~irv~~~~G~~~~v~~~~~y~~t---~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~ 144 (799) - ++-..-||++.-| .+.+|.+.+-.+. -..-|.++ ..|.+-|+++.+..++.|+||..-|.-+.--..+-++ T Consensus 81 na~G~v~~~~livqvg~~l~f~q~t~~pLs--~~n~~~~a~~nlSPsh~isv~v~~G~livanp~i~~~~~~~d~~t~s~ 158 (771) T protein:vir:95 81 NAGGEVGRWISLVQVGTELKFFQTTGETLS--EGNFYNYQFVNMSPSHKLSYAVVDGLLVVANGSRDIYVFEYDSGSVSV 158 (771) T ss_pred hcccccCcEEEEEEeccEEEEEecCCCccc--ccceeeeecceeccceeEEEEEeeeEEEEecCCccEEEEEecCCccee Confidence 1 2223345555544 4667766653332 11112222 2455568888888999999998877655422211111 Q ss_pred CCCcceEEEEecCCCCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCc Q lcl|NC_019510. 145 NDQKDALINVRGGQYGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTG 224 (799) Q Consensus 145 ~~~~~~~~~v~~~~y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~ 224 (799) .+. .-++..+ +..+. ..+|.. -..++..+.-........+.+.- +.+|...... T Consensus 159 t~~-~ll~r~r---f~~q~--~~~G~d-------------~~~~~~~~~~gt~~tn~~iynly------N~gw~~pk~~- 212 (771) T protein:vir:95 159 TTK-RLLVRDL---FGVQD--IVNGVD-------------LRQGNDIATRPTVQTNAHIYNLR------NQTFGVPRVT- 212 (771) T ss_pred Eee-eeeeeeh---hhccc--cccccc-------------eecccccccCCcccCchhheecc------ccceeccccc- Confidence 100 0011111 11110 001110 00000000000000011111110 0122111000 Q ss_pred EEEEEecCCCceeEEEEEeccCcce-eeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEeccc Q lcl|NC_019510. 225 FVNIIAPDGDSIRGLQTKDGYADQL-ISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGW 303 (799) Q Consensus 225 ~i~i~a~~~~~~~~~~~~~g~~~~~-~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~ 303 (799) +. .++. -..++..-......| .+ .|.|-.- ...-.|+|...- T Consensus 213 -----~~--------------snt~~~~iV~~y~a~~g~~p----S~------------sd~~N~a--~~k~~~~Ei~t~ 255 (771) T protein:vir:95 213 -----WH--------------SNEPSDPIVTFRSAASGKFP----SN------------SDSVNLA--LSKRADVEPSTT 255 (771) T ss_pred -----cc--------------cCCccccceEeeeccCCCCc----CC------------ceeeccc--cchhhccceeee Confidence 00 0000 000000000000000 00 0000000 001112332221 Q ss_pred cccceeeccceeeEEEeccCceEEEeecccCcccc--CccccccCcccc--------C---CCeeEEEEEcceEEEecC- Q lcl|NC_019510. 304 NIQVGLNNGTMPWSLIRAADGQFDFVANSWVGRTA--GDDDTNPHPSFV--------G---QAITDVFFYRNRLGMLSG- 369 (799) Q Consensus 304 ~~~~~~~~~t~p~~lv~~~~~t~~~~~~~w~~~~~--gd~~~np~psf~--------~---~~p~~v~f~q~RL~f~~~- 369 (799) +..-..+..-.|..-...+.|-|...+..-....- --+.+.|+||-. . +.-+.|+=|-.|.|++++ T Consensus 256 ~~f~~~~~~~~~~Gt~~~~~G~yi~da~~~g~~~Lt~~ve~~gr~~s~~~~~~~l~~~~t~~~~~~vaeyagRvwYag~~ 335 (771) T protein:vir:95 256 DRFRAEDIVLNPIGTYETARGFFIIDAMARGKSRLEEIVKLKQRYPSLSFGVSSLPQDETPGGASVVCEYAGRVWYAGFS 335 (771) T ss_pred cccchhhhhhcccCcccccCcceeeehhhhcccccceeeeccccchhhhccccccccccCCCCceeEEeeeeeEEEecce Confidence 11000000000000000011111111100000000 001122222211 1 234679999999999871 Q ss_pred --------------CeEEEEccC--------Ccccccccccc--CCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecC Q lcl|NC_019510. 370 --------------ENIILSRTA--------KYFNMYPASVA--VLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADE 425 (799) Q Consensus 370 --------------~~v~~Sr~g--------d~~nF~~~t~~--~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~ 425 (799) ..|.+||.= .|.+=++++.. .+.|.|...+.+.+. -.|.-|+.+++.|++|... T Consensus 336 ~~~iD~dkng~~~~~~ilfSqLv~s~~di~nCyQd~DPTsee~~dLidTDGg~iri~ga--h~ii~Lv~f~~sLlvfc~N 413 (771) T protein:vir:95 336 GQIIDGDDQSPRLVSYILFSQLVDSPADIVNCYQDGDPTSTEEPELVDTDGGFIRIEGA--HDIINLVNVGSAVMVVAAN 413 (771) T ss_pred eEEeeccccCCceeeeEeeehhhcchhhcccccccCCCchhhhhhhhhcCCCEEEecCC--CCceeEEEecceEEEEEec Confidence 138999842 25555555433 467889988888773 4556689999999999999 Q ss_pred cEEEEeC-C-ccccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccCceehhhHH-HHHHHhc Q lcl|NC_019510. 426 AQFVLNA-S-GVLSAKSVELNLTTEFDVNDGARPYGIGRGVYFASPRATFTSINRYYAVQDVSAVKNAEDMT-MHVPSYI 502 (799) Q Consensus 426 ~e~~i~~-~-~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~~~~~dls-~~~~h~~ 502 (799) +.|+|.| + ...|.++..+.+++..+|++.=.=|++|+.++|-.++| |+...+-+ -.-+.++.|| ..++.|. T Consensus 414 GVWAi~ggsd~g~tAtdY~ltKIs~vg~sspnSvVvvg~~i~ywsdtg----Iyal~~Nd--fn~~tAqnLTekTIq~~~ 487 (771) T protein:vir:95 414 GIWMIQGGSDYGFTATNYLVTKISEHGCSSPNSVVVVDNSFMYWGDDG----IYHLTRNQ--YGDYVANNLTEKTIQKYY 487 (771) T ss_pred ceEEEEeccCCceeeeeeEEEEeeeeccCCCccEEEecceEEEeeCCc----eEEEeecc--cCcchhhccchHHHHHHH Confidence 9999965 3 48999999999999999998888899999999999987 44444333 3458899999 7888875 Q ss_pred CCCc---EE--EEEcCCCCcEEEEEEcCC--C--e-E--EEEEeeeCCCceeeEeeEee---e-cCCCeE-EEEEE---- Q lcl|NC_019510. 503 PNGV---FS--ISGSSTENFATVLTSGAK--G--K-V--FIYKFLYIDEQIQQQSWSHW---D-FGDNVT-VLAAN---- 561 (799) Q Consensus 503 ~g~~---~~--~~a~~~~~~~~v~~~~~d--g--~-l--~~~tyl~~~~e~~v~aW~~w---~-~~g~~~-~~~~~---- 561 (799) +.=. +. ...|-+...-+.|+..+. + + + +++. -...|+-+| + .+|..- .+... T Consensus 488 ~~I~~dk~knVtg~fd~~e~rvyw~yPn~~D~~~e~~t~LV~d-------LalgaFYp~~i~~~~ag~l~~~vg~~~~p~ 560 (771) T protein:vir:95 488 EKIPSDAILNATGFYDSYDKKVKWLYNTVLDGRTEPVTELVFD-------LALGAFYPSKIGSLTAGRLPIPVGSVKIPP 560 (771) T ss_pred hhcchhhhcceEEEEEccCCEEEEEecceecCCCcceeeeeee-------ecccccccccccccccCccceeeeeeecCc Confidence 4311 11 112333344455554311 0 0 1 1211 112366667 3 223220 00000 Q ss_pred ----EeCCEE-----------------------------EEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEE Q lcl|NC_019510. 562 ----SIGSHM-----------------------------HVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDI 608 (799) Q Consensus 562 ----~~~d~l-----------------------------~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~ 608 (799) ..+.++ |+++.|.+ ...+|.+ . ...+.-|+|.+. + T Consensus 561 ~~lv~T~~eV~v~~~~v~~tG~~vtV~~~~r~~~~~~~~y~~~~~dg--~~g~~~F--a-----~~~~~~f~DW~s---v 628 (771) T protein:vir:95 561 YKLVETGEEVTVASEQVTATGELVTVKVSTRSPVIRETKYIIVEKLS--SPMRISF--G-----GYTDEEFVDWKS---V 628 (771) T ss_pred cccccccceEEecceeeEecCCceEEEEEEeeccccceEEEEEEecC--CCeeEEe--c-----cccCcceeeccc---C Confidence 001122 11111110 0011111 0 000111222221 0 Q ss_pred ccccccccccccccccccccCccccccceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeE--EEEe Q lcl|NC_019510. 609 PANAFNNDRYETTVDLNAVFGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEF--RYEF 686 (799) Q Consensus 609 ~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~--~~~~ 686 (799) ++. +...++....|+ ..+. +.-+.+..|.++.--. +-.=|+--.. +.+| T Consensus 629 ~~~--------~vdy~sy~~~gY---------------~~~g--d~~~~k~~PYit~y~~----~tedg~v~~~~g~~~p 679 (771) T protein:vir:95 629 DGI--------GVDAPAYLLTGY---------------LAGG--DYQREKFVPYITFHFK----KTEDGFVEDAEGDWTP 679 (771) T ss_pred CCc--------ccchHHHHHhhh---------------hccc--hheeeeccceEEEEEE----eecccceecccccccc Confidence 000 000000000000 0000 0011122221110000 0000000000 0111 Q ss_pred cCeeEEcCCCccceeeeecceEEEEEEEEEeeccceEEEEecCCCcc-cceeccccccCcccccccccccceE--EEEEe Q lcl|NC_019510. 687 SKFLIKKQDESGGFSTEDVGRLQHRRAWLNYEQSGAFYVDVTNLGRS-YRYTMSGKPLGDTTLGQANLESGQF--RFPLA 763 (799) Q Consensus 687 ~~~~~~~~~g~~~~~~~~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~-~~~~~~~~~~~~~~~~~~~~~tg~~--~vp~~ 763 (799) ..+ . +-|+ .+.++...++.- .+-++. ..+-+...++.++.....-+.+..+ ...++ T Consensus 680 ~n~------s---------Sclm--~~sw~ws~s~~t----~k~~~~~eaYk~~~~~~p~~~~~~~yp~~~VV~TKsriR 738 (771) T protein:vir:95 680 TNQ------S---------SCMV--QSQWSWTNSPAS----NKWGRTWQAYRFRRHFFPDNIDNQFDDGNSVVETKSRLR 738 (771) T ss_pred cCC------c---------ceEE--EEEeeeecCCCC----CccccchheeeecceeccCCcchhcCCccceeeeeheee Confidence 111 0 1111 112222222110 011111 1111222233332222222233333 23568 Q ss_pred ecccceEEEEEECCCCcEEEEEEEEEEEEeccc Q lcl|NC_019510. 764 GNAQYNRVVLTSDYTTPLSIIGCGWEGNYIRRS 796 (799) Q Consensus 764 ~~~~~~~v~i~~~~P~P~tvl~i~~eg~y~~r~ 796 (799) |..+-.+++|.+..--.|+|++.++--..|-.. T Consensus 739 G~Gr~~~~rf~s~~gKdlhl~Gysil~~~~~~~ 771 (771) T protein:vir:95 739 GSGKVLSLYITTEPKKNLHIYGWSMLVDVNGTV 771 (771) T ss_pred ecceEEEEEEEecCCcceEEEeEEEEEeecCcC Confidence 889999999999999999999999888887776 No 30 >protein:vir:8837 Length: 513 # NCBI annotation: constituent protein # Family: family:all:4957 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775245;genbank:gi:27476043;genbank:GeneID:2700591 Probab=97.64 E-value=3.4e-05 Score=45.09 Aligned_cols=483 Identities=12% Similarity=0.042 Sum_probs=220.1 Q ss_pred CCceeeeccceecccccCCcHHHhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCceEEEEE Q lcl|NC_019510. 1 MGLVSQSIKNLKGGISQQPNILRFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEYEQYYVV 80 (799) Q Consensus 1 M~~v~~s~~~~~gGVSqq~D~~ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~~~y~l~ 80 (799) ||...+..-.-.|.|--+.+.+.=.++-..+.|+++. .|++.||||-.=+++- ......-+++|. .....+.|. T Consensus 1 ~~~~~~~~~~~~g~~~d~~p~~lp~~a~s~~~N~~~~-~~~~~~~~g~~pv~a~----~~~~~~g~~~~~-~~g~~~~~~ 74 (513) T protein:vir:88 1 MALERQEVKNPTGIVTDIAPADLPLDKWSFGNNVRFK-NGKAQKALGHSPIFDT----AQAPILDMFPFI-RNNIPYWLL 74 (513) T ss_pred CCcCChhhcccccceeccChhhcCCCcceeeeeeeEe-cceeeecCccceeeec----CCCCceeeeeee-cCCCeEEEE Confidence 8777667777777776555444433444667777765 6888999998877321 111122234554 445567777 Q ss_pred EeCCeEEEEeCCCeEEEEEecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceEEEEecCCCC Q lcl|NC_019510. 81 FTGGDIKVFDLNGQEYAVRGDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDALINVRGGQYG 160 (799) Q Consensus 81 ~~~g~irv~~~~G~~~~v~~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~~~v~~~~y~ 160 (799) .+.+.++.++..+ +..+.. .+|..+.. ..-+++|-+|+++++|...+|+.+-- T Consensus 75 ~~~~~~~~~~~~t-~~dvs~-~~~~~~~~--~~w~~~~f~~~i~a~ng~~~~q~~~~----------------------- 127 (513) T protein:vir:88 75 CSEKRLYLADGTT-IIDVSP-GPYSASVT--NRWSVGSFNGVIFANDGVNPPHHLPP----------------------- 127 (513) T ss_pred eeceEEEEecCce-eeeccc-cceeeccc--CceeeeeecCEEEEEcCCCcceEEcC----------------------- Confidence 7777666665322 332222 23321111 11344555555554443322221100 Q ss_pred ceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEEEecCCCceeEEE Q lcl|NC_019510. 161 RTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAPDGDSIRGLQ 240 (799) Q Consensus 161 ~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~~~~~~~~~~ 240 (799) + + T Consensus 128 --------------------~---------------------------------------------------s------- 129 (513) T protein:vir:88 128 --------------------T---------------------------------------------------E------- 129 (513) T ss_pred --------------------C---------------------------------------------------C------- Confidence 0 0 Q ss_pred EEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEeccccccceeeccceeeEEEe Q lcl|NC_019510. 241 TKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVGLNNGTMPWSLIR 320 (799) Q Consensus 241 ~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~~~t~p~~lv~ 320 (799) ..+++|| T Consensus 130 -----------------~~f~dl~-------------------------------------------------------- 136 (513) T protein:vir:88 130 -----------------SVFRVLP-------------------------------------------------------- 136 (513) T ss_pred -----------------ceeeecc-------------------------------------------------------- Confidence 0000000 Q ss_pred ccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEec--------CCeEEEEccCCcc----ccccccc Q lcl|NC_019510. 321 AADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLS--------GENIILSRTAKYF----NMYPASV 388 (799) Q Consensus 321 ~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~--------~~~v~~Sr~gd~~----nF~~~t~ 388 (799) +|.+ ...-..|.+|++||++++ |+.|+.|..+|.. .|..+. T Consensus 137 -----------g~p~---------------~~~a~~i~v~~~flv~~~~t~~~~~~PnrV~wS~~~D~~~~P~~W~~t~- 189 (513) T protein:vir:88 137 -----------NFPA---------------NTTFRRLKSFKNFLIGLNVTSNSIEMPQMVWWSTSADAGGVPASWDPTD- 189 (513) T ss_pred -----------CCCc---------------ccceEEEEEEeeEEEEeecccCcCCCCceEEEecccCCccccccccccc- Confidence 0000 001235667899999874 5679999999963 342221 Q ss_pred cCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEe-CCccccccceEEEEEEeecc-cCCCCcEEeCCeEEE Q lcl|NC_019510. 389 AVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLN-ASGVLSAKSVELNLTTEFDV-NDGARPYGIGRGVYF 466 (799) Q Consensus 389 ~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~-~~~~lTP~~~~~~~~s~~~~-~~~~~Pv~vg~~v~f 466 (799) . ..+.+=.++ .+....|...++....|+||++.+-|.++ .++ |....++....-.| .+.-.=+.+|+.++| T Consensus 190 ~-t~~a~~~~l---~d~~g~~v~g~~~g~~liif~e~~i~~m~y~g~---~~if~~~~i~~~~G~~~p~SI~~~~~~~ff 262 (513) T protein:vir:88 190 P-TKDAGQNTL---ADTNGAIVDGVKLRDSFIIYKEDSVYSMRYIGG---LYIFQFQQLFNDVGILGPNCAIEFDGNHFV 262 (513) T ss_pred c-cCccccccc---CCCccceeeeeecccceEEEecccEEEEEecCC---CceEEEEeecccccccCCceeEEECCeEEE Confidence 1 122222222 33345555567777899999999999996 332 23344444443233 333333789999999 Q ss_pred EecCCCceeEEEEEeeccccCceehhhHHHHHHHhc-----CCCcEEEEEcCCCCcE-EEEEEcC------C--CeEEEE Q lcl|NC_019510. 467 ASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYI-----PNGVFSISGSSTENFA-TVLTSGA------K--GKVFIY 532 (799) Q Consensus 467 ~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~-----~g~~~~~~a~~~~~~~-~v~~~~~------d--g~l~~~ 532 (799) +.+.| +.++ +-.+-...+. ..++.+| ....-.+.+.--+... +.|+-.+ + .++++| T Consensus 263 ls~~G-----f~~~--~G~~~~~Ig~---ekVdk~f~~~~n~~~~~~~~~~~d~~~~~v~~~y~s~~~~~~~~~~~~lVY 332 (513) T protein:vir:88 263 VGHGD-----VYVH--NGVQKQSVID---AQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSEPGKHCDRAIIW 332 (513) T ss_pred EeCCc-----eEEe--cCceeeeccc---chhhhhhhccCCcccceEEEEEEcCcccEEEEEecCCCCCCCcccceEEEE Confidence 99987 3232 2111111111 1222222 1222222222222222 3333111 0 246666 Q ss_pred EeeeCCCceeeEeeEeeecCCCeEEEEEEEeCCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEcccc Q lcl|NC_019510. 533 KFLYIDEQIQQQSWSHWDFGDNVTVLAANSIGSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANA 612 (799) Q Consensus 533 tyl~~~~e~~v~aW~~w~~~g~~~~~~~~~~~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~ 612 (799) .|+ + ..|+.-+.+..+-- +...-+.+.......... -.|... T Consensus 333 d~~----~---~~Ws~~~~p~~~~g--~~g~~~~~~~~~~~~~~~---------------------~~d~~~-------- 374 (513) T protein:vir:88 333 NWK----E---NTWSIRDLPNVLSG--AYGIIDPKTSNLWDDDSN---------------------PWDTDT-------- 374 (513) T ss_pred Ecc----C---CeEEEEeccchhhc--ccccccccccceeccccc---------------------ccccch-------- Confidence 662 2 24654444332100 000001011000000000 000000 Q ss_pred ccccccccccccccccCccccc-cceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeE Q lcl|NC_019510. 613 FNNDRYETTVDLNAVFGGMRWQ-VGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLI 691 (799) Q Consensus 613 ~~~~~~~~~~~~~~~~~g~~~~-~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~ 691 (799) .....+.+.-. ........++|.+... ..++ -.-|-++++.++.....+ T Consensus 375 -----------~~~~~~~~~~~~~sl~~~~~~~~~~~~f--d~~~-----------------~f~G~~lea~~~t~~~~~ 424 (513) T protein:vir:88 375 -----------SVWGEGSYNPAKSSMIFTSFQDAKLFLF--GETS-----------------TFSGQSFTSTLERSDIYL 424 (513) T ss_pred -----------hhhhccccccccceeEeeeccCCceeee--cccc-----------------cccCCceEEEEEecCccc Confidence 00000000000 0001111222222111 0011 134667888888776654 Q ss_pred EcCCCccceeeeecceEEEEEEEEEeeccceEEEEecCCCcccceeccccccCccccccccc-ccceEEEEEeecccceE Q lcl|NC_019510. 692 KKQDESGGFSTEDVGRLQHRRAWLNYEQSGAFYVDVTNLGRSYRYTMSGKPLGDTTLGQANL-ESGQFRFPLAGNAQYNR 770 (799) Q Consensus 692 ~~~~g~~~~~~~~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~-~tg~~~vp~~~~~~~~~ 770 (799) .+ + .+. .+|+++...+...+.+.+.+......... -.+. .+..- ..++..++++...+..+ T Consensus 425 ~~--~-~~~-------~~i~~v~~~~t~~g~~t~~vg~~~~~~~~----~~~s----~~~~~~~~~~~~~~~r~~gRy~~ 486 (513) T protein:vir:88 425 GD--D-RMM-------KTVSAVIPHITGNGVCNIWVGNAQVQGSG----IRWK----GPYPYRIGQDYKIDTKHVGRYIA 486 (513) T ss_pred cC--c-hhh-------eeeeeeeeeeecceEEEEEEeeeccCccc----cccc----cceeeecccCceEEeccCCceEE Confidence 21 1 111 14555555566666666655432111000 0000 01010 11234466667778888 Q ss_pred EEEEECCCCcEEEEEEEEEEEEeccccC Q lcl|NC_019510. 771 VVLTSDYTTPLSIIGCGWEGNYIRRSTG 798 (799) Q Consensus 771 v~i~~~~P~P~tvl~i~~eg~y~~r~rr 798 (799) ++|+...--|+++.++++|..--. .|| T Consensus 487 ~ri~i~~~~~w~~~G~~ve~~~~~-g~R 513 (513) T protein:vir:88 487 LKFDFASAGDWYFNGYTLEMAPKA-GMR 513 (513) T ss_pred EEEEccCCCceEEeeEEEEEecCC-CCC Confidence 888888899999999999887521 333 No 31 >protein:vir:3133 Length: 911 # NCBI annotation: hypothetical protein # Family: family:all:5234 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640315;genbank:gi:21234408;genbank:GeneID:956056 Probab=97.24 E-value=0.00012 Score=42.09 Aligned_cols=655 Identities=15% Similarity=0.156 Sum_probs=253.8 Q ss_pred CCceeeecccee--cc-cccCCcHHHhHhH-HhhhhcceeeccCCceeCCch-------hhhhhhcCCCCcccceEEEEE Q lcl|NC_019510. 1 MGLVSQSIKNLK--GG-ISQQPNILRFPDQ-GELQVNGWSSETEGLQKRPPM-------VFNRKLGAAGFLGAAPLVHLI 69 (799) Q Consensus 1 M~~v~~s~~~~~--gG-VSqq~D~~ry~~~-~~~~~N~~~~p~gGl~rRpGt-------~~v~~~~~~~~~~~~~~l~~~ 69 (799) |+.-.+....|+ +| |- .-.+..|..- +-..+||-....|=-+||=|. +|+..+.+.........+-.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (911) T protein:vir:31 1 MAARKGAVNRFTPVRGWVT-EGNLANYGQDVALDVENMDIEKTGLTQRRFGLFAETSSEQFLSTFTATARARGLLAVKEW 79 (911) T ss_pred Cccccccccccccceeeee-cCchhhcCceeEeeeccccchhcccchhheeeeeccchhhhhhhhhhhhhhcceeehhhH Confidence 887666666664 23 21 1134444333 335789988888888888774 455444332211111111111 Q ss_pred E--cCCceEEEEEEeCCe-EEEEeCCC------eEEEEEecCccccccCCc-----ceeEEEEEcCEEEEEeCCeeeeee Q lcl|NC_019510. 70 N--RDEYEQYYVVFTGGD-IKVFDLNG------QEYAVRGDKSYVQTANPR-----NSIRCVTVADYTFVVNRERVVQAD 135 (799) Q Consensus 70 ~--~~~~~~y~l~~~~g~-irv~~~~G------~~~~v~~~~~y~~t~~~~-----~~l~~~q~aD~~~l~~~~~~p~~l 135 (799) . -+....-||+|..|| +.|..+.. .++++. .+++.-.+ +-.+..---....|.||...|--+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (911) T protein:vir:31 80 REAWGDKDVNMLIFHAGYKVHVVQDTAPLRDANILLTID----LLEAGIKLDGVIDSPVHISVGVGFAIITNPRIEPVLI 155 (911) T ss_pred HHhhCCCcceEEEEecCcEEEEEecccCccccceEEEee----eeccCceeeeeecCceeEEeeceEEEeecCccceEEE Confidence 0 133345677887776 33332221 122211 11111100 001222223466778888877533 Q ss_pred c--cccCCCCcCCC--cce-EEEE--------ecCCCCce------eeEEeccceEEEEEecCCccccceecccccc--c Q lcl|NC_019510. 136 G--NLTNGGTFNDQ--KDA-LINV--------RGGQYGRT------LNVIFNEATRATIKLPSGTGTTPPIEEQVAA--V 194 (799) Q Consensus 136 ~--~~~~~~~~~~~--~~~-~~~v--------~~~~y~~t------y~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~--~ 194 (799) . ...+..-...+ +.. ++.. .+.+|+.+ +.+-..+..+.+-.+.+-++.. +..-...+ + T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 234 (911) T protein:vir:31 156 KLDDVDDEGVPTLSYEPLTLLIRTRELLTPYTTGTNYGDTLTPEEEWNLYNSGWATITRATKDKSGSG-TVYVNPVQYYF 234 (911) T ss_pred EeeccCccCcccccccceeeEeeehhhccccccccccCcccCchhhcccccccceeeeeecccCCccc-eEEEchhheee Confidence 2 11111111000 100 0100 01111111 0111111111111111111100 00000000 0 Q ss_pred c-----cccchhhhhhccccccccccceEEEECCcEEEEEecCCCceeEEEEEeccCcceeeEEeeeecccee-cccccc Q lcl|NC_019510. 195 D-----AQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAPDGDSIRGLQTKDGYADQLISPVTHYAQTFAK-LPQNAP 268 (799) Q Consensus 195 ~-----~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~-l~~~~~ 268 (799) + +.|. -|.+.++... +..+..+..-. ....+.-++.. .|+ T Consensus 235 ~~~~~~~~~~--~~~~~~~~~~---------------------~~~~~~~~~~~--------~~~~~~~~~~~~~~~--- 280 (911) T protein:vir:31 235 DKRGVYPSHS--VLYNSMKQES---------------------AKEIVALNVFS--------PWADEKINFGTTTPP--- 280 (911) T ss_pred cccCcCcchh--hhhhhhhhhc---------------------cceeEEEeeec--------cccccccccccCCCc--- Confidence 0 0000 0001000000 00000000000 00000000000 000 Q ss_pred CCeEEEEEecCCCCcceeEEEEe----cC---------ceEEEEecccc---c--cceeec-cceeeEEEeccCceEEEe Q lcl|NC_019510. 269 DGYTVKIVGDTSRSADKYYVRYN----LT---------RKVWEETVGWN---I--QVGLNN-GTMPWSLIRAADGQFDFV 329 (799) Q Consensus 269 ~G~~v~i~~~~~~~~~~~y~~~~----~~---------~~~w~e~~~~~---~--~~~~~~-~t~p~~lv~~~~~t~~~~ 329 (799) -|..+ -..||..-. -+ .++ .|..+|. + ..++++ +++ ..+--.+.|+.+ T Consensus 281 ~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~p~~~e~~np~gl~~igt~-~n~k~~a~~~~~-- 347 (911) T protein:vir:31 281 LGRYI---------HSAYYFDSAAILSLGIGNLTPPTSDGT-TEGSGPAEEEISNPIGLDNIGTV-NNLKLIAEGTVR-- 347 (911) T ss_pred hhhhh---------hhheeeccceeeeecccccCCCCCCCc-cCCCCCchhhhcCCCCcccccch-hceeeeecccee-- Confidence 00000 011111100 00 000 0111111 0 111211 110 001111222222 Q ss_pred ecccCccccCccccccCccccCCCeeEEEEEcceEEEec-----CCeEEEEccC--------Ccccccccccc--CCCCC Q lcl|NC_019510. 330 ANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLS-----GENIILSRTA--------KYFNMYPASVA--VLSDD 394 (799) Q Consensus 330 ~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~-----~~~v~~Sr~g--------d~~nF~~~t~~--~~~Dd 394 (799) |. +...|+|++||.+|++|+. ...|.+|+.- +|++=++++.. .+.|. T Consensus 348 ---~~---------------~~~r~r~~~~yaGRVfyaD~dkngk~rIlFSqLv~sl~di~nCYQdaDPTSeee~DLIdT 409 (911) T protein:vir:31 348 ---WT---------------VKDRPRCSGYHNGHVYFGDRDKNGKTRILVSQLVNSLDNIPKCFQDADPTAEEINDLIAT 409 (911) T ss_pred ---ee---------------ecccccceeeeccEEEEeeeccCcceeEEEEeeccccccccccccCCCccccccchhhhc Confidence 21 2245899999999999995 2369999843 45555555433 24467 Q ss_pred ccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCc--cccccceEEEEEEeecccCCCCcEEeCCeEEEEecCCC Q lcl|NC_019510. 395 DPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNASG--VLSAKSVELNLTTEFDVNDGARPYGIGRGVYFASPRAT 472 (799) Q Consensus 395 D~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~--~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~ 472 (799) |...+.+.. ...|+-|+.+++.|++|..++.|+|.|.+ ..|.++..|.+++..+|++.=.=|++|+.++|-++.| T Consensus 410 DGg~vri~g--ah~Ii~LV~~G~sLlVFcaNGVWAI~G~d~~g~TATdy~ItKIsdvGcsspNSVVvVgn~i~fWSd~G- 486 (911) T protein:vir:31 410 DGFTMYPVG--MGAPITMVEFNKRLLLLCTNGVWAIRGTSGGGATATDFTLDKVASVEFNSPQSVVDIGTAIVFWSERG- 486 (911) T ss_pred CCcEEecCC--CCCceEEEEecCeEEEEEeCcEEEEeccCCCceeeeeeEEEEEeeeeeCCCCeEEEecCceEEeeCCc- Confidence 888887765 56778899999999999999999998754 6899999999999999998888899999999999987 Q ss_pred ceeEEEEEeeccccCceehhhHH-HHHHHhcCC----CcEEEE-EcCCCCcEEEEEEcC--C-CeEEEEE----e-eeCC Q lcl|NC_019510. 473 FTSINRYYAVQDVSAVKNAEDMT-MHVPSYIPN----GVFSIS-GSSTENFATVLTSGA--K-GKVFIYK----F-LYID 538 (799) Q Consensus 473 ~~~~~r~~~~~~~~d~~~~~dls-~~~~h~~~g----~~~~~~-a~~~~~~~~v~~~~~--d-g~l~~~t----y-l~~~ 538 (799) |....+.+ -.-+.++.+| ..++.|.+. .+...+ .|-.....+.|+.-+ | .+.+.+. + | T Consensus 487 ---IyaLganq--fnD~tAnNLTesTIQ~y~d~I~~dkIkNVtgtyd~de~rVyW~yPn~lDe~teykt~~~~ILVf--- 558 (911) T protein:vir:31 487 ---IIAIGVND--FGDLTSNNLTENTIDEYYDSLDRDIIKNVKGTFINDENRVYWVVPNKQDSNGEYKTDGELVLVL--- 558 (911) T ss_pred ---EEEEeecc--cCccccccccHHHHHHHHhhcChhhhceEEEEEEccCCEEEEEecCccCCccceeecCceEEEE--- Confidence 44443333 3347888888 566666532 111111 233334456666642 2 2333311 0 1 Q ss_pred CceeeEeeEeeecCCCeEEEEEE----Ee-----------------------------------CCEEEEE-EEeCCCEE Q lcl|NC_019510. 539 EQIQQQSWSHWDFGDNVTVLAAN----SI-----------------------------------GSHMHVI-LQNGYDIF 578 (799) Q Consensus 539 ~e~~v~aW~~w~~~g~~~~~~~~----~~-----------------------------------~d~l~~~-v~r~~~~~ 578 (799) .-...+|-+|+..+.-...... .. +..+|++ +|-+.+ T Consensus 559 -dLatgaFYPwtvs~gpLl~~p~y~Lv~TreEvtvPi~~etgaiIve~gsdPV~~tl~vdttGvDg~ayLl~frdg~~-- 635 (911) T protein:vir:31 559 -NLDTGGFYKHTVSGGPLLHAPFRRLVNTRAEVSIPITETDGTVITDTLGDPVTVTRTVTTTGVDGLAYFASFDDGVN-- 635 (911) T ss_pred -EeccCcccceeeecceeecccccccccccccceeeEEeecceEEEecCCCCeEEEEeeecccccceeEEEeeccCCc-- Confidence 1112378888764332110000 00 1123333 111111 Q ss_pred EEEEEEEeeccCCCccccceeeceeEEEEEccccccccccccccccccccCccccccceEEEEecC-Cccccccccccce Q lcl|NC_019510. 579 MGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTVDLNAVFGGMRWQVGKILVSDEV-GEVRQYEPPAGGW 657 (799) Q Consensus 579 ~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~ad-G~~~~~~~v~~g~ 657 (799) ++|.+..+. ...-|+|.+.- + +.+-.+ ...+++ |.. ...+.-|. T Consensus 636 -g~~~f~a~~------~~~~~~dw~~~---~--------~~~~~~---------------y~s~~~~~y~--~~~~~~~~ 680 (911) T protein:vir:31 636 -GQFNFIAEH------QPWGFADWANV---P--------NMTRVN---------------YSSYVDFAYE--YPEVMIGN 680 (911) T ss_pred -ceEEEEEee------cCCeeeccccC---c--------cccccc---------------hhHHHHhhhh--hhhhhhhc Confidence 112111000 00012221110 0 000000 000010 110 11112233 Q ss_pred ecCceEEE--------ccC-CCCCeEEEeEeeeEEEEecCeeEEcCCCccceeeeecceEEEEEEEEE------------ Q lcl|NC_019510. 658 ASDPTLRI--------VGD-MAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTEDVGRLQHRRAWLN------------ 716 (799) Q Consensus 658 ~~~~~~~i--------~~~-~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~~rl~l~~~~~~------------ 716 (799) +.+|.+.- ..+ +.++. .||.|- +++. . .+...|.+..|++.+- T Consensus 681 ~~~pyi~sy~~~~~rv~~~~y~~~~--a~~~f~-~~~~-------~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 744 (911) T protein:vir:31 681 ISLPYIHSYYLTGIRVQTEQYTTET--AHLSFH-RVQA-------H------QTTALGTVTFHKVDMMVSTGMQVISFHK 744 (911) T ss_pred ccCceeeeeeeeeeEEeccceeeec--ccceeE-eeec-------c------cceeeeeeeeeeeeehhhccceeeeecc Confidence 33332221 000 00111 111111 0000 0 0011122333333221 Q ss_pred --eeccceEEEEecCCCcccceeccccccCc---ccccccccccceEEEEEeecccceEEEEEECCC-----------Cc Q lcl|NC_019510. 717 --YEQSGAFYVDVTNLGRSYRYTMSGKPLGD---TTLGQANLESGQFRFPLAGNAQYNRVVLTSDYT-----------TP 780 (799) Q Consensus 717 --~~~t~~~~~~v~~~~~~~~~~~~~~~~~~---~~~~~~~~~tg~~~vp~~~~~~~~~v~i~~~~P-----------~P 780 (799) +.++-...+ |+..+ +..++.|+.+.. +..-+.|+..|+.-+- + ...++-.+-|+-- ++ T Consensus 745 ~~~~~~~~~~v-VNGDA--E~GtmTGWtvtaG~~d~~Ta~p~~rGSyfFa--~-~nn~n~aL~QDIDSagaaaIDAG~v~ 818 (911) T protein:vir:31 745 DDLLRTEAVTL-VNPDA--ETGDATGWTVTAGTLDVRTAAPLYQGSYYFW--S-DSNANFAAYQDIDPVGGGYITAGELA 818 (911) T ss_pred ccceeeeeeEE-EcCCC--CCCCCCcceeeccchhhccCCchhcceEeEc--C-CCCcchhhheeccccccceeeeccch Confidence 112212111 22211 233445554432 1122345566665332 1 1133333333322 33 Q ss_pred EEEEEEEEEEEEeccc--cC--C Q lcl|NC_019510. 781 LSIIGCGWEGNYIRRS--TG--I 799 (799) Q Consensus 781 ~tvl~i~~eg~y~~r~--rr--v 799 (799) .++. .|-+-|-.+. -| | T Consensus 819 ynvS--awl~gyAaqnd~Dr~~l 839 (911) T protein:vir:31 819 NNVI--EAKLSWAARGNTDLGTV 839 (911) T ss_pred hhhh--hhhhhhccCCCCccceE Confidence 3332 2666666653 33 3 No 32 >protein:vir:105563 Length: 396 # NCBI annotation: hypothetical protein # Family: family:all:27455 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164316;genbank:gi:56692963;genbank:GeneID:3197174 Probab=91.97 E-value=0.013 Score=30.85 Aligned_cols=372 Identities=10% Similarity=0.026 Sum_probs=148.0 Q ss_pred CCceeeeccceecc--cccCCcHH----HhHhHHhhhhcceeeccCCceeCCchhhhhhhcCCCCcccceEEEEEEcCCc Q lcl|NC_019510. 1 MGLVSQSIKNLKGG--ISQQPNIL----RFPDQGELQVNGWSSETEGLQKRPPMVFNRKLGAAGFLGAAPLVHLINRDEY 74 (799) Q Consensus 1 M~~v~~s~~~~~gG--VSqq~D~~----ry~~~~~~~~N~~~~p~gGl~rRpGt~~v~~~~~~~~~~~~~~l~~~~~~~~ 74 (799) |+.+ ++--|+|= |+.-.+++ --..-++++.|+=.++.|=.+||-|.+-+.... +.++-.+.. T Consensus 1 ~~~~--~~~~~~ginnv~~e~~l~~~~~~~~~~~r~a~nvdi~~~G~~~~r~~~tr~~~g~----------l~~~~~~~~ 68 (396) T protein:vir:10 1 MATT--SLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQP----------FRQLWQSPL 68 (396) T ss_pred Ccce--eeeeeecccccccccccccCCCcccceeeeeeeecccCCCchhhhccCcccCCce----------ecccccCcc Confidence 8865 33333322 44333333 345678999999999999999999988665322 122222333 Q ss_pred eEEEEEEeCCeEEEEeCCCeE--EEEE-ecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCcCCCcceE Q lcl|NC_019510. 75 EQYYVVFTGGDIKVFDLNGQE--YAVR-GDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTFNDQKDAL 151 (799) Q Consensus 75 ~~y~l~~~~g~irv~~~~G~~--~~v~-~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~~~~~~~~ 151 (799) ..+.|-...+.+--++.+.-. ..++ +..| +.+.+.+|-+|.++...+=.+.. T Consensus 69 ~~~~~~~~~~tl~~~~~~~w~~~~~v~v~~~p----------va~d~~~~Rvy~t~~~~p~~~~~--------------- 123 (396) T protein:vir:10 69 HGDAFGALGDQWGKVDPHSWTFEPLAQIGEGD----------LSHEVLNNRVCVAGTAGIFTYDG--------------- 123 (396) T ss_pred ccceeeeCCceEEEEeCCeEEEEeeeeeccCc----------hhccccCCeEEEEcCCCceeeeC--------------- Confidence 334443334444433322110 0111 1222 23345677788777554321110 Q ss_pred EEEecCCCCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEEEec Q lcl|NC_019510. 152 INVRGGQYGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAP 231 (799) Q Consensus 152 ~~v~~~~y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~ 231 (799) ...|.+.+. +|+..-. . ......++ +.+++. .-+++.. T Consensus 124 --------~~~y~L~vp--------~P~~a~~-~---a~~Gsl~~------------------~~~~Y~----~t~V~~~ 161 (396) T protein:vir:10 124 --------AQAERLTLD--------TPAPPLL-V---AGAGSLSQ------------------GTYGAA----VAWLRGP 161 (396) T ss_pred --------CcceecCcC--------CCccccc-c---cccCccCC------------------ceEEEE----EEEEecC Confidence 011111111 0110000 0 00000000 000000 0001000 Q ss_pred --CC-CceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEeccccccce Q lcl|NC_019510. 232 --DG-DSIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVG 308 (799) Q Consensus 232 --~~-~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~ 308 (799) ++ ....+.....++.-+ + +-+++...+..-.+|=-+..++. .+|+-.+ T Consensus 162 gEEs~p~~~S~~v~~~gg~~--------v---tl~~~~~~~i~~~RiYrS~~~G~-~~~l~aE----------------- 212 (396) T protein:vir:10 162 QESAPSLIAFAEVTDAGALE--------V---TFPLCLDASVTGARLYLTRANGG-ELLLAGD----------------- 212 (396) T ss_pred CCcCcccccccccCCCCCcE--------E---EEEcccCCCcceEEEEEeCCChh-hhhheeh----------------- Confidence 00 000000000000000 0 00011111111112211111111 1121111 Q ss_pred eeccceeeEEEeccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEecCCeEEEEccCCccccc-ccc Q lcl|NC_019510. 309 LNNGTMPWSLIRAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLSGENIILSRTAKYFNMY-PAS 387 (799) Q Consensus 309 ~~~~t~p~~lv~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~-~~t 387 (799) ++ ....+|.+...+|........-=-|.|+ | ..+++|.+||+++.++.||+|...-++=++ +.. T Consensus 213 -----~~-----a~~~s~vlPs~~w~gpP~~~~gL~pmP~--G---~~~A~faGRi~~A~Gn~V~FSEp~~Ph~~~~~~~ 277 (396) T protein:vir:10 213 -----YP-----LGAATVILPTLPELGRPAQFRHLSPMPT--G---KHLAYWRGRLLIARANVLRFSEALAYHLHDERYG 277 (396) T ss_pred -----hc-----cceeeeeeecCCCCCCCccccccccCch--h---HhhhhhcceEEEEeCCEEEEecCCCCceecchhc Confidence 11 1122333444566543211111112221 1 157899999999999999999999873221 111 Q ss_pred ccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCCcc--ccccceEEEEEEeeccc---------CCCC Q lcl|NC_019510. 388 VAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQFVLNASGV--LSAKSVELNLTTEFDVN---------DGAR 456 (799) Q Consensus 388 ~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e~~i~~~~~--lTP~~~~~~~~s~~~~~---------~~~~ 456 (799) -++ . ...|.-+.+...+|+++|+++-|.+.|.++ |+.......+- .-|+ +.-. T Consensus 278 ~~~------------~--~~~Iv~lapv~~gL~Vgt~~~~y~~~G~dP~sms~~~l~~~~p--vp~S~v~~p~~~~s~rs 341 (396) T protein:vir:10 278 FVQ------------M--PQRITFVQPVDGGIWVGQVDHVAFLDGADPASLSVSRRASRAP--VPGSAVLVPAEVVGTNA 341 (396) T ss_pred cCC------------C--CCceEEEEEecCeEEEEEcCcEEEEEcCChhHcceeecccCCC--cccchhcccchhhhccc Confidence 111 1 123455677888999999999999999653 44433321110 1122 1222 Q ss_pred cEEeCCeEEEEecCCCceeEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeee Q lcl|NC_019510. 457 PYGIGRGVYFASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLY 536 (799) Q Consensus 457 Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~ 536 (799) .+..+..++|+++.|= +- . ..++-. ..+....+.+..... + -| +..|++.+++ + T Consensus 342 ~~~~~~~~lwas~dGl----~~----g-~~~G~v----~~l~~~~i~p~~~~A-~--------~~-~~~drRy~~~--~- 395 (396) T protein:vir:10 342 SPDGSPVAVWLAENGY----VM----G-TSSGAI----AEVHAGVLAGITGRA-G--------TS-VVFDRRLLTA--V- 395 (396) T ss_pred ccccCcEEEEccCCcE----EE----E-cCCcee----eeecccccCCCcccc-e--------EE-EeecCeEEEE--e- Confidence 3456888999999882 11 1 111111 111112222211100 0 11 1123332221 1 Q ss_pred C Q lcl|NC_019510. 537 I 537 (799) Q Consensus 537 ~ 537 (799) + T Consensus 396 ~ 396 (396) T protein:vir:10 396 S 396 (396) T ss_pred C Confidence 1 No 33 >protein:vir:108312 Length: 458 # NCBI annotation: hypothetical protein # Family: family:all:1540 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552278;genbank:gi:160700603;genbank:GeneID:5758828 Probab=91.47 E-value=0.016 Score=30.47 Aligned_cols=439 Identities=11% Similarity=0.054 Sum_probs=168.6 Q ss_pred ceEEEEecCCCCceeeEEeccceEEEEE-ecCCccccceecccccccc-cccchhhhhhcccc--ccccccceEEEECCc Q lcl|NC_019510. 149 DALINVRGGQYGRTLNVIFNEATRATIK-LPSGTGTTPPIEEQVAAVD-AQHIAEELAKQIRE--SLAGNPGWTINVGTG 224 (799) Q Consensus 149 ~~~~~v~~~~y~~ty~v~~~~~~~~~~~-tp~gt~~~~~~~~~~a~~~-~~~i~~~l~~~~~~--~~a~~~~~t~~~~g~ 224 (799) .....++.++|...-.+ .+..-.+-+. +|.-.+ ....... ..-........... ..-..++.-+.+.+. T Consensus 1 m~~~~ip~gsy~a~~~~-~daq~~VN~yp~~~e~g------~ss~~l~~tPGl~~f~~~~~~~~~g~~~~~g~ly~v~g~ 73 (458) T protein:vir:10 1 MVQRQIPLVATTAEGDV-SGQEILVNVYPRKSDGG------KYPFTLRHTPGLAFFCELPTFPVMAMHQNGSRAFAVTPR 73 (458) T ss_pred Cceeeeceeeeeccccc-ccceeeeeeeeeccccc------ccccceEecCCceeeecCCCCceeeEEecCCEEEEeeCc Confidence 22233333333222110 0000000000 000000 0000000 00000000000000 000001111112222 Q ss_pred EEEEEecCCCceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCCCCcceeEEEEecCceEEEEecccc Q lcl|NC_019510. 225 FVNIIAPDGDSIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWN 304 (799) Q Consensus 225 ~i~i~a~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~ 304 (799) .+|-...++.. +.+.+++. .|-++-..+ .....+ T Consensus 74 ~LY~V~~~~~~----------------------~~iG~i~g---sg~VsMa~n--------------g~q~vi------- 107 (458) T protein:vir:10 74 DMYEISKDGTY----------------------KRLGSVDF---KGRVVMEDN--------------GKQIVM------- 107 (458) T ss_pred eEEEEeCCceE----------------------EEEecccC---ceeEEEeeC--------------CcEEEE------- Confidence 22211110000 00011110 011111100 000000 Q ss_pred ccceeeccceeeEEEeccCceEEEeecccCccccCccccccCccccCCCeeEEEEEcceEEEec--CCeEEEEccCCccc Q lcl|NC_019510. 305 IQVGLNNGTMPWSLIRAADGQFDFVANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLS--GENIILSRTAKYFN 382 (799) Q Consensus 305 ~~~~~~~~t~p~~lv~~~~~t~~~~~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~--~~~v~~Sr~gd~~n 382 (799) ..+.+...+ +.++..+ . ..|. +.| ..+..|+|..+|+++.. +..++.|-.+| T Consensus 108 -----~~G~~gY~y-d~at~~~--~-~i~d------------~~~--~~~~~v~~~dGy~V~~~~g~~~~~is~L~d--- 161 (458) T protein:vir:10 108 -----VDGEKGYYY-DSETEIV--Q-EIKA------------EGF--YPASTVTYQDGYFIFDRKGTGQFFISELLD--- 161 (458) T ss_pred -----EECCeEEEE-eecccEE--E-eccC------------ccc--cCcceEEEeCcEEEEEeeCCCEEEEEecCc--- Confidence 001111100 0111111 1 1111 111 22689999999999885 44567785544 Q ss_pred cccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcE--EEEeCCccccccceE-EEEEEeecccCCCCcEE Q lcl|NC_019510. 383 MYPASVAVLSDDDPIDVAVSHNRVSILKYAVPFSEELLLWADEAQ--FVLNASGVLSAKSVE-LNLTTEFDVNDGARPYG 459 (799) Q Consensus 383 F~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e--~~i~~~~~lTP~~~~-~~~~s~~~~~~~~~Pv~ 459 (799) . --||++++-+.++++.|.-++...+.|++|.+..- |..+|+..+.=.... ...+ .+|++.-.=.. T Consensus 162 ---~------s~d~l~fa~Ae~~pD~iv~i~~~~~~i~~fG~~TiEvw~ntG~a~fpy~r~~ga~i~--~Gcaa~~sv~~ 230 (458) T protein:vir:10 162 ---V------AFDPLDFATAEGQPDPLLAVLSDHREVFMFGQETIEVWYNSGAADFPFERNQGAFIE--KGIGAPYSVAK 230 (458) T ss_pred ---c------eeCcceeeeecCCCCceEEEEeeccEEEEEeccceEEEEecCCCCcceeecccceee--ecccCcchhhh Confidence 1 14799998899999999999999999999977665 777776322211111 1122 46776655578 Q ss_pred eCCeEEEEecCCCceeEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCC-eEEEEEeeeCC Q lcl|NC_019510. 460 IGRGVYFASPRATFTSINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKG-KVFIYKFLYID 538 (799) Q Consensus 460 vg~~v~f~~~~g~~~~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg-~l~~~tyl~~~ 538 (799) +|++++|+...+ .+++. ++|+++-+| -|=|+.-+ .+| +-...+.++...+| .+|+++| T Consensus 231 ~~~t~~~l~~d~---~Vy~l-------~g~~~~rIS---T~aIE~~i---~sy-~~~da~a~t~~~eGH~fy~Ltf---- 289 (458) T protein:vir:10 231 TNNTVYFIGSDL---MIYQI-------TGYTPVRIS---THAVEQTL---KGV-NLSDAFAYTYQSEGHLFYVLTI---- 289 (458) T ss_pred hCceEEEEcCCe---EEEEe-------cCceeEEee---CHHHHHHH---hcC-ChhheEEEEEEecCeEEEEEEC---- Confidence 999999999865 34442 356555443 33333321 233 23335666665555 4677776 Q ss_pred CceeeEeeEeeecCCCeEEEEEEEeCCEEEEEEEeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEcccccccccc Q lcl|NC_019510. 539 EQIQQQSWSHWDFGDNVTVLAANSIGSHMHVILQNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRY 618 (799) Q Consensus 539 ~e~~v~aW~~w~~~g~~~~~~~~~~~d~l~~~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~ 618 (799) +.. .| .|-+|..- .+|..-+- +..++ ....|..-+ .+....+ T Consensus 290 P~a---~~-Tw~yD~~t----------~~Wher~S---g~~~~----------------~Ra~~~v~~---~g~~~vG-- 331 (458) T protein:vir:10 290 PGK---NL-TWCYDISS----------GSWHVRQS---YQFDR----------------HVSNNSIYF---DQKTLVG-- 331 (458) T ss_pred CCC---Cc-eeEEeccc----------ccceeecc---CCCCc----------------eEEEEEEEe---CCeEEEE-- Confidence 211 11 23333321 12222110 00011 111111111 0000000 Q ss_pred ccccccccccCccccccceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCcc Q lcl|NC_019510. 619 ETTVDLNAVFGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESG 698 (799) Q Consensus 619 ~~~~~~~~~~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~ 698 (799) ++++|.... +.+... . .-|-+.+..+.++++ + ++ T Consensus 332 -------------D~~ng~ly~-ld~~~~------t--------------------d~g~~i~~~~~~p~~---~-~~-- 365 (458) T protein:vir:10 332 -------------DFQNGRIYI-MADNYY------T--------------------DDGDPVVREFILPVV---N-NG-- 365 (458) T ss_pred -------------EcCCCeEEE-EcccCc------C--------------------CCCceeeeeeeccce---e-CC-- Confidence 111221111 110000 0 012223334444222 1 11 Q ss_pred ceeeeecceEEEEEEEEEeeccceEEEEecCCCcccc--e---eccccccCcc----cccccccccceEEEEEeecccce Q lcl|NC_019510. 699 GFSTEDVGRLQHRRAWLNYEQSGAFYVDVTNLGRSYR--Y---TMSGKPLGDT----TLGQANLESGQFRFPLAGNAQYN 769 (799) Q Consensus 699 ~~~~~~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~~~--~---~~~~~~~~~~----~~~~~~~~tg~~~vp~~~~~~~~ 769 (799) ..||+++++.+.+.- |--.+ .+++.+.. + ...+..++.+ .+|++--+...+...-.|..++- T Consensus 366 ------~~rl~~~~~el~~~t-Gvg~~--~~~~~~p~~~l~~S~d~g~~~s~~~~~~~lg~~gey~tr~~~~rlG~ar~r 436 (458) T protein:vir:10 366 ------REFLTVDSLELDLSS-GVGLT--VGQGSDPELRVYFSKDNGNEYSQNFKVGKIGRKGEFLTRAKVNRFGCARQF 436 (458) T ss_pred ------CCeEEEEEEEEEEec-ceeee--eCCCCCceEEEEEeeCCCcccchhHHHhhcCCcchhhhhhhhhhhccCcce Confidence 135566666654432 11111 11111111 1 1122333321 22333333333333334566666 Q ss_pred EEEEEECCCCcEEEEEEEEEEE Q lcl|NC_019510. 770 RVVLTSDYTTPLSIIGCGWEGN 791 (799) Q Consensus 770 ~v~i~~~~P~P~tvl~i~~eg~ 791 (799) -++|+-..|.|.+|+++..+-+ T Consensus 437 vf~v~~s~p~~~~l~ga~~~~r 458 (458) T protein:vir:10 437 TFKVEISDPIPVDIGGAWVEVR 458 (458) T ss_pred EEEEEEecchhhcceeeeEEeC Confidence 6999999999999999999888 No 34 >protein:vir:105428 Length: 472 # NCBI annotation: gene 8 protein # Family: family:all:1540 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958184;genbank:gi:41057286;genbank:GeneID:2716675 Probab=81.33 E-value=0.087 Score=26.41 Aligned_cols=426 Identities=11% Similarity=0.046 Sum_probs=138.8 Q ss_pred Cccccceeccc--ccccccccchhhhhhccccccccccceEEEECCcEEEEEecCCCceeEEEEEecc-CcceeeEEeee Q lcl|NC_019510. 180 GTGTTPPIEEQ--VAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAPDGDSIRGLQTKDGY-ADQLISPVTHY 256 (799) Q Consensus 180 gt~~~~~~~~~--~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~~~~~~~~~~~~~g~-~~~~~~~~~~~ 256 (799) -.-.-+.+... ....+++++..... .+++. ..++. +.-.+....+. T Consensus 1 m~~~~~pl~~G~~~~~~~~d~~~~~pV----------------------N~~a~---------~~~~~~s~~~l~~tPGl 49 (472) T protein:vir:10 1 MPIQQLPLMKGVGKDFRNADYIDYLPV----------------------NMLAT---------PKEILNSSGYLRSFPGI 49 (472) T ss_pred CCeeeeeeccCceeeccccchhheeee----------------------eeeee---------ccCCCcccceeecCCCc Confidence 00000000000 00000111100000 00000 00000 00011111111 Q ss_pred eccceeccccc-------cCCeEEEEEecCCCCcceeEEEEecCceEEEEeccccccceeeccceeeEEEeccCceEEEe Q lcl|NC_019510. 257 AQTFAKLPQNA-------PDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVGLNNGTMPWSLIRAADGQFDFV 329 (799) Q Consensus 257 v~~~~~l~~~~-------~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~~~t~p~~lv~~~~~t~~~~ 329 (799) + ..++++..+ ++|....|.+. .-|.+.. .|-+.++-+...=.++++....++.....-|.+. T Consensus 50 ~-~~a~v~G~~RG~~~~~~~g~lY~V~G~-----~LY~v~~-----~iGsiag~grVsMa~n~~~~av~~~g~~~~Y~yd 118 (472) T protein:vir:10 50 A-KRSDVNGVSRGVEYNMAQNAVYRVCGG-----KLYKGES-----EVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYD 118 (472) T ss_pred e-eeccCCccccceEEEeeCCeEEEEecc-----eEeeeec-----ceecccCcccEEEecCCcEEEEEECCceeEEEee Confidence 1 011222222 12222233221 1122111 1333333332111223333222221111112221 Q ss_pred ecccCccccCccccccCccccCCCeeEEEEEcceEEEecCC--eEEEEccCCccccccccccCCCCCccEEEEEcCCcce Q lcl|NC_019510. 330 ANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLSGE--NIILSRTAKYFNMYPASVAVLSDDDPIDVAVSHNRVS 407 (799) Q Consensus 330 ~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~~~--~v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~~~~ 407 (799) ...-......++... |-.-.+....|+|+..|++|.-+. .++.|-..|-+.. ++.-.++.+.++++ T Consensus 119 ~~v~t~~~~~~d~~~--p~~dlg~~~dv~f~dGyfV~~~~Gt~~~~is~l~d~~~~----------~~y~~fa~AE~~pD 186 (472) T protein:vir:10 119 GTVKTVSNWPTDSGF--TQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHP----------DRYSAQYRAESQPD 186 (472) T ss_pred ccchhhhcccccccc--ccccccceeeeeeecceEEEeccCcceEEEeccCCcccc----------ccccccccccCCCC Confidence 100000000011111 111234567899999999998743 3445655552211 22223455777888 Q ss_pred eeeeeeecCCcEEEEecCcE--EEEeCCccccccceEEEEEE----eecccCCCCcEEeCCeEEEEecCCCceeEEEEEe Q lcl|NC_019510. 408 ILKYAVPFSEELLLWADEAQ--FVLNASGVLSAKSVELNLTT----EFDVNDGARPYGIGRGVYFASPRATFTSINRYYA 481 (799) Q Consensus 408 ~i~~~v~~~~~L~l~t~~~e--~~i~~~~~lTP~~~~~~~~s----~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~ 481 (799) .|.-++...+.|++|.+..- |..+|+. +|+-+-..+++ ..+|++.=.=..+|++++|+.........+.. T Consensus 187 ~Ivgi~~~~~~i~lfG~~TiEvw~ntG~a--~~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~-- 262 (472) T protein:vir:10 187 GIIGIGTWRDFIVCFGSSTIEYFSLTGAT--TAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISNPATGAPSVYI-- 262 (472) T ss_pred ceEEEEeeccEEEEEeccceEEEEecCCC--CcccCceeecccceeeecccCcchhhecCceEEEEecCCccccEEEE-- Confidence 88888999999999977665 7777753 22222222222 24677665558999999999987432223332 Q ss_pred eccccCceehhhH-HHHHHHhcCCCc------EEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEeeEe-eecC- Q lcl|NC_019510. 482 VQDVSAVKNAEDM-TMHVPSYIPNGV------FSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQSWSH-WDFG- 552 (799) Q Consensus 482 ~~~~~d~~~~~dl-s~~~~h~~~g~~------~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~-w~~~- 552 (799) -++|+++=+ |.-++..|+.-. ..+.+++.+.+.+.....-+ .-+||.- -.. -||. |-.. T Consensus 263 ----~~g~~~~rIST~aIE~~i~~y~~~e~~dA~~~t~~~~GH~fy~LtfP~-~Tw~yD~-~t~------~Wherw~~~~ 330 (472) T protein:vir:10 263 ----IGSGQVSPIASASIEKILRSYTADELADGVMESLRFDAHELLIIHLPR-HVLVYDA-SSS------ANGPQWCVLK 330 (472) T ss_pred ----ccCceeEEecCHHHHHHHHhcCCccccceeEEEEEeCCeEEEEEEcCC-ceeEeec-ccc------cCceeeeeec Confidence 246666666 333444444321 12344445555444444333 2333321 111 3443 2221 Q ss_pred -----CCeEEEEEEEe----------CCEEEEEEEe---CCCEEEEEEEEEeeccCCCccccceeeceeEEEEEcccccc Q lcl|NC_019510. 553 -----DNVTVLAANSI----------GSHMHVILQN---GYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFN 614 (799) Q Consensus 553 -----g~~~~~~~~~~----------~d~l~~~v~r---~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~ 614 (799) ......|+++- +..||.+--. ..+-.++++... ..+. .+..|+ +|......+ T Consensus 331 ~g~~~~~~Ra~~~~~~~g~~~vGD~~ng~ly~l~~~~~td~G~~i~~~~~~-p~~~--~d~~Rv-~d~~ve~~~------ 400 (472) T protein:vir:10 331 TGLYDDVYRAIDFIYEGNQITCGDKLESVTGKLQFDISSQYGLQQEHLLFT-PLFK--ADNARC-FDLEVESST------ 400 (472) T ss_pred CCCccCceEEEEEEEeCCeEEEEEcCCCeEEEEcccCcCcCCCcceEEEec-ccee--CCCCeE-EEEEEEeec------ Confidence 12223333222 2334443322 111122222111 1110 011121 122211110 Q ss_pred ccccccccccccccCccccc-cceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEc Q lcl|NC_019510. 615 NDRYETTVDLNAVFGGMRWQ-VGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKK 693 (799) Q Consensus 615 ~~~~~~~~~~~~~~~g~~~~-~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~ 693 (799) |..+. .-..+....||......... ..+.+ -.|..++ T Consensus 401 ---------------G~~~~adp~~~~~~sDg~~~g~~~~~-----------~~~~~-------g~~~~R~--------- 438 (472) T protein:vir:10 401 ---------------GVAQYADRLFLSATTDGINYGREQMI-----------EQNEP-------FVYDKRV--------- 438 (472) T ss_pred ---------------CCCcccCceEEEeccCCcccchhhhh-----------hhccC-------cccccce--------- Confidence 11000 00111112222111000000 00000 1122221 Q ss_pred CCCccceeeeecceEEEEEEEEEeeccceEEEEecCCCcccceeccccccCcccccccccc Q lcl|NC_019510. 694 QDESGGFSTEDVGRLQHRRAWLNYEQSGAFYVDVTNLGRSYRYTMSGKPLGDTTLGQANLE 754 (799) Q Consensus 694 ~~g~~~~~~~~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 754 (799) +.+|+-- ..+--+|++.+...-+. .. .|- .+.+. T Consensus 439 ---------------~~~RlG~-~r~~vgf~~r~~~~~~v---~l----~ga----~~~~e 472 (472) T protein:vir:10 439 ---------------LWKRVGR-IRKNVGFKLRVITKSPV---TL----SGA----QIRIE 472 (472) T ss_pred ---------------eeeeeee-ccccceEEEEEEecccc---ce----eee----eEEeC Confidence 1221110 01111233332221110 00 000 00000 No 35 >protein:vir:177 Length: 472 # NCBI annotation: DNA stabilization protein # Family: family:all:1540 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112082;genbank:gi:13559872;genbank:GeneID:920982 Probab=78.96 E-value=0.11 Score=25.86 Aligned_cols=426 Identities=11% Similarity=0.045 Sum_probs=140.8 Q ss_pred Cccccceeccc--ccccccccchhhhhhccccccccccceEEEECCcEEEEEecCCCceeEEEEEecc-CcceeeEEeee Q lcl|NC_019510. 180 GTGTTPPIEEQ--VAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAPDGDSIRGLQTKDGY-ADQLISPVTHY 256 (799) Q Consensus 180 gt~~~~~~~~~--~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~~~~~~~~~~~~~g~-~~~~~~~~~~~ 256 (799) -.-.-+.+... ....+++.+..... .+++. ..++. +.-.+....+. T Consensus 1 m~~~~~Pl~~G~~~~~~~~d~~~~~pV----------------------N~~a~---------~~~~~~s~~~l~~tPGl 49 (472) T protein:vir:17 1 MPIQQLPLMKGVGKDFRNADYIDYLPV----------------------NMLAT---------PKEILNSSGYLRSFPGI 49 (472) T ss_pred CCeeeeeeccCceeeccccchhheeee----------------------eeeee---------ccCCCcccceeecCCCc Confidence 00000000000 00000111100000 00000 00000 00011111111 Q ss_pred eccceeccccc-------cCCeEEEEEecCCCCcceeEEEEecCceEEEEeccccccceeeccceeeEEEeccCceEEEe Q lcl|NC_019510. 257 AQTFAKLPQNA-------PDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVGLNNGTMPWSLIRAADGQFDFV 329 (799) Q Consensus 257 v~~~~~l~~~~-------~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~~~t~p~~lv~~~~~t~~~~ 329 (799) + ..++++..+ ++|....|.+. .-|.+.. .|-+.++-+...=.++++....++.....-|.+. T Consensus 50 ~-~~a~v~G~~RG~~~~~~~g~lY~V~G~-----~LY~v~~-----~iGsiag~grVsMa~n~~~~av~~~g~~~~Y~y~ 118 (472) T protein:vir:17 50 A-KRSDVNGVSRGVEYNMAQNAVYRVCGG-----KLYKGES-----EVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYD 118 (472) T ss_pred e-eeccCCccccceEEEeeCCeEEEEecc-----eEeeeec-----ceecccCcccEEEecCCcEEEEEECCceeEEEee Confidence 1 011222222 12222233221 1222111 2333333332222233333322222111112221 Q ss_pred ecccCccccCccccccCccccCCCeeEEEEEcceEEEecCCe--EEEEccCCccccccccccCCCCCccEEEEEcCCcce Q lcl|NC_019510. 330 ANSWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLSGEN--IILSRTAKYFNMYPASVAVLSDDDPIDVAVSHNRVS 407 (799) Q Consensus 330 ~~~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~~~~--v~~Sr~gd~~nF~~~t~~~~~DdD~i~~~~~~~~~~ 407 (799) ...-......++... |-.-.+....|+|+..|++|.-+.+ ++.|-..|-+.. ++.-.++.+.++++ T Consensus 119 ~~v~t~~~~~~d~~~--~~~dlg~~~dv~f~dGyfV~~~~Gt~~~~is~l~d~~~~----------~~y~~fa~AE~~pD 186 (472) T protein:vir:17 119 GTVKTVSNWPTDSGF--TQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHP----------DRYSAQYRAESQPD 186 (472) T ss_pred ccchhhhcccccccc--ccccccceeeeeeecceEEEeccCcceEEEeccCCcccc----------ccccccccccCCCC Confidence 100000000011111 1112345678999999999987433 445655552211 22223455777888 Q ss_pred eeeeeeecCCcEEEEecCcE--EEEeCCcccc--c--cceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEe Q lcl|NC_019510. 408 ILKYAVPFSEELLLWADEAQ--FVLNASGVLS--A--KSVELNLTTEFDVNDGARPYGIGRGVYFASPRATFTSINRYYA 481 (799) Q Consensus 408 ~i~~~v~~~~~L~l~t~~~e--~~i~~~~~lT--P--~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~ 481 (799) .|.-++...+.|++|.+..- |..+|+...+ | .+.....+ .+|++.=.=..+|++++|+.........+.. T Consensus 187 ~Ivgi~~~~~~i~lfG~~TiEvw~ntG~a~~~~fpy~r~~g~~iq--~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~-- 262 (472) T protein:vir:17 187 GIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAALYVAQPSLMVQ--KGIAGTYCKTPFADSYAFISNPATGAPSVYI-- 262 (472) T ss_pred ceEEEEeeccEEEEEeccceEEEEeeCCCCCCcCceeecCcceee--ecccCcchhhecCceEEEEecCCccccEEEE-- Confidence 88888999999999977665 7777764322 1 11112233 4677665558999999999987432223332 Q ss_pred eccccCceehhhH-HHHHHHhcCCCc------EEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEeeEe-eecC- Q lcl|NC_019510. 482 VQDVSAVKNAEDM-TMHVPSYIPNGV------FSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQSWSH-WDFG- 552 (799) Q Consensus 482 ~~~~~d~~~~~dl-s~~~~h~~~g~~------~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~-w~~~- 552 (799) -++|+++=+ |.-++..|+.-. ..+.+++.+.+.+.....-+ .-+||.- -.. -||. |-.. T Consensus 263 ----~~g~~~~rIST~aIE~~i~~y~~~e~~dA~~~t~~~~GH~fy~LtfP~-~Tw~yD~-~t~------~Wherw~~~~ 330 (472) T protein:vir:17 263 ----IGSGQVSPISSASIEKILRSYTADELADGVMESLRFDAHELLIIHLPR-HVLVYDA-SSS------ANGPQWCVLK 330 (472) T ss_pred ----ccCceeEEecCHHHHHHHHhcCCccccceeEEEEEeCCeEEEEEEcCC-ceeEeec-ccc------cCceeeeeec Confidence 246666666 333444444321 12344445555444444333 2333321 111 3443 2221 Q ss_pred -----CCeEEEEEEE----------eCCEEEEEEEe---CCCEEEEEEEEEeeccCCCccccceeeceeEEEEEcccccc Q lcl|NC_019510. 553 -----DNVTVLAANS----------IGSHMHVILQN---GYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFN 614 (799) Q Consensus 553 -----g~~~~~~~~~----------~~d~l~~~v~r---~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~ 614 (799) ......|+++ .+..||.+--. ..+-.++++... ..+.. ...|++ |.+.... -|.. T Consensus 331 ~g~~~~~~Ra~~~~~~~g~~~vGD~~ng~ly~ld~~~~td~g~pi~~~~~~-p~~~~--~~~RV~-d~el~~~--tG~~- 403 (472) T protein:vir:17 331 TGLYDDVYRAIDFIYEGNQITCGDKLESVTGKLQFDISSQYDKQQEHLLFT-PLFKA--DNARVF-DLEVESS--TGVA- 403 (472) T ss_pred CCCccCceEEEEEEEeCCeEEEEEcCCCeEEEEcccCcCCCCceeEEEEec-ceeeC--CCceEE-EEEEeee--CCcc- Confidence 1222233322 22344443322 111122222221 11111 111221 2211111 0000 Q ss_pred ccccccccccccccCccccccceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcC Q lcl|NC_019510. 615 NDRYETTVDLNAVFGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQ 694 (799) Q Consensus 615 ~~~~~~~~~~~~~~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~ 694 (799) .+..-.......||......... ..+.+ -.|..++ T Consensus 404 -----------------~~adp~~l~~~sDg~~~g~~~~~-----------~~~~~-------g~~~~R~---------- 438 (472) T protein:vir:17 404 -----------------QYADRLFLSATTDGINYGREQMI-----------EQNEP-------FVYDKRV---------- 438 (472) T ss_pred -----------------cCCCceEEEcccCCcccchhhhh-----------hhccC-------cccccce---------- Confidence 00000111112222111000000 00000 1122221 Q ss_pred CCccceeeeecceEEEEEEEEEeeccceEEEEecCCCcccceecccc-ccCc Q lcl|NC_019510. 695 DESGGFSTEDVGRLQHRRAWLNYEQSGAFYVDVTNLGRSYRYTMSGK-PLGD 745 (799) Q Consensus 695 ~g~~~~~~~~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~~~~~~~~~-~~~~ 745 (799) +.+|+--. .+--+|++.+....+- ...+- .-.+ T Consensus 439 --------------~~~RlG~~-r~~v~f~~~~~~~~~~---~l~~a~~~~e 472 (472) T protein:vir:17 439 --------------LWKRVGRI-RKNVGFKLRVITKSPV---TLSGCQIRIE 472 (472) T ss_pred --------------eeeeeeec-cccceEEEEEeecccc---eeeeeEEEeC Confidence 22221100 0101223322221110 00000 0000 No 36 >protein:vir:7329 Length: 825 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848220;genbank:gi:30387391;genbank:GeneID:2641863 Probab=61.47 E-value=0.35 Score=23.06 Aligned_cols=610 Identities=11% Similarity=0.039 Sum_probs=166.2 Q ss_pred cceEEEEEEcCCceEEEEEEeCCeEEEEeCCCeEEEEEecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeecc---- Q lcl|NC_019510. 62 AAPLVHLINRDEYEQYYVVFTGGDIKVFDLNGQEYAVRGDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGN---- 137 (799) Q Consensus 62 ~~~~l~~~~~~~~~~y~l~~~~g~irv~~~~G~~~~v~~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~---- 137 (799) .+..++.-+|++.|=--+..+-..+.-| .+.++. .-+.+-+.|....=|.-.+ T Consensus 1 m~~~~~q~sF~~GElsP~l~gR~Dl~~y------------------~~g~~~-----~~N~~~~p~Gg~~rRpGt~fva~ 57 (825) T protein:vir:73 1 MAFSWIQPSFAGGEIGPSLYGRIDMSKY------------------QVALRK-----CDNFIVRQYGGVENRPGTRFVGP 57 (825) T ss_pred CccceeccccccceechhhcccchHHHH------------------HHHHHH-----hcCcEEEecCCceecCchHHhHh Confidence 0111111122222111111111110000 000000 0011111111100000000 Q ss_pred ccCCC--------CcCCCcceEEEEecCCCCceee----EEeccceEEEEEecCCccc--cceecccccc---cccccch Q lcl|NC_019510. 138 LTNGG--------TFNDQKDALINVRGGQYGRTLN----VIFNEATRATIKLPSGTGT--TPPIEEQVAA---VDAQHIA 200 (799) Q Consensus 138 ~~~~~--------~~~~~~~~~~~v~~~~y~~ty~----v~~~~~~~~~~~tp~gt~~--~~~~~~~~a~---~~~~~i~ 200 (799) ..+.+ .++.. ..++-+-+..|=+-|. +..++...-...||-.... .+..++.... ....+.. T Consensus 58 ~~~~~~~~rLipF~fs~~-q~y~Lefg~~~lrv~~~gg~v~~~~~~~~e~~TPy~~~~l~~l~~~QsaD~~~i~h~~~pp 136 (825) T protein:vir:73 58 AKYPDRKCRLIPFQFSTV-QTYALEFGHNYMRVIKDGAYVLTTSNVIYELAMPYADTDLFRIKFTQSADVLTLVHPAYPP 136 (825) T ss_pred hcCCCCCEEEEEEEeCCC-cEEEEEEeCCeEEEEeCCceEeccCCceEEEecccchhhhhhheeeeecCEEEEEcCCCce Confidence 00000 00000 1111111111100000 0001111111112110000 0000000000 0011111 Q ss_pred hhhhhccccccccccceEEEECCcEEEEEecCCCceeEEEEEeccCcceeeEEeeeeccceeccccccCCeEEEEEecCC Q lcl|NC_019510. 201 EELAKQIRESLAGNPGWTINVGTGFVNIIAPDGDSIRGLQTKDGYADQLISPVTHYAQTFAKLPQNAPDGYTVKIVGDTS 280 (799) Q Consensus 201 ~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~~~~G~~v~i~~~~~ 280 (799) .+|. ..++.+|.........-...+-+.........++..+. ..++.....+ .+ .-.|..+.+..... T Consensus 137 ~~L~------r~~~~~W~l~~~~f~~gp~~~in~~~sv~v~asg~tg~--~TiTaS~a~~---~~-~~vG~~i~~~~~~v 204 (825) T protein:vir:73 137 KELR------RYAHDNWQIVDVTTKNGPFEDINVDETVKVYASASTGT--ITLTASSAIF---GA-EQVGKLFYLEQPAV 204 (825) T ss_pred eEEE------EecCCCcEEEEEeccCCccccccccccceeeecccCce--eEEEeecccc---Cc-hhcCeEEEEecccc Confidence 1111 12233444321100000000000000000000000000 0000000000 00 00122222211110 Q ss_pred CCcc------------------eeEE-EEecCc----------eEEEEeccccccceeeccceeeEEEeccCceEEEeec Q lcl|NC_019510. 281 RSAD------------------KYYV-RYNLTR----------KVWEETVGWNIQVGLNNGTMPWSLIRAADGQFDFVAN 331 (799) Q Consensus 281 ~~~~------------------~~y~-~~~~~~----------~~w~e~~~~~~~~~~~~~t~p~~lv~~~~~t~~~~~~ 331 (799) .... .+|. ....+. ..|....+.. ..........++...+.++++.. T Consensus 205 ~si~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~t~~~~a~~g~~~~~~~g~~----~~~~~~~~~~~~~~~g~~~it~~ 280 (825) T protein:vir:73 205 DSVPVWETSKTTAINDVRRADSNYYRANTSGKTGTLRPSHTEGMSWDGWGGTG----SDDTGIQWEYLHSGFGIAKITAV 280 (825) T ss_pred cccceeeeeeEEEeeeEEECCCceeeeecccccceeeccccCCceeEeeeeec----ccCCceEEEEEecCCceEEEeec Confidence 0000 0000 000000 0000000000 00000000111111111111110 Q ss_pred ccCccccCccccccCccccCCCeeEEEEEcceEEEecCCeEEEEccCCccccccccccCCCC-CccEEEE---EcCCcce Q lcl|NC_019510. 332 SWVGRTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLSGENIILSRTAKYFNMYPASVAVLSD-DDPIDVA---VSHNRVS 407 (799) Q Consensus 332 ~w~~~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~~~~v~~Sr~gd~~nF~~~t~~~~~D-dD~i~~~---~~~~~~~ 407 (799) .. +. ...+++=.+|.++......+ +...... .....+. T Consensus 281 ~~------------------------------------~~--~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~gyPs 322 (825) T protein:vir:73 281 AG------------------------------------DG--LTATADVVSFIPSQVVGSANASYKWAKYAWNSVNGYPS 322 (825) T ss_pred cc------------------------------------cc--eeeccccceecccccccCCCCCcccccCCcccCCCCcc Confidence 00 00 11222223332221111000 0001110 0111121 Q ss_pred eeeeeeecCCcEEEEec--CcEEEE-e--CC-ccccccc-------eEEEEEEeecccCCCCcEEeCCeEEEEecCCCce Q lcl|NC_019510. 408 ILKYAVPFSEELLLWAD--EAQFVL-N--AS-GVLSAKS-------VELNLTTEFDVNDGARPYGIGRGVYFASPRATFT 474 (799) Q Consensus 408 ~i~~~v~~~~~L~l~t~--~~e~~i-~--~~-~~lTP~~-------~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~ 474 (799) . +.-+...|+++.+ ..|+++ + |+ .-+.|++ +.+...+. -.+.++=++-.+.+++...++.+. T Consensus 323 ~---v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~~~~DdD~I~~~~s~~--~~~~i~~~~~~~~L~~~t~~~e~~ 397 (825) T protein:vir:73 323 T---VVYYQQRLYFAASTAYPQTIWASRTGDYKDFGKNNPIQDDDRIIYTYAGR--QVNEIRHLIDVGNLVALTSGGEYT 397 (825) T ss_pred E---EEEEcceEEEeecCCCCCEEEEEccCCccccccCCCCCCCccEEEEEcCC--cceeEEEEeecCcEEEEecCceEE Confidence 1 2233344555433 234443 2 21 0122222 22322222 112233333344566656666541 Q ss_pred eEEEEEeeccccCceehhhHHHHHHHhcCCCcEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCC---CceeeEeeEeeec Q lcl|NC_019510. 475 SINRYYAVQDVSAVKNAEDMTMHVPSYIPNGVFSISGSSTENFATVLTSGAKGKVFIYKFLYID---EQIQQQSWSHWDF 551 (799) Q Consensus 475 ~~~r~~~~~~~~d~~~~~dls~~~~h~~~g~~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~---~e~~v~aW~~w~~ 551 (799) + .+...+......+++-....+ |- ...-.. .-...++++.+.-.++.-|.|-+.. .-+++..=+.|-+ T Consensus 398 --l----~~~~~~~lTP~~~~~~~~s~~-g~-~~~~Pv-~vg~~~~Fv~~~g~~vre~~~~~~~d~~~~~dlt~~a~hl~ 468 (825) T protein:vir:73 398 --I----SGDQNKVLTPSAFSFSSQGNN-GS-SNVPPI-AVANIALFIQEKGSVVRDLAYSFDVDGYQGTDLTILANHLF 468 (825) T ss_pred --E----ecCCCcccceeeEEEEeeeee-cc-ccccce-EeCCeEEEEeCCCCeEEEEEEeeecCceeccchhhhhHhhc Confidence 1 122122222222211111111 10 000000 0113345555444455555552221 1122222334444 Q ss_pred CCC-eEEEEEEEeC-CEEEEEE----------EeCCC----------EEEEEE-----------------------EEEe Q lcl|NC_019510. 552 GDN-VTVLAANSIG-SHMHVIL----------QNGYD----------IFMGSI-----------------------SFTK 586 (799) Q Consensus 552 ~g~-~~~~~~~~~~-d~l~~~v----------~r~~~----------~~~~~~-----------------------~~~~ 586 (799) .|. +...+..... ..+|++. .|..+ +..+.+ ..++ T Consensus 469 ~~~~~~~~a~~~~p~~~~~~v~~dg~l~~~ty~~~q~v~aW~~~~~~g~v~~~~~i~~~~~D~l~~iV~R~~~g~~~~yi 548 (825) T protein:vir:73 469 QKHSIVDWSFCIVPYSSAFCIRDDGKLLVLTYLRDQQVFAWAPQSSAGKYESTCSISEGSEDAVYFVVNRTINGQTVRYI 548 (825) T ss_pred cCCceEEEEEcCCCceEEEEEecCCeEEEEEEeccccceeeEEEecCCcEEEEEEecCCCccEEEEEEEEeeCCceEEEE Confidence 442 2111111111 1122221 11111 000000 0112 Q ss_pred eccCCCccccceeeceeEEEEEccccccccccccccccccccCccccccceEEEEecCCccccccccccceecCceEEEc Q lcl|NC_019510. 587 KTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTVDLNAVFGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIV 666 (799) Q Consensus 587 ~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~ 666 (799) |.++ .+.+.|....+++||+..+.+. .+.+++.||+|+++.+.+||.+ ..++.+|.++++ T Consensus 549 E~~~-----~~~~~~~~~~~~vD~g~~~~g~--------~~~~~l~~l~g~tv~~~~~g~~--~~~v~~g~itl~----- 608 (825) T protein:vir:73 549 ERLS-----SRLFTNDEDAFFVDCGLSYDGR--------NTSSRTMTISGGTGDWSYQVDY--PVTVSGGAYFVN----- 608 (825) T ss_pred EEec-----ccccCCCcceeEEEEEeeeccc--------ceeeceeeeCCceEEEEeCCeE--EEEEcCCeEEec----- Confidence 2221 1234456677888888776543 2346999999999999999987 455678877654 Q ss_pred cCCCCCeEEEeEeeeEEEEecCeeEEcCCCccceeeeecceEE-EEEEEEEeeccceEE-E--EecCCCccc---c--ee Q lcl|NC_019510. 667 GDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTEDVGRLQ-HRRAWLNYEQSGAFY-V--DVTNLGRSY---R--YT 737 (799) Q Consensus 667 ~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~~rl~-l~~~~~~~~~t~~~~-~--~v~~~~~~~---~--~~ 737 (799) .+. .++|||+|......+++.... ......+|++ -..+.++....-... . .+...+... . .. T Consensus 609 --~~~-~~~i~l~~~~~~~~~~~~~~~------~~~~~i~~~~~~~~v~v~~~~~~~a~~~~~~~t~~~~a~~~~~gL~h 679 (825) T protein:vir:73 609 --TDV-GAQIQFPYTGTDPDTNEPVAK------ELRGDIISVTSNTAVVVRFNRNVPPVLRNVATTNWQMARQTFSGLAH 679 (825) T ss_pred --ccc-eEEEEecccCcccccccceec------eeeEEEccccCceEEEEEecccccceeeeecccCCCcchheeccccc Confidence 233 578999999988877764321 1111112211 111122221111100 0 011111000 0 01 Q ss_pred cccccc---Cc-ccccccccccceEEEEEeecccceEEEE-----EECCCCcEEEEEEEEEEEEeccccCC Q lcl|NC_019510. 738 MSGKPL---GD-TTLGQANLESGQFRFPLAGNAQYNRVVL-----TSDYTTPLSIIGCGWEGNYIRRSTGI 799 (799) Q Consensus 738 ~~~~~~---~~-~~~~~~~~~tg~~~vp~~~~~~~~~v~i-----~~~~P~P~tvl~i~~eg~y~~r~rrv 799 (799) ++|+.+ .+ ...++.-+..|.+++|... ..|.| .--.|||..+.+ .|.--.|.||| T Consensus 680 LeG~~v~v~~Dg~~~~~~~V~~G~vtl~~~~----~~v~vGl~y~~~~~~l~~~~~~---~g~~~g~~~ri 743 (825) T protein:vir:73 680 LEGQTVNILSDASVEPQKTVTGGAVTLESPG----AVVHIGLPITAEFETLDINING---QETLLDKKQVI 743 (825) T ss_pred cCCceEEEEECCeeeCCeEecCcEEEecCCc----eEEEEeeCccceEEecccccCC---CccccCccEEE Confidence 223222 11 1111222344677776311 11111 111456655532 35555688888 No 37 >protein:vir:95324 Length: 823 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512269;genbank:gi:89152436;genbank:GeneID:3952993 Probab=51.83 E-value=0.57 Score=21.92 Aligned_cols=630 Identities=11% Similarity=0.063 Sum_probs=192.5 Q ss_pred EEEEeC----CCeEEE--EEecCccccccCCcceeEEEEEcCEEEEEeCCeeeeeeccccCCCCc--CCCcceEEEEecC Q lcl|NC_019510. 86 IKVFDL----NGQEYA--VRGDKSYVQTANPRNSIRCVTVADYTFVVNRERVVQADGNLTNGGTF--NDQKDALINVRGG 157 (799) Q Consensus 86 irv~~~----~G~~~~--v~~~~~y~~t~~~~~~l~~~q~aD~~~l~~~~~~p~~l~~~~~~~~~--~~~~~~~~~v~~~ 157 (799) ||+.-. +|.++. +.+-.......++++.+ -+ +++++.-..++-.-+..-... ......++..... T Consensus 1 m~i~~~q~sF~~GElsP~l~gR~Dl~ry~~q~~~~-----~N--~~~~~~GGl~rRpGt~fva~~~~~~g~~rLipf~~s 73 (823) T protein:vir:95 1 MAISWIQPSFAGGEIGPSLYGRIDMAKYQVALRKC-----DN--FIVRQYGGVENRPGTRFVGAAKYPNRKCRLIPFQFS 73 (823) T ss_pred CcceeechhccCceechheeccchHHHHHHHHhhh-----hC--cEeeecCCceecCchhhhhhhcCCCCCeeEEEEEeC Confidence 887541 333332 11111111111221110 01 111111111111101100000 0111123322221 Q ss_pred CCCceeeEEeccceEEEEEecCCccccceecccccccccccchhhhhhccccccccccceEEEECCcEEEEEecCCCcee Q lcl|NC_019510. 158 QYGRTLNVIFNEATRATIKLPSGTGTTPPIEEQVAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAPDGDSIR 237 (799) Q Consensus 158 ~y~~ty~v~~~~~~~~~~~tp~gt~~~~~~~~~~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~~~~~~~ 237 (799) -+..|-+.+.+...--+. ++... .......-.+..-|.+. ......+...+.+++|.+++-... T Consensus 74 -~~q~y~Lefg~~~irV~~--~~g~v-v~~~~~~~ev~tPy~~~-----------~l~~Lr~~qsaD~~fivh~~~~p~- 137 (823) T protein:vir:95 74 -TVQTYALEFGHQYMRVIK--DGALV-LNSSNVIYEIATPYTEA-----------DLFRIKFTQSADVLTLVHPAYPPK- 137 (823) T ss_pred -CCcEEEEEEcCCeEEEEe--CCcEE-EecCCceeEEecccccc-----------cccceeEEEeccEEEEEcCCccce- Confidence 233444443322211110 00000 00000000000001111 111233344455555554432221 Q ss_pred EEEEEeccCcceeeEEeeeeccce--------eccccccCCeEEEEEecCCCCcc----eeEEEEe--cCceEEEEec-- Q lcl|NC_019510. 238 GLQTKDGYADQLISPVTHYAQTFA--------KLPQNAPDGYTVKIVGDTSRSAD----KYYVRYN--LTRKVWEETV-- 301 (799) Q Consensus 238 ~~~~~~g~~~~~~~~~~~~v~~~~--------~l~~~~~~G~~v~i~~~~~~~~~----~~y~~~~--~~~~~w~e~~-- 301 (799) .+. ..+..++.+..+......+. ...+....+.+........-..+ ..|++-. .....|.... T Consensus 138 ~L~-r~~~~~w~l~~~~~~~gp~~~~~~~~t~~v~~~~~~~~~t~ta~~~~~~~d~vg~~~~l~~~~~~~~~~~~~~~~~ 216 (823) T protein:vir:95 138 ELR-RYAHDNWQLVDVVTKNGPFEDINIDESLTVYASASTGTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKST 216 (823) T ss_pred EEE-ecCCCCceEEEEEEeccccccccccceeEEeccccCceeEEeecccccchhhccceEEEeccccceeeecceeeee Confidence 111 11111111111111000000 00011111111000000000000 0011000 0001111000 Q ss_pred cccccceeeccceeeEEEeccCceEEEeec---ccCc-cccCccccccCccccCCCeeEEEEEcceEEEecCC----eEE Q lcl|NC_019510. 302 GWNIQVGLNNGTMPWSLIRAADGQFDFVAN---SWVG-RTAGDDDTNPHPSFVGQAITDVFFYRNRLGMLSGE----NII 373 (799) Q Consensus 302 ~~~~~~~~~~~t~p~~lv~~~~~t~~~~~~---~w~~-~~~gd~~~np~psf~~~~p~~v~f~q~RL~f~~~~----~v~ 373 (799) ..+..... ....+........++...... .|.. ..+++++..-.+ |.+..+.. +.. T Consensus 217 ~~~~~~~~-~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~g~~~~t~v 280 (823) T protein:vir:95 217 SIGDIRRA-DSNYYRAVTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIEW---------------EYLHSGFGIARITAV 280 (823) T ss_pred cccceEEe-cccceeeeeccccceeecccCCcceEEeceecccccceeEE---------------EEEeCCcceEEEEee Confidence 00000000 000000000000011000000 0000 000011111001 11111100 000 Q ss_pred EEccCC--ccccccccccCCCCCccEEE-----EEcCCcceeeeeeeecCCcEEEEecC--cEEEEeC---C-------c Q lcl|NC_019510. 374 LSRTAK--YFNMYPASVAVLSDDDPIDV-----AVSHNRVSILKYAVPFSEELLLWADE--AQFVLNA---S-------G 434 (799) Q Consensus 374 ~Sr~gd--~~nF~~~t~~~~~DdD~i~~-----~~~~~~~~~i~~~v~~~~~L~l~t~~--~e~~i~~---~-------~ 434 (799) -+.+++ .--|.++.... ..+..... ......+ ..+.-+...|+++... -|+++-+ + . T Consensus 281 ~~~~~~~~~~~~~~~~~~~-~~~~t~~~~~~~~~~~~g~P---s~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~ 356 (823) T protein:vir:95 281 NGTTATAEVISYIPSQVVG-EDNASYKWAKYAWNSVNGYP---GTVVYYQQRLYFAASTAFPQTIWASRTGDYKDFGKSN 356 (823) T ss_pred cceeeeceEeeeecccccc-CCcCCccccccccCcCCCCc---cEEEEEeceEEEEEcCCCCcEEEEeccCCcccccccc Confidence 111111 11121121111 11111111 1111112 1223344456655442 2444422 1 1 Q ss_pred cccccc-eEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccCceehh--hHHHHHHHhcCCCcEEEEE Q lcl|NC_019510. 435 VLSAKS-VELNLTTEFDVNDGARPYGIGRGVYFASPRATFTSINRYYAVQDVSAVKNAE--DMTMHVPSYIPNGVFSISG 511 (799) Q Consensus 435 ~lTP~~-~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~~~~~--dls~~~~h~~~g~~~~~~a 511 (799) +++... +.+...+. -.+.++=++-.+.++....++.+. ..+...+..... .+..+..+ | ....-. T Consensus 357 ~~~DdD~I~~~~s~~--~~~~i~~~v~~~~Lli~t~~~e~~------l~~~~~~~lTP~~~~~~~~s~~---g-~~~~~P 424 (823) T protein:vir:95 357 PTQDDDRIIYTYAGR--QVNEIRHLIDVGSLVALTSGGEYV------ITGDQNKVLTPSSFAFSSQGSN---G-SSNVPP 424 (823) T ss_pred CCCCCCcEEEEEcCC--cceEEEEEeecCcEEEEecCcEEE------EEcCCCcccceeeEEEEEeecc---c-cccccc Confidence 222211 22222222 122343344445666667666531 112222222222 22222111 1 000000 Q ss_pred cCCCCcEEEEEEcCCCeEEEEEeeeCCC---ceeeEeeEeeecCCC-eEEEEEEEeC-CEEEEEE----------EeCCC Q lcl|NC_019510. 512 SSTENFATVLTSGAKGKVFIYKFLYIDE---QIQQQSWSHWDFGDN-VTVLAANSIG-SHMHVIL----------QNGYD 576 (799) Q Consensus 512 ~~~~~~~~v~~~~~dg~l~~~tyl~~~~---e~~v~aW~~w~~~g~-~~~~~~~~~~-d~l~~~v----------~r~~~ 576 (799) . .-...++++.+.-..+.-|.|-+... -.++.-=+.|-+.|. +..+|..... ..+|++. .|..+ T Consensus 425 v-~vg~~~~Fv~~~g~~vre~~~~~~~d~~~~~dlT~~a~hl~~~~~i~~~a~~~~p~~~~~~v~~dG~l~~~ty~~~q~ 503 (823) T protein:vir:95 425 I-AVANIALFVQEKGSVVRDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDGKLLVMTYLRDQQ 503 (823) T ss_pred e-EeCCeEEEEecCCCEEEEEEEeeecCceecchhhhhhhhhcCCCceEEEEEecCCCeEEEEEecCCcEEEEEEecccc Confidence 0 11234455555444555555522211 112222224444432 1112211111 1122111 11110 Q ss_pred ----------EEEEEE-----------------------EEEeeccCCCccccceeeceeEEEEEccccccccccccc-- Q lcl|NC_019510. 577 ----------IFMGSI-----------------------SFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETT-- 621 (799) Q Consensus 577 ----------~~~~~~-----------------------~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~-- 621 (799) +..+.+ ..++|.++.. .+.+....+++||+..+.+.+... T Consensus 504 v~aW~~~~~~g~~~~~~~i~~~~~d~l~~~v~R~i~g~~~~yiE~~~~~-----~~~~~~~~~~lD~~~s~~g~~~~~~~ 578 (823) T protein:vir:95 504 VFAWAPQSSTGKYESTCSISEGNEDAVYFVVNRTVNGQTVRYIERLSSR-----LFTSDEDAFFVDSGLSYDGRNTSDRT 578 (823) T ss_pred eeeeEEEecCCcEEEEEEecCCCCCEEEEEEEeccCCeEEEEEEeeccc-----cCCCccceeEEEEEEEeecCccccee Confidence 000000 0112222111 112234578899988877654432 Q ss_pred cccccccCccccccceEEEEecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCcccee Q lcl|NC_019510. 622 VDLNAVFGGMRWQVGKILVSDEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFS 701 (799) Q Consensus 622 ~~~~~~~~g~~~~~g~~v~~~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~ 701 (799) ..+....+.+.|++|+++.+ +||.+.+..++ +|.+++ +++++.+++|++|++.++++++++.++ ....++ T Consensus 579 ~~l~~g~~~l~~l~g~~v~~-adg~~~~~~~v-~g~i~l-------~~~~~~~~vGl~~~~~i~~~~~~v~~~-~a~~~~ 648 (823) T protein:vir:95 579 MTITGGSGEWDYLAEYTISV-SGGAYFTSSDV-GAQLQF-------PYTGADPDTGYEVSKELRCDIISVTSN-TAVVVR 648 (823) T ss_pred eEecCCCCcccccCceEEEe-cCcceECCccc-eeEEEe-------CcCCCccccccceEEEEEEeeceeeCC-ceEEEc Confidence 23334445688999999876 89999888877 565544 478899999999999999999888653 223332 Q ss_pred eeecceEEEEEEEEEeeccceEEEEecCCCcccceeccccccCccccc----ccccccceEEEEEeecccceEEEE--E- Q lcl|NC_019510. 702 TEDVGRLQHRRAWLNYEQSGAFYVDVTNLGRSYRYTMSGKPLGDTTLG----QANLESGQFRFPLAGNAQYNRVVL--T- 774 (799) Q Consensus 702 ~~~~~rl~l~~~~~~~~~t~~~~~~v~~~~~~~~~~~~~~~~~~~~~~----~~~~~tg~~~vp~~~~~~~~~v~i--~- 774 (799) .. ++....++...+.. ..+....-.-=-.++++.+.-...| ..-+..|.+++|... ....+-| + T Consensus 649 ~~-----r~v~a~l~~~~t~~--~~~~~~~~~gL~hleg~tv~v~~dg~~~~~~~v~~G~vtl~~~~--~~v~vGl~~~~ 719 (823) T protein:vir:95 649 AN-----RNVPPSLRNVATTN--WQMARRTFGGLSHLEGQTVNILSDANVEPQKVVSGGAVTLESPG--AVVHIGLPITA 719 (823) T ss_pred cC-----Ccccceeeeeeccc--cccccceeeeccccccceEEEEEcCeeeCCeEecCCEEEecCCC--CEEEEeeccee Confidence 11 12222222222211 1111000000001333333211111 122335777766422 2222221 1 Q ss_pred ECCCCcEEEEEEEEEEEEeccccCC Q lcl|NC_019510. 775 SDYTTPLSIIGCGWEGNYIRRSTGI 799 (799) Q Consensus 775 ~~~P~P~tvl~i~~eg~y~~r~rrv 799 (799) --.|||+.+. +.|..--|.||| T Consensus 720 ~~~~l~~~~~---~~g~~~g~~~ri 741 (823) T protein:vir:95 720 EFETLDININ---GQETLLDKKQVI 741 (823) T ss_pred eEEecchhcC---CCcccCCceeEE Confidence 1156666544 357777888888 No 38 >protein:vir:105525 Length: 472 # NCBI annotation: phage DNA stabilization protein # Family: family:all:1540 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516194;genbank:gi:89885997;genbank:GeneID:3964385 Probab=43.97 E-value=0.82 Score=21.04 Aligned_cols=426 Identities=11% Similarity=0.070 Sum_probs=156.4 Q ss_pred ccccccccchhhhhhccccccccccceEEEECCcEEEEEecCCCceeEEEEEeccCcceeeEEeeeeccceecccc---- Q lcl|NC_019510. 191 VAAVDAQHIAEELAKQIRESLAGNPGWTINVGTGFVNIIAPDGDSIRGLQTKDGYADQLISPVTHYAQTFAKLPQN---- 266 (799) Q Consensus 191 ~a~~~~~~i~~~l~~~~~~~~a~~~~~t~~~~g~~i~i~a~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~l~~~---- 266 (799) ..+. +++ |...+..+. ...++.-.. + +..++.-....+......+..+ . ..+.++++. T Consensus 1 m~~~---q~p--l~~g~~~~~-~~~~~~~~l-p--vN~y~~p~~~~~ss~~lr~~PG--------~-~~~~~~~g~~RG~ 62 (472) T protein:vir:10 1 MAIM---QLP--LLRGLGKAR-DDADYIDAL-P--VNMLATPKPVLNASGYLRSFPG--------I-THKAEVAGVSRGV 62 (472) T ss_pred CCce---eee--cccccccCc-cccCceeee-e--eeeeeccccccccceeecccCC--------c-eeecCCCccccee Confidence 1110 000 111000000 000000000 0 0011100000000000000000 0 011111111 Q ss_pred ---ccCCeEEEEEecCCCCcceeEEEEecCceEEEEeccccccceeeccceeeEEEeccCceEEE-----eecccCcccc Q lcl|NC_019510. 267 ---APDGYTVKIVGDTSRSADKYYVRYNLTRKVWEETVGWNIQVGLNNGTMPWSLIRAADGQFDF-----VANSWVGRTA 338 (799) Q Consensus 267 ---~~~G~~v~i~~~~~~~~~~~y~~~~~~~~~w~e~~~~~~~~~~~~~t~p~~lv~~~~~t~~~-----~~~~w~~~~~ 338 (799) ..+|..-.|.+.+ -|.+ ...|-+.++-+...=.++...+..++.-...-|.. +..+|....- T Consensus 63 ~~~~~~~~lY~V~G~~-----Ly~v-----~~~vG~iagsg~VsMa~~~~~q~v~v~g~~~~y~y~g~~~t~~~~~~~~~ 132 (472) T protein:vir:10 63 QYNTHEKTVYRGLGNQ-----LYKG-----HKPIADLAGKGRISMAFSRNSQAVVAAGKMTLYRYDGTVKTLENWPKEKK 132 (472) T ss_pred EeeeeCCeEEEEecce-----EEEE-----EeeeeeecccccEEEEecCCceEEEEecceeEEEeccchhhhhhcccccc Confidence 1122222222211 1211 11233333322221112223333333332211222 1122222110 Q ss_pred CccccccCccccCCCeeEEEEEcceEEEecC--CeEEEEccCCccccccccccCCCCCccEE-EEEcCCcceeeeeeeec Q lcl|NC_019510. 339 GDDDTNPHPSFVGQAITDVFFYRNRLGMLSG--ENIILSRTAKYFNMYPASVAVLSDDDPID-VAVSHNRVSILKYAVPF 415 (799) Q Consensus 339 gd~~~np~psf~~~~p~~v~f~q~RL~f~~~--~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~-~~~~~~~~~~i~~~v~~ 415 (799) .+.+.| ..+..|+|..+|++|..+ +.++.|...|-.- .|+++ ++.+.++++.|.-++.. T Consensus 133 -----it~~dl--~~~~~v~~~dGyfV~~~~gt~~~~iS~L~d~s~-----------~~~~~~FatAE~~pD~Ivgi~~~ 194 (472) T protein:vir:10 133 -----YTQYDI--GNVRDMCHLRGRYVWCKDGSDIFGVTDLEDESH-----------PDRYRALYRAESQPDGIIGIDSW 194 (472) T ss_pred -----CCcccc--CCceeEEEeCceEEEeecCCceEEEeecCCccc-----------CCcccceeeecCCCCceEEEEee Confidence 011111 247789999999988864 3344676554221 24555 56677788888889999 Q ss_pred CCcEEEEecCcE--EEEeCCcccc--c--cceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccCce Q lcl|NC_019510. 416 SEELLLWADEAQ--FVLNASGVLS--A--KSVELNLTTEFDVNDGARPYGIGRGVYFASPRATFTSINRYYAVQDVSAVK 489 (799) Q Consensus 416 ~~~L~l~t~~~e--~~i~~~~~lT--P--~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~~ 489 (799) .+.|++|.+..- |..+|+..++ . .....-.+ .+|++.-.=..+|++++|+.........+..+ ++| T Consensus 195 ~~~i~lfG~~TiEvw~ntG~a~fpf~r~~~~pg~~iq--~Gcaa~~sv~~~~~s~~~l~~d~~g~~~V~~~------~g~ 266 (472) T protein:vir:10 195 RDFIVCFGASTIEYFSLTGAADGQSAIYAAQPALMVE--KGIAGTHCKTRLGDAHVIISHQATGAPSVFLI------NQA 266 (472) T ss_pred ccEEEEEeccceEEEEecCCCCcceeeeccCccceee--ecccCchhhhhhCceEEEEecCCCcceEEEEc------cCc Confidence 999999977665 7777764222 1 12233344 36776655579999999999885433334332 456 Q ss_pred ehhhH-HHHHHHhcCCC----c--EEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEeeEeeecC-CCeEEEEEE Q lcl|NC_019510. 490 NAEDM-TMHVPSYIPNG----V--FSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQSWSHWDFG-DNVTVLAAN 561 (799) Q Consensus 490 ~~~dl-s~~~~h~~~g~----~--~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~w~~~-g~~~~~~~~ 561 (799) +++=+ |.-++..|+.- + ..+.+++.+.+.+.....-+. -+||. +-..+++.++.|.+-.+. ......|++ T Consensus 267 q~~rIST~aIE~~i~~y~~~e~~dA~~~s~~~eGH~fy~LtfP~~-Tw~yD-~at~~~~~~w~~~~~g~~~~~~Ra~~~~ 344 (472) T protein:vir:10 267 QATSIATATIEKILRSYTHDELASAVMETVRFDSHELVLIHLSRQ-VLCYD-AAANQNGLQWSLLKTGFYHAPYRGIDFM 344 (472) T ss_pred eEEEecCHHHHHHHHhCCcccccceeEEEEEeCCeEEEEEEcCCe-eEEEe-ccCCccceeeeeeecCCccCceEEEEEE Confidence 66666 33444455433 1 224566666766655555554 33333 112233444333322221 233333333 Q ss_pred Ee----------CCEEEEEEEe---CCCEEEEEEEEEeeccCCCccccceeeceeEEEEEcccccccccccccccccccc Q lcl|NC_019510. 562 SI----------GSHMHVILQN---GYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTVDLNAVF 628 (799) Q Consensus 562 ~~----------~d~l~~~v~r---~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~ 628 (799) +- +..||.+--. ..+-.++++.... .+.. +..|++ |.. ..+.- T Consensus 345 ~~~g~~~vGD~~ng~l~~ld~~~~td~g~pi~~~~~tp-~~~~--~n~Rvf-d~e--l~~~t------------------ 400 (472) T protein:vir:10 345 FADHHLTCGDKNDSLLGQLDFASSAQYEKPQEHVLYTP-LFKA--DNARVF-DFE--LEAST------------------ 400 (472) T ss_pred EeCCeEEEEEcCCCeEEEEcCcCcCCCCceeEEEeecc-ceec--CCCeEE-EEE--EEeeC------------------ Confidence 32 2345553211 1222233322211 1110 111211 211 11111 Q ss_pred CccccccceE-EEEecCCcccc-cc-ccccceecCceEEEccCCCC-CeEEEeEeeeEEEEecCeeEEcCCCccceeeee Q lcl|NC_019510. 629 GGMRWQVGKI-LVSDEVGEVRQ-YE-PPAGGWASDPTLRIVGDMAG-KRVFIGFAYEFRYEFSKFLIKKQDESGGFSTED 704 (799) Q Consensus 629 ~g~~~~~g~~-v~~~adG~~~~-~~-~v~~g~~~~~~~~i~~~~~~-~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~ 704 (799) |.....++. +....||...- .. +-..|.+.. ..++-..+-+ ..=-|| |..+|...-|.... T Consensus 401 -Gvg~~~~~v~L~wSddg~~~~~~~~~~~~g~~~~-~~r~~w~RlG~ar~~vg--f~~rv~~s~pv~~~----------- 465 (472) T protein:vir:10 401 -GVAHIADRLFLSATADGLHFGREQMINQNAPFAY-DRRILWRRMGRVRKNLG--FKVRVITSSPVTLS----------- 465 (472) T ss_pred -CcCccCceEEEEEeccccccchhHHHhhcCccch-hheeeeheeeccccccc--eEEEEEEecccccc----------- Confidence 111222221 11123332221 11 111121110 0010000000 000123 44556655553321 Q ss_pred cceEEEEEEEEE Q lcl|NC_019510. 705 VGRLQHRRAWLN 716 (799) Q Consensus 705 ~~rl~l~~~~~~ 716 (799) +..+++. T Consensus 466 -----~~~a~~e 472 (472) T protein:vir:10 466 -----GCQIRME 472 (472) T ss_pred -----cceeeeC Confidence 1112222 No 39 >protein:vir:2109 Length: 472 # NCBI annotation: head completion protein # Family: family:all:1540 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059633;genbank:gi:9635541;genbank:GeneID:1262840 Probab=41.24 E-value=0.94 Score=20.74 Aligned_cols=422 Identities=10% Similarity=0.042 Sum_probs=149.0 Q ss_pred ccceEEEECCcEEEEEecCCCceeEEEEEeccCcceeeEEeeee-ccceeccccccCCeEEEE------EecCCCC-cce Q lcl|NC_019510. 214 NPGWTINVGTGFVNIIAPDGDSIRGLQTKDGYADQLISPVTHYA-QTFAKLPQNAPDGYTVKI------VGDTSRS-ADK 285 (799) Q Consensus 214 ~~~~t~~~~g~~i~i~a~~~~~~~~~~~~~g~~~~~~~~~~~~v-~~~~~l~~~~~~G~~v~i------~~~~~~~-~~~ 285 (799) .+---+-...+- .++....+..+... .-+.+....+ .+...| .+..|.+.+. ++.-..+ .+. T Consensus 1 m~~~q~Pl~~g~-------~~~~~~~d~~~~~p-VN~~a~~~~~~~s~~~l--r~tPG~~~~~~~~g~~RG~~~~t~~~~ 70 (472) T protein:vir:21 1 MPIQQLPMMKGM-------GKDFKNADYIDYLP-VNMLATPKEILNSSGYL--RSFPGITKRYDMNGVSRGVEYNTAQNA 70 (472) T ss_pred CceEEeeccccc-------cccccccceeeeee-eeeeeeccCCcccceee--eecCCcceeccCCCceeeeeecccCCe Confidence 111111111110 01111111111000 0001111000 000111 1223322221 1111111 111 Q ss_pred eEEEEecCceEEEEec------cccccceeeccceeeEEEeccCceEEEeecccCccccCccccccCc-cccC---CCee Q lcl|NC_019510. 286 YYVRYNLTRKVWEETV------GWNIQVGLNNGTMPWSLIRAADGQFDFVANSWVGRTAGDDDTNPHP-SFVG---QAIT 355 (799) Q Consensus 286 ~y~~~~~~~~~w~e~~------~~~~~~~~~~~t~p~~lv~~~~~t~~~~~~~w~~~~~gd~~~np~p-sf~~---~~p~ 355 (799) .|.. .++.-|+-.+ +-+...=.++++...........-|++..-.+... ++|.. .|.+ .-+. T Consensus 71 ly~V--~G~~LY~v~~~~G~i~gsgrVsMa~n~~~~~v~~~~~~~~Y~~~~~~~t~~------~~~~d~~f~~~dl~~~~ 142 (472) T protein:vir:21 71 VYRV--CGGKLYKGESEVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVS------NWPADSGFTQYELGSVR 142 (472) T ss_pred EEEE--eCCceEEEeeeeeeecccccEEEeeCCeEEEEEECCceeEEEEecchhhhh------cccCcccccccccccee Confidence 1221 1222232111 11110001122221111111111122222111111 11211 1221 2345 Q ss_pred EEEEEcceEEEecCC--eEEEEccCCccccccccccCCCCCccE-EEEEcCCcceeeeeeeecCCcEEEEecCcE--EEE Q lcl|NC_019510. 356 DVFFYRNRLGMLSGE--NIILSRTAKYFNMYPASVAVLSDDDPI-DVAVSHNRVSILKYAVPFSEELLLWADEAQ--FVL 430 (799) Q Consensus 356 ~v~f~q~RL~f~~~~--~v~~Sr~gd~~nF~~~t~~~~~DdD~i-~~~~~~~~~~~i~~~v~~~~~L~l~t~~~e--~~i 430 (799) .|+|+..|++|..+. ..+-|-.-|-+.. |.. .++-+.++++.|.-++...+.|++|.+..- |.. T Consensus 143 dv~f~dGyfV~~~~gt~~f~is~l~d~~~~-----------~~y~~FatAE~~pD~Iv~i~~~~~~l~lfG~~TiEvw~n 211 (472) T protein:vir:21 143 DITRLRGRYAWSKDGTDSWFITDLEDESHP-----------DRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSL 211 (472) T ss_pred EEEEecceEEEccCCcceeEEecCCCCccc-----------cCCccceeeccCCCceEEEEeeccEEEEEeccceEEEEe Confidence 799999999988643 3344544442221 111 145566778888888999999999977665 777 Q ss_pred eCCccccccceEEEEEE----eecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccCceehhhHH-HHHHHhcCCC Q lcl|NC_019510. 431 NASGVLSAKSVELNLTT----EFDVNDGARPYGIGRGVYFASPRATFTSINRYYAVQDVSAVKNAEDMT-MHVPSYIPNG 505 (799) Q Consensus 431 ~~~~~lTP~~~~~~~~s----~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~~~~~dls-~~~~h~~~g~ 505 (799) +|+. ++..+-+.+++ ..+|++.-.=..+|++++|+...+..-..+.. .++|+++=+| .-++..|+.. T Consensus 212 tG~a--d~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~s~~~l~~~~~g~~~V~~------~~g~qa~rIST~aIE~~i~~y 283 (472) T protein:vir:21 212 TGAT--TAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGAPSVYI------IGSGQASPIATASIEKIIRSY 283 (472) T ss_pred cCCC--CcCcCceEEcCcceeeecccCcchhhecCceEEEEecCCCcccEEEE------ccCceeEEecCHHHHHHHHhc Confidence 7752 22233344432 34777665568999999999998753323332 2456666663 3344444432 Q ss_pred c------EEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEeeEeeecCC---CeEEEEEEEeCCEEEEEEEeCCC Q lcl|NC_019510. 506 V------FSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQSWSHWDFGD---NVTVLAANSIGSHMHVILQNGYD 576 (799) Q Consensus 506 ~------~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~w~~~g---~~~~~~~~~~~d~l~~~v~r~~~ 576 (799) . -.+.+++.+.+.+.....-+. -+||. +-..+..+ -||.-+..+ ....+++++-+.+ +++=.. .+ T Consensus 284 ~~~e~~~A~~~t~~~eGH~fy~LtfP~~-Tw~yD-~at~~~~e--~W~~~~sg~~~~~~R~~~~~~~~g~-~ivGD~-~n 357 (472) T protein:vir:21 284 TAEEMATGVMETLRFDSHELLIIHLPRH-VLVYD-ASSSQNGP--QWCVLKTGLYDDVYRGVDFMYEGNQ-ITCGDK-SE 357 (472) T ss_pred CCccccceEEEEEEeCCeEEEEEEcCCe-eEEEE-cccCccCc--eeeeeccCCCcCceeEEEEEeeCCe-EEEEEc-CC Confidence 1 224456666665555555543 33333 11122222 277776642 2344444432211 111111 11 Q ss_pred EEEEEEEEEeeccCCCcccc-----ceeeceeEEEEEccccccccccccccccccccCccccccceEEEE--ecCCcccc Q lcl|NC_019510. 577 IFMGSISFTKKTLDFGNEPY-----RLYMDAKTRYDIPANAFNNDRYETTVDLNAVFGGMRWQVGKILVS--DEVGEVRQ 649 (799) Q Consensus 577 ~~~~~~~~~~~~~~~~~~~~-----~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v~~--~adG~~~~ 649 (799) +.+..+.+......-...+. ....|.+.-+...-+. ..|....+- .+.+ ..||.... T Consensus 358 G~ly~L~fd~~~~~d~~~~~~r~~p~~~~dn~R~fd~eve~---------------~~Gv~q~~d-~v~L~wSddG~~~~ 421 (472) T protein:vir:21 358 AVVGQLQFDISSQYDKQQEHLLFTPLFKADNARCFDLEVES---------------STGVAQYAD-RLFLSATTDGINYG 421 (472) T ss_pred CeEEEEEecccccCCCcCcEEEEccceeCCCCEEEEEeeec---------------cCCCCCcCc-EEEEEeeccccccc Confidence 22222221111100000000 0011111111100000 001111110 1221 23333221 Q ss_pred cc-ccccceecCceEEEccCCCCCeEEEe-----EeeeEEEEecCeeEEcCCCccceeeeecceEEEEEEEEEee Q lcl|NC_019510. 650 YE-PPAGGWASDPTLRIVGDMAGKRVFIG-----FAYEFRYEFSKFLIKKQDESGGFSTEDVGRLQHRRAWLNYE 718 (799) Q Consensus 650 ~~-~v~~g~~~~~~~~i~~~~~~~~v~vG-----l~y~~~~~~~~~~~~~~~g~~~~~~~~~~rl~l~~~~~~~~ 718 (799) .. .+.-|.+....-++.. =.+| +.|+.++.. ..++.|+.+++++. T Consensus 422 ~~~~~~~g~~g~~~tr~~~------~RlG~~r~~v~f~~r~~~------------------~~~~~l~g~~~~~E 472 (472) T protein:vir:21 422 REQMIEQNEPFVYDKRVLW------KRVGRIRRLIGFKLRVIT------------------KSPVTLSGCQIRLE 472 (472) T ss_pred cceeeccCCccchhcceee------eeeeecccceeEEEEEEe------------------cCcceeeeeEEeeC Confidence 11 1111111110000000 0012 112222221 23455666666666 No 40 >protein:vir:3529 Length: 477 # NCBI annotation: P28 # Family: family:all:1540 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050989;genbank:gi:9633575;genbank:GeneID:1262322 Probab=32.00 E-value=1.5 Score=19.69 Aligned_cols=429 Identities=13% Similarity=0.100 Sum_probs=143.9 Q ss_pred ccccccccccceEEEECCcEEEEEecCCCceeEEEEEeccCcceeeEE-eeeeccceeccccccCCeEEEEEecCCCCcc Q lcl|NC_019510. 206 QIRESLAGNPGWTINVGTGFVNIIAPDGDSIRGLQTKDGYADQLISPV-THYAQTFAKLPQNAPDGYTVKIVGDTSRSAD 284 (799) Q Consensus 206 ~~~~~~a~~~~~t~~~~g~~i~i~a~~~~~~~~~~~~~g~~~~~~~~~-~~~v~~~~~l~~~~~~G~~v~i~~~~~~~~~ 284 (799) -+.+.. .+-..+-...+-.. ...++.....+-.. +.+. +..-.+...|+. ..|...+..+ ++-..- T Consensus 1 ~~~~~~--m~~~~ipl~~g~~~-~~~~~d~~~~~PVN-------~~a~p~~~~~s~~~L~~--~pG~~~~~~~-~G~~RG 67 (477) T protein:vir:35 1 MLSEVF--MPKIQIPLAKGLVK-DIKTADYIDALPVN-------MLATPKEVLNASGYLRS--FPGIEKKQDA-KGVSRG 67 (477) T ss_pred Ccccce--eeeecccccccccc-ccccccceeeeeec-------cceeecccccccccccc--CCcceeeccC-Cccccc Confidence 111100 00000000000000 00000000011000 0000 000011111111 1122111110 000000 Q ss_pred eeEEEE------ecCceEEE---E---eccccccceeeccceeeEEEec-cCceEEEeecccCccccCccccccCccccC Q lcl|NC_019510. 285 KYYVRY------NLTRKVWE---E---TVGWNIQVGLNNGTMPWSLIRA-ADGQFDFVANSWVGRTAGDDDTNPHPSFVG 351 (799) Q Consensus 285 ~~y~~~------~~~~~~w~---e---~~~~~~~~~~~~~t~p~~lv~~-~~~t~~~~~~~w~~~~~gd~~~np~psf~~ 351 (799) -+|... ..++..|+ | .++-+...=.++++.- .++.. ...-|..+.-.++.+. ...+..|+|.. T Consensus 68 ~~~~~~~g~lY~V~G~~LY~v~~~vG~I~gsg~VsMa~n~~~~-aIv~~g~~~gy~y~~t~~~~~~---~~~~~~p~~~l 143 (477) T protein:vir:35 68 VHFNTKNNALYRVCGNTLYRNDKEVADIAGMSRVSMSHSSHSQ-AICFEGKVKLYRYDGTEKALSN---WPKDKYPQYDL 143 (477) T ss_pred eeEeecCCeEEEEecCeeEeeeeeeeeecccccEEEeeCCcEE-EEEECCcceeEEEecccceeee---cCccccCCccc Confidence 001000 11222222 1 1111111001111111 11111 1111222221222111 11123566666 Q ss_pred CCeeEEEEEcceEEEecC--CeEEEEccCCccccccccccCCCCCccEE-EEEcCCcceeeeeeeecCCcEEEEecCcE- Q lcl|NC_019510. 352 QAITDVFFYRNRLGMLSG--ENIILSRTAKYFNMYPASVAVLSDDDPID-VAVSHNRVSILKYAVPFSEELLLWADEAQ- 427 (799) Q Consensus 352 ~~p~~v~f~q~RL~f~~~--~~v~~Sr~gd~~nF~~~t~~~~~DdD~i~-~~~~~~~~~~i~~~v~~~~~L~l~t~~~e- 427 (799) +-+..|+|..+|+++.-+ +.++.|-.-|-.- -|+++ +..+.++++.|.-++...+.|++|.+..- T Consensus 144 ~~~~~v~f~dGyfV~~~~gt~~~~iS~L~d~s~-----------~d~~~~FasAE~~pD~Ivgi~~~~~~i~lfG~~TiE 212 (477) T protein:vir:35 144 GEVIDVCRNRGRYIWLQKGGERFGVTDLEDESK-----------PDRYQPFYRAESQPDGIVSVDAWRDLIVCFGSSSIE 212 (477) T ss_pred cceeEEEeeCceEEEeecCCCeEEEeecCCccc-----------cccccccccccCCCCceEEEEeeccEEEEEeccceE Confidence 677899999999988764 3455574443222 36666 66677788888889999999999977665 Q ss_pred -EEEeCCcccc-c---cceEEEEEEeecccCCCCcEEeCCeEEEEecCCCceeEEEEEeeccccCceehhhH-HHHHHHh Q lcl|NC_019510. 428 -FVLNASGVLS-A---KSVELNLTTEFDVNDGARPYGIGRGVYFASPRATFTSINRYYAVQDVSAVKNAEDM-TMHVPSY 501 (799) Q Consensus 428 -~~i~~~~~lT-P---~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~~~~~dl-s~~~~h~ 501 (799) |..+|+..++ | .+.....+ .+|++.=.=..+|++++|+.........+..+ ++|+++=+ |.-++.. T Consensus 213 vw~ntG~a~f~~p~~r~~~~~mIq--~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~~------~g~q~~rIST~aIE~~ 284 (477) T protein:vir:35 213 YFTLTGSADTSQPLYIHQAAYMIQ--AGIAGRDCKCRYQDKYAILSHQSTGQPAVYLI------GAGEKNKISTATIDKI 284 (477) T ss_pred EEEecCCCCCCcceeecCCceeee--ecccCchhhhhhCceEEEEecCCCcccEEEEc------cCceeEEecCHHHHHH Confidence 7778865554 2 11222233 47776655579999999999875432333322 45666666 3334444 Q ss_pred cCCC------cEEEEEcCCCCcEEEEEEcCCCeEEEEEeeeCCCceeeEeeEeeecC---CCeEEEEEEEeC-------- Q lcl|NC_019510. 502 IPNG------VFSISGSSTENFATVLTSGAKGKVFIYKFLYIDEQIQQQSWSHWDFG---DNVTVLAANSIG-------- 564 (799) Q Consensus 502 ~~g~------~~~~~a~~~~~~~~v~~~~~dg~l~~~tyl~~~~e~~v~aW~~w~~~---g~~~~~~~~~~~-------- 564 (799) |+.- ...+++++.+.+.+.....-+ .-+||. ..-++..-.|+--... ......|+++-+ T Consensus 285 i~ay~~~e~a~af~~t~~~eGH~fy~LtfP~-~Tw~yD---~at~~w~e~W~~~~~g~~~~~~Ra~~~~~~~g~~~vGD~ 360 (477) T protein:vir:35 285 IRYYSADELAASFMESIRFDNHELLLLHLPK-HTLCFD---GSASHQYSQWSLLKSGFYDEPYRAIDFMFFDNQITVGDK 360 (477) T ss_pred HHhcCCcchhceeEEEEEeCCeeEEEEEcCC-ceEEEe---cccccccceeeeeccCCccCceEEEEEEEeCCeEEEEEc Confidence 4321 122445556665554444444 222322 1112111124332221 233333333322 Q ss_pred --CEEEEEE---EeCCCEEEEEEEEEeeccCCCccccceeeceeEEEEEccccccccccccccccccccCccccccceEE Q lcl|NC_019510. 565 --SHMHVIL---QNGYDIFMGSISFTKKTLDFGNEPYRLYMDAKTRYDIPANAFNNDRYETTVDLNAVFGGMRWQVGKIL 639 (799) Q Consensus 565 --d~l~~~v---~r~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~g~~v 639 (799) ..||.+- .+..+-.++++... ..+. .+..|+ +|...... .|....+.. + T Consensus 361 ~ng~l~~ld~~~~~d~g~~i~~~~~~-p~~~--~d~~Rv-~~~el~~~---------------------tGvgq~~d~-v 414 (477) T protein:vir:35 361 KEGVLGHLIFNASNQYEQQTEHLLYT-PMIK--ADNARL-FDFELEAS---------------------TGVAQIADK-L 414 (477) T ss_pred CCCeEEEECCCCcccCCCccceEEec-ceee--CCCCeE-EEEEEEEe---------------------cCcCccCce-E Confidence 2344332 12222122222111 0000 011121 12111111 011111111 1 Q ss_pred EE--ecCCccccccccccceecCceEEEccCCCCCeEEEeEeeeEEEEecCeeEEcCCCccceeeeecceEEEEEEEEEe Q lcl|NC_019510. 640 VS--DEVGEVRQYEPPAGGWASDPTLRIVGDMAGKRVFIGFAYEFRYEFSKFLIKKQDESGGFSTEDVGRLQHRRAWLNY 717 (799) Q Consensus 640 ~~--~adG~~~~~~~v~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~g~~~~~~~~~~rl~l~~~~~~~ 717 (799) .+ ..|| ..|-.+... .++ ..++.. -|++.+|+-- . T Consensus 415 ~L~~sddG--------------------------------~~~~~~~~~------~~g--~~g~~~--~r~~~~RlG~-~ 451 (477) T protein:vir:35 415 FLSVTTDG--------------------------------INYSREQLI------EQN--SPFQYD--KRILWRRIGR-V 451 (477) T ss_pred EEEEeccc--------------------------------cccccceee------cCC--Cccccc--cceeeeeeee-c Confidence 11 1122 111111110 000 011111 1122222110 1 Q ss_pred eccceEEEEecCCCcccceeccccccC Q lcl|NC_019510. 718 EQSGAFYVDVTNLGRSYRYTMSGKPLG 744 (799) Q Consensus 718 ~~t~~~~~~v~~~~~~~~~~~~~~~~~ 744 (799) .+-.+|++++....+- ...-.+.++- T Consensus 452 r~~vgf~~r~~~~~pv-~l~~~~~~~e 477 (477) T protein:vir:35 452 RKNIGFKIRIITKSPV-TLSDLSIRME 477 (477) T ss_pred eeccceEEEEEecCCc-eeccceeEeC Confidence 1111233332211100 0000000000 Done!