Query lcl|NC_012662.1_cdsid_YP_002875655.1 [gene=VPP93_gp31] [protein=putative tail tubular protein B] [protein_id=YP_002875655.1] [location=24741..27083] Match_columns 780 No_of_seqs 153 out of 218 Neff 8.2 Searched_HMMs 1612 Date Thu Nov 7 12:42:34 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_31 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_31_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:10452 Length: 794 100.0 2E-226 1E-229 1258.4 85.3 765 1-780 1-794 (794) 2 protein:vir:94713 Length: 785 100.0 3E-222 2E-225 1234.7 85.1 758 1-780 1-785 (785) 3 protein:vir:2203 Length: 794 # 100.0 7E-222 4E-225 1233.1 86.4 765 1-780 1-794 (794) 4 protein:vir:6326 Length: 826 # 100.0 3E-222 2E-225 1235.1 83.7 769 1-780 1-826 (826) 5 protein:vir:94583 Length: 792 100.0 3E-220 2E-223 1224.4 84.1 764 1-780 1-792 (792) 6 protein:vir:1543 Length: 801 # 100.0 6E-220 4E-223 1222.6 85.0 765 1-780 1-801 (801) 7 protein:vir:99677 Length: 794 100.0 7E-220 4E-223 1222.1 85.2 763 1-780 1-794 (794) 8 protein:vir:80253 Length: 777 100.0 8E-220 5E-223 1221.7 83.3 761 1-780 1-777 (777) 9 protein:vir:97014 Length: 800 100.0 1E-219 6E-223 1221.1 83.5 754 1-780 1-800 (800) 10 protein:vir:3366 Length: 801 # 100.0 3E-219 2E-222 1218.4 82.8 764 1-780 1-801 (801) 11 protein:vir:7021 Length: 803 # 100.0 3E-218 2E-221 1213.2 82.1 761 1-780 1-803 (803) 12 protein:vir:78957 Length: 826 100.0 2E-217 1E-220 1208.2 82.3 767 1-780 1-826 (826) 13 protein:vir:105647 Length: 800 100.0 7E-217 4E-220 1205.7 83.9 762 1-780 1-800 (800) 14 protein:vir:103341 Length: 806 100.0 1E-216 8E-220 1204.2 84.4 765 1-780 1-806 (806) 15 protein:vir:8887 Length: 808 # 100.0 3E-215 2E-218 1196.9 84.1 764 1-780 1-808 (808) 16 protein:vir:78703 Length: 905 100.0 1E-208 9E-212 1160.0 81.2 760 1-780 1-905 (905) 17 protein:vir:100022 Length: 976 100.0 1E-205 6E-209 1144.4 83.4 765 1-780 1-976 (976) 18 protein:vir:103790 Length: 768 100.0 5E-173 3E-176 965.4 74.7 718 1-777 1-768 (768) 19 protein:vir:95324 Length: 823 100.0 1E-162 6E-166 908.8 70.7 704 1-776 1-823 (823) 20 protein:vir:7329 Length: 825 # 100.0 3E-160 2E-163 894.9 66.2 706 1-776 1-825 (825) 21 protein:vir:107802 Length: 681 100.0 7E-158 4E-161 882.1 71.3 659 1-775 1-681 (681) 22 protein:vir:98487 Length: 681 100.0 7E-158 4E-161 882.1 71.3 659 1-775 1-681 (681) 23 protein:vir:107423 Length: 681 100.0 7E-158 4E-161 882.1 71.3 659 1-775 1-681 (681) 24 protein:vir:1778 Length: 680 # 100.0 2E-157 1E-160 879.3 56.3 535 1-545 1-680 (680) 25 protein:vir:102644 Length: 594 100.0 1E-136 8E-140 765.6 57.7 557 1-776 1-594 (594) 26 protein:vir:94602 Length: 1012 99.5 1.8E-12 1.1E-15 85.0 34.4 722 1-780 1-1010(1012) 27 protein:vir:2625 Length: 715 # 99.5 7.1E-12 4.4E-15 81.7 42.9 630 1-777 1-715 (715) 28 protein:vir:8837 Length: 513 # 98.8 3.1E-08 1.9E-11 61.8 40.0 483 1-779 1-513 (513) 29 protein:vir:80177 Length: 1027 98.7 1.1E-07 6.9E-11 58.7 33.8 727 1-780 1-981 (1027) 30 protein:vir:95475 Length: 771 97.1 0.00017 1E-07 41.3 39.3 667 1-777 1-771 (771) 31 protein:vir:105563 Length: 396 97.1 0.00018 1.1E-07 41.1 19.8 362 1-526 1-396 (396) 32 protein:vir:3133 Length: 911 # 96.4 0.00064 3.9E-07 38.1 37.0 684 1-780 1-839 (911) 33 protein:vir:93631 Length: 580 66.6 0.27 0.00016 23.7 27.1 495 86-686 1-580 (580) No 1 >protein:vir:10452 Length: 794 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848299;genbank:gi:30387490;genbank:GeneID:1733952 Probab=100.00 E-value=1.7e-226 Score=1258.37 Aligned_cols=765 Identities=19% Similarity=0.301 Sum_probs=682.7 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCCCccEE Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHL 80 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~~~~~y 80 (780) || +|+|+||||++|||||||++||++||++|+||+|+|+|||+||||++||+++++... ....+++|+++|++.|++| T Consensus 1 M~-~i~~s~~n~~~GvSqq~D~~Ry~~q~~~~~N~~~~~~gG~~rRpGt~fv~~l~~~~~-~~~~~~~~~~~rd~~e~~~ 78 (794) T protein:vir:10 1 MA-LISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLKTLGDNGA-LGQAPYIHLINRDENEQYY 78 (794) T ss_pred Cc-ceeeecchhhcccccCCchHHhhhhHhhhhcceeeeccCcccCcchhhheeccCCCc-cccceeeeEEecCCCceEE Confidence 99 599999999999999999999999999999999999999999999999999987644 3556788999999999998 Q ss_pred EEEEcCcEEEEEeCCCcE--EEecCCCCccccC-CcceEEEEEeCCEEEEecCCEeeeEeeccccc--CCCCcceEEEec Q lcl|NC_012662. 81 VINTNTGGWWLLDREAKN--IVSEGNLSYLLAA-DRRSIQTTSMGGVTYILNTEKRPSATTDNSDK--KDPKTTGFYFVK 155 (780) Q Consensus 81 ~l~~~~g~~~v~d~~~~~--~~~~~~~~y~~~~-~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~--~~~~~~g~v~v~ 155 (780) ++++++ +++||+.+|.. ++.+++++|+.++ ++++|+|+|+||+|||+|++++|++.++.... ..+..+++++++ T Consensus 79 v~~~~~-~irv~~~~G~~~~v~~~~~~~Y~~aa~~~~~l~~~q~aD~~fivn~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 157 (794) T protein:vir:10 79 AVFTGT-GIRVFDLAGNEKQVRYPNGSNYIKTANPRSDLRMVTVADYTFIVNRNVVVQKDPNSVNLANYNPKQDGLINIR 157 (794) T ss_pred EEEeCC-eEEEEEcCCcEEEEEcCCCCcceecCCCcceEEEEEEcCEEEEEcCCeeeeeeccccccCCCCCCccEEEEec Confidence 887766 59999987644 5567888887554 66789999999999999999999988776543 446678999999 Q ss_pred CccccceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhhheecccceEE-EcCeEEEEEcCCCcee-EEE Q lcl|NC_012662. 156 SGAFSKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVGVDFAV-RVGPYIYFELITGTDL-KIT 233 (780) Q Consensus 156 ~g~y~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g~~~~~-~~g~~i~~~~~s~~~~-~vt 233 (780) +|+|+++|++++ ++..++++++|+++.+++..++++++|+.+|.+++++...+|+. +.|+++++.+.++... .++ T Consensus 158 ~g~y~r~y~i~i---~~~~~at~~tpdgt~~~~~~~~s~~~ia~~L~~~l~a~~~g~t~~~~g~~i~i~a~s~~~~~t~s 234 (794) T protein:vir:10 158 GGQYGRELIVHI---NGKDVATYKIPDGSKPEHVNNTDAQWLAERLAKQMRINLSGWTVNVGQGFIHVTAPSGQQIDSFT 234 (794) T ss_pred ccccceEEEecc---CCcceeEEEecCCCCcccceecchhhhhhhhhhhhhcccCCceEEeCCeEEEEEeccCceecccc Confidence 999999999999 44557889999999999999999999999999999888777775 6788999998877653 456 Q ss_pred eecCCcceeEE-EEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEecccceeEEc-ccceeEEee Q lcl|NC_012662. 234 STSGSPYIGYS-NQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYNSVTAIS-VDVPYKIVD 311 (780) Q Consensus 234 ~~~g~~~~~~~-~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~~~~~~~~~~-~~~p~~l~~ 311 (780) ..++....... ..+.++++++||+.+++|+.++|...+++..+.||++|+..++.|+||++++...+++ ++|||.|++ T Consensus 235 ~~~~~~~~~~~~v~~~~~~~~~lp~~~~~G~~v~i~~~~~~~~~~yyv~~~~~~~~w~E~~~~g~~~~~~~~tmP~~l~r 314 (794) T protein:vir:10 235 TKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWTETLGWNTENQVLLETMPHALVR 314 (794) T ss_pred ccCCcCcceeEEEEeccCcceecccCCCCCcEEEEEeCCCCCcceeEEEEEcCCcEEEEecccceeEEEecccceeEEEE Confidence 66665555544 3567899999999999999999999888989999999999999999999999999987 689999986 Q ss_pred cc-----ccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEE Q lcl|NC_012662. 312 DN-----VEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLSGAYVCMSATGEPDRFFRSTVSSLDPTDRIDI 386 (780) Q Consensus 312 ~~-----~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~t~~~~~ddD~i~~ 386 (780) .+ ++..+|++|.+||+++||+|+|++ ++|++|+||||||+|++|++|||||+||||||+++|+++++|||||++ T Consensus 315 ~~~~t~~~~~~~w~~r~~Gd~~tnp~psf~g-~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~ 393 (794) T protein:vir:10 315 AADGNFDFKWLEWSPKSCGDVDTNPWPSFVG-SSINDVFFFRNRLGFLSGENIILSRTAKYFNFYPASIANLSNDDPIDV 393 (794) T ss_pred eccceEEeeecccccccccccccCccCcccC-CCccEEEEEcceEEEeeCCeEEEEecCCcccccccccccCCCCccEEE Confidence 43 567789999999999999999997 789999999999999999999999999999999999999999999999 Q ss_pred EEcCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEecCCceEEEE Q lcl|NC_012662. 387 ASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFSAVL 466 (780) Q Consensus 387 ~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~ 466 (780) +++++++|+|+|+++++++|+|||+++||+|+| +++|||+|++++++|+|+|++.|+|+.+|++++|++++| ++++++ T Consensus 394 ~~ss~~~~~i~~~v~~~~~L~i~T~~~q~~l~~-~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~~~ 471 (794) T protein:vir:10 394 AVSTNRIAILKYAVPFSEELLIWSDEAQFVLTA-SGTLTSRSVELNLTTQFDVQDRARPYGIGRNVYFASPRS-SYTSIH 471 (794) T ss_pred EecCCcceeeEEEeecCCcEEEEecCcEEEEeC-CCcccceeEEEEEEEeecccCCCCceEeCCeEEEEecCC-CeeEEE Confidence 999999999999999999999999999999998 569999999999999999999999999999999998877 678888 Q ss_pred EeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeeeeeEeeccCCcEEE Q lcl|NC_012662. 467 ELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAWHKWVFPYRVAS 546 (780) Q Consensus 467 e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW~~w~~~G~v~~ 546 (780) |++..++.+++|+++|||+|++|||+++++.+++++++|.+++|+++++|+|++|+|+++++||+|+|||||+|+|.|++ T Consensus 472 r~~~~~~~~d~y~a~Dlt~~~~hl~~~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~~ 551 (794) T protein:vir:10 472 RYYAVQDVSSVKNSEDITSHVPNYIPNGVFSICGSGTENFCSVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQV 551 (794) T ss_pred EEeeeccccCceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceEEEeEEEEEcCCcEEE Confidence 87655588999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEE--CCcEEEEEEEcCCCeEEEEEEeeeccCCcccccccceeeeeccceeeecCcce---------eEEeeccccCCC Q lcl|NC_012662. 547 LHFA--RDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVNDGKG---------IVPIYMRPWVSE 615 (780) Q Consensus 547 ~~~~--~d~l~~vv~R~~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~---------~~~~~~~~~~l~ 615 (780) +|+. +|+||++|+|+++++.+||.|+++.... .+.++++||||+.++....+.. ..+...|+.|+| T Consensus 552 ~~~~~~~d~l~~iv~r~~~~~~~r~~~~~~~~~~---~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~g~~~~e 628 (794) T protein:vir:10 552 LACQSISSDMYVILRNEFNTFLARISFTKNAIDL---QGEPYRAFMDMKIRYTIPSGTYNDDTFTTSIHIPTIYGANFGR 628 (794) T ss_pred EEEEecCCeEEEEEEeCCCEEEEEEEEeecCCCC---CCccceeeeecceEEEecCcccccccccceEEcccccCccccc Confidence 9865 7999999999999999999888875433 3456788999999887654432 223456899999 Q ss_pred CeEEEEEecCccccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCceee----ecceEEEEEEE Q lcl|NC_012662. 616 GKLTGSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLIS----TAPVRLLRYEL 691 (780) Q Consensus 616 g~~v~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~~~----~~r~~v~rv~v 691 (780) |+++.+.+||..........+.....++++++++++++|+|||+|+++++|+||++++++|+.++ .+|+||+|+++ T Consensus 629 g~~v~~~adg~~~~~~~~~~~~~g~~~l~i~~~~~a~~v~vGl~y~s~~~~~~~~i~~~~~~~~~~~~~~gr~~l~r~~~ 708 (794) T protein:vir:10 629 GKITVLEPDGKITVFEQPTSGWQSDPWLRLSGNLEGREVFIGFNINFVYEFSKFLIKQTTDDGSTSTEDIGRLQLRRAWV 708 (794) T ss_pred ccEEEEecCCceeeeeeeeeeeecceEEEecCCCCCceEEEeeeeeEEEEecceEEEccCCCcceeeeccccEEEEEEEE Confidence 99999999998877666666666667889999999999999999999999999999999987664 47999999999 Q ss_pred EEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEE Q lcl|NC_012662. 692 TTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQSTEMYLSTDGTQDMNILEIEYII 771 (780) Q Consensus 692 ~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlai~~eg 771 (780) +|.+||+|.++|+++.++. .+.+.+.++++.++.+|.+| ..+|++++|+.+|+++.+|+|+|++|+||+|+||+||| T Consensus 709 ~~~~tg~~~v~v~~~~~~~--~~~~~~~~~~~~~~~~g~~~-~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlsi~~eg 785 (794) T protein:vir:10 709 NYEDSGTFDIYVENQSSNW--KYTMAGARLGSNTLRAGRLN-LGTGQYRFPVVGNAKFNTVYILSDETTPLNIIGCGWEG 785 (794) T ss_pred EeeccccEEEEEcCCcccc--ceeeccceeccccccccccc-cccceEEEEecccCceEEEEEEECCCCceEEEEEEEEE Confidence 9999999999999987653 34578888999998899876 57999999999999999999999999999999999999 Q ss_pred EEecceecC Q lcl|NC_012662. 772 RYNQRRRRV 780 (780) Q Consensus 772 ~y~~r~rrv 780 (780) +||+|+||| T Consensus 786 ~y~~r~~~v 794 (794) T protein:vir:10 786 NYLRRSSGI 794 (794) T ss_pred EEeccccCC Confidence 999999999 No 2 >protein:vir:94713 Length: 785 # NCBI annotation: tail tube # Family: family:all:825 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338122;genbank:gi:77118200;genbank:GeneID:3707736 Probab=100.00 E-value=3.5e-222 Score=1234.71 Aligned_cols=758 Identities=20% Similarity=0.293 Sum_probs=672.5 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCCCccEE Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHL 80 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~~~~~y 80 (780) || +|+|+||||++|||||+|++||++||++|+||+|+|+||++||||++||++++... +...+++.+++++ ++.| T Consensus 1 M~-~~~~s~~n~~~GvSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~~v~~l~~~~---~~~~~~~~f~~~~-~~~y 75 (785) T protein:vir:94 1 MP-LITQSIKNLKGGISQQPDILRFSDQGEAQVNCWSSESDGLQKRPPTVFKRRLNIDV---GSNPKFHLINRDE-QEQY 75 (785) T ss_pred Cc-ceeeecchhhcceecCCchHHhhhHHhhhhcceeeeccCcccCChhHhhhcccCCC---CcCcEEEEEEeCC-CceE Confidence 99 59999999999999999999999999999999999999999999999999887533 3455666677754 4557 Q ss_pred EEEEcCcEEEEEeCCCcEEEecCCCCccccCCc-ceEEEEEeCCEEEEecCCEeeeEeeccccc-CCCCcceEEEecCcc Q lcl|NC_012662. 81 VINTNTGGWWLLDREAKNIVSEGNLSYLLAADR-RSIQTTSMGGVTYILNTEKRPSATTDNSDK-KDPKTTGFYFVKSGA 158 (780) Q Consensus 81 ~l~~~~g~~~v~d~~~~~~~~~~~~~y~~~~~~-~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~-~~~~~~g~v~v~~g~ 158 (780) +|++++++|+|||.+|..+.+++..+|+.+.+. ++|+|+|+||+|||+|++++|+++.+.++. +.+..++++++++++ T Consensus 76 ~l~~~~~~irv~~~~G~~~~v~~~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~~~~~~~~~~~~~~~~~~~~~~i~~g~ 155 (785) T protein:vir:94 76 YIVFNGSNIQIVDLSGNQYSVSGSVDYVKSSNPRDDIRVVTVADYTFVVNRKVVVKGGSEKSHSGYNRKARALINLRGGQ 155 (785) T ss_pred EEEEcCCeEEEEecCCcEEEEecCCCceeecCchhheeeEeeCCEEEEEcCCcceeeeeccCCcCCCCCCceEEEecccc Confidence 888899999999998888888899999876654 479999999999999999999998887654 567789999999999 Q ss_pred ccceeEEEEeeCCceEEEEEEeccCCCCcc-ccccchhhhhhhhhhhheecccceEE-EcCeEEEEEcCCCce-eEEEee Q lcl|NC_012662. 159 FSKEYDISVVWSEGSQTVTYTTPDGTTAGD-ADQSVPEAIARKLVEALIAVGVDFAV-RVGPYIYFELITGTD-LKITST 235 (780) Q Consensus 159 y~~~y~vti~~~~~~~t~t~tt~~~s~~~~-~~~~~~~~i~~~l~~~~~s~g~~~~~-~~g~~i~~~~~s~~~-~~vt~~ 235 (780) |+++|++++ ++..++++++++++.+.. .+..+.++++.++.+++++...+|+. ..++++++.+.++.. ..+++. T Consensus 156 y~~~y~i~i---~g~~~at~~t~~~s~a~~s~~~~s~~~i~~~l~~~l~a~~t~~t~~~~g~~i~i~a~s~t~~~~~s~~ 232 (785) T protein:vir:94 156 YGRTLKVGI---NGGVKVSHKLPAGNDAENDPPKVDAQAIGAALRDLLVTAYPTFTFDLGSGFLLITAPSGTDINSVETE 232 (785) T ss_pred cceeEEEee---CCcceeEEEEccCccccccccccchHHHHHHHHHHhhccccceeEEecCcEEEEEecCCccccceeee Confidence 999999999 566788899998887665 46678889999999999988888764 567888888877654 457777 Q ss_pred cCCcceeEE-EEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEecccceeEEc-ccceeEEeecc Q lcl|NC_012662. 236 SGSPYIGYS-NQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYNSVTAIS-VDVPYKIVDDN 313 (780) Q Consensus 236 ~g~~~~~~~-~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~~~~~~~~~~-~~~p~~l~~~~ 313 (780) ++..+..+. ..+.++++++||..+++|+.++|..++++..+.||++|+..+|+|+||+++|...+++ ++|||.|++.+ T Consensus 233 ~~~~~t~~~~~~~~~~~~~~Lp~~~~~G~~v~v~~~~~~~~~~y~v~~~~~~g~w~e~~~~g~~~~~~~~tmp~~l~~~~ 312 (785) T protein:vir:94 233 DGYANQLISPVLDTVQTISKLPLAAPNGYIIKIQGETNSSADEYYVMYDSNTKTWKETVEPGVVTGFDNTTMPHALVRQS 312 (785) T ss_pred cccCCeEEEEEEeeccceeccccccCCCCEEEEEccCCCCccceEEEEEcCCceEEEecccceeeeeeccccceEEEecc Confidence 777766654 4678899999999999999999999999999999999999999999999999999998 57999999754 Q ss_pred -----ccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEE Q lcl|NC_012662. 314 -----VEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLSGAYVCMSATGEPDRFFRSTVSSLDPTDRIDIAS 388 (780) Q Consensus 314 -----~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~t~~~~~ddD~i~~~~ 388 (780) ++..+|+.|.+||+++||+|||++ ++|++|+||||||+|++|++|||||+||||||+++|+++++|||||++++ T Consensus 313 ~~~~~~~~~~w~~r~~Gd~~tnp~psf~g-~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~ 391 (785) T protein:vir:94 313 DGSFEFKALDWSKRGAGNDDTNPMPSFVD-ATINDVFFYRNRLGFLSGENVIMSRSASYFAFFPKSVATLSDDDPIDVAV 391 (785) T ss_pred CCceEEeccccccccCCCcccCCcceecc-cccceEEEEeceEEEecCCeEEEEccCCcccCccccccCCCCCccEEEEe Confidence 567789999999999999999997 79999999999999999999999999999999999999999999999999 Q ss_pred cCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEecCCceEEEEEe Q lcl|NC_012662. 389 GSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFSAVLEL 468 (780) Q Consensus 389 ~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~ 468 (780) +++++|+|+|+++++++|+|||+++||+|+| +++|||+|++++++|+|+|+++|+|+.+|++++|++++| ++++++|+ T Consensus 392 ~~~~~~~i~~~v~~~~~L~l~T~~~e~~l~~-~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~v~r~ 469 (785) T protein:vir:94 392 SHPRISILKYAVPFSEQLLLWSDEVQFVMTS-SGVLTSKSIQLDVGSEFALGDNARPFAVGRSVFFSAPRG-SFTSIKRY 469 (785) T ss_pred cCCcceeeEEEeecCCcEEEEecCcEEEEcC-CCcccceeEEEEEEEeeeccCCCCceEeCCeEEEEecCC-CeeEEEee Confidence 9999999999999999999999999999987 569999999999999999999999999999999998887 67889888 Q ss_pred eeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeeeeeEeeccCC--cEEE Q lcl|NC_012662. 469 VPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAWHKWVFPY--RVAS 546 (780) Q Consensus 469 ~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW~~w~~~G--~v~~ 546 (780) ++.++++++|+++|||+|++|||+|+++.+++++++|.+++|++++||+|++|||+++++||+|+|||||+|+| .+++ T Consensus 470 ~~~~~~~d~y~~~dlt~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~g~l~~~~y~~~~~e~~v~aW~r~~~~~~~~~~~ 549 (785) T protein:vir:94 470 FAVADVSDVKDADDTTGHVLSYIPNGVFDIQGTGTENYICVNSTGAYNRIYIYKFLFKDSVQLQASWSHWEFPKDDKILA 549 (785) T ss_pred eeecccccceehhhHHHHHHHhcCCCcEEEEEecCCCcEEEEEEcCCCEEEEEEEeecCCceEEEEEEEEEeCCCeEEEE Confidence 87778899999999999999999999999999999999999999999999999999999999999999999976 6889 Q ss_pred EEEECCcEEEEEEEcCCCeEEEEEEeeeccCCcccccccceeeeeccceeeecCcceeE----------EeeccccCCCC Q lcl|NC_012662. 547 LHFARDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVNDGKGIV----------PIYMRPWVSEG 616 (780) Q Consensus 547 ~~~~~d~l~~vv~R~~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~----------~~~~~~~~l~g 616 (780) +|+++|++|++++|. +|...+++|++.... +..+..++.||||+..+....+.... ..+.++.|+|| T Consensus 550 ~~~~~d~~~~vv~r~-~g~~~~~ie~~~~~~--d~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~~g~~~leg 626 (785) T protein:vir:94 550 SASIGSTMFIVRQHQ-GGVDIEHLKFIKEAT--DFPSEPYRLHVDSKVSMVIPIGSFNADTYKTTVDIGAAYGGNAPSPG 626 (785) T ss_pred EEEeCCEEEEEEEcC-CCEEEEEEEeecccC--CCCCcceeEEeeeeeEEEecCcceeccccccccccccccccCCccCC Confidence 999999999999997 788899999875533 23455688899999988665544332 23568999999 Q ss_pred eEEEEEecCccccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCceee---ecceEEEEEEEEE Q lcl|NC_012662. 617 KLTGSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLIS---TAPVRLLRYELTT 693 (780) Q Consensus 617 ~~v~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~~~---~~r~~v~rv~v~~ 693 (780) +++.+++||.+++...+. ...+++++++++++++|+|||+|+++++|+||++++++|+++. .+|+||+|++|+| T Consensus 627 ~~v~v~adG~~~~~~~v~---~~~~tl~~~g~~~~~~v~vGl~y~~~~~~~~~~~~~~~~~~~~~~~~gr~~l~r~~~~~ 703 (785) T protein:vir:94 627 RYYLIDSQGAYLDLGELT---SISTVITLNGDWSGRTVFIGRSYLMSYKFSRFLIKIEDDSGTQSEDTGRLQLRRAWVNY 703 (785) T ss_pred eEEEEeeCCcCccCceEc---CCCcEEEecCCCCCceEEEeeeeeEEEeecceeEEecCCCcccccccccEEEEEEEEEe Confidence 999999999998765443 3346789999999999999999999999999999999986544 4899999999999 Q ss_pred eccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEE Q lcl|NC_012662. 694 RNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQSTEMYLSTDGTQDMNILEIEYIIRY 773 (780) Q Consensus 694 ~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlai~~eg~y 773 (780) .+|++|+++++++.++ +.+.+++++++. ..++.+| +.+|+++||+.+|+++.+|+|+|++|+||+|+||+|||+| T Consensus 704 ~~sg~~~v~v~~~~~~--~~~~~~~~~~g~--~~~~~~~-~~tg~~~vp~~g~~~~~~v~i~~~~P~P~tvlsi~~eg~y 778 (785) T protein:vir:94 704 RDTGALRLIVRNGERE--FVNTFNGYTLGQ--QTIGTTN-IGDGQYRFAMNGNALTTSLTLESDYPTPVSIVGCGWEASY 778 (785) T ss_pred ecccceEEEecCCCcc--ceeeecCcccCc--ccccccc-cccceEEEEeecccceEEEEEEECCCCceEEEEEEEEEEE Confidence 9999999999987764 456677777763 4667665 5799999999999999999999999999999999999999 Q ss_pred ecceecC Q lcl|NC_012662. 774 NQRRRRV 780 (780) Q Consensus 774 ~~r~rrv 780 (780) |+|+||| T Consensus 779 ~~r~~~v 785 (785) T protein:vir:94 779 AKKARSV 785 (785) T ss_pred eccccCC Confidence 9999999 No 3 >protein:vir:2203 Length: 794 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_042000;swissprot:sw:p03747;genbank:gi:9627472;uniprot:P03747;genbank:GeneID:1261024 Probab=100.00 E-value=6.9e-222 Score=1233.08 Aligned_cols=765 Identities=19% Similarity=0.288 Sum_probs=679.2 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCCCccEE Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHL 80 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~~~~~y 80 (780) || +|+|+||||++|||||+|++||++||++|+||+|+|+|||+||||++||+++++... ....++++++++++.+ +| T Consensus 1 M~-~i~~s~~n~~~GvSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~-~~~~~~l~~~~~~~~~-~y 77 (794) T protein:vir:22 1 MA-LISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTLGDNGA-LGQAPYIHLINRDEHE-QY 77 (794) T ss_pred Cc-eeeeecchhhcccccCCchHHhhhHHhhhhcceeeccCCceeCCchHhhhhhcccCC-CCCccEEEEEEeCCCc-EE Confidence 99 599999999999999999999999999999999999999999999999999886543 3346677778776655 45 Q ss_pred EEEEcCcEEEEEeCCCcEE--EecCCCCcccc-CCcceEEEEEeCCEEEEecCCEeeeEeecccccC--CCCcceEEEec Q lcl|NC_012662. 81 VINTNTGGWWLLDREAKNI--VSEGNLSYLLA-ADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKK--DPKTTGFYFVK 155 (780) Q Consensus 81 ~l~~~~g~~~v~d~~~~~~--~~~~~~~y~~~-~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~--~~~~~g~v~v~ 155 (780) ++++++++|+||+.++... ..++..+|+.+ .+..+|+|+|+||+|||+|++++|+++.+..+.. .+..+|+++++ T Consensus 78 ~l~~~~~~irv~~~~G~~~~v~~~~~~~y~~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~~~g~v~v~ 157 (794) T protein:vir:22 78 YAVFTGSGIRVFDLSGNEKQVRYPNGSNYIKTANPRNDLRMVTVADYTFIVNRNVVAQKNTKSVNLPNYNPNQDGLINVR 157 (794) T ss_pred EEEEcCCeEEEEecCCcEEEeecCCCccceecCCCcccEEEEEEcCEEEEEcCCeeeeEeeccccCCCCCCCceEEEEcc Confidence 6788888999999876544 44667778644 4567899999999999999999999999876654 46678999999 Q ss_pred CccccceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhhheecccceEEE-cCeEEEEEcCCCce-eEEE Q lcl|NC_012662. 156 SGAFSKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVGVDFAVR-VGPYIYFELITGTD-LKIT 233 (780) Q Consensus 156 ~g~y~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g~~~~~~-~g~~i~~~~~s~~~-~~vt 233 (780) +++|+++|.++|+. ...+++++++++......++++++|+.+|..++++.+.+|+.. .++++++.+.++.. ..++ T Consensus 158 ~g~y~~ty~v~I~~---~~~a~~~~p~gt~~~~~~~~~~~~ia~~L~~~l~~~~~~~t~~~~~~~~~i~a~~~~~~~~~t 234 (794) T protein:vir:22 158 GGQYGRELIVHING---KDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRTNLSDWTVNVGQGFIHVTAPSGQQIDSFT 234 (794) T ss_pred CCccceeEEEEecc---CcceEEEEcCCCccccceeechhhhhhhhhhhheeccccceEEeCCceEEEEEcCCceEEEEe Confidence 99999999999954 4467888999999999999999999999999999988888865 45678888877654 4577 Q ss_pred eecCCcceeEE-EEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEecccceeEEc-ccceeEEee Q lcl|NC_012662. 234 STSGSPYIGYS-NQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYNSVTAIS-VDVPYKIVD 311 (780) Q Consensus 234 ~~~g~~~~~~~-~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~~~~~~~~~~-~~~p~~l~~ 311 (780) ..+|..+.... ..+.++++++||+.+++|+.++|..++++..+.||++|+..++.|+||++++...+++ ++|||.|++ T Consensus 235 ~~~g~~~t~~~~~~~~~~~~~~lp~~~~~G~~v~i~~~~~~~~~~Y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~lv~ 314 (794) T protein:vir:22 235 TKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAERKVWTETLGWNTEDQVLWETMPHALVR 314 (794) T ss_pred eecccCcceeEEEEeccccceeccccCCCCeEEEEEeCCCCCcceeEEEEeccceEEEEeeeccceeeecccceeeEeee Confidence 77777665554 4577899999999999999999999998889999999999999999999999999997 589999996 Q ss_pred cc-----ccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEE Q lcl|NC_012662. 312 DN-----VEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLSGAYVCMSATGEPDRFFRSTVSSLDPTDRIDI 386 (780) Q Consensus 312 ~~-----~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~t~~~~~ddD~i~~ 386 (780) .. ++..+|++|.|||+++||+|||++ ++|++|+||||||+|++|++|||||+||||||+++|+++++|||||++ T Consensus 315 ~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g-~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~ 393 (794) T protein:vir:22 315 AADGNFDFKWLEWSPKSCGDVDTNPWPSFVG-SSINDVFFFRNRLGFLSGENIILSRTAKYFNFYPASIANLSDDDPIDV 393 (794) T ss_pred ccCCcEEEeeccccccccCccccCCcceecC-CCcceEEEEcceEEEecCCeEEEEccCCccccccccCcCCCCCccEEE Confidence 43 566689999999999999999997 789999999999999999999999999999999999999999999999 Q ss_pred EEcCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEecCCceEEEE Q lcl|NC_012662. 387 ASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFSAVL 466 (780) Q Consensus 387 ~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~ 466 (780) +++++++|+|+|+++++++|+|||+++||+|+| +++|||+|++++++|+|+|+++|+|+.+|++++|++++| ++++++ T Consensus 394 ~~ss~~~~~i~~~v~~~~~L~i~t~~~e~~l~~-~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~~~ 471 (794) T protein:vir:22 394 AVSTNRIAILKYAVPFSEELLIWSDEAQFVLTA-SGTLTSKSVELNLTTQFDVQDRARPFGIGRNVYFASPRS-SFTSIH 471 (794) T ss_pred EecCCcceeeEEEeecCCcEEEEecCcEEEEeC-CCcccceeEEEEEEEEeeccCCCCceEeCCeEEEEecCC-CeeEEE Confidence 999999999999999999999999999999997 569999999999999999999999999999999998876 678888 Q ss_pred EeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeeeeeEeeccCCcEEE Q lcl|NC_012662. 467 ELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAWHKWVFPYRVAS 546 (780) Q Consensus 467 e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW~~w~~~G~v~~ 546 (780) |+++.++.+++|+++|||+|++|||+|+++++++++++|.+++|+++++|+|++|+|+++++||+|+|||||+|+|.|++ T Consensus 472 r~~~~~~~~d~y~~~Dlt~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~~ 551 (794) T protein:vir:22 472 RYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFCSVLSHGDPSKIFMYKFLYLNEELRQQSWSHWDFGENVQV 551 (794) T ss_pred EeEeeecccCceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceeEEeeEEEEcCCCEEE Confidence 87766688999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEE--CCcEEEEEEEcCCCeEEEEEEeeeccCCcccccccceeeeeccceeeecCcce---------eEEeeccccCCC Q lcl|NC_012662. 547 LHFA--RDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVNDGKG---------IVPIYMRPWVSE 615 (780) Q Consensus 547 ~~~~--~d~l~~vv~R~~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~---------~~~~~~~~~~l~ 615 (780) +|+. +|+||++|+|+++++.+||.|+++.... .+.++++||||+..+....+.. ..+...++.|++ T Consensus 552 ~~~~~~~d~l~~iv~r~~~~~~~r~~~~~~~~~~---~~~~~~~~lD~~~~~~~~~g~~~~~~~~t~~~~~~~~g~~~~~ 628 (794) T protein:vir:22 552 LACQSISSDMYVILRNEFNTFLARISFTKNAIDL---QGEPYRAFMDMKIRYTIPSGTYNDDTFTTSIHIPTIYGANFGR 628 (794) T ss_pred EEEEecCCEEEEEEEeCCCEEEEEEEEeeccccC---CCccceeeeeeeEEEeeccceeecCCcceEEEcccccCccccc Confidence 8865 7999999999999999999988875443 3456778999998776544321 222346889999 Q ss_pred CeEEEEEecCccccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCc----eeeecceEEEEEEE Q lcl|NC_012662. 616 GKLTGSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDT----LISTAPVRLLRYEL 691 (780) Q Consensus 616 g~~v~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~----~~~~~r~~v~rv~v 691 (780) |+++.+.+||.+.....+..+......+++++++++++|+|||+|+++++|+||++++++|. ....+|+||+|++| T Consensus 629 g~~v~~~~dg~~~~~~~~~~~~~~~~~~~v~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~~~~~~~~~~~grl~l~r~~~ 708 (794) T protein:vir:22 629 GKITVLEPDGKITVFEQPTAGWNSDPWLRLSGNLEGRMVYIGFNINFVYEFSKFLIKQTADDGSTSTEDIGRLQLRRAWV 708 (794) T ss_pred ceEEEEEcCCceeeceeeeeeeeccceEEeCCCCCCcEEEEeeeeeEEEEecceEEEecCCCccceeeecceEEEEEEEE Confidence 99999999999988877777777777899999999999999999999999999999998874 34458999999999 Q ss_pred EEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEE Q lcl|NC_012662. 692 TTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQSTEMYLSTDGTQDMNILEIEYII 771 (780) Q Consensus 692 ~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlai~~eg 771 (780) +|.+||+|+++++++.++. .+.+++.+++++++.+|++| ..+|++++|+.+|+++.+|+|+|++|+||+|++|+||| T Consensus 709 ~~~~tg~~~v~v~~~~~~~--~~~~~~~~~g~~~~~~g~~~-~~tg~~~vp~~~~~~~~~v~i~~d~p~P~tvlsi~~eg 785 (794) T protein:vir:22 709 NYENSGTFDIYVENQSSNW--KYTMAGARLGSNTLRAGRLN-LGTGQYRFPVVGNAKFNTVYILSDETTPLNIIGCGWEG 785 (794) T ss_pred EeccccceEEEEcCCCccc--ceeecCceecccccccCccc-ccCceEEEEecccCceEEEEEEECCCCCEEEEEEeEEE Confidence 9999999999999877654 45678888999999999876 47999999999999999999999999999999999999 Q ss_pred EEecceecC Q lcl|NC_012662. 772 RYNQRRRRV 780 (780) Q Consensus 772 ~y~~r~rrv 780 (780) +||+|+||| T Consensus 786 ~y~~r~~~v 794 (794) T protein:vir:22 786 NYLRRSSGI 794 (794) T ss_pred EEeccccCC Confidence 999999999 No 4 >protein:vir:6326 Length: 826 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877473;genbank:gi:33300845;uniprot:Q7Y2D1;genbank:GeneID:1482615 Probab=100.00 E-value=3e-222 Score=1235.09 Aligned_cols=769 Identities=28% Similarity=0.435 Sum_probs=666.2 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCCCccEE Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHL 80 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~~~~~y 80 (780) || +|+|+||||++|||||||++||++||++|+||+|+|+|||+||||++||++++++.. .++.+|+|+++|++.||+| T Consensus 1 M~-~i~~s~~n~~~GvSqq~d~~r~~~q~~~~~N~~~~~~~G~~rRpg~~~v~~~~~~~~-~~~~~~~~~~~r~~~~~~~ 78 (826) T protein:vir:63 1 MS-YKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQ-PWPRPFLYHTNLGGRSIAM 78 (826) T ss_pred Cc-eeeeecchhhcceeccCchHhhhhhhhhhhcceeeccCCcccCchhHhhhhhccCCc-cccccEEEEEecCCCceEE Confidence 99 599999999999999999999999999999999999999999999999999997654 4678899999999999999 Q ss_pred EEEEcCcEEEEEeCCCcEEEecC--CCCccccCCcceEEEEEeCCEEEEecCCEeeeEeecccccCCCCcceEEEecCcc Q lcl|NC_012662. 81 VINTNTGGWWLLDREAKNIVSEG--NLSYLLAADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPKTTGFYFVKSGA 158 (780) Q Consensus 81 ~l~~~~g~~~v~d~~~~~~~~~~--~~~y~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~g~v~v~~g~ 158 (780) ++++++|.||||+.++|.++..+ ..+|+++++.++|+|+|+||+|||+|++++|++..+.....++..++++++++++ T Consensus 79 ~~~~~~g~irv~~~~~g~~~~~~~~~~~y~~~~~~~~l~~~t~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~v~~g~ 158 (826) T protein:vir:63 79 LVAQHRGELYLFDERDGRLLMGQPLVHDYLKANDYRQLRAATVADDLFIANLSVKPEADRTDIKGVDPNKAGWLYIKAGQ 158 (826) T ss_pred EEEecCCcEEEEEcCCCeEEEcCCCCCceeeecCccceEEEEeCCEEEEEeCCeeeeeccccccccCCCCcEEEEeeccc Confidence 99999999999999998876544 4678888888999999999999999999999987777777788899999999999 Q ss_pred ccceeEEEEeeCCc------eEEEEEEeccCCCCcc-----ccccchhhhhhhhhhhheec------------------- Q lcl|NC_012662. 159 FSKEYDISVVWSEG------SQTVTYTTPDGTTAGD-----ADQSVPEAIARKLVEALIAV------------------- 208 (780) Q Consensus 159 y~~~y~vti~~~~~------~~t~t~tt~~~s~~~~-----~~~~~~~~i~~~l~~~~~s~------------------- 208 (780) |+++|.|++++... +.+++++++++..+.. ....+.++++.++...+.+. T Consensus 159 Y~~~y~vti~~~~~~~gt~~s~t~t~~t~~~~~a~~~~~~~~~~~s~~yia~~l~~~~~a~~~~~~~~~t~~~~~~~~~~ 238 (826) T protein:vir:63 159 YSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLYGKFFGAPEYTLPNSTKKYPKVDPDA 238 (826) T ss_pred cCceEEEEEEeccccCCccccceEEEEeccCCcccccccccceeeeeeeeeeeceeeeeeccccccCCCccccceecCCc Confidence 99999999987543 3457888887654432 22335566766655433221 Q ss_pred -------ccceEEEcCeEEEEEcCCCceeEEEeecCCcceeEEEEEeecceecccccccCCc---------eEEEEeccC Q lcl|NC_012662. 209 -------GVDFAVRVGPYIYFELITGTDLKITSTSGSPYIGYSNQSQVNLETDLPARLHPSA---------DGALCAVGQ 272 (780) Q Consensus 209 -------g~~~~~~~g~~i~~~~~s~~~~~vt~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~---------~~~v~~~~~ 272 (780) ........+.++++.......+.++..+|..+...+...+++++++||+.+|... ++.++.++ T Consensus 239 ~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~~p~~~~~~~~~~~~~~~~~~~g- 317 (826) T protein:vir:63 239 NAATIAGYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATADLPALLPGVGAPGVGVQFMDGAVMATG- 317 (826) T ss_pred ccceeecceeEecccccEEEEeeCCcccEEEccCCCCcceEEEEEeeccceeeccccCCCcccceEEEeeEEeEEecCC- Confidence 0111122345667777666677777888888877788889999999999888642 33444544 Q ss_pred CCCCceEEEEEecCccEEEEecccceeEEcccceeEEeec------cccccccchhhcCCcccCCCcccccCCCceEEEE Q lcl|NC_012662. 273 SERALVWYRYSSEKGVWLESGDYNSVTAISVDVPYKIVDD------NVEQHIMEGRLAGDDLTNPAPTFLEERRITGIGT 346 (780) Q Consensus 273 ~~~~~~y~~~~~~~g~w~e~~~~~~~~~~~~~~p~~l~~~------~~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~ 346 (780) ...+.||++|+..+|+|+||++++....+ ++|||.|++. .++..+|++|.+||+++||+|+|++ ++|++|+| T Consensus 318 ~~~d~~y~~~~~~~~~w~e~~~~~~~~~~-~tmp~~l~~~~~~~~f~~~~~~w~~r~~Gd~~tnp~psf~g-~~~~~v~f 395 (826) T protein:vir:63 318 STKAPVYFEWDSANRRWAERAAYGTDWVL-KKMPLALRWDEATDTYSLNELEYDRRGSGDEDTNPTFNFVT-RGITGMTT 395 (826) T ss_pred CcccceEEEEEcCCceEEEEeecCccccc-ccceEEEEEeccCCeEEEeccccccccccccccCCCccccC-CCceEEEE Confidence 44588999999999999999999986655 5999999842 3567889999999999999999997 78999999 Q ss_pred EcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccc Q lcl|NC_012662. 347 FQGRLVLLSGAYVCMSATGEPDRFFRSTVSSLDPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAP 426 (780) Q Consensus 347 ~q~RL~f~~~~~v~~S~~gd~~nF~~~t~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP 426 (780) |||||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+| +++||| T Consensus 396 ~q~RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~~~q~~ls~-~~~lTP 474 (826) T protein:vir:63 396 FQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPG-GGIVTP 474 (826) T ss_pred EeceEEEeeCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeEEEeecCCcEEEEecCcEEEEeC-CCcccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999999998 579999 Q ss_pred cceEEEEEeeecCCCCCCcEEeCCeEEEEEecCCceEEEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCe Q lcl|NC_012662. 427 DNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANI 506 (780) Q Consensus 427 ~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~ 506 (780) +|++++++|+|+|+++|+|+.+|++++|++++|++|++||||+|+++.+++|+++|||+|++|||++++..+ +++++|. T Consensus 475 ~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~~~s~v~e~~~~~d~~~~y~~~dlt~~~~~l~~~~v~~~-a~s~~~~ 553 (826) T protein:vir:63 475 RTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYI-QAAASSG 553 (826) T ss_pred eeEEEEEEEeecccCCCCceEeCCeEEEEecCCCceeEEEEEEeeeccccceehhHHHHHHHHhcCCCeEEE-EEcCCCC Confidence 999999999999999999999999999999999889999999999888888999999999999999987766 5556667 Q ss_pred EEEEEEcCCCEEEEEEEeecCCceeeeeeEeeccCCcEEEEEEECCcEEEEEEEcCCCeEEEE-EEeeeccCCccccccc Q lcl|NC_012662. 507 VLMATTGDNRQVIAHEYHFTSQGKVHQAWHKWVFPYRVASLHFARDRVVLFAADDAGSTDKIT-ISTIDPKQGGVTFDVD 585 (780) Q Consensus 507 ~~~~~~~~~g~l~~~ty~~~~~e~~v~aW~~w~~~G~v~~~~~~~d~l~~vv~R~~~g~~~~~-~e~~~~~~~~~~~~~~ 585 (780) .++|++++||+|++|+|+++++||+|+|||||+|+|+|++||+++|+||++|+|++++..+|+ +|+|++....+....+ T Consensus 554 ~v~~~~~~dg~l~~~~y~~~~~e~~v~aW~~~~~~g~v~~~~~i~d~l~~iv~r~~~~~~~r~~~e~~~~~~~~~~~~~d 633 (826) T protein:vir:63 554 YLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYD 633 (826) T ss_pred EEEEEEcCCCEEEEEEEeeCCCcEEEEeEEEEecCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEEecCCccccccCCcc Confidence 788999999999999999999999999999999999999999999999999999999987765 9999876654444556 Q ss_pred ceeeeeccceeeecCcceeEEeeccccCCCCeEEEEEecCccccceeccccccccc--eEEEcCCCCCCEEEEeEeeeEE Q lcl|NC_012662. 586 RLPHLDSMSIVPVNDGKGIVPIYMRPWVSEGKLTGSVATGALASEEVAIDVDEVSW--EFTVEPGFKDSQIYLGFRYESL 663 (780) Q Consensus 586 ~~~~lD~~~~~~~~~~~~~~~~~~~~~~l~g~~v~~~adG~~~~~~~~~~~~~~~~--~~~i~~~~~~~~v~vGl~y~~~ 663 (780) +..++||+..+......... ..+++|+++..+.+++++.+++....... +..+ ++.++++..+++|+|||+|+++ T Consensus 634 ~~~~~d~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~v~l~~~~~~~~~~v~VGl~y~s~ 710 (826) T protein:vir:63 634 YWRRIEATVAGELELTKQHW--DLIKDASAVYQLQPVAGAYMERTHLGVKR-ETNTKVFLDVPEAVVGAVYVVGCEFWSK 710 (826) T ss_pred ceEEEEEeeeeeeccCccee--ecccCcccccEEEEeeCccccCCccceEE-ecCCEEEEecCCCccccEEEEeeeeeEE Confidence 77789988665443222221 24789999999999999998776543322 2233 4566778889999999999999 Q ss_pred EEcCceEEecCCCceeeecceEEEEEEEEEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEe Q lcl|NC_012662. 664 FAPTPPMLKDQNDTLISTAPVRLLRYELTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPC 743 (780) Q Consensus 664 v~~~~~~i~~~~g~~~~~~r~~v~rv~v~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~ 743 (780) ++|+||++++++|+.++.+|+||||++|+|.+||+|+++|+++.++..+.+..+++++++.+..+|.|+ ..+|++++|+ T Consensus 711 ~~~~~~~~~~~~g~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~p~-~~t~~~~vP~ 789 (826) T protein:vir:63 711 VEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPL-VDSAVVPLPA 789 (826) T ss_pred EEecceEEEccCCCcceeccEEEEEEEEEeeccccEEEEecCccccceeEeecCCceeccccccccccc-ccceEEEEEE Confidence 999999999999999999999999999999999999999999999888888889999999988888776 4799999999 Q ss_pred ecccceeEEEEEECCCCCEEEEEEEEEEEEecceecC Q lcl|NC_012662. 744 RSNAQSTEMYLSTDGTQDMNILEIEYIIRYNQRRRRV 780 (780) Q Consensus 744 ~~~~~~~~v~i~~~~P~P~tvlai~~eg~y~~r~rrv 780 (780) .+++++.+|+|+|+.|+||+|++|+|||+||+|+||| T Consensus 790 ~~~~~~~~i~i~~d~P~p~~il~i~~~~~yn~r~rrv 826 (826) T protein:vir:63 790 RVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) T ss_pred eeccceEEEEEEeCCCCcEEEEEEEEEEEEeceeecC Confidence 9999999999999999999999999999999999999 No 5 >protein:vir:94583 Length: 792 # NCBI annotation: Tubular tail protein B # Family: family:all:825 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919014;genbank:gi:119637778;genbank:GeneID:5179343 Probab=100.00 E-value=2.7e-220 Score=1224.38 Aligned_cols=764 Identities=17% Similarity=0.244 Sum_probs=670.5 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCCCccEE Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHL 80 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~~~~~y 80 (780) || +|+|+||||++|||||+|++||++||++|+||+|+|+|||+||||++||++++......... +.+.+.++ .++.| T Consensus 1 M~-~i~~s~~n~~~GiSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~-~l~~~~~~-~~q~y 77 (792) T protein:vir:94 1 MA-LISQSVKNLKGGISQQPNILRFPEQGSEQINGWSSETEGLQKRPPFVFTKTIGDQNALGAKP-LVHLINRD-SAEQY 77 (792) T ss_pred Cc-ceeeecchhhcceecCcchHHhhhhhhhhhcceeeeccccccCChhHHHHhhhcCCCCCccc-EEEEEEeC-CCceE Confidence 99 59999999999999999999999999999999999999999999999999988765544433 44455554 45667 Q ss_pred EEEEcCcEEEEEeCCCcEEEecCCCCccccCCc-ceEEEEEeCCEEEEecCCEeeeEeecccccCCCCcceEEEecCccc Q lcl|NC_012662. 81 VINTNTGGWWLLDREAKNIVSEGNLSYLLAADR-RSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPKTTGFYFVKSGAF 159 (780) Q Consensus 81 ~l~~~~g~~~v~d~~~~~~~~~~~~~y~~~~~~-~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~g~v~v~~g~y 159 (780) ++++++++++|||.+++.+++.+..+|+.++.. ++|+|+|+||+|||+|++++|+++.+..+..|+.+++++++++|+| T Consensus 78 ~l~f~~~~~rv~~~~g~~~~~~~~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~~~~~~~~~~~~~~~~~~~v~i~~g~y 157 (792) T protein:vir:94 78 YVVFTGQGVRVFDLNGKEYDVKGDLSYVKVENPRDDLRMVTVADYTFIVNRNMVVRPDTTPLYTLKENGDCLINIRGGMY 157 (792) T ss_pred EEEEcCCeEEEEecCCceEEecccCceeeecCCcceeEEEEEcCEEEEEeCCccceeEecCcCCCCCCceEEEEccCCCc Confidence 888888899999999999999999999877654 5799999999999999999999999999999999999999999999 Q ss_pred cceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhhheec--ccceEE-EcCeEEEEEcCCCce-eEEEee Q lcl|NC_012662. 160 SKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAV--GVDFAV-RVGPYIYFELITGTD-LKITST 235 (780) Q Consensus 160 ~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~--g~~~~~-~~g~~i~~~~~s~~~-~~vt~~ 235 (780) +++|.+++.. ..++++++.++.+...++.++++++..|....... ..+|+. +.+.++++.+.++.. ..+++. T Consensus 158 ~~~y~i~i~~----~~~~~~~~~~t~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 233 (792) T protein:vir:94 158 GRTLAFTINN----TKIAYEIAHGDAPEHSKQTDAQWLVKKLAGLARLNVAFKGWTFTEGPGYIHVIAPSNSQINSLSTE 233 (792) T ss_pred ceeEEEEecC----ceeeeeeecCcccceecccchhhhhhhhhhhccccccccccEEEECCeEEEEEecCCceeeeeecc Confidence 9999999954 35667788888888888999999999997754443 345654 567888888877654 346667 Q ss_pred cCCcceeE-EEEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEecccceeEEc-ccceeEEeecc Q lcl|NC_012662. 236 SGSPYIGY-SNQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYNSVTAIS-VDVPYKIVDDN 313 (780) Q Consensus 236 ~g~~~~~~-~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~~~~~~~~~~-~~~p~~l~~~~ 313 (780) +|..+... ...+.++++++||+.+|+|+.++|.++++++.++||++|+..+++|+||++++...+++ ++|||++++.. T Consensus 234 ~g~~~~~~~~~~~~v~~~~~lp~~~~~G~~v~i~~~~~~~~d~y~v~~~~~~~~w~E~~~~~~~~~~~~~tmp~~lv~~~ 313 (792) T protein:vir:94 234 DGYADQLMNAVMHTSQSFSRLPVEAPNGYTVKIVGDTSKTSDMFYVQYDNMKKVWKEVAGWGVQKGLNGGTMPHALVRQA 313 (792) T ss_pred cCcCcceeeeeeecccccccccccCCCCcEEEEEccCCCCccceEEEEEcCCceEEEecccceeeeecccccCeeEEEcC Confidence 77665444 45688999999999999999999999999999999999999999999999999999997 58999999754 Q ss_pred -----ccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEE Q lcl|NC_012662. 314 -----VEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLSGAYVCMSATGEPDRFFRSTVSSLDPTDRIDIAS 388 (780) Q Consensus 314 -----~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~t~~~~~ddD~i~~~~ 388 (780) ++.++|++|.+||+++||+|+|++ ++|++|+||||||+|++|++|||||+||||||+++|+++++|||||++++ T Consensus 314 ~~~~~~~~~~w~~r~~gd~~tnp~psf~g-~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ 392 (792) T protein:vir:94 314 DGSFQMQVLPWTQRTCGDMDTNPTPSIVD-QKINDVFFFRNRLGFLAGENIVMSRTSKYFSLFPASVANLSDDDPIDVAV 392 (792) T ss_pred CCcEEEEeccccccccCccccCccceecc-CCcceEEEEcceEEEecCCeEEEEccCCcccCccccccCCCCCccEEEEe Confidence 456679999999999999999997 78999999999999999999999999999999999999999999999999 Q ss_pred cCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEecCCceEEEEEe Q lcl|NC_012662. 389 GSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFSAVLEL 468 (780) Q Consensus 389 ~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~ 468 (780) +++++|+|+|+++++++|+|||+++||+|+| +++|||+|++++++|+|+|+++|+|+.+|++++|++++| ++++++|+ T Consensus 393 ss~~~~~i~~~v~~~~~L~l~T~~~q~~l~~-~~~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g-~~~~v~r~ 470 (792) T protein:vir:94 393 SHNRISILKYAVPFSEELLLWSDQAQFVLSA-QGILSPKSVELNLTTEFDVSDRARPFGVGRGVYFASPRA-SYTSLNRY 470 (792) T ss_pred cCCcceeeeEEeecCCcEEEEecCcEEEEeC-CCcccceeEEEEEEEEeeccCCCCceEeCCeEEEeecCC-CeeEEEee Confidence 9999999999999999999999999999998 579999999999999999999999999999999998887 67889887 Q ss_pred eeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeeeeeEeeccCCcEEEEE Q lcl|NC_012662. 469 VPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAWHKWVFPYRVASLH 548 (780) Q Consensus 469 ~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW~~w~~~G~v~~~~ 548 (780) ++.++.+|+|+++|||+|++|||+|+++.+++++++|.+++|++++||+|++|||+++++||+|+|||||+++|.++++| T Consensus 471 ~~~~~~~d~y~a~DlT~~~~hl~~~~v~~~~a~~~~~~~vv~~~~~~g~l~~~ty~~~~~e~~v~aW~~~~~~g~~~~~~ 550 (792) T protein:vir:94 471 YAVQDVSSVKSAEDMSAHVPNYIPNGVFSIRGSSTENFISVLSSNAPSRIFLYKFLYLNEEIAQQSWSHWELGSNVTVLA 550 (792) T ss_pred eeeccccCceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCeEEEEEEeecCCceEEEeEEEEEcCCcEEEEE Confidence 76678899999999999999999999999999999999999999999999999999999999999999999999988877 Q ss_pred E--ECCcEEEEEEEcCCCeEEEEEEeeeccCCcccccccceeeeeccceeeecCcceeE---------EeeccccCCCCe Q lcl|NC_012662. 549 F--ARDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVNDGKGIV---------PIYMRPWVSEGK 617 (780) Q Consensus 549 ~--~~d~l~~vv~R~~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~---------~~~~~~~~l~g~ 617 (780) + .+|+||++|+|++++..+|+.|+++.... .++++..||||+.++...+++.+. ....++.|++|+ T Consensus 551 ~~~~~D~l~~~v~r~~~~~~~r~~~~~~~~d~---~~~~~~~~lD~~~~~~~~~~~~~~~~~~T~~~~~~~~gl~~l~G~ 627 (792) T protein:vir:94 551 CDSIGSTMYLVLRNQSHTWMCRAHFTKNSIDF---PDEPYRLYIDNKVKYVIPEGSYNDDTYATTVKPVDVYGMKYWTGK 627 (792) T ss_pred EeecCCEEEEEEEeCCCEEEEEEEEeeccccc---CCCcceeeeeeeeeEEecCcceecCceeeeeccccccCcccccCc Confidence 6 58999999999999888888777655432 345678899999998766554321 124689999999 Q ss_pred EEEEEecCccccceec-cccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCc----eeeecceEEEEEEEE Q lcl|NC_012662. 618 LTGSVATGALASEEVA-IDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDT----LISTAPVRLLRYELT 692 (780) Q Consensus 618 ~v~~~adG~~~~~~~~-~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~----~~~~~r~~v~rv~v~ 692 (780) ++.+++||....-... .......+++++++++++++|+|||+|+++++|+||+++++.|. ....||+||||++++ T Consensus 628 ~v~v~~dG~~~~~~~~~~~~~~~~~~i~~~g~~~a~~v~VGl~y~~~~~~~~~~~~~~~g~~~~~~~~~gr~rl~r~~~~ 707 (792) T protein:vir:94 628 FYIVASDGLVSWFEPPRGGWPNGVPMLTMSGNREGETIYVGLAISFRYVFSKFLIKKTADDGSIATEDIGRLQLRRAWVN 707 (792) T ss_pred EEEEEecCceeEeecccceecCCccEEEecCCccCCeEEEeeeeeEEEEeccceeeccCCCcCccccceeeEEEEEEEEe Confidence 9999999976432211 12234566889999999999999999999999999999977664 334579999999999 Q ss_pred EeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEE Q lcl|NC_012662. 693 TRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQSTEMYLSTDGTQDMNILEIEYIIR 772 (780) Q Consensus 693 ~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlai~~eg~ 772 (780) |.+||.|+++++++.++. .+.+.+.++++..+.+|.|| ..+|++++|+.+|+++.+|+|+|++|+||+|+||+|||+ T Consensus 708 ~~~tg~~~v~~~~~~~~~--~~~~~~~~~~~~~~~~g~~~-~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlai~~eg~ 784 (792) T protein:vir:94 708 YEDSGAFTVEVENTSRLF--SYDMAGARLGSNVLRAGGLN-VGTGQFRFPVTGNAQLNEVRIISEHTTPLNVIGCGWEGN 784 (792) T ss_pred eeccceeEEEEcCCCcce--eeeeccceeccccccccccc-cccceEEEEeeccCceEEEEEEECCCCCEEEEEEEEEEE Confidence 999999999999887653 34577888898888888765 689999999999999999999999999999999999999 Q ss_pred EecceecC Q lcl|NC_012662. 773 YNQRRRRV 780 (780) Q Consensus 773 y~~r~rrv 780 (780) ||+|+||| T Consensus 785 y~~r~~~v 792 (792) T protein:vir:94 785 YLRRSSGI 792 (792) T ss_pred EeccccCC Confidence 99999999 No 6 >protein:vir:1543 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052111;swissprot:trembl:q9t105;genbank:gi:9634037;uniprot:Q9T105;genbank:GeneID:1262408 Probab=100.00 E-value=5.7e-220 Score=1222.56 Aligned_cols=765 Identities=18% Similarity=0.277 Sum_probs=672.3 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCCCccEE Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHL 80 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~~~~~y 80 (780) || +|+|+||||++|||||||++||++||++|+||+|+|+|||+||||++||++++..... ...+++|+++|++. ++| T Consensus 1 M~-~i~~s~~n~~~GvSqq~d~~r~~~q~~~~~N~~~~~~gGl~rRpGt~~va~~~~~~~~-~~~~~~~~~~~~~~-e~y 77 (801) T protein:vir:15 1 MA-LISQSIKNLKGGISQQPDILRFAEQGSVQINGWSSESEGLQKRPPMIHLKTLGPAGYV-GAQPYVHLINRDEF-EQY 77 (801) T ss_pred Cc-eeeeecchhhcceecCcchHhhhhhHhhhhcceeccccCcccCCchheeeeecCCCCc-ccceeEEEEEeCCc-eEE Confidence 99 5999999999999999999999999999999999999999999999999999875533 56788888888754 456 Q ss_pred EEEEcCcEEEEEeCCCcEEEecCCCCccccCCc-ceEEEEEeCCEEEEecCCEeeeEeecccccC--CCCcceEEEecCc Q lcl|NC_012662. 81 VINTNTGGWWLLDREAKNIVSEGNLSYLLAADR-RSIQTTSMGGVTYILNTEKRPSATTDNSDKK--DPKTTGFYFVKSG 157 (780) Q Consensus 81 ~l~~~~g~~~v~d~~~~~~~~~~~~~y~~~~~~-~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~--~~~~~g~v~v~~g 157 (780) +|+++++.|+|||.+|..+..++..+|+.+++. ++|+++|+||+|||+|++++|++..+..+.. .+..+++++++++ T Consensus 78 ~l~~~~~~irv~~~~G~~~~v~~~~~y~~~~~~~~~l~~~~~aD~~fi~nr~~~~~~~~~~~~~~~~~~~~~alv~v~~~ 157 (801) T protein:vir:15 78 FVVFTGEDIKVFDLDGKEYQVRGDRSYVRTANPREDLRMITVADYTFVTNRKVVVQSNDQSVNLPGFKDQGDALINVRGG 157 (801) T ss_pred EEEEcCCeEEEEccCCcEEEEecCCccccccCchhheeEEEEcCEEEEeeCCeeeecccCccccCccCCCCceEEEeeec Confidence 788899999999998888888888888776654 5899999999999999999999988765433 4456789999999 Q ss_pred cccceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhhheec---------ccceEEE-cCeEEEEEcCCC Q lcl|NC_012662. 158 AFSKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAV---------GVDFAVR-VGPYIYFELITG 227 (780) Q Consensus 158 ~y~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~---------g~~~~~~-~g~~i~~~~~s~ 227 (780) +|+++|+|++ +++..+++++++++.+...++.+.++++..|..++.+. ...|+.. .+.++++.++.+ T Consensus 158 ~yg~t~~I~i---~gs~~~~~t~~~gs~~~~~~~~s~~~ia~~l~~~~~~~~p~~~~~~~~~~w~~~~~~g~~~i~a~~~ 234 (801) T protein:vir:15 158 QYGRRLSIEF---NGAERAAVQLPDGSQPAHVNEVDGQAIAEKLAAQLRNNLGNPNNDQDPNKWRFNVGPGFIHILAPNN 234 (801) T ss_pred cCceeEEEEe---CCcceEEEEeccCcccchhhhcceeechHHHhhhhhhccCccceeccCccEEEEecCcEEEEeCCCC Confidence 9999999999 45667889999999888888888888888887666542 2345543 345667777666 Q ss_pred ce-eEEEeecCCcceeE-EEEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEecccceeEEc-cc Q lcl|NC_012662. 228 TD-LKITSTSGSPYIGY-SNQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYNSVTAIS-VD 304 (780) Q Consensus 228 ~~-~~vt~~~g~~~~~~-~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~~~~~~~~~~-~~ 304 (780) .. +.+++.+|...... ...+.++++++||.++|+|+.++|..+.++..+.||++|+...+.|+||++++...+++ ++ T Consensus 235 ~~~~~~~t~dg~~~~~~~~~~~~v~~~~~lp~~~~~G~~v~v~~~~~~~~~~y~v~~~~~~~~w~E~a~~g~~~~~~~~t 314 (801) T protein:vir:15 235 DNVWGLQTKDGYADQLINPVTHYTQSFQKLPINAPDGYIVKIVGDTSKTADQYYVRFDLNRKVWVETIGWNTRTHLYYHT 314 (801) T ss_pred cccceeeeccccCceeeeEEeecccceeeeeeecCCCcEEEEEecCCCccceEEEEEEcCCeeEEeecccccceeeeccc Confidence 54 46777777665554 45678999999999999999999999888999999999999999999999999999997 58 Q ss_pred ceeEEeecc-----ccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEecCCeEEEEccCCccccccccccCCC Q lcl|NC_012662. 305 VPYKIVDDN-----VEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLSGAYVCMSATGEPDRFFRSTVSSLD 379 (780) Q Consensus 305 ~p~~l~~~~-----~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~t~~~~~ 379 (780) |||.|++.. ++..+|++|.+||+++||+|+|++ ++|++|+||||||+|++|++|||||+||||||+++|+++++ T Consensus 315 mp~~lv~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~g-~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~ 393 (801) T protein:vir:15 315 MPWALVRASDGNFDFKVLEWGARTVGDDTTNPYPSFTG-QTINDIFFFRNRLGFLSGENIILSRTSKYFNFFPASVSNYS 393 (801) T ss_pred cceEEEeeccceEEEeccccccccCCccccCCcccccC-CCceEEEEEcceEEEeeCCeEEEEecCCccccccccccCCC Confidence 999999754 457789999999999999999987 89999999999999999999999999999999999999999 Q ss_pred CCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEecC Q lcl|NC_012662. 380 PTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRS 459 (780) Q Consensus 380 ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g 459 (780) |||||+++++++++|+|+|+++++++|+|||+++||+|+| +++|||+|++++++|+|+|+++|+|+.+|++++|++++| T Consensus 394 DdD~i~~~~~~~~~~~i~~~v~~~~~L~i~t~~~q~~ls~-~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g 472 (801) T protein:vir:15 394 DDDPIDVAVSHNRVSTLKYAVPFSEELLLWSDQAQFVLTA-SGILSSRSVELNLTTQFDVQDRARPHGVGRNVYFASPRA 472 (801) T ss_pred CCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEcC-CCcccceeEEEEEEEeeeccCCCCceEeCCeEEEEecCC Confidence 9999999999999999999999999999999999999988 569999999999999999999999999999999998887 Q ss_pred CceEEEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeeeeeEeec Q lcl|NC_012662. 460 EAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAWHKWV 539 (780) Q Consensus 460 ~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW~~w~ 539 (780) ++++++|+++.++++|+|+++|||+|++|||+++++++++++++|.+++|+++++|+|++|+|+++++||+|+|||||+ T Consensus 473 -~~~~~~r~~~~~~~~d~y~a~Dlt~~~~hl~~~~v~~~~~~~~~~~~~~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~ 551 (801) T protein:vir:15 473 -SFTSINRYYAVQDVSSVKNAEDMTAHVPNYIPNGVFSISGTTAENFAAILTSGAPNRVYIYKFLYIDEEIRQQSWSHWD 551 (801) T ss_pred -CeeEEEEEEeecccccceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCEEEEEEEecCCCceEEEeeEEEE Confidence 6788887766668899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCcEEEEEE--ECCcEEEEEEEcCCCeEEEEEEeeeccCCcccccccceeeeeccceeeecCcceeE---------Eee Q lcl|NC_012662. 540 FPYRVASLHF--ARDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVNDGKGIV---------PIY 608 (780) Q Consensus 540 ~~G~v~~~~~--~~d~l~~vv~R~~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~---------~~~ 608 (780) |+|.++++|+ .+|+||++|+|+++.+.+++.++.+.. +..+.+++.||||+..++...++.+. ... T Consensus 552 ~~g~~~~~~~~~~~d~l~~~v~r~~~~~~~r~~~~~~~~---~~~~~~~~~~lD~~~~~~~~~~t~~~~~~~~~~~~~~~ 628 (801) T protein:vir:15 552 FGDNVTVFAAQVINSTMTVLMGNEHAVWMGRLHFTKNSI---DIPGEPYRLYIDAKRKYTIPAGTYNDDTYQTSISLATI 628 (801) T ss_pred cCCCEEEEEEEecCCEEEEEEEecCcEEEEEEEEccccc---cCCCcceeeeeeeeeeEeeccceeccCceecccccccc Confidence 9999998886 479999999999654443333333322 22245677899999888765544322 235 Q ss_pred ccccCCCCeEEEEEecCccccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCC----ceeeecce Q lcl|NC_012662. 609 MRPWVSEGKLTGSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQND----TLISTAPV 684 (780) Q Consensus 609 ~~~~~l~g~~v~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g----~~~~~~r~ 684 (780) .|+.|+||+++.+++||.+.+..++.++....+++++++++++++|+|||+|+++++|+||+++.++| +.+..+|+ T Consensus 629 ~gl~~l~g~~v~v~~dG~~~~~~~~~~g~~~~~~~~i~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~~~~~~~~~~~~rl 708 (801) T protein:vir:15 629 YGMNFTKGRVSVVFPDGKIIEVDQPINGWSSDPVLRLDGNQEGQVVYIGFNIPFTYTFSKFLIKKTAEDGSTATEDIGRL 708 (801) T ss_pred cccccccceEEEEEeCCceeeeeeecCcccCcceEEEcCCCCCcEEEEeeeeeEEEEecceEEeccCCCCCceeeeeccE Confidence 68999999999999999999998888888888899999999999999999999999999999995543 56677899 Q ss_pred EEEEEEEEEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEeecccceeEEEEEECCCCCEEE Q lcl|NC_012662. 685 RLLRYELTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQSTEMYLSTDGTQDMNI 764 (780) Q Consensus 685 ~v~rv~v~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tv 764 (780) ||||++|++.+||.+++.|+++.++. .+..++.++++.++.+|.+| ..+|+++||+.+|+++.+|+|+|++|+||+| T Consensus 709 ~l~r~~~~~~~tg~~~~~v~~~~~~~--~~~~~~~~~~~~~~~~~~~~-~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tv 785 (801) T protein:vir:15 709 QLRRAWVNYEDSGAFTIRVNNLSREF--IYTMAGARLGSDNLRVGRSN-IGTGQYRFPVVGNAQTNLVTIESDASTPLNI 785 (801) T ss_pred EEEEEEEEeccCcceEEEECCccccc--ceeecCcccccccccccccc-cccceEEEEEeecCceEEEEEEECCCCcEEE Confidence 99999999999999999999987754 46778999999999999876 5899999999999999999999999999999 Q ss_pred EEEEEEEEEecceecC Q lcl|NC_012662. 765 LEIEYIIRYNQRRRRV 780 (780) Q Consensus 765 lai~~eg~y~~r~rrv 780 (780) +||+|||+||+|+||| T Consensus 786 lsi~~e~~y~~r~~~~ 801 (801) T protein:vir:15 786 IGCGWEGNYLRRSSGI 801 (801) T ss_pred EEEEEEEEEeccccCC Confidence 9999999999999999 No 7 >protein:vir:99677 Length: 794 # NCBI annotation: Tail tubular protein B # Family: family:all:825 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249591;genbank:gi:68299742;genbank:GeneID:3799992 Probab=100.00 E-value=6.9e-220 Score=1222.10 Aligned_cols=763 Identities=21% Similarity=0.281 Sum_probs=664.5 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCCCccEE Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHL 80 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~~~~~y 80 (780) || +|+|+||||++|||||+|++||++||++|+||+|+|+||++||||++||++++++..... ..+.+.+++++ +++| T Consensus 1 M~-~i~~s~~n~~~GvS~q~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~-~~~l~~f~~~~-~~~y 77 (794) T protein:vir:99 1 MA-LISQSIKNLKGGISQQPDILRYSDQGSKQINGFSSEVEGLQKRPPSVHIKRLTDQFGLGQ-KPYCHIINRDE-VERY 77 (794) T ss_pred Cc-eeeeecchhhcceecCCchHHhhhhHhhhhcceeeeccCcccCCccceeeeecCCCCCcc-ccEEEEEEeCC-CceE Confidence 99 599999999999999999999999999999999999999999999999999987654433 34455555554 5577 Q ss_pred EEEEcCcEEEEEeCCCcE---EEecCCCCccccC-CcceEEEEEeCCEEEEecCCEeeeEeeccc--ccCCCCcceEEEe Q lcl|NC_012662. 81 VINTNTGGWWLLDREAKN---IVSEGNLSYLLAA-DRRSIQTTSMGGVTYILNTEKRPSATTDNS--DKKDPKTTGFYFV 154 (780) Q Consensus 81 ~l~~~~g~~~v~d~~~~~---~~~~~~~~y~~~~-~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~--~~~~~~~~g~v~v 154 (780) ++++++++++||+..+|. +......+|+.++ ++++|+|+|+||+|||+|++++|+++.+.. ....+..++++++ T Consensus 78 ~l~f~~~~irv~~~~~g~~~~v~~~~~~~y~~~~~~~~~l~~~q~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~~~v 157 (794) T protein:vir:99 78 AVFFTGSNIRVFDLFTGDEKTVNAPNGLSYVSSSNPRKDLRMVTVADYTFILNRNVATAQGTTNTPSGLAPFGHFGLVVI 157 (794) T ss_pred EEEEcCCeEEEEECCCCeEEEeeccccccccccCCccceeeEEEEccEEEEEcCCeeeeEeeeeccccCcCCCceEEEEe Confidence 889999999999876654 3445667776554 567999999999999999999999886543 4556778999999 Q ss_pred cCccccceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhhheecccceEEEcCeEEEEEcCCCce-eEEE Q lcl|NC_012662. 155 KSGAFSKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVGVDFAVRVGPYIYFELITGTD-LKIT 233 (780) Q Consensus 155 ~~g~y~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g~~~~~~~g~~i~~~~~s~~~-~~vt 233 (780) ++++|+++|++++. ++.++++++++++..+..++.++++|+.++...+...+... ...+.++++.+..+.. ..++ T Consensus 158 ~~g~y~~~y~v~i~---gs~ta~~~tp~~~~~~~~~~~s~~~ia~~l~~~l~~~g~~v-~~~~g~~~i~~~~~~~v~t~s 233 (794) T protein:vir:99 158 RGGQYGRTYRIKVN---GSVEASFETPLGDQVAHAKQIDIAYIIDQLAAGLINKGWAV-TKGSGYFYFSKSGSVIINSLE 233 (794) T ss_pred ccCCCCceEEEEec---CCcccceeeccCcccccccccchhhhhhhhHhhhhcccceE-EeCCeEEEEEecCCceeEEEE Confidence 99999999999994 45678889999998899999999999999998887655433 3456778888777665 4677 Q ss_pred eecCCcceeEE-EEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEecccceeEEc-ccceeEEee Q lcl|NC_012662. 234 STSGSPYIGYS-NQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYNSVTAIS-VDVPYKIVD 311 (780) Q Consensus 234 ~~~g~~~~~~~-~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~~~~~~~~~~-~~~p~~l~~ 311 (780) +.+|..+..++ ..+.++++++||+.+|+|+.++|...+.++.++||++|+..++.|+||++++...+++ ++|||.+++ T Consensus 234 ~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~~~~~~~~y~v~~~~~~~~w~e~~~~~~~~~~~~~t~p~~~v~ 313 (794) T protein:vir:99 234 VEDGYNGQLAWGIINDVQKTTQLPVYAPNNYIIRVSGDPTLNQDDYYVRFDASRNVWTECPAPNIKADYNKATMPHVLIR 313 (794) T ss_pred eecCCCCceeeEEeeeccceeecccCCCCCeEEEEeccCCCCCCceEEEEEcCCceEEeeccceeecceeccceEEEEec Confidence 77877666654 4678999999999999999999999888999999999999999999999999999886 589999987 Q ss_pred cc-----ccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEE Q lcl|NC_012662. 312 DN-----VEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLSGAYVCMSATGEPDRFFRSTVSSLDPTDRIDI 386 (780) Q Consensus 312 ~~-----~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~t~~~~~ddD~i~~ 386 (780) .. ++..+|++|.+||+++||+|||++ ++|++|+||||||+|+++++|||||+||||||+++|+++++|||||++ T Consensus 314 ~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g-~~is~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~ 392 (794) T protein:vir:99 314 EADGTFTFKQADWTHRAAGDDETNPYPSFIG-NSINDIFFFRNRLGFLSGENVILSGSGNYFNFFPESVAVLTDTDPIDV 392 (794) T ss_pred cCCCceeEeeccccccccCCcccCCCccccC-cceeEEEEEeeeEEEecCCeEEEEecCCccccccccccCCCCCccEEE Confidence 53 566789999999999999999997 789999999999999999999999999999999999999999999999 Q ss_pred EEcCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEecCCceEEEE Q lcl|NC_012662. 387 ASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFSAVL 466 (780) Q Consensus 387 ~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~ 466 (780) +++++++|+|+|+++++++|+|||+++||+|+| +++|||+|++++++|+|+|+++|+|+.+|++++|++++| ++++++ T Consensus 393 ~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~l~~-~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g-~~~~v~ 470 (794) T protein:vir:99 393 AVSTNRISILKYAVPFSEELILWSDQAQFVLSS-DGGLTPTTIRLDLTTEFEVTEQARPYGIGRGVYFVSPRA-KFSSVR 470 (794) T ss_pred EecCCcceeeEEEeecCCcEEEEecCcEEEEeC-CCcccceeEEEEEEEEeeccCCCCceEeCCeEEEEecCC-CeeEEE Confidence 999999999999999999999999999999998 569999999999999999999999999999999998887 678887 Q ss_pred EeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeeeeeEeeccCCcEEE Q lcl|NC_012662. 467 ELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAWHKWVFPYRVAS 546 (780) Q Consensus 467 e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW~~w~~~G~v~~ 546 (780) |++..++.+|+|+++|||+|++|||+|+++++++++++|.+++|++++||+|++|||+++++||+|+|||||+|+|.+++ T Consensus 471 r~~~~~~~~d~y~a~Dlt~~~~hl~~~~~~~~~a~~~~~~~~v~~~~~~g~l~~~~y~~~~~eq~v~aW~~~~~~g~~~~ 550 (794) T protein:vir:99 471 RFYAVQDVTQVKNAEDISAHVPYYVENGVFKMSGSSTENFLTILTEGNEQRVYFYKFLYLQEQLVQQSWSHWDFGVNCRV 550 (794) T ss_pred EeeeeccccCceehhhHHHHHHHhcCCCeEEEEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceEEEeEEEEEcCCCeEE Confidence 77543488999999999999999999999999999999999999999999999999999999999999999999999888 Q ss_pred EEE--ECCcEEEEEEEcCCCeEEEEEEeeeccCC-cccccccceeeeeccceeeecCcce---------eEEeeccccCC Q lcl|NC_012662. 547 LHF--ARDRVVLFAADDAGSTDKITISTIDPKQG-GVTFDVDRLPHLDSMSIVPVNDGKG---------IVPIYMRPWVS 614 (780) Q Consensus 547 ~~~--~~d~l~~vv~R~~~g~~~~~~e~~~~~~~-~~~~~~~~~~~lD~~~~~~~~~~~~---------~~~~~~~~~~l 614 (780) +|+ .+|+||++|+|+++ +|+|||+.... .+..++++..||||+..+....+.. ..+...|+.|+ T Consensus 551 ~~~~~~~d~l~~~v~r~~~----~~ler~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l 626 (794) T protein:vir:99 551 LCCDMIGAVMHLIIDSPSG----VLMEKIEFTQNTKDYPDEPYRLYVDRKIEYTFPEGSYNDDDFKTRVKLKDIYGSTPA 626 (794) T ss_pred EEEEEcCCEEEEEEEeCCC----EEEEEEEeeeCCCCCCCcccceeeeeeeeeeecccccccCcceeEEecccccccccc Confidence 776 48999999999876 57777764443 2334567788999999886544332 23456889999 Q ss_pred CCeEEEEEecCccccceecc-ccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCceee----ecceEEEEE Q lcl|NC_012662. 615 EGKLTGSVATGALASEEVAI-DVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLIS----TAPVRLLRY 689 (780) Q Consensus 615 ~g~~v~~~adG~~~~~~~~~-~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~~~----~~r~~v~rv 689 (780) +|+++.+++||......... .+....+.+++++++++++|+|||+|+++++|+||++++++++.+. .||+||||+ T Consensus 627 ~g~~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~vGl~y~s~~~~~~~~~~~~~~~g~~~~~~~gr~~l~r~ 706 (794) T protein:vir:99 627 NGQYVFISLGGVTFTFDPPAGGWQANDGLIEFDGDLRGTKFFVGEAYTFLYEFSKFLIKTTDTADGVATEDIGRLQLRRA 706 (794) T ss_pred CCceEEEEeCCceeeeecccceEecCccEEEecCCCCCcEEEEeeeeeEEEeecceEEeecCCCCceeeeccceEEEEEE Confidence 99999999999876644332 3345567789999999999999999999999999999977654333 479999999 Q ss_pred EEEEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEE Q lcl|NC_012662. 690 ELTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQSTEMYLSTDGTQDMNILEIEY 769 (780) Q Consensus 690 ~v~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlai~~ 769 (780) +|+|.+||+|++.++++.++. .+.+++.++++.++.+|.+| ..||++++|+.+|+++.+|+|+|++|+||+|+||+| T Consensus 707 ~~~~~~tg~~~v~v~~~~~~~--~~~~~~~~~~~~~~~~g~~~-~~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlsi~~ 783 (794) T protein:vir:99 707 WVNYDKSGNFRVEVNNQGRTF--TYNMTGNRLSTNELILGDES-LDTGQFRYAVSGNATQVTVSLISDTPNPLSIIGGGW 783 (794) T ss_pred EEEeecccceEEEECCCccce--eeeccccccccccccccccc-cccceEEEEecccccceEEEEEECCCCCEEEEEEEE Confidence 999999999999999988764 45678888999999999876 579999999999999999999999999999999999 Q ss_pred EEEEecceecC Q lcl|NC_012662. 770 IIRYNQRRRRV 780 (780) Q Consensus 770 eg~y~~r~rrv 780 (780) ||+||+|+||| T Consensus 784 e~~y~~r~~~v 794 (794) T protein:vir:99 784 EGYYVRRSSGI 794 (794) T ss_pred EEEEeccccCC Confidence 99999999999 No 8 >protein:vir:80253 Length: 777 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522886;genbank:gi:158345179;genbank:GeneID:5687516 Probab=100.00 E-value=8.3e-220 Score=1221.66 Aligned_cols=761 Identities=31% Similarity=0.502 Sum_probs=663.7 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCCCccEE Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHL 80 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~~~~~y 80 (780) ||+ |+|+||||++|||||+|++||++||++|+||+|+|+|||+||||++||++++......++.+ +..++++.|++| T Consensus 1 M~~-i~~~~~nf~~GvS~q~D~~ry~~q~~~~~N~~~~~~gG~~rRpGt~fv~~l~~~~~~~~~~~--~~~~~~~~e~~~ 77 (777) T protein:vir:80 1 MSY-FAGSYRQLLFGVSQQTAKDRLEGQVESQLNMQSDLVTGPRRRSPVHLIADAMAATDANRLAY--SLATFSGREVLL 77 (777) T ss_pred Cce-eeeecchhhcccccCCchHHhhhHHhhhhcceeeeccCceeCcchHhhhhhcCCCcccceeE--EEEecCCCeeEE Confidence 995 89999999999999999999999999999999999999999999999999987665555544 345677899999 Q ss_pred EEEEcCcEEEEEeCCCcEEEecCCCCccccCCcceEEEEEeCCEEEEecCCEeeeEeeccccc--CCCCcceEEEecCcc Q lcl|NC_012662. 81 VINTNTGGWWLLDREAKNIVSEGNLSYLLAADRRSIQTTSMGGVTYILNTEKRPSATTDNSDK--KDPKTTGFYFVKSGA 158 (780) Q Consensus 81 ~l~~~~g~~~v~d~~~~~~~~~~~~~y~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~--~~~~~~g~v~v~~g~ 158 (780) +++++++.|+|||+.+|.++..+..+|+++.++++|+|+|+||+|||+|++++|+++.+..+. +.+..++++++++++ T Consensus 78 ~l~~g~g~irv~~~~~g~~~~~~~~~Yl~a~~~~~l~~~q~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~~~v~~~~ 157 (777) T protein:vir:80 78 LVDTLDGTLTILDDATGEVLFTGTNSYLTAGTGRSIRFAALDDSVFVANTEVIPQTQLWSGASAYPDPTRAGYLYVVAGA 157 (777) T ss_pred EEEecCCeEEEEECCCCeEEEecCCCceeeccccceeEEEEcCEEEEEeCCccceeeecccCCCccCcccceEEEeeccC Confidence 999999999999999999999999999999889999999999999999999999999887654 566778999999999 Q ss_pred ccceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhhheecc-----cceEE-EcCeEEEEEcCCCceeEE Q lcl|NC_012662. 159 FSKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVG-----VDFAV-RVGPYIYFELITGTDLKI 232 (780) Q Consensus 159 y~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g-----~~~~~-~~g~~i~~~~~s~~~~~v 232 (780) |+++|+|+++....+.+.+ .+.++......+.+.++++.+|..+..+.. .+|+. +.|+++++++.++. .+ T Consensus 158 ~g~~y~i~i~~~~~~~~~t--~~~~t~~~~~~~~~~~~ia~~L~~~~~~~~~~~s~~~~~~~~~g~~~~i~~~~~~--~~ 233 (777) T protein:vir:80 158 FSKQYRLSITNQVTGVTTS--VDVTTSATEASQATGEYVITQLRTAAEADATIGTAAGFAYYQDGAYLYVTAPEAI--AV 233 (777) T ss_pred CCceeeEeecCCcCceeEE--EecCCcccccccccchhhhhhhhhhhccccceeecCceEEEeCCcEEEEEecCce--eE Confidence 9999999997665554443 334455566677888999998876554432 34443 56788888877654 56 Q ss_pred EeecCCcceeEEEEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEecccceeEEcccceeEEeec Q lcl|NC_012662. 233 TSTSGSPYIGYSNQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYNSVTAISVDVPYKIVDD 312 (780) Q Consensus 233 t~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~~~~~~~~~~~~~p~~l~~~ 312 (780) +..+|.++...+..+++.++++||+++|.++.+++..+++++ ++||++|+..+++|+||++++...++ ++||+.|++. T Consensus 234 t~~~g~~~~~~~~~~~v~~~~~lp~~~~~~~~~~~~~~~~~~-~~~y~~~~~~~~~w~e~~~~~~~~~~-~t~p~~l~~~ 311 (777) T protein:vir:80 234 STDSGSNFLRASNAASIRDAAELPAKLPADADGFIIATGAAK-NKTYFRWVDLERKWDEDASRGAQAEL-IDMPLRITYS 311 (777) T ss_pred ecCCcCccceeeeeEEEeeccccccccccccceEEEeCCCCC-CceEEEEEccCcEEEEeecccccccc-cccceEEEec Confidence 677777777777788999999999999999999998877654 67999999999999999999998877 5999999864 Q ss_pred c----ccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEE Q lcl|NC_012662. 313 N----VEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLSGAYVCMSATGEPDRFFRSTVSSLDPTDRIDIAS 388 (780) Q Consensus 313 ~----~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~t~~~~~ddD~i~~~~ 388 (780) . ++..+|++|.+||+++||+|||++ ++|++|+||||||+|++|++|||||+||||||+++|++++.|||||++++ T Consensus 312 ~~~~~~~~~~w~~r~~gd~~tn~~Psf~g-~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdDpI~~~~ 390 (777) T protein:vir:80 312 APNFSLTALNYERRASGDATSNPALKFTE-QGISGMTTMQGRLVLLAGEYVCMSASGNPLRWFRASVSTQSDDDPIEVAA 390 (777) T ss_pred CCceEeeccCCccccccccccCCCceecC-CceeEEEEEcceeeeecCCeEEEEeccCccccccccccCCCCCccEEEEE Confidence 3 567789999999999999999997 78999999999999999999999999999999999999999999999999 Q ss_pred cCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEecCCceEEEEEe Q lcl|NC_012662. 389 GSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFSAVLEL 468 (780) Q Consensus 389 ~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~ 468 (780) +++++|+|+|+++++++|+|||+++||+|+| +++|||+|++++++|+|+|+++|+|+.+|++++|+++|++++++|||| T Consensus 391 ss~~~~~i~~~v~~~~~L~i~T~~~e~~l~~-~~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~Fv~~r~g~~s~v~e~ 469 (777) T protein:vir:80 391 TAPVASPYEYAVAFNKDLVLFAKTHQGLVPG-ANLLTSRNATAAVVTEYSFQNSCSPVVAGRTVFFASPRSGPWSAVWEM 469 (777) T ss_pred cCCcceeeeeeeecCCcEEEEecCceEEEeC-CCcccceeEEEEEEEeeccCCCCCceEeCCeEEEEecCCCceeEEeee Confidence 9999999999999999999999999999998 579999999999999999999999999999999999999889999999 Q ss_pred eeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeeeeeEeeccCCcEEEEE Q lcl|NC_012662. 469 VPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAWHKWVFPYRVASLH 548 (780) Q Consensus 469 ~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW~~w~~~G~v~~~~ 548 (780) +|+++++++|+++|||+|++|||++++.++ +++++|..++|++++||+|++|||+++++||+|+|||||+|+|+|++|| T Consensus 470 ~~~~~~~d~y~a~Dlt~~~~hl~~~~v~~~-a~s~~p~~v~~~~~~dg~l~~~ty~~~~~e~~v~aW~r~~~~g~v~~v~ 548 (777) T protein:vir:80 470 LPSQYTDAQVEASDSTSHLPKYIAGPVRFL-ATSSTTSIVVVGTSNLRELVVHEYLWQGGEKVHAAWHKWSFPQDITGAY 548 (777) T ss_pred eecccccCceehhHHHHHHHHhcCCceEEE-EEcCCCceEEEEEcCCCeEEEEEEeecCCceEEEeeEEeccCCcEEEEE Confidence 999889999999999999999999986655 6667777789999999999999999999999999999999999999999 Q ss_pred EECCcEEEEEEEcCCCeEEEEEEeeeccCCcccccccceeeeeccceeeec-CcceeEEeeccccCCCCeEEEEEecCcc Q lcl|NC_012662. 549 FARDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVN-DGKGIVPIYMRPWVSEGKLTGSVATGAL 627 (780) Q Consensus 549 ~~~d~l~~vv~R~~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~-~~~~~~~~~~~~~~l~g~~v~~~adG~~ 627 (780) +++|+||++|+|+. ++|||||++....+...+ ..+||||....... ++..+++....+++.++....+..+|.. T Consensus 549 ~i~d~l~~iv~r~~----~~~le~~~~~~~~d~~~~-~~~~~D~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~ 623 (777) T protein:vir:80 549 FRGDRLILLFHVAG----RVILGELFMQRLGDAQSI-PGGFLDLYRVGAANADEEVAIPAFAADLYPEDSTFAYKLSGEF 623 (777) T ss_pred EECCEEEEEEEcCC----eEEEEEEeeccCCCCccc-ceeeeeeeeeeeeeeCCccceeEeeccccCCcceeEEEecCcc Confidence 99999999999963 599999988777655444 35799997543322 3444556666777777766666666543 Q ss_pred ccc---eeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCceeeecceEEEEEEEEEeccccEEEEec Q lcl|NC_012662. 628 ASE---EVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLISTAPVRLLRYELTTRNTGEFDVRIV 704 (780) Q Consensus 628 ~~~---~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~~~~~r~~v~rv~v~~~~T~~~~v~v~ 704 (780) ... ...........+++++++.++++|+|||+|+++++|+||++++++|++++.+|+||+|++|+|++|++|++.|+ T Consensus 624 ~~~~~~~~~v~~~~~~~~~~v~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~g~~~~~~r~~i~r~~~~~~~sg~~~v~v~ 703 (777) T protein:vir:80 624 QSLGQRCGDRRVDGATVYIKVVGAQAGDQYRIGLRYLSKLGPTRPILRDPNGVPITTERTQLHRLTWSLDSTGEVTFRVA 703 (777) T ss_pred cccceeeeeEEeCCceeeEEEcCCCCCCEEEEeeeeEEEEEeCceEEeCCCCceeeecCeEEEEEEEEeeccccEEEEEc Confidence 221 11112223345688999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCcceecccCceecccccccCCcccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecceecC Q lcl|NC_012662. 705 DPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQSTEMYLSTDGTQDMNILEIEYIIRYNQRRRRV 780 (780) Q Consensus 705 ~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlai~~eg~y~~r~rrv 780 (780) ++.++ .+.+.+++.++++.++.+|.|| ..+|++++|+.+|+++.+|+|+|++|+||+|+||+|||+||+|+||= T Consensus 704 ~~~~~-~~~~~~~~~~~~~~~~~~g~~~-~~tg~~~vp~~~~~~~~~v~i~~d~P~P~tilsi~~e~~y~~r~~r~ 777 (777) T protein:vir:80 704 DQARG-ESAYTTTPLRLYSRDLGAGLPL-AATATLDTPARVDMQTAQFSLETDDYYDMNITSLEYGFRYNQRYRRQ 777 (777) T ss_pred CCCCc-ceeeeecCceeccccccccccc-ccceEEEEEEeecCcceEEEEEECCCCceEEEEEEEEEEeecccccC Confidence 98876 4567788999999999898866 57899999999999999999999999999999999999999999966 No 9 >protein:vir:97014 Length: 800 # NCBI annotation: 33 # Family: family:all:825 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654134;genbank:gi:108862018;genbank:GeneID:5075963 Probab=100.00 E-value=1e-219 Score=1221.12 Aligned_cols=754 Identities=22% Similarity=0.324 Sum_probs=665.9 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCCCccEE Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHL 80 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~~~~~y 80 (780) |- |+|+||||++|||||||++||++||++|+||+|+|+|||+||||++||+++++.....++. .|++++++.||+| T Consensus 1 ~~--v~~s~~n~~~GvSqq~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~fva~l~~~~~~~~~~--~~~~~~d~~eq~~ 76 (800) T protein:vir:97 1 ME--VQGSLGRQIQGISQQPPAVRLDGQCTAMVNMIPDVVNGTQSRMGTTHIAKILDAGTDDMAT--HHYRRGDGDEEYF 76 (800) T ss_pred Ce--eEeechhhhcccccCchhHhhhhhhhhhhcceeccccccccCCchhhheeecCCCccccee--EEEEEcCCceEEE Confidence 86 8999999999999999999999999999999999999999999999999998876654443 3456688999999 Q ss_pred EEEEcCcEEEEEeCCCcEEEecCCCCc---c--ccCCcceEEEEEeCCEEEEecCCEeeeEeecccccCCCCcceEEEec Q lcl|NC_012662. 81 VINTNTGGWWLLDREAKNIVSEGNLSY---L--LAADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPKTTGFYFVK 155 (780) Q Consensus 81 ~l~~~~g~~~v~d~~~~~~~~~~~~~y---~--~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~g~v~v~ 155 (780) +++++++.++||+.+|..+.+.+..+| + ++.++++|+++|+||+|||+|++++|++.++..+ .+..+++++++ T Consensus 77 v~~~~~~~~rv~~~~G~~~~v~~~~~~~~y~~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~~--~~~~~~~~~v~ 154 (800) T protein:vir:97 77 FTLKKGQVPEIFDKYGRKCNVTSQDAPMTYLSEVVNPREDVQFMTIADVTFMLNRRKVVKASSRKSP--KVGNKAIVFCA 154 (800) T ss_pred EEEEcCCEEEEEecCCcEEEEecCCcceEEEeccCCCccceeEEEEcCEEEEeeCceeccccccccc--CCCcceEEEEe Confidence 999999999999999888776666543 2 3456789999999999999999999998766543 56788999999 Q ss_pred CccccceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhhheecc--cceEE-EcCeEEEEEcCCCceeEE Q lcl|NC_012662. 156 SGAFSKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVG--VDFAV-RVGPYIYFELITGTDLKI 232 (780) Q Consensus 156 ~g~y~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g--~~~~~-~~g~~i~~~~~s~~~~~v 232 (780) +|+|+++|+|+| ++..++++++++++++..++++++++|+.+|...+.+.+ ..|+. ..|.++++.+..+.++.+ T Consensus 155 ~g~y~~~y~i~I---~~~~~~~~~t~~~t~~~~~~~~~~~~ia~ql~~~~~~~~~~~~~t~~~~G~~~~i~~~~~~~~~v 231 (800) T protein:vir:97 155 YGQYGTSYSIVI---NGANAASFKTPDGGSADHVEQIRTERITSELYSKLQQWSGVSDYEIQRDGTSIFIERRDGASFTI 231 (800) T ss_pred ecccceeeeecc---CCcceEEEEEcCCCCcccceeccHHHHHHHHHHhhhccccccceEEEeCCcEEEEEEcCCceEEE Confidence 999999999999 566788999999999999999999999999998887654 44543 678899999988889999 Q ss_pred EeecCCcceeE-EEEEeecceecccccccCCceEEEEeccCCCCCceEEEEEec---CccEEEEecccceeEEc-cccee Q lcl|NC_012662. 233 TSTSGSPYIGY-SNQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSE---KGVWLESGDYNSVTAIS-VDVPY 307 (780) Q Consensus 233 t~~~g~~~~~~-~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~---~g~w~e~~~~~~~~~~~-~~~p~ 307 (780) ++.++..+..+ ...++++++.+||+++|+|++++|+++++++.+.||++|+.. .++|+|+++++...+++ ++||| T Consensus 232 ~t~~g~~~~~~~~~~~~v~~~~~lp~~~~~g~~v~i~~~~~~~~~~y~~~~~~~~~~~~~w~e~~~~~~~~~~~~~tmp~ 311 (800) T protein:vir:97 232 TTTDGAKGKDLVAIKNKVSSTDLLPSRAPAGYKVQVWPTGSKPESRYWLQAEPKEGNLVSWKETIAADVLLGFDKGTMPY 311 (800) T ss_pred EecCCcCceeeeEEeeeccchhhchhhCCCCcEEEEEccCCCCCceEEEEEEecccCcceEEEeeccccccceecccceE Confidence 99999887665 457789999999999999999999999999999999999975 45799999999999987 58999 Q ss_pred EEeecc---------ccccccchhhcCCcccCCCcccccC---CCceEEEEEcceEEEecCCeEEEEccCCccccccccc Q lcl|NC_012662. 308 KIVDDN---------VEQHIMEGRLAGDDLTNPAPTFLEE---RRITGIGTFQGRLVLLSGAYVCMSATGEPDRFFRSTV 375 (780) Q Consensus 308 ~l~~~~---------~~~~~w~~~~~gd~~t~~~psf~~~---~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~t~ 375 (780) .+++++ ++..+|++|.+||+++||+|+|++. ++|++|+||||||+|++|++|||||+||||||+++|+ T Consensus 312 ~~~~~~~~~~~g~~~~~~~~w~~r~~gd~~tnp~p~f~~~~~~~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~ 391 (800) T protein:vir:97 312 IIERTDIINGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTIGGMFMVQNRLCFTAGEAVIASRTSYFFDFFRYTV 391 (800) T ss_pred EEEEeecccccceeEEEeccccccccCccccCccccccCCcCCCCceeEEEEeeeEEEecCCeEEEEecCCccccccccc Confidence 999863 4667899999999999999999973 6899999999999999999999999999999999999 Q ss_pred cCCCCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccccceEEEEEeeecCCCCCCcEEeCCeEEEE Q lcl|NC_012662. 376 SSLDPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYP 455 (780) Q Consensus 376 ~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~ 455 (780) +++.|||||+++++++++|+|+|+++++++|+|||+++||+|+| +++|||+|++++++|+|+|+++|+|+.+|++++|+ T Consensus 392 ~~~~DdD~I~~~~ss~~v~~i~~~v~~~~~L~i~T~~~q~~ls~-~~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~fv 470 (800) T protein:vir:97 392 ISALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQFILPG-DKPLEKSNALLKPVTTFEVNNKVKPVVTGESVMFA 470 (800) T ss_pred cCCCCCccEEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeC-CCcccceeEEEEEEEeeeccCCCCcEEeCCeEEEe Confidence 99999999999999999999999999999999999999999998 57999999999999999999999999999999999 Q ss_pred EecCCceEEEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeeeee Q lcl|NC_012662. 456 APRSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAW 535 (780) Q Consensus 456 ~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW 535 (780) +++| ++++||||.|+ +.+|+|+++|||+|++|||+|+++++++++++|.+++|+++++|+|++|+|+++++||+|+|| T Consensus 471 ~~~g-~~s~vre~~~~-~~~d~~~a~DlT~~~~hl~~~~v~~~~~~~~~~~~v~~~~~~~~~l~~~~y~~~~~e~~~~aW 548 (800) T protein:vir:97 471 TNDG-SYSGVREFYTD-SYSDTKKAQAITSHVNKLIEGNITNMAASTNVNRLLVTTDKYRNIIYCYDWLWQGTDRVQSAW 548 (800) T ss_pred eCCC-CeeEEEEEeee-ecccceehhhHHHHHHHhcCCceEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceEEEeE Confidence 8876 67899999997 678999999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeccCC--cEEEEEEECCcEEEEEEEcCCCeEEEEEEeeeccCCcccccccceeeeeccceee-----e---------- Q lcl|NC_012662. 536 HKWVFPY--RVASLHFARDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVP-----V---------- 598 (780) Q Consensus 536 ~~w~~~G--~v~~~~~~~d~l~~vv~R~~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~-----~---------- 598 (780) |||+++| .++++++++|+||++|+|+++ +|||||++.+..+ ...++.++||+..+.. . T Consensus 549 ~~~~~~~~~~~~~~~~~~d~l~~vv~r~~~----~~ler~~~~~~~~-~~~~~~~~lD~~~~~~~~~~~~~~~~v~~~~~ 623 (800) T protein:vir:97 549 HVWKWPIGTKVRGMFYSGELLYLLLERGDG----VYLEKMDMGDALT-YGLNDRIRMDRQAELVFKHFKAEDEWVSEPLP 623 (800) T ss_pred EEEecCCCeEEEEEEEcCCeEEEEEEcCCc----EEEEEEecccCcC-cccccceeccccceeeeeeeecccceEecccc Confidence 9999976 677888889999999999864 8999998765433 3445566777543221 0 Q ss_pred ----cCcceeEEeeccccCCCCeEEEEEecCccccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecC Q lcl|NC_012662. 599 ----NDGKGIVPIYMRPWVSEGKLTGSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQ 674 (780) Q Consensus 599 ----~~~~~~~~~~~~~~~l~g~~v~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~ 674 (780) ...........++.|++|.++.+. ++...+ ..+...+.+.+++++++|||||+|+++++|+||+++++ T Consensus 624 ~~~~~~~~~~~~~v~g~~~~~G~~v~~~-~~~~~~-------~~~~~~~~~~~~~~~~~v~vGl~Y~~~~~~~p~~i~~~ 695 (800) T protein:vir:97 624 WVPTNPELLDCILIEGWDSYIGGSFLFK-YNPSDN-------TLSTTFDMYDDSHVKAKVIVGQIYPQEFEPTPVVIRDN 695 (800) T ss_pred ccCCCcceeEEEEecccccccCceEEEE-ecCccC-------cccccceEEeCCCCCcEEEEeeeeeEEEEecceEEEec Confidence 011122344578999999998655 443222 12233567888999999999999999999999999999 Q ss_pred CCceeeecceEEEEEEEEEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEeecccceeEEEE Q lcl|NC_012662. 675 NDTLISTAPVRLLRYELTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQSTEMYL 754 (780) Q Consensus 675 ~g~~~~~~r~~v~rv~v~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~~~~~~v~i 754 (780) +|+++..+|+||+|++|+|.+|++|++.|++..++....+.++++++++..+.+|.+| +++|++++||.+|+++.+|+| T Consensus 696 ~g~~~~~~r~~i~r~~~~~~~sg~~~~~v~~~~~~~~~~~~~~~~~~g~~~~~~g~~~-~~tg~~~vp~~g~~~~~~v~i 774 (800) T protein:vir:97 696 QDRVSYIDVPVVGLVHLNLDMYPDFSVEVKNVKSGKVRRVLASNRIGGALNNTVGYVE-PREGVFRFPLRAKSTDVVYRI 774 (800) T ss_pred CCCceeecceEEEEEEEeecccccEEEEEccccCCceeeeecCccccccccccCCccc-cccceEEEEeecccceeEEEE Confidence 9999999999999999999999999999999998876667788899999999999876 689999999999999999999 Q ss_pred EECCCCCEEEEEEEEEEEEecceecC Q lcl|NC_012662. 755 STDGTQDMNILEIEYIIRYNQRRRRV 780 (780) Q Consensus 755 ~~~~P~P~tvlai~~eg~y~~r~rrv 780 (780) +|++|+||+|+||+|||+||+|+||| T Consensus 775 ~~d~PlP~tvlsi~~eg~y~~r~~rv 800 (800) T protein:vir:97 775 IVESPHTFQLRDIEWEGSYNPTKRRV 800 (800) T ss_pred EECCCCcEEEEEEEEEEEeecccccC Confidence 99999999999999999999999999 No 10 >protein:vir:3366 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523337;swissprot:trembl:q8w5u3;genbank:gi:17570828;goa:Q8W5U3;uniprot:Q8W5U3;genbank:GeneID:927453 Probab=100.00 E-value=3.3e-219 Score=1218.38 Aligned_cols=764 Identities=19% Similarity=0.301 Sum_probs=670.4 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCCCccEE Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHL 80 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~~~~~y 80 (780) || +|+|+||||++|||||||++||++||++|+||+|+|+|||+||||++||++++.+.. .+..+++|+++|++.|+ | T Consensus 1 M~-~i~~~~~nl~~GvSqq~d~~r~~~q~~~~~N~~~~~~gG~~rRpGt~~va~~~~~~~-~~~~~~~~~~~r~~~~~-y 77 (801) T protein:vir:33 1 MA-LISQSIKNLKGGISQQPDILRFTEQGSVQINGWSSESEGIQKRPPMIHLKTLGTAGY-VGAQPYVHLINRDEFEQ-Y 77 (801) T ss_pred Cc-eeEeeccceecceeccchhHhhhhhHhhhhcceeecccCcccCchhHhhhhhcCCCc-cccceEEEEEEeCCceE-E Confidence 99 599999999999999999999999999999999999999999999999999987644 46789999999976555 5 Q ss_pred EEEEcCcEEEEEeCCCcEEEecCCCCccccCC-cceEEEEEeCCEEEEecCCEeeeEeecccc--cCCCCcceEEEecCc Q lcl|NC_012662. 81 VINTNTGGWWLLDREAKNIVSEGNLSYLLAAD-RRSIQTTSMGGVTYILNTEKRPSATTDNSD--KKDPKTTGFYFVKSG 157 (780) Q Consensus 81 ~l~~~~g~~~v~d~~~~~~~~~~~~~y~~~~~-~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~--~~~~~~~g~v~v~~g 157 (780) +++++++.|+|||.+|+.+.+.+..+|+.+++ .++|+++|+||+|||+|++++|++..+..+ ...+..+++++++++ T Consensus 78 ~l~~~~~~irv~~~~G~~~~v~~~~~y~~~~~~~~~l~~~t~aD~~fi~nr~~~p~~~~~~~~~~~~~~~~~~li~v~~~ 157 (801) T protein:vir:33 78 FVVFTGEDIKVFDLDGKEYQVRGDRSYVRTANPREDLRMVTVADYTFVTNRKVVVQSNDQSVNLPGFKDQGDALINVRGG 157 (801) T ss_pred EEEEcCCeEEEEccCCcEEEEecCCcceeecCcchheEEEEEcCEEEEeeCCeeecccCCcccccccCCCcceEEEEeec Confidence 67888999999999988888888888876655 467999999999999999999998765433 344566899999999 Q ss_pred cccceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhhheec---------ccceEEEcC-eEEEEEcCCC Q lcl|NC_012662. 158 AFSKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAV---------GVDFAVRVG-PYIYFELITG 227 (780) Q Consensus 158 ~y~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~---------g~~~~~~~g-~~i~~~~~s~ 227 (780) +|+++|+|++ +++..+++++++++.+...++.+.++++.++..++.+. ...|+...+ .++++.++++ T Consensus 158 ~yg~t~~I~i---~gs~~~~~~~~~gs~~~~v~~~s~~~~A~~l~~~~~~~~~~~~~~~~~~~w~~~~~~g~~~i~~p~~ 234 (801) T protein:vir:33 158 QYGRRLSIEF---NGAERAAVQLPDGSQPAHVNEVDGQAIAEKLAAQLRNNLGNPNNDQDPNKWRFNVGPGFIHILAPNN 234 (801) T ss_pred ccceEEEEEE---CCcceEEEEeeccccccccccccchhhhhhhhhhhhccCccceeeecCceEEEEecCeEEEEecCCC Confidence 9999999999 45567888899998888888888888888887665543 234554443 3455666555 Q ss_pred ce-eEEEeecCCcceeE-EEEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEecccceeEEc-cc Q lcl|NC_012662. 228 TD-LKITSTSGSPYIGY-SNQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYNSVTAIS-VD 304 (780) Q Consensus 228 ~~-~~vt~~~g~~~~~~-~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~~~~~~~~~~-~~ 304 (780) .. +.+++.++...... ...++++++++||.++++|+.++|+..+++..+.||++|+..++.|+||++++...+++ ++ T Consensus 235 ~~~~~itt~~g~~~~~~~~~~~~v~~~~~lp~~~~~g~~v~v~~~~~~~~~~y~v~~~~~~~~w~e~~~~g~~~~~~~~t 314 (801) T protein:vir:33 235 DNVWGLQTKDGYADQLINPVTHYTQSFQKLPINAPDGYIVKIVGDTSKTADQYYVRFDLNRKVWVETIGWNTRTHLHYHT 314 (801) T ss_pred cccccccccCCccceeEEEEeecccceeeeeeecCCCcEEEEEecCCCcccceEEEEEcCCcEEEEeeccccceeeeecc Confidence 44 45777777665444 45688999999999999999999999999999999999999999999999999999997 58 Q ss_pred ceeEEeecc-----ccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEecCCeEEEEccCCccccccccccCCC Q lcl|NC_012662. 305 VPYKIVDDN-----VEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLSGAYVCMSATGEPDRFFRSTVSSLD 379 (780) Q Consensus 305 ~p~~l~~~~-----~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~t~~~~~ 379 (780) |||+|++.. ++..+|+.|.+||+++||+|+|++ ++|++|+||||||+|++|++|||||+||||||+++|+++++ T Consensus 315 mp~~l~~~~~~tf~~~~~~w~~r~~gd~~tnp~psf~g-~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~ 393 (801) T protein:vir:33 315 MPWALVRASDGNFDFKYLEWGARTVGDDTTNPYPSFTG-QTINDIFFFRNRLGFLSGENIILSRTSKYFNFFPASVSNYS 393 (801) T ss_pred cceEEEEccCceEEecccCccccccCCccccCcccccC-CCceEEEEEcceEEEeeCCeEEEEecCCccccccccccCCC Confidence 999999754 467789999999999999999987 79999999999999999999999999999999999999999 Q ss_pred CCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEecC Q lcl|NC_012662. 380 PTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRS 459 (780) Q Consensus 380 ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g 459 (780) |||||+++++++++|+|+|+++++++|+|||+++||+|+| +++|||+|++++++|+|+|+++|+|+.+|++++|++++| T Consensus 394 DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~l~~-~~~lTP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g 472 (801) T protein:vir:33 394 DDDPIDVAVSHDRVSTLKYAVPFSEELLLWSDQAQFVLTA-SDILSSRSVGLNLTTQFDVQDRARPHGVGRNVYFSSPRA 472 (801) T ss_pred CCccEEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeC-CCcccceeEEEEEEEeecccCCCCceEecCeEEEEecCC Confidence 9999999999999999999999999999999999999998 579999999999999999999999999999999998876 Q ss_pred CceEEEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeeeeeEeec Q lcl|NC_012662. 460 EAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAWHKWV 539 (780) Q Consensus 460 ~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW~~w~ 539 (780) ++++++|+++.++.+|+|+++|||+|++|||+|++++++++++++++++|+++++|+|++|+|+++++||+|+|||||+ T Consensus 473 -~~~~v~r~~~~~~~~d~y~~~Dlt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~y~~~~~e~~v~aW~~~~ 551 (801) T protein:vir:33 473 -SFTSINRYYAVQDVSSVKNAEDMTAHVPNYIPNGVFSISGTTAENFVAILTSGAPNRVYIYKFLYIDEEIRQQSWSHWD 551 (801) T ss_pred -CeeEEEEEEeecccccceehhhHHHHHHHhcCCceEEEEEcCCCCeEEEEEecCCCEEEEEEEecCCCceEEEeeEEEE Confidence 6788888766668899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCcEEEEEE--ECCcEEEEEEEcCCCeEEEEEEeeeccCCc-ccccccceeeeeccceeeecCcceeE---------Ee Q lcl|NC_012662. 540 FPYRVASLHF--ARDRVVLFAADDAGSTDKITISTIDPKQGG-VTFDVDRLPHLDSMSIVPVNDGKGIV---------PI 607 (780) Q Consensus 540 ~~G~v~~~~~--~~d~l~~vv~R~~~g~~~~~~e~~~~~~~~-~~~~~~~~~~lD~~~~~~~~~~~~~~---------~~ 607 (780) |+|.++++|+ .+|+|||+|+|+++ .++|||+..... +....++++|||++..++...+.+.. +. T Consensus 552 ~~g~~~~~~~~~~~d~l~~vv~r~~~----~~le~~~~~~~~~d~~~~~~~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~ 627 (801) T protein:vir:33 552 FGDNVTVFAAQVINSTMTVLMSNEHA----VWMGRLHFTKDSIDLPGEPYRLYIDAKRKYTIPAGTYNDDTYQTSISLST 627 (801) T ss_pred cCCCEEEEEEecCCCEEEEEEEcCCc----EEEEEEEEeeccccCCCccceEEeecceEEEecccceecCcccccccccc Confidence 9999988885 68999999999753 678888643322 22345678899998877655433322 22 Q ss_pred eccccCCCCeEEEEEecCccccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCC----ceeeecc Q lcl|NC_012662. 608 YMRPWVSEGKLTGSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQND----TLISTAP 683 (780) Q Consensus 608 ~~~~~~l~g~~v~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g----~~~~~~r 683 (780) ..|+.|+||+++.+++||.+....+...++....++++++++++++|+|||+|+++++|+||+++.++| +++..+| T Consensus 628 ~~gl~~~eg~~v~~~~dG~v~~~~~~~~~~~~~~~l~i~~~~~~~~v~vGl~y~s~~~~~~~~~~~~~~~~~~~~~~~~r 707 (801) T protein:vir:33 628 IYGMNFTKGRVSVVFPDGKIVEIDQPINGWSSDPMLRLDGNQEGQVVYIGFNIPFTYTFSKFLIKKTAEDGSTATEDIGR 707 (801) T ss_pred ccCCccccceEEEEEeCCceEeeeeccccccCceeEEecCCCCCCEEEEeeeeeEEEEeCceEEeccCCCCceeeeeecc Confidence 358999999999999999998777777777777889999999999999999999999999999995543 5667789 Q ss_pred eEEEEEEEEEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEeecccceeEEEEEECCCCCEE Q lcl|NC_012662. 684 VRLLRYELTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQSTEMYLSTDGTQDMN 763 (780) Q Consensus 684 ~~v~rv~v~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~t 763 (780) +||+|++|++.+|++|++.|+++.++ +.+.++++++++.++.++.+|+ .+|++++|+.+|+++.+|+|+|++|+||+ T Consensus 708 ~~l~r~~~~~~~tg~~~v~v~~~~~~--~~~~~~~~~~~~~~~~~~~~~~-~tg~~~vp~~g~~~~~~v~i~~d~P~P~t 784 (801) T protein:vir:33 708 LQLRRAWVNYEDSGAFIIRVNNLSRE--FIYTMAGARLGSDNLRVGGSNI-GTGQYRFPVVGNAQTNTVTIESDASTPLN 784 (801) T ss_pred EEEEEEEEEeecCcceEEEECCcccc--eeeeeccccccccccccccccc-ccceEEEEeeccCceEEEEEEeCCCCCEE Confidence 99999999999999999999988764 4567889999999999998875 79999999999999999999999999999 Q ss_pred EEEEEEEEEEecceecC Q lcl|NC_012662. 764 ILEIEYIIRYNQRRRRV 780 (780) Q Consensus 764 vlai~~eg~y~~r~rrv 780 (780) |+||+|||+||+|+||| T Consensus 785 vl~i~~eg~y~~r~~~~ 801 (801) T protein:vir:33 785 IIGCGWEGNYLRRSSGI 801 (801) T ss_pred EEEEEEEEEEeccccCC Confidence 99999999999999999 No 11 >protein:vir:7021 Length: 803 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853594;genbank:gi:31711676;genbank:GeneID:1481802 Probab=100.00 E-value=2.9e-218 Score=1213.22 Aligned_cols=761 Identities=21% Similarity=0.320 Sum_probs=660.0 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCC--Ccc Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGA--DGR 78 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~--~~~ 78 (780) |- |+|+||||++|||||||++||++||++|+||+|+|+|||+||||++||++++.... ..+++|+++|++ .|+ T Consensus 1 ~~--v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~~va~l~~~~~---~~~~~~~~~~~~~~~e~ 75 (803) T protein:vir:70 1 ME--VQGSLGRQIQGISQQPPAVRLDGQCSEMVNMVPDVVEGTKSRMGTTHIAKLLEYGE---DDMAVHHYRRGGEGEEE 75 (803) T ss_pred Ce--EEeecchhccccccCchHHhhhhhhhhhhcceeeeccccccCChhhhhhhhcCCCc---ccceeeEEEecCCCceE Confidence 86 89999999999999999999999999999999999999999999999999876543 355667777754 569 Q ss_pred EEEEEEcCcEEEEEeCCCcEEEecCCCCcc-----ccCCcceEEEEEeCCEEEEecCCEeeeEeecccccCCCCcceEEE Q lcl|NC_012662. 79 HLVINTNTGGWWLLDREAKNIVSEGNLSYL-----LAADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPKTTGFYF 153 (780) Q Consensus 79 ~y~l~~~~g~~~v~d~~~~~~~~~~~~~y~-----~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~g~v~ 153 (780) +|++++++++||||+++|+.+.+.+..+|. ++...++|+++|+||+|||+|++++|++..+.. ..+..+|+++ T Consensus 76 ~~~~~~~~~~irv~~~~G~~~~v~~~~~~~~~l~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~--~~~~~~~~~~ 153 (803) T protein:vir:70 76 YFFIMKKGQVPEIFDKQGRKCMVQSQDAPMTYLSEVTNPREDVQFMTIADVTFMLNRKKIVKARPERS--PQVGSTAIVF 153 (803) T ss_pred EEEEEecCCeEEEEEcCCcEEEEecCCceeEEEeecCCChhheeEEEEcCEEEEecCceeeeeccccC--CCCCCceEEE Confidence 999999999999999999888877766542 334567899999999999999999999876554 3566789999 Q ss_pred ecCccccceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhhheecc--cceEE-EcCeEEEEEcCCCc-e Q lcl|NC_012662. 154 VKSGAFSKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVG--VDFAV-RVGPYIYFELITGT-D 229 (780) Q Consensus 154 v~~g~y~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g--~~~~~-~~g~~i~~~~~s~~-~ 229 (780) +++++|+++|.|+| ++..++++++++++.+..+.++++++|+.++...+.+.+ .+|+. +.|.++++.+.++. . T Consensus 154 vr~g~y~~~y~itI---ng~~~a~~~t~~~~~~~~~~~~~~~~ia~~l~~~~~~~~s~a~~~~~~~g~~~~i~~~~~~~~ 230 (803) T protein:vir:70 154 MAYGQYGTHYKIII---DGVVAAGYKTRDGAEAHHIEDIRTESIAYNLYQSLQSWDKIADYEIQLDGTSIYITRRDGSTT 230 (803) T ss_pred EeecCCcceEEEEe---CCcceEEEEeCCCcccccccccchhhhhhhhhhheeccccccceEEEECCcEEEEEEcCCCCe Confidence 99999999999999 456688899999999999999999999999988876654 45654 56888999887754 5 Q ss_pred eEEEeecCCcceeEE-EEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecC---ccEEEEecccceeEEc-cc Q lcl|NC_012662. 230 LKITSTSGSPYIGYS-NQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEK---GVWLESGDYNSVTAIS-VD 304 (780) Q Consensus 230 ~~vt~~~g~~~~~~~-~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~---g~w~e~~~~~~~~~~~-~~ 304 (780) +.+++.+|..+..++ ..+.++++++||++||++++++|++++++..+.||++|+..+ ++|+|+++++...+++ ++ T Consensus 231 ~~~~t~~g~~~~~~~~~~~~v~~~~~Lp~~~~~g~~v~v~~~g~~~~d~y~v~~~~~~~~~~~w~e~a~~g~~~~~~~~t 310 (803) T protein:vir:70 231 FDITTEDGAKGKDLVAIKYKVASTDLLPSRAPEGYKVQVWPTGSKPESRYWLQAEKQNGNIVSWKETLAADVLIGFDKST 310 (803) T ss_pred eEEEeecCcCCcEEEEEEecccceeeccccCCCCceEEEEcCCCCCCceeeEEEEeccCCccceEeeeccceeeeeeccc Confidence 788888887776665 478899999999999999999999999999999999998765 4799999999999987 68 Q ss_pred ceeEEeec---------cccccccchhhcCCcccCCCccccc---CCCceEEEEEcceEEEecCCeEEEEccCCcccccc Q lcl|NC_012662. 305 VPYKIVDD---------NVEQHIMEGRLAGDDLTNPAPTFLE---ERRITGIGTFQGRLVLLSGAYVCMSATGEPDRFFR 372 (780) Q Consensus 305 ~p~~l~~~---------~~~~~~w~~~~~gd~~t~~~psf~~---~~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~ 372 (780) |||.+++. .++..+|+.|.+||++|||+|+|++ +++|++|+||||||+|++|++|||||+||||||++ T Consensus 311 ~p~~~v~~~~~~~~~~~~~~~~~~~~r~~gdd~tnp~psf~~~~~~~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~ 390 (803) T protein:vir:70 311 MPYIIERTGFVNGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTLGGMFMVQNRLCVTAGEAVIATRTSYFFDFFR 390 (803) T ss_pred ccEEEEEEEEeecceeEEEEeeccccccccccccCccccccCccCCCCceeEEEEeceEEEeeCCeEEEEccCCcccccc Confidence 99999873 4678889999999999999999987 35799999999999999999999999999999999 Q ss_pred ccccCCCCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccccceEEEEEeeecCCCCCCcEEeCCeE Q lcl|NC_012662. 373 STVSSLDPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVVLTSDLACNAFVAPVTTSQTL 452 (780) Q Consensus 373 ~t~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v 452 (780) +|+++++|||||+++++++++|+|+|+++++++|+|||+++||+|+| +++|||+|++++++|+|+|+++|+|+.+|+++ T Consensus 391 ~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~i~T~~~q~~l~g-~~~lTP~~~~i~~~s~~~~~~~~~Pv~vg~~v 469 (803) T protein:vir:70 391 YTAVSAVATDPFDVFSDASEVYQLKHAVTLDGSTVLFADKSQFILPG-DKPLEKSNVLLKPVTTFEVNNNVKPVATGESV 469 (803) T ss_pred ccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeC-CCcccceeEEEEEEEEeeccCCCccEEeCCeE Confidence 99999999999999999999999999999999999999999999998 56999999999999999999999999999999 Q ss_pred EEEEecCCceEEEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceee Q lcl|NC_012662. 453 MYPAPRSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVH 532 (780) Q Consensus 453 ~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v 532 (780) +|++++| ++++||||.|+ +.+|+|+++|||+|++|||++++++++++++++.+++|+.+++++|++|+|+++++||+| T Consensus 470 ~fv~~~g-~~s~vre~~~~-~~~d~y~a~Dlt~~a~hl~~~~v~~~~~~~~~~~~v~~~~~~~~~l~~~~yl~~~~e~~v 547 (803) T protein:vir:70 470 MFATSEG-AYSGIREFYTD-SYSDTKKAQAITSHVNKLLEGNVIMMSASTNVNRLLVLTDKYRNIIYCYDWLWQGTERVQ 547 (803) T ss_pred EEeccCC-CeeEEEEEecc-ccccceehhhhhhhhHhhcCCceEEEEEeCCCCeEEEEEEcCCCeEEEEEEEecCCcEEE Confidence 9998877 67899999997 678999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeEeeccCCcEEEEEEE--CCcEEEEEEEcCCCeEEEEEEeeeccCCcccccccceeeeeccceeeecCcceeEEeecc Q lcl|NC_012662. 533 QAWHKWVFPYRVASLHFA--RDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVNDGKGIVPIYMR 610 (780) Q Consensus 533 ~aW~~w~~~G~v~~~~~~--~d~l~~vv~R~~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~ 610 (780) +|||||+|+|.++++|++ +|+|||+|+|+++| +|||+|++....+ .++++++||||+.++...... ....+.. T Consensus 548 ~aW~r~~~~g~~~~~~~~~~~d~l~~vv~r~~~g---~~ier~~~~~~~~-~~~~~~~~lD~~~~~~~~~~~-~~~~~~~ 622 (803) T protein:vir:70 548 AAWHKWEWPLGTFIRGMFYSGEHLYLLIERGSTG---VYLERMDMGDALV-YNLNDRIRMDRQAELIFRHIK-AEDVWVS 622 (803) T ss_pred EeEEEEEcCCCEEEEEEEecCCEEEEEEEECCCe---EEEEEEecccccc-cCCcceeEeccceeEeecccc-CCceeee Confidence 999999999999999887 79999999999876 5999999876533 357788999998766432111 1111222 Q ss_pred ccCCCCeE------------EEEEecCccccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCce Q lcl|NC_012662. 611 PWVSEGKL------------TGSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTL 678 (780) Q Consensus 611 ~~~l~g~~------------v~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~ 678 (780) ..|+++.+ .....+|.+..............-+.+++++++++|+|||+|+++++|+||++++++|++ T Consensus 623 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~g~~t~~~~~~~~~~~~~a~~v~VGl~Y~~~~~~~~~~i~~~~~~~ 702 (803) T protein:vir:70 623 EPLPWQPTDVTLLDCVLIDGWDSYIGGSFLFSYNPGDNTLTTTFDMHDDDHVKAKVVVGQLYPQEFEPTQVVIRDNQERV 702 (803) T ss_pred ecccccCcccceeeEEEeeeeeeecCCeEEEEEcCCCccceeeeeEECCCCcccEEEEeeeeeEEEeecceEEEcCCCcc Confidence 23333322 122222322222221111112223567899999999999999999999999999999999 Q ss_pred eeecceEEEEEEEEEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEeecccceeEEEEEECC Q lcl|NC_012662. 679 ISTAPVRLLRYELTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQSTEMYLSTDG 758 (780) Q Consensus 679 ~~~~r~~v~rv~v~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~~~~~~v~i~~~~ 758 (780) +..+|+||+|++|++++|++|.++|+++.++..+.+.++++++++.++.+|.+| ..+|+++||+.+|+++.+|+|++++ T Consensus 703 ~~~~~~rl~r~~~~~~~sg~~~v~v~~~~~~~~~~~~~s~~~~g~~~~~~g~~~-~~tg~~~vP~~~~~~~~~v~i~~d~ 781 (803) T protein:vir:70 703 SYIDVPTVGLVHLNLDKYPDFKVEVKNLKSGKVRNVLASNRVGGAINNIVGYVE-PREGVFKFPLRSLSTDTVYRVMVES 781 (803) T ss_pred ccccccEEEEEEEEeecccceEEEEecCCccccceeeccchhccccccccCccc-cccceEEEEeeccCcceEEEEEECC Confidence 999999999999999999999999999888877777888999999999999887 4789999999999999999999999 Q ss_pred CCCEEEEEEEEEEEEecceecC Q lcl|NC_012662. 759 TQDMNILEIEYIIRYNQRRRRV 780 (780) Q Consensus 759 P~P~tvlai~~eg~y~~r~rrv 780 (780) |+||+|++|+|||+||+|+||| T Consensus 782 P~P~tvlsi~weg~y~~r~rrv 803 (803) T protein:vir:70 782 PHTFQLRDIEWEGSYNPTKRRV 803 (803) T ss_pred CCCeEEEEEEEEEEEecccccC Confidence 9999999999999999999999 No 12 >protein:vir:78957 Length: 826 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522826;genbank:gi:158345061;genbank:GeneID:5687447 Probab=100.00 E-value=2.4e-217 Score=1208.20 Aligned_cols=767 Identities=27% Similarity=0.423 Sum_probs=641.0 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCCCccEE Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHL 80 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~~~~~y 80 (780) || +|+|+||||++|||||||++||++||++|+||+|+|+|||+||||++||++++++.. .+.++|+|++||++.|++| T Consensus 1 M~-~i~~~~~nl~gGvSqq~d~~r~~~q~~~~~N~~~~~~gG~~rRpgt~~va~~~~~~~-~~~~~f~~~~~r~s~e~~~ 78 (826) T protein:vir:78 1 MS-YKQSAYPNLLMGVSQQVAFERLPGQLSEQINMVSDPVSGLRRRSGIELMASLLHTDQ-PWPRPYLYHTNLGGRSIAM 78 (826) T ss_pred Cc-ceeeecchhccceecccchHhhhhhhhhhhcceeccccccccCCchHhhhhhccCCc-CCceeEEEEeccCCcceEE Confidence 99 599999999999999999999999999999999999999999999999999987754 4778999999999999999 Q ss_pred EEEEcCcEEEEEeCCCcEEEecC--CCCccccCCcceEEEEEeCCEEEEecCCEeeeEeecccccCCCCcceEEEecCcc Q lcl|NC_012662. 81 VINTNTGGWWLLDREAKNIVSEG--NLSYLLAADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPKTTGFYFVKSGA 158 (780) Q Consensus 81 ~l~~~~g~~~v~d~~~~~~~~~~--~~~y~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~g~v~v~~g~ 158 (780) ++++++|+||||+..+|.++..+ ..+||+++++++|+|+|+||+|||+|++++|++........++..++++++++|+ T Consensus 79 ~l~~~~g~irv~~~~~g~~~~~~~~~~~y~~~~~~~~l~~~t~aD~~fi~n~~~~p~~~~~~~~~~~~~~~~~~~v~~g~ 158 (826) T protein:vir:78 79 LVAQHRGELYLFDEKDGRLLMGQPLVHDYLKASDYRQLRAATVADDLFIANLEVRPEADKADVLGVDPSKTGWLYIKAGQ 158 (826) T ss_pred EEEEcCCcEEEEECCCCEEEEecCcccceeecCCcceeEEEEEcCEEEEEcCcEeeeeccccccCCCCCceEEEEecccc Confidence 99999999999998888877654 5678999999999999999999999999999987666666677788999999999 Q ss_pred ccceeEEEEeeCCce------EEEEEEeccCCCCc-----cccccchhhhhhhhhhhheeccc----------------- Q lcl|NC_012662. 159 FSKEYDISVVWSEGS------QTVTYTTPDGTTAG-----DADQSVPEAIARKLVEALIAVGV----------------- 210 (780) Q Consensus 159 y~~~y~vti~~~~~~------~t~t~tt~~~s~~~-----~~~~~~~~~i~~~l~~~~~s~g~----------------- 210 (780) |+++|.|++++...+ .++++.+++++.+. ...+....+++.++..+...... T Consensus 159 y~~~y~v~i~~~~~~~~~~~s~t~~y~t~~~~~~~~~~~~~~~~~~~~~~a~~l~~~~~~~~~~~~~~~t~~~~~~~~~~ 238 (826) T protein:vir:78 159 YSKAFSLTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQLFGKFFGAPEYTLPNSTKKYPKVDPDP 238 (826) T ss_pred cCceeEEEeccceeecccccceeEEEEeccCCccccccccccceecchhhheecceeeccccceeeeccceeEeeccccc Confidence 999999999875432 35778888776543 22233456677766543222110 Q ss_pred ---------ceEEEcCeEEEEEcCCCceeEEEeecCCcceeEEEEEeecceecccccccCCc----eEE----EEeccCC Q lcl|NC_012662. 211 ---------DFAVRVGPYIYFELITGTDLKITSTSGSPYIGYSNQSQVNLETDLPARLHPSA----DGA----LCAVGQS 273 (780) Q Consensus 211 ---------~~~~~~g~~i~~~~~s~~~~~vt~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~----~~~----v~~~~~~ 273 (780) .+....+.++++.+.....+.++...|..+...++.++++.+++||+.+|.+. .+. +....++ T Consensus 239 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~l~a~~p~~~~~~~~~~~~~~~~~~~g~ 318 (826) T protein:vir:78 239 AAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATADLPALLPGAGTPGTGVQFMDGAIMATGS 318 (826) T ss_pred cceeeccceeecccccceEEEecCCCeEEEeccCCCccceEEEeeEEEecccceeeeecccccceEEEEEEeeeEecCCC Confidence 01111224555666555566677777888888888889999999988887653 212 2222235 Q ss_pred CCCceEEEEEecCccEEEEecccceeEEcccceeEEeec------cccccccchhhcCCcccCCCcccccCCCceEEEEE Q lcl|NC_012662. 274 ERALVWYRYSSEKGVWLESGDYNSVTAISVDVPYKIVDD------NVEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTF 347 (780) Q Consensus 274 ~~~~~y~~~~~~~g~w~e~~~~~~~~~~~~~~p~~l~~~------~~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~ 347 (780) ..++||++|+..++.|+||+++++...+ ++|||+|++. .++..+|++|.+||+++||+|+|++ ++|++|+|| T Consensus 319 ~~~~~y~~~~~~~~~w~e~a~~g~~~~~-~tmp~~l~~~~~~~~f~~~~~~w~~r~~gd~~tnp~psf~g-~~i~~v~f~ 396 (826) T protein:vir:78 319 TKAPVYFAWDAANRRWAERAAYGTDWVL-KKMPLALRWDESTDTYSLNELEYDRRGSGDEETNPTFNFVK-RGITGMTTF 396 (826) T ss_pred cccceeEEEEcCCceEEEeeccCccccc-ccccEEEEEecCCCeEEEeeccccccccCcccccCcccccC-CCceEEEEE Confidence 5688999999999999999999986444 5999999842 2567789999999999999999997 789999999 Q ss_pred cceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecCCCccccc Q lcl|NC_012662. 348 QGRLVLLSGAYVCMSATGEPDRFFRSTVSSLDPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPD 427 (780) Q Consensus 348 q~RL~f~~~~~v~~S~~gd~~nF~~~t~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~ 427 (780) ||||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+| +++|||+ T Consensus 397 q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~~s~~~~~i~~~v~~~~~L~l~T~~~e~~l~~-~~~lTP~ 475 (826) T protein:vir:78 397 QGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTEPYEHAVTFNKDLIVFAKKYQAVVPG-GGIVTPR 475 (826) T ss_pred eceEEEeeCCeEEEEeccCccccccccccCCCCCCcEEEEEccCcceeEEEEEecCCcEEEEecCcEEEEeC-CCcccce Confidence 999999999999999999999999999999999999999999999999999999999999999999999998 5699999 Q ss_pred ceEEEEEeeecCCCCCCcEEeCCeEEEEEecCCceEEEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeE Q lcl|NC_012662. 428 NASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIV 507 (780) Q Consensus 428 ~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~ 507 (780) |++++++|+|+|+++|+|+.+|++++|+++||++|++||||.|+++++++|+++|||+|++|||+++++.+ +++++|.. T Consensus 476 ~~~~~~~s~~~~~~~~~Pv~vG~~v~F~~~r~~~~s~v~e~~~~~~~~~~y~~~dlt~~~~~l~~~~v~~~-a~s~~~~~ 554 (826) T protein:vir:78 476 TAVISITTQYDVDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPSTDSHYVAEDVTSHIPSYMPGPAEYI-QAAASSGY 554 (826) T ss_pred eEEEEEEEeecccCCCCceEeCCeEEEEecCCCceeEEEEEEeeecccCccchHHHHHHHHHhcCCCeEEE-EEeCCCCe Confidence 99999999999999999999999999999999889999999999888888999999999999999988766 45555666 Q ss_pred EEEEEcCCCEEEEEEEeecCCceeeeeeEeeccCCcEEEEEEECCcEEEEEEEcCCCeEEEE-EEeeeccCCcccccccc Q lcl|NC_012662. 508 LMATTGDNRQVIAHEYHFTSQGKVHQAWHKWVFPYRVASLHFARDRVVLFAADDAGSTDKIT-ISTIDPKQGGVTFDVDR 586 (780) Q Consensus 508 ~~~~~~~~g~l~~~ty~~~~~e~~v~aW~~w~~~G~v~~~~~~~d~l~~vv~R~~~g~~~~~-~e~~~~~~~~~~~~~~~ 586 (780) ++|++++||+|++|||+++++||+|+|||||+|+|+|++||+++|+||++|+|++++..+|+ +|+++.....+.....+ T Consensus 555 ~v~~~~~~g~l~~~ty~~~~~e~~v~aW~~~~~~g~v~~v~~i~d~l~~vv~r~~~~~~~r~~~~~~~~~~~~~~~~~~~ 634 (826) T protein:vir:78 555 LVFGTSAADEMICHQYLWQGNEKVQNAYHRWTLRHQIIGAYFTGDNLMVLIQKGQEIALGRMHLNSLPAREGLQYPKYDY 634 (826) T ss_pred EEEEEcCCCeEEEEEEEecCCcEEEEeEEEEccCCcEEEEEEECCeEEEEEEeCCCEEEEEEEEEecCCCccccccccce Confidence 88999999999999999999999999999999999999999999999999999999988876 88887655443332222 Q ss_pred eeeeeccceee----ecCcce-eEEeeccccCCCCeEEEEEecCccccceeccccccccceEEEcCCCCCCEEEEeEeee Q lcl|NC_012662. 587 LPHLDSMSIVP----VNDGKG-IVPIYMRPWVSEGKLTGSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYE 661 (780) Q Consensus 587 ~~~lD~~~~~~----~~~~~~-~~~~~~~~~~l~g~~v~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~ 661 (780) +.++...+. ...... ......+..|++|+...+..++.+..... .....++++++++.+++|+|||+|+ T Consensus 635 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~----~~~~~~l~~~~~~~~~~v~VGl~y~ 708 (826) T protein:vir:78 635 --WRRIEATVDGELELTKQHWDLIKDGAAVYQLQPQVGAYMERYQLGVKRE----TSTKVFLDVPEAVVGSVYVVGCEFW 708 (826) T ss_pred --eEEEEEEEcceeccccceeEEecCCceeeeeccceeeeccccceecccc----CCCceEEEeCCCccccEEEEeecee Confidence 222221111 111110 11122345567777777666665532221 1122357788888899999999999 Q ss_pred EEEEcCceEEecCCCceeeecceEEEEEEEEEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEE Q lcl|NC_012662. 662 SLFAPTPPMLKDQNDTLISTAPVRLLRYELTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPV 741 (780) Q Consensus 662 ~~v~~~~~~i~~~~g~~~~~~r~~v~rv~v~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~v 741 (780) ++++|+||++++++|++++.+|+||+|++|+|.+|+.|++.|+++.++..+.+..++..+.+.+..+|.| ...+|++++ T Consensus 709 s~~~~~~~~~~~~~g~~~~~~r~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~l~~g~~-~~~t~~v~v 787 (826) T protein:vir:78 709 SKVEFTPPVLRDHNGLPMTSTRAVLHRYNVNFGWTGEFLWRISDTARPNQPWYDTTPLRLSSRQLNAGEP-LVDSAVVPL 787 (826) T ss_pred EEEEeCceEEecCCCcceeecceEEEEEEEEeeccccEEEEeCCCccCcceeeeecccccccccccCCcc-cccceEEEE Confidence 9999999999999999999999999999999999999999999999987777777787777777777755 468999999 Q ss_pred EeecccceeEEEEEECCCCCEEEEEEEEEEEEecceecC Q lcl|NC_012662. 742 PCRSNAQSTEMYLSTDGTQDMNILEIEYIIRYNQRRRRV 780 (780) Q Consensus 742 p~~~~~~~~~v~i~~~~P~P~tvlai~~eg~y~~r~rrv 780 (780) |+.+++++.+|+|+|+.|+||+|++|+|||+||+|+||| T Consensus 788 p~~~~~~~~~i~i~~d~P~P~tvlai~~~~~y~~r~rrv 826 (826) T protein:vir:78 788 PARVDMATSKFELSCHSPYDMNVRAVEYNFKSNQTYRRV 826 (826) T ss_pred eeeccCceEEEEEEeCCCCcEEEEEEeEEEEecceeecC Confidence 999999999999999999999999999999999999999 No 13 >protein:vir:105647 Length: 800 # NCBI annotation: putative tail tubular B protein # Family: family:all:825 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425011;genbank:gi:83571759;uniprot:Q2WC41;genbank:GeneID:3837288 Probab=100.00 E-value=6.8e-217 Score=1205.70 Aligned_cols=762 Identities=22% Similarity=0.328 Sum_probs=655.2 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCCCccEE Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHL 80 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~~~~~y 80 (780) |- |+|+||||++|||||||++||++||++|+||+|+|+|||+||||++||+++++.... ...++.+..++++ +++| T Consensus 1 ~~--v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~fva~l~~~~~~-~~~~~~~~~~~~~-~~~~ 76 (800) T protein:vir:10 1 ME--VQGSLGRQIQGISQQPPAVRLDGQCTTMVNMVPDVVNGTQSRMGTTHIAKLLDEGTD-NMATHHYRRGEGD-EEYF 76 (800) T ss_pred Ce--EEeecchhcccccccchhHhhhhhhhhhhcceeeeccCcccCCcceEEEeecCCCCC-ccEEEEEecCCcc-ceEE Confidence 86 899999999999999999999999999999999999999999999999999876533 3445555555444 4445 Q ss_pred EEEEcCcEEEEEeCCCcEEEecCCC---Cccc-cCC-cceEEEEEeCCEEEEecCCEeeeEeecccccCCCCcceEEEec Q lcl|NC_012662. 81 VINTNTGGWWLLDREAKNIVSEGNL---SYLL-AAD-RRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPKTTGFYFVK 155 (780) Q Consensus 81 ~l~~~~g~~~v~d~~~~~~~~~~~~---~y~~-~~~-~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~g~v~v~ 155 (780) ++++..+.++||+.+|..+...... .|+. +.+ .++|+|+|+||+|||+|++++|+++.+..+..| .+++++++ T Consensus 77 ~~~~~g~~~rv~~~~G~~~~v~~~~~~~~~~~~~~~~~~~l~~~tvaD~tfi~n~~~~~~~~~~~~~~~~--~~~~~~vr 154 (800) T protein:vir:10 77 FTLKKGQVPEIFDKHGRKCNVISQDAPMTYLSEVVNPREDVQFMTIADVTFMLNRRKVVKVSNRKSPKVG--DKAIVFCA 154 (800) T ss_pred EEEEcCCeEEEEecCCcEEEeecCCcceeeeeccCCchhhEEEEEEcCEEEEecCcccccccccCCCCCC--ceEEEEEe Confidence 5566667899999888776654333 3332 333 468999999999999999999999877765544 56899999 Q ss_pred CccccceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhhheecc--cceE-EEcCeEEEEEcCCCceeEE Q lcl|NC_012662. 156 SGAFSKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVG--VDFA-VRVGPYIYFELITGTDLKI 232 (780) Q Consensus 156 ~g~y~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g--~~~~-~~~g~~i~~~~~s~~~~~v 232 (780) +|+|+++|+|++ ++..++++++++++.+....++++++|+.+|...+.+.. .+++ ...|+++++.+.++.++.+ T Consensus 155 ~g~y~~~y~i~i---~g~~~~~~~t~~~~~~~~~~~~s~~~i~~~L~~~l~~~~~~~~~t~~~~g~~i~i~~~~~~~~~~ 231 (800) T protein:vir:10 155 YGQYGTSYSIII---NGTTAASFKTPDGGSAEHVEQIRTERITSELYSKLQQWSGVNDYEIQRDGTSIFIERRDGKSFTV 231 (800) T ss_pred ccccccceeEEe---ccceEEEEEecCCCcccccccccHHHHHHHHHhhhhhcCcccceEEEEcCcEEEEEEecCCceEE Confidence 999999999999 455688999999999999999999999999998887654 3444 4678899999988888999 Q ss_pred EeecCCcceeEE-EEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecC---ccEEEEecccceeEEc-cccee Q lcl|NC_012662. 233 TSTSGSPYIGYS-NQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEK---GVWLESGDYNSVTAIS-VDVPY 307 (780) Q Consensus 233 t~~~g~~~~~~~-~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~---g~w~e~~~~~~~~~~~-~~~p~ 307 (780) ++.++..+..+. ..+.++++.+||+++|++++++|++++++..+.||++|+..+ +.|+||++++...+++ ++||| T Consensus 232 ~~~~~~~~~~~~~~~~~v~~~~~Lp~~~~~g~~~~i~~~~~~~~~~y~~~~~~~~~~~~~w~e~~~~~~~~~~~~~tmp~ 311 (800) T protein:vir:10 232 TTTDGAKGKDLVAIKNKVSSTDLLPSRAPAGYKVQVWPTGSKPESRYWLQAEPKEGNLVSWKETIAADVLLGFDKGTMPY 311 (800) T ss_pred EEeecCCcceEEEEEeeccceeeccccCCCCceEEEEcCCCCCCceeEEEEEeccccceEEEeecccCceeeeecccccE Confidence 999998887775 567899999999999999999999999999999999998754 5699999999999997 58999 Q ss_pred EEeec---------cccccccchhhcCCcccCCCcccccC---CCceEEEEEcceEEEecCCeEEEEccCCccccccccc Q lcl|NC_012662. 308 KIVDD---------NVEQHIMEGRLAGDDLTNPAPTFLEE---RRITGIGTFQGRLVLLSGAYVCMSATGEPDRFFRSTV 375 (780) Q Consensus 308 ~l~~~---------~~~~~~w~~~~~gd~~t~~~psf~~~---~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~t~ 375 (780) .|++. .+...+|++|.+||+++||+|+|++. ++|++|+||||||+|++|++|||||+||||||+++|+ T Consensus 312 ~lv~~~~~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~~~~~~~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~ 391 (800) T protein:vir:10 312 IIERTGIIDGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTIGGMFMVQNRLCFTAGEAVIASRTSYFFDFFRYTV 391 (800) T ss_pred EEEEeeeeecceeEEEEeccccccccCCCCCCCCchhcCCCCCCCceeEEEEeeeEEEeeCCeEEEEccCCccccccccc Confidence 99864 35677899999999999999999985 4699999999999999999999999999999999999 Q ss_pred cCCCCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccccceEEEEEeeecCCCCCCcEEeCCeEEEE Q lcl|NC_012662. 376 SSLDPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYP 455 (780) Q Consensus 376 ~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~ 455 (780) ++++|||||+++++++++|+|+|+++++++|+|||+++||+|+| +++|||+|++++++|+|+|+++|+|+.+|++++|+ T Consensus 392 ~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~g~q~~l~g-~~~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv 470 (800) T protein:vir:10 392 ISALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQFILPG-DKPLEKSNALLKPVTTFEVNNKVKPVVTGESVMFA 470 (800) T ss_pred cCCCCCccEEEEEcCCcceeeeeEeecCCcEEEEecCcEEEEeC-CCcccceeEEEEEEEeeeccCCCCceEeCCeEEEe Confidence 99999999999999999999999999999999999999999998 56999999999999999999999999999999999 Q ss_pred EecCCceEEEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeeeee Q lcl|NC_012662. 456 APRSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAW 535 (780) Q Consensus 456 ~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW 535 (780) +++| ++++||||+|+ +.+|+|+++|||+|++|||+|++++++++++++++++|+++++|+|++|||+++++||+|+|| T Consensus 471 ~~~g-~~s~vre~~~~-~~~d~~~a~DlT~~~~hl~~~~v~~~~~~~~~~~~v~~~~~~~~~l~~~~yl~~~~e~~~~aW 548 (800) T protein:vir:10 471 TNDG-SYSGVREFYTD-SYSDTKKAQAITSHVNKLIEGNITNMAASTNVNRLLVTTDKYRNIIYCYDWLWQGTDRVQSAW 548 (800) T ss_pred cCCC-CeeEEEEEeee-ecccceehhhHHhHHHHhcCCceEEEEEeCCCCeEEEEEEcCCCeEEEEEEeecCCceEEEEE Confidence 8876 67899999997 678899999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeccC--CcEEEEEEECCcEEEEEEEcCCCeEEEEEEeeeccCCcccccccceeeeeccceeeecCcce---eEEeecc Q lcl|NC_012662. 536 HKWVFP--YRVASLHFARDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVNDGKG---IVPIYMR 610 (780) Q Consensus 536 ~~w~~~--G~v~~~~~~~d~l~~vv~R~~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~---~~~~~~~ 610 (780) |||+++ +.+++|++++|+||++|+|+++ +|||||++....+ .+.++++|||++..+....... .+....+ T Consensus 549 ~~w~~~~~~~~~~~~~~~d~l~~iv~r~~~----~~ier~~~~~~~~-~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~ 623 (800) T protein:vir:10 549 HVWEWPMGTKVRGMFYSGELLYLLLERGDG----VYLEKMDMGDALT-YGLNDRIRMDRQAELIFKHFKAEDEWISEPLP 623 (800) T ss_pred EEEEcCCCcEEEEEEEeCCeEEEEEECCCc----EEEEEEecccCcc-ccccceeeeecceeecccccccCcceEEEecc Confidence 999975 4677888889999999999764 9999998866533 3577889999887654322111 1222344 Q ss_pred ccCCCCeEEEEEe--------cCccccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCceeeec Q lcl|NC_012662. 611 PWVSEGKLTGSVA--------TGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLISTA 682 (780) Q Consensus 611 ~~~l~g~~v~~~a--------dG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~~~~~ 682 (780) +.+++++.+.+.+ +|.+.....+.++..+......+++.++++|+|||+|+++++|+||++++++|+++..+ T Consensus 624 ~~~~~~~~~~~~~~~~~~~~~~g~v~~~~~~~~g~~~~~~~~~~g~~~~~~v~VGl~Y~s~~~~~~~~i~~~~g~~~~~~ 703 (800) T protein:vir:10 624 WTPTNPELLDCILIEGWDSYIGGSFLFKYKPSDNTLSTTFDMHDDNHVKAKVIVGQIYPQEFEPTPVVIRDRQDRVSYID 703 (800) T ss_pred ccccCCcceEEeeeccceeecCceeEEEEEecCCceEeeeeecCCCcccceEEEeeeeeEEEeecceEEEcCCCcccccC Confidence 4555555554444 34444443333332221111124577889999999999999999999999999999999 Q ss_pred ceEEEEEEEEEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEeecccceeEEEEEECCCCCE Q lcl|NC_012662. 683 PVRLLRYELTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQSTEMYLSTDGTQDM 762 (780) Q Consensus 683 r~~v~rv~v~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~ 762 (780) |+||+|++|+|.+||+|.+++++..++..+.+..++++.++.++.+|.+| ++||+++||+.+|+++.+|+|++++|+|| T Consensus 704 r~~i~r~~~~~~~sg~~~~~v~~~~~~~~~~~~~~~~~~g~~~~~~g~~~-~~tg~~~vp~~g~~~~~~v~i~~d~P~P~ 782 (800) T protein:vir:10 704 VPVVGLVHLNLDMYPDFSVEVKNVKSGKVRRVLASNRIGGALNNTVGYVE-PREGVFRFPLRAKSTDAVYRIIVESPHTF 782 (800) T ss_pred CeEEEEEEEEeecCceEEEEeccCcccceeEEccCCeeccccccccCccc-ccCceEEEEEeccCceeEEEEEECCCCcE Confidence 99999999999999999999999988877777888899999999999887 57999999999999999999999999999 Q ss_pred EEEEEEEEEEEecceecC Q lcl|NC_012662. 763 NILEIEYIIRYNQRRRRV 780 (780) Q Consensus 763 tvlai~~eg~y~~r~rrv 780 (780) +|+||+|||+||+|+||| T Consensus 783 tvlai~~eg~y~~r~~rv 800 (800) T protein:vir:10 783 QLRDIEWEGSYNPTKRRV 800 (800) T ss_pred EEEEEEEEEEeecccccC Confidence 999999999999999999 No 14 >protein:vir:103341 Length: 806 # NCBI annotation: tail tubular protein B-like protein # Family: family:all:825 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039670;genbank:gi:125999999;genbank:GeneID:4818417 Probab=100.00 E-value=1.3e-216 Score=1204.21 Aligned_cols=765 Identities=20% Similarity=0.296 Sum_probs=665.9 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCCCccEE Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHL 80 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~~~~~y 80 (780) |= |+|+||||++|||||||++||++||++|+||+|+|+|||+||||++||++++.... ....+|.++++..+++| T Consensus 1 ~~--v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~~Gl~rRPgt~~va~l~~~~~---~~~~~~~~~~~~~~~~y 75 (806) T protein:vir:10 1 ME--VQGSYGRQLQGVSQQPIAVRLPGQVTSQLNAVPNVVDGLKTRMGSKHLARILNSLD---ANSLIHHYKRGDDAEEY 75 (806) T ss_pred Ce--eEeecchhccceeccChhHhhhhhhhhhhcceeccccccccCCchhhhhhhcCCCC---ccceEEEEEecCCceEE Confidence 86 89999999999999999999999999999999999999999999999998875432 34456777776666778 Q ss_pred EEEEcCcEEEEEeCCCcEEEecCCC----Ccccc--CCcceEEEEEeCCEEEEecCCEeeeEeecccccCCCCcceEEEe Q lcl|NC_012662. 81 VINTNTGGWWLLDREAKNIVSEGNL----SYLLA--ADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPKTTGFYFV 154 (780) Q Consensus 81 ~l~~~~g~~~v~d~~~~~~~~~~~~----~y~~~--~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~g~v~v 154 (780) ++++.+|.++||+..+|........ .|+.+ .++++|+|+|+||+|||+|++++|+++.+..+ .+..++++++ T Consensus 76 ~v~~~~g~i~v~~~~~G~~~~v~~~~~~~~yl~~~~~~~~~l~~~tvaD~tfi~n~~~~~~~~~~~~~--~~~~~~~v~v 153 (806) T protein:vir:10 76 FVILQPGQVPVIFTVGGLACPVNTQGSAATYLSSSSLPRETTQLMTIGDYTFVLNRKMPVQARGDVTP--SLDNKGLVYV 153 (806) T ss_pred EEEEcCCcEEEEEcCCCcEEEecCCCceEEEeccCCCCcceeeEEEEcCEEEEecCcEeeeecccccC--CCCcceEEEE Confidence 8999999999998666654433332 24443 35679999999999999999999998876654 4556899999 Q ss_pred cCccccceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhhheeccc---ceE-EEcCeEEEEEcCCCcee Q lcl|NC_012662. 155 KSGAFSKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVGV---DFA-VRVGPYIYFELITGTDL 230 (780) Q Consensus 155 ~~g~y~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g~---~~~-~~~g~~i~~~~~s~~~~ 230 (780) ++|+|+++|++++ ++..++++++++++++...+++++++++.++..++.+... +++ .+.|.++++.+.+.... T Consensus 154 ~~g~y~~~y~i~I---ng~~~a~~~t~~~~~~~~~~~~~~~~~a~~l~~~l~~~~~~~~~~~~~~~g~~~~i~~~~~~~~ 230 (806) T protein:vir:10 154 AYANFSFTYQILI---NGQVAAEHKTASSEDVKNEDLVRTDYVAGKLLENFNSRTASFPGFSMYQDGNVLVVDNSNGANY 230 (806) T ss_pred eecccCceeeEEe---ccceEEEEEeccCCCcccccccchhHHHHHHHhhhcccccccceeEEEEcccEEEEecCCCCcc Confidence 9999999999999 4667889999999999999999999999999988877543 333 46788999998888888 Q ss_pred EEEeecCCcceeE-EEEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecC---ccEEEEecccceeEEc-ccc Q lcl|NC_012662. 231 KITSTSGSPYIGY-SNQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEK---GVWLESGDYNSVTAIS-VDV 305 (780) Q Consensus 231 ~vt~~~g~~~~~~-~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~---g~w~e~~~~~~~~~~~-~~~ 305 (780) .+++.+|.....+ +..++++++++||.++|+|+++++.+++++..+.||++|+..+ +.|+|+++++...+++ ++| T Consensus 231 ~~~~~~g~~~~~~~~~~~~v~~~~~lp~~~~~g~~v~i~~~~~~~~~~y~v~~~~~~~~~~~w~e~~~~~~~~~~~~~t~ 310 (806) T protein:vir:10 231 ALTTVDGADGQDLVAIRHKVTNLDTLPNRAPVGYKVQVWPTGSKPESRYWLQAESQDGSKVTWVETIAPGVRKGWNAATM 310 (806) T ss_pred EEEEeeCCCCceeEEeecccCccccCccccCCCcEEEEeccCCCCCCceEEEEEeeccCceEEEeecccccccceecccc Confidence 8888888766554 4578899999999999999999999999999999999997654 4699999999988887 689 Q ss_pred eeEEeecc----------ccccccchhhcCCcccCCCcccccC---CCceEEEEEcceEEEecCCeEEEEccCCcccccc Q lcl|NC_012662. 306 PYKIVDDN----------VEQHIMEGRLAGDDLTNPAPTFLEE---RRITGIGTFQGRLVLLSGAYVCMSATGEPDRFFR 372 (780) Q Consensus 306 p~~l~~~~----------~~~~~w~~~~~gd~~t~~~psf~~~---~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~ 372 (780) |+.+++.. ++.++|+.|.+||+++||+|+|++. ++|++|+||||||+|++|++|||||+||||||++ T Consensus 311 p~~~v~~~~~~~~~~~~~~~~~~w~~r~~Gd~~tn~~psF~~~~~~~~it~v~f~q~RL~f~s~~~v~~Srsgd~~nF~~ 390 (806) T protein:vir:10 311 PHVLVRESLNANGSANFTYRPGEWEDRDVGDDLTNDFPSLLNDSSPQPISSMLMVQNRLMLTSGEAVVASRTSRFFDFFR 390 (806) T ss_pred ceEEEeeeeeecccceeEEEecccccccccccccCccCcccCCCCCccceEEEEEeeeEEEecCCeEEEEccCCcccCcc Confidence 99998753 4567899999999999999999873 5789999999999999999999999999999999 Q ss_pred ccccCCCCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccccceEEEEEeeecCCCCCCcEEeCCeE Q lcl|NC_012662. 373 STVSSLDPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVVLTSDLACNAFVAPVTTSQTL 452 (780) Q Consensus 373 ~t~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v 452 (780) +|+++++|||||+++++++++|+|+|+++++++|+|||+++||+|+| +++|||+|++++++|+|+|+++|+|+.+|+++ T Consensus 391 ~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~t~~~q~~l~~-~~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v 469 (806) T protein:vir:10 391 YTVLATVDTDPFDVFADIEEVYNIRWSAQMDGDVVLFTSDQQFTLPG-DKPLTPTSAVIRPVTQFKMTPGVKPAPSGDSI 469 (806) T ss_pred ccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeC-CCcccceeEEEEEEEeecccCCCCceEeCCeE Confidence 99999999999999999999999999999999999999999999998 56999999999999999999999999999999 Q ss_pred EEEEecCCceEEEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceee Q lcl|NC_012662. 453 MYPAPRSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVH 532 (780) Q Consensus 453 ~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v 532 (780) +|++++| ++++||||+|+ +.+|+|+++|||+|++|||+|+++.+++++++|.+++|++++||+|++|||+++++||+| T Consensus 470 ~Fv~~~g-~~s~vre~~y~-~~~d~~~~~DlT~~~~hl~~g~~~~~~~~~~~~~~~~~~~~~dg~l~~~ty~~~~~e~~v 547 (806) T protein:vir:10 470 LFAFDQG-SYSGIREFFTD-SYSDTKKAQPATSHVDKYIRGKVLELSASSSFNRAFIITSPDRNILYVYDWLYEGTEKVQ 547 (806) T ss_pred EEeeCCC-CeeEEEEEEee-eeccceehhhHHHHHHHhcCCCeEEEEEeCCCCcEEEEEEcCCCEEEEEEEeecCCceEE Confidence 9998887 67899999998 678899999999999999999999999999999999999999999999999999999999 Q ss_pred eeeEeeccCCc--EEEEEEECCcEEEEEEEcC--CCeEEEEEEeeeccCCcccccccceeeeeccceeeecCcce----e Q lcl|NC_012662. 533 QAWHKWVFPYR--VASLHFARDRVVLFAADDA--GSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVNDGKG----I 604 (780) Q Consensus 533 ~aW~~w~~~G~--v~~~~~~~d~l~~vv~R~~--~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~----~ 604 (780) +|||||+++|. +++|++++|+||++|+|++ +|..++|+|+|+..... ..++.++.|||+++++....... . T Consensus 548 ~aW~rw~~~~~~~~~~~~~~~d~l~~vv~R~~~~~g~~~~~iE~~~~~~~~-~~~~~~~~~lD~~~~~~~~~~~~~~~~~ 626 (806) T protein:vir:10 548 NAWHKWSFPAGTVLHAVSYSNEKLYLVLTRTNTSGGVAGVYIEVMDMGDEL-EYGLQDRVRMDRRATLSMTYNATTRVWT 626 (806) T ss_pred EeEEeeeeCCCeEEEEEEEecCeEEEEEEEcCCcccEEEEEEEeecCCCCC-CcccceeeeccccceEEEecccccccee Confidence 99999999865 7788889999999999986 78999999999875443 33567788999999886532221 2 Q ss_pred EEeeccccCCCCeEEEEEecCccccceecc--ccccccceEEEc---CCCCCCEEEEeEeeeEEEEcCceEEecCCCcee Q lcl|NC_012662. 605 VPIYMRPWVSEGKLTGSVATGALASEEVAI--DVDEVSWEFTVE---PGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLI 679 (780) Q Consensus 605 ~~~~~~~~~l~g~~v~~~adG~~~~~~~~~--~~~~~~~~~~i~---~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~~ 679 (780) .....++.|++++.+.+.++|..+...... .......+++.. .+.++++|+|||+|+++++|+||++++++++++ T Consensus 627 ~~~l~~~~~~~~~~~~~~~~g~~~~~g~~~~~~~~~~~~~v~~~~~~~~~~~~~v~vGl~Y~s~~~~t~p~~~~~~~~~~ 706 (806) T protein:vir:10 627 SSALPWLPQDLSSLDAVLVSGWAGYVGGAFQFSYNASNNTISTNFDLAEGNTATIVVGETYWYEVEPTPPLIKDSKDRVS 706 (806) T ss_pred eeeeccccccccceeEEEEeeccccCCceEEEEEcCccceEeeeeeecCCCCcEEEEeeeeeEEEEECCeeEeccCCCcc Confidence 334578899999999999999765321111 111112223322 255688999999999999999999999999998 Q ss_pred eecceEEEEEEEEEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEeecccceeEEEEEECCC Q lcl|NC_012662. 680 STAPVRLLRYELTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQSTEMYLSTDGT 759 (780) Q Consensus 680 ~~~r~~v~rv~v~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~~~~~~v~i~~~~P 759 (780) ..+|+||+|+++++.+|++|++.+.++.+...+.+.+++.++++.++.+|.+| .++|+++||+.+|+++.+|+|+|++| T Consensus 707 ~~~r~~l~r~~~~~~~s~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-~~tg~~~vp~~~~~~~~~v~i~~d~P 785 (806) T protein:vir:10 707 YLDTPTVGNVYLNLDMYPDFSVVVTDKETLQERTVYLANKTAGSITNVIGYIA-PHEGTLRIPLRRKSTDVSFKIRSKSP 785 (806) T ss_pred ccccEEEEEEEEEeecceeeEEEEcccCCCcceeeeccCcccccccccccccc-cccceEEEEeeecCceeEEEEEECCC Confidence 89999999999999999999999999888777888889999999988888777 68999999999999999999999999 Q ss_pred CCEEEEEEEEEEEEecceecC Q lcl|NC_012662. 760 QDMNILEIEYIIRYNQRRRRV 780 (780) Q Consensus 760 ~P~tvlai~~eg~y~~r~rrv 780 (780) +||+|+||+|||+||+|+||| T Consensus 786 ~P~tvlai~~eg~y~~r~~rv 806 (806) T protein:vir:10 786 ATFQLRDIEWTGSYNPRKRRV 806 (806) T ss_pred CceEEEEEEEEEEeecccccC Confidence 999999999999999999999 No 15 >protein:vir:8887 Length: 808 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813776;genbank:gi:29366731;genbank:GeneID:1258831 Probab=100.00 E-value=2.7e-215 Score=1196.92 Aligned_cols=764 Identities=19% Similarity=0.270 Sum_probs=657.6 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCCCccEE Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHL 80 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~~~~~y 80 (780) || +|+|+||||++|||||||++||++||++|+||+|+|+|||+||||++||++++..... ....++++++|++.|++| T Consensus 1 M~-~v~~s~~n~~~GvSqq~d~~R~~~q~~~~~N~~~~~~gG~~rRpgt~~v~~l~~~~~~-~~~~~~~~~~~~~~~~y~ 78 (808) T protein:vir:88 1 MG-LVSQSVKNLKGGISQQPDILRFSNQGALQINGWSSETQGLQKRPPTTFTKRLQNKGFL-GTKPLVHLINRDAQEQYF 78 (808) T ss_pred Cc-ceeeecchhccceeccchhHhhhhhhhhhhcceeeeccccccCCchheeeeeeccCCC-CCCcEEEEEEeCcCceEE Confidence 99 5999999999999999999999999999999999999999999999999998876544 345678888898888887 Q ss_pred EEEEcCcEEEEEeCCCcEEEecCCCCccccCC-cceEEEEEeCCEEEEecCCEeeeEeecc--cccCCCCcceEEEecCc Q lcl|NC_012662. 81 VINTNTGGWWLLDREAKNIVSEGNLSYLLAAD-RRSIQTTSMGGVTYILNTEKRPSATTDN--SDKKDPKTTGFYFVKSG 157 (780) Q Consensus 81 ~l~~~~g~~~v~d~~~~~~~~~~~~~y~~~~~-~~~l~~~q~aD~~fi~~~~~~p~~~~~~--~~~~~~~~~g~v~v~~g 157 (780) +++.++ +|+||+.+++.....+..+|+.+++ .++|+++|+||+|||+|++++|++..+. ...+.+..+++++++++ T Consensus 79 v~~~~~-~i~v~~~~G~~~~v~~~~~y~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~~~~~~~~~~~~~~~vr~g 157 (808) T protein:vir:88 79 VGFSGT-GLAVWDLKGNNYTVRGYNGYANCANPRTDLRLITVADYTFVVNRNTVCQMGSTLTHAAYPRLDGRAIINVRGG 157 (808) T ss_pred EEEeCC-eEEEEEcCCceEEEeecCcceEecCChhheeEEEEcCEEEEEcCCcceeecccccccCCCCCCccEEEEEccc Confidence 766655 5999999998888888889876654 4589999999999999999999986664 33455677899999999 Q ss_pred cccceeEEEEeeCCc--eEEEEEEeccCCCCc-----------cccccchhhhhhhhhhhheecc--cceEE-EcCeEEE Q lcl|NC_012662. 158 AFSKEYDISVVWSEG--SQTVTYTTPDGTTAG-----------DADQSVPEAIARKLVEALIAVG--VDFAV-RVGPYIY 221 (780) Q Consensus 158 ~y~~~y~vti~~~~~--~~t~t~tt~~~s~~~-----------~~~~~~~~~i~~~l~~~~~s~g--~~~~~-~~g~~i~ 221 (780) +|+++|+++++.... ..+.++.+++++.+. ..+...+++++..+...+.+.. .+|.. ..+.+++ T Consensus 158 ~y~~~y~i~i~g~~s~~~~t~t~~~~~~s~~~v~~~~~~~~~~~~~~~~~~~ia~~l~~~~~~~~~~~~~~~~~~~~~~~ 237 (808) T protein:vir:88 158 QYGRTLSITINGDGTGSSPQASIKMPNGSAEKVPAGDPYAGMNQVDMTDASWIAAELARQLTVSLGGSGWSFQAGTGWIL 237 (808) T ss_pred ccCceEEEEEecCCcceeeeEeEEEccCcccceeeccceeecccCCccccccchhhheeeeeecccccceEEEeccceEE Confidence 999999999986433 346677777665332 2344566778887777666654 34543 3457788 Q ss_pred EEcCCCce-eEEEeecCCcceeEEE-EEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEeccccee Q lcl|NC_012662. 222 FELITGTD-LKITSTSGSPYIGYSN-QSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYNSVT 299 (780) Q Consensus 222 ~~~~s~~~-~~vt~~~g~~~~~~~~-~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~~~~~~~ 299 (780) +...++.. +.+++.+|..+...+. .+.|+++++||+.+|+|+.++|..+.++..++||++|+..++.|+||++++... T Consensus 238 i~~~a~~~~~~~~t~~g~~~~~~~~~~~~v~~~~~lp~~~p~g~~v~i~~~~~~~~~~~yv~~~~~~~~w~e~~~~~~~~ 317 (808) T protein:vir:88 238 INAPANDNVRQIATKDGYADTLLSGFIYQVQTFTKLPANAPPGYLVEITGESARSGDNYWVQYDASGKVWKETAKPKIIA 317 (808) T ss_pred EEeccCceeEEEcccCCcCcceeeeeeeeccceeeccccCCCCcEEEEEecCCCCCceeEEEEEcCCeEEEEeeecccee Confidence 88877655 5788888888877765 568999999999999999999999999999999999999999999999999999 Q ss_pred EEc-ccceeEEeecc-----ccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEecCCeEEEEccCCccccccc Q lcl|NC_012662. 300 AIS-VDVPYKIVDDN-----VEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLSGAYVCMSATGEPDRFFRS 373 (780) Q Consensus 300 ~~~-~~~p~~l~~~~-----~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~ 373 (780) +++ ++|||.|++.. ++..+|++|.+||++|||+|+|++ ++|++|+||||||+|++|++|||||+||||||+++ T Consensus 318 ~~~~~tmp~~lv~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g-~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~ 396 (808) T protein:vir:88 318 GFNNATLPHALVRAADGQFDWTPLTWDGRNAGDDDTNPMPSFVG-ATINDVFFFRNRLGFLSGENVVMSRTSKYFNFFPS 396 (808) T ss_pred eecccceeEEEEecCCceEEEEecccccccccccccCccceecC-CceeEEEEEcceEEEeeCCeEEEEeccCcccccCC Confidence 997 58999998854 456779999999999999999997 78999999999999999999999999999999999 Q ss_pred cccCCCCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccccceEEEEEeeecCCCCCCcEEeCCeEE Q lcl|NC_012662. 374 TVSSLDPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVVLTSDLACNAFVAPVTTSQTLM 453 (780) Q Consensus 374 t~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~ 453 (780) |++++.|||||+++++++++|+|+|+|+++++|+|||+++||+|+| +++|||+|++++++|+|+|+++|+|+.+|++++ T Consensus 397 t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~i~T~~~e~~l~~-~~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~ 475 (808) T protein:vir:88 397 SVATLSDDDPIDVAISHNRISILKYAVPFSEQLLLWSDQAQFVLSS-KTILSSKTIELDLTTEFDVSDGARPYGIGRGVY 475 (808) T ss_pred cccCCCCCccEEEEecCCccceeeEEeecCCcEEEEecCcEEEEeC-CCcccceeEEEEEEEEecccCCCCceEeCCeEE Confidence 9999999999999999999999999999999999999999999998 569999999999999999999999999999999 Q ss_pred EEEecCCceEEEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeee Q lcl|NC_012662. 454 YPAPRSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQ 533 (780) Q Consensus 454 f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~ 533 (780) |++++| ++++++|+++.++++|+|+++|||+|++|||+|+++++++++++|.+++|++++||+|++|+|+++++||+|+ T Consensus 476 f~~~~g-~~~~v~r~~~~~~~~d~y~~~dlt~~~~h~~~~~~~~~~~~~~~~~~~v~~~~~~g~l~~~~y~~~~~e~~v~ 554 (808) T protein:vir:88 476 FAAPRA-SFTSLKRYYAIQDVSDVKSAEDVSAHVPSYITNTVHAIHGSGTENFVSILSDGSPNKVFIYKFLYLDEILQQQ 554 (808) T ss_pred EEecCC-CeeEEEEEEEeeeccCceehhhHHHHHHHhcCCCeEEEEEeCCCCeEEEEEEcCCCEEEEEEEeccCCceeEE Confidence 998887 6778877655558899999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEeeccCCcEEE----EEEECCcEEEEEEEcCCCeEEEEEEeeeccCC-cccccccceeeeeccceeeecC---cc-ee Q lcl|NC_012662. 534 AWHKWVFPYRVAS----LHFARDRVVLFAADDAGSTDKITISTIDPKQG-GVTFDVDRLPHLDSMSIVPVND---GK-GI 604 (780) Q Consensus 534 aW~~w~~~G~v~~----~~~~~d~l~~vv~R~~~g~~~~~~e~~~~~~~-~~~~~~~~~~~lD~~~~~~~~~---~~-~~ 604 (780) |||||+|+|.+++ +++++|+||++|+|.++ .|+|||++... .+.....+++||||+.++.... .. .+ T Consensus 555 aW~r~~~~g~~~~~~~~~~~~~d~l~~vV~r~~~----~~ler~~~~~~~~~~~~~~~~~~lD~~~~~~~g~~~~~~~~t 630 (808) T protein:vir:88 555 SFSHWEFGDAATTRVLAASCIGSYCYLMIDRPEG----LCLERMEFTQHTIDYSIEPYRTYMDMKKTIVLGAYNIDTNLT 630 (808) T ss_pred eeEEEecCCCeeEEEEEEeccCCEEEEEEEcCCc----EEEEEEeeccCCCCCccccceeeeeeeeeeccccccCccccc Confidence 9999999987665 44458999999999764 68999987554 3444567789999998764311 11 11 Q ss_pred E----EeeccccCCCCeEEEEEecCccccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCceee Q lcl|NC_012662. 605 V----PIYMRPWVSEGKLTGSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLIS 680 (780) Q Consensus 605 ~----~~~~~~~~l~g~~v~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~~~ 680 (780) . ..+.++.|++++.+.+.+||.+... ...++....++++++++++++|+|||+|+++++|+||++++++|+.++ T Consensus 631 ~~~~~~~~~~~~~~~~~~~~~~~dg~~~~~--~~~~~~~~~~~~~~~~~~~~~v~vGl~y~s~~~~~p~~~~~~~g~~~~ 708 (808) T protein:vir:88 631 SFDVRTAYGGTPGPESTFYTIDQQGVLIEH--EARDWATNPYISFVGNRAGEQMVIGKQYTFQYEFSKFLIKQTADDGST 708 (808) T ss_pred eeecccccccccccceeEEEEcCCceEEee--ecccccCcceEEeCCCccCceEEEeeeeeEEEEecceEEecCCCCcce Confidence 1 2346778999999999999986533 234445667899999999999999999999999999999999998765 Q ss_pred e----cceEEEEEEEEEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEeecccceeEEEEEE Q lcl|NC_012662. 681 T----APVRLLRYELTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQSTEMYLST 756 (780) Q Consensus 681 ~----~r~~v~rv~v~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~~~~~~v~i~~ 756 (780) . +|+||+|+++++.+|+++.+.++++.++. .+.++++++++.+ ..+.+| +.+|++++|+.+|+++.+|+|+| T Consensus 709 ~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~~--~~~~~~~~~~~~~-~~~~~~-~~tg~~~vp~~~~~~~~~v~i~~ 784 (808) T protein:vir:88 709 STEDIGRLQLRRAWLNYEESGAFEINVNNGSSEF--VYVMTGGRLGIQR-VLGELS-VGTGQFKFPVTGNAVNQRVTITS 784 (808) T ss_pred eecccceEEEEEEEEEeecccceEEEeCCCcccc--eeeccCcccCccc-ccCccc-cccceEEEEecccCceeEEEEEE Confidence 4 78999999999999999999999877754 4567788887653 355554 57999999999999999999999 Q ss_pred CCCCCEEEEEEEEEEEEecceecC Q lcl|NC_012662. 757 DGTQDMNILEIEYIIRYNQRRRRV 780 (780) Q Consensus 757 ~~P~P~tvlai~~eg~y~~r~rrv 780 (780) ++|+||+|+||+|||+||+|+||| T Consensus 785 d~P~P~tilsi~~eg~y~~r~~~v 808 (808) T protein:vir:88 785 SNPNPLNVIGCGWEGNYIRRSSGI 808 (808) T ss_pred CCCCceEEEEEEEEEEEeccccCC Confidence 999999999999999999999999 No 16 >protein:vir:78703 Length: 905 # NCBI annotation: tail tube B # Family: family:all:825 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285450;genbank:gi:148724484;genbank:GeneID:5220174 Probab=100.00 E-value=1.5e-208 Score=1160.00 Aligned_cols=760 Identities=18% Similarity=0.237 Sum_probs=633.3 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCCCccEE Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHL 80 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~~~~~y 80 (780) || +|+|+||||++|||||||++|+|+||++|+||+|||+.||+||||++||+.+.... ...+++|+++|++.|+|+ T Consensus 1 M~-~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRp~~~~i~~l~~~~---~~~~~~~~~~r~~~e~y~ 76 (905) T protein:vir:78 1 MG-AVLQKIPNLLGGVSQQPDPVKLPGQVREAENVYLDPTFGCRKRPATKFVGELATNL---PSDTRWFPIFRDAGERYA 76 (905) T ss_pred Cc-cceecchhhhCceeecchhhcCCcchhhhhccccccccccccCchhhhhhhhcCCC---CCCceEEEEEeCCCceEE Confidence 99 69999999999999999999999999999999999999999999999999887543 456789999999999999 Q ss_pred EEEEcCc----EEEEEeCCCcE---EEe-cCCCCccccCCcceEEEEEeCCEEEEecCCEeeeEeecccccCCCCcceEE Q lcl|NC_012662. 81 VINTNTG----GWWLLDREAKN---IVS-EGNLSYLLAADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPKTTGFY 152 (780) Q Consensus 81 ~l~~~~g----~~~v~d~~~~~---~~~-~~~~~y~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~g~v 152 (780) +++..+| .|+|||+.+|+ ++. ++...|+++.+.++|+++++||++||+|++++|++.... ...++.+|++ T Consensus 77 ~~~~~~g~~~~~i~v~d~~~G~~~~V~~~~~~~~yl~~~~~~~l~~~tv~d~tfi~N~~~~~~~~~~~--~~~~~~~~~~ 154 (905) T protein:vir:78 77 VALYKDGSGNTQVRVWDMQTGAERTVTPDATATAYLATTNLNNLNWLTVADYTLLSNKERIVTMSGAS--EVDSNQRALV 154 (905) T ss_pred EEEeeCCCCCcceEEEEccCCcEEEEecCCCccceeecCCCcceEEEEEcCEEEEEcCceeeeecCCC--CcCCCCeEEE Confidence 9998887 49999996553 333 334789988888899999999999999999999875443 4466778999 Q ss_pred EecCccccceeEEEEeeCCceEEEEEEeccCCC----------------------------------------------- Q lcl|NC_012662. 153 FVKSGAFSKEYDISVVWSEGSQTVTYTTPDGTT----------------------------------------------- 185 (780) Q Consensus 153 ~v~~g~y~~~y~vti~~~~~~~t~t~tt~~~s~----------------------------------------------- 185 (780) +|++|+|+|+|.|+|+....+.+.++.++.+.. T Consensus 155 ~v~~g~y~r~y~v~I~~~~~~~~~t~~~~~a~~~s~~s~~~~~~g~~~~~~~~~~~~~t~~~~~~l~f~~~~~~~~~~~~ 234 (905) T protein:vir:78 155 EINAISYNTTYSIDLDRDGASQQVKVYRAKALEISPGSFEVEDGGVCTEHDVQNYTNQTIGSSTGLAFQVRVQCAAYLEN 234 (905) T ss_pred EEEeeccceeEEEEEeCCCCceeeeeeccccceeccccccccccccccccceeeeecceeeccCCceeEEeeccccccCC Confidence 999999999999999876554333322111100 Q ss_pred ----------------------------------------------------C--------ccccccchhhhhhhhhhhh Q lcl|NC_012662. 186 ----------------------------------------------------A--------GDADQSVPEAIARKLVEAL 205 (780) Q Consensus 186 ----------------------------------------------------~--------~~~~~~~~~~i~~~l~~~~ 205 (780) . ..+...+.++|+.+|..++ T Consensus 235 ~~~~~~~~~~~~l~~g~~~~~~~~~~~v~~~g~~y~i~i~~~~~~~~~~~~~~~~~~t~~d~~a~~~~~~~i~~~l~~~~ 314 (905) T protein:vir:78 235 NEYRSRYNVSVVLQNGGTGFRKGDMITVNLNGRDYNIRVTQEEFVYTYASDGTAAHTTPQDSTAGTLDIGQITAGLVNSV 314 (905) T ss_pred CcccccccceeeeeccccccccCccEEEeeccceEEEEEecceeEEEecCCCcccccCccCccCccccHHHHHHHHHHhh Confidence 0 0011112245666666666 Q ss_pred eecccceEEEcCeEEEEEcCCCceeEEEeecCCcceeEE-EEEeecceecccccccCCceEEEEeccCCCCCceEEEEEe Q lcl|NC_012662. 206 IAVGVDFAVRVGPYIYFELITGTDLKITSTSGSPYIGYS-NQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSS 284 (780) Q Consensus 206 ~s~g~~~~~~~g~~i~~~~~s~~~~~vt~~~g~~~~~~~-~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~ 284 (780) ...+.......|+++++.+.++.++.+++++|..+..++ ..+.|+++++||++||+|+++++++++++..|.||++|+. T Consensus 315 ~~~~~~~~~~~g~~i~v~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~Lp~~~~~g~~v~v~~~~~~~~d~yyv~~~~ 394 (905) T protein:vir:78 315 NLISNYSAQAVGNVIEIERTDGRDFNLGVRGGATNRAMTAIKGTANSIVDLPGQCFDGFELKVINTENAESDDYYVVFRS 394 (905) T ss_pred cccccEEEEecCcEEEEEecCCCccEEEEeccCCcceEEEEeccccccccCccccCCCcEEEEEeCCCCCcceEEEEEEe Confidence 555554556889999999999998999999999887766 4788999999999999999999999999999999999975 Q ss_pred c------CccEEEEecccceeEEc-ccceeEEeeccc------------cccccchhhcCCcccCCCcccccCCCceEEE Q lcl|NC_012662. 285 E------KGVWLESGDYNSVTAIS-VDVPYKIVDDNV------------EQHIMEGRLAGDDLTNPAPTFLEERRITGIG 345 (780) Q Consensus 285 ~------~g~w~e~~~~~~~~~~~-~~~p~~l~~~~~------------~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~ 345 (780) . ++.|+||++++...+++ ++|||+|++... ..++|++|.+||++|||+|+|++ ++|++|+ T Consensus 395 ~~~~~~~~~~W~E~~~~~~~~~~~~~tmp~~l~r~~~g~f~~~~~~~~~~~~~~~~r~~Gd~~Tnp~psf~g-~~is~v~ 473 (905) T protein:vir:78 395 AAEGIPGSGSWEETVAPGIERGFNTSTMPHALIRQADGNFTLEALNDEGTITGWAQREVGDDDTNPKPSFVG-RGISDMF 473 (905) T ss_pred cccCCcCceeEEEecccccccccccccccEEEEEecCceEEEEEeccccccccccccccCCcccCCCCcccC-CCcceEE Confidence 4 45899999999999886 589999997543 23359999999999999999997 7899999 Q ss_pred EEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecCCCccc Q lcl|NC_012662. 346 TFQGRLVLLSGAYVCMSATGEPDRFFRSTVSSLDPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLA 425 (780) Q Consensus 346 ~~q~RL~f~~~~~v~~S~~gd~~nF~~~t~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~lt 425 (780) ||||||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|++.+|| T Consensus 474 f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdDpI~~~~ss~~~~~i~~~v~~~~~L~ifT~g~ef~lsg~~~~lT 553 (905) T protein:vir:78 474 FYNNRLGFLSEDAVIMSQPGDYFNFFVTSAITISDSDPIDVTASSTKPAILRAAIGAPKGLILFAENSQFLLASQEVVFS 553 (905) T ss_pred EEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeEEEeecCCcEEEEecCceEEEecCCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998666899 Q ss_pred ccceEEEEEeeecCCCCCCcEEeCCeEEEEEecCCceEEEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCC Q lcl|NC_012662. 426 PDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAAN 505 (780) Q Consensus 426 P~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~ 505 (780) |+|++++++|+|+|+++|+|+.+|++++|++++| ++++||||+|+ +.+|+|+++|+|+|++|||+|+++++. +++| T Consensus 554 P~s~~i~~~S~~~~~~~v~Pv~vG~~vlFv~~~g-~~s~vre~~y~-~~~d~y~a~DlT~~a~hl~~g~v~~~~--~s~~ 629 (905) T protein:vir:78 554 TATIKLTEISDYFYRSLAKPVSTGVSIAFVSEAD-TYSKIFEMSID-SVDNRPQVADITRIVPEYVPTGLTWSV--STPN 629 (905) T ss_pred ceeEEEEeEEeecccCCCCcEEeCCeEEEeecCC-CeeEEEEEEee-ecccceehhHHHHHHHHhcCCceEEEE--ecCC Confidence 9999999999999999999999999999998886 67899999998 567899999999999999999987553 3456 Q ss_pred eEEEEEEcCCCEEEEEEEeecCCceeeeeeEeeccCCcEEEEEEECCcEEEEEEEcCCCeEEEEEEeeeccCCccccccc Q lcl|NC_012662. 506 IVLMATTGDNRQVIAHEYHFTSQGKVHQAWHKWVFPYRVASLHFARDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVD 585 (780) Q Consensus 506 ~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW~~w~~~G~v~~~~~~~d~l~~vv~R~~~g~~~~~~e~~~~~~~~~~~~~~ 585 (780) ..++|+++++|+|++|+|+++++||+|+|||||+|+|.++++|.+.|++|++++|..+|+..+++|+|.........+.. T Consensus 630 ~~~v~~~~~~~~l~~ytyl~~~~eq~v~AWsrw~~~G~~~~~a~i~d~~~~vV~r~~~G~~~~~~~~l~~~~~~~~~d~~ 709 (905) T protein:vir:78 630 NSMMLFGDNSNTAYIFKFFNQGNERQVAGWSKWILPGEQRMCGFFADTGYFVLYDSTTGSYVLSAMELLDDPDSASIDTA 709 (905) T ss_pred CcEEEEEcCCCeEEEEEeecCCCceeEEeEEEEecCCCeEEEEEEcCCEEEEEEEccCCeEEEEEEeeccccCccccccc Confidence 66889999999999999999999999999999999999999999999999999999999999999999655443333433 Q ss_pred ---ceeeeeccce------eeecCcceeEEeeccccCCCCeEEEEEecCccccceeccccccccceEEEcCCCCCCEEEE Q lcl|NC_012662. 586 ---RLPHLDSMSI------VPVNDGKGIVPIYMRPWVSEGKLTGSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYL 656 (780) Q Consensus 586 ---~~~~lD~~~~------~~~~~~~~~~~~~~~~~~l~g~~v~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~v 656 (780) +.+|+|++.. +....+........++.|++++.+.+.+||...+...... +.++.+++ .++++|+| T Consensus 710 ~~~~~~~~d~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dG~~~~~~~~~~--~~~~~~t~---~~a~~v~V 784 (905) T protein:vir:78 710 FSSFLPRLDNYVVKSDLTVVDNGDGTLTVDLEAGQAMTGATPVIMFTDGPSEFAFSQPT--ITAGQFTV---DTTDDFVV 784 (905) T ss_pred eeeeeeccceeeecccceecccCcceEeeeccCccccccceeEEEeeCCceeeeEEEEE--eeceeecc---ccCCeEEE Confidence 3444454421 1112222334445678899999988899998754332222 22333333 35788999 Q ss_pred eEeeeEEEEcCceEEecCCCceeeecceEEEEEEEEEeccccEEEEecCCCCCcceecccCceecccccccCCccccccc Q lcl|NC_012662. 657 GFRYESLFAPTPPMLKDQNDTLISTAPVRLLRYELTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDL 736 (780) Q Consensus 657 Gl~y~~~v~~~~~~i~~~~g~~~~~~r~~v~rv~v~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~t 736 (780) ||+|+++++|+||+++.+++ +...+|++|+|++|+|++|++|++++++..++... .. ... ........+.+|+.++ T Consensus 785 Gl~Y~s~v~~~p~~~~~~~~-s~~~~~~rI~rv~lr~~~Sg~~~v~v~~~~~~~~~-~~-~~~-~~~~~~~~~~~p~~~t 860 (905) T protein:vir:78 785 GFKYETKITLPGFFTSEENK-ADRVYAPIVEFLYLDLYYSGRYQIEVDRIGYDTIN-ID-AGS-IDANIYLADGAPLKEI 860 (905) T ss_pred eeeeeEEEeecceEeccCCC-cccccceEEEEEEEEeecceeEEEEEcCCCcceec-cc-ccc-eecCcccCcccccccc Confidence 99999999999998876554 45567899999999999999999999988776432 22 223 3333345567788899 Q ss_pred ceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEeccee-cC Q lcl|NC_012662. 737 SRVPVPCRSNAQSTEMYLSTDGTQDMNILEIEYIIRYNQRRR-RV 780 (780) Q Consensus 737 g~~~vp~~~~~~~~~v~i~~~~P~P~tvlai~~eg~y~~r~r-rv 780 (780) |+++|||.+|+++.+|+|+|++|+||+|+||+|||+||+|+| || T Consensus 861 g~~~vP~~g~~~~~~v~I~sd~PlP~tvlsi~weg~Yn~r~~~~~ 905 (905) T protein:vir:78 861 ATENVPLFTPGDQVTVTIKAPDPFPSAITGYSWQGHYNRRGIAFI 905 (905) T ss_pred cEEEEEeeccCceeEEEEEECCCCcEEEEEEEEEEEeccceeecC Confidence 999999999999999999999999999999999999999999 78 No 17 >protein:vir:100022 Length: 976 # NCBI annotation: T7-like tail tubular protein B # Family: family:all:825 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214208;genbank:gi:61806431;genbank:GeneID:3294702 Probab=100.00 E-value=1e-205 Score=1144.41 Aligned_cols=765 Identities=18% Similarity=0.241 Sum_probs=625.5 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCc---eeEEEEEecCCCc Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGD---ALYTQYLERGADG 77 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~---~~~~~~~~r~~~~ 77 (780) || +|+|+||||++|||||||++|+|+||++|+||+|||+.||+||||++||+.+.+....... ...+|+++|+++| T Consensus 1 M~-~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRp~~~~v~~l~~~~~~~~~~~~~~~~~~~~r~~~e 79 (976) T protein:vir:10 1 MA-SVTQTIPTLTGGLSQQPDELKIPGQVSVANNVIPDVTHGLLKRPGGKLVASISDNGTAALNSQTNGKWFSYYRDETE 79 (976) T ss_pred Cc-ceeecchhhhCcceecchhhcCCchhhhhhccccccccccccCCcceeeeeecCCCcccccccccceEEEEEcCCCc Confidence 99 6999999999999999999999999999999999999999999999999998865443221 2456899999999 Q ss_pred cEEEEEEcCcEEEEEeCCCcEEE---ecCC-----CCccccCCcceEEEEEeCCEEEEecCCEeeeEeecccccCCCCcc Q lcl|NC_012662. 78 RHLVINTNTGGWWLLDREAKNIV---SEGN-----LSYLLAADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPKTT 149 (780) Q Consensus 78 ~~y~l~~~~g~~~v~d~~~~~~~---~~~~-----~~y~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~ 149 (780) +|++.+.++|.|+|||..+|+.. ..++ ..|+++...++|+++++||++||+|+++.|++..... ..+... T Consensus 80 ~y~~~~~~~g~~~v~~~~~G~~~~v~~~~~~~~~~~~yl~~~~~~~~~~~tv~d~tfi~N~~~~~~~~~~~~--~~~~~~ 157 (976) T protein:vir:10 80 SYIGQVSRSGDINMWRCSDGQAMTVNYDSGTATALTTYLTHTNDEDIQTLTLNDYTFLTNRTKTVAMSSTVE--PVRPPE 157 (976) T ss_pred EEEEEEecCCceEEEEccCCeEEEEEcCCCcccccchhhccCCcceeEEEEEccEEEEecCceEEeeccccc--CCCCce Confidence 99999999999999998766543 3332 2366676678999999999999999999998865443 445567 Q ss_pred eEEEecCccccceeEEEEeeCCceEEEEEEeccCCCC------------------------------------------- Q lcl|NC_012662. 150 GFYFVKSGAFSKEYDISVVWSEGSQTVTYTTPDGTTA------------------------------------------- 186 (780) Q Consensus 150 g~v~v~~g~y~~~y~vti~~~~~~~t~t~tt~~~s~~------------------------------------------- 186 (780) |+++|++|+|+|+|+|+|++...+.+.++.++...+. T Consensus 158 ~~~~v~~~~y~~~y~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~s~~~G~~~~~~~v~ 237 (976) T protein:vir:10 158 VFIDLKATAYARQYAVNLFDNTTTTAVSTVTRIDVELIKSSNNYCDSNGAMVARTSRPSNSTRCDDSAGDGRDAYAPNVG 237 (976) T ss_pred EEEEeeeeccceEEEEEEcCCcccceeeeeeeeeeccccCCcccccccccchhhHhHhhhhhcccccccccccccCceee Confidence 9999999999999999996554443333332100000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_012662. 187 -------------------------------------------------------------------------------- 186 (780) Q Consensus 187 -------------------------------------------------------------------------------- 186 (780) T Consensus 238 ~~~f~~~~G~~~~i~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~Y~~~y~~~~~v~~~~ 317 (976) T protein:vir:10 238 TKVFNVTDGASLTDEANSGSYTYTIDVKDSSNNSVNRGVNLYFRIRTVGQSVPFTTGSGSSATTTYQARYTTTFDLLYGG 317 (976) T ss_pred eeEEEeccCccceEcCCcceEEEEeeccccceEEeecCCceEEEEccccceeecccccccceeeeeeEEEEeEEEEecCC Confidence Q ss_pred --------------------------------------------ccccccchhhhhhhhhhhheecc--cc-eEEEcCeE Q lcl|NC_012662. 187 --------------------------------------------GDADQSVPEAIARKLVEALIAVG--VD-FAVRVGPY 219 (780) Q Consensus 187 --------------------------------------------~~~~~~~~~~i~~~l~~~~~s~g--~~-~~~~~g~~ 219 (780) +......+++++.+|...++..+ .+ .....|++ T Consensus 318 ~g~~~~~~~~V~v~g~~Y~it~~~~~~~~~~a~~~~~~~~~t~~d~~~~~~~~~ia~~L~~~l~a~~~~~g~tv~~~g~~ 397 (976) T protein:vir:10 318 TGWQEGDYFYVWMKDGYYKITVEAISTANVQANLGLIRPNPTPFDTETAVTAESIIGDIRTAIIATGNFTSANVQQIGTG 397 (976) T ss_pred CCcccCceEEEEccccceeeEEEEeeceeEEeccccccCcCCcCcccccccHHHHHHHHHHhhcccccccceEEEEcCcE Confidence 00001224455555555555432 12 23456788 Q ss_pred EEEEcCCCceeEEEeecCCcceeEEEEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecC-----ccEEEEec Q lcl|NC_012662. 220 IYFELITGTDLKITSTSGSPYIGYSNQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEK-----GVWLESGD 294 (780) Q Consensus 220 i~~~~~s~~~~~vt~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~-----g~w~e~~~ 294 (780) +++.++++ .+.++..++. ..-...++|+++++||++||+|+.++|..+++ ..|.||++|.... ++|+||++ T Consensus 398 ~~i~~~~~-~~~~s~~~~~--~~~~~~~~V~~~~~LP~~~~~g~~v~V~~~~~-~~d~yyv~~~~~~~~~~~~~w~E~~~ 473 (976) T protein:vir:10 398 LYVTRPSG-TFNVTAPSSD--LLRVMSGEVANVDDLPSQCKHGYVVKVANSEA-DADDYYVKFFGHNNRDGDGVWEECAK 473 (976) T ss_pred EEEEecCc-ceEecCCCce--eEEEEEeeecchhhhhhhccCCcEEEEecCCC-CceeEEEEeeccccccccceEEEeec Confidence 88876665 3555544432 22234667999999999999999999966654 5689999997654 47999999 Q ss_pred ccceeEEc-ccceeEEeecc-----ccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEecCCeEEEEccCCcc Q lcl|NC_012662. 295 YNSVTAIS-VDVPYKIVDDN-----VEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLSGAYVCMSATGEPD 368 (780) Q Consensus 295 ~~~~~~~~-~~~p~~l~~~~-----~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~ 368 (780) ++..++++ ++|||.|++.+ ++.++|+.|.+||++|||+|+|++ ++|++|+||||||+|++|++|||||+|||| T Consensus 474 ~g~~~g~~~~tmP~~l~~~~~g~f~~~~~~w~~r~vGd~~tnp~psf~g-~~is~v~f~q~RL~f~s~~~v~~Srtgd~~ 552 (976) T protein:vir:10 474 PSRNIEFDKGTMPIQLVRQANGTFTVSQATWQNAEVGDELTNPNPSFVG-KTINQLVFFRNRLVFLSDENVIMSRPGEFF 552 (976) T ss_pred cccccccccccccEEEEecccCeEEeeeccccccccCCcccCcCceecc-cccceEEEEcceEEEecCCeEEEEecCCcc Confidence 99999987 58999999864 578889999999999999999997 899999999999999999999999999999 Q ss_pred ccccccccCCCCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccccceEEEEEeeecCCCCCCcEEe Q lcl|NC_012662. 369 RFFRSTVSSLDPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVVLTSDLACNAFVAPVTT 448 (780) Q Consensus 369 nF~~~t~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~~~~~~~~s~~~~~~~~~Pv~v 448 (780) ||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|++++|||+|++++++|+|+|+++|+|+.+ T Consensus 553 nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~g~e~~lsg~~~~lTP~t~~i~~~s~~~~~~~v~Pv~v 632 (976) T protein:vir:10 553 NFWSKTATTFTPQDVIDLSCSSTYPAIVYDGIQVNAGLLLFTKNQQFMLTTDSDILSPETAKINAVSSYNFNEKTHPVSL 632 (976) T ss_pred ccccccccCCCCCccEEEEecCCcceeeEEEEecCCcEEEEecCceEEEecCCceecceeEEEEEEEeeeccCCCccEEe Confidence 99999999999999999999999999999999999999999999999999866799999999999999999999999999 Q ss_pred CCeEEEEEecCCceEEEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCC Q lcl|NC_012662. 449 SQTLMYPAPRSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQ 528 (780) Q Consensus 449 g~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~ 528 (780) |++++|++++| ++++++||.++.+ +++|++.|||+|++|||+|.+. +++++++|.+++|+++++|+|++|||+++++ T Consensus 633 G~~v~Fv~~~g-~~~r~~~~~~~~~-~~~~~~~dlt~~~~~l~~g~~~-~~a~~~~~~~vv~~~~~~g~l~~~ty~~~~~ 709 (976) T protein:vir:10 633 GTTVAFIDNAN-QFTRFFEMSNVVR-QGEPDVVDQSKVISRLLDKNIS-LVSVSRENSVVFFSQKDTDKIYCFRYFTSGE 709 (976) T ss_pred CCeEEEEecCC-CeEEEEEEeeccc-ccccchhHHHHHhhhhcCCceE-EEEEcCCCcEEEEEEcCCCEEEEEEEeecCC Confidence 99999998776 6889999999865 4689999999999999999765 5788999999999999999999999999999 Q ss_pred ceeeeeeEeeccCCcEEEEEEECCcEEEEEEEcCCCeEEEEEEeeeccCC---------cccccccceeeeeccceeeec Q lcl|NC_012662. 529 GKVHQAWHKWVFPYRVASLHFARDRVVLFAADDAGSTDKITISTIDPKQG---------GVTFDVDRLPHLDSMSIVPVN 599 (780) Q Consensus 529 e~~v~aW~~w~~~G~v~~~~~~~d~l~~vv~R~~~g~~~~~~e~~~~~~~---------~~~~~~~~~~~lD~~~~~~~~ 599 (780) ||+|+|||||+|+|+|++||+++|+|||+|+|+++++.+||+|++++... ....+..++.|||++..+++. T Consensus 710 eq~v~aWsr~~~~G~v~sv~~i~D~ly~vV~r~~~g~~~r~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~ 789 (976) T protein:vir:10 710 KRLLQAWTTWTITGNIQYHCMLDDALYVVTRNNNKDQIVKYSLKLDDAGHFVTDTQGTTSTDDDSIYRVHLDHSSSVTAA 789 (976) T ss_pred ceeEEeeEEEecCCcEEEEEEeCCeEEEEEEecCCeEEEEEEEEECCccceeeeccCccccccCCcceeeeccceEEEec Confidence 99999999999999999999999999999999999999999999976432 111234567899999888765 Q ss_pred Cccee------EEeeccccCCCCeEEEEEecCccccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEec Q lcl|NC_012662. 600 DGKGI------VPIYMRPWVSEGKLTGSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKD 673 (780) Q Consensus 600 ~~~~~------~~~~~~~~~l~g~~v~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~ 673 (780) .++.+ +.......+++++.+.+.+|+...... .....+.+++++|++++++++|||||+|+++++|+||++++ T Consensus 790 ~~t~~~~t~~t~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~~~~v~g~~i~l~g~~~~~~v~VGl~Y~s~~~~~~~~i~~ 868 (976) T protein:vir:10 790 SNTYNTTTIKTTIPKPNGYESTKQLVAYDTDAGNDLGR-YALVTVSGSNLEIPGNWSNNSFIIGYLYEMDVQLPTLYVTQ 868 (976) T ss_pred cccccCCceeEEeecCccccCceeEEEEecccCccccc-ceeeeecCCeeEecCCCCCCeEEEeeeeEEEEeecceeEEe Confidence 54432 122233455677888888887653322 23345677889999999999999999999999999999998 Q ss_pred CCCceee---ecceEEEEEEEEEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEeeccccee Q lcl|NC_012662. 674 QNDTLIS---TAPVRLLRYELTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQST 750 (780) Q Consensus 674 ~~g~~~~---~~r~~v~rv~v~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~~~~~ 750 (780) ++|..+. .+||+|||++|+|.+|++|++.++...++. +.+.+.. ........+.+|+...+.+++|+.+|+++. T Consensus 869 ~~g~~~~~~~~gRl~i~r~~~~~~~tg~~~v~v~~~~~~~-~~~~~~~--~~~~~~~~~~~pl~~~~~~~vP~~~~~~~~ 945 (976) T protein:vir:10 869 QVGDKYRSDAKSSLIVHRIKFSFGPLGVYSTTIQRDGKPD-FTETKEL--GLAGVVGASRLPIVPEVIETVPCYERNTNL 945 (976) T ss_pred CCCCcccccceeeEEEEEEEEEeecccceEEEEcCCCCcc-ccccccc--cccCcccccccceecCcEEEEEeccCCcee Confidence 8765443 378999999999999999999998866542 2222221 223344455677666677899999999999 Q ss_pred EEEEEECCCCCEEEEEEEEEEEEeccee-cC Q lcl|NC_012662. 751 EMYLSTDGTQDMNILEIEYIIRYNQRRR-RV 780 (780) Q Consensus 751 ~v~i~~~~P~P~tvlai~~eg~y~~r~r-rv 780 (780) +|+|+|++|+||+|++|+|||+||+|+| || T Consensus 946 ~v~i~~d~PlP~tilsi~~eg~yn~r~~r~~ 976 (976) T protein:vir:10 946 KVNVKSEHPAPATLYSLAWEGDFTNRFYKRV 976 (976) T ss_pred EEEEEECCCCceEEEEEEEEEEeccceeecC Confidence 9999999999999999999999999998 77 No 18 >protein:vir:103790 Length: 768 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024932;genbank:gi:48697202;genbank:GeneID:2846114 Probab=100.00 E-value=4.6e-173 Score=965.42 Aligned_cols=718 Identities=13% Similarity=0.081 Sum_probs=543.4 Q ss_pred Cceeeeeeechhhcc-----cccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCC Q lcl|NC_012662. 1 MARPFEGALNDLLQG-----VSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGA 75 (780) Q Consensus 1 Ma~~v~~~~~~l~~G-----vSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~ 75 (780) ||+ ++++.+||.+| +++|+|++||++||++|+||+++|+|||+|||||+||+++++.....++++|. + . T Consensus 1 M~~-~~~~~~~F~~GelsP~l~~r~Dl~ry~~~~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~lipf~---~--~ 74 (768) T protein:vir:10 1 MPK-AAPQQVSFDAGELSPLLGARVDLAKYPNGCQVMENFIATVQGPAIRRGGKRFVAATKDSTKQSWLLPFI---V--A 74 (768) T ss_pred CCc-ceeeeeeccCceechhhcccchHHHHHHHHhhhhcceeeecCCceecCchhhhhhhcCCCCCeeEEEEE---e--c Confidence 995 89999999999 89999999999999999999999999999999999999998766555555544 2 4 Q ss_pred CccEEEEEEcCcEEEEEeCCCcEEEecCC----CCccccCCc------ceEEEEEeCCEEEEecCCEeeeEeecccccCC Q lcl|NC_012662. 76 DGRHLVINTNTGGWWLLDREAKNIVSEGN----LSYLLAADR------RSIQTTSMGGVTYILNTEKRPSATTDNSDKKD 145 (780) Q Consensus 76 ~~~~y~l~~~~g~~~v~d~~~~~~~~~~~----~~y~~~~~~------~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~ 145 (780) .++.|+++|++++||||+..+ .+...+. ..||+++++ .+|+|+|+||+|||+|+++||+++.+.++..| T Consensus 75 ~~~~y~l~fg~~~irv~~~~g-~v~~~~~~~e~~tp~~~~~l~~~~~~~~L~~~q~aD~~~i~~~~~~p~~l~r~~~~~w 153 (768) T protein:vir:10 75 DGIAYMLEFGDHYIRFFVNRG-QLVNAGAPVEIATPYALADLTTEDGTFAIRATQSADTMYLFHGGYPTQKLLRTSATTF 153 (768) T ss_pred CccEEEEEEcCCEEEEEECCc-EEEecCeeEEEEcCCCcceeecccccceeEEEeecCEEEEEcCCcceeEEEEecCCCc Confidence 678999999999999998654 4433321 223444444 57999999999999999999999999888766 Q ss_pred CCcceEEEecCccc---cceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhh--heecccceE--EEcCe Q lcl|NC_012662. 146 PKTTGFYFVKSGAF---SKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEA--LIAVGVDFA--VRVGP 218 (780) Q Consensus 146 ~~~~g~v~v~~g~y---~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~--~~s~g~~~~--~~~g~ 218 (780) .. +.+..+.+.| +.+.++++..++.+...+++.... ..+++++...+... .......|. ...|. T Consensus 154 ~l--~~~~~~~gp~~~~n~~~~vti~~s~~~~~~T~tasa~-------~~~~~~v~~~~~l~~~~~~~~~~~~~~~~~g~ 224 (768) T protein:vir:10 154 SL--QPVTFVGGPFAAVNSDNNVRVHASAGTGAVTLVASAS-------VFRPSDVGTLFYLEQEDNSFVKPWVVHQKIGP 224 (768) T ss_pred ee--EEeeecCccccccccceeEEEEecccceeEEEeecCC-------ccchhhcceeeeeeeeccccccccEEEEeeee Confidence 43 2344555555 445667777666666666654221 12222222222111 111222332 23344 Q ss_pred EEEEEcCCCceeEEEeecCCcceeEEEEEeecceecccccccCCceEEEEeccCC-----CCCceEEEEEecCccEEEEe Q lcl|NC_012662. 219 YIYFELITGTDLKITSTSGSPYIGYSNQSQVNLETDLPARLHPSADGALCAVGQS-----ERALVWYRYSSEKGVWLESG 293 (780) Q Consensus 219 ~i~~~~~s~~~~~vt~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~-----~~~~~y~~~~~~~g~w~e~~ 293 (780) +++....+.. ......+..+...+. + ++.....++...+...... .....+++|...+..|.++. T Consensus 225 ~~~~~~~~~~--~~~~~~~~~~~~~~~-------t-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 294 (768) T protein:vir:10 225 SELRRVGDRV--YLCTAVGTATPQVTG-------T-ETPTHTSGSRWDGTGQDESATDEYGSIGAEWEYQHSGYGTVLIT 294 (768) T ss_pred EEEEecCCce--EEeeeeccccccccc-------e-eccccccCceEEEecCcccccccccccceEEEEEEcCCceEEEE Confidence 4443332222 222222222211111 1 1111222333333222211 12234556666666777766 Q ss_pred cccceeEEcccceeEEeec--------cccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEecCCeEEEEccC Q lcl|NC_012662. 294 DYNSVTAISVDVPYKIVDD--------NVEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLSGAYVCMSATG 365 (780) Q Consensus 294 ~~~~~~~~~~~~p~~l~~~--------~~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~~~~v~~S~~g 365 (780) +++......+........+ ......|+.+..+++++||+|+ +|+||||||+|++|++|||||+| T Consensus 295 ~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~g~Ps--------~v~f~q~RL~f~~~~~v~~Srtg 366 (768) T protein:vir:10 295 GYTNDQVVTGTVATNDPADPGMLPNTVVTLTGTYKWARSLFNSTDGFPQ--------MGTFWRNRLCLMRDRWLAMSVSA 366 (768) T ss_pred EecCCeeEEeeeeeecCcccccccccccccCCCcccccCCCcCCCCCce--------EEEEEeeeEEEeeCCEEEEEccc Confidence 6554433322111110001 1233346666777777777765 57999999999999999999999 Q ss_pred CccccccccccCCCCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecC--CCcccccceEEEEEeeecCCCCC Q lcl|NC_012662. 366 EPDRFFRSTVSSLDPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSL--QQLLAPDNASVVLTSDLACNAFV 443 (780) Q Consensus 366 d~~nF~~~t~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~--~~~ltP~~~~~~~~s~~~~~~~~ 443 (780) |||||++++++.+.|||||+++++++++|+|+|++++ ++|+|||+++||+|+|+ +++|||+|++++++|.|+++ +| T Consensus 367 d~~nF~~~s~~~~~DdD~I~~~~ss~~~~~i~~~v~~-~~L~i~T~~~q~~l~~~~~~~~lTP~~~~i~~~s~~g~~-~~ 444 (768) T protein:vir:10 367 DFETFKTKDADQQTDDSAIVQQLNARQLNKLAWMVES-DSLLIGMTGDEWVIGPANASQPVSAANLNAARRTSYGSK-RI 444 (768) T ss_pred ccccccccccccccCCccEEEEecCCcceeEEEEeec-CcEEEEecCceEEEecCCCCcccccceEEEEEeehhccc-cc Confidence 9999999999999999999999999999999999999 58999999999999875 46899999999999999765 69 Q ss_pred CcEEeCCeEEEEEecCCceEEEEEeeeeccccCceehhhHHHHHHHhcCCCc-----EEEEEeCCCCeEEEEEEcCCCEE Q lcl|NC_012662. 444 APVTTSQTLMYPAPRSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEA-----RFMQSASAANIVLMATTGDNRQV 518 (780) Q Consensus 444 ~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~-----~~~~~~~~~~~~~~~~~~~~g~l 518 (780) +|+.+|++++|++++| ++||||.|+ +++|+|+++|+|+|++||+++.. +..++++++|..++||+++||+| T Consensus 445 ~Pv~vG~~v~fv~~~g---~~vre~~y~-~~~d~y~a~DlT~~a~hl~~~~~~~~~~i~~~a~~~~p~~v~~~v~~dg~l 520 (768) T protein:vir:10 445 QPVQVGGTIMFVQKAG---RKLRDFKYD-FSSDNYVSTDVTKIADHITRGRAGTNSGIMSLCFQQEPHSVVWAARADGQL 520 (768) T ss_pred ccEEeCCeEEEEcCCC---CEEEEEEee-eecCceecchhhhhhhhhccccCccccceeeEEEeecCCeEEEEEecCCeE Confidence 9999999999998887 579999998 78999999999999999999864 56788889999999999999999 Q ss_pred EEEEEeecCCceeeeeeEeecc-CCcEEEEEEE------CCcEEEEEEEcCCCeEEEEEEeeeccCCcccccccceeeee Q lcl|NC_012662. 519 IAHEYHFTSQGKVHQAWHKWVF-PYRVASLHFA------RDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLD 591 (780) Q Consensus 519 ~~~ty~~~~~e~~v~aW~~w~~-~G~v~~~~~~------~d~l~~vv~R~~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD 591 (780) ++|+|++++++|+|+|||||++ +|.|++||++ +|+||++|+|+++|+.++|+|+|++.... ....++++||| T Consensus 521 ~~~ty~~e~~~q~v~aW~~~~~~~g~v~~v~~i~~~~g~~d~l~~~v~r~~~g~~~~~ie~l~~~~~~-~~~~~~~~~~D 599 (768) T protein:vir:10 521 IGCTYDEEAGRSDVYGWHRHPDANGFVECVASMPAPDGASDDLWVIVRRQVNGQTVRYVEYLNPALQD-DEPQSSAFYVD 599 (768) T ss_pred EEEEEecCCCceeEEeEEEEEcCCCEEEEEEEEecCCCCccEEEEEEEecCCCeEEEEEEecCccccc-ccccccceEec Confidence 9999999888899999999975 7999999998 58999999999999999999999875433 33456788999 Q ss_pred ccceeeecCcceeEEeeccccCCCCeEEEEEecCccccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEE Q lcl|NC_012662. 592 SMSIVPVNDGKGIVPIYMRPWVSEGKLTGSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPML 671 (780) Q Consensus 592 ~~~~~~~~~~~~~~~~~~~~~~l~g~~v~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i 671 (780) |+.++... .+..+.++.||||+++.+++||++++...+.++ +++++.++++|+|||+|+++++|+||++ T Consensus 600 ~~~~~~~~----~~~~~~gl~~leg~~v~v~~dG~~~~~~~v~~g-------~itl~~~~~~v~vG~~y~s~~~~~p~~~ 668 (768) T protein:vir:10 600 AGITYNGV----PTSTIAGLGHLEGVTVAVLTDGAVHPSRTVTAG-------AITLDWSASIVHIGVPTTCRIQTMQLNA 668 (768) T ss_pred cccccCCc----ceeeecCCCCcccceEEEEECCEeccCceecCC-------EEEeCCCCceEEEeEeeeEEEEecceEe Confidence 99876432 344568999999999999999999887665433 5667889999999999999999999999 Q ss_pred ecCCCceeeecceEEEEEEEEEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEeecc-ccee Q lcl|NC_012662. 672 KDQNDTLISTAPVRLLRYELTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSN-AQST 750 (780) Q Consensus 672 ~~~~g~~~~~~r~~v~rv~v~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~-~~~~ 750 (780) ++++|..+ .+|+||+|++|+|++|++++++++...++.... + +......+|+++.+.||++++|+.++ +++. T Consensus 669 ~~~~gs~~-~~~~ri~r~~v~~~~S~~~~~~~~~~~~~~~~~----~--~r~~~~~~~~~~~l~TG~~~v~~~~~~~~~~ 741 (768) T protein:vir:10 669 GAANGTAQ-GKTKRVTNIATRFSRSLGGVVGPTFDDNDLEQL----S--FRKPSNAMDRAVPLFDGDMESDWRGGYEGQS 741 (768) T ss_pred ecCCcccc-ccceEEEEEEEEEecccceEEEecCCCCCceee----e--eEecCcccCccCCcccCEEEEEecCCCCcce Confidence 99888665 468999999999999999999887665432211 1 12233457888888999999998765 6789 Q ss_pred EEEEEECCCCCEEEEEEEEEEEEecce Q lcl|NC_012662. 751 EMYLSTDGTQDMNILEIEYIIRYNQRR 777 (780) Q Consensus 751 ~v~i~~~~P~P~tvlai~~eg~y~~r~ 777 (780) +|+|+|++|+||+|+||+||+++|.|+ T Consensus 742 ~i~i~~d~P~P~tvlsi~~~~~~nd~~ 768 (768) T protein:vir:10 742 WICYQNDQPLPVTLLGFFPILDTQDDR 768 (768) T ss_pred EEEEEECCCCCEEEEEEEEEEEEeecC Confidence 999999999999999999999999999 No 19 >protein:vir:95324 Length: 823 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512269;genbank:gi:89152436;genbank:GeneID:3952993 Probab=100.00 E-value=9.6e-163 Score=908.83 Aligned_cols=704 Identities=13% Similarity=0.086 Sum_probs=520.3 Q ss_pred Cceeeeeeechhhcc-----cccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCC Q lcl|NC_012662. 1 MARPFEGALNDLLQG-----VSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGA 75 (780) Q Consensus 1 Ma~~v~~~~~~l~~G-----vSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~ 75 (780) |. +++..+||.+| +++|+|++||++||++|+||+++|+|||+|||||+||++++++....++++|.+ . T Consensus 1 m~--i~~~q~sF~~GElsP~l~gR~Dl~ry~~q~~~~~N~~~~~~GGl~rRpGt~fva~~~~~~g~~rLipf~~-----s 73 (823) T protein:vir:95 1 MA--ISWIQPSFAGGEIGPSLYGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPNRKCRLIPFQF-----S 73 (823) T ss_pred Cc--ceeechhccCceechheeccchHHHHHHHHhhhhCcEeeecCCceecCchhhhhhhcCCCCCeeEEEEEe-----C Confidence 99 67888999999 999999999999999999999999999999999999999988776666665543 3 Q ss_pred CccEEEEEEcCcEEEEEeCCCcEEEecCC-----CCccccCCcceEEEEEeCCEEEEecCCEeeeEeecccccCCCCcce Q lcl|NC_012662. 76 DGRHLVINTNTGGWWLLDREAKNIVSEGN-----LSYLLAADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPKTTG 150 (780) Q Consensus 76 ~~~~y~l~~~~g~~~v~d~~~~~~~~~~~-----~~y~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~g 150 (780) .++.|+++|++++||||+..+ .++..++ ..||+++++++|+|+|+||+|||+|+++||+++.|.++..|.... T Consensus 74 ~~q~y~Lefg~~~irV~~~~g-~vv~~~~~~~ev~tPy~~~~l~~Lr~~qsaD~~fivh~~~~p~~L~r~~~~~w~l~~- 151 (823) T protein:vir:95 74 TVQTYALEFGHQYMRVIKDGA-LVLNSSNVIYEIATPYTEADLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQLVD- 151 (823) T ss_pred CCcEEEEEEcCCeEEEEeCCc-EEEecCCceeEEecccccccccceeEEEeccEEEEEcCCccceEEEecCCCCceEEE- Confidence 578999999999999996544 4444433 245788999999999999999999999999999999887664432 Q ss_pred EEEecCcccc---ceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhhheecccceEEEcCeEEEEEcCCC Q lcl|NC_012662. 151 FYFVKSGAFS---KEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVGVDFAVRVGPYIYFELITG 227 (780) Q Consensus 151 ~v~v~~g~y~---~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g~~~~~~~g~~i~~~~~s~ 227 (780) +..+.++|. .++++++.........+ .++......+++.....- +..........| .+-..... T Consensus 152 -~~~~~gp~~~~~~~~t~~v~~~~~~~~~t--~ta~~~~~~~d~vg~~~~---l~~~~~~~~~~~-----~~~~~~~~-- 218 (823) T protein:vir:95 152 -VVTKNGPFEDINIDESLTVYASASTGTIT--LTASASIFGAEQVGKLFY---LEQPAVDSVPVW-----ETSKSTSI-- 218 (823) T ss_pred -EEEeccccccccccceeEEeccccCceeE--EeecccccchhhccceEE---Eeccccceeeec-----ceeeeecc-- Confidence 333455553 44555665444333333 323322333332221100 000000000000 00000000 Q ss_pred ceeEEEeecCCcceeEEEEEeecceecccccccCCceEEEEeccCCCCCceEEEE---EecCccEEEEeccccee--EEc Q lcl|NC_012662. 228 TDLKITSTSGSPYIGYSNQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRY---SSEKGVWLESGDYNSVT--AIS 302 (780) Q Consensus 228 ~~~~vt~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~---~~~~g~w~e~~~~~~~~--~~~ 302 (780) ......+...+..+.. + .+.+..|...+..+..... .+..+++|..| ....|.|+++...+... ... T Consensus 219 --~~~~~~~~~~~~~~~~-~--~~g~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~g~~~~t~v~~~~~~~~~~ 290 (823) T protein:vir:95 219 --GDIRRADSNYYRAVTA-G--KTGTLRPSHTEGTSWDGWG---GSGDDDTGIEWEYLHSGFGIARITAVNGTTATAEVI 290 (823) T ss_pred --cceEEecccceeeeec-c--ccceeecccCCcceEEece---ecccccceeEEEEEeCCcceEEEEeecceeeeceEe Confidence 0000001111111110 0 0111112222222222221 22334444433 34456777765443322 222 Q ss_pred ccceeEEeeccccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEec----CCeEEEEccCCccccccccccCC Q lcl|NC_012662. 303 VDVPYKIVDDNVEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLS----GAYVCMSATGEPDRFFRSTVSSL 378 (780) Q Consensus 303 ~~~p~~l~~~~~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~----~~~v~~S~~gd~~nF~~~t~~~~ 378 (780) ..||+.+.+.....++|+...|++ +| +||++|+||||||+|++ |++|||||+||||||+++++ + T Consensus 291 ~~~~~~~~~~~~~t~~~~~~~~~~--~~--------g~Ps~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~~--~ 358 (823) T protein:vir:95 291 SYIPSQVVGEDNASYKWAKYAWNS--VN--------GYPGTVVYYQQRLYFAASTAFPQTIWASRTGDYKDFGKSNP--T 358 (823) T ss_pred eeeccccccCCcCCccccccccCc--CC--------CCccEEEEEeceEEEEEcCCCCcEEEEeccCCccccccccC--C Confidence 468898888888888888876653 23 46678999999999995 79999999999999999984 5 Q ss_pred CCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecCC-CcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEe Q lcl|NC_012662. 379 DPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQ-QLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAP 457 (780) Q Consensus 379 ~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~-~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~ 457 (780) +|||||+++++++++|.|+|+++++ +|++||+++||+|++++ ++|||+|++++++|.|++ ++|+|+.+|+.++|+++ T Consensus 359 ~DdD~I~~~~s~~~~~~i~~~v~~~-~Lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~~~Fv~~ 436 (823) T protein:vir:95 359 QDDDRIIYTYAGRQVNEIRHLIDVG-SLVALTSGGEYVITGDQNKVLTPSSFAFSSQGSNGS-SNVPPIAVANIALFVQE 436 (823) T ss_pred CCCCcEEEEEcCCcceEEEEEeecC-cEEEEecCcEEEEEcCCCcccceeeEEEEEeecccc-ccccceEeCCeEEEEec Confidence 7999999999999999999999995 79999999999998754 689999999999999865 67999999999999987 Q ss_pred cCCceEEEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeeeeeEe Q lcl|NC_012662. 458 RSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAWHK 537 (780) Q Consensus 458 ~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW~~ 537 (780) +| ++||||+|+ +.+++|+++|+|+|++||+++..+..++++++|..++||+++||+|++|+|+ +||+|.|||| T Consensus 437 ~g---~~vre~~~~-~~~d~~~~~dlT~~a~hl~~~~~i~~~a~~~~p~~~~~~v~~dG~l~~~ty~---~~q~v~aW~~ 509 (823) T protein:vir:95 437 KG---SVVRDLAYS-FDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRDDGKLLVMTYL---RDQQVFAWAP 509 (823) T ss_pred CC---CEEEEEEEe-eecCceecchhhhhhhhhcCCCceEEEEEecCCCeEEEEEecCCcEEEEEEe---cccceeeeEE Confidence 65 579999998 6789999999999999999998888889999999999999999999999997 5788999999 Q ss_pred eccCCcEEEEEEE----CCcEEEEEEEcCCCeEEEEEEeeeccCCcccccccceeeeeccceeeec-------------- Q lcl|NC_012662. 538 WVFPYRVASLHFA----RDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVN-------------- 599 (780) Q Consensus 538 w~~~G~v~~~~~~----~d~l~~vv~R~~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~-------------- 599 (780) |+++|+|+++|++ +|+|||+|+|++||+.++|||+|++.++... .+.+||||+.++... T Consensus 510 ~~~~g~~~~~~~i~~~~~d~l~~~v~R~i~g~~~~yiE~~~~~~~~~~---~~~~~lD~~~s~~g~~~~~~~~~l~~g~~ 586 (823) T protein:vir:95 510 QSSTGKYESTCSISEGNEDAVYFVVNRTVNGQTVRYIERLSSRLFTSD---EDAFFVDSGLSYDGRNTSDRTMTITGGSG 586 (823) T ss_pred EecCCcEEEEEEecCCCCCEEEEEEEeccCCeEEEEEEeeccccCCCc---cceeEEEEEEEeecCcccceeeEecCCCC Confidence 9999999999998 5889999999999999999999998776433 346677776443210 Q ss_pred -------------Ccce--------------------------------------------------------------- Q lcl|NC_012662. 600 -------------DGKG--------------------------------------------------------------- 603 (780) Q Consensus 600 -------------~~~~--------------------------------------------------------------- 603 (780) ++.. T Consensus 587 ~l~~l~g~~v~~adg~~~~~~~v~g~i~l~~~~~~~~vGl~~~~~i~~~~~~v~~~~a~~~~~~r~v~a~l~~~~t~~~~ 666 (823) T protein:vir:95 587 EWDYLAEYTISVSGGAYFTSSDVGAQLQFPYTGADPDTGYEVSKELRCDIISVTSNTAVVVRANRNVPPSLRNVATTNWQ 666 (823) T ss_pred cccccCceEEEecCcceECCccceeEEEeCcCCCccccccceEEEEEEeeceeeCCceEEEccCCcccceeeeeeccccc Confidence 0000 Q ss_pred -eEEeeccccCCCCeEEEEEecCccccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCceeeec Q lcl|NC_012662. 604 -IVPIYMRPWVSEGKLTGSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLISTA 682 (780) Q Consensus 604 -~~~~~~~~~~l~g~~v~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~~~~~ 682 (780) ....+.||+||||++|.+++||++++..++.+| +++++.+++.|||||+|+++++||||++..+ |.++++ T Consensus 667 ~~~~~~~gL~hleg~tv~v~~dg~~~~~~~v~~G-------~vtl~~~~~~v~vGl~~~~~~~~l~~~~~~~-g~~~g~- 737 (823) T protein:vir:95 667 MARRTFGGLSHLEGQTVNILSDANVEPQKVVSGG-------AVTLESPGAVVHIGLPITAEFETLDININGQ-ETLLDK- 737 (823) T ss_pred cccceeeeccccccceEEEEEcCeeeCCeEecCC-------EEEecCCCCEEEEeecceeeEEecchhcCCC-cccCCc- Confidence 012346799999999999999999998876544 4556788999999999999999999998764 655433 Q ss_pred ceEEEEEEEEEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEe-ecccceeEEEEEECCCCC Q lcl|NC_012662. 683 PVRLLRYELTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPC-RSNAQSTEMYLSTDGTQD 761 (780) Q Consensus 683 r~~v~rv~v~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~-~~~~~~~~v~i~~~~P~P 761 (780) ++||+++.++|++|.+++++.+....++. .++. ...+|+||.++||++++++ .+|+++++|+|+|++||| T Consensus 738 ~~ri~~~~~~~~~s~~~~~g~~~~~l~~~--------~~r~-~~~~~~~~~~~tG~~~~~~~~~~~~~~~~~i~q~~plp 808 (823) T protein:vir:95 738 KQVIPSVTLVVNASRGIWATTPGGKWYEY--------PQRE-FEFYDDPVDDATGKVEVKLDSNWGKNGRVKIRQLDPLP 808 (823) T ss_pred eeEEeEEEEEEEeeeeEEEecCCCceeEe--------eccC-CCcccCCCCcccceEEEecCCCcCCccEEEEEEcCCCc Confidence 46899999999999999997755433221 1122 2457999999999999988 689999999999999999 Q ss_pred EEEEEEEEEEEEecc Q lcl|NC_012662. 762 MNILEIEYIIRYNQR 776 (780) Q Consensus 762 ~tvlai~~eg~y~~r 776 (780) ||||||..|...+-= T Consensus 809 ~tvl~v~~~~~~~g~ 823 (823) T protein:vir:95 809 LSVLAVIPRLTVGGF 823 (823) T ss_pred eEEEEEEEEEEecCC Confidence 999999987665443 No 20 >protein:vir:7329 Length: 825 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848220;genbank:gi:30387391;genbank:GeneID:2641863 Probab=100.00 E-value=3.4e-160 Score=894.87 Aligned_cols=706 Identities=11% Similarity=0.067 Sum_probs=505.0 Q ss_pred Cceeeeeeechhhcc-----cccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCC Q lcl|NC_012662. 1 MARPFEGALNDLLQG-----VSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGA 75 (780) Q Consensus 1 Ma~~v~~~~~~l~~G-----vSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~ 75 (780) |+-...| +||.+| +..|.|++||++||++|+||+++|+||++||||++||++++++....++++|.+ + T Consensus 1 m~~~~~q--~sF~~GElsP~l~gR~Dl~~y~~g~~~~~N~~~~p~Gg~~rRpGt~fva~~~~~~~~~rLipF~f-----s 73 (825) T protein:vir:73 1 MAFSWIQ--PSFAGGEIGPSLYGRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPAKYPDRKCRLIPFQF-----S 73 (825) T ss_pred Cccceec--cccccceechhhcccchHHHHHHHHHHhcCcEEEecCCceecCchHHhHhhcCCCCCEEEEEEEe-----C Confidence 9954444 899999 667999999999999999999999999999999999999998877777776654 4 Q ss_pred CccEEEEEEcCcEEEEEeCCCcEEEecCC-----CCccccCCcceEEEEEeCCEEEEecCCEeeeEeecccccCCCCcce Q lcl|NC_012662. 76 DGRHLVINTNTGGWWLLDREAKNIVSEGN-----LSYLLAADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPKTTG 150 (780) Q Consensus 76 ~~~~y~l~~~~g~~~v~d~~~~~~~~~~~-----~~y~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~g 150 (780) .+|+|+|+|++++||||...+ .++..++ .+||+++++++|+|+|++|+|||+|+++||+++.|.++..|... . T Consensus 74 ~~q~y~Lefg~~~lrv~~~gg-~v~~~~~~~~e~~TPy~~~~l~~l~~~QsaD~~~i~h~~~pp~~L~r~~~~~W~l~-~ 151 (825) T protein:vir:73 74 TVQTYALEFGHNYMRVIKDGA-YVLTTSNVIYELAMPYADTDLFRIKFTQSADVLTLVHPAYPPKELRRYAHDNWQIV-D 151 (825) T ss_pred CCcEEEEEEeCCeEEEEeCCc-eEeccCCceEEEecccchhhhhhheeeeecCEEEEEcCCCceeEEEEecCCCcEEE-E Confidence 689999999999999997544 4443332 34578899999999999999999999999999999887655432 2 Q ss_pred EEEecC--ccccceeEEEEeeCCceEEEEEEeccCCCCccccccchh-hhhhhhhhhheecccceEEEcCeEEEEEcCCC Q lcl|NC_012662. 151 FYFVKS--GAFSKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPE-AIARKLVEALIAVGVDFAVRVGPYIYFELITG 227 (780) Q Consensus 151 ~v~v~~--g~y~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~-~i~~~l~~~~~s~g~~~~~~~g~~i~~~~~s~ 227 (780) +.+..+ ...+.+..+++..++...+.+.+... +....++.... ++...................+.+.. ... T Consensus 152 ~~f~~gp~~~in~~~sv~v~asg~tg~~TiTaS~--a~~~~~~vG~~i~~~~~~v~si~~~~~~~~~~~~~v~~---~~~ 226 (825) T protein:vir:73 152 VTTKNGPFEDINVDETVKVYASASTGTITLTASS--AIFGAEQVGKLFYLEQPAVDSVPVWETSKTTAINDVRR---ADS 226 (825) T ss_pred EeccCCccccccccccceeeecccCceeEEEeec--cccCchhcCeEEEEecccccccceeeeeeEEEeeeEEE---CCC Confidence 222211 12233344444444333333333222 12222211110 00000011111111111111111110 011 Q ss_pred ceeEEEeecCCcceeEEEEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEecc-cceeEEcc--- Q lcl|NC_012662. 228 TDLKITSTSGSPYIGYSNQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDY-NSVTAISV--- 303 (780) Q Consensus 228 ~~~~vt~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~~~-~~~~~~~~--- 303 (780) ..+..+... ...+++..+..+.................+.+...++.+.++.+. +......+ T Consensus 227 ~~~~~~~~~--------------~~~t~~~~a~~g~~~~~~~g~~~~~~~~~~~~~~~~~g~~~it~~~~~~~~~~~~~~ 292 (825) T protein:vir:73 227 NYYRANTSG--------------KTGTLRPSHTEGMSWDGWGGTGSDDTGIQWEYLHSGFGIAKITAVAGDGLTATADVV 292 (825) T ss_pred ceeeeeccc--------------ccceeeccccCCceeEeeeeecccCCceEEEEEecCCceEEEeeccccceeeccccc Confidence 111110000 000111111112111111111111111222333333334444433 22222222 Q ss_pred -cceeEEeeccccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEe----cCCeEEEEccCCccccccccccCC Q lcl|NC_012662. 304 -DVPYKIVDDNVEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLL----SGAYVCMSATGEPDRFFRSTVSSL 378 (780) Q Consensus 304 -~~p~~l~~~~~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~----~~~~v~~S~~gd~~nF~~~t~~~~ 378 (780) .+|+.+++.+...++|+...| .++++ ||+.|+||||||+|+ +|++|||||+||||||++++ ++ T Consensus 293 ~~~~~~~~~~~~~t~~~~~~~~--~~~~g--------yPs~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~--~~ 360 (825) T protein:vir:73 293 SFIPSQVVGSANASYKWAKYAW--NSVNG--------YPSTVVYYQQRLYFAASTAYPQTIWASRTGDYKDFGKNN--PI 360 (825) T ss_pred eecccccccCCCCCcccccCCc--ccCCC--------CccEEEEEcceEEEeecCCCCCEEEEEccCCccccccCC--CC Confidence 345555555555555555443 33444 455689999999999 58999999999999999998 46 Q ss_pred CCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecC-CCcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEe Q lcl|NC_012662. 379 DPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSL-QQLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAP 457 (780) Q Consensus 379 ~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~-~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~ 457 (780) +|||||+++++++++|.|+|+++++ +|+|||+++||+|+++ +++|||+|++++++|.|+++ +|+|+.+|++++|+++ T Consensus 361 ~DdD~I~~~~s~~~~~~i~~~~~~~-~L~~~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~~-~~~Pv~vg~~~~Fv~~ 438 (825) T protein:vir:73 361 QDDDRIIYTYAGRQVNEIRHLIDVG-NLVALTSGGEYTISGDQNKVLTPSAFSFSSQGNNGSS-NVPPIAVANIALFIQE 438 (825) T ss_pred CCCccEEEEEcCCcceeEEEEeecC-cEEEEecCceEEEecCCCcccceeeEEEEeeeeeccc-cccceEeCCeEEEEeC Confidence 8999999999999999999999985 8999999999999975 46999999999999999764 6999999999999976 Q ss_pred cCCceEEEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeeeeeEe Q lcl|NC_012662. 458 RSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAWHK 537 (780) Q Consensus 458 ~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW~~ 537 (780) +| ++||||.|+ +.+++|+++|+|+|++||+++..+..++++++|..++|++++||+|++|+|+ +||+|.|||| T Consensus 439 ~g---~~vre~~~~-~~~d~~~~~dlt~~a~hl~~~~~~~~~a~~~~p~~~~~~v~~dg~l~~~ty~---~~q~v~aW~~ 511 (825) T protein:vir:73 439 KG---SVVRDLAYS-FDVDGYQGTDLTILANHLFQKHSIVDWSFCIVPYSSAFCIRDDGKLLVLTYL---RDQQVFAWAP 511 (825) T ss_pred CC---CeEEEEEEe-eecCceeccchhhhhHhhccCCceEEEEEcCCCceEEEEEecCCeEEEEEEe---ccccceeeEE Confidence 65 579999998 6789999999999999999998888889999999999999999999999997 5788999999 Q ss_pred eccCCcEEEEEEE----CCcEEEEEEEcCCCeEEEEEEeeeccCCcccccccceeeeeccceeee--------------- Q lcl|NC_012662. 538 WVFPYRVASLHFA----RDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPV--------------- 598 (780) Q Consensus 538 w~~~G~v~~~~~~----~d~l~~vv~R~~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~--------------- 598 (780) |+++|+|+++|++ +|+||++|+|+++|+.++|+|+|++..+.+. ++.+||||+..++. T Consensus 512 ~~~~g~v~~~~~i~~~~~D~l~~iV~R~~~g~~~~yiE~~~~~~~~~~---~~~~~vD~g~~~~g~~~~~~l~~l~g~tv 588 (825) T protein:vir:73 512 QSSAGKYESTCSISEGSEDAVYFVVNRTINGQTVRYIERLSSRLFTND---EDAFFVDCGLSYDGRNTSSRTMTISGGTG 588 (825) T ss_pred EecCCcEEEEEEecCCCccEEEEEEEEeeCCceEEEEEEecccccCCC---cceeEEEEEeeecccceeeceeeeCCceE Confidence 9999999999999 5789999999999999999999998776544 34567777543321 Q ss_pred ------------cCcce--------------------------------------------------------------- Q lcl|NC_012662. 599 ------------NDGKG--------------------------------------------------------------- 603 (780) Q Consensus 599 ------------~~~~~--------------------------------------------------------------- 603 (780) .++.. T Consensus 589 ~~~~~g~~~~~v~~g~itl~~~~~~~i~l~~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~v~~~~~~~a~~~~~~~t~~~ 668 (825) T protein:vir:73 589 DWSYQVDYPVTVSGGAYFVNTDVGAQIQFPYTGTDPDTNEPVAKELRGDIISVTSNTAVVVRFNRNVPPVLRNVATTNWQ 668 (825) T ss_pred EEEeCCeEEEEEcCCeEEecccceEEEEecccCcccccccceeceeeEEEccccCceEEEEEecccccceeeeecccCCC Confidence 00000 Q ss_pred -eEEeeccccCCCCeEEEEEecCccccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCceeeec Q lcl|NC_012662. 604 -IVPIYMRPWVSEGKLTGSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLISTA 682 (780) Q Consensus 604 -~~~~~~~~~~l~g~~v~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~~~~~ 682 (780) ....++||+||||++|.|++||+++++.++.+| .++++.++++|||||+|+++++||||++..+ |.++++ T Consensus 669 ~a~~~~~gL~hLeG~~v~v~~Dg~~~~~~~V~~G-------~vtl~~~~~~v~vGl~y~~~~~~l~~~~~~~-g~~~g~- 739 (825) T protein:vir:73 669 MARQTFSGLAHLEGQTVNILSDASVEPQKTVTGG-------AVTLESPGAVVHIGLPITAEFETLDININGQ-ETLLDK- 739 (825) T ss_pred cchheeccccccCCceEEEEECCeeeCCeEecCc-------EEEecCCceEEEEeeCccceEEecccccCCC-ccccCc- Confidence 012347899999999999999999998876543 4556678999999999999999999998754 655533 Q ss_pred ceEEEEEEEEEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEe-ecccceeEEEEEECCCCC Q lcl|NC_012662. 683 PVRLLRYELTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPC-RSNAQSTEMYLSTDGTQD 761 (780) Q Consensus 683 r~~v~rv~v~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~-~~~~~~~~v~i~~~~P~P 761 (780) ++||+++.++|++|.+++++.+....+..+ +.. .+.+|+||.++||++++++ .+|+++++|+|+|++||| T Consensus 740 ~~ri~~~~~~~~~s~~~~~g~~~~~l~~~~--------~r~-~~~~~~~~~~~tG~~~~~~~~~~~~~~~~~i~q~~PlP 810 (825) T protein:vir:73 740 KQVIPTVTMVVNASRGIWATTPGGTWYEYP--------QRE-FEFYDDPVDDATGKVEVKLDSNWDKNGRVKVRQLDPLP 810 (825) T ss_pred cEEEEEEEEEEEeeeeEEEecCCCcceEee--------ccC-CCcccCCCccccCcEEEecCCCCCCccEEEEEEcCCCC Confidence 568999999999999999987655332211 122 2457999999999999998 689999999999999999 Q ss_pred EEEEEEEEEEEEecc Q lcl|NC_012662. 762 MNILEIEYIIRYNQR 776 (780) Q Consensus 762 ~tvlai~~eg~y~~r 776 (780) ||||||..|...+-= T Consensus 811 ~tvlav~~~~~~~g~ 825 (825) T protein:vir:73 811 LSVLAVLPRLTVGGF 825 (825) T ss_pred EEEEEEEEEEEecCC Confidence 999999988775544 No 21 >protein:vir:107802 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996623;genbank:gi:45580757;genbank:GeneID:2767878 Probab=100.00 E-value=7.2e-158 Score=882.10 Aligned_cols=659 Identities=13% Similarity=0.102 Sum_probs=494.0 Q ss_pred Cceeeeeeechhhcc-----cccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCC Q lcl|NC_012662. 1 MARPFEGALNDLLQG-----VSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGA 75 (780) Q Consensus 1 Ma~~v~~~~~~l~~G-----vSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~ 75 (780) ||+ +.-..+||.+| +..|.|++||+++|++|+||++.|+||++|||||+||+++++.+...+++||++ + T Consensus 1 m~~-~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~-----~ 74 (681) T protein:vir:10 1 MSN-VRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTY-----S 74 (681) T ss_pred Ccc-eeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEe-----C Confidence 997 67788999999 557999999999999999999999999999999999999999887777777765 4 Q ss_pred CccEEEEEEcCcEEEEEeCCCcEEEecC----CCCccccCCcceEEEEEeCCEEEEecCCEeeeEeecccccCCCCcceE Q lcl|NC_012662. 76 DGRHLVINTNTGGWWLLDREAKNIVSEG----NLSYLLAADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPKTTGF 151 (780) Q Consensus 76 ~~~~y~l~~~~g~~~v~d~~~~~~~~~~----~~~y~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~g~ 151 (780) .+|+|+|+|++++||||. .++.++..+ ..+||+++++.+|+|+|+||+|||+|+++||+++.|.++..|.... + T Consensus 75 ~~~~~~l~~g~~~~r~~~-~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~W~l~~-~ 152 (681) T protein:vir:10 75 VTQTMVIELGAGYFRFHT-NGGTLLDGAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATNWQLAT-I 152 (681) T ss_pred CCceEEEEEeCCeEEEEe-CCcEEeeCcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCceEEEE-E Confidence 689999999999999994 444544322 1346899999999999999999999999999999998887664321 1 Q ss_pred EEecCccccceeEEEEeeCCc--eEEEEEEeccCCCCccccccchhhhhhhhhhhheecccceEEEcCeEEEEEcCC-Cc Q lcl|NC_012662. 152 YFVKSGAFSKEYDISVVWSEG--SQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVGVDFAVRVGPYIYFELIT-GT 228 (780) Q Consensus 152 v~v~~g~y~~~y~vti~~~~~--~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g~~~~~~~g~~i~~~~~s-~~ 228 (780) ....+.+. ..+++...... ..+..+... ............+ .. T Consensus 153 -~f~~~p~~-p~~~~at~~~~~~~~t~~~~v~--------------------------------avda~t~~~s~~~~~~ 198 (681) T protein:vir:10 153 -AFTSPVAT-PTSVTATSNNKGTDYTYRYVVT--------------------------------ALDAEGKTESAPSSAG 198 (681) T ss_pred -Eecccccc-ceeeeeeccCCccceeEeEEEE--------------------------------EeecccceeecCCcce Confidence 11112211 00011000000 000000000 0000000000000 00 Q ss_pred eeEEEeecCCcceeEEEEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEecccceeEEcccceeE Q lcl|NC_012662. 229 DLKITSTSGSPYIGYSNQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYNSVTAISVDVPYK 308 (780) Q Consensus 229 ~~~vt~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~~~~~~~~~~~~~p~~ 308 (780) .......++ +....+...... ...+| +.....+.+.+..+...... ..+.. T Consensus 199 tvt~~~~~~------------------------~~~~t~~w~a~~-g~~~~-~V~~~~~gi~g~ig~~~~~~---~~~~~ 249 (681) T protein:vir:10 199 TCTNNLFTN------------------------GGANTIAWSASS-GASRY-NVYKEQGGLYGYIGQTTGTS---LVDDN 249 (681) T ss_pred EEeeeeecC------------------------CcceeEEEEecC-Cceee-eecccceeEEEEeeccceee---eeecc Confidence 000000000 001111111111 11111 22122222222222222221 11222 Q ss_pred EeeccccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEe----cCCeEEEEccCCccccccccccCCCCCccE Q lcl|NC_012662. 309 IVDDNVEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLL----SGAYVCMSATGEPDRFFRSTVSSLDPTDRI 384 (780) Q Consensus 309 l~~~~~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~----~~~~v~~S~~gd~~nF~~~t~~~~~ddD~i 384 (780) +.......++|.. ++|..+ ++||++|+||||||+|+ +||+|||||+||||||++++ +++||||| T Consensus 250 ~~~~~~~t~~~~~--------~~~~~~--~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~--~~~ddD~i 317 (681) T protein:vir:10 250 IAPDLSVTPPIYD--------AVFNAA--GDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSL--PVRDDDRV 317 (681) T ss_pred cccCccccccccc--------cccccC--CCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccC--CCCCCccE Confidence 2222222334543 333332 46899999999999999 58999999999999999988 46799999 Q ss_pred EEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecC-CCcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEecCCceE Q lcl|NC_012662. 385 DIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSL-QQLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFS 463 (780) Q Consensus 385 ~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~-~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~ 463 (780) +++++++++|.|+|+++++ +|+|||+++||.|+++ +++|||+|++++++|.|++ ++|+|+.+|++++|++++| + T Consensus 318 ~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~fv~~~g---~ 392 (681) T protein:vir:10 318 AFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTIYGAARG---G 392 (681) T ss_pred EEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeeeecc-ccccceeeCCeEEEEecCC---C Confidence 9999999999999999995 7999999999999864 5799999999999999976 5799999999999998876 4 Q ss_pred EEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeeeeeEeeccCCc Q lcl|NC_012662. 464 AVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAWHKWVFPYR 543 (780) Q Consensus 464 ~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW~~w~~~G~ 543 (780) +||||.|+ +.+|+|+++|+|++++|++++..+..++++++|..++||+++||+|++|+|+ +||+|.|||||+++|+ T Consensus 393 ~vre~~y~-~~~d~~~~~dlt~~a~Hl~~~~~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~---~eq~v~aW~~~~~~g~ 468 (681) T protein:vir:10 393 HVRELAYN-WQANGFVTGDLSLRAAHLFDNLDILDMAYAKAPQPIVWFISSSGKLLGLTYV---PEQQIGAWHQHDTDGV 468 (681) T ss_pred EEEEEEEe-eecCceeccchhhhhhhhcCCCCeEEEEEecCCCEEEEEEecCCcEEEEEEe---cccceeeEEEEecCCc Confidence 79999998 6889999999999999999986677778888999999999999999999997 5788999999999999 Q ss_pred EEEEEEE----CCcEEEEEEEcCCCeEEEEEEeeeccCCcccccccceeeeeccceeeecCcceeEEeeccccCCCCeEE Q lcl|NC_012662. 544 VASLHFA----RDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVNDGKGIVPIYMRPWVSEGKLT 619 (780) Q Consensus 544 v~~~~~~----~d~l~~vv~R~~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~l~g~~v 619 (780) |++||++ +|+||++|+|+++|..++|+|+|+...... .....|+||+.++.. .+...+.+++|+||+++ T Consensus 469 v~~v~~i~~~~~d~l~~vv~r~~~g~~~~yie~~~~~~~~~---~~~~~~vD~~~t~~~----~~~~~~sgl~~leG~tv 541 (681) T protein:vir:10 469 FESCAVVAEGNEDRLYAVVRRTIGGNEVRYVERMASRQFDA---QADAFFVDSGLTYSG----EPVSHISGLEHLEGKTV 541 (681) T ss_pred EEEEEEecCCCCcEEEEEEEecCCCCeEEEEEecCCccccc---cccceEeeccccccC----cceeeeccccCCCCcEE Confidence 9999998 578999999999999999999998765322 334579999987643 23445789999999999 Q ss_pred EEEecCccccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCceeeecceEEEEEEEEEeccccE Q lcl|NC_012662. 620 GSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLISTAPVRLLRYELTTRNTGEF 699 (780) Q Consensus 620 ~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~~~~~r~~v~rv~v~~~~T~~~ 699 (780) .+++||.++++.++.+| .++++.++++|+|||+|+++++|+||+++.++|..++ ++++|+|+.|++++|.++ T Consensus 542 ~i~aDG~~~~~~~V~~G-------~itl~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g-~~~ri~rv~lr~~~S~g~ 613 (681) T protein:vir:10 542 SILADGAVHPQRVVTDG-------AIDLDVEAGTVHIGLPITAELQTLPVAMQLDGSFGQG-RVKNINKLWLRVHRSSGI 613 (681) T ss_pred EEEeCCeecCcEeecCc-------EEEeCcCCceEEEeeeceeEEEecceeeecCCcccCC-ceEEEEEEEEEEEcccce Confidence 99999999988876544 4557788999999999999999999999998886654 467999999999999999 Q ss_pred EEEecCCCCCcceecccCceecccccccCCcccccccceEEEEee-cccceeEEEEEECCCCCEEEEEEEEEEEEec Q lcl|NC_012662. 700 DVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCR-SNAQSTEMYLSTDGTQDMNILEIEYIIRYNQ 775 (780) Q Consensus 700 ~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~-~~~~~~~v~i~~~~P~P~tvlai~~eg~y~~ 775 (780) ++.++.+..+.. .. ...+.+|.+|.+.||++++|+. +|+++.+|+|+|++|+||+|+||+||....- T Consensus 614 ~~~~~~~~l~~~--~~-------~~~~~~g~~~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~vgg 681 (681) T protein:vir:10 614 FAGPHADALTEV--KQ-------RTSEPYGSPPALKSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIAIGA 681 (681) T ss_pred EEeeCCCceEEE--EE-------eccccccccCCccCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEEeeC Confidence 998876544321 11 1235678888889999999986 6799999999999999999999999999888 No 22 >protein:vir:98487 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996575;genbank:gi:45569506;genbank:GeneID:2767815 Probab=100.00 E-value=7.2e-158 Score=882.10 Aligned_cols=659 Identities=13% Similarity=0.102 Sum_probs=494.0 Q ss_pred Cceeeeeeechhhcc-----cccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCC Q lcl|NC_012662. 1 MARPFEGALNDLLQG-----VSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGA 75 (780) Q Consensus 1 Ma~~v~~~~~~l~~G-----vSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~ 75 (780) ||+ +.-..+||.+| +..|.|++||+++|++|+||++.|+||++|||||+||+++++.+...+++||++ + T Consensus 1 m~~-~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~-----~ 74 (681) T protein:vir:98 1 MSN-VRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTY-----S 74 (681) T ss_pred Ccc-eeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEe-----C Confidence 997 67788999999 557999999999999999999999999999999999999999887777777765 4 Q ss_pred CccEEEEEEcCcEEEEEeCCCcEEEecC----CCCccccCCcceEEEEEeCCEEEEecCCEeeeEeecccccCCCCcceE Q lcl|NC_012662. 76 DGRHLVINTNTGGWWLLDREAKNIVSEG----NLSYLLAADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPKTTGF 151 (780) Q Consensus 76 ~~~~y~l~~~~g~~~v~d~~~~~~~~~~----~~~y~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~g~ 151 (780) .+|+|+|+|++++||||. .++.++..+ ..+||+++++.+|+|+|+||+|||+|+++||+++.|.++..|.... + T Consensus 75 ~~~~~~l~~g~~~~r~~~-~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~W~l~~-~ 152 (681) T protein:vir:98 75 VTQTMVIELGAGYFRFHT-NGGTLLDGAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATNWQLAT-I 152 (681) T ss_pred CCceEEEEEeCCeEEEEe-CCcEEeeCcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCceEEEE-E Confidence 689999999999999994 444544322 1346899999999999999999999999999999998887664321 1 Q ss_pred EEecCccccceeEEEEeeCCc--eEEEEEEeccCCCCccccccchhhhhhhhhhhheecccceEEEcCeEEEEEcCC-Cc Q lcl|NC_012662. 152 YFVKSGAFSKEYDISVVWSEG--SQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVGVDFAVRVGPYIYFELIT-GT 228 (780) Q Consensus 152 v~v~~g~y~~~y~vti~~~~~--~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g~~~~~~~g~~i~~~~~s-~~ 228 (780) ....+.+. ..+++...... ..+..+... ............+ .. T Consensus 153 -~f~~~p~~-p~~~~at~~~~~~~~t~~~~v~--------------------------------avda~t~~~s~~~~~~ 198 (681) T protein:vir:98 153 -AFTSPVAT-PTSVTATSNNKGTDYTYRYVVT--------------------------------ALDAEGKTESAPSSAG 198 (681) T ss_pred -Eecccccc-ceeeeeeccCCccceeEeEEEE--------------------------------EeecccceeecCCcce Confidence 11112211 00011000000 000000000 0000000000000 00 Q ss_pred eeEEEeecCCcceeEEEEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEecccceeEEcccceeE Q lcl|NC_012662. 229 DLKITSTSGSPYIGYSNQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYNSVTAISVDVPYK 308 (780) Q Consensus 229 ~~~vt~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~~~~~~~~~~~~~p~~ 308 (780) .......++ +....+...... ...+| +.....+.+.+..+...... ..+.. T Consensus 199 tvt~~~~~~------------------------~~~~t~~w~a~~-g~~~~-~V~~~~~gi~g~ig~~~~~~---~~~~~ 249 (681) T protein:vir:98 199 TCTNNLFTN------------------------GGANTIAWSASS-GASRY-NVYKEQGGLYGYIGQTTGTS---LVDDN 249 (681) T ss_pred EEeeeeecC------------------------CcceeEEEEecC-Cceee-eecccceeEEEEeeccceee---eeecc Confidence 000000000 001111111111 11111 22122222222222222221 11222 Q ss_pred EeeccccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEe----cCCeEEEEccCCccccccccccCCCCCccE Q lcl|NC_012662. 309 IVDDNVEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLL----SGAYVCMSATGEPDRFFRSTVSSLDPTDRI 384 (780) Q Consensus 309 l~~~~~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~----~~~~v~~S~~gd~~nF~~~t~~~~~ddD~i 384 (780) +.......++|.. ++|..+ ++||++|+||||||+|+ +||+|||||+||||||++++ +++||||| T Consensus 250 ~~~~~~~t~~~~~--------~~~~~~--~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~--~~~ddD~i 317 (681) T protein:vir:98 250 IAPDLSVTPPIYD--------AVFNAA--GDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSL--PVRDDDRV 317 (681) T ss_pred cccCccccccccc--------cccccC--CCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccC--CCCCCccE Confidence 2222222334543 333332 46899999999999999 58999999999999999988 46799999 Q ss_pred EEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecC-CCcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEecCCceE Q lcl|NC_012662. 385 DIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSL-QQLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFS 463 (780) Q Consensus 385 ~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~-~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~ 463 (780) +++++++++|.|+|+++++ +|+|||+++||.|+++ +++|||+|++++++|.|++ ++|+|+.+|++++|++++| + T Consensus 318 ~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~fv~~~g---~ 392 (681) T protein:vir:98 318 AFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTIYGAARG---G 392 (681) T ss_pred EEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeeeecc-ccccceeeCCeEEEEecCC---C Confidence 9999999999999999995 7999999999999864 5799999999999999976 5799999999999998876 4 Q ss_pred EEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeeeeeEeeccCCc Q lcl|NC_012662. 464 AVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAWHKWVFPYR 543 (780) Q Consensus 464 ~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW~~w~~~G~ 543 (780) +||||.|+ +.+|+|+++|+|++++|++++..+..++++++|..++||+++||+|++|+|+ +||+|.|||||+++|+ T Consensus 393 ~vre~~y~-~~~d~~~~~dlt~~a~Hl~~~~~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~---~eq~v~aW~~~~~~g~ 468 (681) T protein:vir:98 393 HVRELAYN-WQANGFVTGDLSLRAAHLFDNLDILDMAYAKAPQPIVWFISSSGKLLGLTYV---PEQQIGAWHQHDTDGV 468 (681) T ss_pred EEEEEEEe-eecCceeccchhhhhhhhcCCCCeEEEEEecCCCEEEEEEecCCcEEEEEEe---cccceeeEEEEecCCc Confidence 79999998 6889999999999999999986677778888999999999999999999997 5788999999999999 Q ss_pred EEEEEEE----CCcEEEEEEEcCCCeEEEEEEeeeccCCcccccccceeeeeccceeeecCcceeEEeeccccCCCCeEE Q lcl|NC_012662. 544 VASLHFA----RDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVNDGKGIVPIYMRPWVSEGKLT 619 (780) Q Consensus 544 v~~~~~~----~d~l~~vv~R~~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~l~g~~v 619 (780) |++||++ +|+||++|+|+++|..++|+|+|+...... .....|+||+.++.. .+...+.+++|+||+++ T Consensus 469 v~~v~~i~~~~~d~l~~vv~r~~~g~~~~yie~~~~~~~~~---~~~~~~vD~~~t~~~----~~~~~~sgl~~leG~tv 541 (681) T protein:vir:98 469 FESCAVVAEGNEDRLYAVVRRTIGGNEVRYVERMASRQFDA---QADAFFVDSGLTYSG----EPVSHISGLEHLEGKTV 541 (681) T ss_pred EEEEEEecCCCCcEEEEEEEecCCCCeEEEEEecCCccccc---cccceEeeccccccC----cceeeeccccCCCCcEE Confidence 9999998 578999999999999999999998765322 334579999987643 23445789999999999 Q ss_pred EEEecCccccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCceeeecceEEEEEEEEEeccccE Q lcl|NC_012662. 620 GSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLISTAPVRLLRYELTTRNTGEF 699 (780) Q Consensus 620 ~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~~~~~r~~v~rv~v~~~~T~~~ 699 (780) .+++||.++++.++.+| .++++.++++|+|||+|+++++|+||+++.++|..++ ++++|+|+.|++++|.++ T Consensus 542 ~i~aDG~~~~~~~V~~G-------~itl~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g-~~~ri~rv~lr~~~S~g~ 613 (681) T protein:vir:98 542 SILADGAVHPQRVVTDG-------AIDLDVEAGTVHIGLPITAELQTLPVAMQLDGSFGQG-RVKNINKLWLRVHRSSGI 613 (681) T ss_pred EEEeCCeecCcEeecCc-------EEEeCcCCceEEEeeeceeEEEecceeeecCCcccCC-ceEEEEEEEEEEEcccce Confidence 99999999988876544 4557788999999999999999999999998886654 467999999999999999 Q ss_pred EEEecCCCCCcceecccCceecccccccCCcccccccceEEEEee-cccceeEEEEEECCCCCEEEEEEEEEEEEec Q lcl|NC_012662. 700 DVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCR-SNAQSTEMYLSTDGTQDMNILEIEYIIRYNQ 775 (780) Q Consensus 700 ~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~-~~~~~~~v~i~~~~P~P~tvlai~~eg~y~~ 775 (780) ++.++.+..+.. .. ...+.+|.+|.+.||++++|+. +|+++.+|+|+|++|+||+|+||+||....- T Consensus 614 ~~~~~~~~l~~~--~~-------~~~~~~g~~~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~vgg 681 (681) T protein:vir:98 614 FAGPHADALTEV--KQ-------RTSEPYGSPPALKSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIAIGA 681 (681) T ss_pred EEeeCCCceEEE--EE-------eccccccccCCccCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEEeeC Confidence 998876544321 11 1235678888889999999986 6799999999999999999999999999888 No 23 >protein:vir:107423 Length: 681 # NCBI annotation: Bbp13 # Family: family:all:780 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958682;genbank:gi:41179374;genbank:GeneID:2717217 Probab=100.00 E-value=7.2e-158 Score=882.10 Aligned_cols=659 Identities=13% Similarity=0.102 Sum_probs=494.0 Q ss_pred Cceeeeeeechhhcc-----cccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCC Q lcl|NC_012662. 1 MARPFEGALNDLLQG-----VSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGA 75 (780) Q Consensus 1 Ma~~v~~~~~~l~~G-----vSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~ 75 (780) ||+ +.-..+||.+| +..|.|++||+++|++|+||++.|+||++|||||+||+++++.+...+++||++ + T Consensus 1 m~~-~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~-----~ 74 (681) T protein:vir:10 1 MSN-VRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTY-----S 74 (681) T ss_pred Ccc-eeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEe-----C Confidence 997 67788999999 557999999999999999999999999999999999999999887777777765 4 Q ss_pred CccEEEEEEcCcEEEEEeCCCcEEEecC----CCCccccCCcceEEEEEeCCEEEEecCCEeeeEeecccccCCCCcceE Q lcl|NC_012662. 76 DGRHLVINTNTGGWWLLDREAKNIVSEG----NLSYLLAADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPKTTGF 151 (780) Q Consensus 76 ~~~~y~l~~~~g~~~v~d~~~~~~~~~~----~~~y~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~g~ 151 (780) .+|+|+|+|++++||||. .++.++..+ ..+||+++++.+|+|+|+||+|||+|+++||+++.|.++..|.... + T Consensus 75 ~~~~~~l~~g~~~~r~~~-~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~W~l~~-~ 152 (681) T protein:vir:10 75 VTQTMVIELGAGYFRFHT-NGGTLLDGAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATNWQLAT-I 152 (681) T ss_pred CCceEEEEEeCCeEEEEe-CCcEEeeCcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCceEEEE-E Confidence 689999999999999994 444544322 1346899999999999999999999999999999998887664321 1 Q ss_pred EEecCccccceeEEEEeeCCc--eEEEEEEeccCCCCccccccchhhhhhhhhhhheecccceEEEcCeEEEEEcCC-Cc Q lcl|NC_012662. 152 YFVKSGAFSKEYDISVVWSEG--SQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVGVDFAVRVGPYIYFELIT-GT 228 (780) Q Consensus 152 v~v~~g~y~~~y~vti~~~~~--~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g~~~~~~~g~~i~~~~~s-~~ 228 (780) ....+.+. ..+++...... ..+..+... ............+ .. T Consensus 153 -~f~~~p~~-p~~~~at~~~~~~~~t~~~~v~--------------------------------avda~t~~~s~~~~~~ 198 (681) T protein:vir:10 153 -AFTSPVAT-PTSVTATSNNKGTDYTYRYVVT--------------------------------ALDAEGKTESAPSSAG 198 (681) T ss_pred -Eecccccc-ceeeeeeccCCccceeEeEEEE--------------------------------EeecccceeecCCcce Confidence 11112211 00011000000 000000000 0000000000000 00 Q ss_pred eeEEEeecCCcceeEEEEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEecccceeEEcccceeE Q lcl|NC_012662. 229 DLKITSTSGSPYIGYSNQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYNSVTAISVDVPYK 308 (780) Q Consensus 229 ~~~vt~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~~~~~~~~~~~~~p~~ 308 (780) .......++ +....+...... ...+| +.....+.+.+..+...... ..+.. T Consensus 199 tvt~~~~~~------------------------~~~~t~~w~a~~-g~~~~-~V~~~~~gi~g~ig~~~~~~---~~~~~ 249 (681) T protein:vir:10 199 TCTNNLFTN------------------------GGANTIAWSASS-GASRY-NVYKEQGGLYGYIGQTTGTS---LVDDN 249 (681) T ss_pred EEeeeeecC------------------------CcceeEEEEecC-Cceee-eecccceeEEEEeeccceee---eeecc Confidence 000000000 001111111111 11111 22122222222222222221 11222 Q ss_pred EeeccccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEe----cCCeEEEEccCCccccccccccCCCCCccE Q lcl|NC_012662. 309 IVDDNVEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLL----SGAYVCMSATGEPDRFFRSTVSSLDPTDRI 384 (780) Q Consensus 309 l~~~~~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~----~~~~v~~S~~gd~~nF~~~t~~~~~ddD~i 384 (780) +.......++|.. ++|..+ ++||++|+||||||+|+ +||+|||||+||||||++++ +++||||| T Consensus 250 ~~~~~~~t~~~~~--------~~~~~~--~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~--~~~ddD~i 317 (681) T protein:vir:10 250 IAPDLSVTPPIYD--------AVFNAA--GDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSL--PVRDDDRV 317 (681) T ss_pred cccCccccccccc--------cccccC--CCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccC--CCCCCccE Confidence 2222222334543 333332 46899999999999999 58999999999999999988 46799999 Q ss_pred EEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecC-CCcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEecCCceE Q lcl|NC_012662. 385 DIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSL-QQLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFS 463 (780) Q Consensus 385 ~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~-~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~ 463 (780) +++++++++|.|+|+++++ +|+|||+++||.|+++ +++|||+|++++++|.|++ ++|+|+.+|++++|++++| + T Consensus 318 ~~~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~fv~~~g---~ 392 (681) T protein:vir:10 318 AFRVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTIYGAARG---G 392 (681) T ss_pred EEEEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeeeecc-ccccceeeCCeEEEEecCC---C Confidence 9999999999999999995 7999999999999864 5799999999999999976 5799999999999998876 4 Q ss_pred EEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeeeeeEeeccCCc Q lcl|NC_012662. 464 AVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAWHKWVFPYR 543 (780) Q Consensus 464 ~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW~~w~~~G~ 543 (780) +||||.|+ +.+|+|+++|+|++++|++++..+..++++++|..++||+++||+|++|+|+ +||+|.|||||+++|+ T Consensus 393 ~vre~~y~-~~~d~~~~~dlt~~a~Hl~~~~~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~---~eq~v~aW~~~~~~g~ 468 (681) T protein:vir:10 393 HVRELAYN-WQANGFVTGDLSLRAAHLFDNLDILDMAYAKAPQPIVWFISSSGKLLGLTYV---PEQQIGAWHQHDTDGV 468 (681) T ss_pred EEEEEEEe-eecCceeccchhhhhhhhcCCCCeEEEEEecCCCEEEEEEecCCcEEEEEEe---cccceeeEEEEecCCc Confidence 79999998 6889999999999999999986677778888999999999999999999997 5788999999999999 Q ss_pred EEEEEEE----CCcEEEEEEEcCCCeEEEEEEeeeccCCcccccccceeeeeccceeeecCcceeEEeeccccCCCCeEE Q lcl|NC_012662. 544 VASLHFA----RDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVNDGKGIVPIYMRPWVSEGKLT 619 (780) Q Consensus 544 v~~~~~~----~d~l~~vv~R~~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~l~g~~v 619 (780) |++||++ +|+||++|+|+++|..++|+|+|+...... .....|+||+.++.. .+...+.+++|+||+++ T Consensus 469 v~~v~~i~~~~~d~l~~vv~r~~~g~~~~yie~~~~~~~~~---~~~~~~vD~~~t~~~----~~~~~~sgl~~leG~tv 541 (681) T protein:vir:10 469 FESCAVVAEGNEDRLYAVVRRTIGGNEVRYVERMASRQFDA---QADAFFVDSGLTYSG----EPVSHISGLEHLEGKTV 541 (681) T ss_pred EEEEEEecCCCCcEEEEEEEecCCCCeEEEEEecCCccccc---cccceEeeccccccC----cceeeeccccCCCCcEE Confidence 9999998 578999999999999999999998765322 334579999987643 23445789999999999 Q ss_pred EEEecCccccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCceeeecceEEEEEEEEEeccccE Q lcl|NC_012662. 620 GSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLISTAPVRLLRYELTTRNTGEF 699 (780) Q Consensus 620 ~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~~~~~r~~v~rv~v~~~~T~~~ 699 (780) .+++||.++++.++.+| .++++.++++|+|||+|+++++|+||+++.++|..++ ++++|+|+.|++++|.++ T Consensus 542 ~i~aDG~~~~~~~V~~G-------~itl~~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g-~~~ri~rv~lr~~~S~g~ 613 (681) T protein:vir:10 542 SILADGAVHPQRVVTDG-------AIDLDVEAGTVHIGLPITAELQTLPVAMQLDGSFGQG-RVKNINKLWLRVHRSSGI 613 (681) T ss_pred EEEeCCeecCcEeecCc-------EEEeCcCCceEEEeeeceeEEEecceeeecCCcccCC-ceEEEEEEEEEEEcccce Confidence 99999999988876544 4557788999999999999999999999998886654 467999999999999999 Q ss_pred EEEecCCCCCcceecccCceecccccccCCcccccccceEEEEee-cccceeEEEEEECCCCCEEEEEEEEEEEEec Q lcl|NC_012662. 700 DVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCR-SNAQSTEMYLSTDGTQDMNILEIEYIIRYNQ 775 (780) Q Consensus 700 ~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~-~~~~~~~v~i~~~~P~P~tvlai~~eg~y~~ 775 (780) ++.++.+..+.. .. ...+.+|.+|.+.||++++|+. +|+++.+|+|+|++|+||+|+||+||....- T Consensus 614 ~~~~~~~~l~~~--~~-------~~~~~~g~~~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~vgg 681 (681) T protein:vir:10 614 FAGPHADALTEV--KQ-------RTSEPYGSPPALKSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIAIGA 681 (681) T ss_pred EEeeCCCceEEE--EE-------eccccccccCCccCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEEeeC Confidence 998876544321 11 1235678888889999999986 6799999999999999999999999999888 No 24 >protein:vir:1778 Length: 680 # NCBI annotation: tail protein A # Family: family:all:825 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570344;genbank:gi:18640503;genbank:GeneID:932716 Probab=100.00 E-value=2.3e-157 Score=879.34 Aligned_cols=535 Identities=20% Similarity=0.309 Sum_probs=458.4 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCCCccEE Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHL 80 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~~~~~y 80 (780) || +|+|+||||++|||||||++|+|+||++|+||+|||+.||+||||++||+.+.. .+..+.+|+++|+++|||| T Consensus 1 M~-~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRpg~~~i~~l~~----~~~~~~~~~~~rd~~e~~~ 75 (680) T protein:vir:17 1 MA-AVEQMVPNLLGGISQQPDPLKLPGQVKQARNVQLDPTFGALKRPGTELIMQVTG----IPKRAKWIPIMRDAREHYY 75 (680) T ss_pred Cc-cceecchhhhCcceecchhhcCcchhhhhhccccCcCcccccCccceeeeeccC----CCCCceeEEEecCCCCeEE Confidence 99 699999999999999999999999999999999999999999999999998763 3567778999999999999 Q ss_pred EEEEcCc-------EEEEEeCCCcE-E--EecCCCC--c--cccCCcceEEEEEeCCEEEEecCCEeeeEeecccccCCC Q lcl|NC_012662. 81 VINTNTG-------GWWLLDREAKN-I--VSEGNLS--Y--LLAADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDP 146 (780) Q Consensus 81 ~l~~~~g-------~~~v~d~~~~~-~--~~~~~~~--y--~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~ 146 (780) +++..+| .|+|||..+|+ . .+.++.. | .++.+..+||+++++|++||+|+++.|++.+. ...+ T Consensus 76 ~~~~~~g~~~~~~~~i~v~d~~~G~~~~v~~~~~~~~~~~~~~~~~~~~lr~~tv~d~tfi~N~~v~~~~~~~---~~~~ 152 (680) T protein:vir:17 76 VAIYREGANESGDLRIRVFDLKAGVERAVSFVGGEVEEYFPGDETDWEAIRSLTIGDYTFLSNPNVQPTTWSR---SFSR 152 (680) T ss_pred EEEEcCCCcccccceeEEEEccCCeEEEEEcCCCceEEEeecCCCCccceEEEEEcCEEEEECCeEEEeccCC---CCCC Confidence 9999887 39999987653 3 3333221 2 23445569999999999999999999987654 3455 Q ss_pred CcceEEEecCccccceeEEEEeeCCceEE----------------------------------E---EE----------- Q lcl|NC_012662. 147 KTTGFYFVKSGAFSKEYDISVVWSEGSQT----------------------------------V---TY----------- 178 (780) Q Consensus 147 ~~~g~v~v~~g~y~~~y~vti~~~~~~~t----------------------------------~---t~----------- 178 (780) ...|+++|++++|+++|.|+++....+.. + .. T Consensus 153 ~~~g~~~v~~~ayg~ty~v~ing~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~Ag~~t~~~~~~a~la~~l~~~~~~~~~ 232 (680) T protein:vir:17 153 RPEGLVTIGAAGYGTSYIVDFATEDSGQQRRWAVQEMQAPKTKRKKGDGSPDEAGETTVNNWNGTGLSFRVKVEARAFLV 232 (680) T ss_pred CCeeEEEEEEeeeeeEEEEEEeccccceeeeeeeeeeeccccccccccccCCCCcceeeeeeeeeeeeeeeeeccceeee Confidence 67799999999999999999866321110 0 00 Q ss_pred -------------Eec----cC-----C-----------------------------------CCccccccchhhhhhhh Q lcl|NC_012662. 179 -------------TTP----DG-----T-----------------------------------TAGDADQSVPEAIARKL 201 (780) Q Consensus 179 -------------tt~----~~-----s-----------------------------------~~~~~~~~~~~~i~~~l 201 (780) .+. .+ . ....++++++++|+.+| T Consensus 233 ~~g~~~~~~y~~~~~l~~tg~~~~~~~~t~~v~~~G~~y~IsI~~~~~~~~~~~~~s~~~~t~~~~~a~~at~~~Ia~~L 312 (680) T protein:vir:17 233 DDGEEYGHNYIPYVTLLTPGNNTSPFPDTIRVDVSGEGWDIKVTKQIQSKVYANLGTAQFTTPVDQSGGGASTSDIVTGL 312 (680) T ss_pred cCCCceEEEEeeEEEEecCCccccccCceEEEecccceeEEEEccceeeEeccCccceeeeeccCCcccceeHHHHHHHH Confidence 000 00 0 00012335577788888 Q ss_pred hhhheecccceEEEcCeEEEEEcCCC---ceeEEEeecCCcceeEE-EEEeecceecccccccCCceEEEEeccCCCCCc Q lcl|NC_012662. 202 VEALIAVGVDFAVRVGPYIYFELITG---TDLKITSTSGSPYIGYS-NQSQVNLETDLPARLHPSADGALCAVGQSERAL 277 (780) Q Consensus 202 ~~~~~s~g~~~~~~~g~~i~~~~~s~---~~~~vt~~~g~~~~~~~-~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~ 277 (780) ...+.+.+.....+.|++|++...+. ..+.+++.+|..+..+. ..++|+++++||++||+|+.++|.+++++..++ T Consensus 313 ~~~i~~~~~~~~~~~g~~i~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~Lp~~a~~g~~v~v~~~~~~~~~~ 392 (680) T protein:vir:17 313 SAAINGLGTFTAESIGNVIRVRYSDPTRTDEFTMSARGGTSGTGLESIKYSVDTLAELPTKCWNDYQVAVRNTQDTEVDD 392 (680) T ss_pred HHhhcccCcEEEEECCCEEEEEeccCCCceEEEeeccCCCCceeeeeeeeeeccccccccccCCCcEEEEEeCCCCcccc Confidence 87776655555678899999975432 34678888998887764 567899999999999999999999999999999 Q ss_pred eEEEEEec--------CccEEEEecccceeEEc-ccceeEEeeccc-----c-------ccccchhhcCCcccCCCcccc Q lcl|NC_012662. 278 VWYRYSSE--------KGVWLESGDYNSVTAIS-VDVPYKIVDDNV-----E-------QHIMEGRLAGDDLTNPAPTFL 336 (780) Q Consensus 278 ~y~~~~~~--------~g~w~e~~~~~~~~~~~-~~~p~~l~~~~~-----~-------~~~w~~~~~gd~~t~~~psf~ 336 (780) ||++|+.. .+.|+||++++...+++ ++|||+|++... + .+.|++|.+||+++||+|+|+ T Consensus 393 Yyv~~~~~~~~~~~~~~~~W~E~~~~~~~~~~~~~tmp~~l~r~~~g~f~~~~~~~~~~~~~~~~r~~Gdd~tnp~psF~ 472 (680) T protein:vir:17 393 YYVKFETDVEDADVPGSGYWVETVKNGDDGGLVDDTMPHVLVRNALGDFTFSSLNNSSYGKTWADRSVGSEDTNPHPTFT 472 (680) T ss_pred eEEEEeccCcccCcccccceeecccCcccceeccCcceEEEEEccCceeEEEeeccccccccccccccCCcccCCCcccc Confidence 99999863 45899999999999887 589999997542 2 335999999999999999998 Q ss_pred c-CCCceEEEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEE Q lcl|NC_012662. 337 E-ERRITGIGTFQGRLVLLSGAYVCMSATGEPDRFFRSTVSSLDPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQA 415 (780) Q Consensus 337 ~-~~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~t~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~ 415 (780) + +++|++|+||||||+|++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++|| T Consensus 473 ~~G~~p~~v~f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~t~g~q~ 552 (680) T protein:vir:17 473 ESGNGIYGMFMYKNRLGFLTQDAVIMSQVGDYFNFYATSGVTISDADPIDMATSDTKPVKLEAAISSTSGAILFGNQAQF 552 (680) T ss_pred cCCCCceEEEEEcceEEEeeCCeEEEEccCCcccccccccccCCCCccEEEEEcCCcceeeeEEeecCCcEEEEecCeEE Confidence 6 679999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEecCCCcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEecCCceEEEEEeeeeccccCceehhhHHHHHHHhcCCCc Q lcl|NC_012662. 416 VVPSLQQLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEA 495 (780) Q Consensus 416 ~i~~~~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~ 495 (780) +|+|++++|||+|++++++|+|+|+++|+|+.+|+.++|++++| ++++||||.|+ +.+|+|+++|||+|++|||+|++ T Consensus 553 ~ls~~~~~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g-~~s~vre~~y~-~~~d~y~a~DlT~~a~hl~~g~v 630 (680) T protein:vir:17 553 RLSSPDESFGPKTATLDKISNYTYESKADPVQTGVSMIFPTNMG-TYSSVYELSTE-SAKGTPVIEDSSRVIPRLIPSGL 630 (680) T ss_pred EEecCCceecceeEEEEEEEeecccCCCCceEeCCeEEEeecCC-CcceEEEEeee-eccCceehhhHHHHHHHhcCCce Confidence 99986679999999999999999999999999999999998887 57899999997 78999999999999999999999 Q ss_pred EEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeeeeeEeeccCCcEE Q lcl|NC_012662. 496 RFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAWHKWVFPYRVA 545 (780) Q Consensus 496 ~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW~~w~~~G~v~ 545 (780) +++++++++|.+++|++++||+|++|+|+++++||+|+|||||+|+|.=+ T Consensus 631 ~~~~~~~~~~~~~~~~~~~~~~l~~~~yl~~~~e~~v~aW~rw~~~~~d~ 680 (680) T protein:vir:17 631 TWSTASMNNDTVFFGNAKKGRNVYVFRFFNEGQERKVAGWTTWYYEDQDH 680 (680) T ss_pred EEEEeeCCCCeEEEEEEcCCCEEEEEEEeeCCCceEEEEEEEEecCCCCC Confidence 99999999999999999999999999999999999999999999986544 No 25 >protein:vir:102644 Length: 594 # NCBI annotation: Hypothetical protein # Family: family:all:780 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024422;genbank:gi:48696643;genbank:GeneID:2948111 Probab=100.00 E-value=1.3e-136 Score=765.58 Aligned_cols=557 Identities=11% Similarity=0.065 Sum_probs=442.7 Q ss_pred Cceeeeeeechhhcc-----cccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCC Q lcl|NC_012662. 1 MARPFEGALNDLLQG-----VSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGA 75 (780) Q Consensus 1 Ma~~v~~~~~~l~~G-----vSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~ 75 (780) ||+ +.| .||.+| +..|.|++||+++|++|+||++.|+||++||||++|++++++..+..+++||.+ + T Consensus 1 m~~-~~~--~~F~~GelsP~l~~r~Dl~~y~~~~~~~~n~~~~~~G~~~rR~G~~~~~~~~~~~~~~~lipF~~-----s 72 (594) T protein:vir:10 1 MAD-FSQ--TSFKGGVIAPRLQFNEYESAYHHSIEDAVNFVVTEQGSLITRCGSEEVGLCQDGEVRLFRLPAVD-----A 72 (594) T ss_pred Cce-eec--cccCcceecceeccchhHHHHHHHHhhhhceEEEecCCeecCChhHhhhhccCCCCCEEEEEEEe-----C Confidence 997 556 899999 446999999999999999999999999999999999999998887778888765 4 Q ss_pred CccEEEEEEcCcEEEEEeCCCcEEEecCCCC-----cccc---CCcceEEEEEeCCEEEEecCCEeeeEeecccccCCCC Q lcl|NC_012662. 76 DGRHLVINTNTGGWWLLDREAKNIVSEGNLS-----YLLA---ADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPK 147 (780) Q Consensus 76 ~~~~y~l~~~~g~~~v~d~~~~~~~~~~~~~-----y~~~---~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~ 147 (780) .+++|+++++++++|+|...+..+...++.+ |++. +++.+|+|+|++|++|++|++++|+++.|..+..|. T Consensus 73 ~~~~~~le~g~~~~r~~~~~~~~v~~~~~~~~~~~tp~~~t~~~~l~~i~~tqsad~~~~~~~~~~p~~L~R~~~~~w~- 151 (594) T protein:vir:10 73 PSNDVIVEVGNTNIAVWVNDVRQVVANTPSEWRNTIDRIQTAYDTIGDDAGAANTGRLIMVHPALQPKRLYRDNNNAWQ- 151 (594) T ss_pred CCCeEEEEEcCCeEEEEecCcEEEEccCCCcccccccceeeccCCccceEEEEEeeEEEEEcCCCCceEEEEccCCCce- Confidence 7899999999999999965544344444433 2332 457899999999999999999999876654322110 Q ss_pred cceEEEecCccccceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhhheecccceEEEcCeEEEEEcCCC Q lcl|NC_012662. 148 TTGFYFVKSGAFSKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVGVDFAVRVGPYIYFELITG 227 (780) Q Consensus 148 ~~g~v~v~~g~y~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g~~~~~~~g~~i~~~~~s~ 227 (780) +.. T Consensus 152 ---~~~-------------------------------------------------------------------------- 154 (594) T protein:vir:10 152 ---FVN-------------------------------------------------------------------------- 154 (594) T ss_pred ---EEe-------------------------------------------------------------------------- Confidence 000 Q ss_pred ceeEEEeecCCcceeEEEEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEecccceeEEccccee Q lcl|NC_012662. 228 TDLKITSTSGSPYIGYSNQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYNSVTAISVDVPY 307 (780) Q Consensus 228 ~~~~vt~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~~~~~~~~~~~~~p~ 307 (780) + + T Consensus 155 ---------------~-------~-------------------------------------------------------- 156 (594) T protein:vir:10 155 ---------------M-------H-------------------------------------------------------- 156 (594) T ss_pred ---------------c-------c-------------------------------------------------------- Confidence 0 0 Q ss_pred EEeeccccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEec----CCeEEEEccCCccccccccccCCCCCcc Q lcl|NC_012662. 308 KIVDDNVEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLS----GAYVCMSATGEPDRFFRSTVSSLDPTDR 383 (780) Q Consensus 308 ~l~~~~~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~----~~~v~~S~~gd~~nF~~~t~~~~~ddD~ 383 (780) |. ++++.+.+.++|++|+||||||+|++ |++|||||+||||||+++++ ..|||| T Consensus 157 -----------~~---------~~p~~~~~~~~p~~v~f~q~RL~f~~~~~~p~~v~~Srtgd~~nF~~~~~--~~ddd~ 214 (594) T protein:vir:10 157 -----------TG---------AVPAEWSPSNYPQTVGIFQNRVWYVGSPVHRTYFWATRAGKLEDIAPSTA--NNPNDP 214 (594) T ss_pred -----------cC---------cccccccCCccceEEEEEeeeEEEEeCCCCCceEEEEecccccccccCCC--CCCCcc Confidence 00 00000111357889999999999998 57899999999999999985 469999 Q ss_pred EEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecCC-CcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEecCCce Q lcl|NC_012662. 384 IDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQ-QLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAF 462 (780) Q Consensus 384 i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~-~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~ 462 (780) |++.+ +++.+.| |++++.++|+|||+++||+|++++ .+|||+|++++++|.+ +++.|+|+.+|+.++|++++| T Consensus 215 i~~~~-s~~~~~~-~~v~~~~~L~i~t~~~e~~l~~~~~~~lTp~~~~~~~~s~~-g~~~~~P~~vg~~~~fv~~~g--- 288 (594) T protein:vir:10 215 ISFVG-IMEGTPC-WIIASSDVLTIGTTINDYQLAASTGVSVTAATAILRRSSVQ-GTAAVQGIPAEEQVIFCSRNK--- 288 (594) T ss_pred EEEEE-ecccceE-EEEecCCceEEEecCceEEEecCCCcccccceEEEEEeeee-ccCCCcceeeCCeEEEEcCCC--- Confidence 99954 4565555 567788899999999999998753 5899999999999865 678999999999999998766 Q ss_pred EEEEEeeeeccccCceehhhHHHHHHHhcC------CCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCceeeeeeE Q lcl|NC_012662. 463 SAVLELVPSQYTSSQYVSQDVTTHIPRYIE------GEARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKVHQAWH 536 (780) Q Consensus 463 ~~v~e~~~~~~~~~~~~~~dls~~~~h~~~------g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~v~aW~ 536 (780) ++||||+|+ +.+++|+++|||+|++|||+ +..+..++++++|..++||+++||.|++++|+ +||+|.||| T Consensus 289 ~~vre~~y~-~~~d~y~~~dlt~~a~hl~~~~~~~~~~~i~~~a~~~~p~~~~~~v~~dG~l~~~ty~---~eq~v~aWs 364 (594) T protein:vir:10 289 SKVYAMNYV-REQDNWIPDEMSSQAQHLFTPISSAKGASVRRVAYISDAAKSLWVVLENGQINYCCFD---RTTDTKAWT 364 (594) T ss_pred CEEEEEEEe-eccCceeccchhhhhhhhcCccccccCceEEEEEEecCCceEEEEEeCCCeEEEEEEe---cccceeeeE Confidence 579999998 67899999999999999984 45556677888888899999999999999996 688899999 Q ss_pred eec-cCCcEEEEEEE----CCcEEEEEEE--cCCCeEEEE--EEeeeccCCcccccccceeeeeccceeeecCcceeEEe Q lcl|NC_012662. 537 KWV-FPYRVASLHFA----RDRVVLFAAD--DAGSTDKIT--ISTIDPKQGGVTFDVDRLPHLDSMSIVPVNDGKGIVPI 607 (780) Q Consensus 537 ~w~-~~G~v~~~~~~----~d~l~~vv~R--~~~g~~~~~--~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~ 607 (780) ||+ ++|+|++||++ +|++|++|+| +++|..++| ||+|+.....+. ....|+|+...+. .. T Consensus 365 ~~~~t~G~v~~va~i~~~~~d~l~~~V~R~~ti~g~~~~y~~lE~~~~~~~~~~---~~~~~~d~~~~~~--------~~ 433 (594) T protein:vir:10 365 QLELSGGKVIDIAAAFNPDSDYAYVAVVRSKAINGVQKNYTVLEKISSPRTDWK---RADGWVVAQVNQN--------GD 433 (594) T ss_pred eeccCCCcEEEEEEeecCCCCEEEEEEEECCccccceeeEEEeecCCCcccccc---ccceeeeeccccc--------ce Confidence 998 58999999998 5899999999 568999987 999988765443 3346788876542 12 Q ss_pred eccccCCCCeEEEEEecCccccceeccccccccceEEE--cCCCCCCEEEEeEeeeEEEEcCceEEecCCCceeeecceE Q lcl|NC_012662. 608 YMRPWVSEGKLTGSVATGALASEEVAIDVDEVSWEFTV--EPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLISTAPVR 685 (780) Q Consensus 608 ~~~~~~l~g~~v~~~adG~~~~~~~~~~~~~~~~~~~i--~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~~~~~r~~ 685 (780) ..+++||||+++.+++||..+++..+.+ +.+++ .++.++++|||||+|+++++++||++++++|+.++. |+| T Consensus 434 vsgl~hLeg~tv~v~aDG~~~~~~~V~~-----g~itL~~~~~~~~~~v~VGl~Y~s~i~~lp~~~~~~~gs~~g~-r~r 507 (594) T protein:vir:10 434 VLNLDRYIGRTAVIFSKYGLEAEVEVNN-----IGLTHRINGYDPNTVYYVGYKMDSYFRTLTPSNGDMKKSMFGS-KIR 507 (594) T ss_pred eecccccCCceEEEEeCCeecCCeEEcC-----CeeEeeccCCCCcceEEEeeeeeEEEEeecccccCCcccccCc-cEE Confidence 3589999999999999999998877544 33444 356789999999999999999999999998876655 889 Q ss_pred EEEEEEEEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccc--eEEEEeecccceeEEEEEECCCCCEE Q lcl|NC_012662. 686 LLRYELTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLS--RVPVPCRSNAQSTEMYLSTDGTQDMN 763 (780) Q Consensus 686 v~rv~v~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg--~~~vp~~~~~~~~~v~i~~~~P~P~t 763 (780) |+|++|+|++|.+++++.+........... ........+ .|...+| .+.+++.||+++.+|+|+|++|+||| T Consensus 508 i~r~~v~~~~S~g~~vg~~~~~~r~~~~~~-----~~~~~~~~g-~~~~~tg~~~v~~~~~G~~~~~~i~I~qd~PlPlt 581 (594) T protein:vir:10 508 ISKVQLALFDSIEPTVNGEPADDRSTDDIM-----DARLLDFSS-NSGSSNGTRLVDYNPLGWENDGKMVIAVEQPFLCE 581 (594) T ss_pred EEEEEEEEEcceeeEECCcccccccchhhc-----cccCCcccC-cccccCCceEEEEccCCcCcccEEEEEECCCcCEE Confidence 999999999999999876543322111111 111122233 2333455 45566789999999999999999999 Q ss_pred EEEEEEEEEEecc Q lcl|NC_012662. 764 ILEIEYIIRYNQR 776 (780) Q Consensus 764 vlai~~eg~y~~r 776 (780) |+||.+|...+.= T Consensus 582 vlai~~ev~~~~~ 594 (594) T protein:vir:10 582 VVGVFSVVQSNKV 594 (594) T ss_pred EEEEEEEEEeccC Confidence 9999999999998 No 26 >protein:vir:94602 Length: 1012 # NCBI annotation: PfWMP4_35 # Family: family:all:12083 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762665;genbank:gi:115304373;genbank:GeneID:5142302 Probab=99.51 E-value=1.8e-12 Score=84.98 Aligned_cols=722 Identities=14% Similarity=0.138 Sum_probs=318.9 Q ss_pred Ccee----eeeeechhhcc--cccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecC Q lcl|NC_012662. 1 MARP----FEGALNDLLQG--VSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERG 74 (780) Q Consensus 1 Ma~~----v~~~~~~l~~G--vSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~ 74 (780) |-.. ++|-..+-.+| +|.-|-..-|.+ ---..||=++..|-+.||.|++.+-.-......-+ .+.+.-|. T Consensus 1 mtqQQ~~eiqG~~t~~F~GL~~s~S~~~IP~~~-SP~~~N~DV~~~G~V~rR~GT~l~~~Y~inn~s~~---~~s~~irt 76 (1012) T protein:vir:94 1 MTQQQATEIQGPFTREFSGLDISNSVGAIPVSG-SPVFHNCDVSDDGAVVRRRGTALVNTYNINNASGR---AWSDTIRT 76 (1012) T ss_pred CCccccccccccccccccccccccccccccccC-CCceEEeecccCcceeehhhhhhhhhhcccccCcc---eeeeeehh Confidence 5421 11222222333 333332222222 23468999999999999999999854332222222 22333455 Q ss_pred CCccEEEEEEcCcEEEEEeCCCcEEEe-------cCCCCccccCCcceEEEEEe---CCEEEEecCCEeeeEe---eccc Q lcl|NC_012662. 75 ADGRHLVINTNTGGWWLLDREAKNIVS-------EGNLSYLLAADRRSIQTTSM---GGVTYILNTEKRPSAT---TDNS 141 (780) Q Consensus 75 ~~~~~y~l~~~~g~~~v~d~~~~~~~~-------~~~~~y~~~~~~~~l~~~q~---aD~~fi~~~~~~p~~~---~~~~ 141 (780) .-+..|++..+++++.+--...++.+. -...+-|+ -.+++.-|+.+ -|-+.|..+++||..+ .+.. T Consensus 77 ~LG~eYfiLs~~~GLL~~~~~~~~AVG~~K~~a~V~~ss~~~-V~Pssm~F~~~S~~~~R~LILT~~~~~VQ~~F~E~T~ 155 (1012) T protein:vir:94 77 KLGSEYFILSNDVGLLISLMRDDEAVGMPKEVAVVSKSSIWT-VPPSSMCFIPVSAPYDRLLILTPEHPIVQLSFLERTL 155 (1012) T ss_pred hccceeEEEecCCceEEEeeecccccccchhhhhhhhhhccc-cCCcceEEEeccCCCCcEEEEcCCCceEEEEEeeeee Confidence 567778777777776543222222110 00111111 12334444443 4567787777777432 1221 Q ss_pred ccCC-CCcceEEEecC----------------------ccccceeEEEEeeCCceE--EEEEEeccCC------------ Q lcl|NC_012662. 142 DKKD-PKTTGFYFVKS----------------------GAFSKEYDISVVWSEGSQ--TVTYTTPDGT------------ 184 (780) Q Consensus 142 ~~~~-~~~~g~v~v~~----------------------g~y~~~y~vti~~~~~~~--t~t~tt~~~s------------ 184 (780) .... ..+.+.++--. ..-+..|.++++..+-+. +..+..+..- T Consensus 156 s~T~~t~~~~~V~~~~a~~~~~~~~L~~~~N~sS~~~~~~~~T~~AmT~~NP~~S~~ls~~~V~~qtytltirqi~W~WW 235 (1012) T protein:vir:94 156 SFTCTTNHGGGVFSFTAPISVNDTTLWRDTNASSYIVTDAAGTVYAMTQKNPDFSFRLSGSFVVGQTYTLTIRQITWQWW 235 (1012) T ss_pred eeeccCCccceEeecccceeecCeeEEecccccceeEeeccceEEEEEeeCCceeEEEEEEEecCcccceeehhhhhhhh Confidence 1110 01111111000 111333444443322221 1111111000 Q ss_pred ---------------CCccccccc-hhhhhhhhhhhhe------------------------------------------ Q lcl|NC_012662. 185 ---------------TAGDADQSV-PEAIARKLVEALI------------------------------------------ 206 (780) Q Consensus 185 ---------------~~~~~~~~~-~~~i~~~l~~~~~------------------------------------------ 206 (780) +-+....++ ...|...|...+. T Consensus 236 AESm~~~G~~~~~~~SRFNV~~~DQ~V~IP~~L~tDiD~v~~~~~~~~l~~~~ss~F~~~~~~~~T~~P~~AD~YG~~~G 315 (1012) T protein:vir:94 236 AESMYYEGQDMMQNTSRFNVTSIDQNVKIPDRLITDIDPVYKNSQGLGLFVFWSSRFDSNGWAGPTTSPNTADEYGFSGG 315 (1012) T ss_pred hhhHhhhhhHHHhhhhhcccccccccccchhHHhhhhhhhhhccCCccEEEEEeeeecCceeecCCCCCCCcccccccCC Confidence 000000000 0001111100000 Q ss_pred --------------ecccceEEEcC----------e--------EEEEEcCC---CceeEEEeec--------------- Q lcl|NC_012662. 207 --------------AVGVDFAVRVG----------P--------YIYFELIT---GTDLKITSTS--------------- 236 (780) Q Consensus 207 --------------s~g~~~~~~~g----------~--------~i~~~~~s---~~~~~vt~~~--------------- 236 (780) .+.+.+ ..-| + -+.+.+.. +.++.+..+. T Consensus 316 ~~~tpp~~~~~A~L~~aPFF-~TFG~~~s~TP~P~~~V~iLR~RELRFN~G~GA~~~~L~V~~D~~~~t~Nnvpfspsnf 394 (1012) T protein:vir:94 316 GRFTPPSLVPGATLQAAPFF-ITFGGIYSGTPTPINQVNILRLRELRFNGGTGAKPDDLQVYNDTVEHTWNNVPFSPSNF 394 (1012) T ss_pred ceeccccccccceeeccceE-EEeccccCCCCCChhheeeeeeeeeeeccCCCCCCcceEEEEcceeeeccccccCcccc Confidence 000000 0000 0 00111111 1122221110 Q ss_pred --------C--CcceeEEE----------EEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEeccc Q lcl|NC_012662. 237 --------G--SPYIGYSN----------QSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYN 296 (780) Q Consensus 237 --------g--~~~~~~~~----------~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~~~~ 296 (780) + -.+..+++ ..-...+.+||+++|-|-...-..+-..+.-..|.+....+|.... ..|. T Consensus 395 qt~atT~~~T~R~~~L~~A~G~~~~~A~Y~A~~GATnnlpanaPL~IS~~sA~s~~~~~R~v~~~~~~T~~~~~~-G~Y~ 473 (1012) T protein:vir:94 395 QTWATTYTATDRVITLMSAVGDRFNNANYFAILGATNNLPANAPLHISCLSASSYLGGSRRVWYRNLPTTGGTLD-GCYV 473 (1012) T ss_pred cceeeeeeecceeEEEeeeccccccCcceEEEeecccccccCCccccccccceeeeccceeeeeeccccCCceEe-eeEE Confidence 0 00111111 1112345567777765533211100000001112111111111100 0011 Q ss_pred ceeEEcccceeEEeeccccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEec----CCeEEEEccCC------ Q lcl|NC_012662. 297 SVTAISVDVPYKIVDDNVEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLS----GAYVCMSATGE------ 366 (780) Q Consensus 297 ~~~~~~~~~p~~l~~~~~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~----~~~v~~S~~gd------ 366 (780) ...++ -.|.+ +..+..|.--+.||.||++.+ +..+.+|.+|| T Consensus 474 r~YGi---------------G~~~~-------------Y~~~~F~~I~TiY~~RLiL~~~s~~~~~~~~S~~GD~~~~G~ 525 (1012) T protein:vir:94 474 RAYGI---------------GKYVD-------------YSKRSFHAIGTIYRDRLILVNPSTATDQLLISEIGDATVPGE 525 (1012) T ss_pred EEEEe---------------eeeee-------------cCCccccceeeeeeeeeEEeccCCCcceEEEeecCCcccCce Confidence 11111 11222 111223445579999999997 45699999887 Q ss_pred ccccccc-cccCCCCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccccceEEEEEeeecCCCCCCc Q lcl|NC_012662. 367 PDRFFRS-TVSSLDPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVVLTSDLACNAFVAP 445 (780) Q Consensus 367 ~~nF~~~-t~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~~~~~~~~s~~~~~~~~~P 445 (780) ++||+.- ..+.-.|.||+++.+++.-.+.|..++...+.|++||..+-|.+.|++ .++|..-.+..+|+|+.-+.--- T Consensus 526 ~Y~F~QiTD~L~G~~tDPF~L~VtSe~~e~iT~~~~WQ~~LFV~T~~~T~~~~GGe-~~~~s~~~VN~vSt~G~~N~~~V 604 (1012) T protein:vir:94 526 FYQFMQITDMLQGVTTDPFTLNVTSEGRERITAVTGWQKRLFVFTGSNTYSIEGGE-QFGESSYAVNLVSTYGAFNQNCV 604 (1012) T ss_pred eeeeeeeehhhccCcCCceeEEEcccccceeeeeeeeceeEEEEeccceEeecccc-ccchhHHHHHhHHhhcccCcceE Confidence 8999854 457778999999999997788899999999999999999999999865 59999999999999965444445 Q ss_pred EEeCCeEEEEEecCCceEEEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEee Q lcl|NC_012662. 446 VTTSQTLMYPAPRSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHF 525 (780) Q Consensus 446 v~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~ 525 (780) |+..-.|+|..+- +++++.... ++++|.+.+-|+.+..+|..-. .++-++...+....+.+.||+ =|+ T Consensus 605 V~T~~~V~Ym~~~-----G~F~L~~k~-~~~~Y~A~ErSvKIR~~F~~~~----~ss~~~~~Wl~~~e~~~~LYi--~L~ 672 (1012) T protein:vir:94 605 VVTNLTVLYMNKF-----GLFDLMNKP-NTDSYGAFERSVKIRGLFQNLA----GSSGDNLHWLRYNESSNKLYI--GLA 672 (1012) T ss_pred EEeeeEEEEeecc-----ceeeccCCc-cCCcchhhhhhhhhhhhhhhhc----cccccceeeeeeccCCceEEE--Eec Confidence 6778888898653 589988874 6799999999999999986421 222222222222223333332 122 Q ss_pred cCCcee-----------eeeeEeeccCCcEEEEEEEC---CcEEEEEEEcCCC------eEEEEEEe------------- Q lcl|NC_012662. 526 TSQGKV-----------HQAWHKWVFPYRVASLHFAR---DRVVLFAADDAGS------TDKITIST------------- 572 (780) Q Consensus 526 ~~~e~~-----------v~aW~~w~~~G~v~~~~~~~---d~l~~vv~R~~~g------~~~~~~e~------------- 572 (780) .+.|.- -.+|+.-+..|.|.---.+. ..-|+..-.-... ...+|++- T Consensus 673 ~~~dT~~~S~~~~~N~~~DSWs~~~s~~~Fq~YP~V~~~~~~t~L~~i~~~~TV~ML~~~~~~YiDFatirthiypF~~C 752 (1012) T protein:vir:94 673 AEGDTRTTSRNLMLNFTWDSWSTLSSAAPFQMYPAVQLFKYMTWLTNINAPLTVAMLATEMPFYIDFATIRTHIYPFTFC 752 (1012) T ss_pred CCCcchhhhhhhhhhhhhcchhhhhccCCcccchhhhhhhhhhhhhhhcCchhhhhhhhccceeeeeehhcccccceeee Confidence 111111 13677777777655432221 1111111000000 00122221 Q ss_pred --------eeccCCccccccccee-eeecc---------ceee---ecCcceeEEeeccccCC------------CCeEE Q lcl|NC_012662. 573 --------IDPKQGGVTFDVDRLP-HLDSM---------SIVP---VNDGKGIVPIYMRPWVS------------EGKLT 619 (780) Q Consensus 573 --------~~~~~~~~~~~~~~~~-~lD~~---------~~~~---~~~~~~~~~~~~~~~~l------------~g~~v 619 (780) |....+--...++..| -+|-. .+|. ...++.+ -++..++.. .++++ T Consensus 753 aG~~~~~Vms~~~GIY~~~~P~tP~I~~~tit~ss~~~~k~Yq~~T~~~GT~t-Lt~~~~~~~~~~~l~LL~~~~~~~~~ 831 (1012) T protein:vir:94 753 AGQRDVSVMSDSRGIYNLPLPVTPGILDYTITASSKAGAKTYQRNTASAGTET-LTLRNPMMDYADTLELLGGNVNASQF 831 (1012) T ss_pred ccceeeEEEecCCceEEecccccceeeeeEeeccchhhhheecccccccccee-eeecChhhhcCcEEEEecCCCCccEE Confidence 0000000000000000 01100 0010 0011110 111222222 33444 Q ss_pred EEEecCccccceec----------cccccccce-EEEcCCCC-CCEEEEeEeeeEEEEcCceEEecCCCceeeecce-EE Q lcl|NC_012662. 620 GSVATGALASEEVA----------IDVDEVSWE-FTVEPGFK-DSQIYLGFRYESLFAPTPPMLKDQNDTLISTAPV-RL 686 (780) Q Consensus 620 ~~~adG~~~~~~~~----------~~~~~~~~~-~~i~~~~~-~~~v~vGl~y~~~v~~~~~~i~~~~g~~~~~~r~-~v 686 (780) ++.......+.-.. +.+.+.++. ++...-.. ...+.+|.-|.+.++-+-+.+ . + -+|| || T Consensus 832 a~V~~~~~~~~TT~~TV~~N~~~~lQ~T~~~GS~L~~~~~LsqN~~~~~G~~Y~S~Y~SP~F~L-~----S--L~~LKr~ 904 (1012) T protein:vir:94 832 AMVMSNGFEPYTTYPTVTYNGVAPLQWTVTGGSGLNNRPILSQNNNCIMGMIYPSVYASPIFDL-E----S--LGRLKRL 904 (1012) T ss_pred EEEeecccccccccceEEecceeeeeEEEecCCccccccccccCceEEEeecchhhhcchhhhh-h----h--hhhhhhe Confidence 43322211111110 111111111 11110011 346889999999888554433 1 1 1343 57 Q ss_pred EEEEEEEecc--ccEEEEecCCCCCcceecccCceecccccccCCccccc------------ccceEEEEeecccceeEE Q lcl|NC_012662. 687 LRYELTTRNT--GEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVS------------DLSRVPVPCRSNAQSTEM 752 (780) Q Consensus 687 ~rv~v~~~~T--~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~------------~tg~~~vp~~~~~~~~~v 752 (780) .++.|-|.-+ ..++.++..+....... .+-.+.--.++.-+..|.. ..-+..+|+.|..-+.++ T Consensus 905 K~~~L~~Dttvtsqlkynltsgfsqvsvl--ntawvavvsnynenivpavvsyqvgnsyeirrvvelsiplqgygcdyqf 982 (1012) T protein:vir:94 905 KKLHLQMDTTVTSQLKYNLTSGFSQVSVL--NTAWVAVVSNYNENIVPAVVSYQVGNSYEIRRVVELSIPLQGYGCDYQF 982 (1012) T ss_pred eeeeEEeeeeeeeeeeeehhcccceeeee--cceeeeeeeccCccccceeeeeecCCceeeeEEEEEeecccccccceeE Confidence 7777666553 45555554443321111 1111100011111111111 112445788888889999 Q ss_pred EEEECCCCCEEEEEEEEEEEEecceecC Q lcl|NC_012662. 753 YLSTDGTQDMNILEIEYIIRYNQRRRRV 780 (780) Q Consensus 753 ~i~~~~P~P~tvlai~~eg~y~~r~rrv 780 (780) .|.+-..-.+.+-+.+++.+=-+-.|-| T Consensus 983 yiasvgaeafklaayefdiqpqrdkryv 1010 (1012) T protein:vir:94 983 YIASVGAEAFKLAAYEFDIQPQRDKRYV 1010 (1012) T ss_pred eeeeccccceeeeeeeeccccchhhhhc Confidence 9999999999999999887644333322 No 27 >protein:vir:2625 Length: 715 # NCBI annotation: gp27 # Family: family:all:5234 # MgeID: mge:55 # MgeName: SIO1 # Cross-refs: genbank:acc:NP_064766;genbank:gi:9964636;genbank:GeneID:1263056 Probab=99.48 E-value=7.1e-12 Score=81.69 Aligned_cols=630 Identities=13% Similarity=0.104 Sum_probs=289.2 Q ss_pred Cceeeeee-echhhcccccCCchHhhh-hhhhhhhcceeeecCCceeCCccee-----eeeecccccCCCceeEEEEEec Q lcl|NC_012662. 1 MARPFEGA-LNDLLQGVSQQVPRERVA-GQCSAQVNMLSDPVTGIRRRPGSLF-----VSVHDFGPIGEGDALYTQYLER 73 (780) Q Consensus 1 Ma~~v~~~-~~~l~~GvSqq~D~~Ry~-~q~~~~~N~~~~p~gGl~rRpGt~f-----va~~~~~~~~~~~~~~~~~~~r 73 (780) |+++++|. .-.|++|.=--.-++-|| +..-.-+||.....|--+||-|+-| +......+ +....++.... T Consensus 1 m~~~~~~~~vNtFv~GliTEas~ltfpqnasiDe~N~~l~rdG~r~RR~g~~~E~~~vls~~~vp~---galv~~~~W~n 77 (715) T protein:vir:26 1 MPQSLTQRTVNTFIKGLITEASELTFPENASVDELNCSLGRDGTRRRRKAVTLEDNHVLSDVVVPE---GALVQTLDWYN 77 (715) T ss_pred CCcccchhHHhhhhhheeeccccccCCccceeeeeeeeecCCCcchhhccceeecceEEEEEeecC---ceeeeeechhh Confidence 99988764 567899933333333443 4555678999999999999999854 33333232 22233333222 Q ss_pred --CCCccEEEEEEcCcEEEEEeCCCcEEE-----ecCCCCc-cccCC---cc-eEEEEEeCCEEEEecCCEeeeEeeccc Q lcl|NC_012662. 74 --GADGRHLVINTNTGGWWLLDREAKNIV-----SEGNLSY-LLAAD---RR-SIQTTSMGGVTYILNTEKRPSATTDNS 141 (780) Q Consensus 74 --~~~~~~y~l~~~~g~~~v~d~~~~~~~-----~~~~~~y-~~~~~---~~-~l~~~q~aD~~fi~~~~~~p~~~~~~~ 141 (780) ++-+..+.++..+.-+.++.+.+-.+. .+-...+ +...+ .. .++++.+..++.|+||..-|-...-+. T Consensus 78 a~G~v~~~~livqvg~~l~f~q~t~~pLs~~n~~~svdl~~~~~~vn~SPsh~~v~v~~~~G~livanp~i~~~~~~~d~ 157 (715) T protein:vir:26 78 VAGQVNLEFLVVQVNNILYFYEKSTDPLSANKYSGSVDLNTHSASNNLSPSEERVQVTSLNGYLIVASPAINTFYLGFNT 157 (715) T ss_pred cccccCcEEEEEEeccEEEEEeccCCccccCceeEEeeecceecccccccceeEEEEEEeeeEEEEecCCccEEEEEecC Confidence 245566677766666777765542221 1111111 11111 12 588888999999999987776543333 Q ss_pred ccCCCCcceEEEecCccccceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhhheecccceEEE------ Q lcl|NC_012662. 142 DKKDPKTTGFYFVKSGAFSKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVGVDFAVR------ 215 (780) Q Consensus 142 ~~~~~~~~g~v~v~~g~y~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g~~~~~~------ 215 (780) +.....+.- +-+| +..+-+++.+..++..| ++.....+++.|.. +. +.+|... T Consensus 158 ~t~s~t~~~-ll~r------~r~f~~qg~d~~~g~~y-------~~~gt~~tn~~iyn-ly------N~gw~~p~gt~~~ 216 (715) T protein:vir:26 158 STEAFTATS-ISFK------ERDFEWQGSDVDVTSLY-------FGEGTSVSNQRIYD-TY------NVGWVGPKGSAAL 216 (715) T ss_pred CcceeEeeE-EEEE------eeeheeecccccccccc-------ccCCcccCchhhee-cc------cceeecceeEEEE Confidence 211100000 1111 11112222221111111 11222223322211 11 1122211 Q ss_pred --cCeEEEEEcCCCceeEEEeecCCcceeEEEEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEe Q lcl|NC_012662. 216 --VGPYIYFELITGTDLKITSTSGSPYIGYSNQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESG 293 (780) Q Consensus 216 --~g~~i~~~~~s~~~~~vt~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~ 293 (780) .++++. +++ .+.......-+++ ......|.|+- T Consensus 217 N~~~~yiV--ypa----------------~s~~~~S~kd~n~---------------------------afsk~ad~ei~ 251 (715) T protein:vir:26 217 NTYGSYIV--YPA----------------LTHPWYSGKDANG---------------------------AFNKADWLEIY 251 (715) T ss_pred cCCCCceE--ecc----------------cccccCCCccccc---------------------------ccChhhccccc Confidence 111111 000 0000000000000 00001122221 Q ss_pred c-----ccceeEEcccceeEEeeccccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEec------CCeEEEE Q lcl|NC_012662. 294 D-----YNSVTAISVDVPYKIVDDNVEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLS------GAYVCMS 362 (780) Q Consensus 294 ~-----~~~~~~~~~~~p~~l~~~~~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~------~~~v~~S 362 (780) - +..-...++. . +. ..|.-.-+....+++|+.|.+|.+|++ +..|.+| T Consensus 252 tGt~~~~~G~yi~D~~----------~----~g-------~~~leeev~k~R~rsv~~yaGrV~yagiD~dkng~rilfS 310 (715) T protein:vir:26 252 TGSSLASNGHYVLDVF----------N----KA-------RTGLTTEVETGRFRSVAAYAGRVFYAGIDSAKNGGKVYFS 310 (715) T ss_pred cccccccCceEEEeee----------e----cC-------CccchhhhhcCCCcceeeecceEEEeecccccCCCeEEEe Confidence 1 0000111100 0 00 000111122345678999999999994 4479999 Q ss_pred ccCC--------ccccccccc--cCCCCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccccceEEE Q lcl|NC_012662. 363 ATGE--------PDRFFRSTV--SSLDPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVV 432 (780) Q Consensus 363 ~~gd--------~~nF~~~t~--~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~~~~~~ 432 (780) |.=+ |.+=.|++. ..+.|.|...+.+-+. -.|.-|+.++..|+||...+-|+|.|.+...|.++..+. T Consensus 311 qLv~s~~di~nCyQd~DPTsee~~dLidTDGg~iri~ga--h~ii~Lv~f~~sLlvf~~NGVWAi~G~d~g~tATdY~lt 388 (715) T protein:vir:26 311 RLTERMSDVGNCYQVNDPTSEVLSDLLDTDGGVVRIPDA--HNIRKLHVLGASLLVFAENGVWAVAGVDNVFRATEYAIT 388 (715) T ss_pred hhhcchhhcccccccCCCchhhhhhhhhcCCCEEEecCC--CCceeEEEecceEEEEEecceEEEeccCCceeeeeeEEE Confidence 8643 344444432 2466889999988663 335668999999999999999999886778999999999 Q ss_pred EEeeecCCCCCCcEEeCCeEEEEEecCCceEEEEEeeeeccccCceehhhHH-HHHHHhcCCCc---EE--EEEeCCCCe Q lcl|NC_012662. 433 LTSDLACNAFVAPVTTSQTLMYPAPRSEAFSAVLELVPSQYTSSQYVSQDVT-THIPRYIEGEA---RF--MQSASAANI 506 (780) Q Consensus 433 ~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls-~~~~h~~~g~~---~~--~~~~~~~~~ 506 (780) +++..+|++.=.=+++|+.++|-.++| |+.+..++ .-+-+.++.|| ..+..|.+.=. +. ...+-+-+. T Consensus 389 KIs~vg~sspnSvVvv~~~i~~WsdtG-----Iyal~~Nd-~fn~~tAqNLTekTIq~~~~~I~~dk~knVtg~fd~~e~ 462 (715) T protein:vir:26 389 RISDVGLSNENSFVVADGIPIWWGKTG-----IYAVQQSE-NLNTPTAQNLSLSTIQTLWNNISNAKKAQVTVEYDKINQ 462 (715) T ss_pred EeeeeccCCCccEEEecceEEEeeCCc-----EEEEEecc-ccCcchhhccchHHHHHHHhhcchhhhcceEEEEEccCC Confidence 999999998777789999999997775 88888764 34568999999 77887764322 11 123333344 Q ss_pred EEEEEEcCCCEEEEEEEeecC-CceeeeeeEeeccC---Cc----EEEEEE--------------------E-CCcEEEE Q lcl|NC_012662. 507 VLMATTGDNRQVIAHEYHFTS-QGKVHQAWHKWVFP---YR----VASLHF--------------------A-RDRVVLF 557 (780) Q Consensus 507 ~~~~~~~~~g~l~~~ty~~~~-~e~~v~aW~~w~~~---G~----v~~~~~--------------------~-~d~l~~v 557 (780) -+.|..-+..++.-|+|.+.- =+-...|+-+|..+ |. |-++.. . ++.+..+ T Consensus 463 rVyW~yPn~dt~vdykyd~vLV~dLalgaFYp~~v~~~a~~~~~~ig~~~~~~~~~~~t~~~vv~~~~v~~~g~~~~v~~ 542 (715) T protein:vir:26 463 RVFWFYPDNDESVDYKYNNILVMDLALQAFYPWRVEDEASSTSYIIGTSYYGGLGSTSTETQVVNGADVVVNGSDNVVAT 542 (715) T ss_pred EEEEEEcCCceeeceeecCeEEEEecccccccccccccccccceeeeeeeeCCcccccchhheeccceEEEeccceEEEE Confidence 456666555566555552200 01112355555432 21 111110 0 0111112 Q ss_pred EEEc--CCCeEEEEEEeeeccCCcccccccceeeeeccceeeecCcceeEEeeccccCCCCeEEEEEecCccccceeccc Q lcl|NC_012662. 558 AADD--AGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVNDGKGIVPIYMRPWVSEGKLTGSVATGALASEEVAID 635 (780) Q Consensus 558 v~R~--~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~l~g~~v~~~adG~~~~~~~~~~ 635 (780) ..|+ +....-+.+||+.... ..+...+-+...+..+.+.. . .+... T Consensus 543 ~~r~~~~~~~~~~~~~~~~~~~------------------------~~~f~~~~~~~~~dw~s~d~----~----~~~~~ 590 (715) T protein:vir:26 543 LYRDYLEGDSEIKLLVRDGTTG------------------------KMTFATFRGDTYLDWGSADY----K----SFAEA 590 (715) T ss_pred eecccccccceEEEEEEcCCce------------------------eEEEecccCceeeeccccch----h----hHHHh Confidence 2221 1111112223221100 00111111111111111100 0 00000 Q ss_pred cccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCceeeecceEEEEEEEEEeccccEEEEecCCCCCcceecc Q lcl|NC_012662. 636 VDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLISTAPVRLLRYELTTRNTGEFDVRIVDPTIGLDYSNS 715 (780) Q Consensus 636 ~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~~~~~r~~v~rv~v~~~~T~~~~v~v~~~~~~~~~~~~ 715 (780) |....++++-.-..|.-+.|. +++ -.=++-+..|=.-.+..--+-.+.++..+++. ..+..|. T Consensus 591 gy~~~gd~~~~k~~pyvt~~~------~~t-edg~v~~~~g~~p~n~sSclm~~sw~ws~s~s----------t~~eaYk 653 (715) T protein:vir:26 591 GYDFMGDITTFKNAPYVTTYM------RVT-EDGYVASGAGYEFINPSSCLMSVSWNLSKSGS----------TPREIYK 653 (715) T ss_pred hhhhcccceeeecCceEEEEE------EEe-cccceeccCCccccCCcceEEEEEeeeccCCC----------Chhhhhe Confidence 111111111100111001110 000 00011111110000001111122223222211 1111222 Q ss_pred cCceecccccccCCcccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecce Q lcl|NC_012662. 716 KTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQSTEMYLSTDGTQDMNILEIEYIIRYNQRR 777 (780) Q Consensus 716 ~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlai~~eg~y~~r~ 777 (780) +-.++..-+++.-.+--+.++-+-+..++|..+-.+++|.+...-.|+|++.+.-|--|+.+ T Consensus 654 ~~~~~~~~p~~~s~~~yp~~~VvTKsriRG~Gr~~~~rf~s~~gKdlhl~Gysilg~~~~~~ 715 (715) T protein:vir:26 654 LKDVPVVNPNDLSSINYPTDTVVTKSKVRGRGRSMKFRFESVAGKDFHLVGYEVIGAKNNSY 715 (715) T ss_pred ecceeeeCCCccccccCCcceeEeeeeeeccceEEEEEEEecCCcceEEEeEEEEecccCCC Confidence 22211111111111001112222334678999999999999999999999999999988877 No 28 >protein:vir:8837 Length: 513 # NCBI annotation: constituent protein # Family: family:all:4957 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775245;genbank:gi:27476043;genbank:GeneID:2700591 Probab=98.84 E-value=3.1e-08 Score=61.76 Aligned_cols=483 Identities=10% Similarity=0.020 Sum_probs=228.2 Q ss_pred CceeeeeeechhhcccccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecCCCccEE Q lcl|NC_012662. 1 MARPFEGALNDLLQGVSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERGADGRHL 80 (780) Q Consensus 1 Ma~~v~~~~~~l~~GvSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~~~~~~y 80 (780) || .--+.+..-.|.|-.+.+.+-=.++-..+.|+++. .|++.||||-.=+.+.- +....-.+.++. .+..+ T Consensus 1 ~~-~~~~~~~~~~g~~~d~~p~~lp~~a~s~~~N~~~~-~~~~~~~~g~~pv~a~~------~~~~~g~~~~~~-~g~~~ 71 (513) T protein:vir:88 1 MA-LERQEVKNPTGIVTDIAPADLPLDKWSFGNNVRFK-NGKAQKALGHSPIFDTA------QAPILDMFPFIR-NNIPY 71 (513) T ss_pred CC-cCChhhcccccceeccChhhcCCCcceeeeeeeEe-cceeeecCccceeeecC------CCCceeeeeeec-CCCeE Confidence 88 44555555555566555444333444677777765 68889999998774321 111111122232 34445 Q ss_pred EEEEcCcEEEEEeCCCcEEEecCCCCccccCCcceEEEEEeCCEEEEecCCEeeeEeecccccCCCCcceEEEecCcccc Q lcl|NC_012662. 81 VINTNTGGWWLLDREAKNIVSEGNLSYLLAADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPKTTGFYFVKSGAFS 160 (780) Q Consensus 81 ~l~~~~g~~~v~d~~~~~~~~~~~~~y~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~g~v~v~~g~y~ 160 (780) +++.+...++.++..+ ....++ .+ +++..-..-++++-+|+++++|...+|+..- T Consensus 72 ~~~~~~~~~~~~~~~t-~~dvs~-~~-~~~~~~~~w~~~~f~~~i~a~ng~~~~q~~~---------------------- 126 (513) T protein:vir:88 72 WLLCSEKRLYLADGTT-IIDVSP-GP-YSASVTNRWSVGSFNGVIFANDGVNPPHHLP---------------------- 126 (513) T ss_pred EEEeeceEEEEecCce-eeeccc-cc-eeecccCceeeeeecCEEEEEcCCCcceEEc---------------------- Confidence 5555555565555332 222222 22 2332223345566666666655443333210 Q ss_pred ceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhhheecccceEEEcCeEEEEEcCCCceeEEEeecCCcc Q lcl|NC_012662. 161 KEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVGVDFAVRVGPYIYFELITGTDLKITSTSGSPY 240 (780) Q Consensus 161 ~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g~~~~~~~g~~i~~~~~s~~~~~vt~~~g~~~ 240 (780) +. T Consensus 127 ----------------------------------------------------------------------------~~-- 128 (513) T protein:vir:88 127 ----------------------------------------------------------------------------PT-- 128 (513) T ss_pred ----------------------------------------------------------------------------CC-- Confidence 00 Q ss_pred eeEEEEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEecccceeEEcccceeEEeeccccccccc Q lcl|NC_012662. 241 IGYSNQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYNSVTAISVDVPYKIVDDNVEQHIME 320 (780) Q Consensus 241 ~~~~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~~~~~~~~~~~~~p~~l~~~~~~~~~w~ 320 (780) ...+++||. T Consensus 129 --------s~~f~dl~g--------------------------------------------------------------- 137 (513) T protein:vir:88 129 --------ESVFRVLPN--------------------------------------------------------------- 137 (513) T ss_pred --------CceeeeccC--------------------------------------------------------------- Confidence 000111110 Q ss_pred hhhcCCcccCCCcccccCCCceEEEEEcceEEEec--------CCeEEEEccCCccc----cccccccCCCCCccEEEEE Q lcl|NC_012662. 321 GRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLS--------GAYVCMSATGEPDR----FFRSTVSSLDPTDRIDIAS 388 (780) Q Consensus 321 ~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~--------~~~v~~S~~gd~~n----F~~~t~~~~~ddD~i~~~~ 388 (780) +|+ ...-..|.+|++||++++ |+.|+.|..+|... |..+. ...+.+=.++ T Consensus 138 -----------~p~---~~~a~~i~v~~~flv~~~~t~~~~~~PnrV~wS~~~D~~~~P~~W~~t~--~t~~a~~~~l-- 199 (513) T protein:vir:88 138 -----------FPA---NTTFRRLKSFKNFLIGLNVTSNSIEMPQMVWWSTSADAGGVPASWDPTD--PTKDAGQNTL-- 199 (513) T ss_pred -----------CCc---ccceEEEEEEeeEEEEeecccCcCCCCceEEEecccCCccccccccccc--ccCccccccc-- Confidence 000 001234667999999864 67899999999643 42221 1122222233 Q ss_pred cCCcceeEEEEeecCCcEEEEecCcEEEEe-cCCCcccccceEEEEEeeecCCCCCC-cEEeCCeEEEEEecCCceEEEE Q lcl|NC_012662. 389 GSAQNSVFRQALQFNKDLILLGDSTQAVVP-SLQQLLAPDNASVVLTSDLACNAFVA-PVTTSQTLMYPAPRSEAFSAVL 466 (780) Q Consensus 389 ~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~-~~~~~ltP~~~~~~~~s~~~~~~~~~-Pv~vg~~v~f~~~~g~~~~~v~ 466 (780) .+....|...++....|+||++.+-|.++ .+ +|....+++...-.|...-+ =+.+|+.++|+.++| ++ T Consensus 200 -~d~~g~~v~g~~~g~~liif~e~~i~~m~y~g----~~~if~~~~i~~~~G~~~p~SI~~~~~~~ffls~~G-----f~ 269 (513) T protein:vir:88 200 -ADTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG----GLYIFQFQQLFNDVGILGPNCAIEFDGNHFVVGHGD-----VY 269 (513) T ss_pred -CCCccceeeeeecccceEEEecccEEEEEecC----CCceEEEEeecccccccCCceeEEECCeEEEEeCCc-----eE Confidence 33344455567777899999999999996 32 24455666655444443222 366999999998865 54 Q ss_pred EeeeeccccCceehhhHHHHHHHhc-----CCCcEEEEEeCCCCe-EEEEEEcC------C--CEEEEEEEeecCCceee Q lcl|NC_012662. 467 ELVPSQYTSSQYVSQDVTTHIPRYI-----EGEARFMQSASAANI-VLMATTGD------N--RQVIAHEYHFTSQGKVH 532 (780) Q Consensus 467 e~~~~~~~~~~~~~~dls~~~~h~~-----~g~~~~~~~~~~~~~-~~~~~~~~------~--g~l~~~ty~~~~~e~~v 532 (780) .. .+-+.. ..+. ..+++.| .....++...-.+.. .+.|+-.+ . ..+++|-|+ .+ T Consensus 270 ~~--~G~~~~-~Ig~---ekVdk~f~~~~n~~~~~~~~~~~d~~~~~v~~~y~s~~~~~~~~~~~~lVYd~~----~~-- 337 (513) T protein:vir:88 270 VH--NGVQKQ-SVID---AQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSEPGKHCDRAIIWNWK----EN-- 337 (513) T ss_pred Ee--cCceee-eccc---chhhhhhhccCCcccceEEEEEEcCcccEEEEEecCCCCCCCcccceEEEEEcc----CC-- Confidence 22 221111 1111 1222222 122233333333322 33333111 1 245666663 22 Q ss_pred eeeEeeccCCcEEEEEEECCcEEEEEEEcCCCeEEEEEEeeeccCCcccccccceeeeeccceeeecCcceeEEeecccc Q lcl|NC_012662. 533 QAWHKWVFPYRVASLHFARDRVVLFAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVNDGKGIVPIYMRPW 612 (780) Q Consensus 533 ~aW~~w~~~G~v~~~~~~~d~l~~vv~R~~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~ 612 (780) .|+.-+.+..+-.+-...+.+.........+ . .|........+ .+. T Consensus 338 -~Ws~~~~p~~~~g~~g~~~~~~~~~~~~~~~-------------------~-----~d~~~~~~~~~---------~~~ 383 (513) T protein:vir:88 338 -TWSIRDLPNVLSGAYGIIDPKTSNLWDDDSN-------------------P-----WDTDTSVWGEG---------SYN 383 (513) T ss_pred -eEEEEeccchhhcccccccccccceeccccc-------------------c-----cccchhhhhcc---------ccc Confidence 4765554443222211111111111110000 0 01100000000 000 Q ss_pred CCCC-eEEEEEecCccccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCceeeecc-eEEEEEE Q lcl|NC_012662. 613 VSEG-KLTGSVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLISTAP-VRLLRYE 690 (780) Q Consensus 613 ~l~g-~~v~~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~~~~~r-~~v~rv~ 690 (780) .... ....-.++|.+. ..+. ..-.-|-++++.++...+.+.+ + ++ ++|+++. T Consensus 384 ~~~~sl~~~~~~~~~~~----~fd~---------------~~~f~G~~lea~~~t~~~~~~~--~-----~~~~~i~~v~ 437 (513) T protein:vir:88 384 PAKSSMIFTSFQDAKLF----LFGE---------------TSTFSGQSFTSTLERSDIYLGD--D-----RMMKTVSAVI 437 (513) T ss_pred cccceeEeeeccCCcee----eecc---------------cccccCCceEEEEEecCccccC--c-----hhheeeeeee Confidence 0000 001111222211 0110 0114578888888877665532 1 23 3577777 Q ss_pred EEEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEEEEeecccceeEEEEEECCCCCEEEEEEEEE Q lcl|NC_012662. 691 LTTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQSTEMYLSTDGTQDMNILEIEYI 770 (780) Q Consensus 691 v~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~~~~~~v~i~~~~P~P~tvlai~~e 770 (780) ..+...+.+.+.+.............++. .....+...++++...+..+++|+...--|+++.++++| T Consensus 438 ~~~t~~g~~t~~vg~~~~~~~~~~~s~~~------------~~~~~~~~~~~~r~~gRy~~~ri~i~~~~~w~~~G~~ve 505 (513) T protein:vir:88 438 PHITGNGVCNIWVGNAQVQGSGIRWKGPY------------PYRIGQDYKIDTKHVGRYIALKFDFASAGDWYFNGYTLE 505 (513) T ss_pred eeeecceEEEEEEeeeccCccccccccce------------eeecccCceEEeccCCceEEEEEEccCCCceEEeeEEEE Confidence 77777777766665444332221111110 111122344677778888889999988999999999998 Q ss_pred EEEecceec Q lcl|NC_012662. 771 IRYNQRRRR 779 (780) Q Consensus 771 g~y~~r~rr 779 (780) ..--. .|| T Consensus 506 ~~~~~-g~R 513 (513) T protein:vir:88 506 MAPKA-GMR 513 (513) T ss_pred EecCC-CCC Confidence 87521 333 No 29 >protein:vir:80177 Length: 1027 # NCBI annotation: tail tubular protein B # Family: family:all:12083 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285795;genbank:gi:148747829;genbank:GeneID:5220453 Probab=98.69 E-value=1.1e-07 Score=58.72 Aligned_cols=727 Identities=15% Similarity=0.128 Sum_probs=280.5 Q ss_pred Cceeeee---------eechhhcc--cccCCchHhhhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEE Q lcl|NC_012662. 1 MARPFEG---------ALNDLLQG--VSQQVPRERVAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQ 69 (780) Q Consensus 1 Ma~~v~~---------~~~~l~~G--vSqq~D~~Ry~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~ 69 (780) |-.+.+. -..+-.+| +|.-|=..-|.+ ---..|+=++..|-+.||.|++.+-.-.. .-+. +. T Consensus 1 mvnsferrtQQ~~dlG~~s~~F~GL~~t~S~~~IP~~~-SP~~~N~DV~~~G~V~kR~GT~i~~~Y~~---t~~~---~t 73 (1027) T protein:vir:80 1 MVNSFERRTQQGDDLGIRSSNFGGLNTTASPLNIPYED-SPNLLNVDVDVSGNVSKRQGTEILLKYAN---TTPV---YT 73 (1027) T ss_pred CCcchhhhhccccccccccccccccccccccccccccC-CCceEEeecccCcceeehhhhhhhhhhcc---CCce---ee Confidence 4322211 11111222 222222222221 23467999999999999999999854321 1122 22 Q ss_pred EEecCCCccEEEEEEcCcEE-EEEeCCCcEEEe------cCCCCccccCCcceEEEEEeCCEEEEecCCEeeeEe---ec Q lcl|NC_012662. 70 YLERGADGRHLVINTNTGGW-WLLDREAKNIVS------EGNLSYLLAADRRSIQTTSMGGVTYILNTEKRPSAT---TD 139 (780) Q Consensus 70 ~~~r~~~~~~y~l~~~~g~~-~v~d~~~~~~~~------~~~~~y~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~---~~ 139 (780) +.-|.--+..|+|- +++++ .+.-.++..+.. -...+-|+-- +--+-.--.-|-+.|..+.+||... .+ T Consensus 74 ~~vks~LG~dYvLt-~~~GLL~~~~~~~~AVG~~K~~s~V~~aa~~~V~-P~F~~~S~~~~R~LILT~~~~~VQ~~F~E~ 151 (1027) T protein:vir:80 74 FPVKSVLGYDYVLT-KSGGLLEVAGVIGKAVGAYKSFSNVFSAAAANVK-PYFTLLSDVEPRVLILTGTNTPVQVKFVEQ 151 (1027) T ss_pred eeehhhccceeeEe-cCCceEEEeeecccccccchhhhhhhhhhhcccC-ceeEEccCCCCcEEEEcCCCceEEEEEeee Confidence 23344456667544 44444 333222222210 0000001000 0001112245667777777776432 22 Q ss_pred ccccCC-CCcceEEEecCccc----------------------cceeEEEEee-CCce--EEEEEEeccCC--------- Q lcl|NC_012662. 140 NSDKKD-PKTTGFYFVKSGAF----------------------SKEYDISVVW-SEGS--QTVTYTTPDGT--------- 184 (780) Q Consensus 140 ~~~~~~-~~~~g~v~v~~g~y----------------------~~~y~vti~~-~~~~--~t~t~tt~~~s--------- 184 (780) +..... ..+.+.+.-....+ +..|.++++. .+-+ .+..+..+.=+ T Consensus 152 T~t~T~~s~~~~~V~~~~s~~~~~~~~L~~~~N~tS~~~~~~~~T~~AlT~~NlP~~S~~mt~~~V~~~W~WWAESl~~~ 231 (1027) T protein:vir:80 152 TFTTTSGSPTTTVVIPNASRFQYDTPILYMNRNFTSGATYSYNSTTRALTISNLPSWSGSMTFDLVLPVWSWWAESLRWF 231 (1027) T ss_pred eeeeeccCCccceEeecccceeecCeeEEecccccceeEeeccceEEEEEeccCCcceeEEEEeEEecchhhhhhHHhhh Confidence 221111 11112222111111 1222222211 0001 11111111000 Q ss_pred --------CCccccccc-hhhhhhhhhhhheecccceEEEcCeEEEEEcCCCce--eEEEee-cCCcceeEEE--EEeec Q lcl|NC_012662. 185 --------TAGDADQSV-PEAIARKLVEALIAVGVDFAVRVGPYIYFELITGTD--LKITST-SGSPYIGYSN--QSQVN 250 (780) Q Consensus 185 --------~~~~~~~~~-~~~i~~~l~~~~~s~g~~~~~~~g~~i~~~~~s~~~--~~vt~~-~g~~~~~~~~--~~~v~ 250 (780) +-+....++ ...|...|...+...-. ..+..+-.++..+.-+.+ +.-+.. ..+.-.+|.. ..++. T Consensus 232 G~~~~~~~SRFNV~~~DQ~V~IP~~L~sDlD~i~~-~~~~~~m~~~~ta~F~~~~~~~~T~~P~~AD~YG~~~G~~~~~~ 310 (1027) T protein:vir:80 232 GDRFYDAVSRFNVNKADQSVAIPAALRSDLDTIQG-TYGRYPMLLYKTATFNDTYTFSNTGQPANADSYGWGDGSVYNVG 310 (1027) T ss_pred hhHHHhhhhhcccccccccccchhHHhhhhhhhhh-ccCCccEEEEEeeeecCceeecCCCCCCCcccccccCCceEeec Confidence 000000000 00122222211111000 001111122222111111 111100 0001111111 00110 Q ss_pred cee---cccccccC----------CceEEEE---------eccCCCCCc------------eEEEEEecCccEEEEeccc Q lcl|NC_012662. 251 LET---DLPARLHP----------SADGALC---------AVGQSERAL------------VWYRYSSEKGVWLESGDYN 296 (780) Q Consensus 251 ~~~---~l~~~~~~----------~~~~~v~---------~~~~~~~~~------------~y~~~~~~~g~w~e~~~~~ 296 (780) ..+ .-|.-+.- -..+.+. .++.++.+- |--++...+.++.-..+.| T Consensus 311 ~~A~L~~sPFF~TFG~~~t~TP~P~~~V~lLR~RELRFN~G~GA~~~~L~V~~D~~~~s~N~ssT~~~T~R~~~L~~A~G 390 (1027) T protein:vir:80 311 ASAYLNTSPFFATFGDTRTPTPQPPETVHLLRQRELRFNYGNGATGANLRVTVDGTALSANYSSTVAGTNRAYALYKADG 390 (1027) T ss_pred ccceeeccceEEEeccccCCCCCchhheeeeeeeeeeeccCCCCCCcceEEEEcceeeeeeeeeeeeecceeEEEeeecc Confidence 000 00100000 0011111 011111110 0001111111111111111 Q ss_pred ce----------eEEcccceeEEeeccc-------cccccchhhcCCcccCC--------------CcccccCCCceEEE Q lcl|NC_012662. 297 SV----------TAISVDVPYKIVDDNV-------EQHIMEGRLAGDDLTNP--------------APTFLEERRITGIG 345 (780) Q Consensus 297 ~~----------~~~~~~~p~~l~~~~~-------~~~~w~~~~~gd~~t~~--------------~psf~~~~~~~~v~ 345 (780) .. +.+.+..|..+--.+. ...-|....- .++. |-.+..+..|.--+ T Consensus 391 ~~~~~A~dlayY~A~~GATPL~IS~~aA~t~~~~~R~yi~~~~~~---T~~~~~~G~Y~k~YGlG~~~~Y~~~~F~~I~T 467 (1027) T protein:vir:80 391 TLCTSASDLAYYIAFTGATPLGISPTAAVTITNVDRTYIGSAATQ---TDNAYVQGGYFKVYGLGLWANYGTGQFPRIAT 467 (1027) T ss_pred ccccccccceeeeeeeccccccccccceeeeecCceeeeeeeccc---cCCceEeeeEEEEEEeeeeeecCCccccceee Confidence 11 1111111222211100 0111222111 0111 11111123466668 Q ss_pred EEcceEEEec----CCeEEEEccCC------ccccccc-cccCCCCCccEEEEEcCCc-ceeEEEEeecCCcEEEEecCc Q lcl|NC_012662. 346 TFQGRLVLLS----GAYVCMSATGE------PDRFFRS-TVSSLDPTDRIDIASGSAQ-NSVFRQALQFNKDLILLGDST 413 (780) Q Consensus 346 ~~q~RL~f~~----~~~v~~S~~gd------~~nF~~~-t~~~~~ddD~i~~~~~~~~-~~~i~~~v~~~~~L~l~t~~~ 413 (780) .||.||++.+ +..+.+|.+|| ++||+.- ..+.-.|.||+++.++++| .+.|.-++...+.|++||..+ T Consensus 468 vY~~RLvL~~~t~~~~~~~~S~~GD~~~~G~~Y~F~QvTD~L~G~~sDPF~L~VsSsq~~d~vT~~~~WQ~~LFV~T~~~ 547 (1027) T protein:vir:80 468 VYQSRLVLGGFTNDPTRVVFSATGDTVEGGVKYNFFQVTDDLDGLDSDPFDLVVSSSQADDYVTGLVEWQSSLFVLTRRA 547 (1027) T ss_pred eeeeeeEEeccCCCcceEEEeecCCcccCceeeeeeeeehhhccCcCCceeEEEecccccceeeeeeeeceeEEEEecce Confidence 9999999997 45699999987 8999854 4577789999999999866 555888899999999999999 Q ss_pred EEEEecCCCcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEecCCceEEEEEeeeeccccCceehhhHHHHHHHhcCC Q lcl|NC_012662. 414 QAVVPSLQQLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEG 493 (780) Q Consensus 414 q~~i~~~~~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g 493 (780) -|.+.|++..++|..-.+..+|+|+.-+.-.-|+....|+|..+- +++++.... +++.|.+.+-|+.+..+|.. T Consensus 548 T~~~~GGd~t~~~a~~~VN~iSs~G~~N~~~VV~T~~~V~Yl~~~-----G~F~L~~r~-~~~~Y~A~EkSiKIR~~F~~ 621 (1027) T protein:vir:80 548 TFRANGGDATISPARRFVNYISSLGLVNPFSVVRTDTAVFYLSDS-----GVFNLTPRV-EDGEYQAIEKSIKIRKVFGK 621 (1027) T ss_pred eEEeecCccccchhHHHHHHHHhhcccCcceEEEeeeEEEEeecc-----ceeeccCCc-cCCcchhhhhhhhhhhhhhh Confidence 999999777799999999999999654545557778888998653 589988874 67999999999999999864 Q ss_pred CcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeecCCcee-----------eeeeEeeccCCcEEEE---EEE---CCcEEE Q lcl|NC_012662. 494 EARFMQSASAANIVLMATTGDNRQVIAHEYHFTSQGKV-----------HQAWHKWVFPYRVASL---HFA---RDRVVL 556 (780) Q Consensus 494 ~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~~~e~~-----------v~aW~~w~~~G~v~~~---~~~---~d~l~~ 556 (780) -. .++-.+...+....+.+.||+ =|+.+.|.- -.+|+.-++.|.|.-- -.+ ...-|+ T Consensus 622 ~~----~ta~~~~~Wm~~~q~~~~LYv--~L~~~~eT~~~S~~~~~N~~~DSWt~~~t~~~Fk~YtghP~V~~~~~~s~L 695 (1027) T protein:vir:80 622 TT----STAVSSAAWMSFDQNRKVLYV--ALPRGSETTVASALYVYNTFRDSWTQYDTLGGFKTYTGHPYVDTVLGDSFL 695 (1027) T ss_pred hc----cccccceeeeeeccCCceEEE--EecCCCcchhhhhhhhhhhhhcchhhhhcccCcccccCCchhhhhhhhhhh Confidence 22 111222211222222233322 122222111 1368777776655433 111 111111 Q ss_pred EEEEcCCCeE-------EEEEEeeeccCCcccc----c-----ccceee------eecc---------cee--------- Q lcl|NC_012662. 557 FAADDAGSTD-------KITISTIDPKQGGVTF----D-----VDRLPH------LDSM---------SIV--------- 596 (780) Q Consensus 557 vv~R~~~g~~-------~~~~e~~~~~~~~~~~----~-----~~~~~~------lD~~---------~~~--------- 596 (780) ..-.- ++.. .+|++-+.. .+..+. + ...+|+ +|-. .+| T Consensus 696 ~~v~~-~~TV~ML~~~~~~YvDFF~~-CG~~~~~Vlt~~~GIY~~~~P~wnsP~I~~~svs~tt~~~~q~Ye~~T~~~vv 773 (1027) T protein:vir:80 696 LMVAY-GGTVCMLKLYGSRYVDFFNK-CGSFTGNVLTANSGIYTWTAPFWNSPVISNISVSGTTTLAVQRYELPTDLQVV 773 (1027) T ss_pred hhhcC-chhhhhhhhhcchhhhhhhh-cccceeeEEecCCceeEeecccccCCeeeEEEeeccchhhhheeccccccccc Confidence 11000 0000 011111000 000000 0 000011 1100 000 Q ss_pred eecCcc-------eeEEeeccccC------------CCCeEEEEEe--------cCccccceec-cc----------cc- Q lcl|NC_012662. 597 PVNDGK-------GIVPIYMRPWV------------SEGKLTGSVA--------TGALASEEVA-ID----------VD- 637 (780) Q Consensus 597 ~~~~~~-------~~~~~~~~~~~------------l~g~~v~~~a--------dG~~~~~~~~-~~----------~~- 637 (780) ++..-. .+.-++..++. -.|+++++.. .|....++.. .. +. T Consensus 774 pydnvedlsiyvnGT~Ls~~~~~~~~~~~i~LL~~~~~~~~~s~Vprcpvnvsy~~~~~~~~TT~~TV~~N~~~~iQ~Td 853 (1027) T protein:vir:80 774 PYDNVEDLSIYVNGTRLSFGTDWVKQGKAIYLLSDPGDGKTVSIVPRCPVNVSYQGDVTFDETTAQTVWVNNLLQIQGTD 853 (1027) T ss_pred cccccccceeeecceeEeecCchhhcCCEEEEecCCCCcceEEEEecccccccccccccccccccceEEecceeeeccce Confidence 000000 00011122222 2344444431 1111111110 00 00 Q ss_pred -ccc-ceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCceeeecce-EEEEEEEEEeccccEEEE-ecCCCCCccee Q lcl|NC_012662. 638 -EVS-WEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLISTAPV-RLLRYELTTRNTGEFDVR-IVDPTIGLDYS 713 (780) Q Consensus 638 -~~~-~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~~~~~r~-~v~rv~v~~~~T~~~~v~-v~~~~~~~~~~ 713 (780) +.+ ..++...-.....+.+|.-|.+.++-+-+.+ . + -+|| ||.++.|-|.+---+-+. +.+...+.+.. T Consensus 854 y~~~GS~L~~~~~LtN~~~~~G~~Y~S~Y~SP~F~L-~----S--L~~LKk~K~~~L~~Dnedvlpvytigdlasgqdvd 926 (1027) T protein:vir:80 854 YTLSGSTLTFTDTLTNAVVEVGNAYISYYQSPMFLL-G----S--LSNLKKVKHVYLYFDNEDVLPVYTIGDLASGQDVD 926 (1027) T ss_pred eeeccCccccccccccceEEEeecchhhhcchhhhh-h----h--hhhhhheeeeEEEEcCCcceeeeeeccccCCCchh Confidence 000 0111111122456888999998888544433 1 1 1344 688888888765444332 22221111111 Q ss_pred cccCceecccccccCCcccccccceEEEEeecccc-eeEEEEEECCCCCEEEEEEEEEEEEecce------ecC Q lcl|NC_012662. 714 NSKTSLVFGTDDVQLNQALVSDLSRVPVPCRSNAQ-STEMYLSTDGTQDMNILEIEYIIRYNQRR------RRV 780 (780) Q Consensus 714 ~~~~~~~~~~~~~~~g~~p~~~tg~~~vp~~~~~~-~~~v~i~~~~P~P~tvlai~~eg~y~~r~------rrv 780 (780) +.+|.=-..-...+.+...+.++ +.+.-|-+- ..+-|+--|+.-- -|- T Consensus 927 ------------dlvgkwktrananisvtydsentsetsydiysf-------sdlvwdnaffdvdptnlqstry 981 (1027) T protein:vir:80 927 ------------DLVGKWKTRANANISVTYDSENTSETSYDIYSF-------SDLVWDNAFFDVDPTNLQSTRY 981 (1027) T ss_pred ------------HhhhhhcccccceeEEEecCcCcccceeeeeeh-------hhhhcccceecccccccchhhH Confidence 11111000011233333333332 344433322 2233554443210 011 No 30 >protein:vir:95475 Length: 771 # NCBI annotation: hypothetical protein ORF038 # Family: family:all:5234 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294631;genbank:gi:149408197;genbank:GeneID:5237042 Probab=97.11 E-value=0.00017 Score=41.32 Aligned_cols=667 Identities=14% Similarity=0.091 Sum_probs=274.8 Q ss_pred Cceeeee-eechhhcccccCCchHhh-hhhhhhhhcceeeecCCceeCCcceee-------eeecccccCCCceeEEEEE Q lcl|NC_012662. 1 MARPFEG-ALNDLLQGVSQQVPRERV-AGQCSAQVNMLSDPVTGIRRRPGSLFV-------SVHDFGPIGEGDALYTQYL 71 (780) Q Consensus 1 Ma~~v~~-~~~~l~~GvSqq~D~~Ry-~~q~~~~~N~~~~p~gGl~rRpGt~fv-------a~~~~~~~~~~~~~~~~~~ 71 (780) ||+..++ +.-.|++|.=--.-++-| ++..-.-+||.....|--+||-|+-|- ......+.+ +....++.. T Consensus 1 m~~~~~~~~vNtFv~GliTEas~ltfpqnasiDe~N~~l~rdG~r~RR~g~~~E~~~~~vls~~~vpa~g-~~~v~~~~W 79 (771) T protein:vir:95 1 MAKTTNAAEFNTFVGGLITEASPLTFPQNASIDEVNFILNRDGSRNRRNGMDFENGATKVVCNTLVPADG-TIAVTSHNW 79 (771) T ss_pred CCcccchhHHhhhhhheeeccccccCCccceeeeeeeeecCCCcchhhceeeeecCCceEEEEEEecccc-eEEeeeech Confidence 9988766 456789993333333344 345556789999999999999987553 222222111 111222222 Q ss_pred ec--CCCccEEEEEEcCcEEEEEeCCCcEEEecCCCCccccCC---cceEEEEEeCCEEEEecCCEeeeEeecccccCCC Q lcl|NC_012662. 72 ER--GADGRHLVINTNTGGWWLLDREAKNIVSEGNLSYLLAAD---RRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDP 146 (780) Q Consensus 72 ~r--~~~~~~y~l~~~~g~~~v~d~~~~~~~~~~~~~y~~~~~---~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~ 146 (780) .. ++-+..+.++..+.-+.++...+-.+....-.-+ ...+ ...|++..+..++.|+||..-|....-+.+.-.. T Consensus 80 ~na~G~v~~~~livqvg~~l~f~q~t~~pLs~~n~~~~-a~~nlSPsh~isv~v~~G~livanp~i~~~~~~~d~~t~s~ 158 (771) T protein:vir:95 80 ENAGGEVGRWISLVQVGTELKFFQTTGETLSEGNFYNY-QFVNMSPSHKLSYAVVDGLLVVANGSRDIYVFEYDSGSVSV 158 (771) T ss_pred hhcccccCcEEEEEEeccEEEEEecCCCcccccceeee-ecceeccceeEEEEEeeeEEEEecCCccEEEEEecCCccee Confidence 21 3455666777666667777655423211100000 1111 1248888889999999998777554333221110 Q ss_pred CcceEEEecCccccceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhhheecccceEEEc--------Ce Q lcl|NC_012662. 147 KTTGFYFVKSGAFSKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVGVDFAVRV--------GP 218 (780) Q Consensus 147 ~~~g~v~v~~g~y~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g~~~~~~~--------g~ 218 (780) .+.- +-+|.- +..+-- .++.+..++-- .+......+++.|.. |. +.+|.+.. +. T Consensus 159 t~~~-ll~r~r-f~~q~~--~~G~d~~~~~~-------~~~~gt~~tn~~iyn-ly------N~gw~~pk~~~~snt~~~ 220 (771) T protein:vir:95 159 TTKR-LLVRDL-FGVQDI--VNGVDLRQGND-------IATRPTVQTNAHIYN-LR------NQTFGVPRVTWHSNEPSD 220 (771) T ss_pred Eeee-eeeeeh-hhcccc--ccccceecccc-------cccCCcccCchhhee-cc------ccceeccccccccCCccc Confidence 0000 000100 000000 00000000000 011111222222211 11 11121110 11 Q ss_pred EEE--EEcCCCceeEEEeecCCcceeEEEEEeecceecccccccCCceEEEEeccCC--CCCceEEE-EEecCccEEEEe Q lcl|NC_012662. 219 YIY--FELITGTDLKITSTSGSPYIGYSNQSQVNLETDLPARLHPSADGALCAVGQS--ERALVWYR-YSSEKGVWLESG 293 (780) Q Consensus 219 ~i~--~~~~s~~~~~vt~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~--~~~~~y~~-~~~~~g~w~e~~ 293 (780) ++. ..+..+.- .+..+.-+ -.+.-...+...+..-.. ...++++ +.++. ...-|-.- |....+...+++ T Consensus 221 ~iV~~y~a~~g~~--pS~sd~~N-~a~~k~~~~Ei~t~~~f~--~~~~~~~-~~Gt~~~~~G~yi~da~~~g~~~Lt~~v 294 (771) T protein:vir:95 221 PIVTFRSAASGKF--PSNSDSVN-LALSKRADVEPSTTDRFR--AEDIVLN-PIGTYETARGFFIIDAMARGKSRLEEIV 294 (771) T ss_pred cceEeeeccCCCC--cCCceeec-cccchhhccceeeecccc--hhhhhhc-ccCcccccCcceeeehhhhcccccceee Confidence 110 00000000 00000000 000000000000000000 0001110 00111 01100000 000111111111 Q ss_pred cccceeEEcccceeEEeeccccccccchhhcCCcccCCCcccc---------c-CCCceEEEEEcceEEEec-------- Q lcl|NC_012662. 294 DYNSVTAISVDVPYKIVDDNVEQHIMEGRLAGDDLTNPAPTFL---------E-ERRITGIGTFQGRLVLLS-------- 355 (780) Q Consensus 294 ~~~~~~~~~~~~p~~l~~~~~~~~~w~~~~~gd~~t~~~psf~---------~-~~~~~~v~~~q~RL~f~~-------- 355 (780) . .+.+.||-+ + -+..+-|+-|-.|.|+++ T Consensus 295 e---------------------------------~~gr~~s~~~~~~~l~~~~t~~~~~~vaeyagRvwYag~~~~~iD~ 341 (771) T protein:vir:95 295 K---------------------------------LKQRYPSLSFGVSSLPQDETPGGASVVCEYAGRVWYAGFSGQIIDG 341 (771) T ss_pred e---------------------------------ccccchhhhccccccccccCCCCceeEEeeeeeEEEecceeEEeec Confidence 1 011111100 0 023456788889988876 Q ss_pred ----C---CeEEEEccCC--------ccccccccc--cCCCCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEe Q lcl|NC_012662. 356 ----G---AYVCMSATGE--------PDRFFRSTV--SSLDPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVP 418 (780) Q Consensus 356 ----~---~~v~~S~~gd--------~~nF~~~t~--~~~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~ 418 (780) | ..|.+||.=+ |.+=.|++. ..+.|.|...+.+-+. -.|.-|+.++..|+||...+-|+|. T Consensus 342 dkng~~~~~~ilfSqLv~s~~di~nCyQd~DPTsee~~dLidTDGg~iri~ga--h~ii~Lv~f~~sLlvfc~NGVWAi~ 419 (771) T protein:vir:95 342 DDQSPRLVSYILFSQLVDSPADIVNCYQDGDPTSTEEPELVDTDGGFIRIEGA--HDIINLVNVGSAVMVVAANGIWMIQ 419 (771) T ss_pred cccCCceeeeEeeehhhcchhhcccccccCCCchhhhhhhhhcCCCEEEecCC--CCceeEEEecceEEEEEecceEEEE Confidence 1 1488887543 344444432 2466889999988663 3356689999999999999999997 Q ss_pred cCC-CcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEecCCceEEEEEeeeeccccCceehhhHH-HHHHHhcCCCcE Q lcl|NC_012662. 419 SLQ-QLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFSAVLELVPSQYTSSQYVSQDVT-THIPRYIEGEAR 496 (780) Q Consensus 419 ~~~-~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls-~~~~h~~~g~~~ 496 (780) |.+ ...|.++..+.++++.+|++.=.=+++|+.++|-.++| |+.+..++ -+-+.++.|| ..+..|.+.=.. T Consensus 420 ggsd~g~tAtdY~ltKIs~vg~sspnSvVvvg~~i~ywsdtg-----Iyal~~Nd--fn~~tAqnLTekTIq~~~~~I~~ 492 (771) T protein:vir:95 420 GGSDYGFTATNYLVTKISEHGCSSPNSVVVVDNSFMYWGDDG-----IYHLTRNQ--YGDYVANNLTEKTIQKYYEKIPS 492 (771) T ss_pred eccCCceeeeeeEEEEeeeeccCCCccEEEecceEEEeeCCc-----eEEEeecc--cCcchhhccchHHHHHHHhhcch Confidence 654 57999999999999999998777789999999998775 88888875 3568999999 788887653221 Q ss_pred ---E--EEEeCCCCeEEEEEEcC--CC---EE--EEEEEeecCCceeeeeeEee---c-cCCcE----EEEEE------- Q lcl|NC_012662. 497 ---F--MQSASAANIVLMATTGD--NR---QV--IAHEYHFTSQGKVHQAWHKW---V-FPYRV----ASLHF------- 549 (780) Q Consensus 497 ---~--~~~~~~~~~~~~~~~~~--~g---~l--~~~ty~~~~~e~~v~aW~~w---~-~~G~v----~~~~~------- 549 (780) . ...+-+-+.-+.|..-+ |+ -+ +++ +-...|+-+| + .+|.. -++.. T Consensus 493 dk~knVtg~fd~~e~rvyw~yPn~~D~~~e~~t~LV~-------dLalgaFYp~~i~~~~ag~l~~~vg~~~~p~~~lv~ 565 (771) T protein:vir:95 493 DAILNATGFYDSYDKKVKWLYNTVLDGRTEPVTELVF-------DLALGAFYPSKIGSLTAGRLPIPVGSVKIPPYKLVE 565 (771) T ss_pred hhhcceEEEEEccCCEEEEEecceecCCCcceeeeee-------eecccccccccccccccCccceeeeeeecCcccccc Confidence 1 11222222223343321 00 00 111 1123466666 3 22322 11100 Q ss_pred ECCcEEE-EEEEcCCCeEEEEEEeeeccCCcccccccceeeeeccceeeecCcceeEEeeccccCCCCeEEE-EEecCcc Q lcl|NC_012662. 550 ARDRVVL-FAADDAGSTDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVNDGKGIVPIYMRPWVSEGKLTG-SVATGAL 627 (780) Q Consensus 550 ~~d~l~~-vv~R~~~g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~l~g~~v~-~~adG~~ 627 (780) .+.++-+ ..+-+.+|..+ .+.+.--....+ +..+..-.|-... +..+...+.+...+..+.|. +.+|- T Consensus 566 T~~eV~v~~~~v~~tG~~v-tV~~~~r~~~~~--~~~y~~~~~dg~~-----g~~~Fa~~~~~~f~DW~sv~~~~vdy-- 635 (771) T protein:vir:95 566 TGEEVTVASEQVTATGELV-TVKVSTRSPVIR--ETKYIIVEKLSSP-----MRISFGGYTDEEFVDWKSVDGIGVDA-- 635 (771) T ss_pred ccceEEecceeeEecCCce-EEEEEEeecccc--ceEEEEEEecCCC-----eeEEeccccCcceeecccCCCcccch-- Confidence 0111100 11111111110 000000000000 0000000000000 00011011111111111100 00000 Q ss_pred ccceeccccccccceEEEcCCCCCCEEEEeEeeeEEEEcCceEEe-cCCCce-eeec-------ceEEEEEEEEEecccc Q lcl|NC_012662. 628 ASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFRYESLFAPTPPMLK-DQNDTL-ISTA-------PVRLLRYELTTRNTGE 698 (780) Q Consensus 628 ~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~-~~~g~~-~~~~-------r~~v~rv~v~~~~T~~ 698 (780) +.++ .- .....-..+|..+--+|.+ +++ .++|=. ...| .--+-.+.++...++. T Consensus 636 -~sy~-~~------------gY~~~gd~~~~k~~PYit~---y~~~tedg~v~~~~g~~~p~n~sSclm~~sw~ws~s~~ 698 (771) T protein:vir:95 636 -PAYL-LT------------GYLAGGDYQREKFVPYITF---HFKKTEDGFVEDAEGDWTPTNQSSCMVQSQWSWTNSPA 698 (771) T ss_pred -HHHH-Hh------------hhhccchheeeeccceEEE---EEEeecccceecccccccccCCcceEEEEEeeeecCCC Confidence 0000 00 0001112222222222221 222 222210 0111 1112222333333322 Q ss_pred EEEEecCCCCCcceecccCceecccccccCCcccccccceE--EEEeecccceeEEEEEECCCCCEEEEEEEEEEEEecc Q lcl|NC_012662. 699 FDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRV--PVPCRSNAQSTEMYLSTDGTQDMNILEIEYIIRYNQR 776 (780) Q Consensus 699 ~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~--~vp~~~~~~~~~v~i~~~~P~P~tvlai~~eg~y~~r 776 (780) .++-...+..|.+-..++. ++...+.- +....+ +..++|..+-.+++|.+...-.|+|++.+.--..|-. T Consensus 699 -----t~k~~~~~eaYk~~~~~~p--~~~~~~~y-p~~~VV~TKsriRG~Gr~~~~rf~s~~gKdlhl~Gysil~~~~~~ 770 (771) T protein:vir:95 699 -----SNKWGRTWQAYRFRRHFFP--DNIDNQFD-DGNSVVETKSRLRGSGKVLSLYITTEPKKNLHIYGWSMLVDVNGT 770 (771) T ss_pred -----CCccccchheeeecceecc--CCcchhcC-CccceeeeeheeeecceEEEEEEEecCCcceEEEeEEEEEeecCc Confidence 1111112223333222211 11111111 111122 3467899999999999999999999999988877776 Q ss_pred e Q lcl|NC_012662. 777 R 777 (780) Q Consensus 777 ~ 777 (780) . T Consensus 771 ~ 771 (771) T protein:vir:95 771 V 771 (771) T ss_pred C Confidence 6 No 31 >protein:vir:105563 Length: 396 # NCBI annotation: hypothetical protein # Family: family:all:27455 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164316;genbank:gi:56692963;genbank:GeneID:3197174 Probab=97.08 E-value=0.00018 Score=41.11 Aligned_cols=362 Identities=12% Similarity=0.019 Sum_probs=149.6 Q ss_pred Cceeeeeeechhhcc--cccCCchHh----hhhhhhhhhcceeeecCCceeCCcceeeeeecccccCCCceeEEEEEecC Q lcl|NC_012662. 1 MARPFEGALNDLLQG--VSQQVPRER----VAGQCSAQVNMLSDPVTGIRRRPGSLFVSVHDFGPIGEGDALYTQYLERG 74 (780) Q Consensus 1 Ma~~v~~~~~~l~~G--vSqq~D~~R----y~~q~~~~~N~~~~p~gGl~rRpGt~fva~~~~~~~~~~~~~~~~~~~r~ 74 (780) ||. -+|-+|+|= |+.-.++.| -..-+|+.+|.=.++.|=.+||-|.+-+...+... ....+..-+.+- T Consensus 1 ~~~---~~~~~~~ginnv~~e~~l~~~~~~~~~~~r~a~nvdi~~~G~~~~r~~~tr~~~g~l~~--~~~~~~~~~~~~- 74 (396) T protein:vir:10 1 MAT---TSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQ--LWQSPLHGDAFG- 74 (396) T ss_pred Ccc---eeeeeeecccccccccccccCCCcccceeeeeeeecccCCCchhhhccCcccCCceecc--cccCccccceee- Confidence 993 456666543 555454444 34478999999999999999999998776433211 111111111110 Q ss_pred CCccEEEEEEcCcEEEEEeCCCcEEEecCCCCccccCCcceEEEEEeCCEEEEecCCEeeeEeecccccCCCCcceEEEe Q lcl|NC_012662. 75 ADGRHLVINTNTGGWWLLDREAKNIVSEGNLSYLLAADRRSIQTTSMGGVTYILNTEKRPSATTDNSDKKDPKTTGFYFV 154 (780) Q Consensus 75 ~~~~~y~l~~~~g~~~v~d~~~~~~~~~~~~~y~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~~~~~~~~~~~~~g~v~v 154 (780) ..+..+..+. ++.++.+- . + .....-+.+.+.+|-+|.++-..+-.+.. T Consensus 75 ~~~~tl~~~~-~~~w~~~~-~---v----------~v~~~pva~d~~~~Rvy~t~~~~p~~~~~---------------- 123 (396) T protein:vir:10 75 ALGDQWGKVD-PHSWTFEP-L---A----------QIGEGDLSHEVLNNRVCVAGTAGIFTYDG---------------- 123 (396) T ss_pred eCCceEEEEe-CCeEEEEe-e---e----------eeccCchhccccCCeEEEEcCCCceeeeC---------------- Confidence 1233333332 22233220 0 0 01112233445667777776443322110 Q ss_pred cCccccceeEEEEeeCC-------------ce--EEEEEEeccCCCCccccccchhhhhhhhhhhheecccceEEEcCeE Q lcl|NC_012662. 155 KSGAFSKEYDISVVWSE-------------GS--QTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVGVDFAVRVGPY 219 (780) Q Consensus 155 ~~g~y~~~y~vti~~~~-------------~~--~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g~~~~~~~g~~ 219 (780) ...|.+.+..-+ +. +..|+.+..+.. +... T Consensus 124 -----~~~y~L~vp~P~~a~~~a~~Gsl~~~~~~Y~~t~V~~~gEE-s~p~----------------------------- 168 (396) T protein:vir:10 124 -----AQAERLTLDTPAPPLLVAGAGSLSQGTYGAAVAWLRGPQES-APSL----------------------------- 168 (396) T ss_pred -----CcceecCcCCCcccccccccCccCCceEEEEEEEEecCCCc-Cccc----------------------------- Confidence 011111110000 00 111111100000 0000 Q ss_pred EEEEcCCCceeEEEeecCCcceeEEEEEeecceecccccccCCceEEEEeccCCCCCceEEEEEecCccEEEEeccccee Q lcl|NC_012662. 220 IYFELITGTDLKITSTSGSPYIGYSNQSQVNLETDLPARLHPSADGALCAVGQSERALVWYRYSSEKGVWLESGDYNSVT 299 (780) Q Consensus 220 i~~~~~s~~~~~vt~~~g~~~~~~~~~~~v~~~~~l~~~~~~~~~~~v~~~~~~~~~~~y~~~~~~~g~w~e~~~~~~~~ 299 (780) .....++ ++.. ...+-++..-.+.....|..+++.+.+ +|+..+- T Consensus 169 -------~~S~~v~--~~gg----------~~vtl~~~~~~~i~~~RiYrS~~~G~~-~~l~aE~--------------- 213 (396) T protein:vir:10 169 -------IAFAEVT--DAGA----------LEVTFPLCLDASVTGARLYLTRANGGE-LLLAGDY--------------- 213 (396) T ss_pred -------ccccccC--CCCC----------cEEEEEcccCCCcceEEEEEeCCChhh-hhheehh--------------- Confidence 0000000 0000 000000111111222334444333322 2221111 Q ss_pred EEcccceeEEeeccccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEecCCeEEEEccCCccccc-cccccCC Q lcl|NC_012662. 300 AISVDVPYKIVDDNVEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLSGAYVCMSATGEPDRFF-RSTVSSL 378 (780) Q Consensus 300 ~~~~~~p~~l~~~~~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~-~~t~~~~ 378 (780) |...........+|.....-..-=.|+|. + .-+.+|.+||+++.++.||+|...-++=+. +..-++ T Consensus 214 ------~a~~~s~vlPs~~w~gpP~~~~gL~pmP~--G----~~~A~faGRi~~A~Gn~V~FSEp~~Ph~~~~~~~~~~- 280 (396) T protein:vir:10 214 ------PLGAATVILPTLPELGRPAQFRHLSPMPT--G----KHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFVQ- 280 (396) T ss_pred ------ccceeeeeeecCCCCCCCccccccccCch--h----HhhhhhcceEEEEeCCEEEEecCCCCceecchhccCC- Confidence 00000001122345433211111122222 1 136799999999999999999998763221 111111 Q ss_pred CCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecCCCcccccceEEEEEee---ecCCC---------CCCcE Q lcl|NC_012662. 379 DPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQQLLAPDNASVVLTSD---LACNA---------FVAPV 446 (780) Q Consensus 379 ~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~~~ltP~~~~~~~~s~---~~~~~---------~~~Pv 446 (780) . ...|.-+.+...+|+++|+++-|.+.|. +|++.+..+... .-|+. .-..+ T Consensus 281 -----------~--~~~Iv~lapv~~gL~Vgt~~~~y~~~G~----dP~sms~~~l~~~~pvp~S~v~~p~~~~s~rs~~ 343 (396) T protein:vir:10 281 -----------M--PQRITFVQPVDGGIWVGQVDHVAFLDGA----DPASLSVSRRASRAPVPGSAVLVPAEVVGTNASP 343 (396) T ss_pred -----------C--CCceEEEEEecCeEEEEEcCcEEEEEcC----ChhHcceeecccCCCcccchhcccchhhhccccc Confidence 1 1235667778899999999999999984 455554444421 11221 12234 Q ss_pred EeCCeEEEEEecCCceEEEEEeeeeccccCceehhhHHHHHHHhcCCCcEEEEEeCCCCeEEEEEEcCCCEEEEEEEeec Q lcl|NC_012662. 447 TTSQTLMYPAPRSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEGEARFMQSASAANIVLMATTGDNRQVIAHEYHFT 526 (780) Q Consensus 447 ~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g~~~~~~~~~~~~~~~~~~~~~~g~l~~~ty~~~ 526 (780) ..|..++|+++.| +.- +.. ++- . ..+....+.+....... + +..|++.+++- + T Consensus 344 ~~~~~~lwas~dG-----l~~----g~~-~G~-v---~~l~~~~i~p~~~~A~~---------~-~~~drRy~~~~---~ 396 (396) T protein:vir:10 344 DGSPVAVWLAENG-----YVM----GTS-SGA-I---AEVHAGVLAGITGRAGT---------S-VVFDRRLLTAV---S 396 (396) T ss_pred ccCcEEEEccCCc-----EEE----EcC-Cce-e---eeecccccCCCcccceE---------E-EeecCeEEEEe---C Confidence 5688899998876 221 111 111 1 11222223222211110 1 11233332211 0 No 32 >protein:vir:3133 Length: 911 # NCBI annotation: hypothetical protein # Family: family:all:5234 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640315;genbank:gi:21234408;genbank:GeneID:956056 Probab=96.43 E-value=0.00064 Score=38.11 Aligned_cols=684 Identities=14% Similarity=0.079 Sum_probs=248.4 Q ss_pred Cceeeeeeechhh--cccccCCchHhhhhh-hhhhhcceeeecCCceeCCcce-------eeeeecccccCCCceeEEEE Q lcl|NC_012662. 1 MARPFEGALNDLL--QGVSQQVPRERVAGQ-CSAQVNMLSDPVTGIRRRPGSL-------FVSVHDFGPIGEGDALYTQY 70 (780) Q Consensus 1 Ma~~v~~~~~~l~--~GvSqq~D~~Ry~~q-~~~~~N~~~~p~gGl~rRpGt~-------fva~~~~~~~~~~~~~~~~~ 70 (780) ||. -.|....|. +|---.-.+..|..- +-..+||-...+|=-+||-|.- |+...+..+++..+ ..+. T Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~- 77 (911) T protein:vir:31 1 MAA-RKGAVNRFTPVRGWVTEGNLANYGQDVALDVENMDIEKTGLTQRRFGLFAETSSEQFLSTFTATARARGL-LAVK- 77 (911) T ss_pred Ccc-ccccccccccceeeeecCchhhcCceeEeeeccccchhcccchhheeeeeccchhhhhhhhhhhhhhcce-eehh- Confidence 873 345554442 331112234444332 3457899988888888888862 22111111111000 0000 Q ss_pred EecC--CCccEEEEEEcCcE-EEEEeCCCcEEEecCCCCccccCCc-----------c-------eEEEEEeCCEEEEec Q lcl|NC_012662. 71 LERG--ADGRHLVINTNTGG-WWLLDREAKNIVSEGNLSYLLAADR-----------R-------SIQTTSMGGVTYILN 129 (780) Q Consensus 71 ~~r~--~~~~~y~l~~~~g~-~~v~d~~~~~~~~~~~~~y~~~~~~-----------~-------~l~~~q~aD~~fi~~ 129 (780) .-|. .+..--|++|..|+ +-|. ..++|+..+++ + -+...---....|.| T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 147 (911) T protein:vir:31 78 EWREAWGDKDVNMLIFHAGYKVHVV----------QDTAPLRDANILLTIDLLEAGIKLDGVIDSPVHISVGVGFAIITN 147 (911) T ss_pred hHHHhhCCCcceEEEEecCcEEEEE----------ecccCccccceEEEeeeeccCceeeeeecCceeEEeeceEEEeec Confidence 0011 11122244444442 2221 11222222211 0 112222234556777 Q ss_pred CCEeeeEeecccccCCCCcceEEEecCccccceeEEEEeeCCceEEEEEEeccCCCCccccccchh--hhhhhhhhhhee Q lcl|NC_012662. 130 TEKRPSATTDNSDKKDPKTTGFYFVKSGAFSKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPE--AIARKLVEALIA 207 (780) Q Consensus 130 ~~~~p~~~~~~~~~~~~~~~g~v~v~~g~y~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~--~i~~~l~~~~~s 207 (780) |...|.-.-.+. -..+|+..+.-.+. .+-|.... --..|++ ..+-.++.+++ |-...-.-++++ T Consensus 148 ~~~~~~~~~~~~----~~~~~~~~~~~~~~----~~~~~~~~--~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (911) T protein:vir:31 148 PRIEPVLIKLDD----VDDEGVPTLSYEPL----TLLIRTRE--LLTPYTT----GTNYGDTLTPEEEWNLYNSGWATIT 213 (911) T ss_pred CccceEEEEeec----cCccCcccccccce----eeEeeehh--hcccccc----ccccCcccCchhhcccccccceeee Confidence 777664321110 01111111110000 00010000 0000111 11111111221 110000000000 Q ss_pred ------cccceEEEcCeEEEEEcCCCceeEEEeecCCccee---EEEEEeecceecccccccCCceEEEEe-ccCCC--- Q lcl|NC_012662. 208 ------VGVDFAVRVGPYIYFELITGTDLKITSTSGSPYIG---YSNQSQVNLETDLPARLHPSADGALCA-VGQSE--- 274 (780) Q Consensus 208 ------~g~~~~~~~g~~i~~~~~s~~~~~vt~~~g~~~~~---~~~~~~v~~~~~l~~~~~~~~~~~v~~-~~~~~--- 274 (780) .|++......-..|+....... +. .-.+.. .+++- ...|-.-+| ..+-||.- +.... T Consensus 214 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~--~~~~~~~~~~~~~~----~~~~~~~~~-~~~~~~~~~~~~~~~~~ 283 (911) T protein:vir:31 214 RATKDKSGSGTVYVNPVQYYFDKRGVYP---SH--SVLYNSMKQESAKE----IVALNVFSP-WADEKINFGTTTPPLGR 283 (911) T ss_pred eecccCCccceEEEchhheeecccCcCc---ch--hhhhhhhhhhccce----eEEEeeecc-ccccccccccCCCchhh Confidence 0011000000001111100000 00 000000 00000 000000011 11111110 00000 Q ss_pred --CCceEEEEEecCccEEEEecccceeEEcccceeEEe-eccccccccchhhcCCcccCCCc----------ccc----- Q lcl|NC_012662. 275 --RALVWYRYSSEKGVWLESGDYNSVTAISVDVPYKIV-DDNVEQHIMEGRLAGDDLTNPAP----------TFL----- 336 (780) Q Consensus 275 --~~~~y~~~~~~~g~w~e~~~~~~~~~~~~~~p~~l~-~~~~~~~~w~~~~~gd~~t~~~p----------sf~----- 336 (780) -++||. ++. .+ -..+.|.. + |+..- .+.-+- .+.++.+||.- .+. T Consensus 284 ~~~~~~~~--~~~--~~-~~~~~~~~-----~-~~~~~~~~~~~~------p~~~e~~np~gl~~igt~~n~k~~a~~~~ 346 (911) T protein:vir:31 284 YIHSAYYF--DSA--AI-LSLGIGNL-----T-PPTSDGTTEGSG------PAEEEISNPIGLDNIGTVNNLKLIAEGTV 346 (911) T ss_pred hhhhheee--ccc--ee-eeeccccc-----C-CCCCCCccCCCC------CchhhhcCCCCcccccchhceeeeeccce Confidence 112331 110 00 00000000 0 00000 000000 00011222210 000 Q ss_pred ---cCCCceEEEEEcceEEEec-----CCeEEEEccCC--------ccccccccc--cCCCCCccEEEEEcCCcceeEEE Q lcl|NC_012662. 337 ---EERRITGIGTFQGRLVLLS-----GAYVCMSATGE--------PDRFFRSTV--SSLDPTDRIDIASGSAQNSVFRQ 398 (780) Q Consensus 337 ---~~~~~~~v~~~q~RL~f~~-----~~~v~~S~~gd--------~~nF~~~t~--~~~~ddD~i~~~~~~~~~~~i~~ 398 (780) -...|+|++||.+|++|+. ...|.+|+.-+ |+.=.+.+. ..+-|.|...+.+.. ...|+- T Consensus 347 ~~~~~~r~r~~~~yaGRVfyaD~dkngk~rIlFSqLv~sl~di~nCYQdaDPTSeee~DLIdTDGg~vri~g--ah~Ii~ 424 (911) T protein:vir:31 347 RWTVKDRPRCSGYHNGHVYFGDRDKNGKTRILVSQLVNSLDNIPKCFQDADPTAEEINDLIATDGFTMYPVG--MGAPIT 424 (911) T ss_pred eeeecccccceeeeccEEEEeeeccCcceeEEEEeeccccccccccccCCCccccccchhhhcCCcEEecCC--CCCceE Confidence 1246899999999999994 34799998653 555455432 124457887777754 455788 Q ss_pred EeecCCcEEEEecCcEEEEecCC-CcccccceEEEEEeeecCCCCCCcEEeCCeEEEEEecCCceEEEEEeeeeccccCc Q lcl|NC_012662. 399 ALQFNKDLILLGDSTQAVVPSLQ-QLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPAPRSEAFSAVLELVPSQYTSSQ 477 (780) Q Consensus 399 ~v~~~~~L~l~t~~~q~~i~~~~-~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~v~e~~~~~~~~~~ 477 (780) ||.+++.|++|..++-|.|.|.+ ...|.++..+.+++..+|++.=.=|++|+.++|.+++| |..+...++ .- T Consensus 425 LV~~G~sLlVFcaNGVWAI~G~d~~g~TATdy~ItKIsdvGcsspNSVVvVgn~i~fWSd~G-----IyaLganqf--nD 497 (911) T protein:vir:31 425 MVEFNKRLLLLCTNGVWAIRGTSGGGATATDFTLDKVASVEFNSPQSVVDIGTAIVFWSERG-----IIAIGVNDF--GD 497 (911) T ss_pred EEEecCeEEEEEeCcEEEEeccCCCceeeeeeEEEEEeeeeeCCCCeEEEecCceEEeeCCc-----EEEEeeccc--Cc Confidence 99999999999999999998866 57999999999999999998777899999999998876 888888753 34 Q ss_pred eehhhHH-HHHHHhcCCC----cEEEE-EeCCCCeEEEEEEc---CCCEEEEEE----EeecCCceeeeeeEeeccC-Cc Q lcl|NC_012662. 478 YVSQDVT-THIPRYIEGE----ARFMQ-SASAANIVLMATTG---DNRQVIAHE----YHFTSQGKVHQAWHKWVFP-YR 543 (780) Q Consensus 478 ~~~~dls-~~~~h~~~g~----~~~~~-~~~~~~~~~~~~~~---~~g~l~~~t----y~~~~~e~~v~aW~~w~~~-G~ 543 (780) +.++.+| ..+..|.+.= +...+ .+-..+..+.|..- +.++.+.+. +.. .-...+|-+|... |. T Consensus 498 ~tAnNLTesTIQ~y~d~I~~dkIkNVtgtyd~de~rVyW~yPn~lDe~teykt~~~~ILVf---dLatgaFYPwtvs~gp 574 (911) T protein:vir:31 498 LTSNNLTENTIDEYYDSLDRDIIKNVKGTFINDENRVYWVVPNKQDSNGEYKTDGELVLVL---NLDTGGFYKHTVSGGP 574 (911) T ss_pred cccccccHHHHHHHHhhcChhhhceEEEEEEccCCEEEEEecCccCCccceeecCceEEEE---EeccCcccceeeecce Confidence 7899998 6677766421 11112 22222223444443 233333311 110 1123488888754 44 Q ss_pred EEEEEEE-----CCcE---------EEEEEEcCC-CeEEEEEEeeeccCCcccccccceeeeeccceeeecCcceeE-Ee Q lcl|NC_012662. 544 VASLHFA-----RDRV---------VLFAADDAG-STDKITISTIDPKQGGVTFDVDRLPHLDSMSIVPVNDGKGIV-PI 607 (780) Q Consensus 544 v~~~~~~-----~d~l---------~~vv~R~~~-g~~~~~~e~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~~~~~-~~ 607 (780) ++..... +.|. +++++--.+ -...++.|-+ +.+ ..+..++|-|- ..++.+. .. T Consensus 575 Ll~~p~y~Lv~TreEvtvPi~~etgaiIve~gsdPV~~tl~vdtt----GvD-g~ayLl~frdg------~~g~~~f~a~ 643 (911) T protein:vir:31 575 LLHAPFRRLVNTRAEVSIPITETDGTVITDTLGDPVTVTRTVTTT----GVD-GLAYFASFDDG------VNGQFNFIAE 643 (911) T ss_pred eecccccccccccccceeeEEeecceEEEecCCCCeEEEEeeecc----ccc-ceeEEEeeccC------CcceEEEEEe Confidence 4422211 1111 122211111 0011122211 000 00111111110 0111111 11 Q ss_pred eccccCCCCeEEE--------EEecCccccceeccccccccceEEEcCCCCCCEEEEeEe-----eeEEEEcCceEEecC Q lcl|NC_012662. 608 YMRPWVSEGKLTG--------SVATGALASEEVAIDVDEVSWEFTVEPGFKDSQIYLGFR-----YESLFAPTPPMLKDQ 674 (780) Q Consensus 608 ~~~~~~l~g~~v~--------~~adG~~~~~~~~~~~~~~~~~~~i~~~~~~~~v~vGl~-----y~~~v~~~~~~i~~~ 674 (780) ..+...+....|- -+++-.+...+ ++.++.. ++.--+-+.-|++ |+.+-.-+.+ -+.| T Consensus 644 ~~~~~~~dw~~~~~~~~~~y~s~~~~~y~~~~-~~~~~~~-------~pyi~sy~~~~~rv~~~~y~~~~a~~~f-~~~~ 714 (911) T protein:vir:31 644 HQPWGFADWANVPNMTRVNYSSYVDFAYEYPE-VMIGNIS-------LPYIHSYYLTGIRVQTEQYTTETAHLSF-HRVQ 714 (911) T ss_pred ecCCeeeccccCccccccchhHHHHhhhhhhh-hhhhccc-------CceeeeeeeeeeEEeccceeeeccccee-Eeee Confidence 1111111111100 00010000000 0000000 0000001111111 1110000000 0111 Q ss_pred CCceeeecceEEEEEEE--------------EEeccccEEEEecCCCCCcceecccCceecccccccCCcccccccceEE Q lcl|NC_012662. 675 NDTLISTAPVRLLRYEL--------------TTRNTGEFDVRIVDPTIGLDYSNSKTSLVFGTDDVQLNQALVSDLSRVP 740 (780) Q Consensus 675 ~g~~~~~~r~~v~rv~v--------------~~~~T~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~p~~~tg~~~ 740 (780) .-....-|.++.|++.+ .+.++-...+ | +++.+ ..+.++.....+...+..+-++..|++- T Consensus 715 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v-V-NGDAE---~GtmTGWtvtaG~~d~~Ta~p~~rGSyf 789 (911) T protein:vir:31 715 AHQTTALGTVTFHKVDMMVSTGMQVISFHKDDLLRTEAVTL-V-NPDAE---TGDATGWTVTAGTLDVRTAAPLYQGSYY 789 (911) T ss_pred cccceeeeeeeeeeeeehhhccceeeeeccccceeeeeeEE-E-cCCCC---CCCCCcceeeccchhhccCCchhcceEe Confidence 11111112233333322 2222322222 1 11111 2334444444443334434445677776 Q ss_pred EEeecccceeEEEEEECC-----------CCCEEEEEEEEEEEEecce--ec--C Q lcl|NC_012662. 741 VPCRSNAQSTEMYLSTDG-----------TQDMNILEIEYIIRYNQRR--RR--V 780 (780) Q Consensus 741 vp~~~~~~~~~v~i~~~~-----------P~P~tvlai~~eg~y~~r~--rr--v 780 (780) |-- +. ++.-.+.|+- -+|.++. +|-+.|..+. -| | T Consensus 790 Fa~-~n--n~n~aL~QDIDSagaaaIDAG~v~ynvS--awl~gyAaqnd~Dr~~l 839 (911) T protein:vir:31 790 FWS-DS--NANFAAYQDIDPVGGGYITAGELANNVI--EAKLSWAARGNTDLGTV 839 (911) T ss_pred EcC-CC--Ccchhhheeccccccceeeeccchhhhh--hhhhhhccCCCCccceE Confidence 541 22 2333333332 2233332 3777777663 23 3 No 33 >protein:vir:93631 Length: 580 # NCBI annotation: Bcep22gp67 # Family: family:all:1544 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944296;genbank:gi:38640373;genbank:GeneID:2658280 Probab=66.60 E-value=0.27 Score=23.75 Aligned_cols=495 Identities=11% Similarity=0.005 Sum_probs=159.9 Q ss_pred CcEEEEEeCCCcEE--EecCCCCccccCCcceEEEEEeCCEEEEecCCEeeeEe------------------ecccccCC Q lcl|NC_012662. 86 TGGWWLLDREAKNI--VSEGNLSYLLAADRRSIQTTSMGGVTYILNTEKRPSAT------------------TDNSDKKD 145 (780) Q Consensus 86 ~g~~~v~d~~~~~~--~~~~~~~y~~~~~~~~l~~~q~aD~~fi~~~~~~p~~~------------------~~~~~~~~ 145 (780) .-.|++-.-. |.+ +..-..|- ...|.+--+-+=+....|-+. .++ ...| T Consensus 1 M~~i~i~~f~-Ge~Prl~p~lLP~---------~~a~~a~n~~~~~G~i~P~~~~~~~~~~~~i~~~~~~t~~~~-~~~W 69 (580) T protein:vir:93 1 MTIIKITGFS-GEIPRLVPRLLPD---------TAAQNATNARLESGGLTPYRKPKFITRISTIPAGQIETIYRN-GETW 69 (580) T ss_pred CeeEeecccc-cccccchhhhccc---------cccceEEeeeccCCeeeeeeCchhhccccccCcCcceEEEec-Ccee Confidence 1112221101 111 01111111 112222222233333333222 121 1122 Q ss_pred CCcceEEEecCccccceeEEEEeeCCceEEEEEEeccCCCCccccccchhhhhhhhhhhheecccceEEEcCeEEEEEcC Q lcl|NC_012662. 146 PKTTGFYFVKSGAFSKEYDISVVWSEGSQTVTYTTPDGTTAGDADQSVPEAIARKLVEALIAVGVDFAVRVGPYIYFELI 225 (780) Q Consensus 146 ~~~~g~v~v~~g~y~~~y~vti~~~~~~~t~t~tt~~~s~~~~~~~~~~~~i~~~l~~~~~s~g~~~~~~~g~~i~~~~~ 225 (780) ....+.+.+--+.-..+ .+-++ ++. -.-.+.+.+++.......+ .....+...+-....+.+.|... T Consensus 70 ~~w~~~V~~i~~PvA~D-Rvy~T--d~g-~Pkvt~~g~sy~lgVpaPs--------~Apt~~~~g~g~l~~~~y~Yv~T- 136 (580) T protein:vir:93 70 MAWDKPVYAAPGPVAAD-RLYVM--GDG-APKMIVGGTTYPLAVPMPS--------AALTAATSGTGTGDVFSRVYVYT- 136 (580) T ss_pred EEeCCceeeecCccccc-eeEEc--CCc-ccceecCCccccccCCCcc--------cCceeeecCCCCcCccceEEEEE- Confidence 22223333222222111 11111 110 0111122222221100000 00000100011112233333210 Q ss_pred CCceeE-EEeecCCcce----eEEEEEeecceecccccccCC--ceEEEEeccCCCC-CceEEEEEecCccEEEEecccc Q lcl|NC_012662. 226 TGTDLK-ITSTSGSPYI----GYSNQSQVNLETDLPARLHPS--ADGALCAVGQSER-ALVWYRYSSEKGVWLESGDYNS 297 (780) Q Consensus 226 s~~~~~-vt~~~g~~~~----~~~~~~~v~~~~~l~~~~~~~--~~~~v~~~~~~~~-~~~y~~~~~~~g~w~e~~~~~~ 297 (780) ++ -..+++.... .....+..-+.+.+|+-.-.+ ....|.++..+.. .+||+..+-..+. T Consensus 137 ----fVt~~GeES~PS~~S~~vtv~~g~tVtLs~~p~p~~~~~i~~~RIYRS~tG~~gtdy~lVAel~Ag~--------- 203 (580) T protein:vir:93 137 ----FVTGFGEESEPSAISNEVNWQAGQTVTLSGFQAAPAGRNITKQRIYRSQTSLSGTDLYFIAERDASA--------- 203 (580) T ss_pred ----EEcCCCCcCCCcccccceeeCCCCeEEEEecCCCCCCCccceEEEEEeccCCCceeEEEEeeeccce--------- Confidence 00 0001111110 000011111333343322111 1133444433322 3444443322110 Q ss_pred eeEEcccceeEEeeccccccccchhhcCCcccCCCcccccCCCceEEEEEcceEEEecCCeEEEEccCCccccccccccC Q lcl|NC_012662. 298 VTAISVDVPYKIVDDNVEQHIMEGRLAGDDLTNPAPTFLEERRITGIGTFQGRLVLLSGAYVCMSATGEPDRFFRSTVSS 377 (780) Q Consensus 298 ~~~~~~~~p~~l~~~~~~~~~w~~~~~gd~~t~~~psf~~~~~~~~v~~~q~RL~f~~~~~v~~S~~gd~~nF~~~t~~~ 377 (780) ..+....+-........+.+|. .|.+.- . .-+.|.++++.-..++.||+|..=-.+-| |..-.. T Consensus 204 -~sF~Dd~s~a~Lge~Lps~~~~-----------~PP~~m-~--gL~~m~nGi~agF~Gnev~fsEpy~P~AW-P~~yr~ 267 (580) T protein:vir:93 204 -ANFVDNVPLSDQNEPLPSLEWN-----------APPDDL-T--GLISLPNGMMAAFRGKELWLCEPWRPHAW-PQKYVL 267 (580) T ss_pred -eeeeecccccccccccchhhcc-----------CcCCCc-c--eEEeeccceEEEEeCCEEEEecCCCCccc-hhhcCC Confidence 1111111111111111122221 122111 1 12357777765556999999999555553 221111 Q ss_pred CCCCccEEEEEcCCcceeEEEEeecCCcEEEEecCcEEEEecCC-CcccccceEEEEEeeecCCCCCCcEEeCCeEEEEE Q lcl|NC_012662. 378 LDPTDRIDIASGSAQNSVFRQALQFNKDLILLGDSTQAVVPSLQ-QLLAPDNASVVLTSDLACNAFVAPVTTSQTLMYPA 456 (780) Q Consensus 378 ~~ddD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~i~~~~-~~ltP~~~~~~~~s~~~~~~~~~Pv~vg~~v~f~~ 456 (780) ..|. .|--+.+++..|+++|.+.-|+++|.+ +.|+.....+. -.|-+.=..|.+|..++|++ T Consensus 268 t~~~-------------~Ivaia~~g~~LvV~T~g~pyl~~G~~P~~ms~~kL~~~----q~CvS~rsiV~~~~~v~Yas 330 (580) T protein:vir:93 268 TMDY-------------NIVALGAYGTTIVVATDGQPYIVSGASPDAMSQEKLELN----LPCINARGLVDLGYAIAYPS 330 (580) T ss_pred CCCC-------------CceeEeeeCceEEEEEcCceEEEEccChhhccccccccc----cccccccceeecCceEEeec Confidence 1122 234456677899999999999999842 23444333222 24666667899999999998 Q ss_pred ecCCceEEEEEeeeeccccCceehhhHHHHHHHhcCC------CcEEEEEeCCCCeEEEEEEc-CC-----CEEEEEEEe Q lcl|NC_012662. 457 PRSEAFSAVLELVPSQYTSSQYVSQDVTTHIPRYIEG------EARFMQSASAANIVLMATTG-DN-----RQVIAHEYH 524 (780) Q Consensus 457 ~~g~~~~~v~e~~~~~~~~~~~~~~dls~~~~h~~~g------~~~~~~~~~~~~~~~~~~~~-~~-----g~l~~~ty~ 524 (780) ..| +-- .+. ++ +.=+| ..||.- .+..+.+.+-+..-+..-.. ++ ....++.. T Consensus 331 ~dG-----Lv~--i~~---~g--a~vvT---~~l~t~~qW~~~~P~ti~a~~~eG~Y~a~Y~~~~~~~~~~~g~fi~d~- 394 (580) T protein:vir:93 331 HDG-----LVV--ASS---SG--ARVVT---DQLMTRNDWLKTAPGRFVSGQFFGRYLASYEYIDPAGTARRGSFIIDL- 394 (580) T ss_pred CCc-----EEE--EeC---Ch--HHHHH---hhccChhHHHhcCCceEEEEeecCeEEEEEcccccccccccceEEEec- Confidence 876 221 111 11 21112 222211 11122233333222211111 10 11122211 Q ss_pred ecCCceeeeeeEeeccCCcEEEEEEECCcEEEEEEEcCC----Ce----EEEEEEeeeccCCccccccccee-eeeccc- Q lcl|NC_012662. 525 FTSQGKVHQAWHKWVFPYRVASLHFARDRVVLFAADDAG----ST----DKITISTIDPKQGGVTFDVDRLP-HLDSMS- 594 (780) Q Consensus 525 ~~~~e~~v~aW~~w~~~G~v~~~~~~~d~l~~vv~R~~~----g~----~~~~~e~~~~~~~~~~~~~~~~~-~lD~~~- 594 (780) .++ ...|.+-++.......-...|.||++-.+.+- +. .-++.-+... ......+.. .+++.. T Consensus 395 -~~~---~~~~~~~~~~~d~~~~d~~~d~Ly~~~~~~i~~~~~~~~~~~~~~WrSK~f~----~~~~~sf~~~rV~s~~~ 466 (580) T protein:vir:93 395 -TGQ---EAFLHRTNYKADATFYDITEGKLYLCIGQDIYEWDALDSENEILVWRSKQYV----VQKPTNFGVILIEGSVL 466 (580) T ss_pred -CCC---cceeEEeccccceeeeeccCCeEEEEeCCEEEEEcCCCCCcceEEEecceEE----ecCCcCceEEEEeeccc Confidence 111 11355555543333333346788887644321 10 0111111100 000000100 011000 Q ss_pred -----------------------------eeeecCcceeEEeecccc-----CCCCeEEEEEecCccccceecccccccc Q lcl|NC_012662. 595 -----------------------------IVPVNDGKGIVPIYMRPW-----VSEGKLTGSVATGALASEEVAIDVDEVS 640 (780) Q Consensus 595 -----------------------------~~~~~~~~~~~~~~~~~~-----~l~g~~v~~~adG~~~~~~~~~~~~~~~ 640 (780) .--.+....+...+.|++ ...-.++.+++||.+..... .++ T Consensus 467 ~~~~~~~a~~~~~~~~~a~n~~~~~~~~~~~~~~~~~v~~~~i~gd~~~~~~~~~~~~~~~~adG~~~~t~~-----~~~ 541 (580) T protein:vir:93 467 MTPEEEAAEQAAIDAAKAHNDSIFGDASIGGELNGAALNVYPIDGDALVRIESSRFVAATVYADGKAVATVS-----KLN 541 (580) T ss_pred cchhhhhhhhhhhhhhhhhhhhcccccccccccccccceeeeeccccccccccccceEEEEeeCCeEEEEEe-----cCC Confidence 000011111111122222 12245667778887654332 122 Q ss_pred ceEEEcCCCCCCEEEEeEeeeEEEEcCceEEecCCCceeeecceEE Q lcl|NC_012662. 641 WEFTVEPGFKDSQIYLGFRYESLFAPTPPMLKDQNDTLISTAPVRL 686 (780) Q Consensus 641 ~~~~i~~~~~~~~v~vGl~y~~~v~~~~~~i~~~~g~~~~~~r~~v 686 (780) .-++++.+..+.+..|=+.=..+|+ .+.+-.. +.. -++| T Consensus 542 ~~~RLPag~~a~~Wev~vsg~~~V~--~v~la~s----~~E-L~~~ 580 (580) T protein:vir:93 542 RMCRLPSGFLAQTWEVEVSANADIA--QVTLAGT----GAE-LAGV 580 (580) T ss_pred ceEEccCCccccEEEEEEEecccee--EEEEecC----hHH-HhcC Confidence 3345554444444433222222222 1111110 000 0112 Done!