Query lcl|NC_020838.1_cdsid_YP_007673160.1 [gene=SWQG_00014] [protein=hypothetical protein] [protein_id=YP_007673160.1] [location=complement(16018..18963)] Match_columns 981 No_of_seqs 192 out of 224 Neff 7.1 Searched_HMMs 1612 Date Thu Nov 7 16:37:02 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_14 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_14_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:100022 Length: 976 100.0 2E-249 1E-252 1383.8 93.3 928 1-981 1-976 (976) 2 protein:vir:78703 Length: 905 100.0 6E-239 4E-242 1326.6 87.0 861 1-981 1-905 (905) 3 protein:vir:10452 Length: 794 100.0 9E-220 6E-223 1221.3 85.1 766 1-980 1-794 (794) 4 protein:vir:3366 Length: 801 # 100.0 1E-218 7E-222 1215.5 83.7 764 1-980 1-801 (801) 5 protein:vir:94713 Length: 785 100.0 2E-217 1E-220 1208.5 85.5 760 1-980 1-785 (785) 6 protein:vir:1543 Length: 801 # 100.0 1E-217 8E-221 1209.7 82.7 764 1-980 1-801 (801) 7 protein:vir:99677 Length: 794 100.0 3E-217 2E-220 1207.9 83.7 766 1-980 1-794 (794) 8 protein:vir:94583 Length: 792 100.0 2E-216 1E-219 1203.5 85.0 766 1-980 1-792 (792) 9 protein:vir:8887 Length: 808 # 100.0 4E-214 2E-217 1190.5 81.2 762 1-980 1-808 (808) 10 protein:vir:2203 Length: 794 # 100.0 5E-213 3E-216 1184.7 86.1 766 1-980 1-794 (794) 11 protein:vir:97014 Length: 800 100.0 4E-213 2E-216 1185.1 85.2 758 2-980 1-800 (800) 12 protein:vir:105647 Length: 800 100.0 1E-212 7E-216 1182.7 84.2 767 2-980 1-800 (800) 13 protein:vir:103341 Length: 806 100.0 8E-211 5E-214 1172.5 84.4 768 2-980 1-806 (806) 14 protein:vir:7021 Length: 803 # 100.0 4E-210 3E-213 1168.4 84.7 763 2-980 1-803 (803) 15 protein:vir:6326 Length: 826 # 100.0 3E-210 2E-213 1169.5 82.0 759 1-980 1-826 (826) 16 protein:vir:80253 Length: 777 100.0 2E-206 1E-209 1148.6 84.4 755 1-980 1-777 (777) 17 protein:vir:78957 Length: 826 100.0 1E-203 8E-207 1133.0 83.1 760 1-980 1-826 (826) 18 protein:vir:1778 Length: 680 # 100.0 2E-186 1E-189 1038.5 61.9 637 1-728 1-680 (680) 19 protein:vir:103790 Length: 768 100.0 2E-154 1E-157 863.2 71.6 723 1-977 1-768 (768) 20 protein:vir:95324 Length: 823 100.0 7E-144 4E-147 805.4 68.8 699 1-976 1-823 (823) 21 protein:vir:7329 Length: 825 # 100.0 9E-131 6E-134 733.4 66.4 697 1-976 1-825 (825) 22 protein:vir:107423 Length: 681 100.0 2E-127 1E-130 715.6 66.5 664 184-975 1-681 (681) 23 protein:vir:107802 Length: 681 100.0 2E-127 1E-130 715.6 66.5 664 184-975 1-681 (681) 24 protein:vir:98487 Length: 681 100.0 2E-127 1E-130 715.6 66.5 664 184-975 1-681 (681) 25 protein:vir:102644 Length: 594 100.0 2E-105 1E-108 595.0 55.9 560 333-976 1-594 (594) 26 protein:vir:100022 Length: 976 99.9 3.1E-22 1.9E-25 138.5 46.9 704 189-981 1-919 (976) 27 protein:vir:78703 Length: 905 99.9 6.9E-20 4.3E-23 125.6 42.1 703 189-981 1-901 (905) 28 protein:vir:1778 Length: 680 # 99.8 1.7E-19 1.1E-22 123.5 32.5 545 189-801 1-680 (680) 29 protein:vir:94602 Length: 1012 99.6 1.8E-13 1.1E-16 90.5 39.0 832 1-980 1-1012(1012) 30 protein:vir:80177 Length: 1027 98.8 2.8E-08 1.8E-11 62.0 42.2 806 7-981 1-982 (1027) 31 protein:vir:2625 Length: 715 # 98.3 1.6E-06 9.6E-10 52.4 43.0 639 232-977 1-715 (715) 32 protein:vir:95324 Length: 823 97.6 3.5E-05 2.2E-08 45.0 41.6 643 180-981 1-742 (823) 33 protein:vir:95475 Length: 771 97.4 7.4E-05 4.6E-08 43.2 45.3 671 180-977 1-771 (771) 34 protein:vir:8837 Length: 513 # 97.0 0.00023 1.4E-07 40.6 39.8 473 340-979 1-513 (513) 35 protein:vir:107423 Length: 681 96.3 0.00074 4.6E-07 37.8 45.1 576 1-662 1-681 (681) 36 protein:vir:98487 Length: 681 96.3 0.00074 4.6E-07 37.8 45.1 576 1-662 1-681 (681) 37 protein:vir:107802 Length: 681 96.3 0.00074 4.6E-07 37.8 45.1 576 1-662 1-681 (681) 38 protein:vir:7329 Length: 825 # 92.4 0.012 7.2E-06 31.2 41.1 649 180-981 1-744 (825) 39 protein:vir:3529 Length: 477 # 87.5 0.038 2.4E-05 28.3 31.3 435 374-923 1-477 (477) 40 protein:vir:105563 Length: 396 82.4 0.077 4.8E-05 26.7 20.3 336 304-706 1-396 (396) 41 protein:vir:97014 Length: 800 79.9 0.1 6.2E-05 26.1 41.1 644 1-747 11-800 (800) 42 protein:vir:3133 Length: 911 # 68.9 0.23 0.00014 24.1 35.0 666 204-981 1-827 (911) 43 protein:vir:108312 Length: 458 59.7 0.39 0.00024 22.8 35.3 430 355-972 1-458 (458) 44 protein:vir:105647 Length: 800 50.7 0.6 0.00037 21.8 42.7 657 1-747 27-800 (800) 45 protein:vir:2109 Length: 472 # 44.7 0.8 0.00049 21.1 31.4 422 374-897 1-472 (472) 46 protein:vir:9268 Length: 472 # 36.3 1.2 0.00073 20.2 30.7 424 374-897 1-472 (472) 47 protein:vir:105428 Length: 472 28.7 1.7 0.0011 19.3 30.1 429 377-918 1-472 (472) 48 protein:vir:107669 Length: 123 21.3 1.4 0.00087 19.8 3.4 119 232-387 1-123 (123) No 1 >protein:vir:100022 Length: 976 # NCBI annotation: T7-like tail tubular protein B # Family: family:all:825 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214208;genbank:gi:61806431;genbank:GeneID:3294702 Probab=100.00 E-value=2.2e-249 Score=1383.80 Aligned_cols=928 Identities=29% Similarity=0.479 Sum_probs=746.5 Q ss_pred CCceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCC-------CCceEEEEEeCCCce Q lcl|NC_020838. 1 MSTISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAE-------ARGRWFPILRDEEEK 73 (981) Q Consensus 1 m~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~-------~~~~~~~~~rd~~e~ 73 (981) ||+|+||||||++|||||||++|||||+++|+||+||||+||+||||++||++|++.+ .+++||+|+||+.|+ T Consensus 1 M~~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRp~~~~v~~l~~~~~~~~~~~~~~~~~~~~r~~~e~ 80 (976) T protein:vir:10 1 MASVTQTIPTLTGGLSQQPDELKIPGQVSVANNVIPDVTHGLLKRPGGKLVASISDNGTAALNSQTNGKWFSYYRDETES 80 (976) T ss_pred CcceeecchhhhCcceecchhhcCCchhhhhhccccccccccccCCcceeeeeecCCCcccccccccceEEEEEcCCCcE Confidence 9999999999999999999999999999999999999999999999999999998654 267899999999999 Q ss_pred EEEEEEcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCccc---------ccccce Q lcl|NC_020838. 74 YVCQYDTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDL---------NSKQAT 144 (981) Q Consensus 74 y~~~~~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~---------~~~~~~ 144 (981) |++++.+ +|.|+||||.||++++|++++++++++..||.+ +++++||++||+|||||+|++++ .++..| T Consensus 81 y~~~~~~-~g~~~v~~~~~G~~~~v~~~~~~~~~~~~yl~~-~~~~~~~~~tv~d~tfi~N~~~~~~~~~~~~~~~~~~~ 158 (976) T protein:vir:10 81 YIGQVSR-SGDINMWRCSDGQAMTVNYDSGTATALTTYLTH-TNDEDIQTLTLNDYTFLTNRTKTVAMSSTVEPVRPPEV 158 (976) T ss_pred EEEEEec-CCceEEEEccCCeEEEEEcCCCcccccchhhcc-CCcceeEEEEEccEEEEecCceEEeecccccCCCCceE Confidence 9999976 678999999999999999999999999999987 68999999999999999999654 334458 Q ss_pred EEEEcCCcccccceeeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceec---ccccccc-ccccCCccccc Q lcl|NC_020838. 145 YTKTNDGQTATKVNLFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAA---GSAMPSG-YSLGNERTDDY 220 (981) Q Consensus 145 ~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~-~~~~~~~~~~~ 220 (981) +++||+|||+++|++ .|.....+.+..+...+ .++...+..++. ++..... .....-++.+ T Consensus 159 ~~~v~~~~y~~~y~~---~i~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~- 223 (976) T protein:vir:10 159 FIDLKATAYARQYAV---NLFDNTTTTAVSTVTRI-----------DVELIKSSNNYCDSNGAMVARTSRPSNSTRCDD- 223 (976) T ss_pred EEEeeeeccceEEEE---EEcCCcccceeeeeeee-----------eeccccCCcccccccccchhhHhHhhhhhcccc- Confidence 888888888876543 22211111111111111 111122222211 1000000 0000001111 Q ss_pred ceEeecceeEeeeeeeeEEeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEE-Eeeeec-----ccceeec Q lcl|NC_020838. 221 PWFKRDGYRVYEVEKEVAAAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTAC-NIGSSN-----IPASAYL 294 (981) Q Consensus 221 ~~~~~~g~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~-~~~~~~-----~~~~~y~ 294 (981) -+.+|.. +....+++|++.- ++-.++++.+++..+++.-+.+|... .++... .+...+- T Consensus 224 --s~~~G~~---------~~~~~v~~~~f~~----~~G~~~~i~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~ 288 (976) T protein:vir:10 224 --SAGDGRD---------AYAPNVGTKVFNV----TDGASLTDEANSGSYTYTIDVKDSSNNSVNRGVNLYFRIRTVGQS 288 (976) T ss_pred --ccccccc---------ccCceeeeeEEEe----ccCccceEcCCcceEEEEeeccccceEEeecCCceEEEEccccce Confidence 1123332 3344666666532 33445555566666665544444211 111111 0011111 Q ss_pred ccCCcc-eeeEEEEcCEEEEeCCceeEeccCCCCCCCCceEEEEEeeeccceEEEEeeCceEEE-EE-----eccCCCcc Q lcl|NC_020838. 295 KDAAPE-DIEILTINDYTFVLNKNKTTAMKTTTSAAVPNVAFVVIRIVAYNSDYSVTLNGTTVT-HS-----TPDTVAGA 367 (981) Q Consensus 295 ~~~~~~-dl~~~t~ad~tfi~n~~~~~~~~~~~~~~~~~~~~v~v~~g~y~~~~~v~~ng~~~~-~~-----t~~~~a~~ 367 (981) ...++. +++ +++||+|..++.+.+...+. +......++|++++++|+.+++..+++...+ .. ++.....+ T Consensus 289 ~~~~~gt~~~--~~~~Y~~~y~~~~~v~~~~~-g~~~~~~~~V~v~g~~Y~it~~~~~~~~~~a~~~~~~~~~t~~d~~~ 365 (976) T protein:vir:10 289 VPFTTGSGSS--ATTTYQARYTTTFDLLYGGT-GWQEGDYFYVWMKDGYYKITVEAISTANVQANLGLIRPNPTPFDTET 365 (976) T ss_pred eecccccccc--eeeeeeEEEEeEEEEecCCC-CcccCceEEEEccccceeeEEEEeeceeEEeccccccCcCCcCcccc Confidence 111222 223 67899999999999987554 4456678899999999999998777664322 11 12223344 Q ss_pred eecHHHHHHHHHhhhhc---cCceEEEEcCCEEEEEeCCC-ceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEE Q lcl|NC_020838. 368 TTDSGSIAAALTSSINA---LTGFSATQVGPGIYIEGTSA-FSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRV 443 (981) Q Consensus 368 ~~~~~~ia~~l~~~i~~---~~~~~~~~vg~~i~i~~~~~-~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v 443 (981) ..++++|+.+|...+.+ +.++++.+.|++|+|.+++. +.++.++ +..++.++++|+++++||..||+|++|+| T Consensus 366 ~~~~~~ia~~L~~~l~a~~~~~g~tv~~~g~~~~i~~~~~~~~~s~~~---~~~~~~~~~~V~~~~~LP~~~~~g~~v~V 442 (976) T protein:vir:10 366 AVTAESIIGDIRTAIIATGNFTSANVQQIGTGLYVTRPSGTFNVTAPS---SDLLRVMSGEVANVDDLPSQCKHGYVVKV 442 (976) T ss_pred cccHHHHHHHHHHhhcccccccceEEEEcCcEEEEEecCcceEecCCC---ceeEEEEEeeecchhhhhhhccCCcEEEE Confidence 57899999999998865 46788999999999998876 3444333 35789999999999999999999999999 Q ss_pred EccCCCcccceEEEEEcccCCcccceEEEEeeccceeEEEccccceEEEEeccCCceeeecccCCcccCCCcccCcCccc Q lcl|NC_020838. 444 TNSGDVTADDIYVEFQTTNSAARGPGVWEETIGPSLEFEIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSF 523 (981) Q Consensus 444 ~~~g~~~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF 523 (981) ..+++ +.|+||++|+... ++.++++|+||++|+..++++.+||||.|+++++|+|++++++|++|.|||++|||+|+| T Consensus 443 ~~~~~-~~d~yyv~~~~~~-~~~~~~~w~E~~~~g~~~g~~~~tmP~~l~~~~~g~f~~~~~~w~~r~vGd~~tnp~psf 520 (976) T protein:vir:10 443 ANSEA-DADDYYVKFFGHN-NRDGDGVWEECAKPSRNIEFDKGTMPIQLVRQANGTFTVSQATWQNAEVGDELTNPNPSF 520 (976) T ss_pred ecCCC-CceeEEEEeeccc-cccccceEEEeeccccccccccccccEEEEecccCeEEeeeccccccccCCcccCcCcee Confidence 88765 6699999997764 468899999999999999999999999999999999999999999999999999999999 Q ss_pred cCCceeEEEEEcceEEEecCCeEEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEE Q lcl|NC_020838. 524 IGKKINNMFFYRNRLGLLSNEAVIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFV 603 (981) Q Consensus 524 ~g~~ps~v~ffq~RL~f~s~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~ 603 (981) +|++|++|+||||||||+++++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+ T Consensus 521 ~g~~is~v~f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~g~e~~ 600 (976) T protein:vir:10 521 VGKTINQLVFFRNRLVFLSDENVIMSRPGEFFNFWSKTATTFTPQDVIDLSCSSTYPAIVYDGIQVNAGLLLFTKNQQFM 600 (976) T ss_pred cccccceEEEEcceEEEecCCeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeEEEEecCCcEEEEecCceEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEcCCccccccceEEEEEeeeccccCCCcEEeCCeEEEEecCCCeeEEEEEeecccccceehhhHHHHHHHhcCCCeEEE Q lcl|NC_020838. 604 LSTDADILSPTTTKINTISTFECDAEIDAVAVGTTQAFISKSNLYSKLFLMLNVQKEAAATIDEATTNVPEYVPSDIDSM 683 (981) Q Consensus 604 l~g~~~~LTP~~~~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS~~~~h~~~~~i~~~ 683 (981) |+|++++|||+|++++++|+|+|++.++|+.+|++++|++++|++++++||.|++++++|.+.|||+|++|||++.+..+ T Consensus 601 lsg~~~~lTP~t~~i~~~s~~~~~~~v~Pv~vG~~v~Fv~~~g~~~r~~~~~~~~~~~~~~~~dlt~~~~~l~~g~~~~~ 680 (976) T protein:vir:10 601 LTTDSDILSPETAKINAVSSYNFNEKTHPVSLGTTVAFIDNANQFTRFFEMSNVVRQGEPDVVDQSKVISRLLDKNISLV 680 (976) T ss_pred EecCCceecceeEEEEEEEeeeccCCCccEEeCCeEEEEecCCCeEEEEEEeecccccccchhHHHHHhhhhcCCceEEE Confidence 99987799999999999999999999999999999999999999999999999999999999999999999999999889 Q ss_pred EEcCCCcEEEEEecCCcEEEEEEeecCc-chheeeeEeeccCCceEEEEEeCCeEEEEEEcCCcEEEEEEEeecCCceeE Q lcl|NC_020838. 684 TASPAMSIVSLGKSGSNTVYQHRFFMQG-ENRVQTWYKWQLTGDLRLQFFDKTTFYAVTSSGSNVYLTSYDLTQASESGY 762 (981) Q Consensus 684 ~~s~~~~~~~~~~~g~~~l~~y~y~~~~-eq~V~aWsrw~~~G~v~sv~~~~d~ly~vv~r~~~~~l~~~~~~~~~~~~~ 762 (981) +++.+|..++|++.+++.|++|+|+|.. ||+|+|||||+|+|+|+++|+++|+||++++|+++.+++|+.+........ T Consensus 681 a~~~~~~~vv~~~~~~g~l~~~ty~~~~~eq~v~aWsr~~~~G~v~sv~~i~D~ly~vV~r~~~g~~~r~~e~~~~~~~~ 760 (976) T protein:vir:10 681 SVSRENSVVFFSQKDTDKIYCFRYFTSGEKRLLQAWTTWTITGNIQYHCMLDDALYVVTRNNNKDQIVKYSLKLDDAGHF 760 (976) T ss_pred EEcCCCcEEEEEEcCCCEEEEEEEeecCCceeEEeeEEEecCCcEEEEEEeCCeEEEEEEecCCeEEEEEEEEECCccce Confidence 9999999999999999999999999864 556789999999999999999999999999999999999987765443222 Q ss_pred E-------EecCCCccccccccee--eeeeeeeccCCceEEEcccCCC-cCCceEEEEEcCcccCceeEEEEecCceEEE Q lcl|NC_020838. 763 L-------TLPTGEKTDVCLDMFN--VNPYRTYSTSTKKTTVNLPFDH-ITGKKLAVVAIGTYIGDTISATSESEGSVFY 832 (981) Q Consensus 763 ~-------~~~~~~~~~~~lD~~~--vd~~~ty~~~~~~tt~~l~~~~-l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~ 832 (981) . .....+.++++||++. +..+++|++.+..+++.+|+.. +.++-+.+.+++.. .+.. T Consensus 761 ~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~t~~~~t~~t~~~~~~~~~~~~~~~~~~~d~~~-------------~~~~ 827 (976) T protein:vir:10 761 VTDTQGTTSTDDDSIYRVHLDHSSSVTAASNTYNTTTIKTTIPKPNGYESTKQLVAYDTDAGN-------------DLGR 827 (976) T ss_pred eeeccCccccccCCcceeeeccceEEEeccccccCCceeEEeecCccccCceeEEEEecccCc-------------cccc Confidence 1 1233566778899874 4567899999998888887432 22333333333321 1112 Q ss_pred ccccccCCCEEEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeEEEEEEEEEeecccceEEEEccCCCC Q lcl|NC_020838. 833 FEDSDISSNQVLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDLILHRLKVSTGLSGPITYKVDITGKD 912 (981) Q Consensus 833 ~~~~tv~gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~ 912 (981) +...++.+++|+|++++++++|||||+|+++++|+||+++.++|.+......+||+|||++|++.+||.|.+.+++.+++ T Consensus 828 ~~~~~v~g~~i~l~g~~~~~~v~VGl~Y~s~~~~~~~~i~~~~g~~~~~~~~gRl~i~r~~~~~~~tg~~~v~v~~~~~~ 907 (976) T protein:vir:10 828 YALVTVSGSNLEIPGNWSNNSFIIGYLYEMDVQLPTLYVTQQVGDKYRSDAKSSLIVHRIKFSFGPLGVYSTTIQRDGKP 907 (976) T ss_pred ceeeeecCCeeEecCCCCCCeEEEeeeeEEEEeecceeEEeCCCCcccccceeeEEEEEEEEEeecccceEEEEcCCCCc Confidence 33456889999999999999999999999999999999999888877777889999999999999999999999998888 Q ss_pred ceeeEecccccCccccCccccccCCeEEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEeeccccccC Q lcl|NC_020838. 913 EWTNIINVTLPNTYVLNNVNLSASALHDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYNRRFYRRS 981 (981) Q Consensus 913 ~~~~~~~~~~~~~~~~~~~p~~~~~~~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r~rRr~ 981 (981) .+.........+....+.+|+.++..+++|+.+|+++.+|+|+|++|+||+||+|+|||+||+|+|||- T Consensus 908 ~~~~~~~~~~~~~~~~~~~pl~~~~~~~vP~~~~~~~~~v~i~~d~PlP~tilsi~~eg~yn~r~~r~~ 976 (976) T protein:vir:10 908 DFTETKELGLAGVVGASRLPIVPEVIETVPCYERNTNLKVNVKSEHPAPATLYSLAWEGDFTNRFYKRV 976 (976) T ss_pred cccccccccccCcccccccceecCcEEEEEeccCCceeEEEEEECCCCceEEEEEEEEEEeccceeecC Confidence 777766666667777788999999999999999999999999999999999999999999999999999 No 2 >protein:vir:78703 Length: 905 # NCBI annotation: tail tube B # Family: family:all:825 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285450;genbank:gi:148724484;genbank:GeneID:5220174 Probab=100.00 E-value=6e-239 Score=1326.60 Aligned_cols=861 Identities=30% Similarity=0.483 Sum_probs=688.0 Q ss_pred CCceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCC-CCCceEEEEEeCCCceEEEEEE Q lcl|NC_020838. 1 MSTISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNA-EARGRWFPILRDEEEKYVCQYD 79 (981) Q Consensus 1 m~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~-~~~~~~~~~~rd~~e~y~~~~~ 79 (981) ||+|+||||||++|||||||++|||||++||+||+||||+||+||||++||++|++. .++.+||+|+||+.|+|++++. T Consensus 1 M~~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRp~~~~i~~l~~~~~~~~~~~~~~r~~~e~y~~~~~ 80 (905) T protein:vir:78 1 MGAVLQKIPNLLGGVSQQPDPVKLPGQVREAENVYLDPTFGCRKRPATKFVGELATNLPSDTRWFPIFRDAGERYAVALY 80 (905) T ss_pred CccceecchhhhCceeecchhhcCCcchhhhhccccccccccccCchhhhhhhhcCCCCCCceEEEEEeCCCceEEEEEe Confidence 999999999999999999999999999999999999999999999999999999886 4788999999999999999997 Q ss_pred cCC---CcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCccc---------ccccceEEE Q lcl|NC_020838. 80 TTD---GQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDL---------NSKQATYTK 147 (981) Q Consensus 80 ~~~---g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~---------~~~~~~~~~ 147 (981) +.+ ..|+||||.||++++|++++.. ..||+++ ++++||++||+|||||+|++++ .++..|+++ T Consensus 81 ~~g~~~~~i~v~d~~~G~~~~V~~~~~~----~~yl~~~-~~~~l~~~tv~d~tfi~N~~~~~~~~~~~~~~~~~~~~~~ 155 (905) T protein:vir:78 81 KDGSGNTQVRVWDMQTGAERTVTPDATA----TAYLATT-NLNNLNWLTVADYTLLSNKERIVTMSGASEVDSNQRALVE 155 (905) T ss_pred eCCCCCcceEEEEccCCcEEEEecCCCc----cceeecC-CCcceEEEEEcCEEEEEcCceeeeecCCCCcCCCCeEEEE Confidence 643 2599999999999999986654 3567665 4899999999999999998644 445568888 Q ss_pred EcCCcccccceeeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecc Q lcl|NC_020838. 148 TNDGQTATKVNLFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDG 227 (981) Q Consensus 148 ~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 227 (981) +|+|||+++|+ +.|.....+... .+..+..+++.. ++. ++. ..| T Consensus 156 v~~g~y~r~y~---v~I~~~~~~~~~-------------------t~~~~~a~~~s~--------~s~---~~~---~~g 199 (905) T protein:vir:78 156 INAISYNTTYS---IDLDRDGASQQV-------------------KVYRAKALEISP--------GSF---EVE---DGG 199 (905) T ss_pred EEeeccceeEE---EEEeCCCCceee-------------------eeeccccceecc--------ccc---ccc---ccc Confidence 88888887644 233222111111 111222121110 000 000 001 Q ss_pred eeEeeeeeeeEEeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeE-EE Q lcl|NC_020838. 228 YRVYEVEKEVAAAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEI-LT 306 (981) Q Consensus 228 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~-~t 306 (981) -..... ++... +.++.+.. +|+| .+ T Consensus 200 ~~~~~~---~~~~~------------------------------------------------~~t~~~~~---~l~f~~~ 225 (905) T protein:vir:78 200 VCTEHD---VQNYT------------------------------------------------NQTIGSST---GLAFQVR 225 (905) T ss_pred cccccc---eeeee------------------------------------------------cceeeccC---CceeEEe Confidence 000000 00000 00111000 1111 12 Q ss_pred EcCEEEEeCCceeEeccCCC-----CCCCCceEEEEEeeeccceEEEEeeCceEEEE----------EeccCCCcceecH Q lcl|NC_020838. 307 INDYTFVLNKNKTTAMKTTT-----SAAVPNVAFVVIRIVAYNSDYSVTLNGTTVTH----------STPDTVAGATTDS 371 (981) Q Consensus 307 ~ad~tfi~n~~~~~~~~~~~-----~~~~~~~~~v~v~~g~y~~~~~v~~ng~~~~~----------~t~~~~a~~~~~~ 371 (981) +++++|+.|.++.....+.. .....+..++. .++|+++|+|.|++..... +++....+..... T Consensus 226 ~~~~~~~~~~~~~~~~~~~~~l~~g~~~~~~~~~~~--v~~~g~~y~i~i~~~~~~~~~~~~~~~~~~t~~d~~a~~~~~ 303 (905) T protein:vir:78 226 VQCAAYLENNEYRSRYNVSVVLQNGGTGFRKGDMIT--VNLNGRDYNIRVTQEEFVYTYASDGTAAHTTPQDSTAGTLDI 303 (905) T ss_pred eccccccCCCcccccccceeeeeccccccccCccEE--EeeccceEEEEEecceeEEEecCCCcccccCccCccCccccH Confidence 24444544444332211110 11111112333 4789999999998865333 2222222334455 Q ss_pred HHHHHHHHhhhhccCceEEEEcCCEEEEEeCCC--ceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCC Q lcl|NC_020838. 372 GSIAAALTSSINALTGFSATQVGPGIYIEGTSA--FSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDV 449 (981) Q Consensus 372 ~~ia~~l~~~i~~~~~~~~~~vg~~i~i~~~~~--~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~ 449 (981) +.|+.+|.+++++.++|++..+|+.|+|.++++ +.+++++|+.++++.+++++|+++++||++||+||+|+|++++++ T Consensus 304 ~~i~~~l~~~~~~~~~~~~~~~g~~i~v~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~Lp~~~~~g~~v~v~~~~~~ 383 (905) T protein:vir:78 304 GQITAGLVNSVNLISNYSAQAVGNVIEIERTDGRDFNLGVRGGATNRAMTAIKGTANSIVDLPGQCFDGFELKVINTENA 383 (905) T ss_pred HHHHHHHHHhhcccccEEEEecCcEEEEEecCCCccEEEEeccCCcceEEEEeccccccccCccccCCCcEEEEEeCCCC Confidence 799999999999999999999999999988765 678999999999999999999999999999999999999999999 Q ss_pred cccceEEEEEcccCCcccceEEEEeeccceeEEEccccceEEEEeccCCceeeeccc-------CCcccCCCcccCcCcc Q lcl|NC_020838. 450 TADDIYVEFQTTNSAARGPGVWEETIGPSLEFEIDETTMPHQLIRQANGVFKYEPVT-------WDDRLVGDNTTNPIPS 522 (981) Q Consensus 450 ~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~~~~-------w~~r~~GDd~tnp~ps 522 (981) +.|+|||+|+....+..++++|+||++|++.++++.+||||.++|+++|+|+|+.++ |++|.+||++|||+|+ T Consensus 384 ~~d~yyv~~~~~~~~~~~~~~W~E~~~~~~~~~~~~~tmp~~l~r~~~g~f~~~~~~~~~~~~~~~~r~~Gd~~Tnp~ps 463 (905) T protein:vir:78 384 ESDDYYVVFRSAAEGIPGSGSWEETVAPGIERGFNTSTMPHALIRQADGNFTLEALNDEGTITGWAQREVGDDDTNPKPS 463 (905) T ss_pred CcceEEEEEEecccCCcCceeEEEecccccccccccccccEEEEEecCceEEEEEeccccccccccccccCCcccCCCCc Confidence 999999999998888888999999999999999999999999999999999999987 9999999999999999 Q ss_pred ccCCceeEEEEEcceEEEecCCeEEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEE Q lcl|NC_020838. 523 FIGKKINNMFFYRNRLGLLSNEAVIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQF 602 (981) Q Consensus 523 F~g~~ps~v~ffq~RL~f~s~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~ 602 (981) |+|++|++|+||||||||++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++|| T Consensus 464 f~g~~is~v~f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdDpI~~~~ss~~~~~i~~~v~~~~~L~ifT~g~ef 543 (905) T protein:vir:78 464 FVGRGISDMFFYNNRLGFLSEDAVIMSQPGDYFNFFVTSAITISDSDPIDVTASSTKPAILRAAIGAPKGLILFAENSQF 543 (905) T ss_pred ccCCCcceEEEEcceEEEecCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeEEEeecCCcEEEEecCceE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEcCCccccccceEEEEEeeeccccCCCcEEeCCeEEEEecCCCeeEEEEEeecccccceehhhHHHHHHHhcCCCeEE Q lcl|NC_020838. 603 VLSTDADILSPTTTKINTISTFECDAEIDAVAVGTTQAFISKSNLYSKLFLMLNVQKEAAATIDEATTNVPEYVPSDIDS 682 (981) Q Consensus 603 ~l~g~~~~LTP~~~~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS~~~~h~~~~~i~~ 682 (981) +|+|++++|||+|++++++|+|+|+++|+|+.+|+++||++++|+|++||||+|++++|+|+++|||+|++|||++++.. T Consensus 544 ~lsg~~~~lTP~s~~i~~~S~~~~~~~v~Pv~vG~~vlFv~~~g~~s~vre~~y~~~~d~y~a~DlT~~a~hl~~g~v~~ 623 (905) T protein:vir:78 544 LLASQEVVFSTATIKLTEISDYFYRSLAKPVSTGVSIAFVSEADTYSKIFEMSIDSVDNRPQVADITRIVPEYVPTGLTW 623 (905) T ss_pred EEecCCccccceeEEEEeEEeecccCCCCcEEeCCeEEEeecCCCeeEEEEEEeeecccceehhHHHHHHHHhcCCceEE Confidence 99997768999999999999999999999999999999999999999999999999999999999999999999999876 Q ss_pred EEEcCCCcEEEEEecCCcEEEEEEeecC-cchheeeeEeeccCCceEEEEEeCCeEEEEEEcC--CcEEEEEEEeecCCc Q lcl|NC_020838. 683 MTASPAMSIVSLGKSGSNTVYQHRFFMQ-GENRVQTWYKWQLTGDLRLQFFDKTTFYAVTSSG--SNVYLTSYDLTQASE 759 (981) Q Consensus 683 ~~~s~~~~~~~~~~~g~~~l~~y~y~~~-~eq~V~aWsrw~~~G~v~sv~~~~d~ly~vv~r~--~~~~l~~~~~~~~~~ 759 (981) ++++ .|..|+|+.++++.||+|+|+|. +||+|+|||||+|+|.++++|++.|.+|++++|. +...++++++....+ T Consensus 624 ~~~s-~~~~~v~~~~~~~~l~~ytyl~~~~eq~v~AWsrw~~~G~~~~~a~i~d~~~~vV~r~~~G~~~~~~~~l~~~~~ 702 (905) T protein:vir:78 624 SVST-PNNSMMLFGDNSNTAYIFKFFNQGNERQVAGWSKWILPGEQRMCGFFADTGYFVLYDSTTGSYVLSAMELLDDPD 702 (905) T ss_pred EEec-CCCcEEEEEcCCCeEEEEEeecCCCceeEEeEEEEecCCCeEEEEEEcCCEEEEEEEccCCeEEEEEEeeccccC Confidence 6554 56667888889999999999987 5566889999999999999999999999999885 445566666655444 Q ss_pred eeEEEecCCCcccccccceeeeeeeeeccCCceE---EEcccCCCcCCceEEEEEcCcccCceeEEEEecCceEEEcccc Q lcl|NC_020838. 760 SGYLTLPTGEKTDVCLDMFNVNPYRTYSTSTKKT---TVNLPFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDS 836 (981) Q Consensus 760 ~~~~~~~~~~~~~~~lD~~~vd~~~ty~~~~~~t---t~~l~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~ 836 (981) ...++. ....+..++|.+..++.++|...+..+ ....++.|+.++.+.+.+||...+ .+.++ T Consensus 703 ~~~~d~-~~~~~~~~~d~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dG~~~~--------------~~~~~ 767 (905) T protein:vir:78 703 SASIDT-AFSSFLPRLDNYVVKSDLTVVDNGDGTLTVDLEAGQAMTGATPVIMFTDGPSEF--------------AFSQP 767 (905) T ss_pred cccccc-ceeeeeeccceeeecccceecccCcceEeeeccCccccccceeEEEeeCCceee--------------eEEEE Confidence 433321 112244567888888888875443321 122344566666655566665322 22333 Q ss_pred ccCCCEEEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeEEEEEEEEEeecccceEEEEccCCCCceee Q lcl|NC_020838. 837 DISSNQVLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDLILHRLKVSTGLSGPITYKVDITGKDEWTN 916 (981) Q Consensus 837 tv~gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~ 916 (981) ++.++.++++ ++++|+|||+|+++++|+||+++.++++. ..+|++|+|++|+|++||+|.++|++.+++.+.. T Consensus 768 ~~~~~~~t~~---~a~~v~VGl~Y~s~v~~~p~~~~~~~~s~----~~~~~rI~rv~lr~~~Sg~~~v~v~~~~~~~~~~ 840 (905) T protein:vir:78 768 TITAGQFTVD---TTDDFVVGFKYETKITLPGFFTSEENKAD----RVYAPIVEFLYLDLYYSGRYQIEVDRIGYDTINI 840 (905) T ss_pred Eeeceeeccc---cCCeEEEeeeeeEEEeecceEeccCCCcc----cccceEEEEEEEEeecceeEEEEEcCCCcceecc Confidence 4556666653 57889999999999999999998766543 3578999999999999999999999999998888 Q ss_pred EecccccCccccCccccccCCeEEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEeeccccccC Q lcl|NC_020838. 917 IINVTLPNTYVLNNVNLSASALHDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYNRRFYRRS 981 (981) Q Consensus 917 ~~~~~~~~~~~~~~~p~~~~~~~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r~rRr~ 981 (981) ....+..+.++.+.+|+..++.+++|+.+|+++.+|+|+|++|+||+||+|+|||+||+|+|||- T Consensus 841 ~~~~~~~~~~~~~~~p~~~tg~~~vP~~g~~~~~~v~I~sd~PlP~tvlsi~weg~Yn~r~~~~~ 905 (905) T protein:vir:78 841 DAGSIDANIYLADGAPLKEIATENVPLFTPGDQVTVTIKAPDPFPSAITGYSWQGHYNRRGIAFI 905 (905) T ss_pred cccceecCcccCcccccccccEEEEEeeccCceeEEEEEECCCCcEEEEEEEEEEEeccceeecC Confidence 87778888888888889899999999999999999999999999999999999999999999999 No 3 >protein:vir:10452 Length: 794 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848299;genbank:gi:30387490;genbank:GeneID:1733952 Probab=100.00 E-value=9.5e-220 Score=1221.35 Aligned_cols=766 Identities=27% Similarity=0.424 Sum_probs=657.6 Q ss_pred CCceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCCC---CceEEEEEeCCCceEEEE Q lcl|NC_020838. 1 MSTISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAEA---RGRWFPILRDEEEKYVCQ 77 (981) Q Consensus 1 m~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~~---~~~~~~~~rd~~e~y~~~ 77 (981) ||+|+|+||||++|||||||++|+|||+++|+||+|||+.||+||||++||++|..... ...+|+|+||+.|+|+++ T Consensus 1 M~~i~~s~~n~~~GvSqq~D~~Ry~~q~~~~~N~~~~~~gG~~rRpGt~fv~~l~~~~~~~~~~~~~~~~rd~~e~~~v~ 80 (794) T protein:vir:10 1 MALISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLKTLGDNGALGQAPYIHLINRDENEQYYAV 80 (794) T ss_pred CcceeeecchhhcccccCCchHHhhhhHhhhhcceeeeccCcccCcchhhheeccCCCccccceeeeEEecCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999976543 446899999999999999 Q ss_pred EEcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCcccccc Q lcl|NC_020838. 78 YDTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTATKV 157 (981) Q Consensus 78 ~~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~~~ 157 (981) +.+. .|+|||+ +|++++|+++++ .+|+.+++++ T Consensus 81 ~~~~--~irv~~~-~G~~~~v~~~~~-----~~Y~~aa~~~--------------------------------------- 113 (794) T protein:vir:10 81 FTGT--GIRVFDL-AGNEKQVRYPNG-----SNYIKTANPR--------------------------------------- 113 (794) T ss_pred EeCC--eEEEEEc-CCcEEEEEcCCC-----CcceecCCCc--------------------------------------- Confidence 8653 4999999 599988753322 1122222222 Q ss_pred eeeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeeeeee Q lcl|NC_020838. 158 NLFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVEKEV 237 (981) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 237 (981) T Consensus 114 -------------------------------------------------------------------------------- 113 (794) T protein:vir:10 114 -------------------------------------------------------------------------------- 113 (794) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EEeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCc Q lcl|NC_020838. 238 AAAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKN 317 (981) Q Consensus 238 ~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~ 317 (981) ++||++|+||+|||+|++ T Consensus 114 --------------------------------------------------------------~~l~~~q~aD~~fivn~~ 131 (794) T protein:vir:10 114 --------------------------------------------------------------SDLRMVTVADYTFIVNRN 131 (794) T ss_pred --------------------------------------------------------------ceEEEEEEcCEEEEEcCC Confidence 246667777777777777 Q ss_pred eeEeccCCCC----CCCCceEEEEEeeeccceEEEEeeCceE-EEEEeccCCCc---ceecHHHHHHHHHhhhh-ccCce Q lcl|NC_020838. 318 KTTAMKTTTS----AAVPNVAFVVIRIVAYNSDYSVTLNGTT-VTHSTPDTVAG---ATTDSGSIAAALTSSIN-ALTGF 388 (981) Q Consensus 318 ~~~~~~~~~~----~~~~~~~~v~v~~g~y~~~~~v~~ng~~-~~~~t~~~~a~---~~~~~~~ia~~l~~~i~-~~~~~ 388 (981) ++|++..... ..+...+++++++|+|+++|++.+++.. +.+++|++.+. ..+++++|+.+|..++. +.++| T Consensus 132 ~~~~~~~~~~~~~~~~~~~~~~~~v~~g~y~r~y~i~i~~~~~at~~tpdgt~~~~~~~~s~~~ia~~L~~~l~a~~~g~ 211 (794) T protein:vir:10 132 VVVQKDPNSVNLANYNPKQDGLINIRGGQYGRELIVHINGKDVATYKIPDGSKPEHVNNTDAQWLAERLAKQMRINLSGW 211 (794) T ss_pred eeeeeeccccccCCCCCCccEEEEecccccceEEEeccCCcceeEEEecCCCCcccceecchhhhhhhhhhhhhcccCCc Confidence 7777544432 2345678999999999999999999864 55788887753 45778999999998874 56899 Q ss_pred EEEEcCCEEEEEeCCCc---eEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCc Q lcl|NC_020838. 389 SATQVGPGIYIEGTSAF---SISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAA 465 (981) Q Consensus 389 ~~~~vg~~i~i~~~~~~---~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~ 465 (981) ++.+.|++++|.++++. ++++.++..++++.++.++++.+++||+.|++|++|+|+++++++.+.|||+|+... T Consensus 212 t~~~~g~~i~i~a~s~~~~~t~s~~~~~~~~~~~~v~~~~~~~~~lp~~~~~G~~v~i~~~~~~~~~~yyv~~~~~~--- 288 (794) T protein:vir:10 212 TVNVGQGFIHVTAPSGQQIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAER--- 288 (794) T ss_pred eEEeCCeEEEEEeccCceeccccccCCcCcceeEEEEeccCcceecccCCCCCcEEEEEeCCCCCcceeEEEEEcCC--- Confidence 99999999999987754 456778888999999999999999999999999999999999999999999998754 Q ss_pred ccceEEEEeeccceeEEEccccceEEEEeccCCceeeecccCCcccCCCcccCcCccccCCceeEEEEEcceEEEecCCe Q lcl|NC_020838. 466 RGPGVWEETIGPSLEFEIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLLSNEA 545 (981) Q Consensus 466 ~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~s~~~ 545 (981) ++|+||++|+...+++.+||||.++|+++++|+++.++|++|.+||+++||+|+|+|++|++|+||||||+|+++++ T Consensus 289 ---~~w~E~~~~g~~~~~~~~tmP~~l~r~~~~t~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~ 365 (794) T protein:vir:10 289 ---KVWTETLGWNTENQVLLETMPHALVRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVFFFRNRLGFLSGEN 365 (794) T ss_pred ---cEEEEecccceeEEEecccceeEEEEeccceEEeeecccccccccccccCccCcccCCCccEEEEEcceEEEeeCCe Confidence 37999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEEEeeec Q lcl|NC_020838. 546 VIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINTISTFE 625 (981) Q Consensus 546 V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~~S~~~ 625 (981) |||||+||||||+++|++++.|||||+++++++++++|+|+++++++|+|||+++||+|+|+ ++|||+|++++++|+|+ T Consensus 366 v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~i~T~~~q~~l~~~-~~lTP~~~~~~~~s~~~ 444 (794) T protein:vir:10 366 IILSRTAKYFNFYPASIANLSNDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTAS-GTLTSRSVELNLTTQFD 444 (794) T ss_pred EEEEecCCcccccccccccCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCC-CcccceeEEEEEEEeec Confidence 99999999999999999999999999999999999999999999999999999999999986 49999999999999999 Q ss_pred cccCCCcEEeCCeEEEEecCCCeeEEEE-EeecccccceehhhHHHHHHHhcCCCeEEEEEcCCC-cEEEEEecCCcEEE Q lcl|NC_020838. 626 CDAEIDAVAVGTTQAFISKSNLYSKLFL-MLNVQKEAAATIDEATTNVPEYVPSDIDSMTASPAM-SIVSLGKSGSNTVY 703 (981) Q Consensus 626 ~s~~v~Pv~vG~~v~Fv~~~g~~s~vre-~~y~~~~d~~~a~DlS~~~~h~~~~~i~~~~~s~~~-~~~~~~~~g~~~l~ 703 (981) |++.++|+.+|++++|++++|++++++| |.|++++|+|+++|||+|++|||++++..+++++.+ ..++|+..+++.|+ T Consensus 445 ~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~y~a~Dlt~~~~hl~~~~v~~~~~~~~~~~~~~~~~~~~~~l~ 524 (794) T protein:vir:10 445 VQDRARPYGIGRNVYFASPRSSYTSIHRYYAVQDVSSVKNSEDITSHVPNYIPNGVFSICGSGTENFCSVLSHGDPSKIF 524 (794) T ss_pred ccCCCCceEeCCeEEEEecCCCeeEEEEEeeeccccCceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCEEE Confidence 9999999999999999999999988765 568899999999999999999999999988776655 46667777789999 Q ss_pred EEEeecCcchh-eeeeEeeccCCceEEEEE--eCCeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCccccccccee- Q lcl|NC_020838. 704 QHRFFMQGENR-VQTWYKWQLTGDLRLQFF--DKTTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFN- 779 (981) Q Consensus 704 ~y~y~~~~eq~-V~aWsrw~~~G~v~sv~~--~~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~- 779 (981) +|+|+|..+|| |+|||||+|+|.|+++|+ .+|+||++|+|+++.+++|+.+.....+. ..+.++.+||+.. T Consensus 525 ~~~y~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~d~l~~iv~r~~~~~~~r~~~~~~~~~~-----~~~~~~~~lD~~~~ 599 (794) T protein:vir:10 525 MYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSDMYVILRNEFNTFLARISFTKNAIDL-----QGEPYRAFMDMKIR 599 (794) T ss_pred EEEEeecCCceEEEeEEEEEcCCcEEEEEEEecCCeEEEEEEeCCCEEEEEEEEeecCCCC-----CCccceeeeecceE Confidence 99999876554 679999999999999986 48999999999999999999876554322 2345566777653 Q ss_pred -eeeeeeeccCCceEEEcc----cCCCcCCceEEEEEcCcccCceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEE Q lcl|NC_020838. 780 -VNPYRTYSTSTKKTTVNL----PFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDL 854 (981) Q Consensus 780 -vd~~~ty~~~~~~tt~~l----~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v 854 (981) ..++++|++++..|++.+ +++|++|+++.+++||......... . ......+++|++++++++| T Consensus 600 ~~~~~~~~~~~~~~t~~~~~~~~g~~~~eg~~v~~~adg~~~~~~~~~-~-----------~~~g~~~l~i~~~~~a~~v 667 (794) T protein:vir:10 600 YTIPSGTYNDDTFTTSIHIPTIYGANFGRGKITVLEPDGKITVFEQPT-S-----------GWQSDPWLRLSGNLEGREV 667 (794) T ss_pred EEecCcccccccccceEEcccccCcccccccEEEEecCCceeeeeeee-e-----------eeecceEEEecCCCCCceE Confidence 235678888888777644 5789999999999998644322211 0 0122367899999999999 Q ss_pred EEEEeeeEEEEeCCceeeccCCCcccc-eEeeeEEEEEEEEEeecccceEEEEccCCCCceeeEecccccCccc-cCccc Q lcl|NC_020838. 855 IIGYVYDMELELPTLYPTQVEGRSSVS-DVTSDLILHRLKVSTGLSGPITYKVDITGKDEWTNIINVTLPNTYV-LNNVN 932 (981) Q Consensus 855 ~VGl~y~s~v~~~~~~i~~~~g~~~~~-~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~~~~-~~~~p 932 (981) +|||+|+++++|+||++++++|+.... +..+||||+|+++++.+||+|.+.|++.+++....+.+.++..... .+.+| T Consensus 668 ~vGl~y~s~~~~~~~~i~~~~~~~~~~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~ 747 (794) T protein:vir:10 668 FIGFNINFVYEFSKFLIKQTTDDGSTSTEDIGRLQLRRAWVNYEDSGTFDIYVENQSSNWKYTMAGARLGSNTLRAGRLN 747 (794) T ss_pred EEeeeeeEEEEecceEEEccCCCcceeeeccccEEEEEEEEEeeccccEEEEEcCCccccceeeccceeccccccccccc Confidence 999999999999999999999876654 5679999999999999999999999998887666666666665544 45555 Q ss_pred cccCCeEEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEeecccccc Q lcl|NC_020838. 933 LSASALHDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYNRRFYRR 980 (981) Q Consensus 933 ~~~~~~~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r~rRr 980 (981) + .++.+++|+.+|+++.+|+|+|++|+||+|++|+|||+||+|.||= T Consensus 748 ~-~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlsi~~eg~y~~r~~~v 794 (794) T protein:vir:10 748 L-GTGQYRFPVVGNAKFNTVYILSDETTPLNIIGCGWEGNYLRRSSGI 794 (794) T ss_pred c-ccceEEEEecccCceEEEEEEECCCCceEEEEEEEEEEEeccccCC Confidence 5 4567899999999999999999999999999999999999999988 No 4 >protein:vir:3366 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523337;swissprot:trembl:q8w5u3;genbank:gi:17570828;goa:Q8W5U3;uniprot:Q8W5U3;genbank:GeneID:927453 Probab=100.00 E-value=1.1e-218 Score=1215.49 Aligned_cols=764 Identities=26% Similarity=0.397 Sum_probs=657.9 Q ss_pred CCceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCC---CCCceEEEEEeCCCceEEEE Q lcl|NC_020838. 1 MSTISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNA---EARGRWFPILRDEEEKYVCQ 77 (981) Q Consensus 1 m~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~---~~~~~~~~~~rd~~e~y~~~ 77 (981) ||+|+|+||||++|||||||++|+|||+++|+||+|||+.||+||||++||++|... .....+|+|+|++.|+|++. T Consensus 1 M~~i~~~~~nl~~GvSqq~d~~r~~~q~~~~~N~~~~~~gG~~rRpGt~~va~~~~~~~~~~~~~~~~~~r~~~~~y~l~ 80 (801) T protein:vir:33 1 MALISQSIKNLKGGISQQPDILRFTEQGSVQINGWSSESEGIQKRPPMIHLKTLGTAGYVGAQPYVHLINRDEFEQYFVV 80 (801) T ss_pred CceeEeeccceecceeccchhHhhhhhHhhhhcceeecccCcccCchhHhhhhhcCCCccccceEEEEEEeCCceEEEEE Confidence 999999999999999999999999999999999999999999999999999998765 34678999999999999998 Q ss_pred EEcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCcccccc Q lcl|NC_020838. 78 YDTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTATKV 157 (981) Q Consensus 78 ~~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~~~ 157 (981) +. ++.|+|||+ +|++++|+... .|+.+++++++|++++++|||||+|++.. T Consensus 81 ~~--~~~irv~~~-~G~~~~v~~~~-------~y~~~~~~~~~l~~~t~aD~~fi~nr~~~------------------- 131 (801) T protein:vir:33 81 FT--GEDIKVFDL-DGKEYQVRGDR-------SYVRTANPREDLRMVTVADYTFVTNRKVV------------------- 131 (801) T ss_pred Ec--CCeEEEEcc-CCcEEEEecCC-------cceeecCcchheEEEEEcCEEEEeeCCee------------------- Confidence 85 577999998 69999987433 47788888889999999999999996210 Q ss_pred eeeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeeeeee Q lcl|NC_020838. 158 NLFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVEKEV 237 (981) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 237 (981) T Consensus 132 -------------------------------------------------------------------------------- 131 (801) T protein:vir:33 132 -------------------------------------------------------------------------------- 131 (801) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EEeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCc Q lcl|NC_020838. 238 AAAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKN 317 (981) Q Consensus 238 ~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~ 317 (981) T Consensus 132 -------------------------------------------------------------------------------- 131 (801) T protein:vir:33 132 -------------------------------------------------------------------------------- 131 (801) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eeEeccC----CCCCCCCceEEEEEeeeccceEEEEeeCceE-EEEEeccCCCcc---eecHHHHHHHHHhhhhc----- Q lcl|NC_020838. 318 KTTAMKT----TTSAAVPNVAFVVIRIVAYNSDYSVTLNGTT-VTHSTPDTVAGA---TTDSGSIAAALTSSINA----- 384 (981) Q Consensus 318 ~~~~~~~----~~~~~~~~~~~v~v~~g~y~~~~~v~~ng~~-~~~~t~~~~a~~---~~~~~~ia~~l~~~i~~----- 384 (981) |++.+ ...+.....+++++++++|+++|+|.++|.. +.++++++..+. .++.+.+|..|..++.+ T Consensus 132 --p~~~~~~~~~~~~~~~~~~li~v~~~~yg~t~~I~i~gs~~~~~~~~~gs~~~~v~~~s~~~~A~~l~~~~~~~~~~~ 209 (801) T protein:vir:33 132 --VQSNDQSVNLPGFKDQGDALINVRGGQYGRRLSIEFNGAERAAVQLPDGSQPAHVNEVDGQAIAEKLAAQLRNNLGNP 209 (801) T ss_pred --ecccCCcccccccCCCcceEEEEeecccceEEEEEECCcceEEEEeeccccccccccccchhhhhhhhhhhhccCccc Confidence 00000 0001122356899999999999999999964 557888766543 45667778777766533 Q ss_pred -----cCceEEEEcCCEEEEEeCCC---ceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEE Q lcl|NC_020838. 385 -----LTGFSATQVGPGIYIEGTSA---FSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYV 456 (981) Q Consensus 385 -----~~~~~~~~vg~~i~i~~~~~---~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv 456 (981) ..+|.....+..+++..++. +.+++++|.+++.+.++.++|+.+++||.++++|++|+|+++++++.++||| T Consensus 210 ~~~~~~~~w~~~~~~g~~~i~~p~~~~~~~itt~~g~~~~~~~~~~~~v~~~~~lp~~~~~g~~v~v~~~~~~~~~~y~v 289 (801) T protein:vir:33 210 NNDQDPNKWRFNVGPGFIHILAPNNDNVWGLQTKDGYADQLINPVTHYTQSFQKLPINAPDGYIVKIVGDTSKTADQYYV 289 (801) T ss_pred eeeecCceEEEEecCeEEEEecCCCcccccccccCCccceeEEEEeecccceeeeeeecCCCcEEEEEecCCCcccceEE Confidence 23566666677778876665 4578899999999999999999999999999999999999999999999999 Q ss_pred EEEcccCCcccceEEEEeeccceeEEEccccceEEEEeccCCceeeecccCCcccCCCcccCcCccccCCceeEEEEEcc Q lcl|NC_020838. 457 EFQTTNSAARGPGVWEETIGPSLEFEIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRN 536 (981) Q Consensus 457 ~~~~~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~ 536 (981) +|+... ++|+||++|+...+++.+||||.+++.++++|+++.++|++|.+||+++||+|+|+|++|++|+|||| T Consensus 290 ~~~~~~------~~w~e~~~~g~~~~~~~~tmp~~l~~~~~~tf~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~f~q~ 363 (801) T protein:vir:33 290 RFDLNR------KVWVETIGWNTRTHLHYHTMPWALVRASDGNFDFKYLEWGARTVGDDTTNPYPSFTGQTINDIFFFRN 363 (801) T ss_pred EEEcCC------cEEEEeeccccceeeeecccceEEEEccCceEEecccCccccccCCccccCcccccCCCceEEEEEcc Confidence 998754 47999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEecCCeEEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccce Q lcl|NC_020838. 537 RLGLLSNEAVIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTT 616 (981) Q Consensus 537 RL~f~s~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~ 616 (981) ||||+++++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|+ ++|||+|+ T Consensus 364 RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~l~~~-~~lTP~~~ 442 (801) T protein:vir:33 364 RLGFLSGENIILSRTSKYFNFFPASVSNYSDDDPIDVAVSHDRVSTLKYAVPFSEELLLWSDQAQFVLTAS-DILSSRSV 442 (801) T ss_pred eEEEeeCCeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeCC-CcccceeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999986 59999999 Q ss_pred EEEEEeeeccccCCCcEEeCCeEEEEecCCCeeEEEE-EeecccccceehhhHHHHHHHhcCCCeEEEEEcCCCcEEEEE Q lcl|NC_020838. 617 KINTISTFECDAEIDAVAVGTTQAFISKSNLYSKLFL-MLNVQKEAAATIDEATTNVPEYVPSDIDSMTASPAMSIVSLG 695 (981) Q Consensus 617 ~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre-~~y~~~~d~~~a~DlS~~~~h~~~~~i~~~~~s~~~~~~~~~ 695 (981) +++++|+|+|++.++|+.+|+++||++++|+|++++| |.|++++|+|+++|||+|++|||++++..|++++++++++|. T Consensus 443 ~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~v~r~~~~~~~~d~y~~~Dlt~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (801) T protein:vir:33 443 GLNLTTQFDVQDRARPHGVGRNVYFSSPRASFTSINRYYAVQDVSSVKNAEDMTAHVPNYIPNGVFSISGTTAENFVAIL 522 (801) T ss_pred EEEEEEeecccCCCCceEecCeEEEEecCCCeeEEEEEEeecccccceehhhHHHHHHHhcCCceEEEEEcCCCCeEEEE Confidence 9999999999999999999999999999999987755 677888999999999999999999999999999999988876 Q ss_pred ecC-CcEEEEEEeecCcchh-eeeeEeeccCCceEEEEE--eCCeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCcc Q lcl|NC_020838. 696 KSG-SNTVYQHRFFMQGENR-VQTWYKWQLTGDLRLQFF--DKTTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKT 771 (981) Q Consensus 696 ~~g-~~~l~~y~y~~~~eq~-V~aWsrw~~~G~v~sv~~--~~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 771 (981) .++ ++.|++|+|+|+.+|| |+|||||+|+|.++++|+ .+|+||++|+|++++++|++++.....+. ..+.+ T Consensus 523 ~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~d~l~~vv~r~~~~~le~~~~~~~~~d~-----~~~~~ 597 (801) T protein:vir:33 523 TSGAPNRVYIYKFLYIDEEIRQQSWSHWDFGDNVTVFAAQVINSTMTVLMSNEHAVWMGRLHFTKDSIDL-----PGEPY 597 (801) T ss_pred EecCCCEEEEEEEecCCCceEEEeeEEEEcCCCEEEEEEecCCCEEEEEEEcCCcEEEEEEEEeeccccC-----CCccc Confidence 664 5899999999976555 679999999999988875 79999999999999999999876543221 23456 Q ss_pred cccccce--eeeeeeeeccCCceEEEcc----cCCCcCCceEEEEEcCcccCceeEEEEecCceEEEccccccCCCEEEe Q lcl|NC_020838. 772 DVCLDMF--NVNPYRTYSTSTKKTTVNL----PFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLL 845 (981) Q Consensus 772 ~~~lD~~--~vd~~~ty~~~~~~tt~~l----~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL 845 (981) +.+||+. ++.++++|+.++..+++.+ +++|++|+++.+++||........ . .....+.++++ T Consensus 598 ~~~lD~~~~~~~~~~~~~~~~~~t~~~~~~~~gl~~~eg~~v~~~~dG~v~~~~~~------~------~~~~~~~~l~i 665 (801) T protein:vir:33 598 RLYIDAKRKYTIPAGTYNDDTYQTSISLSTIYGMNFTKGRVSVVFPDGKIVEIDQP------I------NGWSSDPMLRL 665 (801) T ss_pred eEEeecceEEEecccceecCccccccccccccCCccccceEEEEEeCCceEeeeec------c------ccccCceeEEe Confidence 6778864 5678999999888877654 689999999999999864322110 0 01124577899 Q ss_pred CCCCCCCEEEEEEeeeEEEEeCCceeeccCCC-cccceEeeeEEEEEEEEEeecccceEEEEccCCCCceeeEecccccC Q lcl|NC_020838. 846 NGDYRGRDLIIGYVYDMELELPTLYPTQVEGR-SSVSDVTSDLILHRLKVSTGLSGPITYKVDITGKDEWTNIINVTLPN 924 (981) Q Consensus 846 ~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~-~~~~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~ 924 (981) ++++++++|+|||+|+++++|+||+++.++|+ +.+++..+||||||++|++++||+|.+.|++.+++......+.++.. T Consensus 666 ~~~~~~~~v~vGl~y~s~~~~~~~~~~~~~~~~~~~~~~~~r~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~ 745 (801) T protein:vir:33 666 DGNQEGQVVYIGFNIPFTYTFSKFLIKKTAEDGSTATEDIGRLQLRRAWVNYEDSGAFIIRVNNLSREFIYTMAGARLGS 745 (801) T ss_pred cCCCCCCEEEEeeeeeEEEEeCceEEeccCCCCceeeeeeccEEEEEEEEEeecCcceEEEECCcccceeeeeccccccc Confidence 99999999999999999999999999988865 44557789999999999999999999999998887665555666555 Q ss_pred cc-ccCccccccCCeEEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEeecccccc Q lcl|NC_020838. 925 TY-VLNNVNLSASALHDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYNRRFYRR 980 (981) Q Consensus 925 ~~-~~~~~p~~~~~~~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r~rRr 980 (981) .+ ..+.+|+. ++.+++|+.+|+++.+|+|+|++||||+||+|+|||+||+|.||= T Consensus 746 ~~~~~~~~~~~-tg~~~vp~~g~~~~~~v~i~~d~P~P~tvl~i~~eg~y~~r~~~~ 801 (801) T protein:vir:33 746 DNLRVGGSNIG-TGQYRFPVVGNAQTNTVTIESDASTPLNIIGCGWEGNYLRRSSGI 801 (801) T ss_pred ccccccccccc-cceEEEEeeccCceEEEEEEeCCCCCEEEEEEEEEEEEeccccCC Confidence 44 45556654 557899999999999999999999999999999999999999988 No 5 >protein:vir:94713 Length: 785 # NCBI annotation: tail tube # Family: family:all:825 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338122;genbank:gi:77118200;genbank:GeneID:3707736 Probab=100.00 E-value=2.1e-217 Score=1208.54 Aligned_cols=760 Identities=26% Similarity=0.419 Sum_probs=661.7 Q ss_pred CCceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCC-CCCceEEEEEeCCCceEEEEEE Q lcl|NC_020838. 1 MSTISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNA-EARGRWFPILRDEEEKYVCQYD 79 (981) Q Consensus 1 m~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~-~~~~~~~~~~rd~~e~y~~~~~ 79 (981) ||+|+|+||||++|||||||++|+|||+++|+||+|||+.||+||||++||++|... ..+.++|+++|++.|+|++.+. T Consensus 1 M~~~~~s~~n~~~GvSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~~v~~l~~~~~~~~~~~~f~~~~~~~y~l~~~ 80 (785) T protein:vir:94 1 MPLITQSIKNLKGGISQQPDILRFSDQGEAQVNCWSSESDGLQKRPPTVFKRRLNIDVGSNPKFHLINRDEQEQYYIVFN 80 (785) T ss_pred CcceeeecchhhcceecCCchHHhhhHHhhhhcceeeeccCcccCChhHhhhcccCCCCcCcEEEEEEeCCCceEEEEEc Confidence 999999999999999999999999999999999999999999999999999998654 4567899999999999999994 Q ss_pred cCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCccccccee Q lcl|NC_020838. 80 TTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTATKVNL 159 (981) Q Consensus 80 ~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~~~~~ 159 (981) +|.|+|||+ +|+++.|+. ...|+.+++++ T Consensus 81 --~~~irv~~~-~G~~~~v~~-------~~~y~~~~~~~----------------------------------------- 109 (785) T protein:vir:94 81 --GSNIQIVDL-SGNQYSVSG-------SVDYVKSSNPR----------------------------------------- 109 (785) T ss_pred --CCeEEEEec-CCcEEEEec-------CCCceeecCch----------------------------------------- Confidence 577999998 588876530 01111111111 Q ss_pred eEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeeeeeeEE Q lcl|NC_020838. 160 FDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVEKEVAA 239 (981) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 239 (981) T Consensus 110 -------------------------------------------------------------------------------- 109 (785) T protein:vir:94 110 -------------------------------------------------------------------------------- 109 (785) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCcee Q lcl|NC_020838. 240 AYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKNKT 319 (981) Q Consensus 240 a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~~~ 319 (981) .||+++|+||+|||+|++++ T Consensus 110 ------------------------------------------------------------~~l~~~q~aD~~fi~n~~~~ 129 (785) T protein:vir:94 110 ------------------------------------------------------------DDIRVVTVADYTFVVNRKVV 129 (785) T ss_pred ------------------------------------------------------------hheeeEeeCCEEEEEcCCcc Confidence 24778888888888888888 Q ss_pred EeccCC---CCCCCCceEEEEEeeeccceEEEEeeCceE-EEEEeccCCCc----ceecHHHHHHHHHhhhhc-cCceEE Q lcl|NC_020838. 320 TAMKTT---TSAAVPNVAFVVIRIVAYNSDYSVTLNGTT-VTHSTPDTVAG----ATTDSGSIAAALTSSINA-LTGFSA 390 (981) Q Consensus 320 ~~~~~~---~~~~~~~~~~v~v~~g~y~~~~~v~~ng~~-~~~~t~~~~a~----~~~~~~~ia~~l~~~i~~-~~~~~~ 390 (981) |++... ...+.++.+++++++++|+++|++.+||.. ++++++++.++ ..++.+.|+++|..++.. ..+|+. T Consensus 130 ~~~~~~~~~~~~~~~~~~~~~i~~g~y~~~y~i~i~g~~~at~~t~~~s~a~~s~~~~s~~~i~~~l~~~l~a~~t~~t~ 209 (785) T protein:vir:94 130 VKGGSEKSHSGYNRKARALINLRGGQYGRTLKVGINGGVKVSHKLPAGNDAENDPPKVDAQAIGAALRDLLVTAYPTFTF 209 (785) T ss_pred eeeeeccCCcCCCCCCceEEEecccccceeEEEeeCCcceeEEEEccCccccccccccchHHHHHHHHHHhhccccceeE Confidence 876544 345567789999999999999999999954 66888876653 457788999999888754 568999 Q ss_pred EEcCCEEEEEeCCC---ceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCccc Q lcl|NC_020838. 391 TQVGPGIYIEGTSA---FSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAARG 467 (981) Q Consensus 391 ~~vg~~i~i~~~~~---~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~~g 467 (981) .+.++++++.+++. +.+++++|.+++.+..++++|+++++||.+|++|++|+|++++++++++||++|+... T Consensus 210 ~~~g~~i~i~a~s~t~~~~~s~~~~~~~t~~~~~~~~~~~~~~Lp~~~~~G~~v~v~~~~~~~~~~y~v~~~~~~----- 284 (785) T protein:vir:94 210 DLGSGFLLITAPSGTDINSVETEDGYANQLISPVLDTVQTISKLPLAAPNGYIIKIQGETNSSADEYYVMYDSNT----- 284 (785) T ss_pred EecCcEEEEEecCCccccceeeecccCCeEEEEEEeeccceeccccccCCCCEEEEEccCCCCccceEEEEEcCC----- Confidence 99999999987654 5678999999999999999999999999999999999999999999999999999864 Q ss_pred ceEEEEeeccceeEEEccccceEEEEeccCCceeeecccCCcccCCCcccCcCccccCCceeEEEEEcceEEEecCCeEE Q lcl|NC_020838. 468 PGVWEETIGPSLEFEIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLLSNEAVI 547 (981) Q Consensus 468 ~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~s~~~V~ 547 (981) ++|+||++|++..+++.++|||.+++.++++|++++++|++|.+||+++||+|+|+|++|++|+||||||||+++++|| T Consensus 285 -g~w~e~~~~g~~~~~~~~tmp~~l~~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~ 363 (785) T protein:vir:94 285 -KTWKETVEPGVVTGFDNTTMPHALVRQSDGSFEFKALDWSKRGAGNDDTNPMPSFVDATINDVFFYRNRLGFLSGENVI 363 (785) T ss_pred -ceEEEecccceeeeeeccccceEEEeccCCceEEeccccccccCCCcccCCcceecccccceEEEEeceEEEecCCeEE Confidence 3799999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEEEeeeccc Q lcl|NC_020838. 548 MSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINTISTFECD 627 (981) Q Consensus 548 ~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~~S~~~~s 627 (981) |||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|+ ++|||+|++++++|+|+|+ T Consensus 364 ~Srtgd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~l~T~~~e~~l~~~-~~lTP~~~~~~~~s~~~~~ 442 (785) T protein:vir:94 364 MSRSASYFAFFPKSVATLSDDDPIDVAVSHPRISILKYAVPFSEQLLLWSDEVQFVMTSS-GVLTSKSIQLDVGSEFALG 442 (785) T ss_pred EEccCCcccCccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEcCC-CcccceeEEEEEEEeeecc Confidence 999999999999999999999999999999999999999999999999999999999986 4999999999999999999 Q ss_pred cCCCcEEeCCeEEEEecCCCeeEEEEE-eecccccceehhhHHHHHHHhcCCCeE-EEEEcCCCcEEEEEecCCcEEEEE Q lcl|NC_020838. 628 AEIDAVAVGTTQAFISKSNLYSKLFLM-LNVQKEAAATIDEATTNVPEYVPSDID-SMTASPAMSIVSLGKSGSNTVYQH 705 (981) Q Consensus 628 ~~v~Pv~vG~~v~Fv~~~g~~s~vre~-~y~~~~d~~~a~DlS~~~~h~~~~~i~-~~~~s~~~~~~~~~~~g~~~l~~y 705 (981) +.++|+.+|++++|++++|++++++|+ .|++++|+|+++|||+|++|||++++. .++++.+|..++|+..+++.|++| T Consensus 443 ~~~~Pv~vg~~v~f~~~~g~~~~v~r~~~~~~~~d~y~~~dlt~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~g~l~~~ 522 (785) T protein:vir:94 443 DNARPFAVGRSVFFSAPRGSFTSIKRYFAVADVSDVKDADDTTGHVLSYIPNGVFDIQGTGTENYICVNSTGAYNRIYIY 522 (785) T ss_pred CCCCceEeCCeEEEEecCCCeeEEEeeeeecccccceehhhHHHHHHHhcCCCcEEEEEecCCCcEEEEEEcCCCEEEEE Confidence 999999999999999999999887666 667779999999999999999999876 456777888999999999999999 Q ss_pred EeecCc-chheeeeEeeccCC--ceEEEEEeCCeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCcccccccceee-- Q lcl|NC_020838. 706 RFFMQG-ENRVQTWYKWQLTG--DLRLQFFDKTTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFNV-- 780 (981) Q Consensus 706 ~y~~~~-eq~V~aWsrw~~~G--~v~sv~~~~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~v-- 780 (981) +|+|+. ||+|+|||||+|+| .++++|+++|+||++++|.++.+++++++.....+ ...+.++.+||+... T Consensus 523 ~y~~~~~e~~v~aW~r~~~~~~~~~~~~~~~~d~~~~vv~r~~g~~~~~ie~~~~~~d-----~~~~~~~~~lD~~~~~~ 597 (785) T protein:vir:94 523 KFLFKDSVQLQASWSHWEFPKDDKILASASIGSTMFIVRQHQGGVDIEHLKFIKEATD-----FPSEPYRLHVDSKVSMV 597 (785) T ss_pred EEeecCCceEEEEEEEEEeCCCeEEEEEEEeCCEEEEEEEcCCCEEEEEEEeecccCC-----CCCcceeEEeeeeeEEE Confidence 999865 55678999999987 58888899999999999999999999986544322 133455667776532 Q ss_pred eeeeeeccCCceEEEc-----ccCCCcCCceEEEEEcCcccCceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEEE Q lcl|NC_020838. 781 NPYRTYSTSTKKTTVN-----LPFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLI 855 (981) Q Consensus 781 d~~~ty~~~~~~tt~~-----l~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~ 855 (981) ..+++|+.++..++.. .+++|++|+++.+++||..++...+ +...++++++||+++++|+ T Consensus 598 ~~~~~~~~~~~~t~~~~~~~~~g~~~leg~~v~v~adG~~~~~~~v---------------~~~~~tl~~~g~~~~~~v~ 662 (785) T protein:vir:94 598 IPIGSFNADTYKTTVDIGAAYGGNAPSPGRYYLIDSQGAYLDLGEL---------------TSISTVITLNGDWSGRTVF 662 (785) T ss_pred ecCcceeccccccccccccccccCCccCCeEEEEeeCCcCccCceE---------------cCCCcEEEecCCCCCceEE Confidence 3456777766655533 3678999999999999987665432 2345789999999999999 Q ss_pred EEEeeeEEEEeCCceeeccCCCcccceEeeeEEEEEEEEEeecccceEEEEccCCCCceeeEecccccCccccCcccccc Q lcl|NC_020838. 856 IGYVYDMELELPTLYPTQVEGRSSVSDVTSDLILHRLKVSTGLSGPITYKVDITGKDEWTNIINVTLPNTYVLNNVNLSA 935 (981) Q Consensus 856 VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 935 (981) |||+|+++++|+||+++.++|.....+..+|+||+|++|++.+||+|.++|++..++ +.+.+.++..+++.++.+|+.+ T Consensus 663 vGl~y~~~~~~~~~~~~~~~~~~~~~~~~gr~~l~r~~~~~~~sg~~~v~v~~~~~~-~~~~~~~~~~g~~~~~~~~~~t 741 (785) T protein:vir:94 663 IGRSYLMSYKFSRFLIKIEDDSGTQSEDTGRLQLRRAWVNYRDTGALRLIVRNGERE-FVNTFNGYTLGQQTIGTTNIGD 741 (785) T ss_pred EeeeeeEEEeecceeEEecCCCcccccccccEEEEEEEEEeecccceEEEecCCCcc-ceeeecCcccCccccccccccc Confidence 999999999999999999988777777789999999999999999999999877765 4555666666677788888876 Q ss_pred CCeEEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEeecccccc Q lcl|NC_020838. 936 SALHDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYNRRFYRR 980 (981) Q Consensus 936 ~~~~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r~rRr 980 (981) + .+++|+.+|+++.+|+|+|++|+||+|++|+|||+||+|+||= T Consensus 742 g-~~~vp~~g~~~~~~v~i~~~~P~P~tvlsi~~eg~y~~r~~~v 785 (785) T protein:vir:94 742 G-QYRFAMNGNALTTSLTLESDYPTPVSIVGCGWEASYAKKARSV 785 (785) T ss_pred c-eEEEEeecccceEEEEEEECCCCceEEEEEEEEEEEeccccCC Confidence 6 6799999999999999999999999999999999999996666 No 6 >protein:vir:1543 Length: 801 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052111;swissprot:trembl:q9t105;genbank:gi:9634037;uniprot:Q9T105;genbank:GeneID:1262408 Probab=100.00 E-value=1.3e-217 Score=1209.68 Aligned_cols=764 Identities=26% Similarity=0.401 Sum_probs=652.2 Q ss_pred CCceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCC---CCceEEEEEeCCCceEEEE Q lcl|NC_020838. 1 MSTISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAE---ARGRWFPILRDEEEKYVCQ 77 (981) Q Consensus 1 m~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~---~~~~~~~~~rd~~e~y~~~ 77 (981) ||+|+|+||||++|||||||.+|+|||+++|+||+|||+.||+||||++||++|+..+ ....+|+|+||+.|+|++. T Consensus 1 M~~i~~s~~n~~~GvSqq~d~~r~~~q~~~~~N~~~~~~gGl~rRpGt~~va~~~~~~~~~~~~~~~~~~~~~~e~y~l~ 80 (801) T protein:vir:15 1 MALISQSIKNLKGGISQQPDILRFAEQGSVQINGWSSESEGLQKRPPMIHLKTLGPAGYVGAQPYVHLINRDEFEQYFVV 80 (801) T ss_pred CceeeeecchhhcceecCcchHhhhhhHhhhhcceeccccCcccCCchheeeeecCCCCcccceeEEEEEeCCceEEEEE Confidence 9999999999999999999999999999999999999999999999999999997654 4568999999999999988 Q ss_pred EEcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCcccccc Q lcl|NC_020838. 78 YDTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTATKV 157 (981) Q Consensus 78 ~~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~~~ 157 (981) +. ++.|+|||+ +|++++|+..+ .|+..++++++|++++++|||||+| T Consensus 81 ~~--~~~irv~~~-~G~~~~v~~~~-------~y~~~~~~~~~l~~~~~aD~~fi~n----------------------- 127 (801) T protein:vir:15 81 FT--GEDIKVFDL-DGKEYQVRGDR-------SYVRTANPREDLRMITVADYTFVTN----------------------- 127 (801) T ss_pred Ec--CCeEEEEcc-CCcEEEEecCC-------ccccccCchhheeEEEEcCEEEEee----------------------- Confidence 84 677999998 69998876332 2444555555555555555555555 Q ss_pred eeeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeeeeee Q lcl|NC_020838. 158 NLFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVEKEV 237 (981) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 237 (981) T Consensus 128 -------------------------------------------------------------------------------- 127 (801) T protein:vir:15 128 -------------------------------------------------------------------------------- 127 (801) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EEeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCc Q lcl|NC_020838. 238 AAAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKN 317 (981) Q Consensus 238 ~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~ 317 (981) ++ T Consensus 128 ------------------------------------------------------------------------------r~ 129 (801) T protein:vir:15 128 ------------------------------------------------------------------------------RK 129 (801) T ss_pred ------------------------------------------------------------------------------CC Confidence 44 Q ss_pred eeEeccCCC----CCCCCceEEEEEeeeccceEEEEeeCce-EEEEEeccCCCcc---eecHHHHHHHHHhhhhc----- Q lcl|NC_020838. 318 KTTAMKTTT----SAAVPNVAFVVIRIVAYNSDYSVTLNGT-TVTHSTPDTVAGA---TTDSGSIAAALTSSINA----- 384 (981) Q Consensus 318 ~~~~~~~~~----~~~~~~~~~v~v~~g~y~~~~~v~~ng~-~~~~~t~~~~a~~---~~~~~~ia~~l~~~i~~----- 384 (981) ++|++.+.. .......+++++++++|+++|+|.+||. .+.++++++..++ .++.+.+|..|..++.. T Consensus 130 ~~~~~~~~~~~~~~~~~~~~alv~v~~~~yg~t~~I~i~gs~~~~~t~~~gs~~~~~~~~s~~~ia~~l~~~~~~~~p~~ 209 (801) T protein:vir:15 130 VVVQSNDQSVNLPGFKDQGDALINVRGGQYGRRLSIEFNGAERAAVQLPDGSQPAHVNEVDGQAIAEKLAAQLRNNLGNP 209 (801) T ss_pred eeeecccCccccCccCCCCceEEEeeeccCceeEEEEeCCcceEEEEeccCcccchhhhcceeechHHHhhhhhhccCcc Confidence 444333221 1122235789999999999999999995 4558888776543 35566777777655532 Q ss_pred -----cCceEEEEcCCEEEEEeCCC---ceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEE Q lcl|NC_020838. 385 -----LTGFSATQVGPGIYIEGTSA---FSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYV 456 (981) Q Consensus 385 -----~~~~~~~~vg~~i~i~~~~~---~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv 456 (981) ...|.+...++.+++..+.. ..+++++|.+++.+.++.+.|+++++||..+++|++|+|++++++++++||| T Consensus 210 ~~~~~~~~w~~~~~~g~~~i~a~~~~~~~~~~t~dg~~~~~~~~~~~~v~~~~~lp~~~~~G~~v~v~~~~~~~~~~y~v 289 (801) T protein:vir:15 210 NNDQDPNKWRFNVGPGFIHILAPNNDNVWGLQTKDGYADQLINPVTHYTQSFQKLPINAPDGYIVKIVGDTSKTADQYYV 289 (801) T ss_pred ceeccCccEEEEecCcEEEEeCCCCcccceeeeccccCceeeeEEeecccceeeeeeecCCCcEEEEEecCCCccceEEE Confidence 24678888888898887665 4578899999999999999999999999999999999999999999999999 Q ss_pred EEEcccCCcccceEEEEeeccceeEEEccccceEEEEeccCCceeeecccCCcccCCCcccCcCccccCCceeEEEEEcc Q lcl|NC_020838. 457 EFQTTNSAARGPGVWEETIGPSLEFEIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRN 536 (981) Q Consensus 457 ~~~~~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~ 536 (981) +|+... ++|+||++|+..++++.+||||.+++.++++|+++.++|++|.+||+++||+|+|+|++|++|+|||| T Consensus 290 ~~~~~~------~~w~E~a~~g~~~~~~~~tmp~~lv~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~g~~~~~v~f~q~ 363 (801) T protein:vir:15 290 RFDLNR------KVWVETIGWNTRTHLYYHTMPWALVRASDGNFDFKVLEWGARTVGDDTTNPYPSFTGQTINDIFFFRN 363 (801) T ss_pred EEEcCC------eeEEeecccccceeeeccccceEEEeeccceEEEeccccccccCCccccCCcccccCCCceEEEEEcc Confidence 998754 47999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEecCCeEEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccce Q lcl|NC_020838. 537 RLGLLSNEAVIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTT 616 (981) Q Consensus 537 RL~f~s~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~ 616 (981) ||||++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|+ ++|||+|+ T Consensus 364 RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~i~t~~~q~~ls~~-~~lTP~~~ 442 (801) T protein:vir:15 364 RLGFLSGENIILSRTSKYFNFFPASVSNYSDDDPIDVAVSHNRVSTLKYAVPFSEELLLWSDQAQFVLTAS-GILSSRSV 442 (801) T ss_pred eEEEeeCCeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEcCC-CcccceeE Confidence 99999999999999999999999999999999999999999999999999999999999999999999986 49999999 Q ss_pred EEEEEeeeccccCCCcEEeCCeEEEEecCCCeeEEEE-EeecccccceehhhHHHHHHHhcCCCeEEEEEcCCCcEEE-E Q lcl|NC_020838. 617 KINTISTFECDAEIDAVAVGTTQAFISKSNLYSKLFL-MLNVQKEAAATIDEATTNVPEYVPSDIDSMTASPAMSIVS-L 694 (981) Q Consensus 617 ~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre-~~y~~~~d~~~a~DlS~~~~h~~~~~i~~~~~s~~~~~~~-~ 694 (981) +++++|+|+|++.++|+.+|+++||++++|+|++++| |.|++++|+|+++|||+|++|||++++..+++++.++.++ | T Consensus 443 ~~~~~s~~~~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~y~a~Dlt~~~~hl~~~~v~~~~~~~~~~~~~~~ 522 (801) T protein:vir:15 443 ELNLTTQFDVQDRARPHGVGRNVYFASPRASFTSINRYYAVQDVSSVKNAEDMTAHVPNYIPNGVFSISGTTAENFAAIL 522 (801) T ss_pred EEEEEEeeeccCCCCceEeCCeEEEEecCCCeeEEEEEEeecccccceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEE Confidence 9999999999999999999999999999999887655 6788889999999999999999999999999988877665 5 Q ss_pred EecCCcEEEEEEeecCcch-heeeeEeeccCCceEEEEE--eCCeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCcc Q lcl|NC_020838. 695 GKSGSNTVYQHRFFMQGEN-RVQTWYKWQLTGDLRLQFF--DKTTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKT 771 (981) Q Consensus 695 ~~~g~~~l~~y~y~~~~eq-~V~aWsrw~~~G~v~sv~~--~~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 771 (981) +...++.|++|+|+|+.+| +|+|||||+|+|.++++|+ ++|+||++++|+++.+++++++..+..+. ..+.+ T Consensus 523 ~~~~~~~l~~~~y~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~d~l~~~v~r~~~~~~~r~~~~~~~~~~-----~~~~~ 597 (801) T protein:vir:15 523 TSGAPNRVYIYKFLYIDEEIRQQSWSHWDFGDNVTVFAAQVINSTMTVLMGNEHAVWMGRLHFTKNSIDI-----PGEPY 597 (801) T ss_pred EEcCCCEEEEEEEecCCCceEEEeeEEEEcCCCEEEEEEEecCCEEEEEEEecCcEEEEEEEEccccccC-----CCcce Confidence 5667799999999996655 4679999999999998876 68999999999999999999877655432 23445 Q ss_pred cccccce--eeeeeeeeccCCceEEE----cccCCCcCCceEEEEEcCcccCceeEEEEecCceEEEccccccCCCEEEe Q lcl|NC_020838. 772 DVCLDMF--NVNPYRTYSTSTKKTTV----NLPFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLL 845 (981) Q Consensus 772 ~~~lD~~--~vd~~~ty~~~~~~tt~----~l~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL 845 (981) +.+||+. ++.++++|++.+..++. ..+++|++|+.+.+++||......++. +|. ...+++++ T Consensus 598 ~~~lD~~~~~~~~~~t~~~~~~~~~~~~~~~~gl~~l~g~~v~v~~dG~~~~~~~~~----~g~--------~~~~~~~i 665 (801) T protein:vir:15 598 RLYIDAKRKYTIPAGTYNDDTYQTSISLATIYGMNFTKGRVSVVFPDGKIIEVDQPI----NGW--------SSDPVLRL 665 (801) T ss_pred eeeeeeeeeEeeccceeccCceecccccccccccccccceEEEEEeCCceeeeeeec----Ccc--------cCcceEEE Confidence 6678874 45678888887776654 446799999999999999765544321 111 13468999 Q ss_pred CCCCCCCEEEEEEeeeEEEEeCCceeeccCCCc-ccceEeeeEEEEEEEEEeecccceEEEEccCCCCceeeEecccccC Q lcl|NC_020838. 846 NGDYRGRDLIIGYVYDMELELPTLYPTQVEGRS-SVSDVTSDLILHRLKVSTGLSGPITYKVDITGKDEWTNIINVTLPN 924 (981) Q Consensus 846 ~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~-~~~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~ 924 (981) ++++++++|+|||+|+++++|+||+++.++|++ .+.+..+||||||++|++++||.|.+.|++.+++..+...+.++.. T Consensus 666 ~~~~~~~~v~vGl~y~~~~~~~~~~~~~~~~~~~~~~~~~~rl~l~r~~~~~~~tg~~~~~v~~~~~~~~~~~~~~~~~~ 745 (801) T protein:vir:15 666 DGNQEGQVVYIGFNIPFTYTFSKFLIKKTAEDGSTATEDIGRLQLRRAWVNYEDSGAFTIRVNNLSREFIYTMAGARLGS 745 (801) T ss_pred cCCCCCcEEEEeeeeeEEEEecceEEeccCCCCCceeeeeccEEEEEEEEEeccCcceEEEECCcccccceeecCccccc Confidence 999999999999999999999999999888754 4557789999999999999999999999999988766677777766 Q ss_pred ccc-cCccccccCCeEEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEeecccccc Q lcl|NC_020838. 925 TYV-LNNVNLSASALHDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYNRRFYRR 980 (981) Q Consensus 925 ~~~-~~~~p~~~~~~~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r~rRr 980 (981) ..+ .+.+|+ .++.+++|+.+|+++.+|+|+|++|+||+||+|+|||+||+|.||= T Consensus 746 ~~~~~~~~~~-~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlsi~~e~~y~~r~~~~ 801 (801) T protein:vir:15 746 DNLRVGRSNI-GTGQYRFPVVGNAQTNLVTIESDASTPLNIIGCGWEGNYLRRSSGI 801 (801) T ss_pred cccccccccc-ccceEEEEEeecCceEEEEEEECCCCcEEEEEEEEEEEEeccccCC Confidence 544 455555 5557899999999999999999999999999999999999999988 No 7 >protein:vir:99677 Length: 794 # NCBI annotation: Tail tubular protein B # Family: family:all:825 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249591;genbank:gi:68299742;genbank:GeneID:3799992 Probab=100.00 E-value=2.7e-217 Score=1207.88 Aligned_cols=766 Identities=28% Similarity=0.449 Sum_probs=659.9 Q ss_pred CCceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCC---CCCceEEEEEeCCCceEEEE Q lcl|NC_020838. 1 MSTISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNA---EARGRWFPILRDEEEKYVCQ 77 (981) Q Consensus 1 m~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~---~~~~~~~~~~rd~~e~y~~~ 77 (981) ||+|+|+||||++|||||||++|+|||+++|+||++||+.||+||||++||++|.++ +.+.++|+++||+.|+|++. T Consensus 1 M~~i~~s~~n~~~GvS~q~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~f~~~~~~~y~l~ 80 (794) T protein:vir:99 1 MALISQSIKNLKGGISQQPDILRYSDQGSKQINGFSSEVEGLQKRPPSVHIKRLTDQFGLGQKPYCHIINRDEVERYAVF 80 (794) T ss_pred CceeeeecchhhcceecCCchHHhhhhHhhhhcceeeeccCcccCCccceeeeecCCCCCccccEEEEEEeCCCceEEEE Confidence 999999999999999999999999999999999999999999999999999999765 34678999999999999999 Q ss_pred EEcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCcccccc Q lcl|NC_020838. 78 YDTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTATKV 157 (981) Q Consensus 78 ~~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~~~ 157 (981) |. ++.|+|||+.+|++++|+.+.. .. T Consensus 81 f~--~~~irv~~~~~g~~~~v~~~~~-----~~----------------------------------------------- 106 (794) T protein:vir:99 81 FT--GSNIRVFDLFTGDEKTVNAPNG-----LS----------------------------------------------- 106 (794) T ss_pred Ec--CCeEEEEECCCCeEEEeecccc-----cc----------------------------------------------- Confidence 94 5779999998898876531111 00 Q ss_pred eeeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeeeeee Q lcl|NC_020838. 158 NLFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVEKEV 237 (981) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 237 (981) T Consensus 107 -------------------------------------------------------------------------------- 106 (794) T protein:vir:99 107 -------------------------------------------------------------------------------- 106 (794) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EEeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCc-ceeeEEEEcCEEEEeCC Q lcl|NC_020838. 238 AAAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAP-EDIEILTINDYTFVLNK 316 (981) Q Consensus 238 ~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~-~dl~~~t~ad~tfi~n~ 316 (981) |+.++++ .+|+++|+||+|||+|+ T Consensus 107 -------------------------------------------------------y~~~~~~~~~l~~~q~aD~~fi~n~ 131 (794) T protein:vir:99 107 -------------------------------------------------------YVSSSNPRKDLRMVTVADYTFILNR 131 (794) T ss_pred -------------------------------------------------------ccccCCccceeeEEEEccEEEEEcC Confidence 1112222 36788888888888888 Q ss_pred ceeEeccCCC----CCCCCceEEEEEeeeccceEEEEeeCceEE-EEEeccCCCc---ceecHHHHHHHHHhhhhccCce Q lcl|NC_020838. 317 NKTTAMKTTT----SAAVPNVAFVVIRIVAYNSDYSVTLNGTTV-THSTPDTVAG---ATTDSGSIAAALTSSINALTGF 388 (981) Q Consensus 317 ~~~~~~~~~~----~~~~~~~~~v~v~~g~y~~~~~v~~ng~~~-~~~t~~~~a~---~~~~~~~ia~~l~~~i~~~~~~ 388 (981) +++|++..+. .......++++++.++|+++|+|.++|..+ .+++|++.+. .+.++++|+.+|...+.. .+| T Consensus 132 ~~~p~~~~~~~~~~~~~~~~~~~~~v~~g~y~~~y~v~i~gs~ta~~~tp~~~~~~~~~~~s~~~ia~~l~~~l~~-~g~ 210 (794) T protein:vir:99 132 NVATAQGTTNTPSGLAPFGHFGLVVIRGGQYGRTYRIKVNGSVEASFETPLGDQVAHAKQIDIAYIIDQLAAGLIN-KGW 210 (794) T ss_pred CeeeeEeeeeccccCcCCCceEEEEeccCCCCceEEEEecCCcccceeeccCcccccccccchhhhhhhhHhhhhc-ccc Confidence 8888865432 234556789999999999999999998654 4777776644 457789999999988864 678 Q ss_pred EEEEcCCEEEEEeCCC---ceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCc Q lcl|NC_020838. 389 SATQVGPGIYIEGTSA---FSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAA 465 (981) Q Consensus 389 ~~~~vg~~i~i~~~~~---~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~ 465 (981) +....+..+++.++.. +.+++++|.+++++..+.++|+++++||+.||+|++|+|++++.+++++||++|+... T Consensus 211 ~v~~~~g~~~i~~~~~~~v~t~s~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~~~~~~~~y~v~~~~~~--- 287 (794) T protein:vir:99 211 AVTKGSGYFYFSKSGSVIINSLEVEDGYNGQLAWGIINDVQKTTQLPVYAPNNYIIRVSGDPTLNQDDYYVRFDASR--- 287 (794) T ss_pred eEEeCCeEEEEEecCCceeEEEEeecCCCCceeeEEeeeccceeecccCCCCCeEEEEeccCCCCCCceEEEEEcCC--- Confidence 9999999999988765 4678889999999999999999999999999999999999999999999999998754 Q ss_pred ccceEEEEeeccceeEEEccccceEEEEeccCCceeeecccCCcccCCCcccCcCccccCCceeEEEEEcceEEEecCCe Q lcl|NC_020838. 466 RGPGVWEETIGPSLEFEIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLLSNEA 545 (981) Q Consensus 466 ~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~s~~~ 545 (981) ++|+||+++++..+++.+|||+.++++++++|++++++|++|.+||+++||+|+|+|++|++|+||||||||+++++ T Consensus 288 ---~~w~e~~~~~~~~~~~~~t~p~~~v~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~is~v~f~q~RL~f~~~~~ 364 (794) T protein:vir:99 288 ---NVWTECPAPNIKADYNKATMPHVLIREADGTFTFKQADWTHRAAGDDETNPYPSFIGNSINDIFFFRNRLGFLSGEN 364 (794) T ss_pred ---ceEEeeccceeecceeccceEEEEeccCCCceeEeeccccccccCCcccCCCccccCcceeEEEEEeeeEEEecCCe Confidence 38999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEEEeeec Q lcl|NC_020838. 546 VIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINTISTFE 625 (981) Q Consensus 546 V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~~S~~~ 625 (981) |||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|+ ++|||+|++++++|+|+ T Consensus 365 v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~~~~~~~~i~~~v~~~~~L~l~t~~~q~~l~~~-~~lTP~~~~~~~~s~~~ 443 (794) T protein:vir:99 365 VILSGSGNYFNFFPESVAVLTDTDPIDVAVSTNRISILKYAVPFSEELILWSDQAQFVLSSD-GGLTPTTIRLDLTTEFE 443 (794) T ss_pred EEEEecCCccccccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCC-CcccceeEEEEEEEEee Confidence 99999999999999999999999999999999999999999999999999999999999996 49999999999999999 Q ss_pred cccCCCcEEeCCeEEEEecCCCeeEEEE-EeecccccceehhhHHHHHHHhcCCCeEEE-EEcCCCcEEEEEecCCcEEE Q lcl|NC_020838. 626 CDAEIDAVAVGTTQAFISKSNLYSKLFL-MLNVQKEAAATIDEATTNVPEYVPSDIDSM-TASPAMSIVSLGKSGSNTVY 703 (981) Q Consensus 626 ~s~~v~Pv~vG~~v~Fv~~~g~~s~vre-~~y~~~~d~~~a~DlS~~~~h~~~~~i~~~-~~s~~~~~~~~~~~g~~~l~ 703 (981) |++.++|+.+|+++||+|++|+|++++| |.|++++|+|+++|||+|++|||++++..+ +++.+|..++|++.+++.|+ T Consensus 444 ~~~~~~Pv~vg~~v~f~~~~g~~~~v~r~~~~~~~~d~y~a~Dlt~~~~hl~~~~~~~~~a~~~~~~~~v~~~~~~g~l~ 523 (794) T protein:vir:99 444 VTEQARPYGIGRGVYFVSPRAKFSSVRRFYAVQDVTQVKNAEDISAHVPYYVENGVFKMSGSSTENFLTILTEGNEQRVY 523 (794) T ss_pred ccCCCCceEeCCeEEEEecCCCeeEEEEeeeeccccCceehhhHHHHHHHhcCCCeEEEEEeCCCCcEEEEEEcCCCEEE Confidence 9999999999999999999999988765 579999999999999999999999987655 67778889999999999999 Q ss_pred EEEeecC-cchheeeeEeeccCCceEEEEE--eCCeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCccccccccee- Q lcl|NC_020838. 704 QHRFFMQ-GENRVQTWYKWQLTGDLRLQFF--DKTTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFN- 779 (981) Q Consensus 704 ~y~y~~~-~eq~V~aWsrw~~~G~v~sv~~--~~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~- 779 (981) +|+|+|+ +||+|+|||||+|+|.++++|+ ++|+||++++|+++.+||||++..+..+. ..+.++.++|+.. T Consensus 524 ~~~y~~~~~eq~v~aW~~~~~~g~~~~~~~~~~~d~l~~~v~r~~~~~ler~~~~~~~~~~-----~~~~~~~~lD~~~~ 598 (794) T protein:vir:99 524 FYKFLYLQEQLVQQSWSHWDFGVNCRVLCCDMIGAVMHLIIDSPSGVLMEKIEFTQNTKDY-----PDEPYRLYVDRKIE 598 (794) T ss_pred EEEEeecCCceEEEeEEEEEcCCCeEEEEEEEcCCEEEEEEEeCCCEEEEEEEeeeCCCCC-----CCcccceeeeeeee Confidence 9999986 4556789999999999988776 69999999999999999999877665432 2344556666642 Q ss_pred -eeeeeeeccCCceEEE----cccCCCcCCceEEEEEcCcccCceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEE Q lcl|NC_020838. 780 -VNPYRTYSTSTKKTTV----NLPFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDL 854 (981) Q Consensus 780 -vd~~~ty~~~~~~tt~----~l~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v 854 (981) .....+|+.+...+++ ..+++||+|+.+.+++||........ ....+.+.+.++|++++++++| T Consensus 599 ~~~~~~~~~~~~~~~~~~~~~~~g~~~l~g~~v~~~~dg~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~v 667 (794) T protein:vir:99 599 YTFPEGSYNDDDFKTRVKLKDIYGSTPANGQYVFISLGGVTFTFDPP-----------AGGWQANDGLIEFDGDLRGTKF 667 (794) T ss_pred eeecccccccCcceeEEeccccccccccCCceEEEEeCCceeeeecc-----------cceEecCccEEEecCCCCCcEE Confidence 3345566666666665 35678999999999999864321110 0012345688999999999999 Q ss_pred EEEEeeeEEEEeCCceeeccCCCccc-ceEeeeEEEEEEEEEeecccceEEEEccCCCCceeeEecccccC-ccccCccc Q lcl|NC_020838. 855 IIGYVYDMELELPTLYPTQVEGRSSV-SDVTSDLILHRLKVSTGLSGPITYKVDITGKDEWTNIINVTLPN-TYVLNNVN 932 (981) Q Consensus 855 ~VGl~y~s~v~~~~~~i~~~~g~~~~-~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~-~~~~~~~p 932 (981) +|||+|+++++|+||+++.+++.+.. ....|||+|||++|++++||+|.+.+++.+++....+.+.+... ...++.+| T Consensus 668 ~vGl~y~s~~~~~~~~~~~~~~~g~~~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~ 747 (794) T protein:vir:99 668 FVGEAYTFLYEFSKFLIKTTDTADGVATEDIGRLQLRRAWVNYDKSGNFRVEVNNQGRTFTYNMTGNRLSTNELILGDES 747 (794) T ss_pred EEeeeeeEEEeecceEEeecCCCCceeeeccceEEEEEEEEEeecccceEEEECCCccceeeeccccccccccccccccc Confidence 99999999999999999987754433 35568999999999999999999999999987655555666543 45566777 Q ss_pred cccCCeEEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEeecccccc Q lcl|NC_020838. 933 LSASALHDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYNRRFYRR 980 (981) Q Consensus 933 ~~~~~~~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r~rRr 980 (981) +.++ .+++|+.+|+++.+|+|+|++|+||+|++|+|||+||+|.||= T Consensus 748 ~~tg-~~~vp~~g~~~~~~v~i~~d~P~P~tvlsi~~e~~y~~r~~~v 794 (794) T protein:vir:99 748 LDTG-QFRYAVSGNATQVTVSLISDTPNPLSIIGGGWEGYYVRRSSGI 794 (794) T ss_pred cccc-eEEEEecccccceEEEEEECCCCCEEEEEEEEEEEEeccccCC Confidence 7655 7899999999999999999999999999999999999999988 No 8 >protein:vir:94583 Length: 792 # NCBI annotation: Tubular tail protein B # Family: family:all:825 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919014;genbank:gi:119637778;genbank:GeneID:5179343 Probab=100.00 E-value=1.7e-216 Score=1203.54 Aligned_cols=766 Identities=26% Similarity=0.394 Sum_probs=657.1 Q ss_pred CCceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCC---CCceEEEEEeCCCceEEEE Q lcl|NC_020838. 1 MSTISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAE---ARGRWFPILRDEEEKYVCQ 77 (981) Q Consensus 1 m~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~---~~~~~~~~~rd~~e~y~~~ 77 (981) ||+|+|+||||++|||||||++|+|||+++|+||+|||+.||+||||++||+.|.+.+ ....+|+++||+.|+|++. T Consensus 1 M~~i~~s~~n~~~GiSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~~~~~~~q~y~l~ 80 (792) T protein:vir:94 1 MALISQSVKNLKGGISQQPNILRFPEQGSEQINGWSSETEGLQKRPPFVFTKTIGDQNALGAKPLVHLINRDSAEQYYVV 80 (792) T ss_pred CcceeeecchhhcceecCcchHHhhhhhhhhhcceeeeccccccCChhHHHHhhhcCCCCCcccEEEEEEeCCCceEEEE Confidence 9999999999999999999999999999999999999999999999999999987663 4567999999999999999 Q ss_pred EEcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCcccccc Q lcl|NC_020838. 78 YDTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTATKV 157 (981) Q Consensus 78 ~~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~~~ 157 (981) +.. +.++|||+ +|.++.|+.+ .+|+.+.+++++ T Consensus 81 f~~--~~~rv~~~-~g~~~~~~~~-------~~y~~~~~~~~~------------------------------------- 113 (792) T protein:vir:94 81 FTG--QGVRVFDL-NGKEYDVKGD-------LSYVKVENPRDD------------------------------------- 113 (792) T ss_pred EcC--CeEEEEec-CCceEEeccc-------CceeeecCCcce------------------------------------- Confidence 964 44999998 5888765321 123333333333 Q ss_pred eeeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeeeeee Q lcl|NC_020838. 158 NLFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVEKEV 237 (981) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 237 (981) T Consensus 114 -------------------------------------------------------------------------------- 113 (792) T protein:vir:94 114 -------------------------------------------------------------------------------- 113 (792) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EEeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCc Q lcl|NC_020838. 238 AAAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKN 317 (981) Q Consensus 238 ~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~ 317 (981) |+++|+||+|||+|++ T Consensus 114 ----------------------------------------------------------------l~~~q~aD~~fi~n~~ 129 (792) T protein:vir:94 114 ----------------------------------------------------------------LRMVTVADYTFIVNRN 129 (792) T ss_pred ----------------------------------------------------------------eEEEEEcCEEEEEeCC Confidence 4556666666666666 Q ss_pred eeEeccC--CCCCCCCceEEEEEeeeccceEEEEeeCceEEEEEeccCCCcce---ecHHHHHHHHHhhhhc---cCceE Q lcl|NC_020838. 318 KTTAMKT--TTSAAVPNVAFVVIRIVAYNSDYSVTLNGTTVTHSTPDTVAGAT---TDSGSIAAALTSSINA---LTGFS 389 (981) Q Consensus 318 ~~~~~~~--~~~~~~~~~~~v~v~~g~y~~~~~v~~ng~~~~~~t~~~~a~~~---~~~~~ia~~l~~~i~~---~~~~~ 389 (981) ++|++.. ....++...+++++++|+|+++|++.++++.++++++.+.+++. ..+++++++|...... ..+|+ T Consensus 130 ~~~~~~~~~~~~~~~~~~~~v~i~~g~y~~~y~i~i~~~~~~~~~~~~t~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~ 209 (792) T protein:vir:94 130 MVVRPDTTPLYTLKENGDCLINIRGGMYGRTLAFTINNTKIAYEIAHGDAPEHSKQTDAQWLVKKLAGLARLNVAFKGWT 209 (792) T ss_pred ccceeEecCcCCCCCCceEEEEccCCCcceeEEEEecCceeeeeeecCcccceecccchhhhhhhhhhhccccccccccE Confidence 6666432 22345556789999999999999999999999999988876654 5678899988776543 35788 Q ss_pred EEEcCCEEEEEeCCC---ceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCcc Q lcl|NC_020838. 390 ATQVGPGIYIEGTSA---FSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAAR 466 (981) Q Consensus 390 ~~~vg~~i~i~~~~~---~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~~ 466 (981) ..+.+++++|.+++. .++++++|.+++++.++++.|+++++||+.|++|++|+|+++++++.|+||++|+...+ T Consensus 210 ~~~~~~~~~i~~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~lp~~~~~G~~v~i~~~~~~~~d~y~v~~~~~~~--- 286 (792) T protein:vir:94 210 FTEGPGYIHVIAPSNSQINSLSTEDGYADQLMNAVMHTSQSFSRLPVEAPNGYTVKIVGDTSKTSDMFYVQYDNMKK--- 286 (792) T ss_pred EEECCeEEEEEecCCceeeeeecccCcCcceeeeeeecccccccccccCCCCcEEEEEccCCCCccceEEEEEcCCc--- Confidence 888999999987664 45678899999999999999999999999999999999999999999999999987643 Q ss_pred cceEEEEeeccceeEEEccccceEEEEeccCCceeeecccCCcccCCCcccCcCccccCCceeEEEEEcceEEEecCCeE Q lcl|NC_020838. 467 GPGVWEETIGPSLEFEIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLLSNEAV 546 (981) Q Consensus 467 g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~s~~~V 546 (981) +|+||+++++..+++.+||||+++++++++|+++.++|++|.+||+++||.|+|+|++|++|+||||||+|+++++| T Consensus 287 ---~w~E~~~~~~~~~~~~~tmp~~lv~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~g~~i~~v~f~q~RL~f~~~~~v 363 (792) T protein:vir:94 287 ---VWKEVAGWGVQKGLNGGTMPHALVRQADGSFQMQVLPWTQRTCGDMDTNPTPSIVDQKINDVFFFRNRLGFLAGENI 363 (792) T ss_pred ---eEEEecccceeeeecccccCeeEEEcCCCcEEEEeccccccccCccccCccceeccCCcceEEEEcceEEEecCCeE Confidence 79999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEEEeeecc Q lcl|NC_020838. 547 IMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINTISTFEC 626 (981) Q Consensus 547 ~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~~S~~~~ 626 (981) ||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|+ ++|||+|++++++|+|+| T Consensus 364 ~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~~~q~~l~~~-~~lTP~~~~i~~~s~~~~ 442 (792) T protein:vir:94 364 VMSRTSKYFSLFPASVANLSDDDPIDVAVSHNRISILKYAVPFSEELLLWSDQAQFVLSAQ-GILSPKSVELNLTTEFDV 442 (792) T ss_pred EEEccCCcccCccccccCCCCCccEEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeCC-CcccceeEEEEEEEEeec Confidence 9999999999999999999999999999999999999999999999999999999999986 499999999999999999 Q ss_pred ccCCCcEEeCCeEEEEecCCCeeEEEE-EeecccccceehhhHHHHHHHhcCCCeEEEE-EcCCCcEEEEEecCCcEEEE Q lcl|NC_020838. 627 DAEIDAVAVGTTQAFISKSNLYSKLFL-MLNVQKEAAATIDEATTNVPEYVPSDIDSMT-ASPAMSIVSLGKSGSNTVYQ 704 (981) Q Consensus 627 s~~v~Pv~vG~~v~Fv~~~g~~s~vre-~~y~~~~d~~~a~DlS~~~~h~~~~~i~~~~-~s~~~~~~~~~~~g~~~l~~ 704 (981) ++.++|+.+|++++|++++|+|++++| |.|++++|+|+++|||+|++|||++++..++ ++..|..++|++.+++.|++ T Consensus 443 ~~~~~Pv~vG~~v~Fv~~~g~~~~v~r~~~~~~~~d~y~a~DlT~~~~hl~~~~v~~~~a~~~~~~~vv~~~~~~g~l~~ 522 (792) T protein:vir:94 443 SDRARPFGVGRGVYFASPRASYTSLNRYYAVQDVSSVKSAEDMSAHVPNYIPNGVFSIRGSSTENFISVLSSNAPSRIFL 522 (792) T ss_pred cCCCCceEeCCeEEEeecCCCeeEEEeeeeeccccCceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCeEEE Confidence 999999999999999999999988776 5677779999999999999999999987765 55677789999999999999 Q ss_pred EEeecCcch-heeeeEeeccCCceEEEEE--eCCeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCccccccccee-- Q lcl|NC_020838. 705 HRFFMQGEN-RVQTWYKWQLTGDLRLQFF--DKTTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFN-- 779 (981) Q Consensus 705 y~y~~~~eq-~V~aWsrw~~~G~v~sv~~--~~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~-- 779 (981) |+|+|+.+| +|+|||||+|+|+++++|+ .+|+||++|+|++++++||+.+.....+. ..+.++.++|+.. T Consensus 523 ~ty~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~D~l~~~v~r~~~~~~~r~~~~~~~~d~-----~~~~~~~~lD~~~~~ 597 (792) T protein:vir:94 523 YKFLYLNEEIAQQSWSHWELGSNVTVLACDSIGSTMYLVLRNQSHTWMCRAHFTKNSIDF-----PDEPYRLYIDNKVKY 597 (792) T ss_pred EEEeecCCceEEEeEEEEEcCCcEEEEEEeecCCEEEEEEEeCCCEEEEEEEEeeccccc-----CCCcceeeeeeeeeE Confidence 999987555 5789999999999988776 68999999999999999999876544321 2234555666643 Q ss_pred eeeeeeeccCCceEEE----cccCCCcCCceEEEEEcCcccCceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEEE Q lcl|NC_020838. 780 VNPYRTYSTSTKKTTV----NLPFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLI 855 (981) Q Consensus 780 vd~~~ty~~~~~~tt~----~l~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~ 855 (981) .....+|++++..|+. ..+++|++|+.+.+++||....... .....+..+++|+|+|++++++|+ T Consensus 598 ~~~~~~~~~~~~~T~~~~~~~~gl~~l~G~~v~v~~dG~~~~~~~-----------~~~~~~~~~~~i~~~g~~~a~~v~ 666 (792) T protein:vir:94 598 VIPEGSYNDDTYATTVKPVDVYGMKYWTGKFYIVASDGLVSWFEP-----------PRGGWPNGVPMLTMSGNREGETIY 666 (792) T ss_pred EecCcceecCceeeeeccccccCcccccCcEEEEEecCceeEeec-----------ccceecCCccEEEecCCccCCeEE Confidence 2234567767766654 3467999999999999985221000 001123567899999999999999 Q ss_pred EEEeeeEEEEeCCceeeccCCCcc-cceEeeeEEEEEEEEEeecccceEEEEccCCCCceeeEecccccCccccCccccc Q lcl|NC_020838. 856 IGYVYDMELELPTLYPTQVEGRSS-VSDVTSDLILHRLKVSTGLSGPITYKVDITGKDEWTNIINVTLPNTYVLNNVNLS 934 (981) Q Consensus 856 VGl~y~s~v~~~~~~i~~~~g~~~-~~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~p~~ 934 (981) |||+|+++++|+||+++.+.|... +....||+||||+++++.+||.|.+++++.+++.+....+.++........+|+. T Consensus 667 VGl~y~~~~~~~~~~~~~~~g~~~~~~~~~gr~rl~r~~~~~~~tg~~~v~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 746 (792) T protein:vir:94 667 VGLAISFRYVFSKFLIKKTADDGSIATEDIGRLQLRRAWVNYEDSGAFTVEVENTSRLFSYDMAGARLGSNVLRAGGLNV 746 (792) T ss_pred EeeeeeEEEEeccceeeccCCCcCccccceeeEEEEEEEEeeeccceeEEEEcCCCcceeeeeccceecccccccccccc Confidence 999999999999999998887644 4567799999999999999999999999999888777777787766655555566 Q ss_pred cCCeEEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEeecccccc Q lcl|NC_020838. 935 ASALHDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYNRRFYRR 980 (981) Q Consensus 935 ~~~~~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r~rRr 980 (981) .++.+++|+.+|+++.+|+|+|++|+||+||||+|||+||+|.||= T Consensus 747 ~tg~~~vp~~g~~~~~~v~i~~d~P~P~tvlai~~eg~y~~r~~~v 792 (792) T protein:vir:94 747 GTGQFRFPVTGNAQLNEVRIISEHTTPLNVIGCGWEGNYLRRSSGI 792 (792) T ss_pred ccceEEEEeeccCceEEEEEEECCCCCEEEEEEEEEEEEeccccCC Confidence 6678999999999999999999999999999999999999999988 No 9 >protein:vir:8887 Length: 808 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813776;genbank:gi:29366731;genbank:GeneID:1258831 Probab=100.00 E-value=4e-214 Score=1190.52 Aligned_cols=762 Identities=26% Similarity=0.430 Sum_probs=635.9 Q ss_pred CCceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCC---CCceEEEEEeCCCceEEEE Q lcl|NC_020838. 1 MSTISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAE---ARGRWFPILRDEEEKYVCQ 77 (981) Q Consensus 1 m~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~---~~~~~~~~~rd~~e~y~~~ 77 (981) ||+|+|+||||++|||||||.+|+|||+++|+||+|||+.||+||||++||++|.+++ .+..+|+++||+.|+|+++ T Consensus 1 M~~v~~s~~n~~~GvSqq~d~~R~~~q~~~~~N~~~~~~gG~~rRpgt~~v~~l~~~~~~~~~~~~~~~~~~~~~~y~v~ 80 (808) T protein:vir:88 1 MGLVSQSVKNLKGGISQQPDILRFSNQGALQINGWSSETQGLQKRPPTTFTKRLQNKGFLGTKPLVHLINRDAQEQYFVG 80 (808) T ss_pred CcceeeecchhccceeccchhHhhhhhhhhhhcceeeeccccccCCchheeeeeeccCCCCCCcEEEEEEeCcCceEEEE Confidence 9999999999999999999999999999999999999999999999999999987654 4567899999999999999 Q ss_pred EEcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCcccccc Q lcl|NC_020838. 78 YDTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTATKV 157 (981) Q Consensus 78 ~~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~~~ 157 (981) +... .|+|||+ +|++++|+. ...|+.+++++++ T Consensus 81 ~~~~--~i~v~~~-~G~~~~v~~-------~~~y~~~~~~~~~------------------------------------- 113 (808) T protein:vir:88 81 FSGT--GLAVWDL-KGNNYTVRG-------YNGYANCANPRTD------------------------------------- 113 (808) T ss_pred EeCC--eEEEEEc-CCceEEEee-------cCcceEecCChhh------------------------------------- Confidence 8653 3999999 598887751 1224444444444 Q ss_pred eeeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeeeeee Q lcl|NC_020838. 158 NLFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVEKEV 237 (981) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 237 (981) T Consensus 114 -------------------------------------------------------------------------------- 113 (808) T protein:vir:88 114 -------------------------------------------------------------------------------- 113 (808) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EEeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCc Q lcl|NC_020838. 238 AAAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKN 317 (981) Q Consensus 238 ~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~ 317 (981) ||++|+||+|||+|++ T Consensus 114 ----------------------------------------------------------------l~~~tvaD~~fi~n~~ 129 (808) T protein:vir:88 114 ----------------------------------------------------------------LRLITVADYTFVVNRN 129 (808) T ss_pred ----------------------------------------------------------------eeEEEEcCEEEEEcCC Confidence 4455555555555555 Q ss_pred eeEeccCCC----CCCCCceEEEEEeeeccceEEEEeeCc------eEEEEEeccCCCc--------------ceecHHH Q lcl|NC_020838. 318 KTTAMKTTT----SAAVPNVAFVVIRIVAYNSDYSVTLNG------TTVTHSTPDTVAG--------------ATTDSGS 373 (981) Q Consensus 318 ~~~~~~~~~----~~~~~~~~~v~v~~g~y~~~~~v~~ng------~~~~~~t~~~~a~--------------~~~~~~~ 373 (981) ++|++.++. .+.....+++++++|+|+++|.|++|| ..+.+.++.+... ..+.+++ T Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~vr~g~y~~~y~i~i~g~~s~~~~t~t~~~~~~s~~~v~~~~~~~~~~~~~~~~~~~ 209 (808) T protein:vir:88 130 TVCQMGSTLTHAAYPRLDGRAIINVRGGQYGRTLSITINGDGTGSSPQASIKMPNGSAEKVPAGDPYAGMNQVDMTDASW 209 (808) T ss_pred cceeecccccccCCCCCCccEEEEEcccccCceEEEEEecCCcceeeeEeEEEccCcccceeeccceeecccCCcccccc Confidence 555443322 223445678999999999999999987 3344555544322 2456678 Q ss_pred HHHHHHhhhhcc---CceEEEEcCCEEEEEeCCC---ceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccC Q lcl|NC_020838. 374 IAAALTSSINAL---TGFSATQVGPGIYIEGTSA---FSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSG 447 (981) Q Consensus 374 ia~~l~~~i~~~---~~~~~~~vg~~i~i~~~~~---~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g 447 (981) |+..|..++.+. .+|+....++.++|..+.+ +.+++++|.+++.+.++.+.|+.+++||..+|+|+.++|++++ T Consensus 210 ia~~l~~~~~~~~~~~~~~~~~~~~~~~i~~~a~~~~~~~~t~~g~~~~~~~~~~~~v~~~~~lp~~~p~g~~v~i~~~~ 289 (808) T protein:vir:88 210 IAAELARQLTVSLGGSGWSFQAGTGWILINAPANDNVRQIATKDGYADTLLSGFIYQVQTFTKLPANAPPGYLVEITGES 289 (808) T ss_pred chhhheeeeeecccccceEEEeccceEEEEeccCceeEEEcccCCcCcceeeeeeeeccceeeccccCCCCcEEEEEecC Confidence 888888777653 4577788889999987664 5678899999999999999999999999999999999999999 Q ss_pred CCcccceEEEEEcccCCcccceEEEEeeccceeEEEccccceEEEEeccCCceeeecccCCcccCCCcccCcCccccCCc Q lcl|NC_020838. 448 DVTADDIYVEFQTTNSAARGPGVWEETIGPSLEFEIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFIGKK 527 (981) Q Consensus 448 ~~~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ 527 (981) +++.++||++|+... ++|+||++|+..++++.+||||.++++++|+|++++++|++|.+||++|||+|+|+|++ T Consensus 290 ~~~~~~~yv~~~~~~------~~w~e~~~~~~~~~~~~~tmp~~lv~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~ 363 (808) T protein:vir:88 290 ARSGDNYWVQYDASG------KVWKETAKPKIIAGFNNATLPHALVRAADGQFDWTPLTWDGRNAGDDDTNPMPSFVGAT 363 (808) T ss_pred CCCCceeEEEEEcCC------eEEEEeeeccceeeecccceeEEEEecCCceEEEEecccccccccccccCccceecCCc Confidence 999999999998754 37999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEEEEcceEEEecCCeEEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcC Q lcl|NC_020838. 528 INNMFFYRNRLGLLSNEAVIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTD 607 (981) Q Consensus 528 ps~v~ffq~RL~f~s~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~ 607 (981) |++|+||||||||+++++|||||+||||||++++++++.|||||+++++++++++|+|+|+++++|+|||+++||+|+|+ T Consensus 364 ~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~~~~~~~~i~~~v~~~~~L~i~T~~~e~~l~~~ 443 (808) T protein:vir:88 364 INDVFFFRNRLGFLSGENVVMSRTSKYFNFFPSSVATLSDDDPIDVAISHNRISILKYAVPFSEQLLLWSDQAQFVLSSK 443 (808) T ss_pred eeEEEEEcceEEEeeCCeEEEEeccCcccccCCcccCCCCCccEEEEecCCccceeeEEeecCCcEEEEecCcEEEEeCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999996 Q ss_pred CccccccceEEEEEeeeccccCCCcEEeCCeEEEEecCCCeeEEE-EEeecccccceehhhHHHHHHHhcCCCeEEEEEc Q lcl|NC_020838. 608 ADILSPTTTKINTISTFECDAEIDAVAVGTTQAFISKSNLYSKLF-LMLNVQKEAAATIDEATTNVPEYVPSDIDSMTAS 686 (981) Q Consensus 608 ~~~LTP~~~~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vr-e~~y~~~~d~~~a~DlS~~~~h~~~~~i~~~~~s 686 (981) ++|||+|++++++|+|+|++.++|+.+|++++|++++|+|++++ +|.|++++|+|+++|||+|++|||++++..++++ T Consensus 444 -~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~f~~~~g~~~~v~r~~~~~~~~d~y~~~dlt~~~~h~~~~~~~~~~~~ 522 (808) T protein:vir:88 444 -TILSSKTIELDLTTEFDVSDGARPYGIGRGVYFAAPRASFTSLKRYYAIQDVSDVKSAEDVSAHVPSYITNTVHAIHGS 522 (808) T ss_pred -CcccceeEEEEEEEEecccCCCCceEeCCeEEEEecCCCeeEEEEEEEeeeccCceehhhHHHHHHHhcCCCeEEEEEe Confidence 49999999999999999999999999999999999999988765 5688999999999999999999999998887665 Q ss_pred -CCCcEEEEEecCCcEEEEEEeecCc-chheeeeEeeccCCceEE----EEEeCCeEEEEEEcCCcEEEEEEEeecCCce Q lcl|NC_020838. 687 -PAMSIVSLGKSGSNTVYQHRFFMQG-ENRVQTWYKWQLTGDLRL----QFFDKTTFYAVTSSGSNVYLTSYDLTQASES 760 (981) Q Consensus 687 -~~~~~~~~~~~g~~~l~~y~y~~~~-eq~V~aWsrw~~~G~v~s----v~~~~d~ly~vv~r~~~~~l~~~~~~~~~~~ 760 (981) ..+..++|++.+++.|++|+|+|+. ||+|+|||||+|+|.+++ +++++|.||++|+|++++++||+.+.....+ T Consensus 523 ~~~~~~~v~~~~~~g~l~~~~y~~~~~e~~v~aW~r~~~~g~~~~~~~~~~~~~d~l~~vV~r~~~~~ler~~~~~~~~~ 602 (808) T protein:vir:88 523 GTENFVSILSDGSPNKVFIYKFLYLDEILQQQSFSHWEFGDAATTRVLAASCIGSYCYLMIDRPEGLCLERMEFTQHTID 602 (808) T ss_pred CCCCeEEEEEEcCCCEEEEEEEeccCCceeEEeeEEEecCCCeeEEEEEEeccCCEEEEEEEcCCcEEEEEEeeccCCCC Confidence 4556678999999999999999864 556789999999997665 4556999999999999999999987654433 Q ss_pred eEEEecCCCcccccccceeeeeeeeeccCCceEEEcc-----cCCCcCCceEEEEEcCcccCceeEEEEecCceEEEccc Q lcl|NC_020838. 761 GYLTLPTGEKTDVCLDMFNVNPYRTYSTSTKKTTVNL-----PFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFED 835 (981) Q Consensus 761 ~~~~~~~~~~~~~~lD~~~vd~~~ty~~~~~~tt~~l-----~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~ 835 (981) . ..+.++.++|+.....+.+|+.++..++... ++.|+++..+.+.++|...... . . T Consensus 603 ~-----~~~~~~~~lD~~~~~~~g~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~dg~~~~~~------~--------~ 663 (808) T protein:vir:88 603 Y-----SIEPYRTYMDMKKTIVLGAYNIDTNLTSFDVRTAYGGTPGPESTFYTIDQQGVLIEHE------A--------R 663 (808) T ss_pred C-----ccccceeeeeeeeeeccccccCccccceeecccccccccccceeEEEEcCCceEEeee------c--------c Confidence 2 2234556677665555666776666665432 3457777777776666532110 1 1 Q ss_pred cccCCCEEEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccc-eEeeeEEEEEEEEEeecccceEEEEccCCCCce Q lcl|NC_020838. 836 SDISSNQVLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVS-DVTSDLILHRLKVSTGLSGPITYKVDITGKDEW 914 (981) Q Consensus 836 ~tv~gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~-~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~ 914 (981) ....+.++++++++++++|+|||+|+++++|+||++++++|+..++ ...||+||+|+++++.+||+|.+.|++..++.. T Consensus 664 ~~~~~~~~~~~~~~~~~~v~vGl~y~s~~~~~p~~~~~~~g~~~~~~~~~gr~~l~r~~~~~~~tg~~~v~v~~~~~~~~ 743 (808) T protein:vir:88 664 DWATNPYISFVGNRAGEQMVIGKQYTFQYEFSKFLIKQTADDGSTSTEDIGRLQLRRAWLNYEESGAFEINVNNGSSEFV 743 (808) T ss_pred cccCcceEEeCCCccCceEEEeeeeeEEEEecceEEecCCCCcceeecccceEEEEEEEEEeecccceEEEeCCCcccce Confidence 1245678999999999999999999999999999999998876654 557899999999999999999999998776644 Q ss_pred eeEecccccCccccCccccccCCeEEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEeecccccc Q lcl|NC_020838. 915 TNIINVTLPNTYVLNNVNLSASALHDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYNRRFYRR 980 (981) Q Consensus 915 ~~~~~~~~~~~~~~~~~p~~~~~~~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r~rRr 980 (981) ....+.++...+..+.+|+. ++.+++|+.+|+++.+|+|+|++|+||+||||+|||+||+|+||= T Consensus 744 ~~~~~~~~~~~~~~~~~~~~-tg~~~vp~~~~~~~~~v~i~~d~P~P~tilsi~~eg~y~~r~~~v 808 (808) T protein:vir:88 744 YVMTGGRLGIQRVLGELSVG-TGQFKFPVTGNAVNQRVTITSSNPNPLNVIGCGWEGNYIRRSSGI 808 (808) T ss_pred eeccCcccCcccccCccccc-cceEEEEecccCceeEEEEEECCCCceEEEEEEEEEEEeccccCC Confidence 43434444444555566665 457899999999999999999999999999999999999997766 No 10 >protein:vir:2203 Length: 794 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_042000;swissprot:sw:p03747;genbank:gi:9627472;uniprot:P03747;genbank:GeneID:1261024 Probab=100.00 E-value=4.7e-213 Score=1184.66 Aligned_cols=766 Identities=26% Similarity=0.417 Sum_probs=656.6 Q ss_pred CCceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCC---CCceEEEEEeCCCceEEEE Q lcl|NC_020838. 1 MSTISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAE---ARGRWFPILRDEEEKYVCQ 77 (981) Q Consensus 1 m~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~---~~~~~~~~~rd~~e~y~~~ 77 (981) ||+|+|+||||++|||||||++|+|+|+++|+||+|||+.||+||||++||++|.... ....+|+++|++.|+|++. T Consensus 1 M~~i~~s~~n~~~GvSqq~D~~ry~~q~~~~~N~~~~~~gG~~rRpG~~fv~~l~~~~~~~~~~~l~~~~~~~~~~y~l~ 80 (794) T protein:vir:22 1 MALISQSIKNLKGGISQQPDILRYPDQGSRQVNGWSSETEGLQKRPPLVFLNTLGDNGALGQAPYIHLINRDEHEQYYAV 80 (794) T ss_pred CceeeeecchhhcccccCCchHHhhhHHhhhhcceeeccCCceeCCchHhhhhhcccCCCCCccEEEEEEeCCCcEEEEE Confidence 9999999999999999999999999999999999999999999999999999997654 3446789999999999998 Q ss_pred EEcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCcccccc Q lcl|NC_020838. 78 YDTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTATKV 157 (981) Q Consensus 78 ~~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~~~ 157 (981) +. ++.|+||++ +|++++|++.++ .+|+.+ T Consensus 81 ~~--~~~irv~~~-~G~~~~v~~~~~-----~~y~~~------------------------------------------- 109 (794) T protein:vir:22 81 FT--GSGIRVFDL-SGNEKQVRYPNG-----SNYIKT------------------------------------------- 109 (794) T ss_pred Ec--CCeEEEEec-CCcEEEeecCCC-----ccceec------------------------------------------- Confidence 85 455999998 588876631111 000000 Q ss_pred eeeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeeeeee Q lcl|NC_020838. 158 NLFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVEKEV 237 (981) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 237 (981) T Consensus 110 -------------------------------------------------------------------------------- 109 (794) T protein:vir:22 110 -------------------------------------------------------------------------------- 109 (794) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EEeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCc Q lcl|NC_020838. 238 AAAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKN 317 (981) Q Consensus 238 ~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~ 317 (981) ++..++|+++|+||++||+|++ T Consensus 110 ----------------------------------------------------------~~~~~~l~~~q~aD~~fi~~~~ 131 (794) T protein:vir:22 110 ----------------------------------------------------------ANPRNDLRMVTVADYTFIVNRN 131 (794) T ss_pred ----------------------------------------------------------CCCcccEEEEEEcCEEEEEcCC Confidence 0112368899999999999999 Q ss_pred eeEeccCCCC----CCCCceEEEEEeeeccceEEEEeeCceEE-EEEeccCCCc---ceecHHHHHHHHHhhhhc-cCce Q lcl|NC_020838. 318 KTTAMKTTTS----AAVPNVAFVVIRIVAYNSDYSVTLNGTTV-THSTPDTVAG---ATTDSGSIAAALTSSINA-LTGF 388 (981) Q Consensus 318 ~~~~~~~~~~----~~~~~~~~v~v~~g~y~~~~~v~~ng~~~-~~~t~~~~a~---~~~~~~~ia~~l~~~i~~-~~~~ 388 (981) ++|++..... ..+.+.|+++++.|+|+++|++.+++... .+++|++.+. ..+++++|+.+|..++.+ .++| T Consensus 132 ~~p~~~~~~~~~~~~~~~~~g~v~v~~g~y~~ty~v~I~~~~~a~~~~p~gt~~~~~~~~~~~~ia~~L~~~l~~~~~~~ 211 (794) T protein:vir:22 132 VVAQKNTKSVNLPNYNPNQDGLINVRGGQYGRELIVHINGKDVAKYKIPDGSQPEHVNNTDAQWLAEELAKQMRTNLSDW 211 (794) T ss_pred eeeeEeeccccCCCCCCCceEEEEccCCccceeEEEEeccCcceEEEEcCCCccccceeechhhhhhhhhhhheeccccc Confidence 9998755432 23456899999999999999999988654 4777776654 457889999999988754 5789 Q ss_pred EEEEcCCEEEEEeCCCc---eEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCc Q lcl|NC_020838. 389 SATQVGPGIYIEGTSAF---SISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAA 465 (981) Q Consensus 389 ~~~~vg~~i~i~~~~~~---~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~ 465 (981) +..+.++.++|.+++.. .+++.+|++++++.++.++++++++||+.|++|++++|++++++..+.||++|+... T Consensus 212 t~~~~~~~~~i~a~~~~~~~~~t~~~g~~~t~~~~~~~~~~~~~~lp~~~~~G~~v~i~~~~~~~~~~Y~v~~~~~~--- 288 (794) T protein:vir:22 212 TVNVGQGFIHVTAPSGQQIDSFTTKDGYADQLINPVTHYAQSFSKLPPNAPNGYMVKIVGDASKSADQYYVRYDAER--- 288 (794) T ss_pred eEEeCCceEEEEEcCCceEEEEeeecccCcceeEEEEeccccceeccccCCCCeEEEEEeCCCCCcceeEEEEeccc--- Confidence 99999999999887653 467888999999999999999999999999999999999999999999999998754 Q ss_pred ccceEEEEeeccceeEEEccccceEEEEeccCCceeeecccCCcccCCCcccCcCccccCCceeEEEEEcceEEEecCCe Q lcl|NC_020838. 466 RGPGVWEETIGPSLEFEIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLLSNEA 545 (981) Q Consensus 466 ~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~s~~~ 545 (981) ++|+||++|++..+++.+||||.++++++|+|++++++|++|.+||+++||+|+|+|++|++|+||||||+|+++++ T Consensus 289 ---~~w~e~~~~~~~~~~~~~t~p~~lv~~~~~~~~~~~~~w~~r~~Gd~~tnp~psf~g~~i~~v~f~q~RL~f~~~~~ 365 (794) T protein:vir:22 289 ---KVWTETLGWNTEDQVLWETMPHALVRAADGNFDFKWLEWSPKSCGDVDTNPWPSFVGSSINDVFFFRNRLGFLSGEN 365 (794) T ss_pred ---eEEEEeeeccceeeecccceeeEeeeccCCcEEEeeccccccccCccccCCcceecCCCcceEEEEcceEEEecCCe Confidence 37999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEEEeeec Q lcl|NC_020838. 546 VIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINTISTFE 625 (981) Q Consensus 546 V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~~S~~~ 625 (981) |||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|+ ++|||+|++++++|+|+ T Consensus 366 v~~Srtgd~~nF~~~t~~~~~DdD~i~~~~ss~~~~~i~~~v~~~~~L~i~t~~~e~~l~~~-~~lTP~~~~~~~~s~~~ 444 (794) T protein:vir:22 366 IILSRTAKYFNFYPASIANLSDDDPIDVAVSTNRIAILKYAVPFSEELLIWSDEAQFVLTAS-GTLTSKSVELNLTTQFD 444 (794) T ss_pred EEEEccCCccccccccCcCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCC-CcccceeEEEEEEEEee Confidence 99999999999999999999999999999999999999999999999999999999999986 49999999999999999 Q ss_pred cccCCCcEEeCCeEEEEecCCCeeEEEE-EeecccccceehhhHHHHHHHhcCCCeEEEEEcCCC-cEEEEEecCCcEEE Q lcl|NC_020838. 626 CDAEIDAVAVGTTQAFISKSNLYSKLFL-MLNVQKEAAATIDEATTNVPEYVPSDIDSMTASPAM-SIVSLGKSGSNTVY 703 (981) Q Consensus 626 ~s~~v~Pv~vG~~v~Fv~~~g~~s~vre-~~y~~~~d~~~a~DlS~~~~h~~~~~i~~~~~s~~~-~~~~~~~~g~~~l~ 703 (981) |++.++|+.+|+++||++++|++++++| |+|++++|+|+++|||+|++|||++++..+++++.+ ..++|+..+++.|+ T Consensus 445 ~~~~~~Pv~vg~~v~f~~~~g~~~~~~r~~~~~~~~d~y~~~Dlt~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~ 524 (794) T protein:vir:22 445 VQDRARPFGIGRNVYFASPRSSFTSIHRYYAVQDVSSVKNAEDITSHVPNYIPNGVFSICGSGTENFCSVLSHGDPSKIF 524 (794) T ss_pred ccCCCCceEeCCeEEEEecCCCeeEEEEeEeeecccCceehhhHHHHHHHhcCCceEEEEEeCCCCcEEEEEEcCCCEEE Confidence 9999999999999999999999987655 667888999999999999999999999988876665 45667777789999 Q ss_pred EEEeecCcch-heeeeEeeccCCceEEEEE--eCCeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCccccccccee- Q lcl|NC_020838. 704 QHRFFMQGEN-RVQTWYKWQLTGDLRLQFF--DKTTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFN- 779 (981) Q Consensus 704 ~y~y~~~~eq-~V~aWsrw~~~G~v~sv~~--~~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~- 779 (981) +|+|+|..+| +|+|||||+|+|.|+++|+ .+|+||++|+|+++.+++|+.+.....+. ..+.++.+||++. T Consensus 525 ~~~y~~~~~e~~v~aW~~~~~~g~~~~~~~~~~~d~l~~iv~r~~~~~~~r~~~~~~~~~~-----~~~~~~~~lD~~~~ 599 (794) T protein:vir:22 525 MYKFLYLNEELRQQSWSHWDFGENVQVLACQSISSDMYVILRNEFNTFLARISFTKNAIDL-----QGEPYRAFMDMKIR 599 (794) T ss_pred EEEEeecCCceeEEeeEEEEcCCCEEEEEEEecCCEEEEEEEeCCCEEEEEEEEeeccccC-----CCccceeeeeeeEE Confidence 9999987655 5689999999999999886 48999999999999999999876554332 3455667788763 Q ss_pred -eeeeeeeccCCceEEEcc----cCCCcCCceEEEEEcCcccCceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEE Q lcl|NC_020838. 780 -VNPYRTYSTSTKKTTVNL----PFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDL 854 (981) Q Consensus 780 -vd~~~ty~~~~~~tt~~l----~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v 854 (981) +..+++|+..++.+++.+ +++|++|+++.+.+||.......+.. -..+.+.++|++++++++| T Consensus 600 ~~~~~g~~~~~~~~t~~~~~~~~g~~~~~g~~v~~~~dg~~~~~~~~~~------------~~~~~~~~~v~~~~~~~~v 667 (794) T protein:vir:22 600 YTIPSGTYNDDTFTTSIHIPTIYGANFGRGKITVLEPDGKITVFEQPTA------------GWNSDPWLRLSGNLEGRMV 667 (794) T ss_pred EeeccceeecCCcceEEEcccccCcccccceEEEEEcCCceeeceeeee------------eeeccceEEeCCCCCCcEE Confidence 334677887877777643 56899999999999987544332210 0123467899999999999 Q ss_pred EEEEeeeEEEEeCCceeeccCCCccc-ceEeeeEEEEEEEEEeecccceEEEEccCCCCceeeEecccccCc-cccCccc Q lcl|NC_020838. 855 IIGYVYDMELELPTLYPTQVEGRSSV-SDVTSDLILHRLKVSTGLSGPITYKVDITGKDEWTNIINVTLPNT-YVLNNVN 932 (981) Q Consensus 855 ~VGl~y~s~v~~~~~~i~~~~g~~~~-~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~~-~~~~~~p 932 (981) +|||+|+++++|+||++++++|++.. .+..+||||+|++|+|.+||+|.+.|++..++..+.+.+.++... ...+.+| T Consensus 668 ~VGl~y~s~~~~~~~~~~~~~~~~~~~~~~~grl~l~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~g~~~~~~g~~~ 747 (794) T protein:vir:22 668 YIGFNINFVYEFSKFLIKQTADDGSTSTEDIGRLQLRRAWVNYENSGTFDIYVENQSSNWKYTMAGARLGSNTLRAGRLN 747 (794) T ss_pred EEeeeeeEEEEecceEEEecCCCccceeeecceEEEEEEEEEeccccceEEEEcCCCcccceeecCceecccccccCccc Confidence 99999999999999999998887654 467899999999999999999999999887776555555555433 3345666 Q ss_pred cccCCeEEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEeecccccc Q lcl|NC_020838. 933 LSASALHDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYNRRFYRR 980 (981) Q Consensus 933 ~~~~~~~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r~rRr 980 (981) +. ++.+++|+++|+++.+|+|+|++|+||+|++|+|||+||+|.||= T Consensus 748 ~~-tg~~~vp~~~~~~~~~v~i~~d~p~P~tvlsi~~eg~y~~r~~~v 794 (794) T protein:vir:22 748 LG-TGQYRFPVVGNAKFNTVYILSDETTPLNIIGCGWEGNYLRRSSGI 794 (794) T ss_pred cc-CceEEEEecccCceEEEEEEECCCCCEEEEEEeEEEEEeccccCC Confidence 54 556899999999999999999999999999999999999999988 No 11 >protein:vir:97014 Length: 800 # NCBI annotation: 33 # Family: family:all:825 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654134;genbank:gi:108862018;genbank:GeneID:5075963 Probab=100.00 E-value=3.8e-213 Score=1185.12 Aligned_cols=758 Identities=22% Similarity=0.368 Sum_probs=641.8 Q ss_pred CceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCC--CCceEEEEEeCCCceEEEEEE Q lcl|NC_020838. 2 STISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAE--ARGRWFPILRDEEEKYVCQYD 79 (981) Q Consensus 2 ~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~--~~~~~~~~~rd~~e~y~~~~~ 79 (981) -.|+||||||++|||||||++|+|||+++|+||+|||+.||+||||++||++|++.+ ....+|+++||+.|+|+++.. T Consensus 1 ~~v~~s~~n~~~GvSqq~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~fva~l~~~~~~~~~~~~~~~~d~~eq~~v~~~ 80 (800) T protein:vir:97 1 MEVQGSLGRQIQGISQQPPAVRLDGQCTAMVNMIPDVVNGTQSRMGTTHIAKILDAGTDDMATHHYRRGDGDEEYFFTLK 80 (800) T ss_pred CeeEeechhhhcccccCchhHhhhhhhhhhhcceeccccccccCCchhhheeecCCCcccceeEEEEEcCCceEEEEEEE Confidence 368999999999999999999999999999999999999999999999999998765 345678899999999988764 Q ss_pred cCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCccccccee Q lcl|NC_020838. 80 TTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTATKVNL 159 (981) Q Consensus 80 ~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~~~~~ 159 (981) .++.++|||+ +|+++.|+..+. T Consensus 81 -~~~~~rv~~~-~G~~~~v~~~~~-------------------------------------------------------- 102 (800) T protein:vir:97 81 -KGQVPEIFDK-YGRKCNVTSQDA-------------------------------------------------------- 102 (800) T ss_pred -cCCEEEEEec-CCcEEEEecCCc-------------------------------------------------------- Confidence 3577999999 588876531000 Q ss_pred eEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeeeeeeEE Q lcl|NC_020838. 160 FDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVEKEVAA 239 (981) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 239 (981) + + T Consensus 103 ------------------------------------------------------------~--~---------------- 104 (800) T protein:vir:97 103 ------------------------------------------------------------P--M---------------- 104 (800) T ss_pred ------------------------------------------------------------c--e---------------- Confidence 0 0 Q ss_pred eeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCcee Q lcl|NC_020838. 240 AYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKNKT 319 (981) Q Consensus 240 a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~~~ 319 (981) +|. ..+++..++||++|+||||||+|++++ T Consensus 105 -------------------~y~-------------------------------~~~~~~~~~l~~~tvaD~~fi~n~~~~ 134 (800) T protein:vir:97 105 -------------------TYL-------------------------------SEVVNPREDVQFMTIADVTFMLNRRKV 134 (800) T ss_pred -------------------EEE-------------------------------eccCCCccceeEEEEcCEEEEeeCcee Confidence 000 000112347899999999999999999 Q ss_pred EeccCCCCCCCCceEEEEEeeeccceEEEEeeCceEEE-EEeccCCCc---ceecHHHHHHHHHhhhhc---cCceEEEE Q lcl|NC_020838. 320 TAMKTTTSAAVPNVAFVVIRIVAYNSDYSVTLNGTTVT-HSTPDTVAG---ATTDSGSIAAALTSSINA---LTGFSATQ 392 (981) Q Consensus 320 ~~~~~~~~~~~~~~~~v~v~~g~y~~~~~v~~ng~~~~-~~t~~~~a~---~~~~~~~ia~~l~~~i~~---~~~~~~~~ 392 (981) |++.....+..+..+++++++|+|+|+|+|.||+...+ +.++++.+. ..+++++||.+|...+.. .++|++.+ T Consensus 135 ~~~~~~~~~~~~~~~~~~v~~g~y~~~y~i~I~~~~~~~~~t~~~t~~~~~~~~~~~~ia~ql~~~~~~~~~~~~~t~~~ 214 (800) T protein:vir:97 135 VKASSRKSPKVGNKAIVFCAYGQYGTSYSIVINGANAASFKTPDGGSADHVEQIRTERITSELYSKLQQWSGVSDYEIQR 214 (800) T ss_pred cccccccccCCCcceEEEEeecccceeeeeccCCcceEEEEEcCCCCcccceeccHHHHHHHHHHhhhccccccceEEEe Confidence 99988888888899999999999999999999996544 777776543 457889999999999865 36789999 Q ss_pred cCCEEEEEeCCC--ceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCcccceE Q lcl|NC_020838. 393 VGPGIYIEGTSA--FSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAARGPGV 470 (981) Q Consensus 393 vg~~i~i~~~~~--~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~~g~~~ 470 (981) .|.+++|..++. +.+++++|++++++.++.++|+++++||++|++|+.++|+++++++.++||++|+... .+.++ T Consensus 215 ~G~~~~i~~~~~~~~~v~t~~g~~~~~~~~~~~~v~~~~~lp~~~~~g~~v~i~~~~~~~~~~y~~~~~~~~---~~~~~ 291 (800) T protein:vir:97 215 DGTSIFIERRDGASFTITTTDGAKGKDLVAIKNKVSSTDLLPSRAPAGYKVQVWPTGSKPESRYWLQAEPKE---GNLVS 291 (800) T ss_pred CCcEEEEEEcCCceEEEEecCCcCceeeeEEeeeccchhhchhhCCCCcEEEEEccCCCCCceEEEEEEecc---cCcce Confidence 999999987654 6789999999999999999999999999999999999999999999999999998754 36779 Q ss_pred EEEeeccceeEEEccccceEEEEec----cCCceeeecccCCcccCCCcccCcCccccC----CceeEEEEEcceEEEec Q lcl|NC_020838. 471 WEETIGPSLEFEIDETTMPHQLIRQ----ANGVFKYEPVTWDDRLVGDNTTNPIPSFIG----KKINNMFFYRNRLGLLS 542 (981) Q Consensus 471 W~E~a~~~~~~~~~~~Tmp~~lv~~----a~g~f~~~~~~w~~r~~GDd~tnp~psF~g----~~ps~v~ffq~RL~f~s 542 (981) |+||++++...+++.+||||.+++. ++|+|++++++|++|.+|||++||+|+|+| ++|++|+||||||||++ T Consensus 292 w~e~~~~~~~~~~~~~tmp~~~~~~~~~~~~g~~~~~~~~w~~r~~gd~~tnp~p~f~~~~~~~~~~~v~f~q~RL~f~~ 371 (800) T protein:vir:97 292 WKETIAADVLLGFDKGTMPYIIERTDIINGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTIGGMFMVQNRLCFTA 371 (800) T ss_pred EEEeeccccccceecccceEEEEEeecccccceeEEEeccccccccCccccCccccccCCcCCCCceeEEEEeeeEEEec Confidence 9999999999999999999999997 578999999999999999999999999998 78999999999999999 Q ss_pred CCeEEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEEEe Q lcl|NC_020838. 543 NEAVIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINTIS 622 (981) Q Consensus 543 ~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~~S 622 (981) +++|||||+||||||+++|++++.|||||+++++++++|+|+|+|+++++|+|||+++||+|+|+ ++|||+|++++++| T Consensus 372 ~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~v~~i~~~v~~~~~L~i~T~~~q~~ls~~-~~lTP~~~~~~~~s 450 (800) T protein:vir:97 372 GEAVIASRTSYFFDFFRYTVISALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQFILPGD-KPLEKSNALLKPVT 450 (800) T ss_pred CCeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeCC-CcccceeEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999996 49999999999999 Q ss_pred eeccccCCCcEEeCCeEEEEecCCCeeEEEEEeecccccceehhhHHHHHHHhcCCCeEEEEEcCCCcEE-EEEecCCcE Q lcl|NC_020838. 623 TFECDAEIDAVAVGTTQAFISKSNLYSKLFLMLNVQKEAAATIDEATTNVPEYVPSDIDSMTASPAMSIV-SLGKSGSNT 701 (981) Q Consensus 623 ~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS~~~~h~~~~~i~~~~~s~~~~~~-~~~~~g~~~ 701 (981) +|+|+++++|+.+|++++|++++|+|++||||+|++++|+|+++|||+|++|||++++..++++++++.+ +|+...++. T Consensus 451 ~~~~~~~~~Pv~vG~~v~fv~~~g~~s~vre~~~~~~~d~~~a~DlT~~~~hl~~~~v~~~~~~~~~~~~v~~~~~~~~~ 530 (800) T protein:vir:97 451 TFEVNNKVKPVVTGESVMFATNDGSYSGVREFYTDSYSDTKKAQAITSHVNKLIEGNITNMAASTNVNRLLVTTDKYRNI 530 (800) T ss_pred eeeccCCCCcEEeCCeEEEeeCCCCeeEEEEEeeeecccceehhhHHHHHHHhcCCceEEEEEeCCCCeEEEEEEcCCCE Confidence 9999999999999999999999999999999999999999999999999999999999999998887765 466667799 Q ss_pred EEEEEeecCcchh-eeeeEeeccCC--ceEEEEEeCCeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCcccccccce Q lcl|NC_020838. 702 VYQHRFFMQGENR-VQTWYKWQLTG--DLRLQFFDKTTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMF 778 (981) Q Consensus 702 l~~y~y~~~~eq~-V~aWsrw~~~G--~v~sv~~~~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~ 778 (981) |++|+|+++.+|| |+|||||++++ .++++++++|+||++++|+++.+||||++.+... ..+..+.++|.. T Consensus 531 l~~~~y~~~~~e~~~~aW~~~~~~~~~~~~~~~~~~d~l~~vv~r~~~~~ler~~~~~~~~-------~~~~~~~~lD~~ 603 (800) T protein:vir:97 531 IYCYDWLWQGTDRVQSAWHVWKWPIGTKVRGMFYSGELLYLLLERGDGVYLEKMDMGDALT-------YGLNDRIRMDRQ 603 (800) T ss_pred EEEEEEeecCCceEEEeEEEEecCCCeEEEEEEEcCCeEEEEEEcCCcEEEEEEecccCcC-------cccccceecccc Confidence 9999999976655 67999999987 4667778899999999999999999998755432 123334455543 Q ss_pred -------------eeeeeeeeccCCce---EEEcccCCCcCCceEEEEEcCcccCceeEEEEecCceEEEccccccCCCE Q lcl|NC_020838. 779 -------------NVNPYRTYSTSTKK---TTVNLPFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQ 842 (981) Q Consensus 779 -------------~vd~~~ty~~~~~~---tt~~l~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~ 842 (981) ++++.+.|...... +....+++|++|+++.+..+.. .+ ++.... T Consensus 604 ~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~v~g~~~~~G~~v~~~~~~~-~~-------------------~~~~~~ 663 (800) T protein:vir:97 604 AELVFKHFKAEDEWVSEPLPWVPTNPELLDCILIEGWDSYIGGSFLFKYNPS-DN-------------------TLSTTF 663 (800) T ss_pred ceeeeeeeecccceEeccccccCCCcceeEEEEecccccccCceEEEEecCc-cC-------------------cccccc Confidence 34455555333221 2223467888888876543321 11 122334 Q ss_pred EEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeEEEEEEEEEeecccceEEEEccCCCCceee--Eecc Q lcl|NC_020838. 843 VLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDLILHRLKVSTGLSGPITYKVDITGKDEWTN--IINV 920 (981) Q Consensus 843 itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~--~~~~ 920 (981) ..+++++++++|||||+|+++++|+||+++.++|+ ....+|+||+|++|++++||+|.+.|++.++++... ..+. T Consensus 664 ~~~~~~~~~~~v~vGl~Y~~~~~~~p~~i~~~~g~---~~~~~r~~i~r~~~~~~~sg~~~~~v~~~~~~~~~~~~~~~~ 740 (800) T protein:vir:97 664 DMYDDSHVKAKVIVGQIYPQEFEPTPVVIRDNQDR---VSYIDVPVVGLVHLNLDMYPDFSVEVKNVKSGKVRRVLASNR 740 (800) T ss_pred eEEeCCCCCcEEEEeeeeeEEEEecceEEEecCCC---ceeecceEEEEEEEeecccccEEEEEccccCCceeeeecCcc Confidence 45556788999999999999999999999988774 345689999999999999999999999998876433 3333 Q ss_pred cccCc-cccCccccccCCeEEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEeecccccc Q lcl|NC_020838. 921 TLPNT-YVLNNVNLSASALHDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYNRRFYRR 980 (981) Q Consensus 921 ~~~~~-~~~~~~p~~~~~~~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r~rRr 980 (981) +.... ...+.+|+ .++.+++||.+|+++.+|+|+|++||||+||+|+|||+||+|+||= T Consensus 741 ~~g~~~~~~g~~~~-~tg~~~vp~~g~~~~~~v~i~~d~PlP~tvlsi~~eg~y~~r~~rv 800 (800) T protein:vir:97 741 IGGALNNTVGYVEP-REGVFRFPLRAKSTDVVYRIIVESPHTFQLRDIEWEGSYNPTKRRV 800 (800) T ss_pred ccccccccCCcccc-ccceEEEEeecccceeEEEEEECCCCcEEEEEEEEEEEeecccccC Confidence 44333 34555665 4567899999999999999999999999999999999999999999 No 12 >protein:vir:105647 Length: 800 # NCBI annotation: putative tail tubular B protein # Family: family:all:825 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425011;genbank:gi:83571759;uniprot:Q2WC41;genbank:GeneID:3837288 Probab=100.00 E-value=1.1e-212 Score=1182.67 Aligned_cols=767 Identities=22% Similarity=0.379 Sum_probs=657.0 Q ss_pred CceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCC-CCceEEEEEeCCCc-e-EEEEE Q lcl|NC_020838. 2 STISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAE-ARGRWFPILRDEEE-K-YVCQY 78 (981) Q Consensus 2 ~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~-~~~~~~~~~rd~~e-~-y~~~~ 78 (981) -.|+||||||++|||||||.+|+|||+++|+||+|||+.||+||||++||++|.+.+ .+..+|+++|++.+ + |++.+ T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~fva~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (800) T protein:vir:10 1 MEVQGSLGRQIQGISQQPPAVRLDGQCTTMVNMVPDVVNGTQSRMGTTHIAKLLDEGTDNMATHHYRRGEGDEEYFFTLK 80 (800) T ss_pred CeEEeecchhcccccccchhHhhhhhhhhhhcceeeeccCcccCCcceEEEeecCCCCCccEEEEEecCCccceEEEEEE Confidence 368999999999999999999999999999999999999999999999999997765 45567777774443 3 33343 Q ss_pred EcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCcccccce Q lcl|NC_020838. 79 DTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTATKVN 158 (981) Q Consensus 79 ~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~~~~ 158 (981) .++.++|||+ +|++++|++.+.. T Consensus 81 --~g~~~rv~~~-~G~~~~v~~~~~~------------------------------------------------------ 103 (800) T protein:vir:10 81 --KGQVPEIFDK-HGRKCNVISQDAP------------------------------------------------------ 103 (800) T ss_pred --cCCeEEEEec-CCcEEEeecCCcc------------------------------------------------------ Confidence 3467999998 5877655300000 Q ss_pred eeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeeeeeeE Q lcl|NC_020838. 159 LFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVEKEVA 238 (981) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 238 (981) T Consensus 104 -------------------------------------------------------------------------------- 103 (800) T protein:vir:10 104 -------------------------------------------------------------------------------- 103 (800) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCc-ceeeEEEEcCEEEEeCCc Q lcl|NC_020838. 239 AAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAP-EDIEILTINDYTFVLNKN 317 (981) Q Consensus 239 ~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~-~dl~~~t~ad~tfi~n~~ 317 (981) ..++..++++ ++|+++|+||+|||+|++ T Consensus 104 ---------------------------------------------------~~~~~~~~~~~~~l~~~tvaD~tfi~n~~ 132 (800) T protein:vir:10 104 ---------------------------------------------------MTYLSEVVNPREDVQFMTIADVTFMLNRR 132 (800) T ss_pred ---------------------------------------------------eeeeeccCCchhhEEEEEEcCEEEEecCc Confidence 0011122334 379999999999999999 Q ss_pred eeEeccCCCCCCCCceEEEEEeeeccceEEEEeeCceEEE-EEeccCCCc---ceecHHHHHHHHHhhhhcc---CceEE Q lcl|NC_020838. 318 KTTAMKTTTSAAVPNVAFVVIRIVAYNSDYSVTLNGTTVT-HSTPDTVAG---ATTDSGSIAAALTSSINAL---TGFSA 390 (981) Q Consensus 318 ~~~~~~~~~~~~~~~~~~v~v~~g~y~~~~~v~~ng~~~~-~~t~~~~a~---~~~~~~~ia~~l~~~i~~~---~~~~~ 390 (981) ++|++.+..++..+..++++++.|+|+++|+|.++|...+ ++++++.++ ..++++.|+.+|..++.+. ++|++ T Consensus 133 ~~~~~~~~~~~~~~~~~~~~vr~g~y~~~y~i~i~g~~~~~~~t~~~~~~~~~~~~s~~~i~~~L~~~l~~~~~~~~~t~ 212 (800) T protein:vir:10 133 KVVKVSNRKSPKVGDKAIVFCAYGQYGTSYSIIINGTTAASFKTPDGGSAEHVEQIRTERITSELYSKLQQWSGVNDYEI 212 (800) T ss_pred ccccccccCCCCCCceEEEEEeccccccceeEEeccceEEEEEecCCCcccccccccHHHHHHHHHhhhhhcCcccceEE Confidence 9999988888888899999999999999999999997644 667766543 4578999999999998653 56889 Q ss_pred EEcCCEEEEEeCCC--ceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCcccc Q lcl|NC_020838. 391 TQVGPGIYIEGTSA--FSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAARGP 468 (981) Q Consensus 391 ~~vg~~i~i~~~~~--~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~~g~ 468 (981) .+.|+.++|..+++ +++++.+|++++++..+.++|+++++||..||+|+.++|+++++++.+.||++|+... .+. T Consensus 213 ~~~g~~i~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~Lp~~~~~g~~~~i~~~~~~~~~~y~~~~~~~~---~~~ 289 (800) T protein:vir:10 213 QRDGTSIFIERRDGKSFTVTTTDGAKGKDLVAIKNKVSSTDLLPSRAPAGYKVQVWPTGSKPESRYWLQAEPKE---GNL 289 (800) T ss_pred EEcCcEEEEEEecCCceEEEEeecCCcceEEEEEeeccceeeccccCCCCceEEEEcCCCCCCceeEEEEEecc---ccc Confidence 99999999987654 6688999999999999999999999999999999999999999999999999998754 357 Q ss_pred eEEEEeeccceeEEEccccceEEEEecc----CCceeeecccCCcccCCCcccCcCccccC----CceeEEEEEcceEEE Q lcl|NC_020838. 469 GVWEETIGPSLEFEIDETTMPHQLIRQA----NGVFKYEPVTWDDRLVGDNTTNPIPSFIG----KKINNMFFYRNRLGL 540 (981) Q Consensus 469 ~~W~E~a~~~~~~~~~~~Tmp~~lv~~a----~g~f~~~~~~w~~r~~GDd~tnp~psF~g----~~ps~v~ffq~RL~f 540 (981) ++|+||+++++.++++.+||||++++.+ +++|++++++|++|.+|||++||+|+|+| ++|++|+|||||||| T Consensus 290 ~~w~e~~~~~~~~~~~~~tmp~~lv~~~~~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~~~~~~~~i~~v~f~q~RL~f 369 (800) T protein:vir:10 290 VSWKETIAADVLLGFDKGTMPYIIERTGIIDGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTIGGMFMVQNRLCF 369 (800) T ss_pred eEEEeecccCceeeeecccccEEEEEeeeeecceeEEEEeccccccccCCCCCCCCchhcCCCCCCCceeEEEEeeeEEE Confidence 7999999999999999999999999987 78999999999999999999999999998 579999999999999 Q ss_pred ecCCeEEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEE Q lcl|NC_020838. 541 LSNEAVIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINT 620 (981) Q Consensus 541 ~s~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~ 620 (981) ++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+|+++++|+|||+++||+|+|+ ++|||+|+++++ T Consensus 370 ~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~g~q~~l~g~-~~lTP~~~~i~~ 448 (800) T protein:vir:10 370 TAGEAVIASRTSYFFDFFRYTVISALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQFILPGD-KPLEKSNALLKP 448 (800) T ss_pred eeCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcceeeeeEeecCCcEEEEecCcEEEEeCC-CcccceeEEEEE Confidence 9999999999999999999999999999999999999999999999999999999999999999996 499999999999 Q ss_pred EeeeccccCCCcEEeCCeEEEEecCCCeeEEEEEeecccccceehhhHHHHHHHhcCCCeEEEEEcCCCcEEEEE-ecCC Q lcl|NC_020838. 621 ISTFECDAEIDAVAVGTTQAFISKSNLYSKLFLMLNVQKEAAATIDEATTNVPEYVPSDIDSMTASPAMSIVSLG-KSGS 699 (981) Q Consensus 621 ~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS~~~~h~~~~~i~~~~~s~~~~~~~~~-~~g~ 699 (981) +|+|+|++.|+|+.+|++++|++++|+|++||||+|++++|+|+++|||+|++|||++++.+|+++++++.++|. ...+ T Consensus 449 ~s~~~~~~~~~Pv~vG~~v~Fv~~~g~~s~vre~~~~~~~d~~~a~DlT~~~~hl~~~~v~~~~~~~~~~~~v~~~~~~~ 528 (800) T protein:vir:10 449 VTTFEVNNKVKPVVTGESVMFATNDGSYSGVREFYTDSYSDTKKAQAITSHVNKLIEGNITNMAASTNVNRLLVTTDKYR 528 (800) T ss_pred EEeeeccCCCCceEeCCeEEEecCCCCeeEEEEEeeeecccceehhhHHhHHHHhcCCceEEEEEeCCCCeEEEEEEcCC Confidence 999999999999999999999999999999999999999999999999999999999999999999988876654 4466 Q ss_pred cEEEEEEeecCcchh-eeeeEeeccCC--ceEEEEEeCCeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCccccccc Q lcl|NC_020838. 700 NTVYQHRFFMQGENR-VQTWYKWQLTG--DLRLQFFDKTTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLD 776 (981) Q Consensus 700 ~~l~~y~y~~~~eq~-V~aWsrw~~~G--~v~sv~~~~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD 776 (981) +.|++|+|+|+.+|| |+|||||++++ .++++++++|+||++++|+++.+||||++....+ ..+.++.++| T Consensus 529 ~~l~~~~yl~~~~e~~~~aW~~w~~~~~~~~~~~~~~~d~l~~iv~r~~~~~ier~~~~~~~~-------~~~~~~~~lD 601 (800) T protein:vir:10 529 NIIYCYDWLWQGTDRVQSAWHVWEWPMGTKVRGMFYSGELLYLLLERGDGVYLEKMDMGDALT-------YGLNDRIRMD 601 (800) T ss_pred CeEEEEEEeecCCceEEEEEEEEEcCCCcEEEEEEEeCCeEEEEEECCCcEEEEEEecccCcc-------ccccceeeee Confidence 899999999987665 57999999864 5677788999999999999999999997654332 2455677889 Q ss_pred ceeeeeeeeeccCCceEEEcccCCCcCCceEEEEEcCcccCceeEEEEecCceEEEccccccCCCEEEe-----CCCCCC Q lcl|NC_020838. 777 MFNVNPYRTYSTSTKKTTVNLPFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLL-----NGDYRG 851 (981) Q Consensus 777 ~~~vd~~~ty~~~~~~tt~~l~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL-----~gd~~~ 851 (981) ++...+...+......++..+++.|++++.+.++++.... ...+|.+. ....+.++.+++ +|+.++ T Consensus 602 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~g~v~--~~~~~~~g~~~~~~~~~~g~~~~ 672 (800) T protein:vir:10 602 RQAELIFKHFKAEDEWISEPLPWTPTNPELLDCILIEGWD-------SYIGGSFL--FKYKPSDNTLSTTFDMHDDNHVK 672 (800) T ss_pred cceeecccccccCcceEEEeccccccCCcceEEeeeccce-------eecCceeE--EEEEecCCceEeeeeecCCCccc Confidence 8888888888777777888889999999999888764311 12233332 223345666666 567889 Q ss_pred CEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeEEEEEEEEEeecccceEEEEccCCCCce-eeEecccccC--cccc Q lcl|NC_020838. 852 RDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDLILHRLKVSTGLSGPITYKVDITGKDEW-TNIINVTLPN--TYVL 928 (981) Q Consensus 852 ~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~-~~~~~~~~~~--~~~~ 928 (981) ++|+|||+|+++++|+||++++++|+. ...+|+||+|++|+|++||.|.+++++..+++. ......+..+ .+.. T Consensus 673 ~~v~VGl~Y~s~~~~~~~~i~~~~g~~---~~~~r~~i~r~~~~~~~sg~~~~~v~~~~~~~~~~~~~~~~~~g~~~~~~ 749 (800) T protein:vir:10 673 AKVIVGQIYPQEFEPTPVVIRDRQDRV---SYIDVPVVGLVHLNLDMYPDFSVEVKNVKSGKVRRVLASNRIGGALNNTV 749 (800) T ss_pred ceEEEeeeeeEEEeecceEEEcCCCcc---cccCCeEEEEEEEEeecCceEEEEeccCcccceeEEccCCeecccccccc Confidence 999999999999999999999888753 355899999999999999999999999888653 3333344332 3456 Q ss_pred CccccccCCeEEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEeecccccc Q lcl|NC_020838. 929 NNVNLSASALHDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYNRRFYRR 980 (981) Q Consensus 929 ~~~p~~~~~~~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r~rRr 980 (981) |.+|+.+ +.+++|+.+|+++.+|+|+|++||||+||+|+|||+||+|+||= T Consensus 750 g~~~~~t-g~~~vp~~g~~~~~~v~i~~d~P~P~tvlai~~eg~y~~r~~rv 800 (800) T protein:vir:10 750 GYVEPRE-GVFRFPLRAKSTDAVYRIIVESPHTFQLRDIEWEGSYNPTKRRV 800 (800) T ss_pred CcccccC-ceEEEEEeccCceeEEEEEECCCCcEEEEEEEEEEEeecccccC Confidence 7777765 57899999999999999999999999999999999999999999 No 13 >protein:vir:103341 Length: 806 # NCBI annotation: tail tubular protein B-like protein # Family: family:all:825 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039670;genbank:gi:125999999;genbank:GeneID:4818417 Probab=100.00 E-value=7.8e-211 Score=1172.46 Aligned_cols=768 Identities=23% Similarity=0.371 Sum_probs=639.8 Q ss_pred CceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCC-CCCceEEEEEe-CCCceEEEEEE Q lcl|NC_020838. 2 STISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNA-EARGRWFPILR-DEEEKYVCQYD 79 (981) Q Consensus 2 ~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~-~~~~~~~~~~r-d~~e~y~~~~~ 79 (981) -.|+||||||++|||||||++|+|||+++|+||+|||+.||+||||++||++|.+. .++..||+|+| |+.|+|++.+. T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~~Gl~rRPgt~~va~l~~~~~~~~~~~~~~~~~~~~~y~v~~~ 80 (806) T protein:vir:10 1 MEVQGSYGRQLQGVSQQPIAVRLPGQVTSQLNAVPNVVDGLKTRMGSKHLARILNSLDANSLIHHYKRGDDAEEYFVILQ 80 (806) T ss_pred CeeEeecchhccceeccChhHhhhhhhhhhhcceeccccccccCCchhhhhhhcCCCCccceEEEEEecCCceEEEEEEc Confidence 48999999999999999999999999999999999999999999999999999765 56888999999 66778887774 Q ss_pred cCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCccccccee Q lcl|NC_020838. 80 TTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTATKVNL 159 (981) Q Consensus 80 ~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~~~~~ 159 (981) +|.|+|||+.+|++++|+++.... . T Consensus 81 --~g~i~v~~~~~G~~~~v~~~~~~~----~------------------------------------------------- 105 (806) T protein:vir:10 81 --PGQVPVIFTVGGLACPVNTQGSAA----T------------------------------------------------- 105 (806) T ss_pred --CCcEEEEEcCCCcEEEecCCCceE----E------------------------------------------------- Confidence 577999998889887763111100 0 Q ss_pred eEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeeeeeeEE Q lcl|NC_020838. 160 FDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVEKEVAA 239 (981) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 239 (981) T Consensus 106 -------------------------------------------------------------------------------- 105 (806) T protein:vir:10 106 -------------------------------------------------------------------------------- 105 (806) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred eeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecc--cCCcceeeEEEEcCEEEEeCCc Q lcl|NC_020838. 240 AYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLK--DAAPEDIEILTINDYTFVLNKN 317 (981) Q Consensus 240 a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~--~~~~~dl~~~t~ad~tfi~n~~ 317 (981) |++ ....++|+|+|+||+|||+|++ T Consensus 106 -----------------------------------------------------yl~~~~~~~~~l~~~tvaD~tfi~n~~ 132 (806) T protein:vir:10 106 -----------------------------------------------------YLSSSSLPRETTQLMTIGDYTFVLNRK 132 (806) T ss_pred -----------------------------------------------------EeccCCCCcceeeEEEEcCEEEEecCc Confidence 111 1122468899999999999999 Q ss_pred eeEeccCCCCCCCCceEEEEEeeeccceEEEEeeCceE-EEEEeccCCCc---ceecHHHHHHHHHhhhh----ccCceE Q lcl|NC_020838. 318 KTTAMKTTTSAAVPNVAFVVIRIVAYNSDYSVTLNGTT-VTHSTPDTVAG---ATTDSGSIAAALTSSIN----ALTGFS 389 (981) Q Consensus 318 ~~~~~~~~~~~~~~~~~~v~v~~g~y~~~~~v~~ng~~-~~~~t~~~~a~---~~~~~~~ia~~l~~~i~----~~~~~~ 389 (981) ++|++.....+..+.++++++++|+|+++|++++|+.. +.++++++.+. ..+.+++++.+|..++. +.++|+ T Consensus 133 ~~~~~~~~~~~~~~~~~~v~v~~g~y~~~y~i~Ing~~~a~~~t~~~~~~~~~~~~~~~~~a~~l~~~l~~~~~~~~~~~ 212 (806) T protein:vir:10 133 MPVQARGDVTPSLDNKGLVYVAYANFSFTYQILINGQVAAEHKTASSEDVKNEDLVRTDYVAGKLLENFNSRTASFPGFS 212 (806) T ss_pred EeeeecccccCCCCcceEEEEeecccCceeeEEeccceEEEEEeccCCCcccccccchhHHHHHHHhhhcccccccceeE Confidence 99998888777788889999999999999999999874 55777776644 35667888888887764 457789 Q ss_pred EEEcCCEEEEEeCCC--ceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCccc Q lcl|NC_020838. 390 ATQVGPGIYIEGTSA--FSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAARG 467 (981) Q Consensus 390 ~~~vg~~i~i~~~~~--~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~~g 467 (981) +++.|.+++|.+++. ..+++.+|+.++++.+++++|+++++||.+||+|+.++|+++++++.++||++|+..++ + T Consensus 213 ~~~~g~~~~i~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~lp~~~~~g~~v~i~~~~~~~~~~y~v~~~~~~~---~ 289 (806) T protein:vir:10 213 MYQDGNVLVVDNSNGANYALTTVDGADGQDLVAIRHKVTNLDTLPNRAPVGYKVQVWPTGSKPESRYWLQAESQDG---S 289 (806) T ss_pred EEEcccEEEEecCCCCccEEEEeeCCCCceeEEeecccCccccCccccCCCcEEEEeccCCCCCCceEEEEEeecc---C Confidence 999999999998775 45778899999999999999999999999999999999999999999999999988654 3 Q ss_pred ceEEEEeeccceeEEEccccceEEEEecc-----CCceeeecccCCcccCCCcccCcCccccC----CceeEEEEEcceE Q lcl|NC_020838. 468 PGVWEETIGPSLEFEIDETTMPHQLIRQA-----NGVFKYEPVTWDDRLVGDNTTNPIPSFIG----KKINNMFFYRNRL 538 (981) Q Consensus 468 ~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a-----~g~f~~~~~~w~~r~~GDd~tnp~psF~g----~~ps~v~ffq~RL 538 (981) .++|+|+++|+...+++.+||||.+++++ +++|+++.++|++|.+|||++||+|+|.| ++|++|+|||||| T Consensus 290 ~~~w~e~~~~~~~~~~~~~t~p~~~v~~~~~~~~~~~~~~~~~~w~~r~~Gd~~tn~~psF~~~~~~~~it~v~f~q~RL 369 (806) T protein:vir:10 290 KVTWVETIAPGVRKGWNAATMPHVLVRESLNANGSANFTYRPGEWEDRDVGDDLTNDFPSLLNDSSPQPISSMLMVQNRL 369 (806) T ss_pred ceEEEeecccccccceeccccceEEEeeeeeecccceeEEEecccccccccccccCccCcccCCCCCccceEEEEEeeeE Confidence 56899999999999999999999999986 78999999999999999999999999988 6899999999999 Q ss_pred EEecCCeEEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEE Q lcl|NC_020838. 539 GLLSNEAVIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKI 618 (981) Q Consensus 539 ~f~s~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i 618 (981) ||++|++|||||+||||||+++|+++++|||||+++++++++|+|+|+++++++|+|||+++||+|+|+ ++|||+|+++ T Consensus 370 ~f~s~~~v~~Srsgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~t~~~q~~l~~~-~~lTP~~~~~ 448 (806) T protein:vir:10 370 MLTSGEAVVASRTSRFFDFFRYTVLATVDTDPFDVFADIEEVYNIRWSAQMDGDVVLFTSDQQFTLPGD-KPLTPTSAVI 448 (806) T ss_pred EEecCCeEEEEccCCcccCccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCcEEEEeCC-CcccceeEEE Confidence 999999999999999999999999999999999999999999999999999999999999999999996 4999999999 Q ss_pred EEEeeeccccCCCcEEeCCeEEEEecCCCeeEEEEEeecccccceehhhHHHHHHHhcCCCeEEEEE-cCCCcEEEEEec Q lcl|NC_020838. 619 NTISTFECDAEIDAVAVGTTQAFISKSNLYSKLFLMLNVQKEAAATIDEATTNVPEYVPSDIDSMTA-SPAMSIVSLGKS 697 (981) Q Consensus 619 ~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS~~~~h~~~~~i~~~~~-s~~~~~~~~~~~ 697 (981) +++|+|+|+++|+|+.+|++++|++++|++++||||+|++++|+|+++|||+|++|||++++..+++ +..|..++|+.. T Consensus 449 ~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~~~DlT~~~~hl~~g~~~~~~~~~~~~~~~~~~~~ 528 (806) T protein:vir:10 449 RPVTQFKMTPGVKPAPSGDSILFAFDQGSYSGIREFFTDSYSDTKKAQPATSHVDKYIRGKVLELSASSSFNRAFIITSP 528 (806) T ss_pred EEEEeecccCCCCceEeCCeEEEeeCCCCeeEEEEEEeeeeccceehhhHHHHHHHhcCCCeEEEEEeCCCCcEEEEEEc Confidence 9999999999999999999999999999999999999999999999999999999999998877665 455667889998 Q ss_pred CCcEEEEEEeecCcc-hheeeeEeeccCCc--eEEEEEeCCeEEEEEEcCC------cEEEEEEEeecCCceeEEEecCC Q lcl|NC_020838. 698 GSNTVYQHRFFMQGE-NRVQTWYKWQLTGD--LRLQFFDKTTFYAVTSSGS------NVYLTSYDLTQASESGYLTLPTG 768 (981) Q Consensus 698 g~~~l~~y~y~~~~e-q~V~aWsrw~~~G~--v~sv~~~~d~ly~vv~r~~------~~~l~~~~~~~~~~~~~~~~~~~ 768 (981) +++.|++|+|+|+.+ |+|+|||||+|+|. ++++++++|+||++|+|++ ..++|+|+..... ... T Consensus 529 ~dg~l~~~ty~~~~~e~~v~aW~rw~~~~~~~~~~~~~~~d~l~~vv~R~~~~~g~~~~~iE~~~~~~~~-------~~~ 601 (806) T protein:vir:10 529 DRNILYVYDWLYEGTEKVQNAWHKWSFPAGTVLHAVSYSNEKLYLVLTRTNTSGGVAGVYIEVMDMGDEL-------EYG 601 (806) T ss_pred CCCEEEEEEEeecCCceEEEeEEeeeeCCCeEEEEEEEecCeEEEEEEEcCCcccEEEEEEEeecCCCCC-------Ccc Confidence 999999999998754 45789999999876 6677888999999999975 2477776532111 223 Q ss_pred CcccccccceeeeeeeeeccCCc-eEE-EcccCCCcCCceEEEEEcCcccCceeEEEEecCceEEEccccccCCCEEEeC Q lcl|NC_020838. 769 EKTDVCLDMFNVNPYRTYSTSTK-KTT-VNLPFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLLN 846 (981) Q Consensus 769 ~~~~~~lD~~~vd~~~ty~~~~~-~tt-~~l~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~ 846 (981) +.++.+||+... ..+.++..+. .+. ...+..|++|+.+.++++|....+........+|... +++.+...+ T Consensus 602 ~~~~~~lD~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~g~~~~~g~~~~~~~~~~~~-----~v~~~~~~~- 674 (806) T protein:vir:10 602 LQDRVRMDRRAT-LSMTYNATTRVWTSSALPWLPQDLSSLDAVLVSGWAGYVGGAFQFSYNASNN-----TISTNFDLA- 674 (806) T ss_pred cceeeeccccce-EEEeccccccceeeeeeccccccccceeEEEEeeccccCCceEEEEEcCccc-----eEeeeeeec- Confidence 445566665322 1122332222 111 2234689999999999999755443333333333221 222222222 Q ss_pred CCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeEEEEEEEEEeecccceEEEEccCCCC--ceeeEecccccC Q lcl|NC_020838. 847 GDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDLILHRLKVSTGLSGPITYKVDITGKD--EWTNIINVTLPN 924 (981) Q Consensus 847 gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~--~~~~~~~~~~~~ 924 (981) +.++++|+|||+|+++++|++|+++++++. ....+|+||+|++|++++||.|.+.+.+.++. ....+.+.+... T Consensus 675 -~~~~~~v~vGl~Y~s~~~~t~p~~~~~~~~---~~~~~r~~l~r~~~~~~~s~~~~~~v~~~~~~~~~~~~~~~~~~~~ 750 (806) T protein:vir:10 675 -EGNTATIVVGETYWYEVEPTPPLIKDSKDR---VSYLDTPTVGNVYLNLDMYPDFSVVVTDKETLQERTVYLANKTAGS 750 (806) T ss_pred -CCCCcEEEEeeeeeEEEEECCeeEeccCCC---ccccccEEEEEEEEEeecceeeEEEEcccCCCcceeeeccCccccc Confidence 456889999999999999999999876653 34568999999999999999999999887653 344555555554 Q ss_pred ccc-cCccccccCCeEEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEeecccccc Q lcl|NC_020838. 925 TYV-LNNVNLSASALHDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYNRRFYRR 980 (981) Q Consensus 925 ~~~-~~~~p~~~~~~~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r~rRr 980 (981) .+. .+.+|+ .++.+++|+.+|+++.+|+|+|++|+||+|+||+|||+||+|+||= T Consensus 751 ~~~~~g~~~~-~tg~~~vp~~~~~~~~~v~i~~d~P~P~tvlai~~eg~y~~r~~rv 806 (806) T protein:vir:10 751 ITNVIGYIAP-HEGTLRIPLRRKSTDVSFKIRSKSPATFQLRDIEWTGSYNPRKRRV 806 (806) T ss_pred cccccccccc-ccceEEEEeeecCceeEEEEEECCCCceEEEEEEEEEEeecccccC Confidence 443 455554 5668899999999999999999999999999999999999999999 No 14 >protein:vir:7021 Length: 803 # NCBI annotation: tail protein # Family: family:all:825 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853594;genbank:gi:31711676;genbank:GeneID:1481802 Probab=100.00 E-value=4.2e-210 Score=1168.44 Aligned_cols=763 Identities=22% Similarity=0.334 Sum_probs=627.7 Q ss_pred CceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCCC-CceEEEEEeCC--CceEEEEE Q lcl|NC_020838. 2 STISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAEA-RGRWFPILRDE--EEKYVCQY 78 (981) Q Consensus 2 ~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~~-~~~~~~~~rd~--~e~y~~~~ 78 (981) -.|+||||||++|||||||++|+|||+++|+||+|||+.||+||||++||++|++++. ...+|+|+|++ .|+|+++. T Consensus 1 ~~v~~s~~nl~~GvSqQ~d~~R~~~q~~~~~N~~~~~~gGl~rRpGt~~va~l~~~~~~~~~~~~~~~~~~~~e~~~~~~ 80 (803) T protein:vir:70 1 MEVQGSLGRQIQGISQQPPAVRLDGQCSEMVNMVPDVVEGTKSRMGTTHIAKLLEYGEDDMAVHHYRRGGEGEEEYFFIM 80 (803) T ss_pred CeEEeecchhccccccCchHHhhhhhhhhhhcceeeeccccccCChhhhhhhhcCCCcccceeeEEEecCCCceEEEEEE Confidence 4689999999999999999999999999999999999999999999999999988764 55678898864 57888887 Q ss_pred EcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCcccccce Q lcl|NC_020838. 79 DTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTATKVN 158 (981) Q Consensus 79 ~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~~~~ 158 (981) .+ +|.|||||+ +|++++|+..+.... T Consensus 81 ~~-~~~irv~~~-~G~~~~v~~~~~~~~---------------------------------------------------- 106 (803) T protein:vir:70 81 KK-GQVPEIFDK-QGRKCMVQSQDAPMT---------------------------------------------------- 106 (803) T ss_pred ec-CCeEEEEEc-CCcEEEEecCCceeE---------------------------------------------------- Confidence 54 788999998 588876631110000 Q ss_pred eeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeeeeeeE Q lcl|NC_020838. 159 LFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVEKEVA 238 (981) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 238 (981) | + T Consensus 107 -------------------------------------------------------------~--l--------------- 108 (803) T protein:vir:70 107 -------------------------------------------------------------Y--L--------------- 108 (803) T ss_pred -------------------------------------------------------------E--E--------------- Confidence 0 0 Q ss_pred EeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCce Q lcl|NC_020838. 239 AAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKNK 318 (981) Q Consensus 239 ~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~~ 318 (981) . .+.+..++||++|+||+|||+|+++ T Consensus 109 --------~----------------------------------------------~~~~~~~~l~~~tvaD~~fi~n~~~ 134 (803) T protein:vir:70 109 --------S----------------------------------------------EVTNPREDVQFMTIADVTFMLNRKK 134 (803) T ss_pred --------e----------------------------------------------ecCCChhheeEEEEcCEEEEecCce Confidence 0 0001113688889999999999999 Q ss_pred eEeccCCCCCCCCceEEEEEeeeccceEEEEeeCceE-EEEEeccCCCc---ceecHHHHHHHHHhhhhcc---CceEEE Q lcl|NC_020838. 319 TTAMKTTTSAAVPNVAFVVIRIVAYNSDYSVTLNGTT-VTHSTPDTVAG---ATTDSGSIAAALTSSINAL---TGFSAT 391 (981) Q Consensus 319 ~~~~~~~~~~~~~~~~~v~v~~g~y~~~~~v~~ng~~-~~~~t~~~~a~---~~~~~~~ia~~l~~~i~~~---~~~~~~ 391 (981) +|++.....+.+++.|++++++|+|+++|.|++||+. +.++++++.+. ...++++||.+|...+.+. ++|++. T Consensus 135 ~~~~~~~~~~~~~~~~~~~vr~g~y~~~y~itIng~~~a~~~t~~~~~~~~~~~~~~~~ia~~l~~~~~~~~s~a~~~~~ 214 (803) T protein:vir:70 135 IVKARPERSPQVGSTAIVFMAYGQYGTHYKIIIDGVVAAGYKTRDGAEAHHIEDIRTESIAYNLYQSLQSWDKIADYEIQ 214 (803) T ss_pred eeeeccccCCCCCCceEEEEeecCCcceEEEEeCCcceEEEEeCCCcccccccccchhhhhhhhhhheeccccccceEEE Confidence 9998888778888899999999999999999999985 55888887755 4567899999998888653 578999 Q ss_pred EcCCEEEEEeCCC---ceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCcccc Q lcl|NC_020838. 392 QVGPGIYIEGTSA---FSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAARGP 468 (981) Q Consensus 392 ~vg~~i~i~~~~~---~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~~g~ 468 (981) +.|..++|.++++ +.+++++|..++++..++++|+++++||+.|++|+.++|++++++++|+|||+|+..+. +. T Consensus 215 ~~g~~~~i~~~~~~~~~~~~t~~g~~~~~~~~~~~~v~~~~~Lp~~~~~g~~v~v~~~g~~~~d~y~v~~~~~~~---~~ 291 (803) T protein:vir:70 215 LDGTSIYITRRDGSTTFDITTEDGAKGKDLVAIKYKVASTDLLPSRAPEGYKVQVWPTGSKPESRYWLQAEKQNG---NI 291 (803) T ss_pred ECCcEEEEEEcCCCCeeEEEeecCcCCcEEEEEEecccceeeccccCCCCceEEEEcCCCCCCceeeEEEEeccC---Cc Confidence 9999999987553 67899999999999999999999999999999999999999999999999999987654 45 Q ss_pred eEEEEeeccceeEEEccccceEEEEeccC----CceeeecccCCcccCCCcccCcCccccC----CceeEEEEEcceEEE Q lcl|NC_020838. 469 GVWEETIGPSLEFEIDETTMPHQLIRQAN----GVFKYEPVTWDDRLVGDNTTNPIPSFIG----KKINNMFFYRNRLGL 540 (981) Q Consensus 469 ~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~----g~f~~~~~~w~~r~~GDd~tnp~psF~g----~~ps~v~ffq~RL~f 540 (981) ++|+||+++++..+++.+||||.++|.++ +.|++++++|++|.+|||+|||+|+|++ ++|++|+|||||||| T Consensus 292 ~~w~e~a~~g~~~~~~~~t~p~~~v~~~~~~~~~~~~~~~~~~~~r~~gdd~tnp~psf~~~~~~~~~~~v~f~q~RL~f 371 (803) T protein:vir:70 292 VSWKETLAADVLIGFDKSTMPYIIERTGFVNGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTLGGMFMVQNRLCV 371 (803) T ss_pred cceEeeeccceeeeeecccccEEEEEEEEeecceeEEEEeeccccccccccccCccccccCccCCCCceeEEEEeceEEE Confidence 68999999999999999999999999754 6799999999999999999999999997 579999999999999 Q ss_pred ecCCeEEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEE Q lcl|NC_020838. 541 LSNEAVIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINT 620 (981) Q Consensus 541 ~s~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~ 620 (981) ++|++|||||+||||||+++|++++.|||||+++++++++|+|+|+++++++|+|||+++||+|+|+ ++|||+|+++++ T Consensus 372 ~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~i~T~~~q~~l~g~-~~lTP~~~~i~~ 450 (803) T protein:vir:70 372 TAGEAVIATRTSYFFDFFRYTAVSAVATDPFDVFSDASEVYQLKHAVTLDGSTVLFADKSQFILPGD-KPLEKSNVLLKP 450 (803) T ss_pred eeCCeEEEEccCCccccccccccCCCCCccEEEEecCCcceeeEEEeecCCcEEEEecCcEEEEeCC-CcccceeEEEEE Confidence 9999999999999999999999999999999999999999999999999999999999999999996 499999999999 Q ss_pred EeeeccccCCCcEEeCCeEEEEecCCCeeEEEEEeecccccceehhhHHHHHHHhcCCCeEEEEEcCCCcEEEEEec-CC Q lcl|NC_020838. 621 ISTFECDAEIDAVAVGTTQAFISKSNLYSKLFLMLNVQKEAAATIDEATTNVPEYVPSDIDSMTASPAMSIVSLGKS-GS 699 (981) Q Consensus 621 ~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS~~~~h~~~~~i~~~~~s~~~~~~~~~~~-g~ 699 (981) +|+|+|++.++|+.+|++++|++++|+|++||||+|++++|+|+++|||+|++|||++++.+++++++++.++|..+ .+ T Consensus 451 ~s~~~~~~~~~Pv~vg~~v~fv~~~g~~s~vre~~~~~~~d~y~a~Dlt~~a~hl~~~~v~~~~~~~~~~~~v~~~~~~~ 530 (803) T protein:vir:70 451 VTTFEVNNNVKPVATGESVMFATSEGAYSGIREFYTDSYSDTKKAQAITSHVNKLLEGNVIMMSASTNVNRLLVLTDKYR 530 (803) T ss_pred EEEeeccCCCccEEeCCeEEEeccCCCeeEEEEEeccccccceehhhhhhhhHhhcCCceEEEEEeCCCCeEEEEEEcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999887665554 56 Q ss_pred cEEEEEEeecCcc-hheeeeEeeccCCceEEEEEe--CCeEEEEEEcCC-cEEEEEEEeecCCceeEEEecCCCcccccc Q lcl|NC_020838. 700 NTVYQHRFFMQGE-NRVQTWYKWQLTGDLRLQFFD--KTTFYAVTSSGS-NVYLTSYDLTQASESGYLTLPTGEKTDVCL 775 (981) Q Consensus 700 ~~l~~y~y~~~~e-q~V~aWsrw~~~G~v~sv~~~--~d~ly~vv~r~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~l 775 (981) +.|++|+|++..+ |+|+|||||+|+|.++++|++ +|+||++++|+. +.+||||++....+ ..+.+ T Consensus 531 ~~l~~~~yl~~~~e~~v~aW~r~~~~g~~~~~~~~~~~d~l~~vv~r~~~g~~ier~~~~~~~~-------~~~~~---- 599 (803) T protein:vir:70 531 NIIYCYDWLWQGTERVQAAWHKWEWPLGTFIRGMFYSGEHLYLLIERGSTGVYLERMDMGDALV-------YNLND---- 599 (803) T ss_pred CeEEEEEEEecCCcEEEEeEEEEEcCCCEEEEEEEecCCEEEEEEEECCCeEEEEEEecccccc-------cCCcc---- Confidence 8899999998654 557899999999999998875 899999999975 46789987654322 12333 Q ss_pred cceeeeeeeeeccCCce-EEEcccCCCcCCce----E-EEEEcCcccCceeEEEEecCceEEEccc-----cccCCCEEE Q lcl|NC_020838. 776 DMFNVNPYRTYSTSTKK-TTVNLPFDHITGKK----L-AVVAIGTYIGDTISATSESEGSVFYFED-----SDISSNQVL 844 (981) Q Consensus 776 D~~~vd~~~ty~~~~~~-tt~~l~~~~l~g~~----v-~v~adG~~~~~~~~~~~~~dG~~~~~~~-----~tv~gg~it 844 (981) ++++||+.+|...... +......+|+++.+ + .+..+|... ..+|....... .++.. .. T Consensus 600 -~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~--------~~~~~~~~~~~~g~~t~~~~~--~~ 668 (803) T protein:vir:70 600 -RIRMDRQAELIFRHIKAEDVWVSEPLPWQPTDVTLLDCVLIDGWDS--------YIGGSFLFSYNPGDNTLTTTF--DM 668 (803) T ss_pred -eeEeccceeEeeccccCCceeeeecccccCcccceeeEEEeeeeee--------ecCCeEEEEEcCCCccceeee--eE Confidence 4444554444322110 11111223444433 1 122223211 11222211111 11222 33 Q ss_pred eCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeEEEEEEEEEeecccceEEEEccCCCCce-eeEeccc-c Q lcl|NC_020838. 845 LNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDLILHRLKVSTGLSGPITYKVDITGKDEW-TNIINVT-L 922 (981) Q Consensus 845 L~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~-~~~~~~~-~ 922 (981) +++++++++|+|||+|+++++|+||++++++|+ ....+|+||+|++|++++||+|.++|++..++.. .....++ . T Consensus 669 ~~~~~~a~~v~VGl~Y~~~~~~~~~~i~~~~~~---~~~~~~~rl~r~~~~~~~sg~~~v~v~~~~~~~~~~~~~s~~~~ 745 (803) T protein:vir:70 669 HDDDHVKAKVVVGQLYPQEFEPTQVVIRDNQER---VSYIDVPTVGLVHLNLDKYPDFKVEVKNLKSGKVRNVLASNRVG 745 (803) T ss_pred ECCCCcccEEEEeeeeeEEEeecceEEEcCCCc---cccccccEEEEEEEEeecccceEEEEecCCccccceeeccchhc Confidence 466889999999999999999999999988774 3455778999999999999999999999887743 2333333 3 Q ss_pred cC-ccccCccccccCCeEEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEeecccccc Q lcl|NC_020838. 923 PN-TYVLNNVNLSASALHDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYNRRFYRR 980 (981) Q Consensus 923 ~~-~~~~~~~p~~~~~~~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r~rRr 980 (981) .. .+..+.+|+.+ +.+++|+.+|+++.+|+|+|++||||+||+|+|||+||+|+||= T Consensus 746 g~~~~~~g~~~~~t-g~~~vP~~~~~~~~~v~i~~d~P~P~tvlsi~weg~y~~r~rrv 803 (803) T protein:vir:70 746 GAINNIVGYVEPRE-GVFKFPLRSLSTDTVYRVMVESPHTFQLRDIEWEGSYNPTKRRV 803 (803) T ss_pred cccccccCcccccc-ceEEEEeeccCcceEEEEEECCCCCeEEEEEEEEEEEecccccC Confidence 32 34566777766 47899999999999999999999999999999999999996666 No 15 >protein:vir:6326 Length: 826 # NCBI annotation: tail tubular protein B # Family: family:all:825 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877473;genbank:gi:33300845;uniprot:Q7Y2D1;genbank:GeneID:1482615 Probab=100.00 E-value=2.7e-210 Score=1169.54 Aligned_cols=759 Identities=20% Similarity=0.292 Sum_probs=629.2 Q ss_pred CCceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCC---CCceEEEEEeCCCceEEEE Q lcl|NC_020838. 1 MSTISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAE---ARGRWFPILRDEEEKYVCQ 77 (981) Q Consensus 1 m~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~---~~~~~~~~~rd~~e~y~~~ 77 (981) ||+|+|+||||++|||||||++|+|||+++|+||+|||+.||+||||++||++|.... ....||+|+||+.|+|+++ T Consensus 1 M~~i~~s~~n~~~GvSqq~d~~r~~~q~~~~~N~~~~~~~G~~rRpg~~~v~~~~~~~~~~~~~~~~~~~r~~~~~~~~~ 80 (826) T protein:vir:63 1 MSYKQSAYPNLLMGVSQQVPFERLPGQLSEQINMVSDPVSGLRRRSGIELMAHLLHTDQPWPRPFLYHTNLGGRSIAMLV 80 (826) T ss_pred CceeeeecchhhcceeccCchHhhhhhhhhhhcceeeccCCcccCchhHhhhhhccCCccccccEEEEEecCCCceEEEE Confidence 9999999999999999999999999999999999999999999999999999987653 3568999999999998877 Q ss_pred EEcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCcccccc Q lcl|NC_020838. 78 YDTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTATKV 157 (981) Q Consensus 78 ~~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~~~ 157 (981) .. .+|.|||||+.+|+.+.+. T Consensus 81 ~~-~~g~irv~~~~~g~~~~~~---------------------------------------------------------- 101 (826) T protein:vir:63 81 AQ-HRGELYLFDERDGRLLMGQ---------------------------------------------------------- 101 (826) T ss_pred Ee-cCCcEEEEEcCCCeEEEcC---------------------------------------------------------- Confidence 64 4688999999866443210 Q ss_pred eeeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeeeeee Q lcl|NC_020838. 158 NLFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVEKEV 237 (981) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 237 (981) . + T Consensus 102 ----------------------------~----------------~---------------------------------- 103 (826) T protein:vir:63 102 ----------------------------P----------------L---------------------------------- 103 (826) T ss_pred ----------------------------C----------------C---------------------------------- Confidence 0 0 Q ss_pred EEeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCc Q lcl|NC_020838. 238 AAAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKN 317 (981) Q Consensus 238 ~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~ 317 (981) +.+|+++++.++|||+|+||+|||+|++ T Consensus 104 ----------------------------------------------------~~~y~~~~~~~~l~~~t~aD~~fi~n~~ 131 (826) T protein:vir:63 104 ----------------------------------------------------VHDYLKANDYRQLRAATVADDLFIANLS 131 (826) T ss_pred ----------------------------------------------------CCceeeecCccceEEEEeCCEEEEEeCC Confidence 0012222233579999999999999999 Q ss_pred eeEeccCCCC--CCCCceEEEEEeeeccceEEEEeeCce----------EEEEEeccCCCcc--------eecHHHHHHH Q lcl|NC_020838. 318 KTTAMKTTTS--AAVPNVAFVVIRIVAYNSDYSVTLNGT----------TVTHSTPDTVAGA--------TTDSGSIAAA 377 (981) Q Consensus 318 ~~~~~~~~~~--~~~~~~~~v~v~~g~y~~~~~v~~ng~----------~~~~~t~~~~a~~--------~~~~~~ia~~ 377 (981) ++|++..+.. .++++.|++++++|+|+++|+|++++. .++++++++..+. ..++++|+.+ T Consensus 132 ~~p~~~~~~~~~~~~~~~~~~~v~~g~Y~~~y~vti~~~~~~~gt~~s~t~t~~t~~~~~a~~~~~~~~~~~s~~yia~~ 211 (826) T protein:vir:63 132 VKPEADRTDIKGVDPNKAGWLYIKAGQYSKAFSMTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQ 211 (826) T ss_pred eeeeeccccccccCCCCcEEEEeeccccCceEEEEEEeccccCCccccceEEEEeccCCcccccccccceeeeeeeeeee Confidence 9999876543 345677999999999999999999863 4667777765332 2345566666 Q ss_pred HHhhhhccC------------------------ceE--EEEcCCEEEEEeCCCceEEEec-CcCcceeEEEEEEEeechh Q lcl|NC_020838. 378 LTSSINALT------------------------GFS--ATQVGPGIYIEGTSAFSISTSG-STTEEGIFAFQDQINVASR 430 (981) Q Consensus 378 l~~~i~~~~------------------------~~~--~~~vg~~i~i~~~~~~~vt~~~-g~~~t~~~~~~~~v~~~~~ 430 (981) +...+.+.+ ++. ....+.++++.++....+..+. +++.+.+......+++.++ T Consensus 212 l~~~~~a~~~~~~~~~t~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 291 (826) T protein:vir:63 212 LYGKFFGAPEYTLPNSTKKYPKVDPDANAATIAGYLNQRGVQDGYIAFRGDADIHVEVSTDMGNNYGIASGGMSLNATAD 291 (826) T ss_pred ceeeeeeccccccCCCccccceecCCcccceeecceeEecccccEEEEeeCCcccEEEccCCCCcceEEEEEeeccceee Confidence 544332221 111 1224566777777666554333 3455677788888999999 Q ss_pred ccccCCC----CcEE-----EEEccCCCcccceEEEEEcccCCcccceEEEEeeccceeEEEccccceEEEE-eccCCce Q lcl|NC_020838. 431 LPNQCEN----GYRV-----RVTNSGDVTADDIYVEFQTTNSAARGPGVWEETIGPSLEFEIDETTMPHQLI-RQANGVF 500 (981) Q Consensus 431 Lp~~~~~----G~~v-----~v~~~g~~~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv-~~a~g~f 500 (981) ||+.+|. |+.+ .++.++ .+.+.||++|+... ++|+||++|+..+++ +||||.|+ +.++++| T Consensus 292 l~~~~p~~~~~~~~~~~~~~~~~~~g-~~~d~~y~~~~~~~------~~w~e~~~~~~~~~~--~tmp~~l~~~~~~~~f 362 (826) T protein:vir:63 292 LPALLPGVGAPGVGVQFMDGAVMATG-STKAPVYFEWDSAN------RRWAERAAYGTDWVL--KKMPLALRWDEATDTY 362 (826) T ss_pred ccccCCCcccceEEEeeEEeEEecCC-CcccceEEEEEcCC------ceEEEEeecCccccc--ccceEEEEEeccCCeE Confidence 9888775 3333 344554 45688999998753 489999999986554 79999998 5689999 Q ss_pred eeecccCCcccCCCcccCcCccccCCceeEEEEEcceEEEecCCeEEEEeccccccccccccccccCCccEEEEEcCCCc Q lcl|NC_020838. 501 KYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLLSNEAVIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKP 580 (981) Q Consensus 501 ~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~s~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~ 580 (981) ++++++|++|.+||++|||+|+|+|++|++|+||||||||+++++|||||+||||||+++|++++.|||||+++++++++ T Consensus 363 ~~~~~~w~~r~~Gd~~tnp~psf~g~~~~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdD~I~~~~ss~~~ 442 (826) T protein:vir:63 363 SLNELEYDRRGSGDEDTNPTFNFVTRGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLT 442 (826) T ss_pred EEeccccccccccccccCCCccccCCCceEEEEEeceEEEeeCCeEEEEccCCccccccccccCCCCCccEEEEEcCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEEEeeeccccCCCcEEeCCeEEEEecCC-CeeEEEEEeeccc Q lcl|NC_020838. 581 VTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINTISTFECDAEIDAVAVGTTQAFISKSN-LYSKLFLMLNVQK 659 (981) Q Consensus 581 n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g-~~s~vre~~y~~~ 659 (981) |+|+|+++++++|+|||+++||+|+|+ ++|||+|++++++|+|+|++.|+|+.+|++++|+|++| +|++||||.|+++ T Consensus 443 ~~i~~~v~~~~~L~l~T~~~q~~ls~~-~~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~~~s~v~e~~~~~d 521 (826) T protein:vir:63 443 EPYEHAVTFNKDLIVFAKKYQAVVPGG-GIVTPRTAVISITTQYDLDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPS 521 (826) T ss_pred eeeEEEeecCCcEEEEecCcEEEEeCC-CcccceeEEEEEEEeecccCCCCceEeCCeEEEEecCCCceeEEEEEEeeec Confidence 999999999999999999999999986 59999999999999999999999999999999999987 5899999999998 Q ss_pred ccc-eehhhHHHHHHHhcCCCeEEEEEcCCCcEEEEEecCCcEEEEEEeecCcc-hheeeeEeeccCCceEEEEEeCCeE Q lcl|NC_020838. 660 EAA-ATIDEATTNVPEYVPSDIDSMTASPAMSIVSLGKSGSNTVYQHRFFMQGE-NRVQTWYKWQLTGDLRLQFFDKTTF 737 (981) Q Consensus 660 ~d~-~~a~DlS~~~~h~~~~~i~~~~~s~~~~~~~~~~~g~~~l~~y~y~~~~e-q~V~aWsrw~~~G~v~sv~~~~d~l 737 (981) .|+ |+++|||+|++|||+++|.+|++|++|++++|++.+++.|++|+|+|+.+ |+|+|||||+|+|+|+++|+++|+| T Consensus 522 ~~~~y~~~dlt~~~~~l~~~~v~~~a~s~~~~~v~~~~~~dg~l~~~~y~~~~~e~~v~aW~~~~~~g~v~~~~~i~d~l 601 (826) T protein:vir:63 522 TDSHYVAEDVTSHIPSYMPGPAEYIQAAASSGYLVFGTSTADEMICHQYLWQGNEKVQNAFHRWTLRHQIIGAYFTGDNL 601 (826) T ss_pred cccceehhHHHHHHHHhcCCCeEEEEEcCCCCEEEEEEcCCCEEEEEEEeeCCCcEEEEeEEEEecCCcEEEEEEECCeE Confidence 775 99999999999999999999999999999999999999999999998654 5578999999999999999999999 Q ss_pred EEEEEcCCcEEEEEEEeecCCceeEEEecCCCcccccccceeeeeeeeeccCCceEEEcccCCCcCCceEEEEEcCcccC Q lcl|NC_020838. 738 YAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFNVNPYRTYSTSTKKTTVNLPFDHITGKKLAVVAIGTYIG 817 (981) Q Consensus 738 y~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~vd~~~ty~~~~~~tt~~l~~~~l~g~~v~v~adG~~~~ 817 (981) |++|+|+++.+++|+.++....+.-.. +...-+++++||.+++...........+++|+++..+.++++|...+ T Consensus 602 ~~iv~r~~~~~~~r~~~e~~~~~~~~~------~~~~d~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 675 (826) T protein:vir:63 602 MVLIQKGQEIALGRMHLNSLPAREGLQ------YPKYDYWRRIEATVAGELELTKQHWDLIKDASAVYQLQPVAGAYMER 675 (826) T ss_pred EEEEEeCCCEEEEEEEEEecCCccccc------cCCccceEEEEEeeeeeeccCcceeecccCcccccEEEEeeCccccC Confidence 999999999999999765544332221 11122356788998888877777777788999999999999998665 Q ss_pred ceeEEEEecCceEEEccccccCCCE--EEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeEEEEEEEEE Q lcl|NC_020838. 818 DTISATSESEGSVFYFEDSDISSNQ--VLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDLILHRLKVS 895 (981) Q Consensus 818 ~~~~~~~~~dG~~~~~~~~tv~gg~--itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r~~v~ 895 (981) ....... +.+|. +.+++++.+++|+|||+|+++++|+||++++++|... ..+|+||||++|+ T Consensus 676 ~~~~~~~-------------~~~g~v~l~~~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~g~~~---~~gr~~l~r~~~~ 739 (826) T protein:vir:63 676 THLGVKR-------------ETNTKVFLDVPEAVVGAVYVVGCEFWSKVEFTPPVLRDHNGLPM---TSTRAVLHRYNVN 739 (826) T ss_pred CccceEE-------------ecCCEEEEecCCCccccEEEEeeeeeEEEEecceEEEccCCCcc---eeccEEEEEEEEE Confidence 4432222 23454 4457888899999999999999999999998888644 4589999999999 Q ss_pred eecccceEEEEccCCCCc-e-eeEecccccCccccCccccccCCeEEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEe Q lcl|NC_020838. 896 TGLSGPITYKVDITGKDE-W-TNIINVTLPNTYVLNNVNLSASALHDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNY 973 (981) Q Consensus 896 ~~~Sg~~~v~v~~~~~~~-~-~~~~~~~~~~~~~~~~~p~~~~~~~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y 973 (981) +++||+|.+.|++..++. . ....+.++...++....|+..++.+++|+.+++++.+|+|+|+.|+||+|++|+|||+| T Consensus 740 ~~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~g~p~~~t~~~~vP~~~~~~~~~i~i~~d~P~p~~il~i~~~~~y 819 (826) T protein:vir:63 740 FGWTGEFLWRISDTARPNQPWYDTTPLRLFSRQLNAGEPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKS 819 (826) T ss_pred eeccccEEEEecCccccceeEeecCCceecccccccccccccceEEEEEEeeccceEEEEEEeCCCCcEEEEEEEEEEEE Confidence 999999999999988764 3 33344556555555555666778899999999999999999999999999999999999 Q ss_pred ecccccc Q lcl|NC_020838. 974 NRRFYRR 980 (981) Q Consensus 974 ~~r~rRr 980 (981) |+|+||= T Consensus 820 n~r~rrv 826 (826) T protein:vir:63 820 NQTYRRV 826 (826) T ss_pred eceeecC Confidence 9999998 No 16 >protein:vir:80253 Length: 777 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522886;genbank:gi:158345179;genbank:GeneID:5687516 Probab=100.00 E-value=1.8e-206 Score=1148.60 Aligned_cols=755 Identities=20% Similarity=0.284 Sum_probs=629.0 Q ss_pred CCceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCCC-Cce-EEEEEeCCCceEEEEE Q lcl|NC_020838. 1 MSTISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAEA-RGR-WFPILRDEEEKYVCQY 78 (981) Q Consensus 1 m~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~~-~~~-~~~~~rd~~e~y~~~~ 78 (981) ||+|+|+||||++|||||||++|+|||+++|+||+|||+.||+||||++||++|.+.+. +.. +|.++||+.|+|+++. T Consensus 1 M~~i~~~~~nf~~GvS~q~D~~ry~~q~~~~~N~~~~~~gG~~rRpGt~fv~~l~~~~~~~~~~~~~~~~~~~e~~~~l~ 80 (777) T protein:vir:80 1 MSYFAGSYRQLLFGVSQQTAKDRLEGQVESQLNMQSDLVTGPRRRSPVHLIADAMAATDANRLAYSLATFSGREVLLLVD 80 (777) T ss_pred CceeeeecchhhcccccCCchHHhhhHHhhhhcceeeeccCceeCcchHhhhhhcCCCcccceeEEEEecCCCeeEEEEE Confidence 99999999999999999999999999999999999999999999999999999987653 333 5677889999999987 Q ss_pred EcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCcccccce Q lcl|NC_020838. 79 DTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTATKVN 158 (981) Q Consensus 79 ~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~~~~ 158 (981) .+ +|.|+|||+.+|..+.+. T Consensus 81 ~g-~g~irv~~~~~g~~~~~~----------------------------------------------------------- 100 (777) T protein:vir:80 81 TL-DGTLTILDDATGEVLFTG----------------------------------------------------------- 100 (777) T ss_pred ec-CCeEEEEECCCCeEEEec----------------------------------------------------------- Confidence 54 688999999755221100 Q ss_pred eeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeeeeeeE Q lcl|NC_020838. 159 LFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVEKEVA 238 (981) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 238 (981) . T Consensus 101 -------------------------------------~------------------------------------------ 101 (777) T protein:vir:80 101 -------------------------------------T------------------------------------------ 101 (777) T ss_pred -------------------------------------C------------------------------------------ Confidence 0 Q ss_pred EeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCce Q lcl|NC_020838. 239 AAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKNK 318 (981) Q Consensus 239 ~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~~ 318 (981) . .|++++++++|+|+|+||+|||+|+++ T Consensus 102 -------------------~---------------------------------~Yl~a~~~~~l~~~q~aD~~fi~n~~~ 129 (777) T protein:vir:80 102 -------------------N---------------------------------SYLTAGTGRSIRFAALDDSVFVANTEV 129 (777) T ss_pred -------------------C---------------------------------CceeeccccceeEEEEcCEEEEEeCCc Confidence 0 022233345799999999999999999 Q ss_pred eEeccCCCC----CCCCceEEEEEeeeccceEEEEeeCceEEEE--Eec---cCCCcceecHHHHHHHHHhhhh------ Q lcl|NC_020838. 319 TTAMKTTTS----AAVPNVAFVVIRIVAYNSDYSVTLNGTTVTH--STP---DTVAGATTDSGSIAAALTSSIN------ 383 (981) Q Consensus 319 ~~~~~~~~~----~~~~~~~~v~v~~g~y~~~~~v~~ng~~~~~--~t~---~~~a~~~~~~~~ia~~l~~~i~------ 383 (981) +|++..... +.....++++++.|+|+++|+|.+++..... +.+ ...+...++.++++.+|..++. T Consensus 130 ~p~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~y~i~i~~~~~~~~~t~~~~t~~~~~~~~~~~~ia~~L~~~~~~~~~~~ 209 (777) T protein:vir:80 130 IPQTQLWSGASAYPDPTRAGYLYVVAGAFSKQYRLSITNQVTGVTTSVDVTTSATEASQATGEYVITQLRTAAEADATIG 209 (777) T ss_pred cceeeecccCCCccCcccceEEEeeccCCCceeeEeecCCcCceeEEEecCCcccccccccchhhhhhhhhhhcccccee Confidence 998754432 3445678999999999999999998754332 222 2334456788899999986663 Q ss_pred ccCceEEEEcCCEEEEEeCCCceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccC Q lcl|NC_020838. 384 ALTGFSATQVGPGIYIEGTSAFSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNS 463 (981) Q Consensus 384 ~~~~~~~~~vg~~i~i~~~~~~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~ 463 (981) +.++|+..+.|+++++.+++...+++.+|.+ ..+....+.|++..+||..+|.++.+++..+++. .++||++|+... T Consensus 210 s~~~~~~~~~g~~~~i~~~~~~~~t~~~g~~-~~~~~~~~~v~~~~~lp~~~~~~~~~~~~~~~~~-~~~~y~~~~~~~- 286 (777) T protein:vir:80 210 TAAGFAYYQDGAYLYVTAPEAIAVSTDSGSN-FLRASNAASIRDAAELPAKLPADADGFIIATGAA-KNKTYFRWVDLE- 286 (777) T ss_pred ecCceEEEeCCcEEEEEecCceeEecCCcCc-cceeeeeEEEeeccccccccccccceEEEeCCCC-CCceEEEEEccC- Confidence 3478899999999999999998888776644 5678889999999999999999999999987765 477999998643 Q ss_pred CcccceEEEEeeccceeEEEccccceEEEEeccCCceeeecccCCcccCCCcccCcCccccCCceeEEEEEcceEEEecC Q lcl|NC_020838. 464 AARGPGVWEETIGPSLEFEIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLLSN 543 (981) Q Consensus 464 ~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~s~ 543 (981) ++|+||++|+...++ +||||.+++.+ ++|+++.++|++|.+||+++||+|||+|++|++|+||||||||+++ T Consensus 287 -----~~w~e~~~~~~~~~~--~t~p~~l~~~~-~~~~~~~~~w~~r~~gd~~tn~~Psf~g~~i~~v~f~q~RL~f~~~ 358 (777) T protein:vir:80 287 -----RKWDEDASRGAQAEL--IDMPLRITYSA-PNFSLTALNYERRASGDATSNPALKFTEQGISGMTTMQGRLVLLAG 358 (777) T ss_pred -----cEEEEeecccccccc--cccceEEEecC-CceEeeccCCccccccccccCCCceecCCceeEEEEEcceeeeecC Confidence 379999999998887 69999999976 5899999999999999999999999999999999999999999999 Q ss_pred CeEEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEEEee Q lcl|NC_020838. 544 EAVIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINTIST 623 (981) Q Consensus 544 ~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~~S~ 623 (981) ++|||||+||||||+++|++++.|||||+++++++++|+|+|+|+++++|+|||+++||+|+|+ ++|||+|++++++|+ T Consensus 359 ~~v~~Srtgd~~nF~~~s~~~~~DdDpI~~~~ss~~~~~i~~~v~~~~~L~i~T~~~e~~l~~~-~~lTP~~~~~~~~s~ 437 (777) T protein:vir:80 359 EYVCMSASGNPLRWFRASVSTQSDDDPIEVAATAPVASPYEYAVAFNKDLVLFAKTHQGLVPGA-NLLTSRNATAAVVTE 437 (777) T ss_pred CeEEEEeccCccccccccccCCCCCccEEEEEcCCcceeeeeeeecCCcEEEEecCceEEEeCC-CcccceeEEEEEEEe Confidence 9999999999999999999999999999999999999999999999999999999999999986 599999999999999 Q ss_pred eccccCCCcEEeCCeEEEEec-CCCeeEEEEEeecc-cccceehhhHHHHHHHhcCCCeEEEEEcCCCcEEEEEecCCcE Q lcl|NC_020838. 624 FECDAEIDAVAVGTTQAFISK-SNLYSKLFLMLNVQ-KEAAATIDEATTNVPEYVPSDIDSMTASPAMSIVSLGKSGSNT 701 (981) Q Consensus 624 ~~~s~~v~Pv~vG~~v~Fv~~-~g~~s~vre~~y~~-~~d~~~a~DlS~~~~h~~~~~i~~~~~s~~~~~~~~~~~g~~~ 701 (981) |+|+++++|+.+|++++|+++ .|++++||||+|++ ++|+|+++|||+|++|||+++|.+|+++++|++++|++++++. T Consensus 438 ~~~~~~~~Pv~vG~~v~Fv~~r~g~~s~v~e~~~~~~~~d~y~a~Dlt~~~~hl~~~~v~~~a~s~~p~~v~~~~~~dg~ 517 (777) T protein:vir:80 438 YSFQNSCSPVVAGRTVFFASPRSGPWSAVWEMLPSQYTDAQVEASDSTSHLPKYIAGPVRFLATSSTTSIVVVGTSNLRE 517 (777) T ss_pred eccCCCCCceEeCCeEEEEecCCCceeEEeeeeecccccCceehhHHHHHHHHhcCCceEEEEEcCCCceEEEEEcCCCe Confidence 999999999999999999975 56789999999995 5799999999999999999999999999999999999999999 Q ss_pred EEEEEeecCcc-hheeeeEeeccCCceEEEEEeCCeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCcccccccceee Q lcl|NC_020838. 702 VYQHRFFMQGE-NRVQTWYKWQLTGDLRLQFFDKTTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFNV 780 (981) Q Consensus 702 l~~y~y~~~~e-q~V~aWsrw~~~G~v~sv~~~~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~v 780 (981) |++|+|+|+.+ |+|+|||||+|+|+|+++|+++|+||++++|++.++||+|+.....+.. .+ ...++|++ . T Consensus 518 l~~~ty~~~~~e~~v~aW~r~~~~g~v~~v~~i~d~l~~iv~r~~~~~le~~~~~~~~d~~------~~-~~~~~D~~-~ 589 (777) T protein:vir:80 518 LVVHEYLWQGGEKVHAAWHKWSFPQDITGAYFRGDRLILLFHVAGRVILGELFMQRLGDAQ------SI-PGGFLDLY-R 589 (777) T ss_pred EEEEEEeecCCceEEEeeEEeccCCcEEEEEEECCEEEEEEEcCCeEEEEEEeeccCCCCc------cc-ceeeeeee-e Confidence 99999998655 4578999999999999999999999999999999999999765544321 11 22567765 4 Q ss_pred eeeeeeccCCceEEEcccCCCcCCceEEEEEcCcccCceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEEEEEEee Q lcl|NC_020838. 781 NPYRTYSTSTKKTTVNLPFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLIIGYVY 860 (981) Q Consensus 781 d~~~ty~~~~~~tt~~l~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~VGl~y 860 (981) .++.+|++....+...+ +|+.+.+...++.+...+.... ..|.+ +.+ .....+++.++.++++|+|||+| T Consensus 590 ~~~~~~~~~~~~~~~~~--~~~~~~~~~~v~~~~~~~~~~~----~~~~v--~~~--~~~~~~~v~~~~~~~~v~VGl~y 659 (777) T protein:vir:80 590 VGAANADEEVAIPAFAA--DLYPEDSTFAYKLSGEFQSLGQ----RCGDR--RVD--GATVYIKVVGAQAGDQYRIGLRY 659 (777) T ss_pred eeeeeeCCccceeEeec--cccCCcceeEEEecCcccccce----eeeeE--EeC--CceeeEEEcCCCCCCEEEEeeee Confidence 56888988776655544 4555545444444332222111 01111 111 11235677778999999999999 Q ss_pred eEEEEeCCceeeccCCCcccceEeeeEEEEEEEEEeecccceEEEEccCCCCceeeE-ecccccCccc-cCccccccCCe Q lcl|NC_020838. 861 DMELELPTLYPTQVEGRSSVSDVTSDLILHRLKVSTGLSGPITYKVDITGKDEWTNI-INVTLPNTYV-LNNVNLSASAL 938 (981) Q Consensus 861 ~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~-~~~~~~~~~~-~~~~p~~~~~~ 938 (981) +++++|+||+++.++|+. ...+|+||+|++|+|++||+|.+.|++..+++.... .+.++...+. .+.||+. ++. T Consensus 660 ~s~~~~~~~~~~~~~g~~---~~~~r~~i~r~~~~~~~sg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~-tg~ 735 (777) T protein:vir:80 660 LSKLGPTRPILRDPNGVP---ITTERTQLHRLTWSLDSTGEVTFRVADQARGESAYTTTPLRLYSRDLGAGLPLAA-TAT 735 (777) T ss_pred EEEEEeCceEEeCCCCce---eeecCeEEEEEEEEeeccccEEEEEcCCCCcceeeeecCceeccccccccccccc-ceE Confidence 999999999999887743 345899999999999999999999999888765444 4555555444 4555554 567 Q ss_pred EEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEeecccccc Q lcl|NC_020838. 939 HDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYNRRFYRR 980 (981) Q Consensus 939 ~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r~rRr 980 (981) +++|+.+|+++.+|+|+|++|+||+||||+|||+||+|+||| T Consensus 736 ~~vp~~~~~~~~~v~i~~d~P~P~tilsi~~e~~y~~r~~r~ 777 (777) T protein:vir:80 736 LDTPARVDMQTAQFSLETDDYYDMNITSLEYGFRYNQRYRRQ 777 (777) T ss_pred EEEEEeecCcceEEEEEECCCCceEEEEEEEEEEeecccccC Confidence 899999999999999999999999999999999999999999 No 17 >protein:vir:78957 Length: 826 # NCBI annotation: putative tail tubular protein B # Family: family:all:825 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522826;genbank:gi:158345061;genbank:GeneID:5687447 Probab=100.00 E-value=1.2e-203 Score=1133.04 Aligned_cols=760 Identities=19% Similarity=0.284 Sum_probs=602.7 Q ss_pred CCceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCCC---CceEEEEEeCCCceEEEE Q lcl|NC_020838. 1 MSTISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAEA---RGRWFPILRDEEEKYVCQ 77 (981) Q Consensus 1 m~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~~---~~~~~~~~rd~~e~y~~~ 77 (981) ||+|+|+||||++|||||||++|+|||+++|+||+|||+.||+||||++||+++..... ...+|+++||+.|+|+++ T Consensus 1 M~~i~~~~~nl~gGvSqq~d~~r~~~q~~~~~N~~~~~~gG~~rRpgt~~va~~~~~~~~~~~~f~~~~~r~s~e~~~~l 80 (826) T protein:vir:78 1 MSYKQSAYPNLLMGVSQQVAFERLPGQLSEQINMVSDPVSGLRRRSGIELMASLLHTDQPWPRPYLYHTNLGGRSIAMLV 80 (826) T ss_pred CcceeeecchhccceecccchHhhhhhhhhhhcceeccccccccCCchHhhhhhccCCcCCceeEEEEeccCCcceEEEE Confidence 99999999999999999999999999999999999999999999999999999876543 346788999999998776 Q ss_pred EEcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCcccccc Q lcl|NC_020838. 78 YDTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTATKV 157 (981) Q Consensus 78 ~~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~~~ 157 (981) .. .+|.|||||+.+|+.+.++ T Consensus 81 ~~-~~g~irv~~~~~g~~~~~~---------------------------------------------------------- 101 (826) T protein:vir:78 81 AQ-HRGELYLFDEKDGRLLMGQ---------------------------------------------------------- 101 (826) T ss_pred EE-cCCcEEEEECCCCEEEEec---------------------------------------------------------- Confidence 64 3688999998755321000 Q ss_pred eeeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeeeeee Q lcl|NC_020838. 158 NLFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVEKEV 237 (981) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 237 (981) T Consensus 102 -------------------------------------------------------------------------------- 101 (826) T protein:vir:78 102 -------------------------------------------------------------------------------- 101 (826) T ss_pred -------------------------------------------------------------------------------- Confidence Q ss_pred EEeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCc Q lcl|NC_020838. 238 AAAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKN 317 (981) Q Consensus 238 ~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~ 317 (981) | ...+|++++++++||++|+||+|||+|++ T Consensus 102 ---------------------~-----------------------------~~~~y~~~~~~~~l~~~t~aD~~fi~n~~ 131 (826) T protein:vir:78 102 ---------------------P-----------------------------LVHDYLKASDYRQLRAATVADDLFIANLE 131 (826) T ss_pred ---------------------C-----------------------------cccceeecCCcceeEEEEEcCEEEEEcCc Confidence 0 00135667778899999999999999999 Q ss_pred eeEeccCCCCC--CCCceEEEEEeeeccceEEEEeeCce----------EEEEEeccCCCc--------ceecHHHHHHH Q lcl|NC_020838. 318 KTTAMKTTTSA--AVPNVAFVVIRIVAYNSDYSVTLNGT----------TVTHSTPDTVAG--------ATTDSGSIAAA 377 (981) Q Consensus 318 ~~~~~~~~~~~--~~~~~~~v~v~~g~y~~~~~v~~ng~----------~~~~~t~~~~a~--------~~~~~~~ia~~ 377 (981) ++|++..+... +.+..++++++.|+|+++|.|++++. .+++.++++..+ ...+..+++.+ T Consensus 132 ~~p~~~~~~~~~~~~~~~~~~~v~~g~y~~~y~v~i~~~~~~~~~~~s~t~~y~t~~~~~~~~~~~~~~~~~~~~~~a~~ 211 (826) T protein:vir:78 132 VRPEADKADVLGVDPSKTGWLYIKAGQYSKAFSLTIKVKDNATGTTYSHTATYVTPDNASTNPNLAEAPFQTSVGYIAWQ 211 (826) T ss_pred EeeeeccccccCCCCCceEEEEecccccCceeEEEeccceeecccccceeEEEEeccCCccccccccccceecchhhhee Confidence 99997655443 34567999999999999999999972 356777765532 22445677777 Q ss_pred HHhhhhccCceEE--------------------------EEcCCEEEEEeCCCceEE-EecCcCcceeEEEEEEEeechh Q lcl|NC_020838. 378 LTSSINALTGFSA--------------------------TQVGPGIYIEGTSAFSIS-TSGSTTEEGIFAFQDQINVASR 430 (981) Q Consensus 378 l~~~i~~~~~~~~--------------------------~~vg~~i~i~~~~~~~vt-~~~g~~~t~~~~~~~~v~~~~~ 430 (981) |.........|.. ...+.++++.++....+. +.++++...+....+.|+++++ T Consensus 212 l~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~ 291 (826) T protein:vir:78 212 LFGKFFGAPEYTLPNSTKKYPKVDPDPAAATVAGYLNQRGVQDGYIAFRGDGDIVVEVSTDMGNNYGIASGGMSLNATAD 291 (826) T ss_pred cceeeccccceeeeccceeEeeccccccceeeccceeecccccceEEEecCCCeEEEeccCCCccceEEEeeEEEecccc Confidence 7544322111110 112356777776655543 2334445566777788888887 Q ss_pred cccc----CCCCcEEE----EEccCCCcccceEEEEEcccCCcccceEEEEeeccceeEEEccccceEEEE-eccCCcee Q lcl|NC_020838. 431 LPNQ----CENGYRVR----VTNSGDVTADDIYVEFQTTNSAARGPGVWEETIGPSLEFEIDETTMPHQLI-RQANGVFK 501 (981) Q Consensus 431 Lp~~----~~~G~~v~----v~~~g~~~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv-~~a~g~f~ 501 (981) ||+. +++|+.+. +....+++.++||++|+...+ +|+||++|++. ++.+||||+++ ++++++|+ T Consensus 292 l~a~~p~~~~~~~~~~~~~~~~~~~g~~~~~~y~~~~~~~~------~w~e~a~~g~~--~~~~tmp~~l~~~~~~~~f~ 363 (826) T protein:vir:78 292 LPALLPGAGTPGTGVQFMDGAIMATGSTKAPVYFAWDAANR------RWAERAAYGTD--WVLKKMPLALRWDESTDTYS 363 (826) T ss_pred eeeeecccccceEEEEEEeeeEecCCCcccceeEEEEcCCc------eEEEeeccCcc--cccccccEEEEEecCCCeEE Confidence 7554 45555554 333445677899999987653 79999999975 56689999998 46789999 Q ss_pred eecccCCcccCCCcccCcCccccCCceeEEEEEcceEEEecCCeEEEEeccccccccccccccccCCccEEEEEcCCCce Q lcl|NC_020838. 502 YEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLLSNEAVIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPV 581 (981) Q Consensus 502 ~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~s~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n 581 (981) ++.++|++|.+||+++||+|+|+|++|++|+||||||||+++++|||||+||||||++++++++.|||||+++++++++| T Consensus 364 ~~~~~w~~r~~gd~~tnp~psf~g~~i~~v~f~q~RL~f~~~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~~s~~~~ 443 (826) T protein:vir:78 364 LNELEYDRRGSGDEETNPTFNFVKRGITGMTTFQGRLVLLSQEYVCMSASNNPHRWFKKSAAALNDDDPIEIAAQGSLTE 443 (826) T ss_pred EeeccccccccCcccccCcccccCCCceEEEEEeceEEEeeCCeEEEEeccCccccccccccCCCCCCcEEEEEccCcce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEEEeeeccccCCCcEEeCCeEEEEecCC-CeeEEEEEeecccc Q lcl|NC_020838. 582 TLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINTISTFECDAEIDAVAVGTTQAFISKSN-LYSKLFLMLNVQKE 660 (981) Q Consensus 582 ~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g-~~s~vre~~y~~~~ 660 (981) +|+|+++++++|+|||+++||+|+|+ ++|||+|++++++|+|+|+++++|+.+|+++||++++| ++++||||.|+++. T Consensus 444 ~i~~~v~~~~~L~l~T~~~e~~l~~~-~~lTP~~~~~~~~s~~~~~~~~~Pv~vG~~v~F~~~r~~~~s~v~e~~~~~~~ 522 (826) T protein:vir:78 444 PYEHAVTFNKDLIVFAKKYQAVVPGG-GIVTPRTAVISITTQYDVDTRAAPAVTGRSVYFAAERALGFMGLHEMAPSPST 522 (826) T ss_pred eEEEEEecCCcEEEEecCcEEEEeCC-CcccceeEEEEEEEeecccCCCCceEeCCeEEEEecCCCceeEEEEEEeeecc Confidence 99999999999999999999999986 49999999999999999999999999999999999887 58899999999987 Q ss_pred cc-eehhhHHHHHHHhcCCCeEEEEEcCCCcEEEEEecCCcEEEEEEeecCcc-hheeeeEeeccCCceEEEEEeCCeEE Q lcl|NC_020838. 661 AA-ATIDEATTNVPEYVPSDIDSMTASPAMSIVSLGKSGSNTVYQHRFFMQGE-NRVQTWYKWQLTGDLRLQFFDKTTFY 738 (981) Q Consensus 661 d~-~~a~DlS~~~~h~~~~~i~~~~~s~~~~~~~~~~~g~~~l~~y~y~~~~e-q~V~aWsrw~~~G~v~sv~~~~d~ly 738 (981) ++ |+++|||+|++|||+++|.+|+++++|++++|+..+++.|++|+|+|..+ |+|+|||||+|+|+|+++|+++|+|| T Consensus 523 ~~~y~~~dlt~~~~~l~~~~v~~~a~s~~~~~~v~~~~~~g~l~~~ty~~~~~e~~v~aW~~~~~~g~v~~v~~i~d~l~ 602 (826) T protein:vir:78 523 DSHYVAEDVTSHIPSYMPGPAEYIQAAASSGYLVFGTSAADEMICHQYLWQGNEKVQNAYHRWTLRHQIIGAYFTGDNLM 602 (826) T ss_pred cCccchHHHHHHHHHhcCCCeEEEEEeCCCCeEEEEEcCCCeEEEEEEEecCCcEEEEeEEEEccCCcEEEEEEECCeEE Confidence 75 99999999999999999999999999999999999999999999998654 45789999999999999999999999 Q ss_pred EEEEcCCcEEEEEEEeecCCceeEEEecCCCcccccccceeeeeeeeeccCCceEEEcccCCCcCCceEEEEEcCcccCc Q lcl|NC_020838. 739 AVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFNVNPYRTYSTSTKKTTVNLPFDHITGKKLAVVAIGTYIGD 818 (981) Q Consensus 739 ~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~vd~~~ty~~~~~~tt~~l~~~~l~g~~v~v~adG~~~~~ 818 (981) ++|+|+++.++||+.+.....+..++ .....++.++...+++ ..|+++..+.++.++..... T Consensus 603 ~vv~r~~~~~~~r~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~ 664 (826) T protein:vir:78 603 VLIQKGQEIALGRMHLNSLPAREGLQ--------YPKYDYWRRIEATVDG----------ELELTKQHWDLIKDGAAVYQ 664 (826) T ss_pred EEEEeCCCEEEEEEEEEecCCCcccc--------ccccceeEEEEEEEcc----------eeccccceeEEecCCceeee Confidence 99999999999999765444332221 1112233333333321 12333444433333321110 Q ss_pred e--eEEEEecCceEEEccccccCCCEEEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeEEEEEEEEEe Q lcl|NC_020838. 819 T--ISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDLILHRLKVST 896 (981) Q Consensus 819 ~--~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r~~v~~ 896 (981) . .......++.... ...+.+...+.+|+++++++|+|||+|+++++|+||+++.++|.. ...+|+||+|++|++ T Consensus 665 ~~g~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~~~~~v~VGl~y~s~~~~~~~~~~~~~g~~---~~~~r~~l~r~~~~~ 740 (826) T protein:vir:78 665 LQPQVGAYMERYQLGV-KRETSTKVFLDVPEAVVGSVYVVGCEFWSKVEFTPPVLRDHNGLP---MTSTRAVLHRYNVNF 740 (826) T ss_pred eccceeeeccccceec-cccCCCceEEEeCCCccccEEEEeeceeEEEEeCceEEecCCCcc---eeecceEEEEEEEEe Confidence 0 0111111111110 011122245888999999999999999999999999999988854 355899999999999 Q ss_pred ecccceEEEEccCCCCc--eeeEecccccCccccCccccccCCeEEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEee Q lcl|NC_020838. 897 GLSGPITYKVDITGKDE--WTNIINVTLPNTYVLNNVNLSASALHDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYN 974 (981) Q Consensus 897 ~~Sg~~~v~v~~~~~~~--~~~~~~~~~~~~~~~~~~p~~~~~~~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~ 974 (981) ++||.|.+.|++..++. .....+.++.+..+...+|+..++.+++|+.+|+++.+|+|+|+.||||+||+|+|||+|| T Consensus 741 ~~tg~~~v~v~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~~t~~v~vp~~~~~~~~~i~i~~d~P~P~tvlai~~~~~y~ 820 (826) T protein:vir:78 741 GWTGEFLWRISDTARPNQPWYDTTPLRLSSRQLNAGEPLVDSAVVPLPARVDMATSKFELSCHSPYDMNVRAVEYNFKSN 820 (826) T ss_pred eccccEEEEeCCCccCcceeeeecccccccccccCCcccccceEEEEeeeccCceEEEEEEeCCCCcEEEEEEeEEEEec Confidence 99999999999888764 2344455555555444556666788999999999999999999999999999999999999 Q ss_pred cccccc Q lcl|NC_020838. 975 RRFYRR 980 (981) Q Consensus 975 ~r~rRr 980 (981) +|+||= T Consensus 821 ~r~rrv 826 (826) T protein:vir:78 821 QTYRRV 826 (826) T ss_pred ceeecC Confidence 888888 No 18 >protein:vir:1778 Length: 680 # NCBI annotation: tail protein A # Family: family:all:825 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570344;genbank:gi:18640503;genbank:GeneID:932716 Probab=100.00 E-value=2.1e-186 Score=1038.54 Aligned_cols=637 Identities=27% Similarity=0.472 Sum_probs=501.6 Q ss_pred CCceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCCCCceEEEEEeCCCceEEEEEEc Q lcl|NC_020838. 1 MSTISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAEARGRWFPILRDEEEKYVCQYDT 80 (981) Q Consensus 1 m~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~~~~~~~~~~rd~~e~y~~~~~~ 80 (981) ||+|+|+||||++|||||||++|||||+++|+||+||||+||+||||++||++|++.+.+++||+|+||+.|+|++++.. T Consensus 1 M~~v~~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~v~Gl~kRpg~~~i~~l~~~~~~~~~~~~~rd~~e~~~~~~~~ 80 (680) T protein:vir:17 1 MAAVEQMVPNLLGGISQQPDPLKLPGQVKQARNVQLDPTFGALKRPGTELIMQVTGIPKRAKWIPIMRDAREHYYVAIYR 80 (680) T ss_pred CccceecchhhhCcceecchhhcCcchhhhhhccccCcCcccccCccceeeeeccCCCCCceeEEEecCCCCeEEEEEEc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999987 Q ss_pred CCC------cEEEEEcCCCeEEEEeeccccccccccce-eeccccceeEEEEeCcEEEEecCccc--------ccccceE Q lcl|NC_020838. 81 TDG------QFRIWSLIDGQPRAVDMGTTAATGQPSGC-NITNLKSDLDVYNTAQDDTDTKLNDL--------NSKQATY 145 (981) Q Consensus 81 ~~g------~~~v~d~~~g~~~~v~~~~~~~~~~~~y~-~~~~~~~~l~~~tv~d~t~i~n~~~~--------~~~~~~~ 145 (981) +++ +|+|||+.+|++++|+++++.. ..|+ +.++++++|||+||+|||||+|++++ .++..++ T Consensus 81 ~g~~~~~~~~i~v~d~~~G~~~~v~~~~~~~---~~~~~~~~~~~~~lr~~tv~d~tfi~N~~v~~~~~~~~~~~~~~g~ 157 (680) T protein:vir:17 81 EGANESGDLRIRVFDLKAGVERAVSFVGGEV---EEYFPGDETDWEAIRSLTIGDYTFLSNPNVQPTTWSRSFSRRPEGL 157 (680) T ss_pred CCCcccccceeEEEEccCCeEEEEEcCCCce---EEEeecCCCCccceEEEEEcCEEEEECCeEEEeccCCCCCCCCeeE Confidence 643 4999999999999999888732 2233 34567889999999999999998643 4555678 Q ss_pred EEEcCCcccccceeeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEee Q lcl|NC_020838. 146 TKTNDGQTATKVNLFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKR 225 (981) Q Consensus 146 ~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (981) ++||+|||+++|.+ .+... ......+.+.+.+.+. .. ....+++..+... .. T Consensus 158 ~~v~~~ayg~ty~v---~ing~--------------------~~~~~~~~~~~~~~~~-~~--~~~~g~~~~~~Ag-~~- 209 (680) T protein:vir:17 158 VTIGAAGYGTSYIV---DFATE--------------------DSGQQRRWAVQEMQAP-KT--KRKKGDGSPDEAG-ET- 209 (680) T ss_pred EEEEEeeeeeEEEE---EEecc--------------------ccceeeeeeeeeeecc-cc--ccccccccCCCCc-ce- Confidence 88888888876543 22111 0000111111111100 00 0111111111000 00 Q ss_pred cceeEeeeeeeeEEeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEE Q lcl|NC_020838. 226 DGYRVYEVEKEVAAAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEIL 305 (981) Q Consensus 226 ~g~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~ 305 (981) +. + ...+....+ .+...... ..++. +.. |.. T Consensus 210 --t~-~-----~~~~a~la~--~l~~~~~~------~~~~~--g~~---------------------~~~---------- 240 (680) T protein:vir:17 210 --TV-N-----NWNGTGLSF--RVKVEARA------FLVDD--GEE---------------------YGH---------- 240 (680) T ss_pred --ee-e-----eeeeeeeee--eeeeccce------eeecC--CCc---------------------eEE---------- Confidence 00 0 000000000 10000000 00000 000 100 Q ss_pred EEcCEEEEeCCceeEeccCCCCCCCCceEEEEEeeeccceEEEEeeCc----------eEEEEEeccCCCcceecHHHHH Q lcl|NC_020838. 306 TINDYTFVLNKNKTTAMKTTTSAAVPNVAFVVIRIVAYNSDYSVTLNG----------TTVTHSTPDTVAGATTDSGSIA 375 (981) Q Consensus 306 t~ad~tfi~n~~~~~~~~~~~~~~~~~~~~v~v~~g~y~~~~~v~~ng----------~~~~~~t~~~~a~~~~~~~~ia 375 (981) +.+-++.+.|+.. ..........+. .-++.|.|.+++ ..+.++++....+...+++.|| T Consensus 241 ~y~~~~~l~~tg~-------~~~~~~~t~~v~----~~G~~y~IsI~~~~~~~~~~~~~s~~~~t~~~~~a~~at~~~Ia 309 (680) T protein:vir:17 241 NYIPYVTLLTPGN-------NTSPFPDTIRVD----VSGEGWDIKVTKQIQSKVYANLGTAQFTTPVDQSGGGASTSDIV 309 (680) T ss_pred EEeeEEEEecCCc-------cccccCceEEEe----cccceeEEEEccceeeEeccCccceeeeeccCCcccceeHHHHH Confidence 0011222222110 000001111111 223344444433 2356777777778888999999 Q ss_pred HHHHhhhhccCceEEEEcCCEEEEEeC-----CCceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCCc Q lcl|NC_020838. 376 AALTSSINALTGFSATQVGPGIYIEGT-----SAFSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVT 450 (981) Q Consensus 376 ~~l~~~i~~~~~~~~~~vg~~i~i~~~-----~~~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~ 450 (981) .+|..+|.+.+++++.++|++|+|++. ..+.+++++|++++++..++++|+++++||++||+||.|+|.++++++ T Consensus 310 ~~L~~~i~~~~~~~~~~~g~~i~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~Lp~~a~~g~~v~v~~~~~~~ 389 (680) T protein:vir:17 310 TGLSAAINGLGTFTAESIGNVIRVRYSDPTRTDEFTMSARGGTSGTGLESIKYSVDTLAELPTKCWNDYQVAVRNTQDTE 389 (680) T ss_pred HHHHHhhcccCcEEEEECCCEEEEEeccCCCceEEEeeccCCCCceeeeeeeeeeccccccccccCCCcEEEEEeCCCCc Confidence 999999999999999999999999763 357789999999999999999999999999999999999999999999 Q ss_pred ccceEEEEEcccCC--cccceEEEEeeccceeEEEccccceEEEEeccCCceeeeccc-------CCcccCCCcccCcCc Q lcl|NC_020838. 451 ADDIYVEFQTTNSA--ARGPGVWEETIGPSLEFEIDETTMPHQLIRQANGVFKYEPVT-------WDDRLVGDNTTNPIP 521 (981) Q Consensus 451 ~d~yyv~~~~~~~~--~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~~~~-------w~~r~~GDd~tnp~p 521 (981) .|+|||+|+...+. ..++++|+||++|+..++++.+||||+++|+++|.|.++.++ |++|.+|||++||+| T Consensus 390 ~~~Yyv~~~~~~~~~~~~~~~~W~E~~~~~~~~~~~~~tmp~~l~r~~~g~f~~~~~~~~~~~~~~~~r~~Gdd~tnp~p 469 (680) T protein:vir:17 390 VDDYYVKFETDVEDADVPGSGYWVETVKNGDDGGLVDDTMPHVLVRNALGDFTFSSLNNSSYGKTWADRSVGSEDTNPHP 469 (680) T ss_pred ccceEEEEeccCcccCcccccceeecccCcccceeccCcceEEEEEccCceeEEEeeccccccccccccccCCcccCCCc Confidence 99999999886543 356789999999999999999999999999999999999886 999999999999999 Q ss_pred ccc--CCceeEEEEEcceEEEecCCeEEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecC Q lcl|NC_020838. 522 SFI--GKKINNMFFYRNRLGLLSNEAVIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPN 599 (981) Q Consensus 522 sF~--g~~ps~v~ffq~RL~f~s~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g 599 (981) +|+ |++|++|+||||||+|+++++|||||+||||||++++++++.|||||+++++++++++|+|+++++++|+|||++ T Consensus 470 sF~~~G~~p~~v~f~q~RL~f~s~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~t~g 549 (680) T protein:vir:17 470 TFTESGNGIYGMFMYKNRLGFLTQDAVIMSQVGDYFNFYATSGVTISDADPIDMATSDTKPVKLEAAISSTSGAILFGNQ 549 (680) T ss_pred ccccCCCCceEEEEEcceEEEeeCCeEEEEccCCcccccccccccCCCCccEEEEEcCCcceeeeEEeecCCcEEEEecC Confidence 998 889999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cEEEEEcCCccccccceEEEEEeeeccccCCCcEEeCCeEEEEecCCCeeEEEEEeecccccceehhhHHHHHHHhcCCC Q lcl|NC_020838. 600 EQFVLSTDADILSPTTTKINTISTFECDAEIDAVAVGTTQAFISKSNLYSKLFLMLNVQKEAAATIDEATTNVPEYVPSD 679 (981) Q Consensus 600 ~q~~l~g~~~~LTP~~~~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS~~~~h~~~~~ 679 (981) +||+|+|++++|||+|++++++|+|+|++.|+|+.+|+++||++++|++++||||.|++++|+|+++|||+|++|||+++ T Consensus 550 ~q~~ls~~~~~lTP~~~~i~~~s~~~~~~~~~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~y~a~DlT~~a~hl~~g~ 629 (680) T protein:vir:17 550 AQFRLSSPDESFGPKTATLDKISNYTYESKADPVQTGVSMIFPTNMGTYSSVYELSTESAKGTPVIEDSSRVIPRLIPSG 629 (680) T ss_pred eEEEEecCCceecceeEEEEEEEeecccCCCCceEeCCeEEEeecCCCcceEEEEeeeeccCceehhhHHHHHHHhcCCc Confidence 99999987679999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEE-EEcCCCcEEEEEecCCcEEEEEEeecCcch-heeeeEeeccCCceE Q lcl|NC_020838. 680 IDSM-TASPAMSIVSLGKSGSNTVYQHRFFMQGEN-RVQTWYKWQLTGDLR 728 (981) Q Consensus 680 i~~~-~~s~~~~~~~~~~~g~~~l~~y~y~~~~eq-~V~aWsrw~~~G~v~ 728 (981) +..+ +++++|.+|+|++.+++.|++|+|||..+| +|+|||||+|++.=+ T Consensus 630 v~~~~~~~~~~~~~~~~~~~~~~l~~~~yl~~~~e~~v~aW~rw~~~~~d~ 680 (680) T protein:vir:17 630 LTWSTASMNNDTVFFGNAKKGRNVYVFRFFNEGQERKVAGWTTWYYEDQDH 680 (680) T ss_pred eEEEEeeCCCCeEEEEEEcCCCEEEEEEEeeCCCceEEEEEEEEecCCCCC Confidence 8876 556677788999999999999999987555 567999999998654 No 19 >protein:vir:103790 Length: 768 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024932;genbank:gi:48697202;genbank:GeneID:2846114 Probab=100.00 E-value=2e-154 Score=863.17 Aligned_cols=723 Identities=15% Similarity=0.133 Sum_probs=525.5 Q ss_pred CCceeecchhhhcc-----ccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCCCCceEEEEEeCCCceEE Q lcl|NC_020838. 1 MSTISQRIPNLLLG-----VSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAEARGRWFPILRDEEEKYV 75 (981) Q Consensus 1 m~~v~~s~~~l~~G-----vSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~~~~~~~~~~rd~~e~y~ 75 (981) ||+++++.+||.+| +++|+|++|+++|+++|+||+++|+.||+||||++||+++.+.......+++.|++.+.|+ T Consensus 1 M~~~~~~~~~F~~GelsP~l~~r~Dl~ry~~~~~~~~N~~~~~~gGl~rRpGt~fv~~~~~~~~~~~lipf~~~~~~~y~ 80 (768) T protein:vir:10 1 MPKAAPQQVSFDAGELSPLLGARVDLAKYPNGCQVMENFIATVQGPAIRRGGKRFVAATKDSTKQSWLLPFIVADGIAYM 80 (768) T ss_pred CCcceeeeeeccCceechhhcccchHHHHHHHHhhhhcceeeecCCceecCchhhhhhhcCCCCCeeEEEEEecCccEEE Confidence 99999999999999 8999999999999999999999999999999999999998765544444555699999999 Q ss_pred EEEEcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCcccc Q lcl|NC_020838. 76 CQYDTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTAT 155 (981) Q Consensus 76 ~~~~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~ 155 (981) +.|. ++.|+||+. +|..+.- +. + + T Consensus 81 l~fg--~~~irv~~~-~g~v~~~---~~-------------~-----------~-------------------------- 104 (768) T protein:vir:10 81 LEFG--DHYIRFFVN-RGQLVNA---GA-------------P-----------V-------------------------- 104 (768) T ss_pred EEEc--CCEEEEEEC-CcEEEec---Ce-------------e-----------E-------------------------- Confidence 9984 677999987 3543210 00 0 0 Q ss_pred cceeeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeeee Q lcl|NC_020838. 156 KVNLFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVEK 235 (981) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 235 (981) +. T Consensus 105 -----------------------------------------------e~------------------------------- 106 (768) T protein:vir:10 105 -----------------------------------------------EI------------------------------- 106 (768) T ss_pred -----------------------------------------------EE------------------------------- Confidence 00 Q ss_pred eeEEeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcc-eeeEEEEcCEEEEe Q lcl|NC_020838. 236 EVAAAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPE-DIEILTINDYTFVL 314 (981) Q Consensus 236 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~-dl~~~t~ad~tfi~ 314 (981) ..|| ..+|++..+++ +|+++|+||++||+ T Consensus 107 ---------------------~tp~-----------------------------~~~~l~~~~~~~~L~~~q~aD~~~i~ 136 (768) T protein:vir:10 107 ---------------------ATPY-----------------------------ALADLTTEDGTFAIRATQSADTMYLF 136 (768) T ss_pred ---------------------EcCC-----------------------------CcceeecccccceeEEEeecCEEEEE Confidence 0000 00133333343 68888888888888 Q ss_pred CCceeEeccCCCCCCCCceEEEEEeeeccce-----EEEEeeCceEEEEEeccCCCcceecHHHHHHHHHhhhh---ccC Q lcl|NC_020838. 315 NKNKTTAMKTTTSAAVPNVAFVVIRIVAYNS-----DYSVTLNGTTVTHSTPDTVAGATTDSGSIAAALTSSIN---ALT 386 (981) Q Consensus 315 n~~~~~~~~~~~~~~~~~~~~v~v~~g~y~~-----~~~v~~ng~~~~~~t~~~~a~~~~~~~~ia~~l~~~i~---~~~ 386 (981) |++++|+.....+...+..+.+..+.+.|.. .+++...+...+.+. ...++...++++...+..... ... T Consensus 137 ~~~~~p~~l~r~~~~~w~l~~~~~~~gp~~~~n~~~~vti~~s~~~~~~T~--tasa~~~~~~~v~~~~~l~~~~~~~~~ 214 (768) T protein:vir:10 137 HGGYPTQKLLRTSATTFSLQPVTFVGGPFAAVNSDNNVRVHASAGTGAVTL--VASASVFRPSDVGTLFYLEQEDNSFVK 214 (768) T ss_pred cCCcceeEEEEecCCCceeEEeeecCccccccccceeEEEEecccceeEEE--eecCCccchhhcceeeeeeeecccccc Confidence 8888887655555555556666666665543 334444444333332 122333445555444322211 123 Q ss_pred ceEEE-EcCCEEEEEeCCCceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCCcccc-----eEEEEEc Q lcl|NC_020838. 387 GFSAT-QVGPGIYIEGTSAFSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADD-----IYVEFQT 460 (981) Q Consensus 387 ~~~~~-~vg~~i~i~~~~~~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~-----yyv~~~~ 460 (981) +|... ..|..++....+...+.+..|..... ....+++.+..|+...+.....+..+. .++++.. T Consensus 215 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~---------~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 285 (768) T protein:vir:10 215 PWVVHQKIGPSELRRVGDRVYLCTAVGTATPQ---------VTGTETPTHTSGSRWDGTGQDESATDEYGSIGAEWEYQH 285 (768) T ss_pred ccEEEEeeeeEEEEecCCceEEeeeecccccc---------ccceeccccccCceEEEecCcccccccccccceEEEEEE Confidence 44433 34444444444443443333321110 011123344456555444433222111 1122211 Q ss_pred ccCCcccceEEEEeeccceeEEEccccceEEEEeccCCceeee--------cccCCcccCCCcccCcCccccCCceeEEE Q lcl|NC_020838. 461 TNSAARGPGVWEETIGPSLEFEIDETTMPHQLIRQANGVFKYE--------PVTWDDRLVGDNTTNPIPSFIGKKINNMF 532 (981) Q Consensus 461 ~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~--------~~~w~~r~~GDd~tnp~psF~g~~ps~v~ 532 (981) . ..+...+.++...+|++..++..++++... ...|.-+..+++++||. |++|+ T Consensus 286 ~------------~~~~~~i~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~g~-------Ps~v~ 346 (768) T protein:vir:10 286 S------------GYGTVLITGYTNDQVVTGTVATNDPADPGMLPNTVVTLTGTYKWARSLFNSTDGF-------PQMGT 346 (768) T ss_pred c------------CCceEEEEEecCCeeEEeeeeeecCcccccccccccccCCCcccccCCCcCCCCC-------ceEEE Confidence 1 112333456677788888887766655443 33455566666666665 55689 Q ss_pred EEcceEEEecCCeEEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcC--Ccc Q lcl|NC_020838. 533 FYRNRLGLLSNEAVIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTD--ADI 610 (981) Q Consensus 533 ffq~RL~f~s~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~--~~~ 610 (981) ||||||+|++|++|||||+||||||++++++.+.|||||+++++++++++|+|++++ ++|+|||+++||+|+|+ +++ T Consensus 347 f~q~RL~f~~~~~v~~Srtgd~~nF~~~s~~~~~DdD~I~~~~ss~~~~~i~~~v~~-~~L~i~T~~~q~~l~~~~~~~~ 425 (768) T protein:vir:10 347 FWRNRLCLMRDRWLAMSVSADFETFKTKDADQQTDDSAIVQQLNARQLNKLAWMVES-DSLLIGMTGDEWVIGPANASQP 425 (768) T ss_pred EEeeeEEEeeCCEEEEEcccccccccccccccccCCccEEEEecCCcceeEEEEeec-CcEEEEecCceEEEecCCCCcc Confidence 999999999999999999999999999999999999999999999999999999999 58999999999999985 458 Q ss_pred ccccceEEEEEeeeccccCCCcEEeCCeEEEEecCCCeeEEEEEeecccccceehhhHHHHHHHhcCC------CeEEEE Q lcl|NC_020838. 611 LSPTTTKINTISTFECDAEIDAVAVGTTQAFISKSNLYSKLFLMLNVQKEAAATIDEATTNVPEYVPS------DIDSMT 684 (981) Q Consensus 611 LTP~~~~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS~~~~h~~~~------~i~~~~ 684 (981) |||+|++++++|.|+++ +++|+.+|++++|+|++|+ +||||.|++++|+|+++|||+|++||+++ +|..|+ T Consensus 426 lTP~~~~i~~~s~~g~~-~~~Pv~vG~~v~fv~~~g~--~vre~~y~~~~d~y~a~DlT~~a~hl~~~~~~~~~~i~~~a 502 (768) T protein:vir:10 426 VSAANLNAARRTSYGSK-RIQPVQVGGTIMFVQKAGR--KLRDFKYDFSSDNYVSTDVTKIADHITRGRAGTNSGIMSLC 502 (768) T ss_pred cccceEEEEEeehhccc-ccccEEeCCeEEEEcCCCC--EEEEEEeeeecCceecchhhhhhhhhccccCccccceeeEE Confidence 99999999999999775 7999999999999999995 79999999999999999999999999985 478899 Q ss_pred EcCCCcEEEEEecCCcEEEEEEeecCcc-hheeeeEeecc-CCceEEEEEe------CCeEEEEEEcCCcEEEEEEEeec Q lcl|NC_020838. 685 ASPAMSIVSLGKSGSNTVYQHRFFMQGE-NRVQTWYKWQL-TGDLRLQFFD------KTTFYAVTSSGSNVYLTSYDLTQ 756 (981) Q Consensus 685 ~s~~~~~~~~~~~g~~~l~~y~y~~~~e-q~V~aWsrw~~-~G~v~sv~~~------~d~ly~vv~r~~~~~l~~~~~~~ 756 (981) ++.+|..++|+.++++.|++|+|+++.+ |+|+|||||++ +|.|+++|++ +|.||++|+|+.+...+++.... T Consensus 503 ~~~~p~~v~~~v~~dg~l~~~ty~~e~~~q~v~aW~~~~~~~g~v~~v~~i~~~~g~~d~l~~~v~r~~~g~~~~~ie~l 582 (768) T protein:vir:10 503 FQQEPHSVVWAARADGQLIGCTYDEEAGRSDVYGWHRHPDANGFVECVASMPAPDGASDDLWVIVRRQVNGQTVRYVEYL 582 (768) T ss_pred EeecCCeEEEEEecCCeEEEEEEecCCCceeEEeEEEEEcCCCEEEEEEEEecCCCCccEEEEEEEecCCCeEEEEEEec Confidence 9999998888888889999999986543 66889999975 7889999987 68999999998775554432111 Q ss_pred CCceeEEEecCCCcccccccceeeeeeeeeccCCceEEEcccCCCcCCceEEEEEcCcccCceeEEEEecCceEEEcccc Q lcl|NC_020838. 757 ASESGYLTLPTGEKTDVCLDMFNVNPYRTYSTSTKKTTVNLPFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDS 836 (981) Q Consensus 757 ~~~~~~~~~~~~~~~~~~lD~~~vd~~~ty~~~~~~tt~~l~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~ 836 (981) .. ....+....|++++||+++|++.+..+ ..+++||+|+++.+++||..+ ++. T Consensus 583 ~~--------~~~~~~~~~~~~~~D~~~~~~~~~~~~--~~gl~~leg~~v~v~~dG~~~-----------------~~~ 635 (768) T protein:vir:10 583 NP--------ALQDDEPQSSAFYVDAGITYNGVPTST--IAGLGHLEGVTVAVLTDGAVH-----------------PSR 635 (768) T ss_pred Cc--------ccccccccccceEeccccccCCcceee--ecCCCCcccceEEEEECCEec-----------------cCc Confidence 11 012223345789999999998765433 346789999999999998643 344 Q ss_pred ccCCCEEEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeEEEEEEEEEeecccceEEEEccCCCCceee Q lcl|NC_020838. 837 DISSNQVLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDLILHRLKVSTGLSGPITYKVDITGKDEWTN 916 (981) Q Consensus 837 tv~gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~ 916 (981) ++.+|+|+|+ +++++|+|||+|+++++|+||+++.++|... .+|+||+|++|+|.+|+++.++++...++ .. T Consensus 636 ~v~~g~itl~--~~~~~v~vG~~y~s~~~~~p~~~~~~~gs~~----~~~~ri~r~~v~~~~S~~~~~~~~~~~~~--~~ 707 (768) T protein:vir:10 636 TVTAGAITLD--WSASIVHIGVPTTCRIQTMQLNAGAANGTAQ----GKTKRVTNIATRFSRSLGGVVGPTFDDND--LE 707 (768) T ss_pred eecCCEEEeC--CCCceEEEeEeeeEEEEecceEeecCCcccc----ccceEEEEEEEEEecccceEEEecCCCCC--ce Confidence 5678899996 6799999999999999999999988877543 47899999999999999999987554432 34 Q ss_pred EecccccCccccCccccccCCeEEEEe-eccCcceEEEEEECCCCCEEEEEEEEEEEeeccc Q lcl|NC_020838. 917 IINVTLPNTYVLNNVNLSASALHDVPI-YQRNENVNIKIIGDTPFPISLLNIVWEGNYNRRF 977 (981) Q Consensus 917 ~~~~~~~~~~~~~~~p~~~~~~~~vp~-~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r~ 977 (981) .++.|..+.. ++.+++..++.+++|+ .+|+++.+|+|+|++|+||+||+|+||+++|.|+ T Consensus 708 ~~~~r~~~~~-~~~~~~l~TG~~~v~~~~~~~~~~~i~i~~d~P~P~tvlsi~~~~~~nd~~ 768 (768) T protein:vir:10 708 QLSFRKPSNA-MDRAVPLFDGDMESDWRGGYEGQSWICYQNDQPLPVTLLGFFPILDTQDDR 768 (768) T ss_pred eeeeEecCcc-cCccCCcccCEEEEEecCCCCcceEEEEEECCCCCEEEEEEEEEEEEeecC Confidence 4555655443 4555444566789998 5568999999999999999999999999999998 No 20 >protein:vir:95324 Length: 823 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512269;genbank:gi:89152436;genbank:GeneID:3952993 Probab=100.00 E-value=7.1e-144 Score=805.37 Aligned_cols=699 Identities=15% Similarity=0.156 Sum_probs=502.4 Q ss_pred CCceeecchhhhcc-----ccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCCCCceEEEEEeCCCceEE Q lcl|NC_020838. 1 MSTISQRIPNLLLG-----VSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAEARGRWFPILRDEEEKYV 75 (981) Q Consensus 1 m~~v~~s~~~l~~G-----vSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~~~~~~~~~~rd~~e~y~ 75 (981) |+ +++..+||.+| +++|+|++|+++|+++|+||+++|+.||+||||++||+.+.+.+.....++..+++.|+|+ T Consensus 1 m~-i~~~q~sF~~GElsP~l~gR~Dl~ry~~q~~~~~N~~~~~~GGl~rRpGt~fva~~~~~~g~~rLipf~~s~~q~y~ 79 (823) T protein:vir:95 1 MA-ISWIQPSFAGGEIGPSLYGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPNRKCRLIPFQFSTVQTYA 79 (823) T ss_pred Cc-ceeechhccCceechheeccchHHHHHHHHhhhhCcEeeecCCceecCchhhhhhhcCCCCCeeEEEEEeCCCcEEE Confidence 99 99999999999 9999999999999999999999999999999999999998776544444455789999999 Q ss_pred EEEEcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCcccc Q lcl|NC_020838. 76 CQYDTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTAT 155 (981) Q Consensus 76 ~~~~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~ 155 (981) +.|. ++.||||+.. |.. | .. + T Consensus 80 Lefg--~~~irV~~~~-g~v--v---------------~~-----------------------------------~---- 100 (823) T protein:vir:95 80 LEFG--HQYMRVIKDG-ALV--L---------------NS-----------------------------------S---- 100 (823) T ss_pred EEEc--CCeEEEEeCC-cEE--E---------------ec-----------------------------------C---- Confidence 9984 6779999642 200 0 00 0 Q ss_pred cceeeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeeee Q lcl|NC_020838. 156 KVNLFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVEK 235 (981) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~ 235 (981) +. T Consensus 101 ----------------------------------------------------------~~-------------------- 102 (823) T protein:vir:95 101 ----------------------------------------------------------NV-------------------- 102 (823) T ss_pred ----------------------------------------------------------Cc-------------------- Confidence 00 Q ss_pred eeEEeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeC Q lcl|NC_020838. 236 EVAAAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLN 315 (981) Q Consensus 236 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n 315 (981) +| .+.+++++++.++|||+|+||++||+| T Consensus 103 -----------------------~~----------------------------ev~tPy~~~~l~~Lr~~qsaD~~fivh 131 (823) T protein:vir:95 103 -----------------------IY----------------------------EIATPYTEADLFRIKFTQSADVLTLVH 131 (823) T ss_pred -----------------------ee----------------------------EEecccccccccceeEEEeccEEEEEc Confidence 00 001123445557999999999999999 Q ss_pred CceeEeccCCCCCCCCceEEEEEeeeccceEEEEeeCceEEEEEeccCCCcceecHHHHHHHHHhhhhccCceEEEEcCC Q lcl|NC_020838. 316 KNKTTAMKTTTSAAVPNVAFVVIRIVAYNSDYSVTLNGTTVTHSTPDTVAGATTDSGSIAAALTSSINALTGFSATQVGP 395 (981) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~v~v~~g~y~~~~~v~~ng~~~~~~t~~~~a~~~~~~~~ia~~l~~~i~~~~~~~~~~vg~ 395 (981) ++++|+.....+...+..+.+..+.+.|...- .+. ....++.+.+...+.+. +.+.|....+|. T Consensus 132 ~~~~p~~L~r~~~~~w~l~~~~~~~gp~~~~~---~~~--t~~v~~~~~~~~~t~ta-----------~~~~~~~d~vg~ 195 (823) T protein:vir:95 132 PAYPPKELRRYAHDNWQLVDVVTKNGPFEDIN---IDE--SLTVYASASTGTITLTA-----------SASIFGAEQVGK 195 (823) T ss_pred CCccceEEEecCCCCceEEEEEEecccccccc---ccc--eeEEeccccCceeEEee-----------cccccchhhccc Confidence 99999876655555556666777777776521 111 11112222222222111 223455566677 Q ss_pred EEEEEeCCCceEE--EecC--cCcceeE-EEE-E----EEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCc Q lcl|NC_020838. 396 GIYIEGTSAFSIS--TSGS--TTEEGIF-AFQ-D----QINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAA 465 (981) Q Consensus 396 ~i~i~~~~~~~vt--~~~g--~~~t~~~-~~~-~----~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~ 465 (981) .+++.....-.+. .... ..+.... ... + ..+..+..|...+..+.+...+ +..+++|+.+.... T Consensus 196 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~--- 269 (823) T protein:vir:95 196 LFYLEQPAVDSVPVWETSKSTSIGDIRRADSNYYRAVTAGKTGTLRPSHTEGTSWDGWGG---SGDDDTGIEWEYLH--- 269 (823) T ss_pred eEEEeccccceeeecceeeeecccceEEecccceeeeeccccceeecccCCcceEEecee---cccccceeEEEEEe--- Confidence 7776543321111 1110 0011000 000 0 0011122334444444444333 23456666664432 Q ss_pred ccceEEEEeecccee-EEEccccceEEEEeccCCceeeecccCCcccCCCcccCcCccccCCceeEEEEEcceEEEec-- Q lcl|NC_020838. 466 RGPGVWEETIGPSLE-FEIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLLS-- 542 (981) Q Consensus 466 ~g~~~W~E~a~~~~~-~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~s-- 542 (981) .+.+.|++++.++.. .+....+||+.++++++++|.++...|++ .+.||++|+||||||+|++ T Consensus 270 ~~~g~~~~t~v~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~--------------~~g~Ps~v~f~q~RL~f~g~~ 335 (823) T protein:vir:95 270 SGFGIARITAVNGTTATAEVISYIPSQVVGEDNASYKWAKYAWNS--------------VNGYPGTVVYYQQRLYFAAST 335 (823) T ss_pred CCcceEEEEeecceeeeceEeeeeccccccCCcCCccccccccCc--------------CCCCccEEEEEeceEEEEEcC Confidence 234578887655533 33445679999999999999888777754 3367889999999999995 Q ss_pred --CCeEEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCC-ccccccceEEE Q lcl|NC_020838. 543 --NEAVIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDA-DILSPTTTKIN 619 (981) Q Consensus 543 --~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~-~~LTP~~~~i~ 619 (981) |++|||||+||||||+++++ ++|||||+++++++++|.|+|+++++ +|++||+++||+|+|++ ++|||+|++++ T Consensus 336 ~~p~~v~~Srtgd~~nF~~~~~--~~DdD~I~~~~s~~~~~~i~~~v~~~-~Lli~t~~~e~~l~~~~~~~lTP~~~~~~ 412 (823) T protein:vir:95 336 AFPQTIWASRTGDYKDFGKSNP--TQDDDRIIYTYAGRQVNEIRHLIDVG-SLVALTSGGEYVITGDQNKVLTPSSFAFS 412 (823) T ss_pred CCCcEEEEeccCCccccccccC--CCCCCcEEEEEcCCcceEEEEEeecC-cEEEEecCcEEEEEcCCCcccceeeEEEE Confidence 68999999999999999984 57999999999999999999999995 79999999999999864 58999999999 Q ss_pred EEeeeccccCCCcEEeCCeEEEEecCCCeeEEEEEeecccccceehhhHHHHHHHhcCC-CeEEEEEcCCCcEEEEEecC Q lcl|NC_020838. 620 TISTFECDAEIDAVAVGTTQAFISKSNLYSKLFLMLNVQKEAAATIDEATTNVPEYVPS-DIDSMTASPAMSIVSLGKSG 698 (981) Q Consensus 620 ~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS~~~~h~~~~-~i~~~~~s~~~~~~~~~~~g 698 (981) ++|+|++ ++++|+.+|++++|+|++|+ +||||.|++++|+|+++|||+|++|++++ ++.+|+++..|+.++|+..+ T Consensus 413 ~~s~~g~-~~~~Pv~vg~~~~Fv~~~g~--~vre~~~~~~~d~~~~~dlT~~a~hl~~~~~i~~~a~~~~p~~~~~~v~~ 489 (823) T protein:vir:95 413 SQGSNGS-SNVPPIAVANIALFVQEKGS--VVRDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCIRD 489 (823) T ss_pred Eeecccc-ccccceEeCCeEEEEecCCC--EEEEEEEeeecCceecchhhhhhhhhcCCCceEEEEEecCCCeEEEEEec Confidence 9999865 57999999999999999985 89999999999999999999999999986 78899988888877777777 Q ss_pred CcEEEEEEeecCcchheeeeEeeccCCceEEEEEe----CCeEEEEEEcCCc----EEEEEEEeecCCceeEEEecCCCc Q lcl|NC_020838. 699 SNTVYQHRFFMQGENRVQTWYKWQLTGDLRLQFFD----KTTFYAVTSSGSN----VYLTSYDLTQASESGYLTLPTGEK 770 (981) Q Consensus 699 ~~~l~~y~y~~~~eq~V~aWsrw~~~G~v~sv~~~----~d~ly~vv~r~~~----~~l~~~~~~~~~~~~~~~~~~~~~ 770 (981) ++.|++|+|+ +||+|.|||||+++|+|+++|++ +|.||++|+|+.+ .|+|||... . T Consensus 490 dG~l~~~ty~--~~q~v~aW~~~~~~g~~~~~~~i~~~~~d~l~~~v~R~i~g~~~~yiE~~~~~--------------~ 553 (823) T protein:vir:95 490 DGKLLVMTYL--RDQQVFAWAPQSSTGKYESTCSISEGNEDAVYFVVNRTVNGQTVRYIERLSSR--------------L 553 (823) T ss_pred CCcEEEEEEe--cccceeeeEEEecCCcEEEEEEecCCCCCEEEEEEEeccCCeEEEEEEeeccc--------------c Confidence 7789999885 79999999999999999999986 6899999999644 456665421 1 Q ss_pred ccccccceeeeeeeeeccCCceE---EE---------------------------------------------------- Q lcl|NC_020838. 771 TDVCLDMFNVNPYRTYSTSTKKT---TV---------------------------------------------------- 795 (981) Q Consensus 771 ~~~~lD~~~vd~~~ty~~~~~~t---t~---------------------------------------------------- 795 (981) +....|++++||+++|++.+... ++ T Consensus 554 ~~~~~~~~~lD~~~s~~g~~~~~~~~~l~~g~~~l~~l~g~~v~~adg~~~~~~~v~g~i~l~~~~~~~~vGl~~~~~i~ 633 (823) T protein:vir:95 554 FTSDEDAFFVDSGLSYDGRNTSDRTMTITGGSGEWDYLAEYTISVSGGAYFTSSDVGAQLQFPYTGADPDTGYEVSKELR 633 (823) T ss_pred CCCccceeEEEEEEEeecCcccceeeEecCCCCcccccCceEEEecCcceECCccceeEEEeCcCCCccccccceEEEEE Confidence 22223455555555554432211 11 Q ss_pred --------------------------------------cccCCCcCCceEEEEEcCcccCceeEEEEecCceEEEccccc Q lcl|NC_020838. 796 --------------------------------------NLPFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDSD 837 (981) Q Consensus 796 --------------------------------------~l~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~t 837 (981) ..+|+||||++|.|++||+. +++.. T Consensus 634 ~~~~~v~~~~a~~~~~~r~v~a~l~~~~t~~~~~~~~~~~gL~hleg~tv~v~~dg~~-----------------~~~~~ 696 (823) T protein:vir:95 634 CDIISVTSNTAVVVRANRNVPPSLRNVATTNWQMARRTFGGLSHLEGQTVNILSDANV-----------------EPQKV 696 (823) T ss_pred EeeceeeCCceEEEccCCcccceeeeeeccccccccceeeeccccccceEEEEEcCee-----------------eCCeE Confidence 22334444444444444442 34556 Q ss_pred cCCCEEEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeEEEEEEEEEeecccceEEEEccCCCCceeeE Q lcl|NC_020838. 838 ISSNQVLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDLILHRLKVSTGLSGPITYKVDITGKDEWTNI 917 (981) Q Consensus 838 v~gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~ 917 (981) |.+|+|+|+ .++..|||||+|+++++|+|+++.. +|...+ .++||+++.++|++|.++.++.+....++ T Consensus 697 v~~G~vtl~--~~~~~v~vGl~~~~~~~~l~~~~~~-~g~~~g----~~~ri~~~~~~~~~s~~~~~g~~~~~l~~---- 765 (823) T protein:vir:95 697 VSGGAVTLE--SPGAVVHIGLPITAEFETLDINING-QETLLD----KKQVIPSVTLVVNASRGIWATTPGGKWYE---- 765 (823) T ss_pred ecCCEEEec--CCCCEEEEeecceeeEEecchhcCC-CcccCC----ceeEEeEEEEEEEeeeeEEEecCCCceeE---- Confidence 789999997 5689999999999999999999875 354332 35689999999999999999875543333 Q ss_pred ecccccCccccCccccccCCeEEEEe-eccCcceEEEEEECCCCCEEEEEEEEEEEeecc Q lcl|NC_020838. 918 INVTLPNTYVLNNVNLSASALHDVPI-YQRNENVNIKIIGDTPFPISLLNIVWEGNYNRR 976 (981) Q Consensus 918 ~~~~~~~~~~~~~~p~~~~~~~~vp~-~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r 976 (981) ++.|.. ..+|++|+..++.+++++ .+|+++++|+|+|++|||||||||..|...+== T Consensus 766 ~~~r~~--~~~~~~~~~~tG~~~~~~~~~~~~~~~~~i~q~~plp~tvl~v~~~~~~~g~ 823 (823) T protein:vir:95 766 YPQREF--EFYDDPVDDATGKVEVKLDSNWGKNGRVKIRQLDPLPLSVLAVIPRLTVGGF 823 (823) T ss_pred eeccCC--CcccCCCCcccceEEEecCCCcCCccEEEEEEcCCCceEEEEEEEEEEecCC Confidence 334432 235776555566677877 899999999999999999999999988776544 No 21 >protein:vir:7329 Length: 825 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848220;genbank:gi:30387391;genbank:GeneID:2641863 Probab=100.00 E-value=9.4e-131 Score=733.43 Aligned_cols=697 Identities=14% Similarity=0.143 Sum_probs=460.6 Q ss_pred CCceeecchhhhcc-----ccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCCCCceEEEEEeCCCc-eE Q lcl|NC_020838. 1 MSTISQRIPNLLLG-----VSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAEARGRWFPILRDEEE-KY 74 (981) Q Consensus 1 m~~v~~s~~~l~~G-----vSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~~~~~~~~~~rd~~e-~y 74 (981) |+.- ..-+||-+| +.-..|..|+..++++|.||+++|..||+||||++|++.+...+ ...+..-+....| .| T Consensus 1 m~~~-~~q~sF~~GElsP~l~gR~Dl~~y~~g~~~~~N~~~~p~Gg~~rRpGt~fva~~~~~~-~~~rLipF~fs~~q~y 78 (825) T protein:vir:73 1 MAFS-WIQPSFAGGEIGPSLYGRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPAKYPD-RKCRLIPFQFSTVQTY 78 (825) T ss_pred Cccc-eeccccccceechhhcccchHHHHHHHHHHhcCcEEEecCCceecCchHHhHhhcCCC-CCEEEEEEEeCCCcEE Confidence 8743 444699999 89999999999999999999999999999999999999875554 3334444454445 46 Q ss_pred EEEEEcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCccc Q lcl|NC_020838. 75 VCQYDTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTA 154 (981) Q Consensus 75 ~~~~~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~ 154 (981) ++.| .++.|+||.. +|... ...+ T Consensus 79 ~Lef--g~~~lrv~~~-gg~v~----------------~~~~-------------------------------------- 101 (825) T protein:vir:73 79 ALEF--GHNYMRVIKD-GAYVL----------------TTSN-------------------------------------- 101 (825) T ss_pred EEEE--eCCeEEEEeC-CceEe----------------ccCC-------------------------------------- Confidence 6666 3678999965 23110 0000 Q ss_pred ccceeeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccccCCcccccceEeecceeEeeee Q lcl|NC_020838. 155 TKVNLFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYPWFKRDGYRVYEVE 234 (981) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 234 (981) .| + T Consensus 102 -----------------------------------------------------------------~~-~----------- 104 (825) T protein:vir:73 102 -----------------------------------------------------------------VI-Y----------- 104 (825) T ss_pred -----------------------------------------------------------------ce-E----------- Confidence 00 0 Q ss_pred eeeEEeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEe Q lcl|NC_020838. 235 KEVAAAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVL 314 (981) Q Consensus 235 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~ 314 (981) . ..+.+++++.++|+++|++|++||+ T Consensus 105 ----------e--------------------------------------------~~TPy~~~~l~~l~~~QsaD~~~i~ 130 (825) T protein:vir:73 105 ----------E--------------------------------------------LAMPYADTDLFRIKFTQSADVLTLV 130 (825) T ss_pred ----------E--------------------------------------------EecccchhhhhhheeeeecCEEEEE Confidence 0 0011233455789999999999999 Q ss_pred CCceeEeccCCCCCCCCceEEEEEeeeccceEEEEeeCceEEEEEeccCCCcceecHHHHHHHHHhhhhccCceEEEEcC Q lcl|NC_020838. 315 NKNKTTAMKTTTSAAVPNVAFVVIRIVAYNSDYSVTLNGTTVTHSTPDTVAGATTDSGSIAAALTSSINALTGFSATQVG 394 (981) Q Consensus 315 n~~~~~~~~~~~~~~~~~~~~v~v~~g~y~~~~~v~~ng~~~~~~t~~~~a~~~~~~~~ia~~l~~~i~~~~~~~~~~vg 394 (981) |++++|+.....+...+....+....+ ....+.++... ...+.+.+...+- ..+.+.+....+| T Consensus 131 h~~~pp~~L~r~~~~~W~l~~~~f~~g---p~~~in~~~sv--~v~asg~tg~~Ti-----------TaS~a~~~~~~vG 194 (825) T protein:vir:73 131 HPAYPPKELRRYAHDNWQIVDVTTKNG---PFEDINVDETV--KVYASASTGTITL-----------TASSAIFGAEQVG 194 (825) T ss_pred cCCCceeEEEEecCCCcEEEEEeccCC---ccccccccccc--eeeecccCceeEE-----------EeeccccCchhcC Confidence 999999865433332222222222211 11112111110 0011111111100 0112334444556 Q ss_pred CEEEEEeCCCceEEEecCcCcceeEEEEEEEeec----------hhccccCCCCcEEE-EEccCCCcccceEEEEEcccC Q lcl|NC_020838. 395 PGIYIEGTSAFSISTSGSTTEEGIFAFQDQINVA----------SRLPNQCENGYRVR-VTNSGDVTADDIYVEFQTTNS 463 (981) Q Consensus 395 ~~i~i~~~~~~~vt~~~g~~~t~~~~~~~~v~~~----------~~Lp~~~~~G~~v~-v~~~g~~~~d~yyv~~~~~~~ 463 (981) ..+++.+...-.+..- +...+............ .+++..+..|.... +.+.... ....-.++... T Consensus 195 ~~i~~~~~~v~si~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~t~~~~a~~g~~~~~~~g~~~~-~~~~~~~~~~~-- 270 (825) T protein:vir:73 195 KLFYLEQPAVDSVPVW-ETSKTTAINDVRRADSNYYRANTSGKTGTLRPSHTEGMSWDGWGGTGSD-DTGIQWEYLHS-- 270 (825) T ss_pred eEEEEeccccccccee-eeeeEEEeeeEEECCCceeeeecccccceeeccccCCceeEeeeeeccc-CCceEEEEEec-- Confidence 5566554332211110 00000000000000000 01122222221111 1111110 00010111111 Q ss_pred CcccceEEEEeeccceeEEEc---cccceEEEEeccCCceeeecccCCcccCCCcccCcCccccCCceeEEEEEcceEEE Q lcl|NC_020838. 464 AARGPGVWEETIGPSLEFEID---ETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGL 540 (981) Q Consensus 464 ~~~g~~~W~E~a~~~~~~~~~---~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f 540 (981) +.+.++.++.++...... ...+|+.+++.++++++++...|+++ +.||++|+||||||+| T Consensus 271 ---~~g~~~it~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~--------------~gyPs~v~f~q~RL~f 333 (825) T protein:vir:73 271 ---GFGIAKITAVAGDGLTATADVVSFIPSQVVGSANASYKWAKYAWNSV--------------NGYPSTVVYYQQRLYF 333 (825) T ss_pred ---CCceEEEeeccccceeeccccceecccccccCCCCCcccccCCcccC--------------CCCccEEEEEcceEEE Confidence 111233332222111111 11245666666667777766666543 3578889999999999 Q ss_pred e----cCCeEEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCC-ccccccc Q lcl|NC_020838. 541 L----SNEAVIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDA-DILSPTT 615 (981) Q Consensus 541 ~----s~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~-~~LTP~~ 615 (981) + +|++|||||+||||||++++ +++|||||+++++++++|.|+|+++++ +|+|||+++||+|+|++ ++|||+| T Consensus 334 ~g~~~~p~~v~~Srtgd~~nF~~~~--~~~DdD~I~~~~s~~~~~~i~~~~~~~-~L~~~t~~~e~~l~~~~~~~lTP~~ 410 (825) T protein:vir:73 334 AASTAYPQTIWASRTGDYKDFGKNN--PIQDDDRIIYTYAGRQVNEIRHLIDVG-NLVALTSGGEYTISGDQNKVLTPSA 410 (825) T ss_pred eecCCCCCEEEEEccCCccccccCC--CCCCCccEEEEEcCCcceeEEEEeecC-cEEEEecCceEEEecCCCcccceee Confidence 9 57999999999999999998 468999999999999999999999985 89999999999999863 6999999 Q ss_pred eEEEEEeeeccccCCCcEEeCCeEEEEecCCCeeEEEEEeecccccceehhhHHHHHHHhcCC-CeEEEEEcCCCcEEEE Q lcl|NC_020838. 616 TKINTISTFECDAEIDAVAVGTTQAFISKSNLYSKLFLMLNVQKEAAATIDEATTNVPEYVPS-DIDSMTASPAMSIVSL 694 (981) Q Consensus 616 ~~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS~~~~h~~~~-~i~~~~~s~~~~~~~~ 694 (981) ++++++|+|+++ +++|+.+|++++|+|++|+ +||||.|++++|+|+++|||+|++|++++ ++.+|++++.|+.++| T Consensus 411 ~~~~~~s~~g~~-~~~Pv~vg~~~~Fv~~~g~--~vre~~~~~~~d~~~~~dlt~~a~hl~~~~~~~~~a~~~~p~~~~~ 487 (825) T protein:vir:73 411 FSFSSQGNNGSS-NVPPIAVANIALFIQEKGS--VVRDLAYSFDVDGYQGTDLTILANHLFQKHSIVDWSFCIVPYSSAF 487 (825) T ss_pred EEEEeeeeeccc-cccceEeCCeEEEEeCCCC--eEEEEEEeeecCceeccchhhhhHhhccCCceEEEEEcCCCceEEE Confidence 999999999765 7999999999999999885 79999999999999999999999999986 7889999988887777 Q ss_pred EecCCcEEEEEEeecCcchheeeeEeeccCCceEEEEEe----CCeEEEEEEcCCc----EEEEEEEeecCCceeEEEec Q lcl|NC_020838. 695 GKSGSNTVYQHRFFMQGENRVQTWYKWQLTGDLRLQFFD----KTTFYAVTSSGSN----VYLTSYDLTQASESGYLTLP 766 (981) Q Consensus 695 ~~~g~~~l~~y~y~~~~eq~V~aWsrw~~~G~v~sv~~~----~d~ly~vv~r~~~----~~l~~~~~~~~~~~~~~~~~ 766 (981) +..+++.|++|+|+ +||+|.|||||+|+|+|+++|++ +|.||++|+|..+ .|+|+|.. T Consensus 488 ~v~~dg~l~~~ty~--~~q~v~aW~~~~~~g~v~~~~~i~~~~~D~l~~iV~R~~~g~~~~yiE~~~~------------ 553 (825) T protein:vir:73 488 CIRDDGKLLVLTYL--RDQQVFAWAPQSSAGKYESTCSISEGSEDAVYFVVNRTINGQTVRYIERLSS------------ 553 (825) T ss_pred EEecCCeEEEEEEe--ccccceeeEEEecCCcEEEEEEecCCCccEEEEEEEEeeCCceEEEEEEecc------------ Confidence 77778899999985 79999999999999999999997 6799999999644 45666632 Q ss_pred CCCcccccccceeeeeeeeeccCCce------------------------------------------------------ Q lcl|NC_020838. 767 TGEKTDVCLDMFNVNPYRTYSTSTKK------------------------------------------------------ 792 (981) Q Consensus 767 ~~~~~~~~lD~~~vd~~~ty~~~~~~------------------------------------------------------ 792 (981) +.+...-|++++||+++|++.+.. T Consensus 554 --~~~~~~~~~~~vD~g~~~~g~~~~~~l~~l~g~tv~~~~~g~~~~~v~~g~itl~~~~~~~i~l~~~~~~~~~~~~~~ 631 (825) T protein:vir:73 554 --RLFTNDEDAFFVDCGLSYDGRNTSSRTMTISGGTGDWSYQVDYPVTVSGGAYFVNTDVGAQIQFPYTGTDPDTNEPVA 631 (825) T ss_pred --cccCCCcceeEEEEEeeecccceeeceeeeCCceEEEEeCCeEEEEEcCCeEEecccceEEEEecccCccccccccee Confidence 222222244455555555432211 Q ss_pred ---------------------------------------EEEcccCCCcCCceEEEEEcCcccCceeEEEEecCceEEEc Q lcl|NC_020838. 793 ---------------------------------------TTVNLPFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYF 833 (981) Q Consensus 793 ---------------------------------------tt~~l~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~ 833 (981) .....+|+||||++|.|+|||.. + T Consensus 632 ~~~~~~i~~~~~~~~v~v~~~~~~~a~~~~~~~t~~~~a~~~~~gL~hLeG~~v~v~~Dg~~-----------------~ 694 (825) T protein:vir:73 632 KELRGDIISVTSNTAVVVRFNRNVPPVLRNVATTNWQMARQTFSGLAHLEGQTVNILSDASV-----------------E 694 (825) T ss_pred ceeeEEEccccCceEEEEEecccccceeeeecccCCCcchheeccccccCCceEEEEECCee-----------------e Confidence 01123445555555555555543 4 Q ss_pred cccccCCCEEEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeEEEEEEEEEeecccceEEEEccCCCCc Q lcl|NC_020838. 834 EDSDISSNQVLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDLILHRLKVSTGLSGPITYKVDITGKDE 913 (981) Q Consensus 834 ~~~tv~gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~ 913 (981) ++.+|.+|+|+|+ .++++|||||+|+++++||||++..+ |...+ .++||+++.++|.+|.++.++.+....++ T Consensus 695 ~~~~V~~G~vtl~--~~~~~v~vGl~y~~~~~~l~~~~~~~-g~~~g----~~~ri~~~~~~~~~s~~~~~g~~~~~l~~ 767 (825) T protein:vir:73 695 PQKTVTGGAVTLE--SPGAVVHIGLPITAEFETLDININGQ-ETLLD----KKQVIPTVTMVVNASRGIWATTPGGTWYE 767 (825) T ss_pred CCeEecCcEEEec--CCceEEEEeeCccceEEecccccCCC-ccccC----ccEEEEEEEEEEEeeeeEEEecCCCcceE Confidence 4566889999997 57899999999999999999998643 54332 35789999999999999999865554333 Q ss_pred eeeEecccccCccccCccccccCCeEEEEe-eccCcceEEEEEECCCCCEEEEEEEEEEEeecc Q lcl|NC_020838. 914 WTNIINVTLPNTYVLNNVNLSASALHDVPI-YQRNENVNIKIIGDTPFPISLLNIVWEGNYNRR 976 (981) Q Consensus 914 ~~~~~~~~~~~~~~~~~~p~~~~~~~~vp~-~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r 976 (981) ++.|.. ..++++|+..++.+++++ .+|+++++|+|+|++|||||||||..|...+== T Consensus 768 ----~~~r~~--~~~~~~~~~~tG~~~~~~~~~~~~~~~~~i~q~~PlP~tvlav~~~~~~~g~ 825 (825) T protein:vir:73 768 ----YPQREF--EFYDDPVDDATGKVEVKLDSNWDKNGRVKVRQLDPLPLSVLAVLPRLTVGGF 825 (825) T ss_pred ----eeccCC--CcccCCCccccCcEEEecCCCCCCccEEEEEEcCCCCEEEEEEEEEEEecCC Confidence 334433 345777555555667877 899999999999999999999999988776554 No 22 >protein:vir:107423 Length: 681 # NCBI annotation: Bbp13 # Family: family:all:780 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958682;genbank:gi:41179374;genbank:GeneID:2717217 Probab=100.00 E-value=1.7e-127 Score=715.61 Aligned_cols=664 Identities=12% Similarity=0.082 Sum_probs=457.0 Q ss_pred cCCceeEEEeeccccceecccccccc---ccccCCcccccceEeecceeEeeeeeeeEEeeCceeeEeeeccCCcCcCce Q lcl|NC_020838. 184 DNGQRIVKDDGTNAGSIAAGSAMPSG---YSLGNERTDDYPWFKRDGYRVYEVEKEVAAAYNSTELSTANTNMGTAQTAY 260 (981) Q Consensus 184 ~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 260 (981) ..-..+.+.+-+..+ +...+....+ |..+--++-+......+|...|.++.+++++...-...+|++.-+.-++.| T Consensus 1 m~~~~~~~~~f~~Ge-~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~~~~~ 79 (681) T protein:vir:10 1 MSNVRVLQRSFGGGE-ISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSVTQTM 79 (681) T ss_pred CcceeEeeeecCCce-eeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCCCceE Confidence 111111222222222 3333332222 222223444666677899999999999999998876678887766656665 Q ss_pred EEEEEcccccceE--EecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCceeEeccCCCCCCCCceEEEEE Q lcl|NC_020838. 261 DNAVSTESTEKGD--YDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKNKTTAMKTTTSAAVPNVAFVVI 338 (981) Q Consensus 261 ~~~i~~~~~~~~~--~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~~~~~~~~~~~~~~~~~~~v~v 338 (981) +-. .++.|+ |..+|.+.. .+..++..+.+++++..+|+++|+||++||+|++++|+.....+...+...-+.. T Consensus 80 ~l~----~g~~~~r~~~~~~~~~~-~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~W~l~~~~f 154 (681) T protein:vir:10 80 VIE----LGAGYFRFHTNGGTLLD-GAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATNWQLATIAF 154 (681) T ss_pred EEE----EeCCeEEEEeCCcEEee-CcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCceEEEEEEe Confidence 422 233333 555555443 3455556777888999999999999999999999999864332222222111111 Q ss_pred eeeccceEEEEeeCceEEEEEeccCCCcceecHHHHHHHHHhhhhccCceEEEEcCCEEEEEeCCCceEEEecCcCccee Q lcl|NC_020838. 339 RIVAYNSDYSVTLNGTTVTHSTPDTVAGATTDSGSIAAALTSSINALTGFSATQVGPGIYIEGTSAFSISTSGSTTEEGI 418 (981) Q Consensus 339 ~~g~y~~~~~v~~ng~~~~~~t~~~~a~~~~~~~~ia~~l~~~i~~~~~~~~~~vg~~i~i~~~~~~~vt~~~g~~~t~~ 418 (981) ..+-+ +.. ....++.++..+... T Consensus 155 ~~~p~----------------------~p~-----------------------------------~~~at~~~~~~~~t~ 177 (681) T protein:vir:10 155 TSPVA----------------------TPT-----------------------------------SVTATSNNKGTDYTY 177 (681) T ss_pred ccccc----------------------cce-----------------------------------eeeeeccCCccceeE Confidence 11111 000 000011111111111 Q ss_pred EEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCcccceEEEEeeccceeEEEccccceEEEEeccCC Q lcl|NC_020838. 419 FAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAARGPGVWEETIGPSLEFEIDETTMPHQLIRQANG 498 (981) Q Consensus 419 ~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g 498 (981) ......+++..+.+........++...... ...-.+.|...++ +....+|++..+ ++++ +..+.+ T Consensus 178 ~~~v~avda~t~~~s~~~~~~tvt~~~~~~--~~~~t~~w~a~~g-~~~~~V~~~~~g---i~g~---------ig~~~~ 242 (681) T protein:vir:10 178 RYVVTALDAEGKTESAPSSAGTCTNNLFTN--GGANTIAWSASSG-ASRYNVYKEQGG---LYGY---------IGQTTG 242 (681) T ss_pred eEEEEEeecccceeecCCcceEEeeeeecC--CcceeEEEEecCC-ceeeeeccccee---EEEE---------eeccce Confidence 111111111111111111111111111000 0000112222221 112223333222 2222 221111 Q ss_pred -ceeeecccCCcccCCCcccCcCccccCCceeEEEEEcceEEEe----cCCeEEEEeccccccccccccccccCCccEEE Q lcl|NC_020838. 499 -VFKYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLL----SNEAVIMSRAGDYFNFFANSSQVVAPDDPIDL 573 (981) Q Consensus 499 -~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~----s~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~ 573 (981) ++....+.++.....+...+|+.++ +.||++|+||||||+|+ +||+|||||+||||||++++ ++.|||||++ T Consensus 243 ~~~~~~~~~~~~~~t~~~~~~~~~~~-~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~--~~~ddD~i~~ 319 (681) T protein:vir:10 243 TSLVDDNIAPDLSVTPPIYDAVFNAA-GDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSL--PVRDDDRVAF 319 (681) T ss_pred eeeeecccccCccccccccccccccC-CCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccC--CCCCCccEEE Confidence 2222223333444444555666554 46899999999999999 57999999999999999988 4689999999 Q ss_pred EEcCCCceeEEEEeecCCcEEEEecCcEEEEEcC-CccccccceEEEEEeeeccccCCCcEEeCCeEEEEecCCCeeEEE Q lcl|NC_020838. 574 QATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTD-ADILSPTTTKINTISTFECDAEIDAVAVGTTQAFISKSNLYSKLF 652 (981) Q Consensus 574 ~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~-~~~LTP~~~~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vr 652 (981) ++++++++.|+|+++++ +|+|||+++||.|+++ +++|||+|++++++|.|++ ++++|+.+|++++|+|++|+ +|| T Consensus 320 ~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~fv~~~g~--~vr 395 (681) T protein:vir:10 320 RVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTIYGAARGG--HVR 395 (681) T ss_pred EEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeeeecc-ccccceeeCCeEEEEecCCC--EEE Confidence 99999999999999995 6999999999999875 4699999999999999976 57999999999999999996 799 Q ss_pred EEeecccccceehhhHHHHHHHhcCC-CeEEEEEcCCCcEEEEEecCCcEEEEEEeecCcchheeeeEeeccCCceEEEE Q lcl|NC_020838. 653 LMLNVQKEAAATIDEATTNVPEYVPS-DIDSMTASPAMSIVSLGKSGSNTVYQHRFFMQGENRVQTWYKWQLTGDLRLQF 731 (981) Q Consensus 653 e~~y~~~~d~~~a~DlS~~~~h~~~~-~i~~~~~s~~~~~~~~~~~g~~~l~~y~y~~~~eq~V~aWsrw~~~G~v~sv~ 731 (981) ||.|++++|+|+++|||++++|++++ ++.+|+++.+|+.++|+..+++.+++|+|+ +||+|+|||||+|+|+|+++| T Consensus 396 e~~y~~~~d~~~~~dlt~~a~Hl~~~~~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~--~eq~v~aW~~~~~~g~v~~v~ 473 (681) T protein:vir:10 396 ELAYNWQANGFVTGDLSLRAAHLFDNLDILDMAYAKAPQPIVWFISSSGKLLGLTYV--PEQQIGAWHQHDTDGVFESCA 473 (681) T ss_pred EEEEeeecCceeccchhhhhhhhcCCCCeEEEEEecCCCEEEEEEecCCcEEEEEEe--cccceeeEEEEecCCcEEEEE Confidence 99999999999999999999999987 899999999999888888888899999985 789999999999999999999 Q ss_pred Ee----CCeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCcccccccceeeeeeeeeccCCceEEEcccCCCcCCceE Q lcl|NC_020838. 732 FD----KTTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFNVNPYRTYSTSTKKTTVNLPFDHITGKKL 807 (981) Q Consensus 732 ~~----~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~vd~~~ty~~~~~~tt~~l~~~~l~g~~v 807 (981) ++ +|.||++++|+.+. ....++|++.....+...|++++||+++|.+.+..+ ..+++||+|+++ T Consensus 474 ~i~~~~~d~l~~vv~r~~~g----------~~~~yie~~~~~~~~~~~~~~~vD~~~t~~~~~~~~--~sgl~~leG~tv 541 (681) T protein:vir:10 474 VVAEGNEDRLYAVVRRTIGG----------NEVRYVERMASRQFDAQADAFFVDSGLTYSGEPVSH--ISGLEHLEGKTV 541 (681) T ss_pred EecCCCCcEEEEEEEecCCC----------CeEEEEEecCCccccccccceEeeccccccCcceee--eccccCCCCcEE Confidence 97 67899999996442 222333333333445556889999999998776543 346799999999 Q ss_pred EEEEcCcccCceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeE Q lcl|NC_020838. 808 AVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDL 887 (981) Q Consensus 808 ~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl 887 (981) .+++||... ++.+|.+|.|+|+ .++++|+|||+|+++++|+||+++.++|...+ .++ T Consensus 542 ~i~aDG~~~-----------------~~~~V~~G~itl~--~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g----~~~ 598 (681) T protein:vir:10 542 SILADGAVH-----------------PQRVVTDGAIDLD--VEAGTVHIGLPITAELQTLPVAMQLDGSFGQG----RVK 598 (681) T ss_pred EEEeCCeec-----------------CcEeecCcEEEeC--cCCceEEEeeeceeEEEecceeeecCCcccCC----ceE Confidence 999998744 3445778999996 56899999999999999999999988876443 467 Q ss_pred EEEEEEEEeecccceEEEEccCCCCceeeEecccccCccccCccccccCCeEEEEe-eccCcceEEEEEECCCCCEEEEE Q lcl|NC_020838. 888 ILHRLKVSTGLSGPITYKVDITGKDEWTNIINVTLPNTYVLNNVNLSASALHDVPI-YQRNENVNIKIIGDTPFPISLLN 966 (981) Q Consensus 888 ~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~vp~-~g~~~~~~v~I~~~~PlPltvls 966 (981) +|+|+.|++.+|+++.++++....+++..+ ..+.+|.++...++.+++|+ .+|+++.+|+|+|++|+||+|+| T Consensus 599 ri~rv~lr~~~S~g~~~~~~~~~l~~~~~~------~~~~~g~~~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvls 672 (681) T protein:vir:10 599 NINKLWLRVHRSSGIFAGPHADALTEVKQR------TSEPYGSPPALKSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVS 672 (681) T ss_pred EEEEEEEEEEcccceEEeeCCCceEEEEEe------ccccccccCCccCCeEEEEeCCCcCcceEEEEEECCCcCEEEEE Confidence 999999999999999999877665554332 23445666555556778887 58999999999999999999999 Q ss_pred EEEEEEeec Q lcl|NC_020838. 967 IVWEGNYNR 975 (981) Q Consensus 967 i~weg~y~~ 975 (981) |+||-...= T Consensus 673 i~~ev~vgg 681 (681) T protein:vir:10 673 MSAEIAIGA 681 (681) T ss_pred eeEEEEeeC Confidence 999999988 No 23 >protein:vir:107802 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996623;genbank:gi:45580757;genbank:GeneID:2767878 Probab=100.00 E-value=1.7e-127 Score=715.61 Aligned_cols=664 Identities=12% Similarity=0.082 Sum_probs=457.0 Q ss_pred cCCceeEEEeeccccceecccccccc---ccccCCcccccceEeecceeEeeeeeeeEEeeCceeeEeeeccCCcCcCce Q lcl|NC_020838. 184 DNGQRIVKDDGTNAGSIAAGSAMPSG---YSLGNERTDDYPWFKRDGYRVYEVEKEVAAAYNSTELSTANTNMGTAQTAY 260 (981) Q Consensus 184 ~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 260 (981) ..-..+.+.+-+..+ +...+....+ |..+--++-+......+|...|.++.+++++...-...+|++.-+.-++.| T Consensus 1 m~~~~~~~~~f~~Ge-~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~~~~~ 79 (681) T protein:vir:10 1 MSNVRVLQRSFGGGE-ISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSVTQTM 79 (681) T ss_pred CcceeEeeeecCCce-eeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCCCceE Confidence 111111222222222 3333332222 222223444666677899999999999999998876678887766656665 Q ss_pred EEEEEcccccceE--EecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCceeEeccCCCCCCCCceEEEEE Q lcl|NC_020838. 261 DNAVSTESTEKGD--YDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKNKTTAMKTTTSAAVPNVAFVVI 338 (981) Q Consensus 261 ~~~i~~~~~~~~~--~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~~~~~~~~~~~~~~~~~~~v~v 338 (981) +-. .++.|+ |..+|.+.. .+..++..+.+++++..+|+++|+||++||+|++++|+.....+...+...-+.. T Consensus 80 ~l~----~g~~~~r~~~~~~~~~~-~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~W~l~~~~f 154 (681) T protein:vir:10 80 VIE----LGAGYFRFHTNGGTLLD-GAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATNWQLATIAF 154 (681) T ss_pred EEE----EeCCeEEEEeCCcEEee-CcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCceEEEEEEe Confidence 422 233333 555555443 3455556777888999999999999999999999999864332222222111111 Q ss_pred eeeccceEEEEeeCceEEEEEeccCCCcceecHHHHHHHHHhhhhccCceEEEEcCCEEEEEeCCCceEEEecCcCccee Q lcl|NC_020838. 339 RIVAYNSDYSVTLNGTTVTHSTPDTVAGATTDSGSIAAALTSSINALTGFSATQVGPGIYIEGTSAFSISTSGSTTEEGI 418 (981) Q Consensus 339 ~~g~y~~~~~v~~ng~~~~~~t~~~~a~~~~~~~~ia~~l~~~i~~~~~~~~~~vg~~i~i~~~~~~~vt~~~g~~~t~~ 418 (981) ..+-+ +.. ....++.++..+... T Consensus 155 ~~~p~----------------------~p~-----------------------------------~~~at~~~~~~~~t~ 177 (681) T protein:vir:10 155 TSPVA----------------------TPT-----------------------------------SVTATSNNKGTDYTY 177 (681) T ss_pred ccccc----------------------cce-----------------------------------eeeeeccCCccceeE Confidence 11111 000 000011111111111 Q ss_pred EEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCcccceEEEEeeccceeEEEccccceEEEEeccCC Q lcl|NC_020838. 419 FAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAARGPGVWEETIGPSLEFEIDETTMPHQLIRQANG 498 (981) Q Consensus 419 ~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g 498 (981) ......+++..+.+........++...... ...-.+.|...++ +....+|++..+ ++++ +..+.+ T Consensus 178 ~~~v~avda~t~~~s~~~~~~tvt~~~~~~--~~~~t~~w~a~~g-~~~~~V~~~~~g---i~g~---------ig~~~~ 242 (681) T protein:vir:10 178 RYVVTALDAEGKTESAPSSAGTCTNNLFTN--GGANTIAWSASSG-ASRYNVYKEQGG---LYGY---------IGQTTG 242 (681) T ss_pred eEEEEEeecccceeecCCcceEEeeeeecC--CcceeEEEEecCC-ceeeeeccccee---EEEE---------eeccce Confidence 111111111111111111111111111000 0000112222221 112223333222 2222 221111 Q ss_pred -ceeeecccCCcccCCCcccCcCccccCCceeEEEEEcceEEEe----cCCeEEEEeccccccccccccccccCCccEEE Q lcl|NC_020838. 499 -VFKYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLL----SNEAVIMSRAGDYFNFFANSSQVVAPDDPIDL 573 (981) Q Consensus 499 -~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~----s~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~ 573 (981) ++....+.++.....+...+|+.++ +.||++|+||||||+|+ +||+|||||+||||||++++ ++.|||||++ T Consensus 243 ~~~~~~~~~~~~~~t~~~~~~~~~~~-~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~--~~~ddD~i~~ 319 (681) T protein:vir:10 243 TSLVDDNIAPDLSVTPPIYDAVFNAA-GDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSL--PVRDDDRVAF 319 (681) T ss_pred eeeeecccccCccccccccccccccC-CCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccC--CCCCCccEEE Confidence 2222223333444444555666554 46899999999999999 57999999999999999988 4689999999 Q ss_pred EEcCCCceeEEEEeecCCcEEEEecCcEEEEEcC-CccccccceEEEEEeeeccccCCCcEEeCCeEEEEecCCCeeEEE Q lcl|NC_020838. 574 QATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTD-ADILSPTTTKINTISTFECDAEIDAVAVGTTQAFISKSNLYSKLF 652 (981) Q Consensus 574 ~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~-~~~LTP~~~~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vr 652 (981) ++++++++.|+|+++++ +|+|||+++||.|+++ +++|||+|++++++|.|++ ++++|+.+|++++|+|++|+ +|| T Consensus 320 ~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~fv~~~g~--~vr 395 (681) T protein:vir:10 320 RVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTIYGAARGG--HVR 395 (681) T ss_pred EEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeeeecc-ccccceeeCCeEEEEecCCC--EEE Confidence 99999999999999995 6999999999999875 4699999999999999976 57999999999999999996 799 Q ss_pred EEeecccccceehhhHHHHHHHhcCC-CeEEEEEcCCCcEEEEEecCCcEEEEEEeecCcchheeeeEeeccCCceEEEE Q lcl|NC_020838. 653 LMLNVQKEAAATIDEATTNVPEYVPS-DIDSMTASPAMSIVSLGKSGSNTVYQHRFFMQGENRVQTWYKWQLTGDLRLQF 731 (981) Q Consensus 653 e~~y~~~~d~~~a~DlS~~~~h~~~~-~i~~~~~s~~~~~~~~~~~g~~~l~~y~y~~~~eq~V~aWsrw~~~G~v~sv~ 731 (981) ||.|++++|+|+++|||++++|++++ ++.+|+++.+|+.++|+..+++.+++|+|+ +||+|+|||||+|+|+|+++| T Consensus 396 e~~y~~~~d~~~~~dlt~~a~Hl~~~~~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~--~eq~v~aW~~~~~~g~v~~v~ 473 (681) T protein:vir:10 396 ELAYNWQANGFVTGDLSLRAAHLFDNLDILDMAYAKAPQPIVWFISSSGKLLGLTYV--PEQQIGAWHQHDTDGVFESCA 473 (681) T ss_pred EEEEeeecCceeccchhhhhhhhcCCCCeEEEEEecCCCEEEEEEecCCcEEEEEEe--cccceeeEEEEecCCcEEEEE Confidence 99999999999999999999999987 899999999999888888888899999985 789999999999999999999 Q ss_pred Ee----CCeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCcccccccceeeeeeeeeccCCceEEEcccCCCcCCceE Q lcl|NC_020838. 732 FD----KTTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFNVNPYRTYSTSTKKTTVNLPFDHITGKKL 807 (981) Q Consensus 732 ~~----~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~vd~~~ty~~~~~~tt~~l~~~~l~g~~v 807 (981) ++ +|.||++++|+.+. ....++|++.....+...|++++||+++|.+.+..+ ..+++||+|+++ T Consensus 474 ~i~~~~~d~l~~vv~r~~~g----------~~~~yie~~~~~~~~~~~~~~~vD~~~t~~~~~~~~--~sgl~~leG~tv 541 (681) T protein:vir:10 474 VVAEGNEDRLYAVVRRTIGG----------NEVRYVERMASRQFDAQADAFFVDSGLTYSGEPVSH--ISGLEHLEGKTV 541 (681) T ss_pred EecCCCCcEEEEEEEecCCC----------CeEEEEEecCCccccccccceEeeccccccCcceee--eccccCCCCcEE Confidence 97 67899999996442 222333333333445556889999999998776543 346799999999 Q ss_pred EEEEcCcccCceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeE Q lcl|NC_020838. 808 AVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDL 887 (981) Q Consensus 808 ~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl 887 (981) .+++||... ++.+|.+|.|+|+ .++++|+|||+|+++++|+||+++.++|...+ .++ T Consensus 542 ~i~aDG~~~-----------------~~~~V~~G~itl~--~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g----~~~ 598 (681) T protein:vir:10 542 SILADGAVH-----------------PQRVVTDGAIDLD--VEAGTVHIGLPITAELQTLPVAMQLDGSFGQG----RVK 598 (681) T ss_pred EEEeCCeec-----------------CcEeecCcEEEeC--cCCceEEEeeeceeEEEecceeeecCCcccCC----ceE Confidence 999998744 3445778999996 56899999999999999999999988876443 467 Q ss_pred EEEEEEEEeecccceEEEEccCCCCceeeEecccccCccccCccccccCCeEEEEe-eccCcceEEEEEECCCCCEEEEE Q lcl|NC_020838. 888 ILHRLKVSTGLSGPITYKVDITGKDEWTNIINVTLPNTYVLNNVNLSASALHDVPI-YQRNENVNIKIIGDTPFPISLLN 966 (981) Q Consensus 888 ~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~vp~-~g~~~~~~v~I~~~~PlPltvls 966 (981) +|+|+.|++.+|+++.++++....+++..+ ..+.+|.++...++.+++|+ .+|+++.+|+|+|++|+||+|+| T Consensus 599 ri~rv~lr~~~S~g~~~~~~~~~l~~~~~~------~~~~~g~~~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvls 672 (681) T protein:vir:10 599 NINKLWLRVHRSSGIFAGPHADALTEVKQR------TSEPYGSPPALKSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVS 672 (681) T ss_pred EEEEEEEEEEcccceEEeeCCCceEEEEEe------ccccccccCCccCCeEEEEeCCCcCcceEEEEEECCCcCEEEEE Confidence 999999999999999999877665554332 23445666555556778887 58999999999999999999999 Q ss_pred EEEEEEeec Q lcl|NC_020838. 967 IVWEGNYNR 975 (981) Q Consensus 967 i~weg~y~~ 975 (981) |+||-...= T Consensus 673 i~~ev~vgg 681 (681) T protein:vir:10 673 MSAEIAIGA 681 (681) T ss_pred eeEEEEeeC Confidence 999999988 No 24 >protein:vir:98487 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996575;genbank:gi:45569506;genbank:GeneID:2767815 Probab=100.00 E-value=1.7e-127 Score=715.61 Aligned_cols=664 Identities=12% Similarity=0.082 Sum_probs=457.0 Q ss_pred cCCceeEEEeeccccceecccccccc---ccccCCcccccceEeecceeEeeeeeeeEEeeCceeeEeeeccCCcCcCce Q lcl|NC_020838. 184 DNGQRIVKDDGTNAGSIAAGSAMPSG---YSLGNERTDDYPWFKRDGYRVYEVEKEVAAAYNSTELSTANTNMGTAQTAY 260 (981) Q Consensus 184 ~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 260 (981) ..-..+.+.+-+..+ +...+....+ |..+--++-+......+|...|.++.+++++...-...+|++.-+.-++.| T Consensus 1 m~~~~~~~~~f~~Ge-~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~~~~~ 79 (681) T protein:vir:98 1 MSNVRVLQRSFGGGE-ISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSVTQTM 79 (681) T ss_pred CcceeEeeeecCCce-eeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCCCceE Confidence 111111222222222 3333332222 222223444666677899999999999999998876678887766656665 Q ss_pred EEEEEcccccceE--EecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCceeEeccCCCCCCCCceEEEEE Q lcl|NC_020838. 261 DNAVSTESTEKGD--YDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKNKTTAMKTTTSAAVPNVAFVVI 338 (981) Q Consensus 261 ~~~i~~~~~~~~~--~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~~~~~~~~~~~~~~~~~~~v~v 338 (981) +-. .++.|+ |..+|.+.. .+..++..+.+++++..+|+++|+||++||+|++++|+.....+...+...-+.. T Consensus 80 ~l~----~g~~~~r~~~~~~~~~~-~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~~L~r~~~~~W~l~~~~f 154 (681) T protein:vir:98 80 VIE----LGAGYFRFHTNGGTLLD-GAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPRELRRLGATNWQLATIAF 154 (681) T ss_pred EEE----EeCCeEEEEeCCcEEee-CcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcceEEEEccCCceEEEEEEe Confidence 422 233333 555555443 3455556777888999999999999999999999999864332222222111111 Q ss_pred eeeccceEEEEeeCceEEEEEeccCCCcceecHHHHHHHHHhhhhccCceEEEEcCCEEEEEeCCCceEEEecCcCccee Q lcl|NC_020838. 339 RIVAYNSDYSVTLNGTTVTHSTPDTVAGATTDSGSIAAALTSSINALTGFSATQVGPGIYIEGTSAFSISTSGSTTEEGI 418 (981) Q Consensus 339 ~~g~y~~~~~v~~ng~~~~~~t~~~~a~~~~~~~~ia~~l~~~i~~~~~~~~~~vg~~i~i~~~~~~~vt~~~g~~~t~~ 418 (981) ..+-+ +.. ....++.++..+... T Consensus 155 ~~~p~----------------------~p~-----------------------------------~~~at~~~~~~~~t~ 177 (681) T protein:vir:98 155 TSPVA----------------------TPT-----------------------------------SVTATSNNKGTDYTY 177 (681) T ss_pred ccccc----------------------cce-----------------------------------eeeeeccCCccceeE Confidence 11111 000 000011111111111 Q ss_pred EEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCcccceEEEEeeccceeEEEccccceEEEEeccCC Q lcl|NC_020838. 419 FAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAARGPGVWEETIGPSLEFEIDETTMPHQLIRQANG 498 (981) Q Consensus 419 ~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g 498 (981) ......+++..+.+........++...... ...-.+.|...++ +....+|++..+ ++++ +..+.+ T Consensus 178 ~~~v~avda~t~~~s~~~~~~tvt~~~~~~--~~~~t~~w~a~~g-~~~~~V~~~~~g---i~g~---------ig~~~~ 242 (681) T protein:vir:98 178 RYVVTALDAEGKTESAPSSAGTCTNNLFTN--GGANTIAWSASSG-ASRYNVYKEQGG---LYGY---------IGQTTG 242 (681) T ss_pred eEEEEEeecccceeecCCcceEEeeeeecC--CcceeEEEEecCC-ceeeeeccccee---EEEE---------eeccce Confidence 111111111111111111111111111000 0000112222221 112223333222 2222 221111 Q ss_pred -ceeeecccCCcccCCCcccCcCccccCCceeEEEEEcceEEEe----cCCeEEEEeccccccccccccccccCCccEEE Q lcl|NC_020838. 499 -VFKYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLL----SNEAVIMSRAGDYFNFFANSSQVVAPDDPIDL 573 (981) Q Consensus 499 -~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~----s~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~ 573 (981) ++....+.++.....+...+|+.++ +.||++|+||||||+|+ +||+|||||+||||||++++ ++.|||||++ T Consensus 243 ~~~~~~~~~~~~~~t~~~~~~~~~~~-~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~--~~~ddD~i~~ 319 (681) T protein:vir:98 243 TSLVDDNIAPDLSVTPPIYDAVFNAA-GDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSL--PVRDDDRVAF 319 (681) T ss_pred eeeeecccccCccccccccccccccC-CCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccC--CCCCCccEEE Confidence 2222223333444444555666554 46899999999999999 57999999999999999988 4689999999 Q ss_pred EEcCCCceeEEEEeecCCcEEEEecCcEEEEEcC-CccccccceEEEEEeeeccccCCCcEEeCCeEEEEecCCCeeEEE Q lcl|NC_020838. 574 QATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTD-ADILSPTTTKINTISTFECDAEIDAVAVGTTQAFISKSNLYSKLF 652 (981) Q Consensus 574 ~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~-~~~LTP~~~~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vr 652 (981) ++++++++.|+|+++++ +|+|||+++||.|+++ +++|||+|++++++|.|++ ++++|+.+|++++|+|++|+ +|| T Consensus 320 ~~~~~~~~~i~~~v~~~-~lli~t~~~e~~l~~~~~~~lTP~~~~~~~~s~~g~-~~~~Pv~vg~~v~fv~~~g~--~vr 395 (681) T protein:vir:98 320 RVAAREANAIRHIVPLT-ELLLLTSSGEWRVASVNSDAVTPTTISVRPQSYVGA-TDVQPVVVNNTTIYGAARGG--HVR 395 (681) T ss_pred EEcCCcceeEEEEEecC-cEEEEEcCcEEEEecCCCccccceeEEEEEeeeecc-ccccceeeCCeEEEEecCCC--EEE Confidence 99999999999999995 6999999999999875 4699999999999999976 57999999999999999996 799 Q ss_pred EEeecccccceehhhHHHHHHHhcCC-CeEEEEEcCCCcEEEEEecCCcEEEEEEeecCcchheeeeEeeccCCceEEEE Q lcl|NC_020838. 653 LMLNVQKEAAATIDEATTNVPEYVPS-DIDSMTASPAMSIVSLGKSGSNTVYQHRFFMQGENRVQTWYKWQLTGDLRLQF 731 (981) Q Consensus 653 e~~y~~~~d~~~a~DlS~~~~h~~~~-~i~~~~~s~~~~~~~~~~~g~~~l~~y~y~~~~eq~V~aWsrw~~~G~v~sv~ 731 (981) ||.|++++|+|+++|||++++|++++ ++.+|+++.+|+.++|+..+++.+++|+|+ +||+|+|||||+|+|+|+++| T Consensus 396 e~~y~~~~d~~~~~dlt~~a~Hl~~~~~i~~~a~~~~p~~~~~~v~~dg~l~~~ty~--~eq~v~aW~~~~~~g~v~~v~ 473 (681) T protein:vir:98 396 ELAYNWQANGFVTGDLSLRAAHLFDNLDILDMAYAKAPQPIVWFISSSGKLLGLTYV--PEQQIGAWHQHDTDGVFESCA 473 (681) T ss_pred EEEEeeecCceeccchhhhhhhhcCCCCeEEEEEecCCCEEEEEEecCCcEEEEEEe--cccceeeEEEEecCCcEEEEE Confidence 99999999999999999999999987 899999999999888888888899999985 789999999999999999999 Q ss_pred Ee----CCeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCcccccccceeeeeeeeeccCCceEEEcccCCCcCCceE Q lcl|NC_020838. 732 FD----KTTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFNVNPYRTYSTSTKKTTVNLPFDHITGKKL 807 (981) Q Consensus 732 ~~----~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~vd~~~ty~~~~~~tt~~l~~~~l~g~~v 807 (981) ++ +|.||++++|+.+. ....++|++.....+...|++++||+++|.+.+..+ ..+++||+|+++ T Consensus 474 ~i~~~~~d~l~~vv~r~~~g----------~~~~yie~~~~~~~~~~~~~~~vD~~~t~~~~~~~~--~sgl~~leG~tv 541 (681) T protein:vir:98 474 VVAEGNEDRLYAVVRRTIGG----------NEVRYVERMASRQFDAQADAFFVDSGLTYSGEPVSH--ISGLEHLEGKTV 541 (681) T ss_pred EecCCCCcEEEEEEEecCCC----------CeEEEEEecCCccccccccceEeeccccccCcceee--eccccCCCCcEE Confidence 97 67899999996442 222333333333445556889999999998776543 346799999999 Q ss_pred EEEEcCcccCceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeE Q lcl|NC_020838. 808 AVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDL 887 (981) Q Consensus 808 ~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl 887 (981) .+++||... ++.+|.+|.|+|+ .++++|+|||+|+++++|+||+++.++|...+ .++ T Consensus 542 ~i~aDG~~~-----------------~~~~V~~G~itl~--~~~~~v~VGl~Y~s~i~~lp~~~~~~~g~~~g----~~~ 598 (681) T protein:vir:98 542 SILADGAVH-----------------PQRVVTDGAIDLD--VEAGTVHIGLPITAELQTLPVAMQLDGSFGQG----RVK 598 (681) T ss_pred EEEeCCeec-----------------CcEeecCcEEEeC--cCCceEEEeeeceeEEEecceeeecCCcccCC----ceE Confidence 999998744 3445778999996 56899999999999999999999988876443 467 Q ss_pred EEEEEEEEeecccceEEEEccCCCCceeeEecccccCccccCccccccCCeEEEEe-eccCcceEEEEEECCCCCEEEEE Q lcl|NC_020838. 888 ILHRLKVSTGLSGPITYKVDITGKDEWTNIINVTLPNTYVLNNVNLSASALHDVPI-YQRNENVNIKIIGDTPFPISLLN 966 (981) Q Consensus 888 ~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~vp~-~g~~~~~~v~I~~~~PlPltvls 966 (981) +|+|+.|++.+|+++.++++....+++..+ ..+.+|.++...++.+++|+ .+|+++.+|+|+|++|+||+|+| T Consensus 599 ri~rv~lr~~~S~g~~~~~~~~~l~~~~~~------~~~~~g~~~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvls 672 (681) T protein:vir:98 599 NINKLWLRVHRSSGIFAGPHADALTEVKQR------TSEPYGSPPALKSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVS 672 (681) T ss_pred EEEEEEEEEEcccceEEeeCCCceEEEEEe------ccccccccCCccCCeEEEEeCCCcCcceEEEEEECCCcCEEEEE Confidence 999999999999999999877665554332 23445666555556778887 58999999999999999999999 Q ss_pred EEEEEEeec Q lcl|NC_020838. 967 IVWEGNYNR 975 (981) Q Consensus 967 i~weg~y~~ 975 (981) |+||-...= T Consensus 673 i~~ev~vgg 681 (681) T protein:vir:98 673 MSAEIAIGA 681 (681) T ss_pred eeEEEEeeC Confidence 999999988 No 25 >protein:vir:102644 Length: 594 # NCBI annotation: Hypothetical protein # Family: family:all:780 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024422;genbank:gi:48696643;genbank:GeneID:2948111 Probab=100.00 E-value=1.7e-105 Score=595.02 Aligned_cols=560 Identities=12% Similarity=0.093 Sum_probs=379.4 Q ss_pred eEEEEEeeeccceEEEEeeCceEEEEEeccCCCcceecHHHHHHHHHhhhhccCceEEEEcCC------EEEEEe---CC Q lcl|NC_020838. 333 VAFVVIRIVAYNSDYSVTLNGTTVTHSTPDTVAGATTDSGSIAAALTSSINALTGFSATQVGP------GIYIEG---TS 403 (981) Q Consensus 333 ~~~v~v~~g~y~~~~~v~~ng~~~~~~t~~~~a~~~~~~~~ia~~l~~~i~~~~~~~~~~vg~------~i~i~~---~~ 403 (981) ++. ..+-.+++-..+... ..-++.+..+..+...- ++-....|+ +.++.. ++ T Consensus 1 m~~----------~~~~~F~~GelsP~l-----~~r~Dl~~y~~~~~~~~----n~~~~~~G~~~rR~G~~~~~~~~~~~ 61 (594) T protein:vir:10 1 MAD----------FSQTSFKGGVIAPRL-----QFNEYESAYHHSIEDAV----NFVVTEQGSLITRCGSEEVGLCQDGE 61 (594) T ss_pred Cce----------eeccccCcceeccee-----ccchhHHHHHHHHhhhh----ceEEEecCCeecCChhHhhhhccCCC Confidence 000 000011111111100 12234444444443332 233332232 222211 11 Q ss_pred C--ceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCC-cccceEEEEEcccCCcccceEEEEeecccee Q lcl|NC_020838. 404 A--FSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDV-TADDIYVEFQTTNSAARGPGVWEETIGPSLE 480 (981) Q Consensus 404 ~--~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~-~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~ 480 (981) . .-+...-+.....+-.+.+.- +......|..+... .+.. +....| .....+.-.+..|... .+.. T Consensus 62 ~~~~lipF~~s~~~~~~le~g~~~-----~r~~~~~~~~v~~~-~~~~~~~~tp~---~~t~~~~l~~i~~tqs--ad~~ 130 (594) T protein:vir:10 62 VRLFRLPAVDAPSNDVIVEVGNTN-----IAVWVNDVRQVVAN-TPSEWRNTIDR---IQTAYDTIGDDAGAAN--TGRL 130 (594) T ss_pred CCEEEEEEEeCCCCeEEEEEcCCe-----EEEEecCcEEEEcc-CCCcccccccc---eeeccCCccceEEEEE--eeEE Confidence 0 111111111111111111000 00111122222111 1110 001111 1111112233456544 3456 Q ss_pred EEEccccceEEEEeccCCceeeecccCCcccCCCcccCcCccccCCceeEEEEEcceEEEecC----CeEEEEecccccc Q lcl|NC_020838. 481 FEIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLLSN----EAVIMSRAGDYFN 556 (981) Q Consensus 481 ~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~s~----~~V~~Srtgdy~N 556 (981) +...+++||+.|+|.+++.|++...+|..+..++.+ .+||++|+||||||+|++. ++|||||+||||| T Consensus 131 ~~~~~~~~p~~L~R~~~~~w~~~~~~~~~~p~~~~~--------~~~p~~v~f~q~RL~f~~~~~~p~~v~~Srtgd~~n 202 (594) T protein:vir:10 131 IMVHPALQPKRLYRDNNNAWQFVNMHTGAVPAEWSP--------SNYPQTVGIFQNRVWYVGSPVHRTYFWATRAGKLED 202 (594) T ss_pred EEEcCCCCceEEEEccCCCceEEecccCcccccccC--------CccceEEEEEeeeEEEEeCCCCCceEEEEecccccc Confidence 778899999999999999999999999876555442 4588999999999999984 6899999999999 Q ss_pred ccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCC-ccccccceEEEEEeeeccccCCCcEEe Q lcl|NC_020838. 557 FFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDA-DILSPTTTKINTISTFECDAEIDAVAV 635 (981) Q Consensus 557 F~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~-~~LTP~~~~i~~~S~~~~s~~v~Pv~v 635 (981) |+++++ +.|||||++++ +++.+.| |++++.++|+|||+++||+|++++ ++|||+|++++++|.+ +++.++|+.+ T Consensus 203 F~~~~~--~~ddd~i~~~~-s~~~~~~-~~v~~~~~L~i~t~~~e~~l~~~~~~~lTp~~~~~~~~s~~-g~~~~~P~~v 277 (594) T protein:vir:10 203 IAPSTA--NNPNDPISFVG-IMEGTPC-WIIASSDVLTIGTTINDYQLAASTGVSVTAATAILRRSSVQ-GTAAVQGIPA 277 (594) T ss_pred cccCCC--CCCCccEEEEE-ecccceE-EEEecCCceEEEecCceEEEecCCCcccccceEEEEEeeee-ccCCCcceee Confidence 999985 47999999954 4565555 557788899999999999999864 5899999999999965 6789999999 Q ss_pred CCeEEEEecCCCeeEEEEEeecccccceehhhHHHHHHHhcC-------CCeEEEEEcCCCcEEEEEecCCcEEEEEEee Q lcl|NC_020838. 636 GTTQAFISKSNLYSKLFLMLNVQKEAAATIDEATTNVPEYVP-------SDIDSMTASPAMSIVSLGKSGSNTVYQHRFF 708 (981) Q Consensus 636 G~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS~~~~h~~~-------~~i~~~~~s~~~~~~~~~~~g~~~l~~y~y~ 708 (981) |+.++|+|++|+ +||||.|++++|+|+++|||+|++|++. ++|.+|+++.+|+.++|+..+++.+++++| T Consensus 278 g~~~~fv~~~g~--~vre~~y~~~~d~y~~~dlt~~a~hl~~~~~~~~~~~i~~~a~~~~p~~~~~~v~~dG~l~~~ty- 354 (594) T protein:vir:10 278 EEQVIFCSRNKS--KVYAMNYVREQDNWIPDEMSSQAQHLFTPISSAKGASVRRVAYISDAAKSLWVVLENGQINYCCF- 354 (594) T ss_pred CCeEEEEcCCCC--EEEEEEEeeccCceeccchhhhhhhhcCccccccCceEEEEEEecCCceEEEEEeCCCeEEEEEE- Confidence 999999999985 7999999999999999999999999984 578999999888888777777778877776 Q ss_pred cCcchheeeeEeec-cCCceEEEEEe----CCeEEEEEEcCCcE--EEEEEEeecCCceeEEEecCCCcccccccceeee Q lcl|NC_020838. 709 MQGENRVQTWYKWQ-LTGDLRLQFFD----KTTFYAVTSSGSNV--YLTSYDLTQASESGYLTLPTGEKTDVCLDMFNVN 781 (981) Q Consensus 709 ~~~eq~V~aWsrw~-~~G~v~sv~~~----~d~ly~vv~r~~~~--~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~vd 781 (981) ++||+|+|||||+ ++|.|+++|++ +|++|++|+|.+.+ .-.++. ++|++.........+.++++ T Consensus 355 -~~eq~v~aWs~~~~t~G~v~~va~i~~~~~d~l~~~V~R~~ti~g~~~~y~--------~lE~~~~~~~~~~~~~~~~d 425 (594) T protein:vir:10 355 -DRTTDTKAWTQLELSGGKVIDIAAAFNPDSDYAYVAVVRSKAINGVQKNYT--------VLEKISSPRTDWKRADGWVV 425 (594) T ss_pred -ecccceeeeEeeccCCCcEEEEEEeecCCCCEEEEEEEECCccccceeeEE--------EeecCCCccccccccceeee Confidence 5899999999998 58999999987 68999999996532 111111 12222222222233456788 Q ss_pred eeeeeccCCceEEEcccCCCcCCceEEEEEcCcccCceeEEEEecCceEEEccccccCCCEEEeC--CCCCCCEEEEEEe Q lcl|NC_020838. 782 PYRTYSTSTKKTTVNLPFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLLN--GDYRGRDLIIGYV 859 (981) Q Consensus 782 ~~~ty~~~~~~tt~~l~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~--gd~~~~~v~VGl~ 859 (981) ++.+|+.. ..+++||+|+++.|++||..+++ .+|.+|.|+|+ ++.++++|||||+ T Consensus 426 ~~~~~~~~------vsgl~hLeg~tv~v~aDG~~~~~-----------------~~V~~g~itL~~~~~~~~~~v~VGl~ 482 (594) T protein:vir:10 426 AQVNQNGD------VLNLDRYIGRTAVIFSKYGLEAE-----------------VEVNNIGLTHRINGYDPNTVYYVGYK 482 (594) T ss_pred ecccccce------eecccccCCceEEEEeCCeecCC-----------------eEEcCCeeEeeccCCCCcceEEEeee Confidence 88887521 23679999999999999975554 34678888886 4678999999999 Q ss_pred eeEEEEeCCceeeccCCCcccceEeeeEEEEEEEEEeecccceEEEEccCCCCceeeEecccccCccccCccccccCC-e Q lcl|NC_020838. 860 YDMELELPTLYPTQVEGRSSVSDVTSDLILHRLKVSTGLSGPITYKVDITGKDEWTNIINVTLPNTYVLNNVNLSASA-L 938 (981) Q Consensus 860 y~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~-~ 938 (981) |+++++++||+++.++|+.. .+|+||+|++|+|.+|+++.++.+...... .....+.+.....+.+++.++. . T Consensus 483 Y~s~i~~lp~~~~~~~gs~~----g~r~ri~r~~v~~~~S~g~~vg~~~~~~r~--~~~~~~~~~~~~~g~~~~~tg~~~ 556 (594) T protein:vir:10 483 MDSYFRTLTPSNGDMKKSMF----GSKIRISKVQLALFDSIEPTVNGEPADDRS--TDDIMDARLLDFSSNSGSSNGTRL 556 (594) T ss_pred eeEEEEeecccccCCccccc----CccEEEEEEEEEEEcceeeEECCccccccc--chhhccccCCcccCcccccCCceE Confidence 99999999999988877543 358999999999999999988754322111 1111222223344556666554 3 Q ss_pred EEEEeeccCcceEEEEEECCCCCEEEEEEEEEEEeecc Q lcl|NC_020838. 939 HDVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYNRR 976 (981) Q Consensus 939 ~~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r 976 (981) +.++..||+++++|+|+|++|||||||||.+|...+.= T Consensus 557 v~~~~~G~~~~~~i~I~qd~PlPltvlai~~ev~~~~~ 594 (594) T protein:vir:10 557 VDYNPLGWENDGKMVIAVEQPFLCEVVGVFSVVQSNKV 594 (594) T ss_pred EEEccCCcCcccEEEEEECCCcCEEEEEEEEEEEeccC Confidence 45667899999999999999999999999999999999 No 26 >protein:vir:100022 Length: 976 # NCBI annotation: T7-like tail tubular protein B # Family: family:all:825 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214208;genbank:gi:61806431;genbank:GeneID:3294702 Probab=99.92 E-value=3.1e-22 Score=138.51 Aligned_cols=704 Identities=13% Similarity=0.090 Sum_probs=238.0 Q ss_pred eEEEeeccccceeccccccccccccCCcccccc----eEeecceeEeeeeeeeEEeeCceeeEeeeccCCcCcCc----e Q lcl|NC_020838. 189 IVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYP----WFKRDGYRVYEVEKEVAAAYNSTELSTANTNMGTAQTA----Y 260 (981) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~g~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~----~ 260 (981) +..+. +.--+...|-+-+...-=..+|..++. -. -.|...|++++ |+++|.+.+..+..+ + T Consensus 1 M~~v~-~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~-v~Gl~kRp~~~---------~v~~l~~~~~~~~~~~~~~~ 69 (976) T protein:vir:10 1 MASVT-QTIPTLTGGLSQQPDELKIPGQVSVANNVIPDV-THGLLKRPGGK---------LVASISDNGTAALNSQTNGK 69 (976) T ss_pred Cccee-ecchhhhCcceecchhhcCCchhhhhhcccccc-ccccccCCcce---------eeeeecCCCcccccccccce Confidence 11111 111112223222222211222222211 11 13555555554 445555555555444 4 Q ss_pred EEEEEcccccceE-----------Eec-ceEEEEeeeeccc---ceeecccCCcceeeEEEEcCEEEEeCCceeEeccCC Q lcl|NC_020838. 261 DNAVSTESTEKGD-----------YDS-EVTACNIGSSNIP---ASAYLKDAAPEDIEILTINDYTFVLNKNKTTAMKTT 325 (981) Q Consensus 261 ~~~i~~~~~~~~~-----------~~~-~g~~~~~~~~~~~---~~~y~~~~~~~dl~~~t~ad~tfi~n~~~~~~~~~~ 325 (981) .|+|+||+.|+|+ ++. +|+.+.|+..+.. ..+|++++..+||||+||||||||+|++++|++.++ T Consensus 70 ~~~~~r~~~e~y~~~~~~~g~~~v~~~~~G~~~~v~~~~~~~~~~~~yl~~~~~~~~~~~tv~d~tfi~N~~~~~~~~~~ 149 (976) T protein:vir:10 70 WFSYYRDETESYIGQVSRSGDINMWRCSDGQAMTVNYDSGTATALTTYLTHTNDEDIQTLTLNDYTFLTNRTKTVAMSST 149 (976) T ss_pred EEEEEcCCCcEEEEEEecCCceEEEEccCCeEEEEEcCCCcccccchhhccCCcceeEEEEEccEEEEecCceEEeeccc Confidence 5999999999996 445 5777777765532 456888765568999999999999999999999998 Q ss_pred CCCCCCceEEEEEeeeccceEEEEeeCceEEE----EEeccC---------CCcceecHHHHHHHHHhhhhcc------- Q lcl|NC_020838. 326 TSAAVPNVAFVVIRIVAYNSDYSVTLNGTTVT----HSTPDT---------VAGATTDSGSIAAALTSSINAL------- 385 (981) Q Consensus 326 ~~~~~~~~~~v~v~~g~y~~~~~v~~ng~~~~----~~t~~~---------~a~~~~~~~~ia~~l~~~i~~~------- 385 (981) ..+..++.|+++|++|+|+|+|+|+|+|...+ ++++.. .+.+.-....++.+...+++.. T Consensus 150 ~~~~~~~~~~~~v~~~~y~~~y~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~s~~~G~ 229 (976) T protein:vir:10 150 VEPVRPPEVFIDLKATAYARQYAVNLFDNTTTTAVSTVTRIDVELIKSSNNYCDSNGAMVARTSRPSNSTRCDDSAGDGR 229 (976) T ss_pred ccCCCCceEEEEeeeeccceEEEEEEcCCcccceeeeeeeeeeccccCCcccccccccchhhHhHhhhhhcccccccccc Confidence 88888899999999999999999999986433 222211 1112222222333333332211 Q ss_pred ---CceE----EEE-cCCEEEEEeCCCce--EEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceE Q lcl|NC_020838. 386 ---TGFS----ATQ-VGPGIYIEGTSAFS--ISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIY 455 (981) Q Consensus 386 ---~~~~----~~~-vg~~i~i~~~~~~~--vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yy 455 (981) +.++ +.. .|....+...+... ....++.....+....+.+..+..+|...+. ..++.....++|+ T Consensus 230 ~~~~~~v~~~~f~~~~G~~~~i~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~-----~~gt~~~~~~~Y~ 304 (976) T protein:vir:10 230 DAYAPNVGTKVFNVTDGASLTDEANSGSYTYTIDVKDSSNNSVNRGVNLYFRIRTVGQSVPF-----TTGSGSSATTTYQ 304 (976) T ss_pred cccCceeeeeEEEeccCccceEcCCcceEEEEeeccccceEEeecCCceEEEEccccceeec-----ccccccceeeeee Confidence 1111 111 12222232222222 2334455555566666666666666554432 2344456678899 Q ss_pred EEEEcccCCcccceEEEEeeccceeEEEccccceEEEEeccCCceeeecccCCcccCCCcccCcCc-------cccCC-- Q lcl|NC_020838. 456 VEFQTTNSAARGPGVWEETIGPSLEFEIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIP-------SFIGK-- 526 (981) Q Consensus 456 v~~~~~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~p-------sF~g~-- 526 (981) ++++.... +|.+..+.. .+......++...-..+++...+....+++....|.+ .+... T Consensus 305 ~~y~~~~~------v~~~~~g~~------~~~~~~V~v~g~~Y~it~~~~~~~~~~a~~~~~~~~~t~~d~~~~~~~~~i 372 (976) T protein:vir:10 305 ARYTTTFD------LLYGGTGWQ------EGDYFYVWMKDGYYKITVEAISTANVQANLGLIRPNPTPFDTETAVTAESI 372 (976) T ss_pred EEEEeEEE------EecCCCCcc------cCceEEEEccccceeeEEEEeeceeEEeccccccCcCCcCcccccccHHHH Confidence 98877543 344333221 1122222233332233333334433333333322211 11110 Q ss_pred ------------ceeEEEEEcceEEEecCCeEEEEeccccccc----------cccccc-----cccCCccEEEEEc--C Q lcl|NC_020838. 527 ------------KINNMFFYRNRLGLLSNEAVIMSRAGDYFNF----------FANSSQ-----VVAPDDPIDLQAT--S 577 (981) Q Consensus 527 ------------~ps~v~ffq~RL~f~s~~~V~~Srtgdy~NF----------~~~s~~-----~~~DdDpI~~~i~--s 577 (981) ....+++.+. +..++++++...|+. ...++. |..--|-..+.+. . T Consensus 373 a~~L~~~l~a~~~~~g~tv~~~------g~~~~i~~~~~~~~~s~~~~~~~~~~~~~V~~~~~LP~~~~~g~~v~V~~~~ 446 (976) T protein:vir:10 373 IGDIRTAIIATGNFTSANVQQI------GTGLYVTRPSGTFNVTAPSSDLLRVMSGEVANVDDLPSQCKHGYVVKVANSE 446 (976) T ss_pred HHHHHHhhcccccccceEEEEc------CcEEEEEecCcceEecCCCceeEEEEEeeecchhhhhhhccCCcEEEEecCC Confidence 0122222222 222333332222211 111111 1110011112221 1 Q ss_pred CCce--eEEEEeecCCcEEEEecCcEEEEE---cCCccccccceEEEEEeeeccc-----------------cCCCcEEe Q lcl|NC_020838. 578 VKPV--TLNYTLATSIGLLVFGPNEQFVLS---TDADILSPTTTKINTISTFECD-----------------AEIDAVAV 635 (981) Q Consensus 578 ~~~n--~I~~~v~~~~~L~l~T~g~q~~l~---g~~~~LTP~~~~i~~~S~~~~s-----------------~~v~Pv~v 635 (981) ...+ -++|...... ...+.|.-. +..--+.+.+......-.=.|+ .+-.|--+ T Consensus 447 ~~~d~yyv~~~~~~~~-----~~~~~w~E~~~~g~~~g~~~~tmP~~l~~~~~g~f~~~~~~w~~r~vGd~~tnp~psf~ 521 (976) T protein:vir:10 447 ADADDYYVKFFGHNNR-----DGDGVWEECAKPSRNIEFDKGTMPIQLVRQANGTFTVSQATWQNAEVGDELTNPNPSFV 521 (976) T ss_pred CCceeEEEEeeccccc-----cccceEEEeeccccccccccccccEEEEecccCeEEeeeccccccccCCcccCcCceec Confidence 1111 1222111000 001123221 1111133333222111100111 11233444 Q ss_pred CCe---EEEEecCCCeeEEEEEeecccccceehhhHHHHHHHhcCC-CeEEEEEcCCCcEEEEEecCCcEEEEEEeecCc Q lcl|NC_020838. 636 GTT---QAFISKSNLYSKLFLMLNVQKEAAATIDEATTNVPEYVPS-DIDSMTASPAMSIVSLGKSGSNTVYQHRFFMQG 711 (981) Q Consensus 636 G~~---v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS~~~~h~~~~-~i~~~~~s~~~~~~~~~~~g~~~l~~y~y~~~~ 711 (981) |+. |.|-|.|=.+.+=..+..+...|-++-. .+. +..+-.. .|..-..+...+.+-|+..-...|+.++ .+ T Consensus 522 g~~is~v~f~q~RL~f~s~~~v~~Srtgd~~nF~-~~t-~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T---~g 596 (976) T protein:vir:10 522 GKTINQLVFFRNRLVFLSDENVIMSRPGEFFNFW-SKT-ATTFTPQDVIDLSCSSTYPAIVYDGIQVNAGLLLFT---KN 596 (976) T ss_pred ccccceEEEEcceEEEecCCeEEEEecCCccccc-ccc-ccCCCCCccEEEEecCCcceeeEEEEecCCcEEEEe---cC Confidence 443 3333332000000011111111111110 110 0011122 3443344555566555544322344443 23 Q ss_pred chheeeeEe-eccCCce--EE-----------EEEeCCeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCcccccccc Q lcl|NC_020838. 712 ENRVQTWYK-WQLTGDL--RL-----------QFFDKTTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDM 777 (981) Q Consensus 712 eq~V~aWsr-w~~~G~v--~s-----------v~~~~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~ 777 (981) .|=+.+..- --++.++ .. ...+++.++++-+++...-+-.+......+.... .....+. T Consensus 597 ~e~~lsg~~~~lTP~t~~i~~~s~~~~~~~v~Pv~vG~~v~Fv~~~g~~~r~~~~~~~~~~~~~~~-----~dlt~~~-- 669 (976) T protein:vir:10 597 QQFMLTTDSDILSPETAKINAVSSYNFNEKTHPVSLGTTVAFIDNANQFTRFFEMSNVVRQGEPDV-----VDQSKVI-- 669 (976) T ss_pred ceEEEecCCceecceeEEEEEEEeeeccCCCccEEeCCeEEEEecCCCeEEEEEEeecccccccch-----hHHHHHh-- Confidence 332222210 1122221 10 1124777887766664332222222221111100 0000011 Q ss_pred eeeeeeeeeccCCceEEEcccCCCcCCceEEEEEcCcccCceeEEEEecCceEEEcccc------c-cCCCEEEeCCCCC Q lcl|NC_020838. 778 FNVNPYRTYSTSTKKTTVNLPFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDS------D-ISSNQVLLNGDYR 850 (981) Q Consensus 778 ~~vd~~~ty~~~~~~tt~~l~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~------t-v~gg~itL~gd~~ 850 (981) .||-...+...+.. ..+..++.....+|.+..+... . ..|-.-+++|... T Consensus 670 ----------------------~~l~~g~~~~~a~~-~~~~~vv~~~~~~g~l~~~ty~~~~~eq~v~aWsr~~~~G~v~ 726 (976) T protein:vir:10 670 ----------------------SRLLDKNISLVSVS-RENSVVFFSQKDTDKIYCFRYFTSGEKRLLQAWTTWTITGNIQ 726 (976) T ss_pred ----------------------hhhcCCceEEEEEc-CCCcEEEEEEcCCCEEEEEEEeecCCceeEEeeEEEecCCcEE Confidence 11111111111111 1122222222223332222110 0 0111111111100 Q ss_pred C-----CEEEE-------E--EeeeEEEEeCCceeeccCCCccc-ceEeeeEEEEE-EEEE------------------- Q lcl|NC_020838. 851 G-----RDLII-------G--YVYDMELELPTLYPTQVEGRSSV-SDVTSDLILHR-LKVS------------------- 895 (981) Q Consensus 851 ~-----~~v~V-------G--l~y~s~v~~~~~~i~~~~g~~~~-~~~~grl~l~r-~~v~------------------- 895 (981) . ..+++ | ..|.-++.+.........+.... .....|+.|.. ..+. T Consensus 727 sv~~i~D~ly~vV~r~~~g~~~r~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~~~~~~~~t~~~~t~~t~~~~~~ 806 (976) T protein:vir:10 727 YHCMLDDALYVVTRNNNKDQIVKYSLKLDDAGHFVTDTQGTTSTDDDSIYRVHLDHSSSVTAASNTYNTTTIKTTIPKPN 806 (976) T ss_pred EEEEeCCeEEEEEEecCCeEEEEEEEEECCccceeeeccCccccccCCcceeeeccceEEEeccccccCCceeEEeecCc Confidence 0 00000 0 00000110000000000000000 00000000000 0000 Q ss_pred -eecccceEEEEccCCCCceeeEecccccCccccCccc--cccCCeEEEEe--------eccCcce-----EEEEEECCC Q lcl|NC_020838. 896 -TGLSGPITYKVDITGKDEWTNIINVTLPNTYVLNNVN--LSASALHDVPI--------YQRNENV-----NIKIIGDTP 959 (981) Q Consensus 896 -~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~p--~~~~~~~~vp~--------~g~~~~~-----~v~I~~~~P 959 (981) .+.++.+.+- ...+.. ..+.+. ..+++..++|- -|..-.. .+.|+..+. T Consensus 807 ~~~~~~~~~~~-~~d~~~--------------~~~~~~~~~v~g~~i~l~g~~~~~~v~VGl~Y~s~~~~~~~~i~~~~g 871 (976) T protein:vir:10 807 GYESTKQLVAY-DTDAGN--------------DLGRYALVTVSGSNLEIPGNWSNNSFIIGYLYEMDVQLPTLYVTQQVG 871 (976) T ss_pred cccCceeEEEE-ecccCc--------------ccccceeeeecCCeeEecCCCCCCeEEEeeeeEEEEeecceeEEeCCC Confidence 0000000000 000000 000000 01111111110 0110000 111221111 Q ss_pred CC--------EEEEEEEE----EEEe--------------eccccccC Q lcl|NC_020838. 960 FP--------ISLLNIVW----EGNY--------------NRRFYRRS 981 (981) Q Consensus 960 lP--------ltvlsi~w----eg~y--------------~~r~rRr~ 981 (981) -+ +.|..+.+ -|-| ..+..|+. T Consensus 872 ~~~~~~~~gRl~i~r~~~~~~~tg~~~v~v~~~~~~~~~~~~~~~~~~ 919 (976) T protein:vir:10 872 DKYRSDAKSSLIVHRIKFSFGPLGVYSTTIQRDGKPDFTETKELGLAG 919 (976) T ss_pred CcccccceeeEEEEEEEEEeecccceEEEEcCCCCccccccccccccC Confidence 11 11111111 1111 00111100 No 27 >protein:vir:78703 Length: 905 # NCBI annotation: tail tube B # Family: family:all:825 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285450;genbank:gi:148724484;genbank:GeneID:5220174 Probab=99.87 E-value=6.9e-20 Score=125.61 Aligned_cols=703 Identities=13% Similarity=0.090 Sum_probs=210.7 Q ss_pred eEEEeeccccceeccccccccccccCCcccccc----eEeecceeEeeeeeeeEEeeCceeeEeeeccCCcCcCceEEEE Q lcl|NC_020838. 189 IVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYP----WFKRDGYRVYEVEKEVAAAYNSTELSTANTNMGTAQTAYDNAV 264 (981) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~g~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~i 264 (981) +..+. +.--+...|-+-+...-=..+|..++. -. -.|...|+++++ +++|.+. .++++++|+| T Consensus 1 M~~v~-~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~-v~Gl~kRp~~~~---------i~~l~~~--~~~~~~~~~~ 67 (905) T protein:vir:78 1 MGAVL-QKIPNLLGGVSQQPDPVKLPGQVREAENVYLDP-TFGCRKRPATKF---------VGELATN--LPSDTRWFPI 67 (905) T ss_pred Cccce-ecchhhhCceeecchhhcCCcchhhhhcccccc-ccccccCchhhh---------hhhhcCC--CCCCceEEEE Confidence 11111 111111122222221111122222111 11 136665555544 4555543 4688999999 Q ss_pred EcccccceE---------------Eec-ceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCceeEeccCCCCC Q lcl|NC_020838. 265 STESTEKGD---------------YDS-EVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKNKTTAMKTTTSA 328 (981) Q Consensus 265 ~~~~~~~~~---------------~~~-~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~~~~~~~~~~~~ 328 (981) +||+.|+|+ |++ +|+.+.|+.+. .+.+|+++++.++|||+||||||||+|++++|++.++..+ T Consensus 68 ~r~~~e~y~~~~~~~g~~~~~i~v~d~~~G~~~~V~~~~-~~~~yl~~~~~~~l~~~tv~d~tfi~N~~~~~~~~~~~~~ 146 (905) T protein:vir:78 68 FRDAGERYAVALYKDGSGNTQVRVWDMQTGAERTVTPDA-TATAYLATTNLNNLNWLTVADYTLLSNKERIVTMSGASEV 146 (905) T ss_pred EeCCCceEEEEEeeCCCCCcceEEEEccCCcEEEEecCC-CccceeecCCCcceEEEEEcCEEEEEcCceeeeecCCCCc Confidence 999999986 344 46666665433 4678999887679999999999999999999999999999 Q ss_pred CCCceEEEEEeeeccceEEEEeeCceEEEEEec----cCCCcceecHHHHHHHHHhhhhc--cCceEEE-EcCCEEEEEe Q lcl|NC_020838. 329 AVPNVAFVVIRIVAYNSDYSVTLNGTTVTHSTP----DTVAGATTDSGSIAAALTSSINA--LTGFSAT-QVGPGIYIEG 401 (981) Q Consensus 329 ~~~~~~~v~v~~g~y~~~~~v~~ng~~~~~~t~----~~~a~~~~~~~~ia~~l~~~i~~--~~~~~~~-~vg~~i~i~~ 401 (981) ..+++|+++|++|+|+|+|+|.||+..+..+.. ........+.+.+.......+.. ..+.+.. ..+..+.+.- T Consensus 147 ~~~~~~~~~v~~g~y~r~y~v~I~~~~~~~~~t~~~~~a~~~s~~s~~~~~~g~~~~~~~~~~~~~t~~~~~~l~f~~~~ 226 (905) T protein:vir:78 147 DSNQRALVEINAISYNTTYSIDLDRDGASQQVKVYRAKALEISPGSFEVEDGGVCTEHDVQNYTNQTIGSSTGLAFQVRV 226 (905) T ss_pred CCCCeEEEEEEeeccceeEEEEEeCCCCceeeeeeccccceeccccccccccccccccceeeeecceeeccCCceeEEee Confidence 999999999999999999999999875543322 22211111222221111111110 0110000 0011111110 Q ss_pred -------------CCCceEEEecCcCcc----eeE----EEEEEE-------------------------ee-------- Q lcl|NC_020838. 402 -------------TSAFSISTSGSTTEE----GIF----AFQDQI-------------------------NV-------- 427 (981) Q Consensus 402 -------------~~~~~vt~~~g~~~t----~~~----~~~~~v-------------------------~~-------- 427 (981) +.+..+++-.|..++ .+. .-.+.+ ++ T Consensus 227 ~~~~~~~~~~~~~~~~~~~~l~~g~~~~~~~~~~~v~~~g~~y~i~i~~~~~~~~~~~~~~~~~~t~~d~~a~~~~~~~i 306 (905) T protein:vir:78 227 QCAAYLENNEYRSRYNVSVVLQNGGTGFRKGDMITVNLNGRDYNIRVTQEEFVYTYASDGTAAHTTPQDSTAGTLDIGQI 306 (905) T ss_pred ccccccCCCcccccccceeeeeccccccccCccEEEeeccceEEEEEecceeEEEecCCCcccccCccCccCccccHHHH Confidence 000111111111110 000 000000 00 Q ss_pred chhccccC--CC-------CcEEEEEccCCCcccceEEEEEcccCCcccceEEEEeeccceeEEEccccceE----EEEe Q lcl|NC_020838. 428 ASRLPNQC--EN-------GYRVRVTNSGDVTADDIYVEFQTTNSAARGPGVWEETIGPSLEFEIDETTMPH----QLIR 494 (981) Q Consensus 428 ~~~Lp~~~--~~-------G~~v~v~~~g~~~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~----~lv~ 494 (981) .+.|.... .. |..+.|......+ + .+... .|..+...|.-........++ ++.+|+ +++. T Consensus 307 ~~~l~~~~~~~~~~~~~~~g~~i~v~~~~~~~---~--~~~~~-~g~~~~~~~~~~~~v~~~~~L-p~~~~~g~~v~v~~ 379 (905) T protein:vir:78 307 TAGLVNSVNLISNYSAQAVGNVIEIERTDGRD---F--NLGVR-GGATNRAMTAIKGTANSIVDL-PGQCFDGFELKVIN 379 (905) T ss_pred HHHHHHhhcccccEEEEecCcEEEEEecCCCc---c--EEEEe-ccCCcceEEEEeccccccccC-ccccCCCcEEEEEe Confidence 00010000 00 1111111110000 0 00000 111111111100000011111 111221 1111 Q ss_pred ccCCc---ee----------eecccCCcccCCCcccCcCccccCCceeEEEEEcceEEEecCCeEEEEecc---cccccc Q lcl|NC_020838. 495 QANGV---FK----------YEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLLSNEAVIMSRAG---DYFNFF 558 (981) Q Consensus 495 ~a~g~---f~----------~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~s~~~V~~Srtg---dy~NF~ 558 (981) ..... |- ..+..|.+....+ -...|.... | .-+|+=.+..+.-+.... .+..|. T Consensus 380 ~~~~~~d~yyv~~~~~~~~~~~~~~W~E~~~~~----~~~~~~~~t---m---p~~l~r~~~g~f~~~~~~~~~~~~~~~ 449 (905) T protein:vir:78 380 TENAESDDYYVVFRSAAEGIPGSGSWEETVAPG----IERGFNTST---M---PHALIRQADGNFTLEALNDEGTITGWA 449 (905) T ss_pred CCCCCcceEEEEEEecccCCcCceeEEEecccc----ccccccccc---c---cEEEEEecCceEEEEEecccccccccc Confidence 11111 00 0111232211100 000111111 1 112221122222221111 111232 Q ss_pred ccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEE-E--cCCccccccceE-------EEEEeeecccc Q lcl|NC_020838. 559 ANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVL-S--TDADILSPTTTK-------INTISTFECDA 628 (981) Q Consensus 559 ~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l-~--g~~~~LTP~~~~-------i~~~S~~~~s~ 628 (981) ... +-|||.=.. .+---+.|..+.-+...|.+.+ .|+++ + |+-.-+.|+++. |....+-.-.+ T Consensus 450 ~r~---~Gd~~Tnp~--psf~g~~is~v~f~q~RL~f~s--~~~v~~Srtgd~~nF~~~t~~~~~DdDpI~~~~ss~~~~ 522 (905) T protein:vir:78 450 QRE---VGDDDTNPK--PSFVGRGISDMFFYNNRLGFLS--EDAVIMSQPGDYFNFFVTSAITISDSDPIDVTASSTKPA 522 (905) T ss_pred ccc---cCCcccCCC--CcccCCCcceEEEEcceEEEec--CCeEEEEccCCccccccccccCCCCCccEEEEEcCCcce Confidence 222 122211111 0111244566666766665554 34444 3 221235555431 11111111112 Q ss_pred CCCc-EEeCCeEEEEecCCCeeEEEEEeecccccceehhhHHHHHH-Hh-cCCCeEEEEEcCCCcEEEEEecCC-cEEEE Q lcl|NC_020838. 629 EIDA-VAVGTTQAFISKSNLYSKLFLMLNVQKEAAATIDEATTNVP-EY-VPSDIDSMTASPAMSIVSLGKSGS-NTVYQ 704 (981) Q Consensus 629 ~v~P-v~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS~~~~-h~-~~~~i~~~~~s~~~~~~~~~~~g~-~~l~~ 704 (981) .++= +..+..++.-..++.| ....+.+......+++... .+ +...+.= -.-...++++...|. ..+.- T Consensus 523 ~i~~~v~~~~~L~ifT~g~ef------~lsg~~~~lTP~s~~i~~~S~~~~~~~v~P--v~vG~~vlFv~~~g~~s~vre 594 (905) T protein:vir:78 523 ILRAAIGAPKGLILFAENSQF------LLASQEVVFSTATIKLTEISDYFYRSLAKP--VSTGVSIAFVSEADTYSKIFE 594 (905) T ss_pred eeEEEeecCCcEEEEecCceE------EEecCCccccceeEEEEeEEeecccCCCCc--EEeCCeEEEeecCCCeeEEEE Confidence 2332 2233334333343332 2222111111111111000 00 0000000 000112333333332 12422 Q ss_pred EEeecCc----chheeeeEeeccCCceEEEEEeCCeEEEEEEcCC-cEEEEEE--------------------------- Q lcl|NC_020838. 705 HRFFMQG----ENRVQTWYKWQLTGDLRLQFFDKTTFYAVTSSGS-NVYLTSY--------------------------- 752 (981) Q Consensus 705 y~y~~~~----eq~V~aWsrw~~~G~v~sv~~~~d~ly~vv~r~~-~~~l~~~--------------------------- 752 (981) |.|-... .+++..=.+|-+.|.+..++.-....+++..+++ .+++-++ T Consensus 595 ~~y~~~~d~y~a~DlT~~a~hl~~g~v~~~~~s~~~~~v~~~~~~~~l~~ytyl~~~~eq~v~AWsrw~~~G~~~~~a~i 674 (905) T protein:vir:78 595 MSIDSVDNRPQVADITRIVPEYVPTGLTWSVSTPNNSMMLFGDNSNTAYIFKFFNQGNERQVAGWSKWILPGEQRMCGFF 674 (905) T ss_pred EEeeecccceehhHHHHHHHHhcCCceEEEEecCCCcEEEEEcCCCeEEEEEeecCCCceeEEeEEEEecCCCeEEEEEE Confidence 2221111 1233345556666665544332222222222222 2222111 Q ss_pred ---------EeecCCceeEEEecCCCcccccccc---eeeeeeeeeccCCceEEEcccCCCcCCceEEEEE-cCcccCce Q lcl|NC_020838. 753 ---------DLTQASESGYLTLPTGEKTDVCLDM---FNVNPYRTYSTSTKKTTVNLPFDHITGKKLAVVA-IGTYIGDT 819 (981) Q Consensus 753 ---------~~~~~~~~~~~~~~~~~~~~~~lD~---~~vd~~~ty~~~~~~tt~~l~~~~l~g~~v~v~a-dG~~~~~~ 819 (981) +...+....+.+++..+.....+|. ++.++...|..++..+....+- ....+.. .|..+ T Consensus 675 ~d~~~~vV~r~~~G~~~~~~~~l~~~~~~~~~d~~~~~~~~~~d~~~~~~~~t~~~~~~-----~~~~~~~~~~~~~--- 746 (905) T protein:vir:78 675 ADTGYFVLYDSTTGSYVLSAMELLDDPDSASIDTAFSSFLPRLDNYVVKSDLTVVDNGD-----GTLTVDLEAGQAM--- 746 (905) T ss_pred cCCEEEEEEEccCCeEEEEEEeeccccCccccccceeeeeeccceeeecccceecccCc-----ceEeeeccCcccc--- Confidence 1111111111111111111111111 1112222222222111110000 0000000 00000 Q ss_pred eEEEEecCceEEEccccccCCCEEEeCCCCCCCEEEEEEeeeEEE--EeCCceeeccCCCcccceEeeeEEEEE-EEEEe Q lcl|NC_020838. 820 ISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLIIGYVYDMEL--ELPTLYPTQVEGRSSVSDVTSDLILHR-LKVST 896 (981) Q Consensus 820 ~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~VGl~y~s~v--~~~~~~i~~~~g~~~~~~~~grl~l~r-~~v~~ 896 (981) .++.+ +.+..+..+ ....+.+ .+.....+......|.. ....+ T Consensus 747 -------------------~~~~~------------~~~~~dG~~~~~~~~~~~---~~~~~t~~~a~~v~VGl~Y~s~v 792 (905) T protein:vir:78 747 -------------------TGATP------------VIMFTDGPSEFAFSQPTI---TAGQFTVDTTDDFVVGFKYETKI 792 (905) T ss_pred -------------------cccee------------EEEeeCCceeeeEEEEEe---eceeeccccCCeEEEeeeeeEEE Confidence 00111 000001000 0000100 01000000001111111 11111 Q ss_pred ecccceEEEEccC---CCCceeeEecccccCccccCccccccCCeEEEEee----ccC----------cceEEEEEECCC Q lcl|NC_020838. 897 GLSGPITYKVDIT---GKDEWTNIINVTLPNTYVLNNVNLSASALHDVPIY----QRN----------ENVNIKIIGDTP 959 (981) Q Consensus 897 ~~Sg~~~v~v~~~---~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~vp~~----g~~----------~~~~v~I~~~~P 959 (981) . ...+.+..+.. .+.........++..+... ..-+..++...+... ..+ .++.+++-...+ T Consensus 793 ~-~~p~~~~~~~~s~~~~~~rI~rv~lr~~~Sg~~-~v~v~~~~~~~~~~~~~~~~~~~~~~~~~p~~~tg~~~vP~~g~ 870 (905) T protein:vir:78 793 T-LPGFFTSEENKADRVYAPIVEFLYLDLYYSGRY-QIEVDRIGYDTINIDAGSIDANIYLADGAPLKEIATENVPLFTP 870 (905) T ss_pred e-ecceEeccCCCcccccceEEEEEEEEeecceeE-EEEEcCCCcceecccccceecCcccCcccccccccEEEEEeecc Confidence 1 22333222111 1111122222222111000 000000000000000 000 011111111111 Q ss_pred CCEEEEEEE---------EEEEeeccccccC Q lcl|NC_020838. 960 FPISLLNIV---------WEGNYNRRFYRRS 981 (981) Q Consensus 960 lPltvlsi~---------weg~y~~r~rRr~ 981 (981) ..=+-+-|. --..|-=++=||+ T Consensus 871 ~~~~~v~I~sd~PlP~tvlsi~weg~Yn~r~ 901 (905) T protein:vir:78 871 GDQVTVTIKAPDPFPSAITGYSWQGHYNRRG 901 (905) T ss_pred CceeEEEEEECCCCcEEEEEEEEEEEeccce Confidence 100001111 1122222222333 No 28 >protein:vir:1778 Length: 680 # NCBI annotation: tail protein A # Family: family:all:825 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570344;genbank:gi:18640503;genbank:GeneID:932716 Probab=99.82 E-value=1.7e-19 Score=123.45 Aligned_cols=545 Identities=12% Similarity=0.044 Sum_probs=195.5 Q ss_pred eEEEeeccccceeccccccccccccCCcccccc----eEeecceeEeeeeeeeEEeeCceeeEeeeccCCcCcCceEEEE Q lcl|NC_020838. 189 IVKDDGTNAGSIAAGSAMPSGYSLGNERTDDYP----WFKRDGYRVYEVEKEVAAAYNSTELSTANTNMGTAQTAYDNAV 264 (981) Q Consensus 189 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~g~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~i 264 (981) +..+. +.--+...|-+-+...-=..+|+.++. -. -.|...|++++++ +.|.. .+.++++|+| T Consensus 1 M~~v~-~si~nl~~GvSqQp~~~r~pgQ~~~q~N~~~d~-v~Gl~kRpg~~~i---------~~l~~---~~~~~~~~~~ 66 (680) T protein:vir:17 1 MAAVE-QMVPNLLGGISQQPDPLKLPGQVKQARNVQLDP-TFGALKRPGTELI---------MQVTG---IPKRAKWIPI 66 (680) T ss_pred Cccce-ecchhhhCcceecchhhcCcchhhhhhccccCc-CcccccCccceee---------eeccC---CCCCceeEEE Confidence 11111 111111122222222111222332221 11 1366655555554 44432 4688999999 Q ss_pred EcccccceEE------------------ec-ceEEEEeeeecccceeec--ccCCcceeeEEEEcCEEEEeCCceeEecc Q lcl|NC_020838. 265 STESTEKGDY------------------DS-EVTACNIGSSNIPASAYL--KDAAPEDIEILTINDYTFVLNKNKTTAMK 323 (981) Q Consensus 265 ~~~~~~~~~~------------------~~-~g~~~~~~~~~~~~~~y~--~~~~~~dl~~~t~ad~tfi~n~~~~~~~~ 323 (981) +||+.|+|+. +. .|..+.|+..+..+..|+ +++...+|||+||||||||+||++++++. T Consensus 67 ~rd~~e~~~~~~~~~g~~~~~~~~i~v~d~~~G~~~~v~~~~~~~~~~~~~~~~~~~~lr~~tv~d~tfi~N~~v~~~~~ 146 (680) T protein:vir:17 67 MRDAREHYYVAIYREGANESGDLRIRVFDLKAGVERAVSFVGGEVEEYFPGDETDWEAIRSLTIGDYTFLSNPNVQPTTW 146 (680) T ss_pred ecCCCCeEEEEEEcCCCcccccceeEEEEccCCeEEEEEcCCCceEEEeecCCCCccceEEEEEcCEEEEECCeEEEecc Confidence 9999999873 11 244555565554444443 33333489999999999999999999998 Q ss_pred CCCCCCCCceEEEEEeeeccceEEEEeeCceEEEE------EeccCC-------------CcceecHHHHHHHHHhhhhc Q lcl|NC_020838. 324 TTTSAAVPNVAFVVIRIVAYNSDYSVTLNGTTVTH------STPDTV-------------AGATTDSGSIAAALTSSINA 384 (981) Q Consensus 324 ~~~~~~~~~~~~v~v~~g~y~~~~~v~~ng~~~~~------~t~~~~-------------a~~~~~~~~ia~~l~~~i~~ 384 (981) +. ..+++..|+++|++|+|+|+|.|.+||....- +..++. +...+..+.++..|..++.. T Consensus 147 ~~-~~~~~~~g~~~v~~~ayg~ty~v~ing~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~Ag~~t~~~~~~a~la~~l~~ 225 (680) T protein:vir:17 147 SR-SFSRRPEGLVTIGAAGYGTSYIVDFATEDSGQQRRWAVQEMQAPKTKRKKGDGSPDEAGETTVNNWNGTGLSFRVKV 225 (680) T ss_pred CC-CCCCCCeeEEEEEEeeeeeEEEEEEeccccceeeeeeeeeeeccccccccccccCCCCcceeeeeeeeeeeeeeeee Confidence 76 44567789999999999999999999854221 111111 12233455555555555432 Q ss_pred -cCceEEEEcCCEEEEEeCCCceEEEecCcCc--cee----EEEEEEEe--e--chhccccCCCCcEEEEEccCCCcccc Q lcl|NC_020838. 385 -LTGFSATQVGPGIYIEGTSAFSISTSGSTTE--EGI----FAFQDQIN--V--ASRLPNQCENGYRVRVTNSGDVTADD 453 (981) Q Consensus 385 -~~~~~~~~vg~~i~i~~~~~~~vt~~~g~~~--t~~----~~~~~~v~--~--~~~Lp~~~~~G~~v~v~~~g~~~~d~ 453 (981) ..+|........++...+....+.+.++... ..+ ....+.+. . ....+.....+..........+.... T Consensus 226 ~~~~~~~~~g~~~~~~y~~~~~l~~tg~~~~~~~~t~~v~~~G~~y~IsI~~~~~~~~~~~~~s~~~~t~~~~~a~~at~ 305 (680) T protein:vir:17 226 EARAFLVDDGEEYGHNYIPYVTLLTPGNNTSPFPDTIRVDVSGEGWDIKVTKQIQSKVYANLGTAQFTTPVDQSGGGAST 305 (680) T ss_pred ccceeeecCCCceEEEEeeEEEEecCCccccccCceEEEecccceeEEEEccceeeEeccCccceeeeeccCCcccceeH Confidence 2334443334444443332111111111110 000 00001110 0 00001100000000000000000000 Q ss_pred eEEEEE--cccCCc-------ccceEEEEeecccee--------------------------EEEccccceE----EEEe Q lcl|NC_020838. 454 IYVEFQ--TTNSAA-------RGPGVWEETIGPSLE--------------------------FEIDETTMPH----QLIR 494 (981) Q Consensus 454 yyv~~~--~~~~~~-------~g~~~W~E~a~~~~~--------------------------~~~~~~Tmp~----~lv~ 494 (981) .++--. ....+. .|..+..++..+... ..+ ++.+|+ .++. T Consensus 306 ~~Ia~~L~~~i~~~~~~~~~~~g~~i~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~v~~~~~L-p~~a~~g~~v~v~~ 384 (680) T protein:vir:17 306 SDIVTGLSAAINGLGTFTAESIGNVIRVRYSDPTRTDEFTMSARGGTSGTGLESIKYSVDTLAEL-PTKCWNDYQVAVRN 384 (680) T ss_pred HHHHHHHHHhhcccCcEEEEECCCEEEEEeccCCCceEEEeeccCCCCceeeeeeeeeecccccc-ccccCCCcEEEEEe Confidence 000000 000000 000111111110000 000 111221 1111 Q ss_pred ccCCc---e--eee----------cccCCcccCCCcccCcCccccCCceeEEEEEcceEEEecCCeEEEEecc-cc--cc Q lcl|NC_020838. 495 QANGV---F--KYE----------PVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLLSNEAVIMSRAG-DY--FN 556 (981) Q Consensus 495 ~a~g~---f--~~~----------~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~s~~~V~~Srtg-dy--~N 556 (981) ..... | ++. +..|.+....+. ...|.... | .=||+-.+..+.-+.+.+ .. .. T Consensus 385 ~~~~~~~~Yyv~~~~~~~~~~~~~~~~W~E~~~~~~----~~~~~~~t---m---p~~l~r~~~g~f~~~~~~~~~~~~~ 454 (680) T protein:vir:17 385 TQDTEVDDYYVKFETDVEDADVPGSGYWVETVKNGD----DGGLVDDT---M---PHVLVRNALGDFTFSSLNNSSYGKT 454 (680) T ss_pred CCCCcccceEEEEeccCcccCcccccceeecccCcc----cceeccCc---c---eEEEEEccCceeEEEeecccccccc Confidence 11110 0 111 012322111100 00111110 1 112222222221222211 00 11 Q ss_pred ccccccccccCC--ccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEE-E--cCCccccccceE-------EEEEeee Q lcl|NC_020838. 557 FFANSSQVVAPD--DPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVL-S--TDADILSPTTTK-------INTISTF 624 (981) Q Consensus 557 F~~~s~~~~~Dd--DpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l-~--g~~~~LTP~~~~-------i~~~S~~ 624 (981) |... .+-|| .|.--.+. +...|..+.-+...|.+.+ .|+++ + |+-.-|.|+++. |....+- T Consensus 455 ~~~r---~~Gdd~tnp~psF~~--~G~~p~~v~f~q~RL~f~s--~~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss 527 (680) T protein:vir:17 455 WADR---SVGSEDTNPHPTFTE--SGNGIYGMFMYKNRLGFLT--QDAVIMSQVGDYFNFYATSGVTISDADPIDMATSD 527 (680) T ss_pred cccc---ccCCcccCCCccccc--CCCCceEEEEEcceEEEee--CCeEEEEccCCcccccccccccCCCCccEEEEEcC Confidence 2211 12222 22211122 2234666666766675554 34444 3 221235555431 1111110 Q ss_pred ccccCCCc-EEeCCeEEEEecCCCeeEEEEEeecccccceehhhHHH--HHHHhcCCCeEEEEEcCCCcEEEEEecCC-c Q lcl|NC_020838. 625 ECDAEIDA-VAVGTTQAFISKSNLYSKLFLMLNVQKEAAATIDEATT--NVPEYVPSDIDSMTASPAMSIVSLGKSGS-N 700 (981) Q Consensus 625 ~~s~~v~P-v~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS~--~~~h~~~~~i~~~~~s~~~~~~~~~~~g~-~ 700 (981) .-.+.++= +...+.++--+.++. +..+...+......++. +...-....+.= -.-...++++...|. . T Consensus 528 ~~~~~i~~~v~~~~~L~l~t~g~q------~~ls~~~~~lTP~~~~i~~~s~~~~~~~~~P--v~vG~~v~Fv~~~g~~s 599 (680) T protein:vir:17 528 TKPVKLEAAISSTSGAILFGNQAQ------FRLSSPDESFGPKTATLDKISNYTYESKADP--VQTGVSMIFPTNMGTYS 599 (680) T ss_pred CcceeeeEEeecCCcEEEEecCeE------EEEecCCceecceeEEEEEEEeecccCCCCc--eEeCCeEEEeecCCCcc Confidence 11112221 222222222223222 22222111111111111 100000000000 000122344444443 2 Q ss_pred EEEEEEeecCcc----hheeeeEeeccCCceEEEEEe--CCeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCccccc Q lcl|NC_020838. 701 TVYQHRFFMQGE----NRVQTWYKWQLTGDLRLQFFD--KTTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVC 774 (981) Q Consensus 701 ~l~~y~y~~~~e----q~V~aWsrw~~~G~v~sv~~~--~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 774 (981) .+.-|.|-+..+ +++..=..|-+.|.+..++.. +..+++.+.++++..+..--+..+.++.. T Consensus 600 ~vre~~y~~~~d~y~a~DlT~~a~hl~~g~v~~~~~~~~~~~~~~~~~~~~~~l~~~~yl~~~~e~~v------------ 667 (680) T protein:vir:17 600 SVYELSTESAKGTPVIEDSSRVIPRLIPSGLTWSTASMNNDTVFFGNAKKGRNVYVFRFFNEGQERKV------------ 667 (680) T ss_pred eEEEEeeeeccCceehhhHHHHHHHhcCCceEEEEeeCCCCeEEEEEEcCCCEEEEEEEeeCCCceEE------------ Confidence 343332222222 234456778888887766553 45677777776654333211111111100 Q ss_pred ccceeeeeeeeeccCCceEEEccc-CCC Q lcl|NC_020838. 775 LDMFNVNPYRTYSTSTKKTTVNLP-FDH 801 (981) Q Consensus 775 lD~~~vd~~~ty~~~~~~tt~~l~-~~~ 801 (981) .+| ..+.++ -+| T Consensus 668 -~aW--------------~rw~~~~~d~ 680 (680) T protein:vir:17 668 -AGW--------------TTWYYEDQDH 680 (680) T ss_pred -EEE--------------EEEecCCCCC Confidence 011 111111 233 No 29 >protein:vir:94602 Length: 1012 # NCBI annotation: PfWMP4_35 # Family: family:all:12083 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762665;genbank:gi:115304373;genbank:GeneID:5142302 Probab=99.63 E-value=1.8e-13 Score=90.47 Aligned_cols=832 Identities=14% Similarity=0.131 Sum_probs=353.3 Q ss_pred CCceeecchhhhccccccchhhc------CCCh----------hh---hhhccccccccccccCchhhhhhhhc--CCCC Q lcl|NC_020838. 1 MSTISQRIPNLLLGVSQQPDKLK------FPGQ----------VK---QATNVFPDYALGLLKRPGGKFEAELY--NAEA 59 (981) Q Consensus 1 m~~v~~s~~~l~~GvSqQ~~~~r------~~gQ----------~~---~q~N~~sd~~~Gl~kRp~~~~~~~l~--~~~~ 59 (981) |- -||.+.+. |.|= ++ .-.||=-|.-.-..||.|++++.+-. +... T Consensus 1 mt-------------qQQ~~eiqG~~t~~F~GL~~s~S~~~IP~~~SP~~~N~DV~~~G~V~rR~GT~l~~~Y~inn~s~ 67 (1012) T protein:vir:94 1 MT-------------QQQATEIQGPFTREFSGLDISNSVGAIPVSGSPVFHNCDVSDDGAVVRRRGTALVNTYNINNASG 67 (1012) T ss_pred CC-------------ccccccccccccccccccccccccccccccCCCceEEeecccCcceeehhhhhhhhhhcccccCc Confidence 21 13333321 1110 00 12355444555678999999997643 2322 Q ss_pred CceEEEEEeCCCceEEEEEEcCCCcEEE---EEcCCCeEEEEeeccccc-ccccc---ce-eeccccceeEEEEeCc--- Q lcl|NC_020838. 60 RGRWFPILRDEEEKYVCQYDTTDGQFRI---WSLIDGQPRAVDMGTTAA-TGQPS---GC-NITNLKSDLDVYNTAQ--- 128 (981) Q Consensus 60 ~~~~~~~~rd~~e~y~~~~~~~~g~~~v---~d~~~g~~~~v~~~~~~~-~~~~~---y~-~~~~~~~~l~~~tv~d--- 128 (981) ....+.|-.--+-+|+++-. ..|-+.+ -|++=|..|.|...-.++ ...++ |+ --+++...+-++|=.. T Consensus 68 ~~~s~~irt~LG~eYfiLs~-~~GLL~~~~~~~~AVG~~K~~a~V~~ss~~~V~Pssm~F~~~S~~~~R~LILT~~~~~V 146 (1012) T protein:vir:94 68 RAWSDTIRTKLGSEYFILSN-DVGLLISLMRDDEAVGMPKEVAVVSKSSIWTVPPSSMCFIPVSAPYDRLLILTPEHPIV 146 (1012) T ss_pred ceeeeeehhhccceeEEEec-CCceEEEeeecccccccchhhhhhhhhhccccCCcceEEEeccCCCCcEEEEcCCCceE Confidence 32335665555657866543 3343332 344445444443111111 11222 11 1112222233333111 Q ss_pred -EEEEecC---cccccccceEEEEcC-Ccccccc----eeeEEEEEEeeeeeeeeeccCccccccCCcee----EEEeec Q lcl|NC_020838. 129 -DDTDTKL---NDLNSKQATYTKTND-GQTATKV----NLFDVDVTYKNGYYEESLKSGVLERIDNGQRI----VKDDGT 195 (981) Q Consensus 129 -~t~i~n~---~~~~~~~~~~~~~r~-~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~ 195 (981) ..|+--. .-+.+-..+++.-.+ -+|..+. .-++.+++.. ..+..|+..+..+.+++.-. +.-.++ T Consensus 147 Q~~F~E~T~s~T~~t~~~~~V~~~~a~~~~~~~~L~~~~N~sS~~~~~--~~~T~~AmT~~NP~~S~~ls~~~V~~qtyt 224 (1012) T protein:vir:94 147 QLSFLERTLSFTCTTNHGGGVFSFTAPISVNDTTLWRDTNASSYIVTD--AAGTVYAMTQKNPDFSFRLSGSFVVGQTYT 224 (1012) T ss_pred EEEEeeeeeeeeccCCccceEeecccceeecCeeEEecccccceeEee--ccceEEEEEeeCCceeEEEEEEEecCcccc Confidence 1111000 000111112211111 0111110 0011111111 12222333333333322211 110111 Q ss_pred c----cccee-cccccc------ccccccCCccccc-------------ceEeecceeEeeeeeeeEEeeCceeeEeeec Q lcl|NC_020838. 196 N----AGSIA-AGSAMP------SGYSLGNERTDDY-------------PWFKRDGYRVYEVEKEVAAAYNSTELSTANT 251 (981) Q Consensus 196 ~----~~~~~-~~~~~~------~~~~~~~~~~~~~-------------~~~~~~g~~~~~~~~~~~~a~~~~~~~~~~~ 251 (981) - -.|.+ +|+-.. ...+-+|.+-++. |++ |.+.+.+ =-++...-.+ T Consensus 225 ltirqi~W~WWAESm~~~G~~~~~~~SRFNV~~~DQ~V~IP~~L~tDiD~v~---~~~~~~~--------l~~~~ss~F~ 293 (1012) T protein:vir:94 225 LTIRQITWQWWAESMYYEGQDMMQNTSRFNVTSIDQNVKIPDRLITDIDPVY---KNSQGLG--------LFVFWSSRFD 293 (1012) T ss_pred eeehhhhhhhhhhhHhhhhhHHHhhhhhcccccccccccchhHHhhhhhhhh---hccCCcc--------EEEEEeeeec Confidence 1 11221 333322 2233333211111 111 1110100 0011111111 Q ss_pred cCCc-CcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCceeEeccCCCCCCC Q lcl|NC_020838. 252 NMGT-AQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKNKTTAMKTTTSAAV 330 (981) Q Consensus 252 ~~~~-~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~~~~~~~~~~~~~~ 330 (981) +..| |... ...+.+.|=|..+|....-+-+.. .|+.++. =|+|-+|. ..+.|.+ T Consensus 294 ~~~~~~~T~-----~P~~AD~YG~~~G~~~tpp~~~~~---A~L~~aP----FF~TFG~~-------------~s~TP~P 348 (1012) T protein:vir:94 294 SNGWAGPTT-----SPNTADEYGFSGGGRFTPPSLVPG---ATLQAAP----FFITFGGI-------------YSGTPTP 348 (1012) T ss_pred CceeecCCC-----CCCCcccccccCCceecccccccc---ceeeccc----eEEEeccc-------------cCCCCCC Confidence 1111 1000 001112222222222111111110 1332221 12222221 0111111 Q ss_pred C-ceEEEEEe-------eeccceEEEEeeCceEEEEEeccCCCcceecHHHHHHHHHhhhhccCceEEEE--cCCEEEEE Q lcl|NC_020838. 331 P-NVAFVVIR-------IVAYNSDYSVTLNGTTVTHSTPDTVAGATTDSGSIAAALTSSINALTGFSATQ--VGPGIYIE 400 (981) Q Consensus 331 ~-~~~~v~v~-------~g~y~~~~~v~~ng~~~~~~t~~~~a~~~~~~~~ia~~l~~~i~~~~~~~~~~--vg~~i~i~ 400 (981) . +.-++..| .|+-+....|+.+++..+.+-..- ++++. -.|..+. ..-.+.+. T Consensus 349 ~~~V~iLR~RELRFN~G~GA~~~~L~V~~D~~~~t~Nnvpf------spsnf-----------qt~atT~~~T~R~~~L~ 411 (1012) T protein:vir:94 349 INQVNILRLRELRFNGGTGAKPDDLQVYNDTVEHTWNNVPF------SPSNF-----------QTWATTYTATDRVITLM 411 (1012) T ss_pred hhheeeeeeeeeeeccCCCCCCcceEEEEcceeeecccccc------Ccccc-----------cceeeeeeecceeEEEe Confidence 1 11122222 344555666666666543331000 00000 0011111 11222222 Q ss_pred eCCCceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCcccceEEEEeecccee Q lcl|NC_020838. 401 GTSAFSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAARGPGVWEETIGPSLE 480 (981) Q Consensus 401 ~~~~~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~ 480 (981) ..+..-. .+..-.++. -+..+||+.+|=| |.....++ +. ++ ...+|.... | T Consensus 412 ~A~G~~~------~~A~Y~A~~---GATnnlpanaPL~----IS~~sA~s-------~~---~~--~R~v~~~~~-~--- 462 (1012) T protein:vir:94 412 SAVGDRF------NNANYFAIL---GATNNLPANAPLH----ISCLSASS-------YL---GG--SRRVWYRNL-P--- 462 (1012) T ss_pred eeccccc------cCcceEEEe---ecccccccCCccc----ccccccee-------ee---cc--ceeeeeecc-c--- Confidence 2211100 000011222 2456788888744 33332211 10 11 011443211 1 Q ss_pred EEEccccceEEEEeccCCceeeecccCCcccCCCcccCcCccccCCceeEEEEEcceEEEecC----CeEEEEeccc--- Q lcl|NC_020838. 481 FEIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLLSN----EAVIMSRAGD--- 553 (981) Q Consensus 481 ~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~s~----~~V~~Srtgd--- 553 (981) -+.+++...++|. -|--++ .++..| +.|.--+.||.||++.++ ..+.+|.+|| T Consensus 463 --T~~~~~~G~Y~r~-YGiG~~--~~Y~~~---------------~F~~I~TiY~~RLiL~~~s~~~~~~~~S~~GD~~~ 522 (1012) T protein:vir:94 463 --TTGGTLDGCYVRA-YGIGKY--VDYSKR---------------SFHAIGTIYRDRLILVNPSTATDQLLISEIGDATV 522 (1012) T ss_pred --cCCceEeeeEEEE-EEeeee--eecCCc---------------cccceeeeeeeeeEEeccCCCcceEEEeecCCccc Confidence 1222333333331 011111 122222 345566899999999985 4588898765 Q ss_pred ---cccccc-cccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEEEeeeccccC Q lcl|NC_020838. 554 ---YFNFFA-NSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINTISTFECDAE 629 (981) Q Consensus 554 ---y~NF~~-~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~~S~~~~s~~ 629 (981) |+||+. +..+.-.|.||+++.+++.-.+.|.-++...+.|++||..+-|.+.|+ +.++|..-.++..|+|+.-+. T Consensus 523 ~G~~Y~F~QiTD~L~G~~tDPF~L~VtSe~~e~iT~~~~WQ~~LFV~T~~~T~~~~GG-e~~~~s~~~VN~vSt~G~~N~ 601 (1012) T protein:vir:94 523 PGEFYQFMQITDMLQGVTTDPFTLNVTSEGRERITAVTGWQKRLFVFTGSNTYSIEGG-EQFGESSYAVNLVSTYGAFNQ 601 (1012) T ss_pred CceeeeeeeeehhhccCcCCceeEEEcccccceeeeeeeeceeEEEEeccceEeeccc-cccchhHHHHHhHHhhcccCc Confidence 899996 566778899999999999888999999999999999999999999986 479999999999999976555 Q ss_pred CCcEEeCCeEEEEecCCCeeEEEEEeecccccceehhhHHHHHHHhcCC-------CeEEEEEcCCCcEEEE-Eec-C-- Q lcl|NC_020838. 630 IDAVAVGTTQAFISKSNLYSKLFLMLNVQKEAAATIDEATTNVPEYVPS-------DIDSMTASPAMSIVSL-GKS-G-- 698 (981) Q Consensus 630 v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS~~~~h~~~~-------~i~~~~~s~~~~~~~~-~~~-g-- 698 (981) -.-|+..-.|+|.++.| |+++....+.|+|.+.|-|+.+..+|.+ ....|.+..+.+.+.+ +.. + T Consensus 602 ~~VV~T~~~V~Ym~~~G----~F~L~~k~~~~~Y~A~ErSvKIR~~F~~~~~ss~~~~~Wl~~~e~~~~LYi~L~~~~dT 677 (1012) T protein:vir:94 602 NCVVVTNLTVLYMNKFG----LFDLMNKPNTDSYGAFERSVKIRGLFQNLAGSSGDNLHWLRYNESSNKLYIGLAAEGDT 677 (1012) T ss_pred ceEEEeeeEEEEeeccc----eeeccCCccCCcchhhhhhhhhhhhhhhhccccccceeeeeeccCCceEEEEecCCCcc Confidence 55688888999999865 8999999999999999999999999964 2334555555443332 222 2 Q ss_pred --CcEEEEEEeecCcchheeeeEeeccCCceEEEEE----eCCeEEEEEEcCCcEEEEEEEeecCCceeEEEe----cCC Q lcl|NC_020838. 699 --SNTVYQHRFFMQGENRVQTWYKWQLTGDLRLQFF----DKTTFYAVTSSGSNVYLTSYDLTQASESGYLTL----PTG 768 (981) Q Consensus 699 --~~~l~~y~y~~~~eq~V~aWsrw~~~G~v~sv~~----~~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~----~~~ 768 (981) ...+|.|.|+| .+|+..+..|.|.---. ..+...+.+..+..++|.+..+ .+|++. ..- T Consensus 678 ~~~S~~~~~N~~~------DSWs~~~s~~~Fq~YP~V~~~~~~t~L~~i~~~~TV~ML~~~~-----~~YiDFatirthi 746 (1012) T protein:vir:94 678 RTTSRNLMLNFTW------DSWSTLSSAAPFQMYPAVQLFKYMTWLTNINAPLTVAMLATEM-----PFYIDFATIRTHI 746 (1012) T ss_pred hhhhhhhhhhhhh------cchhhhhccCCcccchhhhhhhhhhhhhhhcCchhhhhhhhcc-----ceeeeeehhcccc Confidence 24577777665 58998888887653211 1233223333343333322111 011111 000 Q ss_pred Ccccccccce----eeeeee--------------------------eeccCC-----c------------eEEEcccCCC Q lcl|NC_020838. 769 EKTDVCLDMF----NVNPYR--------------------------TYSTST-----K------------KTTVNLPFDH 801 (981) Q Consensus 769 ~~~~~~lD~~----~vd~~~--------------------------ty~~~~-----~------------~tt~~l~~~~ 801 (981) .++..|...+ ..+..+ +|.-.+ . .+...|+ ++ T Consensus 747 ypF~~CaG~~~~~Vms~~~GIY~~~~P~tP~I~~~tit~ss~~~~k~Yq~~T~~~GT~tLt~~~~~~~~~~~l~LL~-~~ 825 (1012) T protein:vir:94 747 YPFTFCAGQRDVSVMSDSRGIYNLPLPVTPGILDYTITASSKAGAKTYQRNTASAGTETLTLRNPMMDYADTLELLG-GN 825 (1012) T ss_pred cceeeeccceeeEEEecCCceEEecccccceeeeeEeeccchhhhheeccccccccceeeeecChhhhcCcEEEEec-CC Confidence 0011110000 000111 111000 0 0111111 34 Q ss_pred cCCceEEEEEcCcccCceeEEEEecCceEEEccccccCCCEEEeCCCCC-CCEEEEEEeeeEEEEeCCceeeccCCCccc Q lcl|NC_020838. 802 ITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLLNGDYR-GRDLIIGYVYDMELELPTLYPTQVEGRSSV 880 (981) Q Consensus 802 l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~-~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~ 880 (981) .+|++++++-.....++...-.+-..| +.+..-.+..|..++...-.+ ...+++|..|.+.++-+-+.+ |+ T Consensus 826 ~~~~~~a~V~~~~~~~~TT~~TV~~N~-~~~lQ~T~~~GS~L~~~~~LsqN~~~~~G~~Y~S~Y~SP~F~L----~S--- 897 (1012) T protein:vir:94 826 VNASQFAMVMSNGFEPYTTYPTVTYNG-VAPLQWTVTGGSGLNNRPILSQNNNCIMGMIYPSVYASPIFDL----ES--- 897 (1012) T ss_pred CCccEEEEEeecccccccccceEEecc-eeeeeEEEecCCccccccccccCceEEEeecchhhhcchhhhh----hh--- Confidence 445555554322222222111111111 111111111122222211111 357999999999998655443 11 Q ss_pred ceEeeeE-EEEEEEEEeecc--cceEEEEccCCCCc------eeeEecccccCccc--------cC-ccccccCCeEEEE Q lcl|NC_020838. 881 SDVTSDL-ILHRLKVSTGLS--GPITYKVDITGKDE------WTNIINVTLPNTYV--------LN-NVNLSASALHDVP 942 (981) Q Consensus 881 ~~~~grl-~l~r~~v~~~~S--g~~~v~v~~~~~~~------~~~~~~~~~~~~~~--------~~-~~p~~~~~~~~vp 942 (981) -+|| +|.++.|-+.-+ ..+...+....... |.-..+ ..+.+. .| .-.+..-....+| T Consensus 898 ---L~~LKr~K~~~L~~Dttvtsqlkynltsgfsqvsvlntawvavvs--nynenivpavvsyqvgnsyeirrvvelsip 972 (1012) T protein:vir:94 898 ---LGRLKRLKKLHLQMDTTVTSQLKYNLTSGFSQVSVLNTAWVAVVS--NYNENIVPAVVSYQVGNSYEIRRVVELSIP 972 (1012) T ss_pred ---hhhhhheeeeeEEeeeeeeeeeeeehhcccceeeeecceeeeeee--ccCccccceeeeeecCCceeeeEEEEEeec Confidence 1232 455555544332 22222222211110 110000 001111 00 1112112234578 Q ss_pred eeccCcceEEEEEECCCCCEEEEEEEEEEEe--ecccccc Q lcl|NC_020838. 943 IYQRNENVNIKIIGDTPFPISLLNIVWEGNY--NRRFYRR 980 (981) Q Consensus 943 ~~g~~~~~~v~I~~~~PlPltvlsi~weg~y--~~r~rRr 980 (981) +.|..-+.++.|.+-..-.+.+-+.+++.+= -+|+-|| T Consensus 973 lqgygcdyqfyiasvgaeafklaayefdiqpqrdkryvrr 1012 (1012) T protein:vir:94 973 LQGYGCDYQFYIASVGAEAFKLAAYEFDIQPQRDKRYVRR 1012 (1012) T ss_pred ccccccceeEeeeeccccceeeeeeeeccccchhhhhccC Confidence 8888888999999998888999888877653 3455566 No 30 >protein:vir:80177 Length: 1027 # NCBI annotation: tail tubular protein B # Family: family:all:12083 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285795;genbank:gi:148747829;genbank:GeneID:5220453 Probab=98.85 E-value=2.8e-08 Score=61.96 Aligned_cols=806 Identities=17% Similarity=0.163 Sum_probs=318.2 Q ss_pred cchhhhccccccchhhcC-----CC--------h--hh---hhhccccccccccccCchhhhhhhhcCCCCCceEEEEEe Q lcl|NC_020838. 7 RIPNLLLGVSQQPDKLKF-----PG--------Q--VK---QATNVFPDYALGLLKRPGGKFEAELYNAEARGRWFPILR 68 (981) Q Consensus 7 s~~~l~~GvSqQ~~~~r~-----~g--------Q--~~---~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~~~~~~~~~~r 68 (981) ...+ ..---||.+.+-. .| - ++ .-.||=-|.-.-..||.|++++.+-.+.+ ....+.|-. T Consensus 1 mvns-ferrtQQ~~dlG~~s~~F~GL~~t~S~~~IP~~~SP~~~N~DV~~~G~V~kR~GT~i~~~Y~~t~-~~~t~~vks 78 (1027) T protein:vir:80 1 MVNS-FERRTQQGDDLGIRSSNFGGLNTTASPLNIPYEDSPNLLNVDVDVSGNVSKRQGTEILLKYANTT-PVYTFPVKS 78 (1027) T ss_pred CCcc-hhhhhccccccccccccccccccccccccccccCCCceEEeecccCcceeehhhhhhhhhhccCC-ceeeeeehh Confidence 1111 1122345444310 00 0 00 12355444555678999999998765332 222355555 Q ss_pred CCCceEEEEEEcCCCcEEE---EEcCCCeEEEEeeccc-cccccccceeec-cccceeEEEEeCc----EEEEecC---c Q lcl|NC_020838. 69 DEEEKYVCQYDTTDGQFRI---WSLIDGQPRAVDMGTT-AATGQPSGCNIT-NLKSDLDVYNTAQ----DDTDTKL---N 136 (981) Q Consensus 69 d~~e~y~~~~~~~~g~~~v---~d~~~g~~~~v~~~~~-~~~~~~~y~~~~-~~~~~l~~~tv~d----~t~i~n~---~ 136 (981) --+-+|++ ....|-+.+ -|++=|..|.|...-. ++...++|+.-. ++...+-++|=.. .-|+--. . T Consensus 79 ~LG~dYvL--t~~~GLL~~~~~~~~AVG~~K~~s~V~~aa~~~V~P~F~~~S~~~~R~LILT~~~~~VQ~~F~E~T~t~T 156 (1027) T protein:vir:80 79 VLGYDYVL--TKSGGLLEVAGVIGKAVGAYKSFSNVFSAAAANVKPYFTLLSDVEPRVLILTGTNTPVQVKFVEQTFTTT 156 (1027) T ss_pred hccceeeE--ecCCceEEEeeecccccccchhhhhhhhhhhcccCceeEEccCCCCcEEEEcCCCceEEEEEeeeeeeee Confidence 55557844 333443333 3555555555542222 222345554432 3333344444221 1111000 0 Q ss_pred ccccccceEEEEcC-Ccccccc----eeeEEEEEEeeeeeeeeeccCccc-cccCCceeEEEeecccccee-ccccc--- Q lcl|NC_020838. 137 DLNSKQATYTKTND-GQTATKV----NLFDVDVTYKNGYYEESLKSGVLE-RIDNGQRIVKDDGTNAGSIA-AGSAM--- 206 (981) Q Consensus 137 ~~~~~~~~~~~~r~-~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~~--- 206 (981) -..|-..+++.-.+ -||..+. .-+...++... .+..|++.+.. +.+++.- .-+-.-..|.+ +|+.. T Consensus 157 ~~s~~~~~V~~~~s~~~~~~~~L~~~~N~tS~~~~~~--~~T~~AlT~~NlP~~S~~m--t~~~V~~~W~WWAESl~~~G 232 (1027) T protein:vir:80 157 SGSPTTTVVIPNASRFQYDTPILYMNRNFTSGATYSY--NSTTRALTISNLPSWSGSM--TFDLVLPVWSWWAESLRWFG 232 (1027) T ss_pred ccCCccceEeecccceeecCeeEEecccccceeEeec--cceEEEEEeccCCcceeEE--EEeEEecchhhhhhHHhhhh Confidence 01111112221111 1111110 00111111111 11222222222 1121111 11111112222 23222 Q ss_pred ---cccccccCCcccc-------------cceEeecceeEeeeeeeeEEeeCceeeE-eeecc---CCcCcCceEEEEEc Q lcl|NC_020838. 207 ---PSGYSLGNERTDD-------------YPWFKRDGYRVYEVEKEVAAAYNSTELS-TANTN---MGTAQTAYDNAVST 266 (981) Q Consensus 207 ---~~~~~~~~~~~~~-------------~~~~~~~g~~~~~~~~~~~~a~~~~~~~-~~~~~---~~~~~~~~~~~i~~ 266 (981) ....+-+|.+-++ .|++ |.+.+..- -++.. +..+. +.+++ | -|- T Consensus 233 ~~~~~~~SRFNV~~~DQ~V~IP~~L~sDlD~i~---~~~~~~~m--------~~~~ta~F~~~~~~~~T~~-P----~~A 296 (1027) T protein:vir:80 233 DRFYDAVSRFNVNKADQSVAIPAALRSDLDTIQ---GTYGRYPM--------LLYKTATFNDTYTFSNTGQ-P----ANA 296 (1027) T ss_pred hHHHhhhhhcccccccccccchhHHhhhhhhhh---hccCCccE--------EEEEeeeecCceeecCCCC-C----CCc Confidence 1223333321111 1111 11111100 01111 11111 11122 1 122 Q ss_pred ccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCceeEeccCCCCCCCCc-eEEEEEe------ Q lcl|NC_020838. 267 ESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKNKTTAMKTTTSAAVPN-VAFVVIR------ 339 (981) Q Consensus 267 ~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~~~~~~~~~~~~~~~~-~~~v~v~------ 339 (981) |+..| .+|..+.+.... |+.++. =|+|.+|. ..+.|.+.+ .-++..| T Consensus 297 D~YG~----~~G~~~~~~~~A-----~L~~sP----FF~TFG~~-------------~t~TP~P~~~V~lLR~RELRFN~ 350 (1027) T protein:vir:80 297 DSYGW----GDGSVYNVGASA-----YLNTSP----FFATFGDT-------------RTPTPQPPETVHLLRQRELRFNY 350 (1027) T ss_pred ccccc----cCCceEeecccc-----eeeccc----eEEEeccc-------------cCCCCCchhheeeeeeeeeeecc Confidence 22222 344444432222 433321 12232221 011121211 1122221 Q ss_pred -eeccceEEEEeeCceEEEEEeccCCCcceecHHHHHHHHHhhhhccCceEEEEc--CCEEEEEeCCCceEEEecCcCcc Q lcl|NC_020838. 340 -IVAYNSDYSVTLNGTTVTHSTPDTVAGATTDSGSIAAALTSSINALTGFSATQV--GPGIYIEGTSAFSISTSGSTTEE 416 (981) Q Consensus 340 -~g~y~~~~~v~~ng~~~~~~t~~~~a~~~~~~~~ia~~l~~~i~~~~~~~~~~v--g~~i~i~~~~~~~vt~~~g~~~t 416 (981) .|+-+....|+.+++..+.+ |..+.. .-.+.+..++..-. + T Consensus 351 G~GA~~~~L~V~~D~~~~s~N----------------------------~ssT~~~T~R~~~L~~A~G~~~--------~ 394 (1027) T protein:vir:80 351 GNGATGANLRVTVDGTALSAN----------------------------YSSTVAGTNRAYALYKADGTLC--------T 394 (1027) T ss_pred CCCCCCcceEEEEcceeeeee----------------------------eeeeeeecceeEEEeeeccccc--------c Confidence 23444455555555544322 110000 11111111111000 0 Q ss_pred eeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCcccceEEEEeeccceeEEEccccceEEEEecc Q lcl|NC_020838. 417 GIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAARGPGVWEETIGPSLEFEIDETTMPHQLIRQA 496 (981) Q Consensus 417 ~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a 496 (981) ...-..+...-...-|..--.-..+.+.+ .+.-|+-+.... -+.+++...+++. T Consensus 395 ~A~dlayY~A~~GATPL~IS~~aA~t~~~-----~~R~yi~~~~~~--------------------T~~~~~~G~Y~k~- 448 (1027) T protein:vir:80 395 SASDLAYYIAFTGATPLGISPTAAVTITN-----VDRTYIGSAATQ--------------------TDNAYVQGGYFKV- 448 (1027) T ss_pred ccccceeeeeeeccccccccccceeeeec-----Cceeeeeeeccc--------------------cCCceEeeeEEEE- Confidence 00000000000000011000000011111 011111111100 0111111112211 Q ss_pred CCceeeecccCCcccCCCcccCcCccccCCceeEEEEEcceEEEecC----CeEEEEeccc------cccccc-cccccc Q lcl|NC_020838. 497 NGVFKYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLLSN----EAVIMSRAGD------YFNFFA-NSSQVV 565 (981) Q Consensus 497 ~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~s~----~~V~~Srtgd------y~NF~~-~s~~~~ 565 (981) +.+ .-|..-. .++.|.--+.||.||++.++ ..+.+|.+|| ++||+. +..+.- T Consensus 449 ---YGl--G~~~~Y~------------~~~F~~I~TvY~~RLvL~~~t~~~~~~~~S~~GD~~~~G~~Y~F~QvTD~L~G 511 (1027) T protein:vir:80 449 ---YGL--GLWANYG------------TGQFPRIATVYQSRLVLGGFTNDPTRVVFSATGDTVEGGVKYNFFQVTDDLDG 511 (1027) T ss_pred ---EEe--eeeeecC------------CccccceeeeeeeeeEEeccCCCcceEEEeecCCcccCceeeeeeeeehhhcc Confidence 000 0122111 12445566899999999984 4689998765 799996 566778 Q ss_pred cCCccEEEEEcCC-CceeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEEEeeeccccCCCcEEeCCeEEEEec Q lcl|NC_020838. 566 APDDPIDLQATSV-KPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINTISTFECDAEIDAVAVGTTQAFISK 644 (981) Q Consensus 566 ~DdDpI~~~i~s~-~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~ 644 (981) .|.||+++.++++ -.+.|.-++...+.|++||..+-|.+.|....++|..-.++..|+|+.-+.-.-|+....|+|.++ T Consensus 512 ~~sDPF~L~VsSsq~~d~vT~~~~WQ~~LFV~T~~~T~~~~GGd~t~~~a~~~VN~iSs~G~~N~~~VV~T~~~V~Yl~~ 591 (1027) T protein:vir:80 512 LDSDPFDLVVSSSQADDYVTGLVEWQSSLFVLTRRATFRANGGDATISPARRFVNYISSLGLVNPFSVVRTDTAVFYLSD 591 (1027) T ss_pred CcCCceeEEEecccccceeeeeeeeceeEEEEecceeEEeecCccccchhHHHHHHHHhhcccCcceEEEeeeEEEEeec Confidence 8999999999884 456678889999999999999999999875569999999999999976555566888899999998 Q ss_pred CCCeeEEEEEeecccccceehhhHHHHHHHhcCC-------CeEEEEEcCCCcEEEE-EecC-----CcEEEEEEeecCc Q lcl|NC_020838. 645 SNLYSKLFLMLNVQKEAAATIDEATTNVPEYVPS-------DIDSMTASPAMSIVSL-GKSG-----SNTVYQHRFFMQG 711 (981) Q Consensus 645 ~g~~s~vre~~y~~~~d~~~a~DlS~~~~h~~~~-------~i~~~~~s~~~~~~~~-~~~g-----~~~l~~y~y~~~~ 711 (981) .| |+++....+.|+|.+.|-|+.+.++|.+ ....|.+..+.+.+.+ +..+ ...+|.|.++| T Consensus 592 ~G----~F~L~~r~~~~~Y~A~EkSiKIR~~F~~~~~ta~~~~~Wm~~~q~~~~LYv~L~~~~eT~~~S~~~~~N~~~-- 665 (1027) T protein:vir:80 592 SG----VFNLTPRVEDGEYQAIEKSIKIRKVFGKTTSTAVSSAAWMSFDQNRKVLYVALPRGSETTVASALYVYNTFR-- 665 (1027) T ss_pred cc----eeeccCCccCCcchhhhhhhhhhhhhhhhccccccceeeeeeccCCceEEEEecCCCcchhhhhhhhhhhhh-- Confidence 65 8999999999999999999999999965 2334555555544333 2222 25577777765 Q ss_pred chheeeeEeeccCCceEEEE-------EeCCeEEEEEEcCCcEEEEEEEe------------------ecCCceeEE--- Q lcl|NC_020838. 712 ENRVQTWYKWQLTGDLRLQF-------FDKTTFYAVTSSGSNVYLTSYDL------------------TQASESGYL--- 763 (981) Q Consensus 712 eq~V~aWsrw~~~G~v~sv~-------~~~d~ly~vv~r~~~~~l~~~~~------------------~~~~~~~~~--- 763 (981) .+|+..++.|.|.--. +..+...+.+..+..++|.+..+ .....-... T Consensus 666 ----DSWt~~~t~~~Fk~YtghP~V~~~~~~s~L~~v~~~~TV~ML~~~~~~YvDFF~~CG~~~~~Vlt~~~GIY~~~~P 741 (1027) T protein:vir:80 666 ----DSWTQYDTLGGFKTYTGHPYVDTVLGDSFLLMVAYGGTVCMLKLYGSRYVDFFNKCGSFTGNVLTANSGIYTWTAP 741 (1027) T ss_pred ----cchhhhhcccCcccccCCchhhhhhhhhhhhhhcCchhhhhhhhhcchhhhhhhhcccceeeEEecCCceeEeecc Confidence 5888887777654220 11222222222222222211110 000000000 Q ss_pred -----------------------EecCCCc---ccccccc--eeeeeeeeecc---CCceEEEcccCCCcCCceEEEEE- Q lcl|NC_020838. 764 -----------------------TLPTGEK---TDVCLDM--FNVNPYRTYST---STKKTTVNLPFDHITGKKLAVVA- 811 (981) Q Consensus 764 -----------------------~~~~~~~---~~~~lD~--~~vd~~~ty~~---~~~~tt~~l~~~~l~g~~v~v~a- 811 (981) +++.... ++...|- +.--..+++.. +-..+...|+ ++.+|++++++- T Consensus 742 ~wnsP~I~~~svs~tt~~~~q~Ye~~T~~~vvpydnvedlsiyvnGT~Ls~~~~~~~~~~~i~LL~-~~~~~~~~s~Vpr 820 (1027) T protein:vir:80 742 FWNSPVISNISVSGTTTLAVQRYELPTDLQVVPYDNVEDLSIYVNGTRLSFGTDWVKQGKAIYLLS-DPGDGKTVSIVPR 820 (1027) T ss_pred cccCCeeeEEEeeccchhhhheeccccccccccccccccceeeecceeEeecCchhhcCCEEEEec-CCCCcceEEEEec Confidence 0000000 0000000 00000011100 0011111221 455566666651 Q ss_pred --------cCcccCceeEEEEecCceEEEc--cccccCCCEEEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccc Q lcl|NC_020838. 812 --------IGTYIGDTISATSESEGSVFYF--EDSDISSNQVLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVS 881 (981) Q Consensus 812 --------dG~~~~~~~~~~~~~dG~~~~~--~~~tv~gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~ 881 (981) .....++.. +...+-..+.+. .+.+..|..++...-.+...+++|..|.+.++-+-|.+ |+ T Consensus 821 cpvnvsy~~~~~~~~TT-~~TV~~N~~~~iQ~Tdy~~~GS~L~~~~~LtN~~~~~G~~Y~S~Y~SP~F~L----~S---- 891 (1027) T protein:vir:80 821 CPVNVSYQGDVTFDETT-AQTVWVNNLLQIQGTDYTLSGSTLTFTDTLTNAVVEVGNAYISYYQSPMFLL----GS---- 891 (1027) T ss_pred ccccccccccccccccc-cceEEecceeeeccceeeeccCccccccccccceEEEeecchhhhcchhhhh----hh---- Confidence 110111111 111111111111 11222333333332334567999999999998655543 21 Q ss_pred eEeeeE-EEEEEEEEeecccceEEEEccCCCCceeeEecccccCcc---ccCccccccCCeEEEEeeccC-cceEEEEEE Q lcl|NC_020838. 882 DVTSDL-ILHRLKVSTGLSGPITYKVDITGKDEWTNIINVTLPNTY---VLNNVNLSASALHDVPIYQRN-ENVNIKIIG 956 (981) Q Consensus 882 ~~~grl-~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~~~---~~~~~p~~~~~~~~vp~~g~~-~~~~v~I~~ 956 (981) -+|| +|.++.|-+.+.--+-+ +..+-...+.+ +.|.=.-....-..+.....+ .+.+.-|-+ T Consensus 892 --L~~LKk~K~~~L~~Dnedvlpv-----------ytigdlasgqdvddlvgkwktrananisvtydsentsetsydiys 958 (1027) T protein:vir:80 892 --LSNLKKVKHVYLYFDNEDVLPV-----------YTIGDLASGQDVDDLVGKWKTRANANISVTYDSENTSETSYDIYS 958 (1027) T ss_pred --hhhhhheeeeEEEEcCCcceee-----------eeeccccCCCchhHhhhhhcccccceeEEEecCcCcccceeeeee Confidence 1333 57777776655422211 11111001110 001000000001111111111 122222211 Q ss_pred CCCCCEEEEEEEEEEEeecc------ccccC Q lcl|NC_020838. 957 DTPFPISLLNIVWEGNYNRR------FYRRS 981 (981) Q Consensus 957 ~~PlPltvlsi~weg~y~~r------~rRr~ 981 (981) ...+.|+--|+.- .-|-| T Consensus 959 -------fsdlvwdnaffdvdptnlqstrya 982 (1027) T protein:vir:80 959 -------FSDLVWDNAFFDVDPTNLQSTRYA 982 (1027) T ss_pred -------hhhhhcccceecccccccchhhHH Confidence 2223455444321 11111 No 31 >protein:vir:2625 Length: 715 # NCBI annotation: gp27 # Family: family:all:5234 # MgeID: mge:55 # MgeName: SIO1 # Cross-refs: genbank:acc:NP_064766;genbank:gi:9964636;genbank:GeneID:1263056 Probab=98.30 E-value=1.6e-06 Score=52.43 Aligned_cols=639 Identities=13% Similarity=0.102 Sum_probs=265.5 Q ss_pred eeeeeeEEeeCceeeEeeeccCCc---CcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEc Q lcl|NC_020838. 232 EVEKEVAAAYNSTELSTANTNMGT---AQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTIN 308 (981) Q Consensus 232 ~~~~~~~~a~~~~~~~~~~~~~~~---~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~a 308 (981) |--+..++..| .|++-++...-. +++.-+--.| .+..++|.- + |. .-++.. T Consensus 1 m~~~~~~~~vN-tFv~GliTEas~ltfpqnasiDe~N------~~l~rdG~r-----------------~-RR-~g~~~E 54 (715) T protein:vir:26 1 MPQSLTQRTVN-TFIKGLITEASELTFPENASVDELN------CSLGRDGTR-----------------R-RR-KAVTLE 54 (715) T ss_pred CCcccchhHHh-hhhhheeeccccccCCccceeeeee------eeecCCCcc-----------------h-hh-ccceee Confidence 11111222222 234444433322 2222221111 011111110 0 00 000111 Q ss_pred CEEEEeCCceeEeccCCCCCCCCceEEEEEeeeccceEEEEeeCceEEE--EEeccCCCcce----ecHHHHHHHHHhhh Q lcl|NC_020838. 309 DYTFVLNKNKTTAMKTTTSAAVPNVAFVVIRIVAYNSDYSVTLNGTTVT--HSTPDTVAGAT----TDSGSIAAALTSSI 382 (981) Q Consensus 309 d~tfi~n~~~~~~~~~~~~~~~~~~~~v~v~~g~y~~~~~v~~ng~~~~--~~t~~~~a~~~----~~~~~ia~~l~~~i 382 (981) +.-|+++.-+|-.+.-+.-.|..-.| +-++...|---|...- -++.+...++. ++. .+--+..... T Consensus 55 ~~~vls~~~vp~galv~~~~W~na~G-------~v~~~~livqvg~~l~f~q~t~~pLs~~n~~~svdl-~~~~~~vn~S 126 (715) T protein:vir:26 55 DNHVLSDVVVPEGALVQTLDWYNVAG-------QVNLEFLVVQVNNILYFYEKSTDPLSANKYSGSVDL-NTHSASNNLS 126 (715) T ss_pred cceEEEEEeecCceeeeeechhhccc-------ccCcEEEEEEeccEEEEEeccCCccccCceeEEeee-cceecccccc Confidence 11233332222111111111211111 1111111100011100 00111111100 000 0000000000 Q ss_pred hccCceEEEEcCCEEEEEeCCC--ceEEEecCcC-cceeE-EEE---EEEeechhccccCCCCcEEEEEccCCCcccceE Q lcl|NC_020838. 383 NALTGFSATQVGPGIYIEGTSA--FSISTSGSTT-EEGIF-AFQ---DQINVASRLPNQCENGYRVRVTNSGDVTADDIY 455 (981) Q Consensus 383 ~~~~~~~~~~vg~~i~i~~~~~--~~vt~~~g~~-~t~~~-~~~---~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yy 455 (981) .++--..++.+.+.+.|..|.. +.+..+++.- -+... .++ ..+|-. ++ .. |-...-.++..+.+.-|= T Consensus 127 Psh~~v~v~~~~G~livanp~i~~~~~~~d~~t~s~t~~~ll~r~r~f~~qg~-d~--~~--g~~y~~~gt~~tn~~iyn 201 (715) T protein:vir:26 127 PSEERVQVTSLNGYLIVASPAINTFYLGFNTSTEAFTATSISFKERDFEWQGS-DV--DV--TSLYFGEGTSVSNQRIYD 201 (715) T ss_pred cceeEEEEEEeeeEEEEecCCccEEEEEecCCcceeEeeEEEEEeeeheeecc-cc--cc--ccccccCCcccCchhhee Confidence 1111244555677777776654 3345444321 11111 111 112211 11 00 000000111111111110 Q ss_pred EEEEcccCCcccceEEEEeeccceeEEEccccceEEEEeccCCceeeecccCCcccCCCcccCcCcccc----------- Q lcl|NC_020838. 456 VEFQTTNSAARGPGVWEETIGPSLEFEIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFI----------- 524 (981) Q Consensus 456 v~~~~~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~----------- 524 (981) + + ..+-+.+.++|- .-.|+.++.+.+.+.|+.--.++++.|. +-+|.+-..|... -|..-|+ T Consensus 202 l-y--N~gw~~p~gt~~-~N~~~~yiVypa~s~~~~S~kd~n~afs--k~ad~ei~tGt~~-~~~G~yi~D~~~~g~~~l 274 (715) T protein:vir:26 202 T-Y--NVGWVGPKGSAA-LNTYGSYIVYPALTHPWYSGKDANGAFN--KADWLEIYTGSSL-ASNGHYVLDVFNKARTGL 274 (715) T ss_pred c-c--cceeecceeEEE-EcCCCCceEecccccccCCCcccccccC--hhhcccccccccc-ccCceEEEeeeecCCccc Confidence 1 1 011112344553 2235555666677777666666655443 3334433333211 1111111 Q ss_pred -----CCceeEEEEEcceEEEec------CCeEEEEec----cc----cccccccccc--cccCCccEEEEEcCCCceeE Q lcl|NC_020838. 525 -----GKKINNMFFYRNRLGLLS------NEAVIMSRA----GD----YFNFFANSSQ--VVAPDDPIDLQATSVKPVTL 583 (981) Q Consensus 525 -----g~~ps~v~ffq~RL~f~s------~~~V~~Srt----gd----y~NF~~~s~~--~~~DdDpI~~~i~s~~~n~I 583 (981) .+.+.+++.|.+|.+|++ +..|.+||. .| |.+=+|++.. .+.|.|...+.+-+-. .| T Consensus 275 eeev~k~R~rsv~~yaGrV~yagiD~dkng~rilfSqLv~s~~di~nCyQd~DPTsee~~dLidTDGg~iri~gah--~i 352 (715) T protein:vir:26 275 TTEVETGRFRSVAAYAGRVFYAGIDSAKNGGKVYFSRLTERMSDVGNCYQVNDPTSEVLSDLLDTDGGVVRIPDAH--NI 352 (715) T ss_pred hhhhhcCCCcceeeecceEEEeecccccCCCeEEEehhhcchhhcccccccCCCchhhhhhhhhcCCCEEEecCCC--Cc Confidence 134677999999999994 457999975 33 4444454322 4778899999887754 36 Q ss_pred EEEeecCCcEEEEecCcEEEEEcCCccccccceEEEEEeeeccccCCCcEEeCCeEEEEecCCCeeEEEEEeecccccce Q lcl|NC_020838. 584 NYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINTISTFECDAEIDAVAVGTTQAFISKSNLYSKLFLMLNVQKEAAA 663 (981) Q Consensus 584 ~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~ 663 (981) +-|+.|+..|+||...+-|+|.|.+..+|.+...+.++++.+|++.=.=+++|+.++|-+++| |..+.-++.-.-+ T Consensus 353 i~Lv~f~~sLlvf~~NGVWAi~G~d~g~tATdY~ltKIs~vg~sspnSvVvv~~~i~~WsdtG----Iyal~~Nd~fn~~ 428 (715) T protein:vir:26 353 RKLHVLGASLLVFAENGVWAVAGVDNVFRATEYAITRISDVGLSNENSFVVADGIPIWWGKTG----IYAVQQSENLNTP 428 (715) T ss_pred eeEEEecceEEEEEecceEEEeccCCceeeeeeEEEEeeeeccCCCccEEEecceEEEeeCCc----EEEEEeccccCcc Confidence 668999999999999999999876668999999999999999998877899999999999987 6666655555678 Q ss_pred ehhhHH-HHHHHhcCC---C-eEEE--EEcCCCcEEEEEecCCcEEEEEEeec--CcchheeeeEeeccCCceEEEEE-e Q lcl|NC_020838. 664 TIDEAT-TNVPEYVPS---D-IDSM--TASPAMSIVSLGKSGSNTVYQHRFFM--QGENRVQTWYKWQLTGDLRLQFF-D 733 (981) Q Consensus 664 ~a~DlS-~~~~h~~~~---~-i~~~--~~s~~~~~~~~~~~g~~~l~~y~y~~--~~eq~V~aWsrw~~~G~v~sv~~-~ 733 (981) .|+.|| ..+..|.+. . +... .+-..+..+.|.-.+..++.-|+|-. .-+-..+|+-+|..+..--+... + T Consensus 429 tAqNLTekTIq~~~~~I~~dk~knVtg~fd~~e~rVyW~yPn~dt~vdykyd~vLV~dLalgaFYp~~v~~~a~~~~~~i 508 (715) T protein:vir:26 429 TAQNLSLSTIQTLWNNISNAKKAQVTVEYDKINQRVFWFYPDNDESVDYKYNNILVMDLALQAFYPWRVEDEASSTSYII 508 (715) T ss_pred hhhccchHHHHHHHhhcchhhhcceEEEEEccCCEEEEEEcCCceeeceeecCeEEEEecccccccccccccccccceee Confidence 999999 778888643 1 2221 12333445555443332333333200 00223467777764432111111 0 Q ss_pred CCeE---EEEEEcCCcEEEEEEEeecCCceeEEEecCCCcccccccceeeeeeeeeccCCceEEEcccCCCcCCceEEEE Q lcl|NC_020838. 734 KTTF---YAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFNVNPYRTYSTSTKKTTVNLPFDHITGKKLAVV 810 (981) Q Consensus 734 ~d~l---y~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~vd~~~ty~~~~~~tt~~l~~~~l~g~~v~v~ 810 (981) +... |-+++-+..+ +-..+. +-+++.+. .-.+--+++++.+-.++ T Consensus 509 g~~~~~~~~~~~t~~~v-v~~~~v------------------------------~~~g~~~~-v~~~~r~~~~~~~~~~~ 556 (715) T protein:vir:26 509 GTSYYGGLGSTSTETQV-VNGADV------------------------------VVNGSDNV-VATLYRDYLEGDSEIKL 556 (715) T ss_pred eeeeeCCcccccchhhe-eccceE------------------------------EEeccceE-EEEeecccccccceEEE Confidence 1100 0011111110 000000 00011100 00011123444332222 Q ss_pred --EcCcccCceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEEEEEEeeeEEEEeCC------ceeeccC------C Q lcl|NC_020838. 811 --AIGTYIGDTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLIIGYVYDMELELPT------LYPTQVE------G 876 (981) Q Consensus 811 --adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~VGl~y~s~v~~~~------~~i~~~~------g 876 (981) -+|.++.... +.... +.-+.++.+.-+ +-...|+.+..+++.-+ ++++..+ | T Consensus 557 ~~~~~~~~~~~f-~~~~~--------~~~~dw~s~d~~-----~~~~~gy~~~gd~~~~k~~pyvt~~~~~tedg~v~~~ 622 (715) T protein:vir:26 557 LVRDGTTGKMTF-ATFRG--------DTYLDWGSADYK-----SFAEAGYDFMGDITTFKNAPYVTTYMRVTEDGYVASG 622 (715) T ss_pred EEEcCCceeEEE-ecccC--------ceeeeccccchh-----hHHHhhhhhcccceeeecCceEEEEEEEecccceecc Confidence 2222211110 00000 000112221111 11223433333333221 2333222 1 Q ss_pred CcccceEeeeEEEEEEEEEeecccceEEEEccCCCCceeeEecccccCccccCcc-ccccCCeEEEEeeccCcceEEEEE Q lcl|NC_020838. 877 RSSVSDVTSDLILHRLKVSTGLSGPITYKVDITGKDEWTNIINVTLPNTYVLNNV-NLSASALHDVPIYQRNENVNIKII 955 (981) Q Consensus 877 ~~~~~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~-p~~~~~~~~vp~~g~~~~~~v~I~ 955 (981) .+-.+.-.. -.+-.+.++..+++. ..++-+.. -..+....+..+.. -+.....-+--++|..+-.+++|. T Consensus 623 ~g~~p~n~s-Sclm~~sw~ws~s~s-------t~~eaYk~-~~~~~~~p~~~s~~~yp~~~VvTKsriRG~Gr~~~~rf~ 693 (715) T protein:vir:26 623 AGYEFINPS-SCLMSVSWNLSKSGS-------TPREIYKL-KDVPVVNPNDLSSINYPTDTVVTKSKVRGRGRSMKFRFE 693 (715) T ss_pred CCccccCCc-ceEEEEEeeeccCCC-------Chhhhhee-cceeeeCCCccccccCCcceeEeeeeeeccceEEEEEEE Confidence 222221111 112223333333322 11111111 11111111111111 111222335567899999999999 Q ss_pred ECCCCCEEEEEEEEEEEeeccc Q lcl|NC_020838. 956 GDTPFPISLLNIVWEGNYNRRF 977 (981) Q Consensus 956 ~~~PlPltvlsi~weg~y~~r~ 977 (981) +..--.|+|++.+.-|--|+.+ T Consensus 694 s~~gKdlhl~Gysilg~~~~~~ 715 (715) T protein:vir:26 694 SVAGKDFHLVGYEVIGAKNNSY 715 (715) T ss_pred ecCCcceEEEeEEEEecccCCC Confidence 9999999999999999988887 No 32 >protein:vir:95324 Length: 823 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512269;genbank:gi:89152436;genbank:GeneID:3952993 Probab=97.63 E-value=3.5e-05 Score=44.98 Aligned_cols=643 Identities=11% Similarity=0.035 Sum_probs=196.6 Q ss_pred cccccCCceeEEEeeccccceecccccccc---ccccCCcccccceEeecceeEeeeeeeeEEeeCceeeEeeeccCCcC Q lcl|NC_020838. 180 LERIDNGQRIVKDDGTNAGSIAAGSAMPSG---YSLGNERTDDYPWFKRDGYRVYEVEKEVAAAYNSTELSTANTNMGTA 256 (981) Q Consensus 180 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~a~~~~~~~~~~~~~~~~ 256 (981) +. ....+.+- ++.-+...+....+ ++..-..+-+......+|...|+++.+++++.+.- | T Consensus 1 m~-----i~~~q~sF-~~GElsP~l~gR~Dl~ry~~q~~~~~N~~~~~~GGl~rRpGt~fva~~~~~~-----------g 63 (823) T protein:vir:95 1 MA-----ISWIQPSF-AGGEIGPSLYGRIDMAKYQVALRKCDNFIVRQYGGVENRPGTRFVGAAKYPN-----------R 63 (823) T ss_pred Cc-----ceeechhc-cCceechheeccchHHHHHHHHhhhhCcEeeecCCceecCchhhhhhhcCCC-----------C Confidence 00 00000000 01111222332222 22222233344445568999999888887765432 4 Q ss_pred cCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCceeEeccCCCCCCCCceEEE Q lcl|NC_020838. 257 QTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKNKTTAMKTTTSAAVPNVAFV 336 (981) Q Consensus 257 ~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~~~~~~~~~~~~~~~~~~~v 336 (981) ...++|++ ++++|+|+....+..+.+...+. ++ ++.. T Consensus 64 ~~rLipf~-~s~~q~y~Lefg~~~irV~~~~g----~v-------------------v~~~------------------- 100 (823) T protein:vir:95 64 KCRLIPFQ-FSTVQTYALEFGHQYMRVIKDGA----LV-------------------LNSS------------------- 100 (823) T ss_pred CeeEEEEE-eCCCcEEEEEEcCCeEEEEeCCc----EE-------------------EecC------------------- Confidence 44566665 78888877554433333321110 11 0000 Q ss_pred EEeeeccceEEEEeeCceEEEEEeccCCCcceecHHHHHHHHHhhhhccCceEEEEcCCEEEEEeCCCceEEEec-CcCc Q lcl|NC_020838. 337 VIRIVAYNSDYSVTLNGTTVTHSTPDTVAGATTDSGSIAAALTSSINALTGFSATQVGPGIYIEGTSAFSISTSG-STTE 415 (981) Q Consensus 337 ~v~~g~y~~~~~v~~ng~~~~~~t~~~~a~~~~~~~~ia~~l~~~i~~~~~~~~~~vg~~i~i~~~~~~~vt~~~-g~~~ 415 (981) +..|.+ .+|+..+ .+..+...+++++++|.+++-+..+... +..+ T Consensus 101 -------~~~~ev---------~tPy~~~------------------~l~~Lr~~qsaD~~fivh~~~~p~~L~r~~~~~ 146 (823) T protein:vir:95 101 -------NVIYEI---------ATPYTEA------------------DLFRIKFTQSADVLTLVHPAYPPKELRRYAHDN 146 (823) T ss_pred -------CceeEE---------ecccccc------------------cccceeEEEeccEEEEEcCCccceEEEecCCCC Confidence 000001 1111110 1123445556666666666555444332 2223 Q ss_pred ceeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccC--CcccceEEEEeeccceeEE--EccccceEE Q lcl|NC_020838. 416 EGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNS--AARGPGVWEETIGPSLEFE--IDETTMPHQ 491 (981) Q Consensus 416 t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~--~~~g~~~W~E~a~~~~~~~--~~~~Tmp~~ 491 (981) +.+..+........+. .-...+.+.+.++. ..+.+.+..+.. ...|...|.+..++..+.. ....+.+.. T Consensus 147 w~l~~~~~~~gp~~~~----~~~~t~~v~~~~~~--~~~t~ta~~~~~~~d~vg~~~~l~~~~~~~~~~~~~~~~~~~~~ 220 (823) T protein:vir:95 147 WQLVDVVTKNGPFEDI----NIDESLTVYASAST--GTITLTASASIFGAEQVGKLFYLEQPAVDSVPVWETSKSTSIGD 220 (823) T ss_pred ceEEEEEEeccccccc----cccceeEEeccccC--ceeEEeecccccchhhccceEEEeccccceeeecceeeeecccc Confidence 3332222211111100 00011112222111 111121111100 0111222222222211110 000111111 Q ss_pred EEeccCCceeeecccCCcccCCCcc--cCcCcccc---CC--ceeEEEEEcceEEEecCC----eEEEEecc--cccccc Q lcl|NC_020838. 492 LIRQANGVFKYEPVTWDDRLVGDNT--TNPIPSFI---GK--KINNMFFYRNRLGLLSNE----AVIMSRAG--DYFNFF 558 (981) Q Consensus 492 lv~~a~g~f~~~~~~w~~r~~GDd~--tnp~psF~---g~--~ps~v~ffq~RL~f~s~~----~V~~Srtg--dy~NF~ 558 (981) +.+.+...+.. |..-..|+.. ..+.-.++ +. ....+. -|....+.. +..-+.++ +.--|. T Consensus 221 ~~~~~~~~~~~----~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~g~~~~t~v~~~~~~~~~~~~~ 293 (823) T protein:vir:95 221 IRRADSNYYRA----VTAGKTGTLRPSHTEGTSWDGWGGSGDDDTGIE---WEYLHSGFGIARITAVNGTTATAEVISYI 293 (823) T ss_pred eEEecccceee----eeccccceeecccCCcceEEeceecccccceeE---EEEEeCCcceEEEEeecceeeeceEeeee Confidence 11111111100 0000000000 00000000 00 000110 011111111 11111111 111122 Q ss_pred ccccccccCCccEEE-----EEcCCCceeEEEEeecCCcEEEEecC--cEEEEEcCC-c--cccccc-------eEEEEE Q lcl|NC_020838. 559 ANSSQVVAPDDPIDL-----QATSVKPVTLNYTLATSIGLLVFGPN--EQFVLSTDA-D--ILSPTT-------TKINTI 621 (981) Q Consensus 559 ~~s~~~~~DdDpI~~-----~i~s~~~n~I~~~v~~~~~L~l~T~g--~q~~l~g~~-~--~LTP~~-------~~i~~~ 621 (981) +.... ...+...+. .-....+ ..+.-+...|++.... -|+++-+.+ + -+.|++ +.+... T Consensus 294 ~~~~~-~~~~~t~~~~~~~~~~~~g~P---s~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~~~~DdD~I~~~~s 369 (823) T protein:vir:95 294 PSQVV-GEDNASYKWAKYAWNSVNGYP---GTVVYYQQRLYFAASTAFPQTIWASRTGDYKDFGKSNPTQDDDRIIYTYA 369 (823) T ss_pred ccccc-cCCcCCccccccccCcCCCCc---cEEEEEeceEEEEEcCCCCcEEEEeccCCccccccccCCCCCCcEEEEEc Confidence 22111 111111111 1111122 2334455556665543 355553322 1 233332 122221 Q ss_pred eeeccccCCCcEEeCCeEEEEecCCCeeEEEEEeeccc-ccc--eehhhHHHHHHHhcCCCeEEEEEcCCCcEEEEEecC Q lcl|NC_020838. 622 STFECDAEIDAVAVGTTQAFISKSNLYSKLFLMLNVQK-EAA--ATIDEATTNVPEYVPSDIDSMTASPAMSIVSLGKSG 698 (981) Q Consensus 622 S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~-~d~--~~a~DlS~~~~h~~~~~i~~~~~s~~~~~~~~~~~g 698 (981) +. -.+.++=++-.+.++....++.+ ..... .+. +..-.+..+ ..+ ++....--...+.+++..++ T Consensus 370 ~~--~~~~i~~~v~~~~Lli~t~~~e~------~l~~~~~~~lTP~~~~~~~~-s~~---g~~~~~Pv~vg~~~~Fv~~~ 437 (823) T protein:vir:95 370 GR--QVNEIRHLIDVGSLVALTSGGEY------VITGDQNKVLTPSSFAFSSQ-GSN---GSSNVPPIAVANIALFVQEK 437 (823) T ss_pred CC--cceEEEEEeecCcEEEEecCcEE------EEEcCCCcccceeeEEEEEe-ecc---ccccccceEeCCeEEEEecC Confidence 11 11123323334455555555532 22221 122 222222221 111 11111111111233333333 Q ss_pred CcEEEEEEeecCcch----heeeeEeeccCC-ceEEEEEeC--C-eEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCc Q lcl|NC_020838. 699 SNTVYQHRFFMQGEN----RVQTWYKWQLTG-DLRLQFFDK--T-TFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEK 770 (981) Q Consensus 699 ~~~l~~y~y~~~~eq----~V~aWsrw~~~G-~v~sv~~~~--d-~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~ 770 (981) ...+.-|.|-+..+. ++.-=+.|-+.| .+..+|... + .+|++ +.++.++...+...++-..+.--...+.. T Consensus 438 g~~vre~~~~~~~d~~~~~dlT~~a~hl~~~~~i~~~a~~~~p~~~~~~v-~~dG~l~~~ty~~~q~v~aW~~~~~~g~~ 516 (823) T protein:vir:95 438 GSVVRDLAYSFDVDGYQGNDLTILANHLFQKHSIVDWCFSIVPYSSAFCI-RDDGKLLVMTYLRDQQVFAWAPQSSTGKY 516 (823) T ss_pred CCEEEEEEEeeecCceecchhhhhhhhhcCCCceEEEEEecCCCeEEEEE-ecCCcEEEEEEecccceeeeEEEecCCcE Confidence 334444443322221 222233566655 344444432 2 23333 22333332222222211111000000000 Q ss_pred cccc------ccceeeeeeeeeccCCc----------------eEEEccc---------------------CCCcCCceE Q lcl|NC_020838. 771 TDVC------LDMFNVNPYRTYSTSTK----------------KTTVNLP---------------------FDHITGKKL 807 (981) Q Consensus 771 ~~~~------lD~~~vd~~~ty~~~~~----------------~tt~~l~---------------------~~~l~g~~v 807 (981) ..++ -|..++....+.++... ...++.+ +.|++|++| T Consensus 517 ~~~~~i~~~~~d~l~~~v~R~i~g~~~~yiE~~~~~~~~~~~~~~~lD~~~s~~g~~~~~~~~~l~~g~~~l~~l~g~~v 596 (823) T protein:vir:95 517 ESTCSISEGNEDAVYFVVNRTVNGQTVRYIERLSSRLFTSDEDAFFVDSGLSYDGRNTSDRTMTITGGSGEWDYLAEYTI 596 (823) T ss_pred EEEEEecCCCCCEEEEEEEeccCCeEEEEEEeeccccCCCccceeEEEEEEEeecCcccceeeEecCCCCcccccCceEE Confidence 0000 01111000000011000 0001111 233333333 Q ss_pred EEEEcCcccCceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeE Q lcl|NC_020838. 808 AVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDL 887 (981) Q Consensus 808 ~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl 887 (981) .+ ++|... +.. +.+++|+|+ ++++.+++|++|++.++++++++....- ...... T Consensus 597 ~~----------------adg~~~--~~~-~v~g~i~l~--~~~~~~~vGl~~~~~i~~~~~~v~~~~a---~~~~~~-- 650 (823) T protein:vir:95 597 SV----------------SGGAYF--TSS-DVGAQLQFP--YTGADPDTGYEVSKELRCDIISVTSNTA---VVVRAN-- 650 (823) T ss_pred Ee----------------cCcceE--CCc-cceeEEEeC--cCCCccccccceEEEEEEeeceeeCCce---EEEccC-- Confidence 22 333332 222 337999996 6788999999999999999887643211 111111 Q ss_pred EEEEEEEEeecccceEEEEccCCCCceeeEecccccCccccCc---c-ccccCCeEEEEeeccCcceEEEE---EECCCC Q lcl|NC_020838. 888 ILHRLKVSTGLSGPITYKVDITGKDEWTNIINVTLPNTYVLNN---V-NLSASALHDVPIYQRNENVNIKI---IGDTPF 960 (981) Q Consensus 888 ~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~---~-p~~~~~~~~vp~~g~~~~~~v~I---~~~~Pl 960 (981) ++..+.++...+.........-. -.....++.- ..+++. + -.+.++....|.. .....+-| .--.|| T Consensus 651 r~v~a~l~~~~t~~~~~~~~~~~---gL~hleg~tv-~v~~dg~~~~~~~v~~G~vtl~~~--~~~v~vGl~~~~~~~~l 724 (823) T protein:vir:95 651 RNVPPSLRNVATTNWQMARRTFG---GLSHLEGQTV-NILSDANVEPQKVVSGGAVTLESP--GAVVHIGLPITAEFETL 724 (823) T ss_pred Ccccceeeeeeccccccccceee---eccccccceE-EEEEcCeeeCCeEecCCEEEecCC--CCEEEEeecceeeEEec Confidence 12222222222211111100000 0011111100 001111 1 1123444444432 23333322 112566 Q ss_pred CEEEEEEEEEEEeeccccccC Q lcl|NC_020838. 961 PISLLNIVWEGNYNRRFYRRS 981 (981) Q Consensus 961 Pltvlsi~weg~y~~r~rRr~ 981 (981) |+.+. +.|..--|.||=. T Consensus 725 ~~~~~---~~g~~~g~~~ri~ 742 (823) T protein:vir:95 725 DININ---GQETLLDKKQVIP 742 (823) T ss_pred chhcC---CCcccCCceeEEe Confidence 66543 3455544444422 No 33 >protein:vir:95475 Length: 771 # NCBI annotation: hypothetical protein ORF038 # Family: family:all:5234 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294631;genbank:gi:149408197;genbank:GeneID:5237042 Probab=97.41 E-value=7.4e-05 Score=43.23 Aligned_cols=671 Identities=12% Similarity=0.106 Sum_probs=275.7 Q ss_pred cccccCCceeEEEeeccccceeccccccccccccCCccc--ccceEeecceeEe------------eeeeeeEEeeCcee Q lcl|NC_020838. 180 LERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERTD--DYPWFKRDGYRVY------------EVEKEVAAAYNSTE 245 (981) Q Consensus 180 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~g~~~~------------~~~~~~~~a~~~~~ 245 (981) +-+ ......++.--..+|.+++-.+ -+-|..+| ++ .+.+||...| -.+..++-|.+..- T Consensus 1 m~~---~~~~~~vNtFv~GliTEas~lt---fpqnasiDe~N~-~l~rdG~r~RR~g~~~E~~~~~vls~~~vpa~g~~~ 73 (771) T protein:vir:95 1 MAK---TTNAAEFNTFVGGLITEASPLT---FPQNASIDEVNF-ILNRDGSRNRRNGMDFENGATKVVCNTLVPADGTIA 73 (771) T ss_pred CCc---ccchhHHhhhhhheeecccccc---CCccceeeeeee-eecCCCcchhhceeeeecCCceEEEEEEecccceEE Confidence 000 0000111222223444332211 11111222 22 2556666443 11222333333443 Q ss_pred eEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcc-eeeEEEEcCEEEEeCCceeEecc- Q lcl|NC_020838. 246 LSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPE-DIEILTINDYTFVLNKNKTTAMK- 323 (981) Q Consensus 246 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~-dl~~~t~ad~tfi~n~~~~~~~~- 323 (981) +..+.=...+|+.-.-++|..-..+-++|-+-+-.+ .-.++-....++- .|+ .|+...+.++..|.|+..-+--- T Consensus 74 v~~~~W~na~G~v~~~~livqvg~~l~f~q~t~~pL--s~~n~~~~a~~nl-SPsh~isv~v~~G~livanp~i~~~~~~ 150 (771) T protein:vir:95 74 VTSHNWENAGGEVGRWISLVQVGTELKFFQTTGETL--SEGNFYNYQFVNM-SPSHKLSYAVVDGLLVVANGSRDIYVFE 150 (771) T ss_pred eeeechhhcccccCcEEEEEEeccEEEEEecCCCcc--cccceeeeeccee-ccceeEEEEEeeeEEEEecCCccEEEEE Confidence 444444455566555555554334444443322100 0001100011111 233 48888889999999987655311 Q ss_pred -CCCC-CCCCceEEEEEeeeccceEEEEeeCceEEEEEeccCCCcceecHHHHHHHHHhhhhccCceEEEEcCCEEEEEe Q lcl|NC_020838. 324 -TTTS-AAVPNVAFVVIRIVAYNSDYSVTLNGTTVTHSTPDTVAGATTDSGSIAAALTSSINALTGFSATQVGPGIYIEG 401 (981) Q Consensus 324 -~~~~-~~~~~~~~v~v~~g~y~~~~~v~~ng~~~~~~t~~~~a~~~~~~~~ia~~l~~~i~~~~~~~~~~vg~~i~i~~ 401 (981) .+++ +....+-++..+ |+ ++..+||+....+.-+.......+++.| |..-.+| +.+ T Consensus 151 ~d~~t~s~t~~~ll~r~r---f~--~q~~~~G~d~~~~~~~~~~gt~~tn~~i-------------ynlyN~g----w~~ 208 (771) T protein:vir:95 151 YDSGSVSVTTKRLLVRDL---FG--VQDIVNGVDLRQGNDIATRPTVQTNAHI-------------YNLRNQT----FGV 208 (771) T ss_pred ecCCcceeEeeeeeeeeh---hh--ccccccccceecccccccCCcccCchhh-------------eeccccc----eec Confidence 1111 111112223332 21 2234456555444433333334444443 1111111 001 Q ss_pred CCCceEEEecCcCcc-eeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCcccceEEEEeecccee Q lcl|NC_020838. 402 TSAFSISTSGSTTEE-GIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAARGPGVWEETIGPSLE 480 (981) Q Consensus 402 ~~~~~vt~~~g~~~t-~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~ 480 (981) |-. + .+ .++ ....+++.-++.-..|. + +|.| .++...-.|+|...-++. T Consensus 209 pk~---~--~~-snt~~~~iV~~y~a~~g~~pS----~------------sd~~--------N~a~~k~~~~Ei~t~~~f 258 (771) T protein:vir:95 209 PRV---T--WH-SNEPSDPIVTFRSAASGKFPS----N------------SDSV--------NLALSKRADVEPSTTDRF 258 (771) T ss_pred ccc---c--cc-cCCccccceEeeeccCCCCcC----C------------ceee--------ccccchhhccceeeeccc Confidence 100 0 00 111 11122222221111111 1 1111 011111224443332211 Q ss_pred EEEccccceEEEEeccCCceeeecccCCc-ccCCC-cccCcCcccc-----------CCceeEEEEEcceEEEecC---- Q lcl|NC_020838. 481 FEIDETTMPHQLIRQANGVFKYEPVTWDD-RLVGD-NTTNPIPSFI-----------GKKINNMFFYRNRLGLLSN---- 543 (981) Q Consensus 481 ~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~-r~~GD-d~tnp~psF~-----------g~~ps~v~ffq~RL~f~s~---- 543 (981) -..+-.-.|....-.+.|-|......-.. +...- +.+.|.|+.. -+..+.|+=|-.|.|++++ T Consensus 259 ~~~~~~~~~~Gt~~~~~G~yi~da~~~g~~~Lt~~ve~~gr~~s~~~~~~~l~~~~t~~~~~~vaeyagRvwYag~~~~~ 338 (771) T protein:vir:95 259 RAEDIVLNPIGTYETARGFFIIDAMARGKSRLEEIVKLKQRYPSLSFGVSSLPQDETPGGASVVCEYAGRVWYAGFSGQI 338 (771) T ss_pred chhhhhhcccCcccccCcceeeehhhhcccccceeeeccccchhhhccccccccccCCCCceeEEeeeeeEEEecceeEE Confidence 11111111111111222333222211000 00001 1244555421 1345678999999998861 Q ss_pred -----------CeEEEEec----cc----cccccccccc--cccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEE Q lcl|NC_020838. 544 -----------EAVIMSRA----GD----YFNFFANSSQ--VVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQF 602 (981) Q Consensus 544 -----------~~V~~Srt----gd----y~NF~~~s~~--~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~ 602 (981) ..|.+||. .| |.+=+|++.. .+.|.|...+.+-+-. .|+-|+.|+..|+||...+-| T Consensus 339 iD~dkng~~~~~~ilfSqLv~s~~di~nCyQd~DPTsee~~dLidTDGg~iri~gah--~ii~Lv~f~~sLlvfc~NGVW 416 (771) T protein:vir:95 339 IDGDDQSPRLVSYILFSQLVDSPADIVNCYQDGDPTSTEEPELVDTDGGFIRIEGAH--DIINLVNVGSAVMVVAANGIW 416 (771) T ss_pred eeccccCCceeeeEeeehhhcchhhcccccccCCCchhhhhhhhhcCCCEEEecCCC--CceeEEEecceEEEEEecceE Confidence 13888864 33 4444454322 4778899999887754 366689999999999999999 Q ss_pred EEEcCC-ccccccceEEEEEeeeccccCCCcEEeCCeEEEEecCCCeeEEEEEeecccccceehhhHH-HHHHHhcCC-- Q lcl|NC_020838. 603 VLSTDA-DILSPTTTKINTISTFECDAEIDAVAVGTTQAFISKSNLYSKLFLMLNVQKEAAATIDEAT-TNVPEYVPS-- 678 (981) Q Consensus 603 ~l~g~~-~~LTP~~~~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS-~~~~h~~~~-- 678 (981) +|.|.+ ..+|.++..+.++++.+|++.=.=+++|+.++|-+++| |..+.-++ -.-+.|+.|| ..+..|.+. T Consensus 417 Ai~ggsd~g~tAtdY~ltKIs~vg~sspnSvVvvg~~i~ywsdtg----Iyal~~Nd-fn~~tAqnLTekTIq~~~~~I~ 491 (771) T protein:vir:95 417 MIQGGSDYGFTATNYLVTKISEHGCSSPNSVVVVDNSFMYWGDDG----IYHLTRNQ-YGDYVANNLTEKTIQKYYEKIP 491 (771) T ss_pred EEEeccCCceeeeeeEEEEeeeeccCCCccEEEecceEEEeeCCc----eEEEeecc-cCcchhhccchHHHHHHHhhcc Confidence 997654 47999999999999999998877899999999999987 55555444 4568999999 778888643 Q ss_pred -C-eEEE--EEcCCCcEEEEEec----CC-cEE--EEEEeecCcchheeeeEee---c-cCCce----EEEEE------- Q lcl|NC_020838. 679 -D-IDSM--TASPAMSIVSLGKS----GS-NTV--YQHRFFMQGENRVQTWYKW---Q-LTGDL----RLQFF------- 732 (981) Q Consensus 679 -~-i~~~--~~s~~~~~~~~~~~----g~-~~l--~~y~y~~~~eq~V~aWsrw---~-~~G~v----~sv~~------- 732 (981) . +... .+-..+..+.|.-. +. ..+ |++ +-..+|+-+| + .+|.. .++.. T Consensus 492 ~dk~knVtg~fd~~e~rvyw~yPn~~D~~~e~~t~LV~------dLalgaFYp~~i~~~~ag~l~~~vg~~~~p~~~lv~ 565 (771) T protein:vir:95 492 SDAILNATGFYDSYDKKVKWLYNTVLDGRTEPVTELVF------DLALGAFYPSKIGSLTAGRLPIPVGSVKIPPYKLVE 565 (771) T ss_pred hhhhcceEEEEEccCCEEEEEecceecCCCcceeeeee------eecccccccccccccccCccceeeeeeecCcccccc Confidence 1 2221 12333444444332 22 222 444 2235788888 4 23332 11111 Q ss_pred eCCeEEE----EEEcCCcEEEEEEEeecC-CceeEE-EecCCCcccccccceeeeeeeeeccCCceEEEcccCCCcCCce Q lcl|NC_020838. 733 DKTTFYA----VTSSGSNVYLTSYDLTQA-SESGYL-TLPTGEKTDVCLDMFNVNPYRTYSTSTKKTTVNLPFDHITGKK 806 (981) Q Consensus 733 ~~d~ly~----vv~r~~~~~l~~~~~~~~-~~~~~~-~~~~~~~~~~~lD~~~vd~~~ty~~~~~~tt~~l~~~~l~g~~ 806 (981) .+.++-+ ++-.++.+.+.+.-.... .+..++ ....+- -+++ ....|.+.+ .+-|....|-. T Consensus 566 T~~eV~v~~~~v~~tG~~vtV~~~~r~~~~~~~~y~~~~~dg~------~g~~--~Fa~~~~~~-----f~DW~sv~~~~ 632 (771) T protein:vir:95 566 TGEEVTVASEQVTATGELVTVKVSTRSPVIRETKYIIVEKLSS------PMRI--SFGGYTDEE-----FVDWKSVDGIG 632 (771) T ss_pred ccceEEecceeeEecCCceEEEEEEeeccccceEEEEEEecCC------CeeE--EeccccCcc-----eeecccCCCcc Confidence 0111111 000111111111100000 000000 000000 0000 000000000 00011111100 Q ss_pred EEEEEcCcccCceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEEEEEEeeeEEEEeCCceeeccC-C-----Cccc Q lcl|NC_020838. 807 LAVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLIIGYVYDMELELPTLYPTQVE-G-----RSSV 880 (981) Q Consensus 807 v~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~-g-----~~~~ 880 (981) +. ++... +- .+..--..+|..+--++++ +++..+ | .+.- T Consensus 633 vd------------------------y~sy~-------~~-gY~~~gd~~~~k~~PYit~---y~~~tedg~v~~~~g~~ 677 (771) T protein:vir:95 633 VD------------------------APAYL-------LT-GYLAGGDYQREKFVPYITF---HFKKTEDGFVEDAEGDW 677 (771) T ss_pred cc------------------------hHHHH-------Hh-hhhccchheeeeccceEEE---EEEeecccceecccccc Confidence 00 00000 00 0000011222222112221 222211 1 1100 Q ss_pred ceEeeeEEEEEEEEEeecccceEEEEccCCCCceeeEecccccCccccCccccccCCe-EEEEeeccCcceEEEEEECCC Q lcl|NC_020838. 881 SDVTSDLILHRLKVSTGLSGPITYKVDITGKDEWTNIINVTLPNTYVLNNVNLSASAL-HDVPIYQRNENVNIKIIGDTP 959 (981) Q Consensus 881 ~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~-~~vp~~g~~~~~~v~I~~~~P 959 (981) .-..-.-.+-.+.++...++.- ..-++....+.+..+...++.....-+..+.+ -+--++|..+-.+++|.+..- T Consensus 678 ~p~n~sSclm~~sw~ws~s~~t----~k~~~~~eaYk~~~~~~p~~~~~~~yp~~~VV~TKsriRG~Gr~~~~rf~s~~g 753 (771) T protein:vir:95 678 TPTNQSSCMVQSQWSWTNSPAS----NKWGRTWQAYRFRRHFFPDNIDNQFDDGNSVVETKSRLRGSGKVLSLYITTEPK 753 (771) T ss_pred cccCCcceEEEEEeeeecCCCC----CccccchheeeecceeccCCcchhcCCccceeeeeheeeecceEEEEEEEecCC Confidence 0000011222333444444321 00011111122222222222221221222212 244678899999999999999 Q ss_pred CCEEEEEEEEEEEeeccc Q lcl|NC_020838. 960 FPISLLNIVWEGNYNRRF 977 (981) Q Consensus 960 lPltvlsi~weg~y~~r~ 977 (981) -.|+|++.+.--..|-.. T Consensus 754 Kdlhl~Gysil~~~~~~~ 771 (771) T protein:vir:95 754 KNLHIYGWSMLVDVNGTV 771 (771) T ss_pred cceEEEeEEEEEeecCcC Confidence 999999999888888777 No 34 >protein:vir:8837 Length: 513 # NCBI annotation: constituent protein # Family: family:all:4957 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775245;genbank:gi:27476043;genbank:GeneID:2700591 Probab=96.98 E-value=0.00023 Score=40.56 Aligned_cols=473 Identities=11% Similarity=0.059 Sum_probs=185.4 Q ss_pred eeccceEEEEeeCceEEEEEeccCCCcceecHHHHHHHHHhhhhccCceE---EEEcCCEEEEEeCCCceEEEecCcCcc Q lcl|NC_020838. 340 IVAYNSDYSVTLNGTTVTHSTPDTVAGATTDSGSIAAALTSSINALTGFS---ATQVGPGIYIEGTSAFSISTSGSTTEE 416 (981) Q Consensus 340 ~g~y~~~~~v~~ng~~~~~~t~~~~a~~~~~~~~ia~~l~~~i~~~~~~~---~~~vg~~i~i~~~~~~~vt~~~g~~~t 416 (981) .+-|-++... .+|+...+ +..-+.- ..|+ ........+...+..-++...=+..-. T Consensus 1 ~~~~~~~~~~-~~g~~~d~-------------------~p~~lp~-~a~s~~~N~~~~~~~~~~~~g~~pv~a~~~~~~~ 59 (513) T protein:vir:88 1 MALERQEVKN-PTGIVTDI-------------------APADLPL-DKWSFGNNVRFKNGKAQKALGHSPIFDTAQAPIL 59 (513) T ss_pred CCcCChhhcc-cccceecc-------------------ChhhcCC-CcceeeeeeeEecceeeecCccceeeecCCCCce Confidence 1222111111 11111110 0000000 0010 111111112222222222111011111 Q ss_pred eeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEccc------C--CcccceEEEEeeccceeEEEccccc Q lcl|NC_020838. 417 GIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTN------S--AARGPGVWEETIGPSLEFEIDETTM 488 (981) Q Consensus 417 ~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~------~--~~~g~~~W~E~a~~~~~~~~~~~Tm 488 (981) ++.++ .. +|..+.+..... .|..++... + .+..+..|.-+.=-+.+..-+..-- T Consensus 60 g~~~~------------~~-~g~~~~~~~~~~-----~~~~~~~~t~~dvs~~~~~~~~~~~w~~~~f~~~i~a~ng~~~ 121 (513) T protein:vir:88 60 DMFPF------------IR-NNIPYWLLCSEK-----RLYLADGTTIIDVSPGPYSASVTNRWSVGSFNGVIFANDGVNP 121 (513) T ss_pred eeeee------------ec-CCCeEEEEeece-----EEEEecCceeeeccccceeecccCceeeeeecCEEEEEcCCCc Confidence 11110 00 122111111110 111111100 0 0001112332221122222111111 Q ss_pred eEEEEeccCCceeeecccCCcccCCCccc-CcCccccCCceeEEEEEcceEEEec--------CCeEEEEecccc----c Q lcl|NC_020838. 489 PHQLIRQANGVFKYEPVTWDDRLVGDNTT-NPIPSFIGKKINNMFFYRNRLGLLS--------NEAVIMSRAGDY----F 555 (981) Q Consensus 489 p~~lv~~a~g~f~~~~~~w~~r~~GDd~t-np~psF~g~~ps~v~ffq~RL~f~s--------~~~V~~Srtgdy----~ 555 (981) |..+ .....+|. +.. +| | ...-..|.+|++||++++ |+.|+.|..+|. . T Consensus 122 ~q~~-~~~s~~f~-------------dl~g~p-~---~~~a~~i~v~~~flv~~~~t~~~~~~PnrV~wS~~~D~~~~P~ 183 (513) T protein:vir:88 122 PHHL-PPTESVFR-------------VLPNFP-A---NTTFRRLKSFKNFLIGLNVTSNSIEMPQMVWWSTSADAGGVPA 183 (513) T ss_pred ceEE-cCCCceee-------------eccCCC-c---ccceEEEEEEeeEEEEeecccCcCCCCceEEEecccCCccccc Confidence 1111 11111111 111 11 1 124566889999999975 567999999996 4 Q ss_pred cccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEEEeeec-cccCCCcEE Q lcl|NC_020838. 556 NFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINTISTFE-CDAEIDAVA 634 (981) Q Consensus 556 NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~~S~~~-~s~~v~Pv~ 634 (981) .|..+. ...+.+=.++ .+....|...++....|+||++.+-|.++=.+ +|...+++....-. |.+.-.=+. T Consensus 184 ~W~~t~--~t~~a~~~~l---~d~~g~~v~g~~~g~~liif~e~~i~~m~y~g---~~~if~~~~i~~~~G~~~p~SI~~ 255 (513) T protein:vir:88 184 SWDPTD--PTKDAGQNTL---ADTNGAIVDGVKLRDSFIIYKEDSVYSMRYIG---GLYIFQFQQLFNDVGILGPNCAIE 255 (513) T ss_pred cccccc--ccCccccccc---CCCccceeeeeecccceEEEecccEEEEEecC---CCceEEEEeecccccccCCceeEE Confidence 343221 1122222222 33344566677888899999999999997212 24455666555323 333333388 Q ss_pred eCCeEEEEecCCCeeEEEEEeecccccceehhhHHHHHHHhcCC-----CeE--EEEEcCCCcE--EEEEec-C---C-- Q lcl|NC_020838. 635 VGTTQAFISKSNLYSKLFLMLNVQKEAAATIDEATTNVPEYVPS-----DID--SMTASPAMSI--VSLGKS-G---S-- 699 (981) Q Consensus 635 vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS~~~~h~~~~-----~i~--~~~~s~~~~~--~~~~~~-g---~-- 699 (981) +|+.++|+++.| ++.+ +...+++- ....+.+.|-. ... ..+.-+.... |+..+- + . T Consensus 256 ~~~~~ffls~~G----f~~~----~G~~~~~I-g~ekVdk~f~~~~n~~~~~~~~~~~d~~~~~v~~~y~s~~~~~~~~~ 326 (513) T protein:vir:88 256 FDGNHFVVGHGD----VYVH----NGVQKQSV-IDAQVRKFFFSDINPDNYQRTFVLADHVNTEMWVCYSSTRSEPGKHC 326 (513) T ss_pred ECCeEEEEeCCc----eEEe----cCceeeec-ccchhhhhhhccCCcccceEEEEEEcCcccEEEEEecCCCCCCCccc Confidence 999999999987 4322 11112111 01233343322 121 2223333322 222221 1 1 Q ss_pred cEEEEEEeecCcchheeeeEeeccCCceEEEEEeCCeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCccccccccee Q lcl|NC_020838. 700 NTVYQHRFFMQGENRVQTWYKWQLTGDLRLQFFDKTTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFN 779 (981) Q Consensus 700 ~~l~~y~y~~~~eq~V~aWsrw~~~G~v~sv~~~~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~ 779 (981) ..+++|.|. .+.|+.-+.+..+-.....-+.+.. +.+.... ..++...-.+. T Consensus 327 ~~~lVYd~~------~~~Ws~~~~p~~~~g~~g~~~~~~~----------~~~~~~~------------~~~d~~~~~~~ 378 (513) T protein:vir:88 327 DRAIIWNWK------ENTWSIRDLPNVLSGAYGIIDPKTS----------NLWDDDS------------NPWDTDTSVWG 378 (513) T ss_pred ceEEEEEcc------CCeEEEEeccchhhccccccccccc----------ceecccc------------cccccchhhhh Confidence 457788764 3567765555432111111110000 0000000 00000000000 Q ss_pred eeeeeeeccCCceEEEcccCCCcCCceEEEEEcCcccCceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEEEEEEe Q lcl|NC_020838. 780 VNPYRTYSTSTKKTTVNLPFDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLIIGYV 859 (981) Q Consensus 780 vd~~~ty~~~~~~tt~~l~~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~VGl~ 859 (981) .+.+.+....+ ......+|.+..+... .-.-|-+ T Consensus 379 -------------------~~~~~~~~~sl-----------~~~~~~~~~~~~fd~~----------------~~f~G~~ 412 (513) T protein:vir:88 379 -------------------EGSYNPAKSSM-----------IFTSFQDAKLFLFGET----------------STFSGQS 412 (513) T ss_pred -------------------cccccccccee-----------EeeeccCCceeeeccc----------------ccccCCc Confidence 00001100000 0011223333332210 1134778 Q ss_pred eeEEEEeCCceeeccCCCcccceEeeeEEEEEEEEEeecccceEEEEccCCCCceeeEecccccCccccCccccccCCeE Q lcl|NC_020838. 860 YDMELELPTLYPTQVEGRSSVSDVTSDLILHRLKVSTGLSGPITYKVDITGKDEWTNIINVTLPNTYVLNNVNLSASALH 939 (981) Q Consensus 860 y~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r~~v~~~~Sg~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 939 (981) +++.++...+... ++... .+|+++...+..+|.+.+.+......... .... ...+....... T Consensus 413 lea~~~t~~~~~~--~~~~~-------~~i~~v~~~~t~~g~~t~~vg~~~~~~~~----~~~s-----~~~~~~~~~~~ 474 (513) T protein:vir:88 413 FTSTLERSDIYLG--DDRMM-------KTVSAVIPHITGNGVCNIWVGNAQVQGSG----IRWK-----GPYPYRIGQDY 474 (513) T ss_pred eEEEEEecCcccc--Cchhh-------eeeeeeeeeeecceEEEEEEeeeccCccc----cccc-----cceeeecccCc Confidence 9999988776542 22111 24555555566666666655332211110 0000 01111112234 Q ss_pred EEEeeccCcceEEEEEECCCCCEEEEEEEEEEEeeccccc Q lcl|NC_020838. 940 DVPIYQRNENVNIKIIGDTPFPISLLNIVWEGNYNRRFYR 979 (981) Q Consensus 940 ~vp~~g~~~~~~v~I~~~~PlPltvlsi~weg~y~~r~rR 979 (981) .++++...+..+++|+...--|+++.++++|..--. .|| T Consensus 475 ~~~~r~~gRy~~~ri~i~~~~~w~~~G~~ve~~~~~-g~R 513 (513) T protein:vir:88 475 KIDTKHVGRYIALKFDFASAGDWYFNGYTLEMAPKA-GMR 513 (513) T ss_pred eEEeccCCceEEEEEEccCCCceEEeeEEEEEecCC-CCC Confidence 577788888889999998999999999998876421 222 No 35 >protein:vir:107423 Length: 681 # NCBI annotation: Bbp13 # Family: family:all:780 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958682;genbank:gi:41179374;genbank:GeneID:2717217 Probab=96.33 E-value=0.00074 Score=37.76 Aligned_cols=576 Identities=14% Similarity=0.099 Sum_probs=177.2 Q ss_pred CCceeecchhhhccc-----cccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCCCCceEEEEEeCCCceEE Q lcl|NC_020838. 1 MSTISQRIPNLLLGV-----SQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAEARGRWFPILRDEEEKYV 75 (981) Q Consensus 1 m~~v~~s~~~l~~Gv-----SqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~~~~~~~~~~rd~~e~y~ 75 (981) |+.+.-.-+||-+|. .-+.|..|+..=++++.||++-|-.|+.||||++|++++.......+...--.++...|+ T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~~~~~~ 80 (681) T protein:vir:10 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSVTQTMV 80 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCCCceEE Confidence 999999999999994 677888888888999999999999999999999999998766443333333344445677 Q ss_pred EEEEcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCcccc Q lcl|NC_020838. 76 CQYDTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTAT 155 (981) Q Consensus 76 ~~~~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~ 155 (981) +.| .++.+|||.- .|.... .+....--.+|.... + .+|++.-.+|++||+++.- .|++ + +|.+.... T Consensus 81 l~~--g~~~~r~~~~-~~~~~~---~~~~~~~~tpy~~~~-l-~~l~~~q~aD~~~i~h~~~-~p~~--L--~r~~~~~W 147 (681) T protein:vir:10 81 IEL--GAGYFRFHTN-GGTLLD---GAVPYEIANPYAEAD-L-FNIHYVQSADVLTLVHPNY-APRE--L--RRLGATNW 147 (681) T ss_pred EEE--eCCeEEEEeC-CcEEee---CcEeEEecCCCChhh-h-cCceEEEEcCEEEEECCCC-cceE--E--EEccCCce Confidence 777 3688999943 343321 111111112354433 2 5699999999999999732 2322 1 22222222 Q ss_pred cceeeE----------EEEEEeeee--eeeeeccCcc------ccccCCceeEEEeeccccceeccccccccccccCCcc Q lcl|NC_020838. 156 KVNLFD----------VDVTYKNGY--YEESLKSGVL------ERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERT 217 (981) Q Consensus 156 ~~~~~~----------~~~~~~~~~--~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (981) ++.... ...+...+. ++..|....- +........++... ...+.+.+..+....+.. T Consensus 148 ~l~~~~f~~~p~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~~~tvt~~~-----~~~~~~~t~~w~a~~g~~ 222 (681) T protein:vir:10 148 QLATIAFTSPVATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSSAGTCTNNL-----FTNGGANTIAWSASSGAS 222 (681) T ss_pred EEEEEEeccccccceeeeeeccCCccceeEeEEEEEeecccceeecCCcceEEeeee-----ecCCcceeEEEEecCCce Confidence 111100 000000000 0000000000 00000000000000 000111111111111110 Q ss_pred cccceEee-cceeEeeeeeeeEEeeCceeeEeee-----ccCCcCcCce--------E--EEEEcccccceEEe----cc Q lcl|NC_020838. 218 DDYPWFKR-DGYRVYEVEKEVAAAYNSTELSTAN-----TNMGTAQTAY--------D--NAVSTESTEKGDYD----SE 277 (981) Q Consensus 218 ~~~~~~~~-~g~~~~~~~~~~~~a~~~~~~~~~~-----~~~~~~~~~~--------~--~~i~~~~~~~~~~~----~~ 277 (981) .+-+..+ .|+..+.+.. .+ +.+...-. ....+...++ . ++=|| -.|. .. T Consensus 223 -~~~V~~~~~gi~g~ig~~-~~----~~~~~~~~~~~~~~t~~~~~~~~~~~~gyP~~v~f~q~R-----L~f~~~~~~p 291 (681) T protein:vir:10 223 -RYNVYKEQGGLYGYIGQT-TG----TSLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAVSYFEQR-----RCFAGTTNKP 291 (681) T ss_pred -eeeecccceeEEEEeecc-ce----eeeeecccccCccccccccccccccCCCceEEEEEEcce-----EEEeeCCCCC Confidence 0000110 1111110000 00 00000000 0011111111 0 00011 0011 00 Q ss_pred eEEEEeeeeccc-------------ceeecccCCcceeeEEE-EcCEEEEeCC-ceeEeccCCCCCCCCceEEEEEeeec Q lcl|NC_020838. 278 VTACNIGSSNIP-------------ASAYLKDAAPEDIEILT-INDYTFVLNK-NKTTAMKTTTSAAVPNVAFVVIRIVA 342 (981) Q Consensus 278 g~~~~~~~~~~~-------------~~~y~~~~~~~dl~~~t-~ad~tfi~n~-~~~~~~~~~~~~~~~~~~~v~v~~g~ 342 (981) -+++..+..++. ....+.......|+.+. ..+.++.... +.... .....+-.+.++.+.. ... T Consensus 292 ~~v~~Srsgdy~nF~~~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~lli~t~~~e~~l~-~~~~~~lTP~~~~~~~-~s~ 369 (681) T protein:vir:10 292 QNIWMTRSGTESAMSYSLPVRDDDRVAFRVAAREANAIRHIVPLTELLLLTSSGEWRVA-SVNSDAVTPTTISVRP-QSY 369 (681) T ss_pred cEEEEEcccCcccccccCCCCCCccEEEEEcCCcceeEEEEEecCcEEEEEcCcEEEEe-cCCCccccceeEEEEE-eee Confidence 122222222211 11122222223343332 2332222221 11111 1111111111111111 111 Q ss_pred cceE-EE-EeeCceEEEEEeccC-----------CCcce-ecHHHHHHHHHhhhhccCceEEEEcCCEEEEEeCCCceEE Q lcl|NC_020838. 343 YNSD-YS-VTLNGTTVTHSTPDT-----------VAGAT-TDSGSIAAALTSSINALTGFSATQVGPGIYIEGTSAFSIS 408 (981) Q Consensus 343 y~~~-~~-v~~ng~~~~~~t~~~-----------~a~~~-~~~~~ia~~l~~~i~~~~~~~~~~vg~~i~i~~~~~~~vt 408 (981) |+.. .. +.+ |..+-|-.+.+ .+... .+....|..|...+ ..-.+..++.+..+.+ +. T Consensus 370 ~g~~~~~Pv~v-g~~v~fv~~~g~~vre~~y~~~~d~~~~~dlt~~a~Hl~~~~-~i~~~a~~~~p~~~~~-------~v 440 (681) T protein:vir:10 370 VGATDVQPVVV-NNTTIYGAARGGHVRELAYNWQANGFVTGDLSLRAAHLFDNL-DILDMAYAKAPQPIVW-------FI 440 (681) T ss_pred eccccccceee-CCeEEEEecCCCEEEEEEEeeecCceeccchhhhhhhhcCCC-CeEEEEEecCCCEEEE-------EE Confidence 2210 00 001 11111111111 11100 11111222222111 1112333333332221 11 Q ss_pred EecCcCcceeEEEEE----EEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCcccceEEEEeeccceeEEEc Q lcl|NC_020838. 409 TSGSTTEEGIFAFQD----QINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAARGPGVWEETIGPSLEFEID 484 (981) Q Consensus 409 ~~~g~~~t~~~~~~~----~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~~~~~ 484 (981) .+|| + +..+.+ +|.+-.+ +..+|....+..-....+|..|+...-...+ ......|.-.+.... T Consensus 441 ~~dg---~-l~~~ty~~eq~v~aW~~---~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g--~~~~yie~~~~~~~~--- 508 (681) T protein:vir:10 441 SSSG---K-LLGLTYVPEQQIGAWHQ---HDTDGVFESCAVVAEGNEDRLYAVVRRTIGG--NEVRYVERMASRQFD--- 508 (681) T ss_pred ecCC---c-EEEEEEecccceeeEEE---EecCCcEEEEEEecCCCCcEEEEEEEecCCC--CeEEEEEecCCcccc--- Confidence 1222 1 112222 1111111 1124554444333333456666655433222 112233332221100 Q ss_pred cccceEEEEeccCCceeeecccCCcccCCCcccCcCccccCCceeE---EEEEcceEEEecCCeEEEEecccc------c Q lcl|NC_020838. 485 ETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFIGKKINN---MFFYRNRLGLLSNEAVIMSRAGDY------F 555 (981) Q Consensus 485 ~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~---v~ffq~RL~f~s~~~V~~Srtgdy------~ 555 (981) ...++-|....+.+.... ..+.+. ++-..|++... =.++..+.+. +..|.+.+.+.- + T Consensus 509 ---------~~~~~~~vD~~~t~~~~~-~~~~sg-l~~leG~tv~i~aDG~~~~~~~V~--~G~itl~~~~~~v~VGl~Y 575 (681) T protein:vir:10 509 ---------AQADAFFVDSGLTYSGEP-VSHISG-LEHLEGKTVSILADGAVHPQRVVT--DGAIDLDVEAGTVHIGLPI 575 (681) T ss_pred ---------ccccceEeeccccccCcc-eeeecc-ccCCCCcEEEEEeCCeecCcEeec--CcEEEeCcCCceEEEeeec Confidence 001111111111111100 000000 00011111110 0112222211 122333333211 1 Q ss_pred cccccccccccCCccEEEEEc-----C--CCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEEEeeecccc Q lcl|NC_020838. 556 NFFANSSQVVAPDDPIDLQAT-----S--VKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINTISTFECDA 628 (981) Q Consensus 556 NF~~~s~~~~~DdDpI~~~i~-----s--~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~~S~~~~s~ 628 (981) .+... -=||++... . .+++.+..-+--+.++.+....+ .|.+ +. .+.+..-+ T Consensus 576 ~s~i~-------~lp~~~~~~~g~~~g~~~ri~rv~lr~~~S~g~~~~~~~~---------~l~~--~~-~~~~~~~g-- 634 (681) T protein:vir:10 576 TAELQ-------TLPVAMQLDGSFGQGRVKNINKLWLRVHRSSGIFAGPHAD---------ALTE--VK-QRTSEPYG-- 634 (681) T ss_pred eeEEE-------ecceeeecCCcccCCceEEEEEEEEEEEcccceEEeeCCC---------ceEE--EE-Eecccccc-- Confidence 11111 111222111 1 11222222222222232322111 1111 00 00010001 Q ss_pred CCCcEEeCCeEEEE-------------ecCCCeeEEEEEeecccccc Q lcl|NC_020838. 629 EIDAVAVGTTQAFI-------------SKSNLYSKLFLMLNVQKEAA 662 (981) Q Consensus 629 ~v~Pv~vG~~v~Fv-------------~~~g~~s~vre~~y~~~~d~ 662 (981) ...|+..|+.=+-+ |+..--..|..+.+.-+..+ T Consensus 635 ~~~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~vgg 681 (681) T protein:vir:10 635 SPPALKSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIAIGA 681 (681) T ss_pred ccCCccCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEEeeC Confidence 11222222211111 11111112222222222222 No 36 >protein:vir:98487 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996575;genbank:gi:45569506;genbank:GeneID:2767815 Probab=96.33 E-value=0.00074 Score=37.76 Aligned_cols=576 Identities=14% Similarity=0.099 Sum_probs=177.2 Q ss_pred CCceeecchhhhccc-----cccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCCCCceEEEEEeCCCceEE Q lcl|NC_020838. 1 MSTISQRIPNLLLGV-----SQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAEARGRWFPILRDEEEKYV 75 (981) Q Consensus 1 m~~v~~s~~~l~~Gv-----SqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~~~~~~~~~~rd~~e~y~ 75 (981) |+.+.-.-+||-+|. .-+.|..|+..=++++.||++-|-.|+.||||++|++++.......+...--.++...|+ T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~~~~~~ 80 (681) T protein:vir:98 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSVTQTMV 80 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCCCceEE Confidence 999999999999994 677888888888999999999999999999999999998766443333333344445677 Q ss_pred EEEEcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCcccc Q lcl|NC_020838. 76 CQYDTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTAT 155 (981) Q Consensus 76 ~~~~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~ 155 (981) +.| .++.+|||.- .|.... .+....--.+|.... + .+|++.-.+|++||+++.- .|++ + +|.+.... T Consensus 81 l~~--g~~~~r~~~~-~~~~~~---~~~~~~~~tpy~~~~-l-~~l~~~q~aD~~~i~h~~~-~p~~--L--~r~~~~~W 147 (681) T protein:vir:98 81 IEL--GAGYFRFHTN-GGTLLD---GAVPYEIANPYAEAD-L-FNIHYVQSADVLTLVHPNY-APRE--L--RRLGATNW 147 (681) T ss_pred EEE--eCCeEEEEeC-CcEEee---CcEeEEecCCCChhh-h-cCceEEEEcCEEEEECCCC-cceE--E--EEccCCce Confidence 777 3688999943 343321 111111112354433 2 5699999999999999732 2322 1 22222222 Q ss_pred cceeeE----------EEEEEeeee--eeeeeccCcc------ccccCCceeEEEeeccccceeccccccccccccCCcc Q lcl|NC_020838. 156 KVNLFD----------VDVTYKNGY--YEESLKSGVL------ERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERT 217 (981) Q Consensus 156 ~~~~~~----------~~~~~~~~~--~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (981) ++.... ...+...+. ++..|....- +........++... ...+.+.+..+....+.. T Consensus 148 ~l~~~~f~~~p~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~~~tvt~~~-----~~~~~~~t~~w~a~~g~~ 222 (681) T protein:vir:98 148 QLATIAFTSPVATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSSAGTCTNNL-----FTNGGANTIAWSASSGAS 222 (681) T ss_pred EEEEEEeccccccceeeeeeccCCccceeEeEEEEEeecccceeecCCcceEEeeee-----ecCCcceeEEEEecCCce Confidence 111100 000000000 0000000000 00000000000000 000111111111111110 Q ss_pred cccceEee-cceeEeeeeeeeEEeeCceeeEeee-----ccCCcCcCce--------E--EEEEcccccceEEe----cc Q lcl|NC_020838. 218 DDYPWFKR-DGYRVYEVEKEVAAAYNSTELSTAN-----TNMGTAQTAY--------D--NAVSTESTEKGDYD----SE 277 (981) Q Consensus 218 ~~~~~~~~-~g~~~~~~~~~~~~a~~~~~~~~~~-----~~~~~~~~~~--------~--~~i~~~~~~~~~~~----~~ 277 (981) .+-+..+ .|+..+.+.. .+ +.+...-. ....+...++ . ++=|| -.|. .. T Consensus 223 -~~~V~~~~~gi~g~ig~~-~~----~~~~~~~~~~~~~~t~~~~~~~~~~~~gyP~~v~f~q~R-----L~f~~~~~~p 291 (681) T protein:vir:98 223 -RYNVYKEQGGLYGYIGQT-TG----TSLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAVSYFEQR-----RCFAGTTNKP 291 (681) T ss_pred -eeeecccceeEEEEeecc-ce----eeeeecccccCccccccccccccccCCCceEEEEEEcce-----EEEeeCCCCC Confidence 0000110 1111110000 00 00000000 0011111111 0 00011 0011 00 Q ss_pred eEEEEeeeeccc-------------ceeecccCCcceeeEEE-EcCEEEEeCC-ceeEeccCCCCCCCCceEEEEEeeec Q lcl|NC_020838. 278 VTACNIGSSNIP-------------ASAYLKDAAPEDIEILT-INDYTFVLNK-NKTTAMKTTTSAAVPNVAFVVIRIVA 342 (981) Q Consensus 278 g~~~~~~~~~~~-------------~~~y~~~~~~~dl~~~t-~ad~tfi~n~-~~~~~~~~~~~~~~~~~~~v~v~~g~ 342 (981) -+++..+..++. ....+.......|+.+. ..+.++.... +.... .....+-.+.++.+.. ... T Consensus 292 ~~v~~Srsgdy~nF~~~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~lli~t~~~e~~l~-~~~~~~lTP~~~~~~~-~s~ 369 (681) T protein:vir:98 292 QNIWMTRSGTESAMSYSLPVRDDDRVAFRVAAREANAIRHIVPLTELLLLTSSGEWRVA-SVNSDAVTPTTISVRP-QSY 369 (681) T ss_pred cEEEEEcccCcccccccCCCCCCccEEEEEcCCcceeEEEEEecCcEEEEEcCcEEEEe-cCCCccccceeEEEEE-eee Confidence 122222222211 11122222223343332 2332222221 11111 1111111111111111 111 Q ss_pred cceE-EE-EeeCceEEEEEeccC-----------CCcce-ecHHHHHHHHHhhhhccCceEEEEcCCEEEEEeCCCceEE Q lcl|NC_020838. 343 YNSD-YS-VTLNGTTVTHSTPDT-----------VAGAT-TDSGSIAAALTSSINALTGFSATQVGPGIYIEGTSAFSIS 408 (981) Q Consensus 343 y~~~-~~-v~~ng~~~~~~t~~~-----------~a~~~-~~~~~ia~~l~~~i~~~~~~~~~~vg~~i~i~~~~~~~vt 408 (981) |+.. .. +.+ |..+-|-.+.+ .+... .+....|..|...+ ..-.+..++.+..+.+ +. T Consensus 370 ~g~~~~~Pv~v-g~~v~fv~~~g~~vre~~y~~~~d~~~~~dlt~~a~Hl~~~~-~i~~~a~~~~p~~~~~-------~v 440 (681) T protein:vir:98 370 VGATDVQPVVV-NNTTIYGAARGGHVRELAYNWQANGFVTGDLSLRAAHLFDNL-DILDMAYAKAPQPIVW-------FI 440 (681) T ss_pred eccccccceee-CCeEEEEecCCCEEEEEEEeeecCceeccchhhhhhhhcCCC-CeEEEEEecCCCEEEE-------EE Confidence 2210 00 001 11111111111 11100 11111222222111 1112333333332221 11 Q ss_pred EecCcCcceeEEEEE----EEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCcccceEEEEeeccceeEEEc Q lcl|NC_020838. 409 TSGSTTEEGIFAFQD----QINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAARGPGVWEETIGPSLEFEID 484 (981) Q Consensus 409 ~~~g~~~t~~~~~~~----~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~~~~~ 484 (981) .+|| + +..+.+ +|.+-.+ +..+|....+..-....+|..|+...-...+ ......|.-.+.... T Consensus 441 ~~dg---~-l~~~ty~~eq~v~aW~~---~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g--~~~~yie~~~~~~~~--- 508 (681) T protein:vir:98 441 SSSG---K-LLGLTYVPEQQIGAWHQ---HDTDGVFESCAVVAEGNEDRLYAVVRRTIGG--NEVRYVERMASRQFD--- 508 (681) T ss_pred ecCC---c-EEEEEEecccceeeEEE---EecCCcEEEEEEecCCCCcEEEEEEEecCCC--CeEEEEEecCCcccc--- Confidence 1222 1 112222 1111111 1124554444333333456666655433222 112233332221100 Q ss_pred cccceEEEEeccCCceeeecccCCcccCCCcccCcCccccCCceeE---EEEEcceEEEecCCeEEEEecccc------c Q lcl|NC_020838. 485 ETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFIGKKINN---MFFYRNRLGLLSNEAVIMSRAGDY------F 555 (981) Q Consensus 485 ~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~---v~ffq~RL~f~s~~~V~~Srtgdy------~ 555 (981) ...++-|....+.+.... ..+.+. ++-..|++... =.++..+.+. +..|.+.+.+.- + T Consensus 509 ---------~~~~~~~vD~~~t~~~~~-~~~~sg-l~~leG~tv~i~aDG~~~~~~~V~--~G~itl~~~~~~v~VGl~Y 575 (681) T protein:vir:98 509 ---------AQADAFFVDSGLTYSGEP-VSHISG-LEHLEGKTVSILADGAVHPQRVVT--DGAIDLDVEAGTVHIGLPI 575 (681) T ss_pred ---------ccccceEeeccccccCcc-eeeecc-ccCCCCcEEEEEeCCeecCcEeec--CcEEEeCcCCceEEEeeec Confidence 001111111111111100 000000 00011111110 0112222211 122333333211 1 Q ss_pred cccccccccccCCccEEEEEc-----C--CCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEEEeeecccc Q lcl|NC_020838. 556 NFFANSSQVVAPDDPIDLQAT-----S--VKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINTISTFECDA 628 (981) Q Consensus 556 NF~~~s~~~~~DdDpI~~~i~-----s--~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~~S~~~~s~ 628 (981) .+... -=||++... . .+++.+..-+--+.++.+....+ .|.+ +. .+.+..-+ T Consensus 576 ~s~i~-------~lp~~~~~~~g~~~g~~~ri~rv~lr~~~S~g~~~~~~~~---------~l~~--~~-~~~~~~~g-- 634 (681) T protein:vir:98 576 TAELQ-------TLPVAMQLDGSFGQGRVKNINKLWLRVHRSSGIFAGPHAD---------ALTE--VK-QRTSEPYG-- 634 (681) T ss_pred eeEEE-------ecceeeecCCcccCCceEEEEEEEEEEEcccceEEeeCCC---------ceEE--EE-Eecccccc-- Confidence 11111 111222111 1 11222222222222232322111 1111 00 00010001 Q ss_pred CCCcEEeCCeEEEE-------------ecCCCeeEEEEEeecccccc Q lcl|NC_020838. 629 EIDAVAVGTTQAFI-------------SKSNLYSKLFLMLNVQKEAA 662 (981) Q Consensus 629 ~v~Pv~vG~~v~Fv-------------~~~g~~s~vre~~y~~~~d~ 662 (981) ...|+..|+.=+-+ |+..--..|..+.+.-+..+ T Consensus 635 ~~~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~vgg 681 (681) T protein:vir:98 635 SPPALKSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIAIGA 681 (681) T ss_pred ccCCccCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEEeeC Confidence 11222222211111 11111112222222222222 No 37 >protein:vir:107802 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996623;genbank:gi:45580757;genbank:GeneID:2767878 Probab=96.33 E-value=0.00074 Score=37.76 Aligned_cols=576 Identities=14% Similarity=0.099 Sum_probs=177.2 Q ss_pred CCceeecchhhhccc-----cccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCCCCceEEEEEeCCCceEE Q lcl|NC_020838. 1 MSTISQRIPNLLLGV-----SQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAEARGRWFPILRDEEEKYV 75 (981) Q Consensus 1 m~~v~~s~~~l~~Gv-----SqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~~~~~~~~~~rd~~e~y~ 75 (981) |+.+.-.-+||-+|. .-+.|..|+..=++++.||++-|-.|+.||||++|++++.......+...--.++...|+ T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~g~~~~~~~~~~~~~~rlipf~~~~~~~~~ 80 (681) T protein:vir:10 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRAGFAFVREVKDSAKKVRLIPFTYSVTQTMV 80 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecChhHhhhhcCCCCCcEEEEEEEeCCCceEE Confidence 999999999999994 677888888888999999999999999999999999998766443333333344445677 Q ss_pred EEEEcCCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCcccccccceEEEEcCCcccc Q lcl|NC_020838. 76 CQYDTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDLNSKQATYTKTNDGQTAT 155 (981) Q Consensus 76 ~~~~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~~~~~~~~~~~r~~~~~~ 155 (981) +.| .++.+|||.- .|.... .+....--.+|.... + .+|++.-.+|++||+++.- .|++ + +|.+.... T Consensus 81 l~~--g~~~~r~~~~-~~~~~~---~~~~~~~~tpy~~~~-l-~~l~~~q~aD~~~i~h~~~-~p~~--L--~r~~~~~W 147 (681) T protein:vir:10 81 IEL--GAGYFRFHTN-GGTLLD---GAVPYEIANPYAEAD-L-FNIHYVQSADVLTLVHPNY-APRE--L--RRLGATNW 147 (681) T ss_pred EEE--eCCeEEEEeC-CcEEee---CcEeEEecCCCChhh-h-cCceEEEEcCEEEEECCCC-cceE--E--EEccCCce Confidence 777 3688999943 343321 111111112354433 2 5699999999999999732 2322 1 22222222 Q ss_pred cceeeE----------EEEEEeeee--eeeeeccCcc------ccccCCceeEEEeeccccceeccccccccccccCCcc Q lcl|NC_020838. 156 KVNLFD----------VDVTYKNGY--YEESLKSGVL------ERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSLGNERT 217 (981) Q Consensus 156 ~~~~~~----------~~~~~~~~~--~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (981) ++.... ...+...+. ++..|....- +........++... ...+.+.+..+....+.. T Consensus 148 ~l~~~~f~~~p~~p~~~~at~~~~~~~~t~~~~v~avda~t~~~s~~~~~~tvt~~~-----~~~~~~~t~~w~a~~g~~ 222 (681) T protein:vir:10 148 QLATIAFTSPVATPTSVTATSNNKGTDYTYRYVVTALDAEGKTESAPSSAGTCTNNL-----FTNGGANTIAWSASSGAS 222 (681) T ss_pred EEEEEEeccccccceeeeeeccCCccceeEeEEEEEeecccceeecCCcceEEeeee-----ecCCcceeEEEEecCCce Confidence 111100 000000000 0000000000 00000000000000 000111111111111110 Q ss_pred cccceEee-cceeEeeeeeeeEEeeCceeeEeee-----ccCCcCcCce--------E--EEEEcccccceEEe----cc Q lcl|NC_020838. 218 DDYPWFKR-DGYRVYEVEKEVAAAYNSTELSTAN-----TNMGTAQTAY--------D--NAVSTESTEKGDYD----SE 277 (981) Q Consensus 218 ~~~~~~~~-~g~~~~~~~~~~~~a~~~~~~~~~~-----~~~~~~~~~~--------~--~~i~~~~~~~~~~~----~~ 277 (981) .+-+..+ .|+..+.+.. .+ +.+...-. ....+...++ . ++=|| -.|. .. T Consensus 223 -~~~V~~~~~gi~g~ig~~-~~----~~~~~~~~~~~~~~t~~~~~~~~~~~~gyP~~v~f~q~R-----L~f~~~~~~p 291 (681) T protein:vir:10 223 -RYNVYKEQGGLYGYIGQT-TG----TSLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAVSYFEQR-----RCFAGTTNKP 291 (681) T ss_pred -eeeecccceeEEEEeecc-ce----eeeeecccccCccccccccccccccCCCceEEEEEEcce-----EEEeeCCCCC Confidence 0000110 1111110000 00 00000000 0011111111 0 00011 0011 00 Q ss_pred eEEEEeeeeccc-------------ceeecccCCcceeeEEE-EcCEEEEeCC-ceeEeccCCCCCCCCceEEEEEeeec Q lcl|NC_020838. 278 VTACNIGSSNIP-------------ASAYLKDAAPEDIEILT-INDYTFVLNK-NKTTAMKTTTSAAVPNVAFVVIRIVA 342 (981) Q Consensus 278 g~~~~~~~~~~~-------------~~~y~~~~~~~dl~~~t-~ad~tfi~n~-~~~~~~~~~~~~~~~~~~~v~v~~g~ 342 (981) -+++..+..++. ....+.......|+.+. ..+.++.... +.... .....+-.+.++.+.. ... T Consensus 292 ~~v~~Srsgdy~nF~~~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~lli~t~~~e~~l~-~~~~~~lTP~~~~~~~-~s~ 369 (681) T protein:vir:10 292 QNIWMTRSGTESAMSYSLPVRDDDRVAFRVAAREANAIRHIVPLTELLLLTSSGEWRVA-SVNSDAVTPTTISVRP-QSY 369 (681) T ss_pred cEEEEEcccCcccccccCCCCCCccEEEEEcCCcceeEEEEEecCcEEEEEcCcEEEEe-cCCCccccceeEEEEE-eee Confidence 122222222211 11122222223343332 2332222221 11111 1111111111111111 111 Q ss_pred cceE-EE-EeeCceEEEEEeccC-----------CCcce-ecHHHHHHHHHhhhhccCceEEEEcCCEEEEEeCCCceEE Q lcl|NC_020838. 343 YNSD-YS-VTLNGTTVTHSTPDT-----------VAGAT-TDSGSIAAALTSSINALTGFSATQVGPGIYIEGTSAFSIS 408 (981) Q Consensus 343 y~~~-~~-v~~ng~~~~~~t~~~-----------~a~~~-~~~~~ia~~l~~~i~~~~~~~~~~vg~~i~i~~~~~~~vt 408 (981) |+.. .. +.+ |..+-|-.+.+ .+... .+....|..|...+ ..-.+..++.+..+.+ +. T Consensus 370 ~g~~~~~Pv~v-g~~v~fv~~~g~~vre~~y~~~~d~~~~~dlt~~a~Hl~~~~-~i~~~a~~~~p~~~~~-------~v 440 (681) T protein:vir:10 370 VGATDVQPVVV-NNTTIYGAARGGHVRELAYNWQANGFVTGDLSLRAAHLFDNL-DILDMAYAKAPQPIVW-------FI 440 (681) T ss_pred eccccccceee-CCeEEEEecCCCEEEEEEEeeecCceeccchhhhhhhhcCCC-CeEEEEEecCCCEEEE-------EE Confidence 2210 00 001 11111111111 11100 11111222222111 1112333333332221 11 Q ss_pred EecCcCcceeEEEEE----EEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCcccceEEEEeeccceeEEEc Q lcl|NC_020838. 409 TSGSTTEEGIFAFQD----QINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAARGPGVWEETIGPSLEFEID 484 (981) Q Consensus 409 ~~~g~~~t~~~~~~~----~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~~~~~ 484 (981) .+|| + +..+.+ +|.+-.+ +..+|....+..-....+|..|+...-...+ ......|.-.+.... T Consensus 441 ~~dg---~-l~~~ty~~eq~v~aW~~---~~~~g~v~~v~~i~~~~~d~l~~vv~r~~~g--~~~~yie~~~~~~~~--- 508 (681) T protein:vir:10 441 SSSG---K-LLGLTYVPEQQIGAWHQ---HDTDGVFESCAVVAEGNEDRLYAVVRRTIGG--NEVRYVERMASRQFD--- 508 (681) T ss_pred ecCC---c-EEEEEEecccceeeEEE---EecCCcEEEEEEecCCCCcEEEEEEEecCCC--CeEEEEEecCCcccc--- Confidence 1222 1 112222 1111111 1124554444333333456666655433222 112233332221100 Q ss_pred cccceEEEEeccCCceeeecccCCcccCCCcccCcCccccCCceeE---EEEEcceEEEecCCeEEEEecccc------c Q lcl|NC_020838. 485 ETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFIGKKINN---MFFYRNRLGLLSNEAVIMSRAGDY------F 555 (981) Q Consensus 485 ~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~---v~ffq~RL~f~s~~~V~~Srtgdy------~ 555 (981) ...++-|....+.+.... ..+.+. ++-..|++... =.++..+.+. +..|.+.+.+.- + T Consensus 509 ---------~~~~~~~vD~~~t~~~~~-~~~~sg-l~~leG~tv~i~aDG~~~~~~~V~--~G~itl~~~~~~v~VGl~Y 575 (681) T protein:vir:10 509 ---------AQADAFFVDSGLTYSGEP-VSHISG-LEHLEGKTVSILADGAVHPQRVVT--DGAIDLDVEAGTVHIGLPI 575 (681) T ss_pred ---------ccccceEeeccccccCcc-eeeecc-ccCCCCcEEEEEeCCeecCcEeec--CcEEEeCcCCceEEEeeec Confidence 001111111111111100 000000 00011111110 0112222211 122333333211 1 Q ss_pred cccccccccccCCccEEEEEc-----C--CCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEEEeeecccc Q lcl|NC_020838. 556 NFFANSSQVVAPDDPIDLQAT-----S--VKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINTISTFECDA 628 (981) Q Consensus 556 NF~~~s~~~~~DdDpI~~~i~-----s--~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~~S~~~~s~ 628 (981) .+... -=||++... . .+++.+..-+--+.++.+....+ .|.+ +. .+.+..-+ T Consensus 576 ~s~i~-------~lp~~~~~~~g~~~g~~~ri~rv~lr~~~S~g~~~~~~~~---------~l~~--~~-~~~~~~~g-- 634 (681) T protein:vir:10 576 TAELQ-------TLPVAMQLDGSFGQGRVKNINKLWLRVHRSSGIFAGPHAD---------ALTE--VK-QRTSEPYG-- 634 (681) T ss_pred eeEEE-------ecceeeecCCcccCCceEEEEEEEEEEEcccceEEeeCCC---------ceEE--EE-Eecccccc-- Confidence 11111 111222111 1 11222222222222232322111 1111 00 00010001 Q ss_pred CCCcEEeCCeEEEE-------------ecCCCeeEEEEEeecccccc Q lcl|NC_020838. 629 EIDAVAVGTTQAFI-------------SKSNLYSKLFLMLNVQKEAA 662 (981) Q Consensus 629 ~v~Pv~vG~~v~Fv-------------~~~g~~s~vre~~y~~~~d~ 662 (981) ...|+..|+.=+-+ |+..--..|..+.+.-+..+ T Consensus 635 ~~~~l~TG~~~v~v~~~~~~~~~v~I~qd~PlP~tvlsi~~ev~vgg 681 (681) T protein:vir:10 635 SPPALKSEEIPLVLSPKWGDSGQLFVRQADPLPLMIVSMSAEIAIGA 681 (681) T ss_pred ccCCccCCeEEEEeCCCcCcceEEEEEECCCcCEEEEEeeEEEEeeC Confidence 11222222211111 11111112222222222222 No 38 >protein:vir:7329 Length: 825 # NCBI annotation: hypothetical protein # Family: family:all:780 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848220;genbank:gi:30387391;genbank:GeneID:2641863 Probab=92.38 E-value=0.012 Score=31.19 Aligned_cols=649 Identities=9% Similarity=0.006 Sum_probs=207.4 Q ss_pred cccccCCceeEEEeeccccceecccccccc---ccccCCcccccceEeecceeEeeeeeeeEEeeCceeeEeeeccCCcC Q lcl|NC_020838. 180 LERIDNGQRIVKDDGTNAGSIAAGSAMPSG---YSLGNERTDDYPWFKRDGYRVYEVEKEVAAAYNSTELSTANTNMGTA 256 (981) Q Consensus 180 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~a~~~~~~~~~~~~~~~~ 256 (981) +.+ ...+.+-+. .-+...+....+ |+.+-.++-+....-.+|...|+++.+++.+.+.--..+|++. T Consensus 1 m~~-----~~~q~sF~~-GElsP~l~gR~Dl~~y~~g~~~~~N~~~~p~Gg~~rRpGt~fva~~~~~~~~~rLipF---- 70 (825) T protein:vir:73 1 MAF-----SWIQPSFAG-GEIGPSLYGRIDMSKYQVALRKCDNFIVRQYGGVENRPGTRFVGPAKYPDRKCRLIPF---- 70 (825) T ss_pred Ccc-----ceecccccc-ceechhhcccchHHHHHHHHHHhcCcEEEecCCceecCchHHhHhhcCCCCCEEEEEE---- Confidence 000 001111111 112222222221 2212223345555667899999999999887655444455522 Q ss_pred cCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCcceeeEEEEcCEEEEeCCceeEeccCCCCCCCCceEEE Q lcl|NC_020838. 257 QTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDIEILTINDYTFVLNKNKTTAMKTTTSAAVPNVAFV 336 (981) Q Consensus 257 ~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl~~~t~ad~tfi~n~~~~~~~~~~~~~~~~~~~~v 336 (981) + ..+++.|+.....+ .||+..-.. +++.+ T Consensus 71 -------~-fs~~q~y~Lefg~~---------------------~lrv~~~gg-~v~~~--------------------- 99 (825) T protein:vir:73 71 -------Q-FSTVQTYALEFGHN---------------------YMRVIKDGA-YVLTT--------------------- 99 (825) T ss_pred -------E-eCCCcEEEEEEeCC---------------------eEEEEeCCc-eEecc--------------------- Confidence 2 11222332111111 111111000 00000 Q ss_pred EEeeeccceEEEEeeCceEEEEEeccCCCcceecHHHHHHHHHhhhhccCceEEEEcCCEEEEEeCCCceEEEec-CcCc Q lcl|NC_020838. 337 VIRIVAYNSDYSVTLNGTTVTHSTPDTVAGATTDSGSIAAALTSSINALTGFSATQVGPGIYIEGTSAFSISTSG-STTE 415 (981) Q Consensus 337 ~v~~g~y~~~~~v~~ng~~~~~~t~~~~a~~~~~~~~ia~~l~~~i~~~~~~~~~~vg~~i~i~~~~~~~vt~~~-g~~~ 415 (981) ++......||+.. .. +..+..+|++++++|.|++-+.-+... +.++ T Consensus 100 ---------------~~~~~e~~TPy~~-----------~~-------l~~l~~~QsaD~~~i~h~~~pp~~L~r~~~~~ 146 (825) T protein:vir:73 100 ---------------SNVIYELAMPYAD-----------TD-------LFRIKFTQSADVLTLVHPAYPPKELRRYAHDN 146 (825) T ss_pred ---------------CCceEEEecccch-----------hh-------hhhheeeeecCEEEEEcCCCceeEEEEecCCC Confidence 0000111122211 11 123456678888888888877666544 3344 Q ss_pred ceeEEEEEEEeechhccccCC-CCcEEEEEccCCCcccceEEEEEcccCCc--ccceEEEEeecccee--EEEccccceE Q lcl|NC_020838. 416 EGIFAFQDQINVASRLPNQCE-NGYRVRVTNSGDVTADDIYVEFQTTNSAA--RGPGVWEETIGPSLE--FEIDETTMPH 490 (981) Q Consensus 416 t~~~~~~~~v~~~~~Lp~~~~-~G~~v~v~~~g~~~~d~yyv~~~~~~~~~--~g~~~W~E~a~~~~~--~~~~~~Tmp~ 490 (981) +.+..+..... |.... -...+.++.++.. ..+.+++..+.... .+.....+......+ +......... T Consensus 147 W~l~~~~f~~g-----p~~~in~~~sv~v~asg~t--g~~TiTaS~a~~~~~~vG~~i~~~~~~v~si~~~~~~~~~~~~ 219 (825) T protein:vir:73 147 WQIVDVTTKNG-----PFEDINVDETVKVYASAST--GTITLTASSAIFGAEQVGKLFYLEQPAVDSVPVWETSKTTAIN 219 (825) T ss_pred cEEEEEeccCC-----ccccccccccceeeecccC--ceeEEEeeccccCchhcCeEEEEecccccccceeeeeeEEEee Confidence 54444433221 11111 0112333443332 22333333322111 111111111111000 0011111111 Q ss_pred EEEeccCCceeeecccCCccc---CCCcccCcCccccCCceeE--EEEEcceEEEecCCeEE---EEecccccccccccc Q lcl|NC_020838. 491 QLIRQANGVFKYEPVTWDDRL---VGDNTTNPIPSFIGKKINN--MFFYRNRLGLLSNEAVI---MSRAGDYFNFFANSS 562 (981) Q Consensus 491 ~lv~~a~g~f~~~~~~w~~r~---~GDd~tnp~psF~g~~ps~--v~ffq~RL~f~s~~~V~---~Srtgdy~NF~~~s~ 562 (981) .+++.....+.+......... ..+...+......+.+... .-+++..-....-..++ ...+++=.+|.+... T Consensus 220 ~v~~~~~~~~~~~~~~~~~t~~~~a~~g~~~~~~~g~~~~~~~~~~~~~~~~~g~~~it~~~~~~~~~~~~~~~~~~~~~ 299 (825) T protein:vir:73 220 DVRRADSNYYRANTSGKTGTLRPSHTEGMSWDGWGGTGSDDTGIQWEYLHSGFGIAKITAVAGDGLTATADVVSFIPSQV 299 (825) T ss_pred eEEECCCceeeeecccccceeeccccCCceeEeeeeecccCCceEEEEEecCCceEEEeeccccceeeccccceeccccc Confidence 122222222221111000000 0000000000000000000 00111100000000111 123444455544322 Q ss_pred ccccCCccEEEEEc-----CCCceeEEEEeecCCcEEEEec--CcEEEE-E--cCCccccccce-------EEEEEeeec Q lcl|NC_020838. 563 QVVAPDDPIDLQAT-----SVKPVTLNYTLATSIGLLVFGP--NEQFVL-S--TDADILSPTTT-------KINTISTFE 625 (981) Q Consensus 563 ~~~~DdDpI~~~i~-----s~~~n~I~~~v~~~~~L~l~T~--g~q~~l-~--g~~~~LTP~~~-------~i~~~S~~~ 625 (981) ..- .+.-.+.... ...+ ..+.-+...|++..+ ..|+++ + |+-.-+.|++. .+...+. T Consensus 300 ~~~-~~~t~~~~~~~~~~~~gyP---s~v~f~q~RL~f~g~~~~p~~v~~Srtgd~~nF~~~~~~~DdD~I~~~~s~~-- 373 (825) T protein:vir:73 300 VGS-ANASYKWAKYAWNSVNGYP---STVVYYQQRLYFAASTAYPQTIWASRTGDYKDFGKNNPIQDDDRIIYTYAGR-- 373 (825) T ss_pred ccC-CCCCcccccCCcccCCCCc---cEEEEEcceEEEeecCCCCCEEEEEccCCccccccCCCCCCCccEEEEEcCC-- Confidence 111 1111222111 1122 223344555666543 345554 3 22112444442 2222211 Q ss_pred cccCCCcEEeCCeEEEEecCCCeeEEEEEeeccc-ccc--eehhhHHHHHHHhcCCCeEEEEEcC-CCcEEEEEecCCcE Q lcl|NC_020838. 626 CDAEIDAVAVGTTQAFISKSNLYSKLFLMLNVQK-EAA--ATIDEATTNVPEYVPSDIDSMTASP-AMSIVSLGKSGSNT 701 (981) Q Consensus 626 ~s~~v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~-~d~--~~a~DlS~~~~h~~~~~i~~~~~s~-~~~~~~~~~~g~~~ 701 (981) -.+.++=++-.+.+++...++.+ ..+.. .+. +..-.+..+-. + ++....--. ...++++ .++... T Consensus 374 ~~~~i~~~~~~~~L~~~t~~~e~------~l~~~~~~~lTP~~~~~~~~s~--~--g~~~~~Pv~vg~~~~Fv-~~~g~~ 442 (825) T protein:vir:73 374 QVNEIRHLIDVGNLVALTSGGEY------TISGDQNKVLTPSAFSFSSQGN--N--GSSNVPPIAVANIALFI-QEKGSV 442 (825) T ss_pred cceeEEEEeecCcEEEEecCceE------EEecCCCcccceeeEEEEeeee--e--ccccccceEeCCeEEEE-eCCCCe Confidence 11123222333455555555542 22221 122 22222222111 1 111100001 1123333 333334 Q ss_pred EEEEEeecCcch----heeeeEeeccCC-ceEEEEEeC--C-eEEEEEEcCCcEEEEEEEeecCCceeE----------- Q lcl|NC_020838. 702 VYQHRFFMQGEN----RVQTWYKWQLTG-DLRLQFFDK--T-TFYAVTSSGSNVYLTSYDLTQASESGY----------- 762 (981) Q Consensus 702 l~~y~y~~~~eq----~V~aWsrw~~~G-~v~sv~~~~--d-~ly~vv~r~~~~~l~~~~~~~~~~~~~----------- 762 (981) +.-|.|-+..+. ++..=+.|-+.| .+..++... . .+|++ +-++.++...+...++-..+. T Consensus 443 vre~~~~~~~d~~~~~dlt~~a~hl~~~~~~~~~a~~~~p~~~~~~v-~~dg~l~~~ty~~~q~v~aW~~~~~~g~v~~~ 521 (825) T protein:vir:73 443 VRDLAYSFDVDGYQGTDLTILANHLFQKHSIVDWSFCIVPYSSAFCI-RDDGKLLVLTYLRDQQVFAWAPQSSAGKYEST 521 (825) T ss_pred EEEEEEeeecCceeccchhhhhHhhccCCceEEEEEcCCCceEEEEE-ecCCeEEEEEEeccccceeeEEEecCCcEEEE Confidence 444433222221 222333566665 355554432 2 23333 223333322222222110000 Q ss_pred --------------EEecCCCcccccc---cc-eeeeeee-eeccCC---ceEEEcccCCCcCCceEEEEEcCcccCcee Q lcl|NC_020838. 763 --------------LTLPTGEKTDVCL---DM-FNVNPYR-TYSTST---KKTTVNLPFDHITGKKLAVVAIGTYIGDTI 820 (981) Q Consensus 763 --------------~~~~~~~~~~~~l---D~-~~vd~~~-ty~~~~---~~tt~~l~~~~l~g~~v~v~adG~~~~~~~ 820 (981) +++.....-..++ +. ++.+... .|-... .......++.||+|+++.++++|.. T Consensus 522 ~~i~~~~~D~l~~iV~R~~~g~~~~yiE~~~~~~~~~~~~~~~vD~g~~~~g~~~~~~l~~l~g~tv~~~~~g~~----- 596 (825) T protein:vir:73 522 CSISEGSEDAVYFVVNRTINGQTVRYIERLSSRLFTNDEDAFFVDCGLSYDGRNTSSRTMTISGGTGDWSYQVDY----- 596 (825) T ss_pred EEecCCCccEEEEEEEEeeCCceEEEEEEecccccCCCcceeEEEEEeeecccceeeceeeeCCceEEEEeCCeE----- Confidence 0000000000011 10 1110000 011000 0112346789999999999998752 Q ss_pred EEEEecCceEEEccccccCCCEEEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeE---EEEEEEEEee Q lcl|NC_020838. 821 SATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDL---ILHRLKVSTG 897 (981) Q Consensus 821 ~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl---~l~r~~v~~~ 897 (981) +.+|.+|.|+|+.+. .++|||+|......+++.... +. .....|. ++.++.+... T Consensus 597 --------------~~~v~~g~itl~~~~---~~~i~l~~~~~~~~~~~~~~~--~~---~~~i~~~~~~~~v~v~~~~~ 654 (825) T protein:vir:73 597 --------------PVTVSGGAYFVNTDV---GAQIQFPYTGTDPDTNEPVAK--EL---RGDIISVTSNTAVVVRFNRN 654 (825) T ss_pred --------------EEEEcCCeEEecccc---eEEEEecccCcccccccceec--ee---eEEEccccCceEEEEEeccc Confidence 235789999997543 589999999887766654321 11 0011111 1222111111 Q ss_pred cccceEEEEccCCCCceeeEecccc----------cCccccCccccccCCeEEEEeeccCcceEEE---EEECCCCCEEE Q lcl|NC_020838. 898 LSGPITYKVDITGKDEWTNIINVTL----------PNTYVLNNVNLSASALHDVPIYQRNENVNIK---IIGDTPFPISL 964 (981) Q Consensus 898 ~Sg~~~v~v~~~~~~~~~~~~~~~~----------~~~~~~~~~p~~~~~~~~vp~~g~~~~~~v~---I~~~~PlPltv 964 (981) . -++................++.. .+...+.. ..+.++....|.. .....|= ..--.|||+.+ T Consensus 655 ~-~a~~~~~~~t~~~~a~~~~~gL~hLeG~~v~v~~Dg~~~~~-~~V~~G~vtl~~~--~~~v~vGl~y~~~~~~l~~~~ 730 (825) T protein:vir:73 655 V-PPVLRNVATTNWQMARQTFSGLAHLEGQTVNILSDASVEPQ-KTVTGGAVTLESP--GAVVHIGLPITAEFETLDINI 730 (825) T ss_pred c-cceeeeecccCCCcchheeccccccCCceEEEEECCeeeCC-eEecCcEEEecCC--ceEEEEeeCccceEEeccccc Confidence 0 01111111111111111111110 00011111 1123344444421 1111111 01124556554 Q ss_pred EEEEEEEEeeccccccC Q lcl|NC_020838. 965 LNIVWEGNYNRRFYRRS 981 (981) Q Consensus 965 lsi~weg~y~~r~rRr~ 981 (981) .+ .|.---|.||=. T Consensus 731 ~~---~g~~~g~~~ri~ 744 (825) T protein:vir:73 731 NG---QETLLDKKQVIP 744 (825) T ss_pred CC---CccccCccEEEE Confidence 32 233333333322 No 39 >protein:vir:3529 Length: 477 # NCBI annotation: P28 # Family: family:all:1540 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050989;genbank:gi:9633575;genbank:GeneID:1262322 Probab=87.47 E-value=0.038 Score=28.34 Aligned_cols=435 Identities=10% Similarity=0.039 Sum_probs=141.6 Q ss_pred HHHHHHhhhhccC---ceE-EEEcCCEEEEEeCCCceEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCC Q lcl|NC_020838. 374 IAAALTSSINALT---GFS-ATQVGPGIYIEGTSAFSISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDV 449 (981) Q Consensus 374 ia~~l~~~i~~~~---~~~-~~~vg~~i~i~~~~~~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~ 449 (981) +.....=.+.+++ +.. ....++++--. |-+..-.+..+.. +...|+ +.+|...+..+.|-. T Consensus 1 ~~~~~~m~~~~ipl~~g~~~~~~~~d~~~~~-PVN~~a~p~~~~~------------s~~~L~--~~pG~~~~~~~~G~~ 65 (477) T protein:vir:35 1 MLSEVFMPKIQIPLAKGLVKDIKTADYIDAL-PVNMLATPKEVLN------------ASGYLR--SFPGIEKKQDAKGVS 65 (477) T ss_pred Ccccceeeeeccccccccccccccccceeee-eeccceeeccccc------------cccccc--cCCcceeeccCCccc Confidence 1000000000111 000 00111111000 0000001111100 001111 112222222211110 Q ss_pred cccceEEEEEcccCCcccceEEEE------eeccceeEEEccccceEEEEec-cCCceeeecccCCcccCCCcccCcCcc Q lcl|NC_020838. 450 TADDIYVEFQTTNSAARGPGVWEE------TIGPSLEFEIDETTMPHQLIRQ-ANGVFKYEPVTWDDRLVGDNTTNPIPS 522 (981) Q Consensus 450 ~~d~yyv~~~~~~~~~~g~~~W~E------~a~~~~~~~~~~~Tmp~~lv~~-a~g~f~~~~~~w~~r~~GDd~tnp~ps 522 (981) .--||+.....-=..-|.-.|+- .++-+.+.--...+.. +++.+ ...-|..+--.+.-+. ...+..|+ T Consensus 66 -RG~~~~~~~g~lY~V~G~~LY~v~~~vG~I~gsg~VsMa~n~~~~-aIv~~g~~~gy~y~~t~~~~~~---~~~~~~p~ 140 (477) T protein:vir:35 66 -RGVHFNTKNNALYRVCGNTLYRNDKEVADIAGMSRVSMSHSSHSQ-AICFEGKVKLYRYDGTEKALSN---WPKDKYPQ 140 (477) T ss_pred -cceeEeecCCeEEEEecCeeEeeeeeeeeecccccEEEeeCCcEE-EEEECCcceeEEEecccceeee---cCccccCC Confidence 11111111110000001112221 1122222222222111 11111 1111332222222221 22234677 Q ss_pred ccCCceeEEEEEcceEEEecC--CeEEEEeccccccccccccccccCCccEE-EEEcCCCceeEEEEeecCCcEEEEecC Q lcl|NC_020838. 523 FIGKKINNMFFYRNRLGLLSN--EAVIMSRAGDYFNFFANSSQVVAPDDPID-LQATSVKPVTLNYTLATSIGLLVFGPN 599 (981) Q Consensus 523 F~g~~ps~v~ffq~RL~f~s~--~~V~~Srtgdy~NF~~~s~~~~~DdDpI~-~~i~s~~~n~I~~~v~~~~~L~l~T~g 599 (981) |....+..|+|...|++|..+ +.++.|-.-|-. --|+++ ++-+.++++.|.-++.+.+.|++|.+. T Consensus 141 ~~l~~~~~v~f~dGyfV~~~~gt~~~~iS~L~d~s-----------~~d~~~~FasAE~~pD~Ivgi~~~~~~i~lfG~~ 209 (477) T protein:vir:35 141 YDLGEVIDVCRNRGRYIWLQKGGERFGVTDLEDES-----------KPDRYQPFYRAESQPDGIVSVDAWRDLIVCFGSS 209 (477) T ss_pred ccccceeEEEeeCceEEEeecCCCeEEEeecCCcc-----------ccccccccccccCCCCceEEEEeeccEEEEEecc Confidence 777778899999999888753 334446322211 135566 566778889999999999999999877 Q ss_pred cE--EEEEcCCcccc-c-cc--eEEEEEeeeccccCCCcEEeCCeEEEEecCCC-eeEEEEEeecccccceehhhHHHH- Q lcl|NC_020838. 600 EQ--FVLSTDADILS-P-TT--TKINTISTFECDAEIDAVAVGTTQAFISKSNL-YSKLFLMLNVQKEAAATIDEATTN- 671 (981) Q Consensus 600 ~q--~~l~g~~~~LT-P-~~--~~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~-~s~vre~~y~~~~d~~~a~DlS~~- 671 (981) .- |..+|.. .++ | -- ..... ..+|++.-.=..+|++++|++.... --.|+. .++|+++-+|-| T Consensus 210 TiEvw~ntG~a-~f~~p~~r~~~~~mI--q~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~------~~g~q~~rIST~a 280 (477) T protein:vir:35 210 SIEYFTLTGSA-DTSQPLYIHQAAYMI--QAGIAGRDCKCRYQDKYAILSHQSTGQPAVYL------IGAGEKNKISTAT 280 (477) T ss_pred ceEEEEecCCC-CCCcceeecCCceee--eecccCchhhhhhCceEEEEecCCCcccEEEE------ccCceeEEecCHH Confidence 66 7778764 232 2 11 11112 3577766666889999999987532 112332 356777777533 Q ss_pred HHHhcCC-----CeEEE--E--EcCCCcEEEEEecCCcEEEEEEeecCcchh-eeeeEeeccC---CceEEEEEe-CCeE Q lcl|NC_020838. 672 VPEYVPS-----DIDSM--T--ASPAMSIVSLGKSGSNTVYQHRFFMQGENR-VQTWYKWQLT---GDLRLQFFD-KTTF 737 (981) Q Consensus 672 ~~h~~~~-----~i~~~--~--~s~~~~~~~~~~~g~~~l~~y~y~~~~eq~-V~aWsrw~~~---G~v~sv~~~-~d~l 737 (981) +++.+.. .+... + ...+...++-+- + .-++|+- ..++ --.|+--... ...+..+++ -+.- T Consensus 281 IE~~i~ay~~~e~a~af~~t~~~eGH~fy~LtfP--~-~Tw~yD~---at~~w~e~W~~~~~g~~~~~~Ra~~~~~~~g~ 354 (477) T protein:vir:35 281 IDKIIRYYSADELAASFMESIRFDNHELLLLHLP--K-HTLCFDG---SASHQYSQWSLLKSGFYDEPYRAIDFMFFDNQ 354 (477) T ss_pred HHHHHHhcCCcchhceeEEEEEeCCeeEEEEEcC--C-ceEEEec---ccccccceeeeeccCCccCceEEEEEEEeCCe Confidence 4444432 11111 2 223333333333 2 4556641 1111 0024332222 234443332 2222 Q ss_pred EEEEEcCCcEEEEEEEeecCCceeEEEecCCCcccccccceeeeeeeeeccCCceEEEcccCCCcCCceEEEEEc----- Q lcl|NC_020838. 738 YAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFNVNPYRTYSTSTKKTTVNLPFDHITGKKLAVVAI----- 812 (981) Q Consensus 738 y~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~vd~~~ty~~~~~~tt~~l~~~~l~g~~v~v~ad----- 812 (981) +++=+..++ .|-.+++....+ +...+.+..+. |+-|.++..+-.+.- T Consensus 355 ~~vGD~~ng-~l~~ld~~~~~d-----------~g~~i~~~~~~----------------p~~~~d~~Rv~~~el~~~tG 406 (477) T protein:vir:35 355 ITVGDKKEG-VLGHLIFNASNQ-----------YEQQTEHLLYT----------------PMIKADNARLFDFELEASTG 406 (477) T ss_pred EEEEEcCCC-eEEEECCCCccc-----------CCCccceEEec----------------ceeeCCCCeEEEEEEEEecC Confidence 222222211 111111100000 00000111110 111111111100000 Q ss_pred -CcccCceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeEEEEE Q lcl|NC_020838. 813 -GTYIGDTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDLILHR 891 (981) Q Consensus 813 -G~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r 891 (981) |...... .-....||.. |-.+... ..|... ....|++.+| T Consensus 407 vgq~~d~v-~L~~sddG~~-----------------------------~~~~~~~-------~~g~~g--~~~~r~~~~R 447 (477) T protein:vir:35 407 VAQIADKL-FLSVTTDGIN-----------------------------YSREQLI-------EQNSPF--QYDKRILWRR 447 (477) T ss_pred cCccCceE-EEEEeccccc-----------------------------cccceee-------cCCCcc--ccccceeeee Confidence 0000000 0000112222 1111110 001100 0112222332 Q ss_pred EEEEeecccceEEEEccCCCCceeeEeccccc Q lcl|NC_020838. 892 LKVSTGLSGPITYKVDITGKDEWTNIINVTLP 923 (981) Q Consensus 892 ~~v~~~~Sg~~~v~v~~~~~~~~~~~~~~~~~ 923 (981) +-. ..+-.+|++.+.....- ...-...+.- T Consensus 448 lG~-~r~~vgf~~r~~~~~pv-~l~~~~~~~e 477 (477) T protein:vir:35 448 IGR-VRKNIGFKIRIITKSPV-TLSDLSIRME 477 (477) T ss_pred eee-ceeccceEEEEEecCCc-eeccceeEeC Confidence 211 11111222222111000 0000000000 No 40 >protein:vir:105563 Length: 396 # NCBI annotation: hypothetical protein # Family: family:all:27455 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164316;genbank:gi:56692963;genbank:GeneID:3197174 Probab=82.38 E-value=0.077 Score=26.68 Aligned_cols=336 Identities=10% Similarity=0.039 Sum_probs=115.1 Q ss_pred EEEEcCEEEEeCCceeE--eccCCCCCCCCc-eEEEEEe---eeccceEEEE-eeC-ceEEEEEeccCCCcceecHHHHH Q lcl|NC_020838. 304 ILTINDYTFVLNKNKTT--AMKTTTSAAVPN-VAFVVIR---IVAYNSDYSV-TLN-GTTVTHSTPDTVAGATTDSGSIA 375 (981) Q Consensus 304 ~~t~ad~tfi~n~~~~~--~~~~~~~~~~~~-~~~v~v~---~g~y~~~~~v-~~n-g~~~~~~t~~~~a~~~~~~~~ia 375 (981) |.++.=+.|.-=.++.. +..+.+...... +.-++|. .|++++.-.. ++. |.-. +.... T Consensus 1 ~~~~~~~~~~ginnv~~e~~l~~~~~~~~~~~r~a~nvdi~~~G~~~~r~~~tr~~~g~l~----~~~~~---------- 66 (396) T protein:vir:10 1 MATTSLVPLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQLRASVRQVTDQPFR----QLWQS---------- 66 (396) T ss_pred CcceeeeeeecccccccccccccCCCcccceeeeeeeecccCCCchhhhccCcccCCceec----ccccC---------- Confidence 22211111111111111 111111111110 1112332 2333321000 000 0000 00000 Q ss_pred HHHHhhhhccCceEEEEcCCEEEEEeCCCceEEEec----Cc------------CcceeEEEEEEEeechhccccCCCCc Q lcl|NC_020838. 376 AALTSSINALTGFSATQVGPGIYIEGTSAFSISTSG----ST------------TEEGIFAFQDQINVASRLPNQCENGY 439 (981) Q Consensus 376 ~~l~~~i~~~~~~~~~~vg~~i~i~~~~~~~vt~~~----g~------------~~t~~~~~~~~v~~~~~Lp~~~~~G~ 439 (981) .+.+++....+++|+-.+++..+....- +. +++..-.+.+-.++- .|-.-+|. - T Consensus 67 --------~~~~~~~~~~~~tl~~~~~~~w~~~~~v~v~~~pva~d~~~~Rvy~t~~~~p~~~~~~~~y-~L~vp~P~-~ 136 (396) T protein:vir:10 67 --------PLHGDAFGALGDQWGKVDPHSWTFEPLAQIGEGDLSHEVLNNRVCVAGTAGIFTYDGAQAE-RLTLDTPA-P 136 (396) T ss_pred --------ccccceeeeCCceEEEEeCCeEEEEeeeeeccCchhccccCCeEEEEcCCCceeeeCCcce-ecCcCCCc-c Confidence 0011222222333332222221111000 00 000000000000000 01111111 1 Q ss_pred EEEEEccCCCcccceEEEEEcccC----Cc----ccceEEEEeeccceeEEEccccceEEEEec-c-------------- Q lcl|NC_020838. 440 RVRVTNSGDVTADDIYVEFQTTNS----AA----RGPGVWEETIGPSLEFEIDETTMPHQLIRQ-A-------------- 496 (981) Q Consensus 440 ~v~v~~~g~~~~d~yyv~~~~~~~----~~----~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~-a-------------- 496 (981) .+.+...|+-..++|.+.+..-+. +. ...+.|+-....+.....+....-..+.+. + T Consensus 137 a~~~a~~Gsl~~~~~~Y~~t~V~~~gEEs~p~~~S~~v~~~gg~~vtl~~~~~~~i~~~RiYrS~~~G~~~~l~aE~~a~ 216 (396) T protein:vir:10 137 PLLVAGAGSLSQGTYGAAVAWLRGPQESAPSLIAFAEVTDAGALEVTFPLCLDASVTGARLYLTRANGGELLLAGDYPLG 216 (396) T ss_pred cccccccCccCCceEEEEEEEEecCCCcCcccccccccCCCCCcEEEEEcccCCCcceEEEEEeCCChhhhhheehhccc Confidence 122334455555666332222211 11 111222110000011111111111222222 1 Q ss_pred CCceeeecccCCcccCCCcccCcCccccCCceeEEEEEcceEEEecCCeEEEEecccccccc-ccccccccCCccEEEEE Q lcl|NC_020838. 497 NGVFKYEPVTWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLLSNEAVIMSRAGDYFNFF-ANSSQVVAPDDPIDLQA 575 (981) Q Consensus 497 ~g~f~~~~~~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~s~~~V~~Srtgdy~NF~-~~s~~~~~DdDpI~~~i 575 (981) ..+|.+...+|.....-..-=-|+|.. .-+.||..||+++.++.||+|...-++=+. +..-+ T Consensus 217 ~~s~vlPs~~w~gpP~~~~gL~pmP~G-----~~~A~faGRi~~A~Gn~V~FSEp~~Ph~~~~~~~~~------------ 279 (396) T protein:vir:10 217 AATVILPTLPELGRPAQFRHLSPMPTG-----KHLAYWRGRLLIARANVLRFSEALAYHLHDERYGFV------------ 279 (396) T ss_pred eeeeeeecCCCCCCCccccccccCchh-----HhhhhhcceEEEEeCCEEEEecCCCCceecchhccC------------ Confidence 123444555665432111111222211 037899999999999999999998873222 11111 Q ss_pred cCCCceeEEEEeecCCcEEEEecCcEEEEEcCCccccccceEEEEEee---ecccc---------CCCcEEeCCeEEEEe Q lcl|NC_020838. 576 TSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDADILSPTTTKINTIST---FECDA---------EIDAVAVGTTQAFIS 643 (981) Q Consensus 576 ~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~~~LTP~~~~i~~~S~---~~~s~---------~v~Pv~vG~~v~Fv~ 643 (981) .. +..|+-+.+.+.+|+++|.++-|.+.|.+ |++.+..+... .-|+. .-..+..|..++|++ T Consensus 280 ~~--~~~Iv~lapv~~gL~Vgt~~~~y~~~G~d----P~sms~~~l~~~~pvp~S~v~~p~~~~s~rs~~~~~~~~lwas 353 (396) T protein:vir:10 280 QM--PQRITFVQPVDGGIWVGQVDHVAFLDGAD----PASLSVSRRASRAPVPGSAVLVPAEVVGTNASPDGSPVAVWLA 353 (396) T ss_pred CC--CCceEEEEEecCeEEEEEcCcEEEEEcCC----hhHcceeecccCCCcccchhcccchhhhcccccccCcEEEEcc Confidence 11 12466677888999999999999999853 55555444421 12322 122244588899999 Q ss_pred cCCCeeEEEEEeecccccceehhhHHHHHHHhcCCCeEEEEEcCCCcEEEEEecCCcEEEEEE Q lcl|NC_020838. 644 KSNLYSKLFLMLNVQKEAAATIDEATTNVPEYVPSDIDSMTASPAMSIVSLGKSGSNTVYQHR 706 (981) Q Consensus 644 ~~g~~s~vre~~y~~~~d~~~a~DlS~~~~h~~~~~i~~~~~s~~~~~~~~~~~g~~~l~~y~ 706 (981) +.|-++ -..++-... ++. ..+... ...+. .++. .| ...+..|. T Consensus 354 ~dGl~~--------g~~~G~v~~-l~~---~~i~p~-~~~A~-----~~~~-~d-rRy~~~~~ 396 (396) T protein:vir:10 354 ENGYVM--------GTSSGAIAE-VHA---GVLAGI-TGRAG-----TSVV-FD-RRLLTAVS 396 (396) T ss_pred CCcEEE--------EcCCceeee-ecc---cccCCC-cccce-----EEEe-ec-CeEEEEeC Confidence 988311 111211111 111 111110 01111 0111 01 11121111 No 41 >protein:vir:97014 Length: 800 # NCBI annotation: 33 # Family: family:all:825 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654134;genbank:gi:108862018;genbank:GeneID:5075963 Probab=79.91 E-value=0.1 Score=26.07 Aligned_cols=644 Identities=10% Similarity=0.042 Sum_probs=171.0 Q ss_pred CCceeecchh---------hhcccc-ccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCCCCceEEEEEeCC Q lcl|NC_020838. 1 MSTISQRIPN---------LLLGVS-QQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAEARGRWFPILRDE 70 (981) Q Consensus 1 m~~v~~s~~~---------l~~GvS-qQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~~~~~~~~~~rd~ 70 (981) -+-|+|..+- ++.++| .+-...|+|| .+.+.-+.++-. +.....|+ .+..... ...+.. - T Consensus 11 ~~GvSqq~d~~R~~~q~~~~~N~~~~~~gGl~rRpG--t~fva~l~~~~~---~~~~~~~~---~~~d~~e-q~~v~~-~ 80 (800) T protein:vir:97 11 IQGISQQPPAVRLDGQCTAMVNMIPDVVNGTQSRMG--TTHIAKILDAGT---DDMATHHY---RRGDGDE-EYFFTL-K 80 (800) T ss_pred hcccccCchhHhhhhhhhhhhcceeccccccccCCc--hhhheeecCCCc---ccceeEEE---EEcCCce-EEEEEE-E Confidence 1113332221 122221 2334567777 344433444321 11111121 1111211 222222 1 Q ss_pred CceEEEEEEcCCCcEEEEEcCCCeEEEEeeccccccccccce-eeccccceeEEEEeCcEEEEecCccc---------cc Q lcl|NC_020838. 71 EEKYVCQYDTTDGQFRIWSLIDGQPRAVDMGTTAATGQPSGC-NITNLKSDLDVYNTAQDDTDTKLNDL---------NS 140 (981) Q Consensus 71 ~e~y~~~~~~~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~-~~~~~~~~l~~~tv~d~t~i~n~~~~---------~~ 140 (981) ...|+.+++- +|.....+..++.+- |+ .+++++++|++++++|||||+|+... .+ T Consensus 81 ~~~~~rv~~~-~G~~~~v~~~~~~~~--------------y~~~~~~~~~~l~~~tvaD~~fi~n~~~~~~~~~~~~~~~ 145 (800) T protein:vir:97 81 KGQVPEIFDK-YGRKCNVTSQDAPMT--------------YLSEVVNPREDVQFMTIADVTFMLNRRKVVKASSRKSPKV 145 (800) T ss_pred cCCEEEEEec-CCcEEEEecCCcceE--------------EEeccCCCccceeEEEEcCEEEEeeCceecccccccccCC Confidence 2357777754 566666666655443 22 23467889999999999999998532 34 Q ss_pred ccceEEEEcCCcccccceeeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceeccccccccccc-------- Q lcl|NC_020838. 141 KQATYTKTNDGQTATKVNLFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYSL-------- 212 (981) Q Consensus 141 ~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------- 212 (981) ...+++++|+|+|++++++ .| ++..+..|....+ ++..+....++++|+.+++....... T Consensus 146 ~~~~~~~v~~g~y~~~y~i---~I---~~~~~~~~~t~~~------t~~~~~~~~~~~~ia~ql~~~~~~~~~~~~~t~~ 213 (800) T protein:vir:97 146 GNKAIVFCAYGQYGTSYSI---VI---NGANAASFKTPDG------GSADHVEQIRTERITSELYSKLQQWSGVSDYEIQ 213 (800) T ss_pred CcceEEEEeecccceeeee---cc---CCcceEEEEEcCC------CCcccceeccHHHHHHHHHHhhhccccccceEEE Confidence 5568999999999987653 22 2333444443222 22222333344444433322111000 Q ss_pred cCCcc--------cccceEeeccee----------EeeeeeeeEEeeCceeeEeeeccCCcCcCceEEEEEcccccc-eE Q lcl|NC_020838. 213 GNERT--------DDYPWFKRDGYR----------VYEVEKEVAAAYNSTELSTANTNMGTAQTAYDNAVSTESTEK-GD 273 (981) Q Consensus 213 ~~~~~--------~~~~~~~~~g~~----------~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~-~~ 273 (981) ..+.. .+..+.+.+|++ .....++.+.+.+...++..........+.|+.+...+.+.. |. T Consensus 214 ~~G~~~~i~~~~~~~~~v~t~~g~~~~~~~~~~~~v~~~~~lp~~~~~g~~v~i~~~~~~~~~~y~~~~~~~~~~~~~w~ 293 (800) T protein:vir:97 214 RDGTSIFIERRDGASFTITTTDGAKGKDLVAIKNKVSSTDLLPSRAPAGYKVQVWPTGSKPESRYWLQAEPKEGNLVSWK 293 (800) T ss_pred eCCcEEEEEEcCCceEEEEecCCcCceeeeEEeeeccchhhchhhCCCCcEEEEEccCCCCCceEEEEEEecccCcceEE Confidence 00000 011122233332 222233333333433333332222222222332222111100 10 Q ss_pred -EecceEEEEeeeeccc------------ceeecc------------cCCcceeeEE------EEcCEEEEeCC------ Q lcl|NC_020838. 274 -YDSEVTACNIGSSNIP------------ASAYLK------------DAAPEDIEIL------TINDYTFVLNK------ 316 (981) Q Consensus 274 -~~~~g~~~~~~~~~~~------------~~~y~~------------~~~~~dl~~~------t~ad~tfi~n~------ 316 (981) -...+....+.....| ++.-++ .++| .-.|. ++.+++|.=|| T Consensus 294 e~~~~~~~~~~~~~tmp~~~~~~~~~~~~g~~~~~~~~w~~r~~gd~~tnp-~p~f~~~~~~~~~~~v~f~q~RL~f~~~ 372 (800) T protein:vir:97 294 ETIAADVLLGFDKGTMPYIIERTDIINGIAQFKIRQGDWEDRKVGDDLTNP-MPSFIDEEVPQTIGGMFMVQNRLCFTAG 372 (800) T ss_pred EeeccccccceecccceEEEEEeecccccceeEEEeccccccccCccccCc-cccccCCcCCCCceeEEEEeeeEEEecC Confidence 0000000000000000 000000 0000 00000 12222222222 Q ss_pred ceeEeccCCCCCCC---------CceEE-EEEeee---------ccceEEEEeeCceEEEEEeccCCCcceecHHHHHHH Q lcl|NC_020838. 317 NKTTAMKTTTSAAV---------PNVAF-VVIRIV---------AYNSDYSVTLNGTTVTHSTPDTVAGATTDSGSIAAA 377 (981) Q Consensus 317 ~~~~~~~~~~~~~~---------~~~~~-v~v~~g---------~y~~~~~v~~ng~~~~~~t~~~~a~~~~~~~~ia~~ 377 (981) +.....++....+. +...+ +.+.+. .+.+.+-+-.++..-..... ..++|.++.-. T Consensus 373 ~~v~~Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~v~~i~~~v~~~~~L~i~T~~~q~~ls~~-----~~lTP~~~~~~ 447 (800) T protein:vir:97 373 EAVIASRTSYFFDFFRYTVISALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQFILPGD-----KPLEKSNALLK 447 (800) T ss_pred CeEEEEecCCccccccccccCCCCCccEEEEecCCcceeeeEEeecCCcEEEEecCcEEEEeCC-----CcccceeEEEE Confidence 11111111100000 01111 011111 11111111111111111100 01222211100 Q ss_pred HHhhhhccCceEEEEcCCEEEEEeCCC-c----eEEEecCcCcceeEEEEEEEeechhccccCCCCcEEEEEccCCCccc Q lcl|NC_020838. 378 LTSSINALTGFSATQVGPGIYIEGTSA-F----SISTSGSTTEEGIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTAD 452 (981) Q Consensus 378 l~~~i~~~~~~~~~~vg~~i~i~~~~~-~----~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d 452 (981) ............-..+|+.+++..+.. . .+..+.. .-.+.+++.+.|-.+-..|.+..++...... T Consensus 448 ~~s~~~~~~~~~Pv~vG~~v~fv~~~g~~s~vre~~~~~~-------~d~~~a~DlT~~~~hl~~~~v~~~~~~~~~~-- 518 (800) T protein:vir:97 448 PVTTFEVNNKVKPVVTGESVMFATNDGSYSGVREFYTDSY-------SDTKKAQAITSHVNKLIEGNITNMAASTNVN-- 518 (800) T ss_pred EEEeeeccCCCCcEEeCCeEEEeeCCCCeeEEEEEeeeec-------ccceehhhHHHHHHHhcCCceEEEEEeCCCC-- Confidence 000000011122223454444432221 1 1111110 1112333444444444456555555543221 Q ss_pred ceEEEEEcccCC-cc---------cc--eEEE--EeeccceeE--EEccccceEEEEeccCCceeeecccCCcccCCCcc Q lcl|NC_020838. 453 DIYVEFQTTNSA-AR---------GP--GVWE--ETIGPSLEF--EIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNT 516 (981) Q Consensus 453 ~yyv~~~~~~~~-~~---------g~--~~W~--E~a~~~~~~--~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~ 516 (981) +-+.+.....+ -. .. ..|. +..+...+. .+. ..-.+.+++...+. -+..++..... +.. T Consensus 519 -~~v~~~~~~~~~l~~~~y~~~~~e~~~~aW~~~~~~~~~~~~~~~~~-~d~l~~vv~r~~~~-~ler~~~~~~~--~~~ 593 (800) T protein:vir:97 519 -RLLVTTDKYRNIIYCYDWLWQGTDRVQSAWHVWKWPIGTKVRGMFYS-GELLYLLLERGDGV-YLEKMDMGDAL--TYG 593 (800) T ss_pred -eEEEEEEcCCCEEEEEEEeecCCceEEEeEEEEecCCCeEEEEEEEc-CCeEEEEEEcCCcE-EEEEEecccCc--Ccc Confidence 11212111100 00 00 1232 111111111 111 12224444443332 23333221110 000 Q ss_pred cCcCccccCCceeEEEEEcceEEEecCCeEEEEeccccccccccccccccC--CccEEEEEcCCCceeEEEEeecCCcEE Q lcl|NC_020838. 517 TNPIPSFIGKKINNMFFYRNRLGLLSNEAVIMSRAGDYFNFFANSSQVVAP--DDPIDLQATSVKPVTLNYTLATSIGLL 594 (981) Q Consensus 517 tnp~psF~g~~ps~v~ffq~RL~f~s~~~V~~Srtgdy~NF~~~s~~~~~D--dDpI~~~i~s~~~n~I~~~v~~~~~L~ 594 (981) -+.|+.+-....++++.-. ..+++....++..+ ++.+.... +..+.|+.. +. + T Consensus 594 -----------------~~~~~~lD~~~~~~~~~~~-~~~~~v~~~~~~~~~~~~~~~~~~----v~g~~~~~G--~~-v 648 (800) T protein:vir:97 594 -----------------LNDRIRMDRQAELVFKHFK-AEDEWVSEPLPWVPTNPELLDCIL----IEGWDSYIG--GS-F 648 (800) T ss_pred -----------------cccceeccccceeeeeeee-cccceEeccccccCCCcceeEEEE----ecccccccC--ce-E Confidence 0111111111112221110 11111111111111 11111100 000111110 11 1 Q ss_pred EEecC--------cEEEEEcCCccccccceEE--EEEeeeccccCCCcEE--eCCeEEEEecCCCeeEEEEEeecccc-c Q lcl|NC_020838. 595 VFGPN--------EQFVLSTDADILSPTTTKI--NTISTFECDAEIDAVA--VGTTQAFISKSNLYSKLFLMLNVQKE-A 661 (981) Q Consensus 595 l~T~g--------~q~~l~g~~~~LTP~~~~i--~~~S~~~~s~~v~Pv~--vG~~v~Fv~~~g~~s~vre~~y~~~~-d 661 (981) ++..+ ....++++. ..+.+.+ .-.+.++ ..+|.. ..+..+.. ++ .+|+++..+... . T Consensus 649 ~~~~~~~~~~~~~~~~~~~~~~---~~~~v~vGl~Y~~~~~---~~p~~i~~~~g~~~~~---~r-~~i~r~~~~~~~sg 718 (800) T protein:vir:97 649 LFKYNPSDNTLSTTFDMYDDSH---VKAKVIVGQIYPQEFE---PTPVVIRDNQDRVSYI---DV-PVVGLVHLNLDMYP 718 (800) T ss_pred EEEecCccCcccccceEEeCCC---CCcEEEEeeeeeEEEE---ecceEEEecCCCceee---cc-eEEEEEEEeecccc Confidence 11111 001111110 1112211 1111111 011111 11222221 11 245555444432 1 Q ss_pred ceehhhHHHHHHHhcCCCeEEE------EEcCCCcE-EEEEecCCcEEEEEEeecCcchheeeeEeeccCCceEEEEEeC Q lcl|NC_020838. 662 AATIDEATTNVPEYVPSDIDSM------TASPAMSI-VSLGKSGSNTVYQHRFFMQGENRVQTWYKWQLTGDLRLQFFDK 734 (981) Q Consensus 662 ~~~a~DlS~~~~h~~~~~i~~~------~~s~~~~~-~~~~~~g~~~l~~y~y~~~~eq~V~aWsrw~~~G~v~sv~~~~ 734 (981) .+.. .+...-.+..... ..++.... ......|+..+-+.. |+.+.+|.==+...++=+|+++.-.+ T Consensus 719 ~~~~-----~v~~~~~~~~~~~~~~~~~~g~~~~~~g~~~~~tg~~~vp~~g--~~~~~~v~i~~d~PlP~tvlsi~~eg 791 (800) T protein:vir:97 719 DFSV-----EVKNVKSGKVRRVLASNRIGGALNNTVGYVEPREGVFRFPLRA--KSTDVVYRIIVESPHTFQLRDIEWEG 791 (800) T ss_pred cEEE-----EEccccCCceeeeecCccccccccccCCccccccceEEEEeec--ccceeEEEEEECCCCcEEEEEEEEEE Confidence 2211 0000000000000 00000000 000111111111111 01111110000011111222222111 Q ss_pred CeEEEEEEcCCcE Q lcl|NC_020838. 735 TTFYAVTSSGSNV 747 (981) Q Consensus 735 d~ly~vv~r~~~~ 747 (981) . ..+|.-++ T Consensus 792 ~----y~~r~~rv 800 (800) T protein:vir:97 792 S----YNPTKRRV 800 (800) T ss_pred E----eecccccC Confidence 1 01111111 No 42 >protein:vir:3133 Length: 911 # NCBI annotation: hypothetical protein # Family: family:all:5234 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640315;genbank:gi:21234408;genbank:GeneID:956056 Probab=68.90 E-value=0.23 Score=24.08 Aligned_cols=666 Identities=14% Similarity=0.104 Sum_probs=214.0 Q ss_pred ccccccccccCCcccccceEeecceeEeeeeeeeEEeeCceeeEeeeccCCc---------CcCceEEEEEcccccceEE Q lcl|NC_020838. 204 SAMPSGYSLGNERTDDYPWFKRDGYRVYEVEKEVAAAYNSTELSTANTNMGT---------AQTAYDNAVSTESTEKGDY 274 (981) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~a~~~~~~~~~~~~~~~---------~~~~~~~~i~~~~~~~~~~ 274 (981) ++ +.. .-+.++.+-..| |++--...-|-|- |+ - T Consensus 1 ~~---------------------------------~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~ 42 (911) T protein:vir:31 1 MA---------------------------------ARK--GAVNRFTPVRGWVTEGNLANYGQDVALDVENMDI-EK--T 42 (911) T ss_pred Cc---------------------------------ccc--ccccccccceeeeecCchhhcCceeEeeeccccc-hh--c Confidence 11 111 111122211111 1110000000000 00 0 Q ss_pred ecceEEEEe---eeecccceeecccCCcceeeEEE-----E--cC-EEEEeCCceeEeccCCCCCCCCceEEEEEeeecc Q lcl|NC_020838. 275 DSEVTACNI---GSSNIPASAYLKDAAPEDIEILT-----I--ND-YTFVLNKNKTTAMKTTTSAAVPNVAFVVIRIVAY 343 (981) Q Consensus 275 ~~~g~~~~~---~~~~~~~~~y~~~~~~~dl~~~t-----~--ad-~tfi~n~~~~~~~~~~~~~~~~~~~~v~v~~g~y 343 (981) ..-.+-+.+ ...+--..++-..+..|.|-.+- - .| .+.|.|.-+.+-..-+..|-.+..-++-+.--.. T Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 122 (911) T protein:vir:31 43 GLTQRRFGLFAETSSEQFLSTFTATARARGLLAVKEWREAWGDKDVNMLIFHAGYKVHVVQDTAPLRDANILLTIDLLEA 122 (911) T ss_pred ccchhheeeeeccchhhhhhhhhhhhhhcceeehhhHHHhhCCCcceEEEEecCcEEEEEecccCccccceEEEeeeecc Confidence 000000000 01110011121122222211110 0 11 3555565444433223333333333332222122 Q ss_pred ceEEEEeeCceEEE-EEeccCCCcceecHHHHHHHH-------------------Hhhhhc---cCc-eEEEEcCCEEEE Q lcl|NC_020838. 344 NSDYSVTLNGTTVT-HSTPDTVAGATTDSGSIAAAL-------------------TSSINA---LTG-FSATQVGPGIYI 399 (981) Q Consensus 344 ~~~~~v~~ng~~~~-~~t~~~~a~~~~~~~~ia~~l-------------------~~~i~~---~~~-~~~~~vg~~i~i 399 (981) | |.++|+.-+ ++...++.-+.+.-..|--.| .-.|.. +.. .+-+..|.++.- T Consensus 123 ~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 198 (911) T protein:vir:31 123 G----IKLDGVIDSPVHISVGVGFAIITNPRIEPVLIKLDDVDDEGVPTLSYEPLTLLIRTRELLTPYTTGTNYGDTLTP 198 (911) T ss_pred C----ceeeeeecCceeEEeeceEEEeecCccceEEEEeeccCccCcccccccceeeEeeehhhccccccccccCcccCc Confidence 2 112222111 111111111111111000000 000000 000 011112222211 Q ss_pred ------EeCC--CceEEEec-CcCcc-eeEEEEEEEeechhccccCCCCcEEEEEccCCCcccceEEEEEcccCCcccce Q lcl|NC_020838. 400 ------EGTS--AFSISTSG-STTEE-GIFAFQDQINVASRLPNQCENGYRVRVTNSGDVTADDIYVEFQTTNSAARGPG 469 (981) Q Consensus 400 ------~~~~--~~~vt~~~-g~~~t-~~~~~~~~v~~~~~Lp~~~~~G~~v~v~~~g~~~~d~yyv~~~~~~~~~~g~~ 469 (981) .... .++..+.| +++++ ....++|.-+.-.-.|.+.- .--.-+..+ T Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~-------------------- 254 (911) T protein:vir:31 199 EEEWNLYNSGWATITRATKDKSGSGTVYVNPVQYYFDKRGVYPSHSV----LYNSMKQES-------------------- 254 (911) T ss_pred hhhcccccccceeeeeecccCCccceEEEchhheeecccCcCcchhh----hhhhhhhhc-------------------- Confidence 0110 11111222 12222 11222222221111111110 000000000 Q ss_pred EEEEeeccce-------eEEEccccceE-EEEecc----------CCceeeecccCCc-----ccCCCcccCcCc----- Q lcl|NC_020838. 470 VWEETIGPSL-------EFEIDETTMPH-QLIRQA----------NGVFKYEPVTWDD-----RLVGDNTTNPIP----- 521 (981) Q Consensus 470 ~W~E~a~~~~-------~~~~~~~Tmp~-~lv~~a----------~g~f~~~~~~w~~-----r~~GDd~tnp~p----- 521 (981) -||.++-+. -+.|...|-|. .++..+ -|.-.+.++.-+. -.+.++.+||.. T Consensus 255 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~e~~np~gl~~ig 333 (911) T protein:vir:31 255 -AKEIVALNVFSPWADEKINFGTTTPPLGRYIHSAYYFDSAAILSLGIGNLTPPTSDGTTEGSGPAEEEISNPIGLDNIG 333 (911) T ss_pred -cceeEEEeeeccccccccccccCCCchhhhhhhheeeccceeeeecccccCCCCCCCccCCCCCchhhhcCCCCccccc Confidence 011111000 01111111111 011100 0111111111111 112245577742 Q ss_pred c-c-------------cCCceeEEEEEcceEEEec-----CCeEEEEec--------cccccccccccc--cccCCccEE Q lcl|NC_020838. 522 S-F-------------IGKKINNMFFYRNRLGLLS-----NEAVIMSRA--------GDYFNFFANSSQ--VVAPDDPID 572 (981) Q Consensus 522 s-F-------------~g~~ps~v~ffq~RL~f~s-----~~~V~~Srt--------gdy~NF~~~s~~--~~~DdDpI~ 572 (981) + | +...|+|++||.+|++|+. ...|.+|+. -+|+.=++++.. .+.|.|-.. T Consensus 334 t~~n~k~~a~~~~~~~~~~r~r~~~~yaGRVfyaD~dkngk~rIlFSqLv~sl~di~nCYQdaDPTSeee~DLIdTDGg~ 413 (911) T protein:vir:31 334 TVNNLKLIAEGTVRWTVKDRPRCSGYHNGHVYFGDRDKNGKTRILVSQLVNSLDNIPKCFQDADPTAEEINDLIATDGFT 413 (911) T ss_pred chhceeeeeccceeeeecccccceeeeccEEEEeeeccCcceeEEEEeeccccccccccccCCCccccccchhhhcCCcE Confidence 0 0 1245899999999999995 237999975 456666665422 345678888 Q ss_pred EEEcCCCceeEEEEeecCCcEEEEecCcEEEEEcCC-ccccccceEEEEEeeeccccCCCcEEeCCeEEEEecCCCeeEE Q lcl|NC_020838. 573 LQATSVKPVTLNYTLATSIGLLVFGPNEQFVLSTDA-DILSPTTTKINTISTFECDAEIDAVAVGTTQAFISKSNLYSKL 651 (981) Q Consensus 573 ~~i~s~~~n~I~~~v~~~~~L~l~T~g~q~~l~g~~-~~LTP~~~~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~v 651 (981) +.+... ..|+-+|.+++.|++|..++-|.|.|.+ ...|.++..|.+++..+|++.=.=|++|+.++|-++.| | T Consensus 414 vri~ga--h~Ii~LV~~G~sLlVFcaNGVWAI~G~d~~g~TATdy~ItKIsdvGcsspNSVVvVgn~i~fWSd~G----I 487 (911) T protein:vir:31 414 MYPVGM--GAPITMVEFNKRLLLLCTNGVWAIRGTSGGGATATDFTLDKVASVEFNSPQSVVDIGTAIVFWSERG----I 487 (911) T ss_pred EecCCC--CCceEEEEecCeEEEEEeCcEEEEeccCCCceeeeeeEEEEEeeeeeCCCCeEEEecCceEEeeCCc----E Confidence 877665 4588899999999999999999998765 57999999999999999998888899999999999987 5 Q ss_pred EEEeecccccceehhhHH-HHHHHhcCC----CeEEEE--EcCCCcEEEEEecC---CcEEEEEEe--ecCcchheeeeE Q lcl|NC_020838. 652 FLMLNVQKEAAATIDEAT-TNVPEYVPS----DIDSMT--ASPAMSIVSLGKSG---SNTVYQHRF--FMQGENRVQTWY 719 (981) Q Consensus 652 re~~y~~~~d~~~a~DlS-~~~~h~~~~----~i~~~~--~s~~~~~~~~~~~g---~~~l~~y~y--~~~~eq~V~aWs 719 (981) ..+...+ ..-+.|+.|| ..+..|... .+...+ +-..+..++|.-.+ ....+.+.= ...-+-+.++|- T Consensus 488 yaLganq-fnD~tAnNLTesTIQ~y~d~I~~dkIkNVtgtyd~de~rVyW~yPn~lDe~teykt~~~~ILVfdLatgaFY 566 (911) T protein:vir:31 488 IAIGVND-FGDLTSNNLTENTIDEYYDSLDRDIIKNVKGTFINDENRVYWVVPNKQDSNGEYKTDGELVLVLNLDTGGFY 566 (911) T ss_pred EEEeecc-cCccccccccHHHHHHHHhhcChhhhceEEEEEEccCCEEEEEecCccCCccceeecCceEEEEEeccCccc Confidence 5555544 4557888888 567777643 222222 22233344443331 223333210 000022457999 Q ss_pred eeccCCc-eEEEEEeCCeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCcccccccceeeeeeeeeccCCceEEEccc Q lcl|NC_020838. 720 KWQLTGD-LRLQFFDKTTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFNVNPYRTYSTSTKKTTVNLP 798 (981) Q Consensus 720 rw~~~G~-v~sv~~~~d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~vd~~~ty~~~~~~tt~~l~ 798 (981) +|...+. ++...+. =+++-.....+-..+ +. ...+..-.+.+.... +.++.-+. ++. ...++ T Consensus 567 Pwtvs~gpLl~~p~y-----~Lv~TreEvtvPi~~-et---gaiIve~gsdPV~~t---l~vdttGv-Dg~----ayLl~ 629 (911) T protein:vir:31 567 KHTVSGGPLLHAPFR-----RLVNTRAEVSIPITE-TD---GTVITDTLGDPVTVT---RTVTTTGV-DGL----AYFAS 629 (911) T ss_pred ceeeecceeeccccc-----ccccccccceeeEEe-ec---ceEEEecCCCCeEEE---Eeeecccc-cce----eEEEe Confidence 9976433 3322211 111111111110000 00 000100001111000 00000000 000 00001 Q ss_pred CCCcCCceEEEEEcCcccCceeEEEEecCceEEEccccccC--------CCEEEeCCCCCCCEEEEEEeeeEEEEeCCce Q lcl|NC_020838. 799 FDHITGKKLAVVAIGTYIGDTISATSESEGSVFYFEDSDIS--------SNQVLLNGDYRGRDLIIGYVYDMELELPTLY 870 (981) Q Consensus 799 ~~~l~g~~v~v~adG~~~~~~~~~~~~~dG~~~~~~~~tv~--------gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~ 870 (981) ..+..--++..+| .-..+|-.=+-..+..+ .-.-..+++. .=.+-+||...+-++-+. T Consensus 630 frdg~~g~~~f~a-----------~~~~~~~~dw~~~~~~~~~~y~s~~~~~y~~~~~~---~~~~~~pyi~sy~~~~~r 695 (911) T protein:vir:31 630 FDDGVNGQFNFIA-----------EHQPWGFADWANVPNMTRVNYSSYVDFAYEYPEVM---IGNISLPYIHSYYLTGIR 695 (911) T ss_pred eccCCcceEEEEE-----------eecCCeeeccccCccccccchhHHHHhhhhhhhhh---hhcccCceeeeeeeeeeE Confidence 1111111111111 11111110000000000 0000000000 012235666655444444 Q ss_pred eeccCCCcccc------------eEeeeEEEEEEEEEeecccceEEEE---ccCCCCceeeEeccc------------cc Q lcl|NC_020838. 871 PTQVEGRSSVS------------DVTSDLILHRLKVSTGLSGPITYKV---DITGKDEWTNIINVT------------LP 923 (981) Q Consensus 871 i~~~~g~~~~~------------~~~grl~l~r~~v~~~~Sg~~~v~v---~~~~~~~~~~~~~~~------------~~ 923 (981) ++...-...++ ..-|.+..|++.+- -|-++.+.- ++.-|.+.....++. .+ T Consensus 696 v~~~~y~~~~a~~~f~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~vVNGDAE~GtmTGWtvtaG 773 (911) T protein:vir:31 696 VQTEQYTTETAHLSFHRVQAHQTTALGTVTFHKVDMM--VSTGMQVISFHKDDLLRTEAVTLVNPDAETGDATGWTVTAG 773 (911) T ss_pred EeccceeeecccceeEeeecccceeeeeeeeeeeeeh--hhccceeeeeccccceeeeeeEEEcCCCCCCCCCcceeecc Confidence 43221111110 11122333433221 122222210 000010100000000 00 Q ss_pred CccccCccccccCCeEEEEeeccCcceEEEEEECCCCCEEEEEEEEE-EEeeccccccC Q lcl|NC_020838. 924 NTYVLNNVNLSASALHDVPIYQRNENVNIKIIGDTPFPISLLNIVWE-GNYNRRFYRRS 981 (981) Q Consensus 924 ~~~~~~~~p~~~~~~~~vp~~g~~~~~~v~I~~~~PlPltvlsi~we-g~y~~r~rRr~ 981 (981) .-+..-..|+..+..+--| +. +++-.+.|+-- |..=-.|.-- ..|+=-.-++. T Consensus 774 ~~d~~Ta~p~~rGSyfFa~--~n--n~n~aL~QDID-SagaaaIDAG~v~ynvSawl~g 827 (911) T protein:vir:31 774 TLDVRTAAPLYQGSYYFWS--DS--NANFAAYQDID-PVGGGYITAGELANNVIEAKLS 827 (911) T ss_pred chhhccCCchhcceEeEcC--CC--Ccchhhheecc-ccccceeeeccchhhhhhhhhh Confidence 0000011122222211111 11 11111222111 1111111100 00000011111 No 43 >protein:vir:108312 Length: 458 # NCBI annotation: hypothetical protein # Family: family:all:1540 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552278;genbank:gi:160700603;genbank:GeneID:5758828 Probab=59.70 E-value=0.39 Score=22.84 Aligned_cols=430 Identities=15% Similarity=0.105 Sum_probs=160.1 Q ss_pred EEEEEeccCCCcceecHHHHHHHHHhhhhccCceEEEEcCCEEEEEeCCCceEEEecCcCcceeEEEEEEEeechhcccc Q lcl|NC_020838. 355 TVTHSTPDTVAGATTDSGSIAAALTSSINALTGFSATQVGPGIYIEGTSAFSISTSGSTTEEGIFAFQDQINVASRLPNQ 434 (981) Q Consensus 355 ~~~~~t~~~~a~~~~~~~~ia~~l~~~i~~~~~~~~~~vg~~i~i~~~~~~~vt~~~g~~~t~~~~~~~~v~~~~~Lp~~ 434 (981) -+..+.|-+...+...... ++.+.+.+....- . -.+-.++........ +.++|.. T Consensus 1 m~~~~ip~gsy~a~~~~~d-aq~~VN~yp~~~e--~--g~ss~~l~~tPGl~~--------------------f~~~~~~ 55 (458) T protein:vir:10 1 MVQRQIPLVATTAEGDVSG-QEILVNVYPRKSD--G--GKYPFTLRHTPGLAF--------------------FCELPTF 55 (458) T ss_pred Cceeeeceeeeeccccccc-ceeeeeeeeeccc--c--cccccceEecCCcee--------------------eecCCCC Confidence 0111111100000000000 0000000000000 0 000001111111000 1122221 Q ss_pred CC------CCcEEEEEccCCCcccceEEEEEcccCCcccceEEEEe---eccceeEEEccccceEEEEeccCCceeeecc Q lcl|NC_020838. 435 CE------NGYRVRVTNSGDVTADDIYVEFQTTNSAARGPGVWEET---IGPSLEFEIDETTMPHQLIRQANGVFKYEPV 505 (981) Q Consensus 435 ~~------~G~~v~v~~~g~~~~d~yyv~~~~~~~~~~g~~~W~E~---a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~~~ 505 (981) .. +|....+.+. .-|-|..+. +.+|. ++-+.+...+.++ ...+.+...-|.++-. T Consensus 56 ~~~g~~~~~g~ly~v~g~-----~LY~V~~~~---------~~~~iG~i~gsg~VsMa~ng~--q~vi~~G~~gY~yd~a 119 (458) T protein:vir:10 56 PVMAMHQNGSRAFAVTPR-----DMYEISKDG---------TYKRLGSVDFKGRVVMEDNGK--QIVMVDGEKGYYYDSE 119 (458) T ss_pred ceeeEEecCCEEEEeeCc-----eEEEEeCCc---------eEEEEecccCceeEEEeeCCc--EEEEEECCeEEEEeec Confidence 11 1111111111 011111100 11111 1122233333322 1111111111111100 Q ss_pred cCCcccCCCcccCcCccccCCceeEEEEEcceEEEec--CCeEEEEeccccccccccccccccCCccEEEEEcCCCceeE Q lcl|NC_020838. 506 TWDDRLVGDNTTNPIPSFIGKKINNMFFYRNRLGLLS--NEAVIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTL 583 (981) Q Consensus 506 ~w~~r~~GDd~tnp~psF~g~~ps~v~ffq~RL~f~s--~~~V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I 583 (981) .+.- ..+-.+.|.+ +..|.|...|++|.. +..++.|-..| .-=||++++-+.++++.| T Consensus 120 t~~~------~~i~d~~~~~--~~~v~~~dGy~V~~~~g~~~~~is~L~d------------~s~d~l~fa~Ae~~pD~i 179 (458) T protein:vir:10 120 TEIV------QEIKAEGFYP--ASTVTYQDGYFIFDRKGTGQFFISELLD------------VAFDPLDFATAEGQPDPL 179 (458) T ss_pred ccEE------EeccCccccC--cceEEEeCcEEEEEeeCCCEEEEEecCc------------ceeCcceeeeecCCCCce Confidence 0000 0011122332 789999999999874 34566674433 115699998899999999 Q ss_pred EEEeecCCcEEEEecCcE--EEEEcCCc-cccccceEEEEEeeeccccCCCcEEeCCeEEEEecCCCeeEEEEEeecccc Q lcl|NC_020838. 584 NYTLATSIGLLVFGPNEQ--FVLSTDAD-ILSPTTTKINTISTFECDAEIDAVAVGTTQAFISKSNLYSKLFLMLNVQKE 660 (981) Q Consensus 584 ~~~v~~~~~L~l~T~g~q--~~l~g~~~-~LTP~~~~i~~~S~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~ 660 (981) .-++.+.+.|++|.+..- |..+|..+ ++-.... ... ..+|++.-.=..+|++++|++..+ .|+.+ T Consensus 180 v~i~~~~~~i~~fG~~TiEvw~ntG~a~fpy~r~~g-a~i--~~Gcaa~~sv~~~~~t~~~l~~d~---~Vy~l------ 247 (458) T protein:vir:10 180 LAVLSDHREVFMFGQETIEVWYNSGAADFPFERNQG-AFI--EKGIGAPYSVAKTNNTVYFIGSDL---MIYQI------ 247 (458) T ss_pred EEEEeeccEEEEEeccceEEEEecCCCCcceeeccc-cee--eecccCcchhhhhCceEEEEcCCe---EEEEe------ Confidence 999999999999987765 77787643 2111110 111 346776655688999999999865 35544 Q ss_pred cceehhhHHHH-HHHhcCC-Ce---EEEEEcCCCcEEEEEe-cCCcEEEEEEeecCcchheeeeEeeccCCceEEEEEeC Q lcl|NC_020838. 661 AAATIDEATTN-VPEYVPS-DI---DSMTASPAMSIVSLGK-SGSNTVYQHRFFMQGENRVQTWYKWQLTGDLRLQFFDK 734 (981) Q Consensus 661 d~~~a~DlS~~-~~h~~~~-~i---~~~~~s~~~~~~~~~~-~g~~~l~~y~y~~~~eq~V~aWsrw~~~G~v~sv~~~~ 734 (981) ++|++.-+|-| +++.+.. .+ ....++.+-..+.+++ .+...-++|+ .. -.-||.-+..+ T Consensus 248 ~g~~~~rIST~aIE~~i~sy~~~da~a~t~~~eGH~fy~LtfP~a~~Tw~yD----~~--t~~Wher~Sg~--------- 312 (458) T protein:vir:10 248 TGYTPVRISTHAVEQTLKGVNLSDAFAYTYQSEGHLFYVLTIPGKNLTWCYD----IS--SGSWHVRQSYQ--------- 312 (458) T ss_pred cCceeEEeeCHHHHHHHhcCChhheEEEEEEecCeEEEEEECCCCCceeEEe----cc--cccceeeccCC--------- Confidence 46666666544 2333321 11 1111111221111111 1112223332 11 11254422110 Q ss_pred CeEEEEEEcCCcEEEEEEEeecCCceeEEEecCCCcccccccceeeeeeeeeccCCceEEEcccCCCcCCceEEEEEcCc Q lcl|NC_020838. 735 TTFYAVTSSGSNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFNVNPYRTYSTSTKKTTVNLPFDHITGKKLAVVAIGT 814 (981) Q Consensus 735 d~ly~vv~r~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~vd~~~ty~~~~~~tt~~l~~~~l~g~~v~v~adG~ 814 (981) +|.+...+...+++ +..+ | ++. T Consensus 313 ----------------------------------------~~~~Ra~~~v~~~g---~~~v--G-D~~------------ 334 (458) T protein:vir:10 313 ----------------------------------------FDRHVSNNSIYFDQ---KTLV--G-DFQ------------ 334 (458) T ss_pred ----------------------------------------CCceEEEEEEEeCC---eEEE--E-EcC------------ Confidence 01111111111110 0000 0 111 Q ss_pred ccCceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeEEEEEEEE Q lcl|NC_020838. 815 YIGDTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDLILHRLKV 894 (981) Q Consensus 815 ~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r~~v 894 (981) +|.+..+....-+ + -|-+++..+.++.+ ++. ..||+++++.| T Consensus 335 ------------ng~ly~ld~~~~t--------d-------~g~~i~~~~~~p~~-----~~~------~~rl~~~~~el 376 (458) T protein:vir:10 335 ------------NGRIYIMADNYYT--------D-------DGDPVVREFILPVV-----NNG------REFLTVDSLEL 376 (458) T ss_pred ------------CCeEEEEcccCcC--------C-------CCceeeeeeeccce-----eCC------CCeEEEEEEEE Confidence 1212111111000 0 13344444444332 110 13555666555 Q ss_pred EeecccceEEEEccCCCCce--eeE--ecccccCcc----ccCccccccCCeEEEEeeccCcceEEEEEECCCCCEEEEE Q lcl|NC_020838. 895 STGLSGPITYKVDITGKDEW--TNI--INVTLPNTY----VLNNVNLSASALHDVPIYQRNENVNIKIIGDTPFPISLLN 966 (981) Q Consensus 895 ~~~~Sg~~~v~v~~~~~~~~--~~~--~~~~~~~~~----~~~~~p~~~~~~~~vp~~g~~~~~~v~I~~~~PlPltvls 966 (981) .+. +|--.+ ..++.++. ... -.+..++.. .+|++=-..+. .++--.|..++--++|+-.+|.|.+|++ T Consensus 377 ~~~-tGvg~~--~~~~~~p~~~l~~S~d~g~~~s~~~~~~~lg~~gey~tr-~~~~rlG~ar~rvf~v~~s~p~~~~l~g 452 (458) T protein:vir:10 377 DLS-SGVGLT--VGQGSDPELRVYFSKDNGNEYSQNFKVGKIGRKGEFLTR-AKVNRFGCARQFTFKVEISDPIPVDIGG 452 (458) T ss_pred EEe-cceeee--eCCCCCceEEEEEeeCCCcccchhHHHhhcCCcchhhhh-hhhhhhccCcceEEEEEEecchhhccee Confidence 432 221111 11111111 000 011112211 11222111111 1122245556666899999999999999 Q ss_pred EEEEEE Q lcl|NC_020838. 967 IVWEGN 972 (981) Q Consensus 967 i~weg~ 972 (981) +..+.+ T Consensus 453 a~~~~r 458 (458) T protein:vir:10 453 AWVEVR 458 (458) T ss_pred eeEEeC Confidence 999988 No 44 >protein:vir:105647 Length: 800 # NCBI annotation: putative tail tubular B protein # Family: family:all:825 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425011;genbank:gi:83571759;uniprot:Q2WC41;genbank:GeneID:3837288 Probab=50.74 E-value=0.6 Score=21.80 Aligned_cols=657 Identities=11% Similarity=0.022 Sum_probs=180.3 Q ss_pred CCceeecchhhhccccccchhhcCCChhhhhhccccccccccccCchhhhhhhhcCCCCCceEEEEEeCCCceEEEEEEc Q lcl|NC_020838. 1 MSTISQRIPNLLLGVSQQPDKLKFPGQVKQATNVFPDYALGLLKRPGGKFEAELYNAEARGRWFPILRDEEEKYVCQYDT 80 (981) Q Consensus 1 m~~v~~s~~~l~~GvSqQ~~~~r~~gQ~~~q~N~~sd~~~Gl~kRp~~~~~~~l~~~~~~~~~~~~~rd~~e~y~~~~~~ 80 (981) +....--++.+.+ ...|+|| .+-|.-+.+.. ..++- .+.-......-..++....+.|+.++.. T Consensus 27 ~~~~~N~~~~~~g------Gl~rRpG--t~fva~l~~~~--~~~~~------~~~~~~~~~~~~~~~~~~~g~~~rv~~~ 90 (800) T protein:vir:10 27 CTTMVNMVPDVVN------GTQSRMG--TTHIAKLLDEG--TDNMA------THHYRRGEGDEEYFFTLKKGQVPEIFDK 90 (800) T ss_pred hhhhhcceeeecc------CcccCCc--ceEEEeecCCC--CCccE------EEEEecCCccceEEEEEEcCCeEEEEec Confidence 2222223333333 4467777 23333232221 00110 1111111111112222223357777764 Q ss_pred CCCcEEEEEcCCCeEEEEeeccccccccccceeeccccceeEEEEeCcEEEEecCccc---------ccccceEEEEcCC Q lcl|NC_020838. 81 TDGQFRIWSLIDGQPRAVDMGTTAATGQPSGCNITNLKSDLDVYNTAQDDTDTKLNDL---------NSKQATYTKTNDG 151 (981) Q Consensus 81 ~~g~~~v~d~~~g~~~~v~~~~~~~~~~~~y~~~~~~~~~l~~~tv~d~t~i~n~~~~---------~~~~~~~~~~r~~ 151 (981) +|....-...++..-. +..+.+++++|++++++|||||+|++.. .++..+++++|+| T Consensus 91 -~G~~~~v~~~~~~~~~-------------~~~~~~~~~~l~~~tvaD~tfi~n~~~~~~~~~~~~~~~~~~~~~~vr~g 156 (800) T protein:vir:10 91 -HGRKCNVISQDAPMTY-------------LSEVVNPREDVQFMTIADVTFMLNRRKVVKVSNRKSPKVGDKAIVFCAYG 156 (800) T ss_pred -CCcEEEeecCCcceee-------------eeccCCchhhEEEEEEcCEEEEecCcccccccccCCCCCCceEEEEEecc Confidence 5655555555554432 2235677889999999999999998644 3344589999999 Q ss_pred cccccceeeEEEEEEeeeeeeeeeccCccccccCCceeEEEeeccccceecccccccccc---------ccCC------- Q lcl|NC_020838. 152 QTATKVNLFDVDVTYKNGYYEESLKSGVLERIDNGQRIVKDDGTNAGSIAAGSAMPSGYS---------LGNE------- 215 (981) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~------- 215 (981) +|++++++ .+ +++....|..+....+ ..+...++++|+..+....... ..+. T Consensus 157 ~y~~~y~i---~i---~g~~~~~~~t~~~~~~------~~~~~~s~~~i~~~L~~~l~~~~~~~~~t~~~~g~~i~i~~~ 224 (800) T protein:vir:10 157 QYGTSYSI---II---NGTTAASFKTPDGGSA------EHVEQIRTERITSELYSKLQQWSGVNDYEIQRDGTSIFIERR 224 (800) T ss_pred ccccceeE---Ee---ccceEEEEEecCCCcc------cccccccHHHHHHHHHhhhhhcCcccceEEEEcCcEEEEEEe Confidence 99988754 22 2333333443332211 1223334444443322111000 0000 Q ss_pred cccccceEeecceeEeeeeeeeEEeeCceeeEeeeccCCcCcCceEEEEEccccc--ceE--EecceEEEEeeeeccc-- Q lcl|NC_020838. 216 RTDDYPWFKRDGYRVYEVEKEVAAAYNSTELSTANTNMGTAQTAYDNAVSTESTE--KGD--YDSEVTACNIGSSNIP-- 289 (981) Q Consensus 216 ~~~~~~~~~~~g~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~--~~~--~~~~g~~~~~~~~~~~-- 289 (981) .-.++.+.+.+|+..........++.....+.. .......+++.+....+ .|. |++.+....++..... T Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~Lp~-----~~~~g~~~~i~~~~~~~~~~y~~~~~~~~~~~~~w~e~~~~~ 299 (800) T protein:vir:10 225 DGKSFTVTTTDGAKGKDLVAIKNKVSSTDLLPS-----RAPAGYKVQVWPTGSKPESRYWLQAEPKEGNLVSWKETIAAD 299 (800) T ss_pred cCCceEEEEeecCCcceEEEEEeeccceeeccc-----cCCCCceEEEEcCCCCCCceeEEEEEeccccceEEEeecccC Confidence 001122223333322211111111111111000 01112233344332222 122 2221111111111100 Q ss_pred -ceeecccCCcceeeEEEEc--CEEEEe-CCceeEeccCCCCCCCCc--e-----EEEEEeeeccceEEEEeeCceEEEE Q lcl|NC_020838. 290 -ASAYLKDAAPEDIEILTIN--DYTFVL-NKNKTTAMKTTTSAAVPN--V-----AFVVIRIVAYNSDYSVTLNGTTVTH 358 (981) Q Consensus 290 -~~~y~~~~~~~dl~~~t~a--d~tfi~-n~~~~~~~~~~~~~~~~~--~-----~~v~v~~g~y~~~~~v~~ng~~~~~ 358 (981) ....-...-|..|....+. +-+|-. ..+...+..++...++.. . .++ ....|-+.-.+-..+.++.. T Consensus 300 ~~~~~~~~tmp~~lv~~~~~~~~~~~~~~~~~w~~r~~gd~~tnp~psf~~~~~~~~i--~~v~f~q~RL~f~~~~~v~~ 377 (800) T protein:vir:10 300 VLLGFDKGTMPYIIERTGIIDGIAQFKIRQGDWEDRKVGDDLTNPMPSFIDEEVPQTI--GGMFMVQNRLCFTAGEAVIA 377 (800) T ss_pred ceeeeecccccEEEEEeeeeecceeEEEEeccccccccCCCCCCCCchhcCCCCCCCc--eeEEEEeeeEEEeeCCeEEE Confidence 0111112222222111111 111111 111111111111111110 0 001 00111111111112222222 Q ss_pred Eec-------cCCCcceecHHHH--------HHHHHhhhhccCceEEEEcCCEEEEEeCCCceEEEecCcCcceeEEEEE Q lcl|NC_020838. 359 STP-------DTVAGATTDSGSI--------AAALTSSINALTGFSATQVGPGIYIEGTSAFSISTSGSTTEEGIFAFQD 423 (981) Q Consensus 359 ~t~-------~~~a~~~~~~~~i--------a~~l~~~i~~~~~~~~~~vg~~i~i~~~~~~~vt~~~g~~~t~~~~~~~ 423 (981) .-+ ...+....+.+.| ...|.-.+....++-+--.|.-..|.+.+. ++. .+-.+.... T Consensus 378 Srtgd~~nF~~~t~~~~~DdD~I~~~~ss~~~~~i~~~v~~~~~L~l~T~g~q~~l~g~~~--lTP----~~~~i~~~s- 450 (800) T protein:vir:10 378 SRTSYFFDFFRYTVISALATDPFDIFSDASEVYQLKHAVTLDGATVLFSDKSQFILPGDKP--LEK----SNALLKPVT- 450 (800) T ss_pred EccCCccccccccccCCCCCccEEEEEcCCcceeeeeEeecCCcEEEEecCcEEEEeCCCc--ccc----eeEEEEEEE- Confidence 110 0000000000100 001111111101111111122222222111 110 000011111 Q ss_pred EEeechhccccCCCCcEEEEEccCCCc-ccceEEEEEcccC---------Cc--ccce-EEEEeeccceeEEEccccceE Q lcl|NC_020838. 424 QINVASRLPNQCENGYRVRVTNSGDVT-ADDIYVEFQTTNS---------AA--RGPG-VWEETIGPSLEFEIDETTMPH 490 (981) Q Consensus 424 ~v~~~~~Lp~~~~~G~~v~v~~~g~~~-~d~yyv~~~~~~~---------~~--~g~~-~W~E~a~~~~~~~~~~~Tmp~ 490 (981) +..--+++++..-.+-++=+...+..+ --.|+..+..... .. .+.+ .+.-...++... T Consensus 451 ~~~~~~~~~Pv~vG~~v~Fv~~~g~~s~vre~~~~~~~d~~~a~DlT~~~~hl~~~~v~~~~~~~~~~~~v--------- 521 (800) T protein:vir:10 451 TFEVNNKVKPVVTGESVMFATNDGSYSGVREFYTDSYSDTKKAQAITSHVNKLIEGNITNMAASTNVNRLL--------- 521 (800) T ss_pred eeeccCCCCceEeCCeEEEecCCCCeeEEEEEeeeecccceehhhHHhHHHHhcCCceEEEEEeCCCCeEE--------- Confidence 001112333333333333333332211 1112111111100 00 0000 000000111111 Q ss_pred EEEeccCC-ceeee---------cccCCcccCCCcccCcCccc-cCCceeEEEEEcceEEEec--CCeEEEEeccccccc Q lcl|NC_020838. 491 QLIRQANG-VFKYE---------PVTWDDRLVGDNTTNPIPSF-IGKKINNMFFYRNRLGLLS--NEAVIMSRAGDYFNF 557 (981) Q Consensus 491 ~lv~~a~g-~f~~~---------~~~w~~r~~GDd~tnp~psF-~g~~ps~v~ffq~RL~f~s--~~~V~~Srtgdy~NF 557 (981) .++...++ -+-++ ...|..- .| .+..+.++.+=++.|+++= ...+|+-|...-.++ T Consensus 522 ~~~~~~~~~l~~~~yl~~~~e~~~~aW~~w-----------~~~~~~~~~~~~~~~d~l~~iv~r~~~~~ier~~~~~~~ 590 (800) T protein:vir:10 522 VTTDKYRNIIYCYDWLWQGTDRVQSAWHVW-----------EWPMGTKVRGMFYSGELLYLLLERGDGVYLEKMDMGDAL 590 (800) T ss_pred EEEEcCCCeEEEEEEeecCCceEEEEEEEE-----------EcCCCcEEEEEEEeCCeEEEEEECCCcEEEEEEecccCc Confidence 11111111 11111 1124321 22 2346777777789998873 456777775221111 Q ss_pred cccccccccCCccEEE----EEc-----CCCce-eEEEEeecCCcEEEEecCcEEEEEcCCccc-----cccceEEEEEe Q lcl|NC_020838. 558 FANSSQVVAPDDPIDL----QAT-----SVKPV-TLNYTLATSIGLLVFGPNEQFVLSTDADIL-----SPTTTKINTIS 622 (981) Q Consensus 558 ~~~s~~~~~DdDpI~~----~i~-----s~~~n-~I~~~v~~~~~L~l~T~g~q~~l~g~~~~L-----TP~~~~i~~~S 622 (981) +.. ..+.--+|. ... ..... .+-|.....+.|.+....+.-.+.+.. ++ +-..+.+.... T Consensus 591 ~~~----~~~~~~lD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-v~~~~~~~~g~~~~~~~~ 665 (800) T protein:vir:10 591 TYG----LNDRIRMDRQAELIFKHFKAEDEWISEPLPWTPTNPELLDCILIEGWDSYIGGS-FLFKYKPSDNTLSTTFDM 665 (800) T ss_pred ccc----ccceeeeecceeecccccccCcceEEEeccccccCCcceEEeeeccceeecCce-eEEEEEecCCceEeeeee Confidence 110 000000110 000 00000 001111112233333332221111110 10 11111111111 Q ss_pred eeccccCCCcEEeCCeE--------EEEecC-C-----CeeEEEEEeeccccc-ceehh--hHH-HHHHHhcCCCeEEEE Q lcl|NC_020838. 623 TFECDAEIDAVAVGTTQ--------AFISKS-N-----LYSKLFLMLNVQKEA-AATID--EAT-TNVPEYVPSDIDSMT 684 (981) Q Consensus 623 ~~~~s~~v~Pv~vG~~v--------~Fv~~~-g-----~~s~vre~~y~~~~d-~~~a~--DlS-~~~~h~~~~~i~~~~ 684 (981) . .+......|.||-.- ++++.. | ++.+|+++......- .+... +.. ...+++...+ ... T Consensus 666 ~-~g~~~~~~v~VGl~Y~s~~~~~~~~i~~~~g~~~~~~r~~i~r~~~~~~~sg~~~~~v~~~~~~~~~~~~~~~--~~~ 742 (800) T protein:vir:10 666 H-DDNHVKAKVIVGQIYPQEFEPTPVVIRDRQDRVSYIDVPVVGLVHLNLDMYPDFSVEVKNVKSGKVRRVLASN--RIG 742 (800) T ss_pred c-CCCcccceEEEeeeeeEEEeecceEEEcCCCcccccCCeEEEEEEEEeecCceEEEEeccCcccceeEEccCC--eec Confidence 0 111122233333110 111111 0 122566665555422 22221 110 0111111000 000 Q ss_pred EcCCCcE-EEEEecCCcEEEEEEeecCcchheeeeEeeccCCceEEEEEeCCeEEEEEEcCCcE Q lcl|NC_020838. 685 ASPAMSI-VSLGKSGSNTVYQHRFFMQGENRVQTWYKWQLTGDLRLQFFDKTTFYAVTSSGSNV 747 (981) Q Consensus 685 ~s~~~~~-~~~~~~g~~~l~~y~y~~~~eq~V~aWsrw~~~G~v~sv~~~~d~ly~vv~r~~~~ 747 (981) ..+.+.. ..-...|.-.+-++. |+.+.+|.==+...++=+|+++.-.+. ..+|.-++ T Consensus 743 g~~~~~~g~~~~~tg~~~vp~~g--~~~~~~v~i~~d~P~P~tvlai~~eg~----y~~r~~rv 800 (800) T protein:vir:10 743 GALNNTVGYVEPREGVFRFPLRA--KSTDAVYRIIVESPHTFQLRDIEWEGS----YNPTKRRV 800 (800) T ss_pred cccccccCcccccCceEEEEEec--cCceeEEEEEECCCCcEEEEEEEEEEE----eecccccC Confidence 0000000 000001111111111 011111100011111112333222111 11111111 No 45 >protein:vir:2109 Length: 472 # NCBI annotation: head completion protein # Family: family:all:1540 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059633;genbank:gi:9635541;genbank:GeneID:1262840 Probab=44.71 E-value=0.8 Score=21.13 Aligned_cols=422 Identities=10% Similarity=0.036 Sum_probs=144.9 Q ss_pred HHHHHHhhhhccCceE-EEEcCCEEEEEe-CCCceEEEecCcCcce-eEEEEEEEeechhccccC-------CCCcEEEE Q lcl|NC_020838. 374 IAAALTSSINALTGFS-ATQVGPGIYIEG-TSAFSISTSGSTTEEG-IFAFQDQINVASRLPNQC-------ENGYRVRV 443 (981) Q Consensus 374 ia~~l~~~i~~~~~~~-~~~vg~~i~i~~-~~~~~vt~~~g~~~t~-~~~~~~~v~~~~~Lp~~~-------~~G~~v~v 443 (981) +. ..++.-+.+.. ....++++ +. |-+..-+.+++..... +++.-.-+.. ..+|... .+|..-.| T Consensus 1 m~---~~q~Pl~~g~~~~~~~~d~~--~~~pVN~~a~~~~~~~s~~~lr~tPG~~~~-~~~~g~~RG~~~~t~~~~ly~V 74 (472) T protein:vir:21 1 MP---IQQLPMMKGMGKDFKNADYI--DYLPVNMLATPKEILNSSGYLRSFPGITKR-YDMNGVSRGVEYNTAQNAVYRV 74 (472) T ss_pred Cc---eEEeecccccccccccccee--eeeeeeeeeeccCCcccceeeeecCCccee-ccCCCceeeeeecccCCeEEEE Confidence 00 01111112211 11223322 11 1111112333322221 2221111110 1112110 01222222 Q ss_pred EccCCCcccceEEEEEcccCCcccceEEEEeeccceeEEEccccceEEEEeccCCc--eeeecccCCcccCCCcccCcCc Q lcl|NC_020838. 444 TNSGDVTADDIYVEFQTTNSAARGPGVWEETIGPSLEFEIDETTMPHQLIRQANGV--FKYEPVTWDDRLVGDNTTNPIP 521 (981) Q Consensus 444 ~~~g~~~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~--f~~~~~~w~~r~~GDd~tnp~p 521 (981) .+. .-|.+..+ .+ +.++-+.+.-....+.... ...+.. |....-.+.-. ++|.+ T Consensus 75 ~G~-----~LY~v~~~--~G---------~i~gsgrVsMa~n~~~~~v--~~~~~~~~Y~~~~~~~t~~------~~~~d 130 (472) T protein:vir:21 75 CGG-----KLYKGESE--VG---------DVAGSGRVSMAHGRTSQAV--GVNGQLVEYRYDGTVKTVS------NWPAD 130 (472) T ss_pred eCC-----ceEEEeee--ee---------eecccccEEEeeCCeEEEE--EECCceeEEEEecchhhhh------cccCc Confidence 211 12323221 00 1112222222222222111 111111 22222111111 11211 Q ss_pred -cccC---CceeEEEEEcceEEEecCC--eEEEEeccccccccccccccccCCccE-EEEEcCCCceeEEEEeecCCcEE Q lcl|NC_020838. 522 -SFIG---KKINNMFFYRNRLGLLSNE--AVIMSRAGDYFNFFANSSQVVAPDDPI-DLQATSVKPVTLNYTLATSIGLL 594 (981) Q Consensus 522 -sF~g---~~ps~v~ffq~RL~f~s~~--~V~~Srtgdy~NF~~~s~~~~~DdDpI-~~~i~s~~~n~I~~~v~~~~~L~ 594 (981) .|.+ ..+-.|+|+..|++|..+. ..+-|-.-|-+.. |.. .++-+.++++.|.-++.+.+.|+ T Consensus 131 ~~f~~~dl~~~~dv~f~dGyfV~~~~gt~~f~is~l~d~~~~-----------~~y~~FatAE~~pD~Iv~i~~~~~~l~ 199 (472) T protein:vir:21 131 SGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHP-----------DRYSAQYRAESQPDGIIGIGTWRDFIV 199 (472) T ss_pred cccccccccceeEEEEecceEEEccCCcceeEEecCCCCccc-----------cCCccceeeccCCCceEEEEeeccEEE Confidence 2222 2345799999999888633 2333444432211 111 14567778888998999999999 Q ss_pred EEecCcE--EEEEcCCccccccceEEEEEe----eeccccCCCcEEeCCeEEEEecCCCeeEEEEEeecccccceehhhH Q lcl|NC_020838. 595 VFGPNEQ--FVLSTDADILSPTTTKINTIS----TFECDAEIDAVAVGTTQAFISKSNLYSKLFLMLNVQKEAAATIDEA 668 (981) Q Consensus 595 l~T~g~q--~~l~g~~~~LTP~~~~i~~~S----~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~Dl 668 (981) +|.+..- |..+|.. ++..+-..+++ ..+|++.-.=..+|++++|++..+..- ......++|+++-+ T Consensus 200 lfG~~TiEvw~ntG~a---d~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~s~~~l~~~~~g~-----~~V~~~~g~qa~rI 271 (472) T protein:vir:21 200 CFGSSTIEYFSLTGAT---TAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGA-----PSVYIIGSGQASPI 271 (472) T ss_pred EEeccceEEEEecCCC---CcCcCceEEcCcceeeecccCcchhhecCceEEEEecCCCcc-----cEEEEccCceeEEe Confidence 9987766 7777754 33334444443 456776666689999999998876310 11223456777777 Q ss_pred HHH-HHHhcCC-------CeEEEEEc--CCCcEEEEEecCCcEEEEEEeecCcchheeeeEeeccCCc---eEEEEEe-C Q lcl|NC_020838. 669 TTN-VPEYVPS-------DIDSMTAS--PAMSIVSLGKSGSNTVYQHRFFMQGENRVQTWYKWQLTGD---LRLQFFD-K 734 (981) Q Consensus 669 S~~-~~h~~~~-------~i~~~~~s--~~~~~~~~~~~g~~~l~~y~y~~~~eq~V~aWsrw~~~G~---v~sv~~~-~ 734 (981) |-| +++.|.. .....+++ .+...++-+-+ .-++|+- ..++.-.-||.-+.++- -+..+++ - T Consensus 272 ST~aIE~~i~~y~~~e~~~A~~~t~~~eGH~fy~LtfP~---~Tw~yD~--at~~~~e~W~~~~sg~~~~~~R~~~~~~~ 346 (472) T protein:vir:21 272 ATASIEKIIRSYTAEEMATGVMETLRFDSHELLIIHLPR---HVLVYDA--SSSQNGPQWCVLKTGLYDDVYRGVDFMYE 346 (472) T ss_pred cCHHHHHHHHhcCCccccceEEEEEEeCCeEEEEEEcCC---eeEEEEc--ccCccCceeeeeccCCCcCceeEEEEEee Confidence 533 3444422 12222323 33333333332 3677752 12222223887776532 3333332 2 Q ss_pred CeEEEEEEcC-CcEEEEEEEeecCCceeEEEecCCCcccccccceeeeeeeeeccCCceEEEcccCCCcCCceEEEEEcC Q lcl|NC_020838. 735 TTFYAVTSSG-SNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFNVNPYRTYSTSTKKTTVNLPFDHITGKKLAVVAIG 813 (981) Q Consensus 735 d~ly~vv~r~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~vd~~~ty~~~~~~tt~~l~~~~l~g~~v~v~adG 813 (981) +.-+++=+.. +.++. +.+..... .+ ....++. + .|+-+.++..+..+.-+ T Consensus 347 ~g~~ivGD~~nG~ly~--L~fd~~~~---~d-----~~~~~~r---~----------------~p~~~~dn~R~fd~eve 397 (472) T protein:vir:21 347 GNQITCGDKSEAVVGQ--LQFDISSQ---YD-----KQQEHLL---F----------------TPLFKADNARCFDLEVE 397 (472) T ss_pred CCeEEEEEcCCCeEEE--EEeccccc---CC-----CcCcEEE---E----------------ccceeCCCCEEEEEeee Confidence 2233333332 22222 22111110 00 0000000 0 01111111111111000 Q ss_pred cc-----cCceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEEEEEEe--eeEEEEeCCceeeccCCCc---ccceE Q lcl|NC_020838. 814 TY-----IGDTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLIIGYV--YDMELELPTLYPTQVEGRS---SVSDV 883 (981) Q Consensus 814 ~~-----~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~VGl~--y~s~v~~~~~~i~~~~g~~---~~~~~ 883 (981) .. ..+.+.-....||..+- .+. .+-.|-+ |..++..-.+-. ....- -.-.. T Consensus 398 ~~~Gv~q~~d~v~L~wSddG~~~~--------~~~---------~~~~g~~g~~~tr~~~~RlG~--~r~~v~f~~r~~~ 458 (472) T protein:vir:21 398 SSTGVAQYADRLFLSATTDGINYG--------REQ---------MIEQNEPFVYDKRVLWKRVGR--IRRLIGFKLRVIT 458 (472) T ss_pred ccCCCCCcCcEEEEEeeccccccc--------cce---------eeccCCccchhcceeeeeeee--cccceeEEEEEEe Confidence 00 00000001111222110 000 0111111 111111111000 00000 00000 Q ss_pred eeeEEEEEEEEEee Q lcl|NC_020838. 884 TSDLILHRLKVSTG 897 (981) Q Consensus 884 ~grl~l~r~~v~~~ 897 (981) ..++.|+.++++++ T Consensus 459 ~~~~~l~g~~~~~E 472 (472) T protein:vir:21 459 KSPVTLSGCQIRLE 472 (472) T ss_pred cCcceeeeeEEeeC Confidence 11122233332222 No 46 >protein:vir:9268 Length: 472 # NCBI annotation: 10 # Family: family:all:1540 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720332;genbank:gi:24371590;genbank:GeneID:955815 Probab=36.33 E-value=1.2 Score=20.19 Aligned_cols=424 Identities=10% Similarity=0.040 Sum_probs=143.5 Q ss_pred HHHHHHhhhhccCceE-EEEcCCEEEEEeCCCceEEEecCcC-cceeEEEEEEEeechhccccC-------CCCcEEEEE Q lcl|NC_020838. 374 IAAALTSSINALTGFS-ATQVGPGIYIEGTSAFSISTSGSTT-EEGIFAFQDQINVASRLPNQC-------ENGYRVRVT 444 (981) Q Consensus 374 ia~~l~~~i~~~~~~~-~~~vg~~i~i~~~~~~~vt~~~g~~-~t~~~~~~~~v~~~~~Lp~~~-------~~G~~v~v~ 444 (981) ++. .+|.=..+.. ..+.++++--. |-+..-+...+.. ...++++-.-+. ++.++... .+|..-.|. T Consensus 1 m~~---~~ipl~~g~~~~~~~a~~~~~~-pvn~y~~~~~~~~ss~~Lr~~pG~~~-~a~~~G~~RG~~~~~~~~~ly~V~ 75 (472) T protein:vir:92 1 MPI---QQLPMMKGMGKDFKNADYIDYL-PINMLATPKEVLDSSGYLRSFPGIAK-RNDVNGVSRGVEYNTAQNAVYRVC 75 (472) T ss_pred Cce---eeccccccccccCccCcceeee-ecccccccccccccccceeeccccee-ecCCCCcccceeeeeeCCeEEEEe Confidence 100 1111001111 11122221000 1010001111111 111221111111 11222221 012122222 Q ss_pred ccCCCcccceEEEEEcccCCcccceEEEEeeccceeEEEccccceEEEEeccCCceeeecccCCcccCCCcccCcCc-cc Q lcl|NC_020838. 445 NSGDVTADDIYVEFQTTNSAARGPGVWEETIGPSLEFEIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIP-SF 523 (981) Q Consensus 445 ~~g~~~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~p-sF 523 (981) +.. -|.+..+- + +.++-+.+.--...+..........--|....-.+.-. ++|.+ .| T Consensus 76 G~~-----Ly~v~~~i--G---------~i~gsgrVsMa~n~~~~av~~~~~~~~Y~~~~~~~t~~------~~~~d~~f 133 (472) T protein:vir:92 76 GGK-----LYKGEAVV--G---------DVAGSGRVSMAHGRTSQAVGVNGQLIEYRYDGAVKTVS------NWPADSGF 133 (472) T ss_pred Ccc-----eEEEEeeE--e---------eccCcccEEEecCCeEEEEEECCceeEEEEecchhhhh------cccCcccc Confidence 211 23222211 1 01122222222222221111111000122221111111 11211 22 Q ss_pred cC---CceeEEEEEcceEEEecC--CeEEEEeccccccccccccccccCCccEE-EEEcCCCceeEEEEeecCCcEEEEe Q lcl|NC_020838. 524 IG---KKINNMFFYRNRLGLLSN--EAVIMSRAGDYFNFFANSSQVVAPDDPID-LQATSVKPVTLNYTLATSIGLLVFG 597 (981) Q Consensus 524 ~g---~~ps~v~ffq~RL~f~s~--~~V~~Srtgdy~NF~~~s~~~~~DdDpI~-~~i~s~~~n~I~~~v~~~~~L~l~T 597 (981) .+ ..+-.|+|+..|++|..+ +.++.|-.-|-+.. |+.+ ++.+..+++.|.-++.+.+.|++|. T Consensus 134 ~~~dl~~~~dv~f~dGyfV~~~~gt~~~~iS~l~d~~~~-----------~~y~~fa~AE~~pD~Ivgi~~~~~~l~lfG 202 (472) T protein:vir:92 134 TQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHP-----------DRYSAEYRAESQPDGIIGIGSWRDFIVCFG 202 (472) T ss_pred ccccccceeEEEEecceEEEccCCCceEEEeccCCcccc-----------ccccccccccCCCCceEEEEeeccEEEEEe Confidence 22 235579999999998863 34555655543322 1111 3556778888888999999999998 Q ss_pred cCcE--EEEEcCCccccccceEEEEEe----eeccccCCCcEEeCCeEEEEecCCCeeEEEEEeecccccceehhhHHHH Q lcl|NC_020838. 598 PNEQ--FVLSTDADILSPTTTKINTIS----TFECDAEIDAVAVGTTQAFISKSNLYSKLFLMLNVQKEAAATIDEATTN 671 (981) Q Consensus 598 ~g~q--~~l~g~~~~LTP~~~~i~~~S----~~~~s~~v~Pv~vG~~v~Fv~~~g~~s~vre~~y~~~~d~~~a~DlS~~ 671 (981) +..- |..+|.. ++..+-..+++ ..+|++.-.=..+|++++|++..+..- ......++|+++-+|-| T Consensus 203 ~~TiEvw~ntG~a---d~~~fpy~r~~g~~i~~Gcaa~~sv~~~~~s~~~l~~~~~g~-----~~V~~~~g~qa~rIST~ 274 (472) T protein:vir:92 203 SSTIEYFSLTGAT---TVGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISHPATGA-----PSVYIIGSGQASPIATA 274 (472) T ss_pred ccceEEEEecCCC---CcCcCceEEcCcceeeecccCcchhhecCceEEEEecCCCcc-----cEEEEccCceeEEecCH Confidence 7766 7777754 33334444443 456776666689999999998876310 11223466777777533 Q ss_pred -HHHhcCC----CeE-----EEEEcCCCcEEEEEecCCcEEEEEEeecCcchheeeeEeeccCCc---eEEEEEe-CCeE Q lcl|NC_020838. 672 -VPEYVPS----DID-----SMTASPAMSIVSLGKSGSNTVYQHRFFMQGENRVQTWYKWQLTGD---LRLQFFD-KTTF 737 (981) Q Consensus 672 -~~h~~~~----~i~-----~~~~s~~~~~~~~~~~g~~~l~~y~y~~~~eq~V~aWsrw~~~G~---v~sv~~~-~d~l 737 (981) +++.|.. .+. ..++..+...++-+- + .-++|+- ..++.-.-||.-..+.- -+..+++ -+.- T Consensus 275 aIE~~i~~y~~~e~~~a~~~s~~~eGH~fy~LtfP--~-~Tw~yD~--at~~~~e~W~~~~sg~~~~~~R~~~~~~~~g~ 349 (472) T protein:vir:92 275 SIEKIIRSYTADELATGVMEALRFDSHELLIIHLP--R-HVLVYDA--SSSQNGPQWCVLKTGLYDDVYRAIDFMYEGNQ 349 (472) T ss_pred HHHHHHHhcCcchhceeeEEEEEecCeeEEEEEcC--C-ceEEEEc--ccCcCCceeeeecCCCcccceeEEEEEeeCCe Confidence 3444432 122 222233333333333 2 5677752 12222223887776532 3333332 2222 Q ss_pred EEEEEcC-CcEEEEEEEeecCCceeEEEecCCCcccccccceeeeeeeeeccCCceEEEcccCCCcCCceEEEEEc---- Q lcl|NC_020838. 738 YAVTSSG-SNVYLTSYDLTQASESGYLTLPTGEKTDVCLDMFNVNPYRTYSTSTKKTTVNLPFDHITGKKLAVVAI---- 812 (981) Q Consensus 738 y~vv~r~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~lD~~~vd~~~ty~~~~~~tt~~l~~~~l~g~~v~v~ad---- 812 (981) +++=+.. +.++....+.....+.. ..++ .+. |.-|.++..+..+.- T Consensus 350 ~ivGD~~nG~ly~l~~~~~t~~~~~----------~~~~---~~~----------------P~~~~dn~R~~d~eve~~~ 400 (472) T protein:vir:92 350 ITCGDKSEAVTGQLQFDISSQYDKQ----------QEHL---LFT----------------PIFKADNARCFDLEVESST 400 (472) T ss_pred EEEEEcCCCeEEEEeccccccCCCc----------ceEE---EEe----------------ceEecCCCEEEEEeeeccC Confidence 3333332 22222211110000000 0000 000 000111111110000 Q ss_pred --CcccCceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEEEEEEe--eeEEEEeCCceeeccCCCc---ccceEee Q lcl|NC_020838. 813 --GTYIGDTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLIIGYV--YDMELELPTLYPTQVEGRS---SVSDVTS 885 (981) Q Consensus 813 --G~~~~~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~VGl~--y~s~v~~~~~~i~~~~g~~---~~~~~~g 885 (981) |... +.+.-....||..+- .+. .+-.|-+ |..++..-.+-. ....- -.-.... T Consensus 401 Gv~q~~-d~v~L~wSddG~~~~--------~~~---------~~~~g~~g~~~tr~~~~RlG~--~r~~v~f~~r~~~~~ 460 (472) T protein:vir:92 401 GVAQYA-DRLFLSATTDGINYG--------REQ---------MIEQNEPFVYDKRVLWKRVGR--IRRLIGFKLRVITKS 460 (472) T ss_pred CCCCcC-ceEEEEeeccccccc--------cce---------eeccCCccchhcceeeeeeee--cccceeEEEEEEecC Confidence 0000 000011111222210 000 0111111 111111111100 00000 0000011 Q ss_pred eEEEEEEEEEee Q lcl|NC_020838. 886 DLILHRLKVSTG 897 (981) Q Consensus 886 rl~l~r~~v~~~ 897 (981) ++.|+.++++++ T Consensus 461 ~~~l~g~~~~~E 472 (472) T protein:vir:92 461 PVTLSGCQIRLE 472 (472) T ss_pred cceeeeeEEeeC Confidence 122333333322 No 47 >protein:vir:105428 Length: 472 # NCBI annotation: gene 8 protein # Family: family:all:1540 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958184;genbank:gi:41057286;genbank:GeneID:2716675 Probab=28.70 E-value=1.7 Score=19.29 Aligned_cols=429 Identities=10% Similarity=0.021 Sum_probs=138.0 Q ss_pred HHHhhhhccCceEE-EEcCCEEEEEe-CCCceEEEecCcCcce-eEEEEEEEeechhccccC-------CCCcEEEEEcc Q lcl|NC_020838. 377 ALTSSINALTGFSA-TQVGPGIYIEG-TSAFSISTSGSTTEEG-IFAFQDQINVASRLPNQC-------ENGYRVRVTNS 446 (981) Q Consensus 377 ~l~~~i~~~~~~~~-~~vg~~i~i~~-~~~~~vt~~~g~~~t~-~~~~~~~v~~~~~Lp~~~-------~~G~~v~v~~~ 446 (981) --..++.=..+... +..++++ .. |-+...+.+++....+ ++..-.- ...+.++..+ .+|..-.|.+. T Consensus 1 m~~~~~pl~~G~~~~~~~~d~~--~~~pVN~~a~~~~~~~s~~~l~~tPGl-~~~a~v~G~~RG~~~~~~~g~lY~V~G~ 77 (472) T protein:vir:10 1 MPIQQLPLMKGVGKDFRNADYI--DYLPVNMLATPKEILNSSGYLRSFPGI-AKRSDVNGVSRGVEYNMAQNAVYRVCGG 77 (472) T ss_pred CCeeeeeeccCceeeccccchh--heeeeeeeeeccCCCcccceeecCCCc-eeeccCCccccceEEEeeCCeEEEEecc Confidence 00011111112111 1222221 00 0010111111111111 1110000 0112233322 11111122111 Q ss_pred CCCcccceEEEEEcccCCcccceEEEEeeccceeEEEccccceEEEEeccCCceeeecccCCcccCCCcccCcCccccCC Q lcl|NC_020838. 447 GDVTADDIYVEFQTTNSAARGPGVWEETIGPSLEFEIDETTMPHQLIRQANGVFKYEPVTWDDRLVGDNTTNPIPSFIGK 526 (981) Q Consensus 447 g~~~~d~yyv~~~~~~~~~~g~~~W~E~a~~~~~~~~~~~Tmp~~lv~~a~g~f~~~~~~w~~r~~GDd~tnp~psF~g~ 526 (981) .-|.+.. .|-+.++-+.+.--+..+..........--|.+....-....-.+|... |.+.-. T Consensus 78 -----~LY~v~~-----------~iGsiag~grVsMa~n~~~~av~~~g~~~~Y~yd~~v~t~~~~~~d~~~--p~~dlg 139 (472) T protein:vir:10 78 -----KLYKGES-----------EVGDVAGSGRVSMAHGRTSQAVGVNGQLVEYRYDGTVKTVSNWPTDSGF--TQYELG 139 (472) T ss_pred -----eEeeeec-----------ceecccCcccEEEecCCcEEEEEECCceeEEEeeccchhhhcccccccc--cccccc Confidence 1122211 1222333333333333333211111111012222111000001112222 222234 Q ss_pred ceeEEEEEcceEEEecCCe--EEEEeccccccccccccccccCCccEEEEEcCCCceeEEEEeecCCcEEEEecCcE--E Q lcl|NC_020838. 527 KINNMFFYRNRLGLLSNEA--VIMSRAGDYFNFFANSSQVVAPDDPIDLQATSVKPVTLNYTLATSIGLLVFGPNEQ--F 602 (981) Q Consensus 527 ~ps~v~ffq~RL~f~s~~~--V~~Srtgdy~NF~~~s~~~~~DdDpI~~~i~s~~~n~I~~~v~~~~~L~l~T~g~q--~ 602 (981) ....|+|+..|++|..+.+ ++.|-.-|-+.+ ++.-.++.+..+++.|.-++.+.+.|++|.+..- | T Consensus 140 ~~~dv~f~dGyfV~~~~Gt~~~~is~l~d~~~~----------~~y~~fa~AE~~pD~Ivgi~~~~~~i~lfG~~TiEvw 209 (472) T protein:vir:10 140 SVRDITRLRGRYAWSKDGTDSWFITDLEDESHP----------DRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYF 209 (472) T ss_pred ceeeeeeecceEEEeccCcceEEEeccCCcccc----------ccccccccccCCCCceEEEEeeccEEEEEeccceEEE Confidence 4678999999999886432 334544442211 2222344577788888889999999999987766 7 Q ss_pred EEEcCCccccccceEEEEEe----eeccccCCCcEEeCCeEEEEecCCC-eeEEEEEeecccccceehhhHHHH-HHHhc Q lcl|NC_020838. 603 VLSTDADILSPTTTKINTIS----TFECDAEIDAVAVGTTQAFISKSNL-YSKLFLMLNVQKEAAATIDEATTN-VPEYV 676 (981) Q Consensus 603 ~l~g~~~~LTP~~~~i~~~S----~~~~s~~v~Pv~vG~~v~Fv~~~g~-~s~vre~~y~~~~d~~~a~DlS~~-~~h~~ 676 (981) ..+|.. +|.-+-..+++ ..+|++.-.=..+|++++|++.... --.|+. .++|+++-+|-| +++.+ T Consensus 210 ~ntG~a---~~~~fpy~r~~g~~iq~Gcaa~~sv~~~~~t~~~l~~d~~g~~~V~~------~~g~~~~rIST~aIE~~i 280 (472) T protein:vir:10 210 SLTGAT---TAGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISNPATGAPSVYI------IGSGQVSPIASASIEKIL 280 (472) T ss_pred EecCCC---CcccCceeecccceeeecccCcchhhecCceEEEEecCCccccEEEE------ccCceeEEecCHHHHHHH Confidence 778764 33333333333 3467766666899999999987521 012333 356677777533 34444 Q ss_pred CC----Ce---EEEEE--cCCCcEEEEEecCCcEEEEEEeecCcchheeeeEe-eccC--C----ceEEEEEe-CCeEEE Q lcl|NC_020838. 677 PS----DI---DSMTA--SPAMSIVSLGKSGSNTVYQHRFFMQGENRVQTWYK-WQLT--G----DLRLQFFD-KTTFYA 739 (981) Q Consensus 677 ~~----~i---~~~~~--s~~~~~~~~~~~g~~~l~~y~y~~~~eq~V~aWsr-w~~~--G----~v~sv~~~-~d~ly~ 739 (981) .. .+ ...++ ..+...++-+- + .-+||+-. -.-||. |-.. | .-+..|++ -+.-++ T Consensus 281 ~~y~~~e~~dA~~~t~~~~GH~fy~LtfP--~-~Tw~yD~~------t~~Wherw~~~~~g~~~~~~Ra~~~~~~~g~~~ 351 (472) T protein:vir:10 281 RSYTADELADGVMESLRFDAHELLIIHLP--R-HVLVYDAS------SSANGPQWCVLKTGLYDDVYRAIDFIYEGNQIT 351 (472) T ss_pred HhcCCccccceeEEEEEeCCeEEEEEEcC--C-ceeEeecc------cccCceeeeeecCCCccCceEEEEEEEeCCeEE Confidence 22 11 12222 23333333333 2 46666421 123554 3221 1 11222221 122222 Q ss_pred EEEc-CCcEEEEEEEe-ecCCceeEEEecCCCcccccccceeeeeeeeeccCCceEEEcccCCCcCCceEEEEEcCcccC Q lcl|NC_020838. 740 VTSS-GSNVYLTSYDL-TQASESGYLTLPTGEKTDVCLDMFNVNPYRTYSTSTKKTTVNLPFDHITGKKLAVVAIGTYIG 817 (981) Q Consensus 740 vv~r-~~~~~l~~~~~-~~~~~~~~~~~~~~~~~~~~lD~~~vd~~~ty~~~~~~tt~~l~~~~l~g~~v~v~adG~~~~ 817 (981) +=+. ++.++...... .+..+ .+.+.++ .|+-|.++..+-.+.-.... T Consensus 352 vGD~~ng~ly~l~~~~~td~G~--------------~i~~~~~----------------~p~~~~d~~Rv~d~~ve~~~- 400 (472) T protein:vir:10 352 CGDKLESVTGKLQFDISSQYGL--------------QQEHLLF----------------TPLFKADNARCFDLEVESST- 400 (472) T ss_pred EEEcCCCeEEEEcccCcCcCCC--------------cceEEEe----------------ccceeCCCCeEEEEEEEeec- Confidence 2222 22222211110 00000 0001011 01111111111000000000 Q ss_pred ceeEEEEecCceEEEccccccCCCEEEeCCCCCCCEEEEEEeeeEEEEeCCceeeccCCCcccceEeeeEEEEEEEEEee Q lcl|NC_020838. 818 DTISATSESEGSVFYFEDSDISSNQVLLNGDYRGRDLIIGYVYDMELELPTLYPTQVEGRSSVSDVTSDLILHRLKVSTG 897 (981) Q Consensus 818 ~~~~~~~~~dG~~~~~~~~tv~gg~itL~gd~~~~~v~VGl~y~s~v~~~~~~i~~~~g~~~~~~~~grl~l~r~~v~~~ 897 (981) |. .....+.+-|. ++. |..|-.+.- ... .+.|+ ...|++.+|+-. .. T Consensus 401 ----------G~------~~~adp~~~~~--~sD-----g~~~g~~~~---~~~-~~~g~-----~~~R~~~~RlG~-~r 447 (472) T protein:vir:10 401 ----------GV------AQYADRLFLSA--TTD-----GINYGREQM---IEQ-NEPFV-----YDKRVLWKRVGR-IR 447 (472) T ss_pred ----------CC------CcccCceEEEe--ccC-----Ccccchhhh---hhh-ccCcc-----cccceeeeeeee-cc Confidence 00 00000111110 000 000000000 000 00010 111222222210 00 Q ss_pred cccceEEEEccCCCCc----eeeEe Q lcl|NC_020838. 898 LSGPITYKVDITGKDE----WTNII 918 (981) Q Consensus 898 ~Sg~~~v~v~~~~~~~----~~~~~ 918 (981) +--+|++.+....+-. +..++ T Consensus 448 ~~vgf~~r~~~~~~v~l~ga~~~~e 472 (472) T protein:vir:10 448 KNVGFKLRVITKSPVTLSGAQIRIE 472 (472) T ss_pred ccceEEEEEEeccccceeeeeEEeC Confidence 1112222221111000 00000 No 48 >protein:vir:107669 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:704 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003901;genbank:gi:45686317;genbank:GeneID:2773009 Probab=21.29 E-value=1.4 Score=19.79 Aligned_cols=119 Identities=11% Similarity=0.105 Sum_probs=57.8 Q ss_pred eeeeeeEEeeCceeeEeeeccCCcCcCceEEEEEcccccceEEecceEEEEeeeecccceeecccCCccee--eEEEEcC Q lcl|NC_020838. 232 EVEKEVAAAYNSTELSTANTNMGTAQTAYDNAVSTESTEKGDYDSEVTACNIGSSNIPASAYLKDAAPEDI--EILTIND 309 (981) Q Consensus 232 ~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~~~~~~~~~~~~~y~~~~~~~dl--~~~t~ad 309 (981) |.+..+.+. .+++++.+..+.-+|.- + | ..-|+++.+|.....--..+++.+-...-.+|+| .+++..| T Consensus 1 mNY~~i~~~-----a~~~I~~fsd~~g~~~l-~-t--~~~~~~~v~G~Ev~~p~~~~~~~G~~~~y~~reIDG~lI~~gD 71 (123) T protein:vir:10 1 MNYSQIERM-----ARKGVAFFTDPSRPMNL-I-K--QGEYGYDENGFEIPPMEQVIPISGATRRPNAREIDGETIRASD 71 (123) T ss_pred CChHHHHHH-----HHHHHhhhcCCCCeEEe-e-e--CCcccccCCCeecccCCeeeeeEEEEeeccccccccceeeecc Confidence 222222221 11223333233334421 1 1 2224555666544444455667777777788876 6778899 Q ss_pred EEEEeCCceeEeccCCCCCCCCceEEEEEeeeccceEEEEeeCceEEEEEeccCCCcceecHHHHHHHHHhhhh--ccCc Q lcl|NC_020838. 310 YTFVLNKNKTTAMKTTTSAAVPNVAFVVIRIVAYNSDYSVTLNGTTVTHSTPDTVAGATTDSGSIAAALTSSIN--ALTG 387 (981) Q Consensus 310 ~tfi~n~~~~~~~~~~~~~~~~~~~~v~v~~g~y~~~~~v~~ng~~~~~~t~~~~a~~~~~~~~ia~~l~~~i~--~~~~ 387 (981) .-+|+|+++..+ .|. .|.++|....+-.+.-..++.++- +-+ .++. +..+ T Consensus 72 vk~if~a~veik------------------~Gd-----~I~vDg~~~rVV~~~pvkPa~~~I---~y~--~qLRrv~~~~ 123 (123) T protein:vir:10 72 ILGIFNNDHEIN------------------EGD-----YIEIDGIRHVVVDARPVQASLEPV---AYR--PVLRRVSVGG 123 (123) T ss_pred EEEeeccceeec------------------cCC-----EEEECCeEEEEecCcccchhhhhh---hhh--hhhceeccCC Confidence 999999998653 222 355666655554444433333221 111 1221 1122 Done!