Query lcl|NC_016571.1_cdsid_YP_004957909.1 [gene=OBP_002] [protein=putative major tailsheath] [protein_id=YP_004957909.1] [location=complement(916..3117)] Match_columns 733 No_of_seqs 8 out of 12 Neff 3.3 Searched_HMMs 1612 Date Thu Nov 7 15:46:17 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_2 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_2_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:94259 Length: 722 100.0 2E-301 1E-304 1669.5 37.6 702 1-733 1-720 (722) 2 protein:vir:8933 Length: 695 # 100.0 1E-284 9E-288 1576.7 34.1 666 1-733 1-694 (695) 3 protein:vir:100829 Length: 607 69.8 0.22 0.00014 24.2 25.7 559 1-733 1-602 (607) 4 protein:vir:6594 Length: 666 # 61.5 0.35 0.00022 23.1 31.9 610 4-733 1-658 (666) 5 protein:vir:106427 Length: 679 56.1 0.46 0.00029 22.4 34.0 637 1-733 1-672 (679) 6 protein:vir:99306 Length: 587 45.4 0.77 0.00048 21.2 24.7 566 1-732 1-587 (587) 7 protein:vir:6894 Length: 660 # 39.9 1 0.00062 20.6 31.2 615 1-733 1-653 (660) 8 protein:vir:80779 Length: 569 34.4 1.3 0.0008 20.0 32.2 545 1-732 1-569 (569) 9 protein:vir:104477 Length: 749 32.4 1.4 0.00088 19.7 36.2 625 1-733 1-746 (749) 10 protein:vir:101187 Length: 663 31.2 1.5 0.00094 19.6 31.1 610 1-733 1-655 (663) 11 protein:vir:95741 Length: 587 25.4 2.1 0.0013 18.9 23.3 562 1-732 1-587 (587) 12 protein:vir:5663 Length: 671 # 23.6 2.3 0.0014 18.6 30.2 618 1-733 1-668 (671) 13 protein:vir:98263 Length: 664 20.6 2.7 0.0017 18.2 35.0 619 1-733 1-657 (664) No 1 >protein:vir:94259 Length: 722 # NCBI annotation: hypothetical protein # Family: family:all:12145 # MgeID: mge:1500 # MgeName: phiEL # Cross-refs: genbank:acc:YP_418039;genbank:gi:82700939;uniprot:Q2Z175;genbank:GeneID:5176710 Probab=100.00 E-value=1.7e-301 Score=1669.50 Aligned_cols=702 Identities=34% Similarity=0.576 Sum_probs=667.1 Q ss_pred CceeeecccceeeccceeeccccccccCCCcccccccceeeeccccc--Cccc--ccCCcchhhhcccccCCCccccchH Q lcl|NC_016571. 1 METFNKVIPGKVVNEGIVSRETFPAAVPVISSPAHFPDFAAVTPRGS--TKRA--TVSTGTFTSKFGDTTDPFGLYYNPV 76 (733) Q Consensus 1 M~~~~n~~P~~v~~~GirD~S~~~l~~~~~~~p~H~Plf~~~~p~G~--~~~~--~v~~gd~~siyGD~~D~~s~yfn~~ 76 (733) |++|||++||||+|+||||||+|+|+|||+++|||||++|.++|||+ ++.| .|.++||++||||++|||||||||| T Consensus 1 m~~f~~~vP~rV~~sGIRD~S~~~~~~~~~t~~~H~Pll~~~~~~~rL~tetGTt~V~~~D~a~IyGq~~D~rS~ff~~q 80 (722) T protein:vir:94 1 MTVFNRIVPGKVDISGIRDNSIPEYDTSPPTTPLHLPVIHGIFPKGKLASKAGTQWVAPQDVKKIFGDIFDEKSPYYGPT 80 (722) T ss_pred CceeeccCCceeEEecccccCcCcccCCCCcchhhccchhccccccceehhccceeccchhhHHHHHhhccccccccChh Confidence 99999999999999999999999999999999999999999999998 4444 4899999999999999999999999 Q ss_pred HHHHHHHhhcccceEEEEEeccccccceeeeEEeeecccccceeEcCcccEecCCCCCcccCccccccCCeeEEe----- Q lcl|NC_016571. 77 TYAIQKLGQAGQASFSFKRLTNNTAKSRTIIGLAIFSGDIPNYLRDGNGDYVLDAAGNPKVDETTATIAGKWVCT----- 151 (733) Q Consensus 77 t~laq~l~~a~~n~f~~~RL~p~dA~s~~~i~~dv~~~~VP~Y~R~~nG~~~~d~~G~p~v~~t~~tI~gk~V~~----- 151 (733) |||||||++||||+|+||||+||||+++++++.+|..+|||+||||.+|+|+|+++|. .++|+||...+ T Consensus 81 SlL~~~L~LGr~N~~~vkRL~~eda~n~a~l~v~i~~Vev~~~~Rd~~g~~~y~~sg~------~iPip~tt~~~gl~i~ 154 (722) T protein:vir:94 81 SVLIQALALGRQNTIGIRRLSVNEVVSKASLSAFVQKVEVQAYERDANGRWKYNENGE------RIPIPGKTYPNGLNIE 154 (722) T ss_pred HHHHHHhhhccCceeEEEEechhhccChhheeeeeehhhhhhhhhhcCCccccccccc------ccccCCccccccccee Confidence 9999999999999999999999999999888888888889999999999999999998 45555544322 Q ss_pred -eEeeccc--ccccceeeccccccccCccccCcCceeeeeheehhhccccccccccccccchhh-cchhhhhhhccccce Q lcl|NC_016571. 152 -GVLKSEG--EVGEAKAFEVTSTDETPGIPTGTNGKFYPLAEVIGGIGDTYNANWISIGHNFQT-DWNEVARFVQTNGSY 227 (733) Q Consensus 152 -~~~~~~~--~~g~~~a~~~~~~~~~~~~~~g~q~~~yPL~E~~~~~Gd~~n~n~i~~g~~~~~-dw~~~~~fv~~~~~~ 227 (733) ++.++++ |+|.++++ +..++++ |+|+.+|||||...+.||+||+.+.++|+|..+ .|+.+++|+.+++.+ T Consensus 155 i~idd~~~g~E~Gt~~V~--t~~ss~D----~~qslv~PLFE~~~~~~~~f~k~G~nlG~Rv~s~t~~dieef~~at~t~ 228 (722) T protein:vir:94 155 IKIDDAAAGKEPGALEVR--TIAASGD----TPETLVLPLFEGVAGVGDEYNKSGLNLGVRADALNWRSISEFVRSTGTF 228 (722) T ss_pred Eeeeccccccccchhhhh--hhhccCC----cchhhhhhhhhhhhhhhhhhhcccccccceeeccchhhhhhHhhhcccc Confidence 5555554 55555555 4444444 678889999999999999999999999999999 999999999999999 Q ss_pred eEEEeeeeecccccccceeeeccCcceeEeecccccccccceeeeeeeeeeeccCcccCcCCCCCCccceEEEhhhHHHH Q lcl|NC_016571. 228 PFILNMGVLLDNGLRVPANTINGTPDTTFTMFDTVNEFNTRYGLKVAVDQYTGNNVNRPVEVQDAPFDDVFMYTQNLSDV 307 (733) Q Consensus 228 pf~~~~~s~~d~g~~~~~~t~~~~~d~~~~~fd~~~~~~~~y~~~v~~d~y~~~~~~~p~~p~~~PF~~~yvY~~Nie~V 307 (733) ||.+++++...+|++|++||+++|+.+.++.| +.....++|++||++++|+++++++|++|+|+||+|+||||+||++| T Consensus 229 ~fd~RQF~~~v~gt~v~~KTalqqd~~~~tv~-~~s~n~~~Y~~dv~~~sy~~~~~~s~~~Pl~sPFsq~~vY~dnId~v 307 (722) T protein:vir:94 229 PYDLRQFTDGVDGTRVYSKTTLQQESAKFTLF-PVSLNGVQYSLQTGFGSFTGRNKNAPGNPVPAPFNDLVVYQDNIDTL 307 (722) T ss_pred chhhhhhhhccCCcceEEEeccccceeeEEEe-eecccchhhhhhhhheeeccCcccCCCCcCcCcccceEEEeccHHHH Confidence 99999999999999999999999999999999 88899999999999999999999999999999999999999999999 Q ss_pred HHHHHhhhcccccccCCcccchhccCCcce--eeccccccccCCCcceeEEecccccccccccceeeccceeecccCCCC Q lcl|NC_016571. 308 AKELYAAEYGTDDPTNQPPNVLSKRLPKHA--IMNMFDLVNHNGKPYKHIVFGGNFDQAGKVTGSRFSMNHYLQSNGGIN 385 (733) Q Consensus 308 l~~ly~aE~~~~d~t~~p~~~~~~~~~~~~--~~n~f~~~~~nG~PY~~Iq~~g~~d~~g~v~gs~~~~n~~i~~sGG~d 385 (733) |||||++|++.||+ ++++++|++| |||||+|+||||+||++||++|++ +|+.|+++++||||+ T Consensus 308 ~Qmiy~~E~~~N~~------~v~v~~pg~~~~qid~~T~~n~dG~PY~~Iq~~G~l---------~~~~~~~i~ssGGi~ 372 (722) T protein:vir:94 308 CQMMYVVEQPENDT------LVEVGVPGEYYKQMNPFTCTNHNGAPYYAITTSGVI---------KWDLSGAIKSSGGIS 372 (722) T ss_pred HHHHhhhhcccccc------cceeccCccccccccceeeeecCCCcceeEEEeeee---------ecccceeeeccCCCc Confidence 99999999999974 5699999998 999999999999999999999998 688899999999999 Q ss_pred ccccccccccccccccccccccchhcccccchhhhhhhhhhhhhhhhhhhhhcccccceeehheeeccceeeEecCCCHH Q lcl|NC_016571. 386 PFADKDGKFPEAPTTWLPAIDGPWVADVSDPDLVISHKQAWEMNQMLIEAYLTTYITSLDVKDVIRNRSSFMWDVGYNQK 465 (733) Q Consensus 386 gt~d~dgk~~d~~t~~~~~ld~~~~~d~~~~~~~~~~~~~~~m~~~l~~a~l~~~~~~~~~nd~~ry~~~~~~DsGfs~~ 465 (733) ||+|||||+|||++. |++++|| +.++|.|+||+|+||||+.|+||+|+|++|+|+++|||+.|||||||||+||||+ T Consensus 373 ~~ldkDg~v~~Yv~~--~t~n~~f-g~ltd~e~~~~~~~~~~i~n~~~~~~~~~~vn~~~~~D~~rnq~~~~~D~G~~~e 449 (722) T protein:vir:94 373 PFLDKDGKVPDYVTK--PTVNDPF-GLLTNTERPLTHLQAWEITNKLMVADLTSYVNGVEMKDTTRNRQSIFWDIGYSQE 449 (722) T ss_pred cccCCCCCcccceec--CcCCccc-cccccchhhhhhhhhhhhhhhhhhhhhhhhccceeeeeeeeceeeeeeecCCchH Confidence 999999999999998 9999998 9999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCcceEEEEEEEeeC--CCCCHHHHHHHHHHHHhhhheeeeeeeccCccceeEeecceeEEEeccccccc Q lcl|NC_016571. 466 IKDIMIQFLSKRKDIIVVPCATEYL--RKKTQDELYSTATMLNTRIVMIPESEVYKSEACRASINLWDARYINEPTWGRF 543 (733) Q Consensus 466 ~K~~m~~fl~~RkD~~vv~~T~~~~--~~~s~~e~~s~a~~L~trl~~~PES~~YgTpa~Ra~I~~~s~~l~ns~y~~~v 543 (733) +|++|||||++|||++||+.|+++. ++.+++|+|||++||+|||+||||||+|||||||||||+||||||+++|+||| T Consensus 450 ~K~~a~q~L~aR~Dl~~~~~t~v~~~~~~~~~~~~~SR~~~i~tRLk~~~ESt~~GTpvcRA~i~l~s~Kl~d~~~~~y~ 529 (722) T protein:vir:94 450 VKDIAEQFLGARKDILVVADACVWRPGEKNSLEEIYSRAAMITNRLRMTVESEKWGTPACRAAVNLIEAKVTDEPTGWYF 529 (722) T ss_pred HHHHHHHHHhcccceeEEeeeeeecCCCCCchHHHHHHHHHHHHHhhcccccccccchhhhhhhhhhhceeecCccceec Confidence 9999999999999999999777774 46799999999999999999999999999999999999999999999999999 Q ss_pred eehHHHHHHHHHHhcCcCCcccCCCccccCCcceeeeeccCcceecCCchhhhhhhCceeEeeeeeee-EEEcccccccc Q lcl|NC_016571. 544 SLNIENMYAFAVAGGGADGRIYAADMPDHEGNRILRIAHDPFVEFEADDPAANNLIQGCITVTPITES-QFCRPALPTVY 622 (733) Q Consensus 544 pln~dlA~k~A~y~Ga~dG~~~~~~~~D~~pN~~v~~~~dlnv~f~~~~v~a~~w~ng~i~v~~yD~~-~~y~PaL~TVy 622 (733) |+|+|+||+||+|+|++||.++|+++|||+|||+||+||+|||+|++|+|||++|+||+||+++||+| |.||||||||| T Consensus 530 ~~~lD~A~~~A~~AG~~~G~lv~~~e~D~s~N~~v~~v~~~Nv~F~~d~vaa~~~~NG~~~~~~~D~~~qtY~P~l~sv~ 609 (722) T protein:vir:94 530 SGNIDLAYAFALWAGNIDGLIVTPNAPDHADNRKLRLMHSPNIVFEEDEVAAENFSNGHISLKPWDWNVQTYRPGLPTIH 609 (722) T ss_pred chhHHHHHHHHhhhccccccccCCcccCcCCCccEEEEecCCceecchhhHHhhhccCcccccccccccccccCCCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999966 66999999999 Q ss_pred CCchhhhhhhhhhhhHHHHHHHHHHHhhhhccccccChhHHHHHHHHHHHHHhhhhhcceecceeEecccceeccchhcC Q lcl|NC_016571. 623 GNINSVLKDLTNVWKCVVVEKILQDIWIQVSGDTQLGKEGYLSFVKDGAEKRIRDLFGSVISNWEVVPSFREDSPTSKSV 702 (733) Q Consensus 623 ~ndtSVLns~~tv~~c~~~~kv~~~~w~~fsG~~slt~~ql~~~~~d~I~~~~rd~fg~rv~n~~vi~p~~~~T~~d~~~ 702 (733) +||+|||+|++|+++||+|||+||++||+|||+++||+|||+++|||+|+++|||+||+||+||...+++++.|.+++++ T Consensus 610 ~d~~SVL~s~~t~~iC~v~~rl~~~~~~~~~Gn~Tlt~eq~v~~~~~eI~d~~Rd~~G~~V~nIv~~T~~~~~~~~~~~~ 689 (722) T protein:vir:94 610 PNPDSVLKDLANPFLCVAMEKISADQWRIVCGDRTITADNYASIMKDEIESACRNAVGGEVSNIVAETFYETGTLGSRAK 689 (722) T ss_pred cCchhhhhhccchHHHHHHHHHHHHHHHHhhCccccCHHHHHHHHHHHHHHHHHHhhccchhhcchhhhhhccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceEEEeeeccCCcEEEEEEEEEEEccCCCCC Q lcl|NC_016571. 703 MYSITRLWFGKGIYMLNSVLEAYNEDSLNAE 733 (733) Q Consensus 703 g~SwT~~~~nn~~~~m~~~leayr~~~l~a~ 733 (733) .+.+.|+||||.||||+|+|++|+++||-.. T Consensus 690 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 720 (722) T protein:vir:94 690 LRVILHAWFNKAKYMMEFDLYAYNQQDLATT 720 (722) T ss_pred eeeeeehhhcccceeeeeeehhccchhhhhc Confidence 9999999999999999999999999999655 No 2 >protein:vir:8933 Length: 695 # NCBI annotation: ORF029 # Family: family:all:12145 # MgeID: mge:163 # MgeName: phiKZ # Cross-refs: genbank:acc:NP_803595;genbank:gi:29134965;genbank:GeneID:1258217 Probab=100.00 E-value=1.5e-284 Score=1576.67 Aligned_cols=666 Identities=22% Similarity=0.305 Sum_probs=629.3 Q ss_pred CceeeecccceeeccceeeccccccccCCCcccccccc---eeeecccccCcccccCCcchhhhcccc-cCCCccccchH Q lcl|NC_016571. 1 METFNKVIPGKVVNEGIVSRETFPAAVPVISSPAHFPD---FAAVTPRGSTKRATVSTGTFTSKFGDT-TDPFGLYYNPV 76 (733) Q Consensus 1 M~~~~n~~P~~v~~~GirD~S~~~l~~~~~~~p~H~Pl---f~~~~p~G~~~~~~v~~gd~~siyGD~-~D~~s~yfn~~ 76 (733) |+ |+|++| ||+|+||||||+|+|+|||+++|||||+ |++.+|+.++++|..++| |++||||+ +||||||||+| T Consensus 1 ~~-f~navP-rVvfsGIRDrSrr~lirpd~t~aqH~PllrLftetGp~~TtyVgd~Ddg-fa~IyGq~slDprSkffn~q 77 (695) T protein:vir:89 1 MA-YYNAVP-RVVFNGIRDRSRRPLIRPDITFAQHCPLLRLFTETGPTETTYVGDSDDG-FASIYGQASLDPRSKFFNTQ 77 (695) T ss_pred Cc-cccccc-eeEeecccccCcCcccCCCCcchhhcchhhhhhhcCCcceeeecCcccc-cceeeeeeeecccCcccChH Confidence 65 677999 9999999999999999999999999999 566666668888888888 99999997 69999999999 Q ss_pred HHHHHHHhhcccceEEEEEecccccc--ceeeeEEeeecccccceeEcCcccEecCCCCCc----ccCccccccCCeeEE Q lcl|NC_016571. 77 TYAIQKLGQAGQASFSFKRLTNNTAK--SRTIIGLAIFSGDIPNYLRDGNGDYVLDAAGNP----KVDETTATIAGKWVC 150 (733) Q Consensus 77 t~laq~l~~a~~n~f~~~RL~p~dA~--s~~~i~~dv~~~~VP~Y~R~~nG~~~~d~~G~p----~v~~t~~tI~gk~V~ 150 (733) ||||||| +||||+|+||||+||||+ +++|+++++|+++||+||||+|| |+|+++|+. .|+.++..-|-+..| T Consensus 78 SlLalnL-LGrgNgfyvkRL~peda~n~arlivai~~Ved~vp~~irrlsG-fnyp~~~~~i~~~pvpt~d~v~Gl~arI 155 (695) T protein:vir:89 78 SLLALNL-LGRGNGFYVKRLRPEDAANPSRLIVAIEIVEDEIPLTIRRLSG-FNYPNSVRDIGNAPVPTTDKVDGLKARI 155 (695) T ss_pred HHHHHHH-hccCceeEEEEechhhccchhhhhhHHHHHHHHHHHHHHhhcC-ccCCccccccccCCCCccccccceeEEE Confidence 9999999 999999999999999999 55566789999999999999999 999999663 355556655558889 Q ss_pred eeEeecccccccceeeccccccccCccccCcCceeeeehe----ehhhcccccccc-ccccccchhh-cchhhhhhhccc Q lcl|NC_016571. 151 TGVLKSEGEVGEAKAFEVTSTDETPGIPTGTNGKFYPLAE----VIGGIGDTYNAN-WISIGHNFQT-DWNEVARFVQTN 224 (733) Q Consensus 151 ~~~~~~~~~~g~~~a~~~~~~~~~~~~~~g~q~~~yPL~E----~~~~~Gd~~n~n-~i~~g~~~~~-dw~~~~~fv~~~ 224 (733) ++|+|+++|+|+++++|++..++++ |+|+.+||||| |||++||+.|++ |+...+++++ |.+.+.+|.++ T Consensus 156 ~lI~DntsEvGtqrVlpgt~~ss~D----g~qslvyPLFE~p~sffgklGdnlG~Rvws~t~adieefdea~t~kfdtR- 230 (695) T protein:vir:89 156 ILIEDNTSEVGTQRVLPGTLVSDKD----GSQSLVYPLFEAPVSFFGKLGDSNGMRVWSTTTADIEEFDEAAMAKFKTR- 230 (695) T ss_pred EEeeccCcCcchhhccccceeecCC----CchhhhhhhHhhHHHHHhhccccccceeeccchhhhhhhhhhcCcchhhh- Confidence 9999999999999999999999998 77999999999 899999999999 9999999988 99999999999 Q ss_pred cceeEEEeeeeeccc-ccccceeeeccCcceeEeecccccccc----cceeeeeeeeeeeccCcccCcCCCCCCccceEE Q lcl|NC_016571. 225 GSYPFILNMGVLLDN-GLRVPANTINGTPDTTFTMFDTVNEFN----TRYGLKVAVDQYTGNNVNRPVEVQDAPFDDVFM 299 (733) Q Consensus 225 ~~~pf~~~~~s~~d~-g~~~~~~t~~~~~d~~~~~fd~~~~~~----~~y~~~v~~d~y~~~~~~~p~~p~~~PF~~~yv 299 (733) +|++|++++++. +++|++||+++ +|++..+||++++.+ ++|++||++++|+++++++|++|+|+||+|+|| T Consensus 231 ---qF~~q~~e~~ev~~tpv~~KTalq-qdy~~~tfd~gv~s~s~n~dlY~gdvl~~sy~ddg~~sg~~PlysPFsq~yv 306 (695) T protein:vir:89 231 ---QFRIQLIEKPEVGTSPVIVKTADQ-QDYLNITFDKGVYSDMYNADLYVGDVLVDSYSDDGVVSGLSPLYSPFSQFYV 306 (695) T ss_pred ---eeeeeeeeccCCCCcceEEEeccc-cceeeeeecccccchhccchhhhhhhheeeeccCcccCCCCcccCcccceEE Confidence 778888886554 56666677666 899999999988877 999999999999999999999999999999999 Q ss_pred EhhhHHHHHHHHHhhhcccccccCCcccchhccCCcceeeccccccccCCCcceeEEecccccccccccceeeccceeec Q lcl|NC_016571. 300 YTQNLSDVAKELYAAEYGTDDPTNQPPNVLSKRLPKHAIMNMFDLVNHNGKPYKHIVFGGNFDQAGKVTGSRFSMNHYLQ 379 (733) Q Consensus 300 Y~~Nie~Vl~~ly~aE~~~~d~t~~p~~~~~~~~~~~~~~n~f~~~~~nG~PY~~Iq~~g~~d~~g~v~gs~~~~n~~i~ 379 (733) ||+||++||||||++|++.| |+++++.++|| +||||+|+||||+||++||++|+++ +|++|+|++++| T Consensus 307 Y~dnId~v~Qmiyd~E~~~N-----pa~~~~~~~PG--eidflT~ln~dGdPY~~Iq~~G~l~-----GGi~LgKdg~vy 374 (695) T protein:vir:89 307 YHENIDLVRQMIYDTEMRVN-----PAAAAHTTAPG--EIDFLTFLAVDGDPYQGIQVLGPLD-----GGITLGKDGNIY 374 (695) T ss_pred EeccHHHHHHHHhhhccccc-----hhhhhhccCCc--cccceeeeecCCCcceeEEEeeeec-----cceeecCCceEE Confidence 99999999999999997665 68899999999 6999999999999999999999998 899999999999 Q ss_pred ccCCCCccccccc--cccccccccccccccchhcccccchhhhhhhhhhhhhhhhhhhhhcccccceeehheeeccceee Q lcl|NC_016571. 380 SNGGINPFADKDG--KFPEAPTTWLPAIDGPWVADVSDPDLVISHKQAWEMNQMLIEAYLTTYITSLDVKDVIRNRSSFM 457 (733) Q Consensus 380 ~sGG~dgt~d~dg--k~~d~~t~~~~~ld~~~~~d~~~~~~~~~~~~~~~m~~~l~~a~l~~~~~~~~~nd~~ry~~~~~ 457 (733) +|||+||++|.|+ |++||+|+||++||| +|+|++||||||+ T Consensus 375 asggtdgtTd~eey~klvdI~N~Nfg~l~D-------------------------------------~y~n~a~yqfg~l 417 (695) T protein:vir:89 375 ASGGTDGTTDLEEYAKLVDIENINFGKLND-------------------------------------RYNNIAEYQFGVL 417 (695) T ss_pred EeCCCCCccchhhhheeeeeecccceeecc-------------------------------------chhhhhhhheeee Confidence 9999999999999 999999999999999 4999999999999 Q ss_pred EecCCCHHHHHHHHHHHhcCcceEEEEEEEeeCCCCCHHH--HHHHHHHHHhhhheeeeeeeccCccceeEeecceeEEE Q lcl|NC_016571. 458 WDVGYNQKIKDIMIQFLSKRKDIIVVPCATEYLRKKTQDE--LYSTATMLNTRIVMIPESEVYKSEACRASINLWDARYI 535 (733) Q Consensus 458 ~DsGfs~~~K~~m~~fl~~RkD~~vv~~T~~~~~~~s~~e--~~s~a~~L~trl~~~PES~~YgTpa~Ra~I~~~s~~l~ 535 (733) ||+||||++|++|||||++|||++|+++|++++|.+-+.| ++||++||+|||+||||||+|||||||||||+|||||| T Consensus 418 yDtGlpmesKy~amq~L~aR~Dl~~~ftt~vetD~r~p~e~~elSR~~~i~tRLkafpEStlyGTpvcRA~ivlqsgKlm 497 (695) T protein:vir:89 418 YDTGLPMESKYRAMRVLSARRDLQYFFTTFVETDSRLPDEATELSRVQQIITRLKAFPESTLYGTGVCRAMIVMQSGKLM 497 (695) T ss_pred eecCCchHHHHHHHHHHhhhhhhhhhhhhheeecccCCCCchhHHHHHHHHHHhhcccccccccchhhhhhhhheeceee Confidence 9999999999999999999999999999999999876665 78999999999999999999999999999999999999 Q ss_pred eccccccceehHHHHHHHHHHhcCcCCcccCCCccccCCcceeeeeccCcceecCCchhhhhhhCceeEeeeeeeeEEEc Q lcl|NC_016571. 536 NEPTWGRFSLNIENMYAFAVAGGGADGRIYAADMPDHEGNRILRIAHDPFVEFEADDPAANNLIQGCITVTPITESQFCR 615 (733) Q Consensus 536 ns~y~~~vpln~dlA~k~A~y~Ga~dG~~~~~~~~D~~pN~~v~~~~dlnv~f~~~~v~a~~w~ng~i~v~~yD~~~~y~ 615 (733) +++|+||||+|+|+|||||+|+|+++|.++|+++|||+|||+||++|+|||+|++|+||+++|+||+||+++||+||+|| T Consensus 498 dg~y~kyvpqllD~Am~wA~yAGag~G~lvpg~emDvspNn~vtfvk~lNv~Ff~drvaa~~w~NG~t~s~~yD~r~~Yy 577 (695) T protein:vir:89 498 DGTYRKYVPQLLDVAMSWARYAGAGTGNLVPGMEMDVSPNNRVTFVKDLNVKFFDDRVRAQAWANGATWSQSYDHRSSYY 577 (695) T ss_pred cCccceecchhHHHHHHHhhhhccccccccCCcccCcCCCccEEEEecCCceecchhhHHhhhccCcccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccCCchhhhhhhhhhhhHHHHHHHHHHHhhhhccccccChhHHHHHHHHHHHHHhhhhhcceecceeEeccccee Q lcl|NC_016571. 616 PALPTVYGNINSVLKDLTNVWKCVVVEKILQDIWIQVSGDTQLGKEGYLSFVKDGAEKRIRDLFGSVISNWEVVPSFRED 695 (733) Q Consensus 616 PaL~TVy~ndtSVLns~~tv~~c~~~~kv~~~~w~~fsG~~slt~~ql~~~~~d~I~~~~rd~fg~rv~n~~vi~p~~~~ 695 (733) |||||||+||+|||+|++|+++||+|||+||++||+|||+++||+|||++|||++|+++|||+||+||+ |+|++|| T Consensus 578 Pclrsv~lddtSVLlsp~tvniCcv~~rlih~vha~f~GnaTlt~eqlv~rcd~eIld~~Rd~fG~rVn----I~p~Tei 653 (695) T protein:vir:89 578 PCLRSVMLDDTSVLLSPITVNICCVLIRLIHKVHAQFSGNATLTPEQLVERCDEYILDLVRDMFGTRVN----IIPRTEI 653 (695) T ss_pred CCCcccccCchhhhhhccchHHHHHHHHHHHHHHHHhhCccccCHHHHHHHHHHHHHHHHHhhcCceee----eeeccce Confidence 999999999999999999999999999999999999999999999999999999999999999999998 9999999 Q ss_pred ccchhcCceEEEeeec---cCCcEEEEEEEEEEEccCCCCC Q lcl|NC_016571. 696 SPTSKSVMYSITRLWF---GKGIYMLNSVLEAYNEDSLNAE 733 (733) Q Consensus 696 T~~d~~~g~SwT~~~~---nn~~~~m~~~leayr~~~l~a~ 733 (733) ||+|.+|||||||.++ |||+|+|+|+||++|||+++|| T Consensus 654 t~~d~~ng~swtc~vtv~annprt~~~f~letvr~et~~a~ 694 (695) T protein:vir:89 654 TPIDANNGTSWTCNVTVEANNPRTTLNFNLETVRIETPPAQ 694 (695) T ss_pred eecccCCCceEEEEEEEeeCCCceEEeeeeeeEEecCCCCC Confidence 9999999999999866 9999999999999999999999 No 3 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=69.76 E-value=0.22 Score=24.21 Aligned_cols=559 Identities=13% Similarity=0.054 Sum_probs=229.4 Q ss_pred Cce-------eeeccccee-eccceeeccccccccCCCcccccccceeeecccccCccccc--CCcchhhhcc--cccCC Q lcl|NC_016571. 1 MET-------FNKVIPGKV-VNEGIVSRETFPAAVPVISSPAHFPDFAAVTPRGSTKRATV--STGTFTSKFG--DTTDP 68 (733) Q Consensus 1 M~~-------~~n~~P~~v-~~~GirD~S~~~l~~~~~~~p~H~Plf~~~~p~G~~~~~~v--~~gd~~siyG--D~~D~ 68 (733) |++ +-.+-|||= .=-|..=...-.-+.+..+.+-+...|-+.+.+|..+.-+. +-.++.++|| |+-|- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~G~~~~~~~~~~~~~a~~~f~~g~l~~a 80 (607) T protein:vir:10 1 MTTTITSAESYKRIYPLFYDSRPHVETNFDDSRLSNTASDSAKNIFMLGSATNGDPTKVYEIRTSQQATKIFGSGDLVDG 80 (607) T ss_pred CcceecchhhHHHHhCCCCccCCceEEEEecCcCcCCCCCCcceEEEEEEeCCCCCceEEEEcchhHHHHhhcCcchHHH Confidence 433 223334430 00122222223345667777788888999999999877663 4445889995 65554 Q ss_pred CccccchHHHHHHHHhhcccceEEEEEeccccccceeeeEEeeecccccceeEcCccc-EecCCCCCcccCccccccCC- Q lcl|NC_016571. 69 FGLYYNPVTYAIQKLGQAGQASFSFKRLTNNTAKSRTIIGLAIFSGDIPNYLRDGNGD-YVLDAAGNPKVDETTATIAG- 146 (733) Q Consensus 69 ~s~yfn~~t~laq~l~~a~~n~f~~~RL~p~dA~s~~~i~~dv~~~~VP~Y~R~~nG~-~~~d~~G~p~v~~t~~tI~g- 146 (733) -..-|+|.+-+ +++++ .|+.-|.-+..+.+...=.+-+ -=..|-.++|.. +.+|+. |+| T Consensus 81 ~~~a~~~~~~~----~~g~~-~~~~~rv~~~~~a~~~~~~~~~---~~~~~~~~~~~i~~~l~~~-----------~~~~ 141 (607) T protein:vir:10 81 IKLAFDPTGNS----VTNGG-TVYALRVDNAKQASLVKDGLTF---TSSIFGTNANQVSVALDND-----------VFGV 141 (607) T ss_pred HHHhhccccCC----ccCCc-eEEEEeCCCccccceecccccc---cccccccCCCceEEEEEec-----------CCCc Confidence 44444444322 24444 3777777555554433212111 112222222221 222211 121 Q ss_pred eeEEeeEeecc-cccc------cceeeccccccccCccccCcCceeeeeheehhhccccccccccccccchhhcchhhhh Q lcl|NC_016571. 147 KWVCTGVLKSE-GEVG------EAKAFEVTSTDETPGIPTGTNGKFYPLAEVIGGIGDTYNANWISIGHNFQTDWNEVAR 219 (733) Q Consensus 147 k~V~~~~~~~~-~~~g------~~~a~~~~~~~~~~~~~~g~q~~~yPL~E~~~~~Gd~~n~n~i~~g~~~~~dw~~~~~ 219 (733) |.....+..+. .++. -.-.+.++..+++..+.-+..|+-..| ....|+..+..=.-+..++ T Consensus 142 ~~~~~~~~~d~~~~~~~n~g~~~~i~y~g~~~~a~~~v~~~~~g~~~~l---t~~~~~~~~~~~~V~~~~l--------- 209 (607) T protein:vir:10 142 PRITVNYSPDNYERTYTNIGQMFSITYSGKSASAGYTVSHDTDGKAILL---TLGSGDSIDKLTNVATFDL--------- 209 (607) T ss_pred cceeEEeecccceeeeeeccceeecccCcccccccceeeecCCCceeEE---EecCCCccceeeeeecccc--------- Confidence 22222222221 1110 011223333333322221222221111 1122222222100001111 Q ss_pred hhccccceeEEEeeeeecccccccceeeeccCcceeEeeccccccccccee----ee--eeeeeeeccCcccCcCCCCCC Q lcl|NC_016571. 220 FVQTNGSYPFILNMGVLLDNGLRVPANTINGTPDTTFTMFDTVNEFNTRYG----LK--VAVDQYTGNNVNRPVEVQDAP 293 (733) Q Consensus 220 fv~~~~~~pf~~~~~s~~d~g~~~~~~t~~~~~d~~~~~fd~~~~~~~~y~----~~--v~~d~y~~~~~~~p~~p~~~P 293 (733) ..+.||..-.++. + ++.-+++.-..++..+ .+..|. .+ |.+..+.- +-.....+.|-+ T Consensus 210 ---~~~~~~t~~~l~~---d--------in~~~~~~A~~~g~~~-i~tky~d~~~~~i~V~~~~~iv-~a~~~D~~~~~~ 273 (607) T protein:vir:10 210 ---TMSKYDTIAKLMQ---A--------ISATPNFSASVVGSPS-VNTSYLDEVTSPVDVKTAPAVV-TAKIGDAISKLG 273 (607) T ss_pred ---cccccchHHHHHH---H--------hhcCCceEEEEecccc-eeeeccccccceeEEEEeeeee-chhhhhhhhccc Confidence 1122222111110 0 1112233222222111 111120 01 11111100 011112344444 Q ss_pred ccceEEEhh--hHHHHHHHHHhhhcccccccCCcccchhccCCcceeeccccccccCCCcceeEEeccccccccccccee Q lcl|NC_016571. 294 FDDVFMYTQ--NLSDVAKELYAAEYGTDDPTNQPPNVLSKRLPKHAIMNMFDLVNHNGKPYKHIVFGGNFDQAGKVTGSR 371 (733) Q Consensus 294 F~~~yvY~~--Nie~Vl~~ly~aE~~~~d~t~~p~~~~~~~~~~~~~~n~f~~~~~nG~PY~~Iq~~g~~d~~g~v~gs~ 371 (733) ..+++..+. ..+.+.... .+| +++..+. -....+|.| T Consensus 274 ~~~~~~~t~~~~~~~~~~~~-~~~-------------------~~~~~~~-~~~~~~~~~-------------------- 312 (607) T protein:vir:10 274 YDPYVVVTQTSNNKPIVNGV-SAG-------------------TGSATAS-VTTAPESFP-------------------- 312 (607) T ss_pred ccceEEeeecccchhhhhhh-hcc-------------------ccceeee-eeccccccc-------------------- Confidence 444444322 111111100 000 0011110 001111111 Q ss_pred eccceeecccCCCCccccccccccccccccccccccchhcccccchhhhhhhhhhhhhhhhhhhhhcccccceeehheee Q lcl|NC_016571. 372 FSMNHYLQSNGGINPFADKDGKFPEAPTTWLPAIDGPWVADVSDPDLVISHKQAWEMNQMLIEAYLTTYITSLDVKDVIR 451 (733) Q Consensus 372 ~~~n~~i~~sGG~dgt~d~dgk~~d~~t~~~~~ld~~~~~d~~~~~~~~~~~~~~~m~~~l~~a~l~~~~~~~~~nd~~r 451 (733) -+. .+....||+||+.. ++|+=+-..|++ .+. T Consensus 313 a~~-a~~~LtGGtdG~~~----------------------------------~ty~dal~aLe~--------~e~----- 344 (607) T protein:vir:10 313 ANF-DTAFLTGGSTGDVP----------------------------------VSWADKFNGAIG--------NNV----- 344 (607) T ss_pred ccc-ceeeeeCCCCCCch----------------------------------hhHHHHHHHHhh--------cCc----- Confidence 111 23456799999611 111111111110 011 Q ss_pred ccceeeEecCCCHHHHHHHHHHHhcCcc----eEEEEEEEeeCCCCCHHHHHHHHHHHHhhhheeeeeeeccCccceeEe Q lcl|NC_016571. 452 NRSSFMWDVGYNQKIKDIMIQFLSKRKD----IIVVPCATEYLRKKTQDELYSTATMLNTRIVMIPESEVYKSEACRASI 527 (733) Q Consensus 452 y~~~~~~DsGfs~~~K~~m~~fl~~RkD----~~vv~~T~~~~~~~s~~e~~s~a~~L~trl~~~PES~~YgTpa~Ra~I 527 (733) |-+.++ + -.+.+|..+..|+.++++ .+.|+.-. ..-++++..+++..++.. |... T Consensus 345 ~~i~~~--t-~d~ai~~~l~a~vkr~~~~g~~~~aVlg~~---~~~t~~~~~t~a~~~N~e---------------rvv~ 403 (607) T protein:vir:10 345 YYIIPL--T-SEENIHAELQAFIDEQHVLGYNYHAFVGGG---FAEPLEQILSRQVNINDS---------------RFGL 403 (607) T ss_pred eEEEec--C-CCHHHHHHHHHHHHHHHhCCCcEEEEecCC---CCCCHHHHHHHHHhhCCC---------------cEEE Confidence 111111 1 246789999999988877 55554322 234677777777776432 5556 Q ss_pred ecceeEEEeccccccceehHHHHHHHHHHhcCcCCcccCCCccccCCcceeeeeccCcceecCCchhhhhhhCceeEeee Q lcl|NC_016571. 528 NLWDARYINEPTWGRFSLNIENMYAFAVAGGGADGRIYAADMPDHEGNRILRIAHDPFVEFEADDPAANNLIQGCITVTP 607 (733) Q Consensus 528 ~~~s~~l~ns~y~~~vpln~dlA~k~A~y~Ga~dG~~~~~~~~D~~pN~~v~~~~dlnv~f~~~~v~a~~w~ng~i~v~~ 607 (733) +.-++.+.+.......|--.=-|+-++.++|.. +-.++.+++-.+.++.-.|... -..+.-.+|.+.+.. T Consensus 404 V~~~~~~~~~G~~~~~~~~~~Aa~vAGl~Ag~~---------~~~SlT~k~i~~~~v~~~lt~~-e~e~ai~~Gv~~l~~ 473 (607) T protein:vir:10 404 VGQSGHVQEGGESVHVPAYLMAAYVGGLSSSLG---------VAVPITNKKLALVDLDQNFSGD-DLNTLNQNGVIGIEH 473 (607) T ss_pred EecCeeEeeCCcceeccHHHHHHHHHHHHhcCc---------cccCcccceeccccccccCCHH-HHHHHHhCCeEEEEE Confidence 666777777644443433222334344444432 3344555444456777777754 445677888665532 Q ss_pred eeeeE------EEccccccccCCchhhhhhhhhhhhHHHHHHHHHHHhh-hhccccccChhHHHHHHHHHHHHHhh--hh Q lcl|NC_016571. 608 ITESQ------FCRPALPTVYGNINSVLKDLTNVWKCVVVEKILQDIWI-QVSGDTQLGKEGYLSFVKDGAEKRIR--DL 678 (733) Q Consensus 608 yD~~~------~y~PaL~TVy~ndtSVLns~~tv~~c~~~~kv~~~~w~-~fsG~~slt~~ql~~~~~d~I~~~~r--d~ 678 (733) +.++ ..-=++-|.-..++..++.+-++=..=+|.+-+...++ .|-|. ...+.....+++.|...+. .+ T Consensus 474 -~~~~~~~~~vrIv~~ItT~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yIGk--~nnd~~~~~vk~~i~~~L~~~~l 550 (607) T protein:vir:10 474 -LVNRNATGGYYIVQDVSTNTVSSSHVDGSLYLGELTDFLFDNLRFVLRDTYIGS--NIRSTSADDIKSTVASYLYSEMN 550 (607) T ss_pred -ccCccccceEEEeeeeeeccCCCCcchheeehhhhHHHHHHHHHHHHhhcCCcc--cCCcchHHHHHHHHHHHHHHHHH Confidence 1111 12234445444444455555555555566666666665 58884 3455666667777766531 11 Q ss_pred h-cceecceeEecccceeccchhcCceEEEeeeccCCcEEEEEEEEEEEccCCCCC Q lcl|NC_016571. 679 F-GSVISNWEVVPSFREDSPTSKSVMYSITRLWFGKGIYMLNSVLEAYNEDSLNAE 733 (733) Q Consensus 679 f-g~rv~n~~vi~p~~~~T~~d~~~g~SwT~~~~nn~~~~m~~~leayr~~~l~a~ 733 (733) - ++.+.+.. ..+-+++......-..|.+...+-|+++.-+ ..+|.++|+|| T Consensus 551 ~~~gaI~df~--~edv~v~~~~D~v~v~~~v~Pv~~iekIyvt--v~v~~~~~~~~ 602 (607) T protein:vir:10 551 NDDGLIVDFS--ESDIVVTISGTVVYIQFAVAPTQEIKNIVVS--GTYSNYSATSE 602 (607) T ss_pred HhcCceeCCC--ccccEEeeCCCEEEEEEEEEEcccceEEEEE--EEEEEEEEeec Confidence 1 22332211 0112223322223333444344444444322 24677888999 No 4 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=61.51 E-value=0.35 Score=23.07 Aligned_cols=610 Identities=11% Similarity=0.048 Sum_probs=232.1 Q ss_pred eeecccceeeccceeeccccccccCCCcccccccceeeecccccCcccc--cCCcchhhhcccccCCCccccchHHHHHH Q lcl|NC_016571. 4 FNKVIPGKVVNEGIVSRETFPAAVPVISSPAHFPDFAAVTPRGSTKRAT--VSTGTFTSKFGDTTDPFGLYYNPVTYAIQ 81 (733) Q Consensus 4 ~~n~~P~~v~~~GirD~S~~~l~~~~~~~p~H~Plf~~~~p~G~~~~~~--v~~gd~~siyGD~~D~~s~yfn~~t~laq 81 (733) .+-.+|| |-++=| |.++ ... .+ +--.+-|-..++||-.+... .+-.||..+||...+-. ....+.+ T Consensus 1 ~~~~~Pg-Vyv~e~-~~~~-~i~-~v---~ts~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~-----~~~~~~~ 68 (666) T protein:vir:65 1 MTLLSPG-FETKET-TLST-TIV-QS---ETGRAALVGKFQWGPAFQIIQVTNEVELVNKFGQPDNNT-----ADYFMSG 68 (666) T ss_pred CceecCc-eEEEEe-cCcc-ccc-cc---CcccceEEecccCCCCccCEEecCHHHHHHHcCCccccc-----hhHHHHH Confidence 4446798 566555 5333 322 22 23356789999999875444 37778999999754432 2223444 Q ss_pred HHhhcccceEEEEEeccccccceeeeEEeeecccccceeE--cCcccEecCCCCC---cccCccccccCCeeEEeeEeec Q lcl|NC_016571. 82 KLGQAGQASFSFKRLTNNTAKSRTIIGLAIFSGDIPNYLR--DGNGDYVLDAAGN---PKVDETTATIAGKWVCTGVLKS 156 (733) Q Consensus 82 ~l~~a~~n~f~~~RL~p~dA~s~~~i~~dv~~~~VP~Y~R--~~nG~~~~d~~G~---p~v~~t~~tI~gk~V~~~~~~~ 156 (733) .-++..|+.|++-|+...++.+++....+ .+..-.+ ...| ..|+ .+.......+++.... T Consensus 69 ~~f~ngg~~~~vvrv~~~~~~~~~~~~~~----~~~~~~~~~g~~~-----~~g~~~~V~~~~~~~~~~~~~~~------ 133 (666) T protein:vir:65 69 ANFLQYGNDLRVVRVLNKEKAKNATALAG----NVEFEITNEGSNY-----EVGDTIKIKHNRQDIETAGKVTK------ 133 (666) T ss_pred HHHHhcCceEEEEEccCcccccccccccC----ceeeeEeeccccc-----cccceEEEEeccccccccccccc------ Confidence 55567788899999877666544332111 0100000 0000 0011 0000000111110000 Q ss_pred ccccccceeeccccccccCccccCcCceeeeeheehhhcccccccccccccc--chhh----cch--hhhhhhcccccee Q lcl|NC_016571. 157 EGEVGEAKAFEVTSTDETPGIPTGTNGKFYPLAEVIGGIGDTYNANWISIGH--NFQT----DWN--EVARFVQTNGSYP 228 (733) Q Consensus 157 ~~~~g~~~a~~~~~~~~~~~~~~g~q~~~yPL~E~~~~~Gd~~n~n~i~~g~--~~~~----dw~--~~~~fv~~~~~~p 228 (733) .+.. ....+...|..+.+..........++-... .+.. .-. .+..+........ T Consensus 134 --------------~~~~----~~~~g~~~~t~~~~~~~~~~g~~~~l~~~~~~~~~~~~~~~~~a~sv~~~~~~g~~~~ 195 (666) T protein:vir:65 134 --------------VDGD----GKVKGVFIPTGKIIAHAKAIGVYPELDGGWTAEFTSSSGNGSAALSVTKIVTDSGLLL 195 (666) T ss_pred --------------cccc----ccccccccccceeeccccccCcceeEeeccceeecccCcccccceeeeecccccceee Confidence 0000 000000111111111100000000000000 0000 000 0000000000000 Q ss_pred EEEeeeeecccccccceeee-ccCcceeEeecccccccccceeeeeeeeeeeccCcccCc-CCCCCCccceEEEhhhHHH Q lcl|NC_016571. 229 FILNMGVLLDNGLRVPANTI-NGTPDTTFTMFDTVNEFNTRYGLKVAVDQYTGNNVNRPV-EVQDAPFDDVFMYTQNLSD 306 (733) Q Consensus 229 f~~~~~s~~d~g~~~~~~t~-~~~~d~~~~~fd~~~~~~~~y~~~v~~d~y~~~~~~~p~-~p~~~PF~~~yvY~~Nie~ 306 (733) ........ +-...+....+ .....+..... +..+-+ .+.+............ .+.-.++........ . T Consensus 196 ~~~~~a~~-~~~~~~~~~~~~~~~~~a~~A~~-~g~~g~-----~i~v~i~~~~~~~~~~~~l~~~~~~~~~~~~~---~ 265 (666) T protein:vir:65 196 TDLETSRA-NITNQTFLTKLKKYDMPAVSAIY-AGEIGN-----SLEVEILARSAFKNTAPDLTMYPYGGERTAAR---N 265 (666) T ss_pred eeeccccc-ccccccccccccccccceeeeee-cccccc-----ceeEEeecccccccccccccccccccccccce---e Confidence 00000000 00000000000 00000000000 000000 0001111111000000 000000100000000 0 Q ss_pred HHHHHHhhhcccccc-----cCCcccchhcc----------CCcceeeccccccccCCCcceeEEeccccccccccccee Q lcl|NC_016571. 307 VAKELYAAEYGTDDP-----TNQPPNVLSKR----------LPKHAIMNMFDLVNHNGKPYKHIVFGGNFDQAGKVTGSR 371 (733) Q Consensus 307 Vl~~ly~aE~~~~d~-----t~~p~~~~~~~----------~~~~~~~n~f~~~~~nG~PY~~Iq~~g~~d~~g~v~gs~ 371 (733) + +. +....+. +..+...++.. ..+. ..+..+.+.-....|-........+ T Consensus 266 ~---~~--~~~~~~~~~~~~v~~~g~~~e~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~~~~~~~~~--------- 330 (666) T protein:vir:65 266 L---IP--YAPQNDNQYAFIVRRDGVVVESYVLSTLKGDKDVYGN-SIYMDDFFARGSSQYIYATAQGWVD--------- 330 (666) T ss_pred e---ec--ccccccccceeeeecCCcccceeecccCcccccccch-hhhhhhhhcccccceeeeecccccc--------- Confidence 0 00 0000000 00000000000 0000 0011111111111221111100000 Q ss_pred eccceeecccCCCCccccccccccccccccccccccchhcccccchhhhhhhhhhhhhhhhhhhhhcccccceeehheee Q lcl|NC_016571. 372 FSMNHYLQSNGGINPFADKDGKFPEAPTTWLPAIDGPWVADVSDPDLVISHKQAWEMNQMLIEAYLTTYITSLDVKDVIR 451 (733) Q Consensus 372 ~~~n~~i~~sGG~dgt~d~dgk~~d~~t~~~~~ld~~~~~d~~~~~~~~~~~~~~~m~~~l~~a~l~~~~~~~~~nd~~r 451 (733) .....+-.+||.|+.....+.+ ..++. .-+-.++++.. .+... T Consensus 331 -~~~~~~~~~~g~~~~~~~~~~~---------g~~~~----------~~~~~~~~~~~-----------------~~~~~ 373 (666) T protein:vir:65 331 -GFSGIISLAGGVSANEATTGGV---------GADPF----------IGAMMQGWDLF-----------------AERES 373 (666) T ss_pred -cccceEEccCCCCcCccccccc---------ccccc----------cccHHHHHHHH-----------------hhhhh Confidence 0111222345554431111000 00000 00011222211 11111 Q ss_pred ccceeeEecCC------CHHHHHHHHHHHhcCcceEEEEEEEee----CC-CCCHHHHHHHHHHHHhhhheeeeeeeccC Q lcl|NC_016571. 452 NRSSFMWDVGY------NQKIKDIMIQFLSKRKDIIVVPCATEY----LR-KKTQDELYSTATMLNTRIVMIPESEVYKS 520 (733) Q Consensus 452 y~~~~~~DsGf------s~~~K~~m~~fl~~RkD~~vv~~T~~~----~~-~~s~~e~~s~a~~L~trl~~~PES~~YgT 520 (733) .-..++.--|+ ...+-.+|...-.+|||.+.++..... .. ..+.++.... ++.+.++-+.. -+. T Consensus 374 ~~~~~l~~p~~~~~~~~~~~v~~~l~~~~~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~-~~~ 448 (666) T protein:vir:65 374 IHVNLLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNIPVTTAIDNLIAW----REGSGNYNENN-MNI 448 (666) T ss_pred ccCCceeecCcCCccchhHHHHHHHHHHHhhccceEEEeccccceeeecCCCCCHHHHHHH----HHhcccccccc-ccc Confidence 12233444344 357778899999999999998854422 22 3455554333 23333322221 122 Q ss_pred ccceeEeecceeEEEeccccc--cceehHHHHHHHHHHhcCcCCcccCCCccccCCcceeeeeccCcceecCCchhhhhh Q lcl|NC_016571. 521 EACRASINLWDARYINEPTWG--RFSLNIENMYAFAVAGGGADGRIYAADMPDHEGNRILRIAHDPFVEFEADDPAANNL 598 (733) Q Consensus 521 pa~Ra~I~~~s~~l~ns~y~~--~vpln~dlA~k~A~y~Ga~dG~~~~~~~~D~~pN~~v~~~~dlnv~f~~~~v~a~~w 598 (733) -..|++++.---++.|....+ .+|..-.+|=-+|+--. ..|.|+.+---.+ ..|+-+-++.+... +.-+...- T Consensus 449 ~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~-~~g~~~span~~~---~~i~g~~~~~~~~~-~~~~~~Ln 523 (666) T protein:vir:65 449 NTTYAVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDA-VSQPWMSPAGYNR---GQIMNVVKLAIEPR-KAHRDRLY 523 (666) T ss_pred CcceEEEEcCceEEecccCCceeEechHHHHHHHHHHHhc-cCCcEEccCCeec---ceeeccccceeecC-hhHHHhhh Confidence 346777776555655544333 46777777766776543 4466664322222 12333345554443 34456667 Q ss_pred hCceeEeeeeeeeEEEccccccccCCchhhhhhhhhhhhHHHHHHHHHHHhhhhccccccChhHHHHHHHHHHHHHhhhh Q lcl|NC_016571. 599 IQGCITVTPITESQFCRPALPTVYGNINSVLKDLTNVWKCVVVEKILQDIWIQVSGDTQLGKEGYLSFVKDGAEKRIRDL 678 (733) Q Consensus 599 ~ng~i~v~~yD~~~~y~PaL~TVy~ndtSVLns~~tv~~c~~~~kv~~~~w~~fsG~~slt~~ql~~~~~d~I~~~~rd~ 678 (733) .+|...+..+..+-...=+=+|.-. +.|-.+-.-..-+.-++++-+.+...++.++- -++.+.+++.+-|+..++++ T Consensus 524 ~~gIn~i~~~~~~G~~~wG~rT~~~-~~s~~~~i~vrR~~~~i~~si~~~~~~~v~ep--n~~~l~~~i~~~i~~~L~~l 600 (666) T protein:vir:65 524 QAAINPVIGAGGEGFILMGDKTATT-VPSPFDRINVRRLFNMLKKNIGDSSKYKLFEN--NDNFTRASFRMEVSQYLSTI 600 (666) T ss_pred hCCceEEEEeCCCeEEEEecccCCC-CCcccceEehhhHHHHHHHHHHHHHHHhccCC--CCHHHHHHHHHHHHHHHHHH Confidence 7888888776544333334455533 33433332222344588999999988888843 25666666666666666664 Q ss_pred hcc-eecceeEecccceeccchhcCceEEEeeec--cCCcEEEEEEEEEEEccCCCCC Q lcl|NC_016571. 679 FGS-VISNWEVVPSFREDSPTSKSVMYSITRLWF--GKGIYMLNSVLEAYNEDSLNAE 733 (733) Q Consensus 679 fg~-rv~n~~vi~p~~~~T~~d~~~g~SwT~~~~--nn~~~~m~~~leayr~~~l~a~ 733 (733) +.. .+.--.|.....+-|+.|-.+|.-.-+..+ .+|.--+.+.|...|.+.--+| T Consensus 601 ~~~gal~g~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e 658 (666) T protein:vir:65 601 RSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIKPAKSINYIMLNFTAVATGSDFDE 658 (666) T ss_pred HhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCccHHH Confidence 421 122223467778889999999988777655 4444555666666665433333 No 5 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=56.12 E-value=0.46 Score=22.41 Aligned_cols=637 Identities=10% Similarity=0.006 Sum_probs=238.6 Q ss_pred CceeeecccceeeccceeeccccccccCCCcccccccceeeecccccCcccc--cCCcchhhhcccccCCCccccchHHH Q lcl|NC_016571. 1 METFNKVIPGKVVNEGIVSRETFPAAVPVISSPAHFPDFAAVTPRGSTKRAT--VSTGTFTSKFGDTTDPFGLYYNPVTY 78 (733) Q Consensus 1 M~~~~n~~P~~v~~~GirD~S~~~l~~~~~~~p~H~Plf~~~~p~G~~~~~~--v~~gd~~siyGD~~D~~s~yfn~~t~ 78 (733) |.+ .+|| |.++=+ |.+ ++..... --.+-|-..+++|-.+... .+-.||...||...|-.-. .. T Consensus 1 ~~~---~~Pg-vyv~e~-~~~-~~i~~~~----t~~~~~vg~~~~gp~~~p~~i~s~~~~~~~fg~~~~~~~~-----~~ 65 (679) T protein:vir:10 1 MTL---LSPG-VETKEI-NLQ-TTIARSS----TGRAALVGKFNWGPAYQISQVVSEVDLVDKFGRPDDQTAD-----SF 65 (679) T ss_pred Cce---ecCc-eEEEee-cCC-cccccCc----cccceeeecccCCCCccCEEecCHHHHHHHcCCcccccch-----HH Confidence 444 6798 566555 643 3332222 2356788999999875544 3777899999976543322 22 Q ss_pred HHHHHhhcccceEEEEEeccccccceeeeEEeeecc---------cccceeEcCcccEecCCCCCcccCccccccCCeeE Q lcl|NC_016571. 79 AIQKLGQAGQASFSFKRLTNNTAKSRTIIGLAIFSG---------DIPNYLRDGNGDYVLDAAGNPKVDETTATIAGKWV 149 (733) Q Consensus 79 laq~l~~a~~n~f~~~RL~p~dA~s~~~i~~dv~~~---------~VP~Y~R~~nG~~~~d~~G~p~v~~t~~tI~gk~V 149 (733) +.+..++..|+.|++-|+...++..++.-..+.+.. .+-..+|- .+..++......+....+|+.+ T Consensus 66 ~~~~~f~~gg~~~~vvrv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~-----~~~~s~~~~~~~~~~~~~~~~~ 140 (679) T protein:vir:10 66 FSGVNFLNYGNDLRLVRVLNETKSRNSSALYQSLSYTITSPGVDYKVGDVVNV-----LQGGNVIATGKVTVVNASGGIV 140 (679) T ss_pred HHHHHHHhCCCeEEEEEccCcccccccccccccccccccccccccccccceee-----eeCCCcccceeEEEeeccCcee Confidence 344556677888999999877655332211111110 11111110 0111110000011111222222 Q ss_pred EeeEeeccccccc---ceeeccccccccCcccc--CcCceeeee--heehhhccccccc-ccccccc-chhhcchhhhhh Q lcl|NC_016571. 150 CTGVLKSEGEVGE---AKAFEVTSTDETPGIPT--GTNGKFYPL--AEVIGGIGDTYNA-NWISIGH-NFQTDWNEVARF 220 (733) Q Consensus 150 ~~~~~~~~~~~g~---~~a~~~~~~~~~~~~~~--g~q~~~yPL--~E~~~~~Gd~~n~-n~i~~g~-~~~~dw~~~~~f 220 (733) ..-+...+ .... ...++....+....... ..+|...+. .......+-.... .+..... .....-..+... T Consensus 141 ~~~v~~~~-~~~~a~~~~~~~~l~~a~~~~~~~~t~~~g~~~~~~v~~v~~~~~~~~~~~~~a~~~i~~~~~~~~t~~~~ 219 (679) T protein:vir:10 141 AFYVPTAA-IIDKAKSLNDYPALDNAWQIQFAAGGPGAGQAATATVVGINLDSTIFVPNDEYAMSAISERSETKRTFIDI 219 (679) T ss_pred eeeecccc-cccccccccccceecccceeeeeeccccccceeeeeeeeeccCCceeeccccccccccccccccchhhhhh Confidence 11110000 0000 00000000000000000 111111111 1111100000000 0000000 000000000111 Q ss_pred hccccceeEEEeeeeecccccccceeeeccCcceeEeecccc-----cccc-cceeeeeeeeeeeccCcccCcCCCCCCc Q lcl|NC_016571. 221 VQTNGSYPFILNMGVLLDNGLRVPANTINGTPDTTFTMFDTV-----NEFN-TRYGLKVAVDQYTGNNVNRPVEVQDAPF 294 (733) Q Consensus 221 v~~~~~~pf~~~~~s~~d~g~~~~~~t~~~~~d~~~~~fd~~-----~~~~-~~y~~~v~~d~y~~~~~~~p~~p~~~PF 294 (733) .+.........+ ...+.|.......+.. +++ +..++.. .... ......+......-..... .. ..+- T Consensus 220 ~~~~~~~~~~A~--~~g~~gn~i~v~~va~-~~~-~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~~ 292 (679) T protein:vir:10 220 CEEMKVPAIVAR--YAGTYGDNIKVLMIAY-KDY-YKFNEAGKIVSVNTINPKVFPTGLDYGNVTPSSYLE-FG--PQNE 292 (679) T ss_pred hhccccceeeee--cccccCCcceEEEEee-ccc-ccccccccccccccccccccccccccccceeeeecc-cc--cccc Confidence 111110000000 0111111111111000 000 0000000 0000 0000000000000000000 00 0000 Q ss_pred cceEEEhhhHHHHHHHHHhhhcccccccCCcccchhccCCcceeeccccccccCCCcceeEEecccccccccccceeecc Q lcl|NC_016571. 295 DDVFMYTQNLSDVAKELYAAEYGTDDPTNQPPNVLSKRLPKHAIMNMFDLVNHNGKPYKHIVFGGNFDQAGKVTGSRFSM 374 (733) Q Consensus 295 ~~~yvY~~Nie~Vl~~ly~aE~~~~d~t~~p~~~~~~~~~~~~~~n~f~~~~~nG~PY~~Iq~~g~~d~~g~v~gs~~~~ 374 (733) +++.+--.+-..+.. -+......++. . .... ...+.... +.++........ .++.... T Consensus 293 ~~~~vvv~~~g~~~~-~~~~~~~~~~~--~--~~~~-------~~~~~~~~---~~~~~~~v~~~~-------~~~~~~~ 350 (679) T protein:vir:10 293 SQFAFIVFNNGVAVE-SKILSTKPGDR--D--IYGT-------SIYINEYF---GNGYSSFVQGVA-------ESWPVGY 350 (679) T ss_pred cceeeEEeccccccc-ceeeecccccc--c--ccch-------hhhhhhhh---cCcccceeeecc-------ccccccc Confidence 111111000000000 00000000000 0 0000 01111111 111111111100 0111122 Q ss_pred ceeecccCCCCccccccccccccccccccccccchhcccccchhhhhhhhhhhhhhhhhhhhhcccccceeehheeeccc Q lcl|NC_016571. 375 NHYLQSNGGINPFADKDGKFPEAPTTWLPAIDGPWVADVSDPDLVISHKQAWEMNQMLIEAYLTTYITSLDVKDVIRNRS 454 (733) Q Consensus 375 n~~i~~sGG~dgt~d~dgk~~d~~t~~~~~ld~~~~~d~~~~~~~~~~~~~~~m~~~l~~a~l~~~~~~~~~nd~~ry~~ 454 (733) ...+..+||.|+-++.. . ++. ..++.+. .....++.+=++ =|. T Consensus 351 ~~~~~~~gg~~~~~~~~-------~-----------~~~---------~~~~~~~---------~~~~~~~~~~l~-~p~ 393 (679) T protein:vir:10 351 TGVLAFGGGQSSNTDIS-------A-----------AEF---------MKGWDMF---------ADREHTDVNLFI-AGA 393 (679) T ss_pred cceeeccCCccCCCccc-------h-----------hhh---------hhhhhhh---------hcccccccceEE-ecC Confidence 23444557766532110 0 000 0111110 000001111111 110 Q ss_pred eeeEec-CCCHHHHHHHHHHHhcCcceEEEEEEEeeCC-CCCHHHHHHHHHHHHhhhheeeeeeec--cCccceeEeecc Q lcl|NC_016571. 455 SFMWDV-GYNQKIKDIMIQFLSKRKDIIVVPCATEYLR-KKTQDELYSTATMLNTRIVMIPESEVY--KSEACRASINLW 530 (733) Q Consensus 455 ~~~~Ds-Gfs~~~K~~m~~fl~~RkD~~vv~~T~~~~~-~~s~~e~~s~a~~L~trl~~~PES~~Y--gTpa~Ra~I~~~ 530 (733) +..+. .-...+-.+|...-.+|||.|.++..+.... ..+.....+-+...++-+...-++... ..-..|++++.- T Consensus 394 -~~~~~~~~~~~v~~~l~~~~~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~~~~~p 472 (679) T protein:vir:10 394 -VAGEGAQIASTVQKAVVAIADERRDCLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYASVDGN 472 (679) T ss_pred -CCCCchhhhHHHHHHHHHHHHhhCCeEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceEEEEcc Confidence 00111 1124566778888899999998886543322 112222233333333322222222221 123578887766 Q ss_pred eeEEEeccccc--cceehHHHHHHHHHHhcCcCCcccCCCccccCCcceeeeeccCcceecCCchhhhhhhCceeEeeee Q lcl|NC_016571. 531 DARYINEPTWG--RFSLNIENMYAFAVAGGGADGRIYAADMPDHEGNRILRIAHDPFVEFEADDPAANNLIQGCITVTPI 608 (733) Q Consensus 531 s~~l~ns~y~~--~vpln~dlA~k~A~y~Ga~dG~~~~~~~~D~~pN~~v~~~~dlnv~f~~~~v~a~~w~ng~i~v~~y 608 (733) --+..|..-.+ .+|..=.+|=-+|+--. ..|.|+.+--..++. |+-+.++.+.+. +.-++..-.+|.+.+..+ T Consensus 473 ~~~~~d~~~~~~~~~p~sg~vAGl~Ar~D~-~~g~~~sPan~~~~~---i~g~~~~~~~~~-~~~~~~Ln~~gin~i~~~ 547 (679) T protein:vir:10 473 YKYQYDKYNDVNRWIPLAADIAGLCARTDT-VGQPWQSPAGFNRGQ---IVNVIKLAVDTR-QAHRDEMYTNGINPIVGF 547 (679) T ss_pred ceeeecccCCceEEechHHHHHHHHHHhhc-cCCcEECcCCeeecc---ccccccceeecC-hhhHHhhhhCCceEEEEe Confidence 55555533222 46777777777777653 447887654333322 222334555543 344666667888888876 Q ss_pred eeeEEEccccccccCCchhhhhhhhhhhhHHHHHHHHHHHhhhhccccccChhHHHHHHHHHHHHHhhhhhcc-eeccee Q lcl|NC_016571. 609 TESQFCRPALPTVYGNINSVLKDLTNVWKCVVVEKILQDIWIQVSGDTQLGKEGYLSFVKDGAEKRIRDLFGS-VISNWE 687 (733) Q Consensus 609 D~~~~y~PaL~TVy~ndtSVLns~~tv~~c~~~~kv~~~~w~~fsG~~slt~~ql~~~~~d~I~~~~rd~fg~-rv~n~~ 687 (733) -.+-...=+=+|.-.++ |-.+-.-..=++-++|+-+.+..+.+.++ . -++.+.+.+.+-|+..+++++.. -+.--. T Consensus 548 ~g~G~~~wG~rT~~~~~-s~~~~i~vrR~~~~i~~si~~~~~~~v~e-p-n~~~~~~~i~~~i~~fL~~l~~~gal~gf~ 624 (679) T protein:vir:10 548 AGQGYILYGDKTASQAP-TPFDRINVRRLFNLLKKSISESAKYKLFE-L-NDAFTRSSFRSEVGSYLDTIRSLGGIYDFR 624 (679) T ss_pred cCCeEEEEcccccCCCC-cccceEehhhHHHHHHHHHHHHHHHhccC-C-CCHHHHHHHHHHHHHHHHHHHhCCceeeeE Confidence 54433333445553333 32322222234557899999999999984 2 36667777777777777765431 121222 Q ss_pred EecccceeccchhcCceEEEeeec--cCCcEEEEEEEEEEEccCCCCC Q lcl|NC_016571. 688 VVPSFREDSPTSKSVMYSITRLWF--GKGIYMLNSVLEAYNEDSLNAE 733 (733) Q Consensus 688 vi~p~~~~T~~d~~~g~SwT~~~~--nn~~~~m~~~leayr~~~l~a~ 733 (733) |+.-...=|+.|-.+|.-.-...+ .+|.--+.+.|...|.+--=+| T Consensus 625 v~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e 672 (679) T protein:vir:10 625 VVCDESNNTPAVIDRNEFVATILIKPARSINYITLSFVATSTGADFDE 672 (679) T ss_pred EEEcCCCCCHHHhhCCeEEEEEEEEecCCccEEEEEEEEeecCccHHH Confidence 466667788999999988766655 3444445555555554422222 No 6 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=45.36 E-value=0.77 Score=21.20 Aligned_cols=566 Identities=12% Similarity=0.049 Sum_probs=216.2 Q ss_pred Cceeeecccce-eeccceeeccccccccCCCcccccccceeeecccccCcccc-c-CCcchhhhcc--cccCCCccccch Q lcl|NC_016571. 1 METFNKVIPGK-VVNEGIVSRETFPAAVPVISSPAHFPDFAAVTPRGSTKRAT-V-STGTFTSKFG--DTTDPFGLYYNP 75 (733) Q Consensus 1 M~~~~n~~P~~-v~~~GirD~S~~~l~~~~~~~p~H~Plf~~~~p~G~~~~~~-v-~~gd~~siyG--D~~D~~s~yfn~ 75 (733) |+.-. -|+| +.--|..=.....-+++..+..--...|-..+.+|.++.-+ + +-.+++++|| |+.|.=.-.|.| T Consensus 1 ~a~~~--~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~~~~~~~~~~g~l~~~~~~a~~~ 78 (587) T protein:vir:99 1 MAVEP--FPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNYSQAKRLFRSGELLDAIELAWGS 78 (587) T ss_pred Ccccc--cCCcccccCceEEEEecCCccccCCCCCceEEEEEEecCCccceeEEeccHHHHHHHhcCcchHHHHHHHhcc Confidence 65432 1211 11113222333334455555566666789999999876655 2 4455889997 543331111111 Q ss_pred HHHHHHHHhhcccceEEEEEeccccccceeeeEEeeecccccceeEcCcccEecCCCCC-cccCccccccCC-eeEEeeE Q lcl|NC_016571. 76 VTYAIQKLGQAGQASFSFKRLTNNTAKSRTIIGLAIFSGDIPNYLRDGNGDYVLDAAGN-PKVDETTATIAG-KWVCTGV 153 (733) Q Consensus 76 ~t~laq~l~~a~~n~f~~~RL~p~dA~s~~~i~~dv~~~~VP~Y~R~~nG~~~~d~~G~-p~v~~t~~tI~g-k~V~~~~ 153 (733) . ++.++-.|+.=|+.++++.+...=.|-+.+ - .+-.-|| ..|--.+-+|++ |.....+ T Consensus 79 ~-------~~~g~~~~~~~rv~~~~~a~~~~~~l~~~a-----~--------~~G~~gN~i~v~~~~~~~~~~~~~~~~~ 138 (587) T protein:vir:99 79 N-------PNYTAGRILAMRIEDAKPASAEIGGLKITS-----K--------IYGNVANNIQVGLEKNTLSDSLRLRVIF 138 (587) T ss_pred c-------cCCCceEEEEEEcCCCceeEEEecCeEEEE-----e--------eccccccceEEEEccCCCCcceeEEEEE Confidence 1 124556788888866665443221211111 0 0111122 011111234444 2221122 Q ss_pred ee-cccccccceeeccccccccCccccCcCceeeeeheehhhccccccccccccccchhhcchhhhhhhccccceeEEEe Q lcl|NC_016571. 154 LK-SEGEVGEAKAFEVTSTDETPGIPTGTNGKFYPLAEVIGGIGDTYNANWISIGHNFQTDWNEVARFVQTNGSYPFILN 232 (733) Q Consensus 154 ~~-~~~~~g~~~a~~~~~~~~~~~~~~g~q~~~yPL~E~~~~~Gd~~n~n~i~~g~~~~~dw~~~~~fv~~~~~~pf~~~ 232 (733) .+ ++.++. ....+++.+. +. |+ ...+++..++..+.-.-.. ..+..+-.+...+.-..+.|+.... T Consensus 139 ~~~~~~~~~-~~~g~v~~i~---y~--g~--~~~a~~~v~~~~~t~~a~~-----~~l~~g~~~v~~yrL~~g~~~~~~~ 205 (587) T protein:vir:99 139 QDDRFNEVY-DNIGNIFTIK---YK--GE--EANATFSVEHDEETQKASR-----LVLKVGDQEVKSYDLTGGAYDYTNA 205 (587) T ss_pred ecccceeee-eeccceeeEE---ee--cc--cccceeeEeecCcceeeee-----eeeecCCceeEEEEecCCchHHHHH Confidence 22 121110 0011222221 11 11 1224444443332211111 0000010111111111111211111 Q ss_pred eeeecccccccceeeeccCcceeEeecccccccccceeeeeeeeeeeccCcccCcCCCCCCccceEEEhhhHHHHHHHHH Q lcl|NC_016571. 233 MGVLLDNGLRVPANTINGTPDTTFTMFDTVNEFNTRYGLKVAVDQYTGNNVNRPVEVQDAPFDDVFMYTQNLSDVAKELY 312 (733) Q Consensus 233 ~~s~~d~g~~~~~~t~~~~~d~~~~~fd~~~~~~~~y~~~v~~d~y~~~~~~~p~~p~~~PF~~~yvY~~Nie~Vl~~ly 312 (733) + .+....++.. -+.|..++ .+.++..|.- ..+..++.--.....+.+.++-.|.. ...++ T Consensus 206 ~---~~~i~~~~~~------tAky~~~~-~~~i~~~~~~-----~~~~~~v~~~~~~v~a~~~D~~~~~~-----~~~~~ 265 (587) T protein:vir:99 206 I---ITDINQLPDF------EAKLSPFG-DKNLESSKLD-----KIENANIKDKAVYVKAVFGDLEKQTA-----YNGIV 265 (587) T ss_pred H---Hhhhccccce------eEEeeccC-CceeEeeccc-----ccccceeeeeeeeeehhccceeeecc-----cceee Confidence 1 0000000000 11121111 0111111100 00000000000000111122211111 00011 Q ss_pred hhhcccccccCCcccchhccCCcceeeccccccccCCCcceeEEecccccccccccceeeccceeecccCCCCccccccc Q lcl|NC_016571. 313 AAEYGTDDPTNQPPNVLSKRLPKHAIMNMFDLVNHNGKPYKHIVFGGNFDQAGKVTGSRFSMNHYLQSNGGINPFADKDG 392 (733) Q Consensus 313 ~aE~~~~d~t~~p~~~~~~~~~~~~~~n~f~~~~~nG~PY~~Iq~~g~~d~~g~v~gs~~~~n~~i~~sGG~dgt~d~dg 392 (733) ..+ .+++. .....+.-+..+.+.......+..-. +..-......||+||+.+. T Consensus 266 ~~~----------------~~~g~-~~~~~~~~~~~~~~~~~~~~~~~~~~--------~a~~~~t~LtGG~dG~~~~-- 318 (587) T protein:vir:99 266 SFE----------------QLNAE-GEVPSNVEVEAGEESATVTATSPIKT--------IEPFELTKLKGGTNGEPPA-- 318 (587) T ss_pred eee----------------ecccc-cchhhhhhhhhccccceeeeeccccc--------eecccceeeecCCCCCccc-- Confidence 000 00110 00111222334444444444332110 0110123466999996221 Q ss_pred cccccccccccccccchhcccccchhhhhhhhhhhhhhhhhhhhhcccccceeehheeeccceeeEecCCCHHHHHHHHH Q lcl|NC_016571. 393 KFPEAPTTWLPAIDGPWVADVSDPDLVISHKQAWEMNQMLIEAYLTTYITSLDVKDVIRNRSSFMWDVGYNQKIKDIMIQ 472 (733) Q Consensus 393 k~~d~~t~~~~~ld~~~~~d~~~~~~~~~~~~~~~m~~~l~~a~l~~~~~~~~~nd~~ry~~~~~~DsGfs~~~K~~m~~ 472 (733) ++. ..|| .|+ .. +.+ .+...+. .+.+|..+.+ T Consensus 319 ---sy~----~al~-------------------------ale---~~-----~~~-------~i~~~t~-d~~i~a~l~a 350 (587) T protein:vir:99 319 ---TWA----DKLD-------------------------KFA---HE-----GGY-------YIVPLSS-KQSVHAEVAS 350 (587) T ss_pred ---cHH----HHHH-------------------------HHh---hC-----CcE-------EEEecCC-CHHHHHHHHH Confidence 100 0010 111 00 111 1111222 5688999999 Q ss_pred HHhcCcc----eEEEEEEEeeCCCCCHHHHHHHHHHHHhhhheeeeeeeccCccceeEeecceeEEEec-cccccceehH Q lcl|NC_016571. 473 FLSKRKD----IIVVPCATEYLRKKTQDELYSTATMLNTRIVMIPESEVYKSEACRASINLWDARYINE-PTWGRFSLNI 547 (733) Q Consensus 473 fl~~RkD----~~vv~~T~~~~~~~s~~e~~s~a~~L~trl~~~PES~~YgTpa~Ra~I~~~s~~l~ns-~y~~~vpln~ 547 (733) |+.++++ .+.|+... ..-++++...++..+... |...+.-.+.+.++ .-...+|- T Consensus 351 ~vk~~r~~g~~~~aVlg~~---~~~~~~~~~~~a~~~n~e---------------~vi~v~~~~~~~~~dg~~~~~~~-- 410 (587) T protein:vir:99 351 FVKERSDAGEPMRAIVGGG---FNESKEQLFGRQASLSNP---------------RVSLVANSGTFVMDDGRKNHVPA-- 410 (587) T ss_pred HHHHHHhCCCcEEEEecCC---CCCCHHHHHHHhhhcCCC---------------cEEEEeccceEecCCCceeeech-- Confidence 9987766 66666431 234677777666666431 33333333333322 11222332 Q ss_pred HHHHHHHHHhcCcCCcccCCCccccCCcceeeeeccCcceecCCchhhhhhhCceeEeeeeeeeE----EEccccccccC Q lcl|NC_016571. 548 ENMYAFAVAGGGADGRIYAADMPDHEGNRILRIAHDPFVEFEADDPAANNLIQGCITVTPITESQ----FCRPALPTVYG 623 (733) Q Consensus 548 dlA~k~A~y~Ga~dG~~~~~~~~D~~pN~~v~~~~dlnv~f~~~~v~a~~w~ng~i~v~~yD~~~----~y~PaL~TVy~ 623 (733) ++..|..+|..-|. .+-.++.+.+-.+.++.-.|... -..+.-.+|.+.+.....+. ..-=++-|.-. T Consensus 411 --~~~aa~vAGl~Ag~-----~~~~SlT~~~i~~~~v~~~~t~~-e~e~li~~Gvl~l~~~~~~~~~~vriv~~ItT~t~ 482 (587) T protein:vir:99 411 --YMVAVALGGLASGL-----EIGESITFKPLRVSSLDQIYESI-DLDELNENGIISIEFVRNRTNTFFRIVDDVTTFND 482 (587) T ss_pred --HHHHHHHHHHHhcC-----chhcCccceeeecccccccCCHH-HHHHHHhCCeEEEEEecCCcceEEEEeeceeeccC Confidence 11222233333222 13345555444466777777644 36667778877765443321 11134444333 Q ss_pred CchhhhhhhhhhhhHHHHHHHHHHHhh-hhccccccChhHHHHHHHHHHHHHhhhhhccee-cceeEecccceeccchhc Q lcl|NC_016571. 624 NINSVLKDLTNVWKCVVVEKILQDIWI-QVSGDTQLGKEGYLSFVKDGAEKRIRDLFGSVI-SNWEVVPSFREDSPTSKS 701 (733) Q Consensus 624 ndtSVLns~~tv~~c~~~~kv~~~~w~-~fsG~~slt~~ql~~~~~d~I~~~~rd~fg~rv-~n~~vi~p~~~~T~~d~~ 701 (733) +.+-.++.+-+.=..=++.+-+.+.+. .|-|. ...+.....++..|...+..+-..+. .+-.. +..+++..+.. T Consensus 483 ~~~~~~~~i~viRv~D~i~~di~~~~~~~yiGk--~Nn~~~r~~i~~~i~~~L~~l~~~gaI~~~~~--~dv~v~~~~d~ 558 (587) T protein:vir:99 483 KSDPVKAEMAVGEANDFLVSELKVQLEDQFIGT--RTINTSASIIKDFIQSYLGRKKRDNEIQDFPA--EDVQVIVEGNE 558 (587) T ss_pred CCCchhhhhhhhhhHHHHHHHHHHHHHhhCCcc--ccchHHHHHHHHHHHHHHHHHHhCCcccCCCc--cceEEEecCCE Confidence 334345555444444566666666664 68884 34445555656666666655554433 11100 12233332222 Q ss_pred CceEEEeeeccCCcEEEEEEEE--EEEccCCCC Q lcl|NC_016571. 702 VMYSITRLWFGKGIYMLNSVLE--AYNEDSLNA 732 (733) Q Consensus 702 ~g~SwT~~~~nn~~~~m~~~le--ayr~~~l~a 732 (733) +.+.|.-.|...|++-.- .+|.|+|+| T Consensus 559 ----~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:99 559 ----ARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred ----EEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 333444445555554443 467788888 No 7 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=39.91 E-value=1 Score=20.59 Aligned_cols=615 Identities=11% Similarity=0.038 Sum_probs=225.6 Q ss_pred CceeeecccceeeccceeeccccccccCCCcccccccceeeecccccCcccc--cCCcchhhhcccccCCCccccchHHH Q lcl|NC_016571. 1 METFNKVIPGKVVNEGIVSRETFPAAVPVISSPAHFPDFAAVTPRGSTKRAT--VSTGTFTSKFGDTTDPFGLYYNPVTY 78 (733) Q Consensus 1 M~~~~n~~P~~v~~~GirD~S~~~l~~~~~~~p~H~Plf~~~~p~G~~~~~~--v~~gd~~siyGD~~D~~s~yfn~~t~ 78 (733) |++ .+|| |-++=| |. +++...... -.+-|-..++||-.+... .+-.||..+||...|-...+ . T Consensus 1 ~~~---~~Pg-Vyv~e~-~~-~~~i~~v~t----s~~~fvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~-----~ 65 (660) T protein:vir:68 1 MAL---LSPG-VELKET-TV-QSTVVNNST----GTAALAGKFQWGPAFQIKQITDEVALVDMFGTPNTDTADY-----F 65 (660) T ss_pred Ccc---ccCc-eEEEEe-cC-CcccccCCC----cceeEEecccCCCCccCEEecCHHHHHHhcCCccCccchh-----H Confidence 555 5798 677655 63 344333333 356688999999874444 36778999999765433222 2 Q ss_pred HHHHHhhcccceEEEEEeccccccceeeeEEeeecccccceeEcCcccEecCCCCCcccCccccccCC----ee-EEeeE Q lcl|NC_016571. 79 AIQKLGQAGQASFSFKRLTNNTAKSRTIIGLAIFSGDIPNYLRDGNGDYVLDAAGNPKVDETTATIAG----KW-VCTGV 153 (733) Q Consensus 79 laq~l~~a~~n~f~~~RL~p~dA~s~~~i~~dv~~~~VP~Y~R~~nG~~~~d~~G~p~v~~t~~tI~g----k~-V~~~~ 153 (733) +...-++..|+.|++-|+.......++.-....+. .-.+. -|... ..|+ +.++.+++ -+ ....+ T Consensus 66 ~~~~~f~~~g~~~~vvRv~~~~~~~~~~~~~~~~~----~t~~~-~g~~~--~~g~----~~~v~~~~~~~~~~~~~~~~ 134 (660) T protein:vir:68 66 MSAMNFLQYGNDLRVVRAVDRDTAKNSSPVAGNIN----FTISS-AGTNY--RVGD----KVVVKYSTDIIEPDGEVTSV 134 (660) T ss_pred HHHHHHHhCCCeEEEEEecccccccccccccccce----eeeec-cCcce--eeee----eeeeecccccccccccceee Confidence 22333456777899999864333222211111111 00000 00000 0000 00000000 00 00000 Q ss_pred eecccccccceeeccccccccCccccCcCceeeeeh------eehhhccccccccccccccchhhcchhhhhhhccccce Q lcl|NC_016571. 154 LKSEGEVGEAKAFEVTSTDETPGIPTGTNGKFYPLA------EVIGGIGDTYNANWISIGHNFQTDWNEVARFVQTNGSY 227 (733) Q Consensus 154 ~~~~~~~g~~~a~~~~~~~~~~~~~~g~q~~~yPL~------E~~~~~Gd~~n~n~i~~g~~~~~dw~~~~~fv~~~~~~ 227 (733) +.... +.......+.... ...+...+|.. ++.++.+...- .+. . .........- T Consensus 135 -~~~~~-----~~~~~~~ta~~~~-~a~~~~~~~~~~~~~~~~v~~~~~~~~~-~~~-----v-------~~~~~d~~~~ 194 (660) T protein:vir:68 135 -DSDGK-----ILNIFIPSGKIIA-KAKEIGEYPELGSNWTAEMSGSSSGLSA-VIT-----I-------DSVVMDSGIL 194 (660) T ss_pred -eecCc-----eeeeeeccccccc-cceeeccccccccceeEEeeccccccee-eee-----e-------ccccccccce Confidence 00000 0000000000000 00000011110 00000000000 000 0 0000000000 Q ss_pred eEEEeeeeecccccccceeeeccCcceeEeecccccccccceeeeeeeeeeeccCcccC--cCCCCC--------Cccce Q lcl|NC_016571. 228 PFILNMGVLLDNGLRVPANTINGTPDTTFTMFDTVNEFNTRYGLKVAVDQYTGNNVNRP--VEVQDA--------PFDDV 297 (733) Q Consensus 228 pf~~~~~s~~d~g~~~~~~t~~~~~d~~~~~fd~~~~~~~~y~~~v~~d~y~~~~~~~p--~~p~~~--------PF~~~ 297 (733) ... ....+..+............+..-..-.. ....+-++.+......+.... ..+... ...++ T Consensus 195 ~~~--~~ta~~~~~~~~~~~~~~~~~~~~~~A~~----~g~~G~~i~v~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~ 268 (660) T protein:vir:68 195 LTE--VETSEEAITSLTFQESIKKYGVPGVVALY----PGELGDQLEIEIVSKADYDKGASAQLKIYPDGGTRYSTAKAI 268 (660) T ss_pred eee--eccccccccccceeeeecccCcccccccc----ccccccceEEEEeccccccccccccceeeecccccccceeeE Confidence 000 00000000000000000000000000000 000000011111111000000 000000 00000 Q ss_pred EEEhhhHHHHHHHHHhhhcccccccCCcccchhc-----cCCcceeeccccccccCCCcceeEEecccccccccccceee Q lcl|NC_016571. 298 FMYTQNLSDVAKELYAAEYGTDDPTNQPPNVLSK-----RLPKHAIMNMFDLVNHNGKPYKHIVFGGNFDQAGKVTGSRF 372 (733) Q Consensus 298 yvY~~Nie~Vl~~ly~aE~~~~d~t~~p~~~~~~-----~~~~~~~~n~f~~~~~nG~PY~~Iq~~g~~d~~g~v~gs~~ 372 (733) .... ... -. .++.....++... ..+.+.. ...+. .....+.....+..|-.....+. .. T Consensus 269 ~~~~-~~~--~~-~~~~~v~~~~~~~-~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~~~~~~~----------~~ 332 (660) T protein:vir:68 269 FGYG-PQT--DD-QYAIIVRRNDSVV-QSVVLSTKRGERDIYGS-NIFIDDFFAKGASNYIFATAQGW----------PK 332 (660) T ss_pred eecc-ccc--cc-ceeeeeecCCcce-eeeeeeccccccccccc-ceeeehhhccCcccEEEEeecCC----------Cc Confidence 0000 000 00 0000000000000 0000000 00000 00111111111111111100000 00 Q ss_pred ccceeecccCCCCccccccccccccccccccccccchhcccccchhhhhhhhhhhhhhhhhhhhhcccccceeehheeec Q lcl|NC_016571. 373 SMNHYLQSNGGINPFADKDGKFPEAPTTWLPAIDGPWVADVSDPDLVISHKQAWEMNQMLIEAYLTTYITSLDVKDVIRN 452 (733) Q Consensus 373 ~~n~~i~~sGG~dgt~d~dgk~~d~~t~~~~~ld~~~~~d~~~~~~~~~~~~~~~m~~~l~~a~l~~~~~~~~~nd~~ry 452 (733) ...+.+...||.|+.+. +++.+. ..+++.-..+ ..+..+-++-- T Consensus 333 ~~~~~~~~~gg~~~~~~-----------------------~~~~~~----~~~~~~~~~~---------~~~~~~~l~~~ 376 (660) T protein:vir:68 333 GFSGVIKLNGGLSSNET-----------------------VEAGDL----MEAWDLFADR---------ESVNAQLFIAG 376 (660) T ss_pred cccceeeeccccccccc-----------------------cccchh----hhHHHHhhhh---------hccccceeecc Confidence 11122233455554211 000000 0111111100 00111111111 Q ss_pred cceeeEecCCCHHHHHHHHHHHhcCcceEEEEEEEee----CC-CCCHHHHHHHHHHHHhhhheeeeeeeccCccceeEe Q lcl|NC_016571. 453 RSSFMWDVGYNQKIKDIMIQFLSKRKDIIVVPCATEY----LR-KKTQDELYSTATMLNTRIVMIPESEVYKSEACRASI 527 (733) Q Consensus 453 ~~~~~~DsGfs~~~K~~m~~fl~~RkD~~vv~~T~~~----~~-~~s~~e~~s~a~~L~trl~~~PES~~YgTpa~Ra~I 527 (733) +.+-. +..=...++.+|.+.-.+|||.|.++..... .. ..+.+ .+...++.+..+.+... +.-..|+++ T Consensus 377 ~~~~~-~~~~~~~v~~~l~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~-~~~s~~~~~ 450 (660) T protein:vir:68 377 SCAGE-SLEVASTVQKHVVAIGDSRQDCLVLCSPPRAAVVGIPVNRAVD----NLVDWRTASGTYTDNNF-NISSTYAAI 450 (660) T ss_pred ccCCC-chHHHHHHHHHHHHHHHhhCCeEEEEcccceeEecCCCCCCHH----HHHHHHhhccccccccc-ccCcceEEE Confidence 11100 0001246788999999999999999854432 12 23433 34444555444433221 233456777 Q ss_pred ecceeEEEecccc--ccceehHHHHHHHHHHhcCcCCcccCCCccccCCcceeeeeccCcceecCCchhhhhhhCceeEe Q lcl|NC_016571. 528 NLWDARYINEPTW--GRFSLNIENMYAFAVAGGGADGRIYAADMPDHEGNRILRIAHDPFVEFEADDPAANNLIQGCITV 605 (733) Q Consensus 528 ~~~s~~l~ns~y~--~~vpln~dlA~k~A~y~Ga~dG~~~~~~~~D~~pN~~v~~~~dlnv~f~~~~v~a~~w~ng~i~v 605 (733) +.---+..|.... ..+|..=.+|=-+|+--. ..|.|+.+--..+ ..|+-.-++.+.. .+.-++..=.+|.+.+ T Consensus 451 ~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar~d~-~~g~~~span~~~---~~i~g~~~~~~~~-~~~~~~~Ln~~gIn~i 525 (660) T protein:vir:68 451 DGNYKYQYDKYNDVNRWVPLAADIAGLCARTDN-ISQPWMSPAGYNR---GQILNVIKLAIET-RQAQRDRLYQEAINPV 525 (660) T ss_pred EcCceEEecccCCceEEechhHHHHHHHHHHhc-cCCcEEccCCeee---ceeeccceeeecC-ChhHHHHHhhCCceEE Confidence 6554444443222 246777777777777653 3367775421111 1233333444443 3445666668888888 Q ss_pred eeeeeeEEEccccccccCCchhhhhhhhhhhhHHHHHHHHHHHhhhhccccccChhHHHHHHHHHHHHHhhhhhcc-eec Q lcl|NC_016571. 606 TPITESQFCRPALPTVYGNINSVLKDLTNVWKCVVVEKILQDIWIQVSGDTQLGKEGYLSFVKDGAEKRIRDLFGS-VIS 684 (733) Q Consensus 606 ~~yD~~~~y~PaL~TVy~ndtSVLns~~tv~~c~~~~kv~~~~w~~fsG~~slt~~ql~~~~~d~I~~~~rd~fg~-rv~ 684 (733) ..+..+-..+=+=+|.-.++ |..+-.-..=+.-++|+-+.+..+++.++ .+ ++.+.+++.+-|+..+++++.. .+. T Consensus 526 ~~~~~~G~~~wG~rT~~~~~-s~~~~i~vrR~~~~i~~si~~~~~~~v~e-pn-~~~~~~~i~~~i~~~L~~l~~~gal~ 602 (660) T protein:vir:68 526 TGTGGDGYVLYGDKTATSVP-SPFDRINVRRLFNMVKTNIGSASKYRLFE-LN-NAFTRSSFRTETSQYLQGIKALGGVY 602 (660) T ss_pred EEecCCeEEEEcceecCCCC-cccceEehhhHHHHHHHHHHHHHHHhccC-CC-CHHHHHHHHHHHHHHHHHHHhcCcee Confidence 87765544444445664443 43333222245567889888888888884 33 5566677777777777765432 222 Q ss_pred ceeEecccceeccchhcCceEEEeeec--cCCcEEEEEEEEEEEccCCCCC Q lcl|NC_016571. 685 NWEVVPSFREDSPTSKSVMYSITRLWF--GKGIYMLNSVLEAYNEDSLNAE 733 (733) Q Consensus 685 n~~vi~p~~~~T~~d~~~g~SwT~~~~--nn~~~~m~~~leayr~~~l~a~ 733 (733) .-.|+....+-|+.|-.+|.-.-...+ .+|.--+.+.|...|.+--=+| T Consensus 603 gf~V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~i~l~~~~~~~~~~~~e 653 (660) T protein:vir:68 603 NFKVVCDTTNNTPAVIDRNEFVATFYLQPARSINYITLNFVATATGADFDE 653 (660) T ss_pred eeEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCccHHH Confidence 233466778888999899987666544 4444445566666655422223 No 8 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=34.43 E-value=1.3 Score=19.97 Aligned_cols=545 Identities=14% Similarity=0.041 Sum_probs=218.4 Q ss_pred Cceeeeccccee-eccceeeccccccccCCCcccccccceeeecccccCcccc-c-CCcchhhhcc--cccCCCccccch Q lcl|NC_016571. 1 METFNKVIPGKV-VNEGIVSRETFPAAVPVISSPAHFPDFAAVTPRGSTKRAT-V-STGTFTSKFG--DTTDPFGLYYNP 75 (733) Q Consensus 1 M~~~~n~~P~~v-~~~GirD~S~~~l~~~~~~~p~H~Plf~~~~p~G~~~~~~-v-~~gd~~siyG--D~~D~~s~yfn~ 75 (733) |+.-. -|||- .--|..=...---++++++.+-+..-|-..+++|..+.-+ + +-.++.++|| ++-|-...+|+| T Consensus 1 ~~~~~--~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~~G~~~~~~~~~~~~~~~~~f~~g~l~~a~~~a~~~ 78 (569) T protein:vir:80 1 MAVEQ--FPRKKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAKGGKPDTVYRFRNYQQAKQVLRSGDLLDAIELAWNA 78 (569) T ss_pred Ceeee--ecCCccccCceEEEEecCCCcCCCCCCceeEEEEEEeCCCCCceeEEecCHHHHHHHhcCCchhHHHHhhccC Confidence 65432 34331 1112222222334567777788888899999999877665 3 4555899994 766777777777 Q ss_pred HHHHHHHHhhcccceEEEEEeccccccceeeeEEeeecccccceeEcCcccEecCCCCC-cccCccccccCCeeEEeeEe Q lcl|NC_016571. 76 VTYAIQKLGQAGQASFSFKRLTNNTAKSRTIIGLAIFSGDIPNYLRDGNGDYVLDAAGN-PKVDETTATIAGKWVCTGVL 154 (733) Q Consensus 76 ~t~laq~l~~a~~n~f~~~RL~p~dA~s~~~i~~dv~~~~VP~Y~R~~nG~~~~d~~G~-p~v~~t~~tI~gk~V~~~~~ 154 (733) ..-+. ++++ .|+.-|.-+..|.+...=.+-+-. ..-| +.|| ..|.-.+.++.|- T Consensus 79 ~~~~~----~~~~-~~~~~rv~~a~~a~~~~~~~~~~a--------~~~g-----~~~n~i~v~l~~~~~~~~------- 133 (569) T protein:vir:80 79 SDVNT----ASAG-DILAVRVEDAKNATLTKGGLTFAS--------TIYG-----VDANEIQVALEDNNLTHT------- 133 (569) T ss_pred ccccc----cCce-EEEEEEcCCCeeeeeeccceeeee--------eecc-----CCCceEEEEEecCcCCcc------- Confidence 62221 2333 477777744444333321110000 0001 0011 0011111111110 Q ss_pred ecccccccceeeccccccccCccccCcCceeeeeheehhhccccccccccccccchhhcchhhhhhhccccceeEEEeee Q lcl|NC_016571. 155 KSEGEVGEAKAFEVTSTDETPGIPTGTNGKFYPLAEVIGGIGDTYNANWISIGHNFQTDWNEVARFVQTNGSYPFILNMG 234 (733) Q Consensus 155 ~~~~~~g~~~a~~~~~~~~~~~~~~g~q~~~yPL~E~~~~~Gd~~n~n~i~~g~~~~~dw~~~~~fv~~~~~~pf~~~~~ 234 (733) +.+. ..... +.+.++|+ .+|++..-.+.. -..++-+. ........|-+.... T Consensus 134 ---------~~~~---v~~~~----~~~~~~~~---~ig~v~si~ytg-~~~~a~~~--------~~~~~~~~~a~~l~~ 185 (569) T protein:vir:80 134 ---------KRLT---VAFSK----DGYKKVFD---NLGKIFSIQYKG-SEAQANFT--------IAQDSISKKATTLTL 185 (569) T ss_pred ---------eeeE---Eeeec----CCCccccc---cccceeeEEEee-ccccceEE--------eecCcCcceeEEEEE Confidence 0000 00000 00111111 111110000000 00000000 000111122222111 Q ss_pred eeccccccc---ceeeec-cCcceeEeecccccccccceeeeeeeeeeeccCcccCcCCCCCCccceEE-EhhhHHHHHH Q lcl|NC_016571. 235 VLLDNGLRV---PANTIN-GTPDTTFTMFDTVNEFNTRYGLKVAVDQYTGNNVNRPVEVQDAPFDDVFM-YTQNLSDVAK 309 (733) Q Consensus 235 s~~d~g~~~---~~~t~~-~~~d~~~~~fd~~~~~~~~y~~~v~~d~y~~~~~~~p~~p~~~PF~~~yv-Y~~Nie~Vl~ 309 (733) ...+++... ..+.+. +..+....+ +..|.. -.+|+--|+ +..|+...-. T Consensus 186 ~~g~~~~~~~~v~~~~~~~~~~~~~~~l----------------v~~~~~----------~~~f~a~~~~~~~~~~~~~~ 239 (569) T protein:vir:80 186 NVGSEPESTTEVMKYELGQGVYSETNVL----------------VSAINS----------LPDWEAKFFPIGDKNLPTDA 239 (569) T ss_pred EecCCcceeEEEEeeccCCccchhhhhh----------------hhhcCC----------ccCceEEEEecCCCcceehh Confidence 212211111 111100 000000000 000000 000111110 0011000000 Q ss_pred HHHhhhcccccccCCcccchhccCCcceeeccccccccCCCcceeEEecccccccccccceeeccceeecccCCCCcccc Q lcl|NC_016571. 310 ELYAAEYGTDDPTNQPPNVLSKRLPKHAIMNMFDLVNHNGKPYKHIVFGGNFDQAGKVTGSRFSMNHYLQSNGGINPFAD 389 (733) Q Consensus 310 ~ly~aE~~~~d~t~~p~~~~~~~~~~~~~~n~f~~~~~nG~PY~~Iq~~g~~d~~g~v~gs~~~~n~~i~~sGG~dgt~d 389 (733) ++..+. .+..+ .+..+-. +.+ .+-..+ +..+|=..+..|.- . .-+ -.+....||.||+.. T Consensus 240 -~d~~~~-~~~~t-~~~~~~~--~~~----di~~~~--~~~~~v~~~~~~~~----~----l~~-~~~~~LtGG~dG~~~ 299 (569) T protein:vir:80 240 -LEAVTK-VDVKT-EAVFVGA--LAG----DIAKQL--EYNDYVTVAVDATK----P----VED-FELTNLTGGSDGTAP 299 (569) T ss_pred -ccchhh-eeccc-cceeeeh--hHH----HHHHhh--cCCceEEEEecCCc----c----eee-ecceeecCCCCCCcc Confidence 111110 00000 0000000 000 000001 23455555543321 1 111 134567799999621 Q ss_pred ccccccccccccccccccchhcccccchhhhhhhhhhhhhhhhhhhhhcccccceeehheeeccce-eeEecCCCHHHHH Q lcl|NC_016571. 390 KDGKFPEAPTTWLPAIDGPWVADVSDPDLVISHKQAWEMNQMLIEAYLTTYITSLDVKDVIRNRSS-FMWDVGYNQKIKD 468 (733) Q Consensus 390 ~dgk~~d~~t~~~~~ld~~~~~d~~~~~~~~~~~~~~~m~~~l~~a~l~~~~~~~~~nd~~ry~~~-~~~DsGfs~~~K~ 468 (733) . ++..+ |.++ ..+.+. +..++. .+.+|. T Consensus 300 ~------------------------------~~~~~-------l~~l-------------e~~~~~~i~~~t~-d~av~~ 328 (569) T protein:vir:80 300 E------------------------------SWANK-------FPLL-------------ANEGGYYLVPLTD-KQAVHS 328 (569) T ss_pred c------------------------------hHHHH-------HHHH-------------hhCCcEEEEecCC-ChHHHH Confidence 1 00111 1110 011111 223333 678899 Q ss_pred HHHHHHhcCcc----eEEEEEEEeeCCCCCHHHHHHHHHHHHhhhheeeeeeeccCccceeEeecceeEEEeccccccce Q lcl|NC_016571. 469 IMIQFLSKRKD----IIVVPCATEYLRKKTQDELYSTATMLNTRIVMIPESEVYKSEACRASINLWDARYINEPTWGRFS 544 (733) Q Consensus 469 ~m~~fl~~RkD----~~vv~~T~~~~~~~s~~e~~s~a~~L~trl~~~PES~~YgTpa~Ra~I~~~s~~l~ns~y~~~vp 544 (733) .+..|+.+++| .+.|+. ....-++++.-.++..++.. |.+++.-++...++.-+.. T Consensus 329 ~l~a~vkr~r~~g~~~~aVvg---~~~~~~~~~~~~~a~~~n~e---------------~vv~v~~~~~~~~~~g~~~-- 388 (569) T protein:vir:80 329 EALAFVKDRTDNGDPMRIIVG---GGTNETVEESITRATNLRDP---------------RASLVGFSGTRKMDDGRLL-- 388 (569) T ss_pred HHHHHHHHHHhCCCcEEEEec---CCCCCCHHHHHHHHhhcCCC---------------eEEEEecCceeecCCCcce-- Confidence 99999998866 565553 11224666666665555322 5555554554444322111 Q ss_pred ehHHHHHHHHHHhcCcCCcccCCCccccCCcceeeeeccCcceecCCchhhhhhhCceeEeeeeeeeEE--Ec--ccccc Q lcl|NC_016571. 545 LNIENMYAFAVAGGGADGRIYAADMPDHEGNRILRIAHDPFVEFEADDPAANNLIQGCITVTPITESQF--CR--PALPT 620 (733) Q Consensus 545 ln~dlA~k~A~y~Ga~dG~~~~~~~~D~~pN~~v~~~~dlnv~f~~~~v~a~~w~ng~i~v~~yD~~~~--y~--PaL~T 620 (733) .|.-+...|..+|..-|.= +..++-+++-.+.++.-.|.. +-..+.-.+|.+.+..-+.+.. ++ =.+.| T Consensus 389 -~~~~~~~aa~vAG~~A~~~-----~~~S~T~k~i~~~~i~~~lt~-~e~~~li~~G~~~l~~~~~~~~~v~~~vn~itT 461 (569) T protein:vir:80 389 -KLPGYMMASQIAGIASGLE-----VGEAITFKHFNVTSVDRVFES-SQLDMLNESGVISIEFVRNRTLTAFRVVQDVTT 461 (569) T ss_pred -eechhhHHHHHHHHHhcCc-----cccCccceeeccccccccCCH-HHHHHHHhCCeEEEEEecCceEEEEEEecccee Confidence 1222334445556444422 334555554446677777753 3455666778777766543322 21 35555 Q ss_pred ccCCchhhhhhhhhhhhHHHHHHHHHHHhh-hhccccccChhHHHHHHHHHHHHHhhhhhccee-cceeEecccceeccc Q lcl|NC_016571. 621 VYGNINSVLKDLTNVWKCVVVEKILQDIWI-QVSGDTQLGKEGYLSFVKDGAEKRIRDLFGSVI-SNWEVVPSFREDSPT 698 (733) Q Consensus 621 Vy~ndtSVLns~~tv~~c~~~~kv~~~~w~-~fsG~~slt~~ql~~~~~d~I~~~~rd~fg~rv-~n~~vi~p~~~~T~~ 698 (733) .-.+.+..++..-+.=..=++++-+.+.|+ .|-|. ...+.....++..|...+..+-..+. .+.. ..+.+++.. T Consensus 462 ~t~~~~~~~~~i~viRv~D~i~~dir~~~~~~yiGk--~nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~--~~dv~v~~~ 537 (569) T protein:vir:80 462 YNDKSDPVKNEMSVGEANDFLVSELKIELDNNFIGT--KVIDTSASLIKNFIQSFLDNKKRAREIQDYT--PEEVQVVLE 537 (569) T ss_pred cCCCCCchhhhhhhhHHHHHHHHHHHHHHHhhcCcc--cCChhHHHHHHHHHHHHHHHHHhCCcccCCC--ccceEEEec Confidence 444555555555555555566666666665 58884 34444455556666665555554332 1100 011222222 Q ss_pred hhcCceEEEeeeccCCcEEEEEEEE--EEEccCCCC Q lcl|NC_016571. 699 SKSVMYSITRLWFGKGIYMLNSVLE--AYNEDSLNA 732 (733) Q Consensus 699 d~~~g~SwT~~~~nn~~~~m~~~le--ayr~~~l~a 732 (733) .. .+.+.|.-.|-.-|++.+- .+|.|+|+| T Consensus 538 ~d----~~~v~~~v~Pv~~~ekI~~ti~~~~~~~~~ 569 (569) T protein:vir:80 538 GD----VASISMTVMPIRSLNKITVQLVYKQQILTA 569 (569) T ss_pred CC----EEEEEEEEEEcccccEEEEEEEEeeeeecC Confidence 11 2445555556666665554 467899999 No 9 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=32.44 E-value=1.4 Score=19.74 Aligned_cols=625 Identities=15% Similarity=0.099 Sum_probs=236.3 Q ss_pred CceeeecccceeeccceeeccccccccCCCcccccccceeeecccccCcccc--cCCcchhhhcccccCCCccccchHHH Q lcl|NC_016571. 1 METFNKVIPGKVVNEGIVSRETFPAAVPVISSPAHFPDFAAVTPRGSTKRAT--VSTGTFTSKFGDTTDPFGLYYNPVTY 78 (733) Q Consensus 1 M~~~~n~~P~~v~~~GirD~S~~~l~~~~~~~p~H~Plf~~~~p~G~~~~~~--v~~gd~~siyGD~~D~~s~yfn~~t~ 78 (733) |.+=. ..|| |-++=+ |.+ +....+..+ ..-|-..++||-.+... .+-.||..+||...+-. |.+.. T Consensus 1 M~~~~-~~Pg-Vyv~e~-~~~-~~~~~~~t~----~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~---~~~~~- 68 (749) T protein:vir:10 1 MATNQ-SSPG-VVIQER-DLT-TVSTIPTAN----VGVIAAPFTKGPVEEVIEITSERQLAEKFGEPNESN---YEYWF- 68 (749) T ss_pred CCccc-cCCe-eEEEEe-cCC-cccccccCc----eeEEEeccCCCCCccCEEcCCHHHHHHHcCCccCCc---ccHHH- Confidence 98622 5798 566544 543 333333332 44588999999875555 26677999999754422 33332 Q ss_pred HHHHHhhcccceEEEEEeccccccceee----eEE---eee----ccccccee-----------------EcCccc---- Q lcl|NC_016571. 79 AIQKLGQAGQASFSFKRLTNNTAKSRTI----IGL---AIF----SGDIPNYL-----------------RDGNGD---- 126 (733) Q Consensus 79 laq~l~~a~~n~f~~~RL~p~dA~s~~~----i~~---dv~----~~~VP~Y~-----------------R~~nG~---- 126 (733) ....++..|+.|++-|+....++.+.. +.+ +.. ....+.+. .+...+ T Consensus 69 -v~~~F~ngg~~~~vvRv~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~pG~~gn~l~v~v~~~~~~~~~~ 147 (749) T protein:vir:10 69 -SAAQFLSYGGLLKTIRVNSSSLKNAVDTGTAPLVKNLQDYETSIEDASNNFSWVARTPGDTGNSIGIFVTDAGADQVVV 147 (749) T ss_pred -HHHHHhhcCCeEEEEEccCccccccccccccccccccccccccccccccceEEEeccCCCcCCceEEEEEcCCCceeee Confidence 334456777889999997666552211 011 000 01111110 000000 Q ss_pred -----------EecCC-----CCCc-cc--Ccccccc--CCeeEEeeEeecccccccceeeccccccccCccccCcCcee Q lcl|NC_016571. 127 -----------YVLDA-----AGNP-KV--DETTATI--AGKWVCTGVLKSEGEVGEAKAFEVTSTDETPGIPTGTNGKF 185 (733) Q Consensus 127 -----------~~~d~-----~G~p-~v--~~t~~tI--~gk~V~~~~~~~~~~~g~~~a~~~~~~~~~~~~~~g~q~~~ 185 (733) |+.+. +|.. .+ .+...+. ....+..+........+...+..+...+. .... T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~a~~~~~~~~--------~~~~ 219 (749) T protein:vir:10 148 VPAPGSGNEHEFVADAAVSAASGAAGKVFKYSIILTIDDVVGTFAPGSATTITIGGSAESVNVLAYDA--------TNKK 219 (749) T ss_pred eecCCccceeeEEeeecccccccccccccccceeeeeccccceeecccceeeeccCcccccccccccC--------Ccce Confidence 00000 0000 00 0000000 00000000000000000000111110000 0011 Q ss_pred eeeheehhhcccccccccccc-ccchhhcchhhh------------hhhc--------------cccceeEEE----eee Q lcl|NC_016571. 186 YPLAEVIGGIGDTYNANWISI-GHNFQTDWNEVA------------RFVQ--------------TNGSYPFIL----NMG 234 (733) Q Consensus 186 yPL~E~~~~~Gd~~n~n~i~~-g~~~~~dw~~~~------------~fv~--------------~~~~~pf~~----~~~ 234 (733) +.+.--.+..+......|.-+ +.........+. +|.. +.....+.. ... T Consensus 220 ~~~~~~s~~~~~~~a~~~~v~~~~~~~~~~~~i~~~~~~~~~~~~~~~a~~~~~~~~~g~~~~it~v~~~~~~~~~~t~~ 299 (749) T protein:vir:10 220 LEIGLPSGGVTGILADNQVITQGTNTAKINVTIERKLLVALNKSSIEFAASDVVQDTNSTNITITSVRDEYTEREYLPGV 299 (749) T ss_pred EEEeeecccccceeeeeecccccccccccccccccchhhhhccccceeeccccccCCccceeEEEeeeccccccccccce Confidence 111110110000000000000 000000000000 0000 000000000 000 Q ss_pred eecccccccce----eeeccCcceeEeecccccccccceeeeeee------------eeeeccCcccCcCCCCCCccc-- Q lcl|NC_016571. 235 VLLDNGLRVPA----NTINGTPDTTFTMFDTVNEFNTRYGLKVAV------------DQYTGNNVNRPVEVQDAPFDD-- 296 (733) Q Consensus 235 s~~d~g~~~~~----~t~~~~~d~~~~~fd~~~~~~~~y~~~v~~------------d~y~~~~~~~p~~p~~~PF~~-- 296 (733) +..+.+.++.+ -...++.|. .+...++. ..|.. + ..+..+.+.. T Consensus 300 ~~~~~a~~~gt~~~~~~~~g~~D~-------------~~v~v~~~~g~~~~~~g~v~e~~~~--~---~~~~~~~~~~~~ 361 (749) T protein:vir:10 300 KWINVAPRPGTSLYANGVGGHRDE-------------MHVILVDIDGGVTGTVGALLERYID--V---SKASDAKTSVGE 361 (749) T ss_pred eeccccccccceeeeecccCCCCc-------------eEEEEecCCCeeeecccceeeeeee--c---cccccccccccc Confidence 01111111111 111111111 11111111 11110 0 0111111110 Q ss_pred -eEEEhhhHHHHHHHHHhhhcccccccCCcccchhccCCcceeeccccccccCCCcceeEEe-------ccccccccccc Q lcl|NC_016571. 297 -VFMYTQNLSDVAKELYAAEYGTDDPTNQPPNVLSKRLPKHAIMNMFDLVNHNGKPYKHIVF-------GGNFDQAGKVT 368 (733) Q Consensus 297 -~yvY~~Nie~Vl~~ly~aE~~~~d~t~~p~~~~~~~~~~~~~~n~f~~~~~nG~PY~~Iq~-------~g~~d~~g~v~ 368 (733) .|+ .+-+.+--+.++..+.... +............ ..++.+..+..+.. ....+. . T Consensus 362 ~~~~-~~~~~~~s~~v~~~~~~~~--~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~----~ 425 (749) T protein:vir:10 362 TNYY-AEVIKQKSEFIYWAEHEST--LYAATSSASDGLF---------GQTAANRQFNLFRSAAGSVDYPAGVTT----L 425 (749) T ss_pred cchh-hhhhccCCCEEEEEecccc--ccccccccccccc---------ccccccceeeccccccccceecccccc----c Confidence 000 0000000011111110000 0000000000000 00011110000000 000000 0 Q ss_pred ceeeccceeecccCCCCccccccccccccccccccccccchhcccccchhhhhhhhhhhhhhhhhhhhhcccccceeehh Q lcl|NC_016571. 369 GSRFSMNHYLQSNGGINPFADKDGKFPEAPTTWLPAIDGPWVADVSDPDLVISHKQAWEMNQMLIEAYLTTYITSLDVKD 448 (733) Q Consensus 369 gs~~~~n~~i~~sGG~dgt~d~dgk~~d~~t~~~~~ld~~~~~d~~~~~~~~~~~~~~~m~~~l~~a~l~~~~~~~~~nd 448 (733) ...-.........||.|++....+--+. ..|+ ..+|+.-.. .. T Consensus 426 ~~~~~~~~~~~~~gg~d~~~~~~~~~~~-------------~~~~---------~~~~~~l~~---------------~~ 468 (749) T protein:vir:10 426 GSKNNATYYYRLSGGVNYTVSAGQYTIT-------------NTDI---------GSAYELIGD---------------PE 468 (749) T ss_pred cccCCcEEEEEccCCccccccccccccc-------------chhH---------HHHHHHhhh---------------hh Confidence 0011111223345666665332221100 0011 111111110 11 Q ss_pred eeeccceeeEecCCC----HHHHHHHHHHHhcCcceEEEEEEEeeCCCCCHHHHHHHHHHHHhhhheeeeeeeccCccce Q lcl|NC_016571. 449 VIRNRSSFMWDVGYN----QKIKDIMIQFLSKRKDIIVVPCATEYLRKKTQDELYSTATMLNTRIVMIPESEVYKSEACR 524 (733) Q Consensus 449 ~~ry~~~~~~DsGfs----~~~K~~m~~fl~~RkD~~vv~~T~~~~~~~s~~e~~s~a~~L~trl~~~PES~~YgTpa~R 524 (733) ......-|+..-|++ ..+..+|++...+|+|.++++......................-|-.. ....| T Consensus 469 ~~~~~~li~~~~~~~~~~~~~v~~al~~~~~~~~~~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~~--------~~s~~ 540 (749) T protein:vir:10 469 SQIVDFIISGPSGTSDANALAKITSLVNIAEERRDCMVFVSPRRGNVIGISNTTTITTNIVDFFKKL--------PSSSY 540 (749) T ss_pred hcccceEEEecCCCCcchhHHHHHHHHHHHhhcCCEEEEEcCCCCcccccccchhhhhHHHHHHhhc--------cCcee Confidence 111223344444554 346678888999999999888655432211000001111111111111 23457 Q ss_pred eEeecceeEEEeccc--cccceehHHHHHHHHHHhcCcCCcccCCCccccCCcceeeeeccCcceecCCchhhhhhhCce Q lcl|NC_016571. 525 ASINLWDARYINEPT--WGRFSLNIENMYAFAVAGGGADGRIYAADMPDHEGNRILRIAHDPFVEFEADDPAANNLIQGC 602 (733) Q Consensus 525 a~I~~~s~~l~ns~y--~~~vpln~dlA~k~A~y~Ga~dG~~~~~~~~D~~pN~~v~~~~dlnv~f~~~~v~a~~w~ng~ 602 (733) ++++.--.+..|..- ...+|..=.+|=-+|+--- ..|-|+.+--..+ ..|+-+-++.+..... -+...-.+|. T Consensus 541 ~~~~~p~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~-~~g~~~SPan~~~---~~i~g~~~~~~~~~~~-e~~~Ln~~gI 615 (749) T protein:vir:10 541 MVFDSGYKYIYDKYNDVYRYIPCNGDTAGLCLQTNE-ISEPWFSPAGFQR---GVLRNAIKLAYTPNKA-QRDQLYANRV 615 (749) T ss_pred EEEEccceeeeccccCceEEechHHHHHHHHHHhhc-cCCcEECcCCcee---eeeeccccceeecChh-HHHhhhhCCc Confidence 777765555544322 2357887778877777643 3467764322211 2355555666655433 3556667887 Q ss_pred eEeeeeeeeEEEccccccccCCchhhhhhhhhhhhHHHHHHHHHHHhhhhccccccChhHHHHHHHHHHHHHhhhhhc-c Q lcl|NC_016571. 603 ITVTPITESQFCRPALPTVYGNINSVLKDLTNVWKCVVVEKILQDIWIQVSGDTQLGKEGYLSFVKDGAEKRIRDLFG-S 681 (733) Q Consensus 603 i~v~~yD~~~~y~PaL~TVy~ndtSVLns~~tv~~c~~~~kv~~~~w~~fsG~~slt~~ql~~~~~d~I~~~~rd~fg-~ 681 (733) +.+..+..+-...=+=+|.-..| |-.+-.-.-=+.-++||-+.+.-+++.++ .+ ++.+.+++.+-|+..++++.+ + T Consensus 616 n~i~~~~g~G~~~wG~rT~~s~d-~~~~~i~vRRl~~~ie~si~~~~~~~v~e-pn-~~~l~~~i~~~i~~fL~~l~~~G 692 (749) T protein:vir:10 616 NPIVSFPGQGVVLYGDKTALGFA-SAFDRINIRRLFLTVERVISTAAKAQLFE-QN-DEAQRSLFINIVEPYLRDVQGRR 692 (749) T ss_pred eEEEEecCCeEEEEcceecCCCC-cccceeehhhhHHHHHHHHHHHHHHhhcC-CC-CHHHHHHHHHHHHHHHHHHHhcC Confidence 87777643322222335543333 33332222234458999999988888884 33 677888888888888888765 3 Q ss_pred eecceeEecccceeccchhcCceEEEeeec--cCCcEEEEEEEEEEEccCCCCC Q lcl|NC_016571. 682 VISNWEVVPSFREDSPTSKSVMYSITRLWF--GKGIYMLNSVLEAYNEDSLNAE 733 (733) Q Consensus 682 rv~n~~vi~p~~~~T~~d~~~g~SwT~~~~--nn~~~~m~~~leayr~~~l~a~ 733 (733) -+.--.|..-.+.=|++|-.+|.-.-...+ ..|.--+.+.|...|.+.--+| T Consensus 693 ~i~~f~V~~d~~~Nt~~~i~~G~~~~~i~~~P~~pae~I~~~~~~~~~~~~~~e 746 (749) T protein:vir:10 693 GVVDFLVKCDSTNNTPEAVDRGEFYAEVFLKPTRTINYVQLTFVATRTGVSFAE 746 (749) T ss_pred CeeeeEEEEcCCCCCHHHhhCCEEEEEEEEEecCCccEEEEEEEEeecCcchHH Confidence 333334566667778888888877665544 4444445566667766655555 No 10 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=31.20 E-value=1.5 Score=19.59 Aligned_cols=610 Identities=11% Similarity=0.005 Sum_probs=231.6 Q ss_pred CceeeecccceeeccceeeccccccccCCCcccccccceeeecccccCcccc--cCCcchhhhcccccCCCccccchHHH Q lcl|NC_016571. 1 METFNKVIPGKVVNEGIVSRETFPAAVPVISSPAHFPDFAAVTPRGSTKRAT--VSTGTFTSKFGDTTDPFGLYYNPVTY 78 (733) Q Consensus 1 M~~~~n~~P~~v~~~GirD~S~~~l~~~~~~~p~H~Plf~~~~p~G~~~~~~--v~~gd~~siyGD~~D~~s~yfn~~t~ 78 (733) |++ .+|| |-++=+ |. +++.. +-+--..-|-..++||-.+... .+-+||..+||...+-. |. .+ T Consensus 1 ~~~---~~Pg-Vyv~e~-~~-~~~i~----~v~ts~~~fvG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~---~~--~~ 65 (663) T protein:vir:10 1 MAL---LSPG-IEMKET-SI-NSTVV----RSATGRAAIVGKFAWGPAYEVRQVTNEVELVDMFGSPDNVT---AP--YF 65 (663) T ss_pred Cce---ecCc-eEEEEe-cC-ccccc----ccCccceeEEeeeccCCCCccEEecCHHHHHHHhCCcCccc---hh--HH Confidence 544 6798 677644 53 33333 3333355688899998864444 37788999999644322 22 23 Q ss_pred HHHHHhhcccceEEEEEeccccccceeeeEEeeecccccceeEcCcccEecCCCCCcccCccccccCCeeE-----EeeE Q lcl|NC_016571. 79 AIQKLGQAGQASFSFKRLTNNTAKSRTIIGLAIFSGDIPNYLRDGNGDYVLDAAGNPKVDETTATIAGKWV-----CTGV 153 (733) Q Consensus 79 laq~l~~a~~n~f~~~RL~p~dA~s~~~i~~dv~~~~VP~Y~R~~nG~~~~d~~G~p~v~~t~~tI~gk~V-----~~~~ 153 (733) +...-++..|+.|++-|+...++...+.. +...++..+..... ...-|++ ..+.+.+.-+ ..++ T Consensus 66 ~v~~~f~ngg~~~~vvRv~~~~~~~~a~~----~~~~~~~~~~~~~~---~~~~g~~----~~v~~~~~~~~~~~~~~~~ 134 (663) T protein:vir:10 66 MSAMNFLQYGNDLRLVRVIDMEKAKNASP----LVNQVSVTITTEGQ---GYTVGDA----ITVKYNNATITEAGKVTAV 134 (663) T ss_pred HHHHHHHhCCCeEEEEEccCCcccccccc----cCCcceeeeecccc---Ccccccc----ccccccccccccccceeee Confidence 34445577888999999987654432211 12222222221110 1122331 1222211110 1111 Q ss_pred eecccccccceeeccc-cccccCccccCcCceeeeehe-----ehhhccccccccccccccchhhcchhhhhhhccccce Q lcl|NC_016571. 154 LKSEGEVGEAKAFEVT-STDETPGIPTGTNGKFYPLAE-----VIGGIGDTYNANWISIGHNFQTDWNEVARFVQTNGSY 227 (733) Q Consensus 154 ~~~~~~~g~~~a~~~~-~~~~~~~~~~g~q~~~yPL~E-----~~~~~Gd~~n~n~i~~g~~~~~dw~~~~~fv~~~~~~ 227 (733) ..+... ...+-.+ ...+.. .+. ..++.+. .+...+...... + .+..++...+.. T Consensus 135 ~~n~~~---~~v~~~~a~~~~~~-~~v----~~~~~~~~~~~~~~~~~~~~~~~~-----------~-~v~~vv~~~~~~ 194 (663) T protein:vir:10 135 DSDGKI---KSLFVPTAEIIAKT-RQL----GTYPTLGDNWRIDVSGASGGSAAA-----------L-ALGNIVVDSGVT 194 (663) T ss_pred ccCCce---EEEEeccccccccc-ccc----ceeeeccccceeEeeecccccccc-----------c-cccceeccccee Confidence 111100 0000000 000000 000 0111110 000000000000 0 001122111111 Q ss_pred eEEEeeeeecccccccceeeeccCcceeEeec-ccccccccceeeeeeeeeeeccCcccCcCCCCCCccceEEEhhhHHH Q lcl|NC_016571. 228 PFILNMGVLLDNGLRVPANTINGTPDTTFTMF-DTVNEFNTRYGLKVAVDQYTGNNVNRPVEVQDAPFDDVFMYTQNLSD 306 (733) Q Consensus 228 pf~~~~~s~~d~g~~~~~~t~~~~~d~~~~~f-d~~~~~~~~y~~~v~~d~y~~~~~~~p~~p~~~PF~~~yvY~~Nie~ 306 (733) .....- ...+.+.....+........-... ..+.+-+ .+.+...... ...+....-++..+... T Consensus 195 ~~~~~~--a~~~~~~~~~~~~~~~~~~~~~~a~~~G~~Gn-----~i~v~i~~~~--------~~~~~~~~~v~~~~~~~ 259 (663) T protein:vir:10 195 FGNSED--APAVMTSPAVMEKYAKFGMPLVSAVYPGEIGS-----TVEVEIVSKT--------AFNSGAQQTIYPFGGTR 259 (663) T ss_pred eEeecc--ccccccccchhhhcccccceeeeeeccccccc-----ceeEEecccc--------ccccccccccccccccc Confidence 111000 000011100000000000000000 0000000 0111111100 01111111111111110 Q ss_pred HHHHHHhhhcccccccCCcccchhccC-Ccce---------------eeccccccccCCCcceeEEecccccccccccce Q lcl|NC_016571. 307 VAKELYAAEYGTDDPTNQPPNVLSKRL-PKHA---------------IMNMFDLVNHNGKPYKHIVFGGNFDQAGKVTGS 370 (733) Q Consensus 307 Vl~~ly~aE~~~~d~t~~p~~~~~~~~-~~~~---------------~~n~f~~~~~nG~PY~~Iq~~g~~d~~g~v~gs 370 (733) ........+.+.. ..++-..++.... ..++ ..-..+.+...|.+|-.....+. T Consensus 260 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------- 328 (663) T protein:vir:10 260 TSNARGVIQYGPM-TDDQFAIIVRRDGIVVESTVLSTRKGDRDVYGSNIFMDDYFRNGGSNFIFASSEGW---------- 328 (663) T ss_pred ccccceeeeeccc-cccceeEEEecCCcceeeeeeeecccccccchhhhhhhhhhccCcceEEEEeeccc---------- Confidence 0000000000000 0000000000000 0000 00000111112233222111110 Q ss_pred eeccceeecccCCCCccccccccccccccccccccccchhcccccchhhhhhhhhhhhhhhhhhhhhcccccceeehhee Q lcl|NC_016571. 371 RFSMNHYLQSNGGINPFADKDGKFPEAPTTWLPAIDGPWVADVSDPDLVISHKQAWEMNQMLIEAYLTTYITSLDVKDVI 450 (733) Q Consensus 371 ~~~~n~~i~~sGG~dgt~d~dgk~~d~~t~~~~~ld~~~~~d~~~~~~~~~~~~~~~m~~~l~~a~l~~~~~~~~~nd~~ 450 (733) .......+..+||.|+.. .+++.+. ..+++.- +....++.+-+ T Consensus 329 ~~~~~~~~~l~gg~d~~~-----------------------~~~~~~~----~~~~~~l---------~~~~~~~~~~~- 371 (663) T protein:vir:10 329 PAGFTGIIQLGGGTSANA-----------------------DVGADEL----IKGWDLF---------SDREALHVNLM- 371 (663) T ss_pred CccccceeEcccccCCCc-----------------------cccchhh----HHHHHhh---------hcccccceeEE- Confidence 111122345556666531 1111111 1111110 11111122211 Q ss_pred eccceeeEecCCC-----HHHHHHHHHHHhcCcceEEEEEEEeeCC-CCCHHHHHHHHHHHHhhhheeeeeeecc--Ccc Q lcl|NC_016571. 451 RNRSSFMWDVGYN-----QKIKDIMIQFLSKRKDIIVVPCATEYLR-KKTQDELYSTATMLNTRIVMIPESEVYK--SEA 522 (733) Q Consensus 451 ry~~~~~~DsGfs-----~~~K~~m~~fl~~RkD~~vv~~T~~~~~-~~s~~e~~s~a~~L~trl~~~PES~~Yg--Tpa 522 (733) |.-..|.. ..+..+|...-.+|||.|+++....... .........-+...+..+...-+...+. --. T Consensus 372 -----i~~~~~~~~~~~~~~v~~al~~~a~~~~~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s 446 (663) T protein:vir:10 372 -----IAGACGSDGAEIASTVQKYVVSLADDRQDCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISS 446 (663) T ss_pred -----EeccCCCCchhhHHHHHHHHHHHHHhhCCEEEEEecCcccccccccccchHHHHHHHHhccccccchhhccccCc Confidence 11122222 4567788888889999999986543321 1111122222334434333332222221 223 Q ss_pred ceeEeecceeEEEecccc--ccceehHHHHHHHHHHhcCcCCcccCCCccccCCcceeeeeccCcceecCCchhhhhhhC Q lcl|NC_016571. 523 CRASINLWDARYINEPTW--GRFSLNIENMYAFAVAGGGADGRIYAADMPDHEGNRILRIAHDPFVEFEADDPAANNLIQ 600 (733) Q Consensus 523 ~Ra~I~~~s~~l~ns~y~--~~vpln~dlA~k~A~y~Ga~dG~~~~~~~~D~~pN~~v~~~~dlnv~f~~~~v~a~~w~n 600 (733) .|++++.--.++.|..-. ..+|..-.+|=-+|+--. ..|.|+.+--... ..|+-+.++.+.+... -+...-.+ T Consensus 447 ~~~~l~~P~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~-~~g~~~sPan~~~---~~i~g~~~~~~~~~~~-~~~~Ln~~ 521 (663) T protein:vir:10 447 TYAFIIGNYKYQYDKYNDINRWVPLAADIAGLCAYTDQ-VSHPWMSPAGYRR---GQIRNCIKLAIEPKQS-MRDTMYQV 521 (663) T ss_pred ceEEEEccceEEecccCCceEEechhHHHHHHHHHhhc-cCCceEccCCcee---ccccccccceeccChh-HHHHHhhC Confidence 467766544444433222 246766666655666543 3466665321111 1344445565555443 35566677 Q ss_pred ceeEeeeeeee-EEEccccccccCCchhhhhhhhhh-hhHHHHHHHHHHHhhhhccccccChhHHHHHHHHHHHHHhhhh Q lcl|NC_016571. 601 GCITVTPITES-QFCRPALPTVYGNINSVLKDLTNV-WKCVVVEKILQDIWIQVSGDTQLGKEGYLSFVKDGAEKRIRDL 678 (733) Q Consensus 601 g~i~v~~yD~~-~~y~PaL~TVy~ndtSVLns~~tv-~~c~~~~kv~~~~w~~fsG~~slt~~ql~~~~~d~I~~~~rd~ 678 (733) |...+..+... -...=+=+|.-.++ |-.+ ++++ =+.-++++-+.+.++.+.++ .+ ++.+.+++.+-|+..++++ T Consensus 522 gin~i~~~~~~~G~~~wG~rT~s~~~-s~~~-~i~vrR~~~~i~~si~~~~~~~v~e-~n-~~~l~~~i~~~i~~~L~~l 597 (663) T protein:vir:10 522 AINPVTGFAGGDGFVLFGDKMATQVP-SPFD-RINVRRLFNMLKKNIGDTSKYELFE-NN-DAFTRQSFRMETSQYLDGI 597 (663) T ss_pred CceEEEEEeCCCcEEEEcccccCCCC-cccc-eEehhhHHHHHHHHHHHHHHHhccC-CC-CHHHHHHHHHHHHHHHHHH Confidence 77777766532 22222345543332 3322 2222 23347899999999999985 34 5667777777777777664 Q ss_pred hcc-eecceeEecccceeccchhcCceEEEeeec--cCCcEEEEEEEEEEEccCCCCC Q lcl|NC_016571. 679 FGS-VISNWEVVPSFREDSPTSKSVMYSITRLWF--GKGIYMLNSVLEAYNEDSLNAE 733 (733) Q Consensus 679 fg~-rv~n~~vi~p~~~~T~~d~~~g~SwT~~~~--nn~~~~m~~~leayr~~~l~a~ 733 (733) ... .+-.-.|....++=|+.|-.+|.-.-+..+ .+|.--+.+.|...|.+-==.| T Consensus 598 ~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~e 655 (663) T protein:vir:10 598 RSLGGCYDFRVVCDTTNNTPNVIDRNEFVGTIYVKPPRSINYITLNMVATSTGANFDE 655 (663) T ss_pred HhcCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCccHHH Confidence 421 222223466677888999999987766655 4444444555554443311111 No 11 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=25.42 E-value=2.1 Score=18.86 Aligned_cols=562 Identities=11% Similarity=0.043 Sum_probs=213.8 Q ss_pred Cceee-ecccceeeccceeeccccccccCCCcccccccceeeecccccCcccc-c-CCcchhhhcc--cccCCCccccch Q lcl|NC_016571. 1 METFN-KVIPGKVVNEGIVSRETFPAAVPVISSPAHFPDFAAVTPRGSTKRAT-V-STGTFTSKFG--DTTDPFGLYYNP 75 (733) Q Consensus 1 M~~~~-n~~P~~v~~~GirD~S~~~l~~~~~~~p~H~Plf~~~~p~G~~~~~~-v-~~gd~~siyG--D~~D~~s~yfn~ 75 (733) |+.-. +.-| +.--|..=.....-+++..+.+--...|-..+.+|.++.-. + +-.+++++|| |+-|. T Consensus 1 ~a~~~~~~~~--~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~G~~~~~~~~~~~~~~~~~~~~g~l~~~------- 71 (587) T protein:vir:95 1 MAVEPFPRRP--ITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEGGEPNTVYELRNYSQAKRLFRSGELLDA------- 71 (587) T ss_pred CcccccCCcc--cccCceEEEEecCCccccCCCCCceEEEEEEeCCCCCceeEEeccHHHHHHHhcCcchHHH------- Confidence 65432 1221 11123333333344555566666667799999999876655 2 4455889997 53332 Q ss_pred HHHHHHHHh---hcccceEEEEEeccccccceeeeEEeeecccccceeEcCcccEecCCCCC-cccCccccccCC-eeEE Q lcl|NC_016571. 76 VTYAIQKLG---QAGQASFSFKRLTNNTAKSRTIIGLAIFSGDIPNYLRDGNGDYVLDAAGN-PKVDETTATIAG-KWVC 150 (733) Q Consensus 76 ~t~laq~l~---~a~~n~f~~~RL~p~dA~s~~~i~~dv~~~~VP~Y~R~~nG~~~~d~~G~-p~v~~t~~tI~g-k~V~ 150 (733) +.++.. +.++-.|+.=|+.++.|.+...=.|-+.. -.-+ ..|| ..|.-.+-+|+| |... T Consensus 72 ---~~~a~~~~~~~g~~~~~~~rv~~~~~a~~~~~~l~~~a-----~~~G--------~~gN~i~v~~~~~~~~~~~~~~ 135 (587) T protein:vir:95 72 ---IELAWGSNPNYTAGRILAMRIEDAKPASAEIGGLKITS-----KIYG--------NVANNIQVGLEKNTLSDSLRLR 135 (587) T ss_pred ---HHHHhccccCCCceEEEEEEcCCCceeEEEecCeEEEE-----eccc--------ccccceEEEEecCCCCCceeEE Confidence 122221 23556677778755555432221111111 1111 1122 111111223445 2222 Q ss_pred eeEee-cccccccceeeccccccccCccccCcCceeeeeheehhhccccccccccccccchhhcchhhhhhhccccceeE Q lcl|NC_016571. 151 TGVLK-SEGEVGEAKAFEVTSTDETPGIPTGTNGKFYPLAEVIGGIGDTYNANWISIGHNFQTDWNEVARFVQTNGSYPF 229 (733) Q Consensus 151 ~~~~~-~~~~~g~~~a~~~~~~~~~~~~~~g~q~~~yPL~E~~~~~Gd~~n~n~i~~g~~~~~dw~~~~~fv~~~~~~pf 229 (733) ....+ ++.++. ....+++.+. +.++ ...+++..++..+.-.... ..+-.+-.+...+.-..+.|+. T Consensus 136 ~~~~~~~~~~~~-~~~g~v~si~---y~g~----~~~~~~~v~~~~~t~~a~~-----~~l~~g~~~v~~yrL~~g~~~~ 202 (587) T protein:vir:95 136 VIFQDDRFNEVY-DNIGNIFTIK---YKGE----EANATFSVEHDEETQKASR-----LVLKVGDQEVKSYDLTGGAYDY 202 (587) T ss_pred EEEecccceeee-eeccceeeee---eecc----ccccceeeeecccceeeee-----eeeecCCceEEEEEecCCchHH Confidence 12222 221110 0011122221 1111 1245555544443322111 0000000111111111111111 Q ss_pred EEeeeeecccccccceeeeccCcceeEeecccccccccceeeeeeeeeeeccCcccCcCCCCCCccceEEEhhhHHHHHH Q lcl|NC_016571. 230 ILNMGVLLDNGLRVPANTINGTPDTTFTMFDTVNEFNTRYGLKVAVDQYTGNNVNRPVEVQDAPFDDVFMYTQNLSDVAK 309 (733) Q Consensus 230 ~~~~~s~~d~g~~~~~~t~~~~~d~~~~~fd~~~~~~~~y~~~v~~d~y~~~~~~~p~~p~~~PF~~~yvY~~Nie~Vl~ 309 (733) ...+ .++...++. -.+.+..++ .+.++..|. +.-+..++.--....-+=+.++-.|..- . T Consensus 203 ~~~~---~~~in~~~~------~tAky~g~~-~~~i~~~~~-----~~~~~~~v~~~~~~v~a~~~d~~~~~~~-----~ 262 (587) T protein:vir:95 203 TNAI---ITDINQLPD------FEAKLSPFG-DKNLESSKL-----DKIENANIKDKAVYVKAVFGDLEKQTAY-----N 262 (587) T ss_pred HHHH---HHhhccccc------eEEEEeccc-CceeEEeec-----Ccccccceehhhhhhhhhhcceeeeeec-----e Confidence 1111 000000000 012222221 011111100 0000000000000000000111111000 0 Q ss_pred HHHhhhcccccccCCcccchhccCCcce-eeccccccccCCCcceeEEecccccccccccceeeccceeecccCCCCccc Q lcl|NC_016571. 310 ELYAAEYGTDDPTNQPPNVLSKRLPKHA-IMNMFDLVNHNGKPYKHIVFGGNFDQAGKVTGSRFSMNHYLQSNGGINPFA 388 (733) Q Consensus 310 ~ly~aE~~~~d~t~~p~~~~~~~~~~~~-~~n~f~~~~~nG~PY~~Iq~~g~~d~~g~v~gs~~~~n~~i~~sGG~dgt~ 388 (733) .++..+ .+.+.. ....+ -+.++.+-......+... ..-.. .+-...||.||+. T Consensus 263 ~~v~~~----------------~~~g~~~~~~~~--~~~~~~~~a~~~~~~~~~-------~~a~~-~~t~LtGG~dG~~ 316 (587) T protein:vir:95 263 GIVSFE----------------QLNAEGEVPSNV--EVEAGEESATVTATSPIK-------TIEPF-ELTKLKGGTNGEP 316 (587) T ss_pred eeeeee----------------cccccceeccch--hhhhcccchheecccccc-------ceecc-ceeeeecCCCCCC Confidence 000000 000000 11111 112222222222111100 00000 1123569999963 Q ss_pred cccccccccccccccccccchhcccccchhhhhhhhhhhhhhhhhhhhhcccccceeehheeeccceeeEecCCCHHHHH Q lcl|NC_016571. 389 DKDGKFPEAPTTWLPAIDGPWVADVSDPDLVISHKQAWEMNQMLIEAYLTTYITSLDVKDVIRNRSSFMWDVGYNQKIKD 468 (733) Q Consensus 389 d~dgk~~d~~t~~~~~ld~~~~~d~~~~~~~~~~~~~~~m~~~l~~a~l~~~~~~~~~nd~~ry~~~~~~DsGfs~~~K~ 468 (733) +. ++ ..+ |.|+-.. +.+ .+...+. .+.+|. T Consensus 317 ~~-----~y-------------------------~~~-------l~ale~~-----~~~-------~i~~~t~-d~~v~a 346 (587) T protein:vir:95 317 PA-----TW-------------------------ADK-------LDKFAHE-----GGY-------YIVPLSS-KQSVHA 346 (587) T ss_pred cc-----cH-------------------------HHH-------HHHHHhC-----CcE-------EEEecCC-CHHHHH Confidence 21 10 000 1110000 111 1112232 568899 Q ss_pred HHHHHHhcCcc----eEEEEEEEeeCCCCCHHHHHHHHHHHHhhhheeeeeeeccCccceeEeecceeEEEec-cccccc Q lcl|NC_016571. 469 IMIQFLSKRKD----IIVVPCATEYLRKKTQDELYSTATMLNTRIVMIPESEVYKSEACRASINLWDARYINE-PTWGRF 543 (733) Q Consensus 469 ~m~~fl~~RkD----~~vv~~T~~~~~~~s~~e~~s~a~~L~trl~~~PES~~YgTpa~Ra~I~~~s~~l~ns-~y~~~v 543 (733) .+..|+.++++ .+.|+.- ...-++++..+++..++.. |...+.-.+.+.++ .-...+ T Consensus 347 ~l~a~vk~~~~~g~~~~aVvg~---~~~~~~~~~~~~a~~~n~e---------------rvi~v~~~~~~~~~dg~~~~~ 408 (587) T protein:vir:95 347 EVASFVKERSDAGEPMRAIVGG---GFNESKEQLFGRQESLSNP---------------RVSLVANSGTFVMDDGRKNHV 408 (587) T ss_pred HHHHHHHHHHhCCCcEEEEEcC---CCCCCHHHHHHHHhhcCCC---------------cEEEecccceEecCCCceeee Confidence 99999977766 6666642 1234677777777666431 33333333333222 111222 Q ss_pred eehHHHHHHHHHHhcCcCCcccCCCccccCCcceeeeeccCcceecCCchhhhhhhCceeEeeeeeeeEE--E--ccccc Q lcl|NC_016571. 544 SLNIENMYAFAVAGGGADGRIYAADMPDHEGNRILRIAHDPFVEFEADDPAANNLIQGCITVTPITESQF--C--RPALP 619 (733) Q Consensus 544 pln~dlA~k~A~y~Ga~dG~~~~~~~~D~~pN~~v~~~~dlnv~f~~~~v~a~~w~ng~i~v~~yD~~~~--y--~PaL~ 619 (733) |--.--|+-++.++|.. +-.++.+..-.+.++.-.|... -..+.-.+|.+.+.....+.- + -=++- T Consensus 409 ~~~~~aa~vAGl~Ag~~---------~~~SlT~~~i~~~~v~~~~t~~-e~e~ai~~Gvl~l~~~~~~~~~~vriv~~it 478 (587) T protein:vir:95 409 PAYMVAVALGGLASGLE---------IGESITFKPLRVSSLDQIYESI-DLDELNENGIISIEFVRNRTNTFFRIVDDVT 478 (587) T ss_pred chHHHHHHHHHHHhcCc---------hhcCccceeeecccccccCCHH-HHHHHHhCCeEEEEEecCCcceEEEEeecce Confidence 22111122233333322 2344555444456777777543 466677888777764443321 1 12444 Q ss_pred cccCCchhhhhhhhhhhhHHHHHHHHHHHhh-hhccccccChhHHHHHHHHHHHHHhhhhhccee-cceeEecccceecc Q lcl|NC_016571. 620 TVYGNINSVLKDLTNVWKCVVVEKILQDIWI-QVSGDTQLGKEGYLSFVKDGAEKRIRDLFGSVI-SNWEVVPSFREDSP 697 (733) Q Consensus 620 TVy~ndtSVLns~~tv~~c~~~~kv~~~~w~-~fsG~~slt~~ql~~~~~d~I~~~~rd~fg~rv-~n~~vi~p~~~~T~ 697 (733) |.-...+-.++.+-++=..=++.+-+.+.+. .|-|. ...+.....++..|...+..+-..+. .+-.. +..+++. T Consensus 479 T~t~~~d~~~~~i~viRv~D~i~~dir~~~~~~~iGk--~nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~--~dv~v~~ 554 (587) T protein:vir:95 479 TFNDKSDPVKAEMAVGEANDFLVSELKVQLEDQFIGT--RTINTSASIIKDFIQSYLGRKKRDNEIQDFPA--EDVQVIV 554 (587) T ss_pred eccCCCCcchhhhhhhhhHHHHHHHHHHHHHhhCCcc--ccchHHHHHHHHHHHHHHHHHHhCCcccCCCc--cceEEEe Confidence 4323333345454444444466666766664 68884 34455666666667776666654443 21100 1222333 Q ss_pred chhcCceEEEeeeccCCcEEEEEEEEE--EEccCCCC Q lcl|NC_016571. 698 TSKSVMYSITRLWFGKGIYMLNSVLEA--YNEDSLNA 732 (733) Q Consensus 698 ~d~~~g~SwT~~~~nn~~~~m~~~lea--yr~~~l~a 732 (733) .+. .+.+.|.-.|-..|++...+ ||.++|+| T Consensus 555 ~~d----~~~v~~~v~Pv~~mekI~vt~~~~~~~~~~ 587 (587) T protein:vir:95 555 EGN----EARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred cCC----EEEEEEEEEEcccceEEEEEEEEeeeeecC Confidence 222 23444555555666655544 56888888 No 12 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=23.63 E-value=2.3 Score=18.62 Aligned_cols=618 Identities=13% Similarity=0.040 Sum_probs=232.9 Q ss_pred CceeeecccceeeccceeeccccccccCCCcccccccceeeecccccCcccc--cCCcchhhhcccccCCCccccchHHH Q lcl|NC_016571. 1 METFNKVIPGKVVNEGIVSRETFPAAVPVISSPAHFPDFAAVTPRGSTKRAT--VSTGTFTSKFGDTTDPFGLYYNPVTY 78 (733) Q Consensus 1 M~~~~n~~P~~v~~~GirD~S~~~l~~~~~~~p~H~Plf~~~~p~G~~~~~~--v~~gd~~siyGD~~D~~s~yfn~~t~ 78 (733) |. -..|| |-++=+ | .+|+..-...+ .+-|-..+++|-.+... .+-.||..+||...+-. | ... T Consensus 1 ~~---~~~Pg-vyv~e~-~-~~~~i~~v~t~----~~~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~--~---~~~ 65 (671) T protein:vir:56 1 MT---LLSPG-IENKEI-N-LASAIGRAATG----RAAMVGKFEWGPAYSITQVTSESDLVTIFGRPNDYT--A---ASF 65 (671) T ss_pred Cc---eecCc-eEEEee-c-CcccccccCcc----cceEEecccCCCCccCEEcCCHHHHHHHcCCcCCCc--c---hhH Confidence 44 46798 566644 6 34443333333 56688999999875554 26778999999754322 2 222 Q ss_pred HHHHHhhcccceEEEEEeccccccceeeeEEeeecccccceeEcCcccEecCCCCCcccCcccccc-CCeeEEeeEeecc Q lcl|NC_016571. 79 AIQKLGQAGQASFSFKRLTNNTAKSRTIIGLAIFSGDIPNYLRDGNGDYVLDAAGNPKVDETTATI-AGKWVCTGVLKSE 157 (733) Q Consensus 79 laq~l~~a~~n~f~~~RL~p~dA~s~~~i~~dv~~~~VP~Y~R~~nG~~~~d~~G~p~v~~t~~tI-~gk~V~~~~~~~~ 157 (733) .....++..|+.|++-|+...++..++....+-.+.+. .-.++--.-+.-+ .....+...+ +++........+. T Consensus 66 ~v~~~f~ngg~~~~vvrv~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~a~~ 140 (671) T protein:vir:56 66 MTANNFLKYGNDLRLVRICDATTAQNATPLYNAVEYTI----GASNGCVVGDDIT-ITYSGVGALTAKGKVLEVDAGNNN 140 (671) T ss_pred HHHHHHHhcCCeEEEEEecCccccccchhhcccccccc----ccCcceeeceeee-eecCcccccccCcceeEEeeeccc Confidence 33445567788899999987765544432222221111 1111100000000 0000011111 1111110000000 Q ss_pred cccc--------cc-----eeeccccccccCccccCcCceeeeehee--------hhhc-cccccccccccccchhhcch Q lcl|NC_016571. 158 GEVG--------EA-----KAFEVTSTDETPGIPTGTNGKFYPLAEV--------IGGI-GDTYNANWISIGHNFQTDWN 215 (733) Q Consensus 158 ~~~g--------~~-----~a~~~~~~~~~~~~~~g~q~~~yPL~E~--------~~~~-Gd~~n~n~i~~g~~~~~dw~ 215 (733) .... .. ...+...... ..+..+.....--. ++.. .+......+.... -...|. T Consensus 141 ~~~~~~~~~~~~v~~~~~~~~~~~~~~~t----~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 215 (671) T protein:vir:56 141 AASKIFLPSAEIVAAAKSDGNYPSVGTIT----LQPTQGDIALTNIEIIDTGSVYFPNIELAFDALTAIETEG-GALKYA 215 (671) T ss_pred eeeeeeccceeEEEeeecccccccccccc----ccccccceeeeeecccccceEEEecccccccccccccccc-ccccch Confidence 0000 00 0000000000 00000000000000 0000 0000000000000 000111 Q ss_pred hhhhhhccccceeEEEeeeeecccccccceeeeccCcceeEeecccccccccce-eeeeeeeeeeccCcccCcCC----- Q lcl|NC_016571. 216 EVARFVQTNGSYPFILNMGVLLDNGLRVPANTINGTPDTTFTMFDTVNEFNTRY-GLKVAVDQYTGNNVNRPVEV----- 289 (733) Q Consensus 216 ~~~~fv~~~~~~pf~~~~~s~~d~g~~~~~~t~~~~~d~~~~~fd~~~~~~~~y-~~~v~~d~y~~~~~~~p~~p----- 289 (733) .+. ...+ .|- +......+.|......... .......+. ..+.+ +.++....+... ...+... T Consensus 216 ~~~---~~~~-~~~-~~a~~~g~~g~~~~v~v~~--~~~~~~~~a----~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 283 (671) T protein:vir:56 216 DLI---EKQG-FPR-LSARYVGDFGDAISVEIIN--YADYQTAFA----FAAGHTLGDIELPIYPDG-GTRSINLSSYFT 283 (671) T ss_pred hhh---hccc-ccc-cccccccccCcceEEEEec--ccccccccc----cccceeeeeccccccccc-cccccccceeec Confidence 110 0000 000 0000000000000000000 000000000 00000 000000000000 0000000 Q ss_pred -CCCCccc--eEEEhhhHHHHHHHHHhhhcccccccCCcccchhccCCcceeecc-ccccccCC-CcceeEEeccccccc Q lcl|NC_016571. 290 -QDAPFDD--VFMYTQNLSDVAKELYAAEYGTDDPTNQPPNVLSKRLPKHAIMNM-FDLVNHNG-KPYKHIVFGGNFDQA 364 (733) Q Consensus 290 -~~~PF~~--~yvY~~Nie~Vl~~ly~aE~~~~d~t~~p~~~~~~~~~~~~~~n~-f~~~~~nG-~PY~~Iq~~g~~d~~ 364 (733) .....++ +.++..+.- . ..+....+. + .+..+-.+. ......+| .+|......... T Consensus 284 ~~~~~~~~~~~~v~~~g~~---~---~~~~~~~~~-~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 344 (671) T protein:vir:56 284 FGPSNSNQYAVIVRVSGEV---E---EAFIVSTNP-G---------DKDVNGQSIFIDEYFENSGSAYITAIAEGWK--- 344 (671) T ss_pred ccccccccceeEEeecCcc---c---eeEEEeecc-c---------ccccchhhhhhhhhhcccCceEEEecCcccC--- Confidence 0000011 111111100 0 000000000 0 000000000 00011111 111110000000 Q ss_pred ccccceeeccceeecccCCCCccccccccccccccccccccccchhcccccchhhhhhhhhhhhhhhhhhhhhcccccce Q lcl|NC_016571. 365 GKVTGSRFSMNHYLQSNGGINPFADKDGKFPEAPTTWLPAIDGPWVADVSDPDLVISHKQAWEMNQMLIEAYLTTYITSL 444 (733) Q Consensus 365 g~v~gs~~~~n~~i~~sGG~dgt~d~dgk~~d~~t~~~~~ld~~~~~d~~~~~~~~~~~~~~~m~~~l~~a~l~~~~~~~ 444 (733) .....+-..||.|++...+.. +.+++. +..+.++ T Consensus 345 --------~~~~~~~~~gg~d~~~~~~~~-----------------------------~~~~~~---------~~~~~~~ 378 (671) T protein:vir:56 345 --------TESGAYNFGGGSDANAGADDW-----------------------------MFGLDM---------LSDPEVL 378 (671) T ss_pred --------CccccccccCccccccchhHH-----------------------------HHHHHh---------hhhcccc Confidence 001112223555554211110 011110 0111111 Q ss_pred eehheeeccceeeEe-cCCCHHHHHHHHHHHhcCcceEEEEEEEeeC---C--CCCHHHHHHHHHHHHhhhheeeeeee- Q lcl|NC_016571. 445 DVKDVIRNRSSFMWD-VGYNQKIKDIMIQFLSKRKDIIVVPCATEYL---R--KKTQDELYSTATMLNTRIVMIPESEV- 517 (733) Q Consensus 445 ~~nd~~ry~~~~~~D-sGfs~~~K~~m~~fl~~RkD~~vv~~T~~~~---~--~~s~~e~~s~a~~L~trl~~~PES~~- 517 (733) . -+++.=+.++=-+ ..-....|.++......|+|.++++...... . ..+++++.+.. ..+...-+... T Consensus 379 ~-~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~ 453 (671) T protein:vir:56 379 Y-TNLVIAGNAAAEEVSIASTVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAGTAVANIQGWR----TGIDPTNGQAVV 453 (671) T ss_pred c-eeEEEcCCCCCccchhHHHHHHHHHHHHHhhcCCEEEEEeccccccccccccccHHHHHHHh----hhccccchhhhh Confidence 1 1111111111111 1112334566666667899999998654331 1 24555544333 22222222111 Q ss_pred --ccCccceeEeecceeEEEeccccc--cceehHHHHHHHHHHhcCcCCcccCCCccccCCcceeeeeccCcceecCCch Q lcl|NC_016571. 518 --YKSEACRASINLWDARYINEPTWG--RFSLNIENMYAFAVAGGGADGRIYAADMPDHEGNRILRIAHDPFVEFEADDP 593 (733) Q Consensus 518 --YgTpa~Ra~I~~~s~~l~ns~y~~--~vpln~dlA~k~A~y~Ga~dG~~~~~~~~D~~pN~~v~~~~dlnv~f~~~~v 593 (733) -+....|++++.---++.|....+ .+|..-.+|=-+|+--. ..|.|+.+--..++ .|+-+.++.+.+.. .- T Consensus 454 ~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AGl~Ar~D~-~~g~~~span~~~~---~i~g~~~~~~~~~~-~~ 528 (671) T protein:vir:56 454 DNLNVSTTYAVIDGNYKYQYDKYNDRNRWVPLAGDIAGLCAYTDQ-VSQPWMSPAGFNRG---QIKGVNRLAVDLRR-AH 528 (671) T ss_pred hhccCCcceEEEecCceEEecccCCceeEechHHHHHHHHHHhhc-cCCcEECcCCceec---cccccccceeecCh-hH Confidence 123346777777666666654443 46777777777777663 34677754322222 23334445544433 34 Q ss_pred hhhhhhCceeEeeeeeeeEEEccccccccCCchhhhhhhhhhhhHHHHHHHHHHHhhhhccccccChhHHHHHHHHHHHH Q lcl|NC_016571. 594 AANNLIQGCITVTPITESQFCRPALPTVYGNINSVLKDLTNVWKCVVVEKILQDIWIQVSGDTQLGKEGYLSFVKDGAEK 673 (733) Q Consensus 594 ~a~~w~ng~i~v~~yD~~~~y~PaL~TVy~ndtSVLns~~tv~~c~~~~kv~~~~w~~fsG~~slt~~ql~~~~~d~I~~ 673 (733) +...-.+|.+.+.++-.+-...=+=+|. .++.|..+-.-..=+.-++|+-+.+..+++.++ .+ ++.+.+++.+-|+. T Consensus 529 ~~~Ln~~gIn~i~~~~~~G~~~wG~rT~-~~~~~~~~~i~vrR~~~~i~~si~~~~~~~v~e-pn-~~~~~~~i~~~i~~ 605 (671) T protein:vir:56 529 RDALYQIGINPVVGFAGQGFVLYGDKTA-TQQASAFDRINVRRLFNLLKKAISDAAKYRLFE-LN-DEFTRSSFKSEIDA 605 (671) T ss_pred HHHHhhCCceEEEEecCCeEEEEcceec-CCCCcccceEehhhHHHHHHHHHHHHHHHhcCC-CC-CHHHHHHHHHHHHH Confidence 5666778888888764332222233555 334455544444445668999999999989884 33 46677777777777 Q ss_pred Hhhhhhcc-eecceeEecccceeccchhcCceEEEeeec--cCCcEEEEEEEEEEEccCCCCC Q lcl|NC_016571. 674 RIRDLFGS-VISNWEVVPSFREDSPTSKSVMYSITRLWF--GKGIYMLNSVLEAYNEDSLNAE 733 (733) Q Consensus 674 ~~rd~fg~-rv~n~~vi~p~~~~T~~d~~~g~SwT~~~~--nn~~~~m~~~leayr~~~l~a~ 733 (733) .+++++.. .+.--.|.....+=|+.|-++|.-.-...+ ..|.--+.+.|-..|.+-==+| T Consensus 606 fL~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~Pae~I~~~~~~~~~~~~f~e 668 (671) T protein:vir:56 606 YLTNIQDLGGVYDFRVVCDETNNPGSVIDRNEFVASIYVKPAKSINFITLNFVATSTDADFAE 668 (671) T ss_pred HHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCcchhh Confidence 77775432 122223466677888889899987666655 4454555666655555411233 No 13 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=20.59 E-value=2.7 Score=18.18 Aligned_cols=619 Identities=10% Similarity=0.020 Sum_probs=230.1 Q ss_pred CceeeecccceeeccceeeccccccccCCCcccccccceeeecccccCcccc--cCCcchhhhcccccCCCccccchHHH Q lcl|NC_016571. 1 METFNKVIPGKVVNEGIVSRETFPAAVPVISSPAHFPDFAAVTPRGSTKRAT--VSTGTFTSKFGDTTDPFGLYYNPVTY 78 (733) Q Consensus 1 M~~~~n~~P~~v~~~GirD~S~~~l~~~~~~~p~H~Plf~~~~p~G~~~~~~--v~~gd~~siyGD~~D~~s~yfn~~t~ 78 (733) |++ + .|| |-++=+ |. .++......+ ..-|-..+++|-.+... .+-.||...||...|-. .-.. T Consensus 1 ma~-~--~Pg-Vyv~E~-~~-~~~i~~~~ts----~~~~vG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~-----~~~~ 65 (664) T protein:vir:98 1 MAL-Q--SPG-IETKET-SV-QSTVVRNSTG----RAAIVGKFSWGPAYQIRQISNEVELVNYFGAPDNLT-----ADYF 65 (664) T ss_pred Cce-e--cCc-eEEEec-CC-Cccccccccc----ceEEEeeccCCCCCccEEecCHHHHHHhcCCccccc-----hhHH Confidence 994 3 898 566544 64 3444433333 45688999999875554 36778999999754422 2233 Q ss_pred HHHHHhhcccceEEEEEeccccccceeeeEEeeecccccceeEc-Ccc-cEecCCCCCcccCccccccCCeeEE---eeE Q lcl|NC_016571. 79 AIQKLGQAGQASFSFKRLTNNTAKSRTIIGLAIFSGDIPNYLRD-GNG-DYVLDAAGNPKVDETTATIAGKWVC---TGV 153 (733) Q Consensus 79 laq~l~~a~~n~f~~~RL~p~dA~s~~~i~~dv~~~~VP~Y~R~-~nG-~~~~d~~G~p~v~~t~~tI~gk~V~---~~~ 153 (733) +.+..++..|+.|++-|+..+.+..++.-...-+..+.-.+-.+ ..| .+....++.+....-.+.-+|.|-. .-+ T Consensus 66 ~v~~~f~ngg~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~gn~~~v~i 145 (664) T protein:vir:98 66 MSAVNFLQYGNDLRLVRVVDKDAAKNASAIFNQIKTTIASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNGKILAVTI 145 (664) T ss_pred HHHHHHHhcCCeEEEEEecCccccccccccccccceeeccCCcccccccceeEeecCcccccceeeccCCCCCceeeEee Confidence 44455577888899999876554322211111111111111000 011 0111111221211112222332210 000 Q ss_pred eec-ccccccceeeccccccccCccccCcCceeeee-heehhhccccccccccccccchhhcchhhhhhhccccceeEEE Q lcl|NC_016571. 154 LKS-EGEVGEAKAFEVTSTDETPGIPTGTNGKFYPL-AEVIGGIGDTYNANWISIGHNFQTDWNEVARFVQTNGSYPFIL 231 (733) Q Consensus 154 ~~~-~~~~g~~~a~~~~~~~~~~~~~~g~q~~~yPL-~E~~~~~Gd~~n~n~i~~g~~~~~dw~~~~~fv~~~~~~pf~~ 231 (733) .+. ........+... +. -.+.. .+.....+...+...+. . +.....+++-. T Consensus 146 ~~~~~~~~~~~~~~~~-----------~~--~~~~~~~s~~~~s~g~a~a~~v~--~------------v~~d~~~~~~~ 198 (664) T protein:vir:98 146 PKRKKSLLVLNRSVLT-----------QI--FLLVGTTEIVSQSSGVSASITID--G------------IESDSGITLLN 198 (664) T ss_pred ccCccceeeccccccc-----------cc--ceecccceeeeeecccceeeecc--c------------ccccceeeccc Confidence 000 000000000000 00 00000 00000000000000000 0 00000000000 Q ss_pred eeeeecc-cccccceeeeccCcceeEeecccccccccceeeeeeeeeeecc----CcccCcCCCCCCccceEEEhhhHHH Q lcl|NC_016571. 232 NMGVLLD-NGLRVPANTINGTPDTTFTMFDTVNEFNTRYGLKVAVDQYTGN----NVNRPVEVQDAPFDDVFMYTQNLSD 306 (733) Q Consensus 232 ~~~s~~d-~g~~~~~~t~~~~~d~~~~~fd~~~~~~~~y~~~v~~d~y~~~----~~~~p~~p~~~PF~~~yvY~~Nie~ 306 (733) ....... .++.+.........-...... ++.+-+ .+.+..-... ....++.+.-.+....- ++-+.. T Consensus 199 ~~~a~~~i~~~~~~~~~~~~~~~~~~a~~-~G~~Gn-----~isv~i~s~~~~~~~~~i~~~~~~~~~~~~~--~~~~~~ 270 (664) T protein:vir:98 199 LDIAKETIQGTSFQTLTQKYQIPSVVALY-PGELGS-----TVQVEIISKAAYDTGAMISGYPSGISVKNSG--RSVMTY 270 (664) T ss_pred cceeeeccccccceeeeeccccceeeeee-cccccc-----eeeeeecccccccCcceEeeccCceecccce--eeeeec Confidence 0000000 000000000000000000000 000000 0000000000 00000000000000000 000000 Q ss_pred HHH--HHHhhhcccccccCCcccchh--ccCCcce--eeccccccccCCCcceeEEecccccccccccceeeccceeecc Q lcl|NC_016571. 307 VAK--ELYAAEYGTDDPTNQPPNVLS--KRLPKHA--IMNMFDLVNHNGKPYKHIVFGGNFDQAGKVTGSRFSMNHYLQS 380 (733) Q Consensus 307 Vl~--~ly~aE~~~~d~t~~p~~~~~--~~~~~~~--~~n~f~~~~~nG~PY~~Iq~~g~~d~~g~v~gs~~~~n~~i~~ 380 (733) +.+ ..|+.-...++.... -+.+. ...+... ..+..+.....+. .+.+....+....+ ...+.. T Consensus 271 ~~~~~~~~~~~~~~~~~~~e-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~-------~~~~~~ 339 (664) T protein:vir:98 271 GPQTDNQYAFVVRRGGIVQE-SFIVSTDKTDKDIYGVNIYMDDFFANGGS---QYVFGTSMNWPKGF-------SGILEF 339 (664) T ss_pred cccCccceeEEEecCCceee-eEEeecccCcccceeeeeechhheecccc---eeeeeecccCCccc-------ceeEec Confidence 000 000000000000000 00000 0000000 0011111100011 11111111100000 111222 Q ss_pred cCCCCccccccccccccccccccccccchhcccccchhhhhhhhhhhhhhhhhhhhhcccccceeehheeeccceeeEec Q lcl|NC_016571. 381 NGGINPFADKDGKFPEAPTTWLPAIDGPWVADVSDPDLVISHKQAWEMNQMLIEAYLTTYITSLDVKDVIRNRSSFMWDV 460 (733) Q Consensus 381 sGG~dgt~d~dgk~~d~~t~~~~~ld~~~~~d~~~~~~~~~~~~~~~m~~~l~~a~l~~~~~~~~~nd~~ry~~~~~~Ds 460 (733) .||.|.. |+ +.+.+. +.+|+.-+ ...+++.+ ++.== T Consensus 340 ~gg~~~~--------~~---------------~g~~~~----~tgl~~l~---------~~~~~~~~--------ll~~p 375 (664) T protein:vir:98 340 GGGLSSN--------DT---------------VGADEL----MTGWDMFA---------DREALHVP--------LLIAG 375 (664) T ss_pred cCccccc--------cc---------------cCchhH----HHHHHhhh---------cccccccc--------eEEec Confidence 3444431 00 000000 12222111 11112222 23222 Q ss_pred CCC-------HHHHHHHHHHHhcCcceEEEEEEEee----CC-CCCHHHHHHHHHHHHhhhheeeeeeeccCccceeEee Q lcl|NC_016571. 461 GYN-------QKIKDIMIQFLSKRKDIIVVPCATEY----LR-KKTQDELYSTATMLNTRIVMIPESEVYKSEACRASIN 528 (733) Q Consensus 461 Gfs-------~~~K~~m~~fl~~RkD~~vv~~T~~~----~~-~~s~~e~~s~a~~L~trl~~~PES~~YgTpa~Ra~I~ 528 (733) ||+ +.+..+|.....+|||.|.++..+.. .. ..+.+++..-...+... ........-+.-..|++++ T Consensus 376 ~~~~~~~~~~~~v~~al~~~a~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~s~~~~l~ 454 (664) T protein:vir:98 376 GCAGESVEIASTVQKHVISIGDERQDCTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKIS-GGTPVDNNLNVSSSYGFLD 454 (664) T ss_pred CCCCCcHHHHHHHHHHHHHHHHhcCCeEEEEccccceeccCCccccHHHHHHHhhhcccc-ccchhhhhcCCccceEEEE Confidence 332 35778888889999999999965533 12 24555544333332111 0111111224455677777 Q ss_pred cceeEEEecccc--ccceehHHHHHHHHHHhcCcCCcccCCCccccCCcceeeeeccCcceecCCchhhhhhhCceeEee Q lcl|NC_016571. 529 LWDARYINEPTW--GRFSLNIENMYAFAVAGGGADGRIYAADMPDHEGNRILRIAHDPFVEFEADDPAANNLIQGCITVT 606 (733) Q Consensus 529 ~~s~~l~ns~y~--~~vpln~dlA~k~A~y~Ga~dG~~~~~~~~D~~pN~~v~~~~dlnv~f~~~~v~a~~w~ng~i~v~ 606 (733) ----++.|.... ..+|..-.+|=-+|+-- ...|.|+.+--..++ .|+-..++.+... +.-+...=.+|...+. T Consensus 455 ~p~~~~~d~~~~~~~~~p~sg~~AGl~A~~D-~~~g~~~span~~~~---~i~g~~~~~~~~~-~~~~~~Ln~~gIn~i~ 529 (664) T protein:vir:98 455 GNYKYQYDKYNDVNRWVPLAGDIAGLCVYTD-SVANPWMSPAGYNRG---QIRNCIKLAIEPR-TAHRDAMYQVQINPVT 529 (664) T ss_pred cCeEEEecccCCceEEechHHHHHHHHHHhh-hcCCcEECcCCceee---eeeccccceeecC-hhhHHHHHhCCCeEEE Confidence 655555543332 34677777777777655 344666654322221 2333344443332 2334444567776676 Q ss_pred eeee-eEEEccccccccCCchhhhhhhhhhhhHHHHHHHHHHHhhhhccccccChhHHHHHHHHHHHHHhhhhhc-ceec Q lcl|NC_016571. 607 PITE-SQFCRPALPTVYGNINSVLKDLTNVWKCVVVEKILQDIWIQVSGDTQLGKEGYLSFVKDGAEKRIRDLFG-SVIS 684 (733) Q Consensus 607 ~yD~-~~~y~PaL~TVy~ndtSVLns~~tv~~c~~~~kv~~~~w~~fsG~~slt~~ql~~~~~d~I~~~~rd~fg-~rv~ 684 (733) .+.. +-..+=+=+|.-.++ |-.+-.-..-+.-++++-+.+.++++.++ .+ ++.+.+++.+-|+..+++++. +.+- T Consensus 530 ~~~~~~G~~~wG~rT~~~~~-s~~~~i~vrR~~~~i~~si~~~~~~~v~e-pn-~~~l~~~i~~~i~~~L~~l~~~gal~ 606 (664) T protein:vir:98 530 GFAGGSGFVLYGDKTLTSVP-SPFDRINVRRLFNMIKKDIGDNAKYKLFE-NN-DDFTRASFRMDTGQYMTNIRALGGCY 606 (664) T ss_pred EeeCCCcEEEEcccccCCCC-cccceEeehhHHHHHHHHHHHHHHHhhcC-CC-CHHHHHHHHHHHHHHHHHHHhcCcee Confidence 6543 222222335553333 33332222234558999999999999984 34 567777777778877777654 2232 Q ss_pred ceeEecccceeccchhcCceEEEeeec--cCCcEEEEEEEEEEEccCCCCC Q lcl|NC_016571. 685 NWEVVPSFREDSPTSKSVMYSITRLWF--GKGIYMLNSVLEAYNEDSLNAE 733 (733) Q Consensus 685 n~~vi~p~~~~T~~d~~~g~SwT~~~~--nn~~~~m~~~leayr~~~l~a~ 733 (733) .-.|....++=|+.|-.+|.-.-...+ ..|.--+.+.+..-|.+--=+| T Consensus 607 g~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~q~~~~~~~~e 657 (664) T protein:vir:98 607 DYRVICDTTNNTPDVIDRNEFVATVYVKPPRSINYITLNFVATSTGADFDE 657 (664) T ss_pred eeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCcchhH Confidence 334567777889999999987666544 4555555666666665522222 Done!