BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|Aclame:protein:vir:105656|NCBI_annot:putative large terminase subunit|genbank:acc:YP_425018;genbank:gi:83571766;uniprot:Q2 WC34;genbank:GeneID:3837297 (405 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|10294|lcl|protein:vir:105656 Length: 405 # NCBI annotation: p... 847 0.0 gi|8927|lcl|protein:vir:97005 Length: 632 # NCBI annotation: 40 ... 812 0.0 gi|15201|lcl|protein:vir:7028 Length: 631 # NCBI annotation: lar... 682 0.0 gi|7495|lcl|protein:vir:103312 Length: 624 # NCBI annotation: Te... 569 e-164 gi|11728|lcl|protein:vir:78939 Length: 601 # NCBI annotation: pu... 259 4e-71 gi|12767|lcl|protein:vir:80218 Length: 595 # NCBI annotation: pu... 259 4e-71 gi|18767|lcl|protein:vir:6335 Length: 601 # NCBI annotation: put... 254 1e-69 gi|3994|lcl|protein:vir:99686 Length: 586 # NCBI annotation: DNA... 199 4e-53 gi|4214|lcl|protein:vir:94726 Length: 575 # NCBI annotation: mat... 192 6e-51 gi|14546|lcl|protein:vir:8897 Length: 582 # NCBI annotation: DNA... 184 2e-48 gi|18462|lcl|protein:vir:2213 Length: 586 # NCBI annotation: DNA... 181 2e-47 gi|16146|lcl|protein:vir:10462 Length: 586 # NCBI annotation: DN... 180 3e-47 gi|3734|lcl|protein:vir:94555 Length: 585 # NCBI annotation: DNA... 172 4e-45 gi|16205|lcl|protein:vir:1554 Length: 587 # NCBI annotation: DNA... 172 8e-45 gi|13826|lcl|protein:vir:3376 Length: 586 # NCBI annotation: DNA... 171 1e-44 gi|7204|lcl|protein:vir:100065 Length: 577 # NCBI annotation: DN... 166 5e-43 gi|11501|lcl|protein:vir:78718 Length: 574 # NCBI annotation: DN... 145 1e-36 gi|5868|lcl|protein:vir:95431 Length: 535 # NCBI annotation: hyp... 52 1e-08 gi|9169|lcl|protein:vir:99234 Length: 550 # NCBI annotation: hyp... 40 6e-05 gi|3935|lcl|protein:vir:103868 Length: 551 # NCBI annotation: hy... 40 7e-05 gi|12711|lcl|protein:vir:80169 Length: 565 # NCBI annotation: te... 39 1e-04 gi|4089|lcl|protein:vir:94598 Length: 581 # NCBI annotation: PfW... 32 0.011 gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: g... 31 0.033 gi|9289|lcl|protein:vir:97165 Length: 431 # NCBI annotation: ORF... 29 0.100 gi|15319|lcl|protein:vir:3140 Length: 535 # NCBI annotation: hyp... 29 0.12 gi|20804|lcl|protein:vir:106290 Length: 633 # NCBI annotation: g... 28 0.16 gi|16897|lcl|protein:vir:5867 Length: 508 # NCBI annotation: sim... 28 0.18 gi|6093|lcl|protein:vir:95761 Length: 432 # NCBI annotation: ter... 28 0.26 gi|22942|lcl|protein:vir:101803 Length: 613 # NCBI annotation: g... 27 0.30 gi|8062|lcl|protein:vir:103374 Length: 536 # NCBI annotation: pu... 27 0.31 gi|7770|lcl|protein:vir:96410 Length: 536 # NCBI annotation: hyp... 27 0.31 gi|23191|lcl|protein:vir:101186 Length: 613 # NCBI annotation: t... 27 0.32 gi|21077|lcl|protein:vir:100538 Length: 612 # NCBI annotation: g... 27 0.33 gi|24596|lcl|protein:vir:98262 Length: 609 # NCBI annotation: gp... 27 0.41 gi|7375|lcl|protein:vir:99083 Length: 566 # NCBI annotation: gp7... 27 0.49 gi|18586|lcl|protein:vir:8649 Length: 564 # NCBI annotation: gp7... 27 0.52 gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: ter... 27 0.60 gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: g... 27 0.64 gi|13703|lcl|protein:vir:4908 Length: 168 # NCBI annotation: gp1... 26 1.00 gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T... 26 1.1 gi|16791|lcl|protein:vir:2742 Length: 168 # NCBI annotation: put... 25 1.7 gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: t... 24 2.5 gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: ter... 24 3.7 gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp... 24 3.7 gi|8026|lcl|protein:vir:96484 Length: 169 # NCBI annotation: tai... 24 3.8 gi|3997|lcl|protein:vir:100199 Length: 629 # NCBI annotation: pu... 23 7.0 gi|360|lcl|protein:vir:344 Length: 482 # NCBI annotation: termin... 22 9.6 >gi|10294|lcl|protein:vir:105656 Length: 405 # NCBI annotation: putative large terminase subunit # Family: family:all:697 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425018;genbank:gi:83571766;uniprot:Q2WC34 ;genbank:GeneID:3837297 Length = 405 Score = 847 bits (2188), Expect = 0.0, Method: Compositional matrix adjust. Identities = 405/405 (100%), Positives = 405/405 (100%) Query: 1 MAKARESQAEALARWEMLQELQQTFPYTAEGLLLFADTVIHNLIAGNPHLIRMQADILKF 60 MAKARESQAEALARWEMLQELQQTFPYTAEGLLLFADTVIHNLIAGNPHLIRMQADILKF Sbjct: 1 MAKARESQAEALARWEMLQELQQTFPYTAEGLLLFADTVIHNLIAGNPHLIRMQADILKF 60 Query: 61 LFYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFR 120 LFYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFR Sbjct: 61 LFYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFR 120 Query: 121 GLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAGMQGARADIILADDV 180 GLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAGMQGARADIILADDV Sbjct: 121 GLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAGMQGARADIILADDV 180 Query: 181 ESMQNARTAAGRALLEELTKEFESINQFGDIIYLGTPQNVNSIYNNLPARGYSVRIWTAR 240 ESMQNARTAAGRALLEELTKEFESINQFGDIIYLGTPQNVNSIYNNLPARGYSVRIWTAR Sbjct: 181 ESMQNARTAAGRALLEELTKEFESINQFGDIIYLGTPQNVNSIYNNLPARGYSVRIWTAR 240 Query: 241 YPSVEQEQCYGDFLAPMIVQDMKDNPALRSGYGLDGNSGAPCAPEMYDDDVLIEKEISQG 300 YPSVEQEQCYGDFLAPMIVQDMKDNPALRSGYGLDGNSGAPCAPEMYDDDVLIEKEISQG Sbjct: 241 YPSVEQEQCYGDFLAPMIVQDMKDNPALRSGYGLDGNSGAPCAPEMYDDDVLIEKEISQG 300 Query: 301 AAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWSNDSINIIGDAPKYGNK 360 AAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWSNDSINIIGDAPKYGNK Sbjct: 301 AAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWSNDSINIIGDAPKYGNK 360 Query: 361 PTDFMYRPVARPYEWGAVTRKIMYIDPAGKHLCQRKTLLIRWNSH 405 PTDFMYRPVARPYEWGAVTRKIMYIDPAGKHLCQRKTLLIRWNSH Sbjct: 361 PTDFMYRPVARPYEWGAVTRKIMYIDPAGKHLCQRKTLLIRWNSH 405 >gi|8927|lcl|protein:vir:97005 Length: 632 # NCBI annotation: 40 # Family: family:all:697 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654141;genbank:gi:108862025;genbank:GeneI D:5075954 Length = 632 Score = 812 bits (2097), Expect = 0.0, Method: Compositional matrix adjust. Identities = 387/389 (99%), Positives = 389/389 (100%) Query: 1 MAKARESQAEALARWEMLQELQQTFPYTAEGLLLFADTVIHNLIAGNPHLIRMQADILKF 60 MAKARESQAEALARWEMLQELQQTFPYTAEGLLLFADTVIHNLIAGNPHLIRMQADILKF Sbjct: 1 MAKARESQAEALARWEMLQELQQTFPYTAEGLLLFADTVIHNLIAGNPHLIRMQADILKF 60 Query: 61 LFYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFR 120 LFYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFR Sbjct: 61 LFYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFR 120 Query: 121 GLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAGMQGARADIILADDV 180 GLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAGMQGARADIILADDV Sbjct: 121 GLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAGMQGARADIILADDV 180 Query: 181 ESMQNARTAAGRALLEELTKEFESINQFGDIIYLGTPQNVNSIYNNLPARGYSVRIWTAR 240 ESMQNARTAAGRALLEELTKEFESINQFGDIIYLGTPQNVNSIYNNLPARGYSVRIWTAR Sbjct: 181 ESMQNARTAAGRALLEELTKEFESINQFGDIIYLGTPQNVNSIYNNLPARGYSVRIWTAR 240 Query: 241 YPSVEQEQCYGDFLAPMIVQDMKDNPALRSGYGLDGNSGAPCAPEMYDDDVLIEKEISQG 300 YPSVEQEQCYGDFLAPMIVQDMKDNPALRSGYGLDGNSGAPCAPEMYDD+VLIEKEISQG Sbjct: 241 YPSVEQEQCYGDFLAPMIVQDMKDNPALRSGYGLDGNSGAPCAPEMYDDEVLIEKEISQG 300 Query: 301 AAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWSNDSINIIGDAPKYGNK 360 AAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWSNDSINIIGDAPKYGNK Sbjct: 301 AAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWSNDSINIIGDAPKYGNK 360 Query: 361 PTDFMYRPVARPYEWGAVTRKIMYIDPAG 389 PTDFMYRPVARPYEWGAV+RKIMYIDPAG Sbjct: 361 PTDFMYRPVARPYEWGAVSRKIMYIDPAG 389 >gi|15201|lcl|protein:vir:7028 Length: 631 # NCBI annotation: large terminase subunit # Family: family:all:697 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853601;genbank:gi:31711683;genbank:GeneID :1481809 Length = 631 Score = 682 bits (1759), Expect = 0.0, Method: Compositional matrix adjust. Identities = 316/389 (81%), Positives = 351/389 (90%) Query: 1 MAKARESQAEALARWEMLQELQQTFPYTAEGLLLFADTVIHNLIAGNPHLIRMQADILKF 60 MA+ARESQAEALARWE L ELQQTFPYT GLL FA VI+NLI GNP L R+QADILKF Sbjct: 1 MARARESQAEALARWEALHELQQTFPYTVAGLLSFAQVVINNLITGNPDLNRVQADILKF 60 Query: 61 LFYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFR 120 LF G+KYR++EA RG AKTT++AIY VFRIIHEPHKRIM+VSQ AKRAEEIAGWV+KIFR Sbjct: 61 LFGGNKYRMVEAQRGQAKTTIAAIYAVFRIIHEPHKRIMIVSQTAKRAEEIAGWVIKIFR 120 Query: 121 GLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAGMQGARADIILADDV 180 GLDFLEFMLPDIYAGD+AS+K FEIHYTLRGSDKSPSV+CYSIEAGMQGARADIILADDV Sbjct: 121 GLDFLEFMLPDIYAGDKASIKGFEIHYTLRGSDKSPSVACYSIEAGMQGARADIILADDV 180 Query: 181 ESMQNARTAAGRALLEELTKEFESINQFGDIIYLGTPQNVNSIYNNLPARGYSVRIWTAR 240 ES+QN+RTAAGRALLE+LTKEFESINQFGDIIYLGTPQ+VNSIYNNLPARGY +RIW R Sbjct: 181 ESLQNSRTAAGRALLEDLTKEFESINQFGDIIYLGTPQSVNSIYNNLPARGYQIRIWPGR 240 Query: 241 YPSVEQEQCYGDFLAPMIVQDMKDNPALRSGYGLDGNSGAPCAPEMYDDDVLIEKEISQG 300 YP++EQE CYGDFLAPMI QDM D+P+LRSGYG+DG GAP PEMYDD+ LIEKEISQG Sbjct: 241 YPTLEQEACYGDFLAPMIRQDMIDDPSLRSGYGIDGTQGAPTCPEMYDDEKLIEKEISQG 300 Query: 301 AAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWSNDSINIIGDAPKYGNK 360 AKFQLQFMLNTR+MDADRYPLRLN LI SFGT+ VP MPTWSNDS+N+I DAP++GNK Sbjct: 301 TAKFQLQFMLNTRLMDADRYPLRLNQLILMSFGTDVVPEMPTWSNDSVNLISDAPRFGNK 360 Query: 361 PTDFMYRPVARPYEWGAVTRKIMYIDPAG 389 PTD++YRPV RPYEW + R++MYIDPAG Sbjct: 361 PTDYLYRPVPRPYEWRPIQRRLMYIDPAG 389 >gi|7495|lcl|protein:vir:103312 Length: 624 # NCBI annotation: TerL large terminase subunit-like protein # Family: family:all:697 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039677;genbank:gi:126000006;genbank:Ge neID:4818388 Length = 624 Score = 569 bits (1467), Expect = e-164, Method: Compositional matrix adjust. Identities = 264/389 (67%), Positives = 318/389 (81%) Query: 1 MAKARESQAEALARWEMLQELQQTFPYTAEGLLLFADTVIHNLIAGNPHLIRMQADILKF 60 MAKARES AL RWE+L +LQ+ FP T EGLL FA+ VIHNLI GNPHL R+QADIL+F Sbjct: 1 MAKARESIQAALERWELLSQLQEAFPNTVEGLLEFAEVVIHNLIPGNPHLNRIQADILRF 60 Query: 61 LFYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFR 120 +F G KYR++EA RG AKTT++AIY VF IIH PH RI++ SQ +KRAEEIAGWV+KIFR Sbjct: 61 MFTGKKYRMVEAQRGQAKTTIAAIYAVFCIIHRPHFRILISSQTSKRAEEIAGWVIKIFR 120 Query: 121 GLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAGMQGARADIILADDV 180 GLD LEFM+PDIY+GD+AS++ FEIHYTLRGS SPSV+CYSIE MQGARAD+I+ADDV Sbjct: 121 GLDILEFMMPDIYSGDKASIRGFEIHYTLRGSGASPSVACYSIEGSMQGARADLIIADDV 180 Query: 181 ESMQNARTAAGRALLEELTKEFESINQFGDIIYLGTPQNVNSIYNNLPARGYSVRIWTAR 240 ES+QN+ TAAGR LEE TKEFESINQ GDI+YLGTPQ++NSIYNNLP+RGY +RIW R Sbjct: 181 ESLQNSATAAGRVKLEEATKEFESINQTGDILYLGTPQSINSIYNNLPSRGYQLRIWPGR 240 Query: 241 YPSVEQEQCYGDFLAPMIVQDMKDNPALRSGYGLDGNSGAPCAPEMYDDDVLIEKEISQG 300 YP+VEQ+ YGDFLAP+I++DM+ NP LR G G+ G P PEMY+D+ LIEKEISQG Sbjct: 241 YPTVEQQVSYGDFLAPLIIEDMEANPELRRGGGITRLQGQPTCPEMYNDEALIEKEISQG 300 Query: 301 AAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWSNDSINIIGDAPKYGNK 360 AKFQLQFMLNTR+ D++R+PL+L++++F +FG ++VP MP S DSIN I +A + GNK Sbjct: 301 TAKFQLQFMLNTRLSDSERFPLKLSSIMFGNFGVDKVPEMPLHSTDSINEIKEAQRPGNK 360 Query: 361 PTDFMYRPVARPYEWGAVTRKIMYIDPAG 389 TD YR RPYEW TR+IMYIDPAG Sbjct: 361 STDRFYRMAPRPYEWKPATRRIMYIDPAG 389 >gi|11728|lcl|protein:vir:78939 Length: 601 # NCBI annotation: putative DNA maturase B # Family: family:all:697 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522835;genbank:gi:158345070;genbank:Ge neID:5687429 Length = 601 Score = 259 bits (662), Expect = 4e-71, Method: Compositional matrix adjust. Identities = 141/384 (36%), Positives = 221/384 (57%), Gaps = 19/384 (4%) Query: 14 RWEMLQELQQTFPYTAEGLLLFADTVIHNLIAGNPHLIRMQADILKFLFYGHKYRLIEAP 73 R+++ E+ +P F D + ++ + MQ DI F+ ++ A Sbjct: 6 RFQIAHEVMDMYPR-------FRDFCLDAMLFLGFKMTWMQLDIADFMQDSPNKAMVAAQ 58 Query: 74 RGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFRGLDFLEFMLPDIY 133 RG AK+T++ IY V+ I+ +P R M+VS + +AEE + K+ D L ++ P+ Sbjct: 59 RGEAKSTIACIYVVWCIVRDPRTRAMLVSGSGDKAEENGQLITKLIMHWDLLAYLRPEAR 118 Query: 134 AGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAGMQGARADIILADDVESMQNARTAAGRA 193 GDR S +F++++ L+G +KS S++C I A +QG RADI++ DD+E+ +N TA RA Sbjct: 119 MGDRTSATSFDVNWALKGVEKSASINCIGITAALQGYRADILIPDDIETTKNGLTATERA 178 Query: 194 LLEELTKEFESINQFGDIIYLGTPQNVNSIYNNLPARGYSVRIWTARYPSVEQEQCYGDF 253 L ++EF SI G I+YLGTPQ+ SIYN LPARG+ +RIW R+P++++++ YGD+ Sbjct: 179 KLTRQSQEFTSICTHGKILYLGTPQSRESIYNGLPARGFLMRIWPGRFPTLDEQERYGDW 238 Query: 254 LAPMI------VQDMKDNPALRSGYGLDGNSGAPCAPEMYDDDVLIEKEISQGAAKFQLQ 307 LAP I +++ NP R+G GLDG G P+ Y+++ LI+KE+ QGA FQLQ Sbjct: 239 LAPSILERIARLEERGHNP--RTGKGLDGTRGWAADPQRYNEEDLIDKELDQGAEGFQLQ 296 Query: 308 FMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWSNDS-INIIGDAPKYG-NKPTDFM 365 +ML+T + D R L+L +L+F E VP W+ D + DA ++ KP + Sbjct: 297 YMLDTSLADEQRMQLKLRDLLFIDATHESVPEQVAWAADERFKLKFDAHRFPIIKPE--L 354 Query: 366 YRPVARPYEWGAVTRKIMYIDPAG 389 Y P W + + M++DPAG Sbjct: 355 YLPALMAGGWAPLQQMTMFVDPAG 378 >gi|12767|lcl|protein:vir:80218 Length: 595 # NCBI annotation: putative DNA maturase B # Family: family:all:697 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522892;genbank:gi:158345185;genbank:Ge neID:5687481 Length = 595 Score = 259 bits (662), Expect = 4e-71, Method: Compositional matrix adjust. Identities = 134/340 (39%), Positives = 204/340 (60%), Gaps = 4/340 (1%) Query: 53 MQADILKFLFYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIA 112 MQ DI +F+ YG + ++ A RG AK+T++ ++ ++ ++ +P R+++VS +AEE Sbjct: 38 MQEDIAEFMQYGPQRSMVAAQRGEAKSTIACLFGLWNLVQDPTHRVVLVSGAQDKAEENG 97 Query: 113 GWVVKIFRGLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAGMQGARA 172 + + L+++ PD YAGDR SV F++H++L+G DKS SV+C I + +QG R Sbjct: 98 KLMHGLIHNWPLLQYLAPDKYAGDRTSVLEFDVHWSLKGVDKSASVNCLGITSSLQGYRG 157 Query: 173 DIILADDVESMQNARTAAGRALLEELTKEFESI--NQFGDIIYLGTPQNVNSIYNNLPAR 230 D+++ DD+E+ +N TA RA L L+KEF SI ++ G I+YLGTPQ SIYN LP R Sbjct: 158 DLLIPDDIETTKNGLTATERAKLITLSKEFTSIVADRNGRILYLGTPQTRESIYNTLPGR 217 Query: 231 GYSVRIWTARYPSVEQEQCYGDFLAPMIVQDMK-DNPALRSGYGLDGNSGAPCAPEMYDD 289 G++VR+W R+P + YGD LAP I++ M ++G GLDG G PE Y + Sbjct: 218 GFTVRVWPGRFPKASELPKYGDALAPSILERMALLGDRCQTGRGLDGTRGWSTDPERYSE 277 Query: 290 DVLIEKEISQGAAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWSNDSIN 349 + L +KE+ QG F+LQFMLNT + DA R L+L +LI F E+VP W+ D Sbjct: 278 EELCDKELDQGPETFELQFMLNTSLSDAARQQLKLRDLIVADFSHEQVPESVFWAADPRF 337 Query: 350 IIGDAPKYGNKPTDFMYRPVARPYEWGAVTRKIMYIDPAG 389 I D P+ + M+RP + + + +++DPAG Sbjct: 338 KI-DLPQEFPVQSVEMFRPASVHEHFAQIKSMTLFLDPAG 376 >gi|18767|lcl|protein:vir:6335 Length: 601 # NCBI annotation: putative DNA maturase B # Family: family:all:697 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877482;genbank:gi:33300854;uniprot:Q7Y2C2 ;genbank:GeneID:1482617 Length = 601 Score = 254 bits (649), Expect = 1e-69, Method: Compositional matrix adjust. Identities = 139/384 (36%), Positives = 218/384 (56%), Gaps = 19/384 (4%) Query: 14 RWEMLQELQQTFPYTAEGLLLFADTVIHNLIAGNPHLIRMQADILKFLFYGHKYRLIEAP 73 R+++ E++ +P F D + ++ + MQ DI F+ ++ A Sbjct: 6 RFQIAHEVRDMYPR-------FRDFCLDAMLFLGFKMTWMQLDIADFMQDSPNKAMVAAQ 58 Query: 74 RGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFRGLDFLEFMLPDIY 133 RG AK+T++ IY V+ I P R M+VS + +AEE + K+ D L ++ P+ Sbjct: 59 RGEAKSTIACIYVVWCITQNPATRAMLVSGSGDKAEENGQLITKLIMHWDLLAYLRPEAR 118 Query: 134 AGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAGMQGARADIILADDVESMQNARTAAGRA 193 GDR S +F++++ L+G +KS S++C I A +QG RADI++ DD+E+ +N TA RA Sbjct: 119 MGDRTSATSFDVNWALKGVEKSASINCIGITAALQGYRADILIPDDIETTKNGLTATERA 178 Query: 194 LLEELTKEFESINQFGDIIYLGTPQNVNSIYNNLPARGYSVRIWTARYPSVEQEQCYGDF 253 L ++EF SI G I+YLGTPQ+ SIYN LPARG+ +RIW R+P+++++ YGD+ Sbjct: 179 KLTRQSQEFTSICTHGKILYLGTPQSRESIYNGLPARGFLMRIWPGRFPTLDEQARYGDW 238 Query: 254 LAPMI------VQDMKDNPALRSGYGLDGNSGAPCAPEMYDDDVLIEKEISQGAAKFQLQ 307 LAP I +++ NP R+G GLDG G P+ Y+++ L++KE+ QG FQLQ Sbjct: 239 LAPSILARIARLEEKGHNP--RTGKGLDGTRGWAADPQRYNEEDLLDKELDQGPEGFQLQ 296 Query: 308 FMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWSNDS-INIIGDAPKYGN-KPTDFM 365 +ML+T + D R L+L +L+F E VP W+ D + DA ++ KP + Sbjct: 297 YMLDTSLADEQRMQLKLRDLLFIDATHESVPEQVAWAADERFKLKFDAHRFPVIKPE--L 354 Query: 366 YRPVARPYEWGAVTRKIMYIDPAG 389 Y P W + + M++DPAG Sbjct: 355 YLPALMAGGWAPLQQMTMFVDPAG 378 >gi|3994|lcl|protein:vir:99686 Length: 586 # NCBI annotation: DNA packaging protein B # Family: family:all:697 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249597;genbank:gi:68299748;genbank:GeneID :3800001 Length = 586 Score = 199 bits (507), Expect = 4e-53, Method: Compositional matrix adjust. Identities = 125/346 (36%), Positives = 190/346 (54%), Gaps = 16/346 (4%) Query: 48 PHLIRMQADILKFLFYGHKYRLI-EAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAK 106 P R Q D+ + L G + R I +A RGI K+ ++ + V+++ + P + M+VS + + Sbjct: 33 PEPTRCQKDMARKLAAGDERRFILQAFRGIGKSFITCAFVVWKLWNNPQLKFMIVSASKE 92 Query: 107 RAEEIAGWVVKIFRGLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAG 166 RA+ + ++ +I L FL + P R SV +F++ L D SPSV I Sbjct: 93 RADANSIFIKRIIDLLPFLHELKP--RPEQRDSVISFDV--GLAKPDHSPSVKSVGITGQ 148 Query: 167 MQGARADIILADDVESMQNARTAAGRALLEELTKEFESI-NQFGDIIYLGTPQNVNSIYN 225 + G+RADI++ADDVE N+ T A R L EL KEF++I G IIYLGTPQ ++Y Sbjct: 149 LTGSRADILIADDVEVPNNSATQAARDRLGELVKEFDAILKPNGTIIYLGTPQCEMTLYR 208 Query: 226 NLPARGYSVRIWTARYPS-VEQEQCYGDFLAPMIVQDMKDNPALRSGYGLDGNSGAPCAP 284 L RGY IW ARYP + + YG+ LAPM+ ++ +NP + P P Sbjct: 209 ELENRGYKTTIWPARYPKDMNDLETYGNRLAPMLKDELMENP--------EAYWWQPTDP 260 Query: 285 EMYDDDVLIEKEISQGAAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWS 344 +DD+ L E+E+S G A F LQFMLN + DA++YPL+L + I + ++ P+ W Sbjct: 261 VRFDDEDLRERELSYGKAGFALQFMLNPNLSDAEKYPLKLRDFIVAALEVDKAPLTYGWL 320 Query: 345 NDSINIIGDAPKYGNKPTDFMYRPVARPYEWGAVTRKIMYIDPAGK 390 + N++ + P+ G K + VA + + T KIM IDP+G+ Sbjct: 321 PNPQNLLQNVPQVGLKGDTYHRYDVADKRQ-ASYTSKIMAIDPSGR 365 >gi|4214|lcl|protein:vir:94726 Length: 575 # NCBI annotation: maturation protein # Family: family:all:697 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338131;genbank:gi:77118209;genbank:GeneID :3707752 Length = 575 Score = 192 bits (488), Expect = 6e-51, Method: Compositional matrix adjust. Identities = 126/347 (36%), Positives = 189/347 (54%), Gaps = 18/347 (5%) Query: 48 PHLIRMQADILKFLFYGHKYRLI-EAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAK 106 P R Q D+ K L G R I +A RGI K+ ++ + V+++ + P + M+VS + + Sbjct: 23 PVPTRCQIDMAKKLSAGDNRRFILQAFRGIGKSFITCAFVVWKLWNNPDLKFMIVSASKE 82 Query: 107 RAEEIAGWVVKIFRGLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAG 166 RA+ + ++ +I + L+ + P G R +V +F++ D SPSV I Sbjct: 83 RADANSIFIKRIIDLMPQLKELKPK--QGQRDAVISFDVGPA--KPDHSPSVKSVGITGQ 138 Query: 167 MQGARADIILADDVESMQNARTAAGRALLEELTKEFESI-NQFGDIIYLGTPQNVNSIYN 225 + G+RADI++ADDVE N+ T A R L EL KEF++I G IIYLGTPQN ++Y Sbjct: 139 LTGSRADILIADDVEVPNNSATQAARDRLSELVKEFDAILKPGGTIIYLGTPQNEMTLYR 198 Query: 226 NLPARGYSVRIWTARYPSVEQE-QCYGDFLAPMIVQDMKDNPALRSGYGLDGNSGAPCAP 284 L RGY+ IW ARYP ++ Q YGD LAPM+ +++++P S Y P Sbjct: 199 ELEGRGYTTTIWPARYPRDRKDWQSYGDRLAPMLQAELEEDP--ESFYW------RPTDE 250 Query: 285 EMYDDDVLIEKEISQGAAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWS 344 +DD L E+E+S G A F LQFMLN + DA++YPL+L +LI P++ W Sbjct: 251 VRFDDTDLKERELSYGKAGFALQFMLNPNLSDAEKYPLKLRDLIVADLDPASSPMVYQWL 310 Query: 345 NDSINIIGDAPKYGNKPTDF-MYRPVARPYEWGAVTRKIMYIDPAGK 390 + N D P G + Y+ V + + T+KI+ IDP+G+ Sbjct: 311 PNPQNKREDVPNVGLMGDSYHTYQTVGSAF--SSYTQKILVIDPSGR 355 >gi|14546|lcl|protein:vir:8897 Length: 582 # NCBI annotation: DNA packaging protein B # Family: family:all:697 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813786;genbank:gi:29366741;genbank:GeneID :1258833 Length = 582 Score = 184 bits (466), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 121/347 (34%), Positives = 181/347 (52%), Gaps = 19/347 (5%) Query: 48 PHLIRMQADILKFLFYGHKYRLI-EAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAK 106 P + Q D+ K L G + R I +A RGI K+ ++ + V+++ + P + M+VS + + Sbjct: 32 PKPTKCQIDMAKKLSAGDERRFILQAFRGIGKSFITCAFVVWKLWNNPDLKFMIVSASKE 91 Query: 107 RAEEIAGWVVKIFRGLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAG 166 RA+ + ++ +I L FL + P G R S AF++ D SPSV I Sbjct: 92 RADANSVFIKRIIDLLPFLHELKPG--PGQRDSSLAFDVGPA--KPDHSPSVKSVGITGQ 147 Query: 167 MQGARADIILADDVESMQNARTAAGRALLEELTKEFESI-NQFGDIIYLGTPQNVNSIYN 225 + G+RADI++ADDVE N+ T R L EL KEF++I G IIYLGTPQ ++Y Sbjct: 148 LTGSRADILIADDVEVPNNSATQTARDHLGELVKEFDAILKPGGTIIYLGTPQTEMTLYR 207 Query: 226 NLPARGYSVRIWTARYPSVEQE-QCYGDFLAPMIVQDMKDNPALRSGYGLDGNSGAPCAP 284 L RGY IW ARYP + + YG LAPM+ +++ + +L AP Sbjct: 208 ELEGRGYVTTIWPARYPKDQADWDSYGPRLAPMLAAELQADGSL---------FWAPTDE 258 Query: 285 EMYDDDVLIEKEISQGAAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWS 344 +DD L E+E+S G F LQFMLN + D ++YPL+L + I +F ++ P W Sbjct: 259 VRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRDFIVGTFAQDKGPTTLIWM 318 Query: 345 NDSINIIGDAPKYGNKPTDFM-YRPVARPYEWGAVTRKIMYIDPAGK 390 ++ N P G K F Y V + + +KI+ IDP+G+ Sbjct: 319 PNAANECKGVPVVGLKGDRFHRYESVGQAT--ASYAQKILVIDPSGR 363 >gi|18462|lcl|protein:vir:2213 Length: 586 # NCBI annotation: DNA maturation protein # Family: family:all:697 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_042010;swissprot:sw:p03694;genbank:gi:962 7482;goa:P03694;uniprot:P03694;genbank:GeneID:1261062 Length = 586 Score = 181 bits (458), Expect = 2e-47, Method: Compositional matrix adjust. Identities = 122/350 (34%), Positives = 182/350 (52%), Gaps = 21/350 (6%) Query: 48 PHLIRMQADILKFLFYG-HKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAK 106 P + Q D+ K L G +K +++A RGI K+ ++ + V+ + +P +I++VS + + Sbjct: 33 PVPTKCQIDMAKVLANGDNKKFILQAFRGIGKSFITCAFVVWSLWRDPQLKILIVSASKE 92 Query: 107 RAEEIAGWVVKIFRGLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAG 166 RA+ + ++ I L FL + P G R SV +F++ D SPSV I Sbjct: 93 RADANSIFIKNIIDLLPFLSELKP--RPGQRDSVISFDVGPA--NPDHSPSVKSVGITGQ 148 Query: 167 MQGARADIILADDVESMQNARTAAGRALLEELTKEFESINQ---FGDIIYLGTPQNVNSI 223 + G+RADII+ADDVE N+ T R L L +EF ++ + +IYLGTPQ ++ Sbjct: 149 LTGSRADIIIADDVEIPSNSATMGAREKLWTLVQEFAALLKPLPSSRVIYLGTPQTEMTL 208 Query: 224 YNNLP-ARGYSVRIWTARYPSVEQEQCY-GDFLAPMIVQDMKDNPALRSGYGLDGNSGAP 281 Y L RGY+ IW A YP +E Y LAPM+ + +NP + +G P Sbjct: 209 YKELEDNRGYTTIIWPALYPRTREENLYYSQRLAPMLRAEYDENP--------EALAGTP 260 Query: 282 CAPEMYDDDVLIEKEISQGAAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMP 341 P +D D L E+E+ G A F LQFMLN + DA++YPLRL + I + E+ P+ Sbjct: 261 TDPVRFDRDDLRERELEYGKAGFTLQFMLNPNLSDAEKYPLRLRDAIVAALDLEKAPMHY 320 Query: 342 TWSNDSINIIGDAPKYGNKPTDF-MYRPVARPYEWGAVTRKIMYIDPAGK 390 W + NII D P G K D Y + G +KI+ IDP+G+ Sbjct: 321 QWLPNRQNIIEDLPNVGLKGDDLHTYHDCSN--NSGQYQQKILVIDPSGR 368 >gi|16146|lcl|protein:vir:10462 Length: 586 # NCBI annotation: DNA maturase B # Family: family:all:697 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848309;genbank:gi:30387500;genbank:GeneID :1733974 Length = 586 Score = 180 bits (456), Expect = 3e-47, Method: Compositional matrix adjust. Identities = 122/350 (34%), Positives = 182/350 (52%), Gaps = 21/350 (6%) Query: 48 PHLIRMQADILKFLFYG-HKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAK 106 P + Q D+ K L G +K +++A RGI K+ ++ + V+ + +P +I++VS + + Sbjct: 33 PVPTKCQIDMAKVLANGDNKKFILQAFRGIGKSFITCAFVVWSLWRDPQLKILIVSASKE 92 Query: 107 RAEEIAGWVVKIFRGLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAG 166 RA+ + ++ I L FL + P G R SV +F++ D SPSV I Sbjct: 93 RADANSIFIKNIIDLLPFLAELKP--RPGQRDSVISFDVGPA--KPDHSPSVKSVGITGQ 148 Query: 167 MQGARADIILADDVESMQNARTAAGRALLEELTKEFESINQ---FGDIIYLGTPQNVNSI 223 + G+RADII+ADDVE N+ T R L L +EF ++ + +IYLGTPQ ++ Sbjct: 149 LTGSRADIIIADDVEIPSNSATMGAREKLWTLVQEFAALLKPLTSSRVIYLGTPQTEMTL 208 Query: 224 YNNLP-ARGYSVRIWTARYPSVEQEQCY-GDFLAPMIVQDMKDNPALRSGYGLDGNSGAP 281 Y L RGY+ IW A YP +E Y LAPM+ + +NP + +G P Sbjct: 209 YKELEDNRGYTTIIWPALYPRTREENLYYSQRLAPMLRAEYDENP--------EALAGTP 260 Query: 282 CAPEMYDDDVLIEKEISQGAAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMP 341 P +D D L E+E+ G A F LQFMLN + DA++YPLRL + I + E+ P+ Sbjct: 261 TDPVRFDRDDLRERELEYGKAGFTLQFMLNPNLSDAEKYPLRLRDAIVAALDLEKAPMHY 320 Query: 342 TWSNDSINIIGDAPKYGNKPTDF-MYRPVARPYEWGAVTRKIMYIDPAGK 390 W + NII D P G K D Y + G +KI+ IDP+G+ Sbjct: 321 QWLPNRQNIIEDLPNVGLKGDDLHTYHDCSN--NSGQYQQKILVIDPSGR 368 >gi|3734|lcl|protein:vir:94555 Length: 585 # NCBI annotation: DNA packaging protein B # Family: family:all:697 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919024;genbank:gi:119637788;genbank:GeneI D:5179315 Length = 585 Score = 172 bits (437), Expect = 4e-45, Method: Compositional matrix adjust. Identities = 119/351 (33%), Positives = 184/351 (52%), Gaps = 23/351 (6%) Query: 48 PHLIRMQADILKFLFYG-HKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAK 106 P + Q D+ + L G HK +++A RGI K+ ++ + V+ + +P ++++VS + + Sbjct: 33 PKPTKCQIDMARTLADGDHKKFILQAFRGIGKSFITCAFVVWVLWRDPQLKVLIVSASKE 92 Query: 107 RAEEIAGWVVKIFRGLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAG 166 RA+ + ++ I L FL + P G R SV +F++ L D SPSV I Sbjct: 93 RADANSIFIKNIIDLLPFLAELKP--RPGQRDSVISFDV--GLAKPDHSPSVKSVGITGQ 148 Query: 167 MQGARADIILADDVESMQNARTAAGRALLEELTKEFESINQ---FGDIIYLGTPQNVNSI 223 + G+RADII+ADDVE N+ T++ R L L EF ++ + +IYLGTPQ ++ Sbjct: 149 LTGSRADIIIADDVEVPGNSSTSSAREKLWTLVTEFAALLKPLPTSRVIYLGTPQTEMTL 208 Query: 224 YNNLP-ARGYSVRIWTARYPSVEQEQCY-GDFLAPMIVQDMKDNPALRSGYGLDGNSGAP 281 Y L +GYS IW A+YP + E Y GD LAPM+ + + G + G P Sbjct: 209 YKELEDNKGYSTVIWPAQYPRNDAEALYYGDRLAPMLKAEYDE--------GFELLRGQP 260 Query: 282 CAPEMYDDDVLIEKEISQGAAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMP 341 P +D D L E+E+ G A + LQFMLN + DA++YPLRL + I + E P+ Sbjct: 261 TDPVRFDTDDLRERELEYGKAGYTLQFMLNPNLSDAEKYPLRLRDAIVCAVDPERAPLSY 320 Query: 342 TWSNDSINIIGDAPKYGNKPTDF--MYRPVARPYEWGAVTRKIMYIDPAGK 390 W + N + P G K D + +R E+ + KI+ IDP+G+ Sbjct: 321 QWLPNRQNRNEELPNVGLKGDDIHSFHTCSSRTAEYQS---KILVIDPSGR 368 >gi|16205|lcl|protein:vir:1554 Length: 587 # NCBI annotation: DNA packaging protein B # Family: family:all:697 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052122;swissprot:trembl:q9t0z4;genbank:gi :9634048;uniprot:Q9T0Z4;genbank:GeneID:1262409 Length = 587 Score = 172 bits (435), Expect = 8e-45, Method: Compositional matrix adjust. Identities = 119/350 (34%), Positives = 183/350 (52%), Gaps = 21/350 (6%) Query: 48 PHLIRMQADILKFLFYG-HKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAK 106 P + Q D+ + L G +K +++A RGI K+ ++ + V+ + +P +I++VS + + Sbjct: 34 PPPTKCQIDMARCLANGDNKKFILQAFRGIGKSFITCAFVVWTLWRDPQLKILIVSASKE 93 Query: 107 RAEEIAGWVVKIFRGLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAG 166 RA+ + ++ I L FL + P G R SV +F++ D SPSV I Sbjct: 94 RADANSIFIKNIIDLLPFLAELKP--RPGQRDSVISFDVGPA--KPDHSPSVKSVGITGQ 149 Query: 167 MQGARADIILADDVESMQNARTAAGRALLEELTKEFESINQ---FGDIIYLGTPQNVNSI 223 + G+RADII+ADDVE N+ T R L L +EF ++ + +IYLGTPQ ++ Sbjct: 150 LTGSRADIIIADDVEIPSNSATQGAREKLWTLVQEFAALLKPLPTSRVIYLGTPQTEMTL 209 Query: 224 YNNLP-ARGYSVRIWTARYP-SVEQEQCYGDFLAPMIVQDMKDNPALRSGYGLDGNSGAP 281 Y L RGY+ IW A YP S E++ YGD LAPM+ ++ D G + G P Sbjct: 210 YKELEDNRGYTTIIWPALYPRSREEDLYYGDRLAPMLREEFND--------GFEMLQGQP 261 Query: 282 CAPEMYDDDVLIEKEISQGAAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMP 341 P +D + L E+E+ G A F LQFMLN + DA++YPLRL + I E+ P+ Sbjct: 262 TDPVRFDMEDLRERELEYGKAGFTLQFMLNPNLSDAEKYPLRLRDAIVCGLDFEKAPMHY 321 Query: 342 TWSNDSINIIGDAPKYGNKPTDFM-YRPVARPYEWGAVTRKIMYIDPAGK 390 W + N + P G K D Y ++ G ++I+ IDP+G+ Sbjct: 322 QWLPNRQNRNEELPNVGLKGDDIHSYHSCSQ--NTGQYQQRILVIDPSGR 369 >gi|13826|lcl|protein:vir:3376 Length: 586 # NCBI annotation: DNA packaging protein B # Family: family:all:697 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523347;swissprot:trembl:q8w5t4;genbank:gi :17570838;uniprot:Q8W5T4;genbank:GeneID:927460 Length = 586 Score = 171 bits (434), Expect = 1e-44, Method: Compositional matrix adjust. Identities = 119/350 (34%), Positives = 183/350 (52%), Gaps = 21/350 (6%) Query: 48 PHLIRMQADILKFLFYG-HKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAK 106 P + Q D+ K L G +K +++A RGI K+ ++ + V+ + +P +I++VS + + Sbjct: 33 PVPTKCQIDMAKVLANGDNKKFILQAFRGIGKSFITCAFVVWTLWRDPQLKILIVSASKE 92 Query: 107 RAEEIAGWVVKIFRGLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAG 166 RA+ + ++ I L FL + P G R SV +F++ D SPSV I Sbjct: 93 RADANSIFIKNIIDLLPFLAELKP--RPGQRDSVISFDVGPA--KPDHSPSVKSVGITGQ 148 Query: 167 MQGARADIILADDVESMQNARTAAGRALLEELTKEFESINQ---FGDIIYLGTPQNVNSI 223 + G+RADII+ADDVE N+ T R L L +EF ++ + +IYLGTPQ ++ Sbjct: 149 LTGSRADIIIADDVEIPSNSATQGAREKLWTLVQEFAALLKPLPTSRVIYLGTPQTEMTL 208 Query: 224 YNNLP-ARGYSVRIWTARYP-SVEQEQCYGDFLAPMIVQDMKDNPALRSGYGLDGNSGAP 281 Y L RGY+ IW A YP S E++ YG+ LAPM+ ++ D G + G P Sbjct: 209 YKELEDNRGYTTIIWPALYPRSREEDLYYGERLAPMLREEFND--------GFEMLQGQP 260 Query: 282 CAPEMYDDDVLIEKEISQGAAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMP 341 P +D + L E+E+ G A F LQFMLN + DA++YPLRL + I E+ P+ Sbjct: 261 TDPVRFDMEDLRERELEYGKAGFTLQFMLNPNLSDAEKYPLRLRDAIVCGLDFEKAPMHY 320 Query: 342 TWSNDSINIIGDAPKYGNKPTDFM-YRPVARPYEWGAVTRKIMYIDPAGK 390 W + N + P G K D Y ++ G ++I+ IDP+G+ Sbjct: 321 QWLPNRQNRNEELPNVGLKGDDIHSYHSCSQ--NTGQYQQRILVIDPSGR 368 >gi|7204|lcl|protein:vir:100065 Length: 577 # NCBI annotation: DNA maturase beta subunit # Family: family:all:697 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214228;genbank:gi:61806451;genbank:GeneID :3294745 Length = 577 Score = 166 bits (420), Expect = 5e-43, Method: Compositional matrix adjust. Identities = 113/346 (32%), Positives = 182/346 (52%), Gaps = 20/346 (5%) Query: 48 PHLIRMQADILKFLFYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKR 107 P R Q I +L G K I+A RG+ K+ ++ + ++ + ++ K+IM++S + +R Sbjct: 26 PSPTRAQYAIADYLQSGPKRLQIQAFRGVGKSWITGAFVLWTLFNDAEKKIMIISASKER 85 Query: 108 AEEIAGWVVKIFRGLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAGM 167 A+ ++ ++ K+ +L+ + P R S +F++ L ++PSV I + Sbjct: 86 ADNMSIFLQKLIIETPWLKHLRPKSDDA-RWSRISFDV---LCSPHQAPSVKSVGITGQL 141 Query: 168 QGARADIILADDVESMQNARTAAGRALLEELTKEFESINQFGD---IIYLGTPQNVNSIY 224 G+RAD+++ DD+E N+ T R L +L E ESI D I+YLGTPQ ++Y Sbjct: 142 TGSRADLMILDDIEVPGNSMTELMREKLLQLCTEAESILTPKDDSRIMYLGTPQTTFTVY 201 Query: 225 NNLPARGYSVRIWTARYPSVEQEQCYGDFLAPMIVQDMKDNPALRSGYGLDGNSGAPCAP 284 L R Y +W ARYP + Y +AP + +D+ DN A SG P Sbjct: 202 RKLAERAYRPFVWPARYP--KDITPYEGLIAPQLQEDI-DNGA---------ESGTVTDP 249 Query: 285 EMYDDDVLIEKEISQGAAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWS 344 + +DDD L ++E + G + F LQFML+T + DA+++PL++ +L+ TS E P W Sbjct: 250 DRFDDDDLQQRESAMGRSNFMLQFMLDTTLSDAEKFPLKMADLVITSVNPTEAPDNVIWC 309 Query: 345 NDSINIIGDAPKYGNKPTDFMYRPVARPYEWGAVTRKIMYIDPAGK 390 +D NII DAP G P D+ Y P+ EW I +DP+G+ Sbjct: 310 SDPQNIIKDAPTVG-LPGDYFYSPMQLQGEWTPYQETICSVDPSGR 354 >gi|11501|lcl|protein:vir:78718 Length: 574 # NCBI annotation: DNA packaging protein # Family: family:all:697 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285469;genbank:gi:148724503;genbank:Ge neID:5220189 Length = 574 Score = 145 bits (365), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 106/353 (30%), Positives = 174/353 (49%), Gaps = 34/353 (9%) Query: 48 PHLIRMQADILKFLFYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKR 107 P R Q I +L +G K I A RG+ K+ ++A + ++ + +P ++IMV+S + +R Sbjct: 28 PKPTRAQLAIADYLQHGPKRLQISAFRGVGKSWITAAFVLWVLFVDPDRKIMVISASKER 87 Query: 108 AEEIAGWVVKIFRGLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAGM 167 A+ + + K+ +++L + P + R S +F++ ++PSV I M Sbjct: 88 ADNFSIFCQKLILDIEWLSHLRPRD-SDQRWSRISFDVGPA--KPHQAPSVKSVGITGQM 144 Query: 168 QGARADIILADDVESMQNARTAAGRALLEELTKEFESI---NQFGDIIYLGTPQNVNSIY 224 G+RA +++ DDVE N+ T R L +L E ESI + I++LGTPQ+ +IY Sbjct: 145 TGSRAHLMVFDDVEVPANSATDMQREKLLQLVSESESILVPDDDARIMFLGTPQSTFTIY 204 Query: 225 NNLPARGYSVRIWTARYPSVEQEQCYGDFLAPMIVQDMKDNPALRSGYGLDGNSGAPCAP 284 L R Y +W ARYP Y LAP +V D++ +P L + P Sbjct: 205 RKLAERSYRPFVWPARYP--RDLSKYEGLLAPQLVADLEKDPEL---------TWKPTDT 253 Query: 285 EMYDDDVLIEKEISQGAAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWS 344 +++ L+E+E + G + F LQFML+T + DA+++PL+ +LI T G E WS Sbjct: 254 R-FNELNLMERESAMGRSNFMLQFMLDTSLSDAEKFPLKFQDLIVTPLGAECAEAY-AWS 311 Query: 345 NDSINIIGDAPKYGNK-------PTDFMYRPVARPYEWGAVTRKIMYIDPAGK 390 D P+Y K P D Y P+ + I+ +DP+G+ Sbjct: 312 AD--------PRYMRKELNPVGLPGDRFYGPMYIDEGIVPYSETIVSVDPSGR 356 >gi|5868|lcl|protein:vir:95431 Length: 535 # NCBI annotation: hypothetical protein ORF049 # Family: family:all:543 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294642;genbank:gi:149408208;genbank:Ge neID:5236998 Length = 535 Score = 52.0 bits (123), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 58/255 (22%), Positives = 105/255 (41%), Gaps = 27/255 (10%) Query: 30 EGLLLFADTVIHNLIAGNPHLIRMQADILKFLFYGH-----KYRLIEAPRGIAKTTLSAI 84 E L FA V + G H + A + + +G +LI PR K+ + A Sbjct: 30 EDLYFFAKLVNPGYVYGEVHR-EIFAWMQDYTLFGRGSDLTSNKLIMLPRAHLKSHMVAT 88 Query: 85 YTVFRIIHEPHKRIMVVSQNAKRAEE----IAGWVVKIFRGLDFLEFMLPDIYAGDRASV 140 + + I P I+ +S A AE + + F E++ P ++ S Sbjct: 89 WCAWIITRHPEVTILYISATATLAETQLYAVKNILASSVYNRYFPEYIHPQEGKREKWSS 148 Query: 141 KAFEIHYTLRGSD--KSPSVSCYSIEAGMQGARADIILADDVESMQNARTAAGRALLEEL 198 A I + R + + +++ + G ADII+ADD+ +NA T GR +++ Sbjct: 149 NAMSIDHVQRKKEGIRDATIATAGLTTNTTGWHADIIVADDLVVPENAYTEDGRESVQKK 208 Query: 199 TKEFESI-NQFGDIIYLGTPQNVNSIYNNLPARGYSV-----------RIWTARYPSVEQ 246 + +F SI N G + GT + + IY ++ Y + +W + +VE+ Sbjct: 209 SSQFTSIRNAGGFTMACGTRYHPSDIYATWRSQKYDIFDDEGMKIDEHPVWEIKEYAVEK 268 Query: 247 EQCYGDFLAPMIVQD 261 + FL P +++ Sbjct: 269 DNI---FLWPRTIRE 280 >gi|9169|lcl|protein:vir:99234 Length: 550 # NCBI annotation: hypothetical protein # Family: family:all:543 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950450;genbank:gi:119953651;genbank:GeneI D:4643094 Length = 550 Score = 40.0 bits (92), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 45/179 (25%), Positives = 77/179 (43%), Gaps = 30/179 (16%) Query: 66 KYRLIEAPRGIAKTTL-SAIYTVFRII----HEPHKRIMVVSQNAKRAEEIAGWVVKIFR 120 ++ I APRG AK+TL S I+ ++ ++ H P + Q A E I + R Sbjct: 85 QHEAIAAPRGNAKSTLVSQIFVIWCVLTGRKHYPLIIMDAFEQAATMLEAIKAELEFNPR 144 Query: 121 -GLDFLE-------FMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAGMQGARA 172 +DF + + + I + A V+ F +RG P R Sbjct: 145 LAMDFPQGAGKGRVWQVGTIVTANDAKVQVFGSGKRMRGLRHGPH-------------RP 191 Query: 173 DIILADDVESMQNARTAAGRALLEELTKE----FESINQFGDIIYLGTPQNVNSIYNNL 227 D+++ DD+E+ +N R+ R LE K+ S + D+I +GT + +S+ + L Sbjct: 192 DLVIGDDLENDENVRSPEQRDKLENWLKKTVLSLGSADDTMDVIIIGTILHYDSVLSRL 250 >gi|3935|lcl|protein:vir:103868 Length: 551 # NCBI annotation: hypothetical protein # Family: family:all:543 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938233;genbank:gi:38229138;genbank:GeneID :2648183 Length = 551 Score = 39.7 bits (91), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 45/179 (25%), Positives = 77/179 (43%), Gaps = 30/179 (16%) Query: 66 KYRLIEAPRGIAKTTL-SAIYTVFRII----HEPHKRIMVVSQNAKRAEEIAGWVVKIFR 120 ++ I APRG AK+TL S I+ ++ ++ H P + Q A E I + R Sbjct: 85 QHEAIAAPRGNAKSTLVSQIFVIWCVLTGRKHYPLIIMDAFEQAATMLEAIKAELEFNPR 144 Query: 121 -GLDFLE-------FMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAGMQGARA 172 +DF + + + I + A V+ F +RG P R Sbjct: 145 LAMDFPQGAGKGRVWQVGTIVTANDAKVQVFGSGKRMRGLRHGPH-------------RP 191 Query: 173 DIILADDVESMQNARTAAGRALLEELTKE----FESINQFGDIIYLGTPQNVNSIYNNL 227 D+++ DD+E+ +N R+ R LE K+ S + D+I +GT + +S+ + L Sbjct: 192 DLVVGDDLENDENVRSPEQRDKLENWLKKTVLSLGSADDTMDVIIIGTILHYDSVLSRL 250 >gi|12711|lcl|protein:vir:80169 Length: 565 # NCBI annotation: terminase # Family: family:all:543 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285801;genbank:gi:148747835;genbank:Ge neID:5220445 Length = 565 Score = 38.5 bits (88), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 42/178 (23%), Positives = 74/178 (41%), Gaps = 38/178 (21%) Query: 65 HKYRLIEAPRGIAKTTL-SAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFRGLD 123 ++ RL+ APRG K+T+ S +Y ++RI P R++V N KR ++ + Sbjct: 56 NRRRLVLAPRGHLKSTVCSVLYVLWRIYRNPDIRVLV-GTNLKRLSRAFIRELRQYFEDT 114 Query: 124 FLE------------FMLPDIYAGDRA-------SVKAFEIHYTLRGSDK---------- 154 +L+ ++P + A DR +V E TL K Sbjct: 115 WLQQNVWNVRPHIEGALVPALSASDRRKRNSQRNNVDYDEALATLTDDTKLIWSMEALQV 174 Query: 155 -------SPSVSCYSIEAGMQGARADIILADDVESMQNARTAAGRALLEELTKEFESI 205 P+V SI + G D+++ DD+ +N++T + E T++ ES+ Sbjct: 175 IRPTVMKEPTVQTVSIGTTVTGDHYDLLILDDIVDFENSKTEDKAENILEWTRDLESV 232 >gi|4089|lcl|protein:vir:94598 Length: 581 # NCBI annotation: PfWMP4_40 # Family: family:all:543 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762670;genbank:gi:115304378;genbank:GeneI D:5142298 Length = 581 Score = 32.3 bits (72), Expect = 0.011, Method: Compositional matrix adjust. Identities = 14/35 (40%), Positives = 23/35 (65%) Query: 68 RLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVS 102 RL++ PRG K+TL+ Y ++RI P+ R++ S Sbjct: 89 RLLQMPRGHLKSTLTVGYIMWRIYRNPNIRMLHAS 123 >gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214662;genbank:gi:61806303;genbank:GeneID :3294591 Length = 550 Score = 30.8 bits (68), Expect = 0.033, Method: Compositional matrix adjust. Identities = 11/52 (21%), Positives = 28/52 (53%) Query: 62 FYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAG 113 F+ H++ + + PR K+T+ Y ++ ++ + + +++ A A E+ G Sbjct: 71 FHQHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAPTAREMLG 122 >gi|9289|lcl|protein:vir:97165 Length: 431 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239721;genbank:gi:66394878;genbank:GeneID :5130898 Length = 431 Score = 28.9 bits (63), Expect = 0.100, Method: Compositional matrix adjust. Identities = 14/45 (31%), Positives = 25/45 (55%) Query: 59 KFLFYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQ 103 KF + YR+++ RG K+ +AI ++RI+ I+VV + Sbjct: 17 KFWHNKNFYRVVKGSRGSKKSKTTAINLIYRIMKYDWANILVVRR 61 >gi|15319|lcl|protein:vir:3140 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:543 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640322;genbank:gi:21234401;genbank:GeneID :956064 Length = 535 Score = 28.9 bits (63), Expect = 0.12, Method: Compositional matrix adjust. Identities = 43/203 (21%), Positives = 78/203 (38%), Gaps = 22/203 (10%) Query: 22 QQTFPYTAEGLLLFADTVIHNLI----AGNPHLIRMQADILKFLF-YGHKYRLIEA---- 72 Q P L+ + +H+L A PH R+ + K LF + + L E Sbjct: 9 QNDIPLEDPRLIELREACLHSLWIFAQAVEPH--RVYGECHKELFDWWQEMELEEVLNTL 66 Query: 73 ---PRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFRGLDFLEFML 129 PR K+ A++ ++I P I V A + + +K D + Sbjct: 67 ALMPRDHQKSHCIAVWVCWQIFKNPAVTIAYVCATESLAI-LQLYDIKQILTSDEFTRLS 125 Query: 130 PDIY-----AGDRASVKAFEIHYTLRGSDK--SPSVSCYSIEAGMQGARADIILADDVES 182 PD+ + + A + + +R ++ P+V +++ GA +I++ DDV Sbjct: 126 PDMIEPMEKKRQKWAETAIIVDHPIRKKERPRDPTVLATGLDSNNIGAHCNIMVKDDVVI 185 Query: 183 MQNARTAAGRALLEELTKEFESI 205 +N+ T R +E SI Sbjct: 186 DKNSLTETARQKVEAKAGHLSSI 208 Score = 23.9 bits (50), Expect = 4.1, Method: Compositional matrix adjust. Identities = 28/103 (27%), Positives = 41/103 (39%), Gaps = 32/103 (31%) Query: 273 GLDGNS-GAPCAPEMYDDDVLIEKEISQGAAKFQ----------------LQFMLNTRMM 315 GLD N+ GA C M DDV+I+K A+ + ++F + TR Sbjct: 165 GLDSNNIGAHCNI-MVKDDVVIDKNSLTETARQKVEAKAGHLSSILTTDGMEFCVGTRYH 223 Query: 316 DADRYPLRLNNLIFTSFGTEEVPVMPTWSNDSINIIGDAPKYG 358 D Y ++ TEEV W D ++G+ P Y Sbjct: 224 PKDHYQTLID-------MTEEV-----WEGD--QLVGERPVYA 252 >gi|20804|lcl|protein:vir:106290 Length: 633 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944105;genbank:gi:38640149;genbank:GeneID :2658038 Length = 633 Score = 28.5 bits (62), Expect = 0.16, Method: Compositional matrix adjust. Identities = 28/141 (19%), Positives = 68/141 (48%), Gaps = 15/141 (10%) Query: 8 QAEALARW--EMLQELQQTFPYTAEGLLLFADT---VIH-NLIAGNPHLIRMQADILKFL 61 +A +W EM++E ++ + ++ FA+T +IH + L Q D+L+ + Sbjct: 119 RANVPTKWTREMVEEWKRC----RDDIVYFAETYCSIIHIDWGVIKVQLRDYQKDMLRIM 174 Query: 62 FYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFRG 121 + + PR + KTT +AI+ ++ K + V++ ++E+ + + + Sbjct: 175 -ASERMSMHNLPRQLGKTTATAIFLTHFVVFNEAKAVGVLAHKGDMSKEV---LERTKQS 230 Query: 122 LDFL-EFMLPDIYAGDRASVK 141 ++ L +F+ P I ++ +++ Sbjct: 231 IELLPDFLQPGIVEWNKGNIE 251 >gi|16897|lcl|protein:vir:5867 Length: 508 # NCBI annotation: similar to terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835653;genbank:gi:30044056 Length = 508 Score = 28.1 bits (61), Expect = 0.18, Method: Compositional matrix adjust. Identities = 12/62 (19%), Positives = 32/62 (51%), Gaps = 1/62 (1%) Query: 50 LIRMQADILKFLFYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAE 109 L +Q ++ F ++ H+Y + E PR + T + Y + ++I + ++++ + A+ Sbjct: 40 LYPIQEKLINF-YHTHRYVITEKPRQMGVTWCAVAYALHQMIFNSNYKVLIAANKEATAK 98 Query: 110 EI 111 + Sbjct: 99 NV 100 >gi|6093|lcl|protein:vir:95761 Length: 432 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950582;genbank:gi:119953777;genbank:GeneI D:5076831 Length = 432 Score = 27.7 bits (60), Expect = 0.26, Method: Compositional matrix adjust. Identities = 23/107 (21%), Positives = 44/107 (41%), Gaps = 12/107 (11%) Query: 67 YRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVV----SQNAKRAEEIAGWVVKIFRGL 122 YR+++ RG K+ +A+Y + I+ ++VV + N + W Sbjct: 29 YRVVKGGRGSKKSKTTALYYIVAILKYNWANLLVVRRFSNTNKQSTYTDLKWAANRLNVS 88 Query: 123 DFLEF--MLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAGM 167 +F LP+I +VKA RG D ++ +++ G+ Sbjct: 89 HLFKFNESLPEI------TVKATGQKILFRGLDDPLKITSITVDTGL 129 >gi|22942|lcl|protein:vir:101803 Length: 613 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238880;genbank:gi:66391955;genbank:GeneID :3416630 Length = 613 Score = 27.3 bits (59), Expect = 0.30, Method: Compositional matrix adjust. Identities = 19/88 (21%), Positives = 43/88 (48%), Gaps = 5/88 (5%) Query: 54 QADILKFLFYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAG 113 Q D+L+ + G++ R + KTT+ AI+ + K + +++ A + E+ Sbjct: 134 QKDMLRIM-AGNRLMAANLSRQLGKTTVVAIFLAHFVCFNSAKNVGILAHKASMSAEV-- 190 Query: 114 WVVKIFRGLDFL-EFMLPDIYAGDRASV 140 + + + L+ L +F+ P I ++ S+ Sbjct: 191 -LHRTKQALELLPDFLQPGIVEWNKGSI 217 >gi|8062|lcl|protein:vir:103374 Length: 536 # NCBI annotation: putative head assembly cofactor # Family: family:all:543 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024735;genbank:gi:48697077;genbank:GeneID :2846042 Length = 536 Score = 27.3 bits (59), Expect = 0.31, Method: Compositional matrix adjust. Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 5/51 (9%) Query: 171 RADIILADDVESMQNART-AAGRALLEELTKEF-ESINQFGD---IIYLGT 216 R D+I+ DDV++ + A + ALLE T + I+ +G IIYLG Sbjct: 205 RPDLIVCDDVQTRECALSEVQNAALLEWFTATLVKCIDNYGSNRRIIYLGN 255 >gi|7770|lcl|protein:vir:96410 Length: 536 # NCBI annotation: hypothetical protein # Family: family:all:543 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218809;genbank:gi:147917326;genbank:Ge neID:5142613 Length = 536 Score = 27.3 bits (59), Expect = 0.31, Method: Compositional matrix adjust. Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 5/51 (9%) Query: 171 RADIILADDVESMQNART-AAGRALLEELTKEF-ESINQFGD---IIYLGT 216 R D+I+ DDV++ + A + ALLE T + I+ +G IIYLG Sbjct: 205 RPDLIVCDDVQTRECALSEVQNAALLEWFTATLVKCIDNYGSNRRIIYLGN 255 >gi|23191|lcl|protein:vir:101186 Length: 613 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932508;genbank:gi:37651634;genbank:GeneID :2610679 Length = 613 Score = 27.3 bits (59), Expect = 0.32, Method: Compositional matrix adjust. Identities = 19/88 (21%), Positives = 43/88 (48%), Gaps = 5/88 (5%) Query: 54 QADILKFLFYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAG 113 Q D+L+ + G++ R + KTT+ AI+ + K + +++ A + E+ Sbjct: 134 QKDMLRIM-AGNRLMAANLSRQLGKTTVVAIFLAHFVCFNSAKNVGILAHKASMSAEV-- 190 Query: 114 WVVKIFRGLDFL-EFMLPDIYAGDRASV 140 + + + L+ L +F+ P I ++ S+ Sbjct: 191 -LHRTKQALELLPDFLQPGIVEWNKGSI 217 >gi|21077|lcl|protein:vir:100538 Length: 612 # NCBI annotation: gp17 terminase subunit # Family: family:all:147 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656379;genbank:gi:109290130;genbank:GeneI D:4156516 Length = 612 Score = 27.3 bits (59), Expect = 0.33, Method: Compositional matrix adjust. Identities = 19/88 (21%), Positives = 43/88 (48%), Gaps = 5/88 (5%) Query: 54 QADILKFLFYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAG 113 Q D+L+ + G++ R + KTT+ AI+ + K + +++ A + E+ Sbjct: 133 QKDMLRIM-AGNRLMAANLSRQLGKTTVVAIFLAHFVCFNSAKNVGILAHKASMSAEV-- 189 Query: 114 WVVKIFRGLDFL-EFMLPDIYAGDRASV 140 + + + L+ L +F+ P I ++ S+ Sbjct: 190 -LHRTKQALELLPDFLQPGIVEWNKGSI 216 >gi|24596|lcl|protein:vir:98262 Length: 609 # NCBI annotation: gp17 terminase subunit, nuclease and ATPase # Family: family:all:147 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239195;genbank:gi:66391670;genbank:GeneID :3416364 Length = 609 Score = 26.9 bits (58), Expect = 0.41, Method: Compositional matrix adjust. Identities = 18/80 (22%), Positives = 40/80 (50%), Gaps = 6/80 (7%) Query: 65 HKYRLIEA--PRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFRGL 122 HK R++ R + KTT+ AI+ + K + V++ A + E+ + + + + Sbjct: 149 HKNRMVTCNLSRQLGKTTVVAIFLAHFVCFNEDKYVGVLAHKASMSAEV---LDRTKQAI 205 Query: 123 DFL-EFMLPDIYAGDRASVK 141 + L +F+ P I ++ S++ Sbjct: 206 ELLPDFLQPGIVEWNKGSIE 225 >gi|7375|lcl|protein:vir:99083 Length: 566 # NCBI annotation: gp7 # Family: family:all:144 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655687;genbank:gi:109521765;genbank:GeneI D:4157805 Length = 566 Score = 26.9 bits (58), Expect = 0.49, Method: Compositional matrix adjust. Identities = 39/147 (26%), Positives = 67/147 (45%), Gaps = 32/147 (21%) Query: 69 LIEAPRGIAKTTLSAIYTVFRIIH-EPHKRIMVVSQNAKRAEEIAGWVVKIFRGLDFLEF 127 LI P K+T++++YTV R + P+ RI++ A +++A + R L Sbjct: 95 LITCPPQEGKSTMASVYTVLRALQLNPNARIIL----ACYGQDLAHGHSRKCRDL----- 145 Query: 128 MLPDIYAGDRASVKAFEIHYTL-----RGSDKSPSVSCYSIEAG------------MQGA 170 + +G R ++ +I L RG++K VS +SIE G + G Sbjct: 146 -IKRHGSGVRDAMTGAQIEDKLGLKLERGANK---VSEWSIEGGTGGLVATGLGGTITGK 201 Query: 171 RADIILADD-VESMQNARTAAGRALLE 196 AD+ + DD + M A +A RA ++ Sbjct: 202 PADLFIIDDPYKHMSEADSATYRAKVD 228 >gi|18586|lcl|protein:vir:8649 Length: 564 # NCBI annotation: gp7 # Family: family:all:144 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817768;genbank:gi:29566200;genbank:GeneID :1259404 Length = 564 Score = 26.6 bits (57), Expect = 0.52, Method: Compositional matrix adjust. Identities = 39/147 (26%), Positives = 67/147 (45%), Gaps = 32/147 (21%) Query: 69 LIEAPRGIAKTTLSAIYTVFRIIH-EPHKRIMVVSQNAKRAEEIAGWVVKIFRGLDFLEF 127 LI P K+T++++YTV R + P+ RI++ A +++A + R L Sbjct: 93 LITCPPQEGKSTMASVYTVLRALQLNPNARIIL----ACYGQDLAHGHSRKCRDL----- 143 Query: 128 MLPDIYAGDRASVKAFEIHYTL-----RGSDKSPSVSCYSIEAG------------MQGA 170 + +G R ++ +I L RG++K VS +SIE G + G Sbjct: 144 -IKRHGSGVRDAMTGAQIEDKLGLKLERGANK---VSEWSIEGGSGGLVATGLGGTITGK 199 Query: 171 RADIILADD-VESMQNARTAAGRALLE 196 AD+ + DD + M A +A RA ++ Sbjct: 200 PADLFIIDDPYKHMSEADSATYRAKVD 226 >gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899601;genbank:gi:34419588;genbank:GeneID :2546011 Length = 600 Score = 26.6 bits (57), Expect = 0.60, Method: Compositional matrix adjust. Identities = 23/115 (20%), Positives = 54/115 (46%), Gaps = 8/115 (6%) Query: 18 LQELQQTFPYTAEGLLLFAD---TVIHNLIAGNPHLIR--MQADILKFLFYGHKYRLIEA 72 L E++ F + ++ FA+ +++H + GN ++ Q ++L+ + ++ + Sbjct: 89 LSEIKAEFQKCRDDIVYFAENYCSIVH-IDLGNIKMVPRPYQKEMLE-VADRSRFSIFLL 146 Query: 73 PRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFRGL-DFLE 126 PR + KTT+ I+ ++ K +++ + E+ V + L DFL+ Sbjct: 147 PRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLERVKNVIENLPDFLQ 201 >gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: gp123 # Family: family:all:147 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717790;genbank:gi:113200627;genbank:GeneI D:4239178 Length = 549 Score = 26.6 bits (57), Expect = 0.64, Method: Compositional matrix adjust. Identities = 9/50 (18%), Positives = 27/50 (54%) Query: 62 FYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEI 111 F+ +++ + + PR K+T+ Y ++ ++ + + +++ A A E+ Sbjct: 70 FHDNRFNIAKLPRQSGKSTIVTSYLLWYVLFNANVNVAILANKAATAREM 119 >gi|13703|lcl|protein:vir:4908 Length: 168 # NCBI annotation: gp168 # Family: family:all:464 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056686;genbank:gi:9635021;genbank:GeneID: 1262686 Length = 168 Score = 25.8 bits (55), Expect = 1.00, Method: Compositional matrix adjust. Identities = 12/45 (26%), Positives = 18/45 (40%) Query: 211 IIYLGTPQNVNSIYNNLPARGYSVRIWTARYPSVEQEQCYGDFLA 255 I +GT ++N + N GY V +W + YG A Sbjct: 65 ISAIGTKDDLNEMLKNSVVDGYKVEVWEIDLADKKSGGKYGALYA 109 >gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T4-like DNA packaging large subunit terminase # Family: family:all:147 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214360;genbank:gi:61806000;genbank:GeneID :3294466 Length = 547 Score = 25.8 bits (55), Expect = 1.1, Method: Compositional matrix adjust. Identities = 10/52 (19%), Positives = 25/52 (48%) Query: 62 FYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAG 113 F+ +++ + + PR K+T Y + + + + V++ A A ++ G Sbjct: 68 FHENRFNICKMPRQTGKSTTCISYLLHYAVFNDNVNVAVLANKASTARDLLG 119 >gi|16791|lcl|protein:vir:2742 Length: 168 # NCBI annotation: putative structural protein # Family: family:all:464 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695115;genbank:gi:23455884;genbank:GeneID :955649 Length = 168 Score = 25.0 bits (53), Expect = 1.7, Method: Compositional matrix adjust. Identities = 11/45 (24%), Positives = 18/45 (40%) Query: 211 IIYLGTPQNVNSIYNNLPARGYSVRIWTARYPSVEQEQCYGDFLA 255 I +GT ++N + GY V +W + + YG A Sbjct: 65 ISAIGTKDDLNEMLKKSVVDGYKVEVWEIDLADKKSDGKYGALYA 109 >gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: terminase large subunit gp17 # Family: family:all:147 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195134;genbank:gi:58532911;uniprot:Q5GQN8 ;genbank:GeneID:3260486 Length = 548 Score = 24.3 bits (51), Expect = 2.5, Method: Compositional matrix adjust. Identities = 10/50 (20%), Positives = 25/50 (50%) Query: 62 FYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEI 111 F+ +++ + + PR K+T Y + +I + I +++ A A ++ Sbjct: 70 FHKNRFNIAKLPRQTGKSTTVVSYLLHYLIFNDNVNIGILANKASTARDL 119 >gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891724;genbank:gi:33620526;genbank:GeneID :1725332 Length = 607 Score = 23.9 bits (50), Expect = 3.7, Method: Compositional matrix adjust. Identities = 20/90 (22%), Positives = 43/90 (47%), Gaps = 9/90 (10%) Query: 54 QADILKFLFYGHKYRLI--EAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEI 111 Q D+LK + H+ R+ + R + KTT AI+ + K + +++ A E+ Sbjct: 141 QKDMLKIM---HENRMSAHKLSRQLGKTTAVAIFLAHYVCFNKDKAVGILAHKGSMAVEV 197 Query: 112 AGWVVKIFRGLDFL-EFMLPDIYAGDRASV 140 + + + ++ L +F+ P I ++ S+ Sbjct: 198 ---LERTKQAIELLPDFLQPGIVEWNKKSI 224 >gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469498;genbank:gi:157311455;genbank:Ge neID:5602125 Length = 607 Score = 23.9 bits (50), Expect = 3.7, Method: Compositional matrix adjust. Identities = 20/90 (22%), Positives = 43/90 (47%), Gaps = 9/90 (10%) Query: 54 QADILKFLFYGHKYRLI--EAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEI 111 Q D+LK + H+ R+ + R + KTT AI+ + K + +++ A E+ Sbjct: 141 QKDMLKIM---HENRMSAHKLSRQLGKTTAVAIFLAHYVCFNKDKAVGILAHKGSMAVEV 197 Query: 112 AGWVVKIFRGLDFL-EFMLPDIYAGDRASV 140 + + + ++ L +F+ P I ++ S+ Sbjct: 198 ---LERTKQAIELLPDFLQPGIVEWNKKSI 224 >gi|8026|lcl|protein:vir:96484 Length: 169 # NCBI annotation: tail protein # Family: family:all:464 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238498;genbank:gi:66391774;genbank:GeneID :5176906 Length = 169 Score = 23.9 bits (50), Expect = 3.8, Method: Compositional matrix adjust. Identities = 10/42 (23%), Positives = 16/42 (38%) Query: 214 LGTPQNVNSIYNNLPARGYSVRIWTARYPSVEQEQCYGDFLA 255 +GT ++N + GY V +W + YG A Sbjct: 69 IGTKDDLNEMLKKSVVDGYKVEVWEIDLADKKSNGKYGALYA 110 >gi|3997|lcl|protein:vir:100199 Length: 629 # NCBI annotation: putative terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025027;genbank:gi:48697260;genbank:GeneID :2948297 Length = 629 Score = 23.1 bits (48), Expect = 7.0, Method: Compositional matrix adjust. Identities = 15/44 (34%), Positives = 25/44 (56%), Gaps = 2/44 (4%) Query: 195 LEELTKEFESINQFGDIIYLGTPQNVNSIYNN--LPARGYSVRI 236 L E TK+F+++ G+I L P ++ + N + RG SV+I Sbjct: 528 LNEPTKDFQNLFINGNITMLNDPLLIDGLNNAVLVEDRGGSVKI 571 >gi|360|lcl|protein:vir:344 Length: 482 # NCBI annotation: terminase # Family: family:all:147 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203458;genbank:gi:15320614;genbank:GeneID :921717 Length = 482 Score = 22.3 bits (46), Expect = 9.6, Method: Compositional matrix adjust. Identities = 8/19 (42%), Positives = 13/19 (68%) Query: 239 ARYPSVEQEQCYGDFLAPM 257 ++YPS ++ YGD LA + Sbjct: 237 SKYPSHDKGSIYGDLLAAL 255 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.321 0.137 0.411 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 176,293 Number of Sequences: 514 Number of extensions: 7959 Number of successful extensions: 112 Number of sequences better than 100.0: 48 Number of HSP's better than 100.0 without gapping: 40 Number of HSP's successfully gapped in prelim test: 8 Number of HSP's that attempted gapping in prelim test: 12 Number of HSP's gapped (non-prelim): 54 length of query: 405 length of database: 206,069 effective HSP length: 74 effective length of query: 331 effective length of database: 168,033 effective search space: 55618923 effective search space used: 55618923 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 38 (19.2 bits)