BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_015283.1_cdsid_YP_004323261.1 [gene=gp17] [protein=terminase DNA packaging enzyme large subunit] [protein_id=YP_004323261.1] [location=110903..112546] (547 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: g... 997 0.0 gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: g... 986 0.0 gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T... 851 0.0 gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: t... 791 0.0 gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: ter... 370 e-104 gi|27701|lcl|protein:vir:6893 Length: 611 # NCBI annotation: gp1... 360 e-101 gi|28182|lcl|protein:vir:108053 Length: 611 # NCBI annotation: g... 356 e-100 gi|27407|lcl|protein:vir:7202 Length: 610 # NCBI annotation: gp1... 353 2e-99 gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: ter... 353 3e-99 gi|22056|lcl|protein:vir:103455 Length: 610 # NCBI annotation: t... 353 3e-99 gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp... 352 5e-99 gi|24596|lcl|protein:vir:98262 Length: 609 # NCBI annotation: gp... 348 9e-98 gi|20804|lcl|protein:vir:106290 Length: 633 # NCBI annotation: g... 347 2e-97 gi|22942|lcl|protein:vir:101803 Length: 613 # NCBI annotation: g... 333 2e-93 gi|23191|lcl|protein:vir:101186 Length: 613 # NCBI annotation: t... 333 2e-93 gi|21077|lcl|protein:vir:100538 Length: 612 # NCBI annotation: g... 319 6e-89 gi|16897|lcl|protein:vir:5867 Length: 508 # NCBI annotation: sim... 185 1e-48 gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: ph... 74 5e-15 gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: te... 74 5e-15 gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: t... 60 5e-11 gi|17621|lcl|protein:vir:816 Length: 568 # NCBI annotation: hypo... 47 4e-07 gi|25680|lcl|protein:vir:10116 Length: 568 # NCBI annotation: hy... 47 4e-07 gi|51|lcl|protein:vir:3295 Length: 568 # NCBI annotation: putati... 47 4e-07 gi|26240|lcl|protein:vir:2763 Length: 568 # NCBI annotation: hyp... 47 4e-07 gi|25849|lcl|protein:vir:9949 Length: 568 # NCBI annotation: hyp... 47 4e-07 gi|1476|lcl|protein:vir:104436 Length: 568 # NCBI annotation: pu... 47 4e-07 gi|12711|lcl|protein:vir:80169 Length: 565 # NCBI annotation: te... 44 4e-06 gi|20547|lcl|protein:vir:105155 Length: 580 # NCBI annotation: c... 40 6e-05 gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: te... 38 3e-04 gi|26461|lcl|protein:vir:573 Length: 589 # NCBI annotation: unkn... 37 6e-04 gi|10061|lcl|protein:vir:99071 Length: 537 # NCBI annotation: gp... 36 0.001 gi|16631|lcl|protein:vir:9699 Length: 584 # NCBI annotation: hyp... 36 0.001 gi|14676|lcl|protein:vir:2499 Length: 474 # NCBI annotation: put... 34 0.004 gi|7504|lcl|protein:vir:99915 Length: 478 # NCBI annotation: gp2... 34 0.004 gi|14207|lcl|protein:vir:8315 Length: 503 # NCBI annotation: gp3... 34 0.006 gi|16385|lcl|protein:vir:3744 Length: 607 # NCBI annotation: orf... 33 0.009 gi|19904|lcl|protein:vir:3781 Length: 607 # NCBI annotation: ter... 33 0.010 gi|4240|lcl|protein:vir:94734 Length: 469 # NCBI annotation: lar... 32 0.015 gi|15774|lcl|protein:vir:6239 Length: 527 # NCBI annotation: gp3... 32 0.016 gi|15412|lcl|protein:vir:1325 Length: 519 # NCBI annotation: gp3... 32 0.026 gi|4089|lcl|protein:vir:94598 Length: 581 # NCBI annotation: PfW... 31 0.036 gi|18494|lcl|protein:vir:1636 Length: 469 # NCBI annotation: put... 30 0.065 gi|3859|lcl|protein:vir:107748 Length: 532 # NCBI annotation: gp... 30 0.084 gi|8927|lcl|protein:vir:97005 Length: 632 # NCBI annotation: 40 ... 30 0.11 gi|10294|lcl|protein:vir:105656 Length: 405 # NCBI annotation: p... 29 0.11 gi|11501|lcl|protein:vir:78718 Length: 574 # NCBI annotation: DN... 29 0.11 gi|14590|lcl|protein:vir:8098 Length: 515 # NCBI annotation: gp2... 29 0.16 gi|15101|lcl|protein:vir:3867 Length: 563 # NCBI annotation: put... 28 0.29 gi|15319|lcl|protein:vir:3140 Length: 535 # NCBI annotation: hyp... 27 0.55 gi|7204|lcl|protein:vir:100065 Length: 577 # NCBI annotation: DN... 27 0.57 gi|8126|lcl|protein:vir:102661 Length: 568 # NCBI annotation: te... 26 1.2 gi|3734|lcl|protein:vir:94555 Length: 585 # NCBI annotation: DNA... 25 1.9 gi|3994|lcl|protein:vir:99686 Length: 586 # NCBI annotation: DNA... 25 2.2 gi|27798|lcl|protein:vir:8408 Length: 214 # NCBI annotation: gp3... 25 3.1 gi|13482|lcl|protein:vir:9757 Length: 471 # NCBI annotation: put... 25 3.3 gi|18767|lcl|protein:vir:6335 Length: 601 # NCBI annotation: put... 24 4.8 gi|19975|lcl|protein:vir:108214 Length: 544 # NCBI annotation: g... 24 5.7 gi|10332|lcl|protein:vir:97407 Length: 514 # NCBI annotation: te... 23 8.8 gi|12298|lcl|protein:vir:79536 Length: 247 # NCBI annotation: pu... 23 9.8 >gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214662;genbank:gi:61806303;genbank:GeneID :3294591 Length = 550 Score = 997 bits (2578), Expect = 0.0, Method: Compositional matrix adjust. Identities = 458/543 (84%), Positives = 503/543 (92%) Query: 5 YLGNPNLKKVNTSIDFKPEEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHFQEEM 64 YLGNPNLKK N S F ++VAE +KC DP+YFI+ YI+IVSLDEG++PF+MY+FQE+M Sbjct: 8 YLGNPNLKKANVSTQFTKKQVAEYMKCAQDPVYFIRKYIRIVSLDEGVIPFDMYNFQEDM 67 Query: 65 VSKFHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQLS 124 V+KFH HRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKA TAREML RLQLS Sbjct: 68 VTKFHQHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAPTAREMLGRLQLS 127 Query: 125 YENLPNWMQQGILQWNRGSLELENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPNHIAD 184 YENLP WMQQGIL WN+GSLELENGSKI+A+STSASAVRGMSFN+IFLDEFAF+PNHIA+ Sbjct: 128 YENLPKWMQQGILGWNKGSLELENGSKILASSTSASAVRGMSFNIIFLDEFAFVPNHIAE 187 Query: 185 QFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERGTNEYVPTEVHWSEVPGRDDV 244 QFF+SVYPTISSGKSTKVIIISTPHGMN FYKLWHDAERG N YV TEVHWS+VPGRDD Sbjct: 188 QFFASVYPTISSGKSTKVIIISTPHGMNQFYKLWHDAERGANNYVATEVHWSQVPGRDDK 247 Query: 245 WKEQTIKNTSESQFRVEFECEFLGSVDTLIAPSKLRIMPYHDPITSNRGLAVYEQVIPEH 304 WK+QTI+NTSE+QFRVEFECEFLGSVDTLI PSKLRIMPY DPI NRGLAVYE V H Sbjct: 248 WKQQTIENTSEAQFRVEFECEFLGSVDTLITPSKLRIMPYKDPIQENRGLAVYEHVQENH 307 Query: 305 NYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNIIVDIAKNYNNAY 364 NYIITVDVSRGVGNDYSAFCVIDTTT+PYK+VARYKNN+IKP+V PN+IVD+A NYN AY Sbjct: 308 NYIITVDVSRGVGNDYSAFCVIDTTTVPYKVVARYKNNQIKPLVFPNLIVDVATNYNGAY 367 Query: 365 ILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQLGVKMSTATKQV 424 +LCEVNDIGGQVADIIQ+DLEYENLLM +MRGRAGQQLGQGFSGKKTQLG+KMSTA KQV Sbjct: 368 VLCEVNDIGGQVADIIQYDLEYENLLMVSMRGRAGQQLGQGFSGKKTQLGIKMSTAVKQV 427 Query: 425 GCSNLKALIEEDKLLINDYDTIAELTTFIAKGQTFQAEEGCNDDLAMCLVIFGWMAMQPY 484 GCSNLKALIE+DKL++ DYDTIAELTTFI KGQ+FQAE+GCNDDLAMCLVIF WMAMQPY Sbjct: 428 GCSNLKALIEDDKLIVEDYDTIAELTTFIQKGQSFQAEDGCNDDLAMCLVIFSWMAMQPY 487 Query: 485 FKEMHDNDVRQRIYEQQKDMIEQDMAPFGFVTDGMEDDYFADAQGDVWKVAEYGDKSYMW 544 FKEMHDNDVRQRIYE Q+D IEQDMAPFGFV+DG+E+D F DAQGDVW++AEYGDKSYMW Sbjct: 488 FKEMHDNDVRQRIYEDQRDQIEQDMAPFGFVSDGLEEDQFQDAQGDVWQIAEYGDKSYMW 547 Query: 545 EFR 547 E+R Sbjct: 548 EYR 550 >gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: gp123 # Family: family:all:147 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717790;genbank:gi:113200627;genbank:GeneI D:4239178 Length = 549 Score = 986 bits (2549), Expect = 0.0, Method: Compositional matrix adjust. Identities = 451/546 (82%), Positives = 506/546 (92%) Query: 2 SDQYLGNPNLKKVNTSIDFKPEEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHFQ 61 ++QYLGNPNLKK N S +F P++VAEV+KC ++P+YFIKNYIKIVSLD+GL+PF+MY+FQ Sbjct: 4 TEQYLGNPNLKKANVSQEFTPDQVAEVIKCSENPVYFIKNYIKIVSLDKGLIPFDMYYFQ 63 Query: 62 EEMVSKFHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRL 121 EEMV KFHD+RFNIAKLPRQSGKSTIVT+YLLWYVLFNANVNVAILANKAATAREMLQRL Sbjct: 64 EEMVQKFHDNRFNIAKLPRQSGKSTIVTSYLLWYVLFNANVNVAILANKAATAREMLQRL 123 Query: 122 QLSYENLPNWMQQGILQWNRGSLELENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPNH 181 QLSYENLP W+QQGILQWNRGSLELENGSKI+AASTSASAVRGMSFNVIFLDEFAF+PNH Sbjct: 124 QLSYENLPKWLQQGILQWNRGSLELENGSKILAASTSASAVRGMSFNVIFLDEFAFVPNH 183 Query: 182 IADQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERGTNEYVPTEVHWSEVPGR 241 +ADQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAER NEY+PTEVHWSEVPGR Sbjct: 184 VADQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERKANEYIPTEVHWSEVPGR 243 Query: 242 DDVWKEQTIKNTSESQFRVEFECEFLGSVDTLIAPSKLRIMPYHDPITSNRGLAVYEQVI 301 D WKEQTIKNTSE QFRVEFECEFLGSVDTLI+PSKLR M Y DPI GL++YE+ I Sbjct: 244 DAAWKEQTIKNTSEQQFRVEFECEFLGSVDTLISPSKLRTMVYGDPIAEKNGLSMYEKTI 303 Query: 302 PEHNYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNIIVDIAKNYN 361 H Y+IT DVSRGV DYSAF VIDTTTIPYK+VA+Y+NN+IKPI+ PNIIVD+A+NYN Sbjct: 304 QGHTYVITADVSRGVSGDYSAFLVIDTTTIPYKLVAKYRNNDIKPILFPNIIVDVARNYN 363 Query: 362 NAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQLGVKMSTAT 421 +A++L EVND+GGQVADIIQ+DLEY+NLLM AMRGRAGQQLGQGFSGKKTQ+G+KMS+AT Sbjct: 364 HAFVLVEVNDVGGQVADIIQYDLEYDNLLMCAMRGRAGQQLGQGFSGKKTQMGIKMSSAT 423 Query: 422 KQVGCSNLKALIEEDKLLINDYDTIAELTTFIAKGQTFQAEEGCNDDLAMCLVIFGWMAM 481 KQVGCSNLKAL+E+DK L+NDYD I+ELTTFI KGQTFQAEEGCNDDLAMC+VIF WMAM Sbjct: 424 KQVGCSNLKALLEDDKFLLNDYDCISELTTFIQKGQTFQAEEGCNDDLAMCMVIFAWMAM 483 Query: 482 QPYFKEMHDNDVRQRIYEQQKDMIEQDMAPFGFVTDGMEDDYFADAQGDVWKVAEYGDKS 541 QPYFKE+HDNDVRQRIY+ Q++ IEQDMAPFGF+ DG+ ++YFADAQGDVW AEYGDKS Sbjct: 484 QPYFKELHDNDVRQRIYDDQREAIEQDMAPFGFMDDGLGEEYFADAQGDVWMTAEYGDKS 543 Query: 542 YMWEFR 547 YMWE+R Sbjct: 544 YMWEYR 549 >gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T4-like DNA packaging large subunit terminase # Family: family:all:147 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214360;genbank:gi:61806000;genbank:GeneID :3294466 Length = 547 Score = 851 bits (2198), Expect = 0.0, Method: Compositional matrix adjust. Identities = 383/546 (70%), Positives = 469/546 (85%) Query: 1 MSDQYLGNPNLKKVNTSIDFKPEEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHF 60 M++ YLGNPNLKK NT I+F + + E LKC++DP+YF +NYIKIVSLDEGLVPFNMY F Sbjct: 1 MAEVYLGNPNLKKANTPIEFSKDNIREFLKCKEDPVYFTRNYIKIVSLDEGLVPFNMYDF 60 Query: 61 QEEMVSKFHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQR 120 QE+++++FH++RFNI K+PRQ+GKST +YLL Y +FN NVNVA+LANKA+TAR++L R Sbjct: 61 QEKLITRFHENRFNICKMPRQTGKSTTCISYLLHYAVFNDNVNVAVLANKASTARDLLGR 120 Query: 121 LQLSYENLPNWMQQGILQWNRGSLELENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPN 180 LQL+YENLP WMQQGI+ WN+GSLELENGSKI A STS+SAVRG S+NVIFLDEFAFIPN Sbjct: 121 LQLAYENLPRWMQQGIISWNKGSLELENGSKISANSTSSSAVRGGSYNVIFLDEFAFIPN 180 Query: 181 HIADQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERGTNEYVPTEVHWSEVPG 240 HIAD FF+SVYPTI+SG+STKVII+STP GMN FY++WHD+E+G +EYV T+VHWSEVPG Sbjct: 181 HIADDFFASVYPTITSGQSTKVIIVSTPRGMNHFYRMWHDSEKGKSEYVATDVHWSEVPG 240 Query: 241 RDDVWKEQTIKNTSESQFRVEFECEFLGSVDTLIAPSKLRIMPYHDPITSNRGLAVYEQV 300 RD+ WKEQTI NTSE QF++EFECEFLGSV+TLI P+KLR + Y P T N GL +YE Sbjct: 241 RDEEWKEQTIANTSEQQFKIEFECEFLGSVNTLINPAKLRNLVYEAPKTRNAGLDIYETP 300 Query: 301 IPEHNYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNIIVDIAKNY 360 + EHNYIITVDV+RG+GNDYSAF V DTT PYK+VA+Y+NNEIKP++ PNII+D+AK Y Sbjct: 301 VKEHNYIITVDVARGLGNDYSAFIVFDTTEFPYKVVAKYRNNEIKPMLFPNIILDVAKGY 360 Query: 361 NNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQLGVKMSTA 420 NNAY+L EVNDIG QVA I+Q+DLEYEN+LMA+MRGRAGQ +GQGFSGKKTQLGV+M++A Sbjct: 361 NNAYLLIEVNDIGDQVASILQYDLEYENVLMASMRGRAGQIVGQGFSGKKTQLGVRMTSA 420 Query: 421 TKQVGCSNLKALIEEDKLLINDYDTIAELTTFIAKGQTFQAEEGCNDDLAMCLVIFGWMA 480 K++GCSNLK ++E+DKLL DY+ I+ELTTF + +F+AEEGCNDDLAMCLVIF W+ Sbjct: 421 VKKLGCSNLKTMMEDDKLLTCDYEIISELTTFAQRHNSFEAEEGCNDDLAMCLVIFSWLV 480 Query: 481 MQPYFKEMHDNDVRQRIYEQQKDMIEQDMAPFGFVTDGMEDDYFADAQGDVWKVAEYGDK 540 Q YFKEM DND+R+RIYE+QK+ IEQDMAPFGF+ DG++D F D GD W + EYGDK Sbjct: 481 AQDYFKEMSDNDIRKRIYEEQKNQIEQDMAPFGFIADGLDDTSFVDKDGDTWHLDEYGDK 540 Query: 541 SYMWEF 546 SYMW++ Sbjct: 541 SYMWDY 546 >gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: terminase large subunit gp17 # Family: family:all:147 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195134;genbank:gi:58532911;uniprot:Q5GQN8 ;genbank:GeneID:3260486 Length = 548 Score = 791 bits (2043), Expect = 0.0, Method: Compositional matrix adjust. Identities = 368/550 (66%), Positives = 449/550 (81%), Gaps = 9/550 (1%) Query: 2 SDQ-YLGNPNLKKVNTSIDFKPEEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHF 60 SDQ YLGNP LKK N IDF E+V E +KC +DP+YF KNY+KIVSLDEGLVPF M+ F Sbjct: 3 SDQIYLGNPLLKKANVKIDFTKEQVKEWIKCANDPVYFTKNYVKIVSLDEGLVPFKMWDF 62 Query: 61 QEEMVSKFHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQR 120 QEE++ KFH +RFNIAKLPRQ+GKST V +YLL Y++FN NVN+ ILANKA+TAR++L R Sbjct: 63 QEELIMKFHKNRFNIAKLPRQTGKSTTVVSYLLHYLIFNDNVNIGILANKASTARDLLAR 122 Query: 121 LQLSYENLPNWMQQGILQWNRGSLELENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPN 180 L +YENLP W+QQG++ WN+G++ELENGSKI+AASTSASAVRGMSFN+IFLDEFAF+PN Sbjct: 123 LATAYENLPKWIQQGVVVWNKGNIELENGSKILAASTSASAVRGMSFNIIFLDEFAFVPN 182 Query: 181 HIADQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERGTNEYVPTEVHWSEVPG 240 HIAD FF+SVYPTI+SGKSTKVIIISTP GMN FYK+W DA G N Y EVHWS+VPG Sbjct: 183 HIADSFFASVYPTITSGKSTKVIIISTPQGMNHFYKMWVDATNGRNGYTFHEVHWSQVPG 242 Query: 241 RDDVWKEQTIKNTSESQFRVEFECEFLGSVDTLIAPSKLRIMPYHDPITSNRGLAVYEQV 300 RD+ WKE+TIKNTSE QF EFECEFLGSVDTLIA SKL+ + ++DPI N+GL +YE+ Sbjct: 243 RDEKWKEETIKNTSERQFTQEFECEFLGSVDTLIAASKLKALVFNDPIKRNKGLDIYEEP 302 Query: 301 IPEHNYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNIIVDIAKNY 360 + Y++TVDVSRG+G DYSAF + D TT+PYK+V +Y+NNEIKP++ PNII D+A++Y Sbjct: 303 KEKSEYLMTVDVSRGIGGDYSAFIIFDITTVPYKVVGKYRNNEIKPMLFPNIINDLARSY 362 Query: 361 NNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQLGVKMSTA 420 NNA++LCEVNDIG QVA I+ +DLEY N+LM AMRGRAGQ +GQGFSG KTQLGVKMS Sbjct: 363 NNAWVLCEVNDIGDQVASILNYDLEYPNVLMCAMRGRAGQLVGQGFSGSKTQLGVKMSIT 422 Query: 421 TKQVGCSNLKALIEEDKLLINDYDTIAELTTFIAKGQTFQAEEGCNDDLAMCLVIFGWMA 480 K+VGC+NLK ++EEDKL+ NDYD I ELTTFI K Q+F+A+EG +DDL MC+VIF W+ Sbjct: 423 VKKVGCANLKTIVEEDKLIFNDYDIINELTTFIQKKQSFEADEGFHDDLVMCMVIFAWLV 482 Query: 481 MQPYFKEMHDNDVRQRIYEQQKDMIEQDMAPFGFVTDGMEDDYFADAQGDVWKVAEYGDK 540 Q YFKEM DND+RQRIY++QK+ IEQDMAPFGF+T G+E + + G +W YGD Sbjct: 483 QQDYFKEMTDNDIRQRIYDEQKNQIEQDMAPFGFITTGLEGEEGFVSDGTIW----YGDT 538 Query: 541 ----SYMWEF 546 YMW + Sbjct: 539 QENVGYMWNY 548 >gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899601;genbank:gi:34419588;genbank:GeneID :2546011 Length = 600 Score = 370 bits (949), Expect = e-104, Method: Compositional matrix adjust. Identities = 206/533 (38%), Positives = 318/533 (59%), Gaps = 29/533 (5%) Query: 4 QYLGNPNLKKVNTSIDFKP---EEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHF 60 +Y+ PNL++ N I F E AE KC+DD +YF +NY IV +D G + + Sbjct: 69 RYMNIPNLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVPRPY 128 Query: 61 QEEMVSKFHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQR 120 Q+EM+ RF+I LPRQ GK+TI+ +L Y++FN + ILA+K + + E+L+R Sbjct: 129 QKEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLER 188 Query: 121 LQLSYENLPNWMQQGILQWNRGSLELENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPN 180 ++ ENLP+++Q GI +WN+G++ +NG K+ A ++ + AVRG SF++I++DE AF+P Sbjct: 189 VKNVIENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPG 248 Query: 181 HIADQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERGTNEYVPTEVHWSEVPG 240 D F+ + +P ISSG+ +KV++ STP+G+N ++ +W+ A +G + + P W V Sbjct: 249 F--DDFWKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRAVQN 306 Query: 241 R-------DD--VWKEQTIKNTSESQFRVEFECEFLGSVDTLIAPSKLRIMPYHDPITSN 291 R DD +K +TI NTS F E C FLG+ TLI KL M D + + Sbjct: 307 RLYKDGEFDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVVKDS 366 Query: 292 RGLAVYEQVIPEHNYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPN 351 G VY++ H YI+TVD S G G DY A +ID T+ P++ VA + +N+ ++LP Sbjct: 367 DGWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPA 426 Query: 352 IIVDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKT 411 II+ A YN AY+ CE+ G V + + DLEYEN++M RA SG + Sbjct: 427 IIMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEE---RA--------SGGRR 475 Query: 412 QLGVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFIAKGQTFQAEEGCNDDLAM 471 LG+K + TK +GCS LK LIE+D+L IN T+ E TF+ KG++++AEEG +DDL M Sbjct: 476 GLGLKPNKKTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVM 535 Query: 472 CLVIFGWMAMQPYFKEM--HDNDVRQRIYEQQ-KDMIEQDMAPFGFVTDGMED 521 L + +++ Q F + + +V I++Q+ DM++ D+ PF + DG+E+ Sbjct: 536 SLTLLAYLSTQDRFSDFVEKEYNVSYDIFKQEVHDMMDDDV-PFLMIADGIEN 587 >gi|27701|lcl|protein:vir:6893 Length: 611 # NCBI annotation: gp17 terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861869;genbank:gi:32453660;genbank:GeneID :1494295 Length = 611 Score = 360 bits (924), Expect = e-101, Method: Compositional matrix adjust. Identities = 204/525 (38%), Positives = 296/525 (56%), Gaps = 31/525 (5%) Query: 4 QYLGNPNLKKVNTSIDFKPEEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHFQEE 63 +Y+G PNLK+ N + E VAE KC+DD +YF + Y I +D G + + +Q + Sbjct: 86 RYMGLPNLKRANIKTQWTYEMVAEWKKCRDDIVYFAETYCAITHIDYGTIKVQLRDYQRD 145 Query: 64 MVSKFHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQL 123 M+ R + L RQ GK+T+V +L +V FN + V ILA+K + + E+L R + Sbjct: 146 MLKIMSSKRMTVCNLSRQLGKTTVVAIFLAHFVCFNKDKAVGILAHKGSMSAEVLDRTKQ 205 Query: 124 SYENLPNWMQQGILQWNRGSLELENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPNHIA 183 + E LP+++Q GI++WN+GS++L+NGS I A ++S AVRG SF +I++DE AFIPN I Sbjct: 206 AIELLPDFLQPGIVEWNKGSIQLDNGSSIGAYASSPDAVRGNSFAMIYIDECAFIPNFI- 264 Query: 184 DQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERGTNEYVPTEVHWSEVPGR-- 241 D + ++ P ISSG+ +K+II +TP+G+N FY +W A G + + P W+ V R Sbjct: 265 DSWL-AIQPVISSGRRSKIIITTTPNGLNHFYDIWTAAVEGKSGFEPYTAIWNSVKERLY 323 Query: 242 ------DDVWK--EQTIKNTSESQFRVEFECEFLGSVDTLIAPSKLRIMPYHDPITSNRG 293 DD W+ +QTI +S +QFR E F G+ TLI+ KL I+ Y + + G Sbjct: 324 NDEDIFDDGWQWSKQTISASSLTQFRQEHTAAFEGTSGTLISGMKLAILDYIEVTPDSHG 383 Query: 294 LAVYEQVIPEHNYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNII 353 +++ H YI T+D S G G DY A +ID TT ++ V +N I ++LP+I+ Sbjct: 384 FHQFKKPEEGHKYIATLDCSEGRGQDYHAMHIIDVTTDKWEQVGVLHSNTISHLILPDIV 443 Query: 354 VDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQL 413 YN I E+N G VA + DLEYEN++ +M L Sbjct: 444 FKYLMEYNECPIYIELNSTGVSVAKSLYMDLEYENVICDSM----------------NDL 487 Query: 414 GVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFIAKGQTFQAEEGCNDDLAMCL 473 G+K S TK VGCS LK LIE+DKL IN TI E TF KG ++ AEEG +DDL M L Sbjct: 488 GMKQSRRTKPVGCSTLKDLIEKDKLKINHRATIQEFRTFSEKGVSWAAEEGYHDDLVMGL 547 Query: 474 VIFGWMAMQPYFKEMHDND---VRQRIYEQQKDMIEQDMAPFGFV 515 VIFGW++ Q F + D D + ++ ++ + D AP FV Sbjct: 548 VIFGWLSTQQKFADYADKDDMRLASEVFSRELQDMNDDYAPVIFV 592 >gi|28182|lcl|protein:vir:108053 Length: 611 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595293;genbank:gi:161622599;genbank:Ge neID:5783772 Length = 611 Score = 356 bits (913), Expect = e-100, Method: Compositional matrix adjust. Identities = 200/528 (37%), Positives = 298/528 (56%), Gaps = 31/528 (5%) Query: 4 QYLGNPNLKKVNTSIDFKPEEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHFQEE 63 +Y+G PNLK+ N + E V+E KC+DD +YF + Y I +D G + + +Q + Sbjct: 86 RYMGLPNLKRANIKTQWTREMVSEWKKCRDDIVYFAETYCAITHIDYGTIKVQLRDYQRD 145 Query: 64 MVSKFHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQL 123 M+ +R L RQ GK+T+V +L +V FN + V ILA+K + + E+L R + Sbjct: 146 MLKIMSKNRMTTCNLSRQLGKTTVVAIFLAHFVCFNKDKAVGILAHKGSMSAEVLDRTKQ 205 Query: 124 SYENLPNWMQQGILQWNRGSLELENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPNHIA 183 + E LP+++Q GI++WN+GS+EL+NGS I A ++S AVRG SF +I++DE AFIPN + Sbjct: 206 AIELLPDFLQPGIVEWNKGSIELDNGSSIGAYASSPDAVRGNSFAMIYIDECAFIPNFL- 264 Query: 184 DQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERGTNEYVPTEVHWSEVPGR-- 241 D + ++ P ISSG+ +K+II +TP+G+N FY +W A G + + P W+ V R Sbjct: 265 DSWL-AIQPVISSGRRSKIIITTTPNGLNHFYDIWTAAVEGKSGFAPYTAIWNSVKERLY 323 Query: 242 ------DDVWK--EQTIKNTSESQFRVEFECEFLGSVDTLIAPSKLRIMPYHDPITSNRG 293 DD W+ QTI +S +QFR E EF G+ TLI+ KL IM + + I N Sbjct: 324 NDADIFDDGWEWSSQTISASSLAQFRQEHCAEFQGTSGTLISGMKLAIMDWKEVIPENGY 383 Query: 294 LAVYEQVIPEHNYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNII 353 + + P H YI ++D S G G DY A +ID TT ++ VA +NEI ++LP+I+ Sbjct: 384 FYRFHEPDPTHKYIASLDCSEGRGQDYHALHIIDVTTDEWEQVAVLHSNEISHMILPDIV 443 Query: 354 VDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQL 413 YN A + E+N G VA + DLEYEN++ +M+ L Sbjct: 444 YKYLMEYNEAPVYIELNSTGVSVAKSLYMDLEYENVICDSMQ----------------DL 487 Query: 414 GVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFIAKGQTFQAEEGCNDDLAMCL 473 G+K + TK VGCS LK LIE+DKL +N TI E TF ++ AE+G +DDL M L Sbjct: 488 GMKQTRRTKPVGCSTLKDLIEKDKLKLNHKQTIMEFRTFSQNKLSWAAEDGFHDDLVMSL 547 Query: 474 VIFGWMAMQPYFKEMHDND---VRQRIYEQQKDMIEQDMAPFGFVTDG 518 VIF W+ Q F + D D + ++ ++ + + ++ P FV G Sbjct: 548 VIFAWLTTQQKFADFIDRDEMRLASEVFSRELEDMNEEYNPVVFVDAG 595 >gi|27407|lcl|protein:vir:7202 Length: 610 # NCBI annotation: gp17 terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049776;genbank:gi:9632591;genbank:GeneID: 1258647 Length = 610 Score = 353 bits (907), Expect = 2e-99, Method: Compositional matrix adjust. Identities = 199/539 (36%), Positives = 299/539 (55%), Gaps = 31/539 (5%) Query: 4 QYLGNPNLKKVNTSIDFKPEEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHFQEE 63 +Y+G PNLK+ N + E V E KC+DD +YF + Y I +D G++ + +Q + Sbjct: 86 RYMGLPNLKRANIKTQWTREMVEEWKKCRDDIVYFAETYCAITHIDYGVIKVQLRDYQRD 145 Query: 64 MVSKFHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQL 123 M+ R + L RQ GK+T+V +L +V FN + V ILA+K + + E+L R + Sbjct: 146 MLKIMSSKRMTVCNLSRQLGKTTVVAIFLAHFVCFNKDKAVGILAHKGSMSAEVLDRTKQ 205 Query: 124 SYENLPNWMQQGILQWNRGSLELENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPNHIA 183 + E LP+++Q GI++WN+GS+EL+NGS I A ++S AVRG SF +I++DE AFIPN Sbjct: 206 AIELLPDFLQPGIVEWNKGSIELDNGSSIGAYASSPDAVRGNSFAMIYIDECAFIPN-FH 264 Query: 184 DQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERGTNEYVPTEVHWSEVPGR-- 241 D + ++ P ISSG+ +K+II +TP+G+N FY +W A G + + P W+ V R Sbjct: 265 DSWL-AIQPVISSGRRSKIIITTTPNGLNHFYDIWTAAVEGKSGFEPYTAIWNSVKERLY 323 Query: 242 ------DDVWKE--QTIKNTSESQFRVEFECEFLGSVDTLIAPSKLRIMPYHDPITSNRG 293 DD W+ QTI +S +QFR E F G+ TLI+ KL +M + + + G Sbjct: 324 NDEDIFDDGWQWSIQTINGSSLAQFRQEHTAAFEGTSGTLISGMKLAVMDFIEVTPDDHG 383 Query: 294 LAVYEQVIPEHNYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNII 353 +++ P+ YI T+D S G G DY A +ID T ++ V +N I ++LP+I+ Sbjct: 384 FHQFKKPEPDRKYIATLDCSEGRGQDYHALHIIDVTDDVWEQVGVLHSNTISHLILPDIV 443 Query: 354 VDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQL 413 + YN + E+N G VA + DLEYE ++ + T L Sbjct: 444 MRYLVEYNECPVYIELNSTGVSVAKSLYMDLEYEGVICDSY----------------TDL 487 Query: 414 GVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFIAKGQTFQAEEGCNDDLAMCL 473 G+K + TK VGCS LK LIE+DKL+I+ TI E TF KG ++ AEEG +DDL M L Sbjct: 488 GMKQTKRTKAVGCSTLKDLIEKDKLIIHHRATIQEFRTFSEKGVSWAAEEGYHDDLVMSL 547 Query: 474 VIFGWMAMQPYFKEMHDND---VRQRIYEQQKDMIEQDMAPFGFVTDGMEDDYFADAQG 529 VIFGW++ Q F + D D + ++ ++ + D AP FV +Y + G Sbjct: 548 VIFGWLSTQSKFIDYADKDDMRLASEVFSKELQDMSDDYAPVIFVDSVHSAEYVPVSHG 606 >gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891724;genbank:gi:33620526;genbank:GeneID :1725332 Length = 607 Score = 353 bits (906), Expect = 3e-99, Method: Compositional matrix adjust. Identities = 198/521 (38%), Positives = 288/521 (55%), Gaps = 32/521 (6%) Query: 4 QYLGNPNLKKVNTSIDFKPEEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHFQEE 63 +Y+ NL++ N + PE +AE +C+ D +YF + Y I +D G + + +Q++ Sbjct: 84 RYMRLKNLRRANIKTQYTPEMIAEWKRCRKDIVYFAETYCAITHIDYGTIKVQLRDYQKD 143 Query: 64 MVSKFHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQL 123 M+ H++R + KL RQ GK+T V +L YV FN + V ILA+K + A E+L+R + Sbjct: 144 MLKIMHENRMSAHKLSRQLGKTTAVAIFLAHYVCFNKDKAVGILAHKGSMAVEVLERTKQ 203 Query: 124 SYENLPNWMQQGILQWNRGSLELENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPNHIA 183 + E LP+++Q GI++WN+ S+ LENGS I A ++S AVRG SF+ I++DE AFI N Sbjct: 204 AIELLPDFLQPGIVEWNKKSIVLENGSSIGAYASSPDAVRGNSFSFIYIDECAFIQNWT- 262 Query: 184 DQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERGTNEYVPTEVHWSEVPGR-- 241 F ++ P ISSG+ +K+I+ +TP+G+N FY +W A G + YVP E W V R Sbjct: 263 -DCFLAIQPVISSGRESKMIMTTTPNGLNHFYDIWQSAIDGKSGYVPYEAVWHSVKERLY 321 Query: 242 ------DD--VWKEQTIKNTSESQFRVEFECEFLGSVDTLIAPSKLRIMPYHDPITSNRG 293 DD W Q I +S QF E EF GS TLI + L + + D + N G Sbjct: 322 NKADIFDDGYEWSSQAIAGSSLEQFLQEHNAEFFGSSGTLIRATTLSRLSFIDVVNDN-G 380 Query: 294 LAVYEQVIPEHNYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNII 353 +E+ Y+ T+D S G G DY A +ID T PYK VA Y +N +LP+I+ Sbjct: 381 FYQFEKPKEGRKYVATLDCSEGRGQDYHALQIIDITEFPYKQVAVYHSNTTSHFILPDIV 440 Query: 354 VDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQL 413 YN + E+N G +A + DLEY+N++ + L Sbjct: 441 FKYLMMYNECPVYIELNSTGVSIAKSLAMDLEYDNIICDSF----------------IDL 484 Query: 414 GVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFIAKGQTFQAEEGCNDDLAMCL 473 G+K S +K +GCS LK LIE+DKL+IN TI EL TF KG ++ AEEG +DDL M L Sbjct: 485 GMKQSKRSKAMGCSALKDLIEKDKLIINHKGTIQELRTFSEKGVSWAAEEGFHDDLVMSL 544 Query: 474 VIFGWMAMQPYFKEMHDND---VRQRIYEQQKDMIEQDMAP 511 VIFGW+ Q F E D + I+ ++ D + ++ AP Sbjct: 545 VIFGWLTTQEKFAEYAGKDEMRIASEIFRKELDELGEEYAP 585 >gi|22056|lcl|protein:vir:103455 Length: 610 # NCBI annotation: terminase subunit nuclease and ATPase # Family: family:all:147 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803107;genbank:gi:116326387;genbank:GeneI D:4405484 Length = 610 Score = 353 bits (906), Expect = 3e-99, Method: Compositional matrix adjust. Identities = 199/539 (36%), Positives = 299/539 (55%), Gaps = 31/539 (5%) Query: 4 QYLGNPNLKKVNTSIDFKPEEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHFQEE 63 +Y+G PNLK+ N + E V E KC+DD +YF + Y I +D G++ + +Q + Sbjct: 86 RYMGLPNLKRANIKTQWTREMVEEWKKCRDDIVYFAETYCAITHIDYGVIKVQLRDYQRD 145 Query: 64 MVSKFHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQL 123 M+ R + L RQ GK+T+V +L +V FN + V ILA+K + + E+L R + Sbjct: 146 MLKIMSSKRMTVCNLSRQLGKTTVVAIFLAHFVCFNKDKAVGILAHKGSMSAEVLDRTKQ 205 Query: 124 SYENLPNWMQQGILQWNRGSLELENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPNHIA 183 + E LP+++Q GI++WN+GS+EL+NGS I A ++S AVRG SF +I++DE AFIPN Sbjct: 206 AIELLPDFLQPGIVEWNKGSIELDNGSSIGAYASSPDAVRGNSFAMIYIDECAFIPN-FH 264 Query: 184 DQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERGTNEYVPTEVHWSEVPGR-- 241 D + ++ P ISSG+ +K+II +TP+G+N FY +W A G + + P W+ V R Sbjct: 265 DSWL-AIQPVISSGRRSKIIITTTPNGLNHFYDIWTAAVEGKSGFEPYTAIWNSVKERLY 323 Query: 242 ------DDVWKE--QTIKNTSESQFRVEFECEFLGSVDTLIAPSKLRIMPYHDPITSNRG 293 DD W+ QTI ++ +QFR E F G+ TLI+ KL IM + + + G Sbjct: 324 NDEDIFDDGWQWSIQTINGSTLAQFRQEHTAAFEGTSGTLISGMKLAIMDFIEVTPDDHG 383 Query: 294 LAVYEQVIPEHNYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNII 353 +++ P+ YI T+D S G G DY A +ID T ++ V +N I ++LP+I+ Sbjct: 384 FHRFKKPEPDRKYIATLDCSEGRGQDYHALHIIDVTDDVWEQVGVLHSNTISHLILPDIV 443 Query: 354 VDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQL 413 + YN + E+N G VA + DLEYE ++ + T L Sbjct: 444 MRYLVEYNECPVYIELNSTGVSVAKSLYMDLEYEGVICDSY----------------TDL 487 Query: 414 GVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFIAKGQTFQAEEGCNDDLAMCL 473 G+K + TK VGCS LK LIE+DKL+I+ TI E TF KG ++ AEEG +DDL M L Sbjct: 488 GMKQTKRTKAVGCSTLKDLIEKDKLIIHHRATIQEFRTFSEKGVSWAAEEGYHDDLVMSL 547 Query: 474 VIFGWMAMQPYFKEMHDND---VRQRIYEQQKDMIEQDMAPFGFVTDGMEDDYFADAQG 529 VIFGW++ Q F + D D + ++ ++ + D AP FV +Y + G Sbjct: 548 VIFGWLSTQSKFIDYADKDDMRLASEVFSKELQDMSDDYAPVIFVDSVHSAEYVPVSHG 606 >gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469498;genbank:gi:157311455;genbank:Ge neID:5602125 Length = 607 Score = 352 bits (904), Expect = 5e-99, Method: Compositional matrix adjust. Identities = 198/521 (38%), Positives = 288/521 (55%), Gaps = 32/521 (6%) Query: 4 QYLGNPNLKKVNTSIDFKPEEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHFQEE 63 +Y+ NL++ N + PE +AE +C+ D +YF + Y I +D G + + +Q++ Sbjct: 84 RYMRLKNLRRANIKTQYTPEMIAEWKRCRKDIVYFAETYCAITHIDYGTIKVQLRDYQKD 143 Query: 64 MVSKFHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQL 123 M+ H++R + KL RQ GK+T V +L YV FN + V ILA+K + A E+L+R + Sbjct: 144 MLKIMHENRMSAHKLSRQLGKTTAVAIFLAHYVCFNKDKAVGILAHKGSMAVEVLERTKQ 203 Query: 124 SYENLPNWMQQGILQWNRGSLELENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPNHIA 183 + E LP+++Q GI++WN+ S+ LENGS I A ++S AVRG SF+ I++DE AFI N Sbjct: 204 AIELLPDFLQPGIVEWNKKSIVLENGSSIGAYASSPDAVRGNSFSFIYIDECAFIQNWT- 262 Query: 184 DQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERGTNEYVPTEVHWSEVPGR-- 241 F ++ P ISSG+ +K+I+ +TP+G+N FY +W A G + YVP E W V R Sbjct: 263 -DCFLAIQPVISSGRESKMIMTTTPNGLNHFYDIWQSAIDGKSGYVPYEAVWHSVKERLY 321 Query: 242 ------DD--VWKEQTIKNTSESQFRVEFECEFLGSVDTLIAPSKLRIMPYHDPITSNRG 293 DD W Q I +S QF E EF GS TLI + L + + D + N G Sbjct: 322 NKADIFDDGYEWSSQAIAGSSLEQFLQEHNAEFFGSSGTLIRATTLSRLSFIDVVNDN-G 380 Query: 294 LAVYEQVIPEHNYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNII 353 +E+ Y+ T+D S G G DY A +ID T PYK VA Y +N +LP+I+ Sbjct: 381 FYQFEKPKEGRKYVATLDCSEGRGQDYHALQIIDITEFPYKPVAVYHSNTTSHFILPDIV 440 Query: 354 VDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQL 413 YN + E+N G +A + DLEY+N++ + L Sbjct: 441 FKYLMMYNECPVYIELNSTGVSIAKSLAMDLEYDNIICDSF----------------IDL 484 Query: 414 GVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFIAKGQTFQAEEGCNDDLAMCL 473 G+K S +K +GCS LK LIE+DKL+IN TI EL TF KG ++ AEEG +DDL M L Sbjct: 485 GMKQSKRSKAMGCSALKDLIEKDKLIINHKGTIQELRTFSEKGVSWAAEEGFHDDLVMSL 544 Query: 474 VIFGWMAMQPYFKEMHDND---VRQRIYEQQKDMIEQDMAP 511 VIFGW+ Q F E D + I+ ++ D + ++ AP Sbjct: 545 VIFGWLTTQEKFAEYAGKDEMRIASEIFRKELDELGEEYAP 585 >gi|24596|lcl|protein:vir:98262 Length: 609 # NCBI annotation: gp17 terminase subunit, nuclease and ATPase # Family: family:all:147 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239195;genbank:gi:66391670;genbank:GeneID :3416364 Length = 609 Score = 348 bits (893), Expect = 9e-98, Method: Compositional matrix adjust. Identities = 193/532 (36%), Positives = 300/532 (56%), Gaps = 33/532 (6%) Query: 4 QYLGNPNLKKVNTSIDFKPEEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHFQEE 63 +Y GNPNLK+ + E + E +KC+DD +YF + Y I +D G + + +Q+E Sbjct: 84 RYNGNPNLKRAYVQTKWTKEMLMEWVKCRDDIVYFAETYCAITHIDYGTIKVQLRDYQKE 143 Query: 64 MVSKFHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQL 123 M+ + H +R L RQ GK+T+V +L +V FN + V +LA+KA+ + E+L R + Sbjct: 144 MLIEMHKNRMVTCNLSRQLGKTTVVAIFLAHFVCFNEDKYVGVLAHKASMSAEVLDRTKQ 203 Query: 124 SYENLPNWMQQGILQWNRGSLELENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPNHIA 183 + E LP+++Q GI++WN+GS+EL+N KI A ++S AVRG SF +I++DE AFIPN Sbjct: 204 AIELLPDFLQPGIVEWNKGSIELDNKCKIGAFASSPDAVRGNSFAMIYIDECAFIPN-FT 262 Query: 184 DQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERGTNEYVPTEVHWSEVPGR-- 241 D + ++ P ISSG+ +K++I +TP+G+N FY +W+ A G + +VP W+ V R Sbjct: 263 DAWL-AIQPVISSGRKSKILITTTPNGLNHFYDIWNAAVEGKSGFVPYTAIWTSVKERLY 321 Query: 242 --------DD--VWKEQTIKNTSESQFRVEFECEFLGSVDTLIAPSKLRIMPYHDPITSN 291 DD W + I +S+ F E EF+G+ TLI+ KL M + D + Sbjct: 322 TDGDNGVFDDGYSWSAKMIAGSSKEAFLQEHCAEFMGTNGTLISGWKLSKMSWIDIDETE 381 Query: 292 RGLAVYEQVIPEHNYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPN 351 Y++ H Y+ +D + G G DY A +ID TT+P++ VA Y +N ++LP+ Sbjct: 382 TNFYQYKKPEEGHKYVAVLDPAEGRGQDYHAMHIIDITTLPFEQVAVYHSNRTSHLILPD 441 Query: 352 IIVDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKT 411 I++ YN A+I E+N G VA + +LEYEN++ + Sbjct: 442 ILLRYLTMYNEAWIYIELNSTGHSVAKSLFSELEYENVICDSY----------------N 485 Query: 412 QLGVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFIAKGQTFQAEEGCNDDLAM 471 LG+K + +K +GCS LK LIE+DKL+IN+ TI E TF KG ++ AEEG +DDL M Sbjct: 486 DLGMKQTKRSKAIGCSTLKDLIEKDKLIINNKKTILEFRTFSEKGVSWAAEEGFHDDLVM 545 Query: 472 CLVIFGWMAMQPYFKEMHDND---VRQRIYEQQKDMIEQDMAPFGFVTDGME 520 L FGW+ Q F E + D + ++ ++++ + +D VT G E Sbjct: 546 SLACFGWLTTQLKFAEFCEKDDLRLANEVFAREREQLYEDALCPVIVTSGDE 597 >gi|20804|lcl|protein:vir:106290 Length: 633 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944105;genbank:gi:38640149;genbank:GeneID :2658038 Length = 633 Score = 347 bits (889), Expect = 2e-97, Method: Compositional matrix adjust. Identities = 187/528 (35%), Positives = 303/528 (57%), Gaps = 31/528 (5%) Query: 4 QYLGNPNLKKVNTSIDFKPEEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHFQEE 63 +YLG PNLK+ N + E V E +C+DD +YF + Y I+ +D G++ + +Q++ Sbjct: 110 RYLGLPNLKRANVPTKWTREMVEEWKRCRDDIVYFAETYCSIIHIDWGVIKVQLRDYQKD 169 Query: 64 MVSKFHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQL 123 M+ R ++ LPRQ GK+T +L +V+FN V +LA+K ++E+L+R + Sbjct: 170 MLRIMASERMSMHNLPRQLGKTTATAIFLTHFVVFNEAKAVGVLAHKGDMSKEVLERTKQ 229 Query: 124 SYENLPNWMQQGILQWNRGSLELENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPNHIA 183 S E LP+++Q GI++WN+G++ELENG I A ++S AVRG SF +I++DE AFI Sbjct: 230 SIELLPDFLQPGIVEWNKGNIELENGCSIGAYASSPDAVRGNSFALIYVDECAFIEGF-- 287 Query: 184 DQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERGTNEYVPTEVHWSEVPGR-- 241 + + ++ P ISSG+ +++I+ STP+G+N +Y LW + + + P W V R Sbjct: 288 EDTWKAILPVISSGRQSRIILTSTPNGINHWYDLWEVSLKSDKGFKPYTTTWITVKERLY 347 Query: 242 ------DD--VWKEQTIKNTSESQFRVEFECEFLGSVDTLIAPSKLRIMPYHDPITSNRG 293 DD W + I ++S F+ E C F+G+ TLI KL M + + I + Sbjct: 348 DGSDAYDDGFEWASKQINSSSVEAFQQEHLCRFMGTSGTLINGFKLSKMTWKEVIADDNF 407 Query: 294 LAVYEQVIPEHNYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNII 353 + E+ + + YI TVD + G G DYS +ID T+ PY+ VA Y +N+I P++LP++I Sbjct: 408 YQI-EKPVEGNKYIATVDPAEGRGQDYSTIQIIDVTSYPYRQVAVYHSNKISPLLLPSVI 466 Query: 354 VDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQL 413 + A YNNA++ E+N IG VA + DLEYEN+++ + + L Sbjct: 467 MRYAMEYNNAWVYIELNSIGNMVAKSLFIDLEYENVIVDSSK----------------DL 510 Query: 414 GVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFIAKGQTFQAEEGCNDDLAMCL 473 G+K + TK VGCS LK LIE+DKL+++ TI E TF+ KG ++ A++G +DDL M L Sbjct: 511 GMKQTKVTKAVGCSTLKDLIEKDKLIVSHKGTIQEFRTFVEKGVSWAAQDGFHDDLVMSL 570 Query: 474 VIFGWMAMQPYFKEMHD--NDVRQRIYEQQKDMIEQDMAPFGFVTDGM 519 IF ++ Q F + D ++ +++ + + + +D + DG+ Sbjct: 571 CIFAYLTTQERFGDFIDATRNIGADVFQSEMEEMLEDFCVGAIIDDGI 618 >gi|22942|lcl|protein:vir:101803 Length: 613 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238880;genbank:gi:66391955;genbank:GeneID :3416630 Length = 613 Score = 333 bits (855), Expect = 2e-93, Method: Compositional matrix adjust. Identities = 193/553 (34%), Positives = 302/553 (54%), Gaps = 47/553 (8%) Query: 4 QYLGNPNLKKVNTSIDFKPEEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHFQEE 63 +YLG PNLK+ N I + E +AE +C++D +YF +NY I +D G++ + +Q++ Sbjct: 77 RYLGLPNLKRANIKIKWTKEMLAERKRCKEDIVYFAENYCCIEHIDYGIIRVQLRDYQKD 136 Query: 64 MVSKFHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQL 123 M+ +R A L RQ GK+T+V +L +V FN+ NV ILA+KA+ + E+L R + Sbjct: 137 MLRIMAGNRLMAANLSRQLGKTTVVAIFLAHFVCFNSAKNVGILAHKASMSAEVLHRTKQ 196 Query: 124 SYENLPNWMQQGILQWNRGSLELENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPNHIA 183 + E LP+++Q GI++WN+GS+ L NG I A S+S AVRG SF +I++DE AFIPN Sbjct: 197 ALELLPDFLQPGIVEWNKGSITLGNGCAIGAFSSSPDAVRGNSFALIYVDEVAFIPN-FT 255 Query: 184 DQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDA-------ERGTNEYVPTEVHWS 236 D + ++ P ISSG+ +K+++ +TP+G+N +Y +W A + + +VP WS Sbjct: 256 DAWM-AIQPVISSGRRSKILMTTTPNGLNHWYDIWTAAITPNSSGDGSKSGFVPYTATWS 314 Query: 237 EVPGR---------------DD--VWKEQTIKNTSESQFRVEFECEFLGSVDTLIAPSKL 279 V R DD W +TI ++ F+ E F G+ TLI +KL Sbjct: 315 SVKERLYSDGKELSGSDSYFDDGYSWSSKTIAGSALDAFQQEHNTAFQGTSGTLINGTKL 374 Query: 280 RIMPYHDPITSNRGLAVYEQVIPEHNYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARY 339 + + D I ++E+ YI T+D + G G DY A + D T PYK VA Y Sbjct: 375 SKLNWID-IPPQDNFTMFEEPKEGRKYIATLDSAEGRGQDYHAMHIFDITEFPYKQVAVY 433 Query: 340 KNNEIKPIVLPNIIVDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAG 399 +N ++LP++++ Y YI E+N G +A + +L+YEN++ + + Sbjct: 434 HSNTTSHLILPDVLLKYLNMYFQPYIYIELNSTGVSIAKSLYSELDYENVICDSYQ---- 489 Query: 400 QQLGQGFSGKKTQLGVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFIAKGQTF 459 LG+K + +K +GCS LK LIE+DKL++N +I EL TF KG ++ Sbjct: 490 ------------DLGLKQTKRSKAIGCSTLKDLIEKDKLILNHKKSIMELRTFSEKGVSW 537 Query: 460 QAEEGCNDDLAMCLVIFGWMAMQPYFKEMHDND---VRQRIYEQQKDMIEQDMAPFGFVT 516 AEEG +DDL M LVIF W+ Q F + +ND + ++ ++ + + D P V Sbjct: 538 AAEEGFHDDLVMSLVIFAWLTTQERFSDFTENDDMRLANEVFRKEMEELNDDYMPMVIVD 597 Query: 517 DGMEDDYFADAQG 529 DG ED + +G Sbjct: 598 DG-EDTFEVTHKG 609 >gi|23191|lcl|protein:vir:101186 Length: 613 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932508;genbank:gi:37651634;genbank:GeneID :2610679 Length = 613 Score = 333 bits (855), Expect = 2e-93, Method: Compositional matrix adjust. Identities = 193/553 (34%), Positives = 302/553 (54%), Gaps = 47/553 (8%) Query: 4 QYLGNPNLKKVNTSIDFKPEEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHFQEE 63 +YLG PNLK+ N I + E +AE +C++D +YF +NY I +D G++ + +Q++ Sbjct: 77 RYLGLPNLKRANIKIKWTKEMLAERKRCKEDIVYFAENYCCIEHIDYGIIRVQLRDYQKD 136 Query: 64 MVSKFHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQL 123 M+ +R A L RQ GK+T+V +L +V FN+ NV ILA+KA+ + E+L R + Sbjct: 137 MLRIMAGNRLMAANLSRQLGKTTVVAIFLAHFVCFNSAKNVGILAHKASMSAEVLHRTKQ 196 Query: 124 SYENLPNWMQQGILQWNRGSLELENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPNHIA 183 + E LP+++Q GI++WN+GS+ L NG I A S+S AVRG SF +I++DE AFIPN Sbjct: 197 ALELLPDFLQPGIVEWNKGSITLGNGCAIGAFSSSPDAVRGNSFALIYVDEVAFIPN-FT 255 Query: 184 DQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDA-------ERGTNEYVPTEVHWS 236 D + ++ P ISSG+ +K+++ +TP+G+N +Y +W A + + +VP WS Sbjct: 256 DAWM-AIQPVISSGRRSKILMTTTPNGLNHWYDIWTAAITPNSSGDGSKSGFVPYTATWS 314 Query: 237 EVPGR---------------DD--VWKEQTIKNTSESQFRVEFECEFLGSVDTLIAPSKL 279 V R DD W +TI ++ F+ E F G+ TLI +KL Sbjct: 315 SVKERLYSDGKELSGSDSYFDDGYSWSSKTIAGSALDAFQQEHNTAFQGTSGTLINGTKL 374 Query: 280 RIMPYHDPITSNRGLAVYEQVIPEHNYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARY 339 + + D I ++E+ YI T+D + G G DY A + D T PYK VA Y Sbjct: 375 SKLNWID-IPPQDNFTMFEEPKEGRKYIATLDSAEGRGQDYHAMHIFDITEFPYKQVAVY 433 Query: 340 KNNEIKPIVLPNIIVDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAG 399 +N ++LP++++ Y YI E+N G +A + +L+YEN++ + + Sbjct: 434 HSNTTSHLILPDVLLKYLNMYFQPYIYIELNSTGVSIAKSLYSELDYENVICDSYQ---- 489 Query: 400 QQLGQGFSGKKTQLGVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFIAKGQTF 459 LG+K + +K +GCS LK LIE+DKL++N +I EL TF KG ++ Sbjct: 490 ------------DLGLKQTKRSKAIGCSTLKDLIEKDKLILNHKKSIMELRTFSEKGVSW 537 Query: 460 QAEEGCNDDLAMCLVIFGWMAMQPYFKEMHDND---VRQRIYEQQKDMIEQDMAPFGFVT 516 AEEG +DDL M LVIF W+ Q F + +ND + ++ ++ + + D P V Sbjct: 538 AAEEGFHDDLVMSLVIFAWLTTQERFSDFTENDDMRLANEVFRKEMEELNDDYMPMVIVD 597 Query: 517 DGMEDDYFADAQG 529 DG ED + +G Sbjct: 598 DG-EDTFEVTHKG 609 >gi|21077|lcl|protein:vir:100538 Length: 612 # NCBI annotation: gp17 terminase subunit # Family: family:all:147 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656379;genbank:gi:109290130;genbank:GeneI D:4156516 Length = 612 Score = 319 bits (817), Expect = 6e-89, Method: Compositional matrix adjust. Identities = 184/544 (33%), Positives = 293/544 (53%), Gaps = 46/544 (8%) Query: 4 QYLGNPNLKKVNTSIDFKPEEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHFQEE 63 +Y G PNLK+ N I + E +AE +C++D +YF +NY I +D G++ + +Q++ Sbjct: 76 RYNGLPNLKRANIKIKWTKEMLAERKRCKEDIVYFAENYCCIEHIDYGIIRVQLRDYQKD 135 Query: 64 MVSKFHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQL 123 M+ +R A L RQ GK+T+V +L +V FN+ NV ILA+KA+ + E+L R + Sbjct: 136 MLRIMAGNRLMAANLSRQLGKTTVVAIFLAHFVCFNSAKNVGILAHKASMSAEVLHRTKQ 195 Query: 124 SYENLPNWMQQGILQWNRGSLELENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPNHIA 183 + E LP+++Q GI++WN+GS+ L NG I A S+S AVRG SF +I++DE AFIPN Sbjct: 196 ALELLPDFLQPGIVEWNKGSITLGNGCAIGAFSSSPDAVRGNSFALIYIDEVAFIPNF-- 253 Query: 184 DQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLW-------HDAERGTNEYVPTEVHWS 236 + + ++ P ISSG+ +K+++ +TP+G+N +Y +W D + +VP WS Sbjct: 254 NDAWLAIQPVISSGRHSKILMTTTPNGLNHWYDIWTAAITPNSDGSGSKSGFVPYTATWS 313 Query: 237 EVPGR-----------------DDVWKEQTIKNTSESQFRVEFECEFLGSVDTLIAPSKL 279 V R D + + + + F+ E F G+ TLI KL Sbjct: 314 SVKERMYSDGSKTDGAIHILTTDILGQPRQSPVLALRAFQQEHNTAFQGTSGTLINGFKL 373 Query: 280 RIMPYHDPITSNRGLAVYEQVIPEHNYIITVDVSRGVGNDYSAFCVIDTTTIPYKMVARY 339 M + + + ++ ++++ I H YI T+D + G G DY A + D T PY+ VA Y Sbjct: 374 SKMTWKE-VPASDNFTMFKEPIEGHKYIATLDSAEGRGQDYHAMHIYDITEFPYEQVAVY 432 Query: 340 KNNEIKPIVLPNIIVDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAG 399 +N ++LP++++ Y YI E+N G +A + +LEYEN++ + Sbjct: 433 HSNTTSHLILPDVLLKYLNMYYQPYIYIELNATGVSIAKSLYSELEYENIICDSY----- 487 Query: 400 QQLGQGFSGKKTQLGVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFIAKGQTF 459 LG+K + +K +GCS LK LIE++KL++ TI EL TF KG ++ Sbjct: 488 -----------NDLGMKQTKRSKAIGCSTLKDLIEKEKLVLYHKGTIMELRTFSEKGVSW 536 Query: 460 QAEEGCNDDLAMCLVIFGWMAMQPYFKEMHDND---VRQRIYEQQKDMIEQDMAPFGFVT 516 AE+G +DDL M LVIF W+ QP F + + D + I+ Q+ + + D P V Sbjct: 537 AAEDGFHDDLVMSLVIFAWLTTQPRFSDFTERDDMRLANEIFRQEMENLYDDYTPVVIVD 596 Query: 517 DGME 520 G E Sbjct: 597 SGEE 600 >gi|16897|lcl|protein:vir:5867 Length: 508 # NCBI annotation: similar to terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835653;genbank:gi:30044056 Length = 508 Score = 185 bits (469), Expect = 1e-48, Method: Compositional matrix adjust. Identities = 137/463 (29%), Positives = 229/463 (49%), Gaps = 43/463 (9%) Query: 27 EVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHFQEEMVSKFHDHRFNIAKLPRQSGKST 86 E+ KC++DPIYFI+ Y+KI + ++PF++Y QE++++ +H HR+ I + PRQ G + Sbjct: 10 ELEKCKNDPIYFIRKYVKIQHPIKRVIPFDLYPIQEKLINFYHTHRYVITEKPRQMGVTW 69 Query: 87 IVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQLSYENLPNWMQQGILQWNRGSLEL 146 AY L ++FN+N V I ANK ATA+ +L+R++ +YE LP ++Q WN+ +E Sbjct: 70 CAVAYALHQMIFNSNYKVLIAANKEATAKNVLERIKFAYEQLPRFLQIKKRTWNKTYIEF 129 Query: 147 ENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPNHIADQFFSSVYPTISSGKSTKVIIIS 206 N S A S+ + + R S ++ ++E AFI N ++ ++SV T+++G K I+ S Sbjct: 130 SNYSSARAVSSKSDSGRSESITLLIVEEAAFISN--MEELWASVQQTLATG--GKCIVNS 185 Query: 207 TPHGMNMFY-KLWHDAERGTNEYVPTEVHWSEVPGRDDVWKEQTIKNTSESQFRVEFECE 265 T +G+ +Y + A+ G +E+ + WS+ P RD+ W E+ + F E C Sbjct: 186 TYNGVGNWYERTIRAAKEGKSEFKYFGIKWSDHPERDEKWFEEQKRLLPPRVFAQEILCI 245 Query: 266 FLGSVDTLIAPSKLRIMPYHDPITSNRGLAVYEQVIPEHNYIITVDVSRGVGNDYSAFCV 325 GS + +I +R + DP G +E Y I+VD + G G D SA V Sbjct: 246 PQGSGENVIPFHLIREEEFIDPFVVKYGGDYWEWYRKPGYYFISVDPASGRGEDRSAVGV 305 Query: 326 ----IDTTTIPYKMVARYKNNEIKPIVLPNIIVDIAKNYNNAYILCEVNDIGGQVADIIQ 381 +D T+ + VA + +++ V+ +I I + I E N IG + Q Sbjct: 306 QVLWVDPQTLTIEQVAEFASDKTSLPVMRQVIKQIYDEFKPQLIFIETNGIG---MGLYQ 362 Query: 382 FDLEYENLLMAAMRGRAGQQLGQGFSGKKTQLGVKMSTATKQVGCSNLKALIEED-KLLI 440 F Y ++ +T K+V S+L A + ED +L++ Sbjct: 363 FMEAYTPSIVGY-----------------------YTTQRKKVHGSDLLAKLYEDGRLIL 399 Query: 441 NDYDTIAEL--TTFIAKGQTFQAEEGCNDDLAMCLVIFGWMAM 481 + +L TT++ + E +DL M L I G MA+ Sbjct: 400 RSKRLLEQLQRTTWVKN----KVETAGRNDLYMAL-INGLMAI 437 >gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: phage terminase large subunit # Family: family:all:147 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468054;genbank:gi:157265496;genbank:Ge neID:5600564 Length = 485 Score = 73.9 bits (180), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 86/337 (25%), Positives = 145/337 (43%), Gaps = 61/337 (18%) Query: 162 VRGMSFNVIFLDEFAFIPNHIADQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDA 221 +RG + + + LDE A IP + + ++ PT+S + +IISTP G+N FY+ + Sbjct: 138 LRGATLDFVILDEAAMIPFSVWSE---AIEPTLSV-RDGWALIISTPKGLNWFYEFFLMG 193 Query: 222 ER-GTNEYVPTE--------------VHWSEVPGRDDVWKEQTIKNTSESQFRVEFECEF 266 R G E +P W P R + + E+ + + +FR E+ EF Sbjct: 194 WRGGLKEGIPNSGINQTHPDFESFHAASWDVWPERREWYMERRL-YIPDLEFRQEYGAEF 252 Query: 267 LGSVDTLIAP-SKLRIMPYHDPITSNRGLA-VYEQVIPEHNYIITVDVSRGVGNDYSAFC 324 + +++ + L ++PY RG V E P+H Y I D G DYS F Sbjct: 253 VSHSNSVFSGLDMLILLPYE-----RRGTRLVVEDYRPDHIYCIGADF--GKNQDYSVFS 305 Query: 325 VIDTTTIPYKMVARYKNNEIKPIVLPNIIVDIAKNYNNAYILCEVNDIGGQVADIIQFDL 384 V+D T + R V + ++++Y +AY++ + +G +A+ + Sbjct: 306 VLDLDTGAIVCLERMNGATWSDQVAR--LKALSEDYGHAYVVADTWGVGDAIAEELD--- 360 Query: 385 EYENLLMAAMRGRAGQQLGQGFSGKKTQLGVKMSTATKQVGCSNLKALIEEDKLLI-NDY 443 QG + T L VK S+ +Q+ SNL L+E+ ++ + ND Sbjct: 361 ------------------AQGIN--YTPLPVKSSSVKEQL-ISNLALLMEKGQVAVPNDK 399 Query: 444 DTIAELTTF----IAKG-QTFQAEEGCNDDLAMCLVI 475 + EL F A G Q +A +DD+ M L + Sbjct: 400 TILDELRNFRYYRTASGNQVMRAYGRGHDDIVMSLAL 436 >gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: terminase large subunit # Family: family:all:147 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467938;genbank:gi:157265379;genbank:Ge neID:5600506 Length = 485 Score = 73.9 bits (180), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 86/337 (25%), Positives = 145/337 (43%), Gaps = 61/337 (18%) Query: 162 VRGMSFNVIFLDEFAFIPNHIADQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDA 221 +RG + + + LDE A IP + + ++ PT+S + +IISTP G+N FY+ + Sbjct: 138 LRGATLDFVILDEAAMIPFSVWSE---AIEPTLSV-RDGWALIISTPKGLNWFYEFFLMG 193 Query: 222 ER-GTNEYVPTE--------------VHWSEVPGRDDVWKEQTIKNTSESQFRVEFECEF 266 R G E +P W P R + + E+ + + +FR E+ EF Sbjct: 194 WRGGLKEGIPNSGVNQTHPDFESFHAASWDVWPERREWYMERRL-YIPDLEFRQEYGAEF 252 Query: 267 LGSVDTLIAP-SKLRIMPYHDPITSNRGLA-VYEQVIPEHNYIITVDVSRGVGNDYSAFC 324 + +++ + L ++PY RG V E P+H Y I D G DYS F Sbjct: 253 VSHSNSVFSGLDMLILLPYE-----RRGTRLVVEDYRPDHIYCIGADF--GKNQDYSVFS 305 Query: 325 VIDTTTIPYKMVARYKNNEIKPIVLPNIIVDIAKNYNNAYILCEVNDIGGQVADIIQFDL 384 V+D T + R V + ++++Y +AY++ + +G +A+ + Sbjct: 306 VLDLDTGAIVCLERMNGATWSDQVAR--LKALSEDYGHAYVVADTWGVGDAIAEELD--- 360 Query: 385 EYENLLMAAMRGRAGQQLGQGFSGKKTQLGVKMSTATKQVGCSNLKALIEEDKLLI-NDY 443 QG + T L VK S+ +Q+ SNL L+E+ ++ + ND Sbjct: 361 ------------------AQGIN--YTPLPVKSSSVKEQL-ISNLALLMEKGQVAVPNDK 399 Query: 444 DTIAELTTF----IAKG-QTFQAEEGCNDDLAMCLVI 475 + EL F A G Q +A +DD+ M L + Sbjct: 400 TILDELRNFRYYRTASGNQVMRAYGRGHDDIVMSLAL 436 >gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: terminase, large subunit # Family: family:all:147 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006983;genbank:gi:46401884;genbank:GeneID :2777679 Length = 438 Score = 60.5 bits (145), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 64/211 (30%), Positives = 102/211 (48%), Gaps = 33/211 (15%) Query: 71 HRFNIAKLPRQSGKSTIVTAYLLWYV-LFNANVNVAILANKAATAREMLQRLQLSYENLP 129 HRF A + R+ GKS I AY L ++ L NV V ++A + A + + + Sbjct: 54 HRFVTACVSRRVGKSFI--AYTLGFLKLLEPNVKVLVVAPNYSLA-------NIGWSQIR 104 Query: 130 NWMQQGILQWNR-----GSLELENGS--KIMAASTSASAVRGMSFNVIFLDEFAFIPNHI 182 +++ LQ R +EL NGS K+ +A+ + SAV G S++ I DE A I + Sbjct: 105 GLIKKYGLQTERENAKDKEIELANGSLFKLASAAQADSAV-GRSYDFIIFDEAA-ISDVG 162 Query: 183 ADQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERGTNEYVPTEVHWSEVPG-- 240 D F + PT+ S K + ISTP G N F + + G ++ +P +W + G Sbjct: 163 GDAFRVQLRPTLDKPNS-KALFISTPRGGNWFKEFY---AYGFDDTLP---NWVSIHGTY 215 Query: 241 RDDVWK-----EQTIKNTSESQFRVEFECEF 266 RD+ E+ + S++ FR E+E +F Sbjct: 216 RDNPRADLNDIEEARRTVSKNYFRQEYEADF 246 >gi|17621|lcl|protein:vir:816 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050549;genbank:gi:9633446;genbank:GeneID: 1262208 Length = 568 Score = 47.4 bits (111), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 47/185 (25%), Positives = 85/185 (45%), Gaps = 14/185 (7%) Query: 294 LAVYEQVIPEHNYIITVDVSRGVGN-DYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNI 352 L V+E P+ Y+ D + G+ + D S+ V+ + + VA + + + + ++ Sbjct: 368 LLVWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSN--GEQVAHWFGH-LDAELFAHL 424 Query: 353 IVDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQ 412 I + + YNNA++ E N+ G V I++ Y + Q L Q + + Sbjct: 425 ISQVCRMYNNAFVGPERNNHGHAV--ILKLRELYPTRYIYNE-----QHLDQAYDDDTPR 477 Query: 413 LGVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFI--AKGQTFQAEEGCNDDLA 470 LG + +K V +K L+ I T++E+ T++ AKG + A+EGC DD Sbjct: 478 LGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKG-SMNAQEGCFDDQL 536 Query: 471 MCLVI 475 M +I Sbjct: 537 MSYMI 541 >gi|25680|lcl|protein:vir:10116 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859246;genbank:gi:32171173;genbank:GeneID :2653342 Length = 568 Score = 47.4 bits (111), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 47/185 (25%), Positives = 85/185 (45%), Gaps = 14/185 (7%) Query: 294 LAVYEQVIPEHNYIITVDVSRGVGN-DYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNI 352 L V+E P+ Y+ D + G+ + D S+ V+ + + VA + + + + ++ Sbjct: 368 LLVWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSN--GEQVAHWFGH-LDAELFAHL 424 Query: 353 IVDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQ 412 I + + YNNA++ E N+ G V I++ Y + Q L Q + + Sbjct: 425 ISQVCRMYNNAFVGPERNNHGHAV--ILKLRELYPTRYIYNE-----QHLDQAYDDDTPR 477 Query: 413 LGVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFI--AKGQTFQAEEGCNDDLA 470 LG + +K V +K L+ I T++E+ T++ AKG + A+EGC DD Sbjct: 478 LGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKG-SMNAQEGCFDDQL 536 Query: 471 MCLVI 475 M +I Sbjct: 537 MSYMI 541 >gi|51|lcl|protein:vir:3295 Length: 568 # NCBI annotation: putative large subunit terminase # Family: family:all:147 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049511;genbank:gi:9632517;genbank:GeneID: 1262002 Length = 568 Score = 47.4 bits (111), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 47/185 (25%), Positives = 85/185 (45%), Gaps = 14/185 (7%) Query: 294 LAVYEQVIPEHNYIITVDVSRGVGN-DYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNI 352 L V+E P+ Y+ D + G+ + D S+ V+ + + VA + + + + ++ Sbjct: 368 LLVWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSN--GEQVAHWFGH-LDAELFAHL 424 Query: 353 IVDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQ 412 I + + YNNA++ E N+ G V I++ Y + Q L Q + + Sbjct: 425 ISQVCRMYNNAFVGPERNNHGHAV--ILKLRELYPTRYIYNE-----QHLDQAYDDDTPR 477 Query: 413 LGVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFI--AKGQTFQAEEGCNDDLA 470 LG + +K V +K L+ I T++E+ T++ AKG + A+EGC DD Sbjct: 478 LGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKG-SMNAQEGCFDDQL 536 Query: 471 MCLVI 475 M +I Sbjct: 537 MSYMI 541 >gi|26240|lcl|protein:vir:2763 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612880;genbank:gi:20065962;genbank:GeneID :935714 Length = 568 Score = 47.4 bits (111), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 47/185 (25%), Positives = 85/185 (45%), Gaps = 14/185 (7%) Query: 294 LAVYEQVIPEHNYIITVDVSRGVGN-DYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNI 352 L V+E P+ Y+ D + G+ + D S+ V+ + + VA + + + + ++ Sbjct: 368 LLVWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSN--GEQVAHWFGH-LDAELFAHL 424 Query: 353 IVDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQ 412 I + + YNNA++ E N+ G V I++ Y + Q L Q + + Sbjct: 425 ISQVCRMYNNAFVGPERNNHGHAV--ILKLRELYPTRYIYNE-----QHLDQAYDDDTPR 477 Query: 413 LGVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFI--AKGQTFQAEEGCNDDLA 470 LG + +K V +K L+ I T++E+ T++ AKG + A+EGC DD Sbjct: 478 LGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKG-SMNAQEGCFDDQL 536 Query: 471 MCLVI 475 M +I Sbjct: 537 MSYMI 541 >gi|25849|lcl|protein:vir:9949 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859079;genbank:gi:32171001;genbank:GeneID :2653280 Length = 568 Score = 47.4 bits (111), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 47/185 (25%), Positives = 85/185 (45%), Gaps = 14/185 (7%) Query: 294 LAVYEQVIPEHNYIITVDVSRGVGN-DYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNI 352 L V+E P+ Y+ D + G+ + D S+ V+ + + VA + + + + ++ Sbjct: 368 LLVWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSN--GEQVAHWFGH-LDAELFAHL 424 Query: 353 IVDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQ 412 I + + YNNA++ E N+ G V I++ Y + Q L Q + + Sbjct: 425 ISQVCRMYNNAFVGPERNNHGHAV--ILKLRELYPTRYIYNE-----QHLDQAYDDDTPR 477 Query: 413 LGVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFI--AKGQTFQAEEGCNDDLA 470 LG + +K V +K L+ I T++E+ T++ AKG + A+EGC DD Sbjct: 478 LGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKG-SMNAQEGCFDDQL 536 Query: 471 MCLVI 475 M +I Sbjct: 537 MSYMI 541 >gi|1476|lcl|protein:vir:104436 Length: 568 # NCBI annotation: putative terminase large subunit # Family: family:all:147 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794060;genbank:gi:116222005;genbank:GeneI D:4397501 Length = 568 Score = 47.4 bits (111), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 47/185 (25%), Positives = 85/185 (45%), Gaps = 14/185 (7%) Query: 294 LAVYEQVIPEHNYIITVDVSRGVGN-DYSAFCVIDTTTIPYKMVARYKNNEIKPIVLPNI 352 L V+E P+ Y+ D + G+ + D S+ V+ + + VA + + + + ++ Sbjct: 368 LLVWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSN--GEQVAHWFGH-LDAELFAHL 424 Query: 353 IVDIAKNYNNAYILCEVNDIGGQVADIIQFDLEYENLLMAAMRGRAGQQLGQGFSGKKTQ 412 I + + YNNA++ E N+ G V I++ Y + Q L Q + + Sbjct: 425 ISQVCRMYNNAFVGPERNNHGHAV--ILKLRELYPTRYIYNE-----QHLDQAYDDDTPR 477 Query: 413 LGVKMSTATKQVGCSNLKALIEEDKLLINDYDTIAELTTFI--AKGQTFQAEEGCNDDLA 470 LG + +K V +K L+ I T++E+ T++ AKG + A+EGC DD Sbjct: 478 LGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKG-SMNAQEGCFDDQL 536 Query: 471 MCLVI 475 M +I Sbjct: 537 MSYMI 541 >gi|12711|lcl|protein:vir:80169 Length: 565 # NCBI annotation: terminase # Family: family:all:543 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285801;genbank:gi:148747835;genbank:Ge neID:5220445 Length = 565 Score = 44.3 bits (103), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 36/149 (24%), Positives = 67/149 (44%), Gaps = 15/149 (10%) Query: 34 DPIYFIKNYIKIVSLDEGLVPFNMYHFQ------EEMVSKFHDHRFNIAKLPRQSGKSTI 87 D ++ I + ++ G F +H++ + + D+R + PR KST+ Sbjct: 13 DALFDIWAFADLIGFRGGRKSFGAFHYELTDFLTQTQQHEDKDNRRRLVLAPRGHLKSTV 72 Query: 88 VTA-YLLWYVLFNANVNVAILANKAATAREMLQRLQLSYENLPNWMQQGILQWN-RGSLE 145 + Y+LW + N ++ V + N +R ++ L+ +E+ W+QQ + WN R +E Sbjct: 73 CSVLYVLWRIYRNPDIRVLVGTNLKRLSRAFIRELRQYFED--TWLQQNV--WNVRPHIE 128 Query: 146 LENGSKIMAASTSASAVRGMSFNVIFLDE 174 G+ + A S S R N + DE Sbjct: 129 ---GALVPALSASDRRKRNSQRNNVDYDE 154 >gi|20547|lcl|protein:vir:105155 Length: 580 # NCBI annotation: conserved phage-related protein # Family: family:all:4926 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398598;genbank:gi:80159854;genbank:GeneID :3772993 Length = 580 Score = 40.0 bits (92), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 61/318 (19%), Positives = 127/318 (39%), Gaps = 38/318 (11%) Query: 36 IYFIKNYIKIVSLDEGLVPFNMYHFQEEMVSKFHDHRFNIAKLPRQSGKSTIVTAYLLWY 95 I + + YI ++ ++ +Y FQ ++ +++ + R GKS + + + Sbjct: 47 ISYYRKYIDKFCIE--VLGLKLYLFQRLILRAMARNQYVMLICCRGLGKSWLSAVFFVAS 104 Query: 96 VLFNANVNVAILANKAATARE-MLQRLQLSYENLPNWMQQGILQWNRGS----LELENGS 150 + + I + + AR ++Q+++ P+ ++ + G+ + NGS Sbjct: 105 CILYKGLKCGIASGQGQQARNVIIQKVKGELAKNPSIAREIVFPIKTGADDCVVNFRNGS 164 Query: 151 KIMA---ASTSASAVRGMSFNVIFLDEFAFIPNHIADQFFSSVYPTISS-------GKST 200 +I A R F+ + +DE + + + + + T + + Sbjct: 165 EIRAIVLGRNQGDGARSWRFHYLLVDECRLVSDKVINTILIPMTKTKRAVAIHHNKREKG 224 Query: 201 KVIIISTPH----GMNMFYKLWHDA-ERGTNEYVPTEVHW-----SEVPGRDDVWKEQTI 250 KVI IS+ + + +K + D G N Y + + + + +DD+ +E+ Sbjct: 225 KVIFISSAYLKTSDLYKRFKYFCDKMSSGANNYFVCSLDYRVGIEAGIFDQDDIDEERNK 284 Query: 251 KNTSESQFRVEFECEFLGSVDTLIAPSKLRIMPYHDPITSNRGLAVYEQVIPEHN---YI 307 + + +F+ E+E F+GS S PY + T R L E P+ + YI Sbjct: 285 PDMTIEEFQYEYEGIFVGS-------SGESYFPY-ETTTPARVLGRGEITQPKKSKSEYI 336 Query: 308 ITVDVSRGVGNDYSAFCV 325 IT DV+ +D C Sbjct: 337 ITHDVAISGASDSDNACT 354 >gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: terminase large subunit # Family: family:all:4877 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429607;genbank:gi:156564098;genbank:Ge neID:5525534 Length = 635 Score = 37.7 bits (86), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 47/212 (22%), Positives = 84/212 (39%), Gaps = 36/212 (16%) Query: 23 EEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHFQEEMVSKFHDHRFNIAKLPRQS 82 EE L D P ++ +K D +QE M+ + D + + +L R+ Sbjct: 44 EEELHYLAILDKPKFWAAETLKWFCRD----------YQEPMLQEMADSKRTVLRLGRRL 93 Query: 83 GKSTIVTAYLLWYVLFNAN------VNVAILANKAATAREMLQRLQLSYENLPNWMQQGI 136 GK+ + +LW+ N ++ I+A + +RL + G Sbjct: 94 GKTETMCIMILWHAFTQPNKGPNNQYDILIIAPYEEQVDLIFKRLSQLID------MSGD 147 Query: 137 LQWNR---GSLELENGS---KIMAASTSASA---VRGMSFNVIFLDEFAFIPNHIADQFF 187 + +R +EL NG+ I A S S S RG ++I LDE +++ + Sbjct: 148 VNPSRDIDKHIELPNGTVIHGITAGSKSGSGAANTRGQRADLIVLDEM----DYMGESEI 203 Query: 188 SSVYPTISSG-KSTKVIIISTPHGMNMFYKLW 218 +++ + + K+I+ STP G Y W Sbjct: 204 TNIMNIRNEAPERIKMIVASTPSGRRDSYYKW 235 >gi|26461|lcl|protein:vir:573 Length: 589 # NCBI annotation: unknown # Family: family:all:4926 # MgeID: mge:13 # MgeName: SPBc2 # Cross-refs: genbank:acc:NP_046608;genbank:gi:9630181;genbank:GeneID: 1261423 Length = 589 Score = 37.0 bits (84), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 35/151 (23%), Positives = 67/151 (44%), Gaps = 20/151 (13%) Query: 34 DPIYFIKNYIKI-VSLDEGLVPFNMYHFQEEMVSKFHDHRFNIAKLPRQSGKSTIVTAYL 92 +P F+K Y+ I + L + ++ + M H+H F + R GK+ + + Y Sbjct: 50 NPHRFVKEYLGITLKLFQCILIYMM----------VHNHYF-MYLASRGQGKTWLTSVYC 98 Query: 93 LWYVLFNANVNVAILANKAATAREMLQR---LQLSYENLPNWMQQGILQWNRGSLELENG 149 + + I + ARE++++ L+ NL ++ N +E NG Sbjct: 99 CVQAILFPGTKIVIASGTKGQAREVIEKIDDLRKESPNLRREIEDLKTSTNDAKVEFHNG 158 Query: 150 S--KIMAASTSASAVRGMSFNVIFLDEFAFI 178 S KI+A++ A + R N++ +DEF + Sbjct: 159 SWIKIVASNDGARSKRA---NLLIVDEFRMV 186 >gi|10061|lcl|protein:vir:99071 Length: 537 # NCBI annotation: gp26 # Family: family:all:523 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655891;genbank:gi:109521463;genbank:GeneI D:4158036 Length = 537 Score = 36.2 bits (82), Expect = 0.001, Method: Compositional matrix adjust. Identities = 44/167 (26%), Positives = 75/167 (44%), Gaps = 28/167 (16%) Query: 78 LPRQSGKSTIVTAYLL---WYVLFNANVNVAILANKAATAREMLQRLQLSYENLPNWMQQ 134 +PRQ+GK+ I+ A L + V ++ A+L N ARE RL+ EN N Sbjct: 90 VPRQNGKTAIIEARELVGLYVVCDKLCIHTAVLFN---AARESFYRLKARIEN--NETLN 144 Query: 135 GILQWNRG----SLELE---------NGSKIMAASTSASAVRGMSFNVIFLDE-FAFIPN 180 I ++ G S+E++ G +++ + + RG S +VI LDE FA Sbjct: 145 KITRFRSGNDNMSIEVKPKKESRHPNAGGRVIYMARGTAVARGFSADVIVLDEAFALDEA 204 Query: 181 HIADQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERGTNE 227 IA ++ +S ++ II ++ G+ +L +RG + Sbjct: 205 SIAAIDYA------TSARANPFIIYASSTGLEDSTELEKLHDRGMRQ 245 >gi|16631|lcl|protein:vir:9699 Length: 584 # NCBI annotation: hypothetical protein # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795461;genbank:gi:28876230;genbank:GeneID :1257775 Length = 584 Score = 35.8 bits (81), Expect = 0.001, Method: Compositional matrix adjust. Identities = 54/190 (28%), Positives = 88/190 (46%), Gaps = 36/190 (18%) Query: 10 NLKKVNTSIDFK----PEEVAEVLKCQDDPIYFIKNYIKIVSLDEGLVPFNMYHFQEEMV 65 +LK+++ DFK PE+ A DPI N+I+I+ + P+ + FQ+ ++ Sbjct: 46 DLKRIDDE-DFKFVYLPEKAA-------DPI----NFIEILPDVKTGKPYPLAMFQKFII 93 Query: 66 SKFH------DH---RFNIA--KLPRQSGKSTIVTAYLLWYVLFNANVNVA----ILAN- 109 + DH RF A + R++GK+ ++ LL+ LF N +++ AN Sbjct: 94 GNLYGWRKKTDHSLRRFRKAMISVARKNGKTILIAGILLYEFLFGHNPSMSRQLFCTAND 153 Query: 110 --KAATAREMLQRLQLSYENLPNWMQQGILQWNRGSLE-LENGSKIMAASTSASAVRGMS 166 +A A +M ++ QLS + + + R L+ L + S I A S AV G Sbjct: 154 RTQAKIAWDMAKK-QLSSLRAKDVDVRKATKIVRDELKNLHDESYIRALSRDTGAVDGFE 212 Query: 167 FNVIFLDEFA 176 V LDEFA Sbjct: 213 PYVGVLDEFA 222 >gi|14676|lcl|protein:vir:2499 Length: 474 # NCBI annotation: putative terminase gp4 # Family: family:all:523 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569740;genbank:gi:18496890;genbank:GeneID :932339 Length = 474 Score = 34.3 bits (77), Expect = 0.004, Method: Compositional matrix adjust. Identities = 29/136 (21%), Positives = 61/136 (44%), Gaps = 11/136 (8%) Query: 78 LPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQLSYENLPNWMQQGIL 137 +PRQ+GK+ ++ A + + N V A++ TA E + +Q + + + IL Sbjct: 69 IPRQTGKTYLLGALVFALCIKTPNTTVIWTAHRTRTAAETFRSMQGLAKR--DKIAPHIL 126 Query: 138 QWNRG----SLELENGSKIMAASTSASAVRGMS-FNVIFLDEFAFIPNHIADQFFSSVYP 192 + G ++ +NGS+I+ + RG + +V+ DE + + D P Sbjct: 127 NVHTGNGKEAVLFKNGSRILFGARERGFGRGFAGVDVLIFDEAQILTENAMDDMV----P 182 Query: 193 TISSGKSTKVIIISTP 208 ++ + +++ TP Sbjct: 183 ATNAAPNPLILLAGTP 198 >gi|7504|lcl|protein:vir:99915 Length: 478 # NCBI annotation: gp2 # Family: family:all:523 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655519;genbank:gi:109392289;genbank:GeneI D:4157084 Length = 478 Score = 34.3 bits (77), Expect = 0.004, Method: Compositional matrix adjust. Identities = 35/162 (21%), Positives = 71/162 (43%), Gaps = 13/162 (8%) Query: 74 NIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQ-LSYENLPNWM 132 ++ +PRQ GK+ ++ + L + V A++ TA+E ++ + L N Sbjct: 63 SVISIPRQVGKTYLIGCIVFALALLTPGLTVIWTAHRTKTAKETFGSMKAMCATPLVNAH 122 Query: 133 QQGILQWNRG--SLELENGSKIM-AASTSASAVRGMSFNVIFLDEFAFIPNHIADQFFSS 189 + + RG + L NGS+I+ A + + ++ LDE + D+ Sbjct: 123 VRNVSD-ARGDEGIYLHNGSRILFGARENGFGLGFAGVGILVLDE----AQRLTDKAMDD 177 Query: 190 VYPTISSGKSTKVIIISTP----HGMNMFYKLWHDAERGTNE 227 + PT+++ ++ +++ TP +F L DA G +E Sbjct: 178 LIPTMNTVENPLILLTGTPPRPTDSGEVFTMLRQDALDGESE 219 >gi|14207|lcl|protein:vir:8315 Length: 503 # NCBI annotation: gp32 # Family: family:all:523 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817883;genbank:gi:29566316;genbank:GeneID :1259511 Length = 503 Score = 33.9 bits (76), Expect = 0.006, Method: Compositional matrix adjust. Identities = 38/172 (22%), Positives = 69/172 (40%), Gaps = 15/172 (8%) Query: 78 LPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQ--LSYENLPNWMQQG 135 +PRQ GK+ +TA L + V ++ T E Q +Q E + ++++ Sbjct: 104 IPRQCGKTHTLTAVLFGLCVEYPGVLAIWTSHHVKTNTETFQAVQAYAKRERVAPFIRKV 163 Query: 136 ILQWNRGSLELENGSKIMAASTSASAVRGM-SFNVIFLDEFAFIPNHIADQFFSSVYPTI 194 L ++E NGS+I+ + RG+ +V+ DE + + T+ Sbjct: 164 TLGSGDEAVEFANGSRILFGARERGFGRGIPGVDVLMSDEAQILTQRAMQDMLA----TL 219 Query: 195 SSGKSTKVIIISTP----HGMNMFYKLWHDAERGTNEYVPTEVHWSEVPGRD 242 ++ + I + TP MF + +AE G T++ W E D Sbjct: 220 NTSRLGLHIYVGTPPKPTDNSEMFSVMRREAETGE----ATDIVWIECGAED 267 >gi|16385|lcl|protein:vir:3744 Length: 607 # NCBI annotation: orf15 # Family: family:all:169 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043485;genbank:gi:9628620;genbank:GeneID: 1261142 Length = 607 Score = 33.1 bits (74), Expect = 0.009, Method: Compositional matrix adjust. Identities = 81/390 (20%), Positives = 156/390 (40%), Gaps = 77/390 (19%) Query: 39 IKNYIKIVSLDEGLVPF--NMYHFQEEMVSKFHDHRFNIAKLPRQSGKSTIVTAYLLWYV 96 +KN I V+ E PF +++ +Q+ + S H NI K RQ G + + L Sbjct: 144 VKNDISHVT-PEMCQPFIDSLFDYQKHIRSNKHHDVRNILK-SRQIGATYYFSFEALEDA 201 Query: 97 LFNANVNVAILANK--AATAREMLQRLQLSYENLPNWMQQGILQWNRGSLELENGSKIMA 154 +F+ + + + A+K A + + ++ Y + + + L NG+++ Sbjct: 202 IFSGDNQIFLSASKRQAEIFKNYIVKMAREYFGV---------ELTGNPIILSNGAELHF 252 Query: 155 ASTSASAVRGMSFNVIFLDEFAFIPNHIADQFFSSVYPTISSGKSTKVIIISTP----HG 210 ST+ + +G S +V + DE+A+I + Q F V +++ + + STP H Sbjct: 253 LSTNKNTSQGNSGHV-YGDEYAWIRDF---QRFDDVASAMATHEKWRETYFSTPSSKFHE 308 Query: 211 MNMFYKL--WHDAE-RGTNEYVPTEVHWSEVPGR---DDVWK-----EQTIKNTSESQFR 259 F+ W D + + N PT + GR D W+ E +K ++ F Sbjct: 309 SYSFWSGDNWRDGDPKRKNVPFPTFAELRD-GGRLCPDGQWRYVVTIEDALKGGADKLFN 367 Query: 260 VE--------------FECEFLGSVDTLIAPSKL-----------RIMPYHDPITSNRGL 294 +E + C ++ D++ +L P D +R Sbjct: 368 IEKLKQRYSKYAFNQLYMCIWIDDADSIFNVKQLLKCGVDIAKWKDFNPKADRPFGDR-- 425 Query: 295 AVYEQVIPEHNYIITVDVSRGVGNDYSAFCVIDTTTIP---YKMVARYKNNEIKPIVLPN 351 V+ P H+ D ++F +I +P Y+M+ARY+ + + + N Sbjct: 426 EVWGGFDPAHS------------GDGASFVIIAPPALPGEKYRMLARYQWHGLSYVYQAN 473 Query: 352 IIVDIAKNYNNAYILCEVNDIGGQVADIIQ 381 I + + YN YI + +G V ++++ Sbjct: 474 QIRALYEKYNMTYIGIDATGVGYGVYELVK 503 >gi|19904|lcl|protein:vir:3781 Length: 607 # NCBI annotation: terminase # Family: family:all:169 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536821;genbank:gi:17981830;genbank:GeneID :929209 Length = 607 Score = 33.1 bits (74), Expect = 0.010, Method: Compositional matrix adjust. Identities = 84/391 (21%), Positives = 159/391 (40%), Gaps = 79/391 (20%) Query: 39 IKNYIKIVSLDEGLVPF--NMYHFQEEM-VSKFHDHRFNIAKLPRQSGKSTIVTAYLLWY 95 +KN I VS E PF +++ +Q+ + +K HD R NI K RQ G + + L Sbjct: 144 VKNDISHVS-PEMCQPFIDSLFDYQKHIRANKHHDVR-NILK-SRQIGATYYFSFEALED 200 Query: 96 VLFNANVNVAILANK--AATAREMLQRLQLSYENLPNWMQQGILQWNRGSLELENGSKIM 153 +F+ + + + A+K A + + ++ Y + + + L NG+++ Sbjct: 201 AIFSGDNQIFLSASKRQAEIFKNYIVKMAREYFGV---------ELTGNPIILSNGAELH 251 Query: 154 AASTSASAVRGMSFNVIFLDEFAFIPNHIADQFFSSVYPTISSGKSTKVIIISTP----H 209 ST+ + +G S +V + DE+A+I + Q F+ V +++ + STP H Sbjct: 252 FLSTNKNTSQGNSGHV-YGDEYAWIRDF---QRFNDVASAMATHAKWRETYFSTPSSKFH 307 Query: 210 GMNMFYKL--WHDAE-RGTNEYVPTEVHWSEVPGR---DDVWK-----EQTIKNTSESQF 258 F+ W D + + N PT + GR D W+ E +K + + F Sbjct: 308 ESYSFWSGDNWRDGDPKRKNVPFPTFAELRD-GGRLCPDGQWRYVVTIEDALKGGAGTLF 366 Query: 259 RVE--------------FECEFLGSVDTLIAPSKL-----------RIMPYHDPITSNRG 293 +E + C ++ D++ +L P D +R Sbjct: 367 NIEKLKQRYSKYAFNQLYMCVWIDDADSIFTVHQLLKCGVDISKWKDFNPKADRPFGDR- 425 Query: 294 LAVYEQVIPEHNYIITVDVSRGVGNDYSAFCVIDTTTIP---YKMVARYKNNEIKPIVLP 350 V+ P H+ D ++F +I +P Y+++ARY+ N + + Sbjct: 426 -EVWGGFDPAHS------------GDGASFVIIAPPALPSEKYRVLARYQWNGLSYVYQA 472 Query: 351 NIIVDIAKNYNNAYILCEVNDIGGQVADIIQ 381 N I + + YN YI + +G V ++++ Sbjct: 473 NQIRALYEKYNMTYIGIDATGVGYGVYELVK 503 >gi|4240|lcl|protein:vir:94734 Length: 469 # NCBI annotation: large subunit terminase # Family: family:all:523 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996704;genbank:gi:45597419;genbank:GeneID :2767958 Length = 469 Score = 32.3 bits (72), Expect = 0.015, Method: Compositional matrix adjust. Identities = 27/108 (25%), Positives = 51/108 (47%), Gaps = 20/108 (18%) Query: 78 LPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQLSYENLPNWMQ---- 133 +PR++GK+ IV LW +++ A++ +T+ SYE L +++ Sbjct: 68 IPRRNGKTEIVYILELW--ALEQGLSILHTAHRISTSHS-------SYEKLKKYLEDSGY 118 Query: 134 ------QGILQWNRGSLEL-ENGSKIMAASTSASAVRGMSFNVIFLDE 174 + I + LEL E+G I + ++S G F+++F+DE Sbjct: 119 VEGEDFKSIKAKGQERLELIESGGVIQFRTRTSSGGLGEGFDILFIDE 166 >gi|15774|lcl|protein:vir:6239 Length: 527 # NCBI annotation: gp33 # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813693;swissprot:trembl:q859c4;genbank:gi :29366753;interpro:IPR005021;uniprot:Q859C4;genbank:Gene ID:1258893 Length = 527 Score = 32.3 bits (72), Expect = 0.016, Method: Compositional matrix adjust. Identities = 38/180 (21%), Positives = 77/180 (42%), Gaps = 22/180 (12%) Query: 55 FNMYHFQEEMVSKFH-DHRFNIAKLPRQSGKSTIVTAYLLWYVLF---NANVNVAILANK 110 + Y ++ ++ HR + + R++GKSTI A +L++++ +A V AN Sbjct: 61 IDAYELTQDTFGRWRRKHRTVVVCVARKNGKSTIAAAIMLYHLIADRGDAQRQVIAAAND 120 Query: 111 AATAREMLQRLQLSYENLPNW-----MQQGILQWNRGSLELENGSKIMAASTSASAVRGM 165 AR + + P +Q+ ++++ +N ++++A A +G+ Sbjct: 121 RNQARMVFDSAKQMVNASPKLAAVCNVQRDVIRYK------DNTYRVVSA--DAGRQQGL 172 Query: 166 SFNVIFLDEFAFIPNHIADQFFSSVYPTISSGKSTKVIIIST--PHGMNMFYKLWHDAER 223 + + LDE+AF + F ++ ++ +IIST P F L ER Sbjct: 173 NPAAVSLDEYAF---SKSSDLFDALTLGSAARNQPMFLIISTAGPDPDGPFAALCEQGER 229 >gi|15412|lcl|protein:vir:1325 Length: 519 # NCBI annotation: gp33 # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047924;swissprot:trembl:q9t214;genbank:gi :9631142;uniprot:Q9T214;genbank:GeneID:2715903 Length = 519 Score = 31.6 bits (70), Expect = 0.026, Method: Compositional matrix adjust. Identities = 41/196 (20%), Positives = 83/196 (42%), Gaps = 41/196 (20%) Query: 54 PFNMYHFQEEMVSKFH------------DHRFNIAKLPRQSGKSTIVTAYLLWYVLF--- 98 PF + +Q E++ + HR + + R++GKSTI A +L++++ Sbjct: 46 PFRLLPWQRELLIDAYVLTQDTFGRWRRKHRTVVVCVARKNGKSTIAAAIMLYHLIADRG 105 Query: 99 NANVNVAILANKAATAREMLQRLQLSYENLPNW-----MQQGILQWNRGSLELENGSKIM 153 +A + AN AR + + P +Q+ ++++ +N +++ Sbjct: 106 DAQRQIIAAANDRNQARMVFDSAKQMVNASPKLAAVCDVQRDVIRYK------DNTYRVV 159 Query: 154 AASTSASAVRGMSFNVIFLDEFAFIPNHIADQFFSSVYPTISSGKSTK----VIIIST-- 207 +A A +G++ + LDE+AF + S ++ ++ G + + +IIST Sbjct: 160 SA--DAGRQQGLNPAAVSLDEYAFSKH-------SDLFDALTLGSAARNQPMFLIISTAG 210 Query: 208 PHGMNMFYKLWHDAER 223 P F L ER Sbjct: 211 PDPDGPFAALCEQGER 226 >gi|4089|lcl|protein:vir:94598 Length: 581 # NCBI annotation: PfWMP4_40 # Family: family:all:543 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762670;genbank:gi:115304378;genbank:GeneI D:5142298 Length = 581 Score = 31.2 bits (69), Expect = 0.036, Method: Compositional matrix adjust. Identities = 19/75 (25%), Positives = 38/75 (50%), Gaps = 9/75 (12%) Query: 75 IAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQLSYENLPNWMQQ 134 + ++PR KST+ Y++W + N N+ + +N + ++ L+ +E+ +QQ Sbjct: 90 LLQMPRGHLKSTLTVGYIMWRIYRNPNIRMLHASNIRELSEAFIRELRNYFED--EGLQQ 147 Query: 135 GILQWN-----RGSL 144 + WN RG+L Sbjct: 148 RV--WNSRPHIRGNL 160 >gi|18494|lcl|protein:vir:1636 Length: 469 # NCBI annotation: putative terminase # Family: family:all:523 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695057;genbank:gi:23455748;genbank:GeneID :955481 Length = 469 Score = 30.0 bits (66), Expect = 0.065, Method: Compositional matrix adjust. Identities = 26/108 (24%), Positives = 52/108 (48%), Gaps = 20/108 (18%) Query: 78 LPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQLSYENLPNWMQ---- 133 +PR++GK+ IV LW ++ +++ A++ +T+ SYE L +++ Sbjct: 68 IPRRNGKTEIVYILELWSLV--QGLSILHTAHRISTSHS-------SYEKLKKYLEDSGY 118 Query: 134 ------QGILQWNRGSLEL-ENGSKIMAASTSASAVRGMSFNVIFLDE 174 + I + LEL E+G I + ++S G F+++ +DE Sbjct: 119 VEGEDFKSIKAKGQERLELIESGGVIQFRTRTSSGGLGEGFDILVIDE 166 >gi|3859|lcl|protein:vir:107748 Length: 532 # NCBI annotation: gp33 TerL # Family: family:all:54 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024878;genbank:gi:48697520;genbank:GeneID :2948367 Length = 532 Score = 29.6 bits (65), Expect = 0.084, Method: Compositional matrix adjust. Identities = 33/178 (18%), Positives = 72/178 (40%), Gaps = 22/178 (12%) Query: 107 LANKAATAREMLQRLQLSYENLPNWMQQGILQWNRGSLELENGSKIMAASTSA------- 159 L +KA + +++ ++LP + G WN + +++ T A Sbjct: 134 LVDKAGDPDSIFFKVRFFLQHLPPEFRGG---WNPHDHTHSSHMRVIIPDTGAVIRGEAG 190 Query: 160 -SAVRGMSFNVIFLDEFAFIPNHIADQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLW 218 + RG ++ F+DE A + N + T + + I IS+ +G+N + Sbjct: 191 KNIGRGGRVSIQFVDEAAHLEN-------AQAVDTALAATTNCRIDISSVNGLNNPFA-- 241 Query: 219 HDAERGTNEYVPTEVHWSEVPGRDDVWKEQTIKNTSESQFRVEFECEFLGSVDTLIAP 276 +R + +HW + P +DD W ++ + + E + ++ S + ++ P Sbjct: 242 --EKRFSGRVKVKTMHWRDDPRKDDEWYKKQKQKFNALVVAQEIDIDYSASAEGVLIP 297 >gi|8927|lcl|protein:vir:97005 Length: 632 # NCBI annotation: 40 # Family: family:all:697 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654141;genbank:gi:108862025;genbank:GeneI D:5075954 Length = 632 Score = 29.6 bits (65), Expect = 0.11, Method: Compositional matrix adjust. Identities = 10/50 (20%), Positives = 27/50 (54%) Query: 68 FHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREM 117 F+ H++ + + PR K+T+ Y ++ ++ + + +++ A A E+ Sbjct: 62 FYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEI 111 >gi|10294|lcl|protein:vir:105656 Length: 405 # NCBI annotation: putative large terminase subunit # Family: family:all:697 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425018;genbank:gi:83571766;uniprot:Q2WC34 ;genbank:GeneID:3837297 Length = 405 Score = 29.3 bits (64), Expect = 0.11, Method: Compositional matrix adjust. Identities = 10/50 (20%), Positives = 27/50 (54%) Query: 68 FHDHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREM 117 F+ H++ + + PR K+T+ Y ++ ++ + + +++ A A E+ Sbjct: 62 FYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEI 111 >gi|11501|lcl|protein:vir:78718 Length: 574 # NCBI annotation: DNA packaging protein # Family: family:all:697 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285469;genbank:gi:148724503;genbank:Ge neID:5220189 Length = 574 Score = 29.3 bits (64), Expect = 0.11, Method: Compositional matrix adjust. Identities = 33/165 (20%), Positives = 63/165 (38%), Gaps = 17/165 (10%) Query: 80 RQSGKSTIVTAYLLWYVLFNANVNVAILANKAATARE---MLQRLQLSYENLPNWMQQGI 136 R GKS I A++LW + + + + +++ A Q+L L E L + + Sbjct: 54 RGVGKSWITAAFVLWVLFVDPDRKIMVISASKERADNFSIFCQKLILDIEWLSHLRPRDS 113 Query: 137 LQ-WNRGSLELENGSKIMAASTSASAVRGM----SFNVIFLDEFAFIPNHIADQFFSSVY 191 Q W+R S ++ A S + + G +++ D+ N D + Sbjct: 114 DQRWSRISFDVGPAKPHQAPSVKSVGITGQMTGSRAHLMVFDDVEVPANSATDMQREKLL 173 Query: 192 PTISSGKS-------TKVIIISTPHGMNMFYKLWHDAERGTNEYV 229 +S +S +++ + TP Y+ AER +V Sbjct: 174 QLVSESESILVPDDDARIMFLGTPQSTFTIYR--KLAERSYRPFV 216 >gi|14590|lcl|protein:vir:8098 Length: 515 # NCBI annotation: gp2 # Family: family:all:523 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817679;genbank:gi:29566110;genbank:GeneID :1259304 Length = 515 Score = 28.9 bits (63), Expect = 0.16, Method: Compositional matrix adjust. Identities = 17/61 (27%), Positives = 34/61 (55%), Gaps = 6/61 (9%) Query: 148 NGSKIMAASTSASAVRGMSFNVIFLDE-FAFIPNHIADQFFSSVYPTISSGKSTKVIIIS 206 +G +++ + + S RG++ N + LDE FA H+ S+ PT+S+ +++I S Sbjct: 147 DGQRVIFKARTNSGGRGLTGNKVILDEGFALRHAHMG-----SLMPTLSAVPDPQLLIGS 201 Query: 207 T 207 + Sbjct: 202 S 202 >gi|15101|lcl|protein:vir:3867 Length: 563 # NCBI annotation: putative large terminase subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680484;swissprot:trembl:q8ltc3;genbank:gi :22296524;interpro:IPR005021;uniprot:Q8LTC3;genbank:Gene ID:951698 Length = 563 Score = 28.1 bits (61), Expect = 0.29, Method: Compositional matrix adjust. Identities = 26/106 (24%), Positives = 49/106 (46%), Gaps = 7/106 (6%) Query: 78 LPRQSGKSTIVTAYLLWYVLFNAN-VNVAILANKAATAREMLQRLQLSYENLPNWMQQG- 135 + R++GKS +++ +L+ LF N N L A ++ + + L M++ Sbjct: 97 MARKNGKSLLISGVILYEFLFGKNPANKRQLYTAANDRKQAGIVFGMVKDRLRALMRKDP 156 Query: 136 ----ILQWNRGSL-ELENGSKIMAASTSASAVRGMSFNVIFLDEFA 176 +++ R L L++GS I + S V G +V +DE+A Sbjct: 157 GIKRMVKITRDELVNLDDGSTIRSFSRDTGLVDGYEPHVAVVDEYA 202 >gi|15319|lcl|protein:vir:3140 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:543 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640322;genbank:gi:21234401;genbank:GeneID :956064 Length = 535 Score = 26.9 bits (58), Expect = 0.55, Method: Compositional matrix adjust. Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 5/50 (10%) Query: 74 NIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQL 123 +A +PR KS + ++ W + N V +A + A E L LQL Sbjct: 65 TLALMPRDHQKSHCIAVWVCWQIFKNPAVTIAYVC-----ATESLAILQL 109 >gi|7204|lcl|protein:vir:100065 Length: 577 # NCBI annotation: DNA maturase beta subunit # Family: family:all:697 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214228;genbank:gi:61806451;genbank:GeneID :3294745 Length = 577 Score = 26.9 bits (58), Expect = 0.57, Method: Compositional matrix adjust. Identities = 33/170 (19%), Positives = 68/170 (40%), Gaps = 28/170 (16%) Query: 80 RQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREM---LQRLQLSYENLPNWMQQG- 135 R GKS I A++LW + +A + I++ A M LQ+L + L + + Sbjct: 52 RGVGKSWITGAFVLWTLFNDAEKKIMIISASKERADNMSIFLQKLIIETPWLKHLRPKSD 111 Query: 136 ILQWNRGSLELENGSKIMAASTSASAVR---------GMSFNVIFLDEFAFIPNHIAD-- 184 +W+R S + ++ + A +V+ G +++ LD+ N + + Sbjct: 112 DARWSRISFD------VLCSPHQAPSVKSVGITGQLTGSRADLMILDDIEVPGNSMTELM 165 Query: 185 -----QFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERGTNEYV 229 Q + ++ ++++ + TP Y+ AER +V Sbjct: 166 REKLLQLCTEAESILTPKDDSRIMYLGTPQTTFTVYR--KLAERAYRPFV 213 >gi|8126|lcl|protein:vir:102661 Length: 568 # NCBI annotation: terminase # Family: family:all:147 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024418;genbank:gi:48696639;genbank:GeneID :2948128 Length = 568 Score = 25.8 bits (55), Expect = 1.2, Method: Compositional matrix adjust. Identities = 12/24 (50%), Positives = 14/24 (58%) Query: 205 ISTPHGMNMFYKLWHDAERGTNEY 228 I+TP G N FYKL AE+ Y Sbjct: 216 ITTPRGKNWFYKLAMHAEKSEEWY 239 >gi|3734|lcl|protein:vir:94555 Length: 585 # NCBI annotation: DNA packaging protein B # Family: family:all:697 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919024;genbank:gi:119637788;genbank:GeneI D:5179315 Length = 585 Score = 25.4 bits (54), Expect = 1.9, Method: Compositional matrix adjust. Identities = 13/45 (28%), Positives = 23/45 (51%) Query: 70 DHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATA 114 DH+ I + R GKS I A+++W + + + V I++ A Sbjct: 50 DHKKFILQAFRGIGKSFITCAFVVWVLWRDPQLKVLIVSASKERA 94 >gi|3994|lcl|protein:vir:99686 Length: 586 # NCBI annotation: DNA packaging protein B # Family: family:all:697 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249597;genbank:gi:68299748;genbank:GeneID :3800001 Length = 586 Score = 25.0 bits (53), Expect = 2.2, Method: Compositional matrix adjust. Identities = 32/170 (18%), Positives = 70/170 (41%), Gaps = 23/170 (13%) Query: 37 YFIKNYIKIVSLDEGLVPFNMY-----------HFQEEMVSKFH--DHRFNIAKLPRQSG 83 Y K+ +++++ V F M Q++M K D R I + R G Sbjct: 4 YLTKDQRRLLAMKNDFVLFLMVLWRALNLPEPTRCQKDMARKLAAGDERRFILQAFRGIG 63 Query: 84 KSTIVTAYLLWYVLFNANVNVAILANKAATA-------REMLQRLQLSYENLPNWMQQ-G 135 KS I A+++W + N + I++ A + ++ L +E P Q+ Sbjct: 64 KSFITCAFVVWKLWNNPQLKFMIVSASKERADANSIFIKRIIDLLPFLHELKPRPEQRDS 123 Query: 136 ILQWNRGSLELENGSKIMAASTSASAVRGMSFNVIFLDEFAFIPNHIADQ 185 ++ ++ G + ++ + + + + G +++ D+ +PN+ A Q Sbjct: 124 VISFDVGLAKPDHSPSVKSVGITGQ-LTGSRADILIADDVE-VPNNSATQ 171 >gi|27798|lcl|protein:vir:8408 Length: 214 # NCBI annotation: gp3 # Family: family:all:30874 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818304;genbank:gi:29566740;genbank:GeneID :1260058 Length = 214 Score = 24.6 bits (52), Expect = 3.1, Method: Compositional matrix adjust. Identities = 27/105 (25%), Positives = 41/105 (39%), Gaps = 21/105 (20%) Query: 79 PRQSGKSTIVTAYLLWY----VLFNANVNVAILANKAATAREMLQRLQLSYENLPNWMQQ 134 PRQ+GK+ AY L Y + F A A AAT +L P Sbjct: 25 PRQNGKTYAALAYALQYPGRVLYFGRGFREAGEAFAAAT--------KLGANRGPGT--- 73 Query: 135 GILQWNRGSLELENG-----SKIMAASTSASAVRGMSFNVIFLDE 174 IL+ N+ L +E ++ + RGM +++ LD+ Sbjct: 74 -ILKTNKSQLSIETSLGGDFGRVNFMPYGRGSGRGMGADLVILDD 117 >gi|13482|lcl|protein:vir:9757 Length: 471 # NCBI annotation: putative terminase # Family: family:all:523 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795519;genbank:gi:28876285;genbank:GeneID :1257826 Length = 471 Score = 24.6 bits (52), Expect = 3.3, Method: Compositional matrix adjust. Identities = 29/160 (18%), Positives = 70/160 (43%), Gaps = 16/160 (10%) Query: 78 LPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQLSYENLPNWM--QQG 135 +PR++GK+ +V LW + + + A++ +T+ ++++ Y + ++ + Sbjct: 70 IPRRNGKTEVVYIVELW--ALHKGLKILHTAHRISTSHASFEKVK-KYLEMSGYVDGEDF 126 Query: 136 ILQWNRGSLELE---NGSKIMAASTSASAVRGMSFNVIFLDEFAFIPNHIADQFFSSVYP 192 I +G +E +G+ I + +++ G F+++ +DE + S++ Sbjct: 127 ISNKAKGQERIEFKASGAVIQFRTRTSNGGLGEGFDLLIIDE----AQEYTSEQESALKY 182 Query: 193 TISSGKSTKVIIISTPHGM----NMFYKLWHDAERGTNEY 228 T++ + I+ TP M +F D +G Y Sbjct: 183 TVTDSDNPMTIMCGTPPTMVSTGTVFEAYRKDCLKGNKRY 222 >gi|18767|lcl|protein:vir:6335 Length: 601 # NCBI annotation: putative DNA maturase B # Family: family:all:697 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877482;genbank:gi:33300854;uniprot:Q7Y2C2 ;genbank:GeneID:1482617 Length = 601 Score = 23.9 bits (50), Expect = 4.8, Method: Compositional matrix adjust. Identities = 10/37 (27%), Positives = 16/37 (43%) Query: 80 RQSGKSTIVTAYLLWYVLFNANVNVAILANKAATARE 116 R KSTI Y++W + N +++ A E Sbjct: 59 RGEAKSTIACIYVVWCITQNPATRAMLVSGSGDKAEE 95 >gi|19975|lcl|protein:vir:108214 Length: 544 # NCBI annotation: gp5 # Family: family:all:523 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552334;genbank:gi:160700654;genbank:Ge neID:5758934 Length = 544 Score = 23.9 bits (50), Expect = 5.7, Method: Compositional matrix adjust. Identities = 12/57 (21%), Positives = 30/57 (52%), Gaps = 1/57 (1%) Query: 78 LPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAATAREMLQRLQLSYENLPNWMQQ 134 +PRQ+GK+ ++ ++ Y LF + A + T +++ R+ + P+ +++ Sbjct: 104 IPRQNGKTQLIALRII-YGLFFLGEKIVYTAQRWQTVKDVYDRIVEIIKRRPSLLRR 159 >gi|10332|lcl|protein:vir:97407 Length: 514 # NCBI annotation: terminase # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762597;genbank:gi:115304298;genbank:GeneI D:5130610 Length = 514 Score = 23.1 bits (48), Expect = 8.8, Method: Compositional matrix adjust. Identities = 24/99 (24%), Positives = 36/99 (36%), Gaps = 12/99 (12%) Query: 149 GSKIMAASTSASAVRGMSFNVIFLDEFAFIPNHIADQFFSSVYPTISSGKSTKVIIISTP 208 G+ S + SA+ G + +DEF PN+ PTI G ++ T Sbjct: 146 GATFRIVSAADSALNGGREKLAVVDEFGTFPNN--------PLPTIREGLEKNEGLLLTI 197 Query: 209 HGMNMFYKLWHDAE----RGTNEYVPTEVHWSEVPGRDD 243 N +D E RG N+ + W + DD Sbjct: 198 TSNNKVRGGAYDEELDTFRGYNDEMDNFKQWGLIFELDD 236 >gi|12298|lcl|protein:vir:79536 Length: 247 # NCBI annotation: putative major tail subunit # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272523;genbank:gi:148609392;genbank:Ge neID:5204372 Length = 247 Score = 23.1 bits (48), Expect = 9.8, Method: Compositional matrix adjust. Identities = 12/35 (34%), Positives = 15/35 (42%), Gaps = 1/35 (2%) Query: 507 QDMAPFGFVTDGMEDDYFADAQGDVWKVAEYGDKS 541 +D+ P D +D Y D D WK G KS Sbjct: 43 KDLQPGEMTADAEDDTYLDDEDAD-WKTTTQGQKS 76 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.320 0.135 0.405 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 247,432 Number of Sequences: 514 Number of extensions: 11197 Number of successful extensions: 162 Number of sequences better than 100.0: 61 Number of HSP's better than 100.0 without gapping: 39 Number of HSP's successfully gapped in prelim test: 22 Number of HSP's that attempted gapping in prelim test: 56 Number of HSP's gapped (non-prelim): 66 length of query: 547 length of database: 206,069 effective HSP length: 76 effective length of query: 471 effective length of database: 167,005 effective search space: 78659355 effective search space used: 78659355 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 39 (19.6 bits)