BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_015279.1_cdsid_YP_004322266.1 [gene=gp17] [protein=terminase DNA packaging enzyme large subunit] [protein_id=YP_004322266.1] [location=93934..95583] (549 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T... 917 0.0 gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: t... 847 0.0 gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: g... 840 0.0 gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: g... 837 0.0 gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: ter... 365 e-103 gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp... 352 9e-99 gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: ter... 351 1e-98 gi|20804|lcl|protein:vir:106290 Length: 633 # NCBI annotation: g... 349 5e-98 gi|28182|lcl|protein:vir:108053 Length: 611 # NCBI annotation: g... 349 6e-98 gi|22942|lcl|protein:vir:101803 Length: 613 # NCBI annotation: g... 348 8e-98 gi|23191|lcl|protein:vir:101186 Length: 613 # NCBI annotation: t... 348 8e-98 gi|24596|lcl|protein:vir:98262 Length: 609 # NCBI annotation: gp... 345 8e-97 gi|27701|lcl|protein:vir:6893 Length: 611 # NCBI annotation: gp1... 345 1e-96 gi|27407|lcl|protein:vir:7202 Length: 610 # NCBI annotation: gp1... 335 9e-94 gi|22056|lcl|protein:vir:103455 Length: 610 # NCBI annotation: t... 333 2e-93 gi|21077|lcl|protein:vir:100538 Length: 612 # NCBI annotation: g... 317 2e-88 gi|16897|lcl|protein:vir:5867 Length: 508 # NCBI annotation: sim... 181 2e-47 gi|17621|lcl|protein:vir:816 Length: 568 # NCBI annotation: hypo... 69 2e-13 gi|25680|lcl|protein:vir:10116 Length: 568 # NCBI annotation: hy... 69 2e-13 gi|51|lcl|protein:vir:3295 Length: 568 # NCBI annotation: putati... 69 2e-13 gi|26240|lcl|protein:vir:2763 Length: 568 # NCBI annotation: hyp... 69 2e-13 gi|25849|lcl|protein:vir:9949 Length: 568 # NCBI annotation: hyp... 69 2e-13 gi|1476|lcl|protein:vir:104436 Length: 568 # NCBI annotation: pu... 69 2e-13 gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: ph... 63 7e-12 gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: te... 63 8e-12 gi|1431|lcl|protein:vir:93626 Length: 530 # NCBI annotation: Bce... 48 3e-07 gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: t... 47 8e-07 gi|4674|lcl|protein:vir:105593 Length: 543 # NCBI annotation: te... 44 3e-06 gi|20547|lcl|protein:vir:105155 Length: 580 # NCBI annotation: c... 44 4e-06 gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: te... 40 7e-05 gi|26461|lcl|protein:vir:573 Length: 589 # NCBI annotation: unkn... 39 1e-04 gi|4240|lcl|protein:vir:94734 Length: 469 # NCBI annotation: lar... 37 5e-04 gi|7504|lcl|protein:vir:99915 Length: 478 # NCBI annotation: gp2... 35 0.002 gi|3859|lcl|protein:vir:107748 Length: 532 # NCBI annotation: gp... 35 0.002 gi|19904|lcl|protein:vir:3781 Length: 607 # NCBI annotation: ter... 34 0.005 gi|18494|lcl|protein:vir:1636 Length: 469 # NCBI annotation: put... 33 0.011 gi|16385|lcl|protein:vir:3744 Length: 607 # NCBI annotation: orf... 32 0.014 gi|14676|lcl|protein:vir:2499 Length: 474 # NCBI annotation: put... 30 0.095 gi|14590|lcl|protein:vir:8098 Length: 515 # NCBI annotation: gp2... 29 0.14 gi|11877|lcl|protein:vir:79128 Length: 593 # NCBI annotation: te... 29 0.16 gi|10294|lcl|protein:vir:105656 Length: 405 # NCBI annotation: p... 26 1.0 gi|8927|lcl|protein:vir:97005 Length: 632 # NCBI annotation: 40 ... 26 1.3 gi|15201|lcl|protein:vir:7028 Length: 631 # NCBI annotation: lar... 26 1.3 gi|25398|lcl|protein:vir:8929 Length: 717 # NCBI annotation: ORF... 26 1.5 gi|10061|lcl|protein:vir:99071 Length: 537 # NCBI annotation: gp... 26 1.5 gi|25110|lcl|protein:vir:80755 Length: 501 # NCBI annotation: pu... 25 1.8 gi|19767|lcl|protein:vir:6384 Length: 677 # NCBI annotation: ter... 25 2.8 gi|8126|lcl|protein:vir:102661 Length: 568 # NCBI annotation: te... 24 3.9 gi|15101|lcl|protein:vir:3867 Length: 563 # NCBI annotation: put... 23 6.4 >gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T4-like DNA packaging large subunit terminase # Family: family:all:147 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214360;genbank:gi:61806000;genbank:GeneID :3294466 Length = 547 Score = 917 bits (2370), Expect = 0.0, Method: Compositional matrix adjust. Identities = 422/544 (77%), Positives = 487/544 (89%), Gaps = 1/544 (0%) Query: 5 VYLGNPNLKKANTPIEFTEEQVIEFLKCKEDPVYFANNYIKIVSLDEGLTQFHPYHFQEK 64 VYLGNPNLKKANTPIEF+++ + EFLKCKEDPVYF NYIKIVSLDEGL F+ Y FQEK Sbjct: 4 VYLGNPNLKKANTPIEFSKDNIREFLKCKEDPVYFTRNYIKIVSLDEGLVPFNMYDFQEK 63 Query: 65 LINNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQT 124 LI FH NRFNICKMPRQTGKSTT +SYLLHYAVFND+VN+ +LANKA+TAR+LLGRLQ Sbjct: 64 LITRFHENRFNICKMPRQTGKSTTCISYLLHYAVFNDNVNVAVLANKASTARDLLGRLQL 123 Query: 125 AYENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDEFAFVPNHVA 184 AYENLPRWMQQGII+WNKGSLELENGSKI A STS+SAVRG S+N++FLDEFAF+PNH+A Sbjct: 124 AYENLPRWMQQGIISWNKGSLELENGSKISANSTSSSAVRGGSYNVIFLDEFAFIPNHIA 183 Query: 185 DSFFASVYPTITSGKNTKVIIVSTPHGMNHFYRMWHDAEKRKNEYIPTDVHWSEVPGRDD 244 D FFASVYPTITSG++TKVIIVSTP GMNHFYRMWHD+EK K+EY+ TDVHWSEVPGRD+ Sbjct: 184 DDFFASVYPTITSGQSTKVIIVSTPRGMNHFYRMWHDSEKGKSEYVATDVHWSEVPGRDE 243 Query: 245 KWKETTIANTSEQQFKVEFECEFLGSVDTLIAPSKLRTLIYDNPIQRNAGLDVYEPPKDK 304 +WKE TIANTSEQQFK+EFECEFLGSV+TLI P+KLR L+Y+ P RNAGLD+YE P + Sbjct: 244 EWKEQTIANTSEQQFKIEFECEFLGSVNTLINPAKLRNLVYEAPKTRNAGLDIYETPVKE 303 Query: 305 HDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPMLFPNIIYEVAKNYNSA 364 H+Y++TVDVARG+G DYSAF+ D TEFP+++VAKYRNN+IKPMLFPNII +VAK YN+A Sbjct: 304 HNYIITVDVARGLGNDYSAFIVFDTTEFPYKVVAKYRNNEIKPMLFPNIILDVAKGYNNA 363 Query: 365 YILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFSGKKTQLGVKMSKTVKK 424 Y+L EVNDIGDQVASILQYDLEY+N+LM SMRGRAGQIVGQGFSGKKTQLGV+M+ VKK Sbjct: 364 YLLIEVNDIGDQVASILQYDLEYENVLMASMRGRAGQIVGQGFSGKKTQLGVRMTSAVKK 423 Query: 425 VGSLNLKTLIEEDKLIFNDYEIISELTTFISKHNSFEAEEGCNDDLAMCLVIYAWLVQMD 484 +G NLKT++E+DKL+ DYEIISELTTF +HNSFEAEEGCNDDLAMCLVI++WLV D Sbjct: 424 LGCSNLKTMMEDDKLLTCDYEIISELTTFAQRHNSFEAEEGCNDDLAMCLVIFSWLVAQD 483 Query: 485 YFKELTDQDVRKRLYEEQKNQIEQDMAPFGFMNDGLDDDSFTDNEGDRWFKADEYGDRSF 544 YFKE++D D+RKR+YEEQKNQIEQDMAPFGF+ DGLDD SF D +GD W DEYGD+S+ Sbjct: 484 YFKEMSDNDIRKRIYEEQKNQIEQDMAPFGFIADGLDDTSFVDKDGDTW-HLDEYGDKSY 542 Query: 545 MWEY 548 MW+Y Sbjct: 543 MWDY 546 >gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: terminase large subunit gp17 # Family: family:all:147 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195134;genbank:gi:58532911;uniprot:Q5GQN8 ;genbank:GeneID:3260486 Length = 548 Score = 847 bits (2187), Expect = 0.0, Method: Compositional matrix adjust. Identities = 393/547 (71%), Positives = 470/547 (85%), Gaps = 1/547 (0%) Query: 2 SDNVYLGNPNLKKANTPIEFTEEQVIEFLKCKEDPVYFANNYIKIVSLDEGLTQFHPYHF 61 SD +YLGNP LKKAN I+FT+EQV E++KC DPVYF NY+KIVSLDEGL F + F Sbjct: 3 SDQIYLGNPLLKKANVKIDFTKEQVKEWIKCANDPVYFTKNYVKIVSLDEGLVPFKMWDF 62 Query: 62 QEKLINNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGR 121 QE+LI FH NRFNI K+PRQTGKSTTVVSYLLHY +FND+VNIGILANKA+TAR+LL R Sbjct: 63 QEELIMKFHKNRFNIAKLPRQTGKSTTVVSYLLHYLIFNDNVNIGILANKASTARDLLAR 122 Query: 122 LQTAYENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDEFAFVPN 181 L TAYENLP+W+QQG++ WNKG++ELENGSKILAASTSASAVRGMSFNI+FLDEFAFVPN Sbjct: 123 LATAYENLPKWIQQGVVVWNKGNIELENGSKILAASTSASAVRGMSFNIIFLDEFAFVPN 182 Query: 182 HVADSFFASVYPTITSGKNTKVIIVSTPHGMNHFYRMWHDAEKRKNEYIPTDVHWSEVPG 241 H+ADSFFASVYPTITSGK+TKVII+STP GMNHFY+MW DA +N Y +VHWS+VPG Sbjct: 183 HIADSFFASVYPTITSGKSTKVIIISTPQGMNHFYKMWVDATNGRNGYTFHEVHWSQVPG 242 Query: 242 RDDKWKETTIANTSEQQFKVEFECEFLGSVDTLIAPSKLRTLIYDNPIQRNAGLDVYEPP 301 RD+KWKE TI NTSE+QF EFECEFLGSVDTLIA SKL+ L++++PI+RN GLD+YE P Sbjct: 243 RDEKWKEETIKNTSERQFTQEFECEFLGSVDTLIAASKLKALVFNDPIKRNKGLDIYEEP 302 Query: 302 KDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPMLFPNIIYEVAKNY 361 K+K +Y+MTVDV+RG+G DYSAF+ DIT P+++V KYRNN+IKPMLFPNII ++A++Y Sbjct: 303 KEKSEYLMTVDVSRGIGGDYSAFIIFDITTVPYKVVGKYRNNEIKPMLFPNIINDLARSY 362 Query: 362 NSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFSGKKTQLGVKMSKT 421 N+A++LCEVNDIGDQVASIL YDLEY N+LMC+MRGRAGQ+VGQGFSG KTQLGVKMS T Sbjct: 363 NNAWVLCEVNDIGDQVASILNYDLEYPNVLMCAMRGRAGQLVGQGFSGSKTQLGVKMSIT 422 Query: 422 VKKVGSLNLKTLIEEDKLIFNDYEIISELTTFISKHNSFEAEEGCNDDLAMCLVIYAWLV 481 VKKVG NLKT++EEDKLIFNDY+II+ELTTFI K SFEA+EG +DDL MC+VI+AWLV Sbjct: 423 VKKVGCANLKTIVEEDKLIFNDYDIINELTTFIQKKQSFEADEGFHDDLVMCMVIFAWLV 482 Query: 482 QMDYFKELTDQDVRKRLYEEQKNQIEQDMAPFGFMNDGLDDDSFTDNEGDRWFKADEYGD 541 Q DYFKE+TD D+R+R+Y+EQKNQIEQDMAPFGF+ GL+ + ++G W+ D + Sbjct: 483 QQDYFKEMTDNDIRQRIYDEQKNQIEQDMAPFGFITTGLEGEEGFVSDGTIWY-GDTQEN 541 Query: 542 RSFMWEY 548 +MW Y Sbjct: 542 VGYMWNY 548 >gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: gp123 # Family: family:all:147 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717790;genbank:gi:113200627;genbank:GeneI D:4239178 Length = 549 Score = 840 bits (2169), Expect = 0.0, Method: Compositional matrix adjust. Identities = 382/544 (70%), Positives = 458/544 (84%), Gaps = 1/544 (0%) Query: 6 YLGNPNLKKANTPIEFTEEQVIEFLKCKEDPVYFANNYIKIVSLDEGLTQFHPYHFQEKL 65 YLGNPNLKKAN EFT +QV E +KC E+PVYF NYIKIVSLD+GL F Y+FQE++ Sbjct: 7 YLGNPNLKKANVSQEFTPDQVAEVIKCSENPVYFIKNYIKIVSLDKGLIPFDMYYFQEEM 66 Query: 66 INNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQTA 125 + FH+NRFNI K+PRQ+GKST V SYLL Y +FN +VN+ ILANKAATARE+L RLQ + Sbjct: 67 VQKFHDNRFNIAKLPRQSGKSTIVTSYLLWYVLFNANVNVAILANKAATAREMLQRLQLS 126 Query: 126 YENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDEFAFVPNHVAD 185 YENLP+W+QQGI+ WN+GSLELENGSKILAASTSASAVRGMSFN++FLDEFAFVPNHVAD Sbjct: 127 YENLPKWLQQGILQWNRGSLELENGSKILAASTSASAVRGMSFNVIFLDEFAFVPNHVAD 186 Query: 186 SFFASVYPTITSGKNTKVIIVSTPHGMNHFYRMWHDAEKRKNEYIPTDVHWSEVPGRDDK 245 FF+SVYPTI+SGK+TKVII+STPHGMN FY++WHDAE++ NEYIPT+VHWSEVPGRD Sbjct: 187 QFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERKANEYIPTEVHWSEVPGRDAA 246 Query: 246 WKETTIANTSEQQFKVEFECEFLGSVDTLIAPSKLRTLIYDNPIQRNAGLDVYEPPKDKH 305 WKE TI NTSEQQF+VEFECEFLGSVDTLI+PSKLRT++Y +PI GL +YE H Sbjct: 247 WKEQTIKNTSEQQFRVEFECEFLGSVDTLISPSKLRTMVYGDPIAEKNGLSMYEKTIQGH 306 Query: 306 DYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPMLFPNIIYEVAKNYNSAY 365 YV+T DV+RGV DYSAF+ +D T P+++VAKYRNNDIKP+LFPNII +VA+NYN A+ Sbjct: 307 TYVITADVSRGVSGDYSAFLVIDTTTIPYKLVAKYRNNDIKPILFPNIIVDVARNYNHAF 366 Query: 366 ILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFSGKKTQLGVKMSKTVKKV 425 +L EVND+G QVA I+QYDLEY NLLMC+MRGRAGQ +GQGFSGKKTQ+G+KMS K+V Sbjct: 367 VLVEVNDVGGQVADIIQYDLEYDNLLMCAMRGRAGQQLGQGFSGKKTQMGIKMSSATKQV 426 Query: 426 GSLNLKTLIEEDKLIFNDYEIISELTTFISKHNSFEAEEGCNDDLAMCLVIYAWLVQMDY 485 G NLK L+E+DK + NDY+ ISELTTFI K +F+AEEGCNDDLAMC+VI+AW+ Y Sbjct: 427 GCSNLKALLEDDKFLLNDYDCISELTTFIQKGQTFQAEEGCNDDLAMCMVIFAWMAMQPY 486 Query: 486 FKELTDQDVRKRLYEEQKNQIEQDMAPFGFMNDGLDDDSFTDNEGDRWFKADEYGDRSFM 545 FKEL D DVR+R+Y++Q+ IEQDMAPFGFM+DGL ++ F D +GD W A EYGD+S+M Sbjct: 487 FKELHDNDVRQRIYDDQREAIEQDMAPFGFMDDGLGEEYFADAQGDVWMTA-EYGDKSYM 545 Query: 546 WEYR 549 WEYR Sbjct: 546 WEYR 549 >gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214662;genbank:gi:61806303;genbank:GeneID :3294591 Length = 550 Score = 837 bits (2163), Expect = 0.0, Method: Compositional matrix adjust. Identities = 378/545 (69%), Positives = 462/545 (84%), Gaps = 1/545 (0%) Query: 5 VYLGNPNLKKANTPIEFTEEQVIEFLKCKEDPVYFANNYIKIVSLDEGLTQFHPYHFQEK 64 +YLGNPNLKKAN +FT++QV E++KC +DPVYF YI+IVSLDEG+ F Y+FQE Sbjct: 7 IYLGNPNLKKANVSTQFTKKQVAEYMKCAQDPVYFIRKYIRIVSLDEGVIPFDMYNFQED 66 Query: 65 LINNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQT 124 ++ FH +RFNI K+PRQ+GKST V +YLL Y +FN +VN+ ILANKA TARE+LGRLQ Sbjct: 67 MVTKFHQHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAPTAREMLGRLQL 126 Query: 125 AYENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDEFAFVPNHVA 184 +YENLP+WMQQGI+ WNKGSLELENGSKILA+STSASAVRGMSFNI+FLDEFAFVPNH+A Sbjct: 127 SYENLPKWMQQGILGWNKGSLELENGSKILASSTSASAVRGMSFNIIFLDEFAFVPNHIA 186 Query: 185 DSFFASVYPTITSGKNTKVIIVSTPHGMNHFYRMWHDAEKRKNEYIPTDVHWSEVPGRDD 244 + FFASVYPTI+SGK+TKVII+STPHGMN FY++WHDAE+ N Y+ T+VHWS+VPGRDD Sbjct: 187 EQFFASVYPTISSGKSTKVIIISTPHGMNQFYKLWHDAERGANNYVATEVHWSQVPGRDD 246 Query: 245 KWKETTIANTSEQQFKVEFECEFLGSVDTLIAPSKLRTLIYDNPIQRNAGLDVYEPPKDK 304 KWK+ TI NTSE QF+VEFECEFLGSVDTLI PSKLR + Y +PIQ N GL VYE ++ Sbjct: 247 KWKQQTIENTSEAQFRVEFECEFLGSVDTLITPSKLRIMPYKDPIQENRGLAVYEHVQEN 306 Query: 305 HDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPMLFPNIIYEVAKNYNSA 364 H+Y++TVDV+RGVG DYSAF +D T P+++VA+Y+NN IKP++FPN+I +VA NYN A Sbjct: 307 HNYIITVDVSRGVGNDYSAFCVIDTTTVPYKVVARYKNNQIKPLVFPNLIVDVATNYNGA 366 Query: 365 YILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFSGKKTQLGVKMSKTVKK 424 Y+LCEVNDIG QVA I+QYDLEY+NLLM SMRGRAGQ +GQGFSGKKTQLG+KMS VK+ Sbjct: 367 YVLCEVNDIGGQVADIIQYDLEYENLLMVSMRGRAGQQLGQGFSGKKTQLGIKMSTAVKQ 426 Query: 425 VGSLNLKTLIEEDKLIFNDYEIISELTTFISKHNSFEAEEGCNDDLAMCLVIYAWLVQMD 484 VG NLK LIE+DKLI DY+ I+ELTTFI K SF+AE+GCNDDLAMCLVI++W+ Sbjct: 427 VGCSNLKALIEDDKLIVEDYDTIAELTTFIQKGQSFQAEDGCNDDLAMCLVIFSWMAMQP 486 Query: 485 YFKELTDQDVRKRLYEEQKNQIEQDMAPFGFMNDGLDDDSFTDNEGDRWFKADEYGDRSF 544 YFKE+ D DVR+R+YE+Q++QIEQDMAPFGF++DGL++D F D +GD W + EYGD+S+ Sbjct: 487 YFKEMHDNDVRQRIYEDQRDQIEQDMAPFGFVSDGLEEDQFQDAQGDVW-QIAEYGDKSY 545 Query: 545 MWEYR 549 MWEYR Sbjct: 546 MWEYR 550 >gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899601;genbank:gi:34419588;genbank:GeneID :2546011 Length = 600 Score = 365 bits (938), Expect = e-103, Method: Compositional matrix adjust. Identities = 200/531 (37%), Positives = 311/531 (58%), Gaps = 27/531 (5%) Query: 6 YLGNPNLKKANTPIEF---TEEQVIEFLKCKEDPVYFANNYIKIVSLDEGLTQFHPYHFQ 62 Y+ PNL++AN PI F E EF KC++D VYFA NY IV +D G + P +Q Sbjct: 70 YMNIPNLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVPRPYQ 129 Query: 63 EKLINNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRL 122 ++++ +RF+I +PRQ GK+T + +L HY VFN+ GILA+K + + E+L R+ Sbjct: 130 KEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLERV 189 Query: 123 QTAYENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDEFAFVPNH 182 + ENLP ++Q GI WNKG++ +NG K+ A ++ + AVRG SF+++++DE AFVP Sbjct: 190 KNVIENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPGF 249 Query: 183 VADSFFASVYPTITSGKNTKVIIVSTPHGMNHFYRMWHDAEKRKNEYIPTDVHWSEVPGR 242 D F+ + +P I+SG+ +KV++ STP+G+NH++ MW+ A + + + P W V R Sbjct: 250 --DDFWKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRAVQNR 307 Query: 243 -------DD--KWKETTIANTSEQQFKVEFECEFLGSVDTLIAPSKLRTLIYDNPIQRNA 293 DD +K TI NTS + F E C FLG+ TLI KL + + ++ + Sbjct: 308 LYKDGEFDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVVKDSD 367 Query: 294 GLDVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPMLFPNI 353 G VY+ P++ H Y++TVD + G G+DY A +D+T +P VA + +N +L P I Sbjct: 368 GWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAI 427 Query: 354 IYEVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFSGKKTQ 413 I + A YN AY+ CE+ G+ V + L DLEY+N++ M RA SG + Sbjct: 428 IMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVI---MEERA--------SGGRRG 476 Query: 414 LGVKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFISKHNSFEAEEGCNDDLAMC 473 LG+K +K K +G LK LIE+D+L N + E TF+ K S+EAEEG +DDL M Sbjct: 477 LGLKPNKKTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMS 536 Query: 474 LVIYAWLVQMDYFKELTDQD--VRKRLYEEQKNQIEQDMAPFGFMNDGLDD 522 L + A+L D F + +++ V +++++ + + D PF + DG+++ Sbjct: 537 LTLLAYLSTQDRFSDFVEKEYNVSYDIFKQEVHDMMDDDVPFLMIADGIEN 587 >gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469498;genbank:gi:157311455;genbank:Ge neID:5602125 Length = 607 Score = 352 bits (902), Expect = 9e-99, Method: Compositional matrix adjust. Identities = 199/527 (37%), Positives = 293/527 (55%), Gaps = 35/527 (6%) Query: 2 SDNV---YLGNPNLKKANTPIEFTEEQVIEFLKCKEDPVYFANNYIKIVSLDEGLTQFHP 58 SDN+ Y+ NL++AN ++T E + E+ +C++D VYFA Y I +D G + Sbjct: 78 SDNIRTRYMRLKNLRRANIKTQYTPEMIAEWKRCRKDIVYFAETYCAITHIDYGTIKVQL 137 Query: 59 YHFQEKLINNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATAREL 118 +Q+ ++ H NR + K+ RQ GK+T V +L HY FN +GILA+K + A E+ Sbjct: 138 RDYQKDMLKIMHENRMSAHKLSRQLGKTTAVAIFLAHYVCFNKDKAVGILAHKGSMAVEV 197 Query: 119 LGRLQTAYENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDEFAF 178 L R + A E LP ++Q GI+ WNK S+ LENGS I A ++S AVRG SF+ +++DE AF Sbjct: 198 LERTKQAIELLPDFLQPGIVEWNKKSIVLENGSSIGAYASSPDAVRGNSFSFIYIDECAF 257 Query: 179 VPNHVADSFFASVYPTITSGKNTKVIIVSTPHGMNHFYRMWHDAEKRKNEYIPTDVHWSE 238 + N D F A + P I+SG+ +K+I+ +TP+G+NHFY +W A K+ Y+P + W Sbjct: 258 IQNWT-DCFLA-IQPVISSGRESKMIMTTTPNGLNHFYDIWQSAIDGKSGYVPYEAVWHS 315 Query: 239 VPGR--------DD--KWKETTIANTSEQQFKVEFECEFLGSVDTLIAPSKLRTLIYDNP 288 V R DD +W IA +S +QF E EF GS TLI + L L + + Sbjct: 316 VKERLYNKADIFDDGYEWSSQAIAGSSLEQFLQEHNAEFFGSSGTLIRATTLSRLSFIDV 375 Query: 289 IQRNAGLDVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPM 348 + N G +E PK+ YV T+D + G G+DY A +DITEFP++ VA Y +N Sbjct: 376 VNDN-GFYQFEKPKEGRKYVATLDCSEGRGQDYHALQIIDITEFPYKPVAVYHSNTTSHF 434 Query: 349 LFPNIIYEVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFS 408 + P+I+++ YN + E+N G +A L DLEY N++ S Sbjct: 435 ILPDIVFKYLMMYNECPVYIELNSTGVSIAKSLAMDLEYDNIICDSF------------- 481 Query: 409 GKKTQLGVKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFISKHNSFEAEEGCND 468 LG+K SK K +G LK LIE+DKLI N I EL TF K S+ AEEG +D Sbjct: 482 ---IDLGMKQSKRSKAMGCSALKDLIEKDKLIINHKGTIQELRTFSEKGVSWAAEEGFHD 538 Query: 469 DLAMCLVIYAWLVQMDYFKELTDQD---VRKRLYEEQKNQIEQDMAP 512 DL M LVI+ WL + F E +D + ++ ++ +++ ++ AP Sbjct: 539 DLVMSLVIFGWLTTQEKFAEYAGKDEMRIASEIFRKELDELGEEYAP 585 >gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891724;genbank:gi:33620526;genbank:GeneID :1725332 Length = 607 Score = 351 bits (901), Expect = 1e-98, Method: Compositional matrix adjust. Identities = 199/527 (37%), Positives = 293/527 (55%), Gaps = 35/527 (6%) Query: 2 SDNV---YLGNPNLKKANTPIEFTEEQVIEFLKCKEDPVYFANNYIKIVSLDEGLTQFHP 58 SDN+ Y+ NL++AN ++T E + E+ +C++D VYFA Y I +D G + Sbjct: 78 SDNIRTRYMRLKNLRRANIKTQYTPEMIAEWKRCRKDIVYFAETYCAITHIDYGTIKVQL 137 Query: 59 YHFQEKLINNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATAREL 118 +Q+ ++ H NR + K+ RQ GK+T V +L HY FN +GILA+K + A E+ Sbjct: 138 RDYQKDMLKIMHENRMSAHKLSRQLGKTTAVAIFLAHYVCFNKDKAVGILAHKGSMAVEV 197 Query: 119 LGRLQTAYENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDEFAF 178 L R + A E LP ++Q GI+ WNK S+ LENGS I A ++S AVRG SF+ +++DE AF Sbjct: 198 LERTKQAIELLPDFLQPGIVEWNKKSIVLENGSSIGAYASSPDAVRGNSFSFIYIDECAF 257 Query: 179 VPNHVADSFFASVYPTITSGKNTKVIIVSTPHGMNHFYRMWHDAEKRKNEYIPTDVHWSE 238 + N D F A + P I+SG+ +K+I+ +TP+G+NHFY +W A K+ Y+P + W Sbjct: 258 IQNWT-DCFLA-IQPVISSGRESKMIMTTTPNGLNHFYDIWQSAIDGKSGYVPYEAVWHS 315 Query: 239 VPGR--------DD--KWKETTIANTSEQQFKVEFECEFLGSVDTLIAPSKLRTLIYDNP 288 V R DD +W IA +S +QF E EF GS TLI + L L + + Sbjct: 316 VKERLYNKADIFDDGYEWSSQAIAGSSLEQFLQEHNAEFFGSSGTLIRATTLSRLSFIDV 375 Query: 289 IQRNAGLDVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPM 348 + N G +E PK+ YV T+D + G G+DY A +DITEFP++ VA Y +N Sbjct: 376 VNDN-GFYQFEKPKEGRKYVATLDCSEGRGQDYHALQIIDITEFPYKQVAVYHSNTTSHF 434 Query: 349 LFPNIIYEVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFS 408 + P+I+++ YN + E+N G +A L DLEY N++ S Sbjct: 435 ILPDIVFKYLMMYNECPVYIELNSTGVSIAKSLAMDLEYDNIICDSF------------- 481 Query: 409 GKKTQLGVKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFISKHNSFEAEEGCND 468 LG+K SK K +G LK LIE+DKLI N I EL TF K S+ AEEG +D Sbjct: 482 ---IDLGMKQSKRSKAMGCSALKDLIEKDKLIINHKGTIQELRTFSEKGVSWAAEEGFHD 538 Query: 469 DLAMCLVIYAWLVQMDYFKELTDQD---VRKRLYEEQKNQIEQDMAP 512 DL M LVI+ WL + F E +D + ++ ++ +++ ++ AP Sbjct: 539 DLVMSLVIFGWLTTQEKFAEYAGKDEMRIASEIFRKELDELGEEYAP 585 >gi|20804|lcl|protein:vir:106290 Length: 633 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944105;genbank:gi:38640149;genbank:GeneID :2658038 Length = 633 Score = 349 bits (896), Expect = 5e-98, Method: Compositional matrix adjust. Identities = 185/528 (35%), Positives = 305/528 (57%), Gaps = 31/528 (5%) Query: 6 YLGNPNLKKANTPIEFTEEQVIEFLKCKEDPVYFANNYIKIVSLDEGLTQFHPYHFQEKL 65 YLG PNLK+AN P ++T E V E+ +C++D VYFA Y I+ +D G+ + +Q+ + Sbjct: 111 YLGLPNLKRANVPTKWTREMVEEWKRCRDDIVYFAETYCSIIHIDWGVIKVQLRDYQKDM 170 Query: 66 INNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQTA 125 + + R ++ +PRQ GK+T +L H+ VFN++ +G+LA+K ++E+L R + + Sbjct: 171 LRIMASERMSMHNLPRQLGKTTATAIFLTHFVVFNEAKAVGVLAHKGDMSKEVLERTKQS 230 Query: 126 YENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDEFAFVPNHVAD 185 E LP ++Q GI+ WNKG++ELENG I A ++S AVRG SF ++++DE AF+ + Sbjct: 231 IELLPDFLQPGIVEWNKGNIELENGCSIGAYASSPDAVRGNSFALIYVDECAFIEGF--E 288 Query: 186 SFFASVYPTITSGKNTKVIIVSTPHGMNHFYRMWHDAEKRKNEYIPTDVHWSEVPGR--- 242 + ++ P I+SG+ +++I+ STP+G+NH+Y +W + K + P W V R Sbjct: 289 DTWKAILPVISSGRQSRIILTSTPNGINHWYDLWEVSLKSDKGFKPYTTTWITVKERLYD 348 Query: 243 -----DD--KWKETTIANTSEQQFKVEFECEFLGSVDTLIAPSKLRTLIYDNPIQRNAGL 295 DD +W I ++S + F+ E C F+G+ TLI KL + + I + Sbjct: 349 GSDAYDDGFEWASKQINSSSVEAFQQEHLCRFMGTSGTLINGFKLSKMTWKEVIADDNFY 408 Query: 296 DVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPMLFPNIIY 355 + E P + + Y+ TVD A G G+DYS +D+T +P+R VA Y +N I P+L P++I Sbjct: 409 QI-EKPVEGNKYIATVDPAEGRGQDYSTIQIIDVTSYPYRQVAVYHSNKISPLLLPSVIM 467 Query: 356 EVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFSGKKTQLG 415 A YN+A++ E+N IG+ VA L DLEY+N+++ S + LG Sbjct: 468 RYAMEYNNAWVYIELNSIGNMVAKSLFIDLEYENVIVDSSK----------------DLG 511 Query: 416 VKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFISKHNSFEAEEGCNDDLAMCLV 475 +K +K K VG LK LIE+DKLI + I E TF+ K S+ A++G +DDL M L Sbjct: 512 MKQTKVTKAVGCSTLKDLIEKDKLIVSHKGTIQEFRTFVEKGVSWAAQDGFHDDLVMSLC 571 Query: 476 IYAWLVQMDYFKELTD--QDVRKRLYEEQKNQIEQDMAPFGFMNDGLD 521 I+A+L + F + D +++ +++ + ++ +D ++DG++ Sbjct: 572 IFAYLTTQERFGDFIDATRNIGADVFQSEMEEMLEDFCVGAIIDDGIN 619 >gi|28182|lcl|protein:vir:108053 Length: 611 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595293;genbank:gi:161622599;genbank:Ge neID:5783772 Length = 611 Score = 349 bits (895), Expect = 6e-98, Method: Compositional matrix adjust. Identities = 196/533 (36%), Positives = 295/533 (55%), Gaps = 33/533 (6%) Query: 6 YLGNPNLKKANTPIEFTEEQVIEFLKCKEDPVYFANNYIKIVSLDEGLTQFHPYHFQEKL 65 Y+G PNLK+AN ++T E V E+ KC++D VYFA Y I +D G + +Q + Sbjct: 87 YMGLPNLKRANIKTQWTREMVSEWKKCRDDIVYFAETYCAITHIDYGTIKVQLRDYQRDM 146 Query: 66 INNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQTA 125 + NR C + RQ GK+T V +L H+ FN +GILA+K + + E+L R + A Sbjct: 147 LKIMSKNRMTTCNLSRQLGKTTVVAIFLAHFVCFNKDKAVGILAHKGSMSAEVLDRTKQA 206 Query: 126 YENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDEFAFVPNHVAD 185 E LP ++Q GI+ WNKGS+EL+NGS I A ++S AVRG SF ++++DE AF+PN + D Sbjct: 207 IELLPDFLQPGIVEWNKGSIELDNGSSIGAYASSPDAVRGNSFAMIYIDECAFIPNFL-D 265 Query: 186 SFFASVYPTITSGKNTKVIIVSTPHGMNHFYRMWHDAEKRKNEYIPTDVHWSEVPGR--- 242 S+ A + P I+SG+ +K+II +TP+G+NHFY +W A + K+ + P W+ V R Sbjct: 266 SWLA-IQPVISSGRRSKIIITTTPNGLNHFYDIWTAAVEGKSGFAPYTAIWNSVKERLYN 324 Query: 243 -----DD--KWKETTIANTSEQQFKVEFECEFLGSVDTLIAPSKLRTLIYDNPIQRNAGL 295 DD +W TI+ +S QF+ E EF G+ TLI+ KL + + I N Sbjct: 325 DADIFDDGWEWSSQTISASSLAQFRQEHCAEFQGTSGTLISGMKLAIMDWKEVIPENGYF 384 Query: 296 DVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPMLFPNIIY 355 + P H Y+ ++D + G G+DY A +D+T VA +N+I M+ P+I+Y Sbjct: 385 YRFHEPDPTHKYIASLDCSEGRGQDYHALHIIDVTTDEWEQVAVLHSNEISHMILPDIVY 444 Query: 356 EVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFSGKKTQLG 415 + YN A + E+N G VA L DLEY+N++ SM+ LG Sbjct: 445 KYLMEYNEAPVYIELNSTGVSVAKSLYMDLEYENVICDSMQ----------------DLG 488 Query: 416 VKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFISKHNSFEAEEGCNDDLAMCLV 475 +K ++ K VG LK LIE+DKL N + I E TF S+ AE+G +DDL M LV Sbjct: 489 MKQTRRTKPVGCSTLKDLIEKDKLKLNHKQTIMEFRTFSQNKLSWAAEDGFHDDLVMSLV 548 Query: 476 IYAWLVQMDYFKELTDQD---VRKRLYEEQKNQIEQDMAPFGFMNDGLDDDSF 525 I+AWL F + D+D + ++ + + ++ P F++ G D+S+ Sbjct: 549 IFAWLTTQQKFADFIDRDEMRLASEVFSRELEDMNEEYNPVVFVDAG--DNSY 599 >gi|22942|lcl|protein:vir:101803 Length: 613 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238880;genbank:gi:66391955;genbank:GeneID :3416630 Length = 613 Score = 348 bits (894), Expect = 8e-98, Method: Compositional matrix adjust. Identities = 201/544 (36%), Positives = 302/544 (55%), Gaps = 48/544 (8%) Query: 6 YLGNPNLKKANTPIEFTEEQVIEFLKCKEDPVYFANNYIKIVSLDEGLTQFHPYHFQEKL 65 YLG PNLK+AN I++T+E + E +CKED VYFA NY I +D G+ + +Q+ + Sbjct: 78 YLGLPNLKRANIKIKWTKEMLAERKRCKEDIVYFAENYCCIEHIDYGIIRVQLRDYQKDM 137 Query: 66 INNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQTA 125 + NR + RQ GK+T V +L H+ FN + N+GILA+KA+ + E+L R + A Sbjct: 138 LRIMAGNRLMAANLSRQLGKTTVVAIFLAHFVCFNSAKNVGILAHKASMSAEVLHRTKQA 197 Query: 126 YENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDEFAFVPNHVAD 185 E LP ++Q GI+ WNKGS+ L NG I A S+S AVRG SF ++++DE AF+PN D Sbjct: 198 LELLPDFLQPGIVEWNKGSITLGNGCAIGAFSSSPDAVRGNSFALIYVDEVAFIPN-FTD 256 Query: 186 SFFASVYPTITSGKNTKVIIVSTPHGMNHFYRMWHDA-------EKRKNEYIPTDVHWSE 238 ++ A + P I+SG+ +K+++ +TP+G+NH+Y +W A + K+ ++P WS Sbjct: 257 AWMA-IQPVISSGRRSKILMTTTPNGLNHWYDIWTAAITPNSSGDGSKSGFVPYTATWSS 315 Query: 239 VPGR---------------DD--KWKETTIANTSEQQFKVEFECEFLGSVDTLIAPSKLR 281 V R DD W TIA ++ F+ E F G+ TLI +KL Sbjct: 316 VKERLYSDGKELSGSDSYFDDGYSWSSKTIAGSALDAFQQEHNTAFQGTSGTLINGTKLS 375 Query: 282 TLIY-DNPIQRNAGLDVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKY 340 L + D P Q N ++E PK+ Y+ T+D A G G+DY A DITEFP++ VA Y Sbjct: 376 KLNWIDIPPQDN--FTMFEEPKEGRKYIATLDSAEGRGQDYHAMHIFDITEFPYKQVAVY 433 Query: 341 RNNDIKPMLFPNIIYEVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAG 400 +N ++ P+++ + Y YI E+N G +A L +L+Y+N++ S + Sbjct: 434 HSNTTSHLILPDVLLKYLNMYFQPYIYIELNSTGVSIAKSLYSELDYENVICDSYQ---- 489 Query: 401 QIVGQGFSGKKTQLGVKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFISKHNSF 460 LG+K +K K +G LK LIE+DKLI N + I EL TF K S+ Sbjct: 490 ------------DLGLKQTKRSKAIGCSTLKDLIEKDKLILNHKKSIMELRTFSEKGVSW 537 Query: 461 EAEEGCNDDLAMCLVIYAWLVQMDYFKELTDQD---VRKRLYEEQKNQIEQDMAPFGFMN 517 AEEG +DDL M LVI+AWL + F + T+ D + ++ ++ ++ D P ++ Sbjct: 538 AAEEGFHDDLVMSLVIFAWLTTQERFSDFTENDDMRLANEVFRKEMEELNDDYMPMVIVD 597 Query: 518 DGLD 521 DG D Sbjct: 598 DGED 601 >gi|23191|lcl|protein:vir:101186 Length: 613 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932508;genbank:gi:37651634;genbank:GeneID :2610679 Length = 613 Score = 348 bits (894), Expect = 8e-98, Method: Compositional matrix adjust. Identities = 201/544 (36%), Positives = 302/544 (55%), Gaps = 48/544 (8%) Query: 6 YLGNPNLKKANTPIEFTEEQVIEFLKCKEDPVYFANNYIKIVSLDEGLTQFHPYHFQEKL 65 YLG PNLK+AN I++T+E + E +CKED VYFA NY I +D G+ + +Q+ + Sbjct: 78 YLGLPNLKRANIKIKWTKEMLAERKRCKEDIVYFAENYCCIEHIDYGIIRVQLRDYQKDM 137 Query: 66 INNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQTA 125 + NR + RQ GK+T V +L H+ FN + N+GILA+KA+ + E+L R + A Sbjct: 138 LRIMAGNRLMAANLSRQLGKTTVVAIFLAHFVCFNSAKNVGILAHKASMSAEVLHRTKQA 197 Query: 126 YENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDEFAFVPNHVAD 185 E LP ++Q GI+ WNKGS+ L NG I A S+S AVRG SF ++++DE AF+PN D Sbjct: 198 LELLPDFLQPGIVEWNKGSITLGNGCAIGAFSSSPDAVRGNSFALIYVDEVAFIPN-FTD 256 Query: 186 SFFASVYPTITSGKNTKVIIVSTPHGMNHFYRMWHDA-------EKRKNEYIPTDVHWSE 238 ++ A + P I+SG+ +K+++ +TP+G+NH+Y +W A + K+ ++P WS Sbjct: 257 AWMA-IQPVISSGRRSKILMTTTPNGLNHWYDIWTAAITPNSSGDGSKSGFVPYTATWSS 315 Query: 239 VPGR---------------DD--KWKETTIANTSEQQFKVEFECEFLGSVDTLIAPSKLR 281 V R DD W TIA ++ F+ E F G+ TLI +KL Sbjct: 316 VKERLYSDGKELSGSDSYFDDGYSWSSKTIAGSALDAFQQEHNTAFQGTSGTLINGTKLS 375 Query: 282 TLIY-DNPIQRNAGLDVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKY 340 L + D P Q N ++E PK+ Y+ T+D A G G+DY A DITEFP++ VA Y Sbjct: 376 KLNWIDIPPQDN--FTMFEEPKEGRKYIATLDSAEGRGQDYHAMHIFDITEFPYKQVAVY 433 Query: 341 RNNDIKPMLFPNIIYEVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAG 400 +N ++ P+++ + Y YI E+N G +A L +L+Y+N++ S + Sbjct: 434 HSNTTSHLILPDVLLKYLNMYFQPYIYIELNSTGVSIAKSLYSELDYENVICDSYQ---- 489 Query: 401 QIVGQGFSGKKTQLGVKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFISKHNSF 460 LG+K +K K +G LK LIE+DKLI N + I EL TF K S+ Sbjct: 490 ------------DLGLKQTKRSKAIGCSTLKDLIEKDKLILNHKKSIMELRTFSEKGVSW 537 Query: 461 EAEEGCNDDLAMCLVIYAWLVQMDYFKELTDQD---VRKRLYEEQKNQIEQDMAPFGFMN 517 AEEG +DDL M LVI+AWL + F + T+ D + ++ ++ ++ D P ++ Sbjct: 538 AAEEGFHDDLVMSLVIFAWLTTQERFSDFTENDDMRLANEVFRKEMEELNDDYMPMVIVD 597 Query: 518 DGLD 521 DG D Sbjct: 598 DGED 601 >gi|24596|lcl|protein:vir:98262 Length: 609 # NCBI annotation: gp17 terminase subunit, nuclease and ATPase # Family: family:all:147 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239195;genbank:gi:66391670;genbank:GeneID :3416364 Length = 609 Score = 345 bits (885), Expect = 8e-97, Method: Compositional matrix adjust. Identities = 189/519 (36%), Positives = 293/519 (56%), Gaps = 33/519 (6%) Query: 6 YLGNPNLKKANTPIEFTEEQVIEFLKCKEDPVYFANNYIKIVSLDEGLTQFHPYHFQEKL 65 Y GNPNLK+A ++T+E ++E++KC++D VYFA Y I +D G + +Q+++ Sbjct: 85 YNGNPNLKRAYVQTKWTKEMLMEWVKCRDDIVYFAETYCAITHIDYGTIKVQLRDYQKEM 144 Query: 66 INNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQTA 125 + H NR C + RQ GK+T V +L H+ FN+ +G+LA+KA+ + E+L R + A Sbjct: 145 LIEMHKNRMVTCNLSRQLGKTTVVAIFLAHFVCFNEDKYVGVLAHKASMSAEVLDRTKQA 204 Query: 126 YENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDEFAFVPNHVAD 185 E LP ++Q GI+ WNKGS+EL+N KI A ++S AVRG SF ++++DE AF+PN D Sbjct: 205 IELLPDFLQPGIVEWNKGSIELDNKCKIGAFASSPDAVRGNSFAMIYIDECAFIPN-FTD 263 Query: 186 SFFASVYPTITSGKNTKVIIVSTPHGMNHFYRMWHDAEKRKNEYIPTDVHWSEVPGR--- 242 ++ A + P I+SG+ +K++I +TP+G+NHFY +W+ A + K+ ++P W+ V R Sbjct: 264 AWLA-IQPVISSGRKSKILITTTPNGLNHFYDIWNAAVEGKSGFVPYTAIWTSVKERLYT 322 Query: 243 -------DD--KWKETTIANTSEQQFKVEFECEFLGSVDTLIAPSKLRTLIYDNPIQRNA 293 DD W IA +S++ F E EF+G+ TLI+ KL + + + + Sbjct: 323 DGDNGVFDDGYSWSAKMIAGSSKEAFLQEHCAEFMGTNGTLISGWKLSKMSWIDIDETET 382 Query: 294 GLDVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPMLFPNI 353 Y+ P++ H YV +D A G G+DY A +DIT P VA Y +N ++ P+I Sbjct: 383 NFYQYKKPEEGHKYVAVLDPAEGRGQDYHAMHIIDITTLPFEQVAVYHSNRTSHLILPDI 442 Query: 354 IYEVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFSGKKTQ 413 + YN A+I E+N G VA L +LEY+N++ S Sbjct: 443 LLRYLTMYNEAWIYIELNSTGHSVAKSLFSELEYENVICDSY----------------ND 486 Query: 414 LGVKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFISKHNSFEAEEGCNDDLAMC 473 LG+K +K K +G LK LIE+DKLI N+ + I E TF K S+ AEEG +DDL M Sbjct: 487 LGMKQTKRSKAIGCSTLKDLIEKDKLIINNKKTILEFRTFSEKGVSWAAEEGFHDDLVMS 546 Query: 474 LVIYAWLVQMDYFKELTDQD---VRKRLYEEQKNQIEQD 509 L + WL F E ++D + ++ ++ Q+ +D Sbjct: 547 LACFGWLTTQLKFAEFCEKDDLRLANEVFAREREQLYED 585 >gi|27701|lcl|protein:vir:6893 Length: 611 # NCBI annotation: gp17 terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861869;genbank:gi:32453660;genbank:GeneID :1494295 Length = 611 Score = 345 bits (884), Expect = 1e-96, Method: Compositional matrix adjust. Identities = 193/525 (36%), Positives = 289/525 (55%), Gaps = 31/525 (5%) Query: 6 YLGNPNLKKANTPIEFTEEQVIEFLKCKEDPVYFANNYIKIVSLDEGLTQFHPYHFQEKL 65 Y+G PNLK+AN ++T E V E+ KC++D VYFA Y I +D G + +Q + Sbjct: 87 YMGLPNLKRANIKTQWTYEMVAEWKKCRDDIVYFAETYCAITHIDYGTIKVQLRDYQRDM 146 Query: 66 INNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQTA 125 + + R +C + RQ GK+T V +L H+ FN +GILA+K + + E+L R + A Sbjct: 147 LKIMSSKRMTVCNLSRQLGKTTVVAIFLAHFVCFNKDKAVGILAHKGSMSAEVLDRTKQA 206 Query: 126 YENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDEFAFVPNHVAD 185 E LP ++Q GI+ WNKGS++L+NGS I A ++S AVRG SF ++++DE AF+PN + D Sbjct: 207 IELLPDFLQPGIVEWNKGSIQLDNGSSIGAYASSPDAVRGNSFAMIYIDECAFIPNFI-D 265 Query: 186 SFFASVYPTITSGKNTKVIIVSTPHGMNHFYRMWHDAEKRKNEYIPTDVHWSEVPGR--- 242 S+ A + P I+SG+ +K+II +TP+G+NHFY +W A + K+ + P W+ V R Sbjct: 266 SWLA-IQPVISSGRRSKIIITTTPNGLNHFYDIWTAAVEGKSGFEPYTAIWNSVKERLYN 324 Query: 243 -----DD--KWKETTIANTSEQQFKVEFECEFLGSVDTLIAPSKLRTLIYDNPIQRNAGL 295 DD +W + TI+ +S QF+ E F G+ TLI+ KL L Y + G Sbjct: 325 DEDIFDDGWQWSKQTISASSLTQFRQEHTAAFEGTSGTLISGMKLAILDYIEVTPDSHGF 384 Query: 296 DVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPMLFPNIIY 355 ++ P++ H Y+ T+D + G G+DY A +D+T V +N I ++ P+I++ Sbjct: 385 HQFKKPEEGHKYIATLDCSEGRGQDYHAMHIIDVTTDKWEQVGVLHSNTISHLILPDIVF 444 Query: 356 EVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFSGKKTQLG 415 + YN I E+N G VA L DLEY+N++ SM LG Sbjct: 445 KYLMEYNECPIYIELNSTGVSVAKSLYMDLEYENVICDSM----------------NDLG 488 Query: 416 VKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFISKHNSFEAEEGCNDDLAMCLV 475 +K S+ K VG LK LIE+DKL N I E TF K S+ AEEG +DDL M LV Sbjct: 489 MKQSRRTKPVGCSTLKDLIEKDKLKINHRATIQEFRTFSEKGVSWAAEEGYHDDLVMGLV 548 Query: 476 IYAWLVQMDYFKELTDQD---VRKRLYEEQKNQIEQDMAPFGFMN 517 I+ WL F + D+D + ++ + + D AP F++ Sbjct: 549 IFGWLSTQQKFADYADKDDMRLASEVFSRELQDMNDDYAPVIFVD 593 >gi|27407|lcl|protein:vir:7202 Length: 610 # NCBI annotation: gp17 terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049776;genbank:gi:9632591;genbank:GeneID: 1258647 Length = 610 Score = 335 bits (858), Expect = 9e-94, Method: Compositional matrix adjust. Identities = 189/525 (36%), Positives = 286/525 (54%), Gaps = 31/525 (5%) Query: 6 YLGNPNLKKANTPIEFTEEQVIEFLKCKEDPVYFANNYIKIVSLDEGLTQFHPYHFQEKL 65 Y+G PNLK+AN ++T E V E+ KC++D VYFA Y I +D G+ + +Q + Sbjct: 87 YMGLPNLKRANIKTQWTREMVEEWKKCRDDIVYFAETYCAITHIDYGVIKVQLRDYQRDM 146 Query: 66 INNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQTA 125 + + R +C + RQ GK+T V +L H+ FN +GILA+K + + E+L R + A Sbjct: 147 LKIMSSKRMTVCNLSRQLGKTTVVAIFLAHFVCFNKDKAVGILAHKGSMSAEVLDRTKQA 206 Query: 126 YENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDEFAFVPNHVAD 185 E LP ++Q GI+ WNKGS+EL+NGS I A ++S AVRG SF ++++DE AF+PN D Sbjct: 207 IELLPDFLQPGIVEWNKGSIELDNGSSIGAYASSPDAVRGNSFAMIYIDECAFIPN-FHD 265 Query: 186 SFFASVYPTITSGKNTKVIIVSTPHGMNHFYRMWHDAEKRKNEYIPTDVHWSEVPGR--- 242 S+ A + P I+SG+ +K+II +TP+G+NHFY +W A + K+ + P W+ V R Sbjct: 266 SWLA-IQPVISSGRRSKIIITTTPNGLNHFYDIWTAAVEGKSGFEPYTAIWNSVKERLYN 324 Query: 243 -----DDKWKET--TIANTSEQQFKVEFECEFLGSVDTLIAPSKLRTLIYDNPIQRNAGL 295 DD W+ + TI +S QF+ E F G+ TLI+ KL + + + G Sbjct: 325 DEDIFDDGWQWSIQTINGSSLAQFRQEHTAAFEGTSGTLISGMKLAVMDFIEVTPDDHGF 384 Query: 296 DVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPMLFPNIIY 355 ++ P+ Y+ T+D + G G+DY A +D+T+ V +N I ++ P+I+ Sbjct: 385 HQFKKPEPDRKYIATLDCSEGRGQDYHALHIIDVTDDVWEQVGVLHSNTISHLILPDIVM 444 Query: 356 EVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFSGKKTQLG 415 YN + E+N G VA L DLEY+ ++ S T LG Sbjct: 445 RYLVEYNECPVYIELNSTGVSVAKSLYMDLEYEGVICDSY----------------TDLG 488 Query: 416 VKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFISKHNSFEAEEGCNDDLAMCLV 475 +K +K K VG LK LIE+DKLI + I E TF K S+ AEEG +DDL M LV Sbjct: 489 MKQTKRTKAVGCSTLKDLIEKDKLIIHHRATIQEFRTFSEKGVSWAAEEGYHDDLVMSLV 548 Query: 476 IYAWLVQMDYFKELTDQD---VRKRLYEEQKNQIEQDMAPFGFMN 517 I+ WL F + D+D + ++ ++ + D AP F++ Sbjct: 549 IFGWLSTQSKFIDYADKDDMRLASEVFSKELQDMSDDYAPVIFVD 593 >gi|22056|lcl|protein:vir:103455 Length: 610 # NCBI annotation: terminase subunit nuclease and ATPase # Family: family:all:147 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803107;genbank:gi:116326387;genbank:GeneI D:4405484 Length = 610 Score = 333 bits (855), Expect = 2e-93, Method: Compositional matrix adjust. Identities = 188/525 (35%), Positives = 286/525 (54%), Gaps = 31/525 (5%) Query: 6 YLGNPNLKKANTPIEFTEEQVIEFLKCKEDPVYFANNYIKIVSLDEGLTQFHPYHFQEKL 65 Y+G PNLK+AN ++T E V E+ KC++D VYFA Y I +D G+ + +Q + Sbjct: 87 YMGLPNLKRANIKTQWTREMVEEWKKCRDDIVYFAETYCAITHIDYGVIKVQLRDYQRDM 146 Query: 66 INNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQTA 125 + + R +C + RQ GK+T V +L H+ FN +GILA+K + + E+L R + A Sbjct: 147 LKIMSSKRMTVCNLSRQLGKTTVVAIFLAHFVCFNKDKAVGILAHKGSMSAEVLDRTKQA 206 Query: 126 YENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDEFAFVPNHVAD 185 E LP ++Q GI+ WNKGS+EL+NGS I A ++S AVRG SF ++++DE AF+PN D Sbjct: 207 IELLPDFLQPGIVEWNKGSIELDNGSSIGAYASSPDAVRGNSFAMIYIDECAFIPN-FHD 265 Query: 186 SFFASVYPTITSGKNTKVIIVSTPHGMNHFYRMWHDAEKRKNEYIPTDVHWSEVPGR--- 242 S+ A + P I+SG+ +K+II +TP+G+NHFY +W A + K+ + P W+ V R Sbjct: 266 SWLA-IQPVISSGRRSKIIITTTPNGLNHFYDIWTAAVEGKSGFEPYTAIWNSVKERLYN 324 Query: 243 -----DDKWKET--TIANTSEQQFKVEFECEFLGSVDTLIAPSKLRTLIYDNPIQRNAGL 295 DD W+ + TI ++ QF+ E F G+ TLI+ KL + + + G Sbjct: 325 DEDIFDDGWQWSIQTINGSTLAQFRQEHTAAFEGTSGTLISGMKLAIMDFIEVTPDDHGF 384 Query: 296 DVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPMLFPNIIY 355 ++ P+ Y+ T+D + G G+DY A +D+T+ V +N I ++ P+I+ Sbjct: 385 HRFKKPEPDRKYIATLDCSEGRGQDYHALHIIDVTDDVWEQVGVLHSNTISHLILPDIVM 444 Query: 356 EVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFSGKKTQLG 415 YN + E+N G VA L DLEY+ ++ S T LG Sbjct: 445 RYLVEYNECPVYIELNSTGVSVAKSLYMDLEYEGVICDSY----------------TDLG 488 Query: 416 VKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFISKHNSFEAEEGCNDDLAMCLV 475 +K +K K VG LK LIE+DKLI + I E TF K S+ AEEG +DDL M LV Sbjct: 489 MKQTKRTKAVGCSTLKDLIEKDKLIIHHRATIQEFRTFSEKGVSWAAEEGYHDDLVMSLV 548 Query: 476 IYAWLVQMDYFKELTDQD---VRKRLYEEQKNQIEQDMAPFGFMN 517 I+ WL F + D+D + ++ ++ + D AP F++ Sbjct: 549 IFGWLSTQSKFIDYADKDDMRLASEVFSKELQDMSDDYAPVIFVD 593 >gi|21077|lcl|protein:vir:100538 Length: 612 # NCBI annotation: gp17 terminase subunit # Family: family:all:147 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656379;genbank:gi:109290130;genbank:GeneI D:4156516 Length = 612 Score = 317 bits (813), Expect = 2e-88, Method: Compositional matrix adjust. Identities = 188/541 (34%), Positives = 288/541 (53%), Gaps = 46/541 (8%) Query: 6 YLGNPNLKKANTPIEFTEEQVIEFLKCKEDPVYFANNYIKIVSLDEGLTQFHPYHFQEKL 65 Y G PNLK+AN I++T+E + E +CKED VYFA NY I +D G+ + +Q+ + Sbjct: 77 YNGLPNLKRANIKIKWTKEMLAERKRCKEDIVYFAENYCCIEHIDYGIIRVQLRDYQKDM 136 Query: 66 INNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQTA 125 + NR + RQ GK+T V +L H+ FN + N+GILA+KA+ + E+L R + A Sbjct: 137 LRIMAGNRLMAANLSRQLGKTTVVAIFLAHFVCFNSAKNVGILAHKASMSAEVLHRTKQA 196 Query: 126 YENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDEFAFVPNHVAD 185 E LP ++Q GI+ WNKGS+ L NG I A S+S AVRG SF ++++DE AF+PN D Sbjct: 197 LELLPDFLQPGIVEWNKGSITLGNGCAIGAFSSSPDAVRGNSFALIYIDEVAFIPN-FND 255 Query: 186 SFFASVYPTITSGKNTKVIIVSTPHGMNHFYRMW-------HDAEKRKNEYIPTDVHWSE 238 ++ A + P I+SG+++K+++ +TP+G+NH+Y +W D K+ ++P WS Sbjct: 256 AWLA-IQPVISSGRHSKILMTTTPNGLNHWYDIWTAAITPNSDGSGSKSGFVPYTATWSS 314 Query: 239 VPGR--DDKWK--------ETTIANTSEQQ-------FKVEFECEFLGSVDTLIAPSKLR 281 V R D K T I Q F+ E F G+ TLI KL Sbjct: 315 VKERMYSDGSKTDGAIHILTTDILGQPRQSPVLALRAFQQEHNTAFQGTSGTLINGFKLS 374 Query: 282 TLIYDNPIQRNAGLDVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYR 341 + + + + +++ P + H Y+ T+D A G G+DY A DITEFP+ VA Y Sbjct: 375 KMTWKE-VPASDNFTMFKEPIEGHKYIATLDSAEGRGQDYHAMHIYDITEFPYEQVAVYH 433 Query: 342 NNDIKPMLFPNIIYEVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQ 401 +N ++ P+++ + Y YI E+N G +A L +LEY+N++ S Sbjct: 434 SNTTSHLILPDVLLKYLNMYYQPYIYIELNATGVSIAKSLYSELEYENIICDSY------ 487 Query: 402 IVGQGFSGKKTQLGVKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFISKHNSFE 461 LG+K +K K +G LK LIE++KL+ I EL TF K S+ Sbjct: 488 ----------NDLGMKQTKRSKAIGCSTLKDLIEKEKLVLYHKGTIMELRTFSEKGVSWA 537 Query: 462 AEEGCNDDLAMCLVIYAWLVQMDYFKELTDQD---VRKRLYEEQKNQIEQDMAPFGFMND 518 AE+G +DDL M LVI+AWL F + T++D + ++ ++ + D P ++ Sbjct: 538 AEDGFHDDLVMSLVIFAWLTTQPRFSDFTERDDMRLANEIFRQEMENLYDDYTPVVIVDS 597 Query: 519 G 519 G Sbjct: 598 G 598 >gi|16897|lcl|protein:vir:5867 Length: 508 # NCBI annotation: similar to terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835653;genbank:gi:30044056 Length = 508 Score = 181 bits (460), Expect = 2e-47, Method: Compositional matrix adjust. Identities = 141/461 (30%), Positives = 220/461 (47%), Gaps = 42/461 (9%) Query: 24 EQVI--EFLKCKEDPVYFANNYIKIVSLDEGLTQFHPYHFQEKLINNFHNNRFNICKMPR 81 +Q+I E KCK DP+YF Y+KI + + F Y QEKLIN +H +R+ I + PR Sbjct: 4 QQIIKQELEKCKNDPIYFIRKYVKIQHPIKRVIPFDLYPIQEKLINFYHTHRYVITEKPR 63 Query: 82 QTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQTAYENLPRWMQQGIIAWN 141 Q G + V+Y LH +FN + + I ANK ATA+ +L R++ AYE LPR++Q WN Sbjct: 64 QMGVTWCAVAYALHQMIFNSNYKVLIAANKEATAKNVLERIKFAYEQLPRFLQIKKRTWN 123 Query: 142 KGSLELENGSKILAASTSASAVRGMSFNILFLDEFAFVPNHVADSFFASVYPTITSGKNT 201 K +E N S A S+ + + R S +L ++E AF+ N + +ASV T+ +G Sbjct: 124 KTYIEFSNYSSARAVSSKSDSGRSESITLLIVEEAAFISN--MEELWASVQQTLATG--G 179 Query: 202 KVIIVSTPHGMNHFY-RMWHDAEKRKNEYIPTDVHWSEVPGRDDKWKETTIANTSEQQFK 260 K I+ ST +G+ ++Y R A++ K+E+ + WS+ P RD+KW E + F Sbjct: 180 KCIVNSTYNGVGNWYERTIRAAKEGKSEFKYFGIKWSDHPERDEKWFEEQKRLLPPRVFA 239 Query: 261 VEFECEFLGSVDTLIAPSKLRTLIYDNPIQRNAGLDVYEPPKDKHDYVMTVDVARGVGED 320 E C GS + +I +R + +P G D +E + Y ++VD A G GED Sbjct: 240 QEILCIPQGSGENVIPFHLIREEEFIDPFVVKYGGDYWEWYRKPGYYFISVDPASGRGED 299 Query: 321 YSA----FVCVDITEFPHRIVAKYRNNDIKPMLFPNIIYEVAKNYNSAYILCEVNDIGDQ 376 SA + VD VA++ ++ + +I ++ + I E N IG Sbjct: 300 RSAVGVQVLWVDPQTLTIEQVAEFASDKTSLPVMRQVIKQIYDEFKPQLIFIETNGIG-- 357 Query: 377 VASILQYDLEYQNLLMCSMRGRAGQIVGQGFSGKKTQLGVKMSKTVKKVGSLNLKTLIEE 436 + Q+ M IVG + +K K GS L L E+ Sbjct: 358 -MGLYQF-----------MEAYTPSIVGYYTTQRK-----------KVHGSDLLAKLYED 394 Query: 437 DKLIFNDYEIISEL--TTFISKHNSFEAEEGCNDDLAMCLV 475 +LI ++ +L TT++ + E +DL M L+ Sbjct: 395 GRLILRSKRLLEQLQRTTWVKN----KVETAGRNDLYMALI 431 >gi|17621|lcl|protein:vir:816 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050549;genbank:gi:9633446;genbank:GeneID: 1262208 Length = 568 Score = 68.6 bits (166), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 122/548 (22%), Positives = 204/548 (37%), Gaps = 94/548 (17%) Query: 11 NLKKANTPIEFTEEQVIEFLKCK-EDPVYFANNYIKIVSLDEGLTQFHPYHFQEKLINNF 69 N + + P E TE + F+ K +P + N+ KI + L F Q +L + Sbjct: 6 NEPRCDEPSEMTEAEQRLFIMTKLSNPWWRLNHLYKIQNEKGELVTFRMRPAQRQLFRSM 65 Query: 70 HNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELL-GRLQTAYEN 128 HN NI RQ G ST + YLL A+F + GI+A A E+ ++ +++ Sbjct: 66 HNK--NIILKARQLGFSTAIDIYLLDQALFIPHLKCGIVAQDKQAASEIFRTKIAVPFDH 123 Query: 129 LPRWMQQ-------------GIIAWNKGS---------------LELENGSKILAA-STS 159 LP W++ G I + GS L + KI A Sbjct: 124 LPDWLRASFTIVERRSGASGGYILFGHGSSIQVATSFRSGTVQRLHISEHGKICAKYPAK 183 Query: 160 ASAVRGMSFNILFLDEFAFVPNHVADSFFASVYPTITSGKNTKV--IIVSTPHGMNHFYR 217 A +R + N + DE A+ Y + ++++ HFY Sbjct: 184 AKELRTGTLNAVS-DECIIFDESTAEGVGGDFYEMSNRAQEITASGLLLTAQDYKFHFYA 242 Query: 218 MWHDAE-------------KRKNEYIPTDVHWSEVPGRDD--KW---KETTIANTSEQQF 259 W D + + K Y + D+ +W KET +Q+F Sbjct: 243 WWQDPKYSARVPESGLKLSREKMTYFSAVEKAMNITLTDEQKQWYINKETEQREEMKQEF 302 Query: 260 KVEFECEFLGSVDTLI-APSKLRT--------LIYD-------------------NPIQR 291 + FL S + A S L+ ++YD N +QR Sbjct: 303 PSTPQEAFLTSGRRVFSAESTLQAESFCSPPMIVYDIEPVTGAKTKAQSLREGNKNELQR 362 Query: 292 NAG--LDVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPML 349 L V+E P +YV D A G+ ++ +D+ + + + + L Sbjct: 363 TLMNYLLVWELPDPDEEYVCGADTAEGL--EHGDRSSLDVVKRSNGEQVAHWFGHLDAEL 420 Query: 350 FPNIIYEVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFSG 409 F ++I +V + YN+A++ E N+ G V IL+ Y + + Q + Q + Sbjct: 421 FAHLISQVCRMYNNAFVGPERNNHGHAV--ILKLRELYPTRYIYN-----EQHLDQAYDD 473 Query: 410 KKTQLGVKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFI-SKHNSFEAEEGCND 468 +LG ++ K V + +KTL+ +SE+ T++ S A+EGC D Sbjct: 474 DTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQEGCFD 533 Query: 469 DLAMCLVI 476 D M +I Sbjct: 534 DQLMSYMI 541 >gi|25680|lcl|protein:vir:10116 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859246;genbank:gi:32171173;genbank:GeneID :2653342 Length = 568 Score = 68.6 bits (166), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 122/548 (22%), Positives = 204/548 (37%), Gaps = 94/548 (17%) Query: 11 NLKKANTPIEFTEEQVIEFLKCK-EDPVYFANNYIKIVSLDEGLTQFHPYHFQEKLINNF 69 N + + P E TE + F+ K +P + N+ KI + L F Q +L + Sbjct: 6 NEPRCDEPSEMTEAEQRLFIMTKLSNPWWRLNHLYKIQNEKGELVTFRMRPAQRQLFRSM 65 Query: 70 HNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELL-GRLQTAYEN 128 HN NI RQ G ST + YLL A+F + GI+A A E+ ++ +++ Sbjct: 66 HNK--NIILKARQLGFSTAIDIYLLDQALFIPHLKCGIVAQDKQAASEIFRTKIAVPFDH 123 Query: 129 LPRWMQQ-------------GIIAWNKGS---------------LELENGSKILAA-STS 159 LP W++ G I + GS L + KI A Sbjct: 124 LPDWLRASFTIVERRSGASGGYILFGHGSSIQVATSFRSGTVQRLHISEHGKICAKYPAK 183 Query: 160 ASAVRGMSFNILFLDEFAFVPNHVADSFFASVYPTITSGKNTKV--IIVSTPHGMNHFYR 217 A +R + N + DE A+ Y + ++++ HFY Sbjct: 184 AKELRTGTLNAVS-DECIIFDESTAEGVGGDFYEMSNRAQEITASGLLLTAQDYKFHFYA 242 Query: 218 MWHDAE-------------KRKNEYIPTDVHWSEVPGRDD--KW---KETTIANTSEQQF 259 W D + + K Y + D+ +W KET +Q+F Sbjct: 243 WWQDPKYSARVPESGLKLSREKMTYFSAVEKAMNITLTDEQKQWYINKETEQREEMKQEF 302 Query: 260 KVEFECEFLGSVDTLI-APSKLRT--------LIYD-------------------NPIQR 291 + FL S + A S L+ ++YD N +QR Sbjct: 303 PSTPQEAFLTSGRRVFSAESTLQAESFCSPPMIVYDIEPVTGAKTKAQSLREGNKNELQR 362 Query: 292 NAG--LDVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPML 349 L V+E P +YV D A G+ ++ +D+ + + + + L Sbjct: 363 TLMNYLLVWELPDPDEEYVCGADTAEGL--EHGDRSSLDVVKRSNGEQVAHWFGHLDAEL 420 Query: 350 FPNIIYEVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFSG 409 F ++I +V + YN+A++ E N+ G V IL+ Y + + Q + Q + Sbjct: 421 FAHLISQVCRMYNNAFVGPERNNHGHAV--ILKLRELYPTRYIYN-----EQHLDQAYDD 473 Query: 410 KKTQLGVKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFI-SKHNSFEAEEGCND 468 +LG ++ K V + +KTL+ +SE+ T++ S A+EGC D Sbjct: 474 DTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQEGCFD 533 Query: 469 DLAMCLVI 476 D M +I Sbjct: 534 DQLMSYMI 541 >gi|51|lcl|protein:vir:3295 Length: 568 # NCBI annotation: putative large subunit terminase # Family: family:all:147 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049511;genbank:gi:9632517;genbank:GeneID: 1262002 Length = 568 Score = 68.6 bits (166), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 122/548 (22%), Positives = 204/548 (37%), Gaps = 94/548 (17%) Query: 11 NLKKANTPIEFTEEQVIEFLKCK-EDPVYFANNYIKIVSLDEGLTQFHPYHFQEKLINNF 69 N + + P E TE + F+ K +P + N+ KI + L F Q +L + Sbjct: 6 NEPRCDEPSEMTEAEQRLFIMTKLSNPWWRLNHLYKIQNEKGELVTFRMRPAQRQLFRSM 65 Query: 70 HNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELL-GRLQTAYEN 128 HN NI RQ G ST + YLL A+F + GI+A A E+ ++ +++ Sbjct: 66 HNK--NIILKARQLGFSTAIDIYLLDQALFIPHLKCGIVAQDKQAASEIFRTKIAVPFDH 123 Query: 129 LPRWMQQ-------------GIIAWNKGS---------------LELENGSKILAA-STS 159 LP W++ G I + GS L + KI A Sbjct: 124 LPDWLRASFTIVERRSGASGGYILFGHGSSIQVATSFRSGTVQRLHISEHGKICAKYPAK 183 Query: 160 ASAVRGMSFNILFLDEFAFVPNHVADSFFASVYPTITSGKNTKV--IIVSTPHGMNHFYR 217 A +R + N + DE A+ Y + ++++ HFY Sbjct: 184 AKELRTGTLNAVS-DECIIFDESTAEGVGGDFYEMSNRAQEITASGLLLTAQDYKFHFYA 242 Query: 218 MWHDAE-------------KRKNEYIPTDVHWSEVPGRDD--KW---KETTIANTSEQQF 259 W D + + K Y + D+ +W KET +Q+F Sbjct: 243 WWQDPKYSARVPESGLKLSREKMTYFSAVEKAMNITLTDEQKQWYINKETEQREEMKQEF 302 Query: 260 KVEFECEFLGSVDTLI-APSKLRT--------LIYD-------------------NPIQR 291 + FL S + A S L+ ++YD N +QR Sbjct: 303 PSTPQEAFLTSGRRVFSAESTLQAESFCSPPMIVYDIEPVTGAKTKAQSLREGNKNELQR 362 Query: 292 NAG--LDVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPML 349 L V+E P +YV D A G+ ++ +D+ + + + + L Sbjct: 363 TLMNYLLVWELPDPDEEYVCGADTAEGL--EHGDRSSLDVVKRSNGEQVAHWFGHLDAEL 420 Query: 350 FPNIIYEVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFSG 409 F ++I +V + YN+A++ E N+ G V IL+ Y + + Q + Q + Sbjct: 421 FAHLISQVCRMYNNAFVGPERNNHGHAV--ILKLRELYPTRYIYN-----EQHLDQAYDD 473 Query: 410 KKTQLGVKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFI-SKHNSFEAEEGCND 468 +LG ++ K V + +KTL+ +SE+ T++ S A+EGC D Sbjct: 474 DTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQEGCFD 533 Query: 469 DLAMCLVI 476 D M +I Sbjct: 534 DQLMSYMI 541 >gi|26240|lcl|protein:vir:2763 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612880;genbank:gi:20065962;genbank:GeneID :935714 Length = 568 Score = 68.6 bits (166), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 122/548 (22%), Positives = 204/548 (37%), Gaps = 94/548 (17%) Query: 11 NLKKANTPIEFTEEQVIEFLKCK-EDPVYFANNYIKIVSLDEGLTQFHPYHFQEKLINNF 69 N + + P E TE + F+ K +P + N+ KI + L F Q +L + Sbjct: 6 NEPRCDEPSEMTEAEQRLFIMTKLSNPWWRLNHLYKIQNEKGELVTFRMRPAQRQLFRSM 65 Query: 70 HNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELL-GRLQTAYEN 128 HN NI RQ G ST + YLL A+F + GI+A A E+ ++ +++ Sbjct: 66 HNK--NIILKARQLGFSTAIDIYLLDQALFIPHLKCGIVAQDKQAASEIFRTKIAVPFDH 123 Query: 129 LPRWMQQ-------------GIIAWNKGS---------------LELENGSKILAA-STS 159 LP W++ G I + GS L + KI A Sbjct: 124 LPDWLRASFTIVERRSGASGGYILFGHGSSIQVATSFRSGTVQRLHISEHGKICAKYPAK 183 Query: 160 ASAVRGMSFNILFLDEFAFVPNHVADSFFASVYPTITSGKNTKV--IIVSTPHGMNHFYR 217 A +R + N + DE A+ Y + ++++ HFY Sbjct: 184 AKELRTGTLNAVS-DECIIFDESTAEGVGGDFYEMSNRAQEITASGLLLTAQDYKFHFYA 242 Query: 218 MWHDAE-------------KRKNEYIPTDVHWSEVPGRDD--KW---KETTIANTSEQQF 259 W D + + K Y + D+ +W KET +Q+F Sbjct: 243 WWQDPKYSARVPESGLKLSREKMTYFSAVEKAMNITLTDEQKQWYINKETEQREEMKQEF 302 Query: 260 KVEFECEFLGSVDTLI-APSKLRT--------LIYD-------------------NPIQR 291 + FL S + A S L+ ++YD N +QR Sbjct: 303 PSTPQEAFLTSGRRVFSAESTLQAESFCSPPMIVYDIEPVTGAKTKAQSLREGNKNELQR 362 Query: 292 NAG--LDVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPML 349 L V+E P +YV D A G+ ++ +D+ + + + + L Sbjct: 363 TLMNYLLVWELPDPDEEYVCGADTAEGL--EHGDRSSLDVVKRSNGEQVAHWFGHLDAEL 420 Query: 350 FPNIIYEVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFSG 409 F ++I +V + YN+A++ E N+ G V IL+ Y + + Q + Q + Sbjct: 421 FAHLISQVCRMYNNAFVGPERNNHGHAV--ILKLRELYPTRYIYN-----EQHLDQAYDD 473 Query: 410 KKTQLGVKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFI-SKHNSFEAEEGCND 468 +LG ++ K V + +KTL+ +SE+ T++ S A+EGC D Sbjct: 474 DTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQEGCFD 533 Query: 469 DLAMCLVI 476 D M +I Sbjct: 534 DQLMSYMI 541 >gi|25849|lcl|protein:vir:9949 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859079;genbank:gi:32171001;genbank:GeneID :2653280 Length = 568 Score = 68.6 bits (166), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 122/548 (22%), Positives = 204/548 (37%), Gaps = 94/548 (17%) Query: 11 NLKKANTPIEFTEEQVIEFLKCK-EDPVYFANNYIKIVSLDEGLTQFHPYHFQEKLINNF 69 N + + P E TE + F+ K +P + N+ KI + L F Q +L + Sbjct: 6 NEPRCDEPSEMTEAEQRLFIMTKLSNPWWRLNHLYKIQNEKGELVTFRMRPAQRQLFRSM 65 Query: 70 HNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELL-GRLQTAYEN 128 HN NI RQ G ST + YLL A+F + GI+A A E+ ++ +++ Sbjct: 66 HNK--NIILKARQLGFSTAIDIYLLDQALFIPHLKCGIVAQDKQAASEIFRTKIAVPFDH 123 Query: 129 LPRWMQQ-------------GIIAWNKGS---------------LELENGSKILAA-STS 159 LP W++ G I + GS L + KI A Sbjct: 124 LPDWLRASFTIVERRSGASGGYILFGHGSSIQVATSFRSGTVQRLHISEHGKICAKYPAK 183 Query: 160 ASAVRGMSFNILFLDEFAFVPNHVADSFFASVYPTITSGKNTKV--IIVSTPHGMNHFYR 217 A +R + N + DE A+ Y + ++++ HFY Sbjct: 184 AKELRTGTLNAVS-DECIIFDESTAEGVGGDFYEMSNRAQEITASGLLLTAQDYKFHFYA 242 Query: 218 MWHDAE-------------KRKNEYIPTDVHWSEVPGRDD--KW---KETTIANTSEQQF 259 W D + + K Y + D+ +W KET +Q+F Sbjct: 243 WWQDPKYSARVPESGLKLSREKMTYFSAVEKAMNITLTDEQKQWYINKETEQREEMKQEF 302 Query: 260 KVEFECEFLGSVDTLI-APSKLRT--------LIYD-------------------NPIQR 291 + FL S + A S L+ ++YD N +QR Sbjct: 303 PSTPQEAFLTSGRRVFSAESTLQAESFCSPPMIVYDIEPVTGAKTKAQSLREGNKNELQR 362 Query: 292 NAG--LDVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPML 349 L V+E P +YV D A G+ ++ +D+ + + + + L Sbjct: 363 TLMNYLLVWELPDPDEEYVCGADTAEGL--EHGDRSSLDVVKRSNGEQVAHWFGHLDAEL 420 Query: 350 FPNIIYEVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFSG 409 F ++I +V + YN+A++ E N+ G V IL+ Y + + Q + Q + Sbjct: 421 FAHLISQVCRMYNNAFVGPERNNHGHAV--ILKLRELYPTRYIYN-----EQHLDQAYDD 473 Query: 410 KKTQLGVKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFI-SKHNSFEAEEGCND 468 +LG ++ K V + +KTL+ +SE+ T++ S A+EGC D Sbjct: 474 DTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQEGCFD 533 Query: 469 DLAMCLVI 476 D M +I Sbjct: 534 DQLMSYMI 541 >gi|1476|lcl|protein:vir:104436 Length: 568 # NCBI annotation: putative terminase large subunit # Family: family:all:147 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794060;genbank:gi:116222005;genbank:GeneI D:4397501 Length = 568 Score = 68.6 bits (166), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 122/548 (22%), Positives = 204/548 (37%), Gaps = 94/548 (17%) Query: 11 NLKKANTPIEFTEEQVIEFLKCK-EDPVYFANNYIKIVSLDEGLTQFHPYHFQEKLINNF 69 N + + P E TE + F+ K +P + N+ KI + L F Q +L + Sbjct: 6 NEPRCDEPSEMTEAEQRLFIMTKLSNPWWRLNHLYKIQNEKGELVTFRMRPAQRQLFRSM 65 Query: 70 HNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELL-GRLQTAYEN 128 HN NI RQ G ST + YLL A+F + GI+A A E+ ++ +++ Sbjct: 66 HNK--NIILKARQLGFSTAIDIYLLDQALFIPHLKCGIVAQDKQAASEIFRTKIAVPFDH 123 Query: 129 LPRWMQQ-------------GIIAWNKGS---------------LELENGSKILAA-STS 159 LP W++ G I + GS L + KI A Sbjct: 124 LPDWLRASFTIVERRSGASGGYILFGHGSSIQVATSFRSGTVQRLHISEHGKICAKYPAK 183 Query: 160 ASAVRGMSFNILFLDEFAFVPNHVADSFFASVYPTITSGKNTKV--IIVSTPHGMNHFYR 217 A +R + N + DE A+ Y + ++++ HFY Sbjct: 184 AKELRTGTLNAVS-DECIIFDESTAEGVGGDFYEMSNRAQEITASGLLLTAQDYKFHFYA 242 Query: 218 MWHDAE-------------KRKNEYIPTDVHWSEVPGRDD--KW---KETTIANTSEQQF 259 W D + + K Y + D+ +W KET +Q+F Sbjct: 243 WWQDPKYSARVPESGLKLSREKMTYFSAVEKAMNITLTDEQKQWYINKETEQREEMKQEF 302 Query: 260 KVEFECEFLGSVDTLI-APSKLRT--------LIYD-------------------NPIQR 291 + FL S + A S L+ ++YD N +QR Sbjct: 303 PSTPQEAFLTSGRRVFSAESTLQAESFCSPPMIVYDIEPVTGAKTKAQSLREGNKNELQR 362 Query: 292 NAG--LDVYEPPKDKHDYVMTVDVARGVGEDYSAFVCVDITEFPHRIVAKYRNNDIKPML 349 L V+E P +YV D A G+ ++ +D+ + + + + L Sbjct: 363 TLMNYLLVWELPDPDEEYVCGADTAEGL--EHGDRSSLDVVKRSNGEQVAHWFGHLDAEL 420 Query: 350 FPNIIYEVAKNYNSAYILCEVNDIGDQVASILQYDLEYQNLLMCSMRGRAGQIVGQGFSG 409 F ++I +V + YN+A++ E N+ G V IL+ Y + + Q + Q + Sbjct: 421 FAHLISQVCRMYNNAFVGPERNNHGHAV--ILKLRELYPTRYIYN-----EQHLDQAYDD 473 Query: 410 KKTQLGVKMSKTVKKVGSLNLKTLIEEDKLIFNDYEIISELTTFI-SKHNSFEAEEGCND 468 +LG ++ K V + +KTL+ +SE+ T++ S A+EGC D Sbjct: 474 DTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQEGCFD 533 Query: 469 DLAMCLVI 476 D M +I Sbjct: 534 DQLMSYMI 541 >gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: phage terminase large subunit # Family: family:all:147 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468054;genbank:gi:157265496;genbank:Ge neID:5600564 Length = 485 Score = 63.2 bits (152), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 91/373 (24%), Positives = 155/373 (41%), Gaps = 65/373 (17%) Query: 163 VRGMSFNILFLDEFAFVPNHVADSFFASVYPTITSGKNTKVIIVSTPHGMNHFYRM---- 218 +RG + + + LDE A +P V + ++ PT+ S ++ +I+STP G+N FY Sbjct: 138 LRGATLDFVILDEAAMIPFSV---WSEAIEPTL-SVRDGWALIISTPKGLNWFYEFFLMG 193 Query: 219 WHDAEKRKNEYIPTD--------------VHWSEVPGRDDKWKETTIANTSEQQFKVEFE 264 W K E IP W P R + + E + + +F+ E+ Sbjct: 194 WRGGLK---EGIPNSGINQTHPDFESFHAASWDVWPERREWYMERRL-YIPDLEFRQEYG 249 Query: 265 CEFLGSVDTLIAPSKLRTLIYDNPIQRNAGLDVYEPPKDKHDYVMTVDVARGVGEDYSAF 324 EF+ +++ + + L+ P +R V E + H Y + D G +DYS F Sbjct: 250 AEFVSHSNSVFSGLDMLILL---PYERRGTRLVVEDYRPDHIYCIGADF--GKNQDYSVF 304 Query: 325 VCVDITEFPHRIVAKYRNNDIKPMLFPNIIYEVAKNYNSAYILCEVNDIGDQVASILQYD 384 +D+ IV R N + ++++Y AY++ + +GD +A L Sbjct: 305 SVLDLDT--GAIVCLERMNGATWSDQVARLKALSEDYGHAYVVADTWGVGDAIAEELD-- 360 Query: 385 LEYQNLLMCSMRGRAGQIVGQGFSGKKTQLGVKMSKTVKKVGSLNLKTLIEEDKL-IFND 443 QG + T L VK S +VK+ NL L+E+ ++ + ND Sbjct: 361 -------------------AQGIN--YTPLPVK-SSSVKEQLISNLALLMEKGQVAVPND 398 Query: 444 YEIISELTTF-----ISKHNSFEAEEGCNDDLAMCLVI-YAWLVQMDYFK-ELTDQDVRK 496 I+ EL F S + A +DD+ M L + Y+ D +K EL ++ K Sbjct: 399 KTILDELRNFRYYRTASGNQVMRAYGRGHDDIVMSLALAYSQYEGKDGYKFELAEERPSK 458 Query: 497 RLYEEQKNQIEQD 509 +EE + +D Sbjct: 459 LKHEESVMSLVED 471 >gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: terminase large subunit # Family: family:all:147 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467938;genbank:gi:157265379;genbank:Ge neID:5600506 Length = 485 Score = 63.2 bits (152), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 91/373 (24%), Positives = 155/373 (41%), Gaps = 65/373 (17%) Query: 163 VRGMSFNILFLDEFAFVPNHVADSFFASVYPTITSGKNTKVIIVSTPHGMNHFYRM---- 218 +RG + + + LDE A +P V + ++ PT+ S ++ +I+STP G+N FY Sbjct: 138 LRGATLDFVILDEAAMIPFSV---WSEAIEPTL-SVRDGWALIISTPKGLNWFYEFFLMG 193 Query: 219 WHDAEKRKNEYIPTD--------------VHWSEVPGRDDKWKETTIANTSEQQFKVEFE 264 W K E IP W P R + + E + + +F+ E+ Sbjct: 194 WRGGLK---EGIPNSGVNQTHPDFESFHAASWDVWPERREWYMERRL-YIPDLEFRQEYG 249 Query: 265 CEFLGSVDTLIAPSKLRTLIYDNPIQRNAGLDVYEPPKDKHDYVMTVDVARGVGEDYSAF 324 EF+ +++ + + L+ P +R V E + H Y + D G +DYS F Sbjct: 250 AEFVSHSNSVFSGLDMLILL---PYERRGTRLVVEDYRPDHIYCIGADF--GKNQDYSVF 304 Query: 325 VCVDITEFPHRIVAKYRNNDIKPMLFPNIIYEVAKNYNSAYILCEVNDIGDQVASILQYD 384 +D+ IV R N + ++++Y AY++ + +GD +A L Sbjct: 305 SVLDLDT--GAIVCLERMNGATWSDQVARLKALSEDYGHAYVVADTWGVGDAIAEELD-- 360 Query: 385 LEYQNLLMCSMRGRAGQIVGQGFSGKKTQLGVKMSKTVKKVGSLNLKTLIEEDKL-IFND 443 QG + T L VK S +VK+ NL L+E+ ++ + ND Sbjct: 361 -------------------AQGIN--YTPLPVK-SSSVKEQLISNLALLMEKGQVAVPND 398 Query: 444 YEIISELTTF-----ISKHNSFEAEEGCNDDLAMCLVI-YAWLVQMDYFK-ELTDQDVRK 496 I+ EL F S + A +DD+ M L + Y+ D +K EL ++ K Sbjct: 399 KTILDELRNFRYYRTASGNQVMRAYGRGHDDIVMSLALAYSQYEGKDGYKFELAEERPSK 458 Query: 497 RLYEEQKNQIEQD 509 +EE + +D Sbjct: 459 LKHEESVMSLVED 471 >gi|1431|lcl|protein:vir:93626 Length: 530 # NCBI annotation: Bcep22gp49 # Family: family:all:147 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944278;genbank:gi:38640355;genbank:GeneID :2658275 Length = 530 Score = 47.8 bits (112), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 45/152 (29%), Positives = 71/152 (46%), Gaps = 16/152 (10%) Query: 81 RQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELL-GRLQTAYENLPRWMQQGIIA 139 RQ G +T + L +A+FN + GI+A TA L +++ AY+NLP +++ + Sbjct: 78 RQLGFTTLICIIWLDHALFNANSRCGIIAQDRETAEALFRDKVKFAYDNLPEALREAMPL 137 Query: 140 WNKGSLEL---ENGSKILAASTSASAVRGMSFNILFLDEFAFV----PNHVADSFFASVY 192 N EL N S I A++ VRG + + L + EF + P+ A+ S+ Sbjct: 138 ANCTKAELLFAHNNSSIRVATS----VRGGTIHRLHISEFGKICAKYPDKAAEVVTGSIP 193 Query: 193 PTITSGKNTKVIIVSTPHGM-NHFYRMWHDAE 223 SG ++I ST G FY + AE Sbjct: 194 AVPKSG---ILVIESTAEGREGEFYNITMQAE 222 >gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: terminase, large subunit # Family: family:all:147 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006983;genbank:gi:46401884;genbank:GeneID :2777679 Length = 438 Score = 46.6 bits (109), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 60/231 (25%), Positives = 104/231 (45%), Gaps = 32/231 (13%) Query: 51 EGLTQFHPYHFQEKLINNFHN--NRFNICKMPRQTGKSTTVVSYLLHY-AVFNDSVNIGI 107 EG+T P Q +IN + +RF + R+ GKS ++Y L + + +V + + Sbjct: 34 EGITPNGP---QIAIINALEDPRHRFVTACVSRRVGKS--FIAYTLGFLKLLEPNVKVLV 88 Query: 108 LANKAATA-------RELLGR--LQTAYENLPRWMQQGIIAWNKGSLELENGSKI-LAAS 157 +A + A R L+ + LQT EN A +K +EL NGS LA++ Sbjct: 89 VAPNYSLANIGWSQIRGLIKKYGLQTEREN----------AKDK-EIELANGSLFKLASA 137 Query: 158 TSASAVRGMSFNILFLDEFAFVPNHVADSFFASVYPTITSGKNTKVIIVSTPHGMNHFYR 217 A + G S++ + DE A + + D+F + PT+ N+K + +STP G N F Sbjct: 138 AQADSAVGRSYDFIIFDEAA-ISDVGGDAFRVQLRPTLDK-PNSKALFISTPRGGNWFKE 195 Query: 218 MW-HDAEKRKNEYIPTDVHWSEVPGRDDKWKETTIANTSEQQFKVEFECEF 267 + + + ++ + + P D E S+ F+ E+E +F Sbjct: 196 FYAYGFDDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKNYFRQEYEADF 246 >gi|4674|lcl|protein:vir:105593 Length: 543 # NCBI annotation: terminase large subunit # Family: family:all:147 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164303;genbank:gi:56692921;genbank:GeneID :3197204 Length = 543 Score = 44.3 bits (103), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 53/221 (23%), Positives = 95/221 (42%), Gaps = 34/221 (15%) Query: 28 EFLKCKEDPVY--FANNYIKIV-----------SLDEG---LTQFHPYHFQEKLINNFHN 71 E +C DP + F+ KI+ S++EG + F P Q++ I + Sbjct: 20 ELARCLADPEWRLFSGCLYKIMIKGDDKIGPDGSIEEGDSFVLPFKPNRAQKRFIRRLWH 79 Query: 72 NRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELL-GRLQTAYENLP 130 N+ RQ G +T + L +A+FN GI+A A+ + +++ AY+NLP Sbjct: 80 R--NLILKARQLGFTTLIAIMWLDHALFNGDQRCGIIAQDRDAAKVIFRDKVKFAYDNLP 137 Query: 131 RWMQQGIIAWNKGSLEL---ENGSKILAASTSASAVRGMSFNILFLDEFAFV----PNHV 183 +++ + EL N S + A++ +R + + L + EF + P+ Sbjct: 138 EEIRERFPTAAANADELLFAHNNSSVRVATS----MRSGTIHRLHVSEFGKICAKYPDKA 193 Query: 184 ADSFFASVYPTITSGKNTKVIIVSTPHGM-NHFYRMWHDAE 223 + S+ T+G ++I ST G F++M AE Sbjct: 194 QEVVTGSIPAVPTNG---ILVIESTAEGREGEFFKMVQIAE 231 >gi|20547|lcl|protein:vir:105155 Length: 580 # NCBI annotation: conserved phage-related protein # Family: family:all:4926 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398598;genbank:gi:80159854;genbank:GeneID :3772993 Length = 580 Score = 44.3 bits (103), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 55/281 (19%), Positives = 113/281 (40%), Gaps = 30/281 (10%) Query: 59 YHFQEKLINNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARE- 117 Y FQ ++ N++ + R GKS + + + + GI + + AR Sbjct: 67 YLFQRLILRAMARNQYVMLICCRGLGKSWLSAVFFVASCILYKGLKCGIASGQGQQARNV 126 Query: 118 LLGRLQTAYENLPRWMQQGIIAWNKGS----LELENGSKILA---ASTSASAVRGMSFNI 170 ++ +++ P ++ + G+ + NGS+I A R F+ Sbjct: 127 IIQKVKGELAKNPSIAREIVFPIKTGADDCVVNFRNGSEIRAIVLGRNQGDGARSWRFHY 186 Query: 171 LFLDEFAFVPNHVADSFFASVYPTITS-------GKNTKVIIVSTPH--GMNHFYRMWHD 221 L +DE V + V ++ + T + + KVI +S+ + + + R + Sbjct: 187 LLVDECRLVSDKVINTILIPMTKTKRAVAIHHNKREKGKVIFISSAYLKTSDLYKRFKYF 246 Query: 222 AEKRK---NEYIPTDVHW-----SEVPGRDDKWKETTIANTSEQQFKVEFECEFLGSVDT 273 +K N Y + + + + +DD +E + + ++F+ E+E F+GS Sbjct: 247 CDKMSSGANNYFVCSLDYRVGIEAGIFDQDDIDEERNKPDMTIEEFQYEYEGIFVGSSGE 306 Query: 274 LIAPSKLRTLIYDNPIQRNAGLDVYEPPKDKHDYVMTVDVA 314 P + T P + ++ +P K K +Y++T DVA Sbjct: 307 SYFPYETTT-----PARVLGRGEITQPKKSKSEYIITHDVA 342 >gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: terminase large subunit # Family: family:all:4877 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429607;genbank:gi:156564098;genbank:Ge neID:5525534 Length = 635 Score = 40.0 bits (92), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 50/222 (22%), Positives = 93/222 (41%), Gaps = 41/222 (18%) Query: 16 NTPIEFTEEQVIEFLKCKEDPVYFANNYIKIVSLDEGLTQFHPYHFQEKLINNFHNNRFN 75 +T +FTEE+ + +L + P ++A +K D +QE ++ +++ Sbjct: 37 STGEKFTEEE-LHYLAILDKPKFWAAETLKWFCRD----------YQEPMLQEMADSKRT 85 Query: 76 ICKMPRQTGKSTTVVSYLLHYAVF------NDSVNIGILANKAATARELLGRLQTAYE-- 127 + ++ R+ GK+ T+ +L +A N+ +I I+A + RL + Sbjct: 86 VLRLGRRLGKTETMCIMILWHAFTQPNKGPNNQYDILIIAPYEEQVDLIFKRLSQLIDMS 145 Query: 128 ---NLPRWMQQGIIAWNKGSLELENGS---KILAASTSASA---VRGMSFNILFLDEFAF 178 N R + + I EL NG+ I A S S S RG +++ LDE Sbjct: 146 GDVNPSRDIDKHI--------ELPNGTVIHGITAGSKSGSGAANTRGQRADLIVLDEM-- 195 Query: 179 VPNHVADSFFASVYPTITSG-KNTKVIIVSTPHGMNHFYRMW 219 +++ +S ++ + K+I+ STP G Y W Sbjct: 196 --DYMGESEITNIMNIRNEAPERIKMIVASTPSGRRDSYYKW 235 >gi|26461|lcl|protein:vir:573 Length: 589 # NCBI annotation: unknown # Family: family:all:4926 # MgeID: mge:13 # MgeName: SPBc2 # Cross-refs: genbank:acc:NP_046608;genbank:gi:9630181;genbank:GeneID: 1261423 Length = 589 Score = 39.3 bits (90), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 8/124 (6%) Query: 61 FQEKLINNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLG 120 FQ LI +N + + R GK+ Y A+ I I + ARE++ Sbjct: 66 FQCILIYMMVHNHYFMYLASRGQGKTWLTSVYCCVQAILFPGTKIVIASGTKGQAREVIE 125 Query: 121 R---LQTAYENLPRWMQQGIIAWNKGSLELENGS--KILAASTSASAVRGMSFNILFLDE 175 + L+ NL R ++ + N +E NGS KI+A++ A + R N+L +DE Sbjct: 126 KIDDLRKESPNLRREIEDLKTSTNDAKVEFHNGSWIKIVASNDGARSKRA---NLLIVDE 182 Query: 176 FAFV 179 F V Sbjct: 183 FRMV 186 >gi|4240|lcl|protein:vir:94734 Length: 469 # NCBI annotation: large subunit terminase # Family: family:all:523 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996704;genbank:gi:45597419;genbank:GeneID :2767958 Length = 469 Score = 37.4 bits (85), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 37/142 (26%), Positives = 65/142 (45%), Gaps = 24/142 (16%) Query: 79 MPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQTAYENLPRWMQ---- 134 +PR+ GK+ V Y+L ++I A++ +T+ ++YE L ++++ Sbjct: 68 IPRRNGKTEIV--YILELWALEQGLSILHTAHRISTS-------HSSYEKLKKYLEDSGY 118 Query: 135 ------QGIIAWNKGSLEL-ENGSKILAASTSASAVRGMSFNILFLDEFAFVPNHVADSF 187 + I A + LEL E+G I + ++S G F+ILF+DE + + Sbjct: 119 VEGEDFKSIKAKGQERLELIESGGVIQFRTRTSSGGLGEGFDILFIDE---AQEYTTEQE 175 Query: 188 FASVYPTITSGKNTKVIIVSTP 209 A Y T+T N I+ TP Sbjct: 176 SALKY-TVTDSDNPMTIMCGTP 196 >gi|7504|lcl|protein:vir:99915 Length: 478 # NCBI annotation: gp2 # Family: family:all:523 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655519;genbank:gi:109392289;genbank:GeneI D:4157084 Length = 478 Score = 35.4 bits (80), Expect = 0.002, Method: Compositional matrix adjust. Identities = 31/138 (22%), Positives = 60/138 (43%), Gaps = 7/138 (5%) Query: 75 NICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQT--AYENLPRW 132 ++ +PRQ GK+ + + A+ + + A++ TA+E G ++ A + Sbjct: 63 SVISIPRQVGKTYLIGCIVFALALLTPGLTVIWTAHRTKTAKETFGSMKAMCATPLVNAH 122 Query: 133 MQQGIIAWNKGSLELENGSKIL-AASTSASAVRGMSFNILFLDEFAFVPNHVADSFFASV 191 ++ A + L NGS+IL A + + IL LDE + D + Sbjct: 123 VRNVSDARGDEGIYLHNGSRILFGARENGFGLGFAGVGILVLDE----AQRLTDKAMDDL 178 Query: 192 YPTITSGKNTKVIIVSTP 209 PT+ + +N +++ TP Sbjct: 179 IPTMNTVENPLILLTGTP 196 >gi|3859|lcl|protein:vir:107748 Length: 532 # NCBI annotation: gp33 TerL # Family: family:all:54 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024878;genbank:gi:48697520;genbank:GeneID :2948367 Length = 532 Score = 35.4 bits (80), Expect = 0.002, Method: Compositional matrix adjust. Identities = 47/233 (20%), Positives = 93/233 (39%), Gaps = 32/233 (13%) Query: 108 LANKAATARELLGRLQTAYENLPRWMQQGIIAWNKGSLELENGSKILAASTSA------- 160 L +KA + +++ ++LP + G WN + +++ T A Sbjct: 134 LVDKAGDPDSIFFKVRFFLQHLPPEFRGG---WNPHDHTHSSHMRVIIPDTGAVIRGEAG 190 Query: 161 -SAVRGMSFNILFLDEFAFVPNHVADSFFASVYPTITSGKNTKVIIVSTPHGMNHFYRMW 219 + RG +I F+DE A + N A T + I +S+ +G+N+ + Sbjct: 191 KNIGRGGRVSIQFVDEAAHLEN-------AQAVDTALAATTNCRIDISSVNGLNNPF--- 240 Query: 220 HDAEKRKNEYIPTD-VHWSEVPGRDDKWKETTIANTSEQQFKVEFECEFLGSVDTLIAPS 278 AEKR + + +HW + P +DD+W + + E + ++ S + ++ P Sbjct: 241 --AEKRFSGRVKVKTMHWRDDPRKDDEWYKKQKQKFNALVVAQEIDIDYSASAEGVLIPL 298 Query: 279 KLRTLIYDNPI--------QRNAGLDVYEPPKDKHDYVMTVDVARGVGEDYSA 323 + D + QR + LDV + KD + + + + E +S Sbjct: 299 EWIDAAIDADVKLGLTVTGQRFSSLDVADEGKDMNAFGSRLGIRMDYAESWSG 351 >gi|19904|lcl|protein:vir:3781 Length: 607 # NCBI annotation: terminase # Family: family:all:169 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536821;genbank:gi:17981830;genbank:GeneID :929209 Length = 607 Score = 33.9 bits (76), Expect = 0.005, Method: Compositional matrix adjust. Identities = 77/365 (21%), Positives = 136/365 (37%), Gaps = 66/365 (18%) Query: 59 YHFQEKLINNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANK------- 111 + +Q+ + N H++ NI K RQ G + L A+F+ I + A+K Sbjct: 164 FDYQKHIRANKHHDVRNILK-SRQIGATYYFSFEALEDAIFSGDNQIFLSASKRQAEIFK 222 Query: 112 ---AATARELLGRLQTAYENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSF 168 ARE G + + L NG+++ ST+ + +G S Sbjct: 223 NYIVKMAREYFG-----------------VELTGNPIILSNGAELHFLSTNKNTSQGNSG 265 Query: 169 NILFLDEFAFVPNHVADSFFASVYPTITSGKNTKVIIVSTPHGMNH----FYRM--WHDA 222 ++ + DE+A++ + + AS T + T STP H F+ W D Sbjct: 266 HV-YGDEYAWIRDFQRFNDVASAMATHAKWRET---YFSTPSSKFHESYSFWSGDNWRDG 321 Query: 223 E-KRKNEYIPT------------DVHWSEVPGRDDKWK---------ETTIANTSEQQFK 260 + KRKN PT D W V +D K E S+ F Sbjct: 322 DPKRKNVPFPTFAELRDGGRLCPDGQWRYVVTIEDALKGGAGTLFNIEKLKQRYSKYAFN 381 Query: 261 VEFECEFLGSVDTLIAPSKLRTLIYDNPIQRNAGLDVYEPPKDKHDYVMTVDVARGVGED 320 + C ++ D++ +L D ++ P D+ + G G Sbjct: 382 QLYMCVWIDDADSIFTVHQLLKCGVDISKWKDFNPKADRPFGDREVWGGFDPAHSGDG-- 439 Query: 321 YSAFVCVDITEFP---HRIVAKYRNNDIKPMLFPNIIYEVAKNYNSAYILCEVNDIGDQV 377 ++FV + P +R++A+Y+ N + + N I + + YN YI + +G V Sbjct: 440 -ASFVIIAPPALPSEKYRVLARYQWNGLSYVYQANQIRALYEKYNMTYIGIDATGVGYGV 498 Query: 378 ASILQ 382 +++ Sbjct: 499 YELVK 503 >gi|18494|lcl|protein:vir:1636 Length: 469 # NCBI annotation: putative terminase # Family: family:all:523 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695057;genbank:gi:23455748;genbank:GeneID :955481 Length = 469 Score = 32.7 bits (73), Expect = 0.011, Method: Compositional matrix adjust. Identities = 36/142 (25%), Positives = 64/142 (45%), Gaps = 24/142 (16%) Query: 79 MPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQTAYENLPRWMQ---- 134 +PR+ GK+ V Y+L ++I A++ +T+ ++YE L ++++ Sbjct: 68 IPRRNGKTEIV--YILELWSLVQGLSILHTAHRISTS-------HSSYEKLKKYLEDSGY 118 Query: 135 ------QGIIAWNKGSLEL-ENGSKILAASTSASAVRGMSFNILFLDEFAFVPNHVADSF 187 + I A + LEL E+G I + ++S G F+IL +DE + + Sbjct: 119 VEGEDFKSIKAKGQERLELIESGGVIQFRTRTSSGGLGEGFDILVIDE---AQEYTTEQE 175 Query: 188 FASVYPTITSGKNTKVIIVSTP 209 A Y T+T N I+ TP Sbjct: 176 SALKY-TVTDSDNPMTIMCGTP 196 >gi|16385|lcl|protein:vir:3744 Length: 607 # NCBI annotation: orf15 # Family: family:all:169 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043485;genbank:gi:9628620;genbank:GeneID: 1261142 Length = 607 Score = 32.3 bits (72), Expect = 0.014, Method: Compositional matrix adjust. Identities = 73/366 (19%), Positives = 142/366 (38%), Gaps = 68/366 (18%) Query: 59 YHFQEKLINNFHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANK------- 111 + +Q+ + +N H++ NI K RQ G + L A+F+ I + A+K Sbjct: 164 FDYQKHIRSNKHHDVRNILK-SRQIGATYYFSFEALEDAIFSGDNQIFLSASKRQAEIFK 222 Query: 112 ---AATARELLGRLQTAYENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSF 168 ARE G + + L NG+++ ST+ + +G S Sbjct: 223 NYIVKMAREYFG-----------------VELTGNPIILSNGAELHFLSTNKNTSQGNSG 265 Query: 169 NILFLDEFAFVPNHVADSFFASVYPTITSGKNTKVIIVSTPHGMNH----FYRM--WHDA 222 ++ + DE+A++ + F V + + + + STP H F+ W D Sbjct: 266 HV-YGDEYAWIRDF---QRFDDVASAMATHEKWRETYFSTPSSKFHESYSFWSGDNWRDG 321 Query: 223 E-KRKNEYIPTDVHWSEVPGR---DDKWK-----ETTIANTSEQQFKVE----------- 262 + KRKN PT + GR D +W+ E + +++ F +E Sbjct: 322 DPKRKNVPFPTFAELRD-GGRLCPDGQWRYVVTIEDALKGGADKLFNIEKLKQRYSKYAF 380 Query: 263 ---FECEFLGSVDTLIAPSKLRTLIYDNPIQRNAGLDVYEPPKDKHDYVMTVDVARGVGE 319 + C ++ D++ +L D ++ P D+ + G G Sbjct: 381 NQLYMCIWIDDADSIFNVKQLLKCGVDIAKWKDFNPKADRPFGDREVWGGFDPAHSGDG- 439 Query: 320 DYSAFVCVDITEFP---HRIVAKYRNNDIKPMLFPNIIYEVAKNYNSAYILCEVNDIGDQ 376 ++FV + P +R++A+Y+ + + + N I + + YN YI + +G Sbjct: 440 --ASFVIIAPPALPGEKYRMLARYQWHGLSYVYQANQIRALYEKYNMTYIGIDATGVGYG 497 Query: 377 VASILQ 382 V +++ Sbjct: 498 VYELVK 503 >gi|14676|lcl|protein:vir:2499 Length: 474 # NCBI annotation: putative terminase gp4 # Family: family:all:523 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569740;genbank:gi:18496890;genbank:GeneID :932339 Length = 474 Score = 29.6 bits (65), Expect = 0.095, Method: Compositional matrix adjust. Identities = 27/134 (20%), Positives = 55/134 (41%), Gaps = 7/134 (5%) Query: 79 MPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQ--TAYENLPRWMQQG 136 +PRQTGK+ + + + + + + A++ TA E +Q + + + Sbjct: 69 IPRQTGKTYLLGALVFALCIKTPNTTVIWTAHRTRTAAETFRSMQGLAKRDKIAPHILNV 128 Query: 137 IIAWNKGSLELENGSKILAASTSASAVRGMS-FNILFLDEFAFVPNHVADSFFASVYPTI 195 K ++ +NGS+IL + RG + ++L DE + + D P Sbjct: 129 HTGNGKEAVLFKNGSRILFGARERGFGRGFAGVDVLIFDEAQILTENAMDDMV----PAT 184 Query: 196 TSGKNTKVIIVSTP 209 + N +++ TP Sbjct: 185 NAAPNPLILLAGTP 198 >gi|14590|lcl|protein:vir:8098 Length: 515 # NCBI annotation: gp2 # Family: family:all:523 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817679;genbank:gi:29566110;genbank:GeneID :1259304 Length = 515 Score = 28.9 bits (63), Expect = 0.14, Method: Compositional matrix adjust. Identities = 17/67 (25%), Positives = 36/67 (53%), Gaps = 4/67 (5%) Query: 142 KGSLELENGSKILAASTSASAVRGMSFNILFLDEFAFVPNHVADSFFASVYPTITSGKNT 201 K S +G +++ + + S RG++ N + LDE F H + S+ PT+++ + Sbjct: 140 KPSKACPDGQRVIFKARTNSGGRGLTGNKVILDE-GFALRH---AHMGSLMPTLSAVPDP 195 Query: 202 KVIIVST 208 +++I S+ Sbjct: 196 QLLIGSS 202 >gi|11877|lcl|protein:vir:79128 Length: 593 # NCBI annotation: terminase # Family: family:all:169 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165255;genbank:gi:145708080;genbank:Ge neID:5247139 Length = 593 Score = 28.9 bits (63), Expect = 0.16, Method: Compositional matrix adjust. Identities = 20/70 (28%), Positives = 32/70 (45%), Gaps = 3/70 (4%) Query: 316 GVGEDYSAFVCVDITEFPH---RIVAKYRNNDIKPMLFPNIIYEVAKNYNSAYILCEVND 372 G G D +A V V P R++ +++ I I VA+ Y+ AY+ + Sbjct: 423 GGGGDSAALVVVAPPLVPGGKFRVLERHQFRGIDYEEQAGAIRRVAERYDVAYVGIDRTG 482 Query: 373 IGDQVASILQ 382 IGD V ++Q Sbjct: 483 IGDAVFRLVQ 492 >gi|10294|lcl|protein:vir:105656 Length: 405 # NCBI annotation: putative large terminase subunit # Family: family:all:697 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425018;genbank:gi:83571766;uniprot:Q2WC34 ;genbank:GeneID:3837297 Length = 405 Score = 26.2 bits (56), Expect = 1.0, Method: Compositional matrix adjust. Identities = 12/61 (19%), Positives = 27/61 (44%) Query: 69 FHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQTAYEN 128 F+ +++ + + PR K+T Y + + I +++ A A E+ G + + Sbjct: 62 FYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFRG 121 Query: 129 L 129 L Sbjct: 122 L 122 >gi|8927|lcl|protein:vir:97005 Length: 632 # NCBI annotation: 40 # Family: family:all:697 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654141;genbank:gi:108862025;genbank:GeneI D:5075954 Length = 632 Score = 25.8 bits (55), Expect = 1.3, Method: Compositional matrix adjust. Identities = 16/80 (20%), Positives = 36/80 (45%), Gaps = 2/80 (2%) Query: 69 FHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQTAYEN 128 F+ +++ + + PR K+T Y + + I +++ A A E+ G + + Sbjct: 62 FYGHKYRLIEAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFRG 121 Query: 129 LP--RWMQQGIIAWNKGSLE 146 L +M I A ++ S++ Sbjct: 122 LDFLEFMLPDIYAGDRASVK 141 >gi|15201|lcl|protein:vir:7028 Length: 631 # NCBI annotation: large terminase subunit # Family: family:all:697 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853601;genbank:gi:31711683;genbank:GeneID :1481809 Length = 631 Score = 25.8 bits (55), Expect = 1.3, Method: Compositional matrix adjust. Identities = 31/163 (19%), Positives = 62/163 (38%), Gaps = 15/163 (9%) Query: 69 FHNNRFNICKMPRQTGKSTTVVSYLLHYAVFNDSVNIGILANKAATARELLGRLQTAYEN 128 F N++ + + R K+T Y + + I I++ A A E+ G + + Sbjct: 62 FGGNKYRMVEAQRGQAKTTIAAIYAVFRIIHEPHKRIMIVSQTAKRAEEIAGWVIKIFRG 121 Query: 129 LP--RWMQQGIIAWNKGS---------LELENGSKILAASTSASAVRGMSFNILFLDEFA 177 L +M I A +K S L + S +A + + ++G +I+ D+ Sbjct: 122 LDFLEFMLPDIYAGDKASIKGFEIHYTLRGSDKSPSVACYSIEAGMQGARADIILADDVE 181 Query: 178 FVPNHVADSFFASVYPTITSGKNTK----VIIVSTPHGMNHFY 216 + N + A + ++ +I + TP +N Y Sbjct: 182 SLQNSRTAAGRALLEDLTKEFESINQFGDIIYLGTPQSVNSIY 224 >gi|25398|lcl|protein:vir:8929 Length: 717 # NCBI annotation: ORF025 # Family: family:all:1546 # MgeID: mge:163 # MgeName: phiKZ # Cross-refs: genbank:acc:NP_803591;genbank:gi:29134961;genbank:GeneID :1258246 Length = 717 Score = 25.8 bits (55), Expect = 1.5, Method: Compositional matrix adjust. Identities = 14/70 (20%), Positives = 35/70 (50%), Gaps = 4/70 (5%) Query: 443 DYEIISELTTFISKHNSFEAEEGCNDDLAMCLVIYAWLV----QMDYFKELTDQDVRKRL 498 D + +EL ++ + +G +DDL + L++ WL+ + Y+ + +L Sbjct: 569 DKPLSTELLALTIRNGRIDHAKGNHDDLVVSLLLAHWLLIQGKNLSYYGINVPILGKSKL 628 Query: 499 YEEQKNQIEQ 508 +++ +Q+E+ Sbjct: 629 RDKEPSQLEK 638 >gi|10061|lcl|protein:vir:99071 Length: 537 # NCBI annotation: gp26 # Family: family:all:523 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655891;genbank:gi:109521463;genbank:GeneI D:4158036 Length = 537 Score = 25.8 bits (55), Expect = 1.5, Method: Compositional matrix adjust. Identities = 37/149 (24%), Positives = 61/149 (40%), Gaps = 24/149 (16%) Query: 79 MPRQTGKSTTV-VSYLLHYAVFNDSVNI--GILANKAATARELLGRLQTAYEN------L 129 +PRQ GK+ + L+ V D + I +L N ARE RL+ EN + Sbjct: 90 VPRQNGKTAIIEARELVGLYVVCDKLCIHTAVLFN---AARESFYRLKARIENNETLNKI 146 Query: 130 PRWMQQG-----IIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDE-FAFVPNHV 183 R+ + K S G +++ + + RG S +++ LDE FA + Sbjct: 147 TRFRSGNDNMSIEVKPKKESRHPNAGGRVIYMARGTAVARGFSADVIVLDEAFALDEASI 206 Query: 184 ADSFFASVYPTITSGKNTKVIIVSTPHGM 212 A +A TS + II ++ G+ Sbjct: 207 AAIDYA------TSARANPFIIYASSTGL 229 >gi|25110|lcl|protein:vir:80755 Length: 501 # NCBI annotation: putative large terminase # Family: family:all:1430 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504114;genbank:gi:158079301;genbank:Ge neID:5666404 Length = 501 Score = 25.4 bits (54), Expect = 1.8, Method: Compositional matrix adjust. Identities = 10/22 (45%), Positives = 15/22 (68%) Query: 361 YNSAYILCEVNDIGDQVASILQ 382 YN I+ +V D GD+VA ++Q Sbjct: 334 YNPDIIVADVGDSGDKVAKLMQ 355 Score = 23.9 bits (50), Expect = 5.9, Method: Compositional matrix adjust. Identities = 11/31 (35%), Positives = 18/31 (58%) Query: 17 TPIEFTEEQVIEFLKCKEDPVYFANNYIKIV 47 T ++ T EQ+ +F++ + DPV Y IV Sbjct: 3 TTVDSTNEQMTKFVQTRLDPVLQNGYYSTIV 33 >gi|19767|lcl|protein:vir:6384 Length: 677 # NCBI annotation: terminase large subunit TerL # Family: family:all:140 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918997;genbank:gi:34610172;genbank:gi:912 14212;genbank:GeneID:2559603 Length = 677 Score = 24.6 bits (52), Expect = 2.8, Method: Compositional matrix adjust. Identities = 11/25 (44%), Positives = 16/25 (64%), Gaps = 1/25 (4%) Query: 236 WSEVPGRDDKW-KETTIANTSEQQF 259 WS P ++W K T +A+ SEQ+F Sbjct: 635 WSSPPSWAEEWDKNTLVADASEQRF 659 >gi|8126|lcl|protein:vir:102661 Length: 568 # NCBI annotation: terminase # Family: family:all:147 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024418;genbank:gi:48696639;genbank:GeneID :2948128 Length = 568 Score = 24.3 bits (51), Expect = 3.9, Method: Compositional matrix adjust. Identities = 10/24 (41%), Positives = 15/24 (62%) Query: 206 VSTPHGMNHFYRMWHDAEKRKNEY 229 ++TP G N FY++ AEK + Y Sbjct: 216 ITTPRGKNWFYKLAMHAEKSEEWY 239 >gi|15101|lcl|protein:vir:3867 Length: 563 # NCBI annotation: putative large terminase subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680484;swissprot:trembl:q8ltc3;genbank:gi :22296524;interpro:IPR005021;uniprot:Q8LTC3;genbank:Gene ID:951698 Length = 563 Score = 23.5 bits (49), Expect = 6.4, Method: Compositional matrix adjust. Identities = 23/115 (20%), Positives = 41/115 (35%), Gaps = 21/115 (18%) Query: 77 CKMPRQTGKSTTVVSYLLH--------------YAVFNDSVNIGILANKAATARELLGRL 122 M R+ GKS + +L+ Y ND GI+ L R Sbjct: 95 ISMARKNGKSLLISGVILYEFLFGKNPANKRQLYTAANDRKQAGIVFGMVKDRLRALMRK 154 Query: 123 QTAYENLPRWMQQGIIAWNKGSLELENGSKILAASTSASAVRGMSFNILFLDEFA 177 + + + + ++ L++GS I + S V G ++ +DE+A Sbjct: 155 DPGIKRMVKITRDELV-------NLDDGSTIRSFSRDTGLVDGYEPHVAVVDEYA 202 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.319 0.136 0.407 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 256,884 Number of Sequences: 514 Number of extensions: 11754 Number of successful extensions: 139 Number of sequences better than 100.0: 57 Number of HSP's better than 100.0 without gapping: 39 Number of HSP's successfully gapped in prelim test: 18 Number of HSP's that attempted gapping in prelim test: 37 Number of HSP's gapped (non-prelim): 65 length of query: 549 length of database: 206,069 effective HSP length: 76 effective length of query: 473 effective length of database: 167,005 effective search space: 78993365 effective search space used: 78993365 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 39 (19.6 bits)