BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_015251.1_cdsid_YP_004300928.1 [gene=17] [protein=gp17 terminase DNA packaging enzyme large subunit] [protein_id=YP_004300928.1] [location=complement(70060..71988)] (642 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|20804|lcl|protein:vir:106290 Length: 633 # NCBI annotation: g... 876 0.0 gi|28182|lcl|protein:vir:108053 Length: 611 # NCBI annotation: g... 722 0.0 gi|24596|lcl|protein:vir:98262 Length: 609 # NCBI annotation: gp... 711 0.0 gi|27701|lcl|protein:vir:6893 Length: 611 # NCBI annotation: gp1... 701 0.0 gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: ter... 701 0.0 gi|23191|lcl|protein:vir:101186 Length: 613 # NCBI annotation: t... 701 0.0 gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp... 699 0.0 gi|27407|lcl|protein:vir:7202 Length: 610 # NCBI annotation: gp1... 698 0.0 gi|22056|lcl|protein:vir:103455 Length: 610 # NCBI annotation: t... 698 0.0 gi|22942|lcl|protein:vir:101803 Length: 613 # NCBI annotation: g... 698 0.0 gi|21077|lcl|protein:vir:100538 Length: 612 # NCBI annotation: g... 684 0.0 gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: ter... 609 e-176 gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: t... 364 e-102 gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: g... 363 e-102 gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T... 352 6e-99 gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: g... 349 5e-98 gi|16897|lcl|protein:vir:5867 Length: 508 # NCBI annotation: sim... 161 2e-41 gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: te... 64 4e-12 gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: ph... 64 5e-12 gi|17621|lcl|protein:vir:816 Length: 568 # NCBI annotation: hypo... 51 5e-08 gi|25680|lcl|protein:vir:10116 Length: 568 # NCBI annotation: hy... 51 5e-08 gi|51|lcl|protein:vir:3295 Length: 568 # NCBI annotation: putati... 51 5e-08 gi|26240|lcl|protein:vir:2763 Length: 568 # NCBI annotation: hyp... 51 5e-08 gi|25849|lcl|protein:vir:9949 Length: 568 # NCBI annotation: hyp... 51 5e-08 gi|1476|lcl|protein:vir:104436 Length: 568 # NCBI annotation: pu... 51 5e-08 gi|26461|lcl|protein:vir:573 Length: 589 # NCBI annotation: unkn... 46 1e-06 gi|20547|lcl|protein:vir:105155 Length: 580 # NCBI annotation: c... 43 1e-05 gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: te... 40 6e-05 gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: t... 38 3e-04 gi|16077|lcl|protein:vir:4155 Length: 468 # NCBI annotation: lar... 33 0.012 gi|18252|lcl|protein:vir:4193 Length: 467 # NCBI annotation: put... 32 0.018 gi|18781|lcl|protein:vir:7429 Length: 581 # NCBI annotation: gp6... 32 0.018 gi|1431|lcl|protein:vir:93626 Length: 530 # NCBI annotation: Bce... 30 0.090 gi|3478|lcl|protein:vir:101370 Length: 494 # NCBI annotation: Pa... 29 0.22 gi|1931|lcl|protein:vir:99847 Length: 486 # NCBI annotation: lar... 28 0.30 gi|25398|lcl|protein:vir:8929 Length: 717 # NCBI annotation: ORF... 28 0.39 gi|19608|lcl|protein:vir:4081 Length: 518 # NCBI annotation: ter... 27 0.78 gi|4674|lcl|protein:vir:105593 Length: 543 # NCBI annotation: te... 27 0.82 gi|6719|lcl|protein:vir:99411 Length: 447 # NCBI annotation: hyp... 27 0.93 gi|8927|lcl|protein:vir:97005 Length: 632 # NCBI annotation: 40 ... 27 1.0 gi|8997|lcl|protein:vir:101640 Length: 432 # NCBI annotation: te... 26 1.1 gi|10294|lcl|protein:vir:105656 Length: 405 # NCBI annotation: p... 26 1.1 gi|12378|lcl|protein:vir:79648 Length: 523 # NCBI annotation: Te... 26 1.1 gi|13269|lcl|protein:vir:81252 Length: 542 # NCBI annotation: gp... 25 1.9 gi|15201|lcl|protein:vir:7028 Length: 631 # NCBI annotation: lar... 25 2.0 gi|16205|lcl|protein:vir:1554 Length: 587 # NCBI annotation: DNA... 25 3.5 gi|13692|lcl|protein:vir:4897 Length: 411 # NCBI annotation: gp4... 25 3.7 gi|16780|lcl|protein:vir:2731 Length: 411 # NCBI annotation: hyp... 25 4.0 gi|4395|lcl|protein:vir:98555 Length: 589 # NCBI annotation: gp3... 24 4.7 gi|13826|lcl|protein:vir:3376 Length: 586 # NCBI annotation: DNA... 24 4.7 gi|17547|lcl|protein:vir:959 Length: 592 # NCBI annotation: term... 23 7.5 gi|9070|lcl|protein:vir:102238 Length: 588 # NCBI annotation: gp... 23 8.6 gi|8220|lcl|protein:vir:101493 Length: 588 # NCBI annotation: gp... 23 8.6 >gi|20804|lcl|protein:vir:106290 Length: 633 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944105;genbank:gi:38640149;genbank:GeneID :2658038 Length = 633 Score = 876 bits (2264), Expect = 0.0, Method: Compositional matrix adjust. Identities = 406/628 (64%), Positives = 507/628 (80%), Gaps = 4/628 (0%) Query: 1 MAHLPEEMIPFADLKKKVVTFDKPYEELFGTDKAVYVQQPHGVLNILDERGDKIIKSISS 60 M+ +P+EM P L++KVV+FD+P + +FG D+AVY QQP G L I D+R +K+I+ + S Sbjct: 1 MSLMPDEMTPVDQLERKVVSFDQPLD-VFGEDEAVYAQQPDGELVIYDDRREKVIRKLKS 59 Query: 61 CKYYKSQHDDTWYPERYEYYYEQSRFRKIKIQGTNKKDYKTYKNRFDKKSRYLGLPNLKK 120 C YYKSQHD WYPE Y+ Y E R +K+ +QG + D+K++K+RF+K++RYLGLPNLK+ Sbjct: 60 CVYYKSQHDGRWYPETYDIYSELKRVQKMNLQGKDPSDFKSFKDRFNKRTRYLGLPNLKR 119 Query: 121 ANVQTTWSQEMVEEWKKCRDDILYFAK-YCAIIHLDHGVIGINLRPYQKDMLRIMSEKRM 179 ANV T W++EMVEEWK+CRDDI+YFA+ YC+IIH+D GVI + LR YQKDMLRIM+ +RM Sbjct: 120 ANVPTKWTREMVEEWKRCRDDIVYFAETYCSIIHIDWGVIKVQLRDYQKDMLRIMASERM 179 Query: 180 SIHNLSRQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMSREVLERTKQIIELLPDFLQ 239 S+HNL RQLGKTTAT+++ H+V FNEAK VG+LAHK MS+EVLERTKQ IELLPDFLQ Sbjct: 180 SMHNLPRQLGKTTATAIFLTHFVVFNEAKAVGVLAHKGDMSKEVLERTKQSIELLPDFLQ 239 Query: 240 PGIVEWNKGSIELENGCAISAYSSDPDAVRGNSFALIYVDECGFVEAWEDCWKAILPVIS 299 PGIVEWNKG+IELENGC+I AY+S PDAVRGNSFALIYVDEC F+E +ED WKAILPVIS Sbjct: 240 PGIVEWNKGNIELENGCSIGAYASSPDAVRGNSFALIYVDECAFIEGFEDTWKAILPVIS 299 Query: 300 SGRNSKIVLTSTPNGMNHWYDLWQSAINGKSGFEPYEANWSAVKERLYHSETDSFDDGIS 359 SGR S+I+LTSTPNG+NHWYDLW+ ++ GF+PY W VKERLY +D++DDG Sbjct: 300 SGRQSRIILTSTPNGINHWYDLWEVSLKSDKGFKPYTTTWITVKERLYDG-SDAYDDGFE 358 Query: 360 WTTNQIGSSSVEAFLQEHAGQFMGSAGTLITGFKLSKMTWTDIEATENLYRFKQPEVGHK 419 W + QI SSSVEAF QEH +FMG++GTLI GFKLSKMTW ++ A +N Y+ ++P G+K Sbjct: 359 WASKQINSSSVEAFQQEHLCRFMGTSGTLINGFKLSKMTWKEVIADDNFYQIEKPVEGNK 418 Query: 420 YFACVDPAEGRGQDYSTIQIIDVTKYPYEQVAVYHSNKISHIMFPRIIQRLAMEYNEAYV 479 Y A VDPAEGRGQDYSTIQIIDVT YPY QVAVYHSNKIS ++ P +I R AMEYN A+V Sbjct: 419 YIATVDPAEGRGQDYSTIQIIDVTSYPYRQVAVYHSNKISPLLLPSVIMRYAMEYNNAWV 478 Query: 480 YIELNSVGLSVAKDLYMDLEYENMIIDSSRDLGMKQTKTTKAMGCSTLKDLIEKDKLRIN 539 YIELNS+G VAK L++DLEYEN+I+DSS+DLGMKQTK TKA+GCSTLKDLIEKDKL ++ Sbjct: 479 YIELNSIGNMVAKSLFIDLEYENVIVDSSKDLGMKQTKVTKAVGCSTLKDLIEKDKLIVS 538 Query: 540 HKQTVIELRTFVEDGLSWSAAKDNHDDLVMALVIFAYLTTQDRFAEFLDMDEHSLAHDIF 599 HK T+ E RTFVE G+SW+A HDDLVM+L IFAYLTTQ+RF +F+D ++ D+F Sbjct: 539 HKGTIQEFRTFVEKGVSWAAQDGFHDDLVMSLCIFAYLTTQERFGDFIDA-TRNIGADVF 597 Query: 600 QDELESMNDDFNFSVFFNDGLNTIEINS 627 Q E+E M +DF +DG+NT E+++ Sbjct: 598 QSEMEEMLEDFCVGAIIDDGINTYEVDN 625 >gi|28182|lcl|protein:vir:108053 Length: 611 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595293;genbank:gi:161622599;genbank:Ge neID:5783772 Length = 611 Score = 722 bits (1863), Expect = 0.0, Method: Compositional matrix adjust. Identities = 349/602 (57%), Positives = 454/602 (75%), Gaps = 16/602 (2%) Query: 37 VQQPHGVLN---ILDERGDKIIKSISSCK--------YYKSQHDDTWYPERYEYYYEQSR 85 ++QP VL+ L+E +IK S + + KSQ DD WYPE++ Y + Sbjct: 1 MEQPVNVLSDDHPLNEGKTIVIKPPGSLERKTEEGINWIKSQWDDKWYPEKFSDYLRIHK 60 Query: 86 FRKIKIQGTNKKDYKTYKNRFDKKSRYLGLPNLKKANVQTTWSQEMVEEWKKCRDDILYF 145 KI G +++T+K++ +K++RY+GLPNLK+AN++T W++EMV EWKKCRDDI+YF Sbjct: 61 IVKIPNNGDRPDEFQTFKDKMNKRTRYMGLPNLKRANIKTQWTREMVSEWKKCRDDIVYF 120 Query: 146 AK-YCAIIHLDHGVIGINLRPYQKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTF 204 A+ YCAI H+D+G I + LR YQ+DML+IMS+ RM+ NLSRQLGKTT +++ H+V F Sbjct: 121 AETYCAITHIDYGTIKVQLRDYQRDMLKIMSKNRMTTCNLSRQLGKTTVVAIFLAHFVCF 180 Query: 205 NEAKNVGILAHKASMSREVLERTKQIIELLPDFLQPGIVEWNKGSIELENGCAISAYSSD 264 N+ K VGILAHK SMS EVL+RTKQ IELLPDFLQPGIVEWNKGSIEL+NG +I AY+S Sbjct: 181 NKDKAVGILAHKGSMSAEVLDRTKQAIELLPDFLQPGIVEWNKGSIELDNGSSIGAYASS 240 Query: 265 PDAVRGNSFALIYVDECGFVEAWEDCWKAILPVISSGRNSKIVLTSTPNGMNHWYDLWQS 324 PDAVRGNSFA+IY+DEC F+ + D W AI PVISSGR SKI++T+TPNG+NH+YD+W + Sbjct: 241 PDAVRGNSFAMIYIDECAFIPNFLDSWLAIQPVISSGRRSKIIITTTPNGLNHFYDIWTA 300 Query: 325 AINGKSGFEPYEANWSAVKERLYHSETDSFDDGISWTTNQIGSSSVEAFLQEHAGQFMGS 384 A+ GKSGF PY A W++VKERLY+ + D FDDG W++ I +SS+ F QEH +F G+ Sbjct: 301 AVEGKSGFAPYTAIWNSVKERLYN-DADIFDDGWEWSSQTISASSLAQFRQEHCAEFQGT 359 Query: 385 AGTLITGFKLSKMTWTDIEATEN--LYRFKQPEVGHKYFACVDPAEGRGQDYSTIQIIDV 442 +GTLI+G KL+ M W ++ EN YRF +P+ HKY A +D +EGRGQDY + IIDV Sbjct: 360 SGTLISGMKLAIMDWKEV-IPENGYFYRFHEPDPTHKYIASLDCSEGRGQDYHALHIIDV 418 Query: 443 TKYPYEQVAVYHSNKISHIMFPRIIQRLAMEYNEAYVYIELNSVGLSVAKDLYMDLEYEN 502 T +EQVAV HSN+ISH++ P I+ + MEYNEA VYIELNS G+SVAK LYMDLEYEN Sbjct: 419 TTDEWEQVAVLHSNEISHMILPDIVYKYLMEYNEAPVYIELNSTGVSVAKSLYMDLEYEN 478 Query: 503 MIIDSSRDLGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVIELRTFVEDGLSWSAAKD 562 +I DS +DLGMKQT+ TK +GCSTLKDLIEKDKL++NHKQT++E RTF ++ LSW+A Sbjct: 479 VICDSMQDLGMKQTRRTKPVGCSTLKDLIEKDKLKLNHKQTIMEFRTFSQNKLSWAAEDG 538 Query: 563 NHDDLVMALVIFAYLTTQDRFAEFLDMDEHSLAHDIFQDELESMNDDFNFSVFFNDGLNT 622 HDDLVM+LVIFA+LTTQ +FA+F+D DE LA ++F ELE MN+++N VF + G N+ Sbjct: 539 FHDDLVMSLVIFAWLTTQQKFADFIDRDEMRLASEVFSRELEDMNEEYNPVVFVDAGDNS 598 Query: 623 IE 624 E Sbjct: 599 YE 600 >gi|24596|lcl|protein:vir:98262 Length: 609 # NCBI annotation: gp17 terminase subunit, nuclease and ATPase # Family: family:all:147 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239195;genbank:gi:66391670;genbank:GeneID :3416364 Length = 609 Score = 711 bits (1836), Expect = 0.0, Method: Compositional matrix adjust. Identities = 344/600 (57%), Positives = 441/600 (73%), Gaps = 12/600 (2%) Query: 31 TDKAVYVQQPHGVLNILDERGDKIIKSISSCKYYKSQHDDTWYPERYEYYYEQSRFRKIK 90 +D + + P + +DE G ++ +S+HD WYP + Y + + K++ Sbjct: 13 SDHPIGLMHPDYLKRKIDEAG---------MEWVQSEHDKKWYPYTFSDYLKINGIHKVE 63 Query: 91 IQGTNKKDYKTYKNRFDKKSRYLGLPNLKKANVQTTWSQEMVEEWKKCRDDILYFAK-YC 149 +QG N ++ TYKN+ +KKSRY G PNLK+A VQT W++EM+ EW KCRDDI+YFA+ YC Sbjct: 64 LQGKNPAEFATYKNKSNKKSRYNGNPNLKRAYVQTKWTKEMLMEWVKCRDDIVYFAETYC 123 Query: 150 AIIHLDHGVIGINLRPYQKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTFNEAKN 209 AI H+D+G I + LR YQK+ML M + RM NLSRQLGKTT +++ H+V FNE K Sbjct: 124 AITHIDYGTIKVQLRDYQKEMLIEMHKNRMVTCNLSRQLGKTTVVAIFLAHFVCFNEDKY 183 Query: 210 VGILAHKASMSREVLERTKQIIELLPDFLQPGIVEWNKGSIELENGCAISAYSSDPDAVR 269 VG+LAHKASMS EVL+RTKQ IELLPDFLQPGIVEWNKGSIEL+N C I A++S PDAVR Sbjct: 184 VGVLAHKASMSAEVLDRTKQAIELLPDFLQPGIVEWNKGSIELDNKCKIGAFASSPDAVR 243 Query: 270 GNSFALIYVDECGFVEAWEDCWKAILPVISSGRNSKIVLTSTPNGMNHWYDLWQSAINGK 329 GNSFA+IY+DEC F+ + D W AI PVISSGR SKI++T+TPNG+NH+YD+W +A+ GK Sbjct: 244 GNSFAMIYIDECAFIPNFTDAWLAIQPVISSGRKSKILITTTPNGLNHFYDIWNAAVEGK 303 Query: 330 SGFEPYEANWSAVKERLY-HSETDSFDDGISWTTNQIGSSSVEAFLQEHAGQFMGSAGTL 388 SGF PY A W++VKERLY + FDDG SW+ I SS EAFLQEH +FMG+ GTL Sbjct: 304 SGFVPYTAIWTSVKERLYTDGDNGVFDDGYSWSAKMIAGSSKEAFLQEHCAEFMGTNGTL 363 Query: 389 ITGFKLSKMTWTDIEATE-NLYRFKQPEVGHKYFACVDPAEGRGQDYSTIQIIDVTKYPY 447 I+G+KLSKM+W DI+ TE N Y++K+PE GHKY A +DPAEGRGQDY + IID+T P+ Sbjct: 364 ISGWKLSKMSWIDIDETETNFYQYKKPEEGHKYVAVLDPAEGRGQDYHAMHIIDITTLPF 423 Query: 448 EQVAVYHSNKISHIMFPRIIQRLAMEYNEAYVYIELNSVGLSVAKDLYMDLEYENMIIDS 507 EQVAVYHSN+ SH++ P I+ R YNEA++YIELNS G SVAK L+ +LEYEN+I DS Sbjct: 424 EQVAVYHSNRTSHLILPDILLRYLTMYNEAWIYIELNSTGHSVAKSLFSELEYENVICDS 483 Query: 508 SRDLGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVIELRTFVEDGLSWSAAKDNHDDL 567 DLGMKQTK +KA+GCSTLKDLIEKDKL IN+K+T++E RTF E G+SW+A + HDDL Sbjct: 484 YNDLGMKQTKRSKAIGCSTLKDLIEKDKLIINNKKTILEFRTFSEKGVSWAAEEGFHDDL 543 Query: 568 VMALVIFAYLTTQDRFAEFLDMDEHSLAHDIFQDELESMNDDFNFSVFFNDGLNTIEINS 627 VM+L F +LTTQ +FAEF + D+ LA+++F E E + +D V G TI + S Sbjct: 544 VMSLACFGWLTTQLKFAEFCEKDDLRLANEVFAREREQLYEDALCPVIVTSGDETISVGS 603 >gi|27701|lcl|protein:vir:6893 Length: 611 # NCBI annotation: gp17 terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861869;genbank:gi:32453660;genbank:GeneID :1494295 Length = 611 Score = 701 bits (1808), Expect = 0.0, Method: Compositional matrix adjust. Identities = 339/607 (55%), Positives = 446/607 (73%), Gaps = 16/607 (2%) Query: 37 VQQPHGVLN---ILDERGDKII---------KSISSCKYYKSQHDDTWYPERYEYYYEQS 84 ++QP LN L+E GDK++ K + KSQ D WYPE++ Y + Sbjct: 1 MEQPINALNDNHPLNE-GDKVVILPPHLAERKEEDGIHWIKSQWDGKWYPEKFSDYLRIN 59 Query: 85 RFRKIKIQGTNKKDYKTYKNRFDKKSRYLGLPNLKKANVQTTWSQEMVEEWKKCRDDILY 144 + KI + ++TYK++ +K++RY+GLPNLK+AN++T W+ EMV EWKKCRDDI+Y Sbjct: 60 KIVKIPNNSDKPELFQTYKDKNNKRTRYMGLPNLKRANIKTQWTYEMVAEWKKCRDDIVY 119 Query: 145 FAK-YCAIIHLDHGVIGINLRPYQKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVT 203 FA+ YCAI H+D+G I + LR YQ+DML+IMS KRM++ NLSRQLGKTT +++ H+V Sbjct: 120 FAETYCAITHIDYGTIKVQLRDYQRDMLKIMSSKRMTVCNLSRQLGKTTVVAIFLAHFVC 179 Query: 204 FNEAKNVGILAHKASMSREVLERTKQIIELLPDFLQPGIVEWNKGSIELENGCAISAYSS 263 FN+ K VGILAHK SMS EVL+RTKQ IELLPDFLQPGIVEWNKGSI+L+NG +I AY+S Sbjct: 180 FNKDKAVGILAHKGSMSAEVLDRTKQAIELLPDFLQPGIVEWNKGSIQLDNGSSIGAYAS 239 Query: 264 DPDAVRGNSFALIYVDECGFVEAWEDCWKAILPVISSGRNSKIVLTSTPNGMNHWYDLWQ 323 PDAVRGNSFA+IY+DEC F+ + D W AI PVISSGR SKI++T+TPNG+NH+YD+W Sbjct: 240 SPDAVRGNSFAMIYIDECAFIPNFIDSWLAIQPVISSGRRSKIIITTTPNGLNHFYDIWT 299 Query: 324 SAINGKSGFEPYEANWSAVKERLYHSETDSFDDGISWTTNQIGSSSVEAFLQEHAGQFMG 383 +A+ GKSGFEPY A W++VKERLY+ E D FDDG W+ I +SS+ F QEH F G Sbjct: 300 AAVEGKSGFEPYTAIWNSVKERLYNDE-DIFDDGWQWSKQTISASSLTQFRQEHTAAFEG 358 Query: 384 SAGTLITGFKLSKMTWTDIEA-TENLYRFKQPEVGHKYFACVDPAEGRGQDYSTIQIIDV 442 ++GTLI+G KL+ + + ++ + ++FK+PE GHKY A +D +EGRGQDY + IIDV Sbjct: 359 TSGTLISGMKLAILDYIEVTPDSHGFHQFKKPEEGHKYIATLDCSEGRGQDYHAMHIIDV 418 Query: 443 TKYPYEQVAVYHSNKISHIMFPRIIQRLAMEYNEAYVYIELNSVGLSVAKDLYMDLEYEN 502 T +EQV V HSN ISH++ P I+ + MEYNE +YIELNS G+SVAK LYMDLEYEN Sbjct: 419 TTDKWEQVGVLHSNTISHLILPDIVFKYLMEYNECPIYIELNSTGVSVAKSLYMDLEYEN 478 Query: 503 MIIDSSRDLGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVIELRTFVEDGLSWSAAKD 562 +I DS DLGMKQ++ TK +GCSTLKDLIEKDKL+INH+ T+ E RTF E G+SW+A + Sbjct: 479 VICDSMNDLGMKQSRRTKPVGCSTLKDLIEKDKLKINHRATIQEFRTFSEKGVSWAAEEG 538 Query: 563 NHDDLVMALVIFAYLTTQDRFAEFLDMDEHSLAHDIFQDELESMNDDFNFSVFFNDGLNT 622 HDDLVM LVIF +L+TQ +FA++ D D+ LA ++F EL+ MNDD+ +F + N+ Sbjct: 539 YHDDLVMGLVIFGWLSTQQKFADYADKDDMRLASEVFSRELQDMNDDYAPVIFVDCASNS 598 Query: 623 IEINSAS 629 E N ++ Sbjct: 599 AEYNPSA 605 >gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891724;genbank:gi:33620526;genbank:GeneID :1725332 Length = 607 Score = 701 bits (1808), Expect = 0.0, Method: Compositional matrix adjust. Identities = 329/560 (58%), Positives = 432/560 (77%), Gaps = 3/560 (0%) Query: 66 SQHDDTWYPERYEYYYEQSRFRKIKIQGTNKKDYKTYKNRFDKKSRYLGLPNLKKANVQT 125 S+HDD WYP+++ Y + +R +KI++Q T+ +YK +K+ + ++RY+ L NL++AN++T Sbjct: 39 SKHDDKWYPKKFSDYLKLNRPQKIRMQSTDPTNYKFFKDSDNIRTRYMRLKNLRRANIKT 98 Query: 126 TWSQEMVEEWKKCRDDILYFAK-YCAIIHLDHGVIGINLRPYQKDMLRIMSEKRMSIHNL 184 ++ EM+ EWK+CR DI+YFA+ YCAI H+D+G I + LR YQKDML+IM E RMS H L Sbjct: 99 QYTPEMIAEWKRCRKDIVYFAETYCAITHIDYGTIKVQLRDYQKDMLKIMHENRMSAHKL 158 Query: 185 SRQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMSREVLERTKQIIELLPDFLQPGIVE 244 SRQLGKTTA +++ HYV FN+ K VGILAHK SM+ EVLERTKQ IELLPDFLQPGIVE Sbjct: 159 SRQLGKTTAVAIFLAHYVCFNKDKAVGILAHKGSMAVEVLERTKQAIELLPDFLQPGIVE 218 Query: 245 WNKGSIELENGCAISAYSSDPDAVRGNSFALIYVDECGFVEAWEDCWKAILPVISSGRNS 304 WNK SI LENG +I AY+S PDAVRGNSF+ IY+DEC F++ W DC+ AI PVISSGR S Sbjct: 219 WNKKSIVLENGSSIGAYASSPDAVRGNSFSFIYIDECAFIQNWTDCFLAIQPVISSGRES 278 Query: 305 KIVLTSTPNGMNHWYDLWQSAINGKSGFEPYEANWSAVKERLYHSETDSFDDGISWTTNQ 364 K+++T+TPNG+NH+YD+WQSAI+GKSG+ PYEA W +VKERLY+ + D FDDG W++ Sbjct: 279 KMIMTTTPNGLNHFYDIWQSAIDGKSGYVPYEAVWHSVKERLYN-KADIFDDGYEWSSQA 337 Query: 365 IGSSSVEAFLQEHAGQFMGSAGTLITGFKLSKMTWTDIEATENLYRFKQPEVGHKYFACV 424 I SS+E FLQEH +F GS+GTLI LS++++ D+ Y+F++P+ G KY A + Sbjct: 338 IAGSSLEQFLQEHNAEFFGSSGTLIRATTLSRLSFIDVVNDNGFYQFEKPKEGRKYVATL 397 Query: 425 DPAEGRGQDYSTIQIIDVTKYPYEQVAVYHSNKISHIMFPRIIQRLAMEYNEAYVYIELN 484 D +EGRGQDY +QIID+T++PY+QVAVYHSN SH + P I+ + M YNE VYIELN Sbjct: 398 DCSEGRGQDYHALQIIDITEFPYKQVAVYHSNTTSHFILPDIVFKYLMMYNECPVYIELN 457 Query: 485 SVGLSVAKDLYMDLEYENMIIDSSRDLGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTV 544 S G+S+AK L MDLEY+N+I DS DLGMKQ+K +KAMGCS LKDLIEKDKL INHK T+ Sbjct: 458 STGVSIAKSLAMDLEYDNIICDSFIDLGMKQSKRSKAMGCSALKDLIEKDKLIINHKGTI 517 Query: 545 IELRTFVEDGLSWSAAKDNHDDLVMALVIFAYLTTQDRFAEFLDMDEHSLAHDIFQDELE 604 ELRTF E G+SW+A + HDDLVM+LVIF +LTTQ++FAE+ DE +A +IF+ EL+ Sbjct: 518 QELRTFSEKGVSWAAEEGFHDDLVMSLVIFGWLTTQEKFAEYAGKDEMRIASEIFRKELD 577 Query: 605 SMNDDFNFSVFFNDGLNTIE 624 + +++ V + DG N IE Sbjct: 578 ELGEEYAPVVIY-DGANGIE 596 >gi|23191|lcl|protein:vir:101186 Length: 613 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932508;genbank:gi:37651634;genbank:GeneID :2610679 Length = 613 Score = 701 bits (1808), Expect = 0.0, Method: Compositional matrix adjust. Identities = 328/574 (57%), Positives = 435/574 (75%), Gaps = 14/574 (2%) Query: 66 SQHDDTWYPERYEYYYEQSRFRKIKIQGTNKKDYKTYKNRFDKKSRYLGLPNLKKANVQT 125 S HDD WYP ++ Y + +++KIQ + ++T+K++ +K++RYLGLPNLK+AN++ Sbjct: 32 SNHDDKWYPSTFDRYMKLQGVKRVKIQSDDPSMFRTFKDKTNKRTRYLGLPNLKRANIKI 91 Query: 126 TWSQEMVEEWKKCRDDILYFAK-YCAIIHLDHGVIGINLRPYQKDMLRIMSEKRMSIHNL 184 W++EM+ E K+C++DI+YFA+ YC I H+D+G+I + LR YQKDMLRIM+ R+ NL Sbjct: 92 KWTKEMLAERKRCKEDIVYFAENYCCIEHIDYGIIRVQLRDYQKDMLRIMAGNRLMAANL 151 Query: 185 SRQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMSREVLERTKQIIELLPDFLQPGIVE 244 SRQLGKTT +++ H+V FN AKNVGILAHKASMS EVL RTKQ +ELLPDFLQPGIVE Sbjct: 152 SRQLGKTTVVAIFLAHFVCFNSAKNVGILAHKASMSAEVLHRTKQALELLPDFLQPGIVE 211 Query: 245 WNKGSIELENGCAISAYSSDPDAVRGNSFALIYVDECGFVEAWEDCWKAILPVISSGRNS 304 WNKGSI L NGCAI A+SS PDAVRGNSFALIYVDE F+ + D W AI PVISSGR S Sbjct: 212 WNKGSITLGNGCAIGAFSSSPDAVRGNSFALIYVDEVAFIPNFTDAWMAIQPVISSGRRS 271 Query: 305 KIVLTSTPNGMNHWYDLWQSAI-------NGKSGFEPYEANWSAVKERLYH-----SETD 352 KI++T+TPNG+NHWYD+W +AI KSGF PY A WS+VKERLY S +D Sbjct: 272 KILMTTTPNGLNHWYDIWTAAITPNSSGDGSKSGFVPYTATWSSVKERLYSDGKELSGSD 331 Query: 353 S-FDDGISWTTNQIGSSSVEAFLQEHAGQFMGSAGTLITGFKLSKMTWTDIEATENLYRF 411 S FDDG SW++ I S+++AF QEH F G++GTLI G KLSK+ W DI +N F Sbjct: 332 SYFDDGYSWSSKTIAGSALDAFQQEHNTAFQGTSGTLINGTKLSKLNWIDIPPQDNFTMF 391 Query: 412 KQPEVGHKYFACVDPAEGRGQDYSTIQIIDVTKYPYEQVAVYHSNKISHIMFPRIIQRLA 471 ++P+ G KY A +D AEGRGQDY + I D+T++PY+QVAVYHSN SH++ P ++ + Sbjct: 392 EEPKEGRKYIATLDSAEGRGQDYHAMHIFDITEFPYKQVAVYHSNTTSHLILPDVLLKYL 451 Query: 472 MEYNEAYVYIELNSVGLSVAKDLYMDLEYENMIIDSSRDLGMKQTKTTKAMGCSTLKDLI 531 Y + Y+YIELNS G+S+AK LY +L+YEN+I DS +DLG+KQTK +KA+GCSTLKDLI Sbjct: 452 NMYFQPYIYIELNSTGVSIAKSLYSELDYENVICDSYQDLGLKQTKRSKAIGCSTLKDLI 511 Query: 532 EKDKLRINHKQTVIELRTFVEDGLSWSAAKDNHDDLVMALVIFAYLTTQDRFAEFLDMDE 591 EKDKL +NHK++++ELRTF E G+SW+A + HDDLVM+LVIFA+LTTQ+RF++F + D+ Sbjct: 512 EKDKLILNHKKSIMELRTFSEKGVSWAAEEGFHDDLVMSLVIFAWLTTQERFSDFTENDD 571 Query: 592 HSLAHDIFQDELESMNDDFNFSVFFNDGLNTIEI 625 LA+++F+ E+E +NDD+ V +DG +T E+ Sbjct: 572 MRLANEVFRKEMEELNDDYMPMVIVDDGEDTFEV 605 >gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469498;genbank:gi:157311455;genbank:Ge neID:5602125 Length = 607 Score = 699 bits (1805), Expect = 0.0, Method: Compositional matrix adjust. Identities = 328/560 (58%), Positives = 431/560 (76%), Gaps = 3/560 (0%) Query: 66 SQHDDTWYPERYEYYYEQSRFRKIKIQGTNKKDYKTYKNRFDKKSRYLGLPNLKKANVQT 125 S+HDD WYP+++ Y + +R +KI++Q T+ +YK +K+ + ++RY+ L NL++AN++T Sbjct: 39 SKHDDKWYPKKFSDYLKLNRPQKIRMQSTDPTNYKVFKDSDNIRTRYMRLKNLRRANIKT 98 Query: 126 TWSQEMVEEWKKCRDDILYFAK-YCAIIHLDHGVIGINLRPYQKDMLRIMSEKRMSIHNL 184 ++ EM+ EWK+CR DI+YFA+ YCAI H+D+G I + LR YQKDML+IM E RMS H L Sbjct: 99 QYTPEMIAEWKRCRKDIVYFAETYCAITHIDYGTIKVQLRDYQKDMLKIMHENRMSAHKL 158 Query: 185 SRQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMSREVLERTKQIIELLPDFLQPGIVE 244 SRQLGKTTA +++ HYV FN+ K VGILAHK SM+ EVLERTKQ IELLPDFLQPGIVE Sbjct: 159 SRQLGKTTAVAIFLAHYVCFNKDKAVGILAHKGSMAVEVLERTKQAIELLPDFLQPGIVE 218 Query: 245 WNKGSIELENGCAISAYSSDPDAVRGNSFALIYVDECGFVEAWEDCWKAILPVISSGRNS 304 WNK SI LENG +I AY+S PDAVRGNSF+ IY+DEC F++ W DC+ AI PVISSGR S Sbjct: 219 WNKKSIVLENGSSIGAYASSPDAVRGNSFSFIYIDECAFIQNWTDCFLAIQPVISSGRES 278 Query: 305 KIVLTSTPNGMNHWYDLWQSAINGKSGFEPYEANWSAVKERLYHSETDSFDDGISWTTNQ 364 K+++T+TPNG+NH+YD+WQSAI+GKSG+ PYEA W +VKERLY+ + D FDDG W++ Sbjct: 279 KMIMTTTPNGLNHFYDIWQSAIDGKSGYVPYEAVWHSVKERLYN-KADIFDDGYEWSSQA 337 Query: 365 IGSSSVEAFLQEHAGQFMGSAGTLITGFKLSKMTWTDIEATENLYRFKQPEVGHKYFACV 424 I SS+E FLQEH +F GS+GTLI LS++++ D+ Y+F++P+ G KY A + Sbjct: 338 IAGSSLEQFLQEHNAEFFGSSGTLIRATTLSRLSFIDVVNDNGFYQFEKPKEGRKYVATL 397 Query: 425 DPAEGRGQDYSTIQIIDVTKYPYEQVAVYHSNKISHIMFPRIIQRLAMEYNEAYVYIELN 484 D +EGRGQDY +QIID+T++PY+ VAVYHSN SH + P I+ + M YNE VYIELN Sbjct: 398 DCSEGRGQDYHALQIIDITEFPYKPVAVYHSNTTSHFILPDIVFKYLMMYNECPVYIELN 457 Query: 485 SVGLSVAKDLYMDLEYENMIIDSSRDLGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTV 544 S G+S+AK L MDLEY+N+I DS DLGMKQ+K +KAMGCS LKDLIEKDKL INHK T+ Sbjct: 458 STGVSIAKSLAMDLEYDNIICDSFIDLGMKQSKRSKAMGCSALKDLIEKDKLIINHKGTI 517 Query: 545 IELRTFVEDGLSWSAAKDNHDDLVMALVIFAYLTTQDRFAEFLDMDEHSLAHDIFQDELE 604 ELRTF E G+SW+A + HDDLVM+LVIF +LTTQ++FAE+ DE +A +IF+ EL+ Sbjct: 518 QELRTFSEKGVSWAAEEGFHDDLVMSLVIFGWLTTQEKFAEYAGKDEMRIASEIFRKELD 577 Query: 605 SMNDDFNFSVFFNDGLNTIE 624 + +++ V + DG N IE Sbjct: 578 ELGEEYAPVVIY-DGANGIE 596 >gi|27407|lcl|protein:vir:7202 Length: 610 # NCBI annotation: gp17 terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049776;genbank:gi:9632591;genbank:GeneID: 1258647 Length = 610 Score = 698 bits (1802), Expect = 0.0, Method: Compositional matrix adjust. Identities = 343/601 (57%), Positives = 443/601 (73%), Gaps = 15/601 (2%) Query: 37 VQQPHGVLN---ILDERGDKIIKSIS--------SCKYYKSQHDDTWYPERYEYYYEQSR 85 ++QP VLN L+E G +IK S + KSQ D WYPE++ Y + Sbjct: 1 MEQPINVLNDFHPLNEAGKILIKHPSLAERKDEDGIHWIKSQWDGKWYPEKFSDYLRLHK 60 Query: 86 FRKIKIQGTNKKDYKTYKNRFDKKSRYLGLPNLKKANVQTTWSQEMVEEWKKCRDDILYF 145 KI + ++TYK++ +K+SRY+GLPNLK+AN++T W++EMVEEWKKCRDDI+YF Sbjct: 61 IVKIPNNSDKPELFQTYKDKNNKRSRYMGLPNLKRANIKTQWTREMVEEWKKCRDDIVYF 120 Query: 146 AK-YCAIIHLDHGVIGINLRPYQKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTF 204 A+ YCAI H+D+GVI + LR YQ+DML+IMS KRM++ NLSRQLGKTT +++ H+V F Sbjct: 121 AETYCAITHIDYGVIKVQLRDYQRDMLKIMSSKRMTVCNLSRQLGKTTVVAIFLAHFVCF 180 Query: 205 NEAKNVGILAHKASMSREVLERTKQIIELLPDFLQPGIVEWNKGSIELENGCAISAYSSD 264 N+ K VGILAHK SMS EVL+RTKQ IELLPDFLQPGIVEWNKGSIEL+NG +I AY+S Sbjct: 181 NKDKAVGILAHKGSMSAEVLDRTKQAIELLPDFLQPGIVEWNKGSIELDNGSSIGAYASS 240 Query: 265 PDAVRGNSFALIYVDECGFVEAWEDCWKAILPVISSGRNSKIVLTSTPNGMNHWYDLWQS 324 PDAVRGNSFA+IY+DEC F+ + D W AI PVISSGR SKI++T+TPNG+NH+YD+W + Sbjct: 241 PDAVRGNSFAMIYIDECAFIPNFHDSWLAIQPVISSGRRSKIIITTTPNGLNHFYDIWTA 300 Query: 325 AINGKSGFEPYEANWSAVKERLYHSETDSFDDGISWTTNQIGSSSVEAFLQEHAGQFMGS 384 A+ GKSGFEPY A W++VKERLY+ E D FDDG W+ I SS+ F QEH F G+ Sbjct: 301 AVEGKSGFEPYTAIWNSVKERLYNDE-DIFDDGWQWSIQTINGSSLAQFRQEHTAAFEGT 359 Query: 385 AGTLITGFKLSKMTWTDIEATEN-LYRFKQPEVGHKYFACVDPAEGRGQDYSTIQIIDVT 443 +GTLI+G KL+ M + ++ ++ ++FK+PE KY A +D +EGRGQDY + IIDVT Sbjct: 360 SGTLISGMKLAVMDFIEVTPDDHGFHQFKKPEPDRKYIATLDCSEGRGQDYHALHIIDVT 419 Query: 444 KYPYEQVAVYHSNKISHIMFPRIIQRLAMEYNEAYVYIELNSVGLSVAKDLYMDLEYENM 503 +EQV V HSN ISH++ P I+ R +EYNE VYIELNS G+SVAK LYMDLEYE + Sbjct: 420 DDVWEQVGVLHSNTISHLILPDIVMRYLVEYNECPVYIELNSTGVSVAKSLYMDLEYEGV 479 Query: 504 IIDSSRDLGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVIELRTFVEDGLSWSAAKDN 563 I DS DLGMKQTK TKA+GCSTLKDLIEKDKL I+H+ T+ E RTF E G+SW+A + Sbjct: 480 ICDSYTDLGMKQTKRTKAVGCSTLKDLIEKDKLIIHHRATIQEFRTFSEKGVSWAAEEGY 539 Query: 564 HDDLVMALVIFAYLTTQDRFAEFLDMDEHSLAHDIFQDELESMNDDFNFSVFFNDGLNTI 623 HDDLVM+LVIF +L+TQ +F ++ D D+ LA ++F EL+ M+DD+ V F D +++ Sbjct: 540 HDDLVMSLVIFGWLSTQSKFIDYADKDDMRLASEVFSKELQDMSDDYA-PVIFVDSVHSA 598 Query: 624 E 624 E Sbjct: 599 E 599 >gi|22056|lcl|protein:vir:103455 Length: 610 # NCBI annotation: terminase subunit nuclease and ATPase # Family: family:all:147 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803107;genbank:gi:116326387;genbank:GeneI D:4405484 Length = 610 Score = 698 bits (1802), Expect = 0.0, Method: Compositional matrix adjust. Identities = 342/601 (56%), Positives = 442/601 (73%), Gaps = 15/601 (2%) Query: 37 VQQPHGVLN---ILDERGDKIIKSIS--------SCKYYKSQHDDTWYPERYEYYYEQSR 85 ++QP LN L+E G +IK S + KSQ D WYPE++ Y + Sbjct: 1 MEQPINALNDFHPLNEAGKILIKHPSLAERKDEDGIHWIKSQWDGKWYPEKFSDYLRLHK 60 Query: 86 FRKIKIQGTNKKDYKTYKNRFDKKSRYLGLPNLKKANVQTTWSQEMVEEWKKCRDDILYF 145 KI + ++TYK++ +K+SRY+GLPNLK+AN++T W++EMVEEWKKCRDDI+YF Sbjct: 61 IVKIPNNSDKPELFQTYKDKNNKRSRYMGLPNLKRANIKTQWTREMVEEWKKCRDDIVYF 120 Query: 146 AK-YCAIIHLDHGVIGINLRPYQKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTF 204 A+ YCAI H+D+GVI + LR YQ+DML+IMS KRM++ NLSRQLGKTT +++ H+V F Sbjct: 121 AETYCAITHIDYGVIKVQLRDYQRDMLKIMSSKRMTVCNLSRQLGKTTVVAIFLAHFVCF 180 Query: 205 NEAKNVGILAHKASMSREVLERTKQIIELLPDFLQPGIVEWNKGSIELENGCAISAYSSD 264 N+ K VGILAHK SMS EVL+RTKQ IELLPDFLQPGIVEWNKGSIEL+NG +I AY+S Sbjct: 181 NKDKAVGILAHKGSMSAEVLDRTKQAIELLPDFLQPGIVEWNKGSIELDNGSSIGAYASS 240 Query: 265 PDAVRGNSFALIYVDECGFVEAWEDCWKAILPVISSGRNSKIVLTSTPNGMNHWYDLWQS 324 PDAVRGNSFA+IY+DEC F+ + D W AI PVISSGR SKI++T+TPNG+NH+YD+W + Sbjct: 241 PDAVRGNSFAMIYIDECAFIPNFHDSWLAIQPVISSGRRSKIIITTTPNGLNHFYDIWTA 300 Query: 325 AINGKSGFEPYEANWSAVKERLYHSETDSFDDGISWTTNQIGSSSVEAFLQEHAGQFMGS 384 A+ GKSGFEPY A W++VKERLY+ E D FDDG W+ I S++ F QEH F G+ Sbjct: 301 AVEGKSGFEPYTAIWNSVKERLYNDE-DIFDDGWQWSIQTINGSTLAQFRQEHTAAFEGT 359 Query: 385 AGTLITGFKLSKMTWTDIEATEN-LYRFKQPEVGHKYFACVDPAEGRGQDYSTIQIIDVT 443 +GTLI+G KL+ M + ++ ++ +RFK+PE KY A +D +EGRGQDY + IIDVT Sbjct: 360 SGTLISGMKLAIMDFIEVTPDDHGFHRFKKPEPDRKYIATLDCSEGRGQDYHALHIIDVT 419 Query: 444 KYPYEQVAVYHSNKISHIMFPRIIQRLAMEYNEAYVYIELNSVGLSVAKDLYMDLEYENM 503 +EQV V HSN ISH++ P I+ R +EYNE VYIELNS G+SVAK LYMDLEYE + Sbjct: 420 DDVWEQVGVLHSNTISHLILPDIVMRYLVEYNECPVYIELNSTGVSVAKSLYMDLEYEGV 479 Query: 504 IIDSSRDLGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVIELRTFVEDGLSWSAAKDN 563 I DS DLGMKQTK TKA+GCSTLKDLIEKDKL I+H+ T+ E RTF E G+SW+A + Sbjct: 480 ICDSYTDLGMKQTKRTKAVGCSTLKDLIEKDKLIIHHRATIQEFRTFSEKGVSWAAEEGY 539 Query: 564 HDDLVMALVIFAYLTTQDRFAEFLDMDEHSLAHDIFQDELESMNDDFNFSVFFNDGLNTI 623 HDDLVM+LVIF +L+TQ +F ++ D D+ LA ++F EL+ M+DD+ V F D +++ Sbjct: 540 HDDLVMSLVIFGWLSTQSKFIDYADKDDMRLASEVFSKELQDMSDDYA-PVIFVDSVHSA 598 Query: 624 E 624 E Sbjct: 599 E 599 >gi|22942|lcl|protein:vir:101803 Length: 613 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238880;genbank:gi:66391955;genbank:GeneID :3416630 Length = 613 Score = 698 bits (1801), Expect = 0.0, Method: Compositional matrix adjust. Identities = 327/574 (56%), Positives = 434/574 (75%), Gaps = 14/574 (2%) Query: 66 SQHDDTWYPERYEYYYEQSRFRKIKIQGTNKKDYKTYKNRFDKKSRYLGLPNLKKANVQT 125 S DD WYP ++ Y + +++KIQ + ++T+K++ +K++RYLGLPNLK+AN++ Sbjct: 32 SNQDDKWYPSTFDRYMKLQGVKRVKIQSDDPSMFRTFKDKTNKRTRYLGLPNLKRANIKI 91 Query: 126 TWSQEMVEEWKKCRDDILYFAK-YCAIIHLDHGVIGINLRPYQKDMLRIMSEKRMSIHNL 184 W++EM+ E K+C++DI+YFA+ YC I H+D+G+I + LR YQKDMLRIM+ R+ NL Sbjct: 92 KWTKEMLAERKRCKEDIVYFAENYCCIEHIDYGIIRVQLRDYQKDMLRIMAGNRLMAANL 151 Query: 185 SRQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMSREVLERTKQIIELLPDFLQPGIVE 244 SRQLGKTT +++ H+V FN AKNVGILAHKASMS EVL RTKQ +ELLPDFLQPGIVE Sbjct: 152 SRQLGKTTVVAIFLAHFVCFNSAKNVGILAHKASMSAEVLHRTKQALELLPDFLQPGIVE 211 Query: 245 WNKGSIELENGCAISAYSSDPDAVRGNSFALIYVDECGFVEAWEDCWKAILPVISSGRNS 304 WNKGSI L NGCAI A+SS PDAVRGNSFALIYVDE F+ + D W AI PVISSGR S Sbjct: 212 WNKGSITLGNGCAIGAFSSSPDAVRGNSFALIYVDEVAFIPNFTDAWMAIQPVISSGRRS 271 Query: 305 KIVLTSTPNGMNHWYDLWQSAI-------NGKSGFEPYEANWSAVKERLYH-----SETD 352 KI++T+TPNG+NHWYD+W +AI KSGF PY A WS+VKERLY S +D Sbjct: 272 KILMTTTPNGLNHWYDIWTAAITPNSSGDGSKSGFVPYTATWSSVKERLYSDGKELSGSD 331 Query: 353 S-FDDGISWTTNQIGSSSVEAFLQEHAGQFMGSAGTLITGFKLSKMTWTDIEATENLYRF 411 S FDDG SW++ I S+++AF QEH F G++GTLI G KLSK+ W DI +N F Sbjct: 332 SYFDDGYSWSSKTIAGSALDAFQQEHNTAFQGTSGTLINGTKLSKLNWIDIPPQDNFTMF 391 Query: 412 KQPEVGHKYFACVDPAEGRGQDYSTIQIIDVTKYPYEQVAVYHSNKISHIMFPRIIQRLA 471 ++P+ G KY A +D AEGRGQDY + I D+T++PY+QVAVYHSN SH++ P ++ + Sbjct: 392 EEPKEGRKYIATLDSAEGRGQDYHAMHIFDITEFPYKQVAVYHSNTTSHLILPDVLLKYL 451 Query: 472 MEYNEAYVYIELNSVGLSVAKDLYMDLEYENMIIDSSRDLGMKQTKTTKAMGCSTLKDLI 531 Y + Y+YIELNS G+S+AK LY +L+YEN+I DS +DLG+KQTK +KA+GCSTLKDLI Sbjct: 452 NMYFQPYIYIELNSTGVSIAKSLYSELDYENVICDSYQDLGLKQTKRSKAIGCSTLKDLI 511 Query: 532 EKDKLRINHKQTVIELRTFVEDGLSWSAAKDNHDDLVMALVIFAYLTTQDRFAEFLDMDE 591 EKDKL +NHK++++ELRTF E G+SW+A + HDDLVM+LVIFA+LTTQ+RF++F + D+ Sbjct: 512 EKDKLILNHKKSIMELRTFSEKGVSWAAEEGFHDDLVMSLVIFAWLTTQERFSDFTENDD 571 Query: 592 HSLAHDIFQDELESMNDDFNFSVFFNDGLNTIEI 625 LA+++F+ E+E +NDD+ V +DG +T E+ Sbjct: 572 MRLANEVFRKEMEELNDDYMPMVIVDDGEDTFEV 605 >gi|21077|lcl|protein:vir:100538 Length: 612 # NCBI annotation: gp17 terminase subunit # Family: family:all:147 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656379;genbank:gi:109290130;genbank:GeneI D:4156516 Length = 612 Score = 684 bits (1765), Expect = 0.0, Method: Compositional matrix adjust. Identities = 332/615 (53%), Positives = 441/615 (71%), Gaps = 26/615 (4%) Query: 28 LFGTDKAVYVQQPHGVLNILDERGDKIIKSISSCKYYKSQHDDTWYPERYEYYYEQSRFR 87 +F +D + + P + + E G+ I+ S HDD WYP ++ Y + + Sbjct: 3 IFESDHPLQMPHPSTLKREMREDGEWIL----------SNHDDKWYPSTFDRYLKSQGVK 52 Query: 88 KIKIQGTNKKDYKTYKNRFDKKSRYLGLPNLKKANVQTTWSQEMVEEWKKCRDDILYFAK 147 ++KIQ + ++T+K++ +K+SRY GLPNLK+AN++ W++EM+ E K+C++DI+YFA+ Sbjct: 53 RVKIQADDPSMFRTFKDKTNKRSRYNGLPNLKRANIKIKWTKEMLAERKRCKEDIVYFAE 112 Query: 148 -YCAIIHLDHGVIGINLRPYQKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTFNE 206 YC I H+D+G+I + LR YQKDMLRIM+ R+ NLSRQLGKTT +++ H+V FN Sbjct: 113 NYCCIEHIDYGIIRVQLRDYQKDMLRIMAGNRLMAANLSRQLGKTTVVAIFLAHFVCFNS 172 Query: 207 AKNVGILAHKASMSREVLERTKQIIELLPDFLQPGIVEWNKGSIELENGCAISAYSSDPD 266 AKNVGILAHKASMS EVL RTKQ +ELLPDFLQPGIVEWNKGSI L NGCAI A+SS PD Sbjct: 173 AKNVGILAHKASMSAEVLHRTKQALELLPDFLQPGIVEWNKGSITLGNGCAIGAFSSSPD 232 Query: 267 AVRGNSFALIYVDECGFVEAWEDCWKAILPVISSGRNSKIVLTSTPNGMNHWYDLWQSAI 326 AVRGNSFALIY+DE F+ + D W AI PVISSGR+SKI++T+TPNG+NHWYD+W +AI Sbjct: 233 AVRGNSFALIYIDEVAFIPNFNDAWLAIQPVISSGRHSKILMTTTPNGLNHWYDIWTAAI 292 Query: 327 -------NGKSGFEPYEANWSAVKERLYHSETDSFDDGIS-WTTNQIGSS------SVEA 372 KSGF PY A WS+VKER+Y S+ D I TT+ +G ++ A Sbjct: 293 TPNSDGSGSKSGFVPYTATWSSVKERMY-SDGSKTDGAIHILTTDILGQPRQSPVLALRA 351 Query: 373 FLQEHAGQFMGSAGTLITGFKLSKMTWTDIEATENLYRFKQPEVGHKYFACVDPAEGRGQ 432 F QEH F G++GTLI GFKLSKMTW ++ A++N FK+P GHKY A +D AEGRGQ Sbjct: 352 FQQEHNTAFQGTSGTLINGFKLSKMTWKEVPASDNFTMFKEPIEGHKYIATLDSAEGRGQ 411 Query: 433 DYSTIQIIDVTKYPYEQVAVYHSNKISHIMFPRIIQRLAMEYNEAYVYIELNSVGLSVAK 492 DY + I D+T++PYEQVAVYHSN SH++ P ++ + Y + Y+YIELN+ G+S+AK Sbjct: 412 DYHAMHIYDITEFPYEQVAVYHSNTTSHLILPDVLLKYLNMYYQPYIYIELNATGVSIAK 471 Query: 493 DLYMDLEYENMIIDSSRDLGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVIELRTFVE 552 LY +LEYEN+I DS DLGMKQTK +KA+GCSTLKDLIEK+KL + HK T++ELRTF E Sbjct: 472 SLYSELEYENIICDSYNDLGMKQTKRSKAIGCSTLKDLIEKEKLVLYHKGTIMELRTFSE 531 Query: 553 DGLSWSAAKDNHDDLVMALVIFAYLTTQDRFAEFLDMDEHSLAHDIFQDELESMNDDFNF 612 G+SW+A HDDLVM+LVIFA+LTTQ RF++F + D+ LA++IF+ E+E++ DD+ Sbjct: 532 KGVSWAAEDGFHDDLVMSLVIFAWLTTQPRFSDFTERDDMRLANEIFRQEMENLYDDYTP 591 Query: 613 SVFFNDGLNTIEINS 627 V + G T E+ S Sbjct: 592 VVIVDSGEETFEVGS 606 >gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899601;genbank:gi:34419588;genbank:GeneID :2546011 Length = 600 Score = 609 bits (1571), Expect = e-176, Method: Compositional matrix adjust. Identities = 300/577 (51%), Positives = 401/577 (69%), Gaps = 16/577 (2%) Query: 57 SISSCKYYKSQHDDTWYP---ERYEYYYEQSRFRKIKIQGTNKKDYKTYKNRFDKKSRYL 113 +I+ KY +S D WYP + ++ + K++IQ T+ D+KTYK++ +++SRY+ Sbjct: 12 TINGIKYVQSNEDMQWYPRYLDDWKVLNRMDKAHKLRIQSTDPSDFKTYKDKGNRRSRYM 71 Query: 114 GLPNLKKANVQTTWSQEMVE---EWKKCRDDILYFAK-YCAIIHLDHGVIGINLRPYQKD 169 +PNL++AN + + E E++KCRDDI+YFA+ YC+I+H+D G I + RPYQK+ Sbjct: 72 NIPNLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVPRPYQKE 131 Query: 170 MLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMSREVLERTKQ 229 ML + R SI L RQLGKTT ++ HY+ FNE K GILAHK SMS EVLER K Sbjct: 132 MLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLERVKN 191 Query: 230 IIELLPDFLQPGIVEWNKGSIELENGCAISAYSSDPDAVRGNSFALIYVDECGFVEAWED 289 +IE LPDFLQPGI EWNKG+I +NGC + AY+S DAVRG SF++IYVDEC FV ++D Sbjct: 192 VIENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPGFDD 251 Query: 290 CWKAILPVISSGRNSKIVLTSTPNGMNHWYDLWQSAINGKSGFEPYEANWSAVKERLYHS 349 WKA PVISSG SK+VLTSTPNG+NH++D+W +A+ G S FEPY W AV+ RLY Sbjct: 252 FWKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRAVQNRLY-- 309 Query: 350 ETDSFDDGISWTTNQIGSSSVEAFLQEHAGQFMGSAGTLITGFKLSKMTWTD-IEATENL 408 + FDDG ++ IG++S EAF QEH F+G+AGTLI GFKLSKM D ++ ++ Sbjct: 310 KDGEFDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVVKDSDGW 369 Query: 409 YRFKQPEVGHKYFACVDPAEGRGQDYSTIQIIDVTKYPYEQVAVYHSNKISHIMFPRIIQ 468 +K+PE GHKY VD +EGRGQDY + +IDVT YP+EQVAV+H NK SH++ P II Sbjct: 370 CVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAIIM 429 Query: 469 RLAMEYNEAYVYIELNSVGLSVAKDLYMDLEYENMIID-----SSRDLGMKQTKTTKAMG 523 + A YNEAYVY E+ S G V +L+ DLEYEN+I++ R LG+K K TKA+G Sbjct: 430 KQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEERASGGRRGLGLKPNKKTKAIG 489 Query: 524 CSTLKDLIEKDKLRINHKQTVIELRTFVEDGLSWSAAKDNHDDLVMALVIFAYLTTQDRF 583 CSTLKDLIEKD+L+INH T+ E TFVE G SW A + HDDLVM+L + AYL+TQDRF Sbjct: 490 CSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMSLTLLAYLSTQDRF 549 Query: 584 AEFLDMDEHSLAHDIFQDELESMNDDFNFSVFFNDGL 620 ++F++ E+++++DIF+ E+ M DD + DG+ Sbjct: 550 SDFVE-KEYNVSYDIFKQEVHDMMDDDVPFLMIADGI 585 >gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: terminase large subunit gp17 # Family: family:all:147 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195134;genbank:gi:58532911;uniprot:Q5GQN8 ;genbank:GeneID:3260486 Length = 548 Score = 364 bits (935), Expect = e-102, Method: Compositional matrix adjust. Identities = 202/521 (38%), Positives = 301/521 (57%), Gaps = 31/521 (5%) Query: 112 YLGLPNLKKANVQTTWSQEMVEEWKKCRDDILYFAK-YCAIIHLDHGVIGINLRPYQKDM 170 YLG P LKKANV+ +++E V+EW KC +D +YF K Y I+ LD G++ + +Q+++ Sbjct: 7 YLGNPLLKKANVKIDFTKEQVKEWIKCANDPVYFTKNYVKIVSLDEGLVPFKMWDFQEEL 66 Query: 171 LRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMSREVLERTKQI 230 + + R +I L RQ GK+T Y HY+ FN+ N+GILA+KAS +R++L R Sbjct: 67 IMKFHKNRFNIAKLPRQTGKSTTVVSYLLHYLIFNDNVNIGILANKASTARDLLARLATA 126 Query: 231 IELLPDFLQPGIVEWNKGSIELENGCAISAYSSDPDAVRGNSFALIYVDECGFV--EAWE 288 E LP ++Q G+V WNKG+IELENG I A S+ AVRG SF +I++DE FV + Sbjct: 127 YENLPKWIQQGVVVWNKGNIELENGSKILAASTSASAVRGMSFNIIFLDEFAFVPNHIAD 186 Query: 289 DCWKAILPVISSGRNSKIVLTSTPNGMNHWYDLWQSAINGKSGFEPYEANWSAVKERLYH 348 + ++ P I+SG+++K+++ STP GMNH+Y +W A NG++G+ +E +WS V R Sbjct: 187 SFFASVYPTITSGKSTKVIIISTPQGMNHFYKMWVDATNGRNGYTFHEVHWSQVPGR--- 243 Query: 349 SETDSFDDGISWTTNQIGSSSVEAFLQEHAGQFMGSAGTLITGFKLSKMTWTD-IEATEN 407 D+ W I ++S F QE +F+GS TLI KL + + D I+ + Sbjct: 244 ------DE--KWKEETIKNTSERQFTQEFECEFLGSVDTLIAASKLKALVFNDPIKRNKG 295 Query: 408 LYRFKQPEVGHKYFACVDPAEGRGQDYSTIQIIDVTKYPYEQVAVYHSNKISHIMFPRII 467 L +++P+ +Y VD + G G DYS I D+T PY+ V Y +N+I ++FP II Sbjct: 296 LDIYEEPKEKSEYLMTVDVSRGIGGDYSAFIIFDITTVPYKVVGKYRNNEIKPMLFPNII 355 Query: 468 QRLAMEYNEAYVYIELNSVGLSVAKDLYMDLEYENMIIDSSR----------------DL 511 LA YN A+V E+N +G VA L DLEY N+++ + R L Sbjct: 356 NDLARSYNNAWVLCEVNDIGDQVASILNYDLEYPNVLMCAMRGRAGQLVGQGFSGSKTQL 415 Query: 512 GMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVIELRTFVEDGLSWSAAKDNHDDLVMAL 571 G+K + T K +GC+ LK ++E+DKL N + EL TF++ S+ A + HDDLVM + Sbjct: 416 GVKMSITVKKVGCANLKTIVEEDKLIFNDYDIINELTTFIQKKQSFEADEGFHDDLVMCM 475 Query: 572 VIFAYLTTQDRFAEFLDMDEHSLAHDIFQDELESMNDDFNF 612 VIFA+L QD F E D D +D ++++E F F Sbjct: 476 VIFAWLVQQDYFKEMTDNDIRQRIYDEQKNQIEQDMAPFGF 516 >gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214662;genbank:gi:61806303;genbank:GeneID :3294591 Length = 550 Score = 363 bits (932), Expect = e-102, Method: Compositional matrix adjust. Identities = 214/543 (39%), Positives = 301/543 (55%), Gaps = 34/543 (6%) Query: 108 KKSRYLGLPNLKKANVQTTWSQEMVEEWKKCRDDILYFA-KYCAIIHLDHGVIGINLRPY 166 K+ YLG PNLKKANV T ++++ V E+ KC D +YF KY I+ LD GVI ++ + Sbjct: 4 KQEIYLGNPNLKKANVSTQFTKKQVAEYMKCAQDPVYFIRKYIRIVSLDEGVIPFDMYNF 63 Query: 167 QKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMSREVLER 226 Q+DM+ + R +I L RQ GK+T + Y YV FN NV ILA+KA +RE+L R Sbjct: 64 QEDMVTKFHQHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAPTAREMLGR 123 Query: 227 TKQIIELLPDFLQPGIVEWNKGSIELENGCAISAYSSDPDAVRGNSFALIYVDECGFV-- 284 + E LP ++Q GI+ WNKGS+ELENG I A S+ AVRG SF +I++DE FV Sbjct: 124 LQLSYENLPKWMQQGILGWNKGSLELENGSKILASSTSASAVRGMSFNIIFLDEFAFVPN 183 Query: 285 EAWEDCWKAILPVISSGRNSKIVLTSTPNGMNHWYDLWQSAINGKSGFEPYEANWSAVKE 344 E + ++ P ISSG+++K+++ STP+GMN +Y LW A G + + E +WS V Sbjct: 184 HIAEQFFASVYPTISSGKSTKVIIISTPHGMNQFYKLWHDAERGANNYVATEVHWSQVPG 243 Query: 345 RLYHSETDSFDDGISWTTNQIGSSSVEAFLQEHAGQFMGSAGTLITGFKLSKMTWTD-IE 403 R DD W I ++S F E +F+GS TLIT KL M + D I+ Sbjct: 244 R---------DD--KWKQQTIENTSEAQFRVEFECEFLGSVDTLITPSKLRIMPYKDPIQ 292 Query: 404 ATENLYRFKQPEVGHKYFACVDPAEGRGQDYSTIQIIDVTKYPYEQVAVYHSNKISHIMF 463 L ++ + H Y VD + G G DYS +ID T PY+ VA Y +N+I ++F Sbjct: 293 ENRGLAVYEHVQENHNYIITVDVSRGVGNDYSAFCVIDTTTVPYKVVARYKNNQIKPLVF 352 Query: 464 PRIIQRLAMEYNEAYVYIELNSVGLSVAKDLYMDLEYENMIIDSSR-------------- 509 P +I +A YN AYV E+N +G VA + DLEYEN+++ S R Sbjct: 353 PNLIVDVATNYNGAYVLCEVNDIGGQVADIIQYDLEYENLLMVSMRGRAGQQLGQGFSGK 412 Query: 510 --DLGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVIELRTFVEDGLSWSAAKDNHDDL 567 LG+K + K +GCS LK LIE DKL + T+ EL TF++ G S+ A +DDL Sbjct: 413 KTQLGIKMSTAVKQVGCSNLKALIEDDKLIVEDYDTIAELTTFIQKGQSFQAEDGCNDDL 472 Query: 568 VMALVIFAYLTTQDRFAEFLDMDEHSLAHDIFQDELESMNDDFNFSVFFNDGLNTIEINS 627 M LVIF+++ Q F E D D + I++D+ + + D F +DGL + Sbjct: 473 AMCLVIFSWMAMQPYFKEMHDND---VRQRIYEDQRDQIEQDMAPFGFVSDGLEEDQFQD 529 Query: 628 ASG 630 A G Sbjct: 530 AQG 532 >gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T4-like DNA packaging large subunit terminase # Family: family:all:147 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214360;genbank:gi:61806000;genbank:GeneID :3294466 Length = 547 Score = 352 bits (904), Expect = 6e-99, Method: Compositional matrix adjust. Identities = 202/548 (36%), Positives = 304/548 (55%), Gaps = 36/548 (6%) Query: 112 YLGLPNLKKANVQTTWSQEMVEEWKKCRDDILYFAK-YCAIIHLDHGVIGINLRPYQKDM 170 YLG PNLKKAN +S++ + E+ KC++D +YF + Y I+ LD G++ N+ +Q+ + Sbjct: 5 YLGNPNLKKANTPIEFSKDNIREFLKCKEDPVYFTRNYIKIVSLDEGLVPFNMYDFQEKL 64 Query: 171 LRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMSREVLERTKQI 230 + E R +I + RQ GK+T Y HY FN+ NV +LA+KAS +R++L R + Sbjct: 65 ITRFHENRFNICKMPRQTGKSTTCISYLLHYAVFNDNVNVAVLANKASTARDLLGRLQLA 124 Query: 231 IELLPDFLQPGIVEWNKGSIELENGCAISAYSSDPDAVRGNSFALIYVDECGFV--EAWE 288 E LP ++Q GI+ WNKGS+ELENG ISA S+ AVRG S+ +I++DE F+ + Sbjct: 125 YENLPRWMQQGIISWNKGSLELENGSKISANSTSSSAVRGGSYNVIFLDEFAFIPNHIAD 184 Query: 289 DCWKAILPVISSGRNSKIVLTSTPNGMNHWYDLWQSAINGKSGFEPYEANWSAVKERLYH 348 D + ++ P I+SG+++K+++ STP GMNH+Y +W + GKS + + +WS V R Sbjct: 185 DFFASVYPTITSGQSTKVIIVSTPRGMNHFYRMWHDSEKGKSEYVATDVHWSEVPGR--- 241 Query: 349 SETDSFDDGISWTTNQIGSSSVEAFLQEHAGQFMGSAGTLITGFKLSKMTWTDIEATEN- 407 D+ W I ++S + F E +F+GS TLI KL + + + T N Sbjct: 242 ------DE--EWKEQTIANTSEQQFKIEFECEFLGSVNTLINPAKLRNLVY-EAPKTRNA 292 Query: 408 -LYRFKQPEVGHKYFACVDPAEGRGQDYSTIQIIDVTKYPYEQVAVYHSNKISHIMFPRI 466 L ++ P H Y VD A G G DYS + D T++PY+ VA Y +N+I ++FP I Sbjct: 293 GLDIYETPVKEHNYIITVDVARGLGNDYSAFIVFDTTEFPYKVVAKYRNNEIKPMLFPNI 352 Query: 467 IQRLAMEYNEAYVYIELNSVGLSVAKDLYMDLEYENMIIDSSR----------------D 510 I +A YN AY+ IE+N +G VA L DLEYEN+++ S R Sbjct: 353 ILDVAKGYNNAYLLIEVNDIGDQVASILQYDLEYENVLMASMRGRAGQIVGQGFSGKKTQ 412 Query: 511 LGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVIELRTFVEDGLSWSAAKDNHDDLVMA 570 LG++ T K +GCS LK ++E DKL + + EL TF + S+ A + +DDL M Sbjct: 413 LGVRMTSAVKKLGCSNLKTMMEDDKLLTCDYEIISELTTFAQRHNSFEAEEGCNDDLAMC 472 Query: 571 LVIFAYLTTQDRFAEFLDMDEHSLAHDIFQDELESMNDDFNFSVFFNDGLNTIEINSASG 630 LVIF++L QD F E M ++ + I++++ + D F DGL+ G Sbjct: 473 LVIFSWLVAQDYFKE---MSDNDIRKRIYEEQKNQIEQDMAPFGFIADGLDDTSFVDKDG 529 Query: 631 YNNHEDNY 638 H D Y Sbjct: 530 DTWHLDEY 537 >gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: gp123 # Family: family:all:147 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717790;genbank:gi:113200627;genbank:GeneI D:4239178 Length = 549 Score = 349 bits (896), Expect = 5e-98, Method: Compositional matrix adjust. Identities = 203/540 (37%), Positives = 302/540 (55%), Gaps = 34/540 (6%) Query: 111 RYLGLPNLKKANVQTTWSQEMVEEWKKCRDDILYFAK-YCAIIHLDHGVIGINLRPYQKD 169 +YLG PNLKKANV ++ + V E KC ++ +YF K Y I+ LD G+I ++ +Q++ Sbjct: 6 QYLGNPNLKKANVSQEFTPDQVAEVIKCSENPVYFIKNYIKIVSLDKGLIPFDMYYFQEE 65 Query: 170 MLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMSREVLERTKQ 229 M++ + R +I L RQ GK+T + Y YV FN NV ILA+KA+ +RE+L+R + Sbjct: 66 MVQKFHDNRFNIAKLPRQSGKSTIVTSYLLWYVLFNANVNVAILANKAATAREMLQRLQL 125 Query: 230 IIELLPDFLQPGIVEWNKGSIELENGCAISAYSSDPDAVRGNSFALIYVDECGFV--EAW 287 E LP +LQ GI++WN+GS+ELENG I A S+ AVRG SF +I++DE FV Sbjct: 126 SYENLPKWLQQGILQWNRGSLELENGSKILAASTSASAVRGMSFNVIFLDEFAFVPNHVA 185 Query: 288 EDCWKAILPVISSGRNSKIVLTSTPNGMNHWYDLWQSAINGKSGFEPYEANWSAVKERLY 347 + + ++ P ISSG+++K+++ STP+GMN +Y LW A + + P E +WS V R Sbjct: 186 DQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERKANEYIPTEVHWSEVPGR-- 243 Query: 348 HSETDSFDDGISWTTNQIGSSSVEAFLQEHAGQFMGSAGTLITGFKLSKMTWTDIEATEN 407 +W I ++S + F E +F+GS TLI+ KL M + D A +N Sbjct: 244 ---------DAAWKEQTIKNTSEQQFRVEFECEFLGSVDTLISPSKLRTMVYGDPIAEKN 294 Query: 408 -LYRFKQPEVGHKYFACVDPAEGRGQDYSTIQIIDVTKYPYEQVAVYHSNKISHIMFPRI 466 L +++ GH Y D + G DYS +ID T PY+ VA Y +N I I+FP I Sbjct: 295 GLSMYEKTIQGHTYVITADVSRGVSGDYSAFLVIDTTTIPYKLVAKYRNNDIKPILFPNI 354 Query: 467 IQRLAMEYNEAYVYIELNSVGLSVAKDLYMDLEYENMIIDSSR----------------D 510 I +A YN A+V +E+N VG VA + DLEY+N+++ + R Sbjct: 355 IVDVARNYNHAFVLVEVNDVGGQVADIIQYDLEYDNLLMCAMRGRAGQQLGQGFSGKKTQ 414 Query: 511 LGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVIELRTFVEDGLSWSAAKDNHDDLVMA 570 +G+K + TK +GCS LK L+E DK +N + EL TF++ G ++ A + +DDL M Sbjct: 415 MGIKMSSATKQVGCSNLKALLEDDKFLLNDYDCISELTTFIQKGQTFQAEEGCNDDLAMC 474 Query: 571 LVIFAYLTTQDRFAEFLDMDEHSLAHDIFQDELESMNDDFNFSVFFNDGLNTIEINSASG 630 +VIFA++ Q F E D D + I+ D+ E++ D F +DGL A G Sbjct: 475 MVIFAWMAMQPYFKELHDND---VRQRIYDDQREAIEQDMAPFGFMDDGLGEEYFADAQG 531 >gi|16897|lcl|protein:vir:5867 Length: 508 # NCBI annotation: similar to terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835653;genbank:gi:30044056 Length = 508 Score = 161 bits (408), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 132/458 (28%), Positives = 218/458 (47%), Gaps = 42/458 (9%) Query: 128 SQEMV-EEWKKCRDDILYFA-KYCAIIHLDHGVIGINLRPYQKDMLRIMSEKRMSIHNLS 185 +Q+++ +E +KC++D +YF KY I H VI +L P Q+ ++ R I Sbjct: 3 TQQIIKQELEKCKNDPIYFIRKYVKIQHPIKRVIPFDLYPIQEKLINFYHTHRYVITEKP 62 Query: 186 RQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMSREVLERTKQIIELLPDFLQPGIVEW 245 RQ+G T Y H + FN V I A+K + ++ VLER K E LP FLQ W Sbjct: 63 RQMGVTWCAVAYALHQMIFNSNYKVLIAANKEATAKNVLERIKFAYEQLPRFLQIKKRTW 122 Query: 246 NKGSIELENGCAISAYSSDPDAVRGNSFALIYVDECGFVEAWEDCWKAILPVISSGRNSK 305 NK IE N + A SS D+ R S L+ V+E F+ E+ W ++ +++G K Sbjct: 123 NKTYIEFSNYSSARAVSSKSDSGRSESITLLIVEEAAFISNMEELWASVQQTLATG--GK 180 Query: 306 IVLTSTPNGMNHWYD-LWQSAINGKSGFEPYEANWSAVKERLYHSETDSFDDGISWTTNQ 364 ++ ST NG+ +WY+ ++A GKS F+ + WS H E D W Q Sbjct: 181 CIVNSTYNGVGNWYERTIRAAKEGKSEFKYFGIKWSD------HPERDE-----KWFEEQ 229 Query: 365 IGSSSVEAFLQEHAGQFMGSAGTLITGFKLSKMTWTD---IEATENLYR-FKQPEVGHKY 420 F QE GS +I + + + D ++ + + +++P G+ Y Sbjct: 230 KRLLPPRVFAQEILCIPQGSGENVIPFHLIREEEFIDPFVVKYGGDYWEWYRKP--GY-Y 286 Query: 421 FACVDPAEGRGQDYSTIQI----IDVTKYPYEQVAVYHSNKISHIMFPRIIQRLAMEYNE 476 F VDPA GRG+D S + + +D EQVA + S+K S + ++I+++ E+ Sbjct: 287 FISVDPASGRGEDRSAVGVQVLWVDPQTLTIEQVAEFASDKTSLPVMRQVIKQIYDEFKP 346 Query: 477 AYVYIELNSVGLSVAKDLYMDLEYENMIIDSSRDLGMKQTKTTKAMGCSTLKDLIEKDKL 536 ++IE N +G+ + Y+ M + +G T+ K G L L E +L Sbjct: 347 QLIFIETNGIGMGL---------YQFMEAYTPSIVGYYTTQRKKVHGSDLLAKLYEDGRL 397 Query: 537 RINHKQTVIELR--TFVEDGLSWSAAKDNHDDLVMALV 572 + K+ + +L+ T+V++ + + +DL MAL+ Sbjct: 398 ILRSKRLLEQLQRTTWVKNKVETAG----RNDLYMALI 431 >gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: terminase large subunit # Family: family:all:147 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467938;genbank:gi:157265379;genbank:Ge neID:5600506 Length = 485 Score = 64.3 bits (155), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 73/324 (22%), Positives = 141/324 (43%), Gaps = 37/324 (11%) Query: 265 PDAVRGNSFALIYVDECGFV--EAWEDCWKAILPVISSGRNSKIVLTSTPNGMNHWYDLW 322 PD +RG + + +DE + W + AI P +S R+ ++ STP G+N +Y+ + Sbjct: 135 PDNLRGATLDFVILDEAAMIPFSVWSE---AIEPTLSV-RDGWALIISTPKGLNWFYEFF 190 Query: 323 QSAING--KSGFEPYEANWSAVKERLYHSET-DSFDDGISWTTNQIGSSSVEAFLQEHAG 379 G K G N + +H+ + D + + W + F QE+ Sbjct: 191 LMGWRGGLKEGIPNSGVNQTHPDFESFHAASWDVWPERREWYMERRLYIPDLEFRQEYGA 250 Query: 380 QFMGSAGTLITGFKLSKMTWTDIEATENLYRFKQPEVGHKYFACVDPAEGRGQDYSTIQI 439 +F+ + ++ +G + + + T + +P+ H Y C+ G+ QDYS + Sbjct: 251 EFVSHSNSVFSGLDMLILLPYERRGTRLVVEDYRPD--HIY--CIGADFGKNQDYSVFSV 306 Query: 440 IDVTKYPYEQVAVYHSNKISHIMFPRIIQR---LAMEYNEAYVYIELNSVGLSVAKDL-Y 495 +D+ + A+ +++ + + R L+ +Y AYV + VG ++A++L Sbjct: 307 LDL-----DTGAIVCLERMNGATWSDQVARLKALSEDYGHAYVVADTWGVGDAIAEELDA 361 Query: 496 MDLEYENMIIDSSRDLGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVI-ELRTF---- 550 + Y + + SS + K S L L+EK ++ + + +T++ ELR F Sbjct: 362 QGINYTPLPVKSS---------SVKEQLISNLALLMEKGQVAVPNDKTILDELRNFRYYR 412 Query: 551 -VEDGLSWSAAKDNHDDLVMALVI 573 A HDD+VM+L + Sbjct: 413 TASGNQVMRAYGRGHDDIVMSLAL 436 >gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: phage terminase large subunit # Family: family:all:147 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468054;genbank:gi:157265496;genbank:Ge neID:5600564 Length = 485 Score = 63.9 bits (154), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 73/324 (22%), Positives = 141/324 (43%), Gaps = 37/324 (11%) Query: 265 PDAVRGNSFALIYVDECGFV--EAWEDCWKAILPVISSGRNSKIVLTSTPNGMNHWYDLW 322 PD +RG + + +DE + W + AI P +S R+ ++ STP G+N +Y+ + Sbjct: 135 PDNLRGATLDFVILDEAAMIPFSVWSE---AIEPTLSV-RDGWALIISTPKGLNWFYEFF 190 Query: 323 QSAING--KSGFEPYEANWSAVKERLYHSET-DSFDDGISWTTNQIGSSSVEAFLQEHAG 379 G K G N + +H+ + D + + W + F QE+ Sbjct: 191 LMGWRGGLKEGIPNSGINQTHPDFESFHAASWDVWPERREWYMERRLYIPDLEFRQEYGA 250 Query: 380 QFMGSAGTLITGFKLSKMTWTDIEATENLYRFKQPEVGHKYFACVDPAEGRGQDYSTIQI 439 +F+ + ++ +G + + + T + +P+ H Y C+ G+ QDYS + Sbjct: 251 EFVSHSNSVFSGLDMLILLPYERRGTRLVVEDYRPD--HIY--CIGADFGKNQDYSVFSV 306 Query: 440 IDVTKYPYEQVAVYHSNKISHIMFPRIIQR---LAMEYNEAYVYIELNSVGLSVAKDL-Y 495 +D+ + A+ +++ + + R L+ +Y AYV + VG ++A++L Sbjct: 307 LDL-----DTGAIVCLERMNGATWSDQVARLKALSEDYGHAYVVADTWGVGDAIAEELDA 361 Query: 496 MDLEYENMIIDSSRDLGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVI-ELRTF---- 550 + Y + + SS + K S L L+EK ++ + + +T++ ELR F Sbjct: 362 QGINYTPLPVKSS---------SVKEQLISNLALLMEKGQVAVPNDKTILDELRNFRYYR 412 Query: 551 -VEDGLSWSAAKDNHDDLVMALVI 573 A HDD+VM+L + Sbjct: 413 TASGNQVMRAYGRGHDDIVMSLAL 436 >gi|17621|lcl|protein:vir:816 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050549;genbank:gi:9633446;genbank:GeneID: 1262208 Length = 568 Score = 50.8 bits (120), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 113/506 (22%), Positives = 193/506 (38%), Gaps = 103/506 (20%) Query: 158 VIGINLRPYQKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTFNEAKNVGILAHKA 217 ++ +RP Q+ + R M K + + +RQLG +TA +Y F GI+A Sbjct: 49 LVTFRMRPAQRQLFRSMHNKNIILK--ARQLGFSTAIDIYLLDQALFIPHLKCGIVAQDK 106 Query: 218 SMSREVLERTKQIIEL--LPDFLQPG--IVEWNKGS----IELENGCAISAYSSDPDAVR 269 + E+ RTK + LPD+L+ IVE G+ I +G +I +S R Sbjct: 107 QAASEIF-RTKIAVPFDHLPDWLRASFTIVERRSGASGGYILFGHGSSIQVATS----FR 161 Query: 270 GNSFALIYVDECGFVEA-----WEDCWKAILPVIS---------------------SGRN 303 + +++ E G + A ++ L +S S R Sbjct: 162 SGTVQRLHISEHGKICAKYPAKAKELRTGTLNAVSDECIIFDESTAEGVGGDFYEMSNRA 221 Query: 304 SKI----VLTSTPNGMNHWYDLWQ----SAINGKSGFEPYEAN---WSAVKERLYHSETD 352 +I +L + + H+Y WQ SA +SG + +SAV++ + + TD Sbjct: 222 QEITASGLLLTAQDYKFHFYAWWQDPKYSARVPESGLKLSREKMTYFSAVEKAMNITLTD 281 Query: 353 SFDDGISWTTN-----------QIGSSSVEAFLQEHAGQFMGSAGTLITGFKLSKMTWTD 401 + W N + S+ EAFL F + F M D Sbjct: 282 ---EQKQWYINKETEQREEMKQEFPSTPQEAFLTSGRRVFSAESTLQAESFCSPPMIVYD 338 Query: 402 IEATEN-----------------------LYRFKQPEVGHKYFACVDPAEG-RGQDYSTI 437 IE L ++ P+ +Y D AEG D S++ Sbjct: 339 IEPVTGAKTKAQSLREGNKNELQRTLMNYLLVWELPDPDEEYVCGADTAEGLEHGDRSSL 398 Query: 438 QIIDVTKYPYEQVAVYHSNKISHIMFPRIIQRLAMEYNEAYVYIELNSVGLSV------- 490 + V + EQVA + + + +F +I ++ YN A+V E N+ G +V Sbjct: 399 DV--VKRSNGEQVAHWFGH-LDAELFAHLISQVCRMYNNAFVGPERNNHGHAVILKLREL 455 Query: 491 --AKDLYMDLEYENMIIDSSRDLGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVIELR 548 + +Y + + D + LG T+ +K + +K L+ I T+ E+ Sbjct: 456 YPTRYIYNEQHLDQAYDDDTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMN 515 Query: 549 TFVEDGL-SWSAAKDNHDDLVMALVI 573 T+V D S +A + DD +M+ +I Sbjct: 516 TYVYDAKGSMNAQEGCFDDQLMSYMI 541 >gi|25680|lcl|protein:vir:10116 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859246;genbank:gi:32171173;genbank:GeneID :2653342 Length = 568 Score = 50.8 bits (120), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 113/506 (22%), Positives = 193/506 (38%), Gaps = 103/506 (20%) Query: 158 VIGINLRPYQKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTFNEAKNVGILAHKA 217 ++ +RP Q+ + R M K + + +RQLG +TA +Y F GI+A Sbjct: 49 LVTFRMRPAQRQLFRSMHNKNIILK--ARQLGFSTAIDIYLLDQALFIPHLKCGIVAQDK 106 Query: 218 SMSREVLERTKQIIEL--LPDFLQPG--IVEWNKGS----IELENGCAISAYSSDPDAVR 269 + E+ RTK + LPD+L+ IVE G+ I +G +I +S R Sbjct: 107 QAASEIF-RTKIAVPFDHLPDWLRASFTIVERRSGASGGYILFGHGSSIQVATS----FR 161 Query: 270 GNSFALIYVDECGFVEA-----WEDCWKAILPVIS---------------------SGRN 303 + +++ E G + A ++ L +S S R Sbjct: 162 SGTVQRLHISEHGKICAKYPAKAKELRTGTLNAVSDECIIFDESTAEGVGGDFYEMSNRA 221 Query: 304 SKI----VLTSTPNGMNHWYDLWQ----SAINGKSGFEPYEAN---WSAVKERLYHSETD 352 +I +L + + H+Y WQ SA +SG + +SAV++ + + TD Sbjct: 222 QEITASGLLLTAQDYKFHFYAWWQDPKYSARVPESGLKLSREKMTYFSAVEKAMNITLTD 281 Query: 353 SFDDGISWTTN-----------QIGSSSVEAFLQEHAGQFMGSAGTLITGFKLSKMTWTD 401 + W N + S+ EAFL F + F M D Sbjct: 282 ---EQKQWYINKETEQREEMKQEFPSTPQEAFLTSGRRVFSAESTLQAESFCSPPMIVYD 338 Query: 402 IEATEN-----------------------LYRFKQPEVGHKYFACVDPAEG-RGQDYSTI 437 IE L ++ P+ +Y D AEG D S++ Sbjct: 339 IEPVTGAKTKAQSLREGNKNELQRTLMNYLLVWELPDPDEEYVCGADTAEGLEHGDRSSL 398 Query: 438 QIIDVTKYPYEQVAVYHSNKISHIMFPRIIQRLAMEYNEAYVYIELNSVGLSV------- 490 + V + EQVA + + + +F +I ++ YN A+V E N+ G +V Sbjct: 399 DV--VKRSNGEQVAHWFGH-LDAELFAHLISQVCRMYNNAFVGPERNNHGHAVILKLREL 455 Query: 491 --AKDLYMDLEYENMIIDSSRDLGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVIELR 548 + +Y + + D + LG T+ +K + +K L+ I T+ E+ Sbjct: 456 YPTRYIYNEQHLDQAYDDDTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMN 515 Query: 549 TFVEDGL-SWSAAKDNHDDLVMALVI 573 T+V D S +A + DD +M+ +I Sbjct: 516 TYVYDAKGSMNAQEGCFDDQLMSYMI 541 >gi|51|lcl|protein:vir:3295 Length: 568 # NCBI annotation: putative large subunit terminase # Family: family:all:147 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049511;genbank:gi:9632517;genbank:GeneID: 1262002 Length = 568 Score = 50.8 bits (120), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 113/506 (22%), Positives = 193/506 (38%), Gaps = 103/506 (20%) Query: 158 VIGINLRPYQKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTFNEAKNVGILAHKA 217 ++ +RP Q+ + R M K + + +RQLG +TA +Y F GI+A Sbjct: 49 LVTFRMRPAQRQLFRSMHNKNIILK--ARQLGFSTAIDIYLLDQALFIPHLKCGIVAQDK 106 Query: 218 SMSREVLERTKQIIEL--LPDFLQPG--IVEWNKGS----IELENGCAISAYSSDPDAVR 269 + E+ RTK + LPD+L+ IVE G+ I +G +I +S R Sbjct: 107 QAASEIF-RTKIAVPFDHLPDWLRASFTIVERRSGASGGYILFGHGSSIQVATS----FR 161 Query: 270 GNSFALIYVDECGFVEA-----WEDCWKAILPVIS---------------------SGRN 303 + +++ E G + A ++ L +S S R Sbjct: 162 SGTVQRLHISEHGKICAKYPAKAKELRTGTLNAVSDECIIFDESTAEGVGGDFYEMSNRA 221 Query: 304 SKI----VLTSTPNGMNHWYDLWQ----SAINGKSGFEPYEAN---WSAVKERLYHSETD 352 +I +L + + H+Y WQ SA +SG + +SAV++ + + TD Sbjct: 222 QEITASGLLLTAQDYKFHFYAWWQDPKYSARVPESGLKLSREKMTYFSAVEKAMNITLTD 281 Query: 353 SFDDGISWTTN-----------QIGSSSVEAFLQEHAGQFMGSAGTLITGFKLSKMTWTD 401 + W N + S+ EAFL F + F M D Sbjct: 282 ---EQKQWYINKETEQREEMKQEFPSTPQEAFLTSGRRVFSAESTLQAESFCSPPMIVYD 338 Query: 402 IEATEN-----------------------LYRFKQPEVGHKYFACVDPAEG-RGQDYSTI 437 IE L ++ P+ +Y D AEG D S++ Sbjct: 339 IEPVTGAKTKAQSLREGNKNELQRTLMNYLLVWELPDPDEEYVCGADTAEGLEHGDRSSL 398 Query: 438 QIIDVTKYPYEQVAVYHSNKISHIMFPRIIQRLAMEYNEAYVYIELNSVGLSV------- 490 + V + EQVA + + + +F +I ++ YN A+V E N+ G +V Sbjct: 399 DV--VKRSNGEQVAHWFGH-LDAELFAHLISQVCRMYNNAFVGPERNNHGHAVILKLREL 455 Query: 491 --AKDLYMDLEYENMIIDSSRDLGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVIELR 548 + +Y + + D + LG T+ +K + +K L+ I T+ E+ Sbjct: 456 YPTRYIYNEQHLDQAYDDDTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMN 515 Query: 549 TFVEDGL-SWSAAKDNHDDLVMALVI 573 T+V D S +A + DD +M+ +I Sbjct: 516 TYVYDAKGSMNAQEGCFDDQLMSYMI 541 >gi|26240|lcl|protein:vir:2763 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612880;genbank:gi:20065962;genbank:GeneID :935714 Length = 568 Score = 50.8 bits (120), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 113/506 (22%), Positives = 193/506 (38%), Gaps = 103/506 (20%) Query: 158 VIGINLRPYQKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTFNEAKNVGILAHKA 217 ++ +RP Q+ + R M K + + +RQLG +TA +Y F GI+A Sbjct: 49 LVTFRMRPAQRQLFRSMHNKNIILK--ARQLGFSTAIDIYLLDQALFIPHLKCGIVAQDK 106 Query: 218 SMSREVLERTKQIIEL--LPDFLQPG--IVEWNKGS----IELENGCAISAYSSDPDAVR 269 + E+ RTK + LPD+L+ IVE G+ I +G +I +S R Sbjct: 107 QAASEIF-RTKIAVPFDHLPDWLRASFTIVERRSGASGGYILFGHGSSIQVATS----FR 161 Query: 270 GNSFALIYVDECGFVEA-----WEDCWKAILPVIS---------------------SGRN 303 + +++ E G + A ++ L +S S R Sbjct: 162 SGTVQRLHISEHGKICAKYPAKAKELRTGTLNAVSDECIIFDESTAEGVGGDFYEMSNRA 221 Query: 304 SKI----VLTSTPNGMNHWYDLWQ----SAINGKSGFEPYEAN---WSAVKERLYHSETD 352 +I +L + + H+Y WQ SA +SG + +SAV++ + + TD Sbjct: 222 QEITASGLLLTAQDYKFHFYAWWQDPKYSARVPESGLKLSREKMTYFSAVEKAMNITLTD 281 Query: 353 SFDDGISWTTN-----------QIGSSSVEAFLQEHAGQFMGSAGTLITGFKLSKMTWTD 401 + W N + S+ EAFL F + F M D Sbjct: 282 ---EQKQWYINKETEQREEMKQEFPSTPQEAFLTSGRRVFSAESTLQAESFCSPPMIVYD 338 Query: 402 IEATEN-----------------------LYRFKQPEVGHKYFACVDPAEG-RGQDYSTI 437 IE L ++ P+ +Y D AEG D S++ Sbjct: 339 IEPVTGAKTKAQSLREGNKNELQRTLMNYLLVWELPDPDEEYVCGADTAEGLEHGDRSSL 398 Query: 438 QIIDVTKYPYEQVAVYHSNKISHIMFPRIIQRLAMEYNEAYVYIELNSVGLSV------- 490 + V + EQVA + + + +F +I ++ YN A+V E N+ G +V Sbjct: 399 DV--VKRSNGEQVAHWFGH-LDAELFAHLISQVCRMYNNAFVGPERNNHGHAVILKLREL 455 Query: 491 --AKDLYMDLEYENMIIDSSRDLGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVIELR 548 + +Y + + D + LG T+ +K + +K L+ I T+ E+ Sbjct: 456 YPTRYIYNEQHLDQAYDDDTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMN 515 Query: 549 TFVEDGL-SWSAAKDNHDDLVMALVI 573 T+V D S +A + DD +M+ +I Sbjct: 516 TYVYDAKGSMNAQEGCFDDQLMSYMI 541 >gi|25849|lcl|protein:vir:9949 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859079;genbank:gi:32171001;genbank:GeneID :2653280 Length = 568 Score = 50.8 bits (120), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 113/506 (22%), Positives = 193/506 (38%), Gaps = 103/506 (20%) Query: 158 VIGINLRPYQKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTFNEAKNVGILAHKA 217 ++ +RP Q+ + R M K + + +RQLG +TA +Y F GI+A Sbjct: 49 LVTFRMRPAQRQLFRSMHNKNIILK--ARQLGFSTAIDIYLLDQALFIPHLKCGIVAQDK 106 Query: 218 SMSREVLERTKQIIEL--LPDFLQPG--IVEWNKGS----IELENGCAISAYSSDPDAVR 269 + E+ RTK + LPD+L+ IVE G+ I +G +I +S R Sbjct: 107 QAASEIF-RTKIAVPFDHLPDWLRASFTIVERRSGASGGYILFGHGSSIQVATS----FR 161 Query: 270 GNSFALIYVDECGFVEA-----WEDCWKAILPVIS---------------------SGRN 303 + +++ E G + A ++ L +S S R Sbjct: 162 SGTVQRLHISEHGKICAKYPAKAKELRTGTLNAVSDECIIFDESTAEGVGGDFYEMSNRA 221 Query: 304 SKI----VLTSTPNGMNHWYDLWQ----SAINGKSGFEPYEAN---WSAVKERLYHSETD 352 +I +L + + H+Y WQ SA +SG + +SAV++ + + TD Sbjct: 222 QEITASGLLLTAQDYKFHFYAWWQDPKYSARVPESGLKLSREKMTYFSAVEKAMNITLTD 281 Query: 353 SFDDGISWTTN-----------QIGSSSVEAFLQEHAGQFMGSAGTLITGFKLSKMTWTD 401 + W N + S+ EAFL F + F M D Sbjct: 282 ---EQKQWYINKETEQREEMKQEFPSTPQEAFLTSGRRVFSAESTLQAESFCSPPMIVYD 338 Query: 402 IEATEN-----------------------LYRFKQPEVGHKYFACVDPAEG-RGQDYSTI 437 IE L ++ P+ +Y D AEG D S++ Sbjct: 339 IEPVTGAKTKAQSLREGNKNELQRTLMNYLLVWELPDPDEEYVCGADTAEGLEHGDRSSL 398 Query: 438 QIIDVTKYPYEQVAVYHSNKISHIMFPRIIQRLAMEYNEAYVYIELNSVGLSV------- 490 + V + EQVA + + + +F +I ++ YN A+V E N+ G +V Sbjct: 399 DV--VKRSNGEQVAHWFGH-LDAELFAHLISQVCRMYNNAFVGPERNNHGHAVILKLREL 455 Query: 491 --AKDLYMDLEYENMIIDSSRDLGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVIELR 548 + +Y + + D + LG T+ +K + +K L+ I T+ E+ Sbjct: 456 YPTRYIYNEQHLDQAYDDDTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMN 515 Query: 549 TFVEDGL-SWSAAKDNHDDLVMALVI 573 T+V D S +A + DD +M+ +I Sbjct: 516 TYVYDAKGSMNAQEGCFDDQLMSYMI 541 >gi|1476|lcl|protein:vir:104436 Length: 568 # NCBI annotation: putative terminase large subunit # Family: family:all:147 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794060;genbank:gi:116222005;genbank:GeneI D:4397501 Length = 568 Score = 50.8 bits (120), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 113/506 (22%), Positives = 193/506 (38%), Gaps = 103/506 (20%) Query: 158 VIGINLRPYQKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTFNEAKNVGILAHKA 217 ++ +RP Q+ + R M K + + +RQLG +TA +Y F GI+A Sbjct: 49 LVTFRMRPAQRQLFRSMHNKNIILK--ARQLGFSTAIDIYLLDQALFIPHLKCGIVAQDK 106 Query: 218 SMSREVLERTKQIIEL--LPDFLQPG--IVEWNKGS----IELENGCAISAYSSDPDAVR 269 + E+ RTK + LPD+L+ IVE G+ I +G +I +S R Sbjct: 107 QAASEIF-RTKIAVPFDHLPDWLRASFTIVERRSGASGGYILFGHGSSIQVATS----FR 161 Query: 270 GNSFALIYVDECGFVEA-----WEDCWKAILPVIS---------------------SGRN 303 + +++ E G + A ++ L +S S R Sbjct: 162 SGTVQRLHISEHGKICAKYPAKAKELRTGTLNAVSDECIIFDESTAEGVGGDFYEMSNRA 221 Query: 304 SKI----VLTSTPNGMNHWYDLWQ----SAINGKSGFEPYEAN---WSAVKERLYHSETD 352 +I +L + + H+Y WQ SA +SG + +SAV++ + + TD Sbjct: 222 QEITASGLLLTAQDYKFHFYAWWQDPKYSARVPESGLKLSREKMTYFSAVEKAMNITLTD 281 Query: 353 SFDDGISWTTN-----------QIGSSSVEAFLQEHAGQFMGSAGTLITGFKLSKMTWTD 401 + W N + S+ EAFL F + F M D Sbjct: 282 ---EQKQWYINKETEQREEMKQEFPSTPQEAFLTSGRRVFSAESTLQAESFCSPPMIVYD 338 Query: 402 IEATEN-----------------------LYRFKQPEVGHKYFACVDPAEG-RGQDYSTI 437 IE L ++ P+ +Y D AEG D S++ Sbjct: 339 IEPVTGAKTKAQSLREGNKNELQRTLMNYLLVWELPDPDEEYVCGADTAEGLEHGDRSSL 398 Query: 438 QIIDVTKYPYEQVAVYHSNKISHIMFPRIIQRLAMEYNEAYVYIELNSVGLSV------- 490 + V + EQVA + + + +F +I ++ YN A+V E N+ G +V Sbjct: 399 DV--VKRSNGEQVAHWFGH-LDAELFAHLISQVCRMYNNAFVGPERNNHGHAVILKLREL 455 Query: 491 --AKDLYMDLEYENMIIDSSRDLGMKQTKTTKAMGCSTLKDLIEKDKLRINHKQTVIELR 548 + +Y + + D + LG T+ +K + +K L+ I T+ E+ Sbjct: 456 YPTRYIYNEQHLDQAYDDDTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMN 515 Query: 549 TFVEDGL-SWSAAKDNHDDLVMALVI 573 T+V D S +A + DD +M+ +I Sbjct: 516 TYVYDAKGSMNAQEGCFDDQLMSYMI 541 >gi|26461|lcl|protein:vir:573 Length: 589 # NCBI annotation: unknown # Family: family:all:4926 # MgeID: mge:13 # MgeName: SPBc2 # Cross-refs: genbank:acc:NP_046608;genbank:gi:9630181;genbank:GeneID: 1261423 Length = 589 Score = 46.2 bits (108), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 38/150 (25%), Positives = 67/150 (44%), Gaps = 5/150 (3%) Query: 159 IGINLRPYQKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTFNEAKNVGILAHKAS 218 +GI L+ +Q ++ +M ++ SR GKT TSVY C + I + Sbjct: 59 LGITLKLFQCILIYMMVHNHYFMYLASRGQGKTWLTSVYCCVQAILFPGTKIVIASGTKG 118 Query: 219 MSREVLERTKQIIELLPDF---LQPGIVEWNKGSIELENGCAISAYSSDPDAVRGNSFAL 275 +REV+E+ + + P+ ++ N +E NG I +S+ D R L Sbjct: 119 QAREVIEKIDDLRKESPNLRREIEDLKTSTNDAKVEFHNGSWIKIVASN-DGARSKRANL 177 Query: 276 IYVDECGFVEAWEDCWKAILPVISSGRNSK 305 + VDE V+ +E K + +++ R+ K Sbjct: 178 LIVDEFRMVD-FEIISKVLRKFLTAPRSPK 206 >gi|20547|lcl|protein:vir:105155 Length: 580 # NCBI annotation: conserved phage-related protein # Family: family:all:4926 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398598;genbank:gi:80159854;genbank:GeneID :3772993 Length = 580 Score = 42.7 bits (99), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 39/160 (24%), Positives = 69/160 (43%), Gaps = 13/160 (8%) Query: 133 EEWKKCRDDILYFAKYCAIIHLDHGVIGINLRPYQKDMLRIMSEKRMSIHNLSRQLGKTT 192 EEW+K I Y+ KY ++ V+G+ L +Q+ +LR M+ + + R LGK+ Sbjct: 41 EEWEKY---ISYYRKYIDKFCIE--VLGLKLYLFQRLILRAMARNQYVMLICCRGLGKSW 95 Query: 193 ATSVYPCHYVTFNEAKNVGILAHKASMSREV-LERTKQIIELLPDFLQPGIVEWNKGS-- 249 ++V+ + GI + + +R V +++ K + P + + G+ Sbjct: 96 LSAVFFVASCILYKGLKCGIASGQGQQARNVIIQKVKGELAKNPSIAREIVFPIKTGADD 155 Query: 250 --IELENGCAISAY---SSDPDAVRGNSFALIYVDECGFV 284 + NG I A + D R F + VDEC V Sbjct: 156 CVVNFRNGSEIRAIVLGRNQGDGARSWRFHYLLVDECRLV 195 >gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: terminase large subunit # Family: family:all:4877 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429607;genbank:gi:156564098;genbank:Ge neID:5525534 Length = 635 Score = 40.4 bits (93), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 45/176 (25%), Positives = 74/176 (42%), Gaps = 26/176 (14%) Query: 164 RPYQKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPC--HYVTFNEAKN----VGILAHKA 217 R YQ+ ML+ M++ + ++ L R+LGKT + + N+ N + I+A Sbjct: 69 RDYQEPMLQEMADSKRTVLRLGRRLGKTETMCIMILWHAFTQPNKGPNNQYDILIIAPYE 128 Query: 218 SMSREVLERTKQIIELLPDFLQPGIVEWNKGSIELENGCAI------SAYSSDPDAVRGN 271 + +R Q+I++ D ++ IEL NG I S S RG Sbjct: 129 EQVDLIFKRLSQLIDMSGDVNPSRDID---KHIELPNGTVIHGITAGSKSGSGAANTRGQ 185 Query: 272 SFALIYVDECGFVEAWEDCWKAILPVISSGRNS-----KIVLTSTPNGMNHWYDLW 322 LI +DE ++ E + I + RN K+++ STP+G Y W Sbjct: 186 RADLIVLDEMDYMGESE------ITNIMNIRNEAPERIKMIVASTPSGRRDSYYKW 235 >gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: terminase, large subunit # Family: family:all:147 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006983;genbank:gi:46401884;genbank:GeneID :2777679 Length = 438 Score = 38.1 bits (87), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 26/94 (27%), Positives = 40/94 (42%), Gaps = 7/94 (7%) Query: 250 IELENGCAIS-AYSSDPDAVRGNSFALIYVDECGFVEAWEDCWKAILPVISSGRNSKIVL 308 IEL NG A ++ D+ G S+ I DE + D ++ L NSK + Sbjct: 124 IELANGSLFKLASAAQADSAVGRSYDFIIFDEAAISDVGGDAFRVQLRPTLDKPNSKALF 183 Query: 309 TSTPNGMNHWYDLWQSAINGKSGFEPYEANWSAV 342 STP G N + + + GF+ NW ++ Sbjct: 184 ISTPRGGNWFKEFY------AYGFDDTLPNWVSI 211 >gi|16077|lcl|protein:vir:4155 Length: 468 # NCBI annotation: large terminase subunit # Family: family:all:144 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046964;genbank:gi:9630534;genbank:GeneID: 1261708 Length = 468 Score = 32.7 bits (73), Expect = 0.012, Method: Compositional matrix adjust. Identities = 37/167 (22%), Positives = 71/167 (42%), Gaps = 17/167 (10%) Query: 161 INLRPYQKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMS 220 I + P+ K + ++S++R ++ + GK+ A + YV +++ + + +S Sbjct: 55 IPVNPFHKQIKFLLSDEREVLYGGAAGGGKSVALLMGALQYVHYSDYAALILRRTYPELS 114 Query: 221 REVLERTKQIIELLPDFLQPGIVEWN--KGSIELENGCAIS----AYSSDPDAVRGNSFA 274 +E +I++ D+L EWN K +G A+ + D +G+S+ Sbjct: 115 QE-----GGLIDMANDWLGGTDAEWNEQKKRWTFPSGAALQFGHMEHEKDRYRYQGSSYH 169 Query: 275 LIYVDECGFVEAWEDCWKAILPVISSGRNSKIVL----TSTPNGMNH 317 I DE E E ++ + + N I L TS P G+ H Sbjct: 170 YIAFDEL--TEFMETQYRFMFRSLRKEVNDHIPLRVRATSNPGGIGH 214 >gi|18252|lcl|protein:vir:4193 Length: 467 # NCBI annotation: putative large terminase subunit # Family: family:all:144 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071818;genbank:gi:11863101;genbank:GeneID :1257603 Length = 467 Score = 32.3 bits (72), Expect = 0.018, Method: Compositional matrix adjust. Identities = 36/167 (21%), Positives = 71/167 (42%), Gaps = 17/167 (10%) Query: 161 INLRPYQKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMS 220 I + P+ K + ++S++R ++ + GK+ A + YV +++ + + +S Sbjct: 55 IPVNPFHKQIKFLLSDEREVLYGGAAGGGKSVALLMGALQYVHYSDYAALILRRTYPELS 114 Query: 221 REVLERTKQIIELLPDFLQPGIVEWN--KGSIELENGCAIS----AYSSDPDAVRGNSFA 274 +E +I++ D+L EWN K +G A+ + D +G+S+ Sbjct: 115 QE-----GGLIDMANDWLGGTDAEWNEQKKRWTFPSGAALQFGHMEHEKDRYRYQGSSYH 169 Query: 275 LIYVDECGFVEAWEDCWKAILPVISSGRNSKIVL----TSTPNGMNH 317 I DE E E ++ + + + I L TS P G+ H Sbjct: 170 YIAFDEL--TEFLESQYRFMFRSLRKEADDPIPLRFRATSNPGGIGH 214 >gi|18781|lcl|protein:vir:7429 Length: 581 # NCBI annotation: gp6 # Family: family:all:147 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818544;genbank:gi:29566981;genbank:GeneID :1260231 Length = 581 Score = 32.3 bits (72), Expect = 0.018, Method: Compositional matrix adjust. Identities = 32/97 (32%), Positives = 48/97 (49%), Gaps = 14/97 (14%) Query: 249 SIELENGCAI-SAYSSD-PDAVRGNSFALIYVDECGFVEAWEDCWKA-ILPVISS-GRNS 304 ++ L +G I SA SS P+ + G ++++E + E WK I+P + G + Sbjct: 135 TVSLWDGAFIYSAKSSAVPERLVGEGLTGVHMEEA--AKQKEVVWKQMIMPTLMDFGGWA 192 Query: 305 KIVLTSTPNGMNHWYDLWQSAINGKSGFEPYEANWSA 341 K T+TP G N +YDL Q A+ P NWSA Sbjct: 193 K--FTTTPEGKNWYYDLHQKALR------PSTLNWSA 221 >gi|1431|lcl|protein:vir:93626 Length: 530 # NCBI annotation: Bcep22gp49 # Family: family:all:147 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944278;genbank:gi:38640355;genbank:GeneID :2658275 Length = 530 Score = 30.0 bits (66), Expect = 0.090, Method: Compositional matrix adjust. Identities = 27/106 (25%), Positives = 46/106 (43%), Gaps = 8/106 (7%) Query: 185 SRQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMSREVL-ERTKQIIELLPDFLQPGIV 243 +RQLG TT + + FN GI+A + + ++ K + LP+ L+ + Sbjct: 77 ARQLGFTTLICIIWLDHALFNANSRCGIIAQDRETAEALFRDKVKFAYDNLPEALREAMP 136 Query: 244 EWNKGSIEL---ENGCAISAYSSDPDAVRGNSFALIYVDECGFVEA 286 N EL N +I +S VRG + +++ E G + A Sbjct: 137 LANCTKAELLFAHNNSSIRVATS----VRGGTIHRLHISEFGKICA 178 >gi|3478|lcl|protein:vir:101370 Length: 494 # NCBI annotation: PacB # Family: family:all:144 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006576;genbank:gi:46401730;genbank:GeneID :2777432 Length = 494 Score = 28.9 bits (63), Expect = 0.22, Method: Compositional matrix adjust. Identities = 13/27 (48%), Positives = 17/27 (62%) Query: 417 GHKYFACVDPAEGRGQDYSTIQIIDVT 443 G + ACVD A G G+D S I I+ V+ Sbjct: 285 GWGWVACVDVAGGTGRDKSVINIMMVS 311 >gi|1931|lcl|protein:vir:99847 Length: 486 # NCBI annotation: large terminase subunit # Family: family:all:460 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164067;genbank:gi:56692599;genbank:GeneID :3192575 Length = 486 Score = 28.1 bits (61), Expect = 0.30, Method: Compositional matrix adjust. Identities = 42/154 (27%), Positives = 62/154 (40%), Gaps = 13/154 (8%) Query: 165 PYQKDMLRIMSEKRMSIHNLSRQLGKTTATSVYPCHYVTFNEAKNVG--ILAHKASMSRE 222 PYQ I R+ + SRQ+G + +T+ Y T E+ V + + +R Sbjct: 20 PYQSRW--ITDPSRLKLMQKSRQIGLSWSTA-YAAGERTAAESARVDQWVSSRDDLQARL 76 Query: 223 VLERTKQIIELL----PDFLQPGIVEWNKGS---IELENGCAISAYSSDPDAVRGNSFAL 275 LE K ++ D + I NK S +E NG I + SS+PDA G Sbjct: 77 FLEDCKMWAGIMNQAAKDLGEIVIDVKNKISAYVLEFANGRRIHSMSSNPDAQAGKRGGR 136 Query: 276 IYVDECGFVEAWEDCWKAILPVISSGRNSKIVLT 309 I +DE W P I+ G +I+ T Sbjct: 137 I-LDEFALHPDPRKLWSIAYPGITWGGAMEIIST 169 >gi|25398|lcl|protein:vir:8929 Length: 717 # NCBI annotation: ORF025 # Family: family:all:1546 # MgeID: mge:163 # MgeName: phiKZ # Cross-refs: genbank:acc:NP_803591;genbank:gi:29134961;genbank:GeneID :1258246 Length = 717 Score = 27.7 bits (60), Expect = 0.39, Method: Compositional matrix adjust. Identities = 14/46 (30%), Positives = 25/46 (54%) Query: 560 AKDNHDDLVMALVIFAYLTTQDRFAEFLDMDEHSLAHDIFQDELES 605 AK NHDDLV++L++ +L Q + + ++ L +D+ S Sbjct: 589 AKGNHDDLVVSLLLAHWLLIQGKNLSYYGINVPILGKSKLRDKEPS 634 >gi|19608|lcl|protein:vir:4081 Length: 518 # NCBI annotation: terminase # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043560;genbank:gi:9628694;genbank:GeneID: 1261154 Length = 518 Score = 26.9 bits (58), Expect = 0.78, Method: Compositional matrix adjust. Identities = 13/34 (38%), Positives = 18/34 (52%) Query: 249 SIELENGCAISAYSSDPDAVRGNSFALIYVDECG 282 SI G IS Y+S+ D + G L+ +DE G Sbjct: 160 SILKSKGTEISIYASNEDTLDGGREQLVIIDEFG 193 >gi|4674|lcl|protein:vir:105593 Length: 543 # NCBI annotation: terminase large subunit # Family: family:all:147 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164303;genbank:gi:56692921;genbank:GeneID :3197204 Length = 543 Score = 26.9 bits (58), Expect = 0.82, Method: Compositional matrix adjust. Identities = 32/162 (19%), Positives = 70/162 (43%), Gaps = 18/162 (11%) Query: 165 PYQKDMLRIMSEKRMSIHNL---SRQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMSR 221 P++ + + +R+ NL +RQLG TT ++ + FN + GI+A ++ Sbjct: 63 PFKPNRAQKRFIRRLWHRNLILKARQLGFTTLIAIMWLDHALFNGDQRCGIIAQDRDAAK 122 Query: 222 EVL-ERTKQIIELLPDFLQPGIVEWNKGSIEL---ENGCAISAYSSDPDAVRGNSFALIY 277 + ++ K + LP+ ++ + EL N ++ +S +R + ++ Sbjct: 123 VIFRDKVKFAYDNLPEEIRERFPTAAANADELLFAHNNSSVRVATS----MRSGTIHRLH 178 Query: 278 VDECG-----FVEAWEDCWKAILPVISSGRNSKIVLTSTPNG 314 V E G + + ++ +P + + N +V+ ST G Sbjct: 179 VSEFGKICAKYPDKAQEVVTGSIPAVPT--NGILVIESTAEG 218 >gi|6719|lcl|protein:vir:99411 Length: 447 # NCBI annotation: hypothetical protein # Family: family:all:32729 # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919076;genbank:gi:119757034;genbank:GeneI D:4606064 Length = 447 Score = 26.6 bits (57), Expect = 0.93, Method: Compositional matrix adjust. Identities = 17/62 (27%), Positives = 28/62 (45%), Gaps = 4/62 (6%) Query: 270 GNSFALIYVDECGFVEAWEDCWKAILPVISSGRN----SKIVLTSTPNGMNHWYDLWQSA 325 G + I+ DE G D + +I+ R + + TST NG N +YD+ + Sbjct: 135 GGEYCRIWCDEVGHYPPNTDLYDLHEMLITRQRTEIGPNTTLWTSTGNGFNQFYDITERQ 194 Query: 326 IN 327 +N Sbjct: 195 VN 196 >gi|8927|lcl|protein:vir:97005 Length: 632 # NCBI annotation: 40 # Family: family:all:697 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654141;genbank:gi:108862025;genbank:GeneI D:5075954 Length = 632 Score = 26.6 bits (57), Expect = 1.0, Method: Compositional matrix adjust. Identities = 26/128 (20%), Positives = 55/128 (42%), Gaps = 10/128 (7%) Query: 129 QEMVEEWKKCRDDILYFAKYCAIIHLDHGVIGIN--LRPYQKDMLRIM-SEKRMSIHNLS 185 QE+ + + + +L FA H +I N L Q D+L+ + + + Sbjct: 19 QELQQTFPYTAEGLLLFADTVI-----HNLIAGNPHLIRMQADILKFLFYGHKYRLIEAP 73 Query: 186 RQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMSREVLERTKQIIELLP--DFLQPGIV 243 R + KTT +++Y + K + +++ A + E+ +I L +F+ P I Sbjct: 74 RGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFRGLDFLEFMLPDIY 133 Query: 244 EWNKGSIE 251 ++ S++ Sbjct: 134 AGDRASVK 141 >gi|8997|lcl|protein:vir:101640 Length: 432 # NCBI annotation: terminase large subunit # Family: family:all:144 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112491;genbank:gi:53793591;uniprot:Q5ZGG2 ;genbank:GeneID:3101748 Length = 432 Score = 26.2 bits (56), Expect = 1.1, Method: Compositional matrix adjust. Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 8/55 (14%) Query: 239 QPGIVEWNKGS-IELENGCAISAYSSDP--DAVRGNSFALIYVDECGFV--EAWE 288 Q G++ WN GS I L++ + AY SD D++ + ++DEC + +AW+ Sbjct: 88 QSGVIYWNNGSEILLKD---LYAYPSDQNFDSLGSLEISGAFIDECNQITYKAWQ 139 >gi|10294|lcl|protein:vir:105656 Length: 405 # NCBI annotation: putative large terminase subunit # Family: family:all:697 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425018;genbank:gi:83571766;uniprot:Q2WC34 ;genbank:GeneID:3837297 Length = 405 Score = 26.2 bits (56), Expect = 1.1, Method: Compositional matrix adjust. Identities = 28/129 (21%), Positives = 56/129 (43%), Gaps = 12/129 (9%) Query: 129 QEMVEEWKKCRDDILYFAKYCAIIHLDHGVIGIN--LRPYQKDMLRIM-SEKRMSIHNLS 185 QE+ + + + +L FA H +I N L Q D+L+ + + + Sbjct: 19 QELQQTFPYTAEGLLLFADTVI-----HNLIAGNPHLIRMQADILKFLFYGHKYRLIEAP 73 Query: 186 RQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMSREVLERTKQIIELLPDFLQ---PGI 242 R + KTT +++Y + K + +++ A + E+ +I L DFL+ P I Sbjct: 74 RGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFRGL-DFLEFMLPDI 132 Query: 243 VEWNKGSIE 251 ++ S++ Sbjct: 133 YAGDRASVK 141 >gi|12378|lcl|protein:vir:79648 Length: 523 # NCBI annotation: TerL # Family: family:all:144 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285519;genbank:gi:148734502;genbank:Ge neID:5220006 Length = 523 Score = 26.2 bits (56), Expect = 1.1, Method: Compositional matrix adjust. Identities = 20/76 (26%), Positives = 32/76 (42%), Gaps = 1/76 (1%) Query: 344 ERLYHSETDSFDDGISWTTNQIGSSSVEAFLQEHAGQFMGSAGTLIT-GFKLSKMTWTDI 402 + LY ++T + D N G +E + GQ GS G IT G +T D Sbjct: 121 QELYPAKTGTSKDDEFQILNDAGKVRLEMISKSMGGQITGSRGGYITPGVYSGCVTLDDP 180 Query: 403 EATENLYRFKQPEVGH 418 E ++++ + E G Sbjct: 181 EKPDDMFSKVKRERGQ 196 >gi|13269|lcl|protein:vir:81252 Length: 542 # NCBI annotation: gp2, terminase # Family: family:all:523 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456732;genbank:gi:157168375;interpro:I PR005021;uniprot:Q9MBK3;genbank:GeneID:5580375 Length = 542 Score = 25.4 bits (54), Expect = 1.9, Method: Compositional matrix adjust. Identities = 22/100 (22%), Positives = 38/100 (38%), Gaps = 11/100 (11%) Query: 151 IIHLDHGVIGINLRPYQK----DMLRIMSEK-------RMSIHNLSRQLGKTTATSVYPC 199 +I ++GI LRP+Q+ L + EK R + + RQ GKT + Sbjct: 33 VIDFAREILGIELRPWQEWFFIHALELDPEKNYEDFRFRQLVLLVGRQNGKTLVMVILGL 92 Query: 200 HYVTFNEAKNVGILAHKASMSREVLERTKQIIELLPDFLQ 239 + + + A S++ L + + PD Q Sbjct: 93 WKLFIDGCSEIVTAAQDLSVAEATLSNAFMLAKANPDLNQ 132 >gi|15201|lcl|protein:vir:7028 Length: 631 # NCBI annotation: large terminase subunit # Family: family:all:697 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853601;genbank:gi:31711683;genbank:GeneID :1481809 Length = 631 Score = 25.4 bits (54), Expect = 2.0, Method: Compositional matrix adjust. Identities = 21/93 (22%), Positives = 40/93 (43%), Gaps = 3/93 (3%) Query: 162 NLRPYQKDMLRIM-SEKRMSIHNLSRQLGKTTATSVYPCHYVTFNEAKNVGILAHKASMS 220 +L Q D+L+ + + + R KTT ++Y + K + I++ A + Sbjct: 49 DLNRVQADILKFLFGGNKYRMVEAQRGQAKTTIAAIYAVFRIIHEPHKRIMIVSQTAKRA 108 Query: 221 REVLERTKQIIELLP--DFLQPGIVEWNKGSIE 251 E+ +I L +F+ P I +K SI+ Sbjct: 109 EEIAGWVIKIFRGLDFLEFMLPDIYAGDKASIK 141 >gi|16205|lcl|protein:vir:1554 Length: 587 # NCBI annotation: DNA packaging protein B # Family: family:all:697 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052122;swissprot:trembl:q9t0z4;genbank:gi :9634048;uniprot:Q9T0Z4;genbank:GeneID:1262409 Length = 587 Score = 24.6 bits (52), Expect = 3.5, Method: Compositional matrix adjust. Identities = 13/47 (27%), Positives = 25/47 (53%), Gaps = 1/47 (2%) Query: 364 QIGSSSVEAFLQEHAGQFMGSAGTLITGF-KLSKMTWTDIEATENLY 409 ++ + +EAFL+EH + SAG ++T ++ W D + N + Sbjct: 538 KVEAEVLEAFLEEHMEHPIHSAGHVVTSMVDGMELYWEDDDVNSNRF 584 >gi|13692|lcl|protein:vir:4897 Length: 411 # NCBI annotation: gp411 # Family: family:all:54 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056675;genbank:gi:9635008;genbank:GeneID: 1262679 Length = 411 Score = 24.6 bits (52), Expect = 3.7, Method: Compositional matrix adjust. Identities = 17/50 (34%), Positives = 25/50 (50%), Gaps = 3/50 (6%) Query: 269 RGNSFALIYVDECGFVEAWEDCWKAILPVISSGRNSKIVLTSTPNGMNHW 318 RG + YV+E A E +K I+ SG +++V S P+ NHW Sbjct: 121 RGFTAFGAYVNEASL--ANEFVFKEIISR-CSGDGARVVWDSNPDNPNHW 167 >gi|16780|lcl|protein:vir:2731 Length: 411 # NCBI annotation: hypothetical protein # Family: family:all:54 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695104;genbank:gi:23455873;genbank:GeneID :955606 Length = 411 Score = 24.6 bits (52), Expect = 4.0, Method: Compositional matrix adjust. Identities = 16/50 (32%), Positives = 24/50 (48%), Gaps = 3/50 (6%) Query: 269 RGNSFALIYVDECGFVEAWEDCWKAILPVISSGRNSKIVLTSTPNGMNHW 318 RG + YV+E E +K I+ SG +++V S P+ NHW Sbjct: 121 RGFTAFGAYVNEASLAN--ELVFKEIISR-CSGDGARVVWDSNPDNPNHW 167 >gi|4395|lcl|protein:vir:98555 Length: 589 # NCBI annotation: gp3 # Family: family:all:169 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958058;genbank:gi:41057355;genbank:GeneID :2744226 Length = 589 Score = 24.3 bits (51), Expect = 4.7, Method: Compositional matrix adjust. Identities = 10/24 (41%), Positives = 15/24 (62%) Query: 467 IQRLAMEYNEAYVYIELNSVGLSV 490 I+RL +YN Y+ I+ +GL V Sbjct: 459 IRRLTEKYNVEYIGIDATGLGLGV 482 >gi|13826|lcl|protein:vir:3376 Length: 586 # NCBI annotation: DNA packaging protein B # Family: family:all:697 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523347;swissprot:trembl:q8w5t4;genbank:gi :17570838;uniprot:Q8W5T4;genbank:GeneID:927460 Length = 586 Score = 24.3 bits (51), Expect = 4.7, Method: Compositional matrix adjust. Identities = 10/29 (34%), Positives = 18/29 (62%) Query: 364 QIGSSSVEAFLQEHAGQFMGSAGTLITGF 392 ++ + +EAFL+EH + SAG ++T Sbjct: 537 KVEAEVLEAFLEEHMEHPIHSAGHVVTAM 565 >gi|17547|lcl|protein:vir:959 Length: 592 # NCBI annotation: terminase # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076613;genbank:gi:13095721;genbank:GeneID :920277 Length = 592 Score = 23.5 bits (49), Expect = 7.5, Method: Compositional matrix adjust. Identities = 9/31 (29%), Positives = 17/31 (54%) Query: 350 ETDSFDDGISWTTNQIGSSSVEAFLQEHAGQ 380 E D+F+ I W+ + I +++ Q+ A Q Sbjct: 3 EVDNFETAIQWSKDVISGNTLANIEQKQAAQ 33 >gi|9070|lcl|protein:vir:102238 Length: 588 # NCBI annotation: gp8 # Family: family:all:147 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655204;genbank:gi:109522784;genbank:GeneI D:4157477 Length = 588 Score = 23.5 bits (49), Expect = 8.6, Method: Compositional matrix adjust. Identities = 12/33 (36%), Positives = 17/33 (51%), Gaps = 6/33 (18%) Query: 309 TSTPNGMNHWYDLWQSAINGKSGFEPYEANWSA 341 TSTP G NH++D +Q G +P W + Sbjct: 197 TSTPEGKNHFHDKFQ------MGQDPNNPEWES 223 >gi|8220|lcl|protein:vir:101493 Length: 588 # NCBI annotation: gp8 # Family: family:all:147 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655387;genbank:gi:109522575;genbank:GeneI D:4157565 Length = 588 Score = 23.5 bits (49), Expect = 8.6, Method: Compositional matrix adjust. Identities = 12/33 (36%), Positives = 17/33 (51%), Gaps = 6/33 (18%) Query: 309 TSTPNGMNHWYDLWQSAINGKSGFEPYEANWSA 341 TSTP G NH++D +Q G +P W + Sbjct: 197 TSTPEGKNHFHDKFQ------MGQDPNNPEWES 223 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.318 0.134 0.402 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 310,248 Number of Sequences: 514 Number of extensions: 14815 Number of successful extensions: 166 Number of sequences better than 100.0: 54 Number of HSP's better than 100.0 without gapping: 45 Number of HSP's successfully gapped in prelim test: 9 Number of HSP's that attempted gapping in prelim test: 42 Number of HSP's gapped (non-prelim): 63 length of query: 642 length of database: 206,069 effective HSP length: 77 effective length of query: 565 effective length of database: 166,491 effective search space: 94067415 effective search space used: 94067415 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 40 (20.0 bits)