BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|Aclame:protein:vir:5662|NCBI_annot:terminase DNA packaging enzyme large subunit|genbank:acc:NP_899601;genbank:gi:34419588;genbank:GeneID:25460 11 (600 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: ter... 1258 0.0 gi|20804|lcl|protein:vir:106290 Length: 633 # NCBI annotation: g... 672 0.0 gi|27701|lcl|protein:vir:6893 Length: 611 # NCBI annotation: gp1... 642 0.0 gi|24596|lcl|protein:vir:98262 Length: 609 # NCBI annotation: gp... 642 0.0 gi|27407|lcl|protein:vir:7202 Length: 610 # NCBI annotation: gp1... 629 0.0 gi|22056|lcl|protein:vir:103455 Length: 610 # NCBI annotation: t... 627 0.0 gi|28182|lcl|protein:vir:108053 Length: 611 # NCBI annotation: g... 618 e-179 gi|22942|lcl|protein:vir:101803 Length: 613 # NCBI annotation: g... 617 e-178 gi|23191|lcl|protein:vir:101186 Length: 613 # NCBI annotation: t... 616 e-178 gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: ter... 610 e-176 gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp... 608 e-176 gi|21077|lcl|protein:vir:100538 Length: 612 # NCBI annotation: g... 598 e-173 gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: t... 375 e-106 gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T... 360 e-101 gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: g... 357 e-100 gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: g... 348 8e-98 gi|16897|lcl|protein:vir:5867 Length: 508 # NCBI annotation: sim... 149 1e-37 gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: t... 47 8e-07 gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: ph... 42 1e-05 gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: te... 42 2e-05 gi|1931|lcl|protein:vir:99847 Length: 486 # NCBI annotation: lar... 40 9e-05 gi|17621|lcl|protein:vir:816 Length: 568 # NCBI annotation: hypo... 38 3e-04 gi|25680|lcl|protein:vir:10116 Length: 568 # NCBI annotation: hy... 38 3e-04 gi|51|lcl|protein:vir:3295 Length: 568 # NCBI annotation: putati... 38 3e-04 gi|26240|lcl|protein:vir:2763 Length: 568 # NCBI annotation: hyp... 38 3e-04 gi|25849|lcl|protein:vir:9949 Length: 568 # NCBI annotation: hyp... 38 3e-04 gi|1476|lcl|protein:vir:104436 Length: 568 # NCBI annotation: pu... 38 3e-04 gi|4674|lcl|protein:vir:105593 Length: 543 # NCBI annotation: te... 37 5e-04 gi|1431|lcl|protein:vir:93626 Length: 530 # NCBI annotation: Bce... 37 9e-04 gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: te... 35 0.002 gi|26461|lcl|protein:vir:573 Length: 589 # NCBI annotation: unkn... 32 0.017 gi|14676|lcl|protein:vir:2499 Length: 474 # NCBI annotation: put... 30 0.056 gi|19975|lcl|protein:vir:108214 Length: 544 # NCBI annotation: g... 30 0.092 gi|7504|lcl|protein:vir:99915 Length: 478 # NCBI annotation: gp2... 28 0.27 gi|27798|lcl|protein:vir:8408 Length: 214 # NCBI annotation: gp3... 27 0.51 gi|11523|lcl|protein:vir:78781 Length: 604 # NCBI annotation: pu... 27 0.69 gi|10294|lcl|protein:vir:105656 Length: 405 # NCBI annotation: p... 26 1.0 gi|8927|lcl|protein:vir:97005 Length: 632 # NCBI annotation: 40 ... 26 1.3 gi|10061|lcl|protein:vir:99071 Length: 537 # NCBI annotation: gp... 26 1.4 gi|9070|lcl|protein:vir:102238 Length: 588 # NCBI annotation: gp... 26 1.6 gi|8220|lcl|protein:vir:101493 Length: 588 # NCBI annotation: gp... 26 1.7 gi|6719|lcl|protein:vir:99411 Length: 447 # NCBI annotation: hyp... 25 2.2 gi|16385|lcl|protein:vir:3744 Length: 607 # NCBI annotation: orf... 25 2.5 gi|25398|lcl|protein:vir:8929 Length: 717 # NCBI annotation: ORF... 25 3.5 gi|1165|lcl|protein:vir:102941 Length: 421 # NCBI annotation: te... 24 4.4 gi|27125|lcl|protein:vir:6595 Length: 164 # NCBI annotation: tai... 23 6.8 gi|25350|lcl|protein:vir:80985 Length: 164 # NCBI annotation: gp... 23 6.8 gi|18781|lcl|protein:vir:7429 Length: 581 # NCBI annotation: gp6... 23 6.8 gi|10332|lcl|protein:vir:97407 Length: 514 # NCBI annotation: te... 23 8.3 >gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899601;genbank:gi:34419588;genbank:GeneID :2546011 Length = 600 Score = 1258 bits (3255), Expect = 0.0, Method: Compositional matrix adjust. Identities = 600/600 (100%), Positives = 600/600 (100%) Query: 1 MSSQKKKFKKKTINGIKYVQSNEDMQWYPRYLDDWKVLNRMDKAHKLRIQSTDPSDFKTY 60 MSSQKKKFKKKTINGIKYVQSNEDMQWYPRYLDDWKVLNRMDKAHKLRIQSTDPSDFKTY Sbjct: 1 MSSQKKKFKKKTINGIKYVQSNEDMQWYPRYLDDWKVLNRMDKAHKLRIQSTDPSDFKTY 60 Query: 61 KDKGNRRSRYMNIPNLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGN 120 KDKGNRRSRYMNIPNLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGN Sbjct: 61 KDKGNRRSRYMNIPNLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGN 120 Query: 121 IKMVPRPYQKEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGS 180 IKMVPRPYQKEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGS Sbjct: 121 IKMVPRPYQKEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGS 180 Query: 181 MSMEVLERVKNVIENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYV 240 MSMEVLERVKNVIENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYV Sbjct: 181 MSMEVLERVKNVIENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYV 240 Query: 241 DECAFVPGFDDFWKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTT 300 DECAFVPGFDDFWKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTT Sbjct: 241 DECAFVPGFDDFWKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTT 300 Query: 301 WRAVQNRLYKDGEFDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGI 360 WRAVQNRLYKDGEFDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGI Sbjct: 301 WRAVQNRLYKDGEFDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGI 360 Query: 361 DVVKDSDGWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTS 420 DVVKDSDGWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTS Sbjct: 361 DVVKDSDGWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTS 420 Query: 421 HLLLPAIIMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEERASGGRRGLGLK 480 HLLLPAIIMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEERASGGRRGLGLK Sbjct: 421 HLLLPAIIMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEERASGGRRGLGLK 480 Query: 481 PNKKTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMSLTLL 540 PNKKTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMSLTLL Sbjct: 481 PNKKTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMSLTLL 540 Query: 541 AYLSTQDRFSDFVEKEYNVSYDIFKQEVHDMMDDDVPFLMIADGIENYGTDFSTNSFGMF 600 AYLSTQDRFSDFVEKEYNVSYDIFKQEVHDMMDDDVPFLMIADGIENYGTDFSTNSFGMF Sbjct: 541 AYLSTQDRFSDFVEKEYNVSYDIFKQEVHDMMDDDVPFLMIADGIENYGTDFSTNSFGMF 600 >gi|20804|lcl|protein:vir:106290 Length: 633 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944105;genbank:gi:38640149;genbank:GeneID :2658038 Length = 633 Score = 672 bits (1733), Expect = 0.0, Method: Compositional matrix adjust. Identities = 326/588 (55%), Positives = 424/588 (72%), Gaps = 18/588 (3%) Query: 9 KKKTINGIK---YVQSNEDMQWYPRYLDDWKVLNRMDKAHKLRIQSTDPSDFKTYKDKGN 65 ++K I +K Y +S D +WYP D + L R+ K+ +Q DPSDFK++KD+ N Sbjct: 50 REKVIRKLKSCVYYKSQHDGRWYPETYDIYSELKRV---QKMNLQGKDPSDFKSFKDRFN 106 Query: 66 RRSRYMNIPNLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVP 125 +R+RY+ +PNL+RAN P ++ + E E+++CRDDIVYFAE YCSI+HID G IK+ Sbjct: 107 KRTRYLGLPNLKRANVPTKWTREMVE---EWKRCRDDIVYFAETYCSIIHIDWGVIKVQL 163 Query: 126 RPYQKEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEV 185 R YQK+ML + R S+ LPRQLGKTT IFL H++VFNE K G+LAHKG MS EV Sbjct: 164 RDYQKDMLRIMASERMSMHNLPRQLGKTTATAIFLTHFVVFNEAKAVGVLAHKGDMSKEV 223 Query: 186 LERVKNVIENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAF 245 LER K IE LPDFLQPGI EWNKGNI +NGC +GAYAS DAVRG SF++IYVDECAF Sbjct: 224 LERTKQSIELLPDFLQPGIVEWNKGNIELENGCSIGAYASSPDAVRGNSFALIYVDECAF 283 Query: 246 VPGFDDFWKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRAVQ 305 + GF+D WKA PVISSG +S+++LTSTPNG+NH++D+W +++ F+PYTTTW V+ Sbjct: 284 IEGFEDTWKAILPVISSGRQSRIILTSTPNGINHWYDLWEVSLKSDKGFKPYTTTWITVK 343 Query: 306 NRLYKDGE--FDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVV 363 RLY DG +DDG + + I ++S EAF QEHLC F+GT+GTLINGFKLSKM +V+ Sbjct: 344 ERLY-DGSDAYDDGFEWASKQINSSSVEAFQQEHLCRFMGTSGTLINGFKLSKMTWKEVI 402 Query: 364 KDSDGWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSHLL 423 D D + +KP EG+KYI TVD +EGRGQDY + +IDVTSYP+ QVAV+H NK S LL Sbjct: 403 AD-DNFYQIEKPVEGNKYIATVDPAEGRGQDYSTIQIIDVTSYPYRQVAVYHSNKISPLL 461 Query: 424 LPAIIMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEERASGGRRGLGLKPNK 483 LP++IM+ A YN A+VY E+ S G +V LF DLEYENVI++ + LG+K K Sbjct: 462 LPSVIMRYAMEYNNAWVYIELNSIGNMVAKSLFIDLEYENVIVD-----SSKDLGMKQTK 516 Query: 484 KTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMSLTLLAYL 543 TKA+GCSTLKDLIEKD+L ++H T++EF TFVEKG SW A++GFHDDLVMSL + AYL Sbjct: 517 VTKAVGCSTLKDLIEKDKLIVSHKGTIQEFRTFVEKGVSWAAQDGFHDDLVMSLCIFAYL 576 Query: 544 STQDRFSDFVEKEYNVSYDIFKQEVHDMMDDDVPFLMIADGIENYGTD 591 +TQ+RF DF++ N+ D+F+ E+ +M++D +I DGI Y D Sbjct: 577 TTQERFGDFIDATRNIGADVFQSEMEEMLEDFCVGAIIDDGINTYEVD 624 >gi|27701|lcl|protein:vir:6893 Length: 611 # NCBI annotation: gp17 terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861869;genbank:gi:32453660;genbank:GeneID :1494295 Length = 611 Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust. Identities = 312/575 (54%), Positives = 409/575 (71%), Gaps = 13/575 (2%) Query: 9 KKKTINGIKYVQSNEDMQWYPRYLDDWKVLNRMDKAHKLRIQSTDPSDFKTYKDKGNRRS 68 ++K +GI +++S D +WYP D+ R++K K+ S P F+TYKDK N+R+ Sbjct: 29 ERKEEDGIHWIKSQWDGKWYPEKFSDYL---RINKIVKIPNNSDKPELFQTYKDKNNKRT 85 Query: 69 RYMNIPNLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVPRPY 128 RYM +PNL+RAN ++ E+ AE++KCRDDIVYFAE YC+I HID G IK+ R Y Sbjct: 86 RYMGLPNLKRANIKTQW---TYEMVAEWKKCRDDIVYFAETYCAITHIDYGTIKVQLRDY 142 Query: 129 QKEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLER 188 Q++ML++ R ++ L RQLGKTT++ IFLAH++ FN+DK GILAHKGSMS EVL+R Sbjct: 143 QRDMLKIMSSKRMTVCNLSRQLGKTTVVAIFLAHFVCFNKDKAVGILAHKGSMSAEVLDR 202 Query: 189 VKNVIENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPG 248 K IE LPDFLQPGI EWNKG+I DNG +GAYAS DAVRG SF+MIY+DECAF+P Sbjct: 203 TKQAIELLPDFLQPGIVEWNKGSIQLDNGSSIGAYASSPDAVRGNSFAMIYIDECAFIPN 262 Query: 249 FDDFWKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRAVQNRL 308 F D W A PVISSG SK+++T+TPNGLNH++D+W AAV+G S FEPYT W +V+ RL Sbjct: 263 FIDSWLAIQPVISSGRRSKIIITTTPNGLNHFYDIWTAAVEGKSGFEPYTAIWNSVKERL 322 Query: 309 YKDGE-FDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVVKDSD 367 Y D + FDDG + ++TI +S F QEH F GT+GTLI+G KL+ + I+V DS Sbjct: 323 YNDEDIFDDGWQWSKQTISASSLTQFRQEHTAAFEGTSGTLISGMKLAILDYIEVTPDSH 382 Query: 368 GWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAI 427 G+ +KKPEEGHKYI T+D SEGRGQDYHA+H+IDVT+ +EQV V H N SHL+LP I Sbjct: 383 GFHQFKKPEEGHKYIATLDCSEGRGQDYHAMHIIDVTTDKWEQVGVLHSNTISHLILPDI 442 Query: 428 IMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEERASGGRRGLGLKPNKKTKA 487 + K YNE +Y E+ STG V L+ DLEYENVI + LG+K +++TK Sbjct: 443 VFKYLMEYNECPIYIELNSTGVSVAKSLYMDLEYENVICD-----SMNDLGMKQSRRTKP 497 Query: 488 IGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMSLTLLAYLSTQD 547 +GCSTLKDLIEKD+LKINH T++EF TF EKG SW AEEG+HDDLVM L + +LSTQ Sbjct: 498 VGCSTLKDLIEKDKLKINHRATIQEFRTFSEKGVSWAAEEGYHDDLVMGLVIFGWLSTQQ 557 Query: 548 RFSDFVEK-EYNVSYDIFKQEVHDMMDDDVPFLMI 581 +F+D+ +K + ++ ++F +E+ DM DD P + + Sbjct: 558 KFADYADKDDMRLASEVFSRELQDMNDDYAPVIFV 592 >gi|24596|lcl|protein:vir:98262 Length: 609 # NCBI annotation: gp17 terminase subunit, nuclease and ATPase # Family: family:all:147 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239195;genbank:gi:66391670;genbank:GeneID :3416364 Length = 609 Score = 642 bits (1657), Expect = 0.0, Method: Compositional matrix adjust. Identities = 312/577 (54%), Positives = 412/577 (71%), Gaps = 15/577 (2%) Query: 15 GIKYVQSNEDMQWYPRYLDDWKVLNRMDKAHKLRIQSTDPSDFKTYKDKGNRRSRYMNIP 74 G+++VQS D +WYP D+ +N + HK+ +Q +P++F TYK+K N++SRY P Sbjct: 33 GMEWVQSEHDKKWYPYTFSDYLKINGI---HKVELQGKNPAEFATYKNKSNKKSRYNGNP 89 Query: 75 NLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVPRPYQKEMLE 134 NL+RA ++ E+ E+ KCRDDIVYFAE YC+I HID G IK+ R YQKEML Sbjct: 90 NLKRAYVQTKW---TKEMLMEWVKCRDDIVYFAETYCAITHIDYGTIKVQLRDYQKEMLI 146 Query: 135 VADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLERVKNVIE 194 ++R L RQLGKTT++ IFLAH++ FNEDK G+LAHK SMS EVL+R K IE Sbjct: 147 EMHKNRMVTCNLSRQLGKTTVVAIFLAHFVCFNEDKYVGVLAHKASMSAEVLDRTKQAIE 206 Query: 195 NLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPGFDDFWK 254 LPDFLQPGI EWNKG+I DN CK+GA+AS DAVRG SF+MIY+DECAF+P F D W Sbjct: 207 LLPDFLQPGIVEWNKGSIELDNKCKIGAFASSPDAVRGNSFAMIYIDECAFIPNFTDAWL 266 Query: 255 ATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRAVQNRLYKDGE- 313 A PVISSG +SK+++T+TPNGLNH++D+WNAAV+G S F PYT W +V+ RLY DG+ Sbjct: 267 AIQPVISSGRKSKILITTTPNGLNHFYDIWNAAVEGKSGFVPYTAIWTSVKERLYTDGDN 326 Query: 314 --FDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVVKDSDGWCV 371 FDDG ++ + I +S+EAF QEH F+GT GTLI+G+KLSKM ID+ + + Sbjct: 327 GVFDDGYSWSAKMIAGSSKEAFLQEHCAEFMGTNGTLISGWKLSKMSWIDIDETETNFYQ 386 Query: 372 YKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAIIMKQ 431 YKKPEEGHKY+ +D +EGRGQDYHA+H+ID+T+ PFEQVAV+H N+TSHL+LP I+++ Sbjct: 387 YKKPEEGHKYVAVLDPAEGRGQDYHAMHIIDITTLPFEQVAVYHSNRTSHLILPDILLRY 446 Query: 432 AYRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEERASGGRRGLGLKPNKKTKAIGCS 491 YNEA++Y E+ STG V LF +LEYENVI + LG+K K++KAIGCS Sbjct: 447 LTMYNEAWIYIELNSTGHSVAKSLFSELEYENVICD-----SYNDLGMKQTKRSKAIGCS 501 Query: 492 TLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMSLTLLAYLSTQDRFSD 551 TLKDLIEKD+L IN+ T+ EF TF EKG SW AEEGFHDDLVMSL +L+TQ +F++ Sbjct: 502 TLKDLIEKDKLIINNKKTILEFRTFSEKGVSWAAEEGFHDDLVMSLACFGWLTTQLKFAE 561 Query: 552 FVEK-EYNVSYDIFKQEVHDMMDDDVPFLMIADGIEN 587 F EK + ++ ++F +E + +D + +++ G E Sbjct: 562 FCEKDDLRLANEVFAREREQLYEDALCPVIVTSGDET 598 >gi|27407|lcl|protein:vir:7202 Length: 610 # NCBI annotation: gp17 terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049776;genbank:gi:9632591;genbank:GeneID: 1258647 Length = 610 Score = 629 bits (1622), Expect = 0.0, Method: Compositional matrix adjust. Identities = 311/575 (54%), Positives = 400/575 (69%), Gaps = 13/575 (2%) Query: 9 KKKTINGIKYVQSNEDMQWYPRYLDDWKVLNRMDKAHKLRIQSTDPSDFKTYKDKGNRRS 68 ++K +GI +++S D +WYP D+ R+ K K+ S P F+TYKDK N+RS Sbjct: 29 ERKDEDGIHWIKSQWDGKWYPEKFSDYL---RLHKIVKIPNNSDKPELFQTYKDKNNKRS 85 Query: 69 RYMNIPNLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVPRPY 128 RYM +PNL+RAN ++ + E E++KCRDDIVYFAE YC+I HID G IK+ R Y Sbjct: 86 RYMGLPNLKRANIKTQWTREMVE---EWKKCRDDIVYFAETYCAITHIDYGVIKVQLRDY 142 Query: 129 QKEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLER 188 Q++ML++ R ++ L RQLGKTT++ IFLAH++ FN+DK GILAHKGSMS EVL+R Sbjct: 143 QRDMLKIMSSKRMTVCNLSRQLGKTTVVAIFLAHFVCFNKDKAVGILAHKGSMSAEVLDR 202 Query: 189 VKNVIENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPG 248 K IE LPDFLQPGI EWNKG+I DNG +GAYAS DAVRG SF+MIY+DECAF+P Sbjct: 203 TKQAIELLPDFLQPGIVEWNKGSIELDNGSSIGAYASSPDAVRGNSFAMIYIDECAFIPN 262 Query: 249 FDDFWKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRAVQNRL 308 F D W A PVISSG SK+++T+TPNGLNH++D+W AAV+G S FEPYT W +V+ RL Sbjct: 263 FHDSWLAIQPVISSGRRSKIIITTTPNGLNHFYDIWTAAVEGKSGFEPYTAIWNSVKERL 322 Query: 309 YKDGE-FDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVVKDSD 367 Y D + FDDG + +TI +S F QEH F GT+GTLI+G KL+ M I+V D Sbjct: 323 YNDEDIFDDGWQWSIQTINGSSLAQFRQEHTAAFEGTSGTLISGMKLAVMDFIEVTPDDH 382 Query: 368 GWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAI 427 G+ +KKPE KYI T+D SEGRGQDYHALH+IDVT +EQV V H N SHL+LP I Sbjct: 383 GFHQFKKPEPDRKYIATLDCSEGRGQDYHALHIIDVTDDVWEQVGVLHSNTISHLILPDI 442 Query: 428 IMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEERASGGRRGLGLKPNKKTKA 487 +M+ YNE VY E+ STG V L+ DLEYE VI + LG+K K+TKA Sbjct: 443 VMRYLVEYNECPVYIELNSTGVSVAKSLYMDLEYEGVICDSYTD-----LGMKQTKRTKA 497 Query: 488 IGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMSLTLLAYLSTQD 547 +GCSTLKDLIEKD+L I+H T++EF TF EKG SW AEEG+HDDLVMSL + +LSTQ Sbjct: 498 VGCSTLKDLIEKDKLIIHHRATIQEFRTFSEKGVSWAAEEGYHDDLVMSLVIFGWLSTQS 557 Query: 548 RFSDFVEK-EYNVSYDIFKQEVHDMMDDDVPFLMI 581 +F D+ +K + ++ ++F +E+ DM DD P + + Sbjct: 558 KFIDYADKDDMRLASEVFSKELQDMSDDYAPVIFV 592 >gi|22056|lcl|protein:vir:103455 Length: 610 # NCBI annotation: terminase subunit nuclease and ATPase # Family: family:all:147 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803107;genbank:gi:116326387;genbank:GeneI D:4405484 Length = 610 Score = 627 bits (1618), Expect = 0.0, Method: Compositional matrix adjust. Identities = 310/575 (53%), Positives = 400/575 (69%), Gaps = 13/575 (2%) Query: 9 KKKTINGIKYVQSNEDMQWYPRYLDDWKVLNRMDKAHKLRIQSTDPSDFKTYKDKGNRRS 68 ++K +GI +++S D +WYP D+ R+ K K+ S P F+TYKDK N+RS Sbjct: 29 ERKDEDGIHWIKSQWDGKWYPEKFSDYL---RLHKIVKIPNNSDKPELFQTYKDKNNKRS 85 Query: 69 RYMNIPNLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVPRPY 128 RYM +PNL+RAN ++ + E E++KCRDDIVYFAE YC+I HID G IK+ R Y Sbjct: 86 RYMGLPNLKRANIKTQWTREMVE---EWKKCRDDIVYFAETYCAITHIDYGVIKVQLRDY 142 Query: 129 QKEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLER 188 Q++ML++ R ++ L RQLGKTT++ IFLAH++ FN+DK GILAHKGSMS EVL+R Sbjct: 143 QRDMLKIMSSKRMTVCNLSRQLGKTTVVAIFLAHFVCFNKDKAVGILAHKGSMSAEVLDR 202 Query: 189 VKNVIENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPG 248 K IE LPDFLQPGI EWNKG+I DNG +GAYAS DAVRG SF+MIY+DECAF+P Sbjct: 203 TKQAIELLPDFLQPGIVEWNKGSIELDNGSSIGAYASSPDAVRGNSFAMIYIDECAFIPN 262 Query: 249 FDDFWKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRAVQNRL 308 F D W A PVISSG SK+++T+TPNGLNH++D+W AAV+G S FEPYT W +V+ RL Sbjct: 263 FHDSWLAIQPVISSGRRSKIIITTTPNGLNHFYDIWTAAVEGKSGFEPYTAIWNSVKERL 322 Query: 309 YKDGE-FDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVVKDSD 367 Y D + FDDG + +TI ++ F QEH F GT+GTLI+G KL+ M I+V D Sbjct: 323 YNDEDIFDDGWQWSIQTINGSTLAQFRQEHTAAFEGTSGTLISGMKLAIMDFIEVTPDDH 382 Query: 368 GWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAI 427 G+ +KKPE KYI T+D SEGRGQDYHALH+IDVT +EQV V H N SHL+LP I Sbjct: 383 GFHRFKKPEPDRKYIATLDCSEGRGQDYHALHIIDVTDDVWEQVGVLHSNTISHLILPDI 442 Query: 428 IMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEERASGGRRGLGLKPNKKTKA 487 +M+ YNE VY E+ STG V L+ DLEYE VI + LG+K K+TKA Sbjct: 443 VMRYLVEYNECPVYIELNSTGVSVAKSLYMDLEYEGVICDSYTD-----LGMKQTKRTKA 497 Query: 488 IGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMSLTLLAYLSTQD 547 +GCSTLKDLIEKD+L I+H T++EF TF EKG SW AEEG+HDDLVMSL + +LSTQ Sbjct: 498 VGCSTLKDLIEKDKLIIHHRATIQEFRTFSEKGVSWAAEEGYHDDLVMSLVIFGWLSTQS 557 Query: 548 RFSDFVEK-EYNVSYDIFKQEVHDMMDDDVPFLMI 581 +F D+ +K + ++ ++F +E+ DM DD P + + Sbjct: 558 KFIDYADKDDMRLASEVFSKELQDMSDDYAPVIFV 592 >gi|28182|lcl|protein:vir:108053 Length: 611 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595293;genbank:gi:161622599;genbank:Ge neID:5783772 Length = 611 Score = 618 bits (1593), Expect = e-179, Method: Compositional matrix adjust. Identities = 301/583 (51%), Positives = 409/583 (70%), Gaps = 13/583 (2%) Query: 8 FKKKTINGIKYVQSNEDMQWYPRYLDDWKVLNRMDKAHKLRIQSTDPSDFKTYKDKGNRR 67 ++KT GI +++S D +WYP D+ R+ K K+ P +F+T+KDK N+R Sbjct: 28 LERKTEEGINWIKSQWDDKWYPEKFSDYL---RIHKIVKIPNNGDRPDEFQTFKDKMNKR 84 Query: 68 SRYMNIPNLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVPRP 127 +RYM +PNL+RAN ++ E+ +E++KCRDDIVYFAE YC+I HID G IK+ R Sbjct: 85 TRYMGLPNLKRANIKTQW---TREMVSEWKKCRDDIVYFAETYCAITHIDYGTIKVQLRD 141 Query: 128 YQKEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLE 187 YQ++ML++ ++R + L RQLGKTT++ IFLAH++ FN+DK GILAHKGSMS EVL+ Sbjct: 142 YQRDMLKIMSKNRMTTCNLSRQLGKTTVVAIFLAHFVCFNKDKAVGILAHKGSMSAEVLD 201 Query: 188 RVKNVIENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVP 247 R K IE LPDFLQPGI EWNKG+I DNG +GAYAS DAVRG SF+MIY+DECAF+P Sbjct: 202 RTKQAIELLPDFLQPGIVEWNKGSIELDNGSSIGAYASSPDAVRGNSFAMIYIDECAFIP 261 Query: 248 GFDDFWKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRAVQNR 307 F D W A PVISSG SK+++T+TPNGLNH++D+W AAV+G S F PYT W +V+ R Sbjct: 262 NFLDSWLAIQPVISSGRRSKIIITTTPNGLNHFYDIWTAAVEGKSGFAPYTAIWNSVKER 321 Query: 308 LYKDGE-FDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVVKDS 366 LY D + FDDG + +TI +S F QEH F GT+GTLI+G KL+ M +V+ ++ Sbjct: 322 LYNDADIFDDGWEWSSQTISASSLAQFRQEHCAEFQGTSGTLISGMKLAIMDWKEVIPEN 381 Query: 367 DGWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPA 426 + + +P+ HKYI ++D SEGRGQDYHALH+IDVT+ +EQVAV H N+ SH++LP Sbjct: 382 GYFYRFHEPDPTHKYIASLDCSEGRGQDYHALHIIDVTTDEWEQVAVLHSNEISHMILPD 441 Query: 427 IIMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEERASGGRRGLGLKPNKKTK 486 I+ K YNEA VY E+ STG V L+ DLEYENVI + + LG+K ++TK Sbjct: 442 IVYKYLMEYNEAPVYIELNSTGVSVAKSLYMDLEYENVICD-----SMQDLGMKQTRRTK 496 Query: 487 AIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMSLTLLAYLSTQ 546 +GCSTLKDLIEKD+LK+NH T+ EF TF + SW AE+GFHDDLVMSL + A+L+TQ Sbjct: 497 PVGCSTLKDLIEKDKLKLNHKQTIMEFRTFSQNKLSWAAEDGFHDDLVMSLVIFAWLTTQ 556 Query: 547 DRFSDFVEK-EYNVSYDIFKQEVHDMMDDDVPFLMIADGIENY 588 +F+DF+++ E ++ ++F +E+ DM ++ P + + G +Y Sbjct: 557 QKFADFIDRDEMRLASEVFSRELEDMNEEYNPVVFVDAGDNSY 599 >gi|22942|lcl|protein:vir:101803 Length: 613 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238880;genbank:gi:66391955;genbank:GeneID :3416630 Length = 613 Score = 617 bits (1591), Expect = e-178, Method: Compositional matrix adjust. Identities = 300/597 (50%), Positives = 417/597 (69%), Gaps = 28/597 (4%) Query: 17 KYVQSNEDMQWYPRYLDDWKVLNRMDKAHKLRIQSTDPSDFKTYKDKGNRRSRYMNIPNL 76 ++V SN+D +WYP D + ++ +++IQS DPS F+T+KDK N+R+RY+ +PNL Sbjct: 28 EWVLSNQDDKWYPSTFDRYM---KLQGVKRVKIQSDDPSMFRTFKDKTNKRTRYLGLPNL 84 Query: 77 RRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVPRPYQKEMLEVA 136 +RAN I++ E+ AE ++C++DIVYFAENYC I HID G I++ R YQK+ML + Sbjct: 85 KRANIKIKW---TKEMLAERKRCKEDIVYFAENYCCIEHIDYGIIRVQLRDYQKDMLRIM 141 Query: 137 DRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLERVKNVIENL 196 +R L RQLGKTT++ IFLAH++ FN K GILAHK SMS EVL R K +E L Sbjct: 142 AGNRLMAANLSRQLGKTTVVAIFLAHFVCFNSAKNVGILAHKASMSAEVLHRTKQALELL 201 Query: 197 PDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPGFDDFWKAT 256 PDFLQPGI EWNKG+IT NGC +GA++S DAVRG SF++IYVDE AF+P F D W A Sbjct: 202 PDFLQPGIVEWNKGSITLGNGCAIGAFSSSPDAVRGNSFALIYVDEVAFIPNFTDAWMAI 261 Query: 257 FPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGIST-------FEPYTTTWRAVQNRLY 309 PVISSG SK+++T+TPNGLNH++D+W AA+ S+ F PYT TW +V+ RLY Sbjct: 262 QPVISSGRRSKILMTTTPNGLNHWYDIWTAAITPNSSGDGSKSGFVPYTATWSSVKERLY 321 Query: 310 KDGE--------FDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGID 361 DG+ FDDG ++ +TI ++ +AF QEH F GT+GTLING KLSK+ ID Sbjct: 322 SDGKELSGSDSYFDDGYSWSSKTIAGSALDAFQQEHNTAFQGTSGTLINGTKLSKLNWID 381 Query: 362 VVKDSDGWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSH 421 + D + ++++P+EG KYI T+D++EGRGQDYHA+H+ D+T +P++QVAV+H N TSH Sbjct: 382 -IPPQDNFTMFEEPKEGRKYIATLDSAEGRGQDYHAMHIFDITEFPYKQVAVYHSNTTSH 440 Query: 422 LLLPAIIMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEERASGGRRGLGLKP 481 L+LP +++K Y + Y+Y E+ STG + L+ +L+YENVI + + LGLK Sbjct: 441 LILPDVLLKYLNMYFQPYIYIELNSTGVSIAKSLYSELDYENVICD-----SYQDLGLKQ 495 Query: 482 NKKTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMSLTLLA 541 K++KAIGCSTLKDLIEKD+L +NH ++ E TF EKG SW AEEGFHDDLVMSL + A Sbjct: 496 TKRSKAIGCSTLKDLIEKDKLILNHKKSIMELRTFSEKGVSWAAEEGFHDDLVMSLVIFA 555 Query: 542 YLSTQDRFSDFVEK-EYNVSYDIFKQEVHDMMDDDVPFLMIADGIENYGTDFSTNSF 597 +L+TQ+RFSDF E + ++ ++F++E+ ++ DD +P +++ DG + + SF Sbjct: 556 WLTTQERFSDFTENDDMRLANEVFRKEMEELNDDYMPMVIVDDGEDTFEVTHKGMSF 612 >gi|23191|lcl|protein:vir:101186 Length: 613 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932508;genbank:gi:37651634;genbank:GeneID :2610679 Length = 613 Score = 616 bits (1589), Expect = e-178, Method: Compositional matrix adjust. Identities = 300/597 (50%), Positives = 416/597 (69%), Gaps = 28/597 (4%) Query: 17 KYVQSNEDMQWYPRYLDDWKVLNRMDKAHKLRIQSTDPSDFKTYKDKGNRRSRYMNIPNL 76 ++V SN D +WYP D + ++ +++IQS DPS F+T+KDK N+R+RY+ +PNL Sbjct: 28 EWVLSNHDDKWYPSTFDRYM---KLQGVKRVKIQSDDPSMFRTFKDKTNKRTRYLGLPNL 84 Query: 77 RRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVPRPYQKEMLEVA 136 +RAN I++ E+ AE ++C++DIVYFAENYC I HID G I++ R YQK+ML + Sbjct: 85 KRANIKIKW---TKEMLAERKRCKEDIVYFAENYCCIEHIDYGIIRVQLRDYQKDMLRIM 141 Query: 137 DRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLERVKNVIENL 196 +R L RQLGKTT++ IFLAH++ FN K GILAHK SMS EVL R K +E L Sbjct: 142 AGNRLMAANLSRQLGKTTVVAIFLAHFVCFNSAKNVGILAHKASMSAEVLHRTKQALELL 201 Query: 197 PDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPGFDDFWKAT 256 PDFLQPGI EWNKG+IT NGC +GA++S DAVRG SF++IYVDE AF+P F D W A Sbjct: 202 PDFLQPGIVEWNKGSITLGNGCAIGAFSSSPDAVRGNSFALIYVDEVAFIPNFTDAWMAI 261 Query: 257 FPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGIST-------FEPYTTTWRAVQNRLY 309 PVISSG SK+++T+TPNGLNH++D+W AA+ S+ F PYT TW +V+ RLY Sbjct: 262 QPVISSGRRSKILMTTTPNGLNHWYDIWTAAITPNSSGDGSKSGFVPYTATWSSVKERLY 321 Query: 310 KDGE--------FDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGID 361 DG+ FDDG ++ +TI ++ +AF QEH F GT+GTLING KLSK+ ID Sbjct: 322 SDGKELSGSDSYFDDGYSWSSKTIAGSALDAFQQEHNTAFQGTSGTLINGTKLSKLNWID 381 Query: 362 VVKDSDGWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSH 421 + D + ++++P+EG KYI T+D++EGRGQDYHA+H+ D+T +P++QVAV+H N TSH Sbjct: 382 -IPPQDNFTMFEEPKEGRKYIATLDSAEGRGQDYHAMHIFDITEFPYKQVAVYHSNTTSH 440 Query: 422 LLLPAIIMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEERASGGRRGLGLKP 481 L+LP +++K Y + Y+Y E+ STG + L+ +L+YENVI + + LGLK Sbjct: 441 LILPDVLLKYLNMYFQPYIYIELNSTGVSIAKSLYSELDYENVICD-----SYQDLGLKQ 495 Query: 482 NKKTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMSLTLLA 541 K++KAIGCSTLKDLIEKD+L +NH ++ E TF EKG SW AEEGFHDDLVMSL + A Sbjct: 496 TKRSKAIGCSTLKDLIEKDKLILNHKKSIMELRTFSEKGVSWAAEEGFHDDLVMSLVIFA 555 Query: 542 YLSTQDRFSDFVEK-EYNVSYDIFKQEVHDMMDDDVPFLMIADGIENYGTDFSTNSF 597 +L+TQ+RFSDF E + ++ ++F++E+ ++ DD +P +++ DG + + SF Sbjct: 556 WLTTQERFSDFTENDDMRLANEVFRKEMEELNDDYMPMVIVDDGEDTFEVTHKGMSF 612 >gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891724;genbank:gi:33620526;genbank:GeneID :1725332 Length = 607 Score = 610 bits (1573), Expect = e-176, Method: Compositional matrix adjust. Identities = 299/579 (51%), Positives = 406/579 (70%), Gaps = 16/579 (2%) Query: 14 NGIKYVQSNEDMQWYPRYLDDWKVLNRMDKAHKLRIQSTDPSDFKTYKDKGNRRSRYMNI 73 NGI+++ S D +WYP+ D+ LNR K+R+QSTDP+++K +KD N R+RYM + Sbjct: 32 NGIEWILSKHDDKWYPKKFSDYLKLNR---PQKIRMQSTDPTNYKFFKDSDNIRTRYMRL 88 Query: 74 PNLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVPRPYQKEML 133 NLRRAN ++ E+ AE+++CR DIVYFAE YC+I HID G IK+ R YQK+ML Sbjct: 89 KNLRRANIKTQYTP---EMIAEWKRCRKDIVYFAETYCAITHIDYGTIKVQLRDYQKDML 145 Query: 134 EVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLERVKNVI 193 ++ +R S L RQLGKTT + IFLAHY+ FN+DK GILAHKGSM++EVLER K I Sbjct: 146 KIMHENRMSAHKLSRQLGKTTAVAIFLAHYVCFNKDKAVGILAHKGSMAVEVLERTKQAI 205 Query: 194 ENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPGFDDFW 253 E LPDFLQPGI EWNK +I +NG +GAYAS DAVRG SFS IY+DECAF+ + D + Sbjct: 206 ELLPDFLQPGIVEWNKKSIVLENGSSIGAYASSPDAVRGNSFSFIYIDECAFIQNWTDCF 265 Query: 254 KATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRAVQNRLYKDGE 313 A PVISSG ESK+++T+TPNGLNH++D+W +A+ G S + PY W +V+ RLY + Sbjct: 266 LAIQPVISSGRESKMIMTTTPNGLNHFYDIWQSAIDGKSGYVPYEAVWHSVKERLYNKAD 325 Query: 314 -FDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVVKDSDGWCVY 372 FDDG + + I +S E F QEH F G++GTLI LS++ IDVV D +G+ + Sbjct: 326 IFDDGYEWSSQAIAGSSLEQFLQEHNAEFFGSSGTLIRATTLSRLSFIDVVND-NGFYQF 384 Query: 373 KKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAIIMKQA 432 +KP+EG KY+ T+D SEGRGQDYHAL +ID+T +P++QVAV+H N TSH +LP I+ K Sbjct: 385 EKPKEGRKYVATLDCSEGRGQDYHALQIIDITEFPYKQVAVYHSNTTSHFILPDIVFKYL 444 Query: 433 YRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEERASGGRRGLGLKPNKKTKAIGCST 492 YNE VY E+ STG + L DLEY+N+I + LG+K +K++KA+GCS Sbjct: 445 MMYNECPVYIELNSTGVSIAKSLAMDLEYDNIICDSFID-----LGMKQSKRSKAMGCSA 499 Query: 493 LKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMSLTLLAYLSTQDRFSDF 552 LKDLIEKD+L INH T++E TF EKG SW AEEGFHDDLVMSL + +L+TQ++F+++ Sbjct: 500 LKDLIEKDKLIINHKGTIQELRTFSEKGVSWAAEEGFHDDLVMSLVIFGWLTTQEKFAEY 559 Query: 553 VEK-EYNVSYDIFKQEVHDMMDDDVPFLMI--ADGIENY 588 K E ++ +IF++E+ ++ ++ P ++ A+GIE Y Sbjct: 560 AGKDEMRIASEIFRKELDELGEEYAPVVIYDGANGIEEY 598 >gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469498;genbank:gi:157311455;genbank:Ge neID:5602125 Length = 607 Score = 608 bits (1568), Expect = e-176, Method: Compositional matrix adjust. Identities = 298/579 (51%), Positives = 405/579 (69%), Gaps = 16/579 (2%) Query: 14 NGIKYVQSNEDMQWYPRYLDDWKVLNRMDKAHKLRIQSTDPSDFKTYKDKGNRRSRYMNI 73 NGI+++ S D +WYP+ D+ LNR K+R+QSTDP+++K +KD N R+RYM + Sbjct: 32 NGIEWILSKHDDKWYPKKFSDYLKLNR---PQKIRMQSTDPTNYKVFKDSDNIRTRYMRL 88 Query: 74 PNLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVPRPYQKEML 133 NLRRAN ++ E+ AE+++CR DIVYFAE YC+I HID G IK+ R YQK+ML Sbjct: 89 KNLRRANIKTQYTP---EMIAEWKRCRKDIVYFAETYCAITHIDYGTIKVQLRDYQKDML 145 Query: 134 EVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLERVKNVI 193 ++ +R S L RQLGKTT + IFLAHY+ FN+DK GILAHKGSM++EVLER K I Sbjct: 146 KIMHENRMSAHKLSRQLGKTTAVAIFLAHYVCFNKDKAVGILAHKGSMAVEVLERTKQAI 205 Query: 194 ENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPGFDDFW 253 E LPDFLQPGI EWNK +I +NG +GAYAS DAVRG SFS IY+DECAF+ + D + Sbjct: 206 ELLPDFLQPGIVEWNKKSIVLENGSSIGAYASSPDAVRGNSFSFIYIDECAFIQNWTDCF 265 Query: 254 KATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRAVQNRLYKDGE 313 A PVISSG ESK+++T+TPNGLNH++D+W +A+ G S + PY W +V+ RLY + Sbjct: 266 LAIQPVISSGRESKMIMTTTPNGLNHFYDIWQSAIDGKSGYVPYEAVWHSVKERLYNKAD 325 Query: 314 -FDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVVKDSDGWCVY 372 FDDG + + I +S E F QEH F G++GTLI LS++ IDVV D +G+ + Sbjct: 326 IFDDGYEWSSQAIAGSSLEQFLQEHNAEFFGSSGTLIRATTLSRLSFIDVVND-NGFYQF 384 Query: 373 KKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAIIMKQA 432 +KP+EG KY+ T+D SEGRGQDYHAL +ID+T +P++ VAV+H N TSH +LP I+ K Sbjct: 385 EKPKEGRKYVATLDCSEGRGQDYHALQIIDITEFPYKPVAVYHSNTTSHFILPDIVFKYL 444 Query: 433 YRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEERASGGRRGLGLKPNKKTKAIGCST 492 YNE VY E+ STG + L DLEY+N+I + LG+K +K++KA+GCS Sbjct: 445 MMYNECPVYIELNSTGVSIAKSLAMDLEYDNIICDSFID-----LGMKQSKRSKAMGCSA 499 Query: 493 LKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMSLTLLAYLSTQDRFSDF 552 LKDLIEKD+L INH T++E TF EKG SW AEEGFHDDLVMSL + +L+TQ++F+++ Sbjct: 500 LKDLIEKDKLIINHKGTIQELRTFSEKGVSWAAEEGFHDDLVMSLVIFGWLTTQEKFAEY 559 Query: 553 VEK-EYNVSYDIFKQEVHDMMDDDVPFLMI--ADGIENY 588 K E ++ +IF++E+ ++ ++ P ++ A+GIE Y Sbjct: 560 AGKDEMRIASEIFRKELDELGEEYAPVVIYDGANGIEEY 598 >gi|21077|lcl|protein:vir:100538 Length: 612 # NCBI annotation: gp17 terminase subunit # Family: family:all:147 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656379;genbank:gi:109290130;genbank:GeneI D:4156516 Length = 612 Score = 598 bits (1543), Expect = e-173, Method: Compositional matrix adjust. Identities = 297/597 (49%), Positives = 405/597 (67%), Gaps = 28/597 (4%) Query: 17 KYVQSNEDMQWYPRYLDDWKVLNRMDKAHKLRIQSTDPSDFKTYKDKGNRRSRYMNIPNL 76 +++ SN D +WYP D + + +++IQ+ DPS F+T+KDK N+RSRY +PNL Sbjct: 27 EWILSNHDDKWYPSTFDRYL---KSQGVKRVKIQADDPSMFRTFKDKTNKRSRYNGLPNL 83 Query: 77 RRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVPRPYQKEMLEVA 136 +RAN I++ E+ AE ++C++DIVYFAENYC I HID G I++ R YQK+ML + Sbjct: 84 KRANIKIKW---TKEMLAERKRCKEDIVYFAENYCCIEHIDYGIIRVQLRDYQKDMLRIM 140 Query: 137 DRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLERVKNVIENL 196 +R L RQLGKTT++ IFLAH++ FN K GILAHK SMS EVL R K +E L Sbjct: 141 AGNRLMAANLSRQLGKTTVVAIFLAHFVCFNSAKNVGILAHKASMSAEVLHRTKQALELL 200 Query: 197 PDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPGFDDFWKAT 256 PDFLQPGI EWNKG+IT NGC +GA++S DAVRG SF++IY+DE AF+P F+D W A Sbjct: 201 PDFLQPGIVEWNKGSITLGNGCAIGAFSSSPDAVRGNSFALIYIDEVAFIPNFNDAWLAI 260 Query: 257 FPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQ-------GISTFEPYTTTWRAVQNRLY 309 PVISSG SK+++T+TPNGLNH++D+W AA+ S F PYT TW +V+ R+Y Sbjct: 261 QPVISSGRHSKILMTTTPNGLNHWYDIWTAAITPNSDGSGSKSGFVPYTATWSSVKERMY 320 Query: 310 KDGEFDDG--EAFKRETIGNTSR------EAFSQEHLCNFLGTAGTLINGFKLSKMKGID 361 DG DG + +G + AF QEH F GT+GTLINGFKLSKM + Sbjct: 321 SDGSKTDGAIHILTTDILGQPRQSPVLALRAFQQEHNTAFQGTSGTLINGFKLSKMTWKE 380 Query: 362 VVKDSDGWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSH 421 V SD + ++K+P EGHKYI T+D++EGRGQDYHA+H+ D+T +P+EQVAV+H N TSH Sbjct: 381 -VPASDNFTMFKEPIEGHKYIATLDSAEGRGQDYHAMHIYDITEFPYEQVAVYHSNTTSH 439 Query: 422 LLLPAIIMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEERASGGRRGLGLKP 481 L+LP +++K Y + Y+Y E+ +TG + L+ +LEYEN+I + LG+K Sbjct: 440 LILPDVLLKYLNMYYQPYIYIELNATGVSIAKSLYSELEYENIICD-----SYNDLGMKQ 494 Query: 482 NKKTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMSLTLLA 541 K++KAIGCSTLKDLIEK++L + H T+ E TF EKG SW AE+GFHDDLVMSL + A Sbjct: 495 TKRSKAIGCSTLKDLIEKEKLVLYHKGTIMELRTFSEKGVSWAAEDGFHDDLVMSLVIFA 554 Query: 542 YLSTQDRFSDFVEK-EYNVSYDIFKQEVHDMMDDDVPFLMIADGIENYGTDFSTNSF 597 +L+TQ RFSDF E+ + ++ +IF+QE+ ++ DD P +++ G E + + SF Sbjct: 555 WLTTQPRFSDFTERDDMRLANEIFRQEMENLYDDYTPVVIVDSGEETFEVGSNGMSF 611 >gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: terminase large subunit gp17 # Family: family:all:147 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195134;genbank:gi:58532911;uniprot:Q5GQN8 ;genbank:GeneID:3260486 Length = 548 Score = 375 bits (964), Expect = e-106, Method: Compositional matrix adjust. Identities = 203/530 (38%), Positives = 310/530 (58%), Gaps = 27/530 (5%) Query: 70 YMNIPNLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVPRPYQ 129 Y+ P L++AN I F E E+ KC +D VYF +NY IV +D G + +Q Sbjct: 7 YLGNPLLKKANVKIDF---TKEQVKEWIKCANDPVYFTKNYVKIVSLDEGLVPFKMWDFQ 63 Query: 130 KEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLERV 189 +E++ ++RF+I LPRQ GK+T + +L HYL+FN++ GILA+K S + ++L R+ Sbjct: 64 EELIMKFHKNRFNIAKLPRQTGKSTTVVSYLLHYLIFNDNVNIGILANKASTARDLLARL 123 Query: 190 KNVIENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPGF 249 ENLP ++Q G+ WNKGNI +NG K+ A ++ + AVRG SF++I++DE AFVP Sbjct: 124 ATAYENLPKWIQQGVVVWNKGNIELENGSKILAASTSASAVRGMSFNIIFLDEFAFVPNH 183 Query: 250 --DDFWKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRAVQNR 307 D F+ + +P I+SG+ +KV++ STP G+NH++ MW A G + + + W V R Sbjct: 184 IADSFFASVYPTITSGKSTKVIIISTPQGMNHFYKMWVDATNGRNGYTFHEVHWSQVPGR 243 Query: 308 LYKDGEFDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVVKDSD 367 E +K ETI NTS F+QE C FLG+ TLI KL + D +K + Sbjct: 244 ---------DEKWKEETIKNTSERQFTQEFECEFLGSVDTLIAASKLKALVFNDPIKRNK 294 Query: 368 GWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAI 427 G +Y++P+E +Y++TVD S G G DY A + D+T+ P++ V + +N+ +L P I Sbjct: 295 GLDIYEEPKEKSEYLMTVDVSRGIGGDYSAFIIFDITTVPYKVVGKYRNNEIKPMLFPNI 354 Query: 428 IMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVI---MEERA--------SGGRRG 476 I A YN A+V CE+ G+ V + L DLEY NV+ M RA SG + Sbjct: 355 INDLARSYNNAWVLCEVNDIGDQVASILNYDLEYPNVLMCAMRGRAGQLVGQGFSGSKTQ 414 Query: 477 LGLKPNKKTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMS 536 LG+K + K +GC+ LK ++E+D+L N + E TF++K +S+EA+EGFHDDLVM Sbjct: 415 LGVKMSITVKKVGCANLKTIVEEDKLIFNDYDIINELTTFIQKKQSFEADEGFHDDLVMC 474 Query: 537 LTLLAYLSTQDRFSDFVEKEYNVSYDIFKQEVHDMMDDDVPFLMIADGIE 586 + + A+L QD F + + + + I+ ++ + + D PF I G+E Sbjct: 475 MVIFAWLVQQDYFKEMTDND--IRQRIYDEQKNQIEQDMAPFGFITTGLE 522 >gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T4-like DNA packaging large subunit terminase # Family: family:all:147 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214360;genbank:gi:61806000;genbank:GeneID :3294466 Length = 547 Score = 360 bits (923), Expect = e-101, Method: Compositional matrix adjust. Identities = 196/531 (36%), Positives = 305/531 (57%), Gaps = 27/531 (5%) Query: 70 YMNIPNLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVPRPYQ 129 Y+ PNL++AN PI F I+ EF KC++D VYF NY IV +D G + +Q Sbjct: 5 YLGNPNLKKANTPIEFSK--DNIR-EFLKCKEDPVYFTRNYIKIVSLDEGLVPFNMYDFQ 61 Query: 130 KEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLERV 189 ++++ +RF+I +PRQ GK+T +L HY VFN++ +LA+K S + ++L R+ Sbjct: 62 EKLITRFHENRFNICKMPRQTGKSTTCISYLLHYAVFNDNVNVAVLANKASTARDLLGRL 121 Query: 190 KNVIENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPGF 249 + ENLP ++Q GI WNKG++ +NG K+ A ++ S AVRG S+++I++DE AF+P Sbjct: 122 QLAYENLPRWMQQGIISWNKGSLELENGSKISANSTSSSAVRGGSYNVIFLDEFAFIPNH 181 Query: 250 --DDFWKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRAVQNR 307 DDF+ + +P I+SG+ +KV++ STP G+NH++ MW+ + +G S + W V R Sbjct: 182 IADDFFASVYPTITSGQSTKVIIVSTPRGMNHFYRMWHDSEKGKSEYVATDVHWSEVPGR 241 Query: 308 LYKDGEFDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVVKDSD 367 E +K +TI NTS + F E C FLG+ TLIN KL + + Sbjct: 242 ---------DEEWKEQTIANTSEQQFKIEFECEFLGSVNTLINPAKLRNLVYEAPKTRNA 292 Query: 368 GWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAI 427 G +Y+ P + H YI+TVD + G G DY A + D T +P++ VA + +N+ +L P I Sbjct: 293 GLDIYETPVKEHNYIITVDVARGLGNDYSAFIVFDTTEFPYKVVAKYRNNEIKPMLFPNI 352 Query: 428 IMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEE---RA--------SGGRRG 476 I+ A YN AY+ E+ G+ V + L DLEYENV+M RA SG + Sbjct: 353 ILDVAKGYNNAYLLIEVNDIGDQVASILQYDLEYENVLMASMRGRAGQIVGQGFSGKKTQ 412 Query: 477 LGLKPNKKTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVMS 536 LG++ K +GCS LK ++E D+L + E TF ++ S+EAEEG +DDL M Sbjct: 413 LGVRMTSAVKKLGCSNLKTMMEDDKLLTCDYEIISELTTFAQRHNSFEAEEGCNDDLAMC 472 Query: 537 LTLLAYLSTQDRFSDFVEKEYNVSYDIFKQEVHDMMDDDVPFLMIADGIEN 587 L + ++L QD F + + + + I++++ + + D PF IADG+++ Sbjct: 473 LVIFSWLVAQDYFKEMSDND--IRKRIYEEQKNQIEQDMAPFGFIADGLDD 521 >gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214662;genbank:gi:61806303;genbank:GeneID :3294591 Length = 550 Score = 357 bits (916), Expect = e-100, Method: Compositional matrix adjust. Identities = 199/534 (37%), Positives = 310/534 (58%), Gaps = 27/534 (5%) Query: 66 RRSRYMNIPNLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVP 125 ++ Y+ PNL++AN +F + AE+ KC D VYF Y IV +D G I Sbjct: 4 KQEIYLGNPNLKKANVSTQF---TKKQVAEYMKCAQDPVYFIRKYIRIVSLDEGVIPFDM 60 Query: 126 RPYQKEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEV 185 +Q++M+ + RF+I LPRQ GK+TI+ +L Y++FN + ILA+K + E+ Sbjct: 61 YNFQEDMVTKFHQHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAPTAREM 120 Query: 186 LERVKNVIENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAF 245 L R++ ENLP ++Q GI WNKG++ +NG K+ A ++ + AVRG SF++I++DE AF Sbjct: 121 LGRLQLSYENLPKWMQQGILGWNKGSLELENGSKILASSTSASAVRGMSFNIIFLDEFAF 180 Query: 246 VPGF--DDFWKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRA 303 VP + F+ + +P ISSG+ +KV++ STP+G+N ++ +W+ A +G + + W Sbjct: 181 VPNHIAEQFFASVYPTISSGKSTKVIIISTPHGMNQFYKLWHDAERGANNYVATEVHWSQ 240 Query: 304 VQNRLYKDGEFDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVV 363 V R DD +K++TI NTS F E C FLG+ TLI KL M D + Sbjct: 241 VPGR-------DD--KWKQQTIENTSEAQFRVEFECEFLGSVDTLITPSKLRIMPYKDPI 291 Query: 364 KDSDGWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSHLL 423 +++ G VY+ +E H YI+TVD S G G DY A +ID T+ P++ VA + +N+ L+ Sbjct: 292 QENRGLAVYEHVQENHNYIITVDVSRGVGNDYSAFCVIDTTTVPYKVVARYKNNQIKPLV 351 Query: 424 LPAIIMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVI---MEERA--------SG 472 P +I+ A YN AYV CE+ G V + + DLEYEN++ M RA SG Sbjct: 352 FPNLIVDVATNYNGAYVLCEVNDIGGQVADIIQYDLEYENLLMVSMRGRAGQQLGQGFSG 411 Query: 473 GRRGLGLKPNKKTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDD 532 + LG+K + K +GCS LK LIE D+L + T+ E TF++KG+S++AE+G +DD Sbjct: 412 KKTQLGIKMSTAVKQVGCSNLKALIEDDKLIVEDYDTIAELTTFIQKGQSFQAEDGCNDD 471 Query: 533 LVMSLTLLAYLSTQDRFSDFVEKEYNVSYDIFKQEVHDMMDDDVPFLMIADGIE 586 L M L + ++++ Q F + + +V I++ + + D PF ++DG+E Sbjct: 472 LAMCLVIFSWMAMQPYFKEM--HDNDVRQRIYEDQRDQIEQDMAPFGFVSDGLE 523 >gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: gp123 # Family: family:all:147 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717790;genbank:gi:113200627;genbank:GeneI D:4239178 Length = 549 Score = 348 bits (894), Expect = 8e-98, Method: Compositional matrix adjust. Identities = 195/537 (36%), Positives = 310/537 (57%), Gaps = 28/537 (5%) Query: 69 RYMNIPNLRRANAPIRFGAPLSEIKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVPRPY 128 +Y+ PNL++AN F + AE KC ++ VYF +NY IV +D G I + Sbjct: 6 QYLGNPNLKKANVSQEF---TPDQVAEVIKCSENPVYFIKNYIKIVSLDKGLIPFDMYYF 62 Query: 129 QKEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLER 188 Q+EM++ +RF+I LPRQ GK+TI+ +L Y++FN + ILA+K + + E+L+R Sbjct: 63 QEEMVQKFHDNRFNIAKLPRQSGKSTIVTSYLLWYVLFNANVNVAILANKAATAREMLQR 122 Query: 189 VKNVIENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPG 248 ++ ENLP +LQ GI +WN+G++ +NG K+ A ++ + AVRG SF++I++DE AFVP Sbjct: 123 LQLSYENLPKWLQQGILQWNRGSLELENGSKILAASTSASAVRGMSFNVIFLDEFAFVPN 182 Query: 249 F--DDFWKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRAVQN 306 D F+ + +P ISSG+ +KV++ STP+G+N ++ +W+ A + + + P W V Sbjct: 183 HVADQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERKANEYIPTEVHWSEVPG 242 Query: 307 RLYKDGEFDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVVKDS 366 R A+K +TI NTS + F E C FLG+ TLI+ KL M D + + Sbjct: 243 R---------DAAWKEQTIKNTSEQQFRVEFECEFLGSVDTLISPSKLRTMVYGDPIAEK 293 Query: 367 DGWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPA 426 +G +Y+K +GH Y++T D S G DY A +ID T+ P++ VA + +N +L P Sbjct: 294 NGLSMYEKTIQGHTYVITADVSRGVSGDYSAFLVIDTTTIPYKLVAKYRNNDIKPILFPN 353 Query: 427 IIMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVI---MEERA--------SGGRR 475 II+ A YN A+V E+ G V + + DLEY+N++ M RA SG + Sbjct: 354 IIVDVARNYNHAFVLVEVNDVGGQVADIIQYDLEYDNLLMCAMRGRAGQQLGQGFSGKKT 413 Query: 476 GLGLKPNKKTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEGFHDDLVM 535 +G+K + TK +GCS LK L+E D+ +N + E TF++KG++++AEEG +DDL M Sbjct: 414 QMGIKMSSATKQVGCSNLKALLEDDKFLLNDYDCISELTTFIQKGQTFQAEEGCNDDLAM 473 Query: 536 SLTLLAYLSTQDRFSDFVEKEYNVSYDIFKQEVHDMMDDDVPFLMIADGI-ENYGTD 591 + + A+++ Q F + + + V I+ + + D PF + DG+ E Y D Sbjct: 474 CMVIFAWMAMQPYFKELHDND--VRQRIYDDQREAIEQDMAPFGFMDDGLGEEYFAD 528 >gi|16897|lcl|protein:vir:5867 Length: 508 # NCBI annotation: similar to terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835653;genbank:gi:30044056 Length = 508 Score = 149 bits (375), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 113/352 (32%), Positives = 175/352 (49%), Gaps = 27/352 (7%) Query: 92 IKAEFQKCRDDIVYFAENYCSIVHIDLGNIKMVPRPYQKEMLEVADRSRFSIFLLPRQLG 151 IK E +KC++D +YF Y I H I P Q++++ R+ I PRQ+G Sbjct: 7 IKQELEKCKNDPIYFIRKYVKIQHPIKRVIPFDLYPIQEKLINFYHTHRYVITEKPRQMG 66 Query: 152 KTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLERVKNVIENLPDFLQPGIEEWNKGN 211 T + H ++FN + + I A+K + + VLER+K E LP FLQ WNK Sbjct: 67 VTWCAVAYALHQMIFNSNYKVLIAANKEATAKNVLERIKFAYEQLPRFLQIKKRTWNKTY 126 Query: 212 ITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPGFDDFWKATFPVISSGEESKVVLT 271 I F N A +S SD+ R +S +++ V+E AF+ ++ W + +++G K ++ Sbjct: 127 IEFSNYSSARAVSSKSDSGRSESITLLIVEEAAFISNMEELWASVQQTLATG--GKCIVN 184 Query: 272 STPNGL-NHYHDMWNAAVQGISTFEPYTTTWRAVQNRLYKDGEFDDGEAFKRETIGNTSR 330 ST NG+ N Y AA +G S F+ + W D D + F+ + R Sbjct: 185 STYNGVGNWYERTIRAAKEGKSEFKYFGIKW--------SDHPERDEKWFEEQKRLLPPR 236 Query: 331 EAFSQEHLCNFLGTAGTLINGFKLSKMKGID--VVK-DSDGWCVYKKPEEGHKYILTVDT 387 F+QE LC G+ +I + + + ID VVK D W Y+KP G+ Y ++VD Sbjct: 237 -VFAQEILCIPQGSGENVIPFHLIREEEFIDPFVVKYGGDYWEWYRKP--GY-YFISVDP 292 Query: 388 SEGRGQDYHALHM----IDVTSYPFEQVAVFHDNKTSHLLLPAI--IMKQAY 433 + GRG+D A+ + +D + EQVA F +KTS LP + ++KQ Y Sbjct: 293 ASGRGEDRSAVGVQVLWVDPQTLTIEQVAEFASDKTS---LPVMRQVIKQIY 341 >gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: terminase, large subunit # Family: family:all:147 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006983;genbank:gi:46401884;genbank:GeneID :2777679 Length = 438 Score = 46.6 bits (109), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 61/258 (23%), Positives = 105/258 (40%), Gaps = 31/258 (12%) Query: 138 RSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGI----LAHKGSMSMEVLERVKNVI 193 R RF + R++GK+ F+A+ L F + E + +A S++ +++ +I Sbjct: 53 RHRFVTACVSRRVGKS-----FIAYTLGFLKLLEPNVKVLVVAPNYSLANIGWSQIRGLI 107 Query: 194 ENLPDFLQPGIEEWNKGNITFDNGCKLG-AYASGSDAVRGKSFSMIYVDECAFVPGFDDF 252 + LQ E I NG A A+ +D+ G+S+ I DE A D Sbjct: 108 KKYG--LQTERENAKDKEIELANGSLFKLASAAQADSAVGRSYDFIIFDEAAISDVGGDA 165 Query: 253 WKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRAVQNRLYKD- 311 ++ SK + STP G N + + + F+ W ++ Y+D Sbjct: 166 FRVQLRPTLDKPNSKALFISTPRGGNWFKEFYAYG------FDDTLPNWVSIHG-TYRDN 218 Query: 312 --GEFDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVVKDSDGW 369 + +D E +R S+ F QE+ +F G + + F ID VKD G Sbjct: 219 PRADLNDIEEARR----TVSKNYFRQEYEADFSVFEGQIFDTF-----NAIDHVKDLKGM 269 Query: 370 CVYKKPEEGHKYILTVDT 387 + K +E + +L +D Sbjct: 270 RHFFKDDEAFETLLGIDV 287 >gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: phage terminase large subunit # Family: family:all:147 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468054;genbank:gi:157265496;genbank:Ge neID:5600564 Length = 485 Score = 42.4 bits (98), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 75/344 (21%), Positives = 133/344 (38%), Gaps = 61/344 (17%) Query: 221 GAYASGSDAVRGKSFSMIYVDECAFVPGFDDFWKATFPVISSGEESKVVLTSTPNGLNHY 280 G A D +RG + + +DE A +P F + +A P +S + ++ STP GLN + Sbjct: 129 GKSADRPDNLRGATLDFVILDEAAMIP-FSVWSEAIEPTLSV-RDGWALIISTPKGLNWF 186 Query: 281 HDM----WNAAVQ------GISTFEPYTTTWRAVQNRLYKDGEFDDGEAFKRETIGNTSR 330 ++ W ++ GI+ P ++ A ++ E + +R I + Sbjct: 187 YEFFLMGWRGGLKEGIPNSGINQTHPDFESFHAASWDVWP--ERREWYMERRLYIPDLE- 243 Query: 331 EAFSQEHLCNFLGTAGTLINGFKLSKM-----KGIDVVKDSDGWCVYKKPEEGHKYILTV 385 F QE+ F+ + ++ +G + + +G +V + +P+ H Y + Sbjct: 244 --FRQEYGAEFVSHSNSVFSGLDMLILLPYERRGTRLVVED------YRPD--HIYCIGA 293 Query: 386 DTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAIIMKQAYRYNEAYVYCEIA 445 D G+ QDY ++D+ + V + N + A + + Y AYV + Sbjct: 294 DF--GKNQDYSVFSVLDLDTGAI--VCLERMNGATWSDQVARLKALSEDYGHAYVVADTW 349 Query: 446 STGELVMNELFRDLEYENVIMEERASGGRRGLGLKP----NKKTKAIGCSTLKDLIEKDQ 501 G+ + EL +G+ P + K S L L+EK Q Sbjct: 350 GVGDAIAEEL-----------------DAQGINYTPLPVKSSSVKEQLISNLALLMEKGQ 392 Query: 502 LKI-NHIPTLKEFHTF-----VEKGKSWEAEEGFHDDLVMSLTL 539 + + N L E F + A HDD+VMSL L Sbjct: 393 VAVPNDKTILDELRNFRYYRTASGNQVMRAYGRGHDDIVMSLAL 436 >gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: terminase large subunit # Family: family:all:147 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467938;genbank:gi:157265379;genbank:Ge neID:5600506 Length = 485 Score = 42.0 bits (97), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 74/344 (21%), Positives = 133/344 (38%), Gaps = 61/344 (17%) Query: 221 GAYASGSDAVRGKSFSMIYVDECAFVPGFDDFWKATFPVISSGEESKVVLTSTPNGLNHY 280 G A D +RG + + +DE A +P F + +A P +S + ++ STP GLN + Sbjct: 129 GKSADRPDNLRGATLDFVILDEAAMIP-FSVWSEAIEPTLSV-RDGWALIISTPKGLNWF 186 Query: 281 HDM----WNAAVQ------GISTFEPYTTTWRAVQNRLYKDGEFDDGEAFKRETIGNTSR 330 ++ W ++ G++ P ++ A ++ E + +R I + Sbjct: 187 YEFFLMGWRGGLKEGIPNSGVNQTHPDFESFHAASWDVWP--ERREWYMERRLYIPDLE- 243 Query: 331 EAFSQEHLCNFLGTAGTLINGFKLSKM-----KGIDVVKDSDGWCVYKKPEEGHKYILTV 385 F QE+ F+ + ++ +G + + +G +V + +P+ H Y + Sbjct: 244 --FRQEYGAEFVSHSNSVFSGLDMLILLPYERRGTRLVVED------YRPD--HIYCIGA 293 Query: 386 DTSEGRGQDYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAIIMKQAYRYNEAYVYCEIA 445 D G+ QDY ++D+ + V + N + A + + Y AYV + Sbjct: 294 DF--GKNQDYSVFSVLDLDTGAI--VCLERMNGATWSDQVARLKALSEDYGHAYVVADTW 349 Query: 446 STGELVMNELFRDLEYENVIMEERASGGRRGLGLKP----NKKTKAIGCSTLKDLIEKDQ 501 G+ + EL +G+ P + K S L L+EK Q Sbjct: 350 GVGDAIAEEL-----------------DAQGINYTPLPVKSSSVKEQLISNLALLMEKGQ 392 Query: 502 LKI-NHIPTLKEFHTF-----VEKGKSWEAEEGFHDDLVMSLTL 539 + + N L E F + A HDD+VMSL L Sbjct: 393 VAVPNDKTILDELRNFRYYRTASGNQVMRAYGRGHDDIVMSLAL 436 >gi|1931|lcl|protein:vir:99847 Length: 486 # NCBI annotation: large terminase subunit # Family: family:all:460 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164067;genbank:gi:56692599;genbank:GeneID :3192575 Length = 486 Score = 40.0 bits (92), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 54/254 (21%), Positives = 97/254 (38%), Gaps = 39/254 (15%) Query: 119 GNIKMVPR-------PYQKEMLEVADRSRFSIFLLPRQLG-----------KTTIMGIFL 160 GN K++P PYQ + D SR + RQ+G +T + Sbjct: 5 GNAKVIPANPDAIFLPYQSRW--ITDPSRLKLMQKSRQIGLSWSTAYAAGERTAAESARV 62 Query: 161 AHYLVFNEDKEAGILAHKGSMSMEVLERVKNVIENLPDFLQPGIEEWNKGNITFDNGCKL 220 ++ +D +A + M ++ + + + ++ I + + F NG ++ Sbjct: 63 DQWVSSRDDLQARLFLEDCKMWAGIMNQAAKDLGEIVIDVKNKISAYV---LEFANGRRI 119 Query: 221 GAYASGSDAVRGKSFSMIYVDECAFVPGFDDFWKATFPVISSGEESKVVLTSTPNGLNHY 280 + +S DA GK I +DE A P W +P I+ G +++ T N + Sbjct: 120 HSMSSNPDAQAGKRGGRI-LDEFALHPDPRKLWSIAYPGITWGGAMEIISTHR-GSQNFF 177 Query: 281 HDMWNAAVQG--ISTFEPYTTTWR---------AVQNRLYKDGEF---DDGEAFKRETIG 326 + + V+G +T T + +Q L D E D+ + F G Sbjct: 178 NQLVREIVEGGNPKNISLHTVTLQDALNQGFLFKLQQMLPADDEIQGMDEAQYFDFIRAG 237 Query: 327 NTSREAFSQEHLCN 340 E+F QE++CN Sbjct: 238 CADEESFQQEYMCN 251 >gi|17621|lcl|protein:vir:816 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050549;genbank:gi:9633446;genbank:GeneID: 1262208 Length = 568 Score = 38.1 bits (87), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 48/174 (27%), Positives = 82/174 (47%), Gaps = 13/174 (7%) Query: 371 VYKKPEEGHKYILTVDTSEGRGQ-DYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAIIM 429 V++ P+ +Y+ DT+EG D +L ++ ++ EQVA + + + L A ++ Sbjct: 370 VWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSN--GEQVAHWFGHLDAELF--AHLI 425 Query: 430 KQAYR-YNEAYVYCEIASTGELVMNELFRDLE-----YENVIMEERASGGRRGLGLKPNK 483 Q R YN A+V E + G V+ +L R+L Y +++ LG + Sbjct: 426 SQVCRMYNNAFVGPERNNHGHAVILKL-RELYPTRYIYNEQHLDQAYDDDTPRLGWLTTR 484 Query: 484 KTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGK-SWEAEEGFHDDLVMS 536 ++K + +K L+ I TL E +T+V K S A+EG DD +MS Sbjct: 485 QSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQEGCFDDQLMS 538 >gi|25680|lcl|protein:vir:10116 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859246;genbank:gi:32171173;genbank:GeneID :2653342 Length = 568 Score = 38.1 bits (87), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 48/174 (27%), Positives = 82/174 (47%), Gaps = 13/174 (7%) Query: 371 VYKKPEEGHKYILTVDTSEGRGQ-DYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAIIM 429 V++ P+ +Y+ DT+EG D +L ++ ++ EQVA + + + L A ++ Sbjct: 370 VWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSN--GEQVAHWFGHLDAELF--AHLI 425 Query: 430 KQAYR-YNEAYVYCEIASTGELVMNELFRDLE-----YENVIMEERASGGRRGLGLKPNK 483 Q R YN A+V E + G V+ +L R+L Y +++ LG + Sbjct: 426 SQVCRMYNNAFVGPERNNHGHAVILKL-RELYPTRYIYNEQHLDQAYDDDTPRLGWLTTR 484 Query: 484 KTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGK-SWEAEEGFHDDLVMS 536 ++K + +K L+ I TL E +T+V K S A+EG DD +MS Sbjct: 485 QSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQEGCFDDQLMS 538 >gi|51|lcl|protein:vir:3295 Length: 568 # NCBI annotation: putative large subunit terminase # Family: family:all:147 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049511;genbank:gi:9632517;genbank:GeneID: 1262002 Length = 568 Score = 38.1 bits (87), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 48/174 (27%), Positives = 82/174 (47%), Gaps = 13/174 (7%) Query: 371 VYKKPEEGHKYILTVDTSEGRGQ-DYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAIIM 429 V++ P+ +Y+ DT+EG D +L ++ ++ EQVA + + + L A ++ Sbjct: 370 VWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSN--GEQVAHWFGHLDAELF--AHLI 425 Query: 430 KQAYR-YNEAYVYCEIASTGELVMNELFRDLE-----YENVIMEERASGGRRGLGLKPNK 483 Q R YN A+V E + G V+ +L R+L Y +++ LG + Sbjct: 426 SQVCRMYNNAFVGPERNNHGHAVILKL-RELYPTRYIYNEQHLDQAYDDDTPRLGWLTTR 484 Query: 484 KTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGK-SWEAEEGFHDDLVMS 536 ++K + +K L+ I TL E +T+V K S A+EG DD +MS Sbjct: 485 QSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQEGCFDDQLMS 538 >gi|26240|lcl|protein:vir:2763 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612880;genbank:gi:20065962;genbank:GeneID :935714 Length = 568 Score = 38.1 bits (87), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 48/174 (27%), Positives = 82/174 (47%), Gaps = 13/174 (7%) Query: 371 VYKKPEEGHKYILTVDTSEGRGQ-DYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAIIM 429 V++ P+ +Y+ DT+EG D +L ++ ++ EQVA + + + L A ++ Sbjct: 370 VWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSN--GEQVAHWFGHLDAELF--AHLI 425 Query: 430 KQAYR-YNEAYVYCEIASTGELVMNELFRDLE-----YENVIMEERASGGRRGLGLKPNK 483 Q R YN A+V E + G V+ +L R+L Y +++ LG + Sbjct: 426 SQVCRMYNNAFVGPERNNHGHAVILKL-RELYPTRYIYNEQHLDQAYDDDTPRLGWLTTR 484 Query: 484 KTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGK-SWEAEEGFHDDLVMS 536 ++K + +K L+ I TL E +T+V K S A+EG DD +MS Sbjct: 485 QSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQEGCFDDQLMS 538 >gi|25849|lcl|protein:vir:9949 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859079;genbank:gi:32171001;genbank:GeneID :2653280 Length = 568 Score = 38.1 bits (87), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 48/174 (27%), Positives = 82/174 (47%), Gaps = 13/174 (7%) Query: 371 VYKKPEEGHKYILTVDTSEGRGQ-DYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAIIM 429 V++ P+ +Y+ DT+EG D +L ++ ++ EQVA + + + L A ++ Sbjct: 370 VWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSN--GEQVAHWFGHLDAELF--AHLI 425 Query: 430 KQAYR-YNEAYVYCEIASTGELVMNELFRDLE-----YENVIMEERASGGRRGLGLKPNK 483 Q R YN A+V E + G V+ +L R+L Y +++ LG + Sbjct: 426 SQVCRMYNNAFVGPERNNHGHAVILKL-RELYPTRYIYNEQHLDQAYDDDTPRLGWLTTR 484 Query: 484 KTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGK-SWEAEEGFHDDLVMS 536 ++K + +K L+ I TL E +T+V K S A+EG DD +MS Sbjct: 485 QSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQEGCFDDQLMS 538 >gi|1476|lcl|protein:vir:104436 Length: 568 # NCBI annotation: putative terminase large subunit # Family: family:all:147 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794060;genbank:gi:116222005;genbank:GeneI D:4397501 Length = 568 Score = 38.1 bits (87), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 48/174 (27%), Positives = 82/174 (47%), Gaps = 13/174 (7%) Query: 371 VYKKPEEGHKYILTVDTSEGRGQ-DYHALHMIDVTSYPFEQVAVFHDNKTSHLLLPAIIM 429 V++ P+ +Y+ DT+EG D +L ++ ++ EQVA + + + L A ++ Sbjct: 370 VWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSN--GEQVAHWFGHLDAELF--AHLI 425 Query: 430 KQAYR-YNEAYVYCEIASTGELVMNELFRDLE-----YENVIMEERASGGRRGLGLKPNK 483 Q R YN A+V E + G V+ +L R+L Y +++ LG + Sbjct: 426 SQVCRMYNNAFVGPERNNHGHAVILKL-RELYPTRYIYNEQHLDQAYDDDTPRLGWLTTR 484 Query: 484 KTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGK-SWEAEEGFHDDLVMS 536 ++K + +K L+ I TL E +T+V K S A+EG DD +MS Sbjct: 485 QSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQEGCFDDQLMS 538 >gi|4674|lcl|protein:vir:105593 Length: 543 # NCBI annotation: terminase large subunit # Family: family:all:147 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164303;genbank:gi:56692921;genbank:GeneID :3197204 Length = 543 Score = 37.4 bits (85), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 19/58 (32%), Positives = 33/58 (56%), Gaps = 1/58 (1%) Query: 142 SIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVL-ERVKNVIENLPD 198 ++ L RQLG TT++ I + +FN D+ GI+A + + ++VK +NLP+ Sbjct: 81 NLILKARQLGFTTLIAIMWLDHALFNGDQRCGIIAQDRDAAKVIFRDKVKFAYDNLPE 138 >gi|1431|lcl|protein:vir:93626 Length: 530 # NCBI annotation: Bcep22gp49 # Family: family:all:147 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944278;genbank:gi:38640355;genbank:GeneID :2658275 Length = 530 Score = 36.6 bits (83), Expect = 9e-04, Method: Compositional matrix adjust. Identities = 30/111 (27%), Positives = 52/111 (46%), Gaps = 10/111 (9%) Query: 144 FLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVL-ERVKNVIENLPDFLQP 202 L RQLG TT++ I + +FN + GI+A + + ++VK +NLP+ L+ Sbjct: 74 ILKARQLGFTTLICIIWLDHALFNANSRCGIIAQDRETAEALFRDKVKFAYDNLPEALRE 133 Query: 203 GIEEWN--KGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDE----CAFVP 247 + N K + F + A+ +VRG + +++ E CA P Sbjct: 134 AMPLANCTKAELLFAHNNSSIRVAT---SVRGGTIHRLHISEFGKICAKYP 181 >gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: terminase large subunit # Family: family:all:4877 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429607;genbank:gi:156564098;genbank:Ge neID:5525534 Length = 635 Score = 35.4 bits (80), Expect = 0.002, Method: Compositional matrix adjust. Identities = 40/177 (22%), Positives = 74/177 (41%), Gaps = 16/177 (9%) Query: 120 NIKMVPRPYQKEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAG------ 173 +K R YQ+ ML+ S+ ++ L R+LGKT M I + + +K Sbjct: 63 TLKWFCRDYQEPMLQEMADSKRTVLRLGRRLGKTETMCIMILWHAFTQPNKGPNNQYDIL 122 Query: 174 ILAHKGSMSMEVLERVKNVIENLPDFLQPGIEEWNKGNITFDNGCKLGAYASGSDA---- 229 I+A + +R+ +I+ D + P + +I NG + +GS + Sbjct: 123 IIAPYEEQVDLIFKRLSQLIDMSGD-VNPSRD--IDKHIELPNGTVIHGITAGSKSGSGA 179 Query: 230 --VRGKSFSMIYVDECAFVPGFDDFWKATFPVISSGEESKVVLTSTPNGLNHYHDMW 284 RG+ +I +DE ++ G + + E K+++ STP+G + W Sbjct: 180 ANTRGQRADLIVLDEMDYM-GESEITNIMNIRNEAPERIKMIVASTPSGRRDSYYKW 235 >gi|26461|lcl|protein:vir:573 Length: 589 # NCBI annotation: unknown # Family: family:all:4926 # MgeID: mge:13 # MgeName: SPBc2 # Cross-refs: genbank:acc:NP_046608;genbank:gi:9630181;genbank:GeneID: 1261423 Length = 589 Score = 32.3 bits (72), Expect = 0.017, Method: Compositional matrix adjust. Identities = 26/110 (23%), Positives = 51/110 (46%), Gaps = 6/110 (5%) Query: 141 FSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLERVKNVIENLPDFL 200 + ++L R GKT + ++ + + I + + EV+E++ ++ + P+ L Sbjct: 79 YFMYLASRGQGKTWLTSVYCCVQAILFPGTKIVIASGTKGQAREVIEKIDDLRKESPN-L 137 Query: 201 QPGIEEW----NKGNITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFV 246 + IE+ N + F NG + AS +D R K +++ VDE V Sbjct: 138 RREIEDLKTSTNDAKVEFHNGSWIKIVAS-NDGARSKRANLLIVDEFRMV 186 >gi|14676|lcl|protein:vir:2499 Length: 474 # NCBI annotation: putative terminase gp4 # Family: family:all:523 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569740;genbank:gi:18496890;genbank:GeneID :932339 Length = 474 Score = 30.4 bits (67), Expect = 0.056, Method: Compositional matrix adjust. Identities = 41/169 (24%), Positives = 67/169 (39%), Gaps = 27/169 (15%) Query: 117 DLGNIKMVPRPYQKEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILA 176 DLG + R + L AD SI PRQ GKT ++G + + + A Sbjct: 46 DLGKLICAKR---DDGLYAADMFAMSI---PRQTGKTYLLGALVFALCIKTPNTTVIWTA 99 Query: 177 HKGSMSMEVLERVKNVIENLPDFLQPGIEEWNKGN----ITFDNGCKL--GAYASGSDAV 230 H+ + E ++ + + D + P I + GN + F NG ++ GA G Sbjct: 100 HRTRTAAETFRSMQGLAKR--DKIAPHILNVHTGNGKEAVLFKNGSRILFGARERGF--- 154 Query: 231 RGKSFSMIYV---DECAFVP--GFDDFWKATFPVISSGEESKVVLTSTP 274 G+ F+ + V DE + DD P ++ ++L TP Sbjct: 155 -GRGFAGVDVLIFDEAQILTENAMDDM----VPATNAAPNPLILLAGTP 198 >gi|19975|lcl|protein:vir:108214 Length: 544 # NCBI annotation: gp5 # Family: family:all:523 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552334;genbank:gi:160700654;genbank:Ge neID:5758934 Length = 544 Score = 29.6 bits (65), Expect = 0.092, Method: Compositional matrix adjust. Identities = 23/87 (26%), Positives = 39/87 (44%), Gaps = 11/87 (12%) Query: 144 FLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLERVKNVIENLPDFLQ-- 201 ++PRQ GKT ++ + + + L F +K A + +V +R+ +I+ P L+ Sbjct: 102 LIIPRQNGKTQLIALRIIYGLFFLGEKIV-YTAQRWQTVKDVYDRIVEIIKRRPSLLRRL 160 Query: 202 ---PGI-----EEWNKGNITFDNGCKL 220 PG+ E G I NG L Sbjct: 161 KPMPGVPDGYSEAGQHGEIYTTNGGSL 187 >gi|7504|lcl|protein:vir:99915 Length: 478 # NCBI annotation: gp2 # Family: family:all:523 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655519;genbank:gi:109392289;genbank:GeneI D:4157084 Length = 478 Score = 28.1 bits (61), Expect = 0.27, Method: Compositional matrix adjust. Identities = 37/148 (25%), Positives = 58/148 (39%), Gaps = 29/148 (19%) Query: 142 SIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGILAHK--------GSM-SMEVLERVKNV 192 S+ +PRQ+GKT ++G + + AH+ GSM +M V Sbjct: 63 SVISIPRQVGKTYLIGCIVFALALLTPGLTVIWTAHRTKTAKETFGSMKAMCATPLVNAH 122 Query: 193 IENLPDFL-QPGIEEWNKGNITF---DNGCKLGAYASGSDAVRGKSFSMIYVDECAFVP- 247 + N+ D GI N I F +NG LG G ++ +DE + Sbjct: 123 VRNVSDARGDEGIYLHNGSRILFGARENGFGLGFAGVG----------ILVLDEAQRLTD 172 Query: 248 -GFDDFWKATFPVISSGEESKVVLTSTP 274 DD P +++ E ++LT TP Sbjct: 173 KAMDDL----IPTMNTVENPLILLTGTP 196 >gi|27798|lcl|protein:vir:8408 Length: 214 # NCBI annotation: gp3 # Family: family:all:30874 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818304;genbank:gi:29566740;genbank:GeneID :1260058 Length = 214 Score = 27.3 bits (59), Expect = 0.51, Method: Compositional matrix adjust. Identities = 13/26 (50%), Positives = 16/26 (61%) Query: 128 YQKEMLEVADRSRFSIFLLPRQLGKT 153 +QK + E D+S S F PRQ GKT Sbjct: 6 HQKLIHETIDKSSISAFAAPRQNGKT 31 >gi|11523|lcl|protein:vir:78781 Length: 604 # NCBI annotation: putative terminase large subunit # Family: family:all:169 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285645;genbank:gi:148727151;genbank:Ge neID:5220131 Length = 604 Score = 26.9 bits (58), Expect = 0.69, Method: Compositional matrix adjust. Identities = 15/78 (19%), Positives = 35/78 (44%), Gaps = 2/78 (2%) Query: 210 GN-ITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPGFDDFWKATFPVISSGEESKV 268 GN I NG +L ++ S++ + +S +Y+DE ++P F+ + + K Sbjct: 235 GNPIVLSNGAELHFCSTNSNSAQSRS-GNVYIDEYFWIPNFEKLSDVASAMATQSHWRKT 293 Query: 269 VLTSTPNGLNHYHDMWNA 286 ++ + ++ + W Sbjct: 294 YFSTPSSKVHEAYRFWTG 311 >gi|10294|lcl|protein:vir:105656 Length: 405 # NCBI annotation: putative large terminase subunit # Family: family:all:697 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425018;genbank:gi:83571766;uniprot:Q2WC34 ;genbank:GeneID:3837297 Length = 405 Score = 26.2 bits (56), Expect = 1.0, Method: Compositional matrix adjust. Identities = 23/115 (20%), Positives = 54/115 (46%), Gaps = 8/115 (6%) Query: 89 LSEIKAEFQKCRDDIVYFAENYCSIVH-IDLGNIKMVPRPYQKEMLE-VADRSRFSIFLL 146 L E++ F + ++ FA+ +++H + GN ++ Q ++L+ + ++ + Sbjct: 18 LQELQQTFPYTAEGLLLFAD---TVIHNLIAGNPHLIR--MQADILKFLFYGHKYRLIEA 72 Query: 147 PRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLERVKNVIENLPDFLQ 201 PR + KTT+ I+ ++ K +++ + E+ V + L DFL+ Sbjct: 73 PRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFRGL-DFLE 126 >gi|8927|lcl|protein:vir:97005 Length: 632 # NCBI annotation: 40 # Family: family:all:697 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654141;genbank:gi:108862025;genbank:GeneI D:5075954 Length = 632 Score = 26.2 bits (56), Expect = 1.3, Method: Compositional matrix adjust. Identities = 23/115 (20%), Positives = 54/115 (46%), Gaps = 8/115 (6%) Query: 89 LSEIKAEFQKCRDDIVYFAENYCSIVH-IDLGNIKMVPRPYQKEMLE-VADRSRFSIFLL 146 L E++ F + ++ FA+ +++H + GN ++ Q ++L+ + ++ + Sbjct: 18 LQELQQTFPYTAEGLLLFAD---TVIHNLIAGNPHLIR--MQADILKFLFYGHKYRLIEA 72 Query: 147 PRQLGKTTIMGIFLAHYLVFNEDKEAGILAHKGSMSMEVLERVKNVIENLPDFLQ 201 PR + KTT+ I+ ++ K +++ + E+ V + L DFL+ Sbjct: 73 PRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAKRAEEIAGWVVKIFRGL-DFLE 126 >gi|10061|lcl|protein:vir:99071 Length: 537 # NCBI annotation: gp26 # Family: family:all:523 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655891;genbank:gi:109521463;genbank:GeneI D:4158036 Length = 537 Score = 25.8 bits (55), Expect = 1.4, Method: Compositional matrix adjust. Identities = 18/60 (30%), Positives = 28/60 (46%), Gaps = 3/60 (5%) Query: 351 GFKLSKMKGIDVVKDSDGWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQ 410 GF L G++VV Y++P EGH +I TS G + LH + P+++ Sbjct: 10 GFSLGNDDGVNVVVGER--YGYQRPPEGHCFIPPYTTSLGDKAMWF-LHQVGFELDPWQE 66 Score = 24.3 bits (51), Expect = 3.9, Method: Compositional matrix adjust. Identities = 38/151 (25%), Positives = 62/151 (41%), Gaps = 22/151 (14%) Query: 142 SIFLLPRQLGKTTIMGI--FLAHYLVFNEDKEAGILAHKGSMSMEVLERVKNVIENLPDF 199 ++ L+PRQ GKT I+ + Y+V DK A + + E R+K IEN Sbjct: 86 ALLLVPRQNGKTAIIEARELVGLYVVC--DKLCIHTAVLFNAARESFYRLKARIENNETL 143 Query: 200 LQPGIEEWNKGNITF-------------DNGCKLGAYASGSDAVRGKSFSMIYVDECAFV 246 + I + GN + G ++ A G+ RG S +I +DE AF Sbjct: 144 NK--ITRFRSGNDNMSIEVKPKKESRHPNAGGRVIYMARGTAVARGFSADVIVLDE-AFA 200 Query: 247 PGFDDFWKATFPVISSGEESKVVLTSTPNGL 277 D+ A +S + ++ ++ GL Sbjct: 201 --LDEASIAAIDYATSARANPFIIYASSTGL 229 >gi|9070|lcl|protein:vir:102238 Length: 588 # NCBI annotation: gp8 # Family: family:all:147 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655204;genbank:gi:109522784;genbank:GeneI D:4157477 Length = 588 Score = 25.8 bits (55), Expect = 1.6, Method: Compositional matrix adjust. Identities = 13/32 (40%), Positives = 16/32 (50%), Gaps = 3/32 (9%) Query: 271 TSTPNGLNHYHDMWNAAVQGISTFEPYTTTWR 302 TSTP G NH+HD + G P +WR Sbjct: 197 TSTPEGKNHFHDKFQ---MGQDPNNPEWESWR 225 >gi|8220|lcl|protein:vir:101493 Length: 588 # NCBI annotation: gp8 # Family: family:all:147 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655387;genbank:gi:109522575;genbank:GeneI D:4157565 Length = 588 Score = 25.8 bits (55), Expect = 1.7, Method: Compositional matrix adjust. Identities = 13/32 (40%), Positives = 16/32 (50%), Gaps = 3/32 (9%) Query: 271 TSTPNGLNHYHDMWNAAVQGISTFEPYTTTWR 302 TSTP G NH+HD + G P +WR Sbjct: 197 TSTPEGKNHFHDKFQ---MGQDPNNPEWESWR 225 >gi|6719|lcl|protein:vir:99411 Length: 447 # NCBI annotation: hypothetical protein # Family: family:all:32729 # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919076;genbank:gi:119757034;genbank:GeneI D:4606064 Length = 447 Score = 25.4 bits (54), Expect = 2.2, Method: Compositional matrix adjust. Identities = 38/166 (22%), Positives = 61/166 (36%), Gaps = 21/166 (12%) Query: 222 AYASGSDA---VRGKSFSMIYVDECAFVPGFDDFWKATFPVISSGE----ESKVVLTSTP 274 A+ G+D G + I+ DE P D + +I+ + + TST Sbjct: 122 AWLGGADKWNRFAGGEYCRIWCDEVGHYPPNTDLYDLHEMLITRQRTEIGPNTTLWTSTG 181 Query: 275 NGLNHYHDMWNAAVQGISTFEPYTTTWRAV-----QNRLYKDGEFDDGEAFKRETIGNTS 329 NG N ++D+ V P+ V N L DG R T+ Sbjct: 182 NGFNQFYDITERQVNADDEPLPWADQMEVVVASTEHNTLLP----PDGLDKIRRQFKGTA 237 Query: 330 REAFSQEHLCNFLGTAGTLINGF-KLSKMKGIDVVKD--SDGWCVY 372 RE Q F G + + F + + ++ D V+D +D W +Y Sbjct: 238 RE--EQGLHGGFAAAEGLVYDAFTRQTHVRDADDVRDRLADDWAMY 281 >gi|16385|lcl|protein:vir:3744 Length: 607 # NCBI annotation: orf15 # Family: family:all:169 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043485;genbank:gi:9628620;genbank:GeneID: 1261142 Length = 607 Score = 25.0 bits (53), Expect = 2.5, Method: Compositional matrix adjust. Identities = 13/39 (33%), Positives = 24/39 (61%), Gaps = 1/39 (2%) Query: 329 SREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVVKDSD 367 S+ AF+Q ++C ++ A ++ N +L K G+D+ K D Sbjct: 376 SKYAFNQLYMCIWIDDADSIFNVKQLLKC-GVDIAKWKD 413 >gi|25398|lcl|protein:vir:8929 Length: 717 # NCBI annotation: ORF025 # Family: family:all:1546 # MgeID: mge:163 # MgeName: phiKZ # Cross-refs: genbank:acc:NP_803591;genbank:gi:29134961;genbank:GeneID :1258246 Length = 717 Score = 24.6 bits (52), Expect = 3.5, Method: Compositional matrix adjust. Identities = 13/41 (31%), Positives = 19/41 (46%) Query: 508 PTLKEFHTFVEKGKSWEAEEGFHDDLVMSLTLLAYLSTQDR 548 P E + + +G HDDLV+SL L +L Q + Sbjct: 571 PLSTELLALTIRNGRIDHAKGNHDDLVVSLLLAHWLLIQGK 611 >gi|1165|lcl|protein:vir:102941 Length: 421 # NCBI annotation: terminase, large subunit # Family: family:all:54 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945278;genbank:gi:39653713;goa:Q708N4;int erpro:IPR004921;interpro:IPR006437;uniprot:Q708N4;genban k:GeneID:2672855 Length = 421 Score = 24.3 bits (51), Expect = 4.4, Method: Compositional matrix adjust. Identities = 11/46 (23%), Positives = 25/46 (54%), Gaps = 2/46 (4%) Query: 474 RRGLGLKPNKKTKAIGCSTLKDLIEKDQLKINH--IPTLKEFHTFV 517 +RG +K + G + ++ ++++ ++ + TLKEFH +V Sbjct: 339 KRGYKIKKARNNVLEGIRFVGSMLGQEKIAVHESCVNTLKEFHAYV 384 >gi|27125|lcl|protein:vir:6595 Length: 164 # NCBI annotation: tail tube protein # Family: family:all:1107 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891726;genbank:gi:33620609;genbank:GeneI D:1725319 Length = 164 Score = 23.5 bits (49), Expect = 6.8, Method: Compositional matrix adjust. Identities = 9/19 (47%), Positives = 11/19 (57%) Query: 30 RYLDDWKVLNRMDKAHKLR 48 R DDW + D AHK+R Sbjct: 66 RVYDDWTITVFNDDAHKIR 84 >gi|25350|lcl|protein:vir:80985 Length: 164 # NCBI annotation: gp19 tail tube protein # Family: family:all:1107 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469500;genbank:gi:157311457;genbank:G eneID:5602118 Length = 164 Score = 23.5 bits (49), Expect = 6.8, Method: Compositional matrix adjust. Identities = 9/19 (47%), Positives = 11/19 (57%) Query: 30 RYLDDWKVLNRMDKAHKLR 48 R DDW + D AHK+R Sbjct: 66 RVYDDWTITVFNDDAHKIR 84 >gi|18781|lcl|protein:vir:7429 Length: 581 # NCBI annotation: gp6 # Family: family:all:147 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818544;genbank:gi:29566981;genbank:GeneID :1260231 Length = 581 Score = 23.5 bits (49), Expect = 6.8, Method: Compositional matrix adjust. Identities = 12/34 (35%), Positives = 17/34 (50%), Gaps = 6/34 (17%) Query: 270 LTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRA 303 T+TP G N Y+D+ A++ P T W A Sbjct: 194 FTTTPEGKNWYYDLHQKALR------PSTLNWSA 221 >gi|10332|lcl|protein:vir:97407 Length: 514 # NCBI annotation: terminase # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762597;genbank:gi:115304298;genbank:GeneI D:5130610 Length = 514 Score = 23.5 bits (49), Expect = 8.3, Method: Compositional matrix adjust. Identities = 26/116 (22%), Positives = 46/116 (39%), Gaps = 22/116 (18%) Query: 200 LQPGIEEWNKG---NITFDNGCKLGAYASGSDAVRGKSFSMIYVDECAFVPGFDDF---- 252 ++ G+E+ N+G IT +N + GAY D RG + M + + DD+ Sbjct: 183 IREGLEK-NEGLLLTITSNNKVRGGAYDEELDTFRGYNDEMDNFKQWGLIFELDDYSQIE 241 Query: 253 ----WKATFPVI--SSGEESKVVLTSTPNGLNH--------YHDMWNAAVQGISTF 294 W P + + G S +T N H + +N +V +++F Sbjct: 242 DPNEWYKANPALDEAKGTVSTETITRELNAARHSTIKANNLFSKRFNFSVNAVTSF 297 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.319 0.136 0.409 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 283,730 Number of Sequences: 514 Number of extensions: 13641 Number of successful extensions: 191 Number of sequences better than 100.0: 56 Number of HSP's better than 100.0 without gapping: 39 Number of HSP's successfully gapped in prelim test: 17 Number of HSP's that attempted gapping in prelim test: 55 Number of HSP's gapped (non-prelim): 70 length of query: 600 length of database: 206,069 effective HSP length: 77 effective length of query: 523 effective length of database: 166,491 effective search space: 87074793 effective search space used: 87074793 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 40 (20.0 bits)