BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_015269.1_cdsid_YP_004306624.1 [gene=SPC35_0141] [protein=terminase large subunit] [protein_id=YP_004306624.1] [location=complement(104135..105451)] (438 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: t... 907 0.0 gi|6886|lcl|protein:vir:106551 Length: 424 # NCBI annotation: pu... 91 3e-20 gi|1165|lcl|protein:vir:102941 Length: 421 # NCBI annotation: te... 71 3e-14 gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: te... 70 5e-14 gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: ph... 70 6e-14 gi|7643|lcl|protein:vir:96370 Length: 447 # NCBI annotation: ORF... 68 2e-13 gi|11588|lcl|protein:vir:78831 Length: 447 # NCBI annotation: te... 68 2e-13 gi|7301|lcl|protein:vir:96238 Length: 447 # NCBI annotation: ORF... 67 5e-13 gi|17691|lcl|protein:vir:9305 Length: 447 # NCBI annotation: lar... 67 5e-13 gi|9829|lcl|protein:vir:103950 Length: 425 # NCBI annotation: ph... 67 5e-13 gi|2779|lcl|protein:vir:99776 Length: 425 # NCBI annotation: lar... 65 2e-12 gi|5208|lcl|protein:vir:106641 Length: 446 # NCBI annotation: OR... 63 6e-12 gi|2931|lcl|protein:vir:105460 Length: 409 # NCBI annotation: pu... 62 1e-11 gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: t... 60 4e-11 gi|5321|lcl|protein:vir:99534 Length: 422 # NCBI annotation: put... 59 1e-10 gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: g... 57 4e-10 gi|15353|lcl|protein:vir:9921 Length: 425 # NCBI annotation: put... 56 8e-10 gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: g... 55 2e-09 gi|3326|lcl|protein:vir:94525 Length: 440 # NCBI annotation: put... 54 4e-09 gi|6719|lcl|protein:vir:99411 Length: 447 # NCBI annotation: hyp... 52 1e-08 gi|1596|lcl|protein:vir:93748 Length: 403 # NCBI annotation: ORF... 51 3e-08 gi|14951|lcl|protein:vir:1235 Length: 405 # NCBI annotation: sim... 51 3e-08 gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T... 51 3e-08 gi|9289|lcl|protein:vir:97165 Length: 431 # NCBI annotation: ORF... 50 4e-08 gi|8220|lcl|protein:vir:101493 Length: 588 # NCBI annotation: gp... 50 7e-08 gi|9070|lcl|protein:vir:102238 Length: 588 # NCBI annotation: gp... 50 8e-08 gi|4261|lcl|protein:vir:94806 Length: 402 # NCBI annotation: ORF... 49 1e-07 gi|9935|lcl|protein:vir:97273 Length: 402 # NCBI annotation: ORF... 49 1e-07 gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: ter... 47 4e-07 gi|18891|lcl|protein:vir:8847 Length: 482 # NCBI annotation: ter... 47 4e-07 gi|8126|lcl|protein:vir:102661 Length: 568 # NCBI annotation: te... 46 9e-07 gi|7076|lcl|protein:vir:96147 Length: 419 # NCBI annotation: ORF... 45 2e-06 gi|18781|lcl|protein:vir:7429 Length: 581 # NCBI annotation: gp6... 43 7e-06 gi|5891|lcl|protein:vir:107098 Length: 421 # NCBI annotation: pu... 43 9e-06 gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp... 42 1e-05 gi|8730|lcl|protein:vir:96840 Length: 421 # NCBI annotation: ORF... 42 1e-05 gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: ter... 42 1e-05 gi|10354|lcl|protein:vir:97448 Length: 421 # NCBI annotation: OR... 41 3e-05 gi|3182|lcl|protein:vir:94499 Length: 421 # NCBI annotation: ORF... 41 3e-05 gi|2589|lcl|protein:vir:94084 Length: 407 # NCBI annotation: ORF... 41 3e-05 gi|3523|lcl|protein:vir:105897 Length: 407 # NCBI annotation: la... 41 3e-05 gi|6284|lcl|protein:vir:95890 Length: 421 # NCBI annotation: ORF... 40 6e-05 gi|7569|lcl|protein:vir:96300 Length: 421 # NCBI annotation: ORF... 39 8e-05 gi|4983|lcl|protein:vir:95058 Length: 421 # NCBI annotation: ORF... 39 9e-05 gi|13903|lcl|protein:vir:9870 Length: 273 # NCBI annotation: put... 39 1e-04 gi|20804|lcl|protein:vir:106290 Length: 633 # NCBI annotation: g... 39 1e-04 gi|6093|lcl|protein:vir:95761 Length: 432 # NCBI annotation: ter... 39 2e-04 gi|10483|lcl|protein:vir:105297 Length: 446 # NCBI annotation: p... 38 2e-04 gi|18074|lcl|protein:vir:5954 Length: 422 # NCBI annotation: hyp... 37 4e-04 gi|360|lcl|protein:vir:344 Length: 482 # NCBI annotation: termin... 37 4e-04 gi|13692|lcl|protein:vir:4897 Length: 411 # NCBI annotation: gp4... 37 5e-04 gi|16780|lcl|protein:vir:2731 Length: 411 # NCBI annotation: hyp... 35 0.001 gi|8015|lcl|protein:vir:96495 Length: 195 # NCBI annotation: ter... 35 0.001 gi|8305|lcl|protein:vir:96700 Length: 689 # NCBI annotation: put... 35 0.002 gi|15673|lcl|protein:vir:1151 Length: 594 # NCBI annotation: pre... 33 0.005 gi|18252|lcl|protein:vir:4193 Length: 467 # NCBI annotation: put... 33 0.005 gi|13374|lcl|protein:vir:9262 Length: 517 # NCBI annotation: gp2... 32 0.015 gi|16077|lcl|protein:vir:4155 Length: 468 # NCBI annotation: lar... 32 0.017 gi|155|lcl|protein:vir:77595 Length: 499 # NCBI annotation: term... 31 0.037 gi|3734|lcl|protein:vir:94555 Length: 585 # NCBI annotation: DNA... 30 0.048 gi|10799|lcl|protein:vir:78055 Length: 440 # NCBI annotation: Te... 30 0.049 gi|5978|lcl|protein:vir:95489 Length: 659 # NCBI annotation: Put... 30 0.052 gi|11736|lcl|protein:vir:79016 Length: 417 # NCBI annotation: pu... 30 0.073 gi|15412|lcl|protein:vir:1325 Length: 519 # NCBI annotation: gp3... 29 0.12 gi|3300|lcl|protein:vir:100942 Length: 499 # NCBI annotation: Gp... 29 0.12 gi|15774|lcl|protein:vir:6239 Length: 527 # NCBI annotation: gp3... 29 0.13 gi|19539|lcl|protein:vir:10320 Length: 627 # NCBI annotation: OR... 28 0.17 gi|18586|lcl|protein:vir:8649 Length: 564 # NCBI annotation: gp7... 28 0.18 gi|7375|lcl|protein:vir:99083 Length: 566 # NCBI annotation: gp7... 27 0.36 gi|2819|lcl|protein:vir:105705 Length: 439 # NCBI annotation: gp... 27 0.44 gi|7834|lcl|protein:vir:102402 Length: 807 # NCBI annotation: gp... 27 0.47 gi|10934|lcl|protein:vir:78195 Length: 601 # NCBI annotation: gp... 27 0.70 gi|9890|lcl|protein:vir:103970 Length: 601 # NCBI annotation: ph... 27 0.70 gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: te... 26 0.97 gi|10061|lcl|protein:vir:99071 Length: 537 # NCBI annotation: gp... 25 2.1 gi|17621|lcl|protein:vir:816 Length: 568 # NCBI annotation: hypo... 24 2.8 gi|25680|lcl|protein:vir:10116 Length: 568 # NCBI annotation: hy... 24 2.8 gi|51|lcl|protein:vir:3295 Length: 568 # NCBI annotation: putati... 24 2.8 gi|26240|lcl|protein:vir:2763 Length: 568 # NCBI annotation: hyp... 24 2.8 gi|25849|lcl|protein:vir:9949 Length: 568 # NCBI annotation: hyp... 24 2.8 gi|1476|lcl|protein:vir:104436 Length: 568 # NCBI annotation: pu... 24 2.8 gi|10658|lcl|protein:vir:268 Length: 605 # NCBI annotation: puta... 24 2.9 gi|1219|lcl|protein:vir:105519 Length: 475 # NCBI annotation: ph... 24 4.3 gi|5072|lcl|protein:vir:95130 Length: 531 # NCBI annotation: hyp... 23 8.2 gi|17978|lcl|protein:vir:4335 Length: 563 # NCBI annotation: ter... 23 8.6 gi|11926|lcl|protein:vir:79177 Length: 589 # NCBI annotation: gp... 23 9.3 >gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: terminase, large subunit # Family: family:all:147 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006983;genbank:gi:46401884;genbank:GeneID :2777679 Length = 438 Score = 907 bits (2345), Expect = 0.0, Method: Compositional matrix adjust. Identities = 437/438 (99%), Positives = 438/438 (100%) Query: 1 MEVSRPYVNTVDVIDFGIDKRFFRLPVSGILAQEGITPNGPQIAIINALEDPRHRFVTAC 60 MEVSRPYVNTVDVIDFGIDKRFFRLPVSGILAQEGITPNGPQIAIINALEDPRHRFVTAC Sbjct: 1 MEVSRPYVNTVDVIDFGIDKRFFRLPVSGILAQEGITPNGPQIAIINALEDPRHRFVTAC 60 Query: 61 VSRRVGKSFIAYTLGFLKLLEPNVKVLVVAPNYSLANIGWSQIRGLIKKYGLQTERENAK 120 VSRRVGKSFIAYTLGFLKLLEPNVKVLVVAPNYSLANIGWSQIRGLIKKYGLQTERENAK Sbjct: 61 VSRRVGKSFIAYTLGFLKLLEPNVKVLVVAPNYSLANIGWSQIRGLIKKYGLQTERENAK 120 Query: 121 DKEIELANGSLFKLASAAQADSAVGRSYDFIIFDEAAISDVGGDAFRVQLRPTLDKPNSK 180 DKEIELANGSLFKLASAAQADSAVGRSYDFIIFDEAAISDVGGDAFRVQLRPTLDKPNSK Sbjct: 121 DKEIELANGSLFKLASAAQADSAVGRSYDFIIFDEAAISDVGGDAFRVQLRPTLDKPNSK 180 Query: 181 ALFISTPRGGNWFKEFYAYGFDDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKNYFRQ 240 ALFISTPRGGNWFKEFYAYGFDDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKNYFRQ Sbjct: 181 ALFISTPRGGNWFKEFYAYGFDDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKNYFRQ 240 Query: 241 EYEADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLLGIDVGYRDPTAVLTIKY 300 EYEADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLLGIDVGYRDPTAVLTIKY Sbjct: 241 EYEADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLLGIDVGYRDPTAVLTIKY 300 Query: 301 HYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAAAQFRQDLAYEHEIAS 360 HYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDR+FVDSAAAQFRQDLAYEHEIAS Sbjct: 301 HYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRIFVDSAAAQFRQDLAYEHEIAS 360 Query: 361 APAKKSVLDGLACLQALFQQGKIIVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDAN 420 APAKKSVLDGLACLQALFQQGKIIVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDAN Sbjct: 361 APAKKSVLDGLACLQALFQQGKIIVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDAN 420 Query: 421 SHLCDALRYGIYSISRGK 438 SHLCDALRYGIYSISRGK Sbjct: 421 SHLCDALRYGIYSISRGK 438 >gi|6886|lcl|protein:vir:106551 Length: 424 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958579;genbank:gi:41179257;genbank:GeneID :2717087 Length = 424 Score = 90.9 bits (224), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 83/310 (26%), Positives = 132/310 (42%), Gaps = 24/310 (7%) Query: 131 LFKLASAAQADSAVGRSYDFIIFDEAAISDVGGDAFRVQLRPTLDKPNSKALFISTPRGG 190 +F A D G + FDE A+ +F Q SK F P G Sbjct: 115 IFGGKDEASQDLVQGITLAGFFFDEVALMP---QSFVNQATARCSVTGSKMWFNCNPSGP 171 Query: 191 -NWFKEFYAYGFDDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKNYFRQEYEADFSVF 249 +WFK + D + IH T DNP D I R S ++++ + + + Sbjct: 172 FHWFKLNWIDQMKDKRA--LRIHFTMHDNPSLDSVTINRYERMYSGVFYQRYIQGLWVMS 229 Query: 250 EGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLLGIDVGYRDPTAVLTIKYHYDTDTYYV 309 EG I+D F+ V + + HF K + D G +PTA L + + +Y+ Sbjct: 230 EGVIYDNFDKDTMVVN-ELPNHFEK------YYVSCDYGTLNPTAFLL--WGRNHGVWYL 280 Query: 310 LEEYQQAEKTTAQHAAYIQHCIDRYKV-----DRVFVDSAAAQFRQDLAYEHEIASAPAK 364 ++EY + +TT++ ++C D + + +D +AA F L ++ AK Sbjct: 281 VKEYYYSGRTTSRQKTDEEYCHDLKEFLGDIRAEMIIDPSAASFSTTLR-QNGFKVRKAK 339 Query: 365 KSVLDGLACLQALFQQGKIIVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSHLC 424 VLDG+ Q +GKI +C +L L +Y WD + E + +HD H C Sbjct: 340 NDVLDGIRVTQTAMNEGKIKFSMNCPNLFKELASYVWDDKAAEHGEDKPVKQHD---HAC 396 Query: 425 DALRYGIYSI 434 DA+RY +Y+I Sbjct: 397 DAMRYFVYTI 406 >gi|1165|lcl|protein:vir:102941 Length: 421 # NCBI annotation: terminase, large subunit # Family: family:all:54 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945278;genbank:gi:39653713;goa:Q708N4;int erpro:IPR004921;interpro:IPR006437;uniprot:Q708N4;genban k:GeneID:2672855 Length = 421 Score = 70.9 bits (172), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 85/328 (25%), Positives = 137/328 (41%), Gaps = 31/328 (9%) Query: 121 DKEIELANGS------LFKLASAAQADSAVGRSYDFIIFDEAAISDVGGDAFRVQLRPTL 174 D IE+ G +F + D G + I FDE A+ ++F Q Sbjct: 105 DNLIEITKGDVSNDFYIFGGKDESSQDLIQGLTLAGIFFDEVALMP---ESFVNQGTGRC 161 Query: 175 DKPNSKALFISTPRGG-NWFKEFYAYGFDDTLPNWVSIHGTYRDNPRADLNDIEEARRTV 233 SK F P G +WFK + + N + +H DN N I++ R+ Sbjct: 162 SVTGSKWWFNCNPDGPYHWFKVNWIDKAET--KNMLYLHFDMDDNLSLSEN-IKKRYRSQ 218 Query: 234 SKNYFRQEY-EADFSVFEGQIFDTFNAIDHV-KDLKGMRHFFKDDEAFETLLGIDVGYRD 291 + F Q Y + ++V EG ++D F+ HV L M K + +D G ++ Sbjct: 219 YQGVFYQRYIQGLWTVAEGIVYDMFSKDKHVVSTLPEMSKLGK-------YVSVDYGTQN 271 Query: 292 PTAVL-----TIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAAA 346 T L I +Y T YY + +KT A++A + + +DR+ +D +AA Sbjct: 272 ATVFLLWEKDIIGKYYLTREYYYSGRDENVQKTNAEYADDLTAWLGDTNIDRIIIDPSAA 331 Query: 347 QFRQDLAYEHEIASAPAKKSVLDGLACLQALFQQGKIIVDASCSSLIHALQNYKWDFQEG 406 F +L + A+ +VL+G+ + ++ Q KI V SC + + Y WD E Sbjct: 332 SFIAELK-KRGYKIKKARNNVLEGIRFVGSMLGQEKIAVHESCVNTLKEFHAYVWD--EK 388 Query: 407 EEKLSREKPRHDANSHLCDALRYGIYSI 434 +KP + H DALRY Y++ Sbjct: 389 ASANGEDKPIKQFD-HAMDALRYFCYTV 415 >gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: terminase large subunit # Family: family:all:147 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467938;genbank:gi:157265379;genbank:Ge neID:5600506 Length = 485 Score = 70.1 bits (170), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 101/442 (22%), Positives = 176/442 (39%), Gaps = 59/442 (13%) Query: 35 GITPNGPQIAIINALEDPRHRFVTACVSRRVGKSFIAYTLGFLKLL-EPNVKVLVVAPNY 93 G P+ Q+AI + R AC+ R+ GKS A +L P + ++AP Y Sbjct: 15 GYKPHHVQLAIHRSTAKRR----VACLGRQSGKSEAASVEAVFELFARPGSQGWIIAPTY 70 Query: 94 SLANIGWSQIRGLIKKYG-------------LQTERENAKDKEIELANG-----SLFKLA 135 A I + ++ +++ + D+ + S F+ Sbjct: 71 DQAEIIFGRVVEKVERLAEVFPATEVQLQRRRLRLLVHHYDRPVNAPGAKRVATSEFRGK 130 Query: 136 SAAQADSAVGRSYDFIIFDEAAISDVGGDAFRVQLRPTLDKPNSKALFISTPRGGNWFKE 195 SA + D+ G + DF+I DEAA+ + + + PTL + AL ISTP+G NWF E Sbjct: 131 SADRPDNLRGATLDFVILDEAAM--IPFSVWSEAIEPTLSVRDGWALIISTPKGLNWFYE 188 Query: 196 FYAYGF-------------DDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKNYFRQEY 242 F+ G+ + T P++ S H D E R + FRQEY Sbjct: 189 FFLMGWRGGLKEGIPNSGVNQTHPDFESFHAASWDVWPERREWYMERRLYIPDLEFRQEY 248 Query: 243 EADFSVFEGQIFDTFNAIDHVK-DLKGMRHFFKD---DEAFETLLGIDVGYRDPTAVLTI 298 A+F +F + + + + +G R +D D + +G D G +V ++ Sbjct: 249 GAEFVSHSNSVFSGLDMLILLPYERRGTRLVVEDYRPDHIY--CIGADFGKNQDYSVFSV 306 Query: 299 KYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDS-------AAAQFRQD 351 DT LE A T + A ++ + Y V D+ A Q Sbjct: 307 -LDLDTGAIVCLERMNGA--TWSDQVARLKALSEDYGHAYVVADTWGVGDAIAEELDAQG 363 Query: 352 LAYEH-EIASAPAKKSVLDGLACLQALFQQGKIIVDASCSSLIHALQNYKWDFQEGEEKL 410 + Y + S+ K+ ++ LA L ++G++ V + +++ L+N+++ ++ Sbjct: 364 INYTPLPVKSSSVKEQLISNLAL---LMEKGQVAV-PNDKTILDELRNFRYYRTASGNQV 419 Query: 411 SREKPRHDANSHLCDALRYGIY 432 R R + + AL Y Y Sbjct: 420 MRAYGRGHDDIVMSLALAYSQY 441 >gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: phage terminase large subunit # Family: family:all:147 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468054;genbank:gi:157265496;genbank:Ge neID:5600564 Length = 485 Score = 69.7 bits (169), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 101/442 (22%), Positives = 176/442 (39%), Gaps = 59/442 (13%) Query: 35 GITPNGPQIAIINALEDPRHRFVTACVSRRVGKSFIAYTLGFLKLL-EPNVKVLVVAPNY 93 G P+ Q+AI + R AC+ R+ GKS A +L P + ++AP Y Sbjct: 15 GYKPHHVQLAIHRSTAKRR----VACLGRQSGKSEAASVEAVFELFARPGSQGWIIAPTY 70 Query: 94 SLANIGWSQIRGLIKKYG-------------LQTERENAKDKEIELANG-----SLFKLA 135 A I + ++ +++ + D+ + S F+ Sbjct: 71 DQAEIIFGRVVEKVERLAEVFPATEVQLQRRRLRLLVHHYDRPVNAPGAKRVATSEFRGK 130 Query: 136 SAAQADSAVGRSYDFIIFDEAAISDVGGDAFRVQLRPTLDKPNSKALFISTPRGGNWFKE 195 SA + D+ G + DF+I DEAA+ + + + PTL + AL ISTP+G NWF E Sbjct: 131 SADRPDNLRGATLDFVILDEAAM--IPFSVWSEAIEPTLSVRDGWALIISTPKGLNWFYE 188 Query: 196 FYAYGF-------------DDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKNYFRQEY 242 F+ G+ + T P++ S H D E R + FRQEY Sbjct: 189 FFLMGWRGGLKEGIPNSGINQTHPDFESFHAASWDVWPERREWYMERRLYIPDLEFRQEY 248 Query: 243 EADFSVFEGQIFDTFNAIDHVK-DLKGMRHFFKD---DEAFETLLGIDVGYRDPTAVLTI 298 A+F +F + + + + +G R +D D + +G D G +V ++ Sbjct: 249 GAEFVSHSNSVFSGLDMLILLPYERRGTRLVVEDYRPDHIY--CIGADFGKNQDYSVFSV 306 Query: 299 KYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDS-------AAAQFRQD 351 DT LE A T + A ++ + Y V D+ A Q Sbjct: 307 -LDLDTGAIVCLERMNGA--TWSDQVARLKALSEDYGHAYVVADTWGVGDAIAEELDAQG 363 Query: 352 LAYEH-EIASAPAKKSVLDGLACLQALFQQGKIIVDASCSSLIHALQNYKWDFQEGEEKL 410 + Y + S+ K+ ++ LA L ++G++ V + +++ L+N+++ ++ Sbjct: 364 INYTPLPVKSSSVKEQLISNLAL---LMEKGQVAV-PNDKTILDELRNFRYYRTASGNQV 419 Query: 411 SREKPRHDANSHLCDALRYGIY 432 R R + + AL Y Y Sbjct: 420 MRAYGRGHDDIVMSLALAYSQY 441 >gi|7643|lcl|protein:vir:96370 Length: 447 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239642;genbank:gi:66395379;genbank:GeneID :5132846 Length = 447 Score = 68.2 bits (165), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 83/329 (25%), Positives = 141/329 (42%), Gaps = 40/329 (12%) Query: 118 NAKDKEIELANGSLF---KLASAAQADSAVGRSYDFIIFDEAA---ISDVGGDAFRVQLR 171 N D ++EL NG++F L + + S G S I+ +EA+ ++D R++ R Sbjct: 119 NKTDNKVELPNGAVFLFKGLDNPEKIKSIKGISD--IVMEEASEFTLNDYTQLTLRLRER 176 Query: 172 PTLDKPNSKALFISTPRGG-NW-FKEFYAYGFDDTLPNWVSIHGTYRDNPRADLNDIEEA 229 ++K + + P NW +K F+ +G + + N + +YRDN D + Sbjct: 177 KHVNK---QIFLMFNPVSKLNWVYKYFFEHG--EPMENVMIRQSSYRDNKFLDEMTRQNL 231 Query: 230 RRTVSKN--YFRQEYEADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLLGIDV 287 ++N Y++ +FS + +F + KD +RH + G+D Sbjct: 232 ELLANRNPAYYKIYALGEFSTLDKLVFPKYEKRLINKD--ELRHL-------PSYFGLDF 282 Query: 288 GY-RDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAA- 345 GY DP+A + K Y++EEY + + A I+ Y + + DSA Sbjct: 283 GYVNDPSAFIHSKIDVKKKKLYIIEEYVKQGMLNDEIANVIKQL--GYAKEEITADSAEQ 340 Query: 346 ---AQFRQDLAYEHEIASAPAKKSVLDGLACLQALFQQGKIIVDASCSSLIHALQNYKWD 402 A+ R +L + + + K SV+ GL L Q +IIVD C I NY W Sbjct: 341 KSIAELR-NLGLKRILPTKKGKGSVVQGLQFLM----QFEIIVDERCFKTIEEFDNYTWQ 395 Query: 403 FQEGEEKLSREKPRHDANSHLCDALRYGI 431 + + + E D +H D+LRY + Sbjct: 396 KDKDTGEYTNEPV--DTYNHCIDSLRYSV 422 >gi|11588|lcl|protein:vir:78831 Length: 447 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285355;genbank:gi:148717883;genbank:Ge neID:5246962 Length = 447 Score = 68.2 bits (165), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 83/329 (25%), Positives = 141/329 (42%), Gaps = 40/329 (12%) Query: 118 NAKDKEIELANGSLF---KLASAAQADSAVGRSYDFIIFDEAA---ISDVGGDAFRVQLR 171 N D ++EL NG++F L + + S G S I+ +EA+ ++D R++ R Sbjct: 119 NKTDNKVELPNGAVFLFKGLDNPEKIKSIKGISD--IVMEEASEFTLNDYTQLTLRLRER 176 Query: 172 PTLDKPNSKALFISTPRGG-NW-FKEFYAYGFDDTLPNWVSIHGTYRDNPRADLNDIEEA 229 ++K + + P NW +K F+ +G + + N + +YRDN D + Sbjct: 177 KHVNK---QIFLMFNPVSKLNWVYKYFFEHG--EPMENVMIRQSSYRDNKFLDEMTRQNL 231 Query: 230 RRTVSKN--YFRQEYEADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLLGIDV 287 ++N Y++ +FS + +F + KD +RH + G+D Sbjct: 232 ELLANRNPAYYKIYALGEFSTLDKLVFPKYEKRLINKD--ELRHL-------PSYFGLDF 282 Query: 288 GY-RDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAA- 345 GY DP+A + K Y++EEY + + A I+ Y + + DSA Sbjct: 283 GYVNDPSAFIHSKIDVKKKKLYIIEEYVKQGMLNDEIANVIKQL--GYAKEEITADSAEQ 340 Query: 346 ---AQFRQDLAYEHEIASAPAKKSVLDGLACLQALFQQGKIIVDASCSSLIHALQNYKWD 402 A+ R +L + + + K SV+ GL L Q +IIVD C I NY W Sbjct: 341 KSIAELR-NLGLKRILPTKKGKGSVVQGLQFLM----QFEIIVDERCFKTIEEFDNYTWQ 395 Query: 403 FQEGEEKLSREKPRHDANSHLCDALRYGI 431 + + + E D +H D+LRY + Sbjct: 396 KDKDTGEYTNEPV--DTYNHCIDSLRYSV 422 >gi|7301|lcl|protein:vir:96238 Length: 447 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239566;genbank:gi:66395301;genbank:GeneID :5132787 Length = 447 Score = 67.0 bits (162), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 82/328 (25%), Positives = 140/328 (42%), Gaps = 38/328 (11%) Query: 118 NAKDKEIELANGSLF---KLASAAQADSAVGRSYDFIIFDEAA---ISDVGGDAFRVQLR 171 N D ++EL NG++F L + + S G S I+ +EA+ ++D R++ R Sbjct: 119 NKTDNKVELPNGAVFLFKGLDNPEKIKSIKGISD--IVMEEASEFTLNDYTQLTLRLRER 176 Query: 172 PTLDKPNSKALFISTPRGGNW-FKEFYAYGFDDTLPNWVSIHGTYRDNPRADLNDIEEAR 230 ++K L + NW +K F+ +G + + N + +YRDN D + Sbjct: 177 KHVNK--QIFLMFNPVSKLNWVYKYFFEHG--EPMENVMIRQSSYRDNKFLDEMTRQNLE 232 Query: 231 RTVSKN--YFRQEYEADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLLGIDVG 288 ++N Y++ +F+ + +F + KD +RH + G+D G Sbjct: 233 LLANRNPAYYKIYALGEFATLDKLVFPKYEKRLINKD--ELRHL-------PSYFGLDFG 283 Query: 289 Y-RDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAA-- 345 Y DP+A + K Y++EEY + + A I+ Y + + DSA Sbjct: 284 YVNDPSAFIHSKIDVKKKKLYIIEEYVKQGMLNDEIANVIKQL--GYAREEITADSAEQK 341 Query: 346 --AQFRQDLAYEHEIASAPAKKSVLDGLACLQALFQQGKIIVDASCSSLIHALQNYKWDF 403 A+ R +L + + + K SV+ GL L Q +IIVD C I NY W Sbjct: 342 SIAELR-NLGLKRILPTKKGKGSVVQGLQFLM----QFEIIVDERCFKTIEEFDNYTWQK 396 Query: 404 QEGEEKLSREKPRHDANSHLCDALRYGI 431 + + + E D +H D+LRY + Sbjct: 397 DKDTGEYTNEPV--DTYNHCIDSLRYSV 422 >gi|17691|lcl|protein:vir:9305 Length: 447 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803283;genbank:gi:29028593;genbank:GeneID :1258039 Length = 447 Score = 67.0 bits (162), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 82/328 (25%), Positives = 140/328 (42%), Gaps = 38/328 (11%) Query: 118 NAKDKEIELANGSLF---KLASAAQADSAVGRSYDFIIFDEAA---ISDVGGDAFRVQLR 171 N D ++EL NG++F L + + S G S I+ +EA+ ++D R++ R Sbjct: 119 NKTDNKVELPNGAVFLFKGLDNPEKIKSIKGISD--IVMEEASEFTLNDYTQLTLRLRER 176 Query: 172 PTLDKPNSKALFISTPRGGNW-FKEFYAYGFDDTLPNWVSIHGTYRDNPRADLNDIEEAR 230 ++K L + NW +K F+ +G + + N + +YRDN D + Sbjct: 177 KHVNK--QIFLMFNPVSKLNWVYKYFFEHG--EPMENVMIRQSSYRDNKFLDEMTRQNLE 232 Query: 231 RTVSKN--YFRQEYEADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLLGIDVG 288 ++N Y++ +F+ + +F + KD +RH + G+D G Sbjct: 233 LLANRNPAYYKIYALGEFATLDKLVFPKYEKRLINKD--ELRHL-------PSYFGLDFG 283 Query: 289 Y-RDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAA-- 345 Y DP+A + K Y++EEY + + A I+ Y + + DSA Sbjct: 284 YVNDPSAFIHSKIDVKKKKLYIIEEYVKQGMLNDEIANVIKQL--GYAKEEITADSAEQK 341 Query: 346 --AQFRQDLAYEHEIASAPAKKSVLDGLACLQALFQQGKIIVDASCSSLIHALQNYKWDF 403 A+ R +L + + + K SV+ GL L Q +IIVD C I NY W Sbjct: 342 SIAELR-NLGLKRILPTKKGKGSVVQGLQFLM----QFEIIVDERCFKTIEEFDNYTWQK 396 Query: 404 QEGEEKLSREKPRHDANSHLCDALRYGI 431 + + + E D +H D+LRY + Sbjct: 397 DKDTGEYTNEPV--DTYNHCIDSLRYSV 422 >gi|9829|lcl|protein:vir:103950 Length: 425 # NCBI annotation: phage terminase large subunit # Family: family:all:54 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873987;genbank:gi:118430762;genbank:GeneI D:4525444 Length = 425 Score = 66.6 bits (161), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 82/328 (25%), Positives = 140/328 (42%), Gaps = 38/328 (11%) Query: 118 NAKDKEIELANGSLF---KLASAAQADSAVGRSYDFIIFDEAA---ISDVGGDAFRVQLR 171 N D ++EL NG++F L + + S G S I+ +EA+ ++D R++ R Sbjct: 97 NKTDNKVELPNGAVFLFKGLDNPEKIKSIKGISD--IVMEEASEFTLNDYTQLTLRLRER 154 Query: 172 PTLDKPNSKALFISTPRGGNW-FKEFYAYGFDDTLPNWVSIHGTYRDNPRADLNDIEEAR 230 ++K L + NW +K F+ +G + + N + +YRDN D + Sbjct: 155 KHVNK--QIFLMFNPVSKLNWVYKYFFEHG--EPMENVMIRQSSYRDNKFLDEMTRQNLE 210 Query: 231 RTVSKN--YFRQEYEADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLLGIDVG 288 ++N Y++ +F+ + +F + KD +RH + G+D G Sbjct: 211 LLANRNPAYYKIYALGEFATLDKLVFPKYEKRLINKD--ELRHL-------PSYFGLDFG 261 Query: 289 Y-RDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAA-- 345 Y DP+A + K Y++EEY + + A I+ Y + + DSA Sbjct: 262 YVNDPSAFIHSKIDVKKKKLYIIEEYVKQGMLNDEIANVIKQL--GYAKEEITADSAEQK 319 Query: 346 --AQFRQDLAYEHEIASAPAKKSVLDGLACLQALFQQGKIIVDASCSSLIHALQNYKWDF 403 A+ R +L + + + K SV+ GL L Q +IIVD C I NY W Sbjct: 320 SIAELR-NLGLKRILPTKKGKGSVVQGLQFLM----QFEIIVDERCFKTIEEFDNYTWQK 374 Query: 404 QEGEEKLSREKPRHDANSHLCDALRYGI 431 + + + E D +H D+LRY + Sbjct: 375 DKDTGEYTNEPV--DTYNHCIDSLRYSV 400 >gi|2779|lcl|protein:vir:99776 Length: 425 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004302;genbank:gi:122891756;genbank:Ge neID:4712331 Length = 425 Score = 65.1 bits (157), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 82/329 (24%), Positives = 140/329 (42%), Gaps = 40/329 (12%) Query: 118 NAKDKEIELANGSLF---KLASAAQADSAVGRSYDFIIFDEAA---ISDVGGDAFRVQLR 171 N D ++ L NG++F L + + S G S I+ +EA+ ++D R++ R Sbjct: 97 NKTDNKVGLPNGAVFLFKGLDNPEKIKSIKGISD--IVMEEASEFTLNDYTQLTLRLRER 154 Query: 172 PTLDKPNSKALFISTPRGG-NW-FKEFYAYGFDDTLPNWVSIHGTYRDNPRADLNDIEEA 229 ++K + I P NW +K F+ +G + + N + +YRDN D + Sbjct: 155 KHVNK---QIFLIFNPVSKLNWVYKYFFEHG--EPMENVMIRQSSYRDNKFLDEMTRQNL 209 Query: 230 RRTVSKN--YFRQEYEADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLLGIDV 287 ++N Y++ +F+ + +F + KD +RH + G+D Sbjct: 210 ELLANRNPAYYKIYALGEFATLDKLVFPKYEKRLINKD--ELRHL-------PSYFGLDF 260 Query: 288 GY-RDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAA- 345 GY DP+A + K Y++EEY + + A I+ Y + + DSA Sbjct: 261 GYVNDPSAFIHSKIDVKKKKLYIIEEYVKQGMLNDEIANVIKQL--GYAKEEITADSAEQ 318 Query: 346 ---AQFRQDLAYEHEIASAPAKKSVLDGLACLQALFQQGKIIVDASCSSLIHALQNYKWD 402 A+ R +L + + + K SV+ GL L Q +IIVD C I NY W Sbjct: 319 KSIAELR-NLGLKRILPTKKGKGSVVQGLQFLM----QFEIIVDERCFKTIEEFDNYTWQ 373 Query: 403 FQEGEEKLSREKPRHDANSHLCDALRYGI 431 + + + E D +H D+LRY + Sbjct: 374 KDKDTGEYTNEPV--DTYNHCIDSLRYSV 400 >gi|5208|lcl|protein:vir:106641 Length: 446 # NCBI annotation: ORF005 # Family: family:all:54 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239489;genbank:gi:66395220;genbank:GeneID :4555795 Length = 446 Score = 63.2 bits (152), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 76/327 (23%), Positives = 133/327 (40%), Gaps = 36/327 (11%) Query: 118 NAKDKEIELANGSLF---KLASAAQADSAVGRSYDFIIFDEAA---ISDVGGDAFRVQLR 171 N D ++EL NG++F L + + S G S I+ +EA+ ++D R++ R Sbjct: 119 NKTDNKVELPNGAVFLFKGLDNPEKIKSIKGISD--IVMEEASEFTLNDYTQLTLRLRER 176 Query: 172 PTLDKPNSKALFISTPRGG-NW-FKEFYAYGFDDTLPNWVSIHGTYRDNPRADLNDIEEA 229 ++K + + P NW +K F+ +G + + N + +YRDN D + Sbjct: 177 KHMNK---QIFLMFNPVSKLNWVYKYFFEHG--EPMENVMIRQSSYRDNKFLDEMTRQNL 231 Query: 230 RRTVSKN--YFRQEYEADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEA--FETLLGI 285 ++N Y++ +F+ + +F + + D E + G+ Sbjct: 232 ELLANRNPAYYKIYALGEFATLDKLVFPKYE-----------KRIISDKEVGHLPSYFGL 280 Query: 286 DVGY-RDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSA 344 D GY DP+A + +K D YV+ EY + + A I Y +++ DSA Sbjct: 281 DFGYVNDPSAFIHVKIDNDNKKLYVISEYVKKGMLNNEIAQVINDL--GYSKEKITADSA 338 Query: 345 AAQFRQDLAYEHEIASAPAKKSVLDGLACLQALFQQGKIIVDASCSSLIHALQNYKWDFQ 404 + ++ PA K +A +Q Q I++D C I NY W Sbjct: 339 EQKSIMEIKTNGIDRIVPAMKGKDSVMAGIQ-FVSQFDIVIDERCYKTIEEFDNYTWKKD 397 Query: 405 EGEEKLSREKPRHDANSHLCDALRYGI 431 + + E D +H DALRY + Sbjct: 398 KNTGEYYNEPV--DTYNHCIDALRYAV 422 >gi|2931|lcl|protein:vir:105460 Length: 409 # NCBI annotation: putative phage terminase large subunit B # Family: family:all:54 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529870;genbank:gi:90592610;genbank:GeneID :3974524 Length = 409 Score = 62.0 bits (149), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 46/158 (29%), Positives = 73/158 (46%), Gaps = 6/158 (3%) Query: 277 EAFETLLGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKV 336 + + +G+D GY P ++ + D +T YVLE+Y Q K Q+ R+ Sbjct: 246 DGLDYYVGVDWGYEHPNPIILLGDDKDGNT-YVLEDYTQKHKFINYWVKVAQNLQTRFGR 304 Query: 337 DRVFVDSAAAQFRQDLAYEHEIASAPAKKSVLDGLACLQALFQQGKI-IVDASCSSLIHA 395 + +F +A + + + A K+VL G+ C+ ++GK +VD + S L+ Sbjct: 305 NLIFYADSARPDNVNEFQSNGLNCINANKNVLPGIECVARKMREGKFYVVDTASSGLLDE 364 Query: 396 LQNYKWDFQEGEEKLSREKPRHDANSHLCDALRYGIYS 433 + Y WD G L RH N L DA+RY IYS Sbjct: 365 IYQYAWDESTGLP-LKENDVRH--NDRL-DAIRYAIYS 398 >gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: terminase large subunit gp17 # Family: family:all:147 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195134;genbank:gi:58532911;uniprot:Q5GQN8 ;genbank:GeneID:3260486 Length = 548 Score = 60.5 bits (145), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 75/262 (28%), Positives = 115/262 (43%), Gaps = 46/262 (17%) Query: 53 RHRFVTACVSRRVGKS--FIAYTLGFLKLLEPNVKVLVVAPNYSLANIGWSQIRGLIKKY 110 ++RF A + R+ GKS ++Y L +L + NV + ++A S A R L+ + Sbjct: 72 KNRFNIAKLPRQTGKSTTVVSYLLHYL-IFNDNVNIGILANKASTA-------RDLLAR- 122 Query: 111 GLQTERENA-----------KDKEIELANGSLFKLASAAQADSAVGRSYDFIIFDEAA-I 158 L T EN IEL NGS LA++ A + G S++ I DE A + Sbjct: 123 -LATAYENLPKWIQQGVVVWNKGNIELENGSKI-LAASTSASAVRGMSFNIIFLDEFAFV 180 Query: 159 SDVGGDAFRVQLRPTLDKPNS-KALFISTPRGGNWFKEFYAYGFDDTLP-NWVSIHGTYR 216 + D+F + PT+ S K + ISTP+G N FY D T N + H + Sbjct: 181 PNHIADSFFASVYPTITSGKSTKVIIISTPQGMN---HFYKMWVDATNGRNGYTFHEVHW 237 Query: 217 DN-PRADLNDIEEARRTVSKNYFRQEYEADF----------SVFEGQIFDTFNAIDHVKD 265 P D EE + S+ F QE+E +F S + +F+ D +K Sbjct: 238 SQVPGRDEKWKEETIKNTSERQFTQEFECEFLGSVDTLIAASKLKALVFN-----DPIKR 292 Query: 266 LKGMRHFFKDDEAFETLLGIDV 287 KG+ + + E E L+ +DV Sbjct: 293 NKGLDIYEEPKEKSEYLMTVDV 314 >gi|5321|lcl|protein:vir:99534 Length: 422 # NCBI annotation: putative protein # Family: family:all:54 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958532;genbank:gi:41179314;genbank:GeneID :2717172 Length = 422 Score = 58.5 bits (140), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 81/331 (24%), Positives = 139/331 (41%), Gaps = 28/331 (8%) Query: 112 LQTERENAKDKEIELANGSLF---KLASAAQADSAVGRSYDFIIFDEAAISDVGGDAFRV 168 LQ N DK I L NG++F + + S G S D ++ + + + + Sbjct: 91 LQYCHVNRSDKTIVLPNGAIFLFQGMDDPEKIKSIKGLS-DVVMEEASEFNHNDYTQLTL 149 Query: 169 QLRPTLDKPNSKALFISTPRGGNWFKEFYAYGFDDTLPNWVSIH-GTYRDNPRADLNDIE 227 +LR K + NW + + D + V+IH TY+DN D ++I Sbjct: 150 RLREPKHKQRQIFCMFNPVSKLNWTYQTWFDPSADYDRSRVAIHQSTYKDNRFLDEDNIR 209 Query: 228 --EARRTVSKNYFRQEYEADFSVFEGQIFDTFNAID-HVKDLKGMRHFFKDDEAFETLLG 284 E + + Y++ +F+ + +F F + +D K + A G Sbjct: 210 TIEELKNTNPAYYKIYTLGEFATLDKLVFPYFETKRLNPRDPKLL--------ALNDYFG 261 Query: 285 IDVGY-RDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDS 343 +D G+ DP+A + IK T YV++E+ + Q A I+ Y + + DS Sbjct: 262 LDYGFINDPSAFMHIKLDMRNKTLYVMDEFVKKGLLNNQLAQVIKDM--GYSKEVITADS 319 Query: 344 AAAQFRQDLAYEHEIASAPAKK---SVLDGLACLQALFQQGKIIVDASCSSLIHALQNYK 400 A + ++ + PA K S++ G+ LQ Q K +VD C I LQNY Sbjct: 320 AEKKSIAEMKRDGIYRIRPALKGPDSIIQGIQFLQ----QFKWVVDDRCVKTIEELQNYT 375 Query: 401 WDFQEGEEKLSREKPRHDANSHLCDALRYGI 431 + + ++ + +P DA +H DA+RY + Sbjct: 376 YVKDKKTDEYT-NRPI-DAYNHCIDAIRYAV 404 >gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214662;genbank:gi:61806303;genbank:GeneID :3294591 Length = 550 Score = 57.0 bits (136), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 66/236 (27%), Positives = 103/236 (43%), Gaps = 18/236 (7%) Query: 22 FFRLPVSGILAQEGITPNGPQIAIINALEDP-----RHRFVTACVSRRVGKSFI--AYTL 74 F R + + EG+ P + N ED +HRF A + R+ GKS I AY L Sbjct: 41 FIRKYIRIVSLDEGVIP----FDMYNFQEDMVTKFHQHRFNIAKLPRQSGKSTIVTAYLL 96 Query: 75 GFLKLLEPNVKVLVVAPNYSLAN--IGWSQIRGLIKKYGLQTERENAKDKEIELANGSLF 132 ++ L NV V ++A A +G Q+ +Q +EL NGS Sbjct: 97 WYV-LFNANVNVAILANKAPTAREMLGRLQLSYENLPKWMQQGILGWNKGSLELENGSKI 155 Query: 133 KLASAAQADSAVGRSYDFIIFDE-AAISDVGGDAFRVQLRPTLDKPNS-KALFISTPRGG 190 LAS+ A + G S++ I DE A + + + F + PT+ S K + ISTP G Sbjct: 156 -LASSTSASAVRGMSFNIIFLDEFAFVPNHIAEQFFASVYPTISSGKSTKVIIISTPHGM 214 Query: 191 NWFKEFYAYGFDDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKNYFRQEYEADF 246 N F + + + + N+V+ + P D ++ S+ FR E+E +F Sbjct: 215 NQFYKLW-HDAERGANNYVATEVHWSQVPGRDDKWKQQTIENTSEAQFRVEFECEF 269 >gi|15353|lcl|protein:vir:9921 Length: 425 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795683;genbank:gi:28876465;genbank:GeneID :1257982 Length = 425 Score = 56.2 bits (134), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 66/248 (26%), Positives = 100/248 (40%), Gaps = 28/248 (11%) Query: 191 NWFKEFYAYGFDDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKN--YFRQEYEADFSV 248 NW Y F T N V TY+DN D E ++N Y++ F+ Sbjct: 178 NWV---YKAFFVKTPKNTVVYQTTYKDNRFLDDVTRENIEELANRNEAYYKIYALGQFAT 234 Query: 249 FEGQIFDTFNAIDHVKD-LKGMRHFFKDDEAFETLLGIDVGY-RDPTAVLTIKYHYDTDT 306 + IF ++ KD L + FF G+D G+ DP+A+L +K Sbjct: 235 LDKLIFPKYDKQILNKDKLSHLPSFF----------GLDYGFINDPSALLHVKIDDANKK 284 Query: 307 YYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAAAQFRQ---DLAYEHEIASAPA 363 Y+LEEY + T + A I+ Y + + DSA + Q +L I Sbjct: 285 LYILEEYVRKNLTNDKIANAIKDL--GYAKEEIRGDSAEKKSNQELRNLGIPRMIDVTKG 342 Query: 364 KKSVLDGLACLQALFQQGKIIVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSHL 423 +V+ G+ L Q IVD C I L+NY W + + + E D+ +H Sbjct: 343 PGTVMQGIQYL----LQYDWIVDERCVKTIEELENYTWKKDKKTNEYTNEPV--DSYNHC 396 Query: 424 CDALRYGI 431 DA+RY + Sbjct: 397 IDAIRYAV 404 >gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: gp123 # Family: family:all:147 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717790;genbank:gi:113200627;genbank:GeneI D:4239178 Length = 549 Score = 55.1 bits (131), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 55/204 (26%), Positives = 94/204 (46%), Gaps = 19/204 (9%) Query: 54 HRFVTACVSRRVGKSFI--AYTLGFLKLLEPNVKVLVVAPNYSLA-------NIGWSQIR 104 +RF A + R+ GKS I +Y L ++ L NV V ++A + A + + + Sbjct: 73 NRFNIAKLPRQSGKSTIVTSYLLWYV-LFNANVNVAILANKAATAREMLQRLQLSYENLP 131 Query: 105 GLIKKYGLQTERENAKDKEIELANGSLFKLASAAQADSAVGRSYDFIIFDEAA-ISDVGG 163 +++ LQ R +EL NGS LA++ A + G S++ I DE A + + Sbjct: 132 KWLQQGILQWNR-----GSLELENGSKI-LAASTSASAVRGMSFNVIFLDEFAFVPNHVA 185 Query: 164 DAFRVQLRPTLDKPNS-KALFISTPRGGNWFKEFYAYGFDDTLPNWVSIHGTYRDNPRAD 222 D F + PT+ S K + ISTP G N F + + + + ++ + + P D Sbjct: 186 DQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLW-HDAERKANEYIPTEVHWSEVPGRD 244 Query: 223 LNDIEEARRTVSKNYFRQEYEADF 246 E+ + S+ FR E+E +F Sbjct: 245 AAWKEQTIKNTSEQQFRVEFECEF 268 >gi|3326|lcl|protein:vir:94525 Length: 440 # NCBI annotation: putative large subunit terminase # Family: family:all:54 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223885;genbank:gi:62327097;genbank:GeneID :5075541 Length = 440 Score = 53.9 bits (128), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 57/232 (24%), Positives = 101/232 (43%), Gaps = 30/232 (12%) Query: 210 SIHGTYRDNPRADLNDIEEARRTVSKNYFRQEYE--ADFSVFEGQIFDTFNAIDHVKDLK 267 +I TY+DN + + ++ + + +N R ++ + EG +FD L Sbjct: 203 AITTTYKDNDHLNADYVDSLKEMLVRNPNRARVAVLGEWGIAEGLVFD---------GLF 253 Query: 268 GMRHFFKDDEA-FETLLGIDVGYR-DPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAA 325 R F D+ A +G+D G++ DPTA I D Y+ +E+ + T Q Sbjct: 254 EQRDFSYDEIANLPKSVGLDFGFKHDPTAGEFIAVDQDNRIVYIYDEFYKQHLLTNQ--- 310 Query: 326 YIQHCIDRYKV--DRVFVDSAAAQFRQDLAYEHEIA----SAPAKKSVLDGLACLQALFQ 379 I + ++K + DSA + +L+ +H + S K SV+ G+ +Q+ Sbjct: 311 -IAQELAKHKAFGLPITADSAEQRMIVELSQQHRVPNIKPSGKGKDSVIQGIQYMQSY-- 367 Query: 380 QGKIIVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSHLCDALRYGI 431 + +V L+ Y +D + L++ K DAN+H DALRY + Sbjct: 368 --RFVVHPRVKGLMEEFNTYVYDMDKEGNWLNKPK---DANNHAIDALRYAL 414 >gi|6719|lcl|protein:vir:99411 Length: 447 # NCBI annotation: hypothetical protein # Family: family:all:32729 # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919076;genbank:gi:119757034;genbank:GeneI D:4606064 Length = 447 Score = 52.0 bits (123), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 86/350 (24%), Positives = 142/350 (40%), Gaps = 61/350 (17%) Query: 118 NAKDKEIELANGSLFKLASAAQADSAVGRSYDFIIFDEAAISDVGGDAFR-----VQLRP 172 ++ DK + G + L A + + G Y I DE D + + + Sbjct: 108 HSNDKRLVYVTGHVAWLGGADKWNRFAGGEYCRIWCDEVGHYPPNTDLYDLHEMLITRQR 167 Query: 173 TLDKPNSKALFISTPRGGNWFKEFY------AYGFDDTLPNWVS----IHGTYRDNPRAD 222 T PN+ L+ ST GN F +FY D+ LP W + + N Sbjct: 168 TEIGPNT-TLWTST---GNGFNQFYDITERQVNADDEPLP-WADQMEVVVASTEHNTLLP 222 Query: 223 LNDIEEARRTVSKNYFRQE--YEADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFE 280 + +++ RR K R+E F+ EG ++D F HV+D +R DD A Sbjct: 223 PDGLDKIRRQF-KGTAREEQGLHGGFAAAEGLVYDAFTRQTHVRDADDVRDRLADDWA-- 279 Query: 281 TLLGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVF 340 + G D G+ DP +L I+ + + V +++ ++E H A + D D Sbjct: 280 -MYGYDAGWNDPRVLLDIRKTH-AGQFVVWDQFYKSES----HLAELVDPDDALPAD--- 330 Query: 341 VDS-AAAQFRQDLAYEHEIA-----------SAPAKKSVLDGLACLQ---ALFQQGK--I 383 VD A + R + EHE A + A+KS+ G+ ++ A+ +G+ + Sbjct: 331 VDPWLAGRPRGRVYAEHEPAHIEQFRKANWPAVKAEKSLDGGIDHVRSRLAMDDEGRPGV 390 Query: 384 IVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSHLCDALRYGIYS 433 +V C LI +YK E+ + K A H DALRY +++ Sbjct: 391 LVTDRCGELIQEFLSYK------EDHVGTSK----AQDHALDALRYALFT 430 >gi|1596|lcl|protein:vir:93748 Length: 403 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240453;genbank:gi:66396122;genbank:GeneID :5133517 Length = 403 Score = 50.8 bits (120), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 47/180 (26%), Positives = 82/180 (45%), Gaps = 18/180 (10%) Query: 264 KDLKGMRHFFKDDEAFET------LLGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAE 317 KD K H+ K++E F+T G+D GY +++ + +D + YV+EE+ Sbjct: 228 KDFKEKVHYIKEEE-FKTKQIKRKYAGVDWGYEHYGSIMVVAEDFDGNK-YVIEEHAHRH 285 Query: 318 KTTAQHAAYIQHCIDRYKVDRVFVDSAAAQFRQDLAYEHEIASAPAKKSVLDGLACLQAL 377 K A + I R+ + D+A + + E +I + A K+V+ G+ + L Sbjct: 286 KEIDDWVAIAKGVIKRHGDILFYCDTARPEHIERFRRE-KIKARYADKAVIAGIEVISRL 344 Query: 378 FQQGKI-IVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSHLCDALRYGIYSISR 436 F+ KI I+ S + NY W K + ++P N DALRY +Y+ ++ Sbjct: 345 FKLNKIFIIKEKVSLFKEEIYNYVW-------KDNADEPVK-LNDDTLDALRYAVYTANK 396 >gi|14951|lcl|protein:vir:1235 Length: 405 # NCBI annotation: similar to phage O1205 ORF26 ( putative large subunit terminase) # Family: family:all:54 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510934;genbank:gi:17426268;genbank:GeneID :927381 Length = 405 Score = 50.8 bits (120), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 47/180 (26%), Positives = 82/180 (45%), Gaps = 18/180 (10%) Query: 264 KDLKGMRHFFKDDEAFET------LLGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAE 317 KD K H+ K++E F+T G+D GY +++ + +D + YV+EE+ Sbjct: 230 KDFKEKVHYIKEEE-FKTKQIKRKYAGVDWGYEHYGSIMVVAEDFDGNK-YVIEEHAHRH 287 Query: 318 KTTAQHAAYIQHCIDRYKVDRVFVDSAAAQFRQDLAYEHEIASAPAKKSVLDGLACLQAL 377 K A + I R+ + D+A + + E +I + A K+V+ G+ + L Sbjct: 288 KEIDDWVAIAKGVIKRHGDILFYCDTARPEHIERFRRE-KIKARYADKAVIAGIEVISRL 346 Query: 378 FQQGKI-IVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSHLCDALRYGIYSISR 436 F+ KI I+ S + NY W K + ++P N DALRY +Y+ ++ Sbjct: 347 FKLNKIFIIKEKVSLFKEEIYNYVW-------KDNADEPVK-LNDDTLDALRYAVYTANK 398 >gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T4-like DNA packaging large subunit terminase # Family: family:all:147 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214360;genbank:gi:61806000;genbank:GeneID :3294466 Length = 547 Score = 50.8 bits (120), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 58/200 (29%), Positives = 93/200 (46%), Gaps = 11/200 (5%) Query: 54 HRFVTACVSRRVGKS--FIAYTLGFLKLLEPNVKVLVVAPNYSLAN--IGWSQIRGLIKK 109 +RF + R+ GKS I+Y L + + NV V V+A S A +G Q+ Sbjct: 71 NRFNICKMPRQTGKSTTCISYLLHY-AVFNDNVNVAVLANKASTARDLLGRLQLAYENLP 129 Query: 110 YGLQTERENAKDKEIELANGSLFKLASAAQADSAV-GRSYDFIIFDEAA-ISDVGGDAFR 167 +Q + +EL NGS K+++ + + SAV G SY+ I DE A I + D F Sbjct: 130 RWMQQGIISWNKGSLELENGS--KISANSTSSSAVRGGSYNVIFLDEFAFIPNHIADDFF 187 Query: 168 VQLRPTLDKPNS-KALFISTPRGGNWFKEFYAYGFDDTLPNWVSIHGTYRDNPRADLNDI 226 + PT+ S K + +STPRG N F + + + +V+ + + P D Sbjct: 188 ASVYPTITSGQSTKVIIVSTPRGMNHFYRMW-HDSEKGKSEYVATDVHWSEVPGRDEEWK 246 Query: 227 EEARRTVSKNYFRQEYEADF 246 E+ S+ F+ E+E +F Sbjct: 247 EQTIANTSEQQFKIEFECEF 266 >gi|9289|lcl|protein:vir:97165 Length: 431 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239721;genbank:gi:66394878;genbank:GeneID :5130898 Length = 431 Score = 50.4 bits (119), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 67/248 (27%), Positives = 102/248 (41%), Gaps = 34/248 (13%) Query: 205 LPNWVSIHGTYRDNPRADLNDIEEARRTVSKNYFRQEY--EADFSVFEGQIFDTFNAIDH 262 L N S TYR N D DIE KN R + D+ V EG +FD F Sbjct: 187 LNNTFSDTTTYRVNEWLDKVDIERYEDLYIKNPRRARIVCDGDWGVAEGLVFDNFK---- 242 Query: 263 VKDLKGMRHFFKDDEAFETLLGIDVGY-RDPTAVLTIKYHYDTDTYYVLEE-YQQAEKTT 320 V+D F + E G+D G+ +DPT V++ ++ +E Y++A T Sbjct: 243 VEDFDWFEEFKRTQEITH---GMDFGFSQDPTTVVSTVVDLKNKKLFIYDEHYKKAMLTD 299 Query: 321 AQHAAYIQHCIDRYKV--------DRVFVDSAAAQFRQDLAYEHEIASAPAKKSVLDGLA 372 I+ + + DRV + + + + A ++L G+ Sbjct: 300 DIKQMLIKKGLGDVDIAADYGAGGDRVISELKSKGIK---GIRKALKGA---NTILPGIQ 353 Query: 373 CLQALFQQGKIIVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSHLCDALRYGI- 431 +Q ++I+ SC I Y +D Q+ + K KP DAN+H+ DALRY + Sbjct: 354 FIQGF----EVIIHPSCEHAIEEFNTYTFD-QDNDGKW-LNKPI-DANNHIIDALRYSLE 406 Query: 432 -YSISRGK 438 Y I R K Sbjct: 407 KYHIVRKK 414 >gi|8220|lcl|protein:vir:101493 Length: 588 # NCBI annotation: gp8 # Family: family:all:147 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655387;genbank:gi:109522575;genbank:GeneI D:4157565 Length = 588 Score = 49.7 bits (117), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 77/332 (23%), Positives = 116/332 (34%), Gaps = 85/332 (25%) Query: 48 ALEDPRHRFVTACVSRRVGKS-----------FIAY-TLGFLKLLEPNVKVLVVAPNYSL 95 A+E+ R A RR GKS F+AY L+ + +V P YS Sbjct: 44 AIEESPARNKVATCGRRFGKSDLGGKRLVPPAFLAYYRQDLLRTTNKAMIYWIVGPEYSD 103 Query: 96 ANIGWSQIRGLIKKYGLQTERENA------KDKEIELANGSLFKLASAAQ-ADSAVGRSY 148 A + + + G+ ++ + + + L G+ A +A+ D+ VG Sbjct: 104 AEKEFRVLWNTLVSLGVPFDKPGSYYDAVGGNMHLSLWQGAYQVHAKSAKYPDTLVGEGL 163 Query: 149 DFIIFDEAAISDVGGDAFRVQLRPTLDKPNSKALFISTPRGGNWFKEFYAYGFDDTLPNW 208 +I EAA + +RPTL +L STP G N F + + G D P W Sbjct: 164 SGVIMAEAAKQKPS--VWTKHVRPTLGDFTGWSLHTSTPEGKNHFHDKFQMGQDPNNPEW 221 Query: 209 VS-------------------------------------------IHGTYRDNP------ 219 S + R+NP Sbjct: 222 ESWRMPSWANPYVYTRTGRLIALGKLPPETPIPDHEYTLDRHVTIMQQLMRENPTVSPFK 281 Query: 220 -------RADLNDIEEARRTVSKNYFRQEYEADFSVFEGQIFDTFNAIDHVKDL-----K 267 R D IE A +S F QE ADF+ + G++F ++ HV DL K Sbjct: 282 IVADHKLRVDQEVIELA-ADMSIESFNQEIGADFTEYVGKVFKDWDEEYHVADLVDYNVK 340 Query: 268 GMRHFFKDDEAFETLLGIDVGYRDPTAVLTIK 299 +F + +ET D GY +P L I+ Sbjct: 341 QFGTYFNPN--YETYAAADYGYTNPNVWLVIQ 370 >gi|9070|lcl|protein:vir:102238 Length: 588 # NCBI annotation: gp8 # Family: family:all:147 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655204;genbank:gi:109522784;genbank:GeneI D:4157477 Length = 588 Score = 49.7 bits (117), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 77/332 (23%), Positives = 116/332 (34%), Gaps = 85/332 (25%) Query: 48 ALEDPRHRFVTACVSRRVGKS-----------FIAY-TLGFLKLLEPNVKVLVVAPNYSL 95 A+E+ R A RR GKS F+AY L+ + +V P YS Sbjct: 44 AIEESPARNKVATCGRRFGKSDLGGKRLVPPAFLAYYRQDLLRTTNKAMIYWIVGPEYSD 103 Query: 96 ANIGWSQIRGLIKKYGLQTERENA------KDKEIELANGSLFKLASAAQ-ADSAVGRSY 148 A + + + G+ ++ + + + L G+ A +A+ D+ VG Sbjct: 104 AEKEFRVLWNTLVSLGVPFDKPGSYYDAVGGNMHLSLWQGAYQVHAKSAKYPDTLVGEGL 163 Query: 149 DFIIFDEAAISDVGGDAFRVQLRPTLDKPNSKALFISTPRGGNWFKEFYAYGFDDTLPNW 208 +I EAA + +RPTL +L STP G N F + + G D P W Sbjct: 164 SGVIMAEAAKQKPS--VWTKHVRPTLGDFTGWSLHTSTPEGKNHFHDKFQMGQDPNNPEW 221 Query: 209 VS-------------------------------------------IHGTYRDNP------ 219 S + R+NP Sbjct: 222 ESWRMPSWANPYVYTRTGRLIALGKLPPETPIPDHEYTLDRHVTIMQQLMRENPTVSPFK 281 Query: 220 -------RADLNDIEEARRTVSKNYFRQEYEADFSVFEGQIFDTFNAIDHVKDL-----K 267 R D IE A +S F QE ADF+ + G++F ++ HV DL K Sbjct: 282 IVADHKLRVDQEVIELA-ADMSIESFNQEIGADFTEYVGKVFKDWDEEYHVADLVDYNVK 340 Query: 268 GMRHFFKDDEAFETLLGIDVGYRDPTAVLTIK 299 +F + +ET D GY +P L I+ Sbjct: 341 QFGTYFNPN--YETYAAADYGYTNPNVWLVIQ 370 >gi|4261|lcl|protein:vir:94806 Length: 402 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240530;genbank:gi:66396200;genbank:GeneID :5133586 Length = 402 Score = 48.9 bits (115), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 46/180 (25%), Positives = 81/180 (45%), Gaps = 18/180 (10%) Query: 264 KDLKGMRHFFKDDEAFET------LLGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAE 317 KD K H+ ++E F+T G+D GY +++ + +D + YV+EE+ Sbjct: 227 KDFKEKVHYITEEE-FKTKQIKRKYAGVDWGYEHYGSIMVVAEDFDGNK-YVIEEHAHRH 284 Query: 318 KTTAQHAAYIQHCIDRYKVDRVFVDSAAAQFRQDLAYEHEIASAPAKKSVLDGLACLQAL 377 K A + I R+ + D+A + + E +I + A K+V+ G+ + L Sbjct: 285 KEIDDWVAIAKGVIKRHGDILFYCDTARPEHIERFRRE-KIKARYADKAVIAGIEVISRL 343 Query: 378 FQQGKI-IVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSHLCDALRYGIYSISR 436 F+ KI I+ S + NY W K + ++P N DALRY +Y+ ++ Sbjct: 344 FKLNKIFIIKEKVSLFKEEIYNYVW-------KDNADEPVK-LNDDTLDALRYAVYTANK 395 >gi|9935|lcl|protein:vir:97273 Length: 402 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240605;genbank:gi:66396276;genbank:GeneID :5133629 Length = 402 Score = 48.9 bits (115), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 46/180 (25%), Positives = 81/180 (45%), Gaps = 18/180 (10%) Query: 264 KDLKGMRHFFKDDEAFET------LLGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAE 317 KD K H+ ++E F+T G+D GY +++ + +D + YV+EE+ Sbjct: 227 KDFKEKVHYITEEE-FKTKQIKRKYAGVDWGYEHYGSIMVVAEDFDGNK-YVIEEHAHRH 284 Query: 318 KTTAQHAAYIQHCIDRYKVDRVFVDSAAAQFRQDLAYEHEIASAPAKKSVLDGLACLQAL 377 K A + I R+ + D+A + + E +I + A K+V+ G+ + L Sbjct: 285 KEIDDWVAIAKGVIKRHGDILFYCDTARPEHIERFRRE-KIKARYADKAVIAGIEVISRL 343 Query: 378 FQQGKI-IVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSHLCDALRYGIYSISR 436 F+ KI I+ S + NY W K + ++P N DALRY +Y+ ++ Sbjct: 344 FKLNKISIIKEKVSLFKEEIYNYVW-------KDNADEPVK-LNDDTLDALRYAVYTANK 395 >gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899601;genbank:gi:34419588;genbank:GeneID :2546011 Length = 600 Score = 47.0 bits (110), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 85/381 (22%), Positives = 147/381 (38%), Gaps = 60/381 (15%) Query: 53 RHRFVTACVSRRVGKS-----FIAYTLGFLKLLEPNVKVLVVAPNYSLANIGWSQIRGLI 107 R RF + R++GK+ F+A+ L F + E + +A S++ +++ +I Sbjct: 138 RSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKEAGI----LAHKGSMSMEVLERVKNVI 193 Query: 108 KKYG--LQTERENAKDKEIELANGSLFKLASAAQADSAVGRSYDFIIFDEAAISDVGGDA 165 + LQ E I NG A A+ +D+ G+S+ I DE A D Sbjct: 194 ENLPDFLQPGIEEWNKGNITFDNGCKLG-AYASGSDAVRGKSFSMIYVDECAFVPGFDDF 252 Query: 166 FRVQLRPTLDKPNSKALFISTPRGGNWFKEFYAYG------FDDTLPNWVSIHG-TYRDN 218 ++ SK + STP G N + + + F+ W ++ Y+D Sbjct: 253 WKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWRAVQNRLYKD- 311 Query: 219 PRADLNDIEEARR----TVSKNYFRQEYEADFSVFEGQIFDTF-----NAIDHVKDLKGM 269 + +D E +R S+ F QE+ +F G + + F ID VKD G Sbjct: 312 --GEFDDGEAFKRETIGNTSREAFSQEHLCNFLGTAGTLINGFKLSKMKGIDVVKDSDGW 369 Query: 270 RHFFKDDEAFETLLGIDVGY---RDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQH--- 323 + K +E + +L +D +D A+ I T Y E+ H Sbjct: 370 CVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDV-----TSYPFEQVAVFHDNKTSHLLL 424 Query: 324 AAYIQHCIDRYKVDRVFVDSAAA------QFRQDLAYEHEIAS------------APAKK 365 A I RY V+ + A+ + +DL YE+ I P KK Sbjct: 425 PAIIMKQAYRYNEAYVYCEIASTGELVMNELFRDLEYENVIMEERASGGRRGLGLKPNKK 484 Query: 366 SVLDGLACLQALFQQGKIIVD 386 + G + L+ L ++ ++ ++ Sbjct: 485 TKAIGCSTLKDLIEKDQLKIN 505 >gi|18891|lcl|protein:vir:8847 Length: 482 # NCBI annotation: terminase # Family: family:all:1730 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775255;genbank:gi:27476053;genbank:GeneID :2700601 Length = 482 Score = 47.0 bits (110), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 79/363 (21%), Positives = 130/363 (35%), Gaps = 57/363 (15%) Query: 105 GLIKKYGL-QTERENAKDKEIELA-----NGSL----FKLASAAQADSAVGRSYDFIIFD 154 G+I K + +TER K ++ +G L FK +Q D +G + D I D Sbjct: 117 GMIPKEDIVKTERREGKPGCVQAVMVRHVSGGLSSLIFKSYEMSQ-DKFMGTAIDVIWLD 175 Query: 155 EAAISDVGGDAFRVQLRPTLDKPNSKALFISTPRGGNWFKEFYAYGFDDTLPNWVSIHGT 214 E D+ Q TP G E D P IH + Sbjct: 176 EECPKDI-----YTQCVTRTATTGGIVYLTFTPEHG--LTEIVKDFLQDLKPGQFLIHAS 228 Query: 215 YRDNPRADLNDIEEARRTVSKNYFRQEYEADFSVFEGQIFDTFNAIDHVKDLKGMRHFFK 274 + D P E+ S R E + G +F + + K + F Sbjct: 229 WEDAPHLSPEVKEQLLSVYSPAERRMRAEGIPMLGSGVVFP-------ILEEKFVCEPFD 281 Query: 275 DDEAFETLLGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYI----QHC 330 + F ++GID+G+ P A+ + + + D YY+ +E ++ +T HA I H Sbjct: 282 IPDHFHRIIGIDLGFDHPNAIACVAWDAEKDKYYLYDERSESGETLGMHADAIYLKGGHQ 341 Query: 331 I------DRYKVD-----RVFVDSAAAQFRQDLAYEHEIASAPAK------KSVLDGLAC 373 I D +K D R FVD ++ YE ++ P SV G+ Sbjct: 342 IPVVVPHDAFKHDGATSGRRFVDLLKDDHNLNVVYE-PFSNPPGPDGKHGGNSVEFGVNW 400 Query: 374 LQALFQQGKIIVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSHLCDALRYGIYS 433 + + G + V +C++ + ++ Y ++ D N + A RY + Sbjct: 401 MLTRMENGDLKVFNTCTNFLKEMKMYH----------RKDGKIVDRNDDMISATRYALLM 450 Query: 434 ISR 436 SR Sbjct: 451 ASR 453 >gi|8126|lcl|protein:vir:102661 Length: 568 # NCBI annotation: terminase # Family: family:all:147 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024418;genbank:gi:48696639;genbank:GeneID :2948128 Length = 568 Score = 45.8 bits (107), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 29/95 (30%), Positives = 48/95 (50%), Gaps = 6/95 (6%) Query: 124 IELANGSLFKLASAAQADSAVGRSYDFIIFDEAAISDVGGDAFRVQLRPTLDKPNSKALF 183 + NGS ++L +D VG I++ E+A+ R LRP LD+ L Sbjct: 160 VRFTNGSTYQL-QGGDSDKLVGAGPVGIVYSESALMSPN---VRTFLRPMLDETGGWELH 215 Query: 184 ISTPRGGNWFKEF--YAYGFDDTLPNWVSIHGTYR 216 I+TPRG NWF + +A ++ +++I+ T+R Sbjct: 216 ITTPRGKNWFYKLAMHAEKSEEWYYKYLTINDTWR 250 >gi|7076|lcl|protein:vir:96147 Length: 419 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240074;genbank:gi:66395738;genbank:GeneID :5133130 Length = 419 Score = 45.1 bits (105), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 67/266 (25%), Positives = 108/266 (40%), Gaps = 40/266 (15%) Query: 189 GGNWFKEFYAYGFDDTLPNWVSI--------------HGTYRDNPRADLNDIEEARRTVS 234 G ++K FY Y +WV+ H TY +NP IEEA+ + Sbjct: 158 NGLFYKFFYTYNPPKRKQSWVNKKYESSFQPDNTFVHHSTYLNNPFIAKEFIEEAKAAKA 217 Query: 235 KNYFRQEYEADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLLGI----DVGY- 289 N R +E + G+ + V +R E F+T I D GY Sbjct: 218 INELRYRWE-----YLGEAIGS-----GVVPFNNLRIETIPKEQFDTFDNIRNAVDFGYA 267 Query: 290 RDPTAVLTIKYHYDTD--TYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAAAQ 347 DP A +++HYD Y ++E+ + + + A +++ Y+ D ++ DSA + Sbjct: 268 TDPLAF--VRWHYDKKKRIIYAVDEHYGVQISNREFANWLKK--KGYQSDEIYADSAEPK 323 Query: 348 FRQDLAYEHEIASAPAKKSVLDGLA-CLQALFQQGKIIVDASCSSLIHALQNYKWDFQEG 406 +L EH I K D + Q L I++D + + I A + D+Q Sbjct: 324 SIAELKQEHSIRRIKGVKKGPDSVEHGEQWLNDLDAIVIDPTRTPNI-AREFENIDYQ-- 380 Query: 407 EEKLSREKPR-HDANSHLCDALRYGI 431 +K KPR D ++H DA RY + Sbjct: 381 TDKDGNVKPRLEDKDNHTIDATRYAL 406 >gi|18781|lcl|protein:vir:7429 Length: 581 # NCBI annotation: gp6 # Family: family:all:147 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818544;genbank:gi:29566981;genbank:GeneID :1260231 Length = 581 Score = 43.1 bits (100), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 48/220 (21%), Positives = 87/220 (39%), Gaps = 41/220 (18%) Query: 238 FRQEYEADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLLGIDVGYRDPTAVLT 297 F QE A+F+ F G++F ++ HV++L + + +ET+ +D GYR+P L Sbjct: 304 FNQEIAAEFTDFVGKVFKEYDEDTHVREL-----VYNPSQDWETIAAVDYGYRNPNVWLL 358 Query: 298 IKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQH---CIDRYKVDRVFVDSAAAQ------- 347 I+ +++E QA+ T + A I C D + + D AA + Sbjct: 359 IQIG-PWGEINIVDELYQADLTPTEFANEILRRGLCPD--TLHSFYADPAAPEASRTLET 415 Query: 348 -FRQ-------------DLAYEHEIASAPAKKSVLDGLACLQALFQQG--------KIIV 385 FRQ D+ + K ++D + FQ G ++++ Sbjct: 416 IFRQHGKRARSRPHTGGDIDNRLNLIRFALKDRIVDAEMSAPSWFQAGASQDVRRPRMMI 475 Query: 386 DASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSHLCD 425 C I Y++ + E+ + K R++ L D Sbjct: 476 STRCPKTIFEFGEYRYPKTKDEQTETSTK-RYETPMKLND 514 Score = 37.4 bits (85), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 45/189 (23%), Positives = 77/189 (40%), Gaps = 26/189 (13%) Query: 41 PQIAIINALEDPRHRFVTACVSRRVGKS------FIAYTLGFLKLLEPNV-------KVL 87 P + +ED +++ A RR+GKS FI + K + + + Sbjct: 36 PHSGQLEFMED-DAQYLCATCGRRMGKSAGIAHEFIPEAM-ITKEMATTLLDDGKRREFW 93 Query: 88 VVAPNYSLAN----IGWSQIRGL---IKKYGLQTERENAKDKEIELANGS-LFKLASAAQ 139 V PNYS A + W++ R L K G + + D + L +G+ ++ S+A Sbjct: 94 TVGPNYSDAEKPFRVFWNKCRALGIPFDKPGTYFDIKGG-DMTVSLWDGAFIYSAKSSAV 152 Query: 140 ADSAVGRSYDFIIFDEAAISDVGGDAFRVQLRPTLDKPNSKALFISTPRGGNWFKEFYAY 199 + VG + +EAA ++ + PTL A F +TP G NW+ + + Sbjct: 153 PERLVGEGLTGVHMEEAAKQKE--VVWKQMIMPTLMDFGGWAKFTTTPEGKNWYYDLHQK 210 Query: 200 GFDDTLPNW 208 + NW Sbjct: 211 ALRPSTLNW 219 >gi|5891|lcl|protein:vir:107098 Length: 421 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950600;genbank:gi:119953680;genbank:GeneI D:4643107 Length = 421 Score = 42.7 bits (99), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 64/257 (24%), Positives = 110/257 (42%), Gaps = 36/257 (14%) Query: 188 RGGNWFKEFYAYGF--DDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKN--YFRQEYE 243 R +W + Y F D+T + H TY DNP I+EA +N +R EY Sbjct: 175 RKQSWVNKKYETSFQPDNTFVH----HSTYLDNPFISKQFIQEAESAKERNEQRYRWEYM 230 Query: 244 ADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDD--EAFETLL-GIDVGY-RDPTAVLTIK 299 + ++ G + FN + K DD + F+ + +D GY DP A ++ Sbjct: 231 GE-AIGSGVV--PFNNLQIEK--------IPDDLYKTFDNIRNAVDFGYATDPLAF--VR 277 Query: 300 YHYDTDT--YYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAAAQFRQDLAYEHE 357 +HYD Y ++E+ + + + A +++ Y+ D ++ DSA + +L EH Sbjct: 278 WHYDKKKRIIYAVDEHYGVQISNREFANWLKRR--GYQSDEIYADSAEPKSIAELKQEHG 335 Query: 358 IASAPAKKSVLDGLA-CLQALFQQGKIIVDAS-CSSLIHALQNYKWDFQEGEEKLSREKP 415 I K D + Q L I++D + ++ +N ++ +K KP Sbjct: 336 IKRIKGVKKGPDSVEHGEQWLDDLTAIVIDPNRTPNIAREFENIDYE----TDKDGNVKP 391 Query: 416 R-HDANSHLCDALRYGI 431 R D ++H DA RY + Sbjct: 392 RLEDKDNHTIDATRYAL 408 >gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469498;genbank:gi:157311455;genbank:Ge neID:5602125 Length = 607 Score = 42.4 bits (98), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 74/306 (24%), Positives = 120/306 (39%), Gaps = 56/306 (18%) Query: 122 KEIELANGSLFKLASAAQADSAVGRSYDFIIFDEAAISDVGGDAFRVQLRPTLDKP-NSK 180 K I L NGS A A+ D+ G S+ FI DE A D F + ++P + SK Sbjct: 222 KSIVLENGSSIG-AYASSPDAVRGNSFSFIYIDECAFIQNWTDCF-LAIQPVISSGRESK 279 Query: 181 ALFISTPRGGNWFKEFYAYGFDDT---LPNWVSIHGTY-RDNPRADLND-----IEEARR 231 + +TP G N F + + D +P H R +AD+ D +A Sbjct: 280 MIMTTTPNGLNHFYDIWQSAIDGKSGYVPYEAVWHSVKERLYNKADIFDDGYEWSSQAIA 339 Query: 232 TVSKNYFRQEYEADF-----SVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEA-------- 278 S F QE+ A+F ++ + ID V D G F K E Sbjct: 340 GSSLEQFLQEHNAEFFGSSGTLIRATTLSRLSFIDVVND-NGFYQFEKPKEGRKYVATLD 398 Query: 279 --------FETLLGIDVG---YRDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYI 327 + L ID+ Y+ P AV YH +T ++++L + ++ Sbjct: 399 CSEGRGQDYHALQIIDITEFPYK-PVAV----YHSNTTSHFILPD------IVFKYLMMY 447 Query: 328 QHCIDRYKVDRVFVDSAAAQFRQDLAYEHEIASA-------PAKKSVLDGLACLQALFQQ 380 C +++ V S A DL Y++ I + +K+S G + L+ L ++ Sbjct: 448 NECPVYIELNSTGV-SIAKSLAMDLEYDNIICDSFIDLGMKQSKRSKAMGCSALKDLIEK 506 Query: 381 GKIIVD 386 K+I++ Sbjct: 507 DKLIIN 512 >gi|8730|lcl|protein:vir:96840 Length: 421 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240151;genbank:gi:66395816;genbank:GeneID :5133181 Length = 421 Score = 42.4 bits (98), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 63/254 (24%), Positives = 105/254 (41%), Gaps = 30/254 (11%) Query: 188 RGGNWFKEFYAYGF--DDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKNYFRQEYEAD 245 R +W + Y F D+T + H TY DNP I+EA +N R +E Sbjct: 175 RKQSWVNKKYESSFQPDNTFVH----HSTYLDNPFIAKQFIDEAEAAKERNELRYRWEYL 230 Query: 246 FSVFEGQIFDTFN-AIDHVKDLKGMRHFFKDDEAFETLL-GIDVGY-RDPTAVLTIKYHY 302 + N I+ + D F+ +F+ + +D GY DP A +++HY Sbjct: 231 GEAIGSGVVPFNNLQIEKIPD-----ELFR---SFDNIRNAVDFGYATDPLAF--VRWHY 280 Query: 303 DTD--TYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAAAQFRQDLAYEHEIAS 360 D Y ++EY + + Q ++ Y+ D ++ DSA + +L EH I Sbjct: 281 DKKKRVIYAVDEYYGVQISNRQFGKWL--WSKGYQSDDIYADSAEPKSIDELRKEHGIKR 338 Query: 361 APAKKSVLDGLA-CLQALFQQGKIIVDAS-CSSLIHALQNYKWDFQEGEEKLSREKPR-H 417 K D + Q L I++D + ++ +N DF+ +K KP+ Sbjct: 339 IKGVKKGPDSVEYGEQWLNDLDAIVIDPNRTPNIAREFENI--DFE--TDKDGNVKPKLE 394 Query: 418 DANSHLCDALRYGI 431 D ++H DA RY + Sbjct: 395 DKDNHTIDATRYAL 408 >gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891724;genbank:gi:33620526;genbank:GeneID :1725332 Length = 607 Score = 42.0 bits (97), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 71/301 (23%), Positives = 119/301 (39%), Gaps = 46/301 (15%) Query: 122 KEIELANGSLFKLASAAQADSAVGRSYDFIIFDEAAISDVGGDAFRVQLRPTLDKP-NSK 180 K I L NGS A A+ D+ G S+ FI DE A D F + ++P + SK Sbjct: 222 KSIVLENGSSIG-AYASSPDAVRGNSFSFIYIDECAFIQNWTDCF-LAIQPVISSGRESK 279 Query: 181 ALFISTPRGGNWFKEFYAYGFDDT---LPNWVSIHGTY-RDNPRADLND-----IEEARR 231 + +TP G N F + + D +P H R +AD+ D +A Sbjct: 280 MIMTTTPNGLNHFYDIWQSAIDGKSGYVPYEAVWHSVKERLYNKADIFDDGYEWSSQAIA 339 Query: 232 TVSKNYFRQEYEADF-----SVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLLGID 286 S F QE+ A+F ++ + ID V D G F K E + + +D Sbjct: 340 GSSLEQFLQEHNAEFFGSSGTLIRATTLSRLSFIDVVND-NGFYQFEKPKEGRKYVATLD 398 Query: 287 VGY---RDPTAVLTIK-----------YHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCID 332 +D A+ I YH +T ++++L + ++ C Sbjct: 399 CSEGRGQDYHALQIIDITEFPYKQVAVYHSNTTSHFILPD------IVFKYLMMYNECPV 452 Query: 333 RYKVDRVFVDSAAAQFRQDLAYEHEIASA-------PAKKSVLDGLACLQALFQQGKIIV 385 +++ V S A DL Y++ I + +K+S G + L+ L ++ K+I+ Sbjct: 453 YIELNSTGV-SIAKSLAMDLEYDNIICDSFIDLGMKQSKRSKAMGCSALKDLIEKDKLII 511 Query: 386 D 386 + Sbjct: 512 N 512 >gi|10354|lcl|protein:vir:97448 Length: 421 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240743;genbank:gi:66396415;genbank:GeneID :5133804 Length = 421 Score = 40.8 bits (94), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 61/255 (23%), Positives = 112/255 (43%), Gaps = 32/255 (12%) Query: 188 RGGNWFKEFYAYGF--DDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKN--YFRQEYE 243 R +W + Y F D+T + H TY +NP I+EA +N +R EY Sbjct: 175 RKQSWVNKKYESSFQADNTYVH----HSTYLNNPFISKQFIQEAESAKKRNEQRYRWEYM 230 Query: 244 ADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLL-GIDVGY-RDPTAVLTIKYH 301 + ++ G + FN + ++++ ++ + F+ + +D GY DP A +++H Sbjct: 231 GE-AIGSGVV--PFNNL-RIEEIPQRQY-----DTFDNIRNAVDFGYATDPLAF--VRWH 279 Query: 302 YDTD--TYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAAAQFRQDLAYEHEIA 359 YD Y ++EY + + + A +++ Y+ D +F DSA + +L EH I Sbjct: 280 YDKKKRVIYAMDEYYGVQISNREFANWLKK--KGYQSDEIFADSAEPKSIAELKQEHGIK 337 Query: 360 SAPAKKSVLDGLA-CLQALFQQGKIIVDA-SCSSLIHALQNYKWDFQEGEEKLSREKPR- 416 K D + Q L I++D ++ +N ++ +K KP+ Sbjct: 338 KIKGVKKGADSVEFGEQWLDDLDAIVIDPRRTPNIAREFENIDYE----TDKDGNVKPKL 393 Query: 417 HDANSHLCDALRYGI 431 D ++H DA RY + Sbjct: 394 EDKDNHTIDATRYAL 408 >gi|3182|lcl|protein:vir:94499 Length: 421 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240671;genbank:gi:66396341;genbank:GeneID :5133763 Length = 421 Score = 40.8 bits (94), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 61/255 (23%), Positives = 112/255 (43%), Gaps = 32/255 (12%) Query: 188 RGGNWFKEFYAYGF--DDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKN--YFRQEYE 243 R +W + Y F D+T + H TY +NP I+EA +N +R EY Sbjct: 175 RKQSWVNKKYESSFQADNTYVH----HSTYLNNPFISKQFIQEAESAKKRNEQRYRWEYM 230 Query: 244 ADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLL-GIDVGY-RDPTAVLTIKYH 301 + ++ G + FN + ++++ ++ + F+ + +D GY DP A +++H Sbjct: 231 GE-AIGSGVV--PFNNL-RIEEIPQRQY-----DTFDNIRNAVDFGYATDPLAF--VRWH 279 Query: 302 YDTD--TYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAAAQFRQDLAYEHEIA 359 YD Y ++EY + + + A +++ Y+ D +F DSA + +L EH I Sbjct: 280 YDKKKRVIYAMDEYYGVQISNREFANWLKK--KGYQSDEIFADSAEPKSIAELKQEHGIK 337 Query: 360 SAPAKKSVLDGLA-CLQALFQQGKIIVDA-SCSSLIHALQNYKWDFQEGEEKLSREKPR- 416 K D + Q L I++D ++ +N ++ +K KP+ Sbjct: 338 KIKGVKKGADSVEFGEQWLDDLDAIVIDPRRTPNIAREFENIDYE----TDKDGNVKPKL 393 Query: 417 HDANSHLCDALRYGI 431 D ++H DA RY + Sbjct: 394 EDKDNHTIDATRYAL 408 >gi|2589|lcl|protein:vir:94084 Length: 407 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240228;genbank:gi:66395894;genbank:GeneID :5133253 Length = 407 Score = 40.8 bits (94), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 37/158 (23%), Positives = 68/158 (43%), Gaps = 11/158 (6%) Query: 280 ETLLGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRV 339 E G+D GY +++ I D + +Y +EE+ K + + RY Sbjct: 252 EYFAGVDWGYEHYGSIVLIGRGIDGN-FYFIEEHAHQFKFIDDWVVIAKDIVSRYGNINF 310 Query: 340 FVDSAAAQFRQDLAYEHEIASAPAKKSVLDGLACLQALFQQGKIIV-DASCSSLIHALQN 398 + D+A ++ + H + + A KS L G+ + LF+Q K++V + + Sbjct: 311 YCDTARPEYITEFR-RHRLRAINADKSKLSGVEEVAKLFKQNKLLVLYDNMDRFKQEVFK 369 Query: 399 YKWDFQEGEEKLSREKPRHDANSHLCDALRYGIYSISR 436 Y W GE P + + + D+LRY IY+ ++ Sbjct: 370 YVWHPTNGE-------PIKEFDD-VLDSLRYAIYTHTK 399 >gi|3523|lcl|protein:vir:105897 Length: 407 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004370;genbank:gi:122891825;genbank:Ge neID:4712368 Length = 407 Score = 40.8 bits (94), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 37/158 (23%), Positives = 68/158 (43%), Gaps = 11/158 (6%) Query: 280 ETLLGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRV 339 E G+D GY +++ I D + +Y +EE+ K + + RY Sbjct: 252 EYFAGVDWGYEHYGSIVLIGRGIDGN-FYFIEEHAHQFKFIDDWVVIAKDIVSRYGNINF 310 Query: 340 FVDSAAAQFRQDLAYEHEIASAPAKKSVLDGLACLQALFQQGKIIV-DASCSSLIHALQN 398 + D+A ++ + H + + A KS L G+ + LF+Q K++V + + Sbjct: 311 YCDTARPEYITEFR-RHRLRAINADKSKLSGVEEVAKLFKQNKLLVLYDNMDRFKQEVFK 369 Query: 399 YKWDFQEGEEKLSREKPRHDANSHLCDALRYGIYSISR 436 Y W GE P + + + D+LRY IY+ ++ Sbjct: 370 YVWHPTNGE-------PIKEFDD-VLDSLRYAIYTHTK 399 >gi|6284|lcl|protein:vir:95890 Length: 421 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240381;genbank:gi:66396048;genbank:GeneID :5133401 Length = 421 Score = 40.0 bits (92), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 64/254 (25%), Positives = 109/254 (42%), Gaps = 30/254 (11%) Query: 188 RGGNWFKEFYAYGF--DDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKN--YFRQEYE 243 R +W + Y F D+T + H TY +NP I+EA +N +R EY Sbjct: 175 RKQSWVNKKYESSFQADNTFVH----HSTYLNNPFISKQFIQEAESAKKRNEQRYRWEYM 230 Query: 244 ADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLLGIDVGY-RDPTAVLTIKYHY 302 + ++ G + FN + + +G F + +D GY DP A +++HY Sbjct: 231 GE-AIGSGVV--PFNNLRIEEIPQGQYDTFDNIRN-----AVDFGYATDPLAF--VRWHY 280 Query: 303 DTD--TYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAAAQFRQDLAYEHEIAS 360 D Y ++E+ + + + A +++ Y+ D +F DSA + +L EH I Sbjct: 281 DKKKRVIYAMDEHYGVQISNREFANWLKK--KGYQSDEIFADSAEPKSIAELKQEHGIKK 338 Query: 361 APAKKSVLDGLA-CLQALFQQGKIIVDA-SCSSLIHALQNYKWDFQEGEEKLSREKPR-H 417 A K D + Q L I++D ++ +N D+Q +K KP+ Sbjct: 339 VKAVKKGADSVEYGEQWLDDLEAIVIDPRRTPNIAREFENI--DYQ--TDKDGNVKPKLE 394 Query: 418 DANSHLCDALRYGI 431 D ++H DA RY + Sbjct: 395 DKDNHAIDATRYAL 408 >gi|7569|lcl|protein:vir:96300 Length: 421 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240307;genbank:gi:66395973;genbank:GeneID :5133377 Length = 421 Score = 39.3 bits (90), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 64/254 (25%), Positives = 109/254 (42%), Gaps = 30/254 (11%) Query: 188 RGGNWFKEFYAYGF--DDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKN--YFRQEYE 243 R +W + Y F D+T + H TY +NP I+EA +N +R EY Sbjct: 175 RKQSWVNKKYESSFQADNTFVH----HSTYLNNPFISKQFIQEAESAKKRNEQRYRWEYM 230 Query: 244 ADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLLGIDVGY-RDPTAVLTIKYHY 302 + ++ G + FN + + +G F + +D GY DP A +++HY Sbjct: 231 GE-AIGSGVV--PFNNLRIEEIPQGQYDTFDNIRN-----AVDFGYATDPLAF--VRWHY 280 Query: 303 DTD--TYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAAAQFRQDLAYEHEIAS 360 D Y ++E+ + + + A +++ Y+ D +F DSA + +L EH I Sbjct: 281 DKKKRVIYAMDEHYGVQISNREFANWLKK--KGYQSDEIFADSAEPKSIAELKQEHGIKK 338 Query: 361 APAKKSVLDGLA-CLQALFQQGKIIVDA-SCSSLIHALQNYKWDFQEGEEKLSREKPR-H 417 A K D + Q L I++D ++ +N D+Q +K KP+ Sbjct: 339 VKAVKKGADSVEYGEQWLDDLEAIVIDPRRTPNIAREFENI--DYQ--TDKDGNVKPKLE 394 Query: 418 DANSHLCDALRYGI 431 D ++H DA RY + Sbjct: 395 DKDNHAIDATRYAL 408 >gi|4983|lcl|protein:vir:95058 Length: 421 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240816;genbank:gi:66394679;genbank:GeneID :5133852 Length = 421 Score = 39.3 bits (90), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 61/255 (23%), Positives = 112/255 (43%), Gaps = 32/255 (12%) Query: 188 RGGNWFKEFYAYGF--DDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKN--YFRQEYE 243 R +W + Y F D+T + H TY +NP I+EA +N +R EY Sbjct: 175 RKQSWVNKKYESSFQADNTYVH----HSTYLNNPFISKQFIQEAESAKKRNEQRYRWEYM 230 Query: 244 ADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLL-GIDVGY-RDPTAVLTIKYH 301 + ++ G + FN + ++++ ++ + F+ + +D GY DP A +++H Sbjct: 231 GE-AIGSGVV--PFNNL-RIEEIPQRQY-----DTFDNIRNAVDFGYATDPLAF--VRWH 279 Query: 302 YDTD--TYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAAAQFRQDLAYEHEIA 359 YD Y ++E+ + + + A +++ Y+ D VF DSA + +L EH I Sbjct: 280 YDKKKRVIYAMDEHYGVQISNREFANWLKK--KGYQSDEVFADSAEPKSIAELKQEHGIK 337 Query: 360 SAPAKKSVLDGLA-CLQALFQQGKIIVDA-SCSSLIHALQNYKWDFQEGEEKLSREKPR- 416 K D + Q L I++D ++ +N ++ +K KP+ Sbjct: 338 KIKGVKKGADSVEFGEQWLDDLDAIVIDPRRTPNIAREFENIDYE----TDKDGNVKPKL 393 Query: 417 HDANSHLCDALRYGI 431 D ++H DA RY + Sbjct: 394 EDKDNHTIDATRYAL 408 >gi|13903|lcl|protein:vir:9870 Length: 273 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795632;genbank:gi:28876409;genbank:GeneID :1257943 Length = 273 Score = 39.3 bits (90), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 69/269 (25%), Positives = 110/269 (40%), Gaps = 48/269 (17%) Query: 190 GNWFKEFYAYGFDDTLPNWVS-------------IHG-TYRDNPRADLNDIEEAR--RTV 233 G ++K FY Y +WV+ +H TY+DNP I EA R Sbjct: 8 GLFYKFFYTYNPPKRKQSWVNKKYESQFQPSNTFVHASTYKDNPFIAKEFIAEAEATRER 67 Query: 234 SKNYFRQEYEADFSVFEGQI-FDT--FNAI--DHVKDLKGMRHFFKDDEAFETLLGIDVG 288 S+ +R EY + ++ G + FD F I + V D +R+ GID G Sbjct: 68 SERRYRWEYLGE-AIGSGVVPFDNLRFERITDEQVADFDNIRN------------GIDYG 114 Query: 289 Y-RDPTAVLTIKYHYD--TDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAA 345 Y DP A +++HYD + Y ++EY + + Q A ++ Y+ D +F +SA Sbjct: 115 YATDPLAF--VRWHYDKKKNGIYAIDEYYGQKISNRQLAKWL--TTKGYQSDEMFAESAE 170 Query: 346 AQFRQDLAYEHEIASAPAKKSVLDGLACLQALFQQGKII-VDAS-CSSLIHALQNYKWDF 403 + +L E I K D + + I +D ++ +N D+ Sbjct: 171 PKSNAELKNEFGIKRIKGVKKGPDSVEFGERWLDDLDFICIDPKRTPNIAREFENI--DY 228 Query: 404 QEGEEKLSREKPR-HDANSHLCDALRYGI 431 Q ++ KPR D +H DA RY + Sbjct: 229 Q--VDRDGNPKPRLEDKVNHAIDATRYAM 255 >gi|20804|lcl|protein:vir:106290 Length: 633 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944105;genbank:gi:38640149;genbank:GeneID :2658038 Length = 633 Score = 38.9 bits (89), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 36/143 (25%), Positives = 57/143 (39%), Gaps = 10/143 (6%) Query: 124 IELANGSLFKLASAAQADSAVGRSYDFIIFDEAAISDVGGDAFRVQLRPTLDKPNSKALF 183 IEL NG A A+ D+ G S+ I DE A + D ++ L S+ + Sbjct: 250 IELENGCSIG-AYASSPDAVRGNSFALIYVDECAFIEGFEDTWKAILPVISSGRQSRIIL 308 Query: 184 ISTPRGGNWFKEFYAY------GFDDTLPNWVSIHGTYRDNPRADLNDIEEARRTV---S 234 STP G N + + + GF W+++ D A + E A + + S Sbjct: 309 TSTPNGINHWYDLWEVSLKSDKGFKPYTTTWITVKERLYDGSDAYDDGFEWASKQINSSS 368 Query: 235 KNYFRQEYEADFSVFEGQIFDTF 257 F+QE+ F G + + F Sbjct: 369 VEAFQQEHLCRFMGTSGTLINGF 391 >gi|6093|lcl|protein:vir:95761 Length: 432 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950582;genbank:gi:119953777;genbank:GeneI D:5076831 Length = 432 Score = 38.5 bits (88), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 59/235 (25%), Positives = 92/235 (39%), Gaps = 28/235 (11%) Query: 214 TYRDNPRADLNDIEEARRTVSKNYFRQEYEA--DFSVFEGQIFDTFNA--IDHVKDLKGM 269 TYR N D DI+ N R A D+ V EG +F+ + D V +K + Sbjct: 200 TYRVNEWLDQQDIDRYEDLWRTNPRRAAVVANGDWGVAEGLVFENYEVKDFDIVSTIKRI 259 Query: 270 RHFFKDDEAFETLLGIDVGY-RDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQ 328 ET G+D G+ DPT + + ++ E+ + TT I Sbjct: 260 G---------ETTAGLDFGFTHDPTTFPRLAVDLEKKELWIYAEHYEHAMTTDDIFKMIV 310 Query: 329 HCIDRYKVDRVFVDSAAAQFRQDL---AYEHEIASAPAKKSVLDGLACLQALFQQGKIIV 385 + V + DSA + +L + S K S+ G+ ++ Q KI + Sbjct: 311 DADMQNAV--ITADSAEQRLIAELQAKGIRRLVPSIKGKGSINAGIDFMK----QFKIYI 364 Query: 386 DASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSHLCDALRYGI--YSISRGK 438 SC I Y + Q+ + K E D+N+H+ DA+RY + Y I K Sbjct: 365 HPSCIKTIEEFDTYIYK-QDKDGKWLNEPI--DSNNHIIDAIRYALERYHIQTSK 416 >gi|10483|lcl|protein:vir:105297 Length: 446 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950664;genbank:gi:119967835;genbank:GeneI D:4643176 Length = 446 Score = 37.7 bits (86), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 60/272 (22%), Positives = 107/272 (39%), Gaps = 40/272 (14%) Query: 188 RGGNWFKEFYAYGF--DDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKN--YFRQEYE 243 R +W + Y F D+T + H TY DNP I+EA +N +R EY Sbjct: 174 RKQSWVNKKYETSFQPDNTFVH----HSTYLDNPFISKQFIQEAESAKERNEQRYRWEYM 229 Query: 244 AD-------------FSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEAFETLLGIDVGYR 290 + +++ +F+ I + D + + + L G R Sbjct: 230 GEAIGSGVVPFNNLQIEKIPDELYKSFDNIRNAVDFGLTKTAPLHSDVYSKLGEHISGVR 289 Query: 291 ------DPTAVLTIKYHYDTDT--YYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVD 342 DP A +++HYD Y ++E+ + + + A +++ Y+ D ++ D Sbjct: 290 KKACATDPLAF--VRWHYDKKKRIIYAVDEHYGVQISNREFANWLKRR--GYQSDEIYAD 345 Query: 343 SAAAQFRQDLAYEHEIASAPAKKSVLDGLA-CLQALFQQGKIIVDAS-CSSLIHALQNYK 400 SA + +L EH I K D + Q L I++D + ++ +N Sbjct: 346 SAEPKSIAELKQEHGIKRIKGVKKGPDSVEHGEQWLDDLTAIVIDPNRTPNIAREFENID 405 Query: 401 WDFQEGEEKLSREKPR-HDANSHLCDALRYGI 431 ++ +K KPR D ++H DA RY + Sbjct: 406 YE----TDKDGNVKPRLEDKDNHTIDATRYAL 433 >gi|18074|lcl|protein:vir:5954 Length: 422 # NCBI annotation: hypothetical protein # Family: family:all:54 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690654;genbank:geneid:6329138;genbank:gi: 22855048;goa:P54308;interpro:IPR006437;interpro:IPR00670 1;interpro:IPR011441;uniprot:P54308;genbank:GeneID:95525 4 Length = 422 Score = 37.0 bits (84), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 53/251 (21%), Positives = 102/251 (40%), Gaps = 24/251 (9%) Query: 188 RGGNWFKEFYAYGFDDTLP-NWVSIHGTYRDNPRADLNDIEEARRTVSKNYFRQEYEADF 246 R +W + + F LP N H TY NP IEEA +N + +E Sbjct: 174 RKQSWVNKVFNSSF---LPANTFVDHSTYLQNPFLSKAFIEEAEEVKRRNELKYRHE--- 227 Query: 247 SVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEA--FETLL-GIDVGYRDPTAVLTIKYHYD 303 + G+ + + ++L+ D E F+ + G+D GY P + +++HYD Sbjct: 228 --YLGEALGS--GVVPFENLQIEEGIITDAEVARFDNIRQGLDFGY-GPDPLAFVRWHYD 282 Query: 304 --TDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAAAQFRQDLAYEHEIASA 361 + Y ++E + + + A +++ ++Y+ R+ DS+ + L EH I Sbjct: 283 KRKNRIYAIDELVDHKVSLKRTADFVRK--NKYESARIIADSSEPRSIDALKLEHGINRI 340 Query: 362 PAKKSVLDGLACLQALFQQ-GKIIVDA-SCSSLIHALQNYKWDFQEGEEKLSREKPRHDA 419 K D + + + I++D ++ +N + + + + R D Sbjct: 341 EGAKKGPDSVEHGERWLDELDAIVIDPLRTPNIAREFENIDYQTDKNGDPIPR---LEDK 397 Query: 420 NSHLCDALRYG 430 ++H DA RY Sbjct: 398 DNHTIDATRYA 408 >gi|360|lcl|protein:vir:344 Length: 482 # NCBI annotation: terminase # Family: family:all:147 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203458;genbank:gi:15320614;genbank:GeneID :921717 Length = 482 Score = 37.0 bits (84), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 26/72 (36%), Positives = 38/72 (52%), Gaps = 10/72 (13%) Query: 366 SVLDGLACLQALFQQGKIIVDASC--------SSLIHALQNYKWDFQEGEEKLSREKPRH 417 S+ DG+A +AL ++ I A C S + AL++Y++ + E + SRE P H Sbjct: 366 SLEDGIAAARALLER-DIRFHARCDVPQVAGLESGLEALRSYRYQYNEKLQTYSRE-PVH 423 Query: 418 DANSHLCDALRY 429 D SH DA RY Sbjct: 424 DWASHDADAFRY 435 >gi|13692|lcl|protein:vir:4897 Length: 411 # NCBI annotation: gp411 # Family: family:all:54 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056675;genbank:gi:9635008;genbank:GeneID: 1262679 Length = 411 Score = 37.0 bits (84), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 45/205 (21%), Positives = 90/205 (43%), Gaps = 21/205 (10%) Query: 232 TVSKNYFRQEYEADFSVFEGQIFDTFNAIDHVKD-LKGMRHFFKDDEAFETLLGIDVGYR 290 T ++ ++ ++V EG I+ +++ HV D L M+ +F GID GY Sbjct: 205 TPKGKFYDRDILGHWTVAEGAIYADYDSKIHVVDELPEMKRYFG---------GIDWGYT 255 Query: 291 DPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAAAQFRQ 350 +++ + D + +Y+++ + K + Y + DSA + Sbjct: 256 HYGSIVIVGEGVDNN-FYLVDGVRAQFKEIDWWVEQARKLTGIYGNIPFYADSARPE--H 312 Query: 351 DLAYEHE-IASAPAKKSVLDGLACLQALFQQGKIIVDAS-CSSLIHALQNYKWDFQEGEE 408 +E+E + A KSV+ G+ + LF++ K+ V + Y+W +E Sbjct: 313 VARFENEGFDISNANKSVIAGIELIAKLFKEQKLYVKRGFVPRFFDEIYQYRW-----KE 367 Query: 409 KLSREKPRHDANSHLCDALRYGIYS 433 ++++P + + L D++RY IYS Sbjct: 368 NSTKDEPLKEFDDVL-DSVRYAIYS 391 >gi|16780|lcl|protein:vir:2731 Length: 411 # NCBI annotation: hypothetical protein # Family: family:all:54 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695104;genbank:gi:23455873;genbank:GeneID :955606 Length = 411 Score = 35.4 bits (80), Expect = 0.001, Method: Compositional matrix adjust. Identities = 44/191 (23%), Positives = 83/191 (43%), Gaps = 21/191 (10%) Query: 246 FSVFEGQIFDTFNAIDHVKD-LKGMRHFFKDDEAFETLLGIDVGYRDPTAVLTIKYHYDT 304 ++V EG I+ +++ HV D L M+ +F GID GY +++ + D Sbjct: 219 WTVAEGAIYADYDSKIHVVDELPEMKRYFG---------GIDWGYTHYGSIVIVGEGVDN 269 Query: 305 DTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAAAQFRQDLAYEHE-IASAPA 363 + +Y+++ K + Y + DSA + +E+E A Sbjct: 270 N-FYLVDGVAAQFKEIDWWVEQARKLTGIYGNIPFYADSARPE--HVARFENEGFDIMNA 326 Query: 364 KKSVLDGLACLQALFQQGKIIVDAS-CSSLIHALQNYKWDFQEGEEKLSREKPRHDANSH 422 KSV+ G+ + LF++ K+ V + Y+W +E ++++P + + Sbjct: 327 NKSVIAGIELIAKLFKEKKLYVKRGFVPRFFDEIYQYRW-----KENSTKDEPLKEFDDV 381 Query: 423 LCDALRYGIYS 433 L D++RY IYS Sbjct: 382 L-DSVRYAIYS 391 >gi|8015|lcl|protein:vir:96495 Length: 195 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238487;genbank:gi:66391763;genbank:GeneID :5176917 Length = 195 Score = 35.4 bits (80), Expect = 0.001, Method: Compositional matrix adjust. Identities = 44/190 (23%), Positives = 81/190 (42%), Gaps = 19/190 (10%) Query: 246 FSVFEGQIFDTFNAIDHVKD-LKGMRHFFKDDEAFETLLGIDVGYRDPTAVLTIKYHYDT 304 ++V EG I+ +++ HV D L M+ F GID GY +++ + D Sbjct: 3 WTVAEGAIYADYDSKIHVVDELPEMKRCFG---------GIDWGYTHYGSIVVVGEGVDG 53 Query: 305 DTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVDSAAAQFRQDLAYEHEIASAPAK 364 + +Y+L+ K + Y+ + DSA + E + A Sbjct: 54 N-FYLLDGVAAQFKEIDWWVEQARKLTGIYRNIPFYADSARPEHVARFESEG-FDISNAN 111 Query: 365 KSVLDGLACLQALFQQGKIIVDAS-CSSLIHALQNYKWDFQEGEEKLSREKPRHDANSHL 423 KSV+ G+ + LF++ K+ V + Y+W +E ++++P + + L Sbjct: 112 KSVIAGIELIAKLFKEEKLYVKRGFVPRFFDEIYQYRW-----KENSTKDEPLKEFDDVL 166 Query: 424 CDALRYGIYS 433 D++RY IYS Sbjct: 167 -DSVRYAIYS 175 >gi|8305|lcl|protein:vir:96700 Length: 689 # NCBI annotation: putative phage terminase large subunit # Family: family:all:140 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039815;genbank:gi:126010850;genbank:Ge neID:5076210 Length = 689 Score = 35.0 bits (79), Expect = 0.002, Method: Compositional matrix adjust. Identities = 32/138 (23%), Positives = 53/138 (38%), Gaps = 13/138 (9%) Query: 26 PVSGILAQEGITPNGPQIAIINALEDPRHRFVTACVSRRVGKSFIAYTLGFLKLLEPNVK 85 P S I N I I++A +P++ VT + ++GKS L +L + Sbjct: 67 PTSPIPGPFNPDTNPYMIPIVSAFANPQYNRVTFVMGTQMGKSVSMENLVGWRLDDDPTP 126 Query: 86 VLVVAPNYSLANIG--------WSQIRGLIKKYGLQTERENAKDKEIELANGSLFKLASA 137 ++ VAP +L + + Q L +KY N K + G+ F+ A A Sbjct: 127 IMYVAPTSNLIDTTVEPKFMDMFQQAESLARKYDW-----NRSTKYTKWVGGTKFRFAWA 181 Query: 138 AQADSAVGRSYDFIIFDE 155 S ++ DE Sbjct: 182 GSPTELAADSAGLVLVDE 199 >gi|15673|lcl|protein:vir:1151 Length: 594 # NCBI annotation: predicted DNA-dependent ATPase terminase subunit # Family: family:all:169 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490600;genbank:gi:17313220;genbank:GeneID :927317 Length = 594 Score = 33.5 bits (75), Expect = 0.005, Method: Compositional matrix adjust. Identities = 31/122 (25%), Positives = 51/122 (41%), Gaps = 8/122 (6%) Query: 307 YYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRVFVD-----SAAAQF-RQDLAYEHEIAS 360 + VLE +Q K A+ A +I+ RY V + VD S AQ RQ + Sbjct: 444 FRVLERHQFRGKDFAEQAEFIRKVTQRYWVTYIGVDTTGMGSGVAQLVRQFFPGVRTFSY 503 Query: 361 APAKKSVLDGLACLQALFQQGKIIVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDAN 420 +P K+ L + ++ + G++ DA + L AL + G + + R+D Sbjct: 504 SPEVKTQL--VMKAWSVIKNGRLEFDAGWTDLAQALMAIRKTITAGGRQFTYTAGRNDNT 561 Query: 421 SH 422 H Sbjct: 562 GH 563 >gi|18252|lcl|protein:vir:4193 Length: 467 # NCBI annotation: putative large terminase subunit # Family: family:all:144 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071818;genbank:gi:11863101;genbank:GeneID :1257603 Length = 467 Score = 33.5 bits (75), Expect = 0.005, Method: Compositional matrix adjust. Identities = 35/126 (27%), Positives = 55/126 (43%), Gaps = 17/126 (13%) Query: 145 GRSYDFIIFDEAAISDVGGDAFRVQ---LRPTLDKPNSKALFISTPRGG---NWFKEFYA 198 G SY +I FDE +++ +R LR D P ++ GG W K + Sbjct: 165 GSSYHYIAFDE--LTEFLESQYRFMFRSLRKEADDPIPLRFRATSNPGGIGHEWVKTRFI 222 Query: 199 YGFDDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKNYFRQEYEADFSV-FEGQIF--D 255 G +P+ T+R+NP + ++ EEA + RQ E D+ V +G +F + Sbjct: 223 TGEKTFIPS------TWRENPYLNRDEYEEALNMLDHVTRRQLKEGDWDVSIQGGVFRRE 276 Query: 256 TFNAID 261 F ID Sbjct: 277 WFEIID 282 >gi|13374|lcl|protein:vir:9262 Length: 517 # NCBI annotation: gp2 # Family: family:all:1730 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720326;genbank:gi:24371583;genbank:GeneID :955800 Length = 517 Score = 32.0 bits (71), Expect = 0.015, Method: Compositional matrix adjust. Identities = 38/173 (21%), Positives = 70/173 (40%), Gaps = 28/173 (16%) Query: 273 FKDDEAFETLLGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCID 332 F+ + F + D G+ P A + + + D D +Y+ ++++E T Q ++ + Sbjct: 326 FECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGAVKSWAN 385 Query: 333 RYKV--------------DRVFVDSAAAQFRQDLAYEHEIASAP-AKKSVLDGLACLQAL 377 + V +++ A A F + EH A+ P SV G+ L+ L Sbjct: 386 KIPVAWPHDGHQHEKGGGEQLKTQYADAGF--SMLPEH--ATFPDGGNSVESGIGELRDL 441 Query: 378 FQQGKIIVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSHLCDALRYG 430 +G+ V +C + Y D + G K+ + N + DA RYG Sbjct: 442 MLEGRFKVFNTCEPFFEEFRLYHRD-ENG--KIVK------TNDDVLDATRYG 485 >gi|16077|lcl|protein:vir:4155 Length: 468 # NCBI annotation: large terminase subunit # Family: family:all:144 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046964;genbank:gi:9630534;genbank:GeneID: 1261708 Length = 468 Score = 32.0 bits (71), Expect = 0.017, Method: Compositional matrix adjust. Identities = 33/126 (26%), Positives = 54/126 (42%), Gaps = 17/126 (13%) Query: 145 GRSYDFIIFDEAAISDVGGDAFRVQLRPTLDKPNS----KALFISTPRG--GNWFKEFYA 198 G SY +I FDE +++ +R R + N + S P G W K + Sbjct: 165 GSSYHYIAFDE--LTEFMETQYRFMFRSLRKEVNDHIPLRVRATSNPGGIGHEWVKTRFI 222 Query: 199 YGFDDTLPNWVSIHGTYRDNPRADLNDIEEARRTVSKNYFRQEYEADFSV-FEGQIF--D 255 G +P+ T+R+NP + ++ EEA + RQ + D+ V +G +F + Sbjct: 223 TGEKTFIPS------TWRENPYLNRDEYEEALNMLDHVTRRQLKDGDWDVTLQGGVFKRE 276 Query: 256 TFNAID 261 F ID Sbjct: 277 WFEVID 282 >gi|155|lcl|protein:vir:77595 Length: 499 # NCBI annotation: terminase large subunit # Family: family:all:1730 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063734;genbank:gi:51236724;genbank:GeneID :2944239 Length = 499 Score = 30.8 bits (68), Expect = 0.037, Method: Compositional matrix adjust. Identities = 37/173 (21%), Positives = 71/173 (41%), Gaps = 28/173 (16%) Query: 273 FKDDEAFETLLGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCID 332 F+ + F + D G+ P A + + + D D +Y+ ++++E T Q ++ + Sbjct: 308 FECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGAVKSWAN 367 Query: 333 RYKV--------------DRVFVDSAAAQFRQDLAYEHEIASAP-AKKSVLDGLACLQAL 377 + V +++ A A F + +H A+ P SV G++ L+ L Sbjct: 368 KIPVAWPHDGHQHEKGGGEQLKTQYADAGF--SMLPDH--ATFPDGGNSVESGISELRDL 423 Query: 378 FQQGKIIVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSHLCDALRYG 430 +G+ V +C + Y D + G K+ + N + DA RYG Sbjct: 424 MLEGRFKVFNTCEPFFEEFRLYHRD-ENG--KIVK------TNDDVLDATRYG 467 >gi|3734|lcl|protein:vir:94555 Length: 585 # NCBI annotation: DNA packaging protein B # Family: family:all:697 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919024;genbank:gi:119637788;genbank:GeneI D:5179315 Length = 585 Score = 30.4 bits (67), Expect = 0.048, Method: Compositional matrix adjust. Identities = 41/173 (23%), Positives = 63/173 (36%), Gaps = 26/173 (15%) Query: 38 PNGPQIAIINALEDPRHRFVTACVSRRVGKSFIAYTLGFLKLL-EPNVKVLVVAPNYSLA 96 P QI + L D H+ R +GKSFI L +P +KVL+V+ + A Sbjct: 35 PTKCQIDMARTLADGDHKKFILQAFRGIGKSFITCAFVVWVLWRDPQLKVLIVSASKERA 94 Query: 97 NIGWSQIRGLIKKYGLQTE---RENAKDKEIELANGSLFK------LASAAQADSAVGRS 147 + I+ +I E R +D I G L K + S G Sbjct: 95 DANSIFIKNIIDLLPFLAELKPRPGQRDSVISFDVG-LAKPDHSPSVKSVGITGQLTGSR 153 Query: 148 YDFIIFDEAAISDVGGDA------------FRVQLRPTLDKPNSKALFISTPR 188 D II D+ + + F L+P P S+ +++ TP+ Sbjct: 154 ADIIIADDVEVPGNSSTSSAREKLWTLVTEFAALLKPL---PTSRVIYLGTPQ 203 >gi|10799|lcl|protein:vir:78055 Length: 440 # NCBI annotation: TerL # Family: family:all:54 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468786;genbank:gi:157325367;genbank:Ge neID:5601817 Length = 440 Score = 30.4 bits (67), Expect = 0.049, Method: Compositional matrix adjust. Identities = 53/217 (24%), Positives = 83/217 (38%), Gaps = 33/217 (15%) Query: 226 IEEARRTVSKNYFRQEYEADFSVFEGQIFDTFNAI------DHVKDLKGMRHFFKDDEAF 279 +E AR KN R E+E F G+ +T N + +H+ F + Sbjct: 235 LESARLVRDKNPNRYEWE-----FLGRNVNTGNEVFPNAVQEHIT--------FDMIDGL 281 Query: 280 ETLLGIDVGYR-DPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCID-RYKVD 337 G D GY DP+ L + Y DT Y+ +E T A I + + Y + Sbjct: 282 RPYEGFDEGYTADPSVWLRVFYDEQRDTVYITDELVMKRYKTKALAKDILNVQEGSYNIV 341 Query: 338 RVFVDSAAAQF---RQDLAYEHEIASAPAKKSVLDGLACLQALFQQGKIIVDASCSSLIH 394 R DSA + +DL + +A + + SV G L + KI++D C + Sbjct: 342 R--GDSANPRVLDEMRDLGV-NALAVSKSPNSVPHGTNWLA---NRIKIVIDFKCPNTWR 395 Query: 395 ALQNYKWDFQEGEEKLSREKPRHDANSHLCDALRYGI 431 +Y +G P D ++H D RY + Sbjct: 396 EFSSYAL-LPDGVGNRKHGFP--DKDNHTIDTTRYAL 429 >gi|5978|lcl|protein:vir:95489 Length: 659 # NCBI annotation: Putative large subunit (GpA homolog) of DNA packaging dimer # Family: family:all:140 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293346;genbank:gi:148912767;genbank:Ge neID:5228141 Length = 659 Score = 30.0 bits (66), Expect = 0.052, Method: Compositional matrix adjust. Identities = 38/135 (28%), Positives = 60/135 (44%), Gaps = 16/135 (11%) Query: 34 EGITPNGP-QIAIINALEDPRHRFVTACVSRRVG-KSFIAYTLGFLKLLEPNVKVLVVAP 91 EG P Q+AI+NA+ + R V S R+G + +G+ K+ VL+ +P Sbjct: 51 EGKWKTAPFQVAILNAMGNDLIRVVNFVKSARIGYTKMLMANIGY-KIQHKRRNVLMWSP 109 Query: 92 NYSLA-NIGWSQIRGLIKK----------YGLQTERENAKDKEIELANGSLFKLASAAQA 140 A I S + GLI+ YG + +N D ++ +L+ L A A Sbjct: 110 TDPDAEGISKSHVNGLIRDVPVLLALAPWYG-RKHSDNTLDTKVFANRRTLWTLGGKA-A 167 Query: 141 DSAVGRSYDFIIFDE 155 + RS D +I+DE Sbjct: 168 RNYRERSADEVIYDE 182 >gi|11736|lcl|protein:vir:79016 Length: 417 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110720;genbank:gi:134287337;genbank:Ge neID:4955190 Length = 417 Score = 29.6 bits (65), Expect = 0.073, Method: Compositional matrix adjust. Identities = 72/275 (26%), Positives = 112/275 (40%), Gaps = 38/275 (13%) Query: 169 QLRPTLDKPN--SKALFISTP-RGGNWFK-EFYAYGFDDTLPNWVSIHGTYRDNPRADLN 224 +LR L PN + F P +W K +++ Y DD + H TY N D Sbjct: 156 RLRGILTNPNLYYQMTFTFNPVSATHWIKRKYFDYKNDDIFTH----HSTYLQNRFID-- 209 Query: 225 DIEEARRTVSKNYFRQEYEAD-FSVFE-GQIFDTFNAIDHVKDLKG--MRHFFKDDEAFE 280 E R + R+E + + + V+ G+ +T AI LK + F + E F+ Sbjct: 210 --EAYYRRMQ---MRKEQDPEGYKVYGLGEWGETGGAI-----LKNYVIHEFPTESEYFD 259 Query: 281 TL-LGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVDRV 339 + L D G+ VL I + D + Y E Y A + I + I K + Sbjct: 260 NMRLSQDFGFNHANVVLRIGFK-DGELYICNEIY--AHEMDTSEIIKIANSIGLEKTLFM 316 Query: 340 FVDSAAAQFRQDLAYEHEIASAPAKKSVLDGLACLQA---LFQQGKIIVDASCSSLIHAL 396 + DSA D + A AK V G ++A +Q +I V SC++ I + Sbjct: 317 YCDSAEP----DRIKMWKSAGYKAK-GVKKGPGSVKAQIDYLKQLRIHVHPSCTNTIKEI 371 Query: 397 QNYKWDFQEGEEKLSREKPRHDANSHLCDALRYGI 431 Q +KW Q+ L ++P + + ALRY I Sbjct: 372 QQWKWK-QDERTGLYLDEPVEFMDDAMA-ALRYSI 404 >gi|15412|lcl|protein:vir:1325 Length: 519 # NCBI annotation: gp33 # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047924;swissprot:trembl:q9t214;genbank:gi :9631142;uniprot:Q9T214;genbank:GeneID:2715903 Length = 519 Score = 28.9 bits (63), Expect = 0.12, Method: Compositional matrix adjust. Identities = 12/28 (42%), Positives = 17/28 (60%) Query: 53 RHRFVTACVSRRVGKSFIAYTLGFLKLL 80 +HR V CV+R+ GKS IA + L+ Sbjct: 74 KHRTVVVCVARKNGKSTIAAAIMLYHLI 101 >gi|3300|lcl|protein:vir:100942 Length: 499 # NCBI annotation: Gp2 # Family: family:all:1730 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006405;genbank:gi:46358697;genbank:GeneID :2777092 Length = 499 Score = 28.9 bits (63), Expect = 0.12, Method: Compositional matrix adjust. Identities = 36/173 (20%), Positives = 70/173 (40%), Gaps = 28/173 (16%) Query: 273 FKDDEAFETLLGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCID 332 F+ + F + D G+ P A + + + D D +Y+ ++++E T Q ++ + Sbjct: 308 FECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSENTAVQAWGAVKSWAN 367 Query: 333 RYKV--------------DRVFVDSAAAQFRQDLAYEHEIASAP-AKKSVLDGLACLQAL 377 + V +++ A A F + +H A+ P SV G++ L+ L Sbjct: 368 KIPVAWPHDGHQHEKGGGEQLKTQYADAGF--SMLPDH--ATFPDGGNSVESGISELRDL 423 Query: 378 FQQGKIIVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSHLCDALRYG 430 +G+ +C + Y D + G K+ + N + DA RYG Sbjct: 424 MLEGRFKAFNTCEPFFEEFRLYHRD-ENG--KIVK------TNDDVLDATRYG 467 >gi|15774|lcl|protein:vir:6239 Length: 527 # NCBI annotation: gp33 # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813693;swissprot:trembl:q859c4;genbank:gi :29366753;interpro:IPR005021;uniprot:Q859C4;genbank:Gene ID:1258893 Length = 527 Score = 28.9 bits (63), Expect = 0.13, Method: Compositional matrix adjust. Identities = 12/28 (42%), Positives = 17/28 (60%) Query: 53 RHRFVTACVSRRVGKSFIAYTLGFLKLL 80 +HR V CV+R+ GKS IA + L+ Sbjct: 77 KHRTVVVCVARKNGKSTIAAAIMLYHLI 104 >gi|19539|lcl|protein:vir:10320 Length: 627 # NCBI annotation: ORF22 # Family: family:all:140 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758915;genbank:gi:27311189;genbank:GeneID :956138 Length = 627 Score = 28.5 bits (62), Expect = 0.17, Method: Compositional matrix adjust. Identities = 44/155 (28%), Positives = 66/155 (42%), Gaps = 11/155 (7%) Query: 42 QIAIINALEDPRHRFVTACVSRRVGKSFIA-YTLGFLKLLEPNVKVLVVAPNYSLANIGW 100 QI I +A+ DP VT S RVG + I +G+ +P +LVV P A G+ Sbjct: 52 QIGIADAMCDPEEERVTVMKSMRVGYTKIVDLAIGYYMDADP-CSMLVVQPTIDDAE-GF 109 Query: 101 S--QIRGLIKKY-GLQTERENAKDKEI-ELANGSLFKLASAAQADSAVGRSYDFIIFDE- 155 S +I +++ LQ + + D + ++ G L A + +IFDE Sbjct: 110 SKDEIAPMLRDVPCLQGKVQRDDDTLLKKVYPGGSLTLVGANSPTGFRRLTVRIVIFDEM 169 Query: 156 -AAISDVG--GDAFRVQLRPTLDKPNSKALFISTP 187 A ++ G GD R T N K + STP Sbjct: 170 SAYPANTGKDGDPVRQGEGRTFSAFNRKIIAGSTP 204 >gi|18586|lcl|protein:vir:8649 Length: 564 # NCBI annotation: gp7 # Family: family:all:144 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817768;genbank:gi:29566200;genbank:GeneID :1259404 Length = 564 Score = 28.5 bits (62), Expect = 0.18, Method: Compositional matrix adjust. Identities = 28/109 (25%), Positives = 43/109 (39%), Gaps = 20/109 (18%) Query: 66 GKSFIAYTLGFLKLLE--PNVKVLVVAPNYSLANIGWSQIRGLIK--------------- 108 GKS +A L+ L+ PN ++++ LA+ + R LIK Sbjct: 101 GKSTMASVYTVLRALQLNPNARIILACYGQDLAHGHSRKCRDLIKRHGSGVRDAMTGAQI 160 Query: 109 --KYGLQTERENAKDKEIELANGSLFKLASAAQADSAVGRSYDFIIFDE 155 K GL+ ER K E + GS L + + G+ D I D+ Sbjct: 161 EDKLGLKLERGANKVSEWSIEGGS-GGLVATGLGGTITGKPADLFIIDD 208 >gi|7375|lcl|protein:vir:99083 Length: 566 # NCBI annotation: gp7 # Family: family:all:144 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655687;genbank:gi:109521765;genbank:GeneI D:4157805 Length = 566 Score = 27.3 bits (59), Expect = 0.36, Method: Compositional matrix adjust. Identities = 27/109 (24%), Positives = 43/109 (39%), Gaps = 20/109 (18%) Query: 66 GKSFIAYTLGFLKLLE--PNVKVLVVAPNYSLANIGWSQIRGLIK--------------- 108 GKS +A L+ L+ PN ++++ LA+ + R LIK Sbjct: 103 GKSTMASVYTVLRALQLNPNARIILACYGQDLAHGHSRKCRDLIKRHGSGVRDAMTGAQI 162 Query: 109 --KYGLQTERENAKDKEIELANGSLFKLASAAQADSAVGRSYDFIIFDE 155 K GL+ ER K E + G+ L + + G+ D I D+ Sbjct: 163 EDKLGLKLERGANKVSEWSIEGGT-GGLVATGLGGTITGKPADLFIIDD 210 >gi|2819|lcl|protein:vir:105705 Length: 439 # NCBI annotation: gp2 # Family: family:all:54 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224140;genbank:gi:62362215;genbank:GeneID :3342458 Length = 439 Score = 26.9 bits (58), Expect = 0.44, Method: Compositional matrix adjust. Identities = 24/75 (32%), Positives = 33/75 (44%), Gaps = 9/75 (12%) Query: 359 ASAPAKKSVLDGLACLQALFQQGKIIVDASCSSLIHA--LQNYKWDFQEGEEKLSREKPR 416 A+ + SV DG+ L+ KII+ C L +YK D GE E Sbjct: 350 AAQKWQGSVEDGITFLRGF---KKIIIHPRCKETAKEARLYSYKTDRITGEVLPIIE--- 403 Query: 417 HDANSHLCDALRYGI 431 D N+H D +RYG+ Sbjct: 404 -DKNNHCWDGIRYGL 417 >gi|7834|lcl|protein:vir:102402 Length: 807 # NCBI annotation: gp6 # Family: family:all:144 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655283;genbank:gi:109521846;genbank:GeneI D:4157717 Length = 807 Score = 26.9 bits (58), Expect = 0.47, Method: Compositional matrix adjust. Identities = 34/151 (22%), Positives = 64/151 (42%), Gaps = 26/151 (17%) Query: 41 PQIAII-NALED----PRHRFVTACVSRRVGKSFIAYTLGFLKLLE--PNVKVLVVAPNY 93 P + II +A+ED PR + + GKS + ++ L+ PN ++++ Sbjct: 60 PALRIISDAIEDVLRYPRCNLLVTMPPQE-GKSTMCAVWTPIRALQLNPNRRIILATYGD 118 Query: 94 SLANIGWSQIRGLIKKY-----------------GLQTERENAKDKEIELANGSLFKLAS 136 SLA+ + R LI +Y GL+ + AK + +G++ + + Sbjct: 119 SLADQHSTTARDLIMRYGTGVTDALTGLAVEDKLGLKINPKQAKVSSWRI-DGAIGGMVA 177 Query: 137 AAQADSAVGRSYDFIIFDEAAISDVGGDAFR 167 A + G+S D I D+ + + D+ R Sbjct: 178 AGLGSAITGKSADLFIIDDPFKNMIEADSTR 208 >gi|10934|lcl|protein:vir:78195 Length: 601 # NCBI annotation: gp4, phage terminase, ATPase subunit # Family: family:all:169 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111154;genbank:gi:134288710;genbank:Ge neID:4960655 Length = 601 Score = 26.6 bits (57), Expect = 0.70, Method: Compositional matrix adjust. Identities = 52/226 (23%), Positives = 80/226 (35%), Gaps = 38/226 (16%) Query: 222 DLNDIEEARRTVSKNYFRQEYEADFSVFEGQIFDTFNAIDHVKDLKGMR----HFFKDDE 277 DL DI+E RR S F ++ Q D ++ + DL+ + DD Sbjct: 358 DLFDIDELRREYSAEEFA-------NLLMCQFIDDSLSVFKLSDLQRCMVDSWEEWADD- 409 Query: 278 AFETLLGIDVGYR------DPTA-------VLTIKYHYDTDTYYVLEEYQQAEKTTAQHA 324 F LL GYR DP V+ D + VLE +Q + A Sbjct: 410 -FSPLLLRPFGYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQA 468 Query: 325 AYIQHCIDRYKVDRVFVDSAAAQ------FRQDLAYEHEIASAPAKKS--VLDGLACLQA 376 A I+ RY V + +D+ R+ + +P K+ VL G Q+ Sbjct: 469 AAIEAITQRYNVGYIAIDTTGMGQGVYQLVRKFFPAAVALNYSPEVKTRLVLKG----QS 524 Query: 377 LFQQGKIIVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSH 422 + + G++ DA + L A K + + R D H Sbjct: 525 VVRNGRLQFDAGWTDLAAAFMAIKQTMTASGRQATYTAGRTDETGH 570 >gi|9890|lcl|protein:vir:103970 Length: 601 # NCBI annotation: phage terminase ATPase subunit # Family: family:all:169 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293751;genbank:gi:72537721;genbank:GeneID :3608097 Length = 601 Score = 26.6 bits (57), Expect = 0.70, Method: Compositional matrix adjust. Identities = 52/226 (23%), Positives = 80/226 (35%), Gaps = 38/226 (16%) Query: 222 DLNDIEEARRTVSKNYFRQEYEADFSVFEGQIFDTFNAIDHVKDLKGMR----HFFKDDE 277 DL DI+E RR S F ++ Q D ++ + DL+ + DD Sbjct: 358 DLFDIDELRREYSAEEFA-------NLLMCQFIDDSLSVFKLSDLQRCMVDSWEEWADD- 409 Query: 278 AFETLLGIDVGYR------DPTA-------VLTIKYHYDTDTYYVLEEYQQAEKTTAQHA 324 F LL GYR DP V+ D + VLE +Q + A Sbjct: 410 -FSPLLLRPFGYREVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQA 468 Query: 325 AYIQHCIDRYKVDRVFVDSAAAQ------FRQDLAYEHEIASAPAKKS--VLDGLACLQA 376 A I+ RY V + +D+ R+ + +P K+ VL G Q+ Sbjct: 469 AAIEAITQRYNVGYIAIDTTGMGQGVYQLVRKFFPAAVALNYSPEVKTRLVLKG----QS 524 Query: 377 LFQQGKIIVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSH 422 + + G++ DA + L A K + + R D H Sbjct: 525 VVRNGRLQFDAGWTDLAAAFMAIKQTMTASGRQATYTAGRTDETGH 570 >gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: terminase large subunit # Family: family:all:4877 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429607;genbank:gi:156564098;genbank:Ge neID:5525534 Length = 635 Score = 25.8 bits (55), Expect = 0.97, Method: Compositional matrix adjust. Identities = 61/276 (22%), Positives = 100/276 (36%), Gaps = 75/276 (27%) Query: 61 VSRRVGKS-FIAYTLGFLKLLEPN------VKVLVVAPNYSLANIGWSQIRGLIKKYGLQ 113 + RR+GK+ + + + +PN +L++AP ++ + ++ LI G Sbjct: 89 LGRRLGKTETMCIMILWHAFTQPNKGPNNQYDILIIAPYEEQVDLIFKRLSQLIDMSGDV 148 Query: 114 TERENAKDKEIELANGSLFKLASAAQ-----ADSAVGRSYDFIIFDEAAISDVGGDAFRV 168 + DK IEL NG++ +A A + G+ D I+ DE D G++ Sbjct: 149 NPSRDI-DKHIELPNGTVIHGITAGSKSGSGAANTRGQRADLIVLDEM---DYMGESEIT 204 Query: 169 QLRPTLDKPNS--KALFISTPRG--GNWFK----EFYAYGFDDTLP-------------- 206 + ++ K + STP G +++K Y DD L Sbjct: 205 NIMNIRNEAPERIKMIVASTPSGRRDSYYKWCVGATKTYAQDDELTRQNGGRVTYNVKMK 264 Query: 207 -----------------------NWVSIHGT-------YRDNPRADLNDIEEARRTVSKN 236 W +IH + NP L IEE R +++ Sbjct: 265 PWKTLADGSYELNQLGDKMREGNGWTTIHAPSTVNPELLKVNPDTGLTYIEELRLDLTEM 324 Query: 237 YFRQEYEADF-----SVFEGQIFDTFNAIDHVKDLK 267 F QE A+F VF + D AI H + LK Sbjct: 325 RFIQEVMAEFGESISGVFLKKHIDI--AISHGERLK 358 >gi|10061|lcl|protein:vir:99071 Length: 537 # NCBI annotation: gp26 # Family: family:all:523 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655891;genbank:gi:109521463;genbank:GeneI D:4158036 Length = 537 Score = 24.6 bits (52), Expect = 2.1, Method: Compositional matrix adjust. Identities = 9/27 (33%), Positives = 16/27 (59%) Query: 75 GFLKLLEPNVKVLVVAPNYSLANIGWS 101 G+ +++ P V + AP+ SL I W+ Sbjct: 339 GWSRIVTPTVVAIDSAPDSSLTTISWA 365 >gi|17621|lcl|protein:vir:816 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050549;genbank:gi:9633446;genbank:GeneID: 1262208 Length = 568 Score = 24.3 bits (51), Expect = 2.8, Method: Compositional matrix adjust. Identities = 11/25 (44%), Positives = 15/25 (60%) Query: 147 SYDFIIFDEAAISDVGGDAFRVQLR 171 S + IIFDE+ VGGD + + R Sbjct: 196 SDECIIFDESTAEGVGGDFYEMSNR 220 >gi|25680|lcl|protein:vir:10116 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859246;genbank:gi:32171173;genbank:GeneID :2653342 Length = 568 Score = 24.3 bits (51), Expect = 2.8, Method: Compositional matrix adjust. Identities = 11/25 (44%), Positives = 15/25 (60%) Query: 147 SYDFIIFDEAAISDVGGDAFRVQLR 171 S + IIFDE+ VGGD + + R Sbjct: 196 SDECIIFDESTAEGVGGDFYEMSNR 220 >gi|51|lcl|protein:vir:3295 Length: 568 # NCBI annotation: putative large subunit terminase # Family: family:all:147 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049511;genbank:gi:9632517;genbank:GeneID: 1262002 Length = 568 Score = 24.3 bits (51), Expect = 2.8, Method: Compositional matrix adjust. Identities = 11/25 (44%), Positives = 15/25 (60%) Query: 147 SYDFIIFDEAAISDVGGDAFRVQLR 171 S + IIFDE+ VGGD + + R Sbjct: 196 SDECIIFDESTAEGVGGDFYEMSNR 220 >gi|26240|lcl|protein:vir:2763 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612880;genbank:gi:20065962;genbank:GeneID :935714 Length = 568 Score = 24.3 bits (51), Expect = 2.8, Method: Compositional matrix adjust. Identities = 11/25 (44%), Positives = 15/25 (60%) Query: 147 SYDFIIFDEAAISDVGGDAFRVQLR 171 S + IIFDE+ VGGD + + R Sbjct: 196 SDECIIFDESTAEGVGGDFYEMSNR 220 >gi|25849|lcl|protein:vir:9949 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859079;genbank:gi:32171001;genbank:GeneID :2653280 Length = 568 Score = 24.3 bits (51), Expect = 2.8, Method: Compositional matrix adjust. Identities = 11/25 (44%), Positives = 15/25 (60%) Query: 147 SYDFIIFDEAAISDVGGDAFRVQLR 171 S + IIFDE+ VGGD + + R Sbjct: 196 SDECIIFDESTAEGVGGDFYEMSNR 220 >gi|1476|lcl|protein:vir:104436 Length: 568 # NCBI annotation: putative terminase large subunit # Family: family:all:147 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794060;genbank:gi:116222005;genbank:GeneI D:4397501 Length = 568 Score = 24.3 bits (51), Expect = 2.8, Method: Compositional matrix adjust. Identities = 11/25 (44%), Positives = 15/25 (60%) Query: 147 SYDFIIFDEAAISDVGGDAFRVQLR 171 S + IIFDE+ VGGD + + R Sbjct: 196 SDECIIFDESTAEGVGGDFYEMSNR 220 >gi|10658|lcl|protein:vir:268 Length: 605 # NCBI annotation: putative terminase, ATPase subunit # Family: family:all:169 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536648;genbank:gi:17975126;genbank:GeneID :929082 Length = 605 Score = 24.3 bits (51), Expect = 2.9, Method: Compositional matrix adjust. Identities = 32/128 (25%), Positives = 50/128 (39%), Gaps = 8/128 (6%) Query: 222 DLNDIEEARRTVSKNYFRQEYEADFSVFEGQIFDTFNAIDHVKDLKGMRHFFKDDEA--- 278 DL DIEE R S+ F + F IF+ FN I+ + +K + A Sbjct: 364 DLFDIEELREEYSETDFNNLFMCVFVDGASSIFE-FNKIERCMVDSDIWQDYKPNAARPF 422 Query: 279 --FETLLGIDVGYRDPTAVLTIKYH--YDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRY 334 E LG D AVL + + + VLE++ + A+ I +R+ Sbjct: 423 GSREVWLGYDPSRTRDNAVLMVVAPPIVAVEKFRVLEKHTWRGLSFQHQASEISKVFERF 482 Query: 335 KVDRVFVD 342 V + +D Sbjct: 483 NVTYLGID 490 >gi|1219|lcl|protein:vir:105519 Length: 475 # NCBI annotation: phage terminase large subunit # Family: family:all:1730 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516188;genbank:gi:89885991;genbank:GeneID :3964379 Length = 475 Score = 23.9 bits (50), Expect = 4.3, Method: Compositional matrix adjust. Identities = 9/39 (23%), Positives = 21/39 (53%) Query: 284 GIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQ 322 G+D G+ P A + + + + + +YV Y+ + + A+ Sbjct: 302 GMDFGWDHPQAHIQLVWDNENEMFYVTRAYKARQVSPAE 340 >gi|5072|lcl|protein:vir:95130 Length: 531 # NCBI annotation: hypothetical protein ORF006 # Family: family:all:144 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293413;genbank:gi:148912834;genbank:Ge neID:5228205 Length = 531 Score = 22.7 bits (47), Expect = 8.2, Method: Compositional matrix adjust. Identities = 12/47 (25%), Positives = 20/47 (42%) Query: 209 VSIHGTYRDNPRADLNDIEEARRTVSKNYFRQEYEADFSVFEGQIFD 255 ++I G+Y++NP + I E N + D+ V G D Sbjct: 247 IAIFGSYKENPYLPASYIAELESIKEPNLRKAWLYGDWDVTAGGAID 293 >gi|17978|lcl|protein:vir:4335 Length: 563 # NCBI annotation: terminase # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061498;genbank:gi:9635588;genbank:GeneID: 1262853 Length = 563 Score = 22.7 bits (47), Expect = 8.6, Method: Compositional matrix adjust. Identities = 14/39 (35%), Positives = 19/39 (48%), Gaps = 1/39 (2%) Query: 268 GMRHFFKDDEAFETLLGIDVGYRDPTAVLTIKYH-YDTD 305 G +D FET++G P A L +YH +DTD Sbjct: 181 GPMFVMEDMSKFETVIGNPGDGASPHAALVDEYHEHDTD 219 >gi|11926|lcl|protein:vir:79177 Length: 589 # NCBI annotation: gp4, phage terminase, ATPase subunit # Family: family:all:169 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111035;genbank:gi:134288784;genbank:Ge neID:4960696 Length = 589 Score = 22.7 bits (47), Expect = 9.3, Method: Compositional matrix adjust. Identities = 30/131 (22%), Positives = 50/131 (38%), Gaps = 14/131 (10%) Query: 280 ETLLGIDVGYRDPTAVLTI--KYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYKVD 337 E +G D +A L + D + VLE +Q + AA I+ RY V Sbjct: 410 EVWVGYDPALTGDSAGLVVVAPPRVDDGAFRVLERHQFRGNDFEEQAAAIEAITQRYNVG 469 Query: 338 RVFVDSAAAQ------FRQDLAYEHEIASAPAKKS--VLDGLACLQALFQQGKIIVDASC 389 + +D+ R+ + +P K+ VL G Q++ + G++ DA Sbjct: 470 YIAIDTTGMGQGVYQLVRKFFPAAVALNYSPEVKTRLVLKG----QSVVRNGRLQFDAGW 525 Query: 390 SSLIHALQNYK 400 + L A K Sbjct: 526 TDLAAAFMAIK 536 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.321 0.137 0.407 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 197,663 Number of Sequences: 514 Number of extensions: 9416 Number of successful extensions: 163 Number of sequences better than 100.0: 87 Number of HSP's better than 100.0 without gapping: 66 Number of HSP's successfully gapped in prelim test: 21 Number of HSP's that attempted gapping in prelim test: 45 Number of HSP's gapped (non-prelim): 93 length of query: 438 length of database: 206,069 effective HSP length: 74 effective length of query: 364 effective length of database: 168,033 effective search space: 61164012 effective search space used: 61164012 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 38 (19.2 bits)