BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_017971.2_cdsid_YP_006382528.1 [gene=tf_70] [protein=terminase large subunit] [protein_id=YP_006382528.1] [location=complement(43173..44612)] (479 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|18891|lcl|protein:vir:8847 Length: 482 # NCBI annotation: ter... 766 0.0 gi|1219|lcl|protein:vir:105519 Length: 475 # NCBI annotation: ph... 258 7e-71 gi|13374|lcl|protein:vir:9262 Length: 517 # NCBI annotation: gp2... 256 6e-70 gi|3300|lcl|protein:vir:100942 Length: 499 # NCBI annotation: Gp... 254 1e-69 gi|155|lcl|protein:vir:77595 Length: 499 # NCBI annotation: term... 254 2e-69 gi|2931|lcl|protein:vir:105460 Length: 409 # NCBI annotation: pu... 44 4e-06 gi|3326|lcl|protein:vir:94525 Length: 440 # NCBI annotation: put... 44 4e-06 gi|5321|lcl|protein:vir:99534 Length: 422 # NCBI annotation: put... 37 7e-04 gi|2589|lcl|protein:vir:94084 Length: 407 # NCBI annotation: ORF... 37 7e-04 gi|3523|lcl|protein:vir:105897 Length: 407 # NCBI annotation: la... 37 7e-04 gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: t... 35 0.002 gi|4261|lcl|protein:vir:94806 Length: 402 # NCBI annotation: ORF... 31 0.039 gi|9935|lcl|protein:vir:97273 Length: 402 # NCBI annotation: ORF... 31 0.041 gi|1596|lcl|protein:vir:93748 Length: 403 # NCBI annotation: ORF... 30 0.061 gi|14951|lcl|protein:vir:1235 Length: 405 # NCBI annotation: sim... 30 0.070 gi|13903|lcl|protein:vir:9870 Length: 273 # NCBI annotation: put... 28 0.25 gi|28182|lcl|protein:vir:108053 Length: 611 # NCBI annotation: g... 28 0.30 gi|14353|lcl|protein:vir:3149 Length: 554 # NCBI annotation: unk... 27 0.44 gi|18781|lcl|protein:vir:7429 Length: 581 # NCBI annotation: gp6... 26 0.97 gi|6775|lcl|protein:vir:96067 Length: 460 # NCBI annotation: con... 26 0.99 gi|15908|lcl|protein:vir:3418 Length: 641 # NCBI annotation: DNA... 25 1.5 gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: te... 25 2.7 gi|16077|lcl|protein:vir:4155 Length: 468 # NCBI annotation: lar... 24 3.1 gi|18252|lcl|protein:vir:4193 Length: 467 # NCBI annotation: put... 24 3.2 gi|8730|lcl|protein:vir:96840 Length: 421 # NCBI annotation: ORF... 24 3.4 gi|4548|lcl|protein:vir:94970 Length: 473 # NCBI annotation: put... 24 4.6 >gi|18891|lcl|protein:vir:8847 Length: 482 # NCBI annotation: terminase # Family: family:all:1730 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775255;genbank:gi:27476053;genbank:GeneID :2700601 Length = 482 Score = 766 bits (1978), Expect = 0.0, Method: Compositional matrix adjust. Identities = 362/481 (75%), Positives = 404/481 (83%), Gaps = 3/481 (0%) Query: 1 MNQTERALALLKELQDRQKYWKIKQYTPYGWQEKFINASSNAAQLLAMTGNRCGKTYTGA 60 M+ ER L++EL +RQKY+++ QYTPYGWQEKFI ASSN AQLLAMTGNRCGKTYTGA Sbjct: 1 MDTQERLRNLVRELAERQKYFRMNQYTPYGWQEKFIAASSNCAQLLAMTGNRCGKTYTGA 60 Query: 61 FIMACHLTGLYPDWWEGRRYNKPIEAWAAGISTDTTRDILQSELLGKWSDPSRFGTGAIP 120 FIMACHLTG YP+WW GR+++KP+ WAAGISTDTTRDILQSELLG W +P FGTG IP Sbjct: 61 FIMACHLTGRYPEWWTGRKFDKPVNCWAAGISTDTTRDILQSELLGDWKNPEAFGTGMIP 120 Query: 121 KEMILETVRREGKPGCVQTVLVKHVSGGTSILTFKSYEMSQDKFMGTAIDVIWLDEECPK 180 KE I++T RREGKPGCVQ V+V+HVSGG S L FKSYEMSQDKFMGTAIDVIWLDEECPK Sbjct: 121 KEDIVKTERREGKPGCVQAVMVRHVSGGLSSLIFKSYEMSQDKFMGTAIDVIWLDEECPK 180 Query: 181 DIYTQCVTRTATTGGIVYLTFTPEHGATELVKEFMQDLKPGQFMIGATWDDAPHLSEEVK 240 DIYTQCVTRTATTGGIVYLTFTPEHG TE+VK+F+QDLKPGQF+I A+W+DAPHLS EVK Sbjct: 181 DIYTQCVTRTATTGGIVYLTFTPEHGLTEIVKDFLQDLKPGQFLIHASWEDAPHLSPEVK 240 Query: 241 EQLLSVYSPAERAMRASGKPMLGSGVVFVVPEERIVVQPIAIPNHWHHIIGIDLGFDHPN 300 EQLLSVYSPAER MRA G PMLGSGVVF + EE+ V +P IP+H+H IIGIDLGFDHPN Sbjct: 241 EQLLSVYSPAERRMRAEGIPMLGSGVVFPILEEKFVCEPFDIPDHFHRIIGIDLGFDHPN 300 Query: 301 AIACLALDPTTGTYYLYDERSERGETLSMFAQAIRAKGGDTIPVVVPHDAFKHDGATSGR 360 AIAC+A D YYLYDERSE GETL M A AI KGG IPVVVPHDAFKHDGATSGR Sbjct: 301 AIACVAWDAEKDKYYLYDERSESGETLGMHADAIYLKGGHQIPVVVPHDAFKHDGATSGR 360 Query: 361 RFVDLLRDDHKLNIVYQPFSNPPGPDGKPGGNSVEFGVNWMHTHMDNGKFKVFSTCTKFL 420 RFVDLL+DDH LN+VY+PFSNPPGPDGK GGNSVEFGVNWM T M+NG KVF+TCT FL Sbjct: 361 RFVDLLKDDHNLNVVYEPFSNPPGPDGKHGGNSVEFGVNWMLTRMENGDLKVFNTCTNFL 420 Query: 421 QEMKLYHRKDGKIIDRSDDMISATRYAALMMGRHGVPGGASNNYTW---SGPLRPAWFDG 477 +EMK+YHRKDGKI+DR+DDMISATRYA LM RH PG N+ + + L P WF Sbjct: 421 KEMKMYHRKDGKIVDRNDDMISATRYALLMASRHARPGAVRNSGYYRSDTARLIPDWFGS 480 Query: 478 V 478 + Sbjct: 481 I 481 >gi|1219|lcl|protein:vir:105519 Length: 475 # NCBI annotation: phage terminase large subunit # Family: family:all:1730 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516188;genbank:gi:89885991;genbank:GeneID :3964379 Length = 475 Score = 258 bits (660), Expect = 7e-71, Method: Compositional matrix adjust. Identities = 170/465 (36%), Positives = 235/465 (50%), Gaps = 30/465 (6%) Query: 3 QTERALALLKELQDRQKYWKIKQY--TPYGWQEKFINASSNAAQLLAMTGNRCGKTYTGA 60 Q + + + E + R+ + + Y T Y WQ KFI S+ AQ+ + NR GKT T Sbjct: 8 QKIQQVRFINEQRRREHACRYRHYYGTRYDWQRKFIGLSAEYAQVALIAANRVGKTDTAT 67 Query: 61 FIMACHLTGLYPDWWEGRRYNKPIEAWAAGISTDTTRDILQSELLGKWSDPSRFGTGAIP 120 ++ A H G YP+ W G R++ W G S + RD+LQ+ LLG+ +D G G IP Sbjct: 68 YVDAVHALGDYPEAWSGYRFSHAPVIWCLGYSGEKCRDLLQTPLLGRKTDNGWQG-GLIP 126 Query: 121 KEMILETVRREGKPGCVQTVLVKHVSGGTSILTFKSYEMSQDKFMGTAIDVIWLDEECPK 180 E I +T G V+T ++HVSG S + F SY Q MG +D +DEE P+ Sbjct: 127 GERIADTEAMTGTTNAVRTAYIRHVSGLLSKIQFWSYSQGQHALMGDCVDWFHIDEE-PR 185 Query: 181 D--IYTQCVTRTAT----TGGIVYLTFTPEHGATELVKEFMQDLKPGQFMIGATWDDAPH 234 D IY Q +TRTAT GG LTFTPE+G T+LV FM + P Q I WDDAPH Sbjct: 186 DPTIYPQVLTRTATGDRGKGGRGILTFTPENGRTDLVIGFMDNPSPAQTCINVGWDDAPH 245 Query: 235 LSEEVKEQLLSVYSPAERAMRASGKPMLGSGVVFVVPEERIVVQPIAIPNHWHHIIGIDL 294 LS++VK LL+ + +R MR G PMLG G ++ + E+ I P +P HW I G+D Sbjct: 246 LSQKVKNDLLASFPAHQRDMRTKGIPMLGHGRIYDLGEDFITCDPFPVPAHWLVIDGMDF 305 Query: 295 GFDHPNAIACLALDPTTGTYYLYDERSERGETLSMFAQAIRAKG--GDTIPVVVPHDAFK 352 G+DHP A L D +Y+ R+ + +S A+A A + +P P D Sbjct: 306 GWDHPQAHIQLVWDNENEMFYV--TRAYKARQVSP-AEAYSAVSIWAENVPTAWPSDGLM 362 Query: 353 HDGATSGRRFVDLLRDDHKLNIVYQPFSNPPGPDGKPGGNSVEFGVNWMHTHMDNGKFKV 412 + + ++ DD ++ P P G SVE +H M GKFKV Sbjct: 363 TEKGSGIQQ--KTYYDDAGFCMLRDPAQWP------DGSRSVE-----LHDLMRRGKFKV 409 Query: 413 FSTCTKFLQEMKLYHRKD-GKIIDRSDDMISATRYAALMMGRHGV 456 FS F E YHR + +I+ DD++ A RY A MM R+ V Sbjct: 410 FSGLRDFFDEYNFYHRDEKSRIVKMRDDILDAVRY-AYMMRRYAV 453 >gi|13374|lcl|protein:vir:9262 Length: 517 # NCBI annotation: gp2 # Family: family:all:1730 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720326;genbank:gi:24371583;genbank:GeneID :955800 Length = 517 Score = 256 bits (653), Expect = 6e-70, Method: Compositional matrix adjust. Identities = 153/443 (34%), Positives = 231/443 (52%), Gaps = 27/443 (6%) Query: 23 IKQYTPYGWQEKFINASSNAAQLLAMTGNRCGKTYTGAFIMACHLTGLYPDW-------- 74 + ++TPY Q +FI+A + + M GN+ GK++TGA +A HLTG YP Sbjct: 52 LYEFTPYSKQREFIDAGHDYPERCFMAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGK 111 Query: 75 ----WEGRRYNKPIEAWAAGISTDTTRDILQSELLGKWSDPSRFGTGAIPKEMILETVRR 130 W+G+R+ +P+ W G + +T Q L G+ + G G+IPKE I+ + Sbjct: 112 YGGEWKGKRFYEPVVFWVGGETNETVTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKS 171 Query: 131 EGKPGCVQTVLVKH-----VSGGTSILTFKSYEMSQDKFMGTAIDVIWLDEECPKDIYTQ 185 P V +LVKH V G SI FK Y + ++ G I +W DEE P IY + Sbjct: 172 PFFPNLVDHLLVKHHTPEGVEDGISICYFKPYSQGRARWQGDTIHGVWFDEEPPYSIYGE 231 Query: 186 CVTRTATTGGIVYLTFTPEHGATELVKEFMQDLKPGQFMIGATWDDAPHLSEEVKEQLLS 245 +TRT G LTFTP G +++V +F+++ Q ++ T DA H ++E KEQ+++ Sbjct: 232 GLTRTNKYGQFSILTFTPLMGMSDVVTKFLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIA 291 Query: 246 VYSPAERAMRASGKPMLGSGVVFVVPEERIVVQPIAIPNHWHHIIGIDLGFDHPNAIACL 305 Y ER RA G P +GSG +F +PEE I QP P+H++ I D G++HP A L Sbjct: 292 SYPEHEREARARGIPTMGSGRIFQIPEETIKCQPFECPDHFYVIDAQDFGWNHPQAHIQL 351 Query: 306 ALDPTTGTYYLYDERSERGETLSMFAQAIRAKGGDTIPVVVPHDAFKHDGATSGRRFVDL 365 D +YL ++ E ++ A + IPV PHD +H+ G + Sbjct: 352 WWDKDADVFYL-ARVWKKSENTAVQAWGAVKSWANKIPVAWPHDGHQHE--KGGGEQLKT 408 Query: 366 LRDDHKLNIVYQPFSNPPGPDGKPGGNSVEFGVNWMHTHMDNGKFKVFSTCTKFLQEMKL 425 D +++ + + P GGNSVE G+ + M G+FKVF+TC F +E +L Sbjct: 409 QYADAGFSMLPEHATFP------DGGNSVESGIGELRDLMLEGRFKVFNTCEPFFEEFRL 462 Query: 426 YHR-KDGKIIDRSDDMISATRYA 447 YHR ++GKI+ +DD++ ATRY Sbjct: 463 YHRDENGKIVKTNDDVLDATRYG 485 >gi|3300|lcl|protein:vir:100942 Length: 499 # NCBI annotation: Gp2 # Family: family:all:1730 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006405;genbank:gi:46358697;genbank:GeneID :2777092 Length = 499 Score = 254 bits (649), Expect = 1e-69, Method: Compositional matrix adjust. Identities = 154/444 (34%), Positives = 228/444 (51%), Gaps = 29/444 (6%) Query: 23 IKQYTPYGWQEKFINASSNAAQLLAMTGNRCGKTYTGAFIMACHLTGLYPDW-------- 74 + ++TPY Q +FI+A + + M GN+ GK++TGA +A HLTG YP Sbjct: 34 LYEFTPYSKQREFIDAGHDYPERCFMAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGK 93 Query: 75 ----WEGRRYNKPIEAWAAGISTDTTRDILQSELLGKWSDPSRFGTGAIPKEMILETVRR 130 W+G+R+ +P+ W G + +T Q L G+ + G G+IPKE I+ + Sbjct: 94 YGGEWKGKRFYEPVVFWVGGETNETVTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKS 153 Query: 131 EGKPGCVQTVLVKH-----VSGGTSILTFKSYEMSQDKFMGTAIDVIWLDEECPKDIYTQ 185 P V +LVKH V G SI FK Y + ++ G I +W DEE P IY + Sbjct: 154 PFFPNLVDHLLVKHHTPEGVEDGISICYFKPYSQGRARWQGDTIHGVWFDEEPPYSIYGE 213 Query: 186 CVTRTATTGGIVYLTFTPEHGATELVKEFMQDLKPGQFMIGATWDDAPHLSEEVKEQLLS 245 +TRT G LTFTP G +++V +F+++ Q ++ T DA H ++E KEQ+++ Sbjct: 214 GLTRTNKYGQFSILTFTPLMGMSDVVTKFLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIA 273 Query: 246 VYSPAERAMRASGKPMLGSGVVFVVPEERIVVQPIAIPNHWHHIIGIDLGFDHPNAIACL 305 Y ER RA G P +GSG +F +PEE I QP P+H++ I D G++HP A L Sbjct: 274 SYPEHEREARARGIPTMGSGRIFQIPEETIKCQPFECPDHFYVIDAQDFGWNHPQAHIQL 333 Query: 306 ALDPTTGTYYLYDERSERGETLSMFAQAIRAKGGDTIPVVVPHDAFKHDGATSGRRFVDL 365 D +YL ++ E ++ A + IPV PHD +H+ Sbjct: 334 WWDKDADVFYL-ARVWKKSENTAVQAWGAVKSWANKIPVAWPHDGHQHEKGGG------- 385 Query: 366 LRDDHKLNIVYQPFSNPPGPDGKP-GGNSVEFGVNWMHTHMDNGKFKVFSTCTKFLQEMK 424 + K FS P P GGNSVE G++ + M G+FK F+TC F +E + Sbjct: 386 --EQLKTQYADAGFSMLPDHATFPDGGNSVESGISELRDLMLEGRFKAFNTCEPFFEEFR 443 Query: 425 LYHR-KDGKIIDRSDDMISATRYA 447 LYHR ++GKI+ +DD++ ATRY Sbjct: 444 LYHRDENGKIVKTNDDVLDATRYG 467 >gi|155|lcl|protein:vir:77595 Length: 499 # NCBI annotation: terminase large subunit # Family: family:all:1730 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063734;genbank:gi:51236724;genbank:GeneID :2944239 Length = 499 Score = 254 bits (648), Expect = 2e-69, Method: Compositional matrix adjust. Identities = 154/444 (34%), Positives = 228/444 (51%), Gaps = 29/444 (6%) Query: 23 IKQYTPYGWQEKFINASSNAAQLLAMTGNRCGKTYTGAFIMACHLTGLYPDW-------- 74 + ++ PY Q +FI+A + + M GN+ GK++TGA +A HLTG YP Sbjct: 34 LYEFAPYSKQREFIDAGHDYPERCFMAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGK 93 Query: 75 ----WEGRRYNKPIEAWAAGISTDTTRDILQSELLGKWSDPSRFGTGAIPKEMILETVRR 130 W+G+R+ +P+ W G + +T Q L G+ + G G+IPKE I+ + Sbjct: 94 YGGEWKGKRFYEPVVFWIGGETNETVTKTTQRILCGRIEENDEPGYGSIPKEDIISWKKS 153 Query: 131 EGKPGCVQTVLVKH-----VSGGTSILTFKSYEMSQDKFMGTAIDVIWLDEECPKDIYTQ 185 P V +LVKH V G SI FK Y + ++ G I +W DEE P IY + Sbjct: 154 PFFPNLVDHLLVKHHTADGVEDGISICYFKPYSQGRARWQGDTIHGVWFDEEPPYSIYGE 213 Query: 186 CVTRTATTGGIVYLTFTPEHGATELVKEFMQDLKPGQFMIGATWDDAPHLSEEVKEQLLS 245 +TRT G LTFTP G +++V +F+++ Q ++ T DA H ++E KEQ+++ Sbjct: 214 GLTRTNKYGQFSILTFTPLMGMSDVVTKFLKNPSKSQKVVNMTIYDAEHYTDEQKEQIIA 273 Query: 246 VYSPAERAMRASGKPMLGSGVVFVVPEERIVVQPIAIPNHWHHIIGIDLGFDHPNAIACL 305 Y ER RA G P +GSG +F +PEE I QP P+H++ I D G++HP A L Sbjct: 274 SYPEHEREARARGIPTMGSGRIFQIPEETIKCQPFECPDHFYVIDAQDFGWNHPQAHIQL 333 Query: 306 ALDPTTGTYYLYDERSERGETLSMFAQAIRAKGGDTIPVVVPHDAFKHDGATSGRRFVDL 365 D +YL ++ E ++ A + IPV PHD +H+ Sbjct: 334 WWDKDADVFYL-ARVWKKSENTAVQAWGAVKSWANKIPVAWPHDGHQHEKGGG------- 385 Query: 366 LRDDHKLNIVYQPFSNPPGPDGKP-GGNSVEFGVNWMHTHMDNGKFKVFSTCTKFLQEMK 424 + K FS P P GGNSVE G++ + M G+FKVF+TC F +E + Sbjct: 386 --EQLKTQYADAGFSMLPDHATFPDGGNSVESGISELRDLMLEGRFKVFNTCEPFFEEFR 443 Query: 425 LYHR-KDGKIIDRSDDMISATRYA 447 LYHR ++GKI+ +DD++ ATRY Sbjct: 444 LYHRDENGKIVKTNDDVLDATRYG 467 >gi|2931|lcl|protein:vir:105460 Length: 409 # NCBI annotation: putative phage terminase large subunit B # Family: family:all:54 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529870;genbank:gi:90592610;genbank:GeneID :3974524 Length = 409 Score = 43.9 bits (102), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 34/145 (23%), Positives = 66/145 (45%), Gaps = 10/145 (6%) Query: 181 DIYTQCVTRTATTGGIVYLTFTPEHGATELVKEFMQDLKPGQFMIGATW--DDAPHLSEE 238 D++ + + R + G + P+ L +++ + P + T+ DD LS++ Sbjct: 139 DVFQEILQRCSIEGARIICDTNPDIPTHWLKTDYIDNHDPKARIKSFTFTIDDNTFLSKD 198 Query: 239 VKEQLLSVYSPAERAMRAS----GKPMLGSGVVFV-VPEERIVVQPIAIPNHWHHIIGID 293 E S+ + R M G+ + G G+V+ ++ +V+ +P+ + +G+D Sbjct: 199 YVE---SIKAATPRGMFYDRGILGQWVTGDGIVYQDFNKDTMVIPKNRVPDGLDYYVGVD 255 Query: 294 LGFDHPNAIACLALDPTTGTYYLYD 318 G++HPN I L D TY L D Sbjct: 256 WGYEHPNPIILLGDDKDGNTYVLED 280 >gi|3326|lcl|protein:vir:94525 Length: 440 # NCBI annotation: putative large subunit terminase # Family: family:all:54 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223885;genbank:gi:62327097;genbank:GeneID :5075541 Length = 440 Score = 43.9 bits (102), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 63/265 (23%), Positives = 107/265 (40%), Gaps = 43/265 (16%) Query: 199 LTFTPEHGATELVKEFMQDL--KPGQFMIGATWDDAPHLSEEVKEQL--LSVYSPAERAM 254 +TF P L EF D + I T+ D HL+ + + L + V +P + Sbjct: 176 ITFNPWSDRHWLKHEFFDDKTKRNHSRAITTTYKDNDHLNADYVDSLKEMLVRNPNRARV 235 Query: 255 RASGKPMLGSGVVF--VVPEERIVVQPIA-IPNHWHHIIGIDLGFDH-PNAIACLALDPT 310 G+ + G+VF + + IA +P +G+D GF H P A +A+D Sbjct: 236 AVLGEWGIAEGLVFDGLFEQRDFSYDEIANLPKS----VGLDFGFKHDPTAGEFIAVDQD 291 Query: 311 TGTYYLYDERSERGETLSMFAQAIRAKGGDTIPVVVPHDAFKHDGATSGRRFVDLLRDDH 370 Y+YDE ++ + AQ + +P+ ++ +R + L H Sbjct: 292 NRIVYIYDEFYKQHLLTNQIAQELAKHKAFGLPIT---------ADSAEQRMIVELSQQH 342 Query: 371 KL-NIVYQPFSNPPGPDGKPGGNSVEFGVNWMHTHMDNGKFKVFSTCTKFLQEMKLY--- 426 ++ NI P GK G +SV G+ +M ++ +F V ++E Y Sbjct: 343 RVPNI---------KPSGK-GKDSVIQGIQYMQSY----RFVVHPRVKGLMEEFNTYVYD 388 Query: 427 HRKDGKIIDRSDD----MISATRYA 447 K+G +++ D I A RYA Sbjct: 389 MDKEGNWLNKPKDANNHAIDALRYA 413 >gi|5321|lcl|protein:vir:99534 Length: 422 # NCBI annotation: putative protein # Family: family:all:54 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958532;genbank:gi:41179314;genbank:GeneID :2717172 Length = 422 Score = 36.6 bits (83), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 47/175 (26%), Positives = 71/175 (40%), Gaps = 32/175 (18%) Query: 290 IGIDLGF-DHPNAIACLALDPTTGTYYLYDERSERGETLSMFAQAIRAKGGDTIPVVVPH 348 G+D GF + P+A + LD T Y+ DE ++G + AQ I+ G + V+ Sbjct: 260 FGLDYGFINDPSAFMHIKLDMRNKTLYVMDEFVKKGLLNNQLAQVIKDMGY-SKEVITAD 318 Query: 349 DAFKHDGATSGRRFVDLLRDDHKLNIVYQPFSNPPGPDGKPGGNSVEFGVNWMHTHMDNG 408 A K A R + +R K GPD S+ G+ ++ Sbjct: 319 SAEKKSIAEMKRDGIYRIRPALK------------GPD------SIIQGIQFLQQF---- 356 Query: 409 KFKVFSTCTKFLQEMKLY-HRKDGKI-------IDRSDDMISATRYAALMMGRHG 455 K+ V C K ++E++ Y + KD K ID + I A RYA HG Sbjct: 357 KWVVDDRCVKTIEELQNYTYVKDKKTDEYTNRPIDAYNHCIDAIRYAVEEENGHG 411 >gi|2589|lcl|protein:vir:94084 Length: 407 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240228;genbank:gi:66395894;genbank:GeneID :5133253 Length = 407 Score = 36.6 bits (83), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 55/279 (19%), Positives = 113/279 (40%), Gaps = 35/279 (12%) Query: 180 KDIYTQCVTRTATTGGIVYLTFTPEHGATELVKEFMQDLKPGQFMIGATW--DDAPHLSE 237 ++++ + +R + TG + + P+H L+K+++++ P ++ + DD L++ Sbjct: 139 EEVFDEIKSRCSGTGARILVDTNPDHPEHWLLKDYIENTDPKAGILSHQFKLDDNNFLND 198 Query: 238 EVKEQL-LSVYSPAERAMRASGKPMLGSGVV---FVVPEERIVVQPIAIPNHWHHIIGID 293 KE + S S +G + G GVV F + E I + + G+D Sbjct: 199 RYKESIKASTPSGMFYERNINGMWVSGDGVVYADFDLNENTIKADELDDIPIKEYFAGVD 258 Query: 294 LGFDHPNAIACLALDPTTGTYYLYDERSERGETLS---MFAQAIRAKGGDTIPVVVPHDA 350 G++H +I + G +Y +E + + + + + A+ I ++ G+ + D Sbjct: 259 WGYEHYGSIVLIGR-GIDGNFYFIEEHAHQFKFIDDWVVIAKDIVSRYGN---INFYCDT 314 Query: 351 FKHDGATSGRRFVDLLRDDHKLNIVYQPFSNPPGPDGKPGGNSVEFGVNWMHTHMDNGKF 410 + + T RR H+L + S G + N Sbjct: 315 ARPEYITEFRR--------HRLRAINADKSKLSGVE------------EVAKLFKQNKLL 354 Query: 411 KVFSTCTKFLQEMKLY--HRKDGKIIDRSDDMISATRYA 447 ++ +F QE+ Y H +G+ I DD++ + RYA Sbjct: 355 VLYDNMDRFKQEVFKYVWHPTNGEPIKEFDDVLDSLRYA 393 >gi|3523|lcl|protein:vir:105897 Length: 407 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004370;genbank:gi:122891825;genbank:Ge neID:4712368 Length = 407 Score = 36.6 bits (83), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 55/279 (19%), Positives = 113/279 (40%), Gaps = 35/279 (12%) Query: 180 KDIYTQCVTRTATTGGIVYLTFTPEHGATELVKEFMQDLKPGQFMIGATW--DDAPHLSE 237 ++++ + +R + TG + + P+H L+K+++++ P ++ + DD L++ Sbjct: 139 EEVFDEIKSRCSGTGARILVDTNPDHPEHWLLKDYIENTDPKAGILSHQFKLDDNNFLND 198 Query: 238 EVKEQL-LSVYSPAERAMRASGKPMLGSGVV---FVVPEERIVVQPIAIPNHWHHIIGID 293 KE + S S +G + G GVV F + E I + + G+D Sbjct: 199 RYKESIKASTPSGMFYERNINGMWVSGDGVVYADFDLNENTIKADELDDIPIKEYFAGVD 258 Query: 294 LGFDHPNAIACLALDPTTGTYYLYDERSERGETLS---MFAQAIRAKGGDTIPVVVPHDA 350 G++H +I + G +Y +E + + + + + A+ I ++ G+ + D Sbjct: 259 WGYEHYGSIVLIGR-GIDGNFYFIEEHAHQFKFIDDWVVIAKDIVSRYGN---INFYCDT 314 Query: 351 FKHDGATSGRRFVDLLRDDHKLNIVYQPFSNPPGPDGKPGGNSVEFGVNWMHTHMDNGKF 410 + + T RR H+L + S G + N Sbjct: 315 ARPEYITEFRR--------HRLRAINADKSKLSGVE------------EVAKLFKQNKLL 354 Query: 411 KVFSTCTKFLQEMKLY--HRKDGKIIDRSDDMISATRYA 447 ++ +F QE+ Y H +G+ I DD++ + RYA Sbjct: 355 VLYDNMDRFKQEVFKYVWHPTNGEPIKEFDDVLDSLRYA 393 >gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: terminase, large subunit # Family: family:all:147 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006983;genbank:gi:46401884;genbank:GeneID :2777679 Length = 438 Score = 34.7 bits (78), Expect = 0.002, Method: Compositional matrix adjust. Identities = 32/143 (22%), Positives = 58/143 (40%), Gaps = 24/143 (16%) Query: 286 WHHIIGIDLGFDHPNAIACLALDPTTGTYYLYDERSERGETLSMFAQAIRAKGGDTIPVV 345 + ++GID+G+ P A+ + T TYY+ +E + +T + A I+ Sbjct: 279 FETLLGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQ---------- 328 Query: 346 VPH--DAFKHDGATSGRRFVDLLRDDHKLNIVYQPFSNPPGPDGKPGGNSVEFGVNWMHT 403 H D +K D R FVD + ++ Y+ P SV G+ + Sbjct: 329 --HCIDRYKVD-----RIFVDSAAAQFRQDLAYE-----HEIASAPAKKSVLDGLACLQA 376 Query: 404 HMDNGKFKVFSTCTKFLQEMKLY 426 GK V ++C+ + ++ Y Sbjct: 377 LFQQGKIIVDASCSSLIHALQNY 399 >gi|4261|lcl|protein:vir:94806 Length: 402 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240530;genbank:gi:66396200;genbank:GeneID :5133586 Length = 402 Score = 30.8 bits (68), Expect = 0.039, Method: Compositional matrix adjust. Identities = 42/191 (21%), Positives = 82/191 (42%), Gaps = 24/191 (12%) Query: 166 GTAIDVIWLDEECPKDIYTQCVTRTATTGGIVYLTFTPEHGATELVKEFM----QDLKPG 221 GTA+ +++ K+++++C + G + + PE+ + K+++ Q L G Sbjct: 125 GTALHNMFI-----KEVFSRC----SHKGARILIDTNPENPMHPVKKDYIDKSGQRLSNG 175 Query: 222 QFMIGA---TWDDAPHLSEEVKEQLLSVYSPAERAMR-ASGKPMLGSGVVFVVPEER--- 274 + I A T D L EE E +++ R GK + GVV+ +E+ Sbjct: 176 RLNIKAFQFTLFDNTFLDEEYIESIIASTPTGMFTDRDIYGKWVSAEGVVYKDFKEKVHY 235 Query: 275 IVVQPIAIPNHWHHIIGIDLGFDHPNAIACLALDPTTGTYYLYDERSERGETLS---MFA 331 I + G+D G++H +I +A D G Y+ +E + R + + A Sbjct: 236 ITEEEFKTKQIKRKYAGVDWGYEHYGSIMVVAED-FDGNKYVIEEHAHRHKEIDDWVAIA 294 Query: 332 QAIRAKGGDTI 342 + + + GD + Sbjct: 295 KGVIKRHGDIL 305 >gi|9935|lcl|protein:vir:97273 Length: 402 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240605;genbank:gi:66396276;genbank:GeneID :5133629 Length = 402 Score = 30.8 bits (68), Expect = 0.041, Method: Compositional matrix adjust. Identities = 42/191 (21%), Positives = 82/191 (42%), Gaps = 24/191 (12%) Query: 166 GTAIDVIWLDEECPKDIYTQCVTRTATTGGIVYLTFTPEHGATELVKEFM----QDLKPG 221 GTA+ +++ K+++++C + G + + PE+ + K+++ Q L G Sbjct: 125 GTALHNMFI-----KEVFSRC----SHKGARILIDTNPENPMHPVKKDYIDKSGQRLSNG 175 Query: 222 QFMIGA---TWDDAPHLSEEVKEQLLSVYSPAERAMR-ASGKPMLGSGVVFVVPEER--- 274 + I A T D L EE E +++ R GK + GVV+ +E+ Sbjct: 176 RLNIKAFQFTLFDNTFLDEEYIESIIASTPTGMFTDRDIYGKWVSAEGVVYKDFKEKVHY 235 Query: 275 IVVQPIAIPNHWHHIIGIDLGFDHPNAIACLALDPTTGTYYLYDERSERGETLS---MFA 331 I + G+D G++H +I +A D G Y+ +E + R + + A Sbjct: 236 ITEEEFKTKQIKRKYAGVDWGYEHYGSIMVVAED-FDGNKYVIEEHAHRHKEIDDWVAIA 294 Query: 332 QAIRAKGGDTI 342 + + + GD + Sbjct: 295 KGVIKRHGDIL 305 >gi|1596|lcl|protein:vir:93748 Length: 403 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240453;genbank:gi:66396122;genbank:GeneID :5133517 Length = 403 Score = 30.0 bits (66), Expect = 0.061, Method: Compositional matrix adjust. Identities = 42/191 (21%), Positives = 82/191 (42%), Gaps = 24/191 (12%) Query: 166 GTAIDVIWLDEECPKDIYTQCVTRTATTGGIVYLTFTPEHGATELVKEFM----QDLKPG 221 GTA+ +++ K+++++C + G + + PE+ + K+++ Q L G Sbjct: 126 GTALHNMFI-----KEVFSRC----SYKGARILIDTNPENPMHPVKKDYIDKSGQRLSNG 176 Query: 222 QFMIGA---TWDDAPHLSEEVKEQLLSVYSPAERAMR-ASGKPMLGSGVVFVVPEER--- 274 + I A T D L EE E +++ R GK + GVV+ +E+ Sbjct: 177 RLNIKAFQFTLFDNTFLDEEYIESIIASTPTGMFTDRDIYGKWVSAEGVVYKDFKEKVHY 236 Query: 275 IVVQPIAIPNHWHHIIGIDLGFDHPNAIACLALDPTTGTYYLYDERSERGETLS---MFA 331 I + G+D G++H +I +A D G Y+ +E + R + + A Sbjct: 237 IKEEEFKTKQIKRKYAGVDWGYEHYGSIMVVAED-FDGNKYVIEEHAHRHKEIDDWVAIA 295 Query: 332 QAIRAKGGDTI 342 + + + GD + Sbjct: 296 KGVIKRHGDIL 306 >gi|14951|lcl|protein:vir:1235 Length: 405 # NCBI annotation: similar to phage O1205 ORF26 ( putative large subunit terminase) # Family: family:all:54 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510934;genbank:gi:17426268;genbank:GeneID :927381 Length = 405 Score = 30.0 bits (66), Expect = 0.070, Method: Compositional matrix adjust. Identities = 42/191 (21%), Positives = 82/191 (42%), Gaps = 24/191 (12%) Query: 166 GTAIDVIWLDEECPKDIYTQCVTRTATTGGIVYLTFTPEHGATELVKEFM----QDLKPG 221 GTA+ +++ K+++++C + G + + PE+ + K+++ Q L G Sbjct: 128 GTALHNMFI-----KEVFSRC----SYKGARILIDTNPENPMHPVKKDYIDKSGQRLSNG 178 Query: 222 QFMIGA---TWDDAPHLSEEVKEQLLSVYSPAERAMR-ASGKPMLGSGVVFVVPEER--- 274 + I A T D L EE E +++ R GK + GVV+ +E+ Sbjct: 179 RLNIKAFQFTLFDNTFLDEEYIESIIASTPTGMFTDRDIYGKWVSAEGVVYKDFKEKVHY 238 Query: 275 IVVQPIAIPNHWHHIIGIDLGFDHPNAIACLALDPTTGTYYLYDERSERGETLS---MFA 331 I + G+D G++H +I +A D G Y+ +E + R + + A Sbjct: 239 IKEEEFKTKQIKRKYAGVDWGYEHYGSIMVVAED-FDGNKYVIEEHAHRHKEIDDWVAIA 297 Query: 332 QAIRAKGGDTI 342 + + + GD + Sbjct: 298 KGVIKRHGDIL 308 >gi|13903|lcl|protein:vir:9870 Length: 273 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795632;genbank:gi:28876409;genbank:GeneID :1257943 Length = 273 Score = 28.1 bits (61), Expect = 0.25, Method: Compositional matrix adjust. Identities = 59/271 (21%), Positives = 103/271 (38%), Gaps = 42/271 (15%) Query: 195 GIVYLTFT----PEHGATELVKEFMQDLKPGQ-FMIGATWDDAPHLSEEVKEQLLSVYSP 249 G+ Y F P+ + + K++ +P F+ +T+ D P +++E + + Sbjct: 8 GLFYKFFYTYNPPKRKQSWVNKKYESQFQPSNTFVHASTYKDNPFIAKEFIAEAEATRER 67 Query: 250 AERAMR-ASGKPMLGSGVVFV--VPEERIVVQPIAIPNHWHHIIGIDLGF-DHPNAIACL 305 +ER R +GSGVV + ERI + +A ++ + GID G+ P A Sbjct: 68 SERRYRWEYLGEAIGSGVVPFDNLRFERITDEQVADFDNIRN--GIDYGYATDPLAFVRW 125 Query: 306 ALDPTTGTYYLYDERSERGETLSMFAQAIRAKGGDTIPVVVPHDAFKHDGATSGRRFVDL 365 D Y DE + + A+ + KG + + K + + Sbjct: 126 HYDKKKNGIYAIDEYYGQKISNRQLAKWLTTKGYQSDEMFAESAEPKSNAELKNEFGIKR 185 Query: 366 LRDDHKLNIVYQPFSNPPGPDGKPGGNSVEFGVNWMHTHMDNGKF-----KVFSTCTKFL 420 ++ K GPD SVEFG W +D+ F K + Sbjct: 186 IKGVKK------------GPD------SVEFGERW----LDDLDFICIDPKRTPNIAREF 223 Query: 421 QEMKLYHRKDG----KIIDRSDDMISATRYA 447 + + +DG ++ D+ + I ATRYA Sbjct: 224 ENIDYQVDRDGNPKPRLEDKVNHAIDATRYA 254 >gi|28182|lcl|protein:vir:108053 Length: 611 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595293;genbank:gi:161622599;genbank:Ge neID:5783772 Length = 611 Score = 27.7 bits (60), Expect = 0.30, Method: Compositional matrix adjust. Identities = 25/90 (27%), Positives = 40/90 (44%), Gaps = 7/90 (7%) Query: 363 VDLLRDDHKLNIVYQPFSNPPGPDGKPGGNSVEFGVNWMHTHMDNGKF-KVFSTCTKFLQ 421 V++L DDH LN PPG + E G+NW+ + D+ + + FS + + Sbjct: 5 VNVLSDDHPLNEGKTIVIKPPGSLER----KTEEGINWIKSQWDDKWYPEKFSDYLRIHK 60 Query: 422 EMKLYHRKD--GKIIDRSDDMISATRYAAL 449 +K+ + D + D M TRY L Sbjct: 61 IVKIPNNGDRPDEFQTFKDKMNKRTRYMGL 90 >gi|14353|lcl|protein:vir:3149 Length: 554 # NCBI annotation: unknown # Family: family:all:28407 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665920;genbank:gi:22091106;genbank:GeneID :951323 Length = 554 Score = 27.3 bits (59), Expect = 0.44, Method: Compositional matrix adjust. Identities = 15/55 (27%), Positives = 29/55 (52%), Gaps = 3/55 (5%) Query: 392 NSVEFGVNWMHTHMDNGKF---KVFSTCTKFLQEMKLYHRKDGKIIDRSDDMISA 443 +S+E G+ + ++NG+ + T F+ M+ R+DGK+ + D I+A Sbjct: 457 HSLENGIARLRILVENGRILFHRGHQTTEDFITSMQSLERRDGKMHGHTPDYIAA 511 >gi|18781|lcl|protein:vir:7429 Length: 581 # NCBI annotation: gp6 # Family: family:all:147 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818544;genbank:gi:29566981;genbank:GeneID :1260231 Length = 581 Score = 26.2 bits (56), Expect = 0.97, Method: Compositional matrix adjust. Identities = 15/53 (28%), Positives = 24/53 (45%), Gaps = 1/53 (1%) Query: 286 WHHIIGIDLGFDHPNAIACLALDPTTGTYYLYDERSERGETLSMFAQAIRAKG 338 W I +D G+ +PN + + P G + DE + T + FA I +G Sbjct: 340 WETIAAVDYGYRNPNVWLLIQIGP-WGEINIVDELYQADLTPTEFANEILRRG 391 >gi|6775|lcl|protein:vir:96067 Length: 460 # NCBI annotation: conserved hypothetical protein ORF004 # Family: family:all:54 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294421;genbank:gi:149408318;genbank:Ge neID:5237186 Length = 460 Score = 26.2 bits (56), Expect = 0.99, Method: Compositional matrix adjust. Identities = 19/108 (17%), Positives = 41/108 (37%), Gaps = 2/108 (1%) Query: 142 VKHVSGGTSILTFKSYEMSQDKFMGTAIDVIWLDEE--CPKDIYTQCVTRTATTGGIVYL 199 +KH G+ L + + ID++WL+E ++ + +++ Sbjct: 86 IKHKRTGSEFLFYGIARNLSEIKSTEGIDILWLEEAHYLTQEQWEVIEPTIRKENSEIWI 145 Query: 200 TFTPEHGATELVKEFMQDLKPGQFMIGATWDDAPHLSEEVKEQLLSVY 247 F P + + F+ F+ W++ P LSE + + + Y Sbjct: 146 IFNPNEVTDFVYQNFVVKPPKDAFVKMINWNENPFLSETMLKVIHEAY 193 >gi|15908|lcl|protein:vir:3418 Length: 641 # NCBI annotation: DNA packaging protein # Family: family:all:140 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040581;genbank:gi:9626245;genbank:GeneID: 2703524 Length = 641 Score = 25.4 bits (54), Expect = 1.5, Method: Compositional matrix adjust. Identities = 20/63 (31%), Positives = 28/63 (44%), Gaps = 4/63 (6%) Query: 394 VEFGVNWMHTHMDNGKFKVF-STCTKFLQEMKLYHRKDGKIIDRSDDMISA---TRYAAL 449 V+ +WM T D GK K F +T E K+ R D +++ + SA R A L Sbjct: 337 VQIVKDWMKTKGDTGKRKTFVNTTLGETWEAKIGERPDAEVMAERKEHYSAPVPDRVAYL 396 Query: 450 MMG 452 G Sbjct: 397 TAG 399 >gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: terminase large subunit # Family: family:all:4877 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429607;genbank:gi:156564098;genbank:Ge neID:5525534 Length = 635 Score = 24.6 bits (52), Expect = 2.7, Method: Compositional matrix adjust. Identities = 17/65 (26%), Positives = 31/65 (47%), Gaps = 3/65 (4%) Query: 4 TERALALLKELQDRQKYWKIK--QYTPYGWQEKFINASSNAAQLLAMTGNRCGKTYTGAF 61 TE L L L D+ K+W + ++ +QE + +++ + + G R GKT T Sbjct: 43 TEEELHYLAIL-DKPKFWAAETLKWFCRDYQEPMLQEMADSKRTVLRLGRRLGKTETMCI 101 Query: 62 IMACH 66 ++ H Sbjct: 102 MILWH 106 >gi|16077|lcl|protein:vir:4155 Length: 468 # NCBI annotation: large terminase subunit # Family: family:all:144 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046964;genbank:gi:9630534;genbank:GeneID: 1261708 Length = 468 Score = 24.3 bits (51), Expect = 3.1, Method: Compositional matrix adjust. Identities = 32/150 (21%), Positives = 58/150 (38%), Gaps = 9/150 (6%) Query: 224 MIGATWDDAPHLSEEVKEQLLSVYSPAERAMRASGK-PMLGSGVVFVVPEERIVVQP--- 279 I +TW + P+L+ + E+ L++ R G + G VF ++ P Sbjct: 228 FIPSTWRENPYLNRDEYEEALNMLDHVTRRQLKDGDWDVTLQGGVFKREWFEVIDSPPNG 287 Query: 280 -IAIPNHWHHIIGIDLGFDHPNAIACLALD-PTTGTYYLYDERSERGETLSMFAQAIRAK 337 + +W G + P+ L L YY+ D R R + ++ +R Sbjct: 288 LVMSVRYWDFAATKPDGANDPDYTVGLLLGVDKEDYYYVLDVRRFRESPGKVKSKVLRTA 347 Query: 338 GGDTIPVVVPHDAFKHDGATSGRRFVDLLR 367 D V++ A + + +SG+ D LR Sbjct: 348 EEDGREVII---AKEEEPGSSGKIVTDYLR 374 >gi|18252|lcl|protein:vir:4193 Length: 467 # NCBI annotation: putative large terminase subunit # Family: family:all:144 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071818;genbank:gi:11863101;genbank:GeneID :1257603 Length = 467 Score = 24.3 bits (51), Expect = 3.2, Method: Compositional matrix adjust. Identities = 33/150 (22%), Positives = 59/150 (39%), Gaps = 9/150 (6%) Query: 224 MIGATWDDAPHLSEEVKEQLLSVYSPAERAMRASGKPMLG-SGVVFVVPEERIVVQP--- 279 I +TW + P+L+ + E+ L++ R G + G VF I+ P Sbjct: 228 FIPSTWRENPYLNRDEYEEALNMLDHVTRRQLKEGDWDVSIQGGVFRREWFEIIDTPPHD 287 Query: 280 -IAIPNHWHHIIGIDLGFDHPNAIACLALD-PTTGTYYLYDERSERGETLSMFAQAIRAK 337 + +W G + P+ L + YY+ D + RG + A+ +R Sbjct: 288 LVMKLRYWDLAATPHDGSNDPDYTVGLLMGVDQDDYYYVLDIQRFRGSPGEVKARVLRTA 347 Query: 338 GGDTIPVVVPHDAFKHDGATSGRRFVDLLR 367 D V++ A + + +SG+ D LR Sbjct: 348 EEDGREVII---AKEEEPGSSGKIVTDYLR 374 >gi|8730|lcl|protein:vir:96840 Length: 421 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240151;genbank:gi:66395816;genbank:GeneID :5133181 Length = 421 Score = 24.3 bits (51), Expect = 3.4, Method: Compositional matrix adjust. Identities = 20/66 (30%), Positives = 31/66 (46%), Gaps = 7/66 (10%) Query: 388 KPGGNSVEFGVNWMHTHMDNGKFKVFST--CTKFLQEMKLYHRKDG----KIIDRSDDMI 441 K G +SVE+G W++ +D T + + + KDG K+ D+ + I Sbjct: 343 KKGPDSVEYGEQWLND-LDAIVIDPNRTPNIAREFENIDFETDKDGNVKPKLEDKDNHTI 401 Query: 442 SATRYA 447 ATRYA Sbjct: 402 DATRYA 407 >gi|4548|lcl|protein:vir:94970 Length: 473 # NCBI annotation: putative phage terminase large subunit # Family: family:all:144 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239275;genbank:gi:66392134;genbank:GeneID :5076615 Length = 473 Score = 23.9 bits (50), Expect = 4.6, Method: Compositional matrix adjust. Identities = 12/31 (38%), Positives = 13/31 (41%) Query: 276 VVQPIAIPNHWHHIIGIDLGFDHPNAIACLA 306 VV P IP+ W G D G P A A Sbjct: 260 VVHPFKIPHTWKIDRGYDYGSSKPAAYLLFA 290 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.319 0.136 0.430 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 236,618 Number of Sequences: 514 Number of extensions: 11495 Number of successful extensions: 70 Number of sequences better than 100.0: 33 Number of HSP's better than 100.0 without gapping: 16 Number of HSP's successfully gapped in prelim test: 17 Number of HSP's that attempted gapping in prelim test: 33 Number of HSP's gapped (non-prelim): 42 length of query: 479 length of database: 206,069 effective HSP length: 75 effective length of query: 404 effective length of database: 167,519 effective search space: 67677676 effective search space used: 67677676 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 39 (19.6 bits)