BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|Aclame:protein:vir:95311|NCBI_annot:putative terminase large subunit|genbank:acc:YP_512256;genbank:gi:89152423;genbank:GeneID :3952979 (491 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|5525|lcl|protein:vir:95311 Length: 491 # NCBI annotation: put... 1020 0.0 gi|16210|lcl|protein:vir:7319 Length: 491 # NCBI annotation: ter... 972 0.0 gi|10609|lcl|protein:vir:106508 Length: 492 # NCBI annotation: P... 82 2e-17 gi|3478|lcl|protein:vir:101370 Length: 494 # NCBI annotation: Pa... 55 2e-09 gi|8997|lcl|protein:vir:101640 Length: 432 # NCBI annotation: te... 35 0.003 gi|10218|lcl|protein:vir:107805 Length: 533 # NCBI annotation: h... 35 0.003 gi|6515|lcl|protein:vir:98503 Length: 533 # NCBI annotation: hyp... 35 0.003 gi|4516|lcl|protein:vir:107432 Length: 533 # NCBI annotation: Bb... 35 0.003 gi|5891|lcl|protein:vir:107098 Length: 421 # NCBI annotation: pu... 28 0.23 gi|17761|lcl|protein:vir:171 Length: 470 # NCBI annotation: term... 28 0.24 gi|2819|lcl|protein:vir:105705 Length: 439 # NCBI annotation: gp... 28 0.26 gi|5142|lcl|protein:vir:105417 Length: 470 # NCBI annotation: ge... 28 0.35 gi|10799|lcl|protein:vir:78055 Length: 440 # NCBI annotation: Te... 27 0.44 gi|11068|lcl|protein:vir:78311 Length: 547 # NCBI annotation: pu... 27 0.45 gi|5208|lcl|protein:vir:106641 Length: 446 # NCBI annotation: OR... 27 0.45 gi|4914|lcl|protein:vir:99572 Length: 459 # NCBI annotation: Ter... 27 0.47 gi|10483|lcl|protein:vir:105297 Length: 446 # NCBI annotation: p... 26 0.97 gi|11501|lcl|protein:vir:78718 Length: 574 # NCBI annotation: DN... 25 2.0 gi|6775|lcl|protein:vir:96067 Length: 460 # NCBI annotation: con... 25 2.1 gi|2779|lcl|protein:vir:99776 Length: 425 # NCBI annotation: lar... 25 2.4 gi|9829|lcl|protein:vir:103950 Length: 425 # NCBI annotation: ph... 25 2.4 gi|17691|lcl|protein:vir:9305 Length: 447 # NCBI annotation: lar... 25 2.7 gi|7301|lcl|protein:vir:96238 Length: 447 # NCBI annotation: ORF... 25 2.7 gi|7643|lcl|protein:vir:96370 Length: 447 # NCBI annotation: ORF... 25 2.7 gi|11588|lcl|protein:vir:78831 Length: 447 # NCBI annotation: te... 25 2.7 gi|15353|lcl|protein:vir:9921 Length: 425 # NCBI annotation: put... 24 3.8 >gi|5525|lcl|protein:vir:95311 Length: 491 # NCBI annotation: putative terminase large subunit # Family: family:all:144 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512256;genbank:gi:89152423;genbank:GeneID :3952979 Length = 491 Score = 1020 bits (2637), Expect = 0.0, Method: Compositional matrix adjust. Identities = 491/491 (100%), Positives = 491/491 (100%) Query: 1 MTDTALSPEEQLIEDIAGFTHDPLGYALYAFPWGEEGTELAHATGPRQWQADAFREIRDH 60 MTDTALSPEEQLIEDIAGFTHDPLGYALYAFPWGEEGTELAHATGPRQWQADAFREIRDH Sbjct: 1 MTDTALSPEEQLIEDIAGFTHDPLGYALYAFPWGEEGTELAHATGPRQWQADAFREIRDH 60 Query: 61 LQNPETRYQPLMLARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 LQNPETRYQPLMLARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE Sbjct: 61 LQNPETRYQPLMLARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 121 IIKWSNLAITKDWFTCTATAMYSNDPGHDKRWRADAIPWSEHNTEAFAGLHNERKRIIVV 180 IIKWSNLAITKDWFTCTATAMYSNDPGHDKRWRADAIPWSEHNTEAFAGLHNERKRIIVV Sbjct: 121 IIKWSNLAITKDWFTCTATAMYSNDPGHDKRWRADAIPWSEHNTEAFAGLHNERKRIIVV 180 Query: 181 FDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSR 240 FDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSR Sbjct: 181 FDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSR 240 Query: 241 TVEGTNKQQLQKWVDDYGEGSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVA 300 TVEGTNKQQLQKWVDDYGEGSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVA Sbjct: 241 TVEGTNKQQLQKWVDDYGEGSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVA 300 Query: 301 HAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVF 360 HAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVF Sbjct: 301 HAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVF 360 Query: 361 IDFGYGTGLKSIGDGWGRTWQLVPFGGASTDPQMLNKRGEMFNSCKTWLRLGGMLDDQET 420 IDFGYGTGLKSIGDGWGRTWQLVPFGGASTDPQMLNKRGEMFNSCKTWLRLGGMLDDQET Sbjct: 361 IDFGYGTGLKSIGDGWGRTWQLVPFGGASTDPQMLNKRGEMFNSCKTWLRLGGMLDDQET 420 Query: 421 ADDLSTAEYKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPVSKRLRIPGQQNQQ 480 ADDLSTAEYKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPVSKRLRIPGQQNQQ Sbjct: 421 ADDLSTAEYKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPVSKRLRIPGQQNQQ 480 Query: 481 GKAITDYDPYA 491 GKAITDYDPYA Sbjct: 481 GKAITDYDPYA 491 >gi|16210|lcl|protein:vir:7319 Length: 491 # NCBI annotation: terminase large subunit # Family: family:all:144 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848210;genbank:gi:30387381;genbank:GeneID :2641874 Length = 491 Score = 972 bits (2512), Expect = 0.0, Method: Compositional matrix adjust. Identities = 463/490 (94%), Positives = 479/490 (97%) Query: 1 MTDTALSPEEQLIEDIAGFTHDPLGYALYAFPWGEEGTELAHATGPRQWQADAFREIRDH 60 MT A+S EEQL+EDIA FT+DPLGYALYAFPWGE+GTELAHATGPR+WQADAFREIRDH Sbjct: 1 MTAAAISTEEQLVEDIASFTYDPLGYALYAFPWGEDGTELAHATGPRKWQADAFREIRDH 60 Query: 61 LQNPETRYQPLMLARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 LQNP TR+QPLMLARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE Sbjct: 61 LQNPATRHQPLMLARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPE 120 Query: 121 IIKWSNLAITKDWFTCTATAMYSNDPGHDKRWRADAIPWSEHNTEAFAGLHNERKRIIVV 180 IIKWSNLAITK+WFTCTATAMYSNDPGHDKRWRADAIPWSEHNTEAFAGLHNERKRIIVV Sbjct: 121 IIKWSNLAITKEWFTCTATAMYSNDPGHDKRWRADAIPWSEHNTEAFAGLHNERKRIIVV 180 Query: 181 FDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKTAQIDSR 240 FDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWK AQIDSR Sbjct: 181 FDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHRWKCAQIDSR 240 Query: 241 TVEGTNKQQLQKWVDDYGEGSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVA 300 TVEGTNKQQLQKWVDDYGE SDFVK+RVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVA Sbjct: 241 TVEGTNKQQLQKWVDDYGEESDFVKVRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQVA 300 Query: 301 HAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVF 360 HAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVF Sbjct: 301 HAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFEDQYQADAVF 360 Query: 361 IDFGYGTGLKSIGDGWGRTWQLVPFGGASTDPQMLNKRGEMFNSCKTWLRLGGMLDDQET 420 IDFGYGTGLKSIGDGWGRTWQL+PFGG STDPQMLNKRGEMFNSCKTWL+LGG LDDQET Sbjct: 361 IDFGYGTGLKSIGDGWGRTWQLIPFGGGSTDPQMLNKRGEMFNSCKTWLKLGGALDDQET 420 Query: 421 ADDLSTAEYKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPVSKRLRIPGQQNQQ 480 ADDLS AEYKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPV+K LRIPGQ++QQ Sbjct: 421 ADDLSAAEYKVRVDGKIVIEPKEDIKERLGRSPGKGDALLLTFAFPVTKHLRIPGQESQQ 480 Query: 481 GKAITDYDPY 490 GKA+T+YDP+ Sbjct: 481 GKAVTEYDPW 490 >gi|10609|lcl|protein:vir:106508 Length: 492 # NCBI annotation: Pas60 # Family: family:all:144 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024846;genbank:gi:48697461;genbank:GeneID :2846165 Length = 492 Score = 81.6 bits (200), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 109/426 (25%), Positives = 167/426 (39%), Gaps = 61/426 (14%) Query: 79 HGIGKSAFISMLINWGMST----CEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITKDWF 134 HG+GKS ++L+NW +T +D K++ TA+ L WPEI KW+ + F Sbjct: 58 HGLGKSFSGAILVNWFATTRDLMGKDWKIITTASAWRHLEVYLWPEIHKWAG----RINF 113 Query: 135 TCTATAMYSNDPGHD------KRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIA 188 A Y +P + K A + + E G H E ++ + DEA + Sbjct: 114 VALGRAPY--NPRTELLDLRLKLTHGAATAVASNQPERIEGAHAEE--LLYLLDEAKIVP 169 Query: 189 DLVWEVAEGALTDEDTEI----IWVAFGNPTRNTGRFRECFRK---YKHRW------KTA 235 W+ EGA ++ ++ A P +GRF + R+ Y+ W + A Sbjct: 170 PATWDSIEGAFSNAGVDVADNAYAFAMSTPGAPSGRFYDIHRRAPGYEDWWTRHVTLEEA 229 Query: 236 QIDSRTVEGTNKQQLQKWVDDYGEGSDFVKIRVRGIFPDASELQFIPTGLTDEAMKRV-- 293 R Q+ +W G S RV G F + E IP + A++R Sbjct: 230 IASGRISRAWADQRRSQW----GSDSAVFHNRVLGEFHASDEDSVIPLAWLEAAIERWHE 285 Query: 294 --VTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADFE 351 P+ GVD G D+ V+ R G V N+ D + I E Sbjct: 286 WDRQGRPSPGGPLWTGVDVGRGG-DETVLAARDGW--AVTLETNRRRDTMATVGLIQARE 342 Query: 352 DQYQADAVFIDFGYGTGLKSIGDGWGRTWQLVPFGGA-STDPQMLNKRGEMFNSCKTWLR 410 + D + + G L+ +G T L G A +T K G + Sbjct: 343 GRAIIDVIGLGAGVFDRLRELG-----TRPLAYTGSAGATVRDRSGKFGFTNTRSAAYWN 397 Query: 411 LGGMLD-----------DQETADDLSTAEYKVR--VDGKIVIEPKEDIKERLGRSPGKGD 457 L +LD D DL+T ++V V KI +EPK+ + ERLGRSP +GD Sbjct: 398 LRELLDPAFDPVLALPPDDLMISDLTTPHWEVTTGVPPKIKVEPKDKVVERLGRSPDRGD 457 Query: 458 ALLLTF 463 A+ ++ Sbjct: 458 AIAMSL 463 >gi|3478|lcl|protein:vir:101370 Length: 494 # NCBI annotation: PacB # Family: family:all:144 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006576;genbank:gi:46401730;genbank:GeneID :2777432 Length = 494 Score = 55.1 bits (131), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 57/236 (24%), Positives = 102/236 (43%), Gaps = 21/236 (8%) Query: 77 SGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWSNLAITK----- 131 SGHG GKS S++ + +V++ AN Q+ + I A+++ Sbjct: 56 SGHGTGKSDMTSIIAILFIMFFPGARVILVANKRQQVLDGIFKYIKSNWATAVSRFPWLS 115 Query: 132 DWFTCTATAMYSNDPGHDKRWRADAIPWSEHNTEAFAGLHNERKRIIVVFDEASNIADLV 191 +F T T+ + + W N EA AG H + ++ + DEAS ++D Sbjct: 116 KYFILTETSFF--EVTGKGVWTILIKSCRPGNEEALAGEHADH--LLYIIDEASGVSDKA 171 Query: 192 WEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECFRKYKHR-------WKTAQIDSRTVEG 244 + V GALT +D I+ ++ PTR +G F + + R + ++S Sbjct: 172 FSVITGALTGKDNRILLLS--QPTRPSGYFYDSHHRLAIRPGNPDGLFTAIILNSEESPL 229 Query: 245 TNKQQLQKWVDDYGEGSD--FVKIRVRGIFPDASELQFIPTGLTDEAMKRVVTAAQ 298 + + ++ + +YG G D I+VRG FP + + + + A +R V A+ Sbjct: 230 VDAKFIRAKLAEYG-GRDNPMYMIKVRGEFPKSQDGFLLGRDEVERATRRKVKIAK 284 >gi|8997|lcl|protein:vir:101640 Length: 432 # NCBI annotation: terminase large subunit # Family: family:all:144 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112491;genbank:gi:53793591;uniprot:Q5ZGG2 ;genbank:GeneID:3101748 Length = 432 Score = 34.7 bits (78), Expect = 0.003, Method: Compositional matrix adjust. Identities = 23/86 (26%), Positives = 41/86 (47%), Gaps = 3/86 (3%) Query: 381 QLVPFGGASTDPQMLNKRGEMFNSCKTWLRL-GGMLDDQETADDLSTAEYKVRVDGKIVI 439 Q+V + T Q K E+ S ++ ++D+ T + K+ DGK+ + Sbjct: 340 QVVQYQNLKT--QCYYKLAEVIQSNNLYIHSEDATVNDEITKELEQVKRDKIDSDGKLQL 397 Query: 440 EPKEDIKERLGRSPGKGDALLLTFAF 465 K+ +K+ +GRSP DAL++ F Sbjct: 398 ISKDKVKQAIGRSPDYSDALMMRMYF 423 >gi|10218|lcl|protein:vir:107805 Length: 533 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:144 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996635;genbank:gi:45580769;genbank:GeneID :2767881 Length = 533 Score = 34.7 bits (78), Expect = 0.003, Method: Compositional matrix adjust. Identities = 60/230 (26%), Positives = 89/230 (38%), Gaps = 27/230 (11%) Query: 279 QFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWT--GNK 336 Q IPT + A R ++A + GVD A G D+ ++ R + V T G Sbjct: 312 QVIPTAWVEAAQARWKRPDRLAPMDSL-GVDVARGGRDNTILARRHAMWFDVPLTYPGKD 370 Query: 337 TTDDLIMAK-RIADFEDQYQADAVFIDFGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML 395 T D +A IA D I G S D + Q V + + Sbjct: 371 TPDGPTVAGLAIAALRDHAVIHLDVIGVG-----ASPYDFLAQAKQQVVGVNVAEAARGT 425 Query: 396 NKRGEM--FN-SCKTWLRLGGMLD-----------DQETADDLSTAEYKVRVDGKIVIEP 441 +K G + FN + W R+ LD D DL+ + + + + Sbjct: 426 DKSGRLRFFNLRSELWWRMREALDPTNNTGIALPPDPRLLADLTAPTWSLS-GATLKVAS 484 Query: 442 KEDIKERLGRSPGKGDALLLTFAFPVSKRLRIPGQQNQQGKAITDYDPYA 491 +EDI E++GRSP G A +L KR + + Q ++ DYDPYA Sbjct: 485 REDIIEKIGRSPDFGSAYVLAL-MDTPKRAAV--EALGQARSRLDYDPYA 531 >gi|6515|lcl|protein:vir:98503 Length: 533 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:144 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996587;genbank:gi:45569518;genbank:GeneID :2767831 Length = 533 Score = 34.7 bits (78), Expect = 0.003, Method: Compositional matrix adjust. Identities = 60/230 (26%), Positives = 89/230 (38%), Gaps = 27/230 (11%) Query: 279 QFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWT--GNK 336 Q IPT + A R ++A + GVD A G D+ ++ R + V T G Sbjct: 312 QVIPTAWVEAAQARWKRPDRLAPMDSL-GVDVARGGRDNTILARRHAMWFDVPLTYPGKD 370 Query: 337 TTDDLIMAK-RIADFEDQYQADAVFIDFGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML 395 T D +A IA D I G S D + Q V + + Sbjct: 371 TPDGPTVAGLAIAALRDHAVIHLDVIGVG-----ASPYDFLAQAKQQVVGVNVAEAARGT 425 Query: 396 NKRGEM--FN-SCKTWLRLGGMLD-----------DQETADDLSTAEYKVRVDGKIVIEP 441 +K G + FN + W R+ LD D DL+ + + + + Sbjct: 426 DKSGRLRFFNLRSELWWRMREALDPTNNTGIALPPDPRLLADLTAPTWSLS-GATLKVAS 484 Query: 442 KEDIKERLGRSPGKGDALLLTFAFPVSKRLRIPGQQNQQGKAITDYDPYA 491 +EDI E++GRSP G A +L KR + + Q ++ DYDPYA Sbjct: 485 REDIIEKIGRSPDFGSAYVLAL-MDTPKRAAV--EALGQARSRLDYDPYA 531 >gi|4516|lcl|protein:vir:107432 Length: 533 # NCBI annotation: Bbp25 # Family: family:all:144 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958694;genbank:gi:41179386;genbank:GeneID :2717226 Length = 533 Score = 34.7 bits (78), Expect = 0.003, Method: Compositional matrix adjust. Identities = 60/230 (26%), Positives = 89/230 (38%), Gaps = 27/230 (11%) Query: 279 QFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWT--GNK 336 Q IPT + A R ++A + GVD A G D+ ++ R + V T G Sbjct: 312 QVIPTAWVEAAQARWKRPDRLAPMDSL-GVDVARGGRDNTILARRHAMWFDVPLTYPGKD 370 Query: 337 TTDDLIMAK-RIADFEDQYQADAVFIDFGYGTGLKSIGDGWGRTWQLVPFGGASTDPQML 395 T D +A IA D I G S D + Q V + + Sbjct: 371 TPDGPTVAGLAIAALRDHAVIHLDVIGVG-----ASPYDFLAQAKQQVVGVNVAEAARGT 425 Query: 396 NKRGEM--FN-SCKTWLRLGGMLD-----------DQETADDLSTAEYKVRVDGKIVIEP 441 +K G + FN + W R+ LD D DL+ + + + + Sbjct: 426 DKSGRLRFFNLRSELWWRMREALDPTNNTGIALPPDPRLLADLTAPTWSLS-GATLKVAS 484 Query: 442 KEDIKERLGRSPGKGDALLLTFAFPVSKRLRIPGQQNQQGKAITDYDPYA 491 +EDI E++GRSP G A +L KR + + Q ++ DYDPYA Sbjct: 485 REDIIEKIGRSPDFGSAYVLAL-MDTPKRAAV--EALGQARSRLDYDPYA 531 >gi|5891|lcl|protein:vir:107098 Length: 421 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950600;genbank:gi:119953680;genbank:GeneI D:4643107 Length = 421 Score = 28.1 bits (61), Expect = 0.23, Method: Compositional matrix adjust. Identities = 20/57 (35%), Positives = 28/57 (49%), Gaps = 2/57 (3%) Query: 69 QPLMLARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWS 125 + L + G G GKS+ IS++I + VV TDN L T + E IKW+ Sbjct: 26 EKLNIVAKGGRGSGKSSDISIIIT-QLIMRYPMNAVVVRKTDNTLATSVF-EQIKWA 80 >gi|17761|lcl|protein:vir:171 Length: 470 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112076;genbank:gi:13559866;genbank:GeneID :921009 Length = 470 Score = 28.1 bits (61), Expect = 0.24, Method: Compositional matrix adjust. Identities = 15/49 (30%), Positives = 27/49 (55%), Gaps = 1/49 (2%) Query: 423 DLSTAEYKVRVDGKIVIEPKEDIKERLG-RSPGKGDALLLTFAFPVSKR 470 +L+ + K +GK+ + K ++K++LG SP DAL++ P R Sbjct: 409 ELTQIQRKFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCPALAR 457 >gi|2819|lcl|protein:vir:105705 Length: 439 # NCBI annotation: gp2 # Family: family:all:54 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224140;genbank:gi:62362215;genbank:GeneID :3342458 Length = 439 Score = 28.1 bits (61), Expect = 0.26, Method: Compositional matrix adjust. Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 9/69 (13%) Query: 166 AFAGL-HN-----ERKRIIVVF-DEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNT 218 F GL HN + RI+V + DEA +++ W+ + +E +E IWV + NP ++ Sbjct: 108 VFCGLRHNLDSIKSKARILVAWVDEAESVSSTAWKKLRPTVREEGSE-IWVTW-NPEKDG 165 Query: 219 GRFRECFRK 227 + FRK Sbjct: 166 SATDKLFRK 174 >gi|5142|lcl|protein:vir:105417 Length: 470 # NCBI annotation: gene 2 protein # Family: family:all:54 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958178;genbank:gi:41057280;genbank:GeneID :2716664 Length = 470 Score = 27.7 bits (60), Expect = 0.35, Method: Compositional matrix adjust. Identities = 15/49 (30%), Positives = 27/49 (55%), Gaps = 1/49 (2%) Query: 423 DLSTAEYKVRVDGKIVIEPKEDIKERLG-RSPGKGDALLLTFAFPVSKR 470 +L+ + K +GK+ + K ++K++LG SP DAL++ P R Sbjct: 409 ELTQIQRKFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCPALVR 457 >gi|10799|lcl|protein:vir:78055 Length: 440 # NCBI annotation: TerL # Family: family:all:54 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468786;genbank:gi:157325367;genbank:Ge neID:5601817 Length = 440 Score = 27.3 bits (59), Expect = 0.44, Method: Compositional matrix adjust. Identities = 16/66 (24%), Positives = 30/66 (45%), Gaps = 3/66 (4%) Query: 78 GHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWS--NLAITKDWFT 135 G G GKS+F+++++ + V+ D +RT P+ +W+ L ++ W T Sbjct: 51 GRGSGKSSFVALMVVDEIMKDPQANAVIFRKVDEGMRTTLLPQ-YQWAIDQLGVSGAWRT 109 Query: 136 CTATAM 141 M Sbjct: 110 SLQPMM 115 >gi|11068|lcl|protein:vir:78311 Length: 547 # NCBI annotation: putative terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468641;genbank:gi:157325219;genbank:Ge neID:5601657 Length = 547 Score = 27.3 bits (59), Expect = 0.45, Method: Compositional matrix adjust. Identities = 15/41 (36%), Positives = 23/41 (56%), Gaps = 3/41 (7%) Query: 78 GHGIGKSAFISMLINWGMSTCE---DCKVVVTANTDNQLRT 115 G G GK+ FIS L N+ +S + V V AN+++Q + Sbjct: 94 GRGGGKNGFISTLSNYFISPLHGINNYDVSVVANSEDQAKV 134 >gi|5208|lcl|protein:vir:106641 Length: 446 # NCBI annotation: ORF005 # Family: family:all:54 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239489;genbank:gi:66395220;genbank:GeneID :4555795 Length = 446 Score = 27.3 bits (59), Expect = 0.45, Method: Compositional matrix adjust. Identities = 14/58 (24%), Positives = 28/58 (48%) Query: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIA 348 KR+++ +V H P G+D Y A I+++ +K L+ ++ ++ IA Sbjct: 263 KRIISDKEVGHLPSYFGLDFGYVNDPSAFIHVKIDNDNKKLYVISEYVKKGMLNNEIA 320 >gi|4914|lcl|protein:vir:99572 Length: 459 # NCBI annotation: TerL-like protein # Family: family:all:54 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039811;genbank:gi:126011061;genbank:Ge neID:4818267 Length = 459 Score = 27.3 bits (59), Expect = 0.47, Method: Compositional matrix adjust. Identities = 20/90 (22%), Positives = 43/90 (47%), Gaps = 11/90 (12%) Query: 378 RTWQLVPFGGASTDPQMLNKRGEMFNSCKTWLRLGGMLDDQETADDLSTAEYKVRVDGKI 437 +T+++V +G ++++ E + +LD + +L++ V GK Sbjct: 370 KTYEVVTYGANHPHDELISISSEHVPA--------KILDKLKI--ELASPHKDVDGMGKF 419 Query: 438 VIEPKEDIKERLG-RSPGKGDALLLTFAFP 466 +E K+D++E+ G +SP DA ++ P Sbjct: 420 KVESKKDMREKRGIKSPNIADAFIMAMIQP 449 >gi|10483|lcl|protein:vir:105297 Length: 446 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950664;genbank:gi:119967835;genbank:GeneI D:4643176 Length = 446 Score = 26.2 bits (56), Expect = 0.97, Method: Compositional matrix adjust. Identities = 19/57 (33%), Positives = 27/57 (47%), Gaps = 2/57 (3%) Query: 69 QPLMLARASGHGIGKSAFISMLINWGMSTCEDCKVVVTANTDNQLRTKTWPEIIKWS 125 + L + G G GKS+ IS++I + VV DN L T + E IKW+ Sbjct: 25 EKLNIVAKGGRGSGKSSDISIIIT-QLIMRYPMNAVVVRKADNTLATSVF-EQIKWA 79 >gi|11501|lcl|protein:vir:78718 Length: 574 # NCBI annotation: DNA packaging protein # Family: family:all:697 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285469;genbank:gi:148724503;genbank:Ge neID:5220189 Length = 574 Score = 25.0 bits (53), Expect = 2.0, Method: Compositional matrix adjust. Identities = 15/74 (20%), Positives = 33/74 (44%), Gaps = 3/74 (4%) Query: 299 VAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIAD---FEDQYQ 355 V ++ I+ VDP+ G D+ V + + + K D + ++D +Y+ Sbjct: 342 VPYSETIVSVDPSGRGTDETVAVVLSQANGYIFVRDMKAFRDGYSDETLSDIVRLGKRYK 401 Query: 356 ADAVFIDFGYGTGL 369 A + ++ +G G+ Sbjct: 402 ASKLLVESNFGDGM 415 >gi|6775|lcl|protein:vir:96067 Length: 460 # NCBI annotation: conserved hypothetical protein ORF004 # Family: family:all:54 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294421;genbank:gi:149408318;genbank:Ge neID:5237186 Length = 460 Score = 25.0 bits (53), Expect = 2.1, Method: Compositional matrix adjust. Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 2/48 (4%) Query: 178 IVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECF 225 I+ +EA + WEV E + E++E IW+ F NP T + F Sbjct: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWIIF-NPNEVTDFVYQNF 160 >gi|2779|lcl|protein:vir:99776 Length: 425 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004302;genbank:gi:122891756;genbank:Ge neID:4712331 Length = 425 Score = 24.6 bits (52), Expect = 2.4, Method: Compositional matrix adjust. Identities = 14/63 (22%), Positives = 27/63 (42%) Query: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350 KR++ ++ H P G+D Y A I+ + + K L+ + ++ IA+ Sbjct: 241 KRLINKDELRHLPSYFGLDFGYVNDPSAFIHSKIDVKKKKLYIIEEYVKQGMLNDEIANV 300 Query: 351 EDQ 353 Q Sbjct: 301 IKQ 303 >gi|9829|lcl|protein:vir:103950 Length: 425 # NCBI annotation: phage terminase large subunit # Family: family:all:54 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873987;genbank:gi:118430762;genbank:GeneI D:4525444 Length = 425 Score = 24.6 bits (52), Expect = 2.4, Method: Compositional matrix adjust. Identities = 14/63 (22%), Positives = 27/63 (42%) Query: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350 KR++ ++ H P G+D Y A I+ + + K L+ + ++ IA+ Sbjct: 241 KRLINKDELRHLPSYFGLDFGYVNDPSAFIHSKIDVKKKKLYIIEEYVKQGMLNDEIANV 300 Query: 351 EDQ 353 Q Sbjct: 301 IKQ 303 >gi|17691|lcl|protein:vir:9305 Length: 447 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803283;genbank:gi:29028593;genbank:GeneID :1258039 Length = 447 Score = 24.6 bits (52), Expect = 2.7, Method: Compositional matrix adjust. Identities = 14/63 (22%), Positives = 27/63 (42%) Query: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350 KR++ ++ H P G+D Y A I+ + + K L+ + ++ IA+ Sbjct: 263 KRLINKDELRHLPSYFGLDFGYVNDPSAFIHSKIDVKKKKLYIIEEYVKQGMLNDEIANV 322 Query: 351 EDQ 353 Q Sbjct: 323 IKQ 325 >gi|7301|lcl|protein:vir:96238 Length: 447 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239566;genbank:gi:66395301;genbank:GeneID :5132787 Length = 447 Score = 24.6 bits (52), Expect = 2.7, Method: Compositional matrix adjust. Identities = 14/63 (22%), Positives = 27/63 (42%) Query: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLWTGNKTTDDLIMAKRIADF 350 KR++ ++ H P G+D Y A I+ + + K L+ + ++ IA+ Sbjct: 263 KRLINKDELRHLPSYFGLDFGYVNDPSAFIHSKIDVKKKKLYIIEEYVKQGMLNDEIANV 322 Query: 351 EDQ 353 Q Sbjct: 323 IKQ 325 >gi|7643|lcl|protein:vir:96370 Length: 447 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239642;genbank:gi:66395379;genbank:GeneID :5132846 Length = 447 Score = 24.6 bits (52), Expect = 2.7, Method: Compositional matrix adjust. Identities = 11/42 (26%), Positives = 20/42 (47%) Query: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLW 332 KR++ ++ H P G+D Y A I+ + + K L+ Sbjct: 263 KRLINKDELRHLPSYFGLDFGYVNDPSAFIHSKIDVKKKKLY 304 >gi|11588|lcl|protein:vir:78831 Length: 447 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285355;genbank:gi:148717883;genbank:Ge neID:5246962 Length = 447 Score = 24.6 bits (52), Expect = 2.7, Method: Compositional matrix adjust. Identities = 11/42 (26%), Positives = 20/42 (47%) Query: 291 KRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKVLW 332 KR++ ++ H P G+D Y A I+ + + K L+ Sbjct: 263 KRLINKDELRHLPSYFGLDFGYVNDPSAFIHSKIDVKKKKLY 304 >gi|15353|lcl|protein:vir:9921 Length: 425 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795683;genbank:gi:28876465;genbank:GeneID :1257982 Length = 425 Score = 24.3 bits (51), Expect = 3.8, Method: Compositional matrix adjust. Identities = 32/182 (17%), Positives = 73/182 (40%), Gaps = 30/182 (16%) Query: 167 FAGLHNERKRIIVVFDEASNIADLVWEVAEGALTDEDTEI-------------IWVAFGN 213 F G+ N K I+D+V E A D+ T++ I++ F Sbjct: 119 FKGMDNPEK-----IKSIKGISDVVMEEASEFTLDDYTQLTLRLRDKKHLEKQIYLMFNP 173 Query: 214 PTRNTGRFRECFRKYKHR---WKTAQIDSRTVEGTNKQQLQKWVDDYGEGSDFVKIRVRG 270 ++ ++ F K ++T D+R ++ ++ +++ + + KI G Sbjct: 174 VSKVNWVYKAFFVKTPKNTVVYQTTYKDNRFLDDVTRENIEELAN---RNEAYYKIYALG 230 Query: 271 IFPDASELQFIPTGLTDEAMKRVVTAAQVAHAPVIIGVDPAYSGVDDAVIYLRQGLHSKV 330 F +L F + K+++ +++H P G+D + A+++++ +K Sbjct: 231 QFATLDKLIF------PKYDKQILNKDKLSHLPSFFGLDYGFINDPSALLHVKIDDANKK 284 Query: 331 LW 332 L+ Sbjct: 285 LY 286 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.318 0.135 0.423 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 250,502 Number of Sequences: 514 Number of extensions: 12225 Number of successful extensions: 63 Number of sequences better than 100.0: 29 Number of HSP's better than 100.0 without gapping: 17 Number of HSP's successfully gapped in prelim test: 12 Number of HSP's that attempted gapping in prelim test: 34 Number of HSP's gapped (non-prelim): 34 length of query: 491 length of database: 206,069 effective HSP length: 75 effective length of query: 416 effective length of database: 167,519 effective search space: 69687904 effective search space used: 69687904 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 39 (19.6 bits)