BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_019767.1_cdsid_YP_007151617.1 [gene=F865_gp10] [protein=major tail subunit] [protein_id=YP_007151617.1] [location=6684..7388] (234 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|15619|lcl|protein:vir:196 Length: 234 # NCBI annotation: majo... 475 e-136 gi|839|lcl|protein:vir:93600 Length: 281 # NCBI annotation: puta... 327 9e-92 gi|13942|lcl|protein:vir:1439 Length: 152 # NCBI annotation: put... 113 2e-27 gi|8571|lcl|protein:vir:100083 Length: 152 # NCBI annotation: gp... 113 2e-27 gi|8001|lcl|protein:vir:100241 Length: 154 # NCBI annotation: gp... 102 3e-24 gi|12836|lcl|protein:vir:80349 Length: 198 # NCBI annotation: gp... 100 1e-23 gi|2833|lcl|protein:vir:105771 Length: 245 # NCBI annotation: gp... 40 3e-05 gi|6897|lcl|protein:vir:106581 Length: 159 # NCBI annotation: pu... 36 5e-04 gi|15109|lcl|protein:vir:3875 Length: 202 # NCBI annotation: maj... 26 0.47 gi|1299|lcl|protein:vir:105078 Length: 155 # NCBI annotation: ma... 25 0.71 gi|996|lcl|protein:vir:5746 Length: 150 # NCBI annotation: hypot... 25 0.83 gi|13389|lcl|protein:vir:1263 Length: 416 # NCBI annotation: hyp... 22 8.0 gi|13914|lcl|protein:vir:9881 Length: 168 # NCBI annotation: hyp... 22 8.1 >gi|15619|lcl|protein:vir:196 Length: 234 # NCBI annotation: major tail subunit # Family: family:all:628 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037706;genbank:gi:9634159;genbank:GeneID: 1262542 Length = 234 Score = 475 bits (1223), Expect = e-136, Method: Compositional matrix adjust. Identities = 234/234 (100%), Positives = 234/234 (100%) Query: 1 MSALYEKSQLTKILISSLPSTKETMDSATFLDLSCTIKEIQFTGGQKQDIDVTTLCSTEQ 60 MSALYEKSQLTKILISSLPSTKETMDSATFLDLSCTIKEIQFTGGQKQDIDVTTLCSTEQ Sbjct: 1 MSALYEKSQLTKILISSLPSTKETMDSATFLDLSCTIKEIQFTGGQKQDIDVTTLCSTEQ 60 Query: 61 ENINGLPSPSEISLSGNFYNNPAQDALRDAYDNDTTYGFQIIFPSGNGFKFLAEVRQHTW 120 ENINGLPSPSEISLSGNFYNNPAQDALRDAYDNDTTYGFQIIFPSGNGFKFLAEVRQHTW Sbjct: 61 ENINGLPSPSEISLSGNFYNNPAQDALRDAYDNDTTYGFQIIFPSGNGFKFLAEVRQHTW 120 Query: 121 SSGTNGVVAATFSLRLKGKPVPIDSVLKLTTDLPSSLSVAVGAAISMAVVAAGGKPPYAY 180 SSGTNGVVAATFSLRLKGKPVPIDSVLKLTTDLPSSLSVAVGAAISMAVVAAGGKPPYAY Sbjct: 121 SSGTNGVVAATFSLRLKGKPVPIDSVLKLTTDLPSSLSVAVGAAISMAVVAAGGKPPYAY 180 Query: 181 TWKKAGSTVSGQTSDTFNKATAVSGDAGDYTCVVTDSSSPVKTVTSAACTLTIS 234 TWKKAGSTVSGQTSDTFNKATAVSGDAGDYTCVVTDSSSPVKTVTSAACTLTIS Sbjct: 181 TWKKAGSTVSGQTSDTFNKATAVSGDAGDYTCVVTDSSSPVKTVTSAACTLTIS 234 >gi|839|lcl|protein:vir:93600 Length: 281 # NCBI annotation: putative major tail subunit # Family: family:all:628 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449301;genbank:gi:157166049;goa:Q6H9U0 ;interpro:IPR007110;interpro:IPR013098;uniprot:Q6H9U0;ge nbank:GeneID:5580422 Length = 281 Score = 327 bits (837), Expect = 9e-92, Method: Compositional matrix adjust. Identities = 156/234 (66%), Positives = 185/234 (79%) Query: 1 MSALYEKSQLTKILISSLPSTKETMDSATFLDLSCTIKEIQFTGGQKQDIDVTTLCSTEQ 60 MSALYE+SQLT+++ISS P+T ETM+ A +L L CTIKE+QFT GQKQDIDVTTLCSTEQ Sbjct: 44 MSALYERSQLTQVMISSAPATAETMEKAEYLRLDCTIKEVQFTAGQKQDIDVTTLCSTEQ 103 Query: 61 ENINGLPSPSEISLSGNFYNNPAQDALRDAYDNDTTYGFQIIFPSGNGFKFLAEVRQHTW 120 ENINGL + SEIS+SGNFY N AQ+ALRDAYDNDT Y F++ FPSG GFKFLAEVRQHTW Sbjct: 104 ENINGLGASSEISMSGNFYLNQAQNALRDAYDNDTVYAFKVQFPSGKGFKFLAEVRQHTW 163 Query: 121 SSGTNGVVAATFSLRLKGKPVPIDSVLKLTTDLPSSLSVAVGAAISMAVVAAGGKPPYAY 180 SSGTNGVVAATFSLRLKGKPV L +L +L+V GA ++M+V GG PPY + Sbjct: 164 SSGTNGVVAATFSLRLKGKPVSYVVPLAFVKNLEKTLTVNTGALLTMSVSVNGGTPPYKH 223 Query: 181 TWKKAGSTVSGQTSDTFNKATAVSGDAGDYTCVVTDSSSPVKTVTSAACTLTIS 234 WKK G V GQT+DTF+KA SGD G YTC VTDS+ +++TS ACT+T++ Sbjct: 224 AWKKDGQPVEGQTTDTFSKANTQSGDKGAYTCEVTDSAEQPQSITSDACTVTVN 277 >gi|13942|lcl|protein:vir:1439 Length: 152 # NCBI annotation: putative major tail subunit protein # Family: family:all:628 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536368;genbank:gi:17975173;genbank:GeneID :929142 Length = 152 Score = 113 bits (283), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 56/135 (41%), Positives = 84/135 (62%), Gaps = 1/135 (0%) Query: 7 KSQLTKILISSLPSTKETMDSATFLDLSCTIKEIQFTGGQKQDIDVTTLCSTEQENINGL 66 K+Q TK+ +S + ST F+DLS T K+IQ+ GGQ ++ID TT S E+E+ GL Sbjct: 10 KAQGTKVEVSKMASTDLDAADLVFVDLSATGKQIQWQGGQSEEIDATTFASDEKESELGL 69 Query: 67 PSPSEISLSGNFY-NNPAQDALRDAYDNDTTYGFQIIFPSGNGFKFLAEVRQHTWSSGTN 125 P P E S+ GN+ N+ Q+ LR A Y F++ F + F F+ VRQ+TW++ N Sbjct: 70 PDPGEFSVDGNYQSNDEGQNILRAARATGEKYVFRVTFADKSQFLFVGMVRQYTWAASVN 129 Query: 126 GVVAATFSLRLKGKP 140 G+++AT+S+R+ G P Sbjct: 130 GLISATYSVRVSGAP 144 >gi|8571|lcl|protein:vir:100083 Length: 152 # NCBI annotation: gp11 # Family: family:all:628 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945041;genbank:gi:38707901;genbank:GeneID :2744130 Length = 152 Score = 113 bits (282), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 56/135 (41%), Positives = 84/135 (62%), Gaps = 1/135 (0%) Query: 7 KSQLTKILISSLPSTKETMDSATFLDLSCTIKEIQFTGGQKQDIDVTTLCSTEQENINGL 66 K+Q TK+ +S + ST F+DLS T K+IQ+ GGQ ++ID TT S E+E+ GL Sbjct: 10 KAQGTKVEVSKVASTDLDAADLAFVDLSATGKQIQWQGGQSEEIDATTFASDEKESELGL 69 Query: 67 PSPSEISLSGNFY-NNPAQDALRDAYDNDTTYGFQIIFPSGNGFKFLAEVRQHTWSSGTN 125 P P E S+ GN+ N+ Q+ LR A Y F++ F + F F+ VRQ+TW++ N Sbjct: 70 PDPGEFSVDGNYQSNDEGQNILRAARATGEKYVFRVTFADKSQFLFVGMVRQYTWAASVN 129 Query: 126 GVVAATFSLRLKGKP 140 G+++AT+S+R+ G P Sbjct: 130 GLISATYSVRVSGAP 144 >gi|8001|lcl|protein:vir:100241 Length: 154 # NCBI annotation: gp70 # Family: family:all:628 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355406;genbank:gi:77864696;genbank:GeneID :3725963 Length = 154 Score = 102 bits (255), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 54/135 (40%), Positives = 82/135 (60%), Gaps = 1/135 (0%) Query: 7 KSQLTKILISSLPSTKETMDSATFLDLSCTIKEIQFTGGQKQDIDVTTLCSTEQENINGL 66 K+Q TK+ +S ST ++ F+DL+ T K IQ+ GGQ +ID TTL S E+E GL Sbjct: 9 KAQGTKVEVSKTVSTDLDDNTLVFVDLNTTGKTIQWQGGQSSEIDATTLASEEKEYELGL 68 Query: 67 PSPSEISLSGNF-YNNPAQDALRDAYDNDTTYGFQIIFPSGNGFKFLAEVRQHTWSSGTN 125 P P E S+ GN+ ++ Q LR A + + F++ F + F F+ VRQ+TWS+ + Sbjct: 69 PDPGEFSVDGNYSSDDEGQSLLRTARASGEKHVFRVTFADQSQFLFVGMVRQYTWSAAVD 128 Query: 126 GVVAATFSLRLKGKP 140 G+V +T+S+R+ G P Sbjct: 129 GIVTSTYSVRVSGAP 143 >gi|12836|lcl|protein:vir:80349 Length: 198 # NCBI annotation: gp12 # Family: family:all:628 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111091;genbank:gi:134288619;genbank:Ge neID:4960596 Length = 198 Score = 100 bits (250), Expect = 1e-23, Method: Compositional matrix adjust. Identities = 49/135 (36%), Positives = 82/135 (60%), Gaps = 1/135 (0%) Query: 7 KSQLTKILISSLPSTKETMDSATFLDLSCTIKEIQFTGGQKQDIDVTTLCSTEQENINGL 66 +SQ TK+ +S +PS + TF+DL+ T K+IQ+ GGQ ++ID TT S ++E+ GL Sbjct: 56 RSQGTKVEVSKVPSYDLDANDITFVDLNTTTKQIQWQGGQSEEIDATTFASEQKESELGL 115 Query: 67 PSPSEISLSGNF-YNNPAQDALRDAYDNDTTYGFQIIFPSGNGFKFLAEVRQHTWSSGTN 125 P E S+ GN+ ++ Q LR A+ + ++ F + F + VRQ++WS G N Sbjct: 116 GDPGEFSVQGNYSSDDEGQLILRAAHSTKAKHVLRVTFSDKSQFLMIGMVRQYSWSGGVN 175 Query: 126 GVVAATFSLRLKGKP 140 ++++++S+RL G P Sbjct: 176 AIISSSYSIRLSGAP 190 >gi|2833|lcl|protein:vir:105771 Length: 245 # NCBI annotation: gp16 # Family: family:all:628 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224154;genbank:gi:62362229;genbank:GeneID :3342524 Length = 245 Score = 40.0 bits (92), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 26/79 (32%), Positives = 45/79 (56%), Gaps = 3/79 (3%) Query: 143 IDSVLKLTTDLPSSLSVAVGAAISMAVVA-AGGKPPYAYTWKKAGSTVS-GQTSDTFNKA 200 + +V+ +TT P ++ G +++ V A Y WKK G VS G T+ T+ K+ Sbjct: 153 VGTVITITTQ-PQGKTLTAGDTLTLTVAATVSDSSSLTYQWKKDGINVSSGGTTATYTKS 211 Query: 201 TAVSGDAGDYTCVVTDSSS 219 +A +GD+G YTC ++ S++ Sbjct: 212 SATTGDSGSYTCQISSSTA 230 >gi|6897|lcl|protein:vir:106581 Length: 159 # NCBI annotation: putative major tail protein # Family: family:all:6477 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958590;genbank:gi:41179249;genbank:GeneID :2717117 Length = 159 Score = 35.8 bits (81), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 22/75 (29%), Positives = 33/75 (44%), Gaps = 1/75 (1%) Query: 32 DLSCTIKEIQFTGGQKQDIDVTTLCSTEQENINGLPSPSEISLSGNFYNNPAQDALRDAY 91 D IK I GG + IDVTTL ++ I G+ + S + + AL A Sbjct: 43 DEIADIKTIPELGGDTEKIDVTTLADDRRKQIEGIQNASNVQFQAVYKGASFAKALAQAG 102 Query: 92 DNDTTYGFQIIFPSG 106 D Y +++ +P G Sbjct: 103 DR-KQYQWKVTYPDG 116 >gi|15109|lcl|protein:vir:3875 Length: 202 # NCBI annotation: major tail protein # Family: family:all:102 # ACLAME annotation(s): phi:0000082 - phage major tail protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680492;swissprot:trembl:p94216;genbank:gi :22296532;interpro:IPR006490;uniprot:P94216;genbank:Gene ID:951722 Length = 202 Score = 25.8 bits (55), Expect = 0.47, Method: Compositional matrix adjust. Identities = 16/60 (26%), Positives = 27/60 (45%), Gaps = 11/60 (18%) Query: 54 TLCSTEQENINGLPSPSEISLSGNFYNNPAQD---------ALRDAYDNDTTYGFQIIFP 104 +L + + ++G P PS S+ G+F QD D +D D +G+ +FP Sbjct: 133 SLPGVDTKTVDGTPDPSADSIEGSFIPRGDQDTGNVVLIGREDNDGFDFDKFHGY--VFP 190 >gi|1299|lcl|protein:vir:105078 Length: 155 # NCBI annotation: major tail shaft subunit # Family: family:all:11396 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006592;genbank:gi:46402098;genbank:GeneID :2777944 Length = 155 Score = 25.4 bits (54), Expect = 0.71, Method: Compositional matrix adjust. Identities = 18/59 (30%), Positives = 30/59 (50%), Gaps = 5/59 (8%) Query: 50 IDVTTLCSTEQENINGLPSPSEISLSGNFYNNPAQ---DALRDAYDNDTTYGFQIIFPS 105 +D TTL ++++I+ LP E SL F ++PA AL +A + T + P+ Sbjct: 50 VDCTTLKDKQKQSISDLPDGPEKSLG--FIDDPANASFTALLNAAEARETIQLYVELPN 106 >gi|996|lcl|protein:vir:5746 Length: 150 # NCBI annotation: hypothetical protein # Family: family:all:11396 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892057;genbank:gi:33770520;uniprot:Q7Y40 3;genbank:GeneID:2637472 Length = 150 Score = 25.0 bits (53), Expect = 0.83, Method: Compositional matrix adjust. Identities = 16/44 (36%), Positives = 24/44 (54%), Gaps = 5/44 (11%) Query: 42 FTGGQKQDIDVTTLCSTEQENINGLPSPSEISLSGNFYNNPAQD 85 TGG +D TTL T +++I+ LP E SL F ++P + Sbjct: 44 LTGGY---VDCTTLIDTNKQSISDLPEGPEKSLG--FIDDPENE 82 >gi|13389|lcl|protein:vir:1263 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690755;genbank:gi:22854995;genbank:GeneID :955207 Length = 416 Score = 21.9 bits (45), Expect = 8.0, Method: Compositional matrix adjust. Identities = 13/26 (50%), Positives = 16/26 (61%) Query: 136 LKGKPVPIDSVLKLTTDLPSSLSVAV 161 L+G PV + L +TTDL S VAV Sbjct: 342 LQGLPVYLGLDLSMTTDLTSVGYVAV 367 >gi|13914|lcl|protein:vir:9881 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:629 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795643;genbank:gi:28876398;genbank:GeneID :1257929 Length = 168 Score = 21.6 bits (44), Expect = 8.1, Method: Compositional matrix adjust. Identities = 8/19 (42%), Positives = 13/19 (68%) Query: 50 IDVTTLCSTEQENINGLPS 68 ID++TL S + E + +PS Sbjct: 150 IDLSTLSSEDAEKVTAMPS 168 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.311 0.126 0.357 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 100,121 Number of Sequences: 514 Number of extensions: 4321 Number of successful extensions: 22 Number of sequences better than 100.0: 17 Number of HSP's better than 100.0 without gapping: 15 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 3 Number of HSP's gapped (non-prelim): 17 length of query: 234 length of database: 206,069 effective HSP length: 69 effective length of query: 165 effective length of database: 170,603 effective search space: 28149495 effective search space used: 28149495 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.9 bits) S2: 36 (18.5 bits)