BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_019487.1_cdsid_YP_007003406.1 [gene=F373_gp149] [protein=tail tube subunit] [protein_id=YP_007003406.1] [location=71823..72248] (141 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|9414|lcl|protein:vir:99300 Length: 142 # NCBI annotation: hyp... 167 4e-44 gi|22597|lcl|protein:vir:95739 Length: 142 # NCBI annotation: OR... 167 4e-44 gi|24940|lcl|protein:vir:80620 Length: 140 # NCBI annotation: gp... 162 2e-42 gi|25129|lcl|protein:vir:80710 Length: 140 # NCBI annotation: st... 160 4e-42 gi|23709|lcl|protein:vir:96672 Length: 121 # NCBI annotation: OR... 149 1e-38 gi|24314|lcl|protein:vir:100800 Length: 155 # NCBI annotation: h... 149 1e-38 gi|21558|lcl|protein:vir:63764 Length: 123 # NCBI annotation: gp... 141 3e-36 gi|15780|lcl|protein:vir:6245 Length: 327 # NCBI annotation: gp3... 26 0.21 gi|976|lcl|protein:vir:6217 Length: 211 # NCBI annotation: hypot... 23 1.1 gi|6927|lcl|protein:vir:106715 Length: 507 # NCBI annotation: gp... 23 1.5 gi|11323|lcl|protein:vir:78588 Length: 507 # NCBI annotation: Te... 23 1.5 gi|4006|lcl|protein:vir:100188 Length: 210 # NCBI annotation: pu... 22 3.7 >gi|9414|lcl|protein:vir:99300 Length: 142 # NCBI annotation: hypothetical protein # Family: family:all:2792 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024480;genbank:gi:48696439;genbank:GeneID :2948028 Length = 142 Score = 167 bits (423), Expect = 4e-44, Method: Compositional matrix adjust. Identities = 78/138 (56%), Positives = 102/138 (73%) Query: 1 MATQAKQTVHTGATVLLMIKNKVVGRAQGLDGRRSFGTEGVYEIGSIMPQEHVQNRYEGS 60 MA++AKQTVHTG TVLLMIK K VGRAQ G+R +GT GVYEIGSIMPQEHV RYEG+ Sbjct: 1 MASEAKQTVHTGNTVLLMIKGKPVGRAQSASGQREYGTTGVYEIGSIMPQEHVYLRYEGT 60 Query: 61 FTLDRFFVKKKSLADLDLAPLGEDVLKLDIFDVVIVDKETNSILRAYRGCTVSEYSETFR 120 T++R +KK++ ADL A LGE++LK DI D+++VD T ++ +Y GC+ + Y+ET++ Sbjct: 61 ITVERLRMKKENFADLGYASLGEEILKKDIIDILVVDNLTKQVIISYHGCSANNYNETWQ 120 Query: 121 VGQISGENATFQYLKASD 138 +I E F YL ASD Sbjct: 121 TNEIVTEEIEFSYLTASD 138 >gi|22597|lcl|protein:vir:95739 Length: 142 # NCBI annotation: ORF105 # Family: family:all:2792 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240911;genbank:gi:66395051;genbank:GeneID :5132680 Length = 142 Score = 167 bits (423), Expect = 4e-44, Method: Compositional matrix adjust. Identities = 78/138 (56%), Positives = 102/138 (73%) Query: 1 MATQAKQTVHTGATVLLMIKNKVVGRAQGLDGRRSFGTEGVYEIGSIMPQEHVQNRYEGS 60 MA++AKQTVHTG TVLLMIK K VGRAQ G+R +GT GVYEIGSIMPQEHV RYEG+ Sbjct: 1 MASEAKQTVHTGNTVLLMIKGKPVGRAQSASGQREYGTTGVYEIGSIMPQEHVYLRYEGT 60 Query: 61 FTLDRFFVKKKSLADLDLAPLGEDVLKLDIFDVVIVDKETNSILRAYRGCTVSEYSETFR 120 T++R +KK++ ADL A LGE++LK DI D+++VD T ++ +Y GC+ + Y+ET++ Sbjct: 61 ITVERLRMKKENFADLGYASLGEEILKKDIIDILVVDNLTKQVIISYHGCSANNYNETWQ 120 Query: 121 VGQISGENATFQYLKASD 138 +I E F YL ASD Sbjct: 121 TNEIVTEEIEFSYLTASD 138 >gi|24940|lcl|protein:vir:80620 Length: 140 # NCBI annotation: gp8 # Family: family:all:2792 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468474;genbank:gi:157325049;genbank:Ge neID:5601587 Length = 140 Score = 162 bits (410), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 75/140 (53%), Positives = 98/140 (70%) Query: 1 MATQAKQTVHTGATVLLMIKNKVVGRAQGLDGRRSFGTEGVYEIGSIMPQEHVQNRYEGS 60 MA+ QTVHTG TV LMIK K+VGR Q G RS+GT GVYEIGSIMPQEHV +YEGS Sbjct: 1 MASVTNQTVHTGNTVYLMIKGKIVGRCQSASGERSYGTTGVYEIGSIMPQEHVYLKYEGS 60 Query: 61 FTLDRFFVKKKSLADLDLAPLGEDVLKLDIFDVVIVDKETNSILRAYRGCTVSEYSETFR 120 T+DR ++K+ LA L + LGED+LK D+ D+V++D T ++ AYRGC+ Y+E + Sbjct: 61 LTVDRLRMRKEDLAKLGITALGEDILKRDVIDIVMMDNTTKEVIIAYRGCSAETYNEEIK 120 Query: 121 VGQISGENATFQYLKASDSK 140 +I E+A F YL A++ K Sbjct: 121 ANEIVSESARFLYLSAANVK 140 >gi|25129|lcl|protein:vir:80710 Length: 140 # NCBI annotation: structural protein # Family: family:all:2792 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504133;genbank:gi:158079320;genbank:Ge neID:5666358 Length = 140 Score = 160 bits (406), Expect = 4e-42, Method: Compositional matrix adjust. Identities = 73/140 (52%), Positives = 99/140 (70%) Query: 1 MATQAKQTVHTGATVLLMIKNKVVGRAQGLDGRRSFGTEGVYEIGSIMPQEHVQNRYEGS 60 MA+ QTVHTG TV LMI NK++GRAQ G R +GT+G+YEIGSIMPQEHV +YEG+ Sbjct: 1 MASVGNQTVHTGNTVYLMIGNKIIGRAQSASGERQYGTQGIYEIGSIMPQEHVYLKYEGT 60 Query: 61 FTLDRFFVKKKSLADLDLAPLGEDVLKLDIFDVVIVDKETNSILRAYRGCTVSEYSETFR 120 TL+R +KK+ LA L + LGED+L+ DI D+V++D T I+ AYRGC+ YSE+F Sbjct: 61 ITLERMRMKKEDLASLGITALGEDILQRDIIDIVMMDNLTKEIVVAYRGCSAISYSESFT 120 Query: 121 VGQISGENATFQYLKASDSK 140 +++ E+ F YL ++ K Sbjct: 121 ANEVTSESTQFTYLTSAKVK 140 >gi|23709|lcl|protein:vir:96672 Length: 121 # NCBI annotation: ORF106 # Family: family:all:2792 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238555;genbank:gi:66391358;genbank:GeneID :5130454 Length = 121 Score = 149 bits (376), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 69/115 (60%), Positives = 87/115 (75%) Query: 1 MATQAKQTVHTGATVLLMIKNKVVGRAQGLDGRRSFGTEGVYEIGSIMPQEHVQNRYEGS 60 MA+QAKQTVHTG TV+LMIK K VGRAQ G RS+GTEGVYEIGSIMPQEHV +YEG Sbjct: 1 MASQAKQTVHTGNTVMLMIKGKPVGRAQSASGTRSYGTEGVYEIGSIMPQEHVYLKYEGE 60 Query: 61 FTLDRFFVKKKSLADLDLAPLGEDVLKLDIFDVVIVDKETNSILRAYRGCTVSEY 115 T++R +KK++ A L A LGE++LK DI D+V++D T +L +Y GC+ + Y Sbjct: 61 LTVERLRMKKENFAKLGYASLGEEILKKDIIDIVVIDNLTKEVLVSYHGCSANNY 115 >gi|24314|lcl|protein:vir:100800 Length: 155 # NCBI annotation: hypothetical protein # Family: family:all:2792 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164737;genbank:gi:56693150;genbank:GeneID :3197433 Length = 155 Score = 149 bits (376), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 66/141 (46%), Positives = 99/141 (70%) Query: 1 MATQAKQTVHTGATVLLMIKNKVVGRAQGLDGRRSFGTEGVYEIGSIMPQEHVQNRYEGS 60 MAT QTV TG + +M+K+ V+GRAQ L G RSFGT GVYE+G+IMP+EHV +Y G+ Sbjct: 1 MATVKNQTVETGNRIYIMVKSAVLGRAQSLTGDRSFGTTGVYELGTIMPREHVFLKYTGT 60 Query: 61 FTLDRFFVKKKSLADLDLAPLGEDVLKLDIFDVVIVDKETNSILRAYRGCTVSEYSETFR 120 +++R+ + + + +A LGEDVLK+D+ D+ + D T ++ YRGC++ +Y ET+R Sbjct: 61 VSVERYRMTTNNFTNTKVAALGEDVLKIDVLDIAVKDNTTGKLIIVYRGCSIDDYQETYR 120 Query: 121 VGQISGENATFQYLKASDSKN 141 +I+GE+A F YL AS+ +N Sbjct: 121 ANEITGESARFYYLTASNLQN 141 >gi|21558|lcl|protein:vir:63764 Length: 123 # NCBI annotation: gp25 # Family: family:all:2792 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547630;genbank:GeneID:3783515 Length = 123 Score = 141 bits (355), Expect = 3e-36, Method: Compositional matrix adjust. Identities = 64/123 (52%), Positives = 86/123 (69%) Query: 18 MIKNKVVGRAQGLDGRRSFGTEGVYEIGSIMPQEHVQNRYEGSFTLDRFFVKKKSLADLD 77 MIK K+VGR Q G RS+GT GVYEIGSIMPQEHV +YEGS T+DR ++K+ LA L Sbjct: 1 MIKGKIVGRCQSASGERSYGTTGVYEIGSIMPQEHVYLKYEGSLTVDRLRMRKEDLAKLG 60 Query: 78 LAPLGEDVLKLDIFDVVIVDKETNSILRAYRGCTVSEYSETFRVGQISGENATFQYLKAS 137 + LGED+LK D+ D+V++D T ++ AYRGC+ Y+E + +I E+A F YL A+ Sbjct: 61 ITALGEDILKRDVIDIVMMDNTTKEVIIAYRGCSAETYNEEIKANEIVSESARFLYLSAA 120 Query: 138 DSK 140 + K Sbjct: 121 NVK 123 >gi|15780|lcl|protein:vir:6245 Length: 327 # NCBI annotation: gp39 # Family: family:all:11659 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813699;swissprot:trembl:q859b8;genbank:gi :29366759;uniprot:Q859B8;genbank:GeneID:1258900 Length = 327 Score = 25.8 bits (55), Expect = 0.21, Method: Compositional matrix adjust. Identities = 17/60 (28%), Positives = 26/60 (43%), Gaps = 2/60 (3%) Query: 84 DVLKLDIFDVVIVDKETNSILRAYR--GCTVSEYSETFRVGQISGENATFQYLKASDSKN 141 DV + F +V T+ AY+ GC +E+S T V + N TF + + N Sbjct: 104 DVSEAPSFSAQMVRPGTDGTKAAYKHKGCVATEWSLTAEVEEAVKLNVTFDFQDVEHTTN 163 >gi|976|lcl|protein:vir:6217 Length: 211 # NCBI annotation: hypothetical protein # Family: family:all:10887 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852597;genbank:gi:31415857;genbank:GeneID :1489215 Length = 211 Score = 23.5 bits (49), Expect = 1.1, Method: Compositional matrix adjust. Identities = 8/18 (44%), Positives = 12/18 (66%) Query: 103 ILRAYRGCTVSEYSETFR 120 ILR Y CTV+ E+++ Sbjct: 127 ILRWYPKCTVAPVEESWK 144 >gi|6927|lcl|protein:vir:106715 Length: 507 # NCBI annotation: gp19 # Family: family:all:144 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944327;genbank:gi:38638626;genbank:GeneID :2657344 Length = 507 Score = 23.1 bits (48), Expect = 1.5, Method: Composition-based stats. Identities = 11/27 (40%), Positives = 17/27 (62%), Gaps = 2/27 (7%) Query: 43 EIGSIMPQEHVQNRYEGSFTLDRFFVK 69 E G+I P+EH+Q Y + L + FV+ Sbjct: 313 EFGAIFPREHLQ--YYHAADLPKQFVR 337 >gi|11323|lcl|protein:vir:78588 Length: 507 # NCBI annotation: TerL # Family: family:all:144 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294855;genbank:gi:149882918;genbank:Ge neID:5291059 Length = 507 Score = 23.1 bits (48), Expect = 1.5, Method: Composition-based stats. Identities = 11/27 (40%), Positives = 17/27 (62%), Gaps = 2/27 (7%) Query: 43 EIGSIMPQEHVQNRYEGSFTLDRFFVK 69 E G+I P+EH+Q Y + L + FV+ Sbjct: 313 EFGAIFPREHLQ--YYHAADLPKQFVR 337 >gi|4006|lcl|protein:vir:100188 Length: 210 # NCBI annotation: putative major tail protein # Family: family:all:1028 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025036;genbank:gi:48697269;genbank:GeneI D:2948286 Length = 210 Score = 21.9 bits (45), Expect = 3.7, Method: Compositional matrix adjust. Identities = 7/15 (46%), Positives = 10/15 (66%) Query: 31 DGRRSFGTEGVYEIG 45 D GT+GVY++G Sbjct: 23 DATNGLGTDGVYQVG 37 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.317 0.134 0.370 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 53,957 Number of Sequences: 514 Number of extensions: 2192 Number of successful extensions: 13 Number of sequences better than 100.0: 13 Number of HSP's better than 100.0 without gapping: 13 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 13 length of query: 141 length of database: 206,069 effective HSP length: 64 effective length of query: 77 effective length of database: 173,173 effective search space: 13334321 effective search space used: 13334321 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 33 (17.3 bits)