BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_016653.1_cdsid_YP_005087137.1 [gene=RoPhRER2_gp11] [protein=major tail protein] [protein_id=YP_005087137.1] [location=7250..8107] (285 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|16484|lcl|protein:vir:7778 Length: 198 # NCBI annotation: gp2... 186 2e-49 gi|11001|lcl|protein:vir:78275 Length: 282 # NCBI annotation: Pu... 173 3e-45 gi|11245|lcl|protein:vir:78506 Length: 282 # NCBI annotation: gp... 171 8e-45 gi|19796|lcl|protein:vir:2349 Length: 283 # NCBI annotation: gp1... 171 1e-44 gi|17806|lcl|protein:vir:2437 Length: 198 # NCBI annotation: maj... 150 1e-38 gi|16689|lcl|protein:vir:4232 Length: 198 # NCBI annotation: obs... 149 3e-38 gi|9502|lcl|protein:vir:104093 Length: 198 # NCBI annotation: gp... 146 3e-37 gi|18197|lcl|protein:vir:5002 Length: 202 # NCBI annotation: maj... 25 1.2 gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: t... 24 1.9 gi|11632|lcl|protein:vir:78892 Length: 228 # NCBI annotation: Ts... 23 4.1 gi|13862|lcl|protein:vir:4835 Length: 203 # NCBI annotation: MPS... 22 9.0 gi|15738|lcl|protein:vir:4957 Length: 203 # NCBI annotation: maj... 22 9.1 >gi|16484|lcl|protein:vir:7778 Length: 198 # NCBI annotation: gp23 # Family: family:all:2431 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817612;genbank:gi:29566042;genbank:GeneID :1259236 Length = 198 Score = 186 bits (473), Expect = 2e-49, Method: Compositional matrix adjust. Identities = 90/159 (56%), Positives = 110/159 (69%), Gaps = 6/159 (3%) Query: 123 SLTDAGEVTVTETAVTNGWTPIGHTSEGDLPELGFEGGDTEVRNTWQRTGLREVNTDTPY 182 +LTD G T T GW IGHTS GD+PE GF+GGDTEVR +WQ+ LREV T+ P Sbjct: 35 NLTDTGLWTPT------GWDSIGHTSRGDMPEFGFDGGDTEVRGSWQKKKLREVTTEDPV 88 Query: 183 DYMTMKLAQFDASGFQFYYGDNASTVDGVFGVDSSVIKPVERALFMLIVDGDLRVGFRAA 242 DY+T+ L QFD F+ YYG NAST GVFGV ++ P E+A ++IVDGD RVGF A Sbjct: 89 DYLTLFLHQFDEQAFELYYGANASTTPGVFGVSAASGDPTEKAFLVVIVDGDERVGFHAH 148 Query: 243 KASIRRDESISLATDEFGTLPVRATFIKHPGNHLFEWIT 281 KAS+RRD++I L TD+F LPVRATF++H LF WI Sbjct: 149 KASVRRDDAIQLPTDDFAALPVRATFLQHNNELLFSWIN 187 Score = 46.2 bits (108), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 20/34 (58%), Positives = 26/34 (76%) Query: 1 MSFNDNAVLTAARGYIFVGPTGTSRPTPAQVADL 34 M+ ND+AVLTAA GY++ P GT+ PTPAQ+ L Sbjct: 1 MALNDDAVLTAAVGYVYTAPVGTAAPTPAQLKTL 34 >gi|11001|lcl|protein:vir:78275 Length: 282 # NCBI annotation: Putative major tail protein # Family: family:all:2431 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491672;genbank:gi:157786496;genbank:Ge neID:5625753 Length = 282 Score = 173 bits (438), Expect = 3e-45, Method: Compositional matrix adjust. Identities = 80/142 (56%), Positives = 105/142 (73%), Gaps = 3/142 (2%) Query: 140 GWTPIGHTSEGDLPELGFEGGDTEVRNTWQRTGLREVNTDTPYDYMTMKLAQFDASGFQF 199 GW +GHTSE DLPE GF+GGD+EVR +WQ+ LREV T+ DY+ + L QFD S + Sbjct: 46 GWELVGHTSEDDLPEFGFDGGDSEVRGSWQKKKLREVETEEIADYVVINLTQFDESALEL 105 Query: 200 YYGDNASTVDGVFGVDS-SVIKPVERALFMLIVDGDLRVGFRAAKASIRRDESISLATDE 258 Y+G N S G+FGV S SV+ ERAL ++IVD D+R+GF A KAS++R+++ISLATDE Sbjct: 106 YFGPNQSATPGIFGVKSGSVVN--ERALLIVIVDNDVRLGFHARKASLKREDAISLATDE 163 Query: 259 FGTLPVRATFIKHPGNHLFEWI 280 FG LPVRATF+ + +L+EWI Sbjct: 164 FGALPVRATFLDYQSYNLYEWI 185 Score = 42.0 bits (97), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 2/37 (5%) Query: 1 MSFNDNAVLTAARGYIFVGPTGTSRPTPAQVA--DLE 35 M+ D+AVL AARGY++ GT+ PTP+Q+ DLE Sbjct: 1 MALKDDAVLIAARGYVYTAAVGTAAPTPSQLKLIDLE 37 Score = 33.9 bits (76), Expect = 0.002, Method: Compositional matrix adjust. Identities = 21/56 (37%), Positives = 36/56 (64%), Gaps = 5/56 (8%) Query: 34 LEPDTFGAHQ----YTLAVTGSPTGGTYTLTVDSKPVAALPLSADLGAIEAALGAV 85 +E D F A + YT+ + G+ TGG++TL V K A++ +A+ A+++A+GAV Sbjct: 185 IEEDWFNAVETAPVYTVDLGGA-TGGSFTLKVGDKTTASIAYNANAAAVKSAIGAV 239 >gi|11245|lcl|protein:vir:78506 Length: 282 # NCBI annotation: gp20 # Family: family:all:2431 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491591;genbank:gi:157786414;genbank:Ge neID:5625658 Length = 282 Score = 171 bits (434), Expect = 8e-45, Method: Compositional matrix adjust. Identities = 79/142 (55%), Positives = 105/142 (73%), Gaps = 3/142 (2%) Query: 140 GWTPIGHTSEGDLPELGFEGGDTEVRNTWQRTGLREVNTDTPYDYMTMKLAQFDASGFQF 199 GW +GHTSE DLPE GF+GGD+EVR +WQ+ LREV T+ DY+ + L QFD + + Sbjct: 46 GWDLVGHTSEDDLPEFGFDGGDSEVRGSWQKKKLREVETEEIADYVVINLTQFDETALEL 105 Query: 200 YYGDNASTVDGVFGVDS-SVIKPVERALFMLIVDGDLRVGFRAAKASIRRDESISLATDE 258 Y+G N S G+FGV S SV+ ERAL ++IVD D+R+GF A KAS++R+++ISLATDE Sbjct: 106 YFGPNQSATPGIFGVKSGSVVN--ERALLIVIVDNDVRLGFHARKASLKREDAISLATDE 163 Query: 259 FGTLPVRATFIKHPGNHLFEWI 280 FG LPVRATF+ + +L+EWI Sbjct: 164 FGALPVRATFLDYQSYNLYEWI 185 Score = 42.0 bits (97), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 2/37 (5%) Query: 1 MSFNDNAVLTAARGYIFVGPTGTSRPTPAQVA--DLE 35 M+ D+AVL AARGY++ GT+ P+PAQ+ DLE Sbjct: 1 MALKDDAVLIAARGYVYTAAVGTAAPSPAQLKLIDLE 37 Score = 33.9 bits (76), Expect = 0.002, Method: Compositional matrix adjust. Identities = 21/56 (37%), Positives = 36/56 (64%), Gaps = 5/56 (8%) Query: 34 LEPDTFGAHQ----YTLAVTGSPTGGTYTLTVDSKPVAALPLSADLGAIEAALGAV 85 +E D F A + YT+ + G+ TGG++TL V K A++ +A+ A+++A+GAV Sbjct: 185 IEEDWFNAVETAPVYTVDLGGA-TGGSFTLKVGDKTTASIAYNANAAAVKSAIGAV 239 >gi|19796|lcl|protein:vir:2349 Length: 283 # NCBI annotation: gp19 # Family: family:all:2431 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075286;genbank:gi:12657873;genbank:GeneID :920104 Length = 283 Score = 171 bits (432), Expect = 1e-44, Method: Compositional matrix adjust. Identities = 79/142 (55%), Positives = 105/142 (73%), Gaps = 3/142 (2%) Query: 140 GWTPIGHTSEGDLPELGFEGGDTEVRNTWQRTGLREVNTDTPYDYMTMKLAQFDASGFQF 199 GW +GHTSE DLPE GF+GGD+EVR +WQ+ LREV T+ DY+ + L QFD + + Sbjct: 46 GWDLVGHTSEDDLPEFGFDGGDSEVRGSWQKKKLREVETEEIADYVVINLTQFDETALEL 105 Query: 200 YYGDNASTVDGVFGVDS-SVIKPVERALFMLIVDGDLRVGFRAAKASIRRDESISLATDE 258 Y+G N S G+FGV S SV+ ERAL ++IVD D+R+GF A KAS++R+++ISLATDE Sbjct: 106 YFGPNQSATPGIFGVKSGSVVN--ERALLIVIVDNDVRLGFHARKASLKREDAISLATDE 163 Query: 259 FGTLPVRATFIKHPGNHLFEWI 280 FG LPVRATF+ + +L+EWI Sbjct: 164 FGALPVRATFLDYQSYNLYEWI 185 Score = 42.0 bits (97), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 19/37 (51%), Positives = 26/37 (70%), Gaps = 2/37 (5%) Query: 1 MSFNDNAVLTAARGYIFVGPTGTSRPTPAQVA--DLE 35 M+ D+AVL AARGY++ GT+ PTP+Q+ DLE Sbjct: 1 MALKDDAVLIAARGYVYTAAVGTAAPTPSQLKLIDLE 37 Score = 30.0 bits (66), Expect = 0.036, Method: Compositional matrix adjust. Identities = 22/56 (39%), Positives = 30/56 (53%), Gaps = 5/56 (8%) Query: 34 LEPDTFGAHQ----YTLAVTGSPTGGTYTLTVDSKPVAALPLSADLGAIEAALGAV 85 +E D F A Y L + G+ TGG YTL V K + +A+ AI+ A+GAV Sbjct: 185 IEEDWFNAVDAPVVYLLDLGGA-TGGDYTLLVGGKSTGDIAYNANASAIKTAIGAV 239 >gi|17806|lcl|protein:vir:2437 Length: 198 # NCBI annotation: major tail subunit # Family: family:all:2431 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046839;genbank:gi:9630407;genbank:GeneID: 1261604 Length = 198 Score = 150 bits (380), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 69/142 (48%), Positives = 92/142 (64%), Gaps = 2/142 (1%) Query: 140 GWTPIGHTSEGDLPELGFEGGDTEVRNTWQRTGLREVNTDTPYDYMTMKLAQFDASGFQF 199 GW+ +GHTS G LPE GFEGGD+EV+ +WQ+ LRE+ T+ P DY+ + L QFD Sbjct: 47 GWSSVGHTSRGTLPEFGFEGGDSEVKGSWQKKKLREITTEDPIDYVVVLLHQFDEQSLGL 106 Query: 200 YYGDNASTVDGVFGVDSSVIKPVERALFMLIVDGDLRVGFRAAKASIRRDESISLATDEF 259 YYG NAST GVFGV + E+A+ ++I DGD+R+G A KA +RRD++I L D+ Sbjct: 107 YYGPNASTTPGVFGVKTGQTN--EKAVLVVIEDGDMRLGHHAHKAGVRRDDAIELPIDDL 164 Query: 260 GTLPVRATFIKHPGNHLFEWIT 281 LPVR T++ H F WI Sbjct: 165 AALPVRFTYLDHKDELPFSWIN 186 Score = 38.1 bits (87), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 17/35 (48%), Positives = 26/35 (74%) Query: 1 MSFNDNAVLTAARGYIFVGPTGTSRPTPAQVADLE 35 M+ ND+AVLTAA GY++V GT+ TPA++ ++ Sbjct: 1 MAENDDAVLTAAVGYVYVAEAGTAAHTPAELKTID 35 >gi|16689|lcl|protein:vir:4232 Length: 198 # NCBI annotation: observed 21.5Kd protein # Family: family:all:2431 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039687;swissprot:sw:q05229;genbank:gi:962 5453;goa:Q05229;uniprot:Q05229;genbank:GeneID:2942952 Length = 198 Score = 149 bits (376), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 69/142 (48%), Positives = 92/142 (64%), Gaps = 2/142 (1%) Query: 140 GWTPIGHTSEGDLPELGFEGGDTEVRNTWQRTGLREVNTDTPYDYMTMKLAQFDASGFQF 199 GWT +GHTS G LPE GFEGG++EV+ +WQ+ LRE+ T+ P DY+T+ L QFD Sbjct: 47 GWTSVGHTSRGTLPEFGFEGGESEVKGSWQKKKLREITTEDPIDYVTVLLHQFDEQSLGL 106 Query: 200 YYGDNASTVDGVFGVDSSVIKPVERALFMLIVDGDLRVGFRAAKASIRRDESISLATDEF 259 YYG NAS GVFGV + E+A+ ++I DGD+R+G A KA +RRD++I L D+ Sbjct: 107 YYGPNASETPGVFGVKTGQTN--EKAVLVVIEDGDMRLGHHAHKAGVRRDDAIELPIDDL 164 Query: 260 GTLPVRATFIKHPGNHLFEWIT 281 LPVR T++ H F WI Sbjct: 165 AALPVRFTYLDHEDELPFSWIN 186 Score = 28.9 bits (63), Expect = 0.074, Method: Compositional matrix adjust. Identities = 12/18 (66%), Positives = 16/18 (88%) Query: 1 MSFNDNAVLTAARGYIFV 18 M+ ND+AVLTAA GY++V Sbjct: 1 MAENDDAVLTAAVGYVYV 18 >gi|9502|lcl|protein:vir:104093 Length: 198 # NCBI annotation: gp25 # Family: family:all:2431 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655604;genbank:gi:109392475;genbank:GeneI D:4156961 Length = 198 Score = 146 bits (369), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 72/149 (48%), Positives = 95/149 (63%), Gaps = 3/149 (2%) Query: 133 TETAVTNGWTPIGHTSEGDLPELGFEGGDTEVRNTWQRTGLREVNTDTPYDYMTMKLAQF 192 T T VT GW +GHTS G LPE GFEGGD+EV+ +WQ+ LRE+ T+ P DY+T+ L QF Sbjct: 41 TWTGVT-GWESVGHTSRGTLPEFGFEGGDSEVKGSWQKKKLREITTEDPIDYVTVLLHQF 99 Query: 193 DASGFQFYYGDNASTVDGVFGVDSSVIKPVERALFMLIVDGDLRVGFRAAKASIRRDESI 252 D YYG NAS GVFGV + E+A+ ++I DGD+R+G A KA +RRD++I Sbjct: 100 DEQTLGLYYGPNASETPGVFGVKTGQTN--EKAVLVVIEDGDMRLGQHAHKAGVRRDDAI 157 Query: 253 SLATDEFGTLPVRATFIKHPGNHLFEWIT 281 L D+ LPVR T++ + F WI Sbjct: 158 ELPIDDLAALPVRFTYLDYEDELPFSWIN 186 Score = 45.8 bits (107), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 23/42 (54%), Positives = 32/42 (76%), Gaps = 3/42 (7%) Query: 1 MSFNDNAVLTAARGYIFVGPTGTSRPTPAQVADLE---PDTF 39 M+ ND+AVLTAA GY++VG GT+ PTPAQ+ L+ P+T+ Sbjct: 1 MAENDDAVLTAAVGYVYVGAAGTAPPTPAQLKTLDLTKPETW 42 >gi|18197|lcl|protein:vir:5002 Length: 202 # NCBI annotation: major tail protein # Family: family:all:1028 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049976;genbank:gi:9632948;genbank:GeneID: 1262112 Length = 202 Score = 25.0 bits (53), Expect = 1.2, Method: Compositional matrix adjust. Identities = 12/33 (36%), Positives = 18/33 (54%) Query: 187 MKLAQFDASGFQFYYGDNASTVDGVFGVDSSVI 219 +KLA D Q GD+ + DGV +DS ++ Sbjct: 9 VKLALVDPDTQQLIKGDSGLSTDGVIAIDSKML 41 >gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: terminase large subunit gp17 # Family: family:all:147 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195134;genbank:gi:58532911;uniprot:Q5GQN8 ;genbank:GeneID:3260486 Length = 548 Score = 24.3 bits (51), Expect = 1.9, Method: Compositional matrix adjust. Identities = 16/42 (38%), Positives = 21/42 (50%), Gaps = 1/42 (2%) Query: 222 VERALFMLIVDGDLRVGFRAAKASIRRDESISLATDEFGTLP 263 V L LI + ++ +G A KAS RD LAT + LP Sbjct: 91 VSYLLHYLIFNDNVNIGILANKASTARDLLARLAT-AYENLP 131 >gi|11632|lcl|protein:vir:78892 Length: 228 # NCBI annotation: Tsh # Family: family:all:47 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468852;genbank:gi:157325426;genbank:Ge neID:5601889 Length = 228 Score = 23.1 bits (48), Expect = 4.1, Method: Compositional matrix adjust. Identities = 15/44 (34%), Positives = 19/44 (43%), Gaps = 12/44 (27%) Query: 141 WTPIGHTSE-----GDLPELGFEG-------GDTEVRNTWQRTG 172 +T IG E G ELG +G G E+R TW + G Sbjct: 42 YTTIGEVFERAVKTGAAMELGLDGKYNESDPGQNELRETWDKVG 85 >gi|13862|lcl|protein:vir:4835 Length: 203 # NCBI annotation: MPS-7201 # Family: family:all:1028 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038332;genbank:gi:9634658;genbank:GeneID: 1262628 Length = 203 Score = 21.9 bits (45), Expect = 9.0, Method: Compositional matrix adjust. Identities = 11/33 (33%), Positives = 17/33 (51%) Query: 187 MKLAQFDASGFQFYYGDNASTVDGVFGVDSSVI 219 +KLA D Q G+ + DGV +DS ++ Sbjct: 9 VKLALVDPKTQQIIKGEEGLSTDGVIEIDSKML 41 >gi|15738|lcl|protein:vir:4957 Length: 203 # NCBI annotation: major tail protein # Family: family:all:1028 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049933;genbank:gi:9632904;genbank:GeneID: 1262081 Length = 203 Score = 21.9 bits (45), Expect = 9.1, Method: Compositional matrix adjust. Identities = 11/33 (33%), Positives = 17/33 (51%) Query: 187 MKLAQFDASGFQFYYGDNASTVDGVFGVDSSVI 219 +KLA D Q G+ + DGV +DS ++ Sbjct: 9 VKLALVDPKTQQIIKGEEGLSTDGVIEIDSKML 41 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.315 0.133 0.389 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 127,064 Number of Sequences: 514 Number of extensions: 5809 Number of successful extensions: 69 Number of sequences better than 100.0: 21 Number of HSP's better than 100.0 without gapping: 19 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 32 Number of HSP's gapped (non-prelim): 33 length of query: 285 length of database: 206,069 effective HSP length: 71 effective length of query: 214 effective length of database: 169,575 effective search space: 36289050 effective search space used: 36289050 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 36 (18.5 bits)