BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|Aclame:protein:vir:9825|NCBI_annot:putative major tail shaft protein|genbank:acc:NP_795587;genbank:gi:28876334;genbank:GeneID :1257909 (161 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|16310|lcl|protein:vir:3038 Length: 161 # NCBI annotation: tai... 328 3e-92 gi|16036|lcl|protein:vir:9825 Length: 161 # NCBI annotation: put... 328 3e-92 gi|19303|lcl|protein:vir:4792 Length: 149 # NCBI annotation: put... 223 9e-61 gi|5806|lcl|protein:vir:98876 Length: 152 # NCBI annotation: tai... 166 2e-43 gi|13029|lcl|protein:vir:80923 Length: 145 # NCBI annotation: Ts... 99 3e-23 gi|15846|lcl|protein:vir:47 Length: 144 # NCBI annotation: major... 99 4e-23 gi|646|lcl|protein:vir:1579 Length: 178 # NCBI annotation: minor... 50 2e-08 gi|10890|lcl|protein:vir:78144 Length: 530 # NCBI annotation: pr... 27 0.10 gi|22578|lcl|protein:vir:95547 Length: 605 # NCBI annotation: OR... 24 1.0 gi|9399|lcl|protein:vir:99333 Length: 605 # NCBI annotation: hyp... 24 1.0 gi|27798|lcl|protein:vir:8408 Length: 214 # NCBI annotation: gp3... 23 1.6 gi|19756|lcl|protein:vir:6373 Length: 258 # NCBI annotation: hyp... 22 3.1 gi|7204|lcl|protein:vir:100065 Length: 577 # NCBI annotation: DN... 22 5.2 >gi|16310|lcl|protein:vir:3038 Length: 161 # NCBI annotation: tail protein # Family: family:all:900 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438151;genbank:gi:16271814;genbank:GeneID :929244 Length = 161 Score = 328 bits (840), Expect = 3e-92, Method: Compositional matrix adjust. Identities = 161/161 (100%), Positives = 161/161 (100%) Query: 1 MRQKNALRGHFIAPYVKGEEKTEVTKEKLLELARWIKDISDDTDEKTEDEAYYDGDGTEE 60 MRQKNALRGHFIAPYVKGEEKTEVTKEKLLELARWIKDISDDTDEKTEDEAYYDGDGTEE Sbjct: 1 MRQKNALRGHFIAPYVKGEEKTEVTKEKLLELARWIKDISDDTDEKTEDEAYYDGDGTEE 60 Query: 61 TTVVGVKGAYTFEGTYDPEDKAQAHIASLKYKLGDERKVWHLIVSADGKTQWLGVATVTE 120 TTVVGVKGAYTFEGTYDPEDKAQAHIASLKYKLGDERKVWHLIVSADGKTQWLGVATVTE Sbjct: 61 TTVVGVKGAYTFEGTYDPEDKAQAHIASLKYKLGDERKVWHLIVSADGKTQWLGVATVTE 120 Query: 121 IIAGSGAAADFEAFGCKITYNSLPKESKEIIPKKNELSVAM 161 IIAGSGAAADFEAFGCKITYNSLPKESKEIIPKKNELSVAM Sbjct: 121 IIAGSGAAADFEAFGCKITYNSLPKESKEIIPKKNELSVAM 161 >gi|16036|lcl|protein:vir:9825 Length: 161 # NCBI annotation: putative major tail shaft protein # Family: family:all:900 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795587;genbank:gi:28876334;genbank:GeneID :1257909 Length = 161 Score = 328 bits (840), Expect = 3e-92, Method: Compositional matrix adjust. Identities = 161/161 (100%), Positives = 161/161 (100%) Query: 1 MRQKNALRGHFIAPYVKGEEKTEVTKEKLLELARWIKDISDDTDEKTEDEAYYDGDGTEE 60 MRQKNALRGHFIAPYVKGEEKTEVTKEKLLELARWIKDISDDTDEKTEDEAYYDGDGTEE Sbjct: 1 MRQKNALRGHFIAPYVKGEEKTEVTKEKLLELARWIKDISDDTDEKTEDEAYYDGDGTEE 60 Query: 61 TTVVGVKGAYTFEGTYDPEDKAQAHIASLKYKLGDERKVWHLIVSADGKTQWLGVATVTE 120 TTVVGVKGAYTFEGTYDPEDKAQAHIASLKYKLGDERKVWHLIVSADGKTQWLGVATVTE Sbjct: 61 TTVVGVKGAYTFEGTYDPEDKAQAHIASLKYKLGDERKVWHLIVSADGKTQWLGVATVTE 120 Query: 121 IIAGSGAAADFEAFGCKITYNSLPKESKEIIPKKNELSVAM 161 IIAGSGAAADFEAFGCKITYNSLPKESKEIIPKKNELSVAM Sbjct: 121 IIAGSGAAADFEAFGCKITYNSLPKESKEIIPKKNELSVAM 161 >gi|19303|lcl|protein:vir:4792 Length: 149 # NCBI annotation: putative major tail shaft protein # Family: family:all:900 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150172;swissprot:trembl:q94m39;genbank:gi :15088783;uniprot:Q94M39;genbank:GeneID:956012 Length = 149 Score = 223 bits (568), Expect = 9e-61, Method: Compositional matrix adjust. Identities = 105/146 (71%), Positives = 125/146 (85%), Gaps = 2/146 (1%) Query: 2 RQKNALRGHFIAPYVKGEEKTEVTKEKLLELARWIKDISDDTDEKTEDEAYYDGDGTEET 61 RQKNALRGHF+APY G E + T++ LELA+WI D+SDDTDEKT+D+AYYDGDG EET Sbjct: 3 RQKNALRGHFVAPYNGGTEPS--TEDTWLELAKWISDVSDDTDEKTDDQAYYDGDGVEET 60 Query: 62 TVVGVKGAYTFEGTYDPEDKAQAHIASLKYKLGDERKVWHLIVSADGKTQWLGVATVTEI 121 TVV VKGAYTFEGTYDP+DKAQA IA +KYK GD+RK+WH +VS+D K QW+G AT TEI Sbjct: 61 TVVSVKGAYTFEGTYDPDDKAQALIAGMKYKTGDDRKLWHKVVSSDRKKQWVGAATATEI 120 Query: 122 IAGSGAAADFEAFGCKITYNSLPKES 147 AGSGAA+D+EAFGCK++YNS PKE+ Sbjct: 121 KAGSGAASDYEAFGCKLSYNSTPKET 146 >gi|5806|lcl|protein:vir:98876 Length: 152 # NCBI annotation: tail shaft protein # Family: family:all:900 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164424;genbank:gi:56694914;genbank:GeneID :3197266 Length = 152 Score = 166 bits (419), Expect = 2e-43, Method: Compositional matrix adjust. Identities = 81/152 (53%), Positives = 107/152 (70%), Gaps = 3/152 (1%) Query: 2 RQKNALRGHFIAPYVKGEEKTEVTKEKLLELARWIKDISDDTDEKTEDEAYYDGDGTEET 61 R KNA R HF+ Y G+++ E LELA++I I DDT+E+TEDEA+YDGDGT ET Sbjct: 3 RLKNAERQHFVQAYEPGQDEP---GEDWLELAKYISTIGDDTNEETEDEAFYDGDGTPET 59 Query: 62 TVVGVKGAYTFEGTYDPEDKAQAHIASLKYKLGDERKVWHLIVSADGKTQWLGVATVTEI 121 TV+ V YT EG YDPED AQA IA LKYK GD RK+WH +V +DGK +W+G ATV+ I Sbjct: 60 TVISVAQGYTPEGYYDPEDPAQALIAGLKYKTGDGRKIWHRVVRSDGKKEWVGRATVSSI 119 Query: 122 IAGSGAAADFEAFGCKITYNSLPKESKEIIPK 153 +AG+G A+ +E F C I ++ +P+E+ P+ Sbjct: 120 VAGAGDASAYETFSCNIRFDRIPEENDLTTPE 151 >gi|13029|lcl|protein:vir:80923 Length: 145 # NCBI annotation: Tsh # Family: family:all:900 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468398;genbank:gi:157324972;genbank:Ge neID:5601356 Length = 145 Score = 99.0 bits (245), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 63/149 (42%), Positives = 88/149 (59%), Gaps = 9/149 (6%) Query: 2 RQKNALRGHFIAPYVKGEEKTEVTKEKLLELARWIKDISDDTDEKTEDEAYYDGDGTEET 61 R KNA +F+A V G V + L++WI ++SDD + TE++ YDGDG E+T Sbjct: 3 RIKNAKTKYFVAEIVDG-----VGEPVWKRLSKWITNVSDDGSDNTEEQGDYDGDGNEKT 57 Query: 62 TVVGVKGAYTFEGTYDPEDKAQAHIASLKYKLGDERKVWHLIVSADGKTQWLGVATVTEI 121 V+G AYTFEGT+D ED+AQ I + K + + R + I D +T +G ATV+EI Sbjct: 58 VVLGYSEAYTFEGTHDREDEAQNLIVA-KRRTPENRGIMFKIEIPDTETA-VGKATVSEI 115 Query: 122 --IAGSGAAADFEAFGCKITYNSLPKESK 148 AG G A +F AF C+I Y+ PK +K Sbjct: 116 KGSAGGGDATEFPAFACRIAYDETPKVTK 144 >gi|15846|lcl|protein:vir:47 Length: 144 # NCBI annotation: major tail shaft protein # Family: family:all:900 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463473;swissprot:trembl:q9t1b1;genbank:gi :16798795;uniprot:Q9T1B1;genbank:GeneID:922387 Length = 144 Score = 98.6 bits (244), Expect = 4e-23, Method: Compositional matrix adjust. Identities = 62/150 (41%), Positives = 89/150 (59%), Gaps = 9/150 (6%) Query: 1 MRQKNALRGHFIAPYVKGEEKTEVTKEKLLELARWIKDISDDTDEKTEDEAYYDGDGTEE 60 MR KNA + +A V G + + + L++WI ++SDD + TE++ YDGDG E+ Sbjct: 1 MRIKNAKTKYSVAEIVAGAGEPDWKR-----LSKWITNVSDDGSDNTEEQGDYDGDGNEK 55 Query: 61 TTVVGVKGAYTFEGTYDPEDKAQAHIASLKYKLGDERKVWHLIVSADGKTQWLGVATVTE 120 T V+G AYTFEGT+D ED+AQ I + K + + R + I D +T +G ATV+E Sbjct: 56 TVVLGYSEAYTFEGTHDREDEAQNLIVA-KRRTPENRSIMFKIEIPDTETA-IGKATVSE 113 Query: 121 I--IAGSGAAADFEAFGCKITYNSLPKESK 148 I AG G A +F AFGC+I Y+ P +K Sbjct: 114 IKGSAGGGDATEFPAFGCRIAYDETPTVTK 143 >gi|646|lcl|protein:vir:1579 Length: 178 # NCBI annotation: minor capsid protein # Family: family:all:900 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695161;swissprot:trembl:o03972;genbank:gi :23455808;goa:O03972;interpro:IPR002813;uniprot:O03972;g enbank:GeneID:955560 Length = 178 Score = 49.7 bits (117), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 34/126 (26%), Positives = 57/126 (45%), Gaps = 1/126 (0%) Query: 19 EEKTEVTKEKLLELARWIKDISDDTDEKTEDEAYYDGDGTEETTVVGVKGAYTFEGTYDP 78 ++ + TK + LA I ++ +E ++ AYYDG G +T V G + F G Sbjct: 20 QDPQDTTKATFVPLAAGISGVTPAANETDDNTAYYDGAGFTDTDVTGKRITLAFSGHRVI 79 Query: 79 EDKAQAHIASLKYKLGDERKVWHLIVSADGKTQWLGVATVTEIIAGSGAAADFEAFGCKI 138 D AQ ++AS +G+ K DG + + T+T I+ G A + F + Sbjct: 80 GDAAQDYVASKFLAIGESLKTLARWTDPDGN-KIVSNVTITAIVPMGGNANAKQTFSFTL 138 Query: 139 TYNSLP 144 ++N P Sbjct: 139 SFNGKP 144 >gi|10890|lcl|protein:vir:78144 Length: 530 # NCBI annotation: probable terminase # Family: family:all:523 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294797;genbank:gi:149882818;genbank:Ge neID:5309172 Length = 530 Score = 27.3 bits (59), Expect = 0.10, Method: Composition-based stats. Identities = 11/24 (45%), Positives = 15/24 (62%) Query: 97 RKVWHLIVSADGKTQWLGVATVTE 120 R VW + +S D +T WL A +TE Sbjct: 336 RTVWAIDMSHDRRTTWLAAAVLTE 359 >gi|22578|lcl|protein:vir:95547 Length: 605 # NCBI annotation: ORF010 # Family: family:all:1430 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240892;genbank:gi:66394959;genbank:GeneID :5132488 Length = 605 Score = 23.9 bits (50), Expect = 1.0, Method: Composition-based stats. Identities = 11/25 (44%), Positives = 16/25 (64%), Gaps = 2/25 (8%) Query: 132 EAFGCKITYNSLPKESKEIIPKKNE 156 + FGC TY S PK + ++ P+ NE Sbjct: 466 KVFGC--TYKSSPKSTGQLRPEFNE 488 >gi|9399|lcl|protein:vir:99333 Length: 605 # NCBI annotation: hypothetical protein # Family: family:all:1430 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024465;genbank:gi:48696425;genbank:GeneID :2948061 Length = 605 Score = 23.9 bits (50), Expect = 1.0, Method: Composition-based stats. Identities = 11/25 (44%), Positives = 16/25 (64%), Gaps = 2/25 (8%) Query: 132 EAFGCKITYNSLPKESKEIIPKKNE 156 + FGC TY S PK + ++ P+ NE Sbjct: 466 KVFGC--TYKSSPKSTGQLRPEFNE 488 >gi|27798|lcl|protein:vir:8408 Length: 214 # NCBI annotation: gp3 # Family: family:all:30874 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818304;genbank:gi:29566740;genbank:GeneID :1260058 Length = 214 Score = 23.5 bits (49), Expect = 1.6, Method: Compositional matrix adjust. Identities = 20/60 (33%), Positives = 27/60 (45%), Gaps = 11/60 (18%) Query: 93 LGDERKVWHLIVSADGKTQWLGVATVTEIIAGSGAAADFEAFGCKITYNSLPKESKEIIP 152 L D+ + HL ADGK W G EI A+ A G T + +E +EI+P Sbjct: 146 LHDQGLLGHLFRVADGKRVWRG----AEI------ASANPALGHLFTLEQVERE-REILP 194 >gi|19756|lcl|protein:vir:6373 Length: 258 # NCBI annotation: hypothetical protein # Family: family:all:29417 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918986;genbank:gi:34610161;genbank:GeneID :2559565 Length = 258 Score = 22.3 bits (46), Expect = 3.1, Method: Compositional matrix adjust. Identities = 18/57 (31%), Positives = 23/57 (40%), Gaps = 2/57 (3%) Query: 103 IVSADGKTQWLGVATVTEIIAGSGAAADFEAFGCKITYNSLPKESKEIIPKKNELSV 159 I+ G LGV+ IAG GA DF+A T +P + I K V Sbjct: 106 IIVTPGAWYQLGVSPTN--IAGVGAVTDFDAKTTDATPAEIPLTNFNIDLKNGRFQV 160 >gi|7204|lcl|protein:vir:100065 Length: 577 # NCBI annotation: DNA maturase beta subunit # Family: family:all:697 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214228;genbank:gi:61806451;genbank:GeneID :3294745 Length = 577 Score = 21.6 bits (44), Expect = 5.2, Method: Composition-based stats. Identities = 11/30 (36%), Positives = 16/30 (53%) Query: 16 VKGEEKTEVTKEKLLELARWIKDISDDTDE 45 V G TE+ +EKLL+L + I D+ Sbjct: 156 VPGNSMTELMREKLLQLCTEAESILTPKDD 185 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.311 0.130 0.374 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 80,723 Number of Sequences: 514 Number of extensions: 3978 Number of successful extensions: 30 Number of sequences better than 100.0: 26 Number of HSP's better than 100.0 without gapping: 26 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 26 length of query: 161 length of database: 206,069 effective HSP length: 65 effective length of query: 96 effective length of database: 172,659 effective search space: 16575264 effective search space used: 16575264 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.8 bits) S2: 34 (17.7 bits)