BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_020871.1_cdsid_YP_007676753.1 [gene=AG2_086] [protein=hypothetical protein] [protein_id=YP_007676753.1] [location=51754..52176] (140 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|24940|lcl|protein:vir:80620 Length: 140 # NCBI annotation: gp... 285 1e-79 gi|21558|lcl|protein:vir:63764 Length: 123 # NCBI annotation: gp... 250 6e-69 gi|25129|lcl|protein:vir:80710 Length: 140 # NCBI annotation: st... 229 1e-62 gi|9414|lcl|protein:vir:99300 Length: 142 # NCBI annotation: hyp... 204 4e-55 gi|22597|lcl|protein:vir:95739 Length: 142 # NCBI annotation: OR... 204 4e-55 gi|23709|lcl|protein:vir:96672 Length: 121 # NCBI annotation: OR... 185 2e-49 gi|24314|lcl|protein:vir:100800 Length: 155 # NCBI annotation: h... 178 2e-47 gi|11622|lcl|protein:vir:78869 Length: 439 # NCBI annotation: Te... 22 4.5 gi|5997|lcl|protein:vir:95545 Length: 195 # NCBI annotation: hyp... 21 6.6 gi|6927|lcl|protein:vir:106715 Length: 507 # NCBI annotation: gp... 21 6.9 gi|11323|lcl|protein:vir:78588 Length: 507 # NCBI annotation: Te... 21 6.9 gi|16025|lcl|protein:vir:9814 Length: 403 # NCBI annotation: put... 21 7.6 gi|16299|lcl|protein:vir:3027 Length: 375 # NCBI annotation: ter... 21 7.9 >gi|24940|lcl|protein:vir:80620 Length: 140 # NCBI annotation: gp8 # Family: family:all:2792 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468474;genbank:gi:157325049;genbank:Ge neID:5601587 Length = 140 Score = 285 bits (730), Expect = 1e-79, Method: Compositional matrix adjust. Identities = 140/140 (100%), Positives = 140/140 (100%) Query: 1 MASVTNQTVHTGNTVYLMIKGKIVGRCQSASGERSYGTTGVYEIGSIMPQEHVYLKYEGS 60 MASVTNQTVHTGNTVYLMIKGKIVGRCQSASGERSYGTTGVYEIGSIMPQEHVYLKYEGS Sbjct: 1 MASVTNQTVHTGNTVYLMIKGKIVGRCQSASGERSYGTTGVYEIGSIMPQEHVYLKYEGS 60 Query: 61 LTVDRLRMRKEDLAKLGITALGEDILKRDVIDIVMMDNTTKEVIIAYRGCSAETYNEEIK 120 LTVDRLRMRKEDLAKLGITALGEDILKRDVIDIVMMDNTTKEVIIAYRGCSAETYNEEIK Sbjct: 61 LTVDRLRMRKEDLAKLGITALGEDILKRDVIDIVMMDNTTKEVIIAYRGCSAETYNEEIK 120 Query: 121 ANEIVSESARFLYLSAANVK 140 ANEIVSESARFLYLSAANVK Sbjct: 121 ANEIVSESARFLYLSAANVK 140 >gi|21558|lcl|protein:vir:63764 Length: 123 # NCBI annotation: gp25 # Family: family:all:2792 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547630;genbank:GeneID:3783515 Length = 123 Score = 250 bits (638), Expect = 6e-69, Method: Compositional matrix adjust. Identities = 123/123 (100%), Positives = 123/123 (100%) Query: 18 MIKGKIVGRCQSASGERSYGTTGVYEIGSIMPQEHVYLKYEGSLTVDRLRMRKEDLAKLG 77 MIKGKIVGRCQSASGERSYGTTGVYEIGSIMPQEHVYLKYEGSLTVDRLRMRKEDLAKLG Sbjct: 1 MIKGKIVGRCQSASGERSYGTTGVYEIGSIMPQEHVYLKYEGSLTVDRLRMRKEDLAKLG 60 Query: 78 ITALGEDILKRDVIDIVMMDNTTKEVIIAYRGCSAETYNEEIKANEIVSESARFLYLSAA 137 ITALGEDILKRDVIDIVMMDNTTKEVIIAYRGCSAETYNEEIKANEIVSESARFLYLSAA Sbjct: 61 ITALGEDILKRDVIDIVMMDNTTKEVIIAYRGCSAETYNEEIKANEIVSESARFLYLSAA 120 Query: 138 NVK 140 NVK Sbjct: 121 NVK 123 >gi|25129|lcl|protein:vir:80710 Length: 140 # NCBI annotation: structural protein # Family: family:all:2792 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504133;genbank:gi:158079320;genbank:Ge neID:5666358 Length = 140 Score = 229 bits (584), Expect = 1e-62, Method: Compositional matrix adjust. Identities = 105/140 (75%), Positives = 124/140 (88%) Query: 1 MASVTNQTVHTGNTVYLMIKGKIVGRCQSASGERSYGTTGVYEIGSIMPQEHVYLKYEGS 60 MASV NQTVHTGNTVYLMI KI+GR QSASGER YGT G+YEIGSIMPQEHVYLKYEG+ Sbjct: 1 MASVGNQTVHTGNTVYLMIGNKIIGRAQSASGERQYGTQGIYEIGSIMPQEHVYLKYEGT 60 Query: 61 LTVDRLRMRKEDLAKLGITALGEDILKRDVIDIVMMDNTTKEVIIAYRGCSAETYNEEIK 120 +T++R+RM+KEDLA LGITALGEDIL+RD+IDIVMMDN TKE+++AYRGCSA +Y+E Sbjct: 61 ITLERMRMKKEDLASLGITALGEDILQRDIIDIVMMDNLTKEIVVAYRGCSAISYSESFT 120 Query: 121 ANEIVSESARFLYLSAANVK 140 ANE+ SES +F YL++A VK Sbjct: 121 ANEVTSESTQFTYLTSAKVK 140 >gi|9414|lcl|protein:vir:99300 Length: 142 # NCBI annotation: hypothetical protein # Family: family:all:2792 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024480;genbank:gi:48696439;genbank:GeneID :2948028 Length = 142 Score = 204 bits (519), Expect = 4e-55, Method: Compositional matrix adjust. Identities = 95/138 (68%), Positives = 116/138 (84%) Query: 1 MASVTNQTVHTGNTVYLMIKGKIVGRCQSASGERSYGTTGVYEIGSIMPQEHVYLKYEGS 60 MAS QTVHTGNTV LMIKGK VGR QSASG+R YGTTGVYEIGSIMPQEHVYL+YEG+ Sbjct: 1 MASEAKQTVHTGNTVLLMIKGKPVGRAQSASGQREYGTTGVYEIGSIMPQEHVYLRYEGT 60 Query: 61 LTVDRLRMRKEDLAKLGITALGEDILKRDVIDIVMMDNTTKEVIIAYRGCSAETYNEEIK 120 +TV+RLRM+KE+ A LG +LGE+ILK+D+IDI+++DN TK+VII+Y GCSA YNE + Sbjct: 61 ITVERLRMKKENFADLGYASLGEEILKKDIIDILVVDNLTKQVIISYHGCSANNYNETWQ 120 Query: 121 ANEIVSESARFLYLSAAN 138 NEIV+E F YL+A++ Sbjct: 121 TNEIVTEEIEFSYLTASD 138 >gi|22597|lcl|protein:vir:95739 Length: 142 # NCBI annotation: ORF105 # Family: family:all:2792 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240911;genbank:gi:66395051;genbank:GeneID :5132680 Length = 142 Score = 204 bits (519), Expect = 4e-55, Method: Compositional matrix adjust. Identities = 95/138 (68%), Positives = 116/138 (84%) Query: 1 MASVTNQTVHTGNTVYLMIKGKIVGRCQSASGERSYGTTGVYEIGSIMPQEHVYLKYEGS 60 MAS QTVHTGNTV LMIKGK VGR QSASG+R YGTTGVYEIGSIMPQEHVYL+YEG+ Sbjct: 1 MASEAKQTVHTGNTVLLMIKGKPVGRAQSASGQREYGTTGVYEIGSIMPQEHVYLRYEGT 60 Query: 61 LTVDRLRMRKEDLAKLGITALGEDILKRDVIDIVMMDNTTKEVIIAYRGCSAETYNEEIK 120 +TV+RLRM+KE+ A LG +LGE+ILK+D+IDI+++DN TK+VII+Y GCSA YNE + Sbjct: 61 ITVERLRMKKENFADLGYASLGEEILKKDIIDILVVDNLTKQVIISYHGCSANNYNETWQ 120 Query: 121 ANEIVSESARFLYLSAAN 138 NEIV+E F YL+A++ Sbjct: 121 TNEIVTEEIEFSYLTASD 138 >gi|23709|lcl|protein:vir:96672 Length: 121 # NCBI annotation: ORF106 # Family: family:all:2792 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238555;genbank:gi:66391358;genbank:GeneID :5130454 Length = 121 Score = 185 bits (469), Expect = 2e-49, Method: Compositional matrix adjust. Identities = 87/115 (75%), Positives = 99/115 (86%) Query: 1 MASVTNQTVHTGNTVYLMIKGKIVGRCQSASGERSYGTTGVYEIGSIMPQEHVYLKYEGS 60 MAS QTVHTGNTV LMIKGK VGR QSASG RSYGT GVYEIGSIMPQEHVYLKYEG Sbjct: 1 MASQAKQTVHTGNTVMLMIKGKPVGRAQSASGTRSYGTEGVYEIGSIMPQEHVYLKYEGE 60 Query: 61 LTVDRLRMRKEDLAKLGITALGEDILKRDVIDIVMMDNTTKEVIIAYRGCSAETY 115 LTV+RLRM+KE+ AKLG +LGE+ILK+D+IDIV++DN TKEV+++Y GCSA Y Sbjct: 61 LTVERLRMKKENFAKLGYASLGEEILKKDIIDIVVIDNLTKEVLVSYHGCSANNY 115 >gi|24314|lcl|protein:vir:100800 Length: 155 # NCBI annotation: hypothetical protein # Family: family:all:2792 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164737;genbank:gi:56693150;genbank:GeneID :3197433 Length = 155 Score = 178 bits (452), Expect = 2e-47, Method: Compositional matrix adjust. Identities = 79/140 (56%), Positives = 109/140 (77%) Query: 1 MASVTNQTVHTGNTVYLMIKGKIVGRCQSASGERSYGTTGVYEIGSIMPQEHVYLKYEGS 60 MA+V NQTV TGN +Y+M+K ++GR QS +G+RS+GTTGVYE+G+IMP+EHV+LKY G+ Sbjct: 1 MATVKNQTVETGNRIYIMVKSAVLGRAQSLTGDRSFGTTGVYELGTIMPREHVFLKYTGT 60 Query: 61 LTVDRLRMRKEDLAKLGITALGEDILKRDVIDIVMMDNTTKEVIIAYRGCSAETYNEEIK 120 ++V+R RM + + ALGED+LK DV+DI + DNTT ++II YRGCS + Y E + Sbjct: 61 VSVERYRMTTNNFTNTKVAALGEDVLKIDVLDIAVKDNTTGKLIIVYRGCSIDDYQETYR 120 Query: 121 ANEIVSESARFLYLSAANVK 140 ANEI ESARF YL+A+N++ Sbjct: 121 ANEITGESARFYYLTASNLQ 140 >gi|11622|lcl|protein:vir:78869 Length: 439 # NCBI annotation: TerL # Family: family:all:54 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468842;genbank:gi:157325434;genbank:Ge neID:5601865 Length = 439 Score = 21.6 bits (44), Expect = 4.5, Method: Composition-based stats. Identities = 10/25 (40%), Positives = 14/25 (56%) Query: 67 RMRKEDLAKLGITALGEDILKRDVI 91 R +E+L K+G+ G D DVI Sbjct: 337 RWLREELEKVGVDTAGADNNAHDVI 361 >gi|5997|lcl|protein:vir:95545 Length: 195 # NCBI annotation: hypothetical protein # Family: family:all:10964 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293365;genbank:gi:148912786;genbank:Ge neID:5228197 Length = 195 Score = 20.8 bits (42), Expect = 6.6, Method: Compositional matrix adjust. Identities = 9/27 (33%), Positives = 14/27 (51%) Query: 39 TGVYEIGSIMPQEHVYLKYEGSLTVDR 65 + VY++ P V+ Y G +TV R Sbjct: 163 SAVYDVDVTYPDGTVHRYYSGPITVSR 189 >gi|6927|lcl|protein:vir:106715 Length: 507 # NCBI annotation: gp19 # Family: family:all:144 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944327;genbank:gi:38638626;genbank:GeneID :2657344 Length = 507 Score = 20.8 bits (42), Expect = 6.9, Method: Composition-based stats. Identities = 9/33 (27%), Positives = 16/33 (48%) Query: 36 YGTTGVYEIGSIMPQEHVYLKYEGSLTVDRLRM 68 Y + E G+I P+EH+ + L +R+ Sbjct: 306 YQQVPLSEFGAIFPREHLQYYHAADLPKQFVRV 338 >gi|11323|lcl|protein:vir:78588 Length: 507 # NCBI annotation: TerL # Family: family:all:144 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294855;genbank:gi:149882918;genbank:Ge neID:5291059 Length = 507 Score = 20.8 bits (42), Expect = 6.9, Method: Composition-based stats. Identities = 9/33 (27%), Positives = 16/33 (48%) Query: 36 YGTTGVYEIGSIMPQEHVYLKYEGSLTVDRLRM 68 Y + E G+I P+EH+ + L +R+ Sbjct: 306 YQQVPLSEFGAIFPREHLQYYHAADLPKQFVRV 338 >gi|16025|lcl|protein:vir:9814 Length: 403 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795576;genbank:gi:28876345;genbank:GeneID :1257867 Length = 403 Score = 20.8 bits (42), Expect = 7.6, Method: Compositional matrix adjust. Identities = 9/21 (42%), Positives = 13/21 (61%) Query: 70 KEDLAKLGITALGEDILKRDV 90 +E+L KLG+ LG +DV Sbjct: 306 REELHKLGVFTLGAPNNSKDV 326 >gi|16299|lcl|protein:vir:3027 Length: 375 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438140;genbank:gi:16271803;genbank:GeneID :929285 Length = 375 Score = 20.8 bits (42), Expect = 7.9, Method: Compositional matrix adjust. Identities = 9/21 (42%), Positives = 13/21 (61%) Query: 70 KEDLAKLGITALGEDILKRDV 90 +E+L KLG+ LG +DV Sbjct: 278 REELHKLGVFTLGAPNNSKDV 298 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.316 0.132 0.358 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 52,453 Number of Sequences: 514 Number of extensions: 2121 Number of successful extensions: 13 Number of sequences better than 100.0: 13 Number of HSP's better than 100.0 without gapping: 13 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 13 length of query: 140 length of database: 206,069 effective HSP length: 63 effective length of query: 77 effective length of database: 173,687 effective search space: 13373899 effective search space used: 13373899 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 33 (17.3 bits)