BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_014459.1_cdsid_YP_003856990.1 [gene=18] [protein=gp18] [protein_id=YP_003856990.1] [location=10472..11086] (204 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|14686|lcl|protein:vir:2509 Length: 204 # NCBI annotation: maj... 347 7e-98 gi|7515|lcl|protein:vir:99926 Length: 204 # NCBI annotation: gp1... 288 4e-80 gi|13490|lcl|protein:vir:9765 Length: 196 # NCBI annotation: put... 83 2e-18 gi|4248|lcl|protein:vir:94739 Length: 185 # NCBI annotation: maj... 75 6e-16 gi|18502|lcl|protein:vir:1644 Length: 185 # NCBI annotation: Str... 75 8e-16 gi|19189|lcl|protein:vir:9580 Length: 183 # NCBI annotation: unk... 66 3e-13 gi|6399|lcl|protein:vir:98437 Length: 184 # NCBI annotation: ORF... 55 9e-10 gi|14015|lcl|protein:vir:8194 Length: 197 # NCBI annotation: gp1... 43 3e-06 gi|13279|lcl|protein:vir:81256 Length: 314 # NCBI annotation: gp... 34 0.001 gi|4089|lcl|protein:vir:94598 Length: 581 # NCBI annotation: PfW... 30 0.029 gi|12981|lcl|protein:vir:80670 Length: 213 # NCBI annotation: gp... 26 0.30 gi|19767|lcl|protein:vir:6384 Length: 677 # NCBI annotation: ter... 24 1.1 gi|19985|lcl|protein:vir:108237 Length: 179 # NCBI annotation: g... 23 3.8 gi|3994|lcl|protein:vir:99686 Length: 586 # NCBI annotation: DNA... 22 6.4 >gi|14686|lcl|protein:vir:2509 Length: 204 # NCBI annotation: major tail subunit gp14 # Family: family:all:698 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569750;genbank:gi:18496900;genbank:GeneID :932335 Length = 204 Score = 347 bits (889), Expect = 7e-98, Method: Compositional matrix adjust. Identities = 166/198 (83%), Positives = 178/198 (89%) Query: 4 PTPPSALGDATKVFAASPSDLETVGGLWFAPFGTKLPTDVDEPLEAAFKNLGFVSADGVT 63 P ++ GD TKVFAASPSDLETVGGLW+APFGT LPTDVDEPL+ FKNLGF+S +GVT Sbjct: 5 PVLENSWGDVTKVFAASPSDLETVGGLWYAPFGTPLPTDVDEPLDDKFKNLGFISVEGVT 64 Query: 64 VKIDSQTTPIEVWGGDEIGALRDKFSIEYSMSLFQVLSPEVNAAIFGAGNVSTAAATEAH 123 VKID QT PIEVWGGDEIGALRDKF+IEYSM LFQVLSPEVNAAIFG GNV T+ AT H Sbjct: 65 VKIDDQTKPIEVWGGDEIGALRDKFAIEYSMKLFQVLSPEVNAAIFGEGNVLTSEATAMH 124 Query: 124 GARMKVLINSKLPKRCSLVLDSVYEDKIIRQVAQIAQLSGLADIKLVHNAPMAFEPTFKV 183 GARMKVLINSKLPKRCSLVLDSVYEDK+IRQVAQIAQ +GLAD+KLVHN PMAFEPTFKV Sbjct: 125 GARMKVLINSKLPKRCSLVLDSVYEDKMIRQVAQIAQKAGLADLKLVHNEPMAFEPTFKV 184 Query: 184 LKGTDGNHVIQYSDDGQI 201 LKGTDGNHV+QYSDDG I Sbjct: 185 LKGTDGNHVVQYSDDGVI 202 >gi|7515|lcl|protein:vir:99926 Length: 204 # NCBI annotation: gp13 # Family: family:all:698 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655530;genbank:gi:109392300;genbank:GeneI D:4157095 Length = 204 Score = 288 bits (736), Expect = 4e-80, Method: Compositional matrix adjust. Identities = 135/204 (66%), Positives = 165/204 (80%) Query: 1 MTQPTPPSALGDATKVFAASPSDLETVGGLWFAPFGTKLPTDVDEPLEAAFKNLGFVSAD 60 MT P PS LGD ++VFAA+PS L+T GGL+ AP GT LPTDVDEPL+ FK+LG+VS+D Sbjct: 1 MTTPVVPSPLGDYSQVFAATPSGLQTAGGLYIAPAGTDLPTDVDEPLDPLFKSLGYVSSD 60 Query: 61 GVTVKIDSQTTPIEVWGGDEIGALRDKFSIEYSMSLFQVLSPEVNAAIFGAGNVSTAAAT 120 GVT+ ID TTPIEVW G+ IG+LRD F+IEYSMSL+QVLSP VNA IFG G+V+TAAAT Sbjct: 61 GVTISIDGSTTPIEVWSGERIGSLRDAFAIEYSMSLYQVLSPHVNAVIFGDGSVTTAAAT 120 Query: 121 EAHGARMKVLINSKLPKRCSLVLDSVYEDKIIRQVAQIAQLSGLADIKLVHNAPMAFEPT 180 HG RMKV I+S++PK SLVLD+ +EDK IRQVA++ Q+S + DI LVHN PMAF PT Sbjct: 121 AEHGNRMKVAISSRMPKMASLVLDAFFEDKAIRQVAELVQMSDIDDITLVHNEPMAFTPT 180 Query: 181 FKVLKGTDGNHVIQYSDDGQIVAA 204 F V +G +G+HV+QYSDDGQ +AA Sbjct: 181 FSVFRGHNGDHVVQYSDDGQKIAA 204 >gi|13490|lcl|protein:vir:9765 Length: 196 # NCBI annotation: putative structural protein # Family: family:all:698 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795527;genbank:gi:28876277;genbank:GeneID :1257818 Length = 196 Score = 83.2 bits (204), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 53/185 (28%), Positives = 93/185 (50%), Gaps = 9/185 (4%) Query: 12 DATKVFAASPSDLETVGGLWFAPFGTKLPTDVDEPLEAAFKNLGFVSADGVTVKIDSQTT 71 D V +A P + G ++ AP GT LP D L+ FKNLG+VS DGV + + Sbjct: 4 DTKNVTSAKP---KAGGAIYSAPLGTTLPEDAKSKLDTKFKNLGYVSEDGVVNEDTRSSE 60 Query: 72 PIEVWGGDEIGALRDKFSIEYSMSLFQVLSPEVNAAIFGAGNVSTAAATEAHGARMKVLI 131 I+ WGGD +G+++ + +++ L + L+ EV ++GA NV+ H + Sbjct: 61 NIKAWGGDIVGSVQTEKEDKFTYKLIESLNVEVLKEVYGAANVTGDLDKGIH-----IKS 115 Query: 132 NSKLPKRCSLVLDSVYEDKIIRQVAQ-IAQLSGLADIKLVHNAPMAFEPTFKVLKGTDGN 190 NSK + ++V+D + I++++ A++ + +IK V + +E T K G+ Sbjct: 116 NSKELEAHAIVIDMIMNGGILKRIVLPNAKVDEVGEIKYVDGEVVGYETTLKCFPDEKGD 175 Query: 191 HVIQY 195 +Y Sbjct: 176 THHEY 180 >gi|4248|lcl|protein:vir:94739 Length: 185 # NCBI annotation: major tail protein # Family: family:all:698 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996712;genbank:gi:45597427;genbank:GeneID :2767963 Length = 185 Score = 75.1 bits (183), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 51/169 (30%), Positives = 82/169 (48%), Gaps = 6/169 (3%) Query: 28 GGLWFAPFGTKLPTDVDEPLEAAFKNLGFVSADGVTVKIDSQTTPIEVWGGDEIGALRDK 87 G ++ AP GT LPTD L AFK LG++S DG+ K ++ I+ WGGD + ++ + Sbjct: 16 GAIYSAPKGTALPTDARTTLNVAFKPLGYISEDGLKNKNSPKSDSIKAWGGDTVATVQTE 75 Query: 88 FSIEYSMSLFQVLSPEVNAAIFGAGNVSTAAATEAHGARMKVLINSKLPKRCSLVLDSVY 147 +S +L + L+ EV ++GA NV+ T + V NSK +V+D Sbjct: 76 KEDTFSYTLIEALNVEVLKEVYGADNVTGTLKT-----GITVKANSKELIEHPVVIDMTV 130 Query: 148 EDKIIRQ-VAQIAQLSGLADIKLVHNAPMAFEPTFKVLKGTDGNHVIQY 195 + + ++ V ++S + DI + + FE T L GN Y Sbjct: 131 RNGVFKRIVIPQGKVSEIGDISYNDSDAVGFEITLTGLPDKAGNSHYDY 179 >gi|18502|lcl|protein:vir:1644 Length: 185 # NCBI annotation: Structural protein # Family: family:all:698 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695065;genbank:gi:23455756;genbank:GeneID :955486 Length = 185 Score = 74.7 bits (182), Expect = 8e-16, Method: Compositional matrix adjust. Identities = 51/169 (30%), Positives = 82/169 (48%), Gaps = 6/169 (3%) Query: 28 GGLWFAPFGTKLPTDVDEPLEAAFKNLGFVSADGVTVKIDSQTTPIEVWGGDEIGALRDK 87 G ++ AP GT LPTD L AFK LG++S DG+ K ++ I+ WGGD + ++ + Sbjct: 16 GAIYSAPKGTALPTDAKPTLNIAFKPLGYISEDGLKNKNSPKSDSIKAWGGDTVATVQTE 75 Query: 88 FSIEYSMSLFQVLSPEVNAAIFGAGNVSTAAATEAHGARMKVLINSKLPKRCSLVLDSVY 147 +S +L + L+ EV ++GA NV+ T + V NSK +V+D Sbjct: 76 KEDTFSYTLIEALNVEVLKEVYGADNVTGTLKT-----GITVKANSKELIEHPVVIDMTV 130 Query: 148 EDKIIRQ-VAQIAQLSGLADIKLVHNAPMAFEPTFKVLKGTDGNHVIQY 195 + + ++ V ++S + DI + + FE T L GN Y Sbjct: 131 RNGVFKRIVIPQGKVSEIGDISYNDSDAVGFEITLTGLPDKAGNSHYDY 179 >gi|19189|lcl|protein:vir:9580 Length: 183 # NCBI annotation: unknown # Family: family:all:698 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862885;genbank:gi:32469421;genbank:GeneID :1461322 Length = 183 Score = 66.2 bits (160), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 40/129 (31%), Positives = 73/129 (56%), Gaps = 7/129 (5%) Query: 28 GGLWFAPFGTKLPTDVDEPLEAAFKNLGFVSADGVTVKIDSQTTPIEVWGGDEIGALRDK 87 G ++ AP GT LPTD L+ AF+ LG++S DG+T ++ I+ WGG + +++ + Sbjct: 16 GAVYSAPLGTALPTDATTKLDQAFEALGYISDDGMTNSNSPESENIKAWGGVVVSSVQKE 75 Query: 88 FSIEYSMSLFQVLSPEVNAAIFGAGNVSTAAATEAHGARMKVLINSK-LPKRCSLVLDSV 146 + + L + L+ V ++G NVS ++ G +K NSK LP C LV+++V Sbjct: 76 KTDTFKYMLIEALNLHVLKEVYGPDNVSGDLSS---GITIKA--NSKELPHHC-LVIETV 129 Query: 147 YEDKIIRQV 155 + +++++ Sbjct: 130 LKGGVLKRI 138 >gi|6399|lcl|protein:vir:98437 Length: 184 # NCBI annotation: ORFp27 # Family: family:all:698 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958286;genbank:gi:41057260;uniprot:Q38601 ;genbank:GeneID:2732821 Length = 184 Score = 54.7 bits (130), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 44/173 (25%), Positives = 73/173 (42%), Gaps = 7/173 (4%) Query: 30 LWFAPFGTKLPTDVDEPLEAAFKNLGFVSADGVTVKIDSQTTPIEVWGGDEIGALRDKFS 89 + AP GT PT+V L+ A+ +G +S DG + D +T WGG + + K Sbjct: 16 FYAAPVGTTAPTNVATALDPAWLPVGLLSEDGASESRDQDSTDFYAWGGVLVRTAKSKHK 75 Query: 90 IEYSMSLFQVLSPEVNAAIFGAGNVSTAAATEAHGARMKVLINSKLPKRCSLVLDSVYED 149 + ++ E N +FG N ++A T V + P+ L L Sbjct: 76 RQIVVTCL-----EENLVVFGLVNPGSSAVTATGVTTRTVKVPKADPRAFVLELRDGAVK 130 Query: 150 KIIRQVAQIAQLSGLADIKLVHNAPMAFEPTFKVLKGTDGNHVIQYSDDGQIV 202 K R+V ++ + ++ L +A A+E T + DG + +DD Q V Sbjct: 131 K--RRVIPKGEVESVGEVTLSDSALTAYELTITIYPAADGTLYLDITDDPQAV 181 >gi|14015|lcl|protein:vir:8194 Length: 197 # NCBI annotation: gp14 # Family: family:all:698 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817987;genbank:gi:29566421;genbank:GeneID :2700975 Length = 197 Score = 42.7 bits (99), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 47/183 (25%), Positives = 79/183 (43%), Gaps = 7/183 (3%) Query: 10 LGDATKVFAASPS-DLETVGGLWFAPFGTKLPTDVDEPLEAAFKNLGFVSADGVTVKIDS 68 + D+ V+AA S D E G AP GT LPTD L+AA + G++ DG I Sbjct: 1 MADSKNVWAAGRSADDEAFFG---APLGTPLPTDAIAELDAALEPHGWMGDDGFVNNIQR 57 Query: 69 QTTPIEVWGGDEIGALRDKFSIEYSMSLFQVLSPEVNAAIFGAGNVSTAAATEAHGARMK 128 T + + G I +D + +++ + +P V +FG NV T+ H Sbjct: 58 DVTKHKDFAGTTIKTTQDNYEETVAVTCCES-NPVVLKTVFGDSNVD-VDFTDGHRKITI 115 Query: 129 VLINSKLPKRCSLVLDSVYEDKIIRQVAQIAQLSGLADIKLVHNAPMAFEPTFKVLKGTD 188 + LP++ S V+ V K V Q++ + ++ + + + + T K Sbjct: 116 RHDEAPLPRK-SFVVRVVDGVKTRMLVIPEGQVTEIGEVTWLSSELVQYTLTIDCYKPAK 174 Query: 189 GNH 191 G+H Sbjct: 175 GSH 177 >gi|13279|lcl|protein:vir:81256 Length: 314 # NCBI annotation: gp12, major tail protein # Family: family:all:698 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456742;genbank:gi:157168385;uniprot:Q 9MBJ3;genbank:GeneID:5580379 Length = 314 Score = 33.9 bits (76), Expect = 0.001, Method: Compositional matrix adjust. Identities = 25/90 (27%), Positives = 44/90 (48%), Gaps = 12/90 (13%) Query: 17 FAASPSDLETVGGLWFAPFGTKLPTDVD-EPLEAAFKNLGFVSADGVTVKIDSQT---TP 72 FAA S L G +AP+GT LP E L + NLG+ + G+ ++ ++ TP Sbjct: 13 FAAELSRLGVTGTANYAPYGTALPAPHSLERLTVPYANLGWFADSGIVETLNEESNSFTP 72 Query: 73 IEVWGGDEIGALRDKFS---IEYSMSLFQV 99 ++ +G +R S + + M+L+ + Sbjct: 73 LQA-----VGPIRSAVSSRELTFQMTLWSI 97 >gi|4089|lcl|protein:vir:94598 Length: 581 # NCBI annotation: PfWMP4_40 # Family: family:all:543 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762670;genbank:gi:115304378;genbank:GeneI D:5142298 Length = 581 Score = 29.6 bits (65), Expect = 0.029, Method: Compositional matrix adjust. Identities = 21/78 (26%), Positives = 34/78 (43%), Gaps = 11/78 (14%) Query: 125 ARMKVLINSKLPKRCSLVLDSVYEDKIIRQVAQIAQLSGLADIKLVHNAPMAFEPTFKVL 184 AR+ VL N K ++ S+Y D+II+ +A+ + D K V N E Sbjct: 446 ARLVVLWNVKRMYVETIAFQSLYRDRIIKHLAEKKIQCAVLDYKPVGNKHKRIE------ 499 Query: 185 KGTDGNHVIQYSDDGQIV 202 +H+ Y + G +V Sbjct: 500 -----SHLSSYFNQGNVV 512 >gi|12981|lcl|protein:vir:80670 Length: 213 # NCBI annotation: gp11 # Family: family:all:698 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285587;genbank:gi:148727093;genbank:Ge neID:5247041 Length = 213 Score = 26.2 bits (56), Expect = 0.30, Method: Compositional matrix adjust. Identities = 33/139 (23%), Positives = 57/139 (41%), Gaps = 26/139 (18%) Query: 54 LGFVSADGVTVKIDSQTTPIEVW-GGDEIGALRDKFSIEYSMSLFQ--------VLSPEV 104 LG++S DG +K + +T ++ W D + + + SIE S L + +V Sbjct: 42 LGYLSDDGFKIKPERKTDDLKAWQNADVVRTVATESSIEISFQLIESKKEVIELFWQSKV 101 Query: 105 NA-AIFGAGNVSTAAATEAHGARMKVLINSKLPKRCSLVLDSVYEDKIIRQVAQIAQLSG 163 A A G+ ++S A T H +L++D V D++IR +L Sbjct: 102 TAGADSGSFDISPGATTGVH----------------ALLMDIVDGDQVIRYYFPEVELID 145 Query: 164 LADIKLVHNAPMAFEPTFK 182 +IK + + T K Sbjct: 146 RDEIKGKNGEVYGYGVTLK 164 >gi|19767|lcl|protein:vir:6384 Length: 677 # NCBI annotation: terminase large subunit TerL # Family: family:all:140 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918997;genbank:gi:34610172;genbank:gi:912 14212;genbank:GeneID:2559603 Length = 677 Score = 24.3 bits (51), Expect = 1.1, Method: Composition-based stats. Identities = 10/21 (47%), Positives = 13/21 (61%) Query: 177 FEPTFKVLKGTDGNHVIQYSD 197 FEPTFK+LK D + +D Sbjct: 249 FEPTFKLLKWDDCGDAVSCAD 269 >gi|19985|lcl|protein:vir:108237 Length: 179 # NCBI annotation: gp15 # Family: family:all:4816 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552344;genbank:gi:160700664;genbank:G eneID:5758957 Length = 179 Score = 22.7 bits (47), Expect = 3.8, Method: Compositional matrix adjust. Identities = 11/43 (25%), Positives = 19/43 (44%), Gaps = 4/43 (9%) Query: 39 LPTDVDEPLEAAFKNLGF----VSADGVTVKIDSQTTPIEVWG 77 +P + +P G+ V ADG+ ++ + T I WG Sbjct: 29 IPESITDPFVTTTGKWGYLGLLVGADGINIQREWDETDIPAWG 71 >gi|3994|lcl|protein:vir:99686 Length: 586 # NCBI annotation: DNA packaging protein B # Family: family:all:697 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249597;genbank:gi:68299748;genbank:GeneID :3800001 Length = 586 Score = 21.9 bits (45), Expect = 6.4, Method: Compositional matrix adjust. Identities = 14/59 (23%), Positives = 26/59 (44%), Gaps = 3/59 (5%) Query: 138 RCSLVLDSVYEDKIIRQVAQIAQLSGLADIKLVHNAPMAFEPTFKVLKGTDGNHVIQYS 196 RC+L K +R + + G I ++ +A + ++ + DG H I+YS Sbjct: 438 RCALTETKSTGQKEMRIADTLEPVMGAHRIVVMESA---IQKDYQTARNVDGTHDIKYS 493 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.316 0.133 0.381 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 86,085 Number of Sequences: 514 Number of extensions: 3573 Number of successful extensions: 21 Number of sequences better than 100.0: 15 Number of HSP's better than 100.0 without gapping: 14 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 6 Number of HSP's gapped (non-prelim): 15 length of query: 204 length of database: 206,069 effective HSP length: 67 effective length of query: 137 effective length of database: 171,631 effective search space: 23513447 effective search space used: 23513447 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 35 (18.1 bits)