BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_018838.1_cdsid_YP_006906412.1 [gene=11] [protein=putative major tail subunit] [protein_id=YP_006906412.1] [location=7160..7795] (211 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|12981|lcl|protein:vir:80670 Length: 213 # NCBI annotation: gp... 370 e-105 gi|13490|lcl|protein:vir:9765 Length: 196 # NCBI annotation: put... 69 4e-14 gi|19189|lcl|protein:vir:9580 Length: 183 # NCBI annotation: unk... 56 3e-10 gi|18502|lcl|protein:vir:1644 Length: 185 # NCBI annotation: Str... 49 6e-08 gi|4248|lcl|protein:vir:94739 Length: 185 # NCBI annotation: maj... 48 7e-08 gi|10070|lcl|protein:vir:99006 Length: 303 # NCBI annotation: gp... 44 2e-06 gi|3569|lcl|protein:vir:101656 Length: 324 # NCBI annotation: gp... 40 3e-05 gi|19394|lcl|protein:vir:7861 Length: 324 # NCBI annotation: gp1... 40 3e-05 gi|6399|lcl|protein:vir:98437 Length: 184 # NCBI annotation: ORF... 33 0.003 gi|13279|lcl|protein:vir:81256 Length: 314 # NCBI annotation: gp... 33 0.003 gi|14015|lcl|protein:vir:8194 Length: 197 # NCBI annotation: gp1... 27 0.20 gi|11001|lcl|protein:vir:78275 Length: 282 # NCBI annotation: Pu... 27 0.28 gi|11245|lcl|protein:vir:78506 Length: 282 # NCBI annotation: gp... 26 0.31 gi|19796|lcl|protein:vir:2349 Length: 283 # NCBI annotation: gp1... 26 0.34 gi|7515|lcl|protein:vir:99926 Length: 204 # NCBI annotation: gp1... 26 0.45 gi|14224|lcl|protein:vir:8332 Length: 271 # NCBI annotation: gp4... 24 1.2 gi|14686|lcl|protein:vir:2509 Length: 204 # NCBI annotation: maj... 24 1.6 gi|7042|lcl|protein:vir:98644 Length: 576 # NCBI annotation: put... 23 2.8 gi|1944|lcl|protein:vir:99839 Length: 250 # NCBI annotation: hyp... 23 3.7 gi|18282|lcl|protein:vir:7995 Length: 269 # NCBI annotation: gp1... 22 4.6 gi|8449|lcl|protein:vir:105827 Length: 269 # NCBI annotation: gp... 22 4.8 gi|9704|lcl|protein:vir:102610 Length: 269 # NCBI annotation: gp... 22 4.8 gi|19024|lcl|protein:vir:9640 Length: 576 # NCBI annotation: lar... 22 6.7 gi|19181|lcl|protein:vir:9572 Length: 459 # NCBI annotation: gp3... 22 8.1 >gi|12981|lcl|protein:vir:80670 Length: 213 # NCBI annotation: gp11 # Family: family:all:698 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285587;genbank:gi:148727093;genbank:Ge neID:5247041 Length = 213 Score = 370 bits (950), Expect = e-105, Method: Compositional matrix adjust. Identities = 182/190 (95%), Positives = 185/190 (97%) Query: 5 MAATRKASNVRSAVTGDVYIGDAHAGDTIKGVEAVPSGLTALGYLSDDGFKIKPERKTDD 64 MA TRKASNVRSAVTGDVYIG AHAGDTI GV+ VP GLTALGYLSDDGFKIKPERKTDD Sbjct: 1 MAGTRKASNVRSAVTGDVYIGKAHAGDTIDGVKTVPDGLTALGYLSDDGFKIKPERKTDD 60 Query: 65 LKAWQNADVVRTVATESSIEISFQLIESKKEVIELFWQSKVTAGSDSGSFDISPGATTGV 124 LKAWQNADVVRTVATESSIEISFQLIESKKEVIELFWQSKVTAG+DSGSFDISPGATTGV Sbjct: 61 LKAWQNADVVRTVATESSIEISFQLIESKKEVIELFWQSKVTAGADSGSFDISPGATTGV 120 Query: 125 HALLMDIVDGDQVIRYYFPEVELVDRDEIKGKNGEVYGYGVTLKAYPAQINKKGDAVSGR 184 HALLMDIVDGDQVIRYYFPEVEL+DRDEIKGKNGEVYGYGVTLKAYPAQINKKGDAVSGR Sbjct: 121 HALLMDIVDGDQVIRYYFPEVELIDRDEIKGKNGEVYGYGVTLKAYPAQINKKGDAVSGR 180 Query: 185 GWMTALKADT 194 GWMTALKADT Sbjct: 181 GWMTALKADT 190 >gi|13490|lcl|protein:vir:9765 Length: 196 # NCBI annotation: putative structural protein # Family: family:all:698 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795527;genbank:gi:28876277;genbank:GeneID :1257818 Length = 196 Score = 68.9 bits (167), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 49/137 (35%), Positives = 79/137 (57%), Gaps = 7/137 (5%) Query: 46 LGYLSDDGFKIKPERKTDDLKAWQNADVVRTVATESSIEISFQLIESKK-EVI-ELFWQS 103 LGY+S+DG + R ++++KAW D+V +V TE + +++LIES EV+ E++ + Sbjct: 43 LGYVSEDGVVNEDTRSSENIKAW-GGDIVGSVQTEKEDKFTYKLIESLNVEVLKEVYGAA 101 Query: 104 KVTAGSDSGSFDISPGATTGVHALLMD-IVDGDQVIRYYFPEVELVDRDEIKGKNGEVYG 162 VT D G S HA+++D I++G + R P ++ + EIK +GEV G Sbjct: 102 NVTGDLDKGIHIKSNSKELEAHAIVIDMIMNGGILKRIVLPNAKVDEVGEIKYVDGEVVG 161 Query: 163 YGVTLKAYPAQINKKGD 179 Y TLK +P ++KGD Sbjct: 162 YETTLKCFP---DEKGD 175 >gi|19189|lcl|protein:vir:9580 Length: 183 # NCBI annotation: unknown # Family: family:all:698 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862885;genbank:gi:32469421;genbank:GeneID :1461322 Length = 183 Score = 56.2 bits (134), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 51/178 (28%), Positives = 82/178 (46%), Gaps = 20/178 (11%) Query: 5 MAATRKASNVRSAVTGDVY---IGDAHAGDTIKGVEAVPSGLTALGYLSDDGFKIKPERK 61 MA + + + G VY +G A D ++ ALGY+SDDG + Sbjct: 1 MATEANVTTAKPKIGGAVYSAPLGTALPTDATTKLD---QAFEALGYISDDGMTNSNSPE 57 Query: 62 TDDLKAWQNADVVRTVATESSIEISFQLIESKKEVIELFWQSKVTAGSDSGSFDISPGAT 121 ++++KAW VV +V E + + LIE+ + L +V G D+ S D+S G T Sbjct: 58 SENIKAWGGV-VVSSVQKEKTDTFKYMLIEA----LNLHVLKEV-YGPDNVSGDLSSGIT 111 Query: 122 TGV-------HALLMD-IVDGDQVIRYYFPEVELVDRDEIKGKNGEVYGYGVTLKAYP 171 H L+++ ++ G + R P ++ DEI +G V GYG T+ A+P Sbjct: 112 IKANSKELPHHCLVIETVLKGGVLKRIVIPSGKVTAIDEITYNDGSVLGYGTTVTAFP 169 >gi|18502|lcl|protein:vir:1644 Length: 185 # NCBI annotation: Structural protein # Family: family:all:698 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695065;genbank:gi:23455756;genbank:GeneID :955486 Length = 185 Score = 48.5 bits (114), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 39/129 (30%), Positives = 64/129 (49%), Gaps = 4/129 (3%) Query: 46 LGYLSDDGFKIKPERKTDDLKAWQNADVVRTVATESSIEISFQLIESKK-EVI-ELFWQS 103 LGY+S+DG K K K+D +KAW D V TV TE S+ LIE+ EV+ E++ Sbjct: 42 LGYISEDGLKNKNSPKSDSIKAW-GGDTVATVQTEKEDTFSYTLIEALNVEVLKEVYGAD 100 Query: 104 KVTAGSDSGSFDISPGATTGVHALLMDIVDGDQVI-RYYFPEVELVDRDEIKGKNGEVYG 162 VT +G + H +++D+ + V R P+ ++ + +I + + G Sbjct: 101 NVTGTLKTGITVKANSKELIEHPVVIDMTVRNGVFKRIVIPQGKVSEIGDISYNDSDAVG 160 Query: 163 YGVTLKAYP 171 + +TL P Sbjct: 161 FEITLTGLP 169 >gi|4248|lcl|protein:vir:94739 Length: 185 # NCBI annotation: major tail protein # Family: family:all:698 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996712;genbank:gi:45597427;genbank:GeneID :2767963 Length = 185 Score = 48.1 bits (113), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 39/129 (30%), Positives = 64/129 (49%), Gaps = 4/129 (3%) Query: 46 LGYLSDDGFKIKPERKTDDLKAWQNADVVRTVATESSIEISFQLIESKK-EVI-ELFWQS 103 LGY+S+DG K K K+D +KAW D V TV TE S+ LIE+ EV+ E++ Sbjct: 42 LGYISEDGLKNKNSPKSDSIKAW-GGDTVATVQTEKEDTFSYTLIEALNVEVLKEVYGAD 100 Query: 104 KVTAGSDSGSFDISPGATTGVHALLMDIVDGDQVI-RYYFPEVELVDRDEIKGKNGEVYG 162 VT +G + H +++D+ + V R P+ ++ + +I + + G Sbjct: 101 NVTGTLKTGITVKANSKELIEHPVVIDMTVRNGVFKRIVIPQGKVSEIGDISYNDSDAVG 160 Query: 163 YGVTLKAYP 171 + +TL P Sbjct: 161 FEITLTGLP 169 >gi|10070|lcl|protein:vir:99006 Length: 303 # NCBI annotation: gp35 # Family: family:all:698 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655900;genbank:gi:109521472;genbank:GeneI D:4157971 Length = 303 Score = 43.9 bits (102), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 30/130 (23%), Positives = 64/130 (49%), Gaps = 10/130 (7%) Query: 51 DDGFKIKPERKTDDLKAWQNADVVRTVATESSIEISFQLIESKKEVIELFW-----QSKV 105 D G I + ++ D++ + + VRT+ T+ + + IE+ +EVIE +W + + Sbjct: 59 DAGLSITSDIESSDIEGYGEVEPVRTIITKRTTRFNASFIETNREVIEKYWGIELDATNL 118 Query: 106 TAGSDSGSFDISPGATTGV--HALLM--DIVDGDQVIRYY-FPEVELVDRDEIKGKNGEV 160 T + G +P + +L+ D V+G+ + Y+ P+V+L + D + ++ Sbjct: 119 TVSAQGGVTVKAPPRPKNIFYRCILLGQDEVNGEDLFPYWILPKVKLTEVDNMDFRDDAE 178 Query: 161 YGYGVTLKAY 170 Y +T +A+ Sbjct: 179 IQYRMTFQAF 188 >gi|3569|lcl|protein:vir:101656 Length: 324 # NCBI annotation: gp19 # Family: family:all:1912 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654774;genbank:gi:109302772;genbank:GeneI D:4156090 Length = 324 Score = 39.7 bits (91), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 42/155 (27%), Positives = 65/155 (41%), Gaps = 10/155 (6%) Query: 26 DAHAGDTIKGVEAVPSG-LTALGYLSDDGFKIKPERKTDDLKAWQNADVVRTVATESSIE 84 D + DT+ E P G +G +++DG + P+ TDD K WQ+ RT TE E Sbjct: 47 DGNLKDTLLS-EDFPGGRFYEIGAITEDGVEFNPKFSTDDTKIWQSRRAQRTDVTEDDEE 105 Query: 85 ISFQLIESKKEVIELFWQ---SKVTAGSDSGSFDISPGATTGVHALLMDI-VDGD----Q 136 + F E+ + L + V + +G P T V+ ++ I VDG + Sbjct: 106 VMFTAAENTPLIDYLRYNLPLENVPSVGTAGYKATKPNYTDMVYRQIVVIGVDGRMDEAE 165 Query: 137 VIRYYFPEVELVDRDEIKGKNGEVYGYGVTLKAYP 171 + P V L + K E+ G +T YP Sbjct: 166 YVAEVRPRVSLTKVGKQSFKAKEIDGTELTFGVYP 200 >gi|19394|lcl|protein:vir:7861 Length: 324 # NCBI annotation: gp18 # Family: family:all:1912 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817468;genbank:gi:29565897;genbank:GeneID :1259090 Length = 324 Score = 39.7 bits (91), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 42/155 (27%), Positives = 65/155 (41%), Gaps = 10/155 (6%) Query: 26 DAHAGDTIKGVEAVPSG-LTALGYLSDDGFKIKPERKTDDLKAWQNADVVRTVATESSIE 84 D + DT+ E P G +G +++DG + P+ TDD K WQ+ RT TE E Sbjct: 47 DGNLKDTLLS-EDFPGGRFYEIGAITEDGVEFNPKFSTDDTKIWQSRRAQRTDVTEDDEE 105 Query: 85 ISFQLIESKKEVIELFWQ---SKVTAGSDSGSFDISPGATTGVHALLMDI-VDGD----Q 136 + F E+ + L + V + +G P T V+ ++ I VDG + Sbjct: 106 VMFTAAENTPLIDYLRYNLPLENVPSVGTAGYKATKPNYTDMVYRQIVVIGVDGRMDEAE 165 Query: 137 VIRYYFPEVELVDRDEIKGKNGEVYGYGVTLKAYP 171 + P V L + K E+ G +T YP Sbjct: 166 YVAEVRPRVSLTKVGKQSFKAKEIDGTELTFGVYP 200 >gi|6399|lcl|protein:vir:98437 Length: 184 # NCBI annotation: ORFp27 # Family: family:all:698 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958286;genbank:gi:41057260;uniprot:Q38601 ;genbank:GeneID:2732821 Length = 184 Score = 33.1 bits (74), Expect = 0.003, Method: Compositional matrix adjust. Identities = 33/129 (25%), Positives = 55/129 (42%), Gaps = 5/129 (3%) Query: 46 LGYLSDDGFKIKPERKTDDLKAWQNADVVRTVATESSIEISFQLIESKKEVIELF--WQS 103 +G LS+DG ++ + D AW +VRT ++ +I +E V L S Sbjct: 40 VGLLSEDGASESRDQDSTDFYAWGGV-LVRTAKSKHKRQIVVTCLEENLVVFGLVNPGSS 98 Query: 104 KVTAGSDSGSFDISPGATTGVHALLMDIVDGDQVIRYYFPEVELVDRDEIKGKNGEVYGY 163 VTA + P A A ++++ DG R P+ E+ E+ + + Y Sbjct: 99 AVTATGVTTRTVKVPKADP--RAFVLELRDGAVKKRRVIPKGEVESVGEVTLSDSALTAY 156 Query: 164 GVTLKAYPA 172 +T+ YPA Sbjct: 157 ELTITIYPA 165 >gi|13279|lcl|protein:vir:81256 Length: 314 # NCBI annotation: gp12, major tail protein # Family: family:all:698 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456742;genbank:gi:157168385;uniprot:Q9 MBJ3;genbank:GeneID:5580379 Length = 314 Score = 32.7 bits (73), Expect = 0.003, Method: Compositional matrix adjust. Identities = 13/41 (31%), Positives = 24/41 (58%) Query: 129 MDIVDGDQVIRYYFPEVELVDRDEIKGKNGEVYGYGVTLKA 169 +DI++G + RY P +++R I E+ GY +T++A Sbjct: 139 IDILEGTKHRRYLLPAASVIERGAITHTKTEMTGYDLTIRA 179 Score = 21.9 bits (45), Expect = 5.9, Method: Compositional matrix adjust. Identities = 12/16 (75%), Positives = 12/16 (75%) Query: 26 DAHAGDTIKGVEAVPS 41 DA AG TIK V AVPS Sbjct: 280 DAVAGLTIKKVPAVPS 295 >gi|14015|lcl|protein:vir:8194 Length: 197 # NCBI annotation: gp14 # Family: family:all:698 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817987;genbank:gi:29566421;genbank:GeneID :2700975 Length = 197 Score = 26.9 bits (58), Expect = 0.20, Method: Compositional matrix adjust. Identities = 30/138 (21%), Positives = 57/138 (41%), Gaps = 6/138 (4%) Query: 41 SGLTALGYLSDDGFKIKPERKTDDLKAWQNADVVRTVATESSIEISFQLIESKKEVIE-L 99 + L G++ DDGF +R K + ++T ++ ES V++ + Sbjct: 38 AALEPHGWMGDDGFVNNIQRDVTKHKDFAGT-TIKTTQDNYEETVAVTCCESNPVVLKTV 96 Query: 100 FWQSKVTAGSDSGSFDIS---PGATTGVHALLMDIVDGDQVIRYYFPEVELVDRDEIKGK 156 F S V G I+ A + ++ +VDG + PE ++ + E+ Sbjct: 97 FGDSNVDVDFTDGHRKITIRHDEAPLPRKSFVVRVVDGVKTRMLVIPEGQVTEIGEVTWL 156 Query: 157 NGEVYGYGVTLKAY-PAQ 173 + E+ Y +T+ Y PA+ Sbjct: 157 SSELVQYTLTIDCYKPAK 174 >gi|11001|lcl|protein:vir:78275 Length: 282 # NCBI annotation: Putative major tail protein # Family: family:all:2431 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491672;genbank:gi:157786496;genbank:Ge neID:5625753 Length = 282 Score = 26.6 bits (57), Expect = 0.28, Method: Compositional matrix adjust. Identities = 20/87 (22%), Positives = 37/87 (42%), Gaps = 2/87 (2%) Query: 67 AWQNADVVRTVATESSIEISFQLIESKKEVIELFWQSKVTAGSDSGSFDISPGATTGVHA 126 +WQ + E + + L + + +EL++ +A G F + G+ A Sbjct: 73 SWQKKKLREVETEEIADYVVINLTQFDESALELYFGPNQSA--TPGIFGVKSGSVVNERA 130 Query: 127 LLMDIVDGDQVIRYYFPEVELVDRDEI 153 LL+ IVD D + ++ + L D I Sbjct: 131 LLIVIVDNDVRLGFHARKASLKREDAI 157 >gi|11245|lcl|protein:vir:78506 Length: 282 # NCBI annotation: gp20 # Family: family:all:2431 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491591;genbank:gi:157786414;genbank:Ge neID:5625658 Length = 282 Score = 26.2 bits (56), Expect = 0.31, Method: Compositional matrix adjust. Identities = 20/88 (22%), Positives = 37/88 (42%), Gaps = 2/88 (2%) Query: 67 AWQNADVVRTVATESSIEISFQLIESKKEVIELFWQSKVTAGSDSGSFDISPGATTGVHA 126 +WQ + E + + L + + +EL++ +A G F + G+ A Sbjct: 73 SWQKKKLREVETEEIADYVVINLTQFDETALELYFGPNQSA--TPGIFGVKSGSVVNERA 130 Query: 127 LLMDIVDGDQVIRYYFPEVELVDRDEIK 154 LL+ IVD D + ++ + L D I Sbjct: 131 LLIVIVDNDVRLGFHARKASLKREDAIS 158 >gi|19796|lcl|protein:vir:2349 Length: 283 # NCBI annotation: gp19 # Family: family:all:2431 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075286;genbank:gi:12657873;genbank:GeneID :920104 Length = 283 Score = 26.2 bits (56), Expect = 0.34, Method: Compositional matrix adjust. Identities = 20/88 (22%), Positives = 37/88 (42%), Gaps = 2/88 (2%) Query: 67 AWQNADVVRTVATESSIEISFQLIESKKEVIELFWQSKVTAGSDSGSFDISPGATTGVHA 126 +WQ + E + + L + + +EL++ +A G F + G+ A Sbjct: 73 SWQKKKLREVETEEIADYVVINLTQFDETALELYFGPNQSA--TPGIFGVKSGSVVNERA 130 Query: 127 LLMDIVDGDQVIRYYFPEVELVDRDEIK 154 LL+ IVD D + ++ + L D I Sbjct: 131 LLIVIVDNDVRLGFHARKASLKREDAIS 158 >gi|7515|lcl|protein:vir:99926 Length: 204 # NCBI annotation: gp13 # Family: family:all:698 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655530;genbank:gi:109392300;genbank:GeneI D:4157095 Length = 204 Score = 25.8 bits (55), Expect = 0.45, Method: Compositional matrix adjust. Identities = 27/133 (20%), Positives = 54/133 (40%), Gaps = 8/133 (6%) Query: 45 ALGYLSDDGFKIKPERKTDDLKAWQNADVVRTVATESSIEISFQLIESKKEVIE--LFWQ 102 +LGY+S DG I + T ++ W + + + ++ +IE S L + + +F Sbjct: 53 SLGYVSSDGVTISIDGSTTPIEVW-SGERIGSLRDAFAIEYSMSLYQVLSPHVNAVIFGD 111 Query: 103 SKVTAGSDSGSFDISPGATTG-----VHALLMDIVDGDQVIRYYFPEVELVDRDEIKGKN 157 VT + + + +L++D D+ IR V++ D D+I + Sbjct: 112 GSVTTAAATAEHGNRMKVAISSRMPKMASLVLDAFFEDKAIRQVAELVQMSDIDDITLVH 171 Query: 158 GEVYGYGVTLKAY 170 E + T + Sbjct: 172 NEPMAFTPTFSVF 184 >gi|14224|lcl|protein:vir:8332 Length: 271 # NCBI annotation: gp49 # Family: family:all:1912 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817900;genbank:gi:29566333;genbank:GeneID :1259528 Length = 271 Score = 24.3 bits (51), Expect = 1.2, Method: Compositional matrix adjust. Identities = 19/80 (23%), Positives = 36/80 (45%), Gaps = 1/80 (1%) Query: 42 GLTALGYLSDDGFKIK-PERKTDDLKAWQNADVVRTVATESSIEISFQLIESKKEVIELF 100 G +G L++DG + P+ D+ Q+ + T S+ I+F +E+ K +++ Sbjct: 88 GFHLIGALTEDGGPERAPDISNDNQMILQSNMPFDSDLTSESLSINFTGVETVKPLMKRL 147 Query: 101 WQSKVTAGSDSGSFDISPGA 120 + + SD S PG Sbjct: 148 RMNLALSDSDGNSIVEDPGT 167 >gi|14686|lcl|protein:vir:2509 Length: 204 # NCBI annotation: major tail subunit gp14 # Family: family:all:698 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569750;genbank:gi:18496900;genbank:GeneI D:932335 Length = 204 Score = 23.9 bits (50), Expect = 1.6, Method: Compositional matrix adjust. Identities = 12/46 (26%), Positives = 26/46 (56%), Gaps = 1/46 (2%) Query: 46 LGYLSDDGFKIKPERKTDDLKAWQNADVVRTVATESSIEISFQLIE 91 LG++S +G +K + +T ++ W D + + + +IE S +L + Sbjct: 55 LGFISVEGVTVKIDDQTKPIEVW-GGDEIGALRDKFAIEYSMKLFQ 99 >gi|7042|lcl|protein:vir:98644 Length: 576 # NCBI annotation: putative terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039919;genbank:gi:126011094;genbank:Ge neID:4818480 Length = 576 Score = 23.1 bits (48), Expect = 2.8, Method: Compositional matrix adjust. Identities = 22/91 (24%), Positives = 37/91 (40%), Gaps = 2/91 (2%) Query: 10 KASNVRSAVTG-DVYIGDAHAGDTIKGVEAVPSGLTALGYLSDDGFKIKPERKTDDLKAW 68 K N R G D Y+ + + V SG + KI E + DD + W Sbjct: 211 KVKNAREFYIGTDGYVREGFIDSMKDKAKKVLSGAARWNSMFPFICKIDEEHEVDDKEKW 270 Query: 69 QNAD-VVRTVATESSIEISFQLIESKKEVIE 98 Q A+ + ++ + E+ + E +E+IE Sbjct: 271 QKANPMFHQPMSDYAQELFDMVCEQYEEMIE 301 >gi|1944|lcl|protein:vir:99839 Length: 250 # NCBI annotation: hypothetical protein # Family: family:all:616 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164080;genbank:gi:56692612;genbank:GeneID :3192567 Length = 250 Score = 22.7 bits (47), Expect = 3.7, Method: Compositional matrix adjust. Identities = 10/17 (58%), Positives = 12/17 (70%) Query: 18 VTGDVYIGDAHAGDTIK 34 VTG+V GD AGD +K Sbjct: 94 VTGEVLEGDLKAGDFVK 110 >gi|18282|lcl|protein:vir:7995 Length: 269 # NCBI annotation: gp11 # Family: family:all:1912 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817349;genbank:gi:29565777;genbank:GeneID :1259036 Length = 269 Score = 22.3 bits (46), Expect = 4.6, Method: Compositional matrix adjust. Identities = 13/48 (27%), Positives = 23/48 (47%) Query: 51 DDGFKIKPERKTDDLKAWQNADVVRTVATESSIEISFQLIESKKEVIE 98 D G + +P+ +DDL Q+ V + TE S + F + + +I Sbjct: 97 DGGAEREPDVTSDDLMVLQSKFPVDSEVTEKSYSVRFVALGTADPLIH 144 >gi|8449|lcl|protein:vir:105827 Length: 269 # NCBI annotation: gp11 # Family: family:all:1912 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655772;genbank:gi:109522095;genbank:GeneI D:4157635 Length = 269 Score = 22.3 bits (46), Expect = 4.8, Method: Compositional matrix adjust. Identities = 13/48 (27%), Positives = 23/48 (47%) Query: 51 DDGFKIKPERKTDDLKAWQNADVVRTVATESSIEISFQLIESKKEVIE 98 D G + +P+ +DDL Q+ V + TE S + F + + +I Sbjct: 97 DGGAEREPDVTSDDLMVLQSKFPVDSEVTEKSYSVRFVALGTADPLIH 144 >gi|9704|lcl|protein:vir:102610 Length: 269 # NCBI annotation: gp11 # Family: family:all:1912 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655007;genbank:gi:109392197;genbank:GeneI D:4157232 Length = 269 Score = 22.3 bits (46), Expect = 4.8, Method: Compositional matrix adjust. Identities = 13/48 (27%), Positives = 23/48 (47%) Query: 51 DDGFKIKPERKTDDLKAWQNADVVRTVATESSIEISFQLIESKKEVIE 98 D G + +P+ +DDL Q+ V + TE S + F + + +I Sbjct: 97 DGGAEREPDVTSDDLMVLQSKFPVDSEVTEKSYSVRFVALGTADPLIH 144 >gi|19024|lcl|protein:vir:9640 Length: 576 # NCBI annotation: large terminase subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795402;genbank:gi:28876175;genbank:GeneID :1257725 Length = 576 Score = 21.9 bits (45), Expect = 6.7, Method: Compositional matrix adjust. Identities = 21/91 (23%), Positives = 37/91 (40%), Gaps = 2/91 (2%) Query: 10 KASNVRSAVTG-DVYIGDAHAGDTIKGVEAVPSGLTALGYLSDDGFKIKPERKTDDLKAW 68 K N R G D Y+ + + V SG + KI E + DD + W Sbjct: 211 KVKNAREFYIGTDGYVREGFIDSMKDKAKKVLSGNARWNSMFPFICKIDEEHEVDDKEKW 270 Query: 69 QNAD-VVRTVATESSIEISFQLIESKKEVIE 98 Q A+ + ++ + E+ + E +E++E Sbjct: 271 QKANPMFHEPMSDYAQELFDMVCEQYEEMVE 301 >gi|19181|lcl|protein:vir:9572 Length: 459 # NCBI annotation: gp38 # Family: family:all:523 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862877;genbank:gi:32469469;genbank:GeneID :1461314 Length = 459 Score = 21.6 bits (44), Expect = 8.1, Method: Compositional matrix adjust. Identities = 6/19 (31%), Positives = 14/19 (73%) Query: 54 FKIKPERKTDDLKAWQNAD 72 + + E++ DD++AW N++ Sbjct: 227 WSVSDEKEIDDVEAWYNSN 245 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.312 0.133 0.385 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 103,553 Number of Sequences: 514 Number of extensions: 4892 Number of successful extensions: 36 Number of sequences better than 100.0: 24 Number of HSP's better than 100.0 without gapping: 23 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 6 Number of HSP's gapped (non-prelim): 25 length of query: 211 length of database: 206,069 effective HSP length: 68 effective length of query: 143 effective length of database: 171,117 effective search space: 24469731 effective search space used: 24469731 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.8 bits) S2: 35 (18.1 bits)