BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_020849.1_cdsid_YP_007674305.1 [gene=PYDG_00098] [protein=hypothetical protein] [protein_id=YP_007674305.1] [location=complement(73089..74702)] (537 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|6267|lcl|protein:vir:95873 Length: 530 # NCBI annotation: gp6... 582 e-168 gi|11995|lcl|protein:vir:79222 Length: 591 # NCBI annotation: hy... 57 4e-10 gi|3935|lcl|protein:vir:103868 Length: 551 # NCBI annotation: hy... 48 3e-07 gi|9169|lcl|protein:vir:99234 Length: 550 # NCBI annotation: hyp... 47 4e-07 gi|6559|lcl|protein:vir:104336 Length: 523 # NCBI annotation: pu... 41 4e-05 gi|7770|lcl|protein:vir:96410 Length: 536 # NCBI annotation: hyp... 40 7e-05 gi|8062|lcl|protein:vir:103374 Length: 536 # NCBI annotation: pu... 40 7e-05 gi|12711|lcl|protein:vir:80169 Length: 565 # NCBI annotation: te... 31 0.042 gi|17058|lcl|protein:vir:5248 Length: 487 # NCBI annotation: put... 29 0.12 gi|2538|lcl|protein:vir:94056 Length: 483 # NCBI annotation: put... 27 0.47 gi|12378|lcl|protein:vir:79648 Length: 523 # NCBI annotation: Te... 27 0.57 gi|3763|lcl|protein:vir:107694 Length: 527 # NCBI annotation: pu... 27 0.62 gi|3336|lcl|protein:vir:94505 Length: 199 # NCBI annotation: maj... 26 0.89 gi|17128|lcl|protein:vir:1379 Length: 581 # NCBI annotation: put... 26 1.3 gi|12543|lcl|protein:vir:80106 Length: 486 # NCBI annotation: Te... 26 1.5 gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: te... 26 1.5 gi|13199|lcl|protein:vir:81178 Length: 567 # NCBI annotation: pu... 26 1.5 gi|8220|lcl|protein:vir:101493 Length: 588 # NCBI annotation: gp... 25 2.0 gi|9070|lcl|protein:vir:102238 Length: 588 # NCBI annotation: gp... 25 2.0 gi|20246|lcl|protein:vir:106918 Length: 204 # NCBI annotation: t... 25 2.6 gi|8690|lcl|protein:vir:102145 Length: 581 # NCBI annotation: ph... 25 3.3 gi|19292|lcl|protein:vir:4781 Length: 436 # NCBI annotation: put... 23 6.2 gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: ter... 23 7.3 >gi|6267|lcl|protein:vir:95873 Length: 530 # NCBI annotation: gp68 # Family: family:all:543 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950546;genbank:gi:119952237;genbank:GeneI D:5075700 Length = 530 Score = 582 bits (1499), Expect = e-168, Method: Compositional matrix adjust. Identities = 291/534 (54%), Positives = 384/534 (71%), Gaps = 10/534 (1%) Query: 8 DISPKTVEDYLNSANYA--EDISYTPSDFALSFITFIKLVNGSEGEENKTPVLHYKMLDT 65 ++ + ++++L+ +Y+ +Y P++FAL+F FIKLVNG EGE NKTP +H KMLD Sbjct: 3 ELIKQELDEWLDQVDYSVLNTPTYIPTEFALTFANFIKLVNGKEGESNKTPPVHLKMLDK 62 Query: 66 LAKGERRVANMVHRGAAKTTIMAEYMVLYLATYGELPNLGKVDLAIYVSDSIENGVKNLR 125 + + +AN+ RGAAKTT+ EY L+LA +G LP+LGKV+ IYVSDS++NGVK+ R Sbjct: 63 ITSKNQYIANLCFRGAAKTTLFMEYFTLFLAVFGHLPSLGKVEGMIYVSDSMDNGVKSAR 122 Query: 126 KNVEHRWGNSEFMQQYVPKIRFTDTRLEFTNIDGKTFIIKMYGAKTGVRGAKEMGKRPQL 185 KN+E R+ NS F+QQ++PK FTD LEF N +G +KM+GAKTG+RG K GKRP L Sbjct: 123 KNIEFRYNNSPFLQQWIPKATFTDNYLEFVNAEGHRLGVKMFGAKTGLRGTKIFGKRPVL 182 Query: 186 AILDDLFSDEDAKSPTVIENVEATIYKAVTYALHPKNNIIIWSGTPFNAKDPLYKAVESG 245 +LDDL SD+DA+S T +E ++ T+YK V +AL P +I++GTPFN +D L +AVESG Sbjct: 183 CVLDDLVSDDDARSRTSMEAIKDTVYKGVNHALDPTRRKVIFNGTPFNKEDILIEAVESG 242 Query: 246 AWAVNVFPVCEQFPCPREDFRGSWPDRFTYDYVKEQYDIAIKTGKADTFNQELMLRIMSD 305 AW VNV+PVCE+FPC RE+F+G+W DRF+YDY+ +QY +A+KTGK +F QELMLRI S+ Sbjct: 243 AWDVNVWPVCEKFPCTREEFQGAWEDRFSYDYINDQYQMALKTGKLASFYQELMLRISSE 302 Query: 306 EDRLIHDCDISWYKRNTLLSKREQYNFYITTDFATGAEKHNDFSVISVWAYNSNGEWFWV 365 ++RLI D +I WY R LL R YNFYITTDFAT ++ +D+SVISVWAY SNG+WFWV Sbjct: 303 DERLIQDSEIKWYSRTQLLRLRSCYNFYITTDFATSEKQVSDYSVISVWAYGSNGDWFWV 362 Query: 366 DGIVKKQLMDANINDLFRLAQMYRPQQVGIEVSGQQGGFIPWIQDEMMRRNCWFALASEN 425 DGI +Q MD N +DLFRL Q Y+PQQVG+E +GQQGGFI +Q EM+ RN +F AS + Sbjct: 363 DGIACRQTMDKNFDDLFRLVQEYQPQQVGVETTGQQGGFISLLQKEMLNRNVFFNFAS-S 421 Query: 426 NSGKPGIRPVPTQKKIDRFQVVVPWFKMNRVYFPIERKDSPEITQAMDELRLVSKQGFKS 485 G+PGI PV + K+ RF +VVPWFK ++YFP E KDSP +T M ++RL + G K Sbjct: 422 RGGQPGIHPVTS--KLSRFNLVVPWFKAGKMYFPAEMKDSPIMTLFMGQIRLATINGLKG 479 Query: 486 KHDDFSDTISMLSVLTPWKPSE--TPVGTFKDGIWEDDDDDDDMENMGMSSYVV 537 K DD DTISML L PWKP PV T D +W+D DD + +SSY+V Sbjct: 480 K-DDCIDTISMLGYLNPWKPQAGMVPVNTSGDPLWDDGDDTGSVNP--LSSYIV 530 >gi|11995|lcl|protein:vir:79222 Length: 591 # NCBI annotation: hypothetical protein # Family: family:all:543 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469154;genbank:gi:157834997;genbank:Ge neID:5648803 Length = 591 Score = 57.4 bits (137), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 68/291 (23%), Positives = 118/291 (40%), Gaps = 53/291 (18%) Query: 153 EFTNIDGKTFIIKMYGAKTGVRGAKEMGKRPQLAILDDLFSDEDAKSPTVIENVEATIYK 212 EF ++ G ++ +GA+ +RG RP+L + DDL +D++AKSPT N + K Sbjct: 184 EFVSLSGVK--LEAFGAEQAIRGTFHGASRPKLLLGDDLITDKEAKSPTERNNRWDWLEK 241 Query: 213 AVTYALHPKNNI-IIWSGTPFNAKDPLYKAVESGAWAVNVFPVCEQFP--------CPR- 262 A+ Y P ++ + GT N DP+ +A + V+ F E FP C Sbjct: 242 AIDYLGPPDGSVKYLGVGTVLNKDDPISRAKRTVGHLVHHFRAIETFPTHMDLWAHCEEV 301 Query: 263 ---------EDF--RGS------WPDRFTYDYVKEQYDIAIKTG---------------- 289 E + RGS P Y +EQ ++ T Sbjct: 302 MLNDDKPVMEQYAERGSVAPDSALPSFQFYQDNREQMELGAVTSWPGVRSLYWLMRQRAK 361 Query: 290 KADTFNQELMLRIMSDEDRLIHDCDISWYKRNTLLSKREQYNFYITTDFATGAEKHNDFS 349 F EL SDED+ + W R+ ++ + D + GA +D S Sbjct: 362 NKAAFATELQGDPRSDEDKTFTNPRF-WVMRSG------RWQMFGACDPSVGASAQSDPS 414 Query: 350 VISVWAYNSNGEWF-WVDGIVKKQLMDANINDLFRLAQMYRPQQVGIEVSG 399 I V +++ + ++ +K+++ +DL + + Y+ + +G E +G Sbjct: 415 AIIVGGWDTEKQVLNVIEAAIKRRVPSKLESDLIKAQREYQMRAIGFENNG 465 >gi|3935|lcl|protein:vir:103868 Length: 551 # NCBI annotation: hypothetical protein # Family: family:all:543 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938233;genbank:gi:38229138;genbank:GeneID :2648183 Length = 551 Score = 47.8 bits (112), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 90/458 (19%), Positives = 172/458 (37%), Gaps = 71/458 (15%) Query: 79 RGAAKTTIMAEYMVLYLATYGELPNLGKVDLAIYVSDSIENG---VKNLRKNVEHRWGNS 135 RG AK+T++++ V++ G+ + + D+ E ++ ++ +E + Sbjct: 93 RGNAKSTLVSQIFVIWCVL------TGRKHYPLIIMDAFEQAATMLEAIKAELEFNPRLA 146 Query: 136 EFMQQYVPKIRFTDTRLEFTNIDGKTFIIKMYGAKTGVRGAKEMGKRPQLAILDDLFSDE 195 Q K R T D K ++++G+ +RG + RP L + DDL +DE Sbjct: 147 MDFPQGAGKGRVWQVGTIVTANDAK---VQVFGSGKRMRGLRHGPHRPDLVVGDDLENDE 203 Query: 196 DAKSPTVIENVEATIYKAVTYALHPKNNI-IIWSGTPFNAKDPLYKAVESGAWAVNVFPV 254 + +SP + +E + K V + + +I GT + L + +++ W F Sbjct: 204 NVRSPEQRDKLENWLKKTVLSLGSADDTMDVIIIGTILHYDSVLSRLLKNPLWKRRKFKA 263 Query: 255 CEQFPCPREDFRGSWPDRF---------TYDYVKEQ------------------YDIAIK 287 ++P R D W + ++ +E+ Y + +K Sbjct: 264 IIEWPH-RMDLWEKWEELLLNSDDEGVAALEFYQERAAAMEDGAIICWPDGQPLYKLMVK 322 Query: 288 TGK--ADTFNQELMLRIMSDEDRLIHDCDISWYKRNTLLSKREQYNFYITTDFATGAEKH 345 + F+ E + E+ C W R Q+ FY D + G + Sbjct: 323 RARDGHSAFDSEQQNDPVQGENAPFAACITFWVNRLA------QWMFYGACDPSLGKQGS 376 Query: 346 N-DFSVISVWAYN-SNGEWFWVDGIVKKQLMDANINDLFRLAQMYRPQQVGIEVSGQQGG 403 + D S I V +N G V+ ++K+L D I D+ + + Y G+E Sbjct: 377 SRDPSAILVGGFNRETGVLDVVEAAIRKRLPDKIIEDIIAMQRAYHCLVWGVEAV----Q 432 Query: 404 FIPWIQDEMMRRNCWFALASENNSGKPGIRPVPTQKKIDRFQVVVPWFKMNRVYFPIERK 463 F +++ E+++R+ ++ P P K+ R + + P + Sbjct: 433 FQEFLRTELVKRS------AKAGCPVPARAITPHADKLLRIESLQPHMANGLIRL----- 481 Query: 464 DSPEITQAMDELRLVSKQGFKSKHDDFSDTISMLSVLT 501 P T +LR + HDD D + ML +L Sbjct: 482 -HPSQTVLEQQLRHFP----AADHDDGPDALHMLWMLA 514 >gi|9169|lcl|protein:vir:99234 Length: 550 # NCBI annotation: hypothetical protein # Family: family:all:543 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950450;genbank:gi:119953651;genbank:GeneI D:4643094 Length = 550 Score = 47.4 bits (111), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 91/458 (19%), Positives = 170/458 (37%), Gaps = 71/458 (15%) Query: 79 RGAAKTTIMAEYMVLYLATYGELPNLGKVDLAIYVSDSIENG---VKNLRKNVEHRWGNS 135 RG AK+T++++ V++ G+ + + D+ E ++ ++ +E + Sbjct: 93 RGNAKSTLVSQIFVIWCVL------TGRKHYPLIIMDAFEQAATMLEAIKAELEFNPRLA 146 Query: 136 EFMQQYVPKIRFTDTRLEFTNIDGKTFIIKMYGAKTGVRGAKEMGKRPQLAILDDLFSDE 195 Q K R T D K ++++G+ +RG + RP L I DDL +DE Sbjct: 147 MDFPQGAGKGRVWQVGTIVTANDAK---VQVFGSGKRMRGLRHGPHRPDLVIGDDLENDE 203 Query: 196 DAKSPTVIENVEATIYKAVTYALHPKNNI-IIWSGTPFNAKDPLYKAVESGAWAVNVFPV 254 + +SP + +E + K V + + +I GT + L + +++ W F Sbjct: 204 NVRSPEQRDKLENWLKKTVLSLGSADDTMDVIIIGTILHYDSVLSRLLKNPLWKRRKFKA 263 Query: 255 CEQFPCPREDFRGSWPDRF---------TYDYVKEQ------------------YDIAIK 287 ++P R D W + + +E+ Y + +K Sbjct: 264 IIEWPH-RMDLWEKWEELLLNSDDEGAAALAFYQERAAAMEDGAIICWPDGQPLYKLMVK 322 Query: 288 TGK--ADTFNQELMLRIMSDEDRLIHDCDISWYKRNTLLSKREQYNFYITTDFATGAE-K 344 + F+ E + E+ C W R Q+ FY D + G + Sbjct: 323 RARDGHSAFDSEQQNDPVQGENAPFAACITFWVNRLA------QWMFYGACDPSLGKQGS 376 Query: 345 HNDFSVISVWAYN-SNGEWFWVDGIVKKQLMDANINDLFRLAQMYRPQQVGIEVSGQQGG 403 D S I V +N G V+ ++K+L D I D+ + + Y G+E Sbjct: 377 SRDPSAILVGGFNRETGVLDVVEAAIRKRLPDKIIEDIIAMQRAYHCLVWGVEAV----Q 432 Query: 404 FIPWIQDEMMRRNCWFALASENNSGKPGIRPVPTQKKIDRFQVVVPWFKMNRVYFPIERK 463 F +++ E+++R+ ++ P P K+ R + + P + Sbjct: 433 FQEFLRTELVKRS------AKAGCPVPARAITPHADKLLRIESLQPHMANGLIRL----- 481 Query: 464 DSPEITQAMDELRLVSKQGFKSKHDDFSDTISMLSVLT 501 P T +LR + HDD D + ML +L Sbjct: 482 -HPSQTVLEQQLRHFP----AADHDDGPDALHMLWMLA 514 >gi|6559|lcl|protein:vir:104336 Length: 523 # NCBI annotation: putative terminase large subunit # Family: family:all:144 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398965;genbank:gi:81343949;genbank:GeneID :3778868 Length = 523 Score = 40.8 bits (94), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 105/499 (21%), Positives = 196/499 (39%), Gaps = 72/499 (14%) Query: 40 TFIKLVNGSEGEENKTPVLHYKMLDTLAKGERR--VANMVHRGAAKTTIMAEYMVLYLAT 97 F ++ G + N +++D + +G+R+ + N V G+AKT + + + +Y Sbjct: 32 CFFQITQGERFKMNWHAKYLCRVIDEILEGKRKDTIIN-VAPGSAKTELFSIHFPVY--- 87 Query: 98 YGELPNLGKV-DLAIYVSDSIENGVKNLRKNVEHRWGNSEFMQQYVPKI-RFTDTRLEFT 155 + + KV +L++ SDS+ VK K V + EF + + D ++ Sbjct: 88 --SMIKIKKVRNLSLSFSDSL---VKRNSKRVRDLIKSKEFQELWPCSFGTCRDDEIQVL 142 Query: 156 NIDGK----TFIIKMYGAKTGVRGAKEMGKRPQLAILDDLFSDEDAKSPTVIENV----E 207 + +GK + M G TG RG +LDD +DA S E V + Sbjct: 143 DENGKVRFESISKAMAGQVTGSRGGYMTDDYSGCIMLDDPLKPDDALSNVRREAVNMLLK 202 Query: 208 ATIYKAVTYALHPKNNIIIWSGTPFNAKDPLYKAVESGAWAVNVFPVCEQFPCPREDFRG 267 TI ++ K II + D + +ESG + F V + ED+ Sbjct: 203 NTIRSRRASSVKGKETPIIAVQQRLHVLDTSH-FMESGQMGIK-FDVVKVPAIVTEDYAD 260 Query: 268 SWPD----------------------RFTYDYVKEQYD--IAIKTGKADTFNQELMLRIM 303 + PD ++Y KE + +A++ TF + + Sbjct: 261 TLPDWIKQQFIDDVLSSPFVERDGVKYYSYFPAKESIEDLMAMRDADPYTFLSQYAQEPV 320 Query: 304 SDEDRLIHDCDISWYKR-NTLLSKREQYNF-YITTDFATGAEKHNDFSVISVWAYNSNGE 361 + LI+ + W++R + +Y++ +IT D A + ++DFSV+ +W Y + + Sbjct: 321 ALGGNLIN---VDWFQRLSDTFRPPAKYDYRFITCDTAMTTKSYSDFSVLQLWGYK-DAK 376 Query: 362 WFWVD---GIVKKQLMDANINDLFRLAQMYRPQQVGIEVSGQQGGFIPWIQDEMMRRNCW 418 + +D G + ++A + D + ++ + Q G + I E Sbjct: 377 IYLIDQRRGKWEAPELEAELLDFEKKSRS----------TSQSDGILRKIIIEKKASGIG 426 Query: 419 FALASENNSGKPGIRP-VPTQKKIDRFQVVVPWFKMNRVYFPIERKDSPEITQAMDELRL 477 + S + I P VP K+ R +P K V P + +P ++ + E+ Sbjct: 427 L-IQSAGRVMRTPIEPYVPDNDKLTRVMSALPQIKAGNVVLP---ESAPWLSGLLTEIAA 482 Query: 478 VSKQGFKSKHDDFSDTISM 496 + KHDD D ++M Sbjct: 483 FTADD-SHKHDDQIDCLTM 500 >gi|7770|lcl|protein:vir:96410 Length: 536 # NCBI annotation: hypothetical protein # Family: family:all:543 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218809;genbank:gi:147917326;genbank:Ge neID:5142613 Length = 536 Score = 40.0 bits (92), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 68/272 (25%), Positives = 107/272 (39%), Gaps = 45/272 (16%) Query: 157 IDGKTFIIKMYGAKTGVRGAKEMGKRPQLAILDDLFSDEDAKS----PTVIENVEATIYK 212 I+GK I+ GA T VRG E KRP L + DD+ + E A S ++E AT+ K Sbjct: 180 INGKVVILLPAGAGTAVRGTNEDHKRPDLIVCDDVQTRECALSEVQNAALLEWFTATLVK 239 Query: 213 AVTYALHPKNNIIIWSGTPFNA--------KDPLYKAVESGAWAVNVFPVCEQFPCPRED 264 + + N II+ G + K+P + ++ +GA ED Sbjct: 240 CIDN--YGSNRRIIYLGNMYPGDCILQMLRKNPEWISLVTGAIL--------------ED 283 Query: 265 FRGSWPDRFTYDYVKEQY--DIAIKTGK---ADTFNQELMLRIMSDEDRLIHDCDISWYK 319 WP+ + +Y D A+ G A+ N L I D+ I D W Sbjct: 284 GESLWPELKPVSVLIREYVHDEALGLGHIWFAEVQNDPLD-SIFKLLDKSIPDIPFDW-- 340 Query: 320 RNTLLSKREQYNFYITTDFATGAEKHNDFSVISVWA-YNSNGEWFWVDGIVKKQLMDANI 378 E +IT D A G K +D +V ++ Y+ N + G + + Sbjct: 341 -----ENMEADAAFITVDPA-GFRKKSDHNVATLHKLYDGNPVAVQMQGGIWTP--KETV 392 Query: 379 NDLFRLAQMYRPQQVGIEVSGQQGGFIPWIQD 410 ++ R+A +GIE +G Q W+ + Sbjct: 393 YNVIRMALDNAVCIIGIESTGYQQSLCYWMNE 424 >gi|8062|lcl|protein:vir:103374 Length: 536 # NCBI annotation: putative head assembly cofactor # Family: family:all:543 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024735;genbank:gi:48697077;genbank:GeneID :2846042 Length = 536 Score = 40.0 bits (92), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 68/272 (25%), Positives = 107/272 (39%), Gaps = 45/272 (16%) Query: 157 IDGKTFIIKMYGAKTGVRGAKEMGKRPQLAILDDLFSDEDAKS----PTVIENVEATIYK 212 I+GK I+ GA T VRG E KRP L + DD+ + E A S ++E AT+ K Sbjct: 180 INGKVVILLPAGAGTAVRGTNEDHKRPDLIVCDDVQTRECALSEVQNAALLEWFTATLVK 239 Query: 213 AVTYALHPKNNIIIWSGTPFNA--------KDPLYKAVESGAWAVNVFPVCEQFPCPRED 264 + + N II+ G + K+P + ++ +GA ED Sbjct: 240 CIDN--YGSNRRIIYLGNMYPGDCILQMLRKNPEWISLVTGAIL--------------ED 283 Query: 265 FRGSWPDRFTYDYVKEQY--DIAIKTGK---ADTFNQELMLRIMSDEDRLIHDCDISWYK 319 WP+ + +Y D A+ G A+ N L I D+ I D W Sbjct: 284 GESLWPELKPVSVLIREYVHDEALGLGHIWFAEVQNDPLD-SIFKLLDKPIPDVPFDW-- 340 Query: 320 RNTLLSKREQYNFYITTDFATGAEKHNDFSVISVWA-YNSNGEWFWVDGIVKKQLMDANI 378 E +IT D A G K +D +V ++ Y+ N + G + + Sbjct: 341 -----ENMEADAAFITVDPA-GFRKKSDHNVATLHKLYDGNPVAVQMQGGIWTP--KETV 392 Query: 379 NDLFRLAQMYRPQQVGIEVSGQQGGFIPWIQD 410 ++ R+A +GIE +G Q W+ + Sbjct: 393 YNVIRMALDNAVCIIGIESTGYQQSLCYWMNE 424 >gi|12711|lcl|protein:vir:80169 Length: 565 # NCBI annotation: terminase # Family: family:all:543 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285801;genbank:gi:148747835;genbank:Ge neID:5220445 Length = 565 Score = 30.8 bits (68), Expect = 0.042, Method: Compositional matrix adjust. Identities = 52/256 (20%), Positives = 98/256 (38%), Gaps = 39/256 (15%) Query: 269 WPDRFTYDYVKEQYDIAIKTGKADTFNQELMLRIMSDEDRLIHDCDISWYKRNTL----- 323 W +RF + V+ +I + F + + RI++ +++L+ ++ ++ ++ Sbjct: 315 WEERFDAEVVE---NIKRRLNSFRRFASQYLNRIVTADEQLLPQENVQYFHPASVDVSDD 371 Query: 324 ---LSKREQYNFYI----TTDFATGAEKHNDFSVISVWAYNSNGEWFWVDGIVKKQLMDA 376 R+ Y + D A +K D +V++V Y+++ + D K Sbjct: 372 GFVSINRDGYKVRVKPMLVVDPAVSQKKTADNTVLTVGGYDNDKNLYIFDVKAGKFTPSE 431 Query: 377 NINDLFRLAQMYRPQQVGIEVSGQQGGFIPWIQDEMMRRNCWFALASENNSGKP-GIRPV 435 I +F LA Y+ V +E G ++D + +P IR Sbjct: 432 TIKHIFTLADKYKLNAVTLETVGGFALLSYQVKDAF-------------KTHRPLAIREY 478 Query: 436 -PTQKKIDRFQVVV-PWFKMNRVYFPIERKDSPEITQAMDELRLVSKQGFKSKHDDFSDT 493 P K R ++ P + +Y PE+ +D L SKHDD DT Sbjct: 479 RPKGDKQGRITAMLEPHWTNKSIYMQSYLAIMPELKDELDSFPL-------SKHDDVVDT 531 Query: 494 ISMLSVL-TPWKPSET 508 +++ L TP + T Sbjct: 532 FAIICELSTPTRKEGT 547 >gi|17058|lcl|protein:vir:5248 Length: 487 # NCBI annotation: putative terminase large subunit TerL # Family: family:all:144 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852753;genbank:gi:31544028;interpro:IPR00 4921;interpro:IPR006517;uniprot:Q7Y5U7;genbank:GeneID:27 53564 Length = 487 Score = 29.3 bits (64), Expect = 0.12, Method: Compositional matrix adjust. Identities = 45/195 (23%), Positives = 83/195 (42%), Gaps = 19/195 (9%) Query: 185 LAILDDLFSD-EDAKSPTVIENVEATIYKAVTYALHPKNNIII----WSGTPFNAKDPLY 239 +AI+DD D ++A S TV +++ + L PK+ +++ W + L Sbjct: 176 IAIIDDPVKDAKEANSQTVRDSIWDWYTTTLYTRLSPKSGVLLGMTRWHEDDLAGR--LI 233 Query: 240 KAVESGA--WAVNVFP-VCEQFPCPREDFRGSWPDRFTYDYVKEQYDIAIKTGKADTFNQ 296 K E+G W + FP + E+ R++ P+RF + + + I G + +N Sbjct: 234 KEAENGGDQWRIVKFPAIAEEDEEFRKEGEPLHPERFDLERLNK---IRQAVG-SQAWNA 289 Query: 297 ELMLRIMSDEDRLIHDCDISWYKRNTLLSKREQYNFYITTDFATGAEKHNDFSVISVWAY 356 R + +I YK ++ + Y D A ++HND+SV V Sbjct: 290 LYQQRPSNKGGGIIKGSWFGRYKVPPIIKVKAIY-----ADTAQKTKQHNDYSVFIVAGK 344 Query: 357 NSNGEWFWVDGIVKK 371 ++G+ + +D I K Sbjct: 345 GADGKAYILDLIRGK 359 >gi|2538|lcl|protein:vir:94056 Length: 483 # NCBI annotation: putative terminase large subunit # Family: family:all:144 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453630;genbank:gi:84662666;genbank:GeneID :5142566 Length = 483 Score = 27.3 bits (59), Expect = 0.47, Method: Compositional matrix adjust. Identities = 44/196 (22%), Positives = 77/196 (39%), Gaps = 26/196 (13%) Query: 303 MSDEDRLIHDCDISWYKRNTLLSKREQYNFYITTDFATGAEKHNDFSVISVWAYNSNGEW 362 MS+ + + +Y++ L + Q +T D + ++ +DF I VWA S+ Sbjct: 285 MSEMGAIFGKGGVRYYRQGELPTAFAQ--VIMTVDASFKGKETSDFCAIGVWAKTSDNR- 341 Query: 363 FWVDGIVKKQL-MDANINDLFRLAQMYRPQQVGIEVSGQQGGFIPWIQDEMMRRNCWFAL 421 W+ + +++L A + L Y PQ I + G P + EM+ R+ Sbjct: 342 VWLLAMRREKLAFTATAQAIVDLKAAY-PQCTRIYIEDAANG--PALI-EMLSRHVQ--- 394 Query: 422 ASENNSGKPGIRPVPT-QKKIDRFQVVVPWFKMNRVYFPIERKDSPEITQAMDELRLVSK 480 GI VP K R+ V ++ +V P D P I + E+ Sbjct: 395 ---------GIVGVPALGSKESRWHAVAGVWQSGQVMLP-HPDDVPSIVPVVAEIVAAP- 443 Query: 481 QGFKSKHDDFSDTISM 496 ++DD D ++M Sbjct: 444 ---DVRNDDAVDCMAM 456 >gi|12378|lcl|protein:vir:79648 Length: 523 # NCBI annotation: TerL # Family: family:all:144 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285519;genbank:gi:148734502;genbank:Ge neID:5220006 Length = 523 Score = 26.9 bits (58), Expect = 0.57, Method: Compositional matrix adjust. Identities = 15/64 (23%), Positives = 34/64 (53%), Gaps = 7/64 (10%) Query: 314 DISWYK-----RNTLLSKREQYNF-YITTDFATGAEKHNDFSVISVWAYNSNGEWFWVDG 367 ++ W++ + + K +++++ +IT D A + ND+SV+ W G +++DG Sbjct: 322 NVGWFQYYGTGEKSTMPKPDRFDYTFITADTAQKEGELNDYSVLCYWGM-FKGRIYFIDG 380 Query: 368 IVKK 371 + K Sbjct: 381 VRGK 384 >gi|3763|lcl|protein:vir:107694 Length: 527 # NCBI annotation: putative terminase large subunit # Family: family:all:144 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003892;genbank:gi:45686309;genbank:GeneID :2773034 Length = 527 Score = 26.9 bits (58), Expect = 0.62, Method: Compositional matrix adjust. Identities = 13/39 (33%), Positives = 22/39 (56%), Gaps = 1/39 (2%) Query: 333 YITTDFATGAEKHNDFSVISVWAYNSNGEWFWVDGIVKK 371 +IT D A + ND++V +W N + +++DGI K Sbjct: 351 FITADTAQKTGELNDYTVFCLWG-KKNDKVYFIDGIRGK 388 >gi|3336|lcl|protein:vir:94505 Length: 199 # NCBI annotation: major tail protein # Family: family:all:1095 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223895;genbank:gi:62327107;genbank:GeneID :5075521 Length = 199 Score = 26.2 bits (56), Expect = 0.89, Method: Compositional matrix adjust. Identities = 22/88 (25%), Positives = 38/88 (43%), Gaps = 4/88 (4%) Query: 405 IPWIQDEMMR-RNCWFALASENNS-GKPGIRPVPTQKKIDRFQ--VVVPWFKMNRVYFPI 460 + +++D R ++ W+ L S + G P I P + + + KM R+ P Sbjct: 8 VKFVKDTPYRGKDVWYFLQSVDAPVGDPAILPAHQESGDTSIEGDSLDEQTKMGRIVAPS 67 Query: 461 ERKDSPEITQAMDELRLVSKQGFKSKHD 488 +DS E+T M + K+KHD Sbjct: 68 TNEDSIEVTSYMVPGDEATDAIIKAKHD 95 >gi|17128|lcl|protein:vir:1379 Length: 581 # NCBI annotation: putative terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612831;genbank:gi:20065965;genbank:GeneID :935781 Length = 581 Score = 25.8 bits (55), Expect = 1.3, Method: Compositional matrix adjust. Identities = 20/70 (28%), Positives = 33/70 (47%), Gaps = 4/70 (5%) Query: 136 EFMQQYVPKIRFTDTRLEFTNIDGKTFIIKMYGAKTGVRGAKEMGKRPQLAILDDLFSDE 195 E M K RF T+ + +FI K K G G GK PQ+A++D+ + Sbjct: 156 ELMASKPLKKRFKFTQKVIKHKKSNSFI-KHLSKKAGKTGD---GKNPQMAVIDEYHAHP 211 Query: 196 DAKSPTVIEN 205 ++K V+++ Sbjct: 212 NSKMYDVMKS 221 >gi|12543|lcl|protein:vir:80106 Length: 486 # NCBI annotation: TerL # Family: family:all:144 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468706;genbank:gi:157325286;genbank:Ge neID:5601797 Length = 486 Score = 25.8 bits (55), Expect = 1.5, Method: Compositional matrix adjust. Identities = 39/165 (23%), Positives = 65/165 (39%), Gaps = 25/165 (15%) Query: 337 DFATGAEKHNDFSVISVWAYNSNGEWFWVDGIVKKQLMDANINDLFRLAQMYRPQQVG-- 394 D A K +DF VW ++F++D I + + +N + RL + P + Sbjct: 328 DMAFKDTKKSDFVAGHVWN-RKKADFFFIDRIHDRMGLPETLNAVRRLT-IKHPLAIAKY 385 Query: 395 IEVSGQQGGFIPWIQDEMMRRNCWFALASENNSGKPGIRPVPTQKKIDRFQVVVPWFKMN 454 IE + ++ E+ +G G+ P K R V P F+ Sbjct: 386 IEEKANGPAVMQTLKGEI--------------TGMIGVEP--EGGKETRAYAVTPLFESG 429 Query: 455 RVYFPIERKDSPEITQAMDELRLVSKQGFKSKHDDFSDTISMLSV 499 VYFP +P I+ ++E+ L G +HDD D ++ V Sbjct: 430 NVYFP-HPLYAPWISDVIEEM-LAFPNG---EHDDDVDAMTQALV 469 >gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: terminase large subunit # Family: family:all:4877 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429607;genbank:gi:156564098;genbank:Ge neID:5525534 Length = 635 Score = 25.8 bits (55), Expect = 1.5, Method: Compositional matrix adjust. Identities = 20/73 (27%), Positives = 33/73 (45%), Gaps = 10/73 (13%) Query: 168 GAKTGVRGAKEMGKRPQLAILDDLFSDEDAKSPTVIENVEATIYKAVTYALHPKNNIIIW 227 G+K+G A G+R L +LD++ D + I N+ +A P+ +I Sbjct: 172 GSKSGSGAANTRGQRADLIVLDEM----DYMGESEITNIMNIRNEA------PERIKMIV 221 Query: 228 SGTPFNAKDPLYK 240 + TP +D YK Sbjct: 222 ASTPSGRRDSYYK 234 >gi|13199|lcl|protein:vir:81178 Length: 567 # NCBI annotation: putative terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285808;genbank:gi:148747729;genbank:Ge neID:5247221 Length = 567 Score = 25.8 bits (55), Expect = 1.5, Method: Compositional matrix adjust. Identities = 16/62 (25%), Positives = 27/62 (43%), Gaps = 13/62 (20%) Query: 180 GKRPQLAILDDLFSDEDAKSPTVIENVEATIYKAVTYALHPKNN--IIIWSGTPFNAKDP 237 GK P LA++D+ + E ++ IY + + + N I+I + FN P Sbjct: 197 GKNPSLAVIDEYHTHETSE-----------IYDVLVSGMVARQNPLIVIITTAGFNLASP 245 Query: 238 LY 239 Y Sbjct: 246 CY 247 >gi|8220|lcl|protein:vir:101493 Length: 588 # NCBI annotation: gp8 # Family: family:all:147 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655387;genbank:gi:109522575;genbank:GeneI D:4157565 Length = 588 Score = 25.0 bits (53), Expect = 2.0, Method: Compositional matrix adjust. Identities = 27/135 (20%), Positives = 52/135 (38%), Gaps = 24/135 (17%) Query: 292 DTFNQELMLRIMSDEDRLIHDCDISWYKRNTLLSKREQYNFYITTDFATGAEKHNDFSVI 351 ++FNQE+ ++ D D ++ + + +Q+ Y ++ T A ++ Sbjct: 304 ESFNQEIGADFTEYVGKVFKDWDEEYHVADLVDYNVKQFGTYFNPNYETYAAADYGYTNP 363 Query: 352 SVWAYNSNGEWFWVDGIVKKQLMDANINDLFR---LAQMYRP------------------ 390 +VW G+W V+ ++++ M D F QM P Sbjct: 364 NVWLVIQIGKWGEVN-VLREIYMPGLTADAFADEIRRQMCNPPNLRIFYPDPADPMSSET 422 Query: 391 --QQVGIEVSGQQGG 403 Q++GI +G GG Sbjct: 423 LSQKLGIRAAGGTGG 437 >gi|9070|lcl|protein:vir:102238 Length: 588 # NCBI annotation: gp8 # Family: family:all:147 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655204;genbank:gi:109522784;genbank:GeneI D:4157477 Length = 588 Score = 25.0 bits (53), Expect = 2.0, Method: Compositional matrix adjust. Identities = 27/135 (20%), Positives = 52/135 (38%), Gaps = 24/135 (17%) Query: 292 DTFNQELMLRIMSDEDRLIHDCDISWYKRNTLLSKREQYNFYITTDFATGAEKHNDFSVI 351 ++FNQE+ ++ D D ++ + + +Q+ Y ++ T A ++ Sbjct: 304 ESFNQEIGADFTEYVGKVFKDWDEEYHVADLVDYNVKQFGTYFNPNYETYAAADYGYTNP 363 Query: 352 SVWAYNSNGEWFWVDGIVKKQLMDANINDLFR---LAQMYRP------------------ 390 +VW G+W V+ ++++ M D F QM P Sbjct: 364 NVWLVIQIGKWGEVN-VLREIYMPGLTADAFADEIRRQMCNPPNLRIFYPDPADPMSSET 422 Query: 391 --QQVGIEVSGQQGG 403 Q++GI +G GG Sbjct: 423 LSQKLGIRAAGGTGG 437 >gi|20246|lcl|protein:vir:106918 Length: 204 # NCBI annotation: tail tube protein gp19 # Family: family:all:1107 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195137;genbank:gi:58532914;goa:Q5GQN5;uni prot:Q5GQN5;genbank:GeneID:3260390 Length = 204 Score = 25.0 bits (53), Expect = 2.6, Method: Compositional matrix adjust. Identities = 21/68 (30%), Positives = 32/68 (47%), Gaps = 5/68 (7%) Query: 51 EENKTPVLHYKMLDTLAKGERRVANMVHRGAAKTTIMAEYMVLYLATYGELPNLGKVDLA 110 E N P L LD+ +V + G+ ++ EY VLY YG N+ ++D+A Sbjct: 105 ETNNAP-LFTPSLDSGYARNLKVKQLEKNGSESGEVLREY-VLY---YGFPTNVSQIDVA 159 Query: 111 IYVSDSIE 118 +D IE Sbjct: 160 YDSNDQIE 167 >gi|8690|lcl|protein:vir:102145 Length: 581 # NCBI annotation: phage terminase, large subunit, putative # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699944;genbank:gi:110804033;genbank:GeneI D:4206689 Length = 581 Score = 24.6 bits (52), Expect = 3.3, Method: Compositional matrix adjust. Identities = 15/43 (34%), Positives = 18/43 (41%) Query: 224 IIIWSGTPFNAKDPLYKAVESGAWAVNVFPVCEQFPCPREDFR 266 II P NA L E G V +F C+ P EDF+ Sbjct: 455 IIQLGYDPHNADTFLQDLEELGFDCVEIFQSCKWLNDPTEDFK 497 >gi|19292|lcl|protein:vir:4781 Length: 436 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150161;swissprot:trembl:q94m50;genbank:gi :15088772;goa:Q94M50;uniprot:Q94M50;genbank:GeneID:95597 6 Length = 436 Score = 23.5 bits (49), Expect = 6.2, Method: Compositional matrix adjust. Identities = 12/44 (27%), Positives = 21/44 (47%), Gaps = 3/44 (6%) Query: 223 NIIIWSGTPFNAKDPLYKAVESGAWAVNVFPVCEQFPCPREDFR 266 NI++ +D ++ V WA+N+F + EQF F+ Sbjct: 57 NIVVIRKVANTIRDSVFNKV---WWALNLFGIAEQFTKTVSPFK 97 >gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899601;genbank:gi:34419588;genbank:GeneID :2546011 Length = 600 Score = 23.5 bits (49), Expect = 7.3, Method: Compositional matrix adjust. Identities = 21/88 (23%), Positives = 40/88 (45%), Gaps = 5/88 (5%) Query: 41 FIKLVNGSEGEENKTPVLHYKMLDTLAKGERRVANMVHRGAAKTTIMAEYMVLYLATYGE 100 + +V+ G P + K + +A R ++ R KTTIM ++ YL + E Sbjct: 110 YCSIVHIDLGNIKMVPRPYQKEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYL-VFNE 168 Query: 101 LPNLG----KVDLAIYVSDSIENGVKNL 124 G K +++ V + ++N ++NL Sbjct: 169 DKEAGILAHKGSMSMEVLERVKNVIENL 196 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.318 0.135 0.411 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 259,779 Number of Sequences: 514 Number of extensions: 12667 Number of successful extensions: 55 Number of sequences better than 100.0: 25 Number of HSP's better than 100.0 without gapping: 14 Number of HSP's successfully gapped in prelim test: 11 Number of HSP's that attempted gapping in prelim test: 36 Number of HSP's gapped (non-prelim): 28 length of query: 537 length of database: 206,069 effective HSP length: 76 effective length of query: 461 effective length of database: 167,005 effective search space: 76989305 effective search space used: 76989305 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 39 (19.6 bits)