BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|Aclame:protein:vir:105818|NCBI_annot:gp2|genbank:acc:YP_655 763;genbank:gi:109522086;genbank:GeneID:4157626 (545 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|9695|lcl|protein:vir:102601 Length: 545 # NCBI annotation: gp... 1108 0.0 gi|8440|lcl|protein:vir:105818 Length: 545 # NCBI annotation: gp... 1108 0.0 gi|18273|lcl|protein:vir:7986 Length: 545 # NCBI annotation: gp2... 1096 0.0 gi|6388|lcl|protein:vir:98470 Length: 536 # NCBI annotation: hyp... 120 5e-29 gi|15774|lcl|protein:vir:6239 Length: 527 # NCBI annotation: gp3... 78 3e-16 gi|15412|lcl|protein:vir:1325 Length: 519 # NCBI annotation: gp3... 74 6e-15 gi|16473|lcl|protein:vir:7767 Length: 566 # NCBI annotation: gp1... 73 9e-15 gi|19787|lcl|protein:vir:2340 Length: 562 # NCBI annotation: gp1... 70 5e-14 gi|16679|lcl|protein:vir:4222 Length: 593 # NCBI annotation: pre... 63 7e-12 gi|9490|lcl|protein:vir:104081 Length: 594 # NCBI annotation: gp... 62 1e-11 gi|17795|lcl|protein:vir:2426 Length: 595 # NCBI annotation: gp1... 61 3e-11 gi|10991|lcl|protein:vir:78228 Length: 903 # NCBI annotation: Pu... 53 1e-08 gi|11235|lcl|protein:vir:78494 Length: 562 # NCBI annotation: Pu... 52 2e-08 gi|13389|lcl|protein:vir:1263 Length: 416 # NCBI annotation: hyp... 44 6e-06 gi|16408|lcl|protein:vir:1883 Length: 504 # NCBI annotation: ter... 32 0.019 gi|15611|lcl|protein:vir:188 Length: 504 # NCBI annotation: term... 31 0.036 gi|5726|lcl|protein:vir:95379 Length: 573 # NCBI annotation: pha... 27 0.71 gi|12641|lcl|protein:vir:80117 Length: 572 # NCBI annotation: Ph... 27 0.73 gi|967|lcl|protein:vir:6208 Length: 574 # NCBI annotation: Termi... 25 2.0 gi|14824|lcl|protein:vir:4088 Length: 550 # NCBI annotation: ter... 25 2.2 gi|3559|lcl|protein:vir:101786 Length: 556 # NCBI annotation: gp... 25 2.3 gi|15908|lcl|protein:vir:3418 Length: 641 # NCBI annotation: DNA... 23 8.4 >gi|9695|lcl|protein:vir:102601 Length: 545 # NCBI annotation: gp2 # Family: family:all:1551 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654998;genbank:gi:109392188;genbank:GeneI D:4157223 Length = 545 Score = 1108 bits (2865), Expect = 0.0, Method: Compositional matrix adjust. Identities = 545/545 (100%), Positives = 545/545 (100%) Query: 1 MAVLQVPAVDLAFPTLGPQVCDFIEDRMVFGPGSLSGQPARLDDEKRALVYRLYELYPRG 60 MAVLQVPAVDLAFPTLGPQVCDFIEDRMVFGPGSLSGQPARLDDEKRALVYRLYELYPRG Sbjct: 1 MAVLQVPAVDLAFPTLGPQVCDFIEDRMVFGPGSLSGQPARLDDEKRALVYRLYELYPRG 60 Query: 61 HHLAGRRRFERAGVELRKGVAKTEFAAWICGVELHPEAPVRCDGFDAAGNPVGRPVRSPV 120 HHLAGRRRFERAGVELRKGVAKTEFAAWICGVELHPEAPVRCDGFDAAGNPVGRPVRSPV Sbjct: 61 HHLAGRRRFERAGVELRKGVAKTEFAAWICGVELHPEAPVRCDGFDAAGNPVGRPVRSPV 120 Query: 121 IPMMAVTEEQVSELAFGVLKYILENGPDVDLFDISKERIVRLSPSGGEDGFAVAVSNAPG 180 IPMMAVTEEQVSELAFGVLKYILENGPDVDLFDISKERIVRLSPSGGEDGFAVAVSNAPG Sbjct: 121 IPMMAVTEEQVSELAFGVLKYILENGPDVDLFDISKERIVRLSPSGGEDGFAVAVSNAPG 180 Query: 181 SRDGARTTFQHFDEPHRLFMPRHRDAHETMLQNMPKRPMEDPWTLYTSTAGQPGQGSIEE 240 SRDGARTTFQHFDEPHRLFMPRHRDAHETMLQNMPKRPMEDPWTLYTSTAGQPGQGSIEE Sbjct: 181 SRDGARTTFQHFDEPHRLFMPRHRDAHETMLQNMPKRPMEDPWTLYTSTAGQPGQGSIEE 240 Query: 241 DVLAEAESIARGERQDPSLFFFRRWAGDEHDDLSTVEKRVAAVADATGPIGEWGPGQFER 300 DVLAEAESIARGERQDPSLFFFRRWAGDEHDDLSTVEKRVAAVADATGPIGEWGPGQFER Sbjct: 241 DVLAEAESIARGERQDPSLFFFRRWAGDEHDDLSTVEKRVAAVADATGPIGEWGPGQFER 300 Query: 301 IAKDYDRTGIDRAYWERVYLNRWRKSGSQAFDMTRLVQCDETVPDGAFVTAGFDGSRWRD 360 IAKDYDRTGIDRAYWERVYLNRWRKSGSQAFDMTRLVQCDETVPDGAFVTAGFDGSRWRD Sbjct: 301 IAKDYDRTGIDRAYWERVYLNRWRKSGSQAFDMTRLVQCDETVPDGAFVTAGFDGSRWRD 360 Query: 361 ATAVVVTEIATGRQMLLGCWERPENVEEWEVPEHEVTALVVDMMSRFEVWRMYCDPWGWD 420 ATAVVVTEIATGRQMLLGCWERPENVEEWEVPEHEVTALVVDMMSRFEVWRMYCDPWGWD Sbjct: 361 ATAVVVTEIATGRQMLLGCWERPENVEEWEVPEHEVTALVVDMMSRFEVWRMYCDPWGWD 420 Query: 421 STIAAWAGRFPDRVVEWAVGGGGSLRRVAAATQGYADALATGDAALAANVWRPKFVEHMG 480 STIAAWAGRFPDRVVEWAVGGGGSLRRVAAATQGYADALATGDAALAANVWRPKFVEHMG Sbjct: 421 STIAAWAGRFPDRVVEWAVGGGGSLRRVAAATQGYADALATGDAALAANVWRPKFVEHMG 480 Query: 481 HAGRRELKLVDDTGQPLWVMQKQDGRLADKFDAAMAGMLSWEACVDARRDGARPRPKVFA 540 HAGRRELKLVDDTGQPLWVMQKQDGRLADKFDAAMAGMLSWEACVDARRDGARPRPKVFA Sbjct: 481 HAGRRELKLVDDTGQPLWVMQKQDGRLADKFDAAMAGMLSWEACVDARRDGARPRPKVFA 540 Query: 541 PRRIY 545 PRRIY Sbjct: 541 PRRIY 545 >gi|8440|lcl|protein:vir:105818 Length: 545 # NCBI annotation: gp2 # Family: family:all:1551 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655763;genbank:gi:109522086;genbank:GeneI D:4157626 Length = 545 Score = 1108 bits (2865), Expect = 0.0, Method: Compositional matrix adjust. Identities = 545/545 (100%), Positives = 545/545 (100%) Query: 1 MAVLQVPAVDLAFPTLGPQVCDFIEDRMVFGPGSLSGQPARLDDEKRALVYRLYELYPRG 60 MAVLQVPAVDLAFPTLGPQVCDFIEDRMVFGPGSLSGQPARLDDEKRALVYRLYELYPRG Sbjct: 1 MAVLQVPAVDLAFPTLGPQVCDFIEDRMVFGPGSLSGQPARLDDEKRALVYRLYELYPRG 60 Query: 61 HHLAGRRRFERAGVELRKGVAKTEFAAWICGVELHPEAPVRCDGFDAAGNPVGRPVRSPV 120 HHLAGRRRFERAGVELRKGVAKTEFAAWICGVELHPEAPVRCDGFDAAGNPVGRPVRSPV Sbjct: 61 HHLAGRRRFERAGVELRKGVAKTEFAAWICGVELHPEAPVRCDGFDAAGNPVGRPVRSPV 120 Query: 121 IPMMAVTEEQVSELAFGVLKYILENGPDVDLFDISKERIVRLSPSGGEDGFAVAVSNAPG 180 IPMMAVTEEQVSELAFGVLKYILENGPDVDLFDISKERIVRLSPSGGEDGFAVAVSNAPG Sbjct: 121 IPMMAVTEEQVSELAFGVLKYILENGPDVDLFDISKERIVRLSPSGGEDGFAVAVSNAPG 180 Query: 181 SRDGARTTFQHFDEPHRLFMPRHRDAHETMLQNMPKRPMEDPWTLYTSTAGQPGQGSIEE 240 SRDGARTTFQHFDEPHRLFMPRHRDAHETMLQNMPKRPMEDPWTLYTSTAGQPGQGSIEE Sbjct: 181 SRDGARTTFQHFDEPHRLFMPRHRDAHETMLQNMPKRPMEDPWTLYTSTAGQPGQGSIEE 240 Query: 241 DVLAEAESIARGERQDPSLFFFRRWAGDEHDDLSTVEKRVAAVADATGPIGEWGPGQFER 300 DVLAEAESIARGERQDPSLFFFRRWAGDEHDDLSTVEKRVAAVADATGPIGEWGPGQFER Sbjct: 241 DVLAEAESIARGERQDPSLFFFRRWAGDEHDDLSTVEKRVAAVADATGPIGEWGPGQFER 300 Query: 301 IAKDYDRTGIDRAYWERVYLNRWRKSGSQAFDMTRLVQCDETVPDGAFVTAGFDGSRWRD 360 IAKDYDRTGIDRAYWERVYLNRWRKSGSQAFDMTRLVQCDETVPDGAFVTAGFDGSRWRD Sbjct: 301 IAKDYDRTGIDRAYWERVYLNRWRKSGSQAFDMTRLVQCDETVPDGAFVTAGFDGSRWRD 360 Query: 361 ATAVVVTEIATGRQMLLGCWERPENVEEWEVPEHEVTALVVDMMSRFEVWRMYCDPWGWD 420 ATAVVVTEIATGRQMLLGCWERPENVEEWEVPEHEVTALVVDMMSRFEVWRMYCDPWGWD Sbjct: 361 ATAVVVTEIATGRQMLLGCWERPENVEEWEVPEHEVTALVVDMMSRFEVWRMYCDPWGWD 420 Query: 421 STIAAWAGRFPDRVVEWAVGGGGSLRRVAAATQGYADALATGDAALAANVWRPKFVEHMG 480 STIAAWAGRFPDRVVEWAVGGGGSLRRVAAATQGYADALATGDAALAANVWRPKFVEHMG Sbjct: 421 STIAAWAGRFPDRVVEWAVGGGGSLRRVAAATQGYADALATGDAALAANVWRPKFVEHMG 480 Query: 481 HAGRRELKLVDDTGQPLWVMQKQDGRLADKFDAAMAGMLSWEACVDARRDGARPRPKVFA 540 HAGRRELKLVDDTGQPLWVMQKQDGRLADKFDAAMAGMLSWEACVDARRDGARPRPKVFA Sbjct: 481 HAGRRELKLVDDTGQPLWVMQKQDGRLADKFDAAMAGMLSWEACVDARRDGARPRPKVFA 540 Query: 541 PRRIY 545 PRRIY Sbjct: 541 PRRIY 545 >gi|18273|lcl|protein:vir:7986 Length: 545 # NCBI annotation: gp2 # Family: family:all:1551 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817340;genbank:gi:29565768;genbank:GeneID :1259002 Length = 545 Score = 1096 bits (2834), Expect = 0.0, Method: Compositional matrix adjust. Identities = 539/545 (98%), Positives = 539/545 (98%) Query: 1 MAVLQVPAVDLAFPTLGPQVCDFIEDRMVFGPGSLSGQPARLDDEKRALVYRLYELYPRG 60 MAVLQVP VDL FPTLGPQVCDFIEDRMVFGPGSLSGQ ARLDDEKRALVYRLYELYPRG Sbjct: 1 MAVLQVPPVDLTFPTLGPQVCDFIEDRMVFGPGSLSGQAARLDDEKRALVYRLYELYPRG 60 Query: 61 HHLAGRRRFERAGVELRKGVAKTEFAAWICGVELHPEAPVRCDGFDAAGNPVGRPVRSPV 120 H LAGRRRFERAGVELRKGVAKTEFAAWICGVELHPEAPVRCDGFDAAGNPVGRPVRSPV Sbjct: 61 HRLAGRRRFERAGVELRKGVAKTEFAAWICGVELHPEAPVRCDGFDAAGNPVGRPVRSPV 120 Query: 121 IPMMAVTEEQVSELAFGVLKYILENGPDVDLFDISKERIVRLSPSGGEDGFAVAVSNAPG 180 IPMMAVTEEQVSELAFGVLKYILENGPD DLFDISKERIVRLSPSGGEDGFAVAVSNAPG Sbjct: 121 IPMMAVTEEQVSELAFGVLKYILENGPDADLFDISKERIVRLSPSGGEDGFAVAVSNAPG 180 Query: 181 SRDGARTTFQHFDEPHRLFMPRHRDAHETMLQNMPKRPMEDPWTLYTSTAGQPGQGSIEE 240 SRDGARTTFQHFDEPHRLFMPRHRDAHETMLQNMPKRPMEDPWTLYTSTAGQPGQGSIEE Sbjct: 181 SRDGARTTFQHFDEPHRLFMPRHRDAHETMLQNMPKRPMEDPWTLYTSTAGQPGQGSIEE 240 Query: 241 DVLAEAESIARGERQDPSLFFFRRWAGDEHDDLSTVEKRVAAVADATGPIGEWGPGQFER 300 DVLAEAESIARGERQDPSLFFFRRWAGDEHDDLSTVEKRVAAVADATGPIGEWGPGQFER Sbjct: 241 DVLAEAESIARGERQDPSLFFFRRWAGDEHDDLSTVEKRVAAVADATGPIGEWGPGQFER 300 Query: 301 IAKDYDRTGIDRAYWERVYLNRWRKSGSQAFDMTRLVQCDETVPDGAFVTAGFDGSRWRD 360 IAKDYDRTGIDRAYWERVYLNRWRKSGSQAFDMTRLVQCDETVPDGAFVTAGFDGSRWRD Sbjct: 301 IAKDYDRTGIDRAYWERVYLNRWRKSGSQAFDMTRLVQCDETVPDGAFVTAGFDGSRWRD 360 Query: 361 ATAVVVTEIATGRQMLLGCWERPENVEEWEVPEHEVTALVVDMMSRFEVWRMYCDPWGWD 420 ATAVVVTEIATGRQMLLGCWERPENVEEWEVPEHEVTALVVDMMSRFEVWRMYCDPWGWD Sbjct: 361 ATAVVVTEIATGRQMLLGCWERPENVEEWEVPEHEVTALVVDMMSRFEVWRMYCDPWGWD 420 Query: 421 STIAAWAGRFPDRVVEWAVGGGGSLRRVAAATQGYADALATGDAALAANVWRPKFVEHMG 480 STIAAWAGRFPDRVVEWAVGGGGSLRRVAAATQGYADALATGDA LAANVWRPKFVEHMG Sbjct: 421 STIAAWAGRFPDRVVEWAVGGGGSLRRVAAATQGYADALATGDAVLAANVWRPKFVEHMG 480 Query: 481 HAGRRELKLVDDTGQPLWVMQKQDGRLADKFDAAMAGMLSWEACVDARRDGARPRPKVFA 540 HAGRRELKLVDDTGQPLWVMQKQDGRLADKFDAAMAGMLSWEACVDARRDGARPRPKVFA Sbjct: 481 HAGRRELKLVDDTGQPLWVMQKQDGRLADKFDAAMAGMLSWEACVDARRDGARPRPKVFA 540 Query: 541 PRRIY 545 PRRIY Sbjct: 541 PRRIY 545 >gi|6388|lcl|protein:vir:98470 Length: 536 # NCBI annotation: hypothetical protein # Family: family:all:1551 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958275;genbank:gi:41057249;genbank:GeneID :2732854 Length = 536 Score = 120 bits (300), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 147/550 (26%), Positives = 230/550 (41%), Gaps = 92/550 (16%) Query: 32 PGSLSGQPARLDDEKRALVYRLYELYPRGHHLAGRRRFERAGVELRKGVAKTEFAAWICG 91 PG G+P E+ + R YEL+P + GRR R + +G K+ F I Sbjct: 8 PGRDDGEPFIPTQEQAEFLLRFYELHP----VTGRRVIHRGLLSRPRGWGKSPFVGAIAL 63 Query: 92 VELHPEAPVRCDGFDAAGNPVGRP---VRSPVIPMMAVTEEQVSELAFGVLKYILENGPD 148 E A V DG+DA G P+GRP VR+P++ + AVTE Q +L+ Sbjct: 64 AEAC--ADVVADGWDAYGEPIGRPWHSVRTPLVRIAAVTEAQTDNTWIPLLEMARGGSLS 121 Query: 149 VDLFDISKERIVRLSPSGGEDGFAVAVSNAPGSRDGARTTFQHFDEPHRLFMPRHRDAH- 207 D + ++ L G ++++ S G F D+ R+++ Sbjct: 122 TDYGLDVLDTVIYLP-----RGEISPITSSASSVKGDPACFASLDQTEEW-----RESNG 171 Query: 208 -----ETMLQNMPKRPMEDPWTLYTSTAGQPGQGSIEEDVLAEAESIARGERQDPSLFFF 262 +TM N K + T A PG+GS+ E+ A+ ++I G + + Sbjct: 172 GIRLAKTMRFNAAKLGGS---IIETPNAFTPGEGSVAENSAADYQAIIDGRSRARGILVD 228 Query: 263 RRWA-GDEHDDLSTVEKRVAAVADATG--------------PIGE-WGPGQFERIAKDYD 306 R A GD D+S + VA + A G P G W P ER+ ++ Sbjct: 229 HREAPGDT--DMSDEQSLVAGLRYAYGDSSDHPDGCVLHDPPCGPGWSP--IERLTGEFW 284 Query: 307 RTGIDRAYWERVYLNRWRKSGSQAFDMTRL---VQCDETVPDGAFVTAGFDGSRWR---- 359 T D +LN+ + + + V G + GFDGSR R Sbjct: 285 DTSNDPQDLRADFLNQITHASDAWLSQPEVRASSDLGKVVQPGDRIVLGFDGSRKRSRGV 344 Query: 360 -DATAVVVTEIATGRQMLLGCWERPENVE--------EWEVPEHEVTALVVDMMSRFEVW 410 DATA++ ++ G LG WE+P +E EW+VP EV A V + + ++V Sbjct: 345 TDATALIGCRLSDGHLFTLGVWEQPPRLELGPDGRPVEWQVPVVEVLAAVAEAFATYDVV 404 Query: 411 RMYCDPWGWDSTIAAWAGRFPDRV---------VEWAVGGGGSLRRVAAATQGYADALA- 460 MY DP W+S +A W F R+ +EW + GG S + A + + AL Sbjct: 405 GMYADPAKWESHVADWEAAFGPRLQVKVTRNHPIEWWMTGGRST-LIVRALEKFHTALTE 463 Query: 461 ---TGDAALAANVWRPKFVEHMGHAGRRELKLVDDTGQPLWVMQKQDGRLADKFDAAMAG 517 T D + A V H+ ++ RR+ T + +M K++ +K DAA+A Sbjct: 464 CELTHDGSSA-------LVRHLLNSRRRK------TRSGIQIM-KENPDSPNKIDAAVAA 509 Query: 518 MLSWEACVDA 527 +L+W+ +DA Sbjct: 510 VLAWQCRLDA 519 >gi|15774|lcl|protein:vir:6239 Length: 527 # NCBI annotation: gp33 # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813693;swissprot:trembl:q859c4;genbank:gi :29366753;interpro:IPR005021;uniprot:Q859C4;genbank:Gene ID:1258893 Length = 527 Score = 78.2 bits (191), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 117/488 (23%), Positives = 185/488 (37%), Gaps = 54/488 (11%) Query: 5 QVPAVDLAFPTLGPQVCDFIEDRMVFGPGSLSGQPARLDDEKRALVYRLYELYPRGHHLA 64 +PA D FP+ G +V +IE+ + GS +GQP RL +R L+ YEL + Sbjct: 17 HIPA-DAPFPSEGYRVAKWIEE-FCYLTGSFAGQPFRLLPWQRTLLIDAYEL-TQDTFGR 73 Query: 65 GRRRFERAGVELRKGVAKTEFAAWICGVELHPEAPVRCDGFDAAGNPVGRPVRSPVIPMM 124 RR+ V + + K+ AA I L D DA + Sbjct: 74 WRRKHRTVVVCVARKNGKSTIAAAIMLYHLI------ADRGDAQRQVIA----------- 116 Query: 125 AVTEEQVSELAFGVLKYILENGPDVDLFDISKERIVRLSPSGGEDGFAVAVSNAPGSRDG 184 A + + + F K ++ P + + ++R +D VS G + G Sbjct: 117 AANDRNQARMVFDSAKQMVNASPKLAAVCNVQRDVIRY-----KDNTYRVVSADAGRQQG 171 Query: 185 ARTTFQHFDEPHRLFMPRHRDAHETMLQNMPKRPMEDPWTLYTSTAGQPGQGSIEEDVLA 244 DE + D + + R P L STAG G + Sbjct: 172 LNPAAVSLDE---YAFSKSSDLFDALTLGSAAR--NQPMFLIISTAGPDPDGPFAA-LCE 225 Query: 245 EAESIARGERQDPSLFFFRRWAGDEHDDLSTVEKRVAAVADATGPIGEWGPGQFERIAKD 304 + E + GE DP+LF+ R W + + ++ V A + + I P F+ A+ Sbjct: 226 QGERVNSGEADDPTLFY-RSWGPKLGETVDHLDPEVWARCNPSYDI--LNPDDFKAAAQR 282 Query: 305 YDRTGIDRAYWERVY-LNRWRKSGSQAFDM----TRLVQCDETVPDGAFVTAGFDGSRWR 359 R+Y L+++ + S + D+ + G V GFDGS Sbjct: 283 STEASF------RIYRLSQFVRGASTWLPHGLWDSLAADDDDPLEPGDEVVCGFDGSWKG 336 Query: 360 DATAVVVTEIATGRQMLLGCWERPENVEEWEVPEHEVTALVVDMMSRFEVWRMYCDPWGW 419 D+TA+V + R +LG WE P + W VP +V + + + V + DP+ W Sbjct: 337 DSTALVACRVRDLRVFVLGHWEAPADDIHWRVPMADVREALHSALDTYRVRNLVADPYRW 396 Query: 420 DSTIAAW-AGRFPDRVVEWAVGGGGSLRRVAAATQGYADALATGDAALAANVWRPKFVEH 478 + T+ A FP SL R+ ATQ DA G + N P H Sbjct: 397 EETLDNLEAEGFPVEAFP-----TNSLARMVPATQAVYDACRDGRLSHDGN---PALARH 448 Query: 479 MGHAGRRE 486 +G+A +E Sbjct: 449 IGNAVLKE 456 >gi|15412|lcl|protein:vir:1325 Length: 519 # NCBI annotation: gp33 # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047924;swissprot:trembl:q9t214;genbank:gi :9631142;uniprot:Q9T214;genbank:GeneID:2715903 Length = 519 Score = 73.6 bits (179), Expect = 6e-15, Method: Compositional matrix adjust. Identities = 127/532 (23%), Positives = 199/532 (37%), Gaps = 60/532 (11%) Query: 5 QVPAVDLAFPTLGPQVCDFIEDRMVFGPGSLSGQPARLDDEKRALVYRLYELYPRGHHLA 64 +PA D P+ G +V +IE+ + GS +GQP RL +R L+ Y L + Sbjct: 14 HIPA-DATVPSEGYRVAKWIEE-FCYLTGSFAGQPFRLLPWQRELLIDAYVL-TQDTFGR 70 Query: 65 GRRRFERAGVELRKGVAKTEFAAWICGVELHPEAPVRCDGFDAAGNPVGRPVRSPVIPMM 124 RR+ V + + K+ AA I L D DA + Sbjct: 71 WRRKHRTVVVCVARKNGKSTIAAAIMLYHL------IADRGDAQRQIIA----------- 113 Query: 125 AVTEEQVSELAFGVLKYILENGPDVDLFDISKERIVRLSPSGGEDGFAVAVSNAPGSRDG 184 A + + + F K ++ P + + ++R +D VS G + G Sbjct: 114 AANDRNQARMVFDSAKQMVNASPKLAAVCDVQRDVIRY-----KDNTYRVVSADAGRQQG 168 Query: 185 ARTTFQHFDEPHRLFMPRHRDAHETMLQNMPKRPMEDPWTLYTSTAGQPGQGSIEEDVLA 244 DE +H D + + R P L STAG G + Sbjct: 169 LNPAAVSLDE---YAFSKHSDLFDALTLGSAAR--NQPMFLIISTAGPDPDGPFAA-LCE 222 Query: 245 EAESIARGERQDPSLFFFRRWAGDEHDDLSTVEKRVAAVADATGPIGEWGPGQFERIAKD 304 + E + GE DP+LF+ R W + + ++ V + + I P F+ A+ Sbjct: 223 QGERVNSGEADDPTLFY-RSWGPKLGETVDHLDPDVWRACNPSYDI--LNPDDFKAAAQR 279 Query: 305 YDRTGIDRAYWERVY-LNRWRKSGSQAFDM---TRLVQCDETVPDGAFVTAGFDGSRWRD 360 R+Y L+++ + S L D+ + G V GFDGS D Sbjct: 280 STEASF------RIYRLSQFVRGASTWLPHGLWDSLAADDDPLEPGDEVVLGFDGSWKGD 333 Query: 361 ATAVVVTEIATGRQMLLGCWERPENVEEWEVPEHEVTALVVDMMSRFEVWRMYCDPWGWD 420 +TA+V I + +LG WE P + W VP +V + + + V + DP+ W+ Sbjct: 334 STALVACRIRDLKVFVLGHWEAPADDAHWRVPMADVREELHTALDVYRVRNLVADPYRWE 393 Query: 421 STIAAW-AGRFPDRVVEWAVGGGGSLRRVAAATQGYADALATGDAALAANVWRPKFVEHM 479 T+ A FP SL R+ ATQ DA G + N P H+ Sbjct: 394 ETLDNLEADGFPVEAFP-----TNSLARMVPATQAVYDACRDGRLSHDGN---PALGRHI 445 Query: 480 GHAGRRELKLVDDTGQPLWVMQKQDGRLADKFDAAMAGMLSWEACVDARRDG 531 G+A +E D G + K+ K D A+A +L+ V R D Sbjct: 446 GNAVLKE----DARGARI---TKEHASSRRKIDLAVAMVLAVHGAVMWREDN 490 >gi|16473|lcl|protein:vir:7767 Length: 566 # NCBI annotation: gp13 # Family: family:all:1551 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817601;genbank:gi:29566031;genbank:GeneID :1259225 Length = 566 Score = 72.8 bits (177), Expect = 9e-15, Method: Compositional matrix adjust. Identities = 122/504 (24%), Positives = 189/504 (37%), Gaps = 87/504 (17%) Query: 71 RAGVELR-KGVAKTEFAAWICGVELHPEAPVRCDGFDAAGNPVGRPVRSPVIPMMAVTEE 129 R G+ R KG K FAA + EL PV FDA GNPVG+P + I + AV+++ Sbjct: 86 REGILRRLKGWGKDPFAAALSLAELC--GPVAFSHFDADGNPVGKPRHAAWITIAAVSQD 143 Query: 130 QVSELAFGVLKYILENGPDVDLFDISKERIVRLSPSGGEDGFAVAVSNAPGSRDGARTTF 189 Q F + ++ D + + R + S +GG A +++P S +G R TF Sbjct: 144 QTKN-TFSLFPIMISKQLKED-YGLLVNRFIIYSEAGGR---IEAATSSPASVEGNRPTF 198 Query: 190 QHFDE-------PHRLFMPRHRDAHETMLQNMPKRPMEDPWTLYTSTAGQPGQGSIEEDV 242 +E P H H + N+ K P L A PG ++ E Sbjct: 199 VIENETQWWGAGPGGEINDGHA-MHGAIEGNLTKIPGAR--RLAICNAHIPGNDTVAEKD 255 Query: 243 LAEAESIARGERQDPSLFFFRRWAGDEHDDLSTVEKRVAAVADATGPIGEWGPGQFE--- 299 + I G+ D + + A A A P+ E P Q E Sbjct: 256 WDAYQDILSGKAVDTGMLY------------------DALEAPADTPVSEI-PSQREDPE 296 Query: 300 --RIAKDYDRTGIDRA----YW--------------------ERVYLNRWRKSGSQAFDM 333 ++ R GI+ A YW R +LN+ Sbjct: 297 GYQLGIKKLREGIEIARGDSYWLPVDEILMSILDIKNSITESRRKFLNQINAHEDSWISP 356 Query: 334 TRLVQCD----ETVPDGAFVTAGFDGSRWRDATAVVVTEIATGRQMLLGCWERPENVEEW 389 +C + + G +T GFDGS+ D TA+V + G L+ W PE+ E Sbjct: 357 NEWNRCQPSTIQPLTKGDRITLGFDGSKSNDWTALVACRVDDGMLFLIKVWN-PEDYESG 415 Query: 390 EVPEHEVTALVVDMMSRFEVWRMYCDPWGWDSTIAAWAGRFPDRVVEWAVGGG------- 442 EVP +V A V M + ++V D +++ + W F ++ A G Sbjct: 416 EVPREDVDATVRSMFASYDVVAFRADVKEFEAYVDQWGRDFRKKIQVNATPGNPIAFDMR 475 Query: 443 GSLRRVAAATQGYADALATGDAALAANVWRPKFVEHMGHAGRRELKLVDDTGQPLWVMQK 502 G +R A + + DA+ + N P +H+ +A RR D ++K Sbjct: 476 GQTKRFAFDCERFLDAVIEQEVFHDGN---PVLKQHVCNA-RRHPTTYDAIA-----IRK 526 Query: 503 QDGRLADKFDAAMAGMLSWEACVD 526 K DAA+ +L++ A D Sbjct: 527 ASKDSGKKIDAAVCAVLAFGARQD 550 >gi|19787|lcl|protein:vir:2340 Length: 562 # NCBI annotation: gp10 # Family: family:all:1551 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075277;genbank:gi:12657864;genbank:GeneID :920069 Length = 562 Score = 70.5 bits (171), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 119/502 (23%), Positives = 184/502 (36%), Gaps = 72/502 (14%) Query: 65 GRRRFERAGVELRKGVAKTEFAAWICGVELHPEAPVRCDGFDAAGNPVGRPVRSPVIPMM 124 G+ + + KG K + EL PV FD GNPVG+ + I + Sbjct: 77 GKYAYREGTLRRMKGWGKDPMIGALALAELC--GPVAFSHFDDNGNPVGKARHAAWITIA 134 Query: 125 AVTEEQVSELAFGVLKYILENGPDVDLFDISKERIVRLSPSGGEDGFAVAVSNAPGSRDG 184 AV+++Q F + ++ + + +S R + S GG A A +P S +G Sbjct: 135 AVSQDQTKN-TFSLFPIMVSKRLRSE-YGLSVNRFIIYSEIGGRLEAATA---SPASMEG 189 Query: 185 ARTTFQHFDE-------PHRLFMPRHRDAHETMLQNMPKRPMEDPWTLYTSTAGQPGQGS 237 R TF +E P H+ A E + NM K P TL A +PG + Sbjct: 190 NRPTFVVQNETQWWGVGPGGEVNGGHQMA-EVIEGNMTKVPGAR--TLSICNAHRPGDDT 246 Query: 238 IEEDVLAEAESIARGERQDPSLFFFRRWAGDEHDDLSTVEKRVAAVADATGPIGEWGPGQ 297 + E I GE D + + +E P E PG Sbjct: 247 VAERSYQNWLDILAGEVIDTGILY------------DALEAPADTPVSEIPPPSEDEPGY 294 Query: 298 FERIAKDYDRTGIDR--AYW--------------------ERVYLNRWRKSGSQAFDMTR 335 +AK + G+ R + W R +LN+ S Sbjct: 295 TAGVAKLLEGLGVARGDSIWLPLDDILMSVLSAKNDIIESRRKFLNQVNASEDSWLAPAD 354 Query: 336 LVQCDET----VPDGAFVTAGFDGSRWRDATAVVVTEIATGRQMLLGCWERPENVEEWEV 391 +C T + G +T GFDGS+ D TA+V + G L+ W PEN EV Sbjct: 355 WDKCHSTSLRPLTKGDKITLGFDGSKSNDWTALVACRVEDGAVFLIDYW-NPENYPSGEV 413 Query: 392 PEHEVTALVVDMMSRFEVWRMYCDPWGWDSTIAAWAGRFPDRVVEWAVGGG-------GS 444 P+ +V A+V M ++EV D +++ + W F + A G G Sbjct: 414 PKEDVDAVVRSMKDKYEVVAFRADVKEFEAYVDQWGQLFRRTIKVNASPGNPVAFDMRGQ 473 Query: 445 LRRVAAATQGYADALATGDAALAANVWRPKFVEHMGHAGRRELKLVDDTGQPLWVMQKQD 504 +R A + +ADA+ + N P H+ +A R + D ++K Sbjct: 474 TKRFALDCERFADAVLEQELVHDNN---PVMKAHITNA-HRHPTIYDAIS-----IRKPS 524 Query: 505 GRLADKFDAAMAGMLSWEACVD 526 K DAA+ +L++ A D Sbjct: 525 KASKRKIDAAVCSVLAFGARQD 546 >gi|16679|lcl|protein:vir:4222 Length: 593 # NCBI annotation: predicted 66.2Kd protein # Family: family:all:1551 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039677;swissprot:sw:q05219;genbank:gi:962 5443;uniprot:Q05219;genbank:GeneID:2942932;interpro:IPR0 05021 Length = 593 Score = 63.2 bits (152), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 122/532 (22%), Positives = 200/532 (37%), Gaps = 86/532 (16%) Query: 44 DEKRALVYRLYELYPRGHHLAGRRRFERAGVELR-KGVAKTEFAAWICGVELHPEAPVRC 102 DE+ LV Y + +G ++ R GV R KG K F A +C EL PV Sbjct: 83 DEQVRLVLWWYAVDDQGQYIY------REGVIRRLKGWGKDPFTAALCLAELC--GPVAF 134 Query: 103 DGFDAAGNPVGRPVRSPVIPMMAVTEEQVSELAFGVLKYILENGPDVDLFDISKERIVRL 162 FDA GNPVG+P + I + AV+++Q F + ++ + + + R + Sbjct: 135 SHFDADGNPVGKPRSAAWITVAAVSQDQTKN-TFSLFPVMISKKLKAE-YGLDVNRFIIY 192 Query: 163 SPSGGEDGFAVAVSNAPGSRDGARTTFQHFDE-------PHRLFMPRHRDAHETMLQNMP 215 S +GG A +++P S +G R TF +E P H A E + NM Sbjct: 193 SAAGGR---IEAATSSPASMEGNRPTFVVQNETQWWGQGPDGKVNEGHAMA-EVIEGNMT 248 Query: 216 KRPMEDPWTLYTSTAGQPGQGSIEEDVLAEAESIARG----------------------- 252 K +E TL A PG ++ E E + + G Sbjct: 249 K--VEGSRTLSICNAHIPGTETVAEKAWDEYQKVQAGDSVDTGMMYDALEAPADTPVSEI 306 Query: 253 --ERQDPSLF-----FFRRWAGDEHDDLS--TVEKRVAAVADATGPIGEWGPGQFERIAK 303 +++DP F R D + ++ + ++ PI E +F Sbjct: 307 PPQKEDPEGFEKGIEKLREGLLIARGDSTWLPIDDIIKSILSTKNPITE-SRRKFLNQVN 365 Query: 304 DYDRTGIDRAYWERVYLNRWRKSGSQAFDMTRLVQCDETVPDGAFVTAGFDGSRWRDATA 363 + + + W R ++ + + L + D +T GFDGS+ D TA Sbjct: 366 AAEDSWLSPQEWNRCQVDLAKYLDKHGREFAPLQRGDR-------ITLGFDGSKSNDWTA 418 Query: 364 VVVTEIATGRQMLLGCWERPENVEEWEVPEHEVTALVVDMMSRFEV---------WRMYC 414 +V ++ G ++ W+ P+ EVP +V A V + ++V + Y Sbjct: 419 LVGCRVSDGLLFVIDIWD-PQKYGG-EVPREDVDAKVHSAFAHYDVVAFRADVKEFEAYV 476 Query: 415 DPWGWDSTIAAWAGRFPDRVVEWAVGGGGSLRRVAAATQGYADALATGDAALAANVWRPK 474 D WG P+ V A G +R A + DA+ G+ N P Sbjct: 477 DQWGRTYKKKLKVNASPNNPV--AFDMRGQQKRFAFDCERLEDAVLEGEVWHDGN---PV 531 Query: 475 FVEHMGHAGRRELKLVDDTGQPLWVMQKQDGRLADKFDAAMAGMLSWEACVD 526 +H+ +A R T ++K + K DAA+ +L++ A D Sbjct: 532 LRQHVLNAKRHP------TNYDAIAIRKVTKDSSKKIDAAVCAVLAFGARQD 577 >gi|9490|lcl|protein:vir:104081 Length: 594 # NCBI annotation: gp13 # Family: family:all:1551 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655592;genbank:gi:109392463;genbank:GeneI D:4156949 Length = 594 Score = 62.4 bits (150), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 128/527 (24%), Positives = 194/527 (36%), Gaps = 75/527 (14%) Query: 44 DEKRALVYRLYELYPRGHHLAGRRRFERAGVELR-KGVAKTEFAAWICGVELHPEAPVRC 102 DE+ LV Y + +G ++ R GV R KG K F A +C EL PV Sbjct: 83 DEQVRLVLWWYAVDDKGQYIY------REGVIRRLKGWGKDPFTAALCLAELC--GPVAF 134 Query: 103 DGFDAAGNPVGRPVRSPVIPMMAVTEEQVSELAFGVLKYILENGPDVDLFDISKERIVRL 162 FDA GNPVG+ +P I + AV+++Q F + ++ + +++ R + Sbjct: 135 SHFDADGNPVGKRRNAPWITVAAVSQDQTKN-TFSLFPVMISKKLKAE-YNLDVNRFIIY 192 Query: 163 SPSGGEDGFAVAVSNAPGSRDGARTTFQHFDE-------PHRLFMPRHRDAHETMLQNMP 215 S G G A +++P + +G R TF +E P H A E + NM Sbjct: 193 SDGGA--GRIEAATSSPAAMEGNRPTFVVQNETQWWGQGPDGKVNEGHAMA-EVIEGNMT 249 Query: 216 KRPMEDPWTLYTSTAGQPGQGSIEEDVLAEAESIARGERQDPSLFF-------------F 262 K +E TL A PG ++ E IA + D L + Sbjct: 250 K--VEGSRTLSICNAHIPGTETVGEKSYNNWLDIATDKSVDTGLLYDALEAPADTPISEI 307 Query: 263 RRWAGDEHDDLSTVEKRVAAVADATGPIGEWGPGQFERIAKDYDRTGIDRAYWERVYLNR 322 D +EK V A G W P + I K T R +LN+ Sbjct: 308 PSQKEDPEGFERGIEKLREGVLIARGD-STWLP--IDDIIKSILSTKNSITESRRKFLNQ 364 Query: 323 WRKSGSQAFDMTRLVQC------------DETVP--DGAFVTAGFDGSRWRDATAVVVTE 368 + +C E VP G +T GFDGS+ D TA+V Sbjct: 365 VNAAEDSWLSPQEWNRCFADPEKYLERRGHEFVPLQRGDRITLGFDGSKSNDWTALVGCR 424 Query: 369 IATGRQMLLGCWERPENVEEWEVPEHEVTALVVDMMSRFEV---------WRMYCDPWGW 419 ++ G ++ W+ P+ EVP +V A V ++V + Y D WG Sbjct: 425 VSDGLLFVIDIWD-PQKYGG-EVPREDVDAKVHSAFKHYDVVAFRADVKEFEAYVDSWGR 482 Query: 420 DSTIAAWAGRFPDRVVEWAVGGGGSLRRVAAATQGYADALATGDAALAANVWRPKFVEHM 479 P+ V A G +R A + DA+ G+ N +H+ Sbjct: 483 TYKKKLKVNASPNNPV--AFDMRGQQKRFAFDCERLEDAVLEGEVWHDGNA---VLSQHV 537 Query: 480 GHAGRRELKLVDDTGQPLWVMQKQDGRLADKFDAAMAGMLSWEACVD 526 +A R T ++K + K DAA+ +L++ A D Sbjct: 538 MNAKRHP------TTYDAIAIRKVTKDSSKKIDAAVCAVLAFGARQD 578 >gi|17795|lcl|protein:vir:2426 Length: 595 # NCBI annotation: gp13 # Family: family:all:1551 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046828;genbank:gi:9630396;genbank:GeneID: 1261617 Length = 595 Score = 61.2 bits (147), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 121/532 (22%), Positives = 199/532 (37%), Gaps = 86/532 (16%) Query: 44 DEKRALVYRLYELYPRGHHLAGRRRFERAGVELR-KGVAKTEFAAWICGVELHPEAPVRC 102 DE+ LV Y + +G ++ R GV R KG K F A +C EL PV Sbjct: 85 DEQVRLVLWWYAVDEKGQYVY------REGVIRRLKGWGKDPFTAALCLAELC--GPVAF 136 Query: 103 DGFDAAGNPVGRPVRSPVIPMMAVTEEQVSELAFGVLKYILENGPDVDLFDISKERIVRL 162 FD G +G+P + I + AV+++Q F + ++ + + + R + Sbjct: 137 SHFDETGQAIGKPRPAAWITVAAVSQDQTKN-TFSLFPVMISKKLKTE-YGLDVNRFIIY 194 Query: 163 SPSGGEDGFAVAVSNAPGSRDGARTTFQHFDE-------PHRLFMPRHRDAHETMLQNMP 215 S +GG A +++P S +G R TF +E P H A E + NM Sbjct: 195 SAAGGR---IEAATSSPASMEGNRPTFVVQNETQWWGQGPDGKANEGHAMA-EVIEGNMT 250 Query: 216 KRPMEDPWTLYTSTAGQPGQGSIEEDVLAEAESIARGERQDPSLFF-------------- 261 K +E TL A PG ++ E E + + G+ D + + Sbjct: 251 K--VEGSRTLSICNAHIPGTETVAEKAYVEWQDVQSGKSVDTGMMYDALEAPADTPISEI 308 Query: 262 ---------FRRWAGDEHDDL------ST---VEKRVAAVADATGPIGEWGPGQFERIAK 303 FR + L ST ++ + ++ I E +F Sbjct: 309 PSEKENPDGFREGIEKLREGLLIARGDSTWLPIDDIIKSILSTKNSITE-SRRKFLNQVN 367 Query: 304 DYDRTGIDRAYWERVYLNRWRKSGSQAFDMTRLVQCDETVPDGAFVTAGFDGSRWRDATA 363 + + + W R + + + F++ L + G +T GFDGS+ D TA Sbjct: 368 AAEDSWLSPQEWNRCFADPDKYLDKMGFELAPLDR-------GQKITLGFDGSKSNDWTA 420 Query: 364 VVVTEIATGRQMLLGCWERPENVEEWEVPEHEVTALVVDMMSRFEV---------WRMYC 414 +V ++ G ++ W+ P+ EVP V A V SR++V + Y Sbjct: 421 LVGCRVSDGLLFVIDIWD-PQKYGG-EVPREFVDAAVHSAFSRYDVVAFRADVKEFEAYV 478 Query: 415 DPWGWDSTIAAWAGRFPDRVVEWAVGGGGSLRRVAAATQGYADALATGDAALAANVWRPK 474 D WG P+ V A G +R A + DA+ G+ N P Sbjct: 479 DSWGRTYKKKLKVNASPNNPV--AFDMRGQQKRFAFDCERLEDAVLEGEVWHDGN---PV 533 Query: 475 FVEHMGHAGRRELKLVDDTGQPLWVMQKQDGRLADKFDAAMAGMLSWEACVD 526 +H+ +A R T ++K + K DAA+ +L++ A D Sbjct: 534 LRQHVLNAKRHP------TTYDAIAIRKVTKDSSKKIDAAVCAVLAFGARQD 579 >gi|10991|lcl|protein:vir:78228 Length: 903 # NCBI annotation: Putative terminase # Family: family:all:1551 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491662;genbank:gi:157786486;genbank:Ge neID:5625706 Length = 903 Score = 52.8 bits (125), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 48/183 (26%), Positives = 77/183 (42%), Gaps = 17/183 (9%) Query: 346 GAFVTAGFDGSRWRDATAVVVTEIATGRQMLLGCWERPENVEEWEVPEHEVTALVVDMMS 405 G +T GFDGS D TA+ + G L+ W PE E +VP +V A V M Sbjct: 710 GERITLGFDGSLSNDHTALTACRVEDGALFLVKVWV-PEKYEGHKVPRQDVDAYVRSMFE 768 Query: 406 RFEVWRMYCDPWGWDSTIAAWAGRFPDRVVEWAVGGG-------GSLRRVAAATQGYADA 458 +++V M D ++ ++ AW F ++ A G G +R A + + DA Sbjct: 769 KYDVVGMRADVKEFEQSVDAWGQDFRRKLKINASPGNPVAFDMRGQQKRFALDCERFRDA 828 Query: 459 LATGDAALAANVWRPKFVEHMGHAGRRELKLVDDTGQPLWVMQKQDGRLADKFDAAMAGM 518 + G+ N P H+ +A + + D ++K K DAA+ + Sbjct: 829 VLAGEVKHDNN---PVLKAHITNA-HQHPTIYDAIS-----IRKPGKESKRKIDAAVTAV 879 Query: 519 LSW 521 L+W Sbjct: 880 LAW 882 Score = 27.7 bits (60), Expect = 0.40, Method: Compositional matrix adjust. Identities = 15/44 (34%), Positives = 23/44 (52%), Gaps = 7/44 (15%) Query: 87 AWICGVELHPEAPVRCDGFDAAGNPVGRPVRSPVIPMMAVTEEQ 130 A +CG PV FD GNPVG+ + + + AV+++Q Sbjct: 104 AELCG-------PVAFSHFDDNGNPVGKTRHAAWVTIAAVSQDQ 140 >gi|11235|lcl|protein:vir:78494 Length: 562 # NCBI annotation: Putative terminase # Family: family:all:1551 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491581;genbank:gi:157786404;genbank:Ge neID:5625646 Length = 562 Score = 51.6 bits (122), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 48/183 (26%), Positives = 77/183 (42%), Gaps = 17/183 (9%) Query: 346 GAFVTAGFDGSRWRDATAVVVTEIATGRQMLLGCWERPENVEEWEVPEHEVTALVVDMMS 405 G +T GFDGS D TA+ + G L+ W PE E +VP +V A V M Sbjct: 369 GERITLGFDGSLSNDHTALTACRVEDGALFLVKVW-VPEKYEGHKVPRQDVDAYVRSMFE 427 Query: 406 RFEVWRMYCDPWGWDSTIAAWAGRFPDRVVEWAVGGG-------GSLRRVAAATQGYADA 458 +++V M D ++ ++ AW F ++ A G G +R A + + DA Sbjct: 428 KYDVVGMRADVKEFEQSVDAWGQDFRRKLRINASPGNPVAFDMRGQQKRFALDCERFRDA 487 Query: 459 LATGDAALAANVWRPKFVEHMGHAGRRELKLVDDTGQPLWVMQKQDGRLADKFDAAMAGM 518 + G+ N P H+ +A + + D ++K K DAA+ + Sbjct: 488 VLAGEVKHDNN---PVLKAHITNA-HQHPTIYDAIS-----IRKPGKESKRKIDAAVTAV 538 Query: 519 LSW 521 L+W Sbjct: 539 LAW 541 Score = 43.1 bits (100), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 52/204 (25%), Positives = 81/204 (39%), Gaps = 17/204 (8%) Query: 65 GRRRFERAGVELRKGVAKTEFAAWICGVELHPEAPVRCDGFDAAGNPVGRPVRSPVIPMM 124 G+ + + KG K + EL PV FD GNPVG+P + + + Sbjct: 77 GKYAYREGTLRRMKGWGKDPMIGALALAELC--GPVAFSHFDDNGNPVGKPRHAAWVTVA 134 Query: 125 AVTEEQVSELAFGVLKYILENGPDVDLFDISKERIVRLSPSGGEDGFAVAVSNAPGSRDG 184 AV+++Q FG+ ++ + + +S R + S GG A A +P S +G Sbjct: 135 AVSQQQTVN-TFGLFPIMVSKKLKTE-YGLSVNRFIIYSEIGGRLEAATA---SPASMEG 189 Query: 185 ARTTFQHFDE-------PHRLFMPRHRDAHETMLQNMPKRPMEDPWTLYTSTAGQPGQGS 237 R TF +E P H+ A E + NM K P TL A +PG + Sbjct: 190 NRPTFVVQNETQWWGVGPGGEVNDGHQMA-EVIEGNMTKVPGAR--TLSICNAHRPGDDT 246 Query: 238 IEEDVLAEAESIARGERQDPSLFF 261 + E I G+ D + + Sbjct: 247 VAEMSYLNWLDILAGDAIDTGVLY 270 >gi|13389|lcl|protein:vir:1263 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690755;genbank:gi:22854995;genbank:GeneID :955207 Length = 416 Score = 43.5 bits (101), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 27/79 (34%), Positives = 40/79 (50%), Gaps = 5/79 (6%) Query: 20 VCDFIEDRMVFG--PGSLSGQPARLDDEKRALVYRLYELYPRGHHLAGRRRFERAGVELR 77 V DF E F G L+GQP L D + + +Y Y + + G RRF +A ++L Sbjct: 44 VVDFYEWSRQFNHVEGILAGQPIELTDFQLFIAANIYGFYKKEN---GARRFRKAYIQLA 100 Query: 78 KGVAKTEFAAWICGVELHP 96 + AK++F A I E+ P Sbjct: 101 RKNAKSQFLALIASYEIFP 119 >gi|16408|lcl|protein:vir:1883 Length: 504 # NCBI annotation: terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037663;genbank:gi:9634121;genbank:GeneID :1262500 Length = 504 Score = 32.0 bits (71), Expect = 0.019, Method: Compositional matrix adjust. Identities = 18/50 (36%), Positives = 26/50 (52%), Gaps = 1/50 (2%) Query: 15 TLGPQVCDFIEDRMVFGPGSLSGQPARLDDEKRALVYRLYELYPRGHHLA 64 T G +V FIE + G L GQP RLD ++ + +Y+ P G +A Sbjct: 2 TRGERVIAFIERFCIVPEGKLIGQPMRLDTFQKEFILAVYD-NPAGTDMA 50 >gi|15611|lcl|protein:vir:188 Length: 504 # NCBI annotation: terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037698;genbank:gi:9634166;genbank:GeneID :1262528 Length = 504 Score = 31.2 bits (69), Expect = 0.036, Method: Compositional matrix adjust. Identities = 18/50 (36%), Positives = 26/50 (52%), Gaps = 1/50 (2%) Query: 15 TLGPQVCDFIEDRMVFGPGSLSGQPARLDDEKRALVYRLYELYPRGHHLA 64 T G +V FIE + G L GQP RLD ++ + +Y+ P G +A Sbjct: 2 TRGERVIAFIERFCIVPEGKLIGQPMRLDPFQKDFILAVYD-NPAGTDMA 50 >gi|5726|lcl|protein:vir:95379 Length: 573 # NCBI annotation: phage terminase # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764473;genbank:gi:115334627;genbank:GeneI D:5179266 Length = 573 Score = 26.6 bits (57), Expect = 0.71, Method: Compositional matrix adjust. Identities = 23/86 (26%), Positives = 36/86 (41%), Gaps = 10/86 (11%) Query: 4 LQVPAVDLAFPTLGPQVCDFIEDRMVFGPGS------LSGQPARLDDEKRALVYRLYELY 57 L+ P D P V IE V G L G+P L+ ++ ++Y L Y Sbjct: 39 LENPKYDFN-PREAEFVIQIIEKTFVHDQGERLDGTPLRGEPFLLEPWQKFIIYNLLGFY 97 Query: 58 PRGHHLAGRRRFERAGVELRKGVAKT 83 +G + RRF+ A + + + KT Sbjct: 98 LKGTKI---RRFKEAFIFIPRKNGKT 120 >gi|12641|lcl|protein:vir:80117 Length: 572 # NCBI annotation: Phage terminase # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425601;genbank:gi:155042934;genbank:Ge neID:5469543 Length = 572 Score = 26.6 bits (57), Expect = 0.73, Method: Compositional matrix adjust. Identities = 23/86 (26%), Positives = 36/86 (41%), Gaps = 10/86 (11%) Query: 4 LQVPAVDLAFPTLGPQVCDFIEDRMVFGPGS------LSGQPARLDDEKRALVYRLYELY 57 L+ P D P V IE V G L G+P L+ ++ ++Y L Y Sbjct: 39 LENPKYDFN-PREAEFVIQIIEKTFVHDQGERLDGTPLRGEPFLLEPWQKFIIYNLLGFY 97 Query: 58 PRGHHLAGRRRFERAGVELRKGVAKT 83 +G + RRF+ A + + + KT Sbjct: 98 LKGTKI---RRFKEAFIFIPRKNGKT 120 >gi|967|lcl|protein:vir:6208 Length: 574 # NCBI annotation: Terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852588;genbank:gi:31415848;genbank:GeneID :1489206 Length = 574 Score = 25.0 bits (53), Expect = 2.0, Method: Compositional matrix adjust. Identities = 57/245 (23%), Positives = 92/245 (37%), Gaps = 22/245 (8%) Query: 19 QVCDFIEDRMVFGPGSLSGQPARLDDEKRALVYRLYELYPRGHHLAGRRRFERAGVELRK 78 ++ DF++ + G L+GQ L+ + + +Y Y + R + V++ K Sbjct: 53 EMLDFVQSFIRHVKGPLAGQLMELELWEMFVFANMYGWYHKNEKGKTVRVIRESYVQVPK 112 Query: 79 GVAKTEFAAWICGVELHPEAPVRCDGFDAAGN-PVGRPVRSPVIPMMAVTEEQVSELAFG 137 KT AA ++ E + D + AA + + P+ A E LA Sbjct: 113 KNGKTIIAAGALLYAMYGELELGADCYCAASDYEQAQNAAEPI----AQAIENSEPLARH 168 Query: 138 VLKYILENGPDVDLFDISKERIVRLSPSG--GEDGFAVAVSNAPGSRDGARTTFQHFDEP 195 Y NG + R S +G ++ F V N G +G F DE Sbjct: 169 TQIYKGVNGT-------VSGAMYRYSINGIAYQNKFKVLTKNTKG-LEGKNPYFVLNDEL 220 Query: 196 HRLFMPRHRDAHETMLQNMPKRPMEDPWTLYTSTAGQPGQGSIEEDVLAEAESIARGERQ 255 H + D ++ + R E P L STAG+ G S+ V A+ + + Sbjct: 221 HA---QENMDMYDNLKSAQISR--EQPMMLNISTAGK-GASSVGMRVYKYAKLVLEND-D 273 Query: 256 DPSLF 260 D SLF Sbjct: 274 DDSLF 278 >gi|14824|lcl|protein:vir:4088 Length: 550 # NCBI annotation: terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510983;swissprot:trembl:q8w607;genbank:gi :17488505;uniprot:Q8W607;genbank:GeneID:1260359 Length = 550 Score = 25.0 bits (53), Expect = 2.2, Method: Compositional matrix adjust. Identities = 31/142 (21%), Positives = 61/142 (42%), Gaps = 9/142 (6%) Query: 121 IPMMAVTEEQVSELAFGVLKYILENGPDVDLFDISKERIVRLSPSGGEDGFAVAVSNAPG 180 I ++A +E+Q ++ +F + +LE D + ++ + +V + SNA Sbjct: 132 IDIVATSEDQ-AKTSFEDIYNMLEENKDT-MKNVYRWNLVAIQHRKNGSTIKYKTSNA-R 188 Query: 181 SRDGARTTFQHFDEPHRLFMPRHRDAHETMLQNMPKRPMEDPWTLYTSTAGQPGQGSIEE 240 ++DG+R FDE H + D + E + +T G +GS+ + Sbjct: 189 TKDGSRPGAVAFDEEHA-----YEDYSNYAVHTSGLGKKERSRRFHLTTDGYV-RGSVLD 242 Query: 241 DVLAEAESIARGERQDPSLFFF 262 D A+ + GE+ + +F F Sbjct: 243 DRKVLAQQVLNGEKPNSKMFPF 264 >gi|3559|lcl|protein:vir:101786 Length: 556 # NCBI annotation: gp9 # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654764;genbank:gi:109302762;genbank:Gene ID:4156221 Length = 556 Score = 25.0 bits (53), Expect = 2.3, Method: Compositional matrix adjust. Identities = 21/71 (29%), Positives = 34/71 (47%), Gaps = 6/71 (8%) Query: 23 FIEDRMVFGPGSLSGQPARLDDEKRALVYRLYELYPRGHHL----AGRRRFERAGVELRK 78 F E+ +V G + + L+D +R + R L+ R + +RR+E A +EL + Sbjct: 31 FFEEILVHTKGQYTRKKFILEDWQRDDIVR--PLFGRVEYSDEFGCYKRRYEIAWIELAR 88 Query: 79 GVAKTEFAAWI 89 KTE A I Sbjct: 89 KNGKTELLAGI 99 >gi|15908|lcl|protein:vir:3418 Length: 641 # NCBI annotation: DNA packaging protein # Family: family:all:140 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040581;genbank:gi:9626245;genbank:GeneID: 2703524 Length = 641 Score = 23.1 bits (48), Expect = 8.4, Method: Compositional matrix adjust. Identities = 19/65 (29%), Positives = 27/65 (41%), Gaps = 8/65 (12%) Query: 254 RQDPSLFFFRRWAGDEHDDLSTVEKRVAAVADATGPIGEWGPGQFERIAKDYDRTGIDRA 313 R PSL W G +H D + KR T G W G + AK+Y +D A Sbjct: 124 RDIPSLLALAPWYGKKHRDNTLTMKRF------TNGRGFWCLGG--KAAKNYREKSVDVA 175 Query: 314 YWERV 318 ++ + Sbjct: 176 GYDEL 180 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.321 0.137 0.437 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 270,549 Number of Sequences: 514 Number of extensions: 13245 Number of successful extensions: 69 Number of sequences better than 100.0: 24 Number of HSP's better than 100.0 without gapping: 16 Number of HSP's successfully gapped in prelim test: 8 Number of HSP's that attempted gapping in prelim test: 28 Number of HSP's gapped (non-prelim): 34 length of query: 545 length of database: 206,069 effective HSP length: 76 effective length of query: 469 effective length of database: 167,005 effective search space: 78325345 effective search space used: 78325345 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 39 (19.6 bits)