BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|Aclame:protein:vir:8897|NCBI_annot:DNA packaging protein B|genbank:acc:NP_813786;genbank:gi:29366741;genbank:GeneID:1258833 (582 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|14546|lcl|protein:vir:8897 Length: 582 # NCBI annotation: DNA... 1202 0.0 gi|4214|lcl|protein:vir:94726 Length: 575 # NCBI annotation: mat... 840 0.0 gi|3994|lcl|protein:vir:99686 Length: 586 # NCBI annotation: DNA... 822 0.0 gi|3734|lcl|protein:vir:94555 Length: 585 # NCBI annotation: DNA... 731 0.0 gi|16146|lcl|protein:vir:10462 Length: 586 # NCBI annotation: DN... 722 0.0 gi|16205|lcl|protein:vir:1554 Length: 587 # NCBI annotation: DNA... 720 0.0 gi|13826|lcl|protein:vir:3376 Length: 586 # NCBI annotation: DNA... 719 0.0 gi|18462|lcl|protein:vir:2213 Length: 586 # NCBI annotation: DNA... 719 0.0 gi|11501|lcl|protein:vir:78718 Length: 574 # NCBI annotation: DN... 440 e-125 gi|7204|lcl|protein:vir:100065 Length: 577 # NCBI annotation: DN... 398 e-113 gi|7495|lcl|protein:vir:103312 Length: 624 # NCBI annotation: Te... 301 1e-83 gi|8927|lcl|protein:vir:97005 Length: 632 # NCBI annotation: 40 ... 294 2e-81 gi|15201|lcl|protein:vir:7028 Length: 631 # NCBI annotation: lar... 291 1e-80 gi|12767|lcl|protein:vir:80218 Length: 595 # NCBI annotation: pu... 273 3e-75 gi|11728|lcl|protein:vir:78939 Length: 601 # NCBI annotation: pu... 252 9e-69 gi|18767|lcl|protein:vir:6335 Length: 601 # NCBI annotation: put... 248 1e-67 gi|10294|lcl|protein:vir:105656 Length: 405 # NCBI annotation: p... 184 4e-48 gi|5868|lcl|protein:vir:95431 Length: 535 # NCBI annotation: hyp... 58 3e-10 gi|12711|lcl|protein:vir:80169 Length: 565 # NCBI annotation: te... 56 1e-09 gi|15319|lcl|protein:vir:3140 Length: 535 # NCBI annotation: hyp... 48 4e-07 gi|692|lcl|protein:vir:3649 Length: 507 # NCBI annotation: gp18 ... 37 8e-04 gi|1758|lcl|protein:vir:101556 Length: 507 # NCBI annotation: gp... 37 8e-04 gi|11323|lcl|protein:vir:78588 Length: 507 # NCBI annotation: Te... 37 9e-04 gi|6927|lcl|protein:vir:106715 Length: 507 # NCBI annotation: gp... 37 9e-04 gi|17058|lcl|protein:vir:5248 Length: 487 # NCBI annotation: put... 36 0.001 gi|4089|lcl|protein:vir:94598 Length: 581 # NCBI annotation: PfW... 35 0.002 gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: te... 35 0.002 gi|12543|lcl|protein:vir:80106 Length: 486 # NCBI annotation: Te... 34 0.004 gi|13633|lcl|protein:vir:3963 Length: 462 # NCBI annotation: put... 28 0.33 gi|291|lcl|protein:vir:3608 Length: 462 # NCBI annotation: TerL ... 27 0.49 gi|15492|lcl|protein:vir:732 Length: 402 # NCBI annotation: unkn... 27 0.50 gi|2538|lcl|protein:vir:94056 Length: 483 # NCBI annotation: put... 27 0.50 gi|19603|lcl|protein:vir:4076 Length: 205 # NCBI annotation: maj... 26 1.2 gi|19384|lcl|protein:vir:7851 Length: 887 # NCBI annotation: gp8... 26 1.5 gi|14353|lcl|protein:vir:3149 Length: 554 # NCBI annotation: unk... 25 2.0 gi|18586|lcl|protein:vir:8649 Length: 564 # NCBI annotation: gp7... 24 4.7 gi|27412|lcl|protein:vir:7207 Length: 163 # NCBI annotation: gp1... 24 5.6 gi|19767|lcl|protein:vir:6384 Length: 677 # NCBI annotation: ter... 23 6.7 gi|2061|lcl|protein:vir:97894 Length: 595 # NCBI annotation: gp2... 23 6.7 gi|1961|lcl|protein:vir:107494 Length: 595 # NCBI annotation: gp... 23 6.7 gi|22058|lcl|protein:vir:103457 Length: 163 # NCBI annotation: t... 23 6.9 gi|7375|lcl|protein:vir:99083 Length: 566 # NCBI annotation: gp7... 23 9.5 >gi|14546|lcl|protein:vir:8897 Length: 582 # NCBI annotation: DNA packaging protein B # Family: family:all:697 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813786;genbank:gi:29366741;genbank:GeneID :1258833 Length = 582 Score = 1202 bits (3109), Expect = 0.0, Method: Compositional matrix adjust. Identities = 582/582 (100%), Positives = 582/582 (100%) Query: 1 MSKPRNGADDLELIKRSFVAFLFVLWRALNLPKPTKCQIDMAKKLSAGDERRFILQAFRG 60 MSKPRNGADDLELIKRSFVAFLFVLWRALNLPKPTKCQIDMAKKLSAGDERRFILQAFRG Sbjct: 1 MSKPRNGADDLELIKRSFVAFLFVLWRALNLPKPTKCQIDMAKKLSAGDERRFILQAFRG 60 Query: 61 IGKSFITCAFVVWKLWNNPDLKFMIVSASKERADANSVFIKRIIDLLPFLHELKPGPGQR 120 IGKSFITCAFVVWKLWNNPDLKFMIVSASKERADANSVFIKRIIDLLPFLHELKPGPGQR Sbjct: 61 IGKSFITCAFVVWKLWNNPDLKFMIVSASKERADANSVFIKRIIDLLPFLHELKPGPGQR 120 Query: 121 DSSLAFDVGPAKPDHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQTARDHLGELV 180 DSSLAFDVGPAKPDHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQTARDHLGELV Sbjct: 121 DSSLAFDVGPAKPDHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQTARDHLGELV 180 Query: 181 KEFDAILKPGGTIIYLGTPQTEMTLYRELEGRGYVTTIWPARYPKDQADWDSYGPRLAPM 240 KEFDAILKPGGTIIYLGTPQTEMTLYRELEGRGYVTTIWPARYPKDQADWDSYGPRLAPM Sbjct: 181 KEFDAILKPGGTIIYLGTPQTEMTLYRELEGRGYVTTIWPARYPKDQADWDSYGPRLAPM 240 Query: 241 LAAELQADGSLFWAPTDEVRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRD 300 LAAELQADGSLFWAPTDEVRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRD Sbjct: 241 LAAELQADGSLFWAPTDEVRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRD 300 Query: 301 FIVGTFAQDKGPTTLIWMPNAANECKGVPVVGLKGDRFHRYESVGQATASYAQKILVIDP 360 FIVGTFAQDKGPTTLIWMPNAANECKGVPVVGLKGDRFHRYESVGQATASYAQKILVIDP Sbjct: 301 FIVGTFAQDKGPTTLIWMPNAANECKGVPVVGLKGDRFHRYESVGQATASYAQKILVIDP 360 Query: 361 SGRGKDETGYAVLYQLNGYIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGD 420 SGRGKDETGYAVLYQLNGYIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGD Sbjct: 361 SGRGKDETGYAVLYQLNGYIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGD 420 Query: 421 GMYIKLLAPVVTATFPCAITEVKSKGQKELRICDVLEPVLGSHKLVIQESLIEKDYRTAL 480 GMYIKLLAPVVTATFPCAITEVKSKGQKELRICDVLEPVLGSHKLVIQESLIEKDYRTAL Sbjct: 421 GMYIKLLAPVVTATFPCAITEVKSKGQKELRICDVLEPVLGSHKLVIQESLIEKDYRTAL 480 Query: 481 NADGTTDTSYSLLYQLTRITRERGSLAHDDRLDALAIGVQFFTEALERDSKVGESEMLQE 540 NADGTTDTSYSLLYQLTRITRERGSLAHDDRLDALAIGVQFFTEALERDSKVGESEMLQE Sbjct: 481 NADGTTDTSYSLLYQLTRITRERGSLAHDDRLDALAIGVQFFTEALERDSKVGESEMLQE 540 Query: 541 FLESHMEDALMGHDRLLEMSISEGVSIQYEDDGSMSNYMGWR 582 FLESHMEDALMGHDRLLEMSISEGVSIQYEDDGSMSNYMGWR Sbjct: 541 FLESHMEDALMGHDRLLEMSISEGVSIQYEDDGSMSNYMGWR 582 >gi|4214|lcl|protein:vir:94726 Length: 575 # NCBI annotation: maturation protein # Family: family:all:697 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338131;genbank:gi:77118209;genbank:GeneID :3707752 Length = 575 Score = 840 bits (2170), Expect = 0.0, Method: Compositional matrix adjust. Identities = 402/560 (71%), Positives = 458/560 (81%), Gaps = 1/560 (0%) Query: 14 IKRSFVAFLFVLWRALNLPKPTKCQIDMAKKLSAGDERRFILQAFRGIGKSFITCAFVVW 73 +K FV FLFVLW+AL+LP PT+CQIDMAKKLSAGD RRFILQAFRGIGKSFITCAFVVW Sbjct: 5 MKADFVFFLFVLWKALSLPVPTRCQIDMAKKLSAGDNRRFILQAFRGIGKSFITCAFVVW 64 Query: 74 KLWNNPDLKFMIVSASKERADANSVFIKRIIDLLPFLHELKPGPGQRDSSLAFDVGPAKP 133 KLWNNPDLKFMIVSASKERADANS+FIKRIIDL+P L ELKP GQRD+ ++FDVGPAKP Sbjct: 65 KLWNNPDLKFMIVSASKERADANSIFIKRIIDLMPQLKELKPKQGQRDAVISFDVGPAKP 124 Query: 134 DHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQTARDHLGELVKEFDAILKPGGTI 193 DHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQ ARD L ELVKEFDAILKPGGTI Sbjct: 125 DHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQAARDRLSELVKEFDAILKPGGTI 184 Query: 194 IYLGTPQTEMTLYRELEGRGYVTTIWPARYPKDQADWDSYGPRLAPMLAAELQAD-GSLF 252 IYLGTPQ EMTLYRELEGRGY TTIWPARYP+D+ DW SYG RLAPML AEL+ D S + Sbjct: 185 IYLGTPQNEMTLYRELEGRGYTTTIWPARYPRDRKDWQSYGDRLAPMLQAELEEDPESFY 244 Query: 253 WAPTDEVRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRDFIVGTFAQDKGP 312 W PTDEVRFDD DL+ERELSYGK GFALQFMLNPNLSD EKYPLKLRD IV P Sbjct: 245 WRPTDEVRFDDTDLKERELSYGKAGFALQFMLNPNLSDAEKYPLKLRDLIVADLDPASSP 304 Query: 313 TTLIWMPNAANECKGVPVVGLKGDRFHRYESVGQATASYAQKILVIDPSGRGKDETGYAV 372 W+PN N+ + VP VGL GD +H Y++VG A +SY QKILVIDPSGRGKDETGYAV Sbjct: 305 MVYQWLPNPQNKREDVPNVGLMGDSYHTYQTVGSAFSSYTQKILVIDPSGRGKDETGYAV 364 Query: 373 LYQLNGYIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGDGMYIKLLAPVVT 432 LYQLNGYIF M+ GG RGGYED+ L+ALA I + KVNE V+EGNFGDGMY++L PV Sbjct: 365 LYQLNGYIFAMEVGGMRGGYEDSTLEALAKIGRKWKVNEYVIEGNFGDGMYLELFKPVAA 424 Query: 433 ATFPCAITEVKSKGQKELRICDVLEPVLGSHKLVIQESLIEKDYRTALNADGTTDTSYSL 492 P A+TEVKSKGQKELRICDVLEP++GSH+L++ + I +DY++A + DG + YSL Sbjct: 425 RIHPAAVTEVKSKGQKELRICDVLEPIMGSHRLIVNAAAIVQDYQSASDKDGVRNPIYSL 484 Query: 493 LYQLTRITRERGSLAHDDRLDALAIGVQFFTEALERDSKVGESEMLQEFLESHMEDALMG 552 YQ+TRI+RERG+LAHDDRLDALAIGVQFF E++ +D+ GE E+ +E+LE ME+ G Sbjct: 485 FYQMTRISRERGALAHDDRLDALAIGVQFFVESMAKDANKGEREVTEEWLEEQMENPRKG 544 Query: 553 HDRLLEMSISEGVSIQYEDD 572 + + GV + ++ D Sbjct: 545 FESIHTEFWDNGVRVTHDTD 564 >gi|3994|lcl|protein:vir:99686 Length: 586 # NCBI annotation: DNA packaging protein B # Family: family:all:697 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249597;genbank:gi:68299748;genbank:GeneID :3800001 Length = 586 Score = 822 bits (2124), Expect = 0.0, Method: Compositional matrix adjust. Identities = 397/561 (70%), Positives = 453/561 (80%), Gaps = 3/561 (0%) Query: 14 IKRSFVAFLFVLWRALNLPKPTKCQIDMAKKLSAGDERRFILQAFRGIGKSFITCAFVVW 73 +K FV FL VLWRALNLP+PT+CQ DMA+KL+AGDERRFILQAFRGIGKSFITCAFVVW Sbjct: 15 MKNDFVLFLMVLWRALNLPEPTRCQKDMARKLAAGDERRFILQAFRGIGKSFITCAFVVW 74 Query: 74 KLWNNPDLKFMIVSASKERADANSVFIKRIIDLLPFLHELKPGPGQRDSSLAFDVGPAKP 133 KLWNNP LKFMIVSASKERADANS+FIKRIIDLLPFLHELKP P QRDS ++FDVG AKP Sbjct: 75 KLWNNPQLKFMIVSASKERADANSIFIKRIIDLLPFLHELKPRPEQRDSVISFDVGLAKP 134 Query: 134 DHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQTARDHLGELVKEFDAILKPGGTI 193 DHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQ ARD LGELVKEFDAILKP GTI Sbjct: 135 DHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQAARDRLGELVKEFDAILKPNGTI 194 Query: 194 IYLGTPQTEMTLYRELEGRGYVTTIWPARYPKDQADWDSYGPRLAPMLAAELQAD-GSLF 252 IYLGTPQ EMTLYRELE RGY TTIWPARYPKD D ++YG RLAPML EL + + + Sbjct: 195 IYLGTPQCEMTLYRELENRGYKTTIWPARYPKDMNDLETYGNRLAPMLKDELMENPEAYW 254 Query: 253 WAPTDEVRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRDFIVGTFAQDKGP 312 W PTD VRFDD+DLRERELSYGK GFALQFMLNPNLSD EKYPLKLRDFIV DK P Sbjct: 255 WQPTDPVRFDDEDLRERELSYGKAGFALQFMLNPNLSDAEKYPLKLRDFIVAALEVDKAP 314 Query: 313 TTLIWMPNAANECKGVPVVGLKGDRFHRYESVGQATASYAQKILVIDPSGRGKDETGYAV 372 T W+PN N + VP VGLKGD +HRY+ + ASY KI+ IDPSGRGKDETGY V Sbjct: 315 LTYGWLPNPQNLLQNVPQVGLKGDTYHRYDVADKRQASYTSKIMAIDPSGRGKDETGYCV 374 Query: 373 LYQLNGYIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGDGMYIKLLAPVVT 432 LY LNGYI+LM+ GGFRGGYED+ L+ALA +AK VNE++ EGNFGDGM++K+ +PV+ Sbjct: 375 LYFLNGYIYLMETGGFRGGYEDSTLEALAKVAKRWNVNEVLCEGNFGDGMFLKIFSPVLN 434 Query: 433 ATFPCAITEVKSKGQKELRICDVLEPVLGSHKLVIQESLIEKDYRTALNADGTTDTSYSL 492 CA+TE KS GQKE+RI D LEPV+G+H++V+ ES I+KDY+TA N DGT D YS+ Sbjct: 435 RVHRCALTETKSTGQKEMRIADTLEPVMGAHRIVVMESAIQKDYQTARNVDGTHDIKYSM 494 Query: 493 LYQLTRITRERGSLAHDDRLDALAIGVQFFTEALERDSKVGESEMLQEFLESHM-EDALM 551 YQLTR+TRERG+LAHDDRLDA AIGV +F E LE+DS+ G + E+LE + +DAL Sbjct: 495 FYQLTRLTRERGALAHDDRLDAFAIGVAYFVEMLEKDSQAGADDTTAEWLEEMLGKDALQ 554 Query: 552 GHDRLLEMSISEGVSIQYEDD 572 + + + V + +EDD Sbjct: 555 ADQSHVHI-LKGDVEVYFEDD 574 >gi|3734|lcl|protein:vir:94555 Length: 585 # NCBI annotation: DNA packaging protein B # Family: family:all:697 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919024;genbank:gi:119637788;genbank:GeneI D:5179315 Length = 585 Score = 731 bits (1888), Expect = 0.0, Method: Compositional matrix adjust. Identities = 357/572 (62%), Positives = 436/572 (76%), Gaps = 5/572 (0%) Query: 14 IKRSFVAFLFVLWRALNLPKPTKCQIDMAKKLSAGDERRFILQAFRGIGKSFITCAFVVW 73 +K FVAFLFVLW+ALNLPKPTKCQIDMA+ L+ GD ++FILQAFRGIGKSFITCAFVVW Sbjct: 15 LKGDFVAFLFVLWKALNLPKPTKCQIDMARTLADGDHKKFILQAFRGIGKSFITCAFVVW 74 Query: 74 KLWNNPDLKFMIVSASKERADANSVFIKRIIDLLPFLHELKPGPGQRDSSLAFDVGPAKP 133 LW +P LK +IVSASKERADANS+FIK IIDLLPFL ELKP PGQRDS ++FDVG AKP Sbjct: 75 VLWRDPQLKVLIVSASKERADANSIFIKNIIDLLPFLAELKPRPGQRDSVISFDVGLAKP 134 Query: 134 DHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQTARDHLGELVKEFDAILKPGGT- 192 DHSPSVKSVGITGQLTGSRADI+IADDVEVP NS+T +AR+ L LV EF A+LKP T Sbjct: 135 DHSPSVKSVGITGQLTGSRADIIIADDVEVPGNSSTSSAREKLWTLVTEFAALLKPLPTS 194 Query: 193 -IIYLGTPQTEMTLYRELE-GRGYVTTIWPARYPKDQADWDSYGPRLAPMLAAELQADGS 250 +IYLGTPQTEMTLY+ELE +GY T IWPA+YP++ A+ YG RLAPML AE Sbjct: 195 RVIYLGTPQTEMTLYKELEDNKGYSTVIWPAQYPRNDAEALYYGDRLAPMLKAEYDEGFE 254 Query: 251 LFWA-PTDEVRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRDFIVGTFAQD 309 L PTD VRFD DLREREL YGK G+ LQFMLNPNLSD EKYPL+LRD IV + Sbjct: 255 LLRGQPTDPVRFDTDDLRERELEYGKAGYTLQFMLNPNLSDAEKYPLRLRDAIVCAVDPE 314 Query: 310 KGPTTLIWMPNAANECKGVPVVGLKGDRFHRYESVGQATASYAQKILVIDPSGRGKDETG 369 + P + W+PN N + +P VGLKGD H + + TA Y KILVIDPSGRGKDETG Sbjct: 315 RAPLSYQWLPNRQNRNEELPNVGLKGDDIHSFHTCSSRTAEYQSKILVIDPSGRGKDETG 374 Query: 370 YAVLYQLNGYIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGDGMYIKLLAP 429 YAVLY LNGYI+LM+ GGFRGGY+D L+ LA AK KV +V E NFGDGM+ K+ +P Sbjct: 375 YAVLYSLNGYIYLMEVGGFRGGYDDATLEKLAKKAKQWKVQTVVHESNFGDGMFGKIFSP 434 Query: 430 VVTATFPCAITEVKSKGQKELRICDVLEPVLGSHKLVIQESLIEKDYRTALNADGTTDTS 489 V+ A+ E+++KG KE+RICD +EP++GSHKL+I++ +I +DY+T+ + DG D Sbjct: 435 VLLKHHKAALEEIRAKGMKEMRICDTIEPLMGSHKLIIRDEVIREDYQTSRDLDGKHDVR 494 Query: 490 YSLLYQLTRITRERGSLAHDDRLDALAIGVQFFTEALERDSKVGESEMLQEFLESHMEDA 549 YS YQ+TR+TRERG++AHDDRLDA+A+G+++ E + DSK+GE EM EFLE+HME Sbjct: 495 YSAFYQMTRMTRERGAVAHDDRLDAIALGIEWLREGMLVDSKIGEEEMTLEFLEAHMEKQ 554 Query: 550 LMGHDRLLEMSISEGVSIQYEDDGSMSNYMGW 581 +G D++ + + GV I YED+ S+++ W Sbjct: 555 TIGGDQIHSLDVG-GVDIYYEDEEGGSSFIDW 585 >gi|16146|lcl|protein:vir:10462 Length: 586 # NCBI annotation: DNA maturase B # Family: family:all:697 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848309;genbank:gi:30387500;genbank:GeneID :1733974 Length = 586 Score = 722 bits (1863), Expect = 0.0, Method: Compositional matrix adjust. Identities = 355/573 (61%), Positives = 429/573 (74%), Gaps = 6/573 (1%) Query: 14 IKRSFVAFLFVLWRALNLPKPTKCQIDMAKKLSAGDERRFILQAFRGIGKSFITCAFVVW 73 +K FVAFLFVLW+ALNLP PTKCQIDMAK L+ GD ++FILQAFRGIGKSFITCAFVVW Sbjct: 15 LKGDFVAFLFVLWKALNLPVPTKCQIDMAKVLANGDNKKFILQAFRGIGKSFITCAFVVW 74 Query: 74 KLWNNPDLKFMIVSASKERADANSVFIKRIIDLLPFLHELKPGPGQRDSSLAFDVGPAKP 133 LW +P LK +IVSASKERADANS+FIK IIDLLPFL ELKP PGQRDS ++FDVGPAKP Sbjct: 75 SLWRDPQLKILIVSASKERADANSIFIKNIIDLLPFLAELKPRPGQRDSVISFDVGPAKP 134 Query: 134 DHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQTARDHLGELVKEFDAILKP--GG 191 DHSPSVKSVGITGQLTGSRADI+IADDVE+P+NSAT AR+ L LV+EF A+LKP Sbjct: 135 DHSPSVKSVGITGQLTGSRADIIIADDVEIPSNSATMGAREKLWTLVQEFAALLKPLTSS 194 Query: 192 TIIYLGTPQTEMTLYRELE-GRGYVTTIWPARYPKDQADWDSYGPRLAPMLAAELQAD-G 249 +IYLGTPQTEMTLY+ELE RGY T IWPA YP+ + + Y RLAPML AE + Sbjct: 195 RVIYLGTPQTEMTLYKELEDNRGYTTIIWPALYPRTREENLYYSQRLAPMLRAEYDENPE 254 Query: 250 SLFWAPTDEVRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRDFIVGTFAQD 309 +L PTD VRFD DLREREL YGK GF LQFMLNPNLSD EKYPL+LRD IV + Sbjct: 255 ALAGTPTDPVRFDRDDLRERELEYGKAGFTLQFMLNPNLSDAEKYPLRLRDAIVAALDLE 314 Query: 310 KGPTTLIWMPNAANECKGVPVVGLKGDRFHRYESVGQATASYAQKILVIDPSGRGKDETG 369 K P W+PN N + +P VGLKGD H Y + Y QKILVIDPSGRGKDETG Sbjct: 315 KAPMHYQWLPNRQNIIEDLPNVGLKGDDLHTYHDCSNNSGQYQQKILVIDPSGRGKDETG 374 Query: 370 YAVLYQLNGYIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGDGMYIKLLAP 429 YAVLY LNGYI+LM+AGGFR GY D L+ LA AK V +V E NFGDGM+ K+ +P Sbjct: 375 YAVLYTLNGYIYLMEAGGFRDGYSDKTLELLAKKAKQWGVQTVVYESNFGDGMFGKVFSP 434 Query: 430 VVTATFPCAITEVKSKGQKELRICDVLEPVLGSHKLVIQESLIEKDYRTALNADGTTDTS 489 ++ CA+ E++++G KE+RICD LEPV+ +H+LVI++ +I DY++A + DG D Sbjct: 435 ILLKHHNCAMEEIRARGMKEMRICDTLEPVMQTHRLVIRDEVIRADYQSARDVDGKYDVK 494 Query: 490 YSLLYQLTRITRERGSLAHDDRLDALAIGVQFFTEALERDSKVGESEMLQEFLESHMEDA 549 YSL YQ+TRITRE+G+LAHDDRLDALA+G+++ E+++ DS E E+L +FLE HM Sbjct: 495 YSLFYQMTRITREKGALAHDDRLDALALGIEYLRESMQLDSVKVEGEVLADFLEEHMMRP 554 Query: 550 LMGHDRLLEMSISEGVSIQYEDD-GSMSNYMGW 581 + ++EMS+ GV + EDD G ++++ W Sbjct: 555 TVSATHIIEMSVG-GVDVYSEDDEGYGTSFIEW 586 >gi|16205|lcl|protein:vir:1554 Length: 587 # NCBI annotation: DNA packaging protein B # Family: family:all:697 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052122;swissprot:trembl:q9t0z4;genbank:gi :9634048;uniprot:Q9T0Z4;genbank:GeneID:1262409 Length = 587 Score = 720 bits (1858), Expect = 0.0, Method: Compositional matrix adjust. Identities = 361/574 (62%), Positives = 434/574 (75%), Gaps = 8/574 (1%) Query: 14 IKRSFVAFLFVLWRALNLPKPTKCQIDMAKKLSAGDERRFILQAFRGIGKSFITCAFVVW 73 +K FVAFLFVLW+AL LP PTKCQIDMA+ L+ GD ++FILQAFRGIGKSFITCAFVVW Sbjct: 16 LKGDFVAFLFVLWKALALPPPTKCQIDMARCLANGDNKKFILQAFRGIGKSFITCAFVVW 75 Query: 74 KLWNNPDLKFMIVSASKERADANSVFIKRIIDLLPFLHELKPGPGQRDSSLAFDVGPAKP 133 LW +P LK +IVSASKERADANS+FIK IIDLLPFL ELKP PGQRDS ++FDVGPAKP Sbjct: 76 TLWRDPQLKILIVSASKERADANSIFIKNIIDLLPFLAELKPRPGQRDSVISFDVGPAKP 135 Query: 134 DHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQTARDHLGELVKEFDAILKPGGT- 192 DHSPSVKSVGITGQLTGSRADI+IADDVE+P+NSATQ AR+ L LV+EF A+LKP T Sbjct: 136 DHSPSVKSVGITGQLTGSRADIIIADDVEIPSNSATQGAREKLWTLVQEFAALLKPLPTS 195 Query: 193 -IIYLGTPQTEMTLYRELE-GRGYVTTIWPARYPKDQADWDSYGPRLAPMLAAELQADG- 249 +IYLGTPQTEMTLY+ELE RGY T IWPA YP+ + + YG RLAPML E DG Sbjct: 196 RVIYLGTPQTEMTLYKELEDNRGYTTIIWPALYPRSREEDLYYGDRLAPMLREEFN-DGF 254 Query: 250 -SLFWAPTDEVRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRDFIVGTFAQ 308 L PTD VRFD +DLREREL YGK GF LQFMLNPNLSD EKYPL+LRD IV Sbjct: 255 EMLQGQPTDPVRFDMEDLRERELEYGKAGFTLQFMLNPNLSDAEKYPLRLRDAIVCGLDF 314 Query: 309 DKGPTTLIWMPNAANECKGVPVVGLKGDRFHRYESVGQATASYAQKILVIDPSGRGKDET 368 +K P W+PN N + +P VGLKGD H Y S Q T Y Q+ILVIDPSGRGKDET Sbjct: 315 EKAPMHYQWLPNRQNRNEELPNVGLKGDDIHSYHSCSQNTGQYQQRILVIDPSGRGKDET 374 Query: 369 GYAVLYQLNGYIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGDGMYIKLLA 428 GYAVL+ LNGYI+LM+AGGFR GY D L++LA AK KV +V E NFGDGM+ K+ + Sbjct: 375 GYAVLFTLNGYIYLMEAGGFRDGYSDKTLESLAKKAKQWKVQTVVFESNFGDGMFGKVFS 434 Query: 429 PVVTATFPCAITEVKSKGQKELRICDVLEPVLGSHKLVIQESLIEKDYRTALNADGTTDT 488 PV+ A+ E++++G KELRICD LEPVL +H+LVI++ +I +DY+TA +ADG D Sbjct: 435 PVLLKHHAAAMEEIRARGMKELRICDTLEPVLSTHRLVIRDEVIREDYQTARDADGKHDV 494 Query: 489 SYSLLYQLTRITRERGSLAHDDRLDALAIGVQFFTEALERDSKVGESEMLQEFLESHMED 548 YSL YQLTR+ RE+G++AHDDRLDALA+GV+F +E D+ E+E+L+ FLE HME Sbjct: 495 RYSLFYQLTRMAREKGAVAHDDRLDALALGVEFLRSTMELDAVKVEAEVLEAFLEEHMEH 554 Query: 549 ALMGHDRLLEMSISEGVSIQYEDDGSMSN-YMGW 581 + ++ S+ +G+ + +EDD SN ++ W Sbjct: 555 PIHSAGHVV-TSMVDGMELYWEDDDVNSNRFIDW 587 >gi|13826|lcl|protein:vir:3376 Length: 586 # NCBI annotation: DNA packaging protein B # Family: family:all:697 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523347;swissprot:trembl:q8w5t4;genbank:gi :17570838;uniprot:Q8W5T4;genbank:GeneID:927460 Length = 586 Score = 719 bits (1857), Expect = 0.0, Method: Compositional matrix adjust. Identities = 359/564 (63%), Positives = 430/564 (76%), Gaps = 7/564 (1%) Query: 14 IKRSFVAFLFVLWRALNLPKPTKCQIDMAKKLSAGDERRFILQAFRGIGKSFITCAFVVW 73 +K FVAFLFVLW+ALNLP PTKCQIDMAK L+ GD ++FILQAFRGIGKSFITCAFVVW Sbjct: 15 LKGDFVAFLFVLWKALNLPVPTKCQIDMAKVLANGDNKKFILQAFRGIGKSFITCAFVVW 74 Query: 74 KLWNNPDLKFMIVSASKERADANSVFIKRIIDLLPFLHELKPGPGQRDSSLAFDVGPAKP 133 LW +P LK +IVSASKERADANS+FIK IIDLLPFL ELKP PGQRDS ++FDVGPAKP Sbjct: 75 TLWRDPQLKILIVSASKERADANSIFIKNIIDLLPFLAELKPRPGQRDSVISFDVGPAKP 134 Query: 134 DHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQTARDHLGELVKEFDAILKPGGT- 192 DHSPSVKSVGITGQLTGSRADI+IADDVE+P+NSATQ AR+ L LV+EF A+LKP T Sbjct: 135 DHSPSVKSVGITGQLTGSRADIIIADDVEIPSNSATQGAREKLWTLVQEFAALLKPLPTS 194 Query: 193 -IIYLGTPQTEMTLYRELE-GRGYVTTIWPARYPKDQADWDSYGPRLAPMLAAELQADG- 249 +IYLGTPQTEMTLY+ELE RGY T IWPA YP+ + + YG RLAPML E DG Sbjct: 195 RVIYLGTPQTEMTLYKELEDNRGYTTIIWPALYPRSREEDLYYGERLAPMLREEFN-DGF 253 Query: 250 -SLFWAPTDEVRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRDFIVGTFAQ 308 L PTD VRFD +DLREREL YGK GF LQFMLNPNLSD EKYPL+LRD IV Sbjct: 254 EMLQGQPTDPVRFDMEDLRERELEYGKAGFTLQFMLNPNLSDAEKYPLRLRDAIVCGLDF 313 Query: 309 DKGPTTLIWMPNAANECKGVPVVGLKGDRFHRYESVGQATASYAQKILVIDPSGRGKDET 368 +K P W+PN N + +P VGLKGD H Y S Q T Y Q+ILVIDPSGRGKDET Sbjct: 314 EKAPMHYQWLPNRQNRNEELPNVGLKGDDIHSYHSCSQNTGQYQQRILVIDPSGRGKDET 373 Query: 369 GYAVLYQLNGYIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGDGMYIKLLA 428 GYAVL+ LNGYI+LM+AGGFR GY D L++LA AK KV +V E NFGDGM+ K+ + Sbjct: 374 GYAVLFTLNGYIYLMEAGGFRDGYSDKTLESLAKKAKQWKVQTVVFESNFGDGMFGKVFS 433 Query: 429 PVVTATFPCAITEVKSKGQKELRICDVLEPVLGSHKLVIQESLIEKDYRTALNADGTTDT 488 PV+ A+ E++++G KELRICD LEPVL +H+LVI++ +I +DY+TA +ADG D Sbjct: 434 PVLLKHHAAALEEIRARGMKELRICDTLEPVLSTHRLVIRDEVIREDYQTARDADGKHDV 493 Query: 489 SYSLLYQLTRITRERGSLAHDDRLDALAIGVQFFTEALERDSKVGESEMLQEFLESHMED 548 YSL YQLTR+ RE+G++AHDDRLDALA+GV+F +E D+ E+E+L+ FLE HME Sbjct: 494 RYSLFYQLTRMAREKGAVAHDDRLDALALGVEFLRSTMELDAVKVEAEVLEAFLEEHMEH 553 Query: 549 ALMGHDRLLEMSISEGVSIQYEDD 572 + ++ ++ +G+ + +EDD Sbjct: 554 PIHSAGHVV-TAMVDGMELYWEDD 576 >gi|18462|lcl|protein:vir:2213 Length: 586 # NCBI annotation: DNA maturation protein # Family: family:all:697 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_042010;swissprot:sw:p03694;genbank:gi:962 7482;goa:P03694;uniprot:P03694;genbank:GeneID:1261062 Length = 586 Score = 719 bits (1857), Expect = 0.0, Method: Compositional matrix adjust. Identities = 354/573 (61%), Positives = 428/573 (74%), Gaps = 6/573 (1%) Query: 14 IKRSFVAFLFVLWRALNLPKPTKCQIDMAKKLSAGDERRFILQAFRGIGKSFITCAFVVW 73 +K FVAFLFVLW+ALNLP PTKCQIDMAK L+ GD ++FILQAFRGIGKSFITCAFVVW Sbjct: 15 LKGDFVAFLFVLWKALNLPVPTKCQIDMAKVLANGDNKKFILQAFRGIGKSFITCAFVVW 74 Query: 74 KLWNNPDLKFMIVSASKERADANSVFIKRIIDLLPFLHELKPGPGQRDSSLAFDVGPAKP 133 LW +P LK +IVSASKERADANS+FIK IIDLLPFL ELKP PGQRDS ++FDVGPA P Sbjct: 75 SLWRDPQLKILIVSASKERADANSIFIKNIIDLLPFLSELKPRPGQRDSVISFDVGPANP 134 Query: 134 DHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQTARDHLGELVKEFDAILKP--GG 191 DHSPSVKSVGITGQLTGSRADI+IADDVE+P+NSAT AR+ L LV+EF A+LKP Sbjct: 135 DHSPSVKSVGITGQLTGSRADIIIADDVEIPSNSATMGAREKLWTLVQEFAALLKPLPSS 194 Query: 192 TIIYLGTPQTEMTLYRELE-GRGYVTTIWPARYPKDQADWDSYGPRLAPMLAAELQAD-G 249 +IYLGTPQTEMTLY+ELE RGY T IWPA YP+ + + Y RLAPML AE + Sbjct: 195 RVIYLGTPQTEMTLYKELEDNRGYTTIIWPALYPRTREENLYYSQRLAPMLRAEYDENPE 254 Query: 250 SLFWAPTDEVRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRDFIVGTFAQD 309 +L PTD VRFD DLREREL YGK GF LQFMLNPNLSD EKYPL+LRD IV + Sbjct: 255 ALAGTPTDPVRFDRDDLRERELEYGKAGFTLQFMLNPNLSDAEKYPLRLRDAIVAALDLE 314 Query: 310 KGPTTLIWMPNAANECKGVPVVGLKGDRFHRYESVGQATASYAQKILVIDPSGRGKDETG 369 K P W+PN N + +P VGLKGD H Y + Y QKILVIDPSGRGKDETG Sbjct: 315 KAPMHYQWLPNRQNIIEDLPNVGLKGDDLHTYHDCSNNSGQYQQKILVIDPSGRGKDETG 374 Query: 370 YAVLYQLNGYIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGDGMYIKLLAP 429 YAVLY LNGYI+LM+AGGFR GY D L+ LA AK V +V E NFGDGM+ K+ +P Sbjct: 375 YAVLYTLNGYIYLMEAGGFRDGYSDKTLELLAKKAKQWGVQTVVYESNFGDGMFGKVFSP 434 Query: 430 VVTATFPCAITEVKSKGQKELRICDVLEPVLGSHKLVIQESLIEKDYRTALNADGTTDTS 489 ++ CA+ E++++G KE+RICD LEPV+ +H+LVI++ +I DY++A + DG D Sbjct: 435 ILLKHHNCAMEEIRARGMKEMRICDTLEPVMQTHRLVIRDEVIRADYQSARDVDGKHDVK 494 Query: 490 YSLLYQLTRITRERGSLAHDDRLDALAIGVQFFTEALERDSKVGESEMLQEFLESHMEDA 549 YSL YQ+TRITRE+G+LAHDDRLDALA+G+++ E+++ DS E E+L +FLE HM Sbjct: 495 YSLFYQMTRITREKGALAHDDRLDALALGIEYLRESMQLDSVKVEGEVLADFLEEHMMRP 554 Query: 550 LMGHDRLLEMSISEGVSIQYEDD-GSMSNYMGW 581 + ++EMS+ GV + EDD G ++++ W Sbjct: 555 TVAATHIIEMSVG-GVDVYSEDDEGYGTSFIEW 586 >gi|11501|lcl|protein:vir:78718 Length: 574 # NCBI annotation: DNA packaging protein # Family: family:all:697 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285469;genbank:gi:148724503;genbank:Ge neID:5220189 Length = 574 Score = 440 bits (1132), Expect = e-125, Method: Compositional matrix adjust. Identities = 238/534 (44%), Positives = 335/534 (62%), Gaps = 16/534 (2%) Query: 18 FVAFLFVLWRALNLPKPTKCQIDMAKKLSAGDERRFILQAFRGIGKSFITCAFVVWKLWN 77 F FL ++WR L+LPKPT+ Q+ +A L G +R + AFRG+GKS+IT AFV+W L+ Sbjct: 14 FKFFLSLVWRELDLPKPTRAQLAIADYLQHG-PKRLQISAFRGVGKSWITAAFVLWVLFV 72 Query: 78 NPDLKFMIVSASKERADANSVFIKRIIDLLPFLHELKP-GPGQRDSSLAFDVGPAKPDHS 136 +PD K M++SASKERAD S+F +++I + +L L+P QR S ++FDVGPAKP + Sbjct: 73 DPDRKIMVISASKERADNFSIFCQKLILDIEWLSHLRPRDSDQRWSRISFDVGPAKPHQA 132 Query: 137 PSVKSVGITGQLTGSRADILIADDVEVPNNSATQTARDHLGELVKEFDAILKP--GGTII 194 PSVKSVGITGQ+TGSRA +++ DDVEVP NSAT R+ L +LV E ++IL P I+ Sbjct: 133 PSVKSVGITGQMTGSRAHLMVFDDVEVPANSATDMQREKLLQLVSESESILVPDDDARIM 192 Query: 195 YLGTPQTEMTLYRELEGRGYVTTIWPARYPKDQADWDSYGPRLAPMLAAELQADGSLFWA 254 +LGTPQ+ T+YR+L R Y +WPARYP+D + ++ LAP L A+L+ D L W Sbjct: 193 FLGTPQSTFTIYRKLAERSYRPFVWPARYPRDLSKYEGL---LAPQLVADLEKDPELTWK 249 Query: 255 PTDEVRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRDFIVGTFAQDKGPTT 314 PTD RF++ +L ERE + G+ F LQFML+ +LSD EK+PLK +D IV + Sbjct: 250 PTD-TRFNELNLMERESAMGRSNFMLQFMLDTSLSDAEKFPLKFQDLIVTPLGAECA-EA 307 Query: 315 LIWMPNAANECKGVPVVGLKGDRFHRYESVGQATASYAQKILVIDPSGRGKDETGYAVLY 374 W + K + VGL GDRF+ + + Y++ I+ +DPSGRG DET VL Sbjct: 308 YAWSADPRYMRKELNPVGLPGDRFYGPMYIDEGIVPYSETIVSVDPSGRGTDETVAVVLS 367 Query: 375 QLNGYIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGDGMYIKLLAPVVTAT 434 Q NGYIF+ D FR GY D L + + K +K ++++VE NFGDGM +L ++ Sbjct: 368 QANGYIFVRDMKAFRDGYSDETLSDIVRLGKRYKASKLLVESNFGDGMITELFKRHISQM 427 Query: 435 FPCAIT-EVKSKGQKELRICDVLEPVLGSHKLVIQESLIEKDYRTALNADGTTDTSYSLL 493 T EV++ +KE RI + LEPV+ HKL+I + E DY + +A Y L Sbjct: 428 GGGMDTEEVRASARKEERIIETLEPVMNQHKLIIDPKVWEYDYSSNPDAAPEKRLEYMLG 487 Query: 494 YQLTRITRERGSLAHDDRLDALAIGVQFFTEALERDSKVGESEMLQEFLESHME 547 YQ++R+ RE+G++ HDDR+DAL+ GVQ++ +A V +S Q+ L H E Sbjct: 488 YQMSRMCREKGAVKHDDRVDALSQGVQYYVDA------VAQSAFKQQALRKHEE 535 >gi|7204|lcl|protein:vir:100065 Length: 577 # NCBI annotation: DNA maturase beta subunit # Family: family:all:697 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214228;genbank:gi:61806451;genbank:GeneID :3294745 Length = 577 Score = 398 bits (1023), Expect = e-113, Method: Compositional matrix adjust. Identities = 231/551 (41%), Positives = 325/551 (58%), Gaps = 20/551 (3%) Query: 9 DDLELIKRSFVAFLFVLWRALNLPKPTKCQIDMAKKLSAGDERRFILQAFRGIGKSFITC 68 D L+ ++ F FL LW L+LP PT+ Q +A L +G +R +QAFRG+GKS+IT Sbjct: 3 DTLKALQGDFKLFLQALWDQLDLPSPTRAQYAIADYLQSG-PKRLQIQAFRGVGKSWITG 61 Query: 69 AFVVWKLWNNPDLKFMIVSASKERADANSVFIKRIIDLLPFLHELKPGPGQ-RDSSLAFD 127 AFV+W L+N+ + K MI+SASKERAD S+F++++I P+L L+P R S ++FD Sbjct: 62 AFVLWTLFNDAEKKIMIISASKERADNMSIFLQKLIIETPWLKHLRPKSDDARWSRISFD 121 Query: 128 VGPAKPDHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQTARDHLGELVKEFDAIL 187 V P +PSVKSVGITGQLTGSRAD++I DD+EVP NS T+ R+ L +L E ++IL Sbjct: 122 VL-CSPHQAPSVKSVGITGQLTGSRADLMILDDIEVPGNSMTELMREKLLQLCTEAESIL 180 Query: 188 KP--GGTIIYLGTPQTEMTLYRELEGRGYVTTIWPARYPKDQADWDSYGPRLAPMLAAEL 245 P I+YLGTPQT T+YR+L R Y +WPARYPKD Y +AP L ++ Sbjct: 181 TPKDDSRIMYLGTPQTTFTVYRKLAERAYRPFVWPARYPKDIT---PYEGLIAPQLQEDI 237 Query: 246 QADGSLFWAPTDEVRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRDFIVGT 305 +G+ TD RFDD DL++RE + G+ F LQFML+ LSD EK+PLK+ D ++ + Sbjct: 238 D-NGAESGTVTDPDRFDDDDLQQRESAMGRSNFMLQFMLDTTLSDAEKFPLKMADLVITS 296 Query: 306 FAQDKGPTTLIWMPNAANECKGVPVVGLKGDRFHRYESVGQATASYAQKILVIDPSGRGK 365 + P +IW + N K P VGL GD F+ + Y + I +DPSGRG Sbjct: 297 VNPTEAPDNVIWCSDPQNIIKDAPTVGLPGDYFYSPMQLQGEWTPYQETICSVDPSGRGT 356 Query: 366 DETGYAVLYQLNGYIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGDG---- 421 DET L Q NG+++L + +R GY D L + K + +VVE NFGDG Sbjct: 357 DETAACYLSQKNGFLYLHEMRAYRDGYSDATLLDILKGCKKYNATTLVVETNFGDGIVSE 416 Query: 422 MYIKLLAPVVTATFPCAITEVKSKGQKELRICDVLEPVLGSHKLVIQESLIEKDYRTALN 481 ++ K L A F + EV++ +KE RI D LEPVL H+L++ +I+ DY + + Sbjct: 417 LFKKHLQQTKQAIF---VDEVRANVRKEDRIIDSLEPVLNQHRLIVDRGVIDWDYSSNKD 473 Query: 482 ADGTTDTSYSLLYQLTRITRERGSLAHDDRLDALAIGVQFFTEALERDSKVGESEMLQEF 541 + Y L YQ++R+ R + ++ HDDRLD LA GV++FT++L + E + Sbjct: 474 CPPESRLLYMLFYQMSRMCRMKFAVKHDDRLDCLAQGVKYFTDSLS----ISAQEQINLR 529 Query: 542 LESHMEDALMG 552 ED L G Sbjct: 530 KREEWEDILQG 540 >gi|7495|lcl|protein:vir:103312 Length: 624 # NCBI annotation: TerL large terminase subunit-like protein # Family: family:all:697 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039677;genbank:gi:126000006;genbank:Ge neID:4818388 Length = 624 Score = 301 bits (771), Expect = 1e-83, Method: Compositional matrix adjust. Identities = 186/520 (35%), Positives = 274/520 (52%), Gaps = 19/520 (3%) Query: 32 PKPTKCQIDMAKKLSAGDERRFILQAFRGIGKSFITCAFVVWKLWNNPDLKFMIVSASKE 91 P + Q D+ + + G + R + +A RG K+ I + V+ + + P + +I S + + Sbjct: 48 PHLNRIQADILRFMFTGKKYRMV-EAQRGQAKTTIAAIYAVFCIIHRPHFRILISSQTSK 106 Query: 92 RADANSVFIKRIIDLLPFLHELKPG--PGQRDSSLAFDVGPA--KPDHSPSVKSVGITGQ 147 RA+ + ++ +I L L + P G + S F++ SPSV I G Sbjct: 107 RAEEIAGWVIKIFRGLDILEFMMPDIYSGDKASIRGFEIHYTLRGSGASPSVACYSIEGS 166 Query: 148 LTGSRADILIADDVEVPNNSATQTARDHLGELVKEFDAILKPGGTIIYLGTPQTEMTLYR 207 + G+RAD++IADDVE NSAT R L E KEF++I G I+YLGTPQ+ ++Y Sbjct: 167 MQGARADLIIADDVESLQNSATAAGRVKLEEATKEFESI-NQTGDILYLGTPQSINSIYN 225 Query: 208 ELEGRGYVTTIWPARYPKDQADWDSYGPRLAPMLAAELQADGSLFWA---------PTDE 258 L RGY IWP RYP + SYG LAP++ +++A+ L PT Sbjct: 226 NLPSRGYQLRIWPGRYPTVEQQV-SYGDFLAPLIIEDMEANPELRRGGGITRLQGQPTCP 284 Query: 259 VRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRDFIVGTFAQDKGPTTLIWM 318 ++D+ L E+E+S G F LQFMLN LSD E++PLKL + G F DK P + Sbjct: 285 EMYNDEALIEKEISQGTAKFQLQFMLNTRLSDSERFPLKLSSIMFGNFGVDKVPEMPLHS 344 Query: 319 PNAANECKGVPVVGLKG-DRFHRYESVGQATASYAQKILVIDPSGRGK--DETGYAVLYQ 375 ++ NE K G K DRF+R ++I+ IDP+G G+ DETG A+++ Sbjct: 345 TDSINEIKEAQRPGNKSTDRFYRMAPRPYEWKPATRRIMYIDPAGGGQNGDETGVAIVFL 404 Query: 376 LNGYIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGDGMYIKLLAPVVTATF 435 L YI++ G +GGYED L+ + AK E+ VE NFG G + ++ P Sbjct: 405 LGTYIYVYKCFGVKGGYEDADLEQIVMAAKEANCKEVFVEKNFGHGAFQAIIKPFFERLH 464 Query: 436 PCAITEVKSKGQKELRICDVLEPVLGSHKLVIQESLIEKDYRTALNADGTTDTSYSLLYQ 495 PC + E + GQKE RI D LEP+L +H+LV +I +D + SYSL +Q Sbjct: 465 PCELQEDYATGQKEERIIDTLEPLLSAHRLVFNTEIIFEDNKAIQKYALEKQASYSLFHQ 524 Query: 496 LTRITRERGSLAHDDRLDALAIGVQFFTEALERDSKVGES 535 + ITR++GSL HDDR+DAL V+ T ++ D +S Sbjct: 525 IANITRDKGSLRHDDRIDALYGAVRQLTTDIDYDEMAKQS 564 >gi|8927|lcl|protein:vir:97005 Length: 632 # NCBI annotation: 40 # Family: family:all:697 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654141;genbank:gi:108862025;genbank:GeneI D:5075954 Length = 632 Score = 294 bits (753), Expect = 2e-81, Method: Compositional matrix adjust. Identities = 179/515 (34%), Positives = 276/515 (53%), Gaps = 21/515 (4%) Query: 32 PKPTKCQIDMAKKLSAGDERRFILQAFRGIGKSFITCAFVVWKLWNNPDLKFMIVSASKE 91 P + Q D+ K L G + R I +A RGI K+ ++ + V+++ + P + M+VS + + Sbjct: 48 PHLIRMQADILKFLFYGHKYRLI-EAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAK 106 Query: 92 RADANSVFIKRIIDLLPFLHELKPG--PGQRDSSLAFDVGPA--KPDHSPSVKSVGITGQ 147 RA+ + ++ +I L FL + P G R S AF++ D SPSV I Sbjct: 107 RAEEIAGWVVKIFRGLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAG 166 Query: 148 LTGSRADILIADDVEVPNNSATQTARDHLGELVKEFDAILKPGGTIIYLGTPQTEMTLYR 207 + G+RADI++ADDVE N+ T R L EL KEF++I G IIYLGTPQ ++Y Sbjct: 167 MQGARADIILADDVESMQNARTAAGRALLEELTKEFESI-NQFGDIIYLGTPQNVNSIYN 225 Query: 208 ELEGRGYVTTIWPARYPKDQADWDSYGPRLAPMLAAELQADGSL---------FWAPTDE 258 L RGY IW ARYP + + YG LAPM+ +++ + +L AP Sbjct: 226 NLPARGYSVRIWTARYPSVEQE-QCYGDFLAPMIVQDMKDNPALRSGYGLDGNSGAPCAP 284 Query: 259 VRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRDFIVGTFAQDKGPTTLIWM 318 +DD+ L E+E+S G F LQFMLN + D ++YPL+L + I +F ++ P W Sbjct: 285 EMYDDEVLIEKEISQGAAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWS 344 Query: 319 PNAANECKGVPVVGLKGDRFHRYESVGQAT--ASYAQKILVIDPSGRGK--DETGYAVLY 374 ++ N P G K F Y V + + ++KI+ IDP+G GK DETG A+++ Sbjct: 345 NDSINIIGDAPKYGNKPTDFM-YRPVARPYEWGAVSRKIMYIDPAGGGKNGDETGVAIVF 403 Query: 375 QLNGYIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGDGMYIKLLAPVVTAT 434 +I++ G GGY ++ L + AK V E+ +E NFG G + ++ P Sbjct: 404 LHGTFIYVYQCFGVPGGYRESSLNRIVQAAKQAGVKEVFIEKNFGHGAFEAVIKPYFERE 463 Query: 435 FPCAITEVKSKGQKELRICDVLEPVLGSHKLVIQESLIEKDYRTALNADGTTDTSYSLLY 494 +P + E + GQKELRI + LEP++ +H+L+ +++ D+ + + SYSL Sbjct: 464 WPVTLEEDYATGQKELRIIETLEPLMAAHRLIFNAEMVKSDFESVQHYPLELRMSYSLFN 523 Query: 495 QLTRITRERGSLAHDDRLDALAIGVQFFTEALERD 529 Q++ IT E+ SL HDDRLDAL ++ T ++ D Sbjct: 524 QMSNITIEKNSLRHDDRLDALYGAIRQLTSQIDYD 558 >gi|15201|lcl|protein:vir:7028 Length: 631 # NCBI annotation: large terminase subunit # Family: family:all:697 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853601;genbank:gi:31711683;genbank:GeneID :1481809 Length = 631 Score = 291 bits (746), Expect = 1e-80, Method: Compositional matrix adjust. Identities = 191/568 (33%), Positives = 289/568 (50%), Gaps = 24/568 (4%) Query: 32 PKPTKCQIDMAKKLSAGDERRFILQAFRGIGKSFITCAFVVWKLWNNPDLKFMIVSASKE 91 P + Q D+ K L G++ R + +A RG K+ I + V+++ + P + MIVS + + Sbjct: 48 PDLNRVQADILKFLFGGNKYRMV-EAQRGQAKTTIAAIYAVFRIIHEPHKRIMIVSQTAK 106 Query: 92 RADANSVFIKRIIDLLPFLHELKPG--PGQRDSSLAFDVGPA--KPDHSPSVKSVGITGQ 147 RA+ + ++ +I L FL + P G + S F++ D SPSV I Sbjct: 107 RAEEIAGWVIKIFRGLDFLEFMLPDIYAGDKASIKGFEIHYTLRGSDKSPSVACYSIEAG 166 Query: 148 LTGSRADILIADDVEVPNNSATQTARDHLGELVKEFDAILKPGGTIIYLGTPQTEMTLYR 207 + G+RADI++ADDVE NS T R L +L KEF++I G IIYLGTPQ+ ++Y Sbjct: 167 MQGARADIILADDVESLQNSRTAAGRALLEDLTKEFESI-NQFGDIIYLGTPQSVNSIYN 225 Query: 208 ELEGRGYVTTIWPARYPKDQADWDSYGPRLAPMLAAELQADGSLF---------WAPTDE 258 L RGY IWP RYP + + YG LAPM+ ++ D SL APT Sbjct: 226 NLPARGYQIRIWPGRYPTLEQE-ACYGDFLAPMIRQDMIDDPSLRSGYGIDGTQGAPTCP 284 Query: 259 VRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRDFIVGTFAQDKGPTTLIWM 318 +DD+ L E+E+S G F LQFMLN L D ++YPL+L I+ +F D P W Sbjct: 285 EMYDDEKLIEKEISQGTAKFQLQFMLNTRLMDADRYPLRLNQLILMSFGTDVVPEMPTWS 344 Query: 319 PNAANECKGVPVVGLK-GDRFHRYESVGQATASYAQKILVIDPSGRGK--DETGYAVLYQ 375 ++ N P G K D +R ++++ IDP+G GK DETG A+++ Sbjct: 345 NDSVNLISDAPRFGNKPTDYLYRPVPRPYEWRPIQRRLMYIDPAGGGKNGDETGVAIVFL 404 Query: 376 LNGYIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGDGMYIKLLAPVVTATF 435 L +I++ G GGY ++ L + AK +V E+ +E NFG G + ++ P + Sbjct: 405 LGTFIYVYKVFGVPGGYSESALSRIVREAKQAEVKEVFIEKNFGHGAFEAVIKPYFEREW 464 Query: 436 PCAITEVKSKGQKELRICDVLEPVLGSHKLVIQESLIEKDYRTALNADGTTDTSYSLLYQ 495 P + E + GQKE RI + LEP++ +H+++ +I++D + + SYSL Q Sbjct: 465 PAELKEDYATGQKEARIIETLEPLMSAHRIIFNAEMIKQDIDSVQHYPLEVRMSYSLFAQ 524 Query: 496 LTRITRERGSLAHDDRLDALAIGVQFFTEALERDSKVGESEMLQEFLESHMEDAL-MGHD 554 ++ IT E+G L HDDRLDAL ++ T ++ D E+ + M + L M D Sbjct: 525 MSNITLEKGCLRHDDRLDALYGAIRQLTSQIDYD----EANRINRLRAKEMREYLEMMTD 580 Query: 555 RLLEMSISEGVSIQYEDDGSMSNYMGWR 582 L G Y ++SN M R Sbjct: 581 PLRRREFFTGQDHGYRKSTNVSNAMQSR 608 >gi|12767|lcl|protein:vir:80218 Length: 595 # NCBI annotation: putative DNA maturase B # Family: family:all:697 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522892;genbank:gi:158345185;genbank:Ge neID:5687481 Length = 595 Score = 273 bits (699), Expect = 3e-75, Method: Compositional matrix adjust. Identities = 186/542 (34%), Positives = 269/542 (49%), Gaps = 35/542 (6%) Query: 35 TKCQIDMAKKLSAGDERRFILQAFRGIGKSFITCAFVVWKLWNNPDLKFMIVSASKERAD 94 T Q D+A+ + G +R + A RG KS I C F +W L +P + ++VS ++++A+ Sbjct: 36 TWMQEDIAEFMQYGPQRSMV-AAQRGEAKSTIACLFGLWNLVQDPTHRVVLVSGAQDKAE 94 Query: 95 ANSVFIKRIIDLLPFLHELKPG--PGQRDSSLAFDVGPAKP--DHSPSVKSVGITGQLTG 150 N + +I P L L P G R S L FDV + D S SV +GIT L G Sbjct: 95 ENGKLMHGLIHNWPLLQYLAPDKYAGDRTSVLEFDVHWSLKGVDKSASVNCLGITSSLQG 154 Query: 151 SRADILIADDVEVPNNSATQTARDHLGELVKEFDAILKP-GGTIIYLGTPQTEMTLYREL 209 R D+LI DD+E N T T R L L KEF +I+ G I+YLGTPQT ++Y L Sbjct: 155 YRGDLLIPDDIETTKNGLTATERAKLITLSKEFTSIVADRNGRILYLGTPQTRESIYNTL 214 Query: 210 EGRGYVTTIWPARYPKDQADWDSYGPRLAP-------MLAAELQA----DGSLFWAPTDE 258 GRG+ +WP R+PK ++ YG LAP +L Q DG+ W+ TD Sbjct: 215 PGRGFTVRVWPGRFPK-ASELPKYGDALAPSILERMALLGDRCQTGRGLDGTRGWS-TDP 272 Query: 259 VRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRDFIVGTFAQDKGPTTLIWM 318 R+ +++L ++EL G F LQFMLN +LSD + LKLRD IV F+ ++ P ++ W Sbjct: 273 ERYSEEELCDKELDQGPETFELQFMLNTSLSDAARQQLKLRDLIVADFSHEQVPESVFWA 332 Query: 319 PNAANECKGVPVVGLKGDRFHRYESVGQATASYAQKILVIDPSGRGKDETGYAVLYQLNG 378 + + ++ R SV + A L +DP+G G DE +A+ + Sbjct: 333 ADPRFKIDLPQEFPVQSVEMFRPASVHEHFAQIKSMTLFLDPAGNGGDELAFAIGGAVGP 392 Query: 379 YIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGDGMYIKLLAPVVTATFP-- 436 YI ++ GGF+GG + L L + K V ++VE N G G +L+ P Sbjct: 393 YIHVVAWGGFKGGVSEDNLDKLVQLCKDFGVKVVLVEKNMGAGTVTQLVRNHFLGIGPDG 452 Query: 437 ------CAITEVKSKGQKELRICDVLEPVLGSHKLVIQESLIEKDYRTALNADGTTDTSY 490 + E GQKELRI + + PV+ H+LV+ S +E D Sbjct: 453 KQRLAGVGVDERHKTGQKELRIINTIRPVMQKHRLVLHRSAVEMDLELLKQYPLQHRDVR 512 Query: 491 SLLYQLTRITRERGSLAHDDRLDALA------IGVQFFTEALERDSKVGESEMLQEFLES 544 S LYQ+ IT +RGSL DDRLDAL +G E E+ + ++ ++QEFL + Sbjct: 513 SGLYQMHNITSDRGSLTKDDRLDALEGLVAELMGFLVIDEVKEQQRR--DAAVVQEFLRN 570 Query: 545 HM 546 M Sbjct: 571 PM 572 >gi|11728|lcl|protein:vir:78939 Length: 601 # NCBI annotation: putative DNA maturase B # Family: family:all:697 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522835;genbank:gi:158345070;genbank:Ge neID:5687429 Length = 601 Score = 252 bits (643), Expect = 9e-69, Method: Compositional matrix adjust. Identities = 177/534 (33%), Positives = 262/534 (49%), Gaps = 50/534 (9%) Query: 33 KPTKCQIDMAKKLSAGDERRFILQAFRGIGKSFITCAFVVWKLWNNPDLKFMIVSASKER 92 K T Q+D+A + + + A RG KS I C +VVW + +P + M+VS S ++ Sbjct: 34 KMTWMQLDIADFMQDSPNKAMV-AAQRGEAKSTIACIYVVWCIVRDPRTRAMLVSGSGDK 92 Query: 93 ADANSVFIKRIIDLLPFLHELKPGP--GQRDSSLAFDVGPAKP--DHSPSVKSVGITGQL 148 A+ N I ++I L L+P G R S+ +FDV A + S S+ +GIT L Sbjct: 93 AEENGQLITKLIMHWDLLAYLRPEARMGDRTSATSFDVNWALKGVEKSASINCIGITAAL 152 Query: 149 TGSRADILIADDVEVPNNSATQTARDHLGELVKEFDAILKPGGTIIYLGTPQTEMTLYRE 208 G RADILI DD+E N T T R L +EF +I G I+YLGTPQ+ ++Y Sbjct: 153 QGYRADILIPDDIETTKNGLTATERAKLTRQSQEFTSICT-HGKILYLGTPQSRESIYNG 211 Query: 209 LEGRGYVTTIWPARYPKDQADWDSYGPRLAPMLAAELQA--------------DGSLFWA 254 L RG++ IWP R+P + + YG LAP + + DG+ WA Sbjct: 212 LPARGFLMRIWPGRFPT-LDEQERYGDWLAPSILERIARLEERGHNPRTGKGLDGTRGWA 270 Query: 255 PTDEVRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRDFIVGTFAQDKGPTT 314 D R++++DL ++EL G GF LQ+ML+ +L+D ++ LKLRD + + P Sbjct: 271 -ADPQRYNEEDLIDKELDQGAEGFQLQYMLDTSLADEQRMQLKLRDLLFIDATHESVPEQ 329 Query: 315 LIWMPNAANECKGVPVVGLKGDRFHRYESV----------GQATASYAQKILVIDPSGRG 364 + W AA+E LK D HR+ + A Q + +DP+G G Sbjct: 330 VAW---AADE-----RFKLKFDA-HRFPIIKPELYLPALMAGGWAPLQQMTMFVDPAGDG 380 Query: 365 KDETGYAVLYQLNGYIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGDGMYI 424 DE YAV L YI ++ GG++GG+ + L+ +A + V I VE N G G Sbjct: 381 GDELSYAVGGTLGPYIHVVSIGGWKGGFAEENLEKCIALAARYGVKVIYVEKNLGAGAVG 440 Query: 425 KLLAPVVTATFP---------CAITEVKSKGQKELRICDVLEPVLGSHKLVIQESLIEKD 475 +L + + P I + + GQKE RI D L P++ H+L+ S ++ D Sbjct: 441 QLFRNYMRSINPDTGKPRYEGIGIEDRQKSGQKERRIIDTLRPIMQRHRLIFHVSAMDSD 500 Query: 476 YRTALNADGTTDTSYSLLYQLTRITRERGSLAHDDRLDALAIGVQFFTEALERD 529 Y T S+ +Q+ IT +RGSL DDR+DAL V+ T +L +D Sbjct: 501 YVACQQYPADKRTERSVFHQIHNITTDRGSLPKDDRIDALEGLVRELTPSLVKD 554 >gi|18767|lcl|protein:vir:6335 Length: 601 # NCBI annotation: putative DNA maturase B # Family: family:all:697 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877482;genbank:gi:33300854;uniprot:Q7Y2C2 ;genbank:GeneID:1482617 Length = 601 Score = 248 bits (634), Expect = 1e-67, Method: Compositional matrix adjust. Identities = 176/536 (32%), Positives = 262/536 (48%), Gaps = 54/536 (10%) Query: 33 KPTKCQIDMAKKLSAGDERRFILQAFRGIGKSFITCAFVVWKLWNNPDLKFMIVSASKER 92 K T Q+D+A + + + A RG KS I C +VVW + NP + M+VS S ++ Sbjct: 34 KMTWMQLDIADFMQDSPNKAMV-AAQRGEAKSTIACIYVVWCITQNPATRAMLVSGSGDK 92 Query: 93 ADANSVFIKRIIDLLPFLHELKPGP--GQRDSSLAFDVGPAKP--DHSPSVKSVGITGQL 148 A+ N I ++I L L+P G R S+ +FDV A + S S+ +GIT L Sbjct: 93 AEENGQLITKLIMHWDLLAYLRPEARMGDRTSATSFDVNWALKGVEKSASINCIGITAAL 152 Query: 149 TGSRADILIADDVEVPNNSATQTARDHLGELVKEFDAILKPGGTIIYLGTPQTEMTLYRE 208 G RADILI DD+E N T T R L +EF +I G I+YLGTPQ+ ++Y Sbjct: 153 QGYRADILIPDDIETTKNGLTATERAKLTRQSQEFTSICT-HGKILYLGTPQSRESIYNG 211 Query: 209 LEGRGYVTTIWPARYPK--DQADWDSYGPRLAPMLAAELQA--------------DGSLF 252 L RG++ IWP R+P +QA YG LAP + A + DG+ Sbjct: 212 LPARGFLMRIWPGRFPTLDEQA---RYGDWLAPSILARIARLEEKGHNPRTGKGLDGTRG 268 Query: 253 WAPTDEVRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRDFIVGTFAQDKGP 312 WA D R++++DL ++EL G GF LQ+ML+ +L+D ++ LKLRD + + P Sbjct: 269 WA-ADPQRYNEEDLLDKELDQGPEGFQLQYMLDTSLADEQRMQLKLRDLLFIDATHESVP 327 Query: 313 TTLIWMPNAANECKGVPVVGLKGDRFHRYESV----------GQATASYAQKILVIDPSG 362 + W AA+E LK D HR+ + A Q + +DP+G Sbjct: 328 EQVAW---AADE-----RFKLKFDA-HRFPVIKPELYLPALMAGGWAPLQQMTMFVDPAG 378 Query: 363 RGKDETGYAVLYQLNGYIFLMDAGGFRGGYEDTVLQALANIAKIHKVNEIVVEGNFGDGM 422 G DE YA+ L YI ++ GG++GG+ + L+ +A + V I VE N G G Sbjct: 379 DGGDELSYALGGTLGPYIHVVSIGGWKGGFAEENLEKCIALAARYGVKVIYVEKNLGAGA 438 Query: 423 YIKLLAPVVTATFP---------CAITEVKSKGQKELRICDVLEPVLGSHKLVIQESLIE 473 +L + + P + + + GQKE RI D L P++ H+L+ S ++ Sbjct: 439 VGQLFRNHMRSIDPDTGKLRYEGIGVEDRQKSGQKERRIIDTLRPIMQRHRLIFHVSAMD 498 Query: 474 KDYRTALNADGTTDTSYSLLYQLTRITRERGSLAHDDRLDALAIGVQFFTEALERD 529 D+ + S+ +Q+ IT +RGSL DDR+DAL V+ L +D Sbjct: 499 SDHVSCQQYPADKRNERSVFHQIHNITTDRGSLPKDDRIDALEGLVRELAPTLVKD 554 >gi|10294|lcl|protein:vir:105656 Length: 405 # NCBI annotation: putative large terminase subunit # Family: family:all:697 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425018;genbank:gi:83571766;uniprot:Q2WC34 ;genbank:GeneID:3837297 Length = 405 Score = 184 bits (466), Expect = 4e-48, Method: Compositional matrix adjust. Identities = 121/347 (34%), Positives = 181/347 (52%), Gaps = 19/347 (5%) Query: 32 PKPTKCQIDMAKKLSAGDERRFILQAFRGIGKSFITCAFVVWKLWNNPDLKFMIVSASKE 91 P + Q D+ K L G + R I +A RGI K+ ++ + V+++ + P + M+VS + + Sbjct: 48 PHLIRMQADILKFLFYGHKYRLI-EAPRGIAKTTLSAIYTVFRIIHEPHKRIMVVSQNAK 106 Query: 92 RADANSVFIKRIIDLLPFLHELKPG--PGQRDSSLAFDVGPA--KPDHSPSVKSVGITGQ 147 RA+ + ++ +I L FL + P G R S AF++ D SPSV I Sbjct: 107 RAEEIAGWVVKIFRGLDFLEFMLPDIYAGDRASVKAFEIHYTLRGSDKSPSVSCYSIEAG 166 Query: 148 LTGSRADILIADDVEVPNNSATQTARDHLGELVKEFDAILKPGGTIIYLGTPQTEMTLYR 207 + G+RADI++ADDVE N+ T R L EL KEF++I G IIYLGTPQ ++Y Sbjct: 167 MQGARADIILADDVESMQNARTAAGRALLEELTKEFESI-NQFGDIIYLGTPQNVNSIYN 225 Query: 208 ELEGRGYVTTIWPARYPKDQADWDSYGPRLAPMLAAELQADGSL---------FWAPTDE 258 L RGY IW ARYP + + YG LAPM+ +++ + +L AP Sbjct: 226 NLPARGYSVRIWTARYPSVEQE-QCYGDFLAPMIVQDMKDNPALRSGYGLDGNSGAPCAP 284 Query: 259 VRFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLRDFIVGTFAQDKGPTTLIWM 318 +DD L E+E+S G F LQFMLN + D ++YPL+L + I +F ++ P W Sbjct: 285 EMYDDDVLIEKEISQGAAKFQLQFMLNTRMMDADRYPLRLNNLIFTSFGTEEVPVMPTWS 344 Query: 319 PNAANECKGVPVVGLKGDRFHRYESVGQAT--ASYAQKILVIDPSGR 363 ++ N P G K F Y V + + +KI+ IDP+G+ Sbjct: 345 NDSINIIGDAPKYGNKPTDFM-YRPVARPYEWGAVTRKIMYIDPAGK 390 >gi|5868|lcl|protein:vir:95431 Length: 535 # NCBI annotation: hypothetical protein ORF049 # Family: family:all:543 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294642;genbank:gi:149408208;genbank:Ge neID:5236998 Length = 535 Score = 58.2 bits (139), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 39/166 (23%), Positives = 76/166 (45%), Gaps = 10/166 (6%) Query: 59 RGIGKSFITCAFVVWKLWNNPDLKFMIVSASKERADANSVFIKRIIDL----LPFLHELK 114 R KS + + W + +P++ + +SA+ A+ +K I+ F + Sbjct: 78 RAHLKSHMVATWCAWIITRHPEVTILYISATATLAETQLYAVKNILASSVYNRYFPEYIH 137 Query: 115 PGPGQRD----SSLAFDVGPAKPD--HSPSVKSVGITGQLTGSRADILIADDVEVPNNSA 168 P G+R+ ++++ D K + ++ + G+T TG ADI++ADD+ VP N+ Sbjct: 138 PQEGKREKWSSNAMSIDHVQRKKEGIRDATIATAGLTTNTTGWHADIIVADDLVVPENAY 197 Query: 169 TQTARDHLGELVKEFDAILKPGGTIIYLGTPQTEMTLYRELEGRGY 214 T+ R+ + + +F +I GG + GT +Y + Y Sbjct: 198 TEDGRESVQKKSSQFTSIRNAGGFTMACGTRYHPSDIYATWRSQKY 243 >gi|12711|lcl|protein:vir:80169 Length: 565 # NCBI annotation: terminase # Family: family:all:543 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285801;genbank:gi:148747835;genbank:Ge neID:5220445 Length = 565 Score = 56.2 bits (134), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 114/539 (21%), Positives = 212/539 (39%), Gaps = 128/539 (23%) Query: 49 DERRFILQAFRGIGKSFI-TCAFVVWKLWNNPDLKFMIVSASKERADA----------NS 97 D RR ++ A RG KS + + +V+W+++ NPD++ ++ + K + A ++ Sbjct: 55 DNRRRLVLAPRGHLKSTVCSVLYVLWRIYRNPDIRVLVGTNLKRLSRAFIRELRQYFEDT 114 Query: 98 VFIKRIIDLLPFLH-ELKPGPGQRD--------SSLAFDVGPA----------------- 131 + + ++ P + L P D +++ +D A Sbjct: 115 WLQQNVWNVRPHIEGALVPALSASDRRKRNSQRNNVDYDEALATLTDDTKLIWSMEALQV 174 Query: 132 -KPD--HSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQTARDHLGELVKEFDAILK 188 +P P+V++V I +TG D+LI DD+ NS T+ +++ E ++ +++L Sbjct: 175 IRPTVMKEPTVQTVSIGTTVTGDHYDLLILDDIVDFENSKTEDKAENILEWTRDLESVLD 234 Query: 189 PGGTIIY-------------LGTPQTEMTLYRELEGRGYVTTIWPARYPKDQADWDSYGP 235 P +Y +G + E T + G I RY + WD YG Sbjct: 235 PRQEHVYHYNPVANKAGALVVGKTKCEFTDFV-----GDEAVILGTRYFQ----WDYYGY 285 Query: 236 RL--APMLA-----------AELQADGSLFWAPTDEVRFDDK---DLRERELSYGKGGFA 279 L A L E DG L+ E RFD + +++ R S+ + FA Sbjct: 286 LLDEAEYLGIRSFMCNIYKNGEDDKDGYLW-----EERFDAEVVENIKRRLNSFRR--FA 338 Query: 280 LQFMLNPNLSDMEKYPLKLRDFIVGTFAQDKGPTTLIWMPNAANECKGVPVVGLKGDRFH 339 Q++ +D + P + + + A+ + V + D + Sbjct: 339 SQYLNRIVTADEQLLPQE----------------NVQYFHPASVDVSDDGFVSINRDGY- 381 Query: 340 RYESVGQATASYAQKILVIDPSGRGKDETGYAVLY------QLNGYIFLMDAGGFRGGYE 393 + +LV+DP+ K VL N YIF + AG F Sbjct: 382 ---------KVRVKPMLVVDPAVSQKKTADNTVLTVGGYDNDKNLYIFDVKAGKFTPS-- 430 Query: 394 DTVLQALANIAKIHKVNEIVVEGNFGDGMYIKLLAPVVTATFPCAITEVKSKGQKELRIC 453 ++ + +A +K+N + +E G + + P AI E + KG K+ RI Sbjct: 431 -ETIKHIFTLADKYKLNAVTLETVGGFALLSYQVKDAFKTHRPLAIREYRPKGDKQGRIT 489 Query: 454 DVLEPVLGSHKLVIQESL-IEKDYRTALNA------DGTTDTSYSLLYQLTRITRERGS 505 +LEP + + +Q L I + + L++ D DT ++++ +L+ TR+ G+ Sbjct: 490 AMLEPHWTNKSIYMQSYLAIMPELKDELDSFPLSKHDDVVDT-FAIICELSTPTRKEGT 547 >gi|15319|lcl|protein:vir:3140 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:543 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640322;genbank:gi:21234401;genbank:GeneID :956064 Length = 535 Score = 47.8 bits (112), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 58/249 (23%), Positives = 97/249 (38%), Gaps = 31/249 (12%) Query: 59 RGIGKSFITCAFVVWKLWNNPDLKFMIVSASKERADANSVFIKRIIDLLPFLH----ELK 114 R KS +V W+++ NP + V A++ A IK+I+ F ++ Sbjct: 71 RDHQKSHCIAVWVCWQIFKNPAVTIAYVCATESLAILQLYDIKQILTSDEFTRLSPDMIE 130 Query: 115 PGPGQR----DSSLAFDVGPAKPDH--SPSVKSVGITGQLTGSRADILIADDVEVPNNSA 168 P +R ++++ D K + P+V + G+ G+ +I++ DDV + NS Sbjct: 131 PMEKKRQKWAETAIIVDHPIRKKERPRDPTVLATGLDSNNIGAHCNIMVKDDVVIDKNSL 190 Query: 169 TQTARDHLGELVKEFDAILKPGGTIIYLGTPQTEMTLYRELEGRGYVTTIWPAR------ 222 T+TAR + +IL G +GT Y+ L +W Sbjct: 191 TETARQKVEAKAGHLSSILTTDGMEFCVGTRYHPKDHYQTL--IDMTEEVWEGDQLVGER 248 Query: 223 --YPKDQADWDSYGPRLAPMLAAELQADGSLFWAPTDEVRFDDKDLRERELSYGKG--GF 278 Y + G L P +A E +DG +F FD L ++ Y K F Sbjct: 249 PVYAVHTRVVEVEGVFLWPRMARE--SDGKMF-------GFDRAQLSRKKAKYRKDMRNF 299 Query: 279 ALQFMLNPN 287 Q+ +PN Sbjct: 300 YCQYYNDPN 308 >gi|692|lcl|protein:vir:3649 Length: 507 # NCBI annotation: gp18 # Family: family:all:144 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705644;genbank:gi:23752329;genbank:GeneID :955741 Length = 507 Score = 36.6 bits (83), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 45/157 (28%), Positives = 74/157 (47%), Gaps = 33/157 (21%) Query: 118 GQRDSSLAFDVGPAKPDHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQTARDHLG 177 G +D+S FDV P+ + VG+ G LTG D+ I DD AT+ A + L Sbjct: 157 GNKDTSNEFDV----PEGG-EFRGVGVGGPLTGFSIDVGIIDD-------ATKNAEEALS 204 Query: 178 ELVKE-----FDAI----LKPGGTIIYLGTPQTEMTLY----RELEGRGYVTTI-WPARY 223 +V++ +D++ L+ +I +GTP + L R++EG+ T + +PA Sbjct: 205 AVVQDGLENWYDSVLLTRLQQLSGVILIGTPWSANDLLARVRRKMEGQPNFTLLSFPALN 264 Query: 224 PKDQADWDSYGP--RLAPMLAA-----ELQADGSLFW 253 DQ ++ P L P L + E++ + S FW Sbjct: 265 DPDQIGYNPDLPLGALVPHLHSADKLREMRRNISEFW 301 >gi|1758|lcl|protein:vir:101556 Length: 507 # NCBI annotation: gp18 # Family: family:all:144 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958123;genbank:gi:41057669;genbank:GeneID :2716813 Length = 507 Score = 36.6 bits (83), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 45/157 (28%), Positives = 74/157 (47%), Gaps = 33/157 (21%) Query: 118 GQRDSSLAFDVGPAKPDHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQTARDHLG 177 G +D+S FDV P+ + VG+ G LTG D+ I DD AT+ A + L Sbjct: 157 GNKDTSNEFDV----PEGG-EFRGVGVGGPLTGFSIDVGIIDD-------ATKNAEEALS 204 Query: 178 ELVKE-----FDAI----LKPGGTIIYLGTPQTEMTLY----RELEGRGYVTTI-WPARY 223 +V++ +D++ L+ +I +GTP + L R++EG+ T + +PA Sbjct: 205 AVVQDGLENWYDSVLLTRLQQLSGVILIGTPWSANDLLARVRRKMEGQPNFTLLSFPALN 264 Query: 224 PKDQADWDSYGP--RLAPMLAA-----ELQADGSLFW 253 DQ ++ P L P L + E++ + S FW Sbjct: 265 DPDQIGYNPDLPLGALVPHLHSADKLREMRRNISEFW 301 >gi|11323|lcl|protein:vir:78588 Length: 507 # NCBI annotation: TerL # Family: family:all:144 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294855;genbank:gi:149882918;genbank:Ge neID:5291059 Length = 507 Score = 36.6 bits (83), Expect = 9e-04, Method: Compositional matrix adjust. Identities = 46/157 (29%), Positives = 75/157 (47%), Gaps = 33/157 (21%) Query: 118 GQRDSSLAFDVGPAKPDHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQTARDHLG 177 G +D+S FDV PA + + VG+ G LTG D+ I DD AT+ A + L Sbjct: 157 GGKDTSNEFDV-PAGGE----FRGVGVGGPLTGFSIDVGIIDD-------ATKNAEEALS 204 Query: 178 ELVKE-----FDAI----LKPGGTIIYLGTPQTEMTLY----RELEGRGYVTTI-WPARY 223 +V++ +D++ L+ +I +GTP + L R++EG+ T + +PA Sbjct: 205 AVVQDGLENWYDSVLLTRLQQLSGVILIGTPWSANDLLARVRRKMEGQPNFTLLSFPALN 264 Query: 224 PKDQADWDSYGP--RLAPMLAA-----ELQADGSLFW 253 DQ ++ P L P L + E++ + S FW Sbjct: 265 DPDQIGYNPDLPLGALVPHLHSADKLREMRRNISEFW 301 >gi|6927|lcl|protein:vir:106715 Length: 507 # NCBI annotation: gp19 # Family: family:all:144 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944327;genbank:gi:38638626;genbank:GeneID :2657344 Length = 507 Score = 36.6 bits (83), Expect = 9e-04, Method: Compositional matrix adjust. Identities = 46/157 (29%), Positives = 75/157 (47%), Gaps = 33/157 (21%) Query: 118 GQRDSSLAFDVGPAKPDHSPSVKSVGITGQLTGSRADILIADDVEVPNNSATQTARDHLG 177 G +D+S FDV PA + + VG+ G LTG D+ I DD AT+ A + L Sbjct: 157 GGKDTSNEFDV-PAGGE----FRGVGVGGPLTGFSIDVGIIDD-------ATKNAEEALS 204 Query: 178 ELVKE-----FDAI----LKPGGTIIYLGTPQTEMTLY----RELEGRGYVTTI-WPARY 223 +V++ +D++ L+ +I +GTP + L R++EG+ T + +PA Sbjct: 205 AVVQDGLENWYDSVLLTRLQQLSGVILIGTPWSANDLLARVRRKMEGQPNFTLLSFPALN 264 Query: 224 PKDQADWDSYGP--RLAPMLAA-----ELQADGSLFW 253 DQ ++ P L P L + E++ + S FW Sbjct: 265 DPDQIGYNPDLPLGALVPHLHSADKLREMRRNISEFW 301 >gi|17058|lcl|protein:vir:5248 Length: 487 # NCBI annotation: putative terminase large subunit TerL # Family: family:all:144 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852753;genbank:gi:31544028;interpro:IPR00 4921;interpro:IPR006517;uniprot:Q7Y5U7;genbank:GeneID:27 53564 Length = 487 Score = 36.2 bits (82), Expect = 0.001, Method: Compositional matrix adjust. Identities = 111/496 (22%), Positives = 185/496 (37%), Gaps = 106/496 (21%) Query: 48 GDERRFILQAFRGIGKS-FITCAFVVWKLWNNPDLKFMIVSASKERADANSVFIKRIIDL 106 G + R ++ A GKS + F W NP+L+ + S S + A ++ ++RIID Sbjct: 61 GKQPRLMIYAPPRSGKSELFSRRFPAWVFGQNPELQIIACSYSADLASRMNLDVQRIIDD 120 Query: 107 LPFLHELKPGPGQRDSSLAFDVGPAKPDHSPSVKSVGITGQLTGSR------------AD 154 P H + P ++A G KP + + I G L R AD Sbjct: 121 -PIYHSIFPNTALNIKNIATISG--KPLRNSEI--FEIVGHLGAYRSAGVGGGITGMGAD 175 Query: 155 ILIADD-VEVPNNSATQTARDHLGELVKEFDAILKPGGTIIYLGTPQTEMTLYRELEGRG 213 I I DD V+ + +QT RD + + TLY L + Sbjct: 176 IAIIDDPVKDAKEANSQTVRDSIWDWYT---------------------TTLYTRLSPKS 214 Query: 214 YVTTIWPARYPKDQADWDSYGPRLAPMLAAELQADGSLFWAPTDEVRFDDKDLRERELSY 273 V + R+ +D LA L E + G + V+F + E + + Sbjct: 215 GV-LLGMTRWHEDD---------LAGRLIKEAENGGDQWRI----VKF--PAIAEEDEEF 258 Query: 274 GKGGFALQFMLNPNLSDMEKYPLKLRDFIVGTFAQDKGPTTLIWMPNAANECKGVPVVGL 333 K G L+P D+E+ K+R VG+ A + ++ +N+ G+ + Sbjct: 259 RKEGEP----LHPERFDLERLN-KIRQ-AVGSQAWNA-----LYQQRPSNKGGGI----I 303 Query: 334 KGDRFHRYE--SVGQATASYAQKILVIDPSGRGKDETGYAVLY----QLNGYIFLMDAGG 387 KG F RY+ + + A YA D + + K Y+V +G +++D Sbjct: 304 KGSWFGRYKVPPIIKVKAIYA------DTAQKTKQHNDYSVFIVAGKGADGKAYILDL-- 355 Query: 388 FRGGYEDTVL-QALANIAKIHKVNE---IVVEGNFGDGMYIKLLAPVVTATFPCAITEVK 443 RG +E L Q L ++ HK + I+ N D L + IT ++ Sbjct: 356 IRGKWEAPELEQTLKDVWAKHKAKKETGILTRANVEDKASGTSLIQTIRRNNQIPITPIQ 415 Query: 444 SKGQKELRICDVLEPVLGSHKLVIQESLIEKDYRTALNADGTTDTSYSLLYQLTRITRER 503 K R+ V + + ++ + + D+ A TD Sbjct: 416 VDADKYTRVLGVQGYIESGYVMLPESAPWIADFINECEAFTATD---------------- 459 Query: 504 GSLAHDDRLDALAIGV 519 S AHDD++DAL + + Sbjct: 460 -SHAHDDQVDALVMAI 474 >gi|4089|lcl|protein:vir:94598 Length: 581 # NCBI annotation: PfWMP4_40 # Family: family:all:543 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762670;genbank:gi:115304378;genbank:GeneI D:5142298 Length = 581 Score = 35.4 bits (80), Expect = 0.002, Method: Compositional matrix adjust. Identities = 15/50 (30%), Positives = 32/50 (64%), Gaps = 3/50 (6%) Query: 54 ILQAFRGIGKSFITCAFVVWKLWNNPDLKFMIVSASKERADANSVFIKRI 103 +LQ RG KS +T +++W+++ NP+++ + S +E ++A FI+ + Sbjct: 90 LLQMPRGHLKSTLTVGYIMWRIYRNPNIRMLHASNIRELSEA---FIREL 136 >gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: terminase large subunit # Family: family:all:4877 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429607;genbank:gi:156564098;genbank:Ge neID:5525534 Length = 635 Score = 35.0 bits (79), Expect = 0.002, Method: Compositional matrix adjust. Identities = 26/123 (21%), Positives = 55/123 (44%), Gaps = 14/123 (11%) Query: 48 GDERRFILQAFRGIGKSFITCAFVVWKLWNNPD------LKFMIVSASKERADANSVFIK 101 D +R +L+ R +GK+ C ++W + P+ +I++ +E+ D + K Sbjct: 80 ADSKRTVLRLGRRLGKTETMCIMILWHAFTQPNKGPNNQYDILIIAPYEEQVD---LIFK 136 Query: 102 RIIDLLPFLHELKPGPGQRDSSLAFDVGPAKPDHSPSVKSVGITG--QLTGSRADILIAD 159 R+ L+ ++ P RD ++ H + S +G G RAD+++ D Sbjct: 137 RLSQLIDMSGDVNPS---RDIDKHIELPNGTVIHGITAGSKSGSGAANTRGQRADLIVLD 193 Query: 160 DVE 162 +++ Sbjct: 194 EMD 196 >gi|12543|lcl|protein:vir:80106 Length: 486 # NCBI annotation: TerL # Family: family:all:144 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468706;genbank:gi:157325286;genbank:Ge neID:5601797 Length = 486 Score = 34.3 bits (77), Expect = 0.004, Method: Compositional matrix adjust. Identities = 49/197 (24%), Positives = 84/197 (42%), Gaps = 16/197 (8%) Query: 10 DLELIKRSFVAFL-FVLWRALNLPKPTKCQIDMAKKLSAGDERRFILQAFRGIGKSF-IT 67 D+EL KRS+ ++ + + L + T+ + + + G+++ +I + KS IT Sbjct: 20 DIELAKRSYRDYVTYSHFGDYQLFEHTELICEKLQHIIDGEQKYYIFEMPPRHSKSMTIT 79 Query: 68 CAFVVWKLWNNPDLKFMIVSAS----KERADANSVFIKRIIDLLPFLHELKPGPGQRDSS 123 F + L NP + + S S K+ N IK D L +H G D S Sbjct: 80 ETFPSYFLMKNPKKRVITTSYSDALAKQFGRKNRDKIKMAGDQLFDIHINPANSGVTDWS 139 Query: 124 LAFDVGPAKPDHSPSVKSVGITGQLTGSRADILIADD-VEVPNNSATQTARDHL-GELVK 181 + + + S + G TG AD+LI DD ++ + ++T RD + E Sbjct: 140 I--------DQYGGGMYSTSMLGGATGRGADLLIIDDPIKNREEAESKTIRDKIYQEWES 191 Query: 182 EFDAILKPGGTIIYLGT 198 F L G ++I + T Sbjct: 192 TFFTRLHKGHSVIVIMT 208 >gi|13633|lcl|protein:vir:3963 Length: 462 # NCBI annotation: putative terminase large subunit # Family: family:all:144 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663671;genbank:gi:21716108;genbank:GeneID :951204 Length = 462 Score = 28.1 bits (61), Expect = 0.33, Method: Compositional matrix adjust. Identities = 26/117 (22%), Positives = 42/117 (35%), Gaps = 17/117 (14%) Query: 398 QALANIAKIHKVNEIVVEGNFGDGMYIKLLAPVVTATFPCAITEVKSKGQKELRICDVLE 457 A+AN ++VN +E N G + + + + CA+ + KE RI Sbjct: 341 NAVANQLINNRVNASRIERNNGGRSFARSVRDKIQGKVACAVEDFFQGNNKEARI----- 395 Query: 458 PVLGSHKLVIQESLIEKDYRTALNADGTTDTSYSLLYQLTRITRERGSLAHDDRLDA 514 + + Q + D+R T + YQ + G HDD DA Sbjct: 396 --YSNSYWIEQHVRLPNDWR----------TRFPEYYQAMTTYQREGKNKHDDAPDA 440 >gi|291|lcl|protein:vir:3608 Length: 462 # NCBI annotation: TerL # Family: family:all:144 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112694;genbank:gi:13786562;genbank:GeneID :921033 Length = 462 Score = 27.3 bits (59), Expect = 0.49, Method: Compositional matrix adjust. Identities = 26/117 (22%), Positives = 41/117 (35%), Gaps = 17/117 (14%) Query: 398 QALANIAKIHKVNEIVVEGNFGDGMYIKLLAPVVTATFPCAITEVKSKGQKELRICDVLE 457 A+AN ++VN +E N G + + + + CA+ + KE RI Sbjct: 341 NAVANQLINNRVNASRIERNNGGRSFARSVRDKIQGKVACAVEDFFQGNNKEARI----- 395 Query: 458 PVLGSHKLVIQESLIEKDYRTALNADGTTDTSYSLLYQLTRITRERGSLAHDDRLDA 514 + + Q D+R T + YQ + G HDD DA Sbjct: 396 --YSNSYWIEQHVRFPNDWR----------TRFPEYYQAMTTYQREGKNKHDDAPDA 440 >gi|15492|lcl|protein:vir:732 Length: 402 # NCBI annotation: unknown # Family: family:all:144 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108709;genbank:gi:13487831;genbank:GeneID :920905 Length = 402 Score = 27.3 bits (59), Expect = 0.50, Method: Compositional matrix adjust. Identities = 26/117 (22%), Positives = 41/117 (35%), Gaps = 17/117 (14%) Query: 398 QALANIAKIHKVNEIVVEGNFGDGMYIKLLAPVVTATFPCAITEVKSKGQKELRICDVLE 457 A+AN ++VN +E N G + + + + CA+ + KE RI Sbjct: 281 NAVANQLINNRVNASRIERNNGGRSFARSVRDKIQGKVACAVEDFFQGNNKEARI----- 335 Query: 458 PVLGSHKLVIQESLIEKDYRTALNADGTTDTSYSLLYQLTRITRERGSLAHDDRLDA 514 + + Q D+R T + YQ + G HDD DA Sbjct: 336 --YSNSYWIEQHVRFPNDWR----------TRFPEYYQAMTTYQREGKNKHDDAPDA 380 >gi|2538|lcl|protein:vir:94056 Length: 483 # NCBI annotation: putative terminase large subunit # Family: family:all:144 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453630;genbank:gi:84662666;genbank:GeneID :5142566 Length = 483 Score = 27.3 bits (59), Expect = 0.50, Method: Compositional matrix adjust. Identities = 20/63 (31%), Positives = 30/63 (47%), Gaps = 2/63 (3%) Query: 139 VKSVGITGQLTGSRADILIADDVEVPNNSA-TQTARD-HLGELVKEFDAILKPGGTIIYL 196 +K+ G+ G LTG D+ + DD+ A +QT +D H F L+ I + Sbjct: 147 IKAQGVGGSLTGFSIDVGLNDDLTADAQDALSQTVQDGHQDWYATVFTTRLQQRSGQINM 206 Query: 197 GTP 199 GTP Sbjct: 207 GTP 209 >gi|19603|lcl|protein:vir:4076 Length: 205 # NCBI annotation: major tail shaft protein # Family: family:all:11746 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043555;genbank:gi:9628689;genbank:GeneID: 1261181 Length = 205 Score = 26.2 bits (56), Expect = 1.2, Method: Compositional matrix adjust. Identities = 13/33 (39%), Positives = 17/33 (51%), Gaps = 3/33 (9%) Query: 206 YRELEGRGYVTTIWP---ARYPKDQADWDSYGP 235 YR+ +G GY T +P A P D A+ D P Sbjct: 110 YRDDDGTGYKATFYPSVQATTPSDTAEADEESP 142 >gi|19384|lcl|protein:vir:7851 Length: 887 # NCBI annotation: gp8 # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817458;genbank:gi:29565887;genbank:GeneID :1259165 Length = 887 Score = 25.8 bits (55), Expect = 1.5, Method: Compositional matrix adjust. Identities = 22/74 (29%), Positives = 30/74 (40%), Gaps = 3/74 (4%) Query: 243 AELQADGSLFWAPTDEV---RFDDKDLRERELSYGKGGFALQFMLNPNLSDMEKYPLKLR 299 AEL A WA D + R D R +YG G + P D ++ PL L Sbjct: 148 AELVASDHHLWAVNDRLKGERVIDTAELYRTQTYGARGDRRYTVTVPEALDRDEAPLPLD 207 Query: 300 DFIVGTFAQDKGPT 313 +I+G + D T Sbjct: 208 PYILGAWLGDGTAT 221 >gi|14353|lcl|protein:vir:3149 Length: 554 # NCBI annotation: unknown # Family: family:all:28407 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665920;genbank:gi:22091106;genbank:GeneID :951323 Length = 554 Score = 25.4 bits (54), Expect = 2.0, Method: Compositional matrix adjust. Identities = 43/175 (24%), Positives = 77/175 (44%), Gaps = 21/175 (12%) Query: 44 KLSAGDERRFILQAFRGIGKSFITCAFVVWKLWNNPDLKFMIVSASKERA--DANSVFIK 101 K +GD IL R K+ IT A+++ L + + ++ + A++ F K Sbjct: 81 KFDSGDR---ILLCHRDGLKTTITLAYLIAGLEYKSGFRGIWAMNNQIQVGKKADTEFWK 137 Query: 102 RIIDLLPFLHELKPGPGQRDSSLAFDVGPAKPDHSPSVKSVG-ITGQLTGSRADILIADD 160 ++D P+L L P + + AK + S+ + G + G + G RA +LI DD Sbjct: 138 -MVDRNPWLINLNAPPEK-------EAVKAKVFANGSILNAGWLGGGIEGDRAHLLILDD 189 Query: 161 -VEVPNNSATQTARDHLGELVKEFDAILKPGGTIIYLGT---PQTEMTLYRELEG 211 ++ + T+ D + + ++K G + +GT P T +R LEG Sbjct: 190 IIKEKGDGDTEDVLDWIEAVCV---PMVKDHGRTVVIGTRKRPDDIYTHFRTLEG 241 >gi|18586|lcl|protein:vir:8649 Length: 564 # NCBI annotation: gp7 # Family: family:all:144 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817768;genbank:gi:29566200;genbank:GeneID :1259404 Length = 564 Score = 24.3 bits (51), Expect = 4.7, Method: Compositional matrix adjust. Identities = 29/130 (22%), Positives = 51/130 (39%), Gaps = 25/130 (19%) Query: 48 GDERRFILQAFRGIGKSFITCAFVVWK-LWNNPDLKFMIVSASKERADANSVFIKRIIDL 106 G R ++ GKS + + V + L NP+ + ++ ++ A +S + +I Sbjct: 87 GVTRNLLITCPPQEGKSTMASVYTVLRALQLNPNARIILACYGQDLAHGHSRKCRDLIK- 145 Query: 107 LPFLHELKPGPGQRDS----------SLAFDVGPAKPDH------SPSVKSVGITGQLTG 150 + G G RD+ L + G K S + + G+ G +TG Sbjct: 146 -------RHGSGVRDAMTGAQIEDKLGLKLERGANKVSEWSIEGGSGGLVATGLGGTITG 198 Query: 151 SRADILIADD 160 AD+ I DD Sbjct: 199 KPADLFIIDD 208 >gi|27412|lcl|protein:vir:7207 Length: 163 # NCBI annotation: gp19 tail tube protein # Family: family:all:1107 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049781;genbank:gi:9632593;genbank:GeneID: 1258727 Length = 163 Score = 23.9 bits (50), Expect = 5.6, Method: Compositional matrix adjust. Identities = 11/29 (37%), Positives = 17/29 (58%), Gaps = 4/29 (13%) Query: 204 TLYRELEGRGYVTTIWPARYPKDQADWDS 232 T+ +E+E +G +WP + Q DWDS Sbjct: 122 TVTKEIEIKG----LWPTNVGELQLDWDS 146 >gi|19767|lcl|protein:vir:6384 Length: 677 # NCBI annotation: terminase large subunit TerL # Family: family:all:140 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918997;genbank:gi:34610172;genbank:gi:912 14212;genbank:GeneID:2559603 Length = 677 Score = 23.5 bits (49), Expect = 6.7, Method: Compositional matrix adjust. Identities = 12/34 (35%), Positives = 18/34 (52%) Query: 308 QDKGPTTLIWMPNAANECKGVPVVGLKGDRFHRY 341 ++ G L P+ A CKG+ + +GDR RY Sbjct: 206 EEDGRKWLASSPHEAPPCKGILGLYNRGDRRRRY 239 >gi|2061|lcl|protein:vir:97894 Length: 595 # NCBI annotation: gp2 # Family: family:all:144 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655098;genbank:gi:109391848;genbank:GeneI D:4157257 Length = 595 Score = 23.5 bits (49), Expect = 6.7, Method: Compositional matrix adjust. Identities = 10/20 (50%), Positives = 14/20 (70%) Query: 141 SVGITGQLTGSRADILIADD 160 + GI +LTG AD++I DD Sbjct: 207 AAGIGSRLTGMPADLMIIDD 226 >gi|1961|lcl|protein:vir:107494 Length: 595 # NCBI annotation: gp2 # Family: family:all:144 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943780;genbank:gi:38638405;genbank:GeneID :2657174 Length = 595 Score = 23.5 bits (49), Expect = 6.7, Method: Compositional matrix adjust. Identities = 10/20 (50%), Positives = 14/20 (70%) Query: 141 SVGITGQLTGSRADILIADD 160 + GI +LTG AD++I DD Sbjct: 207 AAGIGSRLTGMPADLMIIDD 226 >gi|22058|lcl|protein:vir:103457 Length: 163 # NCBI annotation: tail tube monomer # Family: family:all:1107 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803109;genbank:gi:116326389;genbank:GeneI D:4405486 Length = 163 Score = 23.5 bits (49), Expect = 6.9, Method: Compositional matrix adjust. Identities = 11/29 (37%), Positives = 17/29 (58%), Gaps = 4/29 (13%) Query: 204 TLYRELEGRGYVTTIWPARYPKDQADWDS 232 T+ +E+E +G +WP + Q DWDS Sbjct: 122 TVTKEVEIKG----LWPTNVGELQLDWDS 146 >gi|7375|lcl|protein:vir:99083 Length: 566 # NCBI annotation: gp7 # Family: family:all:144 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655687;genbank:gi:109521765;genbank:GeneI D:4157805 Length = 566 Score = 23.1 bits (48), Expect = 9.5, Method: Compositional matrix adjust. Identities = 28/130 (21%), Positives = 51/130 (39%), Gaps = 25/130 (19%) Query: 48 GDERRFILQAFRGIGKSFITCAFVVWK-LWNNPDLKFMIVSASKERADANSVFIKRIIDL 106 G R ++ GKS + + V + L NP+ + ++ ++ A +S + +I Sbjct: 89 GVTRNLLITCPPQEGKSTMASVYTVLRALQLNPNARIILACYGQDLAHGHSRKCRDLIK- 147 Query: 107 LPFLHELKPGPGQRDS----------SLAFDVGPAKPDH------SPSVKSVGITGQLTG 150 + G G RD+ L + G K + + + G+ G +TG Sbjct: 148 -------RHGSGVRDAMTGAQIEDKLGLKLERGANKVSEWSIEGGTGGLVATGLGGTITG 200 Query: 151 SRADILIADD 160 AD+ I DD Sbjct: 201 KPADLFIIDD 210 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.319 0.137 0.407 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 265,944 Number of Sequences: 514 Number of extensions: 12242 Number of successful extensions: 151 Number of sequences better than 100.0: 45 Number of HSP's better than 100.0 without gapping: 39 Number of HSP's successfully gapped in prelim test: 6 Number of HSP's that attempted gapping in prelim test: 32 Number of HSP's gapped (non-prelim): 56 length of query: 582 length of database: 206,069 effective HSP length: 77 effective length of query: 505 effective length of database: 166,491 effective search space: 84077955 effective search space used: 84077955 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 40 (20.0 bits)