BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_019418.1_cdsid_YP_006990344.1 [gene=phiNJ2_0025] [protein=putative phage terminase large subunit] [protein_id=YP_006990344.1] [location=complement(20795..22108)] (437 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|19292|lcl|protein:vir:4781 Length: 436 # NCBI annotation: put... 719 0.0 gi|5794|lcl|protein:vir:98922 Length: 453 # NCBI annotation: ter... 583 e-168 gi|13020|lcl|protein:vir:80960 Length: 443 # NCBI annotation: Te... 453 e-129 gi|15836|lcl|protein:vir:37 Length: 443 # NCBI annotation: putat... 442 e-126 gi|655|lcl|protein:vir:1588 Length: 447 # NCBI annotation: putat... 433 e-123 gi|5891|lcl|protein:vir:107098 Length: 421 # NCBI annotation: pu... 134 3e-33 gi|6284|lcl|protein:vir:95890 Length: 421 # NCBI annotation: ORF... 134 3e-33 gi|7569|lcl|protein:vir:96300 Length: 421 # NCBI annotation: ORF... 133 4e-33 gi|10483|lcl|protein:vir:105297 Length: 446 # NCBI annotation: p... 132 5e-33 gi|7076|lcl|protein:vir:96147 Length: 419 # NCBI annotation: ORF... 132 8e-33 gi|4983|lcl|protein:vir:95058 Length: 421 # NCBI annotation: ORF... 130 2e-32 gi|10354|lcl|protein:vir:97448 Length: 421 # NCBI annotation: OR... 130 2e-32 gi|3182|lcl|protein:vir:94499 Length: 421 # NCBI annotation: ORF... 130 2e-32 gi|8730|lcl|protein:vir:96840 Length: 421 # NCBI annotation: ORF... 124 3e-30 gi|18074|lcl|protein:vir:5954 Length: 422 # NCBI annotation: hyp... 118 1e-28 gi|9289|lcl|protein:vir:97165 Length: 431 # NCBI annotation: ORF... 87 6e-19 gi|3326|lcl|protein:vir:94525 Length: 440 # NCBI annotation: put... 84 3e-18 gi|15353|lcl|protein:vir:9921 Length: 425 # NCBI annotation: put... 80 7e-17 gi|11736|lcl|protein:vir:79016 Length: 417 # NCBI annotation: pu... 75 1e-15 gi|6093|lcl|protein:vir:95761 Length: 432 # NCBI annotation: ter... 74 3e-15 gi|10799|lcl|protein:vir:78055 Length: 440 # NCBI annotation: Te... 74 4e-15 gi|5321|lcl|protein:vir:99534 Length: 422 # NCBI annotation: put... 72 1e-14 gi|4914|lcl|protein:vir:99572 Length: 459 # NCBI annotation: Ter... 72 2e-14 gi|17761|lcl|protein:vir:171 Length: 470 # NCBI annotation: term... 70 5e-14 gi|5142|lcl|protein:vir:105417 Length: 470 # NCBI annotation: ge... 70 5e-14 gi|17691|lcl|protein:vir:9305 Length: 447 # NCBI annotation: lar... 66 9e-13 gi|7301|lcl|protein:vir:96238 Length: 447 # NCBI annotation: ORF... 66 1e-12 gi|9829|lcl|protein:vir:103950 Length: 425 # NCBI annotation: ph... 66 1e-12 gi|7643|lcl|protein:vir:96370 Length: 447 # NCBI annotation: ORF... 65 1e-12 gi|11588|lcl|protein:vir:78831 Length: 447 # NCBI annotation: te... 65 1e-12 gi|5208|lcl|protein:vir:106641 Length: 446 # NCBI annotation: OR... 65 2e-12 gi|2779|lcl|protein:vir:99776 Length: 425 # NCBI annotation: lar... 65 2e-12 gi|13903|lcl|protein:vir:9870 Length: 273 # NCBI annotation: put... 61 3e-11 gi|6775|lcl|protein:vir:96067 Length: 460 # NCBI annotation: con... 59 1e-10 gi|16299|lcl|protein:vir:3027 Length: 375 # NCBI annotation: ter... 45 2e-06 gi|16025|lcl|protein:vir:9814 Length: 403 # NCBI annotation: put... 45 2e-06 gi|1165|lcl|protein:vir:102941 Length: 421 # NCBI annotation: te... 42 1e-05 gi|11622|lcl|protein:vir:78869 Length: 439 # NCBI annotation: Te... 39 1e-04 gi|14466|lcl|protein:vir:3519 Length: 469 # NCBI annotation: P18... 39 2e-04 gi|6886|lcl|protein:vir:106551 Length: 424 # NCBI annotation: pu... 37 4e-04 gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: g... 30 0.044 gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: g... 30 0.063 gi|4089|lcl|protein:vir:94598 Length: 581 # NCBI annotation: PfW... 28 0.24 gi|12378|lcl|protein:vir:79648 Length: 523 # NCBI annotation: Te... 28 0.28 gi|6559|lcl|protein:vir:104336 Length: 523 # NCBI annotation: pu... 27 0.53 gi|13199|lcl|protein:vir:81178 Length: 567 # NCBI annotation: pu... 27 0.55 gi|2931|lcl|protein:vir:105460 Length: 409 # NCBI annotation: pu... 26 0.87 gi|19539|lcl|protein:vir:10320 Length: 627 # NCBI annotation: OR... 23 5.8 gi|3763|lcl|protein:vir:107694 Length: 527 # NCBI annotation: pu... 23 7.9 gi|17128|lcl|protein:vir:1379 Length: 581 # NCBI annotation: put... 23 8.6 >gi|19292|lcl|protein:vir:4781 Length: 436 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150161;swissprot:trembl:q94m50;genbank:gi :15088772;goa:Q94M50;uniprot:Q94M50;genbank:GeneID:95597 6 Length = 436 Score = 719 bits (1857), Expect = 0.0, Method: Compositional matrix adjust. Identities = 341/432 (78%), Positives = 384/432 (88%) Query: 5 DIQKNVNPHFKSVWLSKKPYNILKGGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIRK 64 ++QKN+NPHFKSVW+S PYN+LKGGRNSFKSSVI LKL MM+ YI+ GE ANIV+IRK Sbjct: 4 NVQKNINPHFKSVWISSLPYNVLKGGRNSFKSSVIVLKLAYMMIRYIIAGEAANIVVIRK 63 Query: 65 VANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKSNNI 124 VANTIRDSV+N++ W L LFG+ +F TVSPFKI HK TGSTFYFYG DD+QKLKSN+I Sbjct: 64 VANTIRDSVFNKVWWALNLFGIAEQFTKTVSPFKIVHKTTGSTFYFYGQDDFQKLKSNDI 123 Query: 125 GNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWINEWYER 184 GNII VWYEEAAEF+ E+FDQ+N+TFMRQKHP A FVQ FWSYNPP NPYSWINEW+E Sbjct: 124 GNIIPVWYEEAAEFNDQEDFDQSNVTFMRQKHPRAKFVQFFWSYNPPRNPYSWINEWFES 183 Query: 185 MNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLGNNIYNMS 244 + T NYL HSSTYLDDELGFV EQML DIERIKENDYDYYRY+YLGE VGLGNN+YNMS Sbjct: 184 IKTNKNYLAHSSTYLDDELGFVTEQMLEDIERIKENDYDYYRYLYLGEAVGLGNNVYNMS 243 Query: 245 TFHPLDALPSDDRLIGISFALDGGHQQSATACCAFGVTAKGKVILLDTWYYSPAGQAIKK 304 FH +DA PSDD+LIGISFALDGGHQQSATACCAFG+TAKGKVILLDTWYYSPAGQ +KK Sbjct: 244 MFHAIDACPSDDKLIGISFALDGGHQQSATACCAFGITAKGKVILLDTWYYSPAGQVVKK 303 Query: 305 APSQLSQDIYYFTTKVISKYKVPILQYTIDSAEGALRNQMYLDFAIRWHPVAKLKKVTMI 364 APSQLS++IY + VI KY+V LQYTIDSAEGALRNQM+LDF ++WHPVAKL+KVTMI Sbjct: 304 APSQLSKEIYAYMRSVIEKYRVQALQYTIDSAEGALRNQMFLDFGLKWHPVAKLRKVTMI 363 Query: 365 DTFQSLLAQGRFYYLDTDNNKVFIEEHKMYRWDEKTIQSDNPNVIKDDDHTCDVAQYFVL 424 D+FQSLLAQGRFYYL+T+NNK+FIEEHKMYRWDEKTI+SDNP+VIK+DDHTCD QYFVL Sbjct: 364 DSFQSLLAQGRFYYLNTENNKIFIEEHKMYRWDEKTIKSDNPSVIKEDDHTCDTTQYFVL 423 Query: 425 DNAKILSLRVGN 436 DNAK+L LRVGN Sbjct: 424 DNAKLLGLRVGN 435 >gi|5794|lcl|protein:vir:98922 Length: 453 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164412;genbank:gi:56694902;genbank:GeneID :3197313 Length = 453 Score = 583 bits (1502), Expect = e-168, Method: Compositional matrix adjust. Identities = 288/444 (64%), Positives = 348/444 (78%), Gaps = 12/444 (2%) Query: 3 VIDIQKNVNPHFKSVWLSKKPYNILKGGRNSFKSSVITLKLIIMMVWYIVRGETANIVII 62 + ++Q+N+NPHFK VW S KPYNILKGGRNSFKSSVI LKL+ MM+ YI++GE AN+V+I Sbjct: 10 MFNVQENINPHFKEVWTSSKPYNILKGGRNSFKSSVIALKLVFMMLLYILKGEKANVVVI 69 Query: 63 RKVANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKSN 122 RKV NTIRDSV+N+IQW + LFGLT RFK TVSPFKITHK+TGSTFYFYG DD+QKLKSN Sbjct: 70 RKVGNTIRDSVFNKIQWAIKLFGLTRRFKPTVSPFKITHKRTGSTFYFYGQDDFQKLKSN 129 Query: 123 NIGNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWINEWY 182 +I +IIAVWYEEAAEF+S EEFDQ+N+TFMRQKHPLA FVQ FWSYNPP NPY WINEW Sbjct: 130 DIEDIIAVWYEEAAEFASEEEFDQSNVTFMRQKHPLAEFVQFFWSYNPPRNPYHWINEWA 189 Query: 183 ERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLGNNIYN 242 ++M ++YL H S+YLDD+LGFV QML DIERIK ND+DYYRY+YLGEPVGLG N+YN Sbjct: 190 DKMVGEEDYLVHESSYLDDQLGFVTGQMLKDIERIKNNDHDYYRYIYLGEPVGLGTNVYN 249 Query: 243 MSTFHPLDALPSDDRLIGISFALDGGHQQSATACCAFGVTAKGKVILLDTWYYSPAGQAI 302 M+ F PLD LPSDDR+I + +++DGGH SATAC +G+TA+GKVI L+T+YYSPAG+ Sbjct: 250 MNLFKPLDQLPSDDRVIALFYSVDGGHAHSATACGFYGLTARGKVIRLNTYYYSPAGRVR 309 Query: 303 KKAPSQLSQDIYYFTTKVISK---YKVPILQYTIDSAEGALRNQMYLDFAIRWHPVAKLK 359 KKAPS+LS+D++ F T + I + TID AE A+RNQ Y D+ W PV K K Sbjct: 310 KKAPSELSKDLHDFVTATAKQEYWKGARIQKRTIDDAEAAIRNQYYADYGQYWLPVGKKK 369 Query: 360 KVTMIDTFQSLLAQGRFYYL---------DTDNNKVFIEEHKMYRWDEKTIQSDNPNVIK 410 K+ MID LLAQGRFYYL D+N +FIEEHK Y++DEKT+ SD+P VIK Sbjct: 370 KIDMIDYVHDLLAQGRFYYLTNPYPTGLEHCDSNDIFIEEHKKYQFDEKTLNSDDPKVIK 429 Query: 411 DDDHTCDVAQYFVLDNAKILSLRV 434 +DDHT D QYF NA+ L L+V Sbjct: 430 EDDHTVDEFQYFCTANARDLRLKV 453 >gi|13020|lcl|protein:vir:80960 Length: 443 # NCBI annotation: TerL # Family: family:all:54 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468388;genbank:gi:157324962;genbank:Ge neID:5601395 Length = 443 Score = 453 bits (1166), Expect = e-129, Method: Compositional matrix adjust. Identities = 229/434 (52%), Positives = 300/434 (69%), Gaps = 6/434 (1%) Query: 2 KVIDIQKNVNPHFKSVWLSKKPYNILKGGRNSFKSSVITLKLIIMMVWYIVRGETANIVI 61 +VI++ +NP F +WLSK + I KGGR+S KSSVI+LKL+ + +N+V Sbjct: 13 QVINVTDMINPAFYDLWLSKHNHIIAKGGRSSMKSSVISLKLVEKKM----ANPMSNMVC 68 Query: 62 IRKVANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKS 121 +RKVANT+ SVY QI+W L G+ +F SP +I HK+ G+ FYF G DD KLKS Sbjct: 69 LRKVANTLYKSVYQQIKWALYEMGVADQFNFGKSPMEIIHKEWGTGFYFSGCDDPAKLKS 128 Query: 122 NNI--GNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWIN 179 I G + +W+EE AEFS + D TF+R+ P V I+ S+NPP NPY W+N Sbjct: 129 MKIPVGYVSDLWFEELAEFSGVTDIDVVEDTFIREDLPQGQEVTIYMSFNPPRNPYEWVN 188 Query: 180 EWYERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLGNN 239 E+ + + D+YL H +TYLDDE GF+++Q++ IE+ K+ND DYYR++YLGE +GLG+N Sbjct: 189 EYVDSKRSDDDYLIHHTTYLDDEKGFLSKQIIKKIEKYKKNDLDYYRWMYLGEVIGLGDN 248 Query: 240 IYNMSTFHPLDALPSDDRLIGISFALDGGHQQSATACCAFGVTAKGKVILLDTWYYSPAG 299 +YNM+ F PL A+P+DDRLI I FA+D GHQ SAT C A G TAK VILLDT+YYSPA Sbjct: 249 VYNMNLFQPLKAIPADDRLILIDFAIDTGHQVSATTCLALGFTAKRNVILLDTYYYSPAN 308 Query: 300 QAIKKAPSQLSQDIYYFTTKVISKYKVPILQYTIDSAEGALRNQMYLDFAIRWHPVAKLK 359 Q +KKAPS S+++ F TKV+SKY P+ T+DSAEG LRNQ Y D+ + HPVAK K Sbjct: 309 QVVKKAPSDYSKELREFMTKVVSKYNAPVDMQTVDSAEGGLRNQYYKDYGVSLHPVAKGK 368 Query: 360 KVTMIDTFQSLLAQGRFYYLDTDNNKVFIEEHKMYRWDEKTIQSDNPNVIKDDDHTCDVA 419 KV M+D LLAQGRFYYLD N++FIEEH+ Y+WD KT+ +D P VIK+DDHTCD Sbjct: 369 KVDMVDFVCDLLAQGRFYYLDIPENQIFIEEHRKYQWDVKTVNTDKPEVIKEDDHTCDAF 428 Query: 420 QYFVLDNAKILSLR 433 QY+V DN + L L+ Sbjct: 429 QYYVKDNLRKLGLK 442 >gi|15836|lcl|protein:vir:37 Length: 443 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463463;swissprot:trembl:q9t1c1;genbank:gi :16798785;goa:Q9T1C1;uniprot:Q9T1C1;genbank:GeneID:92238 4 Length = 443 Score = 442 bits (1138), Expect = e-126, Method: Compositional matrix adjust. Identities = 225/434 (51%), Positives = 299/434 (68%), Gaps = 6/434 (1%) Query: 2 KVIDIQKNVNPHFKSVWLSKKPYNILKGGRNSFKSSVITLKLIIMMVWYIVRGETANIVI 61 ++I++ +NP F +WLSK + I KGGR+S KSSVI+LKL+ + +N+V Sbjct: 13 QIINVIDMINPAFYDLWLSKHNHIIAKGGRSSMKSSVISLKLVEKKM----ANPMSNMVC 68 Query: 62 IRKVANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKS 121 +RKVANT+ SVY QI+W L G+ +FK SP +I HK G+ FYF G DD KLKS Sbjct: 69 LRKVANTLYKSVYQQIKWALYEMGVADQFKFGKSPMEIVHKTWGTGFYFSGCDDPAKLKS 128 Query: 122 NNI--GNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWIN 179 I G + +W+EE AEFS + D TF+R+ P V I+ S+NPP NPY W+N Sbjct: 129 MKIPVGYVSGLWFEELAEFSGVTDIDVVEDTFIREDLPQGQEVTIYMSFNPPRNPYEWVN 188 Query: 180 EWYERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLGNN 239 E+ + + D+YL H +TYLDDE GF+++Q++ IE+ K+ND DYYR++YLGE +GLG+N Sbjct: 189 EYVDSKRSDDDYLIHHTTYLDDEKGFLSKQIIKKIEKYKKNDLDYYRWMYLGEVIGLGDN 248 Query: 240 IYNMSTFHPLDALPSDDRLIGISFALDGGHQQSATACCAFGVTAKGKVILLDTWYYSPAG 299 +YNM+ F PL A+P+DDRLI I FA+D GHQ SAT +FG+TAK VILL+T+YYSPA Sbjct: 249 VYNMNLFQPLKAIPADDRLILIDFAIDTGHQVSATTYLSFGLTAKRNVILLNTYYYSPAN 308 Query: 300 QAIKKAPSQLSQDIYYFTTKVISKYKVPILQYTIDSAEGALRNQMYLDFAIRWHPVAKLK 359 Q +KKAPS+ S+++ F TKV+ Y + T+DSAEG LRNQ Y D+ + HPVAK K Sbjct: 309 QVVKKAPSEYSKELRDFMTKVVGNYNTNVDMQTVDSAEGGLRNQYYKDYGVSLHPVAKGK 368 Query: 360 KVTMIDTFQSLLAQGRFYYLDTDNNKVFIEEHKMYRWDEKTIQSDNPNVIKDDDHTCDVA 419 KV MID LLAQGRFYYLD N++FIEEH+ Y+WD KTI +D P V+K+DDHTCD Sbjct: 369 KVDMIDFVCDLLAQGRFYYLDIPENQIFIEEHRKYQWDVKTINTDKPEVVKEDDHTCDAF 428 Query: 420 QYFVLDNAKILSLR 433 QY+V DN + L L+ Sbjct: 429 QYYVKDNLRKLGLK 442 >gi|655|lcl|protein:vir:1588 Length: 447 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695170;swissprot:trembl:o03927;genbank:gi :23455799;goa:O03927;interpro:IPR006437;interpro:IPR0067 01;interpro:IPR011441;uniprot:O03927;genbank:GeneID:9555 65 Length = 447 Score = 433 bits (1113), Expect = e-123, Method: Compositional matrix adjust. Identities = 211/434 (48%), Positives = 288/434 (66%), Gaps = 2/434 (0%) Query: 2 KVIDIQKNVNPHFKSVWLSKKPYNILKGGRNSFKSSVITLKLIIMMVWYIVRGETANIVI 61 KVI I +NPHFK +W + KPY + GGR SFKSSVI+LKL+ M+ I++ AN++ Sbjct: 13 KVIKISDLINPHFKRMWTTDKPYIVANGGRGSFKSSVISLKLVTMVKKAIMQHRKANVIA 72 Query: 62 IRKVANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKS 121 + + + D+VYNQIQW L + + + F SP I HK+TGS+FYFYG D+ KLKS Sbjct: 73 VLANKSDLHDTVYNQIQWALSMLDMDNEFIAYKSPLTIQHKRTGSSFYFYGADNPYKLKS 132 Query: 122 NNIGNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWINEW 181 N +G+++AVWYEEAA S++ FDQ N TF+RQK V++F+SYNPP NPY WINEW Sbjct: 133 NIVGDVVAVWYEEAANMKSSDVFDQANPTFIRQKPEWLDQVKVFYSYNPPKNPYDWINEW 192 Query: 182 YERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLGNNIY 241 ++++ DNYL +S Y D GF ++Q L IE+ K+NDY+YYR++YLGE +GLG +IY Sbjct: 193 IDKVSKDDNYLIDTSDYRCDVRGFTSKQTLDLIEQYKKNDYEYYRWLYLGEVIGLGTSIY 252 Query: 242 NMSTFHPLDALPSDDRLIGISFALDGGHQQSATACCAFGVTAKGKVILLDTWYYSPAGQA 301 N S PL+ P DD + + F+ D G Q SAT +TAK +VILLDT+YYSPA Q+ Sbjct: 253 NPSLLKPLEVFPDDDYIKSLYFSQDSGQQVSATTELCIALTAKKRVILLDTYYYSPAHQS 312 Query: 302 IKKAPSQLSQDIYYFTTKVISKYKVPILQYTIDSAEG--ALRNQMYLDFAIRWHPVAKLK 359 +KK PS+L+ ++Y F ++ + + D A A+ ++ + + WH V K++ Sbjct: 313 VKKPPSELADELYAFEDSREKQWHKKAWKRSADEATSDYAIDHEYFKKYGRHWHHVNKIE 372 Query: 360 KVTMIDTFQSLLAQGRFYYLDTDNNKVFIEEHKMYRWDEKTIQSDNPNVIKDDDHTCDVA 419 K MID Q LLA GRFYYLD N++FI+EH+ Y+WD T++SD P VIK DDHTCD Sbjct: 373 KTAMIDHVQDLLATGRFYYLDNKANQIFIDEHRKYQWDGDTLESDKPKVIKVDDHTCDAF 432 Query: 420 QYFVLDNAKILSLR 433 QYFVLDN + L LR Sbjct: 433 QYFVLDNLRDLELR 446 >gi|5891|lcl|protein:vir:107098 Length: 421 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950600;genbank:gi:119953680;genbank:GeneI D:4643107 Length = 421 Score = 134 bits (336), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 91/293 (31%), Positives = 150/293 (51%), Gaps = 18/293 (6%) Query: 4 IDIQKNVNPHFKSVWLSKKPYNIL----KGGRNSFKSSVITLKLIIMMVWYIVRGETANI 59 I++ + + HF S+W + K L KGGR S KSS I++ + +++ Y + N Sbjct: 5 INLSELLPKHFHSLWKATKDREKLNIVAKGGRGSGKSSDISIIITQLIMRYPM-----NA 59 Query: 60 VIIRKVANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKL 119 V++RK NT+ SV+ QI+W + ++ FK+ VSP +IT+ G+ F G + ++L Sbjct: 60 VVVRKTDNTLATSVFEQIKWAIEEQKVSHLFKVKVSPMEITYVPRGNRIIFRGAQNPERL 119 Query: 120 KS--NNIGNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSW 177 KS ++ +W EE AEF + +E + +R + F + F+SYNPP SW Sbjct: 120 KSLKDSRFPFSIMWIEELAEFKTEDEVTTITNSMLRGELDDGLFYKFFFSYNPPKRKQSW 179 Query: 178 INEWYERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLG 237 +N+ YE DN H STYLD+ F+++Q + + E KE + YR+ Y+GE +G G Sbjct: 180 VNKKYETSFQPDNTFVHHSTYLDN--PFISKQFIQEAESAKERNEQRYRWEYMGEAIGSG 237 Query: 238 NNIYNMSTFHPLDALPSD--DRLIGISFALDGGHQQSATACCAFGVTAKGKVI 288 +N ++ +P D I A+D G+ A + K ++I Sbjct: 238 VVPFNNLQ---IEKIPDDLYKTFDNIRNAVDFGYATDPLAFVRWHYDKKKRII 287 >gi|6284|lcl|protein:vir:95890 Length: 421 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240381;genbank:gi:66396048;genbank:GeneID :5133401 Length = 421 Score = 134 bits (336), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 90/284 (31%), Positives = 145/284 (51%), Gaps = 18/284 (6%) Query: 13 HFKSVWLSKKPYNIL----KGGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIRKVANT 68 HF+ +W + K IL KGGR S KSS I++ + +++ Y + N V+IRK NT Sbjct: 14 HFRPLWKATKDKGILNIIAKGGRGSGKSSDISIIITQLIMRYPM-----NAVVIRKTDNT 68 Query: 69 IRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKS--NNIGN 126 + SV+ QI+W + ++ FK+ VSP +IT+ G+ F G + ++LKS ++ Sbjct: 69 LATSVFEQIKWAIEEQKVSHLFKVKVSPMEITYIPRGNRIIFRGAQNPERLKSLKDSRFP 128 Query: 127 IIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWINEWYERMN 186 W EE AEF + +E + +R + F + F+SYNPP SW+N+ YE Sbjct: 129 FSVAWIEELAEFKTEDEVTTITNSLLRGELDEGLFYKFFFSYNPPKRKQSWVNKKYESSF 188 Query: 187 TMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLGNNIYNMSTF 246 DN H STYL++ F+++Q + + E K+ + YR+ Y+GE +G G +N Sbjct: 189 QADNTFVHHSTYLNN--PFISKQFIQEAESAKKRNEQRYRWEYMGEAIGSGVVPFNNLR- 245 Query: 247 HPLDALPSD--DRLIGISFALDGGHQQSATACCAFGVTAKGKVI 288 ++ +P D I A+D G+ A + K +VI Sbjct: 246 --IEEIPQGQYDTFDNIRNAVDFGYATDPLAFVRWHYDKKKRVI 287 >gi|7569|lcl|protein:vir:96300 Length: 421 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240307;genbank:gi:66395973;genbank:GeneID :5133377 Length = 421 Score = 133 bits (335), Expect = 4e-33, Method: Compositional matrix adjust. Identities = 90/284 (31%), Positives = 145/284 (51%), Gaps = 18/284 (6%) Query: 13 HFKSVWLSKKPYNIL----KGGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIRKVANT 68 HF +W + K ++L KGGR S KSS I++ + +++ Y + N V+IRK NT Sbjct: 14 HFHPLWKATKDKDLLNIIAKGGRGSGKSSDISIIITQLIMRYPM-----NAVVIRKTDNT 68 Query: 69 IRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKS--NNIGN 126 + SV+ QI+W + +T FK+ VSP +IT+ G+ F G + ++LKS ++ Sbjct: 69 LATSVFEQIKWAIEEQKVTHLFKVKVSPMEITYIPRGNRIIFRGAQNPERLKSLKDSRFP 128 Query: 127 IIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWINEWYERMN 186 W EE AEF + +E + +R + F + F+SYNPP SW+N+ YE Sbjct: 129 FSIAWIEELAEFKTEDEVTTITNSLLRGELDEGLFYKFFFSYNPPKRKQSWVNKKYESSF 188 Query: 187 TMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLGNNIYNMSTF 246 DN H STYL++ F+++Q + + E K+ + YR+ Y+GE +G G +N Sbjct: 189 QADNTFVHHSTYLNN--PFISKQFIQEAESAKKRNEQRYRWEYMGEAIGSGVVPFNNLR- 245 Query: 247 HPLDALPSD--DRLIGISFALDGGHQQSATACCAFGVTAKGKVI 288 ++ +P D I A+D G+ A + K +VI Sbjct: 246 --IEEIPQGQYDTFDNIRNAVDFGYATDPLAFVRWHYDKKKRVI 287 >gi|10483|lcl|protein:vir:105297 Length: 446 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950664;genbank:gi:119967835;genbank:GeneI D:4643176 Length = 446 Score = 132 bits (333), Expect = 5e-33, Method: Compositional matrix adjust. Identities = 82/243 (33%), Positives = 131/243 (53%), Gaps = 13/243 (5%) Query: 1 MKVIDIQKNVNPHFKSVWLSKKPYNIL----KGGRNSFKSSVITLKLIIMMVWYIVRGET 56 M I + + + HF S+W + K L KGGR S KSS I++ + +++ Y + Sbjct: 1 MISIKLSELLPKHFHSLWKATKDREKLNIVAKGGRGSGKSSDISIIITQLIMRYPM---- 56 Query: 57 ANIVIIRKVANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDY 116 N V++RK NT+ SV+ QI+W + ++ FK+ VSP +IT+ G+ F G + Sbjct: 57 -NAVVVRKADNTLATSVFEQIKWAIEEQKVSHLFKVKVSPMEITYVPRGNRIIFRGAQNP 115 Query: 117 QKLKS--NNIGNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINP 174 ++LKS ++ +W EE AEF + +E + +R + F + F+SYNPP Sbjct: 116 ERLKSLKDSRFPFSIMWIEELAEFKTEDEVTTITNSMLRGELDDGLFYKFFFSYNPPKRK 175 Query: 175 YSWINEWYERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPV 234 SW+N+ YE DN H STYLD+ F+++Q + + E KE + YR+ Y+GE + Sbjct: 176 QSWVNKKYETSFQPDNTFVHHSTYLDNP--FISKQFIQEAESAKERNEQRYRWEYMGEAI 233 Query: 235 GLG 237 G G Sbjct: 234 GSG 236 >gi|7076|lcl|protein:vir:96147 Length: 419 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240074;genbank:gi:66395738;genbank:GeneID :5133130 Length = 419 Score = 132 bits (332), Expect = 8e-33, Method: Compositional matrix adjust. Identities = 87/284 (30%), Positives = 145/284 (51%), Gaps = 18/284 (6%) Query: 13 HFKSVWLSKKPYNIL----KGGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIRKVANT 68 HF S+W + K L KGGR S KSS I + ++++++ Y V N +I+RK+ NT Sbjct: 12 HFHSLWHAAKDKGKLNIVAKGGRGSGKSSDIAIIIVLLIMRYPV-----NALILRKIDNT 66 Query: 69 IRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKSNNIGNI- 127 + SV+ QI+W + + G++ FK+ VSP +IT+ G+ F G + +++KS Sbjct: 67 LALSVFEQIKWAINVMGVSHLFKIKVSPMEITYVPRGNKMVFRGAQNPERIKSLKDAQFP 126 Query: 128 -IAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWINEWYERMN 186 W EE AEF + +E + +R + F + F++YNPP SW+N+ YE Sbjct: 127 YAIAWIEELAEFKTEDEVTTITNSLLRGELDNGLFYKFFYTYNPPKRKQSWVNKKYESSF 186 Query: 187 TMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLGNNIYNMSTF 246 DN H STYL++ F+ ++ + + + K + YR+ YLGE +G G +N Sbjct: 187 QPDNTFVHHSTYLNNP--FIAKEFIEEAKAAKAINELRYRWEYLGEAIGSGVVPFNNLR- 243 Query: 247 HPLDALPSD--DRLIGISFALDGGHQQSATACCAFGVTAKGKVI 288 ++ +P + D I A+D G+ A + K ++I Sbjct: 244 --IETIPKEQFDTFDNIRNAVDFGYATDPLAFVRWHYDKKKRII 285 >gi|4983|lcl|protein:vir:95058 Length: 421 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240816;genbank:gi:66394679;genbank:GeneID :5133852 Length = 421 Score = 130 bits (328), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 90/284 (31%), Positives = 144/284 (50%), Gaps = 18/284 (6%) Query: 13 HFKSVWLSKKPYNIL----KGGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIRKVANT 68 HF +W + K IL KGGR S KSS I++ + +++ Y + N V+IRK NT Sbjct: 14 HFHPLWKATKDKEILNIVAKGGRGSGKSSDISIIITQLIMRYPM-----NAVVIRKTDNT 68 Query: 69 IRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKS--NNIGN 126 + SV+ QI+W + ++ FK+ VSP +IT+ G+ F G + ++LKS ++ Sbjct: 69 LATSVFEQIKWAIEEQKVSHLFKVKVSPMEITYIPRGNRIIFRGAQNPERLKSLKDSRFP 128 Query: 127 IIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWINEWYERMN 186 W EE AEF + +E + +R + F + F+SYNPP SW+N+ YE Sbjct: 129 FSISWIEELAEFKTEDEVTTITNSLLRGELDEGLFYKFFFSYNPPKRKQSWVNKKYESSF 188 Query: 187 TMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLGNNIYNMSTF 246 DN H STYL++ F+++Q + + E K+ + YR+ Y+GE +G G +N Sbjct: 189 QADNTYVHHSTYLNN--PFISKQFIQEAESAKKRNEQRYRWEYMGEAIGSGVVPFNNLR- 245 Query: 247 HPLDALPSD--DRLIGISFALDGGHQQSATACCAFGVTAKGKVI 288 ++ +P D I A+D G+ A + K +VI Sbjct: 246 --IEEIPQRQYDTFDNIRNAVDFGYATDPLAFVRWHYDKKKRVI 287 >gi|10354|lcl|protein:vir:97448 Length: 421 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240743;genbank:gi:66396415;genbank:GeneID :5133804 Length = 421 Score = 130 bits (328), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 91/291 (31%), Positives = 145/291 (49%), Gaps = 18/291 (6%) Query: 13 HFKSVWLSKKPYNIL----KGGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIRKVANT 68 HF +W K +L KGGR S KSS I++ + +++ Y + N V+IRK NT Sbjct: 14 HFHPLWKVTKDKEVLNVVAKGGRGSGKSSDISIIITQLIMRYPM-----NAVVIRKTDNT 68 Query: 69 IRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKS--NNIGN 126 + SV+ QI+W + ++ FK+ VSP +IT+ G+ F G + ++LKS ++ Sbjct: 69 LATSVFEQIKWAIEEQKVSHLFKVKVSPMEITYIPRGNRIIFRGAQNPERLKSLKDSRFP 128 Query: 127 IIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWINEWYERMN 186 W EE AEF + +E + +R + F + F+SYNPP SW+N+ YE Sbjct: 129 FSIAWIEELAEFKTEDEVTTITNSLLRGELDEGLFYKFFFSYNPPKRKQSWVNKKYESSF 188 Query: 187 TMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLGNNIYNMSTF 246 DN H STYL++ F+++Q + + E K+ + YR+ Y+GE +G G +N Sbjct: 189 QADNTYVHHSTYLNN--PFISKQFIQEAESAKKRNEQRYRWEYMGEAIGSGVVPFNNLR- 245 Query: 247 HPLDALPSD--DRLIGISFALDGGHQQSATACCAFGVTAKGKVILLDTWYY 295 ++ +P D I A+D G+ A + K +VI YY Sbjct: 246 --IEEIPQRQYDTFDNIRNAVDFGYATDPLAFVRWHYDKKKRVIYAMDEYY 294 >gi|3182|lcl|protein:vir:94499 Length: 421 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240671;genbank:gi:66396341;genbank:GeneID :5133763 Length = 421 Score = 130 bits (328), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 91/291 (31%), Positives = 145/291 (49%), Gaps = 18/291 (6%) Query: 13 HFKSVWLSKKPYNIL----KGGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIRKVANT 68 HF +W K +L KGGR S KSS I++ + +++ Y + N V+IRK NT Sbjct: 14 HFHPLWKVTKDKEVLNVVAKGGRGSGKSSDISIIITQLIMRYPM-----NAVVIRKTDNT 68 Query: 69 IRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKS--NNIGN 126 + SV+ QI+W + ++ FK+ VSP +IT+ G+ F G + ++LKS ++ Sbjct: 69 LATSVFEQIKWAIEEQKVSHLFKVKVSPMEITYIPRGNRIIFRGAQNPERLKSLKDSRFP 128 Query: 127 IIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWINEWYERMN 186 W EE AEF + +E + +R + F + F+SYNPP SW+N+ YE Sbjct: 129 FSIAWIEELAEFKTEDEVTTITNSLLRGELDEGLFYKFFFSYNPPKRKQSWVNKKYESSF 188 Query: 187 TMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLGNNIYNMSTF 246 DN H STYL++ F+++Q + + E K+ + YR+ Y+GE +G G +N Sbjct: 189 QADNTYVHHSTYLNN--PFISKQFIQEAESAKKRNEQRYRWEYMGEAIGSGVVPFNNLR- 245 Query: 247 HPLDALPSD--DRLIGISFALDGGHQQSATACCAFGVTAKGKVILLDTWYY 295 ++ +P D I A+D G+ A + K +VI YY Sbjct: 246 --IEEIPQRQYDTFDNIRNAVDFGYATDPLAFVRWHYDKKKRVIYAMDEYY 294 >gi|8730|lcl|protein:vir:96840 Length: 421 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240151;genbank:gi:66395816;genbank:GeneID :5133181 Length = 421 Score = 124 bits (310), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 91/298 (30%), Positives = 144/298 (48%), Gaps = 14/298 (4%) Query: 4 IDIQKNVNPHFKSVWLSKKPYNILKGGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIR 63 I++ + + HF +W + K NIL + S + + I++ I+R N V++R Sbjct: 5 INLSELIPEHFHDLWRATKDPNILNVVGKGGRGSGKSSDISIIITQLIMRY-PMNAVVVR 63 Query: 64 KVANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKS-- 121 K NT+ SV+ QI+W + ++ FK+ VSP +IT+ G+ F G + ++LKS Sbjct: 64 KTDNTLATSVFEQIKWAIEQQKVSHLFKVKVSPMEITYMPRGNRIIFRGAQNPERLKSLK 123 Query: 122 NNIGNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWINEW 181 ++ +W EE AEF + +E + +R + F + F+SYNPP SW+N+ Sbjct: 124 DSRFPFSIMWIEELAEFKTEDEVTTITNSMLRGELDEGLFYKFFFSYNPPKRKQSWVNKK 183 Query: 182 YERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLGNNIY 241 YE DN H STYLD+ F+ +Q + + E KE + YR+ YLGE +G G Sbjct: 184 YESSFQPDNTFVHHSTYLDN--PFIAKQFIDEAEAAKERNELRYRWEYLGEAIGSG---- 237 Query: 242 NMSTFHPLDALPSDDRLI----GISFALDGGHQQSATACCAFGVTAKGKVILLDTWYY 295 + F+ L D L I A+D G+ A + K +VI YY Sbjct: 238 -VVPFNNLQIEKIPDELFRSFDNIRNAVDFGYATDPLAFVRWHYDKKKRVIYAVDEYY 294 >gi|18074|lcl|protein:vir:5954 Length: 422 # NCBI annotation: hypothetical protein # Family: family:all:54 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690654;genbank:geneid:6329138;genbank:gi: 22855048;goa:P54308;interpro:IPR006437;interpro:IPR00670 1;interpro:IPR011441;uniprot:P54308;genbank:GeneID:95525 4 Length = 422 Score = 118 bits (296), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 75/243 (30%), Positives = 126/243 (51%), Gaps = 13/243 (5%) Query: 1 MKVIDIQKNVNPHFKSVWLSKKPYN----ILKGGRNSFKSSVITLKLIIMMVWYIVRGET 56 MK + + + PHF VW + K +LKGGR S KS+ I + +I++M+ + Sbjct: 1 MKKVRLSEKFTPHFLEVWRTVKAAQHLKYVLKGGRGSAKSTHIAMWIILLMMMMPI---- 56 Query: 57 ANIVIIRKVANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDY 116 ++IR+V NT+ SV+ Q++ + + + +K++ SP ++T+ G++ F G DD Sbjct: 57 -TFLVIRRVYNTVEQSVFEQLKEAIDMLEVGHLWKVSKSPLRLTYIPRGNSIIFRGGDDV 115 Query: 117 QKLKSNNIGN--IIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINP 174 QK+KS + +W EE AEF + EE + +R + P F+SYNPP Sbjct: 116 QKIKSIKASKFPVAGMWIEELAEFKTEEEVSVIEKSVLRAELPPGCRYIFFYSYNPPKRK 175 Query: 175 YSWINEWYERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPV 234 SW+N+ + N STYL + F+++ + + E +K + YR+ YLGE + Sbjct: 176 QSWVNKVFNSSFLPANTFVDHSTYLQNP--FLSKAFIEEAEEVKRRNELKYRHEYLGEAL 233 Query: 235 GLG 237 G G Sbjct: 234 GSG 236 >gi|9289|lcl|protein:vir:97165 Length: 431 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239721;genbank:gi:66394878;genbank:GeneID :5130898 Length = 431 Score = 86.7 bits (213), Expect = 6e-19, Method: Compositional matrix adjust. Identities = 76/304 (25%), Positives = 136/304 (44%), Gaps = 18/304 (5%) Query: 2 KVIDIQKNVNPHFKSVWLSKKPYNILKGGRNSFKSSVITLKLIIMMVWYIVRGETANIVI 61 +++D++ + + W +K Y ++KG R S KS + LI + I++ + ANI++ Sbjct: 3 EILDLKNKIGGGYNKFWHNKNFYRVVKGSRGSKKSKTTAINLI----YRIMKYDWANILV 58 Query: 62 IRKVANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKS 121 +R+ +NT + S Y ++W G+ FK S +IT+K TG F GLDD K+ S Sbjct: 59 VRRFSNTNKQSTYTDLKWATNQLGVAHLFKFNESLPEITYKPTGQKILFRGLDDPLKITS 118 Query: 122 NNI--GNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFV-QIFWSYNPPINPYSWI 178 + G + W+EEA + + +F T + +R + F QI ++NP + Sbjct: 119 ITVDTGILCWAWFEEAYQIETFAKF-STVVESIRGSYDSPEFFKQITVTFNPWSERHWLK 177 Query: 179 NEWYERMNTMDNYLCHSSTYLDDELGFVNEQM-LADIERIKE---NDYDYYRYVYLGEPV 234 +++ ++N ++TY VNE + DIER ++ + R V G+ Sbjct: 178 PTFFDEETKLNNTFSDTTTYR------VNEWLDKVDIERYEDLYIKNPRRARIVCDGDWG 231 Query: 235 GLGNNIYNMSTFHPLDALPSDDRLIGISFALDGGHQQSATACCAFGVTAKGKVILLDTWY 294 +++ D R I+ +D G Q T + V K K + + + Sbjct: 232 VAEGLVFDNFKVEDFDWFEEFKRTQEITHGMDFGFSQDPTTVVSTVVDLKNKKLFIYDEH 291 Query: 295 YSPA 298 Y A Sbjct: 292 YKKA 295 >gi|3326|lcl|protein:vir:94525 Length: 440 # NCBI annotation: putative large subunit terminase # Family: family:all:54 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223885;genbank:gi:62327097;genbank:GeneID :5075541 Length = 440 Score = 84.0 bits (206), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 85/322 (26%), Positives = 143/322 (44%), Gaps = 27/322 (8%) Query: 20 SKKPYNILKGGRNSFKSSVITLKLII--MMVWYIVRGETANIVIIRKVANTIRDSVYNQI 77 S+ Y + KG R S KS K+II MM Y+ N ++ R+ A T +DS + I Sbjct: 33 SRDRYLVYKGSRGSGKSYATAAKVIIDIMMYPYV------NWLVTRQYATTQKDSTFATI 86 Query: 78 QWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKSNN--IGNIIAVWYEEA 135 + G+ FK T SP +IT+K+TG +F G+DD K+ S G I W EEA Sbjct: 87 RKVAHSMGVLDLFKFTKSPLEITYKQTGQKVFFRGMDDPLKITSIQPVTGFICRRWCEEA 146 Query: 136 AEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWINEWYERMNTMDNYLCHS 195 E S + FD + MR + P F Q ++NP + + +E+++ ++ + Sbjct: 147 YELKSLDAFDTVEES-MRGELPPGGFYQTVITFNPWSDRHWLKHEFFDDKTKRNHSRAIT 205 Query: 196 STYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLGNNIYNMSTFHPLDALPSD 255 +TY D++ +N + ++ + + + R LGE G+ + F D + Sbjct: 206 TTYKDND--HLNADYVDSLKEMLVRNPNRARVAVLGE-WGIAEGLVFDGLFEQRDFSYDE 262 Query: 256 DRLIGISFALDGGHQQSATACCAFGVTAKGKVILLDTWYYSPAGQAIKKAPSQLSQDIYY 315 + S LD G + TA V +++ + +Y +Q++Q+ Sbjct: 263 IANLPKSVGLDFGFKHDPTAGEFIAVDQDNRIVYIYDEFYKQ-----HLLTNQIAQE--- 314 Query: 316 FTTKVISKYKVPILQYTIDSAE 337 ++K+K L T DSAE Sbjct: 315 -----LAKHKAFGLPITADSAE 331 >gi|15353|lcl|protein:vir:9921 Length: 425 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795683;genbank:gi:28876465;genbank:GeneID :1257982 Length = 425 Score = 79.7 bits (195), Expect = 7e-17, Method: Compositional matrix adjust. Identities = 63/213 (29%), Positives = 113/213 (53%), Gaps = 12/213 (5%) Query: 29 GGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIRKVANTIRDSVYNQIQWGLGLFGLTS 88 GG +S KS + K+I+ + + I+++RKV T+RDSV+ I L FG+ Sbjct: 40 GGASSGKSHGVFQKIILKALNPKFK-HPRKILVLRKVGATVRDSVFADIMSNLSYFGILD 98 Query: 89 RFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKSNNIGNIIAVWYEEAAEFSSAEEFDQTN 148 + K+ +S F+IT G+ F F G+D+ +K+KS I I V EEA+EF + +++ Q Sbjct: 99 KCKINMSAFRITL-PNGAEFIFKGMDNPEKIKS--IKGISDVVMEEASEF-TLDDYTQLT 154 Query: 149 ITFMRQKHPLAPFVQIFWSYNPPINPYSWINEWYERMNTMDNYLCHSSTYLDDELGFVNE 208 + +KH QI+ +N P++ +W+ + + + T N + + +TY D+ F+++ Sbjct: 155 LRLRDKKHLEK---QIYLMFN-PVSKVNWVYKAF-FVKTPKNTVVYQTTYKDNR--FLDD 207 Query: 209 QMLADIERIKENDYDYYRYVYLGEPVGLGNNIY 241 +IE + + YY+ LG+ L I+ Sbjct: 208 VTRENIEELANRNEAYYKIYALGQFATLDKLIF 240 >gi|11736|lcl|protein:vir:79016 Length: 417 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110720;genbank:gi:134287337;genbank:Ge neID:4955190 Length = 417 Score = 75.1 bits (183), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 72/255 (28%), Positives = 120/255 (47%), Gaps = 21/255 (8%) Query: 6 IQKNVNPHFKSVWLSKKPYNILKGGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIRKV 65 ++ N NP FK +KK Y +KG S KS + I+ + +G AN++++RK Sbjct: 7 VRVNFNPDFKEANFTKKRYRAMKGSAGSGKSVNVAQDYILKLGDKKYQG--ANLLVVRKS 64 Query: 66 ANTIRDSVYNQIQWGLG-LFGLTSR--FKMTVSPFKITHKKTGSTFYFYGLDD---YQKL 119 T + S Y ++ + ++G + +K T++P +I K TG++ F G++D +KL Sbjct: 65 EATHKYSTYAELTGAINRIYGKQADKYWKTTLNPLEIKSKVTGNSIIFRGVNDAKQREKL 124 Query: 120 KSNNI--GNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAP---FVQIFWSYNPPINP 174 KS N G + VW EEA E ++ +I R + L + Q+ +++N P++ Sbjct: 125 KSINFSKGKLTWVWCEEATELMESD----IDILDDRLRGILTNPNLYYQMTFTFN-PVSA 179 Query: 175 YSWINEWYERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPV 234 WI Y D+ H STYL + F++E ++ KE D + Y+ LGE Sbjct: 180 THWIKRKYFDYKN-DDIFTHHSTYLQNR--FIDEAYYRRMQMRKEQDPEGYKVYGLGEWG 236 Query: 235 GLGNNIYNMSTFHPL 249 G I H Sbjct: 237 ETGGAILKNYVIHEF 251 >gi|6093|lcl|protein:vir:95761 Length: 432 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950582;genbank:gi:119953777;genbank:GeneI D:5076831 Length = 432 Score = 73.9 bits (180), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 53/171 (30%), Positives = 88/171 (51%), Gaps = 10/171 (5%) Query: 4 IDIQKNVNPHFKSVWLSKKPYNILKGGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIR 63 +++ + V + W SK Y ++KGGR S KS L I+ ++ Y AN++++R Sbjct: 9 VNLPEIVGKGYGQFWRSKNFYRVVKGGRGSKKSKTTALYYIVAILKY----NWANLLVVR 64 Query: 64 KVANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKSNN 123 + +NT + S Y ++W ++ FK S +IT K TG F GLDD K+ S Sbjct: 65 RFSNTNKQSTYTDLKWAANRLNVSHLFKFNESLPEITVKATGQKILFRGLDDPLKITSIT 124 Query: 124 I--GNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAP--FVQIFWSYNP 170 + G + +W EEA + + ++F +T + +R AP F QI ++NP Sbjct: 125 VDTGLLSWLWLEEAYQVENQDKF-ETLVESIRGSID-APDFFKQITVTFNP 173 >gi|10799|lcl|protein:vir:78055 Length: 440 # NCBI annotation: TerL # Family: family:all:54 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468786;genbank:gi:157325367;genbank:Ge neID:5601817 Length = 440 Score = 73.9 bits (180), Expect = 4e-15, Method: Compositional matrix adjust. Identities = 67/240 (27%), Positives = 119/240 (49%), Gaps = 20/240 (8%) Query: 27 LKGGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIRKVANTIRDSVYNQIQWGLGLFGL 86 +KGGR S KSS + L M+V I++ AN VI RKV +R ++ Q QW + G+ Sbjct: 48 MKGGRGSGKSSFVAL----MVVDEIMKDPQANAVIFRKVDEGMRTTLLPQYQWAIDQLGV 103 Query: 87 TSRFKMTVSPFKITHK--KTG--STFYFYGLDDYQKLKSN--NIGNIIAVWYEEAAEFSS 140 + ++ ++ P + +K +TG F G+ D +++K++ +G + YEEA E+ S Sbjct: 104 SGAWRTSLQPMMLLYKNPETGLEQQIRFKGVKDPKRVKASKFRVGYAKYLIYEEADEYES 163 Query: 141 AEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWINEWYERMNTMDNYLCHSSTYLD 200 E+F N ++MR + + F+ YNPP W+N W + + + H ST++ Sbjct: 164 EEDFSIVNSSYMRGEG--TGDSRAFYLYNPPKYKGHWLNNWVDVIRDEPSQYVHHSTFIP 221 Query: 201 DEL---GFVNEQMLADIERIKENDYDYYRYVYLGEPVGLGNNIYNMS-----TFHPLDAL 252 L ++ L +++ + + Y + +LG V GN ++ + TF +D L Sbjct: 222 IALHHPEWLGSTWLESARLVRDKNPNRYEWEFLGRNVNTGNEVFPNAVQEHITFDMIDGL 281 >gi|5321|lcl|protein:vir:99534 Length: 422 # NCBI annotation: putative protein # Family: family:all:54 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958532;genbank:gi:41179314;genbank:GeneID :2717172 Length = 422 Score = 72.0 bits (175), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 107/420 (25%), Positives = 180/420 (42%), Gaps = 52/420 (12%) Query: 13 HFKSVWLSKKPYNILKGGRNSFKSSVITLKLII--MMVWYIVRGETANIVIIRKVANTIR 70 H VW GG +S KS + K+++ + W + R ++ +RKV T++ Sbjct: 28 HLTEVWY---------GGASSGKSHGVVQKVVLKSLQHWNVPR----KVLWLRKVDRTVK 74 Query: 71 DSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKSNNIGNIIAV 130 +S++ + L + + + S I G+ F F G+DD +K+KS I + V Sbjct: 75 NSIFTDVTECLSGWNILQYCHVNRSDKTIV-LPNGAIFLFQGMDDPEKIKS--IKGLSDV 131 Query: 131 WYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWI-NEWYERMNTMD 189 EEA+EF+ D T +T +R + P QIF +N P++ +W W++ D Sbjct: 132 VMEEASEFNHN---DYTQLT-LRLREPKHKQRQIFCMFN-PVSKLNWTYQTWFDPSADYD 186 Query: 190 --NYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLGNNIYNMSTFH 247 H STY D+ F++E + IE +K + YY+ LGE L ++ F Sbjct: 187 RSRVAIHQSTYKDNR--FLDEDNIRTIEELKNTNPAYYKIYTLGEFATLDKLVF--PYFE 242 Query: 248 PLDALPSDDRLIGIS--FALDGGHQQSATACCAFGVTAKGKVILLDTWYYSPAGQAIKKA 305 P D +L+ ++ F LD G +A + + K + + + + Sbjct: 243 TKRLNPRDPKLLALNDYFGLDYGFINDPSAFMHIKLDMRNKTLYVMDEFVKKG--LLNNQ 300 Query: 306 PSQLSQDIYYFTTKVISKYKVPILQYTIDSAEGALRNQMYLDFAIRWHPVAKLKKVTMID 365 +Q+ +D+ Y + +VI T DSAE +M D R P K ++I Sbjct: 301 LAQVIKDMGY-SKEVI----------TADSAEKKSIAEMKRDGIYRIRPALKGPD-SIIQ 348 Query: 366 TFQSLLAQGRFYYLDTDNNKVFIEEHKMYRW--DEKTIQSDNPNVIKDDDHTCDVAQYFV 423 Q L +F ++ D IEE + Y + D+KT + N I +H D +Y V Sbjct: 349 GIQFL---QQFKWVVDDRCVKTIEELQNYTYVKDKKTDEYTN-RPIDAYNHCIDAIRYAV 404 >gi|4914|lcl|protein:vir:99572 Length: 459 # NCBI annotation: TerL-like protein # Family: family:all:54 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039811;genbank:gi:126011061;genbank:Ge neID:4818267 Length = 459 Score = 71.6 bits (174), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 65/226 (28%), Positives = 101/226 (44%), Gaps = 21/226 (9%) Query: 9 NVNPHFKSVWLSKKPYNILKGGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIRKVANT 68 ++NP + WL K Y L GGR S KS + Y+ R T + R+ N Sbjct: 3 DLNPALRDFWLDKARYKALYGGRASSKSHDAAGFAV-----YLARNYTVKFLCARQFQNK 57 Query: 69 IRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGL-DDYQKLKSNNIGNI 127 I +SVY I+ + G T F +T+S I HKKTG+ F FYG+ + ++KS +I Sbjct: 58 ISESVYTLIKGKIDAAGWTKEFDVTIS--SIRHKKTGAEFLFYGIARNLNEIKSTEGVDI 115 Query: 128 IAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWINEWYERMNT 187 + W EE A++ + E+++ N T R+ + + W NP + + Y+ Sbjct: 116 L--WLEE-AQYLTEEQWNVINPTIRRE----GSQIWLIW------NPDQYTDFIYQNFVV 162 Query: 188 MDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEP 233 C S E F+++ ML I + D +VY G P Sbjct: 163 NPPADCLSKQINWTENPFLSDTMLKVIYDEYQRDPKLAEHVYGGAP 208 >gi|17761|lcl|protein:vir:171 Length: 470 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112076;genbank:gi:13559866;genbank:GeneID :921009 Length = 470 Score = 70.1 bits (170), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 65/229 (28%), Positives = 112/229 (48%), Gaps = 26/229 (11%) Query: 9 NVNPHFKSVWLSKKPYNILKGGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIRKVANT 68 ++NP F+ ++ Y + KGGR S KS I L+ R + I+ R++ N+ Sbjct: 3 SINPIFEP-FIEAHRYKVAKGGRGSGKSWAIARLLV-----EAARRQPVRILCARELQNS 56 Query: 69 IRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGL-DDYQKLKSNNIGNI 127 I DSV ++ + G ++ F++ S I H T + F FYG+ ++ K+KS + I Sbjct: 57 ISDSVIRLLEDTIEREGYSAEFEIQRS--MIRHLGTNAEFMFYGIKNNPTKIKS--LEGI 112 Query: 128 IAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWINEWYER--M 185 W EEA E + E +D T + PF +I+ S+NP + +++ Y+R + Sbjct: 113 DICWVEEA-EAVTKESWDILIPTIRK------PFSEIWVSFNP----KNILDDTYQRFVV 161 Query: 186 NTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPV 234 N D+ + Y D+ E + ++E K + YR+++LGEPV Sbjct: 162 NPPDDICLLTVNYTDNP--HFPEVLRLEMEECKRRNPTLYRHIWLGEPV 208 >gi|5142|lcl|protein:vir:105417 Length: 470 # NCBI annotation: gene 2 protein # Family: family:all:54 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958178;genbank:gi:41057280;genbank:GeneID :2716664 Length = 470 Score = 70.1 bits (170), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 65/229 (28%), Positives = 112/229 (48%), Gaps = 26/229 (11%) Query: 9 NVNPHFKSVWLSKKPYNILKGGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIRKVANT 68 ++NP F+ ++ Y + KGGR S KS I L+ R + I+ R++ N+ Sbjct: 3 SINPIFEP-FIEAHRYKVAKGGRGSGKSWAIARLLV-----EAARRQPVRILCARELQNS 56 Query: 69 IRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGL-DDYQKLKSNNIGNI 127 I DSV ++ + G ++ F++ S I H T + F FYG+ ++ K+KS + I Sbjct: 57 ISDSVIRLLEDTIEREGYSAEFEIQRS--MIRHLGTNAEFMFYGIKNNPTKIKS--LEGI 112 Query: 128 IAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWINEWYER--M 185 W EEA E + E +D T + PF +I+ S+NP + +++ Y+R + Sbjct: 113 DICWVEEA-EAVTKESWDILIPTIRK------PFSEIWVSFNP----KNILDDTYQRFVV 161 Query: 186 NTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPV 234 N D+ + Y D+ E + ++E K + YR+++LGEPV Sbjct: 162 NPPDDICLLTVNYTDNP--HFPEVLRLEMEECKRRNPTLYRHIWLGEPV 208 >gi|17691|lcl|protein:vir:9305 Length: 447 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803283;genbank:gi:29028593;genbank:GeneID :1258039 Length = 447 Score = 65.9 bits (159), Expect = 9e-13, Method: Compositional matrix adjust. Identities = 63/234 (26%), Positives = 111/234 (47%), Gaps = 23/234 (9%) Query: 59 IVIIRKVANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQK 118 I+ +RKV +TI+DS++ ++ L FG+ + K+ G+ F F GLD+ +K Sbjct: 85 ILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKV-ELPNGAVFLFKGLDNPEK 143 Query: 119 LKSNNIGNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWI 178 +KS I I + EEA+EF + ++ Q + +KH QIF +N P++ +W+ Sbjct: 144 IKS--IKGISDIVMEEASEF-TLNDYTQLTLRLRERKHVNK---QIFLMFN-PVSKLNWV 196 Query: 179 NEW-YERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLG 237 ++ +E M+N + S+Y D++ F++E ++E + + YY+ LGE L Sbjct: 197 YKYFFEHGEPMENVMIRQSSYRDNK--FLDEMTRQNLELLANRNPAYYKIYALGEFATLD 254 Query: 238 NNIY-----NMSTFHPLDALPSDDRLIGISFALDGGHQQSATACCAFGVTAKGK 286 ++ + L LPS F LD G+ +A + K K Sbjct: 255 KLVFPKYEKRLINKDELRHLPS-------YFGLDFGYVNDPSAFIHSKIDVKKK 301 >gi|7301|lcl|protein:vir:96238 Length: 447 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239566;genbank:gi:66395301;genbank:GeneID :5132787 Length = 447 Score = 65.9 bits (159), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 63/234 (26%), Positives = 111/234 (47%), Gaps = 23/234 (9%) Query: 59 IVIIRKVANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQK 118 I+ +RKV +TI+DS++ ++ L FG+ + K+ G+ F F GLD+ +K Sbjct: 85 ILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKV-ELPNGAVFLFKGLDNPEK 143 Query: 119 LKSNNIGNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWI 178 +KS I I + EEA+EF + ++ Q + +KH QIF +N P++ +W+ Sbjct: 144 IKS--IKGISDIVMEEASEF-TLNDYTQLTLRLRERKHVNK---QIFLMFN-PVSKLNWV 196 Query: 179 NEW-YERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLG 237 ++ +E M+N + S+Y D++ F++E ++E + + YY+ LGE L Sbjct: 197 YKYFFEHGEPMENVMIRQSSYRDNK--FLDEMTRQNLELLANRNPAYYKIYALGEFATLD 254 Query: 238 NNIY-----NMSTFHPLDALPSDDRLIGISFALDGGHQQSATACCAFGVTAKGK 286 ++ + L LPS F LD G+ +A + K K Sbjct: 255 KLVFPKYEKRLINKDELRHLPS-------YFGLDFGYVNDPSAFIHSKIDVKKK 301 >gi|9829|lcl|protein:vir:103950 Length: 425 # NCBI annotation: phage terminase large subunit # Family: family:all:54 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873987;genbank:gi:118430762;genbank:GeneI D:4525444 Length = 425 Score = 65.9 bits (159), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 63/234 (26%), Positives = 111/234 (47%), Gaps = 23/234 (9%) Query: 59 IVIIRKVANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQK 118 I+ +RKV +TI+DS++ ++ L FG+ + K+ G+ F F GLD+ +K Sbjct: 63 ILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKV-ELPNGAVFLFKGLDNPEK 121 Query: 119 LKSNNIGNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWI 178 +KS I I + EEA+EF + ++ Q + +KH QIF +N P++ +W+ Sbjct: 122 IKS--IKGISDIVMEEASEF-TLNDYTQLTLRLRERKHVNK---QIFLMFN-PVSKLNWV 174 Query: 179 NEW-YERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLG 237 ++ +E M+N + S+Y D++ F++E ++E + + YY+ LGE L Sbjct: 175 YKYFFEHGEPMENVMIRQSSYRDNK--FLDEMTRQNLELLANRNPAYYKIYALGEFATLD 232 Query: 238 NNIY-----NMSTFHPLDALPSDDRLIGISFALDGGHQQSATACCAFGVTAKGK 286 ++ + L LPS F LD G+ +A + K K Sbjct: 233 KLVFPKYEKRLINKDELRHLPS-------YFGLDFGYVNDPSAFIHSKIDVKKK 279 >gi|7643|lcl|protein:vir:96370 Length: 447 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239642;genbank:gi:66395379;genbank:GeneID :5132846 Length = 447 Score = 65.1 bits (157), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 63/234 (26%), Positives = 111/234 (47%), Gaps = 23/234 (9%) Query: 59 IVIIRKVANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQK 118 I+ +RKV +TI+DS++ ++ L FG+ + K+ G+ F F GLD+ +K Sbjct: 85 ILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKV-ELPNGAVFLFKGLDNPEK 143 Query: 119 LKSNNIGNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWI 178 +KS I I + EEA+EF + ++ Q + +KH QIF +N P++ +W+ Sbjct: 144 IKS--IKGISDIVMEEASEF-TLNDYTQLTLRLRERKHVNK---QIFLMFN-PVSKLNWV 196 Query: 179 NEW-YERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLG 237 ++ +E M+N + S+Y D++ F++E ++E + + YY+ LGE L Sbjct: 197 YKYFFEHGEPMENVMIRQSSYRDNK--FLDEMTRQNLELLANRNPAYYKIYALGEFSTLD 254 Query: 238 NNIY-----NMSTFHPLDALPSDDRLIGISFALDGGHQQSATACCAFGVTAKGK 286 ++ + L LPS F LD G+ +A + K K Sbjct: 255 KLVFPKYEKRLINKDELRHLPS-------YFGLDFGYVNDPSAFIHSKIDVKKK 301 >gi|11588|lcl|protein:vir:78831 Length: 447 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285355;genbank:gi:148717883;genbank:Ge neID:5246962 Length = 447 Score = 65.1 bits (157), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 63/234 (26%), Positives = 111/234 (47%), Gaps = 23/234 (9%) Query: 59 IVIIRKVANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQK 118 I+ +RKV +TI+DS++ ++ L FG+ + K+ G+ F F GLD+ +K Sbjct: 85 ILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKV-ELPNGAVFLFKGLDNPEK 143 Query: 119 LKSNNIGNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWI 178 +KS I I + EEA+EF + ++ Q + +KH QIF +N P++ +W+ Sbjct: 144 IKS--IKGISDIVMEEASEF-TLNDYTQLTLRLRERKHVNK---QIFLMFN-PVSKLNWV 196 Query: 179 NEW-YERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLG 237 ++ +E M+N + S+Y D++ F++E ++E + + YY+ LGE L Sbjct: 197 YKYFFEHGEPMENVMIRQSSYRDNK--FLDEMTRQNLELLANRNPAYYKIYALGEFSTLD 254 Query: 238 NNIY-----NMSTFHPLDALPSDDRLIGISFALDGGHQQSATACCAFGVTAKGK 286 ++ + L LPS F LD G+ +A + K K Sbjct: 255 KLVFPKYEKRLINKDELRHLPS-------YFGLDFGYVNDPSAFIHSKIDVKKK 301 >gi|5208|lcl|protein:vir:106641 Length: 446 # NCBI annotation: ORF005 # Family: family:all:54 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239489;genbank:gi:66395220;genbank:GeneID :4555795 Length = 446 Score = 65.1 bits (157), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 71/285 (24%), Positives = 129/285 (45%), Gaps = 36/285 (12%) Query: 59 IVIIRKVANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQK 118 I+ +RKV +TI+DS++ ++ L FG+ + K+ G+ F F GLD+ +K Sbjct: 85 ILWLRKVQSTIKDSLFEDVKGCLINFGIWDMCLWNKTDNKV-ELPNGAVFLFKGLDNPEK 143 Query: 119 LKSNNIGNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWI 178 +KS I I + EEA+EF + ++ Q + +KH QIF +N P++ +W+ Sbjct: 144 IKS--IKGISDIVMEEASEF-TLNDYTQLTLRLRERKHMNK---QIFLMFN-PVSKLNWV 196 Query: 179 NEW-YERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLG 237 ++ +E M+N + S+Y D++ F++E ++E + + YY+ LGE L Sbjct: 197 YKYFFEHGEPMENVMIRQSSYRDNK--FLDEMTRQNLELLANRNPAYYKIYALGEFATLD 254 Query: 238 NNIY-----NMSTFHPLDALPSDDRLIGISFALDGGHQQSATACCAFGVTAKGKVILLDT 292 ++ + + + LPS F LD G+ +A + K + + + Sbjct: 255 KLVFPKYEKRIISDKEVGHLPS-------YFGLDFGYVNDPSAFIHVKIDNDNKKLYVIS 307 Query: 293 WYYSPAGQAIKKAPSQLSQDIYYFTTKVISKYKVPILQYTIDSAE 337 Y + +Q+ D+ Y K+ T DSAE Sbjct: 308 EYVKKG--MLNNEIAQVINDLGYSKEKI-----------TADSAE 339 >gi|2779|lcl|protein:vir:99776 Length: 425 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004302;genbank:gi:122891756;genbank:Ge neID:4712331 Length = 425 Score = 65.1 bits (157), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 63/234 (26%), Positives = 111/234 (47%), Gaps = 23/234 (9%) Query: 59 IVIIRKVANTIRDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQK 118 I+ +RKV +TI+DS++ ++ L FG+ + K+ G+ F F GLD+ +K Sbjct: 63 ILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKVGL-PNGAVFLFKGLDNPEK 121 Query: 119 LKSNNIGNIIAVWYEEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYSWI 178 +KS I I + EEA+EF + ++ Q + +KH QIF +N P++ +W+ Sbjct: 122 IKS--IKGISDIVMEEASEF-TLNDYTQLTLRLRERKHVNK---QIFLIFN-PVSKLNWV 174 Query: 179 NEW-YERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEPVGLG 237 ++ +E M+N + S+Y D++ F++E ++E + + YY+ LGE L Sbjct: 175 YKYFFEHGEPMENVMIRQSSYRDNK--FLDEMTRQNLELLANRNPAYYKIYALGEFATLD 232 Query: 238 NNIY-----NMSTFHPLDALPSDDRLIGISFALDGGHQQSATACCAFGVTAKGK 286 ++ + L LPS F LD G+ +A + K K Sbjct: 233 KLVFPKYEKRLINKDELRHLPS-------YFGLDFGYVNDPSAFIHSKIDVKKK 279 >gi|13903|lcl|protein:vir:9870 Length: 273 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795632;genbank:gi:28876409;genbank:GeneID :1257943 Length = 273 Score = 60.8 bits (146), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 2/86 (2%) Query: 152 MRQKHPLAPFVQIFWSYNPPINPYSWINEWYERMNTMDNYLCHSSTYLDDELGFVNEQML 211 MR + F + F++YNPP SW+N+ YE N H+STY D+ F+ ++ + Sbjct: 1 MRGELGDGLFYKFFYTYNPPKRKQSWVNKKYESQFQPSNTFVHASTYKDNP--FIAKEFI 58 Query: 212 ADIERIKENDYDYYRYVYLGEPVGLG 237 A+ E +E YR+ YLGE +G G Sbjct: 59 AEAEATRERSERRYRWEYLGEAIGSG 84 >gi|6775|lcl|protein:vir:96067 Length: 460 # NCBI annotation: conserved hypothetical protein ORF004 # Family: family:all:54 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294421;genbank:gi:149408318;genbank:Ge neID:5237186 Length = 460 Score = 59.3 bits (142), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 73/303 (24%), Positives = 123/303 (40%), Gaps = 55/303 (18%) Query: 10 VNPHFKSVWLSKKPYNILKGGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIRKVANTI 69 +NP ++VW ++ Y ++ GGR S KS + + Y ++ + R+ N I Sbjct: 4 LNPALRAVWRTRARYKVIYGGRASSKSHDAGGIAVYLAANYRLK-----FLCARQFQNRI 58 Query: 70 RDSVYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGL-DDYQKLKSNNIGNII 128 +SVY I+ + F T + K HK+TGS F FYG+ + ++KS I Sbjct: 59 SESVYTLIKDKIENSEYNGEFIFTKNSIK--HKRTGSEFLFYGIARNLSEIKSTE--GID 114 Query: 129 AVWYEEAAEFSSAEEFDQTNITFMRQK---------HPLAPFVQIFWSYNPPINPYSWIN 179 +W EE A + + E+++ T ++ + + FV + PP + + + Sbjct: 115 ILWLEE-AHYLTQEQWEVIEPTIRKENSEIWIIFNPNEVTDFVYQNFVVKPPKDAFVKMI 173 Query: 180 EWYERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGEP-VGLGN 238 W +E F++E ML I E D D ++Y G P G Sbjct: 174 NW-------------------NENPFLSETMLKVIHEAYERDKDQAEHIYGGIPKTGGDK 214 Query: 239 NIYNMS-TFHPLDA------LPSDDRLIGISFALDGGHQQSATACCAFGVTAKGKVIL-L 290 ++ N+ +DA P+ + IG A DG + T G VI+ + Sbjct: 215 SVINLKFILAAIDAHKKLGWEPAGSKRIGFDVADDGEDANATTLM-------HGNVIMEV 267 Query: 291 DTW 293 D W Sbjct: 268 DEW 270 >gi|16299|lcl|protein:vir:3027 Length: 375 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438140;genbank:gi:16271803;genbank:GeneID :929285 Length = 375 Score = 45.1 bits (105), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 66/275 (24%), Positives = 117/275 (42%), Gaps = 34/275 (12%) Query: 169 NPPINPYSWINEWYERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYV 228 NPP + I + ++ NT + +DD E+ I +K+N Y Y R V Sbjct: 106 NPPAPQHPVIKDVFDVQNTRWTHWT-----MDDNPILTAERKQNIINSLKKNPYLYKRDV 160 Query: 229 YLGEPVGLGNNIYNM--STFHPLDALPSDDRLIGISFALDGGHQQSATACCAF--GVTAK 284 LG+ V IY + + + LDAL + + + F DGG + + C V Sbjct: 161 -LGQRVMPQGVIYGLFDTEKNVLDALIGEP--VEMYFCADGGQSDATSMSCNIVTRVRDN 217 Query: 285 GKVIL----LDTWYYSPAGQAIKKAPSQLSQDIYYFTTKVISKYKVPILQYTIDSAEGAL 340 G++ + +Y+S A KA S + ++ F + KY++ + +D A +L Sbjct: 218 GRISFRLNRVAHYYHSGADTGQVKAMSTYALELKVFIDWCVKKYQMRYTEVFVDPACKSL 277 Query: 341 RNQMY-LDFAIRWHP------VAKLKKVTM-IDTFQSLLAQGRFYYLDTDNNKV----FI 388 R +++ L P +K K + + I+ Q++++ G FY ++ + F+ Sbjct: 278 REELHKLGVFTLGAPNNSKDVSSKAKGIEVGIERGQNIISDGAFYLVNHSEEEYDHYHFL 337 Query: 389 EEHKMYRWDEKTIQSDNPNVIKDDDHTCDVAQYFV 423 +E +Y D DN I D+H D +Y V Sbjct: 338 KEIGLYSRD------DNGKPIDKDNHAMDEFRYSV 366 >gi|16025|lcl|protein:vir:9814 Length: 403 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795576;genbank:gi:28876345;genbank:GeneID :1257867 Length = 403 Score = 45.1 bits (105), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 66/275 (24%), Positives = 117/275 (42%), Gaps = 34/275 (12%) Query: 169 NPPINPYSWINEWYERMNTMDNYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYV 228 NPP + I + ++ NT + +DD E+ I +K+N Y Y R V Sbjct: 134 NPPAPQHPVIKDVFDVQNTRWTHWT-----MDDNPILTAERKQNIINSLKKNPYLYKRDV 188 Query: 229 YLGEPVGLGNNIYNM--STFHPLDALPSDDRLIGISFALDGGHQQSATACCAF--GVTAK 284 LG+ V IY + + + LDAL + + + F DGG + + C V Sbjct: 189 -LGQRVMPQGVIYGLFDTEKNVLDALIGEP--VEMYFCADGGQSDATSMSCNIVTRVRDN 245 Query: 285 GKVIL----LDTWYYSPAGQAIKKAPSQLSQDIYYFTTKVISKYKVPILQYTIDSAEGAL 340 G++ + +Y+S A KA S + ++ F + KY++ + +D A +L Sbjct: 246 GRISFRLNRVAHYYHSGADTGQVKAMSTYALELKVFIDWCVKKYQMRYTEVFVDPACKSL 305 Query: 341 RNQMY-LDFAIRWHP------VAKLKKVTM-IDTFQSLLAQGRFYYLDTDNNKV----FI 388 R +++ L P +K K + + I+ Q++++ G FY ++ + F+ Sbjct: 306 REELHKLGVFTLGAPNNSKDVSSKAKGIEVGIERGQNIISDGAFYLVNHSEEEYDHYHFL 365 Query: 389 EEHKMYRWDEKTIQSDNPNVIKDDDHTCDVAQYFV 423 +E +Y D DN I D+H D +Y V Sbjct: 366 KEIGLYSRD------DNGKPIDKDNHAMDEFRYSV 394 >gi|1165|lcl|protein:vir:102941 Length: 421 # NCBI annotation: terminase, large subunit # Family: family:all:54 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945278;genbank:gi:39653713;goa:Q708N4;int erpro:IPR004921;interpro:IPR006437;uniprot:Q708N4;genban k:GeneID:2672855 Length = 421 Score = 42.4 bits (98), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 87/420 (20%), Positives = 157/420 (37%), Gaps = 48/420 (11%) Query: 18 WLSKKPYN-----ILKGGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIRKVANTIRDS 72 WL P + I G S K+ ++L +I W + N + K + + Sbjct: 27 WLWNSPVHESEGIIADGAIRSGKTVSMSLAFVI---WAMTSFNHQNFAMCGKTIGSFNRN 83 Query: 73 VYNQIQWGLGLFGLTSRFKMTVSPFKITHKKTGSTFYFYGLDDYQKLKSNNIGNIIAVWY 132 V + + G + + T + +IT + FY +G D + +++ Sbjct: 84 VLKLLLVMIQSRGFSYVYHRTDNLIEITKGDVSNDFYIFGGKDESSQDLIQGLTLAGIFF 143 Query: 133 EEAAEFSSAEEFDQTNITFMRQKHPLAPFVQIFWSYNP-PINPYSWIN-EWYERMNTMDN 190 +E A +F+ Q W +N P PY W W ++ T + Sbjct: 144 DEVALMPE---------SFVNQGTGRCSVTGSKWWFNCNPDGPYHWFKVNWIDKAETKNM 194 Query: 191 YLCHSSTYLDDELGFVNEQMLADIERIKENDYD---YYRYVYLGEPVGLGNNIYNMSTF- 246 H +DD L + +I++ + Y Y RY+ V G +Y+M + Sbjct: 195 LYLHFD--MDDNL-----SLSENIKKRYRSQYQGVFYQRYIQGLWTVAEGI-VYDMFSKD 246 Query: 247 -HPLDALPSDDRLIGISFALDGGHQQSATACCAFGVTAKGKVILLDTWYYSPAGQAIKKA 305 H + LP +L G ++D G Q+AT + GK L +YYS + ++K Sbjct: 247 KHVVSTLPEMSKL-GKYVSVDYG-TQNATVFLLWEKDIIGKYYLTREYYYSGRDENVQKT 304 Query: 306 PSQLSQDIYYFTTKVISKYKVPILQYTIDSAEGALRNQMYLDFAIRWHPVAKLKKVTM-- 363 ++ + D+ + I + ID + + ++ R + + K + + Sbjct: 305 NAEYADDLTAWLGDT------NIDRIIIDPSAASFIAEL----KKRGYKIKKARNNVLEG 354 Query: 364 IDTFQSLLAQGRFYYLDTDNNKVFIEEHKMYRWDEKTIQSDNPNVIKDDDHTCDVAQYFV 423 I S+L Q + ++ N ++E Y WDEK + IK DH D +YF Sbjct: 355 IRFVGSMLGQEKIAVHESCVNT--LKEFHAYVWDEKASANGEDKPIKQFDHAMDALRYFC 412 >gi|11622|lcl|protein:vir:78869 Length: 439 # NCBI annotation: TerL # Family: family:all:54 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468842;genbank:gi:157325434;genbank:Ge neID:5601865 Length = 439 Score = 38.9 bits (89), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 51/187 (27%), Positives = 88/187 (47%), Gaps = 26/187 (13%) Query: 255 DDRLIGISFALDGGHQQSATACCAFGVTA-------KGKVILLDTWYYSPAGQAIKKAPS 307 + R I + F DGG QQ AT C + +T K K + ++Y+S KA S Sbjct: 247 EGRPIEMVFFGDGG-QQDATVCECYVITEHAADGHYKYKFNQVASYYHSGRDTGEVKAGS 305 Query: 308 QLSQDIYYFTTKVISKYKVPILQYT-IDSAEGALRNQMY---LDFAI---RWHPV-AKLK 359 + +I F + +Y+VP+ + ID A LR ++ +D A H V K + Sbjct: 306 TYAVEIKQFIQWCMKEYEVPVNEPVFIDPACRWLREELEKVGVDTAGADNNAHDVIGKAQ 365 Query: 360 KVTM-IDTFQSLLAQGRFYYLDTDNNK----VFIEEHKMYRWDEKTIQSDNPNVIKDDDH 414 + + I+ QSLL++ R+ ++ N++ +++E MY DE S P + ++H Sbjct: 366 GIEVGIERMQSLLSERRYLLVEQPNDQYDHYSWLQEIGMYVRDE---NSGKP--VDKNNH 420 Query: 415 TCDVAQY 421 D ++Y Sbjct: 421 AMDTSRY 427 >gi|14466|lcl|protein:vir:3519 Length: 469 # NCBI annotation: P18 # Family: family:all:54 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050979;genbank:gi:9633565;genbank:GeneID: 1262312 Length = 469 Score = 38.5 bits (88), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 50/223 (22%), Positives = 96/223 (43%), Gaps = 47/223 (21%) Query: 29 GGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIRKVANTIRDSVYNQIQWGLGLFGLTS 88 GGR K+ +I + R + +R+ N+I DS + +Q + GL + Sbjct: 12 GGRGGMKTVSFAKIALITASMHKRR-----FLCLREFMNSIEDSGHAVLQAEVETLGLQN 66 Query: 89 RFKMTVSPFKITHKKTGSTFYFYGLDD----YQKLKSNNIGNIIA------VWYEEAAEF 138 RF++ + Y G++D Y +L + NI +I + W EE AE Sbjct: 67 RFRILNT-------------YIEGINDSIFKYGQL-ARNIASIKSKHDFDVAWVEE-AET 111 Query: 139 SSAEEFDQTNITFMRQKHPLAPFVQIFWSYNPPINPYS----WINEWYERMNTM-----D 189 S + D T + P ++++S+NP + ++ + E ++T D Sbjct: 112 VSEKSLDSLIPTIRK------PGSELWFSFNPAEEDGAVYKRFVKPYKELIDTQGYYEDD 165 Query: 190 NYLCHSSTYLDDELGFVNEQMLADIERIKENDYDYYRYVYLGE 232 + +YLD+ ++ ++ D +++K +Y +R+VY GE Sbjct: 166 DLYVGKVSYLDNP--WLPAELKNDAQKMKRENYKKWRHVYGGE 206 >gi|6886|lcl|protein:vir:106551 Length: 424 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958579;genbank:gi:41179257;genbank:GeneID :2717087 Length = 424 Score = 37.0 bits (84), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 32/135 (23%), Positives = 56/135 (41%), Gaps = 11/135 (8%) Query: 289 LLDTWYYSPAGQAIKKAPSQLSQDIYYFTTKVISKYKVPILQYTIDSAEGALRNQMYLDF 348 L+ +YYS + +K + D+ F + ++ I+ + S LR F Sbjct: 280 LVKEYYYSGRTTSRQKTDEEYCHDLKEFLGDIRAEM---IIDPSAASFSTTLRQN---GF 333 Query: 349 AIRWHPVAKLKKVTMIDTFQSLLAQGRFYYLDTDNNKVFIEEHKMYRWDEKTIQSDNPNV 408 +R AK + I Q+ + +G+ + + N +E Y WD+K + Sbjct: 334 KVR---KAKNDVLDGIRVTQTAMNEGKIKF--SMNCPNLFKELASYVWDDKAAEHGEDKP 388 Query: 409 IKDDDHTCDVAQYFV 423 +K DH CD +YFV Sbjct: 389 VKQHDHACDAMRYFV 403 >gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: gp123 # Family: family:all:147 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717790;genbank:gi:113200627;genbank:GeneI D:4239178 Length = 549 Score = 30.4 bits (67), Expect = 0.044, Method: Compositional matrix adjust. Identities = 21/78 (26%), Positives = 39/78 (50%), Gaps = 11/78 (14%) Query: 1 MKVIDIQKNVNP----HFKSVWLSK---KPYNILKGGRNSFKSSVITLKLIIMMVWYIVR 53 +K++ + K + P +F+ + K +NI K R S KS+++T L+ WY++ Sbjct: 45 IKIVSLDKGLIPFDMYYFQEEMVQKFHDNRFNIAKLPRQSGKSTIVTSYLL----WYVLF 100 Query: 54 GETANIVIIRKVANTIRD 71 N+ I+ A T R+ Sbjct: 101 NANVNVAILANKAATARE 118 >gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214662;genbank:gi:61806303;genbank:GeneID :3294591 Length = 550 Score = 30.0 bits (66), Expect = 0.063, Method: Compositional matrix adjust. Identities = 16/48 (33%), Positives = 26/48 (54%), Gaps = 4/48 (8%) Query: 24 YNILKGGRNSFKSSVITLKLIIMMVWYIVRGETANIVIIRKVANTIRD 71 +NI K R S KS+++T L+ WY++ N+ I+ A T R+ Sbjct: 76 FNIAKLPRQSGKSTIVTAYLL----WYVLFNANVNVAILANKAPTARE 119 >gi|4089|lcl|protein:vir:94598 Length: 581 # NCBI annotation: PfWMP4_40 # Family: family:all:543 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762670;genbank:gi:115304378;genbank:GeneI D:5142298 Length = 581 Score = 28.1 bits (61), Expect = 0.24, Method: Compositional matrix adjust. Identities = 16/55 (29%), Positives = 27/55 (49%) Query: 237 GNNIYNMSTFHPLDALPSDDRLIGISFALDGGHQQSATACCAFGVTAKGKVILLD 291 G + +S HP +L S + ++G+ FA+ + TA G T + K+ LD Sbjct: 376 GRGMDGVSFRHPDGSLDSLEPVMGVDFAISLSSRADYTAIAVGGKTFQRKLCALD 430 >gi|12378|lcl|protein:vir:79648 Length: 523 # NCBI annotation: TerL # Family: family:all:144 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285519;genbank:gi:148734502;genbank:Ge neID:5220006 Length = 523 Score = 27.7 bits (60), Expect = 0.28, Method: Compositional matrix adjust. Identities = 21/94 (22%), Positives = 40/94 (42%), Gaps = 10/94 (10%) Query: 207 NEQMLADIERIKENDYDYYRYVYLGEPVGLGNNIYNMSTFH-----PLDALPSDDR---- 257 +++ + D++ +++ D + Y EP+ LG N N+ F +P DR Sbjct: 287 SKESIHDLKALRDADLYTFLSQYQQEPIALGGNAINVGWFQYYGTGEKSTMPKPDRFDYT 346 Query: 258 LIGISFALDGGHQQSATACCAFGVTAKGKVILLD 291 I A G + C +G+ KG++ +D Sbjct: 347 FITADTAQKEGELNDYSVLCYWGM-FKGRIYFID 379 >gi|6559|lcl|protein:vir:104336 Length: 523 # NCBI annotation: putative terminase large subunit # Family: family:all:144 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398965;genbank:gi:81343949;genbank:GeneID :3778868 Length = 523 Score = 26.9 bits (58), Expect = 0.53, Method: Compositional matrix adjust. Identities = 13/45 (28%), Positives = 23/45 (51%) Query: 205 FVNEQMLADIERIKENDYDYYRYVYLGEPVGLGNNIYNMSTFHPL 249 F ++ + D+ +++ D + Y EPV LG N+ N+ F L Sbjct: 291 FPAKESIEDLMAMRDADPYTFLSQYAQEPVALGGNLINVDWFQRL 335 >gi|13199|lcl|protein:vir:81178 Length: 567 # NCBI annotation: putative terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285808;genbank:gi:148747729;genbank:Ge neID:5247221 Length = 567 Score = 26.6 bits (57), Expect = 0.55, Method: Compositional matrix adjust. Identities = 11/18 (61%), Positives = 14/18 (77%) Query: 208 EQMLADIERIKENDYDYY 225 E+ L DIERI E+D+ YY Sbjct: 32 ERFLNDIERITEDDFPYY 49 >gi|2931|lcl|protein:vir:105460 Length: 409 # NCBI annotation: putative phage terminase large subunit B # Family: family:all:54 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529870;genbank:gi:90592610;genbank:GeneID :3974524 Length = 409 Score = 26.2 bits (56), Expect = 0.87, Method: Compositional matrix adjust. Identities = 17/65 (26%), Positives = 32/65 (49%), Gaps = 2/65 (3%) Query: 364 IDTFQSLLAQGRFYYLDTDNNKVFIEEHKMYRWDEKTIQSDNPNVIKDDDHTCDVAQYFV 423 I+ + +G+FY +DT ++ + ++E Y WDE T N ++ +D D +Y + Sbjct: 339 IECVARKMREGKFYVVDTASSGL-LDEIYQYAWDESTGLPLKENDVRHNDR-LDAIRYAI 396 Query: 424 LDNAK 428 K Sbjct: 397 YSRNK 401 >gi|19539|lcl|protein:vir:10320 Length: 627 # NCBI annotation: ORF22 # Family: family:all:140 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758915;genbank:gi:27311189;genbank:GeneID :956138 Length = 627 Score = 23.5 bits (49), Expect = 5.8, Method: Compositional matrix adjust. Identities = 15/55 (27%), Positives = 24/55 (43%), Gaps = 1/55 (1%) Query: 382 DNNKVFIEEHKMYRWDEKTIQSDNPNVIKD-DDHTCDVAQYFVLDNAKILSLRVG 435 D N V EEH +W K Q + + D ++ V + + KI+ L +G Sbjct: 32 DKNYVLPEEHGGGKWKTKPFQIGIADAMCDPEEERVTVMKSMRVGYTKIVDLAIG 86 >gi|3763|lcl|protein:vir:107694 Length: 527 # NCBI annotation: putative terminase large subunit # Family: family:all:144 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003892;genbank:gi:45686309;genbank:GeneID :2773034 Length = 527 Score = 22.7 bits (47), Expect = 7.9, Method: Compositional matrix adjust. Identities = 54/243 (22%), Positives = 88/243 (36%), Gaps = 33/243 (13%) Query: 207 NEQMLADIERIKENDYDYYRYVYLGEPVGLGNNIYNMS--TFH--PLDALPSDD-----R 257 +++ + D+ ++E D + Y +P+ LG +++N T++ LDA D R Sbjct: 291 SKESVHDLLALREADQYTFDSQYQQKPIALGGSVFNSEWWTYYGSSLDADEPDPGKYDYR 350 Query: 258 LIGISFALDGGHQQSATACCAFGVTAKGKVILLDTWYYSPAGQAIKKAPSQLSQDIYYFT 317 I A G T C +G D Y+ + +AP Q + Sbjct: 351 FITADTAQKTGELNDYTVFCLWGKKN-------DKVYFIDGIRGKWEAPDMERQFTAFVN 403 Query: 318 TKVISKYKVPILQ--YTIDSAEG-ALRNQMYLDFAIRWHPVAKLK-KVTMIDTFQSLLAQ 373 + +L+ Y D A G L + I P+ + K KVT Q ++ Sbjct: 404 QAWRHNKSMGVLRKIYVEDKASGTGLIQNLRKKTPISITPLQRNKDKVTRAMDAQPVIKA 463 Query: 374 GRFYYLDTDNNKVFIEEHKMYRWDEKTIQSDNPNVIKDDDHTCDVAQYFVLDNAKILSLR 433 GR V EEH M I +++ DD H D +D A I L Sbjct: 464 GRV---------VLPEEHPML----AEIIAEHSAFTYDDTHPHDDIVDNFMDAANIELLT 510 Query: 434 VGN 436 + + Sbjct: 511 IDD 513 >gi|17128|lcl|protein:vir:1379 Length: 581 # NCBI annotation: putative terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612831;genbank:gi:20065965;genbank:GeneID :935781 Length = 581 Score = 22.7 bits (47), Expect = 8.6, Method: Compositional matrix adjust. Identities = 7/18 (38%), Positives = 15/18 (83%) Query: 208 EQMLADIERIKENDYDYY 225 ++ L D+ER++ +D++YY Sbjct: 32 KRFLNDLERMESDDFEYY 49 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.321 0.137 0.420 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 205,936 Number of Sequences: 514 Number of extensions: 9874 Number of successful extensions: 132 Number of sequences better than 100.0: 51 Number of HSP's better than 100.0 without gapping: 44 Number of HSP's successfully gapped in prelim test: 7 Number of HSP's that attempted gapping in prelim test: 20 Number of HSP's gapped (non-prelim): 54 length of query: 437 length of database: 206,069 effective HSP length: 74 effective length of query: 363 effective length of database: 168,033 effective search space: 60995979 effective search space used: 60995979 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 38 (19.2 bits)