BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_011810.1_cdsid_YP_002455934.1 [gene=PB1_gp04] [protein=terminase large subunit] [protein_id=YP_002455934.1] [location=1179..2561] (460 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|6775|lcl|protein:vir:96067 Length: 460 # NCBI annotation: con... 951 0.0 gi|4914|lcl|protein:vir:99572 Length: 459 # NCBI annotation: Ter... 653 0.0 gi|17761|lcl|protein:vir:171 Length: 470 # NCBI annotation: term... 273 4e-75 gi|5142|lcl|protein:vir:105417 Length: 470 # NCBI annotation: ge... 272 5e-75 gi|14466|lcl|protein:vir:3519 Length: 469 # NCBI annotation: P18... 212 8e-57 gi|8380|lcl|protein:vir:96782 Length: 408 # NCBI annotation: put... 97 6e-22 gi|2819|lcl|protein:vir:105705 Length: 439 # NCBI annotation: gp... 94 3e-21 gi|3859|lcl|protein:vir:107748 Length: 532 # NCBI annotation: gp... 87 4e-19 gi|15353|lcl|protein:vir:9921 Length: 425 # NCBI annotation: put... 66 7e-13 gi|18074|lcl|protein:vir:5954 Length: 422 # NCBI annotation: hyp... 56 1e-09 gi|5794|lcl|protein:vir:98922 Length: 453 # NCBI annotation: ter... 56 1e-09 gi|655|lcl|protein:vir:1588 Length: 447 # NCBI annotation: putat... 53 6e-09 gi|13020|lcl|protein:vir:80960 Length: 443 # NCBI annotation: Te... 52 2e-08 gi|15836|lcl|protein:vir:37 Length: 443 # NCBI annotation: putat... 51 3e-08 gi|6093|lcl|protein:vir:95761 Length: 432 # NCBI annotation: ter... 50 4e-08 gi|10483|lcl|protein:vir:105297 Length: 446 # NCBI annotation: p... 49 1e-07 gi|19292|lcl|protein:vir:4781 Length: 436 # NCBI annotation: put... 49 1e-07 gi|5891|lcl|protein:vir:107098 Length: 421 # NCBI annotation: pu... 49 1e-07 gi|2779|lcl|protein:vir:99776 Length: 425 # NCBI annotation: lar... 48 2e-07 gi|17691|lcl|protein:vir:9305 Length: 447 # NCBI annotation: lar... 48 3e-07 gi|7301|lcl|protein:vir:96238 Length: 447 # NCBI annotation: ORF... 48 3e-07 gi|7643|lcl|protein:vir:96370 Length: 447 # NCBI annotation: ORF... 48 3e-07 gi|11588|lcl|protein:vir:78831 Length: 447 # NCBI annotation: te... 48 3e-07 gi|9829|lcl|protein:vir:103950 Length: 425 # NCBI annotation: ph... 47 3e-07 gi|5208|lcl|protein:vir:106641 Length: 446 # NCBI annotation: OR... 45 2e-06 gi|3326|lcl|protein:vir:94525 Length: 440 # NCBI annotation: put... 45 2e-06 gi|9289|lcl|protein:vir:97165 Length: 431 # NCBI annotation: ORF... 43 6e-06 gi|7569|lcl|protein:vir:96300 Length: 421 # NCBI annotation: ORF... 40 6e-05 gi|10354|lcl|protein:vir:97448 Length: 421 # NCBI annotation: OR... 40 6e-05 gi|3182|lcl|protein:vir:94499 Length: 421 # NCBI annotation: ORF... 40 6e-05 gi|6284|lcl|protein:vir:95890 Length: 421 # NCBI annotation: ORF... 40 6e-05 gi|4983|lcl|protein:vir:95058 Length: 421 # NCBI annotation: ORF... 40 7e-05 gi|7076|lcl|protein:vir:96147 Length: 419 # NCBI annotation: ORF... 39 2e-04 gi|5321|lcl|protein:vir:99534 Length: 422 # NCBI annotation: put... 38 3e-04 gi|11736|lcl|protein:vir:79016 Length: 417 # NCBI annotation: pu... 37 4e-04 gi|8730|lcl|protein:vir:96840 Length: 421 # NCBI annotation: ORF... 33 0.005 gi|18891|lcl|protein:vir:8847 Length: 482 # NCBI annotation: ter... 30 0.042 gi|25110|lcl|protein:vir:80755 Length: 501 # NCBI annotation: pu... 27 0.36 gi|14676|lcl|protein:vir:2499 Length: 474 # NCBI annotation: put... 27 0.58 gi|3478|lcl|protein:vir:101370 Length: 494 # NCBI annotation: Pa... 27 0.64 gi|7504|lcl|protein:vir:99915 Length: 478 # NCBI annotation: gp2... 26 1.2 gi|14207|lcl|protein:vir:8315 Length: 503 # NCBI annotation: gp3... 25 1.7 gi|5525|lcl|protein:vir:95311 Length: 491 # NCBI annotation: put... 25 2.0 gi|12972|lcl|protein:vir:80678 Length: 503 # NCBI annotation: gp... 25 2.0 gi|16210|lcl|protein:vir:7319 Length: 491 # NCBI annotation: ter... 25 2.2 gi|4240|lcl|protein:vir:94734 Length: 469 # NCBI annotation: lar... 25 2.4 gi|18494|lcl|protein:vir:1636 Length: 469 # NCBI annotation: put... 23 6.5 gi|19181|lcl|protein:vir:9572 Length: 459 # NCBI annotation: gp3... 23 8.5 >gi|6775|lcl|protein:vir:96067 Length: 460 # NCBI annotation: conserved hypothetical protein ORF004 # Family: family:all:54 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294421;genbank:gi:149408318;genbank:Ge neID:5237186 Length = 460 Score = 951 bits (2457), Expect = 0.0, Method: Compositional matrix adjust. Identities = 460/460 (100%), Positives = 460/460 (100%) Query: 1 MYKLNPALRAVWRTRARYKVIYGGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRISE 60 MYKLNPALRAVWRTRARYKVIYGGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRISE Sbjct: 1 MYKLNPALRAVWRTRARYKVIYGGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRISE 60 Query: 61 SVYTLIKDKIENSEYNGEFIFTKNSIKHKRTGSEFLFYGIARNLSEIKSTEGIDILWLEE 120 SVYTLIKDKIENSEYNGEFIFTKNSIKHKRTGSEFLFYGIARNLSEIKSTEGIDILWLEE Sbjct: 61 SVYTLIKDKIENSEYNGEFIFTKNSIKHKRTGSEFLFYGIARNLSEIKSTEGIDILWLEE 120 Query: 121 AHYLTQEQWEVIEPTIRKENSEIWIIFNPNEVTDFVYQNFVVKPPKDAFVKMINWNENPF 180 AHYLTQEQWEVIEPTIRKENSEIWIIFNPNEVTDFVYQNFVVKPPKDAFVKMINWNENPF Sbjct: 121 AHYLTQEQWEVIEPTIRKENSEIWIIFNPNEVTDFVYQNFVVKPPKDAFVKMINWNENPF 180 Query: 181 LSETMLKVIHEAYERDKDQAEHIYGGIPKTGGDKSVINLKFILAAIDAHKKLGWEPAGSK 240 LSETMLKVIHEAYERDKDQAEHIYGGIPKTGGDKSVINLKFILAAIDAHKKLGWEPAGSK Sbjct: 181 LSETMLKVIHEAYERDKDQAEHIYGGIPKTGGDKSVINLKFILAAIDAHKKLGWEPAGSK 240 Query: 241 RIGFDVADDGEDANATTLMHGNVIMEVDEWDGLEDELLKSSSRVYNLAKMKGASVTYDSI 300 RIGFDVADDGEDANATTLMHGNVIMEVDEWDGLEDELLKSSSRVYNLAKMKGASVTYDSI Sbjct: 241 RIGFDVADDGEDANATTLMHGNVIMEVDEWDGLEDELLKSSSRVYNLAKMKGASVTYDSI 300 Query: 301 GVGAHVGSKFAELNDSSPDFKLTYDPFNAGGAVDKPDDIYMKLPHTTIKNKDHFSNIKAQ 360 GVGAHVGSKFAELNDSSPDFKLTYDPFNAGGAVDKPDDIYMKLPHTTIKNKDHFSNIKAQ Sbjct: 301 GVGAHVGSKFAELNDSSPDFKLTYDPFNAGGAVDKPDDIYMKLPHTTIKNKDHFSNIKAQ 360 Query: 361 KWEEVATRFRKTYEAVVHGKVYPFDELISINSETIHPDKLNQLCIELSSPRKDLDMNGRF 420 KWEEVATRFRKTYEAVVHGKVYPFDELISINSETIHPDKLNQLCIELSSPRKDLDMNGRF Sbjct: 361 KWEEVATRFRKTYEAVVHGKVYPFDELISINSETIHPDKLNQLCIELSSPRKDLDMNGRF 420 Query: 421 KVESKKDMREKRKIKSPNIADSVIMSAILPIRKPKGFFDF 460 KVESKKDMREKRKIKSPNIADSVIMSAILPIRKPKGFFDF Sbjct: 421 KVESKKDMREKRKIKSPNIADSVIMSAILPIRKPKGFFDF 460 >gi|4914|lcl|protein:vir:99572 Length: 459 # NCBI annotation: TerL-like protein # Family: family:all:54 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039811;genbank:gi:126011061;genbank:Ge neID:4818267 Length = 459 Score = 653 bits (1684), Expect = 0.0, Method: Compositional matrix adjust. Identities = 309/460 (67%), Positives = 374/460 (81%), Gaps = 1/460 (0%) Query: 1 MYKLNPALRAVWRTRARYKVIYGGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRISE 60 MY LNPALR W +ARYK +YGGRASSKSHDA G AVYLA NY +KFLCARQFQN+ISE Sbjct: 1 MYDLNPALRDFWLDKARYKALYGGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISE 60 Query: 61 SVYTLIKDKIENSEYNGEFIFTKNSIKHKRTGSEFLFYGIARNLSEIKSTEGIDILWLEE 120 SVYTLIK KI+ + + EF T +SI+HK+TG+EFLFYGIARNL+EIKSTEG+DILWLEE Sbjct: 61 SVYTLIKGKIDAAGWTKEFDVTISSIRHKKTGAEFLFYGIARNLNEIKSTEGVDILWLEE 120 Query: 121 AHYLTQEQWEVIEPTIRKENSEIWIIFNPNEVTDFVYQNFVVKPPKDAFVKMINWNENPF 180 A YLT+EQW VI PTIR+E S+IW+I+NP++ TDF+YQNFVV PP D K INW ENPF Sbjct: 121 AQYLTEEQWNVINPTIRREGSQIWLIWNPDQYTDFIYQNFVVNPPADCLSKQINWTENPF 180 Query: 181 LSETMLKVIHEAYERDKDQAEHIYGGIPKTGGDKSVINLKFILAAIDAHKKLGWEPAGSK 240 LS+TMLKVI++ Y+RD AEH+YGG PK GGDK++I L+++LAAIDAHKKLGW+ GSK Sbjct: 181 LSDTMLKVIYDEYQRDPKLAEHVYGGAPKMGGDKAIIQLQYVLAAIDAHKKLGWKIEGSK 240 Query: 241 RIGFDVADDGEDANATTLMHGNVIMEVDEWDGLEDELLKSSSRVYNLAKMKGASVTYDSI 300 R GFD+ADDG+DANA GNV++ +EWDGLEDELLKSS++V+N A KG+S+ +DSI Sbjct: 241 RTGFDIADDGDDANAIVDAIGNVVVWAEEWDGLEDELLKSSTKVFNHALEKGSSIIFDSI 300 Query: 301 GVGAHVGSKFAELNDSSPDFKLTYDPFNAGGAVDKPDDIYMKLPHTTIKNKDHFSNIKAQ 360 GVGAH GSKF+ELN++ ++ Y+PFNAGGAV PD YMKLPH I N++HFSN+KAQ Sbjct: 301 GVGAHAGSKFSELNEAR-SLEIIYEPFNAGGAVYDPDGTYMKLPHVVITNREHFSNVKAQ 359 Query: 361 KWEEVATRFRKTYEAVVHGKVYPFDELISINSETIHPDKLNQLCIELSSPRKDLDMNGRF 420 W+ VATRFRKTYE V +G +P DELISI+SE + L++L IEL+SP KD+D G+F Sbjct: 360 MWDRVATRFRKTYEVVTYGANHPHDELISISSEHVPAKILDKLKIELASPHKDVDGMGKF 419 Query: 421 KVESKKDMREKRKIKSPNIADSVIMSAILPIRKPKGFFDF 460 KVESKKDMREKR IKSPNIAD+ IM+ I P R+P GFFDF Sbjct: 420 KVESKKDMREKRGIKSPNIADAFIMAMIQPKRQPAGFFDF 459 >gi|17761|lcl|protein:vir:171 Length: 470 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112076;genbank:gi:13559866;genbank:GeneID :921009 Length = 470 Score = 273 bits (697), Expect = 4e-75, Method: Compositional matrix adjust. Identities = 161/442 (36%), Positives = 242/442 (54%), Gaps = 12/442 (2%) Query: 17 RYKVIYGGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRISESVYTLIKDKIENSEYN 76 RYKV GGR S KS + V A ++ LCAR+ QN IS+SV L++D IE Y+ Sbjct: 16 RYKVAKGGRGSGKSWAIARLLVEAARRQPVRILCARELQNSISDSVIRLLEDTIEREGYS 75 Query: 77 GEFIFTKNSIKHKRTGSEFLFYGIARNLSEIKSTEGIDILWLEEAHYLTQEQWEVIEPTI 136 EF ++ I+H T +EF+FYGI N ++IKS EGIDI W+EEA +T+E W+++ PTI Sbjct: 76 AEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGIDICWVEEAEAVTKESWDILIPTI 135 Query: 137 RKENSEIWIIFNPNEVTDFVYQNFVVKPPKDAFVKMINWNENPFLSETMLKVIHEAYERD 196 RK SEIW+ FNP + D YQ FVV PP D + +N+ +NP E + + E R+ Sbjct: 136 RKPFSEIWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFPEVLRLEMEECKRRN 195 Query: 197 KDQAEHIYGGIPKTGGDKSVINLKFILAAIDAHKKLGWEPAGSKRIGFDVADDGEDANAT 256 HI+ G P + D ++I +++ AA DAHKKLGW+ G+ D +D G DA Sbjct: 196 PTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGY 255 Query: 257 TLMHGNVIMEVDEWDGLEDELLKSSSRVYNLAKMKGAS-VTYDSIGVGA----HVGSKFA 311 HG+V+ + E GL ++ + + +LA GA +D GVGA F+ Sbjct: 256 ASRHGSVVKRIAE--GLLMDINEGADWATSLAIEDGADHYLWDGDGVGAGLRRQTTEAFS 313 Query: 312 ELNDSSPDFKLTYDPFNAGGAVDK---PDDIYMKLPHTTIKNKDHFSNIKAQKWEEVATR 368 ++ FK + PF+ D++ TI D F N +AQ + +A R Sbjct: 314 GKKITATMFKGSESPFDEDAPYQAGAWADEVVQGDNVRTI--GDVFRNKRAQFYYALADR 371 Query: 369 FRKTYEAVVHGKVYPFDELISINSETIHPDKLNQLCIELSSPRKDLDMNGRFKVESKKDM 428 TY AVVHG+ D+++S + E I + L +L EL+ ++ + NG+ ++ +K +M Sbjct: 372 LYLTYRAVVHGEYADPDDMLSFDKEAIGENILEKLFAELTQIQRKFNNNGKLELMTKVEM 431 Query: 429 REKRKIKSPNIADSVIMSAILP 450 ++K I SPN+AD+++M P Sbjct: 432 KQKLGIPSPNLADALMMCMHCP 453 >gi|5142|lcl|protein:vir:105417 Length: 470 # NCBI annotation: gene 2 protein # Family: family:all:54 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958178;genbank:gi:41057280;genbank:GeneID :2716664 Length = 470 Score = 272 bits (696), Expect = 5e-75, Method: Compositional matrix adjust. Identities = 161/442 (36%), Positives = 241/442 (54%), Gaps = 12/442 (2%) Query: 17 RYKVIYGGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRISESVYTLIKDKIENSEYN 76 RYKV GGR S KS + V A ++ LCAR+ QN IS+SV L++D IE Y+ Sbjct: 16 RYKVAKGGRGSGKSWAIARLLVEAARRQPVRILCARELQNSISDSVIRLLEDTIEREGYS 75 Query: 77 GEFIFTKNSIKHKRTGSEFLFYGIARNLSEIKSTEGIDILWLEEAHYLTQEQWEVIEPTI 136 EF ++ I+H T +EF+FYGI N ++IKS EGIDI W+EEA +T+E W+++ PTI Sbjct: 76 AEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGIDICWVEEAEAVTKESWDILIPTI 135 Query: 137 RKENSEIWIIFNPNEVTDFVYQNFVVKPPKDAFVKMINWNENPFLSETMLKVIHEAYERD 196 RK SEIW+ FNP + D YQ FVV PP D + +N+ +NP E + + E R+ Sbjct: 136 RKPFSEIWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFPEVLRLEMEECKRRN 195 Query: 197 KDQAEHIYGGIPKTGGDKSVINLKFILAAIDAHKKLGWEPAGSKRIGFDVADDGEDANAT 256 HI+ G P + D ++I +++ AA DAHKKLGW+ G+ D +D G DA Sbjct: 196 PTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGY 255 Query: 257 TLMHGNVIMEVDEWDGLEDELLKSSSRVYNLAKMKGAS-VTYDSIGVGA----HVGSKFA 311 HG+V+ + E GL ++ + + +LA GA +D GVGA F+ Sbjct: 256 ASRHGSVVKRIAE--GLLMDINEGADWATSLAIEDGADHYLWDGDGVGAGLRRQTTEAFS 313 Query: 312 ELNDSSPDFKLTYDPFNAGGAVDK---PDDIYMKLPHTTIKNKDHFSNIKAQKWEEVATR 368 ++ FK + PF+ D++ TI D F N +AQ + +A R Sbjct: 314 GKKITATMFKGSESPFDEDAPYQAGAWADEVVQGDNVRTI--GDVFRNKRAQFYYALADR 371 Query: 369 FRKTYEAVVHGKVYPFDELISINSETIHPDKLNQLCIELSSPRKDLDMNGRFKVESKKDM 428 TY AVVHG+ D+++S + E I L +L EL+ ++ + NG+ ++ +K +M Sbjct: 372 LYLTYRAVVHGEYADPDDMLSFDKEAIGEKMLEKLFAELTQIQRKFNNNGKLELMTKVEM 431 Query: 429 REKRKIKSPNIADSVIMSAILP 450 ++K I SPN+AD+++M P Sbjct: 432 KQKLGIPSPNLADALMMCMHCP 453 >gi|14466|lcl|protein:vir:3519 Length: 469 # NCBI annotation: P18 # Family: family:all:54 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050979;genbank:gi:9633565;genbank:GeneID: 1262312 Length = 469 Score = 212 bits (539), Expect = 8e-57, Method: Compositional matrix adjust. Identities = 150/458 (32%), Positives = 241/458 (52%), Gaps = 32/458 (6%) Query: 17 RYKVIYGGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRISESVYTLIKDKIENSEYN 76 R KV +GGR K+ IA+ A+ ++ +FLC R+F N I +S + +++ ++E Sbjct: 6 RIKVYFGGRGGMKTVSFAKIALITASMHKRRFLCLREFMNSIEDSGHAVLQAEVETLGLQ 65 Query: 77 GEFIFTKNSIKHKRTGSEFLFYGIARNLSEIKSTEGIDILWLEEAHYLTQEQWEVIEPTI 136 F N+ S F + +ARN++ IKS D+ W+EEA ++++ + + PTI Sbjct: 66 NRFRIL-NTYIEGINDSIFKYGQLARNIASIKSKHDFDVAWVEEAETVSEKSLDSLIPTI 124 Query: 137 RKENSEIWIIFNPNEVTDFVYQNFVVKPPK------------DAFVKMINWNENPFLSET 184 RK SE+W FNP E VY+ FV KP K D +V +++ +NP+L Sbjct: 125 RKPGSELWFSFNPAEEDGAVYKRFV-KPYKELIDTQGYYEDDDLYVGKVSYLDNPWLPAE 183 Query: 185 MLKVIHEAYERDKDQAEHIYGGIPKTGGDKSVINLKFILAAIDAHKKLGWEPAGSKRIGF 244 + + + + H+YGG + ++I +++ AAIDAH KLG++P+G + + F Sbjct: 184 LKNDAQKMKRENYKKWRHVYGGECDANYEDALIQPEWVEAAIDAHIKLGFKPSGIRVVTF 243 Query: 245 DVADDGEDANATTLMHGNVIMEVDEWDGLEDELLKSSSRVYNLA-KMKGASVTYDSIGVG 303 D AD G+D A + +G +I + W E ++ ++ ++ A + YD+IG+G Sbjct: 244 DPADSGQDEKALSKRYGVLIEDCVSWS--EGDVADATMTAFDDAFDYRADDFIYDNIGLG 301 Query: 304 AHVGSKFAELNDSSPDFKLTYDPFNAGGAVDKPDDIYMK-----LPHTTIKNKDH---FS 355 A G+ L S+ K+ F AG + D PD+IY+ LP + ++ H F Sbjct: 302 A--GTVKTHLRHSNDGNKMVVTGFGAGDSPDYPDEIYVPGNGEYLPSSNNDDRTHRDTFR 359 Query: 356 NIKAQKWEEVATRFRKTYEAVVHGKVYPFDELISINSETIHPDKLNQLCIEL-SSPRKDL 414 N +AQ W +A RF KT+ AV G+ D LIS++S+ KL+QL EL RK Sbjct: 360 NKRAQYWVYLADRFYKTWRAVEKGEYLDPDALISLSSKIA---KLSQLKSELIKQQRKRT 416 Query: 415 DMNGRFKVESKKDMREKRKIKSPNIADSVIMSAILPIR 452 N ++ SK +MR K IKSPN+AD+++MS P+R Sbjct: 417 PGNRLIQLMSKDEMRLK-GIKSPNMADTLMMSFANPLR 453 >gi|8380|lcl|protein:vir:96782 Length: 408 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224236;genbank:gi:62362371;genbank:GeneID :3345721 Length = 408 Score = 96.7 bits (239), Expect = 6e-22, Method: Compositional matrix adjust. Identities = 55/175 (31%), Positives = 94/175 (53%), Gaps = 2/175 (1%) Query: 13 RTRARYKVIYGGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRISESVYTLIKDKIEN 72 R R++ YG R S KS + +A A +++ LC R+ Q I ES + +K+ I++ Sbjct: 20 RGAVRFRGAYGSRGSGKSFNFAKMAAIWGAIEKMRILCTRELQVSIKESFHAELKNAIKS 79 Query: 73 SEY-NGEFIFTKNSIKHKRTGSEFLFYGIARNLSEIKSTEGIDILWLEEAHYLTQEQWEV 131 E+ + + + I++ G+EFLF G+ + +KST ID+ +EEA + + W Sbjct: 80 DEWLSSIYDVGIDYIRNNNNGTEFLFKGLRHGMGSVKSTAQIDLTIVEEAEDVPENAWVE 139 Query: 132 IEPTI-RKENSEIWIIFNPNEVTDFVYQNFVVKPPKDAFVKMINWNENPFLSETM 185 + PTI R + +E W+I+NP + V + F P DA V +N+ +NPF + + Sbjct: 140 LLPTIFRTDKAECWVIWNPRKKGSPVDKRFRQFKPDDAVVVEMNYYDNPFFPKGL 194 >gi|2819|lcl|protein:vir:105705 Length: 439 # NCBI annotation: gp2 # Family: family:all:54 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224140;genbank:gi:62362215;genbank:GeneID :3342458 Length = 439 Score = 94.4 bits (233), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 62/222 (27%), Positives = 110/222 (49%), Gaps = 15/222 (6%) Query: 17 RYKVIYGGRASSKSHDAG---GIAVYLAANYRLK--FLCARQFQNRISESVYTLIKDKIE 71 RY+ +GGR S+K+ + Y AA + LCAR++ N + ES +K I Sbjct: 23 RYRGAHGGRGSAKTRTFALMTAVKAYQAAEANISGVILCAREYMNSLEESSMEEVKQAIR 82 Query: 72 NSEY-NGEFIFTKNSIKHKRTGSEFLFYGIARNLSEIKSTEGIDILWLEEAHYLTQEQWE 130 + + + F + I+ K ++F G+ NL IKS I + W++EA ++ W+ Sbjct: 83 SVAWLDDYFDIGEKYIRTKNRKVSYVFCGLRHNLDSIKSKARILVAWVDEAESVSSTAWK 142 Query: 131 VIEPTIRKENSEIWIIFNPNEVTDFVYQNFVVKPPKDAFVKMINWNENPFLSETMLKVIH 190 + PT+R+E SEIW+ +NP + + F PPK + + +N+ +NP+ V+ Sbjct: 143 KLRPTVREEGSEIWVTWNPEKDGSATDKLFRKNPPKSSMIVEMNYVDNPWFP----AVLE 198 Query: 191 EAYERD---KDQAEH--IYGGIPKTGGDKSVINLKFILAAID 227 E + D D A++ I+ G DK V+ K+++ + + Sbjct: 199 EERQEDLANLDYADYAWIWEGAYLENSDKQVLANKYVVQSFE 240 >gi|3859|lcl|protein:vir:107748 Length: 532 # NCBI annotation: gp33 TerL # Family: family:all:54 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024878;genbank:gi:48697520;genbank:GeneID :2948367 Length = 532 Score = 87.0 bits (214), Expect = 4e-19, Method: Compositional matrix adjust. Identities = 79/250 (31%), Positives = 117/250 (46%), Gaps = 20/250 (8%) Query: 216 VINLKFILAAIDAHKKLGWEPAGSKRIGFDVADDGEDANATTLMHGNVIMEVDEWDGLED 275 +I L++I AAIDA KLG G + DVAD+G+D NA G + + W G Sbjct: 295 LIPLEWIDAAIDADVKLGLTVTGQRFSSLDVADEGKDMNAFGSRLGIRMDYAESWSGKGS 354 Query: 276 ELLKSSSRVYNLA-KMKGASVTYDSIGVGAHVGSKFAELNDSSPDFK----LTYDPFNAG 330 + ++ R L G +DS G+G V AE ++ P+ K + F Sbjct: 355 NIYGTTLRTIGLVIAQNGRDFQFDSDGLGVGVRGD-AEAINALPERKAYPKIDAIAFRGS 413 Query: 331 GAVDKPDDIYMKLP--HTTIKNKDHFSNIKAQKWEEVATRFRKTYEAVVHGKVYPFDELI 388 +V +PD ++P + +KN D F N KAQ++ + RF TY AVV Y DE+I Sbjct: 414 SSVREPDK---QVPGAYKGVKNVDFFQNRKAQEYWALRMRFEATYRAVVEKLEYDPDEII 470 Query: 389 SINSETIHPDKLNQLCIELSSPRKDLDMNGRFKVESKKDMREKRKIKSPNIADSVIMSAI 448 SI+S PD L ++ +EL P G+ ++ D + SPN AD +M Sbjct: 471 SISSRI--PD-LQKIRMELHQPLYKPSTTGKIMIQKTPD-----GMVSPNYADMTMM-LY 521 Query: 449 LPIRKPKGFF 458 P + +G F Sbjct: 522 APQQTKRGIF 531 >gi|15353|lcl|protein:vir:9921 Length: 425 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795683;genbank:gi:28876465;genbank:GeneID :1257982 Length = 425 Score = 66.2 bits (160), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 55/188 (29%), Positives = 90/188 (47%), Gaps = 15/188 (7%) Query: 20 VIYGGRASSKSHDAGGIAVYLAANYRLK----FLCARQFQNRISESVYTLIKDKIENSEY 75 V YGG +S KSH + A N + K L R+ + +SV+ D + N Y Sbjct: 37 VHYGGASSGKSHGVFQKIILKALNPKFKHPRKILVLRKVGATVRDSVFA---DIMSNLSY 93 Query: 76 NGEFIFTKNSIKHKR----TGSEFLFYGIARNLSEIKSTEGIDILWLEEAHYLTQEQWEV 131 G K ++ R G+EF+F G+ N +IKS +GI + +EEA T + + Sbjct: 94 FGILDKCKINMSAFRITLPNGAEFIFKGMD-NPEKIKSIKGISDVVMEEASEFTLDDYTQ 152 Query: 132 IEPTIRKEN---SEIWIIFNPNEVTDFVYQNFVVKPPKDAFVKMINWNENPFLSETMLKV 188 + +R + +I+++FNP ++VY+ F VK PK+ V + +N FL + + Sbjct: 153 LTLRLRDKKHLEKQIYLMFNPVSKVNWVYKAFFVKTPKNTVVYQTTYKDNRFLDDVTREN 212 Query: 189 IHEAYERD 196 I E R+ Sbjct: 213 IEELANRN 220 Score = 25.4 bits (54), Expect = 1.4, Method: Compositional matrix adjust. Identities = 16/61 (26%), Positives = 29/61 (47%), Gaps = 2/61 (3%) Query: 345 HTTIKNKDHFSNIKAQKWEEVATRFRKTYEAVVHGKVYPFDELI--SINSETIHPDKLNQ 402 TT K+ ++ + EE+A R Y+ G+ D+LI + + ++ DKL+ Sbjct: 196 QTTYKDNRFLDDVTRENIEELANRNEAYYKIYALGQFATLDKLIFPKYDKQILNKDKLSH 255 Query: 403 L 403 L Sbjct: 256 L 256 >gi|18074|lcl|protein:vir:5954 Length: 422 # NCBI annotation: hypothetical protein # Family: family:all:54 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690654;genbank:geneid:6329138;genbank:gi: 22855048;goa:P54308;interpro:IPR006437;interpro:IPR00670 1;interpro:IPR011441;uniprot:P54308;genbank:GeneID:95525 4 Length = 422 Score = 55.8 bits (133), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 56/226 (24%), Positives = 103/226 (45%), Gaps = 26/226 (11%) Query: 3 KLNPALRAVWRTRARYK----VIYGGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRI 58 K P VWRT + V+ GGR S+KS + L + FL R+ N + Sbjct: 9 KFTPHFLEVWRTVKAAQHLKYVLKGGRGSAKSTHIAMWIILLMMMMPITFLVIRRVYNTV 68 Query: 59 SESVYTLIKDKIENSEYNGEFIFTKNSIK--HKRTGSEFLFYGIARNLSEIKSTEG---- 112 +SV+ +K+ I+ E + +K+ ++ + G+ +F G ++ +IKS + Sbjct: 69 EQSVFEQLKEAIDMLEVGHLWKVSKSPLRLTYIPRGNSIIFRG-GDDVQKIKSIKASKFP 127 Query: 113 IDILWLEE-AHYLTQEQWEVIEPTIRKENSE------IWIIFNPNE-----VTDFVYQNF 160 + +W+EE A + T+E+ VIE ++ + + +NP + V +F Sbjct: 128 VAGMWIEELAEFKTEEEVSVIEKSVLRAELPPGCRYIFFYSYNPPKRKQSWVNKVFNSSF 187 Query: 161 VVKPPKDAFVKMINWNENPFLSETMLKVIHEAYERDKDQAEHIYGG 206 + P + FV + +NPFLS+ ++ E R++ + H Y G Sbjct: 188 L---PANTFVDHSTYLQNPFLSKAFIEEAEEVKRRNELKYRHEYLG 230 >gi|5794|lcl|protein:vir:98922 Length: 453 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164412;genbank:gi:56694902;genbank:GeneID :3197313 Length = 453 Score = 55.8 bits (133), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 71/282 (25%), Positives = 113/282 (40%), Gaps = 45/282 (15%) Query: 4 LNPALRAVWRTRARYKVIYGGRASSKSHDAGGIAVYLAANYRLK-----FLCARQFQNRI 58 +NP + VW + Y ++ GGR S KS V++ Y LK + R+ N I Sbjct: 17 INPHFKEVWTSSKPYNILKGGRNSFKSSVIALKLVFMMLLYILKGEKANVVVIRKVGNTI 76 Query: 59 SESVYTLIKDKIENSEYNGEFIFTKNSIK--HKRTGSEFLFYGIARNLSEIKSTEGIDIL 116 +SV+ I+ I+ F T + K HKRTGS F FYG + ++KS + DI+ Sbjct: 77 RDSVFNKIQWAIKLFGLTRRFKPTVSPFKITHKRTGSTFYFYG-QDDFQKLKSNDIEDII 135 Query: 117 --WLEEAHYLTQEQWEVIEPTIRKENSEIWIIFNPNEVTDFVYQNFVVKPPKDAFVKMIN 174 W EEA + S + + + + +FV + PP++ + + Sbjct: 136 AVWYEEA--------AEFASEEEFDQSNVTFMRQKHPLAEFVQFFWSYNPPRNPYHWINE 187 Query: 175 W-------------------NENPFLSETMLKVIHEAYERDKDQAEHIYGGIPKTGGDKS 215 W ++ F++ MLK I D D +IY G P G + Sbjct: 188 WADKMVGEEDYLVHESSYLDDQLGFVTGQMLKDIERIKNNDHDYYRYIYLGEP-VGLGTN 246 Query: 216 VINLKFILAAIDAHKKLGWEPAGSKRIGFDVADDGEDANATT 257 V N+ K L P+ + I + DG A++ T Sbjct: 247 VYNMNLF-------KPLDQLPSDDRVIALFYSVDGGHAHSAT 281 >gi|655|lcl|protein:vir:1588 Length: 447 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695170;swissprot:trembl:o03927;genbank:gi :23455799;goa:O03927;interpro:IPR006437;interpro:IPR0067 01;interpro:IPR011441;uniprot:O03927;genbank:GeneID:9555 65 Length = 447 Score = 53.1 bits (126), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 53/185 (28%), Positives = 90/185 (48%), Gaps = 26/185 (14%) Query: 4 LNPALRAVWRTRARYKVIYGGRASSKSHDAGGIAVYLAANYRLKFLCARQF--------Q 55 +NP + +W T Y V GGR S KS I++ L + + R+ + Sbjct: 21 INPHFKRMWTTDKPYIVANGGRGSFKS---SVISLKLVTMVKKAIMQHRKANVIAVLANK 77 Query: 56 NRISESVYTLIKDKIENSEYNGEFIFTKN--SIKHKRTGSEFLFYGIARNLSEIKSTEGI 113 + + ++VY I+ + + + EFI K+ +I+HKRTGS F FYG A N ++KS Sbjct: 78 SDLHDTVYNQIQWALSMLDMDNEFIAYKSPLTIQHKRTGSSFYFYG-ADNPYKLKSNIVG 136 Query: 114 DI--LWLEEAHYL-TQEQWEVIEPTIRKENSEIWIIFNPNEVTDFVYQNFVVKPPKDAFV 170 D+ +W EEA + + + ++ PT ++ E W+ ++V F N PPK+ + Sbjct: 137 DVVAVWYEEAANMKSSDVFDQANPTFIRQKPE-WL----DQVKVFYSYN----PPKNPYD 187 Query: 171 KMINW 175 + W Sbjct: 188 WINEW 192 >gi|13020|lcl|protein:vir:80960 Length: 443 # NCBI annotation: TerL # Family: family:all:54 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468388;genbank:gi:157324962;genbank:Ge neID:5601395 Length = 443 Score = 51.6 bits (122), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 77/281 (27%), Positives = 121/281 (43%), Gaps = 40/281 (14%) Query: 4 LNPALRAVWRTRARYKVIYGGRASSKSHDAGGIAVYLA----ANYRLKFLCARQFQNRIS 59 +NPA +W ++ + + GGR+S KS I++ L AN +C R+ N + Sbjct: 21 INPAFYDLWLSKHNHIIAKGGRSSMKS---SVISLKLVEKKMANPMSNMVCLRKVANTLY 77 Query: 60 ESVYTLIKDKIENSEYNGEFIFTKN--SIKHKRTGSEFLFYGI--ARNLSEIKSTEG-ID 114 +SVY IK + +F F K+ I HK G+ F F G L +K G + Sbjct: 78 KSVYQQIKWALYEMGVADQFNFGKSPMEIIHKEWGTGFYFSGCDDPAKLKSMKIPVGYVS 137 Query: 115 ILWLEE-AHYLTQEQWEVIEPTIRKEN------SEIWIIFNPNE-----VTDFVYQNFVV 162 LW EE A + +V+E T +E+ I++ FNP V ++V Sbjct: 138 DLWFEELAEFSGVTDIDVVEDTFIREDLPQGQEVTIYMSFNPPRNPYEWVNEYVDSK--- 194 Query: 163 KPPKDAFVKMINW--NENPFLSETMLKVIHEAYERDKDQAEHIY-GGIPKTGGDKSVINL 219 + D + + +E FLS+ ++K I + + D D +Y G + G + +NL Sbjct: 195 RSDDDYLIHHTTYLDDEKGFLSKQIIKKIEKYKKNDLDYYRWMYLGEVIGLGDNVYNMNL 254 Query: 220 KFILAAIDAHKKLGWEPAGSKRIGFDVA-DDGEDANATTLM 259 L AI PA + I D A D G +ATT + Sbjct: 255 FQPLKAI---------PADDRLILIDFAIDTGHQVSATTCL 286 >gi|15836|lcl|protein:vir:37 Length: 443 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463463;swissprot:trembl:q9t1c1;genbank:gi :16798785;goa:Q9T1C1;uniprot:Q9T1C1;genbank:GeneID:92238 4 Length = 443 Score = 50.8 bits (120), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 78/284 (27%), Positives = 120/284 (42%), Gaps = 46/284 (16%) Query: 4 LNPALRAVWRTRARYKVIYGGRASSKSHDAGGIAVYLA----ANYRLKFLCARQFQNRIS 59 +NPA +W ++ + + GGR+S KS I++ L AN +C R+ N + Sbjct: 21 INPAFYDLWLSKHNHIIAKGGRSSMKS---SVISLKLVEKKMANPMSNMVCLRKVANTLY 77 Query: 60 ESVYTLIKDKIENSEYNGEFIFTKN--SIKHKRTGSEFLFYGI--ARNLSEIKSTEG-ID 114 +SVY IK + +F F K+ I HK G+ F F G L +K G + Sbjct: 78 KSVYQQIKWALYEMGVADQFKFGKSPMEIVHKTWGTGFYFSGCDDPAKLKSMKIPVGYVS 137 Query: 115 ILWLEE-AHYLTQEQWEVIEPTIRKEN------SEIWIIFNP--------NEVTDFVYQN 159 LW EE A + +V+E T +E+ I++ FNP NE D Sbjct: 138 GLWFEELAEFSGVTDIDVVEDTFIREDLPQGQEVTIYMSFNPPRNPYEWVNEYVD----- 192 Query: 160 FVVKPPKDAFVKMINW--NENPFLSETMLKVIHEAYERDKDQAEHIY-GGIPKTGGDKSV 216 + D + + +E FLS+ ++K I + + D D +Y G + G + Sbjct: 193 -SKRSDDDYLIHHTTYLDDEKGFLSKQIIKKIEKYKKNDLDYYRWMYLGEVIGLGDNVYN 251 Query: 217 INLKFILAAIDAHKKLGWEPAGSKRIGFDVA-DDGEDANATTLM 259 +NL L AI PA + I D A D G +ATT + Sbjct: 252 MNLFQPLKAI---------PADDRLILIDFAIDTGHQVSATTYL 286 >gi|6093|lcl|protein:vir:95761 Length: 432 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950582;genbank:gi:119953777;genbank:GeneI D:5076831 Length = 432 Score = 50.4 bits (119), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 51/187 (27%), Positives = 80/187 (42%), Gaps = 15/187 (8%) Query: 12 WRTRARYKVIYGGRASSKSHDAGGIAVYLAANYRL-KFLCARQFQNRISESVYTLIKDKI 70 WR++ Y+V+ GGR S KS + Y L R+F N +S YT +K Sbjct: 23 WRSKNFYRVVKGGRGSKKSKTTALYYIVAILKYNWANLLVVRRFSNTNKQSTYTDLKWAA 82 Query: 71 ENSEYNGEFIFTKN--SIKHKRTGSEFLFYGIARNL---SEIKSTEGIDILWLEEAHYL- 124 + F F ++ I K TG + LF G+ L S T + LWLEEA+ + Sbjct: 83 NRLNVSHLFKFNESLPEITVKATGQKILFRGLDDPLKITSITVDTGLLSWLWLEEAYQVE 142 Query: 125 TQEQWEVIEPTIRKE------NSEIWIIFNPNEVTDFVYQNFVVKPP--KDAFVKMINWN 176 Q+++E + +IR +I + FNP ++ F + KD F + Sbjct: 143 NQDKFETLVESIRGSIDAPDFFKQITVTFNPWSERHWLKSAFFDEDTRKKDVFADTTTYR 202 Query: 177 ENPFLSE 183 N +L + Sbjct: 203 VNEWLDQ 209 >gi|10483|lcl|protein:vir:105297 Length: 446 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950664;genbank:gi:119967835;genbank:GeneI D:4643176 Length = 446 Score = 48.9 bits (115), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 53/218 (24%), Positives = 97/218 (44%), Gaps = 20/218 (9%) Query: 8 LRAVWR-TRARYK---VIYGGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRISESVY 63 ++W+ T+ R K V GGR S KS D I L Y + + R+ N ++ SV+ Sbjct: 14 FHSLWKATKDREKLNIVAKGGRGSGKSSDISIIITQLIMRYPMNAVVVRKADNTLATSVF 73 Query: 64 TLIKDKIENSEYNGEF--IFTKNSIKHKRTGSEFLFYGIARNLSEIKSTEG----IDILW 117 IK IE + + F + I + G+ +F G A+N +KS + I+W Sbjct: 74 EQIKWAIEEQKVSHLFKVKVSPMEITYVPRGNRIIFRG-AQNPERLKSLKDSRFPFSIMW 132 Query: 118 LEE-AHYLTQEQWEVIEPTIRKENSEIWIIFN-------PNEVTDFVYQNFVVK-PPKDA 168 +EE A + T+++ I ++ + + + + P +V + + P + Sbjct: 133 IEELAEFKTEDEVTTITNSMLRGELDDGLFYKFFFSYNPPKRKQSWVNKKYETSFQPDNT 192 Query: 169 FVKMINWNENPFLSETMLKVIHEAYERDKDQAEHIYGG 206 FV + +NPF+S+ ++ A ER++ + Y G Sbjct: 193 FVHHSTYLDNPFISKQFIQEAESAKERNEQRYRWEYMG 230 >gi|19292|lcl|protein:vir:4781 Length: 436 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150161;swissprot:trembl:q94m50;genbank:gi :15088772;goa:Q94M50;uniprot:Q94M50;genbank:GeneID:95597 6 Length = 436 Score = 48.9 bits (115), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 79/303 (26%), Positives = 124/303 (40%), Gaps = 55/303 (18%) Query: 4 LNPALRAVWRTRARYKVIYGGRASSKSHD-----AGGIAVYLAANYRLKFLCARQFQNRI 58 +NP ++VW + Y V+ GGR S KS A + Y+ A + R+ N I Sbjct: 9 INPHFKSVWISSLPYNVLKGGRNSFKSSVIVLKLAYMMIRYIIAGEAANIVVIRKVANTI 68 Query: 59 SESVYTLIKDKIENSEYNGEFIFTKNSIK--HKRTGSEFLFYGIARNLSEIKSTEGIDIL 116 +SV+ + + +F T + K HK TGS F FYG + ++KS + +I+ Sbjct: 69 RDSVFNKVWWALNLFGIAEQFTKTVSPFKIVHKTTGSTFYFYG-QDDFQKLKSNDIGNII 127 Query: 117 --WLEE-AHYLTQEQWEVIEPTIRKENSEIWIIFNPNEVTDFVYQNFVVKPPKDAFVKMI 173 W EE A + QE ++ T ++ +P FV + PP++ + + Sbjct: 128 PVWYEEAAEFNDQEDFDQSNVTFMRQK-------HPR--AKFVQFFWSYNPPRNPYSWIN 178 Query: 174 NW-------------------NENPFLSETMLKVIHEAYERDKDQAEHIYGGIPKTGGDK 214 W +E F++E ML+ I E D D ++Y G G Sbjct: 179 EWFESIKTNKNYLAHSSTYLDDELGFVTEQMLEDIERIKENDYDYYRYLYLG-EAVGLGN 237 Query: 215 SVINLKFILAAIDAHKKLGWEPAGSKRIGFDVADDGEDANATTLM-------HGNVIMEV 267 +V N+ + AIDA P+ K IG A DG + T G VI+ + Sbjct: 238 NVYNMS-MFHAIDAC------PSDDKLIGISFALDGGHQQSATACCAFGITAKGKVIL-L 289 Query: 268 DEW 270 D W Sbjct: 290 DTW 292 >gi|5891|lcl|protein:vir:107098 Length: 421 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950600;genbank:gi:119953680;genbank:GeneI D:4643107 Length = 421 Score = 48.9 bits (115), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 53/218 (24%), Positives = 97/218 (44%), Gaps = 20/218 (9%) Query: 8 LRAVWR-TRARYK---VIYGGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRISESVY 63 ++W+ T+ R K V GGR S KS D I L Y + + R+ N ++ SV+ Sbjct: 15 FHSLWKATKDREKLNIVAKGGRGSGKSSDISIIITQLIMRYPMNAVVVRKTDNTLATSVF 74 Query: 64 TLIKDKIENSEYNGEF--IFTKNSIKHKRTGSEFLFYGIARNLSEIKSTEG----IDILW 117 IK IE + + F + I + G+ +F G A+N +KS + I+W Sbjct: 75 EQIKWAIEEQKVSHLFKVKVSPMEITYVPRGNRIIFRG-AQNPERLKSLKDSRFPFSIMW 133 Query: 118 LEE-AHYLTQEQWEVIEPTIRKENSEIWIIFN-------PNEVTDFVYQNFVVK-PPKDA 168 +EE A + T+++ I ++ + + + + P +V + + P + Sbjct: 134 IEELAEFKTEDEVTTITNSMLRGELDDGLFYKFFFSYNPPKRKQSWVNKKYETSFQPDNT 193 Query: 169 FVKMINWNENPFLSETMLKVIHEAYERDKDQAEHIYGG 206 FV + +NPF+S+ ++ A ER++ + Y G Sbjct: 194 FVHHSTYLDNPFISKQFIQEAESAKERNEQRYRWEYMG 231 >gi|2779|lcl|protein:vir:99776 Length: 425 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004302;genbank:gi:122891756;genbank:Ge neID:4712331 Length = 425 Score = 48.1 bits (113), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 41/146 (28%), Positives = 75/146 (51%), Gaps = 7/146 (4%) Query: 44 YRLKFLCARQFQNRISESVYTLIKDKIENSEYNGEFIFTKNSIKHKR-TGSEFLFYGIAR 102 Y + L R+ Q+ I +S++ +KD + N ++ K K G+ FLF G+ Sbjct: 59 YPRRILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKVGLPNGAVFLFKGLD- 117 Query: 103 NLSEIKSTEGIDILWLEEAHYLTQEQWEVIEPTIRKE---NSEIWIIFNPNEVTDFVYQN 159 N +IKS +GI + +EEA T + + +R+ N +I++IFNP ++VY+ Sbjct: 118 NPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERKHVNKQIFLIFNPVSKLNWVYKY 177 Query: 160 FVV--KPPKDAFVKMINWNENPFLSE 183 F +P ++ ++ ++ +N FL E Sbjct: 178 FFEHGEPMENVMIRQSSYRDNKFLDE 203 >gi|17691|lcl|protein:vir:9305 Length: 447 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803283;genbank:gi:29028593;genbank:GeneID :1258039 Length = 447 Score = 47.8 bits (112), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 40/146 (27%), Positives = 76/146 (52%), Gaps = 7/146 (4%) Query: 44 YRLKFLCARQFQNRISESVYTLIKDKIENSEYNGEFIFTKNSIKHKR-TGSEFLFYGIAR 102 Y + L R+ Q+ I +S++ +KD + N ++ K K + G+ FLF G+ Sbjct: 81 YPRRILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKVELPNGAVFLFKGLD- 139 Query: 103 NLSEIKSTEGIDILWLEEAHYLTQEQWEVIEPTIRKE---NSEIWIIFNPNEVTDFVYQN 159 N +IKS +GI + +EEA T + + +R+ N +I+++FNP ++VY+ Sbjct: 140 NPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERKHVNKQIFLMFNPVSKLNWVYKY 199 Query: 160 FVV--KPPKDAFVKMINWNENPFLSE 183 F +P ++ ++ ++ +N FL E Sbjct: 200 FFEHGEPMENVMIRQSSYRDNKFLDE 225 >gi|7301|lcl|protein:vir:96238 Length: 447 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239566;genbank:gi:66395301;genbank:GeneID :5132787 Length = 447 Score = 47.8 bits (112), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 40/146 (27%), Positives = 76/146 (52%), Gaps = 7/146 (4%) Query: 44 YRLKFLCARQFQNRISESVYTLIKDKIENSEYNGEFIFTKNSIKHKR-TGSEFLFYGIAR 102 Y + L R+ Q+ I +S++ +KD + N ++ K K + G+ FLF G+ Sbjct: 81 YPRRILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKVELPNGAVFLFKGLD- 139 Query: 103 NLSEIKSTEGIDILWLEEAHYLTQEQWEVIEPTIRKE---NSEIWIIFNPNEVTDFVYQN 159 N +IKS +GI + +EEA T + + +R+ N +I+++FNP ++VY+ Sbjct: 140 NPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERKHVNKQIFLMFNPVSKLNWVYKY 199 Query: 160 FVV--KPPKDAFVKMINWNENPFLSE 183 F +P ++ ++ ++ +N FL E Sbjct: 200 FFEHGEPMENVMIRQSSYRDNKFLDE 225 >gi|7643|lcl|protein:vir:96370 Length: 447 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239642;genbank:gi:66395379;genbank:GeneID :5132846 Length = 447 Score = 47.8 bits (112), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 40/146 (27%), Positives = 76/146 (52%), Gaps = 7/146 (4%) Query: 44 YRLKFLCARQFQNRISESVYTLIKDKIENSEYNGEFIFTKNSIKHKR-TGSEFLFYGIAR 102 Y + L R+ Q+ I +S++ +KD + N ++ K K + G+ FLF G+ Sbjct: 81 YPRRILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKVELPNGAVFLFKGLD- 139 Query: 103 NLSEIKSTEGIDILWLEEAHYLTQEQWEVIEPTIRKE---NSEIWIIFNPNEVTDFVYQN 159 N +IKS +GI + +EEA T + + +R+ N +I+++FNP ++VY+ Sbjct: 140 NPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERKHVNKQIFLMFNPVSKLNWVYKY 199 Query: 160 FVV--KPPKDAFVKMINWNENPFLSE 183 F +P ++ ++ ++ +N FL E Sbjct: 200 FFEHGEPMENVMIRQSSYRDNKFLDE 225 >gi|11588|lcl|protein:vir:78831 Length: 447 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285355;genbank:gi:148717883;genbank:Ge neID:5246962 Length = 447 Score = 47.8 bits (112), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 40/146 (27%), Positives = 76/146 (52%), Gaps = 7/146 (4%) Query: 44 YRLKFLCARQFQNRISESVYTLIKDKIENSEYNGEFIFTKNSIKHKR-TGSEFLFYGIAR 102 Y + L R+ Q+ I +S++ +KD + N ++ K K + G+ FLF G+ Sbjct: 81 YPRRILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKVELPNGAVFLFKGLD- 139 Query: 103 NLSEIKSTEGIDILWLEEAHYLTQEQWEVIEPTIRKE---NSEIWIIFNPNEVTDFVYQN 159 N +IKS +GI + +EEA T + + +R+ N +I+++FNP ++VY+ Sbjct: 140 NPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERKHVNKQIFLMFNPVSKLNWVYKY 199 Query: 160 FVV--KPPKDAFVKMINWNENPFLSE 183 F +P ++ ++ ++ +N FL E Sbjct: 200 FFEHGEPMENVMIRQSSYRDNKFLDE 225 >gi|9829|lcl|protein:vir:103950 Length: 425 # NCBI annotation: phage terminase large subunit # Family: family:all:54 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873987;genbank:gi:118430762;genbank:GeneI D:4525444 Length = 425 Score = 47.4 bits (111), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 40/146 (27%), Positives = 76/146 (52%), Gaps = 7/146 (4%) Query: 44 YRLKFLCARQFQNRISESVYTLIKDKIENSEYNGEFIFTKNSIKHKR-TGSEFLFYGIAR 102 Y + L R+ Q+ I +S++ +KD + N ++ K K + G+ FLF G+ Sbjct: 59 YPRRILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKVELPNGAVFLFKGLD- 117 Query: 103 NLSEIKSTEGIDILWLEEAHYLTQEQWEVIEPTIRKE---NSEIWIIFNPNEVTDFVYQN 159 N +IKS +GI + +EEA T + + +R+ N +I+++FNP ++VY+ Sbjct: 118 NPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERKHVNKQIFLMFNPVSKLNWVYKY 177 Query: 160 FVV--KPPKDAFVKMINWNENPFLSE 183 F +P ++ ++ ++ +N FL E Sbjct: 178 FFEHGEPMENVMIRQSSYRDNKFLDE 203 >gi|5208|lcl|protein:vir:106641 Length: 446 # NCBI annotation: ORF005 # Family: family:all:54 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239489;genbank:gi:66395220;genbank:GeneID :4555795 Length = 446 Score = 45.1 bits (105), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 39/146 (26%), Positives = 75/146 (51%), Gaps = 7/146 (4%) Query: 44 YRLKFLCARQFQNRISESVYTLIKDKIENSEYNGEFIFTKNSIKHKR-TGSEFLFYGIAR 102 Y + L R+ Q+ I +S++ +K + N ++ K K + G+ FLF G+ Sbjct: 81 YPRRILWLRKVQSTIKDSLFEDVKGCLINFGIWDMCLWNKTDNKVELPNGAVFLFKGLD- 139 Query: 103 NLSEIKSTEGIDILWLEEAHYLTQEQWEVIEPTIRKE---NSEIWIIFNPNEVTDFVYQN 159 N +IKS +GI + +EEA T + + +R+ N +I+++FNP ++VY+ Sbjct: 140 NPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERKHMNKQIFLMFNPVSKLNWVYKY 199 Query: 160 FVV--KPPKDAFVKMINWNENPFLSE 183 F +P ++ ++ ++ +N FL E Sbjct: 200 FFEHGEPMENVMIRQSSYRDNKFLDE 225 >gi|3326|lcl|protein:vir:94525 Length: 440 # NCBI annotation: putative large subunit terminase # Family: family:all:54 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223885;genbank:gi:62327097;genbank:GeneID :5075541 Length = 440 Score = 45.1 bits (105), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 42/160 (26%), Positives = 75/160 (46%), Gaps = 12/160 (7%) Query: 2 YKLNPALRAVWRTRARYKVIYGGRASSKSH-DAGGIAVYLAANYRLKFLCARQFQNRISE 60 Y ++ A ++ +R RY V G R S KS+ A + + + + +L RQ+ + Sbjct: 21 YIVSKAYYPMFNSRDRYLVYKGSRGSGKSYATAAKVIIDIMMYPYVNWLVTRQYATTQKD 80 Query: 61 SVYTLIKDKIENSEYNGEFIFTKN--SIKHKRTGSEFLFYGI--ARNLSEIKSTEG-IDI 115 S + I+ + F FTK+ I +K+TG + F G+ ++ I+ G I Sbjct: 81 STFATIRKVAHSMGVLDLFKFTKSPLEITYKQTGQKVFFRGMDDPLKITSIQPVTGFICR 140 Query: 116 LWLEEAHYL-TQEQWEVIEPTIRKENS-----EIWIIFNP 149 W EEA+ L + + ++ +E ++R E + I FNP Sbjct: 141 RWCEEAYELKSLDAFDTVEESMRGELPPGGFYQTVITFNP 180 >gi|9289|lcl|protein:vir:97165 Length: 431 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239721;genbank:gi:66394878;genbank:GeneID :5130898 Length = 431 Score = 43.1 bits (100), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 43/160 (26%), Positives = 64/160 (40%), Gaps = 13/160 (8%) Query: 3 KLNPALRAVWRTRARYKVIYGGRASSKSHDAGGIAVYLAANYRL-KFLCARQFQNRISES 61 K+ W + Y+V+ G R S KS +Y Y L R+F N +S Sbjct: 10 KIGGGYNKFWHNKNFYRVVKGSRGSKKSKTTAINLIYRIMKYDWANILVVRRFSNTNKQS 69 Query: 62 VYTLIKDKIENSEYNGEFIFTKN--SIKHKRTGSEFLFYGIARNLSEIKSTEGIDIL--- 116 YT +K F F ++ I +K TG + LF G+ L T IL Sbjct: 70 TYTDLKWATNQLGVAHLFKFNESLPEITYKPTGQKILFRGLDDPLKITSITVDTGILCWA 129 Query: 117 WLEEAHYL-TQEQWEVIEPTIRKENS------EIWIIFNP 149 W EEA+ + T ++ + +IR +I + FNP Sbjct: 130 WFEEAYQIETFAKFSTVVESIRGSYDSPEFFKQITVTFNP 169 >gi|7569|lcl|protein:vir:96300 Length: 421 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240307;genbank:gi:66395973;genbank:GeneID :5133377 Length = 421 Score = 40.0 bits (92), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 52/203 (25%), Positives = 88/203 (43%), Gaps = 24/203 (11%) Query: 23 GGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRISESVYTLIKDKIENSEYNGEF--I 80 GGR S KS D I L Y + + R+ N ++ SV+ IK IE + F Sbjct: 34 GGRGSGKSSDISIIITQLIMRYPMNAVVIRKTDNTLATSVFEQIKWAIEEQKVTHLFKVK 93 Query: 81 FTKNSIKHKRTGSEFLFYGIARNLSEIKSTEG----IDILWLEE-AHYLTQEQWEVIEPT 135 + I + G+ +F G A+N +KS + I W+EE A + T+++ I + Sbjct: 94 VSPMEITYIPRGNRIIFRG-AQNPERLKSLKDSRFPFSIAWIEELAEFKTEDEVTTITNS 152 Query: 136 -IRKENSE-----IWIIFNPNEVTDFVYQNFVVK------PPKDAFVKMINWNENPFLSE 183 +R E E + +NP + Q++V K + FV + NPF+S+ Sbjct: 153 LLRGELDEGLFYKFFFSYNPPKRK----QSWVNKKYESSFQADNTFVHHSTYLNNPFISK 208 Query: 184 TMLKVIHEAYERDKDQAEHIYGG 206 ++ A +R++ + Y G Sbjct: 209 QFIQEAESAKKRNEQRYRWEYMG 231 >gi|10354|lcl|protein:vir:97448 Length: 421 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240743;genbank:gi:66396415;genbank:GeneID :5133804 Length = 421 Score = 40.0 bits (92), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 53/206 (25%), Positives = 91/206 (44%), Gaps = 24/206 (11%) Query: 20 VIYGGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRISESVYTLIKDKIENSEYNGEF 79 V GGR S KS D I L Y + + R+ N ++ SV+ IK IE + + F Sbjct: 31 VAKGGRGSGKSSDISIIITQLIMRYPMNAVVIRKTDNTLATSVFEQIKWAIEEQKVSHLF 90 Query: 80 --IFTKNSIKHKRTGSEFLFYGIARNLSEIKSTEG----IDILWLEE-AHYLTQEQWEVI 132 + I + G+ +F G A+N +KS + I W+EE A + T+++ I Sbjct: 91 KVKVSPMEITYIPRGNRIIFRG-AQNPERLKSLKDSRFPFSIAWIEELAEFKTEDEVTTI 149 Query: 133 EPT-IRKENSE-----IWIIFNPNEVTDFVYQNFVVKPPKDAF------VKMINWNENPF 180 + +R E E + +NP + Q++V K + +F V + NPF Sbjct: 150 TNSLLRGELDEGLFYKFFFSYNPPKRK----QSWVNKKYESSFQADNTYVHHSTYLNNPF 205 Query: 181 LSETMLKVIHEAYERDKDQAEHIYGG 206 +S+ ++ A +R++ + Y G Sbjct: 206 ISKQFIQEAESAKKRNEQRYRWEYMG 231 >gi|3182|lcl|protein:vir:94499 Length: 421 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240671;genbank:gi:66396341;genbank:GeneID :5133763 Length = 421 Score = 40.0 bits (92), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 53/206 (25%), Positives = 91/206 (44%), Gaps = 24/206 (11%) Query: 20 VIYGGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRISESVYTLIKDKIENSEYNGEF 79 V GGR S KS D I L Y + + R+ N ++ SV+ IK IE + + F Sbjct: 31 VAKGGRGSGKSSDISIIITQLIMRYPMNAVVIRKTDNTLATSVFEQIKWAIEEQKVSHLF 90 Query: 80 --IFTKNSIKHKRTGSEFLFYGIARNLSEIKSTEG----IDILWLEE-AHYLTQEQWEVI 132 + I + G+ +F G A+N +KS + I W+EE A + T+++ I Sbjct: 91 KVKVSPMEITYIPRGNRIIFRG-AQNPERLKSLKDSRFPFSIAWIEELAEFKTEDEVTTI 149 Query: 133 EPT-IRKENSE-----IWIIFNPNEVTDFVYQNFVVKPPKDAF------VKMINWNENPF 180 + +R E E + +NP + Q++V K + +F V + NPF Sbjct: 150 TNSLLRGELDEGLFYKFFFSYNPPKRK----QSWVNKKYESSFQADNTYVHHSTYLNNPF 205 Query: 181 LSETMLKVIHEAYERDKDQAEHIYGG 206 +S+ ++ A +R++ + Y G Sbjct: 206 ISKQFIQEAESAKKRNEQRYRWEYMG 231 >gi|6284|lcl|protein:vir:95890 Length: 421 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240381;genbank:gi:66396048;genbank:GeneID :5133401 Length = 421 Score = 40.0 bits (92), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 51/203 (25%), Positives = 89/203 (43%), Gaps = 24/203 (11%) Query: 23 GGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRISESVYTLIKDKIENSEYNGEF--I 80 GGR S KS D I L Y + + R+ N ++ SV+ IK IE + + F Sbjct: 34 GGRGSGKSSDISIIITQLIMRYPMNAVVIRKTDNTLATSVFEQIKWAIEEQKVSHLFKVK 93 Query: 81 FTKNSIKHKRTGSEFLFYGIARNLSEIKSTEG----IDILWLEE-AHYLTQEQWEVIEPT 135 + I + G+ +F G A+N +KS + + W+EE A + T+++ I + Sbjct: 94 VSPMEITYIPRGNRIIFRG-AQNPERLKSLKDSRFPFSVAWIEELAEFKTEDEVTTITNS 152 Query: 136 -IRKENSE-----IWIIFNPNEVTDFVYQNFVVK------PPKDAFVKMINWNENPFLSE 183 +R E E + +NP + Q++V K + FV + NPF+S+ Sbjct: 153 LLRGELDEGLFYKFFFSYNPPKRK----QSWVNKKYESSFQADNTFVHHSTYLNNPFISK 208 Query: 184 TMLKVIHEAYERDKDQAEHIYGG 206 ++ A +R++ + Y G Sbjct: 209 QFIQEAESAKKRNEQRYRWEYMG 231 >gi|4983|lcl|protein:vir:95058 Length: 421 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240816;genbank:gi:66394679;genbank:GeneID :5133852 Length = 421 Score = 39.7 bits (91), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 53/206 (25%), Positives = 91/206 (44%), Gaps = 24/206 (11%) Query: 20 VIYGGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRISESVYTLIKDKIENSEYNGEF 79 V GGR S KS D I L Y + + R+ N ++ SV+ IK IE + + F Sbjct: 31 VAKGGRGSGKSSDISIIITQLIMRYPMNAVVIRKTDNTLATSVFEQIKWAIEEQKVSHLF 90 Query: 80 --IFTKNSIKHKRTGSEFLFYGIARNLSEIKSTEG----IDILWLEE-AHYLTQEQWEVI 132 + I + G+ +F G A+N +KS + I W+EE A + T+++ I Sbjct: 91 KVKVSPMEITYIPRGNRIIFRG-AQNPERLKSLKDSRFPFSISWIEELAEFKTEDEVTTI 149 Query: 133 EPT-IRKENSE-----IWIIFNPNEVTDFVYQNFVVKPPKDAF------VKMINWNENPF 180 + +R E E + +NP + Q++V K + +F V + NPF Sbjct: 150 TNSLLRGELDEGLFYKFFFSYNPPKRK----QSWVNKKYESSFQADNTYVHHSTYLNNPF 205 Query: 181 LSETMLKVIHEAYERDKDQAEHIYGG 206 +S+ ++ A +R++ + Y G Sbjct: 206 ISKQFIQEAESAKKRNEQRYRWEYMG 231 >gi|7076|lcl|protein:vir:96147 Length: 419 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240074;genbank:gi:66395738;genbank:GeneID :5133130 Length = 419 Score = 38.5 bits (88), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 54/209 (25%), Positives = 89/209 (42%), Gaps = 22/209 (10%) Query: 20 VIYGGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRISESVYTLIKDKIENSEYNGEF 79 V GGR S KS D I V L Y + L R+ N ++ SV+ IK I + F Sbjct: 29 VAKGGRGSGKSSDIAIIIVLLIMRYPVNALILRKIDNTLALSVFEQIKWAINVMGVSHLF 88 Query: 80 IF--TKNSIKHKRTGSEFLFYGIARNLSEIKSTEGID----ILWLEE-AHYLTQEQWEVI 132 + I + G++ +F G A+N IKS + I W+EE A + T+++ I Sbjct: 89 KIKVSPMEITYVPRGNKMVFRG-AQNPERIKSLKDAQFPYAIAWIEELAEFKTEDEVTTI 147 Query: 133 EPTIRK---ENSEIWIIF----NPNEVTDFVYQNFVVK-PPKDAFVKMINWNENPFLSET 184 ++ + +N + F P +V + + P + FV + NPF+++ Sbjct: 148 TNSLLRGELDNGLFYKFFYTYNPPKRKQSWVNKKYESSFQPDNTFVHHSTYLNNPFIAKE 207 Query: 185 ML------KVIHEAYERDKDQAEHIYGGI 207 + K I+E R + E I G+ Sbjct: 208 FIEEAKAAKAINELRYRWEYLGEAIGSGV 236 >gi|5321|lcl|protein:vir:99534 Length: 422 # NCBI annotation: putative protein # Family: family:all:54 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958532;genbank:gi:41179314;genbank:GeneID :2717172 Length = 422 Score = 37.7 bits (86), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 46/187 (24%), Positives = 83/187 (44%), Gaps = 18/187 (9%) Query: 19 KVIYGGRASSKSHDAGGIAVYLAA---NYRLKFLCARQFQNRISESVYTLIKDKIENSEY 75 +V YGG +S KSH V + N K L R+ + S++T + + + S + Sbjct: 31 EVWYGGASSGKSHGVVQKVVLKSLQHWNVPRKVLWLRKVDRTVKNSIFTDVTECL--SGW 88 Query: 76 N-GEFIFTKNSIKH--KRTGSEFLFYGIARNLSEIKSTEGIDILWLEEAHYLTQEQWEVI 132 N ++ S K G+ FLF G+ + +IKS +G+ + +EEA + + Sbjct: 89 NILQYCHVNRSDKTIVLPNGAIFLFQGMD-DPEKIKSIKGLSDVVMEEASEFNHNDYTQL 147 Query: 133 EPTIRK---ENSEIWIIFNPNEVTDFVYQNFVVKPPKD-----AFVKMINWNENPFLSET 184 +R+ + +I+ +FNP ++ YQ + P D + + +N FL E Sbjct: 148 TLRLREPKHKQRQIFCMFNPVSKLNWTYQTW-FDPSADYDRSRVAIHQSTYKDNRFLDED 206 Query: 185 MLKVIHE 191 ++ I E Sbjct: 207 NIRTIEE 213 >gi|11736|lcl|protein:vir:79016 Length: 417 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110720;genbank:gi:134287337;genbank:Ge neID:4955190 Length = 417 Score = 37.0 bits (84), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 52/229 (22%), Positives = 96/229 (41%), Gaps = 35/229 (15%) Query: 4 LNPALRAVWRTRARYKVIYGGRASSKSHDAGGIAVYLAANYRLKF----------LCARQ 53 NP + T+ RY+ + G S KS V +A +Y LK L R+ Sbjct: 11 FNPDFKEANFTKKRYRAMKGSAGSGKS-------VNVAQDYILKLGDKKYQGANLLVVRK 63 Query: 54 FQNRISESVYTLIK---DKIENSEYNGEFIFTKN--SIKHKRTGSEFLFYGI--ARNLSE 106 + S Y + ++I + + + T N IK K TG+ +F G+ A+ + Sbjct: 64 SEATHKYSTYAELTGAINRIYGKQADKYWKTTLNPLEIKSKVTGNSIIFRGVNDAKQREK 123 Query: 107 IKS---TEG-IDILWLEEAHYLTQEQWEVIEPTIRK--ENSEIW----IIFNPNEVTDFV 156 +KS ++G + +W EEA L + ++++ +R N ++ FNP T ++ Sbjct: 124 LKSINFSKGKLTWVWCEEATELMESDIDILDDRLRGILTNPNLYYQMTFTFNPVSATHWI 183 Query: 157 YQNFVVKPPKDAFVKMINWNENPFLSETMLKVIHEAYERDKDQAEHIYG 205 + + D F + +N F+ E + + E+D + +YG Sbjct: 184 KRKYFDYKNDDIFTHHSTYLQNRFIDEAYYRRMQMRKEQDP-EGYKVYG 231 >gi|8730|lcl|protein:vir:96840 Length: 421 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240151;genbank:gi:66395816;genbank:GeneID :5133181 Length = 421 Score = 33.5 bits (75), Expect = 0.005, Method: Compositional matrix adjust. Identities = 47/194 (24%), Positives = 85/194 (43%), Gaps = 24/194 (12%) Query: 32 DAGGIAVYLAANYRLKFLCARQFQNRISESVYTLIKDKIENSEYNGEF--IFTKNSIKHK 89 D I L Y + + R+ N ++ SV+ IK IE + + F + I + Sbjct: 43 DISIIITQLIMRYPMNAVVVRKTDNTLATSVFEQIKWAIEQQKVSHLFKVKVSPMEITYM 102 Query: 90 RTGSEFLFYGIARNLSEIKSTEG----IDILWLEE-AHYLTQEQWEVIEPT-IRKENSE- 142 G+ +F G A+N +KS + I+W+EE A + T+++ I + +R E E Sbjct: 103 PRGNRIIFRG-AQNPERLKSLKDSRFPFSIMWIEELAEFKTEDEVTTITNSMLRGELDEG 161 Query: 143 ----IWIIFNPNEVTDFVYQNFVVK------PPKDAFVKMINWNENPFLSETMLKVIHEA 192 + +NP + Q++V K P + FV + +NPF+++ + A Sbjct: 162 LFYKFFFSYNPPKRK----QSWVNKKYESSFQPDNTFVHHSTYLDNPFIAKQFIDEAEAA 217 Query: 193 YERDKDQAEHIYGG 206 ER++ + Y G Sbjct: 218 KERNELRYRWEYLG 231 >gi|18891|lcl|protein:vir:8847 Length: 482 # NCBI annotation: terminase # Family: family:all:1730 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775255;genbank:gi:27476053;genbank:GeneID :2700601 Length = 482 Score = 30.4 bits (67), Expect = 0.042, Method: Compositional matrix adjust. Identities = 52/237 (21%), Positives = 87/237 (36%), Gaps = 46/237 (19%) Query: 113 IDILWLEEAHYLTQEQWEVIEPTIRKENSEIWIIFNPNEVTDFVYQNFVVKPPKDAFVKM 172 ID++WL+E ++ + +++ F P + ++F+ F+ Sbjct: 169 IDVIWLDEE--CPKDIYTQCVTRTATTGGIVYLTFTPEHGLTEIVKDFLQDLKPGQFLIH 226 Query: 173 INWNENPFLS----ETMLKVIHEAYERDKDQAEHIYGGIPKTGGDK--SVINLKFILAAI 226 +W + P LS E +L V A R + + GIP G ++ KF+ Sbjct: 227 ASWEDAPHLSPEVKEQLLSVYSPAERRMRAE------GIPMLGSGVVFPILEEKFVCEPF 280 Query: 227 DAHKKLGWEPAGSKRIGFDVADDGEDANATTLMHGNVIMEVDEWDG------LEDELLKS 280 D + IG D+ D H N I V WD L DE +S Sbjct: 281 DIPDHF------HRIIGIDLGFD----------HPNAIACV-AWDAEKDKYYLYDERSES 323 Query: 281 SSRVYNLAK---MKGAS-----VTYDSIGV-GAHVGSKFAELNDSSPDFKLTYDPFN 328 + A +KG V +D+ GA G +F +L + + Y+PF+ Sbjct: 324 GETLGMHADAIYLKGGHQIPVVVPHDAFKHDGATSGRRFVDLLKDDHNLNVVYEPFS 380 >gi|25110|lcl|protein:vir:80755 Length: 501 # NCBI annotation: putative large terminase # Family: family:all:1430 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504114;genbank:gi:158079301;genbank:Ge neID:5666404 Length = 501 Score = 27.3 bits (59), Expect = 0.36, Method: Compositional matrix adjust. Identities = 18/67 (26%), Positives = 36/67 (53%), Gaps = 3/67 (4%) Query: 57 RISESVYTLIKDKIENSEYNGEFIFTKNSIKHKRTGSEFLFYGIARNLSEIKSTEGIDIL 116 ++++ V T + ++N Y+ NS+K K+ + FL++ R+ S+ + EG+DI Sbjct: 11 QMTKFVQTRLDPVLQNGYYSTIVDQEVNSLKAKKIRNSFLYF---RSSSKPGAVEGVDID 67 Query: 117 WLEEAHY 123 +L Y Sbjct: 68 YLSMDEY 74 >gi|14676|lcl|protein:vir:2499 Length: 474 # NCBI annotation: putative terminase gp4 # Family: family:all:523 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569740;genbank:gi:18496890;genbank:GeneID :932339 Length = 474 Score = 26.6 bits (57), Expect = 0.58, Method: Compositional matrix adjust. Identities = 19/66 (28%), Positives = 29/66 (43%), Gaps = 3/66 (4%) Query: 90 RTGSEFLFYGIARNLSEIKSTEGIDILWLEEAHYLTQEQWEVIEP-TIRKENSEIWIIFN 148 + GS LF R + G+D+L +EA LT+ + + P T N I + Sbjct: 140 KNGSRILFGARERGFG--RGFAGVDVLIFDEAQILTENAMDDMVPATNAAPNPLILLAGT 197 Query: 149 PNEVTD 154 P + TD Sbjct: 198 PPKPTD 203 >gi|3478|lcl|protein:vir:101370 Length: 494 # NCBI annotation: PacB # Family: family:all:144 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006576;genbank:gi:46401730;genbank:GeneID :2777432 Length = 494 Score = 26.6 bits (57), Expect = 0.64, Method: Compositional matrix adjust. Identities = 16/60 (26%), Positives = 31/60 (51%) Query: 389 SINSETIHPDKLNQLCIELSSPRKDLDMNGRFKVESKKDMREKRKIKSPNIADSVIMSAI 448 ++ S + DK E S ++ G++KV SK+DM++K + SP+ D+ + + Sbjct: 408 AVKSGRMRLDKGAATIEEASKIPVGINSAGQWKVMSKEDMKKKLNLHSPDHWDTYCFAML 467 >gi|7504|lcl|protein:vir:99915 Length: 478 # NCBI annotation: gp2 # Family: family:all:523 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655519;genbank:gi:109392289;genbank:GeneI D:4157084 Length = 478 Score = 25.8 bits (55), Expect = 1.2, Method: Compositional matrix adjust. Identities = 16/44 (36%), Positives = 23/44 (52%), Gaps = 1/44 (2%) Query: 112 GIDILWLEEAHYLTQEQWEVIEPTIRK-ENSEIWIIFNPNEVTD 154 G+ IL L+EA LT + + + PT+ EN I + P TD Sbjct: 158 GVGILVLDEAQRLTDKAMDDLIPTMNTVENPLILLTGTPPRPTD 201 >gi|14207|lcl|protein:vir:8315 Length: 503 # NCBI annotation: gp32 # Family: family:all:523 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817883;genbank:gi:29566316;genbank:GeneID :1259511 Length = 503 Score = 25.0 bits (53), Expect = 1.7, Method: Compositional matrix adjust. Identities = 18/64 (28%), Positives = 28/64 (43%), Gaps = 3/64 (4%) Query: 92 GSEFLFYGIARNLSEIKSTEGIDILWLEEAHYLTQEQWEVIEPTIRKENSEIWI-IFNPN 150 GS LF R + G+D+L +EA LTQ + + T+ + I + P Sbjct: 177 GSRILFGARERGFG--RGIPGVDVLMSDEAQILTQRAMQDMLATLNTSRLGLHIYVGTPP 234 Query: 151 EVTD 154 + TD Sbjct: 235 KPTD 238 >gi|5525|lcl|protein:vir:95311 Length: 491 # NCBI annotation: putative terminase large subunit # Family: family:all:144 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512256;genbank:gi:89152423;genbank:GeneID :3952979 Length = 491 Score = 25.0 bits (53), Expect = 2.0, Method: Compositional matrix adjust. Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 2/48 (4%) Query: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWIIF-NPNEVTDFVYQNF 160 I+ +EA + WEV E + E++E IW+ F NP T + F Sbjct: 178 IVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECF 225 >gi|12972|lcl|protein:vir:80678 Length: 503 # NCBI annotation: gp2 # Family: family:all:523 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285578;genbank:gi:148727084;genbank:Ge neID:5247049 Length = 503 Score = 25.0 bits (53), Expect = 2.0, Method: Compositional matrix adjust. Identities = 14/46 (30%), Positives = 23/46 (50%), Gaps = 4/46 (8%) Query: 113 IDILWLEEAHYLTQEQWEVIEPTIRKENS----EIWIIFNPNEVTD 154 +D L +EA L+ EQ E + PT+ S +I++ P + D Sbjct: 174 VDDLVCDEAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLAD 219 >gi|16210|lcl|protein:vir:7319 Length: 491 # NCBI annotation: terminase large subunit # Family: family:all:144 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848210;genbank:gi:30387381;genbank:GeneID :2641874 Length = 491 Score = 25.0 bits (53), Expect = 2.2, Method: Compositional matrix adjust. Identities = 16/48 (33%), Positives = 24/48 (50%), Gaps = 2/48 (4%) Query: 115 ILWLEEAHYLTQEQWEVIEPTIRKENSE-IWIIF-NPNEVTDFVYQNF 160 I+ +EA + WEV E + E++E IW+ F NP T + F Sbjct: 178 IVVFDEASNIADLVWEVAEGALTDEDTEIIWVAFGNPTRNTGRFRECF 225 >gi|4240|lcl|protein:vir:94734 Length: 469 # NCBI annotation: large subunit terminase # Family: family:all:523 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996704;genbank:gi:45597419;genbank:GeneID :2767958 Length = 469 Score = 24.6 bits (52), Expect = 2.4, Method: Compositional matrix adjust. Identities = 12/36 (33%), Positives = 22/36 (61%) Query: 111 EGIDILWLEEAHYLTQEQWEVIEPTIRKENSEIWII 146 EG DIL+++EA T EQ ++ T+ ++ + I+ Sbjct: 157 EGFDILFIDEAQEYTTEQESALKYTVTDSDNPMTIM 192 >gi|18494|lcl|protein:vir:1636 Length: 469 # NCBI annotation: putative terminase # Family: family:all:523 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695057;genbank:gi:23455748;genbank:GeneID :955481 Length = 469 Score = 23.1 bits (48), Expect = 6.5, Method: Compositional matrix adjust. Identities = 12/36 (33%), Positives = 21/36 (58%) Query: 111 EGIDILWLEEAHYLTQEQWEVIEPTIRKENSEIWII 146 EG DIL ++EA T EQ ++ T+ ++ + I+ Sbjct: 157 EGFDILVIDEAQEYTTEQESALKYTVTDSDNPMTIM 192 >gi|19181|lcl|protein:vir:9572 Length: 459 # NCBI annotation: gp38 # Family: family:all:523 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862877;genbank:gi:32469469;genbank:GeneID :1461314 Length = 459 Score = 22.7 bits (47), Expect = 8.5, Method: Compositional matrix adjust. Identities = 12/36 (33%), Positives = 20/36 (55%) Query: 111 EGIDILWLEEAHYLTQEQWEVIEPTIRKENSEIWII 146 EG D+L ++EA T EQ ++ T+ + I I+ Sbjct: 158 EGFDMLIIDEAQEYTTEQESALKYTVTDSENPITIM 193 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.317 0.135 0.399 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 217,942 Number of Sequences: 514 Number of extensions: 10536 Number of successful extensions: 114 Number of sequences better than 100.0: 54 Number of HSP's better than 100.0 without gapping: 41 Number of HSP's successfully gapped in prelim test: 13 Number of HSP's that attempted gapping in prelim test: 26 Number of HSP's gapped (non-prelim): 56 length of query: 460 length of database: 206,069 effective HSP length: 75 effective length of query: 385 effective length of database: 167,519 effective search space: 64494815 effective search space used: 64494815 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 39 (19.6 bits)