BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|Aclame:protein:vir:99572|NCBI_annot:TerL-like protein|genbank:acc:YP_001039811;genbank:gi:126011061;genbank:GeneID:4 818267 (459 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|4914|lcl|protein:vir:99572 Length: 459 # NCBI annotation: Ter... 951 0.0 gi|6775|lcl|protein:vir:96067 Length: 460 # NCBI annotation: con... 653 0.0 gi|5142|lcl|protein:vir:105417 Length: 470 # NCBI annotation: ge... 280 2e-77 gi|17761|lcl|protein:vir:171 Length: 470 # NCBI annotation: term... 279 6e-77 gi|14466|lcl|protein:vir:3519 Length: 469 # NCBI annotation: P18... 190 4e-50 gi|2819|lcl|protein:vir:105705 Length: 439 # NCBI annotation: gp... 111 2e-26 gi|8380|lcl|protein:vir:96782 Length: 408 # NCBI annotation: put... 108 2e-25 gi|3859|lcl|protein:vir:107748 Length: 532 # NCBI annotation: gp... 92 1e-20 gi|15353|lcl|protein:vir:9921 Length: 425 # NCBI annotation: put... 67 3e-13 gi|5794|lcl|protein:vir:98922 Length: 453 # NCBI annotation: ter... 60 8e-11 gi|19292|lcl|protein:vir:4781 Length: 436 # NCBI annotation: put... 57 6e-10 gi|7076|lcl|protein:vir:96147 Length: 419 # NCBI annotation: ORF... 54 3e-09 gi|18074|lcl|protein:vir:5954 Length: 422 # NCBI annotation: hyp... 52 1e-08 gi|9289|lcl|protein:vir:97165 Length: 431 # NCBI annotation: ORF... 50 4e-08 gi|13020|lcl|protein:vir:80960 Length: 443 # NCBI annotation: Te... 50 5e-08 gi|15836|lcl|protein:vir:37 Length: 443 # NCBI annotation: putat... 49 2e-07 gi|6093|lcl|protein:vir:95761 Length: 432 # NCBI annotation: ter... 49 2e-07 gi|5208|lcl|protein:vir:106641 Length: 446 # NCBI annotation: OR... 47 5e-07 gi|10483|lcl|protein:vir:105297 Length: 446 # NCBI annotation: p... 47 6e-07 gi|5891|lcl|protein:vir:107098 Length: 421 # NCBI annotation: pu... 47 7e-07 gi|655|lcl|protein:vir:1588 Length: 447 # NCBI annotation: putat... 46 8e-07 gi|7569|lcl|protein:vir:96300 Length: 421 # NCBI annotation: ORF... 45 2e-06 gi|2779|lcl|protein:vir:99776 Length: 425 # NCBI annotation: lar... 45 2e-06 gi|9829|lcl|protein:vir:103950 Length: 425 # NCBI annotation: ph... 44 3e-06 gi|17691|lcl|protein:vir:9305 Length: 447 # NCBI annotation: lar... 44 3e-06 gi|7643|lcl|protein:vir:96370 Length: 447 # NCBI annotation: ORF... 44 4e-06 gi|7301|lcl|protein:vir:96238 Length: 447 # NCBI annotation: ORF... 44 4e-06 gi|11588|lcl|protein:vir:78831 Length: 447 # NCBI annotation: te... 44 4e-06 gi|10354|lcl|protein:vir:97448 Length: 421 # NCBI annotation: OR... 44 4e-06 gi|3182|lcl|protein:vir:94499 Length: 421 # NCBI annotation: ORF... 44 4e-06 gi|4983|lcl|protein:vir:95058 Length: 421 # NCBI annotation: ORF... 44 6e-06 gi|6284|lcl|protein:vir:95890 Length: 421 # NCBI annotation: ORF... 43 7e-06 gi|11736|lcl|protein:vir:79016 Length: 417 # NCBI annotation: pu... 42 1e-05 gi|8730|lcl|protein:vir:96840 Length: 421 # NCBI annotation: ORF... 35 0.002 gi|18891|lcl|protein:vir:8847 Length: 482 # NCBI annotation: ter... 33 0.006 gi|3478|lcl|protein:vir:101370 Length: 494 # NCBI annotation: Pa... 32 0.022 gi|14676|lcl|protein:vir:2499 Length: 474 # NCBI annotation: put... 29 0.11 gi|2931|lcl|protein:vir:105460 Length: 409 # NCBI annotation: pu... 29 0.13 gi|7504|lcl|protein:vir:99915 Length: 478 # NCBI annotation: gp2... 28 0.25 gi|5525|lcl|protein:vir:95311 Length: 491 # NCBI annotation: put... 27 0.43 gi|16210|lcl|protein:vir:7319 Length: 491 # NCBI annotation: ter... 27 0.48 gi|12972|lcl|protein:vir:80678 Length: 503 # NCBI annotation: gp... 27 0.72 gi|4240|lcl|protein:vir:94734 Length: 469 # NCBI annotation: lar... 26 0.92 gi|14207|lcl|protein:vir:8315 Length: 503 # NCBI annotation: gp3... 25 1.7 gi|18494|lcl|protein:vir:1636 Length: 469 # NCBI annotation: put... 25 2.6 gi|12378|lcl|protein:vir:79648 Length: 523 # NCBI annotation: Te... 24 3.1 gi|13482|lcl|protein:vir:9757 Length: 471 # NCBI annotation: put... 24 3.7 gi|19181|lcl|protein:vir:9572 Length: 459 # NCBI annotation: gp3... 24 4.8 >gi|4914|lcl|protein:vir:99572 Length: 459 # NCBI annotation: TerL-like protein # Family: family:all:54 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039811;genbank:gi:126011061;genbank:Ge neID:4818267 Length = 459 Score = 951 bits (2458), Expect = 0.0, Method: Compositional matrix adjust. Identities = 459/459 (100%), Positives = 459/459 (100%) Query: 1 MYDLNPALRDFWLDKARYKALYGGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISE 60 MYDLNPALRDFWLDKARYKALYGGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISE Sbjct: 1 MYDLNPALRDFWLDKARYKALYGGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISE 60 Query: 61 SVYTLIKGKIDAAGWTKEFDVTISSIRHKKTGAEFLFYGIARNLNEIKSTEGVDILWLEE 120 SVYTLIKGKIDAAGWTKEFDVTISSIRHKKTGAEFLFYGIARNLNEIKSTEGVDILWLEE Sbjct: 61 SVYTLIKGKIDAAGWTKEFDVTISSIRHKKTGAEFLFYGIARNLNEIKSTEGVDILWLEE 120 Query: 121 AQYLTEEQWNVINPTIRREGSQIWLIWNPDQYTDFIYQNFVVNPPADCLSKQINWTENPF 180 AQYLTEEQWNVINPTIRREGSQIWLIWNPDQYTDFIYQNFVVNPPADCLSKQINWTENPF Sbjct: 121 AQYLTEEQWNVINPTIRREGSQIWLIWNPDQYTDFIYQNFVVNPPADCLSKQINWTENPF 180 Query: 181 LSDTMLKVIYDEYQRDPKLAEHVYGGAPKMGGDKAIIQLQYVLAAIDAHKKLGWKIEGSK 240 LSDTMLKVIYDEYQRDPKLAEHVYGGAPKMGGDKAIIQLQYVLAAIDAHKKLGWKIEGSK Sbjct: 181 LSDTMLKVIYDEYQRDPKLAEHVYGGAPKMGGDKAIIQLQYVLAAIDAHKKLGWKIEGSK 240 Query: 241 RTGFDIADDGDDANAIVDAIGNVVVWAEEWDGLEDELLKSSTKVFNHALEKGSSIIFDSI 300 RTGFDIADDGDDANAIVDAIGNVVVWAEEWDGLEDELLKSSTKVFNHALEKGSSIIFDSI Sbjct: 241 RTGFDIADDGDDANAIVDAIGNVVVWAEEWDGLEDELLKSSTKVFNHALEKGSSIIFDSI 300 Query: 301 GVGAHAGSKFSELNEARSLEIIYEPFNAGGAVYDPDGTYMKLPHVVITNREHFSNVKAQM 360 GVGAHAGSKFSELNEARSLEIIYEPFNAGGAVYDPDGTYMKLPHVVITNREHFSNVKAQM Sbjct: 301 GVGAHAGSKFSELNEARSLEIIYEPFNAGGAVYDPDGTYMKLPHVVITNREHFSNVKAQM 360 Query: 361 WDRVATRFRKTYEVVTYGANHPHDELISISSEHVPAKILDKLKIELASPHKDVDGMGKFK 420 WDRVATRFRKTYEVVTYGANHPHDELISISSEHVPAKILDKLKIELASPHKDVDGMGKFK Sbjct: 361 WDRVATRFRKTYEVVTYGANHPHDELISISSEHVPAKILDKLKIELASPHKDVDGMGKFK 420 Query: 421 VESKKDMREKRGIKSPNIADAFIMAMIQPKRQPAGFFDF 459 VESKKDMREKRGIKSPNIADAFIMAMIQPKRQPAGFFDF Sbjct: 421 VESKKDMREKRGIKSPNIADAFIMAMIQPKRQPAGFFDF 459 >gi|6775|lcl|protein:vir:96067 Length: 460 # NCBI annotation: conserved hypothetical protein ORF004 # Family: family:all:54 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294421;genbank:gi:149408318;genbank:Ge neID:5237186 Length = 460 Score = 653 bits (1684), Expect = 0.0, Method: Compositional matrix adjust. Identities = 309/460 (67%), Positives = 374/460 (81%), Gaps = 1/460 (0%) Query: 1 MYDLNPALRDFWLDKARYKALYGGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISE 60 MY LNPALR W +ARYK +YGGRASSKSHDA G AVYLA NY +KFLCARQFQN+ISE Sbjct: 1 MYKLNPALRAVWRTRARYKVIYGGRASSKSHDAGGIAVYLAANYRLKFLCARQFQNRISE 60 Query: 61 SVYTLIKGKIDAAGWTKEFDVTISSIRHKKTGAEFLFYGIARNLNEIKSTEGVDILWLEE 120 SVYTLIK KI+ + + EF T +SI+HK+TG+EFLFYGIARNL+EIKSTEG+DILWLEE Sbjct: 61 SVYTLIKDKIENSEYNGEFIFTKNSIKHKRTGSEFLFYGIARNLSEIKSTEGIDILWLEE 120 Query: 121 AQYLTEEQWNVINPTIRREGSQIWLIWNPDQYTDFIYQNFVVNPPADCLSKQINWTENPF 180 A YLT+EQW VI PTIR+E S+IW+I+NP++ TDF+YQNFVV PP D K INW ENPF Sbjct: 121 AHYLTQEQWEVIEPTIRKENSEIWIIFNPNEVTDFVYQNFVVKPPKDAFVKMINWNENPF 180 Query: 181 LSDTMLKVIYDEYQRDPKLAEHVYGGAPKMGGDKAIIQLQYVLAAIDAHKKLGWKIEGSK 240 LS+TMLKVI++ Y+RD AEH+YGG PK GGDK++I L+++LAAIDAHKKLGW+ GSK Sbjct: 181 LSETMLKVIHEAYERDKDQAEHIYGGIPKTGGDKSVINLKFILAAIDAHKKLGWEPAGSK 240 Query: 241 RTGFDIADDGDDANAIVDAIGNVVVWAEEWDGLEDELLKSSTKVFNHALEKGSSIIFDSI 300 R GFD+ADDG+DANA GNV++ +EWDGLEDELLKSS++V+N A KG+S+ +DSI Sbjct: 241 RIGFDVADDGEDANATTLMHGNVIMEVDEWDGLEDELLKSSSRVYNLAKMKGASVTYDSI 300 Query: 301 GVGAHAGSKFSELNEAR-SLEIIYEPFNAGGAVYDPDGTYMKLPHVVITNREHFSNVKAQ 359 GVGAH GSKF+ELN++ ++ Y+PFNAGGAV PD YMKLPH I N++HFSN+KAQ Sbjct: 301 GVGAHVGSKFAELNDSSPDFKLTYDPFNAGGAVDKPDDIYMKLPHTTIKNKDHFSNIKAQ 360 Query: 360 MWDRVATRFRKTYEVVTYGANHPHDELISISSEHVPAKILDKLKIELASPHKDVDGMGKF 419 W+ VATRFRKTYE V +G +P DELISI+SE + L++L IEL+SP KD+D G+F Sbjct: 361 KWEEVATRFRKTYEAVVHGKVYPFDELISINSETIHPDKLNQLCIELSSPRKDLDMNGRF 420 Query: 420 KVESKKDMREKRGIKSPNIADAFIMAMIQPKRQPAGFFDF 459 KVESKKDMREKR IKSPNIAD+ IM+ I P R+P GFFDF Sbjct: 421 KVESKKDMREKRKIKSPNIADSVIMSAILPIRKPKGFFDF 460 >gi|5142|lcl|protein:vir:105417 Length: 470 # NCBI annotation: gene 2 protein # Family: family:all:54 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958178;genbank:gi:41057280;genbank:GeneID :2716664 Length = 470 Score = 280 bits (716), Expect = 2e-77, Method: Compositional matrix adjust. Identities = 167/458 (36%), Positives = 250/458 (54%), Gaps = 14/458 (3%) Query: 1 MYDLNPALRDFWLDKARYKALYGGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISE 60 M +NP F ++ RYK GGR S KS A V AR V+ LCAR+ QN IS+ Sbjct: 1 MTSINPIFEPF-IEAHRYKVAKGGRGSGKSWAIARLLVEAARRQPVRILCARELQNSISD 59 Query: 61 SVYTLIKGKIDAAGWTKEFDVTISSIRHKKTGAEFLFYGIARNLNEIKSTEGVDILWLEE 120 SV L++ I+ G++ EF++ S IRH T AEF+FYGI N +IKS EG+DI W+EE Sbjct: 60 SVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGIDICWVEE 119 Query: 121 AQYLTEEQWNVINPTIRREGSQIWLIWNPDQYTDFIYQNFVVNPPADCLSKQINWTENPF 180 A+ +T+E W+++ PTIR+ S+IW+ +NP D YQ FVVNPP D +N+T+NP Sbjct: 120 AEAVTKESWDILIPTIRKPFSEIWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPH 179 Query: 181 LSDTMLKVIYDEYQRDPKLAEHVYGGAPKMGGDKAIIQLQYVLAAIDAHKKLGWKIEGSK 240 + + + + +R+P L H++ G P D AII+ +++ AA DAHKKLGWK +G+ Sbjct: 180 FPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAV 239 Query: 241 RTGFDIADDGDDANAIVDAIGNVVVWAEEWDGLEDELLKSSTKVFNHALEKGSS-IIFDS 299 + D +D G DA G+VV E GL ++ + + + A+E G+ ++D Sbjct: 240 VSAHDPSDTGPDAKGYASRHGSVVKRIAE--GLLMDINEGADWATSLAIEDGADHYLWDG 297 Query: 300 IGVGAHAGSKFSELNEARSLEIIYEPFNAGGAVYDPDGTYMKLPHV--------VITNRE 351 GVGA + +E + +I F + +D D Y V T + Sbjct: 298 DGVGAGLRRQTTEAFSGK--KITATMFKGSESPFDEDAPYQAGAWADEVVQGDNVRTIGD 355 Query: 352 HFSNVKAQMWDRVATRFRKTYEVVTYGANHPHDELISISSEHVPAKILDKLKIELASPHK 411 F N +AQ + +A R TY V +G D+++S E + K+L+KL EL + Sbjct: 356 VFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGEKMLEKLFAELTQIQR 415 Query: 412 DVDGMGKFKVESKKDMREKRGIKSPNIADAFIMAMIQP 449 + GK ++ +K +M++K GI SPN+ADA +M M P Sbjct: 416 KFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCP 453 >gi|17761|lcl|protein:vir:171 Length: 470 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112076;genbank:gi:13559866;genbank:GeneID :921009 Length = 470 Score = 279 bits (713), Expect = 6e-77, Method: Compositional matrix adjust. Identities = 167/458 (36%), Positives = 249/458 (54%), Gaps = 14/458 (3%) Query: 1 MYDLNPALRDFWLDKARYKALYGGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISE 60 M +NP F ++ RYK GGR S KS A V AR V+ LCAR+ QN IS+ Sbjct: 1 MTSINPIFEPF-IEAHRYKVAKGGRGSGKSWAIARLLVEAARRQPVRILCARELQNSISD 59 Query: 61 SVYTLIKGKIDAAGWTKEFDVTISSIRHKKTGAEFLFYGIARNLNEIKSTEGVDILWLEE 120 SV L++ I+ G++ EF++ S IRH T AEF+FYGI N +IKS EG+DI W+EE Sbjct: 60 SVIRLLEDTIEREGYSAEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGIDICWVEE 119 Query: 121 AQYLTEEQWNVINPTIRREGSQIWLIWNPDQYTDFIYQNFVVNPPADCLSKQINWTENPF 180 A+ +T+E W+++ PTIR+ S+IW+ +NP D YQ FVVNPP D +N+T+NP Sbjct: 120 AEAVTKESWDILIPTIRKPFSEIWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPH 179 Query: 181 LSDTMLKVIYDEYQRDPKLAEHVYGGAPKMGGDKAIIQLQYVLAAIDAHKKLGWKIEGSK 240 + + + + +R+P L H++ G P D AII+ +++ AA DAHKKLGWK +G+ Sbjct: 180 FPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAV 239 Query: 241 RTGFDIADDGDDANAIVDAIGNVVVWAEEWDGLEDELLKSSTKVFNHALEKGSS-IIFDS 299 + D +D G DA G+VV E GL ++ + + + A+E G+ ++D Sbjct: 240 VSAHDPSDTGPDAKGYASRHGSVVKRIAE--GLLMDINEGADWATSLAIEDGADHYLWDG 297 Query: 300 IGVGAHAGSKFSELNEARSLEIIYEPFNAGGAVYDPDGTYMKLPHV--------VITNRE 351 GVGA + +E + +I F + +D D Y V T + Sbjct: 298 DGVGAGLRRQTTEAFSGK--KITATMFKGSESPFDEDAPYQAGAWADEVVQGDNVRTIGD 355 Query: 352 HFSNVKAQMWDRVATRFRKTYEVVTYGANHPHDELISISSEHVPAKILDKLKIELASPHK 411 F N +AQ + +A R TY V +G D+++S E + IL+KL EL + Sbjct: 356 VFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEAIGENILEKLFAELTQIQR 415 Query: 412 DVDGMGKFKVESKKDMREKRGIKSPNIADAFIMAMIQP 449 + GK ++ +K +M++K GI SPN+ADA +M M P Sbjct: 416 KFNNNGKLELMTKVEMKQKLGIPSPNLADALMMCMHCP 453 >gi|14466|lcl|protein:vir:3519 Length: 469 # NCBI annotation: P18 # Family: family:all:54 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050979;genbank:gi:9633565;genbank:GeneID: 1262312 Length = 469 Score = 190 bits (482), Expect = 4e-50, Method: Compositional matrix adjust. Identities = 142/458 (31%), Positives = 226/458 (49%), Gaps = 33/458 (7%) Query: 17 RYKALYGGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISESVYTLIKGKIDAAGWT 76 R K +GGR K+ A A+ A + +FLC R+F N I +S + +++ +++ G Sbjct: 6 RIKVYFGGRGGMKTVSFAKIALITASMHKRRFLCLREFMNSIEDSGHAVLQAEVETLGLQ 65 Query: 77 KEFDVTISSIRHKKTGAEFLFYGIARNLNEIKSTEGVDILWLEEAQYLTEEQWNVINPTI 136 F + + I F + +ARN+ IKS D+ W+EEA+ ++E+ + + PTI Sbjct: 66 NRFRILNTYIEGINDSI-FKYGQLARNIASIKSKHDFDVAWVEEAETVSEKSLDSLIPTI 124 Query: 137 RREGSQIWLIWNPDQYTDFIYQNFVVNPPADCLSKQ------------INWTENPFLSDT 184 R+ GS++W +NP + +Y+ F V P + + Q +++ +NP+L Sbjct: 125 RKPGSELWFSFNPAEEDGAVYKRF-VKPYKELIDTQGYYEDDDLYVGKVSYLDNPWLPAE 183 Query: 185 MLKVIYDEYQRDPKLAEHVYGGAPKMGGDKAIIQLQYVLAAIDAHKKLGWKIEGSKRTGF 244 + + + K HVYGG + A+IQ ++V AAIDAH KLG+K G + F Sbjct: 184 LKNDAQKMKRENYKKWRHVYGGECDANYEDALIQPEWVEAAIDAHIKLGFKPSGIRVVTF 243 Query: 245 DIADDGDDANAIVDAIGNVVVWAEEWDGLEDELLKSSTKVFNHALE-KGSSIIFDSIGVG 303 D AD G D A+ G ++ W E ++ ++ F+ A + + I+D+IG+G Sbjct: 244 DPADSGQDEKALSKRYGVLIEDCVSWS--EGDVADATMTAFDDAFDYRADDFIYDNIGLG 301 Query: 304 AHAGSKFSELNEAR-SLEIIYEPFNAGGAVYDPDGTYMK-----LPHVVITNREH---FS 354 AG+ + L + +++ F AG + PD Y+ LP +R H F Sbjct: 302 --AGTVKTHLRHSNDGNKMVVTGFGAGDSPDYPDEIYVPGNGEYLPSSNNDDRTHRDTFR 359 Query: 355 NVKAQMWDRVATRFRKTYEVVTYGANHPHDELISISSEHVPAKILDKLKIEL-ASPHKDV 413 N +AQ W +A RF KT+ V G D LIS+SS+ L +LK EL K Sbjct: 360 NKRAQYWVYLADRFYKTWRAVEKGEYLDPDALISLSSKIAK---LSQLKSELIKQQRKRT 416 Query: 414 DGMGKFKVESKKDMREKRGIKSPNIADAFIMAMIQPKR 451 G ++ SK +MR K GIKSPN+AD +M+ P R Sbjct: 417 PGNRLIQLMSKDEMRLK-GIKSPNMADTLMMSFANPLR 453 >gi|2819|lcl|protein:vir:105705 Length: 439 # NCBI annotation: gp2 # Family: family:all:54 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224140;genbank:gi:62362215;genbank:GeneID :3342458 Length = 439 Score = 111 bits (277), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 65/228 (28%), Positives = 114/228 (50%), Gaps = 15/228 (6%) Query: 11 FWLDKARYKALYGGRASSKSHDAAGFAVYLA-----RNYTVKFLCARQFQNKISESVYTL 65 F + RY+ +GGR S+K+ A A N + LCAR++ N + ES Sbjct: 17 FATEGVRYRGAHGGRGSAKTRTFALMTAVKAYQAAEANISGVILCAREYMNSLEESSMEE 76 Query: 66 IKGKIDAAGWTKE-FDVTISSIRHKKTGAEFLFYGIARNLNEIKSTEGVDILWLEEAQYL 124 +K I + W + FD+ IR K ++F G+ NL+ IKS + + W++EA+ + Sbjct: 77 VKQAIRSVAWLDDYFDIGEKYIRTKNRKVSYVFCGLRHNLDSIKSKARILVAWVDEAESV 136 Query: 125 TEEQWNVINPTIRREGSQIWLIWNPDQYTDFIYQNFVVNPPADCLSKQINWTENPFLSDT 184 + W + PT+R EGS+IW+ WNP++ + F NPP + ++N+ +NP+ Sbjct: 137 SSTAWKKLRPTVREEGSEIWVTWNPEKDGSATDKLFRKNPPKSSMIVEMNYVDNPWFP-- 194 Query: 185 MLKVIYDEYQRDPKLAEH-----VYGGAPKMGGDKAIIQLQYVLAAID 227 V+ +E Q D ++ ++ GA DK ++ +YV+ + + Sbjct: 195 --AVLEEERQEDLANLDYADYAWIWEGAYLENSDKQVLANKYVVQSFE 240 >gi|8380|lcl|protein:vir:96782 Length: 408 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224236;genbank:gi:62362371;genbank:GeneID :3345721 Length = 408 Score = 108 bits (269), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 63/195 (32%), Positives = 100/195 (51%), Gaps = 3/195 (1%) Query: 16 ARYKALYGGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISESVYTLIKGKIDAAGW 75 R++ YG R S KS + A A ++ LC R+ Q I ES + +K I + W Sbjct: 23 VRFRGAYGSRGSGKSFNFAKMAAIWGAIEKMRILCTRELQVSIKESFHAELKNAIKSDEW 82 Query: 76 TKE-FDVTISSIRHKKTGAEFLFYGIARNLNEIKSTEGVDILWLEEAQYLTEEQWNVINP 134 +DV I IR+ G EFLF G+ + +KST +D+ +EEA+ + E W + P Sbjct: 83 LSSIYDVGIDYIRNNNNGTEFLFKGLRHGMGSVKSTAQIDLTIVEEAEDVPENAWVELLP 142 Query: 135 TI-RREGSQIWLIWNPDQYTDFIYQNFVVNPPADCLSKQINWTENPFLSDTMLKV-IYDE 192 TI R + ++ W+IWNP + + + F P D + ++N+ +NPF + + +DE Sbjct: 143 TIFRTDKAECWVIWNPRKKGSPVDKRFRQFKPDDAVVVEMNYYDNPFFPKGLEDLRRHDE 202 Query: 193 YQRDPKLAEHVYGGA 207 P+L HV+ GA Sbjct: 203 DTMPPELYAHVWLGA 217 >gi|3859|lcl|protein:vir:107748 Length: 532 # NCBI annotation: gp33 TerL # Family: family:all:54 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024878;genbank:gi:48697520;genbank:GeneID :2948367 Length = 532 Score = 91.7 bits (226), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 76/253 (30%), Positives = 116/253 (45%), Gaps = 27/253 (10%) Query: 216 IIQLQYVLAAIDAHKKLGWKIEGSKRTGFDIADDGDDANAIVDAIGNVVVWAEEWDGLED 275 +I L+++ AAIDA KLG + G + + D+AD+G D NA +G + +AE W G Sbjct: 295 LIPLEWIDAAIDADVKLGLTVTGQRFSSLDVADEGKDMNAFGSRLGIRMDYAESWSGKGS 354 Query: 276 ELLKSSTKVFNHAL-EKGSSIIFDSIGVG------AHAGSKFSELNEARSLEIIYEPFNA 328 + ++ + + + G FDS G+G A A + E ++ I F Sbjct: 355 NIYGTTLRTIGLVIAQNGRDFQFDSDGLGVGVRGDAEAINALPERKAYPKIDAI--AFRG 412 Query: 329 GGAVYDPD----GTYMKLPHVVITNREHFSNVKAQMWDRVATRFRKTYEVVTYGANHPHD 384 +V +PD G Y + N + F N KAQ + + RF TY V + D Sbjct: 413 SSSVREPDKQVPGAYKG-----VKNVDFFQNRKAQEYWALRMRFEATYRAVVEKLEYDPD 467 Query: 385 ELISISSEHVPAKILDKLKIELASPHKDVDGMGKFKVESKKDMREKRGIKSPNIADAFIM 444 E+ISISS +P L K+++EL P GK ++ D G+ SPN AD M Sbjct: 468 EIISISS-RIPD--LQKIRMELHQPLYKPSTTGKIMIQKTPD-----GMVSPNYAD-MTM 518 Query: 445 AMIQPKRQPAGFF 457 + P++ G F Sbjct: 519 MLYAPQQTKRGIF 531 >gi|15353|lcl|protein:vir:9921 Length: 425 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795683;genbank:gi:28876465;genbank:GeneID :1257982 Length = 425 Score = 67.4 bits (163), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 48/171 (28%), Positives = 83/171 (48%), Gaps = 9/171 (5%) Query: 22 YGGRASSKSHDAAGFAVYLARN----YTVKFLCARQFQNKISESVYTLIKGKIDAAGWTK 77 YGG +S KSH + A N + K L R+ + +SV+ I + G Sbjct: 39 YGGASSGKSHGVFQKIILKALNPKFKHPRKILVLRKVGATVRDSVFADIMSNLSYFGILD 98 Query: 78 EFDVTISSIRHK-KTGAEFLFYGIARNLNEIKSTEGVDILWLEEAQYLTEEQWNVINPTI 136 + + +S+ R GAEF+F G+ N +IKS +G+ + +EEA T + + + + Sbjct: 99 KCKINMSAFRITLPNGAEFIFKGMD-NPEKIKSIKGISDVVMEEASEFTLDDYTQLTLRL 157 Query: 137 RREG---SQIWLIWNPDQYTDFIYQNFVVNPPADCLSKQINWTENPFLSDT 184 R + QI+L++NP +++Y+ F V P + + Q + +N FL D Sbjct: 158 RDKKHLEKQIYLMFNPVSKVNWVYKAFFVKTPKNTVVYQTTYKDNRFLDDV 208 >gi|5794|lcl|protein:vir:98922 Length: 453 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164412;genbank:gi:56694902;genbank:GeneID :3197313 Length = 453 Score = 59.7 bits (143), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 40/121 (33%), Positives = 61/121 (50%), Gaps = 8/121 (6%) Query: 3 DLNPALRDFWLDKARYKALYGGRASSKSHDAAGFAVYLARNYTVK-----FLCARQFQNK 57 ++NP ++ W Y L GGR S KS A V++ Y +K + R+ N Sbjct: 16 NINPHFKEVWTSSKPYNILKGGRNSFKSSVIALKLVFMMLLYILKGEKANVVVIRKVGNT 75 Query: 58 ISESVYTLIKGKIDAAGWTKEFDVTIS--SIRHKKTGAEFLFYGIARNLNEIKSTEGVDI 115 I +SV+ I+ I G T+ F T+S I HK+TG+ F FYG + ++KS + DI Sbjct: 76 IRDSVFNKIQWAIKLFGLTRRFKPTVSPFKITHKRTGSTFYFYG-QDDFQKLKSNDIEDI 134 Query: 116 L 116 + Sbjct: 135 I 135 >gi|19292|lcl|protein:vir:4781 Length: 436 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150161;swissprot:trembl:q94m50;genbank:gi :15088772;goa:Q94M50;uniprot:Q94M50;genbank:GeneID:95597 6 Length = 436 Score = 56.6 bits (135), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 42/147 (28%), Positives = 71/147 (48%), Gaps = 11/147 (7%) Query: 3 DLNPALRDFWLDKARYKALYGGRASSKSHDAAGFAVYLARNYTV-----KFLCARQFQNK 57 ++NP + W+ Y L GGR S KS Y+ Y + + R+ N Sbjct: 8 NINPHFKSVWISSLPYNVLKGGRNSFKSSVIVLKLAYMMIRYIIAGEAANIVVIRKVANT 67 Query: 58 ISESVYTLIKGKIDAAGWTKEFDVTIS--SIRHKKTGAEFLFYGIARNLNEIKSTEGVDI 115 I +SV+ + ++ G ++F T+S I HK TG+ F FYG + ++KS + +I Sbjct: 68 IRDSVFNKVWWALNLFGIAEQFTKTVSPFKIVHKTTGSTFYFYG-QDDFQKLKSNDIGNI 126 Query: 116 L--WLEE-AQYLTEEQWNVINPTIRRE 139 + W EE A++ +E ++ N T R+ Sbjct: 127 IPVWYEEAAEFNDQEDFDQSNVTFMRQ 153 >gi|7076|lcl|protein:vir:96147 Length: 419 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240074;genbank:gi:66395738;genbank:GeneID :5133130 Length = 419 Score = 53.9 bits (128), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 61/206 (29%), Positives = 89/206 (43%), Gaps = 21/206 (10%) Query: 23 GGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISESVYTLIKGKIDAAGWTKEFDVT 82 GGR S KS D A V L Y V L R+ N ++ SV+ IK I+ G + F + Sbjct: 32 GGRGSGKSSDIAIIIVLLIMRYPVNALILRKIDNTLALSVFEQIKWAINVMGVSHLFKIK 91 Query: 83 IS--SIRHKKTGAEFLFYGIARNLNEIKSTEGVD----ILWLEE-AQYLTEEQWNVINPT 135 +S I + G + +F G A+N IKS + I W+EE A++ TE++ I + Sbjct: 92 VSPMEITYVPRGNKMVFRG-AQNPERIKSLKDAQFPYAIAWIEELAEFKTEDEVTTITNS 150 Query: 136 IRREGSQIWLIWNPDQYTDFIYQNFVVNPPADCLSKQINWTENPFLSDTMLKVIYDEYQR 195 + R L + F Y NPP S E+ F D V + Y Sbjct: 151 LLRGELDNGLFYK------FFY---TYNPPKRKQSWVNKKYESSFQPDNTF-VHHSTYLN 200 Query: 196 DPKLAEHVYGGAPKMGGDKAIIQLQY 221 +P +A+ A KAI +L+Y Sbjct: 201 NPFIAKEFIEEA---KAAKAINELRY 223 >gi|18074|lcl|protein:vir:5954 Length: 422 # NCBI annotation: hypothetical protein # Family: family:all:54 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690654;genbank:geneid:6329138;genbank:gi: 22855048;goa:P54308;interpro:IPR006437;interpro:IPR00670 1;interpro:IPR011441;uniprot:P54308;genbank:GeneID:95525 4 Length = 422 Score = 52.0 bits (123), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 55/203 (27%), Positives = 94/203 (46%), Gaps = 18/203 (8%) Query: 20 ALYGGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISESVYTLIKGKIDAAGWTKEF 79 L GGR S+KS A + + L + FL R+ N + +SV+ +K ID + Sbjct: 30 VLKGGRGSAKSTHIAMWIILLMMMMPITFLVIRRVYNTVEQSVFEQLKEAIDMLEVGHLW 89 Query: 80 DVTISSIR--HKKTGAEFLFYGIARNLNEIKSTEG----VDILWLEE-AQYLTEEQWNVI 132 V+ S +R + G +F G ++ +IKS + V +W+EE A++ TEE+ +VI Sbjct: 90 KVSKSPLRLTYIPRGNSIIFRG-GDDVQKIKSIKASKFPVAGMWIEELAEFKTEEEVSVI 148 Query: 133 NPTIRR----EGSQ--IWLIWNPDQYTDFIYQNFVVNP---PADCLSKQINWTENPFLSD 183 ++ R G + + +NP + + N V N PA+ + +NPFLS Sbjct: 149 EKSVLRAELPPGCRYIFFYSYNPPKRKQ-SWVNKVFNSSFLPANTFVDHSTYLQNPFLSK 207 Query: 184 TMLKVIYDEYQRDPKLAEHVYGG 206 ++ + +R+ H Y G Sbjct: 208 AFIEEAEEVKRRNELKYRHEYLG 230 >gi|9289|lcl|protein:vir:97165 Length: 431 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239721;genbank:gi:66394878;genbank:GeneID :5130898 Length = 431 Score = 50.4 bits (119), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 57/213 (26%), Positives = 89/213 (41%), Gaps = 19/213 (8%) Query: 11 FWLDKARYKALYGGRASSKSHDAAGFAVYLARNYT-VKFLCARQFQNKISESVYTLIKGK 69 FW +K Y+ + G R S KS A +Y Y L R+F N +S YT +K Sbjct: 18 FWHNKNFYRVVKGSRGSKKSKTTAINLIYRIMKYDWANILVVRRFSNTNKQSTYTDLKWA 77 Query: 70 IDAAGWTK--EFDVTISSIRHKKTGAEFLFYGIARNLNEIKSTEGVDIL---WLEEA-QY 123 + G +F+ ++ I +K TG + LF G+ L T IL W EEA Q Sbjct: 78 TNQLGVAHLFKFNESLPEITYKPTGQKILFRGLDDPLKITSITVDTGILCWAWFEEAYQI 137 Query: 124 LTEEQWNVINPTIRREGS--------QIWLIWNPDQYTDFIYQNFVVNPPA--DCLSKQI 173 T +++ + +IR GS QI + +NP ++ F + S Sbjct: 138 ETFAKFSTVVESIR--GSYDSPEFFKQITVTFNPWSERHWLKPTFFDEETKLNNTFSDTT 195 Query: 174 NWTENPFLSDTMLKVIYDEYQRDPKLAEHVYGG 206 + N +L ++ D Y ++P+ A V G Sbjct: 196 TYRVNEWLDKVDIERYEDLYIKNPRRARIVCDG 228 >gi|13020|lcl|protein:vir:80960 Length: 443 # NCBI annotation: TerL # Family: family:all:54 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468388;genbank:gi:157324962;genbank:Ge neID:5601395 Length = 443 Score = 50.1 bits (118), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 50/162 (30%), Positives = 73/162 (45%), Gaps = 19/162 (11%) Query: 4 LNPALRDFWLDKARYKALYGGRASSKSHDAAGFAVYLAR----NYTVKFLCARQFQNKIS 59 +NPA D WL K + GGR+S KS + ++ L N +C R+ N + Sbjct: 21 INPAFYDLWLSKHNHIIAKGGRSSMKS---SVISLKLVEKKMANPMSNMVCLRKVANTLY 77 Query: 60 ESVYTLIKGKIDAAGWTKEFDVTIS--SIRHKKTGAEFLFYGI--ARNLNEIKSTEG-VD 114 +SVY IK + G +F+ S I HK+ G F F G L +K G V Sbjct: 78 KSVYQQIKWALYEMGVADQFNFGKSPMEIIHKEWGTGFYFSGCDDPAKLKSMKIPVGYVS 137 Query: 115 ILWLEE-AQYLTEEQWNVINPTIRRE----GSQ--IWLIWNP 149 LW EE A++ +V+ T RE G + I++ +NP Sbjct: 138 DLWFEELAEFSGVTDIDVVEDTFIREDLPQGQEVTIYMSFNP 179 >gi|15836|lcl|protein:vir:37 Length: 443 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463463;swissprot:trembl:q9t1c1;genbank:gi :16798785;goa:Q9T1C1;uniprot:Q9T1C1;genbank:GeneID:92238 4 Length = 443 Score = 48.5 bits (114), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 50/162 (30%), Positives = 71/162 (43%), Gaps = 19/162 (11%) Query: 4 LNPALRDFWLDKARYKALYGGRASSKSHDAAGFAVYLAR----NYTVKFLCARQFQNKIS 59 +NPA D WL K + GGR+S KS + ++ L N +C R+ N + Sbjct: 21 INPAFYDLWLSKHNHIIAKGGRSSMKS---SVISLKLVEKKMANPMSNMVCLRKVANTLY 77 Query: 60 ESVYTLIKGKIDAAGWTKEFDVTIS--SIRHKKTGAEFLFYGI--ARNLNEIKSTEG-VD 114 +SVY IK + G +F S I HK G F F G L +K G V Sbjct: 78 KSVYQQIKWALYEMGVADQFKFGKSPMEIVHKTWGTGFYFSGCDDPAKLKSMKIPVGYVS 137 Query: 115 ILWLEE-AQYLTEEQWNVINPTIRRE----GSQ--IWLIWNP 149 LW EE A++ +V+ T RE G + I++ +NP Sbjct: 138 GLWFEELAEFSGVTDIDVVEDTFIREDLPQGQEVTIYMSFNP 179 >gi|6093|lcl|protein:vir:95761 Length: 432 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950582;genbank:gi:119953777;genbank:GeneI D:5076831 Length = 432 Score = 48.5 bits (114), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 40/125 (32%), Positives = 57/125 (45%), Gaps = 9/125 (7%) Query: 11 FWLDKARYKALYGGRASSKSHDAAGFAVYLARNYT-VKFLCARQFQNKISESVYTLIK-- 67 FW K Y+ + GGR S KS A + + Y L R+F N +S YT +K Sbjct: 22 FWRSKNFYRVVKGGRGSKKSKTTALYYIVAILKYNWANLLVVRRFSNTNKQSTYTDLKWA 81 Query: 68 -GKIDAAGWTKEFDVTISSIRHKKTGAEFLFYGIARNLNEIKSTEGVDI---LWLEEAQY 123 +++ + K F+ ++ I K TG + LF G+ L T + LWLEEA Y Sbjct: 82 ANRLNVSHLFK-FNESLPEITVKATGQKILFRGLDDPLKITSITVDTGLLSWLWLEEA-Y 139 Query: 124 LTEEQ 128 E Q Sbjct: 140 QVENQ 144 >gi|5208|lcl|protein:vir:106641 Length: 446 # NCBI annotation: ORF005 # Family: family:all:54 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239489;genbank:gi:66395220;genbank:GeneID :4555795 Length = 446 Score = 47.0 bits (110), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 39/151 (25%), Positives = 72/151 (47%), Gaps = 17/151 (11%) Query: 44 YTVKFLCARQFQNKISESVYTLIKGKI------DAAGWTKEFDVTISSIRHKKTGAEFLF 97 Y + L R+ Q+ I +S++ +KG + D W K + GA FLF Sbjct: 81 YPRRILWLRKVQSTIKDSLFEDVKGCLINFGIWDMCLWNK-----TDNKVELPNGAVFLF 135 Query: 98 YGIARNLNEIKSTEGVDILWLEEAQYLTEEQWNVINPTIRRE---GSQIWLIWNPDQYTD 154 G+ N +IKS +G+ + +EEA T + + +R QI+L++NP + Sbjct: 136 KGLD-NPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERKHMNKQIFLMFNPVSKLN 194 Query: 155 FIYQNFVVN--PPADCLSKQINWTENPFLSD 183 ++Y+ F + P + + +Q ++ +N FL + Sbjct: 195 WVYKYFFEHGEPMENVMIRQSSYRDNKFLDE 225 >gi|10483|lcl|protein:vir:105297 Length: 446 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950664;genbank:gi:119967835;genbank:GeneI D:4643176 Length = 446 Score = 46.6 bits (109), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 45/180 (25%), Positives = 82/180 (45%), Gaps = 16/180 (8%) Query: 23 GGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISESVYTLIKGKIDAAGWTKEFDVT 82 GGR S KS D + L Y + + R+ N ++ SV+ IK I+ + F V Sbjct: 33 GGRGSGKSSDISIIITQLIMRYPMNAVVVRKADNTLATSVFEQIKWAIEEQKVSHLFKVK 92 Query: 83 IS--SIRHKKTGAEFLFYGIARNLNEIKSTEG----VDILWLEE-AQYLTEEQWNVI-NP 134 +S I + G +F G A+N +KS + I+W+EE A++ TE++ I N Sbjct: 93 VSPMEITYVPRGNRIIFRG-AQNPERLKSLKDSRFPFSIMWIEELAEFKTEDEVTTITNS 151 Query: 135 TIRREGS-----QIWLIWN-PDQYTDFIYQNFVVN-PPADCLSKQINWTENPFLSDTMLK 187 +R E + + +N P + ++ + + + P + + +NPF+S ++ Sbjct: 152 MLRGELDDGLFYKFFFSYNPPKRKQSWVNKKYETSFQPDNTFVHHSTYLDNPFISKQFIQ 211 >gi|5891|lcl|protein:vir:107098 Length: 421 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950600;genbank:gi:119953680;genbank:GeneI D:4643107 Length = 421 Score = 46.6 bits (109), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 45/180 (25%), Positives = 82/180 (45%), Gaps = 16/180 (8%) Query: 23 GGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISESVYTLIKGKIDAAGWTKEFDVT 82 GGR S KS D + L Y + + R+ N ++ SV+ IK I+ + F V Sbjct: 34 GGRGSGKSSDISIIITQLIMRYPMNAVVVRKTDNTLATSVFEQIKWAIEEQKVSHLFKVK 93 Query: 83 IS--SIRHKKTGAEFLFYGIARNLNEIKSTEG----VDILWLEE-AQYLTEEQWNVI-NP 134 +S I + G +F G A+N +KS + I+W+EE A++ TE++ I N Sbjct: 94 VSPMEITYVPRGNRIIFRG-AQNPERLKSLKDSRFPFSIMWIEELAEFKTEDEVTTITNS 152 Query: 135 TIRREGS-----QIWLIWN-PDQYTDFIYQNFVVN-PPADCLSKQINWTENPFLSDTMLK 187 +R E + + +N P + ++ + + + P + + +NPF+S ++ Sbjct: 153 MLRGELDDGLFYKFFFSYNPPKRKQSWVNKKYETSFQPDNTFVHHSTYLDNPFISKQFIQ 212 >gi|655|lcl|protein:vir:1588 Length: 447 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695170;swissprot:trembl:o03927;genbank:gi :23455799;goa:O03927;interpro:IPR006437;interpro:IPR0067 01;interpro:IPR011441;uniprot:O03927;genbank:GeneID:9555 65 Length = 447 Score = 46.2 bits (108), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 48/174 (27%), Positives = 79/174 (45%), Gaps = 20/174 (11%) Query: 4 LNPALRDFWLDKARYKALYGGRASSKSHDAAGFAVYLARNYTVKFLCARQF-----QNKI 58 +NP + W Y GGR S KS + V + + ++ A ++ + Sbjct: 21 INPHFKRMWTTDKPYIVANGGRGSFKSSVISLKLVTMVKKAIMQHRKANVIAVLANKSDL 80 Query: 59 SESVYTLIKGKIDAAGWTKEFDVTIS--SIRHKKTGAEFLFYGIARNLNEIKSTEGVDI- 115 ++VY I+ + EF S +I+HK+TG+ F FYG A N ++KS D+ Sbjct: 81 HDTVYNQIQWALSMLDMDNEFIAYKSPLTIQHKRTGSSFYFYG-ADNPYKLKSNIVGDVV 139 Query: 116 -LWLEEAQYL-TEEQWNVINPTIRREGSQIWLIWNPDQYTDFIYQNFVVNPPAD 167 +W EEA + + + ++ NPT R+ + WL DQ F + NPP + Sbjct: 140 AVWYEEAANMKSSDVFDQANPTFIRQKPE-WL----DQVKVF----YSYNPPKN 184 >gi|7569|lcl|protein:vir:96300 Length: 421 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240307;genbank:gi:66395973;genbank:GeneID :5133377 Length = 421 Score = 45.1 bits (105), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 37/123 (30%), Positives = 58/123 (47%), Gaps = 8/123 (6%) Query: 23 GGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISESVYTLIKGKIDAAGWTKEFDVT 82 GGR S KS D + L Y + + R+ N ++ SV+ IK I+ T F V Sbjct: 34 GGRGSGKSSDISIIITQLIMRYPMNAVVIRKTDNTLATSVFEQIKWAIEEQKVTHLFKVK 93 Query: 83 IS--SIRHKKTGAEFLFYGIARNLNEIKSTEG----VDILWLEE-AQYLTEEQWNVINPT 135 +S I + G +F G A+N +KS + I W+EE A++ TE++ I + Sbjct: 94 VSPMEITYIPRGNRIIFRG-AQNPERLKSLKDSRFPFSIAWIEELAEFKTEDEVTTITNS 152 Query: 136 IRR 138 + R Sbjct: 153 LLR 155 >gi|2779|lcl|protein:vir:99776 Length: 425 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004302;genbank:gi:122891756;genbank:Ge neID:4712331 Length = 425 Score = 45.1 bits (105), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 41/151 (27%), Positives = 72/151 (47%), Gaps = 17/151 (11%) Query: 44 YTVKFLCARQFQNKISESVYTLIK------GKIDAAGWTKEFDVTISSIRHKKTGAEFLF 97 Y + L R+ Q+ I +S++ +K G D W K D + GA FLF Sbjct: 59 YPRRILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKT-DNKVGL----PNGAVFLF 113 Query: 98 YGIARNLNEIKSTEGVDILWLEEAQYLTEEQWNVINPTIRRE---GSQIWLIWNPDQYTD 154 G+ N +IKS +G+ + +EEA T + + +R QI+LI+NP + Sbjct: 114 KGLD-NPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERKHVNKQIFLIFNPVSKLN 172 Query: 155 FIYQNFVVN--PPADCLSKQINWTENPFLSD 183 ++Y+ F + P + + +Q ++ +N FL + Sbjct: 173 WVYKYFFEHGEPMENVMIRQSSYRDNKFLDE 203 >gi|9829|lcl|protein:vir:103950 Length: 425 # NCBI annotation: phage terminase large subunit # Family: family:all:54 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873987;genbank:gi:118430762;genbank:GeneI D:4525444 Length = 425 Score = 44.3 bits (103), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 39/151 (25%), Positives = 71/151 (47%), Gaps = 17/151 (11%) Query: 44 YTVKFLCARQFQNKISESVYTLIK------GKIDAAGWTKEFDVTISSIRHKKTGAEFLF 97 Y + L R+ Q+ I +S++ +K G D W K + GA FLF Sbjct: 59 YPRRILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNK-----TDNKVELPNGAVFLF 113 Query: 98 YGIARNLNEIKSTEGVDILWLEEAQYLTEEQWNVINPTIRRE---GSQIWLIWNPDQYTD 154 G+ N +IKS +G+ + +EEA T + + +R QI+L++NP + Sbjct: 114 KGLD-NPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERKHVNKQIFLMFNPVSKLN 172 Query: 155 FIYQNFVVN--PPADCLSKQINWTENPFLSD 183 ++Y+ F + P + + +Q ++ +N FL + Sbjct: 173 WVYKYFFEHGEPMENVMIRQSSYRDNKFLDE 203 >gi|17691|lcl|protein:vir:9305 Length: 447 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803283;genbank:gi:29028593;genbank:GeneID :1258039 Length = 447 Score = 44.3 bits (103), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 39/151 (25%), Positives = 71/151 (47%), Gaps = 17/151 (11%) Query: 44 YTVKFLCARQFQNKISESVYTLIK------GKIDAAGWTKEFDVTISSIRHKKTGAEFLF 97 Y + L R+ Q+ I +S++ +K G D W K + GA FLF Sbjct: 81 YPRRILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNK-----TDNKVELPNGAVFLF 135 Query: 98 YGIARNLNEIKSTEGVDILWLEEAQYLTEEQWNVINPTIRRE---GSQIWLIWNPDQYTD 154 G+ N +IKS +G+ + +EEA T + + +R QI+L++NP + Sbjct: 136 KGLD-NPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERKHVNKQIFLMFNPVSKLN 194 Query: 155 FIYQNFVVN--PPADCLSKQINWTENPFLSD 183 ++Y+ F + P + + +Q ++ +N FL + Sbjct: 195 WVYKYFFEHGEPMENVMIRQSSYRDNKFLDE 225 >gi|7643|lcl|protein:vir:96370 Length: 447 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239642;genbank:gi:66395379;genbank:GeneID :5132846 Length = 447 Score = 43.9 bits (102), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 39/151 (25%), Positives = 71/151 (47%), Gaps = 17/151 (11%) Query: 44 YTVKFLCARQFQNKISESVYTLIK------GKIDAAGWTKEFDVTISSIRHKKTGAEFLF 97 Y + L R+ Q+ I +S++ +K G D W K + GA FLF Sbjct: 81 YPRRILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNK-----TDNKVELPNGAVFLF 135 Query: 98 YGIARNLNEIKSTEGVDILWLEEAQYLTEEQWNVINPTIRRE---GSQIWLIWNPDQYTD 154 G+ N +IKS +G+ + +EEA T + + +R QI+L++NP + Sbjct: 136 KGLD-NPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERKHVNKQIFLMFNPVSKLN 194 Query: 155 FIYQNFVVN--PPADCLSKQINWTENPFLSD 183 ++Y+ F + P + + +Q ++ +N FL + Sbjct: 195 WVYKYFFEHGEPMENVMIRQSSYRDNKFLDE 225 >gi|7301|lcl|protein:vir:96238 Length: 447 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239566;genbank:gi:66395301;genbank:GeneID :5132787 Length = 447 Score = 43.9 bits (102), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 39/151 (25%), Positives = 71/151 (47%), Gaps = 17/151 (11%) Query: 44 YTVKFLCARQFQNKISESVYTLIK------GKIDAAGWTKEFDVTISSIRHKKTGAEFLF 97 Y + L R+ Q+ I +S++ +K G D W K + GA FLF Sbjct: 81 YPRRILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNK-----TDNKVELPNGAVFLF 135 Query: 98 YGIARNLNEIKSTEGVDILWLEEAQYLTEEQWNVINPTIRRE---GSQIWLIWNPDQYTD 154 G+ N +IKS +G+ + +EEA T + + +R QI+L++NP + Sbjct: 136 KGLD-NPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERKHVNKQIFLMFNPVSKLN 194 Query: 155 FIYQNFVVN--PPADCLSKQINWTENPFLSD 183 ++Y+ F + P + + +Q ++ +N FL + Sbjct: 195 WVYKYFFEHGEPMENVMIRQSSYRDNKFLDE 225 >gi|11588|lcl|protein:vir:78831 Length: 447 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285355;genbank:gi:148717883;genbank:Ge neID:5246962 Length = 447 Score = 43.9 bits (102), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 39/151 (25%), Positives = 71/151 (47%), Gaps = 17/151 (11%) Query: 44 YTVKFLCARQFQNKISESVYTLIK------GKIDAAGWTKEFDVTISSIRHKKTGAEFLF 97 Y + L R+ Q+ I +S++ +K G D W K + GA FLF Sbjct: 81 YPRRILWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNK-----TDNKVELPNGAVFLF 135 Query: 98 YGIARNLNEIKSTEGVDILWLEEAQYLTEEQWNVINPTIRRE---GSQIWLIWNPDQYTD 154 G+ N +IKS +G+ + +EEA T + + +R QI+L++NP + Sbjct: 136 KGLD-NPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERKHVNKQIFLMFNPVSKLN 194 Query: 155 FIYQNFVVN--PPADCLSKQINWTENPFLSD 183 ++Y+ F + P + + +Q ++ +N FL + Sbjct: 195 WVYKYFFEHGEPMENVMIRQSSYRDNKFLDE 225 >gi|10354|lcl|protein:vir:97448 Length: 421 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240743;genbank:gi:66396415;genbank:GeneID :5133804 Length = 421 Score = 43.9 bits (102), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 36/123 (29%), Positives = 58/123 (47%), Gaps = 8/123 (6%) Query: 23 GGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISESVYTLIKGKIDAAGWTKEFDVT 82 GGR S KS D + L Y + + R+ N ++ SV+ IK I+ + F V Sbjct: 34 GGRGSGKSSDISIIITQLIMRYPMNAVVIRKTDNTLATSVFEQIKWAIEEQKVSHLFKVK 93 Query: 83 IS--SIRHKKTGAEFLFYGIARNLNEIKSTEG----VDILWLEE-AQYLTEEQWNVINPT 135 +S I + G +F G A+N +KS + I W+EE A++ TE++ I + Sbjct: 94 VSPMEITYIPRGNRIIFRG-AQNPERLKSLKDSRFPFSIAWIEELAEFKTEDEVTTITNS 152 Query: 136 IRR 138 + R Sbjct: 153 LLR 155 >gi|3182|lcl|protein:vir:94499 Length: 421 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240671;genbank:gi:66396341;genbank:GeneID :5133763 Length = 421 Score = 43.9 bits (102), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 36/123 (29%), Positives = 58/123 (47%), Gaps = 8/123 (6%) Query: 23 GGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISESVYTLIKGKIDAAGWTKEFDVT 82 GGR S KS D + L Y + + R+ N ++ SV+ IK I+ + F V Sbjct: 34 GGRGSGKSSDISIIITQLIMRYPMNAVVIRKTDNTLATSVFEQIKWAIEEQKVSHLFKVK 93 Query: 83 IS--SIRHKKTGAEFLFYGIARNLNEIKSTEG----VDILWLEE-AQYLTEEQWNVINPT 135 +S I + G +F G A+N +KS + I W+EE A++ TE++ I + Sbjct: 94 VSPMEITYIPRGNRIIFRG-AQNPERLKSLKDSRFPFSIAWIEELAEFKTEDEVTTITNS 152 Query: 136 IRR 138 + R Sbjct: 153 LLR 155 >gi|4983|lcl|protein:vir:95058 Length: 421 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240816;genbank:gi:66394679;genbank:GeneID :5133852 Length = 421 Score = 43.5 bits (101), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 36/123 (29%), Positives = 58/123 (47%), Gaps = 8/123 (6%) Query: 23 GGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISESVYTLIKGKIDAAGWTKEFDVT 82 GGR S KS D + L Y + + R+ N ++ SV+ IK I+ + F V Sbjct: 34 GGRGSGKSSDISIIITQLIMRYPMNAVVIRKTDNTLATSVFEQIKWAIEEQKVSHLFKVK 93 Query: 83 IS--SIRHKKTGAEFLFYGIARNLNEIKSTEG----VDILWLEE-AQYLTEEQWNVINPT 135 +S I + G +F G A+N +KS + I W+EE A++ TE++ I + Sbjct: 94 VSPMEITYIPRGNRIIFRG-AQNPERLKSLKDSRFPFSISWIEELAEFKTEDEVTTITNS 152 Query: 136 IRR 138 + R Sbjct: 153 LLR 155 >gi|6284|lcl|protein:vir:95890 Length: 421 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240381;genbank:gi:66396048;genbank:GeneID :5133401 Length = 421 Score = 43.1 bits (100), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 35/123 (28%), Positives = 58/123 (47%), Gaps = 8/123 (6%) Query: 23 GGRASSKSHDAAGFAVYLARNYTVKFLCARQFQNKISESVYTLIKGKIDAAGWTKEFDVT 82 GGR S KS D + L Y + + R+ N ++ SV+ IK I+ + F V Sbjct: 34 GGRGSGKSSDISIIITQLIMRYPMNAVVIRKTDNTLATSVFEQIKWAIEEQKVSHLFKVK 93 Query: 83 IS--SIRHKKTGAEFLFYGIARNLNEIKSTEG----VDILWLEE-AQYLTEEQWNVINPT 135 +S I + G +F G A+N +KS + + W+EE A++ TE++ I + Sbjct: 94 VSPMEITYIPRGNRIIFRG-AQNPERLKSLKDSRFPFSVAWIEELAEFKTEDEVTTITNS 152 Query: 136 IRR 138 + R Sbjct: 153 LLR 155 >gi|11736|lcl|protein:vir:79016 Length: 417 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110720;genbank:gi:134287337;genbank:Ge neID:4955190 Length = 417 Score = 42.0 bits (97), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 51/230 (22%), Positives = 96/230 (41%), Gaps = 35/230 (15%) Query: 3 DLNPALRDFWLDKARYKALYGGRASSKSHDAAGFAVYLARNYTVK----------FLCAR 52 + NP ++ K RY+A+ G S KS V +A++Y +K L R Sbjct: 10 NFNPDFKEANFTKKRYRAMKGSAGSGKS-------VNVAQDYILKLGDKKYQGANLLVVR 62 Query: 53 QFQNKISESVYTLIKGKID---AAGWTKEFDVTIS--SIRHKKTGAEFLFYGI--ARNLN 105 + + S Y + G I+ K + T++ I+ K TG +F G+ A+ Sbjct: 63 KSEATHKYSTYAELTGAINRIYGKQADKYWKTTLNPLEIKSKVTGNSIIFRGVNDAKQRE 122 Query: 106 EIKSTE----GVDILWLEEAQYLTEEQWNVINPTIRREGS------QIWLIWNPDQYTDF 155 ++KS + +W EEA L E ++++ +R + Q+ +NP T + Sbjct: 123 KLKSINFSKGKLTWVWCEEATELMESDIDILDDRLRGILTNPNLYYQMTFTFNPVSATHW 182 Query: 156 IYQNFVVNPPADCLSKQINWTENPFLSDTMLKVIYDEYQRDPKLAEHVYG 205 I + + D + + +N F+ + + + ++DP+ VYG Sbjct: 183 IKRKYFDYKNDDIFTHHSTYLQNRFIDEAYYRRMQMRKEQDPE-GYKVYG 231 >gi|8730|lcl|protein:vir:96840 Length: 421 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240151;genbank:gi:66395816;genbank:GeneID :5133181 Length = 421 Score = 35.0 bits (79), Expect = 0.002, Method: Compositional matrix adjust. Identities = 30/114 (26%), Positives = 53/114 (46%), Gaps = 8/114 (7%) Query: 32 DAAGFAVYLARNYTVKFLCARQFQNKISESVYTLIKGKIDAAGWTKEFDVTIS--SIRHK 89 D + L Y + + R+ N ++ SV+ IK I+ + F V +S I + Sbjct: 43 DISIIITQLIMRYPMNAVVVRKTDNTLATSVFEQIKWAIEQQKVSHLFKVKVSPMEITYM 102 Query: 90 KTGAEFLFYGIARNLNEIKSTEG----VDILWLEE-AQYLTEEQWNVINPTIRR 138 G +F G A+N +KS + I+W+EE A++ TE++ I ++ R Sbjct: 103 PRGNRIIFRG-AQNPERLKSLKDSRFPFSIMWIEELAEFKTEDEVTTITNSMLR 155 >gi|18891|lcl|protein:vir:8847 Length: 482 # NCBI annotation: terminase # Family: family:all:1730 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775255;genbank:gi:27476053;genbank:GeneID :2700601 Length = 482 Score = 33.1 bits (74), Expect = 0.006, Method: Compositional matrix adjust. Identities = 64/276 (23%), Positives = 107/276 (38%), Gaps = 50/276 (18%) Query: 86 IRHKKTGAEFLFYGIARNLNEIKSTEGVDILWLEEAQYLTEEQWNVINPTIRRE---GSQ 142 +RH G L + + +D++WL+E E ++ + R G Sbjct: 142 VRHVSGGLSSLIFKSYEMSQDKFMGTAIDVIWLDE-----ECPKDIYTQCVTRTATTGGI 196 Query: 143 IWLIWNPDQYTDFIYQNFVVNPPADCLSKQINWTENPFLSDTMLKVIYDEYQRDPKLAEH 202 ++L + P+ I ++F+ + +W + P LS + + + Y P Sbjct: 197 VYLTFTPEHGLTEIVKDFLQDLKPGQFLIHASWEDAPHLSPEVKEQLLSVYS--PAERRM 254 Query: 203 VYGGAPKMGGDKA--IIQLQYVLAAIDA----HKKLGWKIEGSKRTGFDIADDGDDANAI 256 G P +G I++ ++V D H+ +G + GFD NAI Sbjct: 255 RAEGIPMLGSGVVFPILEEKFVCEPFDIPDHFHRIIGIDL------GFD------HPNAI 302 Query: 257 VDAIGNVVVWAEEWDG--LEDELLKSSTKVFNHA----LEKGSSIIF----DSIGV-GAH 305 V W E D L DE +S + HA L+ G I D+ GA Sbjct: 303 A-----CVAWDAEKDKYYLYDERSESGETLGMHADAIYLKGGHQIPVVVPHDAFKHDGAT 357 Query: 306 AGSKFSEL-NEARSLEIIYEPF-NAGGAVYDPDGTY 339 +G +F +L + +L ++YEPF N G PDG + Sbjct: 358 SGRRFVDLLKDDHNLNVVYEPFSNPPG----PDGKH 389 >gi|3478|lcl|protein:vir:101370 Length: 494 # NCBI annotation: PacB # Family: family:all:144 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006576;genbank:gi:46401730;genbank:GeneID :2777432 Length = 494 Score = 31.6 bits (70), Expect = 0.022, Method: Compositional matrix adjust. Identities = 13/35 (37%), Positives = 24/35 (68%) Query: 413 VDGMGKFKVESKKDMREKRGIKSPNIADAFIMAMI 447 ++ G++KV SK+DM++K + SP+ D + AM+ Sbjct: 433 INSAGQWKVMSKEDMKKKLNLHSPDHWDTYCFAML 467 >gi|14676|lcl|protein:vir:2499 Length: 474 # NCBI annotation: putative terminase gp4 # Family: family:all:523 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569740;genbank:gi:18496890;genbank:GeneID :932339 Length = 474 Score = 29.3 bits (64), Expect = 0.11, Method: Compositional matrix adjust. Identities = 22/66 (33%), Positives = 29/66 (43%), Gaps = 3/66 (4%) Query: 90 KTGAEFLFYGIARNLNEIKSTEGVDILWLEEAQYLTEEQWNVINP-TIRREGSQIWLIWN 148 K G+ LF R + GVD+L +EAQ LTE + + P T I L Sbjct: 140 KNGSRILFGARERGFG--RGFAGVDVLIFDEAQILTENAMDDMVPATNAAPNPLILLAGT 197 Query: 149 PDQYTD 154 P + TD Sbjct: 198 PPKPTD 203 >gi|2931|lcl|protein:vir:105460 Length: 409 # NCBI annotation: putative phage terminase large subunit B # Family: family:all:54 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529870;genbank:gi:90592610;genbank:GeneID :3974524 Length = 409 Score = 28.9 bits (63), Expect = 0.13, Method: Compositional matrix adjust. Identities = 34/141 (24%), Positives = 59/141 (41%), Gaps = 19/141 (13%) Query: 59 SESVYTLIKGKIDAAGWTKEFDVTISSIRHKKTGAEFLFYGIARNLNEIKSTEGVDIL-- 116 S S+YT + I++ F +T+ + RH + +GI + S GV + Sbjct: 72 SNSIYTNVISAIESY-----FGITMKTDRH----GHYHLFGIDIVPSYTGSIRGVGFIRG 122 Query: 117 ------WLEEAQYLTEEQWNVINPTIRREGSQIWLIWNPDQYTDFIYQNFVVN--PPADC 168 ++ EA T + + I EG++I NPD T ++ +++ N P A Sbjct: 123 MTSYGAYVNEASLATHDVFQEILQRCSIEGARIICDTNPDIPTHWLKTDYIDNHDPKARI 182 Query: 169 LSKQINWTENPFLSDTMLKVI 189 S +N FLS ++ I Sbjct: 183 KSFTFTIDDNTFLSKDYVESI 203 >gi|7504|lcl|protein:vir:99915 Length: 478 # NCBI annotation: gp2 # Family: family:all:523 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655519;genbank:gi:109392289;genbank:GeneI D:4157084 Length = 478 Score = 28.1 bits (61), Expect = 0.25, Method: Compositional matrix adjust. Identities = 18/44 (40%), Positives = 25/44 (56%), Gaps = 1/44 (2%) Query: 112 GVDILWLEEAQYLTEEQWNVINPTIRR-EGSQIWLIWNPDQYTD 154 GV IL L+EAQ LT++ + + PT+ E I L P + TD Sbjct: 158 GVGILVLDEAQRLTDKAMDDLIPTMNTVENPLILLTGTPPRPTD 201 >gi|5525|lcl|protein:vir:95311 Length: 491 # NCBI annotation: putative terminase large subunit # Family: family:all:144 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512256;genbank:gi:89152423;genbank:GeneID :3952979 Length = 491 Score = 27.3 bits (59), Expect = 0.43, Method: Compositional matrix adjust. Identities = 20/90 (22%), Positives = 43/90 (47%), Gaps = 11/90 (12%) Query: 370 KTYEVVTYGANHPHDELISISSEHVPA--------KILDKLKI--ELASPHKDVDGMGKF 419 +T+++V +G ++++ E + +LD + +L++ V GK Sbjct: 378 RTWQLVPFGGASTDPQMLNKRGEMFNSCKTWLRLGGMLDDQETADDLSTAEYKVRVDGKI 437 Query: 420 KVESKKDMREKRGIKSPNIADAFIMAMIQP 449 +E K+D++E+ G +SP DA ++ P Sbjct: 438 VIEPKEDIKERLG-RSPGKGDALLLTFAFP 466 >gi|16210|lcl|protein:vir:7319 Length: 491 # NCBI annotation: terminase large subunit # Family: family:all:144 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848210;genbank:gi:30387381;genbank:GeneID :2641874 Length = 491 Score = 26.9 bits (58), Expect = 0.48, Method: Compositional matrix adjust. Identities = 12/33 (36%), Positives = 20/33 (60%), Gaps = 1/33 (3%) Query: 417 GKFKVESKKDMREKRGIKSPNIADAFIMAMIQP 449 GK +E K+D++E+ G +SP DA ++ P Sbjct: 435 GKIVIEPKEDIKERLG-RSPGKGDALLLTFAFP 466 >gi|12972|lcl|protein:vir:80678 Length: 503 # NCBI annotation: gp2 # Family: family:all:523 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285578;genbank:gi:148727084;genbank:Ge neID:5247049 Length = 503 Score = 26.6 bits (57), Expect = 0.72, Method: Compositional matrix adjust. Identities = 23/71 (32%), Positives = 32/71 (45%), Gaps = 9/71 (12%) Query: 113 VDILWLEEAQYLTEEQWNVINPTIRREGS----QIWLIWNPDQYTDFIYQNFVVNPPADC 168 VD L +EAQ L++EQ + PT+ S QI+L P D + V+ Sbjct: 174 VDDLVCDEAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLAD---GSVVLRLRGQA 230 Query: 169 L--SKQINWTE 177 L K+ WTE Sbjct: 231 LGGGKRFAWTE 241 >gi|4240|lcl|protein:vir:94734 Length: 469 # NCBI annotation: large subunit terminase # Family: family:all:523 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996704;genbank:gi:45597419;genbank:GeneID :2767958 Length = 469 Score = 26.2 bits (56), Expect = 0.92, Method: Compositional matrix adjust. Identities = 12/26 (46%), Positives = 18/26 (69%) Query: 111 EGVDILWLEEAQYLTEEQWNVINPTI 136 EG DIL+++EAQ T EQ + + T+ Sbjct: 157 EGFDILFIDEAQEYTTEQESALKYTV 182 >gi|14207|lcl|protein:vir:8315 Length: 503 # NCBI annotation: gp32 # Family: family:all:523 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817883;genbank:gi:29566316;genbank:GeneID :1259511 Length = 503 Score = 25.0 bits (53), Expect = 1.7, Method: Compositional matrix adjust. Identities = 20/65 (30%), Positives = 30/65 (46%), Gaps = 5/65 (7%) Query: 92 GAEFLFYGIARNLNEIKSTEGVDILWLEEAQYLTEEQWNVINPTIR--REGSQIWLIWNP 149 G+ LF R + GVD+L +EAQ LT+ + T+ R G I+ + P Sbjct: 177 GSRILFGARERGFG--RGIPGVDVLMSDEAQILTQRAMQDMLATLNTSRLGLHIY-VGTP 233 Query: 150 DQYTD 154 + TD Sbjct: 234 PKPTD 238 >gi|18494|lcl|protein:vir:1636 Length: 469 # NCBI annotation: putative terminase # Family: family:all:523 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695057;genbank:gi:23455748;genbank:GeneID :955481 Length = 469 Score = 24.6 bits (52), Expect = 2.6, Method: Compositional matrix adjust. Identities = 12/26 (46%), Positives = 17/26 (65%) Query: 111 EGVDILWLEEAQYLTEEQWNVINPTI 136 EG DIL ++EAQ T EQ + + T+ Sbjct: 157 EGFDILVIDEAQEYTTEQESALKYTV 182 >gi|12378|lcl|protein:vir:79648 Length: 523 # NCBI annotation: TerL # Family: family:all:144 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285519;genbank:gi:148734502;genbank:Ge neID:5220006 Length = 523 Score = 24.3 bits (51), Expect = 3.1, Method: Compositional matrix adjust. Identities = 14/50 (28%), Positives = 23/50 (46%), Gaps = 8/50 (16%) Query: 376 TYGANHPHDELISISSEHVPAKILDKLKIELASPHKDVDGMGKFKVESKK 425 TY +HPHD+++ + D + IEL VD M + +K+ Sbjct: 482 TYDDSHPHDDIMD--------NLFDAVNIELNLADNAVDRMKRLAGLAKR 523 >gi|13482|lcl|protein:vir:9757 Length: 471 # NCBI annotation: putative terminase # Family: family:all:523 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795519;genbank:gi:28876285;genbank:GeneID :1257826 Length = 471 Score = 24.3 bits (51), Expect = 3.7, Method: Compositional matrix adjust. Identities = 11/26 (42%), Positives = 17/26 (65%) Query: 111 EGVDILWLEEAQYLTEEQWNVINPTI 136 EG D+L ++EAQ T EQ + + T+ Sbjct: 159 EGFDLLIIDEAQEYTSEQESALKYTV 184 >gi|19181|lcl|protein:vir:9572 Length: 459 # NCBI annotation: gp38 # Family: family:all:523 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862877;genbank:gi:32469469;genbank:GeneID :1461314 Length = 459 Score = 23.9 bits (50), Expect = 4.8, Method: Compositional matrix adjust. Identities = 11/26 (42%), Positives = 17/26 (65%) Query: 111 EGVDILWLEEAQYLTEEQWNVINPTI 136 EG D+L ++EAQ T EQ + + T+ Sbjct: 158 EGFDMLIIDEAQEYTTEQESALKYTV 183 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.318 0.135 0.410 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 226,221 Number of Sequences: 514 Number of extensions: 11000 Number of successful extensions: 90 Number of sequences better than 100.0: 50 Number of HSP's better than 100.0 without gapping: 36 Number of HSP's successfully gapped in prelim test: 14 Number of HSP's that attempted gapping in prelim test: 30 Number of HSP's gapped (non-prelim): 52 length of query: 459 length of database: 206,069 effective HSP length: 75 effective length of query: 384 effective length of database: 167,519 effective search space: 64327296 effective search space used: 64327296 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 39 (19.6 bits)