BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_020481.1_cdsid_YP_007517700.1 [gene=I900_gp02] [protein=phage terminase large subunit] [protein_id=YP_007517700.1] [location=942..2213] (423 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|8126|lcl|protein:vir:102661 Length: 568 # NCBI annotation: te... 159 7e-41 gi|20113|lcl|protein:vir:108299 Length: 556 # NCBI annotation: h... 107 3e-25 gi|4674|lcl|protein:vir:105593 Length: 543 # NCBI annotation: te... 89 8e-20 gi|360|lcl|protein:vir:344 Length: 482 # NCBI annotation: termin... 88 2e-19 gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: t... 81 3e-17 gi|1431|lcl|protein:vir:93626 Length: 530 # NCBI annotation: Bce... 76 8e-16 gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: g... 62 9e-12 gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: t... 59 8e-11 gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: g... 58 2e-10 gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T... 57 5e-10 gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: ph... 51 2e-08 gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: te... 51 2e-08 gi|9070|lcl|protein:vir:102238 Length: 588 # NCBI annotation: gp... 49 1e-07 gi|8220|lcl|protein:vir:101493 Length: 588 # NCBI annotation: gp... 49 2e-07 gi|18781|lcl|protein:vir:7429 Length: 581 # NCBI annotation: gp6... 46 1e-06 gi|5321|lcl|protein:vir:99534 Length: 422 # NCBI annotation: put... 42 2e-05 gi|20547|lcl|protein:vir:105155 Length: 580 # NCBI annotation: c... 41 3e-05 gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: te... 41 3e-05 gi|15353|lcl|protein:vir:9921 Length: 425 # NCBI annotation: put... 38 2e-04 gi|27407|lcl|protein:vir:7202 Length: 610 # NCBI annotation: gp1... 36 7e-04 gi|27701|lcl|protein:vir:6893 Length: 611 # NCBI annotation: gp1... 36 7e-04 gi|22056|lcl|protein:vir:103455 Length: 610 # NCBI annotation: t... 36 7e-04 gi|15101|lcl|protein:vir:3867 Length: 563 # NCBI annotation: put... 35 0.001 gi|5208|lcl|protein:vir:106641 Length: 446 # NCBI annotation: OR... 35 0.002 gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: ter... 33 0.005 gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp... 33 0.006 gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: ter... 33 0.006 gi|10609|lcl|protein:vir:106508 Length: 492 # NCBI annotation: P... 30 0.038 gi|27798|lcl|protein:vir:8408 Length: 214 # NCBI annotation: gp3... 30 0.039 gi|8690|lcl|protein:vir:102145 Length: 581 # NCBI annotation: ph... 30 0.067 gi|14207|lcl|protein:vir:8315 Length: 503 # NCBI annotation: gp3... 29 0.12 gi|24596|lcl|protein:vir:98262 Length: 609 # NCBI annotation: gp... 28 0.16 gi|26461|lcl|protein:vir:573 Length: 589 # NCBI annotation: unkn... 28 0.17 gi|9829|lcl|protein:vir:103950 Length: 425 # NCBI annotation: ph... 28 0.23 gi|17691|lcl|protein:vir:9305 Length: 447 # NCBI annotation: lar... 28 0.25 gi|2779|lcl|protein:vir:99776 Length: 425 # NCBI annotation: lar... 28 0.26 gi|7301|lcl|protein:vir:96238 Length: 447 # NCBI annotation: ORF... 28 0.29 gi|14004|lcl|protein:vir:8183 Length: 506 # NCBI annotation: gp3... 28 0.29 gi|9590|lcl|protein:vir:97250 Length: 515 # NCBI annotation: hyp... 27 0.32 gi|7504|lcl|protein:vir:99915 Length: 478 # NCBI annotation: gp2... 27 0.56 gi|7643|lcl|protein:vir:96370 Length: 447 # NCBI annotation: ORF... 27 0.59 gi|11588|lcl|protein:vir:78831 Length: 447 # NCBI annotation: te... 27 0.59 gi|19608|lcl|protein:vir:4081 Length: 518 # NCBI annotation: ter... 26 0.72 gi|24325|lcl|protein:vir:100809 Length: 488 # NCBI annotation: h... 26 0.74 gi|8562|lcl|protein:vir:100097 Length: 571 # NCBI annotation: gp... 26 1.1 gi|13932|lcl|protein:vir:1429 Length: 570 # NCBI annotation: put... 25 1.2 gi|6886|lcl|protein:vir:106551 Length: 424 # NCBI annotation: pu... 25 1.6 gi|5794|lcl|protein:vir:98922 Length: 453 # NCBI annotation: ter... 25 2.3 gi|3859|lcl|protein:vir:107748 Length: 532 # NCBI annotation: gp... 23 8.0 gi|13199|lcl|protein:vir:81178 Length: 567 # NCBI annotation: pu... 23 9.0 >gi|8126|lcl|protein:vir:102661 Length: 568 # NCBI annotation: terminase # Family: family:all:147 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024418;genbank:gi:48696639;genbank:GeneID :2948128 Length = 568 Score = 159 bits (402), Expect = 7e-41, Method: Compositional matrix adjust. Identities = 129/458 (28%), Positives = 213/458 (46%), Gaps = 94/458 (20%) Query: 27 VLVCHRRFGKTVCMINHLIRSALLSKNKNPRYAYIAPTFKQAKSIAWDYMKQFTAKIPYT 86 + V HRR GK + + HLI A+ S+ + Y +I P QA+ WD + + Sbjct: 87 IQVLHRRAGKDIGAL-HLI--AIASQLRVGNYKHILPYKTQARDAIWDGIDALGNRFIRN 143 Query: 87 KF--------NETELRVDLPNGSRITLLGSENSDGLRGIYLDGCVIDEYA--NVNSKLFP 136 F NE+ + V NGS L G + SD L G G V E A + N + F Sbjct: 144 AFPDEIVESINESRMLVRFTNGSTYQLQGGD-SDKLVGAGPVGIVYSESALMSPNVRTF- 201 Query: 137 EIIRPALSDRKGYCVFIGTPAGMNNNFYELYQHAQGADDWF------------NYKAKAS 184 +RP L + G+ + I TP G N FY+L HA+ +++W+ Y ++A Sbjct: 202 --LRPMLDETGGWELHITTPRG-KNWFYKLAMHAEKSEEWYYKYLTINDTWRWAYSSEAL 258 Query: 185 ET------------------------------KIVDEEELVK--------AKEVMGDKKY 206 +T + D E+ ++ A +M ++ Sbjct: 259 DTDTLQQAGTATLNDGHVIPVYESIPTELKYRNVADAEKAIERGVVKGMYAVRIMTERMV 318 Query: 207 Q-------------QEFECDWIANIEGAVYSDVLGKMEDQKQLTRVPYDPSLPVSTAWDL 253 Q QE+ CDW ++G+ Y D++ M + ++ + P++P+ PV D+ Sbjct: 319 QSLIDEGQDPFIVRQEYYCDWDVALQGSYYGDLMITMYNTGRIGKFPHNPNRPVYVHMDI 378 Query: 254 GVSDHSAIIFYQQ--LGRSVNIIDYHEERGQGLPYYVQVIKD----KDYVYKDHFAPHDI 307 G +D ++I F Q+ +G+ V IID+ + L +V I++ +DY PHD Sbjct: 379 GFNDSTSITFTQEGPMGQGV-IIDHLWGSNKSLLQWVYDIEEHANKRDYALGLIILPHDA 437 Query: 308 EVTDFGNGKTRREVAYQ------LGIRFKVVPKIPLEDGIHATTMTLPRCWIDTDHCKKL 361 + T+ +GKTR+E + G++F V+ + + GI AT L C ID +C+ L Sbjct: 438 DNTEVSHGKTRKEWVLEEGRFEERGMQFDVLERADKQAGIDATRGFLATCCIDETNCEYL 497 Query: 362 IDALRHYHRKYIDKNRMFRSKPVHDWSSHACDAMRYLS 399 I+AL+ + R+Y +KN+ +R+ VHDW+SH D +RYL+ Sbjct: 498 IEALKSFRREYDEKNQTYRNNAVHDWASHPADNVRYLA 535 >gi|20113|lcl|protein:vir:108299 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:144 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552286;genbank:gi:160700611;genbank:Ge neID:5758815 Length = 556 Score = 107 bits (267), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 58/204 (28%), Positives = 105/204 (51%), Gaps = 18/204 (8%) Query: 219 EGAVYSDVLGKMEDQKQLTRVPYDPSLPVSTAWDLGVSDHSAIIFYQQLGRSVNIIDYHE 278 EG VY + ++ ++ + T +P +P+LPV T WDLG +D + Q G+ + +I + Sbjct: 332 EGVVYKKEIERLLEEGRFTHIPVEPALPVYTYWDLGRNDDMVLWLMQPHGKELRLIACYS 391 Query: 279 ERGQGLPYYVQVIKDKDYVYK----DHFAPHDIEVTDFGNGKTRREVAYQLGIRFKVVPK 334 R +G+ +Y+ +KD Y +H APHDI V D ++R +VA ++GI+FK++ + Sbjct: 392 NRDEGMEHYINWLKDFQAKYNIRFGEHLAPHDIAVHDLMTNESRIDVAKKMGIKFKLIER 451 Query: 335 IPLE-DGIHATTMTLPRCWIDTDHCKKLI-------------DALRHYHRKYIDKNRMFR 380 + + I+A PR WID C I L+ R++ N +F+ Sbjct: 452 CKSKRESINALKKLFPRIWIDKVRCDTDIAGNTGDLARKTGWKGLKALRREWDHNNEVFK 511 Query: 381 SKPVHDWSSHACDAMRYLSVGLQE 404 + W+++ CDA++ + + +E Sbjct: 512 DETGPKWATNFCDALQQMGLHYKE 535 >gi|4674|lcl|protein:vir:105593 Length: 543 # NCBI annotation: terminase large subunit # Family: family:all:147 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164303;genbank:gi:56692921;genbank:GeneID :3197204 Length = 543 Score = 89.4 bits (220), Expect = 8e-20, Method: Compositional matrix adjust. Identities = 55/185 (29%), Positives = 91/185 (49%), Gaps = 6/185 (3%) Query: 217 NIEGAVYSDVLGKMEDQKQLTRVPYDPSLPVSTAWDLGVSDHSAIIFYQQLGRSVNIIDY 276 + EG Y+ + + + +T+V +P++T WD+G SD AI F+Q+L IDY Sbjct: 330 STEGNWYAKDMATLRKRNGITKVLI-LDMPINTFWDIGRSDGCAIWFHQELHGEDRFIDY 388 Query: 277 HEERGQGLPYYVQVIKDKDYVYKDHFAPHDIE---VTDFGNGKTRREVAYQLGIRFKVVP 333 +E + L +YV+ ++D+ Y++ HF PHD E ++DF G +F +VP Sbjct: 389 YEAHNEDLRHYVKEMRDRGYLFGTHFLPHDAEHKRLSDFNRSTLEMLQDLMPGEQFAIVP 448 Query: 334 KIP-LEDGIHATTMTLPRCWIDTDHCKKLIDALRHYHRKYIDKNRMFRSKP-VHDWSSHA 391 +I L G+ T + ++D C K I L Y +K+ F ++P + S Sbjct: 449 RITELVTGVQQTRKHMKTAYLDETRCAKGIQRLEGYRKKFNRAENRFTNEPDKSNGCSEG 508 Query: 392 CDAMR 396 DA R Sbjct: 509 ADAFR 513 >gi|360|lcl|protein:vir:344 Length: 482 # NCBI annotation: terminase # Family: family:all:147 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203458;genbank:gi:15320614;genbank:GeneID :921717 Length = 482 Score = 87.8 bits (216), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 64/225 (28%), Positives = 110/225 (48%), Gaps = 23/225 (10%) Query: 198 KEVMG-DKKYQQEF-ECDWIANIEGAVYSDVLGKMEDQKQLTRVPYDPSLPVSTAWDLGV 255 ++V G D ++ E+ + + ++ +G++Y D+L +E + + ++ + + T WDLG Sbjct: 221 RQVAGKDAEWVAEYIDSKYPSHDKGSIYGDLLAALEARGGICTFEHE-TGSIFTIWDLGR 279 Query: 256 SDHSAIIFYQQLGRSVNIIDYHEERGQGLPYYVQVI----KDKDYVYKDHFAPHDIEVTD 311 +D ++I F + V+I+D++ G+ L +Y ++ K Y Y H PHD Sbjct: 280 ADSTSIWFMRLRTGGVDIVDHYRNNGEPLSHYFGLLDGWASAKGYRYLKHVLPHDARAKT 339 Query: 312 FGNGKTRREVAYQLGIRFK-----VVPKIPLEDGIHATTMTLPR-----CWIDTDHCKKL 361 TR V Q ++ V P++ LEDGI A L R D L Sbjct: 340 L---VTRSSVLEQFLAKYGPAAVVVGPQLSLEDGIAAARALLERDIRFHARCDVPQVAGL 396 Query: 362 ---IDALRHYHRKYIDKNRMFRSKPVHDWSSHACDAMRYLSVGLQ 403 ++ALR Y +Y +K + + +PVHDW+SH DA RY++ Q Sbjct: 397 ESGLEALRSYRYQYNEKLQTYSREPVHDWASHDADAFRYVATFAQ 441 >gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: terminase, large subunit # Family: family:all:147 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006983;genbank:gi:46401884;genbank:GeneID :2777679 Length = 438 Score = 80.9 bits (198), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 95/391 (24%), Positives = 163/391 (41%), Gaps = 31/391 (7%) Query: 23 YRWNVLVCHRRFGKTVCMINHLIRSALLSKNKNPRYAYIAPTFKQAKSIAWDYMKQFTAK 82 +R+ RR GK+ L N + +AP + A +I W ++ K Sbjct: 54 HRFVTACVSRRVGKSFIAYTLGFLKLL---EPNVKVLVVAPNYSLA-NIGWSQIRGLIKK 109 Query: 83 --IPYTKFNETELRVDLPNGSRITLLGSENSDGLRGIYLDGCVIDEYA--NVNSKLFPEI 138 + + N + ++L NGS L + +D G D + DE A +V F Sbjct: 110 YGLQTERENAKDKEIELANGSLFKLASAAQADSAVGRSYDFIIFDEAAISDVGGDAFRVQ 169 Query: 139 IRPALSDRKGYCVFIGTPAGMNNNFYELYQHAQGADD----WFNYKAKASETKIVDEEEL 194 +RP L +FI TP G N F E Y A G DD W + + D ++ Sbjct: 170 LRPTLDKPNSKALFISTPRG-GNWFKEFY--AYGFDDTLPNWVSIHGTYRDNPRADLNDI 226 Query: 195 VKAKEVMGDKKYQQEFECDWIANIEGAVYSDVLGKMEDQKQLTRVPY----DPSLPVSTA 250 +A+ + ++QE+E D+ + EG ++ D ++ K L + + D + Sbjct: 227 EEARRTVSKNYFRQEYEADF-SVFEGQIF-DTFNAIDHVKDLKGMRHFFKDDEAFETLLG 284 Query: 251 WDLGVSDHSAI--IFYQQLGRSVNIIDYHEERGQGLPYYVQVIKDKDYVYKDHFAPHDIE 308 D+G D +A+ I Y + +++ +++ + + I+ YK D Sbjct: 285 IDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQHCIDRYK-----VDRI 339 Query: 309 VTDFGNGKTRREVAYQLGIRFKVVPKIPLEDGIHATTMTLPRCWIDTD-HCKKLIDALRH 367 D + R+++AY+ I K L DG+ + I D C LI AL++ Sbjct: 340 FVDSAAAQFRQDLAYEHEIASAPAKKSVL-DGLACLQALFQQGKIIVDASCSSLIHALQN 398 Query: 368 YHRKYID-KNRMFRSKPVHDWSSHACDAMRY 397 Y + + + ++ R KP HD +SH CDA+RY Sbjct: 399 YKWDFQEGEEKLSREKPRHDANSHLCDALRY 429 >gi|1431|lcl|protein:vir:93626 Length: 530 # NCBI annotation: Bcep22gp49 # Family: family:all:147 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944278;genbank:gi:38640355;genbank:GeneID :2658275 Length = 530 Score = 75.9 bits (185), Expect = 8e-16, Method: Compositional matrix adjust. Identities = 58/214 (27%), Positives = 101/214 (47%), Gaps = 9/214 (4%) Query: 217 NIEGAVYSDVLGKMEDQKQLT-RVPYDPSLPVSTAWDLGVSDHSAIIFYQQLGRSVNIID 275 + EG Y+ L Q ++ +P ++P T WD+G SD +AI Q++ I Sbjct: 315 STEGTYYAQQLAAARKQGRIKPSLPVLFNVPCFTFWDIGNSDGTAIWVLQRVEHEWRAIR 374 Query: 276 YHEERGQGLPYYVQVIKDKDYVYKDHFAPHDIEVTDFG--NGKTRREVAYQL--GIRFKV 331 + E G+ Y+V+ ++ V+ F PHD + G K+ +++ +L G+RF++ Sbjct: 375 FKEGWGEPYSYFVKWLQGLGLVWDTMFLPHDADHVRQGQTTNKSPKQMLEELMPGVRFEI 434 Query: 332 VPKI-PLEDGIHATTMTLPRCWIDTDHCKKLIDALRHYHRKYIDKNRMFRSKPVHDWS-S 389 VP+I + GI T P W D CK I + +Y +K+ + + + ++P S Sbjct: 435 VPRIDDVNWGIQQTRDAFPLLWFDETECKDGIIHIENYRKKWSVQQQRWMTEPDKTGGHS 494 Query: 390 HACDAMRYLSVGLQE--INDRQTAPQSVADNEYR 421 A DA+R + IN R+ A +S +R Sbjct: 495 EAADALRQFAQAYAGGLINVRKPAGKSKRPRSWR 528 >gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214662;genbank:gi:61806303;genbank:GeneID :3294591 Length = 550 Score = 62.4 bits (150), Expect = 9e-12, Method: Compositional matrix adjust. Identities = 52/227 (22%), Positives = 113/227 (49%), Gaps = 14/227 (6%) Query: 5 IPYTPRKHQALLHKKISEYRWNVLVCHRRFGKTVCMINHLIRSALLSKNKNPR-YAYIAP 63 IP+ Q + K ++R+N+ R+ GK+ + +L+ L + N N A AP Sbjct: 56 IPFDMYNFQEDMVTKFHQHRFNIAKLPRQSGKSTIVTAYLLWYVLFNANVNVAILANKAP 115 Query: 64 TFKQAKS---IAWDYMKQFTAKIPYTKFNETELRVDLPNGSRITLLGSENSDGLRGIYLD 120 T ++ ++++ + ++ + +N+ L +L NGS+I L S ++ +RG+ + Sbjct: 116 TAREMLGRLQLSYENLPKWMQQ-GILGWNKGSL--ELENGSKI-LASSTSASAVRGMSFN 171 Query: 121 GCVIDEYA----NVNSKLFPEIIRPALSDRKGYCVFIGTPAGMNNNFYELYQHAQ-GADD 175 +DE+A ++ + F + S + + I TP GMN FY+L+ A+ GA++ Sbjct: 172 IIFLDEFAFVPNHIAEQFFASVYPTISSGKSTKVIIISTPHGMNQ-FYKLWHDAERGANN 230 Query: 176 WFNYKAKASETKIVDEEELVKAKEVMGDKKYQQEFECDWIANIEGAV 222 + + S+ D++ + E + +++ EFEC+++ +++ + Sbjct: 231 YVATEVHWSQVPGRDDKWKQQTIENTSEAQFRVEFECEFLGSVDTLI 277 >gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: terminase large subunit gp17 # Family: family:all:147 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195134;genbank:gi:58532911;uniprot:Q5GQN8 ;genbank:GeneID:3260486 Length = 548 Score = 59.3 bits (142), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 50/234 (21%), Positives = 112/234 (47%), Gaps = 18/234 (7%) Query: 5 IPYTPRKHQALLHKKISEYRWNVLVCHRRFGKTVCMINHLIRSALLSKNKN----PRYAY 60 +P+ Q L K + R+N+ R+ GK+ ++++L+ + + N N A Sbjct: 55 VPFKMWDFQEELIMKFHKNRFNIAKLPRQTGKSTTVVSYLLHYLIFNDNVNIGILANKAS 114 Query: 61 IAPTFKQAKSIAWDYMKQFTAK--IPYTKFNETELRVDLPNGSRITLLGSENSDGLRGIY 118 A + A++ + ++ + + + K N ++L NGS+I L S ++ +RG+ Sbjct: 115 TARDLLARLATAYENLPKWIQQGVVVWNKGN-----IELENGSKI-LAASTSASAVRGMS 168 Query: 119 LDGCVIDEYA----NVNSKLFPEIIRPALSDRKGYCVFIGTPAGMNNNFYELYQHA-QGA 173 + +DE+A ++ F + S + + I TP GMN+ FY+++ A G Sbjct: 169 FNIIFLDEFAFVPNHIADSFFASVYPTITSGKSTKVIIISTPQGMNH-FYKMWVDATNGR 227 Query: 174 DDWFNYKAKASETKIVDEEELVKAKEVMGDKKYQQEFECDWIANIEGAVYSDVL 227 + + ++ S+ DE+ + + ++++ QEFEC+++ +++ + + L Sbjct: 228 NGYTFHEVHWSQVPGRDEKWKEETIKNTSERQFTQEFECEFLGSVDTLIAASKL 281 >gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: gp123 # Family: family:all:147 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717790;genbank:gi:113200627;genbank:GeneI D:4239178 Length = 549 Score = 57.8 bits (138), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 53/235 (22%), Positives = 111/235 (47%), Gaps = 14/235 (5%) Query: 5 IPYTPRKHQALLHKKISEYRWNVLVCHRRFGKTVCMINHLIRSALLSKNKN----PRYAY 60 IP+ Q + +K + R+N+ R+ GK+ + ++L+ L + N N A Sbjct: 55 IPFDMYYFQEEMVQKFHDNRFNIAKLPRQSGKSTIVTSYLLWYVLFNANVNVAILANKAA 114 Query: 61 IAPTFKQAKSIAWDYMKQFTAKIPYTKFNETELRVDLPNGSRITLLGSENSDGLRGIYLD 120 A Q ++++ + ++ + ++N L +L NGS+I L S ++ +RG+ + Sbjct: 115 TAREMLQRLQLSYENLPKWLQQ-GILQWNRGSL--ELENGSKI-LAASTSASAVRGMSFN 170 Query: 121 GCVIDEYA----NVNSKLFPEIIRPALSDRKGYCVFIGTPAGMNNNFYELYQHAQ-GADD 175 +DE+A +V + F + S + + I TP GM N FY+L+ A+ A++ Sbjct: 171 VIFLDEFAFVPNHVADQFFSSVYPTISSGKSTKVIIISTPHGM-NMFYKLWHDAERKANE 229 Query: 176 WFNYKAKASETKIVDEEELVKAKEVMGDKKYQQEFECDWIANIEGAVYSDVLGKM 230 + + SE D + + +++++ EFEC+++ +++ + L M Sbjct: 230 YIPTEVHWSEVPGRDAAWKEQTIKNTSEQQFRVEFECEFLGSVDTLISPSKLRTM 284 >gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T4-like DNA packaging large subunit terminase # Family: family:all:147 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214360;genbank:gi:61806000;genbank:GeneID :3294466 Length = 547 Score = 56.6 bits (135), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 51/229 (22%), Positives = 107/229 (46%), Gaps = 18/229 (7%) Query: 5 IPYTPRKHQALLHKKISEYRWNVLVCHRRFGKTVCMINHLIRSALLSKNKNPRYAYIAPT 64 +P+ Q L + E R+N+ R+ GK+ I++L+ A+ + N N A +A Sbjct: 53 VPFNMYDFQEKLITRFHENRFNICKMPRQTGKSTTCISYLLHYAVFNDNVN--VAVLANK 110 Query: 65 FKQAK------SIAWDYMKQFTAKIPYTKFNETELRVDLPNGSRITLLGSENSDGLRGIY 118 A+ +A++ + ++ + +N+ L +L NGS+I+ S +S +RG Sbjct: 111 ASTARDLLGRLQLAYENLPRWMQQ-GIISWNKGSL--ELENGSKISA-NSTSSSAVRGGS 166 Query: 119 LDGCVIDEYA----NVNSKLFPEIIRPALSDRKGYCVFIGTPAGMNNNFYELYQHAQ-GA 173 + +DE+A ++ F + S + + + TP GM N+FY ++ ++ G Sbjct: 167 YNVIFLDEFAFIPNHIADDFFASVYPTITSGQSTKVIIVSTPRGM-NHFYRMWHDSEKGK 225 Query: 174 DDWFNYKAKASETKIVDEEELVKAKEVMGDKKYQQEFECDWIANIEGAV 222 ++ SE DEE + +++++ EFEC+++ ++ + Sbjct: 226 SEYVATDVHWSEVPGRDEEWKEQTIANTSEQQFKIEFECEFLGSVNTLI 274 >gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: phage terminase large subunit # Family: family:all:147 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468054;genbank:gi:157265496;genbank:Ge neID:5600564 Length = 485 Score = 51.2 bits (121), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 47/182 (25%), Positives = 76/182 (41%), Gaps = 24/182 (13%) Query: 5 IPYTPRKHQALLHKKISEYRWNVLVCHRRFGKTVCMINHLIRSALLSKNKNPRYAYIAPT 64 + Y P Q +H+ ++ R V R+ GK+ + L + IAPT Sbjct: 14 LGYKPHHVQLAIHRSTAKRR--VACLGRQSGKSEAASVEAVFE--LFARPGSQGWIIAPT 69 Query: 65 FKQAKSI---AWDYMKQFTAKIPYTKFNETELR-----------VDLPNGSRITLL---- 106 + QA+ I + +++ P T+ R V+ P R+ Sbjct: 70 YDQAEIIFGRVVEKVERLAEVFPATEVQLQRRRLRLLVHHYDRPVNAPGAKRVATSEFRG 129 Query: 107 -GSENSDGLRGIYLDGCVIDEYANVNSKLFPEIIRPALSDRKGYCVFIGTPAGMNNNFYE 165 ++ D LRG LD ++DE A + ++ E I P LS R G+ + I TP G+ N FYE Sbjct: 130 KSADRPDNLRGATLDFVILDEAAMIPFSVWSEAIEPTLSVRDGWALIISTPKGL-NWFYE 188 Query: 166 LY 167 + Sbjct: 189 FF 190 >gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: terminase large subunit # Family: family:all:147 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467938;genbank:gi:157265379;genbank:Ge neID:5600506 Length = 485 Score = 51.2 bits (121), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 47/182 (25%), Positives = 76/182 (41%), Gaps = 24/182 (13%) Query: 5 IPYTPRKHQALLHKKISEYRWNVLVCHRRFGKTVCMINHLIRSALLSKNKNPRYAYIAPT 64 + Y P Q +H+ ++ R V R+ GK+ + L + IAPT Sbjct: 14 LGYKPHHVQLAIHRSTAKRR--VACLGRQSGKSEAASVEAVFE--LFARPGSQGWIIAPT 69 Query: 65 FKQAKSI---AWDYMKQFTAKIPYTKFNETELR-----------VDLPNGSRITLL---- 106 + QA+ I + +++ P T+ R V+ P R+ Sbjct: 70 YDQAEIIFGRVVEKVERLAEVFPATEVQLQRRRLRLLVHHYDRPVNAPGAKRVATSEFRG 129 Query: 107 -GSENSDGLRGIYLDGCVIDEYANVNSKLFPEIIRPALSDRKGYCVFIGTPAGMNNNFYE 165 ++ D LRG LD ++DE A + ++ E I P LS R G+ + I TP G+ N FYE Sbjct: 130 KSADRPDNLRGATLDFVILDEAAMIPFSVWSEAIEPTLSVRDGWALIISTPKGL-NWFYE 188 Query: 166 LY 167 + Sbjct: 189 FF 190 >gi|9070|lcl|protein:vir:102238 Length: 588 # NCBI annotation: gp8 # Family: family:all:147 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655204;genbank:gi:109522784;genbank:GeneI D:4157477 Length = 588 Score = 48.5 bits (114), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 43/179 (24%), Positives = 75/179 (41%), Gaps = 22/179 (12%) Query: 12 HQALLHKKISEYRWNVLVCHRRFGKTVCMINHLIRSALL---------SKNKNPRYAYIA 62 H L + S R V C RRFGK+ L+ A L + NK Y + Sbjct: 39 HPGQLAIEESPARNKVATCGRRFGKSDLGGKRLVPPAFLAYYRQDLLRTTNKAMIYWIVG 98 Query: 63 PTFKQAKS---IAWDYMKQFTAKIPYTKFNE------TELRVDLPNGS-RITLLGSENSD 112 P + A+ + W+ + + +P+ K + + L G+ ++ ++ D Sbjct: 99 PEYSDAEKEFRVLWNTL--VSLGVPFDKPGSYYDAVGGNMHLSLWQGAYQVHAKSAKYPD 156 Query: 113 GLRGIYLDGCVIDEYANVNSKLFPEIIRPALSDRKGYCVFIGTPAGMNNNFYELYQHAQ 171 L G L G ++ E A ++ + +RP L D G+ + TP G N+F++ +Q Q Sbjct: 157 TLVGEGLSGVIMAEAAKQKPSVWTKHVRPTLGDFTGWSLHTSTPEG-KNHFHDKFQMGQ 214 >gi|8220|lcl|protein:vir:101493 Length: 588 # NCBI annotation: gp8 # Family: family:all:147 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655387;genbank:gi:109522575;genbank:GeneI D:4157565 Length = 588 Score = 48.5 bits (114), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 43/179 (24%), Positives = 75/179 (41%), Gaps = 22/179 (12%) Query: 12 HQALLHKKISEYRWNVLVCHRRFGKTVCMINHLIRSALL---------SKNKNPRYAYIA 62 H L + S R V C RRFGK+ L+ A L + NK Y + Sbjct: 39 HPGQLAIEESPARNKVATCGRRFGKSDLGGKRLVPPAFLAYYRQDLLRTTNKAMIYWIVG 98 Query: 63 PTFKQAKS---IAWDYMKQFTAKIPYTKFNE------TELRVDLPNGS-RITLLGSENSD 112 P + A+ + W+ + + +P+ K + + L G+ ++ ++ D Sbjct: 99 PEYSDAEKEFRVLWNTL--VSLGVPFDKPGSYYDAVGGNMHLSLWQGAYQVHAKSAKYPD 156 Query: 113 GLRGIYLDGCVIDEYANVNSKLFPEIIRPALSDRKGYCVFIGTPAGMNNNFYELYQHAQ 171 L G L G ++ E A ++ + +RP L D G+ + TP G N+F++ +Q Q Sbjct: 157 TLVGEGLSGVIMAEAAKQKPSVWTKHVRPTLGDFTGWSLHTSTPEG-KNHFHDKFQMGQ 214 >gi|18781|lcl|protein:vir:7429 Length: 581 # NCBI annotation: gp6 # Family: family:all:147 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818544;genbank:gi:29566981;genbank:GeneID :1260231 Length = 581 Score = 45.8 bits (107), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 40/161 (24%), Positives = 70/161 (43%), Gaps = 22/161 (13%) Query: 29 VCHRRFGKTVCMINHLIRSALLSK---------NKNPRYAYIAPTFKQAKS---IAWDYM 76 C RR GK+ + + I A+++K K + + P + A+ + W+ Sbjct: 54 TCGRRMGKSAGIAHEFIPEAMITKEMATTLLDDGKRREFWTVGPNYSDAEKPFRVFWNKC 113 Query: 77 KQFTAKIPYTK------FNETELRVDLPNGSRI-TLLGSENSDGLRGIYLDGCVIDEYAN 129 + IP+ K ++ V L +G+ I + S + L G L G ++E A Sbjct: 114 RAL--GIPFDKPGTYFDIKGGDMTVSLWDGAFIYSAKSSAVPERLVGEGLTGVHMEEAAK 171 Query: 130 VNSKLFPEIIRPALSDRKGYCVFIGTPAGMNNNFYELYQHA 170 ++ ++I P L D G+ F TP G N +Y+L+Q A Sbjct: 172 QKEVVWKQMIMPTLMDFGGWAKFTTTPEG-KNWYYDLHQKA 211 >gi|5321|lcl|protein:vir:99534 Length: 422 # NCBI annotation: putative protein # Family: family:all:54 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958532;genbank:gi:41179314;genbank:GeneID :2717172 Length = 422 Score = 42.0 bits (97), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 70/331 (21%), Positives = 140/331 (42%), Gaps = 35/331 (10%) Query: 83 IPYTKFNETELRVDLPNGSRITLLGSENSDGLRGIY-LDGCVIDEYANVN----SKLFPE 137 + Y N ++ + LPNG+ G ++ + ++ I L V++E + N ++L Sbjct: 91 LQYCHVNRSDKTIVLPNGAIFLFQGMDDPEKIKSIKGLSDVVMEEASEFNHNDYTQLTLR 150 Query: 138 IIRPALSDRKGYCVFIGTPAGMNNNFYELYQHAQGADDWFN---YKAKASETKIVDEEEL 194 + P R+ +C+F P N Y+ + D +++ + + +DE+ + Sbjct: 151 LREPKHKQRQIFCMF--NPVSKLNWTYQTWFDPSADYDRSRVAIHQSTYKDNRFLDEDNI 208 Query: 195 VKAKEVMG-DKKYQQEFECDWIANIEGAVYSDVLGKMEDQKQLTRVPYDPSLPVSTAW-- 251 +E+ + Y + + A ++ V+ + K+L P DP L + Sbjct: 209 RTIEELKNTNPAYYKIYTLGEFATLDKLVFPYF-----ETKRLN--PRDPKLLALNDYFG 261 Query: 252 -DLG-VSDHSAIIFYQ--QLGRSVNIIDYHEERGQGLPYYVQVIKDKDYVYKDHFAPHDI 307 D G ++D SA + + +++ ++D ++G QVIKD Y ++ Sbjct: 262 LDYGFINDPSAFMHIKLDMRNKTLYVMDEFVKKGLLNNQLAQVIKDMGY-------SKEV 314 Query: 308 EVTDFGNGKTRREVAYQLGIRFKVVPKIPLEDGIHATTMTLPRC-WIDTDHCKKLIDALR 366 D K+ E+ GI +++ P + D I L + W+ D C K I+ L+ Sbjct: 315 ITADSAEKKSIAEMKRD-GI-YRIRPALKGPDSIIQGIQFLQQFKWVVDDRCVKTIEELQ 372 Query: 367 HYHRKYIDKNRMFRSKPVHDWSSHACDAMRY 397 +Y K + ++P+ D +H DA+RY Sbjct: 373 NYTYVKDKKTDEYTNRPI-DAYNHCIDAIRY 402 >gi|20547|lcl|protein:vir:105155 Length: 580 # NCBI annotation: conserved phage-related protein # Family: family:all:4926 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398598;genbank:gi:80159854;genbank:GeneID :3772993 Length = 580 Score = 40.8 bits (94), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 51/237 (21%), Positives = 103/237 (43%), Gaps = 35/237 (14%) Query: 13 QALLHKKISEYRWNVLVCHRRFGKTVCMINHLIRSALLSKNKNPRYAYIAPTFKQAKSIA 72 Q L+ + ++ ++ +L+C R GK+ + S +L K A + +QA+++ Sbjct: 70 QRLILRAMARNQYVMLICCRGLGKSWLSAVFFVASCILYKGLKCGIA--SGQGQQARNVI 127 Query: 73 WDYMKQFTAKIPYT--------KFNETELRVDLPNGS--RITLLGSENSDGLRGIYLDGC 122 +K AK P K + V+ NGS R +LG DG R Sbjct: 128 IQKVKGELAKNPSIAREIVFPIKTGADDCVVNFRNGSEIRAIVLGRNQGDGARSWRFHYL 187 Query: 123 VIDEYANVNSKLFPEIIRPALSDR-----------KGYCVFIGTPAGMNNNFYELYQH-- 169 ++DE V+ K+ I+ P + KG +FI + ++ Y+ +++ Sbjct: 188 LVDECRLVSDKVINTILIPMTKTKRAVAIHHNKREKGKVIFISSAYLKTSDLYKRFKYFC 247 Query: 170 ---AQGADDWF----NYKAKASETKIVDEEEL--VKAKEVMGDKKYQQEFECDWIAN 217 + GA+++F +Y+ E I D++++ + K M +++Q E+E ++ + Sbjct: 248 DKMSSGANNYFVCSLDYRV-GIEAGIFDQDDIDEERNKPDMTIEEFQYEYEGIFVGS 303 >gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: terminase large subunit # Family: family:all:4877 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429607;genbank:gi:156564098;genbank:Ge neID:5525534 Length = 635 Score = 40.8 bits (94), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 43/167 (25%), Positives = 70/167 (41%), Gaps = 12/167 (7%) Query: 10 RKHQALLHKKISEYRWNVLVCHRRFGKTVCMINHLIRSALLSKNKNPRYAY----IAPTF 65 R +Q + +++++ + VL RR GKT M ++ A NK P Y IAP Sbjct: 69 RDYQEPMLQEMADSKRTVLRLGRRLGKTETMCIMILWHAFTQPNKGPNNQYDILIIAPYE 128 Query: 66 KQAKSIAWDYMKQFTAKIPYTKFNETELRVDLPNGSRI--TLLGSENSDG---LRGIYLD 120 +Q I + + + ++LPNG+ I GS++ G RG D Sbjct: 129 EQVDLIFKRLSQLIDMSGDVNPSRDIDKHIELPNGTVIHGITAGSKSGSGAANTRGQRAD 188 Query: 121 GCVIDEYANVNSKLFPEI--IRPALSDRKGYCVFIGTPAGMNNNFYE 165 V+DE + I IR +R V TP+G +++Y+ Sbjct: 189 LIVLDEMDYMGESEITNIMNIRNEAPERIKMIV-ASTPSGRRDSYYK 234 >gi|15353|lcl|protein:vir:9921 Length: 425 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795683;genbank:gi:28876465;genbank:GeneID :1257982 Length = 425 Score = 37.7 bits (86), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 80/378 (21%), Positives = 153/378 (40%), Gaps = 35/378 (9%) Query: 35 GKTVCMINHLIRSALLSKNKNPRYAYIAPTFKQA--KSIAWDYMKQFT--AKIPYTKFNE 90 GK+ + +I AL K K+PR + S+ D M + + K N Sbjct: 45 GKSHGVFQKIILKALNPKFKHPRKILVLRKVGATVRDSVFADIMSNLSYFGILDKCKINM 104 Query: 91 TELRVDLPNGSRITLLGSENSDGLRGIY-LDGCVIDEYANVNSKLFPEIIRPALSDRKG- 148 + R+ LPNG+ G +N + ++ I + V++E + + ++ L D+K Sbjct: 105 SAFRITLPNGAEFIFKGMDNPEKIKSIKGISDVVMEEASEFTLDDYTQLTL-RLRDKKHL 163 Query: 149 ----YCVFIGTPAGMNNNFYELYQHAQGADDWFNYKAKASETKIVDEEELVKAKEVMG-D 203 Y +F P N Y+ + + + Y+ + + +D+ +E+ + Sbjct: 164 EKQIYLMF--NPVSKVNWVYKAF-FVKTPKNTVVYQTTYKDNRFLDDVTRENIEELANRN 220 Query: 204 KKYQQEFECDWIANIEGAVYSDVLGKMEDQKQLTRVPYDPSLPVSTAWDLG-VSDHSAII 262 + Y + + A ++ ++ ++ ++ +L+ LP D G ++D SA++ Sbjct: 221 EAYYKIYALGQFATLDKLIFPKYDKQILNKDKLSH------LPSFFGLDYGFINDPSALL 274 Query: 263 FYQ--QLGRSVNIIDYHEERGQGLPYYVQVIKDKDYVYKDHFAPHDIEVTDFGNGKTRRE 320 + + + I++ + + IKD Y A +I D K+ +E Sbjct: 275 HVKIDDANKKLYILEEYVRKNLTNDKIANAIKDLGY------AKEEIR-GDSAEKKSNQE 327 Query: 321 VAYQLGI-RFKVVPKIPLEDGIHATTMTLPRCWIDTDHCKKLIDALRHYHRKYIDKNRMF 379 + LGI R V K P + L WI + C K I+ L +Y K K + Sbjct: 328 LR-NLGIPRMIDVTKGP-GTVMQGIQYLLQYDWIVDERCVKTIEELENYTWKKDKKTNEY 385 Query: 380 RSKPVHDWSSHACDAMRY 397 ++PV D +H DA+RY Sbjct: 386 TNEPV-DSYNHCIDAIRY 402 >gi|27407|lcl|protein:vir:7202 Length: 610 # NCBI annotation: gp17 terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049776;genbank:gi:9632591;genbank:GeneID: 1258647 Length = 610 Score = 36.2 bits (82), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 44/175 (25%), Positives = 74/175 (42%), Gaps = 27/175 (15%) Query: 10 RKHQALLHKKISEYRWNVLVCHRRFGKTVCMINHLIRSALLSKN--------KNPRYAYI 61 R +Q + K +S R V R+ GKT + L +K+ K A + Sbjct: 140 RDYQRDMLKIMSSKRMTVCNLSRQLGKTTVVAIFLAHFVCFNKDKAVGILAHKGSMSAEV 199 Query: 62 APTFKQAKSIAWDYMKQFTAKIPYTKFNETELRVDLPNGSRITLLGSENSDGLRG----- 116 KQA + D+++ ++N+ + +D NGS I S + D +RG Sbjct: 200 LDRTKQAIELLPDFLQP-----GIVEWNKGSIELD--NGSSIGAYAS-SPDAVRGNSFAM 251 Query: 117 IYLDGCVIDEYANVNSKLFPEIIRPALSD-RKGYCVFIGTPAGMNNNFYELYQHA 170 IY+D C N + I+P +S R+ + TP G+ N+FY+++ A Sbjct: 252 IYIDECAF--IPNFHDSWLA--IQPVISSGRRSKIIITTTPNGL-NHFYDIWTAA 301 >gi|27701|lcl|protein:vir:6893 Length: 611 # NCBI annotation: gp17 terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861869;genbank:gi:32453660;genbank:GeneID :1494295 Length = 611 Score = 36.2 bits (82), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 43/171 (25%), Positives = 76/171 (44%), Gaps = 19/171 (11%) Query: 10 RKHQALLHKKISEYRWNVLVCHRRFGKTVCMINHLIRSALLSKN--------KNPRYAYI 61 R +Q + K +S R V R+ GKT + L +K+ K A + Sbjct: 140 RDYQRDMLKIMSSKRMTVCNLSRQLGKTTVVAIFLAHFVCFNKDKAVGILAHKGSMSAEV 199 Query: 62 APTFKQAKSIAWDYMKQFTAKIPYTKFNETELRVDLPNGSRITLLGSENSDGLRGIYLDG 121 KQA + D+++ ++N+ +++D NGS I S + D +RG Sbjct: 200 LDRTKQAIELLPDFLQP-----GIVEWNKGSIQLD--NGSSIGAYAS-SPDAVRGNSFAM 251 Query: 122 CVIDEYANVNSKLFPEI-IRPALSD-RKGYCVFIGTPAGMNNNFYELYQHA 170 IDE A + + + + I+P +S R+ + TP G+ N+FY+++ A Sbjct: 252 IYIDECAFIPNFIDSWLAIQPVISSGRRSKIIITTTPNGL-NHFYDIWTAA 301 >gi|22056|lcl|protein:vir:103455 Length: 610 # NCBI annotation: terminase subunit nuclease and ATPase # Family: family:all:147 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803107;genbank:gi:116326387;genbank:GeneI D:4405484 Length = 610 Score = 36.2 bits (82), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 44/175 (25%), Positives = 74/175 (42%), Gaps = 27/175 (15%) Query: 10 RKHQALLHKKISEYRWNVLVCHRRFGKTVCMINHLIRSALLSKN--------KNPRYAYI 61 R +Q + K +S R V R+ GKT + L +K+ K A + Sbjct: 140 RDYQRDMLKIMSSKRMTVCNLSRQLGKTTVVAIFLAHFVCFNKDKAVGILAHKGSMSAEV 199 Query: 62 APTFKQAKSIAWDYMKQFTAKIPYTKFNETELRVDLPNGSRITLLGSENSDGLRG----- 116 KQA + D+++ ++N+ + +D NGS I S + D +RG Sbjct: 200 LDRTKQAIELLPDFLQP-----GIVEWNKGSIELD--NGSSIGAYAS-SPDAVRGNSFAM 251 Query: 117 IYLDGCVIDEYANVNSKLFPEIIRPALSD-RKGYCVFIGTPAGMNNNFYELYQHA 170 IY+D C N + I+P +S R+ + TP G+ N+FY+++ A Sbjct: 252 IYIDECAF--IPNFHDSWLA--IQPVISSGRRSKIIITTTPNGL-NHFYDIWTAA 301 >gi|15101|lcl|protein:vir:3867 Length: 563 # NCBI annotation: putative large terminase subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680484;swissprot:trembl:q8ltc3;genbank:gi :22296524;interpro:IPR005021;uniprot:Q8LTC3;genbank:Gene ID:951698 Length = 563 Score = 35.4 bits (80), Expect = 0.001, Method: Compositional matrix adjust. Identities = 45/179 (25%), Positives = 75/179 (41%), Gaps = 21/179 (11%) Query: 32 RRFGKTVCMINHLIRSALLSKN-KNPRYAYIAPTFKQAKSIAW----DYMKQFTAKIP-- 84 R+ GK++ + ++ L KN N R Y A ++ I + D ++ K P Sbjct: 99 RKNGKSLLISGVILYEFLFGKNPANKRQLYTAANDRKQAGIVFGMVKDRLRALMRKDPGI 158 Query: 85 --YTKFNETELRVDLPNGSRITLLGSENSDGLRGIYLDGCVIDEYANVNSKLFPEIIRPA 142 K EL V+L +GS I S ++ + G V+DEYAN + E + Sbjct: 159 KRMVKITRDEL-VNLDDGSTIRSF-SRDTGLVDGYEPHVAVVDEYANAKTTDMIETLASG 216 Query: 143 LSDRKGYCVFIGTPAGMNNN---FYELYQHA-------QGADDWFNYKAKASETKIVDE 191 Y FI + AG + N F + Y +A + A+ +F + A+ + VD+ Sbjct: 217 QVLLPSYLTFIISTAGFDMNVPMFQQNYPYAKKVLSGEEKAERYFAFIAEQDNVQEVDD 275 >gi|5208|lcl|protein:vir:106641 Length: 446 # NCBI annotation: ORF005 # Family: family:all:54 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239489;genbank:gi:66395220;genbank:GeneID :4555795 Length = 446 Score = 35.0 bits (79), Expect = 0.002, Method: Compositional matrix adjust. Identities = 65/320 (20%), Positives = 133/320 (41%), Gaps = 27/320 (8%) Query: 88 FNETELRVDLPNGSRITLLGSENSDGLRGIY-LDGCVIDEYANVNSKLFPEI---IRPAL 143 +N+T+ +V+LPNG+ G +N + ++ I + V++E + + ++ +R Sbjct: 118 WNKTDNKVELPNGAVFLFKGLDNPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERK 177 Query: 144 SDRKGYCVFIGTPAGMNNNFYELYQHAQGADDWFNYKAKASETKIVDEEELVKAKEVMGD 203 K + + +N + ++H + ++ ++ + K +DE + E++ + Sbjct: 178 HMNKQIFLMFNPVSKLNWVYKYFFEHGEPMENVMIRQSSYRDNKFLDEMTR-QNLELLAN 236 Query: 204 KK--YQQEFECDWIANIEGAVYSDVLGKMEDQKQLTRVPYDPSLPVSTAWDLG-VSDHSA 260 + Y + + A ++ V+ ++ K++ LP D G V+D SA Sbjct: 237 RNPAYYKIYALGEFATLDKLVFPKYEKRIISDKEVGH------LPSYFGLDFGYVNDPSA 290 Query: 261 IIFYQ--QLGRSVNIIDYHEERGQGLPYYVQVIKDKDYVYKDHFAPHDIEVTDFGNGKTR 318 I + + + +I + ++G QVI D Y K+ E KT Sbjct: 291 FIHVKIDNDNKKLYVISEYVKKGMLNNEIAQVINDLGYS-KEKITADSAEQKSIMEIKTN 349 Query: 319 REVAYQLGIRFKVVPKIPLEDGIHATTMTLPRCWIDTDH-CKKLIDALRHYHRKYIDKNR 377 GI ++VP + +D + A + + I D C K I+ +Y K Sbjct: 350 -------GID-RIVPAMKGKDSVMAGIQFVSQFDIVIDERCYKTIEEFDNYTWKKDKNTG 401 Query: 378 MFRSKPVHDWSSHACDAMRY 397 + ++PV D +H DA+RY Sbjct: 402 EYYNEPV-DTYNHCIDALRY 420 >gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899601;genbank:gi:34419588;genbank:GeneID :2546011 Length = 600 Score = 33.5 bits (75), Expect = 0.005, Method: Compositional matrix adjust. Identities = 54/253 (21%), Positives = 106/253 (41%), Gaps = 41/253 (16%) Query: 5 IPYTPRKHQALLHKKISEYRWNVLVCHRRFGKTVCMINHLIRSALLSKNKNPRYAYIAPT 64 I PR +Q + + R+++ + R+ GKT M L + +++K A Sbjct: 121 IKMVPRPYQKEMLEVADRSRFSIFLLPRQLGKTTIMGIFLAHYLVFNEDKE------AGI 174 Query: 65 FKQAKSIAWDYM---KQFTAKIP------YTKFNETELRVDLPNGSRITLLGSENSDGLR 115 S++ + + K +P ++N+ + D NG ++ S SD +R Sbjct: 175 LAHKGSMSMEVLERVKNVIENLPDFLQPGIEEWNKGNITFD--NGCKLGAYAS-GSDAVR 231 Query: 116 G-----IYLDGCV-IDEYANVNSKLFPEIIRPALSDRKGYCVFIGTPAGMNNNFYELYQH 169 G IY+D C + + + FP I S + V TP G+ N++++++ Sbjct: 232 GKSFSMIYVDECAFVPGFDDFWKATFPVIS----SGEESKVVLTSTPNGL-NHYHDMWNA 286 Query: 170 A-QGADDWFNYKA--KASETKI-----VDEEELVKAKEVMGD---KKYQQEFECDWIANI 218 A QG + Y +A + ++ D+ E K +E +G+ + + QE C+++ Sbjct: 287 AVQGISTFEPYTTTWRAVQNRLYKDGEFDDGEAFK-RETIGNTSREAFSQEHLCNFLGTA 345 Query: 219 EGAVYSDVLGKME 231 + L KM+ Sbjct: 346 GTLINGFKLSKMK 358 >gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469498;genbank:gi:157311455;genbank:Ge neID:5602125 Length = 607 Score = 33.1 bits (74), Expect = 0.006, Method: Compositional matrix adjust. Identities = 45/183 (24%), Positives = 75/183 (40%), Gaps = 20/183 (10%) Query: 10 RKHQALLHKKISEYRWNVLVCHRRFGKTVCMINHLIRSALLSKNK--------NPRYAYI 61 R +Q + K + E R + R+ GKT + L +K+K + Sbjct: 138 RDYQKDMLKIMHENRMSAHKLSRQLGKTTAVAIFLAHYVCFNKDKAVGILAHKGSMAVEV 197 Query: 62 APTFKQAKSIAWDYMKQFTAKIPYTKFNETELRVDLPNGSRITLLGSENSDGLRGIYLDG 121 KQA + D+++ ++N+ + L NGS I S + D +RG Sbjct: 198 LERTKQAIELLPDFLQP-----GIVEWNKKS--IVLENGSSIGAYAS-SPDAVRGNSFSF 249 Query: 122 CVIDEYANVN--SKLFPEIIRPALSDRKGYCVFIGTPAGMNNNFYELYQHA-QGADDWFN 178 IDE A + + F I S R+ + TP G+N+ FY+++Q A G + Sbjct: 250 IYIDECAFIQNWTDCFLAIQPVISSGRESKMIMTTTPNGLNH-FYDIWQSAIDGKSGYVP 308 Query: 179 YKA 181 Y+A Sbjct: 309 YEA 311 >gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891724;genbank:gi:33620526;genbank:GeneID :1725332 Length = 607 Score = 33.1 bits (74), Expect = 0.006, Method: Compositional matrix adjust. Identities = 45/183 (24%), Positives = 75/183 (40%), Gaps = 20/183 (10%) Query: 10 RKHQALLHKKISEYRWNVLVCHRRFGKTVCMINHLIRSALLSKNK--------NPRYAYI 61 R +Q + K + E R + R+ GKT + L +K+K + Sbjct: 138 RDYQKDMLKIMHENRMSAHKLSRQLGKTTAVAIFLAHYVCFNKDKAVGILAHKGSMAVEV 197 Query: 62 APTFKQAKSIAWDYMKQFTAKIPYTKFNETELRVDLPNGSRITLLGSENSDGLRGIYLDG 121 KQA + D+++ ++N+ + L NGS I S + D +RG Sbjct: 198 LERTKQAIELLPDFLQP-----GIVEWNKKS--IVLENGSSIGAYAS-SPDAVRGNSFSF 249 Query: 122 CVIDEYANVN--SKLFPEIIRPALSDRKGYCVFIGTPAGMNNNFYELYQHA-QGADDWFN 178 IDE A + + F I S R+ + TP G+N+ FY+++Q A G + Sbjct: 250 IYIDECAFIQNWTDCFLAIQPVISSGRESKMIMTTTPNGLNH-FYDIWQSAIDGKSGYVP 308 Query: 179 YKA 181 Y+A Sbjct: 309 YEA 311 >gi|10609|lcl|protein:vir:106508 Length: 492 # NCBI annotation: Pas60 # Family: family:all:144 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024846;genbank:gi:48697461;genbank:GeneID :2846165 Length = 492 Score = 30.4 bits (67), Expect = 0.038, Method: Compositional matrix adjust. Identities = 34/169 (20%), Positives = 72/169 (42%), Gaps = 22/169 (13%) Query: 81 AKIPYTKFNET-ELRVDLPNGSRITLLGSENSDGLRGIYLDGCV--IDEYANVNSKLFPE 137 + PY E +LR+ L +G+ T + S + + G + + + +DE V + + Sbjct: 117 GRAPYNPRTELLDLRLKLTHGA-ATAVASNQPERIEGAHAEELLYLLDEAKIVPPATW-D 174 Query: 138 IIRPALSDR------KGYCVFIGTPAGMNNNFYELYQHAQGADDWFNYKAKASETKIVDE 191 I A S+ Y + TP + FY++++ A G +DW+ T+ V Sbjct: 175 SIEGAFSNAGVDVADNAYAFAMSTPGAPSGRFYDIHRRAPGYEDWW--------TRHVTL 226 Query: 192 EELVKAKEVMGDKKYQQEFECDWIANIEGAVYSDVLGKMEDQKQLTRVP 240 EE + + + + + + W ++ ++ VLG+ + + +P Sbjct: 227 EEAIASGRI--SRAWADQRRSQWGSD-SAVFHNRVLGEFHASDEDSVIP 272 >gi|27798|lcl|protein:vir:8408 Length: 214 # NCBI annotation: gp3 # Family: family:all:30874 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818304;genbank:gi:29566740;genbank:GeneID :1260058 Length = 214 Score = 30.4 bits (67), Expect = 0.039, Method: Compositional matrix adjust. Identities = 29/132 (21%), Positives = 54/132 (40%), Gaps = 13/132 (9%) Query: 12 HQALLHKKISEYRWNVLVCHRRFGKTVCMINHLIRSALLSKNKNPRYAYIAPTFKQAKSI 71 HQ L+H+ I + + R+ GKT + + ++ R Y F++A Sbjct: 6 HQKLIHETIDKSSISAFAAPRQNGKTYAALAYALQYP-------GRVLYFGRGFREAGEA 58 Query: 72 AWDYMKQFTAKIPYT--KFNETELRVDLPNG---SRITLLGSENSDGLRGIYLDGCVIDE 126 K + P T K N+++L ++ G R+ + G RG+ D ++D+ Sbjct: 59 FAAATKLGANRGPGTILKTNKSQLSIETSLGGDFGRVNFMPYGRGSG-RGMGADLVILDD 117 Query: 127 YANVNSKLFPEI 138 V + + EI Sbjct: 118 AHEVEADVLAEI 129 >gi|8690|lcl|protein:vir:102145 Length: 581 # NCBI annotation: phage terminase, large subunit, putative # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699944;genbank:gi:110804033;genbank:GeneI D:4206689 Length = 581 Score = 29.6 bits (65), Expect = 0.067, Method: Compositional matrix adjust. Identities = 32/136 (23%), Positives = 55/136 (40%), Gaps = 7/136 (5%) Query: 32 RRFGKTVCMINHLIRSALLSKNKNPRYAYIAPTFKQAKSIAWDYMKQFTAKIP-----YT 86 R+ K++ I+ A K N + A KQA+ I W ++F P + Sbjct: 108 RKNAKSLLNSGIGIKLAAFDKYPNAQVYCTATKMKQAR-IVWKQARKFIEIEPDLREIFK 166 Query: 87 KFNETELRVDLPNGSRITLLGSENSDGLRGIYLDGCVIDEYANVNSKLFPEIIRPALSDR 146 + + L NG I LG ++ + G G +IDEY + + +++ ++ Sbjct: 167 IKDHDAIIESLINGGEIMALG-RDTGTIDGFDPHGGIIDEYHSHKTNQMVKLLEDGSVNQ 225 Query: 147 KGYCVFIGTPAGMNNN 162 + I T AG N N Sbjct: 226 AESLISIITTAGFNLN 241 >gi|14207|lcl|protein:vir:8315 Length: 503 # NCBI annotation: gp32 # Family: family:all:523 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817883;genbank:gi:29566316;genbank:GeneID :1259511 Length = 503 Score = 28.9 bits (63), Expect = 0.12, Method: Compositional matrix adjust. Identities = 26/101 (25%), Positives = 47/101 (46%), Gaps = 6/101 (5%) Query: 64 TFKQAKSIA-WDYMKQFTAKIPYTKFNETELRVDLPNGSRITLLGSENSDGLRGI-YLDG 121 TF+ ++ A + + F K+ +E V+ NGSRI L G+ RGI +D Sbjct: 143 TFQAVQAYAKRERVAPFIRKVTLGSGDEA---VEFANGSRI-LFGARERGFGRGIPGVDV 198 Query: 122 CVIDEYANVNSKLFPEIIRPALSDRKGYCVFIGTPAGMNNN 162 + DE + + +++ + R G +++GTP +N Sbjct: 199 LMSDEAQILTQRAMQDMLATLNTSRLGLHIYVGTPPKPTDN 239 >gi|24596|lcl|protein:vir:98262 Length: 609 # NCBI annotation: gp17 terminase subunit, nuclease and ATPase # Family: family:all:147 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239195;genbank:gi:66391670;genbank:GeneID :3416364 Length = 609 Score = 28.5 bits (62), Expect = 0.16, Method: Compositional matrix adjust. Identities = 28/100 (28%), Positives = 45/100 (45%), Gaps = 13/100 (13%) Query: 95 VDLPNGSRITLLGSENSDGLRG-----IYLDGCVIDEYANVNSKLFPEIIRPALSD-RKG 148 ++L N +I S + D +RG IY+D C N I+P +S RK Sbjct: 224 IELDNKCKIGAFAS-SPDAVRGNSFAMIYIDECAF--IPNFTDAWLA--IQPVISSGRKS 278 Query: 149 YCVFIGTPAGMNNNFYELYQHA-QGADDWFNYKAKASETK 187 + TP G+N+ FY+++ A +G + Y A + K Sbjct: 279 KILITTTPNGLNH-FYDIWNAAVEGKSGFVPYTAIWTSVK 317 >gi|26461|lcl|protein:vir:573 Length: 589 # NCBI annotation: unknown # Family: family:all:4926 # MgeID: mge:13 # MgeName: SPBc2 # Cross-refs: genbank:acc:NP_046608;genbank:gi:9630181;genbank:GeneID: 1261423 Length = 589 Score = 28.5 bits (62), Expect = 0.17, Method: Compositional matrix adjust. Identities = 15/58 (25%), Positives = 33/58 (56%), Gaps = 2/58 (3%) Query: 87 KFNETELRVDLPNGSRITLLGSENSDGLRGIYLDGCVIDEYANVNSKLFPEIIRPALS 144 K + + +V+ NGS I ++ S +DG R + ++DE+ V+ ++ +++R L+ Sbjct: 145 KTSTNDAKVEFHNGSWIKIVAS--NDGARSKRANLLIVDEFRMVDFEIISKVLRKFLT 200 >gi|9829|lcl|protein:vir:103950 Length: 425 # NCBI annotation: phage terminase large subunit # Family: family:all:54 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873987;genbank:gi:118430762;genbank:GeneI D:4525444 Length = 425 Score = 27.7 bits (60), Expect = 0.23, Method: Compositional matrix adjust. Identities = 59/317 (18%), Positives = 133/317 (41%), Gaps = 21/317 (6%) Query: 88 FNETELRVDLPNGSRITLLGSENSDGLRGIY-LDGCVIDEYANVNSKLFPEI---IRPAL 143 +N+T+ +V+LPNG+ G +N + ++ I + V++E + + ++ +R Sbjct: 96 WNKTDNKVELPNGAVFLFKGLDNPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERK 155 Query: 144 SDRKGYCVFIGTPAGMNNNFYELYQHAQGADDWFNYKAKASETKIVDEEELVKAKEVMGD 203 K + + +N + ++H + ++ ++ + K +DE + E++ + Sbjct: 156 HVNKQIFLMFNPVSKLNWVYKYFFEHGEPMENVMIRQSSYRDNKFLDEMTR-QNLELLAN 214 Query: 204 KK--YQQEFECDWIANIEGAVYSDVLGKMEDQKQLTRVPYDPSLPVSTAWDLGVSDHSAI 261 + Y + + A ++ V+ ++ ++ +L +P L D HS I Sbjct: 215 RNPAYYKIYALGEFATLDKLVFPKYEKRLINKDELRHLPSYFGLDFGYVNDPSAFIHSKI 274 Query: 262 IFYQQLGRSVNIIDYHEERGQGLPYYVQVIKDKDYVYKDHFAPHDIEVTDFGNGKTRREV 321 + + + II+ + ++G VIK Y A +I D K+ E+ Sbjct: 275 DVKK---KKLYIIEEYVKQGMLNDEIANVIKQLGY------AKEEI-TADSAEQKSIAEL 324 Query: 322 AYQLGIRFKVVPKIPLEDGIHATTMTLPRCWIDTDH-CKKLIDALRHYHRKYIDKNRMFR 380 LG++ +++P + + L + I D C K I+ +Y + + Sbjct: 325 R-NLGLK-RILPTKKGKGSVVQGLQFLMQFEIIVDERCFKTIEEFDNYTWQKDKDTGEYT 382 Query: 381 SKPVHDWSSHACDAMRY 397 ++PV D +H D++RY Sbjct: 383 NEPV-DTYNHCIDSLRY 398 >gi|17691|lcl|protein:vir:9305 Length: 447 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803283;genbank:gi:29028593;genbank:GeneID :1258039 Length = 447 Score = 27.7 bits (60), Expect = 0.25, Method: Compositional matrix adjust. Identities = 59/317 (18%), Positives = 133/317 (41%), Gaps = 21/317 (6%) Query: 88 FNETELRVDLPNGSRITLLGSENSDGLRGIY-LDGCVIDEYANVNSKLFPEI---IRPAL 143 +N+T+ +V+LPNG+ G +N + ++ I + V++E + + ++ +R Sbjct: 118 WNKTDNKVELPNGAVFLFKGLDNPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERK 177 Query: 144 SDRKGYCVFIGTPAGMNNNFYELYQHAQGADDWFNYKAKASETKIVDEEELVKAKEVMGD 203 K + + +N + ++H + ++ ++ + K +DE + E++ + Sbjct: 178 HVNKQIFLMFNPVSKLNWVYKYFFEHGEPMENVMIRQSSYRDNKFLDEMTR-QNLELLAN 236 Query: 204 KK--YQQEFECDWIANIEGAVYSDVLGKMEDQKQLTRVPYDPSLPVSTAWDLGVSDHSAI 261 + Y + + A ++ V+ ++ ++ +L +P L D HS I Sbjct: 237 RNPAYYKIYALGEFATLDKLVFPKYEKRLINKDELRHLPSYFGLDFGYVNDPSAFIHSKI 296 Query: 262 IFYQQLGRSVNIIDYHEERGQGLPYYVQVIKDKDYVYKDHFAPHDIEVTDFGNGKTRREV 321 + + + II+ + ++G VIK Y A +I D K+ E+ Sbjct: 297 DVKK---KKLYIIEEYVKQGMLNDEIANVIKQLGY------AKEEI-TADSAEQKSIAEL 346 Query: 322 AYQLGIRFKVVPKIPLEDGIHATTMTLPRCWIDTDH-CKKLIDALRHYHRKYIDKNRMFR 380 LG++ +++P + + L + I D C K I+ +Y + + Sbjct: 347 R-NLGLK-RILPTKKGKGSVVQGLQFLMQFEIIVDERCFKTIEEFDNYTWQKDKDTGEYT 404 Query: 381 SKPVHDWSSHACDAMRY 397 ++PV D +H D++RY Sbjct: 405 NEPV-DTYNHCIDSLRY 420 >gi|2779|lcl|protein:vir:99776 Length: 425 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004302;genbank:gi:122891756;genbank:Ge neID:4712331 Length = 425 Score = 27.7 bits (60), Expect = 0.26, Method: Compositional matrix adjust. Identities = 63/318 (19%), Positives = 133/318 (41%), Gaps = 23/318 (7%) Query: 88 FNETELRVDLPNGSRITLLGSENSDGLRGIY-LDGCVIDEYANVNSKLFPEIIRPALSDR 146 +N+T+ +V LPNG+ G +N + ++ I + V++E + + ++ L +R Sbjct: 96 WNKTDNKVGLPNGAVFLFKGLDNPEKIKSIKGISDIVMEEASEFTLNDYTQLTL-RLRER 154 Query: 147 K---GYCVFIGTPAGMNNNFYE-LYQHAQGADDWFNYKAKASETKIVDEEELVKAKEVMG 202 K I P N Y+ ++H + ++ ++ + K +DE + E++ Sbjct: 155 KHVNKQIFLIFNPVSKLNWVYKYFFEHGEPMENVMIRQSSYRDNKFLDEMTR-QNLELLA 213 Query: 203 DKK--YQQEFECDWIANIEGAVYSDVLGKMEDQKQLTRVPYDPSLPVSTAWDLGVSDHSA 260 ++ Y + + A ++ V+ ++ ++ +L +P L D HS Sbjct: 214 NRNPAYYKIYALGEFATLDKLVFPKYEKRLINKDELRHLPSYFGLDFGYVNDPSAFIHSK 273 Query: 261 IIFYQQLGRSVNIIDYHEERGQGLPYYVQVIKDKDYVYKDHFAPHDIEVTDFGNGKTRRE 320 I + + + II+ + ++G VIK Y A +I D K+ E Sbjct: 274 IDVKK---KKLYIIEEYVKQGMLNDEIANVIKQLGY------AKEEI-TADSAEQKSIAE 323 Query: 321 VAYQLGIRFKVVPKIPLEDGIHATTMTLPRCWIDTDH-CKKLIDALRHYHRKYIDKNRMF 379 + LG++ +++P + + L + I D C K I+ +Y + + Sbjct: 324 LR-NLGLK-RILPTKKGKGSVVQGLQFLMQFEIIVDERCFKTIEEFDNYTWQKDKDTGEY 381 Query: 380 RSKPVHDWSSHACDAMRY 397 ++PV D +H D++RY Sbjct: 382 TNEPV-DTYNHCIDSLRY 398 >gi|7301|lcl|protein:vir:96238 Length: 447 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239566;genbank:gi:66395301;genbank:GeneID :5132787 Length = 447 Score = 27.7 bits (60), Expect = 0.29, Method: Compositional matrix adjust. Identities = 58/317 (18%), Positives = 133/317 (41%), Gaps = 21/317 (6%) Query: 88 FNETELRVDLPNGSRITLLGSENSDGLRGIY-LDGCVIDEYANVNSKLFPEI---IRPAL 143 +N+T+ +V+LPNG+ G +N + ++ I + V++E + + ++ +R Sbjct: 118 WNKTDNKVELPNGAVFLFKGLDNPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERK 177 Query: 144 SDRKGYCVFIGTPAGMNNNFYELYQHAQGADDWFNYKAKASETKIVDEEELVKAKEVMGD 203 K + + +N + ++H + ++ ++ + K +DE + E++ + Sbjct: 178 HVNKQIFLMFNPVSKLNWVYKYFFEHGEPMENVMIRQSSYRDNKFLDEMTR-QNLELLAN 236 Query: 204 KK--YQQEFECDWIANIEGAVYSDVLGKMEDQKQLTRVPYDPSLPVSTAWDLGVSDHSAI 261 + Y + + A ++ V+ ++ ++ +L +P L D HS I Sbjct: 237 RNPAYYKIYALGEFATLDKLVFPKYEKRLINKDELRHLPSYFGLDFGYVNDPSAFIHSKI 296 Query: 262 IFYQQLGRSVNIIDYHEERGQGLPYYVQVIKDKDYVYKDHFAPHDIEVTDFGNGKTRREV 321 + + + II+ + ++G VIK Y ++ A D K+ E+ Sbjct: 297 DVKK---KKLYIIEEYVKQGMLNDEIANVIKQLGYAREEITA-------DSAEQKSIAEL 346 Query: 322 AYQLGIRFKVVPKIPLEDGIHATTMTLPRCWIDTDH-CKKLIDALRHYHRKYIDKNRMFR 380 LG++ +++P + + L + I D C K I+ +Y + + Sbjct: 347 R-NLGLK-RILPTKKGKGSVVQGLQFLMQFEIIVDERCFKTIEEFDNYTWQKDKDTGEYT 404 Query: 381 SKPVHDWSSHACDAMRY 397 ++PV D +H D++RY Sbjct: 405 NEPV-DTYNHCIDSLRY 420 >gi|14004|lcl|protein:vir:8183 Length: 506 # NCBI annotation: gp3 # Family: family:all:523 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817976;genbank:gi:29566410;genbank:GeneID :2700964 Length = 506 Score = 27.7 bits (60), Expect = 0.29, Method: Compositional matrix adjust. Identities = 20/63 (31%), Positives = 29/63 (46%), Gaps = 2/63 (3%) Query: 95 VDLPNGSRITLLGSENSDGLRGIY-LDGCVIDEYANVNSKLFPEIIRPALSDRKGYCVFI 153 V NGS+I L G+ +S RG +D V DE N+ +++ G F+ Sbjct: 149 VHFANGSKI-LFGARSSGFGRGFSEVDIQVYDECQNLKDSALTDMLAAMNVSEIGLAFFM 207 Query: 154 GTP 156 GTP Sbjct: 208 GTP 210 >gi|9590|lcl|protein:vir:97250 Length: 515 # NCBI annotation: hypothetical protein ORF012 # Family: family:all:144 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294520;genbank:gi:149408241;genbank:Ge neID:5237115 Length = 515 Score = 27.3 bits (59), Expect = 0.32, Method: Compositional matrix adjust. Identities = 17/74 (22%), Positives = 28/74 (37%) Query: 32 RRFGKTVCMINHLIRSALLSKNKNPRYAYIAPTFKQAKSIAWDYMKQFTAKIPYTKFNET 91 R GKT C++ ++ R T+ Q + K F P K+N+ Sbjct: 73 RGPGKTDCLLMDFLQHVGKGYGSEWRGILFRQTYPQLSDVINKTNKWFKRIFPGAKYNKV 132 Query: 92 ELRVDLPNGSRITL 105 E + P+G + L Sbjct: 133 EHKWTFPDGEELLL 146 >gi|7504|lcl|protein:vir:99915 Length: 478 # NCBI annotation: gp2 # Family: family:all:523 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655519;genbank:gi:109392289;genbank:GeneI D:4157084 Length = 478 Score = 26.6 bits (57), Expect = 0.56, Method: Compositional matrix adjust. Identities = 18/60 (30%), Positives = 25/60 (41%) Query: 97 LPNGSRITLLGSENSDGLRGIYLDGCVIDEYANVNSKLFPEIIRPALSDRKGYCVFIGTP 156 L NGSRI EN GL + V+DE + K ++I + + GTP Sbjct: 137 LHNGSRILFGARENGFGLGFAGVGILVLDEAQRLTDKAMDDLIPTMNTVENPLILLTGTP 196 >gi|7643|lcl|protein:vir:96370 Length: 447 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239642;genbank:gi:66395379;genbank:GeneID :5132846 Length = 447 Score = 26.6 bits (57), Expect = 0.59, Method: Compositional matrix adjust. Identities = 58/317 (18%), Positives = 133/317 (41%), Gaps = 21/317 (6%) Query: 88 FNETELRVDLPNGSRITLLGSENSDGLRGIY-LDGCVIDEYANVNSKLFPEI---IRPAL 143 +N+T+ +V+LPNG+ G +N + ++ I + V++E + + ++ +R Sbjct: 118 WNKTDNKVELPNGAVFLFKGLDNPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERK 177 Query: 144 SDRKGYCVFIGTPAGMNNNFYELYQHAQGADDWFNYKAKASETKIVDEEELVKAKEVMGD 203 K + + +N + ++H + ++ ++ + K +DE + E++ + Sbjct: 178 HVNKQIFLMFNPVSKLNWVYKYFFEHGEPMENVMIRQSSYRDNKFLDEMTR-QNLELLAN 236 Query: 204 KK--YQQEFECDWIANIEGAVYSDVLGKMEDQKQLTRVPYDPSLPVSTAWDLGVSDHSAI 261 + Y + + + ++ V+ ++ ++ +L +P L D HS I Sbjct: 237 RNPAYYKIYALGEFSTLDKLVFPKYEKRLINKDELRHLPSYFGLDFGYVNDPSAFIHSKI 296 Query: 262 IFYQQLGRSVNIIDYHEERGQGLPYYVQVIKDKDYVYKDHFAPHDIEVTDFGNGKTRREV 321 + + + II+ + ++G VIK Y A +I D K+ E+ Sbjct: 297 DVKK---KKLYIIEEYVKQGMLNDEIANVIKQLGY------AKEEI-TADSAEQKSIAEL 346 Query: 322 AYQLGIRFKVVPKIPLEDGIHATTMTLPRCWIDTDH-CKKLIDALRHYHRKYIDKNRMFR 380 LG++ +++P + + L + I D C K I+ +Y + + Sbjct: 347 R-NLGLK-RILPTKKGKGSVVQGLQFLMQFEIIVDERCFKTIEEFDNYTWQKDKDTGEYT 404 Query: 381 SKPVHDWSSHACDAMRY 397 ++PV D +H D++RY Sbjct: 405 NEPV-DTYNHCIDSLRY 420 >gi|11588|lcl|protein:vir:78831 Length: 447 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285355;genbank:gi:148717883;genbank:Ge neID:5246962 Length = 447 Score = 26.6 bits (57), Expect = 0.59, Method: Compositional matrix adjust. Identities = 58/317 (18%), Positives = 133/317 (41%), Gaps = 21/317 (6%) Query: 88 FNETELRVDLPNGSRITLLGSENSDGLRGIY-LDGCVIDEYANVNSKLFPEI---IRPAL 143 +N+T+ +V+LPNG+ G +N + ++ I + V++E + + ++ +R Sbjct: 118 WNKTDNKVELPNGAVFLFKGLDNPEKIKSIKGISDIVMEEASEFTLNDYTQLTLRLRERK 177 Query: 144 SDRKGYCVFIGTPAGMNNNFYELYQHAQGADDWFNYKAKASETKIVDEEELVKAKEVMGD 203 K + + +N + ++H + ++ ++ + K +DE + E++ + Sbjct: 178 HVNKQIFLMFNPVSKLNWVYKYFFEHGEPMENVMIRQSSYRDNKFLDEMTR-QNLELLAN 236 Query: 204 KK--YQQEFECDWIANIEGAVYSDVLGKMEDQKQLTRVPYDPSLPVSTAWDLGVSDHSAI 261 + Y + + + ++ V+ ++ ++ +L +P L D HS I Sbjct: 237 RNPAYYKIYALGEFSTLDKLVFPKYEKRLINKDELRHLPSYFGLDFGYVNDPSAFIHSKI 296 Query: 262 IFYQQLGRSVNIIDYHEERGQGLPYYVQVIKDKDYVYKDHFAPHDIEVTDFGNGKTRREV 321 + + + II+ + ++G VIK Y A +I D K+ E+ Sbjct: 297 DVKK---KKLYIIEEYVKQGMLNDEIANVIKQLGY------AKEEI-TADSAEQKSIAEL 346 Query: 322 AYQLGIRFKVVPKIPLEDGIHATTMTLPRCWIDTDH-CKKLIDALRHYHRKYIDKNRMFR 380 LG++ +++P + + L + I D C K I+ +Y + + Sbjct: 347 R-NLGLK-RILPTKKGKGSVVQGLQFLMQFEIIVDERCFKTIEEFDNYTWQKDKDTGEYT 404 Query: 381 SKPVHDWSSHACDAMRY 397 ++PV D +H D++RY Sbjct: 405 NEPV-DTYNHCIDSLRY 420 >gi|19608|lcl|protein:vir:4081 Length: 518 # NCBI annotation: terminase # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043560;genbank:gi:9628694;genbank:GeneID: 1261154 Length = 518 Score = 26.2 bits (56), Expect = 0.72, Method: Compositional matrix adjust. Identities = 35/132 (26%), Positives = 54/132 (40%), Gaps = 13/132 (9%) Query: 100 GSRITLLGSENSDGLRGIYLDGCVIDEYANVNSKLFPEI-IRPALSDRKGYCVFIGTP-- 156 G+ I++ S N D L G +IDE+ P I IR L KG +FI T Sbjct: 166 GTEISIYAS-NEDTLDGGREQLVIIDEFGAFKKN--PLITIRQGLRKNKG-TLFISTTNN 221 Query: 157 ---AGMNNNFYELYQHAQGADD---WFNYKAKASETKIVDEEELVKAKEVMGDKKYQQEF 210 G ++ E ++ DD W Y A ++ D + +KA +G ++ Sbjct: 222 VIRGGAYDDELESWKEWVKDDDFSHWVFYYALDDYDEVKDSSKYIKANPALGYTLSLEDI 281 Query: 211 ECDWIANIEGAV 222 + D+I I V Sbjct: 282 QKDFIGAIGNPV 293 >gi|24325|lcl|protein:vir:100809 Length: 488 # NCBI annotation: hypothetical protein # Family: family:all:1430 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164748;genbank:gi:56693161;genbank:GeneID :3197442 Length = 488 Score = 26.2 bits (56), Expect = 0.74, Method: Compositional matrix adjust. Identities = 12/43 (27%), Positives = 20/43 (46%) Query: 89 NETELRVDLPNGSRITLLGSENSDGLRGIYLDGCVIDEYANVN 131 N +++ SR+ S L G+ +DG +DEY +N Sbjct: 24 NNDSVKMKQIRNSRMMFRSSSTGKALEGVDVDGLSLDEYDRLN 66 >gi|8562|lcl|protein:vir:100097 Length: 571 # NCBI annotation: gp2 # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945032;genbank:gi:38707892;genbank:GeneID :2744144 Length = 571 Score = 25.8 bits (55), Expect = 1.1, Method: Compositional matrix adjust. Identities = 33/140 (23%), Positives = 50/140 (35%), Gaps = 15/140 (10%) Query: 99 NGSRITLLGSENSDGLRGIYLDGC-VIDEYANVNSKLFPEIIRPALSDRKGYCVFIGTPA 157 +GSR L DG C ++DEY +S E + + R+ +FI T A Sbjct: 187 DGSRFEPLIGNPGDGAS----PSCSIVDEYHEHDSAALYETMLTGMGARRQPLMFIITTA 242 Query: 158 GMN---------NNFYELYQHAQGADDWFNYKAKASET-KIVDEEELVKAKEVMGDKKYQ 207 G N E+ + D+ F + E D L KA +G YQ Sbjct: 243 GANIEGPCFDKRRQVIEMLEGTVPDDELFGWIWTIDEGDDWTDPRVLAKANPNIGISVYQ 302 Query: 208 QEFECDWIANIEGAVYSDVL 227 E I+ A +++ Sbjct: 303 DYLESQQQRAIKSARFTNTF 322 >gi|13932|lcl|protein:vir:1429 Length: 570 # NCBI annotation: putative terminase (large subunit) # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536358;genbank:gi:17975163;genbank:GeneID :929161 Length = 570 Score = 25.4 bits (54), Expect = 1.2, Method: Compositional matrix adjust. Identities = 50/228 (21%), Positives = 77/228 (33%), Gaps = 28/228 (12%) Query: 18 KKISEYRWNVLVCHRRFGKTVCMINHLIRSALLSKNKNPRYAYIAPTFKQAKSIAWDYMK 77 ++ E W V R+ GK+V I +L A T KQA W+ + Sbjct: 105 RRFRESYWEV---PRKNGKSVTAAGVGIGMFVLDDEFGAEVYAGATTEKQA----WEVFR 157 Query: 78 --QFTAKIPYTKFNETELRVDLPN------GSRITLLGSENSDGLRGIYLDGCVIDEYAN 129 Q K + + V+ N GSR + DG ++DEY Sbjct: 158 PAQLMVKRSPMLIDSAGIEVNASNMNKPADGSRFEPIIGNPGDGASP---SCAIVDEYHE 214 Query: 130 VNSKLFPEIIRPALSDRKGYCVFIGTPAGMN---------NNFYELYQHAQGADDWFNYK 180 +S E + + R+ +FI T AG N E+ + D+ F + Sbjct: 215 HDSAALYETMLTGMGARRQPLMFIITTAGANIEGPCFDKRRQVIEMLEGTVPDDELFGWI 274 Query: 181 AKASET-KIVDEEELVKAKEVMGDKKYQQEFECDWIANIEGAVYSDVL 227 E D L KA +G YQ E I+ A +++ Sbjct: 275 WTIDEGDDWTDPRVLAKANPNIGISVYQDYLESQQQRAIKSARFTNTF 322 >gi|6886|lcl|protein:vir:106551 Length: 424 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958579;genbank:gi:41179257;genbank:GeneID :2717087 Length = 424 Score = 25.0 bits (53), Expect = 1.6, Method: Compositional matrix adjust. Identities = 13/19 (68%), Positives = 13/19 (68%), Gaps = 5/19 (26%) Query: 382 KPV--HDWSSHACDAMRYL 398 KPV HD HACDAMRY Sbjct: 387 KPVKQHD---HACDAMRYF 402 >gi|5794|lcl|protein:vir:98922 Length: 453 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164412;genbank:gi:56694902;genbank:GeneID :3197313 Length = 453 Score = 24.6 bits (52), Expect = 2.3, Method: Compositional matrix adjust. Identities = 16/63 (25%), Positives = 29/63 (46%), Gaps = 5/63 (7%) Query: 105 LLGSENSDGLRGIYLDGCVIDEYANVNSKLFPEIIRPALSDRKGY-CVFIGTPAGMNNNF 163 ++G E+ YLD D+ V ++ +I R +D Y +++G P G+ N Sbjct: 192 MVGEEDYLVHESSYLD----DQLGFVTGQMLKDIERIKNNDHDYYRYIYLGEPVGLGTNV 247 Query: 164 YEL 166 Y + Sbjct: 248 YNM 250 >gi|3859|lcl|protein:vir:107748 Length: 532 # NCBI annotation: gp33 TerL # Family: family:all:54 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024878;genbank:gi:48697520;genbank:GeneID :2948367 Length = 532 Score = 22.7 bits (47), Expect = 8.0, Method: Compositional matrix adjust. Identities = 28/139 (20%), Positives = 59/139 (42%), Gaps = 8/139 (5%) Query: 84 PYTKFNETELRVDLPNGSRITLLGSENSDGLRGIYLDGCVIDEYANVNSKLFPEIIRPAL 143 P+ + + +RV +P+ + + G + RG + +DE A++ + + + AL Sbjct: 165 PHDHTHSSHMRVIIPDTGAV-IRGEAGKNIGRGGRVSIQFVDEAAHLENA---QAVDTAL 220 Query: 144 SDRKGYCVFIGTPAGMNNNFYELYQHAQGADDWFNYKAKASETKIVDEEELVKAKEVMGD 203 + + I + G+NN F E + +++ + D+E K K+ Sbjct: 221 AATTNCRIDISSVNGLNNPFAEKRFSGRVKVKTMHWRDDPRK----DDEWYKKQKQKFNA 276 Query: 204 KKYQQEFECDWIANIEGAV 222 QE + D+ A+ EG + Sbjct: 277 LVVAQEIDIDYSASAEGVL 295 >gi|13199|lcl|protein:vir:81178 Length: 567 # NCBI annotation: putative terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285808;genbank:gi:148747729;genbank:Ge neID:5247221 Length = 567 Score = 22.7 bits (47), Expect = 9.0, Method: Compositional matrix adjust. Identities = 17/62 (27%), Positives = 25/62 (40%) Query: 99 NGSRITLLGSENSDGLRGIYLDGCVIDEYANVNSKLFPEIIRPALSDRKGYCVFIGTPAG 158 +GS I L E G VIDEY + +++ + R+ + I T AG Sbjct: 180 SGSIIQPLSKEARKTGDGKNPSLAVIDEYHTHETSEIYDVLVSGMVARQNPLIVIITTAG 239 Query: 159 MN 160 N Sbjct: 240 FN 241 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.321 0.137 0.424 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 201,214 Number of Sequences: 514 Number of extensions: 10107 Number of successful extensions: 90 Number of sequences better than 100.0: 54 Number of HSP's better than 100.0 without gapping: 39 Number of HSP's successfully gapped in prelim test: 15 Number of HSP's that attempted gapping in prelim test: 26 Number of HSP's gapped (non-prelim): 57 length of query: 423 length of database: 206,069 effective HSP length: 74 effective length of query: 349 effective length of database: 168,033 effective search space: 58643517 effective search space used: 58643517 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 38 (19.2 bits)