BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_018835.1_cdsid_YP_006906086.1 [gene=NJ01_017] [protein=terminase large subunit] [protein_id=YP_006906086.1] [location=8631..10172] (513 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|18891|lcl|protein:vir:8847 Length: 482 # NCBI annotation: ter... 318 1e-88 gi|13374|lcl|protein:vir:9262 Length: 517 # NCBI annotation: gp2... 232 1e-62 gi|155|lcl|protein:vir:77595 Length: 499 # NCBI annotation: term... 229 6e-62 gi|3300|lcl|protein:vir:100942 Length: 499 # NCBI annotation: Gp... 229 7e-62 gi|1219|lcl|protein:vir:105519 Length: 475 # NCBI annotation: ph... 223 4e-60 gi|2931|lcl|protein:vir:105460 Length: 409 # NCBI annotation: pu... 50 7e-08 gi|2172|lcl|protein:vir:100329 Length: 605 # NCBI annotation: te... 36 0.001 gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: t... 36 0.001 gi|17332|lcl|protein:vir:2014 Length: 590 # NCBI annotation: gpP... 33 0.005 gi|18685|lcl|protein:vir:5692 Length: 590 # NCBI annotation: gpP... 32 0.017 gi|16979|lcl|protein:vir:6059 Length: 590 # NCBI annotation: gpP... 32 0.017 gi|13020|lcl|protein:vir:80960 Length: 443 # NCBI annotation: Te... 32 0.019 gi|12899|lcl|protein:vir:80432 Length: 510 # NCBI annotation: Bc... 30 0.061 gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: ph... 30 0.069 gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: te... 30 0.069 gi|4395|lcl|protein:vir:98555 Length: 589 # NCBI annotation: gp3... 30 0.085 gi|9289|lcl|protein:vir:97165 Length: 431 # NCBI annotation: ORF... 30 0.094 gi|8126|lcl|protein:vir:102661 Length: 568 # NCBI annotation: te... 29 0.16 gi|3326|lcl|protein:vir:94525 Length: 440 # NCBI annotation: put... 28 0.20 gi|6719|lcl|protein:vir:99411 Length: 447 # NCBI annotation: hyp... 28 0.22 gi|5072|lcl|protein:vir:95130 Length: 531 # NCBI annotation: hyp... 28 0.32 gi|6886|lcl|protein:vir:106551 Length: 424 # NCBI annotation: pu... 27 0.46 gi|2589|lcl|protein:vir:94084 Length: 407 # NCBI annotation: ORF... 27 0.56 gi|3523|lcl|protein:vir:105897 Length: 407 # NCBI annotation: la... 27 0.56 gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: g... 27 0.80 gi|4548|lcl|protein:vir:94970 Length: 473 # NCBI annotation: put... 26 1.2 gi|1431|lcl|protein:vir:93626 Length: 530 # NCBI annotation: Bce... 25 1.5 gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T... 25 1.8 gi|5321|lcl|protein:vir:99534 Length: 422 # NCBI annotation: put... 25 1.8 gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: t... 25 2.8 gi|22578|lcl|protein:vir:95547 Length: 605 # NCBI annotation: OR... 25 2.8 gi|9399|lcl|protein:vir:99333 Length: 605 # NCBI annotation: hyp... 25 2.8 gi|19292|lcl|protein:vir:4781 Length: 436 # NCBI annotation: put... 24 4.5 gi|20547|lcl|protein:vir:105155 Length: 580 # NCBI annotation: c... 23 5.5 gi|13692|lcl|protein:vir:4897 Length: 411 # NCBI annotation: gp4... 23 6.2 gi|16780|lcl|protein:vir:2731 Length: 411 # NCBI annotation: hyp... 23 6.3 gi|8015|lcl|protein:vir:96495 Length: 195 # NCBI annotation: ter... 23 6.7 gi|1165|lcl|protein:vir:102941 Length: 421 # NCBI annotation: te... 23 7.2 gi|19929|lcl|protein:vir:4852 Length: 369 # NCBI annotation: put... 23 8.5 >gi|18891|lcl|protein:vir:8847 Length: 482 # NCBI annotation: terminase # Family: family:all:1730 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775255;genbank:gi:27476053;genbank:GeneID :2700601 Length = 482 Score = 318 bits (815), Expect = 1e-88, Method: Compositional matrix adjust. Identities = 188/478 (39%), Positives = 271/478 (56%), Gaps = 28/478 (5%) Query: 18 VDEKKALLNMLKERDQWRKYNRILSFKAYDFQKKFYAAGLKHRFRFLCAANRVGKSYSEA 77 +D ++ L N+++E + +KY R+ + Y +Q+KF AA NR GK+Y+ A Sbjct: 1 MDTQERLRNLVRELAERQKYFRMNQYTPYGWQEKFIAASSNCAQLLAMTGNRCGKTYTGA 60 Query: 78 YEFACHVTGRYPTWWTGYKFKRPILAWAVGITGDSTRKVLQKELFGTPIGKDTNLLGTGV 137 + ACH+TGRYP WWTG KF +P+ WA GI+ D+TR +LQ EL G K+ GTG+ Sbjct: 61 FIMACHLTGRYPEWWTGRKFDKPVNCWAAGISTDTTRDILQSELLGD--WKNPEAFGTGM 118 Query: 138 IPRDAIVIDTIERDGNK--LQIVQIKHQNERGEFDGLSTLEFRSTQQGEHTLMGATVDYI 195 IP++ IV T R+G +Q V ++H + GLS+L F+S + + MG +D I Sbjct: 119 IPKEDIV-KTERREGKPGCVQAVMVRHVS-----GGLSSLIFKSYEMSQDKFMGTAIDVI 172 Query: 196 WLDEEDPYESMAIFAQCVTRTLTTKGLVTITATPENGLTELVDKFMKGEGDESTGSLYFQ 255 WLDEE P + I+ QCVTRT TT G+V +T TPE+GLTE+V F++ D G + Sbjct: 173 WLDEECPKD---IYTQCVTRTATTGGIVYLTFTPEHGLTEIVKDFLQ---DLKPGQ-FLI 225 Query: 256 NASWWDAHADLGGHITDQDIKDMTEGIPAWQLEMRSKGMPLLGSGLIYDVSDDTIKCEPF 315 +ASW DA H++ + + + + MR++G+P+LGSG+++ + ++ CEPF Sbjct: 226 HASWEDA-----PHLSPEVKEQLLSVYSPAERRMRAEGIPMLGSGVVFPILEEKFVCEPF 280 Query: 316 EIPDTWKRVCAIDIGIDHPTAAVWTAYDANTDTIYVYDSYKEGGFTPVYHAPAINGR-GQ 374 +IPD + R+ ID+G DHP A A+DA D Y+YD E G T HA AI + G Sbjct: 281 DIPDHFHRIIGIDLGFDHPNAIACVAWDAEKDKYYLYDERSESGETLGMHADAIYLKGGH 340 Query: 375 WIPVILPHDADNTEKG-SGSSVAQFYK-NAGVNVQSETFYNKIGMDGKK-NFFVEPGITD 431 IPV++PHDA + SG K + +NV E F N G DGK VE G+ Sbjct: 341 QIPVVVPHDAFKHDGATSGRRFVDLLKDDHNLNVVYEPFSNPPGPDGKHGGNSVEFGVNW 400 Query: 432 IRERMMSGRFKVFNTAANAKLFEEKARYHRKVGKIIKEHDDLMDAMRYSACSVTHRGR 489 + RM +G KVFNT N +E YHRK GKI+ +DD++ A RY+ + R Sbjct: 401 MLTRMENGDLKVFNTCTN--FLKEMKMYHRKDGKIVDRNDDMISATRYALLMASRHAR 456 >gi|13374|lcl|protein:vir:9262 Length: 517 # NCBI annotation: gp2 # Family: family:all:1730 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720326;genbank:gi:24371583;genbank:GeneID :955800 Length = 517 Score = 232 bits (591), Expect = 1e-62, Method: Compositional matrix adjust. Identities = 160/499 (32%), Positives = 246/499 (49%), Gaps = 47/499 (9%) Query: 7 IYEALTGNNLSVDEKKALLNMLKERD------------QWRKYNRILSFKAYDFQKKFYA 54 I AL + S E A+L+ L + + +R + + F Y Q++F Sbjct: 7 ISAALVSRSYSTVELDAILDNLSDEEQIELLELLEEEENYRNTHLLYEFTPYSKQREFID 66 Query: 55 AGLKHRFRFLCAANRVGKSYSEAYEFACHVTGRYPTW------------WTGYKFKRPIL 102 AG + R A N++GKS++ A E A H+TGRYP W G +F P++ Sbjct: 67 AGHDYPERCFMAGNQLGKSFTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVV 126 Query: 103 AWAVGITGDSTRKVLQKELFGTPIGKDTNLLGTGVIPRDAIVI-DTIERDGNKLQIVQIK 161 W G T ++ K Q+ L G ++ + G G IP++ I+ N + + +K Sbjct: 127 FWVGGETNETVTKTTQRILCGRI--EENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVK 184 Query: 162 HQNERGEFDGLSTLEFRSTQQGEHTLMGATVDYIWLDEEDPYESMAIFAQCVTRTLTTKG 221 H G DG+S F+ QG G T+ +W DEE PY +I+ + +TRT Sbjct: 185 HHTPEGVEDGISICYFKPYSQGRARWQGDTIHGVWFDEEPPY---SIYGEGLTRTNKYGQ 241 Query: 222 LVTITATPENGLTELVDKFMKGEGDESTGSLYFQNASWWDAHADLGGHITDQDIKDMTEG 281 +T TP G++++V KF+K + S N + +DA H TD+ + + Sbjct: 242 FSILTFTPLMGMSDVVTKFLKN----PSKSQKVVNMTIYDAE-----HYTDEQKEQIIAS 292 Query: 282 IPAWQLEMRSKGMPLLGSGLIYDVSDDTIKCEPFEIPDTWKRVCAIDIGIDHPTAAVWTA 341 P + E R++G+P +GSG I+ + ++TIKC+PFE PD + + A D G +HP A + Sbjct: 293 YPEHEREARARGIPTMGSGRIFQIPEETIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLW 352 Query: 342 YDANTDTIYVYDSYKEGGFTPVYHAPAINGRGQWIPVILPHDADNTEKGSGSSVAQFYKN 401 +D + D Y+ +K+ T V A+ IPV PHD EKG G + Y + Sbjct: 353 WDKDADVFYLARVWKKSENTAVQAWGAVKSWANKIPVAWPHDGHQHEKGGGEQLKTQYAD 412 Query: 402 AGVNVQSETFYNKIGMDGKKNFFVEPGITDIRERMMSGRFKVFNTAANAKLFEEKARYHR 461 AG ++ E + DG + VE GI ++R+ M+ GRFKVFNT FEE YHR Sbjct: 413 AGFSMLPE---HATFPDGGNS--VESGIGELRDLMLEGRFKVFNTC--EPFFEEFRLYHR 465 Query: 462 -KVGKIIKEHDDLMDAMRY 479 + GKI+K +DD++DA RY Sbjct: 466 DENGKIVKTNDDVLDATRY 484 >gi|155|lcl|protein:vir:77595 Length: 499 # NCBI annotation: terminase large subunit # Family: family:all:1730 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063734;genbank:gi:51236724;genbank:GeneID :2944239 Length = 499 Score = 229 bits (584), Expect = 6e-62, Method: Compositional matrix adjust. Identities = 159/480 (33%), Positives = 248/480 (51%), Gaps = 35/480 (7%) Query: 14 NNLSVDEKKALLNMLKERDQWRKYNRILSFKAYDFQKKFYAAGLKHRFRFLCAANRVGKS 73 +NLS +E+ LL +L+E + +R + + F Y Q++F AG + R A N++GKS Sbjct: 8 DNLSDEEQIELLELLEEEENYRNTHLLYEFAPYSKQREFIDAGHDYPERCFMAGNQLGKS 67 Query: 74 YSEAYEFACHVTGRYPTW------------WTGYKFKRPILAWAVGITGDSTRKVLQKEL 121 ++ A E A H+TGRYP W G +F P++ W G T ++ K Q+ L Sbjct: 68 FTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWIGGETNETVTKTTQRIL 127 Query: 122 FGTPIGKDTNLLGTGVIPRDAIVI-DTIERDGNKLQIVQIKHQNERGEFDGLSTLEFRST 180 G ++ + G G IP++ I+ N + + +KH G DG+S F+ Sbjct: 128 CGRI--EENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKHHTADGVEDGISICYFKPY 185 Query: 181 QQGEHTLMGATVDYIWLDEEDPYESMAIFAQCVTRTLTTKGLVTITATPENGLTELVDKF 240 QG G T+ +W DEE PY +I+ + +TRT +T TP G++++V KF Sbjct: 186 SQGRARWQGDTIHGVWFDEEPPY---SIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTKF 242 Query: 241 MKGEGDESTGSLYFQNASWWDAHADLGGHITDQDIKDMTEGIPAWQLEMRSKGMPLLGSG 300 +K + S N + +DA H TD+ + + P + E R++G+P +GSG Sbjct: 243 LKN----PSKSQKVVNMTIYDAE-----HYTDEQKEQIIASYPEHEREARARGIPTMGSG 293 Query: 301 LIYDVSDDTIKCEPFEIPDTWKRVCAIDIGIDHPTAAVWTAYDANTDTIYVYDSYKEGGF 360 I+ + ++TIKC+PFE PD + + A D G +HP A + +D + D Y+ +K+ Sbjct: 294 RIFQIPEETIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSEN 353 Query: 361 TPVYHAPAINGRGQWIPVILPHDADNTEKGSGSSVAQFYKNAGVNVQSETFYNKIGMDGK 420 T V A+ IPV PHD EKG G + Y +AG ++ + + DG Sbjct: 354 TAVQAWGAVKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPD---HATFPDGG 410 Query: 421 KNFFVEPGITDIRERMMSGRFKVFNTAANAKLFEEKARYHR-KVGKIIKEHDDLMDAMRY 479 + VE GI+++R+ M+ GRFKVFNT FEE YHR + GKI+K +DD++DA RY Sbjct: 411 NS--VESGISELRDLMLEGRFKVFNTC--EPFFEEFRLYHRDENGKIVKTNDDVLDATRY 466 >gi|3300|lcl|protein:vir:100942 Length: 499 # NCBI annotation: Gp2 # Family: family:all:1730 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006405;genbank:gi:46358697;genbank:GeneID :2777092 Length = 499 Score = 229 bits (583), Expect = 7e-62, Method: Compositional matrix adjust. Identities = 158/480 (32%), Positives = 247/480 (51%), Gaps = 35/480 (7%) Query: 14 NNLSVDEKKALLNMLKERDQWRKYNRILSFKAYDFQKKFYAAGLKHRFRFLCAANRVGKS 73 +NLS +E+ LL +L+E + +R + + F Y Q++F AG + R A N++GKS Sbjct: 8 DNLSDEEQIELLELLEEEENYRNTHLLYEFTPYSKQREFIDAGHDYPERCFMAGNQLGKS 67 Query: 74 YSEAYEFACHVTGRYPTW------------WTGYKFKRPILAWAVGITGDSTRKVLQKEL 121 ++ A E A H+TGRYP W G +F P++ W G T ++ K Q+ L Sbjct: 68 FTGAAEVAFHLTGRYPGTKGYPADGKYGGEWKGKRFYEPVVFWVGGETNETVTKTTQRIL 127 Query: 122 FGTPIGKDTNLLGTGVIPRDAIVI-DTIERDGNKLQIVQIKHQNERGEFDGLSTLEFRST 180 G ++ + G G IP++ I+ N + + +KH G DG+S F+ Sbjct: 128 CGRI--EENDEPGYGSIPKEDIISWKKSPFFPNLVDHLLVKHHTPEGVEDGISICYFKPY 185 Query: 181 QQGEHTLMGATVDYIWLDEEDPYESMAIFAQCVTRTLTTKGLVTITATPENGLTELVDKF 240 QG G T+ +W DEE PY +I+ + +TRT +T TP G++++V KF Sbjct: 186 SQGRARWQGDTIHGVWFDEEPPY---SIYGEGLTRTNKYGQFSILTFTPLMGMSDVVTKF 242 Query: 241 MKGEGDESTGSLYFQNASWWDAHADLGGHITDQDIKDMTEGIPAWQLEMRSKGMPLLGSG 300 +K + S N + +DA H TD+ + + P + E R++G+P +GSG Sbjct: 243 LKN----PSKSQKVVNMTIYDAE-----HYTDEQKEQIIASYPEHEREARARGIPTMGSG 293 Query: 301 LIYDVSDDTIKCEPFEIPDTWKRVCAIDIGIDHPTAAVWTAYDANTDTIYVYDSYKEGGF 360 I+ + ++TIKC+PFE PD + + A D G +HP A + +D + D Y+ +K+ Sbjct: 294 RIFQIPEETIKCQPFECPDHFYVIDAQDFGWNHPQAHIQLWWDKDADVFYLARVWKKSEN 353 Query: 361 TPVYHAPAINGRGQWIPVILPHDADNTEKGSGSSVAQFYKNAGVNVQSETFYNKIGMDGK 420 T V A+ IPV PHD EKG G + Y +AG ++ + + DG Sbjct: 354 TAVQAWGAVKSWANKIPVAWPHDGHQHEKGGGEQLKTQYADAGFSMLPD---HATFPDGG 410 Query: 421 KNFFVEPGITDIRERMMSGRFKVFNTAANAKLFEEKARYHR-KVGKIIKEHDDLMDAMRY 479 + VE GI+++R+ M+ GRFK FNT FEE YHR + GKI+K +DD++DA RY Sbjct: 411 NS--VESGISELRDLMLEGRFKAFNTC--EPFFEEFRLYHRDENGKIVKTNDDVLDATRY 466 >gi|1219|lcl|protein:vir:105519 Length: 475 # NCBI annotation: phage terminase large subunit # Family: family:all:1730 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516188;genbank:gi:89885991;genbank:GeneID :3964379 Length = 475 Score = 223 bits (568), Expect = 4e-60, Method: Compositional matrix adjust. Identities = 155/470 (32%), Positives = 235/470 (50%), Gaps = 44/470 (9%) Query: 20 EKKALLNMLKERDQWRKYNRILSFKAYDFQKKFYAAGLKHRFRFLCAANRVGKSYSEAYE 79 ++ +N + R+ +Y + YD+Q+KF ++ L AANRVGK+ + Y Sbjct: 11 QQVRFINEQRRREHACRYRHYYGTR-YDWQRKFIGLSAEYAQVALIAANRVGKTDTATYV 69 Query: 80 FACHVTGRYPTWWTGYKFKRPILAWAVGITGDSTRKVLQKELFGTPIGKDTNLLGTGVIP 139 A H G YP W+GY+F + W +G +G+ R +LQ L G K N G+IP Sbjct: 70 DAVHALGDYPEAWSGYRFSHAPVIWCLGYSGEKCRDLLQTPLLGR---KTDNGWQGGLIP 126 Query: 140 RDAIVIDTIERDG--NKLQIVQIKHQNERGEFDGLSTLEFRSTQQGEHTLMGATVDYIWL 197 + I DT G N ++ I+H + LS ++F S QG+H LMG VD+ + Sbjct: 127 GERIA-DTEAMTGTTNAVRTAYIRHVSGL-----LSKIQFWSYSQGQHALMGDCVDWFHI 180 Query: 198 DEE--DPYESMAIFAQCVTRTLT----TKGLVTITATPENGLTELVDKFMKGEGDESTGS 251 DEE DP I+ Q +TRT T G +T TPENG T+LV FM D + + Sbjct: 181 DEEPRDP----TIYPQVLTRTATGDRGKGGRGILTFTPENGRTDLVIGFM----DNPSPA 232 Query: 252 LYFQNASWWDAHADLGGHITDQDIKDMTEGIPAWQLEMRSKGMPLLGSGLIYDVSDDTIK 311 N W DA H++ + D+ PA Q +MR+KG+P+LG G IYD+ +D I Sbjct: 233 QTCINVGWDDA-----PHLSQKVKNDLLASFPAHQRDMRTKGIPMLGHGRIYDLGEDFIT 287 Query: 312 CEPFEIPDTWKRVCAIDIGIDHPTAAVWTAYDANTDTIYVYDSYKEGGFTPVYHAPAING 371 C+PF +P W + +D G DHP A + +D + YV +YK +P A++ Sbjct: 288 CDPFPVPAHWLVIDGMDFGWDHPQAHIQLVWDNENEMFYVTRAYKARQVSPAEAYSAVSI 347 Query: 372 RGQWIPVILPHDADNTEKGSGSSVAQFYKNAGVNVQSETFYNKIGMDGKKNFFVEPGITD 431 + +P P D TEKGSG +Y +AG + + DG ++ + Sbjct: 348 WAENVPTAWPSDGLMTEKGSGIQQKTYYDDAGFCMLRDP---AQWPDGSRS-------VE 397 Query: 432 IRERMMSGRFKVFNTAANAKLFEEKARYHR-KVGKIIKEHDDLMDAMRYS 480 + + M G+FKVF+ + F+E YHR + +I+K DD++DA+RY+ Sbjct: 398 LHDLMRRGKFKVFSGLRD--FFDEYNFYHRDEKSRIVKMRDDILDAVRYA 445 >gi|2931|lcl|protein:vir:105460 Length: 409 # NCBI annotation: putative phage terminase large subunit B # Family: family:all:54 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529870;genbank:gi:90592610;genbank:GeneID :3974524 Length = 409 Score = 49.7 bits (117), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 51/204 (25%), Positives = 90/204 (44%), Gaps = 27/204 (13%) Query: 293 GMPLLGSGLIY-DVSDDTIKCEPFEIPDTWKRVCAIDIGIDHPTAAVWTAYDANTDTIYV 351 G + G G++Y D + DT+ +PD +D G +HP + D + +T + Sbjct: 219 GQWVTGDGIVYQDFNKDTMVIPKNRVPDGLDYYVGVDWGYEHPNPIILLGDDKDGNTYVL 278 Query: 352 YDSYKEGGFTPVYHAPAINGRGQWIPVILPHDADNTEKGSGSSVAQFYKNAGVNVQSETF 411 D ++ F W+ V A N + G ++ + +A + +E Sbjct: 279 EDYTQKHKFI-----------NYWVKV-----AQNLQTRFGRNLIFYADSARPDNVNEFQ 322 Query: 412 YNKIG-MDGKKNFFVEPGITDIRERMMSGRFKVFNTAANAKLFEEKARY--HRKVGKIIK 468 N + ++ KN V PGI + +M G+F V +TA++ L +E +Y G +K Sbjct: 323 SNGLNCINANKN--VLPGIECVARKMREGKFYVVDTASSG-LLDEIYQYAWDESTGLPLK 379 Query: 469 E----HDDLMDAMRYSACSVTHRG 488 E H+D +DA+RY+ S +G Sbjct: 380 ENDVRHNDRLDAIRYAIYSRNKKG 403 >gi|2172|lcl|protein:vir:100329 Length: 605 # NCBI annotation: terminase ATPase subunit # Family: family:all:169 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655470;genbank:gi:109289938;genbank:GeneI D:4157372 Length = 605 Score = 36.2 bits (82), Expect = 0.001, Method: Compositional matrix adjust. Identities = 29/94 (30%), Positives = 45/94 (47%), Gaps = 8/94 (8%) Query: 19 DEKKALLNMLKERDQWRKYNRILSFKAYDFQKKFYAAGLKHRFRFLCAANRVGKSYSEAY 78 D KK N + E N L + +QKK++ AGL HR R + + ++G +Y A+ Sbjct: 133 DRKKPEQNAISEEQAELLINGFLD-GMFHYQKKWHEAGLTHRIRNILKSRQIGATYYFAH 191 Query: 79 EFACH--VTGRYPTWWTGYK-----FKRPILAWA 105 E VTGR + + K F+ I+A+A Sbjct: 192 EALVDALVTGRNQIFISASKKQALQFRAYIVAYA 225 >gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: terminase, large subunit # Family: family:all:147 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006983;genbank:gi:46401884;genbank:GeneID :2777679 Length = 438 Score = 36.2 bits (82), Expect = 0.001, Method: Compositional matrix adjust. Identities = 18/56 (32%), Positives = 29/56 (51%) Query: 315 FEIPDTWKRVCAIDIGIDHPTAAVWTAYDANTDTIYVYDSYKEGGFTPVYHAPAIN 370 F+ + ++ + ID+G PTA + Y +TDT YV + Y++ T HA I Sbjct: 273 FKDDEAFETLLGIDVGYRDPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYIQ 328 Score = 25.4 bits (54), Expect = 1.6, Method: Compositional matrix adjust. Identities = 9/17 (52%), Positives = 12/17 (70%) Query: 58 KHRFRFLCAANRVGKSY 74 +HRF C + RVGKS+ Sbjct: 53 RHRFVTACVSRRVGKSF 69 >gi|17332|lcl|protein:vir:2014 Length: 590 # NCBI annotation: gpP # Family: family:all:169 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046758;genbank:gi:9630329;genbank:GeneID: 1261531 Length = 590 Score = 33.5 bits (75), Expect = 0.005, Method: Compositional matrix adjust. Identities = 24/87 (27%), Positives = 44/87 (50%), Gaps = 11/87 (12%) Query: 36 KYNRILSFKAYDFQKKFYAAGLKHRFRFLCAANRVGKSYSEAYEFACHV--TGRYPTWWT 93 K +I +++D+Q +Y AGL+HR R + + ++G ++ + E TG + + Sbjct: 136 KLEQIFFEQSFDYQLHWYRAGLEHRIRDILKSRQIGATFYFSREALLRALKTGHNQIFLS 195 Query: 94 -----GYKFKRPILAWA----VGITGD 111 Y F+ I+A+A V +TGD Sbjct: 196 ASKTQAYVFREYIIAFARLVDVDLTGD 222 >gi|18685|lcl|protein:vir:5692 Length: 590 # NCBI annotation: gpP # Family: family:all:169 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839851;genbank:gi:30065706;genbank:GeneID :1260600 Length = 590 Score = 32.0 bits (71), Expect = 0.017, Method: Compositional matrix adjust. Identities = 23/87 (26%), Positives = 44/87 (50%), Gaps = 11/87 (12%) Query: 36 KYNRILSFKAYDFQKKFYAAGLKHRFRFLCAANRVGKSYSEAYEFACHV--TGRYPTWWT 93 K +I +++++Q +Y AGL+HR R + + ++G ++ + E TG + + Sbjct: 136 KLEQIFFEQSFEYQLHWYRAGLEHRIRDILKSRQIGATFYFSREALLRALKTGHNQIFLS 195 Query: 94 -----GYKFKRPILAWA----VGITGD 111 Y F+ I+A+A V +TGD Sbjct: 196 ASKTQAYVFREYIIAFARLVDVDLTGD 222 >gi|16979|lcl|protein:vir:6059 Length: 590 # NCBI annotation: gpP # Family: family:all:169 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878200;genbank:gi:33438899;genbank:GeneID :1457734 Length = 590 Score = 32.0 bits (71), Expect = 0.017, Method: Compositional matrix adjust. Identities = 23/87 (26%), Positives = 44/87 (50%), Gaps = 11/87 (12%) Query: 36 KYNRILSFKAYDFQKKFYAAGLKHRFRFLCAANRVGKSYSEAYEFACHV--TGRYPTWWT 93 K +I +++++Q +Y AGL+HR R + + ++G ++ + E TG + + Sbjct: 136 KLEQIFFEQSFEYQLHWYRAGLEHRIRDILKSRQIGATFYFSREALLRALKTGHNQIFLS 195 Query: 94 -----GYKFKRPILAWA----VGITGD 111 Y F+ I+A+A V +TGD Sbjct: 196 ASKTQAYVFREYIIAFARLVDVDLTGD 222 >gi|13020|lcl|protein:vir:80960 Length: 443 # NCBI annotation: TerL # Family: family:all:54 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468388;genbank:gi:157324962;genbank:Ge neID:5601395 Length = 443 Score = 32.0 bits (71), Expect = 0.019, Method: Compositional matrix adjust. Identities = 25/106 (23%), Positives = 40/106 (37%), Gaps = 18/106 (16%) Query: 381 PHDADNTEKGSGSSVAQFYKNAGVNVQSETFYNKIGMDGKKNFFVEPGITDIRERMMSGR 440 P D + G Q+YK+ GV++ K+ M + + + + GR Sbjct: 336 PVDMQTVDSAEGGLRNQYYKDYGVSLHPVAKGKKVDM-----------VDFVCDLLAQGR 384 Query: 441 FKVFNTAANAKLFEEKARYHRKVG-------KIIKEHDDLMDAMRY 479 F + N EE +Y V ++IKE D DA +Y Sbjct: 385 FYYLDIPENQIFIEEHRKYQWDVKTVNTDKPEVIKEDDHTCDAFQY 430 >gi|12899|lcl|protein:vir:80432 Length: 510 # NCBI annotation: BcepGomrgp04 # Family: family:all:144 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210224;genbank:gi:146329916;genbank:Ge neID:5123541 Length = 510 Score = 30.0 bits (66), Expect = 0.061, Method: Compositional matrix adjust. Identities = 30/95 (31%), Positives = 46/95 (48%), Gaps = 12/95 (12%) Query: 296 LLGSGLIYDVSDDTIKCEP-FEIPDTWKRVCAIDIGIDHPTAAVWTAYDANTDTIYVYDS 354 ++ G I D+ + + +P F IP +W+ + D G HP W A +AN DT + + Sbjct: 270 VVAGGAIDDLWREEVHVKPRFNIPASWRVDRSFDWGSTHPFYVGWWA-EANGDTATITNP 328 Query: 355 YKEGGFTPVYHAPAINGRGQWIPVILPHDADNTEK 389 +G T Y PA RG +IL H+ TE+ Sbjct: 329 --DG--TETYWTPA---RGS---LILFHEWYGTEE 353 >gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: phage terminase large subunit # Family: family:all:147 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468054;genbank:gi:157265496;genbank:Ge neID:5600564 Length = 485 Score = 30.0 bits (66), Expect = 0.069, Method: Compositional matrix adjust. Identities = 22/74 (29%), Positives = 40/74 (54%), Gaps = 4/74 (5%) Query: 172 LSTLEFR-STQQGEHTLMGATVDYIWLDEEDPYESMAIFAQCVTRTLTTK-GLVTITATP 229 ++T EFR + L GAT+D++ LDE +++++ + TL+ + G I +TP Sbjct: 122 VATSEFRGKSADRPDNLRGATLDFVILDEA-AMIPFSVWSEAIEPTLSVRDGWALIISTP 180 Query: 230 ENGLTELVDKFMKG 243 + GL + F+ G Sbjct: 181 K-GLNWFYEFFLMG 193 >gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: terminase large subunit # Family: family:all:147 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467938;genbank:gi:157265379;genbank:Ge neID:5600506 Length = 485 Score = 30.0 bits (66), Expect = 0.069, Method: Compositional matrix adjust. Identities = 22/74 (29%), Positives = 40/74 (54%), Gaps = 4/74 (5%) Query: 172 LSTLEFR-STQQGEHTLMGATVDYIWLDEEDPYESMAIFAQCVTRTLTTK-GLVTITATP 229 ++T EFR + L GAT+D++ LDE +++++ + TL+ + G I +TP Sbjct: 122 VATSEFRGKSADRPDNLRGATLDFVILDEA-AMIPFSVWSEAIEPTLSVRDGWALIISTP 180 Query: 230 ENGLTELVDKFMKG 243 + GL + F+ G Sbjct: 181 K-GLNWFYEFFLMG 193 >gi|4395|lcl|protein:vir:98555 Length: 589 # NCBI annotation: gp3 # Family: family:all:169 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958058;genbank:gi:41057355;genbank:GeneID :2744226 Length = 589 Score = 29.6 bits (65), Expect = 0.085, Method: Compositional matrix adjust. Identities = 11/39 (28%), Positives = 24/39 (61%) Query: 36 KYNRILSFKAYDFQKKFYAAGLKHRFRFLCAANRVGKSY 74 K I +++++Q ++Y AGL HR R + + ++G ++ Sbjct: 136 KLEEIFFDQSFEYQLQWYRAGLAHRIRDILKSRQIGATF 174 >gi|9289|lcl|protein:vir:97165 Length: 431 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239721;genbank:gi:66394878;genbank:GeneID :5130898 Length = 431 Score = 29.6 bits (65), Expect = 0.094, Method: Compositional matrix adjust. Identities = 21/71 (29%), Positives = 32/71 (45%), Gaps = 10/71 (14%) Query: 297 LGSGLIYDVSDDTIKCEPFEIPDTWKRVCAIDIGIDH-----PTAAVWTAYDANTDTIYV 351 + GL++D K E F+ + +KR I G+D PT V T D +++ Sbjct: 232 VAEGLVFD----NFKVEDFDWFEEFKRTQEITHGMDFGFSQDPTTVVSTVVDLKNKKLFI 287 Query: 352 YDS-YKEGGFT 361 YD YK+ T Sbjct: 288 YDEHYKKAMLT 298 >gi|8126|lcl|protein:vir:102661 Length: 568 # NCBI annotation: terminase # Family: family:all:147 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024418;genbank:gi:48696639;genbank:GeneID :2948128 Length = 568 Score = 28.9 bits (63), Expect = 0.16, Method: Compositional matrix adjust. Identities = 12/26 (46%), Positives = 16/26 (61%) Query: 378 VILPHDADNTEKGSGSSVAQFYKNAG 403 +ILPHDADNTE G + ++ G Sbjct: 431 IILPHDADNTEVSHGKTRKEWVLEEG 456 >gi|3326|lcl|protein:vir:94525 Length: 440 # NCBI annotation: putative large subunit terminase # Family: family:all:54 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223885;genbank:gi:62327097;genbank:GeneID :5075541 Length = 440 Score = 28.5 bits (62), Expect = 0.20, Method: Compositional matrix adjust. Identities = 18/60 (30%), Positives = 29/60 (48%), Gaps = 2/60 (3%) Query: 297 LGSGLIYDVSDDTIKCEPFEIPDTWKRVCAIDIGIDH-PTAAVWTAYDANTDTIYVYDSY 355 + GL++D + EI + K V +D G H PTA + A D + +Y+YD + Sbjct: 243 IAEGLVFDGLFEQRDFSYDEIANLPKSV-GLDFGFKHDPTAGEFIAVDQDNRIVYIYDEF 301 >gi|6719|lcl|protein:vir:99411 Length: 447 # NCBI annotation: hypothetical protein # Family: family:all:32729 # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919076;genbank:gi:119757034;genbank:GeneI D:4606064 Length = 447 Score = 28.5 bits (62), Expect = 0.22, Method: Compositional matrix adjust. Identities = 21/74 (28%), Positives = 31/74 (41%), Gaps = 3/74 (4%) Query: 425 VEPGITDIRERMM---SGRFKVFNTAANAKLFEEKARYHRKVGKIIKEHDDLMDAMRYSA 481 ++ GI +R R+ GR V T +L +E Y K D +DA+RY+ Sbjct: 369 LDGGIDHVRSRLAMDDEGRPGVLVTDRCGELIQEFLSYKEDHVGTSKAQDHALDALRYAL 428 Query: 482 CSVTHRGRSKHDVS 495 + T R D S Sbjct: 429 FTHTPRDTGDSDSS 442 >gi|5072|lcl|protein:vir:95130 Length: 531 # NCBI annotation: hypothetical protein ORF006 # Family: family:all:144 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293413;genbank:gi:148912834;genbank:Ge neID:5228205 Length = 531 Score = 27.7 bits (60), Expect = 0.32, Method: Compositional matrix adjust. Identities = 29/108 (26%), Positives = 43/108 (39%), Gaps = 17/108 (15%) Query: 300 GLIYDVSDDTIKCEP-FEIPDTWKRVCAIDIGIDHPTAAVWTAYDANTDTIYVYDSYKEG 358 G I D+ I P F IP +W+ D G HP + W A T+ V E Sbjct: 290 GAIDDLWQSHIHVVPRFVIPPSWRIDRTYDDGSSHPFSVGWWAEADGTEATIVLSDGTEF 349 Query: 359 GFTPV---------YHAPAINGRGQWIPVILPHDADNTEKGSGSSVAQ 397 F P ++ A + +G++IP + K S S++AQ Sbjct: 350 TFCPQPGSLIQLFEWYGCAKDEKGEYIP-------NKGLKLSASNIAQ 390 >gi|6886|lcl|protein:vir:106551 Length: 424 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958579;genbank:gi:41179257;genbank:GeneID :2717087 Length = 424 Score = 27.3 bits (59), Expect = 0.46, Method: Compositional matrix adjust. Identities = 21/70 (30%), Positives = 30/70 (42%), Gaps = 9/70 (12%) Query: 425 VEPGITDIRERMMSGRFKVFNTAANAKLFEEKARY-------HRKVGKIIKEHDDLMDAM 477 V GI + M G+ K N LF+E A Y K +K+HD DAM Sbjct: 342 VLDGIRVTQTAMNEGKIKFSMNCPN--LFKELASYVWDDKAAEHGEDKPVKQHDHACDAM 399 Query: 478 RYSACSVTHR 487 RY ++ ++ Sbjct: 400 RYFVYTIIYK 409 >gi|2589|lcl|protein:vir:94084 Length: 407 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240228;genbank:gi:66395894;genbank:GeneID :5133253 Length = 407 Score = 26.9 bits (58), Expect = 0.56, Method: Compositional matrix adjust. Identities = 13/36 (36%), Positives = 22/36 (61%) Query: 456 KARYHRKVGKIIKEHDDLMDAMRYSACSVTHRGRSK 491 K +H G+ IKE DD++D++RY+ + T R + Sbjct: 369 KYVWHPTNGEPIKEFDDVLDSLRYAIYTHTKPERLR 404 >gi|3523|lcl|protein:vir:105897 Length: 407 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004370;genbank:gi:122891825;genbank:Ge neID:4712368 Length = 407 Score = 26.9 bits (58), Expect = 0.56, Method: Compositional matrix adjust. Identities = 13/36 (36%), Positives = 22/36 (61%) Query: 456 KARYHRKVGKIIKEHDDLMDAMRYSACSVTHRGRSK 491 K +H G+ IKE DD++D++RY+ + T R + Sbjct: 369 KYVWHPTNGEPIKEFDDVLDSLRYAIYTHTKPERLR 404 >gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214662;genbank:gi:61806303;genbank:GeneI D:3294591 Length = 550 Score = 26.6 bits (57), Expect = 0.80, Method: Compositional matrix adjust. Identities = 16/46 (34%), Positives = 21/46 (45%), Gaps = 7/46 (15%) Query: 35 RKYNRILS-------FKAYDFQKKFYAAGLKHRFRFLCAANRVGKS 73 RKY RI+S F Y+FQ+ +HRF + GKS Sbjct: 43 RKYIRIVSLDEGVIPFDMYNFQEDMVTKFHQHRFNIAKLPRQSGKS 88 >gi|4548|lcl|protein:vir:94970 Length: 473 # NCBI annotation: putative phage terminase large subunit # Family: family:all:144 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239275;genbank:gi:66392134;genbank:GeneID :5076615 Length = 473 Score = 25.8 bits (55), Expect = 1.2, Method: Compositional matrix adjust. Identities = 24/82 (29%), Positives = 33/82 (40%), Gaps = 12/82 (14%) Query: 261 DAHADLGGHITDQDIKDMTEGIPAWQLEMRSKGMPLLGSGLIYDVSDDTIKC-EPFEIPD 319 D A L G ++ M EG W++ + +G I D+ I PF+IP Sbjct: 220 DYRARLKGMGDSATVQAMLEG--DWEV---------VSAGGIADLWRSKIHVVHPFKIPH 268 Query: 320 TWKRVCAIDIGIDHPTAAVWTA 341 TWK D G P A + A Sbjct: 269 TWKIDRGYDYGSSKPAAYLLFA 290 >gi|1431|lcl|protein:vir:93626 Length: 530 # NCBI annotation: Bcep22gp49 # Family: family:all:147 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944278;genbank:gi:38640355;genbank:GeneID :2658275 Length = 530 Score = 25.4 bits (54), Expect = 1.5, Method: Compositional matrix adjust. Identities = 15/42 (35%), Positives = 22/42 (52%), Gaps = 2/42 (4%) Query: 355 YKEGGFTPV-YHAPAINGRGQ-WIPVILPHDADNTEKGSGSS 394 +KEG P Y + G G W + LPHDAD+ +G ++ Sbjct: 375 FKEGWGEPYSYFVKWLQGLGLVWDTMFLPHDADHVRQGQTTN 416 >gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T4-like DNA packaging large subunit terminase # Family: family:all:147 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214360;genbank:gi:61806000;genbank:GeneI D:3294466 Length = 547 Score = 25.4 bits (54), Expect = 1.8, Method: Compositional matrix adjust. Identities = 15/46 (32%), Positives = 21/46 (45%), Gaps = 7/46 (15%) Query: 35 RKYNRILS-------FKAYDFQKKFYAAGLKHRFRFLCAANRVGKS 73 R Y +I+S F YDFQ+K ++RF + GKS Sbjct: 40 RNYIKIVSLDEGLVPFNMYDFQEKLITRFHENRFNICKMPRQTGKS 85 >gi|5321|lcl|protein:vir:99534 Length: 422 # NCBI annotation: putative protein # Family: family:all:54 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958532;genbank:gi:41179314;genbank:GeneID :2717172 Length = 422 Score = 25.4 bits (54), Expect = 1.8, Method: Compositional matrix adjust. Identities = 21/66 (31%), Positives = 29/66 (43%), Gaps = 4/66 (6%) Query: 327 IDIG-IDHPTAAVWTAYDANTDTIYVYDSYKEGGFTPVYHAPAINGRGQWIPVILPHDAD 385 +D G I+ P+A + D T+YV D + + G A I G VI AD Sbjct: 262 LDYGFINDPSAFMHIKLDMRNKTLYVMDEFVKKGLLNNQLAQVIKDMGYSKEVIT---AD 318 Query: 386 NTEKGS 391 + EK S Sbjct: 319 SAEKKS 324 >gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: terminase large subunit gp17 # Family: family:all:147 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195134;genbank:gi:58532911;uniprot:Q5GQN 8;genbank:GeneID:3260486 Length = 548 Score = 24.6 bits (52), Expect = 2.8, Method: Compositional matrix adjust. Identities = 11/34 (32%), Positives = 18/34 (52%) Query: 40 ILSFKAYDFQKKFYAAGLKHRFRFLCAANRVGKS 73 ++ FK +DFQ++ K+RF + GKS Sbjct: 54 LVPFKMWDFQEELIMKFHKNRFNIAKLPRQTGKS 87 >gi|22578|lcl|protein:vir:95547 Length: 605 # NCBI annotation: ORF010 # Family: family:all:1430 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240892;genbank:gi:66394959;genbank:GeneID :5132488 Length = 605 Score = 24.6 bits (52), Expect = 2.8, Method: Compositional matrix adjust. Identities = 13/35 (37%), Positives = 21/35 (60%), Gaps = 1/35 (2%) Query: 173 STLEFRSTQQGEHTLMGATVDYIWLDEEDPYESMA 207 S+L FR++ + T+ G +DY+ LDE D +A Sbjct: 156 SSLFFRTSSKAS-TVEGVDIDYLSLDEYDRVNLLA 189 >gi|9399|lcl|protein:vir:99333 Length: 605 # NCBI annotation: hypothetical protein # Family: family:all:1430 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024465;genbank:gi:48696425;genbank:GeneID :2948061 Length = 605 Score = 24.6 bits (52), Expect = 2.8, Method: Compositional matrix adjust. Identities = 13/35 (37%), Positives = 21/35 (60%), Gaps = 1/35 (2%) Query: 173 STLEFRSTQQGEHTLMGATVDYIWLDEEDPYESMA 207 S+L FR++ + T+ G +DY+ LDE D +A Sbjct: 156 SSLFFRTSSKAS-TVEGVDIDYLSLDEYDRVNLLA 189 >gi|19292|lcl|protein:vir:4781 Length: 436 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150161;swissprot:trembl:q94m50;genbank:gi :15088772;goa:Q94M50;uniprot:Q94M50;genbank:GeneID:95597 6 Length = 436 Score = 23.9 bits (50), Expect = 4.5, Method: Compositional matrix adjust. Identities = 18/58 (31%), Positives = 23/58 (39%), Gaps = 7/58 (12%) Query: 429 ITDIRERMMSGRFKVFNTAANAKLFEEKA--RYHRKVGK-----IIKEHDDLMDAMRY 479 I + + GRF NT N EE R+ K K +IKE D D +Y Sbjct: 363 IDSFQSLLAQGRFYYLNTENNKIFIEEHKMYRWDEKTIKSDNPSVIKEDDHTCDTTQY 420 >gi|20547|lcl|protein:vir:105155 Length: 580 # NCBI annotation: conserved phage-related protein # Family: family:all:4926 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398598;genbank:gi:80159854;genbank:GeneID :3772993 Length = 580 Score = 23.5 bits (49), Expect = 5.5, Method: Compositional matrix adjust. Identities = 12/42 (28%), Positives = 20/42 (47%) Query: 40 ILSFKAYDFQKKFYAAGLKHRFRFLCAANRVGKSYSEAYEFA 81 +L K Y FQ+ A ++++ L +GKS+ A F Sbjct: 61 VLGLKLYLFQRLILRAMARNQYVMLICCRGLGKSWLSAVFFV 102 >gi|13692|lcl|protein:vir:4897 Length: 411 # NCBI annotation: gp411 # Family: family:all:54 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056675;genbank:gi:9635008;genbank:GeneID: 1262679 Length = 411 Score = 23.5 bits (49), Expect = 6.2, Method: Compositional matrix adjust. Identities = 12/37 (32%), Positives = 21/37 (56%), Gaps = 4/37 (10%) Query: 451 KLFEEKARYHRKVGKI----IKEHDDLMDAMRYSACS 483 + F+E +Y K +KE DD++D++RY+ S Sbjct: 355 RFFDEIYQYRWKENSTKDEPLKEFDDVLDSVRYAIYS 391 >gi|16780|lcl|protein:vir:2731 Length: 411 # NCBI annotation: hypothetical protein # Family: family:all:54 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695104;genbank:gi:23455873;genbank:GeneID :955606 Length = 411 Score = 23.5 bits (49), Expect = 6.3, Method: Compositional matrix adjust. Identities = 12/37 (32%), Positives = 21/37 (56%), Gaps = 4/37 (10%) Query: 451 KLFEEKARYHRKVGKI----IKEHDDLMDAMRYSACS 483 + F+E +Y K +KE DD++D++RY+ S Sbjct: 355 RFFDEIYQYRWKENSTKDEPLKEFDDVLDSVRYAIYS 391 >gi|8015|lcl|protein:vir:96495 Length: 195 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238487;genbank:gi:66391763;genbank:GeneID :5176917 Length = 195 Score = 23.5 bits (49), Expect = 6.7, Method: Compositional matrix adjust. Identities = 12/37 (32%), Positives = 21/37 (56%), Gaps = 4/37 (10%) Query: 451 KLFEEKARYHRKVGKI----IKEHDDLMDAMRYSACS 483 + F+E +Y K +KE DD++D++RY+ S Sbjct: 139 RFFDEIYQYRWKENSTKDEPLKEFDDVLDSVRYAIYS 175 >gi|1165|lcl|protein:vir:102941 Length: 421 # NCBI annotation: terminase, large subunit # Family: family:all:54 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945278;genbank:gi:39653713;goa:Q708N4;int erpro:IPR004921;interpro:IPR006437;uniprot:Q708N4;genban k:GeneID:2672855 Length = 421 Score = 23.1 bits (48), Expect = 7.2, Method: Compositional matrix adjust. Identities = 14/36 (38%), Positives = 23/36 (63%), Gaps = 1/36 (2%) Query: 449 NAKLFEEKARYHRKVGKIIKEHDDLMDAMRYSACSV 484 +A +++EKA + + K IK+ D MDA+RY +V Sbjct: 381 HAYVWDEKASANGE-DKPIKQFDHAMDALRYFCYTV 415 >gi|19929|lcl|protein:vir:4852 Length: 369 # NCBI annotation: putative terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049392;genbank:gi:9632420;genbank:GeneID: 1258503 Length = 369 Score = 23.1 bits (48), Expect = 8.5, Method: Compositional matrix adjust. Identities = 10/48 (20%), Positives = 18/48 (37%) Query: 57 LKHRFRFLCAANRVGKSYSEAYEFACHVTGRYPTWWTGYKFKRPILAW 104 +K ++ + Y Y F + PTW + + +LAW Sbjct: 310 MKSTYKIDVVDAIIDAFYDGMYAFEDYAITNNPTWKVEHMSQEAVLAW 357 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.318 0.135 0.414 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 253,414 Number of Sequences: 514 Number of extensions: 12653 Number of successful extensions: 102 Number of sequences better than 100.0: 43 Number of HSP's better than 100.0 without gapping: 29 Number of HSP's successfully gapped in prelim test: 14 Number of HSP's that attempted gapping in prelim test: 40 Number of HSP's gapped (non-prelim): 45 length of query: 513 length of database: 206,069 effective HSP length: 76 effective length of query: 437 effective length of database: 167,005 effective search space: 72981185 effective search space used: 72981185 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 39 (19.6 bits)