BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_019916.1_cdsid_YP_007236688.1 [gene=G168_gp02] [protein=terminase large subunit] [protein_id=YP_007236688.1] [location=550..1845] (431 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|3326|lcl|protein:vir:94525 Length: 440 # NCBI annotation: put... 632 0.0 gi|9289|lcl|protein:vir:97165 Length: 431 # NCBI annotation: ORF... 363 e-102 gi|6093|lcl|protein:vir:95761 Length: 432 # NCBI annotation: ter... 345 5e-97 gi|17691|lcl|protein:vir:9305 Length: 447 # NCBI annotation: lar... 195 8e-52 gi|7643|lcl|protein:vir:96370 Length: 447 # NCBI annotation: ORF... 195 1e-51 gi|11588|lcl|protein:vir:78831 Length: 447 # NCBI annotation: te... 195 1e-51 gi|9829|lcl|protein:vir:103950 Length: 425 # NCBI annotation: ph... 195 1e-51 gi|7301|lcl|protein:vir:96238 Length: 447 # NCBI annotation: ORF... 194 1e-51 gi|2779|lcl|protein:vir:99776 Length: 425 # NCBI annotation: lar... 194 2e-51 gi|5321|lcl|protein:vir:99534 Length: 422 # NCBI annotation: put... 190 2e-50 gi|5208|lcl|protein:vir:106641 Length: 446 # NCBI annotation: OR... 190 3e-50 gi|5891|lcl|protein:vir:107098 Length: 421 # NCBI annotation: pu... 187 2e-49 gi|15353|lcl|protein:vir:9921 Length: 425 # NCBI annotation: put... 183 3e-48 gi|10354|lcl|protein:vir:97448 Length: 421 # NCBI annotation: OR... 175 1e-45 gi|3182|lcl|protein:vir:94499 Length: 421 # NCBI annotation: ORF... 175 1e-45 gi|7569|lcl|protein:vir:96300 Length: 421 # NCBI annotation: ORF... 174 2e-45 gi|6284|lcl|protein:vir:95890 Length: 421 # NCBI annotation: ORF... 172 7e-45 gi|7076|lcl|protein:vir:96147 Length: 419 # NCBI annotation: ORF... 172 8e-45 gi|4983|lcl|protein:vir:95058 Length: 421 # NCBI annotation: ORF... 172 8e-45 gi|10483|lcl|protein:vir:105297 Length: 446 # NCBI annotation: p... 169 5e-44 gi|18074|lcl|protein:vir:5954 Length: 422 # NCBI annotation: hyp... 155 6e-40 gi|8730|lcl|protein:vir:96840 Length: 421 # NCBI annotation: ORF... 155 9e-40 gi|11736|lcl|protein:vir:79016 Length: 417 # NCBI annotation: pu... 143 4e-36 gi|13903|lcl|protein:vir:9870 Length: 273 # NCBI annotation: put... 102 8e-24 gi|13020|lcl|protein:vir:80960 Length: 443 # NCBI annotation: Te... 100 6e-23 gi|15836|lcl|protein:vir:37 Length: 443 # NCBI annotation: putat... 97 3e-22 gi|10799|lcl|protein:vir:78055 Length: 440 # NCBI annotation: Te... 94 3e-21 gi|8380|lcl|protein:vir:96782 Length: 408 # NCBI annotation: put... 81 2e-17 gi|2931|lcl|protein:vir:105460 Length: 409 # NCBI annotation: pu... 73 5e-15 gi|19292|lcl|protein:vir:4781 Length: 436 # NCBI annotation: put... 69 1e-13 gi|655|lcl|protein:vir:1588 Length: 447 # NCBI annotation: putat... 65 2e-12 gi|1165|lcl|protein:vir:102941 Length: 421 # NCBI annotation: te... 57 6e-10 gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: t... 55 2e-09 gi|2819|lcl|protein:vir:105705 Length: 439 # NCBI annotation: gp... 54 3e-09 gi|18891|lcl|protein:vir:8847 Length: 482 # NCBI annotation: ter... 52 9e-09 gi|6775|lcl|protein:vir:96067 Length: 460 # NCBI annotation: con... 47 3e-07 gi|6886|lcl|protein:vir:106551 Length: 424 # NCBI annotation: pu... 47 4e-07 gi|5794|lcl|protein:vir:98922 Length: 453 # NCBI annotation: ter... 47 4e-07 gi|17761|lcl|protein:vir:171 Length: 470 # NCBI annotation: term... 47 6e-07 gi|5142|lcl|protein:vir:105417 Length: 470 # NCBI annotation: ge... 46 7e-07 gi|4261|lcl|protein:vir:94806 Length: 402 # NCBI annotation: ORF... 44 3e-06 gi|14951|lcl|protein:vir:1235 Length: 405 # NCBI annotation: sim... 44 3e-06 gi|1596|lcl|protein:vir:93748 Length: 403 # NCBI annotation: ORF... 44 3e-06 gi|9935|lcl|protein:vir:97273 Length: 402 # NCBI annotation: ORF... 41 2e-05 gi|13692|lcl|protein:vir:4897 Length: 411 # NCBI annotation: gp4... 41 2e-05 gi|16299|lcl|protein:vir:3027 Length: 375 # NCBI annotation: ter... 39 1e-04 gi|16780|lcl|protein:vir:2731 Length: 411 # NCBI annotation: hyp... 39 1e-04 gi|16025|lcl|protein:vir:9814 Length: 403 # NCBI annotation: put... 39 1e-04 gi|12899|lcl|protein:vir:80432 Length: 510 # NCBI annotation: Bc... 39 2e-04 gi|5072|lcl|protein:vir:95130 Length: 531 # NCBI annotation: hyp... 38 2e-04 gi|11622|lcl|protein:vir:78869 Length: 439 # NCBI annotation: Te... 38 2e-04 gi|4914|lcl|protein:vir:99572 Length: 459 # NCBI annotation: Ter... 36 0.001 gi|9590|lcl|protein:vir:97250 Length: 515 # NCBI annotation: hyp... 36 0.001 gi|8997|lcl|protein:vir:101640 Length: 432 # NCBI annotation: te... 35 0.002 gi|2589|lcl|protein:vir:94084 Length: 407 # NCBI annotation: ORF... 34 0.004 gi|3523|lcl|protein:vir:105897 Length: 407 # NCBI annotation: la... 34 0.004 gi|6719|lcl|protein:vir:99411 Length: 447 # NCBI annotation: hyp... 33 0.007 gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: te... 31 0.027 gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: ph... 31 0.028 gi|8015|lcl|protein:vir:96495 Length: 195 # NCBI annotation: ter... 28 0.18 gi|1299|lcl|protein:vir:105078 Length: 155 # NCBI annotation: ma... 28 0.27 gi|13269|lcl|protein:vir:81252 Length: 542 # NCBI annotation: gp... 27 0.35 gi|26461|lcl|protein:vir:573 Length: 589 # NCBI annotation: unkn... 26 0.94 gi|15201|lcl|protein:vir:7028 Length: 631 # NCBI annotation: lar... 25 2.4 gi|4548|lcl|protein:vir:94970 Length: 473 # NCBI annotation: put... 24 3.5 gi|10070|lcl|protein:vir:99006 Length: 303 # NCBI annotation: gp... 23 6.3 gi|17621|lcl|protein:vir:816 Length: 568 # NCBI annotation: hypo... 23 6.6 gi|25680|lcl|protein:vir:10116 Length: 568 # NCBI annotation: hy... 23 6.6 gi|51|lcl|protein:vir:3295 Length: 568 # NCBI annotation: putati... 23 6.6 gi|26240|lcl|protein:vir:2763 Length: 568 # NCBI annotation: hyp... 23 6.6 gi|25849|lcl|protein:vir:9949 Length: 568 # NCBI annotation: hyp... 23 6.6 gi|1476|lcl|protein:vir:104436 Length: 568 # NCBI annotation: pu... 23 6.6 >gi|3326|lcl|protein:vir:94525 Length: 440 # NCBI annotation: putative large subunit terminase # Family: family:all:54 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223885;genbank:gi:62327097;genbank:GeneID :5075541 Length = 440 Score = 632 bits (1629), Expect = 0.0, Method: Compositional matrix adjust. Identities = 293/431 (67%), Positives = 348/431 (80%), Gaps = 1/431 (0%) Query: 1 MSGTVNINLAELLSPAYYPLFKDRSRYLVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWLV 60 M+ T+++N+ ++S AYYP+F R RYLVYKGSRGSGKSYA A KV++D + +PYVNWLV Sbjct: 11 MANTIDLNVPYIVSKAYYPMFNSRDRYLVYKGSRGSGKSYATAAKVIIDIMMYPYVNWLV 70 Query: 61 LRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKITS 120 RQYA T +DSTFAT++KVA + V +LF + SPLE+ YK TGQ++FFRGMD PLKITS Sbjct: 71 TRQYATTQKDSTFATIRKVAHSMGVLDLFKFTKSPLEITYKQTGQKVFFRGMDDPLKITS 130 Query: 121 ITPTTGQLCRAWYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNPWSERHFLKS 180 I P TG +CR W EE YELKSLD F+TVEES+RGEL P G+YQ+++TFNPWS+RH+LK Sbjct: 131 IQPVTGFICRRWCEEAYELKSLDAFDTVEESMRGELP-PGGFYQTVITFNPWSDRHWLKH 189 Query: 181 EFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGLVF 240 EFFD+ T+R+ A TTTYKDNDHLN Y+ SLKEMLVRNPNRARVAVLG+WG+AEGLVF Sbjct: 190 EFFDDKTKRNHSRAITTTYKDNDHLNADYVDSLKEMLVRNPNRARVAVLGEWGIAEGLVF 249 Query: 241 DGLFEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLLTG 300 DGLFE+R I LP S+GLDFGFKHDPTA F AVDQ NR+VY+YDE YK LLT Sbjct: 250 DGLFEQRDFSYDEIANLPKSVGLDFGFKHDPTAGEFIAVDQDNRIVYIYDEFYKQHLLTN 309 Query: 301 QIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQSYRF 360 QIAQ+L H A+G+PITADSA +I EL++ H VP I PSGKGKDSV+QGIQYMQSYRF Sbjct: 310 QIAQELAKHKAFGLPITADSAEQRMIVELSQQHRVPNIKPSGKGKDSVIQGIQYMQSYRF 369 Query: 361 VVDPRCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAIDALRYAMQKFMFVKNGQYMSYT 420 VV PR KGL+EE N YVYD DK GNWLN P DANNHAIDALRYA++K+MFV+ G YM+Y Sbjct: 370 VVHPRVKGLMEEFNTYVYDMDKEGNWLNKPKDANNHAIDALRYALEKYMFVRAGHYMNYQ 429 Query: 421 DRVAELKNLGL 431 +RV+ LKNLGL Sbjct: 430 ERVSTLKNLGL 440 >gi|9289|lcl|protein:vir:97165 Length: 431 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239721;genbank:gi:66394878;genbank:GeneID :5130898 Length = 431 Score = 363 bits (932), Expect = e-102, Method: Compositional matrix adjust. Identities = 182/434 (41%), Positives = 270/434 (62%), Gaps = 16/434 (3%) Query: 7 INLAELLSPAYYPLFKDRSRYLVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWLVLRQYAY 66 ++L + Y + +++ Y V KGSRGS KS AI ++ + + + N LV+R+++ Sbjct: 5 LDLKNKIGGGYNKFWHNKNFYRVVKGSRGSKKSKTTAINLIYRIMKYDWANILVVRRFSN 64 Query: 67 TNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKITSITPTTG 126 TN+ ST+ LK + L V +LF + S E+ YKPTGQ+I FRG+D PLKITSIT TG Sbjct: 65 TNKQSTYTDLKWATNQLGVAHLFKFNESLPEITYKPTGQKILFRGLDDPLKITSITVDTG 124 Query: 127 QLCRAWYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNPWSERHFLKSEFFDEA 186 LC AW+EE Y++++ F+TV ES+RG D P + Q +TFNPWSERH+LK FFDE Sbjct: 125 ILCWAWFEEAYQIETFAKFSTVVESIRGSYDSPEFFKQITVTFNPWSERHWLKPTFFDEE 184 Query: 187 TRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGLVFDGL--- 243 T+ + ++ TTTY+ N+ L++ I+ +++ ++NP RAR+ GDWGVAEGLVFD Sbjct: 185 TKLNNTFSDTTTYRVNEWLDKVDIERYEDLYIKNPRRARIVCDGDWGVAEGLVFDNFKVE 244 Query: 244 ----FEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLLT 299 FEE + Q + H G+DFGF DPT V T VD N+ +++YDE YK +LT Sbjct: 245 DFDWFEEF----KRTQEITH--GMDFGFSQDPTTVVSTVVDLKNKKLFIYDEHYKKAMLT 298 Query: 300 GQIAQQLRAHMAYGIPITAD--SAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQS 357 I Q L + I AD + G +I+EL + G+ GI + KG ++++ GIQ++Q Sbjct: 299 DDIKQMLIKKGLGDVDIAADYGAGGDRVISEL-KSKGIKGIRKALKGANTILPGIQFIQG 357 Query: 358 YRFVVDPRCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAIDALRYAMQKFMFVKNGQYM 417 + ++ P C+ +EE N Y +D+D G WLN P+DANNH IDALRY+++K+ V+ + Sbjct: 358 FEVIIHPSCEHAIEEFNTYTFDQDNDGKWLNKPIDANNHIIDALRYSLEKYHIVRKKRKK 417 Query: 418 SYTDRVAELKNLGL 431 + + +K+LGL Sbjct: 418 NIESKTKVIKSLGL 431 >gi|6093|lcl|protein:vir:95761 Length: 432 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950582;genbank:gi:119953777;genbank:GeneI D:5076831 Length = 432 Score = 345 bits (886), Expect = 5e-97, Method: Compositional matrix adjust. Identities = 177/409 (43%), Positives = 257/409 (62%), Gaps = 4/409 (0%) Query: 2 SGTVNINLAELLSPAYYPLFKDRSRYLVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWLVL 61 + + +NL E++ Y ++ ++ Y V KG RGS KS A+ +V + + + N LV+ Sbjct: 4 TSQIKVNLPEIVGKGYGQFWRSKNFYRVVKGGRGSKKSKTTALYYIVAILKYNWANLLVV 63 Query: 62 RQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKITSI 121 R+++ TN+ ST+ LK A+ LNV +LF + S E+ K TGQ+I FRG+D PLKITSI Sbjct: 64 RRFSNTNKQSTYTDLKWAANRLNVSHLFKFNESLPEITVKATGQKILFRGLDDPLKITSI 123 Query: 122 TPTTGQLCRAWYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNPWSERHFLKSE 181 T TG L W EE Y++++ D F T+ ES+RG +D P + Q +TFNPWSERH+LKS Sbjct: 124 TVDTGLLSWLWLEEAYQVENQDKFETLVESIRGSIDAPDFFKQITVTFNPWSERHWLKSA 183 Query: 182 FFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGLVFD 241 FFDE TR+ V+A TTTY+ N+ L++ I +++ NP RA V GDWGVAEGLVF+ Sbjct: 184 FFDEDTRKKDVFADTTTYRVNEWLDQQDIDRYEDLWRTNPRRAAVVANGDWGVAEGLVFE 243 Query: 242 GLFEERIID-KQAIQRLPHSI-GLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLLT 299 +E + D I+R+ + GLDFGF HDPT AVD + +++Y E Y+H + T Sbjct: 244 N-YEVKDFDIVSTIKRIGETTAGLDFGFTHDPTTFPRLAVDLEKKELWIYAEHYEHAMTT 302 Query: 300 GQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQSYR 359 I + + ITADSA LIAEL + G+ ++PS KGK S+ GI +M+ ++ Sbjct: 303 DDIFKMIVDADMQNAVITADSAEQRLIAEL-QAKGIRRLVPSIKGKGSINAGIDFMKQFK 361 Query: 360 FVVDPRCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAIDALRYAMQKF 408 + P C +EE + Y+Y +DK G WLN P+D+NNH IDA+RYA++++ Sbjct: 362 IYIHPSCIKTIEEFDTYIYKQDKDGKWLNEPIDSNNHIIDAIRYALERY 410 >gi|17691|lcl|protein:vir:9305 Length: 447 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803283;genbank:gi:29028593;genbank:GeneID :1258039 Length = 447 Score = 195 bits (496), Expect = 8e-52, Method: Compositional matrix adjust. Identities = 122/374 (32%), Positives = 204/374 (54%), Gaps = 13/374 (3%) Query: 59 LVLRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKI 118 L LR+ T +DS F +K + ++++ W + +V P G F+G+D+P KI Sbjct: 86 LWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKVEL-PNGAVFLFKGLDNPEKI 144 Query: 119 TSITPTTGQLCRAWYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNPWSERHFL 178 SI + + EE E +L+ + + LR + Q L FNP S+ +++ Sbjct: 145 KSIKGISDIVM----EEASEF-TLNDYTQLTLRLR---ERKHVNKQIFLMFNPVSKLNWV 196 Query: 179 KSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGL 238 FF+ V ++Y+DN L+E ++L+ + RNP ++ LG++ + L Sbjct: 197 YKYFFEHGEPMENVMIRQSSYRDNKFLDEMTRQNLELLANRNPAYYKIYALGEFATLDKL 256 Query: 239 VFDGLFEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLL 298 VF +E+R+I+K ++ LP GLDFG+ +DP+A + + +D + +Y+ +E K G+L Sbjct: 257 VFPK-YEKRLINKDELRHLPSYFGLDFGYVNDPSAFIHSKIDVKKKKLYIIEEYVKQGML 315 Query: 299 TGQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQSY 358 +IA ++ ITADSA IAEL R G+ ILP+ KGK SVVQG+Q++ + Sbjct: 316 NDEIANVIKQLGYAKEEITADSAEQKSIAEL-RNLGLKRILPTKKGKGSVVQGLQFLMQF 374 Query: 359 RFVVDPRCKGLLEELNLYVYDRDK-AGNWLNTPVDANNHAIDALRYAMQKFMFVKNGQYM 417 +VD RC +EE + Y + +DK G + N PVD NH ID+LRY++++F + + Sbjct: 375 EIIVDERCFKTIEEFDNYTWQKDKDTGEYTNEPVDTYNHCIDSLRYSVERF-YRPVRKRT 433 Query: 418 SYTDRVAELKNLGL 431 + + +V +K+LGL Sbjct: 434 NLSSKVDTIKSLGL 447 >gi|7643|lcl|protein:vir:96370 Length: 447 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239642;genbank:gi:66395379;genbank:GeneID :5132846 Length = 447 Score = 195 bits (495), Expect = 1e-51, Method: Compositional matrix adjust. Identities = 122/374 (32%), Positives = 204/374 (54%), Gaps = 13/374 (3%) Query: 59 LVLRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKI 118 L LR+ T +DS F +K + ++++ W + +V P G F+G+D+P KI Sbjct: 86 LWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKVEL-PNGAVFLFKGLDNPEKI 144 Query: 119 TSITPTTGQLCRAWYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNPWSERHFL 178 SI + + EE E +L+ + + LR + Q L FNP S+ +++ Sbjct: 145 KSIKGISDIVM----EEASEF-TLNDYTQLTLRLR---ERKHVNKQIFLMFNPVSKLNWV 196 Query: 179 KSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGL 238 FF+ V ++Y+DN L+E ++L+ + RNP ++ LG++ + L Sbjct: 197 YKYFFEHGEPMENVMIRQSSYRDNKFLDEMTRQNLELLANRNPAYYKIYALGEFSTLDKL 256 Query: 239 VFDGLFEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLL 298 VF +E+R+I+K ++ LP GLDFG+ +DP+A + + +D + +Y+ +E K G+L Sbjct: 257 VFPK-YEKRLINKDELRHLPSYFGLDFGYVNDPSAFIHSKIDVKKKKLYIIEEYVKQGML 315 Query: 299 TGQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQSY 358 +IA ++ ITADSA IAEL R G+ ILP+ KGK SVVQG+Q++ + Sbjct: 316 NDEIANVIKQLGYAKEEITADSAEQKSIAEL-RNLGLKRILPTKKGKGSVVQGLQFLMQF 374 Query: 359 RFVVDPRCKGLLEELNLYVYDRDK-AGNWLNTPVDANNHAIDALRYAMQKFMFVKNGQYM 417 +VD RC +EE + Y + +DK G + N PVD NH ID+LRY++++F + + Sbjct: 375 EIIVDERCFKTIEEFDNYTWQKDKDTGEYTNEPVDTYNHCIDSLRYSVERF-YRPVRKRT 433 Query: 418 SYTDRVAELKNLGL 431 + + +V +K+LGL Sbjct: 434 NVSSKVDTIKSLGL 447 >gi|11588|lcl|protein:vir:78831 Length: 447 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285355;genbank:gi:148717883;genbank:Ge neID:5246962 Length = 447 Score = 195 bits (495), Expect = 1e-51, Method: Compositional matrix adjust. Identities = 122/374 (32%), Positives = 204/374 (54%), Gaps = 13/374 (3%) Query: 59 LVLRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKI 118 L LR+ T +DS F +K + ++++ W + +V P G F+G+D+P KI Sbjct: 86 LWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKVEL-PNGAVFLFKGLDNPEKI 144 Query: 119 TSITPTTGQLCRAWYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNPWSERHFL 178 SI + + EE E +L+ + + LR + Q L FNP S+ +++ Sbjct: 145 KSIKGISDIVM----EEASEF-TLNDYTQLTLRLR---ERKHVNKQIFLMFNPVSKLNWV 196 Query: 179 KSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGL 238 FF+ V ++Y+DN L+E ++L+ + RNP ++ LG++ + L Sbjct: 197 YKYFFEHGEPMENVMIRQSSYRDNKFLDEMTRQNLELLANRNPAYYKIYALGEFSTLDKL 256 Query: 239 VFDGLFEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLL 298 VF +E+R+I+K ++ LP GLDFG+ +DP+A + + +D + +Y+ +E K G+L Sbjct: 257 VFPK-YEKRLINKDELRHLPSYFGLDFGYVNDPSAFIHSKIDVKKKKLYIIEEYVKQGML 315 Query: 299 TGQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQSY 358 +IA ++ ITADSA IAEL R G+ ILP+ KGK SVVQG+Q++ + Sbjct: 316 NDEIANVIKQLGYAKEEITADSAEQKSIAEL-RNLGLKRILPTKKGKGSVVQGLQFLMQF 374 Query: 359 RFVVDPRCKGLLEELNLYVYDRDK-AGNWLNTPVDANNHAIDALRYAMQKFMFVKNGQYM 417 +VD RC +EE + Y + +DK G + N PVD NH ID+LRY++++F + + Sbjct: 375 EIIVDERCFKTIEEFDNYTWQKDKDTGEYTNEPVDTYNHCIDSLRYSVERF-YRPVRKRT 433 Query: 418 SYTDRVAELKNLGL 431 + + +V +K+LGL Sbjct: 434 NVSSKVDTIKSLGL 447 >gi|9829|lcl|protein:vir:103950 Length: 425 # NCBI annotation: phage terminase large subunit # Family: family:all:54 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873987;genbank:gi:118430762;genbank:GeneI D:4525444 Length = 425 Score = 195 bits (495), Expect = 1e-51, Method: Compositional matrix adjust. Identities = 122/374 (32%), Positives = 204/374 (54%), Gaps = 13/374 (3%) Query: 59 LVLRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKI 118 L LR+ T +DS F +K + ++++ W + +V P G F+G+D+P KI Sbjct: 64 LWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKVEL-PNGAVFLFKGLDNPEKI 122 Query: 119 TSITPTTGQLCRAWYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNPWSERHFL 178 SI + + EE E +L+ + + LR + Q L FNP S+ +++ Sbjct: 123 KSIKGISDIVM----EEASEF-TLNDYTQLTLRLR---ERKHVNKQIFLMFNPVSKLNWV 174 Query: 179 KSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGL 238 FF+ V ++Y+DN L+E ++L+ + RNP ++ LG++ + L Sbjct: 175 YKYFFEHGEPMENVMIRQSSYRDNKFLDEMTRQNLELLANRNPAYYKIYALGEFATLDKL 234 Query: 239 VFDGLFEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLL 298 VF +E+R+I+K ++ LP GLDFG+ +DP+A + + +D + +Y+ +E K G+L Sbjct: 235 VFPK-YEKRLINKDELRHLPSYFGLDFGYVNDPSAFIHSKIDVKKKKLYIIEEYVKQGML 293 Query: 299 TGQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQSY 358 +IA ++ ITADSA IAEL R G+ ILP+ KGK SVVQG+Q++ + Sbjct: 294 NDEIANVIKQLGYAKEEITADSAEQKSIAEL-RNLGLKRILPTKKGKGSVVQGLQFLMQF 352 Query: 359 RFVVDPRCKGLLEELNLYVYDRDK-AGNWLNTPVDANNHAIDALRYAMQKFMFVKNGQYM 417 +VD RC +EE + Y + +DK G + N PVD NH ID+LRY++++F + + Sbjct: 353 EIIVDERCFKTIEEFDNYTWQKDKDTGEYTNEPVDTYNHCIDSLRYSVERF-YRPVRKRT 411 Query: 418 SYTDRVAELKNLGL 431 + + +V +K+LGL Sbjct: 412 NVSSKVDTIKSLGL 425 >gi|7301|lcl|protein:vir:96238 Length: 447 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239566;genbank:gi:66395301;genbank:GeneID :5132787 Length = 447 Score = 194 bits (494), Expect = 1e-51, Method: Compositional matrix adjust. Identities = 122/374 (32%), Positives = 204/374 (54%), Gaps = 13/374 (3%) Query: 59 LVLRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKI 118 L LR+ T +DS F +K + ++++ W + +V P G F+G+D+P KI Sbjct: 86 LWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKVEL-PNGAVFLFKGLDNPEKI 144 Query: 119 TSITPTTGQLCRAWYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNPWSERHFL 178 SI + + EE E +L+ + + LR + Q L FNP S+ +++ Sbjct: 145 KSIKGISDIVM----EEASEF-TLNDYTQLTLRLR---ERKHVNKQIFLMFNPVSKLNWV 196 Query: 179 KSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGL 238 FF+ V ++Y+DN L+E ++L+ + RNP ++ LG++ + L Sbjct: 197 YKYFFEHGEPMENVMIRQSSYRDNKFLDEMTRQNLELLANRNPAYYKIYALGEFATLDKL 256 Query: 239 VFDGLFEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLL 298 VF +E+R+I+K ++ LP GLDFG+ +DP+A + + +D + +Y+ +E K G+L Sbjct: 257 VFPK-YEKRLINKDELRHLPSYFGLDFGYVNDPSAFIHSKIDVKKKKLYIIEEYVKQGML 315 Query: 299 TGQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQSY 358 +IA ++ ITADSA IAEL R G+ ILP+ KGK SVVQG+Q++ + Sbjct: 316 NDEIANVIKQLGYAREEITADSAEQKSIAEL-RNLGLKRILPTKKGKGSVVQGLQFLMQF 374 Query: 359 RFVVDPRCKGLLEELNLYVYDRDK-AGNWLNTPVDANNHAIDALRYAMQKFMFVKNGQYM 417 +VD RC +EE + Y + +DK G + N PVD NH ID+LRY++++F + + Sbjct: 375 EIIVDERCFKTIEEFDNYTWQKDKDTGEYTNEPVDTYNHCIDSLRYSVERF-YRPVRKRT 433 Query: 418 SYTDRVAELKNLGL 431 + + +V +K+LGL Sbjct: 434 NLSSKVDTIKSLGL 447 >gi|2779|lcl|protein:vir:99776 Length: 425 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004302;genbank:gi:122891756;genbank:Ge neID:4712331 Length = 425 Score = 194 bits (493), Expect = 2e-51, Method: Compositional matrix adjust. Identities = 122/374 (32%), Positives = 204/374 (54%), Gaps = 13/374 (3%) Query: 59 LVLRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKI 118 L LR+ T +DS F +K + ++++ W + +V P G F+G+D+P KI Sbjct: 64 LWLRKVQSTIKDSLFEDVKDCLINFGIWDMCLWNKTDNKVGL-PNGAVFLFKGLDNPEKI 122 Query: 119 TSITPTTGQLCRAWYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNPWSERHFL 178 SI + + EE E +L+ + + LR + Q L FNP S+ +++ Sbjct: 123 KSIKGISDIVM----EEASEF-TLNDYTQLTLRLR---ERKHVNKQIFLIFNPVSKLNWV 174 Query: 179 KSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGL 238 FF+ V ++Y+DN L+E ++L+ + RNP ++ LG++ + L Sbjct: 175 YKYFFEHGEPMENVMIRQSSYRDNKFLDEMTRQNLELLANRNPAYYKIYALGEFATLDKL 234 Query: 239 VFDGLFEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLL 298 VF +E+R+I+K ++ LP GLDFG+ +DP+A + + +D + +Y+ +E K G+L Sbjct: 235 VFPK-YEKRLINKDELRHLPSYFGLDFGYVNDPSAFIHSKIDVKKKKLYIIEEYVKQGML 293 Query: 299 TGQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQSY 358 +IA ++ ITADSA IAEL R G+ ILP+ KGK SVVQG+Q++ + Sbjct: 294 NDEIANVIKQLGYAKEEITADSAEQKSIAEL-RNLGLKRILPTKKGKGSVVQGLQFLMQF 352 Query: 359 RFVVDPRCKGLLEELNLYVYDRDK-AGNWLNTPVDANNHAIDALRYAMQKFMFVKNGQYM 417 +VD RC +EE + Y + +DK G + N PVD NH ID+LRY++++F + + Sbjct: 353 EIIVDERCFKTIEEFDNYTWQKDKDTGEYTNEPVDTYNHCIDSLRYSVERF-YRPVRKRT 411 Query: 418 SYTDRVAELKNLGL 431 + + +V +K+LGL Sbjct: 412 NVSSKVDTIKSLGL 425 >gi|5321|lcl|protein:vir:99534 Length: 422 # NCBI annotation: putative protein # Family: family:all:54 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958532;genbank:gi:41179314;genbank:GeneID :2717172 Length = 422 Score = 190 bits (483), Expect = 2e-50, Method: Compositional matrix adjust. Identities = 138/388 (35%), Positives = 207/388 (53%), Gaps = 22/388 (5%) Query: 29 VYKGSRGSGKSYAAAIKVLVDTIAHPYV--NWLVLRQYAYTNRDSTFATLKKVASDLNVY 86 V+ G SGKS+ KV++ ++ H V L LR+ T ++S F + + S N+ Sbjct: 32 VWYGGASSGKSHGVVQKVVLKSLQHWNVPRKVLWLRKVDRTVKNSIFTDVTECLSGWNIL 91 Query: 87 NLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKITSITPTTGQLCRAWYEECYELKSLDGFN 146 S +V P G F+GMD P KI SI L EE E D Sbjct: 92 QYCHVNRSDKTIVL-PNGAIFLFQGMDDPEKIKSIK----GLSDVVMEEASEFNHND--- 143 Query: 147 TVEESLRGELDDPSGYYQSILT-FNPWSERHFLKSEFFDEATR--RSGVYATTTTYKDND 203 + +LR L +P + I FNP S+ ++ +FD + RS V +TYKDN Sbjct: 144 YTQLTLR--LREPKHKQRQIFCMFNPVSKLNWTYQTWFDPSADYDRSRVAIHQSTYKDNR 201 Query: 204 HLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGLVFDGLFEERIIDKQ--AIQRLPHSI 261 L+E I++++E+ NP ++ LG++ + LVF FE + ++ + + L Sbjct: 202 FLDEDNIRTIEELKNTNPAYYKIYTLGEFATLDKLVF-PYFETKRLNPRDPKLLALNDYF 260 Query: 262 GLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLLTGQIAQQLRAHMAYGIP-ITADS 320 GLD+GF +DP+A + +D N+ +YV DE K GLL Q+AQ ++ M Y ITADS Sbjct: 261 GLDYGFINDPSAFMHIKLDMRNKTLYVMDEFVKKGLLNNQLAQVIK-DMGYSKEVITADS 319 Query: 321 AGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQSYRFVVDPRCKGLLEELNLYVYDR 380 A IAE+ R G+ I P+ KG DS++QGIQ++Q +++VVD RC +EEL Y Y + Sbjct: 320 AEKKSIAEMKR-DGIYRIRPALKGPDSIIQGIQFLQQFKWVVDDRCVKTIEELQNYTYVK 378 Query: 381 DKAGN-WLNTPVDANNHAIDALRYAMQK 407 DK + + N P+DA NH IDA+RYA+++ Sbjct: 379 DKKTDEYTNRPIDAYNHCIDAIRYAVEE 406 >gi|5208|lcl|protein:vir:106641 Length: 446 # NCBI annotation: ORF005 # Family: family:all:54 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239489;genbank:gi:66395220;genbank:GeneID :4555795 Length = 446 Score = 190 bits (483), Expect = 3e-50, Method: Compositional matrix adjust. Identities = 119/358 (33%), Positives = 189/358 (52%), Gaps = 12/358 (3%) Query: 59 LVLRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKI 118 L LR+ T +DS F +K + ++++ W + +V P G F+G+D+P KI Sbjct: 86 LWLRKVQSTIKDSLFEDVKGCLINFGIWDMCLWNKTDNKVEL-PNGAVFLFKGLDNPEKI 144 Query: 119 TSITPTTGQLCRAWYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNPWSERHFL 178 SI + + EE E +L+ + + LR + Q L FNP S+ +++ Sbjct: 145 KSIKGISDIVM----EEASEF-TLNDYTQLTLRLR---ERKHMNKQIFLMFNPVSKLNWV 196 Query: 179 KSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGL 238 FF+ V ++Y+DN L+E ++L+ + RNP ++ LG++ + L Sbjct: 197 YKYFFEHGEPMENVMIRQSSYRDNKFLDEMTRQNLELLANRNPAYYKIYALGEFATLDKL 256 Query: 239 VFDGLFEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLL 298 VF +E+RII + + LP GLDFG+ +DP+A + +D N+ +YV E K G+L Sbjct: 257 VFPK-YEKRIISDKEVGHLPSYFGLDFGYVNDPSAFIHVKIDNDNKKLYVISEYVKKGML 315 Query: 299 TGQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQSY 358 +IAQ + ITADSA I E+ + +G+ I+P+ KGKDSV+ GIQ++ + Sbjct: 316 NNEIAQVINDLGYSKEKITADSAEQKSIMEI-KTNGIDRIVPAMKGKDSVMAGIQFVSQF 374 Query: 359 RFVVDPRCKGLLEELNLYVYDRDK-AGNWLNTPVDANNHAIDALRYAMQKFMFVKNGQ 415 V+D RC +EE + Y + +DK G + N PVD NH IDALRYA++ K Q Sbjct: 375 DIVIDERCYKTIEEFDNYTWKKDKNTGEYYNEPVDTYNHCIDALRYAVEVLTIQKKHQ 432 >gi|5891|lcl|protein:vir:107098 Length: 421 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950600;genbank:gi:119953680;genbank:GeneI D:4643107 Length = 421 Score = 187 bits (475), Expect = 2e-49, Method: Compositional matrix adjust. Identities = 135/416 (32%), Positives = 211/416 (50%), Gaps = 15/416 (3%) Query: 4 TVNINLAELLSPAYYPLFK---DRSRY-LVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWL 59 T++INL+ELL ++ L+K DR + +V KG RGSGKS +I + + +P +N + Sbjct: 2 TISINLSELLPKHFHSLWKATKDREKLNIVAKGGRGSGKSSDISIIITQLIMRYP-MNAV 60 Query: 60 VLRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKIT 119 V+R+ T S F +K + V +LF K SP+E+ Y P G RI FRG +P ++ Sbjct: 61 VVRKTDNTLATSVFEQIKWAIEEQKVSHLFKVKVSPMEITYVPRGNRIIFRGAQNPERLK 120 Query: 120 SITPTTGQLCRAWYEECYELKSLDGFNTVEES-LRGELDDPSGYYQSILTFNPWSERHFL 178 S+ + W EE E K+ D T+ S LRGELDD +Y+ ++NP + Sbjct: 121 SLKDSRFPFSIMWIEELAEFKTEDEVTTITNSMLRGELDD-GLFYKFFFSYNPPKRKQSW 179 Query: 179 KSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGL 238 ++ ++ + + + +TY DN +++ +I+ + RN R R +G+ + G+ Sbjct: 180 VNKKYETSFQPDNTFVHHSTYLDNPFISKQFIQEAESAKERNEQRYRWEYMGE-AIGSGV 238 Query: 239 V-FDGLFEERIIDK--QAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKH 295 V F+ L E+I D + + +++ DFG+ DP A V D+ R++Y DE Y Sbjct: 239 VPFNNLQIEKIPDDLYKTFDNIRNAV--DFGYATDPLAFVRWHYDKKKRIIYAVDEHYGV 296 Query: 296 GLLTGQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYM 355 + + A L+ I ADSA IAEL + HG+ I KG DSV G Q++ Sbjct: 297 QISNREFANWLKRRGYQSDEIYADSAEPKSIAELKQEHGIKRIKGVKKGPDSVEHGEQWL 356 Query: 356 QSYR-FVVDP-RCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAIDALRYAMQKFM 409 V+DP R + E Y+ DK GN D +NH IDA RYA+++ M Sbjct: 357 DDLTAIVIDPNRTPNIAREFENIDYETDKDGNVKPRLEDKDNHTIDATRYALERDM 412 >gi|15353|lcl|protein:vir:9921 Length: 425 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795683;genbank:gi:28876465;genbank:GeneID :1257982 Length = 425 Score = 183 bits (465), Expect = 3e-48, Method: Compositional matrix adjust. Identities = 129/405 (31%), Positives = 204/405 (50%), Gaps = 26/405 (6%) Query: 29 VYKGSRGSGKSYAAAIKVLVDTI----AHPYVNWLVLRQYAYTNRDSTFATLKKVASDLN 84 V+ G SGKS+ K+++ + HP LVLR+ T RDS FA + S Sbjct: 37 VHYGGASSGKSHGVFQKIILKALNPKFKHPR-KILVLRKVGATVRDSVFADIMSNLSYFG 95 Query: 85 VYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKITSITPTTGQLCRAWYEECYELKSLDG 144 + + S + P G F+GMD+P KI SI + + EE E +LD Sbjct: 96 ILDKCKINMSAFRITL-PNGAEFIFKGMDNPEKIKSIKGISDVVM----EEASEF-TLDD 149 Query: 145 FNTVEESLRGELDDPSGYYQSILTFNPWSERHFLKSEFFDEATRRSGVYATTTTYKDNDH 204 + + LR D Q L FNP S+ +++ FF + + + VY TT YKDN Sbjct: 150 YTQLTLRLR---DKKHLEKQIYLMFNPVSKVNWVYKAFFVKTPKNTVVYQTT--YKDNRF 204 Query: 205 LNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGLVFDGLFEERIIDKQAIQRLPHSIGLD 264 L++ ++++E+ RN ++ LG + + L+F ++++I++K + LP GLD Sbjct: 205 LDDVTRENIEELANRNEAYYKIYALGQFATLDKLIFPK-YDKQILNKDKLSHLPSFFGLD 263 Query: 265 FGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLLTGQIAQQLRAHMAYGIPITADSAGSN 324 +GF +DP+A + +D N+ +Y+ +E + L +IA ++ I DSA Sbjct: 264 YGFINDPSALLHVKIDDANKKLYILEEYVRKNLTNDKIANAIKDLGYAKEEIRGDSAEKK 323 Query: 325 LIAELTRVHGVPGILPSGKGKDSVVQGIQYMQSYRFVVDPRCKGLLEELNLYVYDRDKAG 384 EL R G+P ++ KG +V+QGIQY+ Y ++VD RC +EEL Y + +DK Sbjct: 324 SNQEL-RNLGIPRMIDVTKGPGTVMQGIQYLLQYDWIVDERCVKTIEELENYTWKKDKKT 382 Query: 385 N-WLNTPVDANNHAIDALRYAMQKFMFVKNGQYMSYTDRVAELKN 428 N + N PVD+ NH IDA+RYA+Q ++ DR +KN Sbjct: 383 NEYTNEPVDSYNHCIDAIRYAVQDRIY-------QSADRSKRMKN 420 >gi|10354|lcl|protein:vir:97448 Length: 421 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240743;genbank:gi:66396415;genbank:GeneID :5133804 Length = 421 Score = 175 bits (443), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 135/414 (32%), Positives = 208/414 (50%), Gaps = 11/414 (2%) Query: 4 TVNINLAELLSPAYYPLFK---DRSRY-LVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWL 59 T++INL++LL ++PL+K D+ +V KG RGSGKS +I + + +P +N + Sbjct: 2 TISINLSDLLPKHFHPLWKVTKDKEVLNVVAKGGRGSGKSSDISIIITQLIMRYP-MNAV 60 Query: 60 VLRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKIT 119 V+R+ T S F +K + V +LF K SP+E+ Y P G RI FRG +P ++ Sbjct: 61 VIRKTDNTLATSVFEQIKWAIEEQKVSHLFKVKVSPMEITYIPRGNRIIFRGAQNPERLK 120 Query: 120 SITPTTGQLCRAWYEECYELKSLDGFNTVEES-LRGELDDPSGYYQSILTFNPWSERHFL 178 S+ + AW EE E K+ D T+ S LRGELD+ +Y+ ++NP + Sbjct: 121 SLKDSRFPFSIAWIEELAEFKTEDEVTTITNSLLRGELDE-GLFYKFFFSYNPPKRKQSW 179 Query: 179 KSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGL 238 ++ ++ + + Y +TY +N +++ +I+ + RN R R +G+ + G+ Sbjct: 180 VNKKYESSFQADNTYVHHSTYLNNPFISKQFIQEAESAKKRNEQRYRWEYMGE-AIGSGV 238 Query: 239 V-FDGLFEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGL 297 V F+ L E I +Q +DFG+ DP A V D+ RV+Y DE Y + Sbjct: 239 VPFNNLRIEEIPQRQYDTFDNIRNAVDFGYATDPLAFVRWHYDKKKRVIYAMDEYYGVQI 298 Query: 298 LTGQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQS 357 + A L+ I ADSA IAEL + HG+ I KG DSV G Q++ Sbjct: 299 SNREFANWLKKKGYQSDEIFADSAEPKSIAELKQEHGIKKIKGVKKGADSVEFGEQWLDD 358 Query: 358 Y-RFVVDP-RCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAIDALRYAMQKFM 409 V+DP R + E Y+ DK GN D +NH IDA RYA+++ M Sbjct: 359 LDAIVIDPRRTPNIAREFENIDYETDKDGNVKPKLEDKDNHTIDATRYALERDM 412 >gi|3182|lcl|protein:vir:94499 Length: 421 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240671;genbank:gi:66396341;genbank:GeneID :5133763 Length = 421 Score = 175 bits (443), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 135/414 (32%), Positives = 208/414 (50%), Gaps = 11/414 (2%) Query: 4 TVNINLAELLSPAYYPLFK---DRSRY-LVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWL 59 T++INL++LL ++PL+K D+ +V KG RGSGKS +I + + +P +N + Sbjct: 2 TISINLSDLLPKHFHPLWKVTKDKEVLNVVAKGGRGSGKSSDISIIITQLIMRYP-MNAV 60 Query: 60 VLRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKIT 119 V+R+ T S F +K + V +LF K SP+E+ Y P G RI FRG +P ++ Sbjct: 61 VIRKTDNTLATSVFEQIKWAIEEQKVSHLFKVKVSPMEITYIPRGNRIIFRGAQNPERLK 120 Query: 120 SITPTTGQLCRAWYEECYELKSLDGFNTVEES-LRGELDDPSGYYQSILTFNPWSERHFL 178 S+ + AW EE E K+ D T+ S LRGELD+ +Y+ ++NP + Sbjct: 121 SLKDSRFPFSIAWIEELAEFKTEDEVTTITNSLLRGELDE-GLFYKFFFSYNPPKRKQSW 179 Query: 179 KSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGL 238 ++ ++ + + Y +TY +N +++ +I+ + RN R R +G+ + G+ Sbjct: 180 VNKKYESSFQADNTYVHHSTYLNNPFISKQFIQEAESAKKRNEQRYRWEYMGE-AIGSGV 238 Query: 239 V-FDGLFEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGL 297 V F+ L E I +Q +DFG+ DP A V D+ RV+Y DE Y + Sbjct: 239 VPFNNLRIEEIPQRQYDTFDNIRNAVDFGYATDPLAFVRWHYDKKKRVIYAMDEYYGVQI 298 Query: 298 LTGQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQS 357 + A L+ I ADSA IAEL + HG+ I KG DSV G Q++ Sbjct: 299 SNREFANWLKKKGYQSDEIFADSAEPKSIAELKQEHGIKKIKGVKKGADSVEFGEQWLDD 358 Query: 358 Y-RFVVDP-RCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAIDALRYAMQKFM 409 V+DP R + E Y+ DK GN D +NH IDA RYA+++ M Sbjct: 359 LDAIVIDPRRTPNIAREFENIDYETDKDGNVKPKLEDKDNHTIDATRYALERDM 412 >gi|7569|lcl|protein:vir:96300 Length: 421 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240307;genbank:gi:66395973;genbank:GeneID :5133377 Length = 421 Score = 174 bits (440), Expect = 2e-45, Method: Compositional matrix adjust. Identities = 133/414 (32%), Positives = 207/414 (50%), Gaps = 11/414 (2%) Query: 4 TVNINLAELLSPAYYPLFK---DRSRY-LVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWL 59 T++INL++LL ++PL+K D+ ++ KG RGSGKS +I + + +P +N + Sbjct: 2 TISINLSDLLPKHFHPLWKATKDKDLLNIIAKGGRGSGKSSDISIIITQLIMRYP-MNAV 60 Query: 60 VLRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKIT 119 V+R+ T S F +K + V +LF K SP+E+ Y P G RI FRG +P ++ Sbjct: 61 VIRKTDNTLATSVFEQIKWAIEEQKVTHLFKVKVSPMEITYIPRGNRIIFRGAQNPERLK 120 Query: 120 SITPTTGQLCRAWYEECYELKSLDGFNTVEES-LRGELDDPSGYYQSILTFNPWSERHFL 178 S+ + AW EE E K+ D T+ S LRGELD+ +Y+ ++NP + Sbjct: 121 SLKDSRFPFSIAWIEELAEFKTEDEVTTITNSLLRGELDE-GLFYKFFFSYNPPKRKQSW 179 Query: 179 KSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGL 238 ++ ++ + + + +TY +N +++ +I+ + RN R R +G+ + G+ Sbjct: 180 VNKKYESSFQADNTFVHHSTYLNNPFISKQFIQEAESAKKRNEQRYRWEYMGE-AIGSGV 238 Query: 239 V-FDGLFEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGL 297 V F+ L E I Q +DFG+ DP A V D+ RV+Y DE Y + Sbjct: 239 VPFNNLRIEEIPQGQYDTFDNIRNAVDFGYATDPLAFVRWHYDKKKRVIYAMDEHYGVQI 298 Query: 298 LTGQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQS 357 + A L+ I ADSA IAEL + HG+ + KG DSV G Q++ Sbjct: 299 SNREFANWLKKKGYQSDEIFADSAEPKSIAELKQEHGIKKVKAVKKGADSVEYGEQWLDD 358 Query: 358 YR-FVVDP-RCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAIDALRYAMQKFM 409 V+DP R + E Y DK GN D +NHAIDA RYA+++ M Sbjct: 359 LEAIVIDPRRTPNIAREFENIDYQTDKDGNVKPKLEDKDNHAIDATRYALERDM 412 >gi|6284|lcl|protein:vir:95890 Length: 421 # NCBI annotation: ORF008 # Family: family:all:54 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240381;genbank:gi:66396048;genbank:GeneID :5133401 Length = 421 Score = 172 bits (436), Expect = 7e-45, Method: Compositional matrix adjust. Identities = 132/414 (31%), Positives = 204/414 (49%), Gaps = 11/414 (2%) Query: 4 TVNINLAELLSPAYYPLFKDRSRY----LVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWL 59 T++INL++LL + PL+K ++ KG RGSGKS +I + + +P +N + Sbjct: 2 TISINLSDLLPKHFRPLWKATKDKGILNIIAKGGRGSGKSSDISIIITQLIMRYP-MNAV 60 Query: 60 VLRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKIT 119 V+R+ T S F +K + V +LF K SP+E+ Y P G RI FRG +P ++ Sbjct: 61 VIRKTDNTLATSVFEQIKWAIEEQKVSHLFKVKVSPMEITYIPRGNRIIFRGAQNPERLK 120 Query: 120 SITPTTGQLCRAWYEECYELKSLDGFNTVEES-LRGELDDPSGYYQSILTFNPWSERHFL 178 S+ + AW EE E K+ D T+ S LRGELD+ +Y+ ++NP + Sbjct: 121 SLKDSRFPFSVAWIEELAEFKTEDEVTTITNSLLRGELDE-GLFYKFFFSYNPPKRKQSW 179 Query: 179 KSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGL 238 ++ ++ + + + +TY +N +++ +I+ + RN R R +G+ + G+ Sbjct: 180 VNKKYESSFQADNTFVHHSTYLNNPFISKQFIQEAESAKKRNEQRYRWEYMGE-AIGSGV 238 Query: 239 V-FDGLFEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGL 297 V F+ L E I Q +DFG+ DP A V D+ RV+Y DE Y + Sbjct: 239 VPFNNLRIEEIPQGQYDTFDNIRNAVDFGYATDPLAFVRWHYDKKKRVIYAMDEHYGVQI 298 Query: 298 LTGQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQS 357 + A L+ I ADSA IAEL + HG+ + KG DSV G Q++ Sbjct: 299 SNREFANWLKKKGYQSDEIFADSAEPKSIAELKQEHGIKKVKAVKKGADSVEYGEQWLDD 358 Query: 358 YR-FVVDP-RCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAIDALRYAMQKFM 409 V+DP R + E Y DK GN D +NHAIDA RYA+++ M Sbjct: 359 LEAIVIDPRRTPNIAREFENIDYQTDKDGNVKPKLEDKDNHAIDATRYALERDM 412 >gi|7076|lcl|protein:vir:96147 Length: 419 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240074;genbank:gi:66395738;genbank:GeneID :5133130 Length = 419 Score = 172 bits (436), Expect = 8e-45, Method: Compositional matrix adjust. Identities = 132/413 (31%), Positives = 203/413 (49%), Gaps = 11/413 (2%) Query: 5 VNINLAELLSPAYYPLF---KDRSRY-LVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWLV 60 + + L+EL+ ++ L+ KD+ + +V KG RGSGKS AI +++ + +P VN L+ Sbjct: 1 MRVKLSELIPEHFHSLWHAAKDKGKLNIVAKGGRGSGKSSDIAIIIVLLIMRYP-VNALI 59 Query: 61 LRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKITS 120 LR+ T S F +K + + V +LF K SP+E+ Y P G ++ FRG +P +I S Sbjct: 60 LRKIDNTLALSVFEQIKWAINVMGVSHLFKIKVSPMEITYVPRGNKMVFRGAQNPERIKS 119 Query: 121 ITPTTGQLCRAWYEECYELKSLDGFNTVEES-LRGELDDPSGYYQSILTFNPWSERHFLK 179 + AW EE E K+ D T+ S LRGELD+ +Y+ T+NP + Sbjct: 120 LKDAQFPYAIAWIEELAEFKTEDEVTTITNSLLRGELDN-GLFYKFFYTYNPPKRKQSWV 178 Query: 180 SEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGLV 239 ++ ++ + + + +TY +N + + +I+ K N R R LG+ + G+V Sbjct: 179 NKKYESSFQPDNTFVHHSTYLNNPFIAKEFIEEAKAAKAINELRYRWEYLGE-AIGSGVV 237 Query: 240 -FDGLFEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLL 298 F+ L E I +Q +DFG+ DP A V D+ R++Y DE Y + Sbjct: 238 PFNNLRIETIPKEQFDTFDNIRNAVDFGYATDPLAFVRWHYDKKKRIIYAVDEHYGVQIS 297 Query: 299 TGQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQSY 358 + A L+ I ADSA IAEL + H + I KG DSV G Q++ Sbjct: 298 NREFANWLKKKGYQSDEIYADSAEPKSIAELKQEHSIRRIKGVKKGPDSVEHGEQWLNDL 357 Query: 359 -RFVVDP-RCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAIDALRYAMQKFM 409 V+DP R + E Y DK GN D +NH IDA RYA+++ M Sbjct: 358 DAIVIDPTRTPNIAREFENIDYQTDKDGNVKPRLEDKDNHTIDATRYALERDM 410 >gi|4983|lcl|protein:vir:95058 Length: 421 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240816;genbank:gi:66394679;genbank:GeneID :5133852 Length = 421 Score = 172 bits (435), Expect = 8e-45, Method: Compositional matrix adjust. Identities = 133/414 (32%), Positives = 208/414 (50%), Gaps = 11/414 (2%) Query: 4 TVNINLAELLSPAYYPLFK---DRSRY-LVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWL 59 T++INL++LL ++PL+K D+ +V KG RGSGKS +I + + +P +N + Sbjct: 2 TISINLSDLLPMHFHPLWKATKDKEILNIVAKGGRGSGKSSDISIIITQLIMRYP-MNAV 60 Query: 60 VLRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKIT 119 V+R+ T S F +K + V +LF K SP+E+ Y P G RI FRG +P ++ Sbjct: 61 VIRKTDNTLATSVFEQIKWAIEEQKVSHLFKVKVSPMEITYIPRGNRIIFRGAQNPERLK 120 Query: 120 SITPTTGQLCRAWYEECYELKSLDGFNTVEES-LRGELDDPSGYYQSILTFNPWSERHFL 178 S+ + +W EE E K+ D T+ S LRGELD+ +Y+ ++NP + Sbjct: 121 SLKDSRFPFSISWIEELAEFKTEDEVTTITNSLLRGELDE-GLFYKFFFSYNPPKRKQSW 179 Query: 179 KSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGL 238 ++ ++ + + Y +TY +N +++ +I+ + RN R R +G+ + G+ Sbjct: 180 VNKKYESSFQADNTYVHHSTYLNNPFISKQFIQEAESAKKRNEQRYRWEYMGE-AIGSGV 238 Query: 239 V-FDGLFEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGL 297 V F+ L E I +Q +DFG+ DP A V D+ RV+Y DE Y + Sbjct: 239 VPFNNLRIEEIPQRQYDTFDNIRNAVDFGYATDPLAFVRWHYDKKKRVIYAMDEHYGVQI 298 Query: 298 LTGQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQS 357 + A L+ + ADSA IAEL + HG+ I KG DSV G Q++ Sbjct: 299 SNREFANWLKKKGYQSDEVFADSAEPKSIAELKQEHGIKKIKGVKKGADSVEFGEQWLDD 358 Query: 358 Y-RFVVDP-RCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAIDALRYAMQKFM 409 V+DP R + E Y+ DK GN D +NH IDA RYA+++ M Sbjct: 359 LDAIVIDPRRTPNIAREFENIDYETDKDGNVKPKLEDKDNHTIDATRYALERDM 412 >gi|10483|lcl|protein:vir:105297 Length: 446 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950664;genbank:gi:119967835;genbank:GeneI D:4643176 Length = 446 Score = 169 bits (428), Expect = 5e-44, Method: Compositional matrix adjust. Identities = 133/441 (30%), Positives = 210/441 (47%), Gaps = 41/441 (9%) Query: 5 VNINLAELLSPAYYPLFK---DRSRY-LVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWLV 60 ++I L+ELL ++ L+K DR + +V KG RGSGKS +I + + +P +N +V Sbjct: 2 ISIKLSELLPKHFHSLWKATKDREKLNIVAKGGRGSGKSSDISIIITQLIMRYP-MNAVV 60 Query: 61 LRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKITS 120 +R+ T S F +K + V +LF K SP+E+ Y P G RI FRG +P ++ S Sbjct: 61 VRKADNTLATSVFEQIKWAIEEQKVSHLFKVKVSPMEITYVPRGNRIIFRGAQNPERLKS 120 Query: 121 ITPTTGQLCRAWYEECYELKSLDGFNTVEES-LRGELDDPSGYYQSILTFNPWSERHFLK 179 + + W EE E K+ D T+ S LRGELDD +Y+ ++NP + Sbjct: 121 LKDSRFPFSIMWIEELAEFKTEDEVTTITNSMLRGELDD-GLFYKFFFSYNPPKRKQSWV 179 Query: 180 SEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGLV 239 ++ ++ + + + +TY DN +++ +I+ + RN R R +G+ + G+V Sbjct: 180 NKKYETSFQPDNTFVHHSTYLDNPFISKQFIQEAESAKERNEQRYRWEYMGE-AIGSGVV 238 Query: 240 -FDGLFEERIIDK--QAIQRLPHSIGLDFGFKH--------------------------D 270 F+ L E+I D+ ++ + +++ DFG D Sbjct: 239 PFNNLQIEKIPDELYKSFDNIRNAV--DFGLTKTAPLHSDVYSKLGEHISGVRKKACATD 296 Query: 271 PTAAVFTAVDQVNRVVYVYDEVYKHGLLTGQIAQQLRAHMAYGIPITADSAGSNLIAELT 330 P A V D+ R++Y DE Y + + A L+ I ADSA IAEL Sbjct: 297 PLAFVRWHYDKKKRIIYAVDEHYGVQISNREFANWLKRRGYQSDEIYADSAEPKSIAELK 356 Query: 331 RVHGVPGILPSGKGKDSVVQGIQYMQSYR-FVVDP-RCKGLLEELNLYVYDRDKAGNWLN 388 + HG+ I KG DSV G Q++ V+DP R + E Y+ DK GN Sbjct: 357 QEHGIKRIKGVKKGPDSVEHGEQWLDDLTAIVIDPNRTPNIAREFENIDYETDKDGNVKP 416 Query: 389 TPVDANNHAIDALRYAMQKFM 409 D +NH IDA RYA+++ M Sbjct: 417 RLEDKDNHTIDATRYALERDM 437 >gi|18074|lcl|protein:vir:5954 Length: 422 # NCBI annotation: hypothetical protein # Family: family:all:54 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690654;genbank:geneid:6329138;genbank:gi: 22855048;goa:P54308;interpro:IPR006437;interpro:IPR00670 1;interpro:IPR011441;uniprot:P54308;genbank:GeneID:95525 4 Length = 422 Score = 155 bits (393), Expect = 6e-40, Method: Compositional matrix adjust. Identities = 126/413 (30%), Positives = 201/413 (48%), Gaps = 13/413 (3%) Query: 7 INLAELLSPAYYPLFK--DRSRYLVY--KGSRGSGKSYAAAIKVLVDTIAHPYVNWLVLR 62 + L+E +P + +++ +++L Y KG RGS KS A+ +++ + P + +LV+R Sbjct: 4 VRLSEKFTPHFLEVWRTVKAAQHLKYVLKGGRGSAKSTHIAMWIILLMMMMP-ITFLVIR 62 Query: 63 QYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKITSIT 122 + T S F LK+ L V +L+ SPL + Y P G I FRG D KI SI Sbjct: 63 RVYNTVEQSVFEQLKEAIDMLEVGHLWKVSKSPLRLTYIPRGNSIIFRGGDDVQKIKSIK 122 Query: 123 PTTGQLCRAWYEECYELKSLDGFNTVEES-LRGELDDPSGYYQSILTFNPWSERHFLKSE 181 + + W EE E K+ + + +E+S LR EL P Y ++NP + ++ Sbjct: 123 ASKFPVAGMWIEELAEFKTEEEVSVIEKSVLRAEL-PPGCRYIFFYSYNPPKRKQSWVNK 181 Query: 182 FFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGLV-F 240 F+ + + + +TY N L++++I+ +E+ RN + R LG+ + G+V F Sbjct: 182 VFNSSFLPANTFVDHSTYLQNPFLSKAFIEEAEEVKRRNELKYRHEYLGE-ALGSGVVPF 240 Query: 241 DGL-FEERIIDKQAIQRLPH-SIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLL 298 + L EE II + R + GLDFG+ DP A V D+ +Y DE+ H + Sbjct: 241 ENLQIEEGIITDAEVARFDNIRQGLDFGYGPDPLAFVRWHYDKRKNRIYAIDELVDHKVS 300 Query: 299 TGQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQSY 358 + A +R + I ADS+ I L HG+ I + KG DSV G +++ Sbjct: 301 LKRTADFVRKNKYESARIIADSSEPRSIDALKLEHGINRIEGAKKGPDSVEHGERWLDEL 360 Query: 359 -RFVVDP-RCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAIDALRYAMQKFM 409 V+DP R + E Y DK G+ + D +NH IDA RYA ++ M Sbjct: 361 DAIVIDPLRTPNIAREFENIDYQTDKNGDPIPRLEDKDNHTIDATRYAFERDM 413 >gi|8730|lcl|protein:vir:96840 Length: 421 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240151;genbank:gi:66395816;genbank:GeneID :5133181 Length = 421 Score = 155 bits (392), Expect = 9e-40, Method: Compositional matrix adjust. Identities = 123/415 (29%), Positives = 196/415 (47%), Gaps = 13/415 (3%) Query: 4 TVNINLAELLSPAYYPLF---KDRSRYLVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWLV 60 T +INL+EL+ ++ L+ KD + V + ++ I +N +V Sbjct: 2 TTSINLSELIPEHFHDLWRATKDPNILNVVGKGGRGSGKSSDISIIITQLIMRYPMNAVV 61 Query: 61 LRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKITS 120 +R+ T S F +K V +LF K SP+E+ Y P G RI FRG +P ++ S Sbjct: 62 VRKTDNTLATSVFEQIKWAIEQQKVSHLFKVKVSPMEITYMPRGNRIIFRGAQNPERLKS 121 Query: 121 ITPTTGQLCRAWYEECYELKSLDGFNTVEES-LRGELDDPSGYYQSILTFNPWSERHFLK 179 + + W EE E K+ D T+ S LRGELD+ +Y+ ++NP + Sbjct: 122 LKDSRFPFSIMWIEELAEFKTEDEVTTITNSMLRGELDE-GLFYKFFFSYNPPKRKQSWV 180 Query: 180 SEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGLV 239 ++ ++ + + + +TY DN + + +I + RN R R LG+ + G+V Sbjct: 181 NKKYESSFQPDNTFVHHSTYLDNPFIAKQFIDEAEAAKERNELRYRWEYLGE-AIGSGVV 239 Query: 240 -FDGLFEERIIDK--QAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHG 296 F+ L E+I D+ ++ + +++ DFG+ DP A V D+ RV+Y DE Y Sbjct: 240 PFNNLQIEKIPDELFRSFDNIRNAV--DFGYATDPLAFVRWHYDKKKRVIYAVDEYYGVQ 297 Query: 297 LLTGQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQ 356 + Q + L + I ADSA I EL + HG+ I KG DSV G Q++ Sbjct: 298 ISNRQFGKWLWSKGYQSDDIYADSAEPKSIDELRKEHGIKRIKGVKKGPDSVEYGEQWLN 357 Query: 357 SYR-FVVDP-RCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAIDALRYAMQKFM 409 V+DP R + E ++ DK GN D +NH IDA RYA+++ M Sbjct: 358 DLDAIVIDPNRTPNIAREFENIDFETDKDGNVKPKLEDKDNHTIDATRYALERDM 412 >gi|11736|lcl|protein:vir:79016 Length: 417 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110720;genbank:gi:134287337;genbank:Ge neID:4955190 Length = 417 Score = 143 bits (361), Expect = 4e-36, Method: Compositional matrix adjust. Identities = 120/411 (29%), Positives = 201/411 (48%), Gaps = 39/411 (9%) Query: 17 YYPLFKD----RSRYLVYKGSRGSGKSYAAAIKVLVDTIAHPY--VNWLVLRQYAYTNRD 70 + P FK+ + RY KGS GSGKS A ++ Y N LV+R+ T++ Sbjct: 11 FNPDFKEANFTKKRYRAMKGSAGSGKSVNVAQDYILKLGDKKYQGANLLVVRKSEATHKY 70 Query: 71 STFATLKKVASDLNVYNLFT---WKSS--PLEVVYKPTGQRIFFRGMDSPL---KITSIT 122 ST+A L + +Y WK++ PLE+ K TG I FRG++ K+ SI Sbjct: 71 STYAELTGAIN--RIYGKQADKYWKTTLNPLEIKSKVTGNSIIFRGVNDAKQREKLKSIN 128 Query: 123 PTTGQLCRAWYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNPWSERHFLKSEF 182 + G+L W EE EL D + +++ LRG L +P+ YYQ TFNP S H++K ++ Sbjct: 129 FSKGKLTWVWCEEATELMESD-IDILDDRLRGILTNPNLYYQMTFTFNPVSATHWIKRKY 187 Query: 183 FDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGLVFDG 242 FD + ++ +TY N ++E+Y + ++ ++P +V LG+WG G + Sbjct: 188 FD--YKNDDIFTHHSTYLQNRFIDEAYYRRMQMRKEQDPEGYKVYGLGEWGETGGAILKN 245 Query: 243 -LFEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLLTGQ 301 + E + + + + DFGF H A V + + +Y+ +E+Y H + T + Sbjct: 246 YVIHEFPTESEYFDNM--RLSQDFGFNH---ANVVLRIGFKDGELYICNEIYAHEMDTSE 300 Query: 302 IAQQLRAHMAYGIPIT----ADSAGSNLIAELTRVHGVPGILPSG--KGKDSVVQGIQYM 355 I ++ + G+ T DSA + I ++ G G KG SV I Y+ Sbjct: 301 I---IKIANSIGLEKTLFMYCDSAEPDRI----KMWKSAGYKAKGVKKGPGSVKAQIDYL 353 Query: 356 QSYRFVVDPRCKGLLEELNLYVYDRD-KAGNWLNTPVDANNHAIDALRYAM 405 + R V P C ++E+ + + +D + G +L+ PV+ + A+ ALRY++ Sbjct: 354 KQLRIHVHPSCTNTIKEIQQWKWKQDERTGLYLDEPVEFMDDAMAALRYSI 404 >gi|13903|lcl|protein:vir:9870 Length: 273 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795632;genbank:gi:28876409;genbank:GeneID :1257943 Length = 273 Score = 102 bits (254), Expect = 8e-24, Method: Compositional matrix adjust. Identities = 81/261 (31%), Positives = 120/261 (45%), Gaps = 5/261 (1%) Query: 152 LRGELDDPSGYYQSILTFNPWSERHFLKSEFFDEATRRSGVYATTTTYKDNDHLNESYIK 211 +RGEL D +Y+ T+NP + ++ ++ + S + +TYKDN + + +I Sbjct: 1 MRGELGD-GLFYKFFYTYNPPKRKQSWVNKKYESQFQPSNTFVHASTYKDNPFIAKEFIA 59 Query: 212 SLKEMLVRNPNRARVAVLGDWGVAEGLV-FDGLFEERIIDKQAIQRLPHSIGLDFGFKHD 270 + R+ R R LG+ + G+V FD L ERI D+Q G+D+G+ D Sbjct: 60 EAEATRERSERRYRWEYLGE-AIGSGVVPFDNLRFERITDEQVADFDNIRNGIDYGYATD 118 Query: 271 PTAAVFTAVDQVNRVVYVYDEVYKHGLLTGQIAQQLRAHMAYGIPITADSAGSNLIAELT 330 P A V D+ +Y DE Y + Q+A+ L + A+SA AEL Sbjct: 119 PLAFVRWHYDKKKNGIYAIDEYYGQKISNRQLAKWLTTKGYQSDEMFAESAEPKSNAELK 178 Query: 331 RVHGVPGILPSGKGKDSVVQGIQYMQSYRFV-VDP-RCKGLLEELNLYVYDRDKAGNWLN 388 G+ I KG DSV G +++ F+ +DP R + E Y D+ GN Sbjct: 179 NEFGIKRIKGVKKGPDSVEFGERWLDDLDFICIDPKRTPNIAREFENIDYQVDRDGNPKP 238 Query: 389 TPVDANNHAIDALRYAMQKFM 409 D NHAIDA RYAM M Sbjct: 239 RLEDKVNHAIDATRYAMSDDM 259 >gi|13020|lcl|protein:vir:80960 Length: 443 # NCBI annotation: TerL # Family: family:all:54 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468388;genbank:gi:157324962;genbank:Ge neID:5601395 Length = 443 Score = 99.8 bits (247), Expect = 6e-23, Method: Compositional matrix adjust. Identities = 105/427 (24%), Positives = 187/427 (43%), Gaps = 41/427 (9%) Query: 7 INLAELLSPAYYPLFKDRSRYLVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWLVLRQYAY 66 IN+ ++++PA+Y L+ + +++ KG R S KS ++K++ +A+P N + LR+ A Sbjct: 15 INVTDMINPAFYDLWLSKHNHIIAKGGRSSMKSSVISLKLVEKKMANPMSNMVCLRKVAN 74 Query: 67 TNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKITSITPTTG 126 T S + +K ++ V + F + SP+E+++K G +F G D P K+ S+ G Sbjct: 75 TLYKSVYQQIKWALYEMGVADQFNFGKSPMEIIHKEWGTGFYFSGCDDPAKLKSMKIPVG 134 Query: 127 QLCRAWYEECYELKSLDGFNTVEESLRGELDDPSGYYQSI-LTFNPWSERHFLKSEFFDE 185 + W+EE E + + VE++ E D P G +I ++FNP + +E+ D Sbjct: 135 YVSDLWFEELAEFSGVTDIDVVEDTFIRE-DLPQGQEVTIYMSFNPPRNPYEWVNEYVDS 193 Query: 186 ATRRSGVYATTTTYKDNDH--LNESYIKSLKEMLVRNPNRARVAVLGD-WGVAEGLVFDG 242 TTY D++ L++ IK +++ + + R LG+ G+ + + Sbjct: 194 KRSDDDYLIHHTTYLDDEKGFLSKQIIKKIEKYKKNDLDYYRWMYLGEVIGLGDNVYNMN 253 Query: 243 LFEERIIDKQAIQRLPHSIGLDFGFK--HDPTAAVFTAVD-QVNRVVYVYDEVYKHGL-- 297 LF+ +AI I +DF H +A A+ R V + D Y Sbjct: 254 LFQPL----KAIPADDRLILIDFAIDTGHQVSATTCLALGFTAKRNVILLDTYYYSPANQ 309 Query: 298 ----LTGQIAQQLRAHMA-----YGIPI---TADSAGSNLIAELTRVHGVPGILPSGKGK 345 +++LR M Y P+ T DSA L + + +GV + P KGK Sbjct: 310 VVKKAPSDYSKELREFMTKVVSKYNAPVDMQTVDSAEGGLRNQYYKDYGV-SLHPVAKGK 368 Query: 346 ---------DSVVQGIQYMQSYRFVVDPRCKGLLEELNLYVYDRDKAGNWLNTPVDANNH 396 D + QG + ++ P + +EE Y +D + ++H Sbjct: 369 KVDMVDFVCDLLAQG-----RFYYLDIPENQIFIEEHRKYQWDVKTVNTDKPEVIKEDDH 423 Query: 397 AIDALRY 403 DA +Y Sbjct: 424 TCDAFQY 430 >gi|15836|lcl|protein:vir:37 Length: 443 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463463;swissprot:trembl:q9t1c1;genbank:gi :16798785;goa:Q9T1C1;uniprot:Q9T1C1;genbank:GeneID:92238 4 Length = 443 Score = 97.4 bits (241), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 57/198 (28%), Positives = 102/198 (51%), Gaps = 2/198 (1%) Query: 7 INLAELLSPAYYPLFKDRSRYLVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWLVLRQYAY 66 IN+ ++++PA+Y L+ + +++ KG R S KS ++K++ +A+P N + LR+ A Sbjct: 15 INVIDMINPAFYDLWLSKHNHIIAKGGRSSMKSSVISLKLVEKKMANPMSNMVCLRKVAN 74 Query: 67 TNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKITSITPTTG 126 T S + +K ++ V + F + SP+E+V+K G +F G D P K+ S+ G Sbjct: 75 TLYKSVYQQIKWALYEMGVADQFKFGKSPMEIVHKTWGTGFYFSGCDDPAKLKSMKIPVG 134 Query: 127 QLCRAWYEECYELKSLDGFNTVEESLRGELDDPSGYYQSI-LTFNPWSERHFLKSEFFDE 185 + W+EE E + + VE++ E D P G +I ++FNP + +E+ D Sbjct: 135 YVSGLWFEELAEFSGVTDIDVVEDTFIRE-DLPQGQEVTIYMSFNPPRNPYEWVNEYVDS 193 Query: 186 ATRRSGVYATTTTYKDND 203 TTY D++ Sbjct: 194 KRSDDDYLIHHTTYLDDE 211 >gi|10799|lcl|protein:vir:78055 Length: 440 # NCBI annotation: TerL # Family: family:all:54 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468786;genbank:gi:157325367;genbank:Ge neID:5601817 Length = 440 Score = 94.0 bits (232), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 105/424 (24%), Positives = 188/424 (44%), Gaps = 25/424 (5%) Query: 4 TVNINLAELLSPAYYPLFK-----DRSRYLVYKGSRGSGKSYAAAIKVLVDTIAHPYVNW 58 +++IN+ +L+SPA++ +++ ++ KG RGSGKS A+ V+ + + P N Sbjct: 17 SIDINIFDLMSPAFHNIYQRVLDNTAPSHVWMKGGRGSGKSSFVALMVVDEIMKDPQANA 76 Query: 59 LVLRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYK--PTG--QRIFFRGMDS 114 ++ R+ R + + L V + P+ ++YK TG Q+I F+G+ Sbjct: 77 VIFRKVDEGMRTTLLPQYQWAIDQLGVSGAWRTSLQPMMLLYKNPETGLEQQIRFKGVKD 136 Query: 115 PLKITSITPTTGQLCRAWYEECYELKSLDGFNTVEES-LRGELDDPSGYYQSILTFNPWS 173 P ++ + G YEE E +S + F+ V S +RGE +G ++ +NP Sbjct: 137 PKRVKASKFRVGYAKYLIYEEADEYESEEDFSIVNSSYMRGE---GTGDSRAFYLYNPPK 193 Query: 174 ERHFLKSEFFDEATRRSGVYATTTTY-----KDNDHLNESYIKSLKEMLVRNPNRARVAV 228 + + + D Y +T+ + L ++++S + + +NPNR Sbjct: 194 YKGHWLNNWVDVIRDEPSQYVHHSTFIPIALHHPEWLGSTWLESARLVRDKNPNRYEWEF 253 Query: 229 LGDWGVAEGLVFDGLFEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYV 288 LG VF +E I P+ G D G+ DP+ + D+ VY+ Sbjct: 254 LGRNVNTGNEVFPNAVQEHITFDMIDGLRPYE-GFDEGYTADPSVWLRVFYDEQRDTVYI 312 Query: 289 YDEVYKHGLLTGQIAQQLR--AHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKD 346 DE+ T +A+ + +Y I + DSA ++ E+ R GV L K + Sbjct: 313 TDELVMKRYKTKALAKDILNVQEGSYNI-VRGDSANPRVLDEM-RDLGV-NALAVSKSPN 369 Query: 347 SVVQGIQYMQS-YRFVVDPRCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAIDALRYAM 405 SV G ++ + + V+D +C E + Y D GN + D +NH ID RYA+ Sbjct: 370 SVPHGTNWLANRIKIVIDFKCPNTWREFSSYALLPDGVGNRKHGFPDKDNHTIDTTRYAL 429 Query: 406 QKFM 409 ++ + Sbjct: 430 EEVI 433 >gi|8380|lcl|protein:vir:96782 Length: 408 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224236;genbank:gi:62362371;genbank:GeneID :3345721 Length = 408 Score = 80.9 bits (198), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 56/151 (37%), Positives = 74/151 (49%), Gaps = 6/151 (3%) Query: 262 GLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLLTGQIAQQL--RAHMAYGIPITAD 319 GLDFGF DPTA V ++ + VY+ E K GL A L R + AD Sbjct: 246 GLDFGFSQDPTAGVKCWLNGND--VYIEKEAGKVGLEIDHTADYLIKRIDGIDDAKVYAD 303 Query: 320 SAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQSYRFVVDPRCKGLLEELNLYVYD 379 SA I+ L R G+P I K K SV G+++++S R +DP C ++E Y Y Sbjct: 304 SARPESISLLKRT-GIPRIEGVPKWKGSVEDGVEWLRSKRIFIDPECTETIKEFTYYSYK 362 Query: 380 RDK-AGNWLNTPVDANNHAIDALRYAMQKFM 409 D+ G N VDA NH IDA+RY + Sbjct: 363 TDRYTGEIKNQLVDAYNHYIDAIRYCFNDMI 393 >gi|2931|lcl|protein:vir:105460 Length: 409 # NCBI annotation: putative phage terminase large subunit B # Family: family:all:54 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529870;genbank:gi:90592610;genbank:GeneID :3974524 Length = 409 Score = 73.2 bits (178), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 63/249 (25%), Positives = 117/249 (46%), Gaps = 17/249 (6%) Query: 166 ILTFNPWSERHFLKSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRAR 225 I NP H+LK+++ D ++ + + T T DN L++ Y++S+K R R Sbjct: 156 ICDTNPDIPTHWLKTDYIDNHDPKARIKSFTFTIDDNTFLSKDYVESIKAATPRGMFYDR 215 Query: 226 VAVLGDWGVAEGLVFDGLFEERII--DKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVN 283 +LG W +G+V+ ++ ++ + L + +G+D+G++H P + D+ Sbjct: 216 -GILGQWVTGDGIVYQDFNKDTMVIPKNRVPDGLDYYVGVDWGYEH-PNPIILLGDDKDG 273 Query: 284 RVVYVYDEVYKHGLLT--GQIAQQLRAHMAYGIPITADSAGSNLIAE-----LTRVHGVP 336 + D KH + ++AQ L+ + ADSA + + E L ++ Sbjct: 274 NTYVLEDYTQKHKFINYWVKVAQNLQTRFGRNLIFYADSARPDNVNEFQSNGLNCINANK 333 Query: 337 GILPSGKGKDSVVQGIQYMQSYRFVVDPRCKGLLEELNLYVYDRDKAGNWLNTPVDANNH 396 +LP G + V + ++ + Y VVD GLL+E+ Y +D + G L +N Sbjct: 334 NVLP---GIECVARKMREGKFY--VVDTASSGLLDEIYQYAWD-ESTGLPLKENDVRHND 387 Query: 397 AIDALRYAM 405 +DA+RYA+ Sbjct: 388 RLDAIRYAI 396 >gi|19292|lcl|protein:vir:4781 Length: 436 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150161;swissprot:trembl:q94m50;genbank:gi :15088772;goa:Q94M50;uniprot:Q94M50;genbank:GeneID:95597 6 Length = 436 Score = 68.9 bits (167), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 55/206 (26%), Positives = 94/206 (45%), Gaps = 12/206 (5%) Query: 5 VNINLAELLSPAYYPLFKDRSRYLVYKGSRGSGKSYAAAIKV----LVDTIAHPYVNWLV 60 + N+ + ++P + ++ Y V KG R S KS +K+ + IA N +V Sbjct: 1 MTFNVQKNINPHFKSVWISSLPYNVLKGGRNSFKSSVIVLKLAYMMIRYIIAGEAANIVV 60 Query: 61 LRQYAYTNRDSTFATLKKVASDLNVYNL---FTWKSSPLEVVYKPTGQRIFFRGMDSPLK 117 +R+ A T RDS F KV LN++ + FT SP ++V+K TG +F G D K Sbjct: 61 IRKVANTIRDSVF---NKVWWALNLFGIAEQFTKTVSPFKIVHKTTGSTFYFYGQDDFQK 117 Query: 118 ITSITPTTGQLCRAWYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNPWSERHF 177 + S G + WYEE E + F+ + + + + Q ++NP + Sbjct: 118 LKS--NDIGNIIPVWYEEAAEFNDQEDFDQSNVTFMRQKHPRAKFVQFFWSYNPPRNPYS 175 Query: 178 LKSEFFDEATRRSGVYATTTTYKDND 203 +E+F+ A ++TY D++ Sbjct: 176 WINEWFESIKTNKNYLAHSSTYLDDE 201 >gi|655|lcl|protein:vir:1588 Length: 447 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695170;swissprot:trembl:o03927;genbank:gi :23455799;goa:O03927;interpro:IPR006437;interpro:IPR0067 01;interpro:IPR011441;uniprot:O03927;genbank:GeneID:9555 65 Length = 447 Score = 65.1 bits (157), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 46/198 (23%), Positives = 90/198 (45%), Gaps = 6/198 (3%) Query: 7 INLAELLSPAYYPLFKDRSRYLVYKGSRGSGKSYAAAIKVLV----DTIAHPYVNWLVLR 62 I +++L++P + ++ Y+V G RGS KS ++K++ + H N + + Sbjct: 15 IKISDLINPHFKRMWTTDKPYIVANGGRGSFKSSVISLKLVTMVKKAIMQHRKANVIAVL 74 Query: 63 QYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKITSIT 122 D+ + ++ S L++ N F SPL + +K TG +F G D+P K+ S Sbjct: 75 ANKSDLHDTVYNQIQWALSMLDMDNEFIAYKSPLTIQHKRTGSSFYFYGADNPYKLKS-- 132 Query: 123 PTTGQLCRAWYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNPWSERHFLKSEF 182 G + WYEE +KS D F+ + + + + ++NP + +E+ Sbjct: 133 NIVGDVVAVWYEEAANMKSSDVFDQANPTFIRQKPEWLDQVKVFYSYNPPKNPYDWINEW 192 Query: 183 FDEATRRSGVYATTTTYK 200 D+ ++ T+ Y+ Sbjct: 193 IDKVSKDDNYLIDTSDYR 210 >gi|1165|lcl|protein:vir:102941 Length: 421 # NCBI annotation: terminase, large subunit # Family: family:all:54 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945278;genbank:gi:39653713;goa:Q708N4;int erpro:IPR004921;interpro:IPR006437;uniprot:Q708N4;genban k:GeneID:2672855 Length = 421 Score = 56.6 bits (135), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 66/255 (25%), Positives = 114/255 (44%), Gaps = 38/255 (14%) Query: 170 NPWSERHFLKSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVL 229 NP H+ K + D+A ++ +Y DN L+E+ IK + R + Sbjct: 173 NPDGPYHWFKVNWIDKAETKNMLYLHFDM-DDNLSLSEN-IKKRYRSQYQGVFYQRY-IQ 229 Query: 230 GDWGVAEGLVFDGLFEERIIDKQAIQRLPHS------IGLDFGFKHDPTAAVFTAVDQVN 283 G W VAEG+V+D +F + DK + LP + +D+G + T + D + Sbjct: 230 GLWTVAEGIVYD-MFSK---DKHVVSTLPEMSKLGKYVSVDYG-TQNATVFLLWEKDIIG 284 Query: 284 RVVYVYDEVYKHGLLTGQIAQQLRAHMAYGIP----------ITADSAGSNLIAELT-RV 332 + Y+ E Y G + Q+ A A + I D + ++ IAEL R Sbjct: 285 KY-YLTREYYYSG--RDENVQKTNAEYADDLTAWLGDTNIDRIIIDPSAASFIAELKKRG 341 Query: 333 HGVPGILPSGKGKDSVVQGIQYMQSY----RFVVDPRCKGLLEELNLYVYDRDKAGNWLN 388 + + K +++V++GI+++ S + V C L+E + YV+D + N + Sbjct: 342 YKIK------KARNNVLEGIRFVGSMLGQEKIAVHESCVNTLKEFHAYVWDEKASANGED 395 Query: 389 TPVDANNHAIDALRY 403 P+ +HA+DALRY Sbjct: 396 KPIKQFDHAMDALRY 410 >gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: terminase, large subunit # Family: family:all:147 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006983;genbank:gi:46401884;genbank:GeneID :2777679 Length = 438 Score = 55.1 bits (131), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 59/228 (25%), Positives = 96/228 (42%), Gaps = 30/228 (13%) Query: 198 TYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGLVFDGL-----------FEE 246 TY+DN + + I+ + + +N R D+ V EG +FD Sbjct: 214 TYRDNPRADLNDIEEARRTVSKNYFRQEYE--ADFSVFEGQIFDTFNAIDHVKDLKGMRH 271 Query: 247 RIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLLTGQIAQQL 306 D +A + L +G+D G++ DPTA + YV +E + T Q A + Sbjct: 272 FFKDDEAFETL---LGIDVGYR-DPTAVLTIKYHYDTDTYYVLEEYQQAEKTTAQHAAYI 327 Query: 307 RAHM-AYGIP-ITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQSY----RF 360 + + Y + I DSA + +L H + S K SV+ G+ +Q+ + Sbjct: 328 QHCIDRYKVDRIFVDSAAAQFRQDLAYEHEIA----SAPAKKSVLDGLACLQALFQQGKI 383 Query: 361 VVDPRCKGLLEELNLYVYDRDKAGNWLNTPV---DANNHAIDALRYAM 405 +VD C L+ L Y +D + L+ DAN+H DALRY + Sbjct: 384 IVDASCSSLIHALQNYKWDFQEGEEKLSREKPRHDANSHLCDALRYGI 431 >gi|2819|lcl|protein:vir:105705 Length: 439 # NCBI annotation: gp2 # Family: family:all:54 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224140;genbank:gi:62362215;genbank:GeneID :3342458 Length = 439 Score = 54.3 bits (129), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 46/176 (26%), Positives = 77/176 (43%), Gaps = 29/176 (16%) Query: 262 GLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGL-----------LTGQIAQQLRAHM 310 G DFGF DP+ + + ++ +Y+ E Y +G+ T +QL+ Sbjct: 253 GADFGFAKDPSTLIRMFI--LDNNLYIEYEAYGNGVELDDMWKFYAGKTDATPKQLKDWK 310 Query: 311 ------------AYGIPITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQSY 358 A PI AD++ I+ + + G I + K + SV GI +++ + Sbjct: 311 VTDDTKFPGIPEARKWPIKADNSRPETISHI-KGQGF-NISAAQKWQGSVEDGITFLRGF 368 Query: 359 -RFVVDPRCKGLLEELNLYVYDRDK-AGNWLNTPVDANNHAIDALRYAMQKFMFVK 412 + ++ PRCK +E LY Y D+ G L D NNH D +RY + ++ K Sbjct: 369 KKIIIHPRCKETAKEARLYSYKTDRITGEVLPIIEDKNNHCWDGIRYGLDGYIKCK 424 >gi|18891|lcl|protein:vir:8847 Length: 482 # NCBI annotation: terminase # Family: family:all:1730 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775255;genbank:gi:27476053;genbank:GeneID :2700601 Length = 482 Score = 52.4 bits (124), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 67/264 (25%), Positives = 105/264 (39%), Gaps = 39/264 (14%) Query: 167 LTFNPWSERHFLKSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARV 226 LTF P + +F + + G + +++D HL+ + L + V +P R+ Sbjct: 199 LTFTPEHGLTEIVKDFLQDL--KPGQFLIHASWEDAPHLSPEVKEQL--LSVYSPAERRM 254 Query: 227 AVLGDWGVAEGLVFDGLFEERIIDKQAIQRLPHSI-GLDFGFKHDPTAAVFTAVDQVNRV 285 G + G+VF L E+ + + I H I G+D GF H P A A D Sbjct: 255 RAEGIPMLGSGVVFPILEEKFVCEPFDIPDHFHRIIGIDLGFDH-PNAIACVAWDAEKDK 313 Query: 286 VYVYDEVYKHGLLTGQIAQQLRAHMAYGIPIT---------ADSAGSNLIAELTRVHGV- 335 Y+YDE + G G A + + IP+ ++G + L H + Sbjct: 314 YYLYDERSESGETLGMHADAIYLKGGHQIPVVVPHDAFKHDGATSGRRFVDLLKDDHNLN 373 Query: 336 ---------PGILPSGK-GKDSVVQGIQY----MQSYRFVVDPRCKGLLEELNLYVYDRD 381 PG P GK G +SV G+ + M++ V C L+E+ +Y Sbjct: 374 VVYEPFSNPPG--PDGKHGGNSVEFGVNWMLTRMENGDLKVFNTCTNFLKEMKMYHRKDG 431 Query: 382 KAGNWLNTPVDANNHAIDALRYAM 405 K VD N+ I A RYA+ Sbjct: 432 KI-------VDRNDDMISATRYAL 448 >gi|6775|lcl|protein:vir:96067 Length: 460 # NCBI annotation: conserved hypothetical protein ORF004 # Family: family:all:54 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294421;genbank:gi:149408318;genbank:Ge neID:5237186 Length = 460 Score = 47.4 bits (111), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 54/214 (25%), Positives = 99/214 (46%), Gaps = 19/214 (8%) Query: 13 LSPAYYPLFKDRSRYLVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWLVLRQYAYTNRDST 72 L+PA +++ R+RY V G R S KS+ A + V A+ + +L RQ+ +S Sbjct: 4 LNPALRAVWRTRARYKVIYGGRASSKSHDAG-GIAVYLAANYRLKFLCARQFQNRISESV 62 Query: 73 FATLK-KVA-SDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKITSITPTTGQLCR 130 + +K K+ S+ N +FT S + +K TG F G+ L + I T G + Sbjct: 63 YTLIKDKIENSEYNGEFIFTKNS----IKHKRTGSEFLFYGIARNL--SEIKSTEG-IDI 115 Query: 131 AWYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNPWSERHFLKSEFFDEATRRS 190 W EE + L + + + +E ++R E + + FNP F+ F + + + Sbjct: 116 LWLEEAHYL-TQEQWEVIEPTIRKE------NSEIWIIFNPNEVTDFVYQNFVVKPPKDA 168 Query: 191 GVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRA 224 + + +N L+E+ +K + E R+ ++A Sbjct: 169 --FVKMINWNENPFLSETMLKVIHEAYERDKDQA 200 >gi|6886|lcl|protein:vir:106551 Length: 424 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958579;genbank:gi:41179257;genbank:GeneID :2717087 Length = 424 Score = 47.0 bits (110), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 58/256 (22%), Positives = 109/256 (42%), Gaps = 25/256 (9%) Query: 170 NPWSERHFLKSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVL 229 NP H+ K + D+ + + T + DN L+ I + M + + + Sbjct: 167 NPSGPFHWFKLNWIDQMKDKRALRIHFTMH-DNPSLDSVTINRYERMY--SGVFYQRYIQ 223 Query: 230 GDWGVAEGLVFDGLFEERIIDKQAIQRLPHS-----IGLDFGFKHDPTAAVFTAVDQVNR 284 G W ++EG+++D F++ D + LP+ + D+G +PTA F + + Sbjct: 224 GLWVMSEGVIYDN-FDK---DTMVVNELPNHFEKYYVSCDYG-TLNPTA--FLLWGRNHG 276 Query: 285 VVYVYDEVYKHGLLTGQIAQQLRAHMAYGIPITADSAGSNLIAELTRVHGVPGILPSG-- 342 V Y+ E Y G T + Q+ + + + +I + + + +G Sbjct: 277 VWYLVKEYYYSGRTTSR--QKTDEEYCHDLKEFLGDIRAEMIIDPSAASFSTTLRQNGFK 334 Query: 343 --KGKDSVVQGIQYMQSY----RFVVDPRCKGLLEELNLYVYDRDKAGNWLNTPVDANNH 396 K K+ V+ GI+ Q+ + C L +EL YV+D A + + PV ++H Sbjct: 335 VRKAKNDVLDGIRVTQTAMNEGKIKFSMNCPNLFKELASYVWDDKAAEHGEDKPVKQHDH 394 Query: 397 AIDALRYAMQKFMFVK 412 A DA+RY + ++ K Sbjct: 395 ACDAMRYFVYTIIYKK 410 >gi|5794|lcl|protein:vir:98922 Length: 453 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164412;genbank:gi:56694902;genbank:GeneID :3197313 Length = 453 Score = 47.0 bits (110), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 33/120 (27%), Positives = 55/120 (45%), Gaps = 4/120 (3%) Query: 5 VNINLAELLSPAYYPLFKDRSRYLVYKGSRGSGKSYAAAIKV----LVDTIAHPYVNWLV 60 V N+ E ++P + ++ Y + KG R S KS A+K+ L+ + N +V Sbjct: 9 VMFNVQENINPHFKEVWTSSKPYNILKGGRNSFKSSVIALKLVFMMLLYILKGEKANVVV 68 Query: 61 LRQYAYTNRDSTFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKITS 120 +R+ T RDS F ++ + F SP ++ +K TG +F G D K+ S Sbjct: 69 IRKVGNTIRDSVFNKIQWAIKLFGLTRRFKPTVSPFKITHKRTGSTFYFYGQDDFQKLKS 128 >gi|17761|lcl|protein:vir:171 Length: 470 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112076;genbank:gi:13559866;genbank:GeneID :921009 Length = 470 Score = 46.6 bits (109), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 57/226 (25%), Positives = 93/226 (41%), Gaps = 30/226 (13%) Query: 13 LSPAYYPLFKDRSRYLVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWLVLRQYAYTNRDST 72 ++P + P F + RY V KG RGSGKS+A A ++LV+ V L R+ + DS Sbjct: 4 INPIFEP-FIEAHRYKVAKGGRGSGKSWAIA-RLLVEAARRQPVRILCARELQNSISDSV 61 Query: 73 FATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGM-DSPLKITSITPTTGQLCRA 131 L+ F + S + + T F G+ ++P KI S+ +C Sbjct: 62 IRLLEDTIEREGYSAEFEIQRSMIR--HLGTNAEFMFYGIKNNPTKIKSLEGI--DIC-- 115 Query: 132 WYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNPWSERHFLKSEFFDEATRR-- 189 W EE E + + ++ + ++R + + ++FNP D+ +R Sbjct: 116 WVEEA-EAVTKESWDILIPTIR------KPFSEIWVSFNP--------KNILDDTYQRFV 160 Query: 190 ----SGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGD 231 + T Y DN H E ++E RNP R LG+ Sbjct: 161 VNPPDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGE 206 >gi|5142|lcl|protein:vir:105417 Length: 470 # NCBI annotation: gene 2 protein # Family: family:all:54 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958178;genbank:gi:41057280;genbank:GeneID :2716664 Length = 470 Score = 46.2 bits (108), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 57/226 (25%), Positives = 93/226 (41%), Gaps = 30/226 (13%) Query: 13 LSPAYYPLFKDRSRYLVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWLVLRQYAYTNRDST 72 ++P + P F + RY V KG RGSGKS+A A ++LV+ V L R+ + DS Sbjct: 4 INPIFEP-FIEAHRYKVAKGGRGSGKSWAIA-RLLVEAARRQPVRILCARELQNSISDSV 61 Query: 73 FATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGM-DSPLKITSITPTTGQLCRA 131 L+ F + S + + T F G+ ++P KI S+ +C Sbjct: 62 IRLLEDTIEREGYSAEFEIQRSMIR--HLGTNAEFMFYGIKNNPTKIKSLEGI--DIC-- 115 Query: 132 WYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNPWSERHFLKSEFFDEATRR-- 189 W EE E + + ++ + ++R + + ++FNP D+ +R Sbjct: 116 WVEEA-EAVTKESWDILIPTIR------KPFSEIWVSFNP--------KNILDDTYQRFV 160 Query: 190 ----SGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGD 231 + T Y DN H E ++E RNP R LG+ Sbjct: 161 VNPPDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGE 206 >gi|4261|lcl|protein:vir:94806 Length: 402 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240530;genbank:gi:66396200;genbank:GeneID :5133586 Length = 402 Score = 44.3 bits (103), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 60/261 (22%), Positives = 112/261 (42%), Gaps = 39/261 (14%) Query: 166 ILTFNPWSERHFLKSEFFDEATRRSG-----VYATTTTYKDNDHLNESYIKSLKEMLVRN 220 ++ NP + H +K ++ D++ +R + A T DN L+E YI+S+ + Sbjct: 148 LIDTNPENPMHPVKKDYIDKSGQRLSNGRLNIKAFQFTLFDNTFLDEEYIESI---IAST 204 Query: 221 PNRARVA--VLGDWGVAEGLVFDG-------LFEERIIDKQAIQRLPHSIGLDFGFKHDP 271 P + G W AEG+V+ + EE KQ ++ G+D+G++H Sbjct: 205 PTGMFTDRDIYGKWVSAEGVVYKDFKEKVHYITEEEFKTKQIKRKYA---GVDWGYEH-- 259 Query: 272 TAAVFTAVDQVNRVVYVYDE-VYKHGLLTGQIAQQLRAHMAYG-IPITADSAGSNLIAEL 329 ++ + + YV +E ++H + +A +G I D+A I Sbjct: 260 YGSIMVVAEDFDGNKYVIEEHAHRHKEIDDWVAIAKGVIKRHGDILFYCDTARPEHIERF 319 Query: 330 TRVHGVPGILPSGKGKDSVVQGIQYMQ-----SYRFVVDPRCKGLLEELNLYVYDRDKAG 384 R + + +V+ GI+ + + F++ + EE+ YV+ +D A Sbjct: 320 RREK-----IKARYADKAVIAGIEVISRLFKLNKIFIIKEKVSLFKEEIYNYVW-KDNA- 372 Query: 385 NWLNTPVDANNHAIDALRYAM 405 + PV N+ +DALRYA+ Sbjct: 373 ---DEPVKLNDDTLDALRYAV 390 >gi|14951|lcl|protein:vir:1235 Length: 405 # NCBI annotation: similar to phage O1205 ORF26 ( putative large subunit terminase) # Family: family:all:54 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510934;genbank:gi:17426268;genbank:GeneID :927381 Length = 405 Score = 44.3 bits (103), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 60/261 (22%), Positives = 112/261 (42%), Gaps = 39/261 (14%) Query: 166 ILTFNPWSERHFLKSEFFDEATRRSG-----VYATTTTYKDNDHLNESYIKSLKEMLVRN 220 ++ NP + H +K ++ D++ +R + A T DN L+E YI+S+ + Sbjct: 151 LIDTNPENPMHPVKKDYIDKSGQRLSNGRLNIKAFQFTLFDNTFLDEEYIESI---IAST 207 Query: 221 PNRARVA--VLGDWGVAEGLVFDG-------LFEERIIDKQAIQRLPHSIGLDFGFKHDP 271 P + G W AEG+V+ + EE KQ ++ G+D+G++H Sbjct: 208 PTGMFTDRDIYGKWVSAEGVVYKDFKEKVHYIKEEEFKTKQIKRKYA---GVDWGYEH-- 262 Query: 272 TAAVFTAVDQVNRVVYVYDE-VYKHGLLTGQIAQQLRAHMAYG-IPITADSAGSNLIAEL 329 ++ + + YV +E ++H + +A +G I D+A I Sbjct: 263 YGSIMVVAEDFDGNKYVIEEHAHRHKEIDDWVAIAKGVIKRHGDILFYCDTARPEHIERF 322 Query: 330 TRVHGVPGILPSGKGKDSVVQGIQYMQ-----SYRFVVDPRCKGLLEELNLYVYDRDKAG 384 R + + +V+ GI+ + + F++ + EE+ YV+ +D A Sbjct: 323 RREK-----IKARYADKAVIAGIEVISRLFKLNKIFIIKEKVSLFKEEIYNYVW-KDNA- 375 Query: 385 NWLNTPVDANNHAIDALRYAM 405 + PV N+ +DALRYA+ Sbjct: 376 ---DEPVKLNDDTLDALRYAV 393 >gi|1596|lcl|protein:vir:93748 Length: 403 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240453;genbank:gi:66396122;genbank:GeneID :5133517 Length = 403 Score = 44.3 bits (103), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 60/261 (22%), Positives = 112/261 (42%), Gaps = 39/261 (14%) Query: 166 ILTFNPWSERHFLKSEFFDEATRRSG-----VYATTTTYKDNDHLNESYIKSLKEMLVRN 220 ++ NP + H +K ++ D++ +R + A T DN L+E YI+S+ + Sbjct: 149 LIDTNPENPMHPVKKDYIDKSGQRLSNGRLNIKAFQFTLFDNTFLDEEYIESI---IAST 205 Query: 221 PNRARVA--VLGDWGVAEGLVFDG-------LFEERIIDKQAIQRLPHSIGLDFGFKHDP 271 P + G W AEG+V+ + EE KQ ++ G+D+G++H Sbjct: 206 PTGMFTDRDIYGKWVSAEGVVYKDFKEKVHYIKEEEFKTKQIKRKYA---GVDWGYEH-- 260 Query: 272 TAAVFTAVDQVNRVVYVYDE-VYKHGLLTGQIAQQLRAHMAYG-IPITADSAGSNLIAEL 329 ++ + + YV +E ++H + +A +G I D+A I Sbjct: 261 YGSIMVVAEDFDGNKYVIEEHAHRHKEIDDWVAIAKGVIKRHGDILFYCDTARPEHIERF 320 Query: 330 TRVHGVPGILPSGKGKDSVVQGIQYMQ-----SYRFVVDPRCKGLLEELNLYVYDRDKAG 384 R + + +V+ GI+ + + F++ + EE+ YV+ +D A Sbjct: 321 RREK-----IKARYADKAVIAGIEVISRLFKLNKIFIIKEKVSLFKEEIYNYVW-KDNA- 373 Query: 385 NWLNTPVDANNHAIDALRYAM 405 + PV N+ +DALRYA+ Sbjct: 374 ---DEPVKLNDDTLDALRYAV 391 >gi|9935|lcl|protein:vir:97273 Length: 402 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240605;genbank:gi:66396276;genbank:GeneID :5133629 Length = 402 Score = 41.2 bits (95), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 59/261 (22%), Positives = 112/261 (42%), Gaps = 39/261 (14%) Query: 166 ILTFNPWSERHFLKSEFFDEATRRSG-----VYATTTTYKDNDHLNESYIKSLKEMLVRN 220 ++ NP + H +K ++ D++ +R + A T DN L+E YI+S+ + Sbjct: 148 LIDTNPENPMHPVKKDYIDKSGQRLSNGRLNIKAFQFTLFDNTFLDEEYIESI---IAST 204 Query: 221 PNRARVA--VLGDWGVAEGLVFDG-------LFEERIIDKQAIQRLPHSIGLDFGFKHDP 271 P + G W AEG+V+ + EE KQ ++ G+D+G++H Sbjct: 205 PTGMFTDRDIYGKWVSAEGVVYKDFKEKVHYITEEEFKTKQIKRKYA---GVDWGYEH-- 259 Query: 272 TAAVFTAVDQVNRVVYVYDE-VYKHGLLTGQIAQQLRAHMAYG-IPITADSAGSNLIAEL 329 ++ + + YV +E ++H + +A +G I D+A I Sbjct: 260 YGSIMVVAEDFDGNKYVIEEHAHRHKEIDDWVAIAKGVIKRHGDILFYCDTARPEHIERF 319 Query: 330 TRVHGVPGILPSGKGKDSVVQGIQYMQS-YRF----VVDPRCKGLLEELNLYVYDRDKAG 384 R + + +V+ GI+ + ++ ++ + EE+ YV+ +D A Sbjct: 320 RREK-----IKARYADKAVIAGIEVISRLFKLNKISIIKEKVSLFKEEIYNYVW-KDNA- 372 Query: 385 NWLNTPVDANNHAIDALRYAM 405 + PV N+ +DALRYA+ Sbjct: 373 ---DEPVKLNDDTLDALRYAV 390 >gi|13692|lcl|protein:vir:4897 Length: 411 # NCBI annotation: gp411 # Family: family:all:54 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056675;genbank:gi:9635008;genbank:GeneID: 1262679 Length = 411 Score = 41.2 bits (95), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 61/263 (23%), Positives = 109/263 (41%), Gaps = 28/263 (10%) Query: 170 NPWSERHFLKSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVL 229 NP + H+L ++ + + + + DN L++ YI S+K + + R +L Sbjct: 160 NPDNPNHWLNRDYIGKNDGK--IIDFSFKLDDNTFLSKRYIDSIKAVTPKGKFYDR-DIL 216 Query: 230 GDWGVAEGLVFDGLFEERIIDKQAIQRLPHSI----GLDFGFKHDPTAAVFTAVDQVNRV 285 G W VAEG ++ ++ +I + LP G+D+G+ H ++ + V+ Sbjct: 217 GHWTVAEGAIY-ADYDSKI---HVVDELPEMKRYFGGIDWGYTH--YGSIVIVGEGVDNN 270 Query: 286 VYVYDEVYKHGLLTGQIAQQLRAHMA-YG-IPITADSAGSNLIAELTRVHGVPGILPSGK 343 Y+ D V +Q R YG IP ADSA +A G S Sbjct: 271 FYLVDGVRAQFKEIDWWVEQARKLTGIYGNIPFYADSARPEHVARFEN----EGFDISNA 326 Query: 344 GKDSVVQGIQ-----YMQSYRFVVDPRCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAI 398 K SV+ GI+ + + +V +E+ Y R K + + P+ + + Sbjct: 327 NK-SVIAGIELIAKLFKEQKLYVKRGFVPRFFDEIYQY---RWKENSTKDEPLKEFDDVL 382 Query: 399 DALRYAMQKFMFVKNGQYMSYTD 421 D++RYA+ + + + SY D Sbjct: 383 DSVRYAIYSDYVIGSTERASYDD 405 >gi|16299|lcl|protein:vir:3027 Length: 375 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438140;genbank:gi:16271803;genbank:GeneID :929285 Length = 375 Score = 38.9 bits (89), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 82/358 (22%), Positives = 149/358 (41%), Gaps = 74/358 (20%) Query: 98 VVYKPTG-QRIFFRG---MDSPLKITSITPTTGQLCRA------WYEECYELKSLDGFNT 147 ++ P G +R++++G ++S IT ++ + C + +EC+ T Sbjct: 41 LITTPKGNKRVYYKGGGKVNSVGAITGMSLGSVVFCEINLLHMDFIQECFR-------RT 93 Query: 148 VEESLRGELDDPSGYYQSILTFNPWSERHFLKSEFFDEATRRSGVYATTTTYKDNDHLNE 207 LR L D NP + +H + + FD R T T DN L Sbjct: 94 WAAKLRYHLAD----------LNPPAPQHPVIKDVFDVQNTR----WTHWTMDDNPILTA 139 Query: 208 SYIKSLKEMLVRNPNRARVAVLGDWGVAEGLVFDGLF--EERIIDKQAIQRLPHSIGLDF 265 +++ L +NP + VLG + +G+++ GLF E+ ++D + + D Sbjct: 140 ERKQNIINSLKKNPYLYKRDVLGQRVMPQGVIY-GLFDTEKNVLDALIGEPVEMYFCADG 198 Query: 266 GFKHDPTAA---VFTAVDQVNRVVYVYDEV---YKHGLLTGQI------AQQLRAHMAYG 313 G + D T+ + T V R+ + + V Y G TGQ+ A +L+ + + Sbjct: 199 G-QSDATSMSCNIVTRVRDNGRISFRLNRVAHYYHSGADTGQVKAMSTYALELKVFIDWC 257 Query: 314 IP--------ITADSAGSNLIAELTRVHGVPGILPSGKGKD--SVVQGIQY-MQSYRFVV 362 + + D A +L EL ++ GV + KD S +GI+ ++ + ++ Sbjct: 258 VKKYQMRYTEVFVDPACKSLREELHKL-GVFTLGAPNNSKDVSSKAKGIEVGIERGQNII 316 Query: 363 DPRCKGLL----EELNLY-------VYDRDKAGNWLNTPVDANNHAIDALRYAMQKFM 409 L+ EE + Y +Y RD G P+D +NHA+D RY++ F+ Sbjct: 317 SDGAFYLVNHSEEEYDHYHFLKEIGLYSRDDNGK----PIDKDNHAMDEFRYSVNVFV 370 >gi|16780|lcl|protein:vir:2731 Length: 411 # NCBI annotation: hypothetical protein # Family: family:all:54 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695104;genbank:gi:23455873;genbank:GeneID :955606 Length = 411 Score = 38.5 bits (88), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 61/263 (23%), Positives = 110/263 (41%), Gaps = 28/263 (10%) Query: 170 NPWSERHFLKSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVL 229 NP + H+L ++ + + + + DN L++ YI S+K + R +L Sbjct: 160 NPDNPNHWLNRDYIGKNDGK--IIDFSFKLDDNTFLSKRYIDSIKAATPKGKFYDR-DIL 216 Query: 230 GDWGVAEGLVFDGLFEERIIDKQAIQRLPHSI----GLDFGFKHDPTAAVFTAVDQVNRV 285 G W VAEG ++ ++ +I + LP G+D+G+ H ++ + V+ Sbjct: 217 GLWTVAEGAIY-ADYDSKI---HVVDELPEMKRYFGGIDWGYTH--YGSIVIVGEGVDNN 270 Query: 286 VYVYDEVYKHGLLTGQIAQQLRAHMA-YG-IPITADSAGSNLIAELTRVHGVPGILPSGK 343 Y+ D V +Q R YG IP ADSA +A G I+ + K Sbjct: 271 FYLVDGVAAQFKEIDWWVEQARKLTGIYGNIPFYADSARPEHVARFEN-EGFD-IMNANK 328 Query: 344 GKDSVVQGIQ-----YMQSYRFVVDPRCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAI 398 SV+ GI+ + + +V +E+ Y R K + + P+ + + Sbjct: 329 ---SVIAGIELIAKLFKEKKLYVKRGFVPRFFDEIYQY---RWKENSTKDEPLKEFDDVL 382 Query: 399 DALRYAMQKFMFVKNGQYMSYTD 421 D++RYA+ + + + SY D Sbjct: 383 DSVRYAIYSDYVIGSTERASYDD 405 >gi|16025|lcl|protein:vir:9814 Length: 403 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795576;genbank:gi:28876345;genbank:GeneID :1257867 Length = 403 Score = 38.5 bits (88), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 82/358 (22%), Positives = 149/358 (41%), Gaps = 74/358 (20%) Query: 98 VVYKPTG-QRIFFRG---MDSPLKITSITPTTGQLCRA------WYEECYELKSLDGFNT 147 ++ P G +R++++G ++S IT ++ + C + +EC+ T Sbjct: 69 LITTPKGNKRVYYKGGGKVNSVGAITGMSLGSVVFCEINLLHMDFIQECFR-------RT 121 Query: 148 VEESLRGELDDPSGYYQSILTFNPWSERHFLKSEFFDEATRRSGVYATTTTYKDNDHLNE 207 LR L D NP + +H + + FD R T T DN L Sbjct: 122 WAAKLRYHLAD----------LNPPAPQHPVIKDVFDVQNTR----WTHWTMDDNPILTA 167 Query: 208 SYIKSLKEMLVRNPNRARVAVLGDWGVAEGLVFDGLF--EERIIDKQAIQRLPHSIGLDF 265 +++ L +NP + VLG + +G+++ GLF E+ ++D + + D Sbjct: 168 ERKQNIINSLKKNPYLYKRDVLGQRVMPQGVIY-GLFDTEKNVLDALIGEPVEMYFCADG 226 Query: 266 GFKHDPTAA---VFTAVDQVNRVVYVYDEV---YKHGLLTGQI------AQQLRAHMAYG 313 G + D T+ + T V R+ + + V Y G TGQ+ A +L+ + + Sbjct: 227 G-QSDATSMSCNIVTRVRDNGRISFRLNRVAHYYHSGADTGQVKAMSTYALELKVFIDWC 285 Query: 314 IP--------ITADSAGSNLIAELTRVHGVPGILPSGKGKD--SVVQGIQY-MQSYRFVV 362 + + D A +L EL ++ GV + KD S +GI+ ++ + ++ Sbjct: 286 VKKYQMRYTEVFVDPACKSLREELHKL-GVFTLGAPNNSKDVSSKAKGIEVGIERGQNII 344 Query: 363 DPRCKGLL----EELNLY-------VYDRDKAGNWLNTPVDANNHAIDALRYAMQKFM 409 L+ EE + Y +Y RD G P+D +NHA+D RY++ F+ Sbjct: 345 SDGAFYLVNHSEEEYDHYHFLKEIGLYSRDDNGK----PIDKDNHAMDEFRYSVNVFV 398 >gi|12899|lcl|protein:vir:80432 Length: 510 # NCBI annotation: BcepGomrgp04 # Family: family:all:144 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210224;genbank:gi:146329916;genbank:Ge neID:5123541 Length = 510 Score = 38.5 bits (88), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 21/70 (30%), Positives = 36/70 (51%), Gaps = 3/70 (4%) Query: 198 TYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGLVFDGLFEERIIDKQAIQRL 257 +YK+N +L Y+ L+ +++PN+ + + GDW V G D L+ E + K + Sbjct: 236 SYKENIYLTPEYVAELES--IKDPNKRKAWLHGDWNVVAGGAIDDLWREEVHVKPRFN-I 292 Query: 258 PHSIGLDFGF 267 P S +D F Sbjct: 293 PASWRVDRSF 302 >gi|5072|lcl|protein:vir:95130 Length: 531 # NCBI annotation: hypothetical protein ORF006 # Family: family:all:144 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293413;genbank:gi:148912834;genbank:Ge neID:5228205 Length = 531 Score = 38.1 bits (87), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 25/85 (29%), Positives = 39/85 (45%), Gaps = 11/85 (12%) Query: 164 QSILTFNPWSERHFLKSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNR 223 + I +NP +E+ E T A +YK+N +L SYI L+ ++ PN Sbjct: 227 REIQIYNPATEK---------EETHVISQIAIFGSYKENPYLPASYIAELES--IKEPNL 275 Query: 224 ARVAVLGDWGVAEGLVFDGLFEERI 248 + + GDW V G D L++ I Sbjct: 276 RKAWLYGDWDVTAGGAIDDLWQSHI 300 >gi|11622|lcl|protein:vir:78869 Length: 439 # NCBI annotation: TerL # Family: family:all:54 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468842;genbank:gi:157325434;genbank:Ge neID:5601865 Length = 439 Score = 37.7 bits (86), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 35/110 (31%), Positives = 46/110 (41%), Gaps = 22/110 (20%) Query: 315 PITADSAGSNLIAELTRVHGVPGILPSGKGKDSVVQ------GIQYMQS------YRFVV 362 P+ D A L EL +V GV D + + GI+ MQS Y V Sbjct: 329 PVFIDPACRWLREELEKV-GVDTAGADNNAHDVIGKAQGIEVGIERMQSLLSERRYLLVE 387 Query: 363 DPRCK----GLLEELNLYVYDRDKAGNWLNTPVDANNHAIDALRYAMQKF 408 P + L+E+ +YV D + PVD NNHA+D RYA F Sbjct: 388 QPNDQYDHYSWLQEIGMYVRDENSG-----KPVDKNNHAMDTSRYATNYF 432 >gi|4914|lcl|protein:vir:99572 Length: 459 # NCBI annotation: TerL-like protein # Family: family:all:54 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039811;genbank:gi:126011061;genbank:Ge neID:4818267 Length = 459 Score = 35.8 bits (81), Expect = 0.001, Method: Compositional matrix adjust. Identities = 51/214 (23%), Positives = 85/214 (39%), Gaps = 17/214 (7%) Query: 13 LSPAYYPLFKDRSRYLVYKGSRGSGKSYAAAIKVLVDTIAHPY-VNWLVLRQYAYTNRDS 71 L+PA + D++RY G R S KS+ AA + +A Y V +L RQ+ +S Sbjct: 4 LNPALRDFWLDKARYKALYGGRASSKSHDAAGFAVY--LARNYTVKFLCARQFQNKISES 61 Query: 72 TFATLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDSPLKITSITPTTGQLCRA 131 + +K F S + +K TG F G+ L I T G + Sbjct: 62 VYTLIKGKIDAAGWTKEFDVTISSIR--HKKTGAEFLFYGIARNL--NEIKSTEG-VDIL 116 Query: 132 WYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNPWSERHFLKSEFFDEATRRSG 191 W EE L + +N + ++R E Q L +NP F+ F + Sbjct: 117 WLEEAQYLTE-EQWNVINPTIRREGS------QIWLIWNPDQYTDFIYQNFV--VNPPAD 167 Query: 192 VYATTTTYKDNDHLNESYIKSLKEMLVRNPNRAR 225 + + +N L+++ +K + + R+P A Sbjct: 168 CLSKQINWTENPFLSDTMLKVIYDEYQRDPKLAE 201 >gi|9590|lcl|protein:vir:97250 Length: 515 # NCBI annotation: hypothetical protein ORF012 # Family: family:all:144 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294520;genbank:gi:149408241;genbank:Ge neID:5237115 Length = 515 Score = 35.8 bits (81), Expect = 0.001, Method: Compositional matrix adjust. Identities = 100/478 (20%), Positives = 182/478 (38%), Gaps = 107/478 (22%) Query: 18 YPLFKDRSRYLVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWLVLRQYAYTNRDSTF-ATL 76 +P+F+ ++Y+G+RG GK+ +L+D + H V + Y R F T Sbjct: 62 HPIFE-----VLYEGTRGPGKTDC----LLMDFLQH------VGKGYGSEWRGILFRQTY 106 Query: 77 KKVASDLNVYNLFTWKSSPL----EVVYK---PTGQRIFFRGMDSPLKITSIT----PTT 125 +++ +N N + + P +V +K P G+ + R M SP + P Sbjct: 107 PQLSDVINKTNKWFKRIFPGAKYNKVEHKWTFPDGEELLLRHMKSPEDYWNYHGHAYPWI 166 Query: 126 G--QLCRAWYEECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNP---WSERHF--- 177 G +LC ++CY + + + + + P Y + + P W + F Sbjct: 167 GWEELCNWADDKCYTV-MMSCCRSTKPGM------PRCYRATTNPYGPGHNWVKARFRLP 219 Query: 178 -LKSEFFDEATR-------RSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVL 229 ++ +A R R ++ + + H + YI ++ RNP+ + Sbjct: 220 HMRGRVILDAMRDGEREPPRVAIHGSIYENQILLHADPEYISKIR-AAARNPSELAAWLH 278 Query: 230 GDWGVAEGLVFDGLF--EERIIDKQAIQRLPHSIGLDFGF---KHDPTAAVFTAVD---- 280 G W + G +FD ++ + ++ + +P +D F P A ++ A Sbjct: 279 GSWDIIAGGMFDDIYRGDVHVVPSVPLSVIPKRWKIDRSFDWGSSKPFAVLWWAESNGEP 338 Query: 281 ---------QVNRVVYVYDEVYKHGLLTGQIAQQLRAHMAYGI--------------PIT 317 +V +Y+ E Y + + L + +A G+ P Sbjct: 339 FEWNGRVYGKVRGDLYLIQEWYGWNGTRNEGVRMLASEVAQGVKDREEDWALEGRVKPGP 398 Query: 318 ADSA-----GSNLIAELTRVHGVPGILPSGKGKDSVVQGIQYMQS-YRFVVDPRCKGLLE 371 ADS+ N IA GV P+ KG S QG + ++ + + P G E Sbjct: 399 ADSSIFDVENGNSIAVDMEKKGV-RWTPADKGPGSRKQGWEQIRKLLKGALPPAGGGPRE 457 Query: 372 ELNLYVYD--------------RDKAGNWLNTPVDANNHAIDALRYAM-QKFMFVKNG 414 LY++D DK + +NT +A +H DA+RY + +K VK G Sbjct: 458 VPGLYIFDWCQQTIETVPVLPRDDKDLDDVNT--EAEDHIGDAIRYRVRKKLRGVKQG 513 >gi|8997|lcl|protein:vir:101640 Length: 432 # NCBI annotation: terminase large subunit # Family: family:all:144 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112491;genbank:gi:53793591;uniprot:Q5ZGG2 ;genbank:GeneID:3101748 Length = 432 Score = 34.7 bits (78), Expect = 0.002, Method: Compositional matrix adjust. Identities = 84/389 (21%), Positives = 162/389 (41%), Gaps = 44/389 (11%) Query: 16 AYYPLFKDR-SRYLVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWLVLRQYAYTNRDSTFA 74 AYY KD+ ++ L+Y G+ G GKS + ++ A+P WL+ R +++T Sbjct: 11 AYY--LKDKETKELIYGGAAGGGKSALGILWLIEQCQAYPGTRWLMGRSKLKALKETTLN 68 Query: 75 TLKKVASDLNVYNLFTWKSSPLEVVYKPTGQRIFFRGMDS-PLKITSITPTTGQLCRAWY 133 T + AS L + + + + V+Y G I + + + P + + ++ A+ Sbjct: 69 TFFEQASILKITEQYNYNAQS-GVIYWNNGSEILLKDLYAYPSDQNFDSLGSLEISGAFI 127 Query: 134 EECYELKSLDGFNTVEESLRGELDDPSGYYQSILTFNP---WSERHF-LKSEFFDEATRR 189 +EC ++ + + V+ +R +L+ + + T NP W F LK + + Sbjct: 128 DECNQI-TYKAWQIVKSRIRYKLNQYGIEPKMLGTCNPAKNWVYAQFYLKDKNGTLDNDK 186 Query: 190 SGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVLGDWGVAEGLVFDGLFEERII 249 + A T DN HL SY+ SL L + N + G+W ++I Sbjct: 187 KFIQALPT---DNPHLPASYLTSL---LSLDENSKQRLYYGNWEYDND-------PAKLI 233 Query: 250 DKQAIQRLPHSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYD-----EVYKHGLLT-GQIA 303 D + IQ + + FG + D++ V+ V+ E++ + +IA Sbjct: 234 DYEKIQNCFTNTFIPFGEMYISADIARFGSDKM--VICVWSGFRVVEIFSMAKSSITEIA 291 Query: 304 QQLRAHMAYGIPITADSAGSNLIAE-------LTRVHGVPGILPSGKGKDSVVQGIQYMQ 356 + +R G+ I SN+I + + V G G + + + + Q +QY Q Sbjct: 292 EAVR-----GLSIKHKVPLSNVICDEDGVGGGVVDVLGCTGFINNSRAMEVDNQVVQY-Q 345 Query: 357 SYRFVVDPRCKGLLEELNLYVYDRDKAGN 385 + + + +++ NLY++ D N Sbjct: 346 NLKTQCYYKLAEVIQSNNLYIHSEDATVN 374 >gi|2589|lcl|protein:vir:94084 Length: 407 # NCBI annotation: ORF009 # Family: family:all:54 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240228;genbank:gi:66395894;genbank:GeneID :5133253 Length = 407 Score = 33.9 bits (76), Expect = 0.004, Method: Compositional matrix adjust. Identities = 51/244 (20%), Positives = 99/244 (40%), Gaps = 18/244 (7%) Query: 170 NPWSERHFLKSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVL 229 NP H+L ++ + ++G+ + DN+ LN+ Y +S+K R + Sbjct: 161 NPDHPEHWLLKDYIENTDPKAGILSHQFKLDDNNFLNDRYKESIKASTPSGMFYER-NIN 219 Query: 230 GDWGVAEGLVF-DGLFEERIIDKQAIQRLP---HSIGLDFGFKH-DPTAAVFTAVDQVNR 284 G W +G+V+ D E I + +P + G+D+G++H + +D Sbjct: 220 GMWVSGDGVVYADFDLNENTIKADELDDIPIKEYFAGVDWGYEHYGSIVLIGRGID--GN 277 Query: 285 VVYVYDEVYKHGLLTGQIAQQLRAHMAYG-IPITADSAGSNLIAELTRVHGVPGILPSGK 343 ++ + ++ + + YG I D+A I E R H + I + K Sbjct: 278 FYFIEEHAHQFKFIDDWVVIAKDIVSRYGNINFYCDTARPEYITEFRR-HRLRAI-NADK 335 Query: 344 GKDSVVQGIQ--YMQSYRFVVDPRCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAIDAL 401 K S V+ + + Q+ V+ +E+ YV+ P+ + +D+L Sbjct: 336 SKLSGVEEVAKLFKQNKLLVLYDNMDRFKQEVFKYVWHPTNG-----EPIKEFDDVLDSL 390 Query: 402 RYAM 405 RYA+ Sbjct: 391 RYAI 394 >gi|3523|lcl|protein:vir:105897 Length: 407 # NCBI annotation: large terminase # Family: family:all:54 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004370;genbank:gi:122891825;genbank:Ge neID:4712368 Length = 407 Score = 33.9 bits (76), Expect = 0.004, Method: Compositional matrix adjust. Identities = 51/244 (20%), Positives = 99/244 (40%), Gaps = 18/244 (7%) Query: 170 NPWSERHFLKSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLVRNPNRARVAVL 229 NP H+L ++ + ++G+ + DN+ LN+ Y +S+K R + Sbjct: 161 NPDHPEHWLLKDYIENTDPKAGILSHQFKLDDNNFLNDRYKESIKASTPSGMFYER-NIN 219 Query: 230 GDWGVAEGLVF-DGLFEERIIDKQAIQRLP---HSIGLDFGFKH-DPTAAVFTAVDQVNR 284 G W +G+V+ D E I + +P + G+D+G++H + +D Sbjct: 220 GMWVSGDGVVYADFDLNENTIKADELDDIPIKEYFAGVDWGYEHYGSIVLIGRGID--GN 277 Query: 285 VVYVYDEVYKHGLLTGQIAQQLRAHMAYG-IPITADSAGSNLIAELTRVHGVPGILPSGK 343 ++ + ++ + + YG I D+A I E R H + I + K Sbjct: 278 FYFIEEHAHQFKFIDDWVVIAKDIVSRYGNINFYCDTARPEYITEFRR-HRLRAI-NADK 335 Query: 344 GKDSVVQGIQ--YMQSYRFVVDPRCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAIDAL 401 K S V+ + + Q+ V+ +E+ YV+ P+ + +D+L Sbjct: 336 SKLSGVEEVAKLFKQNKLLVLYDNMDRFKQEVFKYVWHPTNG-----EPIKEFDDVLDSL 390 Query: 402 RYAM 405 RYA+ Sbjct: 391 RYAI 394 >gi|6719|lcl|protein:vir:99411 Length: 447 # NCBI annotation: hypothetical protein # Family: family:all:32729 # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919076;genbank:gi:119757034;genbank:GeneI D:4606064 Length = 447 Score = 33.1 bits (74), Expect = 0.007, Method: Compositional matrix adjust. Identities = 90/391 (23%), Positives = 143/391 (36%), Gaps = 70/391 (17%) Query: 59 LVLRQYAYTNRDSTFATLKKV--ASDLNVYN---------LFTWKSSPLEVVYKPTGQRI 107 LVL Q + +T++ K+ D N + + TW S+ +VY TG Sbjct: 64 LVLAQDYQKGKSTTYSVFFKILPGEDTNPFKDGDPENSPIVDTWHSNDKRLVYV-TGHVA 122 Query: 108 FFRGMDSPLKITSITPTTGQLCRAWYEECYELKSLDGFNTVEESL----RGELDDPSGYY 163 + G D + G+ CR W +E + E L R E+ + + Sbjct: 123 WLGGADKWNRFAG-----GEYCRIWCDEVGHYPPNTDLYDLHEMLITRQRTEIGPNTTLW 177 Query: 164 QSILT-FNPW---SERHF-LKSEFFDEATRRSGVYATTTTYKDNDHLNESYIKSLKEMLV 218 S FN + +ER E A + V A+T + N L + ++ Sbjct: 178 TSTGNGFNQFYDITERQVNADDEPLPWADQMEVVVAST---EHNTLLPPDGLDKIRRQF- 233 Query: 219 RNPNRARVAVLGDWGVAEGLVFDGL-----------FEERIIDKQAIQRLPHSIGLDFGF 267 + R + G + AEGLV+D +R+ D A+ G D G+ Sbjct: 234 KGTAREEQGLHGGFAAAEGLVYDAFTRQTHVRDADDVRDRLADDWAM------YGYDAGW 287 Query: 268 KHDPTAAVFTAVDQVNRVVYVYDEVYKHGLLTGQIAQQLRAHMAYGIPITADSAGSNLIA 327 +DP + + V V+D+ YK ++ A A P A + A Sbjct: 288 -NDPRVLLDIRKTHAGQFV-VWDQFYKSESHLAELVDPDDALPADVDPWLAGRPRGRVYA 345 Query: 328 ELTRVH---GVPGILPSGKGKDSVVQGIQYMQSYRFVVDP----------RCKGLLEELN 374 E H P+ K + S+ GI +++S R +D RC L++E Sbjct: 346 EHEPAHIEQFRKANWPAVKAEKSLDGGIDHVRS-RLAMDDEGRPGVLVTDRCGELIQEF- 403 Query: 375 LYVYDRDKAGNWLNTPVDANNHAIDALRYAM 405 Y D G A +HA+DALRYA+ Sbjct: 404 -LSYKEDHVGT-----SKAQDHALDALRYAL 428 >gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: terminase large subunit # Family: family:all:147 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467938;genbank:gi:157265379;genbank:Ge neID:5600506 Length = 485 Score = 31.2 bits (69), Expect = 0.027, Method: Compositional matrix adjust. Identities = 35/132 (26%), Positives = 58/132 (43%), Gaps = 8/132 (6%) Query: 259 HSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLLTGQIAQQLRAHMAYGIP-IT 317 + IG DFG D +VF+ +D ++ V E + Q+A+ YG + Sbjct: 289 YCIGADFGKNQD--YSVFSVLD-LDTGAIVCLERMNGATWSDQVARLKALSEDYGHAYVV 345 Query: 318 ADSAG-SNLIAELTRVHGV---PGILPSGKGKDSVVQGIQYMQSYRFVVDPRCKGLLEEL 373 AD+ G + IAE G+ P + S K+ ++ + + V P K +L+EL Sbjct: 346 ADTWGVGDAIAEELDAQGINYTPLPVKSSSVKEQLISNLALLMEKGQVAVPNDKTILDEL 405 Query: 374 NLYVYDRDKAGN 385 + Y R +GN Sbjct: 406 RNFRYYRTASGN 417 Score = 22.7 bits (47), Expect = 7.9, Method: Compositional matrix adjust. Identities = 11/32 (34%), Positives = 18/32 (56%), Gaps = 1/32 (3%) Query: 34 RGSGKSYAAAIKVLVDTIAHP-YVNWLVLRQY 64 R SGKS AA+++ + + A P W++ Y Sbjct: 39 RQSGKSEAASVEAVFELFARPGSQGWIIAPTY 70 >gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: phage terminase large subunit # Family: family:all:147 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468054;genbank:gi:157265496;genbank:Ge neID:5600564 Length = 485 Score = 31.2 bits (69), Expect = 0.028, Method: Compositional matrix adjust. Identities = 35/132 (26%), Positives = 58/132 (43%), Gaps = 8/132 (6%) Query: 259 HSIGLDFGFKHDPTAAVFTAVDQVNRVVYVYDEVYKHGLLTGQIAQQLRAHMAYGIP-IT 317 + IG DFG D +VF+ +D ++ V E + Q+A+ YG + Sbjct: 289 YCIGADFGKNQD--YSVFSVLD-LDTGAIVCLERMNGATWSDQVARLKALSEDYGHAYVV 345 Query: 318 ADSAG-SNLIAELTRVHGV---PGILPSGKGKDSVVQGIQYMQSYRFVVDPRCKGLLEEL 373 AD+ G + IAE G+ P + S K+ ++ + + V P K +L+EL Sbjct: 346 ADTWGVGDAIAEELDAQGINYTPLPVKSSSVKEQLISNLALLMEKGQVAVPNDKTILDEL 405 Query: 374 NLYVYDRDKAGN 385 + Y R +GN Sbjct: 406 RNFRYYRTASGN 417 Score = 22.7 bits (47), Expect = 7.9, Method: Compositional matrix adjust. Identities = 11/32 (34%), Positives = 18/32 (56%), Gaps = 1/32 (3%) Query: 34 RGSGKSYAAAIKVLVDTIAHP-YVNWLVLRQY 64 R SGKS AA+++ + + A P W++ Y Sbjct: 39 RQSGKSEAASVEAVFELFARPGSQGWIIAPTY 70 >gi|8015|lcl|protein:vir:96495 Length: 195 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238487;genbank:gi:66391763;genbank:GeneID :5176917 Length = 195 Score = 28.5 bits (62), Expect = 0.18, Method: Compositional matrix adjust. Identities = 45/207 (21%), Positives = 83/207 (40%), Gaps = 25/207 (12%) Query: 232 WGVAEGLVFDGLFEERIIDKQAIQRLPHSI----GLDFGFKHDPTAAVFTAVDQVNRVVY 287 W VAEG ++ ++ +I + LP G+D+G+ H ++ + V+ Y Sbjct: 3 WTVAEGAIY-ADYDSKI---HVVDELPEMKRCFGGIDWGYTH--YGSIVVVGEGVDGNFY 56 Query: 288 VYDEVYKHGLLTGQIAQQLR--AHMAYGIPITADSAGSNLIAELTRVHGVPGILPSGKGK 345 + D V +Q R + IP ADSA +A G S K Sbjct: 57 LLDGVAAQFKEIDWWVEQARKLTGIYRNIPFYADSARPEHVARFES----EGFDISNANK 112 Query: 346 DSVVQGIQ-----YMQSYRFVVDPRCKGLLEELNLYVYDRDKAGNWLNTPVDANNHAIDA 400 SV+ GI+ + + +V +E+ Y R K + + P+ + +D+ Sbjct: 113 -SVIAGIELIAKLFKEEKLYVKRGFVPRFFDEIYQY---RWKENSTKDEPLKEFDDVLDS 168 Query: 401 LRYAMQKFMFVKNGQYMSYTDRVAELK 427 +RYA+ + + + SY D ++ + Sbjct: 169 VRYAIYSDYVIGSTEQASYDDLLSMFR 195 >gi|1299|lcl|protein:vir:105078 Length: 155 # NCBI annotation: major tail shaft subunit # Family: family:all:11396 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006592;genbank:gi:46402098;genbank:GeneID :2777944 Length = 155 Score = 27.7 bits (60), Expect = 0.27, Method: Compositional matrix adjust. Identities = 18/52 (34%), Positives = 23/52 (44%), Gaps = 2/52 (3%) Query: 228 VLGDWGVAEGLVFDGLFEERIIDKQAIQRLPHSIGLDFGFKHDPTAAVFTAV 279 LG G G V +++ KQ+I LP GF DP A FTA+ Sbjct: 39 ALGAMGQTGGFVDCTTLKDK--QKQSISDLPDGPEKSLGFIDDPANASFTAL 88 >gi|13269|lcl|protein:vir:81252 Length: 542 # NCBI annotation: gp2, terminase # Family: family:all:523 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456732;genbank:gi:157168375;interpro:I PR005021;uniprot:Q9MBK3;genbank:GeneID:5580375 Length = 542 Score = 27.3 bits (59), Expect = 0.35, Method: Compositional matrix adjust. Identities = 14/52 (26%), Positives = 26/52 (50%) Query: 2 SGTVNINLAELLSPAYYPLFKDRSRYLVYKGSRGSGKSYAAAIKVLVDTIAH 53 +G+ I LA P +F ++ V +RG G+S++A + +L + H Sbjct: 151 NGSNQIELAYAPVPEALDVFGAMPKWFVVATNRGGGRSHSAELAMLDELREH 202 >gi|26461|lcl|protein:vir:573 Length: 589 # NCBI annotation: unknown # Family: family:all:4926 # MgeID: mge:13 # MgeName: SPBc2 # Cross-refs: genbank:acc:NP_046608;genbank:gi:9630181;genbank:GeneID: 1261423 Length = 589 Score = 25.8 bits (55), Expect = 0.94, Method: Compositional matrix adjust. Identities = 10/30 (33%), Positives = 16/30 (53%) Query: 25 SRYLVYKGSRGSGKSYAAAIKVLVDTIAHP 54 + Y +Y SRG GK++ ++ V I P Sbjct: 77 NHYFMYLASRGQGKTWLTSVYCCVQAILFP 106 >gi|15201|lcl|protein:vir:7028 Length: 631 # NCBI annotation: large terminase subunit # Family: family:all:697 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853601;genbank:gi:31711683;genbank:GeneID :1481809 Length = 631 Score = 24.6 bits (52), Expect = 2.4, Method: Compositional matrix adjust. Identities = 11/41 (26%), Positives = 21/41 (51%) Query: 25 SRYLVYKGSRGSGKSYAAAIKVLVDTIAHPYVNWLVLRQYA 65 ++Y + + RG K+ AAI + I P+ +++ Q A Sbjct: 65 NKYRMVEAQRGQAKTTIAAIYAVFRIIHEPHKRIMIVSQTA 105 >gi|4548|lcl|protein:vir:94970 Length: 473 # NCBI annotation: putative phage terminase large subunit # Family: family:all:144 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239275;genbank:gi:66392134;genbank:GeneI D:5076615 Length = 473 Score = 23.9 bits (50), Expect = 3.5, Method: Compositional matrix adjust. Identities = 12/41 (29%), Positives = 23/41 (56%), Gaps = 1/41 (2%) Query: 25 SRYLVYKGSRGSGKSYAAAIKVLVDTIAHP-YVNWLVLRQY 64 +R ++Y G+ G GKSY + +V ++ P + +L R + Sbjct: 19 AREILYGGAAGGGKSYLLRVASIVYSLEIPGLITYLFRRTF 59 >gi|10070|lcl|protein:vir:99006 Length: 303 # NCBI annotation: gp35 # Family: family:all:698 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655900;genbank:gi:109521472;genbank:GeneI D:4157971 Length = 303 Score = 23.1 bits (48), Expect = 6.3, Method: Compositional matrix adjust. Identities = 10/24 (41%), Positives = 12/24 (50%) Query: 107 IFFRGMDSPLKITSITPTTGQLCR 130 + F MD+PL T P TG L Sbjct: 23 VLFAKMDTPLLTTIEDPATGDLVE 46 >gi|17621|lcl|protein:vir:816 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050549;genbank:gi:9633446;genbank:GeneID: 1262208 Length = 568 Score = 23.1 bits (48), Expect = 6.6, Method: Compositional matrix adjust. Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 1/49 (2%) Query: 337 GILPSGKGKDSVVQGIQYMQSYRFVVDPRCKGLLEELNLYVYDRDKAGN 385 G L + + K + +G++ + + + R G L E+N YVYD + N Sbjct: 479 GWLTTRQSKPVLTEGMKTLLN-NGISGIRWSGTLSEMNTYVYDAKGSMN 526 >gi|25680|lcl|protein:vir:10116 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859246;genbank:gi:32171173;genbank:GeneID :2653342 Length = 568 Score = 23.1 bits (48), Expect = 6.6, Method: Compositional matrix adjust. Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 1/49 (2%) Query: 337 GILPSGKGKDSVVQGIQYMQSYRFVVDPRCKGLLEELNLYVYDRDKAGN 385 G L + + K + +G++ + + + R G L E+N YVYD + N Sbjct: 479 GWLTTRQSKPVLTEGMKTLLN-NGISGIRWSGTLSEMNTYVYDAKGSMN 526 >gi|51|lcl|protein:vir:3295 Length: 568 # NCBI annotation: putative large subunit terminase # Family: family:all:147 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049511;genbank:gi:9632517;genbank:GeneID: 1262002 Length = 568 Score = 23.1 bits (48), Expect = 6.6, Method: Compositional matrix adjust. Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 1/49 (2%) Query: 337 GILPSGKGKDSVVQGIQYMQSYRFVVDPRCKGLLEELNLYVYDRDKAGN 385 G L + + K + +G++ + + + R G L E+N YVYD + N Sbjct: 479 GWLTTRQSKPVLTEGMKTLLN-NGISGIRWSGTLSEMNTYVYDAKGSMN 526 >gi|26240|lcl|protein:vir:2763 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612880;genbank:gi:20065962;genbank:GeneID :935714 Length = 568 Score = 23.1 bits (48), Expect = 6.6, Method: Compositional matrix adjust. Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 1/49 (2%) Query: 337 GILPSGKGKDSVVQGIQYMQSYRFVVDPRCKGLLEELNLYVYDRDKAGN 385 G L + + K + +G++ + + + R G L E+N YVYD + N Sbjct: 479 GWLTTRQSKPVLTEGMKTLLN-NGISGIRWSGTLSEMNTYVYDAKGSMN 526 >gi|25849|lcl|protein:vir:9949 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859079;genbank:gi:32171001;genbank:GeneID :2653280 Length = 568 Score = 23.1 bits (48), Expect = 6.6, Method: Compositional matrix adjust. Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 1/49 (2%) Query: 337 GILPSGKGKDSVVQGIQYMQSYRFVVDPRCKGLLEELNLYVYDRDKAGN 385 G L + + K + +G++ + + + R G L E+N YVYD + N Sbjct: 479 GWLTTRQSKPVLTEGMKTLLN-NGISGIRWSGTLSEMNTYVYDAKGSMN 526 >gi|1476|lcl|protein:vir:104436 Length: 568 # NCBI annotation: putative terminase large subunit # Family: family:all:147 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794060;genbank:gi:116222005;genbank:GeneI D:4397501 Length = 568 Score = 23.1 bits (48), Expect = 6.6, Method: Compositional matrix adjust. Identities = 14/49 (28%), Positives = 25/49 (51%), Gaps = 1/49 (2%) Query: 337 GILPSGKGKDSVVQGIQYMQSYRFVVDPRCKGLLEELNLYVYDRDKAGN 385 G L + + K + +G++ + + + R G L E+N YVYD + N Sbjct: 479 GWLTTRQSKPVLTEGMKTLLN-NGISGIRWSGTLSEMNTYVYDAKGSMN 526 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.320 0.136 0.408 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 200,731 Number of Sequences: 514 Number of extensions: 9889 Number of successful extensions: 222 Number of sequences better than 100.0: 75 Number of HSP's better than 100.0 without gapping: 71 Number of HSP's successfully gapped in prelim test: 4 Number of HSP's that attempted gapping in prelim test: 22 Number of HSP's gapped (non-prelim): 86 length of query: 431 length of database: 206,069 effective HSP length: 74 effective length of query: 357 effective length of database: 168,033 effective search space: 59987781 effective search space used: 59987781 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 38 (19.2 bits)