BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_018861.1_cdsid_YP_006908144.1 [gene=D302_gp082] [protein=putative phage DNA packaging protein] [protein_id=YP_006908144.1] [location=74590..75918] (442 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: ter... 166 4e-43 gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: t... 163 4e-42 gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: g... 162 8e-42 gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: g... 159 7e-41 gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T... 155 1e-39 gi|24596|lcl|protein:vir:98262 Length: 609 # NCBI annotation: gp... 153 3e-39 gi|20804|lcl|protein:vir:106290 Length: 633 # NCBI annotation: g... 153 4e-39 gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: ter... 153 5e-39 gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp... 150 3e-38 gi|28182|lcl|protein:vir:108053 Length: 611 # NCBI annotation: g... 142 8e-36 gi|23191|lcl|protein:vir:101186 Length: 613 # NCBI annotation: t... 139 8e-35 gi|22942|lcl|protein:vir:101803 Length: 613 # NCBI annotation: g... 138 1e-34 gi|27407|lcl|protein:vir:7202 Length: 610 # NCBI annotation: gp1... 134 2e-33 gi|22056|lcl|protein:vir:103455 Length: 610 # NCBI annotation: t... 134 2e-33 gi|27701|lcl|protein:vir:6893 Length: 611 # NCBI annotation: gp1... 131 2e-32 gi|21077|lcl|protein:vir:100538 Length: 612 # NCBI annotation: g... 127 4e-31 gi|16897|lcl|protein:vir:5867 Length: 508 # NCBI annotation: sim... 51 2e-08 gi|17621|lcl|protein:vir:816 Length: 568 # NCBI annotation: hypo... 35 0.002 gi|25680|lcl|protein:vir:10116 Length: 568 # NCBI annotation: hy... 35 0.002 gi|51|lcl|protein:vir:3295 Length: 568 # NCBI annotation: putati... 35 0.002 gi|26240|lcl|protein:vir:2763 Length: 568 # NCBI annotation: hyp... 35 0.002 gi|25849|lcl|protein:vir:9949 Length: 568 # NCBI annotation: hyp... 35 0.002 gi|1476|lcl|protein:vir:104436 Length: 568 # NCBI annotation: pu... 35 0.002 gi|16146|lcl|protein:vir:10462 Length: 586 # NCBI annotation: DN... 26 0.94 gi|3734|lcl|protein:vir:94555 Length: 585 # NCBI annotation: DNA... 25 1.3 gi|17058|lcl|protein:vir:5248 Length: 487 # NCBI annotation: put... 25 1.9 gi|8126|lcl|protein:vir:102661 Length: 568 # NCBI annotation: te... 25 2.5 gi|19292|lcl|protein:vir:4781 Length: 436 # NCBI annotation: put... 24 2.9 gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: te... 23 8.6 >gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899601;genbank:gi:34419588;genbank:GeneID :2546011 Length = 600 Score = 166 bits (421), Expect = 4e-43, Method: Compositional matrix adjust. Identities = 112/368 (30%), Positives = 195/368 (52%), Gaps = 38/368 (10%) Query: 101 CAFIDKWSEFSNSVIPTISASKKSQIIAASTPVGLNHWYKMWSDAVEGKSSYKPFKVEWW 160 CAF+ + +F + P IS+ ++S+++ STP GLNH++ MW+ AV+G S+++P+ W Sbjct: 243 CAFVPGFDDFWKATFPVISSGEESKVVLTSTPNGLNHYHDMWNAAVQGISTFEPYTTTWR 302 Query: 161 KVPGR---------DENYKELMIKTLEGGIRTWNQEYACEFIGSSDTLVDMTVLSNIKFG 211 V R E +K I ++QE+ C F+G++ TL++ LS +K Sbjct: 303 AVQNRLYKDGEFDDGEAFKRETIGNTSR--EAFSQEHLCNFLGTAGTLINGFKLSKMKGI 360 Query: 212 NILREPNFGETIRVYEAPQEHHKYMVLADAAKG-AIDGFVFHVIDVTNIPFKQVASGKIS 270 +++++ + VY+ P+E HKY++ D ++G D H+IDVT+ PF+QVA + Sbjct: 361 DVVKD---SDGWCVYKKPEEGHKYILTVDTSEGRGQDYHALHMIDVTSYPFEQVAVFHDN 417 Query: 271 E-SYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQEP----DKKW 325 + S+L+ P I YNEA CE G V++ LF+ EYEN+ E ++ Sbjct: 418 KTSHLLLPAIIMKQAYRYNEAYVYCE-IASTGELVMNELFRDLEYENVIMEERASGGRRG 476 Query: 326 LGVRTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFCNVNGKYQAQNSKAHDDYVM 385 LG++ K ++ S +K IE ++L + PT+KE TF ++A+ HDD VM Sbjct: 477 LGLKPNKKTKAIGCSTLKDLIEKDQLKINHIPTLKEFHTFVEKGKSWEAEEG-FHDDLVM 535 Query: 386 ALSLL--------FVPLLDLNNIVDYDVFLNKINSDSEITDGDVKYLQMG-----FFDDG 432 +L+LL F ++ V YD+F +++ ++ D DV +L + + D Sbjct: 536 SLTLLAYLSTQDRFSDFVEKEYNVSYDIFKQEVH---DMMDDDVPFLMIADGIENYGTDF 592 Query: 433 TSSFYGIF 440 +++ +G+F Sbjct: 593 STNSFGMF 600 >gi|20243|lcl|protein:vir:106989 Length: 548 # NCBI annotation: terminase large subunit gp17 # Family: family:all:147 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195134;genbank:gi:58532911;uniprot:Q5GQN8 ;genbank:GeneID:3260486 Length = 548 Score = 163 bits (412), Expect = 4e-42, Method: Compositional matrix adjust. Identities = 111/351 (31%), Positives = 179/351 (50%), Gaps = 46/351 (13%) Query: 97 VSHNCAFIDKWS--------EFSNSVIPTISASKKSQIIAASTPVGLNHWYKMWSDAVEG 148 +S N F+D+++ F SV PTI++ K +++I STP G+NH+YKMW DA G Sbjct: 167 MSFNIIFLDEFAFVPNHIADSFFASVYPTITSGKSTKVIIISTPQGMNHFYKMWVDATNG 226 Query: 149 KSSYKPFKVEWWKVPGRDENYKELMIKTLEGGIRTWNQEYACEFIGSSDTLVDMTVLSNI 208 ++ Y +V W +VPGRDE +KE IK R + QE+ CEF+GS DTL+ + L + Sbjct: 227 RNGYTFHEVHWSQVPGRDEKWKEETIKNTSE--RQFTQEFECEFLGSVDTLIAASKLKAL 284 Query: 209 KFGNILREPNFGETIRVYEAPQEHHKYMVLADAAKG-AIDGFVFHVIDVTNIPFKQVASG 267 F + ++ + + +YE P+E +Y++ D ++G D F + D+T +P+K V Sbjct: 285 VFNDPIKR---NKGLDIYEEPKEKSEYLMTVDVSRGIGGDYSAFIIFDITTVPYKVVGKY 341 Query: 268 KISE-SYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQ------- 319 + +E ++ P I ++ R+YN A +CE N+ G V +L EY N+ Sbjct: 342 RNNEIKPMLFPNIINDLARSYNNAWVLCEVND-IGDQVASILNYDLEYPNVLMCAMRGRA 400 Query: 320 --------EPDKKWLGVRTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFCNVNGK 371 K LGV+ + + + +N+K +E +KLI D + EL TF Sbjct: 401 GQLVGQGFSGSKTQLGVKMSITVKKVGCANLKTIVEEDKLIFNDYDIINELTTFIQKKQS 460 Query: 372 YQAQNSKAHDDYVMALSLLFVPLLDLNNIVDYDVFLNKINSDSEITDGDVK 422 ++A + HDD VM + ++F L V D F E+TD D++ Sbjct: 461 FEA-DEGFHDDLVMCM-VIFAWL------VQQDYF-------KEMTDNDIR 496 >gi|22263|lcl|protein:vir:104536 Length: 550 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214662;genbank:gi:61806303;genbank:GeneID :3294591 Length = 550 Score = 162 bits (410), Expect = 8e-42, Method: Compositional matrix adjust. Identities = 107/320 (33%), Positives = 174/320 (54%), Gaps = 34/320 (10%) Query: 97 VSHNCAFIDKWS--------EFSNSVIPTISASKKSQIIAASTPVGLNHWYKMWSDAVEG 148 +S N F+D+++ +F SV PTIS+ K +++I STP G+N +YK+W DA G Sbjct: 168 MSFNIIFLDEFAFVPNHIAEQFFASVYPTISSGKSTKVIIISTPHGMNQFYKLWHDAERG 227 Query: 149 KSSYKPFKVEWWKVPGRDENYKELMIK-TLEGGIRTWNQEYACEFIGSSDTLVDMTVLSN 207 ++Y +V W +VPGRD+ +K+ I+ T E R E+ CEF+GS DTL+ + L Sbjct: 228 ANNYVATEVHWSQVPGRDDKWKQQTIENTSEAQFRV---EFECEFLGSVDTLITPSKLRI 284 Query: 208 IKFGNILREPNFGETIRVYEAPQEHHKYMVLADAAKGAIDGF-VFHVIDVTNIPFKQVAS 266 + + + ++E N G + VYE QE+H Y++ D ++G + + F VID T +P+K VA Sbjct: 285 MPYKDPIQE-NRG--LAVYEHVQENHNYIITVDVSRGVGNDYSAFCVIDTTTVPYKVVAR 341 Query: 267 GKISE-SYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQ------ 319 K ++ L+ P + ++ YN A +CE N+ G V D++ EYEN+ Sbjct: 342 YKNNQIKPLVFPNLIVDVATNYNGAYVLCEVND-IGGQVADIIQYDLEYENLLMVSMRGR 400 Query: 320 ---------EPDKKWLGVRTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFCNVNG 370 K LG++ + + + SN+K IE++KLI++D T+ EL TF Sbjct: 401 AGQQLGQGFSGKKTQLGIKMSTAVKQVGCSNLKALIEDDKLIVEDYDTIAELTTFIQKGQ 460 Query: 371 KYQAQNSKAHDDYVMALSLL 390 +QA++ +DD M L + Sbjct: 461 SFQAEDG-CNDDLAMCLVIF 479 >gi|23408|lcl|protein:vir:103169 Length: 549 # NCBI annotation: gp123 # Family: family:all:147 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717790;genbank:gi:113200627;genbank:GeneI D:4239178 Length = 549 Score = 159 bits (401), Expect = 7e-41, Method: Compositional matrix adjust. Identities = 112/369 (30%), Positives = 182/369 (49%), Gaps = 38/369 (10%) Query: 97 VSHNCAFIDKWS--------EFSNSVIPTISASKKSQIIAASTPVGLNHWYKMWSDAVEG 148 +S N F+D+++ +F +SV PTIS+ K +++I STP G+N +YK+W DA Sbjct: 167 MSFNVIFLDEFAFVPNHVADQFFSSVYPTISSGKSTKVIIISTPHGMNMFYKLWHDAERK 226 Query: 149 KSSYKPFKVEWWKVPGRDENYKELMIK-TLEGGIRTWNQEYACEFIGSSDTLVDMTVLSN 207 + Y P +V W +VPGRD +KE IK T E R E+ CEF+GS DTL+ + L Sbjct: 227 ANEYIPTEVHWSEVPGRDAAWKEQTIKNTSEQQFRV---EFECEFLGSVDTLISPSKLRT 283 Query: 208 IKFGNILREPNFGETIRVYEAPQEHHKYMVLADAAKGAIDGF-VFHVIDVTNIPFKQVAS 266 + +G+ + E N + +YE + H Y++ AD ++G + F VID T IP+K VA Sbjct: 284 MVYGDPIAEKN---GLSMYEKTIQGHTYVITADVSRGVSGDYSAFLVIDTTTIPYKLVAK 340 Query: 267 GKISE-SYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQ------ 319 + ++ ++ P I ++ R YN A + E N+ G V D++ EY+N+ Sbjct: 341 YRNNDIKPILFPNIIVDVARNYNHAFVLVEVND-VGGQVADIIQYDLEYDNLLMCAMRGR 399 Query: 320 ---------EPDKKWLGVRTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFCNVNG 370 K +G++ + + + SN+K +E++K +L D + EL TF Sbjct: 400 AGQQLGQGFSGKKTQMGIKMSSATKQVGCSNLKALLEDDKFLLNDYDCISELTTFIQKGQ 459 Query: 371 KYQAQNSKAHDDYVMALSLLFVPLLD--LNNIVDYDVFLNKINSDSEITDGDVKYLQMGF 428 +QA+ +DD M + + + + D DV + E + D+ GF Sbjct: 460 TFQAEEG-CNDDLAMCMVIFAWMAMQPYFKELHDNDVRQRIYDDQREAIEQDMA--PFGF 516 Query: 429 FDDGTSSFY 437 DDG Y Sbjct: 517 MDDGLGEEY 525 >gi|24012|lcl|protein:vir:104946 Length: 547 # NCBI annotation: T4-like DNA packaging large subunit terminase # Family: family:all:147 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214360;genbank:gi:61806000;genbank:GeneID :3294466 Length = 547 Score = 155 bits (391), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 101/318 (31%), Positives = 164/318 (51%), Gaps = 32/318 (10%) Query: 98 SHNCAFIDKWS--------EFSNSVIPTISASKKSQIIAASTPVGLNHWYKMWSDAVEGK 149 S+N F+D+++ +F SV PTI++ + +++I STP G+NH+Y+MW D+ +GK Sbjct: 166 SYNVIFLDEFAFIPNHIADDFFASVYPTITSGQSTKVIIVSTPRGMNHFYRMWHDSEKGK 225 Query: 150 SSYKPFKVEWWKVPGRDENYKELMIKTLEGGIRTWNQEYACEFIGSSDTLVDMTVLSNIK 209 S Y V W +VPGRDE +KE I + + E+ CEF+GS +TL++ L N+ Sbjct: 226 SEYVATDVHWSEVPGRDEEWKEQTIANTSE--QQFKIEFECEFLGSVNTLINPAKLRNLV 283 Query: 210 FGNILREPNFGETIRVYEAPQEHHKYMVLADAAKGAIDGF-VFHVIDVTNIPFKQVASGK 268 + + N G + +YE P + H Y++ D A+G + + F V D T P+K VA + Sbjct: 284 Y-EAPKTRNAG--LDIYETPVKEHNYIITVDVARGLGNDYSAFIVFDTTEFPYKVVAKYR 340 Query: 269 ISE-SYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQEP------ 321 +E ++ P I ++ + YN A + E N+ G V +L EYEN+ Sbjct: 341 NNEIKPMLFPNIILDVAKGYNNAYLLIEVND-IGDQVASILQYDLEYENVLMASMRGRAG 399 Query: 322 ---------DKKWLGVRTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFCNVNGKY 372 K LGVR T + + SN+K +E++KL+ D + EL TF + + Sbjct: 400 QIVGQGFSGKKTQLGVRMTSAVKKLGCSNLKTMMEDDKLLTCDYEIISELTTFAQRHNSF 459 Query: 373 QAQNSKAHDDYVMALSLL 390 +A+ +DD M L + Sbjct: 460 EAEEG-CNDDLAMCLVIF 476 >gi|24596|lcl|protein:vir:98262 Length: 609 # NCBI annotation: gp17 terminase subunit, nuclease and ATPase # Family: family:all:147 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239195;genbank:gi:66391670;genbank:GeneID :3416364 Length = 609 Score = 153 bits (387), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 106/304 (34%), Positives = 163/304 (53%), Gaps = 22/304 (7%) Query: 101 CAFIDKWSEFSNSVIPTISASKKSQIIAASTPVGLNHWYKMWSDAVEGKSSYKPFKVEWW 160 CAFI +++ ++ P IS+ +KS+I+ +TP GLNH+Y +W+ AVEGKS + P+ W Sbjct: 255 CAFIPNFTDAWLAIQPVISSGRKSKILITTTPNGLNHFYDIWNAAVEGKSGFVPYTAIWT 314 Query: 161 KVPGR----------DENYKELMIKTLEGGIR-TWNQEYACEFIGSSDTLVDMTVLSNIK 209 V R D+ Y K + G + + QE+ EF+G++ TL+ LS + Sbjct: 315 SVKERLYTDGDNGVFDDGYS-WSAKMIAGSSKEAFLQEHCAEFMGTNGTLISGWKLSKMS 373 Query: 210 FGNI-LREPNFGETIRVYEAPQEHHKYMVLADAAKG-AIDGFVFHVIDVTNIPFKQVASG 267 + +I E NF + Y+ P+E HKY+ + D A+G D H+ID+T +PF+QVA Sbjct: 374 WIDIDETETNFYQ----YKKPEEGHKYVAVLDPAEGRGQDYHAMHIIDITTLPFEQVAVY 429 Query: 268 KISE-SYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQEPDKKWL 326 + S+L+ P I L YNEA E N G SV LF EYEN+ + L Sbjct: 430 HSNRTSHLILPDILLRYLTMYNEAWIYIELN-STGHSVAKSLFSELEYENVICDSYND-L 487 Query: 327 GVRTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFCNVNGKYQAQNSKAHDDYVMA 386 G++ TK +++ S +K IE +KLI+ ++ T+ E TF + A+ HDD VM+ Sbjct: 488 GMKQTKRSKAIGCSTLKDLIEKDKLIINNKKTILEFRTFSEKGVSWAAEEG-FHDDLVMS 546 Query: 387 LSLL 390 L+ Sbjct: 547 LACF 550 >gi|20804|lcl|protein:vir:106290 Length: 633 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944105;genbank:gi:38640149;genbank:GeneID :2658038 Length = 633 Score = 153 bits (386), Expect = 4e-39, Method: Compositional matrix adjust. Identities = 107/356 (30%), Positives = 174/356 (48%), Gaps = 35/356 (9%) Query: 101 CAFIDKWSEFSNSVIPTISASKKSQIIAASTPVGLNHWYKMWSDAVEGKSSYKPFKVEWW 160 CAFI+ + + +++P IS+ ++S+II STP G+NHWY +W +++ +KP+ W Sbjct: 281 CAFIEGFEDTWKAILPVISSGRQSRIILTSTPNGINHWYDLWEVSLKSDKGFKPYTTTWI 340 Query: 161 KVPGR--------DENYKELMIKTLEGGIRTWNQEYACEFIGSSDTLVDMTVLSNIKFGN 212 V R D+ ++ + + + QE+ C F+G+S TL++ LS + + Sbjct: 341 TVKERLYDGSDAYDDGFEWASKQINSSSVEAFQQEHLCRFMGTSGTLINGFKLSKMTWKE 400 Query: 213 ILREPNFGETIRVYEAPQEHHKYMVLADAAKG-AIDGFVFHVIDVTNIPFKQVA---SGK 268 ++ + NF + E P E +KY+ D A+G D +IDVT+ P++QVA S K Sbjct: 401 VIADDNFYQI----EKPVEGNKYIATVDPAEGRGQDYSTIQIIDVTSYPYRQVAVYHSNK 456 Query: 269 ISESYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQEPDKKWLGV 328 IS L+ P + YN A E N G V LF EYEN+ + K LG+ Sbjct: 457 ISP--LLLPSVIMRYAMEYNNAWVYIELN-SIGNMVAKSLFIDLEYENVIVDSSKD-LGM 512 Query: 329 RTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFCNVNGKYQAQNSKAHDDYVMALS 388 + TK ++ S +K IE +KLI+ + T++E TF + AQ+ HDD VM+L Sbjct: 513 KQTKVTKAVGCSTLKDLIEKDKLIVSHKGTIQEFRTFVEKGVSWAAQDG-FHDDLVMSLC 571 Query: 389 LL--------FVPLLDLNNIVDYDVFLNKINSDSEITDGDVKYLQMGFFDDGTSSF 436 + F +D + DVF SE+ + + DDG +++ Sbjct: 572 IFAYLTTQERFGDFIDATRNIGADVF------QSEMEEMLEDFCVGAIIDDGINTY 621 >gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891724;genbank:gi:33620526;genbank:GeneID :1725332 Length = 607 Score = 153 bits (386), Expect = 5e-39, Method: Compositional matrix adjust. Identities = 97/300 (32%), Positives = 156/300 (52%), Gaps = 17/300 (5%) Query: 101 CAFIDKWSEFSNSVIPTISASKKSQIIAASTPVGLNHWYKMWSDAVEGKSSYKPFKVEWW 160 CAFI W++ ++ P IS+ ++S++I +TP GLNH+Y +W A++GKS Y P++ W Sbjct: 255 CAFIQNWTDCFLAIQPVISSGRESKMIMTTTPNGLNHFYDIWQSAIDGKSGYVPYEAVWH 314 Query: 161 KVPGR--------DENYKELMIKTLEGGIRTWNQEYACEFIGSSDTLVDMTVLSNIKFGN 212 V R D+ Y+ + + QE+ EF GSS TL+ T LS + F + Sbjct: 315 SVKERLYNKADIFDDGYEWSSQAIAGSSLEQFLQEHNAEFFGSSGTLIRATTLSRLSFID 374 Query: 213 ILREPNFGETIRVYEAPQEHHKYMVLADAAKG-AIDGFVFHVIDVTNIPFKQVAS-GKIS 270 ++ + F + +E P+E KY+ D ++G D +ID+T P+KQVA + Sbjct: 375 VVNDNGFYQ----FEKPKEGRKYVATLDCSEGRGQDYHALQIIDITEFPYKQVAVYHSNT 430 Query: 271 ESYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQEPDKKWLGVRT 330 S+ + P I + L YNE E N G S+ L EY+NI + LG++ Sbjct: 431 TSHFILPDIVFKYLMMYNECPVYIELN-STGVSIAKSLAMDLEYDNIICDSFID-LGMKQ 488 Query: 331 TKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFCNVNGKYQAQNSKAHDDYVMALSLL 390 +K +++ S +K IE +KLI+ + T++EL TF + A+ HDD VM+L + Sbjct: 489 SKRSKAMGCSALKDLIEKDKLIINHKGTIQELRTFSEKGVSWAAEEG-FHDDLVMSLVIF 547 >gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469498;genbank:gi:157311455;genbank:Ge neID:5602125 Length = 607 Score = 150 bits (379), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 96/300 (32%), Positives = 155/300 (51%), Gaps = 17/300 (5%) Query: 101 CAFIDKWSEFSNSVIPTISASKKSQIIAASTPVGLNHWYKMWSDAVEGKSSYKPFKVEWW 160 CAFI W++ ++ P IS+ ++S++I +TP GLNH+Y +W A++GKS Y P++ W Sbjct: 255 CAFIQNWTDCFLAIQPVISSGRESKMIMTTTPNGLNHFYDIWQSAIDGKSGYVPYEAVWH 314 Query: 161 KVPGR--------DENYKELMIKTLEGGIRTWNQEYACEFIGSSDTLVDMTVLSNIKFGN 212 V R D+ Y+ + + QE+ EF GSS TL+ T LS + F + Sbjct: 315 SVKERLYNKADIFDDGYEWSSQAIAGSSLEQFLQEHNAEFFGSSGTLIRATTLSRLSFID 374 Query: 213 ILREPNFGETIRVYEAPQEHHKYMVLADAAKG-AIDGFVFHVIDVTNIPFKQVAS-GKIS 270 ++ + F + +E P+E KY+ D ++G D +ID+T P+K VA + Sbjct: 375 VVNDNGFYQ----FEKPKEGRKYVATLDCSEGRGQDYHALQIIDITEFPYKPVAVYHSNT 430 Query: 271 ESYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQEPDKKWLGVRT 330 S+ + P I + L YNE E N G S+ L EY+NI + LG++ Sbjct: 431 TSHFILPDIVFKYLMMYNECPVYIELN-STGVSIAKSLAMDLEYDNIICDSFID-LGMKQ 488 Query: 331 TKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFCNVNGKYQAQNSKAHDDYVMALSLL 390 +K +++ S +K IE +KLI+ + T++EL TF + A+ HDD VM+L + Sbjct: 489 SKRSKAMGCSALKDLIEKDKLIINHKGTIQELRTFSEKGVSWAAEEG-FHDDLVMSLVIF 547 >gi|28182|lcl|protein:vir:108053 Length: 611 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595293;genbank:gi:161622599;genbank:Ge neID:5783772 Length = 611 Score = 142 bits (358), Expect = 8e-36, Method: Compositional matrix adjust. Identities = 111/350 (31%), Positives = 175/350 (50%), Gaps = 21/350 (6%) Query: 101 CAFIDKWSEFSNSVIPTISASKKSQIIAASTPVGLNHWYKMWSDAVEGKSSYKPFKVEWW 160 CAFI + + ++ P IS+ ++S+II +TP GLNH+Y +W+ AVEGKS + P+ W Sbjct: 257 CAFIPNFLDSWLAIQPVISSGRRSKIIITTTPNGLNHFYDIWTAAVEGKSGFAPYTAIWN 316 Query: 161 KVPGRDENYKELMIKTLEGGIRT--------WNQEYACEFIGSSDTLVDMTVLSNIKFGN 212 V R N ++ E +T + QE+ EF G+S TL+ L+ + + Sbjct: 317 SVKERLYNDADIFDDGWEWSSQTISASSLAQFRQEHCAEFQGTSGTLISGMKLAIMDWKE 376 Query: 213 ILREPNFGETIRVYEAPQEHHKYMVLADAAKG-AIDGFVFHVIDVTNIPFKQVASGKISE 271 ++ P G R +E P HKY+ D ++G D H+IDVT ++QVA +E Sbjct: 377 VI--PENGYFYRFHE-PDPTHKYIASLDCSEGRGQDYHALHIIDVTTDEWEQVAVLHSNE 433 Query: 272 -SYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQEPDKKWLGVRT 330 S+++ P I Y L YNEA E N G SV L+ EYEN+ + + LG++ Sbjct: 434 ISHMILPDIVYKYLMEYNEAPVYIELN-STGVSVAKSLYMDLEYENVICDSMQD-LGMKQ 491 Query: 331 TKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFCNVNGKYQAQNSKAHDDYVMALSLL 390 T+ + S +K IE +KL L + T+ E TF + A++ HDD VM+L ++ Sbjct: 492 TRRTKPVGCSTLKDLIEKDKLKLNHKQTIMEFRTFSQNKLSWAAEDG-FHDDLVMSL-VI 549 Query: 391 FVPLLDLNNIVDY----DVFLNKINSDSEITDGDVKYLQMGFFDDGTSSF 436 F L D+ ++ L E+ D + +Y + F D G +S+ Sbjct: 550 FAWLTTQQKFADFIDRDEMRLASEVFSRELEDMNEEYNPVVFVDAGDNSY 599 >gi|23191|lcl|protein:vir:101186 Length: 613 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932508;genbank:gi:37651634;genbank:GeneID :2610679 Length = 613 Score = 139 bits (349), Expect = 8e-35, Method: Compositional matrix adjust. Identities = 109/368 (29%), Positives = 178/368 (48%), Gaps = 46/368 (12%) Query: 102 AFIDKWSEFSNSVIPTISASKKSQIIAASTPVGLNHWYKMWSDAVE-------GKSSYKP 154 AFI +++ ++ P IS+ ++S+I+ +TP GLNHWY +W+ A+ KS + P Sbjct: 249 AFIPNFTDAWMAIQPVISSGRRSKILMTTTPNGLNHWYDIWTAAITPNSSGDGSKSGFVP 308 Query: 155 FKVEWWKVPGR---------------DENYKELMIKTLEG-GIRTWNQEYACEFIGSSDT 198 + W V R D+ Y KT+ G + + QE+ F G+S T Sbjct: 309 YTATWSSVKERLYSDGKELSGSDSYFDDGY-SWSSKTIAGSALDAFQQEHNTAFQGTSGT 367 Query: 199 LVDMTVLSNIKFGNILREPNFGETIRVYEAPQEHHKYMVLADAAKG-AIDGFVFHVIDVT 257 L++ T LS + + +I + NF ++E P+E KY+ D+A+G D H+ D+T Sbjct: 368 LINGTKLSKLNWIDIPPQDNF----TMFEEPKEGRKYIATLDSAEGRGQDYHAMHIFDIT 423 Query: 258 NIPFKQVAS-GKISESYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYEN 316 P+KQVA + S+L+ P + L Y + E N G S+ L+ +YEN Sbjct: 424 EFPYKQVAVYHSNTTSHLILPDVLLKYLNMYFQPYIYIELN-STGVSIAKSLYSELDYEN 482 Query: 317 I----YQEPDKKWLGVRTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFCNVNGKY 372 + YQ+ LG++ TK +++ S +K IE +KLIL + ++ EL TF + Sbjct: 483 VICDSYQD-----LGLKQTKRSKAIGCSTLKDLIEKDKLILNHKKSIMELRTFSEKGVSW 537 Query: 373 QAQNSKAHDDYVMALSLLFVPLLDLNNIVDY----DVFLNKINSDSEITDGDVKYLQMGF 428 A+ HDD VM+L ++F L D+ D+ L E+ + + Y+ M Sbjct: 538 AAEEG-FHDDLVMSL-VIFAWLTTQERFSDFTENDDMRLANEVFRKEMEELNDDYMPMVI 595 Query: 429 FDDGTSSF 436 DDG +F Sbjct: 596 VDDGEDTF 603 >gi|22942|lcl|protein:vir:101803 Length: 613 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238880;genbank:gi:66391955;genbank:GeneID :3416630 Length = 613 Score = 138 bits (348), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 109/368 (29%), Positives = 178/368 (48%), Gaps = 46/368 (12%) Query: 102 AFIDKWSEFSNSVIPTISASKKSQIIAASTPVGLNHWYKMWSDAVE-------GKSSYKP 154 AFI +++ ++ P IS+ ++S+I+ +TP GLNHWY +W+ A+ KS + P Sbjct: 249 AFIPNFTDAWMAIQPVISSGRRSKILMTTTPNGLNHWYDIWTAAITPNSSGDGSKSGFVP 308 Query: 155 FKVEWWKVPGR---------------DENYKELMIKTLEG-GIRTWNQEYACEFIGSSDT 198 + W V R D+ Y KT+ G + + QE+ F G+S T Sbjct: 309 YTATWSSVKERLYSDGKELSGSDSYFDDGY-SWSSKTIAGSALDAFQQEHNTAFQGTSGT 367 Query: 199 LVDMTVLSNIKFGNILREPNFGETIRVYEAPQEHHKYMVLADAAKG-AIDGFVFHVIDVT 257 L++ T LS + + +I + NF ++E P+E KY+ D+A+G D H+ D+T Sbjct: 368 LINGTKLSKLNWIDIPPQDNF----TMFEEPKEGRKYIATLDSAEGRGQDYHAMHIFDIT 423 Query: 258 NIPFKQVAS-GKISESYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYEN 316 P+KQVA + S+L+ P + L Y + E N G S+ L+ +YEN Sbjct: 424 EFPYKQVAVYHSNTTSHLILPDVLLKYLNMYFQPYIYIELN-STGVSIAKSLYSELDYEN 482 Query: 317 I----YQEPDKKWLGVRTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFCNVNGKY 372 + YQ+ LG++ TK +++ S +K IE +KLIL + ++ EL TF + Sbjct: 483 VICDSYQD-----LGLKQTKRSKAIGCSTLKDLIEKDKLILNHKKSIMELRTFSEKGVSW 537 Query: 373 QAQNSKAHDDYVMALSLLFVPLLDLNNIVDY----DVFLNKINSDSEITDGDVKYLQMGF 428 A+ HDD VM+L ++F L D+ D+ L E+ + + Y+ M Sbjct: 538 AAEEG-FHDDLVMSL-VIFAWLTTQERFSDFTENDDMRLANEVFRKEMEELNDDYMPMVI 595 Query: 429 FDDGTSSF 436 DDG +F Sbjct: 596 VDDGEDTF 603 >gi|27407|lcl|protein:vir:7202 Length: 610 # NCBI annotation: gp17 terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049776;genbank:gi:9632591;genbank:GeneID: 1258647 Length = 610 Score = 134 bits (337), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 109/353 (30%), Positives = 172/353 (48%), Gaps = 25/353 (7%) Query: 101 CAFIDKWSEFSNSVIPTISASKKSQIIAASTPVGLNHWYKMWSDAVEGKSSYKPFKVEWW 160 CAFI + + ++ P IS+ ++S+II +TP GLNH+Y +W+ AVEGKS ++P+ W Sbjct: 257 CAFIPNFHDSWLAIQPVISSGRRSKIIITTTPNGLNHFYDIWTAAVEGKSGFEPYTAIWN 316 Query: 161 KVPGRDENYKELM-------IKTLEG-GIRTWNQEYACEFIGSSDTLVDMTVLSNIKFGN 212 V R N +++ I+T+ G + + QE+ F G+S TL+ L+ + F Sbjct: 317 SVKERLYNDEDIFDDGWQWSIQTINGSSLAQFRQEHTAAFEGTSGTLISGMKLAVMDFIE 376 Query: 213 ILREPNFGETIRVYEAPQEHHKYMVLADAAKG-AIDGFVFHVIDVTNIPFKQVA---SGK 268 + + + ++ P+ KY+ D ++G D H+IDVT+ ++QV S Sbjct: 377 VTPDDH---GFHQFKKPEPDRKYIATLDCSEGRGQDYHALHIIDVTDDVWEQVGVLHSNT 433 Query: 269 ISESYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQEPDKKWLGV 328 I S+L+ P I L YNE E N G SV L+ EYE + + LG+ Sbjct: 434 I--SHLILPDIVMRYLVEYNECPVYIELN-STGVSVAKSLYMDLEYEGVICDSYTD-LGM 489 Query: 329 RTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFCNVNGKYQAQNSKAHDDYVMALS 388 + TK ++ S +K IE +KLI+ T++E TF G A HDD VM+L Sbjct: 490 KQTKRTKAVGCSTLKDLIEKDKLIIHHRATIQEFRTFSE-KGVSWAAEEGYHDDLVMSL- 547 Query: 389 LLFVPLLDLNNIVDY----DVFLNKINSDSEITDGDVKYLQMGFFDDGTSSFY 437 ++F L + +DY D+ L E+ D Y + F D S+ Y Sbjct: 548 VIFGWLSTQSKFIDYADKDDMRLASEVFSKELQDMSDDYAPVIFVDSVHSAEY 600 >gi|22056|lcl|protein:vir:103455 Length: 610 # NCBI annotation: terminase subunit nuclease and ATPase # Family: family:all:147 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803107;genbank:gi:116326387;genbank:GeneI D:4405484 Length = 610 Score = 134 bits (337), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 108/353 (30%), Positives = 173/353 (49%), Gaps = 25/353 (7%) Query: 101 CAFIDKWSEFSNSVIPTISASKKSQIIAASTPVGLNHWYKMWSDAVEGKSSYKPFKVEWW 160 CAFI + + ++ P IS+ ++S+II +TP GLNH+Y +W+ AVEGKS ++P+ W Sbjct: 257 CAFIPNFHDSWLAIQPVISSGRRSKIIITTTPNGLNHFYDIWTAAVEGKSGFEPYTAIWN 316 Query: 161 KVPGRDENYKELM-------IKTLEGG-IRTWNQEYACEFIGSSDTLVDMTVLSNIKFGN 212 V R N +++ I+T+ G + + QE+ F G+S TL+ L+ + F Sbjct: 317 SVKERLYNDEDIFDDGWQWSIQTINGSTLAQFRQEHTAAFEGTSGTLISGMKLAIMDFIE 376 Query: 213 ILREPNFGETIRVYEAPQEHHKYMVLADAAKG-AIDGFVFHVIDVTNIPFKQVA---SGK 268 + + + ++ P+ KY+ D ++G D H+IDVT+ ++QV S Sbjct: 377 VTPDDH---GFHRFKKPEPDRKYIATLDCSEGRGQDYHALHIIDVTDDVWEQVGVLHSNT 433 Query: 269 ISESYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQEPDKKWLGV 328 I S+L+ P I L YNE E N G SV L+ EYE + + LG+ Sbjct: 434 I--SHLILPDIVMRYLVEYNECPVYIELN-STGVSVAKSLYMDLEYEGVICDSYTD-LGM 489 Query: 329 RTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFCNVNGKYQAQNSKAHDDYVMALS 388 + TK ++ S +K IE +KLI+ T++E TF + A+ HDD VM+L Sbjct: 490 KQTKRTKAVGCSTLKDLIEKDKLIIHHRATIQEFRTFSEKGVSWAAEEG-YHDDLVMSL- 547 Query: 389 LLFVPLLDLNNIVDY----DVFLNKINSDSEITDGDVKYLQMGFFDDGTSSFY 437 ++F L + +DY D+ L E+ D Y + F D S+ Y Sbjct: 548 VIFGWLSTQSKFIDYADKDDMRLASEVFSKELQDMSDDYAPVIFVDSVHSAEY 600 >gi|27701|lcl|protein:vir:6893 Length: 611 # NCBI annotation: gp17 terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861869;genbank:gi:32453660;genbank:GeneID :1494295 Length = 611 Score = 131 bits (329), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 103/351 (29%), Positives = 167/351 (47%), Gaps = 25/351 (7%) Query: 101 CAFIDKWSEFSNSVIPTISASKKSQIIAASTPVGLNHWYKMWSDAVEGKSSYKPFKVEWW 160 CAFI + + ++ P IS+ ++S+II +TP GLNH+Y +W+ AVEGKS ++P+ W Sbjct: 257 CAFIPNFIDSWLAIQPVISSGRRSKIIITTTPNGLNHFYDIWTAAVEGKSGFEPYTAIWN 316 Query: 161 KVPGR--------DENYKELMIKTLEGGIRTWNQEYACEFIGSSDTLVDMTVLSNIKFGN 212 V R D+ ++ + + QE+ F G+S TL+ L+ + + Sbjct: 317 SVKERLYNDEDIFDDGWQWSKQTISASSLTQFRQEHTAAFEGTSGTLISGMKLAILDYIE 376 Query: 213 ILREPNFGETIRVYEAPQEHHKYMVLADAAKG-AIDGFVFHVIDVTNIPFKQVA---SGK 268 + + + ++ P+E HKY+ D ++G D H+IDVT ++QV S Sbjct: 377 VTPDSH---GFHQFKKPEEGHKYIATLDCSEGRGQDYHAMHIIDVTTDKWEQVGVLHSNT 433 Query: 269 ISESYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQEPDKKWLGV 328 I S+L+ P I + L YNE E N G SV L+ EYEN+ + LG+ Sbjct: 434 I--SHLILPDIVFKYLMEYNECPIYIELN-STGVSVAKSLYMDLEYENVICDSMND-LGM 489 Query: 329 RTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFCNVNGKYQAQNSKAHDDYVMALS 388 + ++ + S +K IE +KL + T++E TF + A+ HDD VM L Sbjct: 490 KQSRRTKPVGCSTLKDLIEKDKLKINHRATIQEFRTFSEKGVSWAAEEG-YHDDLVMGL- 547 Query: 389 LLFVPLLDLNNIVDY----DVFLNKINSDSEITDGDVKYLQMGFFDDGTSS 435 ++F L DY D+ L E+ D + Y + F D ++S Sbjct: 548 VIFGWLSTQQKFADYADKDDMRLASEVFSRELQDMNDDYAPVIFVDCASNS 598 >gi|21077|lcl|protein:vir:100538 Length: 612 # NCBI annotation: gp17 terminase subunit # Family: family:all:147 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656379;genbank:gi:109290130;genbank:GeneI D:4156516 Length = 612 Score = 127 bits (318), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 95/310 (30%), Positives = 149/310 (48%), Gaps = 31/310 (10%) Query: 102 AFIDKWSEFSNSVIPTISASKKSQIIAASTPVGLNHWYKMW-------SDAVEGKSSYKP 154 AFI +++ ++ P IS+ + S+I+ +TP GLNHWY +W SD KS + P Sbjct: 248 AFIPNFNDAWLAIQPVISSGRHSKILMTTTPNGLNHWYDIWTAAITPNSDGSGSKSGFVP 307 Query: 155 FKVEWWKVPGR---DENYKELMIKTLEGGI------------RTWNQEYACEFIGSSDTL 199 + W V R D + + I L I R + QE+ F G+S TL Sbjct: 308 YTATWSSVKERMYSDGSKTDGAIHILTTDILGQPRQSPVLALRAFQQEHNTAFQGTSGTL 367 Query: 200 VDMTVLSNIKFGNILREPNFGETIRVYEAPQEHHKYMVLADAAKG-AIDGFVFHVIDVTN 258 ++ LS + + + NF +++ P E HKY+ D+A+G D H+ D+T Sbjct: 368 INGFKLSKMTWKEVPASDNF----TMFKEPIEGHKYIATLDSAEGRGQDYHAMHIYDITE 423 Query: 259 IPFKQVAS-GKISESYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENI 317 P++QVA + S+L+ P + L Y + E N G S+ L+ EYENI Sbjct: 424 FPYEQVAVYHSNTTSHLILPDVLLKYLNMYYQPYIYIELN-ATGVSIAKSLYSELEYENI 482 Query: 318 YQEPDKKWLGVRTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFCNVNGKYQAQNS 377 + LG++ TK +++ S +K IE KL+L + T+ EL TF + A++ Sbjct: 483 ICDSYND-LGMKQTKRSKAIGCSTLKDLIEKEKLVLYHKGTIMELRTFSEKGVSWAAEDG 541 Query: 378 KAHDDYVMAL 387 HDD VM+L Sbjct: 542 -FHDDLVMSL 550 >gi|16897|lcl|protein:vir:5867 Length: 508 # NCBI annotation: similar to terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835653;genbank:gi:30044056 Length = 508 Score = 51.2 bits (121), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 71/303 (23%), Positives = 122/303 (40%), Gaps = 36/303 (11%) Query: 96 FVSHNCAFIDKWSEFSNSVIPTISASKKSQIIAASTPVGLNHWY-KMWSDAVEGKSSYKP 154 + AFI E SV T++ K I ST G+ +WY + A EGKS +K Sbjct: 153 LIVEEAAFISNMEELWASVQQTLATGGKC--IVNSTYNGVGNWYERTIRAAKEGKSEFKY 210 Query: 155 FKVEWWKVPGRDENYKELMIKTLEGGIRTWNQEYACEFIGSSDTLVDMTVLSNIKFGNIL 214 F ++W P RDE + E + L R + QE C GS + ++ ++ +F + Sbjct: 211 FGIKWSDHPERDEKWFEEQKRLLPP--RVFAQEILCIPQGSGENVIPFHLIREEEFIDPF 268 Query: 215 REPNFGETIRVYEAPQEHHKYMVLADAAKGAID-----GFVFHVIDVTNIPFKQVASGKI 269 G+ Y P Y + D A G + G +D + +QVA Sbjct: 269 VVKYGGDYWEWYRKP---GYYFISVDPASGRGEDRSAVGVQVLWVDPQTLTIEQVAEFAS 325 Query: 270 SESYLMAPPIFYNILRT-YNE----AMFVCENNEGAGTSVVDLLFQMYEYENIYQEPDKK 324 ++ L P+ +++ Y+E +F+ N G G +Y++ Y Sbjct: 326 DKTSL---PVMRQVIKQIYDEFKPQLIFIETNGIGMG---------LYQFMEAY---TPS 370 Query: 325 WLGVRTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFCNVNGKYQAQNSKAHDDYV 384 +G TT+ + + E+ +LIL+ + +++L V K + + +D Sbjct: 371 IVGYYTTQRKKVHGSDLLAKLYEDGRLILRSKRLLEQLQRTTWVKNKVE---TAGRNDLY 427 Query: 385 MAL 387 MAL Sbjct: 428 MAL 430 >gi|17621|lcl|protein:vir:816 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050549;genbank:gi:9633446;genbank:GeneID: 1262208 Length = 568 Score = 35.0 bits (79), Expect = 0.002, Method: Compositional matrix adjust. Identities = 52/191 (27%), Positives = 79/191 (41%), Gaps = 22/191 (11%) Query: 214 LREPNFGETIR-------VYEAPQEHHKYMVLADAAKGAIDGFVFHVIDVTNIPFKQVAS 266 LRE N E R V+E P +Y+ AD A+G G + V +QVA Sbjct: 352 LREGNKNELQRTLMNYLLVWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSNGEQVAH 411 Query: 267 --GKISESYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQEP--- 321 G + ++ L A + + R YN A FV G +V+ L ++Y IY E Sbjct: 412 WFGHL-DAELFA-HLISQVCRMYNNA-FVGPERNNHGHAVILKLRELYPTRYIYNEQHLD 468 Query: 322 -----DKKWLGVRTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFC-NVNGKYQAQ 375 D LG TT+ ++ MK + N ++ T+ E+ T+ + G AQ Sbjct: 469 QAYDDDTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQ 528 Query: 376 NSKAHDDYVMA 386 DD +M+ Sbjct: 529 EG-CFDDQLMS 538 >gi|25680|lcl|protein:vir:10116 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859246;genbank:gi:32171173;genbank:GeneID :2653342 Length = 568 Score = 35.0 bits (79), Expect = 0.002, Method: Compositional matrix adjust. Identities = 52/191 (27%), Positives = 79/191 (41%), Gaps = 22/191 (11%) Query: 214 LREPNFGETIR-------VYEAPQEHHKYMVLADAAKGAIDGFVFHVIDVTNIPFKQVAS 266 LRE N E R V+E P +Y+ AD A+G G + V +QVA Sbjct: 352 LREGNKNELQRTLMNYLLVWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSNGEQVAH 411 Query: 267 --GKISESYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQEP--- 321 G + ++ L A + + R YN A FV G +V+ L ++Y IY E Sbjct: 412 WFGHL-DAELFA-HLISQVCRMYNNA-FVGPERNNHGHAVILKLRELYPTRYIYNEQHLD 468 Query: 322 -----DKKWLGVRTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFC-NVNGKYQAQ 375 D LG TT+ ++ MK + N ++ T+ E+ T+ + G AQ Sbjct: 469 QAYDDDTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQ 528 Query: 376 NSKAHDDYVMA 386 DD +M+ Sbjct: 529 EG-CFDDQLMS 538 >gi|51|lcl|protein:vir:3295 Length: 568 # NCBI annotation: putative large subunit terminase # Family: family:all:147 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049511;genbank:gi:9632517;genbank:GeneID: 1262002 Length = 568 Score = 35.0 bits (79), Expect = 0.002, Method: Compositional matrix adjust. Identities = 52/191 (27%), Positives = 79/191 (41%), Gaps = 22/191 (11%) Query: 214 LREPNFGETIR-------VYEAPQEHHKYMVLADAAKGAIDGFVFHVIDVTNIPFKQVAS 266 LRE N E R V+E P +Y+ AD A+G G + V +QVA Sbjct: 352 LREGNKNELQRTLMNYLLVWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSNGEQVAH 411 Query: 267 --GKISESYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQEP--- 321 G + ++ L A + + R YN A FV G +V+ L ++Y IY E Sbjct: 412 WFGHL-DAELFA-HLISQVCRMYNNA-FVGPERNNHGHAVILKLRELYPTRYIYNEQHLD 468 Query: 322 -----DKKWLGVRTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFC-NVNGKYQAQ 375 D LG TT+ ++ MK + N ++ T+ E+ T+ + G AQ Sbjct: 469 QAYDDDTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQ 528 Query: 376 NSKAHDDYVMA 386 DD +M+ Sbjct: 529 EG-CFDDQLMS 538 >gi|26240|lcl|protein:vir:2763 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612880;genbank:gi:20065962;genbank:GeneID :935714 Length = 568 Score = 35.0 bits (79), Expect = 0.002, Method: Compositional matrix adjust. Identities = 52/191 (27%), Positives = 79/191 (41%), Gaps = 22/191 (11%) Query: 214 LREPNFGETIR-------VYEAPQEHHKYMVLADAAKGAIDGFVFHVIDVTNIPFKQVAS 266 LRE N E R V+E P +Y+ AD A+G G + V +QVA Sbjct: 352 LREGNKNELQRTLMNYLLVWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSNGEQVAH 411 Query: 267 --GKISESYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQEP--- 321 G + ++ L A + + R YN A FV G +V+ L ++Y IY E Sbjct: 412 WFGHL-DAELFA-HLISQVCRMYNNA-FVGPERNNHGHAVILKLRELYPTRYIYNEQHLD 468 Query: 322 -----DKKWLGVRTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFC-NVNGKYQAQ 375 D LG TT+ ++ MK + N ++ T+ E+ T+ + G AQ Sbjct: 469 QAYDDDTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQ 528 Query: 376 NSKAHDDYVMA 386 DD +M+ Sbjct: 529 EG-CFDDQLMS 538 >gi|25849|lcl|protein:vir:9949 Length: 568 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859079;genbank:gi:32171001;genbank:GeneID :2653280 Length = 568 Score = 35.0 bits (79), Expect = 0.002, Method: Compositional matrix adjust. Identities = 52/191 (27%), Positives = 79/191 (41%), Gaps = 22/191 (11%) Query: 214 LREPNFGETIR-------VYEAPQEHHKYMVLADAAKGAIDGFVFHVIDVTNIPFKQVAS 266 LRE N E R V+E P +Y+ AD A+G G + V +QVA Sbjct: 352 LREGNKNELQRTLMNYLLVWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSNGEQVAH 411 Query: 267 --GKISESYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQEP--- 321 G + ++ L A + + R YN A FV G +V+ L ++Y IY E Sbjct: 412 WFGHL-DAELFA-HLISQVCRMYNNA-FVGPERNNHGHAVILKLRELYPTRYIYNEQHLD 468 Query: 322 -----DKKWLGVRTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFC-NVNGKYQAQ 375 D LG TT+ ++ MK + N ++ T+ E+ T+ + G AQ Sbjct: 469 QAYDDDTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQ 528 Query: 376 NSKAHDDYVMA 386 DD +M+ Sbjct: 529 EG-CFDDQLMS 538 >gi|1476|lcl|protein:vir:104436 Length: 568 # NCBI annotation: putative terminase large subunit # Family: family:all:147 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794060;genbank:gi:116222005;genbank:GeneI D:4397501 Length = 568 Score = 35.0 bits (79), Expect = 0.002, Method: Compositional matrix adjust. Identities = 52/191 (27%), Positives = 79/191 (41%), Gaps = 22/191 (11%) Query: 214 LREPNFGETIR-------VYEAPQEHHKYMVLADAAKGAIDGFVFHVIDVTNIPFKQVAS 266 LRE N E R V+E P +Y+ AD A+G G + V +QVA Sbjct: 352 LREGNKNELQRTLMNYLLVWELPDPDEEYVCGADTAEGLEHGDRSSLDVVKRSNGEQVAH 411 Query: 267 --GKISESYLMAPPIFYNILRTYNEAMFVCENNEGAGTSVVDLLFQMYEYENIYQEP--- 321 G + ++ L A + + R YN A FV G +V+ L ++Y IY E Sbjct: 412 WFGHL-DAELFA-HLISQVCRMYNNA-FVGPERNNHGHAVILKLRELYPTRYIYNEQHLD 468 Query: 322 -----DKKWLGVRTTKSNRSKNLSNMKLFIENNKLILQDEPTVKELLTFC-NVNGKYQAQ 375 D LG TT+ ++ MK + N ++ T+ E+ T+ + G AQ Sbjct: 469 QAYDDDTPRLGWLTTRQSKPVLTEGMKTLLNNGISGIRWSGTLSEMNTYVYDAKGSMNAQ 528 Query: 376 NSKAHDDYVMA 386 DD +M+ Sbjct: 529 EG-CFDDQLMS 538 >gi|16146|lcl|protein:vir:10462 Length: 586 # NCBI annotation: DNA maturase B # Family: family:all:697 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848309;genbank:gi:30387500;genbank:GeneID :1733974 Length = 586 Score = 25.8 bits (55), Expect = 0.94, Method: Compositional matrix adjust. Identities = 16/75 (21%), Positives = 37/75 (49%), Gaps = 20/75 (26%) Query: 335 RSKNLSNMKL------FIENNKLILQDEPTVKELLTFCNVNGKY--------------QA 374 R++ + M++ ++ ++L+++DE + + +V+GKY + Sbjct: 448 RARGMKEMRICDTLEPVMQTHRLVIRDEVIRADYQSARDVDGKYDVKYSLFYQMTRITRE 507 Query: 375 QNSKAHDDYVMALSL 389 + + AHDD + AL+L Sbjct: 508 KGALAHDDRLDALAL 522 >gi|3734|lcl|protein:vir:94555 Length: 585 # NCBI annotation: DNA packaging protein B # Family: family:all:697 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919024;genbank:gi:119637788;genbank:GeneI D:5179315 Length = 585 Score = 25.4 bits (54), Expect = 1.3, Method: Compositional matrix adjust. Identities = 17/75 (22%), Positives = 38/75 (50%), Gaps = 20/75 (26%) Query: 335 RSKNLSNMKL------FIENNKLILQDEPTVKELLTFCNVNGKY--------------QA 374 R+K + M++ + ++KLI++DE ++ T +++GK+ + Sbjct: 448 RAKGMKEMRICDTIEPLMGSHKLIIRDEVIREDYQTSRDLDGKHDVRYSAFYQMTRMTRE 507 Query: 375 QNSKAHDDYVMALSL 389 + + AHDD + A++L Sbjct: 508 RGAVAHDDRLDAIAL 522 >gi|17058|lcl|protein:vir:5248 Length: 487 # NCBI annotation: putative terminase large subunit TerL # Family: family:all:144 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852753;genbank:gi:31544028;interpro:IPR00 4921;interpro:IPR006517;uniprot:Q7Y5U7;genbank:GeneID:27 53564 Length = 487 Score = 25.0 bits (53), Expect = 1.9, Method: Compositional matrix adjust. Identities = 15/52 (28%), Positives = 27/52 (51%), Gaps = 4/52 (7%) Query: 345 FIENNKLIL-QDEPTVKELLTFCNVNGKYQAQNSKAHDDYVMALSLLFVPLL 395 +IE+ ++L + P + + + C + A +S AHDD V AL + +L Sbjct: 430 YIESGYVMLPESAPWIADFINECEA---FTATDSHAHDDQVDALVMAISDIL 478 >gi|8126|lcl|protein:vir:102661 Length: 568 # NCBI annotation: terminase # Family: family:all:147 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024418;genbank:gi:48696639;genbank:GeneID :2948128 Length = 568 Score = 24.6 bits (52), Expect = 2.5, Method: Compositional matrix adjust. Identities = 35/121 (28%), Positives = 47/121 (38%), Gaps = 33/121 (27%) Query: 130 STPVGLNHWYKMWSDAVEGKSSYKPFKVEWWKVPGRDENYKELMIKTLEGGIRTWNQEYA 189 +TP G N +YK+ A+ + S EW+ YK L I TW Y+ Sbjct: 217 TTPRGKNWFYKL---AMHAEKSE-----EWY--------YKYLTIND------TWRWAYS 254 Query: 190 CEFIGSSDTLVDMTVLSNIKFGNILREPNFGETIRVYEAPQEHHKYMVLADAAKGAIDGF 249 E + +DTL + N G I VYE+ KY +ADA K G Sbjct: 255 SEAL-DTDTLQQAGTAT----------LNDGHVIPVYESIPTELKYRNVADAEKAIERGV 303 Query: 250 V 250 V Sbjct: 304 V 304 >gi|19292|lcl|protein:vir:4781 Length: 436 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150161;swissprot:trembl:q94m50;genbank:gi :15088772;goa:Q94M50;uniprot:Q94M50;genbank:GeneID:95597 6 Length = 436 Score = 24.3 bits (51), Expect = 2.9, Method: Compositional matrix adjust. Identities = 11/23 (47%), Positives = 15/23 (65%) Query: 338 NLSNMKLFIENNKLILQDEPTVK 360 N N K+FIE +K+ DE T+K Sbjct: 379 NTENNKIFIEEHKMYRWDEKTIK 401 >gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: terminase large subunit # Family: family:all:4877 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429607;genbank:gi:156564098;genbank:Ge neID:5525534 Length = 635 Score = 22.7 bits (47), Expect = 8.6, Method: Compositional matrix adjust. Identities = 12/35 (34%), Positives = 18/35 (51%) Query: 108 SEFSNSVIPTISASKKSQIIAASTPVGLNHWYKMW 142 SE +N + A ++ ++I ASTP G Y W Sbjct: 201 SEITNIMNIRNEAPERIKMIVASTPSGRRDSYYKW 235 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.318 0.136 0.405 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 214,329 Number of Sequences: 514 Number of extensions: 10188 Number of successful extensions: 107 Number of sequences better than 100.0: 31 Number of HSP's better than 100.0 without gapping: 21 Number of HSP's successfully gapped in prelim test: 10 Number of HSP's that attempted gapping in prelim test: 25 Number of HSP's gapped (non-prelim): 32 length of query: 442 length of database: 206,069 effective HSP length: 74 effective length of query: 368 effective length of database: 168,033 effective search space: 61836144 effective search space used: 61836144 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 38 (19.2 bits)