BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|Aclame:protein:vir:101493|NCBI_annot:gp8|genbank:acc:YP_655 387;genbank:gi:109522575;genbank:GeneID:4157565 (588 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|8220|lcl|protein:vir:101493 Length: 588 # NCBI annotation: gp... 1228 0.0 gi|9070|lcl|protein:vir:102238 Length: 588 # NCBI annotation: gp... 1225 0.0 gi|18781|lcl|protein:vir:7429 Length: 581 # NCBI annotation: gp6... 463 e-132 gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: te... 77 7e-16 gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: ph... 77 7e-16 gi|8126|lcl|protein:vir:102661 Length: 568 # NCBI annotation: te... 57 4e-10 gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: t... 50 9e-08 gi|1165|lcl|protein:vir:102941 Length: 421 # NCBI annotation: te... 35 0.003 gi|6886|lcl|protein:vir:106551 Length: 424 # NCBI annotation: pu... 33 0.007 gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: ter... 28 0.39 gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp... 28 0.39 gi|11160|lcl|protein:vir:78394 Length: 423 # NCBI annotation: hy... 27 0.52 gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: ter... 26 1.5 gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: te... 25 2.0 gi|5891|lcl|protein:vir:107098 Length: 421 # NCBI annotation: pu... 25 2.8 gi|8015|lcl|protein:vir:96495 Length: 195 # NCBI annotation: ter... 25 2.9 gi|22942|lcl|protein:vir:101803 Length: 613 # NCBI annotation: g... 25 3.3 gi|23191|lcl|protein:vir:101186 Length: 613 # NCBI annotation: t... 25 3.5 gi|21077|lcl|protein:vir:100538 Length: 612 # NCBI annotation: g... 25 3.5 gi|24596|lcl|protein:vir:98262 Length: 609 # NCBI annotation: gp... 24 4.7 gi|28182|lcl|protein:vir:108053 Length: 611 # NCBI annotation: g... 23 7.2 gi|27407|lcl|protein:vir:7202 Length: 610 # NCBI annotation: gp1... 23 7.3 gi|22056|lcl|protein:vir:103455 Length: 610 # NCBI annotation: t... 23 7.3 gi|22578|lcl|protein:vir:95547 Length: 605 # NCBI annotation: OR... 23 7.3 gi|9399|lcl|protein:vir:99333 Length: 605 # NCBI annotation: hyp... 23 7.3 gi|27701|lcl|protein:vir:6893 Length: 611 # NCBI annotation: gp1... 23 7.5 gi|15201|lcl|protein:vir:7028 Length: 631 # NCBI annotation: lar... 23 8.3 >gi|8220|lcl|protein:vir:101493 Length: 588 # NCBI annotation: gp8 # Family: family:all:147 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655387;genbank:gi:109522575;genbank:GeneI D:4157565 Length = 588 Score = 1228 bits (3178), Expect = 0.0, Method: Compositional matrix adjust. Identities = 588/588 (100%), Positives = 588/588 (100%) Query: 1 MTSTANHAAAQLDRWQIYDTEIEVTTERGRERRKVWDPHPGQLAIEESPARNKVATCGRR 60 MTSTANHAAAQLDRWQIYDTEIEVTTERGRERRKVWDPHPGQLAIEESPARNKVATCGRR Sbjct: 1 MTSTANHAAAQLDRWQIYDTEIEVTTERGRERRKVWDPHPGQLAIEESPARNKVATCGRR 60 Query: 61 FGKSDLGGKRLVPPAFLAYYRQDLLRTTNKAMIYWIVGPEYSDAEKEFRVLWNTLVSLGV 120 FGKSDLGGKRLVPPAFLAYYRQDLLRTTNKAMIYWIVGPEYSDAEKEFRVLWNTLVSLGV Sbjct: 61 FGKSDLGGKRLVPPAFLAYYRQDLLRTTNKAMIYWIVGPEYSDAEKEFRVLWNTLVSLGV 120 Query: 121 PFDKPGSYYDAVGGNMHLSLWQGAYQVHAKSAKYPDTLVGEGLSGVIMAEAAKQKPSVWT 180 PFDKPGSYYDAVGGNMHLSLWQGAYQVHAKSAKYPDTLVGEGLSGVIMAEAAKQKPSVWT Sbjct: 121 PFDKPGSYYDAVGGNMHLSLWQGAYQVHAKSAKYPDTLVGEGLSGVIMAEAAKQKPSVWT 180 Query: 181 KHVRPTLGDFTGWSLHTSTPEGKNHFHDKFQMGQDPNNPEWESWRMPSWANPYVYTRTGR 240 KHVRPTLGDFTGWSLHTSTPEGKNHFHDKFQMGQDPNNPEWESWRMPSWANPYVYTRTGR Sbjct: 181 KHVRPTLGDFTGWSLHTSTPEGKNHFHDKFQMGQDPNNPEWESWRMPSWANPYVYTRTGR 240 Query: 241 LIALGKLPPETPIPDHEYTLDRHVTIMQQLMRENPTVSPFKIVADHKLRVDQEVIELAAD 300 LIALGKLPPETPIPDHEYTLDRHVTIMQQLMRENPTVSPFKIVADHKLRVDQEVIELAAD Sbjct: 241 LIALGKLPPETPIPDHEYTLDRHVTIMQQLMRENPTVSPFKIVADHKLRVDQEVIELAAD 300 Query: 301 MSIESFNQEIGADFTEYVGKVFKDWDEEYHVADLVDYNVKQFGTYFNPNYETYAAADYGY 360 MSIESFNQEIGADFTEYVGKVFKDWDEEYHVADLVDYNVKQFGTYFNPNYETYAAADYGY Sbjct: 301 MSIESFNQEIGADFTEYVGKVFKDWDEEYHVADLVDYNVKQFGTYFNPNYETYAAADYGY 360 Query: 361 TNPNVWLVIQIGKWGEVNVLREIYMPGLTADAFADEIRRQMCNPPNLRIFYPDPADPMSS 420 TNPNVWLVIQIGKWGEVNVLREIYMPGLTADAFADEIRRQMCNPPNLRIFYPDPADPMSS Sbjct: 361 TNPNVWLVIQIGKWGEVNVLREIYMPGLTADAFADEIRRQMCNPPNLRIFYPDPADPMSS 420 Query: 421 ETLSQKLGIRAAGGTGGEKRIRLNLIRQFLKRGRNDPTDLTGMGWRPQLMFDRSCANARR 480 ETLSQKLGIRAAGGTGGEKRIRLNLIRQFLKRGRNDPTDLTGMGWRPQLMFDRSCANARR Sbjct: 421 ETLSQKLGIRAAGGTGGEKRIRLNLIRQFLKRGRNDPTDLTGMGWRPQLMFDRSCANARR 480 Query: 481 EMEAYRYPDSTDKPNPSSLIYEEPLKKDDHVPEALGRFFVGHYGEKALVNERAGTRISKA 540 EMEAYRYPDSTDKPNPSSLIYEEPLKKDDHVPEALGRFFVGHYGEKALVNERAGTRISKA Sbjct: 481 EMEAYRYPDSTDKPNPSSLIYEEPLKKDDHVPEALGRFFVGHYGEKALVNERAGTRISKA 540 Query: 541 NIEHQRTNRERRSTNAPKDIKYQRLTAGEPDAYQYKNPDVGWGKELLD 588 NIEHQRTNRERRSTNAPKDIKYQRLTAGEPDAYQYKNPDVGWGKELLD Sbjct: 541 NIEHQRTNRERRSTNAPKDIKYQRLTAGEPDAYQYKNPDVGWGKELLD 588 >gi|9070|lcl|protein:vir:102238 Length: 588 # NCBI annotation: gp8 # Family: family:all:147 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655204;genbank:gi:109522784;genbank:GeneI D:4157477 Length = 588 Score = 1225 bits (3169), Expect = 0.0, Method: Compositional matrix adjust. Identities = 587/588 (99%), Positives = 587/588 (99%) Query: 1 MTSTANHAAAQLDRWQIYDTEIEVTTERGRERRKVWDPHPGQLAIEESPARNKVATCGRR 60 MTSTANHAAAQLDRWQIYDTEIEVTTERGRERRKVWDPHPGQLAIEESPARNKVATCGRR Sbjct: 1 MTSTANHAAAQLDRWQIYDTEIEVTTERGRERRKVWDPHPGQLAIEESPARNKVATCGRR 60 Query: 61 FGKSDLGGKRLVPPAFLAYYRQDLLRTTNKAMIYWIVGPEYSDAEKEFRVLWNTLVSLGV 120 FGKSDLGGKRLVPPAFLAYYRQDLLRTTNKAMIYWIVGPEYSDAEKEFRVLWNTLVSLGV Sbjct: 61 FGKSDLGGKRLVPPAFLAYYRQDLLRTTNKAMIYWIVGPEYSDAEKEFRVLWNTLVSLGV 120 Query: 121 PFDKPGSYYDAVGGNMHLSLWQGAYQVHAKSAKYPDTLVGEGLSGVIMAEAAKQKPSVWT 180 PFDKPGSYYDAVGGNMHLSLWQGAYQVHAKSAKYPDTLVGEGLSGVIMAEAAKQKPSVWT Sbjct: 121 PFDKPGSYYDAVGGNMHLSLWQGAYQVHAKSAKYPDTLVGEGLSGVIMAEAAKQKPSVWT 180 Query: 181 KHVRPTLGDFTGWSLHTSTPEGKNHFHDKFQMGQDPNNPEWESWRMPSWANPYVYTRTGR 240 KHVRPTLGDFTGWSLHTSTPEGKNHFHDKFQMGQDPNNPEWESWRMPSWANPYVYTRTGR Sbjct: 181 KHVRPTLGDFTGWSLHTSTPEGKNHFHDKFQMGQDPNNPEWESWRMPSWANPYVYTRTGR 240 Query: 241 LIALGKLPPETPIPDHEYTLDRHVTIMQQLMRENPTVSPFKIVADHKLRVDQEVIELAAD 300 LIALGKLPPETPIPDHEYTLDRHVTIMQQLMRENPTVSPFKIVADHKLRVDQEVIELAAD Sbjct: 241 LIALGKLPPETPIPDHEYTLDRHVTIMQQLMRENPTVSPFKIVADHKLRVDQEVIELAAD 300 Query: 301 MSIESFNQEIGADFTEYVGKVFKDWDEEYHVADLVDYNVKQFGTYFNPNYETYAAADYGY 360 MSIESFNQEIGADFTEYVGKVFKDWDEEYHVADLVDYNVKQFGTYFNPNYETYAAADYGY Sbjct: 301 MSIESFNQEIGADFTEYVGKVFKDWDEEYHVADLVDYNVKQFGTYFNPNYETYAAADYGY 360 Query: 361 TNPNVWLVIQIGKWGEVNVLREIYMPGLTADAFADEIRRQMCNPPNLRIFYPDPADPMSS 420 TNPNVWLVIQIGKWGEVNVLREIYMPGLTADAFADEIRRQMCNPPNLRIFYPDPADPMSS Sbjct: 361 TNPNVWLVIQIGKWGEVNVLREIYMPGLTADAFADEIRRQMCNPPNLRIFYPDPADPMSS 420 Query: 421 ETLSQKLGIRAAGGTGGEKRIRLNLIRQFLKRGRNDPTDLTGMGWRPQLMFDRSCANARR 480 ETLSQKLGIRAAGGTGGEKRIRLNLIRQFLKRGRNDPTDLTGMGWRPQLMFDRSCANARR Sbjct: 421 ETLSQKLGIRAAGGTGGEKRIRLNLIRQFLKRGRNDPTDLTGMGWRPQLMFDRSCANARR 480 Query: 481 EMEAYRYPDSTDKPNPSSLIYEEPLKKDDHVPEALGRFFVGHYGEKALVNERAGTRISKA 540 EMEAYRYPDSTDKPNPSSLIYEEPLKKDDHVPEALGRFFVGHYGEKALVNERAGTRISKA Sbjct: 481 EMEAYRYPDSTDKPNPSSLIYEEPLKKDDHVPEALGRFFVGHYGEKALVNERAGTRISKA 540 Query: 541 NIEHQRTNRERRSTNAPKDIKYQRLTAGEPDAYQYKNPDVGWGKELLD 588 NIEHQRTNRERRSTNAPKDIKYQRLT GEPDAYQYKNPDVGWGKELLD Sbjct: 541 NIEHQRTNRERRSTNAPKDIKYQRLTDGEPDAYQYKNPDVGWGKELLD 588 >gi|18781|lcl|protein:vir:7429 Length: 581 # NCBI annotation: gp6 # Family: family:all:147 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818544;genbank:gi:29566981;genbank:GeneID :1260231 Length = 581 Score = 463 bits (1191), Expect = e-132, Method: Compositional matrix adjust. Identities = 254/556 (45%), Positives = 332/556 (59%), Gaps = 27/556 (4%) Query: 3 STANHAAAQLDRWQIYDTEIEVTTERGRERRKVWDPHPGQLAIEESPARNKVATCGRRFG 62 ST + +W D E+ + +R RKV+DPH GQL E A+ ATCGRR G Sbjct: 2 STEAEYVEYVSKWTYLDAEV-LPDKRSGGFRKVFDPHSGQLEFMEDDAQYLCATCGRRMG 60 Query: 63 KSDLGGKRLVPPAFLAYYRQDLLRTTNKAMIYWIVGPEYSDAEKEFRVLWNTLVSLGVPF 122 KS +P A + L K +W VGP YSDAEK FRV WN +LG+PF Sbjct: 61 KSAGIAHEFIPEAMITKEMATTLLDDGKRREFWTVGPNYSDAEKPFRVFWNKCRALGIPF 120 Query: 123 DKPGSYYDAVGGNMHLSLWQGAYQVHAKSAKYPDTLVGEGLSGVIMAEAAKQKPSVWTKH 182 DKPG+Y+D GG+M +SLW GA+ AKS+ P+ LVGEGL+GV M EAAKQK VW + Sbjct: 121 DKPGTYFDIKGGDMTVSLWDGAFIYSAKSSAVPERLVGEGLTGVHMEEAAKQKEVVWKQM 180 Query: 183 VRPTLGDFTGWSLHTSTPEGKNHFHDKFQMGQDPNNPEWESWRMPSWANPYVYTRTGRLI 242 + PTL DF GW+ T+TPEGKN ++D Q P+ W + R+PSW NP+VYT TGRLI Sbjct: 181 IMPTLMDFGGWAKFTTTPEGKNWYYDLHQKALRPSTLNWSAHRIPSWRNPHVYTETGRLI 240 Query: 243 ALGKLPPETPIPDHEYTLDRHVTIMQQLMRENPTVSPFKIVADHKLRVDQEVIELAADMS 302 A GKLP +TPIP HE+T+D HV + LM ENP + F+I +L++D V++LA D + Sbjct: 241 AAGKLPKDTPIPPHEFTIDAHVKRLMYLMAENPGYTSFEIAKSERLQIDSGVLQLANDQT 300 Query: 303 IESFNQEIGADFTEYVGKVFKDWDEEYHVADLVDYNVKQFGTYFNPNYETYAAADYGYTN 362 I FNQEI A+FT++VGKVFK++DE+ HV +LV YN Q ++ET AA DYGY N Sbjct: 301 IPEFNQEIAAEFTDFVGKVFKEYDEDTHVRELV-YNPSQ-------DWETIAAVDYGYRN 352 Query: 363 PNVWLVIQIGKWGEVNVLREIYMPGLTADAFADEIRRQMCNPPNLRIFYPDPADPMSS-- 420 PNVWL+IQIG WGE+N++ E+Y LT FA+EI R+ P L FY DPA P +S Sbjct: 353 PNVWLLIQIGPWGEINIVDELYQADLTPTEFANEILRRGLCPDTLHSFYADPAAPEASRT 412 Query: 421 -ETLSQKLGIRAAG--GTGGEKRIRLNLIRQFLKRGRNDPTDLTGMGW----------RP 467 ET+ ++ G RA TGG+ RLNLIR F + R +++ W RP Sbjct: 413 LETIFRQHGKRARSRPHTGGDIDNRLNLIR-FALKDRIVDAEMSAPSWFQAGASQDVRRP 471 Query: 468 QLMFDRSCANARREMEAYRYPDSTDKPNPSSLI-YEEPLKKDDHVPEALGRFFVGHYGEK 526 ++M C E YRYP + D+ +S YE P+K +DH PEA+GRF G Y Sbjct: 472 RMMISTRCPKTIFEFGEYRYPKTKDEQTETSTKRYETPMKLNDHTPEAIGRFLGGMYHAV 531 Query: 527 ALVNERAGTRISKANI 542 A GTR+++ Sbjct: 532 A-AQMGGGTRVTRGQF 546 >gi|10766|lcl|protein:vir:78016 Length: 485 # NCBI annotation: terminase large subunit # Family: family:all:147 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467938;genbank:gi:157265379;genbank:Ge neID:5600506 Length = 485 Score = 76.6 bits (187), Expect = 7e-16, Method: Compositional matrix adjust. Identities = 66/218 (30%), Positives = 91/218 (41%), Gaps = 35/218 (16%) Query: 36 WDPHPGQLAIEESPARNKVATCGRRFGKSDLGGKRLVPPAFLAYYRQDLLRTTNKAMIYW 95 + PH QLAI S A+ +VA GR+ GKS+ V F Q W Sbjct: 16 YKPHHVQLAIHRSTAKRRVACLGRQSGKSEAASVEAVFELFARPGSQG-----------W 64 Query: 96 IVGPEYSDAEKEFRVLWNTLVSLGVPFDKPGSYYDAVGGNMHLSLWQ------GAYQV-- 147 I+ P Y AE F + + L F + + + GA +V Sbjct: 65 IIAPTYDQAEIIFGRVVEKVERLAEVFPATEVQLQRRRLRLLVHHYDRPVNAPGAKRVAT 124 Query: 148 ---HAKSAKYPDTLVGEGLSGVIMAEAAKQKPSVWTKHVRPTLGDFTGWSLHTSTPEGKN 204 KSA PD L G L VI+ EAA SVW++ + PTL GW+L STP+G N Sbjct: 125 SEFRGKSADRPDNLRGATLDFVILDEAAMIPFSVWSEAIEPTLSVRDGWALIISTPKGLN 184 Query: 205 HFHDKFQM-------------GQDPNNPEWESWRMPSW 229 F++ F M G + +P++ES+ SW Sbjct: 185 WFYEFFLMGWRGGLKEGIPNSGVNQTHPDFESFHAASW 222 >gi|12256|lcl|protein:vir:79447 Length: 485 # NCBI annotation: phage terminase large subunit # Family: family:all:147 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468054;genbank:gi:157265496;genbank:Ge neID:5600564 Length = 485 Score = 76.6 bits (187), Expect = 7e-16, Method: Compositional matrix adjust. Identities = 66/218 (30%), Positives = 91/218 (41%), Gaps = 35/218 (16%) Query: 36 WDPHPGQLAIEESPARNKVATCGRRFGKSDLGGKRLVPPAFLAYYRQDLLRTTNKAMIYW 95 + PH QLAI S A+ +VA GR+ GKS+ V F Q W Sbjct: 16 YKPHHVQLAIHRSTAKRRVACLGRQSGKSEAASVEAVFELFARPGSQG-----------W 64 Query: 96 IVGPEYSDAEKEFRVLWNTLVSLGVPFDKPGSYYDAVGGNMHLSLWQ------GAYQV-- 147 I+ P Y AE F + + L F + + + GA +V Sbjct: 65 IIAPTYDQAEIIFGRVVEKVERLAEVFPATEVQLQRRRLRLLVHHYDRPVNAPGAKRVAT 124 Query: 148 ---HAKSAKYPDTLVGEGLSGVIMAEAAKQKPSVWTKHVRPTLGDFTGWSLHTSTPEGKN 204 KSA PD L G L VI+ EAA SVW++ + PTL GW+L STP+G N Sbjct: 125 SEFRGKSADRPDNLRGATLDFVILDEAAMIPFSVWSEAIEPTLSVRDGWALIISTPKGLN 184 Query: 205 HFHDKFQM-------------GQDPNNPEWESWRMPSW 229 F++ F M G + +P++ES+ SW Sbjct: 185 WFYEFFLMGWRGGLKEGIPNSGINQTHPDFESFHAASW 222 >gi|8126|lcl|protein:vir:102661 Length: 568 # NCBI annotation: terminase # Family: family:all:147 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024418;genbank:gi:48696639;genbank:GeneID :2948128 Length = 568 Score = 57.4 bits (137), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 93/379 (24%), Positives = 158/379 (41%), Gaps = 63/379 (16%) Query: 32 RRKVWDPHPGQLAIEESPA--RNKVATCGRRFGKSDLGGKRLVPPAFLAYYRQDLLRTTN 89 RRK + G+ E P+ + + RR GK D+G L+ A LR N Sbjct: 63 RRKNEALNKGEEFNEPEPSWRKKAIQVLHRRAGK-DIGALHLIAIA-------SQLRVGN 114 Query: 90 KAMIYWIVGPEYSDAEKEFRVLWNTLVSLGVPFDK---PGSYYDAVG-GNMHLSLWQGA- 144 Y + P + A +W+ + +LG F + P +++ M + G+ Sbjct: 115 ----YKHILPYKTQARD---AIWDGIDALGNRFIRNAFPDEIVESINESRMLVRFTNGST 167 Query: 145 YQVHAKSAKYPDTLVGEGLSGVIMAEAAKQKPSVWTKHVRPTLGDFTGWSLHTSTPEGKN 204 YQ+ + D LVG G G++ +E+A P+V T +RP L + GW LH +TP GKN Sbjct: 168 YQLQGGDS---DKLVGAGPVGIVYSESALMSPNVRT-FLRPMLDETGGWELHITTPRGKN 223 Query: 205 HFHDKFQMGQDPNNPEW--------ESWRMPSWANPYVYTRTGRLIALGKLPPETPIPDH 256 F+ K M + + EW ++WR ++++ + T T + L IP + Sbjct: 224 WFY-KLAMHAE-KSEEWYYKYLTINDTWRW-AYSSEALDTDTLQQAGTATLNDGHVIPVY 280 Query: 257 EYTLDRHVTIMQQLMRENPTVSPFKIVADHKLRVDQEVIE--LAADMSIESFNQEI---G 311 E PT ++ VAD + +++ V++ A + E Q + G Sbjct: 281 ESI---------------PTELKYRNVADAEKAIERGVVKGMYAVRIMTERMVQSLIDEG 325 Query: 312 ADFTEYVGKVFKDWD---EEYHVADLVD--YNVKQFGTY-FNPNYETYAAADYGYTNPNV 365 D + + DWD + + DL+ YN + G + NPN Y D G+ + Sbjct: 326 QDPFIVRQEYYCDWDVALQGSYYGDLMITMYNTGRIGKFPHNPNRPVYVHMDIGFNDSTS 385 Query: 366 WLVIQIGKWGEVNVLREIY 384 Q G G+ ++ ++ Sbjct: 386 ITFTQEGPMGQGVIIDHLW 404 >gi|21526|lcl|protein:vir:104262 Length: 438 # NCBI annotation: terminase, large subunit # Family: family:all:147 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006983;genbank:gi:46401884;genbank:GeneID :2777679 Length = 438 Score = 49.7 bits (117), Expect = 9e-08, Method: Compositional matrix adjust. Identities = 77/332 (23%), Positives = 116/332 (34%), Gaps = 85/332 (25%) Query: 44 AIEESPARNKVATCGRRFGKSDLGGKRLVPPAFLAYYRQDLLRTTNKAMIYWIVGPEYSD 103 A+E+ R A RR GKS F+AY L+ + +V P YS Sbjct: 48 ALEDPRHRFVTACVSRRVGKS-----------FIAY-TLGFLKLLEPNVKVLVVAPNYSL 95 Query: 104 AEKEFRVLWNTLVSLGVPFDKPGSYYDAVGGNMHLSLWQGAYQVHAKSAKYPDTLVGEGL 163 A + + + G+ ++ + + + L G+ A +A+ D+ VG Sbjct: 96 ANIGWSQIRGLIKKYGLQTERENA------KDKEIELANGSLFKLASAAQ-ADSAVGRSY 148 Query: 164 SGVIMAEAAKQKPS--VWTKHVRPTLGDFTGWSLHTSTPEGKNHFHDKFQMGQDPNNPEW 221 +I EAA + +RPTL +L STP G N F + + G D P W Sbjct: 149 DFIIFDEAAISDVGGDAFRVQLRPTLDKPNSKALFISTPRGGNWFKEFYAYGFDDTLPNW 208 Query: 222 ESWRMPSWANPYVYTRTGRLIALGKLPPETPIPDHEYTLDRHVTIMQQLMRENPTVSPFK 281 S + R+NP Sbjct: 209 VS-------------------------------------------IHGTYRDNP------ 219 Query: 282 IVADHKLRVDQEVIELA-ADMSIESFNQEIGADFTEYVGKVFKDWDEEYHVADLVDYNVK 340 R D IE A +S F QE ADF+ + G++F ++ HV DL K Sbjct: 220 -------RADLNDIEEARRTVSKNYFRQEYEADFSVFEGQIFDTFNAIDHVKDL-----K 267 Query: 341 QFGTYFNPN--YETYAAADYGYTNPNVWLVIQ 370 +F + +ET D GY +P L I+ Sbjct: 268 GMRHFFKDDEAFETLLGIDVGYRDPTAVLTIK 299 >gi|1165|lcl|protein:vir:102941 Length: 421 # NCBI annotation: terminase, large subunit # Family: family:all:54 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945278;genbank:gi:39653713;goa:Q708N4;int erpro:IPR004921;interpro:IPR006437;uniprot:Q708N4;genban k:GeneID:2672855 Length = 421 Score = 34.7 bits (78), Expect = 0.003, Method: Compositional matrix adjust. Identities = 39/178 (21%), Positives = 70/178 (39%), Gaps = 37/178 (20%) Query: 353 YAAADYGYTNPNVWLVIQIGKWGEVNVLREIYMPGL------TADAFADEIRRQMCNPPN 406 Y + DYG N V+L+ + G+ + RE Y G T +AD++ + + Sbjct: 262 YVSVDYGTQNATVFLLWEKDIIGKYYLTREYYYSGRDENVQKTNAEYADDLTAWLGDTNI 321 Query: 407 LRIFYPDPADPMSSETLSQKLGIRAAGGTGGEKRIRLNLIRQFLKRGRNDPTD----LTG 462 RI A +E +K G + +K+ RN+ + + Sbjct: 322 DRIIIDPSAASFIAEL--KKRGYK-------------------IKKARNNVLEGIRFVGS 360 Query: 463 MGWRPQLMFDRSCANARREMEAYRYPDSTDKPNPSSLIYEEPLKKDDHVPEALGRFFV 520 M + ++ SC N +E AY + + ++P+K+ DH +AL R+F Sbjct: 361 MLGQEKIAVHESCVNTLKEFHAYVWDEKASANGE-----DKPIKQFDHAMDAL-RYFC 412 >gi|6886|lcl|protein:vir:106551 Length: 424 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958579;genbank:gi:41179257;genbank:GeneID :2717087 Length = 424 Score = 33.5 bits (75), Expect = 0.007, Method: Compositional matrix adjust. Identities = 45/197 (22%), Positives = 77/197 (39%), Gaps = 65/197 (32%) Query: 348 PNY--ETYAAADYGYTNPNVWLVIQIGKWGEVN----VLREIYMPGLTAD---------- 391 PN+ + Y + DYG NP +L+ WG + +++E Y G T Sbjct: 248 PNHFEKYYVSCDYGTLNPTAFLL-----WGRNHGVWYLVKEYYYSGRTTSRQKTDEEYCH 302 Query: 392 ---AFADEIRRQMCNPPNLRIFYPDPADPMSSETLSQKLGIRAAGGTGGEKRIRLNLIRQ 448 F +IR +M DP+ S TL Q G + Sbjct: 303 DLKEFLGDIRAEMI---------IDPSAASFSTTLRQN-GFK------------------ 334 Query: 449 FLKRGRNDPTD-----LTGMGWRPQLMFDRSCANARREMEAYRYPDSTDKPNPSSLIYEE 503 +++ +ND D T M ++ F +C N +E+ +Y + D + ++ Sbjct: 335 -VRKAKNDVLDGIRVTQTAMN-EGKIKFSMNCPNLFKELASYVWDDKAAEHGE-----DK 387 Query: 504 PLKKDDHVPEALGRFFV 520 P+K+ DH +A+ R+FV Sbjct: 388 PVKQHDHACDAM-RYFV 403 >gi|27123|lcl|protein:vir:6593 Length: 607 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891724;genbank:gi:33620526;genbank:GeneID :1725332 Length = 607 Score = 27.7 bits (60), Expect = 0.39, Method: Compositional matrix adjust. Identities = 20/68 (29%), Positives = 31/68 (45%), Gaps = 6/68 (8%) Query: 152 AKYPDTLVGEGLSGVIMAEAAKQKPSVWTKH---VRPTLGD-FTGWSLHTSTPEGKNHFH 207 A PD + G S + + E A + WT ++P + + T+TP G NHF+ Sbjct: 236 ASSPDAVRGNSFSFIYIDECAFIQN--WTDCFLAIQPVISSGRESKMIMTTTPNGLNHFY 293 Query: 208 DKFQMGQD 215 D +Q D Sbjct: 294 DIWQSAID 301 >gi|25348|lcl|protein:vir:80989 Length: 607 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469498;genbank:gi:157311455;genbank:Ge neID:5602125 Length = 607 Score = 27.7 bits (60), Expect = 0.39, Method: Compositional matrix adjust. Identities = 20/68 (29%), Positives = 31/68 (45%), Gaps = 6/68 (8%) Query: 152 AKYPDTLVGEGLSGVIMAEAAKQKPSVWTKH---VRPTLGD-FTGWSLHTSTPEGKNHFH 207 A PD + G S + + E A + WT ++P + + T+TP G NHF+ Sbjct: 236 ASSPDAVRGNSFSFIYIDECAFIQN--WTDCFLAIQPVISSGRESKMIMTTTPNGLNHFY 293 Query: 208 DKFQMGQD 215 D +Q D Sbjct: 294 DIWQSAID 301 >gi|11160|lcl|protein:vir:78394 Length: 423 # NCBI annotation: hypothetical protein # Family: family:all:147 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110830;genbank:gi:134288591;genbank:Ge neID:5179657 Length = 423 Score = 27.3 bits (59), Expect = 0.52, Method: Compositional matrix adjust. Identities = 16/53 (30%), Positives = 30/53 (56%), Gaps = 6/53 (11%) Query: 198 STPEGKNHFHDKFQMGQDPNNPEWESWRMPSWANPYV---YTRTGRLIALGKL 247 +TPEG HD++ + + NP +E + P+ +NP++ Y ++ R G+L Sbjct: 166 TTPEGFRFVHDRWVVKK---NPGYEMIQAPTTSNPFLPEDYVQSLRDTYPGRL 215 >gi|26943|lcl|protein:vir:5662 Length: 600 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899601;genbank:gi:34419588;genbank:GeneID :2546011 Length = 600 Score = 25.8 bits (55), Expect = 1.5, Method: Compositional matrix adjust. Identities = 13/32 (40%), Positives = 16/32 (50%), Gaps = 3/32 (9%) Query: 197 TSTPEGKNHFHDKFQ---MGQDPNNPEWESWR 225 TSTP G NH+HD + G P +WR Sbjct: 271 TSTPNGLNHYHDMWNAAVQGISTFEPYTTTWR 302 >gi|24686|lcl|protein:vir:79769 Length: 635 # NCBI annotation: terminase large subunit # Family: family:all:4877 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429607;genbank:gi:156564098;genbank:Ge neID:5525534 Length = 635 Score = 25.4 bits (54), Expect = 2.0, Method: Compositional matrix adjust. Identities = 25/92 (27%), Positives = 41/92 (44%), Gaps = 12/92 (13%) Query: 239 GRLIALGKLPPETPIPDHEYTLDRHVTIMQQLMREN---PTVSPFKIVADHKLRVDQE-- 293 GR+ K+ P + D Y L++ + MRE T+ V L+V+ + Sbjct: 255 GRVTYNVKMKPWKTLADGSYELNQ----LGDKMREGNGWTTIHAPSTVNPELLKVNPDTG 310 Query: 294 ---VIELAADMSIESFNQEIGADFTEYVGKVF 322 + EL D++ F QE+ A+F E + VF Sbjct: 311 LTYIEELRLDLTEMRFIQEVMAEFGESISGVF 342 >gi|5891|lcl|protein:vir:107098 Length: 421 # NCBI annotation: putative large terminase subunit # Family: family:all:54 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950600;genbank:gi:119953680;genbank:GeneI D:4643107 Length = 421 Score = 25.0 bits (53), Expect = 2.8, Method: Compositional matrix adjust. Identities = 22/83 (26%), Positives = 38/83 (45%), Gaps = 5/83 (6%) Query: 355 AADYGY-TNPNVWLVIQIGKWGEV-NVLREIYMPGLTADAFADEIRRQMCNPPNLRIFYP 412 A D+GY T+P ++ K + + E Y ++ FA+ ++R+ + Y Sbjct: 263 AVDFGYATDPLAFVRWHYDKKKRIIYAVDEHYGVQISNREFANWLKRRGYQSDEI---YA 319 Query: 413 DPADPMSSETLSQKLGIRAAGGT 435 D A+P S L Q+ GI+ G Sbjct: 320 DSAEPKSIAELKQEHGIKRIKGV 342 >gi|8015|lcl|protein:vir:96495 Length: 195 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238487;genbank:gi:66391763;genbank:GeneID :5176917 Length = 195 Score = 24.6 bits (52), Expect = 2.9, Method: Compositional matrix adjust. Identities = 24/99 (24%), Positives = 41/99 (41%), Gaps = 12/99 (12%) Query: 319 GKVFKDWDEEYHVADLVDYNVKQFGTYFNPNYETYAAADYGYTNPNVWLVIQIGKWGEVN 378 G ++ D+D + HV D + + FG D+GYT+ +V+ G G Sbjct: 8 GAIYADYDSKIHVVDELPEMKRCFG-----------GIDWGYTHYGSIVVVGEGVDGNFY 56 Query: 379 VLREIYMPGLTADAFADEIRRQMCNPPNLRIFYPDPADP 417 +L + D + ++ R+ N+ FY D A P Sbjct: 57 LLDGVAAQFKEIDWWVEQARKLTGIYRNIP-FYADSARP 94 >gi|22942|lcl|protein:vir:101803 Length: 613 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238880;genbank:gi:66391955;genbank:GeneID :3416630 Length = 613 Score = 24.6 bits (52), Expect = 3.3, Method: Compositional matrix adjust. Identities = 10/24 (41%), Positives = 15/24 (62%) Query: 195 LHTSTPEGKNHFHDKFQMGQDPNN 218 L T+TP G NH++D + PN+ Sbjct: 274 LMTTTPNGLNHWYDIWTAAITPNS 297 >gi|23191|lcl|protein:vir:101186 Length: 613 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932508;genbank:gi:37651634;genbank:GeneID :2610679 Length = 613 Score = 24.6 bits (52), Expect = 3.5, Method: Compositional matrix adjust. Identities = 10/24 (41%), Positives = 15/24 (62%) Query: 195 LHTSTPEGKNHFHDKFQMGQDPNN 218 L T+TP G NH++D + PN+ Sbjct: 274 LMTTTPNGLNHWYDIWTAAITPNS 297 >gi|21077|lcl|protein:vir:100538 Length: 612 # NCBI annotation: gp17 terminase subunit # Family: family:all:147 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656379;genbank:gi:109290130;genbank:GeneI D:4156516 Length = 612 Score = 24.6 bits (52), Expect = 3.5, Method: Compositional matrix adjust. Identities = 10/24 (41%), Positives = 15/24 (62%) Query: 195 LHTSTPEGKNHFHDKFQMGQDPNN 218 L T+TP G NH++D + PN+ Sbjct: 273 LMTTTPNGLNHWYDIWTAAITPNS 296 >gi|24596|lcl|protein:vir:98262 Length: 609 # NCBI annotation: gp17 terminase subunit, nuclease and ATPase # Family: family:all:147 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239195;genbank:gi:66391670;genbank:GeneID :3416364 Length = 609 Score = 24.3 bits (51), Expect = 4.7, Method: Compositional matrix adjust. Identities = 9/14 (64%), Positives = 11/14 (78%) Query: 195 LHTSTPEGKNHFHD 208 L T+TP G NHF+D Sbjct: 281 LITTTPNGLNHFYD 294 >gi|28182|lcl|protein:vir:108053 Length: 611 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595293;genbank:gi:161622599;genbank:Ge neID:5783772 Length = 611 Score = 23.5 bits (49), Expect = 7.2, Method: Compositional matrix adjust. Identities = 8/12 (66%), Positives = 10/12 (83%) Query: 197 TSTPEGKNHFHD 208 T+TP G NHF+D Sbjct: 285 TTTPNGLNHFYD 296 >gi|27407|lcl|protein:vir:7202 Length: 610 # NCBI annotation: gp17 terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049776;genbank:gi:9632591;genbank:GeneID: 1258647 Length = 610 Score = 23.5 bits (49), Expect = 7.3, Method: Compositional matrix adjust. Identities = 12/33 (36%), Positives = 17/33 (51%), Gaps = 6/33 (18%) Query: 197 TSTPEGKNHFHDKF------QMGQDPNNPEWES 223 T+TP G NHF+D + + G +P W S Sbjct: 285 TTTPNGLNHFYDIWTAAVEGKSGFEPYTAIWNS 317 >gi|22056|lcl|protein:vir:103455 Length: 610 # NCBI annotation: terminase subunit nuclease and ATPase # Family: family:all:147 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803107;genbank:gi:116326387;genbank:GeneI D:4405484 Length = 610 Score = 23.5 bits (49), Expect = 7.3, Method: Compositional matrix adjust. Identities = 12/33 (36%), Positives = 17/33 (51%), Gaps = 6/33 (18%) Query: 197 TSTPEGKNHFHDKF------QMGQDPNNPEWES 223 T+TP G NHF+D + + G +P W S Sbjct: 285 TTTPNGLNHFYDIWTAAVEGKSGFEPYTAIWNS 317 >gi|22578|lcl|protein:vir:95547 Length: 605 # NCBI annotation: ORF010 # Family: family:all:1430 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240892;genbank:gi:66394959;genbank:GeneID :5132488 Length = 605 Score = 23.5 bits (49), Expect = 7.3, Method: Compositional matrix adjust. Identities = 15/52 (28%), Positives = 21/52 (40%) Query: 29 GRERRKVWDPHPGQLAIEESPARNKVATCGRRFGKSDLGGKRLVPPAFLAYY 80 R+R K P Q I NK R+ G S++G +V A + Y Sbjct: 54 NRDRSKAQAHRPWQTRIVNDTHPNKAVIKSRQLGLSEMGVMEMVHFADMHSY 105 >gi|9399|lcl|protein:vir:99333 Length: 605 # NCBI annotation: hypothetical protein # Family: family:all:1430 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024465;genbank:gi:48696425;genbank:GeneID :2948061 Length = 605 Score = 23.5 bits (49), Expect = 7.3, Method: Compositional matrix adjust. Identities = 15/52 (28%), Positives = 21/52 (40%) Query: 29 GRERRKVWDPHPGQLAIEESPARNKVATCGRRFGKSDLGGKRLVPPAFLAYY 80 R+R K P Q I NK R+ G S++G +V A + Y Sbjct: 54 NRDRSKAQAHRPWQTRIVNDTHPNKAVIKSRQLGLSEMGVMEMVHFADMHSY 105 >gi|27701|lcl|protein:vir:6893 Length: 611 # NCBI annotation: gp17 terminase DNA packaging enzyme, large subunit # Family: family:all:147 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861869;genbank:gi:32453660;genbank:GeneID :1494295 Length = 611 Score = 23.5 bits (49), Expect = 7.5, Method: Compositional matrix adjust. Identities = 8/12 (66%), Positives = 10/12 (83%) Query: 197 TSTPEGKNHFHD 208 T+TP G NHF+D Sbjct: 285 TTTPNGLNHFYD 296 >gi|15201|lcl|protein:vir:7028 Length: 631 # NCBI annotation: large terminase subunit # Family: family:all:697 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853601;genbank:gi:31711683;genbank:GeneID :1481809 Length = 631 Score = 23.5 bits (49), Expect = 8.3, Method: Compositional matrix adjust. Identities = 15/58 (25%), Positives = 23/58 (39%), Gaps = 2/58 (3%) Query: 103 DAEKEFRVL--WNTLVSLGVPFDKPGSYYDAVGGNMHLSLWQGAYQVHAKSAKYPDTL 158 D KEF + + ++ LG P Y + + +W G Y + A Y D L Sbjct: 197 DLTKEFESINQFGDIIYLGTPQSVNSIYNNLPARGYQIRIWPGRYPTLEQEACYGDFL 254 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.317 0.135 0.422 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 303,201 Number of Sequences: 514 Number of extensions: 14880 Number of successful extensions: 72 Number of sequences better than 100.0: 35 Number of HSP's better than 100.0 without gapping: 28 Number of HSP's successfully gapped in prelim test: 7 Number of HSP's that attempted gapping in prelim test: 30 Number of HSP's gapped (non-prelim): 42 length of query: 588 length of database: 206,069 effective HSP length: 77 effective length of query: 511 effective length of database: 166,491 effective search space: 85076901 effective search space used: 85076901 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 40 (20.0 bits)