BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|Aclame:protein:vir:94970|NCBI_annot:putative phage terminase large subunit|genbank:acc:YP_239275;genbank:gi:66392134;genbank:GeneID:50766 15 (473 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|4548|lcl|protein:vir:94970 Length: 473 # NCBI annotation: put... 984 0.0 gi|9590|lcl|protein:vir:97250 Length: 515 # NCBI annotation: hyp... 200 2e-53 gi|12899|lcl|protein:vir:80432 Length: 510 # NCBI annotation: Bc... 167 2e-43 gi|5072|lcl|protein:vir:95130 Length: 531 # NCBI annotation: hyp... 162 1e-41 gi|18252|lcl|protein:vir:4193 Length: 467 # NCBI annotation: put... 104 3e-24 gi|16077|lcl|protein:vir:4155 Length: 468 # NCBI annotation: lar... 101 2e-23 gi|5427|lcl|protein:vir:95253 Length: 533 # NCBI annotation: Ter... 45 2e-06 gi|6886|lcl|protein:vir:106551 Length: 424 # NCBI annotation: pu... 33 0.010 gi|20804|lcl|protein:vir:106290 Length: 633 # NCBI annotation: g... 27 0.50 gi|1219|lcl|protein:vir:105519 Length: 475 # NCBI annotation: ph... 26 0.81 gi|18891|lcl|protein:vir:8847 Length: 482 # NCBI annotation: ter... 26 0.89 gi|6571|lcl|protein:vir:104349 Length: 218 # NCBI annotation: pu... 25 1.7 gi|26461|lcl|protein:vir:573 Length: 589 # NCBI annotation: unkn... 25 2.1 gi|27798|lcl|protein:vir:8408 Length: 214 # NCBI annotation: gp3... 25 2.2 gi|2959|lcl|protein:vir:102079 Length: 565 # NCBI annotation: te... 24 3.7 gi|2476|lcl|protein:vir:102883 Length: 565 # NCBI annotation: ph... 24 3.8 gi|2423|lcl|protein:vir:107576 Length: 565 # NCBI annotation: ph... 24 3.9 gi|2370|lcl|protein:vir:105001 Length: 565 # NCBI annotation: pu... 24 3.9 gi|1931|lcl|protein:vir:99847 Length: 486 # NCBI annotation: lar... 24 4.3 gi|3559|lcl|protein:vir:101786 Length: 556 # NCBI annotation: gp... 23 5.2 gi|19384|lcl|protein:vir:7851 Length: 887 # NCBI annotation: gp8... 23 5.6 gi|3326|lcl|protein:vir:94525 Length: 440 # NCBI annotation: put... 23 7.3 gi|23191|lcl|protein:vir:101186 Length: 613 # NCBI annotation: t... 23 8.3 gi|22942|lcl|protein:vir:101803 Length: 613 # NCBI annotation: g... 23 8.4 gi|19767|lcl|protein:vir:6384 Length: 677 # NCBI annotation: ter... 23 9.2 >gi|4548|lcl|protein:vir:94970 Length: 473 # NCBI annotation: putative phage terminase large subunit # Family: family:all:144 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239275;genbank:gi:66392134;genbank:GeneID :5076615 Length = 473 Score = 984 bits (2543), Expect = 0.0, Method: Compositional matrix adjust. Identities = 473/473 (100%), Positives = 473/473 (100%) Query: 1 MATDFKLYPPQQRALITPAREILYGGAAGGGKSYLLRVASIVYSLEIPGLITYLFRRTFK 60 MATDFKLYPPQQRALITPAREILYGGAAGGGKSYLLRVASIVYSLEIPGLITYLFRRTFK Sbjct: 1 MATDFKLYPPQQRALITPAREILYGGAAGGGKSYLLRVASIVYSLEIPGLITYLFRRTFK 60 Query: 61 EVLSNHVYTPGGYLEMMKGLIDAGDVVYSKSDNSFTFYNGSRIQLAHSQFENDIYTHQGA 120 EVLSNHVYTPGGYLEMMKGLIDAGDVVYSKSDNSFTFYNGSRIQLAHSQFENDIYTHQGA Sbjct: 61 EVLSNHVYTPGGYLEMMKGLIDAGDVVYSKSDNSFTFYNGSRIQLAHSQFENDIYTHQGA 120 Query: 121 QIGFLIIDEATHFTPPMIRFIRSRVRLGSMIIPPKWKALFPRILYTANPGGVGHHYFKSN 180 QIGFLIIDEATHFTPPMIRFIRSRVRLGSMIIPPKWKALFPRILYTANPGGVGHHYFKSN Sbjct: 121 QIGFLIIDEATHFTPPMIRFIRSRVRLGSMIIPPKWKALFPRILYTANPGGVGHHYFKSN 180 Query: 181 FVDIGSGHVFQAPEDEGSMLREYIPAKLEDNKVMMETDPDYRARLKGMGDSATVQAMLEG 240 FVDIGSGHVFQAPEDEGSMLREYIPAKLEDNKVMMETDPDYRARLKGMGDSATVQAMLEG Sbjct: 181 FVDIGSGHVFQAPEDEGSMLREYIPAKLEDNKVMMETDPDYRARLKGMGDSATVQAMLEG 240 Query: 241 DWEVVSAGGIADLWRSKIHVVHPFKIPHTWKIDRGYDYGSSKPAAYLLFAESDGSEFRDQ 300 DWEVVSAGGIADLWRSKIHVVHPFKIPHTWKIDRGYDYGSSKPAAYLLFAESDGSEFRDQ Sbjct: 241 DWEVVSAGGIADLWRSKIHVVHPFKIPHTWKIDRGYDYGSSKPAAYLLFAESDGSEFRDQ 300 Query: 301 QGRVCWVPAGTVFVIGEDYIANKRQEGLRLTAIEQGRRMARYEAESGYQNRIQPGPADNA 360 QGRVCWVPAGTVFVIGEDYIANKRQEGLRLTAIEQGRRMARYEAESGYQNRIQPGPADNA Sbjct: 301 QGRVCWVPAGTVFVIGEDYIANKRQEGLRLTAIEQGRRMARYEAESGYQNRIQPGPADNA 360 Query: 361 IFSAEPGHRTVADDIGIHGVTFTRSNKNPGSRIEGLQLFRTRLKAATERPMENPGFFVFN 420 IFSAEPGHRTVADDIGIHGVTFTRSNKNPGSRIEGLQLFRTRLKAATERPMENPGFFVFN Sbjct: 361 IFSAEPGHRTVADDIGIHGVTFTRSNKNPGSRIEGLQLFRTRLKAATERPMENPGFFVFN 420 Query: 421 TCFNTIRTIPNLQNSPKNSEDLDTAGEDHIWDVIRYRLLKAAKQIKLIETEGH 473 TCFNTIRTIPNLQNSPKNSEDLDTAGEDHIWDVIRYRLLKAAKQIKLIETEGH Sbjct: 421 TCFNTIRTIPNLQNSPKNSEDLDTAGEDHIWDVIRYRLLKAAKQIKLIETEGH 473 >gi|9590|lcl|protein:vir:97250 Length: 515 # NCBI annotation: hypothetical protein ORF012 # Family: family:all:144 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294520;genbank:gi:149408241;genbank:Ge neID:5237115 Length = 515 Score = 200 bits (509), Expect = 2e-53, Method: Compositional matrix adjust. Identities = 144/470 (30%), Positives = 238/470 (50%), Gaps = 36/470 (7%) Query: 15 LITPAREILYGGAAGGGKSYLLRVASIV-----YSLEIPGLITYLFRRTFKEVLSNHVYT 69 + P E+LY G G GK+ L + + Y E G+ LFR+T+ ++ T Sbjct: 60 MAHPIFEVLYEGTRGPGKTDCLLMDFLQHVGKGYGSEWRGI---LFRQTYPQLSDVINKT 116 Query: 70 PGGYLEMMKGLIDAGDVVYSKSDNSFTFYNGSRIQLAHSQFENDIYTHQGAQIGFLIIDE 129 + + G Y+K ++ +TF +G + L H + D + + G ++ +E Sbjct: 117 NKWFKRIFPG------AKYNKVEHKWTFPDGEELLLRHMKSPEDYWNYHGHAYPWIGWEE 170 Query: 130 ATHFTPPMI-RFIRSRVRLGSMIIPPKWKALFPRILYTANPGGVGHHYFKSNF--VDIGS 186 ++ + S R +P ++A T NP G GH++ K+ F + Sbjct: 171 LCNWADDKCYTVMMSCCRSTKPGMPRCYRA-------TTNPYGPGHNWVKARFRLPHMRG 223 Query: 187 GHVFQAPED-EGSMLREYIPAKLEDNKVMMETDPDYRARLKGMG-DSATVQAMLEGDWEV 244 + A D E R I + +N++++ DP+Y ++++ + + + A L G W++ Sbjct: 224 RVILDAMRDGEREPPRVAIHGSIYENQILLHADPEYISKIRAAARNPSELAAWLHGSWDI 283 Query: 245 VSAGGIADLWRSKIHVVHPFK---IPHTWKIDRGYDYGSSKPAAYLLFAESDGSEFRDQQ 301 ++ G D++R +HVV IP WKIDR +D+GSSKP A L +AES+G F + Sbjct: 284 IAGGMFDDIYRGDVHVVPSVPLSVIPKRWKIDRSFDWGSSKPFAVLWWAESNGEPF-EWN 342 Query: 302 GRVCWVPAGTVFVIGEDYIAN-KRQEGLRLTAIEQGRRMARYEAESGYQNRIQPGPADNA 360 GRV G +++I E Y N R EG+R+ A E + + E + + R++PGPAD++ Sbjct: 343 GRVYGKVRGDLYLIQEWYGWNGTRNEGVRMLASEVAQGVKDREEDWALEGRVKPGPADSS 402 Query: 361 IFSAEPGHRTVADDIGIHGVTFTRSNKNPGSRIEGLQLFRTRLKAATER----PMENPGF 416 IF E G+ ++A D+ GV +T ++K PGSR +G + R LK A P E PG Sbjct: 403 IFDVENGN-SIAVDMEKKGVRWTPADKGPGSRKQGWEQIRKLLKGALPPAGGGPREVPGL 461 Query: 417 FVFNTCFNTIRTIPNLQNSPKNSEDLDTAGEDHIWDVIRYRLLKAAKQIK 466 ++F+ C TI T+P L K+ +D++T EDHI D IRYR+ K + +K Sbjct: 462 YIFDWCQQTIETVPVLPRDDKDLDDVNTEAEDHIGDAIRYRVRKKLRGVK 511 >gi|12899|lcl|protein:vir:80432 Length: 510 # NCBI annotation: BcepGomrgp04 # Family: family:all:144 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210224;genbank:gi:146329916;genbank:Ge neID:5123541 Length = 510 Score = 167 bits (424), Expect = 2e-43, Method: Compositional matrix adjust. Identities = 143/498 (28%), Positives = 229/498 (45%), Gaps = 49/498 (9%) Query: 9 PPQQRALIT-PAREILYGGAAGGGKS-----YLLRVASIVYSLEIPGLITYLFRRTFKEV 62 P Q A IT P +LY G G GK+ R + Y G+I F R +K Sbjct: 17 PGSQTAAITYPGHHLLYEGTRGPGKTDAQLMKFRRYVGLGYGRFWRGII---FDREYKN- 72 Query: 63 LSNHVYTPGGYLEMMKGLIDAGDVVYSKSDNSFTFYNGSRIQLAHSQFENDIYTHQGAQI 122 L + V + + + D SKSD + + G + + D + + G + Sbjct: 73 LDDLVSKSQRWFPLFE---DGAKFKASKSDYRWVWPTGEELLFRQIKKSTDYWNYHGQEF 129 Query: 123 GFLIIDEATHF-TPPMIRFIRSRVRLGSMIIPPKW--------KALFPR----ILYTANP 169 F+ +E + + TP + + S R S P W + L P + T NP Sbjct: 130 PFIGWNELSKYPTPDLYESMMSCNR--SSFRPEDWPYIDEHGNQCLLPEMPLMVFSTTNP 187 Query: 170 GGVGHHYFKSNFVDIGS-GHVFQAPED---EGSMLREYIPAK----LEDNKVMMETDPDY 221 G GH++ K F+DI G V + +D + RE + K + P+Y Sbjct: 188 YGPGHNWVKRQFIDIAPPGVVVKTTKDVFNPRTQKREPVTKTQVRLFGSYKENIYLTPEY 247 Query: 222 RARLKGMGDSATVQAMLEGDWEVVSAGGIADLWRSKIHVVHPFKIPHTWKIDRGYDYGSS 281 A L+ + D +A L GDW VV+ G I DLWR ++HV F IP +W++DR +D+GS+ Sbjct: 248 VAELESIKDPNKRKAWLHGDWNVVAGGAIDDLWREEVHVKPRFNIPASWRVDRSFDWGST 307 Query: 282 KPAAYLLFAESDG--SEFRDQQG-RVCWVPAGTVFVIGEDYIANKR---QEGLRLTAIEQ 335 P +AE++G + + G W PA ++ ++ + +GL+L A Sbjct: 308 HPFYVGWWAEANGDTATITNPDGTETYWTPARGSLILFHEWYGTEEIGTNKGLKLGAKAV 367 Query: 336 GRRMARYEAESGYQNRI----QPGPADNAIFSA-EPGHRTVADDIGIHGVTFTRSNKNPG 390 + + EA+ +NRI + GPAD I++ + T+A + +GV + ++K+ G Sbjct: 368 AKGIKEIEAQLWRENRILTPVRAGPADGQIYNVIQKDVDTIAKVMEDNGVMWKPADKSAG 427 Query: 391 SRIEGLQLFRTRLKAATERPMENPGFFVFNTCFNTIRTIPNLQNSPKNSEDLDTAGEDHI 450 +R GLQL R RL+A+ E E PG + + C T+P L N +D+DT EDH Sbjct: 428 ARTNGLQLLRDRLEASLE--GEGPGIYFMSHCTAVTSTLPVLPRDDDNLDDVDTEAEDHP 485 Query: 451 WDVIRYRLLKAAKQIKLI 468 +D +RYR L +A ++ + Sbjct: 486 YDGVRYRCLASANRLATV 503 >gi|5072|lcl|protein:vir:95130 Length: 531 # NCBI annotation: hypothetical protein ORF006 # Family: family:all:144 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293413;genbank:gi:148912834;genbank:Ge neID:5228205 Length = 531 Score = 162 bits (409), Expect = 1e-41, Method: Compositional matrix adjust. Identities = 125/348 (35%), Positives = 176/348 (50%), Gaps = 58/348 (16%) Query: 163 ILYTANPGGVGHHYFKSNFVDIGSGHVFQAPEDEGSMLRE----YIPA-KLEDNKVMMET 217 + T NP G GH++ K F+ I AP G+++R Y PA + E+ V+ + Sbjct: 197 VFSTTNPSGPGHNWVKRRFITI-------APR--GTVVRREIQIYNPATEKEETHVISQI 247 Query: 218 --------DP----DYRARLKGMGDSATVQAMLEGDWEVVSAGGIADLWRSKIHVVHPFK 265 +P Y A L+ + + +A L GDW+V + G I DLW+S IHVV F Sbjct: 248 AIFGSYKENPYLPASYIAELESIKEPNLRKAWLYGDWDVTAGGAIDDLWQSHIHVVPRFV 307 Query: 266 IPHTWKIDRGYDYGSSKPAAYLLFAESDGSE------------FRDQQGRVC----WVPA 309 IP +W+IDR YD GSS P + +AE+DG+E F Q G + W Sbjct: 308 IPPSWRIDRTYDDGSSHPFSVGWWAEADGTEATIVLSDGTEFTFCPQPGSLIQLFEWY-- 365 Query: 310 GTVFVIGEDYIANKRQEGLRLTA--IEQG--RRMARYEAESGYQNRIQPGPADNAIFSA- 364 G +YI NK GL+L+A I QG R A ++ PGPADN I Sbjct: 366 GCAKDEKGEYIPNK---GLKLSASNIAQGIIDREISLMANGWILSQPWPGPADNRIRQVI 422 Query: 365 EPGHRTVADDIGIHGVTFTRSNKNPGSRIEGLQLFRTRLKAATERPMENPGFFVFNTCFN 424 + T + GV +T S+K+PGSR+ GLQLFR RL+A+ R E PG + + C Sbjct: 423 DSELDTTEKLMSKKGVRWTESDKSPGSRVIGLQLFRDRLEASVNR--EGPGIYFMSNCVA 480 Query: 425 TIRTIPNLQNSPKNSEDLDTAGEDHIWDVIRYRLL----KAAKQIKLI 468 +I +P L K +D+DT EDH +D++RYR+L KAA + KL+ Sbjct: 481 SIDLLPTLPRDEKKIDDVDTNAEDHCYDMVRYRVLKGANKAAAKFKLV 528 >gi|18252|lcl|protein:vir:4193 Length: 467 # NCBI annotation: putative large terminase subunit # Family: family:all:144 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071818;genbank:gi:11863101;genbank:GeneID :1257603 Length = 467 Score = 104 bits (259), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 70/241 (29%), Positives = 117/241 (48%), Gaps = 32/241 (13%) Query: 11 QQRALITPAREILYGGAAGGGKSYLLRVASIVYSLEIPGLITYLFRRTFKEVLSNHVYTP 70 Q + L++ RE+LYGGAAGGGKS L + ++ Y + + RRT+ E+ Sbjct: 63 QIKFLLSDEREVLYGGAAGGGKSVALLMGALQY-VHYSDYAALILRRTYPELSQE----- 116 Query: 71 GGYLEMMKGLIDAGDVVYSKSDNSFTFYNGSRIQLAHSQFENDIYTHQGAQIGFLIIDEA 130 GG ++M + D +++ +TF +G+ +Q H + E D Y +QG+ ++ DE Sbjct: 117 GGLIDMANDWLGGTDAEWNEQKKRWTFPSGAALQFGHMEHEKDRYRYQGSSYHYIAFDEL 176 Query: 131 THFTPPMIRFI-RSRVRLGSMIIPPKWKALFPRILYTANPGGVGHHYFKSNFVDIGSGHV 189 T F RF+ RS + IP +++A T+NPGG+GH + K+ F+ +G Sbjct: 177 TEFLESQYRFMFRSLRKEADDPIPLRFRA-------TSNPGGIGHEWVKTRFI---TGE- 225 Query: 190 FQAPEDEGSMLREYIPAKLEDNKVMMETDPDYRARLKGMGDSATVQAMLEGDWEVVSAGG 249 + +IP+ +N + + D M D T + + EGDW+V GG Sbjct: 226 -----------KTFIPSTWRENPYL---NRDEYEEALNMLDHVTRRQLKEGDWDVSIQGG 271 Query: 250 I 250 + Sbjct: 272 V 272 >gi|16077|lcl|protein:vir:4155 Length: 468 # NCBI annotation: large terminase subunit # Family: family:all:144 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046964;genbank:gi:9630534;genbank:GeneID: 1261708 Length = 468 Score = 101 bits (251), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 70/244 (28%), Positives = 118/244 (48%), Gaps = 32/244 (13%) Query: 8 YPPQQRALITPAREILYGGAAGGGKSYLLRVASIVYSLEIPGLITYLFRRTFKEVLSNHV 67 + Q + L++ RE+LYGGAAGGGKS L + ++ Y + + RRT+ E+ Sbjct: 60 FHKQIKFLLSDEREVLYGGAAGGGKSVALLMGALQY-VHYSDYAALILRRTYPELSQE-- 116 Query: 68 YTPGGYLEMMKGLIDAGDVVYSKSDNSFTFYNGSRIQLAHSQFENDIYTHQGAQIGFLII 127 GG ++M + D +++ +TF +G+ +Q H + E D Y +QG+ ++ Sbjct: 117 ---GGLIDMANDWLGGTDAEWNEQKKRWTFPSGAALQFGHMEHEKDRYRYQGSSYHYIAF 173 Query: 128 DEATHFTPPMIRFI-RSRVRLGSMIIPPKWKALFPRILYTANPGGVGHHYFKSNFVDIGS 186 DE T F RF+ RS + + IP R+ T+NPGG+GH + K+ F+ + Sbjct: 174 DELTEFMETQYRFMFRSLRKEVNDHIP-------LRVRATSNPGGIGHEWVKTRFI---T 223 Query: 187 GHVFQAPEDEGSMLREYIPAKLEDNKVMMETDPDYRARLKGMGDSATVQAMLEGDWEVVS 246 G + +IP+ +N + +Y L M D T + + +GDW+V Sbjct: 224 GE------------KTFIPSTWRENPYL--NRDEYEEAL-NMLDHVTRRQLKDGDWDVTL 268 Query: 247 AGGI 250 GG+ Sbjct: 269 QGGV 272 >gi|5427|lcl|protein:vir:95253 Length: 533 # NCBI annotation: Terminase, large subunit # Family: family:all:144 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944884;genbank:gi:38707825;genbank:GeneID :2744038 Length = 533 Score = 44.7 bits (104), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 42/139 (30%), Positives = 66/139 (47%), Gaps = 8/139 (5%) Query: 11 QQRALITPAREILYGGAAGGGKSYLLRVASIVYSLEIPGLITYLFRRTFKEVLSNHVYTP 70 Q+ L T A +LYGGAAG GK+ L + S+ + +E P FRR N Sbjct: 78 QEVFLNTNADLVLYGGAAGAGKTAALLMDSLRF-IEDPNYNAVYFRR-------NTTQLQ 129 Query: 71 GGYLEMMKGLIDAGDVVYSKSDNSFTFYNGSRIQLAHSQFENDIYTHQGAQIGFLIIDEA 130 GG K L + + + TF +G+ I+ + + E HQG + + DE Sbjct: 130 GGLWPAAKKLFGKFGGIPHEQKMTITFPSGATIKFTYLELEKHAEGHQGIEYSAIYFDEG 189 Query: 131 THFTPPMIRFIRSRVRLGS 149 THF+ I ++++R+R G+ Sbjct: 190 THFSASQISYLQTRLRSGA 208 >gi|6886|lcl|protein:vir:106551 Length: 424 # NCBI annotation: putative terminase large subunit # Family: family:all:54 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958579;genbank:gi:41179257;genbank:GeneID :2717087 Length = 424 Score = 32.7 bits (73), Expect = 0.010, Method: Compositional matrix adjust. Identities = 37/134 (27%), Positives = 58/134 (43%), Gaps = 19/134 (14%) Query: 162 RILYTANPGGVGHHYFKSNFVDIGSGHVFQAPEDEGSMLREYIPAKLEDNKVMMETDPDY 221 ++ + NP G H+FK N++D + LR I + DN + D Sbjct: 161 KMWFNCNPSG-PFHWFKLNWID---------QMKDKRALR--IHFTMHDNPSL---DSVT 205 Query: 222 RARLKGMGDSATVQAMLEGDWEVVSAGGIADLWRSKIHVVHPFKIP-HTWKIDRGYDYGS 280 R + M Q ++G W V+S G I D + VV+ ++P H K DYG+ Sbjct: 206 INRYERMYSGVFYQRYIQGLW-VMSEGVIYDNFDKDTMVVN--ELPNHFEKYYVSCDYGT 262 Query: 281 SKPAAYLLFAESDG 294 P A+LL+ + G Sbjct: 263 LNPTAFLLWGRNHG 276 >gi|20804|lcl|protein:vir:106290 Length: 633 # NCBI annotation: gp17 terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944105;genbank:gi:38640149;genbank:GeneID :2658038 Length = 633 Score = 26.9 bits (58), Expect = 0.50, Method: Compositional matrix adjust. Identities = 11/29 (37%), Positives = 15/29 (51%), Gaps = 8/29 (27%) Query: 156 WKALFP--------RILYTANPGGVGHHY 176 WKA+ P RI+ T+ P G+ H Y Sbjct: 291 WKAILPVISSGRQSRIILTSTPNGINHWY 319 >gi|1219|lcl|protein:vir:105519 Length: 475 # NCBI annotation: phage terminase large subunit # Family: family:all:1730 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516188;genbank:gi:89885991;genbank:GeneID :3964379 Length = 475 Score = 26.2 bits (56), Expect = 0.81, Method: Compositional matrix adjust. Identities = 15/54 (27%), Positives = 24/54 (44%), Gaps = 1/54 (1%) Query: 244 VVSAGGIADLWRSKIHVVHPFKIPHTWKIDRGYDYGSSKPAAYLLFAESDGSEF 297 ++ G I DL I PF +P W + G D+G P A++ + +E Sbjct: 272 MLGHGRIYDLGEDFI-TCDPFPVPAHWLVIDGMDFGWDHPQAHIQLVWDNENEM 324 >gi|18891|lcl|protein:vir:8847 Length: 482 # NCBI annotation: terminase # Family: family:all:1730 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775255;genbank:gi:27476053;genbank:GeneID :2700601 Length = 482 Score = 26.2 bits (56), Expect = 0.89, Method: Compositional matrix adjust. Identities = 17/56 (30%), Positives = 27/56 (48%), Gaps = 4/56 (7%) Query: 411 MENPGFFVFNTCFNTIRTIPNLQNSPKNSEDLDTAGEDHIWDVIRYRLLKAAKQIK 466 MEN VFNTC N ++ + K+ + +D D + RY LL A++ + Sbjct: 405 MENGDLKVFNTCTNFLKEMKMYHR--KDGKIVDR--NDDMISATRYALLMASRHAR 456 >gi|6571|lcl|protein:vir:104349 Length: 218 # NCBI annotation: putative major tail protein # Family: family:all:47 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398977;genbank:gi:81343961;genbank:GeneID :3778881 Length = 218 Score = 25.4 bits (54), Expect = 1.7, Method: Compositional matrix adjust. Identities = 14/33 (42%), Positives = 17/33 (51%) Query: 286 YLLFAESDGSEFRDQQGRVCWVPAGTVFVIGED 318 YLLF S + D+Q RV V +V V G D Sbjct: 46 YLLFTASASTLLADKQVRVTAVSGTSVTVEGID 78 >gi|26461|lcl|protein:vir:573 Length: 589 # NCBI annotation: unknown # Family: family:all:4926 # MgeID: mge:13 # MgeName: SPBc2 # Cross-refs: genbank:acc:NP_046608;genbank:gi:9630181;genbank:GeneID: 1261423 Length = 589 Score = 25.0 bits (53), Expect = 2.1, Method: Compositional matrix adjust. Identities = 30/118 (25%), Positives = 51/118 (43%), Gaps = 27/118 (22%) Query: 23 LYGGAAGGGKSYLLRVASIVYSLEIPGLITYLFRRTFKEVLSNHVYTPGGYLEMMKGLID 82 +Y + G GK++L V V ++ PG K V+++ T G E+++ + D Sbjct: 81 MYLASRGQGKTWLTSVYCCVQAILFPGT---------KIVIASG--TKGQAREVIEKIDD 129 Query: 83 -----------AGDVVYSKSDNSFTFYNGSRIQLAHSQFENDIYTHQGAQIGFLIIDE 129 D+ S +D F+NGS I++ S ND + + LI+DE Sbjct: 130 LRKESPNLRREIEDLKTSTNDAKVEFHNGSWIKIVAS---ND--GARSKRANLLIVDE 182 >gi|27798|lcl|protein:vir:8408 Length: 214 # NCBI annotation: gp3 # Family: family:all:30874 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818304;genbank:gi:29566740;genbank:GeneI D:1260058 Length = 214 Score = 25.0 bits (53), Expect = 2.2, Method: Compositional matrix adjust. Identities = 13/32 (40%), Positives = 20/32 (62%), Gaps = 5/32 (15%) Query: 31 GKSYLLRVASIVYSLEIPGLITYLFRRTFKEV 62 GK+Y A++ Y+L+ PG + Y F R F+E Sbjct: 29 GKTY----AALAYALQYPGRVLY-FGRGFREA 55 >gi|2959|lcl|protein:vir:102079 Length: 565 # NCBI annotation: terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512312;genbank:gi:89152481;genbank:GeneID :3953072 Length = 565 Score = 24.3 bits (51), Expect = 3.7, Method: Compositional matrix adjust. Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 1/48 (2%) Query: 373 DDIGIHGVTFTRSNKNPGSRIEGLQLFRTRLKAATERPMENPGFFVFN 420 DDI + ++N + EGL+ R+ LK A +RP + F N Sbjct: 278 DDIKDES-NWIKANPIVATYEEGLEGIRSDLKVALDRPEKMRAFLTKN 324 >gi|2476|lcl|protein:vir:102883 Length: 565 # NCBI annotation: phage terminase, large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338134;genbank:gi:77020208;genbank:GeneID :3703792 Length = 565 Score = 24.3 bits (51), Expect = 3.8, Method: Compositional matrix adjust. Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 1/48 (2%) Query: 373 DDIGIHGVTFTRSNKNPGSRIEGLQLFRTRLKAATERPMENPGFFVFN 420 DDI + ++N + EGL+ R+ LK A +RP + F N Sbjct: 278 DDIKDES-NWIKANPIVATYEEGLEGIRSDLKVALDRPEKMRAFLTKN 324 >gi|2423|lcl|protein:vir:107576 Length: 565 # NCBI annotation: phage terminase, large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338185;genbank:gi:77020155;genbank:GeneID :3703707 Length = 565 Score = 23.9 bits (50), Expect = 3.9, Method: Compositional matrix adjust. Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 1/48 (2%) Query: 373 DDIGIHGVTFTRSNKNPGSRIEGLQLFRTRLKAATERPMENPGFFVFN 420 DDI + ++N + EGL+ R+ LK A +RP + F N Sbjct: 278 DDIKDES-NWIKANPIVATYEEGLEGIRSDLKVALDRPEKMRAFLTKN 324 >gi|2370|lcl|protein:vir:105001 Length: 565 # NCBI annotation: putative phage terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459966;genbank:gi:85701381;genbank:GeneID :3882142 Length = 565 Score = 23.9 bits (50), Expect = 3.9, Method: Compositional matrix adjust. Identities = 15/48 (31%), Positives = 23/48 (47%), Gaps = 1/48 (2%) Query: 373 DDIGIHGVTFTRSNKNPGSRIEGLQLFRTRLKAATERPMENPGFFVFN 420 DDI + ++N + EGL+ R+ LK A +RP + F N Sbjct: 278 DDIKDES-NWIKANPIVATYEEGLEGIRSDLKVALDRPEKMRAFLTKN 324 >gi|1931|lcl|protein:vir:99847 Length: 486 # NCBI annotation: large terminase subunit # Family: family:all:460 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164067;genbank:gi:56692599;genbank:GeneID :3192575 Length = 486 Score = 23.9 bits (50), Expect = 4.3, Method: Compositional matrix adjust. Identities = 14/51 (27%), Positives = 22/51 (43%) Query: 204 IPAKLEDNKVMMETDPDYRARLKGMGDSATVQAMLEGDWEVVSAGGIADLW 254 I +ED+KV + DP RA L+ + T + E + G + W Sbjct: 380 IRGAMEDHKVRIPYDPKIRAALREVTKQTTAAGNIRFTAERTADGHADEFW 430 >gi|3559|lcl|protein:vir:101786 Length: 556 # NCBI annotation: gp9 # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654764;genbank:gi:109302762;genbank:GeneI D:4156221 Length = 556 Score = 23.5 bits (49), Expect = 5.2, Method: Compositional matrix adjust. Identities = 9/21 (42%), Positives = 15/21 (71%) Query: 68 YTPGGYLEMMKGLIDAGDVVY 88 + PGG+L++ +G + DVVY Sbjct: 398 FVPGGWLKVTEGDVLDFDVVY 418 >gi|19384|lcl|protein:vir:7851 Length: 887 # NCBI annotation: gp8 # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817458;genbank:gi:29565887;genbank:GeneID :1259165 Length = 887 Score = 23.5 bits (49), Expect = 5.6, Method: Compositional matrix adjust. Identities = 9/21 (42%), Positives = 15/21 (71%) Query: 68 YTPGGYLEMMKGLIDAGDVVY 88 + PGG+L++ +G + DVVY Sbjct: 729 FVPGGWLKVTEGDVLDFDVVY 749 >gi|3326|lcl|protein:vir:94525 Length: 440 # NCBI annotation: putative large subunit terminase # Family: family:all:54 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223885;genbank:gi:62327097;genbank:GeneI D:5075541 Length = 440 Score = 23.1 bits (48), Expect = 7.3, Method: Compositional matrix adjust. Identities = 11/40 (27%), Positives = 21/40 (52%), Gaps = 1/40 (2%) Query: 20 REILYGGAAGGGKSYLLRVASIVYSLEIPGLITYLFRRTF 59 R ++Y G+ G GKSY A ++ + + + +L R + Sbjct: 36 RYLVYKGSRGSGKSYAT-AAKVIIDIMMYPYVNWLVTRQY 74 >gi|23191|lcl|protein:vir:101186 Length: 613 # NCBI annotation: terminase DNA packaging enzyme large subunit # Family: family:all:147 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932508;genbank:gi:37651634;genbank:GeneID :2610679 Length = 613 Score = 23.1 bits (48), Expect = 8.3, Method: Compositional matrix adjust. Identities = 25/103 (24%), Positives = 38/103 (36%), Gaps = 14/103 (13%) Query: 74 LEMMKGLIDAGDVVYSKSDNSFTFYNGSRIQLAHSQFENDIYTHQGAQIGFLIIDEATHF 133 LE++ + G V ++K S T NG I F + +G + +DE Sbjct: 198 LELLPDFLQPGIVEWNKG--SITLGNGCAI----GAFSSSPDAVRGNSFALIYVDE---- 247 Query: 134 TPPMIRFIRSRVRLGSMIIPPKWKALFPRILYTANPGGVGHHY 176 + FI + I P +IL T P G+ H Y Sbjct: 248 ----VAFIPNFTDAWMAIQPVISSGRRSKILMTTTPNGLNHWY 286 >gi|22942|lcl|protein:vir:101803 Length: 613 # NCBI annotation: gp17 # Family: family:all:147 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238880;genbank:gi:66391955;genbank:GeneID :3416630 Length = 613 Score = 23.1 bits (48), Expect = 8.4, Method: Compositional matrix adjust. Identities = 25/103 (24%), Positives = 38/103 (36%), Gaps = 14/103 (13%) Query: 74 LEMMKGLIDAGDVVYSKSDNSFTFYNGSRIQLAHSQFENDIYTHQGAQIGFLIIDEATHF 133 LE++ + G V ++K S T NG I F + +G + +DE Sbjct: 198 LELLPDFLQPGIVEWNKG--SITLGNGCAI----GAFSSSPDAVRGNSFALIYVDE---- 247 Query: 134 TPPMIRFIRSRVRLGSMIIPPKWKALFPRILYTANPGGVGHHY 176 + FI + I P +IL T P G+ H Y Sbjct: 248 ----VAFIPNFTDAWMAIQPVISSGRRSKILMTTTPNGLNHWY 286 >gi|19767|lcl|protein:vir:6384 Length: 677 # NCBI annotation: terminase large subunit TerL # Family: family:all:140 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918997;genbank:gi:34610172;genbank:gi:912 14212;genbank:GeneID:2559603 Length = 677 Score = 22.7 bits (47), Expect = 9.2, Method: Compositional matrix adjust. Identities = 19/57 (33%), Positives = 25/57 (43%), Gaps = 4/57 (7%) Query: 244 VVSAGGIADLWRSKIHVVHPFKIPHTWKIDRGYDYGSSKPAAYLLFAESDGSEFRDQ 300 V + GIA I+VV F + + ++D D KP AYL D E R Q Sbjct: 413 VCNVFGIAPGNPYDIYVVDRFNVIKSQRVDHDGDREWVKPHAYL----EDWQEVRTQ 465 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.320 0.138 0.420 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 220,873 Number of Sequences: 514 Number of extensions: 10488 Number of successful extensions: 71 Number of sequences better than 100.0: 30 Number of HSP's better than 100.0 without gapping: 22 Number of HSP's successfully gapped in prelim test: 8 Number of HSP's that attempted gapping in prelim test: 26 Number of HSP's gapped (non-prelim): 32 length of query: 473 length of database: 206,069 effective HSP length: 75 effective length of query: 398 effective length of database: 167,519 effective search space: 66672562 effective search space used: 66672562 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 39 (19.6 bits)