BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_018849.1_cdsid_YP_006906999.1 [gene=2] [protein=putative terminase large subunit] [protein_id=YP_006906999.1] [location=373..1884] (503 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|12972|lcl|protein:vir:80678 Length: 503 # NCBI annotation: gp... 947 0.0 gi|18494|lcl|protein:vir:1636 Length: 469 # NCBI annotation: put... 161 2e-41 gi|4240|lcl|protein:vir:94734 Length: 469 # NCBI annotation: lar... 159 6e-41 gi|13482|lcl|protein:vir:9757 Length: 471 # NCBI annotation: put... 158 1e-40 gi|19181|lcl|protein:vir:9572 Length: 459 # NCBI annotation: gp3... 147 4e-37 gi|14590|lcl|protein:vir:8098 Length: 515 # NCBI annotation: gp2... 120 3e-29 gi|14676|lcl|protein:vir:2499 Length: 474 # NCBI annotation: put... 119 9e-29 gi|10061|lcl|protein:vir:99071 Length: 537 # NCBI annotation: gp... 115 2e-27 gi|7504|lcl|protein:vir:99915 Length: 478 # NCBI annotation: gp2... 108 1e-25 gi|14207|lcl|protein:vir:8315 Length: 503 # NCBI annotation: gp3... 101 2e-23 gi|14004|lcl|protein:vir:8183 Length: 506 # NCBI annotation: gp3... 79 1e-16 gi|19975|lcl|protein:vir:108214 Length: 544 # NCBI annotation: g... 56 8e-10 gi|13269|lcl|protein:vir:81252 Length: 542 # NCBI annotation: gp... 42 1e-05 gi|27798|lcl|protein:vir:8408 Length: 214 # NCBI annotation: gp3... 41 4e-05 gi|10890|lcl|protein:vir:78144 Length: 530 # NCBI annotation: pr... 40 8e-05 gi|4914|lcl|protein:vir:99572 Length: 459 # NCBI annotation: Ter... 28 0.25 gi|5142|lcl|protein:vir:105417 Length: 470 # NCBI annotation: ge... 27 0.77 gi|17761|lcl|protein:vir:171 Length: 470 # NCBI annotation: term... 26 0.86 gi|5726|lcl|protein:vir:95379 Length: 573 # NCBI annotation: pha... 25 1.7 gi|12641|lcl|protein:vir:80117 Length: 572 # NCBI annotation: Ph... 25 1.7 gi|19315|lcl|protein:vir:4508 Length: 577 # NCBI annotation: lar... 23 5.8 gi|16642|lcl|protein:vir:9710 Length: 203 # NCBI annotation: hyp... 23 6.4 gi|17922|lcl|protein:vir:4452 Length: 577 # NCBI annotation: Ter... 23 8.3 gi|6927|lcl|protein:vir:106715 Length: 507 # NCBI annotation: gp... 23 9.1 gi|11323|lcl|protein:vir:78588 Length: 507 # NCBI annotation: Te... 23 9.3 gi|14466|lcl|protein:vir:3519 Length: 469 # NCBI annotation: P18... 23 9.5 >gi|12972|lcl|protein:vir:80678 Length: 503 # NCBI annotation: gp2 # Family: family:all:523 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285578;genbank:gi:148727084;genbank:Ge neID:5247049 Length = 503 Score = 947 bits (2449), Expect = 0.0, Method: Compositional matrix adjust. Identities = 480/503 (95%), Positives = 492/503 (97%) Query: 1 MSGVVGSQVPRHRVAAAYSVTAGGDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASG 60 MSGVVGSQVPRHRVAAAYSV+AGGDAGELGRAYGLTPDPWQQQVLDDWLAVG NGRLASG Sbjct: 1 MSGVVGSQVPRHRVAAAYSVSAGGDAGELGRAYGLTPDPWQQQVLDDWLAVGSNGRLASG 60 Query: 61 VCGVFVPRQNGKNAILEIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQFPD 120 VCGVFVPRQNGKNAILEIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQFPD Sbjct: 61 VCGVFVPRQNGKNAILEIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQFPD 120 Query: 121 LYRMVKSIRATNGQEAIVLHHPDCATFEKKCGCPGWGSVEFVARSRGSARGFTVDDLVCD 180 LYRMVKSIRATNGQEAIVLHHPDCATFEKKCGC GWGSVEFVARSRGSARGFTVDDLVCD Sbjct: 121 LYRMVKSIRATNGQEAIVLHHPDCATFEKKCGCSGWGSVEFVARSRGSARGFTVDDLVCD 180 Query: 181 EAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSGGKRFAWT 240 EAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQAL GGKRFAWT Sbjct: 181 EAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALGGGKRFAWT 240 Query: 241 EFSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRG 300 EFSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRG Sbjct: 241 EFSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRG 300 Query: 301 QSASSVIPADKWVQSAVGEASLVGGKVFGVSFSRSGDRVALAGAGRTDAGVHVEVIDGLS 360 QSA+SV+PADKW QSAV EASLVGGKVFGVSFSRSGDRVALAGAG+TDAGVHVEVIDGLS Sbjct: 301 QSAASVVPADKWAQSAVDEASLVGGKVFGVSFSRSGDRVALAGAGKTDAGVHVEVIDGLS 360 Query: 361 GTIVDGVGRLADWLAVRWGDTEKIMVAGSGAVLLQKALTDRGVPGRGVIVADTGVYVEAC 420 GTIVDGVGRLADWLAVRWGDT++IMVAGSGAVLLQKALTDRG+PGRGV+VADTGVYVEAC Sbjct: 361 GTIVDGVGRLADWLAVRWGDTDRIMVAGSGAVLLQKALTDRGIPGRGVVVADTGVYVEAC 420 Query: 421 QAFLEGVRSGSVSHPRADSRRDMLDIAVRSAVQKKKGSAWGWGSSFKDGSEVPLEAVSLA 480 QAFLEGVRSG +SHPRADSRRDMLDIAVRSAVQK+KGSAWGWGSSFKDGSEVPLEAVSLA Sbjct: 421 QAFLEGVRSGVISHPRADSRRDMLDIAVRSAVQKRKGSAWGWGSSFKDGSEVPLEAVSLA 480 Query: 481 YFGAKMAKAKRRERSGRKRVSVV 503 + GAK + RRERSGRKRVSVV Sbjct: 481 FLGAKRVRRGRRERSGRKRVSVV 503 >gi|18494|lcl|protein:vir:1636 Length: 469 # NCBI annotation: putative terminase # Family: family:all:523 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695057;genbank:gi:23455748;genbank:GeneID :955481 Length = 469 Score = 161 bits (407), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 136/495 (27%), Positives = 236/495 (47%), Gaps = 42/495 (8%) Query: 6 GSQVPRHRVAAAYSVTAGGDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASGVCGVF 65 G+Q P V ++ T +A E+ PWQ+ +L + +A+ +G G Sbjct: 8 GNQYPTQSVILPFTETKYQEAIEIYEKSKHECYPWQKNLLKEVMAIDEDGLWTHQKFGYS 67 Query: 66 VPRQNGKNAILEIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQFPDLYRMV 125 +PR+NGK I+ I+EL+ + +QG ILHTAH + ++ ++ +L+ + E+ Sbjct: 68 IPRRNGKTEIVYILELW-SLVQGLSILHTAHRISTSHSSYEKLKKYLEDSGYVEG--EDF 124 Query: 126 KSIRATNGQEAIVLHHPDCATFEKKCGCPGWGSVEFVARSRGSARGFTVDDLVCDEAQEL 185 KSI+A GQE + L G ++F R+ G D LV DEAQE Sbjct: 125 KSIKA-KGQERLEL-------------IESGGVIQFRTRTSSGGLGEGFDILVIDEAQEY 170 Query: 186 SDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSGGKRFA-WTEFSI 244 + EQ AL TV+ S +P I GTPP P++ G+V R ++G +++ W E+S+ Sbjct: 171 TTEQESALKYTVT--DSDNPMTIMCGTPPTPVSSGTVFTNYRDNTIAGKAKYSGWAEWSV 228 Query: 245 PDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRGQSAS 304 D D DV + ++NP++G LN + E +RLG+W + + Sbjct: 229 EDVKDIHDVEAWY-----NSNPSMGYHLNERKIEAEL-GEDKLDHNVQRLGYWPK-YNQK 281 Query: 305 SVIPADKWVQSAVGEASLVGGKVF-GVSFSRSGDRVALAGAGRTDAG-VHVEVIDGLSGT 362 SVI +W V ++ GK+F G+ + G VA++ A +T +G V VE ID S Sbjct: 282 SVISEQEWNALKVNRLPVIKGKLFVGIKYGNDGANVAMSIAVKTLSGKVFVETIDCQS-- 339 Query: 363 IVDGVGRLADWLAVRWGDTEKIMVAG-SGAVLLQKALTDRGVPGRGVIVADTGVYVEACQ 421 I +G + ++L + D EK+++ G SG +L + D + + I+ + A Sbjct: 340 IRNGNQWIINFL--KKADVEKVVIDGQSGQSILTSEMKDFKL--KEPILPTVKEIINANS 395 Query: 422 AFLEGVRSGSVSHPRADSRRDMLDIAVRSAVQKKKGSA--WGWGSSFKDGSEVPLEAVSL 479 + +G+ + H S + L V + ++ G++ +G+ S F D +++ L Sbjct: 396 LWEQGIFQKNFCH----SGQPSLSTVVTNCDKRNIGTSGGFGYKSQFDDMDISLMDSALL 451 Query: 480 AYFGAKMAKAKRRER 494 A++ K K++++ Sbjct: 452 AHWACSNNKPKKKQQ 466 >gi|4240|lcl|protein:vir:94734 Length: 469 # NCBI annotation: large subunit terminase # Family: family:all:523 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996704;genbank:gi:45597419;genbank:GeneID :2767958 Length = 469 Score = 159 bits (403), Expect = 6e-41, Method: Compositional matrix adjust. Identities = 137/495 (27%), Positives = 234/495 (47%), Gaps = 42/495 (8%) Query: 6 GSQVPRHRVAAAYSVTAGGDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASGVCGVF 65 G+Q P V ++ T +A E+ PWQ+ +L + +A+ +G G Sbjct: 8 GNQYPTQSVILPFTETKYQEAIEIYEKSKHECYPWQKNLLKEIMAIDEDGLWTHQKFGYS 67 Query: 66 VPRQNGKNAILEIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQFPDLYRMV 125 +PR+NGK I+ I+EL+ A QG ILHTAH + ++ ++ +L+ + E+ Sbjct: 68 IPRRNGKTEIVYILELW-ALEQGLSILHTAHRISTSHSSYEKLKKYLEDSGYVEG--EDF 124 Query: 126 KSIRATNGQEAIVLHHPDCATFEKKCGCPGWGSVEFVARSRGSARGFTVDDLVCDEAQEL 185 KSI+A GQE + L G ++F R+ G D L DEAQE Sbjct: 125 KSIKA-KGQERLEL-------------IESGGVIQFRTRTSSGGLGEGFDILFIDEAQEY 170 Query: 186 SDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSGGKRFA-WTEFSI 244 + EQ AL TV+ S +P I GTPP P++ G+V R L+G +++ W E+S+ Sbjct: 171 TTEQESALKYTVT--DSDNPMTIMCGTPPTPVSSGTVFTNYRDNTLAGKAKYSGWAEWSV 228 Query: 245 PDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRGQSAS 304 D D DV + ++NP++G LN + E +RLG+W + + Sbjct: 229 EDVKDIHDVEAWY-----NSNPSMGYHLNERKIEAEL-GEDKLDHNVQRLGYWPK-YNQK 281 Query: 305 SVIPADKWVQSAVGEASLVGGKVF-GVSFSRSGDRVALAGAGRTDAG-VHVEVIDGLSGT 362 SVI +W V ++ GK+F G+ + G VA++ A +T +G V VE ID S Sbjct: 282 SVISEQEWNALKVNRLPVIKGKLFVGIKYGNDGANVAMSIAVKTLSGKVFVETIDCQS-- 339 Query: 363 IVDGVGRLADWLAVRWGDTEKIMVAG-SGAVLLQKALTDRGVPGRGVIVADTGVYVEACQ 421 I +G + ++L + D EK+++ G SG +L + D + + I+ + A Sbjct: 340 IRNGNQWIINFL--KKADVEKVVIDGQSGQSILTSEMKDFKL--KEPILPTVKEIINANS 395 Query: 422 AFLEGVRSGSVSHPRADSRRDMLDIAVRSAVQKKKGSA--WGWGSSFKDGSEVPLEAVSL 479 + +G+ + H S + L V + ++ G++ +G+ S F D +++ L Sbjct: 396 LWEQGIFQKNFCH----SGQPSLSTVVTNCDKRNIGTSGGFGYKSQFDDMDISLMDSALL 451 Query: 480 AYFGAKMAKAKRRER 494 A++ K K++++ Sbjct: 452 AHWACSNNKLKKKQQ 466 >gi|13482|lcl|protein:vir:9757 Length: 471 # NCBI annotation: putative terminase # Family: family:all:523 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795519;genbank:gi:28876285;genbank:GeneID :1257826 Length = 471 Score = 158 bits (399), Expect = 1e-40, Method: Compositional matrix adjust. Identities = 139/497 (27%), Positives = 240/497 (48%), Gaps = 42/497 (8%) Query: 5 VGSQVPRHRVAAAYSVTAGGDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASGVCGV 64 +G+Q P V ++ + +A + GL+ PWQ +L +A+ NG G Sbjct: 9 LGNQRPTQSVNLHFAKSLAHEAINYYKKTGLSCYPWQVNMLIPIMAIDENGLWVHQKYGY 68 Query: 65 FVPRQNGKNAILEIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQFPDLYRM 124 +PR+NGK ++ IVEL+ A +G +ILHTAH + ++ +F +++ + E + D Sbjct: 69 AIPRRNGKTEVVYIVELW-ALHKGLKILHTAHRISTSHASFEKVKKYLEMS-GYVDGEDF 126 Query: 125 VKSIRATNGQEAIVLHHPDCATFEKKCGCPGWGSVEFVARSRGSARGFTVDDLVCDEAQE 184 + + GQE I K G ++F R+ G D L+ DEAQE Sbjct: 127 ISN--KAKGQERIEF---------KASG----AVIQFRTRTSNGGLGEGFDLLIIDEAQE 171 Query: 185 LSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSGGKRFA-WTEFS 243 + EQ AL TV+ S +P I GTPP ++ G+V R L G KR++ W E+S Sbjct: 172 YTSEQESALKYTVT--DSDNPMTIMCGTPPTMVSTGTVFEAYRKDCLKGNKRYSGWAEWS 229 Query: 244 IPDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRGQSA 303 +P+ +DVS + +NP++G LN + E +RLG+W + Sbjct: 230 VPEMVKINDVSSWYI-----SNPSMGFHLNERKIEAEL-GEDEIDHNIQRLGYWPSF-NQ 282 Query: 304 SSVIPADKWVQSAVGEASLVGGKVF-GVSFSRSGDRVALAGAGRT-DAGVHVEVIDGLSG 361 SVI +W + V + + K+F G+ F + G+ V+L+ A RT + V VE ID LS Sbjct: 283 KSVISEKEWAKLKVEQVPELKSKLFVGIKFGQDGNNVSLSIAARTSENKVFVETIDCLS- 341 Query: 362 TIVDGVGRLADWLAVRWGDTEKIMVAG-SGAVLLQKALTDRGVPGRGVIVADTGVYVEAC 420 + +G + ++L + D K+++ G SG LL + + ++G+ + + + A Sbjct: 342 -VRNGTQWIINFL--KSADIAKVVIDGASGQELLAQEMKEQGL--KKPELPKVAEIITAN 396 Query: 421 QAFLEGVRSGSVSHPRADSRRDMLDIAVRSAVQKKKGSAWGWG-SSFKDGSEVPL-EAVS 478 + +G+ ++ H S + L V + +++ GS G+G S D ++ L ++ Sbjct: 397 MMWEQGIMQETICH----SDQPSLTAVVTNCEKRQIGSNGGFGYKSLYDDRDISLMDSAL 452 Query: 479 LAYFGAKMAKAKRRERS 495 LA++ K KR++R+ Sbjct: 453 LAHWICYTTKPKRKQRT 469 >gi|19181|lcl|protein:vir:9572 Length: 459 # NCBI annotation: gp38 # Family: family:all:523 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862877;genbank:gi:32469469;genbank:GeneID :1461314 Length = 459 Score = 147 bits (370), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 139/490 (28%), Positives = 228/490 (46%), Gaps = 48/490 (9%) Query: 5 VGSQVPRHRVAAAY--SVTAGGDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASGVC 62 +G+Q P V Y + +A EL GL+ WQ+ +L +AV NG Sbjct: 6 LGNQNPTQSVILKYVKKNSKAKEAIELYERTGLSCYAWQKNLLLPMMAVDKNGLWVHQKF 65 Query: 63 GVFVPRQNGKNAILEIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQFPDLY 122 G +PR+NGK+ +L I E++ +G ILHTAH + ++ +F +++ + E + + D Sbjct: 66 GYSIPRRNGKSELLYIGEIW-GLHEGLNILHTAHRISTSHASFEKVKRYLE-KMGYVDG- 122 Query: 123 RMVKSIRATNGQEAIVLHHPDCATFEKKCGCPGWGSVEFVARSRGSARGFTVDDLVCDEA 182 SIRA GQE I L+ G ++F R+ G D L+ DEA Sbjct: 123 EDFNSIRA-KGQERIELYSTG-------------GVIQFRTRTSNGGLGEGFDMLIIDEA 168 Query: 183 QELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSG-GKRFAWTE 241 QE + EQ AL TV+ S +P I GTPP P++ G+V + R L G GK W E Sbjct: 169 QEYTTEQESALKYTVT--DSENPITIMCGTPPTPVSSGTVFTKYRETCLFGKGKYSGWAE 226 Query: 242 FSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRGQ 301 +S+ DE + DDV + ++NP++G LN + E +RLG+W Sbjct: 227 WSVSDEKEIDDVEAWY-----NSNPSMGYHLNERKIEAEL-GEDKLDHNIQRLGFWPT-Y 279 Query: 302 SASSVIPADKWVQSAVGEASLVGGKV-FGVSFSRSGDRVALAGAGRTDAG-VHVEVIDGL 359 + S I +W + + + + GK+ G+ + + G VA++ A RT+ G VE +D Sbjct: 280 NQKSAISETEWNELKMDDIPELSGKLSVGIKYGQDGTNVAMSIAARTNDGRFFVETVDCQ 339 Query: 360 SGTIVDGVGRLADWLA--VRWGDTEKIMVAG-SGAVLLQKALTDRGVPGRGVIVADTGVY 416 S V +W+ +R D +I++ G SG +L + L D + + VI+ Sbjct: 340 S------VRNGNEWMVAFLRQADVAQIVIDGASGQKILDEELKDYRI--KNVILPTVKEI 391 Query: 417 VEACQAFLEGVRSGSVSHPRADSRRDMLDIAVRSAVQKKKGSAWGWG--SSFKDGSEVPL 474 + A + +G+ ++ H S L + ++ GS G+G S F D + + Sbjct: 392 IVANALWEQGIYQKTICHAGQPS----LSKVATNCDKRNIGSNGGFGYRSHFDDMNISLM 447 Query: 475 EAVSLAYFGA 484 ++ LA++ Sbjct: 448 DSALLAHWAC 457 >gi|14590|lcl|protein:vir:8098 Length: 515 # NCBI annotation: gp2 # Family: family:all:523 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817679;genbank:gi:29566110;genbank:GeneID :1259304 Length = 515 Score = 120 bits (302), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 144/524 (27%), Positives = 216/524 (41%), Gaps = 73/524 (13%) Query: 10 PRHRVAAAYSVTAGGDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASGVCGVFVPRQ 69 P + AYS T G + +L G PDP Q+ L+ + NGR + V RQ Sbjct: 7 PAYANFPAYSETYGPEVADLCDLAGFPPDPEQELGLNALFGIDSNGRSTAFEFVVIAARQ 66 Query: 70 NGKNAILEIVELFKATIQGRRILH-TAHELKSARKAFMRLRSFFENERQFPDLYRMVKSI 128 N K + L I +R++ +AHE+ R+AF L + E+ P L + ++ Sbjct: 67 NLKTGFEKQAALGWLFITEQRLISWSAHEMTPTREAFNDLVNLIEST---PSLAKRLED- 122 Query: 129 RATNG------QEAIVLHHPDCATFEKKCGCPGWGSVEFVARSRGSARGFTVDDLVCDEA 182 TNG EAI L + CP V F AR+ RG T + ++ DE Sbjct: 123 GPTNGVFRGAGTEAIAL--------KPSKACPDGQRVIFKARTNSGGRGLTGNKVILDEG 174 Query: 183 QELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRL----RGQALSGGKRFA 238 L + +L+PT+SA P DPQ + +G+ AD V+ +L R + L+ KR Sbjct: 175 FALRHAHMGSLMPTLSAVP--DPQ-LLIGSS-ACHADSEVLHKLVKRGRSEELAPRKRLG 230 Query: 239 WTEFSIPDESDPDDVSRQW----------RKLAGDTNPALGRRLNFGTVSDEHESMSAAG 288 + EF P+ + DD + R+ NP GRR+ + + E +S+ A Sbjct: 231 YLEFCAPENACEDDECPHYVGYPGCAMDKREYIIMANPQAGRRITWEYLEGERDSLDPAE 290 Query: 289 FARERLGWWDR-GQSASSVIPADKWVQSAVGEASLVGGKVFGVSFSRSGDRVALAGAG-R 346 F RERLGW D+ + +I D W ++ FGV ++ A+ AG R Sbjct: 291 FGRERLGWHDKPAIEDAPLISKDGWATKMDPKSQPGPRLAFGVYVNKLQTAAAIGVAGYR 350 Query: 347 TDAGVHVEVIDGLSG---TIVDGVG-------RLAD-WLAVRWGDTEKIMVAGSGAVL-- 393 D +HV ++ G + G+ L D W WG ++ + +GA+L Sbjct: 351 EDGKIHVGIVPAARGGNVATLPGINWIPARMKELKDSWRPCGWGLDDR---SAAGALLPD 407 Query: 394 LQK---ALTDRGVPGRGVIVADTGVYVEACQAFLEGVRSGSVSH----PRADSRRDMLDI 446 L+K + D G G+ A AC F +S + H P ADS Sbjct: 408 LKKLGFEVGDEVTNGAGINNATPADVARACGTFYAKYQSDDLRHQGSKPLADS------- 460 Query: 447 AVRSAVQKKKGSAWGWG-SSFKDGSE--VPLEAVSLAYFGAKMA 487 V + + AW W KD V L AV+LA + A Sbjct: 461 -VTAGKMRDLADAWAWDRKDAKDAKSDIVQLMAVTLAVHALETA 503 >gi|14676|lcl|protein:vir:2499 Length: 474 # NCBI annotation: putative terminase gp4 # Family: family:all:523 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569740;genbank:gi:18496890;genbank:GeneID :932339 Length = 474 Score = 119 bits (298), Expect = 9e-29, Method: Compositional matrix adjust. Identities = 131/483 (27%), Positives = 206/483 (42%), Gaps = 43/483 (8%) Query: 7 SQVPRHRVAAAYSV-TAGGDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASGVCGVF 65 S+V RH +A V TA A GL D WQ + A +G A+ + + Sbjct: 9 SEVARHVIAPQGIVSTAWPSVRATCGAMGLGFDLWQDDLGKLICAKRDDGLYAADMFAMS 68 Query: 66 VPRQNGKNAIL-EIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQFPDLYRM 124 +PRQ GK +L +V ++ TAH ++A + F ++ + ++ P + Sbjct: 69 IPRQTGKTYLLGALVFALCIKTPNTTVIWTAHRTRTAAETFRSMQGLAKRDKIAPHIL-- 126 Query: 125 VKSIRATNGQEAIVLHHPDCATFEKKCGCPGWGS-VEFVARSRGSARGFT-VDDLVCDEA 182 ++ NG+EA++ + GS + F AR RG RGF VD L+ DEA Sbjct: 127 --NVHTGNGKEAVLFKN---------------GSRILFGARERGFGRGFAGVDVLIFDEA 169 Query: 183 QELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSGG-KRFAWTE 241 Q L++ ++ ++P +AAP +P + GTPP P G V +R AL+G + E Sbjct: 170 QILTENAMDDMVPATNAAP--NPLILLAGTPPKPTDPGEVFTVMRLDALAGDVDDVGYVE 227 Query: 242 FSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRGQ 301 S +++DPDD S QWRK+ NP+ R + + +++ F RE +G W + Sbjct: 228 ISADEDADPDDRS-QWRKM----NPSYPHRTSARAILRMRKALGDESFKREAMGIWPKVS 282 Query: 302 SASSVIPADKWVQ-SAVGEASLVGGKVFGVSFSRSGDRVALAGAGRTDAGVHVEVIDGLS 360 V+ + +W +G V S A D G HVE + + Sbjct: 283 VHQPVVKSGRWHDLFDLGPEDGEAPNALAVDMSHGLAISVGACWLMDDDGRHVEEV--WA 340 Query: 361 GTIVDGVGRLADWLAVRWGDTEKIMV-AGSGAVLLQKALTDRGVPGRGVIVADTGVYVEA 419 GT DW+A R G +++ + S A L L R V + AD + Sbjct: 341 GT---DTAAAVDWIAERAGRRIPVLIDSMSPAAALAPELKARKVKVKLTGAADMA---KG 394 Query: 420 CQAFLEGVRSGSVSHPRADSRRDMLDIAVRSAVQKKKGSAWGWGSSFKDGSEVPLEAVSL 479 C F GV + +++H + D L A + ++ G WGW PL AV+L Sbjct: 395 CGLFENGVNADTLTHGDQPALNDALAGARKRPIRDAGG--WGWDRRDPTCVIHPLVAVTL 452 Query: 480 AYF 482 A Sbjct: 453 ALL 455 >gi|10061|lcl|protein:vir:99071 Length: 537 # NCBI annotation: gp26 # Family: family:all:523 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655891;genbank:gi:109521463;genbank:GeneI D:4158036 Length = 537 Score = 115 bits (287), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 117/447 (26%), Positives = 193/447 (43%), Gaps = 50/447 (11%) Query: 12 HRVAAAYSVTAGGDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASGVCGVFVPRQNG 71 H Y+ + G A G DPWQ+ VL + L + G A+ + VPRQNG Sbjct: 36 HCFIPPYTTSLGDKAMWFLHQVGFELDPWQEFVLRNMLNLDAQGHWAASEALLLVPRQNG 95 Query: 72 KNAILEIVELFKATIQGRRI-LHTAHELKSARKAFMRLRSFFENERQFPDLYRMVKSIRA 130 K AI+E EL + ++ +HTA +AR++F RL++ EN + R R+ Sbjct: 96 KTAIIEARELVGLYVVCDKLCIHTAVLFNAARESFYRLKARIENNETLNKITRF----RS 151 Query: 131 TNGQEAIVLHHPDCATFEKKCGCPGWGSVEFVARSRGSARGFTVDDLVCDEAQELSDEQL 190 N +I + P + G G V ++AR ARGF+ D +V DEA L Sbjct: 152 GNDNMSIEV-KPKKESRHPNAG----GRVIYMARGTAVARGFSADVIVLDEAFALD---- 202 Query: 191 EALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSGGKRFAWTEFSIPDESDP 250 EA + + A S + L D + + +L + + + E+ D Sbjct: 203 EASIAAIDYATSARANPFIIYASSTGLEDSTELEKLHDRGMRQDPDMLFMEWCATTR-DL 261 Query: 251 DDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRGQSASSVIPAD 310 DD +R +NPALG R++ + E S F RERLG W+ + ++VIPAD Sbjct: 262 DDEENYYR-----SNPALGYRISIERIRKERNRHSDKTFGRERLGLWN-DNAFNAVIPAD 315 Query: 311 KW---------------VQSA-VGEASLVGGKVFGVSFSRSGDRVALAGAGRT-DAGVHV 353 +W V+ A G + +V V + + ++ AG+ D V + Sbjct: 316 QWKSLCLCHGTVHDEHRVEGAEAGWSRIVTPTVVAIDSAPDSSLTTISWAGKNQDGQVQI 375 Query: 354 EVIDGLSGTIVDGVGRLADWLAVRWGDTEKIMVAGSGAVLLQKALTD-RGVP---GRGVI 409 E++ S GVG +++A+ + D +++ AV++Q T + +P G+ Sbjct: 376 EILQEAS-----GVGWAVEFVAMLY-DPQRVETPPPLAVVVQAGATAGQLIPELEALGIE 429 Query: 410 VADTGVY--VEACQAFLEGVRSGSVSH 434 V G+ +AC+ F + ++H Sbjct: 430 VIPFGLRDACDACKYFYDRANDRRLAH 456 >gi|7504|lcl|protein:vir:99915 Length: 478 # NCBI annotation: gp2 # Family: family:all:523 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655519;genbank:gi:109392289;genbank:GeneI D:4157084 Length = 478 Score = 108 bits (271), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 142/513 (27%), Positives = 223/513 (43%), Gaps = 69/513 (13%) Query: 3 GVVGSQVPRHRVAAAYSVTAGGDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASGVC 62 G+VG+ PR R + + G T D WQ + LA+ G G A+ Sbjct: 18 GIVGTAWPRVR--------------DTCKNIGWTFDRWQDGLGRLILALDGTGLYAADTS 63 Query: 63 GVFVPRQNGKNAILEIVELFKATIQ-GRRILHTAHELKSARKAFMRLRSFFENERQFPDL 121 + +PRQ GK ++ + A + G ++ TAH K+A++ F +++ P + Sbjct: 64 VISIPRQVGKTYLIGCIVFALALLTPGLTVIWTAHRTKTAKETFGSMKAMCAT----PLV 119 Query: 122 YRMVKSIRATNGQEAIVLHHPDCATFEKKCGCPGWGS-VEFVARSRGSARGFT-VDDLVC 179 V+++ G E I LH+ GS + F AR G GF V LV Sbjct: 120 NAHVRNVSDARGDEGIYLHN---------------GSRILFGARENGFGLGFAGVGILVL 164 Query: 180 DEAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSGGKR-FA 238 DEAQ L+D+ ++ L+PT++ +P + GTPP P G V LR AL G Sbjct: 165 DEAQRLTDKAMDDLIPTMNTVE--NPLILLTGTPPRPTDSGEVFTMLRQDALDGESEGTL 222 Query: 239 WTEFSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWD 298 + EFS + + PDD + Q RK NP+ R + + ++++ F RE G WD Sbjct: 223 YVEFSADEGAHPDDRA-QLRK----ANPSYPHRTSERAIRRMRKNLTEESFLREAFGIWD 277 Query: 299 RGQSASSVIPADKWVQ-SAVGEASLVGGKVFGVSFSRSGDRVALAGAGRTDAG-VHVEVI 356 + V+ A +W + + G A+ V FGV S S R+ A D H E + Sbjct: 278 K-VVHRPVVTAARWRRLESTGPAAGVKPNGFGVDMSHS--RMVSVNAVWLDGDQAHTEEV 334 Query: 357 DGLSGTIVDGVGRLADWLAVRWG----DTEKIMVAGSGAVLLQKALTDRGVPGRGVIVAD 412 +G D W+A W T ++ + S A L L + GV V V Sbjct: 335 --WAGDDTDAA---VAWIADAWKRAGRRTVVVIDSESPAASLVVDLENAGV---NVYVTS 386 Query: 413 TGVYVEACQAFLEGVRSGSVSHPRADSRRDMLDIAVRSAVQKKKGSAWGWGSSFKDGSEV 472 AC A +++G+++H + + D V++ ++ A GWG ++ S Sbjct: 387 AANMAAACGAVENRLKAGTLTH---GGQMSVTDAVVKNGKRRPIRGAGGWGWDRRNPSSQ 443 Query: 473 PLEAV--SLAYFGA---KMAKAKRRERSGRKRV 500 +AV +LA +GA K A +RR +GR+ V Sbjct: 444 IHQAVAMTLALYGATKHKRATRQRRAETGREAV 476 >gi|14207|lcl|protein:vir:8315 Length: 503 # NCBI annotation: gp32 # Family: family:all:523 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817883;genbank:gi:29566316;genbank:GeneID :1259511 Length = 503 Score = 101 bits (252), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 114/414 (27%), Positives = 183/414 (44%), Gaps = 52/414 (12%) Query: 34 GLTPDPWQQQVLDDWLAVGGNGRLASGV--CGVFVPRQNGKNAILEIVELFKATIQ--GR 89 GL+ D WQ + LA +G LA V GV +PRQ GK L V LF ++ G Sbjct: 70 GLSLDRWQDGIAGLLLAYRPDGVLAHTVGGFGVSIPRQCGKTHTLTAV-LFGLCVEYPGV 128 Query: 90 RILHTAHELKSARKAFMRLRSFFENERQFPDLYRMVKSIRATNGQEAIVLHHPDCATFEK 149 + T+H +K+ + F ++++ + ER P ++ + +G EA+ + Sbjct: 129 LAIWTSHHVKTNTETFQAVQAYAKRERVAP----FIRKVTLGSGDEAVEFAN-------- 176 Query: 150 KCGCPGWGS-VEFVARSRGSARGFT-VDDLVCDEAQELSDEQLEALLPTVSAAPSGDPQQ 207 GS + F AR RG RG VD L+ DEAQ L+ ++ +L T++ + G Sbjct: 177 -------GSRILFGARERGFGRGIPGVDVLMSDEAQILTQRAMQDMLATLNTSRLG--LH 227 Query: 208 IFLGTPPGPLADGSVVLRLRGQALSG-GKRFAWTEFSIPDESDPDDVSRQWRKLAGDTNP 266 I++GTPP P + + +R +A +G W E D D DD+ QW + NP Sbjct: 228 IYVGTPPKPTDNSEMFSVMRREAETGEATDIVWIECGAEDTGDLDDIE-QWM----NANP 282 Query: 267 ALGRRLNFGTVSDEHESMSAAGFARERLGWWDRGQSASSVIPADKWVQSAVGEASLVGGK 326 + R ++ + GF RE LG WD +S+S + A W S +G+ + Sbjct: 283 SCPHRTPVVSIQRLRRRLDDDGFRREALGIWDASESSSFDLAA--W--SDLGDRGVDAPT 338 Query: 327 VFGVSFSRSGDR----VALAGAGRTDAG--VHVEVIDGLSGTIVDGVGRLADWLAVRWGD 380 + S DR + +AG TD G V + ++ + T V+ V +L V D Sbjct: 339 RAALVLDMSPDRRHCWIGVAGDVDTDDGEKVLLMAMETTAATAVEKVRQL-----VNERD 393 Query: 381 TEKIMVAGSGAVLLQKALTDRGVPGRGVIVADTGVYVEACQAFLEGVRSGSVSH 434 + + A L+ AL + + + + AD Q EG+++ SV H Sbjct: 394 IVDVAITNGAARALEPALVEAAIEYQRLSQADVAAAYSTLQ---EGIKNKSVCH 444 >gi|14004|lcl|protein:vir:8183 Length: 506 # NCBI annotation: gp3 # Family: family:all:523 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817976;genbank:gi:29566410;genbank:GeneID :2700964 Length = 506 Score = 78.6 bits (192), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 128/491 (26%), Positives = 199/491 (40%), Gaps = 56/491 (11%) Query: 34 GLTPDPWQQQVLDDWLAVGGNGRLASGVCGVF--VPRQNGKN-AILEIVELFKATIQGRR 90 G+ D WQ+ + L + +G LA V GV + RQ GK ++ + + G Sbjct: 44 GIVLDRWQEDIWYAALGLREDGTLACDVMGVTLSIARQAGKTWGVMVGLVAICLSRPGTL 103 Query: 91 ILHTAHELKSARKAFMRLRSFFENERQFPDLYRMVKSIRATNGQEAIVLHHPDCATFEKK 150 + ++H +++ + ++ E P + RA HP AT + + Sbjct: 104 AVWSSHHDRTSSQTLDKIAGIVERPEIRPKM-------RA---------QHPVVATDDNR 147 Query: 151 CGCPGWGS-VEFVARSRGSARGFT-VDDLVCDEAQELSDEQLEALLPTVSAAPSGDPQQI 208 GS + F ARS G RGF+ VD V DE Q L D L +L ++ + G Sbjct: 148 GVHFANGSKILFGARSSGFGRGFSEVDIQVYDECQNLKDSALTDMLAAMNVSEIG--LAF 205 Query: 209 FLGTPPGP----LADGSVVLRLRGQALSGGKRFA----WTEFSI--PDESDPD-DVSRQW 257 F+GTPP P L R R +AL+ K+ + EF+ P+ D D R W Sbjct: 206 FMGTPPRPQEVALGVHEAFKRRRDKALAPVKKRPFKGIYVEFAPESPETVVADIDAPRFW 265 Query: 258 RKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRGQSASSVIPADKWVQSAV 317 KLA + NP+ G R+ + E+MS RE G WD+ +V+P D+W A Sbjct: 266 EKLA-EVNPSFGFRVGKSAIERLVENMSPEDVRREVFGIWDKTNETLAVVPRDQWNNLAA 324 Query: 318 G-EASLVGGKVFGVSFSRSGDRVALAGAGRTDAGVHVEVIDGLSGTIVDGVGRLADWLAV 376 + +G++ +RSG + R HVE+ G + ++++ Sbjct: 325 DVDVDPEDVAAYGINATRSG-WYWITACWREGESAHVEIALGTQSEV-----EAMNFMSR 378 Query: 377 RWGDTEKIMVAGSGAVLLQKALTDRGVPGRGVIVADTGVYVEACQAF-LEGVRSGSVSHP 435 I +GA KAL ++ A T A A L V G +SH Sbjct: 379 HATKRTPIKHDSTGAA---KALGEKLKKLYFNASAYTQNEAGAGNALWLSLVEQGRLSH- 434 Query: 436 RADSRRDMLDIAVRSAVQKKKGSAWGW-----GSSFKDGSEVPLEAVSLAYFGAKMAKAK 490 D ++D L++AVR + ++ + S GW SF G + + A A+ Sbjct: 435 --DGQQD-LELAVRGSRRQDRTSG-GWMLVPRSESFDIGPAISMSGAVYAAMTARRPSGN 490 Query: 491 RRERSGRKRVS 501 R RKR S Sbjct: 491 SRATHRRKRHS 501 >gi|19975|lcl|protein:vir:108214 Length: 544 # NCBI annotation: gp5 # Family: family:all:523 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552334;genbank:gi:160700654;genbank:Ge neID:5758934 Length = 544 Score = 56.2 bits (134), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 91/407 (22%), Positives = 156/407 (38%), Gaps = 33/407 (8%) Query: 40 WQQQVLDDWLAVGGNGRLASGVCGVFVPRQNGKNAILEIVELFKATIQGRRILHTAHELK 99 WQ +A +G A + +PRQNGK ++ + ++ G +I++TA + Sbjct: 78 WQWDAARKIMATRPDGLWAHPDVCLIIPRQNGKTQLIALRIIYGLFFLGEKIVYTAQRWQ 137 Query: 100 SARKAFMRLRSFFENERQFPDLYRMVKSIRATNGQEAIVLHHPDCATFEKKCGCPGWGSV 159 + + + R+ E ++ P L R +K + + H + T GS+ Sbjct: 138 TVKDVYDRI---VEIIKRRPSLLRRLKPMPGVPDGYSEAGQHGEIYTTNG-------GSL 187 Query: 160 EFVARSRGSARGFTVDDL-VCDEAQELSDEQLEALLPTVSAAPSGDPQQIFLGTP--PGP 216 + R++ RG T DL + DEA ++ D + L AA +PQ I++ T Sbjct: 188 DMGPRTKAVGRGQTKIDLAIFDEAYDIKDVLVGGLTGAQKAA--TNPQTIYISTAAVASE 245 Query: 217 LADGSVVLRLRGQALSGGKRFAWTEFSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFGT 276 D V+ +R E+ P DD WR P+ G + Sbjct: 246 HPDCGVLAGMRRNGQRKEPDLYAAEWCAPPGMARDD-PEAWRLAC----PSFGITVRERD 300 Query: 277 VSDEHESMSA-----AGFARERLGW--WDRG-QSASSVIPADKWVQSAVGEASLVGGKVF 328 ++ E+ A A + + LGW W ++ +I D W V + +LVG Sbjct: 301 LAREYRMARANARLLAIYDADYLGWGEWPPDPENTEPIIDPDWWEALTVLQPALVGDICI 360 Query: 329 GVSFSRSGDRVALAGAGRT-DAGVHVEVIDGLSGTIVDGVGRLADWLAVRWGDTEKIMVA 387 + + +A RT D VHVEV + I L + L W I+ Sbjct: 361 AIERTLDTRYWCIAAGQRTIDGRVHVEVGYWRAANIGVVAAALLE-LVELWNPAAIIVDD 419 Query: 388 GSGAVLLQKALTDRGVPGRGVIVADTGVYVEACQAFLEGVRSGSVSH 434 S A + + ++G+ + A T Q F++ V + V+H Sbjct: 420 RSKAKPIVGVMFNQGI---EIETASTPKLAMYTQGFIDAVNAADVTH 463 >gi|13269|lcl|protein:vir:81252 Length: 542 # NCBI annotation: gp2, terminase # Family: family:all:523 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456732;genbank:gi:157168375;interpro:I PR005021;uniprot:Q9MBK3;genbank:GeneID:5580375 Length = 542 Score = 42.4 bits (98), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 62/227 (27%), Positives = 94/227 (41%), Gaps = 37/227 (16%) Query: 64 VFVPRQNGKNAILEIVELFKATIQG-RRILHTAHELKSARKAFMRLRSFFENERQFPDLY 122 + V RQNGK ++ I+ L+K I G I+ A +L A L + F + PDL Sbjct: 75 LLVGRQNGKTLVMVILGLWKLFIDGCSEIVTAAQDLSVAEAT---LSNAFMLAKANPDLN 131 Query: 123 R----------MVKSIRATNGQEAIVL-HHPDCATFEKKCGCPGWGSVEFVARSRGSARG 171 + MV +R NG I L + P + P W VA +RG R Sbjct: 132 QWLPWRMERGEMVPFMRTANGSNQIELAYAPVPEALDVFGAMPKWF---VVATNRGGGRS 188 Query: 172 FTVDDLVCDEAQELSDEQ-LEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLR--- 227 + + + DE +E +D Q A+ P V+ P Q++ + G + SVVLR + Sbjct: 189 HSAELAMLDELREHTDFQSWGAITPAVAERPRN---QVYGFSNAG--DEKSVVLRKQRNI 243 Query: 228 -----GQALSGGKRFAWTEFSIPDESDPDDVSRQWRKLAGDTNPALG 269 ++ + A E+S P+E D R NP+LG Sbjct: 244 CLKEISDGITDQSQLAIFEWSAPEECSIFD-----RDGWAAANPSLG 285 >gi|27798|lcl|protein:vir:8408 Length: 214 # NCBI annotation: gp3 # Family: family:all:30874 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818304;genbank:gi:29566740;genbank:GeneID :1260058 Length = 214 Score = 40.8 bits (94), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 61/245 (24%), Positives = 83/245 (33%), Gaps = 51/245 (20%) Query: 59 SGVCGVFVPRQNGKNAILEIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQF 118 S + PRQNGK L A R+L+ + A +AF Sbjct: 17 SSISAFAAPRQNGKT----YAALAYALQYPGRVLYFGRGFREAGEAFA------------ 60 Query: 119 PDLYRMVKSIRATNGQEAIVLHHPDCATFEKKCGCPGWGSVEFVARSRGSARGFTVDDLV 178 + A G I+ + + E G +G V F+ RGS RG D ++ Sbjct: 61 -----AATKLGANRGPGTILKTNKSQLSIETSLGGD-FGRVNFMPYGRGSGRGMGADLVI 114 Query: 179 CDEAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSGGKRFA 238 D+A E+ + L + P V G +A + L Q L G Sbjct: 115 LDDAHEVEADVLAEISPCVF-------------RTSGKIAGFGL---LHDQGLLG----- 153 Query: 239 WTEFSIPDESDPDDVSRQWRKLA-GDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWW 297 F + D R WR NPALG V E E + F RERLG Sbjct: 154 -HLFRVADG------KRVWRGAEIASANPALGHLFTLEQVEREREILPGEIFRRERLGLN 206 Query: 298 DRGQS 302 +G S Sbjct: 207 SKGSS 211 >gi|10890|lcl|protein:vir:78144 Length: 530 # NCBI annotation: probable terminase # Family: family:all:523 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294797;genbank:gi:149882818;genbank:Ge neID:5309172 Length = 530 Score = 39.7 bits (91), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 40/170 (23%), Positives = 65/170 (38%), Gaps = 35/170 (20%) Query: 39 PWQQQVLDDWLAVGGNGRLASGVCGVFVPRQNGKNAILEIVELFKATIQGRR-------- 90 PWQ+ +L L + GRL V V RQNGK I ++ + + R Sbjct: 43 PWQKWLLIHMLELDAFGRLRFRKALVIVGRQNGKTLIAAVLAAYWLYVDAGRWPDQLPEQ 102 Query: 91 ---ILHTAHELKSARKAFMRLRSFFENE--------RQFPDLYRMVKSIRATNGQEAIVL 139 ++ A +L A K + ++R + + + PDL R TNG+ + Sbjct: 103 DFIVVGAAQKLDIAMKPWRQVRRWGAPDDPKVGIARDRVPDLQAFTYPPRTTNGETELRT 162 Query: 140 HHPDCATFEKKCGCPGWGSVEFVARSRGSARGFTVDDLVCDEAQELSDEQ 189 H G ++ R+ ARG + L+ DE +E D + Sbjct: 163 H----------------GGAAYLPRTFEGARGQSAARLILDELREQYDYE 196 >gi|4914|lcl|protein:vir:99572 Length: 459 # NCBI annotation: TerL-like protein # Family: family:all:54 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039811;genbank:gi:126011061;genbank:Ge neID:4818267 Length = 459 Score = 28.1 bits (61), Expect = 0.25, Method: Compositional matrix adjust. Identities = 24/71 (33%), Positives = 33/71 (46%), Gaps = 9/71 (12%) Query: 174 VDDLVCDEAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLAD---GSVVLRLRGQA 230 VD L +EAQ L++EQ + PT+ S QI+L P D + V+ Sbjct: 113 VDILWLEEAQYLTEEQWNVINPTIRREGS----QIWLIWNPDQYTDFIYQNFVVNPPADC 168 Query: 231 LSGGKRFAWTE 241 LS K+ WTE Sbjct: 169 LS--KQINWTE 177 >gi|5142|lcl|protein:vir:105417 Length: 470 # NCBI annotation: gene 2 protein # Family: family:all:54 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958178;genbank:gi:41057280;genbank:GeneID :2716664 Length = 470 Score = 26.6 bits (57), Expect = 0.77, Method: Compositional matrix adjust. Identities = 32/144 (22%), Positives = 60/144 (41%), Gaps = 15/144 (10%) Query: 174 VDDLVCDEAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSG 233 +D +EA+ ++ E + L+PT+ S +I++ P + D + + ++ Sbjct: 112 IDICWVEEAEAVTKESWDILIPTIRKPFS----EIWVSFNPKNILDDT----YQRFVVNP 163 Query: 234 GKRFAWTEFSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFG---TVSD----EHESMSA 286 + D +V R + NP L R + G + SD + E + A Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 287 AGFARERLGWWDRGQSASSVIPAD 310 A A ++LGW +G S+ P+D Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSD 247 >gi|17761|lcl|protein:vir:171 Length: 470 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112076;genbank:gi:13559866;genbank:GeneID :921009 Length = 470 Score = 26.2 bits (56), Expect = 0.86, Method: Compositional matrix adjust. Identities = 32/144 (22%), Positives = 60/144 (41%), Gaps = 15/144 (10%) Query: 174 VDDLVCDEAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSG 233 +D +EA+ ++ E + L+PT+ S +I++ P + D + + ++ Sbjct: 112 IDICWVEEAEAVTKESWDILIPTIRKPFS----EIWVSFNPKNILDDT----YQRFVVNP 163 Query: 234 GKRFAWTEFSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFG---TVSD----EHESMSA 286 + D +V R + NP L R + G + SD + E + A Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 287 AGFARERLGWWDRGQSASSVIPAD 310 A A ++LGW +G S+ P+D Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSD 247 >gi|5726|lcl|protein:vir:95379 Length: 573 # NCBI annotation: phage terminase # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764473;genbank:gi:115334627;genbank:GeneI D:5179266 Length = 573 Score = 25.4 bits (54), Expect = 1.7, Method: Compositional matrix adjust. Identities = 12/39 (30%), Positives = 20/39 (51%), Gaps = 6/39 (15%) Query: 38 DPWQQQVLDDWLAVGGNG----RLASGVCGVFVPRQNGK 72 +PWQ+ ++ + L G R +F+PR+NGK Sbjct: 83 EPWQKFIIYNLLGFYLKGTKIRRFKEAF--IFIPRKNGK 119 >gi|12641|lcl|protein:vir:80117 Length: 572 # NCBI annotation: Phage terminase # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425601;genbank:gi:155042934;genbank:Ge neID:5469543 Length = 572 Score = 25.4 bits (54), Expect = 1.7, Method: Compositional matrix adjust. Identities = 12/39 (30%), Positives = 20/39 (51%), Gaps = 6/39 (15%) Query: 38 DPWQQQVLDDWLAVGGNG----RLASGVCGVFVPRQNGK 72 +PWQ+ ++ + L G R +F+PR+NGK Sbjct: 83 EPWQKFIIYNLLGFYLKGTKIRRFKEAF--IFIPRKNGK 119 >gi|19315|lcl|protein:vir:4508 Length: 577 # NCBI annotation: large terminase subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599034;genbank:gi:19548992;genbank:GeneID :935222 Length = 577 Score = 23.5 bits (49), Expect = 5.8, Method: Compositional matrix adjust. Identities = 18/51 (35%), Positives = 26/51 (50%), Gaps = 4/51 (7%) Query: 35 LTPDPWQQQVLD---DWLAVGGNGRLASGVCGVFVPRQNGKNAILEIVELF 82 +T +PWQ V+ W+ G R V +PR+NGK+AI V L+ Sbjct: 82 ITLEPWQLFVICCAFGWVNKGSRLRRFREV-YTEIPRKNGKSAISAGVALY 131 >gi|16642|lcl|protein:vir:9710 Length: 203 # NCBI annotation: hypothetical protein # Family: family:all:102 # ACLAME annotation(s): phi:0000082 - phage major tail protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795472;genbank:gi:28876219;genbank:GeneID :1257763 Length = 203 Score = 23.5 bits (49), Expect = 6.4, Method: Compositional matrix adjust. Identities = 14/46 (30%), Positives = 19/46 (41%) Query: 409 IVADTGVYVEACQAFLEGVRSGSVSHPRADSRRDMLDIAVRSAVQK 454 + AD G YV E V +D+R+D I V V+K Sbjct: 49 VSADDGPYVVLSGGITETKLEIEVLDLTSDARKDFFGITVEKGVEK 94 >gi|17922|lcl|protein:vir:4452 Length: 577 # NCBI annotation: Terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700375;genbank:gi:23505447;genbank:GeneID :955654 Length = 577 Score = 23.1 bits (48), Expect = 8.3, Method: Compositional matrix adjust. Identities = 18/54 (33%), Positives = 27/54 (50%), Gaps = 4/54 (7%) Query: 35 LTPDPWQQQVLD---DWLAVGGNGRLASGVCGVFVPRQNGKNAILEIVELFKAT 85 +T +PWQ ++ W+ G R V +PR+NGK+AI V L+ T Sbjct: 82 ITLEPWQLFIVCCAFGWVQKGTKLRRFREV-YTEIPRKNGKSAISAGVALYCFT 134 >gi|6927|lcl|protein:vir:106715 Length: 507 # NCBI annotation: gp19 # Family: family:all:144 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944327;genbank:gi:38638626;genbank:GeneID :2657344 Length = 507 Score = 23.1 bits (48), Expect = 9.1, Method: Compositional matrix adjust. Identities = 27/99 (27%), Positives = 44/99 (44%), Gaps = 21/99 (21%) Query: 167 GSARGFTVDDLVCDEAQELSDEQLEA------------LLPTVSAAPSGDPQQIFLGTPP 214 G GF++D + D+A + ++E L A +L T SG I +GT Sbjct: 180 GPLTGFSIDVGIIDDATKNAEEALSAVVQDGLENWYDSVLLTRLQQLSG---VILIGT-- 234 Query: 215 GPLADGSVVLRLRGQALSGGKRFAWTEFSIPDESDPDDV 253 P + ++ R+R + + G F T S P +DPD + Sbjct: 235 -PWSANDLLARVR-RKMEGQPNF--TLLSFPALNDPDQI 269 >gi|11323|lcl|protein:vir:78588 Length: 507 # NCBI annotation: TerL # Family: family:all:144 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294855;genbank:gi:149882918;genbank:Ge neID:5291059 Length = 507 Score = 22.7 bits (47), Expect = 9.3, Method: Compositional matrix adjust. Identities = 27/99 (27%), Positives = 44/99 (44%), Gaps = 21/99 (21%) Query: 167 GSARGFTVDDLVCDEAQELSDEQLEA------------LLPTVSAAPSGDPQQIFLGTPP 214 G GF++D + D+A + ++E L A +L T SG I +GT Sbjct: 180 GPLTGFSIDVGIIDDATKNAEEALSAVVQDGLENWYDSVLLTRLQQLSG---VILIGT-- 234 Query: 215 GPLADGSVVLRLRGQALSGGKRFAWTEFSIPDESDPDDV 253 P + ++ R+R + + G F T S P +DPD + Sbjct: 235 -PWSANDLLARVR-RKMEGQPNF--TLLSFPALNDPDQI 269 >gi|14466|lcl|protein:vir:3519 Length: 469 # NCBI annotation: P18 # Family: family:all:54 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050979;genbank:gi:9633565;genbank:GeneID: 1262312 Length = 469 Score = 22.7 bits (47), Expect = 9.5, Method: Compositional matrix adjust. Identities = 7/19 (36%), Positives = 16/19 (84%) Query: 179 CDEAQELSDEQLEALLPTV 197 +EA+ +S++ L++L+PT+ Sbjct: 106 VEEAETVSEKSLDSLIPTI 124 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.318 0.134 0.407 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 226,578 Number of Sequences: 514 Number of extensions: 10501 Number of successful extensions: 82 Number of sequences better than 100.0: 31 Number of HSP's better than 100.0 without gapping: 21 Number of HSP's successfully gapped in prelim test: 10 Number of HSP's that attempted gapping in prelim test: 32 Number of HSP's gapped (non-prelim): 32 length of query: 503 length of database: 206,069 effective HSP length: 75 effective length of query: 428 effective length of database: 167,519 effective search space: 71698132 effective search space used: 71698132 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 39 (19.6 bits)