BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_015453.1_cdsid_YP_004414707.1 [gene=PaP-PAS50_gp2] [protein=Gp2] [protein_id=YP_004414707.1] [location=378..1889] (503 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|12972|lcl|protein:vir:80678 Length: 503 # NCBI annotation: gp... 957 0.0 gi|13482|lcl|protein:vir:9757 Length: 471 # NCBI annotation: put... 159 7e-41 gi|18494|lcl|protein:vir:1636 Length: 469 # NCBI annotation: put... 156 5e-40 gi|4240|lcl|protein:vir:94734 Length: 469 # NCBI annotation: lar... 154 2e-39 gi|19181|lcl|protein:vir:9572 Length: 459 # NCBI annotation: gp3... 149 7e-38 gi|14590|lcl|protein:vir:8098 Length: 515 # NCBI annotation: gp2... 124 2e-30 gi|14676|lcl|protein:vir:2499 Length: 474 # NCBI annotation: put... 120 4e-29 gi|10061|lcl|protein:vir:99071 Length: 537 # NCBI annotation: gp... 114 3e-27 gi|7504|lcl|protein:vir:99915 Length: 478 # NCBI annotation: gp2... 105 2e-24 gi|14207|lcl|protein:vir:8315 Length: 503 # NCBI annotation: gp3... 97 5e-22 gi|14004|lcl|protein:vir:8183 Length: 506 # NCBI annotation: gp3... 82 1e-17 gi|19975|lcl|protein:vir:108214 Length: 544 # NCBI annotation: g... 54 5e-09 gi|13269|lcl|protein:vir:81252 Length: 542 # NCBI annotation: gp... 42 1e-05 gi|27798|lcl|protein:vir:8408 Length: 214 # NCBI annotation: gp3... 41 4e-05 gi|10890|lcl|protein:vir:78144 Length: 530 # NCBI annotation: pr... 40 8e-05 gi|4914|lcl|protein:vir:99572 Length: 459 # NCBI annotation: Ter... 28 0.27 gi|12641|lcl|protein:vir:80117 Length: 572 # NCBI annotation: Ph... 25 1.5 gi|5726|lcl|protein:vir:95379 Length: 573 # NCBI annotation: pha... 25 1.5 gi|5142|lcl|protein:vir:105417 Length: 470 # NCBI annotation: ge... 25 1.9 gi|17761|lcl|protein:vir:171 Length: 470 # NCBI annotation: term... 25 2.1 gi|19315|lcl|protein:vir:4508 Length: 577 # NCBI annotation: lar... 24 5.2 gi|17922|lcl|protein:vir:4452 Length: 577 # NCBI annotation: Ter... 23 6.9 gi|6927|lcl|protein:vir:106715 Length: 507 # NCBI annotation: gp... 23 8.1 gi|11323|lcl|protein:vir:78588 Length: 507 # NCBI annotation: Te... 23 8.2 gi|14466|lcl|protein:vir:3519 Length: 469 # NCBI annotation: P18... 23 9.6 >gi|12972|lcl|protein:vir:80678 Length: 503 # NCBI annotation: gp2 # Family: family:all:523 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285578;genbank:gi:148727084;genbank:Ge neID:5247049 Length = 503 Score = 957 bits (2473), Expect = 0.0, Method: Compositional matrix adjust. Identities = 488/503 (97%), Positives = 494/503 (98%) Query: 1 MSGVVGSQVPRHRVAAAYSVSAGNDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASG 60 MSGVVGSQVPRHRVAAAYSVSAG DAGELGRAYGLTPDPWQQQVLDDWLAVG NGRLASG Sbjct: 1 MSGVVGSQVPRHRVAAAYSVSAGGDAGELGRAYGLTPDPWQQQVLDDWLAVGSNGRLASG 60 Query: 61 VCGVFVPRQNGKNAILEIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQFPD 120 VCGVFVPRQNGKNAILEIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQFPD Sbjct: 61 VCGVFVPRQNGKNAILEIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQFPD 120 Query: 121 LYRMVKSIRATNGQEAIVLHHPDCATFERKCGCPGWGSVEFVARSRGSARGFTVDDLVCD 180 LYRMVKSIRATNGQEAIVLHHPDCATFE+KCGC GWGSVEFVARSRGSARGFTVDDLVCD Sbjct: 121 LYRMVKSIRATNGQEAIVLHHPDCATFEKKCGCSGWGSVEFVARSRGSARGFTVDDLVCD 180 Query: 181 EAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSGGKRFAWT 240 EAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQAL GGKRFAWT Sbjct: 181 EAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALGGGKRFAWT 240 Query: 241 EFSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRG 300 EFSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRG Sbjct: 241 EFSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRG 300 Query: 301 QSAASVVPADKWAQSAVDEASLVGGKVFGVSFSRSGDRVALAGAGRTDAGVHVEVIDGLS 360 QSAASVVPADKWAQSAVDEASLVGGKVFGVSFSRSGDRVALAGAG+TDAGVHVEVIDGLS Sbjct: 301 QSAASVVPADKWAQSAVDEASLVGGKVFGVSFSRSGDRVALAGAGKTDAGVHVEVIDGLS 360 Query: 361 GTIVDGVGRLADWLAVRWGDTDRIMVAGSGAVLLQKALTDRGVPGRGVVVADTGVYVEAC 420 GTIVDGVGRLADWLAVRWGDTDRIMVAGSGAVLLQKALTDRG+PGRGVVVADTGVYVEAC Sbjct: 361 GTIVDGVGRLADWLAVRWGDTDRIMVAGSGAVLLQKALTDRGIPGRGVVVADTGVYVEAC 420 Query: 421 QAFLEGVRSGVVSHPHADSRRDMLDIAVRSAVQKRKGSAWGWGSSFKDGSEVPLEAVSLA 480 QAFLEGVRSGV+SHP ADSRRDMLDIAVRSAVQKRKGSAWGWGSSFKDGSEVPLEAVSLA Sbjct: 421 QAFLEGVRSGVISHPRADSRRDMLDIAVRSAVQKRKGSAWGWGSSFKDGSEVPLEAVSLA 480 Query: 481 YLGAKMAKAKRRERSGRKRVSVV 503 +LGAK + RRERSGRKRVSVV Sbjct: 481 FLGAKRVRRGRRERSGRKRVSVV 503 >gi|13482|lcl|protein:vir:9757 Length: 471 # NCBI annotation: putative terminase # Family: family:all:523 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795519;genbank:gi:28876285;genbank:GeneID :1257826 Length = 471 Score = 159 bits (402), Expect = 7e-41, Method: Compositional matrix adjust. Identities = 137/497 (27%), Positives = 239/497 (48%), Gaps = 42/497 (8%) Query: 5 VGSQVPRHRVAAAYSVSAGNDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASGVCGV 64 +G+Q P V ++ S ++A + GL+ PWQ +L +A+ NG G Sbjct: 9 LGNQRPTQSVNLHFAKSLAHEAINYYKKTGLSCYPWQVNMLIPIMAIDENGLWVHQKYGY 68 Query: 65 FVPRQNGKNAILEIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQFPDLYRM 124 +PR+NGK ++ IVEL+ A +G +ILHTAH + ++ +F +++ + E + D Sbjct: 69 AIPRRNGKTEVVYIVELW-ALHKGLKILHTAHRISTSHASFEKVKKYLEMS-GYVDGEDF 126 Query: 125 VKSIRATNGQEAIVLHHPDCATFERKCGCPGWGSVEFVARSRGSARGFTVDDLVCDEAQE 184 + + GQE I ++F R+ G D L+ DEAQE Sbjct: 127 ISN--KAKGQERIEFKASGAV-------------IQFRTRTSNGGLGEGFDLLIIDEAQE 171 Query: 185 LSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSGGKRFA-WTEFS 243 + EQ AL TV+ S +P I GTPP ++ G+V R L G KR++ W E+S Sbjct: 172 YTSEQESALKYTVT--DSDNPMTIMCGTPPTMVSTGTVFEAYRKDCLKGNKRYSGWAEWS 229 Query: 244 IPDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRGQSA 303 +P+ +DVS + +NP++G LN + E +RLG+W + Sbjct: 230 VPEMVKINDVSSWYI-----SNPSMGFHLNERKIEAEL-GEDEIDHNIQRLGYWP-SFNQ 282 Query: 304 ASVVPADKWAQSAVDEASLVGGKVF-GVSFSRSGDRVALAGAGRT-DAGVHVEVIDGLSG 361 SV+ +WA+ V++ + K+F G+ F + G+ V+L+ A RT + V VE ID LS Sbjct: 283 KSVISEKEWAKLKVEQVPELKSKLFVGIKFGQDGNNVSLSIAARTSENKVFVETIDCLS- 341 Query: 362 TIVDGVGRLADWLAVRWGDTDRIMVAG-SGAVLLQKALTDRGVPGRGVVVADTGVYVEAC 420 + +G + ++L + D ++++ G SG LL + + ++G+ + + + A Sbjct: 342 -VRNGTQWIINFL--KSADIAKVVIDGASGQELLAQEMKEQGL--KKPELPKVAEIITAN 396 Query: 421 QAFLEGVRSGVVSHPHADSRRDMLDIAVRSAVQKRKGSAWGWG-SSFKDGSEVPL-EAVS 478 + +G+ + H S + L V + +++ GS G+G S D ++ L ++ Sbjct: 397 MMWEQGIMQETICH----SDQPSLTAVVTNCEKRQIGSNGGFGYKSLYDDRDISLMDSAL 452 Query: 479 LAYLGAKMAKAKRRERS 495 LA+ K KR++R+ Sbjct: 453 LAHWICYTTKPKRKQRT 469 >gi|18494|lcl|protein:vir:1636 Length: 469 # NCBI annotation: putative terminase # Family: family:all:523 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695057;genbank:gi:23455748;genbank:GeneID :955481 Length = 469 Score = 156 bits (395), Expect = 5e-40, Method: Compositional matrix adjust. Identities = 131/495 (26%), Positives = 235/495 (47%), Gaps = 42/495 (8%) Query: 6 GSQVPRHRVAAAYSVSAGNDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASGVCGVF 65 G+Q P V ++ + +A E+ PWQ+ +L + +A+ +G G Sbjct: 8 GNQYPTQSVILPFTETKYQEAIEIYEKSKHECYPWQKNLLKEVMAIDEDGLWTHQKFGYS 67 Query: 66 VPRQNGKNAILEIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQFPDLYRMV 125 +PR+NGK I+ I+EL+ + +QG ILHTAH + ++ ++ +L+ + E+ Sbjct: 68 IPRRNGKTEIVYILELW-SLVQGLSILHTAHRISTSHSSYEKLKKYLEDSGYVEG--EDF 124 Query: 126 KSIRATNGQEAIVLHHPDCATFERKCGCPGWGSVEFVARSRGSARGFTVDDLVCDEAQEL 185 KSI+A GQE + L G ++F R+ G D LV DEAQE Sbjct: 125 KSIKA-KGQERLEL-------------IESGGVIQFRTRTSSGGLGEGFDILVIDEAQEY 170 Query: 186 SDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSGGKRFA-WTEFSI 244 + EQ AL TV+ S +P I GTPP P++ G+V R ++G +++ W E+S+ Sbjct: 171 TTEQESALKYTVT--DSDNPMTIMCGTPPTPVSSGTVFTNYRDNTIAGKAKYSGWAEWSV 228 Query: 245 PDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRGQSAA 304 D D DV + ++NP++G LN + E +RLG+W + + Sbjct: 229 EDVKDIHDVEAWY-----NSNPSMGYHLNERKIEAEL-GEDKLDHNVQRLGYWPK-YNQK 281 Query: 305 SVVPADKWAQSAVDEASLVGGKVF-GVSFSRSGDRVALAGAGRTDAG-VHVEVIDGLSGT 362 SV+ +W V+ ++ GK+F G+ + G VA++ A +T +G V VE ID S Sbjct: 282 SVISEQEWNALKVNRLPVIKGKLFVGIKYGNDGANVAMSIAVKTLSGKVFVETIDCQS-- 339 Query: 363 IVDGVGRLADWLAVRWGDTDRIMVAG-SGAVLLQKALTDRGVPGRGVVVADTGVYVEACQ 421 I +G + ++L + D +++++ G SG +L + D + + ++ + A Sbjct: 340 IRNGNQWIINFL--KKADVEKVVIDGQSGQSILTSEMKDFKL--KEPILPTVKEIINANS 395 Query: 422 AFLEGVRSGVVSHPHADSRRDMLDIAVRSAVQKRKGSA--WGWGSSFKDGSEVPLEAVSL 479 + +G+ H S + L V + ++ G++ +G+ S F D +++ L Sbjct: 396 LWEQGIFQKNFCH----SGQPSLSTVVTNCDKRNIGTSGGFGYKSQFDDMDISLMDSALL 451 Query: 480 AYLGAKMAKAKRRER 494 A+ K K++++ Sbjct: 452 AHWACSNNKPKKKQQ 466 >gi|4240|lcl|protein:vir:94734 Length: 469 # NCBI annotation: large subunit terminase # Family: family:all:523 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996704;genbank:gi:45597419;genbank:GeneID :2767958 Length = 469 Score = 154 bits (390), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 132/495 (26%), Positives = 233/495 (47%), Gaps = 42/495 (8%) Query: 6 GSQVPRHRVAAAYSVSAGNDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASGVCGVF 65 G+Q P V ++ + +A E+ PWQ+ +L + +A+ +G G Sbjct: 8 GNQYPTQSVILPFTETKYQEAIEIYEKSKHECYPWQKNLLKEIMAIDEDGLWTHQKFGYS 67 Query: 66 VPRQNGKNAILEIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQFPDLYRMV 125 +PR+NGK I+ I+EL+ A QG ILHTAH + ++ ++ +L+ + E+ Sbjct: 68 IPRRNGKTEIVYILELW-ALEQGLSILHTAHRISTSHSSYEKLKKYLEDSGYVEG--EDF 124 Query: 126 KSIRATNGQEAIVLHHPDCATFERKCGCPGWGSVEFVARSRGSARGFTVDDLVCDEAQEL 185 KSI+A GQE + L G ++F R+ G D L DEAQE Sbjct: 125 KSIKA-KGQERLEL-------------IESGGVIQFRTRTSSGGLGEGFDILFIDEAQEY 170 Query: 186 SDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSGGKRFA-WTEFSI 244 + EQ AL TV+ S +P I GTPP P++ G+V R L+G +++ W E+S+ Sbjct: 171 TTEQESALKYTVT--DSDNPMTIMCGTPPTPVSSGTVFTNYRDNTLAGKAKYSGWAEWSV 228 Query: 245 PDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRGQSAA 304 D D DV + ++NP++G LN + E +RLG+W + + Sbjct: 229 EDVKDIHDVEAWY-----NSNPSMGYHLNERKIEAEL-GEDKLDHNVQRLGYWPK-YNQK 281 Query: 305 SVVPADKWAQSAVDEASLVGGKVF-GVSFSRSGDRVALAGAGRTDAG-VHVEVIDGLSGT 362 SV+ +W V+ ++ GK+F G+ + G VA++ A +T +G V VE ID S Sbjct: 282 SVISEQEWNALKVNRLPVIKGKLFVGIKYGNDGANVAMSIAVKTLSGKVFVETIDCQS-- 339 Query: 363 IVDGVGRLADWLAVRWGDTDRIMVAG-SGAVLLQKALTDRGVPGRGVVVADTGVYVEACQ 421 I +G + ++L + D +++++ G SG +L + D + + ++ + A Sbjct: 340 IRNGNQWIINFL--KKADVEKVVIDGQSGQSILTSEMKDFKL--KEPILPTVKEIINANS 395 Query: 422 AFLEGVRSGVVSHPHADSRRDMLDIAVRSAVQKRKGSA--WGWGSSFKDGSEVPLEAVSL 479 + +G+ H S + L V + ++ G++ +G+ S F D +++ L Sbjct: 396 LWEQGIFQKNFCH----SGQPSLSTVVTNCDKRNIGTSGGFGYKSQFDDMDISLMDSALL 451 Query: 480 AYLGAKMAKAKRRER 494 A+ K K++++ Sbjct: 452 AHWACSNNKLKKKQQ 466 >gi|19181|lcl|protein:vir:9572 Length: 459 # NCBI annotation: gp38 # Family: family:all:523 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862877;genbank:gi:32469469;genbank:GeneID :1461314 Length = 459 Score = 149 bits (376), Expect = 7e-38, Method: Compositional matrix adjust. Identities = 139/487 (28%), Positives = 227/487 (46%), Gaps = 48/487 (9%) Query: 5 VGSQVPRHRVAAAY--SVSAGNDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASGVC 62 +G+Q P V Y S +A EL GL+ WQ+ +L +AV NG Sbjct: 6 LGNQNPTQSVILKYVKKNSKAKEAIELYERTGLSCYAWQKNLLLPMMAVDKNGLWVHQKF 65 Query: 63 GVFVPRQNGKNAILEIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQFPDLY 122 G +PR+NGK+ +L I E++ +G ILHTAH + ++ +F +++ + E + + D Sbjct: 66 GYSIPRRNGKSELLYIGEIW-GLHEGLNILHTAHRISTSHASFEKVKRYLE-KMGYVDG- 122 Query: 123 RMVKSIRATNGQEAIVLHHPDCATFERKCGCPGWGSVEFVARSRGSARGFTVDDLVCDEA 182 SIRA GQE I L+ G ++F R+ G D L+ DEA Sbjct: 123 EDFNSIRA-KGQERIELYSTG-------------GVIQFRTRTSNGGLGEGFDMLIIDEA 168 Query: 183 QELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSG-GKRFAWTE 241 QE + EQ AL TV+ S +P I GTPP P++ G+V + R L G GK W E Sbjct: 169 QEYTTEQESALKYTVT--DSENPITIMCGTPPTPVSSGTVFTKYRETCLFGKGKYSGWAE 226 Query: 242 FSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRGQ 301 +S+ DE + DDV + ++NP++G LN + E +RLG+W Sbjct: 227 WSVSDEKEIDDVEAWY-----NSNPSMGYHLNERKIEAEL-GEDKLDHNIQRLGFWPT-Y 279 Query: 302 SAASVVPADKWAQSAVDEASLVGGKV-FGVSFSRSGDRVALAGAGRTDAG-VHVEVIDGL 359 + S + +W + +D+ + GK+ G+ + + G VA++ A RT+ G VE +D Sbjct: 280 NQKSAISETEWNELKMDDIPELSGKLSVGIKYGQDGTNVAMSIAARTNDGRFFVETVDCQ 339 Query: 360 SGTIVDGVGRLADWLA--VRWGDTDRIMVAG-SGAVLLQKALTDRGVPGRGVVVADTGVY 416 S V +W+ +R D +I++ G SG +L + L D + + V++ Sbjct: 340 S------VRNGNEWMVAFLRQADVAQIVIDGASGQKILDEELKDYRI--KNVILPTVKEI 391 Query: 417 VEACQAFLEGVRSGVVSHPHADSRRDMLDIAVRSAVQKRKGSAWGWG--SSFKDGSEVPL 474 + A + +G+ + H S L + ++ GS G+G S F D + + Sbjct: 392 IVANALWEQGIYQKTICHAGQPS----LSKVATNCDKRNIGSNGGFGYRSHFDDMNISLM 447 Query: 475 EAVSLAY 481 ++ LA+ Sbjct: 448 DSALLAH 454 >gi|14590|lcl|protein:vir:8098 Length: 515 # NCBI annotation: gp2 # Family: family:all:523 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817679;genbank:gi:29566110;genbank:GeneID :1259304 Length = 515 Score = 124 bits (312), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 148/518 (28%), Positives = 220/518 (42%), Gaps = 75/518 (14%) Query: 10 PRHRVAAAYSVSAGNDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASGVCGVFVPRQ 69 P + AYS + G + +L G PDP Q+ L+ + NGR + V RQ Sbjct: 7 PAYANFPAYSETYGPEVADLCDLAGFPPDPEQELGLNALFGIDSNGRSTAFEFVVIAARQ 66 Query: 70 NGKNAILEIVELFKATIQGRRILH-TAHELKSARKAFMRLRSFFENERQFPDLYRMVKSI 128 N K + L I +R++ +AHE+ R+AF L + E+ P L + ++ Sbjct: 67 NLKTGFEKQAALGWLFITEQRLISWSAHEMTPTREAFNDLVNLIEST---PSLAKRLED- 122 Query: 129 RATNG------QEAIVLHHPDCATFERKCGCPGWGSVEFVARSRGSARGFTVDDLVCDEA 182 TNG EAI L + CP V F AR+ RG T + ++ DE Sbjct: 123 GPTNGVFRGAGTEAIAL--------KPSKACPDGQRVIFKARTNSGGRGLTGNKVILDEG 174 Query: 183 QELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRL----RGQALSGGKRFA 238 L + +L+PT+SA P DPQ + +G+ AD V+ +L R + L+ KR Sbjct: 175 FALRHAHMGSLMPTLSAVP--DPQ-LLIGSS-ACHADSEVLHKLVKRGRSEELAPRKRLG 230 Query: 239 WTEFSIPDESDPDDVSRQW----------RKLAGDTNPALGRRLNFGTVSDEHESMSAAG 288 + EF P+ + DD + R+ NP GRR+ + + E +S+ A Sbjct: 231 YLEFCAPENACEDDECPHYVGYPGCAMDKREYIIMANPQAGRRITWEYLEGERDSLDPAE 290 Query: 289 FARERLGWWDR-GQSAASVVPADKWAQSAVDEASLVGGKV-FGVSFSRSGDRVALAGAG- 345 F RERLGW D+ A ++ D WA + +D S G ++ FGV ++ A+ AG Sbjct: 291 FGRERLGWHDKPAIEDAPLISKDGWA-TKMDPKSQPGPRLAFGVYVNKLQTAAAIGVAGY 349 Query: 346 RTDAGVHVEVIDGLSG---TIVDGVG-------RLAD-WLAVRWGDTDRIMVAGSGAVL- 393 R D +HV ++ G + G+ L D W WG DR + +GA+L Sbjct: 350 REDGKIHVGIVPAARGGNVATLPGINWIPARMKELKDSWRPCGWGLDDR---SAAGALLP 406 Query: 394 -LQK---ALTDRGVPGRGVVVADTGVYVEACQAFLEGVRSGVVSH----PHADSRRDMLD 445 L+K + D G G+ A AC F +S + H P ADS Sbjct: 407 DLKKLGFEVGDEVTNGAGINNATPADVARACGTFYAKYQSDDLRHQGSKPLADS------ 460 Query: 446 IAVRSAVQKRKGSAWGWG-SSFKDGSE--VPLEAVSLA 480 V + + AW W KD V L AV+LA Sbjct: 461 --VTAGKMRDLADAWAWDRKDAKDAKSDIVQLMAVTLA 496 >gi|14676|lcl|protein:vir:2499 Length: 474 # NCBI annotation: putative terminase gp4 # Family: family:all:523 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569740;genbank:gi:18496890;genbank:GeneID :932339 Length = 474 Score = 120 bits (301), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 134/484 (27%), Positives = 207/484 (42%), Gaps = 45/484 (9%) Query: 7 SQVPRHRVAAAYSVS-AGNDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASGVCGVF 65 S+V RH +A VS A A GL D WQ + A +G A+ + + Sbjct: 9 SEVARHVIAPQGIVSTAWPSVRATCGAMGLGFDLWQDDLGKLICAKRDDGLYAADMFAMS 68 Query: 66 VPRQNGKNAIL-EIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQFPDLYRM 124 +PRQ GK +L +V ++ TAH ++A + F ++ + ++ P + Sbjct: 69 IPRQTGKTYLLGALVFALCIKTPNTTVIWTAHRTRTAAETFRSMQGLAKRDKIAPHIL-- 126 Query: 125 VKSIRATNGQEAIVLHHPDCATFERKCGCPGWGS-VEFVARSRGSARGFT-VDDLVCDEA 182 ++ NG+EA++ + GS + F AR RG RGF VD L+ DEA Sbjct: 127 --NVHTGNGKEAVLFKN---------------GSRILFGARERGFGRGFAGVDVLIFDEA 169 Query: 183 QELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSGG-KRFAWTE 241 Q L++ ++ ++P +AAP +P + GTPP P G V +R AL+G + E Sbjct: 170 QILTENAMDDMVPATNAAP--NPLILLAGTPPKPTDPGEVFTVMRLDALAGDVDDVGYVE 227 Query: 242 FSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRGQ 301 S +++DPDD S QWRK+ NP+ R + + +++ F RE +G W + Sbjct: 228 ISADEDADPDDRS-QWRKM----NPSYPHRTSARAILRMRKALGDESFKREAMGIWPKVS 282 Query: 302 SAASVVPADKWAQSAVDEASLVGGKVFGVSFSRSGDRVALAGAG--RTDAGVHVEVIDGL 359 VV + +W D G ++ S GA D G HVE + Sbjct: 283 VHQPVVKSGRW-HDLFDLGPEDGEAPNALAVDMSHGLAISVGACWLMDDDGRHVEEV--W 339 Query: 360 SGTIVDGVGRLADWLAVRWGDTDRIMV-AGSGAVLLQKALTDRGVPGRGVVVADTGVYVE 418 +GT DW+A R G +++ + S A L L R V + AD + Sbjct: 340 AGT---DTAAAVDWIAERAGRRIPVLIDSMSPAAALAPELKARKVKVKLTGAAD---MAK 393 Query: 419 ACQAFLEGVRSGVVSHPHADSRRDMLDIAVRSAVQKRKGSAWGWGSSFKDGSEVPLEAVS 478 C F GV + ++H + D L A + + R WGW PL AV+ Sbjct: 394 GCGLFENGVNADTLTHGDQPALNDALAGARKRPI--RDAGGWGWDRRDPTCVIHPLVAVT 451 Query: 479 LAYL 482 LA L Sbjct: 452 LALL 455 >gi|10061|lcl|protein:vir:99071 Length: 537 # NCBI annotation: gp26 # Family: family:all:523 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655891;genbank:gi:109521463;genbank:GeneI D:4158036 Length = 537 Score = 114 bits (285), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 117/453 (25%), Positives = 191/453 (42%), Gaps = 62/453 (13%) Query: 12 HRVAAAYSVSAGNDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASGVCGVFVPRQNG 71 H Y+ S G+ A G DPWQ+ VL + L + G A+ + VPRQNG Sbjct: 36 HCFIPPYTTSLGDKAMWFLHQVGFELDPWQEFVLRNMLNLDAQGHWAASEALLLVPRQNG 95 Query: 72 KNAILEIVELFKATIQGRRI-LHTAHELKSARKAFMRLRSFFENERQFPDLYRMVKSIRA 130 K AI+E EL + ++ +HTA +AR++F RL++ EN + R R+ Sbjct: 96 KTAIIEARELVGLYVVCDKLCIHTAVLFNAARESFYRLKARIENNETLNKITRF----RS 151 Query: 131 TNGQEAIVL------HHPDCATFERKCGCPGWGSVEFVARSRGSARGFTVDDLVCDEAQE 184 N +I + HP+ G V ++AR ARGF+ D +V DEA Sbjct: 152 GNDNMSIEVKPKKESRHPNAG-----------GRVIYMARGTAVARGFSADVIVLDEAFA 200 Query: 185 LSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSGGKRFAWTEFSI 244 L EA + + A S + L D + + +L + + + E+ Sbjct: 201 LD----EASIAAIDYATSARANPFIIYASSTGLEDSTELEKLHDRGMRQDPDMLFMEWCA 256 Query: 245 PDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRGQSAA 304 D DD +R +NPALG R++ + E S F RERLG W+ + Sbjct: 257 TTR-DLDDEENYYR-----SNPALGYRISIERIRKERNRHSDKTFGRERLGLWN-DNAFN 309 Query: 305 SVVPADKWAQSAV------DEASLVGGK----------VFGVSFSRSGDRVALAGAGRT- 347 +V+PAD+W + DE + G + V + + ++ AG+ Sbjct: 310 AVIPADQWKSLCLCHGTVHDEHRVEGAEAGWSRIVTPTVVAIDSAPDSSLTTISWAGKNQ 369 Query: 348 DAGVHVEVIDGLSGTIVDGVGRLADWLAVRWGDTDRIMVAGSGAVLLQKALT-DRGVP-- 404 D V +E++ S GVG +++A+ + D R+ AV++Q T + +P Sbjct: 370 DGQVQIEILQEAS-----GVGWAVEFVAMLY-DPQRVETPPPLAVVVQAGATAGQLIPEL 423 Query: 405 -GRGVVVADTGVY--VEACQAFLEGVRSGVVSH 434 G+ V G+ +AC+ F + ++H Sbjct: 424 EALGIEVIPFGLRDACDACKYFYDRANDRRLAH 456 >gi|7504|lcl|protein:vir:99915 Length: 478 # NCBI annotation: gp2 # Family: family:all:523 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655519;genbank:gi:109392289;genbank:GeneI D:4157084 Length = 478 Score = 105 bits (261), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 142/513 (27%), Positives = 220/513 (42%), Gaps = 69/513 (13%) Query: 3 GVVGSQVPRHRVAAAYSVSAGNDAGELGRAYGLTPDPWQQQVLDDWLAVGGNGRLASGVC 62 G+VG+ PR R + + G T D WQ + LA+ G G A+ Sbjct: 18 GIVGTAWPRVR--------------DTCKNIGWTFDRWQDGLGRLILALDGTGLYAADTS 63 Query: 63 GVFVPRQNGKNAILEIVELFKATIQ-GRRILHTAHELKSARKAFMRLRSFFENERQFPDL 121 + +PRQ GK ++ + A + G ++ TAH K+A++ F +++ P + Sbjct: 64 VISIPRQVGKTYLIGCIVFALALLTPGLTVIWTAHRTKTAKETFGSMKAMCAT----PLV 119 Query: 122 YRMVKSIRATNGQEAIVLHHPDCATFERKCGCPGWGS-VEFVARSRGSARGFT-VDDLVC 179 V+++ G E I LH+ GS + F AR G GF V LV Sbjct: 120 NAHVRNVSDARGDEGIYLHN---------------GSRILFGARENGFGLGFAGVGILVL 164 Query: 180 DEAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSGGKR-FA 238 DEAQ L+D+ ++ L+PT++ +P + GTPP P G V LR AL G Sbjct: 165 DEAQRLTDKAMDDLIPTMNTVE--NPLILLTGTPPRPTDSGEVFTMLRQDALDGESEGTL 222 Query: 239 WTEFSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWD 298 + EFS + + PDD + Q RK NP+ R + + ++++ F RE G WD Sbjct: 223 YVEFSADEGAHPDDRA-QLRK----ANPSYPHRTSERAIRRMRKNLTEESFLREAFGIWD 277 Query: 299 RGQSAASVVPADKWAQ-SAVDEASLVGGKVFGVSFSRSGDRVALAGAGRTDAG-VHVEVI 356 + VV A +W + + A+ V FGV S S R+ A D H E + Sbjct: 278 K-VVHRPVVTAARWRRLESTGPAAGVKPNGFGVDMSHS--RMVSVNAVWLDGDQAHTEEV 334 Query: 357 DGLSGTIVDGVGRLADWLAVRWGDTDR----IMVAGSGAVLLQKALTDRGVPGRGVVVAD 412 +G D W+A W R ++ + S A L L + GV V V Sbjct: 335 --WAGDDTDAA---VAWIADAWKRAGRRTVVVIDSESPAASLVVDLENAGV---NVYVTS 386 Query: 413 TGVYVEACQAFLEGVRSGVVSHPHADSRRDMLDIAVRSAVQKRKGSAWGWGSSFKDGSEV 472 AC A +++G ++H + + D V++ ++ A GWG ++ S Sbjct: 387 AANMAAACGAVENRLKAGTLTH---GGQMSVTDAVVKNGKRRPIRGAGGWGWDRRNPSSQ 443 Query: 473 PLEAV--SLAYLGA---KMAKAKRRERSGRKRV 500 +AV +LA GA K A +RR +GR+ V Sbjct: 444 IHQAVAMTLALYGATKHKRATRQRRAETGREAV 476 >gi|14207|lcl|protein:vir:8315 Length: 503 # NCBI annotation: gp32 # Family: family:all:523 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817883;genbank:gi:29566316;genbank:GeneID :1259511 Length = 503 Score = 97.1 bits (240), Expect = 5e-22, Method: Compositional matrix adjust. Identities = 112/411 (27%), Positives = 179/411 (43%), Gaps = 46/411 (11%) Query: 34 GLTPDPWQQQVLDDWLAVGGNGRLASGV--CGVFVPRQNGKNAILEIVELFKATIQ--GR 89 GL+ D WQ + LA +G LA V GV +PRQ GK L V LF ++ G Sbjct: 70 GLSLDRWQDGIAGLLLAYRPDGVLAHTVGGFGVSIPRQCGKTHTLTAV-LFGLCVEYPGV 128 Query: 90 RILHTAHELKSARKAFMRLRSFFENERQFPDLYRMVKSIRATNGQEAIVLHHPDCATFER 149 + T+H +K+ + F ++++ + ER P ++ + +G EA+ + Sbjct: 129 LAIWTSHHVKTNTETFQAVQAYAKRERVAP----FIRKVTLGSGDEAVEFAN-------- 176 Query: 150 KCGCPGWGS-VEFVARSRGSARGFT-VDDLVCDEAQELSDEQLEALLPTVSAAPSGDPQQ 207 GS + F AR RG RG VD L+ DEAQ L+ ++ +L T++ + G Sbjct: 177 -------GSRILFGARERGFGRGIPGVDVLMSDEAQILTQRAMQDMLATLNTSRLG--LH 227 Query: 208 IFLGTPPGPLADGSVVLRLRGQALSG-GKRFAWTEFSIPDESDPDDVSRQWRKLAGDTNP 266 I++GTPP P + + +R +A +G W E D D DD+ QW + NP Sbjct: 228 IYVGTPPKPTDNSEMFSVMRREAETGEATDIVWIECGAEDTGDLDDIE-QWM----NANP 282 Query: 267 ALGRRLNFGTVSDEHESMSAAGFARERLGWWDRGQSAASVVPA-DKWAQSAVDEASLVGG 325 + R ++ + GF RE LG WD +S++ + A VD A Sbjct: 283 SCPHRTPVVSIQRLRRRLDDDGFRREALGIWDASESSSFDLAAWSDLGDRGVD-APTRAA 341 Query: 326 KVFGVSFSRSGDRVALAGAGRTDAG--VHVEVIDGLSGTIVDGVGRLADWLAVRWGDTDR 383 V +S R + +AG TD G V + ++ + T V+ V +L V D Sbjct: 342 LVLDMSPDRRHCWIGVAGDVDTDDGEKVLLMAMETTAATAVEKVRQL-----VNERDIVD 396 Query: 384 IMVAGSGAVLLQKALTDRGVPGRGVVVADTGVYVEACQAFLEGVRSGVVSH 434 + + A L+ AL + + + + AD Q EG+++ V H Sbjct: 397 VAITNGAARALEPALVEAAIEYQRLSQADVAAAYSTLQ---EGIKNKSVCH 444 >gi|14004|lcl|protein:vir:8183 Length: 506 # NCBI annotation: gp3 # Family: family:all:523 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817976;genbank:gi:29566410;genbank:GeneID :2700964 Length = 506 Score = 82.4 bits (202), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 130/491 (26%), Positives = 201/491 (40%), Gaps = 56/491 (11%) Query: 34 GLTPDPWQQQVLDDWLAVGGNGRLASGVCGVF--VPRQNGKN-AILEIVELFKATIQGRR 90 G+ D WQ+ + L + +G LA V GV + RQ GK ++ + + G Sbjct: 44 GIVLDRWQEDIWYAALGLREDGTLACDVMGVTLSIARQAGKTWGVMVGLVAICLSRPGTL 103 Query: 91 ILHTAHELKSARKAFMRLRSFFENERQFPDLYRMVKSIRATNGQEAIVLHHPDCATFERK 150 + ++H +++ + ++ E P + RA HP AT + + Sbjct: 104 AVWSSHHDRTSSQTLDKIAGIVERPEIRPKM-------RA---------QHPVVATDDNR 147 Query: 151 CGCPGWGS-VEFVARSRGSARGFT-VDDLVCDEAQELSDEQLEALLPTVSAAPSGDPQQI 208 GS + F ARS G RGF+ VD V DE Q L D L +L ++ + G Sbjct: 148 GVHFANGSKILFGARSSGFGRGFSEVDIQVYDECQNLKDSALTDMLAAMNVSEIG--LAF 205 Query: 209 FLGTPPGP----LADGSVVLRLRGQALSGGKRFA----WTEFSI--PDESDPD-DVSRQW 257 F+GTPP P L R R +AL+ K+ + EF+ P+ D D R W Sbjct: 206 FMGTPPRPQEVALGVHEAFKRRRDKALAPVKKRPFKGIYVEFAPESPETVVADIDAPRFW 265 Query: 258 RKLAGDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWWDRGQSAASVVPADKWAQSAV 317 KLA + NP+ G R+ + E+MS RE G WD+ +VVP D+W A Sbjct: 266 EKLA-EVNPSFGFRVGKSAIERLVENMSPEDVRREVFGIWDKTNETLAVVPRDQWNNLAA 324 Query: 318 D-EASLVGGKVFGVSFSRSGDRVALAGAGRTDAGVHVEVIDGLSGTIVDGVGRLADWLAV 376 D + +G++ +RSG + R HVE+ G + ++++ Sbjct: 325 DVDVDPEDVAAYGINATRSG-WYWITACWREGESAHVEIALGTQSEV-----EAMNFMSR 378 Query: 377 RWGDTDRIMVAGSGAVLLQKALTDRGVPGRGVVVADTGVYVEACQAF-LEGVRSGVVSHP 435 I +GA KAL ++ A T A A L V G +SH Sbjct: 379 HATKRTPIKHDSTGAA---KALGEKLKKLYFNASAYTQNEAGAGNALWLSLVEQGRLSH- 434 Query: 436 HADSRRDMLDIAVRSAVQKRKGSAWGW-----GSSFKDGSEVPLEAVSLAYLGAKMAKAK 490 D ++D L++AVR + ++ + S GW SF G + + A + A+ Sbjct: 435 --DGQQD-LELAVRGSRRQDRTSG-GWMLVPRSESFDIGPAISMSGAVYAAMTARRPSGN 490 Query: 491 RRERSGRKRVS 501 R RKR S Sbjct: 491 SRATHRRKRHS 501 >gi|19975|lcl|protein:vir:108214 Length: 544 # NCBI annotation: gp5 # Family: family:all:523 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552334;genbank:gi:160700654;genbank:Ge neID:5758934 Length = 544 Score = 53.5 bits (127), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 90/407 (22%), Positives = 156/407 (38%), Gaps = 33/407 (8%) Query: 40 WQQQVLDDWLAVGGNGRLASGVCGVFVPRQNGKNAILEIVELFKATIQGRRILHTAHELK 99 WQ +A +G A + +PRQNGK ++ + ++ G +I++TA + Sbjct: 78 WQWDAARKIMATRPDGLWAHPDVCLIIPRQNGKTQLIALRIIYGLFFLGEKIVYTAQRWQ 137 Query: 100 SARKAFMRLRSFFENERQFPDLYRMVKSIRATNGQEAIVLHHPDCATFERKCGCPGWGSV 159 + + + R+ E ++ P L R +K + + H + T GS+ Sbjct: 138 TVKDVYDRI---VEIIKRRPSLLRRLKPMPGVPDGYSEAGQHGEIYTTNG-------GSL 187 Query: 160 EFVARSRGSARGFTVDDL-VCDEAQELSDEQLEALLPTVSAAPSGDPQQIFLGTP--PGP 216 + R++ RG T DL + DEA ++ D + L AA +PQ I++ T Sbjct: 188 DMGPRTKAVGRGQTKIDLAIFDEAYDIKDVLVGGLTGAQKAA--TNPQTIYISTAAVASE 245 Query: 217 LADGSVVLRLRGQALSGGKRFAWTEFSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFGT 276 D V+ +R E+ P DD WR P+ G + Sbjct: 246 HPDCGVLAGMRRNGQRKEPDLYAAEWCAPPGMARDD-PEAWRLAC----PSFGITVRERD 300 Query: 277 VSDEHESMSA-----AGFARERLGW--WDRG-QSAASVVPADKWAQSAVDEASLVGGKVF 328 ++ E+ A A + + LGW W ++ ++ D W V + +LVG Sbjct: 301 LAREYRMARANARLLAIYDADYLGWGEWPPDPENTEPIIDPDWWEALTVLQPALVGDICI 360 Query: 329 GVSFSRSGDRVALAGAGRT-DAGVHVEVIDGLSGTIVDGVGRLADWLAVRWGDTDRIMVA 387 + + +A RT D VHVEV + I L + L W I+ Sbjct: 361 AIERTLDTRYWCIAAGQRTIDGRVHVEVGYWRAANIGVVAAALLE-LVELWNPAAIIVDD 419 Query: 388 GSGAVLLQKALTDRGVPGRGVVVADTGVYVEACQAFLEGVRSGVVSH 434 S A + + ++G+ + A T Q F++ V + V+H Sbjct: 420 RSKAKPIVGVMFNQGI---EIETASTPKLAMYTQGFIDAVNAADVTH 463 >gi|13269|lcl|protein:vir:81252 Length: 542 # NCBI annotation: gp2, terminase # Family: family:all:523 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456732;genbank:gi:157168375;interpro:I PR005021;uniprot:Q9MBK3;genbank:GeneID:5580375 Length = 542 Score = 42.4 bits (98), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 62/227 (27%), Positives = 94/227 (41%), Gaps = 37/227 (16%) Query: 64 VFVPRQNGKNAILEIVELFKATIQG-RRILHTAHELKSARKAFMRLRSFFENERQFPDLY 122 + V RQNGK ++ I+ L+K I G I+ A +L A L + F + PDL Sbjct: 75 LLVGRQNGKTLVMVILGLWKLFIDGCSEIVTAAQDLSVAEAT---LSNAFMLAKANPDLN 131 Query: 123 R----------MVKSIRATNGQEAIVL-HHPDCATFERKCGCPGWGSVEFVARSRGSARG 171 + MV +R NG I L + P + P W VA +RG R Sbjct: 132 QWLPWRMERGEMVPFMRTANGSNQIELAYAPVPEALDVFGAMPKWF---VVATNRGGGRS 188 Query: 172 FTVDDLVCDEAQELSDEQ-LEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLR--- 227 + + + DE +E +D Q A+ P V+ P Q++ + G + SVVLR + Sbjct: 189 HSAELAMLDELREHTDFQSWGAITPAVAERPRN---QVYGFSNAG--DEKSVVLRKQRNI 243 Query: 228 -----GQALSGGKRFAWTEFSIPDESDPDDVSRQWRKLAGDTNPALG 269 ++ + A E+S P+E D R NP+LG Sbjct: 244 CLKEISDGITDQSQLAIFEWSAPEECSIFD-----RDGWAAANPSLG 285 >gi|27798|lcl|protein:vir:8408 Length: 214 # NCBI annotation: gp3 # Family: family:all:30874 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818304;genbank:gi:29566740;genbank:GeneID :1260058 Length = 214 Score = 40.8 bits (94), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 61/245 (24%), Positives = 83/245 (33%), Gaps = 51/245 (20%) Query: 59 SGVCGVFVPRQNGKNAILEIVELFKATIQGRRILHTAHELKSARKAFMRLRSFFENERQF 118 S + PRQNGK L A R+L+ + A +AF Sbjct: 17 SSISAFAAPRQNGKT----YAALAYALQYPGRVLYFGRGFREAGEAFA------------ 60 Query: 119 PDLYRMVKSIRATNGQEAIVLHHPDCATFERKCGCPGWGSVEFVARSRGSARGFTVDDLV 178 + A G I+ + + E G +G V F+ RGS RG D ++ Sbjct: 61 -----AATKLGANRGPGTILKTNKSQLSIETSLGGD-FGRVNFMPYGRGSGRGMGADLVI 114 Query: 179 CDEAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSGGKRFA 238 D+A E+ + L + P V G +A + L Q L G Sbjct: 115 LDDAHEVEADVLAEISPCVF-------------RTSGKIAGFGL---LHDQGLLG----- 153 Query: 239 WTEFSIPDESDPDDVSRQWRKLA-GDTNPALGRRLNFGTVSDEHESMSAAGFARERLGWW 297 F + D R WR NPALG V E E + F RERLG Sbjct: 154 -HLFRVADG------KRVWRGAEIASANPALGHLFTLEQVEREREILPGEIFRRERLGLN 206 Query: 298 DRGQS 302 +G S Sbjct: 207 SKGSS 211 >gi|10890|lcl|protein:vir:78144 Length: 530 # NCBI annotation: probable terminase # Family: family:all:523 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294797;genbank:gi:149882818;genbank:Ge neID:5309172 Length = 530 Score = 39.7 bits (91), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 40/170 (23%), Positives = 65/170 (38%), Gaps = 35/170 (20%) Query: 39 PWQQQVLDDWLAVGGNGRLASGVCGVFVPRQNGKNAILEIVELFKATIQGRR-------- 90 PWQ+ +L L + GRL V V RQNGK I ++ + + R Sbjct: 43 PWQKWLLIHMLELDAFGRLRFRKALVIVGRQNGKTLIAAVLAAYWLYVDAGRWPDQLPEQ 102 Query: 91 ---ILHTAHELKSARKAFMRLRSFFENE--------RQFPDLYRMVKSIRATNGQEAIVL 139 ++ A +L A K + ++R + + + PDL R TNG+ + Sbjct: 103 DFIVVGAAQKLDIAMKPWRQVRRWGAPDDPKVGIARDRVPDLQAFTYPPRTTNGETELRT 162 Query: 140 HHPDCATFERKCGCPGWGSVEFVARSRGSARGFTVDDLVCDEAQELSDEQ 189 H G ++ R+ ARG + L+ DE +E D + Sbjct: 163 H----------------GGAAYLPRTFEGARGQSAARLILDELREQYDYE 196 >gi|4914|lcl|protein:vir:99572 Length: 459 # NCBI annotation: TerL-like protein # Family: family:all:54 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039811;genbank:gi:126011061;genbank:Ge neID:4818267 Length = 459 Score = 28.1 bits (61), Expect = 0.27, Method: Compositional matrix adjust. Identities = 24/71 (33%), Positives = 33/71 (46%), Gaps = 9/71 (12%) Query: 174 VDDLVCDEAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLAD---GSVVLRLRGQA 230 VD L +EAQ L++EQ + PT+ S QI+L P D + V+ Sbjct: 113 VDILWLEEAQYLTEEQWNVINPTIRREGS----QIWLIWNPDQYTDFIYQNFVVNPPADC 168 Query: 231 LSGGKRFAWTE 241 LS K+ WTE Sbjct: 169 LS--KQINWTE 177 >gi|12641|lcl|protein:vir:80117 Length: 572 # NCBI annotation: Phage terminase # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425601;genbank:gi:155042934;genbank:Ge neID:5469543 Length = 572 Score = 25.4 bits (54), Expect = 1.5, Method: Compositional matrix adjust. Identities = 12/39 (30%), Positives = 20/39 (51%), Gaps = 6/39 (15%) Query: 38 DPWQQQVLDDWLAVGGNG----RLASGVCGVFVPRQNGK 72 +PWQ+ ++ + L G R +F+PR+NGK Sbjct: 83 EPWQKFIIYNLLGFYLKGTKIRRFKEAF--IFIPRKNGK 119 >gi|5726|lcl|protein:vir:95379 Length: 573 # NCBI annotation: phage terminase # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764473;genbank:gi:115334627;genbank:GeneI D:5179266 Length = 573 Score = 25.4 bits (54), Expect = 1.5, Method: Compositional matrix adjust. Identities = 12/39 (30%), Positives = 20/39 (51%), Gaps = 6/39 (15%) Query: 38 DPWQQQVLDDWLAVGGNG----RLASGVCGVFVPRQNGK 72 +PWQ+ ++ + L G R +F+PR+NGK Sbjct: 83 EPWQKFIIYNLLGFYLKGTKIRRFKEAF--IFIPRKNGK 119 >gi|5142|lcl|protein:vir:105417 Length: 470 # NCBI annotation: gene 2 protein # Family: family:all:54 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958178;genbank:gi:41057280;genbank:GeneID :2716664 Length = 470 Score = 25.0 bits (53), Expect = 1.9, Method: Compositional matrix adjust. Identities = 31/144 (21%), Positives = 60/144 (41%), Gaps = 15/144 (10%) Query: 174 VDDLVCDEAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSG 233 +D +EA+ ++ E + L+PT+ S +I++ P + D + + ++ Sbjct: 112 IDICWVEEAEAVTKESWDILIPTIRKPFS----EIWVSFNPKNILDDT----YQRFVVNP 163 Query: 234 GKRFAWTEFSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFG---TVSD----EHESMSA 286 + D +V R + NP L R + G + SD + E + A Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 287 AGFARERLGWWDRGQSAASVVPAD 310 A A ++LGW +G ++ P+D Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSD 247 >gi|17761|lcl|protein:vir:171 Length: 470 # NCBI annotation: terminase large subunit # Family: family:all:54 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112076;genbank:gi:13559866;genbank:GeneID :921009 Length = 470 Score = 25.0 bits (53), Expect = 2.1, Method: Compositional matrix adjust. Identities = 31/144 (21%), Positives = 60/144 (41%), Gaps = 15/144 (10%) Query: 174 VDDLVCDEAQELSDEQLEALLPTVSAAPSGDPQQIFLGTPPGPLADGSVVLRLRGQALSG 233 +D +EA+ ++ E + L+PT+ S +I++ P + D + + ++ Sbjct: 112 IDICWVEEAEAVTKESWDILIPTIRKPFS----EIWVSFNPKNILDDT----YQRFVVNP 163 Query: 234 GKRFAWTEFSIPDESDPDDVSRQWRKLAGDTNPALGRRLNFG---TVSD----EHESMSA 286 + D +V R + NP L R + G + SD + E + A Sbjct: 164 PDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEA 223 Query: 287 AGFARERLGWWDRGQSAASVVPAD 310 A A ++LGW +G ++ P+D Sbjct: 224 ATDAHKKLGWKAKGAVVSAHDPSD 247 >gi|19315|lcl|protein:vir:4508 Length: 577 # NCBI annotation: large terminase subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599034;genbank:gi:19548992;genbank:GeneID :935222 Length = 577 Score = 23.9 bits (50), Expect = 5.2, Method: Compositional matrix adjust. Identities = 18/51 (35%), Positives = 26/51 (50%), Gaps = 4/51 (7%) Query: 35 LTPDPWQQQVLD---DWLAVGGNGRLASGVCGVFVPRQNGKNAILEIVELF 82 +T +PWQ V+ W+ G R V +PR+NGK+AI V L+ Sbjct: 82 ITLEPWQLFVICCAFGWVNKGSRLRRFREV-YTEIPRKNGKSAISAGVALY 131 >gi|17922|lcl|protein:vir:4452 Length: 577 # NCBI annotation: Terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700375;genbank:gi:23505447;genbank:GeneID :955654 Length = 577 Score = 23.5 bits (49), Expect = 6.9, Method: Compositional matrix adjust. Identities = 18/54 (33%), Positives = 27/54 (50%), Gaps = 4/54 (7%) Query: 35 LTPDPWQQQVLD---DWLAVGGNGRLASGVCGVFVPRQNGKNAILEIVELFKAT 85 +T +PWQ ++ W+ G R V +PR+NGK+AI V L+ T Sbjct: 82 ITLEPWQLFIVCCAFGWVQKGTKLRRFREVYTE-IPRKNGKSAISAGVALYCFT 134 >gi|6927|lcl|protein:vir:106715 Length: 507 # NCBI annotation: gp19 # Family: family:all:144 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944327;genbank:gi:38638626;genbank:GeneID :2657344 Length = 507 Score = 23.1 bits (48), Expect = 8.1, Method: Compositional matrix adjust. Identities = 27/99 (27%), Positives = 44/99 (44%), Gaps = 21/99 (21%) Query: 167 GSARGFTVDDLVCDEAQELSDEQLEA------------LLPTVSAAPSGDPQQIFLGTPP 214 G GF++D + D+A + ++E L A +L T SG I +GT Sbjct: 180 GPLTGFSIDVGIIDDATKNAEEALSAVVQDGLENWYDSVLLTRLQQLSG---VILIGT-- 234 Query: 215 GPLADGSVVLRLRGQALSGGKRFAWTEFSIPDESDPDDV 253 P + ++ R+R + + G F T S P +DPD + Sbjct: 235 -PWSANDLLARVR-RKMEGQPNF--TLLSFPALNDPDQI 269 >gi|11323|lcl|protein:vir:78588 Length: 507 # NCBI annotation: TerL # Family: family:all:144 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294855;genbank:gi:149882918;genbank:Ge neID:5291059 Length = 507 Score = 23.1 bits (48), Expect = 8.2, Method: Compositional matrix adjust. Identities = 27/99 (27%), Positives = 44/99 (44%), Gaps = 21/99 (21%) Query: 167 GSARGFTVDDLVCDEAQELSDEQLEA------------LLPTVSAAPSGDPQQIFLGTPP 214 G GF++D + D+A + ++E L A +L T SG I +GT Sbjct: 180 GPLTGFSIDVGIIDDATKNAEEALSAVVQDGLENWYDSVLLTRLQQLSG---VILIGT-- 234 Query: 215 GPLADGSVVLRLRGQALSGGKRFAWTEFSIPDESDPDDV 253 P + ++ R+R + + G F T S P +DPD + Sbjct: 235 -PWSANDLLARVR-RKMEGQPNF--TLLSFPALNDPDQI 269 >gi|14466|lcl|protein:vir:3519 Length: 469 # NCBI annotation: P18 # Family: family:all:54 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050979;genbank:gi:9633565;genbank:GeneID: 1262312 Length = 469 Score = 22.7 bits (47), Expect = 9.6, Method: Compositional matrix adjust. Identities = 7/19 (36%), Positives = 16/19 (84%) Query: 179 CDEAQELSDEQLEALLPTV 197 +EA+ +S++ L++L+PT+ Sbjct: 106 VEEAETVSEKSLDSLIPTI 124 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.318 0.134 0.407 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 226,542 Number of Sequences: 514 Number of extensions: 10640 Number of successful extensions: 89 Number of sequences better than 100.0: 30 Number of HSP's better than 100.0 without gapping: 18 Number of HSP's successfully gapped in prelim test: 12 Number of HSP's that attempted gapping in prelim test: 41 Number of HSP's gapped (non-prelim): 31 length of query: 503 length of database: 206,069 effective HSP length: 75 effective length of query: 428 effective length of database: 167,519 effective search space: 71698132 effective search space used: 71698132 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 39 (19.6 bits)