BLASTP 2.2.18 [Mar-02-2008] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Query= lcl|NC_019717.1_cdsid_YP_007112135.1 [gene=F845_gp02] [protein=terminase large subunit gpA] [protein_id=YP_007112135.1] [location=521..2443] (640 letters) Database: capsid_neck_tail 514 sequences; 206,069 total letters Searching...................................................done Score E Sequences producing significant alignments: (bits) Value gi|17272|lcl|protein:vir:387 Length: 640 # NCBI annotation: gp2 ... 1200 0.0 gi|15908|lcl|protein:vir:3418 Length: 641 # NCBI annotation: DNA... 1186 0.0 gi|5978|lcl|protein:vir:95489 Length: 659 # NCBI annotation: Put... 612 e-177 gi|19539|lcl|protein:vir:10320 Length: 627 # NCBI annotation: OR... 275 1e-75 gi|12290|lcl|protein:vir:79539 Length: 699 # NCBI annotation: pu... 93 1e-20 gi|19767|lcl|protein:vir:6384 Length: 677 # NCBI annotation: ter... 79 1e-16 gi|8305|lcl|protein:vir:96700 Length: 689 # NCBI annotation: put... 47 7e-07 gi|19080|lcl|protein:vir:1659 Length: 540 # NCBI annotation: ter... 28 0.30 gi|2310|lcl|protein:vir:93989 Length: 540 # NCBI annotation: put... 28 0.31 gi|6927|lcl|protein:vir:106715 Length: 507 # NCBI annotation: gp... 28 0.37 gi|4336|lcl|protein:vir:94893 Length: 540 # NCBI annotation: put... 28 0.48 gi|11323|lcl|protein:vir:78588 Length: 507 # NCBI annotation: Te... 26 1.1 gi|15547|lcl|protein:vir:856 Length: 540 # NCBI annotation: puta... 26 1.4 gi|692|lcl|protein:vir:3649 Length: 507 # NCBI annotation: gp18 ... 26 1.7 gi|1758|lcl|protein:vir:101556 Length: 507 # NCBI annotation: gp... 26 1.7 gi|4674|lcl|protein:vir:105593 Length: 543 # NCBI annotation: te... 26 1.7 gi|1848|lcl|protein:vir:93873 Length: 540 # NCBI annotation: put... 26 1.8 gi|6623|lcl|protein:vir:95971 Length: 301 # NCBI annotation: ORF... 23 9.2 gi|13401|lcl|protein:vir:1275 Length: 200 # NCBI annotation: hyp... 23 9.2 >gi|17272|lcl|protein:vir:387 Length: 640 # NCBI annotation: gp2 # Family: family:all:140 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046897;genbank:gi:9630466;genbank:GeneID: 1261641 Length = 640 Score = 1200 bits (3105), Expect = 0.0, Method: Compositional matrix adjust. Identities = 563/640 (87%), Positives = 604/640 (94%) Query: 1 MNISNSQVKGLQHSARAGLRSLYRPEPQTAVEWADENYYLPKESAYQEGRWETLPFQRAI 60 M IS SQV L+ + +AGL+SLYRPEP TAVEWAD +YYLPKESAYQEGRWETLPFQRAI Sbjct: 1 MIISTSQVANLRTAVKAGLKSLYRPEPMTAVEWADAHYYLPKESAYQEGRWETLPFQRAI 60 Query: 61 MNAMGNDYIREVNVVKSARVGYSKMLLGVYAYFIQHKQRNSLIWLPTDGDAENFMKSHVE 120 MNAMG+DYIR VNV+KSARVGYSKMLLGV AYFI+HKQRN L+WLPTDGDA+NFMKSHVE Sbjct: 61 MNAMGSDYIRIVNVIKSARVGYSKMLLGVIAYFIEHKQRNELLWLPTDGDADNFMKSHVE 120 Query: 121 PTIRDIPTLLALAPWYGKKHRDNTLSMKRFSNGRGFWCLGGKAAKNYREKSVDVAGYDEL 180 PTIRD+P+LL+LAPWYGKKHRDNTLSMKRF+NGRGFWCLGGKAAKNYREKSVDV GYDEL Sbjct: 121 PTIRDVPSLLSLAPWYGKKHRDNTLSMKRFTNGRGFWCLGGKAAKNYREKSVDVVGYDEL 180 Query: 181 AAFDDDIEKEGSPTFLGDKRIEGSVWPKSIRGSTPKVKGTCQIERAAKESEHFLRFHVPC 240 AAFD DIEKEGSPTFLGDKRIEGSVWPKSIRGSTPK++GTCQIERAAKES HF+RFHV C Sbjct: 181 AAFDADIEKEGSPTFLGDKRIEGSVWPKSIRGSTPKLRGTCQIERAAKESGHFMRFHVAC 240 Query: 241 PHCGEEQYLKFGDEETPFGFKWSPGEPSSVYYLCEHNACVIKQQELDFLEARYICDETGI 300 PHCGEEQYLKFGD +TPFGFKW P + +VYYLCEHNACVIKQ ELDF ARYIC+ TGI Sbjct: 241 PHCGEEQYLKFGDRDTPFGFKWEPEQAETVYYLCEHNACVIKQHELDFSNARYICELTGI 300 Query: 301 WTRDGLHWFASSGTEIEPPDSVTFHIWTAYSPFTTWVQIVKDWIKTKGDTGKRKTFVNTT 360 WTRDGL WF+SS EI+PP+SVTFHIWTAYSPFTTWVQIVKDW KTKGDTGKRKTFVNTT Sbjct: 301 WTRDGLRWFSSSNAEIDPPESVTFHIWTAYSPFTTWVQIVKDWFKTKGDTGKRKTFVNTT 360 Query: 361 LGETWEPKIGERPDAEVLAERKEHFEASVPERVAYLTAGIDSQLDRYEMRVWGWGPGEES 420 LGETWE KIG+RPDA+VLAERKEHF+A+VPERVAYLTAGIDSQLDRYEMRVWGWGPGEES Sbjct: 361 LGETWEAKIGDRPDADVLAERKEHFDAAVPERVAYLTAGIDSQLDRYEMRVWGWGPGEES 420 Query: 421 WLIDKIIVMGRHDDESTLARVDEAINRTYKRQNGLEMVISRTCWDIGGIDPTIVYNRSKK 480 WLID+ I+MGRHDDESTLARVDEAIN+TY R+NG+EM ISR CWDIGGIDPTIVYNRSKK Sbjct: 421 WLIDRQIIMGRHDDESTLARVDEAINKTYTRRNGVEMSISRICWDIGGIDPTIVYNRSKK 480 Query: 481 HGLFRVIPIKGASVYGKPVANMPRKRNKSGVYLTEVGTDTAKEQIYNRFTLVAQRDEPLA 540 HGLFRVIPIKGASVYGKPVANMPRKRNK+GVYLTEVGTDTAKEQIYNRFTL+ + DEPLA Sbjct: 481 HGLFRVIPIKGASVYGKPVANMPRKRNKNGVYLTEVGTDTAKEQIYNRFTLIVEGDEPLA 540 Query: 541 GAVHFPNNPEIYDLTEAQQLTAEEQVEKWVDGKKKIVWDSKKRRNEALDCFVYALAALRI 600 GAVHFPNNP+IYDL+EAQQLTAEE VEKWVDGK+KI+WDSKKRRNEALDCFVYALAALRI Sbjct: 541 GAVHFPNNPDIYDLSEAQQLTAEELVEKWVDGKRKIIWDSKKRRNEALDCFVYALAALRI 600 Query: 601 SISRWQLNLDSLLASLLEEEGSRNNNKTLADYARALSGEE 640 SISRWQLNLDSLLASLLEEEG+RNNNKTLADYARALSGEE Sbjct: 601 SISRWQLNLDSLLASLLEEEGTRNNNKTLADYARALSGEE 640 >gi|15908|lcl|protein:vir:3418 Length: 641 # NCBI annotation: DNA packaging protein # Family: family:all:140 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040581;genbank:gi:9626245;genbank:GeneID: 2703524 Length = 641 Score = 1186 bits (3067), Expect = 0.0, Method: Compositional matrix adjust. Identities = 563/640 (87%), Positives = 606/640 (94%) Query: 1 MNISNSQVKGLQHSARAGLRSLYRPEPQTAVEWADENYYLPKESAYQEGRWETLPFQRAI 60 MNISNSQV L+H RAGLRSL+RPEPQTAVEWAD NYYLPKESAYQEGRWETLPFQRAI Sbjct: 1 MNISNSQVNRLRHFVRAGLRSLFRPEPQTAVEWADANYYLPKESAYQEGRWETLPFQRAI 60 Query: 61 MNAMGNDYIREVNVVKSARVGYSKMLLGVYAYFIQHKQRNSLIWLPTDGDAENFMKSHVE 120 MNAMG+DYIREVNVVKSARVGYSKMLLGVYAYFI+HKQRN+LIWLPTDGDAENFMK+HVE Sbjct: 61 MNAMGSDYIREVNVVKSARVGYSKMLLGVYAYFIEHKQRNTLIWLPTDGDAENFMKTHVE 120 Query: 121 PTIRDIPTLLALAPWYGKKHRDNTLSMKRFSNGRGFWCLGGKAAKNYREKSVDVAGYDEL 180 PTIRDIP+LLALAPWYGKKHRDNTL+MKRF+NGRGFWCLGGKAAKNYREKSVDVAGYDEL Sbjct: 121 PTIRDIPSLLALAPWYGKKHRDNTLTMKRFTNGRGFWCLGGKAAKNYREKSVDVAGYDEL 180 Query: 181 AAFDDDIEKEGSPTFLGDKRIEGSVWPKSIRGSTPKVKGTCQIERAAKESEHFLRFHVPC 240 AAFDDDIE+EGSPTFLGDKRIEGSVWPKSIRGSTPKV+GTCQIERAA ES HF+RFHV C Sbjct: 181 AAFDDDIEQEGSPTFLGDKRIEGSVWPKSIRGSTPKVRGTCQIERAASESPHFMRFHVAC 240 Query: 241 PHCGEEQYLKFGDEETPFGFKWSPGEPSSVYYLCEHNACVIKQQELDFLEARYICDETGI 300 PHCGEEQYLKFGD+ETPFG KW+P +PSSV+YLCEHNACVI+QQELDF +ARYIC++TGI Sbjct: 241 PHCGEEQYLKFGDKETPFGLKWTPDDPSSVFYLCEHNACVIRQQELDFTDARYICEKTGI 300 Query: 301 WTRDGLHWFASSGTEIEPPDSVTFHIWTAYSPFTTWVQIVKDWIKTKGDTGKRKTFVNTT 360 WTRDG+ WF+SSG EIEPPDSVTFHIWTAYSPFTTWVQIVKDW+KTKGDTGKRKTFVNTT Sbjct: 301 WTRDGILWFSSSGEEIEPPDSVTFHIWTAYSPFTTWVQIVKDWMKTKGDTGKRKTFVNTT 360 Query: 361 LGETWEPKIGERPDAEVLAERKEHFEASVPERVAYLTAGIDSQLDRYEMRVWGWGPGEES 420 LGETWE KIGERPDAEV+AERKEH+ A VP+RVAYLTAGIDSQLDRYEMRVWGWGPGEES Sbjct: 361 LGETWEAKIGERPDAEVMAERKEHYSAPVPDRVAYLTAGIDSQLDRYEMRVWGWGPGEES 420 Query: 421 WLIDKIIVMGRHDDESTLARVDEAINRTYKRQNGLEMVISRTCWDIGGIDPTIVYNRSKK 480 WLID+ I+MGRHDDE TL RVDEAIN+TY R+NG EM ISR CWD GGIDPTIVY RSKK Sbjct: 421 WLIDRQIIMGRHDDEQTLLRVDEAINKTYTRRNGAEMSISRICWDTGGIDPTIVYERSKK 480 Query: 481 HGLFRVIPIKGASVYGKPVANMPRKRNKSGVYLTEVGTDTAKEQIYNRFTLVAQRDEPLA 540 HGLFRVIPIKGASVYGKPVA+MPRKRNK+GVYLTE+GTDTAKEQIYNRFTL + DEPL Sbjct: 481 HGLFRVIPIKGASVYGKPVASMPRKRNKNGVYLTEIGTDTAKEQIYNRFTLTPEGDEPLP 540 Query: 541 GAVHFPNNPEIYDLTEAQQLTAEEQVEKWVDGKKKIVWDSKKRRNEALDCFVYALAALRI 600 GAVHFPNNP+I+DLTEAQQLTAEEQVEKWVDG+KKI+WDSKKRRNEALDCFVYALAALRI Sbjct: 541 GAVHFPNNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKRRNEALDCFVYALAALRI 600 Query: 601 SISRWQLNLDSLLASLLEEEGSRNNNKTLADYARALSGEE 640 SISRWQL+L +LLASL EE+G+ N KTLADYARALSGE+ Sbjct: 601 SISRWQLDLSALLASLQEEDGAATNKKTLADYARALSGED 640 >gi|5978|lcl|protein:vir:95489 Length: 659 # NCBI annotation: Putative large subunit (GpA homolog) of DNA packaging dimer # Family: family:all:140 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293346;genbank:gi:148912767;genbank:Ge neID:5228141 Length = 659 Score = 612 bits (1578), Expect = e-177, Method: Compositional matrix adjust. Identities = 315/613 (51%), Positives = 403/613 (65%), Gaps = 13/613 (2%) Query: 11 LQHSARAGLRSLYRPEPQTAVEWA---DENYYLPKESAYQEGRWETLPFQRAIMNAMGND 67 L+ + GL+ LY+ P TAVEWA D+ +Y+ ES+Y EG+W+T PFQ AI+NAMGND Sbjct: 11 LRKAVDLGLQGLYKSPPMTAVEWAEDPDDGFYMSAESSYNEGKWKTAPFQVAILNAMGND 70 Query: 68 YIREVNVVKSARVGYSKMLLGVYAYFIQHKQRNSLIWLPTDGDAENFMKSHVEPTIRDIP 127 IR VN VKSAR+GY+KML+ Y IQHK+RN L+W PTD DAE KSHV IRD+P Sbjct: 71 LIRVVNFVKSARIGYTKMLMANIGYKIQHKRRNVLMWSPTDPDAEGISKSHVNGLIRDVP 130 Query: 128 TLLALAPWYGKKHRDNTLSMKRFSNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDDDI 187 LLALAPWYG+KH DNTL K F+N R W LGGKAA+NYRE+S D YDEL+ FD DI Sbjct: 131 VLLALAPWYGRKHSDNTLDTKVFANRRTLWTLGGKAARNYRERSADEVIYDELSKFDADI 190 Query: 188 EKEGSPTFLGDKRIEGSVWPKSIRGSTPKVKGTCQIERAAKESEHFLRFHVPCPHCGEEQ 247 E EGSPTFLGD+R+ G+V+PKSIRGSTP +G CQI +AA ES LR+++PCPHCG EQ Sbjct: 191 EGEGSPTFLGDQRLRGAVYPKSIRGSTPGTEGQCQITKAADESPRRLRYYIPCPHCGHEQ 250 Query: 248 YLKFGDEETPFGFKW---SPGEPSSVYYLCEHNACVIKQQELDFLEA----RYICDETGI 300 LK+G ++ FG K+ GE SSV+Y CE+ C + + + A R+ C+ +G+ Sbjct: 251 TLKWGGKDCAFGVKYIANDLGEASSVWYACENERCSGTFEHHEMVVASERGRWKCEVSGV 310 Query: 301 WTRDGLHWFASSGTEIEPPDSVTFHIWTAYSPFTTWVQIVKDWIKTKGDTGKRKTFVNTT 360 WTRD + WF I P SV F+ W YS +T+W+ ++ +W+K KGD K KTF NT Sbjct: 311 WTRDAMEWFGPDDQPIRTPRSVAFYCWAVYSTWTSWLDLIDEWLKVKGDREKLKTFTNTI 370 Query: 361 LGETWEPKIGERPDAEVLAERKEHFEASVPERVAYLTAGIDSQLDRYEMRVWGWGPGEES 420 LGE W GER + + L R+E++ VP + L GID+Q DRYE RVW +G GEE+ Sbjct: 371 LGEVWVEDEGERVEWQTLYARRENY-PKVPPQALVLMGGIDTQDDRYEGRVWAFGLGEEA 429 Query: 421 WLIDKIIVMGRHDDESTLARVDEAINRTYKRQNGLEMVISRTCWDIGGIDPTIVYNRSKK 480 WL+ + I+ G E +V I+R + R +G+ M + R CWD GG V S K Sbjct: 430 WLVHRFILTGDPASEELRRKVGLEIHRQFTRADGVPMRVERWCWDAGGHYSDEVEAESIK 489 Query: 481 HGLFRVIPIKGASVYGKPVANMPRKRNKSGVYLTEVGTDTAKEQIYNRFTL-VAQRDEPL 539 HG+ V+P GAS YGKP+AN P KR K VY TE+GTD AKE IY+R + V +P Sbjct: 490 HGVHWVVPTFGASTYGKPIANFP-KRRKRKVYKTELGTDNAKELIYSRLRIDVPIPWQPT 548 Query: 540 AGAVHFPNNPEIYDLTEAQQLTAEEQVEKWVDGKKKIVWDSKKRRNEALDCFVYALAALR 599 G VHFP + +I D E +Q+TAE++ G + + WDS RRNEALDCFVYALAALR Sbjct: 549 PGCVHFPIDSDICDEDELKQITAEKKKSVMAKGVRVLRWDSGGRRNEALDCFVYALAALR 608 Query: 600 ISISRWQLNLDSL 612 IS R+ L+LD L Sbjct: 609 ISQQRFGLDLDQL 621 >gi|19539|lcl|protein:vir:10320 Length: 627 # NCBI annotation: ORF22 # Family: family:all:140 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758915;genbank:gi:27311189;genbank:GeneID :956138 Length = 627 Score = 275 bits (703), Expect = 1e-75, Method: Compositional matrix adjust. Identities = 190/616 (30%), Positives = 300/616 (48%), Gaps = 57/616 (9%) Query: 19 LRSLYRPEPQTAV-EWADENYYLPKESAYQEGRWETLPFQRAIMNAMGNDYIREVNVVKS 77 +RS + P P + +W D+NY LP+E + G+W+T PFQ I +AM + V V+KS Sbjct: 15 VRSTWTPPPNLTISQWGDKNYVLPEE--HGGGKWKTKPFQIGIADAMCDPEEERVTVMKS 72 Query: 78 ARVGYSKMLLGVYAYFIQHKQRNSLIWLPTDGDAENFMKSHVEPTIRDIPTLLALAPWYG 137 RVGY+K++ Y++ + L+ PT DAE F K + P +RD+P L G Sbjct: 73 MRVGYTKIVDLAIGYYMDADPCSMLVVQPTIDDAEGFSKDEIAPMLRDVPCL------QG 126 Query: 138 KKHRDNTLSMKRFSNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDDDIEKEGSPTFLG 197 K RD+ +K+ G +G + +R +V + +DE++A+ + K+G P G Sbjct: 127 KVQRDDDTLLKKVYPGGSLTLVGANSPTGFRRLTVRIVIFDEMSAYPANTGKDGDPVRQG 186 Query: 198 DKRIEGSVWPKSIRGSTPKVKGTCQIERAAKESEHFLRFHVPCPHCGEEQYLKFGDEETP 257 + R + K I GSTP + G C+IE+ S+ FHVPCPHCG + L++ + Sbjct: 187 EGRTFSAFNRKIIAGSTPTIAGVCRIEKEFIRSDQRY-FHVPCPHCGHKHILQWSN---- 241 Query: 258 FGFKWSPGEPSSVYYLCEHNAC-----------VIKQQELDFLEARYIC---DETGIWTR 303 F+W G+P +++C +C ++ E ++ C E W + Sbjct: 242 --FRWPEGQPELAHFVCP--SCKKDIEEGSKKEMVAAGEFRSIKPFTCCGHEQEPEAWDK 297 Query: 304 DGLHWFASSGTEIEPPDSVTFHIWTAYS--PFTTWVQIVKDWIKTKGDTGKRKTFVNTTL 361 G G E++ FHIW AYS P W ++ K W + K D ++ +VNT Sbjct: 298 KGRPICKHCG-EVKISGHAGFHIWAAYSDLPNAKWSKLAKYWEEVKDDPDEKVVYVNTIR 356 Query: 362 GETWEPKIGERPDAEVLAERKEHF----EASVPERVAYLTAGIDSQLDRYEMRVWGWGPG 417 GET++ E D + L +R+E + + VPE V + A +D+Q +R EM G G G Sbjct: 357 GETYKETETEV-DWKPLYDRREPYGDDHDGKVPEAVRIILATVDTQDNRLEMTTIGIGEG 415 Query: 418 EESWLIDKIIVMGRHDDESTLARVDEAINRTYKRQNGLEMVISRTCWDIGG--IDPTIVY 475 EE WL+++ + MG+ D+ TLA++ A++RTY G M I+ D+ G D + Y Sbjct: 416 EEVWLLNRKVFMGQPDNPETLAQLTRALDRTYTHACGFSMGITACAIDVQGHYYDTMLAY 475 Query: 476 NRSKKHGLFRVIPIKGASVYGKPVANMPRKRNKSGVYLTEVGTDTAKEQIYNRFTLVAQR 535 R + I+G + Y P P + N + L +G + K +I R Sbjct: 476 CAQHSD---RCVAIRGGNDYAAPAIKPPSRSNVYRIPLYTLGVNNIKNRIAKRLRF---- 528 Query: 536 DEPLAGAVHFPNNPEIYDLTEAQQLTAEEQVEKWVDG-KKKIVWDSKKRRNEALDCFVYA 594 P +H+P + E +++ +QLTAE V ++ +G ++ + K RNEA D VYA Sbjct: 529 KYPGRFFIHWPKSNE-FEVDYFEQLTAETVVTEYKNGIPYRVFKNPTKARNEAWDLLVYA 587 Query: 595 LAALRISISRWQLNLD 610 A L W LN D Sbjct: 588 YALL------WILNPD 597 >gi|12290|lcl|protein:vir:79539 Length: 699 # NCBI annotation: putative large terminase subunit # Family: family:all:140 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272515;genbank:gi:148609384;genbank:Ge neID:5204375 Length = 699 Score = 92.8 bits (229), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 156/629 (24%), Positives = 243/629 (38%), Gaps = 125/629 (19%) Query: 52 ETLPFQRAIMNAMGNDYIREVNVVKSARVGYSKMLL-GVYAYFIQHKQRNSLIWLPTDGD 110 E P+ MN + + V V AR G + L+ G Y I + L+ T+ Sbjct: 48 ELTPYIIEPMNCLASREYDAVIFVGPARTGKTIGLIDGWIVYTIVCDPSDMLVVQMTEDK 107 Query: 111 AENFMKSHVEPTIRDIPTLLALAPWYGKKHRDNTLSMKRFSNGR----GFWCLGGKAAKN 166 A K ++ T R + A+ + DN + K F +G G+ + ++ + Sbjct: 108 AREHSKKRLDRTFR---SSAAVKKRMSPRRNDNNVHDKTFRDGSFLKIGWPSVNIMSSSD 164 Query: 167 YREKSVDVAGYDELAAFDDDIEKEGSPTFLGDKR-------------------IEGSVWP 207 YR V + YD F ++I+ EG L KR I S W Sbjct: 165 YR--FVALTDYDR---FPENIDSEGDGFSLASKRTTTFMSAGMTLVESSPGRDICDSKWR 219 Query: 208 KSIRGSTPKVKGTCQIERAAKESEHFLRFHVPCPHCGE------EQYLKFGDEETPFGFK 261 + P G + R++ PCPHCGE + + +E PF Sbjct: 220 RKSPHEAPPTTGILSLYNRGDRR----RWYWPCPHCGEYFQPAMDAMTGYRNEPDPFKAS 275 Query: 262 WSPGEPSSVYYLCEHNACVI---KQQELDFLEARYICDETGIWTRDGLHWFASSGTEIEP 318 + Y LC H + +I K++EL + G+W R+G + EP Sbjct: 276 ------EAAYLLCPHCSGIITAEKKREL---------NSAGVWLREGQVIDRNGNVSGEP 320 Query: 319 PDSVTFHIWT--AYSPFTTWVQIVKDWIKTKGD---TGKRKTF---VNTTLGETWEPKIG 370 S W + + TW Q+V + + + TG +T +NT G + P+ Sbjct: 321 RRSRIASFWMEGPAAAYQTWAQLVYKLLTAEQEYEATGSEETLRAVINTDWGLPYLPRAS 380 Query: 371 -ERPDAEVLAERKEHFEA-SVPERVAYLTAGIDSQLDRYE---MRVWGWGPGEESWLIDK 425 E+ +E+L +R E + SVP+ V +L A +D Q R+ ++V G+G E W+ID+ Sbjct: 381 MEQRKSELLEQRAEPVPSRSVPDGVNFLVAAVDVQAGRHRRFVVQVTGYGSRGERWIIDR 440 Query: 426 --IIVMGRHDDESTLARVDEA---------INRTYKRQNGL------EMVISRTCWDIGG 468 I R D + R+D A + + + L +M + D GG Sbjct: 441 YNITQSLRGDSDGESQRIDPASYPEDWDVLLTDVFHKSWPLASDPSQQMRLMAMAVDSGG 500 Query: 469 IDPTI-----VYNRSKKHGLF-RVIPIKGASV---------YGKPVANMPRKRNKSG-VY 512 D + R ++ GL R+ KG S+ + R+ +G V Sbjct: 501 EDGVTDNAYKFWRRCRRDGLGKRIYLFKGDSIRRAKLITRTFPDNTGRTGRRAQAAGDVP 560 Query: 513 LTEVGTDTAKEQIYNRFTLVAQRDEPLAGAVHFPN--NPEIYDLTEAQQLTAEEQVEKWV 570 L + TD K+++ N RD P G VHFP+ YD +LT EE+ Sbjct: 561 LWLLQTDALKDRVNNAL----WRDSPGPGYVHFPDWLGSWFYD-----ELTYEERSS--- 608 Query: 571 DGKKKIVWDSKKR-RNEALDCFVYALAAL 598 DGK W R NEA D VYA A + Sbjct: 609 DGK----WSKPGRGANEAFDLMVYAEALV 633 >gi|19767|lcl|protein:vir:6384 Length: 677 # NCBI annotation: terminase large subunit TerL # Family: family:all:140 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918997;genbank:gi:34610172;genbank:gi:912 14212;genbank:GeneID:2559603 Length = 677 Score = 79.0 bits (193), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 148/654 (22%), Positives = 239/654 (36%), Gaps = 111/654 (16%) Query: 19 LRSLYRPEPQTAVEWADENYYLPKESAYQEGRW--ETLPFQRAIMNAM-GNDYIREVNVV 75 L ++RP + V A Y G W ET+P+ MN + D+ EV V Sbjct: 5 LADIFRPPERLTVSQAAAKYRKLNTPGAYVGDWFNETVPYMVEPMNTLTSRDFDAEV-FV 63 Query: 76 KSARVGYSK-MLLGVYAYFIQHKQRNSLIWLPTDGDAENFMKSHVEPTIRDIPTLLALAP 134 A+ G + +LL Y ++ + +++ PT+ A +F ++ R P + A+ Sbjct: 64 GPAQCGKTDGLLLNFVGYTVKVDPMDMILYCPTNSAARDFSVRRIDRMHRHSPEIGAMM- 122 Query: 135 WYGKKHRDNTLSMKRFSNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDDDIEKEGSPT 194 + RD + L + + + + + DD++ +G P Sbjct: 123 ---ARSRDTDNKFDKHYKDGTILTLSWPSVTEFAGRPIGRVCLTDYDRMPDDVDGDGEPF 179 Query: 195 FLGDKRI---------------------EGSVWPKSIRGSTPKVKGTCQIERAAKESEHF 233 L KR +G W S P KG + + Sbjct: 180 DLASKRTTTFGSFAMTLAESSPSREIEEDGRKWLASSPHEAPPCKGILGLYNRGDRRRRY 239 Query: 234 LRFHVPCPHCGE--EQYLKFGDEETPFGFKWSP-GEPSSVYYLCEHNA-CVIKQQELDFL 289 + CPHCG+ E K KW G+ S A C + E D Sbjct: 240 WK----CPHCGDWFEPTFKL--------LKWDDCGDAVSCADTVRMEAPCCGGRIEAD-- 285 Query: 290 EARYICDETGIWTRDGLHWFASSGTEIEPPDSVTFHIWT--AYSPFTTWVQIVKDWIKTK 347 R D G+W +DG A P S W + F +W ++V ++I + Sbjct: 286 -QRNDLDLWGVWLKDGESMTADDKRVGTPRRSRIASFWMNGVVAAFISWRKLVANYITAE 344 Query: 348 GD---TGKR---KTFVNTTLGETWEPKIGERPD-AEVLAERKEHF-EASVPERVAYLTAG 399 D TG + K F NT LGE + + E E L R E E VP+ V +L Sbjct: 345 EDYERTGSQESLKKFYNTDLGEPYFHRGNETVLLPETLKARAEVLREKHVPKAVRFLIGI 404 Query: 400 IDSQLDRYEMRVWGWGPGE--ESWLIDKIIVMGR----HDDESTLARVD----------- 442 D Q + + V+G PG + +++D+ V+ HD + + Sbjct: 405 CDVQKNMWVCNVFGIAPGNPYDIYVVDRFNVIKSQRVDHDGDREWVKPHAYLEDWQEVRT 464 Query: 443 EAINRTYKRQN--GLEMVISRTCWDIGGIDPT--IVYNRSKK------HGLFRVI----- 487 + + + Y + G M + T D GG + + YN ++ HG F +I Sbjct: 465 QVMEKMYPLDDDSGRVMQLKMTFCDSGGREGVTGMAYNFYRQLREEGLHGRFHLIKGEPK 524 Query: 488 PIKGASVYGKPVANMPRK--RNKSGVYLTEVGTDTAKEQIYNRFTLVAQRDEPLAGAVHF 545 P + G P AN K + V + + ++ K+ R +V P +G VHF Sbjct: 525 PGHPRTRVGYPDANHKDKWSAARGDVPVLFLNSNLLKDTALGRLEVVT----PGSGMVHF 580 Query: 546 PN-NPEIYDLTEAQQLTAEEQVEK-WVDGKKKIVWDSKKRRNEALDCFVYALAA 597 P P+ Y + QL +E + +K WV + +RNE+ D Y L A Sbjct: 581 PEWLPDSYYV----QLVSERRTDKGWV--------ATSVKRNESWDLLYYCLGA 622 >gi|8305|lcl|protein:vir:96700 Length: 689 # NCBI annotation: putative phage terminase large subunit # Family: family:all:140 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039815;genbank:gi:126010850;genbank:Ge neID:5076210 Length = 689 Score = 47.0 bits (110), Expect = 7e-07, Method: Compositional matrix adjust. Identities = 106/461 (22%), Positives = 162/461 (35%), Gaps = 74/461 (16%) Query: 11 LQHSARAGLRSLYRPEPQTAVEWADENYYLPKESAYQEGRW--ETLPFQRAIMNAMGNDY 68 L S R + P P+TA EWA EN +P S G + +T P+ I++A N Sbjct: 36 LTASLRLMAEMVKAPPPRTADEWARENRIMPPTSPI-PGPFNPDTNPYMIPIVSAFANPQ 94 Query: 69 IREVNVVKSARVGYSKMLLGVYAYFIQHKQRNSLIWLPTDGDAENFMKSHVEPTIRDIPT 128 V V ++G S + + + + + PT N + + VEP D Sbjct: 95 YNRVTFVMGTQMGKSVSMENLVGWRLDDDPTPIMYVAPT----SNLIDTTVEPKFMD--- 147 Query: 129 LLALAPWYGKKHRDN-TLSMKRFSNGRGF---WCLGGKAAKNYREKSVDVAGYDELAAFD 184 + A +K+ N + ++ G F W A + E + D AG + D Sbjct: 148 MFQQAESLARKYDWNRSTKYTKWVGGTKFRFAW------AGSPTELAADSAGLVLVDEVD 201 Query: 185 DDIEK-EGSPTFLGDKRIEGSVWPKSIRGSTPKVKGTCQIERAAKESEHFLRFH------ 237 + EG T + + R + V K +TP + E H+ R H Sbjct: 202 RIVNTGEGDTTEIIEARGDAYVDSKIGYTATPTHGKVERTEHPRTGLTHWARSHRDALSS 261 Query: 238 ---------------VPCPHCGEEQYLKFGDEETPFGFKWSPGEPS-----------SVY 271 VPCPHCG QY E W PG+ + Sbjct: 262 AIWRLWQSGTRHEWAVPCPHCG--QYFIPHSE-----LLWWPGKGTEEECTPDQAEKKAM 314 Query: 272 YLCEHNACVIKQQELDFLEARYICDETG-IWTRDGLHWFASSGTEIEPPDSVTFHIWT-- 328 C N C+I+ + + R + G T DG+ E + S F +W Sbjct: 315 LTCPRNGCMIEDKYRAAMNKRGVPVAPGQTVTPDGV-----IEGEADTAGSSHFSMWVSG 369 Query: 329 --AYSPFTTWVQIVKDWIKT--KGDTGKRKTFVNTTLGETWEPKIGERPD-AEVLAERKE 383 +++ ++ + K GD + NT GE + GE P EV A R Sbjct: 370 LCSFAAKKSYGFLAKKLAAALQSGDPETLQGVYNTGFGECYA-LTGEVPAWEEVKAMRWS 428 Query: 384 HFEASVPERVAYLTAGIDSQLDRYEMRVWGWGPGEESWLID 424 + V L +D Q +R V W PG S L++ Sbjct: 429 YSAGEVLPGAEKLICTVDVQKNRLVYVVRAWFPGMGSQLVE 469 >gi|19080|lcl|protein:vir:1659 Length: 540 # NCBI annotation: terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044948;genbank:gi:9629655;genbank:GeneID :1261297 Length = 540 Score = 28.1 bits (61), Expect = 0.30, Method: Compositional matrix adjust. Identities = 17/51 (33%), Positives = 27/51 (52%), Gaps = 4/51 (7%) Query: 9 KGLQHSARAGLRSLYRPEPQT-AVEWADENYYLPKESAYQEGRWETLPFQR 58 K +Q R + +YR + T A+EW ++N+YL + + E LP QR Sbjct: 24 KTIQKQIRIHNKYIYRYDRVTQAIEWIEDNFYLTTGNLM---KIELLPTQR 71 >gi|2310|lcl|protein:vir:93989 Length: 540 # NCBI annotation: putative terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764316;genbank:gi:115315630;genbank:Gene ID:5176576 Length = 540 Score = 28.1 bits (61), Expect = 0.31, Method: Compositional matrix adjust. Identities = 17/51 (33%), Positives = 27/51 (52%), Gaps = 4/51 (7%) Query: 9 KGLQHSARAGLRSLYRPEPQT-AVEWADENYYLPKESAYQEGRWETLPFQR 58 K +Q R + +YR + T A+EW ++N+YL + + E LP QR Sbjct: 24 KTIQKQIRIHNKYIYRYDRVTQAIEWIEDNFYLTTGNLM---KIELLPTQR 71 >gi|6927|lcl|protein:vir:106715 Length: 507 # NCBI annotation: gp19 # Family: family:all:144 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944327;genbank:gi:38638626;genbank:GeneID :2657344 Length = 507 Score = 28.1 bits (61), Expect = 0.37, Method: Compositional matrix adjust. Identities = 24/93 (25%), Positives = 38/93 (40%), Gaps = 16/93 (17%) Query: 382 KEHFE----ASVPERVAYLTAGIDSQL------DRYEMRVWGWGPGEESWLID----KII 427 +EH + A +P++ + D+ D + VWG E WLID K+ Sbjct: 320 REHLQYYHAADLPKQFVRVIMSCDATFKDGQASDFVFVGVWGKTADERVWLIDWRREKLA 379 Query: 428 VMGRHDDESTLARVDEAINRTY--KRQNGLEMV 458 M + L R A++R Y K NG ++ Sbjct: 380 FMATAQAIADLKRKHAAVSRVYIEKAANGAALI 412 >gi|4336|lcl|protein:vir:94893 Length: 540 # NCBI annotation: putative terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762513;genbank:gi:115304212;genbank:Gene ID:5141206 Length = 540 Score = 27.7 bits (60), Expect = 0.48, Method: Compositional matrix adjust. Identities = 17/51 (33%), Positives = 26/51 (50%), Gaps = 4/51 (7%) Query: 9 KGLQHSARAGLRSLYRPEPQT-AVEWADENYYLPKESAYQEGRWETLPFQR 58 K +Q R + +YR + T A+EW +N+YL + + E LP QR Sbjct: 24 KTIQKQIRIHNKYIYRYDRVTQAIEWIQDNFYLTTGNLM---KIELLPTQR 71 >gi|11323|lcl|protein:vir:78588 Length: 507 # NCBI annotation: TerL # Family: family:all:144 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294855;genbank:gi:149882918;genbank:Ge neID:5291059 Length = 507 Score = 26.2 bits (56), Expect = 1.1, Method: Compositional matrix adjust. Identities = 23/93 (24%), Positives = 38/93 (40%), Gaps = 16/93 (17%) Query: 382 KEHFE----ASVPERVAYLTAGIDSQL------DRYEMRVWGWGPGEESWLID----KII 427 +EH + A +P++ + D+ D + VWG E WLID K+ Sbjct: 320 REHLQYYHAADLPKQFVRVIMSCDATFKDGQASDFVFVGVWGKTADERVWLIDWRREKLA 379 Query: 428 VMGRHDDESTLARVDEAINRTYKRQ--NGLEMV 458 M + L R A++R Y + NG ++ Sbjct: 380 FMATAQAIADLKRKHAAVSRVYIEEAANGAALI 412 >gi|15547|lcl|protein:vir:856 Length: 540 # NCBI annotation: putative terminase subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047115;genbank:gi:9630568;genbank:GeneID :1261755 Length = 540 Score = 25.8 bits (55), Expect = 1.4, Method: Compositional matrix adjust. Identities = 12/33 (36%), Positives = 20/33 (60%), Gaps = 1/33 (3%) Query: 9 KGLQHSARAGLRSLYRPEPQT-AVEWADENYYL 40 K +Q R + +YR + T A+EW ++N+YL Sbjct: 24 KTIQKQIRIHKKYIYRYDRVTQAIEWIEDNFYL 56 >gi|692|lcl|protein:vir:3649 Length: 507 # NCBI annotation: gp18 # Family: family:all:144 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705644;genbank:gi:23752329;genbank:GeneID :955741 Length = 507 Score = 25.8 bits (55), Expect = 1.7, Method: Compositional matrix adjust. Identities = 17/54 (31%), Positives = 24/54 (44%), Gaps = 6/54 (11%) Query: 411 VWGWGPGEESWLID----KIIVMGRHDDESTLARVDEAINRTYKRQ--NGLEMV 458 VWG E WLID K+ M + L R A++R Y + NG ++ Sbjct: 359 VWGKTADERVWLIDWRREKLAFMATAQAIADLKRKHAAVSRVYIEEAANGAALI 412 >gi|1758|lcl|protein:vir:101556 Length: 507 # NCBI annotation: gp18 # Family: family:all:144 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958123;genbank:gi:41057669;genbank:GeneID :2716813 Length = 507 Score = 25.8 bits (55), Expect = 1.7, Method: Compositional matrix adjust. Identities = 17/54 (31%), Positives = 24/54 (44%), Gaps = 6/54 (11%) Query: 411 VWGWGPGEESWLID----KIIVMGRHDDESTLARVDEAINRTYKRQ--NGLEMV 458 VWG E WLID K+ M + L R A++R Y + NG ++ Sbjct: 359 VWGKTADERVWLIDWRREKLAFMATAQAIADLKRKHAAVSRVYIEEAANGAALI 412 >gi|4674|lcl|protein:vir:105593 Length: 543 # NCBI annotation: terminase large subunit # Family: family:all:147 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164303;genbank:gi:56692921;genbank:GeneID :3197204 Length = 543 Score = 25.8 bits (55), Expect = 1.7, Method: Compositional matrix adjust. Identities = 31/131 (23%), Positives = 49/131 (37%), Gaps = 9/131 (6%) Query: 448 TYKRQNGLEMVIS-----RTCWDIGGIDPTIVYNRSKKHGLFRVIPIKGASV--YGKPVA 500 T +++NG+ V+ T WDIG D ++ + HG R I A V Sbjct: 342 TLRKRNGITKVLILDMPINTFWDIGRSDGCAIWFHQELHGEDRFIDYYEAHNEDLRHYVK 401 Query: 501 NMPRKRNKSGVYLTEVGTDTAKEQIYNRFTLVAQRDEPLAGAVHFPNNPEIYDLTEAQQL 560 M + G + + + +NR TL +D L F P I +L Q Sbjct: 402 EMRDRGYLFGTHFLPHDAEHKRLSDFNRSTLEMLQD--LMPGEQFAIVPRITELVTGVQQ 459 Query: 561 TAEEQVEKWVD 571 T + ++D Sbjct: 460 TRKHMKTAYLD 470 >gi|1848|lcl|protein:vir:93873 Length: 540 # NCBI annotation: putative terminase large subunit # Family: family:all:35 # ACLAME annotation(s): phi:0000073 - phage terminase large subunit # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764262;genbank:gi:115315575;genbank:Gene ID:5141567 Length = 540 Score = 25.8 bits (55), Expect = 1.8, Method: Compositional matrix adjust. Identities = 12/34 (35%), Positives = 20/34 (58%), Gaps = 1/34 (2%) Query: 9 KGLQHSARAGLRSLYRPEPQT-AVEWADENYYLP 41 K +Q R + +YR + T A+EW ++N+YL Sbjct: 24 KTIQKQIRIHGKYIYRYDRVTQAIEWIEDNFYLT 57 >gi|6623|lcl|protein:vir:95971 Length: 301 # NCBI annotation: ORF014 # Family: family:all:102 # ACLAME annotation(s): phi:0000082 - phage major tail protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239807;genbank:gi:66395464;genbank:GeneID :5132888 Length = 301 Score = 23.1 bits (48), Expect = 9.2, Method: Compositional matrix adjust. Identities = 11/29 (37%), Positives = 17/29 (58%) Query: 568 KWVDGKKKIVWDSKKRRNEALDCFVYALA 596 + DGK+KI++DS K + + F LA Sbjct: 162 RIADGKRKIMFDSAKDGADEKEFFKVLLA 190 >gi|13401|lcl|protein:vir:1275 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:102 # ACLAME annotation(s): phi:0000082 - phage major tail protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690767;genbank:gi:22855007;genbank:GeneID :955222 Length = 200 Score = 23.1 bits (48), Expect = 9.2, Method: Compositional matrix adjust. Identities = 13/31 (41%), Positives = 17/31 (54%), Gaps = 3/31 (9%) Query: 375 AEVLAERKEHFEASVPERVAYLTAGIDSQLD 405 AEVL + K+ E SVPE L + +LD Sbjct: 14 AEVLKDTKDELEFSVPEE---LPGAVSMKLD 41 Database: capsid_neck_tail Posted date: Nov 7, 2013 12:16 PM Number of letters in database: 206,069 Number of sequences in database: 514 Lambda K H 0.317 0.134 0.417 Lambda K H 0.267 0.0410 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 312,426 Number of Sequences: 514 Number of extensions: 15039 Number of successful extensions: 67 Number of sequences better than 100.0: 22 Number of HSP's better than 100.0 without gapping: 6 Number of HSP's successfully gapped in prelim test: 16 Number of HSP's that attempted gapping in prelim test: 43 Number of HSP's gapped (non-prelim): 23 length of query: 640 length of database: 206,069 effective HSP length: 77 effective length of query: 563 effective length of database: 166,491 effective search space: 93734433 effective search space used: 93734433 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 40 (20.0 bits)