Query lcl|NC_019515.1_cdsid_YP_007005931.1 [gene=F400_gp080] [protein=hypothetical protein] [protein_id=YP_007005931.1] [location=complement(50257..50901)] Match_columns 214 No_of_seqs 16 out of 20 Neff 3.3 Searched_HMMs 1612 Date Thu Nov 7 17:16:54 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_80 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_80_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:103305 Length: 245 72.3 0.011 7.1E-06 31.2 1.3 117 92-214 1-149 (245) 2 protein:vir:7020 Length: 246 # 42.0 0.81 0.0005 21.1 5.8 133 30-214 1-150 (246) 3 protein:vir:97033 Length: 245 36.9 1.1 0.00071 20.3 5.9 134 29-214 1-150 (245) 4 protein:vir:105646 Length: 245 36.9 1.1 0.00069 20.3 5.8 134 29-214 1-150 (245) 5 protein:vir:78741 Length: 197 21.2 0.36 0.00022 23.0 0.1 107 72-214 1-120 (197) 6 protein:vir:80215 Length: 211 15.3 4 0.0025 17.3 5.8 108 62-214 1-116 (211) 7 protein:vir:78929 Length: 184 14.6 1.2 0.00076 20.1 1.4 102 64-214 1-113 (184) 8 protein:vir:6325 Length: 184 # 13.4 4.6 0.0029 16.9 6.4 105 64-214 1-113 (184) 9 protein:vir:1542 Length: 196 # 8.7 7.5 0.0047 15.8 5.2 116 67-214 1-127 (196) 10 protein:vir:3365 Length: 196 # 8.5 7.8 0.0048 15.7 5.3 116 67-214 1-127 (196) No 1 >protein:vir:103305 Length: 245 # NCBI annotation: tail-like protein # Family: family:all:824 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039669;genbank:gi:125999998;genbank:GeneID:4818381 Probab=72.25 E-value=0.011 Score=31.22 Aligned_cols=117 Identities=19% Similarity=0.235 Sum_probs=58.7 Q ss_pred CCceEEEEccCCCCCCcccCCCCChhhhhh--ccchhhhhhcccceeEeecCccccccccc-------eEE----EEEEe Q lcl|NC_019515. 92 QGALQWEVGSGQSTWDDANPPAPNVTDKAL--ATPFFRKAIQLSDISFIDGANAVVSTVTN-------RIQ----IKVTF 158 (214) Q Consensus 92 ~pi~~~AVGtg~~~~~d~nppap~~~dT~L--~~E~fRkai~~sd~t~l~~~~av~~~~t~-------~l~----i~~~f 158 (214) -|| .|.||--++ |.-+...+-||+| ++-+.|. |..+-++.++.++......++ .++ ==|.| T Consensus 1 ~~~-~~~~~~~~~----~~~~~~~~~dteLdAVN~~L~a-IGEsPV~sld~~npdva~A~~IL~~v~~~vQ~llseGW~F 74 (245) T protein:vir:10 1 MPI-NWNVGPTPG----LASVNLDTVDTRLEAINLCLRA-VGYASIESEDSGDLDAADASKILATVGQRVQYNGGKGWWF 74 (245) T ss_pred Ccc-ccCCCCCCC----CcccccccccchHHHHHHHHHh-hcccccccccCCchhHHHHHHHHHHHHHHHHhhcCCCeeE Confidence 355 677877554 3334445567777 4556663 555666666555544322111 111 00001 Q ss_pred C-------cccCCccee-eeeee----ccC-cccc-cccCCCceEEEeeec-----CceecCCCcEEEEEEEEeC Q lcl|NC_019515. 159 L-------STEANGYLR-EFGIF----GGG-ADCK-VDVLGSGHMINRKTH-----GVIFKTSGMEITRTLRFTF 214 (214) Q Consensus 159 ~-------~~eanG~Lr-EaGLF----~~~-~~~t-~~~~~~G~MyNRvtF-----~pI~K~~d~~LTLtWeI~F 214 (214) - -..++|.|. -..-+ +.. ...+ .-+.+.|.||.|.+. .|+++..-++++++|.+-| T Consensus 75 Nte~~~~ltPd~~g~i~iP~n~L~v~~~~~~~~~~~~~v~RGgkLYD~~n~T~~F~~pv~~~~~~~v~iV~~~pF 149 (245) T protein:vir:10 75 NVEPNWQMTPDANGEILIPNNAIAAWQDVRYDDKKVLISIRGRKVYNMNTHSTDFSNSLNREGFFRMTFMLNLPF 149 (245) T ss_pred eecCCceeccCCCCceecCccchhhhcccccCCCccceEEcCCeeEecccCceeccCccccccceeEEEEeeCCh Confidence 0 011223211 11100 000 0011 134477899999854 5777777889999999999 No 2 >protein:vir:7020 Length: 246 # NCBI annotation: tail protein # Family: family:all:824 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853593;genbank:gi:31711675;genbank:GeneID:1481801 Probab=41.99 E-value=0.81 Score=21.09 Aligned_cols=133 Identities=14% Similarity=0.117 Sum_probs=65.6 Q ss_pred chhhhhhhhhhhhhhccceeeeeeEEEecccCceeehhhhhhHHHHHHHHHHHHHhhcCCCCCCceEEEEccCCCCCCcc Q lcl|NC_019515. 30 PKFIGQITDTIKFVKDGIYKVTEDITVNAKAGDVLELPVGYNTVVNEASKLIAALIKRHTGYQGALQWEVGSGQSTWDDA 109 (214) Q Consensus 30 ~~~~~~~~~~~~~~~~~~~~~~~dI~~~~~sG~vVe~~~~~N~Iv~sGr~LvA~Lf~~~~g~~pi~~~AVGtg~~~~~d~ 109 (214) -..|-|-+|. |.|-.|-.|...+. +-.+=|+|..+ =|-.||+.+..| T Consensus 1 ~~~~~~~~~~------~~~~~~~~~~~~~T------eLdAVN~~L~a------------IGEsPV~sld~~--------- 47 (246) T protein:vir:70 1 MPVIQQSSDV------GYIMSDASFSIIDS------KLEAVNLCMRA------------IGREGVDSLDSG--------- 47 (246) T ss_pred CCcccccccc------eEEeeccccccchh------hHHHHHHHHHh------------hCccccccccCC--------- Confidence 1234444443 34444444444332 33444555443 255677665432 Q ss_pred cCCCCChhhhhhccchhhhhhc---cccee--------EeecCccccccccceEEEEEEeCcccCCcceeeeeeeccCcc Q lcl|NC_019515. 110 NPPAPNVTDKALATPFFRKAIQ---LSDIS--------FIDGANAVVSTVTNRIQIKVTFLSTEANGYLREFGIFGGGAD 178 (214) Q Consensus 110 nppap~~~dT~L~~E~fRkai~---~sd~t--------~l~~~~av~~~~t~~l~i~~~f~~~eanG~LrEaGLF~~~~~ 178 (214) ||..|++... | .++-| .++ ...|. +.|+.+....-|.+.|++-.. |. ... T Consensus 48 n~d~~~a~~i-L-~~v~~-~vq~~lseGW~FNte~~~~ltPD~~g~I~iP~n~L~v~~~-------~~---------~~~ 108 (246) T protein:vir:70 48 DLDAEDASKM-L-DIVSQ-RFQYNKGGGWWFNREPNWRIVPDTNGEVNLPNNCLAVLQC-------YA---------LGE 108 (246) T ss_pred CccHHHHHHH-H-HHHHH-HHHHhccCCeeEeecCceeeccCCCCeEecCccceeeeec-------cC---------ccc Confidence 3334443322 2 22212 222 22343 444443333345555554331 11 001 Q ss_pred cc-cccCCCceEEEeee----cC-ceecCCCcEEEEEEEEeC Q lcl|NC_019515. 179 CK-VDVLGSGHMINRKT----HG-VIFKTSGMEITRTLRFTF 214 (214) Q Consensus 179 ~t-~~~~~~G~MyNRvt----F~-pI~K~~d~~LTLtWeI~F 214 (214) .+ .-+.+.|.||.|.. |. .++..+.++++++|.+-| T Consensus 109 ~~~~vv~RGgkLYD~~n~T~~F~~~~~~D~pv~v~IV~~~~F 150 (246) T protein:vir:70 109 RKVPMTMRAGKLYSTWNHTFDMRSHVNKDGAIRLTLLTYLPF 150 (246) T ss_pred CceeeEEcCCeeEeecccceecccccccCcceEEEEEecCCh Confidence 11 24567899999998 53 566677899999999999 No 3 >protein:vir:97033 Length: 245 # NCBI annotation: 32 # Family: family:all:824 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654133;genbank:gi:108862017;genbank:GeneID:5075982 Probab=36.90 E-value=1.1 Score=20.26 Aligned_cols=134 Identities=13% Similarity=0.087 Sum_probs=55.4 Q ss_pred CchhhhhhhhhhhhhhccceeeeeeEEEecccCceeehhhhhhHHHHHHHHHHHHHhhcCCCCCCceEEEEccCCCCCCc Q lcl|NC_019515. 29 KPKFIGQITDTIKFVKDGIYKVTEDITVNAKAGDVLELPVGYNTVVNEASKLIAALIKRHTGYQGALQWEVGSGQSTWDD 108 (214) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~~dI~~~~~sG~vVe~~~~~N~Iv~sGr~LvA~Lf~~~~g~~pi~~~AVGtg~~~~~d 108 (214) -| .|-|-.|.=....| .|-++.|= +-.+=|++.. .=|-.||+.+-=|+ T Consensus 1 ~~-~~~~~~~~~~~~~~-~~~~~~~T-----------eLdAVN~~L~------------aIGEsPV~sld~~~------- 48 (245) T protein:vir:97 1 MP-VIRQTSKLGHMMED-VAFQIIDS-----------KLEAVNLCMR------------AIGREGVDSLDSGD------- 48 (245) T ss_pred CC-ccccchhhhhhhhh-hhhhhhhh-----------hHHHHHHHHH------------hhCccccceecCCC------- Confidence 00 11111111000000 00011111 1112232211 12444555433222 Q ss_pred ccCCCCChhhhhhccchhh---hhhccccee--------EeecCccccccccceEEEEEEeCcccCCcceeeeeeeccCc Q lcl|NC_019515. 109 ANPPAPNVTDKALATPFFR---KAIQLSDIS--------FIDGANAVVSTVTNRIQIKVTFLSTEANGYLREFGIFGGGA 177 (214) Q Consensus 109 ~nppap~~~dT~L~~E~fR---kai~~sd~t--------~l~~~~av~~~~t~~l~i~~~f~~~eanG~LrEaGLF~~~~ 177 (214) |..|++ +.+....+ +.+....|. +.|+.+....-|.+.|++-.. |+.-+ T Consensus 49 --~~~~~v---a~al~~l~~~~r~vqseGW~FNte~~~~ltPD~~g~I~iP~n~L~v~~~-------~~~~~-------- 108 (245) T protein:vir:97 49 --LDAEDA---SKMIDIVSQRFQYNKGGGWWFNREPNWQLAPDTNGEVNLPNNCLAVLQC-------YALGE-------- 108 (245) T ss_pred --cchHHH---HHHHHHHHHHHHHHccCCeeEeecCCeeeccCCCCeEecCccceeeecc-------Ccccc-------- Confidence 233332 22222222 233333444 344433333345555543221 11100 Q ss_pred ccccccCCCceEEEeeec-----CceecCCCcEEEEEEEEeC Q lcl|NC_019515. 178 DCKVDVLGSGHMINRKTH-----GVIFKTSGMEITRTLRFTF 214 (214) Q Consensus 178 ~~t~~~~~~G~MyNRvtF-----~pI~K~~d~~LTLtWeI~F 214 (214) ....-+.+.|.||++.+. .||++...++++++|.+-| T Consensus 109 ~~~~~v~RggrLYD~~nhT~~F~~pi~~~~~~~v~Iv~~~pF 150 (245) T protein:vir:97 109 KKVPMTMRAGKLYSTWSHTFDMRKHVNANGMIRLTLLTLLPY 150 (245) T ss_pred ccceeEeccceEEeccccceecccccccCcceEEEEEeeCCh Confidence 001124567899999854 4899999999999999999 No 4 >protein:vir:105646 Length: 245 # NCBI annotation: putative tail tubular A protein # Family: family:all:824 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425010;genbank:gi:83571758;uniprot:Q2WC42;genbank:GeneID:3837287 Probab=36.85 E-value=1.1 Score=20.31 Aligned_cols=134 Identities=13% Similarity=0.088 Sum_probs=55.2 Q ss_pred CchhhhhhhhhhhhhhccceeeeeeEEEecccCceeehhhhhhHHHHHHHHHHHHHhhcCCCCCCceEEEEccCCCCCCc Q lcl|NC_019515. 29 KPKFIGQITDTIKFVKDGIYKVTEDITVNAKAGDVLELPVGYNTVVNEASKLIAALIKRHTGYQGALQWEVGSGQSTWDD 108 (214) Q Consensus 29 ~~~~~~~~~~~~~~~~~~~~~~~~dI~~~~~sG~vVe~~~~~N~Iv~sGr~LvA~Lf~~~~g~~pi~~~AVGtg~~~~~d 108 (214) -| .|-|-.|.=....| .|-++.|= +-.+=|++.. .=|-.||+.+-=|+ T Consensus 1 ~~-~~~~~~~~~~~~~~-~~~~~~~T-----------eLdAVN~~L~------------aIGEsPV~sld~~~------- 48 (245) T protein:vir:10 1 MP-VIRQTSKVGHMMED-VAFQIIDS-----------KLEAVNLCMR------------AIGREGVDSLDSGD------- 48 (245) T ss_pred CC-cccccchhhhhhhh-hhhhhhhh-----------hHHHHHHHHH------------hhCccccceecCCC------- Confidence 00 11111111000000 00011110 1112232211 12444555433222 Q ss_pred ccCCCCChhhhhhccchhh---hhhccccee--------EeecCccccccccceEEEEEEeCcccCCcceeeeeeeccCc Q lcl|NC_019515. 109 ANPPAPNVTDKALATPFFR---KAIQLSDIS--------FIDGANAVVSTVTNRIQIKVTFLSTEANGYLREFGIFGGGA 177 (214) Q Consensus 109 ~nppap~~~dT~L~~E~fR---kai~~sd~t--------~l~~~~av~~~~t~~l~i~~~f~~~eanG~LrEaGLF~~~~ 177 (214) |..|++ +.+....+ +.+....|. +.|+.+....-|.+.|++-.. |+.-+ T Consensus 49 --~~~~~v---a~al~~l~~~~r~vqseGW~FNte~~~~ltPD~~g~I~iP~n~L~v~~~-------~~~~~-------- 108 (245) T protein:vir:10 49 --LDAEDA---SKMIDIVSQRFQYNKGGGWWFNREPNWQIAPDTNGEVNLPNNCLAVLQC-------YALGE-------- 108 (245) T ss_pred --cchHHH---HHHHHHHHHHHHHHccCCeeEeecCCeeeccCCCCeEecCccceeeecc-------Ccccc-------- Confidence 233332 22222222 223333444 344433333345555543221 11100 Q ss_pred ccccccCCCceEEEeeec-----CceecCCCcEEEEEEEEeC Q lcl|NC_019515. 178 DCKVDVLGSGHMINRKTH-----GVIFKTSGMEITRTLRFTF 214 (214) Q Consensus 178 ~~t~~~~~~G~MyNRvtF-----~pI~K~~d~~LTLtWeI~F 214 (214) ....-+.+.|.||++.+. .||++...++++++|.+-| T Consensus 109 ~~~~~v~RggrLYD~~nhT~~F~~pi~~~~~~~v~Iv~~~pF 150 (245) T protein:vir:10 109 KKVPMTMRAGKLYSTWSHTFDMRKHVNANGMIRLTLLTLLPY 150 (245) T ss_pred ccceeEeccceEEeccccceecccccccCcceEEEEEeeCCh Confidence 001124567899999854 4899999999999999999 No 5 >protein:vir:78741 Length: 197 # NCBI annotation: tail tube A # Family: family:all:824 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285449;genbank:gi:148724483;genbank:GeneID:5220212 Probab=21.25 E-value=0.36 Score=23.04 Aligned_cols=107 Identities=17% Similarity=0.315 Sum_probs=50.8 Q ss_pred HHHHHHHHHHHH--HhhcCCCCCCceEEEEccCCCCCCcccCCCCChhhhhhccchhhh---hhcccce--------eEe Q lcl|NC_019515. 72 TVVNEASKLIAA--LIKRHTGYQGALQWEVGSGQSTWDDANPPAPNVTDKALATPFFRK---AIQLSDI--------SFI 138 (214) Q Consensus 72 ~Iv~sGr~LvA~--Lf~~~~g~~pi~~~AVGtg~~~~~d~nppap~~~dT~L~~E~fRk---ai~~sd~--------t~l 138 (214) |=++ -.+|=|= -|+ .=|-.||+.+-. .|| |.+.+..+.+. -+....| ++. T Consensus 1 m~~~-~teLdAVN~~L~-aIGEspV~sld~---------~np------dva~a~~iL~~v~~~vqseGW~FNte~~~~l~ 63 (197) T protein:vir:78 1 MASK-LTKLGAVNIVLT-NIGMAPVTLIDS---------NNP------MVATAQTILDEVSGSVQSEGWSYNTERAYPFI 63 (197) T ss_pred Cccc-hhHHHHHHHHHH-hhCCcccceeeC---------CCc------cHHHHHHHHHHHHHHHhhCCceEeecCCceec Confidence 1111 1122111 111 124456655421 222 22333332222 2232333 444 Q ss_pred ecCccccccccceEEEEEEeCcccCCcceeeeeeeccCcccccccCCCceEEEeeecCceecCCCcEEEEEEEEeC Q lcl|NC_019515. 139 DGANAVVSTVTNRIQIKVTFLSTEANGYLREFGIFGGGADCKVDVLGSGHMINRKTHGVIFKTSGMEITRTLRFTF 214 (214) Q Consensus 139 ~~~~av~~~~t~~l~i~~~f~~~eanG~LrEaGLF~~~~~~t~~~~~~G~MyNRvtF~pI~K~~d~~LTLtWeI~F 214 (214) |+.+....-|.+.+++... |. +-.+ -|.+.|.||.|.+..-..- +.++++++|-+-| T Consensus 64 pd~~g~I~~P~n~L~vd~~-------~~-~~~~----------~v~Rgg~LYD~~n~T~~F~-~pi~~~iv~~~~F 120 (197) T protein:vir:78 64 KDNTGRIAIPSNVLSLDCA-------ST-SKYD----------LIIRGGFLYDKAGHTDVFT-ENLELDVVWCFEF 120 (197) T ss_pred CCCCCeEecCccceEEecC-------CC-ceee----------EEEeCCeEEeccCCcEEeC-CceEEEEEeecCh Confidence 5444444446666654221 11 1111 2357889999999865553 6699999999999 No 6 >protein:vir:80215 Length: 211 # NCBI annotation: putative tail tubular protein A # Family: family:all:824 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522885;genbank:gi:158345178;genbank:GeneID:5687478 Probab=15.32 E-value=4 Score=17.30 Aligned_cols=108 Identities=9% Similarity=-0.010 Sum_probs=55.1 Q ss_pred ceeehhhhhhHHHHHHHHHHHHHhhcCCCCCCceEEEEccCCCCCCcccCCCCChhhhhhccchhhhhhcccce------ Q lcl|NC_019515. 62 DVLELPVGYNTVVNEASKLIAALIKRHTGYQGALQWEVGSGQSTWDDANPPAPNVTDKALATPFFRKAIQLSDI------ 135 (214) Q Consensus 62 ~vVe~~~~~N~Iv~sGr~LvA~Lf~~~~g~~pi~~~AVGtg~~~~~d~nppap~~~dT~L~~E~fRkai~~sd~------ 135 (214) +-..+-.+=|+|..+ =|-.||+.+-. .||.+..+-+ +..++.| .+....| T Consensus 1 ~~~teLdAVN~~L~a------------IGEsPV~sld~---------~npdva~a~~--iL~~v~r-~vqseGW~FNte~ 56 (211) T protein:vir:80 1 MQLTFLEAVNLVLRE------------LGETPVTSVDE---------TYPTLAQILP--AMEDARR-NTLAEGWWFNSFD 56 (211) T ss_pred CcchHHHHHHHHHHh------------hCccccccccC---------CchhHHHHHH--HHHHHHH-HHccCCeeEeecC Confidence 444444555555433 23446655432 1222222111 1223322 2332333 Q ss_pred --eEeecCccccccccceEEEEEEeCcccCCcceeeeeeeccCcccccccCCCceEEEeeecCceecCCCcEEEEEEEEe Q lcl|NC_019515. 136 --SFIDGANAVVSTVTNRIQIKVTFLSTEANGYLREFGIFGGGADCKVDVLGSGHMINRKTHGVIFKTSGMEITRTLRFT 213 (214) Q Consensus 136 --t~l~~~~av~~~~t~~l~i~~~f~~~eanG~LrEaGLF~~~~~~t~~~~~~G~MyNRvtF~pI~K~~d~~LTLtWeI~ 213 (214) ++.|+.+....-|.+.+++-.. |. +--+.+.|.||.|.+..-.. .+.++++++|-+- T Consensus 57 ~~~ltPd~~g~I~iP~n~L~v~~~-------~~-------------~~~~~Rgg~LYD~~n~T~~F-~~pi~v~iv~~~~ 115 (211) T protein:vir:80 57 DFTASPSPAGEVLLSEDTLAFYPD-------DV-------------EKFTWAGRYVRVTGTGSKVV-GAPVKGRVVLDIP 115 (211) T ss_pred CceeccCCCCeEecCccceEEeeC-------CC-------------eeeeeeCceEEeccCCcEee-CCceEEEEEeecC Confidence 4455444444446666654221 11 11234678999999986555 4669999999999 Q ss_pred C Q lcl|NC_019515. 214 F 214 (214) Q Consensus 214 F 214 (214) | T Consensus 116 F 116 (211) T protein:vir:80 116 Y 116 (211) T ss_pred h Confidence 9 No 7 >protein:vir:78929 Length: 184 # NCBI annotation: putative tail tubular protein A # Family: family:all:824 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522825;genbank:gi:158345060;genbank:GeneID:5687419 Probab=14.57 E-value=1.2 Score=20.11 Aligned_cols=102 Identities=10% Similarity=0.036 Sum_probs=53.2 Q ss_pred eehhhhhhHHHHHHHHHHHHHhhcCCCCCCceEEEEccCCCCCCcccCCCCChhhhhhccchhhh---hhccc------- Q lcl|NC_019515. 64 LELPVGYNTVVNEASKLIAALIKRHTGYQGALQWEVGSGQSTWDDANPPAPNVTDKALATPFFRK---AIQLS------- 133 (214) Q Consensus 64 Ve~~~~~N~Iv~sGr~LvA~Lf~~~~g~~pi~~~AVGtg~~~~~d~nppap~~~dT~L~~E~fRk---ai~~s------- 133 (214) +.+-.+=|+|+.+ =|-.||+.+.. .|| |.+.+..+.++ .+... T Consensus 1 ~teLdAVN~~L~a------------IGEspV~sld~---------~np------dva~a~~iL~~v~~~vqseGW~FNte 53 (184) T protein:vir:78 1 MLLLDAVNVILRK------------IGELPIPSMDE---------TYP------TMAIALPELEDQRIQLLTQGWWFNTW 53 (184) T ss_pred CchHHHHHHHHHh------------hCCcccccccC---------CCc------cHHHHHHHHHHHHHHHhhCCceEeec Confidence 3333333433221 23345554432 232 22333333222 22222 Q ss_pred -ceeEeecCccccccccceEEEEEEeCcccCCcceeeeeeeccCcccccccCCCceEEEeeecCceecCCCcEEEEEEEE Q lcl|NC_019515. 134 -DISFIDGANAVVSTVTNRIQIKVTFLSTEANGYLREFGIFGGGADCKVDVLGSGHMINRKTHGVIFKTSGMEITRTLRF 212 (214) Q Consensus 134 -d~t~l~~~~av~~~~t~~l~i~~~f~~~eanG~LrEaGLF~~~~~~t~~~~~~G~MyNRvtF~pI~K~~d~~LTLtWeI 212 (214) .+++.|+.+....-|.+.+++-. +|. --+-+.|.||.|.+..-.. ++.++++++|-+ T Consensus 54 ~~~~ltPd~~g~I~~P~n~L~i~~-------~~~--------------d~~~Rgg~lYD~~n~T~~F-~~~i~~~iv~~~ 111 (184) T protein:vir:78 54 WKHKLTPDPQGRINLPKDTLAFYP-------DSP--------------DLQWDGLGVRDANTGDDRI-GKSVEGRLVLSR 111 (184) T ss_pred CCeeeeecCCCeEEcCccceEeec-------CCc--------------eeEEcCcEEEeccCCcEEe-CCeeEEEEEeec Confidence 34555655555555666666532 110 0123578999999886555 588999999999 Q ss_pred eC Q lcl|NC_019515. 213 TF 214 (214) Q Consensus 213 ~F 214 (214) -| T Consensus 112 ~F 113 (184) T protein:vir:78 112 EW 113 (184) T ss_pred Ch Confidence 99 No 8 >protein:vir:6325 Length: 184 # NCBI annotation: tail tubular protein A # Family: family:all:824 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877472;genbank:gi:33300844;uniprot:Q7Y2D2;genbank:GeneID:1482614 Probab=13.44 E-value=4.6 Score=16.93 Aligned_cols=105 Identities=10% Similarity=-0.001 Sum_probs=53.6 Q ss_pred eehhhhhhHHHHHHHHHHHHHhhcCCCCCCceEEEEccCCCCCCcccCCCCChhhhhhccchhhhhhcccce-------- Q lcl|NC_019515. 64 LELPVGYNTVVNEASKLIAALIKRHTGYQGALQWEVGSGQSTWDDANPPAPNVTDKALATPFFRKAIQLSDI-------- 135 (214) Q Consensus 64 Ve~~~~~N~Iv~sGr~LvA~Lf~~~~g~~pi~~~AVGtg~~~~~d~nppap~~~dT~L~~E~fRkai~~sd~-------- 135 (214) +.+-.+=|+|..+ =|-.||+.+..| ||.+.++. ..| .++ |+.+....| T Consensus 1 ~teL~AVN~~L~a------------IGespV~sld~~---------npdva~a~-~iL-~~v-~~~vqs~GW~FNte~~~ 56 (184) T protein:vir:63 1 MLLLDAVNVILRK------------IGELPTLSMDET---------YPTMAIAL-PEL-EDQ-RIQLLTQGWWFNTWWRH 56 (184) T ss_pred CchHHHHHHHHHh------------hCccccceecCC---------CccHHHHH-HHH-HHH-HHHHhcCCceEeecCCc Confidence 4444454555443 244566654432 22222211 112 222 223333333 Q ss_pred eEeecCccccccccceEEEEEEeCcccCCcceeeeeeeccCcccccccCCCceEEEeeecCceecCCCcEEEEEEEEeC Q lcl|NC_019515. 136 SFIDGANAVVSTVTNRIQIKVTFLSTEANGYLREFGIFGGGADCKVDVLGSGHMINRKTHGVIFKTSGMEITRTLRFTF 214 (214) Q Consensus 136 t~l~~~~av~~~~t~~l~i~~~f~~~eanG~LrEaGLF~~~~~~t~~~~~~G~MyNRvtF~pI~K~~d~~LTLtWeI~F 214 (214) ++.|+.+....-|.+.+++-- +.+. -+-+.|.||.|.+..-.. ++.++++++|-+-| T Consensus 57 ~ltPd~~g~I~~P~n~L~v~~------~~~d---------------~~~Rgg~LyD~~n~t~~F-~~~i~v~iv~~~~F 113 (184) T protein:vir:63 57 KLTPDPTGRINLPKGTLAFYP------DSPD---------------LQWDGLGVRDANTGDDRI-GKPVEGRLVLSREW 113 (184) T ss_pred eeeecCCCeEEcCcceeeeec------CCCc---------------eEEcCCEEEeccCCcEEe-CCceEEEEEeecCh Confidence 344444444444555555421 0111 123578999999885555 58899999999999 No 9 >protein:vir:1542 Length: 196 # NCBI annotation: tail tubular protein A # Family: family:all:824 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052110;swissprot:trembl:q9t106;genbank:gi:9634036;uniprot:Q9T106;genbank:GeneID:1262371 Probab=8.72 E-value=7.5 Score=15.77 Aligned_cols=116 Identities=18% Similarity=0.254 Sum_probs=49.7 Q ss_pred hhhhhHHHHHHHHHHH--HHhhcCCCCCCceEEEEccCCCCCCcccCCCCChhhhhhccchhhhhhcccce--------e Q lcl|NC_019515. 67 PVGYNTVVNEASKLIA--ALIKRHTGYQGALQWEVGSGQSTWDDANPPAPNVTDKALATPFFRKAIQLSDI--------S 136 (214) Q Consensus 67 ~~~~N~Iv~sGr~LvA--~Lf~~~~g~~pi~~~AVGtg~~~~~d~nppap~~~dT~L~~E~fRkai~~sd~--------t 136 (214) -+-+-+=+.+.-+|=| .-|+ .=|-.||+.+.. +.||...++. ..| .++ |+.+....| + T Consensus 1 ~~~~~~~~~~~teLdAVN~~L~-aIGEspV~sld~--------~~npdva~a~-~iL-~~v-~~~vqseGW~FNte~~~~ 68 (196) T protein:vir:15 1 MRSYEMNIETAEELSAVNDILA-SIGEPPVSTLEG--------DANADVANAR-RVL-NKI-NRQIQSRGWTFNIEEGVT 68 (196) T ss_pred CCccccchhhhhhhHHHHHHHH-hcCccccccccC--------CCCccHHHHH-HHH-HHH-HHHHhhCCceEeecCCce Confidence 1111111222222222 1111 123345544322 1233222211 112 122 222333333 3 Q ss_pred EeecC-ccccccccceEEEEEEeCcccCCcceeeeeeeccCcccccccCCCceEEEeeecCceecCCCcEEEEEEEEeC Q lcl|NC_019515. 137 FIDGA-NAVVSTVTNRIQIKVTFLSTEANGYLREFGIFGGGADCKVDVLGSGHMINRKTHGVIFKTSGMEITRTLRFTF 214 (214) Q Consensus 137 ~l~~~-~av~~~~t~~l~i~~~f~~~eanG~LrEaGLF~~~~~~t~~~~~~G~MyNRvtF~pI~K~~d~~LTLtWeI~F 214 (214) +.|+. +....-|.+.+++... + . ...-+.+.|.||.|.+..-.. ++.++++++|-+-| T Consensus 69 ltPD~~~g~I~vP~n~L~v~~~-------~-----------~-~~~~v~Rgg~LYD~~n~T~~F-~~pi~v~iv~~~~F 127 (196) T protein:vir:15 69 LLPDAFSGMIPFSSDYLSVMAT-------S-----------G-QTQYINRGGYLYDRSAKTDRF-PSGVQVNLIRLREF 127 (196) T ss_pred eeecCCCCeEecCcceeEEecC-------C-----------C-ceeEEEcCCeEEeccCCcEEe-CCceEEEEEeecCh Confidence 44432 2222335555553321 1 0 112355788999999885543 35699999999999 No 10 >protein:vir:3365 Length: 196 # NCBI annotation: tail tubular protein A # Family: family:all:824 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523336;swissprot:trembl:q8w5u4;genbank:gi:17570827;uniprot:Q8W5U4;genbank:GeneID:927450 Probab=8.48 E-value=7.8 Score=15.70 Aligned_cols=116 Identities=18% Similarity=0.242 Sum_probs=49.6 Q ss_pred hhhhhHHHHHHHHHHH--HHhhcCCCCCCceEEEEccCCCCCCcccCCCCChhhhhhccchhhhhhccccee-------- Q lcl|NC_019515. 67 PVGYNTVVNEASKLIA--ALIKRHTGYQGALQWEVGSGQSTWDDANPPAPNVTDKALATPFFRKAIQLSDIS-------- 136 (214) Q Consensus 67 ~~~~N~Iv~sGr~LvA--~Lf~~~~g~~pi~~~AVGtg~~~~~d~nppap~~~dT~L~~E~fRkai~~sd~t-------- 136 (214) -+-+-+=+.+.-+|=| .-|+ .=|-.||+.+..- .||...++. ..| .++ |+.+....|. T Consensus 1 ~~~~~~~~~~~teLdAVN~~L~-aIGEspV~sld~~--------~npdva~a~-~iL-~~v-~~~vqseGW~FNte~~~~ 68 (196) T protein:vir:33 1 MRSYEMNIETAEELSAVNDILA-SIGEPPVSTLEGD--------ANADVANAR-RVL-NKI-NRQIQSRGWTFNIEEGVT 68 (196) T ss_pred CCccccchhhhhhhHHHHHHHH-hcCccccccccCC--------CCccHHHHH-HHH-HHH-HHHHhhCCceEeecCcee Confidence 1111111222222221 1111 1233455443221 122222211 112 121 2223333333 Q ss_pred EeecC-ccccccccceEEEEEEeCcccCCcceeeeeeeccCcccccccCCCceEEEeeecCceecCCCcEEEEEEEEeC Q lcl|NC_019515. 137 FIDGA-NAVVSTVTNRIQIKVTFLSTEANGYLREFGIFGGGADCKVDVLGSGHMINRKTHGVIFKTSGMEITRTLRFTF 214 (214) Q Consensus 137 ~l~~~-~av~~~~t~~l~i~~~f~~~eanG~LrEaGLF~~~~~~t~~~~~~G~MyNRvtF~pI~K~~d~~LTLtWeI~F 214 (214) +.|+. +....-|.+.+++... + . ...-|.+.|.||.|.+..-.. ++.++++++|-+-| T Consensus 69 ltPD~~~g~I~vP~n~L~v~~~-------~-----------~-~~~~v~Rgg~LYD~~n~T~~F-~~pi~v~iv~~~~F 127 (196) T protein:vir:33 69 LLPDAFSGMIPFSSDYLSVMAT-------S-----------G-QTQYVNRGGYLYDRSAKTDRF-PSGVQVNLIRLREF 127 (196) T ss_pred EeeeCCCCeEecCcceeEEecC-------C-----------C-ceeEEEcCCeEEeccCCcEEe-CCceEEEEEeecCh Confidence 44432 2233345555554321 1 0 112355788999999885543 35699999999999 Done!