Query lcl|Aclame:protein:vir:79389|NCBI_annot:gp056|genbank:acc:YP_001469055;genbank:gi:157311060;genbank:GeneID:5602048 Match_columns 324 No_of_seqs 22 out of 37 Neff 3.7 Searched_HMMs 1612 Date Mon Dec 2 09:22:49 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_128 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_128_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:122 Length: 293 # 100.0 1.9E-55 1.2E-58 320.5 14.9 264 38-322 1-293 (293) 2 protein:vir:5206 Length: 293 # 100.0 1.8E-54 1.1E-57 315.2 15.2 263 38-322 1-293 (293) 3 protein:vir:79392 Length: 240 100.0 6.7E-54 4.2E-57 312.1 13.8 214 64-324 1-214 (240) 4 protein:vir:18 Length: 287 # N 100.0 3.2E-50 2E-53 291.9 17.7 262 38-323 1-287 (287) 5 protein:vir:4733 Length: 194 # 99.9 1.1E-32 6.6E-36 195.9 6.3 194 47-323 1-194 (194) 6 protein:vir:9604 Length: 236 # 99.9 3.8E-27 2.3E-30 165.5 9.0 219 47-319 1-236 (236) 7 protein:vir:9444 Length: 251 # 98.5 1.3E-09 8.2E-13 69.3 9.2 219 40-320 1-251 (251) 8 protein:vir:9465 Length: 251 # 98.5 1.3E-09 8.2E-13 69.3 9.2 219 40-320 1-251 (251) 9 protein:vir:97357 Length: 251 98.5 1.3E-09 8.2E-13 69.3 9.2 219 40-320 1-251 (251) 10 protein:vir:9604 Length: 236 # 81.7 0.03 1.8E-05 29.0 6.4 212 62-324 1-232 (236) 11 protein:vir:18 Length: 287 # N 74.6 0.16 9.7E-05 25.0 12.7 244 56-324 1-283 (287) No 1 >protein:vir:122 Length: 293 # NCBI annotation: lower collar protein # Family: family:all:5217 # MgeID: mge:4 # MgeName: B103 # Cross-refs: genbank:acc:NP_690645;swissprot:sw:q37892;genbank:gi:22855159;uniprot:Q37892;genbank:GeneID:955372 Probab=100.00 E-value=1.9e-55 Score=320.53 Aligned_cols=264 Identities=14% Similarity=0.165 Sum_probs=172.6 Q ss_pred hhhhhhhhhhhhHHHhhhhccCCcCccccchhhccchhhHHHHHhhccccccccCccccchhhhHHHHHHHHHHhhhhhh Q lcl|Aclame:pro 38 LAHRRYETRMTTRDLLLGLSDYPDNRLYVPTEELLDEVFDQLIEITRIKPLVVFNDKEMDSQLTYKIMEDIFVLTEDREI 117 (324) Q Consensus 38 ~~~~~~~~~~~~~~~l~g~~~~~~n~~~~~~~e~i~k~~~~~~~~~~~~~~~~f~yp~~de~~~~~Fe~~fi~~fy~rEI 117 (324) +| - --|-.|-||++++-| ++. ++..|+|||+|++||+ |.||+|||+|||+||+||||||||||| T Consensus 1 ~~--~--~~~~~~~~~~~~~~~-~~~--~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 64 (293) T protein:vir:12 1 MA--S--YTMKLSTYIEMWSQY-ETG--LSMAEKIEKGRPKLFD---------FQYPIFDESYRKVFETHFIRNFYMREI 64 (293) T ss_pred Cc--c--eeehHHHHHHHHhhc-cCc--cchhhhhhhhhhhhhh---------ccCCcccchHHHHHHHHHHHHHHHHHh Confidence 11 1 246788999999933 333 5778889999999999 999999999999999999999999999 Q ss_pred ccccHHHHHHHHHHHHHHhchHHHHHHHHHHHHHhhccccccccccccccccccccccCcCc------------cccccc Q lcl|Aclame:pro 118 LKDTAGAWFRMFAATWHRYAELDEQHLRQIFQGIYRDHDSTSNAQSSSSGGSTSTSENTSKN------------QSTNDG 185 (324) Q Consensus 118 GfET~~~Fk~~L~~~l~~~Mp~y~~~~n~l~e~~~~~~D~~gNs~gssTg~~~s~se~~S~s------------qSts~g 185 (324) ||||+|+|||+|++||+++|| |||+|||+++++++|+.++....++.....++..... .+.+.+ T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~----~~n~~~~s~~~~t~~~nnT~~~~tsns~~sTd~~~nt~sd~st~~ng~s~ttts~ 140 (293) T protein:vir:12 65 GFETEGLFKFNLETWLIINMP----YFNKLFESELIKYDPLENTRLNTTGNKKNDTERNDNRDTTGSMKADGKSNTKTSD 140 (293) T ss_pred hccchhHHHHHHHHHHhhhcc----hhcchhhccccccCCCcccccccccCcccccCCCCCcCcccccccccccccCCCC Confidence 999999999999999999999 7799999999999999998755554422222221111 111111 Q ss_pred ccccccccc----cccCccCcccccCcccCCCcceeeecccccccccccccccccccccCccccceeccccccccceecc Q lcl|Aclame:pro 186 QSDSDSKSD----SASVGTTDSDAKNILSTMPQDKVFIKGDPLVQDVEYADNVTMNTSQGRDNSSQKSSNLSASHDWSWN 261 (324) Q Consensus 186 kT~ssSnT~----SSt~GtndS~nrti~Sdtpdsrl~g~td~~~~~ieYA~siskn~StskS~TtGksnS~TsS~s~s~~ 261 (324) .+..+.++. +...+++++.+.+...++|++.....+.++....+.+.+...+.+++...+++..+..+.+...+.+ T Consensus 141 ~t~~s~~t~s~~ntt~s~t~~stntt~stdt~~s~t~~tt~d~~~t~d~~Ttt~d~stts~~nTt~~~ntts~sNt~~tg 220 (293) T protein:vir:12 141 KTNATGSSKEDGKTTGSVTDDNFNRKIDSDQPDSRLNLTTNDGQGTLEYASAIEENNTNNKRNTTGTNNVTSSAESESTG 220 (293) T ss_pred CccccccccCccccCCcccCCCCCCCCCcCCCCCccccccCCCCCccCCcccccccccCCCcCCCCCCccCCCCCCCCCc Confidence 111111111 1222355567778888888888888888777777777666555544444443333222211111111 Q ss_pred ccccccccc------------cccccccccccc-ccCCCCcchhHHHHHHHhhhccchhhhHHHHHHHHHHHhh Q lcl|Aclame:pro 262 GSSSGSYGR------------NVGSNTSHSNNQ-SHSQGMSQSVYNTYRQWHDSSLDMTGGMYYQLVRAGLWSM 322 (324) Q Consensus 262 ~SsSds~gs------------n~gaN~a~ssnv-t~T~G~N~gV~qs~~~h~~s~~d~~~~~~~~~~~~~~~~~ 322 (324) .+.++.... ..++..+..++. ...+| ++.+.++++.+++..|.|.++||++|.+++|+-. T Consensus 221 sttsd~nTt~~sntTssd~tns~~nst~~~ns~stG~sg-s~S~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~ 293 (293) T protein:vir:12 221 SGTSDTVTTDNANTTTNDKLNSQINNVEDYIESKIGKSG-TQSYASLVQDYRAALLRIEKRIFDEMQELFMLVY 293 (293) T ss_pred ccCCccccccccccccccccCCCCccccccccccccccc-ccccccccchhhhHHHHHHHHHHHHHHHHhhhcC Confidence 111111000 011111111111 12222 5567788999999999999999999999998776 No 2 >protein:vir:5206 Length: 293 # NCBI annotation: lower collar protein # Family: family:all:5217 # MgeID: mge:116 # MgeName: PZA # Cross-refs: genbank:acc:NP_040729;genbank:gi:9626400;genbank:GeneID:1260971 Probab=100.00 E-value=1.8e-54 Score=315.23 Aligned_cols=263 Identities=13% Similarity=0.147 Sum_probs=165.7 Q ss_pred hhhhhhhhhhhhHHHhhhhccCCcCccccchhhccchhhHHHHHhhccccccccCccccchhhhHHHHHHHHHHhhhhhh Q lcl|Aclame:pro 38 LAHRRYETRMTTRDLLLGLSDYPDNRLYVPTEELLDEVFDQLIEITRIKPLVVFNDKEMDSQLTYKIMEDIFVLTEDREI 117 (324) Q Consensus 38 ~~~~~~~~~~~~~~~l~g~~~~~~n~~~~~~~e~i~k~~~~~~~~~~~~~~~~f~yp~~de~~~~~Fe~~fi~~fy~rEI 117 (324) +| - --|-.|-||+++++ .||+| ++.|+|||+|++||+ |+||||||+|||+||+||||||||||| T Consensus 1 ~~--~--~~~~~~~~~~~~~~-~~~~~--~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 64 (293) T protein:vir:52 1 MS--S--YTMQLRTYIEMWSQ-GETGL--STAEKIEKGRPKLFD---------FNYPIFDESYRTIFETHFIRNFYMREI 64 (293) T ss_pred Cc--c--eeehHhHHHhhhhh-cCCcc--cccchhhhhhhhhhc---------cCCCccchhHHHHHHHHHHHHHHHHHh Confidence 11 1 24778999999998 67743 556789999999999 999999999999999999999999999 Q ss_pred ccccHHHHHHHHHHHHHHhchHHHHHHHHHHHHHhhccccccccccccccccccccccCc----------Cccccccccc Q lcl|Aclame:pro 118 LKDTAGAWFRMFAATWHRYAELDEQHLRQIFQGIYRDHDSTSNAQSSSSGGSTSTSENTS----------KNQSTNDGQS 187 (324) Q Consensus 118 GfET~~~Fk~~L~~~l~~~Mp~y~~~~n~l~e~~~~~~D~~gNs~gssTg~~~s~se~~S----------~sqSts~gkT 187 (324) ||||+|+|||+|+.||+++|| |||++|+++.+.++|+.|+....+++....+.... .+.+.+++++ T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~----~~n~~~~s~~~nt~p~dNt~~~~ttn~~~sTs~d~st~tdss~t~d~~stTds~t 140 (293) T protein:vir:52 65 GFETEGLFKFHLETWLMINMP----YFNKLFESELIKYDPLENTRVGVKSNTKNDTDRNDNRDVKQDLTSNGTSSTDAKQ 140 (293) T ss_pred hccchHHHHHHHHHHHhhhcc----cccccccccccccCCcccccccccccccccccccCCcccCCccccCCccCCCCCc Confidence 999999999999999999999 66999999999999998886544443221111111 1111111111 Q ss_pred cccccc------ccccCccCcccccCcccCCCcceeeecccccccccccccccccccccCccccceeccccccccceecc Q lcl|Aclame:pro 188 DSDSKS------DSASVGTTDSDAKNILSTMPQDKVFIKGDPLVQDVEYADNVTMNTSQGRDNSSQKSSNLSASHDWSWN 261 (324) Q Consensus 188 ~ssSnT------~SSt~GtndS~nrti~Sdtpdsrl~g~td~~~~~ieYA~siskn~StskS~TtGksnS~TsS~s~s~~ 261 (324) .+.++. ...+..+.++.......+++++.......+.....+.+.+...+.++....++..+++.+...+.+.+ T Consensus 141 ~~~s~~tanst~ta~snsT~~s~t~~~ssdT~ns~~~~tssn~~~s~d~atst~d~nst~t~nsttsnnstt~~nt~~ts 220 (293) T protein:vir:52 141 NDTSKTTGNEKSSGSGSITDDNFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTTDTTSNTTGTS 220 (293) T ss_pred CCCcccccCcccCCCcccCCCCCcCCccccCCcccccccccCCCCcCCcccccccccccCCcCCCccccccCCccCCccc Confidence 111111 11122234445555667777777766666665555555554444433332222222222111111000 Q ss_pred ------------cccccccccccccccccccccccCCCC--cchhHHHHHHHhhhccchhhhHHHHHHHHHHHhh Q lcl|Aclame:pro 262 ------------GSSSGSYGRNVGSNTSHSNNQSHSQGM--SQSVYNTYRQWHDSSLDMTGGMYYQLVRAGLWSM 322 (324) Q Consensus 262 ------------~SsSds~gsn~gaN~a~ssnvt~T~G~--N~gV~qs~~~h~~s~~d~~~~~~~~~~~~~~~~~ 322 (324) .+.++......++.. ....+..-|. ++..-+.+..||+..|+++.+||++|++++||-. T Consensus 221 ttt~~~nTTn~sntts~snsts~~nst--~~snt~~sg~~gSvS~a~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 293 (293) T protein:vir:52 221 TLDSDSKTSNKANTTSNDKLNSQINSV--EDYIEDRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQELFMLVY 293 (293) T ss_pred CCCCcccccCCcccCcCCCCCCccccc--cccCccccccccCccchhhhhhHHHHHHhHHHHHHHHHHHHHhhcC Confidence 001111111111111 1122222233 4566789999999999999999999999998876 No 3 >protein:vir:79392 Length: 240 # NCBI annotation: lower collar # Family: family:all:5217 # MgeID: mge:1869 # MgeName: Av-1 # Cross-refs: genbank:acc:YP_001333665;genbank:gi:151266302;genbank:GeneID:5329873 Probab=100.00 E-value=6.7e-54 Score=312.09 Aligned_cols=214 Identities=17% Similarity=0.180 Sum_probs=153.0 Q ss_pred cccchhhccchhhHHHHHhhccccccccCccccchhhhHHHHHHHHHHhhhhhhccccHHHHHHHHHHHHHHhchHHHHH Q lcl|Aclame:pro 64 LYVPTEELLDEVFDQLIEITRIKPLVVFNDKEMDSQLTYKIMEDIFVLTEDREILKDTAGAWFRMFAATWHRYAELDEQH 143 (324) Q Consensus 64 ~~~~~~e~i~k~~~~~~~~~~~~~~~~f~yp~~de~~~~~Fe~~fi~~fy~rEIGfET~~~Fk~~L~~~l~~~Mp~y~~~ 143 (324) |-|.+--+ .+..|+.. +-.-.=+||+|||+||++||++|||+||||||||||+++|||+|++||+++|| | T Consensus 1 ~~~~~~~~-----~~~~~~~~-~~~~~~~YPIfDesYrk~FEt~Fir~FYmrEIGFETeg~Fkf~Le~wL~lnMP----y 70 (240) T protein:vir:79 1 MSVTTIML-----RDVVKLTN-DHIGLDNYPIFDESYRKTLNDRIKREYWLQEIAHETIDIFIWRMSLRMDLIMP----R 70 (240) T ss_pred CchhhhHH-----HHHHHhhc-ccccccccCccchHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHhhcc----h Confidence 21211111 11222211 11223479999999999999999999999999999999999999999999999 5 Q ss_pred HHHHHHHHhhccccccccccccccccccccccCcCcccccccccccccccccccCccCcccccCcccCCCcceeeecccc Q lcl|Aclame:pro 144 LRQIFQGIYRDHDSTSNAQSSSSGGSTSTSENTSKNQSTNDGQSDSDSKSDSASVGTTDSDAKNILSTMPQDKVFIKGDP 223 (324) Q Consensus 144 ~n~l~e~~~~~~D~~gNs~gssTg~~~s~se~~S~sqSts~gkT~ssSnT~SSt~GtndS~nrti~Sdtpdsrl~g~td~ 223 (324) ||++||+|+++|||+.|.+..++.+..+.++.+. ......+.++++++++|++++|+||+||++.. T Consensus 71 yNk~~esEl~~YdPLen~r~~s~T~~d~r~~~sG------------~~~etG~gs~tddn~kr~~~sDtPDtRL~~Dg-- 136 (240) T protein:vir:79 71 YNRMYLAELQNTDPLEGNRHYSRTGQDGRSQNSG------------INHQTGSGSGTNESKGRTVGSDTPQTRLAGDG-- 136 (240) T ss_pred hHHHHHHHhhccccccccccccccCCccceeecC------------ccccccccccccccccccccCCCcchhhhccc-- Confidence 6899999999999998777554432211111111 12223355678889999999999999998422 Q ss_pred cccccccccccccccccCccccceeccccccccceecccccccccccccccccccccccccCCCCcchhHHHHHHHhhhc Q lcl|Aclame:pro 224 LVQDVEYADNVTMNTSQGRDNSSQKSSNLSASHDWSWNGSSSGSYGRNVGSNTSHSNNQSHSQGMSQSVYNTYRQWHDSS 303 (324) Q Consensus 224 ~~~~ieYA~siskn~StskS~TtGksnS~TsS~s~s~~~SsSds~gsn~gaN~a~ssnvt~T~G~N~gV~qs~~~h~~s~ 303 (324) +||++++...+++++.+ .+.|++++++++++.+ +..+.++|+.-.+.-.+++||++. T Consensus 137 -----dYAS~isd~~t~~~s~s------~~dSdS~t~st~n~sn------------Nq~~~s~Gk~G~~syai~eyR~al 193 (240) T protein:vir:79 137 -----DYATSISDASTGGSSTS------RNESDSTSSSTSNYSN------------NQNSESWGYSGSKARAIAEYRSTL 193 (240) T ss_pred -----hhhhhhhhhhcCCcccc------cccccccccccccccc------------ccchhhhcccchHHHHHHHHHHHH Confidence 89999988666655554 3334433333333322 235566798888888999999999 Q ss_pred cchhhhHHHHHHHHHHHhhcC Q lcl|Aclame:pro 304 LDMTGGMYYQLVRAGLWSMFC 324 (324) Q Consensus 304 ~d~~~~~~~~~~~~~~~~~~~ 324 (324) |+++.+||++|.+++|+-.=. T Consensus 194 L~ve~~if~em~eLFM~vy~~ 214 (240) T protein:vir:79 194 LNVDDLVIRELSDLFMGIWDG 214 (240) T ss_pred HhHHHHHHHHHHHHhhhhccC Confidence 999999999999999875433 No 4 >protein:vir:18 Length: 287 # NCBI annotation: lower collar protein # Family: family:all:5217 # MgeID: mge:323 # MgeName: GA-1 # Cross-refs: genbank:acc:NP_073694;swissprot:trembl:q9fzw4;genbank:gi:12248118;uniprot:Q9FZW4;genbank:GeneID:919889 Probab=100.00 E-value=3.2e-50 Score=291.90 Aligned_cols=262 Identities=13% Similarity=0.152 Sum_probs=168.7 Q ss_pred hhhhhhhhhhhhHHHhhhhccCCcCccccchhhccchhhHHHHHhhccccccccCccccchhhhHHHHHHHHHHhhhhhh Q lcl|Aclame:pro 38 LAHRRYETRMTTRDLLLGLSDYPDNRLYVPTEELLDEVFDQLIEITRIKPLVVFNDKEMDSQLTYKIMEDIFVLTEDREI 117 (324) Q Consensus 38 ~~~~~~~~~~~~~~~l~g~~~~~~n~~~~~~~e~i~k~~~~~~~~~~~~~~~~f~yp~~de~~~~~Fe~~fi~~fy~rEI 117 (324) +| ---|-.|-+|++++++.++ +++.|+|||+|++||+ |+||+ |++|||+||++|||+|||||| T Consensus 1 ~~----~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~---------~~y~~-~~~~~~~~~~~~~~~~~~~~i 63 (287) T protein:vir:18 1 MA----SFTMPLREIVEWATQFDNK---LTRNEKIEEGRKKLFD---------FFYPI-ETDYKKEFETKFIKHFYFREI 63 (287) T ss_pred Cc----ceeehHHHHHHHHhhhccc---chhhHHHhhhhhhhhh---------hcCcc-chHHHHHHHHHHHHHHHHHHH Confidence 11 1246789999999999988 3456889999999999 99999 999999999999999999999 Q ss_pred ccccHHHHHHHHHHHHHHhchHHHHHHHHHHHHHhhcccccccccccccccccccccc----CcCccccccccccccccc Q lcl|Aclame:pro 118 LKDTAGAWFRMFAATWHRYAELDEQHLRQIFQGIYRDHDSTSNAQSSSSGGSTSTSEN----TSKNQSTNDGQSDSDSKS 193 (324) Q Consensus 118 GfET~~~Fk~~L~~~l~~~Mp~y~~~~n~l~e~~~~~~D~~gNs~gssTg~~~s~se~----~S~sqSts~gkT~ssSnT 193 (324) ||||+++|||+|+.||+++|| |||++||+++++.+|..++.....++.-..+.. ...+....++++.+.+++ T Consensus 64 ~~~~~~~~~~~~~~~~~~~mp----~~n~~~ese~vntN~~~nTdant~tNtD~nTt~ndn~dtdsnt~ad~ntntdtnT 139 (287) T protein:vir:18 64 GFETEGRFKFALEEWLNLNMP----YWNKIIESTHLDYNPLYNVDYKKDSDLIRNLDQVDNRVTDSKIENNGKASSESNV 139 (287) T ss_pred hhhhHHHHHHHHHHHHHhhcc----hhhhhHhhhhccCCccccccccCCCCcccCCCCCCCcccccCcccCCCcCCCCCC Confidence 999999999999999999999 899999999999999987754433221111111 122223333333333333 Q ss_pred ccccCc----------cCcccccCcccCCCcceeeecccccccccccccccccccccCccccceecccccccc------c Q lcl|Aclame:pro 194 DSASVG----------TTDSDAKNILSTMPQDKVFIKGDPLVQDVEYADNVTMNTSQGRDNSSQKSSNLSASH------D 257 (324) Q Consensus 194 ~SSt~G----------tndS~nrti~Sdtpdsrl~g~td~~~~~ieYA~siskn~StskS~TtGksnS~TsS~------s 257 (324) .+...+ -.+.-+.....+.|++...+.++.. +.+++..+....++....+.+.+++.+.+. + T Consensus 140 ntd~nTtantdtntD~NTt~n~~tnt~dN~d~ntd~ntd~n--t~d~~T~~s~tnsn~~~nTd~ntnsntdtnsd~ntts 217 (287) T protein:vir:18 140 ITSEKGEANSIQDADRNSTAKKKRMFEDTPDGRLDIVNDNN--IIQYATDLTQEDSTDSVKDKIKNDSSSKNDSTGNTTA 217 (287) T ss_pred CCCcCCCCCCccccccccccCCCcCcccCCCCccccccCCC--CccccccCCCccccCCCCCCcccCCCccccCCCcccc Confidence 222111 1112334455566676776666644 356666665555555444444433322111 1 Q ss_pred eeccccccccccccccccccc-----ccccccCCCCcchhHHHHHHHhhhccchhhhHHHHHHHHHHHhhc Q lcl|Aclame:pro 258 WSWNGSSSGSYGRNVGSNTSH-----SNNQSHSQGMSQSVYNTYRQWHDSSLDMTGGMYYQLVRAGLWSMF 323 (324) Q Consensus 258 ~s~~~SsSds~gsn~gaN~a~-----ssnvt~T~G~N~gV~qs~~~h~~s~~d~~~~~~~~~~~~~~~~~~ 323 (324) .+.....++..........+. .+++...+| .+.-.+++++||++.|.++++||+||++++||-.= T Consensus 218 n~~~nstsn~nsn~d~nSd~N~~~n~n~~s~~~~g-t~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (287) T protein:vir:18 218 EGKTNNITEGNNVDDFKSEKDEKQKLNDHIYGKQG-NVSYPQLIKEHREAILNVERMIFDQMEELFMFVYN 287 (287) T ss_pred CCCCCcccCCccCCCCCCCCCcccccccccccccc-ceeHHHHHHHHHHHHHhHHHHHHHHHHHHHhhhcC Confidence 111111111111111111111 122333333 55778999999999999999999999999987544 No 5 >protein:vir:4733 Length: 194 # NCBI annotation: collar protein # Family: family:all:28811 # MgeID: mge:103 # MgeName: Cp-1 # Cross-refs: genbank:acc:NP_044824;swissprot:trembl:q37996;genbank:gi:9629536;uniprot:Q37996;genbank:GeneID:1261240 Probab=99.95 E-value=1.1e-32 Score=195.87 Aligned_cols=194 Identities=22% Similarity=0.238 Sum_probs=160.9 Q ss_pred hhhHHHhhhhccCCcCccccchhhccchhhHHHHHhhccccccccCccccchhhhHHHHHHHHHHhhhhhhccccHHHHH Q lcl|Aclame:pro 47 MTTRDLLLGLSDYPDNRLYVPTEELLDEVFDQLIEITRIKPLVVFNDKEMDSQLTYKIMEDIFVLTEDREILKDTAGAWF 126 (324) Q Consensus 47 ~~~~~~l~g~~~~~~n~~~~~~~e~i~k~~~~~~~~~~~~~~~~f~yp~~de~~~~~Fe~~fi~~fy~rEIGfET~~~Fk 126 (324) ||+| |+||+ ++||++|+++..|||++|++||+ |++....+...|+++||+|||+||||.||+++|. T Consensus 1 mtgr--ldgla-vdengeflhyntiidqtynelfk-----------dmelvngvsdnfkkefckhfynreigletfarfq 66 (194) T protein:vir:47 1 MTGR--LDGLA-VDENGEFLHYNTIIDQTYNELFK-----------DMELVNGVSDNFKKEFCKHFYNREIGLETFARFQ 66 (194) T ss_pred Cccc--cccee-ecCCCceeehhhHHHhhHHHHHH-----------HHHHhhhhhhhHHHHHHHHHhcchhhhhHHHHHH Confidence 9999 99999 99999999999999999999999 9999999999999999999999999999999999 Q ss_pred HHHHHHHHHhchHHHHHHHHHHHHHhhccccccccccccccccccccccCcCcccccccccccccccccccCccCccccc Q lcl|Aclame:pro 127 RMFAATWHRYAELDEQHLRQIFQGIYRDHDSTSNAQSSSSGGSTSTSENTSKNQSTNDGQSDSDSKSDSASVGTTDSDAK 206 (324) Q Consensus 127 ~~L~~~l~~~Mp~y~~~~n~l~e~~~~~~D~~gNs~gssTg~~~s~se~~S~sqSts~gkT~ssSnT~SSt~GtndS~nr 206 (324) .+|++.|+ ..|.++|+||.| +|+......+||. |.++......+...- T Consensus 67 ialeevln---necfnlfkylae--------------------irnkaikdlnqsm---------nidtvgnqkadgqal 114 (194) T protein:vir:47 67 IALEEVLN---NECFNLFKYLAE--------------------IRNKAIKDLNQSM---------NIDTVGNQKADGQAL 114 (194) T ss_pred HHHHHHhh---hhHHHHHHHHHH--------------------HHhHHHHhhhhhc---------cccccccccccccee Confidence 99999999 789999999988 4444445666766 556666777888899 Q ss_pred CcccCCCcceeeecccccccccccccccccccccCccccceeccccccccceecccccccccccccccccccccccccCC Q lcl|Aclame:pro 207 NILSTMPQDKVFIKGDPLVQDVEYADNVTMNTSQGRDNSSQKSSNLSASHDWSWNGSSSGSYGRNVGSNTSHSNNQSHSQ 286 (324) Q Consensus 207 ti~Sdtpdsrl~g~td~~~~~ieYA~siskn~StskS~TtGksnS~TsS~s~s~~~SsSds~gsn~gaN~a~ssnvt~T~ 286 (324) +|...+|+.|..++..+..++|+||++..+|. +|++++|.|+..+|++|+.-.- . T Consensus 115 qianttpqerkeivfterygvieyadnlvenh--------qknnadtksnvsgwsgsslaer-----------------l 169 (194) T protein:vir:47 115 QIANTTPQERKEIVFTERYGVIEYADNLVENH--------QKNNADTKSNVSGWSGSSLAER-----------------L 169 (194) T ss_pred eeccCChhhhhhhhhhhhcchhHHHHHHHhhh--------hccccccccccccccchhHHHH-----------------H Confidence 99999999999999999999999999999999 8999999999888877654321 1 Q ss_pred CCcchhHHHHHHHhhhccchhhhHHHHHHHHHHHhhc Q lcl|Aclame:pro 287 GMSQSVYNTYRQWHDSSLDMTGGMYYQLVRAGLWSMF 323 (324) Q Consensus 287 G~N~gV~qs~~~h~~s~~d~~~~~~~~~~~~~~~~~~ 323 (324) -+|..+ -|.--.||.-|-++++-- | T Consensus 170 qrnael-----------kdiqfqifnicdklflqv-f 194 (194) T protein:vir:47 170 QRNAEL-----------KDIQFQIFNICDKLFLQV-F 194 (194) T ss_pred hhccch-----------hhhHHHHHHHHHHHHHhc-C Confidence 122222 233344666666554422 2 No 6 >protein:vir:9604 Length: 236 # NCBI annotation: hypothetical protein # Family: family:all:5254 # MgeID: mge:172 # MgeName: C1 # Cross-refs: genbank:acc:NP_852020;genbank:gi:31072022;genbank:GeneID:1489939 Probab=99.89 E-value=3.8e-27 Score=165.46 Aligned_cols=219 Identities=20% Similarity=0.243 Sum_probs=142.3 Q ss_pred hhhHHHhh------hhccC--CcCcccc--chhhccchhhHHHHHhhccccccccCcccc-chhhhHHHHHHHHHHhhhh Q lcl|Aclame:pro 47 MTTRDLLL------GLSDY--PDNRLYV--PTEELLDEVFDQLIEITRIKPLVVFNDKEM-DSQLTYKIMEDIFVLTEDR 115 (324) Q Consensus 47 ~~~~~~l~------g~~~~--~~n~~~~--~~~e~i~k~~~~~~~~~~~~~~~~f~yp~~-de~~~~~Fe~~fi~~fy~r 115 (324) |+--||+. |++.. |||++-+ --+. +.+++=.-..+-+.|-+-.+| -..++.+|+++|+.||||| T Consensus 1 m~l~~~i~~e~vk~g~~~f~~~~n~~~~~~d~~q-----~~~k~~~~d~dv~~vvne~if~g~~~kEdF~~~F~~yF~~r 75 (236) T protein:vir:96 1 MRLFELIYKEVVKNGYSPFRSPENRIVVFEDKAQ-----IETKIMMYDEDVQKVVNELIFTGSKVNEDFREEFVNYFFNR 75 (236) T ss_pred CchHHHHHHHHHhccchhhcCCCceEEEeechhH-----HHHHHHhhhHHHHHHHHHHhhccccchHHHHHHHHHHHHhc Confidence 77666552 44421 3443321 1111 111111000011111111111 2357889999999999999 Q ss_pred hhccccHHHHHHHHHHHHHHhch----HHHHHHHHHHHHHhhccccccccccccccccccccccCcCccccccccccccc Q lcl|Aclame:pro 116 EILKDTAGAWFRMFAATWHRYAE----LDEQHLRQIFQGIYRDHDSTSNAQSSSSGGSTSTSENTSKNQSTNDGQSDSDS 191 (324) Q Consensus 116 EIGfET~~~Fk~~L~~~l~~~Mp----~y~~~~n~l~e~~~~~~D~~gNs~gssTg~~~s~se~~S~sqSts~gkT~ssS 191 (324) ||||+|+.+|..+|..+|++.|| .||++|+.|.+..+.+ +.|++++..++.+ ++...| T Consensus 76 Ei~~qt~~aF~~~l~~~l~tke~~ln~iY~~s~e~ll~E~y~~--S~Ghs~~~t~n~D----------------~t~n~S 137 (236) T protein:vir:96 76 EPHWDSLYIFRAKLKGILKTKEAVLNMLYLKSTELLLGESMSK--SEGHSSNENRSRD----------------NSTNES 137 (236) T ss_pred cCCcccHHHHHHHHHHHhhhhhhhhhhhhccchhhhhhhhhhh--ccccccccccCcc----------------cccccc Confidence 99999999999999999999999 9999999999999888 5555444433211 122223 Q ss_pred ccccccCccCcccccCcccCCCcceeeecccccccccccccccccccccCccccceeccccccccceecccccccccccc Q lcl|Aclame:pro 192 KSDSASVGTTDSDAKNILSTMPQDKVFIKGDPLVQDVEYADNVTMNTSQGRDNSSQKSSNLSASHDWSWNGSSSGSYGRN 271 (324) Q Consensus 192 nT~SSt~GtndS~nrti~Sdtpdsrl~g~td~~~~~ieYA~siskn~StskS~TtGksnS~TsS~s~s~~~SsSds~gsn 271 (324) ++.+. ++..-.+.||+.+.+. ....++.||++- ..+++++-+++++.+++.+..++..+++|.+++=+ T Consensus 138 ng~~~--------~~nA~~~lPd~~~d~D--v~~snL~yAdN~--t~s~~~tvn~s~s~si~~s~~~~nnkgns~gtqf~ 205 (236) T protein:vir:96 138 NGENR--------GANAHSTNPDDVTDTD--LETANLSYADNL--DKSYNESVNVSHSKGISSSQGSSNNNSNSTNTQFN 205 (236) T ss_pred ccccc--------cCCccccCCcchhccc--cccccccccccc--ccccccccccccccccccccccccccccccccchh Confidence 33333 3344558899988554 445789999954 45668888888888888888776666665543322 Q ss_pred cccccccccccccCCCCcchhHHHHHHHhhhccc--hhhhHHHHHHHHHH Q lcl|Aclame:pro 272 VGSNTSHSNNQSHSQGMSQSVYNTYRQWHDSSLD--MTGGMYYQLVRAGL 319 (324) Q Consensus 272 ~gaN~a~ssnvt~T~G~N~gV~qs~~~h~~s~~d--~~~~~~~~~~~~~~ 319 (324) ..+ -+++..|+||+.+.|| |+.++||+ |+ T Consensus 206 ~~~---------------ld~~~~~kqkI~~e~D~klFs~lf~~----~~ 236 (236) T protein:vir:96 206 TKA---------------LEEYEAFKQKIFDELDIKLFSQLFYE----GY 236 (236) T ss_pred hHH---------------HHHHHHHHHHhhhhhcHHHhhhhhhc----CC Confidence 211 3899999999999999 88888886 44 No 7 >protein:vir:9444 Length: 251 # NCBI annotation: lower collar protein # Family: family:all:5254 # MgeID: mge:168 # MgeName: phiP68 # Cross-refs: genbank:acc:NP_817334;genbank:gi:29565761;genbank:GeneID:1258938 Probab=98.52 E-value=1.3e-09 Score=69.26 Aligned_cols=219 Identities=21% Similarity=0.305 Sum_probs=110.9 Q ss_pred hhhhhhhhhhHHHhhhhccCCcCccccchhhccchhhHHHHHhhccccccccCcc--------ccch------------- Q lcl|Aclame:pro 40 HRRYETRMTTRDLLLGLSDYPDNRLYVPTEELLDEVFDQLIEITRIKPLVVFNDK--------EMDS------------- 98 (324) Q Consensus 40 ~~~~~~~~~~~~~l~g~~~~~~n~~~~~~~e~i~k~~~~~~~~~~~~~~~~f~yp--------~~de------------- 98 (324) -.|| .||--||+ ..|+|.|++..|-.+-++ .+|+|. -||+ T Consensus 1 ~~~~--~M~L~d~I--------------~~E~iK~G~~~F~~dNkl---~~~dD~~Q~~~Kml~~D~DV~~iVNE~vF~G 61 (251) T protein:vir:94 1 MARY--TMTLYDFI--------------KSELIKKGFNEFVNDNKL---TFYDDEFQFMQKMLKFDKDVLAIVNEKVFKG 61 (251) T ss_pred Cccc--hhHHHHHH--------------HHHHHhccchhhhcCCce---EEecchHHHHHHHHhhhHHHHHHHHHHhhcc Confidence 3344 25544443 245555555555443222 222231 1233 Q ss_pred -hhhHH-----HHHHHHHHhhhhhhccccHHHHHHHHHHHHHHhchHHHHHHHHHHHHHhhccccccccccccccccccc Q lcl|Aclame:pro 99 -QLTYK-----IMEDIFVLTEDREILKDTAGAWFRMFAATWHRYAELDEQHLRQIFQGIYRDHDSTSNAQSSSSGGSTST 172 (324) Q Consensus 99 -~~~~~-----Fe~~fi~~fy~rEIGfET~~~Fk~~L~~~l~~~Mp~y~~~~n~l~e~~~~~~D~~gNs~gssTg~~~s~ 172 (324) +++.+ |.+.|..||.+|||-.+|..+|--+|-..+.-. +.|+|-++.+.-. ..+.-++|- +-+ T Consensus 62 ~~~~de~~~~~Fk~~F~~~F~~RE~~~~t~~~F~~q~~~v~~T~----E~~LN~~Y~SsE~--E~~~qSqG~-----~~H 130 (251) T protein:vir:94 62 FSLKDELSDLLFKKSFTIHFLDREINRQTVEAFGMQVITVCITH----EDYLNVVYSSSEV--EKYLQSQGF-----TEH 130 (251) T ss_pred cccchhhhhhHHHHHHHHHhhcccccHHHHHHHHHHHHHHhhhh----HHHHHhhhhhhHH--HHHHHhcCC-----ccc Confidence 24455 999999999999999999999999988877655 3466666543222 122222221 111 Q ss_pred cccCcCccccccccccccccccc--ccCccCcccccCcccCCCcceeeecccccccccccccccccccccCccccceecc Q lcl|Aclame:pro 173 SENTSKNQSTNDGQSDSDSKSDS--ASVGTTDSDAKNILSTMPQDKVFIKGDPLVQDVEYADNVTMNTSQGRDNSSQKSS 250 (324) Q Consensus 173 se~~S~sqSts~gkT~ssSnT~S--St~GtndS~nrti~Sdtpdsrl~g~td~~~~~ieYA~siskn~StskS~TtGksn 250 (324) ++..+ ..++-++|-.. -...++-+.++..-+..||+.+.+..+. .++.||++..-.. ||+- T Consensus 131 ~~~~~-------~~~D~tsNq~~~~~~~S~G~~~~~NA~~s~P~~~~~~D~d~--~~L~~ADN~~~~~--------~Kt~ 193 (251) T protein:vir:94 131 NEDTT-------SNTDETSNQNATSLDNSTGMTANRNAYVSLPQSEVNIDVDN--TTLRFADNNTIDN--------GKTV 193 (251) T ss_pred ccccc-------ccccccccccccccccccccccCccccccCccchhhccccc--ceeeecccccccc--------cccc Confidence 11100 00111111111 0112333455666778899988766554 4799998875333 3333 Q ss_pred ccccccceecccccccccccccccccccccccccCCCCcchhHHHHHHHhhhccchhhhHHHHHHHH---HHH Q lcl|Aclame:pro 251 NLSASHDWSWNGSSSGSYGRNVGSNTSHSNNQSHSQGMSQSVYNTYRQWHDSSLDMTGGMYYQLVRA---GLW 320 (324) Q Consensus 251 S~TsS~s~s~~~SsSds~gsn~gaN~a~ssnvt~T~G~N~gV~qs~~~h~~s~~d~~~~~~~~~~~~---~~~ 320 (324) ..+.-+++..+++++...+-+.|. ++. .|.-..-.|--||+-.-||+.+-.. +|| T Consensus 194 N~S~N~S~~~~~~~~~~~~N~~~~-------~~~--------~Q~~~~~id~~~~~rkKI~~E~D~K~F~Qi~ 251 (251) T protein:vir:94 194 NKSSNESNQNAKRNQNQKGNAKGT-------QFT--------KQYLIDNIDKAYDLRKKILNEFDKKCFLQIW 251 (251) T ss_pred ccccchhhhhhhhccccccccccc-------chh--------hhHhHHHHHHHHHHHHHHHHHHhHHHHHhcC Confidence 333223333333333332222221 122 2555555666777778888776543 444 No 8 >protein:vir:9465 Length: 251 # NCBI annotation: lower collar protein # Family: family:all:5254 # MgeID: mge:169 # MgeName: 44AHJD # Cross-refs: genbank:acc:NP_817312;genbank:gi:29565738;genbank:GeneID:1258928 Probab=98.52 E-value=1.3e-09 Score=69.26 Aligned_cols=219 Identities=21% Similarity=0.305 Sum_probs=110.9 Q ss_pred hhhhhhhhhhHHHhhhhccCCcCccccchhhccchhhHHHHHhhccccccccCcc--------ccch------------- Q lcl|Aclame:pro 40 HRRYETRMTTRDLLLGLSDYPDNRLYVPTEELLDEVFDQLIEITRIKPLVVFNDK--------EMDS------------- 98 (324) Q Consensus 40 ~~~~~~~~~~~~~l~g~~~~~~n~~~~~~~e~i~k~~~~~~~~~~~~~~~~f~yp--------~~de------------- 98 (324) -.|| .||--||+ ..|+|.|++..|-.+-++ .+|+|. -||+ T Consensus 1 ~~~~--~M~L~d~I--------------~~E~iK~G~~~F~~dNkl---~~~dD~~Q~~~Kml~~D~DV~~iVNE~vF~G 61 (251) T protein:vir:94 1 MARY--TMTLYDFI--------------KSELIKKGFNEFVNDNKL---TFYDDEFQFMQKMLKFDKDVLAIVNEKVFKG 61 (251) T ss_pred Cccc--hhHHHHHH--------------HHHHHhccchhhhcCCce---EEecchHHHHHHHHhhhHHHHHHHHHHhhcc Confidence 3344 25544443 245555555555443222 222231 1233 Q ss_pred -hhhHH-----HHHHHHHHhhhhhhccccHHHHHHHHHHHHHHhchHHHHHHHHHHHHHhhccccccccccccccccccc Q lcl|Aclame:pro 99 -QLTYK-----IMEDIFVLTEDREILKDTAGAWFRMFAATWHRYAELDEQHLRQIFQGIYRDHDSTSNAQSSSSGGSTST 172 (324) Q Consensus 99 -~~~~~-----Fe~~fi~~fy~rEIGfET~~~Fk~~L~~~l~~~Mp~y~~~~n~l~e~~~~~~D~~gNs~gssTg~~~s~ 172 (324) +++.+ |.+.|..||.+|||-.+|..+|--+|-..+.-. +.|+|-++.+.-. ..+.-++|- +-+ T Consensus 62 ~~~~de~~~~~Fk~~F~~~F~~RE~~~~t~~~F~~q~~~v~~T~----E~~LN~~Y~SsE~--E~~~qSqG~-----~~H 130 (251) T protein:vir:94 62 FSLKDELSDLLFKKSFTIHFLDREINRQTVEAFGMQVITVCITH----EDYLNVVYSSSEV--EKYLQSQGF-----TEH 130 (251) T ss_pred cccchhhhhhHHHHHHHHHhhcccccHHHHHHHHHHHHHHhhhh----HHHHHhhhhhhHH--HHHHHhcCC-----ccc Confidence 24455 999999999999999999999999988877655 3466666543222 122222221 111 Q ss_pred cccCcCccccccccccccccccc--ccCccCcccccCcccCCCcceeeecccccccccccccccccccccCccccceecc Q lcl|Aclame:pro 173 SENTSKNQSTNDGQSDSDSKSDS--ASVGTTDSDAKNILSTMPQDKVFIKGDPLVQDVEYADNVTMNTSQGRDNSSQKSS 250 (324) Q Consensus 173 se~~S~sqSts~gkT~ssSnT~S--St~GtndS~nrti~Sdtpdsrl~g~td~~~~~ieYA~siskn~StskS~TtGksn 250 (324) ++..+ ..++-++|-.. -...++-+.++..-+..||+.+.+..+. .++.||++..-.. ||+- T Consensus 131 ~~~~~-------~~~D~tsNq~~~~~~~S~G~~~~~NA~~s~P~~~~~~D~d~--~~L~~ADN~~~~~--------~Kt~ 193 (251) T protein:vir:94 131 NEDTT-------SNTDETSNQNATSLDNSTGMTANRNAYVSLPQSEVNIDVDN--TTLRFADNNTIDN--------GKTV 193 (251) T ss_pred ccccc-------ccccccccccccccccccccccCccccccCccchhhccccc--ceeeecccccccc--------cccc Confidence 11100 00111111111 0112333455666778899988766554 4799998875333 3333 Q ss_pred ccccccceecccccccccccccccccccccccccCCCCcchhHHHHHHHhhhccchhhhHHHHHHHH---HHH Q lcl|Aclame:pro 251 NLSASHDWSWNGSSSGSYGRNVGSNTSHSNNQSHSQGMSQSVYNTYRQWHDSSLDMTGGMYYQLVRA---GLW 320 (324) Q Consensus 251 S~TsS~s~s~~~SsSds~gsn~gaN~a~ssnvt~T~G~N~gV~qs~~~h~~s~~d~~~~~~~~~~~~---~~~ 320 (324) ..+.-+++..+++++...+-+.|. ++. .|.-..-.|--||+-.-||+.+-.. +|| T Consensus 194 N~S~N~S~~~~~~~~~~~~N~~~~-------~~~--------~Q~~~~~id~~~~~rkKI~~E~D~K~F~Qi~ 251 (251) T protein:vir:94 194 NKSSNESNQNAKRNQNQKGNAKGT-------QFT--------KQYLIDNIDKAYDLRKKILNEFDKKCFLQIW 251 (251) T ss_pred ccccchhhhhhhhccccccccccc-------chh--------hhHhHHHHHHHHHHHHHHHHHHhHHHHHhcC Confidence 333223333333333332222221 122 2555555666777778888776543 444 No 9 >protein:vir:97357 Length: 251 # NCBI annotation: ORF008 # Family: family:all:5254 # MgeID: mge:1669 # MgeName: 66 # Cross-refs: genbank:acc:YP_239466;genbank:gi:66395195;genbank:GeneID:5130532 Probab=98.52 E-value=1.3e-09 Score=69.26 Aligned_cols=219 Identities=21% Similarity=0.305 Sum_probs=110.9 Q ss_pred hhhhhhhhhhHHHhhhhccCCcCccccchhhccchhhHHHHHhhccccccccCcc--------ccch------------- Q lcl|Aclame:pro 40 HRRYETRMTTRDLLLGLSDYPDNRLYVPTEELLDEVFDQLIEITRIKPLVVFNDK--------EMDS------------- 98 (324) Q Consensus 40 ~~~~~~~~~~~~~l~g~~~~~~n~~~~~~~e~i~k~~~~~~~~~~~~~~~~f~yp--------~~de------------- 98 (324) -.|| .||--||+ ..|+|.|++..|-.+-++ .+|+|. -||+ T Consensus 1 ~~~~--~M~L~d~I--------------~~E~iK~G~~~F~~dNkl---~~~dD~~Q~~~Kml~~D~DV~~iVNE~vF~G 61 (251) T protein:vir:97 1 MARY--TMTLYDFI--------------KSELIKKGFNEFVNDNKL---TFYDDEFQFMQKMLKFDKDVLAIVNEKVFKG 61 (251) T ss_pred Cccc--hhHHHHHH--------------HHHHHhccchhhhcCCce---EEecchHHHHHHHHhhhHHHHHHHHHHhhcc Confidence 3344 25544443 245555555555443222 222231 1233 Q ss_pred -hhhHH-----HHHHHHHHhhhhhhccccHHHHHHHHHHHHHHhchHHHHHHHHHHHHHhhccccccccccccccccccc Q lcl|Aclame:pro 99 -QLTYK-----IMEDIFVLTEDREILKDTAGAWFRMFAATWHRYAELDEQHLRQIFQGIYRDHDSTSNAQSSSSGGSTST 172 (324) Q Consensus 99 -~~~~~-----Fe~~fi~~fy~rEIGfET~~~Fk~~L~~~l~~~Mp~y~~~~n~l~e~~~~~~D~~gNs~gssTg~~~s~ 172 (324) +++.+ |.+.|..||.+|||-.+|..+|--+|-..+.-. +.|+|-++.+.-. ..+.-++|- +-+ T Consensus 62 ~~~~de~~~~~Fk~~F~~~F~~RE~~~~t~~~F~~q~~~v~~T~----E~~LN~~Y~SsE~--E~~~qSqG~-----~~H 130 (251) T protein:vir:97 62 FSLKDELSDLLFKKSFTIHFLDREINRQTVEAFGMQVITVCITH----EDYLNVVYSSSEV--EKYLQSQGF-----TEH 130 (251) T ss_pred cccchhhhhhHHHHHHHHHhhcccccHHHHHHHHHHHHHHhhhh----HHHHHhhhhhhHH--HHHHHhcCC-----ccc Confidence 24455 999999999999999999999999988877655 3466666543222 122222221 111 Q ss_pred cccCcCccccccccccccccccc--ccCccCcccccCcccCCCcceeeecccccccccccccccccccccCccccceecc Q lcl|Aclame:pro 173 SENTSKNQSTNDGQSDSDSKSDS--ASVGTTDSDAKNILSTMPQDKVFIKGDPLVQDVEYADNVTMNTSQGRDNSSQKSS 250 (324) Q Consensus 173 se~~S~sqSts~gkT~ssSnT~S--St~GtndS~nrti~Sdtpdsrl~g~td~~~~~ieYA~siskn~StskS~TtGksn 250 (324) ++..+ ..++-++|-.. -...++-+.++..-+..||+.+.+..+. .++.||++..-.. ||+- T Consensus 131 ~~~~~-------~~~D~tsNq~~~~~~~S~G~~~~~NA~~s~P~~~~~~D~d~--~~L~~ADN~~~~~--------~Kt~ 193 (251) T protein:vir:97 131 NEDTT-------SNTDETSNQNATSLDNSTGMTANRNAYVSLPQSEVNIDVDN--TTLRFADNNTIDN--------GKTV 193 (251) T ss_pred ccccc-------ccccccccccccccccccccccCccccccCccchhhccccc--ceeeecccccccc--------cccc Confidence 11100 00111111111 0112333455666778899988766554 4799998875333 3333 Q ss_pred ccccccceecccccccccccccccccccccccccCCCCcchhHHHHHHHhhhccchhhhHHHHHHHH---HHH Q lcl|Aclame:pro 251 NLSASHDWSWNGSSSGSYGRNVGSNTSHSNNQSHSQGMSQSVYNTYRQWHDSSLDMTGGMYYQLVRA---GLW 320 (324) Q Consensus 251 S~TsS~s~s~~~SsSds~gsn~gaN~a~ssnvt~T~G~N~gV~qs~~~h~~s~~d~~~~~~~~~~~~---~~~ 320 (324) ..+.-+++..+++++...+-+.|. ++. .|.-..-.|--||+-.-||+.+-.. +|| T Consensus 194 N~S~N~S~~~~~~~~~~~~N~~~~-------~~~--------~Q~~~~~id~~~~~rkKI~~E~D~K~F~Qi~ 251 (251) T protein:vir:97 194 NKSSNESNQNAKRNQNQKGNAKGT-------QFT--------KQYLIDNIDKAYDLRKKILNEFDKKCFLQIW 251 (251) T ss_pred ccccchhhhhhhhccccccccccc-------chh--------hhHhHHHHHHHHHHHHHHHHHHhHHHHHhcC Confidence 333223333333333332222221 122 2555555666777778888776543 444 No 10 >protein:vir:9604 Length: 236 # NCBI annotation: hypothetical protein # Family: family:all:5254 # MgeID: mge:172 # MgeName: C1 # Cross-refs: genbank:acc:NP_852020;genbank:gi:31072022;genbank:GeneID:1489939 Probab=81.72 E-value=0.03 Score=28.96 Aligned_cols=212 Identities=18% Similarity=0.195 Sum_probs=82.6 Q ss_pred Cccc-cchhhccchhhHHHHHhhccccccccCcc--------ccchhhhHHHHHHHHHHhhhhhhccccHHHHHHHHHHH Q lcl|Aclame:pro 62 NRLY-VPTEELLDEVFDQLIEITRIKPLVVFNDK--------EMDSQLTYKIMEDIFVLTEDREILKDTAGAWFRMFAAT 132 (324) Q Consensus 62 n~~~-~~~~e~i~k~~~~~~~~~~~~~~~~f~yp--------~~de~~~~~Fe~~fi~~fy~rEIGfET~~~Fk~~L~~~ 132 (324) =+|| +=..|+|.+++..|-.- ---+.+|+|. -||+.+.+.-.+++++ ||.--..|+.++-++ T Consensus 1 m~l~~~i~~e~vk~g~~~f~~~--~n~~~~~~d~~q~~~k~~~~d~dv~~vvne~if~-------g~~~kEdF~~~F~~y 71 (236) T protein:vir:96 1 MRLFELIYKEVVKNGYSPFRSP--ENRIVVFEDKAQIETKIMMYDEDVQKVVNELIFT-------GSKVNEDFREEFVNY 71 (236) T ss_pred CchHHHHHHHHHhccchhhcCC--CceEEEeechhHHHHHHHhhhHHHHHHHHHHhhc-------cccchHHHHHHHHHH Confidence 1222 23456666666666540 0224556664 3677777777777776 444455699888887 Q ss_pred HHHhchHHHHHHHHHH----HHHhhcccccccccc-ccccccccccccCcCcccccccccccccccccccCccCcccccC Q lcl|Aclame:pro 133 WHRYAELDEQHLRQIF----QGIYRDHDSTSNAQS-SSSGGSTSTSENTSKNQSTNDGQSDSDSKSDSASVGTTDSDAKN 207 (324) Q Consensus 133 l~~~Mp~y~~~~n~l~----e~~~~~~D~~gNs~g-ssTg~~~s~se~~S~sqSts~gkT~ssSnT~SSt~GtndS~nrt 207 (324) +--.=| .|..=++| -..|+++.|..|-.+ +++ +..-.-.-.+|++|.+ .+..+++ ..+++|...+ T Consensus 72 F~~rEi--~~qt~~aF~~~l~~~l~tke~~ln~iY~~s~----e~ll~E~y~~S~Ghs~-~~t~n~D---~t~n~Sng~~ 141 (236) T protein:vir:96 72 FFNREP--HWDSLYIFRAKLKGILKTKEAVLNMLYLKST----ELLLGESMSKSEGHSS-NENRSRD---NSTNESNGEN 141 (236) T ss_pred HHhccC--CcccHHHHHHHHHHHhhhhhhhhhhhhccch----hhhhhhhhhhcccccc-ccccCcc---cccccccccc Confidence 764444 22222333 346778888777776 322 1111112233333221 1111111 1111111100 Q ss_pred cccCCCcceeeecccccccccccccccccccccCccccceeccccccccceecccccccccccccccccc------cccc Q lcl|Aclame:pro 208 ILSTMPQDKVFIKGDPLVQDVEYADNVTMNTSQGRDNSSQKSSNLSASHDWSWNGSSSGSYGRNVGSNTS------HSNN 281 (324) Q Consensus 208 i~Sdtpdsrl~g~td~~~~~ieYA~siskn~StskS~TtGksnS~TsS~s~s~~~SsSds~gsn~gaN~a------~ssn 281 (324) .+.. |++.-+.. .---+..++..+-+++...+-+.+.+.+++.+ ++++ T Consensus 142 ~~~n-------------------A~~~lPd~-------~~d~Dv~~snL~yAdN~t~s~~~tvn~s~s~si~~s~~~~nn 195 (236) T protein:vir:96 142 RGAN-------------------AHSTNPDD-------VTDTDLETANLSYADNLDKSYNESVNVSHSKGISSSQGSSNN 195 (236) T ss_pred ccCC-------------------ccccCCcc-------hhcccccccccccccccccccccccccccccccccccccccc Confidence 0000 11111111 00011111222222222222222222222222 2233 Q ss_pred cccCCCCcchhHHHHHHHhhhccchhhhHHHHHHHHHHHhhcC Q lcl|Aclame:pro 282 QSHSQGMSQSVYNTYRQWHDSSLDMTGGMYYQLVRAGLWSMFC 324 (324) Q Consensus 282 vt~T~G~N~gV~qs~~~h~~s~~d~~~~~~~~~~~~~~~~~~~ 324 (324) ..++-| +|+=++.-|--++.-.-||+.+- .-||+-.. T Consensus 196 kgns~g-----tqf~~~~ld~~~~~kqkI~~e~D-~klFs~lf 232 (236) T protein:vir:96 196 NSNSTN-----TQFNTKALEEYEAFKQKIFDELD-IKLFSQLF 232 (236) T ss_pred cccccc-----cchhhHHHHHHHHHHHHhhhhhc-HHHhhhhh Confidence 333322 23333433333333333333321 22454433 No 11 >protein:vir:18 Length: 287 # NCBI annotation: lower collar protein # Family: family:all:5217 # MgeID: mge:323 # MgeName: GA-1 # Cross-refs: genbank:acc:NP_073694;swissprot:trembl:q9fzw4;genbank:gi:12248118;uniprot:Q9FZW4;genbank:GeneID:919889 Probab=74.59 E-value=0.16 Score=25.00 Aligned_cols=244 Identities=12% Similarity=0.023 Sum_probs=73.7 Q ss_pred hccCCcC-ccccchhhccch--hhHHHHHhhccccccccCccccchhhhHHHHHHHHHHhhhhhhccccHHHHHHHHHHH Q lcl|Aclame:pro 56 LSDYPDN-RLYVPTEELLDE--VFDQLIEITRIKPLVVFNDKEMDSQLTYKIMEDIFVLTEDREILKDTAGAWFRMFAAT 132 (324) Q Consensus 56 ~~~~~~n-~~~~~~~e~i~k--~~~~~~~~~~~~~~~~f~yp~~de~~~~~Fe~~fi~~fy~rEIGfET~~~Fk~~L~~~ 132 (324) +|.|-=- |.||-+|.-++. ++.++||.+|+| .|| +++.|++.|.+.|.-|=| .. .| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~--~~y~~~~~~~~~~~~~~~---------~~--~~ 59 (287) T protein:vir:18 1 MASFTMPLREIVEWATQFDNKLTRNEKIEEGRKK--------LFD--FFYPIETDYKKEFETKFI---------KH--FY 59 (287) T ss_pred CcceeehHHHHHHHHhhhcccchhhHHHhhhhhh--------hhh--hcCccchHHHHHHHHHHH---------HH--HH Confidence 4444444 678888888876 467889988864 788 688889999988866533 21 24 Q ss_pred HHHhchHHHHHHHHHHHHHhhccccccccccccccc----cc---cccccCcCcccccccccccccccccccCccCc--- Q lcl|Aclame:pro 133 WHRYAELDEQHLRQIFQGIYRDHDSTSNAQSSSSGG----ST---STSENTSKNQSTNDGQSDSDSKSDSASVGTTD--- 202 (324) Q Consensus 133 l~~~Mp~y~~~~n~l~e~~~~~~D~~gNs~gssTg~----~~---s~se~~S~sqSts~gkT~ssSnT~SSt~Gtnd--- 202 (324) |+++-=.-+-.|..-.+--|+-.=|.-|....+.-- .. ........++...+..++..+++.+.++...+ T Consensus 60 ~~~i~~~~~~~~~~~~~~~~~~~mp~~n~~~ese~vntN~~~nTdant~tNtD~nTt~ndn~dtdsnt~ad~ntntdtnT 139 (287) T protein:vir:18 60 FREIGFETEGRFKFALEEWLNLNMPYWNKIIESTHLDYNPLYNVDYKKDSDLIRNLDQVDNRVTDSKIENNGKASSESNV 139 (287) T ss_pred HHHHhhhhHHHHHHHHHHHHHhhcchhhhhHhhhhccCCccccccccCCCCcccCCCCCCCcccccCcccCCCcCCCCCC Confidence 444431122233332333333333433432111110 00 00111222222222222222222221111111 Q ss_pred -ccccCcccCCCcceeeecccccc-----cccccccccccccccCcccccee----ccccccccc--eeccccccccccc Q lcl|Aclame:pro 203 -SDAKNILSTMPQDKVFIKGDPLV-----QDVEYADNVTMNTSQGRDNSSQK----SSNLSASHD--WSWNGSSSGSYGR 270 (324) Q Consensus 203 -S~nrti~Sdtpdsrl~g~td~~~-----~~ieYA~siskn~StskS~TtGk----snS~TsS~s--~s~~~SsSds~gs 270 (324) ....+-... +....+.+.... .++.....+.... .....++.. ++....+.. .+.+...++.... T Consensus 140 ntd~nTtant--dtntD~NTt~n~~tnt~dN~d~ntd~ntd~-nt~d~~T~~s~tnsn~~~nTd~ntnsntdtnsd~ntt 216 (287) T protein:vir:18 140 ITSEKGEANS--IQDADRNSTAKKKRMFEDTPDGRLDIVNDN-NIIQYATDLTQEDSTDSVKDKIKNDSSSKNDSTGNTT 216 (287) T ss_pred CCCcCCCCCC--ccccccccccCCCcCcccCCCCccccccCC-CCccccccCCCccccCCCCCCcccCCCccccCCCccc Confidence 011000000 001111111100 0111000000000 000000000 000001111 1111111111111 Q ss_pred cccc--ccc--cccccccCCC--Ccchh--------HHHHHHHhhhccchhhhHHHHHHHHHHHhhcC Q lcl|Aclame:pro 271 NVGS--NTS--HSNNQSHSQG--MSQSV--------YNTYRQWHDSSLDMTGGMYYQLVRAGLWSMFC 324 (324) Q Consensus 271 n~ga--N~a--~ssnvt~T~G--~N~gV--------~qs~~~h~~s~~d~~~~~~~~~~~~~~~~~~~ 324 (324) ..++ +.+ .++....+.. .+... +|+|.+-.- -|=-.+-=-+||+=-.|=.+|. T Consensus 217 sn~~~nstsn~nsn~d~nSd~N~~~n~n~~s~~~~gt~s~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 283 (287) T protein:vir:18 217 AEGKTNNITEGNNVDDFKSEKDEKQKLNDHIYGKQGNVSYPQLIK-EHREAILNVERMIFDQMEELFM 283 (287) T ss_pred cCCCCCcccCCccCCCCCCCCCccccccccccccccceeHHHHHH-HHHHHHHhHHHHHHHHHHHHHh Confidence 1111 111 1111111100 00011 122222111 0000111112333233333343 Done!