Query lcl|Aclame:protein:vir:5206|NCBI_annot:lower collar protein|genbank:acc:NP_040729;genbank:gi:9626400;genbank:GeneID:1260971 Match_columns 293 No_of_seqs 123 out of 213 Neff 6.8 Searched_HMMs 1612 Date Fri Nov 29 20:42:59 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_16 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_16_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:5206 Length: 293 # 100.0 4E-104 2E-107 587.4 24.4 293 1-293 1-293 (293) 2 protein:vir:122 Length: 293 # 100.0 1E-101 9E-105 573.4 23.2 293 1-293 1-293 (293) 3 protein:vir:18 Length: 287 # N 100.0 1.7E-94 1E-97 534.7 24.2 286 1-293 1-286 (287) 4 protein:vir:79392 Length: 240 100.0 1.1E-62 6.6E-66 360.4 15.1 212 1-293 1-212 (240) 5 protein:vir:9604 Length: 236 # 99.4 8.3E-16 5.2E-19 103.2 10.9 208 1-293 1-232 (236) 6 protein:vir:97357 Length: 251 98.4 1.2E-08 7.6E-12 64.0 11.0 220 1-293 1-251 (251) 7 protein:vir:9444 Length: 251 # 98.4 1.2E-08 7.6E-12 64.0 11.0 220 1-293 1-251 (251) 8 protein:vir:9465 Length: 251 # 98.4 1.2E-08 7.6E-12 64.0 11.0 220 1-293 1-251 (251) 9 protein:vir:5206 Length: 293 # 97.9 7.3E-06 4.5E-09 48.7 16.9 269 1-286 14-293 (293) 10 protein:vir:122 Length: 293 # 95.0 0.0029 1.8E-06 34.5 13.4 268 1-286 14-293 (293) 11 protein:vir:18 Length: 287 # N 93.9 0.006 3.7E-06 32.8 16.3 256 15-293 1-279 (287) 12 protein:vir:4733 Length: 194 # 93.2 0.0016 1E-06 35.9 7.5 179 4-293 1-194 (194) No 1 >protein:vir:5206 Length: 293 # NCBI annotation: lower collar protein # Family: family:all:5217 # MgeID: mge:116 # MgeName: PZA # Cross-refs: genbank:acc:NP_040729;genbank:gi:9626400;genbank:GeneID:1260971 Probab=100.00 E-value=4e-104 Score=587.45 Aligned_cols=293 Identities=100% Similarity=1.374 Sum_probs=243.4 Q ss_pred CCcchhhHhHHHhhhhhccccccccccccccccccccCCCcccchhhhHHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHH Q lcl|Aclame:pro 1 MSSYTMQLRTYIEMWSQGETGLSTAEKIEKGRPKLFDFNYPIFDESYRTIFETHFIRNFYMREIGFETEGLFKFHLETWL 80 (293) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~f~~~~~~~~~~~~ig~~t~~~fk~~l~~~~ 80 (293) |||||||||+|||||+|||+||+|+||||+||||||||+||||||+|||+||+|||||||||||||||++||||+|+.|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (293) T protein:vir:52 1 MSSYTMQLRTYIEMWSQGETGLSTAEKIEKGRPKLFDFNYPIFDESYRTIFETHFIRNFYMREIGFETEGLFKFHLETWL 80 (293) T ss_pred CcceeehHhHHHhhhhhcCCcccccchhhhhhhhhhccCCCccchhHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhchHHHHHHhhhcccCCCccCCccCCCCCCCCCCCCCCCCCcCCcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCC Q lcl|Aclame:pro 81 MINMPYFNKLFESELIKYDPLENTRVGVKSNTKNDTDRNDNRDVKQDLTSNGTSSTDAKQNDTSKTTGNEKSSGSGSITD 160 (293) Q Consensus 81 ~~~m~~~~~~~e~e~~d~n~~~nt~~~s~s~s~~dts~~dn~~t~s~~t~n~ts~s~~n~~~~s~nt~n~s~~~~~~~t~ 160 (293) +++||||+++|++++.+++|.+++.....++...+++..++...++....++...+++.....+.+++++...+.+..+. T Consensus 81 ~~~~~~~n~~~~s~~~nt~p~dNt~~~~ttn~~~sTs~d~st~tdss~t~d~~stTds~t~~~s~~tanst~ta~snsT~ 160 (293) T protein:vir:52 81 MINMPYFNKLFESELIKYDPLENTRVGVKSNTKNDTDRNDNRDVKQDLTSNGTSSTDAKQNDTSKTTGNEKSSGSGSITD 160 (293) T ss_pred hhhcccccccccccccccCCcccccccccccccccccccCCcccCCccccCCccCCCCCcCCCcccccCcccCCCcccCC Confidence 99999999999999999999999999999999888888888888888888888777777777777777777777777776 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|Aclame:pro 161 DNFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTTDTTSNTTGTSTLDSDSKTSNKANTTSNDKL 240 (293) Q Consensus 161 ~~~~s~~~~dtsn~~~~~t~sn~~~~~dn~s~~~~~ns~d~~n~~~~nts~~~s~~~~ts~~~s~n~ts~s~n~~~n~~~ 240 (293) +......+.+++++....++.+..+..+.++.....++....+....+.+....+..++.......++.+.++....... T Consensus 161 ~s~t~~~ssdT~ns~~~~tssn~~~s~d~atst~d~nst~t~nsttsnnstt~~nt~~tsttt~~~nTTn~sntts~sns 240 (293) T protein:vir:52 161 DNFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTTDTTSNTTGTSTLDSDSKTSNKANTTSNDKL 240 (293) T ss_pred CCCcCCccccCCcccccccccCCCCcCCcccccccccccCCcCCCccccccCCccCCcccCCCCcccccCCcccCcCCCC Confidence 66666666677776666666666666665555555555554444444444433333333334444444455555556666 Q ss_pred CCCCCCCCCCcccccCCcCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 241 NSQINSVEDYIEDRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQELFMLVY 293 (293) Q Consensus 241 ~s~~n~~~~~n~s~~gn~gt~Sss~~i~~~re~~~~~~~~i~~~~~~~fm~~~ 293 (293) ....+.....++...|+.++++.+++|++||++|||||++||+||++|||||| T Consensus 241 ts~~nst~~snt~~sg~~gSvS~a~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 293 (293) T protein:vir:52 241 NSQINSVEDYIEDRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQELFMLVY 293 (293) T ss_pred CCccccccccCccccccccCccchhhhhhHHHHHHhHHHHHHHHHHHHHhhcC Confidence 66777777888888999999999999999999999999999999999999999 No 2 >protein:vir:122 Length: 293 # NCBI annotation: lower collar protein # Family: family:all:5217 # MgeID: mge:4 # MgeName: B103 # Cross-refs: genbank:acc:NP_690645;swissprot:sw:q37892;genbank:gi:22855159;uniprot:Q37892;genbank:GeneID:955372 Probab=100.00 E-value=1.4e-101 Score=573.43 Aligned_cols=293 Identities=67% Similarity=1.069 Sum_probs=208.0 Q ss_pred CCcchhhHhHHHhhhhhccccccccccccccccccccCCCcccchhhhHHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHH Q lcl|Aclame:pro 1 MSSYTMQLRTYIEMWSQGETGLSTAEKIEKGRPKLFDFNYPIFDESYRTIFETHFIRNFYMREIGFETEGLFKFHLETWL 80 (293) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~f~~~~~~~~~~~~ig~~t~~~fk~~l~~~~ 80 (293) |||||||||+|||||+|||+||+++||||+||||||||.||||||+|||+||+|||||||||||||||++||||||+.|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (293) T protein:vir:12 1 MASYTMKLSTYIEMWSQYETGLSMAEKIEKGRPKLFDFQYPIFDESYRKVFETHFIRNFYMREIGFETEGLFKFNLETWL 80 (293) T ss_pred CcceeehHHHHHHHHhhccCccchhhhhhhhhhhhhhccCCcccchHHHHHHHHHHHHHHHHHhhccchhHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhchHHHHHHhhhcccCCCccCCccCCCCCCCCCCCCCCCCCcCCcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCC Q lcl|Aclame:pro 81 MINMPYFNKLFESELIKYDPLENTRVGVKSNTKNDTDRNDNRDVKQDLTSNGTSSTDAKQNDTSKTTGNEKSSGSGSITD 160 (293) Q Consensus 81 ~~~m~~~~~~~e~e~~d~n~~~nt~~~s~s~s~~dts~~dn~~t~s~~t~n~ts~s~~n~~~~s~nt~n~s~~~~~~~t~ 160 (293) +|+|||||+|||+++.+++|.+++.....++..+.+...++...++....++..........................++ T Consensus 81 ~~~~~~~n~~~~s~~~~t~~~nnT~~~~tsns~~sTd~~~nt~sd~st~~ng~s~ttts~~t~~s~~t~s~~ntt~s~t~ 160 (293) T protein:vir:12 81 IINMPYFNKLFESELIKYDPLENTRLNTTGNKKNDTERNDNRDTTGSMKADGKSNTKTSDKTNATGSSKEDGKTTGSVTD 160 (293) T ss_pred hhhcchhcchhhccccccCCCcccccccccCcccccCCCCCcCcccccccccccccCCCCCccccccccCccccCCcccC Confidence 99999999999999999999999999999999888888877776666555544322221111111111111111111222 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|Aclame:pro 161 DNFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTTDTTSNTTGTSTLDSDSKTSNKANTTSNDKL 240 (293) Q Consensus 161 ~~~~s~~~~dtsn~~~~~t~sn~~~~~dn~s~~~~~ns~d~~n~~~~nts~~~s~~~~ts~~~s~n~ts~s~n~~~n~~~ 240 (293) +......+.+++++....++.++....+..+.....+....++...........+..+.....++..+....+....... T Consensus 161 ~stntt~stdt~~s~t~~tt~d~~~t~d~~Ttt~d~stts~~nTt~~~ntts~sNt~~tgsttsd~nTt~~sntTssd~t 240 (293) T protein:vir:12 161 DNFNRKIDSDQPDSRLNLTTNDGQGTLEYASAIEENNTNNKRNTTGTNNVTSSAESESTGSGTSDTVTTDNANTTTNDKL 240 (293) T ss_pred CCCCCCCCcCCCCCccccccCCCCCccCCcccccccccCCCcCCCCCCccCCCCCCCCCcccCCcccccccccccccccc Confidence 22223333444444444344443333333333222222222222222222222222222233333444444455555555 Q ss_pred CCCCCCCCCCcccccCCcCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 241 NSQINSVEDYIEDRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQELFMLVY 293 (293) Q Consensus 241 ~s~~n~~~~~n~s~~gn~gt~Sss~~i~~~re~~~~~~~~i~~~~~~~fm~~~ 293 (293) +...+.....+....|.++..+.+.+|+.||++|++||++||++|+||||||| T Consensus 241 ns~~nst~~~ns~stG~sgs~S~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~ 293 (293) T protein:vir:12 241 NSQINNVEDYIESKIGKSGTQSYASLVQDYRAALLRIEKRIFDEMQELFMLVY 293 (293) T ss_pred CCCCcccccccccccccccccccccccchhhhHHHHHHHHHHHHHHHHhhhcC Confidence 56666667777888899999999999999999999999999999999999999 No 3 >protein:vir:18 Length: 287 # NCBI annotation: lower collar protein # Family: family:all:5217 # MgeID: mge:323 # MgeName: GA-1 # Cross-refs: genbank:acc:NP_073694;swissprot:trembl:q9fzw4;genbank:gi:12248118;uniprot:Q9FZW4;genbank:GeneID:919889 Probab=100.00 E-value=1.7e-94 Score=534.68 Aligned_cols=286 Identities=37% Similarity=0.655 Sum_probs=206.9 Q ss_pred CCcchhhHhHHHhhhhhccccccccccccccccccccCCCcccchhhhHHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHH Q lcl|Aclame:pro 1 MSSYTMQLRTYIEMWSQGETGLSTAEKIEKGRPKLFDFNYPIFDESYRTIFETHFIRNFYMREIGFETEGLFKFHLETWL 80 (293) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~f~~~~~~~~~~~~ig~~t~~~fk~~l~~~~ 80 (293) |||||||||+|||||||+|++|+|+||||+||||||||+||| |++|||+||+||||+|||||||||||++|||+|+.|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~-~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 79 (287) T protein:vir:18 1 MASFTMPLREIVEWATQFDNKLTRNEKIEEGRKKLFDFFYPI-ETDYKKEFETKFIKHFYFREIGFETEGRFKFALEEWL 79 (287) T ss_pred CcceeehHHHHHHHHhhhcccchhhHHHhhhhhhhhhhcCcc-chHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHH Confidence 999999999999999999999999999999999999999999 9999999999999999999999999999999999999 Q ss_pred HhhchHHHHHHhhhcccCCCccCCccCCCCCCCCCCCCCCCCCcCCcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCC Q lcl|Aclame:pro 81 MINMPYFNKLFESELIKYDPLENTRVGVKSNTKNDTDRNDNRDVKQDLTSNGTSSTDAKQNDTSKTTGNEKSSGSGSITD 160 (293) Q Consensus 81 ~~~m~~~~~~~e~e~~d~n~~~nt~~~s~s~s~~dts~~dn~~t~s~~t~n~ts~s~~n~~~~s~nt~n~s~~~~~~~t~ 160 (293) +++||||+|++++|+.+.++..++.....++...++...++....+....++....+.++.......++.......+.+. T Consensus 80 ~~~mp~~n~~~ese~vntN~~~nTdant~tNtD~nTt~ndn~dtdsnt~ad~ntntdtnTntd~nTtantdtntD~NTt~ 159 (287) T protein:vir:18 80 NLNMPYWNKIIESTHLDYNPLYNVDYKKDSDLIRNLDQVDNRVTDSKIENNGKASSESNVITSEKGEANSIQDADRNSTA 159 (287) T ss_pred HhhcchhhhhHhhhhccCCccccccccCCCCcccCCCCCCCcccccCcccCCCcCCCCCCCCCcCCCCCCcccccccccc Confidence 99999999999999999999999999988888888888888887777777666665555544444333333332222222 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|Aclame:pro 161 DNFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTTDTTSNTTGTSTLDSDSKTSNKANTTSNDKL 240 (293) Q Consensus 161 ~~~~s~~~~dtsn~~~~~t~sn~~~~~dn~s~~~~~ns~d~~n~~~~nts~~~s~~~~ts~~~s~n~ts~s~n~~~n~~~ 240 (293) +.. ....+.+++....... .+..+..+.....++....+.......+...+..++.. .........+...+... T Consensus 160 n~~--tnt~dN~d~ntd~ntd--~nt~d~~T~~s~tnsn~~~nTd~ntnsntdtnsd~ntt--sn~~~nstsn~nsn~d~ 233 (287) T protein:vir:18 160 KKK--RMFEDTPDGRLDIVND--NNIIQYATDLTQEDSTDSVKDKIKNDSSSKNDSTGNTT--AEGKTNNITEGNNVDDF 233 (287) T ss_pred CCC--cCcccCCCCccccccC--CCCccccccCCCccccCCCCCCcccCCCccccCCCccc--cCCCCCcccCCccCCCC Confidence 211 1111222221111111 11112222222222222222222222222222222222 12222233444455566 Q ss_pred CCCCCCCCCCcccccCCcCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 241 NSQINSVEDYIEDRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQELFMLVY 293 (293) Q Consensus 241 ~s~~n~~~~~n~s~~gn~gt~Sss~~i~~~re~~~~~~~~i~~~~~~~fm~~~ 293 (293) ++..+...+.+.+..+..|++|+++||++|||||||||++||+||++|||||| T Consensus 234 nSd~N~~~n~n~~s~~~~gt~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 286 (287) T protein:vir:18 234 KSEKDEKQKLNDHIYGKQGNVSYPQLIKEHREAILNVERMIFDQMEELFMFVY 286 (287) T ss_pred CCCCCccccccccccccccceeHHHHHHHHHHHHHhHHHHHHHHHHHHHhhhc Confidence 66777788889999999999999999999999999999999999999999999 No 4 >protein:vir:79392 Length: 240 # NCBI annotation: lower collar # Family: family:all:5217 # MgeID: mge:1869 # MgeName: Av-1 # Cross-refs: genbank:acc:YP_001333665;genbank:gi:151266302;genbank:GeneID:5329873 Probab=100.00 E-value=1.1e-62 Score=360.36 Aligned_cols=212 Identities=29% Similarity=0.527 Sum_probs=153.8 Q ss_pred CCcchhhHhHHHhhhhhccccccccccccccccccccCCCcccchhhhHHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHH Q lcl|Aclame:pro 1 MSSYTMQLRTYIEMWSQGETGLSTAEKIEKGRPKLFDFNYPIFDESYRTIFETHFIRNFYMREIGFETEGLFKFHLETWL 80 (293) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~f~~~~~~~~~~~~ig~~t~~~fk~~l~~~~ 80 (293) |+--||-||..|.. - ..|+=-=+||||||+|||+||+||||+||||||||||+++|||+|+.|| T Consensus 1 ~~~~~~~~~~~~~~--------------~--~~~~~~~~YPIfDesYrk~FEt~Fir~FYmrEIGFETeg~Fkf~Le~wL 64 (240) T protein:vir:79 1 MSVTTIMLRDVVKL--------------T--NDHIGLDNYPIFDESYRKTLNDRIKREYWLQEIAHETIDIFIWRMSLRM 64 (240) T ss_pred CchhhhHHHHHHHh--------------h--cccccccccCccchHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHH Confidence 99999999999821 1 2233335899999999999999999999999999999999999999999 Q ss_pred HhhchHHHHHHhhhcccCCCccCCccCCCCCCCCCCCCCCCCCcCCcCCCCCCCCCCCCcCCCCCCCCCCCCCCCCCCCC Q lcl|Aclame:pro 81 MINMPYFNKLFESELIKYDPLENTRVGVKSNTKNDTDRNDNRDVKQDLTSNGTSSTDAKQNDTSKTTGNEKSSGSGSITD 160 (293) Q Consensus 81 ~~~m~~~~~~~e~e~~d~n~~~nt~~~s~s~s~~dts~~dn~~t~s~~t~n~ts~s~~n~~~~s~nt~n~s~~~~~~~t~ 160 (293) +++|||||++|++|+++++|++++...+++. +++...+++....++.++.++ T Consensus 65 ~lnMPyyNk~~esEl~~YdPLen~r~~s~T~----------------------------~d~r~~~sG~~~etG~gs~td 116 (240) T protein:vir:79 65 DLIMPRYNRMYLAELQNTDPLEGNRHYSRTG----------------------------QDGRSQNSGINHQTGSGSGTN 116 (240) T ss_pred HhhcchhHHHHHHHhhccccccccccccccC----------------------------CccceeecCcccccccccccc Confidence 9999999999999999999999888444333 122222333444555566666 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|Aclame:pro 161 DNFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTTDTTSNTTGTSTLDSDSKTSNKANTTSNDKL 240 (293) Q Consensus 161 ~~~~s~~~~dtsn~~~~~t~sn~~~~~dn~s~~~~~ns~d~~n~~~~nts~~~s~~~~ts~~~s~n~ts~s~n~~~n~~~ 240 (293) ++.....+.|+|+++.+++. +.+++..+...+.+.+... +++..+++.++. T Consensus 117 dn~kr~~~sDtPDtRL~~Dg-------dYAS~isd~~t~~~s~s~~----------------dSdS~t~st~n~------ 167 (240) T protein:vir:79 117 ESKGRTVGSDTPQTRLAGDG-------DYATSISDASTGGSSTSRN----------------ESDSTSSSTSNY------ 167 (240) T ss_pred ccccccccCCCcchhhhccc-------hhhhhhhhhhcCCcccccc----------------cccccccccccc------ Confidence 67778889999999988533 5555544322222211111 111111111111 Q ss_pred CCCCCCCCCCcccccCCcCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 241 NSQINSVEDYIEDRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQELFMLVY 293 (293) Q Consensus 241 ~s~~n~~~~~n~s~~gn~gt~Sss~~i~~~re~~~~~~~~i~~~~~~~fm~~~ 293 (293) +.+.+..+.|.+|+++. +|++|||||||||++||+||+||||+|| T Consensus 168 ------snNq~~~s~Gk~G~~sy--ai~eyR~alL~ve~~if~em~eLFM~vy 212 (240) T protein:vir:79 168 ------SNNQNSESWGYSGSKAR--AIAEYRSTLLNVDDLVIRELSDLFMGIW 212 (240) T ss_pred ------ccccchhhhcccchHHH--HHHHHHHHHHhHHHHHHHHHHHHhhhhc Confidence 11123344688888887 4999999999999999999999999999 No 5 >protein:vir:9604 Length: 236 # NCBI annotation: hypothetical protein # Family: family:all:5254 # MgeID: mge:172 # MgeName: C1 # Cross-refs: genbank:acc:NP_852020;genbank:gi:31072022;genbank:GeneID:1489939 Probab=99.44 E-value=8.3e-16 Score=103.25 Aligned_cols=208 Identities=17% Similarity=0.260 Sum_probs=96.3 Q ss_pred CCcchhhHhHHHh-hhhh----------------cccccccccc--ccccccccccCCCcccchhhhHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MSSYTMQLRTYIE-MWSQ----------------GETGLSTAEK--IEKGRPKLFDFNYPIFDESYRTIFETHFIRNFYM 61 (293) Q Consensus 1 ~~~~~~~~~~~~~-~~~~----------------~~~~~~~~~~--~~~~~~~~f~~~~~~~~~~~~~~f~~~~~~~~~~ 61 (293) |-=|-+--.++|. +|+. +|.+|++-+| .-.-.+++|. --++|++|++.|+.|||| T Consensus 1 m~l~~~i~~e~vk~g~~~f~~~~n~~~~~~d~~q~~~k~~~~d~dv~~vvne~if~------g~~~kEdF~~~F~~yF~~ 74 (236) T protein:vir:96 1 MRLFELIYKEVVKNGYSPFRSPENRIVVFEDKAQIETKIMMYDEDVQKVVNELIFT------GSKVNEDFREEFVNYFFN 74 (236) T ss_pred CchHHHHHHHHHhccchhhcCCCceEEEeechhHHHHHHHhhhHHHHHHHHHHhhc------cccchHHHHHHHHHHHHh Confidence 7777776666663 3433 3334433222 1111222222 235788999999999999 Q ss_pred HHhcCCcHHHHHHHHHHHHHhhch----HHHHHHhhhcccCCCccCCccCCCCCCCCCCCCCCCCCcCCcCCCCCCCCCC Q lcl|Aclame:pro 62 REIGFETEGLFKFHLETWLMINMP----YFNKLFESELIKYDPLENTRVGVKSNTKNDTDRNDNRDVKQDLTSNGTSSTD 137 (293) Q Consensus 62 ~~ig~~t~~~fk~~l~~~~~~~m~----~~~~~~e~e~~d~n~~~nt~~~s~s~s~~dts~~dn~~t~s~~t~n~ts~s~ 137 (293) |||||+|+++|..+|...|+++|| .|+++||--| +.....+.+.+.+.+.+.+... T Consensus 75 rEi~~qt~~aF~~~l~~~l~tke~~ln~iY~~s~e~ll------~E~y~~S~Ghs~~~t~n~D~t~-------------- 134 (236) T protein:vir:96 75 REPHWDSLYIFRAKLKGILKTKEAVLNMLYLKSTELLL------GESMSKSEGHSSNENRSRDNST-------------- 134 (236) T ss_pred ccCCcccHHHHHHHHHHHhhhhhhhhhhhhccchhhhh------hhhhhhccccccccccCccccc-------------- Confidence 999999999999999999999999 7888888633 2222223222222111111000 Q ss_pred CCcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|Aclame:pro 138 AKQNDTSKTTGNEKSSGSGSITDDNFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTTDTTSNTT 217 (293) Q Consensus 138 ~n~~~~s~nt~n~s~~~~~~~t~~~~~s~~~~dtsn~~~~~t~sn~~~~~dn~s~~~~~ns~d~~n~~~~nts~~~s~~~ 217 (293) +. ++.-+.+.++..+-|+...+.+... ...+.+. +.+.+.+...+-+. T Consensus 135 -----------n~-------Sng~~~~~nA~~~lPd~~~d~Dv~~--snL~yAd------------N~t~s~~~tvn~s~ 182 (236) T protein:vir:96 135 -----------NE-------SNGENRGANAHSTNPDDVTDTDLET--ANLSYAD------------NLDKSYNESVNVSH 182 (236) T ss_pred -----------cc-------cccccccCCccccCCcchhcccccc--ccccccc------------cccccccccccccc Confidence 00 0000011111122222211111100 0011110 00000000000000 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcccccCCcCccccHHHHHHHHHHHHHHHHHHHHHHH-HHHhhcC Q lcl|Aclame:pro 218 GTSTLDSDSKTSNKANTTSNDKLNSQINSVEDYIEDRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQ-ELFMLVY 293 (293) Q Consensus 218 ~ts~~~s~n~ts~s~n~~~n~~~~s~~n~~~~~n~s~~gn~gt~Sss~~i~~~re~~~~~~~~i~~~~~-~~fm~~~ 293 (293) +.+..++...+ .+-|..+..+.-.++.+...-+.+.|+++.- .||-.+| T Consensus 183 s~si~~s~~~~---------------------------nnkgns~gtqf~~~~ld~~~~~kqkI~~e~D~klFs~lf 232 (236) T protein:vir:96 183 SKGISSSQGSS---------------------------NNNSNSTNTQFNTKALEEYEAFKQKIFDELDIKLFSQLF 232 (236) T ss_pred ccccccccccc---------------------------cccccccccchhhHHHHHHHHHHHHhhhhhcHHHhhhhh Confidence 01111111100 0111112222334556888889999999876 4777666 No 6 >protein:vir:97357 Length: 251 # NCBI annotation: ORF008 # Family: family:all:5254 # MgeID: mge:1669 # MgeName: 66 # Cross-refs: genbank:acc:YP_239466;genbank:gi:66395195;genbank:GeneID:5130532 Probab=98.36 E-value=1.2e-08 Score=63.97 Aligned_cols=220 Identities=22% Similarity=0.345 Sum_probs=93.9 Q ss_pred CCcchhhHhHHHh------hhh---------------hcccccccccc--ccccccccccCCCcccchhhhHH-----HH Q lcl|Aclame:pro 1 MSSYTMQLRTYIE------MWS---------------QGETGLSTAEK--IEKGRPKLFDFNYPIFDESYRTI-----FE 52 (293) Q Consensus 1 ~~~~~~~~~~~~~------~~~---------------~~~~~~~~~~~--~~~~~~~~f~~~~~~~~~~~~~~-----f~ 52 (293) ||-|||.|-.+|- +|+ |+|.+|++-+| .-.-.+++|. --+++.+ |. T Consensus 1 ~~~~~M~L~d~I~~E~iK~G~~~F~~dNkl~~~dD~~Q~~~Kml~~D~DV~~iVNE~vF~------G~~~~de~~~~~Fk 74 (251) T protein:vir:97 1 MARYTMTLYDFIKSELIKKGFNEFVNDNKLTFYDDEFQFMQKMLKFDKDVLAIVNEKVFK------GFSLKDELSDLLFK 74 (251) T ss_pred CccchhHHHHHHHHHHHhccchhhhcCCceEEecchHHHHHHHHhhhHHHHHHHHHHhhc------ccccchhhhhhHHH Confidence 9999999998873 232 22233322222 1111122222 1123444 99 Q ss_pred HHHHHHHHHHHhcCCcHHHHHHHHHHHHHhhchHHHHHHhhhccc-CCCccC-CccCCCCCCCCCCCCCCCCCcCCcCCC Q lcl|Aclame:pro 53 THFIRNFYMREIGFETEGLFKFHLETWLMINMPYFNKLFESELIK-YDPLEN-TRVGVKSNTKNDTDRNDNRDVKQDLTS 130 (293) Q Consensus 53 ~~~~~~~~~~~ig~~t~~~fk~~l~~~~~~~m~~~~~~~e~e~~d-~n~~~n-t~~~s~s~s~~dts~~dn~~t~s~~t~ 130 (293) +.|..||.-|||-..++..|--+|-.-|+-+..|-+-+|.+.--. ....++ +...++.++..+.+ +. T Consensus 75 ~~F~~~F~~RE~~~~t~~~F~~q~~~v~~T~E~~LN~~Y~SsE~E~~~qSqG~~~H~~~~~~~~D~t----------sN- 143 (251) T protein:vir:97 75 KSFTIHFLDREINRQTVEAFGMQVITVCITHEDYLNVVYSSSEVEKYLQSQGFTEHNEDTTSNTDET----------SN- 143 (251) T ss_pred HHHHHHhhcccccHHHHHHHHHHHHHHhhhhHHHHHhhhhhhHHHHHHHhcCCcccccccccccccc----------cc- Confidence 999999999999999999999999999999999999999754321 000000 00000000000000 00 Q ss_pred CCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|Aclame:pro 131 NGTSSTDAKQNDTSKTTGNEKSSGSGSITDDNFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTT 210 (293) Q Consensus 131 n~ts~s~~n~~~~s~nt~n~s~~~~~~~t~~~~~s~~~~dtsn~~~~~t~sn~~~~~dn~s~~~~~ns~d~~n~~~~nts 210 (293) .+.+ +-+..++- +.+..+-..-|+...+.+-.+.. ...+. ++ +-.+...-+.+ T Consensus 144 ---------q~~~------~~~~S~G~----~~~~NA~~s~P~~~~~~D~d~~~--L~~AD----N~--~~~~~Kt~N~S 196 (251) T protein:vir:97 144 ---------QNAT------SLDNSTGM----TANRNAYVSLPQSEVNIDVDNTT--LRFAD----NN--TIDNGKTVNKS 196 (251) T ss_pred ---------cccc------cccccccc----ccCccccccCccchhhcccccce--eeecc----cc--ccccccccccc Confidence 0000 00000000 00000000000000000000000 00000 00 00000000000 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcccccCCcCccccHHHHHHHHHHHHHHHHHHHHHHH-HHH Q lcl|Aclame:pro 211 DTTSNTTGTSTLDSDSKTSNKANTTSNDKLNSQINSVEDYIEDRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQ-ELF 289 (293) Q Consensus 211 ~~~s~~~~ts~~~s~n~ts~s~n~~~n~~~~s~~n~~~~~n~s~~gn~gt~Sss~~i~~~re~~~~~~~~i~~~~~-~~f 289 (293) . ..+...+..+.++.++. .|++-..+..-+.-+..+-+.+.||+|.- .|| T Consensus 197 ~----------N~S~~~~~~~~~~~~N~-------------------~~~~~~~Q~~~~~id~~~~~rkKI~~E~D~K~F 247 (251) T protein:vir:97 197 S----------NESNQNAKRNQNQKGNA-------------------KGTQFTKQYLIDNIDKAYDLRKKILNEFDKKCF 247 (251) T ss_pred c----------chhhhhhhhcccccccc-------------------cccchhhhHhHHHHHHHHHHHHHHHHHHhHHHH Confidence 0 00111111111111111 11222233344556667889999999998 699 Q ss_pred hhcC Q lcl|Aclame:pro 290 MLVY 293 (293) Q Consensus 290 m~~~ 293 (293) .++| T Consensus 248 ~Qi~ 251 (251) T protein:vir:97 248 LQIW 251 (251) T ss_pred HhcC Confidence 9999 No 7 >protein:vir:9444 Length: 251 # NCBI annotation: lower collar protein # Family: family:all:5254 # MgeID: mge:168 # MgeName: phiP68 # Cross-refs: genbank:acc:NP_817334;genbank:gi:29565761;genbank:GeneID:1258938 Probab=98.36 E-value=1.2e-08 Score=63.97 Aligned_cols=220 Identities=22% Similarity=0.345 Sum_probs=93.9 Q ss_pred CCcchhhHhHHHh------hhh---------------hcccccccccc--ccccccccccCCCcccchhhhHH-----HH Q lcl|Aclame:pro 1 MSSYTMQLRTYIE------MWS---------------QGETGLSTAEK--IEKGRPKLFDFNYPIFDESYRTI-----FE 52 (293) Q Consensus 1 ~~~~~~~~~~~~~------~~~---------------~~~~~~~~~~~--~~~~~~~~f~~~~~~~~~~~~~~-----f~ 52 (293) ||-|||.|-.+|- +|+ |+|.+|++-+| .-.-.+++|. --+++.+ |. T Consensus 1 ~~~~~M~L~d~I~~E~iK~G~~~F~~dNkl~~~dD~~Q~~~Kml~~D~DV~~iVNE~vF~------G~~~~de~~~~~Fk 74 (251) T protein:vir:94 1 MARYTMTLYDFIKSELIKKGFNEFVNDNKLTFYDDEFQFMQKMLKFDKDVLAIVNEKVFK------GFSLKDELSDLLFK 74 (251) T ss_pred CccchhHHHHHHHHHHHhccchhhhcCCceEEecchHHHHHHHHhhhHHHHHHHHHHhhc------ccccchhhhhhHHH Confidence 9999999998873 232 22233322222 1111122222 1123444 99 Q ss_pred HHHHHHHHHHHhcCCcHHHHHHHHHHHHHhhchHHHHHHhhhccc-CCCccC-CccCCCCCCCCCCCCCCCCCcCCcCCC Q lcl|Aclame:pro 53 THFIRNFYMREIGFETEGLFKFHLETWLMINMPYFNKLFESELIK-YDPLEN-TRVGVKSNTKNDTDRNDNRDVKQDLTS 130 (293) Q Consensus 53 ~~~~~~~~~~~ig~~t~~~fk~~l~~~~~~~m~~~~~~~e~e~~d-~n~~~n-t~~~s~s~s~~dts~~dn~~t~s~~t~ 130 (293) +.|..||.-|||-..++..|--+|-.-|+-+..|-+-+|.+.--. ....++ +...++.++..+.+ +. T Consensus 75 ~~F~~~F~~RE~~~~t~~~F~~q~~~v~~T~E~~LN~~Y~SsE~E~~~qSqG~~~H~~~~~~~~D~t----------sN- 143 (251) T protein:vir:94 75 KSFTIHFLDREINRQTVEAFGMQVITVCITHEDYLNVVYSSSEVEKYLQSQGFTEHNEDTTSNTDET----------SN- 143 (251) T ss_pred HHHHHHhhcccccHHHHHHHHHHHHHHhhhhHHHHHhhhhhhHHHHHHHhcCCcccccccccccccc----------cc- Confidence 999999999999999999999999999999999999999754321 000000 00000000000000 00 Q ss_pred CCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|Aclame:pro 131 NGTSSTDAKQNDTSKTTGNEKSSGSGSITDDNFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTT 210 (293) Q Consensus 131 n~ts~s~~n~~~~s~nt~n~s~~~~~~~t~~~~~s~~~~dtsn~~~~~t~sn~~~~~dn~s~~~~~ns~d~~n~~~~nts 210 (293) .+.+ +-+..++- +.+..+-..-|+...+.+-.+.. ...+. ++ +-.+...-+.+ T Consensus 144 ---------q~~~------~~~~S~G~----~~~~NA~~s~P~~~~~~D~d~~~--L~~AD----N~--~~~~~Kt~N~S 196 (251) T protein:vir:94 144 ---------QNAT------SLDNSTGM----TANRNAYVSLPQSEVNIDVDNTT--LRFAD----NN--TIDNGKTVNKS 196 (251) T ss_pred ---------cccc------cccccccc----ccCccccccCccchhhcccccce--eeecc----cc--ccccccccccc Confidence 0000 00000000 00000000000000000000000 00000 00 00000000000 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcccccCCcCccccHHHHHHHHHHHHHHHHHHHHHHH-HHH Q lcl|Aclame:pro 211 DTTSNTTGTSTLDSDSKTSNKANTTSNDKLNSQINSVEDYIEDRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQ-ELF 289 (293) Q Consensus 211 ~~~s~~~~ts~~~s~n~ts~s~n~~~n~~~~s~~n~~~~~n~s~~gn~gt~Sss~~i~~~re~~~~~~~~i~~~~~-~~f 289 (293) . ..+...+..+.++.++. .|++-..+..-+.-+..+-+.+.||+|.- .|| T Consensus 197 ~----------N~S~~~~~~~~~~~~N~-------------------~~~~~~~Q~~~~~id~~~~~rkKI~~E~D~K~F 247 (251) T protein:vir:94 197 S----------NESNQNAKRNQNQKGNA-------------------KGTQFTKQYLIDNIDKAYDLRKKILNEFDKKCF 247 (251) T ss_pred c----------chhhhhhhhcccccccc-------------------cccchhhhHhHHHHHHHHHHHHHHHHHHhHHHH Confidence 0 00111111111111111 11222233344556667889999999998 699 Q ss_pred hhcC Q lcl|Aclame:pro 290 MLVY 293 (293) Q Consensus 290 m~~~ 293 (293) .++| T Consensus 248 ~Qi~ 251 (251) T protein:vir:94 248 LQIW 251 (251) T ss_pred HhcC Confidence 9999 No 8 >protein:vir:9465 Length: 251 # NCBI annotation: lower collar protein # Family: family:all:5254 # MgeID: mge:169 # MgeName: 44AHJD # Cross-refs: genbank:acc:NP_817312;genbank:gi:29565738;genbank:GeneID:1258928 Probab=98.36 E-value=1.2e-08 Score=63.97 Aligned_cols=220 Identities=22% Similarity=0.345 Sum_probs=93.9 Q ss_pred CCcchhhHhHHHh------hhh---------------hcccccccccc--ccccccccccCCCcccchhhhHH-----HH Q lcl|Aclame:pro 1 MSSYTMQLRTYIE------MWS---------------QGETGLSTAEK--IEKGRPKLFDFNYPIFDESYRTI-----FE 52 (293) Q Consensus 1 ~~~~~~~~~~~~~------~~~---------------~~~~~~~~~~~--~~~~~~~~f~~~~~~~~~~~~~~-----f~ 52 (293) ||-|||.|-.+|- +|+ |+|.+|++-+| .-.-.+++|. --+++.+ |. T Consensus 1 ~~~~~M~L~d~I~~E~iK~G~~~F~~dNkl~~~dD~~Q~~~Kml~~D~DV~~iVNE~vF~------G~~~~de~~~~~Fk 74 (251) T protein:vir:94 1 MARYTMTLYDFIKSELIKKGFNEFVNDNKLTFYDDEFQFMQKMLKFDKDVLAIVNEKVFK------GFSLKDELSDLLFK 74 (251) T ss_pred CccchhHHHHHHHHHHHhccchhhhcCCceEEecchHHHHHHHHhhhHHHHHHHHHHhhc------ccccchhhhhhHHH Confidence 9999999998873 232 22233322222 1111122222 1123444 99 Q ss_pred HHHHHHHHHHHhcCCcHHHHHHHHHHHHHhhchHHHHHHhhhccc-CCCccC-CccCCCCCCCCCCCCCCCCCcCCcCCC Q lcl|Aclame:pro 53 THFIRNFYMREIGFETEGLFKFHLETWLMINMPYFNKLFESELIK-YDPLEN-TRVGVKSNTKNDTDRNDNRDVKQDLTS 130 (293) Q Consensus 53 ~~~~~~~~~~~ig~~t~~~fk~~l~~~~~~~m~~~~~~~e~e~~d-~n~~~n-t~~~s~s~s~~dts~~dn~~t~s~~t~ 130 (293) +.|..||.-|||-..++..|--+|-.-|+-+..|-+-+|.+.--. ....++ +...++.++..+.+ +. T Consensus 75 ~~F~~~F~~RE~~~~t~~~F~~q~~~v~~T~E~~LN~~Y~SsE~E~~~qSqG~~~H~~~~~~~~D~t----------sN- 143 (251) T protein:vir:94 75 KSFTIHFLDREINRQTVEAFGMQVITVCITHEDYLNVVYSSSEVEKYLQSQGFTEHNEDTTSNTDET----------SN- 143 (251) T ss_pred HHHHHHhhcccccHHHHHHHHHHHHHHhhhhHHHHHhhhhhhHHHHHHHhcCCcccccccccccccc----------cc- Confidence 999999999999999999999999999999999999999754321 000000 00000000000000 00 Q ss_pred CCCCCCCCCcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|Aclame:pro 131 NGTSSTDAKQNDTSKTTGNEKSSGSGSITDDNFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTT 210 (293) Q Consensus 131 n~ts~s~~n~~~~s~nt~n~s~~~~~~~t~~~~~s~~~~dtsn~~~~~t~sn~~~~~dn~s~~~~~ns~d~~n~~~~nts 210 (293) .+.+ +-+..++- +.+..+-..-|+...+.+-.+.. ...+. ++ +-.+...-+.+ T Consensus 144 ---------q~~~------~~~~S~G~----~~~~NA~~s~P~~~~~~D~d~~~--L~~AD----N~--~~~~~Kt~N~S 196 (251) T protein:vir:94 144 ---------QNAT------SLDNSTGM----TANRNAYVSLPQSEVNIDVDNTT--LRFAD----NN--TIDNGKTVNKS 196 (251) T ss_pred ---------cccc------cccccccc----ccCccccccCccchhhcccccce--eeecc----cc--ccccccccccc Confidence 0000 00000000 00000000000000000000000 00000 00 00000000000 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCcccccCCcCccccHHHHHHHHHHHHHHHHHHHHHHH-HHH Q lcl|Aclame:pro 211 DTTSNTTGTSTLDSDSKTSNKANTTSNDKLNSQINSVEDYIEDRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQ-ELF 289 (293) Q Consensus 211 ~~~s~~~~ts~~~s~n~ts~s~n~~~n~~~~s~~n~~~~~n~s~~gn~gt~Sss~~i~~~re~~~~~~~~i~~~~~-~~f 289 (293) . ..+...+..+.++.++. .|++-..+..-+.-+..+-+.+.||+|.- .|| T Consensus 197 ~----------N~S~~~~~~~~~~~~N~-------------------~~~~~~~Q~~~~~id~~~~~rkKI~~E~D~K~F 247 (251) T protein:vir:94 197 S----------NESNQNAKRNQNQKGNA-------------------KGTQFTKQYLIDNIDKAYDLRKKILNEFDKKCF 247 (251) T ss_pred c----------chhhhhhhhcccccccc-------------------cccchhhhHhHHHHHHHHHHHHHHHHHHhHHHH Confidence 0 00111111111111111 11222233344556667889999999998 699 Q ss_pred hhcC Q lcl|Aclame:pro 290 MLVY 293 (293) Q Consensus 290 m~~~ 293 (293) .++| T Consensus 248 ~Qi~ 251 (251) T protein:vir:94 248 LQIW 251 (251) T ss_pred HhcC Confidence 9999 No 9 >protein:vir:5206 Length: 293 # NCBI annotation: lower collar protein # Family: family:all:5217 # MgeID: mge:116 # MgeName: PZA # Cross-refs: genbank:acc:NP_040729;genbank:gi:9626400;genbank:GeneID:1260971 Probab=97.85 E-value=7.3e-06 Score=48.73 Aligned_cols=269 Identities=12% Similarity=0.119 Sum_probs=80.9 Q ss_pred CCc-chhhHhHHHhhhhhccccccccccccccccccccCCCc-ccchhhhHHHHHHHHHHHHHHHhcCCc--HHHHHHHH Q lcl|Aclame:pro 1 MSS-YTMQLRTYIEMWSQGETGLSTAEKIEKGRPKLFDFNYP-IFDESYRTIFETHFIRNFYMREIGFET--EGLFKFHL 76 (293) Q Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~-~~~~~~~~~f~~~~~~~~~~~~ig~~t--~~~fk~~l 76 (293) |+| |--+|.+- ..-|.+ .+|+=+=...|||=.|. +|+.-+++.| |+|+--|.-+|-=- .++|.-.+ T Consensus 14 ~~~~~~~~~~~~----~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~ 83 (293) T protein:vir:52 14 MWSQGETGLSTA----EKIEKG---RPKLFDFNYPIFDESYRTIFETHFIRNF---YMREIGFETEGLFKFHLETWLMIN 83 (293) T ss_pred hhhhcCCccccc----chhhhh---hhhhhccCCCccchhHHHHHHHHHHHHH---HHHHhhccchHHHHHHHHHHHhhh Confidence 221 21111111 111111 11222234455665553 3343333333 23433333333111 35676666 Q ss_pred HHHHHhhchHHHHHHhhhcccCC-CccCCccCCCCCCCCCCCCCC------CCCcCCcCCCCCCCCCCCCcCCCCCCCCC Q lcl|Aclame:pro 77 ETWLMINMPYFNKLFESELIKYD-PLENTRVGVKSNTKNDTDRND------NRDVKQDLTSNGTSSTDAKQNDTSKTTGN 149 (293) Q Consensus 77 ~~~~~~~m~~~~~~~e~e~~d~n-~~~nt~~~s~s~s~~dts~~d------n~~t~s~~t~n~ts~s~~n~~~~s~nt~n 149 (293) |=+.+.. |..++.+....+.. ....+...........+.... ...+.+..........++.....+..+.. T Consensus 84 ~~~~n~~--~~s~~~nt~p~dNt~~~~ttn~~~sTs~d~st~tdss~t~d~~stTds~t~~~s~~tanst~ta~snsT~~ 161 (293) T protein:vir:52 84 MPYFNKL--FESELIKYDPLENTRVGVKSNTKNDTDRNDNRDVKQDLTSNGTSSTDAKQNDTSKTTGNEKSSGSGSITDD 161 (293) T ss_pred ccccccc--ccccccccCCcccccccccccccccccccCCcccCCccccCCccCCCCCcCCCcccccCcccCCCcccCCC Confidence 6666543 44555555333322 222222111111111111111 11111111111111111111111111111 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|Aclame:pro 150 EKSSGSGSITDDNFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTTDTTSNTTGTSTLDSDSKTS 229 (293) Q Consensus 150 ~s~~~~~~~t~~~~~s~~~~dtsn~~~~~t~sn~~~~~dn~s~~~~~ns~d~~n~~~~nts~~~s~~~~ts~~~s~n~ts 229 (293) .........+.++.......+........+.....+....+......++....+..... .....++..... +... T Consensus 162 s~t~~~ssdT~ns~~~~tssn~~~s~d~atst~d~nst~t~nsttsnnstt~~nt~~ts----ttt~~~nTTn~s-ntts 236 (293) T protein:vir:52 162 NFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTTDTTSNTTGTS----TLDSDSKTSNKA-NTTS 236 (293) T ss_pred CCcCCccccCCcccccccccCCCCcCCcccccccccccCCcCCCccccccCCccCCccc----CCCCcccccCCc-ccCc Confidence 11111111111111111111111110000000000111111111111111111111000 000111111111 1111 Q ss_pred CCCCCCCCCCCCCCCCCCCCCcccccCCcCccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 230 NKANTTSNDKLNSQINSVEDYIEDRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQ 286 (293) Q Consensus 230 ~s~n~~~n~~~~s~~n~~~~~n~s~~gn~gt~Sss~~i~~~re~~~~~~~~i~~~~~ 286 (293) ........+..............+..--.....+-++|....+.+.+-.+.||-.++ T Consensus 237 ~snsts~~nst~~snt~~sg~~gSvS~a~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 293 (293) T protein:vir:52 237 NDKLNSQINSVEDYIEDRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQELFMLVY 293 (293) T ss_pred CCCCCCccccccccCccccccccCccchhhhhhHHHHHHhHHHHHHHHHHHHHhhcC Confidence 111111111111111111122223334456668888999999999999999999888 No 10 >protein:vir:122 Length: 293 # NCBI annotation: lower collar protein # Family: family:all:5217 # MgeID: mge:4 # MgeName: B103 # Cross-refs: genbank:acc:NP_690645;swissprot:sw:q37892;genbank:gi:22855159;uniprot:Q37892;genbank:GeneID:955372 Probab=95.03 E-value=0.0029 Score=34.48 Aligned_cols=268 Identities=13% Similarity=0.130 Sum_probs=50.1 Q ss_pred CCc-chhhHhHHHhhhhhccccccccccccccccccccCCCc-ccchhhhHHHHHHHHHHHHHHHhcCCc--HHHHHHHH Q lcl|Aclame:pro 1 MSS-YTMQLRTYIEMWSQGETGLSTAEKIEKGRPKLFDFNYP-IFDESYRTIFETHFIRNFYMREIGFET--EGLFKFHL 76 (293) Q Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~-~~~~~~~~~f~~~~~~~~~~~~ig~~t--~~~fk~~l 76 (293) |+| |--+|..- -.-|.+ .+|+=+-+--|||=.|. +|+.-+++.| |+|.--|.-+|-=- .+.|..+- T Consensus 14 ~~~~~~~~~~~~----~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~ 83 (293) T protein:vir:12 14 MWSQYETGLSMA----EKIEKG---RPKLFDFQYPIFDESYRKVFETHFIRNF---YMREIGFETEGLFKFNLETWLIIN 83 (293) T ss_pred HHhhccCccchh----hhhhhh---hhhhhhccCCcccchHHHHHHHHHHHHH---HHHHhhccchhHHHHHHHHHHhhh Confidence 332 22233220 001111 12222223334554442 2233322222 22322222222111 12222222 Q ss_pred HHHHHhhchHHHHHHhhhcccCC--CccCCccCCCCCCCCCCCCCCCCCc--CCcCCCCCCCCCCCCcCCCCCCCCCCCC Q lcl|Aclame:pro 77 ETWLMINMPYFNKLFESELIKYD--PLENTRVGVKSNTKNDTDRNDNRDV--KQDLTSNGTSSTDAKQNDTSKTTGNEKS 152 (293) Q Consensus 77 ~~~~~~~m~~~~~~~e~e~~d~n--~~~nt~~~s~s~s~~dts~~dn~~t--~s~~t~n~ts~s~~n~~~~s~nt~n~s~ 152 (293) |=|. -..|----.......+ ................+....+... .+.....+.....+.........+.... T Consensus 84 ~~~~---n~~~~s~~~~t~~~nnT~~~~tsns~~sTd~~~nt~sd~st~~ng~s~ttts~~t~~s~~t~s~~ntt~s~t~ 160 (293) T protein:vir:12 84 MPYF---NKLFESELIKYDPLENTRLNTTGNKKNDTERNDNRDTTGSMKADGKSNTKTSDKTNATGSSKEDGKTTGSVTD 160 (293) T ss_pred cchh---cchhhccccccCCCcccccccccCcccccCCCCCcCcccccccccccccCCCCCccccccccCccccCCcccC Confidence 2222 2222221111111111 1111111111111111100000000 0001111111111111110000000000 Q ss_pred C----CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|Aclame:pro 153 S----GSGSITDDNFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTTDTTSNTTGTSTLDSDSKT 228 (293) Q Consensus 153 ~----~~~~~t~~~~~s~~~~dtsn~~~~~t~sn~~~~~dn~s~~~~~ns~d~~n~~~~nts~~~s~~~~ts~~~s~n~t 228 (293) . ..+..+.+........+........+.....+...........+.....+.. ........+++........ T Consensus 161 ~stntt~stdt~~s~t~~tt~d~~~t~d~~Ttt~d~stts~~nTt~~~ntts~sNt~----~tgsttsd~nTt~~sntTs 236 (293) T protein:vir:12 161 DNFNRKIDSDQPDSRLNLTTNDGQGTLEYASAIEENNTNNKRNTTGTNNVTSSAESE----STGSGTSDTVTTDNANTTT 236 (293) T ss_pred CCCCCCCCcCCCCCccccccCCCCCccCCcccccccccCCCcCCCCCCccCCCCCCC----CCcccCCcccccccccccc Confidence 0 0000000000000000000000000000000000000000000000000000 0000000011111110000 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCcccccCCcCccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 229 SNKANTTSNDKLNSQINSVEDYIEDRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQ 286 (293) Q Consensus 229 s~s~n~~~n~~~~s~~n~~~~~n~s~~gn~gt~Sss~~i~~~re~~~~~~~~i~~~~~ 286 (293) ....+...+.......... ........-.-...+-+.|..--.+|.+-.+.||--++ T Consensus 237 sd~tns~~nst~~~ns~st-G~sgs~S~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~ 293 (293) T protein:vir:12 237 NDKLNSQINNVEDYIESKI-GKSGTQSYASLVQDYRAALLRIEKRIFDEMQELFMLVY 293 (293) T ss_pred ccccCCCCccccccccccc-ccccccccccccchhhhHHHHHHHHHHHHHHHHhhhcC Confidence 0000111100000000000 00011111112223444455555556666666666555 No 11 >protein:vir:18 Length: 287 # NCBI annotation: lower collar protein # Family: family:all:5217 # MgeID: mge:323 # MgeName: GA-1 # Cross-refs: genbank:acc:NP_073694;swissprot:trembl:q9fzw4;genbank:gi:12248118;uniprot:Q9FZW4;genbank:GeneID:919889 Probab=93.92 E-value=0.006 Score=32.78 Aligned_cols=256 Identities=13% Similarity=0.129 Sum_probs=49.0 Q ss_pred hhhccccc----cccccccccccccccCCCcccchhhhHHHHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHhh------- Q lcl|Aclame:pro 15 WSQGETGL----STAEKIEKGRPKLFDFNYPIFDESYRTIFETHFIRNFYMREIGFETEGLFKFHLETWLMIN------- 83 (293) Q Consensus 15 ~~~~~~~~----~~~~~~~~~~~~~f~~~~~~~~~~~~~~f~~~~~~~~~~~~ig~~t~~~fk~~l~~~~~~~------- 83 (293) .+-|--|| -+...||.|- --.. .| ++.-+|-| --+| +| .+-||-.|+.++.|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~---~~~~-~~-~~~~~~~~----~~~y---~~----~~~~~~~~~~~~~~~~~~~~i~ 64 (287) T protein:vir:18 1 MASFTMPLREIVEWATQFDNKL---TRNE-KI-EEGRKKLF----DFFY---PI----ETDYKKEFETKFIKHFYFREIG 64 (287) T ss_pred CcceeehHHHHHHHHhhhcccc---hhhH-HH-hhhhhhhh----hhcC---cc----chHHHHHHHHHHHHHHHHHHHh Confidence 22333344 2233344442 1111 02 33333333 2111 22 444444444444332 Q ss_pred ------chHHHHHHhhhcccC-CC-ccCCccCCCCCCCCCCCCCCCCCcCCcCCCCCCCCCCCCcCCCCCCCCCCCCCCC Q lcl|Aclame:pro 84 ------MPYFNKLFESELIKY-DP-LENTRVGVKSNTKNDTDRNDNRDVKQDLTSNGTSSTDAKQNDTSKTTGNEKSSGS 155 (293) Q Consensus 84 ------m~~~~~~~e~e~~d~-n~-~~nt~~~s~s~s~~dts~~dn~~t~s~~t~n~ts~s~~n~~~~s~nt~n~s~~~~ 155 (293) ..|+-++|-.-.-.+ +. .+......+.. ..+..+.....+..+..+.....++................. T Consensus 65 ~~~~~~~~~~~~~~~~~~mp~~n~~~ese~vntN~~--~nTdant~tNtD~nTt~ndn~dtdsnt~ad~ntntdtnTntd 142 (287) T protein:vir:18 65 FETEGRFKFALEEWLNLNMPYWNKIIESTHLDYNPL--YNVDYKKDSDLIRNLDQVDNRVTDSKIENNGKASSESNVITS 142 (287) T ss_pred hhhHHHHHHHHHHHHHhhcchhhhhHhhhhccCCcc--ccccccCCCCcccCCCCCCCcccccCcccCCCcCCCCCCCCC Confidence 233333333222110 00 11111111111 111111111111222222111111111111111111110001 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|Aclame:pro 156 GSITDDNFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTTDTTSNTTGTSTLDSDSKTSNKANTT 235 (293) Q Consensus 156 ~~~t~~~~~s~~~~dtsn~~~~~t~sn~~~~~dn~s~~~~~ns~d~~n~~~~nts~~~s~~~~ts~~~s~n~ts~s~n~~ 235 (293) ...+.+.... .+ .+...........+...........+..+.........+........+.......+........ T Consensus 143 ~nTtantdtn---tD-~NTt~n~~tnt~dN~d~ntd~ntd~nt~d~~T~~s~tnsn~~~nTd~ntnsntdtnsd~nttsn 218 (287) T protein:vir:18 143 EKGEANSIQD---AD-RNSTAKKKRMFEDTPDGRLDIVNDNNIIQYATDLTQEDSTDSVKDKIKNDSSSKNDSTGNTTAE 218 (287) T ss_pred cCCCCCCccc---cc-cccccCCCcCcccCCCCccccccCCCCccccccCCCccccCCCCCCcccCCCccccCCCccccC Confidence 1000000000 00 0000000000101111000000000110111110001111111111111111111111100011 Q ss_pred CCCCCCCCCCCCCCCcc----cccCCcCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 236 SNDKLNSQINSVEDYIE----DRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQELFMLVY 293 (293) Q Consensus 236 ~n~~~~s~~n~~~~~n~----s~~gn~gt~Sss~~i~~~re~~~~~~~~i~~~~~~~fm~~~ 293 (293) .........+.....+. ....+...... +.-+-|...+---+.-|+.--|-+|-.+= T Consensus 219 ~~~nstsn~nsn~d~nSd~N~~~n~n~~s~~~-~gt~s~~~~~~~~~~~~~~~~~~~~~~~~ 279 (287) T protein:vir:18 219 GKTNNITEGNNVDDFKSEKDEKQKLNDHIYGK-QGNVSYPQLIKEHREAILNVERMIFDQME 279 (287) T ss_pred CCCCcccCCccCCCCCCCCCcccccccccccc-ccceeHHHHHHHHHHHHHhHHHHHHHHHH Confidence 11111111111111111 11111111111 11112222222222222222222332221 No 12 >protein:vir:4733 Length: 194 # NCBI annotation: collar protein # Family: family:all:28811 # MgeID: mge:103 # MgeName: Cp-1 # Cross-refs: genbank:acc:NP_044824;swissprot:trembl:q37996;genbank:gi:9629536;uniprot:Q37996;genbank:GeneID:1261240 Probab=93.21 E-value=0.0016 Score=35.86 Aligned_cols=179 Identities=28% Similarity=0.394 Sum_probs=69.7 Q ss_pred chhhHhHHHhhhhhccccccccccccccccccccCCC-cccchhhhH-------------HHHHHHHHHHHHHHhcCCcH Q lcl|Aclame:pro 4 YTMQLRTYIEMWSQGETGLSTAEKIEKGRPKLFDFNY-PIFDESYRT-------------IFETHFIRNFYMREIGFETE 69 (293) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~-~~~~~~~~~-------------~f~~~~~~~~~~~~ig~~t~ 69 (293) .|=.|. +..- =|.|..- +| .|.|+-|.. -|.+.|.+|||.||||.||+ T Consensus 1 mtgrld----glav----------dengefl----hyntiidqtynelfkdmelvngvsdnfkkefckhfynreigletf 62 (194) T protein:vir:47 1 MTGRLD----GLAV----------DENGEFL----HYNTIIDQTYNELFKDMELVNGVSDNFKKEFCKHFYNREIGLETF 62 (194) T ss_pred Cccccc----ceee----------cCCCcee----ehhhHHHhhHHHHHHHHHHhhhhhhhHHHHHHHHHhcchhhhhHH Confidence 000111 0000 1333321 22 233444433 46788999999999999999 Q ss_pred HHHHHHHHHHHHhh-chHHHHHHhhhcccCCCccCCccCCCCCCCCCCCCCCCCCcCCcCCCCCCCCCCCCcCCCCCCCC Q lcl|Aclame:pro 70 GLFKFHLETWLMIN-MPYFNKLFESELIKYDPLENTRVGVKSNTKNDTDRNDNRDVKQDLTSNGTSSTDAKQNDTSKTTG 148 (293) Q Consensus 70 ~~fk~~l~~~~~~~-m~~~~~~~e~e~~d~n~~~nt~~~s~s~s~~dts~~dn~~t~s~~t~n~ts~s~~n~~~~s~nt~ 148 (293) ++|.++|+.-|+.- .-.|.-+-|+..--- .+-...-+.+ +-+++...+.. T Consensus 63 arfqialeevlnnecfnlfkylaeirnkai---------------kdlnqsmnid------------tvgnqkadgqa-- 113 (194) T protein:vir:47 63 ARFQIALEEVLNNECFNLFKYLAEIRNKAI---------------KDLNQSMNID------------TVGNQKADGQA-- 113 (194) T ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHhHHH---------------Hhhhhhcccc------------cccccccccce-- Confidence 99999999988642 222332333222000 0000000000 00001000000 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q lcl|Aclame:pro 149 NEKSSGSGSITDDNFKRDLNADTADDRLQLTTKDGEGVLEYASQIEEHNENKKRDTKTSNTTDTTSNTTGTSTLDSDSKT 228 (293) Q Consensus 149 n~s~~~~~~~t~~~~~s~~~~dtsn~~~~~t~sn~~~~~dn~s~~~~~ns~d~~n~~~~nts~~~s~~~~ts~~~s~n~t 228 (293) - ...+.+|..+...--..--+....+. +- -.+.. T Consensus 114 --l--------------qianttpqerkeivfterygvieyad----------------nl--------------venhq 147 (194) T protein:vir:47 114 --L--------------QIANTTPQERKEIVFTERYGVIEYAD----------------NL--------------VENHQ 147 (194) T ss_pred --e--------------eeccCChhhhhhhhhhhhcchhHHHH----------------HH--------------Hhhhh Confidence 0 00011222211110000000000000 00 00000 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCcccccCCcCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 229 SNKANTTSNDKLNSQINSVEDYIEDRVGKIGTQSYARLVMDYREALLRIEQRIFNEMQELFMLVY 293 (293) Q Consensus 229 s~s~n~~~n~~~~s~~n~~~~~n~s~~gn~gt~Sss~~i~~~re~~~~~~~~i~~~~~~~fm~~~ 293 (293) .+. .++ . ....|=+ .+|.++-++ ....|-.|.-+||.-|..||++|| T Consensus 148 knn--adt----k----------snvsgws-gsslaerlq-rnaelkdiqfqifnicdklflqvf 194 (194) T protein:vir:47 148 KNN--ADT----K----------SNVSGWS-GSSLAERLQ-RNAELKDIQFQIFNICDKLFLQVF 194 (194) T ss_pred ccc--ccc----c----------ccccccc-chhHHHHHh-hccchhhhHHHHHHHHHHHHHhcC Confidence 000 000 0 0001111 113333332 345567788899999999999999 Done!