Query lcl|NC_016655.1_cdsid_YP_005087331.1 [gene=RoPhREQ1_gp79] [protein=hypothetical protein] [protein_id=YP_005087331.1] [location=42994..46974] Match_columns 1326 No_of_seqs 1101 out of 2699 Neff 9.4 Searched_HMMs 1612 Date Thu Nov 7 13:40:58 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_79 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_79_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:104739 Length: 470 92.4 7.1E-05 4.4E-08 43.3 -1.1 149 1138-1326 1-173 (470) 2 protein:vir:103005 Length: 390 91.0 0.0005 3.1E-07 38.7 2.0 157 1133-1326 1-178 (390) 3 protein:vir:96783 Length: 488 63.2 0.012 7.6E-06 31.1 -0.5 162 1152-1326 1-210 (488) 4 protein:vir:122 Length: 293 # 52.2 0.068 4.2E-05 27.0 1.6 89 1205-1326 1-94 (293) 5 protein:vir:94956 Length: 452 50.3 0.0036 2.2E-06 34.0 -5.7 144 1161-1326 1-166 (452) 6 protein:vir:103955 Length: 324 41.0 0.16 0.0001 24.9 1.8 68 1256-1326 1-70 (324) 7 protein:vir:97148 Length: 324 34.6 0.34 0.00021 23.2 2.6 65 1256-1326 1-70 (324) 8 protein:vir:96223 Length: 324 31.0 0.43 0.00026 22.6 2.5 65 1256-1326 1-70 (324) 9 protein:vir:99749 Length: 324 27.3 0.52 0.00032 22.2 2.2 65 1256-1326 1-70 (324) 10 protein:vir:95449 Length: 584 27.0 0.16 0.0001 24.9 -0.6 121 1156-1326 1-137 (584) 11 protein:vir:99920 Length: 311 26.4 2 0.0012 19.0 5.5 194 1119-1326 1-275 (311) 12 protein:vir:18 Length: 287 # N 25.4 0.33 0.0002 23.3 0.8 86 1205-1326 1-93 (287) 13 protein:vir:9309 Length: 324 # 25.3 0.54 0.00033 22.1 2.0 69 1256-1326 1-70 (324) 14 protein:vir:106986 Length: 292 25.0 0.061 3.8E-05 27.2 -3.3 123 1138-1326 1-163 (292) 15 protein:vir:4997 Length: 397 # 24.7 1.5 0.00094 19.6 4.3 106 1210-1326 1-152 (397) 16 protein:vir:5206 Length: 293 # 21.3 0.58 0.00036 21.9 1.3 88 1205-1326 1-102 (293) 17 protein:vir:99072 Length: 479 20.7 0.69 0.00043 21.5 1.6 100 1206-1326 1-131 (479) No 1 >protein:vir:104739 Length: 470 # NCBI annotation: T4-like neck protein # Family: family:all:1104 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214353;genbank:gi:61805993;genbank:GeneID:3294259 Probab=92.43 E-value=7.1e-05 Score=43.34 Aligned_cols=149 Identities=8% Similarity=-0.031 Sum_probs=70.6 Q ss_pred ccceecccccccCCceEEEEEecccccccccccccceeEEecCccccccccCCc-CCEEEEEcCCeEEEEEEcCCCC--- Q lcl|NC_016655. 1138 LPGLKSNPPAFEPRSGTALYTDHAPIPLTNRKYRARTVSLGLGGDLFVSEWGPD-SPEFTYEAENYWLKDISNPNNN--- 1213 (1326) Q Consensus 1138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~--- 1213 (1326) ||-.+.++ .+|.++--| ++=|..|+-+++-..|-| |=++ +.|.|.= |.+..=+|+|-|-+.+|..|-. T Consensus 1 ~~~~~~~~---~~~~~~~~~--~~~~~~e~~~~~g~~~~~-~~~~--~~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~ 72 (470) T protein:vir:10 1 MALNPFFL---QGTSSEQRL--TQDLINEHLKIYGVEVTY-IPRK--YVNTKSIIEEVQSSKFDDNFAIEAYVNTYEGYG 72 (470) T ss_pred CcccceeE---cCCCchhHH--HHHHHHHHhHhccceEEE-echh--hcccccccccccccccccceeEEEEeecccCcC Confidence 33211111 122222110 111222221111111111 1122 2234443 3445567889999999998832 Q ss_pred --hhH-----------------HHHHHHHHHHHHHhhhhcCCccCCccceeecccccceeeecCCCCCHHHHHHHHHHHH Q lcl|NC_016655. 1214 --LRL-----------------KVAWDKTQVAKTNSATVFQPLGSDLPVVLSEGYKGDTFTLKLTPVNHEERAELRKMLT 1274 (1326) Q Consensus 1214 --~~~-----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1274 (1326) .+| |++||..-..||++|...++|-= |+ ++.++||. T Consensus 73 ~~~~~~~~fg~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~-----~~-------~~~~~p~e------------- 127 (470) T protein:vir:10 73 GQGDVLTKFGMSIRDEVTLTISKERFEDFIAPFMAGLDDGPGGNE-----EV-------ILATRPRE------------- 127 (470) T ss_pred CcceeeeecCcccceEEEEEECCccccccccchhhcccCcccccc-----cc-------cccCCccc------------- Confidence 111 46888888888888776555422 22 23333332 Q ss_pred hh-HhhcCCCCCceeEeeeccchhhhhhhcchhcccChHHHHHhhhhhccCCC Q lcl|NC_016655. 1275 SG-RTLFLQSDIDDAWWVRPVGDLIEDVLPTYNRQSNPLREITCQFVQVAPAE 1326 (1326) Q Consensus 1275 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1326 (1326) | -..||-.. +-|=.+. ||+=-|.|-+..+..|++.|+++.+...+ T Consensus 128 -gdl~~~p~~~--~~~~i~~----ve~~~p~~~~G~~~~~~it~~~f~ysge~ 173 (470) T protein:vir:10 128 -GDLVFFPLGS--RLFEVKF----VEHEDPFYQLGKNYVYQLKCELFEYEDEV 173 (470) T ss_pred -ccEEEecCCC--CEEEEEe----cCCCCcchhcCcceeEEeeeceeEecCCc Confidence 2 11133321 1111121 24556778888888899999987776655 No 2 >protein:vir:103005 Length: 390 # NCBI annotation: gp110 # Family: family:all:1104 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717777;genbank:gi:113200614;genbank:GeneID:4239009 Probab=90.95 E-value=0.0005 Score=38.70 Aligned_cols=157 Identities=15% Similarity=0.051 Sum_probs=54.2 Q ss_pred ceeecccceeccc--ccccCCceEEEEEeccc-------ccccccccccceeEEecCccccccccCCcC-CEEEEEcCCe Q lcl|NC_016655. 1133 SDWADLPGLKSNP--PAFEPRSGTALYTDHAP-------IPLTNRKYRARTVSLGLGGDLFVSEWGPDS-PEFTYEAENY 1202 (1326) Q Consensus 1133 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 1202 (1326) -.|.+-|+|.+.. -...+| .-|.+|.. |..|+=+++-..|-| |=++ +.|.|.== .+.-=+|++- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~q~l~~~lv~e~i~~~g~~~~y-~~r~--~~~~d~~~~e~~~~~f~~~ 74 (390) T protein:vir:10 1 MTYSNDPPNNCIQSDYTSSCR---LNLNGSAQEQTFMENLIVESIELYGQNVYY-LPRI--YVNRDTILNEVETSRFEQA 74 (390) T ss_pred CeecCCCcccceecceeeccE---EEEeccCchhHHHHHHHHHHhHhcCceEEE-echh--eeccccccccccccccccc Confidence 2233333333200 011111 12222222 222222222112111 2122 22333222 2344467777 Q ss_pred EEEEEEcCCCC-----hhHH----HHHHHHHHHHHHhhhhcCCccCCccceeecccccceeeecCCCCCHHHHHHHHHHH Q lcl|NC_016655. 1203 WLKDISNPNNN-----LRLK----VAWDKTQVAKTNSATVFQPLGSDLPVVLSEGYKGDTFTLKLTPVNHEERAELRKML 1273 (1326) Q Consensus 1203 ~~~~~~~~~~~-----~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1273 (1326) |-+.+|+.|-. .+|- +|= .|-..+.-+-+++||+|.....+ +.|.| + T Consensus 75 ~~~~~y~~~~~~~~~~~~~~skfg~~~-~de~~~~~~~~~~~~~~~~~~~~-----------------~~~~~------p 130 (390) T protein:vir:10 75 LSVRAYVNNVEGWEGQGDLLSKFGVRI-EDKTTFIFSRKKFTTAVDDNAVL-----------------NVEGR------P 130 (390) T ss_pred eEEEEEeechhccCCccceeeecCcee-cceEEEEECCcchhhhhCCcccc-----------------cccCC------C Confidence 88888877632 1110 000 00001111112222222111000 01111 1 Q ss_pred HhhHhh-cCCCCCce-eEeeeccchhhhhhhcchhcccChHHHHHhhhhhccCCC Q lcl|NC_016655. 1274 TSGRTL-FLQSDIDD-AWWVRPVGDLIEDVLPTYNRQSNPLREITCQFVQVAPAE 1326 (1326) Q Consensus 1274 ~~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1326 (1326) ..|..+ ||-...-+ -.|+++-- |-|.+.+|..|++.|++++..-.+ T Consensus 131 ~egdliy~p~~~~lfei~~ve~~~-------p~yq~G~nyt~~i~a~lf~ySge~ 178 (390) T protein:vir:10 131 NEGDLIWFPATRHLFEIKFVEAER-------PFYQLGKGYVWECQCELFEYSDED 178 (390) T ss_pred CCCceEEecCCCCEEEEEecCCCC-------CceEccCceeeeeEEeeeccCCcc Confidence 112111 22211111 23433322 557788899999999987654443 No 3 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=63.22 E-value=0.012 Score=31.08 Aligned_cols=162 Identities=14% Similarity=0.040 Sum_probs=76.4 Q ss_pred ceEEEEEecc----cccccccccccce-----e-EEecCc-----cccccc---cCCcC----------CEEEEEcCCeE Q lcl|NC_016655. 1152 SGTALYTDHA----PIPLTNRKYRART-----V-SLGLGG-----DLFVSE---WGPDS----------PEFTYEAENYW 1203 (1326) Q Consensus 1152 ~~~~~~~~~~----~~~~~~~~~~~~~-----~-~~~~~~-----~~~~~~---~~~~~----------~~~~~~~~~~~ 1203 (1326) -=-+||-||. |+....|.|++.. + .-|.|. +..+.+ ||+|+ +...-||++|. T Consensus 1 ~~~~~~~~~~~~~m~V~~~hp~y~a~~~~W~~~~d~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~ 80 (488) T protein:vir:96 1 MLKCLYIKHRGFFMLTPIYHPDYLVNAPQWLRNLDCVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLT 80 (488) T ss_pred CceeEEEeecceeecccccCHHHHHHhhhhhHhhhhhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhh Confidence 3347888988 8888787777622 1 112221 111222 22221 11222344444 Q ss_pred EEEEEcCCCChhHHHHHHHHHHHHHHhhhhcCC-ccCCccceeecccccceeeecCCCCCHHHHHHHHHHHHhhHh---- Q lcl|NC_016655. 1204 LKDISNPNNNLRLKVAWDKTQVAKTNSATVFQP-LGSDLPVVLSEGYKGDTFTLKLTPVNHEERAELRKMLTSGRT---- 1278 (1326) Q Consensus 1204 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 1278 (1326) +.-.+.+|-=+ +..++++..+=+|-| +-.+-|=+.+ ++-+-||++.-.-+.-=|..|.+.|..||. T Consensus 81 ~~rA~~~n~~~-------~tl~~l~G~vfrk~p~~~~~~~~~l~--~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilV 151 (488) T protein:vir:96 81 WRLANYVNIVN-------PTMNAITGAVMRREPEFDTMDNPVLI--GLRDNIDGKGNGIDQECKQALNALQWGSRCGWLV 151 (488) T ss_pred hhccccCchhH-------HHHHHhcchhhccCceeccCCcHHHH--HHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEE Confidence 43333333222 222233333334444 2333221111 112228999888888999999999999976 Q ss_pred hcCCCC-------------CceeEee-eccchhhhhhhcchhcccChHHHHHhhhhhcc-CCC Q lcl|NC_016655. 1279 LFLQSD-------------IDDAWWV-RPVGDLIEDVLPTYNRQSNPLREITCQFVQVA-PAE 1326 (1326) Q Consensus 1279 ~~~~~~-------------~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 1326 (1326) -||+.. ..+.+.. +=.+||.++|=....|--=+|||.. ++. +.. T Consensus 152 D~P~~~~T~ade~~~~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~----~~~D~~~ 210 (488) T protein:vir:96 152 RSHPESATMADWNKGKKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDY----QERDGGT 210 (488) T ss_pred ecCCCcCCHHHHHHhcCCcEEEEechhhhcCcceeccCCceeeEEEEEEEEE----EeccCCC Confidence 355421 1122222 4468888766444344445566621 111 111 No 4 >protein:vir:122 Length: 293 # NCBI annotation: lower collar protein # Family: family:all:5217 # MgeID: mge:4 # MgeName: B103 # Cross-refs: genbank:acc:NP_690645;swissprot:sw:q37892;genbank:gi:22855159;uniprot:Q37892;genbank:GeneID:955372 Probab=52.24 E-value=0.068 Score=26.98 Aligned_cols=89 Identities=12% Similarity=0.083 Sum_probs=45.1 Q ss_pred EEEEcCCCChhHH--HHHHHHHHHHHHh---hhhcCCccCCccceeecccccceeeecCCCCCHHHHHHHHHHHHhhHhh Q lcl|NC_016655. 1205 KDISNPNNNLRLK--VAWDKTQVAKTNS---ATVFQPLGSDLPVVLSEGYKGDTFTLKLTPVNHEERAELRKMLTSGRTL 1279 (1326) Q Consensus 1205 ~~~~~~~~~~~~~--~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1279 (1326) +--|. .+|| .||=-.|...|.. +++-+|-. ||..-|.-.+.-|..|..+.- |+ T Consensus 1 ~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~~--~~- 58 (293) T protein:vir:12 1 MASYT----MKLSTYIEMWSQYETGLSMAEKIEKGRPKL---------------FDFQYPIFDESYRKVFETHFI--RN- 58 (293) T ss_pred Cccee----ehHHHHHHHHhhccCccchhhhhhhhhhhh---------------hhccCCcccchHHHHHHHHHH--HH- Confidence 11111 2333 4555555444443 44444411 444444422223333322110 33 Q ss_pred cCCCCCceeEeeeccchhhhhhhcchhcccChHHHHHhhhhhccCCC Q lcl|NC_016655. 1280 FLQSDIDDAWWVRPVGDLIEDVLPTYNRQSNPLREITCQFVQVAPAE 1326 (1326) Q Consensus 1280 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1326 (1326) ||-|..|..-+.++ |+++.++||-+.-.|-.+.-+| T Consensus 59 ---------~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~n~~~~s~ 94 (293) T protein:vir:12 59 ---------FYMREIGFETEGLF--KFNLETWLIINMPYFNKLFESE 94 (293) T ss_pred ---------HHHHHhhccchhHH--HHHHHHHHhhhcchhcchhhcc Confidence 56666776666555 8888888888887777777666 No 5 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=50.31 E-value=0.0036 Score=33.98 Aligned_cols=144 Identities=15% Similarity=0.001 Sum_probs=63.7 Q ss_pred ccccccccccccceeEEecC-------------ccccccccCCcCCEEEEEcCCeEEEEEEcCCCChhHHHHHHHHHHHH Q lcl|NC_016655. 1161 APIPLTNRKYRARTVSLGLG-------------GDLFVSEWGPDSPEFTYEAENYWLKDISNPNNNLRLKVAWDKTQVAK 1227 (1326) Q Consensus 1161 ~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1227 (1326) +|+....|.|.+..-.+=|- +++.+.+|.+|.+. +| +.||-..+.+|--+ +..+++ T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~---~Y-~~rl~rA~~~n~~~-------~t~~~~ 69 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDD---MY-NAYKQRALFYSITS-------KTLSAL 69 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHH---HH-HHHHhhccCCchHH-------HHHHHH Confidence 55555555554432111111 11223333333220 01 11222222222111 112222 Q ss_pred HHhhhhcCCccCCccceeecccccceeeecCCCCCHHHHHHHHHHHHhhHhh----cCCCCCceeEee-----eccchhh Q lcl|NC_016655. 1228 TNSATVFQPLGSDLPVVLSEGYKGDTFTLKLTPVNHEERAELRKMLTSGRTL----FLQSDIDDAWWV-----RPVGDLI 1298 (1326) Q Consensus 1228 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~-----~~~~~~~ 1298 (1326) +..+=+|-|.- |++-+.++ + -||++.-.-+.-=|..|.+.|..||.. ||.. +..-+|+ +=.+||+ T Consensus 70 ~G~vf~k~p~~-~~p~~l~~-~---~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~-g~rPy~~~~~~~~Ii~W~~ 143 (452) T protein:vir:94 70 SGMVLDQPPVI-THPDAMSK-Y---FEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLT-GGDPYISVYTTENILNWEE 143 (452) T ss_pred hchhhcCCcee-cccHHHHH-H---HhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccC-CCceEEEEechhhhcCccc Confidence 22333333311 44433322 1 257776666678899999999999886 7764 4433332 4569999 Q ss_pred hhhhcchhcccChHHHHHhhhhhccCCC Q lcl|NC_016655. 1299 EDVLPTYNRQSNPLREITCQFVQVAPAE 1326 (1326) Q Consensus 1299 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1326 (1326) |++= .|.--.|||..|. +..+.| T Consensus 144 ~~~g---~l~~v~lre~~~~--~d~~d~ 166 (452) T protein:vir:94 144 DEDG---RLLMVVLREFYTV--RDTADR 166 (452) T ss_pred cccC---CeeEEEEEEEEEE--ecCCCc Confidence 8752 2444466764332 222222 No 6 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=41.04 E-value=0.16 Score=24.94 Aligned_cols=68 Identities=6% Similarity=-0.018 Sum_probs=34.2 Q ss_pred ecCCCCCHHHHHHHHHHHHhhHhhcCCCCCceeEe--eeccchhhhhhhcchhcccChHHHHHhhhhhccCCC Q lcl|NC_016655. 1256 LKLTPVNHEERAELRKMLTSGRTLFLQSDIDDAWW--VRPVGDLIEDVLPTYNRQSNPLREITCQFVQVAPAE 1326 (1326) Q Consensus 1256 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1326 (1326) +|.++.+++|+.+|.+-+..|+.|-.+....-..= .=|. ...++++ ..-++.++||.+ |.++.+.-.+ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~-~~~~~ii-~~~~~~s~l~~~-~~~~~~~~~~ 70 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLN-DFTTPIL-QEVMENSKIMQL-GKYEPMEGTE 70 (324) T ss_pred CCCchHHHHHHHHHHHHhhccceecccceeccCCCcceech-hHHHHHH-HHHHhhchhhhh-cceeeccCCc Confidence 67777777888888888887755433221100000 0000 0111222 244566677775 7766654333 No 7 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=34.63 E-value=0.34 Score=23.16 Aligned_cols=65 Identities=5% Similarity=-0.005 Sum_probs=33.2 Q ss_pred ecCCCCCHHHHHHHHHHHHhhHhhcCCCC-----CceeEeeeccchhhhhhhcchhcccChHHHHHhhhhhccCCC Q lcl|NC_016655. 1256 LKLTPVNHEERAELRKMLTSGRTLFLQSD-----IDDAWWVRPVGDLIEDVLPTYNRQSNPLREITCQFVQVAPAE 1326 (1326) Q Consensus 1256 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1326 (1326) .+-++.+++|+..|..-+..++.+-.+.. ..+.-=..-....|+ .-+..++||.+ |+++.+.-.+ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~-----~~~~~s~l~~~-~~~~~~~~~~ 70 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQ-----EVMENSKIMQL-GKYEPMEGTE 70 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHH-----HHHhhcchhhh-cceeeccCCc Confidence 45556666777777777777755433321 111100000123333 44667788886 7666544332 No 8 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=30.96 E-value=0.43 Score=22.62 Aligned_cols=65 Identities=5% Similarity=0.002 Sum_probs=32.5 Q ss_pred ecCCCCCHHHHHHHHHHHHhhHhhcCCCCCceeEeeeccchhh-----hhhhcchhcccChHHHHHhhhhhccCCC Q lcl|NC_016655. 1256 LKLTPVNHEERAELRKMLTSGRTLFLQSDIDDAWWVRPVGDLI-----EDVLPTYNRQSNPLREITCQFVQVAPAE 1326 (1326) Q Consensus 1256 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1326 (1326) +|.++.+++++.+|.+-+..++.+-++....-.+ -|-+| ++++ ..-++..+||.+ |..+.+.-.+ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~----~~~lip~~~~~~ii-~~~~~~s~l~~l-~~~~~~~~~~ 70 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEK----KDGTLLNDFTTPIL-QEVMENSKIMQL-GKYEPMEGTE 70 (324) T ss_pred CCcchhhhHHHHHHHHhhhhhhhcccccccccCC----CcceechhHHHHHH-HHHHhhchhhhh-cceeeccCCc Confidence 6666777777777888888886655543211110 11111 1111 134556666664 5555443222 No 9 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=27.27 E-value=0.52 Score=22.16 Aligned_cols=65 Identities=5% Similarity=0.010 Sum_probs=31.7 Q ss_pred ecCCCCCHHHHHHHHHHHHhhHhhcCCCCC-----ceeEeeeccchhhhhhhcchhcccChHHHHHhhhhhccCCC Q lcl|NC_016655. 1256 LKLTPVNHEERAELRKMLTSGRTLFLQSDI-----DDAWWVRPVGDLIEDVLPTYNRQSNPLREITCQFVQVAPAE 1326 (1326) Q Consensus 1256 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1326 (1326) .|-++.+++|+.+|..-+..+..|=.+..- .+.-=..-....|+ .-++.++||.+ |..+.+.-.+ T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~-----~~~~~s~l~~~-~~~~~~~~~~ 70 (324) T protein:vir:99 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQ-----EVMENSKIMRL-GKYEPMEGTE 70 (324) T ss_pred CCCchHhhHHHHHHHHHhhhhhhccccceeccCCCcceechhHHHHHHH-----HHHhhchhhhh-cceeeccCCc Confidence 455566666777787777777443222211 11000001122233 44666778775 7766554332 No 10 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=27.05 E-value=0.16 Score=24.93 Aligned_cols=121 Identities=15% Similarity=0.094 Sum_probs=50.2 Q ss_pred EEEecccccccccccccceeEEecCccccccccCCcCCEEEEEcCCeEEEEEEcCCCChhHHHHHHHHHHHHHHhhhhcC Q lcl|NC_016655. 1156 LYTDHAPIPLTNRKYRARTVSLGLGGDLFVSEWGPDSPEFTYEAENYWLKDISNPNNNLRLKVAWDKTQVAKTNSATVFQ 1235 (1326) Q Consensus 1156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1235 (1326) |=.|.+-+. + +.|-|..+-.|-.||.+|= |..++-.++| ++.++|+...+..+ T Consensus 1 ~~~~~~~~~---------~----------~~~~~~~~~~v~~~~~~~~-------~~r~~~~~~w-~el~~y~~a~~~~~ 53 (584) T protein:vir:95 1 MSVKVAELN---------S----------LLVRDSSAQWVAYLWDRFN-------NQRRQKIEEW-KELRNYVFATDTTT 53 (584) T ss_pred CCcchhhhh---------h----------hccccchHHHHHHHHHHHH-------hhhchhhccC-HHHHHHHHhhhhhh Confidence 112221110 0 2233444444455555441 1222223567 45666666655432 Q ss_pred CccCCccceeecccccceeeecCCCCCH----HHHHHHHHHHHhh-HhhcCCCCCceeEeeeccchhhhhhhc------- Q lcl|NC_016655. 1236 PLGSDLPVVLSEGYKGDTFTLKLTPVNH----EERAELRKMLTSG-RTLFLQSDIDDAWWVRPVGDLIEDVLP------- 1303 (1326) Q Consensus 1236 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~------- 1303 (1326) + -++|.||-|+ .=|++-..|..+= +.|||+.+ |++-||-.=+++=- T Consensus 54 -----------~------~~~~~~~r~~~~~~k~~~~~~~i~~~l~~~~Fp~~~-----w~~~v~~~~~~~~~~~~~ai~ 111 (584) T protein:vir:95 54 -----------T------SNQGLPWKNSTTLPKLCQIRDNLHSNYFSSLFPNDD-----WLRWVGYGKGDSTKTKAKAIQ 111 (584) T ss_pred -----------h------hhcccccccccchhHHHHHHHHHHHHHHHhhcCccc-----eeeeecCCCchhhHHHHHHHH Confidence 2 2445555553 2233322333332 78999964 78777755444311 Q ss_pred ----chhcccChHHHHHhhhhhccCCC Q lcl|NC_016655. 1304 ----TYNRQSNPLREITCQFVQVAPAE 1326 (1326) Q Consensus 1304 ----~~~~~~~~~~~~~~~~~~~~~~~ 1326 (1326) .+-.++ .+|++.-.|+.-++-- T Consensus 112 ~~i~dkl~e~-~~~~~~~~~i~d~~~~ 137 (584) T protein:vir:95 112 AYMSNKCRES-HFRTEVSKLIYDYIDY 137 (584) T ss_pred HHHhhhhhhc-cHHHHHHHHHHhhccC Confidence 111111 2233333333222211 No 11 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=26.38 E-value=2 Score=18.99 Aligned_cols=194 Identities=12% Similarity=0.009 Sum_probs=70.9 Q ss_pred ccccccccccccccceeecccceec--ccccccCCceEEEEEecccccccccccc----cceeE-EecCccccccccCCc Q lcl|NC_016655. 1119 MPQIQYADDDGTGYSDWADLPGLKS--NPPAFEPRSGTALYTDHAPIPLTNRKYR----ARTVS-LGLGGDLFVSEWGPD 1191 (1326) Q Consensus 1119 ~~~~~~a~~~~~g~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~-~~~~~~~~~~~~~~~ 1191 (1326) |- .. ++++ |+ -||..++ -....+.++-.+=+.++.|+....-+|- ...+. .|=|.+ +.|.|.+ T Consensus 1 Ma--t~-tt~~-g~----~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~--~~~~~~~ 70 (311) T protein:vir:99 1 MA--TF-GTGN-LK----NLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGRPKAEFVGEGQQ--KSSTTGE 70 (311) T ss_pred Cc--ee-cCCC-ce----eccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEEeecCcc--cccccce Confidence 11 11 1121 22 1343222 1122333444444444444432221110 01111 122333 4455555 Q ss_pred CCEEEEEcCCeEEEEEEcCCCChhHHHHH-H--HHHHHHHHhhhhc-----------------CC--ccCCccc----ee Q lcl|NC_016655. 1192 SPEFTYEAENYWLKDISNPNNNLRLKVAW-D--KTQVAKTNSATVF-----------------QP--LGSDLPV----VL 1245 (1326) Q Consensus 1192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~~~~~~~~~~~~~-----------------~~--~~~~~~~----~~ 1245 (1326) =..++++. +.+.+++|=+.+ |.+.+ | .+|.+||++.-.+ ++ +-|..|. .+ T Consensus 71 f~~v~l~~---~k~~~~~~iS~e-ll~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~ 146 (311) T protein:vir:99 71 FDFVTSTP---KKAQVTMRFNEE-VQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASK 146 (311) T ss_pred eeEEEEee---EEEEEeehhhHH-HhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccc Confidence 55566654 555667776654 44444 3 3466666532221 11 1222221 01 Q ss_pred ec----cccc---ceee----e-----cCC-----CCCHHHHHHHHHHHHh-hHhhcCC---CCCceeEeeeccchh--- Q lcl|NC_016655. 1246 SE----GYKG---DTFT----L-----KLT-----PVNHEERAELRKMLTS-GRTLFLQ---SDIDDAWWVRPVGDL--- 1297 (1326) Q Consensus 1246 ~~----~~~~---~~~~----~-----~~~-----~~~~~~~~~~~~~~~~-~~~~~~~---~~~~~~~~~~~~~~~--- 1297 (1326) .. ..+. .-|+ | ++- -.|+.-+..|++|.+. ||+||+. ....+.+|-+||..- T Consensus 147 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~s~~i 226 (311) T protein:vir:99 147 RVELTADTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEGIDASVSDTV 226 (311) T ss_pred eeeccccccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecceeeEeeccc Confidence 10 0000 0000 0 011 1234666778887765 7999865 334578998887541 Q ss_pred -------hhh---hhcc--hhcccC---h-----HHHHHhhhhhccCCC Q lcl|NC_016655. 1298 -------IED---VLPT--YNRQSN---P-----LREITCQFVQVAPAE 1326 (1326) Q Consensus 1298 -------~~~---~~~~--~~~~~~---~-----~~~~~~~~~~~~~~~ 1326 (1326) .|. ++-. +.+.++ . .|++.-+..+.+-.+ T Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~ 275 (311) T protein:vir:99 227 NGGDEADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPD 275 (311) T ss_pred ccccccccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCC Confidence 010 0000 001111 0 011211111111000 No 12 >protein:vir:18 Length: 287 # NCBI annotation: lower collar protein # Family: family:all:5217 # MgeID: mge:323 # MgeName: GA-1 # Cross-refs: genbank:acc:NP_073694;swissprot:trembl:q9fzw4;genbank:gi:12248118;uniprot:Q9FZW4;genbank:GeneID:919889 Probab=25.38 E-value=0.33 Score=23.26 Aligned_cols=86 Identities=12% Similarity=0.100 Sum_probs=44.3 Q ss_pred EEEEcCCCChhHH--HHHHHHHHHHHH---hhhhcCCccCCccceeecccccceeeecCCCCCHHHHHHHHHHHHhh--H Q lcl|NC_016655. 1205 KDISNPNNNLRLK--VAWDKTQVAKTN---SATVFQPLGSDLPVVLSEGYKGDTFTLKLTPVNHEERAELRKMLTSG--R 1277 (1326) Q Consensus 1205 ~~~~~~~~~~~~~--~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 1277 (1326) +--|. .+|| .+|=-+|...|. ++++-+|-.=|| |-+ | -..|+|.|+.= | T Consensus 1 ~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~y~------~-----------~~~~~~~~~~~~~~ 56 (287) T protein:vir:18 1 MASFT----MPLREIVEWATQFDNKLTRNEKIEEGRKKLFDF---FYP------I-----------ETDYKKEFETKFIK 56 (287) T ss_pred Cccee----ehHHHHHHHHhhhcccchhhHHHhhhhhhhhhh---cCc------c-----------chHHHHHHHHHHHH Confidence 11111 2343 567666665553 344445522233 332 4 23566665543 3 Q ss_pred hhcCCCCCceeEeeeccchhhhhhhcchhcccChHHHHHhhhhhccCCC Q lcl|NC_016655. 1278 TLFLQSDIDDAWWVRPVGDLIEDVLPTYNRQSNPLREITCQFVQVAPAE 1326 (1326) Q Consensus 1278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1326 (1326) + ||-|+.|..++..+ |+++...|+-+.-||-.+--.| T Consensus 57 ~----------~~~~~i~~~~~~~~--~~~~~~~~~~~mp~~n~~~ese 93 (287) T protein:vir:18 57 H----------FYFREIGFETEGRF--KFALEEWLNLNMPYWNKIIEST 93 (287) T ss_pred H----------HHHHHHhhhhHHHH--HHHHHHHHHhhcchhhhhHhhh Confidence 3 45666775555444 6667777777776666655555 No 13 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=25.32 E-value=0.54 Score=22.06 Aligned_cols=69 Identities=4% Similarity=-0.029 Sum_probs=29.1 Q ss_pred ecCCCCCHHHHHHHHHHHHhhHhhcCCCCCceeEeeeccch-hhhhhhcchhcccChHHHHHhhhhhccCCC Q lcl|NC_016655. 1256 LKLTPVNHEERAELRKMLTSGRTLFLQSDIDDAWWVRPVGD-LIEDVLPTYNRQSNPLREITCQFVQVAPAE 1326 (1326) Q Consensus 1256 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1326 (1326) ++.-+.++.|..+|..-+..+|+|-++..-.-.+=..-+-. ..++++ ..-+..++||.+ |..+++.-.+ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii-~~~~~~s~l~~l-~~~~~~~~~~ 70 (324) T protein:vir:93 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPIL-QEVMENSKIMQL-GKYEPMEGTE 70 (324) T ss_pred CchhHHHHHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHH-HHHHhhchhhhh-cceeeccCCc Confidence 22222334444467777777777655432111110000001 111111 244566677775 6666543333 No 14 >protein:vir:106986 Length: 292 # NCBI annotation: neck protein gp14 # Family: family:all:1104 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195128;genbank:gi:58532905;uniprot:Q5GQV9;genbank:GeneID:3260483 Probab=25.03 E-value=0.061 Score=27.24 Aligned_cols=123 Identities=15% Similarity=0.180 Sum_probs=56.7 Q ss_pred ccceecccccccCCc------------------eEEEEEecccccccccccccceeEEecCccccccccCCcCCEEEEEc Q lcl|NC_016655. 1138 LPGLKSNPPAFEPRS------------------GTALYTDHAPIPLTNRKYRARTVSLGLGGDLFVSEWGPDSPEFTYEA 1199 (1326) Q Consensus 1138 ~~~~~~~~~~~~~~~------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1199 (1326) ||-...+..-+.||. --..|-+++=++ |. |. |...-=+| T Consensus 1 m~~npyfn~~~~~~~~eQ~L~~~LV~Esiq~~G~dvyYlpRe~~~-----------------d~-~~-----~E~~~skF 57 (292) T protein:vir:10 1 MPTSPYFPSYYSGYSGEQNLVQDLVDEQIKLFGTDIYYLPRTILR-----------------DN-TL-----DDVIYNKF 57 (292) T ss_pred CCcCccccccccCcCchhHHHHHHHHHHHHhcCceEEEechhhhc-----------------cc-cc-----cccccccc Confidence 432222111122222 223333333221 11 11 22233468 Q ss_pred CCeEEEEEEcCCCC-----hhH-----------------HHHHHHHHHHHHHhhhhcCCccCCccceeecccccceeeec Q lcl|NC_016655. 1200 ENYWLKDISNPNNN-----LRL-----------------KVAWDKTQVAKTNSATVFQPLGSDLPVVLSEGYKGDTFTLK 1257 (1326) Q Consensus 1200 ~~~~~~~~~~~~~~-----~~~-----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1257 (1326) .+-|-+.+|+.|-. .+| |+||+..--.+-+ --+++|.-||| |=+ T Consensus 58 ~~a~~ieaY~~~~eg~~g~~~~~sKFG~~~~De~t~~is~~~f~~~~~~~~~-~~~~~P~eGDL------------IYf- 123 (292) T protein:vir:10 58 ERQFQVEMLLQNVEGFGSPSEFISKFGLRITDEVRFIVSQRRWDEEAVNYDL-NVNGRPNEGDL------------LYF- 123 (292) T ss_pred ccceeEEEEeechhccCCCcceeeecCceecceEEEEEccchhhhhcCcccc-cccCCCccccE------------EEE- Confidence 88999999998832 111 4667652111111 11235555555 222 Q ss_pred CCCCCHHHHHHHHHHHHhhHhhcCCCCCceeEeeeccchhhhhhhcchhcccChHHHHHhhhhhccCCC Q lcl|NC_016655. 1258 LTPVNHEERAELRKMLTSGRTLFLQSDIDDAWWVRPVGDLIEDVLPTYNRQSNPLREITCQFVQVAPAE 1326 (1326) Q Consensus 1258 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1326 (1326) |-. +.||.=+ |+ |+--|-|.+..|..||+.|+..+.+-.+ T Consensus 124 -Pl~---------------~~lFEI~------~v-------e~~~PfyQ~gk~~~~~l~~~~F~Ys~E~ 163 (292) T protein:vir:10 124 -PLT---------------QDIYEIK------FV-------EREDPFYQLGKNYFYIMTAEIYEYGSDN 163 (292) T ss_pred -cCC---------------CcEEEEE------cc-------cCCCchhhhCCceEEEEEEEEEeecCce Confidence 111 1223221 21 2223558888899999999988776555 No 15 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=24.72 E-value=1.5 Score=19.60 Aligned_cols=106 Identities=13% Similarity=0.046 Sum_probs=35.2 Q ss_pred CCCChhHHHHHHHHH----------------------------------HHHHHhh----hhcCC-ccCCccceeecccc Q lcl|NC_016655. 1210 PNNNLRLKVAWDKTQ----------------------------------VAKTNSA----TVFQP-LGSDLPVVLSEGYK 1250 (1326) Q Consensus 1210 ~~~~~~~~~~~~~~~----------------------------------~~~~~~~----~~~~~-~~~~~~~~~~~~~~ 1250 (1326) =..-.|||++|++.- .+.++.+ +..+- ....++..+.+. T Consensus 1 Mk~~~eL~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 78 (397) T protein:vir:49 1 MKTSNELHDLWIAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKKP-- 78 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc-- Confidence 111133333332110 0000111 11111 123333333321 Q ss_pred cceeeecCCCCCHHHHHHHHHHHHhhH-hhc------CCCCCceeEeeeccchhhhhhhcchhcccChHHHHHhhhhhcc Q lcl|NC_016655. 1251 GDTFTLKLTPVNHEERAELRKMLTSGR-TLF------LQSDIDDAWWVRPVGDLIEDVLPTYNRQSNPLREITCQFVQVA 1323 (1326) Q Consensus 1251 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1323 (1326) +.........+||..|.++|..+. ..+ ....+.|.-=..-....|+ .-+...+||++ |..+.+- T Consensus 79 ---~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~-----~~~~~~~l~~~-~~~~~~~ 149 (397) T protein:vir:49 79 ---LTKNEEEVKANFVKDFKNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINT-----LVRQFDSLQEY-VNVENVT 149 (397) T ss_pred ---ccchhhHHHHHHHHHHHHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHH-----HHHhhhhHhhh-cceeecc Confidence 111112222478888999987761 110 0111111100000112233 44556677765 5544333 Q ss_pred CCC Q lcl|NC_016655. 1324 PAE 1326 (1326) Q Consensus 1324 ~~~ 1326 (1326) ... T Consensus 150 ~~~ 152 (397) T protein:vir:49 150 TLT 152 (397) T ss_pred CCc Confidence 222 No 16 >protein:vir:5206 Length: 293 # NCBI annotation: lower collar protein # Family: family:all:5217 # MgeID: mge:116 # MgeName: PZA # Cross-refs: genbank:acc:NP_040729;genbank:gi:9626400;genbank:GeneID:1260971 Probab=21.30 E-value=0.58 Score=21.89 Aligned_cols=88 Identities=13% Similarity=0.180 Sum_probs=38.7 Q ss_pred EEEEcCCCChhHH--HHHHHHHHHHH---HhhhhcCCccCCccceeecccccceeeecCCCCCHHHHHHHHH-HHHhhHh Q lcl|NC_016655. 1205 KDISNPNNNLRLK--VAWDKTQVAKT---NSATVFQPLGSDLPVVLSEGYKGDTFTLKLTPVNHEERAELRK-MLTSGRT 1278 (1326) Q Consensus 1205 ~~~~~~~~~~~~~--~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 1278 (1326) +--|. .+|| .||=-+|...| .++|+-+|-. ||.--|.-.+.-|..|.. || |+ T Consensus 1 ~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~~---~~ 58 (293) T protein:vir:52 1 MSSYT----MQLRTYIEMWSQGETGLSTAEKIEKGRPKL---------------FDFNYPIFDESYRTIFETHFI---RN 58 (293) T ss_pred Cccee----ehHhHHHhhhhhcCCcccccchhhhhhhhh---------------hccCCCccchhHHHHHHHHHH---HH Confidence 11111 2333 45544465444 3344434411 343344433333444432 22 33 Q ss_pred hcCCCCCceeEeeeccchhhhhhhcchhcccChHHH--------HHhhhhhccCCC Q lcl|NC_016655. 1279 LFLQSDIDDAWWVRPVGDLIEDVLPTYNRQSNPLRE--------ITCQFVQVAPAE 1326 (1326) Q Consensus 1279 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~ 1326 (1326) ||-|..|..-+.++ |++++..||- +.||....-|++ T Consensus 59 ----------~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~n~~~~s~~~nt~p~d 102 (293) T protein:vir:52 59 ----------FYMREIGFETEGLF--KFHLETWLMINMPYFNKLFESELIKYDPLE 102 (293) T ss_pred ----------HHHHHhhccchHHH--HHHHHHHHhhhcccccccccccccccCCcc Confidence 34444444444333 4555555543 457777788888 No 17 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=20.69 E-value=0.69 Score=21.46 Aligned_cols=100 Identities=13% Similarity=0.152 Sum_probs=28.6 Q ss_pred EEEcCCCCh-----------hHHHHHHHHHHHHHHhhhhcCCccCCccceeecccccceeeecCCCCC-HHHHH------ Q lcl|NC_016655. 1206 DISNPNNNL-----------RLKVAWDKTQVAKTNSATVFQPLGSDLPVVLSEGYKGDTFTLKLTPVN-HEERA------ 1267 (1326) Q Consensus 1206 ~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~------ 1267 (1326) -|+.|...+ +|-++|++... -+++|++. .=|+ |. | ++.++.+ .+++. T Consensus 1 ~~~~p~~~l~~~~~~~~~~~~l~~~~~~~~~-r~~~~~~Y--Y~g~----~~-------i-~~~~~~~~~~~~~~~~~~~ 65 (479) T protein:vir:99 1 MIDLPDEDLSSEGLAKYLETKVFPKMNTECE-RLDDFEAW--TKNG----QE-------V-PDLATRHKNKEREVLQQLS 65 (479) T ss_pred CccCCcccCChhHHHHHHHHHHHHHHHHHhH-HHHHHHHH--HhcC----Cc-------c-cccccccCChhHHHHHHHh Confidence 567776541 12234544332 12222211 1121 11 1 1111111 12222 Q ss_pred --HHHHHH-Hhh-HhhcCCCCC---------ceeEeeeccchhhhhhhcchhcccChHHHHHhhhhhccCCC Q lcl|NC_016655. 1268 --ELRKML-TSG-RTLFLQSDI---------DDAWWVRPVGDLIEDVLPTYNRQSNPLREITCQFVQVAPAE 1326 (1326) Q Consensus 1268 --~~~~~~-~~~-~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1326 (1326) .|-|++ ++= .+|+|+... .-.||.+| ++|++.....+. +--.-|-|+-|-|.+ T Consensus 66 ~~n~~~~iVd~~~~~l~~~gf~~~d~~~~~~~~~i~~~N---~~d~~~~~~~~~---a~~~G~af~~v~~~~ 131 (479) T protein:vir:99 66 RKPWMGLMVNSFAQQLIVDGYRKTGTNENAKGWDTWRLN---QMDKQQFWLNRA---VLTFGYAFIKVTSGI 131 (479) T ss_pred hcCcHHHHHHHHHhhcccccccCCCchhhHHHHHHHHhc---ChhHHHHHHHHH---HhhcCceEEEEecCC Confidence 233322 222 345655321 11244333 122111100000 000114444443311 Done!