Query lcl|NC_019419.2_cdsid_YP_006990380.1 [gene=JL1_53] [protein=hypothetical protein] [protein_id=YP_006990380.1] [location=complement(37282..37812)] Match_columns 176 No_of_seqs 70 out of 74 Neff 5.5 Searched_HMMs 1612 Date Thu Nov 7 19:20:45 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_53 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_53_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:80389 Length: 172 100.0 3.1E-52 1.9E-55 303.0 15.5 157 1-176 1-172 (172) 2 protein:vir:78383 Length: 169 100.0 4.1E-51 2.5E-54 296.9 14.4 157 1-176 1-169 (169) 3 protein:vir:95004 Length: 169 100.0 4E-51 2.5E-54 296.9 13.9 158 1-176 1-169 (169) 4 protein:vir:95176 Length: 172 100.0 1.1E-50 6.8E-54 294.5 15.1 159 1-176 1-172 (172) 5 protein:vir:94955 Length: 170 100.0 6.6E-50 4.1E-53 290.2 14.5 155 1-176 1-170 (170) 6 protein:vir:97267 Length: 172 100.0 7E-46 4.3E-49 268.2 14.0 154 1-176 1-172 (172) 7 protein:vir:80967 Length: 131 98.5 1.9E-09 1.2E-12 68.3 10.1 126 1-176 1-127 (131) 8 protein:vir:43 Length: 131 # N 98.5 2.4E-09 1.5E-12 67.8 9.9 125 1-176 1-127 (131) 9 protein:vir:98900 Length: 132 98.4 6.7E-09 4.1E-12 65.4 10.2 126 1-176 1-128 (132) 10 protein:vir:9576 Length: 131 # 95.6 0.00063 3.9E-07 38.1 10.6 125 9-176 1-130 (131) 11 protein:vir:4788 Length: 130 # 95.3 0.00034 2.1E-07 39.6 8.3 122 1-176 1-125 (130) 12 protein:vir:99002 Length: 158 95.2 0.00053 3.3E-07 38.5 9.0 122 9-176 1-124 (158) 13 protein:vir:1640 Length: 132 # 94.5 0.0016 9.7E-07 36.0 9.9 126 1-176 1-131 (132) 14 protein:vir:2505 Length: 128 # 93.8 0.00088 5.5E-07 37.3 7.0 118 6-176 1-123 (128) 15 protein:vir:94761 Length: 132 93.7 0.0036 2.2E-06 34.0 10.1 126 9-176 1-131 (132) 16 protein:vir:9761 Length: 140 # 93.4 0.0037 2.3E-06 33.9 9.8 123 1-176 1-129 (140) 17 protein:vir:79701 Length: 144 92.6 0.0033 2E-06 34.2 8.3 130 9-176 1-140 (144) 18 protein:vir:9821 Length: 138 # 92.0 0.0026 1.6E-06 34.7 7.0 127 1-176 1-133 (138) 19 protein:vir:101652 Length: 188 90.6 0.014 8.4E-06 30.8 9.5 140 15-166 1-188 (188) 20 protein:vir:7857 Length: 188 # 90.6 0.014 8.4E-06 30.8 9.5 140 15-166 1-188 (188) 21 protein:vir:80320 Length: 188 81.6 0.084 5.2E-05 26.5 8.8 138 1-166 1-188 (188) 22 protein:vir:1435 Length: 188 # 79.5 0.1 6.5E-05 26.0 9.2 137 1-166 1-188 (188) 23 protein:vir:80036 Length: 111 77.9 0.054 3.4E-05 27.5 6.5 108 11-174 1-111 (111) 24 protein:vir:103283 Length: 125 74.2 0.07 4.3E-05 26.9 6.1 89 64-176 1-123 (125) 25 protein:vir:107702 Length: 136 71.6 0.19 0.00012 24.5 8.0 122 6-176 1-134 (136) 26 protein:vir:101559 Length: 158 71.4 0.13 7.8E-05 25.5 6.8 104 61-176 1-149 (158) 27 protein:vir:3639 Length: 158 # 71.4 0.13 7.8E-05 25.5 6.8 104 61-176 1-149 (158) 28 protein:vir:99848 Length: 172 68.9 0.026 1.6E-05 29.3 2.4 115 9-160 1-172 (172) 29 protein:vir:107756 Length: 147 62.0 0.34 0.00021 23.1 10.6 126 1-176 1-143 (147) 30 protein:vir:78595 Length: 158 61.3 0.35 0.00022 23.1 7.1 104 61-176 1-149 (158) 31 protein:vir:106739 Length: 158 61.3 0.35 0.00022 23.1 7.1 104 61-176 1-149 (158) 32 protein:vir:107864 Length: 150 52.9 0.54 0.00034 22.0 7.0 118 10-176 1-149 (150) 33 protein:vir:104344 Length: 132 51.1 0.59 0.00037 21.8 6.9 118 8-176 1-129 (132) 34 protein:vir:79640 Length: 134 48.6 0.66 0.00041 21.6 7.0 117 8-176 1-130 (134) 35 protein:vir:79074 Length: 150 47.5 0.7 0.00043 21.4 8.9 118 10-176 1-149 (150) 36 protein:vir:1993 Length: 141 # 47.1 0.71 0.00044 21.4 7.7 122 10-176 1-139 (141) 37 protein:vir:103846 Length: 138 45.4 0.77 0.00048 21.2 8.6 111 10-176 1-137 (138) 38 protein:vir:79253 Length: 138 43.3 0.85 0.00053 21.0 6.9 111 10-176 1-130 (138) 39 protein:vir:99222 Length: 138 43.3 0.85 0.00053 21.0 6.9 111 10-176 1-130 (138) 40 protein:vir:486 Length: 107 # 41.4 0.84 0.00052 21.0 5.8 102 11-166 1-107 (107) 41 protein:vir:5256 Length: 119 # 32.7 1.4 0.00087 19.8 10.4 109 1-174 1-119 (119) 42 protein:vir:96108 Length: 155 32.6 1.4 0.00087 19.8 7.2 98 78-176 1-152 (155) 43 protein:vir:99570 Length: 153 31.8 1.5 0.00091 19.7 10.7 128 1-176 1-150 (153) 44 protein:vir:4512 Length: 107 # 25.3 2.1 0.0013 18.8 6.1 105 11-166 1-107 (107) 45 protein:vir:98481 Length: 136 23.6 2.3 0.0014 18.6 7.1 112 1-176 1-121 (136) No 1 >protein:vir:80389 Length: 172 # NCBI annotation: BcepGomrgp09 # Family: family:all:703 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210229;genbank:gi:146329921;genbank:GeneID:5123498 Probab=100.00 E-value=3.1e-52 Score=303.00 Aligned_cols=157 Identities=24% Similarity=0.331 Sum_probs=136.5 Q ss_pred CcceeecC-----ccccccHHHHHHHHHhcCCccChhhhhHHHHHHHHHHhhh--cccceeCCCccccccccCCcccCcc Q lcl|NC_019419. 1 MPAFFIGV-----NTMYGDPQTFVDYAAARGVEVTLSDATRHLTVVNDFLNGI--NWIGEPADQTGIDAWPRINYQSDGK 73 (176) Q Consensus 1 ~~~~~~~~-----~~~Y~sva~adaY~a~rg~~~~~~~ke~aLi~As~~ld~~--~~~G~r~~~~Q~laWPR~G~~~~g~ 73 (176) |+...=.. -.+|+|+++|++||++||+++|+++||++|++|+||||++ +|+|+|++++|+|+|||+|+.+||. T Consensus 1 Malived~~g~~~anSYvt~~~a~aY~~~rg~~~~~d~~e~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~g~ 80 (172) T protein:vir:80 1 MALIVEDGTGKPDANTYAGADFVIAYAQARGVTVDADEAERLILEAMDYIESFRRRWKGERNTREQGLTWPRHDAVVDGF 80 (172) T ss_pred CeeEeeCCCCCccccccccHHHHHHHHHHcCCCcCHHHHHHHHHHHHHHHhhccCccccccCCccccccccccCcccCcc Confidence 65322111 2689999999999999999999999999999999999995 6999999999999999999999887 Q ss_pred cccccccccccccccccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcceeEEEEcCeeEEeecCCCC--------c Q lcl|NC_019419. 74 PVRDTLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKETIRETVGPITMEYDPATI--------G 145 (176) Q Consensus 74 ~~~~~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~v~rekVG~I~veY~~~~~--------~ 145 (176) ++ +++.||.+||+||||||+++++++++.+...+..||+||||+||+||+.+.+ + T Consensus 81 ~~-----------------~~~~IP~~v~~A~~elA~~~~~g~~~~~~~~~~~v~~ekVG~i~~eY~~~~~~~~~~~~~~ 143 (172) T protein:vir:80 81 VI-----------------PSDVIPKELQSAVAAAVIEQVNGFELQQSQDQWAVRIEKVDVIEVQYAAGGGGQSASANAP 143 (172) T ss_pred cc-----------------cccchhHHHHHHHHHHHHHHhcCCccCcCCCCceeeEEeccceEEeeecccCccccccccC Confidence 76 5578999999999999999999888888888888999999999999975433 2 Q ss_pred ccchHHHHHHHHhhhhccCCcccceeeeecC Q lcl|NC_019419. 146 SGVSFPWWDGLLGHWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 146 ~~~~~~~v~~lL~~~l~~~g~~~~~~~v~RG 176 (176) ..|+||+|++||+|||+++|+ .+++++|| T Consensus 144 ~~~~~~~v~~LL~p~l~~~gg--~~~~~vrg 172 (172) T protein:vir:80 144 MKPTFPKIDALLNPLLVGDGG--LFLVAVRG 172 (172) T ss_pred CccchHHHHHHHhhhhcCCCC--eeeeeecC Confidence 468999999999999987543 46889999 No 2 >protein:vir:78383 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110841;genbank:gi:134288602;genbank:GeneID:5179646 Probab=100.00 E-value=4.1e-51 Score=296.85 Aligned_cols=157 Identities=26% Similarity=0.276 Sum_probs=136.7 Q ss_pred Ccceeec------CccccccHHHHHHHHHhcCCccCh--hhhhHHHHHHHHHHhhh--cccceeCCCccccccccCCccc Q lcl|NC_019419. 1 MPAFFIG------VNTMYGDPQTFVDYAAARGVEVTL--SDATRHLTVVNDFLNGI--NWIGEPADQTGIDAWPRINYQS 70 (176) Q Consensus 1 ~~~~~~~------~~~~Y~sva~adaY~a~rg~~~~~--~~ke~aLi~As~~ld~~--~~~G~r~~~~Q~laWPR~G~~~ 70 (176) |+.. |. -..+|+|+++|++||++||+++|. ++||++|++|++|||++ +|+|+|++++|+|+|||+|+.+ T Consensus 1 Mali-V~~~~g~~~anSYvtv~~a~aY~~~rg~~~~~d~~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~ 79 (169) T protein:vir:78 1 MPLI-VETGQGIPNADSYVSLEDGRALAAKYGLELPEDDTAAEAALRNGAVYVGLFESQMCGRRVSANQALAFPRTGVTL 79 (169) T ss_pred CeeE-eeCCCCCccccccccHHHHHHHHHHcCCcCCCChHHHHHHHHHHHHHhhhccccceeeeCCcccccccccCCcee Confidence 7643 21 145899999999999999998874 45899999999999986 8999999999999999999999 Q ss_pred CcccccccccccccccccccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcceeEEEEc-CeeEEeecCCC-Ccccc Q lcl|NC_019419. 71 DGKPVRDTLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKETIRETV-GPITMEYDPAT-IGSGV 148 (176) Q Consensus 71 ~g~~~~~~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~v~rekV-G~I~veY~~~~-~~~~~ 148 (176) +|.+++ ++.||.+||+||||||+++++++++.+...+++|++|+| |+|||||+.++ .++.| T Consensus 80 ~g~~~~-----------------~~~IP~~v~~A~~elA~~~~~g~~~~~~~~~~~v~~e~v~G~i~veY~~~~~~~~~~ 142 (169) T protein:vir:78 80 HGFPQP-----------------SNVIPPLVIQAQVMAAVEYGAGTDVRGSTDGREVQTERVEGAVTVSYFKNGYSGGTV 142 (169) T ss_pred cccccc-----------------cccchHHHHHHHHHHHHHHhcCcccCCCCCcceeEEEEecCceeEeecCCCCCCCcc Confidence 988864 578999999999999999999988888888888998887 99999997764 45689 Q ss_pred hHHHHHHHHhhhhccCCcccceeeeecC Q lcl|NC_019419. 149 SFPWWDGLLGHWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 149 ~~~~v~~lL~~~l~~~g~~~~~~~v~RG 176 (176) +||++++||+|||+++ ++..+|+|+|| T Consensus 143 ~~~~~~~LL~p~l~~~-~g~~~i~~~rg 169 (169) T protein:vir:78 143 SITTADDALRPLLCGS-NNAYSFNVFRG 169 (169) T ss_pred cHHHHHHHhhhhcccC-CCcceeeeecC Confidence 9999999999999864 45578999999 No 3 >protein:vir:95004 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224025;genbank:gi:62327312;genbank:GeneID:5176832 Probab=100.00 E-value=4e-51 Score=296.91 Aligned_cols=158 Identities=26% Similarity=0.268 Sum_probs=136.1 Q ss_pred Ccceeec-----CccccccHHHHHHHHHhcCCccChh--hhhHHHHHHHHHHhhh--cccceeCCCccccccccCCcccC Q lcl|NC_019419. 1 MPAFFIG-----VNTMYGDPQTFVDYAAARGVEVTLS--DATRHLTVVNDFLNGI--NWIGEPADQTGIDAWPRINYQSD 71 (176) Q Consensus 1 ~~~~~~~-----~~~~Y~sva~adaY~a~rg~~~~~~--~ke~aLi~As~~ld~~--~~~G~r~~~~Q~laWPR~G~~~~ 71 (176) |+...=. -..+|+|+++|++||++||+++|++ +||++|++|++|||++ +|+|+|++++|+|+|||+|+.++ T Consensus 1 M~liv~~~~g~~~anSYvt~~ea~aY~~~rg~~~~~dd~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~ 80 (169) T protein:vir:95 1 MPLIVETGQGLPNADSYVSLEDGRALAAKYGLELPEDDIAAEASLRNGAVYVGLFESQMCGRRVSANQALAFPRTGIDLH 80 (169) T ss_pred CeeEEeCCCCCCcccccccHHHHHHHHHHcCCcCCCCHHHHHHHHHHHHHHhhccccccccccCCcchhhccccCCceec Confidence 6533211 1458999999999999999998865 5899999999999986 89999999999999999999988 Q ss_pred cccccccccccccccccccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcceeEEEEc-CeeEEeecCCCC-cccch Q lcl|NC_019419. 72 GKPVRDTLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKETIRETV-GPITMEYDPATI-GSGVS 149 (176) Q Consensus 72 g~~~~~~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~v~rekV-G~I~veY~~~~~-~~~~~ 149 (176) +.++ +++.||.+||+||||||++++++++..+...++.|++|++ |+||+||+.++. +++|+ T Consensus 81 g~~~-----------------~~~~IP~~V~~A~~elA~~~~~g~~~~~~~~~~~v~~e~v~G~i~veY~~~~~~~~~~~ 143 (169) T protein:vir:95 81 GFPQ-----------------PSNVIPSLVIQAQVMAAVEYGAGTDVRGSTDGREVQTERVEGAVTVSYFKNGYSGGTVS 143 (169) T ss_pred cccc-----------------ccccchHHHHHHHHHHHHHHHcCccccCCCCccceeeeeeccceeEeecCCCCcCcccc Confidence 8776 4578999999999999999999988888777778888766 999999976544 56899 Q ss_pred HHHHHHHHhhhhccCCcccceeeeecC Q lcl|NC_019419. 150 FPWWDGLLGHWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 150 ~~~v~~lL~~~l~~~g~~~~~~~v~RG 176 (176) ||++++||+|||+++ ++..+|+|+|| T Consensus 144 ~~a~~~LL~p~l~g~-~g~~~i~~~rg 169 (169) T protein:vir:95 144 ITAADDALRPLLCGS-NNAYSFNVFRG 169 (169) T ss_pred HHHHHHhhhhhcccC-CCcceeeeecC Confidence 999999999999864 45578999999 No 4 >protein:vir:95176 Length: 172 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:703 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293420;genbank:gi:148912841;genbank:GeneID:5228251 Probab=100.00 E-value=1.1e-50 Score=294.47 Aligned_cols=159 Identities=25% Similarity=0.322 Sum_probs=140.2 Q ss_pred Ccceeec-------CccccccHHHHHHHHHhcCCccCh--hhhhHHHHHHHHHHhh--hcccceeCCCccccccccCCcc Q lcl|NC_019419. 1 MPAFFIG-------VNTMYGDPQTFVDYAAARGVEVTL--SDATRHLTVVNDFLNG--INWIGEPADQTGIDAWPRINYQ 69 (176) Q Consensus 1 ~~~~~~~-------~~~~Y~sva~adaY~a~rg~~~~~--~~ke~aLi~As~~ld~--~~~~G~r~~~~Q~laWPR~G~~ 69 (176) ||-++|. --.+|+|+++|++||++||+.++. ++||++|++|++|||+ ++|+|+|++++|+|+|||+|+. T Consensus 1 ~~Malive~~~g~~~anSYvtv~ea~aY~~~rg~~~~~~~~~ke~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~ 80 (172) T protein:vir:95 1 MAITIVVEDGSGVTNANSYVSVADARIYASNRGVELPLDDDELAAMLIRSTDYLEAQACRFQGKPTSTTQALQWPRTGVF 80 (172) T ss_pred CceeEEEeCCCCCCcccccccHHHHHHHHHhcCCcCCCChHHHHHHHHHHHHHhhccCCceeeeecCCcccccCCcCCcc Confidence 7766665 346899999999999999887654 4589999999999996 6999999999999999999999 Q ss_pred cCcccccccccccccccccccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCc-ceeEEEEcCeeEEeecCCCC-ccc Q lcl|NC_019419. 70 SDGKPVRDTLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSG-KETIRETVGPITMEYDPATI-GSG 147 (176) Q Consensus 70 ~~g~~~~~~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~-~~v~rekVG~I~veY~~~~~-~~~ 147 (176) +++.++ +++.||++||+||||||+++++++++.+...+ ..||+||||+|||||+.+++ ++. T Consensus 81 ~~~~~v-----------------~~~~IP~~V~~A~~elA~~~~~~~~~~~~~~~~~~vk~~kVG~I~veY~~~~~~~~~ 143 (172) T protein:vir:95 81 LNEDEV-----------------PSNVIPKSLIAAQVQLTMAINAGFDLQPNVSPQDYVTREKVGPIETEYADPLSVGIM 143 (172) T ss_pred cCcccc-----------------cccchhHHHHHHHHHHHHHHHcCccccccCCcccceeEEeccceEEeeccCCCCCCc Confidence 888776 55789999999999999999998887776544 56999999999999977544 557 Q ss_pred chHHHHHHHHhhhhccCCcccceeeeecC Q lcl|NC_019419. 148 VSFPWWDGLLGHWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 148 ~~~~~v~~lL~~~l~~~g~~~~~~~v~RG 176 (176) |+||+|++||+|||+++|++.++|||+|= T Consensus 144 ~~~~~v~~LL~p~l~~~~~~~~~~r~~r~ 172 (172) T protein:vir:95 144 PTFTAANALLAPLFGECASNKFALRTIRV 172 (172) T ss_pred ccHHHHHHHHhhhhcccCCcceeeEEEeC Confidence 99999999999999999999999999999 No 5 >protein:vir:94955 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239280;genbank:gi:66392062;genbank:GeneID:5076600 Probab=100.00 E-value=6.6e-50 Score=290.20 Aligned_cols=155 Identities=20% Similarity=0.323 Sum_probs=136.5 Q ss_pred CcceeecCc------cccccHHHHHHHHHhcC-----CccChhhhhHHHHHHHHHHhh-hcccceeCCCccccccccCCc Q lcl|NC_019419. 1 MPAFFIGVN------TMYGDPQTFVDYAAARG-----VEVTLSDATRHLTVVNDFLNG-INWIGEPADQTGIDAWPRINY 68 (176) Q Consensus 1 ~~~~~~~~~------~~Y~sva~adaY~a~rg-----~~~~~~~ke~aLi~As~~ld~-~~~~G~r~~~~Q~laWPR~G~ 68 (176) || +|..+ .+|+|++||++||+.|+ ..+++++||++|++|+||||+ ++|+|+|++++|+|+|||+|+ T Consensus 1 m~--~i~~~~g~~~AnSYvtv~ea~aY~~~r~~~~~w~~~~~~~~e~aL~~A~dyId~~~~f~G~r~~~~Q~l~wPR~g~ 78 (170) T protein:vir:94 1 MP--TVDATPGSITANSYVTVAEANSYFDGSYGRPLWTSASEDEKASLVISASRYLDQMMAWIGAPTNPEQSMWWPCKNA 78 (170) T ss_pred Cc--eeecCCCCCcccceecHHHHHHHHHhhccccccCCCCHHHHHHHHHHHHHHhccccccccccCCcchhhcccccCc Confidence 76 34444 69999999999999985 467777899999999999998 799999999999999999999 Q ss_pred ccCcccccccccccccccccccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcceeEEEEcCeeEEeecCCCCcccc Q lcl|NC_019419. 69 QSDGKPVRDTLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKETIRETVGPITMEYDPATIGSGV 148 (176) Q Consensus 69 ~~~g~~~~~~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~v~rekVG~I~veY~~~~~~~~~ 148 (176) .+++..+ +++.||++||+||||||+++++++++.+..++ .+|+||||+|||||+.++ .+.+ T Consensus 79 ~~dg~~~-----------------~~~~IP~~V~~Aq~elA~~~~~~~~~~~~~~~-~v~~~kVG~i~veY~~~~-~~~~ 139 (170) T protein:vir:94 79 VIGGMTL-----------------SQVSIPVKVKIAVFELAYFMLESGAALSFADQ-TIDSVKVGTIRVEFTKNS-TDAG 139 (170) T ss_pred ccCcccc-----------------ccchhhHHHHHHHHHHHHHHHhCcccCccccc-ceeeEecceeEEEecCCC-CCCc Confidence 9888776 45789999999999999999999888776664 589999999999998544 4578 Q ss_pred hHHHHHHHHhhhhcc---CCcccceeeeecC Q lcl|NC_019419. 149 SFPWWDGLLGHWIDS---DGNAAGNFDVFRG 176 (176) Q Consensus 149 ~~~~v~~lL~~~l~~---~g~~~~~~~v~RG 176 (176) +|+.|++||+|||.+ ++++.++|+|+|| T Consensus 140 ~~~~v~~LL~p~l~~~~~g~~~~~~~~~~r~ 170 (170) T protein:vir:94 140 LPTFVEAMLSGFGSPVLYGSNAARSIDLVRA 170 (170) T ss_pred cHHHHHHHhhhhhccccccccccceeeeecC Confidence 899999999999875 8899999999999 No 6 >protein:vir:97267 Length: 172 # NCBI annotation: hypothetical protein ORF024 # Family: family:all:703 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294532;genbank:gi:149408253;genbank:GeneID:5237132 Probab=100.00 E-value=7e-46 Score=268.15 Aligned_cols=154 Identities=19% Similarity=0.220 Sum_probs=126.3 Q ss_pred CcceeecCc------cccccHHHHHHHHHhcCCccCh---hhhhHHHHHHHHHHhh-hccccee-CCCccccccccCCcc Q lcl|NC_019419. 1 MPAFFIGVN------TMYGDPQTFVDYAAARGVEVTL---SDATRHLTVVNDFLNG-INWIGEP-ADQTGIDAWPRINYQ 69 (176) Q Consensus 1 ~~~~~~~~~------~~Y~sva~adaY~a~rg~~~~~---~~ke~aLi~As~~ld~-~~~~G~r-~~~~Q~laWPR~G~~ 69 (176) |+-..++.+ .+|+|+++|++||+.||+++|+ ++||++|++|++|||+ ++|+|+| ++++|+|+|||+|+. T Consensus 1 m~liveD~t~~~~~AnSYvtv~~a~aY~~~rg~~~~a~~~~~ke~aLi~A~~yiD~~~~f~G~r~~~~~Q~l~WPRtg~~ 80 (172) T protein:vir:97 1 MALIVQDNTGAVAGANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAW 80 (172) T ss_pred CceEeeCCCCCCCCccccccHHHHHHHHHhcCcccCCCCcHHHHHHHHHHHHHHhhhhhcccCCCCCcchhhhcccCCCC Confidence 988887776 7899999999999999998774 4589999999999998 6999987 689999999999985 Q ss_pred cCcccccccccccccccccccccccccccHHHHHHHHHHHHHHhcCcCCCcCCC---c--ceeEEEEcCeeEEeecCCCC Q lcl|NC_019419. 70 SDGKPVRDTLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGS---G--KETIRETVGPITMEYDPATI 144 (176) Q Consensus 70 ~~g~~~~~~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~---~--~~v~rekVG~I~veY~~~~~ 144 (176) ++..+ ++|.||++||+||||||+++++++.....+. + ..+||+|||+|+++|...++ T Consensus 81 -d~~~~-----------------~~~~IP~~v~~A~~elA~~al~~~l~~d~~~~~~~~~v~~kr~kvg~i~~~y~~~~~ 142 (172) T protein:vir:97 81 -DRDRY-----------------YINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGG 142 (172) T ss_pred -CCccc-----------------ccccccHHHHHHHHHHHHHHHhcccccccccccccccceeeeeeecceeeEeeccCC Confidence 66665 5578999999999999999888865543322 2 24899999999999965433 Q ss_pred --cccchHHHHHHHHhhhhccCCcccceeeeecC Q lcl|NC_019419. 145 --GSGVSFPWWDGLLGHWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 145 --~~~~~~~~v~~lL~~~l~~~g~~~~~~~v~RG 176 (176) +..|+||+|++||+|+...+|++ +++|| T Consensus 143 ~~~~~p~~~~v~aLL~p~gl~~~~~----~~~r~ 172 (172) T protein:vir:97 143 AVFQMPKYPAADQKLVRAGLVRSGG----TLLRG 172 (172) T ss_pred CCCccccHHHHHHHHhhhccccCcc----eeccC Confidence 35899999999999962222222 79999 No 7 >protein:vir:80967 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468394;genbank:gi:157324968;genbank:GeneID:5601402 Probab=98.51 E-value=1.9e-09 Score=68.35 Aligned_cols=126 Identities=10% Similarity=0.113 Sum_probs=79.1 Q ss_pred CcceeecCccccccHHHHHHHHHhcCCccChhhhhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCccccccccc Q lcl|NC_019419. 1 MPAFFIGVNTMYGDPQTFVDYAAARGVEVTLSDATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRDTLT 80 (176) Q Consensus 1 ~~~~~~~~~~~Y~sva~adaY~a~rg~~~~~~~ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~~~~ 80 (176) || |+|.++..... -|..+++++=+++|.+|+++||.+-| -| +++..+.+ T Consensus 1 M~---------Y~d~~~Y~~~y--~G~~i~e~~F~~l~~rAs~~ID~~T~-------------~r----i~~~~~d~--- 49 (131) T protein:vir:80 1 MP---------YTTLEFYTNEY--AGEHLEQDEFAKLLKHAERKIDSVTF-------------YR----IRKSGIEA--- 49 (131) T ss_pred CC---------CCCHHHHHHhh--CCCCCchhHHHHHHHHHHHHHHHHhc-------------cc----cccccccc--- Confidence 55 88887764322 46778888888999999999999755 11 11111111 Q ss_pred ccccccccccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcceeEEEEcCeeEEeecCCCCcccch-HHHHHHHHhh Q lcl|NC_019419. 81 DVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKETIRETVGPITMEYDPATIGSGVS-FPWWDGLLGH 159 (176) Q Consensus 81 ~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~v~rekVG~I~veY~~~~~~~~~~-~~~v~~lL~~ 159 (176) +.+.+|.+||.|+|+.|-.+...+..... ..+.+++++||..+|+|..++..+... -+.+...+.. T Consensus 50 ------------~~~~~~~~vk~A~c~q~e~~~~~g~~~~~-~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~ 116 (131) T protein:vir:80 50 ------------FSEFIQHQIQLATCNQIEYFKEAGGTSEL-AVSKPDNVSIGRTSISDSNFASTATSLNSGLVGSDVRS 116 (131) T ss_pred ------------CchhHHHHHHHHHHHHHHHHHHhhhhhhh-cccccCeeeeCceEEeeccccchhhhhhhhhhHHHHHH Confidence 22569999999999999876654433322 233479999999999997643322111 1224444455 Q ss_pred hhccCCcccceeeeecC Q lcl|NC_019419. 160 WIDSDGNAAGNFDVFRG 176 (176) Q Consensus 160 ~l~~~g~~~~~~~v~RG 176 (176) ||...|- +.|| T Consensus 117 ~L~~TGL------lyrG 127 (131) T protein:vir:80 117 YLAHTGL------LYNG 127 (131) T ss_pred HHhccCC------eecC Confidence 5544442 6677 No 8 >protein:vir:43 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463469;swissprot:trembl:q9t1b5;genbank:gi:16798791;uniprot:Q9T1B5;genbank:GeneID:922374 Probab=98.49 E-value=2.4e-09 Score=67.84 Aligned_cols=125 Identities=11% Similarity=0.162 Sum_probs=79.0 Q ss_pred CcceeecCccccccHHHHHH-HHHhcCCccChhhhhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCcccccccc Q lcl|NC_019419. 1 MPAFFIGVNTMYGDPQTFVD-YAAARGVEVTLSDATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRDTL 79 (176) Q Consensus 1 ~~~~~~~~~~~Y~sva~ada-Y~a~rg~~~~~~~ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~~~ 79 (176) || |+|.++... | -|-.+++++=+++|.+|+++||.+-|- | +++..+. T Consensus 1 M~---------Y~d~~~Y~~~y---~g~~i~e~~F~~l~~rAs~~ID~~T~~-------------r----i~~~~~~--- 48 (131) T protein:vir:43 1 MP---------YTTLEFYNDEY---AGEHLEQDEFDKLLKHAERKIDSVTFY-------------R----IRKGGIE--- 48 (131) T ss_pred CC---------CCCHHHHHHhh---CCCCCCHhHHHHHHHHHHHHHHHHhcc-------------c----ccccCcc--- Confidence 54 888887643 4 466788877789999999999997551 1 1111110 Q ss_pred cccccccccccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcceeEEEEcCeeEEeecCCCCcccc-hHHHHHHHHh Q lcl|NC_019419. 80 TDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKETIRETVGPITMEYDPATIGSGV-SFPWWDGLLG 158 (176) Q Consensus 80 ~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~v~rekVG~I~veY~~~~~~~~~-~~~~v~~lL~ 158 (176) -+.+.+|.+||.|+|+.|-.+...+....... +.+++++||..+|+|..++..... .-+.+...+. T Consensus 49 ------------~~~~~~~~~vk~A~c~q~e~~~~~g~~s~~~~-~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~ 115 (131) T protein:vir:43 49 ------------SFSEFIQHQIQLATCNQIEYFKEAGGTSELAV-SKPDNVSIGRTSISDSNFASTATSLNSGLIGSDVR 115 (131) T ss_pred ------------ccchhhHHHHHHHHHHHHHHHHHhHHHhhhhc-cccCeeecCceEEeecccccchhhhchhhhHHHHH Confidence 12256899999999999987665443332222 347999999999999764332211 1123444555 Q ss_pred hhhccCCcccceeeeecC Q lcl|NC_019419. 159 HWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 159 ~~l~~~g~~~~~~~v~RG 176 (176) .||...|- +.|| T Consensus 116 ~~L~~TGL------lyrG 127 (131) T protein:vir:43 116 SYLAHTGL------LYNG 127 (131) T ss_pred HHHhccCC------eecC Confidence 55554442 6777 No 9 >protein:vir:98900 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:3882 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164420;genbank:gi:56694910;genbank:GeneID:3197290 Probab=98.39 E-value=6.7e-09 Score=65.41 Aligned_cols=126 Identities=13% Similarity=0.094 Sum_probs=78.2 Q ss_pred CcceeecCccccccHHHHHHHHHhcCCccChhhhhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCccccccccc Q lcl|NC_019419. 1 MPAFFIGVNTMYGDPQTFVDYAAARGVEVTLSDATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRDTLT 80 (176) Q Consensus 1 ~~~~~~~~~~~Y~sva~adaY~a~rg~~~~~~~ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~~~~ 80 (176) || |+|.++...| .|-.+++++=+++|.+|+++||.+.|. ++++.-+. T Consensus 1 M~---------Y~t~~~Y~~~---~G~~i~e~~F~~l~~rAs~~ID~iT~~-----------------ri~~~~~~---- 47 (132) T protein:vir:98 1 MP---------YLTYEEFMDL---NGRDIDDKKFEKLLPKASAIIDGVTGH-----------------FYQKVDME---- 47 (132) T ss_pred CC---------CCCHHHHHhh---cCCCCCHHHHHHHHHHHHHHHHHHhcc-----------------cccCCCcc---- Confidence 44 9998887654 566788888899999999999986551 11111111 Q ss_pred ccccccccccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcceeEEEEcCeeEEeecCCCCc--ccchHHHHHHHHh Q lcl|NC_019419. 81 DVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKETIRETVGPITMEYDPATIG--SGVSFPWWDGLLG 158 (176) Q Consensus 81 ~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~v~rekVG~I~veY~~~~~~--~~~~~~~v~~lL~ 158 (176) -+...++.+||.|+|..+-++...+..........+++++||..+|+|.++... .....+.+..-+. T Consensus 48 -----------~d~~~~~~~vk~A~c~qiey~~~~G~~sae~~~~~~~S~svG~~Svs~~s~~~~~~~~~~~~~~~~~a~ 116 (132) T protein:vir:98 48 -----------KDNAWRVNQFKLALCAQIEYFDALGATTFEEINNSPQTFQAGRTSVSNASRYNPSGANESKPLVAEDVY 116 (132) T ss_pred -----------ccChHHHHHHHHHHHHHHHHHHhccchhhhhccCccceeeeCcEEEEeeccCCcccccccccchHHHHH Confidence 122457789999999999776654433332334458999999999999754221 1222222322233 Q ss_pred hhhccCCcccceeeeecC Q lcl|NC_019419. 159 HWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 159 ~~l~~~g~~~~~~~v~RG 176 (176) .||...|- +.|| T Consensus 117 ~~L~~tGL------LyrG 128 (132) T protein:vir:98 117 IYLQGTGL------LFQG 128 (132) T ss_pred HHHhhcCC------cccc Confidence 34443332 6677 No 10 >protein:vir:9576 Length: 131 # NCBI annotation: gp42 # Family: family:all:1271 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862881;genbank:gi:32469473;genbank:GeneID:1461318 Probab=95.57 E-value=0.00063 Score=38.15 Aligned_cols=125 Identities=11% Similarity=0.055 Sum_probs=73.2 Q ss_pred ccccccHHHHHHHHHhcCCccChhhh---hHHHHHHHHHHhhhcccceeCCCccccccccCCcccCcccccccccccccc Q lcl|NC_019419. 9 NTMYGDPQTFVDYAAARGVEVTLSDA---TRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRDTLTDVVAV 85 (176) Q Consensus 9 ~~~Y~sva~adaY~a~rg~~~~~~~k---e~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~~~~~~~~~ 85 (176) -+.|+|++|+. +||..++.+++ +.+|-.|+++|.. .+|+.|..++... T Consensus 1 m~~fAtv~D~~----~rwr~Lt~~E~~ra~~LL~~As~~ir~--------------~~p~~~~~l~~~~----------- 51 (131) T protein:vir:95 1 MENFATVEDLK----KLWRALKFDEEKRAEALLEVVSHSLRV--------------EAKKVGKDLDGLV----------- 51 (131) T ss_pred CCccCCHHHHH----HHhcCCCHHHHHHHHHHHHHHHHHHHH--------------hhhhccCCccccc----------- Confidence 34488999985 56678887764 5677889999964 4565554333222 Q ss_pred cccccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcceeEEEEcCee--EEeecCCCCcccchHHHHHHHHhhhhcc Q lcl|NC_019419. 86 VPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKETIRETVGPI--TMEYDPATIGSGVSFPWWDGLLGHWIDS 163 (176) Q Consensus 86 ~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~v~rekVG~I--~veY~~~~~~~~~~~~~v~~lL~~~l~~ 163 (176) .+.+..+.-+++-+|+...+.+..+. ...+..=.++..|+. +.+|...+. .. | +..-....|.. T Consensus 52 ------~~~~~~~~~~~~V~~~~V~Ral~~~~---~~~G~tq~S~TaG~ys~S~t~~~p~g--~l-y--lt~~e~~~LGl 117 (131) T protein:vir:95 52 ------ATDPSFTMVVKSVTVDVVARTLMTST---DQEPMTQVAESALGYSFSGSYLVPGG--GL-F--IKDSELKRLGL 117 (131) T ss_pred ------cCCccchHHHHHHHHHHHHHHhcCCC---CCCCceeeeeecccceeeeeeecCCC--Cc-e--eChHHHHHhCC Confidence 23345677899999999988775431 111222246788988 555654322 22 2 11112222333 Q ss_pred CCcccceeeeecC Q lcl|NC_019419. 164 DGNAAGNFDVFRG 176 (176) Q Consensus 164 ~g~~~~~~~v~RG 176 (176) +|.-.|.|.+.=- T Consensus 118 ~~~r~~~i~~~~~ 130 (131) T protein:vir:95 118 KKQRYGVIDIYGT 130 (131) T ss_pred CCCceeEEeeccC Confidence 4566777777644 No 11 >protein:vir:4788 Length: 130 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150168;swissprot:trembl:q94m43;genbank:gi:15088779;uniprot:Q94M43;genbank:GeneID:955972 Probab=95.32 E-value=0.00034 Score=39.58 Aligned_cols=122 Identities=12% Similarity=0.090 Sum_probs=71.0 Q ss_pred CcceeecCccccccHHHHHHHHHhcCCccChhhhhHHHHHHHHHHhhhc--ccceeCCCccccccccCCcccCccccccc Q lcl|NC_019419. 1 MPAFFIGVNTMYGDPQTFVDYAAARGVEVTLSDATRHLTVVNDFLNGIN--WIGEPADQTGIDAWPRINYQSDGKPVRDT 78 (176) Q Consensus 1 ~~~~~~~~~~~Y~sva~adaY~a~rg~~~~~~~ke~aLi~As~~ld~~~--~~G~r~~~~Q~laWPR~G~~~~g~~~~~~ 78 (176) || |.|.++.+.| +.+ +.++=+++|-+|++-||.+. |.=+..+-+ T Consensus 1 M~---------YlT~eey~el----~~~-~~~~F~kl~k~A~~~ID~~t~~~y~~~~~~~-------------------- 46 (130) T protein:vir:47 1 MT---------YLTQEEFDEL----DFD-EVTDFEKLAKRAKIAIDLYTNGIYQKDIDFE-------------------- 46 (130) T ss_pred CC---------CCchhhHhhc----CCC-ChhhHHHHHHHHHHHHHHHhcccccccCCcc-------------------- Confidence 55 9999998765 444 33457899999999999752 321110000 Q ss_pred ccccccccccccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcceeEEEEcCeeEEeecCCCCccc-chHHHHHHHH Q lcl|NC_019419. 79 LTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKETIRETVGPITMEYDPATIGSG-VSFPWWDGLL 157 (176) Q Consensus 79 ~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~v~rekVG~I~veY~~~~~~~~-~~~~~v~~lL 157 (176) -+.+.+=.+||.|.|.=..+....+ ..+......+.+.+||--++.|+.+..+.. ..+-.....+ T Consensus 47 -------------~~~~~r~~~vK~A~a~QieY~~~~G-~~s~~~~~~~~S~svGrtSis~~~~~~~~~~~~~~vs~da~ 112 (130) T protein:vir:47 47 -------------KEIAYRKSAVKLAMAFQIAYLDASG-IMSADDKQLANSVSIGRTSISYSTSQSTLAGQRFNLSMDAE 112 (130) T ss_pred -------------CcchHHHHHHHHHHHHHHHHHHHhc-cccchhccCcceeeecceeeecCcCccccccCCccccHHHH Confidence 0123455678888887665544332 233333556889999999999976543322 2332222222 Q ss_pred hhhhccCCcccceeeeecC Q lcl|NC_019419. 158 GHWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 158 ~~~l~~~g~~~~~~~v~RG 176 (176) .||...|. ++.|| T Consensus 113 -~~L~~tGL-----~Ly~G 125 (130) T protein:vir:47 113 -NALRQAGF-----SLVVG 125 (130) T ss_pred -HHHHhccc-----ccccC Confidence 34444443 46788 No 12 >protein:vir:99002 Length: 158 # NCBI annotation: gp31 # Family: family:all:32652 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655896;genbank:gi:109521468;genbank:GeneID:4157967 Probab=95.21 E-value=0.00053 Score=38.54 Aligned_cols=122 Identities=13% Similarity=0.192 Sum_probs=78.4 Q ss_pred ccccccHHHHHHHHHhcCCccChhhhhHHHHHHHHHHhhh-cccceeCCCccccccccCCcccCcccccccccccccccc Q lcl|NC_019419. 9 NTMYGDPQTFVDYAAARGVEVTLSDATRHLTVVNDFLNGI-NWIGEPADQTGIDAWPRINYQSDGKPVRDTLTDVVAVVP 87 (176) Q Consensus 9 ~~~Y~sva~adaY~a~rg~~~~~~~ke~aLi~As~~ld~~-~~~G~r~~~~Q~laWPR~G~~~~g~~~~~~~~~~~~~~~ 87 (176) --.|+|++|++... -.++|++. ++...+|=.+||.. .|. -.-|..+||-. T Consensus 1 ~~alasvee~~trl---~~~lp~~~-~r~~a~a~~vLd~~S~~a----r~~~gr~W~~~--------------------- 51 (158) T protein:vir:99 1 MAALVSVEEFTTFL---RVPLPEEG-SEKYTQMEFLLTLASDWA----RELSCKPWLLP--------------------- 51 (158) T ss_pred CcceeeHhhhhhhh---cccCChhh-hHHHHHHHHHHHHHHHHH----HHhcCccCCCC--------------------- Confidence 23489999999988 34676554 23333333334321 110 12355678821 Q ss_pred cccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcceeEEEEcCeeEEeecCCCCcccchH-HHHHHHHhhhhccCCc Q lcl|NC_019419. 88 AGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKETIRETVGPITMEYDPATIGSGVSF-PWWDGLLGHWIDSDGN 166 (176) Q Consensus 88 ~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~v~rekVG~I~veY~~~~~~~~~~~-~~v~~lL~~~l~~~g~ 166 (176) +.+|.-|+.-|...|-+..+||+ .++.+.+|+-++.|........ -| ..=..+|+-|-++. + T Consensus 52 -------~daP~~vr~ivL~aa~R~~~NP~--------g~~~~~~G~~~~~~~~~g~~~~-ffT~~E~~~L~r~~~s~-G 114 (158) T protein:vir:99 52 -------ADAPVTARGIILAASRREWNNPK--------RVSYVVKGPQSATFMQSAYPPG-FFTDAEEAKLRSYGRST-G 114 (158) T ss_pred -------CcchhHHHHHHHHHHHHHHhcCC--------ceEEeeecchhhhcccccCCCc-ccCHHHHHHHHHhhccc-C Confidence 45888888888888888888762 4677889999999965432211 22 45567888887665 4 Q ss_pred ccceeeeecC Q lcl|NC_019419. 167 AAGNFDVFRG 176 (176) Q Consensus 167 ~~~~~~v~RG 176 (176) ++.++...|| T Consensus 115 G~~~~~ttR~ 124 (158) T protein:vir:99 115 NWGVIETYRD 124 (158) T ss_pred ceeEEEeecC Confidence 6799999999 No 13 >protein:vir:1640 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695061;genbank:gi:23455752;genbank:GeneID:955488 Probab=94.54 E-value=0.0016 Score=35.97 Aligned_cols=126 Identities=10% Similarity=0.033 Sum_probs=70.5 Q ss_pred CcceeecCccccccHHHHHHHHHhcCCccChhhh---hHHHHHHHHHHhhhcccceeCCCccccccccCCcccCcccccc Q lcl|NC_019419. 1 MPAFFIGVNTMYGDPQTFVDYAAARGVEVTLSDA---TRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRD 77 (176) Q Consensus 1 ~~~~~~~~~~~Y~sva~adaY~a~rg~~~~~~~k---e~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~ 77 (176) |+ .|+|++|+. +||.+++++++ +.+|-.|+++|.. .+|+.|..++...-++ T Consensus 1 m~--------~fAtv~Dv~----~r~r~L~~~E~~ra~~lL~dAs~~ir~--------------~~p~~~~~l~a~~~e~ 54 (132) T protein:vir:16 1 MN--------PFATVDDLT----MLWRPLKGDEKERAEKLLEIVSDSLRE--------------EADKVGRDLYAMIAEK 54 (132) T ss_pred CC--------ccCCHHHHH----HHhcCCCHhHHHHHHHHHHHHHHHHHH--------------hhhhhccccccccccc Confidence 44 488999985 66678888864 5677889999954 4555544333221111 Q ss_pred cccccccccccccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcceeEEEEcCee--EEeecCCCCcccchHHHHHH Q lcl|NC_019419. 78 TLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKETIRETVGPI--TMEYDPATIGSGVSFPWWDG 155 (176) Q Consensus 78 ~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~v~rekVG~I--~veY~~~~~~~~~~~~~v~~ 155 (176) .+..+.-++.-+|+...+.+.++.-.. |..=.++..|+. +.+|...+ +..-+ -+. T Consensus 55 ----------------~~~~~~~~~~V~~~~V~Ral~~~~~~~---G~tq~S~TaG~ys~S~t~~~p~--G~lyl--t~~ 111 (132) T protein:vir:16 55 ----------------PSYFASVVKSVTVDIVARTLMTSTDQE---PMTQTTESALGYSVSGSYLVPG--GGLFI--KNS 111 (132) T ss_pred ----------------cccchhHHHHHHHHHHHHHhcCCCCCC---CceeeeeeccchheeeeeecCC--Cccee--ChH Confidence 122344567778888888776542111 212245778988 56675432 22221 122 Q ss_pred HHhhhhccCCcccceeeeecC Q lcl|NC_019419. 156 LLGHWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 156 lL~~~l~~~g~~~~~~~v~RG 176 (176) .+. .|..++..+|+|.+.=- T Consensus 112 e~~-~LG~~~~r~~~i~~~~~ 131 (132) T protein:vir:16 112 ELS-RLGLKKQRFGVIDFYGN 131 (132) T ss_pred HHH-hhCCCCCceEEEeecCC Confidence 222 23335566777777644 No 14 >protein:vir:2505 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:28222 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569746;genbank:gi:18496896;genbank:GeneID:932265 Probab=93.79 E-value=0.00088 Score=37.34 Aligned_cols=118 Identities=14% Similarity=0.073 Sum_probs=76.3 Q ss_pred ecCccccccHHHHHHHHHhcC-CccChhhhh---HHHHHHHHHHhhhcccceeCCCccccccccCCcccCcccccccccc Q lcl|NC_019419. 6 IGVNTMYGDPQTFVDYAAARG-VEVTLSDAT---RHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRDTLTD 81 (176) Q Consensus 6 ~~~~~~Y~sva~adaY~a~rg-~~~~~~~ke---~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~~~~~ 81 (176) ..--...+|+.+..+ +| ..+++++.+ .+|-.|+|-|.++-|.+ + T Consensus 1 ~~~~~alAtvdDv~~----~lrr~Lt~dE~~~a~~Ll~eAsdlI~g~l~~~---------------------~------- 48 (128) T protein:vir:25 1 MTECKALATSQDVKR----ALRRDLTEAEQTDLSELLAEATDLVVGYLHPY---------------------P------- 48 (128) T ss_pred CccchhccCHHHHHH----HhcCCCCHHHHHHHHHHHhcchheeeeecCCC---------------------C------- Confidence 122223455555443 34 467777643 45668999998875533 1 Q ss_pred cccccccccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcceeEEEEcCeeEEeecCCCCcccchH-HHHHHHHhhh Q lcl|NC_019419. 82 VVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKETIRETVGPITMEYDPATIGSGVSF-PWWDGLLGHW 160 (176) Q Consensus 82 ~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~v~rekVG~I~veY~~~~~~~~~~~-~~v~~lL~~~ 160 (176) ..|.+|..|+.-+|..+..+++.++..... -.+..-|+++++|..+++++..-. ..-+.+|+|| T Consensus 49 -----------vp~~~p~~v~rVvA~ivarAltr~~~~~pe----~~S~TAgpfs~~ft~~~~~~g~yLTaa~k~~Lrp~ 113 (128) T protein:vir:25 49 -----------VPTPTPGPIKRVVASMVAAVLTRPTQILPE----TQSLTADGFGVTFTPGGNSPGPYLSAALKQRLRPY 113 (128) T ss_pred -----------CCCCCCchHHHHHHHHHHHHhhCCCccCCC----ceeeecccccccccCCCCCCCceEcHHHHhhcccc Confidence 126689999999999998888765444332 234467999999877766655433 5678899999 Q ss_pred hccCCcccceeeeecC Q lcl|NC_019419. 161 IDSDGNAAGNFDVFRG 176 (176) Q Consensus 161 l~~~g~~~~~~~v~RG 176 (176) -.+ .|+|.=| T Consensus 114 R~~------~~sV~l~ 123 (128) T protein:vir:25 114 RTG------MVAVEMG 123 (128) T ss_pred cce------eeEeecc Confidence 542 4667777 No 15 >protein:vir:94761 Length: 132 # NCBI annotation: unknown # Family: family:all:1271 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996708;genbank:gi:45597423;genbank:GeneID:2769029 Probab=93.66 E-value=0.0036 Score=34.00 Aligned_cols=126 Identities=10% Similarity=-0.015 Sum_probs=68.8 Q ss_pred ccccccHHHHHHHHHhcCCccChhhh---hHHHHHHHHHHhhhcccceeCCCccccccccCCcccCcccccccccccccc Q lcl|NC_019419. 9 NTMYGDPQTFVDYAAARGVEVTLSDA---TRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRDTLTDVVAV 85 (176) Q Consensus 9 ~~~Y~sva~adaY~a~rg~~~~~~~k---e~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~~~~~~~~~ 85 (176) -+.|+|++|+.+ ||..++++++ +.+|-.|+++|.. .||+.|...+..-. T Consensus 1 m~~fAtv~Dl~~----r~r~L~~dE~~ra~~LL~dAs~~iR~--------------~~~~~~~~~~~~~~---------- 52 (132) T protein:vir:94 1 MNPFATVDDLTM----LWRPLKGDEKERAEKLLEIVSDTLRE--------------EADKVGRDLDVMIS---------- 52 (132) T ss_pred CCCcCCHHHHHH----HhccCChhHHHHHHHHHHHHHHHHHH--------------HHhhhccccccccC---------- Confidence 345889999874 7778888875 4567789999964 45555432221100 Q ss_pred cccccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcceeEEEEcCee--EEeecCCCCcccchHHHHHHHHhhhhcc Q lcl|NC_019419. 86 VPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKETIRETVGPI--TMEYDPATIGSGVSFPWWDGLLGHWIDS 163 (176) Q Consensus 86 ~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~v~rekVG~I--~veY~~~~~~~~~~~~~v~~lL~~~l~~ 163 (176) ...|..+.-+|.-+|+...+.+..+.- ..+..=.++..|+. +.+|...+. . .| +..-....|.- T Consensus 53 ------~~~d~~~~~~k~V~~~~V~Ral~~~~~---~~g~tq~S~TaG~ys~S~T~~np~G--~-ly--lt~~e~~~LGl 118 (132) T protein:vir:94 53 ------EKPSYFSSVVKSVTVDIVARTLMTSTD---QEPMTQTTESALGYSVSGSYLVPGG--G-LF--IKNSELSRLGL 118 (132) T ss_pred ------CCCccchhHHHHHHHHHHHHHhcCCCC---CCCceeeeeecccceeeeeeecCCC--C-ce--eChHHHHhhCC Confidence 011334555677788888887755321 11211245778988 566754322 2 22 11112222333 Q ss_pred CCcccceeeeecC Q lcl|NC_019419. 164 DGNAAGNFDVFRG 176 (176) Q Consensus 164 ~g~~~~~~~v~RG 176 (176) ++.-.|.|.+.=- T Consensus 119 ~~~r~~~i~~~~~ 131 (132) T protein:vir:94 119 KKQRFGVIDFYGN 131 (132) T ss_pred CCCceEEEeecCC Confidence 4456677766533 No 16 >protein:vir:9761 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795523;genbank:gi:28876281;genbank:GeneID:1257822 Probab=93.40 E-value=0.0037 Score=33.90 Aligned_cols=123 Identities=11% Similarity=0.018 Sum_probs=67.0 Q ss_pred CcceeecCccccccHHHHHHHHHhcCCccChhhh---hHHHHHHHHHHhhhcccceeCCCccccccccCCcccCcccccc Q lcl|NC_019419. 1 MPAFFIGVNTMYGDPQTFVDYAAARGVEVTLSDA---TRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRD 77 (176) Q Consensus 1 ~~~~~~~~~~~Y~sva~adaY~a~rg~~~~~~~k---e~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~ 77 (176) |+ .|+|++|+. +||..++++++ +.+|-.|+++|.. .+|+.|..++-... T Consensus 1 m~--------~fATv~Dv~----~rwr~Lt~dE~~ra~~LL~dAS~~iR~--------------~~p~~g~~~~~~~~-- 52 (140) T protein:vir:97 1 MG--------NFATTDDVI----LLWRPLSVDELKRANALLKVVSDTLRM--------------EADKVGKDLDKTMV-- 52 (140) T ss_pred CC--------cCCCHHHHH----HHhcCCCHhHHHHHHHHHHHHHHHHHH--------------hhhhccCCcchhcc-- Confidence 44 488999987 56668887764 5677789999964 56666543221111 Q ss_pred cccccccccccccccccccccHHHHHHHHHHHHHHhcC-cCCCcCCCcceeEEEEcCee--EEeecCCCCcccchHHHHH Q lcl|NC_019419. 78 TLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADE-VDISPVGSGKETIRETVGPI--TMEYDPATIGSGVSFPWWD 154 (176) Q Consensus 78 ~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~-~~~~~~~~~~~v~rekVG~I--~veY~~~~~~~~~~~~~v~ 154 (176) ..+.-+.-++.-+|....+.+.. .+. .+..=.++..|+. +.+|...+. ..| +. T Consensus 53 ---------------~~~~~~~~~k~V~~~mV~Ral~~~~d~----~G~tq~S~TaG~ys~S~T~~np~G---~ly--lt 108 (140) T protein:vir:97 53 ---------------DKPYFVNVIKSVTVDIVARTLMTSTQG----EPMSQESQSALGYTWSGTYLVPGG---GLF--IK 108 (140) T ss_pred ---------------cCccchhHHHHHHHHHHHHHhcCCCCC----CcceeeeeeccchhheeeeecCCC---Cce--eC Confidence 11123444566777776665532 221 1222245678988 566754322 222 11 Q ss_pred HHHhhhhccCCcccceeeeecC Q lcl|NC_019419. 155 GLLGHWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 155 ~lL~~~l~~~g~~~~~~~v~RG 176 (176) .-....|..+|.-.|+|.+ =| T Consensus 109 ~~e~~~LGl~~~r~~~i~~-~g 129 (140) T protein:vir:97 109 DNELKRLGLKKQRYGGIEL-YG 129 (140) T ss_pred hHHHHHhCCCCCceeeecc-cC Confidence 1122233334566778877 45 No 17 >protein:vir:79701 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285884;genbank:gi:148750841;genbank:GeneID:5220403 Probab=92.62 E-value=0.0033 Score=34.20 Aligned_cols=130 Identities=18% Similarity=0.103 Sum_probs=66.9 Q ss_pred ccccccHHHHHHHHHhcCCccCh-hhhhHHHHHHHHHHhhhc--ccceeCCCccccccccCCcccCcccccccccccccc Q lcl|NC_019419. 9 NTMYGDPQTFVDYAAARGVEVTL-SDATRHLTVVNDFLNGIN--WIGEPADQTGIDAWPRINYQSDGKPVRDTLTDVVAV 85 (176) Q Consensus 9 ~~~Y~sva~adaY~a~rg~~~~~-~~ke~aLi~As~~ld~~~--~~G~r~~~~Q~laWPR~G~~~~g~~~~~~~~~~~~~ 85 (176) -++|.|-+|.+ .-|....+ ++=+++|-+|.+-||.+. |.+. + ..+ . T Consensus 1 ~~pYLTy~ef~----~lg~~~~~~d~F~kllk~A~~~ID~~T~y~~~~--------------y-----~~~--------~ 49 (144) T protein:vir:79 1 MKPYLTTSDFE----KLGYELKKPDNFGKLLKSATVLINQICSYYDPA--------------F-----AYH--------D 49 (144) T ss_pred CCcccchhhhh----hhCCCCcchhhhhhHHHHHHHHhhhhhhhhccc--------------c-----ccc--------c Confidence 56788877763 34554454 445899999999999852 2110 0 000 0 Q ss_pred ccccccccc-cccc---HHHHHHHHHHHHHHhcCcCCCc-CCCcceeEEEEcCeeEEeecCCCCcccc--hHHHHHHHHh Q lcl|NC_019419. 86 VPAGQIVDF-ASIP---IAVEQAVYRLAMLVADEVDISP-VGSGKETIRETVGPITMEYDPATIGSGV--SFPWWDGLLG 158 (176) Q Consensus 86 ~~~g~~i~~-d~IP---~~V~~A~~eLA~~~~~~~~~~~-~~~~~~v~rekVG~I~veY~~~~~~~~~--~~~~v~~lL~ 158 (176) +..+.-++. ..|| .+||.|.|.-..+....+.... ......+++..||-.+++|.+++..+.. ++ .|...-- T Consensus 50 i~~d~~~d~~~~~~~r~~~vKkA~a~QIeY~~~~G~~sa~e~~~~~~~S~svGrtsvs~~~~~~~s~t~~~~-~v~~~a~ 128 (144) T protein:vir:79 50 LEADSQADPDSYLFRQAMAFKKAVALEMLFLEDSGYSSAYDVAQGALNSFTVGHTSMSLNPSAGQNLTVGST-GVVKSAY 128 (144) T ss_pred ccccccccchhhhhHHHHHHHHHHHHHHHHHHHcCCcchhhhhcCccceeEecceEEeecCCCccccccccc-cccHHHH Confidence 001111111 2255 4568888876654443332222 1224568999999999999765443211 12 1222223 Q ss_pred hhhccCCcccceeeeecC Q lcl|NC_019419. 159 HWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 159 ~~l~~~g~~~~~~~v~RG 176 (176) .||...|. +.|| T Consensus 129 ~yL~~tGL------LYrG 140 (144) T protein:vir:79 129 DLLGRYGL------LFSG 140 (144) T ss_pred HHHhhcCc------cccc Confidence 33333332 4455 No 18 >protein:vir:9821 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795583;genbank:gi:28876338;genbank:GeneID:1257921 Probab=91.96 E-value=0.0026 Score=34.74 Aligned_cols=127 Identities=15% Similarity=0.111 Sum_probs=66.4 Q ss_pred CcceeecCccccccHHHHHHHHHhcCCccChhhhhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCccccccccc Q lcl|NC_019419. 1 MPAFFIGVNTMYGDPQTFVDYAAARGVEVTLSDATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRDTLT 80 (176) Q Consensus 1 ~~~~~~~~~~~Y~sva~adaY~a~rg~~~~~~~ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~~~~ 80 (176) |- -+.-.|.|.+|.+. -+.+-+ ++=+.+|-+|++-||.+. +..+++..+-+ T Consensus 1 ~~----~~~M~YlT~eey~~----l~~~~~-~dF~kllk~As~~ID~~t-----------------~~~y~~~d~e~--- 51 (138) T protein:vir:98 1 ME----VVIIAFLTQKEFED----LGFDDV-EDFEKMEKRASHAVNLYC-----------------RNRYDYKDLKK--- 51 (138) T ss_pred Cc----cccccccchHHHhc----cCCCCh-hhHHHHHHHHHHHhhhhh-----------------ccccccccccc--- Confidence 11 12223999997754 355533 457899999999999741 11222222211 Q ss_pred ccccccccccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcceeEEEEcCeeEEeecCCCCcccc------hHHHHH Q lcl|NC_019419. 81 DVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKETIRETVGPITMEYDPATIGSGV------SFPWWD 154 (176) Q Consensus 81 ~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~v~rekVG~I~veY~~~~~~~~~------~~~~v~ 154 (176) +.+-+=.+||.|.|.=..++.+.+ ..+........+.+||-.++.|..++..+.. +|.... T Consensus 52 ------------d~~~r~~~vKkA~a~QIeY~~~~G-~ts~~d~~~~~s~svGrTSiS~~~~~~~~s~~~~~~~~~~~s~ 118 (138) T protein:vir:98 52 ------------EIALVQKAVKRAIAYQIAYLNDSG-VMTAEDKQSFAGISLGRTSISYTVGHGQGSQQKTLADRFNLCL 118 (138) T ss_pred ------------hhHHHHHHHHHHHHHHHHHHHHcC-CcchhhccCcCceEeeeeEeecccccccccccccccccccccH Confidence 112244567888776544443333 3333335567899999999988433222211 122222 Q ss_pred HHHhhhhccCCcccceeeeecC Q lcl|NC_019419. 155 GLLGHWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 155 ~lL~~~l~~~g~~~~~~~v~RG 176 (176) .-+. ||...|. +.|| T Consensus 119 ~A~~-~L~~tGL------LY~G 133 (138) T protein:vir:98 119 DAEN-ELLVVGL------GYTG 133 (138) T ss_pred HHHH-HHhhcCc------cccc Confidence 2122 4444443 5666 No 19 >protein:vir:101652 Length: 188 # NCBI annotation: gp15 # Family: family:all:11114 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654770;genbank:gi:109302768;genbank:GeneID:4156086 Probab=90.60 E-value=0.014 Score=30.83 Aligned_cols=140 Identities=14% Similarity=0.133 Sum_probs=67.1 Q ss_pred HHHHHHHHHhcCCccChhhhhHHHHHHHHHHhhh---cccceeC-----C-Ccccccccc------CC---cccCccccc Q lcl|NC_019419. 15 PQTFVDYAAARGVEVTLSDATRHLTVVNDFLNGI---NWIGEPA-----D-QTGIDAWPR------IN---YQSDGKPVR 76 (176) Q Consensus 15 va~adaY~a~rg~~~~~~~ke~aLi~As~~ld~~---~~~G~r~-----~-~~Q~laWPR------~G---~~~~g~~~~ 76 (176) ..+++..++ -..+++++.+-||..|+..+.++ +|.=... + +.-.+--|+ .. +.++|.+.. T Consensus 1 ~~~~~~la~--~~~~da~~v~lAL~~As~~VR~~~g~~i~~V~~dt~~id~~Gg~~~LP~~Pvv~i~~Ve~~~~~G~~~~ 78 (188) T protein:vir:10 1 MTFAQQLAD--AFPEDADDAATALSWAKSQVEGYCGRKFDLVEDDVAIVDPYCGSSLLPSIPVVEISKVEGYLPTGNGMD 78 (188) T ss_pred CchhhhHHH--hcCCCcchHHHHHHHHHHHHHHHhCCcceeeeeeeeeeecCCCceeeccCcceeeeEEEEEeeCCcccc Confidence 112222222 23445666777899888888753 2220000 0 000001133 11 112222211 Q ss_pred cccccccccccc--------c----------------------cccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcce Q lcl|NC_019419. 77 DTLTDVVAVVPA--------G----------------------QIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKE 126 (176) Q Consensus 77 ~~~~~~~~~~~~--------g----------------------~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~ 126 (176) -...+.+....+ | +-=--+.||.+|+...|++|-..+.|+. - T Consensus 79 ~v~~r~y~~~g~~~~l~~trg~pg~~~~r~~~WPw~p~~VtVTytHGy~evP~eiv~lv~d~A~~~~~np~--------~ 150 (188) T protein:vir:10 79 WVELTNYWFKRDTGLIFDTTGLPGSEWSTGHTWPWLPGSLRVTYTHGYNPVPDELIDVAIRLAREYQSNPE--------L 150 (188) T ss_pred cccccccccccceeeecccccCcccccccccccccCcceEEEEEecCCCcccHHHHHHHHHHHHHHhcCcc--------c Confidence 111110000000 0 0001357999999999999987776532 2 Q ss_pred eEEEEcCeeEEeecCCCCcccchHHHHHHHHhhhhccCCc Q lcl|NC_019419. 127 TIRETVGPITMEYDPATIGSGVSFPWWDGLLGHWIDSDGN 166 (176) Q Consensus 127 v~rekVG~I~veY~~~~~~~~~~~~~v~~lL~~~l~~~g~ 166 (176) ...++||++|++|+. .++.+. -+.=..+|+.|....-. T Consensus 151 L~q~~vG~~S~tfa~-~~~~sl-~~~~~~il~ry~l~~~~ 188 (188) T protein:vir:10 151 LVSKQVGEIERRFGS-VAGTSL-SKADQAILDRYVIATLA 188 (188) T ss_pred ceeeecCceeeeccc-ccCCcc-cchhHHhhccccccccC Confidence 467899999999985 222221 13345778888654322 No 20 >protein:vir:7857 Length: 188 # NCBI annotation: gp14 # Family: family:all:11114 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817464;genbank:gi:29565893;genbank:GeneID:1259086 Probab=90.60 E-value=0.014 Score=30.83 Aligned_cols=140 Identities=14% Similarity=0.133 Sum_probs=67.1 Q ss_pred HHHHHHHHHhcCCccChhhhhHHHHHHHHHHhhh---cccceeC-----C-Ccccccccc------CC---cccCccccc Q lcl|NC_019419. 15 PQTFVDYAAARGVEVTLSDATRHLTVVNDFLNGI---NWIGEPA-----D-QTGIDAWPR------IN---YQSDGKPVR 76 (176) Q Consensus 15 va~adaY~a~rg~~~~~~~ke~aLi~As~~ld~~---~~~G~r~-----~-~~Q~laWPR------~G---~~~~g~~~~ 76 (176) ..+++..++ -..+++++.+-||..|+..+.++ +|.=... + +.-.+--|+ .. +.++|.+.. T Consensus 1 ~~~~~~la~--~~~~da~~v~lAL~~As~~VR~~~g~~i~~V~~dt~~id~~Gg~~~LP~~Pvv~i~~Ve~~~~~G~~~~ 78 (188) T protein:vir:78 1 MTFAQQLAD--AFPEDADDAATALSWAKSQVEGYCGRKFDLVEDDVAIVDPYCGSSLLPSIPVVEISKVEGYLPTGNGMD 78 (188) T ss_pred CchhhhHHH--hcCCCcchHHHHHHHHHHHHHHHhCCcceeeeeeeeeeecCCCceeeccCcceeeeEEEEEeeCCcccc Confidence 112222222 23445666777899888888753 2220000 0 000001133 11 112222211 Q ss_pred cccccccccccc--------c----------------------cccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcce Q lcl|NC_019419. 77 DTLTDVVAVVPA--------G----------------------QIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKE 126 (176) Q Consensus 77 ~~~~~~~~~~~~--------g----------------------~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~ 126 (176) -...+.+....+ | +-=--+.||.+|+...|++|-..+.|+. - T Consensus 79 ~v~~r~y~~~g~~~~l~~trg~pg~~~~r~~~WPw~p~~VtVTytHGy~evP~eiv~lv~d~A~~~~~np~--------~ 150 (188) T protein:vir:78 79 WVELTNYWFKRDTGLIFDTTGLPGSEWSTGHTWPWLPGSLRVTYTHGYNPVPDELIDVAIRLAREYQSNPE--------L 150 (188) T ss_pred cccccccccccceeeecccccCcccccccccccccCcceEEEEEecCCCcccHHHHHHHHHHHHHHhcCcc--------c Confidence 111110000000 0 0001357999999999999987776532 2 Q ss_pred eEEEEcCeeEEeecCCCCcccchHHHHHHHHhhhhccCCc Q lcl|NC_019419. 127 TIRETVGPITMEYDPATIGSGVSFPWWDGLLGHWIDSDGN 166 (176) Q Consensus 127 v~rekVG~I~veY~~~~~~~~~~~~~v~~lL~~~l~~~g~ 166 (176) ...++||++|++|+. .++.+. -+.=..+|+.|....-. T Consensus 151 L~q~~vG~~S~tfa~-~~~~sl-~~~~~~il~ry~l~~~~ 188 (188) T protein:vir:78 151 LVSKQVGEIERRFGS-VAGTSL-SKADQAILDRYVIATLA 188 (188) T ss_pred ceeeecCceeeeccc-ccCCcc-cchhHHhhccccccccC Confidence 467899999999985 222221 13345778888654322 No 21 >protein:vir:80320 Length: 188 # NCBI annotation: gp8, conserved hypothetical protein # Family: family:all:501 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111087;genbank:gi:134288682;genbank:GeneID:4960567 Probab=81.63 E-value=0.084 Score=26.48 Aligned_cols=138 Identities=14% Similarity=-0.034 Sum_probs=68.3 Q ss_pred CcceeecCc--cccccHHHHHHHHHhcCCccChhh--hhHHHH-HHHHHHhhhcccceeC---CCccc-cccccCCcccC Q lcl|NC_019419. 1 MPAFFIGVN--TMYGDPQTFVDYAAARGVEVTLSD--ATRHLT-VVNDFLNGINWIGEPA---DQTGI-DAWPRINYQSD 71 (176) Q Consensus 1 ~~~~~~~~~--~~Y~sva~adaY~a~rg~~~~~~~--ke~aLi-~As~~ld~~~~~G~r~---~~~Q~-laWPR~G~~~~ 71 (176) |.+.+|-.. ..=+|+++++++..-- -+++| -+..|+ .|.++++++ .|+.- ...+- -.||+.++.++ T Consensus 1 M~~~~~~~ppa~ePVtL~e~K~hLRid---~~~eD~~l~~~lI~aA~~~~E~~--~gr~l~~qt~~~~~~~~~~~~i~Lp 75 (188) T protein:vir:80 1 MAAVLVEYLDDAEPLTFEEVAFQCRID---DDDERDFVERIVIPGARQAAESK--SGAAIRKARYVERLSGFPLAEISLS 75 (188) T ss_pred CCceeeccCCCCcccCHHHHHHHcCCC---CchhhHHHHHHHHHHHHHHHHHH--hCCeeeeeeEEEEecCCCCCceEec Confidence 888877432 3347999999988652 22223 234455 577788863 23211 11111 12444333322 Q ss_pred cccccccc---------c------cccccc--------------------------ccccccccccccHHHHHHHHHHHH Q lcl|NC_019419. 72 GKPVRDTL---------T------DVVAVV--------------------------PAGQIVDFASIPIAVEQAVYRLAM 110 (176) Q Consensus 72 g~~~~~~~---------~------~~~~~~--------------------------~~g~~i~~d~IP~~V~~A~~eLA~ 110 (176) -.++.+.. . ..++++ ..|+ .+.+|..+|+|...++. T Consensus 76 ~~PV~sV~sV~~~d~~G~~~~l~~~~y~l~~~~~~~~l~~~~~~~~p~~~~V~V~~~AG~---~~~vP~~ik~aill~va 152 (188) T protein:vir:80 76 VGQVIRVDSIEIRDASGATTTLDADAFELVQLGREALLVPEGQARWPFARAVTITYQAGV---DLARYPSVRTWMLLAAA 152 (188) T ss_pred ccccceeeEEEEEcCCCcEEeecccceEEeecCCCcEEEEecCCCCCCCceEEEEEEecc---cccChHHHHHHHHHHHH Confidence 11111100 0 001111 1121 14689999999999887 Q ss_pred HHhcCcCCCcCCCcceeEEEEcCeeEEeecCCCCcccchHHHHHHHHhhhhccCCc Q lcl|NC_019419. 111 LVADEVDISPVGSGKETIRETVGPITMEYDPATIGSGVSFPWWDGLLGHWIDSDGN 166 (176) Q Consensus 111 ~~~~~~~~~~~~~~~~v~rekVG~I~veY~~~~~~~~~~~~~v~~lL~~~l~~~g~ 166 (176) ....+- |.+ . .+. +.+..| ...+++||++|-.-.|+ T Consensus 153 ~~Ye~R-------------e~~---~--~g~-~~~~~P-~~~v~~Ll~pyRvp~~~ 188 (188) T protein:vir:80 153 WAYDHR-------------ELF---S--EGQ-PIGEMP-GGYADVLLNPITVPPRF 188 (188) T ss_pred HHHhcc-------------ccc---c--ccc-cccccc-HHHHHHHhhccCCCCCC Confidence 665431 110 0 000 111122 23589999999988776 No 22 >protein:vir:1435 Length: 188 # NCBI annotation: hypothetical protein # Family: family:all:501 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536364;genbank:gi:17975169;genbank:GeneID:929149 Probab=79.46 E-value=0.1 Score=25.97 Aligned_cols=137 Identities=11% Similarity=-0.043 Sum_probs=65.4 Q ss_pred CcceeecCc--cccccHHHHHHHHHhcCCccChhh--hhHHHH-HHHHHHhhhcccceeC---CC-ccccccccCCcccC Q lcl|NC_019419. 1 MPAFFIGVN--TMYGDPQTFVDYAAARGVEVTLSD--ATRHLT-VVNDFLNGINWIGEPA---DQ-TGIDAWPRINYQSD 71 (176) Q Consensus 1 ~~~~~~~~~--~~Y~sva~adaY~a~rg~~~~~~~--ke~aLi-~As~~ld~~~~~G~r~---~~-~Q~laWPR~G~~~~ 71 (176) |=+.+|-.. ..=+|++|+++|..-- -+++| -...|+ .|.++++++ .|+.- .. ..--.||+.+..++ T Consensus 1 m~~~~~~~ppa~epVtLae~K~~lrid---~~~eD~~l~~~li~aA~~~~E~~--tgr~l~~qt~~~~~~~~~~~~~~Lp 75 (188) T protein:vir:14 1 MAAVLVEYLDDAEPLTFEEVAFQCRID---DDDERDFVERVVIPGARQAAESK--AGAAIRKARYVEHLSGFPPAEVPLS 75 (188) T ss_pred CCceeeecCCCCCccCHHHHHHHcCCC---CchhHHHHHHHHHHHHHHHHHHH--hCCeeeeeeEEEEecCcCCCceEec Confidence 776666433 3456999999987542 22222 234455 567788863 33221 11 11123444333332 Q ss_pred ccccccccccc------------------------------------------ccccccccccccccccHHHHHHHHHHH Q lcl|NC_019419. 72 GKPVRDTLTDV------------------------------------------VAVVPAGQIVDFASIPIAVEQAVYRLA 109 (176) Q Consensus 72 g~~~~~~~~~~------------------------------------------~~~~~~g~~i~~d~IP~~V~~A~~eLA 109 (176) --++.+. +.+ -.....|+ .+.+|..+|+|...++ T Consensus 76 ~~Pv~sV-~sV~~~d~~g~~~~l~~~~y~l~~~~~~~~l~~~~~~~~p~~~~V~V~~~AG~---~~~vP~~ik~Aill~v 151 (188) T protein:vir:14 76 VGQVISV-DSIEIRDASGATTTLDAGAFELVQLGRETLLVPAGQARWPYARAVTIKYQAGI---DLARYPSVRSWMLLAA 151 (188) T ss_pred ccCccee-eEEEEEcCCCceEeecccceEEeecCCCcEEEEecCCCCCCCceEEEEEEecC---ccCchHHHHHHHHHHH Confidence 2222110 000 00111121 2457888888888887 Q ss_pred HHHhcCcCCCcCCCcceeEEEEcCeeEEeecCCCCcccchHHHHHHHHhhhhccCCc Q lcl|NC_019419. 110 MLVADEVDISPVGSGKETIRETVGPITMEYDPATIGSGVSFPWWDGLLGHWIDSDGN 166 (176) Q Consensus 110 ~~~~~~~~~~~~~~~~~v~rekVG~I~veY~~~~~~~~~~~~~v~~lL~~~l~~~g~ 166 (176) -....+-+ . +..+. ..+..| ...++.||++|-.--|+ T Consensus 152 a~~Y~~Re-------------~-----~~~g~-~~~~lP-~~~v~~Ll~pyRvP~~~ 188 (188) T protein:vir:14 152 AWAYDHRE-------------L-----YSDGQ-PMGEMP-GGYSDVLLNPITVPPRF 188 (188) T ss_pred HHHHhccc-------------c-----ccccc-cccccc-HHHHHHHhhccCCCCCC Confidence 76554311 0 00110 011122 23488999999876665 No 23 >protein:vir:80036 Length: 111 # NCBI annotation: gp10 # Family: family:all:3186 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468714;genbank:gi:157325294;genbank:GeneID:5601727 Probab=77.86 E-value=0.054 Score=27.53 Aligned_cols=108 Identities=19% Similarity=0.113 Sum_probs=59.7 Q ss_pred ccccHHHHHHHHHhcCCccChhhhhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCccccccccccccccccccc Q lcl|NC_019419. 11 MYGDPQTFVDYAAARGVEVTLSDATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRDTLTDVVAVVPAGQ 90 (176) Q Consensus 11 ~Y~sva~adaY~a~rg~~~~~~~ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~~~~~~~~~~~~g~ 90 (176) |=-|++....-+..-. .+++|+-+.++-.|..-.+ . T Consensus 1 m~ttv~~vkl~a~~L~-~~sDDsl~~~I~dA~~e~~-------------a------------------------------ 36 (111) T protein:vir:80 1 MKTDVSKLKLTASSLA-SVSDDSLQVHIDDSYLEVQ-------------E------------------------------ 36 (111) T ss_pred CchhHHHHHHhhHhhc-CCChHHHHHHHHHHHHHhh-------------c------------------------------ Confidence 4555666665555433 4777776666665544432 2 Q ss_pred ccccccccHHHHH-HHHHHHHHHhcCcCCCcCCCcceeEEEEcCeeEEeecCCC-CcccchHHHHHHHHhhhhc-cCCcc Q lcl|NC_019419. 91 IVDFASIPIAVEQ-AVYRLAMLVADEVDISPVGSGKETIRETVGPITMEYDPAT-IGSGVSFPWWDGLLGHWID-SDGNA 167 (176) Q Consensus 91 ~i~~d~IP~~V~~-A~~eLA~~~~~~~~~~~~~~~~~v~rekVG~I~veY~~~~-~~~~~~~~~v~~lL~~~l~-~~g~~ 167 (176) ++-|++++. ||--||+.++.- .+..|++|+||.++-+|..-+ ...-.+-+|=+-+++=|=. .+|++ T Consensus 37 ----~gFp~~~~e~a~rYLa~HLat~-------~~~~v~sE~V~~Lk~~Y~~~~~~~~l~~s~wGq~Y~rL~k~~~~gs~ 105 (111) T protein:vir:80 37 ----KGFPEKFEERANRYLAAHLATL-------ANKNVKSEAVGSLKREYYEVKGDSGLLSTEYGQEYARLLKEANGGSG 105 (111) T ss_pred ----CCCChhHHHHHHHHHHHHHHHh-------cCCCCchhhhhhHHHHhhhcccccccccchhHHHHHHHHHHhcCCcc Confidence 334445554 444578766532 134589999999999997432 2233344666666655522 23333 Q ss_pred cceeeee Q lcl|NC_019419. 168 AGNFDVF 174 (176) Q Consensus 168 ~~~~~v~ 174 (176) ...+ |+ T Consensus 106 ~~~v-Vv 111 (111) T protein:vir:80 106 ISMV-VV 111 (111) T ss_pred ceee-eC Confidence 3323 44 No 24 >protein:vir:103283 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277463;genbank:gi:71834105;genbank:GeneID:3562394 Probab=74.17 E-value=0.07 Score=26.92 Aligned_cols=89 Identities=19% Similarity=0.127 Sum_probs=42.9 Q ss_pred ccCCcccCcccccccccccccccccccccccccccHHHHHHHHHHHH---------------------HHhcCcC----- Q lcl|NC_019419. 64 PRINYQSDGKPVRDTLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAM---------------------LVADEVD----- 117 (176) Q Consensus 64 PR~G~~~~g~~~~~~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~---------------------~~~~~~~----- 117 (176) =|.=+ =+...+|++|++|=+|+|- .++.-+. T Consensus 1 mR~l~-----------------------P~f~~vpdevi~~wid~A~lFVC~~~fg~~~~~Al~lytlHLm~~dga~k~e 57 (125) T protein:vir:10 1 MRTLY-----------------------PPLKSQPDDVLNAWIEVAKLFICLDKFGDKQVQALAFYTLHLLSQDIALKTE 57 (125) T ss_pred Ccccc-----------------------chhhccCHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhcccccccc Confidence 22111 1234579999988777652 1111111 Q ss_pred CCc-CCCcceeEEEE-cCeeEEeecCCCCccc----chHHHHHHHHhhhhccCCcccceee--eecC Q lcl|NC_019419. 118 ISP-VGSGKETIRET-VGPITMEYDPATIGSG----VSFPWWDGLLGHWIDSDGNAAGNFD--VFRG 176 (176) Q Consensus 118 ~~~-~~~~~~v~rek-VG~I~veY~~~~~~~~----~~~~~v~~lL~~~l~~~g~~~~~~~--v~RG 176 (176) ... .+..+++++-+ .|+++++|++.+.... -.-|| -.|+..|+.-.+++|+.+- +.|| T Consensus 58 ~~~~~~~s~r~~s~slsGE~Sit~~~~s~d~s~~~L~~T~w-Gk~~~~L~k~~~GgFaL~T~~~~~~ 123 (125) T protein:vir:10 58 NDSSQTSSERVKSYSLSGEYTISYDTSTAAASSSNLEESSW-GKLYIDLMRLKVGRWGLITSGGSRC 123 (125) T ss_pred cccccccccceeeeeeccceEeecccccccccccccccCch-HHHHHHHHHhcCCceeeeccccccC Confidence 111 12345688888 5999999987544321 01122 2444444443334444433 2233 No 25 >protein:vir:107702 Length: 136 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003900;genbank:gi:45686316;genbank:GeneID:2773042 Probab=71.60 E-value=0.19 Score=24.49 Aligned_cols=122 Identities=11% Similarity=0.072 Sum_probs=58.7 Q ss_pred ecCccccccHHHHHHHHHhcCCccChhhhhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCcccccccccccccc Q lcl|NC_019419. 6 IGVNTMYGDPQTFVDYAAARGVEVTLSDATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRDTLTDVVAV 85 (176) Q Consensus 6 ~~~~~~Y~sva~adaY~a~rg~~~~~~~ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~~~~~~~~~ 85 (176) .+-+|.-..||.+..-+-+.. .+|++.-+..+--|..||....| T Consensus 1 ~~~~~~~~~ve~fR~l~PeF~-dvPde~i~~~~d~A~~~v~~~~~----------------------------------- 44 (136) T protein:vir:10 1 MNQETLIAVVEQMRKLVPALR-KVPDETLYAWVEMAELFVCQKTF----------------------------------- 44 (136) T ss_pred CCchHHHHHHHHHHHhccccc-cCCHHHHHHHHHHHHHhhcCCCC----------------------------------- Confidence 344443444554444443322 34766667777788888853222 Q ss_pred cccccccccccccHHHHHHHHHHHHHHhcCcCC------CcCCCcceeEE-EEcCeeEEeecCCCCcccc----hHHHHH Q lcl|NC_019419. 86 VPAGQIVDFASIPIAVEQAVYRLAMLVADEVDI------SPVGSGKETIR-ETVGPITMEYDPATIGSGV----SFPWWD 154 (176) Q Consensus 86 ~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~------~~~~~~~~v~r-ekVG~I~veY~~~~~~~~~----~~~~v~ 154 (176) .+...+|..-+++.++.-+.. ......+++++ ..+|+++|.|++.+....- .-||= T Consensus 45 ------------Gk~y~~al~lltAHLl~l~~~~~~~~~~~~~~s~rv~ssat~GevSVS~a~~s~~~s~~WL~~TpyG- 111 (136) T protein:vir:10 45 ------------KDAYVKALALYALHLAFLDGALKGEDEDLESYSRRVTSFSLSGEFSQTFGEVTKNQSGDMMLSTPWG- 111 (136) T ss_pred ------------hhHHHHHHHHHHHHHHhcccccccccccccccccceehheeccceeEeeccccCchhhHhhhcCHHH- Confidence 234556666666655521111 11122344555 5699999999865432211 11333 Q ss_pred HHHhhhhccCCcccceee-eecC Q lcl|NC_019419. 155 GLLGHWIDSDGNAAGNFD-VFRG 176 (176) Q Consensus 155 ~lL~~~l~~~g~~~~~~~-v~RG 176 (176) +++..|+.-.+.+|+-+- +.+| T Consensus 112 q~y~aL~k~~~gGf~l~t~~~~~ 134 (136) T protein:vir:10 112 KMFEQLKARRRGRFALMTGLRGG 134 (136) T ss_pred HHHHHHHhhcccchhhhhccccc Confidence 333334332333444443 3344 No 26 >protein:vir:101559 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958112;genbank:gi:41057658;genbank:GeneID:2716816 Probab=71.35 E-value=0.13 Score=25.52 Aligned_cols=104 Identities=18% Similarity=0.133 Sum_probs=47.4 Q ss_pred cccccCCcccCcccccccccccccccccccccccccccHHHHHHHHHHHH-HHhcCcCCC-------------------- Q lcl|NC_019419. 61 DAWPRINYQSDGKPVRDTLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAM-LVADEVDIS-------------------- 119 (176) Q Consensus 61 laWPR~G~~~~g~~~~~~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~-~~~~~~~~~-------------------- 119 (176) ++=|- -.++|++..-+..++.= ..+|...++.-++.|- .++++.+.. T Consensus 1 ~~~~~-----------~~v~Fd~a~FR~~fPeF-a~~pd~~i~~~~~~A~~~~l~n~~~s~~~~~~~r~~ll~LltAHll 68 (158) T protein:vir:10 1 MSTPP-----------YRITFDPAGFIAEYPEF-ATVPTPRLQAMFNQAQAALLDNTGGSPVTDDNVLRELFNMLVAHLL 68 (158) T ss_pred CCCCC-----------ceEEcChHHHHHhCccc-ccCCHHHHHHHHHhhhheeeCCcccccccChHHHHHHHHHHHHHHH Confidence 33332 12223222222222221 2267777766666662 222221111 Q ss_pred --------cCCCc--ceeEEEEcCeeEEeecCCCCcc--------cchH-HHHHHHHhhh-----hccCCcccceeeeec Q lcl|NC_019419. 120 --------PVGSG--KETIRETVGPITMEYDPATIGS--------GVSF-PWWDGLLGHW-----IDSDGNAAGNFDVFR 175 (176) Q Consensus 120 --------~~~~~--~~v~rekVG~I~veY~~~~~~~--------~~~~-~~v~~lL~~~-----l~~~g~~~~~~~v~R 175 (176) ....+ ++++++++|.|||.|+.+..+. .+.| --.-+|++.+ +.+++..-.-+|=++ T Consensus 69 ~L~~~~~~g~~~g~vG~vsSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fw~L~~~~~~Gg~v~Gg~pe~~~~r~~g 148 (158) T protein:vir:10 69 TLFSAAPTSANSRPPGRLSSATEGTVSSSFEYALPQGSAIAPWYNQTQYGAMFWMATAHYRSARYMVSGGSGIGTARAYG 148 (158) T ss_pred HHhhhhhcccccCcccceeeeeeCceeEEeeeccccCCchhhhhhcCHHHHHHHHHHHhhCccccccccCCcccceeccC Confidence 11112 5789999999999997543221 2223 1222444433 444444445566666 Q ss_pred C Q lcl|NC_019419. 176 G 176 (176) Q Consensus 176 G 176 (176) | T Consensus 149 ~ 149 (158) T protein:vir:10 149 Q 149 (158) T ss_pred c Confidence 6 No 27 >protein:vir:3639 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705634;genbank:gi:23752319;genbank:GeneID:955737 Probab=71.35 E-value=0.13 Score=25.52 Aligned_cols=104 Identities=18% Similarity=0.133 Sum_probs=47.4 Q ss_pred cccccCCcccCcccccccccccccccccccccccccccHHHHHHHHHHHH-HHhcCcCCC-------------------- Q lcl|NC_019419. 61 DAWPRINYQSDGKPVRDTLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAM-LVADEVDIS-------------------- 119 (176) Q Consensus 61 laWPR~G~~~~g~~~~~~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~-~~~~~~~~~-------------------- 119 (176) ++=|- -.++|++..-+..++.= ..+|...++.-++.|- .++++.+.. T Consensus 1 ~~~~~-----------~~v~Fd~a~FR~~fPeF-a~~pd~~i~~~~~~A~~~~l~n~~~s~~~~~~~r~~ll~LltAHll 68 (158) T protein:vir:36 1 MSTPP-----------YRITFDPAGFIAEYPEF-ATVPTPRLQAMFNQAQAALLDNTGGSPVTDDNVLRELFNMLVAHLL 68 (158) T ss_pred CCCCC-----------ceEEcChHHHHHhCccc-ccCCHHHHHHHHHhhhheeeCCcccccccChHHHHHHHHHHHHHHH Confidence 33332 12223222222222221 2267777766666662 222221111 Q ss_pred --------cCCCc--ceeEEEEcCeeEEeecCCCCcc--------cchH-HHHHHHHhhh-----hccCCcccceeeeec Q lcl|NC_019419. 120 --------PVGSG--KETIRETVGPITMEYDPATIGS--------GVSF-PWWDGLLGHW-----IDSDGNAAGNFDVFR 175 (176) Q Consensus 120 --------~~~~~--~~v~rekVG~I~veY~~~~~~~--------~~~~-~~v~~lL~~~-----l~~~g~~~~~~~v~R 175 (176) ....+ ++++++++|.|||.|+.+..+. .+.| --.-+|++.+ +.+++..-.-+|=++ T Consensus 69 ~L~~~~~~g~~~g~vG~vsSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fw~L~~~~~~Gg~v~Gg~pe~~~~r~~g 148 (158) T protein:vir:36 69 TLFSAAPTSANSRPPGRLSSATEGTVSSSFEYALPQGSAIAPWYNQTQYGAMFWMATAHYRSARYMVSGGSGIGTARAYG 148 (158) T ss_pred HHhhhhhcccccCcccceeeeeeCceeEEeeeccccCCchhhhhhcCHHHHHHHHHHHhhCccccccccCCcccceeccC Confidence 11112 5789999999999997543221 2223 1222444433 444444445566666 Q ss_pred C Q lcl|NC_019419. 176 G 176 (176) Q Consensus 176 G 176 (176) | T Consensus 149 ~ 149 (158) T protein:vir:36 149 Q 149 (158) T ss_pred c Confidence 6 No 28 >protein:vir:99848 Length: 172 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164077;genbank:gi:56692609;genbank:GeneID:3192576 Probab=68.86 E-value=0.026 Score=29.29 Aligned_cols=115 Identities=10% Similarity=0.083 Sum_probs=55.4 Q ss_pred ccccccHHHHHHHHHhc---------CC----------------------------ccChhhhhHHHHHHHHHHhhh-cc Q lcl|NC_019419. 9 NTMYGDPQTFVDYAAAR---------GV----------------------------EVTLSDATRHLTVVNDFLNGI-NW 50 (176) Q Consensus 9 ~~~Y~sva~adaY~a~r---------g~----------------------------~~~~~~ke~aLi~As~~ld~~-~~ 50 (176) -+||+|++++.+-+-.+ +. .+.+.--+.||..|+..||++ .- T Consensus 1 ~~mYaT~~dl~~r~g~~el~qLt~~~~~~~~~~~l~d~~~~~~~~~~~~~~~~~~g~~d~~~i~~Al~dA~aeIDgYL~~ 80 (172) T protein:vir:99 1 MAVYITLPELAERPGAEELSQAATPRPLQAVDSELLDALLRGLPVDRWTPEEIEVGHATVEVINSAVSDAQGYIDGFLQR 80 (172) T ss_pred CcccccHHHHHhhcCHHHHHHHhccccccCCHHHHHHHhhcchhhhhcccccccccccCHHHHHHHHHHHHHHHHHHHhc Confidence 78999999988776321 10 112222478999999999984 21 Q ss_pred cceeCCCccccccccCCcccCcccccccccccccccccccccccccccHHHHHHHHHHHHHHhcCcC----C-Cc-CCCc Q lcl|NC_019419. 51 IGEPADQTGIDAWPRINYQSDGKPVRDTLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVD----I-SP-VGSG 124 (176) Q Consensus 51 ~G~r~~~~Q~laWPR~G~~~~g~~~~~~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~----~-~~-~~~~ 124 (176) ++ +.+|...+|.-|+..||-+|.+.+-..- . .. .... T Consensus 81 R~-------------------------------------Y~lPL~~vP~~L~~~a~dIArY~L~~~r~~~~~~~e~v~~r 123 (172) T protein:vir:99 81 RG-------------------------------------YSLPLAKRYGVVTGWTRAIARYLLHQDRLGPGAEKDPIVRD 123 (172) T ss_pred cc-------------------------------------ccCCCcccchHHHHHHHHHHHHHHHhccCCcccCCHHHHHH Confidence 10 1245567999999999999996553211 0 11 1110 Q ss_pred --cee---EEEEcCeeEEeecCC---CCcccchH-----HHHHHHHhhh Q lcl|NC_019419. 125 --KET---IRETVGPITMEYDPA---TIGSGVSF-----PWWDGLLGHW 160 (176) Q Consensus 125 --~~v---~rekVG~I~veY~~~---~~~~~~~~-----~~v~~lL~~~ 160 (176) ..+ +...-|.++.-=... ..++.+.+ .+=..-|++| T Consensus 124 Y~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~v~~~~r~F~rd~L~gf 172 (172) T protein:vir:99 124 YRDALKFLQLIAEGKFSLGPDDPLTPPGGGVPQVLAPARTFSHDTLKDY 172 (172) T ss_pred HHHHHHHHHHHhcCccccCCCCCCCCCCCCceeeecCCCccChhhccCC Confidence 011 111124444321100 00111110 0011122222 No 29 >protein:vir:107756 Length: 147 # NCBI annotation: gp21 # Family: family:all:664 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024869;genbank:gi:48697511;genbank:GeneID:2948377 Probab=61.95 E-value=0.34 Score=23.13 Aligned_cols=126 Identities=13% Similarity=0.128 Sum_probs=61.0 Q ss_pred CcceeecCccccccHHHHHHHHHhcC--CccChhhhhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCccccccc Q lcl|NC_019419. 1 MPAFFIGVNTMYGDPQTFVDYAAARG--VEVTLSDATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRDT 78 (176) Q Consensus 1 ~~~~~~~~~~~Y~sva~adaY~a~rg--~~~~~~~ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~~ 78 (176) |=..| |++++.+-+=+.. ..+|++.-+..|..|..+|+..+|. |.. + T Consensus 1 m~v~f--------d~~~Fr~~fPeFad~~~~pd~~i~~~l~~A~~~l~~~~~~-----------~~~-----~------- 49 (147) T protein:vir:10 1 MDHTL--------DITKFRALFPEFNNDVKYPDALLEQWYAVAGEYLGLTDYA-----------CGL-----N------- 49 (147) T ss_pred Cceec--------CHHHHHHhcccccCCccCCHHHHHHHHHHHHHhhccccCC-----------ccc-----C------- Confidence 43333 5777777665544 3578888889999999999876651 111 1 Q ss_pred ccccccccccccccccccccHHHHHHHHHHHHHHh--cCcCCCcCCCcceeEEEEcCeeEEeecCCCCc-------ccch Q lcl|NC_019419. 79 LTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVA--DEVDISPVGSGKETIRETVGPITMEYDPATIG-------SGVS 149 (176) Q Consensus 79 ~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~--~~~~~~~~~~~~~v~rekVG~I~veY~~~~~~-------~~~~ 149 (176) ...-.++.+-|+..++ +..........+++.++++|.|||.|+..... ..+. T Consensus 50 -------------------g~~~~~~l~Ll~AHll~l~~~~~~g~g~~G~v~Sas~G~VSVSy~~~~~~~~~~~w~~~T~ 110 (147) T protein:vir:10 50 -------------------GNTLDLALMQLTAHLMKSATILSSNKGAPMVMTSATIDKVSISTLAPPIKNGWQYWLSTTP 110 (147) T ss_pred -------------------hhhHHHHHHHHHHHHHHHHHhhccCCCcccceeeeeecceeeeeecCCCCCcchhhhhcCH Confidence 1122333333333221 11111222234578999999999999754221 1222 Q ss_pred HH-HHHHHHhhhhcc----CCcccc-eeeeecC Q lcl|NC_019419. 150 FP-WWDGLLGHWIDS----DGNAAG-NFDVFRG 176 (176) Q Consensus 150 ~~-~v~~lL~~~l~~----~g~~~~-~~~v~RG 176 (176) |- -.-+|++.+-.+ +|...+ -||=+=| T Consensus 111 YGq~y~~l~~~~~~Gg~vvgG~p~r~a~r~vgg 143 (147) T protein:vir:10 111 YGQMLWALLSMRSSGGFVYGGSPELSGYRRIGG 143 (147) T ss_pred HHHHHHHHHHhhCccceecCCCCccccccccCc Confidence 31 112333332111 111111 1222222 No 30 >protein:vir:78595 Length: 158 # NCBI annotation: BcepNY3gp07 # Family: family:all:664 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294844;genbank:gi:149882907;genbank:GeneID:5291066 Probab=61.33 E-value=0.35 Score=23.06 Aligned_cols=104 Identities=13% Similarity=0.060 Sum_probs=47.8 Q ss_pred cccccCCcccCcccccccccccccccccccccccccccHHHHHHHHHHHH-HHhcCc----------------------- Q lcl|NC_019419. 61 DAWPRINYQSDGKPVRDTLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAM-LVADEV----------------------- 116 (176) Q Consensus 61 laWPR~G~~~~g~~~~~~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~-~~~~~~----------------------- 116 (176) ++=|- -.++|++..-+..++.= ..+|...++.-++.|- .++++. T Consensus 1 ~~~~~-----------~~v~Fd~a~FR~~fPeF-a~~pd~~i~~~~~~A~~~~~~~~~~s~~~~~~~r~~ll~LltAHll 68 (158) T protein:vir:78 1 MSTPP-----------YRITFDPAGFIAEYPEF-ATVATPRLQAMFNQAQTALLDNTGGSPVTDDNVLRELFNMLVAHLL 68 (158) T ss_pred CCCCC-----------ceEEcChHHHHHhchhh-ccCCHHHHHHHHHHhhhhhcCCCccccccChhHHHHHHHHHHHHHH Confidence 33332 22233332222222221 2267777766666552 222210 Q ss_pred -----CCCc-C-CCcceeEEEEcCeeEEeecCCCCcc--------cchH-HHHHHHHhhh-----hccCCcccceeeeec Q lcl|NC_019419. 117 -----DISP-V-GSGKETIRETVGPITMEYDPATIGS--------GVSF-PWWDGLLGHW-----IDSDGNAAGNFDVFR 175 (176) Q Consensus 117 -----~~~~-~-~~~~~v~rekVG~I~veY~~~~~~~--------~~~~-~~v~~lL~~~-----l~~~g~~~~~~~v~R 175 (176) .... . ...+++++.++|.|||.|+.+..+. .+.| --.-+|++.+ +.+++..-.-+|=++ T Consensus 69 ~L~~~~~~~a~~g~~G~isSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fwal~~~~~~Ggy~~gg~pe~~~~r~~g 148 (158) T protein:vir:78 69 TLFGATPTSANSRPPGRLSSAAEGTVSSSFEFKLPEGSAIAPWYNQTQYGAMFWMATARYRSARYMVSGGSGIGTARAYG 148 (158) T ss_pred HHhHhhhccccCCcccceeeeeecceEEEEeecCccccchhHHhhcCHHHHHHHHHHHHhcccccccccCCcccceeecC Confidence 0001 1 1124789999999999997543221 2223 1122444333 445544555566666 Q ss_pred C Q lcl|NC_019419. 176 G 176 (176) Q Consensus 176 G 176 (176) | T Consensus 149 ~ 149 (158) T protein:vir:78 149 Q 149 (158) T ss_pred c Confidence 6 No 31 >protein:vir:106739 Length: 158 # NCBI annotation: gp08 # Family: family:all:664 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944316;genbank:gi:38638615;genbank:GeneID:2657368 Probab=61.33 E-value=0.35 Score=23.06 Aligned_cols=104 Identities=13% Similarity=0.060 Sum_probs=47.8 Q ss_pred cccccCCcccCcccccccccccccccccccccccccccHHHHHHHHHHHH-HHhcCc----------------------- Q lcl|NC_019419. 61 DAWPRINYQSDGKPVRDTLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAM-LVADEV----------------------- 116 (176) Q Consensus 61 laWPR~G~~~~g~~~~~~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~-~~~~~~----------------------- 116 (176) ++=|- -.++|++..-+..++.= ..+|...++.-++.|- .++++. T Consensus 1 ~~~~~-----------~~v~Fd~a~FR~~fPeF-a~~pd~~i~~~~~~A~~~~~~~~~~s~~~~~~~r~~ll~LltAHll 68 (158) T protein:vir:10 1 MSTPP-----------YRITFDPAGFIAEYPEF-ATVATPRLQAMFNQAQTALLDNTGGSPVTDDNVLRELFNMLVAHLL 68 (158) T ss_pred CCCCC-----------ceEEcChHHHHHhchhh-ccCCHHHHHHHHHHhhhhhcCCCccccccChhHHHHHHHHHHHHHH Confidence 33332 22233332222222221 2267777766666552 222210 Q ss_pred -----CCCc-C-CCcceeEEEEcCeeEEeecCCCCcc--------cchH-HHHHHHHhhh-----hccCCcccceeeeec Q lcl|NC_019419. 117 -----DISP-V-GSGKETIRETVGPITMEYDPATIGS--------GVSF-PWWDGLLGHW-----IDSDGNAAGNFDVFR 175 (176) Q Consensus 117 -----~~~~-~-~~~~~v~rekVG~I~veY~~~~~~~--------~~~~-~~v~~lL~~~-----l~~~g~~~~~~~v~R 175 (176) .... . ...+++++.++|.|||.|+.+..+. .+.| --.-+|++.+ +.+++..-.-+|=++ T Consensus 69 ~L~~~~~~~a~~g~~G~isSas~G~VSVS~d~~~~t~~~~~~W~~~T~YG~~fwal~~~~~~Ggy~~gg~pe~~~~r~~g 148 (158) T protein:vir:10 69 TLFGATPTSANSRPPGRLSSAAEGTVSSSFEFKLPEGSAIAPWYNQTQYGAMFWMATARYRSARYMVSGGSGIGTARAYG 148 (158) T ss_pred HHhHhhhccccCCcccceeeeeecceEEEEeecCccccchhHHhhcCHHHHHHHHHHHHhcccccccccCCcccceeecC Confidence 0001 1 1124789999999999997543221 2223 1122444333 445544555566666 Q ss_pred C Q lcl|NC_019419. 176 G 176 (176) Q Consensus 176 G 176 (176) | T Consensus 149 ~ 149 (158) T protein:vir:10 149 Q 149 (158) T ss_pred c Confidence 6 No 32 >protein:vir:107864 Length: 150 # NCBI annotation: gp36 # Family: family:all:348 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024709;genbank:gi:48696946;genbank:GeneID:2845952 Probab=52.90 E-value=0.54 Score=22.04 Aligned_cols=118 Identities=7% Similarity=0.059 Sum_probs=53.9 Q ss_pred cccccHHHHHHHHHhc--------CC---------ccChhhhhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCc Q lcl|NC_019419. 10 TMYGDPQTFVDYAAAR--------GV---------EVTLSDATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDG 72 (176) Q Consensus 10 ~~Y~sva~adaY~a~r--------g~---------~~~~~~ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g 72 (176) -.|+|.+++.+.+... -. ++.++--++||..|+..||++ .+. T Consensus 1 M~Y~T~~Dl~~r~ge~el~~Ltd~~~~g~~~~~~~~~d~~~i~~Al~dA~~eIDgY--L~~------------------- 59 (150) T protein:vir:10 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESSVRQAEEIVDAH--LRG------------------- 59 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccCccccchhhcCHHHHHHHHHHHHHHHHHH--Hhh------------------- Confidence 3499999999887432 11 122222478999999999984 111 Q ss_pred ccccccccccccccccccccccccccHHHHHHHHHHHHHHhcCc-----CCCc-CCCc--ceeE---EEEcCeeEEeecC Q lcl|NC_019419. 73 KPVRDTLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEV-----DISP-VGSG--KETI---RETVGPITMEYDP 141 (176) Q Consensus 73 ~~~~~~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~-----~~~~-~~~~--~~v~---rekVG~I~veY~~ 141 (176) .+.+|...+|..|+..||-+|.+.+-.. .... .... ..++ ...-|.++..=.. T Consensus 60 ----------------RY~lPl~~vP~~L~~~a~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~ 123 (150) T protein:vir:10 60 ----------------RYNLPLSPVPTVIKDVTVNLARHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPS 123 (150) T ss_pred ----------------hccCCcccccHHHHHHHHHHHHHHHHhcccccCCCCHHHHHHHHHHHHHHHHHhcCcccCCCCC Confidence 1123446699999999999998655321 1111 1100 0111 1111544442211 Q ss_pred CC---CcccchHHHHHHHHhhhhccCCcccceeeeecC Q lcl|NC_019419. 142 AT---IGSGVSFPWWDGLLGHWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 142 ~~---~~~~~~~~~v~~lL~~~l~~~g~~~~~~~v~RG 176 (176) .. .++.+. +.++..-|+ =+=.|| T Consensus 124 ~~~~~~~~~~~-----------v~~~~r~f~-r~~l~g 149 (150) T protein:vir:10 124 GPATPEPGEMK-----------VRARRRQFD-ADLLER 149 (150) T ss_pred CCCCCCCceee-----------eecCCCccC-hhhccC Confidence 00 001110 111110000 011123 No 33 >protein:vir:104344 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398973;genbank:gi:81343957;genbank:GeneID:3778876 Probab=51.11 E-value=0.59 Score=21.84 Aligned_cols=118 Identities=15% Similarity=0.137 Sum_probs=49.3 Q ss_pred CccccccHHHHHHHHHhcCCccChhhhhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCcccccccccccccccc Q lcl|NC_019419. 8 VNTMYGDPQTFVDYAAARGVEVTLSDATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRDTLTDVVAVVP 87 (176) Q Consensus 8 ~~~~Y~sva~adaY~a~rg~~~~~~~ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~~~~~~~~~~~ 87 (176) -|+ .=+|....-+-+ -..+|++.-+.-+--|-.|| T Consensus 1 ~~~--~~~e~~R~l~P~-f~kvpdevI~~wielA~lfV------------------------------------------ 35 (132) T protein:vir:10 1 MND--AILAFMRSLVPA-LKAVDDESINVWIDLARLYV------------------------------------------ 35 (132) T ss_pred Cch--HHHHHHHHhcch-hhcCChHHHHHHHHHHHHHH------------------------------------------ Confidence 000 001111000000 00223333233222333333 Q ss_pred cccccccccccHHHHHHHHHHHHHHhcCcC-CCcCCCcceeEEEEc------CeeEEeecCCCCccc--chHHHHHHHHh Q lcl|NC_019419. 88 AGQIVDFASIPIAVEQAVYRLAMLVADEVD-ISPVGSGKETIRETV------GPITMEYDPATIGSG--VSFPWWDGLLG 158 (176) Q Consensus 88 ~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~-~~~~~~~~~v~rekV------G~I~veY~~~~~~~~--~~~~~v~~lL~ 158 (176) ..+..+++...|.--+|+.++..+. ......+.+.-+++| |+++++|++.+.... ..-||- .|+. T Consensus 36 -----c~~~~g~~~~~AlaL~taHLm~~dga~k~en~~~~t~S~rvaS~Sl~Ge~Sisf~~~sa~~s~L~~tp~G-kl~~ 109 (132) T protein:vir:10 36 -----CADKFGNDADRAVGLYALHLMLSDGAFKGENEGLETYSRRMASYSLSGEFSITYDNQSAIQGDLSSSSWG-RMYK 109 (132) T ss_pred -----HhhcCchhHHHHHHHHHHHHhhccccccccccchhhhhhhhhhhcccCceeeecccccccccccccCcHH-HHHH Confidence 2234455556666556665554333 223333444555555 999999976443211 112444 6666 Q ss_pred hhhccCCccccee--eeecC Q lcl|NC_019419. 159 HWIDSDGNAAGNF--DVFRG 176 (176) Q Consensus 159 ~~l~~~g~~~~~~--~v~RG 176 (176) .|+.-.+++||-+ -.+|| T Consensus 110 ~L~k~~~GgfgL~t~~~~~~ 129 (132) T protein:vir:10 110 ALLRKKGGGFGLITSAAGGG 129 (132) T ss_pred HHHHhccCccccccccCcCC Confidence 6766444455533 23333 No 34 >protein:vir:79640 Length: 134 # NCBI annotation: gp38 # Family: family:all:5122 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285527;genbank:gi:148734510;genbank:GeneID:5219998 Probab=48.60 E-value=0.66 Score=21.56 Aligned_cols=117 Identities=13% Similarity=0.106 Sum_probs=55.4 Q ss_pred CccccccHHHHHHHHHhcCCccChhhhhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCcccccccccccccccc Q lcl|NC_019419. 8 VNTMYGDPQTFVDYAAARGVEVTLSDATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRDTLTDVVAVVP 87 (176) Q Consensus 8 ~~~~Y~sva~adaY~a~rg~~~~~~~ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~~~~~~~~~~~ 87 (176) -|..= .+|.+..-+-+. ..+|++.-+..+..|..||..-. T Consensus 1 m~d~~-~ve~Fr~l~PeF-~~vpde~l~~~~~~A~~~i~~~~-------------------------------------- 40 (134) T protein:vir:79 1 MNDIE-ILEQIYKIAPAF-KKVDPELIQAWIELAKDFVCEKH-------------------------------------- 40 (134) T ss_pred CchHH-HHHHHHHhcccc-ccCCHHHHHHHHHHhhhhhcCCC-------------------------------------- Confidence 11111 133232222121 12577777788888888885322 Q ss_pred cccccccccccHHHHHHHHHHHHHHhcC------cCCCcCCCcceeEE-EEcCeeEEeecCCCCcc------cchHHHHH Q lcl|NC_019419. 88 AGQIVDFASIPIAVEQAVYRLAMLVADE------VDISPVGSGKETIR-ETVGPITMEYDPATIGS------GVSFPWWD 154 (176) Q Consensus 88 ~g~~i~~d~IP~~V~~A~~eLA~~~~~~------~~~~~~~~~~~v~r-ekVG~I~veY~~~~~~~------~~~~~~v~ 154 (176) ..+....|..-+++.++.- +........++|.. ...|+++|.|++.+... ++ ||= T Consensus 41 ---------~g~~~~~al~lltAHLl~l~~~~~~~g~~~~~~~grv~ssst~G~vSvS~a~ps~~~~~~Wl~~T--pYG- 108 (134) T protein:vir:79 41 ---------FKDKYFRAVALYTLHLMTLDGAMKQESESVESYSHRIASFSLTGEFSQTFSKVSDDTSGNTLRQT--PWG- 108 (134) T ss_pred ---------CChHHHHHHHHHHHHHHhhcccccccccccccccchhhhhhhhcceeeeccCcccchhHHHHhcC--HHH- Confidence 2233555655566554421 11111122234544 55899999998643322 22 233 Q ss_pred HHHhhhhccCCcccceeeeecC Q lcl|NC_019419. 155 GLLGHWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 155 ~lL~~~l~~~g~~~~~~~v~RG 176 (176) +++..|....+.+||-+.=.|| T Consensus 109 q~y~~L~k~~~GGf~~~t~~~~ 130 (134) T protein:vir:79 109 KMYEVLNKKKGGGFGLTTAFHR 130 (134) T ss_pred HHHHHHHHhhccchHhhhhccc Confidence 4555554433444554444444 No 35 >protein:vir:79074 Length: 150 # NCBI annotation: gp10, conserved hypothetical protein # Family: family:all:348 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111210;genbank:gi:134288797;genbank:GeneID:4960748 Probab=47.47 E-value=0.7 Score=21.43 Aligned_cols=118 Identities=8% Similarity=0.050 Sum_probs=54.4 Q ss_pred cccccHHHHHHHHHhc--------CC---------ccChhhhhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCc Q lcl|NC_019419. 10 TMYGDPQTFVDYAAAR--------GV---------EVTLSDATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDG 72 (176) Q Consensus 10 ~~Y~sva~adaY~a~r--------g~---------~~~~~~ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g 72 (176) -.|+|.+++.+.+.+. -. .+.++--++||..|+..||++ .+. T Consensus 1 M~Y~T~~Dl~~r~ge~~l~~Ltd~~~~~~~~~~~~~~d~~~i~~Al~dA~~~IDgy--L~~------------------- 59 (150) T protein:vir:79 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESAVRQAEEIVDAH--LRG------------------- 59 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccccccccccccCHHHHHHHHHHHHHHHHHH--Hhh------------------- Confidence 3499999999987432 11 122222478999999999984 111 Q ss_pred ccccccccccccccccccccccccccHHHHHHHHHHHHHHhcCc-----CCCcCCCc---ceeE---EEEcCeeEEeecC Q lcl|NC_019419. 73 KPVRDTLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEV-----DISPVGSG---KETI---RETVGPITMEYDP 141 (176) Q Consensus 73 ~~~~~~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~-----~~~~~~~~---~~v~---rekVG~I~veY~~ 141 (176) .+.+|...+|..|+..||-+|.+.+-.. ........ ..++ ...-|.++..=.. T Consensus 60 ----------------RY~lPl~~vP~~L~~~a~dIA~Y~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~ 123 (150) T protein:vir:79 60 ----------------RYNLPLSPVPTVIKDVTVNLARHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPS 123 (150) T ss_pred ----------------hccCCcccccHHHHHHHHHHHHHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCC Confidence 1223446799999999999998655321 11111100 0111 1111544442111 Q ss_pred C--C-CcccchHHHHHHHHhhhhccCCcccceeeeecC Q lcl|NC_019419. 142 A--T-IGSGVSFPWWDGLLGHWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 142 ~--~-~~~~~~~~~v~~lL~~~l~~~g~~~~~~~v~RG 176 (176) . . .++... +.++..-|+ =+=.|| T Consensus 124 ~~~~~~~~~~~-----------v~~~~r~f~-r~~l~g 149 (150) T protein:vir:79 124 GPATPEPGEMK-----------VRARRRQFD-ADLLER 149 (150) T ss_pred ccCCCCCCcee-----------eecCCCccC-hhhccC Confidence 0 0 001110 111111000 011233 No 36 >protein:vir:1993 Length: 141 # NCBI annotation: Hypothetical protein # Family: family:all:348 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050640;genbank:gi:9633527;genbank:GeneID:2636296 Probab=47.10 E-value=0.71 Score=21.39 Aligned_cols=122 Identities=11% Similarity=0.104 Sum_probs=57.6 Q ss_pred cccccHHHHHHHHHhc---------CC--ccChhhhhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCccccccc Q lcl|NC_019419. 10 TMYGDPQTFVDYAAAR---------GV--EVTLSDATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRDT 78 (176) Q Consensus 10 ~~Y~sva~adaY~a~r---------g~--~~~~~~ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~~ 78 (176) -.|+|.+++.+.+-.. .. .++++--++||..|+..||++ .+.| T Consensus 1 M~Y~T~~Dl~~~~ge~~l~~Lt~d~~~~g~~d~~~i~~Al~dA~~eIdgy--L~~R------------------------ 54 (141) T protein:vir:19 1 MNYATVNDLCARYTRTRLDILTRPKTADGQPDDAVAEQALADASAFIDGY--LAAR------------------------ 54 (141) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcCCCCccccCHHHHHHHHHHHHHHHHHH--Hhhc------------------------ Confidence 3399999999887432 11 122222478999999999984 1111 Q ss_pred ccccccccccccccccccccHHHHHHHHHHHHHHhcCcCCCcC-CCc--cee---EEEEcCeeEEeecCCCCcccchHHH Q lcl|NC_019419. 79 LTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPV-GSG--KET---IRETVGPITMEYDPATIGSGVSFPW 152 (176) Q Consensus 79 ~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~-~~~--~~v---~rekVG~I~veY~~~~~~~~~~~~~ 152 (176) +.+|...+|.-|+..+|-+|.+.+.+...... ... ..+ +...-|.++.--........+.-. T Consensus 55 -----------Y~lPl~~~P~~L~~~a~dIA~Y~L~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~- 122 (141) T protein:vir:19 55 -----------FVLPLTVVPSLLKRQCCVVAWFYLNESQPTEQITATYRDTVRWLEQVRDGKTDPGVESRTAASPEGED- 122 (141) T ss_pred -----------ccCCccccchHHHHHHHHHHHHHHhcCCCChHHHHHHHHHHHHHHHHhcCccccCCCCCCCCCCCCCc- Confidence 12344679999999999999976643321110 000 001 111125554432111100000000 Q ss_pred HHHHHhhhhccCCcccceeeeecC Q lcl|NC_019419. 153 WDGLLGHWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 153 v~~lL~~~l~~~g~~~~~~~v~RG 176 (176) ...+.++..-|+- + .|| T Consensus 123 -----~~~~~~~~r~f~r-~-~~G 139 (141) T protein:vir:19 123 -----LVQVQSDPPVFSR-K-QKG 139 (141) T ss_pred -----eeEeecCCcccCc-c-ccc Confidence 0012222222221 1 355 No 37 >protein:vir:103846 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938245;genbank:gi:38229150;genbank:GeneID:2648161 Probab=45.41 E-value=0.77 Score=21.20 Aligned_cols=111 Identities=8% Similarity=0.070 Sum_probs=53.9 Q ss_pred cccccHHHHHHHHHh--------cCC----ccChhhhhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCcccccc Q lcl|NC_019419. 10 TMYGDPQTFVDYAAA--------RGV----EVTLSDATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRD 77 (176) Q Consensus 10 ~~Y~sva~adaY~a~--------rg~----~~~~~~ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~ 77 (176) -.|+|.+++.+.+.. +.. .+.++--++||..|+..||++ .+. T Consensus 1 M~Y~T~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~~eIdgy--L~~------------------------ 54 (138) T protein:vir:10 1 MSYCTQADLVEQYGEASIRQLSDRVNKPATTIDPAVVAQAIADADAEIDLH--LHA------------------------ 54 (138) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccCCCcCccCHHHHHHHHHHHHHHHHHH--Hhh------------------------ Confidence 349999999987532 222 122222478999999999984 111 Q ss_pred cccccccccccccccccccccHHHHHHHHHHHHHHhcCcCCCc--CCCc--cee---EEEEcCeeEEeecCCCCcccchH Q lcl|NC_019419. 78 TLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISP--VGSG--KET---IRETVGPITMEYDPATIGSGVSF 150 (176) Q Consensus 78 ~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~--~~~~--~~v---~rekVG~I~veY~~~~~~~~~~~ 150 (176) .+.+|.+.+|.-|+..||-+|.+.+-...... .... ..+ +...-|.++.--...... T Consensus 55 -----------RY~lPl~~vP~~L~~~a~dIA~Y~L~~~~~~~e~~~~rY~~Ai~~L~~Ia~G~~~Lg~~~~~~~----- 118 (138) T protein:vir:10 55 -----------RYQLPLAQVPVVLKRVACVLAFANLHTQVKDDHPAILDAERKRKLLGGISSGKLSLALTSSGTP----- 118 (138) T ss_pred -----------cccCCccccchHHHHHHHHHHHHHHhcCCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCccc----- Confidence 12234467999999999999997664321110 0000 000 011114433322111000 Q ss_pred HHHHHHHhhhhccCCcccceeeee-------cC Q lcl|NC_019419. 151 PWWDGLLGHWIDSDGNAAGNFDVF-------RG 176 (176) Q Consensus 151 ~~v~~lL~~~l~~~g~~~~~~~v~-------RG 176 (176) ..+.+.+.|. |. T Consensus 119 --------------~~~~~~~~~~s~~r~Fg~d 137 (138) T protein:vir:10 119 --------------APIANTVQISSQRNDFGGT 137 (138) T ss_pred --------------CCCCCceeeecCCccCCCC Confidence 0011112221 11 No 38 >protein:vir:79253 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469165;genbank:gi:157835007;genbank:GeneID:5648834 Probab=43.30 E-value=0.85 Score=20.97 Aligned_cols=111 Identities=12% Similarity=0.075 Sum_probs=53.1 Q ss_pred cccccHHHHHHHHHhc--------CC----ccChhhhhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCcccccc Q lcl|NC_019419. 10 TMYGDPQTFVDYAAAR--------GV----EVTLSDATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRD 77 (176) Q Consensus 10 ~~Y~sva~adaY~a~r--------g~----~~~~~~ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~ 77 (176) -.|+|.+++.+-+.+. .. .++++--++||..|+..||++ .+. T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgY--L~~------------------------ 54 (138) T protein:vir:79 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLH--LHG------------------------ 54 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHH--Hhh------------------------ Confidence 3499999999865322 11 122222478999999999985 111 Q ss_pred cccccccccccccccccccccHHHHHHHHHHHHHHhcCcCCCcC--CCc--cee---EEEEcCeeEEeecCCCCcccchH Q lcl|NC_019419. 78 TLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPV--GSG--KET---IRETVGPITMEYDPATIGSGVSF 150 (176) Q Consensus 78 ~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~--~~~--~~v---~rekVG~I~veY~~~~~~~~~~~ 150 (176) .+.+|...+|.-|+..||-+|.+.+-+....+. ... ..+ +...-|.++.--..... T Consensus 55 -----------RY~lPl~~vP~~L~~~a~dIA~Y~L~~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~------ 117 (138) T protein:vir:79 55 -----------RYQLPLASVPTALKRIACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGK------ 117 (138) T ss_pred -----------cccCCccccchHHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCc------ Confidence 112344679999999999999976643221110 000 000 00111333322111000 Q ss_pred HHHHHHHhhhhccCCcccceeeeecC Q lcl|NC_019419. 151 PWWDGLLGHWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 151 ~~v~~lL~~~l~~~g~~~~~~~v~RG 176 (176) ...+.+.+.+.=+ T Consensus 118 -------------~~~~~~~~~~~~~ 130 (138) T protein:vir:79 118 -------------PAPVANTVQISEG 130 (138) T ss_pred -------------CCCCCCceeeecC Confidence 0001112222211 No 39 >protein:vir:99222 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950460;genbank:gi:119953661;genbank:GeneID:4643082 Probab=43.30 E-value=0.85 Score=20.97 Aligned_cols=111 Identities=12% Similarity=0.075 Sum_probs=53.1 Q ss_pred cccccHHHHHHHHHhc--------CC----ccChhhhhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCcccccc Q lcl|NC_019419. 10 TMYGDPQTFVDYAAAR--------GV----EVTLSDATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRD 77 (176) Q Consensus 10 ~~Y~sva~adaY~a~r--------g~----~~~~~~ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~ 77 (176) -.|+|.+++.+-+.+. .. .++++--++||..|+..||++ .+. T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgY--L~~------------------------ 54 (138) T protein:vir:99 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLH--LHG------------------------ 54 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHH--Hhh------------------------ Confidence 3499999999865322 11 122222478999999999985 111 Q ss_pred cccccccccccccccccccccHHHHHHHHHHHHHHhcCcCCCcC--CCc--cee---EEEEcCeeEEeecCCCCcccchH Q lcl|NC_019419. 78 TLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPV--GSG--KET---IRETVGPITMEYDPATIGSGVSF 150 (176) Q Consensus 78 ~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~--~~~--~~v---~rekVG~I~veY~~~~~~~~~~~ 150 (176) .+.+|...+|.-|+..||-+|.+.+-+....+. ... ..+ +...-|.++.--..... T Consensus 55 -----------RY~lPl~~vP~~L~~~a~dIA~Y~L~~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~------ 117 (138) T protein:vir:99 55 -----------RYQLPLASVPTALKRIACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGK------ 117 (138) T ss_pred -----------cccCCccccchHHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCc------ Confidence 112344679999999999999976643221110 000 000 00111333322111000 Q ss_pred HHHHHHHhhhhccCCcccceeeeecC Q lcl|NC_019419. 151 PWWDGLLGHWIDSDGNAAGNFDVFRG 176 (176) Q Consensus 151 ~~v~~lL~~~l~~~g~~~~~~~v~RG 176 (176) ...+.+.+.+.=+ T Consensus 118 -------------~~~~~~~~~~~~~ 130 (138) T protein:vir:99 118 -------------PAPVANTVQISEG 130 (138) T ss_pred -------------CCCCCCceeeecC Confidence 0001112222211 No 40 >protein:vir:486 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543093;swissprot:trembl:q8w626;genbank:gi:18249905;uniprot:Q8W626;genbank:GeneID:929692 Probab=41.41 E-value=0.84 Score=21.01 Aligned_cols=102 Identities=9% Similarity=-0.061 Sum_probs=55.2 Q ss_pred ccccHHHHHHHHHhcCCccChhh--hhHHHHHHHHHHhhhcccceeCCCcc---ccccccCCcccCcccccccccccccc Q lcl|NC_019419. 11 MYGDPQTFVDYAAARGVEVTLSD--ATRHLTVVNDFLNGINWIGEPADQTG---IDAWPRINYQSDGKPVRDTLTDVVAV 85 (176) Q Consensus 11 ~Y~sva~adaY~a~rg~~~~~~~--ke~aLi~As~~ld~~~~~G~r~~~~Q---~laWPR~G~~~~g~~~~~~~~~~~~~ 85 (176) |++|++++..|..--.- .++|+ -+..+..|.+++.+ |.|++....+ +-.||.. T Consensus 1 M~vtL~e~K~hLRid~D-~~ddD~li~~~i~aA~~~i~~--~~~r~l~~~~~~~~~~~~~~------------------- 58 (107) T protein:vir:48 1 MLLKEEEIKSHLRLDDG-LYSDGDFLKLLAQAVQKRTET--YLNRKLYAPEETIPEDDPDG------------------- 58 (107) T ss_pred CCCCHHHHHHHcCCCCC-CchhHHHHHHHHHHHHHHHHH--HhccccccccccccccCccc------------------- Confidence 99999999999864221 12233 24455566677753 5555443322 2233321 Q ss_pred cccccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcceeEEEEcCeeEEeecCCCCcccchHHHHHHHHhhhhccCC Q lcl|NC_019419. 86 VPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKETIRETVGPITMEYDPATIGSGVSFPWWDGLLGHWIDSDG 165 (176) Q Consensus 86 ~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~v~rekVG~I~veY~~~~~~~~~~~~~v~~lL~~~l~~~g 165 (176) -.||..++.|+.-|+-....+- |.+..-+. ...| ..++.||.+|-.=+ T Consensus 59 ---------~~~~~~ik~Avlllv~~~Y~NR-------------e~v~~~~~-------~~iP--~~v~~LL~~yR~~~- 106 (107) T protein:vir:48 59 ---------MHLTDDVRLAMLMLVSHFYENR-------------STITDVEK-------LETP--MSFRWLAGPYRIVP- 106 (107) T ss_pred ---------cccchhHHHHHHHHHHHHHhhh-------------hhhccccc-------cccC--HHHHHHHHHhhccC- Confidence 2489999999998887555431 11110010 1112 13778888885432 Q ss_pred c Q lcl|NC_019419. 166 N 166 (176) Q Consensus 166 ~ 166 (176) . T Consensus 107 l 107 (107) T protein:vir:48 107 L 107 (107) T ss_pred C Confidence 2 No 41 >protein:vir:5256 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852761;genbank:gi:31544036;uniprot:Q7Y5T9;genbank:GeneID:2753553 Probab=32.70 E-value=1.4 Score=19.77 Aligned_cols=109 Identities=12% Similarity=0.108 Sum_probs=55.1 Q ss_pred CcceeecCccccccHHHHHHHHHhcCCccChhhhhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCccccccccc Q lcl|NC_019419. 1 MPAFFIGVNTMYGDPQTFVDYAAARGVEVTLSDATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRDTLT 80 (176) Q Consensus 1 ~~~~~~~~~~~Y~sva~adaY~a~rg~~~~~~~ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~~~~ 80 (176) || |++++.+-+-+.. .+|++.-+..|-.|..+|+..+| | T Consensus 1 m~-----------t~~~Fr~~~PeF~-~~pd~~i~~~l~~A~~~l~~~~~----------------g------------- 39 (119) T protein:vir:52 1 MP-----------LTEDFLLRYTEFG-KTDAKRIGLFLSDAQAEVSKVQW----------------G------------- 39 (119) T ss_pred CC-----------cHHHHHHhhhhcc-CCCHHHHHHHHHHHHHhhCCcCC----------------c------------- Confidence 44 5567766665443 36888888999999999975444 1 Q ss_pred ccccccccccccccccccHHHHHHHHHHHHHHh--cCcCCCc-CCCcceeEEEEcCeeEEeecCCCCc-------ccchH Q lcl|NC_019419. 81 DVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVA--DEVDISP-VGSGKETIRETVGPITMEYDPATIG-------SGVSF 150 (176) Q Consensus 81 ~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~--~~~~~~~-~~~~~~v~rekVG~I~veY~~~~~~-------~~~~~ 150 (176) ..-.++.+.++...+ ....... ....++++++++|.|+|.|+..... ..+.| T Consensus 40 ------------------~~~~~~~~L~~AH~l~l~~~~~~~~g~~~g~v~S~s~G~vSvS~~~~~~~~~~~~w~~~T~Y 101 (119) T protein:vir:52 40 ------------------KLYDRGVMALTAHLLKLSADAEISGGAANRNLASESAGELSVSYTAPISANGSDDFYQLTAY 101 (119) T ss_pred ------------------hHHHHHHHHHHHHHHHhhhhhhccccccccceeeeeecceeeeeeccccCCcchhhhhcCHH Confidence 011223333444322 1111111 1223578999999999999754322 12222 Q ss_pred HHHHHHHhhhhccCCcccceeeee Q lcl|NC_019419. 151 PWWDGLLGHWIDSDGNAAGNFDVF 174 (176) Q Consensus 151 ~~v~~lL~~~l~~~g~~~~~~~v~ 174 (176) =. .+..|++--|. +| .|. T Consensus 102 --G~-~y~~L~r~~g~-Gg--~Va 119 (119) T protein:vir:52 102 --GQ-EYLRLRRLIGV-GV--MVA 119 (119) T ss_pred --HH-HHHHHHHHhcC-CC--cCC Confidence 21 22222221111 12 233 No 42 >protein:vir:96108 Length: 155 # NCBI annotation: hypothetical protein ORF025 # Family: family:all:664 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294442;genbank:gi:149408339;genbank:GeneID:5237227 Probab=32.64 E-value=1.4 Score=19.76 Aligned_cols=98 Identities=17% Similarity=0.082 Sum_probs=45.7 Q ss_pred ccccccccccccc--ccccccccHHHHHHHHHHHHHHhcCcCCC-c---------------------------------- Q lcl|NC_019419. 78 TLTDVVAVVPAGQ--IVDFASIPIAVEQAVYRLAMLVADEVDIS-P---------------------------------- 120 (176) Q Consensus 78 ~~~~~~~~~~~g~--~i~~d~IP~~V~~A~~eLA~~~~~~~~~~-~---------------------------------- 120 (176) ..++++..-+..+ +-+.+.+|...+++..+.|-..+++.... + T Consensus 1 ~v~fd~~~FR~~fPeFad~~~~Pd~~i~~~l~~A~~~l~~~~~~s~~~~g~~~~~~l~Ll~AH~l~L~~~~~~gaa~~g~ 80 (155) T protein:vir:96 1 MVIFDEQKFRTLFPEFADPASYPAVRLQLYFDIACEFISDRDSPYRILNGKALEACLYLLTAHLLSLSTMQVQGAAGGGV 80 (155) T ss_pred CcccCHHHHHHhCccccCcccCCHHHHHHHHHHHHHhhcCCCccccccChHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 2233333333333 33345678888888888876555422110 0 Q ss_pred ---CCCcceeEEEEcCeeEEeecCCCCcc-------cchHH-HHHHHHhhhhccCCc-----ccc-eeeeecC Q lcl|NC_019419. 121 ---VGSGKETIRETVGPITMEYDPATIGS-------GVSFP-WWDGLLGHWIDSDGN-----AAG-NFDVFRG 176 (176) Q Consensus 121 ---~~~~~~v~rekVG~I~veY~~~~~~~-------~~~~~-~v~~lL~~~l~~~g~-----~~~-~~~v~RG 176 (176) ....+.++++++|.|||.|+...... .+.|- -.=+|++.+- .+|+ -.+ -||=+=| T Consensus 81 ~~~g~~~G~vsSas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~f~~l~~~~~-~Gg~~vgG~per~~~r~vgg 152 (155) T protein:vir:96 81 TAGGTQGGFITSATVGEVSVAKLAPPAKNGWQWWLSGTPYGQELWALLSVKA-VGGFYIGGLPERRGFRKVGG 152 (155) T ss_pred ccccccccceeeceecceeeeeecCCCCCchhHHhhcCHHHHHHHHHHHHhc-ccccccCCCCccccccccCc Confidence 01124589999999999997643221 23231 1123333332 1221 111 1222222 No 43 >protein:vir:99570 Length: 153 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039799;genbank:gi:126011049;genbank:GeneID:4818265 Probab=31.77 E-value=1.5 Score=19.66 Aligned_cols=128 Identities=13% Similarity=0.053 Sum_probs=62.5 Q ss_pred CcceeecCccccccHHHHHHHHHhcC--CccChhhhhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCccccccc Q lcl|NC_019419. 1 MPAFFIGVNTMYGDPQTFVDYAAARG--VEVTLSDATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRDT 78 (176) Q Consensus 1 ~~~~~~~~~~~Y~sva~adaY~a~rg--~~~~~~~ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~~ 78 (176) |...+. +++.+.+-+=+.. ..+|++.-+..|-.|..+|+..+| +.. T Consensus 1 m~~~~f-------d~~~Fr~~fPeFad~~~~Pd~~i~~~l~~A~~~l~~~~~-------------~~~------------ 48 (153) T protein:vir:99 1 MADPVY-------NDGLFRIMYPEFADQEKYPPEVIEIYYDTATLFITGSMF-------------PCA------------ 48 (153) T ss_pred CCcccC-------ChHHHHHhcccccCccccCHHHHHHHHHHHHHhhcCccc-------------ccc------------ Confidence 666552 4455555444333 357888889999999999965322 211 Q ss_pred ccccccccccccccccccccHHHHHHHHHHHHHHhc-------CcCCCcCCCcceeEEEEcCeeEEeecCCCCc------ Q lcl|NC_019419. 79 LTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVAD-------EVDISPVGSGKETIRETVGPITMEYDPATIG------ 145 (176) Q Consensus 79 ~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~-------~~~~~~~~~~~~v~rekVG~I~veY~~~~~~------ 145 (176) ..-.+...++.+-++..++. ..........+.++++++|.|||.|+..... T Consensus 49 ----------------~~~g~~~~~~l~Ll~AH~l~L~~~~~~~~~~a~~~~~G~vsSas~G~VSVSy~~~~~~~~~~~w 112 (153) T protein:vir:99 49 ----------------ALSGKQLVGALNMLTAHLMSLSMQRSQTALGATNDQGGYTLSATIGEVSVSKMAPPAKDGWEFW 112 (153) T ss_pred ----------------ccChHHHHHHHHHHHHHHHHHHhhhhcccccCCCccccceeeeeecceeeeeecCCCCCchhHh Confidence 01133445555555443321 1111222334678999999999999754332 Q ss_pred -ccchHH-HHHHHHhhhhcc----CCcccc-eeeeecC Q lcl|NC_019419. 146 -SGVSFP-WWDGLLGHWIDS----DGNAAG-NFDVFRG 176 (176) Q Consensus 146 -~~~~~~-~v~~lL~~~l~~----~g~~~~-~~~v~RG 176 (176) ..+.|- -.=+|++.+-.+ +|...+ -||=+=| T Consensus 113 ~~~T~YGq~fw~l~~~~~~Gg~v~gg~pe~~~~r~vgg 150 (153) T protein:vir:99 113 LAQTPYGQALWALLKMLSVGGFAIGGLPERTGFRKVGG 150 (153) T ss_pred hhcCHHHHHHHHHHHHhcccccccCCCCccccccccCc Confidence 122231 112333333111 111111 1222223 No 44 >protein:vir:4512 Length: 107 # NCBI annotation: unknown # Family: family:all:363 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599038;genbank:gi:19548996;genbank:GeneID:935218 Probab=25.28 E-value=2.1 Score=18.85 Aligned_cols=105 Identities=13% Similarity=-0.023 Sum_probs=56.2 Q ss_pred ccccHHHHHHHHHhcCCccChhh--hhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCccccccccccccccccc Q lcl|NC_019419. 11 MYGDPQTFVDYAAARGVEVTLSD--ATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKPVRDTLTDVVAVVPA 88 (176) Q Consensus 11 ~Y~sva~adaY~a~rg~~~~~~~--ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~~~~~~~~~~~~~~~ 88 (176) |++|++++..|..--+- .++|+ -+..+..|.+|+.. |.|++....+.. ||-.. . T Consensus 1 M~vtL~e~K~hLRId~D-~~ddD~lI~~~i~AA~~~i~~--~~~r~~~~~~~~-~~~~~-------------------~- 56 (107) T protein:vir:45 1 MLLKMEEIKLQLRLDDD-FSDEDELLELLGKAAQSRTEN--FLNRKLYATADD-RPADD-------------------P- 56 (107) T ss_pred CCCCHHHHHHHcCCCCC-CchhHHHHHHHHHHHHHHHHH--Hhcccccccccc-ccccc-------------------c- Confidence 99999999999864221 12223 24455567777764 667665544332 33211 0 Q ss_pred ccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcceeEEEEcCeeEEeecCCCCcccchHHHHHHHHhhhhccCCc Q lcl|NC_019419. 89 GQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKETIRETVGPITMEYDPATIGSGVSFPWWDGLLGHWIDSDGN 166 (176) Q Consensus 89 g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~v~rekVG~I~veY~~~~~~~~~~~~~v~~lL~~~l~~~g~ 166 (176) ..-.||..++.|+..|.-....+- +.+... +....| | .++.||.+|-.=+ . T Consensus 57 ----~~~~~~~~~~~AvLllv~~~Y~NR-------------e~~~~~-------~~~~lp-~-~v~~Ll~~~R~~~-~ 107 (107) T protein:vir:45 57 ----DGLVISDDVKLALLLLVSHFYENR-------------STVTDV-------EKMELP-M-SFNWLVAPYRLIP-L 107 (107) T ss_pred ----ccccCChhHHHHHHHHHHHHHhhh-------------hhcccc-------chhccc-h-HHHHHHHHHhhcC-C Confidence 012379999999998876544331 111000 001112 1 3677888874422 1 No 45 >protein:vir:98481 Length: 136 # NCBI annotation: ORF3 # Family: family:all:32440 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958281;genbank:gi:41057255;uniprot:Q38596;genbank:GeneID:2732865 Probab=23.56 E-value=2.3 Score=18.61 Aligned_cols=112 Identities=15% Similarity=0.118 Sum_probs=52.9 Q ss_pred CcceeecCccccccHHHHHHHHHhcCC-ccChhh-----hhHHHHHHHHHHhhhcccceeCCCccccccccCCcccCccc Q lcl|NC_019419. 1 MPAFFIGVNTMYGDPQTFVDYAAARGV-EVTLSD-----ATRHLTVVNDFLNGINWIGEPADQTGIDAWPRINYQSDGKP 74 (176) Q Consensus 1 ~~~~~~~~~~~Y~sva~adaY~a~rg~-~~~~~~-----ke~aLi~As~~ld~~~~~G~r~~~~Q~laWPR~G~~~~g~~ 74 (176) |++ |+|++|+.+ ||. .+++++ .+++|-.|++.|... ||.-+ T Consensus 1 M~~--------fAtv~Dl~~----rw~~~~~dee~~ra~~~~lL~dAS~~ir~~--------------~p~~~------- 47 (136) T protein:vir:98 1 MAA--------YATVEDYQA----RAAVTLPDGSPRRAQVEAYLDDASALMARH--------------IPTGH------- 47 (136) T ss_pred CCc--------cCCHHHHHH----HhccCCCCchhHHHHHHHHHHHHHHHHHHh--------------CCCCC------- Confidence 655 789999865 555 344443 245577899999653 44321 Q ss_pred ccccccccccccccccccccccccHHHHHHHHHHHHHHhcCcCCCcCCCcceeEEEEcCeeEEeecCCCCcccchH-HHH Q lcl|NC_019419. 75 VRDTLTDVVAVVPAGQIVDFASIPIAVEQAVYRLAMLVADEVDISPVGSGKETIRETVGPITMEYDPATIGSGVSF-PWW 153 (176) Q Consensus 75 ~~~~~~~~~~~~~~g~~i~~d~IP~~V~~A~~eLA~~~~~~~~~~~~~~~~~v~rekVG~I~veY~~~~~~~~~~~-~~v 153 (176) +.-|.-++.=+|....+.+.+++- ..+++.|..+-.+..+ +..-+ +-= T Consensus 48 --------------------~~~~~~~~~V~~~~V~R~~~np~G--------~~s~TaG~ys~s~t~~---G~Lylt~~E 96 (136) T protein:vir:98 48 --------------------TPDPGTLRAICVAVVRRVMANPGG--------YRQRTIGQYAETLGED---GGLYLTEDE 96 (136) T ss_pred --------------------CCChhHHHHHHHHHHHHHhhCCCC--------cccccchhHHHhhhcC---CCcccChHH Confidence 012455666677777676665442 2345566554333221 11111 111 Q ss_pred HHHHhhhhccCCc--ccceeeeecC Q lcl|NC_019419. 154 DGLLGHWIDSDGN--AAGNFDVFRG 176 (176) Q Consensus 154 ~~lL~~~l~~~g~--~~~~~~v~RG 176 (176) ..+|++=-..-|. +.++|..-+| T Consensus 97 ~~~Lg~~rqr~~~~d~a~si~~~~~ 121 (136) T protein:vir:98 97 KGQLQPPDQTAPDADAAYSLDLDPG 121 (136) T ss_pred HHHhCCCCCcccccccceecccCCC Confidence 1223221000011 2234555554 Done!