Query lcl|NC_018836.1_cdsid_YP_006906193.1 [gene=phiHau3_16] [protein=hypothetical protein] [protein_id=YP_006906193.1] [location=9695..10198] Match_columns 167 No_of_seqs 2 out of 5 Neff 1.4 Searched_HMMs 1612 Date Thu Nov 7 13:27:30 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_13 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_13_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:1643 Length: 111 # 97.2 4.8E-06 3E-09 49.7 7.8 111 16-155 1-111 (111) 2 protein:vir:9764 Length: 111 # 97.1 4.3E-06 2.7E-09 50.0 7.4 111 16-155 1-111 (111) 3 protein:vir:94768 Length: 111 96.9 1E-05 6.4E-09 47.9 7.8 111 16-155 1-111 (111) 4 protein:vir:9579 Length: 111 # 96.9 1.1E-05 6.8E-09 47.8 7.4 111 16-155 1-111 (111) 5 protein:vir:98426 Length: 131 95.8 0.00011 6.7E-08 42.3 7.1 131 4-159 1-131 (131) 6 protein:vir:7777 Length: 135 # 92.8 0.00048 3E-07 38.8 4.0 128 1-159 1-135 (135) 7 protein:vir:99005 Length: 170 91.6 0.0024 1.5E-06 35.0 6.3 150 1-167 1-168 (170) 8 protein:vir:8331 Length: 150 # 84.8 0.02 1.3E-05 29.9 6.7 138 1-166 11-150 (150) 9 protein:vir:2348 Length: 148 # 81.8 0.0086 5.4E-06 31.9 3.4 129 1-159 1-148 (148) 10 protein:vir:78292 Length: 148 81.5 0.0088 5.5E-06 31.8 3.4 125 1-159 1-148 (148) 11 protein:vir:78481 Length: 148 81.5 0.0088 5.5E-06 31.8 3.4 125 1-159 1-148 (148) 12 protein:vir:7994 Length: 134 # 77.5 0.033 2E-05 28.7 5.2 129 1-157 1-134 (134) 13 protein:vir:104092 Length: 140 76.9 0.025 1.5E-05 29.4 4.4 131 1-162 1-140 (140) 14 protein:vir:4231 Length: 139 # 75.5 0.024 1.5E-05 29.4 3.9 130 1-161 1-139 (139) 15 protein:vir:105826 Length: 134 75.4 0.043 2.6E-05 28.1 5.2 129 1-157 1-134 (134) 16 protein:vir:102609 Length: 134 75.4 0.043 2.6E-05 28.1 5.2 129 1-157 1-134 (134) 17 protein:vir:2436 Length: 139 # 69.6 0.041 2.5E-05 28.2 3.7 130 1-161 1-139 (139) 18 protein:vir:96894 Length: 140 62.2 0.34 0.00021 23.2 9.5 132 1-163 1-140 (140) 19 protein:vir:94096 Length: 141 58.3 0.42 0.00026 22.7 7.5 137 1-164 1-141 (141) 20 protein:vir:105892 Length: 141 58.3 0.42 0.00026 22.7 7.5 137 1-164 1-141 (141) 21 protein:vir:96260 Length: 141 58.3 0.42 0.00026 22.7 7.5 137 1-164 1-141 (141) 22 protein:vir:5979 Length: 134 # 47.8 0.69 0.00043 21.5 9.1 126 1-157 1-134 (134) 23 protein:vir:1892 Length: 121 # 40.2 0.98 0.00061 20.6 6.2 118 4-158 1-121 (121) 24 protein:vir:81066 Length: 118 40.2 0.98 0.00061 20.6 8.3 115 15-159 1-118 (118) 25 protein:vir:97070 Length: 118 39.9 1 0.00062 20.6 8.2 115 15-157 1-118 (118) 26 protein:vir:95961 Length: 145 34.7 1.3 0.00079 20.0 9.9 137 1-167 1-144 (145) 27 protein:vir:94794 Length: 145 34.6 1.3 0.00079 20.0 9.9 137 1-167 1-144 (145) 28 protein:vir:1244 Length: 145 # 34.2 1.3 0.00081 19.9 8.6 137 1-167 1-144 (145) 29 protein:vir:101655 Length: 134 33.8 1.3 0.00083 19.9 6.4 133 1-159 1-134 (134) 30 protein:vir:7860 Length: 134 # 33.8 1.3 0.00083 19.9 6.4 133 1-159 1-134 (134) 31 protein:vir:10368 Length: 118 33.1 1.4 0.00086 19.8 8.2 115 15-159 1-118 (118) 32 protein:vir:93602 Length: 114 31.3 1.5 0.00094 19.6 7.4 107 16-154 1-114 (114) 33 protein:vir:97325 Length: 145 30.6 1.6 0.00097 19.5 9.5 136 1-167 1-143 (145) 34 protein:vir:95111 Length: 145 29.2 1.7 0.001 19.4 9.9 137 1-167 1-144 (145) No 1 >protein:vir:1643 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695064;genbank:gi:23455755;genbank:GeneID:955492 Probab=97.16 E-value=4.8e-06 Score=49.73 Aligned_cols=111 Identities=19% Similarity=0.260 Sum_probs=85.7 Q ss_pred cHHHHHHHHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeeheeccCCCCchhhHHHHHHHHHH Q lcl|NC_018836. 16 PVEDLLLAVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQSFCEDPDGDADAAILAEAVRVV 95 (167) Q Consensus 16 piedllla~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~~~edpdgd~da~~laeavr~~ 95 (167) =||-++..-|..-| ||.+.+=...+-.=.||.+.|-.+- .+.|.|++.++||+|++. +.+|+.||+.|+.+ T Consensus 1 miE~~i~~~L~~~l-~Vpv~~e~p~~~P~~FV~vErtGG~-----~~~~~~~~~lAVq~w~~S---~~eAa~La~~v~~~ 71 (111) T protein:vir:16 1 MIEIIIKNFLDTHL-SVSSFLEKKGEMPLSYILFEKTGSS-----KSNHLLSSTFAFQSYAPS---MYEAAKLNEQLKEV 71 (111) T ss_pred ChHHhHHHHHhhcC-CceeEeecCCCCCCceEEEEecCCc-----cccccccceEEEEecchh---HHHHHHHHHHHHHH Confidence 78999988888877 6766555555555569999998764 356999999999999975 67999999999999 Q ss_pred HHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccceeeceeeeeEee Q lcl|NC_018836. 96 LRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWRYETQYDIEIR 155 (167) Q Consensus 96 l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwryet~~di~ir 155 (167) +.+. +++--|.++. .+|..+|-|=.+|--||-+-|||+.= T Consensus 72 l~~l-------~~~~~I~av~-------------~~s~ynf~d~~tk~~RYQav~~i~~~ 111 (111) T protein:vir:16 72 VERL-------IELNEISNVS-------------LNSDYNFTDTETKEYRYQAVFDINHY 111 (111) T ss_pred Hhhc-------cccccceeee-------------cCCCCcCCCCCCCCceEEEEEEEeeC Confidence 8443 2333344443 35667888888999999999999987 No 2 >protein:vir:9764 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795526;genbank:gi:28876278;genbank:GeneID:1257819 Probab=97.13 E-value=4.3e-06 Score=50.00 Aligned_cols=111 Identities=20% Similarity=0.316 Sum_probs=87.0 Q ss_pred cHHHHHHHHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeeheeccCCCCchhhHHHHHHHHHH Q lcl|NC_018836. 16 PVEDLLLAVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQSFCEDPDGDADAAILAEAVRVV 95 (167) Q Consensus 16 piedllla~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~~~edpdgd~da~~laeavr~~ 95 (167) =||-+++.-|...| ||.+..=.+.+-.=+||.+.|-.+ ..++|+|++.+|||+|++. +.+|+.||+.|+.+ T Consensus 1 mIE~~i~~yL~~~l-~vpv~~e~p~~~P~~FV~vEkTGG-----~~~~~~~~a~lAvQsyg~S---~~~AA~La~~V~~a 71 (111) T protein:vir:97 1 MIEVIIKKYLDEHL-DVPSFFEHQKDEPARFIILEKTSG-----AKQNHLLSSTFAFQSYAES---LYEAALLNDKVKQV 71 (111) T ss_pred ChhhhhhHHHhhhc-CceEEEeecCCCCCceEEEEeeCC-----ccccccccceEEEEecchh---HHHHHHHHHHHHHH Confidence 78989988888876 666555555555557999999876 5689999999999999974 78999999999988 Q ss_pred HHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccceeeceeeeeEee Q lcl|NC_018836. 96 LRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWRYETQYDIEIR 155 (167) Q Consensus 96 l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwryet~~di~ir 155 (167) +.+. +++--+.++.+.|- -.|-|--++-.||-+-|||+.= T Consensus 72 ~~~l-------~~l~~i~~v~lns~-------------Ynf~d~~tk~yRYQa~~di~~~ 111 (111) T protein:vir:97 72 IEQL-------DVLPQVSGVHLNAD-------------YNFTDTATKRYRYQAVFDINHY 111 (111) T ss_pred hhhh-------ccCccceeeeeccc-------------ccCCCCCCCCccEEEEEEEeeC Confidence 7533 35556777877653 2345555788999999999987 No 3 >protein:vir:94768 Length: 111 # NCBI annotation: unknown # Family: family:all:1269 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996711;genbank:gi:45597426;genbank:GeneID:2769040 Probab=96.93 E-value=1e-05 Score=47.93 Aligned_cols=111 Identities=20% Similarity=0.273 Sum_probs=84.2 Q ss_pred cHHHHHHHHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeeheeccCCCCchhhHHHHHHHHHH Q lcl|NC_018836. 16 PVEDLLLAVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQSFCEDPDGDADAAILAEAVRVV 95 (167) Q Consensus 16 piedllla~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~~~edpdgd~da~~laeavr~~ 95 (167) =||-++..-|..-| ||.+..=...+-.=+||.+.|-.+- .+.|.|++.++||+|++. +.+|+.||+.|+.+ T Consensus 1 miE~~v~~~L~~~l-~vpv~~e~p~~~p~~FV~vErtGG~-----~~~~~~~~~lAVQ~~~~S---~~eAa~La~~v~~~ 71 (111) T protein:vir:94 1 MIEIIIKNFLDTHL-SVSSFLEKKGEMPLSYVLFEKTGSS-----KSNHLLSSTFAFQSYAPS---MYEAAKLNEQLKEV 71 (111) T ss_pred ChHHhHHHHHhhcC-CcceEeecCCCCCCceEEEEecCCc-----cccccccceEEEEecchh---HHHHHHHHHHHHHH Confidence 78999998888877 6665444444444559999998763 467899999999999975 57999999999998 Q ss_pred HHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccceeeceeeeeEee Q lcl|NC_018836. 96 LRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWRYETQYDIEIR 155 (167) Q Consensus 96 l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwryet~~di~ir 155 (167) +.+. +++--|.++.+ ++...|-|=.+|--||-+-|||+.= T Consensus 72 ~~~l-------~~~~~i~~v~~-------------~s~Ynf~d~~tk~~RYQav~~i~~~ 111 (111) T protein:vir:94 72 VERL-------IELNEISNVSL-------------NSDYNFTDTETKEYRYQAVFDINHY 111 (111) T ss_pred Hhhc-------ccccccceeec-------------CCCcccCCCcCCCceEEEEEEEeeC Confidence 8543 23333444433 4556777778999999999999987 No 4 >protein:vir:9579 Length: 111 # NCBI annotation: gp45 # Family: family:all:1269 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862884;genbank:gi:32469476;genbank:GeneID:1461321 Probab=96.87 E-value=1.1e-05 Score=47.77 Aligned_cols=111 Identities=20% Similarity=0.274 Sum_probs=84.1 Q ss_pred cHHHHHHHHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeeheeccCCCCchhhHHHHHHHHHH Q lcl|NC_018836. 16 PVEDLLLAVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQSFCEDPDGDADAAILAEAVRVV 95 (167) Q Consensus 16 piedllla~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~~~edpdgd~da~~laeavr~~ 95 (167) =||.++..-|...| ||.+.+=.+.+--=+||.+.|-.+- .+.|.|++.++||+|++. +.+|+.||+.|+.+ T Consensus 1 miE~~v~~~L~~~l-~vpv~~~vp~~~P~~FV~vErtGG~-----~~~~~~~p~laVq~wg~S---~~~Aa~La~~v~~a 71 (111) T protein:vir:95 1 MIEIIINKYLDGHL-DVPSFFEHEAEAPDSFVIIQKTGGK-----ERNHSGSATFAFQSYAPT---MQKAAELNVKVKSA 71 (111) T ss_pred ChHHhHHHHhhhhc-CeeEEeecCCCCCCceEEEEeeCCc-----cccccccceEEEEecccc---HHHHHHHHHHHHHH Confidence 68888888887665 3656555555555689999998653 467889999999999975 68999999999988 Q ss_pred HHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccceeeceeeeeEee Q lcl|NC_018836. 96 LRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWRYETQYDIEIR 155 (167) Q Consensus 96 l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwryet~~di~ir 155 (167) +.+. . +.++ |.++.+ .|+..|-|=.+|--||-+-|||++= T Consensus 72 ~~~l-~---~~~~---i~~v~~-------------~s~ynf~d~~tk~~RYQ~~~~i~~~ 111 (111) T protein:vir:95 72 VKGL-I---ELDS---ICGVHL-------------NSDYNFTDTETKQYRYQAVFDINYF 111 (111) T ss_pred Hhhh-h---cccc---cccccc-------------CCccccCCCCCCCceEEEEEEEEeC Confidence 7554 2 2222 333333 4677788888999999999999987 No 5 >protein:vir:98426 Length: 131 # NCBI annotation: ORF6 # Family: family:all:12105 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958284;genbank:gi:41057258;uniprot:Q38599;genbank:GeneID:2732810 Probab=95.77 E-value=0.00011 Score=42.33 Aligned_cols=131 Identities=25% Similarity=0.257 Sum_probs=89.8 Q ss_pred CchhHHHHhhcCcHHHHHHHHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeeheeccCCCCch Q lcl|NC_018836. 4 LPDEIKALAELSPVEDLLLAVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQSFCEDPDGDA 83 (167) Q Consensus 4 lp~~i~a~~e~spiedllla~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~~~edpdgd~ 83 (167) +|+-+-.=+| +=+-++|-..|-.-+-+|+|++=+-.+-.-.||.|.|-.-. .-+.-+|..+++||+|..+ ++ T Consensus 1 ~~~i~~pda~-~v~~~~lr~~l~a~~~~V~V~t~vP~~RP~rfV~VertgG~----~~~~~~Dr~~L~Vq~W~~t---~~ 72 (131) T protein:vir:98 1 MPPILMPDAV-AVIAGYLRAVLVARGVTVPVGSRVPSPRPARFVRIERIGGP----ANTVVTDRPRLDVHCWGSS---EE 72 (131) T ss_pred CCCccCCchh-HHHHHHHHHHHHhcCCceEecccCCCCCCceEEEEEecCCC----cCCccccceEEEEEecCCC---HH Confidence 4432211110 11233444444333456999999999999999999998322 1223479999999999976 56 Q ss_pred hhHHHHHHHHHHHHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccceeeceeeeeEeecCcC Q lcl|NC_018836. 84 DAAILAEAVRVVLRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWRYETQYDIEIRKPRT 159 (167) Q Consensus 84 da~~laeavr~~l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwryet~~di~ir~pr~ 159 (167) +|..||+.||..|-.+...+ |+. +-+|..-.||.-|-|=-+|.=||-.--|+++|---- T Consensus 73 ~A~~La~~vr~~ll~~~~~~------g~~-----------~~~~~e~~gpy~~PD~es~~~Ryq~tv~l~~r~~~~ 131 (131) T protein:vir:98 73 DAHDLMQLCRALLGAARGSH------GDT-----------VLARPATGGPQFLPDAETGAARWAFTLDITMRGHAL 131 (131) T ss_pred HHHHHHHHHHHHHhhccccc------chh-----------eeccccCCCCCcCCCCCCCCceeEEEEEEEeeeccC Confidence 89999999998776432211 322 345777778999999778999999999999987554 No 6 >protein:vir:7777 Length: 135 # NCBI annotation: gp22 # Family: family:all:2820 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817611;genbank:gi:29566041;genbank:GeneID:1259235 Probab=92.78 E-value=0.00048 Score=38.78 Aligned_cols=128 Identities=31% Similarity=0.455 Sum_probs=86.4 Q ss_pred CCCCchhHHHHhhcCcHHHHHHHHHHhhcCCcchhhhhhh--hccCCEEEEEecCccceecCcceecc----chheeehe Q lcl|NC_018836. 1 MAGLPDEIKALAELSPVEDLLLAVLRDGLPGIRVQSLIEK--DQHFPFVLVRRDPSFGLWAGDTRFTD----SARVVVQS 74 (167) Q Consensus 1 maglp~~i~a~~e~spiedllla~l~~~lp~i~~~sli~~--~q~fpfi~~rr~~s~g~wagd~rf~d----~a~~~i~~ 74 (167) |+-|| -|....+-.||..|||+++||-|+. -.+||.|.+||- |-.|--. -+--+|.- T Consensus 1 ~~~~P----------rvq~VV~PiLR~~L~~v~VgtWvediD~R~FPlinvRRv-------GG~R~p~~P~~~~~PViEm 63 (135) T protein:vir:77 1 MAQMP----------RVQAVVLPILRAALPDVKVGSWIEDIDYRTFPMVNVRRV-------GGPRHETRPDKLALPVIEM 63 (135) T ss_pred CCCCc----------hhHHHHHHHHhhhcCCceecccccccccccccceeeeec-------CCCCCCccchhhhcchhee Confidence 76666 3666788899999999999999986 468999999995 3333221 12222222 Q ss_pred eccCCCCchhhHHHHHHHHHHHHHhhhhhccc-ccccceeeeeeccCCccccccccccCceeeccccccceeeceeeeeE Q lcl|NC_018836. 75 FCEDPDGDADAAILAEAVRVVLRNAWLSQKVY-AGRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWRYETQYDIE 153 (167) Q Consensus 75 ~~edpdgd~da~~laeavr~~l~~a~~~~~~~-~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwryet~~di~ 153 (167) -+-.-.|--....|-|-.--+|-+|-+-|+.- +|.=|-+.-+|.+ -|+..+=...||-.--..+- T Consensus 64 TaY~~~gL~etEqlYEdaL~vLYdAv~~Q~~TPaGYLhSi~ETmGA--------------tqfsS~f~dsWRvqGLI~LG 129 (135) T protein:vir:77 64 TAYGREGLVETEKLYEDALEALYDAVKHQTQTPAGYLHSIKETMGA--------------TQFSSLFQDSWRVQGLIQLG 129 (135) T ss_pred eeccccCccchHHHHHHHHHHHHHHHhhhccCcchhhhhHHHhhcc--------------ccCCcchhhhhHhhhhhhhc Confidence 22223444455566665556677777777655 4566777777754 35555556689998888889 Q ss_pred eecCcC Q lcl|NC_018836. 154 IRKPRT 159 (167) Q Consensus 154 ir~pr~ 159 (167) ||.||. T Consensus 130 vRpPr~ 135 (135) T protein:vir:77 130 VRPPRS 135 (135) T ss_pred ccCCCC Confidence 999998 No 7 >protein:vir:99005 Length: 170 # NCBI annotation: gp34 # Family: family:all:32655 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655899;genbank:gi:109521471;genbank:GeneID:4157970 Probab=91.59 E-value=0.0024 Score=34.98 Aligned_cols=150 Identities=23% Similarity=0.302 Sum_probs=101.6 Q ss_pred CCCC-chhHHHHhhcCcHHHHHHHHHHhhcCCcchhhhhhh--------------hccCCEEEEEecCccceecCcceec Q lcl|NC_018836. 1 MAGL-PDEIKALAELSPVEDLLLAVLRDGLPGIRVQSLIEK--------------DQHFPFVLVRRDPSFGLWAGDTRFT 65 (167) Q Consensus 1 magl-p~~i~a~~e~spiedllla~l~~~lp~i~~~sli~~--------------~q~fpfi~~rr~~s~g~wagd~rf~ 65 (167) |+-. |+--.- .++--|||||...+.+-+|..++++-|-- -|+=|-+-+-|-+. =+-+++-+ T Consensus 1 Ma~~lPDW~eg-da~l~v~dl~~q~~qkl~Pn~~v~~WipdDw~~~~~~~da~pt~~~~Ptl~~~R~~G---q~D~d~~~ 76 (170) T protein:vir:99 1 MADFLPDWWEG-PEYLDVEDLFAQHFQKLLPNVRVCHWIQPDWYIPTGFVDATPTYGTEPTLRLWRQPG---QRDDESTT 76 (170) T ss_pred CccccCCccCC-cHHHHHHHHHHHHHHHhCCCceeEeecCcccccccccccccccccccceEEEEecCC---ccchhhcc Confidence 8876 552222 45567999999999999999999999987 45667788888775 34566777 Q ss_pred cchheeeheeccCCCCchhhHHHHHHHHHHHHHhhhhhcccccccceeeeeeccCCccccccccccCceeecccccccee Q lcl|NC_018836. 66 DSARVVVQSFCEDPDGDADAAILAEAVRVVLRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWR 145 (167) Q Consensus 66 d~a~~~i~~~~edpdgd~da~~laeavr~~l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwr 145 (167) |.+-+-+-.-.+. -+|+-.|-+-||+.|+--.+. -.++--|.+++ -+-+..| -|||+--.++..--- T Consensus 77 Da~~lq~~vvt~S---r~DS~~l~~fvr~im~a~~~g-~~~~~~~qvv~------i~sv~e~---~Gp~~iP~~~~D~r~ 143 (170) T protein:vir:99 77 DAPLLQFAAVTRS---HGDSIQLIEFVHTVMRALNNG-HKIKYNGQLVG------IKNVGLW---LGPQTIPEGPIDEFF 143 (170) T ss_pred chhhhhhhhhccC---hHHHHHHHHHHHHHHHhhhcC-CeeeeCCceEE------EEEeccc---cccccCCCCCccceE Confidence 7777666555553 468889999999877633222 22233333444 2334444 499998777766544 Q ss_pred eceeeeeEeecCcCCCCC---CCCC Q lcl|NC_018836. 146 YETQYDIEIRKPRTKPFP---LSTP 167 (167) Q Consensus 146 yet~~di~ir~pr~~pfp---lstp 167 (167) --.-|.|+||.||-+|-- |.|- T Consensus 144 V~atyevtv~~~r~~p~y~~~l~~~ 168 (170) T protein:vir:99 144 VPVTYKFTVAGKKLQPNYRKILDSL 168 (170) T ss_pred eeeEEEEEeecccCCchHHHHHHhh Confidence 456799999999998742 1111 No 8 >protein:vir:8331 Length: 150 # NCBI annotation: gp48 # Family: family:all:2795 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817899;genbank:gi:29566332;genbank:GeneID:1259527 Probab=84.79 E-value=0.02 Score=29.88 Aligned_cols=138 Identities=20% Similarity=0.271 Sum_probs=84.6 Q ss_pred CCCCchhHHHHhhcCc-HHHHHHHHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCc-ceeccchheeeheeccC Q lcl|NC_018836. 1 MAGLPDEIKALAELSP-VEDLLLAVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGD-TRFTDSARVVVQSFCED 78 (167) Q Consensus 1 maglp~~i~a~~e~sp-iedllla~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd-~rf~d~a~~~i~~~~ed 78 (167) -.--|||-.-+.|-.| +|+++.+-|. |=.|+.+----+...||++++|.+.- |+ .--+|+|-|-|--||.| T Consensus 11 ~~~~~~~~~~~~~sapdae~~vv~wLs---p~~rvA~~R~~~dplPf~lv~rv~G~----d~pde~td~avvsv~~fg~~ 83 (150) T protein:vir:83 11 ETPEPPEPEILNEGPADAETFVVKWLG---EVYRAANTRRPGDPLPFLLIQQVAGK----ENLDESTADPVVQVDILCDK 83 (150) T ss_pred CCcccCCcccccCCCccHHHHHHHHhh---HHhhhhhcccCCCCCCeEEEEecCCC----CCcccccccceeeeeecccc Confidence 1122333333333333 6777776664 33455555555667999999999852 34 46789999999999999 Q ss_pred CCCchhhHHHHHHHHHHHHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccceeeceeeeeEeecCc Q lcl|NC_018836. 79 PDGDADAAILAEAVRVVLRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWRYETQYDIEIRKPR 158 (167) Q Consensus 79 pdgd~da~~laeavr~~l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwryet~~di~ir~pr 158 (167) -||++-|.-+|.-|---+. - +++.-.++-+|-.+ +.--||+| +.|+|= -|=||--||.+-. T Consensus 84 v~G~daA~~~ad~vH~RM~-~-l~r~tl~~Gtld~~-~v~~aP~~----------leY~dD--~vvrYt~RY~~G~---- 144 (150) T protein:vir:83 84 VDGEDAARDIKDRVHRRML-L-LGRYLEMDGTLDWM-KVFESPRR----------LEYTND--KVIRYTARYQFGQ---- 144 (150) T ss_pred ccchhhhhhhhhhHHHHHH-H-HhhhhccCCcchhh-hhhccccc----------cccCCC--eEEEeeeeeeccC---- Confidence 9999999999877643222 2 22455555555333 34445664 568775 6778888886521 Q ss_pred CCCCCCCC Q lcl|NC_018836. 159 TKPFPLST 166 (167) Q Consensus 159 ~~pfplst 166 (167) |+---. T Consensus 145 --~Y~~~~ 150 (150) T protein:vir:83 145 --TYEQIA 150 (150) T ss_pred --chhhcC Confidence 111111 No 9 >protein:vir:2348 Length: 148 # NCBI annotation: gp18 # Family: family:all:2820 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075285;genbank:gi:12657872;genbank:GeneID:920060 Probab=81.79 E-value=0.0086 Score=31.90 Aligned_cols=129 Identities=27% Similarity=0.350 Sum_probs=82.6 Q ss_pred CCCCchhHHHHhhcCcHHHHHHHHHHh------------hcCCcchhhhhhh--hccCCEEEEEecCccceecCcceecc Q lcl|NC_018836. 1 MAGLPDEIKALAELSPVEDLLLAVLRD------------GLPGIRVQSLIEK--DQHFPFVLVRRDPSFGLWAGDTRFTD 66 (167) Q Consensus 1 maglp~~i~a~~e~spiedllla~l~~------------~lp~i~~~sli~~--~q~fpfi~~rr~~s~g~wagd~rf~d 66 (167) |||- |--|....+-.||. .|+||++||-|+. -.+||.|.+||- |-.|--. T Consensus 1 ma~k---------~Prvq~VV~PiLR~~~~~~~~~~~vp~l~~v~VgtWvediD~R~FPlinvRRv-------GG~R~p~ 64 (148) T protein:vir:23 1 MAGK---------LPIVGEVVLPILRGHEDLSNPISTVPSLAGVHVGTWVEDIDSRTFPLITVRRV-------GGTRSPE 64 (148) T ss_pred CCcc---------cchhhhhhhhhhcccccccccccccccccCceecccccccccccccceeeeec-------CCCCCCc Confidence 6653 33455566677777 7999999999986 468999999995 3333221 Q ss_pred ----chheeeheeccCCCCchhhHHHHHHHHHHHHHhhhhhccc-ccccceeeeeeccCCccccccccccCceeeccccc Q lcl|NC_018836. 67 ----SARVVVQSFCEDPDGDADAAILAEAVRVVLRNAWLSQKVY-AGRGHITRVDMASAPRRATDWATATGPVQYADLPT 141 (167) Q Consensus 67 ----~a~~~i~~~~edpdgd~da~~laeavr~~l~~a~~~~~~~-~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~ 141 (167) -+.-+|.--+-.-.|--....|-|-.--+|-+|-+-|+.- +|.=|-+.-+|.++ |+..+=. T Consensus 65 ~P~~~~~PViEmTaY~~~gL~etEqlYEdaLevLYdAv~~Q~~TPaGYLhSi~ETmGAt--------------qfsS~f~ 130 (148) T protein:vir:23 65 HPTLFTQPVVEMTAYSAADLPTTEQMYEDALEVLYRAARLQTKTPAGYLHSVTETLGAS--------------HGPSPFD 130 (148) T ss_pred cchhhhcccceeeeccccCccchHHHHHHHHHHHHHHHhhhccCcchhhhhhhHhhccc--------------cCCcchh Confidence 1222222222223344445556555555666666666554 46667777777543 4555555 Q ss_pred cceeeceeeeeEeecCcC Q lcl|NC_018836. 142 GVWRYETQYDIEIRKPRT 159 (167) Q Consensus 142 gvwryet~~di~ir~pr~ 159 (167) ..||-.--..+-||.||. T Consensus 131 dsWRvqGLI~LGvRpPr~ 148 (148) T protein:vir:23 131 RTWRVFGLIRLGIRPPKN 148 (148) T ss_pred hhhhhhhhhhhcccCCCC Confidence 689988888889999998 No 10 >protein:vir:78292 Length: 148 # NCBI annotation: gp19 # Family: family:all:2820 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491671;genbank:gi:157786495;genbank:GeneID:5625770 Probab=81.49 E-value=0.0088 Score=31.84 Aligned_cols=125 Identities=27% Similarity=0.359 Sum_probs=82.4 Q ss_pred CCCCchhHHHHhhcCcHHHHHHHHHHh------------hcCCcchhhhhhh--hccCCEEEEEecCccceecCcceecc Q lcl|NC_018836. 1 MAGLPDEIKALAELSPVEDLLLAVLRD------------GLPGIRVQSLIEK--DQHFPFVLVRRDPSFGLWAGDTRFTD 66 (167) Q Consensus 1 maglp~~i~a~~e~spiedllla~l~~------------~lp~i~~~sli~~--~q~fpfi~~rr~~s~g~wagd~rf~d 66 (167) |||- |--|....+-.||. .|+||++||-|+. -.+||.|.+||- |-.|--. T Consensus 1 ma~k---------~Prvq~VV~PiLR~~~~~~~~~~~vp~l~~v~VgtWvediD~R~FPlinvRRv-------GG~R~p~ 64 (148) T protein:vir:78 1 MAGK---------LPIVGEVVLPILRGHEDLSEPISTVPSLAGVHVGTWVEDIDSRTFPLITVRRV-------GGTRSPE 64 (148) T ss_pred CCcc---------cchhhhhhhhhhcccccccccccccccccCceecccccccccccccceeeeec-------CCCCCCc Confidence 6653 33455566667777 7999999999986 468999999995 3333221 Q ss_pred ----c----hheeeheeccCCCCchhhHHHHHHHHHHHHHhhhhhccc-ccccceeeeeeccCCccccccccccCceeec Q lcl|NC_018836. 67 ----S----ARVVVQSFCEDPDGDADAAILAEAVRVVLRNAWLSQKVY-AGRGHITRVDMASAPRRATDWATATGPVQYA 137 (167) Q Consensus 67 ----~----a~~~i~~~~edpdgd~da~~laeavr~~l~~a~~~~~~~-~~~gh~~~~~m~~~prr~~dwatatgpvqya 137 (167) - ..|..|+ -.|--....|-|-.--+|-+|-+-|+.- +|.=|-+.-+|.++ |+. T Consensus 65 ~P~~~~~PViEmTaY~----~~gL~etEqlYEdaLevLYdAv~~Q~~TPaGYLhSi~ETmGAt--------------qfs 126 (148) T protein:vir:78 65 HPTLFTQPVVEMTAYS----AADLPTTEQMYEDALEVLYRAARLQTKTPAGYLHSVTETLGAS--------------HGP 126 (148) T ss_pred cchhhhcchheeeecc----ccCCcchHHHHHHHHHHHHHHHhhhccCcchhhhhhhhhhccc--------------cCC Confidence 1 2334443 2344444556555555666666666554 46667777777543 455 Q ss_pred cccccceeeceeeeeEeecCcC Q lcl|NC_018836. 138 DLPTGVWRYETQYDIEIRKPRT 159 (167) Q Consensus 138 dlp~gvwryet~~di~ir~pr~ 159 (167) .+=...||-.--..+-||.||. T Consensus 127 S~f~dsWRvqGLI~LGvRpPr~ 148 (148) T protein:vir:78 127 SPFDRTWRVFGLIRLGIRPPKN 148 (148) T ss_pred cchhhhhhhhhhhhhcccCCCC Confidence 5555689988888889999998 No 11 >protein:vir:78481 Length: 148 # NCBI annotation: gp19 # Family: family:all:2820 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491590;genbank:gi:157786413;genbank:GeneID:5625633 Probab=81.49 E-value=0.0088 Score=31.84 Aligned_cols=125 Identities=27% Similarity=0.359 Sum_probs=82.4 Q ss_pred CCCCchhHHHHhhcCcHHHHHHHHHHh------------hcCCcchhhhhhh--hccCCEEEEEecCccceecCcceecc Q lcl|NC_018836. 1 MAGLPDEIKALAELSPVEDLLLAVLRD------------GLPGIRVQSLIEK--DQHFPFVLVRRDPSFGLWAGDTRFTD 66 (167) Q Consensus 1 maglp~~i~a~~e~spiedllla~l~~------------~lp~i~~~sli~~--~q~fpfi~~rr~~s~g~wagd~rf~d 66 (167) |||- |--|....+-.||. .|+||++||-|+. -.+||.|.+||- |-.|--. T Consensus 1 ma~k---------~Prvq~VV~PiLR~~~~~~~~~~~vp~l~~v~VgtWvediD~R~FPlinvRRv-------GG~R~p~ 64 (148) T protein:vir:78 1 MAGK---------LPIVGEVVLPILRGHEDLSEPISTVPSLAGVHVGTWVEDIDSRTFPLITVRRV-------GGTRSPE 64 (148) T ss_pred CCcc---------cchhhhhhhhhhcccccccccccccccccCceecccccccccccccceeeeec-------CCCCCCc Confidence 6653 33455566667777 7999999999986 468999999995 3333221 Q ss_pred ----c----hheeeheeccCCCCchhhHHHHHHHHHHHHHhhhhhccc-ccccceeeeeeccCCccccccccccCceeec Q lcl|NC_018836. 67 ----S----ARVVVQSFCEDPDGDADAAILAEAVRVVLRNAWLSQKVY-AGRGHITRVDMASAPRRATDWATATGPVQYA 137 (167) Q Consensus 67 ----~----a~~~i~~~~edpdgd~da~~laeavr~~l~~a~~~~~~~-~~~gh~~~~~m~~~prr~~dwatatgpvqya 137 (167) - ..|..|+ -.|--....|-|-.--+|-+|-+-|+.- +|.=|-+.-+|.++ |+. T Consensus 65 ~P~~~~~PViEmTaY~----~~gL~etEqlYEdaLevLYdAv~~Q~~TPaGYLhSi~ETmGAt--------------qfs 126 (148) T protein:vir:78 65 HPTLFTQPVVEMTAYS----AADLPTTEQMYEDALEVLYRAARLQTKTPAGYLHSVTETLGAS--------------HGP 126 (148) T ss_pred cchhhhcchheeeecc----ccCCcchHHHHHHHHHHHHHHHhhhccCcchhhhhhhhhhccc--------------cCC Confidence 1 2334443 2344444556555555666666666554 46667777777543 455 Q ss_pred cccccceeeceeeeeEeecCcC Q lcl|NC_018836. 138 DLPTGVWRYETQYDIEIRKPRT 159 (167) Q Consensus 138 dlp~gvwryet~~di~ir~pr~ 159 (167) .+=...||-.--..+-||.||. T Consensus 127 S~f~dsWRvqGLI~LGvRpPr~ 148 (148) T protein:vir:78 127 SPFDRTWRVFGLIRLGIRPPKN 148 (148) T ss_pred cchhhhhhhhhhhhhcccCCCC Confidence 5555689988888889999998 No 12 >protein:vir:7994 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817348;genbank:gi:29565776;genbank:GeneID:1259015 Probab=77.52 E-value=0.033 Score=28.71 Aligned_cols=129 Identities=25% Similarity=0.269 Sum_probs=73.7 Q ss_pred CCCCchhHHHHhhcCcHHHHHHHHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeeheeccCCC Q lcl|NC_018836. 1 MAGLPDEIKALAELSPVEDLLLAVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQSFCEDPD 80 (167) Q Consensus 1 maglp~~i~a~~e~spiedllla~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~~~edpd 80 (167) |+ .-+.-++|++|.+-|.- =-|+.+----+.-.||+.++|.+.=+ .+| --+|+|.|.|--|| + T Consensus 1 m~--------~~saP~~e~~vv~WLsp---~~~va~~R~~~~PLPf~~V~Rv~G~d--~~e-~~tD~avvsv~~fg---~ 63 (134) T protein:vir:79 1 MA--------TDSAPSIHRVLVAWLSP---LGKVSTRRLSGDPLPHRVVRRVDGRD--VPE-EGSDSAVVSVHTFA---A 63 (134) T ss_pred CC--------cccCCChheeeeeeccc---chhceeccCCCCCCCeEEEEEeCCCC--Ccc-ccccCceeEEEEee---C Confidence 32 23334555555444321 11222222335678999999998532 222 34699999999999 7 Q ss_pred CchhhHHHHHHHHHHHHHhhhh-hc-ccccccceeeeeec---cCCccccccccccCceeeccccccceeeceeeeeEee Q lcl|NC_018836. 81 GDADAAILAEAVRVVLRNAWLS-QK-VYAGRGHITRVDMA---SAPRRATDWATATGPVQYADLPTGVWRYETQYDIEIR 155 (167) Q Consensus 81 gd~da~~laeavr~~l~~a~~~-~~-~~~~~gh~~~~~m~---~~prr~~dwatatgpvqyadlp~gvwryet~~di~ir 155 (167) |++-|.-+|+-|---...--+. -+ ..-.-||..++|.. -+|+| +.|+|=| -+=||--||.+-.. T Consensus 64 ~~eaA~d~ad~vHrRM~kL~~~~~~~~~~~gG~~~~id~~~vl~~P~~----------~eY~dD~-~~vrytgRY~~g~~ 132 (134) T protein:vir:79 64 SDEAAENEAELTHQRMLELVVNPLTEIPVGGGVVARIDYARVLMKPVL----------VEYDDDG-HLVRHVGRYEIGVQ 132 (134) T ss_pred CHHHhhHHHHHHHHHHHHHhcccccceecCCceEEEeehhhhhcccee----------eeeCCCc-eEEEEeeeeeeccc Confidence 7888887777664332221111 11 33456999998864 35554 6788754 45577777765422 Q ss_pred cC Q lcl|NC_018836. 156 KP 157 (167) Q Consensus 156 ~p 157 (167) -- T Consensus 133 y~ 134 (134) T protein:vir:79 133 YI 134 (134) T ss_pred cC Confidence 11 No 13 >protein:vir:104092 Length: 140 # NCBI annotation: gp24 # Family: family:all:2820 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655603;genbank:gi:109392474;genbank:GeneID:4156960 Probab=76.94 E-value=0.025 Score=29.37 Aligned_cols=131 Identities=24% Similarity=0.357 Sum_probs=89.0 Q ss_pred CCCCchhHHHHhhcCcHHHHHHHHHHhhcC--Ccchhhhhhh--hccCCEEEEEecCccceecCcceec----cchheee Q lcl|NC_018836. 1 MAGLPDEIKALAELSPVEDLLLAVLRDGLP--GIRVQSLIEK--DQHFPFVLVRRDPSFGLWAGDTRFT----DSARVVV 72 (167) Q Consensus 1 maglp~~i~a~~e~spiedllla~l~~~lp--~i~~~sli~~--~q~fpfi~~rr~~s~g~wagd~rf~----d~a~~~i 72 (167) |+-+|- |....+-.||.-++ |++++|-|+. -.+||.|.+||- |-.|-- --+.-+| T Consensus 1 m~~~Pr----------vq~VV~PiLR~~~~l~~v~v~tWv~diD~R~FPminvRRi-------GG~R~p~~P~~~~~PVi 63 (140) T protein:vir:10 1 MARMPR----------VQKVVAPILRNALTLDGVAITTWVPDVDYREFPMINIRRI-------GGIRNPKAPLLHTNPVI 63 (140) T ss_pred CCCCch----------hHHHHHhHhhcCCCccceeeeccccccccccccceeeeec-------CCCCCCccchhhhccee Confidence 777764 55667778888888 9999999986 468999999996 333321 1122233 Q ss_pred heeccCCCCchhhHHHHHHHHHHHHHhhhhhcccc-cccceeeeeeccCCccccccccccCceeeccccccceeeceeee Q lcl|NC_018836. 73 QSFCEDPDGDADAAILAEAVRVVLRNAWLSQKVYA-GRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWRYETQYD 151 (167) Q Consensus 73 ~~~~edpdgd~da~~laeavr~~l~~a~~~~~~~~-~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwryet~~d 151 (167) .--+-.-.|--....|-|-.--+|-.|-..|..-| |.=|-+.-+|-+ -|+..|=...||-.--.. T Consensus 64 EmtaY~~~gLie~E~lYEdaLevLY~Av~~Q~qTPaGYLhSi~ETmGA--------------tqfsS~f~dsWRvqGLIr 129 (140) T protein:vir:10 64 EMSAYSTEGLIECEELYEDALEELYLAVQSQTQTPAGYLTSIFETMGA--------------TQFSSLYQDSWRVQGLIR 129 (140) T ss_pred eeeeccccCccchHHHHHHHHHHHHHHHhhhccCcchhhhhhhhhhcc--------------ccCCccchhhhhhhhhhh Confidence 22222334555556666666667778888887654 555666666644 466666667899988888 Q ss_pred eEeecCcCCCC Q lcl|NC_018836. 152 IEIRKPRTKPF 162 (167) Q Consensus 152 i~ir~pr~~pf 162 (167) +-||+||.+-- T Consensus 130 LGvR~Pr~~t~ 140 (140) T protein:vir:10 130 LGVRRPRSNTS 140 (140) T ss_pred hcccCCCCCCC Confidence 99999998765 No 14 >protein:vir:4231 Length: 139 # NCBI annotation: predicted 15.7Kd protein # Family: family:all:2820 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039686;swissprot:sw:q05228;genbank:gi:9625452;uniprot:Q05228;genbank:GeneID:2942946 Probab=75.51 E-value=0.024 Score=29.45 Aligned_cols=130 Identities=27% Similarity=0.371 Sum_probs=89.8 Q ss_pred CCCCchhHHHHhhcCcHHHHHHHHHHh--hcCCcchhhhhhh--hccCCEEEEEecCccceecCcceecc----chheee Q lcl|NC_018836. 1 MAGLPDEIKALAELSPVEDLLLAVLRD--GLPGIRVQSLIEK--DQHFPFVLVRRDPSFGLWAGDTRFTD----SARVVV 72 (167) Q Consensus 1 maglp~~i~a~~e~spiedllla~l~~--~lp~i~~~sli~~--~q~fpfi~~rr~~s~g~wagd~rf~d----~a~~~i 72 (167) |+-+|- |....+-.||. .|+|++++|-|+. -.+||.|.+||- |-+|--. -+--+| T Consensus 1 m~~~Pr----------vq~Vv~PiLR~~P~l~~v~V~tWv~diD~R~FPminvRRi-------GG~R~p~~Pt~~~~PVi 63 (139) T protein:vir:42 1 MARMPR----------VQAVAAPILRSDPRLEGVTVTTWVPDVDFREFPMINLRRI-------GGTRNPNAPTLHTLPVV 63 (139) T ss_pred CCcCch----------hHHHHhhhhcCCccccCceeecccccCccccccceeeeec-------CCCCCCCCcchhcccee Confidence 777764 45566777888 8999999999986 468999999995 3344222 222233 Q ss_pred heeccCCCCchhhHHHHHHHHHHHHHhhhhhcccc-cccceeeeeeccCCccccccccccCceeeccccccceeeceeee Q lcl|NC_018836. 73 QSFCEDPDGDADAAILAEAVRVVLRNAWLSQKVYA-GRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWRYETQYD 151 (167) Q Consensus 73 ~~~~edpdgd~da~~laeavr~~l~~a~~~~~~~~-~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwryet~~d 151 (167) .--+-.-+|--....|-|-.--+|-+|-..|..-| |.=|-+.-+|-+ -|+..+=...||-.--.. T Consensus 64 EmTaY~~~gLietE~lYEdaLevLYdAv~~q~qTPaGYLhSi~ETmGA--------------tqfsS~f~dsWRvqGLI~ 129 (139) T protein:vir:42 64 EMTAYTRDGLIETEELYETALEVLYDAVENGTQTPAGYLTSIFETMGA--------------TQFSSLYQDSWRIQGLIR 129 (139) T ss_pred eeeeccccCccchHHHHHHHHHHHHHHHhccccCcchhhhhhhhhhcc--------------ccCCccchhhhhhhhhhh Confidence 33333445666666777766677777777776554 555766666644 466666667899988888 Q ss_pred eEeecCcCCC Q lcl|NC_018836. 152 IEIRKPRTKP 161 (167) Q Consensus 152 i~ir~pr~~p 161 (167) +-||+||..- T Consensus 130 LGvR~Pr~t~ 139 (139) T protein:vir:42 130 LGVRRPRTTL 139 (139) T ss_pred hcccCCcccC Confidence 8999999865 No 15 >protein:vir:105826 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655771;genbank:gi:109522094;genbank:GeneID:4157634 Probab=75.42 E-value=0.043 Score=28.10 Aligned_cols=129 Identities=24% Similarity=0.261 Sum_probs=73.6 Q ss_pred CCCCchhHHHHhhcCcHHHHHHHHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeeheeccCCC Q lcl|NC_018836. 1 MAGLPDEIKALAELSPVEDLLLAVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQSFCEDPD 80 (167) Q Consensus 1 maglp~~i~a~~e~spiedllla~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~~~edpd 80 (167) |+ .-+.-++|++|.+-|.- =-|+.+----+.-.||+.++|.+.=+ .+| --+|+|.|.|--|| + T Consensus 1 m~--------~~saP~~e~~vv~WLsp---~~~va~~R~~~~PLPf~~V~Rv~G~d--~~e-~~tD~avvsv~~fg---~ 63 (134) T protein:vir:10 1 MA--------TDSAPSIHRVLVAWLSP---LGKVSTRRLSGDPLPHRVVRRVDGRD--VPE-EGSDVAVVSVHTFA---A 63 (134) T ss_pred CC--------cccCCChheeeeeeccc---chhceeccCCCCCCCeEEEEEeCCCC--Ccc-cccccceEEEEEee---C Confidence 32 23334555555444321 11222222335678999999998532 222 24699999999999 7 Q ss_pred CchhhHHHHHHHHHHHHHhhhh-hc-ccccccceeeeeec---cCCccccccccccCceeeccccccceeeceeeeeEee Q lcl|NC_018836. 81 GDADAAILAEAVRVVLRNAWLS-QK-VYAGRGHITRVDMA---SAPRRATDWATATGPVQYADLPTGVWRYETQYDIEIR 155 (167) Q Consensus 81 gd~da~~laeavr~~l~~a~~~-~~-~~~~~gh~~~~~m~---~~prr~~dwatatgpvqyadlp~gvwryet~~di~ir 155 (167) |++-|.-+|+-|---...--+. -+ ..-.-||..++|.. -+|+| +.|+|=| -+=||--||.+-.. T Consensus 64 ~~eaA~d~ad~vHrRM~kL~~~~~~~~~~~gG~~~~id~~~v~~~P~~----------~eY~dD~-~~vrytgRY~~g~~ 132 (134) T protein:vir:10 64 SDEAAENEAELTHQRMLELVVNPLTEIPVGGGVVARIDYARVLMKPVL----------VEYDDDG-HLVRHVGRYEIGVQ 132 (134) T ss_pred CHHHhhHHHHHHHHHHHHHhcccccceecCCceEEEeehhhhhcccee----------eeeCCCc-eEEEEeeeeeeccc Confidence 7888887777664332221111 11 33456999998864 35654 6788754 45577777765422 Q ss_pred cC Q lcl|NC_018836. 156 KP 157 (167) Q Consensus 156 ~p 157 (167) -- T Consensus 133 y~ 134 (134) T protein:vir:10 133 YI 134 (134) T ss_pred cC Confidence 11 No 16 >protein:vir:102609 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655006;genbank:gi:109392196;genbank:GeneID:4157231 Probab=75.42 E-value=0.043 Score=28.10 Aligned_cols=129 Identities=24% Similarity=0.261 Sum_probs=73.6 Q ss_pred CCCCchhHHHHhhcCcHHHHHHHHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeeheeccCCC Q lcl|NC_018836. 1 MAGLPDEIKALAELSPVEDLLLAVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQSFCEDPD 80 (167) Q Consensus 1 maglp~~i~a~~e~spiedllla~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~~~edpd 80 (167) |+ .-+.-++|++|.+-|.- =-|+.+----+.-.||+.++|.+.=+ .+| --+|+|.|.|--|| + T Consensus 1 m~--------~~saP~~e~~vv~WLsp---~~~va~~R~~~~PLPf~~V~Rv~G~d--~~e-~~tD~avvsv~~fg---~ 63 (134) T protein:vir:10 1 MA--------TDSAPSIHRVLVAWLSP---LGKVSTRRLSGDPLPHRVVRRVDGRD--VPE-EGSDVAVVSVHTFA---A 63 (134) T ss_pred CC--------cccCCChheeeeeeccc---chhceeccCCCCCCCeEEEEEeCCCC--Ccc-cccccceEEEEEee---C Confidence 32 23334555555444321 11222222335678999999998532 222 24699999999999 7 Q ss_pred CchhhHHHHHHHHHHHHHhhhh-hc-ccccccceeeeeec---cCCccccccccccCceeeccccccceeeceeeeeEee Q lcl|NC_018836. 81 GDADAAILAEAVRVVLRNAWLS-QK-VYAGRGHITRVDMA---SAPRRATDWATATGPVQYADLPTGVWRYETQYDIEIR 155 (167) Q Consensus 81 gd~da~~laeavr~~l~~a~~~-~~-~~~~~gh~~~~~m~---~~prr~~dwatatgpvqyadlp~gvwryet~~di~ir 155 (167) |++-|.-+|+-|---...--+. -+ ..-.-||..++|.. -+|+| +.|+|=| -+=||--||.+-.. T Consensus 64 ~~eaA~d~ad~vHrRM~kL~~~~~~~~~~~gG~~~~id~~~v~~~P~~----------~eY~dD~-~~vrytgRY~~g~~ 132 (134) T protein:vir:10 64 SDEAAENEAELTHQRMLELVVNPLTEIPVGGGVVARIDYARVLMKPVL----------VEYDDDG-HLVRHVGRYEIGVQ 132 (134) T ss_pred CHHHhhHHHHHHHHHHHHHhcccccceecCCceEEEeehhhhhcccee----------eeeCCCc-eEEEEeeeeeeccc Confidence 7888887777664332221111 11 33456999998864 35654 6788754 45577777765422 Q ss_pred cC Q lcl|NC_018836. 156 KP 157 (167) Q Consensus 156 ~p 157 (167) -- T Consensus 133 y~ 134 (134) T protein:vir:10 133 YI 134 (134) T ss_pred cC Confidence 11 No 17 >protein:vir:2436 Length: 139 # NCBI annotation: gp22 # Family: family:all:2820 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046838;genbank:gi:9630406;genbank:GeneID:1261583 Probab=69.61 E-value=0.041 Score=28.20 Aligned_cols=130 Identities=24% Similarity=0.385 Sum_probs=89.0 Q ss_pred CCCCchhHHHHhhcCcHHHHHHHHHHhh--cCCcchhhhhhh--hccCCEEEEEecCccceecCcceecc----chheee Q lcl|NC_018836. 1 MAGLPDEIKALAELSPVEDLLLAVLRDG--LPGIRVQSLIEK--DQHFPFVLVRRDPSFGLWAGDTRFTD----SARVVV 72 (167) Q Consensus 1 maglp~~i~a~~e~spiedllla~l~~~--lp~i~~~sli~~--~q~fpfi~~rr~~s~g~wagd~rf~d----~a~~~i 72 (167) |+-+|- |....+-.||.- |+||+++|-|+. -.+||.|.+||- |-.|--. -+--+| T Consensus 1 m~~~Pr----------vq~VV~PiLR~~P~la~v~v~tWv~diD~R~FPminvRRi-------GG~R~~~~P~~~~~PVi 63 (139) T protein:vir:24 1 MGAMPR----------VQSVVAPILREDPRLAGVTIVTWVPDIDFREFPMINIRRI-------GGIRNANAPKLHSLPVV 63 (139) T ss_pred CCCCch----------hhhHHhhhhhcCcccCCceeeeecccCccccccceeeeec-------CCCCCcccchhhcccee Confidence 777663 555677778887 999999999986 468999999996 3333221 122233 Q ss_pred heeccCCCCchhhHHHHHHHHHHHHHhhhhhcccc-cccceeeeeeccCCccccccccccCceeeccccccceeeceeee Q lcl|NC_018836. 73 QSFCEDPDGDADAAILAEAVRVVLRNAWLSQKVYA-GRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWRYETQYD 151 (167) Q Consensus 73 ~~~~edpdgd~da~~laeavr~~l~~a~~~~~~~~-~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwryet~~d 151 (167) .--+-.-+|--....|-|-.--+|-+|-+.|..-| |.=|-+.-+|-+ -|+..|=...||-.--.. T Consensus 64 EmtaY~~~gLie~E~lYEdaLevLYdAvk~q~qTPaGYL~Si~ETmGA--------------tqfsS~f~dsWRvqGLI~ 129 (139) T protein:vir:24 64 EMSAYSTDGLIECEELYETALEVLYDAVKNGTQTPAGYLSSIFETMGA--------------TQFSSLYQDSWRIQGLIR 129 (139) T ss_pred eeeeccccCccchHHHHHHHHHHHHHHHhccccCcchhhhhHhHhhcc--------------ccCCccchhhhhhhhhhh Confidence 32233345666667777766667777777776554 555766666644 466666667899988888 Q ss_pred eEeecCcCCC Q lcl|NC_018836. 152 IEIRKPRTKP 161 (167) Q Consensus 152 i~ir~pr~~p 161 (167) +-||.||..- T Consensus 130 LGvR~Pr~tt 139 (139) T protein:vir:24 130 LGVRTPRSTT 139 (139) T ss_pred hcccCCCCCC Confidence 8999999765 No 18 >protein:vir:96894 Length: 140 # NCBI annotation: ORF029 # Family: family:all:296 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240162;genbank:gi:66395835;genbank:GeneID:5133235 Probab=62.21 E-value=0.34 Score=23.16 Aligned_cols=132 Identities=12% Similarity=0.094 Sum_probs=73.0 Q ss_pred CC-CCchhHHHHhhcCcHHHHHHHHHH------hhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeeh Q lcl|NC_018836. 1 MA-GLPDEIKALAELSPVEDLLLAVLR------DGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQ 73 (167) Q Consensus 1 ma-glp~~i~a~~e~spiedllla~l~------~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~ 73 (167) |. ....++. .-+.+.|. ..++| ++-.-.-.+..||||.+-...+-- |..++-......+.|. T Consensus 1 Msms~~~aLq---------~Ai~a~L~ada~l~alvg~-~VyD~~P~~~~~Pyv~lG~~~~~~-~~~~~~~g~~~~~~i~ 69 (140) T protein:vir:96 1 MWVSVEPELT---------VQIYKRLKASPIINKFVGD-RVFDVVQEDAVYPYIVVGESNVTN-NESSTMMRETVGIVIH 69 (140) T ss_pred CCccHHHHHH---------HHHHHHhhcChhHHHhcCC-ccccCCccCCCCCEEEecCceeee-cCCCcccceEEEEEEE Confidence 43 2111111 11222211 22222 222223346789999986544332 3334444455566777 Q ss_pred eeccCCCCchhhHHHHHHHHHHHHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccc-eeeceeeee Q lcl|NC_018836. 74 SFCEDPDGDADAAILAEAVRVVLRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGV-WRYETQYDI 152 (167) Q Consensus 74 ~~~edpdgd~da~~laeavr~~l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gv-wryet~~di 152 (167) .+.. -.|-+.|..++.||+.+|. +. ... ..+|++.+.+..+ |.-..|.|. |+.--+|.+ T Consensus 70 Vws~-~~g~~ea~~ia~av~~AL~-~~---l~l-~~~~lv~l~~~~~--------------~~~rd~dg~~~hgvl~~r~ 129 (140) T protein:vir:96 70 VYSQ-FATQYEAKQIISAIGYVLN-RP---IDI-ENYEFQFSRIDSQ--------------SVFPDIDRFTKHGTIRLLF 129 (140) T ss_pred EEEc-CCCHHHHHHHHHHHHHHhC-CC---ccC-CCCeEEEEEEeee--------------EEEecCCCceEEEEEEEEE Confidence 7764 4577889999999999994 43 333 3588888777655 222234443 666777888 Q ss_pred EeecCcCCCCC Q lcl|NC_018836. 153 EIRKPRTKPFP 163 (167) Q Consensus 153 ~ir~pr~~pfp 163 (167) +||+-..|--- T Consensus 130 ~v~~~~~~~~~ 140 (140) T protein:vir:96 130 KYRHIKKGEGV 140 (140) T ss_pred EEEeeccccCC Confidence 88876554333 No 19 >protein:vir:94096 Length: 141 # NCBI annotation: ORF031 # Family: family:all:296 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240240;genbank:gi:66395916;genbank:GeneID:5133265 Probab=58.30 E-value=0.42 Score=22.67 Aligned_cols=137 Identities=13% Similarity=0.174 Sum_probs=69.3 Q ss_pred CC-CCchhHHHHhhcCcHHHHHH--HHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeeheecc Q lcl|NC_018836. 1 MA-GLPDEIKALAELSPVEDLLL--AVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQSFCE 77 (167) Q Consensus 1 ma-glp~~i~a~~e~spiedlll--a~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~~~e 77 (167) |. +...++..+ |-.-|. ++|...+++ ++-.-.-.+..||||.+-...+- -|..++.......+.|..+.+ T Consensus 1 Msms~~~aLQ~A-----i~~~L~adaal~alvg~-rI~D~~P~~~~~PYv~lG~~~~~-~~~~~~~~g~~~~~ti~Vws~ 73 (141) T protein:vir:94 1 MWVSVEPELTNQ-----IYKRLISDPNINKLVDD-RVFDVVQDDAVYPYIVVGESNVT-NNESSATMRETVGIVIHVYSQ 73 (141) T ss_pred CccchhHHHHHH-----HHHHhhcChhhHhhcCC-ccccCCccCCCCCEEEeCCceee-ecCCCcccceEEEEEEEEEEc Confidence 43 222222211 011111 012222222 22223334678999998766553 355556666677777888875 Q ss_pred CCCCchhhHHHHHHHHHHHHHhhhhhcccccccceeeeeeccC-CccccccccccCceeeccccccceeeceeeeeEeec Q lcl|NC_018836. 78 DPDGDADAAILAEAVRVVLRNAWLSQKVYAGRGHITRVDMASA-PRRATDWATATGPVQYADLPTGVWRYETQYDIEIRK 156 (167) Q Consensus 78 dpdgd~da~~laeavr~~l~~a~~~~~~~~~~gh~~~~~m~~~-prr~~dwatatgpvqyadlp~gvwryet~~di~ir~ 156 (167) . .|-+.|..++.||+.+|.+ .+ -...++++.+.+..+ -+|-+|=.|.-| --+|.+.|+. T Consensus 74 ~-~g~~eak~ia~av~~AL~~-~l----~l~~~~lv~l~~~~~~~~rd~dg~t~hg--------------vl~~ra~v~~ 133 (141) T protein:vir:94 74 F-ATQYEAKLILSAIGYVLNR-PI----EIDNYEFQFSRIDSQAVFPDIDRFTKHG--------------TIRLLFKYRH 133 (141) T ss_pred C-CCHHHHHHHHHHHHHHhcc-cc----cCCCceEEEEEEeeeeeeecCCCceEEE--------------EEEEEEEEEe Confidence 4 4778899999999999964 22 234567777777543 234444333322 2334444432 Q ss_pred CcCCCCCC Q lcl|NC_018836. 157 PRTKPFPL 164 (167) Q Consensus 157 pr~~pfpl 164 (167) -..|---. T Consensus 134 ~~~~~~~~ 141 (141) T protein:vir:94 134 KKKNEGVY 141 (141) T ss_pred ccccccCC Confidence 11111001 No 20 >protein:vir:105892 Length: 141 # NCBI annotation: tail protein # Family: family:all:296 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004380;genbank:gi:122891835;genbank:GeneID:4712363 Probab=58.30 E-value=0.42 Score=22.67 Aligned_cols=137 Identities=13% Similarity=0.174 Sum_probs=69.3 Q ss_pred CC-CCchhHHHHhhcCcHHHHHH--HHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeeheecc Q lcl|NC_018836. 1 MA-GLPDEIKALAELSPVEDLLL--AVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQSFCE 77 (167) Q Consensus 1 ma-glp~~i~a~~e~spiedlll--a~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~~~e 77 (167) |. +...++..+ |-.-|. ++|...+++ ++-.-.-.+..||||.+-...+- -|..++.......+.|..+.+ T Consensus 1 Msms~~~aLQ~A-----i~~~L~adaal~alvg~-rI~D~~P~~~~~PYv~lG~~~~~-~~~~~~~~g~~~~~ti~Vws~ 73 (141) T protein:vir:10 1 MWVSVEPELTNQ-----IYKRLISDPNINKLVDD-RVFDVVQDDAVYPYIVVGESNVT-NNESSATMRETVGIVIHVYSQ 73 (141) T ss_pred CccchhHHHHHH-----HHHHhhcChhhHhhcCC-ccccCCccCCCCCEEEeCCceee-ecCCCcccceEEEEEEEEEEc Confidence 43 222222211 011111 012222222 22223334678999998766553 355556666677777888875 Q ss_pred CCCCchhhHHHHHHHHHHHHHhhhhhcccccccceeeeeeccC-CccccccccccCceeeccccccceeeceeeeeEeec Q lcl|NC_018836. 78 DPDGDADAAILAEAVRVVLRNAWLSQKVYAGRGHITRVDMASA-PRRATDWATATGPVQYADLPTGVWRYETQYDIEIRK 156 (167) Q Consensus 78 dpdgd~da~~laeavr~~l~~a~~~~~~~~~~gh~~~~~m~~~-prr~~dwatatgpvqyadlp~gvwryet~~di~ir~ 156 (167) . .|-+.|..++.||+.+|.+ .+ -...++++.+.+..+ -+|-+|=.|.-| --+|.+.|+. T Consensus 74 ~-~g~~eak~ia~av~~AL~~-~l----~l~~~~lv~l~~~~~~~~rd~dg~t~hg--------------vl~~ra~v~~ 133 (141) T protein:vir:10 74 F-ATQYEAKLILSAIGYVLNR-PI----EIDNYEFQFSRIDSQAVFPDIDRFTKHG--------------TIRLLFKYRH 133 (141) T ss_pred C-CCHHHHHHHHHHHHHHhcc-cc----cCCCceEEEEEEeeeeeeecCCCceEEE--------------EEEEEEEEEe Confidence 4 4778899999999999964 22 234567777777543 234444333322 2334444432 Q ss_pred CcCCCCCC Q lcl|NC_018836. 157 PRTKPFPL 164 (167) Q Consensus 157 pr~~pfpl 164 (167) -..|---. T Consensus 134 ~~~~~~~~ 141 (141) T protein:vir:10 134 KKKNEGVY 141 (141) T ss_pred ccccccCC Confidence 11111001 No 21 >protein:vir:96260 Length: 141 # NCBI annotation: ORF027 # Family: family:all:296 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240317;genbank:gi:66395991;genbank:GeneID:5133337 Probab=58.30 E-value=0.42 Score=22.67 Aligned_cols=137 Identities=13% Similarity=0.174 Sum_probs=69.3 Q ss_pred CC-CCchhHHHHhhcCcHHHHHH--HHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeeheecc Q lcl|NC_018836. 1 MA-GLPDEIKALAELSPVEDLLL--AVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQSFCE 77 (167) Q Consensus 1 ma-glp~~i~a~~e~spiedlll--a~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~~~e 77 (167) |. +...++..+ |-.-|. ++|...+++ ++-.-.-.+..||||.+-...+- -|..++.......+.|..+.+ T Consensus 1 Msms~~~aLQ~A-----i~~~L~adaal~alvg~-rI~D~~P~~~~~PYv~lG~~~~~-~~~~~~~~g~~~~~ti~Vws~ 73 (141) T protein:vir:96 1 MWVSVEPELTNQ-----IYKRLISDPNINKLVDD-RVFDVVQDDAVYPYIVVGESNVT-NNESSATMRETVGIVIHVYSQ 73 (141) T ss_pred CccchhHHHHHH-----HHHHhhcChhhHhhcCC-ccccCCccCCCCCEEEeCCceee-ecCCCcccceEEEEEEEEEEc Confidence 43 222222211 011111 012222222 22223334678999998766553 355556666677777888875 Q ss_pred CCCCchhhHHHHHHHHHHHHHhhhhhcccccccceeeeeeccC-CccccccccccCceeeccccccceeeceeeeeEeec Q lcl|NC_018836. 78 DPDGDADAAILAEAVRVVLRNAWLSQKVYAGRGHITRVDMASA-PRRATDWATATGPVQYADLPTGVWRYETQYDIEIRK 156 (167) Q Consensus 78 dpdgd~da~~laeavr~~l~~a~~~~~~~~~~gh~~~~~m~~~-prr~~dwatatgpvqyadlp~gvwryet~~di~ir~ 156 (167) . .|-+.|..++.||+.+|.+ .+ -...++++.+.+..+ -+|-+|=.|.-| --+|.+.|+. T Consensus 74 ~-~g~~eak~ia~av~~AL~~-~l----~l~~~~lv~l~~~~~~~~rd~dg~t~hg--------------vl~~ra~v~~ 133 (141) T protein:vir:96 74 F-ATQYEAKLILSAIGYVLNR-PI----EIDNYEFQFSRIDSQAVFPDIDRFTKHG--------------TIRLLFKYRH 133 (141) T ss_pred C-CCHHHHHHHHHHHHHHhcc-cc----cCCCceEEEEEEeeeeeeecCCCceEEE--------------EEEEEEEEEe Confidence 4 4778899999999999964 22 234567777777543 234444333322 2334444432 Q ss_pred CcCCCCCC Q lcl|NC_018836. 157 PRTKPFPL 164 (167) Q Consensus 157 pr~~pfpl 164 (167) -..|---. T Consensus 134 ~~~~~~~~ 141 (141) T protein:vir:96 134 KKKNEGVY 141 (141) T ss_pred ccccccCC Confidence 11111001 No 22 >protein:vir:5979 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:296 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690679;genbank:geneid:6329147;genbank:gi:22855073;uniprot:O48448;genbank:GeneID:955319 Probab=47.78 E-value=0.69 Score=21.47 Aligned_cols=126 Identities=15% Similarity=0.156 Sum_probs=72.4 Q ss_pred CCCCchh-------HHHHhhcCcHHHHHHHHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeeh Q lcl|NC_018836. 1 MAGLPDE-------IKALAELSPVEDLLLAVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQ 73 (167) Q Consensus 1 maglp~~-------i~a~~e~spiedllla~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~ 73 (167) |.=--++ +.+|.+-..+-.|+ ..+-|..| .+..||||.+-...+-- |..++.......+.|. T Consensus 1 m~~~s~~~aLq~Ai~~~L~ad~~l~alv-g~I~D~~P---------~~~~~PYV~lG~~~~~d-~~~~~~~g~~~~~ti~ 69 (134) T protein:vir:59 1 MTWKLASRALQKATVENLESYQPLMEMV-NQVTESPG---------KDDPYPYVVIGDQSSTP-FETKSSFGENITMDFH 69 (134) T ss_pred CCccchhHHHHHHHHHHhhcChhHHHhh-hhhhcCCC---------CCCCCCEEEeCCceeee-cCCCcccceEEEEEEE Confidence 5433222 12333444444433 23444444 35689999996644432 3444555556666777 Q ss_pred eeccCCCCchhhHHHHHHHHHHHHHhhhhhcccccccceeeeeeccCC-ccccccccccCceeeccccccceeeceeeee Q lcl|NC_018836. 74 SFCEDPDGDADAAILAEAVRVVLRNAWLSQKVYAGRGHITRVDMASAP-RRATDWATATGPVQYADLPTGVWRYETQYDI 152 (167) Q Consensus 74 ~~~edpdgd~da~~laeavr~~l~~a~~~~~~~~~~gh~~~~~m~~~p-rr~~dwatatgpvqyadlp~gvwryet~~di 152 (167) .+... |-..|..++.||+.+|.++.+.- ..|+++.+.+..+- +|.+|=.| |+-.-+|-. T Consensus 70 Vws~~--g~~ea~~ia~av~~aL~~~~L~l----~~~~lv~l~~~~~~~~rd~dg~~--------------~hg~l~fra 129 (134) T protein:vir:59 70 VWGGT--TRAEAQDISSRVLEALTYKPLMF----EGFTFVAKKLVLAQVITDTDGVT--------------KHGIIKVRF 129 (134) T ss_pred EEECC--ChHHHHHHHHHHHHHhcCCCccc----CCceEEEeEEeeeeEEecCCCce--------------EEEEEEEEE Confidence 77653 45779999999999999887543 34888888886542 34444222 333344444 Q ss_pred EeecC Q lcl|NC_018836. 153 EIRKP 157 (167) Q Consensus 153 ~ir~p 157 (167) .|..- T Consensus 130 ~ve~~ 134 (134) T protein:vir:59 130 TINNN 134 (134) T ss_pred EEecC Confidence 44443 No 23 >protein:vir:1892 Length: 121 # NCBI annotation: gp11 # Family: family:all:896 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037672;genbank:gi:9634130;genbank:GeneID:1262490 Probab=40.23 E-value=0.98 Score=20.63 Aligned_cols=118 Identities=22% Similarity=0.292 Sum_probs=63.2 Q ss_pred CchhHHHHhhcCcHHHHHHHHHH-hhcC-CcchhhhhhhhccCCEEEEEecC-ccceecCcceeccchheeeheeccCCC Q lcl|NC_018836. 4 LPDEIKALAELSPVEDLLLAVLR-DGLP-GIRVQSLIEKDQHFPFVLVRRDP-SFGLWAGDTRFTDSARVVVQSFCEDPD 80 (167) Q Consensus 4 lp~~i~a~~e~spiedllla~l~-~~lp-~i~~~sli~~~q~fpfi~~rr~~-s~g~wagd~rf~d~a~~~i~~~~edpd 80 (167) +=+.|-++-+-||---.||...- .-.| |+.. ....+|+|...+.. +-+.+....-=.|++++-|-.|+..+ T Consensus 1 m~~~i~~~l~~d~~v~allg~~~~Rvyp~~~aP-----~~~~~Pyiv~q~vsg~p~~~l~G~~~~~~~~vQIDvyA~t~- 74 (121) T protein:vir:18 1 MIAPIFSVCASSPEVTDLLGSNPVRIYPFGIQD-----DNVVYPYVVWQNITGSPENYIAQRPDADFFTLQVDAYADTV- 74 (121) T ss_pred CchHHHHHHhcChhhhhhhcCCCceeeeccCCC-----CcCcCCeEEEEEecCcccceecCCCCcceeEEEEEeecCCH- Confidence 23334444444443222221100 1123 3333 34578999988765 44455544445566777777887665 Q ss_pred CchhhHHHHHHHHHHHHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccceeeceeeeeEeecCc Q lcl|NC_018836. 81 GDADAAILAEAVRVVLRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWRYETQYDIEIRKPR 158 (167) Q Consensus 81 gd~da~~laeavr~~l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwryet~~di~ir~pr 158 (167) .+|..|++|||.+|.. .+++++..+. + -|-.|+..|+ .|||..-.+| T Consensus 75 --~~A~~l~~avr~Ale~----------~~~~~~~~~~-------~----------ye~dT~lyR~--s~Dv~~~~~r 121 (121) T protein:vir:18 75 --DEVIAVATALRDAIEP----------HAHITRWGGQ-------E----------RDPETKRYRY--SFDVDWIVTR 121 (121) T ss_pred --HHHHHHHHHHHHHhhh----------cCcccCCCCC-------C----------Ccccccceee--eeEEEEeecC Confidence 4688999999998852 2333322221 1 1234566555 4677776666 No 24 >protein:vir:81066 Length: 118 # NCBI annotation: p13 # Family: family:all:3244 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285683;genbank:gi:148727191;genbank:GeneID:5247111 Probab=40.16 E-value=0.98 Score=20.62 Aligned_cols=115 Identities=14% Similarity=0.148 Sum_probs=65.7 Q ss_pred CcHHHHHHHHHHhhcCC-cchhhhhhhhccCCEEEEEecCcccee--cCcceeccchheeeheeccCCCCchhhHHHHHH Q lcl|NC_018836. 15 SPVEDLLLAVLRDGLPG-IRVQSLIEKDQHFPFVLVRRDPSFGLW--AGDTRFTDSARVVVQSFCEDPDGDADAAILAEA 91 (167) Q Consensus 15 spiedllla~l~~~lp~-i~~~sli~~~q~fpfi~~rr~~s~g~w--agd~rf~d~a~~~i~~~~edpdgd~da~~laea 91 (167) --+|+.|.++|..-+|| |-.+..=+....+|+|..-|...-... .|.+.=.++.++-|-.|+..+ .+|..|++| T Consensus 1 Ms~e~~l~a~L~~~~~~Rvyp~~aP~~~~~~Pyiv~q~vsg~p~~~l~G~~~~~~~~rvQIdvyA~t~---~~A~~l~~a 77 (118) T protein:vir:81 1 MSYGRVLKDLLDPVFSGRVYADIPPDSPPLDAYAIYQRVGGVPVYWQEGGMPEKVNARVQIQIWSRSK---QEAYLATVQ 77 (118) T ss_pred CchHHHHHHHHHhhcCCccccccCCCCCccCceEEEEecCCcccccccCCCCCccceeEEEEEeeCCH---HHHHHHHHH Confidence 44677777888777775 333322233445799998886543211 233333446778888888776 478899999 Q ss_pred HHHHHHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccceeeceeeeeEeecCcC Q lcl|NC_018836. 92 VRVVLRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWRYETQYDIEIRKPRT 159 (167) Q Consensus 92 vr~~l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwryet~~di~ir~pr~ 159 (167) ||..|...- . + .+ | .+|+.=-|-.++.+|. .+|+.|=-+-+ T Consensus 78 v~~al~~~~----~------~-------~~-----~---~~~~d~ye~dt~l~r~--~~Df~iw~~~~ 118 (118) T protein:vir:81 78 VLRLVSEAP----D------M-------QV-----L---SQPIDDYVREIKLYGS--RVDVSMWYPIT 118 (118) T ss_pred HHHHhhhcc----c------e-------ee-----c---cCCccccccccCceeE--EEEEEEEecCC Confidence 998884321 0 0 01 1 1222222334666665 55666655555 No 25 >protein:vir:97070 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:3244 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453569;genbank:gi:84662604;genbank:GeneID:5142485 Probab=39.86 E-value=1 Score=20.59 Aligned_cols=115 Identities=12% Similarity=0.099 Sum_probs=64.9 Q ss_pred CcHHHHHHHHHHhhcCC-cchhhhhhhhccCCEEEEEecCcccee--cCcceeccchheeeheeccCCCCchhhHHHHHH Q lcl|NC_018836. 15 SPVEDLLLAVLRDGLPG-IRVQSLIEKDQHFPFVLVRRDPSFGLW--AGDTRFTDSARVVVQSFCEDPDGDADAAILAEA 91 (167) Q Consensus 15 spiedllla~l~~~lp~-i~~~sli~~~q~fpfi~~rr~~s~g~w--agd~rf~d~a~~~i~~~~edpdgd~da~~laea 91 (167) --+|+.|.++|+.-.|+ |-.+..=+....+|+|...|...-..+ .|...=.+..++-|-.|+..+ .+|..|+++ T Consensus 1 M~~e~~l~a~L~~~~~~Rvyp~~aP~~~~~~Pyiv~q~vsg~p~~~ldG~~~~~~~~rvQIdvyA~t~---~~A~~l~~a 77 (118) T protein:vir:97 1 MSYGRMLKDLLDPVFSGRVYADIPPDSPPLDAYAIYQRVGGVPVYWKEGGMPDKVNARVQVQIWSRSK---QEAYLATVQ 77 (118) T ss_pred CchHHHHHHHHhhhcCCccccccCCCCCCcCCEEEEEecCCcccccccCCCCCccceeEEEEEeeCCH---HHHHHHHHH Confidence 45777888888766553 433333234455799999887553222 244333456678888888776 478889999 Q ss_pred HHHHHHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccceeeceeeeeEeecC Q lcl|NC_018836. 92 VRVVLRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWRYETQYDIEIRKP 157 (167) Q Consensus 92 vr~~l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwryet~~di~ir~p 157 (167) ||..|... +.. .|+. .|++=-|-.++.+|..--|-|.-+-- T Consensus 78 v~~al~~~----~~~-------------~~~~--------~~~~~ye~dt~lyr~~~Df~iw~~~~ 118 (118) T protein:vir:97 78 VLRIVSEA----NDM-------------QVLS--------QPIDDYVRELKLYGSRVDISMWYNLT 118 (118) T ss_pred HHHHhhcc----ccc-------------cccc--------CCcccccccCCceEEEEEEEEEeecC Confidence 98877432 111 1111 11222234566666654444433322 No 26 >protein:vir:95961 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240391;genbank:gi:66396072;genbank:GeneID:5133472 Probab=34.69 E-value=1.3 Score=20.00 Aligned_cols=137 Identities=12% Similarity=0.113 Sum_probs=74.9 Q ss_pred CC-CCchhHHH-----HhhcCcHHHHHHHHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeehe Q lcl|NC_018836. 1 MA-GLPDEIKA-----LAELSPVEDLLLAVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQS 74 (167) Q Consensus 1 ma-glp~~i~a-----~~e~spiedllla~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~ 74 (167) |. ...-++.. |..-..+ ...+.+ ++-.-.-.+..||||.+-...+- -|..++.......+.|.. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l--------~alvgg-rV~D~~P~~~~~PYv~lG~~~~~-d~~~~~~~g~~~~~ti~V 70 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNPII--------QKQLDG-RVFDCVQKDAVYPYIVVGETNVT-NKETTTSMVEDVGITLHV 70 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhH--------HHhhcc-ccccCCcCCCCCCEEEecCceee-ecCCCcccceEEEEEEEE Confidence 54 22222111 1111111 112222 22222334568999999766553 245555666667777877 Q ss_pred eccCCCCchhhHHHHHHHHHHHHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccc-eeeceeeeeE Q lcl|NC_018836. 75 FCEDPDGDADAAILAEAVRVVLRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGV-WRYETQYDIE 153 (167) Q Consensus 75 ~~edpdgd~da~~laeavr~~l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gv-wryet~~di~ 153 (167) +.. -.|-+.|..++.||+.+|.+ . ...+ .+|++.+.+..+ |+...|.|. |+.--+|.+. T Consensus 71 ws~-~~g~~eak~ia~av~~aL~~-~---l~l~-~~~lv~l~~~~~--------------~~~rd~dg~~~hgvl~fra~ 130 (145) T protein:vir:95 71 YSQ-ARNRDEASQIIQFLGFVLNN-E---IEID-YYSFIKSRIDTQ--------------EVITDIDQYTKHGVIRLVFK 130 (145) T ss_pred EEc-CCCHHHHHHHHHHHHHHhcc-c---cCCC-CCeEEEeEEeee--------------eEeecCCCceEEEEEEEEEE Confidence 764 46888999999999999964 3 3334 488888887654 222335554 5566677777 Q ss_pred eecCcCCCCCCCCC Q lcl|NC_018836. 154 IRKPRTKPFPLSTP 167 (167) Q Consensus 154 ir~pr~~pfplstp 167 (167) |+.-..|----.-- T Consensus 131 ve~~~~~~~~~~~~ 144 (145) T protein:vir:95 131 YRHNTLQRSVTNGA 144 (145) T ss_pred EEecccccccccCC Confidence 76543332111111 No 27 >protein:vir:94794 Length: 145 # NCBI annotation: ORF028 # Family: family:all:296 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240542;genbank:gi:66396219;genbank:GeneID:5133574 Probab=34.63 E-value=1.3 Score=20.00 Aligned_cols=137 Identities=12% Similarity=0.111 Sum_probs=74.9 Q ss_pred CC-CCchhHHH-----HhhcCcHHHHHHHHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeehe Q lcl|NC_018836. 1 MA-GLPDEIKA-----LAELSPVEDLLLAVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQS 74 (167) Q Consensus 1 ma-glp~~i~a-----~~e~spiedllla~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~ 74 (167) |. ...-++.. |..-..+ ...+.+ ++-.-.-.+..||||.+-...+- -|..++.......+.|.. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l--------~alvgg-rV~D~~P~~~~~PYv~lG~~~~~-d~~~~~~~g~~~~~ti~V 70 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNPII--------QKQLDG-RVFDCVQKDAVYPYIVVGETNVT-NKETTTSMVEDVGITLHV 70 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhH--------HHhhcc-ccccCCcCCCCCCEEEecCceee-ecCCCcccceEEEEEEEE Confidence 54 22222111 1111111 112222 22222334568999999766553 245555666667777877 Q ss_pred eccCCCCchhhHHHHHHHHHHHHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccc-eeeceeeeeE Q lcl|NC_018836. 75 FCEDPDGDADAAILAEAVRVVLRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGV-WRYETQYDIE 153 (167) Q Consensus 75 ~~edpdgd~da~~laeavr~~l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gv-wryet~~di~ 153 (167) +.. -.|-+.|..++.||+.+|.+ . ...+ .+|++.+.+..+ |+...|.|. |+.--+|.+. T Consensus 71 ws~-~~g~~eak~ia~av~~aL~~-~---l~l~-~~~lv~l~~~~~--------------~~~rd~dg~~~hgvl~fra~ 130 (145) T protein:vir:94 71 YSQ-ARNRDEASQIIQFLGFVLNN-E---IEID-YYSFIKSRIDTQ--------------EVITDIDQYTKHGIIRLVFK 130 (145) T ss_pred EEc-CCCHHHHHHHHHHHHHHhcc-c---cCCC-CCeEEEeEEeee--------------eEeecCCCceEEEEEEEEEE Confidence 764 46888999999999999964 3 3334 488888887654 222335554 5566677777 Q ss_pred eecCcCCCCCCCCC Q lcl|NC_018836. 154 IRKPRTKPFPLSTP 167 (167) Q Consensus 154 ir~pr~~pfplstp 167 (167) |+.-..|----.-- T Consensus 131 ve~~~~~~~~~~~~ 144 (145) T protein:vir:94 131 YRHNTLQRSVTNGA 144 (145) T ss_pred EEecccccccccCC Confidence 76543332111111 No 28 >protein:vir:1244 Length: 145 # NCBI annotation: similar to phage Spp1 gp17 # Family: family:all:296 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510943;genbank:gi:17426277;genbank:GeneID:927402 Probab=34.22 E-value=1.3 Score=19.95 Aligned_cols=137 Identities=14% Similarity=0.148 Sum_probs=72.7 Q ss_pred CCCCchhHHHHhhcCcHHHHHHHHH------HhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeehe Q lcl|NC_018836. 1 MAGLPDEIKALAELSPVEDLLLAVL------RDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQS 74 (167) Q Consensus 1 maglp~~i~a~~e~spiedllla~l------~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~ 74 (167) |.--|.. +| -+-+.+.| ...+.+ ++-.-.-.+..||||.+-...+-- |..++.......+.|.. T Consensus 1 M~~s~~~--aL------q~ai~~~L~ad~~l~~lvg~-~vyD~~P~~~~~PyV~lG~~~~~~-~~t~~~~~~~~~lti~V 70 (145) T protein:vir:12 1 MWVSVER--YL------FNKVYNKLKSNPIIQKQLGG-RVFDCVQKDAVYPYIVVGETNVTN-KETTTSMVEDVGITLHV 70 (145) T ss_pred CcccHHH--HH------HHHHHHHhhcChhHHHhcCc-ccccCCccCCCCCEEEeccceeee-cCCCcccceEEEEEEEE Confidence 5433321 11 12222222 122222 222223345689999987655532 44445556666677777 Q ss_pred eccCCCCchhhHHHHHHHHHHHHHhhhhhcccccccceeeeeeccCCccccccccccCceeecccccc-ceeeceeeeeE Q lcl|NC_018836. 75 FCEDPDGDADAAILAEAVRVVLRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTG-VWRYETQYDIE 153 (167) Q Consensus 75 ~~edpdgd~da~~laeavr~~l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~g-vwryet~~di~ 153 (167) |.. .+|-..+..++++|+.+|.+ .+ - ...++++.+.+..+- -..| |.| .|+---+|..+ T Consensus 71 ws~-~~gr~ea~~ia~ai~~aL~~-~l---~-l~~~~lv~l~~~~~~-~~rd-------------~d~~~~hgvl~~ra~ 130 (145) T protein:vir:12 71 YSQ-ARNRDEASQIIQFLGFVLNN-EI---E-IDYYSFIKSRIDTQE-VITD-------------IDQYTKHGIIRLVFK 130 (145) T ss_pred EEc-CccHHHHHHHHHHHHHHhcc-cc---C-CCCceEEEEEEeeEE-EEec-------------CCCceEEEEEEEEEE Confidence 765 55778899999999999864 22 2 234777777766431 1112 333 35555677777 Q ss_pred eecCcCCCCCCCCC Q lcl|NC_018836. 154 IRKPRTKPFPLSTP 167 (167) Q Consensus 154 ir~pr~~pfplstp 167 (167) ||.--.|----... T Consensus 131 i~~~~~~~~~~~~~ 144 (145) T protein:vir:12 131 YRHNTLQRSVTNGA 144 (145) T ss_pred EEeCCcccccccCC Confidence 77644332211111 No 29 >protein:vir:101655 Length: 134 # NCBI annotation: gp18 # Family: family:all:2795 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654773;genbank:gi:109302771;genbank:GeneID:4156089 Probab=33.77 E-value=1.3 Score=19.90 Aligned_cols=133 Identities=26% Similarity=0.372 Sum_probs=85.1 Q ss_pred CCCCchhHHHHhhcCc-HHHHHHHHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeeheeccCC Q lcl|NC_018836. 1 MAGLPDEIKALAELSP-VEDLLLAVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQSFCEDP 79 (167) Q Consensus 1 maglp~~i~a~~e~sp-iedllla~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~~~edp 79 (167) |.-|. .-.| .|.|..|-|.-.+-+|.. ---.+..-|||++.|+|.= |...-.|-|-|.|.-|..|- T Consensus 1 mlpls-------rpnpnaeklvcaylspffenvas--hrwvdaptpfilvkrlpgg----gqgevsdcalmsikvfgkdv 67 (134) T protein:vir:10 1 MLPLS-------RPNPNAEKLVCAYLSPFFENVAS--HRWVDAPTPFILVKRLPGG----GQGEVSDCALMSIKVFGKDV 67 (134) T ss_pred CCCCC-------CCCCchhhhhhhhhhhHHhhhhc--cccccCCCceEEEeeCCCC----CCccccceeeeeeeeecccc Confidence 43221 1112 355666666555554432 2234566799999999974 45667899999999999887 Q ss_pred CCchhhHHHHHHHHHHHHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccceeeceeeeeEeecCcC Q lcl|NC_018836. 80 DGDADAAILAEAVRVVLRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWRYETQYDIEIRKPRT 159 (167) Q Consensus 80 dgd~da~~laeavr~~l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwryet~~di~ir~pr~ 159 (167) | .|+.||.-|..- ||.|...-.+.--||-.|+.+-.- .-+|-|- .|.|=-.. -|..||=|.+|---- T Consensus 68 d---eagdladevher-mrkwkpkdtvsygghsfginllev-edapfwl------dygddtee--cytarywvhlrvdyv 134 (134) T protein:vir:10 68 D---EAGDLADEVHER-MRKWKPKDTVSYGGHSFGINLLEV-EDAPFWL------DYGDDTEE--CYTARYWVHLRVDYV 134 (134) T ss_pred c---cccchHHHHHHH-HhccCcccccccCchhhcceeEee-cCCceee------ecCCCccc--eeeeeEEEEEEEecC Confidence 6 467788887654 677888888888899888776432 2334453 34443222 366677777665433 No 30 >protein:vir:7860 Length: 134 # NCBI annotation: gp17 # Family: family:all:2795 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817467;genbank:gi:29565896;genbank:GeneID:1259089 Probab=33.77 E-value=1.3 Score=19.90 Aligned_cols=133 Identities=26% Similarity=0.372 Sum_probs=85.1 Q ss_pred CCCCchhHHHHhhcCc-HHHHHHHHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeeheeccCC Q lcl|NC_018836. 1 MAGLPDEIKALAELSP-VEDLLLAVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQSFCEDP 79 (167) Q Consensus 1 maglp~~i~a~~e~sp-iedllla~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~~~edp 79 (167) |.-|. .-.| .|.|..|-|.-.+-+|.. ---.+..-|||++.|+|.= |...-.|-|-|.|.-|..|- T Consensus 1 mlpls-------rpnpnaeklvcaylspffenvas--hrwvdaptpfilvkrlpgg----gqgevsdcalmsikvfgkdv 67 (134) T protein:vir:78 1 MLPLS-------RPNPNAEKLVCAYLSPFFENVAS--HRWVDAPTPFILVKRLPGG----GQGEVSDCALMSIKVFGKDV 67 (134) T ss_pred CCCCC-------CCCCchhhhhhhhhhhHHhhhhc--cccccCCCceEEEeeCCCC----CCccccceeeeeeeeecccc Confidence 43221 1112 355666666555554432 2234566799999999974 45667899999999999887 Q ss_pred CCchhhHHHHHHHHHHHHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccceeeceeeeeEeecCcC Q lcl|NC_018836. 80 DGDADAAILAEAVRVVLRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWRYETQYDIEIRKPRT 159 (167) Q Consensus 80 dgd~da~~laeavr~~l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwryet~~di~ir~pr~ 159 (167) | .|+.||.-|..- ||.|...-.+.--||-.|+.+-.- .-+|-|- .|.|=-.. -|..||=|.+|---- T Consensus 68 d---eagdladevher-mrkwkpkdtvsygghsfginllev-edapfwl------dygddtee--cytarywvhlrvdyv 134 (134) T protein:vir:78 68 D---EAGDLADEVHER-MRKWKPKDTVSYGGHSFGINLLEV-EDAPFWL------DYGDDTEE--CYTARYWVHLRVDYV 134 (134) T ss_pred c---cccchHHHHHHH-HhccCcccccccCchhhcceeEee-cCCceee------ecCCCccc--eeeeeEEEEEEEecC Confidence 6 467788887654 677888888888899888776432 2334453 34443222 366677777665433 No 31 >protein:vir:10368 Length: 118 # NCBI annotation: conserved phage protein # Family: family:all:3244 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858960;genbank:gi:32128425;genbank:GeneID:2648389 Probab=33.07 E-value=1.4 Score=19.82 Aligned_cols=115 Identities=14% Similarity=0.123 Sum_probs=64.1 Q ss_pred CcHHHHHHHHHHhhcCC-cchhhhhhhhccCCEEEEEecCcccee--cCcceeccchheeeheeccCCCCchhhHHHHHH Q lcl|NC_018836. 15 SPVEDLLLAVLRDGLPG-IRVQSLIEKDQHFPFVLVRRDPSFGLW--AGDTRFTDSARVVVQSFCEDPDGDADAAILAEA 91 (167) Q Consensus 15 spiedllla~l~~~lp~-i~~~sli~~~q~fpfi~~rr~~s~g~w--agd~rf~d~a~~~i~~~~edpdgd~da~~laea 91 (167) --+|+.|.++|+.-.++ |-.+..=+....+|+|...|..+-... .|.+.=.+..++-|-.|+..+ .+|..|++| T Consensus 1 Ms~e~~l~a~L~~~~~~RVyp~~aP~~~~~~Pyiv~q~vsg~p~~~l~G~~~~~~~~rvQIdvyA~t~---~~A~~l~~a 77 (118) T protein:vir:10 1 MSYGRVLKDLLDPVFSGRVYADIPPDSPPLDAYAIYQRVGGVPVYWQEGGMPEKVNARVQIQIWSRSK---QEAYLATVQ 77 (118) T ss_pred CchHHHHHHHHhhhcCCccccccCCCCCCcCCEEEEEecCCcccccccCCCCccceeEEEEEEeeCCH---HHHHHHHHH Confidence 44677778888766664 333333344456799999886543221 233333455678888888776 478899999 Q ss_pred HHHHHHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccceeeceeeeeEeecCcC Q lcl|NC_018836. 92 VRVVLRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWRYETQYDIEIRKPRT 159 (167) Q Consensus 92 vr~~l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwryet~~di~ir~pr~ 159 (167) ||..|... +.. .|.. .|++=-|-.++.+|.. +|+.|=--+| T Consensus 78 v~~al~~~----~~~-------------~~~~--------~~~d~ye~dt~l~r~~--~Df~vw~~~~ 118 (118) T protein:vir:10 78 VLRLVSEA----NDM-------------QVLS--------QPIDDYVREIKLYGSR--VDISMWYNLT 118 (118) T ss_pred HHHHhhhc----ccc-------------eecc--------CCCccccccCCceEEE--EEEEEeeecC Confidence 98877543 110 0110 1111123345665554 4444444444 No 32 >protein:vir:93602 Length: 114 # NCBI annotation: putative structural component # Family: family:all:896 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449300;genbank:gi:157166048;uniprot:Q6H9U1;genbank:GeneID:5580424 Probab=31.26 E-value=1.5 Score=19.60 Aligned_cols=107 Identities=14% Similarity=0.094 Sum_probs=65.8 Q ss_pred cHHHHHHHHHHhhcCC-----cchhhhhhhhccCCEEEEEecCc--cceecCcceeccchheeeheeccCCCCchhhHHH Q lcl|NC_018836. 16 PVEDLLLAVLRDGLPG-----IRVQSLIEKDQHFPFVLVRRDPS--FGLWAGDTRFTDSARVVVQSFCEDPDGDADAAIL 88 (167) Q Consensus 16 piedllla~l~~~lp~-----i~~~sli~~~q~fpfi~~rr~~s--~g~wagd~rf~d~a~~~i~~~~edpdgd~da~~l 88 (167) =+|+-|.++|..-++| +..++.-.-+-.+|||.-.+-.+ +-.=.|. -.|..++-|-.|+..+ .+|..| T Consensus 1 M~e~~i~~lL~~~~~gRvyp~~~P~~~~~~~~~~Pyiv~q~vsg~p~~~l~gp--~~~~~~vQIDvyA~t~---~~A~~l 75 (114) T protein:vir:93 1 MTEADLYPHLAHLAGGQVYPYVVPLLDGRPSVALPWVVFSLISSVSADVMGGQ--AESSVSVQIDVYAGTV---TQARQI 75 (114) T ss_pred CchHHHHHHHHhhcCcccccccCCcccCcCCccCceEEEEeccCcccccccCc--cccceEEEEEeeeCCH---HHHHHH Confidence 3466666666654443 33333222223589998877532 2233453 3577888888888765 478899 Q ss_pred HHHHHHHHHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccceeeceeeeeEe Q lcl|NC_018836. 89 AEAVRVVLRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGVWRYETQYDIEI 154 (167) Q Consensus 89 aeavr~~l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gvwryet~~di~i 154 (167) +++||..|. ..+++..... +=-|=.++..|.---|.|+| T Consensus 76 ~~~v~~Al~----------~~~~~~~~~~-----------------~~ye~dt~lyR~~~d~~v~~ 114 (114) T protein:vir:93 76 RQDAREAIM----------LLAPGSVSEM-----------------QDYIPENRCYRATLEFQVTV 114 (114) T ss_pred HHHHHHHHh----------hcCcEeecCC-----------------CcccccccceeeEEEEEEeC Confidence 999998885 2234432211 11256678888888888888 No 33 >protein:vir:97325 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240617;genbank:gi:66396297;genbank:GeneID:5133681 Probab=30.62 E-value=1.6 Score=19.52 Aligned_cols=136 Identities=13% Similarity=0.123 Sum_probs=75.3 Q ss_pred CC-CCchhHHH-----HhhcCcHHHHHHHHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeehe Q lcl|NC_018836. 1 MA-GLPDEIKA-----LAELSPVEDLLLAVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQS 74 (167) Q Consensus 1 ma-glp~~i~a-----~~e~spiedllla~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~ 74 (167) |. ...-++.. |..-..+ ...+.+ ++-.-.-.+..||||.+-...+- -|..++.......+.|.. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~~l--------~alvgg-rV~D~~P~~a~~PYv~lG~~~~~-d~~~~~~~g~~~~~ti~V 70 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNLII--------RKQLDG-RVFDCVQKDAVYPYIVVGETNVT-NKETTTSMVEDVGITLHV 70 (145) T ss_pred CcchHhHHHHHHHHHHhhcChhH--------HHhhcC-ceecCCccCCCCCEEEeCcceee-ecCCCcccceEEEEEEEE Confidence 54 22222111 1111111 112222 22222334678999999766553 245555666677778888 Q ss_pred eccCCCCchhhHHHHHHHHHHHHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccc-eeeceeeeeE Q lcl|NC_018836. 75 FCEDPDGDADAAILAEAVRVVLRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGV-WRYETQYDIE 153 (167) Q Consensus 75 ~~edpdgd~da~~laeavr~~l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gv-wryet~~di~ 153 (167) +.. -+|-+.|..++.||+.+|.+ . ...+ .+|++.+.+..+- +...|.|. |+.--+|.+. T Consensus 71 ws~-~~g~~eak~ia~av~~aL~~-~---l~l~-~~~lv~l~~~~~~--------------~~rd~dg~~~hgvl~fra~ 130 (145) T protein:vir:97 71 YSQ-ARNRDEASQIIQFLGFVLNN-E---IEID-YYSFIKSRIDTQE--------------VITDIDQYTKHGIIRLVFK 130 (145) T ss_pred EEc-CCCHHHHHHHHHHHHHHhcc-c---cCCC-CCeEEEeEEeeee--------------EeecCCCceEEEEEEEEEE Confidence 874 56888899999999999964 3 3344 4888888776542 22235554 5566677777 Q ss_pred eecCcCCCCCCCCC Q lcl|NC_018836. 154 IRKPRTKPFPLSTP 167 (167) Q Consensus 154 ir~pr~~pfplstp 167 (167) |+.-.-| =|++.- T Consensus 131 ve~~~~~-~~~~~~ 143 (145) T protein:vir:97 131 YRHNTLQ-RSVTNG 143 (145) T ss_pred EecCcee-cccccC Confidence 7653322 122222 No 34 >protein:vir:95111 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240829;genbank:gi:66394699;genbank:GeneID:5133905 Probab=29.24 E-value=1.7 Score=19.35 Aligned_cols=137 Identities=12% Similarity=0.116 Sum_probs=74.8 Q ss_pred CC-CCchhHH-----HHhhcCcHHHHHHHHHHhhcCCcchhhhhhhhccCCEEEEEecCccceecCcceeccchheeehe Q lcl|NC_018836. 1 MA-GLPDEIK-----ALAELSPVEDLLLAVLRDGLPGIRVQSLIEKDQHFPFVLVRRDPSFGLWAGDTRFTDSARVVVQS 74 (167) Q Consensus 1 ma-glp~~i~-----a~~e~spiedllla~l~~~lp~i~~~sli~~~q~fpfi~~rr~~s~g~wagd~rf~d~a~~~i~~ 74 (167) |. +..-++. +|..-+.+-. .+.+ ++-.-.-.+..||||.+-...+- -|..++.......+.|.. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~a--------lvgg-rV~D~~P~~a~~PYV~lG~~~~~-~~~~~~~~g~~~~~ti~V 70 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNSIIQK--------QLDG-RVFDCVQKDAVYPYIVVGETNVT-NKETTTSMVEDVGITLHV 70 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHH--------hhcC-ceecCCcCCCCCCEEEecCceee-ecCCCcccceEEEEEEEE Confidence 54 2222221 1112122222 2222 22222334568999999766553 245555666667777877 Q ss_pred eccCCCCchhhHHHHHHHHHHHHHhhhhhcccccccceeeeeeccCCccccccccccCceeeccccccc-eeeceeeeeE Q lcl|NC_018836. 75 FCEDPDGDADAAILAEAVRVVLRNAWLSQKVYAGRGHITRVDMASAPRRATDWATATGPVQYADLPTGV-WRYETQYDIE 153 (167) Q Consensus 75 ~~edpdgd~da~~laeavr~~l~~a~~~~~~~~~~gh~~~~~m~~~prr~~dwatatgpvqyadlp~gv-wryet~~di~ 153 (167) +.. -.|-+.|..++.||+.+|.+ . ...+ .+|++.+.+..+ |+-..|.|. |+.--+|.+. T Consensus 71 ws~-~~g~~eak~ia~av~~aL~~-~---l~l~-~~~lv~l~~~~~--------------~~~rd~dg~~~hgvl~~ra~ 130 (145) T protein:vir:95 71 YSQ-ARNRDEASQIIQFLGFVLNN-E---IEID-YYSFIKSRIDTQ--------------EVITDIDRYTKHGIIRLVFK 130 (145) T ss_pred EEc-CCCHHHHHHHHHHHHHHhcc-c---cCCC-CCeEEEeEEeee--------------eEeecCCCceEEEEEEEEEE Confidence 764 45888899999999999964 3 3344 488888887654 222235553 5566677777 Q ss_pred eecCcCCCCCCCCC Q lcl|NC_018836. 154 IRKPRTKPFPLSTP 167 (167) Q Consensus 154 ir~pr~~pfplstp 167 (167) |+.-..|----.-- T Consensus 131 ve~~~~~~~~~~~~ 144 (145) T protein:vir:95 131 YRHNTLQRSVTNGA 144 (145) T ss_pred EEecccccccccCC Confidence 76543332110001 Done!