Query lcl|NC_020866.1_cdsid_YP_007676423.1 [gene=RHVG_00044] [protein=hypothetical protein] [protein_id=YP_007676423.1] [location=26455..26883] Match_columns 142 No_of_seqs 109 out of 226 Neff 6.8 Searched_HMMs 1612 Date Thu Nov 7 17:26:02 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_44 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_44_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:79074 Length: 150 100.0 3.4E-53 2.1E-56 308.3 15.7 140 1-141 1-150 (150) 2 protein:vir:99848 Length: 172 100.0 5.6E-53 3.5E-56 307.1 15.7 140 1-141 1-172 (172) 3 protein:vir:107864 Length: 150 100.0 7.1E-53 4.4E-56 306.5 15.8 140 1-141 1-150 (150) 4 protein:vir:1993 Length: 141 # 100.0 1.7E-52 1.1E-55 304.4 16.1 140 1-142 1-141 (141) 5 protein:vir:79253 Length: 138 100.0 2.2E-51 1.4E-54 298.3 15.6 137 1-137 1-138 (138) 6 protein:vir:99222 Length: 138 100.0 2.2E-51 1.4E-54 298.3 15.6 137 1-137 1-138 (138) 7 protein:vir:103846 Length: 138 100.0 5.3E-51 3.3E-54 296.2 15.6 137 1-141 1-138 (138) 8 protein:vir:43 Length: 131 # N 96.1 1.1E-05 6.6E-09 47.9 2.8 113 1-142 1-118 (131) 9 protein:vir:80967 Length: 131 95.9 1.7E-05 1E-08 46.8 3.2 98 1-114 1-131 (131) 10 protein:vir:98481 Length: 136 95.7 0.0001 6.2E-08 42.5 6.8 117 1-142 1-135 (136) 11 protein:vir:98900 Length: 132 94.6 0.00085 5.3E-07 37.4 8.6 114 1-142 1-119 (132) 12 protein:vir:94761 Length: 132 94.0 0.0011 6.6E-07 36.9 7.9 102 1-142 1-116 (132) 13 protein:vir:9761 Length: 140 # 93.9 0.0005 3.1E-07 38.7 5.9 115 1-135 1-140 (140) 14 protein:vir:2505 Length: 128 # 92.5 0.00018 1.1E-07 41.1 1.4 112 1-142 5-118 (128) 15 protein:vir:2432 Length: 124 # 92.4 0.0014 8.8E-07 36.2 6.1 116 1-134 1-124 (124) 16 protein:vir:9576 Length: 131 # 91.8 0.0027 1.7E-06 34.7 6.8 102 1-142 1-115 (131) 17 protein:vir:1640 Length: 132 # 91.4 0.0015 9.2E-07 36.1 5.0 115 1-130 1-132 (132) 18 protein:vir:7773 Length: 123 # 90.8 0.004 2.5E-06 33.7 6.8 117 1-134 1-123 (123) 19 protein:vir:1329 Length: 122 # 89.6 0.0058 3.6E-06 32.9 6.7 113 1-134 1-122 (122) 20 protein:vir:78478 Length: 149 88.3 0.0042 2.6E-06 33.6 4.9 123 1-141 1-149 (149) 21 protein:vir:78254 Length: 149 88.3 0.0042 2.6E-06 33.6 4.9 123 1-141 1-149 (149) 22 protein:vir:6243 Length: 122 # 86.0 0.013 7.8E-06 31.0 6.2 113 1-134 1-122 (122) 23 protein:vir:94955 Length: 170 78.6 0.019 1.2E-05 30.0 4.2 118 1-142 14-152 (170) 24 protein:vir:4228 Length: 125 # 76.0 0.047 2.9E-05 27.8 5.6 116 1-134 1-125 (125) 25 protein:vir:80389 Length: 172 75.6 0.031 1.9E-05 28.9 4.5 120 1-142 15-159 (172) 26 protein:vir:104088 Length: 125 73.0 0.048 3E-05 27.8 4.9 116 1-134 1-125 (125) 27 protein:vir:99002 Length: 158 71.1 0.036 2.2E-05 28.5 3.7 113 1-142 1-133 (158) 28 protein:vir:1887 Length: 108 # 71.0 0.2 0.00013 24.4 8.9 97 1-118 6-108 (108) 29 protein:vir:192 Length: 108 # 71.0 0.2 0.00013 24.4 8.9 97 1-118 6-108 (108) 30 protein:vir:106583 Length: 105 65.1 0.29 0.00018 23.5 7.8 90 3-107 1-105 (105) 31 protein:vir:9821 Length: 138 # 58.8 0.13 8.2E-05 25.4 4.3 112 1-142 6-130 (138) 32 protein:vir:81159 Length: 95 # 58.1 0.42 0.00026 22.6 9.2 91 1-108 1-95 (95) 33 protein:vir:4788 Length: 130 # 56.1 0.12 7.2E-05 25.7 3.5 109 1-142 1-122 (130) 34 protein:vir:97267 Length: 172 54.3 0.067 4.1E-05 27.0 1.9 128 1-140 16-172 (172) 35 protein:vir:95176 Length: 172 53.6 0.24 0.00015 24.0 4.9 117 1-142 17-157 (172) 36 protein:vir:2345 Length: 125 # 52.6 0.45 0.00028 22.5 6.2 116 1-134 1-125 (125) 37 protein:vir:78383 Length: 169 40.0 0.99 0.00062 20.6 6.6 118 1-142 15-155 (169) 38 protein:vir:95004 Length: 169 39.4 1 0.00063 20.5 6.4 118 1-142 15-155 (169) 39 protein:vir:108221 Length: 150 36.5 1.2 0.00073 20.2 5.9 126 1-142 5-149 (150) 40 protein:vir:100103 Length: 120 30.1 1.6 0.001 19.5 8.0 94 1-111 5-120 (120) 41 protein:vir:100245 Length: 113 26.2 2 0.0012 19.0 8.7 93 1-111 1-113 (113) 42 protein:vir:102961 Length: 131 20.1 1.4 0.00086 19.8 3.1 108 9-142 1-121 (131) No 1 >protein:vir:79074 Length: 150 # NCBI annotation: gp10, conserved hypothetical protein # Family: family:all:348 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111210;genbank:gi:134288797;genbank:GeneID:4960748 Probab=100.00 E-value=3.4e-53 Score=308.27 Aligned_cols=140 Identities=26% Similarity=0.470 Sum_probs=130.2 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCC-----cccccCHHHHHHHHHHHHHHHHHHHhhhcCCCcccccHHHHHHHHHHH Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTP-----PAGVIDIDVVNDALTDTDAVIDGYLGTRYVLPLVETPPQIPEIAISIA 75 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~-----~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lPl~~~p~~L~~~~~dIA 75 (142) |+|||.+||+++||+++|++|+|++.. +.+++|+++|++||++|+++|||||++||.|||+++|.+|+++||||| T Consensus 1 M~Y~T~~Dl~~r~ge~~l~~Ltd~~~~~~~~~~~~~~d~~~i~~Al~dA~~~IDgyL~~RY~lPl~~vP~~L~~~a~dIA 80 (150) T protein:vir:79 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESAVRQAEEIVDAHLRGRYNLPLSPVPTVIKDVTVNLA 80 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccccccccccccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHH Confidence 999999999999999999999987642 457899999999999999999999999999999999999999999999 Q ss_pred HHHHhcCCC-----ChHHHHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCCCccChhhhccc Q lcl|NC_020866. 76 IWKLHSFEP-----GDKIKTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRERPLTQENMKGF 141 (142) Q Consensus 76 ~Y~L~~~~~-----~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~F~r~~l~g~ 141 (142) ||+||.+++ +|++++|||+|++||++|++||++||+++ .+.+++++++++++++|+|||++|||| T Consensus 81 ~Y~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~-~~~~~~~~~~~v~~~~r~f~r~~l~g~ 150 (150) T protein:vir:79 81 RHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPS-GPATPEPGEMKVRARRRQFDADLLERF 150 (150) T ss_pred HHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCC-ccCCCCCCceeeecCCCccChhhccCC Confidence 999998753 78999999999999999999999999876 455556788999999999999999999 No 2 >protein:vir:99848 Length: 172 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164077;genbank:gi:56692609;genbank:GeneID:3192576 Probab=100.00 E-value=5.6e-53 Score=307.05 Aligned_cols=140 Identities=29% Similarity=0.474 Sum_probs=130.0 Q ss_pred CC-CCCHHHHHHhcCHHHHHHHhCCCCC-------------------------cccccCHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_020866. 1 MP-YTSLDALTKKFGPDMLLGLTDRTTP-------------------------PAGVIDIDVVNDALTDTDAVIDGYLGT 54 (142) Q Consensus 1 M~-YaT~~Dl~~~~g~~el~~Ltd~~~~-------------------------~~~~~d~~~v~~Al~~A~~~id~YL~~ 54 (142) |. |||++||+++||++||+|||++++. .+|++|.++|++||++|+++|||||++ T Consensus 1 ~~mYaT~~dl~~r~g~~el~qLt~~~~~~~~~~~l~d~~~~~~~~~~~~~~~~~~g~~d~~~i~~Al~dA~aeIDgYL~~ 80 (172) T protein:vir:99 1 MAVYITLPELAERPGAEELSQAATPRPLQAVDSELLDALLRGLPVDRWTPEEIEVGHATVEVINSAVSDAQGYIDGFLQR 80 (172) T ss_pred CcccccHHHHHhhcCHHHHHHHhccccccCCHHHHHHHhhcchhhhhcccccccccccCHHHHHHHHHHHHHHHHHHHhc Confidence 99 9999999999999999999987742 457899999999999999999999999 Q ss_pred h-cCCCcccccHHHHHHHHHHHHHHHhcCC-----CChHHHHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeec Q lcl|NC_020866. 55 R-YVLPLVETPPQIPEIAISIAIWKLHSFE-----PGDKIKTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTD 128 (142) Q Consensus 55 R-Y~lPl~~~p~~L~~~~~dIA~Y~L~~~~-----~~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~ 128 (142) | |.|||+++|.+|+++|||||||+||+++ .+|++++|||+||+||++|++||++||++.+.+ +++++.++|++ T Consensus 81 R~Y~lPL~~vP~~L~~~a~dIArY~L~~~r~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~-~~~~~~~~v~~ 159 (172) T protein:vir:99 81 RGYSLPLAKRYGVVTGWTRAIARYLLHQDRLGPGAEKDPIVRDYRDALKFLQLIAEGKFSLGPDDPLT-PPGGGVPQVLA 159 (172) T ss_pred ccccCCCcccchHHHHHHHHHHHHHHHhccCCcccCCHHHHHHHHHHHHHHHHHhcCccccCCCCCCC-CCCCCceeeec Confidence 9 9999999999999999999999999875 378999999999999999999999999887654 45668899999 Q ss_pred CCCccChhhhccc Q lcl|NC_020866. 129 RERPLTQENMKGF 141 (142) Q Consensus 129 ~~r~F~r~~l~g~ 141 (142) ++|+|||++|||| T Consensus 160 ~~r~F~rd~L~gf 172 (172) T protein:vir:99 160 PARTFSHDTLKDY 172 (172) T ss_pred CCCccChhhccCC Confidence 9999999999999 No 3 >protein:vir:107864 Length: 150 # NCBI annotation: gp36 # Family: family:all:348 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024709;genbank:gi:48696946;genbank:GeneID:2845952 Probab=100.00 E-value=7.1e-53 Score=306.48 Aligned_cols=140 Identities=24% Similarity=0.437 Sum_probs=129.5 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCC-----cccccCHHHHHHHHHHHHHHHHHHHhhhcCCCcccccHHHHHHHHHHH Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTP-----PAGVIDIDVVNDALTDTDAVIDGYLGTRYVLPLVETPPQIPEIAISIA 75 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~-----~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lPl~~~p~~L~~~~~dIA 75 (142) |+|||.+||+++||+++|++|+|++.. +.+++|+++|++||++|+++|||||++||.||++++|.+|+++||||| T Consensus 1 M~Y~T~~Dl~~r~ge~el~~Ltd~~~~g~~~~~~~~~d~~~i~~Al~dA~~eIDgYL~~RY~lPl~~vP~~L~~~a~dIA 80 (150) T protein:vir:10 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESSVRQAEEIVDAHLRGRYNLPLSPVPTVIKDVTVNLA 80 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccCccccchhhcCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHH Confidence 999999999999999999999987642 346899999999999999999999999999999999999999999999 Q ss_pred HHHHhcCC-----CChHHHHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCCCccChhhhccc Q lcl|NC_020866. 76 IWKLHSFE-----PGDKIKTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRERPLTQENMKGF 141 (142) Q Consensus 76 ~Y~L~~~~-----~~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~F~r~~l~g~ 141 (142) ||+||.++ .+|++++|||+||+||++|++||++||+++. +.+++++++.+++++|+|||++|||| T Consensus 81 rY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~-~~~~~~~~~~v~~~~r~f~r~~l~gf 150 (150) T protein:vir:10 81 RHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPSG-PATPEPGEMKVRARRRQFDADLLERF 150 (150) T ss_pred HHHHHhcccccCCCCHHHHHHHHHHHHHHHHHhcCcccCCCCCC-CCCCCCceeeeecCCCccChhhccCC Confidence 99999865 3789999999999999999999999998764 45556788999999999999999999 No 4 >protein:vir:1993 Length: 141 # NCBI annotation: Hypothetical protein # Family: family:all:348 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050640;genbank:gi:9633527;genbank:GeneID:2636296 Probab=100.00 E-value=1.7e-52 Score=304.37 Aligned_cols=140 Identities=29% Similarity=0.476 Sum_probs=130.7 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCCcccccHHHHHHHHHHHHHHHh Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLPLVETPPQIPEIAISIAIWKLH 80 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (142) |+|||.+||+++||+++|++|++++. ..|++|+++|++||++|+++|||||++||.||++++|.+|+++|||||+|+|+ T Consensus 1 M~Y~T~~Dl~~~~ge~~l~~Lt~d~~-~~g~~d~~~i~~Al~dA~~eIdgyL~~RY~lPl~~~P~~L~~~a~dIA~Y~L~ 79 (141) T protein:vir:19 1 MNYATVNDLCARYTRTRLDILTRPKT-ADGQPDDAVAEQALADASAFIDGYLAARFVLPLTVVPSLLKRQCCVVAWFYLN 79 (141) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcCCC-CccccCHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHHh Confidence 99999999999999999999996544 35889999999999999999999999999999999999999999999999999 Q ss_pred cCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCcC-CCCCCCeeeeecCCCccChhhhcccC Q lcl|NC_020866. 81 SFEPGDKIKTDYRDALQALRDIAKGAIKLNATSVEP-ATTGDGGARMTDRERPLTQENMKGFI 142 (142) Q Consensus 81 ~~~~~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~-~~~~~~~~~~~~~~r~F~r~~l~g~~ 142 (142) +++++|++++|||+|++||++|++||++||+++.++ ++++.+.+++++++|+|+|+ +|||| T Consensus 80 ~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~~~~~~~~r~f~r~-~~G~~ 141 (141) T protein:vir:19 80 ESQPTEQITATYRDTVRWLEQVRDGKTDPGVESRTAASPEGEDLVQVQSDPPVFSRK-QKGFI 141 (141) T ss_pred cCCCChHHHHHHHHHHHHHHHHhcCccccCCCCCCCCCCCCCceeEeecCCcccCcc-cccCC Confidence 999999999999999999999999999999887654 34566789999999999996 79999 No 5 >protein:vir:79253 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469165;genbank:gi:157835007;genbank:GeneID:5648834 Probab=100.00 E-value=2.2e-51 Score=298.31 Aligned_cols=137 Identities=24% Similarity=0.363 Sum_probs=130.8 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCCcccccHHHHHHHHHHHHHHHh Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLPLVETPPQIPEIAISIAIWKLH 80 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (142) |+|||.+||+++||+++|++|+|++.++++++|+++|++||++|+++|||||++||.||++++|++|+++|||||+|+|+ T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL~~RY~lPl~~vP~~L~~~a~dIA~Y~L~ 80 (138) T protein:vir:79 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLH 80 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHHh Confidence 99999999999999999999999988888999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCh-HHHHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCCCccChhh Q lcl|NC_020866. 81 SFEPGD-KIKTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRERPLTQEN 137 (142) Q Consensus 81 ~~~~~e-~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~F~r~~ 137 (142) +++.++ .+++|||+|++||++|++||++||+++.+++++++++++|++++|+||||- T Consensus 81 ~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~~~~~~~r~F~Rd~ 138 (138) T protein:vir:79 81 IVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISEGRNDWGADW 138 (138) T ss_pred cCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcCCCCCCceeeecCCCCCCCCC Confidence 988765 599999999999999999999999998888888889999999999999654 No 6 >protein:vir:99222 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950460;genbank:gi:119953661;genbank:GeneID:4643082 Probab=100.00 E-value=2.2e-51 Score=298.31 Aligned_cols=137 Identities=24% Similarity=0.363 Sum_probs=130.8 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCCcccccHHHHHHHHHHHHHHHh Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLPLVETPPQIPEIAISIAIWKLH 80 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (142) |+|||.+||+++||+++|++|+|++.++++++|+++|++||++|+++|||||++||.||++++|++|+++|||||+|+|+ T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL~~RY~lPl~~vP~~L~~~a~dIA~Y~L~ 80 (138) T protein:vir:99 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLH 80 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHHh Confidence 99999999999999999999999988888999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCh-HHHHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCCCccChhh Q lcl|NC_020866. 81 SFEPGD-KIKTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRERPLTQEN 137 (142) Q Consensus 81 ~~~~~e-~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~F~r~~ 137 (142) +++.++ .+++|||+|++||++|++||++||+++.+++++++++++|++++|+||||- T Consensus 81 ~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~~~~~~~r~F~Rd~ 138 (138) T protein:vir:99 81 IVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISEGRNDWGADW 138 (138) T ss_pred cCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcCCCCCCceeeecCCCCCCCCC Confidence 988765 599999999999999999999999998888888889999999999999654 No 7 >protein:vir:103846 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938245;genbank:gi:38229150;genbank:GeneID:2648161 Probab=100.00 E-value=5.3e-51 Score=296.22 Aligned_cols=137 Identities=27% Similarity=0.387 Sum_probs=131.1 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCCcccccHHHHHHHHHHHHHHHh Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLPLVETPPQIPEIAISIAIWKLH 80 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (142) |+|||.+||+++||+++|++|+|++.++.+++|+++|++||++|+++|||||++||.||++++|.+|+++|||||+|+|+ T Consensus 1 M~Y~T~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~~eIdgyL~~RY~lPl~~vP~~L~~~a~dIA~Y~L~ 80 (138) T protein:vir:10 1 MSYCTQADLVEQYGEASIRQLSDRVNKPATTIDPAVVAQAIADADAEIDLHLHARYQLPLAQVPVVLKRVACVLAFANLH 80 (138) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccCCCcCccCHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHHh Confidence 99999999999999999999999988888999999999999999999999999999999999999999999999999999 Q ss_pred cCCC-ChHHHHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCCCccChhhhccc Q lcl|NC_020866. 81 SFEP-GDKIKTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRERPLTQENMKGF 141 (142) Q Consensus 81 ~~~~-~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~F~r~~l~g~ 141 (142) +++. +|++++|||+|++||++|++||++||+++.+++++++++++|++++|+||| +| T Consensus 81 ~~~~~~e~~~~rY~~Ai~~L~~Ia~G~~~Lg~~~~~~~~~~~~~~~~~s~~r~Fg~----d~ 138 (138) T protein:vir:10 81 TQVKDDHPAILDAERKRKLLGGISSGKLSLALTSSGTPAPIANTVQISSQRNDFGG----TW 138 (138) T ss_pred cCCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCcccCCCCCceeeecCCccCCC----CC Confidence 8865 588999999999999999999999999998888888899999999999996 56 No 8 >protein:vir:43 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463469;swissprot:trembl:q9t1b5;genbank:gi:16798791;uniprot:Q9T1B5;genbank:GeneID:922374 Probab=96.07 E-value=1.1e-05 Score=47.86 Aligned_cols=113 Identities=14% Similarity=0.182 Sum_probs=59.8 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCC-c----ccccHHHHHHHHHHH Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLP-L----VETPPQIPEIAISIA 75 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lP-l----~~~p~~L~~~~~dIA 75 (142) |+|+|.+.+.+.||. ..+.++-.+..+..|+..||.+...|+.-= + +.+|..++..||..+ T Consensus 1 M~Y~d~~~Y~~~y~g--------------~~i~e~~F~~l~~rAs~~ID~~T~~ri~~~~~~~~~~~~~~~vk~A~c~q~ 66 (131) T protein:vir:43 1 MPYTTLEFYNDEYAG--------------EHLEQDEFDKLLKHAERKIDSVTFYRIRKGGIESFSEFIQHQIQLATCNQI 66 (131) T ss_pred CCCCCHHHHHHhhCC--------------CCCCHhHHHHHHHHHHHHHHHHhcccccccCccccchhhHHHHHHHHHHHH Confidence 999999999988853 235566789999999999999999998621 1 357788999999999 Q ss_pred HHHHhcCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCCCccChhhhcccC Q lcl|NC_020866. 76 IWKLHSFEPGDKIKTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRERPLTQENMKGFI 142 (142) Q Consensus 76 ~Y~L~~~~~~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~F~r~~l~g~~ 142 (142) -|.-.....+ +.+..-..-+.-|+.++....... ......+..-......|+ T Consensus 67 e~~~~~g~~s-------~~~~~~~~S~svG~~Svs~~~~~~--------~~~~~~~~~~~~~a~~~L 118 (131) T protein:vir:43 67 EYFKEAGGTS-------ELAVSKPDNVSIGRTSISDSNFAS--------TATSLNSGLIGSDVRSYL 118 (131) T ss_pred HHHHHhHHHh-------hhhccccCeeecCceEEeeccccc--------chhhhchhhhHHHHHHHH Confidence 8764321100 001000111122222221100000 000000001111222222 No 9 >protein:vir:80967 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468394;genbank:gi:157324968;genbank:GeneID:5601402 Probab=95.89 E-value=1.7e-05 Score=46.77 Aligned_cols=98 Identities=16% Similarity=0.097 Sum_probs=58.9 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCC-c----ccccHHHHHHHHHHH Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLP-L----VETPPQIPEIAISIA 75 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lP-l----~~~p~~L~~~~~dIA 75 (142) |+|+|.+.+.+.|+. +.+.++-....+..|+..||.+...|++-- + +.+|..++..||..+ T Consensus 1 M~Y~d~~~Y~~~y~G--------------~~i~e~~F~~l~~rAs~~ID~~T~~ri~~~~~d~~~~~~~~~vk~A~c~q~ 66 (131) T protein:vir:80 1 MPYTTLEFYTNEYAG--------------EHLEQDEFAKLLKHAERKIDSVTFYRIRKSGIEAFSEFIQHQIQLATCNQI 66 (131) T ss_pred CCCCCHHHHHHhhCC--------------CCCchhHHHHHHHHHHHHHHHHhcccccccccccCchhHHHHHHHHHHHHH Confidence 999999999988853 234556689999999999999999998632 1 357788999999999 Q ss_pred HHHHhcCCCCh-----------------------H-----HHHHHHHHHHHHHHHhcCcccCCCCCC Q lcl|NC_020866. 76 IWKLHSFEPGD-----------------------K-----IKTDYRDALQALRDIAKGAIKLNATSV 114 (142) Q Consensus 76 ~Y~L~~~~~~e-----------------------~-----v~~rY~~Ai~~L~~va~G~~~L~~~~~ 114 (142) -|.-.....++ . -...+++|+.||+. .|=.-=|+.-. T Consensus 67 e~~~~~g~~~~~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~--TGLlyrGV~~~ 131 (131) T protein:vir:80 67 EYFKEAGGTSELAVSKPDNVSIGRTSISDSNFASTATSLNSGLVGSDVRSYLAH--TGLLYNGVGVR 131 (131) T ss_pred HHHHHhhhhhhhcccccCeeeeCceEEeeccccchhhhhhhhhhHHHHHHHHhc--cCCeecCCCCC Confidence 87643211000 0 00123333333331 11111111111 No 10 >protein:vir:98481 Length: 136 # NCBI annotation: ORF3 # Family: family:all:32440 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958281;genbank:gi:41057255;uniprot:Q38596;genbank:GeneID:2732865 Probab=95.73 E-value=0.0001 Score=42.51 Aligned_cols=117 Identities=17% Similarity=0.271 Sum_probs=62.1 Q ss_pred CC-CCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCCcccccHHHHHHHHHHHHHHH Q lcl|NC_020866. 1 MP-YTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLPLVETPPQIPEIAISIAIWKL 79 (142) Q Consensus 1 M~-YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lPl~~~p~~L~~~~~dIA~Y~L 79 (142) |+ |||.+|++.|++.. +++ .+.+.+.++.-|++||..|..++ +-+.++-|..++.++|++++=.+ T Consensus 1 M~~fAtv~Dl~~rw~~~----~~d------ee~~ra~~~~lL~dAS~~ir~~~----p~~~~~~~~~~~~V~~~~V~R~~ 66 (136) T protein:vir:98 1 MAAYATVEDYQARAAVT----LPD------GSPRRAQVEAYLDDASALMARHI----PTGHTPDPGTLRAICVAVVRRVM 66 (136) T ss_pred CCccCCHHHHHHHhccC----CCC------chhHHHHHHHHHHHHHHHHHHhC----CCCCCCChhHHHHHHHHHHHHHh Confidence 98 99999999998731 111 11223467778999999988775 44455568999999999997444 Q ss_pred hcCC--CChHHHHHHHHHHHHHHHHhcCcccC--------CCCCCcCCCCCCCeeeeec-------CCCccChhhhcccC Q lcl|NC_020866. 80 HSFE--PGDKIKTDYRDALQALRDIAKGAIKL--------NATSVEPATTGDGGARMTD-------RERPLTQENMKGFI 142 (142) Q Consensus 80 ~~~~--~~e~v~~rY~~Ai~~L~~va~G~~~L--------~~~~~~~~~~~~~~~~~~~-------~~r~F~r~~l~g~~ 142 (142) .... .++.. --|-+...+ .|.+-| |+.. +.-....+...+.- .+-+|+- ||- T Consensus 67 ~np~G~~s~Ta-G~ys~s~t~-----~G~Lylt~~E~~~Lg~~r-qr~~~~d~a~si~~~~~~~~~~~dp~~~----~~~ 135 (136) T protein:vir:98 67 ANPGGYRQRTI-GQYAETLGE-----DGGLYLTEDEKGQLQPPD-QTAPDADAAYSLDLDPGTRAWVDDPAGC----GWP 135 (136) T ss_pred hCCCCcccccc-hhHHHhhhc-----CCCcccChHHHHHhCCCC-CcccccccceecccCCCcCCcCCCCCCC----CCC Confidence 3211 11221 145555544 355322 2211 00001111122211 1223332 444 No 11 >protein:vir:98900 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:3882 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164420;genbank:gi:56694910;genbank:GeneID:3197290 Probab=94.60 E-value=0.00085 Score=37.42 Aligned_cols=114 Identities=13% Similarity=0.136 Sum_probs=59.9 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCC-c----ccccHHHHHHHHHHH Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLP-L----VETPPQIPEIAISIA 75 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lP-l----~~~p~~L~~~~~dIA 75 (142) |+|+|.+.+.+ ++. +.++++..+..+.+|+..||.+...||.-. + +.++..++..+|-.+ T Consensus 1 M~Y~t~~~Y~~-~~G--------------~~i~e~~F~~l~~rAs~~ID~iT~~ri~~~~~~~d~~~~~~~vk~A~c~qi 65 (132) T protein:vir:98 1 MPYLTYEEFMD-LNG--------------RDIDDKKFEKLLPKASAIIDGVTGHFYQKVDMEKDNAWRVNQFKLALCAQI 65 (132) T ss_pred CCCCCHHHHHh-hcC--------------CCCCHHHHHHHHHHHHHHHHHHhcccccCCCccccChHHHHHHHHHHHHHH Confidence 99999999876 431 235667799999999999999999998742 2 234456777887666 Q ss_pred HHHHhcCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCCCccChhhhcccC Q lcl|NC_020866. 76 IWKLHSFEPGDKIKTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRERPLTQENMKGFI 142 (142) Q Consensus 76 ~Y~L~~~~~~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~F~r~~l~g~~ 142 (142) -|.-..... .++.+-.-+.-+.-|+.++....... +.......+++ +. ....|+ T Consensus 66 ey~~~~G~~------sae~~~~~~~S~svG~~Svs~~s~~~-----~~~~~~~~~~~-~~-~a~~~L 119 (132) T protein:vir:98 66 EYFDALGAT------TFEEINNSPQTFQAGRTSVSNASRYN-----PSGANESKPLV-AE-DVYIYL 119 (132) T ss_pred HHHHhccch------hhhhccCccceeeeCcEEEEeeccCC-----cccccccccch-HH-HHHHHH Confidence 554211100 11222222344455555554211100 00001111111 11 123333 No 12 >protein:vir:94761 Length: 132 # NCBI annotation: unknown # Family: family:all:1271 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996708;genbank:gi:45597423;genbank:GeneID:2769029 Probab=94.01 E-value=0.0011 Score=36.90 Aligned_cols=102 Identities=9% Similarity=0.022 Sum_probs=55.2 Q ss_pred CC-CCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCCc------c-cccHHHHHHHH Q lcl|NC_020866. 1 MP-YTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLPL------V-ETPPQIPEIAI 72 (142) Q Consensus 1 M~-YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lPl------~-~~p~~L~~~~~ 72 (142) |. |||.+|+..+++ +|+++ ..++++.-|++|+..|..=.-.++.-|. . ..+.+++++|| T Consensus 1 m~~fAtv~Dl~~r~r-----~L~~d--------E~~ra~~LL~dAs~~iR~~~~~~~~~~~~~~~~~~d~~~~~~k~V~~ 67 (132) T protein:vir:94 1 MNPFATVDDLTMLWR-----PLKGD--------EKERAEKLLEIVSDTLREEADKVGRDLDVMISEKPSYFSSVVKSVTV 67 (132) T ss_pred CCCcCCHHHHHHHhc-----cCChh--------HHHHHHHHHHHHHHHHHHHHhhhccccccccCCCCccchhHHHHHHH Confidence 87 999999999985 44432 1478999999999999766555444221 1 13578999999 Q ss_pred HHHHHHHhcCCCChHHHH------HHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCCCccChhhhcccC Q lcl|NC_020866. 73 SIAIWKLHSFEPGDKIKT------DYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRERPLTQENMKGFI 142 (142) Q Consensus 73 dIA~Y~L~~~~~~e~v~~------rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~F~r~~l~g~~ 142 (142) ++++=-|......+.+.+ .|-+...| ...++.=-|++. .+..+ T Consensus 68 ~~V~Ral~~~~~~~g~tq~S~TaG~ys~S~T~--------------------------~np~G~lylt~~-e~~~L 116 (132) T protein:vir:94 68 DIVARTLMTSTDQEPMTQTTESALGYSVSGSY--------------------------LVPGGGLFIKNS-ELSRL 116 (132) T ss_pred HHHHHHhcCCCCCCCceeeeeecccceeeeee--------------------------ecCCCCceeChH-HHHhh Confidence 999866654322211100 11111111 000111112221 11111 No 13 >protein:vir:9761 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795523;genbank:gi:28876281;genbank:GeneID:1257822 Probab=93.93 E-value=0.0005 Score=38.69 Aligned_cols=115 Identities=13% Similarity=0.127 Sum_probs=63.2 Q ss_pred CC-CCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhh-cCCCc-----ccccHHHHHHHHH Q lcl|NC_020866. 1 MP-YTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTR-YVLPL-----VETPPQIPEIAIS 73 (142) Q Consensus 1 M~-YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~R-Y~lPl-----~~~p~~L~~~~~d 73 (142) |. |||.+|+..++. +|+++ ..++++.-|++|+..|...+-.. +.+|. ...+.+++.+||+ T Consensus 1 m~~fATv~Dv~~rwr-----~Lt~d--------E~~ra~~LL~dAS~~iR~~~p~~g~~~~~~~~~~~~~~~~~k~V~~~ 67 (140) T protein:vir:97 1 MGNFATTDDVILLWR-----PLSVD--------ELKRANALLKVVSDTLRMEADKVGKDLDKTMVDKPYFVNVIKSVTVD 67 (140) T ss_pred CCcCCCHHHHHHHhc-----CCCHh--------HHHHHHHHHHHHHHHHHHhhhhccCCcchhcccCccchhHHHHHHHH Confidence 88 999999999984 23322 14689999999999998877643 55553 2335789999999 Q ss_pred HHHHHHhcCCCChH------HHHHHHHHHHHHHHHhcCcccC--------CCCCCcCCCCCCCeeee----ecCCCccCh Q lcl|NC_020866. 74 IAIWKLHSFEPGDK------IKTDYRDALQALRDIAKGAIKL--------NATSVEPATTGDGGARM----TDRERPLTQ 135 (142) Q Consensus 74 IA~Y~L~~~~~~e~------v~~rY~~Ai~~L~~va~G~~~L--------~~~~~~~~~~~~~~~~~----~~~~r~F~r 135 (142) |.+=-|-....++. ---.|-+...|+ ...|.+-| |+.. ..-+.+.+ .-.+-.|+| T Consensus 68 mV~Ral~~~~d~~G~tq~S~TaG~ys~S~T~~--np~G~lylt~~e~~~LGl~~-----~r~~~i~~~g~~~~~~~~~~~ 140 (140) T protein:vir:97 68 IVARTLMTSTQGEPMSQESQSALGYTWSGTYL--VPGGGLFIKDNELKRLGLKK-----QRYGGIELYGEIKRDNDYFDR 140 (140) T ss_pred HHHHHhcCCCCCCcceeeeeeccchhheeeee--cCCCCceeChHHHHHhCCCC-----CceeeecccCccccCcccccC Confidence 98765543211111 122344443332 22333222 2110 00011111 123456766 No 14 >protein:vir:2505 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:28222 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569746;genbank:gi:18496896;genbank:GeneID:932265 Probab=92.53 E-value=0.00018 Score=41.06 Aligned_cols=112 Identities=13% Similarity=0.243 Sum_probs=59.5 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCCcccccHHHHHHHHHHHHHHHh Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLPLVETPPQIPEIAISIAIWKLH 80 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (142) -+++|.+|+..+++. .|++. +...+..-|++|+..|++||. +|.+|- +.|..++++||.|+.=-|. T Consensus 5 ~alAtvdDv~~~lrr----~Lt~d--------E~~~a~~Ll~eAsdlI~g~l~-~~~vp~-~~p~~v~rVvA~ivarAlt 70 (128) T protein:vir:25 5 KALATSQDVKRALRR----DLTEA--------EQTDLSELLAEATDLVVGYLH-PYPVPT-PTPGPIKRVVASMVAAVLT 70 (128) T ss_pred hhccCHHHHHHHhcC----CCCHH--------HHHHHHHHHhcchheeeeecC-CCCCCC-CCCchHHHHHHHHHHHHhh Confidence 358999999999875 34432 345677789999999999997 666665 5677889999999876665 Q ss_pred cCC--CChHHHHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCCCccChhhhcccC Q lcl|NC_020866. 81 SFE--PGDKIKTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRERPLTQENMKGFI 142 (142) Q Consensus 81 ~~~--~~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~F~r~~l~g~~ 142 (142) ... .++ .+-+.+|-++-+. +....+++.-.++..+..=|--.+|-+ T Consensus 71 r~~~~~pe------------~~S~TAgpfs~~f----t~~~~~~g~yLTaa~k~~Lrp~R~~~~ 118 (128) T protein:vir:25 71 RPTQILPE------------TQSLTADGFGVTF----TPGGNSPGPYLSAALKQRLRPYRTGMV 118 (128) T ss_pred CCCccCCC------------ceeeecccccccc----cCCCCCCCceEcHHHHhhcccccceee Confidence 432 111 1112223222111 111111222222211111000011111 No 15 >protein:vir:2432 Length: 124 # NCBI annotation: gp19 # Family: family:all:2817 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046834;genbank:gi:9630402;genbank:GeneID:1261576 Probab=92.44 E-value=0.0014 Score=36.19 Aligned_cols=116 Identities=13% Similarity=0.110 Sum_probs=62.2 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCC-----CcccccHHHHHHHHHHH Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVL-----PLVETPPQIPEIAISIA 75 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~l-----Pl~~~p~~L~~~~~dIA 75 (142) |+|||.+|+.++++. .|++ ...+.++.-|++|+..|-. |++- .....++.++.++|++. T Consensus 1 ~~~At~~Dv~~rw~r----~Lt~--------~E~~~ve~lL~dAs~~ir~----r~P~l~~~~~~~~~~~~v~~V~a~~V 64 (124) T protein:vir:24 1 MAYATADDVVTLWAK----EPEP--------EVMALIERRLEQVERMIRR----RIPDLDARVSSDIFRADLIDIEADAV 64 (124) T ss_pred CCCCCHHHHHHHhCC----CCCH--------HHHHHHHHHHHHHHHHHHh----cCCCcchhcCCCCChhhHHHHHHHHH Confidence 999999999999853 2222 1345688999999999874 5552 22345778999999987 Q ss_pred HHHHhcCCC-ChHHHHHHHHHHHHHHHHhcCcccCCCCCCcC--CCCCCCeeeeecCCCccC Q lcl|NC_020866. 76 IWKLHSFEP-GDKIKTDYRDALQALRDIAKGAIKLNATSVEP--ATTGDGGARMTDRERPLT 134 (142) Q Consensus 76 ~Y~L~~~~~-~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~--~~~~~~~~~~~~~~r~F~ 134 (142) .=-+....+ ..+-.-.|-+.+.+ ....|++-|.-.+-.. ..-..+...++.+.-.=+ T Consensus 65 ~R~~rnP~G~~s~T~G~Ys~sl~~--~~~~g~Lylt~~E~~~Lg~~r~~~~~~i~p~~~~~~ 124 (124) T protein:vir:24 65 LRLVRNPEGYLSETDGAYTYQLQA--DLSQGKLVILDEEWTTLGVNRLSRMSTLVPNIVMPT 124 (124) T ss_pred HHHhhCCCCceecccchhHHhhhh--cccCCceeeCHHHHHhhCcccccceeEeecceeeCC Confidence 755432111 11112456666665 4455665442111000 111122222222111111 No 16 >protein:vir:9576 Length: 131 # NCBI annotation: gp42 # Family: family:all:1271 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862881;genbank:gi:32469473;genbank:GeneID:1461318 Probab=91.79 E-value=0.0027 Score=34.70 Aligned_cols=102 Identities=10% Similarity=0.084 Sum_probs=55.8 Q ss_pred CC-CCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhc------CCCcccccHHHHHHHHH Q lcl|NC_020866. 1 MP-YTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRY------VLPLVETPPQIPEIAIS 73 (142) Q Consensus 1 M~-YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY------~lPl~~~p~~L~~~~~d 73 (142) |. |||.+|+..++. +|+++ ..++++.-|++|+..|..-+-... ..+-+..+..++.+||+ T Consensus 1 m~~fAtv~D~~~rwr-----~Lt~~--------E~~ra~~LL~~As~~ir~~~p~~~~~l~~~~~~~~~~~~~~~~V~~~ 67 (131) T protein:vir:95 1 MENFATVEDLKKLWR-----ALKFD--------EEKRAEALLEVVSHSLRVEAKKVGKDLDGLVATDPSFTMVVKSVTVD 67 (131) T ss_pred CCccCCHHHHHHHhc-----CCCHH--------HHHHHHHHHHHHHHHHHHhhhhccCCccccccCCccchHHHHHHHHH Confidence 87 999999999984 23322 245899999999999987654322 12234457899999999 Q ss_pred HHHHHHhcCCCChHHH------HHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCCCccChhhhcccC Q lcl|NC_020866. 74 IAIWKLHSFEPGDKIK------TDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRERPLTQENMKGFI 142 (142) Q Consensus 74 IA~Y~L~~~~~~e~v~------~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~F~r~~l~g~~ 142 (142) +++.-|.....++.+. -.|-+...|+ ...|.+ -|+++. +..+ T Consensus 68 ~V~Ral~~~~~~~G~tq~S~TaG~ys~S~t~~--~p~g~l------------------------ylt~~e-~~~L 115 (131) T protein:vir:95 68 VVARTLMTSTDQEPMTQVAESALGYSFSGSYL--VPGGGL------------------------FIKDSE-LKRL 115 (131) T ss_pred HHHHHhcCCCCCCCceeeeeecccceeeeeee--cCCCCc------------------------eeChHH-HHHh Confidence 9998876443222110 0111111111 011111 111111 1111 No 17 >protein:vir:1640 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695061;genbank:gi:23455752;genbank:GeneID:955488 Probab=91.39 E-value=0.0015 Score=36.09 Aligned_cols=115 Identities=11% Similarity=0.114 Sum_probs=55.4 Q ss_pred CC-CCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcC-C---Cc---ccccHHHHHHHH Q lcl|NC_020866. 1 MP-YTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYV-L---PL---VETPPQIPEIAI 72 (142) Q Consensus 1 M~-YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~-l---Pl---~~~p~~L~~~~~ 72 (142) |. |||.+|+..+|+ .|+++ ..++++.-|++|+..|..=+-.+.. + +. ...+..++++|| T Consensus 1 m~~fAtv~Dv~~r~r-----~L~~~--------E~~ra~~lL~dAs~~ir~~~p~~~~~l~a~~~e~~~~~~~~~~~V~~ 67 (132) T protein:vir:16 1 MNPFATVDDLTMLWR-----PLKGD--------EKERAEKLLEIVSDSLREEADKVGRDLYAMIAEKPSYFASVVKSVTV 67 (132) T ss_pred CCccCCHHHHHHHhc-----CCCHh--------HHHHHHHHHHHHHHHHHHhhhhhccccccccccccccchhHHHHHHH Confidence 88 999999999985 34422 2468999999999999765543322 2 11 123567999999 Q ss_pred HHHHHHHhcCCCChHH------HHHHHHHHHHHHHHhcCcccCCCCCC---cCCCCCCCeeeeecCC Q lcl|NC_020866. 73 SIAIWKLHSFEPGDKI------KTDYRDALQALRDIAKGAIKLNATSV---EPATTGDGGARMTDRE 130 (142) Q Consensus 73 dIA~Y~L~~~~~~e~v------~~rY~~Ai~~L~~va~G~~~L~~~~~---~~~~~~~~~~~~~~~~ 130 (142) ++++=-|......+.+ --.|-+...|+ ...|.+-|.-.+- +-....-+.+-+.... T Consensus 68 ~~V~Ral~~~~~~~G~tq~S~TaG~ys~S~t~~--~p~G~lylt~~e~~~LG~~~~r~~~i~~~~~~ 132 (132) T protein:vir:16 68 DIVARTLMTSTDQEPMTQTTESALGYSVSGSYL--VPGGGLFIKNSELSRLGLKKQRFGVIDFYGND 132 (132) T ss_pred HHHHHHhcCCCCCCCceeeeeeccchheeeeee--cCCCcceeChHHHHhhCCCCCceEEEeecCCC Confidence 9988766543222211 11222222221 1122211100000 0000000001111111 No 18 >protein:vir:7773 Length: 123 # NCBI annotation: gp19 # Family: family:all:2817 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817607;genbank:gi:29566037;genbank:GeneID:1259231 Probab=90.83 E-value=0.004 Score=33.71 Aligned_cols=117 Identities=18% Similarity=0.259 Sum_probs=59.7 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCCc-cccc---HHHHHHHHHHHH Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLPL-VETP---PQIPEIAISIAI 76 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lPl-~~~p---~~L~~~~~dIA~ 76 (142) |+|||.+|+.++++. .||+ ...+.++.-|++|+..|-.-+- .|+- ..-| +.++.++|++.. T Consensus 1 ~~~At~~Dv~ar~~r----~LT~--------~E~~~ve~lL~dAs~~ir~r~P---~l~~~a~d~~~~~~~~~V~~~~V~ 65 (123) T protein:vir:77 1 MPYATASDVTSRWAR----QPTD--------EETALINVRLADVERMIKRRIP---DLATKVTDPDYLEDLKQVEADAVL 65 (123) T ss_pred CCcCCHHHHHHHhCC----CCCH--------HHHHHHHHHHHHHHHHHHHhcc---CcccccCCcchhHHHHHHHHHHHH Confidence 999999999999853 2232 1345688999999999877332 2221 1233 678899998876 Q ss_pred HHHhcCCC-ChHHHHHHHHHHHHHHHHhcCcccCCCCCCcC-CCCCCCeeeeecCCCccC Q lcl|NC_020866. 77 WKLHSFEP-GDKIKTDYRDALQALRDIAKGAIKLNATSVEP-ATTGDGGARMTDRERPLT 134 (142) Q Consensus 77 Y~L~~~~~-~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~-~~~~~~~~~~~~~~r~F~ 134 (142) =-+....+ ..+-.-.|-+.+.+ ....|++-|.-.+-.. ..+.++...+...+..=+ T Consensus 66 R~~rnpeG~~s~T~G~ys~sl~~--a~~~g~Lylt~~E~~~Lg~~~~~~~~i~p~~~~~~ 123 (123) T protein:vir:77 66 RLVRNPEGYLSETDGNYTYMLRS--DLASGKLEIFPEEWEILGYRRSRMTVIVPNPVMPT 123 (123) T ss_pred HHhhCCCCceecccchhhhhhcc--cCCCCcceeCHHHHHhhcCCCCceeEEeeceecCC Confidence 54422111 01111356666554 4556665442111000 011111222222111111 No 19 >protein:vir:1329 Length: 122 # NCBI annotation: gp37 # Family: family:all:11657 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047928;swissprot:trembl:q9zxb0;genbank:gi:9631146;uniprot:Q9ZXB0;genbank:GeneID:2715909 Probab=89.65 E-value=0.0058 Score=32.86 Aligned_cols=113 Identities=19% Similarity=0.162 Sum_probs=68.7 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCCcccccHHHHHHHHHHHHHHHh Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLPLVETPPQIPEIAISIAIWKLH 80 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (142) |+|+|.++|.+.-|-++-. ....+.+..||+-....+..|++...+..-.+.|..++-+...+||-+.- T Consensus 1 mayatieelraldglddsa-----------lfsdellsdaidfsvetveaycgrkwdtaedptpetirwcvrtlarqyvl 69 (122) T protein:vir:13 1 MAYATIEELRALDGLDDSA-----------LFSDELLSDAIDFSVETVEAYCGRKWDTAEDPTPETIRWCVRTLARQYVL 69 (122) T ss_pred CcchhhhhhhhhcCccchh-----------hhhhhhhhhhhhhhhhhhhhhhCcccCCcCCCChhHHHHHHHHHHHHHHH Confidence 9999999988766643322 34456788999999999999999999999999999998888899995543 Q ss_pred --cCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCe-------eeeecCCCccC Q lcl|NC_020866. 81 --SFEPGDKIKTDYRDALQALRDIAKGAIKLNATSVEPATTGDGG-------ARMTDRERPLT 134 (142) Q Consensus 81 --~~~~~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~-------~~~~~~~r~F~ 134 (142) -.+.++. |+..- -.=|.+.|...+.+=.+.+-.+ .++. -|-.|- T Consensus 70 dhvsripdr-------alqlq--sefgsiqlaqaggnwrptslpevnaklnlyrvr-lpfifm 122 (122) T protein:vir:13 70 DHVSRIPDR-------ALQLQ--SEFGSIQLAQAGGNWRPTSLPEVNAKLNLYRVR-LPFIFM 122 (122) T ss_pred HHhhhcchh-------hhhhh--hcccceeeeccCCCcccCcccccccceeeeeee-cceeeC Confidence 2344442 22211 1235555533222211111111 1111 122232 No 20 >protein:vir:78478 Length: 149 # NCBI annotation: gp16 # Family: family:all:2817 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491587;genbank:gi:157786410;genbank:GeneID:5625630 Probab=88.31 E-value=0.0042 Score=33.60 Aligned_cols=123 Identities=19% Similarity=0.275 Sum_probs=60.7 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCCc-cccc---HHHHHHHHHHHH Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLPL-VETP---PQIPEIAISIAI 76 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lPl-~~~p---~~L~~~~~dIA~ 76 (142) |+|||.+|+.++++. .||++ ..++++.-|++|+..|-.-+- .|+- .+.| +.++.++|++.+ T Consensus 1 ~afAtv~Dve~rw~r----~LT~e--------E~~~ae~lL~dAs~~IR~~iP---~La~~~~dp~~~a~v~~V~~~mV~ 65 (149) T protein:vir:78 1 MAYAEPSDVVARLGR----PLTDD--------EETQVETFLEDAEIEIRSRIP---DLDDKAEDEDYLKRVIKVEASAVT 65 (149) T ss_pred CCcCCHHHHHHHhCC----CCCHH--------HHHHHHHHHHHHHHHHHHhcc---ccccccCCcchhhHHHHHHHHHHH Confidence 999999999999853 22321 245799999999999877441 2221 1223 578899999887 Q ss_pred HHHhcCCC--ChHHHHHHHHHHHHHHHHhcCcccC--------CCCCC---------cCCCCCCCeeeeecCC-CccChh Q lcl|NC_020866. 77 WKLHSFEP--GDKIKTDYRDALQALRDIAKGAIKL--------NATSV---------EPATTGDGGARMTDRE-RPLTQE 136 (142) Q Consensus 77 Y~L~~~~~--~e~v~~rY~~Ai~~L~~va~G~~~L--------~~~~~---------~~~~~~~~~~~~~~~~-r~F~r~ 136 (142) =-+....+ ++. .-.|-+.+.+ ....|++-| |+... .+-..+..-..|.+-. ++|-.. T Consensus 66 R~~rnpeG~~S~T-~G~YS~slt~--~np~G~LylT~~E~a~LG~~r~~G~~~i~p~~~~~~~~~~~~~~~~~~~~~~~~ 142 (149) T protein:vir:78 66 RLIRNPDGYIGET-DGNYSYQLNW--RLNTGAIEITDKEWAQLGLSKNVGVLNVRPKTPLERSGEYPAFGSVEWQVFQQS 142 (149) T ss_pred HHhcCCCCeeeee-cchhhhhhhc--cCCCCceeeCHHHHHhhCCcccccceeecccCccccCCCCCcccceeeeeeecc Confidence 65532211 111 1355555544 223344322 21110 0000011112222221 344332 Q ss_pred h--hccc Q lcl|NC_020866. 137 N--MKGF 141 (142) Q Consensus 137 ~--l~g~ 141 (142) + --|| T Consensus 143 ~~~~~~~ 149 (149) T protein:vir:78 143 SPLYWGY 149 (149) T ss_pred CcccccC Confidence 2 1344 No 21 >protein:vir:78254 Length: 149 # NCBI annotation: gp16 # Family: family:all:2817 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491668;genbank:gi:157786492;genbank:GeneID:5625732 Probab=88.31 E-value=0.0042 Score=33.60 Aligned_cols=123 Identities=19% Similarity=0.275 Sum_probs=60.7 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCCc-cccc---HHHHHHHHHHHH Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLPL-VETP---PQIPEIAISIAI 76 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lPl-~~~p---~~L~~~~~dIA~ 76 (142) |+|||.+|+.++++. .||++ ..++++.-|++|+..|-.-+- .|+- .+.| +.++.++|++.+ T Consensus 1 ~afAtv~Dve~rw~r----~LT~e--------E~~~ae~lL~dAs~~IR~~iP---~La~~~~dp~~~a~v~~V~~~mV~ 65 (149) T protein:vir:78 1 MAYAEPSDVVARLGR----PLTDD--------EETQVETFLEDAEIEIRSRIP---DLDDKAEDEDYLKRVIKVEASAVT 65 (149) T ss_pred CCcCCHHHHHHHhCC----CCCHH--------HHHHHHHHHHHHHHHHHHhcc---ccccccCCcchhhHHHHHHHHHHH Confidence 999999999999853 22321 245799999999999877441 2221 1223 578899999887 Q ss_pred HHHhcCCC--ChHHHHHHHHHHHHHHHHhcCcccC--------CCCCC---------cCCCCCCCeeeeecCC-CccChh Q lcl|NC_020866. 77 WKLHSFEP--GDKIKTDYRDALQALRDIAKGAIKL--------NATSV---------EPATTGDGGARMTDRE-RPLTQE 136 (142) Q Consensus 77 Y~L~~~~~--~e~v~~rY~~Ai~~L~~va~G~~~L--------~~~~~---------~~~~~~~~~~~~~~~~-r~F~r~ 136 (142) =-+....+ ++. .-.|-+.+.+ ....|++-| |+... .+-..+..-..|.+-. ++|-.. T Consensus 66 R~~rnpeG~~S~T-~G~YS~slt~--~np~G~LylT~~E~a~LG~~r~~G~~~i~p~~~~~~~~~~~~~~~~~~~~~~~~ 142 (149) T protein:vir:78 66 RLIRNPDGYIGET-DGNYSYQLNW--RLNTGAIEITDKEWAQLGLSKNVGVLNVRPKTPLERSGEYPAFGSVEWQVFQQS 142 (149) T ss_pred HHhcCCCCeeeee-cchhhhhhhc--cCCCCceeeCHHHHHhhCCcccccceeecccCccccCCCCCcccceeeeeeecc Confidence 65532211 111 1355555544 223344322 21110 0000011112222221 344332 Q ss_pred h--hccc Q lcl|NC_020866. 137 N--MKGF 141 (142) Q Consensus 137 ~--l~g~ 141 (142) + --|| T Consensus 143 ~~~~~~~ 149 (149) T protein:vir:78 143 SPLYWGY 149 (149) T ss_pred CcccccC Confidence 2 1344 No 22 >protein:vir:6243 Length: 122 # NCBI annotation: gp37 # Family: family:all:11657 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813697;swissprot:trembl:q859c0;genbank:gi:29366757;uniprot:Q859C0;genbank:GeneID:1258898 Probab=86.02 E-value=0.013 Score=30.99 Aligned_cols=113 Identities=19% Similarity=0.166 Sum_probs=66.9 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCCcccccHHHHHHHHHHHHHHHh Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLPLVETPPQIPEIAISIAIWKLH 80 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (142) |+|+|.++|.+.-|-++ ......+.+..||+-....+.-|+++..+..-.+.|.+++-+...+||-+.- T Consensus 1 mayatieelralegidd-----------aslfpdellsdaidfsvetvevycgqkwdtaenptpevirwcvrtlarqyvl 69 (122) T protein:vir:62 1 MAYATIEELRALEGIDD-----------ASLFPDELLSDAIDFSVETVEVYCGQKWDTAENPTPEVIRWCVRTLARQYVL 69 (122) T ss_pred CccchhhhhHhhccccc-----------cccchhhhhhhhhhhhhhhhhhhcCcccCCcCCCchHHHHHHHHHHHHHHHH Confidence 99999998876544222 1234456788999999999999999999999999999988888889995543 Q ss_pred --cCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCc---CCCCCC----CeeeeecCCCccC Q lcl|NC_020866. 81 --SFEPGDKIKTDYRDALQALRDIAKGAIKLNATSVE---PATTGD----GGARMTDRERPLT 134 (142) Q Consensus 81 --~~~~~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~---~~~~~~----~~~~~~~~~r~F~ 134 (142) -.+.++. |+..- -.=|.+.|...+.. ++-+.- +..++. -|-.|- T Consensus 70 dhvsripdr-------alqlq--sefgsiqlaqaggtwrptslpevnaklnlyrvr-lpfifm 122 (122) T protein:vir:62 70 DHVSRIPDR-------ALQLQ--SEFGSIQLAQAGGTWRPTSLPEVNAKLNLYRVR-LPFIFM 122 (122) T ss_pred HHhhhcchh-------hhhhh--hcccceeeeccCCccccCcCcccccceeeeEee-cceeeC Confidence 2344442 22211 12255555322211 111111 111111 122232 No 23 >protein:vir:94955 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239280;genbank:gi:66392062;genbank:GeneID:5076600 Probab=78.64 E-value=0.019 Score=30.01 Aligned_cols=118 Identities=14% Similarity=0.163 Sum_probs=65.8 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHH---HHhhhc------------------CCC Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDG---YLGTRY------------------VLP 59 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~---YL~~RY------------------~lP 59 (142) =+|+|.+|..+-+...-. .+ .....|.+..+.||-.|+.-||+ |++.|- .+| T Consensus 14 nSYvtv~ea~aY~~~r~~---~~----~w~~~~~~~~e~aL~~A~dyId~~~~f~G~r~~~~Q~l~wPR~g~~~dg~~~~ 86 (170) T protein:vir:94 14 NSYVTVAEANSYFDGSYG---RP----LWTSASEDEKASLVISASRYLDQMMAWIGAPTNPEQSMWWPCKNAVIGGMTLS 86 (170) T ss_pred cceecHHHHHHHHHhhcc---cc----ccCCCCHHHHHHHHHHHHHHhccccccccccCCcchhhcccccCcccCccccc Confidence 569999998766544321 11 12346788899999999999996 333321 135 Q ss_pred cccccHHHHHHHHHHHHHHHhcCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCCCccChhhhc Q lcl|NC_020866. 60 LVETPPQIPEIAISIAIWKLHSFEPGDKIKTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRERPLTQENMK 139 (142) Q Consensus 60 l~~~p~~L~~~~~dIA~Y~L~~~~~~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~F~r~~l~ 139 (142) ...+|..++..||-+|.+.+.......... .+++ -++| |.++..-....+. .+.+ ..+ +.-|+ T Consensus 87 ~~~IP~~V~~Aq~elA~~~~~~~~~~~~~~----~~v~-~~kV--G~i~veY~~~~~~---~~~~-----~~v--~~LL~ 149 (170) T protein:vir:94 87 QVSIPVKVKIAVFELAYFMLESGAALSFAD----QTID-SVKV--GTIRVEFTKNSTD---AGLP-----TFV--EAMLS 149 (170) T ss_pred cchhhHHHHHHHHHHHHHHHhCcccCcccc----ccee-eEec--ceeEEEecCCCCC---CccH-----HHH--HHHhh Confidence 567899999999999998886543221111 1121 1333 5554432211110 0100 111 34467 Q ss_pred ccC Q lcl|NC_020866. 140 GFI 142 (142) Q Consensus 140 g~~ 142 (142) +|+ T Consensus 150 p~l 152 (170) T protein:vir:94 150 GFG 152 (170) T ss_pred hhh Confidence 777 No 24 >protein:vir:4228 Length: 125 # NCBI annotation: predicted 14.0Kd protein # Family: family:all:2817 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039683;swissprot:sw:q05225;genbank:gi:9625449;uniprot:Q05225;genbank:GeneID:2942926 Probab=76.04 E-value=0.047 Score=27.85 Aligned_cols=116 Identities=14% Similarity=0.142 Sum_probs=60.5 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCC-----cccccHHHHHHHHHHH Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLP-----LVETPPQIPEIAISIA 75 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lP-----l~~~p~~L~~~~~dIA 75 (142) |+|||.+|..++++. .|++ -....|+.=|++|+..|-..+= +|+ .+..++.++.++++.. T Consensus 1 m~~A~~eDV~a~w~r----~lt~--------~e~~~v~~~L~~Ae~~Ir~riP---dL~~r~~~~~~~~~~v~~Vea~aV 65 (125) T protein:vir:42 1 MAYATAEDVVTLWAK----EPEP--------EVMALIERRLQQIERMIKRRIP---DLDVKAAASATFRADLIDIEADAV 65 (125) T ss_pred CCcccHhHHHHHhCC----CCCh--------HHHHHHHHHHHHHHHHHHHhCC---CchhhhcccCcchhhHHHHHHHHH Confidence 999999999999863 2232 1356788889999998854431 122 3445677777777665 Q ss_pred HHHHhcCC-C-ChHHHHHHHHHHHHHHHHhcCcccCCCCCCc--CCCCCCCeeeeecCCCccC Q lcl|NC_020866. 76 IWKLHSFE-P-GDKIKTDYRDALQALRDIAKGAIKLNATSVE--PATTGDGGARMTDRERPLT 134 (142) Q Consensus 76 ~Y~L~~~~-~-~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~--~~~~~~~~~~~~~~~r~F~ 134 (142) += |..+. + ..+-.-.|-.-+.+ +.+.|++-+--++-+ .++...+.+.+....-.=| T Consensus 66 ~R-v~RNpeGy~s~T~G~Ys~~l~~--~~~~g~L~it~eEw~~L~p~~~~g~~~i~P~~~~~~ 125 (125) T protein:vir:42 66 LR-LVRNPEGYLSETDGAYTYQLQA--DLSQGKLTILDEEWEILGVNSQKRMAVIVPNVVMPT 125 (125) T ss_pred HH-HHhCCCccccccchhHHHhhhc--ccccCceeeCHHHHHhhCccccccceeecccceeCC Confidence 53 43321 1 01111345555544 667788765322111 1111223333332211111 No 25 >protein:vir:80389 Length: 172 # NCBI annotation: BcepGomrgp09 # Family: family:all:703 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210229;genbank:gi:146329921;genbank:GeneID:5123498 Probab=75.56 E-value=0.031 Score=28.88 Aligned_cols=120 Identities=15% Similarity=0.232 Sum_probs=60.6 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHH----HHhhhc------------------CC Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDG----YLGTRY------------------VL 58 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~----YL~~RY------------------~l 58 (142) =+|+|.+++.+.+... ..+++.+-.+.||..|+..||+ |.+.|- .+ T Consensus 15 nSYvt~~~a~aY~~~r------------g~~~~~d~~e~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~g~~~ 82 (172) T protein:vir:80 15 NTYAGADFVIAYAQAR------------GVTVDADEAERLILEAMDYIESFRRRWKGERNTREQGLTWPRHDAVVDGFVI 82 (172) T ss_pred cccccHHHHHHHHHHc------------CCCcCHHHHHHHHHHHHHHHhhccCccccccCCccccccccccCcccCcccc Confidence 5699999998776432 1133444579999999999999 333321 24 Q ss_pred CcccccHHHHHHHHHHHHHHHhcCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCC-C--ccCh Q lcl|NC_020866. 59 PLVETPPQIPEIAISIAIWKLHSFEPGDKIKTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRE-R--PLTQ 135 (142) Q Consensus 59 Pl~~~p~~L~~~~~dIA~Y~L~~~~~~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~-r--~F~r 135 (142) |...+|..|+..||-+|.+.+.......... ..++ --++| |.++..-.. ....+..-..++. . ++=. T Consensus 83 ~~~~IP~~v~~A~~elA~~~~~g~~~~~~~~---~~~v-~~ekV--G~i~~eY~~----~~~~~~~~~~~~~~~~~~~v~ 152 (172) T protein:vir:80 83 PSDVIPKELQSAVAAAVIEQVNGFELQQSQD---QWAV-RIEKV--DVIEVQYAA----GGGGQSASANAPMKPTFPKID 152 (172) T ss_pred cccchhHHHHHHHHHHHHHHhcCCccCcCCC---Ccee-eEEec--cceEEeeec----ccCccccccccCCccchHHHH Confidence 5567899999999999976554322111111 0111 11122 332221110 0000101111111 1 1223 Q ss_pred hhhcccC Q lcl|NC_020866. 136 ENMKGFI 142 (142) Q Consensus 136 ~~l~g~~ 142 (142) .-|++|+ T Consensus 153 ~LL~p~l 159 (172) T protein:vir:80 153 ALLNPLL 159 (172) T ss_pred HHHhhhh Confidence 4567777 No 26 >protein:vir:104088 Length: 125 # NCBI annotation: gp20 # Family: family:all:2817 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655599;genbank:gi:109392470;genbank:GeneID:4156956 Probab=73.04 E-value=0.048 Score=27.80 Aligned_cols=116 Identities=11% Similarity=0.123 Sum_probs=58.2 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCC-----cccccHHHHHHHHHHH Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLP-----LVETPPQIPEIAISIA 75 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lP-----l~~~p~~L~~~~~dIA 75 (142) |+|||.+|+.++++. .|++ -..+.|+.=|++|+..|--.+= .|+ .+..+..++.++++.. T Consensus 1 ma~A~~~Dv~~~w~r----~lT~--------~E~~~v~~~L~~Ae~~Ir~riP---~L~~r~~a~~~~~~~v~~Vea~aV 65 (125) T protein:vir:10 1 MAYANAQDVVTLWAK----EPEP--------EVMELIERRLAQVERMIKRRIP---NLDLKVAADATFQADLIDIEADAV 65 (125) T ss_pred CCcCCHHHHHHHhCC----CCCH--------HHHHHHHHHHHHHHHHHHHhCC---ChhhhhhcCCCccccHHHHHHHHH Confidence 999999999999863 2232 1356788889999998854331 122 3345566666655544 Q ss_pred HHHHhcCC-C-ChHHHHHHHHHHHHHHHHhcCcccCCCCCCc--CCCCCCCeeeeecCCCccC Q lcl|NC_020866. 76 IWKLHSFE-P-GDKIKTDYRDALQALRDIAKGAIKLNATSVE--PATTGDGGARMTDRERPLT 134 (142) Q Consensus 76 ~Y~L~~~~-~-~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~--~~~~~~~~~~~~~~~r~F~ 134 (142) + ++..+. + ..+-.-.|-+-+.+ +.+.|++-|--++-. .+....+.+.++.+.-.=+ T Consensus 66 ~-Rv~rNPeGy~s~T~G~Ys~~l~~--~~~~g~L~it~~Ew~~Lg~~r~s~~~~i~p~~~~~~ 125 (125) T protein:vir:10 66 L-RLVRNPEGYISETDGAYTYQLQT--DLSQGRLTILDDEWTTLGVNRLSRMSVIAPNIVMPT 125 (125) T ss_pred H-HHhcCCCcccccccchhHHhhhc--ccccCceeeCHHHHHhhccccccceeeeecccccCC Confidence 3 344321 1 11111345444444 566677655311110 1111233333332211111 No 27 >protein:vir:99002 Length: 158 # NCBI annotation: gp31 # Family: family:all:32652 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655896;genbank:gi:109521468;genbank:GeneID:4157967 Probab=71.07 E-value=0.036 Score=28.51 Aligned_cols=113 Identities=15% Similarity=0.149 Sum_probs=62.4 Q ss_pred CC-CCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHH---HHHHHHHHHHHHhhhcCCCcccccHHHHHHHHHHHH Q lcl|NC_020866. 1 MP-YTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDA---LTDTDAVIDGYLGTRYVLPLVETPPQIPEIAISIAI 76 (142) Q Consensus 1 M~-YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~A---l~~A~~~id~YL~~RY~lPl~~~p~~L~~~~~dIA~ 76 (142) |+ |+|++||..++-. .-+..+ ..+..+| |++||++..-+=++...+| ..+|..++.+|..-|+ T Consensus 1 ~~alasvee~~trl~~---------~lp~~~---~r~~a~a~~vLd~~S~~ar~~~gr~W~~~-~daP~~vr~ivL~aa~ 67 (158) T protein:vir:99 1 MAALVSVEEFTTFLRV---------PLPEEG---SEKYTQMEFLLTLASDWARELSCKPWLLP-ADAPVTARGIILAASR 67 (158) T ss_pred CcceeeHhhhhhhhcc---------cCChhh---hHHHHHHHHHHHHHHHHHHHhcCccCCCC-CcchhHHHHHHHHHHH Confidence 87 9999999998721 111111 2234444 9999999887766666544 3689999999998887 Q ss_pred HHHhcC------CC-ChHH-------H--HHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCCCccChhhhcc Q lcl|NC_020866. 77 WKLHSF------EP-GDKI-------K--TDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRERPLTQENMKG 140 (142) Q Consensus 77 Y~L~~~------~~-~e~v-------~--~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~F~r~~l~g 140 (142) =.+.+- +. +..+ . =-+++=++.|++...-+- |+-. .-+.++.-|. ..+ T Consensus 68 R~~~NP~g~~~~~~G~~~~~~~~~g~~~~ffT~~E~~~L~r~~~s~G--G~~~-----------~~ttR~d~~~---~~~ 131 (158) T protein:vir:99 68 REWNNPKRVSYVVKGPQSATFMQSAYPPGFFTDAEEAKLRSYGRSTG--NWGV-----------IETYRDDEEQ---LNG 131 (158) T ss_pred HHHhcCCceEEeeecchhhhcccccCCCcccCHHHHHHHHHhhcccC--ceeE-----------EEeecCcccc---CCc Confidence 555431 11 1111 0 012444667777632221 1111 1112333332 688 Q ss_pred cC Q lcl|NC_020866. 141 FI 142 (142) Q Consensus 141 ~~ 142 (142) || T Consensus 132 yv 133 (158) T protein:vir:99 132 YL 133 (158) T ss_pred ee Confidence 88 No 28 >protein:vir:1887 Length: 108 # NCBI annotation: gp6 # Family: family:all:6764 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037667;genbank:gi:9634125;genbank:GeneID:1262472 Probab=70.95 E-value=0.2 Score=24.39 Aligned_cols=97 Identities=6% Similarity=0.065 Sum_probs=52.7 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhh-cCCCcccccHHHHHHHHHHHHHHH Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTR-YVLPLVETPPQIPEIAISIAIWKL 79 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~R-Y~lPl~~~p~~L~~~~~dIA~Y~L 79 (142) |.++|+++++....- | ..-|.+.|+.-|..|.+.|-+|+... |..+ ..+|..++..+.-++-| + T Consensus 6 M~~vtLee~K~hLRi-------d------~dddD~lI~~~i~AA~~~v~~~~~~~~~~~~-~~~p~~ik~AiLllv~~-~ 70 (108) T protein:vir:18 6 LDVISLSLFKQQIEF-------E------EDDRDELITLYAQAAFDYCMRWCDEPAWKVA-ADIPAAVKGAVLLVFAD-M 70 (108) T ss_pred ccccCHHHHHHHcCC-------C------CCcchHHHHHHHHHHHHHHHHHhCCcccccc-cccchHHHHHHHHHHHH-H Confidence 999999999877441 1 12467889999999999999999755 3333 34666666555544443 4 Q ss_pred hcCC-C-Ch-HHHHHHHHHHHHHH--HHhcCcccCCCCCCcCCC Q lcl|NC_020866. 80 HSFE-P-GD-KIKTDYRDALQALR--DIAKGAIKLNATSVEPAT 118 (142) Q Consensus 80 ~~~~-~-~e-~v~~rY~~Ai~~L~--~va~G~~~L~~~~~~~~~ 118 (142) |.+| + ++ +.... .-+..+|. +-=.|+ |+.+... T Consensus 71 YenRE~~~~~~~~~~-~~~~~LL~pYR~~~g~-----~~~~~~~ 108 (108) T protein:vir:18 71 FEHRTAQSEVQLYEN-AAAERMMFIHRNWRGK-----AESEEGS 108 (108) T ss_pred Hhcccccccchhhhh-HHHHHHHHHHHhcCCC-----CCcccCC Confidence 4433 2 22 11111 01222222 122233 2221111 No 29 >protein:vir:192 Length: 108 # NCBI annotation: Gp6 # Family: family:all:6764 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037702;genbank:gi:9634167;genbank:GeneID:1262532 Probab=70.95 E-value=0.2 Score=24.39 Aligned_cols=97 Identities=6% Similarity=0.065 Sum_probs=52.7 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhh-cCCCcccccHHHHHHHHHHHHHHH Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTR-YVLPLVETPPQIPEIAISIAIWKL 79 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~R-Y~lPl~~~p~~L~~~~~dIA~Y~L 79 (142) |.++|+++++....- | ..-|.+.|+.-|..|.+.|-+|+... |..+ ..+|..++..+.-++-| + T Consensus 6 M~~vtLee~K~hLRi-------d------~dddD~lI~~~i~AA~~~v~~~~~~~~~~~~-~~~p~~ik~AiLllv~~-~ 70 (108) T protein:vir:19 6 LDVISLSLFKQQIEF-------E------EDDRDELITLYAQAAFDYCMRWCDEPAWKVA-ADIPAAVKGAVLLVFAD-M 70 (108) T ss_pred ccccCHHHHHHHcCC-------C------CCcchHHHHHHHHHHHHHHHHHhCCcccccc-cccchHHHHHHHHHHHH-H Confidence 999999999877441 1 12467889999999999999999755 3333 34666666555544443 4 Q ss_pred hcCC-C-Ch-HHHHHHHHHHHHHH--HHhcCcccCCCCCCcCCC Q lcl|NC_020866. 80 HSFE-P-GD-KIKTDYRDALQALR--DIAKGAIKLNATSVEPAT 118 (142) Q Consensus 80 ~~~~-~-~e-~v~~rY~~Ai~~L~--~va~G~~~L~~~~~~~~~ 118 (142) |.+| + ++ +.... .-+..+|. +-=.|+ |+.+... T Consensus 71 YenRE~~~~~~~~~~-~~~~~LL~pYR~~~g~-----~~~~~~~ 108 (108) T protein:vir:19 71 FEHRTAQSEVQLYEN-AAAERMMFIHRNWRGK-----AESEEGS 108 (108) T ss_pred Hhcccccccchhhhh-HHHHHHHHHHHhcCCC-----CCcccCC Confidence 4433 2 22 11111 01222222 122233 2221111 No 30 >protein:vir:106583 Length: 105 # NCBI annotation: putative protein # Family: family:all:6481 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958586;genbank:gi:41179246;genbank:GeneID:2717119 Probab=65.15 E-value=0.29 Score=23.55 Aligned_cols=90 Identities=14% Similarity=0.173 Sum_probs=56.6 Q ss_pred CCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCCcccccHHHHHHHHHHHHHHHhcC Q lcl|NC_020866. 3 YTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLPLVETPPQIPEIAISIAIWKLHSF 82 (142) Q Consensus 3 YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~~~ 82 (142) +-..+.+...+-. |.-+ .++.+.+.+..-|++|.+.+=+|+.. ..+|+.|..+++++|..+.... T Consensus 1 ~~~~~~~~e~ik~--L~~~-------~d~~~DelL~~lieda~~~vl~y~nr------~~ip~~l~~~v~evav~~fNR~ 65 (105) T protein:vir:10 1 MLNVDQLTEIVSA--LSTR-------LENVNNALLTELVKESIAQVLDYTGQ------KKLVGSMDIYVKKLAVINYNRL 65 (105) T ss_pred CCchHHHHHHHHH--Hhcc-------CCCchhHHHHHHHHHHHHHHHHHcCC------cccchhHHHHHHHHHHHHhccc Confidence 2223333322211 1112 24567889999999999999999863 4778899999999888776543 Q ss_pred CC------C---------hHHHHHHHHHHHHHHHHhcCcc Q lcl|NC_020866. 83 EP------G---------DKIKTDYRDALQALRDIAKGAI 107 (142) Q Consensus 83 ~~------~---------e~v~~rY~~Ai~~L~~va~G~~ 107 (142) .. + +.+-+-|.+.|+--++-.-|+. T Consensus 66 G~EG~tS~SegGvS~sy~~~~~~~~~~~l~~yR~~~v~~~ 105 (105) T protein:vir:10 66 GIEGETQRSEGGITNYLETGIPKDIRQGLNSYRIAKVKKL 105 (105) T ss_pred CCcccceeecCCeeeeeeccCcHHHHHHHHHHhhhcccCC Confidence 21 1 1245566666665565555665 No 31 >protein:vir:9821 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795583;genbank:gi:28876338;genbank:GeneID:1257921 Probab=58.82 E-value=0.13 Score=25.41 Aligned_cols=112 Identities=13% Similarity=0.108 Sum_probs=52.3 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCC-Cccccc-----HHHHHHHHHH Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVL-PLVETP-----PQIPEIAISI 74 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~l-Pl~~~p-----~~L~~~~~dI 74 (142) |+|+|.+++.+.-+. +.+-.+.-+..|+..||.+..-+|.- -+.... .+=+.+|..| T Consensus 6 M~YlT~eey~~l~~~-----------------~~~dF~kllk~As~~ID~~t~~~y~~~d~e~d~~~r~~~vKkA~a~QI 68 (138) T protein:vir:98 6 IAFLTQKEFEDLGFD-----------------DVEDFEKMEKRASHAVNLYCRNRYDYKDLKKEIALVQKAVKRAIAYQI 68 (138) T ss_pred ccccchHHHhccCCC-----------------ChhhHHHHHHHHHHHhhhhhccccccccccchhHHHHHHHHHHHHHHH Confidence 999999987643221 11228889999999999999999873 222222 2223344444 Q ss_pred HHHHHhcCCCChHHHHHHHHHHHHHHHHhcCcccCCC--CCCcCCCCCCCeeeeecCCCccChh---hh--cccC Q lcl|NC_020866. 75 AIWKLHSFEPGDKIKTDYRDALQALRDIAKGAIKLNA--TSVEPATTGDGGARMTDRERPLTQE---NM--KGFI 142 (142) Q Consensus 75 A~Y~L~~~~~~e~v~~rY~~Ai~~L~~va~G~~~L~~--~~~~~~~~~~~~~~~~~~~r~F~r~---~l--~g~~ 142 (142) .......-...+. ..-+.-|.-|+.++.- ...+.+..++.. .+--++-+ -| -|++ T Consensus 69 eY~~~~G~ts~~d--------~~~~~s~svGrTSiS~~~~~~~~s~~~~~~-----~~~~~s~~A~~~L~~tGLL 130 (138) T protein:vir:98 69 AYLNDSGVMTAED--------KQSFAGISLGRTSISYTVGHGQGSQQKTLA-----DRFNLCLDAENELLVVGLG 130 (138) T ss_pred HHHHHcCCcchhh--------ccCcCceEeeeeEeeccccccccccccccc-----ccccccHHHHHHHhhcCcc Confidence 4444333211111 2334456666655531 111111111100 00011111 11 1222 No 32 >protein:vir:81159 Length: 95 # NCBI annotation: putative DNA packaging protein # Family: family:all:316 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285813;genbank:gi:148747734;genbank:GeneID:5247202 Probab=58.07 E-value=0.42 Score=22.64 Aligned_cols=91 Identities=13% Similarity=0.119 Sum_probs=61.5 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCCcccccHHHHHHHHHHHHHHHh Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLPLVETPPQIPEIAISIAIWKLH 80 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (142) |.+.|+++++....- | ..-|.+.|+.-|+.|...|.+|++.++ ...|+.++..++-++-++=. T Consensus 1 Mm~vtLee~K~~LRI-------D------~d~dD~lI~~li~aA~~~i~~~~g~~~----~~~~~~~~~Avl~lv~~~Ye 63 (95) T protein:vir:81 1 MMIVTLEEVKNWLRV-------D------FSDDDALITTLINAAEEYLKNATGTTF----DATNHLAKIFCMTLIADWYE 63 (95) T ss_pred CCcCCHHHHHHHcCC-------C------CCcchHHHHHHHHHHHHHHHHhhcccc----ccCchHHHHHHHHHHHHHHh Confidence 999999998865331 1 123677899999999999999998654 34566666666666655544 Q ss_pred cCCC----ChHHHHHHHHHHHHHHHHhcCccc Q lcl|NC_020866. 81 SFEP----GDKIKTDYRDALQALRDIAKGAIK 108 (142) Q Consensus 81 ~~~~----~e~v~~rY~~Ai~~L~~va~G~~~ 108 (142) .|.. +.++..-.+.-|.-|+....|... T Consensus 64 NRe~~~~~~~~~p~~v~sll~~lr~~~~~~~~ 95 (95) T protein:vir:81 64 NRELVGRASDQVRPILQSILAQLTYAYGGETA 95 (95) T ss_pred hccccccccccccHHHHHHHHHhhhccccccC Confidence 4432 234666667777777766666643 No 33 >protein:vir:4788 Length: 130 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150168;swissprot:trembl:q94m43;genbank:gi:15088779;uniprot:Q94M43;genbank:GeneID:955972 Probab=56.06 E-value=0.12 Score=25.73 Aligned_cols=109 Identities=15% Similarity=0.214 Sum_probs=47.6 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCC--Cccccc----HHHH-HHHHH Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVL--PLVETP----PQIP-EIAIS 73 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~l--Pl~~~p----~~L~-~~~~d 73 (142) |+|+|.+++.+.-|+ +++-.+.-+..|+..||.+.+.+|.- -+.+.. ..++ .+|.. T Consensus 1 M~YlT~eey~el~~~-----------------~~~~F~kl~k~A~~~ID~~t~~~y~~~~~~~~~~~~r~~~vK~A~a~Q 63 (130) T protein:vir:47 1 MTYLTQEEFDELDFD-----------------EVTDFEKLAKRAKIAIDLYTNGIYQKDIDFEKEIAYRKSAVKLAMAFQ 63 (130) T ss_pred CCCCchhhHhhcCCC-----------------ChhhHHHHHHHHHHHHHHHhcccccccCCccCcchHHHHHHHHHHHHH Confidence 999999998754332 11228888999999999999988852 222222 2222 23333 Q ss_pred HHHHHHhcCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCCCccChhh-----hccc-C Q lcl|NC_020866. 74 IAIWKLHSFEPGDKIKTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRERPLTQEN-----MKGF-I 142 (142) Q Consensus 74 IA~Y~L~~~~~~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~F~r~~-----l~g~-~ 142 (142) |.......-...+. -+-..-|.-|+.++.-........ ..+.. ++-+- .-|+ + T Consensus 64 ieY~~~~G~~s~~~--------~~~~~S~svGrtSis~~~~~~~~~-------~~~~~-vs~da~~~L~~tGL~L 122 (130) T protein:vir:47 64 IAYLDASGIMSADD--------KQLANSVSIGRTSISYSTSQSTLA-------GQRFN-LSMDAENALRQAGFSL 122 (130) T ss_pred HHHHHHhccccchh--------ccCcceeeecceeeecCcCccccc-------cCCcc-ccHHHHHHHHhccccc Confidence 33332222111111 111222333443332111111100 11111 12221 1233 2 No 34 >protein:vir:97267 Length: 172 # NCBI annotation: hypothetical protein ORF024 # Family: family:all:703 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294532;genbank:gi:149408253;genbank:GeneID:5237132 Probab=54.35 E-value=0.067 Score=27.02 Aligned_cols=128 Identities=14% Similarity=0.134 Sum_probs=58.9 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHH---HHhhhc-C-----------------CC Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDG---YLGTRY-V-----------------LP 59 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~---YL~~RY-~-----------------lP 59 (142) =+|+|.+++...+...-. .+ ...+.+-.+++|..|+.-||+ |.+.|- . +| T Consensus 16 nSYvtv~~a~aY~~~rg~-~~--------~a~~~~~ke~aLi~A~~yiD~~~~f~G~r~~~~~Q~l~WPRtg~~d~~~~~ 86 (172) T protein:vir:97 16 NAYISVEEFKTYHTDRGN-SF--------AGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYY 86 (172) T ss_pred cccccHHHHHHHHHhcCc-cc--------CCCCcHHHHHHHHHHHHHHhhhhhcccCCCCCcchhhhcccCCCCCCcccc Confidence 569999999888755421 11 112334478899999999997 333341 1 23 Q ss_pred cccccHHHHHHHHHHHHHHHhcCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeec-----CCCccC Q lcl|NC_020866. 60 LVETPPQIPEIAISIAIWKLHSFEPGDKIKTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTD-----RERPLT 134 (142) Q Consensus 60 l~~~p~~L~~~~~dIA~Y~L~~~~~~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~-----~~r~F~ 134 (142) ...+|.-|+..||-+|.+-|.....++....-=...+ ..|.+.=|.++..-.. ...+..+...+.. +|+-++ T Consensus 87 ~~~IP~~v~~A~~elA~~al~~~l~~d~~~~~~~~~v-~~kr~kvg~i~~~y~~--~~~~~~~~p~~~~v~aLL~p~gl~ 163 (172) T protein:vir:97 87 INDIPPEVKEACAEYALRALAAELNPDPERNASGVAV-LSKSEAVGPISESVTF--VGGAVFQMPKYPAADQKLVRAGLV 163 (172) T ss_pred cccccHHHHHHHHHHHHHHHhcccccccccccccccc-eeeeeeecceeeEeec--cCCCCCccccHHHHHHHHhhhccc Confidence 4567899999999999988765432211100000000 0011222333321100 0111111111110 011111 Q ss_pred hhh---hcc Q lcl|NC_020866. 135 QEN---MKG 140 (142) Q Consensus 135 r~~---l~g 140 (142) +.. +|| T Consensus 164 ~~~~~~~r~ 172 (172) T protein:vir:97 164 RSGGTLLRG 172 (172) T ss_pred cCcceeccC Confidence 111 233 No 35 >protein:vir:95176 Length: 172 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:703 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293420;genbank:gi:148912841;genbank:GeneID:5228251 Probab=53.65 E-value=0.24 Score=23.96 Aligned_cols=117 Identities=10% Similarity=0.092 Sum_probs=58.8 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHH----Hhhh------------------cCC Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGY----LGTR------------------YVL 58 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~Y----L~~R------------------Y~l 58 (142) =+|+|.+++.+.+...-. .-..|.+..+.||-.|+.-||+| ++.| -.+ T Consensus 17 nSYvtv~ea~aY~~~rg~----------~~~~~~~~ke~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~~~~v 86 (172) T protein:vir:95 17 NSYVSVADARIYASNRGV----------ELPLDDDELAAMLIRSTDYLEAQACRFQGKPTSTTQALQWPRTGVFLNEDEV 86 (172) T ss_pred cccccHHHHHHHHHhcCC----------cCCCChHHHHHHHHHHHHHhhccCCceeeeecCCcccccCCcCCcccCcccc Confidence 569999999887654311 11135667899999999999975 2221 113 Q ss_pred CcccccHHHHHHHHHHHHHHHhcCC--CChHHHHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCCCccChh Q lcl|NC_020866. 59 PLVETPPQIPEIAISIAIWKLHSFE--PGDKIKTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRERPLTQE 136 (142) Q Consensus 59 Pl~~~p~~L~~~~~dIA~Y~L~~~~--~~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~F~r~ 136 (142) |...+|..|+..||-+|.+.+.... ++..-.++ + --++| |.++..-... ....+...+ +.=.. T Consensus 87 ~~~~IP~~V~~A~~elA~~~~~~~~~~~~~~~~~~----v-k~~kV--G~I~veY~~~---~~~~~~~~~-----~~v~~ 151 (172) T protein:vir:95 87 PSNVIPKSLIAAQVQLTMAINAGFDLQPNVSPQDY----V-TREKV--GPIETEYADP---LSVGIMPTF-----TAANA 151 (172) T ss_pred cccchhHHHHHHHHHHHHHHHcCccccccCCcccc----e-eEEec--cceEEeeccC---CCCCCcccH-----HHHHH Confidence 4567899999999999975544321 11111000 0 00112 3333211000 000000000 11223 Q ss_pred hhcccC Q lcl|NC_020866. 137 NMKGFI 142 (142) Q Consensus 137 ~l~g~~ 142 (142) -|++|+ T Consensus 152 LL~p~l 157 (172) T protein:vir:95 152 LLAPLF 157 (172) T ss_pred HHhhhh Confidence 356665 No 36 >protein:vir:2345 Length: 125 # NCBI annotation: gp15 # Family: family:all:2817 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075282;genbank:gi:12657869;genbank:GeneID:920134 Probab=52.62 E-value=0.45 Score=22.48 Aligned_cols=116 Identities=13% Similarity=0.098 Sum_probs=60.2 Q ss_pred CC-CCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcC-CC-----cccccHHHHHHHHH Q lcl|NC_020866. 1 MP-YTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYV-LP-----LVETPPQIPEIAIS 73 (142) Q Consensus 1 M~-YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~-lP-----l~~~p~~L~~~~~d 73 (142) |+ |+|.+|..++++. .|++ -....|+.-|++|+..|- .|++ |+ .+.-+..++.++++ T Consensus 1 ma~~A~~eDV~a~w~R----~lt~--------eE~~~V~~~L~~ae~~ir----rriPdL~~r~~~~~~~~~~v~~V~a~ 64 (125) T protein:vir:23 1 MATLATHEDVTAFWAR----TPTA--------EEIVLINRRLAQAERMLL----RAIPELLIKASSDPVFRAEVIDIEAE 64 (125) T ss_pred CCcccCHHHHHHHhCC----CCCH--------HHHHHHHHHHHHHHHHHH----HhcCChhhhhcCCCcchhhHHHHHHH Confidence 87 9999999999863 2232 135678889999999987 3333 32 34556778888777 Q ss_pred HHHHHHhcCCC-ChHHHHHHHHHHHHHHHHhcCcccCCCCCCcC-CCCCCCeeeeecCCCccC Q lcl|NC_020866. 74 IAIWKLHSFEP-GDKIKTDYRDALQALRDIAKGAIKLNATSVEP-ATTGDGGARMTDRERPLT 134 (142) Q Consensus 74 IA~Y~L~~~~~-~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~-~~~~~~~~~~~~~~r~F~ 134 (142) ..+=.+..... ..+-.-.|-.-+.+ +++.|++-+--++-+. .++-++..++...+..=+ T Consensus 65 ~V~Rv~rnPeGy~seT~g~Yt~~l~~--~~~~g~L~it~~E~a~Lg~~~s~~~vi~p~~~~p~ 125 (125) T protein:vir:23 65 AVLRLVRNHEGYLSETDGNYTYMLQA--QDPNRKLEILPEEWEVLGIVRSGLGILVPTVVLPS 125 (125) T ss_pred HHHHHhcCCCCccccccchhhhhhhc--cCCCCceeecHHHHHhhccccccceEEeeceecCC Confidence 65543321111 01111355555555 5667776553111111 111122333332222111 No 37 >protein:vir:78383 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110841;genbank:gi:134288602;genbank:GeneID:5179646 Probab=39.99 E-value=0.99 Score=20.60 Aligned_cols=118 Identities=14% Similarity=0.127 Sum_probs=59.4 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHH----Hhhhc------------------CC Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGY----LGTRY------------------VL 58 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~Y----L~~RY------------------~l 58 (142) =+|+|.+++.+.+.+.-. + -..|.+..+.||..|+..||+| ++.|- .+ T Consensus 15 nSYvtv~~a~aY~~~rg~---~-------~~~d~~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~g~~~ 84 (169) T protein:vir:78 15 DSYVSLEDGRALAAKYGL---E-------LPEDDTAAEAALRNGAVYVGLFESQMCGRRVSANQALAFPRTGVTLHGFPQ 84 (169) T ss_pred cccccHHHHHHHHHHcCC---c-------CCCChHHHHHHHHHHHHHhhhccccceeeeCCcccccccccCCceeccccc Confidence 459999999888654321 1 1135677999999999999974 33321 23 Q ss_pred CcccccHHHHHHHHHHHHHHHhcCCCChH-HHHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCCCccChhh Q lcl|NC_020866. 59 PLVETPPQIPEIAISIAIWKLHSFEPGDK-IKTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRERPLTQEN 137 (142) Q Consensus 59 Pl~~~p~~L~~~~~dIA~Y~L~~~~~~e~-v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~F~r~~ 137 (142) |...+|.-++..||-+|.+.+.....+.. -.++. .++--.|.+...-.. ..+..+.+.+. .=..- T Consensus 85 ~~~~IP~~v~~A~~elA~~~~~g~~~~~~~~~~~v------~~e~v~G~i~veY~~---~~~~~~~~~~~-----~~~~L 150 (169) T protein:vir:78 85 PSNVIPPLVIQAQVMAAVEYGAGTDVRGSTDGREV------QTERVEGAVTVSYFK---NGYSGGTVSIT-----TADDA 150 (169) T ss_pred ccccchHHHHHHHHHHHHHHhcCcccCCCCCccee------EEEEecCceeEeecC---CCCCCCcccHH-----HHHHH Confidence 45678999999999999988764321111 00000 000001332221100 00001111110 01123 Q ss_pred hcccC Q lcl|NC_020866. 138 MKGFI 142 (142) Q Consensus 138 l~g~~ 142 (142) |+.|+ T Consensus 151 L~p~l 155 (169) T protein:vir:78 151 LRPLL 155 (169) T ss_pred hhhhc Confidence 45554 No 38 >protein:vir:95004 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224025;genbank:gi:62327312;genbank:GeneID:5176832 Probab=39.43 E-value=1 Score=20.54 Aligned_cols=118 Identities=13% Similarity=0.112 Sum_probs=61.8 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHH----Hhhhc------------------CC Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGY----LGTRY------------------VL 58 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~Y----L~~RY------------------~l 58 (142) =+|+|.+++.+.+...-. + -..|....+.+|-.|+.-||+| ++.|- .+ T Consensus 15 nSYvt~~ea~aY~~~rg~---~-------~~~dd~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~g~~~ 84 (169) T protein:vir:95 15 DSYVSLEDGRALAAKYGL---E-------LPEDDIAAEASLRNGAVYVGLFESQMCGRRVSANQALAFPRTGIDLHGFPQ 84 (169) T ss_pred cccccHHHHHHHHHHcCC---c-------CCCCHHHHHHHHHHHHHHhhccccccccccCCcchhhccccCCceeccccc Confidence 459999999888664311 1 1135667899999999999983 33321 14 Q ss_pred CcccccHHHHHHHHHHHHHHHhcCCCChHH-HHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCCCccChhh Q lcl|NC_020866. 59 PLVETPPQIPEIAISIAIWKLHSFEPGDKI-KTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRERPLTQEN 137 (142) Q Consensus 59 Pl~~~p~~L~~~~~dIA~Y~L~~~~~~e~v-~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~F~r~~ 137 (142) |...+|..++..||-+|.+.+......... .++ + .+....|.++..-.. ..+..+.+.+.. =+.- T Consensus 85 ~~~~IP~~V~~A~~elA~~~~~g~~~~~~~~~~~----v--~~e~v~G~i~veY~~---~~~~~~~~~~~a-----~~~L 150 (169) T protein:vir:95 85 PSNVIPSLVIQAQVMAAVEYGAGTDVRGSTDGRE----V--QTERVEGAVTVSYFK---NGYSGGTVSITA-----ADDA 150 (169) T ss_pred ccccchHHHHHHHHHHHHHHHcCccccCCCCccc----e--eeeeeccceeEeecC---CCCcCccccHHH-----HHHh Confidence 567889999999999999998743211110 000 0 000112444332111 011111121111 1122 Q ss_pred hcccC Q lcl|NC_020866. 138 MKGFI 142 (142) Q Consensus 138 l~g~~ 142 (142) |+.|+ T Consensus 151 L~p~l 155 (169) T protein:vir:95 151 LRPLL 155 (169) T ss_pred hhhhc Confidence 34444 No 39 >protein:vir:108221 Length: 150 # NCBI annotation: gp11 # Family: family:all:28004 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552340;genbank:gi:160700660;genbank:GeneID:5758941 Probab=36.51 E-value=1.2 Score=20.21 Aligned_cols=126 Identities=11% Similarity=0.061 Sum_probs=67.9 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcC-CCcccccHHHHHHHHHHHHHHH Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYV-LPLVETPPQIPEIAISIAIWKL 79 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~-lPl~~~p~~L~~~~~dIA~Y~L 79 (142) -+|+++++|.++|. -|++.+ ..+.+.-|.+|+..|..=.- ++. -|++.-+.+.+.++|+|.+=-| T Consensus 5 ~pFadv~~lea~Wr-----pLt~~E--------~~~Ae~LL~~As~~IR~~~P-a~a~a~l~~dd~~A~~Vs~~vVk~Am 70 (150) T protein:vir:10 5 TPFIDVSQFEAMFR-----PLGDGE--------RLLAEVLLKAAAIRIRDRVA-AAGRAPLEPDDAMAILVSFEVTRDAM 70 (150) T ss_pred ccccchhhhHhhhc-----ccChhH--------HHHHHHHHHHHHHHHhhccc-ccCCCCCCCCcchhHHHHHHHHHHhc Confidence 34999999999997 455432 34566678889888876222 233 5677788899999999998776 Q ss_pred h--cCC---C-----ChHH--HHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeee----ecCC--CccChhhhccc Q lcl|NC_020866. 80 H--SFE---P-----GDKI--KTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARM----TDRE--RPLTQENMKGF 141 (142) Q Consensus 80 ~--~~~---~-----~e~v--~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~----~~~~--r~F~r~~l~g~ 141 (142) - .+. . .... .--|-+...-|.--..-|-.||++....+.-+.-.--| .+++ -+.+ +=.+| T Consensus 71 ~~~~e~~G~ss~S~T~G~rses~T~snPag~L~ft~~~k~lLGis~ta~P~~~~~~~df~~~~~~~~~~~~~~--~~~~~ 148 (150) T protein:vir:10 71 PPIPEMAGRTQYSITTDDRTEQATMATAAGLLDFNERHWSLLGISATAGPEYGGMGGDFGQLGRANPYPIVIG--SDADW 148 (150) T ss_pred cccccccccchhhhccccccccccccchhhhhhhhHHHHHHhCCCccCCccccCCCcchhhhcCCCCcceEec--CCccc Confidence 3 221 1 1111 11344444555555555666777643322221110001 0111 1111 11234 Q ss_pred C Q lcl|NC_020866. 142 I 142 (142) Q Consensus 142 ~ 142 (142) + T Consensus 149 ~ 149 (150) T protein:vir:10 149 L 149 (150) T ss_pred c Confidence 4 No 40 >protein:vir:100103 Length: 120 # NCBI annotation: gp7 # Family: family:all:363 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945037;genbank:gi:38707897;genbank:GeneID:2744150 Probab=30.06 E-value=1.6 Score=19.45 Aligned_cols=94 Identities=14% Similarity=0.134 Sum_probs=51.3 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhc-CCCc---------------cccc Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRY-VLPL---------------VETP 64 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY-~lPl---------------~~~p 64 (142) |+.+|+++++....- | ..-|.+.|+.-|.-|.+.|..|++..+ .... ..+| T Consensus 5 m~~vtL~e~K~hLRv-------d------~d~DD~lI~~~i~AA~~~v~~~~~r~l~~~~~~~~~~~~~~~~~~~~~~~~ 71 (120) T protein:vir:10 5 TPIVSLEVALAHLRE-------D------AGVADDLIKIYIGAATQSASDYVDRKLYANDAEMQAAVADATAGADPIVAN 71 (120) T ss_pred CCccCHHHHHHHcCC-------C------CCcchHHHHHHHHHHHHHHHHHhCCcccccccccchhhhccccccccccCC Confidence 999999998876542 1 124677899999999999999998763 2110 1256 Q ss_pred HHHHHHHHHHHHHHHhcCCCC-----h-HHHHHHHHHHHHHHHHhcCcccCCC Q lcl|NC_020866. 65 PQIPEIAISIAIWKLHSFEPG-----D-KIKTDYRDALQALRDIAKGAIKLNA 111 (142) Q Consensus 65 ~~L~~~~~dIA~Y~L~~~~~~-----e-~v~~rY~~Ai~~L~~va~G~~~L~~ 111 (142) +.++..++-++- ++|.+|.. + ...+--.-+-..|.... ...|+ T Consensus 72 ~~i~~AvLllvg-~~YenRe~~~~~~~~~~~~lP~~v~~Ll~~yR---~~~gv 120 (120) T protein:vir:10 72 DAIRAAILLTIG-KLYAFREDVVSGASASVTELPSGAKSLLFPYR---VGLGV 120 (120) T ss_pred HHHHHHHHHHHH-HHHhchhhhhhcccccccccCHHHHHHHHHhh---hccCC Confidence 666665554444 34443321 1 11111111223333222 12233 No 41 >protein:vir:100245 Length: 113 # NCBI annotation: gp74 # Family: family:all:363 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355410;genbank:gi:77864700;genbank:GeneID:3725967 Probab=26.19 E-value=2 Score=18.97 Aligned_cols=93 Identities=15% Similarity=0.041 Sum_probs=51.1 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhh-cCCCc---------------cccc Q lcl|NC_020866. 1 MPYTSLDALTKKFGPDMLLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTR-YVLPL---------------VETP 64 (142) Q Consensus 1 M~YaT~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~R-Y~lPl---------------~~~p 64 (142) |+++|+++++....-+ ..-|.+.|+.-|.-|++.+-.||+.+ |..+. ..+| T Consensus 1 M~~vtLee~K~hLRvd-------------~d~dD~lI~~li~AA~~~ve~~l~r~l~~~~~~~~~~~~~~~~~~~~~~~p 67 (113) T protein:vir:10 1 MALVELKLALGFVRAN-------------AGVEDDVVQMLLDAATQSAVDYLNRQVFETEDAMTTAIEAGTAGQNPMVVN 67 (113) T ss_pred CCCCCHHHHHHHcCCC-------------CCcchHHHHHHHHHHHHHHHHHhCccccccccccccccccccccccccccC Confidence 9999999988775421 12367889999999999999999876 33221 1356 Q ss_pred HHHHHHHHHHHHHHHhcCCC-ChHHHHHHH---HHHHHHHHHhcCcccCCC Q lcl|NC_020866. 65 PQIPEIAISIAIWKLHSFEP-GDKIKTDYR---DALQALRDIAKGAIKLNA 111 (142) Q Consensus 65 ~~L~~~~~dIA~Y~L~~~~~-~e~v~~rY~---~Ai~~L~~va~G~~~L~~ 111 (142) +.++..+.-++-|+=..|.+ ++. .-++ -+-.+|..... -.|+ T Consensus 68 ~~i~~AvLllv~~~Y~nRe~~~~~--~~~~lP~~v~~Ll~~yR~---~~g~ 113 (113) T protein:vir:10 68 AAIRAAILKITAELYANREDTAFG--PITELPLNARALLRPHRI---IPGV 113 (113) T ss_pred hHHHHHHHHHHHHHHhhhhhhchh--hhhccCHHHHHHHHHhhh---hcCC Confidence 66766666555443322322 211 1111 11122222211 1222 No 42 >protein:vir:102961 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:26777 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945287;genbank:gi:39653722;uniprot:Q708M5;genbank:GeneID:2672875 Probab=20.12 E-value=1.4 Score=19.79 Aligned_cols=108 Identities=12% Similarity=0.162 Sum_probs=47.9 Q ss_pred HHHhcCHHH-------HHHHhCCCCCcccccCHHHHHHHHHHHHHHHHHHHhhhcCCCcccccHHHHHHHHHHHHHHHhc Q lcl|NC_020866. 9 LTKKFGPDM-------LLGLTDRTTPPAGVIDIDVVNDALTDTDAVIDGYLGTRYVLPLVETPPQIPEIAISIAIWKLHS 81 (142) Q Consensus 9 l~~~~g~~e-------l~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~id~YL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~~ 81 (142) |+.+..+.. +.-+.+++ ..=|..+++-||+++...|-.|+. ...+|.-|.-.++++|.=-|.. T Consensus 1 ~~~~lkq~~~~~~~~~~l~~~~d~----~~kD~~vl~faie~v~~~IlnycN------ikeiP~~Le~v~~~maiDll~~ 70 (131) T protein:vir:10 1 MIQELKQDNTMYLISCVRKMRQDN----YFKDMEVLHYALTQAENEILNYIH------QDSVPGRLENVWIDMTNDLLDK 70 (131) T ss_pred Chhhhhhhhhhhhhhhhhcccccc----ccchHHHHHHHHHHHHHHHhhhcC------CcccchhhHHHHHHHHHHHHhh Confidence 444443321 11122221 233566899999999999999997 4466666655555444422211 Q ss_pred CC-CChHHHHHHHHHHHHHHHHhcCcccCCCCCCcCCCCCCCeeeeecCCCccCh-----hhhcccC Q lcl|NC_020866. 82 FE-PGDKIKTDYRDALQALRDIAKGAIKLNATSVEPATTGDGGARMTDRERPLTQ-----ENMKGFI 142 (142) Q Consensus 82 ~~-~~e~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~F~r-----~~l~g~~ 142 (142) .. .+.+-. ..++..--.+.|..|..+ +.+.++.-.|.| .-|++|. T Consensus 71 e~~~~~k~~-~i~~~~g~VsSI~eGDTs---------------Isf~s~t~~~qrl~~~~s~l~~Y~ 121 (131) T protein:vir:10 71 VKEQSVLAE-KAGADDFSVKSIKMGDTT---------------IEKVSPYEMIQRMKQVPSSLERYK 121 (131) T ss_pred hcccccccc-cccccccceeeeeeccee---------------eeccCCccHHHHHHHHHHHHhhhH Confidence 11 000000 000000001333334333 334434444444 2233333 Done!