Query lcl|NC_018274.1_cdsid_YP_006560535.1 [gene=B614_gp38] [protein=hypothetical protein] [protein_id=YP_006560535.1] [location=22056..22472] Match_columns 138 No_of_seqs 102 out of 230 Neff 6.8 Searched_HMMs 1612 Date Thu Nov 7 13:01:02 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_38 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_38_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:79253 Length: 138 100.0 4.1E-54 2.6E-57 313.2 16.5 138 1-138 1-138 (138) 2 protein:vir:99222 Length: 138 100.0 4.1E-54 2.6E-57 313.2 16.5 138 1-138 1-138 (138) 3 protein:vir:103846 Length: 138 100.0 9.2E-54 5.7E-57 311.4 16.4 138 1-138 1-138 (138) 4 protein:vir:79074 Length: 150 100.0 6.5E-49 4E-52 284.8 15.4 137 1-138 1-150 (150) 5 protein:vir:107864 Length: 150 100.0 1.4E-48 8.8E-52 282.9 15.5 137 1-138 1-150 (150) 6 protein:vir:99848 Length: 172 100.0 2.2E-48 1.4E-51 281.8 15.3 137 1-138 1-168 (172) 7 protein:vir:1993 Length: 141 # 100.0 4.7E-48 2.9E-51 280.0 15.9 136 1-138 1-137 (141) 8 protein:vir:98481 Length: 136 97.3 5.2E-06 3.2E-09 49.5 9.2 118 1-138 1-134 (136) 9 protein:vir:43 Length: 131 # N 95.4 0.00054 3.3E-07 38.5 9.6 99 1-115 1-131 (131) 10 protein:vir:2432 Length: 124 # 95.1 0.00049 3E-07 38.7 8.6 117 1-135 1-124 (124) 11 protein:vir:80967 Length: 131 95.1 0.00077 4.8E-07 37.6 9.6 99 1-115 1-131 (131) 12 protein:vir:2505 Length: 128 # 95.1 2.7E-05 1.7E-08 45.6 1.5 108 1-138 5-113 (128) 13 protein:vir:78478 Length: 149 94.2 0.00063 3.9E-07 38.1 7.0 121 1-138 1-136 (149) 14 protein:vir:78254 Length: 149 94.2 0.00063 3.9E-07 38.1 7.0 121 1-138 1-136 (149) 15 protein:vir:94761 Length: 132 94.1 0.00082 5.1E-07 37.5 7.5 116 1-131 1-132 (132) 16 protein:vir:98900 Length: 132 93.7 0.0021 1.3E-06 35.2 8.9 112 1-138 1-118 (132) 17 protein:vir:9576 Length: 131 # 93.0 0.0018 1.1E-06 35.7 7.4 115 1-138 1-127 (131) 18 protein:vir:7773 Length: 123 # 92.3 0.0029 1.8E-06 34.5 7.6 116 1-135 1-123 (123) 19 protein:vir:1640 Length: 132 # 90.6 0.0046 2.8E-06 33.4 6.9 116 1-131 1-132 (132) 20 protein:vir:9761 Length: 140 # 90.4 0.0062 3.9E-06 32.7 7.5 115 1-138 1-135 (140) 21 protein:vir:4228 Length: 125 # 90.2 0.0042 2.6E-06 33.6 6.3 120 1-135 1-125 (125) 22 protein:vir:104088 Length: 125 89.5 0.0035 2.2E-06 34.0 5.4 120 1-135 1-125 (125) 23 protein:vir:1887 Length: 108 # 88.2 0.023 1.4E-05 29.6 8.8 98 1-124 6-108 (108) 24 protein:vir:192 Length: 108 # 88.2 0.023 1.4E-05 29.6 8.8 98 1-124 6-108 (108) 25 protein:vir:1329 Length: 122 # 86.2 0.011 6.8E-06 31.3 6.0 115 1-135 1-122 (122) 26 protein:vir:6243 Length: 122 # 83.3 0.023 1.4E-05 29.6 6.4 115 1-135 1-122 (122) 27 protein:vir:81159 Length: 95 # 80.9 0.061 3.8E-05 27.2 7.8 92 1-109 1-95 (95) 28 protein:vir:9821 Length: 138 # 77.6 0.015 9.6E-06 30.5 3.4 108 1-138 6-123 (138) 29 protein:vir:93592 Length: 108 74.6 0.1 6.3E-05 26.0 7.1 93 1-110 2-108 (108) 30 protein:vir:2345 Length: 125 # 74.1 0.068 4.2E-05 27.0 6.0 117 1-135 1-125 (125) 31 protein:vir:100245 Length: 113 68.7 0.23 0.00014 24.1 7.8 95 1-112 1-113 (113) 32 protein:vir:100103 Length: 120 66.7 0.22 0.00014 24.2 7.1 95 1-112 5-120 (120) 33 protein:vir:80389 Length: 172 64.7 0.28 0.00017 23.6 7.2 112 1-131 15-172 (172) 34 protein:vir:106583 Length: 105 61.9 0.34 0.00021 23.1 7.7 91 3-108 1-105 (105) 35 protein:vir:4788 Length: 130 # 56.6 0.17 0.0001 24.9 4.5 104 1-138 1-114 (130) 36 protein:vir:94955 Length: 170 55.5 0.065 4E-05 27.1 2.1 121 1-138 14-156 (170) 37 protein:vir:97267 Length: 172 47.8 0.23 0.00014 24.1 3.8 126 1-137 16-172 (172) 38 protein:vir:79701 Length: 144 37.5 0.23 0.00014 24.1 2.1 112 1-138 1-130 (144) 39 protein:vir:95176 Length: 172 33.4 0.33 0.0002 23.2 2.3 116 1-133 17-172 (172) 40 protein:vir:5742 Length: 110 # 28.5 1.7 0.0011 19.3 8.7 96 1-108 1-110 (110) 41 protein:vir:99002 Length: 158 28.1 1.8 0.0011 19.2 8.3 124 1-138 1-150 (158) 42 protein:vir:94507 Length: 113 24.8 2.1 0.0013 18.9 5.0 98 1-128 1-113 (113) 43 protein:vir:95004 Length: 169 21.0 2.7 0.0017 18.2 6.4 116 1-136 15-169 (169) 44 protein:vir:78383 Length: 169 20.6 2.7 0.0017 18.2 6.8 116 1-136 15-169 (169) 45 protein:vir:3970 Length: 110 # 20.5 1.8 0.0011 19.2 3.8 96 1-128 1-110 (110) No 1 >protein:vir:79253 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469165;genbank:gi:157835007;genbank:GeneID:5648834 Probab=100.00 E-value=4.1e-54 Score=313.25 Aligned_cols=138 Identities=100% Similarity=1.431 Sum_probs=134.2 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHHHHHhh Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLH 80 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (138) |+|||.+||+++||+++|+||+|++.++++++|+++|++||++|+++|||||++||.||+.++|.+|+++|||||+|+|| T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL~~RY~lPl~~vP~~L~~~a~dIA~Y~L~ 80 (138) T protein:vir:79 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLH 80 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHHh Confidence 99999999999999999999999998888999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCccCCCCCCeeEEecCCccCCCCC Q lcl|NC_018274. 81 IVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISEGRNDWGADW 138 (138) Q Consensus 81 ~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~f~rd~ 138 (138) +++.+++.+++|||+|++||++|++||++||+++.+++++++++++|++++|+||||| T Consensus 81 ~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~~~~~~~r~F~Rd~ 138 (138) T protein:vir:79 81 IVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISEGRNDWGADW 138 (138) T ss_pred cCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcCCCCCCceeeecCCCCCCCCC Confidence 9988888899999999999999999999999998888888899999999999999999 No 2 >protein:vir:99222 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950460;genbank:gi:119953661;genbank:GeneID:4643082 Probab=100.00 E-value=4.1e-54 Score=313.25 Aligned_cols=138 Identities=100% Similarity=1.431 Sum_probs=134.2 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHHHHHhh Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLH 80 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (138) |+|||.+||+++||+++|+||+|++.++++++|+++|++||++|+++|||||++||.||+.++|.+|+++|||||+|+|| T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL~~RY~lPl~~vP~~L~~~a~dIA~Y~L~ 80 (138) T protein:vir:99 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLH 80 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHHh Confidence 99999999999999999999999998888999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCccCCCCCCeeEEecCCccCCCCC Q lcl|NC_018274. 81 IVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISEGRNDWGADW 138 (138) Q Consensus 81 ~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~f~rd~ 138 (138) +++.+++.+++|||+|++||++|++||++||+++.+++++++++++|++++|+||||| T Consensus 81 ~~~~~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~~~~~~~r~F~Rd~ 138 (138) T protein:vir:99 81 IVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISEGRNDWGADW 138 (138) T ss_pred cCCCCcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcCCCCCCceeeecCCCCCCCCC Confidence 9988888899999999999999999999999998888888899999999999999999 No 3 >protein:vir:103846 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938245;genbank:gi:38229150;genbank:GeneID:2648161 Probab=100.00 E-value=9.2e-54 Score=311.35 Aligned_cols=138 Identities=67% Similarity=1.058 Sum_probs=134.0 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHHHHHhh Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLH 80 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (138) |+|||.+||+++||+++|.||+|++.++.+++|+++|++||++|+++|||||++||.|||.++|.+|+++|||||+|+|| T Consensus 1 M~Y~T~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~~eIdgyL~~RY~lPl~~vP~~L~~~a~dIA~Y~L~ 80 (138) T protein:vir:10 1 MSYCTQADLVEQYGEASIRQLSDRVNKPATTIDPAVVAQAIADADAEIDLHLHARYQLPLAQVPVVLKRVACVLAFANLH 80 (138) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccCCCcCccCHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHHh Confidence 99999999999999999999999988888899999999999999999999999999999999999999999999999999 Q ss_pred cCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCccCCCCCCeeEEecCCccCCCCC Q lcl|NC_018274. 81 IVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISEGRNDWGADW 138 (138) Q Consensus 81 ~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~f~rd~ 138 (138) +++.++|++++|||+|++||++|++||++||+++.+++++++++++|++++|+||||| T Consensus 81 ~~~~~~e~~~~rY~~Ai~~L~~Ia~G~~~Lg~~~~~~~~~~~~~~~~~s~~r~Fg~d~ 138 (138) T protein:vir:10 81 TQVKDDHPAILDAERKRKLLGGISSGKLSLALTSSGTPAPIANTVQISSQRNDFGGTW 138 (138) T ss_pred cCCCCChHHHHHHHHHHHHHHHHhcCcccCCCCCCcccCCCCCceeeecCCccCCCCC Confidence 8888888899999999999999999999999999888888889999999999999999 No 4 >protein:vir:79074 Length: 150 # NCBI annotation: gp10, conserved hypothetical protein # Family: family:all:348 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111210;genbank:gi:134288797;genbank:GeneID:4960748 Probab=100.00 E-value=6.5e-49 Score=284.77 Aligned_cols=137 Identities=32% Similarity=0.504 Sum_probs=124.7 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCc-----CcCccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHH Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNK-----PATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLA 75 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~-----~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lPl~~~p~~L~~~~~dIA 75 (138) |+|||.+||+++||+++|+||+|++.. ..+++|+++|++||++|+++|||||++||.|||.++|.+|+++||||| T Consensus 1 M~Y~T~~Dl~~r~ge~~l~~Ltd~~~~~~~~~~~~~~d~~~i~~Al~dA~~~IDgyL~~RY~lPl~~vP~~L~~~a~dIA 80 (150) T protein:vir:79 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESAVRQAEEIVDAHLRGRYNLPLSPVPTVIKDVTVNLA 80 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccccccccccccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHH Confidence 999999999999999999999988643 447899999999999999999999999999999999999999999999 Q ss_pred HHHhhcCCC----CCHHHHHHHHHHHHHHHHHhcCccccCCCCCccCCCCCCeeEEecCCccCCC----CC Q lcl|NC_018274. 76 YANLHIVLK----EENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISEGRNDWGA----DW 138 (138) Q Consensus 76 ~Y~L~~~~~----~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~f~r----d~ 138 (138) +|+||.+++ .+|++++|||+|++||++|++||++||++. .++++++++++|++++|+||| +| T Consensus 81 ~Y~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~-~~~~~~~~~~~v~~~~r~f~r~~l~g~ 150 (150) T protein:vir:79 81 RHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPS-GPATPEPGEMKVRARRRQFDADLLERF 150 (150) T ss_pred HHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCC-ccCCCCCCceeeecCCCccChhhccCC Confidence 999998765 368899999999999999999999999876 444456678999999999996 45 No 5 >protein:vir:107864 Length: 150 # NCBI annotation: gp36 # Family: family:all:348 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024709;genbank:gi:48696946;genbank:GeneID:2845952 Probab=100.00 E-value=1.4e-48 Score=282.90 Aligned_cols=137 Identities=31% Similarity=0.498 Sum_probs=124.2 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCc-----CcCccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHH Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNK-----PATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLA 75 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~-----~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lPl~~~p~~L~~~~~dIA 75 (138) |+|||.+||+++||+++|+||+|++.. ..+++|+++|++||++|+++|||||++||.|||.++|.+|+++||||| T Consensus 1 M~Y~T~~Dl~~r~ge~el~~Ltd~~~~g~~~~~~~~~d~~~i~~Al~dA~~eIDgYL~~RY~lPl~~vP~~L~~~a~dIA 80 (150) T protein:vir:10 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESSVRQAEEIVDAHLRGRYNLPLSPVPTVIKDVTVNLA 80 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccCccccchhhcCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHH Confidence 999999999999999999999988643 346899999999999999999999999999999999999999999999 Q ss_pred HHHhhcCCC----CCHHHHHHHHHHHHHHHHHhcCccccCCCCCccCCCCCCeeEEecCCccCCCC----C Q lcl|NC_018274. 76 YANLHIVLK----EENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISEGRNDWGAD----W 138 (138) Q Consensus 76 ~Y~L~~~~~----~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~f~rd----~ 138 (138) +|+||.+++ .+|++++|||+|++||++|++||++||++.. ++.++++.++|++++|+|||| | T Consensus 81 rY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~-~~~~~~~~~~v~~~~r~f~r~~l~gf 150 (150) T protein:vir:10 81 RHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPSG-PATPEPGEMKVRARRRQFDADLLERF 150 (150) T ss_pred HHHHHhcccccCCCCHHHHHHHHHHHHHHHHHhcCcccCCCCCC-CCCCCCceeeeecCCCccChhhccCC Confidence 999998765 3688999999999999999999999998764 444556789999999999964 5 No 6 >protein:vir:99848 Length: 172 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164077;genbank:gi:56692609;genbank:GeneID:3192576 Probab=100.00 E-value=2.2e-48 Score=281.83 Aligned_cols=137 Identities=26% Similarity=0.313 Sum_probs=125.0 Q ss_pred CC-CCCHHHHHHhcCHHHHHHHhcCCCc-------------------------CcCccCHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018274. 1 MS-YCTLADLIEQYSEQKIREVSDRVNK-------------------------PATTIDTVIVDRAIADADSEIDLHLHG 54 (138) Q Consensus 1 M~-Y~T~~Dl~~~~g~~el~~Ltd~~~~-------------------------~~~~~d~~~v~~Al~~A~~~idgyL~~ 54 (138) |+ |||++||+++||++||+|||++++. .++++|.++|++||++|+++|||||++ T Consensus 1 ~~mYaT~~dl~~r~g~~el~qLt~~~~~~~~~~~l~d~~~~~~~~~~~~~~~~~~g~~d~~~i~~Al~dA~aeIDgYL~~ 80 (172) T protein:vir:99 1 MAVYITLPELAERPGAEELSQAATPRPLQAVDSELLDALLRGLPVDRWTPEEIEVGHATVEVINSAVSDAQGYIDGFLQR 80 (172) T ss_pred CcccccHHHHHhhcCHHHHHHHhccccccCCHHHHHHHhhcchhhhhcccccccccccCHHHHHHHHHHHHHHHHHHHhc Confidence 99 9999999999999999999998752 357899999999999999999999999 Q ss_pred h-ccCCcccccHHHHHHHHHHHHHHhhcCCC----CCHHHHHHHHHHHHHHHHHhcCccccCCCCCccCCCCCCeeEEec Q lcl|NC_018274. 55 R-YQLPLASVPTALKRIACGLAYANLHIVLK----EENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISE 129 (138) Q Consensus 55 R-Y~lPl~~~p~~L~~~~~dIA~Y~L~~~~~----~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~ 129 (138) | |.|||+++|.+|+++|||||+|+||++++ .+|.+++|||+|++||++|++||++||++.+.+ +++++.++|++ T Consensus 81 R~Y~lPL~~vP~~L~~~a~dIArY~L~~~r~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~-~~~~~~~~v~~ 159 (172) T protein:vir:99 81 RGYSLPLAKRYGVVTGWTRAIARYLLHQDRLGPGAEKDPIVRDYRDALKFLQLIAEGKFSLGPDDPLT-PPGGGVPQVLA 159 (172) T ss_pred ccccCCCcccchHHHHHHHHHHHHHHHhccCCcccCCHHHHHHHHHHHHHHHHHhcCccccCCCCCCC-CCCCCceeeec Confidence 9 99999999999999999999999998764 368899999999999999999999999876554 45668899999 Q ss_pred CCccCCCCC Q lcl|NC_018274. 130 GRNDWGADW 138 (138) Q Consensus 130 ~~r~f~rd~ 138 (138) ++|+||||= T Consensus 160 ~~r~F~rd~ 168 (172) T protein:vir:99 160 PARTFSHDT 168 (172) T ss_pred CCCccChhh Confidence 999999755 No 7 >protein:vir:1993 Length: 141 # NCBI annotation: Hypothetical protein # Family: family:all:348 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050640;genbank:gi:9633527;genbank:GeneID:2636296 Probab=100.00 E-value=4.7e-48 Score=280.03 Aligned_cols=136 Identities=24% Similarity=0.364 Sum_probs=124.9 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHHHHHhh Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLH 80 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (138) |+|||.+||+++||+++|.||+++.. .++++|+++|++||++|++||||||++||.||+.++|.+|+++|||||+|+|+ T Consensus 1 M~Y~T~~Dl~~~~ge~~l~~Lt~d~~-~~g~~d~~~i~~Al~dA~~eIdgyL~~RY~lPl~~~P~~L~~~a~dIA~Y~L~ 79 (141) T protein:vir:19 1 MNYATVNDLCARYTRTRLDILTRPKT-ADGQPDDAVAEQALADASAFIDGYLAARFVLPLTVVPSLLKRQCCVVAWFYLN 79 (141) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcCCC-CccccCHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHHh Confidence 99999999999999999999997654 35789999999999999999999999999999999999999999999999999 Q ss_pred cCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcc-CCCCCCeeEEecCCccCCCCC Q lcl|NC_018274. 81 IVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGK-PAPVANTVQISEGRNDWGADW 138 (138) Q Consensus 81 ~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~-~~~~~~~~~~~~~~r~f~rd~ 138 (138) ++++ ++++++|||+|++||++|++||++||++..+. ++++.+.++|++++|+||||= T Consensus 80 ~~~~-~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~~~~~~~~r~f~r~~ 137 (141) T protein:vir:19 80 ESQP-TEQITATYRDTVRWLEQVRDGKTDPGVESRTAASPEGEDLVQVQSDPPVFSRKQ 137 (141) T ss_pred cCCC-ChHHHHHHHHHHHHHHHHhcCccccCCCCCCCCCCCCCceeEeecCCcccCccc Confidence 8875 46799999999999999999999999887654 445678899999999999876 No 8 >protein:vir:98481 Length: 136 # NCBI annotation: ORF3 # Family: family:all:32440 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958281;genbank:gi:41057255;uniprot:Q38596;genbank:GeneID:2732865 Probab=97.28 E-value=5.2e-06 Score=49.54 Aligned_cols=118 Identities=15% Similarity=0.163 Sum_probs=69.2 Q ss_pred CC-CCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHHHHHh Q lcl|NC_018274. 1 MS-YCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANL 79 (138) Q Consensus 1 M~-Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lPl~~~p~~L~~~~~dIA~Y~L 79 (138) |. |+|.+|+.++++... .+ .+.+.+.++.-|++|+..|..+. .-+.++-|..++.++|++++=.+ T Consensus 1 M~~fAtv~Dl~~rw~~~~----~d------ee~~ra~~~~lL~dAS~~ir~~~----p~~~~~~~~~~~~V~~~~V~R~~ 66 (136) T protein:vir:98 1 MAAYATVEDYQARAAVTL----PD------GSPRRAQVEAYLDDASALMARHI----PTGHTPDPGTLRAICVAVVRRVM 66 (136) T ss_pred CCccCCHHHHHHHhccCC----CC------chhHHHHHHHHHHHHHHHHHHhC----CCCCCCChhHHHHHHHHHHHHHh Confidence 88 999999999987411 11 12223457778999999987764 33444558899999999998656 Q ss_pred hcCCCCCHHHHHHHHHHHHHHHHHhcCccc--------cCCCCCccCCCCCCeeEE-------ecCCccCCCCC Q lcl|NC_018274. 80 HIVLKEENPVYKTAEHLRKLLSGIANGKLS--------LALDADGKPAPVANTVQI-------SEGRNDWGADW 138 (138) Q Consensus 80 ~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~--------L~~~~~~~~~~~~~~~~~-------~~~~r~f~rd~ 138 (138) .+..+....-.--|-+.+.+ .|.+- ||+....-.. ..+...+ .-.+-.|+.|| T Consensus 67 ~np~G~~s~TaG~ys~s~t~-----~G~Lylt~~E~~~Lg~~rqr~~~-~d~a~si~~~~~~~~~~~dp~~~~~ 134 (136) T protein:vir:98 67 ANPGGYRQRTIGQYAETLGE-----DGGLYLTEDEKGQLQPPDQTAPD-ADAAYSLDLDPGTRAWVDDPAGCGW 134 (136) T ss_pred hCCCCcccccchhHHHhhhc-----CCCcccChHHHHHhCCCCCcccc-cccceecccCCCcCCcCCCCCCCCC Confidence 43333222122357776665 46643 3332211000 0111111 22456899999 No 9 >protein:vir:43 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463469;swissprot:trembl:q9t1b5;genbank:gi:16798791;uniprot:Q9T1B5;genbank:GeneID:922374 Probab=95.37 E-value=0.00054 Score=38.50 Aligned_cols=99 Identities=16% Similarity=0.164 Sum_probs=63.6 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccC-Cc----ccccHHHHHHHHHHH Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQL-PL----ASVPTALKRIACGLA 75 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~l-Pl----~~~p~~L~~~~~dIA 75 (138) |+|+|.+.+++.+|. ..+.++-....+..|+..||.+...|+.- -+ ..+|..++.+||..+ T Consensus 1 M~Y~d~~~Y~~~y~g--------------~~i~e~~F~~l~~rAs~~ID~~T~~ri~~~~~~~~~~~~~~~vk~A~c~q~ 66 (131) T protein:vir:43 1 MPYTTLEFYNDEYAG--------------EHLEQDEFDKLLKHAERKIDSVTFYRIRKGGIESFSEFIQHQIQLATCNQI 66 (131) T ss_pred CCCCCHHHHHHhhCC--------------CCCCHhHHHHHHHHHHHHHHHHhcccccccCccccchhhHHHHHHHHHHHH Confidence 999999999988753 23455668899999999999999999752 11 356888999999999 Q ss_pred HHHhhcCC-------C----------------CCH----HHHHHHHHHHHHHHHHhcCccccCCCCC Q lcl|NC_018274. 76 YANLHIVL-------K----------------EEN----PVYKTAEHLRKLLSGIANGKLSLALDAD 115 (138) Q Consensus 76 ~Y~L~~~~-------~----------------~~~----~v~~rY~~Ai~~L~~va~G~~~L~~~~~ 115 (138) -|.-.... . ... .-..-++.|..||+. .|-+-=|+.-. T Consensus 67 e~~~~~g~~s~~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~--TGLlyrGV~~~ 131 (131) T protein:vir:43 67 EYFKEAGGTSELAVSKPDNVSIGRTSISDSNFASTATSLNSGLIGSDVRSYLAH--TGLLYNGVGVR 131 (131) T ss_pred HHHHHhHHHhhhhccccCeeecCceEEeecccccchhhhchhhhHHHHHHHHhc--cCCeecCCCCC Confidence 87632110 0 000 001136677777762 23322232222 No 10 >protein:vir:2432 Length: 124 # NCBI annotation: gp19 # Family: family:all:2817 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046834;genbank:gi:9630402;genbank:GeneID:1261576 Probab=95.11 E-value=0.00049 Score=38.74 Aligned_cols=117 Identities=10% Similarity=0.028 Sum_probs=65.8 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccC-----CcccccHHHHHHHHHHH Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQL-----PLASVPTALKRIACGLA 75 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~l-----Pl~~~p~~L~~~~~dIA 75 (138) |+|+|.+|+.++++.. |++ ...+.++.-|++|+..|-. |++- ....-|+.++.++|++. T Consensus 1 ~~~At~~Dv~~rw~r~----Lt~--------~E~~~ve~lL~dAs~~ir~----r~P~l~~~~~~~~~~~~v~~V~a~~V 64 (124) T protein:vir:24 1 MAYATADDVVTLWAKE----PEP--------EVMALIERRLEQVERMIRR----RIPDLDARVSSDIFRADLIDIEADAV 64 (124) T ss_pred CCCCCHHHHHHHhCCC----CCH--------HHHHHHHHHHHHHHHHHHh----cCCCcchhcCCCCChhhHHHHHHHHH Confidence 9999999999998531 221 1345688889999998864 5442 22244778999999988 Q ss_pred HHHhhcCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCC--ccCCCCCCeeEEecCCccCC Q lcl|NC_018274. 76 YANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDAD--GKPAPVANTVQISEGRNDWG 135 (138) Q Consensus 76 ~Y~L~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~--~~~~~~~~~~~~~~~~r~f~ 135 (138) +=-+.+..+-...-.-.|-+.+.+ ....|++-|.-..- -.+....+.+.++...-.=+ T Consensus 65 ~R~~rnP~G~~s~T~G~Ys~sl~~--~~~~g~Lylt~~E~~~Lg~~r~~~~~~i~p~~~~~~ 124 (124) T protein:vir:24 65 LRLVRNPEGYLSETDGAYTYQLQA--DLSQGKLVILDEEWTTLGVNRLSRMSTLVPNIVMPT 124 (124) T ss_pred HHHhhCCCCceecccchhHHhhhh--cccCCceeeCHHHHHhhCcccccceeEeecceeeCC Confidence 776654333221122567777776 45567765532110 01111122223322211111 No 11 >protein:vir:80967 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468394;genbank:gi:157324968;genbank:GeneID:5601402 Probab=95.07 E-value=0.00077 Score=37.65 Aligned_cols=99 Identities=15% Similarity=0.145 Sum_probs=63.7 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCC-c----ccccHHHHHHHHHHH Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLP-L----ASVPTALKRIACGLA 75 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lP-l----~~~p~~L~~~~~dIA 75 (138) |+|+|.+.+.+.|+. ..+.++-....+..|+..||.+...|+.-- + ..+|..++.+||..+ T Consensus 1 M~Y~d~~~Y~~~y~G--------------~~i~e~~F~~l~~rAs~~ID~~T~~ri~~~~~d~~~~~~~~~vk~A~c~q~ 66 (131) T protein:vir:80 1 MPYTTLEFYTNEYAG--------------EHLEQDEFAKLLKHAERKIDSVTFYRIRKSGIEAFSEFIQHQIQLATCNQI 66 (131) T ss_pred CCCCCHHHHHHhhCC--------------CCCchhHHHHHHHHHHHHHHHHhcccccccccccCchhHHHHHHHHHHHHH Confidence 999999999988753 234455688999999999999999997521 1 357888999999998 Q ss_pred HHHhhcCC-------C----------------CC----HHHHHHHHHHHHHHHHHhcCccccCCCCC Q lcl|NC_018274. 76 YANLHIVL-------K----------------EE----NPVYKTAEHLRKLLSGIANGKLSLALDAD 115 (138) Q Consensus 76 ~Y~L~~~~-------~----------------~~----~~v~~rY~~Ai~~L~~va~G~~~L~~~~~ 115 (138) -|.-.... . .. ..-...+++|+.||+. .|-+-=|+.-. T Consensus 67 e~~~~~g~~~~~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~--TGLlyrGV~~~ 131 (131) T protein:vir:80 67 EYFKEAGGTSELAVSKPDNVSIGRTSISDSNFASTATSLNSGLVGSDVRSYLAH--TGLLYNGVGVR 131 (131) T ss_pred HHHHHhhhhhhhcccccCeeeeCceEEeeccccchhhhhhhhhhHHHHHHHHhc--cCCeecCCCCC Confidence 87632110 0 00 0111246667777762 23322233222 No 12 >protein:vir:2505 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:28222 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569746;genbank:gi:18496896;genbank:GeneID:932265 Probab=95.05 E-value=2.7e-05 Score=45.61 Aligned_cols=108 Identities=14% Similarity=0.202 Sum_probs=68.1 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHHHHHhh Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLH 80 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (138) -+++|.+|+.++++. +|+++ +...+..-|++|+..|+|||. +|.+| ++.|..++++|+.|+.=-|. T Consensus 5 ~alAtvdDv~~~lrr----~Lt~d--------E~~~a~~Ll~eAsdlI~g~l~-~~~vp-~~~p~~v~rVvA~ivarAlt 70 (128) T protein:vir:25 5 KALATSQDVKRALRR----DLTEA--------EQTDLSELLAEATDLVVGYLH-PYPVP-TPTPGPIKRVVASMVAAVLT 70 (128) T ss_pred hhccCHHHHHHHhcC----CCCHH--------HHHHHHHHHhcchheeeeecC-CCCCC-CCCCchHHHHHHHHHHHHhh Confidence 358999999999865 34332 244566779999999999997 56665 46678899999999887775 Q ss_pred cCCC-CCHHHHHHHHHHHHHHHHHhcCccccCCCCCccCCCCCCeeEEecCCccCCCCC Q lcl|NC_018274. 81 IVLK-EENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISEGRNDWGADW 138 (138) Q Consensus 81 ~~~~-~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~f~rd~ 138 (138) .... .++ ..-+.+|-++-+ -+..+.++++-.++..+..=|=| T Consensus 71 r~~~~~pe------------~~S~TAgpfs~~----ft~~~~~~g~yLTaa~k~~Lrp~ 113 (128) T protein:vir:25 71 RPTQILPE------------TQSLTADGFGVT----FTPGGNSPGPYLSAALKQRLRPY 113 (128) T ss_pred CCCccCCC------------ceeeeccccccc----ccCCCCCCCceEcHHHHhhcccc Confidence 4322 121 111233433222 23334445555666655555667 No 13 >protein:vir:78478 Length: 149 # NCBI annotation: gp16 # Family: family:all:2817 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491587;genbank:gi:157786410;genbank:GeneID:5625630 Probab=94.21 E-value=0.00063 Score=38.14 Aligned_cols=121 Identities=12% Similarity=0.173 Sum_probs=60.6 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCCc-cccc---HHHHHHHHHHHH Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPL-ASVP---TALKRIACGLAY 76 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lPl-~~~p---~~L~~~~~dIA~ 76 (138) |+|+|.+|+.++++.. ||+ ...++++.-|++|+..|-.-+- .|+- .+.| +.++.++|++.+ T Consensus 1 ~afAtv~Dve~rw~r~----LT~--------eE~~~ae~lL~dAs~~IR~~iP---~La~~~~dp~~~a~v~~V~~~mV~ 65 (149) T protein:vir:78 1 MAYAEPSDVVARLGRP----LTD--------DEETQVETFLEDAEIEIRSRIP---DLDDKAEDEDYLKRVIKVEASAVT 65 (149) T ss_pred CCcCCHHHHHHHhCCC----CCH--------HHHHHHHHHHHHHHHHHHHhcc---ccccccCCcchhhHHHHHHHHHHH Confidence 9999999999998532 221 2245788999999999866331 1221 1223 567899999887 Q ss_pred HHhhcCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCC--ccCCCCCCeeEEe-------cCC-ccCC-CCC Q lcl|NC_018274. 77 ANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDAD--GKPAPVANTVQIS-------EGR-NDWG-ADW 138 (138) Q Consensus 77 Y~L~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~--~~~~~~~~~~~~~-------~~~-r~f~-rd~ 138 (138) =-+.+..+-...-.-.|-+.+.+ ....|++-|.-..- -......+.+.+. +++ +-|+ -+| T Consensus 66 R~~rnpeG~~S~T~G~YS~slt~--~np~G~LylT~~E~a~LG~~r~~G~~~i~p~~~~~~~~~~~~~~~~~~ 136 (149) T protein:vir:78 66 RLIRNPDGYIGETDGNYSYQLNW--RLNTGAIEITDKEWAQLGLSKNVGVLNVRPKTPLERSGEYPAFGSVEW 136 (149) T ss_pred HHhcCCCCeeeeecchhhhhhhc--cCCCCceeeCHHHHHhhCCcccccceeecccCccccCCCCCcccceee Confidence 76644433211112456666665 23345443321100 0000001111111 111 1222 234 No 14 >protein:vir:78254 Length: 149 # NCBI annotation: gp16 # Family: family:all:2817 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491668;genbank:gi:157786492;genbank:GeneID:5625732 Probab=94.21 E-value=0.00063 Score=38.14 Aligned_cols=121 Identities=12% Similarity=0.173 Sum_probs=60.6 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCCc-cccc---HHHHHHHHHHHH Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPL-ASVP---TALKRIACGLAY 76 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lPl-~~~p---~~L~~~~~dIA~ 76 (138) |+|+|.+|+.++++.. ||+ ...++++.-|++|+..|-.-+- .|+- .+.| +.++.++|++.+ T Consensus 1 ~afAtv~Dve~rw~r~----LT~--------eE~~~ae~lL~dAs~~IR~~iP---~La~~~~dp~~~a~v~~V~~~mV~ 65 (149) T protein:vir:78 1 MAYAEPSDVVARLGRP----LTD--------DEETQVETFLEDAEIEIRSRIP---DLDDKAEDEDYLKRVIKVEASAVT 65 (149) T ss_pred CCcCCHHHHHHHhCCC----CCH--------HHHHHHHHHHHHHHHHHHHhcc---ccccccCCcchhhHHHHHHHHHHH Confidence 9999999999998532 221 2245788999999999866331 1221 1223 567899999887 Q ss_pred HHhhcCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCC--ccCCCCCCeeEEe-------cCC-ccCC-CCC Q lcl|NC_018274. 77 ANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDAD--GKPAPVANTVQIS-------EGR-NDWG-ADW 138 (138) Q Consensus 77 Y~L~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~--~~~~~~~~~~~~~-------~~~-r~f~-rd~ 138 (138) =-+.+..+-...-.-.|-+.+.+ ....|++-|.-..- -......+.+.+. +++ +-|+ -+| T Consensus 66 R~~rnpeG~~S~T~G~YS~slt~--~np~G~LylT~~E~a~LG~~r~~G~~~i~p~~~~~~~~~~~~~~~~~~ 136 (149) T protein:vir:78 66 RLIRNPDGYIGETDGNYSYQLNW--RLNTGAIEITDKEWAQLGLSKNVGVLNVRPKTPLERSGEYPAFGSVEW 136 (149) T ss_pred HHhcCCCCeeeeecchhhhhhhc--cCCCCceeeCHHHHHhhCCcccccceeecccCccccCCCCCcccceee Confidence 76644433211112456666665 23345443321100 0000001111111 111 1222 234 No 15 >protein:vir:94761 Length: 132 # NCBI annotation: unknown # Family: family:all:1271 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996708;genbank:gi:45597423;genbank:GeneID:2769029 Probab=94.12 E-value=0.00082 Score=37.49 Aligned_cols=116 Identities=9% Similarity=0.060 Sum_probs=57.2 Q ss_pred CC-CCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCCcc-------cccHHHHHHHH Q lcl|NC_018274. 1 MS-YCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLA-------SVPTALKRIAC 72 (138) Q Consensus 1 M~-Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lPl~-------~~p~~L~~~~~ 72 (138) |. |||.+|+..+++ +|++++ .++++.-|++|+..|..=.-.++.-+.. ..+.+++++|| T Consensus 1 m~~fAtv~Dl~~r~r-----~L~~dE--------~~ra~~LL~dAs~~iR~~~~~~~~~~~~~~~~~~d~~~~~~k~V~~ 67 (132) T protein:vir:94 1 MNPFATVDDLTMLWR-----PLKGDE--------KERAEKLLEIVSDTLREEADKVGRDLDVMISEKPSYFSSVVKSVTV 67 (132) T ss_pred CCCcCCHHHHHHHhc-----cCChhH--------HHHHHHHHHHHHHHHHHHHhhhccccccccCCCCccchhHHHHHHH Confidence 87 999999999985 344321 4778899999999997655444432211 13578999999 Q ss_pred HHHHHHhhcCCCCC---H--HHHHHHHHHHHHHHHHhcCccccCCCC---CccCCCCCCeeEEecCC Q lcl|NC_018274. 73 GLAYANLHIVLKEE---N--PVYKTAEHLRKLLSGIANGKLSLALDA---DGKPAPVANTVQISEGR 131 (138) Q Consensus 73 dIA~Y~L~~~~~~~---~--~v~~rY~~Ai~~L~~va~G~~~L~~~~---~~~~~~~~~~~~~~~~~ 131 (138) ++++=-|-...... + .-.--|-+...|+ ...|.+-|.-.. -+-.-+..+.+-+.... T Consensus 68 ~~V~Ral~~~~~~~g~tq~S~TaG~ys~S~T~~--np~G~lylt~~e~~~LGl~~~r~~~i~~~~~~ 132 (132) T protein:vir:94 68 DIVARTLMTSTDQEPMTQTTESALGYSVSGSYL--VPGGGLFIKNSELSRLGLKKQRFGVIDFYGND 132 (132) T ss_pred HHHHHHhcCCCCCCCceeeeeecccceeeeeee--cCCCCceeChHHHHhhCCCCCceEEEeecCCC Confidence 99988775432111 0 0111222222221 112222111000 00000111111111111 No 16 >protein:vir:98900 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:3882 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164420;genbank:gi:56694910;genbank:GeneID:3197290 Probab=93.68 E-value=0.0021 Score=35.23 Aligned_cols=112 Identities=17% Similarity=0.127 Sum_probs=61.6 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCC-c----ccccHHHHHHHHHHH Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLP-L----ASVPTALKRIACGLA 75 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lP-l----~~~p~~L~~~~~dIA 75 (138) |+|+|.+.+.+..| ..++++..++.+..|+..||.+...||.-. + ..++..++.++|..+ T Consensus 1 M~Y~t~~~Y~~~~G---------------~~i~e~~F~~l~~rAs~~ID~iT~~ri~~~~~~~d~~~~~~~vk~A~c~qi 65 (132) T protein:vir:98 1 MPYLTYEEFMDLNG---------------RDIDDKKFEKLLPKASAIIDGVTGHFYQKVDMEKDNAWRVNQFKLALCAQI 65 (132) T ss_pred CCCCCHHHHHhhcC---------------CCCCHHHHHHHHHHHHHHHHHHhcccccCCCccccChHHHHHHHHHHHHHH Confidence 99999999986333 245566789999999999999999998532 2 234456788888666 Q ss_pred HHHhhcCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCccCCC-CCCeeEEecCCccCCCCC Q lcl|NC_018274. 76 YANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAP-VANTVQISEGRNDWGADW 138 (138) Q Consensus 76 ~Y~L~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~-~~~~~~~~~~~r~f~rd~ 138 (138) -|.-... .. .++.+..-++-+.-|+.++.......... ..+...+.. --++| T Consensus 66 ey~~~~G-~~------sae~~~~~~~S~svG~~Svs~~s~~~~~~~~~~~~~~~~----~a~~~ 118 (132) T protein:vir:98 66 EYFDALG-AT------TFEEINNSPQTFQAGRTSVSNASRYNPSGANESKPLVAE----DVYIY 118 (132) T ss_pred HHHHhcc-ch------hhhhccCccceeeeCcEEEEeeccCCcccccccccchHH----HHHHH Confidence 6542111 11 12222223555677777775422111111 111111100 00122 No 17 >protein:vir:9576 Length: 131 # NCBI annotation: gp42 # Family: family:all:1271 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862881;genbank:gi:32469473;genbank:GeneID:1461318 Probab=93.00 E-value=0.0018 Score=35.65 Aligned_cols=115 Identities=10% Similarity=0.046 Sum_probs=55.7 Q ss_pred CC-CCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhc------cCCcccccHHHHHHHHH Q lcl|NC_018274. 1 MS-YCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRY------QLPLASVPTALKRIACG 73 (138) Q Consensus 1 M~-Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY------~lPl~~~p~~L~~~~~d 73 (138) |. |||.+|+..++. . |+++ ..++++.-|++|+..|..-+-... ..+-...+..++.+||+ T Consensus 1 m~~fAtv~D~~~rwr--~---Lt~~--------E~~ra~~LL~~As~~ir~~~p~~~~~l~~~~~~~~~~~~~~~~V~~~ 67 (131) T protein:vir:95 1 MENFATVEDLKKLWR--A---LKFD--------EEKRAEALLEVVSHSLRVEAKKVGKDLDGLVATDPSFTMVVKSVTVD 67 (131) T ss_pred CCccCCHHHHHHHhc--C---CCHH--------HHHHHHHHHHHHHHHHHHhhhhccCCccccccCCccchHHHHHHHHH Confidence 87 999999999984 2 3321 145788999999999876554321 11223446799999999 Q ss_pred HHHHHhhcCCCCCH-----HHHHHHHHHHHHHHHHhcCccccCCCCCccCCCCCCeeEEecCCccCCCCC Q lcl|NC_018274. 74 LAYANLHIVLKEEN-----PVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISEGRNDWGADW 138 (138) Q Consensus 74 IA~Y~L~~~~~~~~-----~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~f~rd~ 138 (138) +++.-|-....... .-.--|-+...|+ ...|.+-|.-. .- ... --++.|.|+=|- T Consensus 68 ~V~Ral~~~~~~~G~tq~S~TaG~ys~S~t~~--~p~g~lylt~~--e~-----~~L-Gl~~~r~~~i~~ 127 (131) T protein:vir:95 68 VVARTLMTSTDQEPMTQVAESALGYSFSGSYL--VPGGGLFIKDS--EL-----KRL-GLKKQRYGVIDI 127 (131) T ss_pred HHHHHhcCCCCCCCceeeeeecccceeeeeee--cCCCCceeChH--HH-----HHh-CCCCCceeEEee Confidence 99998854321110 0011122222221 11122111000 00 000 000111111111 No 18 >protein:vir:7773 Length: 123 # NCBI annotation: gp19 # Family: family:all:2817 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817607;genbank:gi:29566037;genbank:GeneID:1259231 Probab=92.28 E-value=0.0029 Score=34.47 Aligned_cols=116 Identities=16% Similarity=0.194 Sum_probs=62.2 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCC-ccccc---HHHHHHHHHHHH Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLP-LASVP---TALKRIACGLAY 76 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lP-l~~~p---~~L~~~~~dIA~ 76 (138) |+|+|.+|+.++++. .|++ ...+.++.-|++|+..|-.-+= .++ ...-| +.++.++|++.+ T Consensus 1 ~~~At~~Dv~ar~~r----~LT~--------~E~~~ve~lL~dAs~~ir~r~P---~l~~~a~d~~~~~~~~~V~~~~V~ 65 (123) T protein:vir:77 1 MPYATASDVTSRWAR----QPTD--------EETALINVRLADVERMIKRRIP---DLATKVTDPDYLEDLKQVEADAVL 65 (123) T ss_pred CCcCCHHHHHHHhCC----CCCH--------HHHHHHHHHHHHHHHHHHHhcc---CcccccCCcchhHHHHHHHHHHHH Confidence 999999999999853 1221 2345688889999999866332 122 11233 678899998877 Q ss_pred HHhhcCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCC---CccCCCCCCeeEEecCCccCC Q lcl|NC_018274. 77 ANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDA---DGKPAPVANTVQISEGRNDWG 135 (138) Q Consensus 77 Y~L~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~---~~~~~~~~~~~~~~~~~r~f~ 135 (138) =-+.+..+-...-.-.|-+.+.+ ....|++-|.-.. -+..- ++...++.....=+ T Consensus 66 R~~rnpeG~~s~T~G~ys~sl~~--a~~~g~Lylt~~E~~~Lg~~~--~~~~~i~p~~~~~~ 123 (123) T protein:vir:77 66 RLVRNPEGYLSETDGNYTYMLRS--DLASGKLEIFPEEWEILGYRR--SRMTVIVPNPVMPT 123 (123) T ss_pred HHhhCCCCceecccchhhhhhcc--cCCCCcceeCHHHHHhhcCCC--CceeEEeeceecCC Confidence 65543333211112467777664 5666776553211 01111 11122222211111 No 19 >protein:vir:1640 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695061;genbank:gi:23455752;genbank:GeneID:955488 Probab=90.64 E-value=0.0046 Score=33.41 Aligned_cols=116 Identities=10% Similarity=0.103 Sum_probs=60.0 Q ss_pred CC-CCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhc-cCC---c---ccccHHHHHHHH Q lcl|NC_018274. 1 MS-YCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRY-QLP---L---ASVPTALKRIAC 72 (138) Q Consensus 1 M~-Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY-~lP---l---~~~p~~L~~~~~ 72 (138) |. |||.+|+..+++ +|+++ ..++++.-|++|+..|..=+-.+. .++ . ...+..++.+|| T Consensus 1 m~~fAtv~Dv~~r~r-----~L~~~--------E~~ra~~lL~dAs~~ir~~~p~~~~~l~a~~~e~~~~~~~~~~~V~~ 67 (132) T protein:vir:16 1 MNPFATVDDLTMLWR-----PLKGD--------EKERAEKLLEIVSDSLREEADKVGRDLYAMIAEKPSYFASVVKSVTV 67 (132) T ss_pred CCccCCHHHHHHHhc-----CCCHh--------HHHHHHHHHHHHHHHHHHhhhhhccccccccccccccchhHHHHHHH Confidence 87 999999999985 23322 146788999999999865543332 221 1 123567999999 Q ss_pred HHHHHHhhcCCCCC-----HHHHHHHHHHHHHHHHHhcCccccCCCC---CccCCCCCCeeEEecCC Q lcl|NC_018274. 73 GLAYANLHIVLKEE-----NPVYKTAEHLRKLLSGIANGKLSLALDA---DGKPAPVANTVQISEGR 131 (138) Q Consensus 73 dIA~Y~L~~~~~~~-----~~v~~rY~~Ai~~L~~va~G~~~L~~~~---~~~~~~~~~~~~~~~~~ 131 (138) ++++=-|-...... ..-.-.|-+...|+ ...|.+-|.-.. -+-.-..-+.+-+.... T Consensus 68 ~~V~Ral~~~~~~~G~tq~S~TaG~ys~S~t~~--~p~G~lylt~~e~~~LG~~~~r~~~i~~~~~~ 132 (132) T protein:vir:16 68 DIVARTLMTSTDQEPMTQTTESALGYSVSGSYL--VPGGGLFIKNSELSRLGLKKQRFGVIDFYGND 132 (132) T ss_pred HHHHHHhcCCCCCCCceeeeeeccchheeeeee--cCCCcceeChHHHHhhCCCCCceEEEeecCCC Confidence 99987776432211 11122344444443 223443331100 00011111222222222 No 20 >protein:vir:9761 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795523;genbank:gi:28876281;genbank:GeneID:1257822 Probab=90.41 E-value=0.0062 Score=32.67 Aligned_cols=115 Identities=11% Similarity=0.077 Sum_probs=58.5 Q ss_pred CC-CCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhh-ccCCcc-----cccHHHHHHHHH Q lcl|NC_018274. 1 MS-YCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGR-YQLPLA-----SVPTALKRIACG 73 (138) Q Consensus 1 M~-Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~R-Y~lPl~-----~~p~~L~~~~~d 73 (138) |. |||.+|+..+++. |++++ .++++.-|++|+..|...+-.. +.+|.. ..+.+++.+||+ T Consensus 1 m~~fATv~Dv~~rwr~-----Lt~dE--------~~ra~~LL~dAS~~iR~~~p~~g~~~~~~~~~~~~~~~~~k~V~~~ 67 (140) T protein:vir:97 1 MGNFATTDDVILLWRP-----LSVDE--------LKRANALLKVVSDTLRMEADKVGKDLDKTMVDKPYFVNVIKSVTVD 67 (140) T ss_pred CCcCCCHHHHHHHhcC-----CCHhH--------HHHHHHHHHHHHHHHHHhhhhccCCcchhcccCccchhHHHHHHHH Confidence 87 9999999999852 33221 4678899999999998777533 445422 235688999999 Q ss_pred HHHHHhhcCCCC---CH--HHHHHHHHHHHHHHHHhcCccccCCCCCccCCCCCCeeEEecCCccCC--------CCC Q lcl|NC_018274. 74 LAYANLHIVLKE---EN--PVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISEGRNDWG--------ADW 138 (138) Q Consensus 74 IA~Y~L~~~~~~---~~--~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~f~--------rd~ 138 (138) |++=-|-..... ++ .-.--|-+...|+ ...|.+-|.-. +-. ..- -.+.|.|+ ||= T Consensus 68 mV~Ral~~~~d~~G~tq~S~TaG~ys~S~T~~--np~G~lylt~~--e~~-----~LG-l~~~r~~~i~~~g~~~~~~ 135 (140) T protein:vir:97 68 IVARTLMTSTQGEPMSQESQSALGYTWSGTYL--VPGGGLFIKDN--ELK-----RLG-LKKQRYGGIELYGEIKRDN 135 (140) T ss_pred HHHHHhcCCCCCCcceeeeeeccchhheeeee--cCCCCceeChH--HHH-----HhC-CCCCceeeecccCccccCc Confidence 987766422111 00 1122333333332 11333222100 000 000 01122222 111 No 21 >protein:vir:4228 Length: 125 # NCBI annotation: predicted 14.0Kd protein # Family: family:all:2817 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039683;swissprot:sw:q05225;genbank:gi:9625449;uniprot:Q05225;genbank:GeneID:2942926 Probab=90.17 E-value=0.0042 Score=33.64 Aligned_cols=120 Identities=11% Similarity=0.052 Sum_probs=63.2 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhh---hccCCcccccHHHHHHHHHHHHH Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHG---RYQLPLASVPTALKRIACGLAYA 77 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~---RY~lPl~~~p~~L~~~~~dIA~Y 77 (138) |+|+|.+|+.++++.. |++ .....|+.-|++|+..|-..+=. |-. -...-++.++.++.+..+= T Consensus 1 m~~A~~eDV~a~w~r~----lt~--------~e~~~v~~~L~~Ae~~Ir~riPdL~~r~~-~~~~~~~~v~~Vea~aV~R 67 (125) T protein:vir:42 1 MAYATAEDVVTLWAKE----PEP--------EVMALIERRLQQIERMIKRRIPDLDVKAA-ASATFRADLIDIEADAVLR 67 (125) T ss_pred CCcccHhHHHHHhCCC----CCh--------HHHHHHHHHHHHHHHHHHHhCCCchhhhc-ccCcchhhHHHHHHHHHHH Confidence 9999999999998642 221 24567888899999887544311 000 0234467778887765554 Q ss_pred HhhcCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCC--ccCCCCCCeeEEecCCccCC Q lcl|NC_018274. 78 NLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDAD--GKPAPVANTVQISEGRNDWG 135 (138) Q Consensus 78 ~L~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~--~~~~~~~~~~~~~~~~r~f~ 135 (138) ...+..+-...-...|-.-+.+ +.+.|++-|..+.- -.++...+.+.+....-.=+ T Consensus 68 v~RNpeGy~s~T~G~Ys~~l~~--~~~~g~L~it~eEw~~L~p~~~~g~~~i~P~~~~~~ 125 (125) T protein:vir:42 68 LVRNPEGYLSETDGAYTYQLQA--DLSQGKLTILDEEWEILGVNSQKRMAVIVPNVVMPT 125 (125) T ss_pred HHhCCCccccccchhHHHhhhc--ccccCceeeCHHHHHhhCccccccceeecccceeCC Confidence 3333322111111456555555 67788876642211 11212333333333221111 No 22 >protein:vir:104088 Length: 125 # NCBI annotation: gp20 # Family: family:all:2817 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655599;genbank:gi:109392470;genbank:GeneID:4156956 Probab=89.54 E-value=0.0035 Score=34.04 Aligned_cols=120 Identities=11% Similarity=0.067 Sum_probs=60.7 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhh---hccCCcccccHHHHHHHHHHHHH Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHG---RYQLPLASVPTALKRIACGLAYA 77 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~---RY~lPl~~~p~~L~~~~~dIA~Y 77 (138) |+|+|.+|+.++++.. |++ .....|+.-|++|+..|-..+=. |-. --...+..++.++.+..+= T Consensus 1 ma~A~~~Dv~~~w~r~----lT~--------~E~~~v~~~L~~Ae~~Ir~riP~L~~r~~-a~~~~~~~v~~Vea~aV~R 67 (125) T protein:vir:10 1 MAYANAQDVVTLWAKE----PEP--------EVMELIERRLAQVERMIKRRIPNLDLKVA-ADATFQADLIDIEADAVLR 67 (125) T ss_pred CCcCCHHHHHHHhCCC----CCH--------HHHHHHHHHHHHHHHHHHHhCCChhhhhh-cCCCccccHHHHHHHHHHH Confidence 9999999999998631 221 24567888899999987543310 000 0234456667666654443 Q ss_pred HhhcCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCC--ccCCCCCCeeEEecCCccCC Q lcl|NC_018274. 78 NLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDAD--GKPAPVANTVQISEGRNDWG 135 (138) Q Consensus 78 ~L~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~--~~~~~~~~~~~~~~~~r~f~ 135 (138) ...+..+-...-...|-+-+.+ +.+.|++-|.-+.- -.+....+.+.++...-.=+ T Consensus 68 v~rNPeGy~s~T~G~Ys~~l~~--~~~~g~L~it~~Ew~~Lg~~r~s~~~~i~p~~~~~~ 125 (125) T protein:vir:10 68 LVRNPEGYISETDGAYTYQLQT--DLSQGRLTILDDEWTTLGVNRLSRMSVIAPNIVMPT 125 (125) T ss_pred HhcCCCcccccccchhHHhhhc--ccccCceeeCHHHHHhhccccccceeeeecccccCC Confidence 3322222111111345555555 66778876642210 12222233444433221112 No 23 >protein:vir:1887 Length: 108 # NCBI annotation: gp6 # Family: family:all:6764 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037667;genbank:gi:9634125;genbank:GeneID:1262472 Probab=88.16 E-value=0.023 Score=29.61 Aligned_cols=98 Identities=11% Similarity=0.094 Sum_probs=55.2 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhh-ccCCcccccHHHHHHHHHHHHHHh Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGR-YQLPLASVPTALKRIACGLAYANL 79 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~R-Y~lPl~~~p~~L~~~~~dIA~Y~L 79 (138) |.++|+++++....-+ ..-|.+.|+.-|..|.+.|-+|++.. |..+ ..+|..++..++-++ =++ T Consensus 6 M~~vtLee~K~hLRid-------------~dddD~lI~~~i~AA~~~v~~~~~~~~~~~~-~~~p~~ik~AiLllv-~~~ 70 (108) T protein:vir:18 6 LDVISLSLFKQQIEFE-------------EDDRDELITLYAQAAFDYCMRWCDEPAWKVA-ADIPAAVKGAVLLVF-ADM 70 (108) T ss_pred ccccCHHHHHHHcCCC-------------CCcchHHHHHHHHHHHHHHHHHhCCcccccc-cccchHHHHHHHHHH-HHH Confidence 9999999999875421 23467889999999999999999864 3333 356777776666444 445 Q ss_pred hcCC-CCCH-HHHHHHHHHHHHHH--HHhcCccccCCCCCccCCCCCCe Q lcl|NC_018274. 80 HIVL-KEEN-PVYKTAEHLRKLLS--GIANGKLSLALDADGKPAPVANT 124 (138) Q Consensus 80 ~~~~-~~~~-~v~~rY~~Ai~~L~--~va~G~~~L~~~~~~~~~~~~~~ 124 (138) |.+| +..+ +...- .-+..+|. +-=.|+ |.. +.|+ T Consensus 71 YenRE~~~~~~~~~~-~~~~~LL~pYR~~~g~-----~~~-----~~~~ 108 (108) T protein:vir:18 71 FEHRTAQSEVQLYEN-AAAERMMFIHRNWRGK-----AES-----EEGS 108 (108) T ss_pred Hhcccccccchhhhh-HHHHHHHHHHHhcCCC-----CCc-----ccCC Confidence 5444 3322 11111 12222332 222233 211 1111 No 24 >protein:vir:192 Length: 108 # NCBI annotation: Gp6 # Family: family:all:6764 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037702;genbank:gi:9634167;genbank:GeneID:1262532 Probab=88.16 E-value=0.023 Score=29.61 Aligned_cols=98 Identities=11% Similarity=0.094 Sum_probs=55.2 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhh-ccCCcccccHHHHHHHHHHHHHHh Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGR-YQLPLASVPTALKRIACGLAYANL 79 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~R-Y~lPl~~~p~~L~~~~~dIA~Y~L 79 (138) |.++|+++++....-+ ..-|.+.|+.-|..|.+.|-+|++.. |..+ ..+|..++..++-++ =++ T Consensus 6 M~~vtLee~K~hLRid-------------~dddD~lI~~~i~AA~~~v~~~~~~~~~~~~-~~~p~~ik~AiLllv-~~~ 70 (108) T protein:vir:19 6 LDVISLSLFKQQIEFE-------------EDDRDELITLYAQAAFDYCMRWCDEPAWKVA-ADIPAAVKGAVLLVF-ADM 70 (108) T ss_pred ccccCHHHHHHHcCCC-------------CCcchHHHHHHHHHHHHHHHHHhCCcccccc-cccchHHHHHHHHHH-HHH Confidence 9999999999875421 23467889999999999999999864 3333 356777776666444 445 Q ss_pred hcCC-CCCH-HHHHHHHHHHHHHH--HHhcCccccCCCCCccCCCCCCe Q lcl|NC_018274. 80 HIVL-KEEN-PVYKTAEHLRKLLS--GIANGKLSLALDADGKPAPVANT 124 (138) Q Consensus 80 ~~~~-~~~~-~v~~rY~~Ai~~L~--~va~G~~~L~~~~~~~~~~~~~~ 124 (138) |.+| +..+ +...- .-+..+|. +-=.|+ |.. +.|+ T Consensus 71 YenRE~~~~~~~~~~-~~~~~LL~pYR~~~g~-----~~~-----~~~~ 108 (108) T protein:vir:19 71 FEHRTAQSEVQLYEN-AAAERMMFIHRNWRGK-----AES-----EEGS 108 (108) T ss_pred Hhcccccccchhhhh-HHHHHHHHHHHhcCCC-----CCc-----ccCC Confidence 5444 3322 11111 12222332 222233 211 1111 No 25 >protein:vir:1329 Length: 122 # NCBI annotation: gp37 # Family: family:all:11657 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047928;swissprot:trembl:q9zxb0;genbank:gi:9631146;uniprot:Q9ZXB0;genbank:GeneID:2715909 Probab=86.25 E-value=0.011 Score=31.33 Aligned_cols=115 Identities=13% Similarity=0.114 Sum_probs=67.9 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHHHHHhh Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLH 80 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (138) |+|+|.++|.+.-|.+.- .-...+.+..||+...+.+..|.+..++..-.+.|+.++-+...+||-+.- T Consensus 1 mayatieelraldgldds-----------alfsdellsdaidfsvetveaycgrkwdtaedptpetirwcvrtlarqyvl 69 (122) T protein:vir:13 1 MAYATIEELRALDGLDDS-----------ALFSDELLSDAIDFSVETVEAYCGRKWDTAEDPTPETIRWCVRTLARQYVL 69 (122) T ss_pred CcchhhhhhhhhcCccch-----------hhhhhhhhhhhhhhhhhhhhhhhCcccCCcCCCChhHHHHHHHHHHHHHHH Confidence 999999999877664332 233456688999999999999999999999999999999988999997754 Q ss_pred cCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCC---ccC---C-CCCCeeEEecCCccCC Q lcl|NC_018274. 81 IVLKEENPVYKTAEHLRKLLSGIANGKLSLALDAD---GKP---A-PVANTVQISEGRNDWG 135 (138) Q Consensus 81 ~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~---~~~---~-~~~~~~~~~~~~r~f~ 135 (138) +.. .+--+.|+..-. .=|.+.|.-..+ .+. . +.-+...+.- +=.|- T Consensus 70 dhv------sripdralqlqs--efgsiqlaqaggnwrptslpevnaklnlyrvrl-pfifm 122 (122) T protein:vir:13 70 DHV------SRIPDRALQLQS--EFGSIQLAQAGGNWRPTSLPEVNAKLNLYRVRL-PFIFM 122 (122) T ss_pred HHh------hhcchhhhhhhh--cccceeeeccCCCcccCcccccccceeeeeeec-ceeeC Confidence 321 111122222211 125555521111 111 1 1111122221 22232 No 26 >protein:vir:6243 Length: 122 # NCBI annotation: gp37 # Family: family:all:11657 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813697;swissprot:trembl:q859c0;genbank:gi:29366757;uniprot:Q859C0;genbank:GeneID:1258898 Probab=83.33 E-value=0.023 Score=29.56 Aligned_cols=115 Identities=13% Similarity=0.124 Sum_probs=66.7 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHHHHHhh Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLH 80 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (138) |+|+|.++|.+.-|-+. + .-.-.+.+..||+...+.+.-|.++.++..-++.|++++-+...+||-+.- T Consensus 1 mayatieelralegidd----------a-slfpdellsdaidfsvetvevycgqkwdtaenptpevirwcvrtlarqyvl 69 (122) T protein:vir:62 1 MAYATIEELRALEGIDD----------A-SLFPDELLSDAIDFSVETVEVYCGQKWDTAENPTPEVIRWCVRTLARQYVL 69 (122) T ss_pred CccchhhhhHhhccccc----------c-ccchhhhhhhhhhhhhhhhhhhcCcccCCcCCCchHHHHHHHHHHHHHHHH Confidence 99999998886544221 1 223345688999999999999999999999999999999988999997754 Q ss_pred cCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCc------cCC-CCCCeeEEecCCccCC Q lcl|NC_018274. 81 IVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADG------KPA-PVANTVQISEGRNDWG 135 (138) Q Consensus 81 ~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~------~~~-~~~~~~~~~~~~r~f~ 135 (138) +.- .+--+.|+..-. .=|.+.|.-..+. +.. +.-+...+.- +=.|- T Consensus 70 dhv------sripdralqlqs--efgsiqlaqaggtwrptslpevnaklnlyrvrl-pfifm 122 (122) T protein:vir:62 70 DHV------SRIPDRALQLQS--EFGSIQLAQAGGTWRPTSLPEVNAKLNLYRVRL-PFIFM 122 (122) T ss_pred HHh------hhcchhhhhhhh--cccceeeeccCCccccCcCcccccceeeeEeec-ceeeC Confidence 321 111122332211 1255555221110 111 1112222221 22233 No 27 >protein:vir:81159 Length: 95 # NCBI annotation: putative DNA packaging protein # Family: family:all:316 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285813;genbank:gi:148747734;genbank:GeneID:5247202 Probab=80.88 E-value=0.061 Score=27.25 Aligned_cols=92 Identities=13% Similarity=0.056 Sum_probs=63.0 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHHHHHhh Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLH 80 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (138) |.+.|+++++....- | ..-|.+.|+.-|..|...|.+|++.++ .+.|+.++..++-++-++=. T Consensus 1 Mm~vtLee~K~~LRI-------D------~d~dD~lI~~li~aA~~~i~~~~g~~~----~~~~~~~~~Avl~lv~~~Ye 63 (95) T protein:vir:81 1 MMIVTLEEVKNWLRV-------D------FSDDDALITTLINAAEEYLKNATGTTF----DATNHLAKIFCMTLIADWYE 63 (95) T ss_pred CCcCCHHHHHHHcCC-------C------CCcchHHHHHHHHHHHHHHHHhhcccc----ccCchHHHHHHHHHHHHHHh Confidence 999999999976432 1 233677899999999999999998654 34566777776666655544 Q ss_pred cCCCC---CHHHHHHHHHHHHHHHHHhcCccc Q lcl|NC_018274. 81 IVLKE---ENPVYKTAEHLRKLLSGIANGKLS 109 (138) Q Consensus 81 ~~~~~---~~~v~~rY~~Ai~~L~~va~G~~~ 109 (138) +|... ...+-.-.+.-|..|+..-.|.-. T Consensus 64 NRe~~~~~~~~~p~~v~sll~~lr~~~~~~~~ 95 (95) T protein:vir:81 64 NRELVGRASDQVRPILQSILAQLTYAYGGETA 95 (95) T ss_pred hccccccccccccHHHHHHHHHhhhccccccC Confidence 44332 123555667777777776666644 No 28 >protein:vir:9821 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795583;genbank:gi:28876338;genbank:GeneID:1257921 Probab=77.56 E-value=0.015 Score=30.51 Aligned_cols=108 Identities=19% Similarity=0.272 Sum_probs=54.3 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccC-Ccccc-----cHHHHHHHHHH Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQL-PLASV-----PTALKRIACGL 74 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~l-Pl~~~-----p~~L~~~~~dI 74 (138) |+|+|.+++...-+. +.+-.+.-+..|+..||.+..-+|.- -+... -.+=+.+|..| T Consensus 6 M~YlT~eey~~l~~~-----------------~~~dF~kllk~As~~ID~~t~~~y~~~d~e~d~~~r~~~vKkA~a~QI 68 (138) T protein:vir:98 6 IAFLTQKEFEDLGFD-----------------DVEDFEKMEKRASHAVNLYCRNRYDYKDLKKEIALVQKAVKRAIAYQI 68 (138) T ss_pred ccccchHHHhccCCC-----------------ChhhHHHHHHHHHHHhhhhhccccccccccchhHHHHHHHHHHHHHHH Confidence 999999988643221 11127888999999999999998853 23222 22334455555 Q ss_pred HHHHhhcCCCCCHHHHHHHHHHHHHHHHHhcCccccCCC--CC-ccCC-CCCCeeEEecCCccCCCCC Q lcl|NC_018274. 75 AYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALD--AD-GKPA-PVANTVQISEGRNDWGADW 138 (138) Q Consensus 75 A~Y~L~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~--~~-~~~~-~~~~~~~~~~~~r~f~rd~ 138 (138) ......+-....+ ..-+.-|.-|+.++... .+ +... +..+...++ ..-.|| T Consensus 69 eY~~~~G~ts~~d---------~~~~~s~svGrTSiS~~~~~~~~s~~~~~~~~~~~s----~~A~~~ 123 (138) T protein:vir:98 69 AYLNDSGVMTAED---------KQSFAGISLGRTSISYTVGHGQGSQQKTLADRFNLC----LDAENE 123 (138) T ss_pred HHHHHcCCcchhh---------ccCcCceEeeeeEeeccccccccccccccccccccc----HHHHHH Confidence 5555443211111 33455677787776411 11 1111 111111111 111223 No 29 >protein:vir:93592 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:1879 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449297;genbank:gi:157166045;uniprot:Q6H9U4;genbank:GeneID:5580414 Probab=74.55 E-value=0.1 Score=26.02 Aligned_cols=93 Identities=14% Similarity=0.062 Sum_probs=52.6 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhh-ccCC-------cccccHHHHHHHH Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGR-YQLP-------LASVPTALKRIAC 72 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~R-Y~lP-------l~~~p~~L~~~~~ 72 (138) |.++|+++++....-+ ..-|.+.|+.-|..|++.|-+||... +..+ -.++|..++..++ T Consensus 2 m~~vtLeevK~hLRId-------------~d~dD~li~~~i~aA~~~v~~~l~~~~~~~~~~~~~~~~~~~~~~i~~AvL 68 (108) T protein:vir:93 2 TALLTLEEIKAHLRVD-------------HDADDDMLMDKVRQATAVLLAYIQGSRDKVIREDGELIPGEALTRMKGAAM 68 (108) T ss_pred CcCCCHHHHHHHcCCC-------------CCcChHHHHHHHHHHHHHHHHHhccccccccccccccccccCChHHHHHHH Confidence 8899999999875421 12367789999999999999999653 2111 1245667777777 Q ss_pred HHHHHHhhcCCCCCHH-HHH-----HHHHHHHHHHHHhcCcccc Q lcl|NC_018274. 73 GLAYANLHIVLKEENP-VYK-----TAEHLRKLLSGIANGKLSL 110 (138) Q Consensus 73 dIA~Y~L~~~~~~~~~-v~~-----rY~~Ai~~L~~va~G~~~L 110 (138) -++-|+=-+|...++. +.. -.+.-+.-+++ =.+ | T Consensus 69 lLv~~~YenRe~~~~~~~~~~elP~~v~~Ll~~~R~---p~~-~ 108 (108) T protein:vir:93 69 RLTGMLYRNPDLAEREELLQGELPFSVSVLIYDLRC---PTV-L 108 (108) T ss_pred HHHHHHHhccccccccccccccCCHHHHHHHHHccc---ccc-C Confidence 6665554444433221 111 11222222222 111 1 No 30 >protein:vir:2345 Length: 125 # NCBI annotation: gp15 # Family: family:all:2817 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075282;genbank:gi:12657869;genbank:GeneID:920134 Probab=74.08 E-value=0.068 Score=26.97 Aligned_cols=117 Identities=10% Similarity=-0.032 Sum_probs=63.2 Q ss_pred CC-CCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhcc-CC-----cccccHHHHHHHHH Q lcl|NC_018274. 1 MS-YCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQ-LP-----LASVPTALKRIACG 73 (138) Q Consensus 1 M~-Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~-lP-----l~~~p~~L~~~~~d 73 (138) |+ |+|.+|+.++++.. |++ ...+.|+.-|++|+..|- .|++ |+ ...-+..++.++++ T Consensus 1 ma~~A~~eDV~a~w~R~----lt~--------eE~~~V~~~L~~ae~~ir----rriPdL~~r~~~~~~~~~~v~~V~a~ 64 (125) T protein:vir:23 1 MATLATHEDVTAFWART----PTA--------EEIVLINRRLAQAERMLL----RAIPELLIKASSDPVFRAEVIDIEAE 64 (125) T ss_pred CCcccCHHHHHHHhCCC----CCH--------HHHHHHHHHHHHHHHHHH----HhcCChhhhhcCCCcchhhHHHHHHH Confidence 87 99999999998632 221 245678888999999886 3332 21 23446778888887 Q ss_pred HHHHHhhcCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCc-cCCCCCCeeEEecCCccCC Q lcl|NC_018274. 74 LAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADG-KPAPVANTVQISEGRNDWG 135 (138) Q Consensus 74 IA~Y~L~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~-~~~~~~~~~~~~~~~r~f~ 135 (138) ..+=.+.+..+-...-.-.|-.-+.+ .++.|++-|..+.-+ =.++-++...+......=+ T Consensus 65 ~V~Rv~rnPeGy~seT~g~Yt~~l~~--~~~~g~L~it~~E~a~Lg~~~s~~~vi~p~~~~p~ 125 (125) T protein:vir:23 65 AVLRLVRNHEGYLSETDGNYTYMLQA--QDPNRKLEILPEEWEVLGIVRSGLGILVPTVVLPS 125 (125) T ss_pred HHHHHhcCCCCccccccchhhhhhhc--cCCCCceeecHHHHHhhccccccceEEeeceecCC Confidence 66554433332211112456666665 567788765322110 0111123333333322222 No 31 >protein:vir:100245 Length: 113 # NCBI annotation: gp74 # Family: family:all:363 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355410;genbank:gi:77864700;genbank:GeneID:3725967 Probab=68.75 E-value=0.23 Score=24.05 Aligned_cols=95 Identities=13% Similarity=0.054 Sum_probs=53.4 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhc-cCCc---------------cccc Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRY-QLPL---------------ASVP 64 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY-~lPl---------------~~~p 64 (138) |+++|+++++....-+ ...|.+.|+.-|..|++.+-.||+.++ ..+. ..+| T Consensus 1 M~~vtLee~K~hLRvd-------------~d~dD~lI~~li~AA~~~ve~~l~r~l~~~~~~~~~~~~~~~~~~~~~~~p 67 (113) T protein:vir:10 1 MALVELKLALGFVRAN-------------AGVEDDVVQMLLDAATQSAVDYLNRQVFETEDAMTTAIEAGTAGQNPMVVN 67 (113) T ss_pred CCCCCHHHHHHHcCCC-------------CCcchHHHHHHHHHHHHHHHHHhCccccccccccccccccccccccccccC Confidence 9999999999875521 123678899999999999999998763 2211 1357 Q ss_pred HHHHHHHHHHHHHHhhcCC-CCCH-HHHHHHHHHHHHHHHHhcCccccCC Q lcl|NC_018274. 65 TALKRIACGLAYANLHIVL-KEEN-PVYKTAEHLRKLLSGIANGKLSLAL 112 (138) Q Consensus 65 ~~L~~~~~dIA~Y~L~~~~-~~~~-~v~~rY~~Ai~~L~~va~G~~~L~~ 112 (138) +.++..+.-++- ++|..| .... ...+-=--+..+|...+. --|+ T Consensus 68 ~~i~~AvLllv~-~~Y~nRe~~~~~~~~~lP~~v~~Ll~~yR~---~~g~ 113 (113) T protein:vir:10 68 AAIRAAILKITA-ELYANREDTAFGPITELPLNARALLRPHRI---IPGV 113 (113) T ss_pred hHHHHHHHHHHH-HHHhhhhhhchhhhhccCHHHHHHHHHhhh---hcCC Confidence 777776664444 445433 3221 111110012222222221 1122 No 32 >protein:vir:100103 Length: 120 # NCBI annotation: gp7 # Family: family:all:363 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945037;genbank:gi:38707897;genbank:GeneID:2744150 Probab=66.67 E-value=0.22 Score=24.21 Aligned_cols=95 Identities=11% Similarity=0.055 Sum_probs=54.2 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhcc-CC---------------ccccc Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQ-LP---------------LASVP 64 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~-lP---------------l~~~p 64 (138) |+.+|+++++....-+ ..-|.+.|+.-|..|.+.|-.|++..+. .. ...+| T Consensus 5 m~~vtL~e~K~hLRvd-------------~d~DD~lI~~~i~AA~~~v~~~~~r~l~~~~~~~~~~~~~~~~~~~~~~~~ 71 (120) T protein:vir:10 5 TPIVSLEVALAHLRED-------------AGVADDLIKIYIGAATQSASDYVDRKLYANDAEMQAAVADATAGADPIVAN 71 (120) T ss_pred CCccCHHHHHHHcCCC-------------CCcchHHHHHHHHHHHHHHHHHhCCcccccccccchhhhccccccccccCC Confidence 9999999999875431 2346788999999999999999987641 10 11257 Q ss_pred HHHHHHHHHHHHHHhhcCCCC-----CHHHHHHHHHHHHHHHHHhcCccccCC Q lcl|NC_018274. 65 TALKRIACGLAYANLHIVLKE-----ENPVYKTAEHLRKLLSGIANGKLSLAL 112 (138) Q Consensus 65 ~~L~~~~~dIA~Y~L~~~~~~-----~~~v~~rY~~Ai~~L~~va~G~~~L~~ 112 (138) +.++..++-++ -++|..|.. .....+--.-+-..|...+. ..|+ T Consensus 72 ~~i~~AvLllv-g~~YenRe~~~~~~~~~~~~lP~~v~~Ll~~yR~---~~gv 120 (120) T protein:vir:10 72 DAIRAAILLTI-GKLYAFREDVVSGASASVTELPSGAKSLLFPYRV---GLGV 120 (120) T ss_pred HHHHHHHHHHH-HHHHhchhhhhhcccccccccCHHHHHHHHHhhh---ccCC Confidence 77777666444 455544421 11111211123333433221 1222 No 33 >protein:vir:80389 Length: 172 # NCBI annotation: BcepGomrgp09 # Family: family:all:703 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210229;genbank:gi:146329921;genbank:GeneID:5123498 Probab=64.70 E-value=0.28 Score=23.62 Aligned_cols=112 Identities=13% Similarity=0.133 Sum_probs=55.6 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHH----Hhhhc------------------cC Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLH----LHGRY------------------QL 58 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgy----L~~RY------------------~l 58 (138) =+|+|.+++.+.+..+ ..+.+.+-.+.+|..|+.-||+| .+.|- .+ T Consensus 15 nSYvt~~~a~aY~~~r------------g~~~~~d~~e~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~g~~~ 82 (172) T protein:vir:80 15 NTYAGADFVIAYAQAR------------GVTVDADEAERLILEAMDYIESFRRRWKGERNTREQGLTWPRHDAVVDGFVI 82 (172) T ss_pred cccccHHHHHHHHHHc------------CCCcCHHHHHHHHHHHHHHHhhccCccccccCCccccccccccCcccCcccc Confidence 5799999998776432 12333445799999999999993 33321 24 Q ss_pred CcccccHHHHHHHHHHHHHHhhcCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcc----CC--------------- Q lcl|NC_018274. 59 PLASVPTALKRIACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGK----PA--------------- 119 (138) Q Consensus 59 Pl~~~p~~L~~~~~dIA~Y~L~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~----~~--------------- 119 (138) |-..+|..|+..||-+|.+.+......+. .. ..++ .-++| |.++..-..... .+ T Consensus 83 ~~~~IP~~v~~A~~elA~~~~~g~~~~~~-~~---~~~v-~~ekV--G~i~~eY~~~~~~~~~~~~~~~~~~~~~v~~LL 155 (172) T protein:vir:80 83 PSDVIPKELQSAVAAAVIEQVNGFELQQS-QD---QWAV-RIEKV--DVIEVQYAAGGGGQSASANAPMKPTFPKIDALL 155 (172) T ss_pred cccchhHHHHHHHHHHHHHHhcCCccCcC-CC---Ccee-eEEec--cceEEeeecccCccccccccCCccchHHHHHHH Confidence 56678999999999999765543211111 00 0011 11222 333321110000 00 Q ss_pred ----CCCCeeEE-ecCC Q lcl|NC_018274. 120 ----PVANTVQI-SEGR 131 (138) Q Consensus 120 ----~~~~~~~~-~~~~ 131 (138) .++++..+ .=++ T Consensus 156 ~p~l~~~gg~~~~~vrg 172 (172) T protein:vir:80 156 NPLLVGDGGLFLVAVRG 172 (172) T ss_pred hhhhcCCCCeeeeeecC Confidence 00011110 0111 No 34 >protein:vir:106583 Length: 105 # NCBI annotation: putative protein # Family: family:all:6481 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958586;genbank:gi:41179246;genbank:GeneID:2717119 Probab=61.91 E-value=0.34 Score=23.12 Aligned_cols=91 Identities=13% Similarity=0.065 Sum_probs=54.7 Q ss_pred CCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHHHHHhhcC Q lcl|NC_018274. 3 YCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLHIV 82 (138) Q Consensus 3 Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~~~ 82 (138) +...+.+...+ ..|.-. .++.+..+++..|++|.+.|=+|+.+ ..+|..|..+++++|..+.... T Consensus 1 ~~~~~~~~e~i-----k~L~~~----~d~~~DelL~~lieda~~~vl~y~nr------~~ip~~l~~~v~evav~~fNR~ 65 (105) T protein:vir:10 1 MLNVDQLTEIV-----SALSTR----LENVNNALLTELVKESIAQVLDYTGQ------KKLVGSMDIYVKKLAVINYNRL 65 (105) T ss_pred CCchHHHHHHH-----HHHhcc----CCCchhHHHHHHHHHHHHHHHHHcCC------cccchhHHHHHHHHHHHHhccc Confidence 22222232221 112211 23567789999999999999999874 3678899999999888775422 Q ss_pred C--CC------------CHHHHHHHHHHHHHHHHHhcCcc Q lcl|NC_018274. 83 L--KE------------ENPVYKTAEHLRKLLSGIANGKL 108 (138) Q Consensus 83 ~--~~------------~~~v~~rY~~Ai~~L~~va~G~~ 108 (138) . +. ....-+-|.+.|+--++-.-|+. T Consensus 66 G~EG~tS~SegGvS~sy~~~~~~~~~~~l~~yR~~~v~~~ 105 (105) T protein:vir:10 66 GIEGETQRSEGGITNYLETGIPKDIRQGLNSYRIAKVKKL 105 (105) T ss_pred CCcccceeecCCeeeeeeccCcHHHHHHHHHHhhhcccCC Confidence 1 11 11233456666666565555665 No 35 >protein:vir:4788 Length: 130 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150168;swissprot:trembl:q94m43;genbank:gi:15088779;uniprot:Q94M43;genbank:GeneID:955972 Probab=56.57 E-value=0.17 Score=24.86 Aligned_cols=104 Identities=19% Similarity=0.282 Sum_probs=49.2 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhcc--CCcc----cccHHHHH-HHHH Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQ--LPLA----SVPTALKR-IACG 73 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~--lPl~----~~p~~L~~-~~~d 73 (138) |+|+|.+++.+.-|. +++-.+.-+..|+..||.+.+.+|. .-+. .+-..++. +|.. T Consensus 1 M~YlT~eey~el~~~-----------------~~~~F~kl~k~A~~~ID~~t~~~y~~~~~~~~~~~~r~~~vK~A~a~Q 63 (130) T protein:vir:47 1 MTYLTQEEFDELDFD-----------------EVTDFEKLAKRAKIAIDLYTNGIYQKDIDFEKEIAYRKSAVKLAMAFQ 63 (130) T ss_pred CCCCchhhHhhcCCC-----------------ChhhHHHHHHHHHHHHHHHhcccccccCCccCcchHHHHHHHHHHHHH Confidence 999999999854332 1112778899999999999998884 2222 22223333 3334 Q ss_pred HHHHHhhcCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCccCCCCCCeeEEecCCccCCCC---C Q lcl|NC_018274. 74 LAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISEGRNDWGAD---W 138 (138) Q Consensus 74 IA~Y~L~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~f~rd---~ 138 (138) |.-....+-....+ ..-..-|.-|+.++.-...+....+. +. .++-| | T Consensus 64 ieY~~~~G~~s~~~---------~~~~~S~svGrtSis~~~~~~~~~~~-------~~-~vs~da~~~ 114 (130) T protein:vir:47 64 IAYLDASGIMSADD---------KQLANSVSIGRTSISYSTSQSTLAGQ-------RF-NLSMDAENA 114 (130) T ss_pred HHHHHHhccccchh---------ccCcceeeecceeeecCcCccccccC-------Cc-cccHHHHHH Confidence 43333322111111 22233455555555321111111111 11 11222 3 No 36 >protein:vir:94955 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239280;genbank:gi:66392062;genbank:GeneID:5076600 Probab=55.52 E-value=0.065 Score=27.10 Aligned_cols=121 Identities=14% Similarity=0.073 Sum_probs=63.7 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHH---Hhhh------------------ccCC Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLH---LHGR------------------YQLP 59 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgy---L~~R------------------Y~lP 59 (138) =+|+|.+|..+-+...-. . ......|.+..+.+|..|+.-||+- ++.| ..+| T Consensus 14 nSYvtv~ea~aY~~~r~~---~----~~w~~~~~~~~e~aL~~A~dyId~~~~f~G~r~~~~Q~l~wPR~g~~~dg~~~~ 86 (170) T protein:vir:94 14 NSYVTVAEANSYFDGSYG---R----PLWTSASEDEKASLVISASRYLDQMMAWIGAPTNPEQSMWWPCKNAVIGGMTLS 86 (170) T ss_pred cceecHHHHHHHHHhhcc---c----cccCCCCHHHHHHHHHHHHHHhccccccccccCCcchhhcccccCcccCccccc Confidence 679999999876544321 1 1123567888999999999999972 3322 1235 Q ss_pred cccccHHHHHHHHHHHHHHhhcCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCccCCCCCCeeEEecC-CccCCCCC Q lcl|NC_018274. 60 LASVPTALKRIACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISEG-RNDWGADW 138 (138) Q Consensus 60 l~~~p~~L~~~~~dIA~Y~L~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~-~r~f~rd~ 138 (138) -..+|..|+..||-+|.+.+.+....+. . +.+++ -++| |.++..-....+....... +..= ..+.+.-| T Consensus 87 ~~~IP~~V~~Aq~elA~~~~~~~~~~~~--~---~~~v~-~~kV--G~i~veY~~~~~~~~~~~~--v~~LL~p~l~~~~ 156 (170) T protein:vir:94 87 QVSIPVKVKIAVFELAYFMLESGAALSF--A---DQTID-SVKV--GTIRVEFTKNSTDAGLPTF--VEAMLSGFGSPVL 156 (170) T ss_pred cchhhHHHHHHHHHHHHHHHhCcccCcc--c---cccee-eEec--ceeEEEecCCCCCCccHHH--HHHHhhhhhcccc Confidence 5678999999999999988854332221 1 11121 2334 6665543211111110000 0000 01111111 No 37 >protein:vir:97267 Length: 172 # NCBI annotation: hypothetical protein ORF024 # Family: family:all:703 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294532;genbank:gi:149408253;genbank:GeneID:5237132 Probab=47.82 E-value=0.23 Score=24.12 Aligned_cols=126 Identities=11% Similarity=0.112 Sum_probs=59.6 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHH---Hhhhc------------------cCC Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLH---LHGRY------------------QLP 59 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgy---L~~RY------------------~lP 59 (138) =+|.|.+++.+.+...-. .+ ...+.+-.+.+|..|+.-||+- .+.|= .+| T Consensus 16 nSYvtv~~a~aY~~~rg~-~~--------~a~~~~~ke~aLi~A~~yiD~~~~f~G~r~~~~~Q~l~WPRtg~~d~~~~~ 86 (172) T protein:vir:97 16 NAYISVEEFKTYHTDRGN-SF--------AGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYY 86 (172) T ss_pred cccccHHHHHHHHHhcCc-cc--------CCCCcHHHHHHHHHHHHHHhhhhhcccCCCCCcchhhhcccCCCCCCcccc Confidence 679999999987655421 11 1122334778999999999973 33331 124 Q ss_pred cccccHHHHHHHHHHHHHHhhcCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCccCCCC-CCe---------eEEec Q lcl|NC_018274. 60 LASVPTALKRIACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPV-ANT---------VQISE 129 (138) Q Consensus 60 l~~~p~~L~~~~~dIA~Y~L~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~-~~~---------~~~~~ 129 (138) ...+|.-|+..||-+|.+-|.....++...... ...-..|.+.-|.++..--..+..... ... .-... T Consensus 87 ~~~IP~~v~~A~~elA~~al~~~l~~d~~~~~~--~~~v~~kr~kvg~i~~~y~~~~~~~~~~p~~~~v~aLL~p~gl~~ 164 (172) T protein:vir:97 87 INDIPPEVKEACAEYALRALAAELNPDPERNAS--GVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVR 164 (172) T ss_pred cccccHHHHHHHHHHHHHHHhcccccccccccc--cccceeeeeeecceeeEeeccCCCCCccccHHHHHHHHhhhcccc Confidence 456799999999999998876543221100000 000001222224443321110000000 000 00111 Q ss_pred CCccCCCC Q lcl|NC_018274. 130 GRNDWGAD 137 (138) Q Consensus 130 ~~r~f~rd 137 (138) ++-.|-|- T Consensus 165 ~~~~~~r~ 172 (172) T protein:vir:97 165 SGGTLLRG 172 (172) T ss_pred CcceeccC Confidence 22233333 No 38 >protein:vir:79701 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285884;genbank:gi:148750841;genbank:GeneID:5220403 Probab=37.54 E-value=0.23 Score=24.05 Aligned_cols=112 Identities=13% Similarity=0.073 Sum_probs=57.2 Q ss_pred CC-CCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhh---ccC-----C----c-cccc-- Q lcl|NC_018274. 1 MS-YCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGR---YQL-----P----L-ASVP-- 64 (138) Q Consensus 1 M~-Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~R---Y~l-----P----l-~~~p-- 64 (138) |. |+|.+++...-| ..++.+..+.-+..|+..||.+..-. |.- + + ...+ T Consensus 1 ~~pYLTy~ef~~lg~---------------~~~~~d~F~kllk~A~~~ID~~T~y~~~~y~~~~i~~d~~~d~~~~~~~r 65 (144) T protein:vir:79 1 MKPYLTTSDFEKLGY---------------ELKKPDNFGKLLKSATVLINQICSYYDPAFAYHDLEADSQADPDSYLFRQ 65 (144) T ss_pred CCcccchhhhhhhCC---------------CCcchhhhhhHHHHHHHHhhhhhhhhccccccccccccccccchhhhhHH Confidence 76 999888854322 13345568888999999999987654 321 1 1 1111 Q ss_pred -HHHH-HHHHHHHHHHhhcCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCccCCCCCCeeEEecCCccCCCCC Q lcl|NC_018274. 65 -TALK-RIACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQISEGRNDWGADW 138 (138) Q Consensus 65 -~~L~-~~~~dIA~Y~L~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~~~~r~f~rd~ 138 (138) ..++ .+|..|.-+.-.+. ...|+-+-.+++-+.-|+.++.....+.+....++..+. ..-.+| T Consensus 66 ~~~vKkA~a~QIeY~~~~G~-------~sa~e~~~~~~~S~svGrtsvs~~~~~~~s~t~~~~~v~----~~a~~y 130 (144) T protein:vir:79 66 AMAFKKAVALEMLFLEDSGY-------SSAYDVAQGALNSFTVGHTSMSLNPSAGQNLTVGSTGVV----KSAYDL 130 (144) T ss_pred HHHHHHHHHHHHHHHHHcCC-------cchhhhhcCccceeEecceEEeecCCCcccccccccccc----HHHHHH Confidence 2223 34444444333322 223444556777788888776543322222222111111 122344 No 39 >protein:vir:95176 Length: 172 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:703 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293420;genbank:gi:148912841;genbank:GeneID:5228251 Probab=33.44 E-value=0.33 Score=23.24 Aligned_cols=116 Identities=12% Similarity=0.059 Sum_probs=57.3 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHH----Hhhh------------------ccC Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLH----LHGR------------------YQL 58 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgy----L~~R------------------Y~l 58 (138) =+|+|.+++.+.+..+- . ....|.+..+.+|..|+.-||+| ++.| ..+ T Consensus 17 nSYvtv~ea~aY~~~rg--------~--~~~~~~~~ke~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~~~~v 86 (172) T protein:vir:95 17 NSYVSVADARIYASNRG--------V--ELPLDDDELAAMLIRSTDYLEAQACRFQGKPTSTTQALQWPRTGVFLNEDEV 86 (172) T ss_pred cccccHHHHHHHHHhcC--------C--cCCCChHHHHHHHHHHHHHhhccCCceeeeecCCcccccCCcCCcccCcccc Confidence 57999999998765431 0 12246667899999999999985 2221 123 Q ss_pred CcccccHHHHHHHHHHHHHHhhcCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCccC------------------CC Q lcl|NC_018274. 59 PLASVPTALKRIACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKP------------------AP 120 (138) Q Consensus 59 Pl~~~p~~L~~~~~dIA~Y~L~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~------------------~~ 120 (138) |-..+|..|+..||-+|.+.+....-. +...+ +..+ --++| |.++..-...+.. +. T Consensus 87 ~~~~IP~~V~~A~~elA~~~~~~~~~~--~~~~~-~~~v-k~~kV--G~I~veY~~~~~~~~~~~~~~v~~LL~p~l~~~ 160 (172) T protein:vir:95 87 PSNVIPKSLIAAQVQLTMAINAGFDLQ--PNVSP-QDYV-TREKV--GPIETEYADPLSVGIMPTFTAANALLAPLFGEC 160 (172) T ss_pred cccchhHHHHHHHHHHHHHHHcCcccc--ccCCc-ccce-eEEec--cceEEeeccCCCCCCcccHHHHHHHHhhhhccc Confidence 556789999999999997555432100 00000 0011 01112 4443321110000 11 Q ss_pred CCCeeEEecCCcc Q lcl|NC_018274. 121 VANTVQISEGRND 133 (138) Q Consensus 121 ~~~~~~~~~~~r~ 133 (138) +++++.|..- |+ T Consensus 161 ~~~~~~~r~~-r~ 172 (172) T protein:vir:95 161 ASNKFALRTI-RV 172 (172) T ss_pred CCcceeeEEE-eC Confidence 1122222110 11 No 40 >protein:vir:5742 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892053;genbank:gi:33770516;uniprot:Q7Y407;genbank:GeneID:2637465 Probab=28.55 E-value=1.7 Score=19.27 Aligned_cols=96 Identities=9% Similarity=-0.048 Sum_probs=51.7 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhh-ccC----Cccc-------ccHHHH Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGR-YQL----PLAS-------VPTALK 68 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~R-Y~l----Pl~~-------~p~~L~ 68 (138) |+.+|+++++.+..- .. ..+-+.+.++.-+..|.+.+..|+..+ |.. |-.+ .+.-++ T Consensus 1 m~mitLeeiK~hlRi------d~-----D~~~eD~lL~~y~~AA~~~~e~~~~rkLy~~~~~~~~~p~~~~gl~~~~di~ 69 (110) T protein:vir:57 1 MGMTSLSNVKTQLRL------EE-----DFTEHDDFIESLIDAAQRSIERTYYCVLVDSQEALEKLPEGVRGFLIEPDTQ 69 (110) T ss_pred CCCCCHHHHHHHcCC------CC-----CCChhHHHHHHHHHHHHHHHHHHhCCcccCCccccccCCCCCCccccCHHHH Confidence 999999999976442 11 123467789999999999999999887 532 2111 344455 Q ss_pred HHHHHHHHHHhhcCCCCCH-HHHHHHHHHHHHHHH-HhcCcc Q lcl|NC_018274. 69 RIACGLAYANLHIVLKEEN-PVYKTAEHLRKLLSG-IANGKL 108 (138) Q Consensus 69 ~~~~dIA~Y~L~~~~~~~~-~v~~rY~~Ai~~L~~-va~G~~ 108 (138) ..|.-++-++=-+|..... ....- --+.+||-. +.+-.+ T Consensus 70 ~A~Lllv~hwYeNREav~~~~~~~~-P~~v~~Ll~P~~~~~~ 110 (110) T protein:vir:57 70 LAARMMVAQWYLNPKGTSPDGDTPA-QLGVEYLLFPLMEHTV 110 (110) T ss_pred HHHHHHHHHHHhcccccccccccch-hHHHHHHHHHHHhhcC Confidence 5555433333323333211 01111 223333322 444443 No 41 >protein:vir:99002 Length: 158 # NCBI annotation: gp31 # Family: family:all:32652 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655896;genbank:gi:109521468;genbank:GeneID:4157967 Probab=28.14 E-value=1.8 Score=19.22 Aligned_cols=124 Identities=7% Similarity=0.010 Sum_probs=62.7 Q ss_pred CC-CCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHH---HHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHHH Q lcl|NC_018274. 1 MS-YCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRA---IADADSEIDLHLHGRYQLPLASVPTALKRIACGLAY 76 (138) Q Consensus 1 M~-Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~A---l~~A~~~idgyL~~RY~lPl~~~p~~L~~~~~dIA~ 76 (138) |+ |+|++||.+++-. .| + +--..+..+| |+++|++.--+=+++..+| ..+|..++.+|..-|+ T Consensus 1 ~~alasvee~~trl~~-~l--------p---~~~~r~~a~a~~vLd~~S~~ar~~~gr~W~~~-~daP~~vr~ivL~aa~ 67 (158) T protein:vir:99 1 MAALVSVEEFTTFLRV-PL--------P---EEGSEKYTQMEFLLTLASDWARELSCKPWLLP-ADAPVTARGIILAASR 67 (158) T ss_pred CcceeeHhhhhhhhcc-cC--------C---hhhhHHHHHHHHHHHHHHHHHHHhcCccCCCC-CcchhHHHHHHHHHHH Confidence 77 9999999999721 10 0 0012233344 8999988877766655544 4679999999998887 Q ss_pred HHhhcCCCC------CHHH-------H--HHHHHHHHHHHHHhcCc---cccCCCCCccCCCCCCeeEEecCC---ccCC Q lcl|NC_018274. 77 ANLHIVLKE------ENPV-------Y--KTAEHLRKLLSGIANGK---LSLALDADGKPAPVANTVQISEGR---NDWG 135 (138) Q Consensus 77 Y~L~~~~~~------~~~v-------~--~rY~~Ai~~L~~va~G~---~~L~~~~~~~~~~~~~~~~~~~~~---r~f~ 135 (138) =.+.+-.+- +..+ . =-+++-++.|++...-+ -+++.-- +..-...+-+-+...+ ++|. T Consensus 68 R~~~NP~g~~~~~~G~~~~~~~~~g~~~~ffT~~E~~~L~r~~~s~GG~~~~~ttR-~d~~~~~~yv~v~~~GdpfP~~~ 146 (158) T protein:vir:99 68 REWNNPKRVSYVVKGPQSATFMQSAYPPGFFTDAEEAKLRSYGRSTGNWGVIETYR-DDEEQLNGYLEVYPHGGLMPVYH 146 (158) T ss_pred HHHhcCCceEEeeecchhhhcccccCCCcccCHHHHHHHHHhhcccCceeEEEeec-CccccCCceecccCCCCcccccC Confidence 766433210 0000 0 12345566666663222 1221100 0111111222222222 4444 Q ss_pred -CCC Q lcl|NC_018274. 136 -ADW 138 (138) Q Consensus 136 -rd~ 138 (138) -|| T Consensus 147 ~~d~ 150 (158) T protein:vir:99 147 PDDI 150 (158) T ss_pred cccc Confidence 455 No 42 >protein:vir:94507 Length: 113 # NCBI annotation: putative DNA packaging protein # Family: family:all:372 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223891;genbank:gi:62327103;genbank:GeneID:5075523 Probab=24.77 E-value=2.1 Score=18.86 Aligned_cols=98 Identities=10% Similarity=0.073 Sum_probs=56.6 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHHHHHhh Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLH 80 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (138) |+ ++++++.+.|- +| .....+++..|.+|+..+-.||+ +|.+-...+|+-|.-+.+++|..+.. T Consensus 1 M~--~L~~vK~~lgi------~d-------~~~D~lL~~iI~~a~~~i~~~l~-~~~~~~~~iP~~l~~Iv~evavkryN 64 (113) T protein:vir:94 1 MA--LLDSIKLRIGI------ED-------TKQDDLLTDIISDVQARVLAYVN-QDGLVQSELPNGLDFVIKDVTIRIYN 64 (113) T ss_pred Cc--hHHHHHHHhCC------CC-------CchhhHHHHHHHHHHHHHHHHhC-CccchhhhhhhHHHHHHHHHHHHHhc Confidence 54 56677766653 22 11235799999999999999998 46555678899999999999988764 Q ss_pred cCC--CC----CHH----H-----HHHHHHHHHHHHHHhcCccccCCCCCccCCCCCCeeEEe Q lcl|NC_018274. 81 IVL--KE----ENP----V-----YKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQIS 128 (138) Q Consensus 81 ~~~--~~----~~~----v-----~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~ 128 (138) ... +. .+. . -+-|++-| ++-.++.-. ...++.|- T Consensus 65 R~g~EG~~S~SeeG~S~sf~~~~df~~y~~~l---~~~~~~~~~-----------~~~g~rF~ 113 (113) T protein:vir:94 65 KIGDEGKESSSEGNVSNTWDTPADLSEYSDVL---DVYRKSYKR-----------RSAGMRFI 113 (113) T ss_pred ccCCccceeeecCceeeeecCccchhhHHHHH---HHHHhhccC-----------CCCCceeC Confidence 332 11 010 1 12333333 333333211 11122333 No 43 >protein:vir:95004 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224025;genbank:gi:62327312;genbank:GeneID:5176832 Probab=21.03 E-value=2.7 Score=18.25 Aligned_cols=116 Identities=12% Similarity=0.104 Sum_probs=60.4 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHH----Hhhh------------------ccC Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLH----LHGR------------------YQL 58 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgy----L~~R------------------Y~l 58 (138) =+|.|.+++.+.+..+-. ....|..-.+.+|..|+.-||+| .+.| ..+ T Consensus 15 nSYvt~~ea~aY~~~rg~----------~~~~dd~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~g~~~ 84 (169) T protein:vir:95 15 DSYVSLEDGRALAAKYGL----------ELPEDDIAAEASLRNGAVYVGLFESQMCGRRVSANQALAFPRTGIDLHGFPQ 84 (169) T ss_pred cccccHHHHHHHHHHcCC----------cCCCCHHHHHHHHHHHHHHhhccccccccccCCcchhhccccCCceeccccc Confidence 569999999987654311 11235667899999999999983 3331 124 Q ss_pred CcccccHHHHHHHHHHHHHHhhcCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCccCC-----------------CC Q lcl|NC_018274. 59 PLASVPTALKRIACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPA-----------------PV 121 (138) Q Consensus 59 Pl~~~p~~L~~~~~dIA~Y~L~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~-----------------~~ 121 (138) |...+|..++..||-+|.+.+.+.......-..+ + .+..-.|.++..-...+... .+ T Consensus 85 ~~~~IP~~V~~A~~elA~~~~~g~~~~~~~~~~~----v--~~e~v~G~i~veY~~~~~~~~~~~~~a~~~LL~p~l~g~ 158 (169) T protein:vir:95 85 PSNVIPSLVIQAQVMAAVEYGAGTDVRGSTDGRE----V--QTERVEGAVTVSYFKNGYSGGTVSITAADDALRPLLCGS 158 (169) T ss_pred ccccchHHHHHHHHHHHHHHHcCccccCCCCccc----e--eeeeeccceeEeecCCCCcCccccHHHHHHhhhhhcccC Confidence 5678899999999999999986432111100001 0 00011255444321111110 00 Q ss_pred CCeeEEecCCccCCC Q lcl|NC_018274. 122 ANTVQISEGRNDWGA 136 (138) Q Consensus 122 ~~~~~~~~~~r~f~r 136 (138) ++.+.| ++|.- T Consensus 159 ~g~~~i----~~~rg 169 (169) T protein:vir:95 159 NNAYSF----NVFRG 169 (169) T ss_pred CCccee----eeecC Confidence 111111 12211 No 44 >protein:vir:78383 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110841;genbank:gi:134288602;genbank:GeneID:5179646 Probab=20.58 E-value=2.7 Score=18.18 Aligned_cols=116 Identities=13% Similarity=0.100 Sum_probs=58.7 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHH----Hhhh------------------ccC Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLH----LHGR------------------YQL 58 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgy----L~~R------------------Y~l 58 (138) =+|+|.+++.+.+..+-. ....|....+.+|..|+.-||+| ++.| ..+ T Consensus 15 nSYvtv~~a~aY~~~rg~----------~~~~d~~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~g~~~ 84 (169) T protein:vir:78 15 DSYVSLEDGRALAAKYGL----------ELPEDDTAAEAALRNGAVYVGLFESQMCGRRVSANQALAFPRTGVTLHGFPQ 84 (169) T ss_pred cccccHHHHHHHHHHcCC----------cCCCChHHHHHHHHHHHHHhhhccccceeeeCCcccccccccCCceeccccc Confidence 469999999987654321 11235677999999999999974 3322 134 Q ss_pred CcccccHHHHHHHHHHHHHHhhcCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCccCC-----------------CC Q lcl|NC_018274. 59 PLASVPTALKRIACGLAYANLHIVLKEENPVYKTAEHLRKLLSGIANGKLSLALDADGKPA-----------------PV 121 (138) Q Consensus 59 Pl~~~p~~L~~~~~dIA~Y~L~~~~~~~~~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~-----------------~~ 121 (138) |...+|.-++..||-+|.+.+......+..-.++. .++--.|.+...-...+... .+ T Consensus 85 ~~~~IP~~v~~A~~elA~~~~~g~~~~~~~~~~~v------~~e~v~G~i~veY~~~~~~~~~~~~~~~~~LL~p~l~~~ 158 (169) T protein:vir:78 85 PSNVIPPLVIQAQVMAAVEYGAGTDVRGSTDGREV------QTERVEGAVTVSYFKNGYSGGTVSITTADDALRPLLCGS 158 (169) T ss_pred ccccchHHHHHHHHHHHHHHhcCcccCCCCCccee------EEEEecCceeEeecCCCCCCCcccHHHHHHHhhhhcccC Confidence 55678999999999999988753321110000000 00000133333211111000 00 Q ss_pred CCeeEEecCCccCCC Q lcl|NC_018274. 122 ANTVQISEGRNDWGA 136 (138) Q Consensus 122 ~~~~~~~~~~r~f~r 136 (138) ++.+.| ++|.- T Consensus 159 ~g~~~i----~~~rg 169 (169) T protein:vir:78 159 NNAYSF----NVFRG 169 (169) T ss_pred CCccee----eeecC Confidence 111111 12211 No 45 >protein:vir:3970 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663678;genbank:gi:21716115;genbank:GeneID:951203 Probab=20.48 E-value=1.8 Score=19.23 Aligned_cols=96 Identities=9% Similarity=0.084 Sum_probs=53.8 Q ss_pred CCCCCHHHHHHhcCHHHHHHHhcCCCcCcCccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHHHHHhh Q lcl|NC_018274. 1 MSYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLH 80 (138) Q Consensus 1 M~Y~T~~Dl~~~~g~~el~~Ltd~~~~~~~~~d~~~v~~Al~~A~~~idgyL~~RY~lPl~~~p~~L~~~~~dIA~Y~L~ 80 (138) |+. +++++.+.|.. +...++..|.+|...|-.||... ...+|..|--+..++|..+.. T Consensus 1 M~i--L~~vK~~lgi~----------------~D~lL~~li~~a~~~i~~~l~~~----~~~iP~~l~~iv~evav~ryN 58 (110) T protein:vir:39 1 MAI--TDDLKKLLGGS----------------SDERLEVIEKRTRERLLLILSSN----IKEVPPELEYVVLDVSLKRFN 58 (110) T ss_pred Cch--HHHHHHhcCCC----------------hhHHHHHHHHHHHHHHHHHhCCC----hhhhhhHHHHHHHHHHHHHhc Confidence 654 67777776631 24579999999999999999842 346789999999998887754 Q ss_pred cCC--CC----CH--------HHHHHHHHHHHHHHHHhcCccccCCCCCccCCCCCCeeEEe Q lcl|NC_018274. 81 IVL--KE----EN--------PVYKTAEHLRKLLSGIANGKLSLALDADGKPAPVANTVQIS 128 (138) Q Consensus 81 ~~~--~~----~~--------~v~~rY~~Ai~~L~~va~G~~~L~~~~~~~~~~~~~~~~~~ 128 (138) ... +. .+ ..-+-|.+-|. +-..++..- ....-+.+.|- T Consensus 59 R~g~EG~~S~SeeG~S~sf~~~d~~~y~~~l~---~y~~~~~~~-------~~~~~g~~~f~ 110 (110) T protein:vir:39 59 RIGQEGMQSYSQEGLSMTFSESDFDEYADEIE---SWRKSKETE-------GDKKIGRFRLY 110 (110) T ss_pred cccccccceeecCCeeeeecccCcchhHHHHH---HHhhhcccc-------ccCcceeeeeC Confidence 221 11 01 11223333332 222222111 11122234444 Done!