Query lcl|NC_019515.1_cdsid_YP_007005947.1 [gene=F400_gp096] [protein=hypothetical protein] [protein_id=YP_007005947.1] [location=complement(65818..66300)] Match_columns 160 No_of_seqs 1 out of 4 Neff 1.0 Searched_HMMs 1612 Date Thu Nov 7 17:17:37 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_96 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_96_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:80362 Length: 140 97.9 7.1E-08 4.4E-11 59.8 6.4 133 1-160 1-137 (140) 2 protein:vir:100243 Length: 140 97.9 1.5E-07 9.4E-11 58.0 8.2 132 1-159 1-140 (140) 3 protein:vir:100075 Length: 140 97.9 1E-07 6.2E-11 59.0 7.1 133 1-160 1-137 (140) 4 protein:vir:1437 Length: 140 # 97.8 2.3E-07 1.4E-10 57.0 7.6 133 1-160 1-137 (140) 5 protein:vir:1273 Length: 127 # 97.7 8.7E-07 5.4E-10 53.8 9.3 126 1-156 1-127 (127) 6 protein:vir:194 Length: 149 # 97.5 7.9E-07 4.9E-10 54.0 6.8 133 1-160 2-148 (149) 7 protein:vir:93617 Length: 148 97.4 1.4E-06 8.7E-10 52.7 6.8 134 1-160 4-147 (148) 8 protein:vir:4347 Length: 164 # 97.3 1E-06 6.5E-10 53.4 5.9 139 1-160 1-155 (164) 9 protein:vir:94538 Length: 125 97.2 2.4E-06 1.5E-09 51.4 6.9 123 1-160 1-123 (125) 10 protein:vir:1386 Length: 149 # 97.2 4.4E-06 2.7E-09 50.0 7.8 139 1-158 1-149 (149) 11 protein:vir:1891 Length: 179 # 97.1 4.1E-06 2.5E-09 50.1 7.0 139 1-160 5-170 (179) 12 protein:vir:107568 Length: 146 97.1 5.8E-06 3.6E-09 49.3 7.5 141 1-159 1-146 (146) 13 protein:vir:105007 Length: 146 97.1 5.8E-06 3.6E-09 49.3 7.5 141 1-159 1-146 (146) 14 protein:vir:102085 Length: 146 97.1 5.8E-06 3.6E-09 49.3 7.5 141 1-159 1-146 (146) 15 protein:vir:102875 Length: 146 97.1 5.8E-06 3.6E-09 49.3 7.5 141 1-159 1-146 (146) 16 protein:vir:3873 Length: 128 # 97.0 8.7E-06 5.4E-09 48.3 8.1 127 1-156 1-128 (128) 17 protein:vir:1988 Length: 156 # 96.9 6.4E-05 3.9E-08 43.6 11.5 140 1-157 3-156 (156) 18 protein:vir:99833 Length: 190 96.7 6E-05 3.7E-08 43.7 10.4 139 1-159 2-190 (190) 19 protein:vir:105089 Length: 133 96.6 4.4E-05 2.7E-08 44.5 8.9 130 1-158 1-133 (133) 20 protein:vir:9930 Length: 108 # 96.4 1.7E-05 1E-08 46.8 5.6 108 6-157 1-108 (108) 21 protein:vir:9708 Length: 125 # 96.4 4.6E-05 2.8E-08 44.4 7.8 122 1-160 1-124 (125) 22 protein:vir:5745 Length: 135 # 96.3 8.2E-05 5.1E-08 43.0 9.0 130 1-160 1-135 (135) 23 protein:vir:2740 Length: 114 # 96.3 4.1E-05 2.6E-08 44.6 7.3 114 1-157 1-114 (114) 24 protein:vir:4906 Length: 114 # 96.3 4.1E-05 2.6E-08 44.6 7.3 114 1-157 1-114 (114) 25 protein:vir:95789 Length: 114 96.3 3.4E-05 2.1E-08 45.1 6.7 114 1-160 1-114 (114) 26 protein:vir:3617 Length: 112 # 96.1 5.8E-05 3.6E-08 43.8 7.0 112 1-156 1-112 (112) 27 protein:vir:102154 Length: 119 95.9 4.8E-05 3E-08 44.2 5.6 114 1-152 1-119 (119) 28 protein:vir:97088 Length: 157 95.7 0.00032 2E-07 39.7 9.4 133 1-159 1-157 (157) 29 protein:vir:98409 Length: 108 95.6 7.8E-05 4.9E-08 43.1 5.7 108 7-156 1-108 (108) 30 protein:vir:103917 Length: 115 95.6 0.00012 7.3E-08 42.1 6.6 115 7-156 1-115 (115) 31 protein:vir:96358 Length: 115 95.6 0.00012 7.3E-08 42.1 6.6 115 7-156 1-115 (115) 32 protein:vir:96225 Length: 115 95.6 0.00012 7.3E-08 42.1 6.6 115 7-156 1-115 (115) 33 protein:vir:9312 Length: 115 # 95.6 0.00012 7.3E-08 42.1 6.6 115 7-156 1-115 (115) 34 protein:vir:78858 Length: 115 95.6 0.00012 7.3E-08 42.1 6.6 115 7-156 1-115 (115) 35 protein:vir:97144 Length: 115 95.6 0.00012 7.3E-08 42.1 6.6 115 7-156 1-115 (115) 36 protein:vir:96486 Length: 112 95.6 0.00011 7.1E-08 42.2 6.5 112 1-152 1-112 (112) 37 protein:vir:99744 Length: 115 95.5 0.00012 7.5E-08 42.1 6.4 115 7-156 1-115 (115) 38 protein:vir:743 Length: 108 # 95.4 0.00015 9.1E-08 41.6 6.6 108 7-156 1-108 (108) 39 protein:vir:106623 Length: 115 95.4 0.00019 1.2E-07 41.0 7.1 114 7-156 1-115 (115) 40 protein:vir:99196 Length: 155 95.3 0.00032 2E-07 39.7 8.3 140 1-160 1-152 (155) 41 protein:vir:79091 Length: 175 94.7 0.0022 1.4E-06 35.1 11.2 142 1-160 1-169 (175) 42 protein:vir:103841 Length: 155 94.2 0.00093 5.8E-07 37.2 7.9 141 1-160 1-152 (155) 43 protein:vir:107757 Length: 189 94.1 0.0015 9.5E-07 36.0 8.9 90 69-160 1-93 (189) 44 protein:vir:79034 Length: 141 94.1 0.0007 4.4E-07 37.9 7.0 124 1-160 1-132 (141) 45 protein:vir:98342 Length: 125 93.9 0.00067 4.1E-07 38.0 6.5 121 1-160 1-125 (125) 46 protein:vir:79988 Length: 125 93.9 0.00067 4.1E-07 38.0 6.5 121 1-160 1-125 (125) 47 protein:vir:4704 Length: 125 # 93.9 0.00067 4.1E-07 38.0 6.5 121 1-160 1-125 (125) 48 protein:vir:81106 Length: 125 93.9 0.00067 4.1E-07 38.0 6.5 121 1-160 1-125 (125) 49 protein:vir:9414 Length: 125 # 93.9 0.00067 4.1E-07 38.0 6.5 121 1-160 1-125 (125) 50 protein:vir:107851 Length: 175 93.7 0.0034 2.1E-06 34.1 10.1 143 1-158 1-175 (175) 51 protein:vir:79225 Length: 155 93.6 0.0014 8.4E-07 36.3 7.7 140 1-160 1-152 (155) 52 protein:vir:3163 Length: 145 # 93.4 0.0068 4.2E-06 32.5 11.2 133 1-160 1-140 (145) 53 protein:vir:99546 Length: 200 93.1 0.0016 1E-06 35.9 7.4 95 64-160 1-143 (200) 54 protein:vir:101594 Length: 173 92.8 0.0015 9.6E-07 36.0 6.8 116 7-160 1-171 (173) 55 protein:vir:81147 Length: 126 92.5 0.0027 1.7E-06 34.6 7.7 124 1-159 1-126 (126) 56 protein:vir:5257 Length: 148 # 91.4 0.0055 3.4E-06 33.0 8.2 85 52-160 1-91 (148) 57 protein:vir:94490 Length: 137 91.4 0.002 1.2E-06 35.4 5.6 108 1-148 1-137 (137) 58 protein:vir:93738 Length: 137 91.4 0.002 1.2E-06 35.4 5.6 108 1-148 1-137 (137) 59 protein:vir:97427 Length: 137 91.4 0.002 1.2E-06 35.4 5.6 108 1-148 1-137 (137) 60 protein:vir:105916 Length: 149 91.0 0.0019 1.2E-06 35.4 5.3 108 1-148 13-149 (149) 61 protein:vir:96829 Length: 135 90.9 0.0034 2.1E-06 34.1 6.5 108 1-148 1-135 (135) 62 protein:vir:95894 Length: 137 89.3 0.004 2.5E-06 33.8 5.5 108 1-148 1-137 (137) 63 protein:vir:94108 Length: 149 88.6 0.0043 2.6E-06 33.6 5.2 108 1-148 13-149 (149) 64 protein:vir:5978 Length: 144 # 88.6 0.006 3.7E-06 32.8 5.9 114 1-157 4-144 (144) 65 protein:vir:107099 Length: 137 88.0 0.0044 2.7E-06 33.5 4.9 108 1-148 1-137 (137) 66 protein:vir:94654 Length: 142 85.5 0.025 1.6E-05 29.3 7.6 114 1-152 1-142 (142) 67 protein:vir:94796 Length: 137 84.5 0.017 1.1E-05 30.3 6.2 108 1-148 1-137 (137) 68 protein:vir:5257 Length: 148 # 82.2 0.022 1.3E-05 29.7 5.8 79 1-88 69-148 (148) 69 protein:vir:99546 Length: 200 82.1 0.018 1.1E-05 30.1 5.4 82 1-88 117-200 (200) 70 protein:vir:106570 Length: 182 82.0 0.031 1.9E-05 28.8 6.6 137 1-160 1-181 (182) 71 protein:vir:105330 Length: 137 81.8 0.021 1.3E-05 29.8 5.6 108 1-153 1-137 (137) 72 protein:vir:96121 Length: 137 81.3 0.012 7.5E-06 31.1 4.1 108 1-153 1-137 (137) 73 protein:vir:96105 Length: 193 79.1 0.086 5.3E-05 26.4 8.0 87 72-160 1-136 (193) 74 protein:vir:100887 Length: 139 78.2 0.056 3.5E-05 27.5 6.7 127 1-160 3-139 (139) 75 protein:vir:4956 Length: 153 # 77.5 0.071 4.4E-05 26.9 7.0 132 1-160 1-139 (153) 76 protein:vir:99528 Length: 92 # 77.3 0.014 8.8E-06 30.7 3.2 92 1-128 1-92 (92) 77 protein:vir:5703 Length: 150 # 76.9 0.13 8.1E-05 25.4 9.0 131 6-160 1-149 (150) 78 protein:vir:5000 Length: 141 # 76.8 0.081 5E-05 26.6 7.1 132 1-159 1-141 (141) 79 protein:vir:4859 Length: 140 # 76.6 0.079 4.9E-05 26.6 7.0 130 1-158 1-140 (140) 80 protein:vir:6071 Length: 150 # 74.7 0.16 9.6E-05 25.0 9.3 132 6-153 1-150 (150) 81 protein:vir:77650 Length: 155 72.7 0.038 2.3E-05 28.4 4.2 79 73-160 1-98 (155) 82 protein:vir:100223 Length: 139 69.8 0.18 0.00011 24.6 7.3 129 1-160 1-139 (139) 83 protein:vir:101563 Length: 155 69.8 0.055 3.4E-05 27.5 4.4 94 48-160 1-98 (155) 84 protein:vir:78607 Length: 155 68.7 0.1 6.2E-05 26.1 5.6 78 1-89 77-155 (155) 85 protein:vir:98557 Length: 149 68.7 0.23 0.00014 24.0 10.3 129 1-160 1-148 (149) 86 protein:vir:105467 Length: 144 67.8 0.2 0.00012 24.4 7.0 125 1-160 1-143 (144) 87 protein:vir:80037 Length: 199 67.3 0.081 5.1E-05 26.6 4.8 85 1-90 114-199 (199) 88 protein:vir:1838 Length: 149 # 67.2 0.26 0.00016 23.8 9.4 132 1-153 1-149 (149) 89 protein:vir:107757 Length: 189 65.3 0.13 8E-05 25.5 5.5 90 1-92 68-189 (189) 90 protein:vir:106728 Length: 155 65.0 0.13 8.4E-05 25.4 5.6 78 1-89 77-155 (155) 91 protein:vir:77650 Length: 155 62.7 0.18 0.00011 24.7 5.8 79 1-89 77-155 (155) 92 protein:vir:96105 Length: 193 62.2 0.17 0.00011 24.7 5.6 82 1-88 110-193 (193) 93 protein:vir:101563 Length: 155 61.5 0.19 0.00012 24.5 5.7 79 1-89 77-155 (155) 94 protein:vir:4833 Length: 140 # 61.2 0.31 0.00019 23.3 6.8 129 1-158 1-140 (140) 95 protein:vir:94069 Length: 168 59.4 0.24 0.00015 24.0 5.8 82 52-160 1-101 (168) 96 protein:vir:78077 Length: 141 55.2 0.1 6.3E-05 26.0 3.1 114 21-160 1-141 (141) 97 protein:vir:97327 Length: 116 54.0 0.14 9E-05 25.2 3.7 87 26-148 1-116 (116) 98 protein:vir:1243 Length: 116 # 54.0 0.14 9E-05 25.2 3.7 87 26-148 1-116 (116) 99 protein:vir:2026 Length: 150 # 51.8 0.57 0.00035 21.9 9.8 131 6-160 1-149 (150) 100 protein:vir:106728 Length: 155 50.8 0.26 0.00016 23.8 4.5 79 71-160 1-98 (155) 101 protein:vir:78607 Length: 155 50.2 0.25 0.00016 23.8 4.4 79 71-160 1-98 (155) 102 protein:vir:95062 Length: 116 42.2 0.4 0.00025 22.8 4.2 87 26-148 1-116 (116) 103 protein:vir:81067 Length: 119 40.2 0.86 0.00053 20.9 5.7 88 64-159 1-119 (119) 104 protein:vir:102963 Length: 163 36.9 1.1 0.00071 20.3 7.5 150 1-160 1-163 (163) 105 protein:vir:94069 Length: 168 36.1 0.94 0.00058 20.7 5.2 89 1-114 80-168 (168) 106 protein:vir:1164 Length: 156 # 35.6 1.2 0.00076 20.1 7.8 140 1-160 1-155 (156) 107 protein:vir:10367 Length: 119 35.5 1.2 0.00075 20.1 5.7 88 64-159 1-119 (119) 108 protein:vir:79179 Length: 155 35.3 1.2 0.00077 20.1 8.0 134 1-153 1-155 (155) 109 protein:vir:100312 Length: 152 32.0 1.5 0.0009 19.7 8.2 131 1-158 1-152 (152) 110 protein:vir:79115 Length: 148 25.4 2.1 0.0013 18.9 8.4 128 2-157 1-148 (148) No 1 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=97.89 E-value=7.1e-08 Score=59.78 Aligned_cols=133 Identities=17% Similarity=0.126 Sum_probs=81.8 Q ss_pred CCcccccCHHHHHHHHhhhccCCcc--hHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPE--YDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHK 78 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepe--yddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthk 78 (160) |.|=-.-.+|++.+-|.+|..+... -..+++..++.+.+.++... |. .|.. T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~a-------P~--------------------~tG~ 53 (140) T protein:vir:80 1 MSSIQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRA-------PK--------------------KTGK 53 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CC--------------------Ccch Confidence 7764445567888888877544321 13455555555555555543 31 2345 Q ss_pred hhhhhhhhhcccCCCeEEEEEeccCCcccc--cccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 79 FVDSIKLVYEEDRGDGIMVFIGVDGGTTDT--GLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 79 fvdsiklvyeedrgdgimvfigvdggttdt--glsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiie 156 (160) .-+||+..+...+.-+.++-+|++.|+.-. +-+-..-+-|+||||++|||+-=|-..|.-.+-++++.+.+.+..-|. T Consensus 54 l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~ 133 (140) T protein:vir:80 54 LRRNIVSAALRQKDAPGLATAGVRVRTKGKADSPSNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAID 133 (140) T ss_pred hhhceeeeccccccccceeeeeeecccccccCCCCCcceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHH Confidence 566666555555554555566665443211 112234678999999999999444778888888888887777777666 Q ss_pred cccC Q lcl|NC_019515. 157 GDLK 160 (160) Q Consensus 157 gdlk 160 (160) --|. T Consensus 134 k~~~ 137 (140) T protein:vir:80 134 QALG 137 (140) T ss_pred HHhh Confidence 6555 No 2 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=97.88 E-value=1.5e-07 Score=57.97 Aligned_cols=132 Identities=14% Similarity=0.121 Sum_probs=78.2 Q ss_pred CCcccccCHHHHHHHHhhhccCCcc--hHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPE--YDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHK 78 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepe--yddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthk 78 (160) |.+--.-.+|++.+-|.+|.++.-. -+.+++..+..+++.++... | .+|.. T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~a-------p--------------------~~tG~ 53 (140) T protein:vir:10 1 MSSVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARA-------P--------------------KKTGK 53 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------C--------------------CChhh Confidence 7765566778888888888754421 24455555555555555543 3 23667 Q ss_pred hhhhhhhhhcccCCCeEEEEEeccCC--cccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHH----HHHHHHHH Q lcl|NC_019515. 79 FVDSIKLVYEEDRGDGIMVFIGVDGG--TTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIM----EEVSQRLL 152 (160) Q Consensus 79 fvdsiklvyeedrgdgimvfigvdgg--ttdtglsmqeladfiefgtskqparmpfhkswammeheim----eevsqrll 152 (160) +.+||+..+...+...-.+.+|+..+ .+-.+-+-.--+-|+||||++|||+-=+-..|.-.+-+++ +++.++|- T Consensus 54 l~~sI~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~ 133 (140) T protein:vir:10 54 LKRNIVTAALKQKDSPGIATAGVRVRTKGKADSPNNAFYWRFVELGTQFMKAEPFMRPAFDASIAQAEGAIRTEIARAID 133 (140) T ss_pred HHHhceecccccccccceeEEeeccccccccCCCCcccccceeccCcCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHH Confidence 78888877655433222233444322 1111223344578999999999998655666655554444 45555555 Q ss_pred HHhhccc Q lcl|NC_019515. 153 AIIEGDL 159 (160) Q Consensus 153 aiiegdl 159 (160) ..+.|-| T Consensus 134 k~~~~~~ 140 (140) T protein:vir:10 134 QVVGGGL 140 (140) T ss_pred HHhhcCC Confidence 6666666 No 3 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=97.88 E-value=1e-07 Score=58.97 Aligned_cols=133 Identities=17% Similarity=0.116 Sum_probs=81.6 Q ss_pred CCcccccCHHHHHHHHhhhccCCc--chHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEP--EYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHK 78 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnep--eyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthk 78 (160) |.+=-.-.+|++.+-|.+|.++.. .-..+++..++.+++.++... |. .|.. T Consensus 1 Ma~~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~a-------P~--------------------~tG~ 53 (140) T protein:vir:10 1 MSSIQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRA-------PK--------------------KTGK 53 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CC--------------------Chhh Confidence 776555567888888888865432 123344444444444444432 31 3567 Q ss_pred hhhhhhhhhcccCCCeEEEEEeccCCccc--ccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 79 FVDSIKLVYEEDRGDGIMVFIGVDGGTTD--TGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 79 fvdsiklvyeedrgdgimvfigvdggttd--tglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiie 156 (160) ..|||+..+...+..+..+.+|+..++-- .+-+...-+-|+|||||+|||+-=|-..|.-.+-++++.+.+.+..-|. T Consensus 54 l~~sI~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~ 133 (140) T protein:vir:10 54 LRRNIVSAALRQKDAPGLATAGVRVRTKGKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAID 133 (140) T ss_pred HHHhccccccccccccceEEeeeeeccccccCCCCccceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHH Confidence 78888888877777677777776533211 1123345678999999999999434667777776666665555544444 Q ss_pred cccC Q lcl|NC_019515. 157 GDLK 160 (160) Q Consensus 157 gdlk 160 (160) --+. T Consensus 134 k~~~ 137 (140) T protein:vir:10 134 RVLG 137 (140) T ss_pred HHhh Confidence 3333 No 4 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=97.78 E-value=2.3e-07 Score=57.03 Aligned_cols=133 Identities=17% Similarity=0.113 Sum_probs=81.4 Q ss_pred CCcccccCHHHHHHHHhhhccCCc-ch-HHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEP-EY-DDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHK 78 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnep-ey-ddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthk 78 (160) |.|--.-.+|++.+-|.+|.++.. +- ..+++..++.+.+.++... |. .|.. T Consensus 1 M~~~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~a-------P~--------------------~tG~ 53 (140) T protein:vir:14 1 MSSIQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRA-------PK--------------------KTGK 53 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CC--------------------Chhh Confidence 766444456777777777754432 22 4455666666666665543 31 2445 Q ss_pred hhhhhhhhhcccCCCeEEEEEeccCCccc--ccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 79 FVDSIKLVYEEDRGDGIMVFIGVDGGTTD--TGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 79 fvdsiklvyeedrgdgimvfigvdggttd--tglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiie 156 (160) .-+||+...........++.+|+.+|.-- .+-+-..-+-|+|||||+|||+-=|-..|...+-++++.+.+.+..-|. T Consensus 54 l~~sI~~~~~~~~~~~~~~~vg~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~pFl~pa~~~~~~~~~~~~~~~~~~~l~ 133 (140) T protein:vir:14 54 LRRNIVSAALRQKDAPGLATAGVRVRTKGKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAID 133 (140) T ss_pred HHhhcccccccccccceeEEeeeeeccccccCCCCccceeeeeccccCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 66677655444444455666666443211 1123445678999999999999555788888888887777777666665 Q ss_pred cccC Q lcl|NC_019515. 157 GDLK 160 (160) Q Consensus 157 gdlk 160 (160) --+. T Consensus 134 k~~~ 137 (140) T protein:vir:14 134 RVLG 137 (140) T ss_pred HHhh Confidence 5555 No 5 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=97.67 E-value=8.7e-07 Score=53.80 Aligned_cols=126 Identities=20% Similarity=0.207 Sum_probs=74.3 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |++--.-.++.+.+-|.+|..+ -+.+.+.+-.+-|+.+++.+.- ..|.- ++ +|-... T Consensus 1 M~~~~i~Gl~el~~~l~~l~~~---~~~~~~~al~~~a~~v~~~~k~---~ap~~-----------~~------~tg~l~ 57 (127) T protein:vir:12 1 MADMSFDGIDDLTQYFEKIGGD---IEKVEPVALKAGGEIIAERQRS---HVNRS-----------DK------KQPHMQ 57 (127) T ss_pred CeeeeehhHHHHHHHHHHhhHH---HHHHHHHHHHHHHHHHHHHHHH---hCCCC-----------CC------ChhHHH Confidence 7775555678888888777643 2334444444444444443321 12321 11 244567 Q ss_pred hhhhhhh-cccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 81 DSIKLVY-EEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 81 dsiklvy-eedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiie 156 (160) |+|+.-= ..+++.+..|-||.+.++ --.+-|+|||||++|++-=+...|.-..-++++.+.+.+-.-|+ T Consensus 58 ~~I~~~~~k~~~~g~~~v~Vg~~~~~-------~~y~~f~E~GT~~~~a~Pf~~pa~~~~~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 58 DNITVSNVRESKDGVRFVAVGPNKKV-------AYRGRFLEWGTSKMPPQPFIEKGGKEGEGPAVELMERILTAPIK 127 (127) T ss_pred HhhhccccccccCceeEEEEeeCCCC-------cceeeeeccCccCCCCCccchHhHHHHHHHHHHHHHHHHHHhcC Confidence 7775421 234455667778876543 23577999999999999445677777776666666655555554 No 6 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=97.48 E-value=7.9e-07 Score=54.03 Aligned_cols=133 Identities=22% Similarity=0.293 Sum_probs=68.8 Q ss_pred CCccc-ccCHHHHHHHHhhhccCCcc--hHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhh Q lcl|NC_019515. 1 MTSEL-YGDWDKFAQILHNLKDNEPE--YDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTH 77 (160) Q Consensus 1 mtsel-ygdwdkfaqilhnlkdnepe--yddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirth 77 (160) |..++ .-.+|++.+-|..|.++..+ -...++..++.|++.++... |. + |. T Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a-------P~-~-------------------~g 54 (149) T protein:vir:19 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRA-------PV-R-------------------TG 54 (149) T ss_pred cceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC-------CC-C-------------------ch Confidence 33222 12467777777777654332 23444445555555555432 32 2 22 Q ss_pred hhhhhhhhhhcccC-CCeEEEEEeccCCcccccccHH----------HHHHHHhhcCCCCccccchhhhHHHHHHHHHHH Q lcl|NC_019515. 78 KFVDSIKLVYEEDR-GDGIMVFIGVDGGTTDTGLSMQ----------ELADFIEFGTSKQPARMPFHKSWAMMEHEIMEE 146 (160) Q Consensus 78 kfvdsiklvyeedr-gdgimvfigvdggttdtglsmq----------eladfiefgtskqparmpfhkswammeheimee 146 (160) +.-+||+......+ +-++...+++.++..+.+.+-. --+-|+||||++|||+-=|-..|.-.+-++.+. T Consensus 55 ~l~~si~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PF~~pA~~~~k~~~~~~ 134 (149) T protein:vir:19 55 KLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASV 134 (149) T ss_pred hhhhhccccccccccccceeecccccccccccccccceeecCCCCccceeeeeccCCCCCCCCcchhHHHHHHHHHHHHH Confidence 33344433333222 2223333444333333222111 125699999999999944466777777777776 Q ss_pred HHHHHHHHhhcccC Q lcl|NC_019515. 147 VSQRLLAIIEGDLK 160 (160) Q Consensus 147 vsqrllaiiegdlk 160 (160) +...|-..|.--++ T Consensus 135 ~~~~l~~~l~k~~~ 148 (149) T protein:vir:19 135 AIARMNQAIDEVLS 148 (149) T ss_pred HHHHHHHHHHHHhc Confidence 66666666655555 No 7 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=97.35 E-value=1.4e-06 Score=52.67 Aligned_cols=134 Identities=18% Similarity=0.254 Sum_probs=71.8 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |+-++= .+|++.+-|..|... .-+.+.++.-.+-|+.+++.+... -|. + |.... T Consensus 4 ~~~~i~-Gldel~~~l~~L~~~--~~~~~~~~Al~~~a~~v~~~ak~~---aP~-~-------------------~g~l~ 57 (148) T protein:vir:93 4 TLLDFS-GLEDISRDLQLLSGA--ENNRVLREATRAGANVLKEEVVSR---APV-R-------------------RGKLR 57 (148) T ss_pred eeeeeh-hHHHHHHHHHHhHHH--HHHHHHHHHHHHHHHHHHHHHHhh---CCC-C-------------------cchhh Confidence 333332 378888888888432 122344555444455555444332 132 1 23333 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCccccccc----------HHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHH Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLS----------MQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQR 150 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtgls----------mqeladfiefgtskqparmpfhkswammeheimeevsqr 150 (160) +||+.-....+.-+....|++-++..+++-+ -.--+-|+|||||+|||+-=+-..|.-.+-++.+.+.++ T Consensus 58 ~~i~~~~~~~~~g~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~pa~PFl~pA~~~~k~~~~~~~~~~ 137 (148) T protein:vir:93 58 RNVVVLSRRSRDGGMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIAR 137 (148) T ss_pred hhceeccccccCCceeeeeeecccccccccccceeecCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHH Confidence 3333332222222333333333322222111 112356999999999999555678888877777777777 Q ss_pred HHHHhhcccC Q lcl|NC_019515. 151 LLAIIEGDLK 160 (160) Q Consensus 151 llaiiegdlk 160 (160) +-.-|.--|+ T Consensus 138 ~~~~i~k~~~ 147 (148) T protein:vir:93 138 MNRAIDEVLR 147 (148) T ss_pred HHHHHHHHhc Confidence 7777776666 No 8 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=97.33 E-value=1e-06 Score=53.37 Aligned_cols=139 Identities=20% Similarity=0.169 Sum_probs=76.6 Q ss_pred CC----cccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhh Q lcl|NC_019515. 1 MT----SELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRT 76 (160) Q Consensus 1 mt----selygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirt 76 (160) |. -++=| +|.+.+-|..|.+.- -+.++|..-.+-|+-|++.+.. ..|.+|+.+.- + T Consensus 1 Ma~~~~~~i~G-l~eL~~~l~~L~~~~--~~k~~r~Al~~aa~~v~~~ak~---~ap~~~~~~~~---------~----- 60 (164) T protein:vir:43 1 MADTVEFSITG-LDSLLGKLDSVTDDV--KRRGGRAALRKAAMIVVQAAKQ---GAEKVDDPGTG---------R----- 60 (164) T ss_pred CCcceEEeeec-HHHHHHHHHHhHHHH--HHHHHHHHHHHHHHHHHHHHHH---hCCcccCCCcc---------c----- Confidence 43 34433 778888787775421 1234555555555555554432 23665554321 1 Q ss_pred hhhhhhhhhhhc---ccCCCeEEEEEeccCCcccccccHH---------HHHHHHhhcCCCCccccchhhhHHHHHHHHH Q lcl|NC_019515. 77 HKFVDSIKLVYE---EDRGDGIMVFIGVDGGTTDTGLSMQ---------ELADFIEFGTSKQPARMPFHKSWAMMEHEIM 144 (160) Q Consensus 77 hkfvdsiklvye---edrgdgimvfigvdggttdtglsmq---------eladfiefgtskqparmpfhkswammeheim 144 (160) +.-+||..... .-+..++...+|+.+|+.....+-. =-+-|+|||||++|++-=+-..|.--+-++. T Consensus 61 -~l~~~i~~~~~~~~~~~~~~~~~~vg~~~~~~~~~~~~~~~~~~~~~~~y~~f~EfGT~km~a~PFlrPA~~~~k~~~~ 139 (164) T protein:vir:43 61 -SISDNIALRWNGRLFKRTGDLGFRIGVLHGAVLPKKGERSDKTANAPTPHWRLLEFGTEDMRAQPFMRSALADNIAEVT 139 (164) T ss_pred -hhhhhhhhhcccCccccccceeEEecccccccccccccccccCCCCCcceEEEeecCCCCCCCCcchhhhHHHhHHHHH Confidence 12223333221 1223345556666666543322211 1356999999999999767788887777777 Q ss_pred HHHHHHHHHHhhcccC Q lcl|NC_019515. 145 EEVSQRLLAIIEGDLK 160 (160) Q Consensus 145 eevsqrllaiiegdlk 160 (160) +.+.+.|-.-|.--|+ T Consensus 140 ~~~~~~l~~~i~ka~~ 155 (164) T protein:vir:43 140 STFVSEYEKGIDRAIK 155 (164) T ss_pred HHHHHHHHHHHHHHHH Confidence 7666666665554444 No 9 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=97.24 E-value=2.4e-06 Score=51.41 Aligned_cols=123 Identities=13% Similarity=0.099 Sum_probs=62.0 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |.+..==+|+-+.++..+|+.-...-.+.++..-.+.++.|.+-... ..| .+|-..- T Consensus 1 Ma~~~~i~~~Gld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak~---~ap--------------------~~tG~L~ 57 (125) T protein:vir:94 1 MANDFNIKFKGVDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSKG---LAR--------------------VDTGYMR 57 (125) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHh---hCC--------------------CCChhhh Confidence 55543334444444444443321111111111222223333332211 112 1344566 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIEGDLK 160 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiiegdlk 160 (160) +||+.--....+.|+-+-||.. + +-|-|+||||+++|++-=|..+|....-++.+.+.+ .+..-+| T Consensus 58 ~sI~~~~~~~~~~~~~~~v~~~---~-------~Ya~~vEfGT~~~~a~Pfl~pa~~~~~~~~~~~l~~----~l~~a~k 123 (125) T protein:vir:94 58 NNIQQDEVKEEHGVVTGRYVAR---A-------DYSSYNEYGTYRMSAQPFMAPSVAAMTPFFYKAVRD----ALNKAAK 123 (125) T ss_pred hhceecceeccCCcEEEEeeCC---C-------CccceeecccccCCCCcccchhHHHHHHHHHHHHHH----HHHHHhc Confidence 7776543344556666666542 2 357899999999999965677787776665555444 4444555 No 10 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=97.18 E-value=4.4e-06 Score=49.95 Aligned_cols=139 Identities=15% Similarity=0.268 Sum_probs=73.5 Q ss_pred CCcc---cccCHHHHHHHHhhhcc---CCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeeh Q lcl|NC_019515. 1 MTSE---LYGDWDKFAQILHNLKD---NEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILI 74 (160) Q Consensus 1 mtse---lygdwdkfaqilhnlkd---nepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrili 74 (160) |.+. -.-.++.+.+-|.+|.. .+---...++..++.+++.++.. .|..++.. ..++.. . T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~-------aP~~~~~~-----~~~~~~---~ 65 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPL-------IHISDDNS-----KSGRKG---S 65 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHh-------CCccCCcc-----cccccc---c Confidence 6652 23456778777777732 22222345555555555555544 35554421 111111 1 Q ss_pred hh-hhhhhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019515. 75 RT-HKFVDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLA 153 (160) Q Consensus 75 rt-hkfvdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrlla 153 (160) |+ -..-|+|+.-=-..++...+|-||.+.+.. |-.-.+-|+|||||+||++-=|-..|..-+-++.+-+.+.|.. T Consensus 66 ~~~~~~~d~i~~~~~~~~~g~~~~~VG~~~~~~----~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k 141 (149) T protein:vir:13 66 RPPGHAANNIPEPKIRKKKGNLQCVVGWEKSDN----TPFYYMKMEEWGTSERPPHHAFGKTNKILKRVYDNIAQKKYDN 141 (149) T ss_pred cccchhhhcceecccccccceeEEEeeccCCCC----CccceeeeeccCccCCCCCccchHHHHHHHHHHHHHHHHHHHH Confidence 11 123455544222234445567788765432 1234788999999999999434566766666555544444444 Q ss_pred Hhh---cc Q lcl|NC_019515. 154 IIE---GD 158 (160) Q Consensus 154 iie---gd 158 (160) .|. || T Consensus 142 ~i~~~lG~ 149 (149) T protein:vir:13 142 FVKEKLGD 149 (149) T ss_pred HHHHHhcC Confidence 443 56 No 11 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=97.11 E-value=4.1e-06 Score=50.15 Aligned_cols=139 Identities=17% Similarity=0.270 Sum_probs=80.4 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |+-++=| .|++.+-|..|.+. .-+.++|..-.+-|+-|++.+.- .-|.+|++.. +-+.. T Consensus 5 ~~~~i~G-l~eL~~~l~~L~~~--~~~k~~r~Al~~aa~~v~~~ak~---~ap~~~~~~~---------------~~~l~ 63 (179) T protein:vir:18 5 VEVSLTG-LESLLGKMEAVSEV--TRNKAGRFALRKAANIIRDRARS---NASRVDDPLT---------------KEAIH 63 (179) T ss_pred EEEEeec-HHHHHHHHHHhHHH--HHHHHHHHHHHHHHHHHHHHHHH---hCCccccccc---------------hhhhh Confidence 3334444 67777777777432 11345566666666666655543 3455544321 12233 Q ss_pred hhhhhhhcc---cCCCeEEEEEeccCCcccccc------------cHH------------HHHHHHhhcCCCCccccchh Q lcl|NC_019515. 81 DSIKLVYEE---DRGDGIMVFIGVDGGTTDTGL------------SMQ------------ELADFIEFGTSKQPARMPFH 133 (160) Q Consensus 81 dsiklvyee---drgdgimvfigvdggttdtgl------------smq------------eladfiefgtskqparmpfh 133 (160) ++|...... .+...+.+-+||-+|+..... .+. =-+-|+|||||++||+-=|- T Consensus 64 ~~i~~~~~~~~~~~~g~~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa~PFlr 143 (179) T protein:vir:18 64 KNIVASFSSKQFRRTGDLAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSARPILR 143 (179) T ss_pred hheeecccccccccccceeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCCCccch Confidence 444443332 333445667777777653221 110 01359999999999997777 Q ss_pred hhHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_019515. 134 KSWAMMEHEIMEEVSQRLLAIIEGDLK 160 (160) Q Consensus 134 kswammeheimeevsqrllaiiegdlk 160 (160) ..|.--.-++.+.+.++|..-|.--|| T Consensus 144 PA~~~~~~~a~~~i~~~l~~~i~k~lk 170 (179) T protein:vir:18 144 PAMNGVDNDVINVFSTEMGKAIDRAIR 170 (179) T ss_pred hhHHhhHHHHHHHHHHHHHHHHHHHHH Confidence 888877777777777777776666666 No 12 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=97.07 E-value=5.8e-06 Score=49.29 Aligned_cols=141 Identities=21% Similarity=0.214 Sum_probs=79.4 Q ss_pred CCc----ccccCHHHHHHHHhhhccCCc-chHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehh Q lcl|NC_019515. 1 MTS----ELYGDWDKFAQILHNLKDNEP-EYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIR 75 (160) Q Consensus 1 mts----elygdwdkfaqilhnlkdnep-eyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilir 75 (160) |.+ ++= .+|.|.+-|..|.++.. .-..+++.-++.|++.++..+ |..+..+-. +...+.- . T Consensus 1 Ma~~~~~~i~-Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~a-------p~~~~~~~~-----~~~~~~~-~ 66 (146) T protein:vir:10 1 MADGIDLDLL-GFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERA-------PRSPSPKKR-----SKSEPWR-T 66 (146) T ss_pred CCCceeeeeh-hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCccccccc-----ccccccc-c Confidence 443 343 46888888888865321 123344444445555554443 443332211 0000000 0 Q ss_pred hhhhhhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019515. 76 THKFVDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAII 155 (160) Q Consensus 76 thkfvdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaii 155 (160) +-...|+|+..--..++.+.++-+|++.+..+.. --+-|+||||++||++-=+...|.-.+-++.+.+.+.|-.-| T Consensus 67 ~~~~~~~i~~~~~~~~~g~~~~~vg~~~~~~~~~----~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l 142 (146) T protein:vir:10 67 GQHGADQIKVTKAKLEGGIKTVKIGLNKADRSPW----FYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEM 142 (146) T ss_pred cccccccceeccccccccceeEEeeeccCCCCCc----ceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH Confidence 0112334443333345667788888866533322 246799999999999965678888888888877777777666 Q ss_pred hccc Q lcl|NC_019515. 156 EGDL 159 (160) Q Consensus 156 egdl 159 (160) .-.| T Consensus 143 ~ka~ 146 (146) T protein:vir:10 143 RLDL 146 (146) T ss_pred hhcC Confidence 6666 No 13 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=97.07 E-value=5.8e-06 Score=49.29 Aligned_cols=141 Identities=21% Similarity=0.214 Sum_probs=79.4 Q ss_pred CCc----ccccCHHHHHHHHhhhccCCc-chHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehh Q lcl|NC_019515. 1 MTS----ELYGDWDKFAQILHNLKDNEP-EYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIR 75 (160) Q Consensus 1 mts----elygdwdkfaqilhnlkdnep-eyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilir 75 (160) |.+ ++= .+|.|.+-|..|.++.. .-..+++.-++.|++.++..+ |..+..+-. +...+.- . T Consensus 1 Ma~~~~~~i~-Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~a-------p~~~~~~~~-----~~~~~~~-~ 66 (146) T protein:vir:10 1 MADGIDLDLL-GFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERA-------PRSPSPKKR-----SKSEPWR-T 66 (146) T ss_pred CCCceeeeeh-hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCccccccc-----ccccccc-c Confidence 443 343 46888888888865321 123344444445555554443 443332211 0000000 0 Q ss_pred hhhhhhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019515. 76 THKFVDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAII 155 (160) Q Consensus 76 thkfvdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaii 155 (160) +-...|+|+..--..++.+.++-+|++.+..+.. --+-|+||||++||++-=+...|.-.+-++.+.+.+.|-.-| T Consensus 67 ~~~~~~~i~~~~~~~~~g~~~~~vg~~~~~~~~~----~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l 142 (146) T protein:vir:10 67 GQHGADQIKVTKAKLEGGIKTVKIGLNKADRSPW----FYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEM 142 (146) T ss_pred cccccccceeccccccccceeEEeeeccCCCCCc----ceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH Confidence 0112334443333345667788888866533322 246799999999999965678888888888877777777666 Q ss_pred hccc Q lcl|NC_019515. 156 EGDL 159 (160) Q Consensus 156 egdl 159 (160) .-.| T Consensus 143 ~ka~ 146 (146) T protein:vir:10 143 RLDL 146 (146) T ss_pred hhcC Confidence 6666 No 14 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=97.07 E-value=5.8e-06 Score=49.29 Aligned_cols=141 Identities=21% Similarity=0.214 Sum_probs=79.4 Q ss_pred CCc----ccccCHHHHHHHHhhhccCCc-chHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehh Q lcl|NC_019515. 1 MTS----ELYGDWDKFAQILHNLKDNEP-EYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIR 75 (160) Q Consensus 1 mts----elygdwdkfaqilhnlkdnep-eyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilir 75 (160) |.+ ++= .+|.|.+-|..|.++.. .-..+++.-++.|++.++..+ |..+..+-. +...+.- . T Consensus 1 Ma~~~~~~i~-Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~a-------p~~~~~~~~-----~~~~~~~-~ 66 (146) T protein:vir:10 1 MADGIDLDLL-GFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERA-------PRSPSPKKR-----SKSEPWR-T 66 (146) T ss_pred CCCceeeeeh-hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCccccccc-----ccccccc-c Confidence 443 343 46888888888865321 123344444445555554443 443332211 0000000 0 Q ss_pred hhhhhhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019515. 76 THKFVDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAII 155 (160) Q Consensus 76 thkfvdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaii 155 (160) +-...|+|+..--..++.+.++-+|++.+..+.. --+-|+||||++||++-=+...|.-.+-++.+.+.+.|-.-| T Consensus 67 ~~~~~~~i~~~~~~~~~g~~~~~vg~~~~~~~~~----~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l 142 (146) T protein:vir:10 67 GQHGADQIKVTKAKLEGGIKTVKIGLNKADRSPW----FYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEM 142 (146) T ss_pred cccccccceeccccccccceeEEeeeccCCCCCc----ceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH Confidence 0112334443333345667788888866533322 246799999999999965678888888888877777777666 Q ss_pred hccc Q lcl|NC_019515. 156 EGDL 159 (160) Q Consensus 156 egdl 159 (160) .-.| T Consensus 143 ~ka~ 146 (146) T protein:vir:10 143 RLDL 146 (146) T ss_pred hhcC Confidence 6666 No 15 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=97.07 E-value=5.8e-06 Score=49.29 Aligned_cols=141 Identities=21% Similarity=0.214 Sum_probs=79.4 Q ss_pred CCc----ccccCHHHHHHHHhhhccCCc-chHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehh Q lcl|NC_019515. 1 MTS----ELYGDWDKFAQILHNLKDNEP-EYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIR 75 (160) Q Consensus 1 mts----elygdwdkfaqilhnlkdnep-eyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilir 75 (160) |.+ ++= .+|.|.+-|..|.++.. .-..+++.-++.|++.++..+ |..+..+-. +...+.- . T Consensus 1 Ma~~~~~~i~-Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~a-------p~~~~~~~~-----~~~~~~~-~ 66 (146) T protein:vir:10 1 MADGIDLDLL-GFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERA-------PRSPSPKKR-----SKSEPWR-T 66 (146) T ss_pred CCCceeeeeh-hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCccccccc-----ccccccc-c Confidence 443 343 46888888888865321 123344444445555554443 443332211 0000000 0 Q ss_pred hhhhhhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019515. 76 THKFVDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAII 155 (160) Q Consensus 76 thkfvdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaii 155 (160) +-...|+|+..--..++.+.++-+|++.+..+.. --+-|+||||++||++-=+...|.-.+-++.+.+.+.|-.-| T Consensus 67 ~~~~~~~i~~~~~~~~~g~~~~~vg~~~~~~~~~----~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l 142 (146) T protein:vir:10 67 GQHGADQIKVTKAKLEGGIKTVKIGLNKADRSPW----FYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEM 142 (146) T ss_pred cccccccceeccccccccceeEEeeeccCCCCCc----ceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH Confidence 0112334443333345667788888866533322 246799999999999965678888888888877777777666 Q ss_pred hccc Q lcl|NC_019515. 156 EGDL 159 (160) Q Consensus 156 egdl 159 (160) .-.| T Consensus 143 ~ka~ 146 (146) T protein:vir:10 143 RLDL 146 (146) T ss_pred hhcC Confidence 6666 No 16 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=97.02 E-value=8.7e-06 Score=48.33 Aligned_cols=127 Identities=21% Similarity=0.173 Sum_probs=67.9 Q ss_pred CCcccccCHHHHHHHHhhhccC-CcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDN-EPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdn-epeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkf 79 (160) |..++-| .+.+.+-|.+|... +-.-.+.++..++.+++.+++-. |. +.|.. ..+-.. T Consensus 1 m~v~i~G-l~el~~~l~~l~~~~~k~~~~al~~ga~~~~~~~k~~a-------p~----------~~~~~----~~~~h~ 58 (128) T protein:vir:38 1 MGVKVTG-DAELLANLNKLQFGVAKEARAAVRDGAQKFADKLKSNT-------PE----------WDGET----DMSGHL 58 (128) T ss_pred Cccchhh-HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CC----------cCCCC----cccchh Confidence 8888665 45566666666432 22344555555666666665432 32 11210 111224 Q ss_pred hhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 80 VDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 80 vdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiie 156 (160) -|+|+.---.+.+....+.||.+.++. =.+-|+|||||++|++-=+-+.|.-.+-++.+.+.+.|-.-|- T Consensus 59 ~d~I~~~~~k~~~g~~~~~VG~~k~~~-------~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 59 RDDIKLSSVRETSGLTEVDVGYGKDTG-------WRAHFPNSGTSMQDPQHFIEETQEIMRPVVIAAFLSHLKEGGM 128 (128) T ss_pred hhhhccccccccCceeEEEeeecCCCc-------eEEeeeccCccCCCCCcchhHHHHHhHHHHHHHHHHHHHhhcC Confidence 456654222223333557788765432 2478999999999999444566666665555544444432111 No 17 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=96.85 E-value=6.4e-05 Score=43.59 Aligned_cols=140 Identities=22% Similarity=0.288 Sum_probs=92.5 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccc-cC----Ccccchhhhhhhhhcccc-ceeeh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQE-ID----MPALDDEYLADKVSEGYD-SRILI 74 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqe-id----mpalddeyladkvsegyd-srili 74 (160) |.-++=.|++.+.+.|+.|.... +-..+.+.+|..+.+.+++-++-|. -| .++|...|++.|-..|+. .++|+ T Consensus 3 ~~i~~~~d~~~l~~~L~~l~~~~-~~~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~~~~~L~ 81 (156) T protein:vir:19 3 LDMNVAVDVRRIQLALDELGTVT-RDRAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFVPGSILT 81 (156) T ss_pred EEEEEeecHHHHHHHHHHHHhhh-ccHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCCCCcchh Confidence 55566789999999999985432 2236889999999998888886553 33 678999999999888876 46999 Q ss_pred hhhhhhhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCC--------CccccchhhhHHHHHHHHHHH Q lcl|NC_019515. 75 RTHKFVDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSK--------QPARMPFHKSWAMMEHEIMEE 146 (160) Q Consensus 75 rthkfvdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtsk--------qparmpfhkswammeheimee 146 (160) +|....+||+-.+. +++. .||.+ ...|-.-+||..+ -||| ||---=.-.+.+|.+. T Consensus 82 ~tg~L~~Si~~~~~---~~~v--~vGt~----------~~yA~vHqfG~~~~~~~~~~~iPaR-pfLG~s~~d~~~I~~~ 145 (156) T protein:vir:19 82 LHGDLARSITTDYG---QDYA--LIGSP----------KIYAAIHQWGGTPDMAPRPAGVPAR-PYMGLDKTGEQEIFDA 145 (156) T ss_pred hhHHHHHHhhheec---CCEE--EEecc----------hhhhHHhhcCcccccCCCccccCCc-cccCCCHHHHHHHHHH Confidence 99999999986553 4444 34432 3679999999875 4555 2211111223334333 Q ss_pred HHHHHHHHhhc Q lcl|NC_019515. 147 VSQRLLAIIEG 157 (160) Q Consensus 147 vsqrllaiieg 157 (160) |..-|-.++.- T Consensus 146 i~~~l~~~~~~ 156 (156) T protein:vir:19 146 IRKRVSAALRQ 156 (156) T ss_pred HHHHHHHHhhC Confidence 33333333322 No 18 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=96.71 E-value=6e-05 Score=43.74 Aligned_cols=139 Identities=19% Similarity=0.272 Sum_probs=88.6 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccc-cC---Ccccchhhhhhhhhccccceeehhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQE-ID---MPALDDEYLADKVSEGYDSRILIRT 76 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqe-id---mpalddeyladkvsegydsrilirt 76 (160) |.-.+=.|++.+.+.|+.|.++-..-.++.+.+|+.+....++-|+-|. -| .+.+...|++.|-..| ..+|.+| T Consensus 2 ~~i~i~~d~~~~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~--~~~L~~t 79 (190) T protein:vir:99 2 AGITLEWDGRRALDVLNAGSAALGDPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNR--DKILTLD 79 (190) T ss_pred ceeEEEecHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCC--Cccceec Confidence 4444666999999999988777555568999999999999999887653 33 5678888888775554 6899999 Q ss_pred hhhhhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCC------------------------------ Q lcl|NC_019515. 77 HKFVDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQ------------------------------ 126 (160) Q Consensus 77 hkfvdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskq------------------------------ 126 (160) -...+||+-.+. +|+++| |.+ ..-|..-+||...+ T Consensus 80 g~L~~Si~~~~~---~~~v~v--Gtn----------~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 144 (190) T protein:vir:99 80 GHLRNLLRYQLD---GSELLF--GSD----------RPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFVPRRRSN 144 (190) T ss_pred HHHHHHHhheec---CcEEEE--ecC----------cchhhhhhcCCcccccccchhhhhhhhhhhhhhhcccccccccc Confidence 999999996554 345444 432 24466667775433 Q ss_pred --------------ccc--cchhhhHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_019515. 127 --------------PAR--MPFHKSWAMMEHEIMEEVSQRLLAIIEGDL 159 (160) Q Consensus 127 --------------par--mpfhkswammeheimeevsqrllaiiegdl 159 (160) ||| |+|. .-.+.+|.+-+..-|-.+++.-- T Consensus 145 ~~~~~~~~~~~v~IPaRpfLG~s---~~d~~~I~~~i~~~l~~~~~~~~ 190 (190) T protein:vir:99 145 FAQDVQIGPYTIQMPARPWLGTS---SQDDDTILQRVERYLQRALRERA 190 (190) T ss_pred cchhcccccceeeecCcccCCCC---HHHHHHHHHHHHHHHHHHHhhcC Confidence 444 2222 12233333333333333333322 No 19 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=96.59 E-value=4.4e-05 Score=44.45 Aligned_cols=130 Identities=17% Similarity=0.196 Sum_probs=68.8 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |.+--.-.++.+.+-|..|....- ..+.++.-.+-|+.|++.+.-. .|.-+... + -..- T Consensus 1 M~~~~i~Gl~el~~~l~~L~~~~~--~k~~~~Al~~~a~~i~~~ak~~---ap~~~~~~----------~------~~~~ 59 (133) T protein:vir:10 1 MIRMEVKGLDELERQLTALGEKVA--TKVLRDAGREALKVVEEDMKQH---AGFDETST----------G------QHMR 59 (133) T ss_pred CeeEeeehHHHHHHHHHHhHHHHH--HHHHHHHHHHHHHHHHHHHHHh---CCCCCCcc----------h------hhhh Confidence 555444557888888888754321 2344444444444444433221 23222110 0 1133 Q ss_pred hhhhhhhccc--CCCe-EEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019515. 81 DSIKLVYEED--RGDG-IMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIEG 157 (160) Q Consensus 81 dsiklvyeed--rgdg-imvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiieg 157 (160) +||++-..-. ++.| ++|.+|-+-++. --+-|+|||||+||++-=+-..|.-.+-++.+.+.+.|..-|+. T Consensus 60 ~~I~v~~~~~~~~~~~~~~v~vg~~~~~~-------~y~~f~E~GT~k~~a~PF~~pA~~~~~~~~~~~~~~~~~~~l~K 132 (133) T protein:vir:10 60 DSIKIRSSTRKAQGNAVVTLRVGPSKQHH-------MKVLAQEFGTVKQVADPFIRPALDYNVQTVLRVLTVEIRNGIQN 132 (133) T ss_pred hcccccccccccCccceEEEEecCCCCcc-------ceEeeeccCCCCCCCCccchHHHHHhHHHHHHHHHHHHHHHhhc Confidence 5565432222 2223 445554322111 13459999999999995556788877777777776666666655 Q ss_pred c Q lcl|NC_019515. 158 D 158 (160) Q Consensus 158 d 158 (160) - T Consensus 133 ~ 133 (133) T protein:vir:10 133 R 133 (133) T ss_pred C Confidence 4 No 20 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=96.41 E-value=1.7e-05 Score=46.78 Aligned_cols=108 Identities=13% Similarity=0.080 Sum_probs=57.0 Q ss_pred ccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhhhhhhh Q lcl|NC_019515. 6 YGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFVDSIKL 85 (160) Q Consensus 6 ygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfvdsikl 85 (160) .-.-|++.+-|.++ ...-++.++..-.+.|+.|.+.+.. ..| .+|-..-+||+. T Consensus 1 i~Gld~l~~~l~~~---~~~~~~~v~~al~~~a~~i~~~ak~---~aP--------------------v~TG~Lr~sI~~ 54 (108) T protein:vir:99 1 MRGLDRFLRSVERK---QKSVRIAVDKELSKSAARIERQAKI---LAP--------------------VDTGWLRAQIYS 54 (108) T ss_pred CchHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHh---cCC--------------------cCchhhhcceee Confidence 33345555555444 3345555666666677776665432 123 135556677764 Q ss_pred hhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019515. 86 VYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIEG 157 (160) Q Consensus 86 vyeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiieg 157 (160) ... ++..+-|+. ..+-|-|+||||+++||+-=|-.++....-++.+++.. ++.- T Consensus 55 ~~~----~~~~~~v~~----------~~~Ya~~vE~GT~~m~a~Pf~~pa~~~~~~~~~~~i~~----~lrk 108 (108) T protein:vir:99 55 EQQ----RLLHYRVVS----------PALYSIYLELGTRKMEAQSFLDPALRKEWPVLMANIKK----MFKR 108 (108) T ss_pred eec----CcEEEEeec----------CcccchhcccCccccCCCcchhhhHHHHHHHHHHHHHH----HhcC Confidence 332 223333332 23579999999999999833344554443333333332 2222 No 21 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=96.38 E-value=4.6e-05 Score=44.38 Aligned_cols=122 Identities=15% Similarity=0.107 Sum_probs=62.2 Q ss_pred CCcccccCHHHHHHHHhhhccC-CcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDN-EPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdn-epeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkf 79 (160) |.+ ..+.+.+-|.+|-.. +..-...++.-++.++++++.-. |.- .|.. .... T Consensus 1 mv~----Gl~el~~~l~~l~~~~~~~~~~al~~ga~~~~~~~k~~a-------p~~----------~~~~------~~hl 53 (125) T protein:vir:97 1 MTK----GLDEILANLTKLEVKAPKTAKAAVTEVAKEFEKALKANT-------PVY----------EVET------DERL 53 (125) T ss_pred Cch----hHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhC-------CcC----------CCCc------hhhH Confidence 654 346666666666321 22233444444444555554432 322 1211 1135 Q ss_pred hhhhhhhh-cccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_019515. 80 VDSIKLVY-EEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIEGD 158 (160) Q Consensus 80 vdsiklvy-eedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiiegd 158 (160) -|||+.-= ..++.....+-||.+-++. -.+-|+|||||+||++-=+-+.|.-.+-+ +.+++...+.-- T Consensus 54 ~d~I~~~~~k~~~~g~~~~~VG~~k~~~-------~y~~f~E~GT~k~~~~pF~~pa~~~~k~~----~~~~~~~~~~~~ 122 (125) T protein:vir:97 54 QEDTVISGFKGANVGIVSKEIGYGKATG-------WRAHYPNDGTIYQRGQDFKERTINQMTPK----AKQLYAEKVKEG 122 (125) T ss_pred HhhhhcccccccccCceEEEEeecCCCc-------eeEeeeccCccCCCcCccchHhHHHhHHH----HHHHHHHHHHHH Confidence 67776421 1233333456788764432 35889999999999993334555544433 344444444444 Q ss_pred cC Q lcl|NC_019515. 159 LK 160 (160) Q Consensus 159 lk 160 (160) |+ T Consensus 123 L~ 124 (125) T protein:vir:97 123 LG 124 (125) T ss_pred hc Confidence 55 No 22 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=96.33 E-value=8.2e-05 Score=42.98 Aligned_cols=130 Identities=15% Similarity=0.138 Sum_probs=63.0 Q ss_pred CCcc-cccCHHHHHHHHhhhccCC-cch-HHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhh Q lcl|NC_019515. 1 MTSE-LYGDWDKFAQILHNLKDNE-PEY-DDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTH 77 (160) Q Consensus 1 mtse-lygdwdkfaqilhnlkdne-pey-ddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirth 77 (160) |-.+ -.-..+.+.+-|.+|..+. ..- ..+++..++-+.+.++.. .|.-+.. .+- T Consensus 1 M~~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~-------ap~~~~~----------------~~g 57 (135) T protein:vir:57 1 MIPEIEISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQN-------AGYDNSS----------------TNA 57 (135) T ss_pred CceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHh-------CCCCCCC----------------chh Confidence 3333 3344778888888876542 111 233444444444444433 2322110 112 Q ss_pred hhhhhhhhhh-cccCCCe-EEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019515. 78 KFVDSIKLVY-EEDRGDG-IMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAII 155 (160) Q Consensus 78 kfvdsiklvy-eedrgdg-imvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaii 155 (160) ...+||+..- ..++|++ +.|.+|...+.- --+-|+|||||+||++-=+-..|.-.+-++.+.+.+.|-.-| T Consensus 58 ~l~~~I~i~~~k~~~~~~~v~v~vg~~~~~~-------~~~~f~E~GT~~~~a~PF~~pa~~~~~~~~~~~~~~~~~~~l 130 (135) T protein:vir:57 58 HMRDSIKIRSSRGKAGSTVVVLRVGPTRSHY-------MKALAQEFGTIKQVAKPFIRPALDYNKMQVLRILTVEIRDGL 130 (135) T ss_pred hHHhhcccccccccccceeEEEEecCCCCcc-------eeEeecccCCCCCCCCcchhHhHHHhHHHHHHHHHHHHHHHH Confidence 2334443321 1223333 344444332211 125578999999999944456777766666655555544443 Q ss_pred hcccC Q lcl|NC_019515. 156 EGDLK 160 (160) Q Consensus 156 egdlk 160 (160) .--.+ T Consensus 131 ~ka~r 135 (135) T protein:vir:57 131 STLSR 135 (135) T ss_pred HHhcC Confidence 33333 No 23 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=96.32 E-value=4.1e-05 Score=44.61 Aligned_cols=114 Identities=21% Similarity=0.240 Sum_probs=62.8 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |++=-+-..|++.+-|.++.. .-.-.++++..+.++++.+++... . ..++ +|-..- T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~-~~~v~~~~~~~~~~~~~~~~~~a~-------~----------~~p~------~TG~Lr 56 (114) T protein:vir:27 1 MATIEFEGLDEMAQSLLKNAS-PEKRSKVLRKYGSKLKEAAVNRAQ-------F----------NKGY------STGATR 56 (114) T ss_pred CeeeeeehHHHHHHHHHHhcC-HHHHHHHHHHHHHHHHHHHHHhcc-------c----------CCCC------Cchhhh Confidence 887444446787777766532 222355666667777766654321 0 1122 456667 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIEG 157 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiieg 157 (160) +||+.-. .++|.- ||.. .+-|-|+||||+++|||-=+...|.-. -+++.++|..+++- T Consensus 57 ~sI~~~~---~~~~~~--V~~~----------~~Ya~~vEfGT~km~a~Pfl~PA~~~~----~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 57 RSITLQV---ESDKAT--VEAL----------TSYSGYLEVGTRKMEAQPFMKPALDEV----APKMVEELAKWDET 114 (114) T ss_pred hceeeee---cCCeeE--ecCC----------CCccceecccccccCCCCchhhhHHHH----HHHHHHHHHHHhcC Confidence 7876533 233432 2221 245789999999999983334455433 33344444444444 No 24 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=96.32 E-value=4.1e-05 Score=44.61 Aligned_cols=114 Identities=21% Similarity=0.240 Sum_probs=62.8 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |++=-+-..|++.+-|.++.. .-.-.++++..+.++++.+++... . ..++ +|-..- T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~-~~~v~~~~~~~~~~~~~~~~~~a~-------~----------~~p~------~TG~Lr 56 (114) T protein:vir:49 1 MATIEFEGLDEMAQSLLKNAS-PEKRSKVLRKYGSKLKEAAVNRAQ-------F----------NKGY------STGATR 56 (114) T ss_pred CeeeeeehHHHHHHHHHHhcC-HHHHHHHHHHHHHHHHHHHHHhcc-------c----------CCCC------Cchhhh Confidence 887444446787777766532 222355666667777766654321 0 1122 456667 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIEG 157 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiieg 157 (160) +||+.-. .++|.- ||.. .+-|-|+||||+++|||-=+...|.-. -+++.++|..+++- T Consensus 57 ~sI~~~~---~~~~~~--V~~~----------~~Ya~~vEfGT~km~a~Pfl~PA~~~~----~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 57 RSITLQV---ESDKAT--VEAL----------TSYSGYLEVGTRKMEAQPFMKPALDEV----APKMVEELAKWDET 114 (114) T ss_pred hceeeee---cCCeeE--ecCC----------CCccceecccccccCCCCchhhhHHHH----HHHHHHHHHHHhcC Confidence 7876533 233432 2221 245789999999999983334455433 33344444444444 No 25 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=96.31 E-value=3.4e-05 Score=45.08 Aligned_cols=114 Identities=18% Similarity=0.266 Sum_probs=64.9 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |+-++=| -|++.+-|.++.+.- .+.++..-.+.|..+.+.+... .|. +|-..- T Consensus 1 msi~i~G-ld~l~~~l~~~~~~~---~~~v~~al~~~a~~i~~~ak~~---aPv--------------------~TG~Lr 53 (114) T protein:vir:95 1 MAIKWQG-IEKLVATISNAQPKA---VEQSLQVLKNNGEKGKRIAKQL---APK--------------------DTEFLK 53 (114) T ss_pred Ceeeeeh-HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHh---CCc--------------------Cchhhh Confidence 8877766 577887777776432 2333444444455444433221 231 244455 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIEGDLK 160 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiiegdlk 160 (160) +||+. +.+|.-..||.. .+-|-|+||||+++||+-=+-.+|....- ++.++|...+.+.+| T Consensus 54 ~sI~~-----~~~g~~~~V~~~----------~~Ya~yvE~GT~~~~aqPfl~pa~~~~~~----~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 54 DHITT-----SYPGMEAHIHGE----------AGYDGYQEYGTRFQPGTPHFRPMMEQIQP----QFQKDMTDVMKGAFK 114 (114) T ss_pred hceee-----ecCceEEEeecC----------CCccceeecCccccCCCccchhhHHHHHH----HHHHHHHHHHHhhcC Confidence 66653 233433334321 13578999999999999444556654443 445566666777777 No 26 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=96.10 E-value=5.8e-05 Score=43.83 Aligned_cols=112 Identities=17% Similarity=0.263 Sum_probs=59.0 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |...+ +|+.+.+++.+|+++. .++.++.+..+.+..|.+.+.. ..| .+|-..- T Consensus 1 M~~~i--~i~Gld~l~~~L~~~~--~~~~~~~al~~~~~~i~~~ak~---~aP--------------------vdTG~Lr 53 (112) T protein:vir:36 1 MKSSL--SFKGIDQLVKHLDKAA--SLKGVQQVVKSNTSNMTANMQK---LVP--------------------VDTGYMK 53 (112) T ss_pred Cceee--eehhHHHHHHHHHhhh--hHHHHHHHHHHHHHHHHHHHHH---hCC--------------------CCchhhh Confidence 54432 3334444455555432 2355666666666666654432 123 1233445 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiie 156 (160) +||+.-. ..+|+.+-||.. .+-|-|+||||+++||+-=+-.+|...+-++.+ ++-.++. T Consensus 54 ~si~~~~---~~~~~~~~V~~~----------~~Ya~~vE~GT~k~~a~Pfl~pa~~~~~~~~~~----~i~~~lr 112 (112) T protein:vir:36 54 RSIKMEL---TEGGFSGQAGPH----------TDYSAYVEYGTRFQSAQPFVKPAYNEQKGVFIK----DLERLLK 112 (112) T ss_pred hceeeee---cCCceEEEeecC----------CCccceeeccccccCCCcchhhhHHHHHHHHHH----HHHHHcC Confidence 6665322 334555555532 235889999999999994444566655444333 3333333 No 27 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=95.87 E-value=4.8e-05 Score=44.25 Aligned_cols=114 Identities=21% Similarity=0.248 Sum_probs=55.0 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |.+=-.-..|.+-|-|.++- -.-+.+-+..-.+.++-|++-++.. .|- +.|--.+ T Consensus 1 Ma~iel~G~del~~~l~~~g---~~~~~ie~kAlk~g~e~I~~~~~~n---~P~----------~tg~lkk--------- 55 (119) T protein:vir:10 1 MASLEIEGFEEFEKFISEDM---VLDESTKRKGIKAGITKIGKAIEKN---SPI----------KSGRLSK--------- 55 (119) T ss_pred CceeehhhHHHHHHHHHhhh---hhhHHHHHHHHHHHhHHHHHHHhhc---CCc----------ccCCcce--------- Confidence 65533333555555553332 2223333333344455555544332 232 1111111 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchh-----hhHHHHHHHHHHHHHHHHH Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFH-----KSWAMMEHEIMEEVSQRLL 152 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfh-----kswammeheimeevsqrll 152 (160) |+-+- .-+| ++-+|.|- |-.=-.-|.||||||+||+-||- .+|.-.-+.+.+|+-..+- T Consensus 56 --ik~~~---kk~g-~~~VG~~k-------s~~fy~kF~EFGTSkm~a~~pF~~~a~~~~~~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 56 --VKIRV---KNTG-LATEGTAS-------SSEFYDIFQNFGTSEQKAHVGYFDRAVDETTNEAVEEVAEIIFRKMR 119 (119) T ss_pred --eeeee---ecCc-eeEeccCC-------cchhhhhhccccccccCCCCCccccccccChHHHHHHHHHHHHHhcC Confidence 22221 1223 57777743 33345679999999999998874 3454444444444433332 No 28 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=95.69 E-value=0.00032 Score=39.74 Aligned_cols=133 Identities=16% Similarity=0.177 Sum_probs=83.1 Q ss_pred CCccc-ccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 1 MTSEL-YGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 1 mtsel-ygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkf 79 (160) |.-+. =-|.+.+.|.|..|++ +-..+.|..-++-|+-+++-+.-. .| .+|-+. T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~---~~~~v~R~A~~~ga~vv~dear~~---aP--------------------~~tG~L 54 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVE---HSSDVVRTMTYESAVAVRESAKAF---VN--------------------DETGKL 54 (157) T ss_pred CeeEeecccHHHHHHHHHHhHH---HHHHHHHHHHHHHHHHHHHHHHHh---CC--------------------CCcchh Confidence 43333 2355667777777753 455678888888777777654311 12 157889 Q ss_pred hhhhhhhhcccC-CCeEEE-EEeccCCcccccc------cH----------HHHHHHHhhcCCC-Cccc----cchhhhH Q lcl|NC_019515. 80 VDSIKLVYEEDR-GDGIMV-FIGVDGGTTDTGL------SM----------QELADFIEFGTSK-QPAR----MPFHKSW 136 (160) Q Consensus 80 vdsiklvyeedr-gdgimv-figvdggttdtgl------sm----------qeladfiefgtsk-qpar----mpfhksw 136 (160) -+||...|..++ ++|+.+ .||+..++.--|- +- .=...|.||||++ -||+ --|...= T Consensus 55 kksI~~~~~~~~s~~g~~~~~Vg~~~~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k 134 (157) T protein:vir:97 55 RNNLYVAYSPEESVEGIQTYAVSWRKKAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVA 134 (157) T ss_pred hhheeeeeccccCCCceEEEEEeecCCccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhH Confidence 999998886544 578754 4899876532221 00 0023445555554 3433 1244555 Q ss_pred HHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_019515. 137 AMMEHEIMEEVSQRLLAIIEGDL 159 (160) Q Consensus 137 ammeheimeevsqrllaiiegdl 159 (160) .-+...+..++.|++.....||- T Consensus 135 ~~a~~~~~~~l~k~I~e~l~g~~ 157 (157) T protein:vir:97 135 MQIPDIARAAGAKKYAELQRGDT 157 (157) T ss_pred HHHHHHHHHHHHHHHHHHhcCCC Confidence 66777888889999999999999 No 29 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=95.59 E-value=7.8e-05 Score=43.10 Aligned_cols=108 Identities=13% Similarity=0.212 Sum_probs=55.9 Q ss_pred cCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhhhhhhhh Q lcl|NC_019515. 7 GDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFVDSIKLV 86 (160) Q Consensus 7 gdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfvdsiklv 86 (160) =.|+-+-+++..|+++.- .+.++.+-.+.|..|.+..... .| .+|-..-+||+.- T Consensus 1 i~i~Gld~l~~~l~~~~~--~~~~~~al~~~a~~i~~~ak~~---ap--------------------vdTG~Lr~si~~~ 55 (108) T protein:vir:98 1 MKITGIDALQKKLRKNAT--LNDVKHVVKRNTVSMNKNMQNL---AP--------------------VDTGNMKRSITSE 55 (108) T ss_pred CcchhHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHh---CC--------------------CCchhhHhhceee Confidence 455555566666654422 2334445455555444433211 12 1345566777643 Q ss_pred hcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 87 YEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 87 yeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiie 156 (160) .. .+|.-+-||.. .+-|-|.||||+++||+-=|-.+|....-.+++++ -.++. T Consensus 56 ~~---~~~~~~~V~~~----------~~Ya~~vE~GT~~m~aqPFl~pa~~~~~~~~~~~i----~~~lr 108 (108) T protein:vir:98 56 FT---DGGLTGTTIPH----------TDYAGYVEYGTRFQAAQPFVKPAFDVQKKIFTNDL----ERLTK 108 (108) T ss_pred ee---cCceEEEeecC----------CCccceeeccccccCCCcchhhHHHHHHHHHHHHH----HHHcC Confidence 32 23343444432 13588999999999999444566765554444433 33333 No 30 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=95.56 E-value=0.00012 Score=42.13 Aligned_cols=115 Identities=17% Similarity=0.202 Sum_probs=56.7 Q ss_pred cCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhhhhhhhh Q lcl|NC_019515. 7 GDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFVDSIKLV 86 (160) Q Consensus 7 gdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfvdsiklv 86 (160) =.|+-+..++..|+.-.....+.++.+-.+-++.|.+..... .|+ .|.. -.+|...-+||+.- T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~---a~~------------~~~~--p~~TG~Lr~sI~~~ 63 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLK---ARE------------VMNK--GYWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---ccc------------cCCC--CCCchhhhhcceee Confidence 345444455555544333344444444444444444333211 111 1111 13567777887642 Q ss_pred hcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 87 YEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 87 yeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiie 156 (160) . .+|..+-||. + .+-|-|+||||+++|||-=|-..|....-++. +++-.++. T Consensus 64 --~--~g~~~~~v~~---~-------~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~----~~i~~~~k 115 (115) T protein:vir:10 64 --K--TGDLQYTITS---H-------AAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTV----EELKALFE 115 (115) T ss_pred --e--cCceEEEeec---C-------ccchhhhcccccccCCCCchhhhHHHHHHHHH----HHHHHHhC Confidence 1 2233333332 2 24688999999999999656666654443333 33333333 No 31 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=95.56 E-value=0.00012 Score=42.13 Aligned_cols=115 Identities=17% Similarity=0.202 Sum_probs=56.7 Q ss_pred cCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhhhhhhhh Q lcl|NC_019515. 7 GDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFVDSIKLV 86 (160) Q Consensus 7 gdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfvdsiklv 86 (160) =.|+-+..++..|+.-.....+.++.+-.+-++.|.+..... .|+ .|.. -.+|...-+||+.- T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~---a~~------------~~~~--p~~TG~Lr~sI~~~ 63 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLK---ARE------------VMNK--GYWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---ccc------------cCCC--CCCchhhhhcceee Confidence 345444455555544333344444444444444444333211 111 1111 13567777887642 Q ss_pred hcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 87 YEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 87 yeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiie 156 (160) . .+|..+-||. + .+-|-|+||||+++|||-=|-..|....-++. +++-.++. T Consensus 64 --~--~g~~~~~v~~---~-------~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~----~~i~~~~k 115 (115) T protein:vir:96 64 --K--TGDLQYTITS---H-------AAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTV----EELKALFE 115 (115) T ss_pred --e--cCceEEEeec---C-------ccchhhhcccccccCCCCchhhhHHHHHHHHH----HHHHHHhC Confidence 1 2233333332 2 24688999999999999656666654443333 33333333 No 32 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=95.56 E-value=0.00012 Score=42.13 Aligned_cols=115 Identities=17% Similarity=0.202 Sum_probs=56.7 Q ss_pred cCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhhhhhhhh Q lcl|NC_019515. 7 GDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFVDSIKLV 86 (160) Q Consensus 7 gdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfvdsiklv 86 (160) =.|+-+..++..|+.-.....+.++.+-.+-++.|.+..... .|+ .|.. -.+|...-+||+.- T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~---a~~------------~~~~--p~~TG~Lr~sI~~~ 63 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLK---ARE------------VMNK--GYWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---ccc------------cCCC--CCCchhhhhcceee Confidence 345444455555544333344444444444444444333211 111 1111 13567777887642 Q ss_pred hcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 87 YEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 87 yeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiie 156 (160) . .+|..+-||. + .+-|-|+||||+++|||-=|-..|....-++. +++-.++. T Consensus 64 --~--~g~~~~~v~~---~-------~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~----~~i~~~~k 115 (115) T protein:vir:96 64 --K--TGDLQYTITS---H-------AAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTV----EELKALFE 115 (115) T ss_pred --e--cCceEEEeec---C-------ccchhhhcccccccCCCCchhhhHHHHHHHHH----HHHHHHhC Confidence 1 2233333332 2 24688999999999999656666654443333 33333333 No 33 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=95.56 E-value=0.00012 Score=42.13 Aligned_cols=115 Identities=17% Similarity=0.202 Sum_probs=56.7 Q ss_pred cCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhhhhhhhh Q lcl|NC_019515. 7 GDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFVDSIKLV 86 (160) Q Consensus 7 gdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfvdsiklv 86 (160) =.|+-+..++..|+.-.....+.++.+-.+-++.|.+..... .|+ .|.. -.+|...-+||+.- T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~---a~~------------~~~~--p~~TG~Lr~sI~~~ 63 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLK---ARE------------VMNK--GYWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---ccc------------cCCC--CCCchhhhhcceee Confidence 345444455555544333344444444444444444333211 111 1111 13567777887642 Q ss_pred hcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 87 YEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 87 yeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiie 156 (160) . .+|..+-||. + .+-|-|+||||+++|||-=|-..|....-++. +++-.++. T Consensus 64 --~--~g~~~~~v~~---~-------~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~----~~i~~~~k 115 (115) T protein:vir:93 64 --K--TGDLQYTITS---H-------AAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTV----EELKALFE 115 (115) T ss_pred --e--cCceEEEeec---C-------ccchhhhcccccccCCCCchhhhHHHHHHHHH----HHHHHHhC Confidence 1 2233333332 2 24688999999999999656666654443333 33333333 No 34 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=95.56 E-value=0.00012 Score=42.13 Aligned_cols=115 Identities=17% Similarity=0.202 Sum_probs=56.7 Q ss_pred cCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhhhhhhhh Q lcl|NC_019515. 7 GDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFVDSIKLV 86 (160) Q Consensus 7 gdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfvdsiklv 86 (160) =.|+-+..++..|+.-.....+.++.+-.+-++.|.+..... .|+ .|.. -.+|...-+||+.- T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~---a~~------------~~~~--p~~TG~Lr~sI~~~ 63 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLK---ARE------------VMNK--GYWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---ccc------------cCCC--CCCchhhhhcceee Confidence 345444455555544333344444444444444444333211 111 1111 13567777887642 Q ss_pred hcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 87 YEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 87 yeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiie 156 (160) . .+|..+-||. + .+-|-|+||||+++|||-=|-..|....-++. +++-.++. T Consensus 64 --~--~g~~~~~v~~---~-------~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~----~~i~~~~k 115 (115) T protein:vir:78 64 --K--TGDLQYTITS---H-------AAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTV----EELKALFE 115 (115) T ss_pred --e--cCceEEEeec---C-------ccchhhhcccccccCCCCchhhhHHHHHHHHH----HHHHHHhC Confidence 1 2233333332 2 24688999999999999656666654443333 33333333 No 35 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=95.56 E-value=0.00012 Score=42.13 Aligned_cols=115 Identities=17% Similarity=0.202 Sum_probs=56.7 Q ss_pred cCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhhhhhhhh Q lcl|NC_019515. 7 GDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFVDSIKLV 86 (160) Q Consensus 7 gdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfvdsiklv 86 (160) =.|+-+..++..|+.-.....+.++.+-.+-++.|.+..... .|+ .|.. -.+|...-+||+.- T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~---a~~------------~~~~--p~~TG~Lr~sI~~~ 63 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLK---ARE------------VMNK--GYWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---ccc------------cCCC--CCCchhhhhcceee Confidence 345444455555544333344444444444444444333211 111 1111 13567777887642 Q ss_pred hcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 87 YEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 87 yeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiie 156 (160) . .+|..+-||. + .+-|-|+||||+++|||-=|-..|....-++. +++-.++. T Consensus 64 --~--~g~~~~~v~~---~-------~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~----~~i~~~~k 115 (115) T protein:vir:97 64 --K--TGDLQYTITS---H-------AAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTV----EELKALFE 115 (115) T ss_pred --e--cCceEEEeec---C-------ccchhhhcccccccCCCCchhhhHHHHHHHHH----HHHHHHhC Confidence 1 2233333332 2 24688999999999999656666654443333 33333333 No 36 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=95.56 E-value=0.00011 Score=42.21 Aligned_cols=112 Identities=22% Similarity=0.283 Sum_probs=59.1 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |++=-+-.-|++.+-|.++.. ......+++.-+.++++.+.+... +++ ++ +|-..- T Consensus 1 Ma~i~i~Gld~L~~~l~~~~~-~~~v~~~v~~~~~~~~~~~~~~a~------------~~a-----pv------dTG~Lr 56 (112) T protein:vir:96 1 MATIEFEGLDEMAQSLLKNAS-SERRSKVLRKYGAKLKEAAVSKAQ------------FKK-----GY------STGATR 56 (112) T ss_pred CceeeehHHHHHHHHHHhhcC-HHHHHHHHHHHHHHHHHHHHHHhh------------hcC-----CC------Cchhhh Confidence 888444445666666655432 123345555555555554443221 111 12 344555 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHH Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLL 152 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrll 152 (160) +||+. + ..|.-+.||.. .+-|-|+||||+++||+-=|...|....-.+.+++ .||- T Consensus 57 ~sI~~---~--~~~~~~~v~~~----------~~Ya~~vE~GTr~m~AqPF~~PA~~~~~~~~~~~l-~~L~ 112 (112) T protein:vir:96 57 RSITL---E--AGSDRAVVEAL----------TNYSGYLEVGTRKMEAQPFMRPALDQVVPEMVEEM-AKWE 112 (112) T ss_pred hceee---e--cCceEEEecCC----------CCccceeccCccccCCCCchhhhHHHHHHHHHHHH-HhcC Confidence 66753 2 22333333321 24678999999999999666677766555544443 2333 No 37 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=95.48 E-value=0.00012 Score=42.08 Aligned_cols=115 Identities=17% Similarity=0.251 Sum_probs=56.4 Q ss_pred cCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhhhhhhhh Q lcl|NC_019515. 7 GDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFVDSIKLV 86 (160) Q Consensus 7 gdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfvdsiklv 86 (160) =.|+-+..++..|++......+.++.+-.+.++.|.+.+... .|+.. ..++ +|...-+||+.. T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~a~~~---a~~~~--------~~p~------~TG~Lr~SI~~~ 63 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLK---AREVM--------NKGY------WTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hcccc--------CCCC------cchhhhhceeee Confidence 344444444445544434444444444444444444333221 11100 0122 456667777543 Q ss_pred hcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 87 YEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 87 yeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiie 156 (160) ..+|..+-||.+ .+-|-|+||||+++||+-=|...|....-+ +-++|-.++. T Consensus 64 ----~~g~~~~~V~~~----------~~Ya~~vE~GT~~m~a~PFl~PA~~~~k~~----~~~~l~~~~k 115 (115) T protein:vir:99 64 ----KTVDLQYTITSH----------AAYSGFLEFGTRYMEAEPFMWPVYEVIRKS----TVEELKTLFE 115 (115) T ss_pred ----ecCcEEEEecCC----------ccccccccccccccCCCCcchhhHHHHHHH----HHHHHHHHhC Confidence 223455544432 235789999999999995556666544333 3344444444 No 38 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=95.41 E-value=0.00015 Score=41.61 Aligned_cols=108 Identities=16% Similarity=0.243 Sum_probs=57.9 Q ss_pred cCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhhhhhhhh Q lcl|NC_019515. 7 GDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFVDSIKLV 86 (160) Q Consensus 7 gdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfvdsiklv 86 (160) =+|+-+.+++.+|+.+. -.+.++.+-++.|..|.+.+.. -.| .+|-...+||+.- T Consensus 1 i~i~Gld~l~~~l~~~~--~~~~~~~al~~~a~~i~~~ak~---~aP--------------------v~TG~Lr~si~~~ 55 (108) T protein:vir:74 1 MKITGIDALQKKLRKNA--TLDDVKHVVKSNTASMNKNMQN---LAP--------------------VDTGNMKRSITSE 55 (108) T ss_pred CcchhHHHHHHHHHHhh--hHHHHHHHHHHHHHHHHHHHHH---hCC--------------------CCchhhhccceee Confidence 56777777777776543 2344455555556655554422 112 1344556677643 Q ss_pred hcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 87 YEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 87 yeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllaiie 156 (160) +. .+|.-+-||. ..+-|-|+||||+++||+-=+-..|...+.++.++ |-.++. T Consensus 56 ~~---~~~~~~~V~~----------~~~Ya~~vE~GT~km~aqpf~~pa~~~~~~~~~~~----i~~~~k 108 (108) T protein:vir:74 56 FT---DGGLSGTTGP----------HTDYAGYVEYGTRFQSAQPFVKPAFNIQKKVFTND----LERLTK 108 (108) T ss_pred ee---cCceEEEeec----------CCCcccceeccccccCCCcchhhHHHHHHHHHHHH----HHHHcC Confidence 22 2333333331 12468999999999999855555555444443333 333333 No 39 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=95.36 E-value=0.00019 Score=40.99 Aligned_cols=114 Identities=13% Similarity=0.212 Sum_probs=54.6 Q ss_pred cCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhhhhhhhh Q lcl|NC_019515. 7 GDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFVDSIKLV 86 (160) Q Consensus 7 gdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfvdsiklv 86 (160) =.|+-+..+...|++-.....+.++.+-.+-++.|.+.... ..|+.. ..+ .+|...-+||+.- T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~~~~al~~~~~~i~~~a~~---~a~~~~--------~~p------v~TG~Lr~sI~~~ 63 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIEDDVDDILKNNAKEGVGIAVS---NAKEVM--------NKG------YWTGNLASLIEVK 63 (115) T ss_pred CeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hhcccc--------CCC------Ccchhhhhceeee Confidence 34544555555554433333333333333333344333321 122110 012 2466677787642 Q ss_pred hcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchh-hhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 87 YEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFH-KSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 87 yeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfh-kswammeheimeevsqrllaiie 156 (160) ..+|.-+-++.+ .+-|-|+|||||++||| ||- ..|... -+.+-++|-.+++ T Consensus 64 ----~~g~~~~~v~~~----------~~Ya~~vEfGT~km~a~-PFl~PA~~~~----k~~~~~~i~~~i~ 115 (115) T protein:vir:10 64 ----KIGDLHYRVIST----------AHYSGFLEFGTRYMEPA-PFMFPTYQTL----KKSTINDLKRLLS 115 (115) T ss_pred ----ecCcEEEEeeCC----------CccchheecccccCCCC-CchhhhHHHH----HHHHHHHHHHHhC Confidence 223332333222 23678999999999999 664 444433 3344455555666 No 40 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=95.35 E-value=0.00032 Score=39.71 Aligned_cols=140 Identities=21% Similarity=0.268 Sum_probs=92.6 Q ss_pred CCcc--cccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccc-eeehhhh Q lcl|NC_019515. 1 MTSE--LYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDS-RILIRTH 77 (160) Q Consensus 1 mtse--lygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegyds-rilirth 77 (160) |+-. +=.|++.+.+.|+.|..+--.-.++.+.+|+.+.+.+++-++-+---.+.|...|++.|-..|+.. .||++|. T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL~~tg 80 (155) T protein:vir:99 1 MTTRIDVELDDQEVRQRLALLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPILQVTN 80 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcchhch Confidence 5542 345778888888888766655678888899888888888775443346788899999998888765 5999999 Q ss_pred hhhhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCC-------Cccc--cchhhhHHHHHHHHHHHHH Q lcl|NC_019515. 78 KFVDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSK-------QPAR--MPFHKSWAMMEHEIMEEVS 148 (160) Q Consensus 78 kfvdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtsk-------qpar--mpfhkswammeheimeevs 148 (160) ...+||+..+. ++++.| |.+ ...|..-+||+.. -||| |++-.. .++-.|.- T Consensus 81 ~L~~Si~~~~~---~~~v~v--Gtn----------~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~-----~~l~~e~~ 140 (155) T protein:vir:99 81 ALARSVTTWAD---RNEAGI--GSN----------LVYAAIHQFGGDAGRGHQVEIPARRYLPFDEN-----GQLAAGAR 140 (155) T ss_pred hhhhhhhceec---CCEEEE--ecC----------ccchhhhhcccccCCCCccccCCccccCCCCc-----cccchHHH Confidence 99999885543 445444 321 2468889999873 4665 222221 12223444 Q ss_pred HHHHHHhhcccC Q lcl|NC_019515. 149 QRLLAIIEGDLK 160 (160) Q Consensus 149 qrllaiiegdlk 160 (160) +.++.+|+--|+ T Consensus 141 ~~I~~~i~~~l~ 152 (155) T protein:vir:99 141 QSILEIVLTALS 152 (155) T ss_pred HHHHHHHHHHHh Confidence 444555555555 No 41 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=94.74 E-value=0.0022 Score=35.13 Aligned_cols=142 Identities=20% Similarity=0.214 Sum_probs=90.3 Q ss_pred CCc--ccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhccc-ccCCcccchhhhhhhhhcc---------- Q lcl|NC_019515. 1 MTS--ELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQ-EIDMPALDDEYLADKVSEG---------- 67 (160) Q Consensus 1 mts--elygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegq-eidmpalddeyladkvseg---------- 67 (160) |+. ++=-|++.+.+-|+.|...--.-..+.+.+|+.+....++-++.| .-|-++|..-+++.+.-.| T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~~~ 80 (175) T protein:vir:79 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred CceEEEEEechHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccccch Confidence 553 344567888888887766554455788888888888888766544 3578889999988776544 Q ss_pred -------ccceeehhhhhhhhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCC-------CCccccchh Q lcl|NC_019515. 68 -------YDSRILIRTHKFVDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTS-------KQPARMPFH 133 (160) Q Consensus 68 -------ydsrilirthkfvdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgts-------kqparmpfh 133 (160) ....||++|....+||.-.+.. ++.. ||. . ..-|-+-.||+. +-||| ||- T Consensus 81 ~~~~~~~~~~~~L~~tG~L~~Si~~~~~~---~~v~--vGt----n------~~YAaiHqfGg~~~~~~~v~IPAR-PfL 144 (175) T protein:vir:79 81 AAASRRKAGLMILQDSGQMAASTATDSGE---DYSV--IGS----N------KEYAAIQHFGGQAGRGLKVTIPGR-AWL 144 (175) T ss_pred hhHhhhccCCCcceechhhhhhhhheecC---CEEE--Eec----C------cchhhHhhcccccCCCcccccCcc-ccc Confidence 3577999999999999976643 3433 332 2 257888899975 57888 442 Q ss_pred hhHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_019515. 134 KSWAMMEHEIMEEVSQRLLAIIEGDLK 160 (160) Q Consensus 134 kswammeheimeevsqrllaiiegdlk 160 (160) --= -+.|+-.|+.+.++.+|+--|+ T Consensus 145 G~s--~~de~~~~~~~~I~~~i~~~l~ 169 (175) T protein:vir:79 145 PVT--ADGELQPEAVEPVLNTILRHLM 169 (175) T ss_pred CCC--cccchhHHHHHHHHHHHHHHHH Confidence 110 0122223333334444433333 No 42 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=94.19 E-value=0.00093 Score=37.20 Aligned_cols=141 Identities=18% Similarity=0.212 Sum_probs=96.3 Q ss_pred CC--cccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccc-eeehhhh Q lcl|NC_019515. 1 MT--SELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDS-RILIRTH 77 (160) Q Consensus 1 mt--selygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegyds-rilirth 77 (160) |. -++-.|+..+.+.|+.|.+.--.-..+.+.+|..+...+++-++-+----|.|..-|++.+...|+.+ ++|+.|. T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~~~~~~L~~tG 80 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAAVTDTLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGRGAHPILQVTN 80 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccCCCCCccccch Confidence 55 24567888899999888766544567888888888888888776443346788999999998888866 6999999 Q ss_pred hhhhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCC-------Cccccchhh-hHHHHHHHHHHHHHH Q lcl|NC_019515. 78 KFVDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSK-------QPARMPFHK-SWAMMEHEIMEEVSQ 149 (160) Q Consensus 78 kfvdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtsk-------qparmpfhk-swammeheimeevsq 149 (160) ...+||+-.+ .+|+++| |.+ ...|..-+||+.. -||| ||-- | -..|+-.|+.+ T Consensus 81 ~L~~Si~~~~---~~~~v~v--Gtn----------~~YA~iHqfGg~~~~~~~~~iPAR-PfLG~s---~~~e~~~ei~~ 141 (155) T protein:vir:10 81 ALARSITTRA---DRDQAQI--GSN----------LSYAAIQQLGGQAGRGRKVTIPAR-PYLPVL---RNGQLKPSARD 141 (155) T ss_pred hhhhhhhcee---cCCEEEE--ecC----------cchhhhhhcccccCCCCccccCCc-cccCCC---ccccchHHHHH Confidence 9999988554 3455544 321 2468888999753 4666 3321 1 02244455555 Q ss_pred HHHHHhhcccC Q lcl|NC_019515. 150 RLLAIIEGDLK 160 (160) Q Consensus 150 rllaiiegdlk 160 (160) .++.++..-|+ T Consensus 142 ~I~~~i~~~l~ 152 (155) T protein:vir:10 142 AVLDVLLAALS 152 (155) T ss_pred HHHHHHHHHHh Confidence 66666665565 No 43 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=94.11 E-value=0.0015 Score=36.01 Aligned_cols=90 Identities=17% Similarity=0.160 Sum_probs=69.6 Q ss_pred cceeehhhhhhhhhhhhhhcccCCCeEEEEEeccC-CcccccccHHHHHHHHhhcCCC--CccccchhhhHHHHHHHHHH Q lcl|NC_019515. 69 DSRILIRTHKFVDSIKLVYEEDRGDGIMVFIGVDG-GTTDTGLSMQELADFIEFGTSK--QPARMPFHKSWAMMEHEIME 145 (160) Q Consensus 69 dsrilirthkfvdsiklvyeedrgdgimvfigvdg-gttdtglsmqeladfiefgtsk--qparmpfhkswammeheime 145 (160) =|..+-+.-++.+.++-++.+..+ --|.+|+-. .+-+.|.++..+|-.-|||+.. -|+|-=+...++.-..++.+ T Consensus 1 M~~~i~~~~~~~~~L~~~lk~l~~--k~V~VGi~~~~~y~dG~~vA~Ia~~~E~G~p~~~IP~RPFlr~t~~~~~~~~~~ 78 (189) T protein:vir:10 1 MGRVIRKQGPARVKLNAFIKGMND--YSVRIGWFSTAKYPDGTPTAYVASIHEFGAPSRGIPARSFIRPTIAAQQAAWSQ 78 (189) T ss_pred CcceeccCcHHHHHHHHHHHHhhC--CeEEEEecCCCCCCCcccHHHHHHHHHhcCcCCCCCCchhhhHHHHHHHHHHHH Confidence 334444466777777777766543 456667654 4557899999999999999954 69997778888888899999 Q ss_pred HHHHHHHHHhhcccC Q lcl|NC_019515. 146 EVSQRLLAIIEGDLK 160 (160) Q Consensus 146 evsqrllaiiegdlk 160 (160) .+.+.+.+++.|+.. T Consensus 79 ~l~~~~~~vl~G~~~ 93 (189) T protein:vir:10 79 QMRFYAKQIVVGQMN 93 (189) T ss_pred HHHHHHHHHHhCCCC Confidence 999999999999877 No 44 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=94.09 E-value=0.0007 Score=37.87 Aligned_cols=124 Identities=15% Similarity=0.241 Sum_probs=59.7 Q ss_pred CCcccccCHHHHHHHHhhhccC-CcchHHHHHHHHHHHHHHHHHHh-cccccCCcccchhhhhhhhhccccceeehhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDN-EPEYDDVIRSVGQKIAEKIREMI-EGQEIDMPALDDEYLADKVSEGYDSRILIRTHK 78 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdn-epeyddvirsvgqkiaekiremi-egqeidmpalddeyladkvsegydsrilirthk 78 (160) |.+.-==|+.-|-++..+|+.- ..+...+++.+..++|..+.+.+ ....+ +|-+ T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPV------------------------dTG~ 56 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPV------------------------DTGF 56 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCC------------------------cchh Confidence 5554333444444444555432 33566677777777766655433 22222 2333 Q ss_pred hhhhhhhh-----hc-ccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHH Q lcl|NC_019515. 79 FVDSIKLV-----YE-EDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLL 152 (160) Q Consensus 79 fvdsiklv-----ye-edrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrll 152 (160) +-.|++.- .+ ...|++..|-|+ +.-+-|-|+|+|++.+|.+ ||-+---|++.. ++++...+- T Consensus 57 Lr~sw~~~~~~~~~~~~~~g~~~~v~v~----------n~~~YA~~VE~Ghr~~~~~-gfV~G~fml~~s-~~~~~~~~~ 124 (141) T protein:vir:79 57 LRQGWNGVAYARSLPVYKQGNNYIIEVV----------NPTEYASYVNFGHRTKDGK-GWVKGQHFLTIS-EMELQSQVD 124 (141) T ss_pred hcccccccccccccceeecCCeeEEEEe----------cCCcchhhhhcceeecCCc-ceeCCchhHHHH-HHHHHHHHH Confidence 33343210 11 113455544443 2236799999999999886 554443344332 223333333 Q ss_pred HHhhcccC Q lcl|NC_019515. 153 AIIEGDLK 160 (160) Q Consensus 153 aiiegdlk 160 (160) .+++.-|+ T Consensus 125 ~~~~~~l~ 132 (141) T protein:vir:79 125 KIIEKKLL 132 (141) T ss_pred HHHHHHHH Confidence 33333333 No 45 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=93.87 E-value=0.00067 Score=38.00 Aligned_cols=121 Identities=17% Similarity=0.193 Sum_probs=58.3 Q ss_pred CCcccccCHHHHHHHHhhhccCCc-chHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEP-EYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnep-eyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkf 79 (160) |.-+. +.+.+.+.|.+|..... .-....+..++-++|.+++-. |. +.++ + .. T Consensus 1 M~v~v--~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~a-------P~----------~~~~-------~-hl 53 (125) T protein:vir:98 1 MGARI--ESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNT-------PF----------ANTK-------K-HA 53 (125) T ss_pred CeeEe--eHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CC----------CCCC-------c-hh Confidence 65554 33556666666543211 112233334444444444332 32 1111 1 25 Q ss_pred hhhhhhh-hcccCCCeE-EEEEeccCCcccccccHHHHHHHHhhcCCCCccccch-hhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 80 VDSIKLV-YEEDRGDGI-MVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPF-HKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 80 vdsiklv-yeedrgdgi-mvfigvdggttdtglsmqeladfiefgtskqparmpf-hkswammeheimeevsqrllaiie 156 (160) .|+|++- -..++++|. .|-+|. +-+++. .+-|+|||||+||++ || -+.|.-.+-|+.+-+.+.|..+. T Consensus 54 ~d~I~vs~~k~~~~~g~~~v~VG~---~k~~~~----~a~F~E~GT~k~~a~-pF~~~a~~~~~~ev~~~~~~~lrk~~- 124 (125) T protein:vir:98 54 RDHIAVSNVKTDRHTSEKIVTIGY---AKGVSH----RIHATEFGTMYQKPQ-LFITKTEKQGKNKVLKTMLDTAKRLQ- 124 (125) T ss_pred hhheeecccccccccceEEEEecc---CCCCce----EEEeccCCccCCCCC-chhhHHHHHhHHHHHHHHHHHHHHHh- Confidence 5666542 123445544 344543 333442 467999999999999 55 56676666554433332222222 Q ss_pred cccC Q lcl|NC_019515. 157 GDLK 160 (160) Q Consensus 157 gdlk 160 (160) | T Consensus 125 ---k 125 (125) T protein:vir:98 125 ---K 125 (125) T ss_pred ---C Confidence 2 No 46 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=93.87 E-value=0.00067 Score=38.00 Aligned_cols=121 Identities=17% Similarity=0.193 Sum_probs=58.3 Q ss_pred CCcccccCHHHHHHHHhhhccCCc-chHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEP-EYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnep-eyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkf 79 (160) |.-+. +.+.+.+.|.+|..... .-....+..++-++|.+++-. |. +.++ + .. T Consensus 1 M~v~v--~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~a-------P~----------~~~~-------~-hl 53 (125) T protein:vir:79 1 MGARI--ESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNT-------PF----------ANTK-------K-HA 53 (125) T ss_pred CeeEe--eHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CC----------CCCC-------c-hh Confidence 65554 33556666666543211 112233334444444444332 32 1111 1 25 Q ss_pred hhhhhhh-hcccCCCeE-EEEEeccCCcccccccHHHHHHHHhhcCCCCccccch-hhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 80 VDSIKLV-YEEDRGDGI-MVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPF-HKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 80 vdsiklv-yeedrgdgi-mvfigvdggttdtglsmqeladfiefgtskqparmpf-hkswammeheimeevsqrllaiie 156 (160) .|+|++- -..++++|. .|-+|. +-+++. .+-|+|||||+||++ || -+.|.-.+-|+.+-+.+.|..+. T Consensus 54 ~d~I~vs~~k~~~~~g~~~v~VG~---~k~~~~----~a~F~E~GT~k~~a~-pF~~~a~~~~~~ev~~~~~~~lrk~~- 124 (125) T protein:vir:79 54 RDHIAVSNVKTDRHTSEKIVTIGY---AKGVSH----RIHATEFGTMYQKPQ-LFITKTEKQGKNKVLKTMLDTAKRLQ- 124 (125) T ss_pred hhheeecccccccccceEEEEecc---CCCCce----EEEeccCCccCCCCC-chhhHHHHHhHHHHHHHHHHHHHHHh- Confidence 5666542 123445544 344543 333442 467999999999999 55 56676666554433332222222 Q ss_pred cccC Q lcl|NC_019515. 157 GDLK 160 (160) Q Consensus 157 gdlk 160 (160) | T Consensus 125 ---k 125 (125) T protein:vir:79 125 ---K 125 (125) T ss_pred ---C Confidence 2 No 47 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=93.87 E-value=0.00067 Score=38.00 Aligned_cols=121 Identities=17% Similarity=0.193 Sum_probs=58.3 Q ss_pred CCcccccCHHHHHHHHhhhccCCc-chHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEP-EYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnep-eyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkf 79 (160) |.-+. +.+.+.+.|.+|..... .-....+..++-++|.+++-. |. +.++ + .. T Consensus 1 M~v~v--~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~a-------P~----------~~~~-------~-hl 53 (125) T protein:vir:47 1 MGARI--ESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNT-------PF----------ANTK-------K-HA 53 (125) T ss_pred CeeEe--eHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CC----------CCCC-------c-hh Confidence 65554 33556666666543211 112233334444444444332 32 1111 1 25 Q ss_pred hhhhhhh-hcccCCCeE-EEEEeccCCcccccccHHHHHHHHhhcCCCCccccch-hhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 80 VDSIKLV-YEEDRGDGI-MVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPF-HKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 80 vdsiklv-yeedrgdgi-mvfigvdggttdtglsmqeladfiefgtskqparmpf-hkswammeheimeevsqrllaiie 156 (160) .|+|++- -..++++|. .|-+|. +-+++. .+-|+|||||+||++ || -+.|.-.+-|+.+-+.+.|..+. T Consensus 54 ~d~I~vs~~k~~~~~g~~~v~VG~---~k~~~~----~a~F~E~GT~k~~a~-pF~~~a~~~~~~ev~~~~~~~lrk~~- 124 (125) T protein:vir:47 54 RDHIAVSNVKTDRHTSEKIVTIGY---AKGVSH----RIHATEFGTMYQKPQ-LFITKTEKQGKNKVLKTMLDTAKRLQ- 124 (125) T ss_pred hhheeecccccccccceEEEEecc---CCCCce----EEEeccCCccCCCCC-chhhHHHHHhHHHHHHHHHHHHHHHh- Confidence 5666542 123445544 344543 333442 467999999999999 55 56676666554433332222222 Q ss_pred cccC Q lcl|NC_019515. 157 GDLK 160 (160) Q Consensus 157 gdlk 160 (160) | T Consensus 125 ---k 125 (125) T protein:vir:47 125 ---K 125 (125) T ss_pred ---C Confidence 2 No 48 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=93.87 E-value=0.00067 Score=38.00 Aligned_cols=121 Identities=17% Similarity=0.193 Sum_probs=58.3 Q ss_pred CCcccccCHHHHHHHHhhhccCCc-chHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEP-EYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnep-eyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkf 79 (160) |.-+. +.+.+.+.|.+|..... .-....+..++-++|.+++-. |. +.++ + .. T Consensus 1 M~v~v--~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~a-------P~----------~~~~-------~-hl 53 (125) T protein:vir:81 1 MGARI--ESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNT-------PF----------ANTK-------K-HA 53 (125) T ss_pred CeeEe--eHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CC----------CCCC-------c-hh Confidence 65554 33556666666543211 112233334444444444332 32 1111 1 25 Q ss_pred hhhhhhh-hcccCCCeE-EEEEeccCCcccccccHHHHHHHHhhcCCCCccccch-hhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 80 VDSIKLV-YEEDRGDGI-MVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPF-HKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 80 vdsiklv-yeedrgdgi-mvfigvdggttdtglsmqeladfiefgtskqparmpf-hkswammeheimeevsqrllaiie 156 (160) .|+|++- -..++++|. .|-+|. +-+++. .+-|+|||||+||++ || -+.|.-.+-|+.+-+.+.|..+. T Consensus 54 ~d~I~vs~~k~~~~~g~~~v~VG~---~k~~~~----~a~F~E~GT~k~~a~-pF~~~a~~~~~~ev~~~~~~~lrk~~- 124 (125) T protein:vir:81 54 RDHIAVSNVKTDRHTSEKIVTIGY---AKGVSH----RIHATEFGTMYQKPQ-LFITKTEKQGKNKVLKTMLDTAKRLQ- 124 (125) T ss_pred hhheeecccccccccceEEEEecc---CCCCce----EEEeccCCccCCCCC-chhhHHHHHhHHHHHHHHHHHHHHHh- Confidence 5666542 123445544 344543 333442 467999999999999 55 56676666554433332222222 Q ss_pred cccC Q lcl|NC_019515. 157 GDLK 160 (160) Q Consensus 157 gdlk 160 (160) | T Consensus 125 ---k 125 (125) T protein:vir:81 125 ---K 125 (125) T ss_pred ---C Confidence 2 No 49 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=93.87 E-value=0.00067 Score=38.00 Aligned_cols=121 Identities=17% Similarity=0.193 Sum_probs=58.3 Q ss_pred CCcccccCHHHHHHHHhhhccCCc-chHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEP-EYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnep-eyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkf 79 (160) |.-+. +.+.+.+.|.+|..... .-....+..++-++|.+++-. |. +.++ + .. T Consensus 1 M~v~v--~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~a-------P~----------~~~~-------~-hl 53 (125) T protein:vir:94 1 MGARI--ESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNT-------PF----------ANTK-------K-HA 53 (125) T ss_pred CeeEe--eHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CC----------CCCC-------c-hh Confidence 65554 33556666666543211 112233334444444444332 32 1111 1 25 Q ss_pred hhhhhhh-hcccCCCeE-EEEEeccCCcccccccHHHHHHHHhhcCCCCccccch-hhhHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019515. 80 VDSIKLV-YEEDRGDGI-MVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPF-HKSWAMMEHEIMEEVSQRLLAIIE 156 (160) Q Consensus 80 vdsiklv-yeedrgdgi-mvfigvdggttdtglsmqeladfiefgtskqparmpf-hkswammeheimeevsqrllaiie 156 (160) .|+|++- -..++++|. .|-+|. +-+++. .+-|+|||||+||++ || -+.|.-.+-|+.+-+.+.|..+. T Consensus 54 ~d~I~vs~~k~~~~~g~~~v~VG~---~k~~~~----~a~F~E~GT~k~~a~-pF~~~a~~~~~~ev~~~~~~~lrk~~- 124 (125) T protein:vir:94 54 RDHIAVSNVKTDRHTSEKIVTIGY---AKGVSH----RIHATEFGTMYQKPQ-LFITKTEKQGKNKVLKTMLDTAKRLQ- 124 (125) T ss_pred hhheeecccccccccceEEEEecc---CCCCce----EEEeccCCccCCCCC-chhhHHHHHhHHHHHHHHHHHHHHHh- Confidence 5666542 123445544 344543 333442 467999999999999 55 56676666554433332222222 Q ss_pred cccC Q lcl|NC_019515. 157 GDLK 160 (160) Q Consensus 157 gdlk 160 (160) | T Consensus 125 ---k 125 (125) T protein:vir:94 125 ---K 125 (125) T ss_pred ---C Confidence 2 No 50 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=93.74 E-value=0.0034 Score=34.13 Aligned_cols=143 Identities=17% Similarity=0.232 Sum_probs=93.6 Q ss_pred CC--cccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhccc-ccCCcccchhhhhhhhhcc---------- Q lcl|NC_019515. 1 MT--SELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQ-EIDMPALDDEYLADKVSEG---------- 67 (160) Q Consensus 1 mt--selygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegq-eidmpalddeyladkvseg---------- 67 (160) |+ -++=-|++.+.+.|+.|..+--.-..+.+.+|+.+....++-++-| .-|-+++.+-+.+.+-..| T Consensus 1 Ms~~i~i~~~~~~l~~~L~~l~~~~~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~~~~ 80 (175) T protein:vir:10 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred CceeEEEEecHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhhhhh Confidence 55 3566688999999998876654445788999999988888877544 3577778887777654443 Q ss_pred -------ccceeehhhhhhhhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCC-------CCccc--cc Q lcl|NC_019515. 68 -------YDSRILIRTHKFVDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTS-------KQPAR--MP 131 (160) Q Consensus 68 -------ydsrilirthkfvdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgts-------kqpar--mp 131 (160) ...+||++|....+||.-.+. ++++. ||.+ ..-|-.-.||+. +-||| ++ T Consensus 81 ~~~~~~~~~~~~L~~tG~L~~Si~~~~~---~~~v~--vGtn----------~~YAaiHqfGg~~~~~~~v~iPaRpfLG 145 (175) T protein:vir:10 81 AAASRRKAGLMILQDSGQMAASVSTDHD---DNSAV--IGSN----------KEYAAIHQFGGQAGRGLKVTIPARPWLP 145 (175) T ss_pred hhhhhhccCCCcceechhhhhhhheeec---CCEEE--EecC----------hhhhhhhhcccccCCCCccccCCccccC Confidence 367799999999999996654 33333 3321 356888889987 56887 44 Q ss_pred hhhh-HHH--HHHHHHHHHHHHHHHHhhcc Q lcl|NC_019515. 132 FHKS-WAM--MEHEIMEEVSQRLLAIIEGD 158 (160) Q Consensus 132 fhks-wam--meheimeevsqrllaiiegd 158 (160) |... ++. ...+|++.+...|-..+.+- T Consensus 146 ~s~~d~~~~e~~~~Il~~~~~~l~~~~~~~ 175 (175) T protein:vir:10 146 VTADGELQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred CCcccccchHHHHHHHHHHHHHHHHHhccC Confidence 4322 211 12445555555444444333 No 51 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=93.58 E-value=0.0014 Score=36.31 Aligned_cols=140 Identities=20% Similarity=0.280 Sum_probs=93.0 Q ss_pred CCc--ccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccc-eeehhhh Q lcl|NC_019515. 1 MTS--ELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDS-RILIRTH 77 (160) Q Consensus 1 mts--elygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegyds-rilirth 77 (160) |.- ++=.|++.+.+.|+.|...--.-..+.+.+|..+.+.+++-++-+--.-+.|.+.|++.+-..|+.. +||.+|. T Consensus 1 M~~~i~i~~d~~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~~tG 80 (155) T protein:vir:79 1 MTTRIDVELDDQEVRQRLAVLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQVTN 80 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccccch Confidence 443 2344678888888888765545567888899888888888886444456789999999998888744 6999999 Q ss_pred hhhhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCC-------Cccc--cchhhhHHHHHHHHHHHHH Q lcl|NC_019515. 78 KFVDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSK-------QPAR--MPFHKSWAMMEHEIMEEVS 148 (160) Q Consensus 78 kfvdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtsk-------qpar--mpfhkswammeheimeevs 148 (160) ...+||.-.+. ++++.| | |. ...|..-+||+.. -||| +++... .|+..|+- T Consensus 81 ~L~~Si~~~~~---~~~v~v--G----t~------~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~-----~~l~~~~~ 140 (155) T protein:vir:79 81 ALARSVTTWAD---RNEAGI--G----SN------LVYAAIHQFGGDAGRGHQVEIPARRYLPFDEN-----GQLAAGAR 140 (155) T ss_pred hhhhhhhceec---CCEEEE--e----cC------chhhhhhhcccccCCCCccccCCccccCCCCc-----cccchHHH Confidence 99999874432 344444 3 21 3578889999864 4665 222211 12333444 Q ss_pred HHHHHHhhcccC Q lcl|NC_019515. 149 QRLLAIIEGDLK 160 (160) Q Consensus 149 qrllaiiegdlk 160 (160) +.++.+|+--|+ T Consensus 141 ~~I~~~i~~~l~ 152 (155) T protein:vir:79 141 QSILEVVLTALS 152 (155) T ss_pred HHHHHHHHHHHH Confidence 555556655555 No 52 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=93.40 E-value=0.0068 Score=32.47 Aligned_cols=133 Identities=21% Similarity=0.272 Sum_probs=77.5 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcc-cccC---Ccccchhhhhhhhhccccceeehhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEG-QEID---MPALDDEYLADKVSEGYDSRILIRT 76 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremieg-qeid---mpalddeyladkvsegydsrilirt 76 (160) |- +.=.++++ .+..|.. ...+.++.++..+.+.+.+.++. +.-| -+.|.+.|++.|-+ +.+|++| T Consensus 1 ~i-~~~~~i~~---~l~~l~~---~~~~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~----~~~L~~t 69 (145) T protein:vir:31 1 MV-EDENNIPE---AREAIQD---GLTDGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGS----DTPLIDN 69 (145) T ss_pred Cc-ccHHHHHH---HHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcC----CCCCccC Confidence 32 22233443 3333322 23456777788888777777653 3322 56888999988743 5799999 Q ss_pred hhhhhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCC--Cccccch-hhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019515. 77 HKFVDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSK--QPARMPF-HKSWAMMEHEIMEEVSQRLLA 153 (160) Q Consensus 77 hkfvdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtsk--qparmpf-hkswammeheimeevsqrlla 153 (160) ....+||.-...-+.. +-.+.|| |. ..-|-+-+||+.| .||| || +-+=.--+.++ ...+.. T Consensus 70 G~L~~Si~~~~~~~~~-~~~a~vG----tn------~~YA~~hqfG~~~~~IPaR-PfLG~~~~~~~~~~----~~ii~~ 133 (145) T protein:vir:31 70 SRLLTDINAASMMDRA-NRMAVIG----TN------LDYAEHHEFGAPEAGIPAR-PIFGPAGAYASQQA----PDVIGD 133 (145) T ss_pred HHHHHHHHHHhhhccc-CceeEec----CC------chhhhhhccCCcccccCCC-CccCCCccchHHHH----HHHHHH Confidence 9999999876554432 2222333 22 2578899999987 9999 55 33322222222 223334 Q ss_pred HhhcccC Q lcl|NC_019515. 154 IIEGDLK 160 (160) Q Consensus 154 iiegdlk 160 (160) +|+..|| T Consensus 134 ~i~~~L~ 140 (145) T protein:vir:31 134 EIDTNLE 140 (145) T ss_pred HHHHHhh Confidence 4444555 No 53 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=93.12 E-value=0.0016 Score=35.88 Aligned_cols=95 Identities=18% Similarity=0.242 Sum_probs=68.2 Q ss_pred hhccccceeehhhhhhhhhhhhhhcccCCCeEEEEEeccCCcc-------cccccHHHHHHHHhhcCCC----------- Q lcl|NC_019515. 64 VSEGYDSRILIRTHKFVDSIKLVYEEDRGDGIMVFIGVDGGTT-------DTGLSMQELADFIEFGTSK----------- 125 (160) Q Consensus 64 vsegydsrilirthkfvdsiklvyeedrgdgimvfigvdggtt-------dtglsmqeladfiefgtsk----------- 125 (160) ...|....+-++--+-++.+.--. +.-++-.|-+|+-.+.+ +.|++++.+|-.-|||+.- T Consensus 1 ~~~~~~~~~k~~~~~~~~~~~~~l--~~l~~~~v~vGi~~~~~y~~~~~~~dG~~va~IA~~~EfG~~i~~p~~~~~~~~ 78 (200) T protein:vir:99 1 MKKGFSKSNSVAAPLKHFQMLKQF--DALKGKTVQAGWFETDRYPAKEGETIGPLVAKIARQLEFGGVINHPGGTKYIKD 78 (200) T ss_pred CCcCcceeeeeecchHHHHHHHHH--HHhhCCeEEEEEcCCCCcCCcccccccchHHHHHhHHHcCCeeccCCCcccccc Confidence 455666666665433222222111 22355688899965542 4689999999999999763 Q ss_pred ------------------------------CccccchhhhHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_019515. 126 ------------------------------QPARMPFHKSWAMMEHEIMEEVSQRLLAIIEGDLK 160 (160) Q Consensus 126 ------------------------------qparmpfhkswammeheimeevsqrllaiiegdlk 160 (160) -|+|--+...++.-..++.+.+.+.+.+++.|++. T Consensus 79 ~~~~g~~~g~rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~l~g~~~ 143 (200) T protein:vir:99 79 AIVDGRYVGTRFVHKSFQGEHEVTKAHQIVIPARPFMRLAWATFNKDKVKIQAQIARQLLDGTIN 143 (200) T ss_pred ccccccccccccccccccceeeeeccccccCCCcchhhHHHHHHHHHHHHHHHHHHHHHHhCCCC Confidence 38887788888888889999999999999999877 No 54 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=92.78 E-value=0.0015 Score=36.00 Aligned_cols=116 Identities=16% Similarity=0.188 Sum_probs=61.5 Q ss_pred cCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhhhhhhhh Q lcl|NC_019515. 7 GDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFVDSIKLV 86 (160) Q Consensus 7 gdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfvdsiklv 86 (160) =.|.-+.+++..|+.....-+++++..-.+.|+.|.+....- .| .+|-..-+||+.. T Consensus 1 i~i~Gld~L~~~L~~l~~~~~~~~~~a~~~~a~~i~~~ak~~---aP--------------------v~TG~Lr~sI~~~ 57 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDIDKNINATTEEAANFIEDRAKTL---AP--------------------KNFGKLAQSISTS 57 (173) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---CC--------------------cCchhhhhcceee Confidence 234444444444444444456777777777777777665431 23 2344555666543 Q ss_pred hcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCC----------------------------------------- Q lcl|NC_019515. 87 YEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSK----------------------------------------- 125 (160) Q Consensus 87 yeedrgdgimvfigvdggttdtglsmqeladfiefgtsk----------------------------------------- 125 (160) -.. .+.++- .+|. +-.+-+-|+||||++ T Consensus 58 ~~~-~~~~~~--~~v~--------~~~~Ya~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 126 (173) T protein:vir:10 58 DLK-AKDLIS--KKIT--------VNELYGAYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDE 126 (173) T ss_pred eec-cCceeE--EeeC--------CCcccchhhhcccccccCCCchhhhhhcccccccccccccccccccccccccccch Confidence 211 222321 1221 223568899999986 Q ss_pred --------------CccccchhhhHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_019515. 126 --------------QPARMPFHKSWAMMEHEIMEEVSQRLLAIIEGDLK 160 (160) Q Consensus 126 --------------qparmpfhkswammeheimeevsqrllaiiegdlk 160 (160) |||+-=|--+|-.++.++.+.+.++| ...|+ T Consensus 127 ~~~~~~~~~~~~~G~~aqPFl~PA~~~~~~~~~~~i~~~i----~~~lr 171 (173) T protein:vir:10 127 KAAYPIFAKILGAGINPQPFLYPAWIEGKKQYLKDLENLL----KTYNK 171 (173) T ss_pred hcccceeeEeecCCCCCCccchhHHHHhHHHHHHHHHHHH----HHHhh Confidence 44443345567777766665555544 44444 No 55 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=92.47 E-value=0.0027 Score=34.65 Aligned_cols=124 Identities=17% Similarity=0.225 Sum_probs=74.1 Q ss_pred CCcccccCHHHHH-HHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFA-QILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 1 mtselygdwdkfa-qilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkf 79 (160) |++= +-|.++ .|.+.|.+---+-.++++..-+++|+.+++.|.- .-| .||-++ T Consensus 1 Ma~i---~id~la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~---~aP--------------------~rTG~y 54 (126) T protein:vir:81 1 MANI---TIDRLADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQA---LAP--------------------KRTGEY 54 (126) T ss_pred Cccc---chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---hCC--------------------cccchh Confidence 6653 334443 2445555433344555566666666666655432 112 268888 Q ss_pred hhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCC-ccccchhhhHHHHHHHHHHHHHHHHHHHhhcc Q lcl|NC_019515. 80 VDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQ-PARMPFHKSWAMMEHEIMEEVSQRLLAIIEGD 158 (160) Q Consensus 80 vdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskq-parmpfhkswammeheimeevsqrllaiiegd 158 (160) .+|++.-...+.|.+-.|..+-. --.|+-++|||+.+. .-|+|-..--+--|....+++.|++-.+|+|+ T Consensus 55 ~ksw~vk~~~~~g~~~~vv~~~~---------~~~l~HLLEfGha~r~gGrV~a~Phi~Pa~e~~~~~~~~~i~~~l~~g 125 (126) T protein:vir:81 55 ARTFTITKEDGYGTTKRIIWNKK---------HYRRVHLLEFGHAKVNGGRVKEYPHLRPAYDKHGARLPDELKRVIENG 125 (126) T ss_pred hccccccccccCCcceEEEeccC---------CCCceeeeecceecCCCCccCCCcchHHHHHHHHHHHHHHHHHHhhcC Confidence 88888776666665544444322 123567799999983 34454333344457778888888888888887 Q ss_pred c Q lcl|NC_019515. 159 L 159 (160) Q Consensus 159 l 159 (160) = T Consensus 126 g 126 (126) T protein:vir:81 126 G 126 (126) T ss_pred C Confidence 7 No 56 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=91.42 E-value=0.0055 Score=32.96 Aligned_cols=85 Identities=18% Similarity=0.335 Sum_probs=47.9 Q ss_pred CcccchhhhhhhhhccccceeehhhhhhhhhhhhhhcccCCCeEEEEEec-----cCCcccccccHHHHHHHHhhcCCCC Q lcl|NC_019515. 52 MPALDDEYLADKVSEGYDSRILIRTHKFVDSIKLVYEEDRGDGIMVFIGV-----DGGTTDTGLSMQELADFIEFGTSKQ 126 (160) Q Consensus 52 mpalddeyladkvsegydsrilirthkfvdsiklvyeedrgdgimvfigv-----dggttdtglsmqeladfiefgtskq 126 (160) |+. ++.- ..-+ .+.+.-...+-. +--|-+|+ .+++-+.|+|+.++|-.-|||+... T Consensus 1 M~~--------~~k~--------~~~~-~~~l~~~l~~l~--~~~v~VGi~~~~~~~~~~~~g~~vA~ia~~~E~G~~~I 61 (148) T protein:vir:52 1 MAV--------TVTA--------NFSA-AKQLIEQMKSLK--EKAVYVGFPAEFDEKVKGSENFNLASLAAVLEFGNEHI 61 (148) T ss_pred Ccc--------cccc--------ccHH-HHHHHHHHHHhh--CCeEEEEeecCcCCCCCCCCCCCHHHHHHHHhcCCCCC Confidence 111 1111 0001 122222222222 34566666 4556778999999999999999999 Q ss_pred ccccchhhh-HHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_019515. 127 PARMPFHKS-WAMMEHEIMEEVSQRLLAIIEGDLK 160 (160) Q Consensus 127 parmpfhks-wammeheimeevsqrllaiiegdlk 160 (160) |+| ||-.. ++.-. ++..+.+-++++|.+. T Consensus 62 P~R-pflr~t~~~~~----~~~~~~~~~~~~~~~~ 91 (148) T protein:vir:52 62 PAR-PFLRQTLEENQ----EKYTALFIQWFDQGVP 91 (148) T ss_pred CCc-chhHHHHHHHH----HHHHHHHHHHHHcCCC Confidence 999 66543 33322 4445555566666666 No 57 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=91.35 E-value=0.002 Score=35.44 Aligned_cols=108 Identities=20% Similarity=0.267 Sum_probs=63.2 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |..-.+| .|+|.+.|.++.+ +..+.++....+.|+.+.+.+.. -.| .+|-..- T Consensus 1 Ma~~~~g-~~~l~~~l~~~~~---~~~~~~~~~~~~~a~~i~~~ak~---~aP--------------------vdTG~Lr 53 (137) T protein:vir:94 1 MAKVKYG-NWDLVKELENYER---DMERWVKRGIAKTTAKIHNTIIS---LMP--------------------VDTGYLR 53 (137) T ss_pred CchhHHh-HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH---hCC--------------------ccccchh Confidence 9888886 6678888887754 45666666666666666554432 122 2455666 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcC-----------------------------CCCccccc Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGT-----------------------------SKQPARMP 131 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgt-----------------------------skqparmp 131 (160) +||+.-++. +|.-+-||.. .+-|-|+|||| +.+||+-- T Consensus 54 ~SI~~~~~~---~~~~~~V~~~----------~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PF 120 (137) T protein:vir:94 54 ESVTMDFKD---SGFTGVINIG----------SEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPF 120 (137) T ss_pred ccceeEeec---CceEEEEecC----------CCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcc Confidence 777754433 3333333321 24678999999 45677644 Q ss_pred hhhhHHHHHHHHHHHHH Q lcl|NC_019515. 132 FHKSWAMMEHEIMEEVS 148 (160) Q Consensus 132 fhkswammeheimeevs 148 (160) +-.++.-.+-.|...++ T Consensus 121 l~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 121 WEPAIDAGRAFFNKYFS 137 (137) T ss_pred hHHHHHHHHHHHHHhhC Confidence 44555555545544444 No 58 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=91.35 E-value=0.002 Score=35.44 Aligned_cols=108 Identities=20% Similarity=0.267 Sum_probs=63.2 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |..-.+| .|+|.+.|.++.+ +..+.++....+.|+.+.+.+.. -.| .+|-..- T Consensus 1 Ma~~~~g-~~~l~~~l~~~~~---~~~~~~~~~~~~~a~~i~~~ak~---~aP--------------------vdTG~Lr 53 (137) T protein:vir:93 1 MAKVKYG-NWDLVKELENYER---DMERWVKRGIAKTTAKIHNTIIS---LMP--------------------VDTGYLR 53 (137) T ss_pred CchhHHh-HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH---hCC--------------------ccccchh Confidence 9888886 6678888887754 45666666666666666554432 122 2455666 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcC-----------------------------CCCccccc Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGT-----------------------------SKQPARMP 131 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgt-----------------------------skqparmp 131 (160) +||+.-++. +|.-+-||.. .+-|-|+|||| +.+||+-- T Consensus 54 ~SI~~~~~~---~~~~~~V~~~----------~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PF 120 (137) T protein:vir:93 54 ESVTMDFKD---SGFTGVINIG----------SEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPF 120 (137) T ss_pred ccceeEeec---CceEEEEecC----------CCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcc Confidence 777754433 3333333321 24678999999 45677644 Q ss_pred hhhhHHHHHHHHHHHHH Q lcl|NC_019515. 132 FHKSWAMMEHEIMEEVS 148 (160) Q Consensus 132 fhkswammeheimeevs 148 (160) +-.++.-.+-.|...++ T Consensus 121 l~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:93 121 WEPAIDAGRAFFNKYFS 137 (137) T ss_pred hHHHHHHHHHHHHHhhC Confidence 44555555545544444 No 59 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=91.35 E-value=0.002 Score=35.44 Aligned_cols=108 Identities=20% Similarity=0.267 Sum_probs=63.2 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |..-.+| .|+|.+.|.++.+ +..+.++....+.|+.+.+.+.. -.| .+|-..- T Consensus 1 Ma~~~~g-~~~l~~~l~~~~~---~~~~~~~~~~~~~a~~i~~~ak~---~aP--------------------vdTG~Lr 53 (137) T protein:vir:97 1 MAKVKYG-NWDLVKELENYER---DMERWVKRGIAKTTAKIHNTIIS---LMP--------------------VDTGYLR 53 (137) T ss_pred CchhHHh-HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH---hCC--------------------ccccchh Confidence 9888886 6678888887754 45666666666666666554432 122 2455666 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcC-----------------------------CCCccccc Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGT-----------------------------SKQPARMP 131 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgt-----------------------------skqparmp 131 (160) +||+.-++. +|.-+-||.. .+-|-|+|||| +.+||+-- T Consensus 54 ~SI~~~~~~---~~~~~~V~~~----------~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PF 120 (137) T protein:vir:97 54 ESVTMDFKD---SGFTGVINIG----------SEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPF 120 (137) T ss_pred ccceeEeec---CceEEEEecC----------CCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcc Confidence 777754433 3333333321 24678999999 45677644 Q ss_pred hhhhHHHHHHHHHHHHH Q lcl|NC_019515. 132 FHKSWAMMEHEIMEEVS 148 (160) Q Consensus 132 fhkswammeheimeevs 148 (160) +-.++.-.+-.|...++ T Consensus 121 l~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:97 121 WEPAIDAGRAFFNKYFS 137 (137) T ss_pred hHHHHHHHHHHHHHhhC Confidence 44555555545544444 No 60 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=91.04 E-value=0.0019 Score=35.44 Aligned_cols=108 Identities=19% Similarity=0.263 Sum_probs=60.9 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |.+-.|| .|+|.+-|.++.+ +.++.++..-++.|+.+.+.... ..| .+|-..- T Consensus 13 Ma~v~~G-ld~l~~~l~~~~~---~~~~~~~~~l~~~a~~v~~~ak~---~aP--------------------vdTG~L~ 65 (149) T protein:vir:10 13 MAKVKYG-ADSMVVELDKFDK---KIEEWVKKGIAKTTTKIYNTAVA---LAP--------------------VDLGFLE 65 (149) T ss_pred hHHHHHH-HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH---hCC--------------------cccchhh Confidence 7665564 6777776665543 56667766666666666655421 123 2455666 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCC-----------------------------CCccccc Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTS-----------------------------KQPARMP 131 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgts-----------------------------kqparmp 131 (160) +||+... .++|+-.-||.+ .+-|-|+||||. ++|++-- T Consensus 66 ~SI~~~~---~~~g~~~~V~~~----------~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PF 132 (149) T protein:vir:10 66 ESIDFKY---FDGGLSSVISVG----------ADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPF 132 (149) T ss_pred ccceEEe---cCCcEEEEEecC----------CCcccccccCccccccCCcccccccccceeeccccceecCCCCCCCcc Confidence 7776433 334544444322 246789999984 3555533 Q ss_pred hhhhHHHHHHHHHHHHH Q lcl|NC_019515. 132 FHKSWAMMEHEIMEEVS 148 (160) Q Consensus 132 fhkswammeheimeevs 148 (160) +-.+|...+-+|.+.++ T Consensus 133 l~pA~~~~k~~i~~~i~ 149 (149) T protein:vir:10 133 WNPAIDAGRKTFEQYFS 149 (149) T ss_pred hhHHHHHHHHHHHHhhC Confidence 44555555554444443 No 61 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=90.90 E-value=0.0034 Score=34.12 Aligned_cols=108 Identities=16% Similarity=0.167 Sum_probs=64.5 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |.+-.+ ..|+|.+-|.++. .+.+.+++...++.|+.+.+.+.. -.| ++|-... T Consensus 1 Ma~~~~-Gl~~l~~~l~~~~---~~~~~~~~~al~~~a~~v~~~ak~---~ap--------------------vdTG~Lr 53 (135) T protein:vir:96 1 MAKVKY-GADSIVVDLEKYS---KDMEKWVKKGITKTTLKIYNTAIH---LMP--------------------VDTGFLR 53 (135) T ss_pred Cchhhh-hHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHH---hCC--------------------ccchhhh Confidence 888767 5778887777765 356778888888888877765421 112 3566777 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCC---------------------------CCccccchh Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTS---------------------------KQPARMPFH 133 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgts---------------------------kqparmpfh 133 (160) +||+... .++|+..-||. -.+-|-|+||||. .+|++--|- T Consensus 54 ~SI~~~~---~~~g~~~~V~~----------~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~ 120 (135) T protein:vir:96 54 QSTTVDF---ENGGFTGVVKI----------GSNYAVYVNYGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWE 120 (135) T ss_pred cceeEEe---ecCcEEEEEec----------CCCccchhhcccccccCCCccccccccccccCCcceeecCCcCCCcchh Confidence 7887543 34444444431 1247889999993 355554444 Q ss_pred hhHHHHHHHHHHHHH Q lcl|NC_019515. 134 KSWAMMEHEIMEEVS 148 (160) Q Consensus 134 kswammeheimeevs 148 (160) .++...+-+|.+.++ T Consensus 121 ~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 121 PAIDAGRQTFEQYFS 135 (135) T ss_pred HHHHHHHHHHHHhcC Confidence 555544444333332 No 62 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=89.25 E-value=0.004 Score=33.76 Aligned_cols=108 Identities=20% Similarity=0.255 Sum_probs=63.1 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |.+..+| ++.|.+-|.++.+ +..++++....+.|+.+.+.+.. ..| .+|-..- T Consensus 1 Ma~~~~G-~~~l~~~l~~~~~---~~~~~~~~~~~~~a~~v~~~ak~---~aP--------------------v~TG~L~ 53 (137) T protein:vir:95 1 MAKVKYG-NWDLVKELENYER---DMERWVKRGIAKTTAKIHNTIIS---LMP--------------------VDTGYLR 53 (137) T ss_pred CchhHHh-HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH---hCC--------------------ccchhhh Confidence 9988886 5678887777653 55667777777777766665421 112 1355556 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcC-----------------------------CCCccccc Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGT-----------------------------SKQPARMP 131 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgt-----------------------------skqparmp 131 (160) +||+...+. +|.-+-||. ..+-|-|+|||| +.+|++-- T Consensus 54 ~Si~~~~~~---~~~~~~V~~----------~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PF 120 (137) T protein:vir:95 54 ESVTMDFKD---GGFTGVINI----------GSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPF 120 (137) T ss_pred cCeeeEeeC---CceEEEEec----------CCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcc Confidence 677643332 233333331 124678999999 45677644 Q ss_pred hhhhHHHHHHHHHHHHH Q lcl|NC_019515. 132 FHKSWAMMEHEIMEEVS 148 (160) Q Consensus 132 fhkswammeheimeevs 148 (160) +-.+|.-.+-+|...+| T Consensus 121 l~pA~~~~~~~i~k~l~ 137 (137) T protein:vir:95 121 WEPAIDAGRAFFNKYFS 137 (137) T ss_pred hHHHHHHHHHHHHHhhC Confidence 55566555555544444 No 63 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=88.62 E-value=0.0043 Score=33.59 Aligned_cols=108 Identities=19% Similarity=0.263 Sum_probs=58.9 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |.+-.|| .|+|.+-|.++.+ +.++.++..-++.|+.+.+.+.. -.| .+|-..- T Consensus 13 Ma~~~~G-ld~l~~~L~~~~~---~~~~~~~~al~~~a~~v~~~ak~---~aP--------------------vdTG~Lr 65 (149) T protein:vir:94 13 MAKVKYG-ADSMVVELDKFDK---KIEEWVKKGIAKTTTKIYNTAVA---LAP--------------------VDLGFLE 65 (149) T ss_pred HHHHHHH-HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH---hCC--------------------cccchhh Confidence 7776665 6777776665543 45666666666666666554421 122 1355556 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCC-----------------------------CCccccc Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTS-----------------------------KQPARMP 131 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgts-----------------------------kqparmp 131 (160) +||+... .++|+-.-||.+ .+-|-|+||||. .+|++-- T Consensus 66 ~SI~~~~---~~~g~~~~V~~~----------~~YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PF 132 (149) T protein:vir:94 66 ESIDFKY---FDGGLSSVISVG----------ADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPF 132 (149) T ss_pred cCeeEEe---eCCcEEEEEecC----------CCcccccccCccccccCCCccccccccceeecCccceecCCCCCCCcc Confidence 6776432 244544444321 246889999983 3556533 Q ss_pred hhhhHHHHHHHHHHHHH Q lcl|NC_019515. 132 FHKSWAMMEHEIMEEVS 148 (160) Q Consensus 132 fhkswammeheimeevs 148 (160) +-.+|...+-+|.+.++ T Consensus 133 l~pA~~~~~~~i~~~i~ 149 (149) T protein:vir:94 133 WNPAIDAGRKTFEQYFS 149 (149) T ss_pred hHHHHHHHHHHHHHhhC Confidence 44555544444433333 No 64 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=88.55 E-value=0.006 Score=32.77 Aligned_cols=114 Identities=23% Similarity=0.258 Sum_probs=64.7 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |+.- .|+.-+.++..+|+.....-.+.++...++.|+.+.+..... .| ++|.... T Consensus 4 ms~~--i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~---ap--------------------v~TG~Lr 58 (144) T protein:vir:59 4 MSVR--IDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASL---AP--------------------VDEGNLK 58 (144) T ss_pred ceee--ehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---CC--------------------ccchhhh Confidence 5432 355556666666665555556666666666666665544311 11 3578888 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCC---------------------------CCccccchh Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTS---------------------------KQPARMPFH 133 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgts---------------------------kqparmpfh 133 (160) +||+..+. ++|+-+-||.. .+-|-|+||||. .+|++--|. T Consensus 59 ~SI~~~~~---~~g~~~~V~~~----------~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~ 125 (144) T protein:vir:59 59 NSIQIDYK---NNGLTAEITVG----------AEYAIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFW 125 (144) T ss_pred cCeeEEee---cCcEEEEEecC----------CCccchhhcCccccccCCCccccccccccccccceecCCCCCCCcchh Confidence 89886542 34444444331 357889999983 466665556 Q ss_pred hhHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019515. 134 KSWAMMEHEIMEEVSQRLLAIIEG 157 (160) Q Consensus 134 kswammeheimeevsqrllaiieg 157 (160) .+|...+-.+++++ -. +-| T Consensus 126 pA~~~~~~~~~~~i----~~-~~g 144 (144) T protein:vir:59 126 PAVEEGGEYFEREM----RR-LRG 144 (144) T ss_pred HHHHHHHHHHHHHH----HH-hcC Confidence 67766555554432 22 233 No 65 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=88.00 E-value=0.0044 Score=33.50 Aligned_cols=108 Identities=19% Similarity=0.239 Sum_probs=55.7 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |.+-.+| +|+|.+-|.++.+ +.++.++..-++.|+.+.+.... ..| .+|-..- T Consensus 1 Ma~~~~G-l~~l~~~l~~~~~---~~~~~~~~al~~~a~~i~~~ak~---~aP--------------------vdTG~Lr 53 (137) T protein:vir:10 1 MAKVKYG-NWELVKELEDFEK---ETIRWAKKGIAKTTTIIHNSIVS---NMP--------------------VDTGYLR 53 (137) T ss_pred CchhHhh-HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH---hCC--------------------cCcchhh Confidence 8887775 5777776666643 44555555555555555443321 133 1455566 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCC-----------------------------Cccccc Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSK-----------------------------QPARMP 131 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtsk-----------------------------qparmp 131 (160) +||+...+. +|+-+.||.+ .+-|-|+||||.. +|++-- T Consensus 54 ~SI~~~~~~---~~~~~~V~~~----------~~Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PF 120 (137) T protein:vir:10 54 ESVSMDFKK---GGLTGVINIG----------SEYAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPF 120 (137) T ss_pred cCeeEEeeC---CcEEEEEecC----------CCcccccccCccccccCCCccccccccceeeccccceeccCCCCCCcc Confidence 777654432 3443344321 1357788998743 555533 Q ss_pred hhhhHHHHHHHHHHHHH Q lcl|NC_019515. 132 FHKSWAMMEHEIMEEVS 148 (160) Q Consensus 132 fhkswammeheimeevs 148 (160) +-.+|.-.+-+|...++ T Consensus 121 l~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 121 WEPAIDEGRAFFNKYFS 137 (137) T ss_pred hhHHHHHHHHHHHHhcC Confidence 34444433333333333 No 66 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=85.51 E-value=0.025 Score=29.35 Aligned_cols=114 Identities=19% Similarity=0.186 Sum_probs=58.8 Q ss_pred CCccccc-CHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 1 MTSELYG-DWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 1 mtselyg-dwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkf 79 (160) |.+--|- ++++|+..|..+.+ ..+.+++..-...|+.+..... ...| .+|-.. T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~---~~~~~~~~~l~~~a~~i~~~ak---~~aP--------------------v~TG~L 54 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALD---RLTGAAREATEAAANDMVNMAK---GLCP--------------------VDTGRL 54 (142) T ss_pred CceeEEEecHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHH---HhCC--------------------ccchhh Confidence 5554444 89999988877755 3445555555555555544321 0111 356777 Q ss_pred hhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCC---------------------------Cccccch Q lcl|NC_019515. 80 VDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSK---------------------------QPARMPF 132 (160) Q Consensus 80 vdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtsk---------------------------qparmpf 132 (160) -+||+... +..|.++.+-||. -.+-|-|.||||.. +|++--| T Consensus 55 r~SI~~~~-~~~g~~~~~~v~~----------~~~YA~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl 123 (142) T protein:vir:94 55 RSSIQAVP-SGGRFSFSVTIGT----------NVTYAADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFM 123 (142) T ss_pred hccceeee-ccCCceEEEEEec----------CcccchhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcch Confidence 78887543 3345555554432 12578899999964 2332223 Q ss_pred hhhHHHHHHHHHHHHHHHHH Q lcl|NC_019515. 133 HKSWAMMEHEIMEEVSQRLL 152 (160) Q Consensus 133 hkswammeheimeevsqrll 152 (160) -.++.-.+-+| ++.-++|- T Consensus 124 ~~A~~~~~~~i-~~~~~~~~ 142 (142) T protein:vir:94 124 RPAIAAASTFL-RNHAKGIR 142 (142) T ss_pred hHHHHHHHHHH-HHHHHhcC Confidence 33333322222 22222222 No 67 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=84.53 E-value=0.017 Score=30.28 Aligned_cols=108 Identities=20% Similarity=0.246 Sum_probs=61.1 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |.+-.+| +|+|.+-|.++.+ +.+..++..-++.|+.+.+.+.. -.|- +|-..- T Consensus 1 Ma~~~~G-~~~l~~~L~~~~~---~~~~~~~~al~~~a~~v~~~ak~---~aPv--------------------dTG~Lr 53 (137) T protein:vir:94 1 MAKVKYG-NWDLVKELENYER---DIERWVKRGIAKTTVKIHNTIIS---LMPV--------------------DTGYLR 53 (137) T ss_pred CchhHHh-HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH---hCCc--------------------Ccchhh Confidence 8887774 5566666666543 44555555555556555543321 1221 355566 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcC-----------------------------CCCccccc Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGT-----------------------------SKQPARMP 131 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgt-----------------------------skqparmp 131 (160) +||+...+. +|.-+-||.+ .+-|-|+|||| +.+||+-- T Consensus 54 ~SI~~~~~~---~~~~~~V~~~----------~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PF 120 (137) T protein:vir:94 54 ESVTMDFKD---GGFTGVINIG----------SEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPF 120 (137) T ss_pred cCceeEeec---CcEEEEEecC----------CCcccccccCccccccCCCcccccccccceeccCCceeecCCcCCCcc Confidence 777654432 3333334321 24678999995 45777655 Q ss_pred hhhhHHHHHHHHHHHHH Q lcl|NC_019515. 132 FHKSWAMMEHEIMEEVS 148 (160) Q Consensus 132 fhkswammeheimeevs 148 (160) +-.+|.-.+-+|.+.+| T Consensus 121 l~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 121 WEPAIDAGRVFFNKYFS 137 (137) T ss_pred hHHHHHHHHHHHHHhhC Confidence 55666666666665555 No 68 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=82.20 E-value=0.022 Score=29.71 Aligned_cols=79 Identities=14% Similarity=0.096 Sum_probs=58.5 Q ss_pred CC-cccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 1 MT-SELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 1 mt-selygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkf 79 (160) .| .+-...| .+.+.+.-...-.-+.....+|+.++..|++.|.. ...|.+.+.-++.| |.| .=||-|... T Consensus 69 ~t~~~~~~~~---~~~~~~~~~~~~~~~~~L~~~G~~~~~~ik~~I~~--~~~ppna~sTi~~K---g~~-~PLidTG~l 139 (148) T protein:vir:52 69 QTLEENQEKY---TALFIQWFDQGVPAAQIYERLSVMAQGDVQMNIVK--GEWVANAKSTIRRK---KSS-KPLIDTGKM 139 (148) T ss_pred HHHHHHHHHH---HHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhc--CCCCCCcHHHHHhc---CCC-CchhHHHHH Confidence 12 2222334 33344444455667899999999999999999964 56899999999977 444 569999999 Q ss_pred hhhhhhhhc Q lcl|NC_019515. 80 VDSIKLVYE 88 (160) Q Consensus 80 vdsiklvye 88 (160) .+||.-+-| T Consensus 140 ~~SIty~V~ 148 (148) T protein:vir:52 140 RQSVRGIVK 148 (148) T ss_pred HHHhhhhcC Confidence 999998877 No 69 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=82.15 E-value=0.018 Score=30.12 Aligned_cols=82 Identities=20% Similarity=0.215 Sum_probs=63.0 Q ss_pred CC-cccccCHHH-HHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhh Q lcl|NC_019515. 1 MT-SELYGDWDK-FAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHK 78 (160) Q Consensus 1 mt-selygdwdk-faqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthk 78 (160) .| .+--..|.+ +.+.+.++-.++-..+.....+|+.++..|++.|.. ..-|.+.+.-++.| |. ++=||-|.. T Consensus 117 ~t~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~~G~~~~~~ik~~I~~--~~~ppna~sTi~~K---g~-~~PLidTG~ 190 (200) T protein:vir:99 117 LAWATFNKDKVKIQAQIARQLLDGTINPEQALAQIGLALEGCIVRSIKS--GPWAANSPATIRAK---GF-DKPLIDTAH 190 (200) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc--CCCCCChHHHHHHh---CC-CCchHHHHH Confidence 11 222345554 445556666677778999999999999999999964 55799999999987 44 467999999 Q ss_pred hhhhhhhhhc Q lcl|NC_019515. 79 FVDSIKLVYE 88 (160) Q Consensus 79 fvdsiklvye 88 (160) ..+||.-+-| T Consensus 191 l~~SIty~Ve 200 (200) T protein:vir:99 191 MWQTVSSKVS 200 (200) T ss_pred HHhHhccccC Confidence 9999998887 No 70 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=81.98 E-value=0.031 Score=28.83 Aligned_cols=137 Identities=17% Similarity=0.132 Sum_probs=60.3 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |.+--+-..|++.+-|..+.+ .-.+.++..-.++++++.+.++.+-=-.-..| |-..- T Consensus 1 m~~v~i~Gld~L~~kl~~~~~---~~~~~v~~a~~~~~~~~a~~v~~~ak~~~Pvd-------------------tG~Lr 58 (182) T protein:vir:10 1 MIEVELKGVNELRAKLKKLPD---IMAKATANAQENAIEQAEAYAVDELQSSIKYS-------------------TGELT 58 (182) T ss_pred CeEEEEecHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCC-------------------chhhh Confidence 666555567777666655532 22333333334444444333322110011123 33334 Q ss_pred hhhhhhhcccCCCeEEEEEecc--------CCcccc------cc----cH--HH-----HHHHHh--------------- Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVD--------GGTTDT------GL----SM--QE-----LADFIE--------------- 120 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvd--------ggttdt------gl----sm--qe-----ladfie--------------- 120 (160) +||+.-.. ..++++...|+.+ -||--. |+ +. .. -++.++ T Consensus 59 ~SI~~~~~-~~~~~~~g~V~~~~~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~ 137 (182) T protein:vir:10 59 RSFKHEVK-VDGDEVIGRWWNSSMVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKIN 137 (182) T ss_pred hceeeeee-ecCCeEEEEeecCCCccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeec Confidence 44542221 2244444444433 111000 00 00 00 011222 Q ss_pred ----hcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_019515. 121 ----FGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIEGDLK 160 (160) Q Consensus 121 ----fgtskqparmpfhkswammeheimeevsqrllaiiegdlk 160 (160) ++|+.||||-=|-.+|..++.++.+.+.+++-.-+.--+- T Consensus 138 ~~~~~~t~G~~aqPFl~pA~~~~~~~i~~~i~~~i~~~l~~~~g 181 (182) T protein:vir:10 138 GKYFYRTTGQPARQFMTPAANKMAKEAPEIIKRSIDQELHDKLG 181 (182) T ss_pred CceEeecCCCCCCcchHHHHHHhHHHHHHHHHHHHHHHHHHhhc Confidence 3467889987778889888888777666554433322111 No 71 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=81.81 E-value=0.021 Score=29.78 Aligned_cols=108 Identities=19% Similarity=0.258 Sum_probs=52.2 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |++-.+|. |+|..-|.++.+ +.++.++..-++.|+.|.+.... ..| .+|-..- T Consensus 1 Ma~~~~G~-~~l~~~l~~~~~---~~~~~~~~al~~~a~~i~~~ak~---~aP--------------------v~TG~Lr 53 (137) T protein:vir:10 1 MAKVKYGN-WDLVKELEEFEK---ETIRWAKKGIAKTTTIIHNSIVS---NMP--------------------VDTGYLR 53 (137) T ss_pred CccchhCH-HHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH---hCC--------------------cCcchhh Confidence 98877776 567777766654 33444444444444444433221 123 2455566 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCC-----------------------------Cccccc Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSK-----------------------------QPARMP 131 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtsk-----------------------------qparmp 131 (160) +||+...+ .+|+-.-||.+ .+-|-|+||||.. ||++-- T Consensus 54 ~SI~~~~~---~~~~~~~V~~~----------~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pf 120 (137) T protein:vir:10 54 ESVSMDFK---KGGLTGVINIG----------SEYAVYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPF 120 (137) T ss_pred cCeeeEec---CCcEEEEEecC----------CccccccccCccccccCCCcccccccceeeeccccccccCCCCCCCcc Confidence 77764332 23433333321 2468899999843 444422 Q ss_pred hhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019515. 132 FHKSWAMMEHEIMEEVSQRLLA 153 (160) Q Consensus 132 fhkswammeheimeevsqrlla 153 (160) |-.+|.-.+-.|. +.|| T Consensus 121 l~pA~~~~~~~i~-----k~i~ 137 (137) T protein:vir:10 121 WEPAIDEGRAFFN-----KYFS 137 (137) T ss_pred hhHHHHHHHHHHH-----HhhC Confidence 2333332222222 2222 No 72 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=81.33 E-value=0.012 Score=31.11 Aligned_cols=108 Identities=20% Similarity=0.227 Sum_probs=54.7 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |.+-.+|. |+|.+-|.++. ...+++++..-++.|+.+.+...- -.|. +|-..- T Consensus 1 Ma~~~~G~-~~l~~~l~~~~---~~~~~~~~~~l~~~a~~~~~~ak~---~~pv--------------------dTG~L~ 53 (137) T protein:vir:96 1 MAKVKYGN-WDLVAELEDYR---DEMEEWVKKGILKTTLAIYNTAVA---LAPV--------------------DLGFLK 53 (137) T ss_pred CchhHhhH-HHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHH---hCCc--------------------Cccchh Confidence 88877755 56666565553 445666666666666655443321 1221 233444 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcC-----------------------------CCCccccc Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGT-----------------------------SKQPARMP 131 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgt-----------------------------skqparmp 131 (160) +||+.... .+|.-+.||.. .+-|-|+|||| +.+|++-- T Consensus 54 ~Si~~~~~---~~g~~~~V~~~----------~~YA~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pF 120 (137) T protein:vir:96 54 ESIDFKVT---DGGFSSVISVG----------AEYAIYVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPF 120 (137) T ss_pred cCceeEee---cCceEEEEecC----------CCcccccccCccccccCCCccccccccceeeccCcceeecCCCCCCcc Confidence 56654222 23433444432 24688999999 34556533 Q ss_pred hhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019515. 132 FHKSWAMMEHEIMEEVSQRLLA 153 (160) Q Consensus 132 fhkswammeheimeevsqrlla 153 (160) +-.++.-.+-.|. ++++ T Consensus 121 l~pA~~~~~~~i~-----k~i~ 137 (137) T protein:vir:96 121 WNPAIDEGRKVFN-----RYFS 137 (137) T ss_pred hhHHHHHHHHHHH-----HhhC Confidence 3444433333333 2233 No 73 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=79.11 E-value=0.086 Score=26.43 Aligned_cols=87 Identities=15% Similarity=0.287 Sum_probs=57.4 Q ss_pred eehhhh-hhhhhhhhhhcccCCCeEEEEEeccCCc-------ccccccHHHHHHHHhhcCCC------------------ Q lcl|NC_019515. 72 ILIRTH-KFVDSIKLVYEEDRGDGIMVFIGVDGGT-------TDTGLSMQELADFIEFGTSK------------------ 125 (160) Q Consensus 72 ilirth-kfvdsiklvyeedrgdgimvfigvdggt-------tdtglsmqeladfiefgtsk------------------ 125 (160) +-+|.- +.++.+.--.++ -.+-.|-+|+-.+. ...|.+++.+|-.-|||..- T Consensus 1 m~~~~~~~~~~~~~~~l~~--l~~~~v~vGi~~~~~~~~~~~~~~G~~va~iAai~EfG~~I~~~~~~~~~~~~~~~g~~ 78 (193) T protein:vir:96 1 MSLRRDSELIAAHLQMLRA--MRGRSVSAGWYSTARYPDKAGGSVGIQVARIARLNEYGGTIDHPGGTRYIRDAIVRGRF 78 (193) T ss_pred CeeccchHHHHHHHHHHHH--hcCCeEEEEEcCCCCCCCcccccccchHHHHHhHHHcCCccccCccceeeeeccccccc Confidence 223322 123333222222 34566777885432 33589999999999999762 Q ss_pred -----------------------CccccchhhhHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_019515. 126 -----------------------QPARMPFHKSWAMMEHEIMEEVSQRLLAIIEGDLK 160 (160) Q Consensus 126 -----------------------qparmpfhkswammeheimeevsqrllaiiegdlk 160 (160) -|+|-=+..++..-..++.+-+.+.+-++++|+.. T Consensus 79 ~~~~~~k~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~~~~~~~~~~g~~~ 136 (193) T protein:vir:96 79 VGVRFVRNDFPGETEVTKPHRITIPARPFMRYAWNLFSADRAAIQNRIAMRLARGQIT 136 (193) T ss_pred cccceeccCcceeeEeecceeccCCCcchhhhhHHHHHHHHHHHHHHHHHHHHhCCCC Confidence 37776566777877888888888888889999876 No 74 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=78.23 E-value=0.056 Score=27.46 Aligned_cols=127 Identities=17% Similarity=0.233 Sum_probs=54.3 Q ss_pred CCcccccCHHHHHHHHhhhccCC---cch-HHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNE---PEY-DDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRT 76 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdne---pey-ddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirt 76 (160) |+ + -|-++|.||.... ++- ..+.+..+..+++++.+- .|.-. |-.. ..+..+ . T Consensus 3 ~~-~------~lee~l~~i~kl~~~~~~~~~ki~kaGA~v~~e~L~~~-------tp~~~--~~~~--~~~~~~-----~ 59 (139) T protein:vir:10 3 MD-E------ALGQWLKQVSKAAELSISDQEKITKAGADVYAKKLAET-------TKEKH--PNTK--GDGGKY-----G 59 (139) T ss_pred HH-H------HHHHHHHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHh-------ccccc--CcCC--CCCCCC-----c Confidence 22 2 2334455543332 222 223344445556666543 33211 0000 000000 1 Q ss_pred hhhhhhhhhhh-cccC-CCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019515. 77 HKFVDSIKLVY-EEDR-GDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAI 154 (160) Q Consensus 77 hkfvdsiklvy-eedr-gdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswammeheimeevsqrllai 154 (160) | .-|+|+.-= ..|. .+|.. -+|.|- .+ -.|-|+||||++||+.-=.-+.-.-++.|+.+...+-+-.+ T Consensus 60 H-laD~I~~s~~~~dg~~~g~~-~VG~~k----~~----~~A~f~n~GT~k~~~~hFie~t~~e~~~evl~a~~~~~k~~ 129 (139) T protein:vir:10 60 H-LSEDIRSAAGDIDGDHNGSS-TVGFHN----KA----HIARFLNDGTKYIRADHFVDNARDDAKDAVFAAEAEKYQAM 129 (139) T ss_pred c-hhhcceecCcccccccceee-eeCCCC----Cc----ceEeecccCccccCCCchHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 456655422 1222 24443 466652 12 25899999999999984333444444555444333333333 Q ss_pred hh---c-ccC Q lcl|NC_019515. 155 IE---G-DLK 160 (160) Q Consensus 155 ie---g-dlk 160 (160) |. | |=| T Consensus 130 l~~~~~~~~~ 139 (139) T protein:vir:10 130 IAKANGGGDK 139 (139) T ss_pred HhhcCCCCCC Confidence 32 2 222 No 75 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=77.52 E-value=0.071 Score=26.89 Aligned_cols=132 Identities=16% Similarity=0.215 Sum_probs=54.7 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHH-HHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQK-IAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqk-iaekiremiegqeidmpalddeyladkvsegydsrilirthkf 79 (160) |.+.-.| -+.|.+-|..|-...++-..-|-..|-+ +++++.+.- |. ..|+.|=.-..-.. T Consensus 1 M~~~~~g-lee~~~~lekL~~~~~~~~~katkAGA~v~~e~L~~~t-------p~-----------~h~~~~kt~~~~Hl 61 (153) T protein:vir:49 1 MTGLDEA-LEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVT-------RE-----------KHYSKKKDLKYGHM 61 (153) T ss_pred CccHHHH-HHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhc-------cc-----------cCCCCCCCCCCCcc Confidence 7663311 2333444444444444444344444443 444454432 21 11222100000124 Q ss_pred hhhhhhhhcccCCCeE---EEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhh-HH--HHHHHHHHHHHHHHHH Q lcl|NC_019515. 80 VDSIKLVYEEDRGDGI---MVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKS-WA--MMEHEIMEEVSQRLLA 153 (160) Q Consensus 80 vdsiklvyeedrgdgi---mvfigvdggttdtglsmqeladfiefgtskqparmpfhks-wa--mmeheimeevsqrlla 153 (160) -|+|..- .-+-||. -+.+|.+..+ -.-+|-|+||||++||+. ||-.- -. -.+-+|.+....-+-. T Consensus 62 aD~I~~s--~~~idG~~dG~s~VG~~~~~------~a~~a~f~n~GT~km~~~-hFie~tr~e~~~k~~vl~A~~~~~~~ 132 (153) T protein:vir:49 62 ADGLAVQ--STNADGRKNGVSTVGWKNNY------HAQNARRLNDGTKKYRAD-HFITNVQNDSTVKNKVLLAEKEEYEK 132 (153) T ss_pred cccceec--cccccccccceeeecccCCc------cceeeeecccCcccCCCC-hhhHHHHHHhhHHHHHHHHHHHHHHH Confidence 5565532 1121221 2345555432 124689999999999976 56421 11 1123444433333333 Q ss_pred HhhcccC Q lcl|NC_019515. 154 IIEGDLK 160 (160) Q Consensus 154 iiegdlk 160 (160) ||.--+- T Consensus 133 il~~~~~ 139 (153) T protein:vir:49 133 LIRRKGG 139 (153) T ss_pred HHHhcCC Confidence 3322211 No 76 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=77.33 E-value=0.014 Score=30.71 Aligned_cols=92 Identities=21% Similarity=0.373 Sum_probs=49.0 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) |.---| .|+-..++++.|+.+... ++ ++.|-++.+.+|..-... .+| +| |-..- T Consensus 1 Ma~~~i-~~~Gld~L~~~L~~~~~~-~~-v~~vv~~~~~~l~~~ak~---~ap--------------~d------TG~lr 54 (92) T protein:vir:99 1 MADYSI-SWDGLDALDEALANQQNM-NT-VKKVVKKHTANLMTATQQ---AVP--------------VD------TGHLK 54 (92) T ss_pred CCceee-EeehHHHHHHHHHhhccH-HH-HHHHHHHHHHHHHHHHHH---hCC--------------CC------ccccc Confidence 655222 566566666666655432 22 344444444444322211 123 22 33344 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCcc Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPA 128 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskqpa 128 (160) +||++-++. +|.-.-|++.|.+++ -+-|+||||.+.+| T Consensus 55 rSI~~~~~~---~g~~~~v~~~gp~a~-------Ya~YvE~GTR~M~A 92 (92) T protein:vir:99 55 QSAQIQISR---DGFTGSVTYGGGLVN-------YAAYVEFGTRFMDS 92 (92) T ss_pred eeeeEEeec---CCeeEEEEeccCccc-------cccccccceeecCC Confidence 567644433 344455566555555 67799999999999 No 77 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=76.86 E-value=0.13 Score=25.43 Aligned_cols=131 Identities=16% Similarity=0.228 Sum_probs=74.6 Q ss_pred ccCHHHHHHHHhhhccC-Cc-chHHHHHHHHHHHHHHHHHHhcccc-c---CCcccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 6 YGDWDKFAQILHNLKDN-EP-EYDDVIRSVGQKIAEKIREMIEGQE-I---DMPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 6 ygdwdkfaqilhnlkdn-ep-eyddvirsvgqkiaekiremiegqe-i---dmpalddeyladkvsegydsrilirthkf 79 (160) --+.+.+.+.|+.+-++ +| .-..+.|.+|+.+....++-|+.|. . -.|.+...|+..|- |...++++.+-.- T Consensus 1 m~~~~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~--~~~~~~l~~~~~l 78 (150) T protein:vir:57 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKT--GRVKRKMFAKLIT 78 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhc--cCCCcccchhhhh Confidence 23445555555554333 22 3466889999999999999888774 2 25778888887774 5567777776555 Q ss_pred hhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCC----------Cccc--cchhhhHHHHHHHHHHHH Q lcl|NC_019515. 80 VDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSK----------QPAR--MPFHKSWAMMEHEIMEEV 147 (160) Q Consensus 80 vdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtsk----------qpar--mpfhkswammeheimeev 147 (160) -.||+..+. .|+..|- .-.| |....|..--||.+. -||| |+|.. --|.||++-+ T Consensus 79 ~~sl~~~~~---~~~a~vg--~~~G------~~~~yAaiHQfG~~~r~~~~~~~~~iPaRp~LG~s~---~d~~~i~~~i 144 (150) T protein:vir:57 79 SRFLHIRAS---PEQASME--FYGG------KSPKIASVHQFGLSEETRKDGKKIDYPARPLLGFTG---EDVQMIEEII 144 (150) T ss_pred ccceeeeee---CcEEEEE--eecC------CchhhhhhhhccccccccCCCceeecCCcccCCCCH---HHHHHHHHHH Confidence 566665444 3444442 2122 235778888898664 3665 23321 1233333322 Q ss_pred HHHHHHHhhcccC Q lcl|NC_019515. 148 SQRLLAIIEGDLK 160 (160) Q Consensus 148 sqrllaiiegdlk 160 (160) ..-|. T Consensus 145 --------~~~l~ 149 (150) T protein:vir:57 145 --------LAHLD 149 (150) T ss_pred --------HHHHh Confidence 22222 No 78 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=76.82 E-value=0.081 Score=26.57 Aligned_cols=132 Identities=16% Similarity=0.228 Sum_probs=60.1 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHH-HHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQK-IAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqk-iaekiremiegqeidmpalddeyladkvsegydsrilirthkf 79 (160) |.+- .---+.|..-|.+|-...++-...|-..|-+ +++++.+.-.-.- |+.+=--..-.. T Consensus 1 M~~~-~~gl~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~h------------------y~~~~~~~~~Hl 61 (141) T protein:vir:50 1 MVGL-AEALDEWLKTVASIGNLTPAEQVEITTAGAKVFKKELEEVTREKH------------------YSRKKNPKFGHM 61 (141) T ss_pred CccH-HHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHhcccCC------------------CCCCCCCCCCcc Confidence 7662 2113444445555554444444444445544 4555554432110 111100000124 Q ss_pred hhhhhhh--hcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchh-hhH--HHHHHHHHH---HHHHHH Q lcl|NC_019515. 80 VDSIKLV--YEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFH-KSW--AMMEHEIME---EVSQRL 151 (160) Q Consensus 80 vdsiklv--yeedrgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfh-ksw--ammeheime---evsqrl 151 (160) .|||+.- +.....||.. .+|.+..+ -.-+|-|++|||++||+. ||- +.- +-...+|.+ ++-+++ T Consensus 62 aD~I~~~~~~~DG~~dg~s-~VG~~~~~------~~~~A~f~n~GT~k~~~~-hFve~~~~~a~~k~~Vl~A~~~~~k~~ 133 (141) T protein:vir:50 62 ADGLAIQSTNADGRKNGVS-TVGWKNNY------HAQNARRLNDGTKKYRAD-HFVTNVQNDSTVQKKVLLEKKRNTKNS 133 (141) T ss_pred ccceeeccCccccccCCee-eeccCCCc------cceeeeccccCccccCCC-chhHHHHHhhhhHHHHHHHHHHHHHHH Confidence 5665431 2222224443 57775432 235789999999999985 443 222 111234433 344444 Q ss_pred HHHhhccc Q lcl|NC_019515. 152 LAIIEGDL 159 (160) Q Consensus 152 laiiegdl 159 (160) |.--.||= T Consensus 134 l~~~~~~~ 141 (141) T protein:vir:50 134 LEEKEGCD 141 (141) T ss_pred HHhccCCC Confidence 54445555 No 79 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=76.62 E-value=0.079 Score=26.61 Aligned_cols=130 Identities=17% Similarity=0.218 Sum_probs=60.7 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHH-HHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhh-h Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQ-KIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTH-K 78 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgq-kiaekiremiegqeidmpalddeyladkvsegydsrilirth-k 78 (160) |.+-- ---+.|.+-|.+|-...++-...|-..|- .+++++.+.-.-.- |..|=- ..+ - T Consensus 1 M~~~~-d~l~e~~~~lekl~~~~~~~~~katkAGA~v~~~~L~~~tp~~h------------------~~~~~t-~~~~H 60 (140) T protein:vir:48 1 MTGLD-EALEGWLKTVASIGDLTPAEQAKITTAGAKVFKEELAEVTRQKH------------------YSNKKH-LKYGH 60 (140) T ss_pred CccHH-HHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhccccC------------------CCCCCC-CCCCc Confidence 76622 11334444444554444444444444554 44555655432100 111100 001 1 Q ss_pred hhhhhhhh-hcccC-CCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchh-hhHHHH--HHHH---HHHHHHH Q lcl|NC_019515. 79 FVDSIKLV-YEEDR-GDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFH-KSWAMM--EHEI---MEEVSQR 150 (160) Q Consensus 79 fvdsiklv-yeedr-gdgimvfigvdggttdtglsmqeladfiefgtskqparmpfh-kswamm--ehei---meevsqr 150 (160) ..|||+.- +..|. .||.. .+|.+..+. .-+|-|+++||++||+. ||- +.-.-+ .-+| |.++-++ T Consensus 61 laD~I~~~~~~iDg~~~g~s-~VG~~kk~~------a~~A~f~n~GT~k~~~~-hFve~~~~e~~~k~~vl~A~~~~~~~ 132 (140) T protein:vir:48 61 MADGLSVQSTNVDGRKNGVS-TVGWVNRYH------AQNARRLNDGTKKYRAD-HFVTNVQNDSAVQTKVLLAEKEEYEK 132 (140) T ss_pred chhceeecccccccccCcee-eeccCCCcc------eeeeeccccCccccCCC-chhHHHHHhhhhHHHHHHHHHHHHHH Confidence 45666542 11121 23333 466654322 35789999999999986 443 222111 2233 4455666 Q ss_pred HHHHhhcc Q lcl|NC_019515. 151 LLAIIEGD 158 (160) Q Consensus 151 llaiiegd 158 (160) +|.---|| T Consensus 133 ~l~~~~~~ 140 (140) T protein:vir:48 133 LIRKKGGE 140 (140) T ss_pred HHHhhcCC Confidence 66666666 No 80 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=74.67 E-value=0.16 Score=25.02 Aligned_cols=132 Identities=17% Similarity=0.236 Sum_probs=74.1 Q ss_pred ccCHHHHHHHHhhhccC-Cc-chHHHHHHHHHHHHHHHHHHhcccc-cC---Ccccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 6 YGDWDKFAQILHNLKDN-EP-EYDDVIRSVGQKIAEKIREMIEGQE-ID---MPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 6 ygdwdkfaqilhnlkdn-ep-eyddvirsvgqkiaekiremiegqe-id---mpalddeyladkvsegydsrilirthkf 79 (160) --|.+.+.+.|+.+-.+ +| .-..+.|.+|+.+....++-|+.|. .| .+.+...|++.|- |-..++++++-.- T Consensus 1 ~~~~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~--~~~~~~l~~~~~l 78 (150) T protein:vir:60 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKT--GRVKRKMFAKLIT 78 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhh--cCCCccchhhhhh Confidence 33445555545543332 22 3456889999999998888888774 32 6677788887764 5556777776655 Q ss_pred hhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCC----------ccc--cchhhhHHHHHHHHHHHH Q lcl|NC_019515. 80 VDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQ----------PAR--MPFHKSWAMMEHEIMEEV 147 (160) Q Consensus 80 vdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskq----------par--mpfhkswammeheimeev 147 (160) -.||+..+. +++..|- .=.| |....|-.--||.+.. ||| ++|-. --|.+|++-+ T Consensus 79 ~~sl~~~~~---~~~a~vg--~~~G------t~~~yAaiHQfG~~~~~~~~~~~~~iPaRp~LG~s~---~d~~~i~~~i 144 (150) T protein:vir:60 79 SRFLHIRAS---PEQASME--FYGG------KSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTG---EDVQMIEEII 144 (150) T ss_pred cceeeeeee---CcEEEEE--eeCC------CchhhhhhhhccccccccCCCCceecCCcccCCCCH---HHHHHHHHHH Confidence 666665544 3444442 2123 2357888889996633 555 22322 1233333322 Q ss_pred HHHHHH Q lcl|NC_019515. 148 SQRLLA 153 (160) Q Consensus 148 sqrlla 153 (160) ...|-. T Consensus 145 ~~~l~r 150 (150) T protein:vir:60 145 LAHLDR 150 (150) T ss_pred HHHHhC Confidence 222211 No 81 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=72.72 E-value=0.038 Score=28.38 Aligned_cols=79 Identities=27% Similarity=0.288 Sum_probs=43.7 Q ss_pred ehhhhhhhhhhhhhhcccCCCeEEEEEeccCCcc-------------------cccccHHHHHHHHhhcCCCCccccchh Q lcl|NC_019515. 73 LIRTHKFVDSIKLVYEEDRGDGIMVFIGVDGGTT-------------------DTGLSMQELADFIEFGTSKQPARMPFH 133 (160) Q Consensus 73 lirthkfvdsiklvyeedrgdgimvfigvdggtt-------------------dtglsmqeladfiefgtskqparmpfh 133 (160) .=...+.+++ + .++=.+..|-+|+=.|++ ..|+++..+|-+.|||+...|+|-=+. T Consensus 1 m~~~r~~l~~---~--~~~l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~G~pva~ia~~~e~G~~~IP~RPFlr 75 (155) T protein:vir:77 1 MSVTRRGLTL---P--KDRYRSMSVKAGVLAGATYPDESGKKLADGSILKKDPRAGLPVAMIAMALNYGTSKLPARPFME 75 (155) T ss_pred CcchHHHHHH---H--HHHHhcCceEEeecCCCCCccccchhhhhhhhccccccccccHhhhhhhhhcCCCCCCCCchhh Confidence 0001111111 1 222223345566633321 258999999999999999999996666 Q ss_pred hhHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_019515. 134 KSWAMMEHEIMEEVSQRLLAIIEGDLK 160 (160) Q Consensus 134 kswammeheimeevsqrllaiiegdlk 160 (160) ..++.-.-++.+.+.+. +.+.+. T Consensus 76 ~t~~~~~~~~~~~l~~~----~~~~~~ 98 (155) T protein:vir:77 76 KTIADRSAEWIKGLTVM----MTMGYD 98 (155) T ss_pred HHHHHHHHHHHHHHHHH----HHccCc Confidence 66665555555554443 333333 No 82 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=69.84 E-value=0.18 Score=24.61 Aligned_cols=129 Identities=19% Similarity=0.284 Sum_probs=53.9 Q ss_pred CCc-ccccCHHHHHHHHhhhccC---CcchHHHH-HHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehh Q lcl|NC_019515. 1 MTS-ELYGDWDKFAQILHNLKDN---EPEYDDVI-RSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIR 75 (160) Q Consensus 1 mts-elygdwdkfaqilhnlkdn---epeyddvi-rsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilir 75 (160) |.- +-+.+| |.+|... .++-...| +..+..+++++++-. |.-. .+.+=+++= - T Consensus 1 ~~~~~~l~e~------l~~lekl~~~~~~~~~k~tkaGA~v~~~~L~~~t-------p~~~-------~~~~~~~~~--~ 58 (139) T protein:vir:10 1 MDMDEALGQW------LKQVSKAAQLSVSDQEKITKAGADVYAKELAETT-------KEKH-------PNTKGDGGK--Y 58 (139) T ss_pred CCHHHHHHHH------HHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHhc-------cccc-------ccCCCCCCC--C Confidence 211 233333 4444322 23333334 333455666665432 3110 000001000 0 Q ss_pred hhhhhhhhhhhhcccCCCeEEEEEeccCCcccccccH-HHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019515. 76 THKFVDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSM-QELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAI 154 (160) Q Consensus 76 thkfvdsiklvyeedrgdgimvfigvdggttdtglsm-qeladfiefgtskqparmpfhkswammeheimeevsqrllai 154 (160) +| ..|+|.. ..-+-||. -+|+++.|..- --+|-|+++||++||+.-=.-+.-.-++-|+.+...+-+-.+ T Consensus 59 ~H-laD~I~~--~~~~idg~------~~g~~~VG~~~~~~~Ahf~n~GT~~~~~~hFie~t~~e~~~ev~~a~~~~~ke~ 129 (139) T protein:vir:10 59 GH-LSEDISS--AAGDIDGD------HNGSSTVGFHNKAHIARFLNDGTKNIRADHFVDNARDDAKDAVFAAEAEKYQAM 129 (139) T ss_pred Cc-cccccee--cCcccccc------ccccceeCCCCCceeeeeeccCccccCCCchHHHHHHHHHHHHHHHHHHHHHHH Confidence 12 4455543 22222221 12333344321 125889999999999874334444445555554444444444 Q ss_pred hh---c-ccC Q lcl|NC_019515. 155 IE---G-DLK 160 (160) Q Consensus 155 ie---g-dlk 160 (160) |. | |-| T Consensus 130 l~~~~~~~~~ 139 (139) T protein:vir:10 130 IAKANGGDSK 139 (139) T ss_pred HhhcCCCCCC Confidence 43 2 333 No 83 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=69.75 E-value=0.055 Score=27.48 Aligned_cols=94 Identities=19% Similarity=0.176 Sum_probs=44.8 Q ss_pred cccCCcccchhhhhhhhhccccceeehhhhhhhhhhhhhhccc---CCCeEEEEEeccCC-cccccccHHHHHHHHhhcC Q lcl|NC_019515. 48 QEIDMPALDDEYLADKVSEGYDSRILIRTHKFVDSIKLVYEED---RGDGIMVFIGVDGG-TTDTGLSMQELADFIEFGT 123 (160) Q Consensus 48 qeidmpalddeyladkvsegydsrilirthkfvdsiklvyeed---rgdgimvfigvdgg-ttdtglsmqeladfiefgt 123 (160) -.|+--.|+. +.+.++ +.+ +++- +++++ .++|..+.+|...+ ..-.|.++..+|-+.|||| T Consensus 1 m~v~r~~L~~--~~~~l~-~~~--V~VG----------i~~~a~y~d~~g~~~~~g~~~~~~~~~G~pva~ia~~~e~G~ 65 (155) T protein:vir:10 1 MSVTRRGLTL--PKDRYK-SMS--VKAG----------VLAGATYPDESGKKLADGTILKKDPRAGLPVAMIAMALNYGT 65 (155) T ss_pred CcchHHHHHH--HHHHhh-CCe--eEEe----------ecCCCCCCccccchhhhhhhhccccccCcchhhhhhhhhcCC Confidence 1111111111 111111 110 0000 01111 12233443333222 1135899999999999999 Q ss_pred CCCccccchhhhHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_019515. 124 SKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIEGDLK 160 (160) Q Consensus 124 skqparmpfhkswammeheimeevsqrllaiiegdlk 160 (160) ...|+|-=|...++.-..++.+.+.+.+ .+.+. T Consensus 66 ~~IP~RPFlr~t~~~~~~~~~~~l~~~~----~~~~~ 98 (155) T protein:vir:10 66 SKLPARPFMEKTIADRSAEWIKGLTVMM----TMGYD 98 (155) T ss_pred CCCCCcchhHHHHHHHHHHHHHHHHHHH----HcCCC Confidence 9999996666667666666665555443 33333 No 84 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=68.71 E-value=0.1 Score=26.08 Aligned_cols=78 Identities=15% Similarity=0.327 Sum_probs=56.3 Q ss_pred CCcccccCHHH-HHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 1 MTSELYGDWDK-FAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 1 mtselygdwdk-faqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkf 79 (160) .-++-...|-+ |++.+ .+.-.-+.....+|+.++..|++.|.. ++ |.+.+.-.+.| |+| +=||-|... T Consensus 77 t~~~~~~~~~~~l~~~~----~~~~~~~~~L~~~G~~~~~~Ik~~I~~--~~-~pna~~Ti~~K---g~~-kPLidTG~l 145 (155) T protein:vir:78 77 TITDRSAEWIKGLTVMM----TMGYDAEVAMGQIGQAMKDDIKTTISE--WP-ADNSADWAGKK---GFN-HGLIWTSHL 145 (155) T ss_pred HHHHHHHHHHHHHHHHH----HcCCCHHHHHHHHHHHHHHHHHHHHhc--CC-CCCcHHHHHhc---CCC-CchhHHHHH Confidence 22333444532 33333 234567889999999999999999964 66 66777777765 554 689999999 Q ss_pred hhhhhhhhcc Q lcl|NC_019515. 80 VDSIKLVYEE 89 (160) Q Consensus 80 vdsiklvyee 89 (160) .+||.-+-+| T Consensus 146 ~~SIty~V~~ 155 (155) T protein:vir:78 146 LNSVEQEIVK 155 (155) T ss_pred HHhhhhhccC Confidence 9999988888 No 85 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=68.70 E-value=0.23 Score=24.05 Aligned_cols=129 Identities=20% Similarity=0.285 Sum_probs=75.1 Q ss_pred CCcccccCHHHHHHHHhhhccC-Cc-chHHHHHHHHHHHHHHHHHHhcccc-cC---Ccccchhhhhhhhhccccceeeh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDN-EP-EYDDVIRSVGQKIAEKIREMIEGQE-ID---MPALDDEYLADKVSEGYDSRILI 74 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdn-ep-eyddvirsvgqkiaekiremiegqe-id---mpalddeyladkvsegydsrili 74 (160) |. |-+.+.+.|+.|-.+ +| .-..+.|.+|+.+....++-++.|. .| .+.+...|+..|- |--.++++ T Consensus 1 m~-----d~~~l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~--~~~~~~l~ 73 (149) T protein:vir:98 1 MS-----ELTALQERLTGLIASLSPAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKK--GRIRREMF 73 (149) T ss_pred Cc-----hHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhcc--CCCCcccc Confidence 43 345555555544222 23 3456889999999999998888774 32 5667777765543 44567888 Q ss_pred hhhhhhhhhhhhhcccCCCeEEE-EEeccCCcccccccHHHHHHHHhhcCCC----------Cccc--cchhhhHHHHHH Q lcl|NC_019515. 75 RTHKFVDSIKLVYEEDRGDGIMV-FIGVDGGTTDTGLSMQELADFIEFGTSK----------QPAR--MPFHKSWAMMEH 141 (160) Q Consensus 75 rthkfvdsiklvyeedrgdgimv-figvdggttdtglsmqeladfiefgtsk----------qpar--mpfhkswammeh 141 (160) ++-..-.||+..+.. +++.| |+| |....|-.-.||..- -||| ++|-. T Consensus 74 ~~g~l~~sl~~~~~~---~~~~V~~~G----------s~~~yAa~HQfG~~~r~~~~~~~~~iPaRp~LG~s~------- 133 (149) T protein:vir:98 74 ARLRTNRFMKAKGSD---SAAVVEFTG----------RVQRMARVHQYGLKDRPNRHSRDVQYAARPLLGFTR------- 133 (149) T ss_pred hhhhhhhhhhheecC---CeeEEEecC----------cchHHhhHhhccccccccCCCcceeccccccCCCCH------- Confidence 776667777765543 44444 443 234788888999753 3555 23321 Q ss_pred HHHHHHHHHHHHHhhcccC Q lcl|NC_019515. 142 EIMEEVSQRLLAIIEGDLK 160 (160) Q Consensus 142 eimeevsqrllaiiegdlk 160 (160) +-.+.++.+|.--|. T Consensus 134 ----~d~~~i~~~i~~~l~ 148 (149) T protein:vir:98 134 ----DDEQMIEDIIIRHLG 148 (149) T ss_pred ----HHHHHHHHHHHHHhh Confidence 122333333333333 No 86 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=67.78 E-value=0.2 Score=24.45 Aligned_cols=125 Identities=18% Similarity=0.300 Sum_probs=52.5 Q ss_pred CCccccc-CHHHHHHHHhhhccC--CcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhh Q lcl|NC_019515. 1 MTSELYG-DWDKFAQILHNLKDN--EPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTH 77 (160) Q Consensus 1 mtselyg-dwdkfaqilhnlkdn--epeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirth 77 (160) |... | |++.|.+++.+|+.. ......++....+++|.++..-+.-. .| ++|- T Consensus 1 Ms~~--~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~---tP--------------------VdTG 55 (144) T protein:vir:10 1 MSLG--HVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEAN---TP--------------------VKQG 55 (144) T ss_pred CCCC--CccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHh---CC--------------------CCcc Confidence 4321 2 444445555555432 22234455555555555543322111 12 1344 Q ss_pred hhhhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccc-cc-----hhhhH----HHHHH---HHH Q lcl|NC_019515. 78 KFVDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPAR-MP-----FHKSW----AMMEH---EIM 144 (160) Q Consensus 78 kfvdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskqpar-mp-----fhksw----ammeh---eim 144 (160) ..-.|++.-=-+-.++|.-+-|| +.-+-|-|+|||+..+|.| .| -++.| -|++. ++. T Consensus 56 ~Lr~S~~~~~~~~~~~~~~~~V~----------n~~~YA~~VE~Ghr~~~G~~v~~~~~~~~~g~V~G~~~~~~a~~~~~ 125 (144) T protein:vir:10 56 NLRRSWTAEGPTYGCGGWTIKLI----------NNAEYASYVESGHRQTPGRYVPVLKKRLVRDWVPGQFYMKKSIPQIQ 125 (144) T ss_pred hhccceeecceeeecCeeEEEEe----------cCCCcccccccceeecCCcccccCCCccccceecCccchHHHHHHHH Confidence 45555543111123455544443 2335699999999988753 11 12223 23332 222 Q ss_pred HHHHHHHHHHhhc--ccC Q lcl|NC_019515. 145 EEVSQRLLAIIEG--DLK 160 (160) Q Consensus 145 eevsqrllaiieg--dlk 160 (160) .++-+.|-..+++ ||+ T Consensus 126 ~~~~~~l~k~l~~l~d~~ 143 (144) T protein:vir:10 126 RQLPQLVTEGLWGLKDLF 143 (144) T ss_pred HHHHHHHHHHHHHHhhhc Confidence 2222222222222 333 No 87 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=67.33 E-value=0.081 Score=26.55 Aligned_cols=85 Identities=24% Similarity=0.321 Sum_probs=60.1 Q ss_pred CCcccccCHHH-HHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 1 MTSELYGDWDK-FAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 1 mtselygdwdk-faqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkf 79 (160) .-.+-...|.+ +.+.+.+.-.+.-.-+.....+|+.++..|+..|. ....|.+...-++. ..|.| +=||-|... T Consensus 114 t~~~~~~~~~~~~~~~~~~vl~g~~~a~~~L~~~G~~~~~~Ik~~I~--~~~~ppna~~Tia~--rKg~~-kPLidTG~l 188 (199) T protein:vir:80 114 TFDEKSNKWGELFEGWIDDVIHGKLSAEQVYNRLGAKIVDDIQMKIV--EIQTPAKSAATLAR--NPRKN-NPLIVTGKM 188 (199) T ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHHHHHHHh--ccCCCCCCHHHHHH--hcCCC-CchHHHHHH Confidence 12233344543 44555555455567899999999999999999994 45679998887763 13554 569999999 Q ss_pred hhhhhhhhccc Q lcl|NC_019515. 80 VDSIKLVYEED 90 (160) Q Consensus 80 vdsiklvyeed 90 (160) .+||.-+-.+. T Consensus 189 ~~SIty~V~~~ 199 (199) T protein:vir:80 189 KNSVTWKVMKS 199 (199) T ss_pred HhhcceeeeeC Confidence 99998766554 No 88 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=67.23 E-value=0.26 Score=23.84 Aligned_cols=132 Identities=19% Similarity=0.239 Sum_probs=69.2 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccc-cC---Ccccchhhhhhhhhccccceeehhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQE-ID---MPALDDEYLADKVSEGYDSRILIRT 76 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqe-id---mpalddeyladkvsegydsrilirt 76 (160) |. ++=-.=+.+...|.+|....+ ..+.|.+|+.+....++-++.|. .| .+.+...|+.. ..|-.++++.++ T Consensus 1 m~-~~~~~~~~l~~ll~~L~~~~~--~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~--~~g~~~~~~~~~ 75 (149) T protein:vir:18 1 MS-ELTALQERLAGLIASLSPAAR--RKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRS--KKGRIKREMFAK 75 (149) T ss_pred Cc-hHHHHHHHHHHHHHhcCCchH--HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhh--ccCcccchhhhh Confidence 44 332222344555666654443 46899999999999999888774 22 45566666543 335566666554 Q ss_pred hhhhhhhhhhhcccCCCeEEE-EEeccCCcccccccHHHHHHHHhhcCCC----------Cccc--cchhhhHHHHHHHH Q lcl|NC_019515. 77 HKFVDSIKLVYEEDRGDGIMV-FIGVDGGTTDTGLSMQELADFIEFGTSK----------QPAR--MPFHKSWAMMEHEI 143 (160) Q Consensus 77 hkfvdsiklvyeedrgdgimv-figvdggttdtglsmqeladfiefgtsk----------qpar--mpfhkswammehei 143 (160) -..-.+++..+. .++..| |+|.+ ...|..--||... -||| |+|-. --|.|| T Consensus 76 l~~~~~l~~~~~---~~~~~v~~~Gtn----------~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~s~---~d~~~I 139 (149) T protein:vir:18 76 LRTSRFMKAKGS---DSAAVVEFTGKV----------QRMARVHQYGLKDRPNRNSRDVQYEARPLLGFTR---DDEQMI 139 (149) T ss_pred hhhhhhhheeec---CceeEEEecccc----------hhhhhhhhccccccccCCCccccccccccCCCCH---HHHHHH Confidence 333334443333 334333 44433 3577778888662 3555 23321 123333 Q ss_pred HHHHHHHHHH Q lcl|NC_019515. 144 MEEVSQRLLA 153 (160) Q Consensus 144 meevsqrlla 153 (160) ++.+..-|-. T Consensus 140 ~~~i~~~l~~ 149 (149) T protein:vir:18 140 EDVIISHLGK 149 (149) T ss_pred HHHHHHHHhC Confidence 3333222222 No 89 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=65.29 E-value=0.13 Score=25.47 Aligned_cols=90 Identities=7% Similarity=0.100 Sum_probs=64.8 Q ss_pred CCcccccCHHH-HHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhc------------- Q lcl|NC_019515. 1 MTSELYGDWDK-FAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSE------------- 66 (160) Q Consensus 1 mtselygdwdk-faqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvse------------- 66 (160) ..++-...|.+ +.+.+.+.-++.-.-+.+...+|+.++..|++.|.- .+.|.+.+.-++.|=.. T Consensus 68 t~~~~~~~~~~~l~~~~~~vl~G~~~~~~~L~~~G~~a~~~Ik~~I~~--~~~ppna~sTi~~Kg~~~~~~~~~~~~~~~ 145 (189) T protein:vir:10 68 TIAAQQAAWSQQMRFYAKQIVVGQMNVEQALEGLAIVARGDVDATLAR--LKDPPLSPLTIYIRKFIKDGGVIHGYKDIM 145 (189) T ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc--CCCCCCcHHHHHHhcccCcccchhhhhhhh Confidence 22344555644 555666655555678999999999999999999954 56799999888776422 Q ss_pred ------------------cccceeehhhhhhhhhhhhhhcccCC Q lcl|NC_019515. 67 ------------------GYDSRILIRTHKFVDSIKLVYEEDRG 92 (160) Q Consensus 67 ------------------gydsrilirthkfvdsiklvyeedrg 92 (160) +-.++=||-|....+||.-+-.+... T Consensus 146 ~~~~~~~~~~~~~~~~~~~~s~kPLidTG~l~~SIty~V~~k~~ 189 (189) T protein:vir:10 146 RLRSEMQQEQAKGTLNLSGVSTDPLDFTGYMRATLSYTVTKEKS 189 (189) T ss_pred hhhhhhhhhhhhccccccccCCCchhhHHHHHhhcceeeeecCC Confidence 23467899999999999866544333 No 90 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=64.98 E-value=0.13 Score=25.35 Aligned_cols=78 Identities=17% Similarity=0.344 Sum_probs=55.5 Q ss_pred CCcccccCHHH-HHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 1 MTSELYGDWDK-FAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 1 mtselygdwdk-faqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkf 79 (160) .-++-...|-+ +++.+ .+.-.-+.....+|+.++..|+..|. .++ |.+...-.+.| |.| .=||-|... T Consensus 77 t~~~~~~~~~~~l~~~~----~~~~~~~~~L~~lG~~~~~~Ik~~I~--~~~-~pna~~Ti~~K---G~~-kPLidTG~l 145 (155) T protein:vir:10 77 TIADRSAEWIKGLTVMM----TMGYDAEVAMGQIGQAMKDDIKTTIS--EWP-ADNSADWAGKK---GFN-HGLIWTSHL 145 (155) T ss_pred HHHHHHHHHHHHHHHHH----HcCCCHHHHHHHHHHHHHHHHHHHHh--cCC-CCCcHHHHHhc---CCC-CchhHHHHH Confidence 22333344532 33333 33456778899999999999999996 466 66777777765 555 569999999 Q ss_pred hhhhhhhhcc Q lcl|NC_019515. 80 VDSIKLVYEE 89 (160) Q Consensus 80 vdsiklvyee 89 (160) .+||.-+-.| T Consensus 146 ~~SIty~Vv~ 155 (155) T protein:vir:10 146 LNSVEQEIVK 155 (155) T ss_pred HHhhhhhccC Confidence 9999987777 No 91 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=62.66 E-value=0.18 Score=24.68 Aligned_cols=79 Identities=15% Similarity=0.267 Sum_probs=53.7 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) .-++-...|- +.+.+.-..+-.-+...+.+|+.++..|++.|.. ...|.+. .-.+. .|.| .=||-|.... T Consensus 77 t~~~~~~~~~---~~l~~~~~~~~~~~~~L~~lG~~~~~~Iq~~I~~--~~~p~~~-~Ti~~---KG~d-~PLidTG~l~ 146 (155) T protein:vir:77 77 TIADRSAEWI---KGLTVMMTMGYDAEVAMGQIGQAMKDDIKTTISE--WPADNNA-DWAGK---KGFN-HGLIWTSHLL 146 (155) T ss_pred HHHHHHHHHH---HHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHhc--CCCCCCh-HHHHh---cCCC-CchhHHHHHH Confidence 2233344443 2333333345567889999999999999999974 4567653 34444 4665 6799999999 Q ss_pred hhhhhhhcc Q lcl|NC_019515. 81 DSIKLVYEE 89 (160) Q Consensus 81 dsiklvyee 89 (160) +||.-+-.| T Consensus 147 ~SIty~Vv~ 155 (155) T protein:vir:77 147 NSIEQEIVK 155 (155) T ss_pred HhhhhhccC Confidence 999987777 No 92 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=62.16 E-value=0.17 Score=24.74 Aligned_cols=82 Identities=20% Similarity=0.195 Sum_probs=59.3 Q ss_pred CCccc-ccCHHH-HHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhh Q lcl|NC_019515. 1 MTSEL-YGDWDK-FAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHK 78 (160) Q Consensus 1 mtsel-ygdwdk-faqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthk 78 (160) .|.+- -..|.+ +.+.+.++..++-..+.+...+|+.++..|++-|.- ..-|.+...-++.| |. +.=||-|-. T Consensus 110 ~t~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~~G~~~~~~ik~~I~~--~~~ppna~~Ti~~K---G~-~~PLidTG~ 183 (193) T protein:vir:96 110 YAWNLFSADRAAIQNRIAMRLARGQITPDQALAQIGLALEGYIARSIRT--GPWVANSASTVRRK---GF-NRPLVDTAH 183 (193) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc--CCCCCCcHHHHHHh---CC-CCchhHHHH Confidence 22111 122333 334556666777778999999999999999999954 55699999999977 44 467999999 Q ss_pred hhhhhhhhhc Q lcl|NC_019515. 79 FVDSIKLVYE 88 (160) Q Consensus 79 fvdsiklvye 88 (160) ..+||.-+-- T Consensus 184 l~~SIty~Vv 193 (193) T protein:vir:96 184 MLQSISSRVT 193 (193) T ss_pred HHhhhcceeC Confidence 9999974332 No 93 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=61.47 E-value=0.19 Score=24.53 Aligned_cols=79 Identities=15% Similarity=0.267 Sum_probs=53.0 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) .-++-...|- +.|.+.-...-.-+...+.+|+.++..|++.|.. ...|.+. .-.+. .|+| .=||-|.... T Consensus 77 t~~~~~~~~~---~~l~~~~~~~~~~~~~L~~~G~~~~~~Ik~~I~~--~~~p~~~-~Ti~~---KG~~-~PLidTG~l~ 146 (155) T protein:vir:10 77 TIADRSAEWI---KGLTVMMTMGYDAEVAMGQIGQAMKDDIKTTISE--WPADNNA-DWAGK---KGFN-HGLIWTSHLL 146 (155) T ss_pred HHHHHHHHHH---HHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhc--CCCCCCh-HHHHh---cCCC-CchHHHHHHH Confidence 1223333343 2333333345566889999999999999999974 4567653 34443 4665 6799999999 Q ss_pred hhhhhhhcc Q lcl|NC_019515. 81 DSIKLVYEE 89 (160) Q Consensus 81 dsiklvyee 89 (160) +||.-+-.+ T Consensus 147 ~Sity~Vv~ 155 (155) T protein:vir:10 147 NSIEQEIVK 155 (155) T ss_pred HhhhhhccC Confidence 999987666 No 94 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=61.21 E-value=0.31 Score=23.35 Aligned_cols=129 Identities=17% Similarity=0.269 Sum_probs=62.1 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHH-HHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhh-- Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQK-IAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTH-- 77 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqk-iaekiremiegqeidmpalddeyladkvsegydsrilirth-- 77 (160) |.+-- ---+.|.+-+.+|....++-...|-..|-+ +++++.+-- |. ..|+.| .|- T Consensus 1 M~~~~-d~l~e~~~~v~kl~~~~~~~~~katkAGAkv~~~~L~~~t-------p~-----------~h~~~r---~t~~~ 58 (140) T protein:vir:48 1 MTGLD-EALEGWLKTVASIGDLTPAEQAKITTAGAKVFKKELAEVT-------RE-----------KHYSKK---KDLKY 58 (140) T ss_pred CccHH-HHHHHHHHHHHHhccCCHHHHHHHHHHhHHHHHHHHHHhc-------cc-----------CCCCCC---CCCCC Confidence 66522 113333444455555455555555555544 344444322 11 112221 111 Q ss_pred -hhhhhhhhh-hccc-CCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCCccccchhhhHH--HHHHHHH---HHHHH Q lcl|NC_019515. 78 -KFVDSIKLV-YEED-RGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQPARMPFHKSWA--MMEHEIM---EEVSQ 149 (160) Q Consensus 78 -kfvdsiklv-yeed-rgdgimvfigvdggttdtglsmqeladfiefgtskqparmpfhkswa--mmeheim---eevsq 149 (160) -..|+|+.- +.-| ..||.. .+|.|..+.. -+|-|+++||+++|+.-=.-+.-. -.+-++. .++-+ T Consensus 59 ~HlaD~I~~~~~~idg~~dG~s-~VG~~k~~~a------~~a~f~NdGT~k~~~~hFve~t~~e~~~~~~vl~A~~~~y~ 131 (140) T protein:vir:48 59 GHMADGLAVQSTNVDGRKNGVA-TVGWKNNYHA------QNARRLNDGTKKYRADHFVTNVQNDSAVRDKVLLAEKEEYE 131 (140) T ss_pred Ccccccceecccccccccccce-eecccCCCce------eEEeecccCccccCCCchHHHHHHhhhhHHHHHHHHHHHHH Confidence 145666543 2211 124444 5787764322 368999999999998642222221 1233443 34456 Q ss_pred HHHHHhhcc Q lcl|NC_019515. 150 RLLAIIEGD 158 (160) Q Consensus 150 rllaiiegd 158 (160) ++|.---|| T Consensus 132 ~~l~kk~~~ 140 (140) T protein:vir:48 132 KLIRKKGGE 140 (140) T ss_pred HHHHhhcCC Confidence 666655677 No 95 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=59.38 E-value=0.24 Score=24.03 Aligned_cols=82 Identities=16% Similarity=0.180 Sum_probs=44.0 Q ss_pred Ccccchhhhhhhhhccccceeehhhhhhhhhhhhhhccc-CCCeEEEEEecc------------------CCcccccccH Q lcl|NC_019515. 52 MPALDDEYLADKVSEGYDSRILIRTHKFVDSIKLVYEED-RGDGIMVFIGVD------------------GGTTDTGLSM 112 (160) Q Consensus 52 mpalddeyladkvsegydsrilirthkfvdsiklvyeed-rgdgimvfigvd------------------ggttdtglsm 112 (160) |-......+ +..-++. .-.+..|-+|+- ...-+.|+++ T Consensus 1 ~~~~~~~g~-----------------------~~~~~~~~~l~~~~v~vG~l~~a~yp~G~~~~~~~~~~~~~~~~g~~v 57 (168) T protein:vir:94 1 MTTIARKGV-----------------------KMPPHLEAQFQSGEVKAGVLSGSTYPQMTYTDQRTGKQIEDARGGMPV 57 (168) T ss_pred Cccccchhh-----------------------hhhHHHHHhhhccceeeeccccCcccccccchhhcccccccccccccH Confidence 222222111 1111100 001223444431 2344778999 Q ss_pred HHHHHHHhhcCCCCccccchhhhHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_019515. 113 QELADFIEFGTSKQPARMPFHKSWAMMEHEIMEEVSQRLLAIIEGDLK 160 (160) Q Consensus 113 qeladfiefgtskqparmpfhkswammeheimeevsqrllaiiegdlk 160 (160) ..+|-.-|||+...|+|-=|....+. -.++..+.+-.+++|++. T Consensus 58 a~Ia~~~E~G~~~IP~RPFlr~t~~~----~~~~~~~~~~~~~~~~~~ 101 (168) T protein:vir:94 58 AVIAQALEYGHGQNHPRPFMQQTYAA----QYRAWSRDLTLTLKAGAA 101 (168) T ss_pred HHHHHHHhcCCCCCCCchhhHHHHHH----HHHHHHHHHHHHHhcCCC Confidence 99999999999999999544444332 234555566666777666 No 96 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=55.16 E-value=0.1 Score=26.02 Aligned_cols=114 Identities=19% Similarity=0.269 Sum_probs=46.1 Q ss_pred cCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhcccc-ceeehhhhhhhhhhhhhhcccCCCeEEEEE Q lcl|NC_019515. 21 DNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYD-SRILIRTHKFVDSIKLVYEEDRGDGIMVFI 99 (160) Q Consensus 21 dnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegyd-srilirthkfvdsiklvyeedrgdgimvfi 99 (160) -|+-+|++-.+.+-.++-.++.+-++--.+.+ +..+.|+.- ..-=++|...-.||+.-+-. .|..+.| T Consensus 1 ~~~~~f~~~~~~~~~~~~k~~~~~~~~~a~~~--------~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~---~g~~~~V 69 (141) T protein:vir:78 1 MNEFEFDSNIPKARKLIEKKVLQALEDIGEHM--------TTELAEGGHGVTSNNDTGEYAQKSGYKVRK---SSKEVIV 69 (141) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhhhhccccccchhhcceeeeeec---CCcEEEE Confidence 56666666555554444444433322110100 011111110 00114566666777644322 2333445 Q ss_pred eccCCcccccccHHHHHHHHhhcCCC--------------------------CccccchhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019515. 100 GVDGGTTDTGLSMQELADFIEFGTSK--------------------------QPARMPFHKSWAMMEHEIMEEVSQRLLA 153 (160) Q Consensus 100 gvdggttdtglsmqeladfiefgtsk--------------------------qparmpfhkswammeheimeevsqrlla 153 (160) |.+ .+-|-|+||||+. |||+-=+-++....+-+| .+.-++++. T Consensus 70 ~~~----------~~YA~yVE~GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~~i-~~~i~~~~~ 138 (141) T protein:vir:78 70 GNS----------SDYAIYYEFGTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQDKV-RVFTERALR 138 (141) T ss_pred ecC----------CCccceeecCCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhHHHH-HHHHHHHhh Confidence 431 3478899999943 555422223333322222 112222221 Q ss_pred HhhcccC Q lcl|NC_019515. 154 IIEGDLK 160 (160) Q Consensus 154 iiegdlk 160 (160) .|. T Consensus 139 ----~l~ 141 (141) T protein:vir:78 139 ----GIN 141 (141) T ss_pred ----ccC Confidence 122 No 97 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=54.01 E-value=0.14 Score=25.19 Aligned_cols=87 Identities=20% Similarity=0.240 Sum_probs=48.2 Q ss_pred hHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhhhhhhhhhcccCCCeEEEEEeccCCc Q lcl|NC_019515. 26 YDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFVDSIKLVYEEDRGDGIMVFIGVDGGT 105 (160) Q Consensus 26 yddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfvdsiklvyeedrgdgimvfigvdggt 105 (160) -+.+++.+-.+.++.+.+.+.- -.| .+|...-+||+...+. +|+-.-||. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~---~aP--------------------v~TG~Lr~SI~~~~~~---~~~~~~V~~---- 50 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIIS---LMP--------------------VDTGYLRESVTMDFKD---GGFTGVINI---- 50 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHH---hCC--------------------cCcccccccceEEeec---CcEEEEEec---- Confidence 4555665555666555544321 122 2466666777654432 344333431 Q ss_pred ccccccHHHHHHHHhhc-----------------------------CCCCccccchhhhHHHHHHHHHHHHH Q lcl|NC_019515. 106 TDTGLSMQELADFIEFG-----------------------------TSKQPARMPFHKSWAMMEHEIMEEVS 148 (160) Q Consensus 106 tdtglsmqeladfiefg-----------------------------tskqparmpfhkswammeheimeevs 148 (160) -.+-|-|+||| |..+||+-=|-.+|.-..-.|...+| T Consensus 51 ------~~~YA~yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 51 ------GSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ------CCCcccccccCCcccccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 13578899999 66678774445566555555555554 No 98 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=54.01 E-value=0.14 Score=25.19 Aligned_cols=87 Identities=20% Similarity=0.240 Sum_probs=48.2 Q ss_pred hHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhhhhhhhhhcccCCCeEEEEEeccCCc Q lcl|NC_019515. 26 YDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFVDSIKLVYEEDRGDGIMVFIGVDGGT 105 (160) Q Consensus 26 yddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfvdsiklvyeedrgdgimvfigvdggt 105 (160) -+.+++.+-.+.++.+.+.+.- -.| .+|...-+||+...+. +|+-.-||. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~---~aP--------------------v~TG~Lr~SI~~~~~~---~~~~~~V~~---- 50 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIIS---LMP--------------------VDTGYLRESVTMDFKD---GGFTGVINI---- 50 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHH---hCC--------------------cCcccccccceEEeec---CcEEEEEec---- Confidence 4555665555666555544321 122 2466666777654432 344333431 Q ss_pred ccccccHHHHHHHHhhc-----------------------------CCCCccccchhhhHHHHHHHHHHHHH Q lcl|NC_019515. 106 TDTGLSMQELADFIEFG-----------------------------TSKQPARMPFHKSWAMMEHEIMEEVS 148 (160) Q Consensus 106 tdtglsmqeladfiefg-----------------------------tskqparmpfhkswammeheimeevs 148 (160) -.+-|-|+||| |..+||+-=|-.+|.-..-.|...+| T Consensus 51 ------~~~YA~yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 51 ------GSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ------CCCcccccccCCcccccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 13578899999 66678774445566555555555554 No 99 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=51.77 E-value=0.57 Score=21.91 Aligned_cols=131 Identities=17% Similarity=0.260 Sum_probs=77.1 Q ss_pred ccCHHHHHHHHhhhccC-Cc-chHHHHHHHHHHHHHHHHHHhcccccC----Ccccchhhhhhhhhccccceeehhhhhh Q lcl|NC_019515. 6 YGDWDKFAQILHNLKDN-EP-EYDDVIRSVGQKIAEKIREMIEGQEID----MPALDDEYLADKVSEGYDSRILIRTHKF 79 (160) Q Consensus 6 ygdwdkfaqilhnlkdn-ep-eyddvirsvgqkiaekiremiegqeid----mpalddeyladkvsegydsrilirthkf 79 (160) --|-+.+.+.|+.|-.+ +| .-..+.|.+|+.+....++-++.|.=- .+.+...|+..|- |..+++|.++-.. T Consensus 1 ~~~~~~l~~~L~~ll~~l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~--g~~~~~l~~~~~l 78 (150) T protein:vir:20 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKT--GRVKRKMFAKLIT 78 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhc--cCCCccccchhhh Confidence 34556666666655433 33 345678999999999999988877521 4566777877663 5667888888777 Q ss_pred hhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCC----------Cccc--cchhhhHHHHHHHHHHHH Q lcl|NC_019515. 80 VDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSK----------QPAR--MPFHKSWAMMEHEIMEEV 147 (160) Q Consensus 80 vdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtsk----------qpar--mpfhkswammeheimeev 147 (160) -.||+..+. .++..| |+-.| |....|-.--||... -||| ++|-. --|.||. T Consensus 79 ~~sl~~~~~---~~~~~v--g~~~G------s~~~yAa~HQfG~~~~~~~~~~~~~iPaRp~LG~s~---~d~~~i~--- 141 (150) T protein:vir:20 79 SRFLHIRAS---PEQASM--EFYGG------KSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTG---EDVQMIE--- 141 (150) T ss_pred hhhhheeec---CcEEEE--EeeCC------cchhhhhhhhcccccccccCCCceeccccccCCCCH---HHHHHHH--- Confidence 777775443 344443 33233 345678888899653 3555 23332 1122222 Q ss_pred HHHHHHHhhcccC Q lcl|NC_019515. 148 SQRLLAIIEGDLK 160 (160) Q Consensus 148 sqrllaiiegdlk 160 (160) .+|.--|+ T Consensus 142 -----~~i~~~l~ 149 (150) T protein:vir:20 142 -----EIILAHLE 149 (150) T ss_pred -----HHHHHHHh Confidence 22222222 No 100 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=50.83 E-value=0.26 Score=23.84 Aligned_cols=79 Identities=25% Similarity=0.299 Sum_probs=42.3 Q ss_pred eeehhhhhhhhhhhhhhcccCCCeEEEEEeccCCc-------------------ccccccHHHHHHHHhhcCCCCccccc Q lcl|NC_019515. 71 RILIRTHKFVDSIKLVYEEDRGDGIMVFIGVDGGT-------------------TDTGLSMQELADFIEFGTSKQPARMP 131 (160) Q Consensus 71 rilirthkfvdsiklvyeedrgdgimvfigvdggt-------------------tdtglsmqeladfiefgtskqparmp 131 (160) --+.|.+ ++-++.+-+ +.-|-+|+=.|+ -..|+++..+|-.-|||+...|+|-= T Consensus 1 m~v~~k~-----L~~~~~~l~--~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~~~IP~RPF 73 (155) T protein:vir:10 1 MSVTRRG-----LTLPKDRYR--SMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGTSKLPARPF 73 (155) T ss_pred CcchHHH-----HHHHHHHHh--CCeeEEeecCCCCCccccchhhhhhhhcccccccCCcHHHHHHHHhcCCCCCCCcch Confidence 0011111 222222222 223445543332 12489999999999999999999965 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_019515. 132 FHKSWAMMEHEIMEEVSQRLLAIIEGDLK 160 (160) Q Consensus 132 fhkswammeheimeevsqrllaiiegdlk 160 (160) |...++.-.-++.+.+.+. +.+.+. T Consensus 74 lr~t~~~~~~~~~~~l~~~----~~~~~~ 98 (155) T protein:vir:10 74 MEKTIADRSAEWIKGLTVM----MTMGYD 98 (155) T ss_pred hHHHHHHHHHHHHHHHHHH----HHcCCC Confidence 5666665554554444433 333333 No 101 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=50.18 E-value=0.25 Score=23.85 Aligned_cols=79 Identities=24% Similarity=0.293 Sum_probs=42.8 Q ss_pred eeehhhhhhhhhhhhhhcccCCCeEEEEEeccCCc-------------------ccccccHHHHHHHHhhcCCCCccccc Q lcl|NC_019515. 71 RILIRTHKFVDSIKLVYEEDRGDGIMVFIGVDGGT-------------------TDTGLSMQELADFIEFGTSKQPARMP 131 (160) Q Consensus 71 rilirthkfvdsiklvyeedrgdgimvfigvdggt-------------------tdtglsmqeladfiefgtskqparmp 131 (160) --+.|.+ ++-++.+-+ +.-|-+|+=.|+ ...|+++..+|-.-|||+...|+|-= T Consensus 1 m~v~~k~-----L~~~~~~l~--~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~~~IP~RPF 73 (155) T protein:vir:78 1 MSVTRRG-----LTLPKDRYR--SMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGTSKLPARPF 73 (155) T ss_pred CcchHHH-----HHHHHHHHh--CCeeEEeecCCCCCCcccchhhhhhhhcccccccCCcHHHHHHhhhcCCCCCCCcch Confidence 0111111 222222222 233445553332 12489999999999999999999966 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHhhcccC Q lcl|NC_019515. 132 FHKSWAMMEHEIMEEVSQRLLAIIEGDLK 160 (160) Q Consensus 132 fhkswammeheimeevsqrllaiiegdlk 160 (160) |...++.-.-++.+.+.+.+ .+.+. T Consensus 74 lr~t~~~~~~~~~~~l~~~~----~~~~~ 98 (155) T protein:vir:78 74 MEKTITDRSAEWIKGLTVMM----TMGYD 98 (155) T ss_pred hhHHHHHHHHHHHHHHHHHH----HcCCC Confidence 66666655555555444433 33333 No 102 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=42.22 E-value=0.4 Score=22.76 Aligned_cols=87 Identities=20% Similarity=0.238 Sum_probs=44.7 Q ss_pred hHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhhhhhhhhhcccCCCeEEEEEeccCCc Q lcl|NC_019515. 26 YDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFVDSIKLVYEEDRGDGIMVFIGVDGGT 105 (160) Q Consensus 26 yddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfvdsiklvyeedrgdgimvfigvdggt 105 (160) -+.+++..-.+.++.+.+...- ..|. +|...-+||+...+. +|+-.-||. + T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~---~apv--------------------~TG~Lr~SI~~~~~~---~~~~~~V~~---~ 51 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIIS---LMPV--------------------DTGYLRESVTMDFKD---GGFTGVINI---G 51 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHh---hCCc--------------------cccccccceeEEeec---CcEEEEEec---C Confidence 4455555555555555443311 1221 355666677654433 343333332 1 Q ss_pred ccccccHHHHHHHHhhc-----------------------------CCCCccccchhhhHHHHHHHHHHHHH Q lcl|NC_019515. 106 TDTGLSMQELADFIEFG-----------------------------TSKQPARMPFHKSWAMMEHEIMEEVS 148 (160) Q Consensus 106 tdtglsmqeladfiefg-----------------------------tskqparmpfhkswammeheimeevs 148 (160) .+-|-|+||| |..|||+-=|-.+|.--.-.|+..+| T Consensus 52 -------~~Ya~yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 52 -------SEYAIYVNYGTGIYATGAGGSRAKNIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred -------CCccceeecCccccccCCCccccccccceeecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 2467899999 55677774344555555555555555 No 103 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=40.17 E-value=0.86 Score=20.94 Aligned_cols=88 Identities=17% Similarity=0.314 Sum_probs=59.0 Q ss_pred hhccccceeehhhhhhhhhhhhhhcc-cCCCeEEEE-EeccCCcccccccHHHHHHHHhhc------------------- Q lcl|NC_019515. 64 VSEGYDSRILIRTHKFVDSIKLVYEE-DRGDGIMVF-IGVDGGTTDTGLSMQELADFIEFG------------------- 122 (160) Q Consensus 64 vsegydsrilirthkfvdsiklvyee-drgdgimvf-igvdggttdtglsmqeladfiefg------------------- 122 (160) |.+.---|+=.+|-+.-+||...|.. +..||.-++ +|..-.+.--| -++||| T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhg-------hlvE~Ghw~~~~~~~~~dG~w~~~~ 73 (119) T protein:vir:81 1 MRESAKAFVNDETGKLRSNLYVAYSPEESTNGVQTYAVSWRKKAAPHG-------HLLEFGHWQTHAAYKGKDGEWYSSS 73 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCeEEEEeeccCCcCCcc-------cccccceeeeeeeeeccCceeeecC Confidence 33333445666788999999999865 556786655 33333333222 356899 Q ss_pred -----CCCCccccchh-----hhHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_019515. 123 -----TSKQPARMPFH-----KSWAMMEHEIMEEVSQRLLAIIEGDL 159 (160) Q Consensus 123 -----tskqparmpfh-----kswammeheimeevsqrllaiiegdl 159 (160) +++-||+ ||- ..-+-++...+....||+..+..|+- T Consensus 74 ~~l~~~~~vPa~-pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:81 74 VKLVNPKWIPAR-PFLRPGYDSVAMQIPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred ccccCceecCCC-CccchhHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 7777776 343 23445777788889999999999988 No 104 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=36.93 E-value=1.1 Score=20.26 Aligned_cols=150 Identities=14% Similarity=0.138 Sum_probs=66.2 Q ss_pred CCccc-ccCHHHHHHHHhhhccC---CcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhh Q lcl|NC_019515. 1 MTSEL-YGDWDKFAQILHNLKDN---EPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRT 76 (160) Q Consensus 1 mtsel-ygdwdkfaqilhnlkdn---epeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirt 76 (160) |.+.. +-++++|++-|..+-.. +..-+++.+.+|.++..++.+..--...+.+.-----...++..-.-|-.-..| T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~t 80 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQG 80 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhcccccccc Confidence 76654 45677777777654221 223466666666666555555443222222210000000011111111111234 Q ss_pred hhhhhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCCC-----ccccchhhhHHHHHHHHHHHHHHHH Q lcl|NC_019515. 77 HKFVDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSKQ-----PARMPFHKSWAMMEHEIMEEVSQRL 151 (160) Q Consensus 77 hkfvdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtskq-----parmpfhkswammeheimeevsqrl 151 (160) -..-.|.+.-=-...|++..|-|. +..+-|-++|+|--.. |.+-.+.+|=..++.++-+.+-+.| T Consensus 81 G~lr~swk~~~~~k~~~~~~v~v~----------N~~~YA~~VE~GHR~~~gGfV~G~fml~~s~~~~~~~~~~~~e~~l 150 (163) T protein:vir:10 81 GTLQKGWSKSRIEVSGRTYKQKVY----------NKVYYAPHVEYGHKTVNGGFVPGQFFLHKTVEDTKSDMEKRVRDKY 150 (163) T ss_pred chhhccceecceeecCCceEEEEE----------ecCCccchhhcceeecCCceeccchhhHHHHHHHHHHHHHHHHHHH Confidence 444444443111123333333221 3345689999996544 4445556665555555544444444 Q ss_pred HHHhh----cccC Q lcl|NC_019515. 152 LAIIE----GDLK 160 (160) Q Consensus 152 laiie----gdlk 160 (160) -.++. |.-| T Consensus 151 ~~~l~k~~~~~~~ 163 (163) T protein:vir:10 151 DGFMRKVVLGNGK 163 (163) T ss_pred HHHHHHhhcCCCC Confidence 44433 3333 No 105 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=36.05 E-value=0.94 Score=20.73 Aligned_cols=89 Identities=25% Similarity=0.339 Sum_probs=57.8 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccCCcccchhhhhhhhhccccceeehhhhhhh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEIDMPALDDEYLADKVSEGYDSRILIRTHKFV 80 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeidmpalddeyladkvsegydsrilirthkfv 80 (160) ..++-...| .+.+.++-...-.-+.+...+|+.++..|+..|.. +. |.+...-++.| |.| +=||-|-... T Consensus 80 t~~~~~~~~---~~~~~~~~~~~~~~~~~L~~lG~~~~~~Ik~~I~~--~~-ppna~sTi~~K---G~~-~PLiDTG~l~ 149 (168) T protein:vir:94 80 TYAAQYRAW---SRDLTLTLKAGAAADTALRTVGQRMAEDIQDTIRN--WP-ADNSPEWAAIK---GFN-AGLRQTGVLL 149 (168) T ss_pred HHHHHHHHH---HHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHhhc--CC-CCccHHHHHhc---CCC-CchhHHHHHH Confidence 122333334 34444444455667899999999999999999964 55 88999999876 554 5699999999 Q ss_pred hhhhhhhcccCCCeEEEEEeccCCcccccccHHH Q lcl|NC_019515. 81 DSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQE 114 (160) Q Consensus 81 dsiklvyeedrgdgimvfigvdggttdtglsmqe 114 (160) +||.-+--+|.. .|.+ ..| T Consensus 150 ~SIty~Vv~d~~----------~~~~-----~~~ 168 (168) T protein:vir:94 150 NAIDSAVIIDGE----------HGEA-----PRE 168 (168) T ss_pred hhcceeeeecCC----------CCCC-----CCC Confidence 999864322211 1111 111 No 106 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=35.64 E-value=1.2 Score=20.11 Aligned_cols=140 Identities=16% Similarity=0.200 Sum_probs=69.3 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccC-----Ccccchhhhhhhhhccccceeehh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEID-----MPALDDEYLADKVSEGYDSRILIR 75 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeid-----mpalddeyladkvsegydsrilir 75 (160) |+.++=--=+.+.++|.+|+...+ .++.|++|..+....++-|+.|. + .+.+...++..+-..+.-...+.+ T Consensus 1 m~~~~~~l~~~L~~ll~~L~~~~~--~~l~r~Ig~~l~~~t~~Rf~~q~-~PdG~~W~p~~~~~~~~~~~~~~~~~~m~~ 77 (156) T protein:vir:11 1 MADSLEALEDWAGPILRALEPGPR--AALARSLARDLRRSQQKRVMAQR-NPDGSAYEPRKKRELRGKQGRIRRKIKMFQ 77 (156) T ss_pred CchhHHHHHHHHHHHHHhcCCcch--HHHHHHHHHHHHHHHHHHHHhhc-CCCCCCCcccchHHHhhhccccccchhhhh Confidence 776554444556666777765544 56899999999988888888775 3 234555555444332222222221 Q ss_pred hhhhhhhhhhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCC----------CccccchhhhHHHHHHHHHH Q lcl|NC_019515. 76 THKFVDSIKLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSK----------QPARMPFHKSWAMMEHEIME 145 (160) Q Consensus 76 thkfvdsiklvyeedrgdgimvfigvdggttdtglsmqeladfiefgtsk----------qparmpfhkswammeheime 145 (160) .-...-+|+..+ ..|+..| |+- | |....|-.--||..- -||| ||-- +..+-.+ T Consensus 78 ~l~~~~~l~~~~---~~~~a~v--g~~-G------s~~~yA~iHQfG~~~~~~~~~~~v~iPaR-p~LG----~s~~d~~ 140 (156) T protein:vir:11 78 KLRTVRYLRAKG---DAQAITV--SFA-G------RIARIARVHQYGLRDRAEPGAPEVSYAQR-LLLG----FDSSDME 140 (156) T ss_pred hhhhhheeeeee---cCcEEEE--Eec-C------CchhhhhhhcccccccccCCCCccccccc-ccCC----CCHHHHH Confidence 111111233222 3445544 221 2 234677777888762 3555 2211 1112223 Q ss_pred HHHHHHHHHhhcccC Q lcl|NC_019515. 146 EVSQRLLAIIEGDLK 160 (160) Q Consensus 146 evsqrllaiiegdlk 160 (160) ||..-++..+.+..- T Consensus 141 ~i~~~i~~~l~~~~~ 155 (156) T protein:vir:11 141 TIQNGILAHIDANSP 155 (156) T ss_pred HHHHHHHHHHhhcCC Confidence 333333333444444 No 107 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=35.49 E-value=1.2 Score=20.14 Aligned_cols=88 Identities=17% Similarity=0.315 Sum_probs=58.3 Q ss_pred hhccccceeehhhhhhhhhhhhhhcc-cCCCeEEEE-EeccCCcccccccHHHHHHHHhhc------------------- Q lcl|NC_019515. 64 VSEGYDSRILIRTHKFVDSIKLVYEE-DRGDGIMVF-IGVDGGTTDTGLSMQELADFIEFG------------------- 122 (160) Q Consensus 64 vsegydsrilirthkfvdsiklvyee-drgdgimvf-igvdggttdtglsmqeladfiefg------------------- 122 (160) |.+.---|+=-+|-+.-+||...|.. +..||.-++ +|..-.+.--| -++||| T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPhg-------hlvE~Ghw~~~~~~~~~dG~w~~~~ 73 (119) T protein:vir:10 1 MRESAKAFVNDETGKLRSNLYVAYSTEESTNGVQTYAVSWRKKAAPHG-------HLLEFGHWQTHAAYKGKDGEWYSSS 73 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCEEEEEeecCCCcCCcc-------cccccceeeeeeeeeccCceeeecC Confidence 33333445666788999999999864 556786655 33333333222 356888 Q ss_pred -----CCCCccccchh-----hhHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_019515. 123 -----TSKQPARMPFH-----KSWAMMEHEIMEEVSQRLLAIIEGDL 159 (160) Q Consensus 123 -----tskqparmpfh-----kswammeheimeevsqrllaiiegdl 159 (160) +++-||+ ||- ..-+-++...+....||+..+..|+- T Consensus 74 ~~l~~~~~vPa~-pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:10 74 VKLVNPKWIPAR-PFLRPGYDSVAMQIPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred ccccCceecCCC-CccchhHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 6666665 332 33445777788889999999999988 No 108 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=35.35 E-value=1.2 Score=20.08 Aligned_cols=134 Identities=11% Similarity=0.171 Sum_probs=60.1 Q ss_pred CCcccccCHHHHHHHHhhhccC--CcchHHHHHHHHHHHHHHHHHHhcccccC-----Ccccchhhhhhhhhccccceee Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDN--EPEYDDVIRSVGQKIAEKIREMIEGQEID-----MPALDDEYLADKVSEGYDSRIL 73 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdn--epeyddvirsvgqkiaekiremiegqeid-----mpalddeyladkvsegydsril 73 (160) |+.+ ...+.+.|..|-.+ .+.-..+.|++|+.+....++-|+.|. + .+++...+...+..... -. T Consensus 1 m~~~----~~~l~~~l~~ll~~l~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~-~PDG~~W~prk~~~~~~~~~~~~---g~ 72 (155) T protein:vir:79 1 MTDD----LQALERWAGGLLAKLSPAARRQLLRELGRDLRRAQQSRVAAQR-NPDGSAYEPRKVKAGGKRLREKA---GR 72 (155) T ss_pred CchH----HHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhc-CCCCCCCcccchhhhhhhhhccc---Cc Confidence 7764 34444444443332 123356899999999888888887665 3 23344444333322110 11 Q ss_pred hhhhhhhhhh----hhhhcccCCCeEEEEEeccCCcccccccHHHHHHHHhhcCCC----------CccccchhhhHHHH Q lcl|NC_019515. 74 IRTHKFVDSI----KLVYEEDRGDGIMVFIGVDGGTTDTGLSMQELADFIEFGTSK----------QPARMPFHKSWAMM 139 (160) Q Consensus 74 irthkfvdsi----klvyeedrgdgimvfigvdggttdtglsmqeladfiefgtsk----------qparmpfhkswamm 139 (160) ++.+...+++ .|.++- ..|+..| |. .| |....|..--||... -||| ||---=.-- T Consensus 73 ~~~~~m~~~l~~a~~l~~~~-~~d~a~V--g~-~G------s~~~yAaiHQfG~~~r~~~~~~~v~iPaR-p~LGls~~d 141 (155) T protein:vir:79 73 VKREAMFRKLRTARYLRIDV-DSTGLAI--GF-DE------RLSRIARVHQEGQKAPVEPGGPLAQYPVR-VVLGFSDAD 141 (155) T ss_pred ccchhhhhhhhhhheeeeee-cCcEEEE--Ee-cC------cchhhhhhhhcCCcccCCCCCcccccccc-cccCCCHHH Confidence 1111111111 122332 3466554 33 22 335577888888653 3555 332111112 Q ss_pred HHHHHHHHHHHHHH Q lcl|NC_019515. 140 EHEIMEEVSQRLLA 153 (160) Q Consensus 140 eheimeevsqrlla 153 (160) +.||++-+..-|-. T Consensus 142 ~~~I~~~i~~~l~r 155 (155) T protein:vir:79 142 RELVRDRLLRELTR 155 (155) T ss_pred HHHHHHHHHHHhhC Confidence 23333322222211 No 109 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=31.97 E-value=1.5 Score=19.68 Aligned_cols=131 Identities=17% Similarity=0.215 Sum_probs=64.9 Q ss_pred CCcccccCHHHHHHHHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccC-----Ccccchhhhhhhhhccccceeehh Q lcl|NC_019515. 1 MTSELYGDWDKFAQILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEID-----MPALDDEYLADKVSEGYDSRILIR 75 (160) Q Consensus 1 mtselygdwdkfaqilhnlkdnepeyddvirsvgqkiaekiremiegqeid-----mpalddeyladkvsegydsrilir 75 (160) |+.++=--=..++.+|.+|+...+ ..+.|.+|..+....++-|+.|. + .+.+...|. ..+..++ T Consensus 1 M~~~~~~~~~~L~~ll~~L~~~~r--~~l~~~Ig~~l~~~t~~Rf~~q~-~PDG~pW~p~k~~~~--------~~k~~~~ 69 (152) T protein:vir:10 1 MSEPIEQVKTAFDSLLNNISKPRR--RLMYQQIGRELARSQRRRIKAQQ-NPDGSAYEPRKKPKK--------GVKSKIK 69 (152) T ss_pred CchHHHHHHHHHHHHHHhcCcchH--HHHHHHHHHHHHHHHHHHHHhcc-CCCCCCCchhhhhhh--------hhccccc Confidence 877644333445666667765443 46899999999999999998876 3 222222222 2233444 Q ss_pred hhhhhhhhh----hhhcccCCCeEEE-EEeccCCcccccccHHHHHHHHhhcCC-----------CCccccchhhhHHHH Q lcl|NC_019515. 76 THKFVDSIK----LVYEEDRGDGIMV-FIGVDGGTTDTGLSMQELADFIEFGTS-----------KQPARMPFHKSWAMM 139 (160) Q Consensus 76 thkfvdsik----lvyeedrgdgimv-figvdggttdtglsmqeladfiefgts-----------kqparmpfhkswamm 139 (160) +.+-..++. +.|. -..|+..| |+|. ....|..--||-. +-||| ||---=.-- T Consensus 70 ~~~m~~~L~~a~~l~~~-a~~~~~~Vg~~Gt----------~~~yAaiHQfG~~~r~~~~~~~~v~iPaR-p~LG~s~~d 137 (152) T protein:vir:10 70 SGKMFDKITQPRFMRLR-LESEGVSLGYEGG----------DAVIARIHQQGLIGRVRKDWDLKVKYASR-ELLGFTDDD 137 (152) T ss_pred chhHHHhhhhcceeeee-ecCcEEEEEecCC----------chhhhhhhccCccccccCCCCcceecccc-ccCCCCHHH Confidence 444443332 2232 23456555 4432 2256666667732 23555 332111222 Q ss_pred HHHHHHHHHHHHHHHhhcc Q lcl|NC_019515. 140 EHEIMEEVSQRLLAIIEGD 158 (160) Q Consensus 140 eheimeevsqrllaiiegd 158 (160) +.||.+-+. ..+.+- T Consensus 138 ~~~I~~~i~----~~l~~a 152 (152) T protein:vir:10 138 LQMIEDYMI----NILAGS 152 (152) T ss_pred HHHHHHHHH----HHHhcC Confidence 333333322 222232 No 110 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=25.45 E-value=2.1 Score=18.87 Aligned_cols=128 Identities=18% Similarity=0.209 Sum_probs=62.2 Q ss_pred CcccccCHHHHHH----HHhhhccCCcchHHHHHHHHHHHHHHHHHHhcccccC-----Ccccchhhhhhhhhcccccee Q lcl|NC_019515. 2 TSELYGDWDKFAQ----ILHNLKDNEPEYDDVIRSVGQKIAEKIREMIEGQEID-----MPALDDEYLADKVSEGYDSRI 72 (160) Q Consensus 2 tselygdwdkfaq----ilhnlkdnepeyddvirsvgqkiaekiremiegqeid-----mpalddeyladkvsegydsri 72 (160) -++ .+.+.+ +|.+|.+... ..+.|.+|+.+....++-|+.|. | .+++..-|.+.+ |-..+. T Consensus 1 m~~----~~~l~~~L~~ll~~l~~~~~--~~l~r~Ig~~l~~st~~Rf~~q~-~PDG~~W~p~s~~~~~~~---g~~~~~ 70 (148) T protein:vir:79 1 MSE----SRELEAWLAGMLTKLDAPAR--RMLARAVAAELRRRQAARIAEQR-NPDGSPYVPRKPQLRHRA---GRIRRA 70 (148) T ss_pred Ccc----HHHHHHHHHHHHHhcCChhH--HHHHHHHHHHHHHHHHHHHHhhc-CCCCCcCcccchHHHhhc---cccccc Confidence 222 344444 4445543322 46889999999998888888776 3 234444444333 344444 Q ss_pred ehhhhhhhhhhhhhhcccCCCeEEE-EEeccCCcccccccHHHHHHHHhhcCCC----------CccccchhhhHHHHHH Q lcl|NC_019515. 73 LIRTHKFVDSIKLVYEEDRGDGIMV-FIGVDGGTTDTGLSMQELADFIEFGTSK----------QPARMPFHKSWAMMEH 141 (160) Q Consensus 73 lirthkfvdsiklvyeedrgdgimv-figvdggttdtglsmqeladfiefgtsk----------qparmpfhkswammeh 141 (160) +..+-..-.+++..+. .++..| |+|. ....|-.--||-.. -||| ||---=.--|. T Consensus 71 ~~~~l~~~~~l~~~~~---~~~~~v~~~Gt----------~~~yAaiHQfG~~~r~~~~~~~v~iPaR-p~LG~s~~d~~ 136 (148) T protein:vir:79 71 MFMRLRLARYMKTQAD---ANTAVVTFAGN----------AQRIATVHQFGLRDRVNKAGLTAQYPAR-ELLGMDGVDME 136 (148) T ss_pred ccchhhhhhheeeeee---CCeeeEEeecc----------chhhhhhhhcCccccccCCCCccccCcc-cccCCCHHHHH Confidence 4333222333433332 334443 3332 23567777788443 3665 23211122344 Q ss_pred HHHHHHHHHHHHHhhc Q lcl|NC_019515. 142 EIMEEVSQRLLAIIEG 157 (160) Q Consensus 142 eimeevsqrllaiieg 157 (160) ||++-+...| .| T Consensus 137 ~i~~~i~~~l----~~ 148 (148) T protein:vir:79 137 HITNLLLLHL----GA 148 (148) T ss_pred HHHHHHHHHh----cC Confidence 4444443333 33 Done!