Query lcl|NC_015294.1_cdsid_YP_004327201.1 [gene=PsPhPAKP1_gp056] [protein=hypothetical protein] [protein_id=YP_004327201.1] [location=31865..32344] Match_columns 159 No_of_seqs 111 out of 191 Neff 6.5 Searched_HMMs 1612 Date Thu Nov 7 14:16:11 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_56 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_56_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:96105 Length: 193 100.0 1.3E-53 8.2E-57 310.5 16.8 148 7-156 1-193 (193) 2 protein:vir:107757 Length: 189 100.0 3.8E-53 2.4E-56 308.0 16.9 150 5-159 1-188 (189) 3 protein:vir:99546 Length: 200 100.0 5.3E-53 3.3E-56 307.2 16.6 152 3-156 1-200 (200) 4 protein:vir:5257 Length: 148 # 100.0 1.5E-52 9.1E-56 304.8 17.1 146 5-156 1-148 (148) 5 protein:vir:78607 Length: 155 100.0 1.1E-49 6.6E-53 289.1 16.1 140 9-157 1-155 (155) 6 protein:vir:106728 Length: 155 100.0 1.1E-49 7.1E-53 288.9 16.1 140 9-157 1-155 (155) 7 protein:vir:94069 Length: 168 100.0 4.9E-49 3E-52 285.4 14.7 146 7-159 1-161 (168) 8 protein:vir:80037 Length: 199 100.0 5E-49 3.1E-52 285.4 14.6 143 7-158 1-199 (199) 9 protein:vir:101563 Length: 155 100.0 1.6E-48 9.8E-52 282.7 15.6 140 9-157 1-155 (155) 10 protein:vir:77650 Length: 155 100.0 4.7E-48 2.9E-51 280.1 15.6 140 9-157 1-155 (155) 11 protein:vir:95260 Length: 160 100.0 1.7E-45 1.1E-48 266.0 16.0 148 3-159 1-156 (160) 12 protein:vir:3163 Length: 145 # 98.6 2E-10 1.3E-13 73.7 7.0 77 73-159 1-84 (145) 13 protein:vir:4347 Length: 164 # 98.4 3.6E-10 2.2E-13 72.4 2.8 103 3-115 1-164 (164) 14 protein:vir:79225 Length: 155 98.4 8.9E-10 5.5E-13 70.2 4.8 86 70-159 1-94 (155) 15 protein:vir:79091 Length: 175 98.4 9.9E-10 6.1E-13 70.0 5.0 86 70-159 1-111 (175) 16 protein:vir:99833 Length: 190 98.3 9.5E-10 5.9E-13 70.1 4.2 84 70-159 1-94 (190) 17 protein:vir:103841 Length: 155 98.3 1.4E-09 8.4E-13 69.2 4.6 86 64-159 1-94 (155) 18 protein:vir:99196 Length: 155 98.3 1.9E-09 1.2E-12 68.4 4.9 86 70-159 1-94 (155) 19 protein:vir:102875 Length: 146 98.3 3.6E-09 2.2E-12 66.9 6.0 85 3-105 1-146 (146) 20 protein:vir:107568 Length: 146 98.3 3.6E-09 2.2E-12 66.9 6.0 85 3-105 1-146 (146) 21 protein:vir:105007 Length: 146 98.3 3.6E-09 2.2E-12 66.9 6.0 85 3-105 1-146 (146) 22 protein:vir:102085 Length: 146 98.3 3.6E-09 2.2E-12 66.9 6.0 85 3-105 1-146 (146) 23 protein:vir:1386 Length: 149 # 98.2 4.8E-09 2.9E-12 66.2 6.0 87 3-122 1-149 (149) 24 protein:vir:1988 Length: 156 # 98.2 3E-09 1.9E-12 67.3 4.8 85 70-159 1-98 (156) 25 protein:vir:94538 Length: 125 98.2 2.8E-09 1.7E-12 67.5 4.4 92 3-98 1-125 (125) 26 protein:vir:1891 Length: 179 # 98.2 1.9E-09 1.2E-12 68.4 3.0 103 3-139 1-179 (179) 27 protein:vir:107851 Length: 175 98.0 9E-09 5.6E-12 64.7 3.8 86 70-159 1-111 (175) 28 protein:vir:194 Length: 149 # 98.0 2.6E-08 1.6E-11 62.1 5.4 94 2-107 1-149 (149) 29 protein:vir:93617 Length: 148 98.0 1.7E-08 1.1E-11 63.2 4.1 94 2-107 1-148 (148) 30 protein:vir:1437 Length: 140 # 97.9 5.4E-08 3.4E-11 60.4 6.5 90 1-99 1-140 (140) 31 protein:vir:100243 Length: 140 97.8 1.2E-07 7.7E-11 58.5 7.1 90 1-99 1-140 (140) 32 protein:vir:95789 Length: 114 97.8 6.6E-08 4.1E-11 59.9 5.6 85 7-96 1-114 (114) 33 protein:vir:100075 Length: 140 97.8 9.9E-08 6.2E-11 59.0 6.3 90 1-99 1-140 (140) 34 protein:vir:4704 Length: 125 # 97.8 6.9E-08 4.3E-11 59.8 5.1 77 7-96 1-125 (125) 35 protein:vir:81106 Length: 125 97.8 6.9E-08 4.3E-11 59.8 5.1 77 7-96 1-125 (125) 36 protein:vir:79988 Length: 125 97.8 6.9E-08 4.3E-11 59.8 5.1 77 7-96 1-125 (125) 37 protein:vir:98342 Length: 125 97.8 6.9E-08 4.3E-11 59.8 5.1 77 7-96 1-125 (125) 38 protein:vir:9414 Length: 125 # 97.8 6.9E-08 4.3E-11 59.8 5.1 77 7-96 1-125 (125) 39 protein:vir:1273 Length: 127 # 97.8 3.1E-08 1.9E-11 61.8 3.1 79 1-96 41-127 (127) 40 protein:vir:96358 Length: 115 97.8 3.6E-08 2.2E-11 61.4 3.1 80 7-92 1-115 (115) 41 protein:vir:9312 Length: 115 # 97.8 3.6E-08 2.2E-11 61.4 3.1 80 7-92 1-115 (115) 42 protein:vir:97144 Length: 115 97.8 3.6E-08 2.2E-11 61.4 3.1 80 7-92 1-115 (115) 43 protein:vir:96225 Length: 115 97.8 3.6E-08 2.2E-11 61.4 3.1 80 7-92 1-115 (115) 44 protein:vir:103917 Length: 115 97.8 3.6E-08 2.2E-11 61.4 3.1 80 7-92 1-115 (115) 45 protein:vir:78858 Length: 115 97.8 3.6E-08 2.2E-11 61.4 3.1 80 7-92 1-115 (115) 46 protein:vir:80362 Length: 140 97.8 1.5E-07 9.2E-11 58.0 6.4 90 1-99 1-140 (140) 47 protein:vir:106623 Length: 115 97.8 2.9E-08 1.8E-11 61.9 2.5 80 7-92 1-115 (115) 48 protein:vir:97088 Length: 157 97.6 4.4E-08 2.7E-11 60.9 1.8 92 3-99 1-157 (157) 49 protein:vir:3617 Length: 112 # 97.6 5.7E-08 3.5E-11 60.3 2.2 84 3-92 1-112 (112) 50 protein:vir:105089 Length: 133 97.6 9.9E-08 6.2E-11 59.0 2.7 89 1-98 1-133 (133) 51 protein:vir:2740 Length: 114 # 97.5 1.6E-07 1E-10 57.8 3.7 84 1-93 1-114 (114) 52 protein:vir:4906 Length: 114 # 97.5 1.6E-07 1E-10 57.8 3.7 84 1-93 1-114 (114) 53 protein:vir:99744 Length: 115 97.5 1.4E-07 8.8E-11 58.1 3.0 80 7-92 1-115 (115) 54 protein:vir:9708 Length: 125 # 97.5 2E-07 1.2E-10 57.3 3.6 81 1-97 37-125 (125) 55 protein:vir:9930 Length: 108 # 97.5 2E-07 1.3E-10 57.3 3.5 84 4-93 1-108 (108) 56 protein:vir:5978 Length: 144 # 97.5 3.3E-07 2.1E-10 56.1 4.4 87 4-92 1-144 (144) 57 protein:vir:96486 Length: 112 97.4 1.4E-07 8.7E-11 58.1 1.9 82 1-95 1-112 (112) 58 protein:vir:3873 Length: 128 # 97.3 3.7E-07 2.3E-10 55.8 2.9 83 1-96 40-128 (128) 59 protein:vir:5745 Length: 135 # 97.2 1.6E-06 1E-09 52.3 6.0 92 3-110 1-135 (135) 60 protein:vir:79091 Length: 175 97.2 3.4E-06 2.1E-09 50.6 6.9 95 3-98 1-175 (175) 61 protein:vir:98409 Length: 108 97.1 3.5E-07 2.2E-10 55.9 1.2 80 2-92 1-108 (108) 62 protein:vir:98557 Length: 149 97.1 4.2E-06 2.6E-09 50.0 6.9 81 74-159 1-90 (149) 63 protein:vir:743 Length: 108 # 96.9 9.5E-07 5.9E-10 53.6 2.0 80 2-92 1-108 (108) 64 protein:vir:107851 Length: 175 96.8 1.2E-05 7.4E-09 47.6 7.3 95 3-98 1-175 (175) 65 protein:vir:94654 Length: 142 96.7 5.8E-06 3.6E-09 49.3 4.9 86 4-91 1-142 (142) 66 protein:vir:2026 Length: 150 # 96.5 1.7E-05 1E-08 46.8 6.1 81 74-159 1-90 (150) 67 protein:vir:2026 Length: 150 # 96.5 6.7E-06 4.2E-09 48.9 3.6 79 1-93 53-150 (150) 68 protein:vir:1988 Length: 156 # 96.4 1.5E-05 9.4E-09 47.0 5.5 77 1-97 68-156 (156) 69 protein:vir:106570 Length: 182 96.4 8.3E-06 5.1E-09 48.4 3.8 96 1-101 1-182 (182) 70 protein:vir:101594 Length: 173 96.4 1.7E-05 1E-08 46.8 5.5 86 7-108 1-173 (173) 71 protein:vir:103841 Length: 155 96.3 1E-05 6.4E-09 47.9 3.9 80 1-99 55-155 (155) 72 protein:vir:6071 Length: 150 # 96.3 1.1E-05 6.7E-09 47.8 3.9 80 1-93 53-150 (150) 73 protein:vir:100312 Length: 152 96.2 8.3E-06 5.1E-09 48.5 2.8 79 1-94 54-152 (152) 74 protein:vir:5703 Length: 150 # 96.1 1.6E-05 1E-08 46.8 4.0 80 1-93 53-150 (150) 75 protein:vir:6071 Length: 150 # 96.1 3.1E-05 1.9E-08 45.3 5.5 81 74-159 1-90 (150) 76 protein:vir:3163 Length: 145 # 95.9 5.2E-05 3.2E-08 44.1 6.1 82 1-102 50-145 (145) 77 protein:vir:5703 Length: 150 # 95.9 4.6E-05 2.9E-08 44.3 5.6 81 74-159 1-90 (150) 78 protein:vir:79179 Length: 155 95.8 2.7E-05 1.7E-08 45.6 3.9 79 1-93 54-155 (155) 79 protein:vir:99833 Length: 190 95.8 4.6E-05 2.9E-08 44.3 5.1 79 1-99 58-190 (190) 80 protein:vir:78077 Length: 141 95.4 2.9E-05 1.8E-08 45.4 2.9 90 4-100 1-141 (141) 81 protein:vir:98557 Length: 149 95.4 6.6E-05 4.1E-08 43.5 4.7 79 1-93 53-149 (149) 82 protein:vir:96829 Length: 135 95.4 2.1E-05 1.3E-08 46.2 1.8 85 1-88 1-135 (135) 83 protein:vir:79115 Length: 148 95.3 4.9E-05 3E-08 44.2 3.7 78 1-97 53-148 (148) 84 protein:vir:93738 Length: 137 95.3 2.7E-05 1.7E-08 45.6 2.2 82 3-88 1-137 (137) 85 protein:vir:97427 Length: 137 95.3 2.7E-05 1.7E-08 45.6 2.2 82 3-88 1-137 (137) 86 protein:vir:94490 Length: 137 95.3 2.7E-05 1.7E-08 45.6 2.2 82 3-88 1-137 (137) 87 protein:vir:96121 Length: 137 95.0 4.5E-05 2.8E-08 44.4 2.6 83 1-88 1-137 (137) 88 protein:vir:1838 Length: 149 # 94.9 0.00013 8.2E-08 41.8 4.9 79 1-93 53-149 (149) 89 protein:vir:95894 Length: 137 94.7 4.9E-05 3E-08 44.2 2.0 82 3-88 1-137 (137) 90 protein:vir:105330 Length: 137 94.7 2.3E-05 1.4E-08 46.1 0.2 85 1-88 1-137 (137) 91 protein:vir:94796 Length: 137 94.7 0.00013 8.1E-08 41.9 4.3 85 1-88 1-137 (137) 92 protein:vir:8669 Length: 142 # 94.7 8.2E-05 5.1E-08 43.0 3.2 86 2-88 1-142 (142) 93 protein:vir:99101 Length: 142 94.7 8.2E-05 5.1E-08 43.0 3.2 86 2-88 1-142 (142) 94 protein:vir:4956 Length: 153 # 94.6 3.6E-05 2.2E-08 44.9 1.2 103 1-138 44-153 (153) 95 protein:vir:94108 Length: 149 94.6 0.00011 6.9E-08 42.3 3.7 83 1-88 13-149 (149) 96 protein:vir:79225 Length: 155 94.5 0.00013 8.2E-08 41.9 4.0 80 1-98 55-155 (155) 97 protein:vir:102154 Length: 119 94.4 2.6E-05 1.6E-08 45.7 -0.1 83 1-96 1-119 (119) 98 protein:vir:79115 Length: 148 94.3 0.00041 2.5E-07 39.2 6.3 81 74-159 1-89 (148) 99 protein:vir:97327 Length: 116 94.3 5.8E-05 3.6E-08 43.8 1.5 77 3-88 1-116 (116) 100 protein:vir:1243 Length: 116 # 94.3 5.8E-05 3.6E-08 43.8 1.5 77 3-88 1-116 (116) 101 protein:vir:4833 Length: 140 # 94.2 0.00015 9E-08 41.6 3.5 88 1-99 35-140 (140) 102 protein:vir:105916 Length: 149 94.1 5.8E-05 3.6E-08 43.8 1.2 85 1-88 13-149 (149) 103 protein:vir:95062 Length: 116 94.1 5.7E-05 3.6E-08 43.8 1.2 77 3-88 1-116 (116) 104 protein:vir:99196 Length: 155 94.1 0.00017 1.1E-07 41.2 3.8 78 1-99 55-155 (155) 105 protein:vir:1838 Length: 149 # 94.0 0.00048 3E-07 38.8 6.0 81 74-159 1-90 (149) 106 protein:vir:107099 Length: 137 94.0 8.8E-05 5.5E-08 42.8 1.9 78 1-88 1-137 (137) 107 protein:vir:1164 Length: 156 # 93.7 0.00027 1.7E-07 40.1 4.0 83 1-102 54-156 (156) 108 protein:vir:81067 Length: 119 93.7 0.00019 1.2E-07 41.0 3.2 84 3-99 1-119 (119) 109 protein:vir:5000 Length: 141 # 93.6 0.00021 1.3E-07 40.8 3.3 91 1-102 44-141 (141) 110 protein:vir:10367 Length: 119 93.5 0.00022 1.4E-07 40.6 3.1 84 3-99 1-119 (119) 111 protein:vir:100887 Length: 139 93.4 0.0002 1.3E-07 40.8 2.8 88 1-103 44-139 (139) 112 protein:vir:4859 Length: 140 # 93.1 0.00025 1.6E-07 40.3 3.0 90 1-102 44-140 (140) 113 protein:vir:3787 Length: 231 # 92.8 0.001 6.5E-07 36.9 5.9 83 1-100 59-231 (231) 114 protein:vir:106041 Length: 137 92.7 0.00011 6.9E-08 42.3 0.3 87 1-100 1-137 (137) 115 protein:vir:81147 Length: 126 92.5 0.0014 8.5E-07 36.3 6.1 88 4-99 1-126 (126) 116 protein:vir:97427 Length: 137 91.3 0.00035 2.2E-07 39.5 1.4 64 70-159 1-64 (137) 117 protein:vir:94490 Length: 137 91.3 0.00035 2.2E-07 39.5 1.4 64 70-159 1-64 (137) 118 protein:vir:93738 Length: 137 91.3 0.00035 2.2E-07 39.5 1.4 64 70-159 1-64 (137) 119 protein:vir:100223 Length: 139 88.2 0.0015 9.2E-07 36.1 2.4 89 1-104 44-139 (139) 120 protein:vir:1243 Length: 116 # 87.6 0.0016 9.7E-07 36.0 2.1 43 101-159 1-43 (116) 121 protein:vir:97327 Length: 116 87.6 0.0016 9.7E-07 36.0 2.1 43 101-159 1-43 (116) 122 protein:vir:79179 Length: 155 87.4 0.006 3.7E-06 32.8 5.2 82 70-159 1-96 (155) 123 protein:vir:95062 Length: 116 87.4 0.0017 1E-06 35.8 2.2 43 101-159 1-43 (116) 124 protein:vir:79034 Length: 141 87.0 0.0031 1.9E-06 34.4 3.4 95 1-101 1-141 (141) 125 protein:vir:95894 Length: 137 86.6 0.0031 1.9E-06 34.3 3.2 64 70-159 1-64 (137) 126 protein:vir:98860 Length: 230 86.6 0.0054 3.3E-06 33.0 4.5 82 1-100 61-230 (230) 127 protein:vir:96829 Length: 135 86.4 0.0029 1.8E-06 34.5 2.9 61 70-159 1-64 (135) 128 protein:vir:3750 Length: 227 # 85.9 0.0057 3.5E-06 32.9 4.3 80 1-99 59-227 (227) 129 protein:vir:94796 Length: 137 85.6 0.0036 2.2E-06 34.0 3.0 64 70-159 1-64 (137) 130 protein:vir:106570 Length: 182 85.3 0.0027 1.7E-06 34.6 2.2 69 69-159 1-69 (182) 131 protein:vir:106506 Length: 137 84.9 0.00053 3.3E-07 38.6 -1.9 85 3-97 1-137 (137) 132 protein:vir:96121 Length: 137 84.7 0.0041 2.5E-06 33.7 2.9 64 70-159 1-64 (137) 133 protein:vir:78755 Length: 228 83.5 0.015 9E-06 30.7 5.4 92 1-135 55-228 (228) 134 protein:vir:1164 Length: 156 # 83.3 0.019 1.2E-05 30.1 5.9 82 70-159 1-93 (156) 135 protein:vir:100312 Length: 152 82.3 0.023 1.4E-05 29.6 5.9 82 70-159 1-91 (152) 136 protein:vir:96358 Length: 115 81.7 0.0033 2E-06 34.2 1.1 68 75-159 1-68 (115) 137 protein:vir:9312 Length: 115 # 81.7 0.0033 2E-06 34.2 1.1 68 75-159 1-68 (115) 138 protein:vir:96225 Length: 115 81.7 0.0033 2E-06 34.2 1.1 68 75-159 1-68 (115) 139 protein:vir:97144 Length: 115 81.7 0.0033 2E-06 34.2 1.1 68 75-159 1-68 (115) 140 protein:vir:78858 Length: 115 81.7 0.0033 2E-06 34.2 1.1 68 75-159 1-68 (115) 141 protein:vir:103917 Length: 115 81.7 0.0033 2E-06 34.2 1.1 68 75-159 1-68 (115) 142 protein:vir:9930 Length: 108 # 81.6 0.0047 2.9E-06 33.4 1.9 60 71-159 1-60 (108) 143 protein:vir:966 Length: 123 # 80.5 0.014 8.8E-06 30.7 4.2 85 3-93 1-123 (123) 144 protein:vir:5978 Length: 144 # 79.9 0.0081 5E-06 32.1 2.6 69 65-159 1-69 (144) 145 protein:vir:101594 Length: 173 79.7 0.011 7E-06 31.3 3.3 62 75-159 1-62 (173) 146 protein:vir:105330 Length: 137 78.8 0.0046 2.8E-06 33.4 0.9 64 70-159 1-64 (137) 147 protein:vir:107099 Length: 137 76.8 0.0066 4.1E-06 32.5 1.1 64 70-159 1-64 (137) 148 protein:vir:97982 Length: 140 75.9 0.0028 1.8E-06 34.5 -1.1 87 1-100 1-140 (140) 149 protein:vir:107545 Length: 140 75.9 0.0028 1.8E-06 34.5 -1.1 87 1-100 1-140 (140) 150 protein:vir:78077 Length: 141 71.5 0.013 8.2E-06 30.9 1.4 63 74-159 1-64 (141) 151 protein:vir:102963 Length: 163 71.3 0.056 3.5E-05 27.4 4.8 91 7-100 1-163 (163) 152 protein:vir:9879 Length: 127 # 70.2 0.0085 5.2E-06 31.9 0.1 89 1-93 14-127 (127) 153 protein:vir:100652 Length: 134 65.5 0.063 3.9E-05 27.1 3.9 78 5-94 1-134 (134) 154 protein:vir:3848 Length: 159 # 62.0 0.062 3.8E-05 27.2 3.1 88 1-101 57-159 (159) 155 protein:vir:105467 Length: 144 60.8 0.11 7E-05 25.8 4.3 66 70-159 1-69 (144) 156 protein:vir:6246 Length: 143 # 60.5 0.02 1.2E-05 29.9 0.1 97 1-107 1-143 (143) 157 protein:vir:99528 Length: 92 # 57.0 0.046 2.8E-05 27.9 1.5 61 4-66 1-92 (92) 158 protein:vir:1332 Length: 143 # 54.2 0.031 1.9E-05 28.9 0.1 97 1-107 1-143 (143) 159 protein:vir:80116 Length: 127 54.1 0.061 3.8E-05 27.3 1.7 87 4-100 1-127 (127) 160 protein:vir:98636 Length: 138 53.4 0.32 0.0002 23.3 5.5 84 1-97 5-138 (138) 161 protein:vir:102441 Length: 137 49.2 0.064 4E-05 27.1 1.0 60 64-159 1-60 (137) 162 protein:vir:9647 Length: 132 # 47.5 0.42 0.00026 22.7 5.2 82 3-97 1-132 (132) 163 protein:vir:95372 Length: 124 42.6 0.062 3.9E-05 27.2 -0.2 84 4-97 1-124 (124) 164 protein:vir:104347 Length: 145 36.0 1.2 0.00074 20.2 7.1 90 3-105 1-145 (145) 165 protein:vir:94994 Length: 131 35.2 1 0.00065 20.5 5.3 75 1-95 33-131 (131) 166 protein:vir:101302 Length: 134 32.6 0.65 0.0004 21.6 3.8 78 5-94 1-134 (134) 167 protein:vir:9513 Length: 134 # 32.6 0.65 0.0004 21.6 3.8 78 5-94 1-134 (134) 168 protein:vir:78380 Length: 131 31.9 1.3 0.00079 20.0 5.2 76 1-95 33-131 (131) 169 protein:vir:7412 Length: 168 # 30.7 0.94 0.00058 20.7 4.3 98 1-105 29-168 (168) 170 protein:vir:96288 Length: 100 30.0 0.21 0.00013 24.3 0.6 75 79-159 1-76 (100) 171 protein:vir:94944 Length: 121 25.4 1.2 0.00071 20.2 3.8 78 1-78 2-121 (121) 172 protein:vir:1028 Length: 168 # 25.4 0.72 0.00045 21.4 2.7 98 1-105 29-168 (168) 173 protein:vir:102338 Length: 116 24.0 1.3 0.00082 19.9 3.8 87 3-96 1-116 (116) No 1 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=100.00 E-value=1.3e-53 Score=310.49 Aligned_cols=148 Identities=25% Similarity=0.422 Sum_probs=139.8 Q ss_pred ceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCC---CCCCCHHHHHHHHhcCC---------------------- Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSE---NGNLPVAQVAAYNEFGT---------------------- 61 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~~~---~~G~~~A~iA~~~E~G~---------------------- 61 (159) |+++.+.+.|++++++|++|++++|+|||++++.|+++ ++|+++|+||+|||||. T Consensus 1 m~~~~~~~~~~~~~~~l~~l~~~~v~vGi~~~~~~~~~~~~~~G~~va~iAai~EfG~~I~~~~~~~~~~~~~~~g~~~~ 80 (193) T protein:vir:96 1 MSLRRDSELIAAHLQMLRAMRGRSVSAGWYSTARYPDKAGGSVGIQVARIARLNEYGGTIDHPGGTRYIRDAIVRGRFVG 80 (193) T ss_pred CeeccchHHHHHHHHHHHHhcCCeEEEEEcCCCCCCCcccccccchHHHHHhHHHcCCccccCccceeeeeccccccccc Confidence 88899999999999999999999999999999998763 35899999999999994 Q ss_pred -------------------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_015294. 62 -------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDD 122 (159) Q Consensus 62 -------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~ 122 (159) ++||||||||++++++. +.|.+.+++++.+++.|+.+++++|+.+|..++++||.+|++ T Consensus 81 ~~~~k~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~--~~~~~~~~~~~~~~~~g~~~~~~~l~~~G~~~~~~ik~~I~~ 158 (193) T protein:vir:96 81 VRFVRNDFPGETEVTKPHRITIPARPFMRYAWNLFS--ADRAAIQNRIAMRLARGQITPDQALAQIGLALEGYIARSIRT 158 (193) T ss_pred cceeccCcceeeEeecceeccCCCcchhhhhHHHHH--HHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc Confidence 37999999999999985 568999999999999999999999999999999999999999 Q ss_pred C-CCCCcHHHHHhcCCCCCchhHHHHHhhhhhhhc Q lcl|NC_015294. 123 Y-PGSNSPAWAAYKGFNDPLFHTGKMLESVKFQIH 156 (159) Q Consensus 123 ~-~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty~V~ 156 (159) + +|||||+||++||||+||||||+|++||+|+|+ T Consensus 159 ~~~ppna~~Ti~~KG~~~PLidTG~l~~SIty~Vv 193 (193) T protein:vir:96 159 GPWVANSASTVRRKGFNRPLVDTAHMLQSISSRVT 193 (193) T ss_pred CCCCCCcHHHHHHhCCCCchhHHHHHHhhhcceeC Confidence 8 589999999999999999999999999999999 No 2 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=100.00 E-value=3.8e-53 Score=307.96 Aligned_cols=150 Identities=19% Similarity=0.346 Sum_probs=139.3 Q ss_pred CCceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCCC--CCCCCchhhHHHHHHHHHHH Q lcl|NC_015294. 5 ASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTT--RNPTRPFMAPTFEEFTSQFH 82 (159) Q Consensus 5 ~~~~~k~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~--~IP~RpFlr~~~~~~~~~~~ 82 (159) .+++|+...+.+++|.+.|++|++++|+||||++++|+ ||+++|+||+|||||++ +||||||||++++++. ++ T Consensus 1 M~~~i~~~~~~~~~L~~~lk~l~~k~V~VGi~~~~~y~---dG~~vA~Ia~~~E~G~p~~~IP~RPFlr~t~~~~~--~~ 75 (189) T protein:vir:10 1 MGRVIRKQGPARVKLNAFIKGMNDYSVRIGWFSTAKYP---DGTPTAYVASIHEFGAPSRGIPARSFIRPTIAAQQ--AA 75 (189) T ss_pred CcceeccCcHHHHHHHHHHHHhhCCeEEEEecCCCCCC---CcccHHHHHHHHHhcCcCCCCCCchhhhHHHHHHH--HH Confidence 34456767788899999999999999999999998774 89999999999999996 7999999999999985 56 Q ss_pred HHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcC-CCCCcHHHHHhcCC------------------------ Q lcl|NC_015294. 83 YARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDY-PGSNSPAWAAYKGF------------------------ 137 (159) Q Consensus 83 ~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~-~~Pna~~Ti~~KG~------------------------ 137 (159) |.+.+++++.+++.|+.+++++|+.+|+.++++||.+|+++ +|||||+||++||+ T Consensus 76 ~~~~l~~~~~~vl~G~~~~~~~L~~~G~~a~~~Ik~~I~~~~~ppna~sTi~~Kg~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (189) T protein:vir:10 76 WSQQMRFYAKQIVVGQMNVEQALEGLAIVARGDVDATLARLKDPPLSPLTIYIRKFIKDGGVIHGYKDIMRLRSEMQQEQ 155 (189) T ss_pred HHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHHhcccCcccchhhhhhhhhhhhhhhhhh Confidence 89999999999999999999999999999999999999998 58999999999994 Q ss_pred -----------CCCchhHHHHHhhhhhhhcccC Q lcl|NC_015294. 138 -----------NDPLFHTGKMLESVKFQIHRRQ 159 (159) Q Consensus 138 -----------~~PLiDTG~L~~SIty~V~~k~ 159 (159) ++||||||+|++||||+|++|+ T Consensus 156 ~~~~~~~~~~s~kPLidTG~l~~SIty~V~~k~ 188 (189) T protein:vir:10 156 AKGTLNLSGVSTDPLDFTGYMRATLSYTVTKEK 188 (189) T ss_pred hhccccccccCCCchhhHHHHHhhcceeeeecC Confidence 6999999999999999999999 No 3 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=100.00 E-value=5.3e-53 Score=307.19 Aligned_cols=152 Identities=28% Similarity=0.415 Sum_probs=137.1 Q ss_pred ccCCc--eeeeh-HHHHHHHHHHHHHhhCCEEEEEecccccCCC---CCCCCCHHHHHHHHhcCC--------------- Q lcl|NC_015294. 3 ILASF--SFKTD-RRRLTSLIKRVEALDGTTVEVGFFPEDRYGS---ENGNLPVAQVAAYNEFGT--------------- 61 (159) Q Consensus 3 m~~~~--~~k~~-~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~~---~~~G~~~A~iA~~~E~G~--------------- 61 (159) |.+|+ ++|.. .+++++++++|++|++++|+|||+++++|++ ++||+++|+||+|||||+ T Consensus 1 ~~~~~~~~~k~~~~~~~~~~~~~l~~l~~~~v~vGi~~~~~y~~~~~~~dG~~va~IA~~~EfG~~i~~p~~~~~~~~~~ 80 (200) T protein:vir:99 1 MKKGFSKSNSVAAPLKHFQMLKQFDALKGKTVQAGWFETDRYPAKEGETIGPLVAKIARQLEFGGVINHPGGTKYIKDAI 80 (200) T ss_pred CCcCcceeeeeecchHHHHHHHHHHHhhCCeEEEEEcCCCCcCCcccccccchHHHHHhHHHcCCeeccCCCcccccccc Confidence 44444 22222 3579999999999999999999999999874 458999999999999994 Q ss_pred --------------------------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHH Q lcl|NC_015294. 62 --------------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQ 115 (159) Q Consensus 62 --------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~ 115 (159) ++||||||||++++++. +.|.+.+++.+.+++.|+.+++++|+.+|..++++ T Consensus 81 ~~g~~~g~rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~--~~~~~~~~~~~~~~l~g~~~~~~~L~~~G~~~~~~ 158 (200) T protein:vir:99 81 VDGRYVGTRFVHKSFQGEHEVTKAHQIVIPARPFMRLAWATFN--KDKVKIQAQIARQLLDGTINPEQALAQIGLALEGC 158 (200) T ss_pred ccccccccccccccccceeeeeccccccCCCcchhhHHHHHHH--HHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHH Confidence 38999999999999985 56899999999999999999999999999999999 Q ss_pred HHHHhhcC-CCCCcHHHHHhcCCCCCchhHHHHHhhhhhhhc Q lcl|NC_015294. 116 MQVNIDDY-PGSNSPAWAAYKGFNDPLFHTGKMLESVKFQIH 156 (159) Q Consensus 116 i~~~I~~~-~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty~V~ 156 (159) ||.+|+++ +|||||+||++||||+||||||+|++||+|+|+ T Consensus 159 ik~~I~~~~~ppna~sTi~~Kg~~~PLidTG~l~~SIty~Ve 200 (200) T protein:vir:99 159 IVRSIKSGPWAANSPATIRAKGFDKPLIDTAHMWQTVSSKVS 200 (200) T ss_pred HHHHHhcCCCCCChHHHHHHhCCCCchHHHHHHHhHhccccC Confidence 99999998 589999999999999999999999999999999 No 4 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=100.00 E-value=1.5e-52 Score=304.76 Aligned_cols=146 Identities=28% Similarity=0.369 Sum_probs=131.0 Q ss_pred CCceeeehHHHHHHHHHHHHHhhCCEEEEEeccccc-CCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHH Q lcl|NC_015294. 5 ASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDR-YGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHY 83 (159) Q Consensus 5 ~~~~~k~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~-~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~ 83 (159) .++++|.+.+++++++++|++|++++|+||||++.. ...++||+++|+||+|||||+++||+|||||+++++++. +| T Consensus 1 M~~~~k~~~~~~~~l~~~l~~l~~~~v~VGi~~~~~~~~~~~~g~~vA~ia~~~E~G~~~IP~Rpflr~t~~~~~~--~~ 78 (148) T protein:vir:52 1 MAVTVTANFSAAKQLIEQMKSLKEKAVYVGFPAEFDEKVKGSENFNLASLAAVLEFGNEHIPARPFLRQTLEENQE--KY 78 (148) T ss_pred CccccccccHHHHHHHHHHHHhhCCeEEEEeecCcCCCCCCCCCCCHHHHHHHHhcCCCCCCCcchhHHHHHHHHH--HH Confidence 455678888999999999999999999999996532 234679999999999999999999999999999999864 46 Q ss_pred HHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcC-CCCCcHHHHHhcCCCCCchhHHHHHhhhhhhhc Q lcl|NC_015294. 84 ARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDY-PGSNSPAWAAYKGFNDPLFHTGKMLESVKFQIH 156 (159) Q Consensus 84 ~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~-~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty~V~ 156 (159) .++ +.+++.|+.+++++|+.+|+.++++||.+|+++ +|||||+||++||||+||||||+|++||+|+|+ T Consensus 79 ~~~----~~~~~~~~~~~~~~L~~~G~~~~~~ik~~I~~~~~ppna~sTi~~Kg~~~PLidTG~l~~SIty~V~ 148 (148) T protein:vir:52 79 TAL----FIQWFDQGVPAAQIYERLSVMAQGDVQMNIVKGEWVANAKSTIRRKKSSKPLIDTGKMRQSVRGIVK 148 (148) T ss_pred HHH----HHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHhcCCCCchhHHHHHHHHhhhhcC Confidence 554 456777899999999999999999999999998 589999999999999999999999999999999 No 5 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=100.00 E-value=1.1e-49 Score=289.09 Aligned_cols=140 Identities=28% Similarity=0.531 Sum_probs=124.6 Q ss_pred eeehHHHHHHHHHHHHHhhCCEEEEEecccccCCC---------------CCCCCCHHHHHHHHhcCCCCCCCCchhhHH Q lcl|NC_015294. 9 FKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGS---------------ENGNLPVAQVAAYNEFGTTRNPTRPFMAPT 73 (159) Q Consensus 9 ~k~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~~---------------~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~ 73 (159) .|.++++|+.++++| ++++|+|||++++.|++ +.+|+++|+||+|||||+.+||||||||++ T Consensus 1 m~v~~k~L~~~~~~l---~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~~~IP~RPFlr~t 77 (155) T protein:vir:78 1 MSVTRRGLTLPKDRY---RSMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGTSKLPARPFMEKT 77 (155) T ss_pred CcchHHHHHHHHHHH---hCCeeEEeecCCCCCCcccchhhhhhhhcccccccCCcHHHHHHhhhcCCCCCCCcchhhHH Confidence 345578888887665 57899999999999986 446999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhh Q lcl|NC_015294. 74 FEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKF 153 (159) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty 153 (159) ++++.. +|.+.+ .+++.++.+++++|+.+|+.++++||.+|+++.|||||+||++||||+||||||+|++||+| T Consensus 78 ~~~~~~--~~~~~l----~~~~~~~~~~~~~L~~~G~~~~~~Ik~~I~~~~~pna~~Ti~~Kg~~kPLidTG~l~~SIty 151 (155) T protein:vir:78 78 ITDRSA--EWIKGL----TVMMTMGYDAEVAMGQIGQAMKDDIKTTISEWPADNSADWAGKKGFNHGLIWTSHLLNSVEQ 151 (155) T ss_pred HHHHHH--HHHHHH----HHHHHcCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCchhHHHHHHHhhhh Confidence 999865 466554 45566788999999999999999999999999999999999999999999999999999999 Q ss_pred hhcc Q lcl|NC_015294. 154 QIHR 157 (159) Q Consensus 154 ~V~~ 157 (159) +|++ T Consensus 152 ~V~~ 155 (155) T protein:vir:78 152 EIVK 155 (155) T ss_pred hccC Confidence 9999 No 6 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=100.00 E-value=1.1e-49 Score=288.89 Aligned_cols=140 Identities=28% Similarity=0.531 Sum_probs=124.7 Q ss_pred eeehHHHHHHHHHHHHHhhCCEEEEEecccccCCC---------------CCCCCCHHHHHHHHhcCCCCCCCCchhhHH Q lcl|NC_015294. 9 FKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGS---------------ENGNLPVAQVAAYNEFGTTRNPTRPFMAPT 73 (159) Q Consensus 9 ~k~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~~---------------~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~ 73 (159) .|.++++|+.++++| ++++|+|||++++.|++ +.+|+++|+||+|||||+.+||+|||||++ T Consensus 1 m~v~~k~L~~~~~~l---~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~~~IP~RPFlr~t 77 (155) T protein:vir:10 1 MSVTRRGLTLPKDRY---RSMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGTSKLPARPFMEKT 77 (155) T ss_pred CcchHHHHHHHHHHH---hCCeeEEeecCCCCCccccchhhhhhhhcccccccCCcHHHHHHHHhcCCCCCCCcchhHHH Confidence 345578888887665 57899999999999986 446999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhh Q lcl|NC_015294. 74 FEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKF 153 (159) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty 153 (159) +++++. +|.+.+ .+++.++.+++++|+.+|..++++||.+|+++.|||||+||++||||+||||||+|++||+| T Consensus 78 ~~~~~~--~~~~~l----~~~~~~~~~~~~~L~~lG~~~~~~Ik~~I~~~~~pna~~Ti~~KG~~kPLidTG~l~~SIty 151 (155) T protein:vir:10 78 IADRSA--EWIKGL----TVMMTMGYDAEVAMGQIGQAMKDDIKTTISEWPADNSADWAGKKGFNHGLIWTSHLLNSVEQ 151 (155) T ss_pred HHHHHH--HHHHHH----HHHHHcCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCcHHHHHhcCCCCchhHHHHHHHhhhh Confidence 999865 466554 45566788999999999999999999999999999999999999999999999999999999 Q ss_pred hhcc Q lcl|NC_015294. 154 QIHR 157 (159) Q Consensus 154 ~V~~ 157 (159) +|++ T Consensus 152 ~Vv~ 155 (155) T protein:vir:10 152 EIVK 155 (155) T ss_pred hccC Confidence 9999 No 7 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=100.00 E-value=4.9e-49 Score=285.44 Aligned_cols=146 Identities=29% Similarity=0.511 Sum_probs=128.7 Q ss_pred ceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCC--------------CCCCCCHHHHHHHHhcCCCCCCCCchhhH Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGS--------------ENGNLPVAQVAAYNEFGTTRNPTRPFMAP 72 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~~--------------~~~G~~~A~iA~~~E~G~~~IP~RpFlr~ 72 (159) |+++ ..++++...+.+..|.+..|+|||+++++|++ +++|+++|+||+|||||+.+||+|||||+ T Consensus 1 ~~~~-~~~g~~~~~~~~~~l~~~~v~vG~l~~a~yp~G~~~~~~~~~~~~~~~~g~~va~Ia~~~E~G~~~IP~RPFlr~ 79 (168) T protein:vir:94 1 MTTI-ARKGVKMPPHLEAQFQSGEVKAGVLSGSTYPQMTYTDQRTGKQIEDARGGMPVAVIAQALEYGHGQNHPRPFMQQ 79 (168) T ss_pred Cccc-cchhhhhhHHHHHhhhccceeeeccccCcccccccchhhcccccccccccccHHHHHHHHhcCCCCCCCchhhHH Confidence 3322 25668888888899999999999999998863 35789999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhh Q lcl|NC_015294. 73 TFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVK 152 (159) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~SIt 152 (159) ++++++. +|.+ .+.+++.|+.+++++|+.+|..++++||.+|+++.|||||+||++||||+||||||+|++||+ T Consensus 80 t~~~~~~--~~~~----~~~~~~~~~~~~~~~L~~lG~~~~~~Ik~~I~~~~ppna~sTi~~KG~~~PLiDTG~l~~SIt 153 (168) T protein:vir:94 80 TYAAQYR--AWSR----DLTLTLKAGAAADTALRTVGQRMAEDIQDTIRNWPADNSPEWAAIKGFNAGLRQTGVLLNAID 153 (168) T ss_pred HHHHHHH--HHHH----HHHHHHhcCCCHHHHHHHHHHHHHHHHHHHhhcCCCCccHHHHHhcCCCCchhHHHHHHhhcc Confidence 9998864 4554 556778889999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccc-C Q lcl|NC_015294. 153 FQIHRR-Q 159 (159) Q Consensus 153 y~V~~k-~ 159 (159) |+|++. + T Consensus 154 y~Vv~d~~ 161 (168) T protein:vir:94 154 SAVIIDGE 161 (168) T ss_pred eeeeecCC Confidence 988844 4 No 8 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=100.00 E-value=5e-49 Score=285.38 Aligned_cols=143 Identities=25% Similarity=0.478 Sum_probs=131.4 Q ss_pred ceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcC-------------------------- Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFG-------------------------- 60 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G-------------------------- 60 (159) |+++.+.+.+++++++|++|++++|+|||+.+ ||.++++||.+|||| T Consensus 1 m~vt~~~~~~~~~~~~l~~L~~k~v~vGi~~~-------d~~~~~~Ia~~~E~Ga~I~~~~~~l~Ip~~~a~~~k~~~~~ 73 (199) T protein:vir:80 1 MKVTTDKSTMNKAIRELDQLDRYSLQIGLFGE-------DDSFIQMIAGVHEFGLTIRPKGKYLTIPTPEAGDRRARDIP 73 (199) T ss_pred CcccccHHHHHHHHHHHHHhcCCEEEEEEecC-------CCcchhheeehhhcCCeeecCCceeeecchhhhcccccccC Confidence 77888999999999999999999999999953 566777777777777 Q ss_pred --------------------------C--CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHH Q lcl|NC_015294. 61 --------------------------T--TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMV 112 (159) Q Consensus 61 --------------------------~--~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~ 112 (159) + .+||+|||||+++++++ ++|.+++++++.+++.|+.+++++|+.+|+.+ T Consensus 74 ~~~~p~g~~~~~~~~~~~~~~~~e~g~~~~~IP~RPFlr~t~~~~~--~~~~~~~~~~~~~vl~g~~~a~~~L~~~G~~~ 151 (199) T protein:vir:80 74 GLFKPKGKNILAVAGPDGKLTVMFYLKTEVNIPERSFLRSTFDEKS--NKWGELFEGWIDDVIHGKLSAEQVYNRLGAKI 151 (199) T ss_pred cccccCCcceeeeeccccceeeeeeccccccCCCCchhHHHHHHHH--HHHHHHHHHHHHHHHhCCCcHHHHHHHHHHHH Confidence 2 27999999999999985 56899999999999999999999999999999 Q ss_pred HHHHHHHhhcC-CCCCcHHHHH-hcCCCCCchhHHHHHhhhhhhhccc Q lcl|NC_015294. 113 AEQMQVNIDDY-PGSNSPAWAA-YKGFNDPLFHTGKMLESVKFQIHRR 158 (159) Q Consensus 113 ~~~i~~~I~~~-~~Pna~~Ti~-~KG~~~PLiDTG~L~~SIty~V~~k 158 (159) +++||.+|+++ +|||||+||+ +||||+||||||+|++||+|+|++- T Consensus 152 ~~~Ik~~I~~~~~ppna~~Tia~rKg~~kPLidTG~l~~SIty~V~~~ 199 (199) T protein:vir:80 152 VDDIQMKIVEIQTPAKSAATLARNPRKNNPLIVTGKMKNSVTWKVMKS 199 (199) T ss_pred HHHHHHHHhccCCCCCCHHHHHHhcCCCCchHHHHHHHhhcceeeeeC Confidence 99999999987 6899999997 8999999999999999999999999 No 9 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=100.00 E-value=1.6e-48 Score=282.65 Aligned_cols=140 Identities=27% Similarity=0.542 Sum_probs=123.8 Q ss_pred eeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCC---------------CCCCHHHHHHHHhcCCCCCCCCchhhHH Q lcl|NC_015294. 9 FKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSEN---------------GNLPVAQVAAYNEFGTTRNPTRPFMAPT 73 (159) Q Consensus 9 ~k~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~~~~---------------~G~~~A~iA~~~E~G~~~IP~RpFlr~~ 73 (159) .+.++++|+.++++|+ +++|+||||+++.|+++. +|+++|+||+|||||+.+||||||||++ T Consensus 1 m~v~r~~L~~~~~~l~---~~~V~VGi~~~a~y~d~~g~~~~~g~~~~~~~~~G~pva~ia~~~e~G~~~IP~RPFlr~t 77 (155) T protein:vir:10 1 MSVTRRGLTLPKDRYK---SMSVKAGVLAGATYPDESGKKLADGTILKKDPRAGLPVAMIAMALNYGTSKLPARPFMEKT 77 (155) T ss_pred CcchHHHHHHHHHHhh---CCeeEEeecCCCCCCccccchhhhhhhhccccccCcchhhhhhhhhcCCCCCCCcchhHHH Confidence 2444778888877665 678999999999998644 3999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhh Q lcl|NC_015294. 74 FEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKF 153 (159) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty 153 (159) ++++.. +|.+.+ .+++.++.+++++|+.+|..++++||.+|+++.+||+|+||++||||+||||||+|++||+| T Consensus 78 ~~~~~~--~~~~~l----~~~~~~~~~~~~~L~~~G~~~~~~Ik~~I~~~~~p~~~~Ti~~KG~~~PLidTG~l~~Sity 151 (155) T protein:vir:10 78 IADRSA--EWIKGL----TVMMTMGYDAEVAMGQIGQAMKDDIKTTISEWPADNNADWAGKKGFNHGLIWTSHLLNSIEQ 151 (155) T ss_pred HHHHHH--HHHHHH----HHHHHcCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCChHHHHhcCCCCchHHHHHHHHhhhh Confidence 999865 455544 45667788999999999999999999999999888999999999999999999999999999 Q ss_pred hhcc Q lcl|NC_015294. 154 QIHR 157 (159) Q Consensus 154 ~V~~ 157 (159) +|++ T Consensus 152 ~Vv~ 155 (155) T protein:vir:10 152 EIVK 155 (155) T ss_pred hccC Confidence 9999 No 10 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=100.00 E-value=4.7e-48 Score=280.05 Aligned_cols=140 Identities=26% Similarity=0.528 Sum_probs=122.4 Q ss_pred eeehHHHHHHHHHHHHHhhCCEEEEEecccccCCC---------------CCCCCCHHHHHHHHhcCCCCCCCCchhhHH Q lcl|NC_015294. 9 FKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGS---------------ENGNLPVAQVAAYNEFGTTRNPTRPFMAPT 73 (159) Q Consensus 9 ~k~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~~---------------~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~ 73 (159) .+.++.+|+.++++| ++++|+|||++++.|++ +++|+++|+||+|||||+.+||||||||++ T Consensus 1 m~~~r~~l~~~~~~l---~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~G~pva~ia~~~e~G~~~IP~RPFlr~t 77 (155) T protein:vir:77 1 MSVTRRGLTLPKDRY---RSMSVKAGVLAGATYPDESGKKLADGSILKKDPRAGLPVAMIAMALNYGTSKLPARPFMEKT 77 (155) T ss_pred CcchHHHHHHHHHHH---hcCceEEeecCCCCCccccchhhhhhhhccccccccccHhhhhhhhhcCCCCCCCCchhhHH Confidence 234466677776654 57889999999999886 445999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhh Q lcl|NC_015294. 74 FEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKF 153 (159) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty 153 (159) ++++.. +|.+.+. +++.++.+++++|+.+|..++++||.+|+++.+||+|+||++||||+||||||+|++||+| T Consensus 78 ~~~~~~--~~~~~l~----~~~~~~~~~~~~L~~lG~~~~~~Iq~~I~~~~~p~~~~Ti~~KG~d~PLidTG~l~~SIty 151 (155) T protein:vir:77 78 IADRSA--EWIKGLT----VMMTMGYDAEVAMGQIGQAMKDDIKTTISEWPADNNADWAGKKGFNHGLIWTSHLLNSIEQ 151 (155) T ss_pred HHHHHH--HHHHHHH----HHHHccCcHHHHHHHHHHHHHHHHHHHHhcCCCCCChHHHHhcCCCCchhHHHHHHHhhhh Confidence 999865 4665554 4556678999999999999999999999999888999999999999999999999999999 Q ss_pred hhcc Q lcl|NC_015294. 154 QIHR 157 (159) Q Consensus 154 ~V~~ 157 (159) +|++ T Consensus 152 ~Vv~ 155 (155) T protein:vir:77 152 EIVK 155 (155) T ss_pred hccC Confidence 9999 No 11 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=100.00 E-value=1.7e-45 Score=265.99 Aligned_cols=148 Identities=16% Similarity=0.265 Sum_probs=122.3 Q ss_pred ccCCceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHH--- Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTS--- 79 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~--- 79 (159) |++ +.+.+++++|.+++++|.++.|+||||+++. .++||+++++||+|||||+.+||+|||||++++.... T Consensus 1 ~~~----~~~~~G~~~L~~~~k~l~~~~V~VGi~~d~g--~~~dG~sv~~vA~~~EfG~~~iPaRPf~R~tfe~~~~~~~ 74 (160) T protein:vir:95 1 MVK----RVIHPARAKLVGAMKNLQTANAQVGYFQEQG--QHSSGFSYPALMYLQEVIGVPSASGKVYRRLFEITMMLNK 74 (160) T ss_pred Cce----eechHhHHHHHHHHHHHhCCeeEEeeccccc--cCCCCccHHHHHhhhhcCcccCCCcchhHHHHHHHHHHHH Confidence 333 3567889999999999999999999999873 3469999999999999999999999999999974222 Q ss_pred HHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcC-----CCCCcHHHHHhcCCCCCchhHHHHHhhhhhh Q lcl|NC_015294. 80 QFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDY-----PGSNSPAWAAYKGFNDPLFHTGKMLESVKFQ 154 (159) Q Consensus 80 ~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~-----~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty~ 154 (159) +..+.+..++...++..|+.++ .+.+|+.++++|+.+|.+. ||||||+||++||||+||||||+|++||+|+ T Consensus 75 ~~~~~~~~~~i~~~~~~g~~~~---~~~LG~~~~~~ik~~I~~~~~p~~w~pNap~Ti~~Kgs~~PLiDTg~l~~Si~y~ 151 (160) T protein:vir:95 75 QTLLEQTKKNLYKQLSSLNTDP---SNTLEAFAKNAQKAIKRGFGNSAILPPNAPSTVKKKGFNAPLVETGDLRDNLAYK 151 (160) T ss_pred HHHHHHHHHHHHHHHhhcchhH---HHHHHHHHHHHHHHHHhhcCCccCCCCCcHHHHHhcCCCCcchhhHHHhhhhhhe Confidence 2223444555556666666443 3559999999999999763 5699999999999999999999999999999 Q ss_pred hcccC Q lcl|NC_015294. 155 IHRRQ 159 (159) Q Consensus 155 V~~k~ 159 (159) |.++. T Consensus 152 v~~~~ 156 (160) T protein:vir:95 152 ISTKK 156 (160) T ss_pred eeccc Confidence 99999 No 12 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=98.60 E-value=2e-10 Score=73.72 Aligned_cols=77 Identities=13% Similarity=0.162 Sum_probs=55.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhc-----C--CCCCcHHHHHhcCCCCCchhHH Q lcl|NC_015294. 73 TFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDD-----Y--PGSNSPAWAAYKGFNDPLFHTG 145 (159) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~-----~--~~Pna~~Ti~~KG~~~PLiDTG 145 (159) .+.. ...+.+.++ ++.. +....|..+|......++..|.+ | |+|+||+|+++||.++||+||| T Consensus 1 ~i~~---~~~i~~~l~----~l~~---~~~~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~~~~L~~tG 70 (145) T protein:vir:31 1 MVED---ENNIPEARE----AIQD---GLTDGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGSDTPLIDNS 70 (145) T ss_pred Cccc---HHHHHHHHH----HHHH---HHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcCCCCCccCH Confidence 1211 112333333 3322 23457888999999999988863 2 5699999999999999999999 Q ss_pred HHHhhhhhhhcccC Q lcl|NC_015294. 146 KMLESVKFQIHRRQ 159 (159) Q Consensus 146 ~L~~SIty~V~~k~ 159 (159) .|++||+|.+.... T Consensus 71 ~L~~Si~~~~~~~~ 84 (145) T protein:vir:31 71 RLLTDINAASMMDR 84 (145) T ss_pred HHHHHHHHHhhhcc Confidence 99999999985332 No 13 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=98.37 E-value=3.6e-10 Score=72.37 Aligned_cols=103 Identities=17% Similarity=0.280 Sum_probs=53.6 Q ss_pred ccCCceeeehHHHHHHHHHHHHHhhCC----------------------------------------------------- Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLIKRVEALDGT----------------------------------------------------- 29 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~~~l~~l~~~----------------------------------------------------- 29 (159) |..+|+++. .+|++|.+.|+.|... T Consensus 1 Ma~~~~~~i--~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~ 78 (164) T protein:vir:43 1 MADTVEFSI--TGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRT 78 (164) T ss_pred CCcceEEee--ecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccc Confidence 667776664 4555555555544211 Q ss_pred ---EEEEEecccccCC-----CCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCH Q lcl|NC_015294. 30 ---TVEVGFFPEDRYG-----SENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQT 101 (159) Q Consensus 30 ---~v~VGi~~~~~~~-----~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 101 (159) ...||+..+.... .....-+.+.++.++||||.++||||||||+++++.. .+.+.+.+.+... . T Consensus 79 ~~~~~~vg~~~~~~~~~~~~~~~~~~~~~~~y~~f~EfGT~km~a~PFlrPA~~~~k~--~~~~~~~~~l~~~------i 150 (164) T protein:vir:43 79 GDLGFRIGVLHGAVLPKKGERSDKTANAPTPHWRLLEFGTEDMRAQPFMRSALADNIA--EVTSTFVSEYEKG------I 150 (164) T ss_pred cceeEEecccccccccccccccccCCCCCcceEEEeecCCCCCCCCcchhhhHHHhHH--HHHHHHHHHHHHH------H Confidence 1122221111000 0001112345788999999999999999999998753 3444444433332 2 Q ss_pred HHHHHHHHHHHHHH Q lcl|NC_015294. 102 NTLLKKLGKMVAEQ 115 (159) Q Consensus 102 ~~~l~~iG~~~~~~ 115 (159) +.+|.+.+..++.- T Consensus 151 ~ka~~k~~~~~~~~ 164 (164) T protein:vir:43 151 DRAIKRAAKKAAQG 164 (164) T ss_pred HHHHHHHHhhhccC Confidence 34444444443333 No 14 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=98.37 E-value=8.9e-10 Score=70.20 Aligned_cols=86 Identities=16% Similarity=0.113 Sum_probs=60.0 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhc-C--CCCCcHHHHHhc-----CCCCCc Q lcl|NC_015294. 70 MAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDD-Y--PGSNSPAWAAYK-----GFNDPL 141 (159) Q Consensus 70 lr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~-~--~~Pna~~Ti~~K-----G~~~PL 141 (159) |-..++-.-....+. +.+.++.....+...+|..||..+...++..|.. | |+|+||+|+++| +..++| T Consensus 1 M~~~i~i~~d~~~~~----~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL 76 (155) T protein:vir:79 1 MTTRIDVELDDQEVR----QRLAVLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPIL 76 (155) T ss_pred CceEEEEEechHHHH----HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCcc Confidence 321111000001233 3344443333477899999999999999999963 4 679999999866 346899 Q ss_pred hhHHHHHhhhhhhhcccC Q lcl|NC_015294. 142 FHTGKMLESVKFQIHRRQ 159 (159) Q Consensus 142 iDTG~L~~SIty~V~~k~ 159 (159) +|||.|++||+|.+.... T Consensus 77 ~~tG~L~~Si~~~~~~~~ 94 (155) T protein:vir:79 77 QVTNALARSVTTWADRNE 94 (155) T ss_pred ccchhhhhhhhceecCCE Confidence 999999999999998877 No 15 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=98.36 E-value=9.9e-10 Score=69.96 Aligned_cols=86 Identities=14% Similarity=0.093 Sum_probs=60.5 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcC----CCCCcHHHHHhc---------- Q lcl|NC_015294. 70 MAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDY----PGSNSPAWAAYK---------- 135 (159) Q Consensus 70 lr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~----~~Pna~~Ti~~K---------- 135 (159) |-..++ .+-. ...+.+.+.++.....+...+|..||..+...++..|.+. |+|+||+|+++| T Consensus 1 Ms~~i~-i~~d---~~~~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~ 76 (175) T protein:vir:79 1 MSDFVN-FQID---DSALRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKN 76 (175) T ss_pred CceEEE-EEec---hHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhcccccccccc Confidence 221110 1000 0123344444444445788999999999999999999753 569999998643 Q ss_pred -----------CCCCCchhHHHHHhhhhhhhcccC Q lcl|NC_015294. 136 -----------GFNDPLFHTGKMLESVKFQIHRRQ 159 (159) Q Consensus 136 -----------G~~~PLiDTG~L~~SIty~V~~k~ 159 (159) +..++|+|||.|++||+|.+.... T Consensus 77 ~~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~ 111 (175) T protein:vir:79 77 GELTAAASRRKAGLMILQDSGQMAASTATDSGEDY 111 (175) T ss_pred ccchhhHhhhccCCCcceechhhhhhhhheecCCE Confidence 457899999999999999998877 No 16 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.33 E-value=9.5e-10 Score=70.06 Aligned_cols=84 Identities=14% Similarity=0.207 Sum_probs=59.7 Q ss_pred hhHHHHHHH-HHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhc-----C--CCCCcHHHHHhc--CCCC Q lcl|NC_015294. 70 MAPTFEEFT-SQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDD-----Y--PGSNSPAWAAYK--GFND 139 (159) Q Consensus 70 lr~~~~~~~-~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~-----~--~~Pna~~Ti~~K--G~~~ 139 (159) |- ++. .+ .-..+.+.+..++..+ .+...++..||..+...+++.|++ | |+|++|+|+++| +..+ T Consensus 1 M~-~i~-i~~d~~~~~~~L~~l~~~~----~~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~~~ 74 (190) T protein:vir:99 1 MA-GIT-LEWDGRRALDVLNAGSAAL----GDPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNRDK 74 (190) T ss_pred Cc-eeE-EEecHHHHHHHHHHHHHHh----hhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCCCc Confidence 21 110 00 0012333344444433 367899999999999999999964 2 569999999766 5679 Q ss_pred CchhHHHHHhhhhhhhcccC Q lcl|NC_015294. 140 PLFHTGKMLESVKFQIHRRQ 159 (159) Q Consensus 140 PLiDTG~L~~SIty~V~~k~ 159 (159) +|+|||.|++||+|.+.... T Consensus 75 ~L~~tg~L~~Si~~~~~~~~ 94 (190) T protein:vir:99 75 ILTLDGHLRNLLRYQLDGSE 94 (190) T ss_pred cceecHHHHHHHhheecCcE Confidence 99999999999999998877 No 17 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=98.31 E-value=1.4e-09 Score=69.20 Aligned_cols=86 Identities=14% Similarity=0.084 Sum_probs=60.3 Q ss_pred CCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhc-C--CCCCcHHHHHh-----c Q lcl|NC_015294. 64 NPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDD-Y--PGSNSPAWAAY-----K 135 (159) Q Consensus 64 IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~-~--~~Pna~~Ti~~-----K 135 (159) .- .++.-.++.. .+. +.+.++.....+...+|..||..+...++..|.. + |+|+||.|+++ + T Consensus 1 Ms--~~i~i~~~~~----~~~----~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~ 70 (155) T protein:vir:10 1 MA--NRIELELVDR----EVQ----ERLAALYAAVTDTLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGR 70 (155) T ss_pred CC--ceEEEEechH----HHH----HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccC Confidence 11 1122222211 123 3444443333477899999999999999999963 3 67999999864 3 Q ss_pred CCCCCchhHHHHHhhhhhhhcccC Q lcl|NC_015294. 136 GFNDPLFHTGKMLESVKFQIHRRQ 159 (159) Q Consensus 136 G~~~PLiDTG~L~~SIty~V~~k~ 159 (159) |..++|+|||.|++||+|.+.... T Consensus 71 ~~~~~L~~tG~L~~Si~~~~~~~~ 94 (155) T protein:vir:10 71 GAHPILQVTNALARSITTRADRDQ 94 (155) T ss_pred CCCCccccchhhhhhhhceecCCE Confidence 567899999999999999998877 No 18 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=98.28 E-value=1.9e-09 Score=68.41 Aligned_cols=86 Identities=14% Similarity=0.124 Sum_probs=60.0 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhh-cC--CCCCcHHHHHhc-----CCCCCc Q lcl|NC_015294. 70 MAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNID-DY--PGSNSPAWAAYK-----GFNDPL 141 (159) Q Consensus 70 lr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~-~~--~~Pna~~Ti~~K-----G~~~PL 141 (159) |-.-++ .+. .. +.+.+.+.++.....+...+|..||..+...++..|. +| |+|+||+|+++| +..++| T Consensus 1 Ms~~i~-i~~--d~-~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL 76 (155) T protein:vir:99 1 MTTRID-VEL--DD-QEVRQRLALLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPIL 76 (155) T ss_pred CceEEE-EEe--ch-HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcc Confidence 321111 100 00 1233444444433446789999999999999999996 34 569999999866 246799 Q ss_pred hhHHHHHhhhhhhhcccC Q lcl|NC_015294. 142 FHTGKMLESVKFQIHRRQ 159 (159) Q Consensus 142 iDTG~L~~SIty~V~~k~ 159 (159) +|||.|++||+|.+.+.. T Consensus 77 ~~tg~L~~Si~~~~~~~~ 94 (155) T protein:vir:99 77 QVTNALARSVTTWADRNE 94 (155) T ss_pred hhchhhhhhhhceecCCE Confidence 999999999999998777 No 19 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=98.26 E-value=3.6e-09 Score=66.87 Aligned_cols=85 Identities=20% Similarity=0.378 Sum_probs=48.0 Q ss_pred ccCCceeeehHHHHHHHHHHHHHhhCC----------------------------------------------------- Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLIKRVEALDGT----------------------------------------------------- 29 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~~~l~~l~~~----------------------------------------------------- 29 (159) |..+++++. .+|++|++.|+.|... T Consensus 1 Ma~~~~~~i--~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~ 78 (146) T protein:vir:10 1 MADGIDLDL--LGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTK 78 (146) T ss_pred CCCceeeee--hhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecc Confidence 556666553 3444444444433111 Q ss_pred --------EEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCH Q lcl|NC_015294. 30 --------TVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQT 101 (159) Q Consensus 30 --------~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 101 (159) .+.||+-.. ++ +.+.++.+.||||.+.||+|||+|+++++.. .+.+.+.+.+.+.+.= T Consensus 79 ~~~~~g~~~~~vg~~~~-------~~-~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~--~~~~~~~~~l~~~l~k---- 144 (146) T protein:vir:10 79 AKLEGGIKTVKIGLNKA-------DR-SPWFYLKFHEWGTSKMPAHPFIEPGFNASKA--EAVRAMTDILKNEMRL---- 144 (146) T ss_pred ccccccceeEEeeeccC-------CC-CCcceeeeeccCCCCCCCCcchhHHHHHhHH--HHHHHHHHHHHHHHhh---- Confidence 112222110 11 2356888899999999999999999998753 4555555555544322 Q ss_pred HHHH Q lcl|NC_015294. 102 NTLL 105 (159) Q Consensus 102 ~~~l 105 (159) +| T Consensus 145 --a~ 146 (146) T protein:vir:10 145 --DL 146 (146) T ss_pred --cC Confidence 22 No 20 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=98.26 E-value=3.6e-09 Score=66.87 Aligned_cols=85 Identities=20% Similarity=0.378 Sum_probs=48.0 Q ss_pred ccCCceeeehHHHHHHHHHHHHHhhCC----------------------------------------------------- Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLIKRVEALDGT----------------------------------------------------- 29 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~~~l~~l~~~----------------------------------------------------- 29 (159) |..+++++. .+|++|++.|+.|... T Consensus 1 Ma~~~~~~i--~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~ 78 (146) T protein:vir:10 1 MADGIDLDL--LGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTK 78 (146) T ss_pred CCCceeeee--hhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecc Confidence 556666553 3444444444433111 Q ss_pred --------EEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCH Q lcl|NC_015294. 30 --------TVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQT 101 (159) Q Consensus 30 --------~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 101 (159) .+.||+-.. ++ +.+.++.+.||||.+.||+|||+|+++++.. .+.+.+.+.+.+.+.= T Consensus 79 ~~~~~g~~~~~vg~~~~-------~~-~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~--~~~~~~~~~l~~~l~k---- 144 (146) T protein:vir:10 79 AKLEGGIKTVKIGLNKA-------DR-SPWFYLKFHEWGTSKMPAHPFIEPGFNASKA--EAVRAMTDILKNEMRL---- 144 (146) T ss_pred ccccccceeEEeeeccC-------CC-CCcceeeeeccCCCCCCCCcchhHHHHHhHH--HHHHHHHHHHHHHHhh---- Confidence 112222110 11 2356888899999999999999999998753 4555555555544322 Q ss_pred HHHH Q lcl|NC_015294. 102 NTLL 105 (159) Q Consensus 102 ~~~l 105 (159) +| T Consensus 145 --a~ 146 (146) T protein:vir:10 145 --DL 146 (146) T ss_pred --cC Confidence 22 No 21 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=98.26 E-value=3.6e-09 Score=66.87 Aligned_cols=85 Identities=20% Similarity=0.378 Sum_probs=48.0 Q ss_pred ccCCceeeehHHHHHHHHHHHHHhhCC----------------------------------------------------- Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLIKRVEALDGT----------------------------------------------------- 29 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~~~l~~l~~~----------------------------------------------------- 29 (159) |..+++++. .+|++|++.|+.|... T Consensus 1 Ma~~~~~~i--~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~ 78 (146) T protein:vir:10 1 MADGIDLDL--LGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTK 78 (146) T ss_pred CCCceeeee--hhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecc Confidence 556666553 3444444444433111 Q ss_pred --------EEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCH Q lcl|NC_015294. 30 --------TVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQT 101 (159) Q Consensus 30 --------~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 101 (159) .+.||+-.. ++ +.+.++.+.||||.+.||+|||+|+++++.. .+.+.+.+.+.+.+.= T Consensus 79 ~~~~~g~~~~~vg~~~~-------~~-~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~--~~~~~~~~~l~~~l~k---- 144 (146) T protein:vir:10 79 AKLEGGIKTVKIGLNKA-------DR-SPWFYLKFHEWGTSKMPAHPFIEPGFNASKA--EAVRAMTDILKNEMRL---- 144 (146) T ss_pred ccccccceeEEeeeccC-------CC-CCcceeeeeccCCCCCCCCcchhHHHHHhHH--HHHHHHHHHHHHHHhh---- Confidence 112222110 11 2356888899999999999999999998753 4555555555544322 Q ss_pred HHHH Q lcl|NC_015294. 102 NTLL 105 (159) Q Consensus 102 ~~~l 105 (159) +| T Consensus 145 --a~ 146 (146) T protein:vir:10 145 --DL 146 (146) T ss_pred --cC Confidence 22 No 22 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=98.26 E-value=3.6e-09 Score=66.87 Aligned_cols=85 Identities=20% Similarity=0.378 Sum_probs=48.0 Q ss_pred ccCCceeeehHHHHHHHHHHHHHhhCC----------------------------------------------------- Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLIKRVEALDGT----------------------------------------------------- 29 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~~~l~~l~~~----------------------------------------------------- 29 (159) |..+++++. .+|++|++.|+.|... T Consensus 1 Ma~~~~~~i--~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~ 78 (146) T protein:vir:10 1 MADGIDLDL--LGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTK 78 (146) T ss_pred CCCceeeee--hhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecc Confidence 556666553 3444444444433111 Q ss_pred --------EEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCH Q lcl|NC_015294. 30 --------TVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQT 101 (159) Q Consensus 30 --------~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 101 (159) .+.||+-.. ++ +.+.++.+.||||.+.||+|||+|+++++.. .+.+.+.+.+.+.+.= T Consensus 79 ~~~~~g~~~~~vg~~~~-------~~-~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~--~~~~~~~~~l~~~l~k---- 144 (146) T protein:vir:10 79 AKLEGGIKTVKIGLNKA-------DR-SPWFYLKFHEWGTSKMPAHPFIEPGFNASKA--EAVRAMTDILKNEMRL---- 144 (146) T ss_pred ccccccceeEEeeeccC-------CC-CCcceeeeeccCCCCCCCCcchhHHHHHhHH--HHHHHHHHHHHHHHhh---- Confidence 112222110 11 2356888899999999999999999998753 4555555555544322 Q ss_pred HHHH Q lcl|NC_015294. 102 NTLL 105 (159) Q Consensus 102 ~~~l 105 (159) +| T Consensus 145 --a~ 146 (146) T protein:vir:10 145 --DL 146 (146) T ss_pred --cC Confidence 22 No 23 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=98.23 E-value=4.8e-09 Score=66.21 Aligned_cols=87 Identities=21% Similarity=0.360 Sum_probs=47.9 Q ss_pred ccCCceeeehHHHHHHHHHHHHHhhC-C---------------------------------------------------- Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLIKRVEALDG-T---------------------------------------------------- 29 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~~~l~~l~~-~---------------------------------------------------- 29 (159) |..+++++. .+|++|++.|+.|.. . T Consensus 1 Ma~~~~~~i--~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~ 78 (149) T protein:vir:13 1 MSDGWEIKF--EGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSGRKGSRPPGHAANNIPEP 78 (149) T ss_pred CCceeEEEe--ecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCccccccccccccchhhhcceec Confidence 666766663 445555555544421 0 Q ss_pred ---------EEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_015294. 30 ---------TVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQ 100 (159) Q Consensus 30 ---------~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 100 (159) .+.||+..+ ++ +.+.++.+.||||.+.||+|||||++++... .+.+.+.+.+.+.+.- T Consensus 79 ~~~~~~g~~~~~VG~~~~-------~~-~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~--~~~~~~~~~l~k~i~~--- 145 (149) T protein:vir:13 79 KIRKKKGNLQCVVGWEKS-------DN-TPFYYMKMEEWGTSERPPHHAFGKTNKILKR--VYDNIAQKKYDNFVKE--- 145 (149) T ss_pred ccccccceeEEEeeccCC-------CC-CccceeeeeccCccCCCCCccchHHHHHHHH--HHHHHHHHHHHHHHHH--- Confidence 123333211 11 2356888899999999999999999988753 3444444433332221 Q ss_pred HHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_015294. 101 TNTLLKKLGKMVAEQMQVNIDD 122 (159) Q Consensus 101 ~~~~l~~iG~~~~~~i~~~I~~ 122 (159) .+.+ T Consensus 146 ------------------~lG~ 149 (149) T protein:vir:13 146 ------------------KLGD 149 (149) T ss_pred ------------------HhcC Confidence 1111 No 24 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=98.22 E-value=3e-09 Score=67.30 Aligned_cols=85 Identities=8% Similarity=0.083 Sum_probs=58.4 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhc------C--CCCCcHHHHHhcC----- Q lcl|NC_015294. 70 MAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDD------Y--PGSNSPAWAAYKG----- 136 (159) Q Consensus 70 lr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~------~--~~Pna~~Ti~~KG----- 136 (159) |...++-......+.+ .+.++.. ..+...++..||..+...+++.|.+ | |+|++|+|+++|. T Consensus 1 ms~~i~~~~d~~~l~~----~L~~l~~-~~~~~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~ 75 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQL----ALDELGT-VTRDRAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFV 75 (156) T ss_pred CeEEEEEeecHHHHHH----HHHHHHh-hhccHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCC Confidence 4333221101112333 3333322 2234579999999999999999963 4 5699999999873 Q ss_pred CCCCchhHHHHHhhhhhhhcccC Q lcl|NC_015294. 137 FNDPLFHTGKMLESVKFQIHRRQ 159 (159) Q Consensus 137 ~~~PLiDTG~L~~SIty~V~~k~ 159 (159) ..+||+|||.|++||+|.+.... T Consensus 76 ~~~~L~~tg~L~~Si~~~~~~~~ 98 (156) T protein:vir:19 76 PGSILTLHGDLARSITTDYGQDY 98 (156) T ss_pred CCcchhhhHHHHHHhhheecCCE Confidence 35899999999999999998877 No 25 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=98.21 E-value=2.8e-09 Score=67.47 Aligned_cols=92 Identities=22% Similarity=0.455 Sum_probs=56.7 Q ss_pred ccCCceeeehHHHHHHHHHHHHHhhCC------------------------EEEEEecccccC----CCCCCC-----CC Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLIKRVEALDGT------------------------TVEVGFFPEDRY----GSENGN-----LP 49 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~~~l~~l~~~------------------------~v~VGi~~~~~~----~~~~~G-----~~ 49 (159) |-+.++|+. .+|++|.+.|+.+.+. -+.-|.+.++=. ...++| -+ T Consensus 1 Ma~~~~i~~--~Gld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~~~~~~~~~~v~~ 78 (125) T protein:vir:94 1 MANDFNIKF--KGVDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVKEEHGVVTGRYVA 78 (125) T ss_pred CCCceeeee--hhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceecceeccCCcEEEEeeC Confidence 666677775 3566666666554221 011222211100 001111 23 Q ss_pred HHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_015294. 50 VAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDG 98 (159) Q Consensus 50 ~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~ 98 (159) .+.+|.+.||||...|+||||+|+++++. ..+.+.++..+..++.-. T Consensus 79 ~~~Ya~~vEfGT~~~~a~Pfl~pa~~~~~--~~~~~~l~~~l~~a~k~~ 125 (125) T protein:vir:94 79 RADYSSYNEYGTYRMSAQPFMAPSVAAMT--PFFYKAVRDALNKAAKFS 125 (125) T ss_pred CCCccceeecccccCCCCcccchhHHHHH--HHHHHHHHHHHHHHhccC Confidence 46789999999999999999999999875 457777888887777654 No 26 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=98.19 E-value=1.9e-09 Score=68.45 Aligned_cols=103 Identities=19% Similarity=0.290 Sum_probs=50.1 Q ss_pred ccCCceeeehHHHHHHHHHHHHHhhCC----------------------------------------------------- Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLIKRVEALDGT----------------------------------------------------- 29 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~~~l~~l~~~----------------------------------------------------- 29 (159) |..+|+++. .+|++|.+.|+.|.+. T Consensus 1 Ma~~~~~~i--~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~ 78 (179) T protein:vir:18 1 MADSVEVSL--TGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRT 78 (179) T ss_pred CCceEEEEe--ecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccc Confidence 556776664 4455555555444211 Q ss_pred ---EEEEEecccccC--------------------CCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_015294. 30 ---TVEVGFFPEDRY--------------------GSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARL 86 (159) Q Consensus 30 ---~v~VGi~~~~~~--------------------~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~ 86 (159) .+.||+..+... +....+-..+.++.+.||||.++||||||||+++++.. T Consensus 79 g~~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa~PFlrPA~~~~~~------- 151 (179) T protein:vir:18 79 GDLAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSARPILRPAMNGVDN------- 151 (179) T ss_pred cceeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCCCccchhhHHhhHH------- Confidence 122333211100 00001122355677889999999999999999987642 Q ss_pred HHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCC Q lcl|NC_015294. 87 MKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFND 139 (159) Q Consensus 87 ~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~ 139 (159) ++++.+...+...|++.+.... .||-+- T Consensus 152 ----------------~a~~~i~~~l~~~i~k~lk~~~---------~~~~~~ 179 (179) T protein:vir:18 152 ----------------DVINVFSTEMGKAIDRAIRLAM---------KKGTTA 179 (179) T ss_pred ----------------HHHHHHHHHHHHHHHHHHHhhc---------ccCCCC Confidence 2333334444444444443211 011000 No 27 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=98.03 E-value=9e-09 Score=64.68 Aligned_cols=86 Identities=15% Similarity=0.122 Sum_probs=61.3 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcC----CCCCcHHHHHhc---------- Q lcl|NC_015294. 70 MAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDY----PGSNSPAWAAYK---------- 135 (159) Q Consensus 70 lr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~----~~Pna~~Ti~~K---------- 135 (159) |--.+ +.+.. ...+.+.+.++.....+...+|..||..+....+..|.+. |.|.+|+|+++| T Consensus 1 Ms~~i-~i~~~---~~~l~~~L~~l~~~~~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~ 76 (175) T protein:vir:10 1 MSDFV-NFQID---DSALRTRLLQLEQAGHQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKN 76 (175) T ss_pred CceeE-EEEec---HHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhh Confidence 22111 11100 1224455555555455788999999999999999999753 468999998632 Q ss_pred -----------CCCCCchhHHHHHhhhhhhhcccC Q lcl|NC_015294. 136 -----------GFNDPLFHTGKMLESVKFQIHRRQ 159 (159) Q Consensus 136 -----------G~~~PLiDTG~L~~SIty~V~~k~ 159 (159) +..++|+|||.|++||+|.+.+.. T Consensus 77 ~~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~ 111 (175) T protein:vir:10 77 GELTAAASRRKAGLMILQDSGQMAASVSTDHDDNS 111 (175) T ss_pred hhhhhhhhhhccCCCcceechhhhhhhheeecCCE Confidence 457899999999999999998777 No 28 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=97.97 E-value=2.6e-08 Score=62.13 Aligned_cols=94 Identities=24% Similarity=0.351 Sum_probs=44.4 Q ss_pred cccCCceeeehHHHHHHHHHHHHHhhCC----------------------------------EEE--------------- Q lcl|NC_015294. 2 IILASFSFKTDRRRLTSLIKRVEALDGT----------------------------------TVE--------------- 32 (159) Q Consensus 2 ~m~~~~~~k~~~~~l~~l~~~l~~l~~~----------------------------------~v~--------------- 32 (159) ||...|+|+ +|++|.+.|+.|.+. .+. T Consensus 1 mm~~~~~i~----Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~ 76 (149) T protein:vir:19 1 MIETSLDFS----GLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSG 76 (149) T ss_pred Ccceeeehh----hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeec Confidence 444344443 444444444443211 000 Q ss_pred EEecccc--cCCC----CCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHH Q lcl|NC_015294. 33 VGFFPED--RYGS----ENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLK 106 (159) Q Consensus 33 VGi~~~~--~~~~----~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~ 106 (159) |++.... .... ...+-+.+.++.+.||||.++||+|||+|+++++.. .+.+.+.+.+.+.+. .++. T Consensus 77 v~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PF~~pA~~~~k~--~~~~~~~~~l~~~l~------k~~~ 148 (149) T protein:vir:19 77 VHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREE--EAASVAIARMNQAID------EVLS 148 (149) T ss_pred ccccccccccccccceeecCCCCccceeeeeccCCCCCCCCcchhHHHHHHHH--HHHHHHHHHHHHHHH------HHhc Confidence 0000000 0000 000113356788999999999999999999998743 344444444433221 2222 Q ss_pred H Q lcl|NC_015294. 107 K 107 (159) Q Consensus 107 ~ 107 (159) + T Consensus 149 k 149 (149) T protein:vir:19 149 K 149 (149) T ss_pred C Confidence 2 No 29 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=97.95 E-value=1.7e-08 Score=63.15 Aligned_cols=94 Identities=21% Similarity=0.357 Sum_probs=44.4 Q ss_pred cccCCceeeehHHHHHHHHHHHHHhhCC----------------------------------EEEE-------------- Q lcl|NC_015294. 2 IILASFSFKTDRRRLTSLIKRVEALDGT----------------------------------TVEV-------------- 33 (159) Q Consensus 2 ~m~~~~~~k~~~~~l~~l~~~l~~l~~~----------------------------------~v~V-------------- 33 (159) ||...++|+ +|++|.+.|+.|.+. .+.+ T Consensus 1 mm~~~~~i~----Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v 76 (148) T protein:vir:93 1 MIETLLDFS----GLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGV 76 (148) T ss_pred Ccceeeeeh----hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeee Confidence 444444444 344444444333211 0111 Q ss_pred Eeccccc--CCC----CCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHH Q lcl|NC_015294. 34 GFFPEDR--YGS----ENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKK 107 (159) Q Consensus 34 Gi~~~~~--~~~----~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~ 107 (159) ++..... ... ...+...+.++.+.||||.+.||||||+|+++++.. .+.+.+.+.+.+.+ +.+|.+ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~pa~PFl~pA~~~~k~--~~~~~~~~~~~~~i------~k~~~k 148 (148) T protein:vir:93 77 HIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSE--QAAQVAIARMNRAI------DEVLRR 148 (148) T ss_pred eecccccccccccceeecCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHH--HHHHHHHHHHHHHH------HHHhcC Confidence 1100000 000 001223466888999999999999999999998743 34444444333322 122222 No 30 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=97.93 E-value=5.4e-08 Score=60.43 Aligned_cols=90 Identities=22% Similarity=0.273 Sum_probs=51.0 Q ss_pred CcccCCceeeehHHHHHHHHHHHHHhh------------------------------------------------CCEEE Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEALD------------------------------------------------GTTVE 32 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~l~------------------------------------------------~~~v~ 32 (159) |+ .++++ +|++|.+.|+.|. ...+. T Consensus 1 M~---~~~i~----Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~ 73 (140) T protein:vir:14 1 MS---SIQII----GLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLAT 73 (140) T ss_pred Cc---eeeeh----hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccccccceeEE Confidence 33 23333 2333333333221 11233 Q ss_pred EEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHH--HHHHHHHHHHHHHHHhCCC Q lcl|NC_015294. 33 VGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQ--FHYARLMKSTFENVLRDGR 99 (159) Q Consensus 33 VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~--~~~~~~~~~~~~~~~~g~~ 99 (159) ||+..+... ..++-+.+.++.+.||||.++||||||+|+++++..+ ..+.+.++..+.+++.|.. T Consensus 74 vg~~~~~~~--~~~~~~~~~y~~f~E~GT~~~~a~pFl~pa~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:14 74 AGVRVRTKG--KADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGRR 140 (140) T ss_pred eeeeecccc--ccCCCCccceeeeeccccCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 333222110 0122345678899999999999999999999887432 2345666677777888876 No 31 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=97.84 E-value=1.2e-07 Score=58.45 Aligned_cols=90 Identities=20% Similarity=0.191 Sum_probs=49.4 Q ss_pred CcccCCceeeehHHHHHHHHHHHHHhhC------------------------------------------------CEEE Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEALDG------------------------------------------------TTVE 32 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~l~~------------------------------------------------~~v~ 32 (159) |+ .++++. |++|.+.|+.|.. ..+. T Consensus 1 Ma---~~~i~G----ld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~ 73 (140) T protein:vir:10 1 MS---SVQILG----LADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKDSPGIAT 73 (140) T ss_pred Cc---eeeehh----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceecccccccccceeE Confidence 33 233332 2222222222110 1223 Q ss_pred EEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHH--HHHHHHHHHHHHHHHhCCC Q lcl|NC_015294. 33 VGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQ--FHYARLMKSTFENVLRDGR 99 (159) Q Consensus 33 VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~--~~~~~~~~~~~~~~~~g~~ 99 (159) ||+..+.... ..+.+.+.++.+.||||.+.||+|||+|+++++..+ ..+.+.+++.+.+++.|+. T Consensus 74 ~~~~~~~~~~--~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 74 AGVRVRTKGK--ADSPNNAFYWRFVELGTQFMKAEPFMRPAFDASIAQAEGAIRTEIARAIDQVVGGGL 140 (140) T ss_pred Eeeccccccc--cCCCCcccccceeccCcCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 3332211110 123456778999999999999999999999887432 2234555566677788877 No 32 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=97.84 E-value=6.6e-08 Score=59.95 Aligned_cols=85 Identities=15% Similarity=0.345 Sum_probs=49.4 Q ss_pred ceeeehHHHHHHHHHHHHHhhCC------------------------EEEEEecccccCCCCCCC-----CCHHHHHHHH Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDGT------------------------TVEVGFFPEDRYGSENGN-----LPVAQVAAYN 57 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~~------------------------~v~VGi~~~~~~~~~~~G-----~~~A~iA~~~ 57 (159) |+++. .+|++|.+.|+.+.+. -|.-|.+.++-. ...+| .+.+.+|.+. T Consensus 1 msi~i--~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~-~~~~g~~~~V~~~~~Ya~yv 77 (114) T protein:vir:95 1 MAIKW--QGIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQLAPKDTEFLKDHIT-TSYPGMEAHIHGEAGYDGYQ 77 (114) T ss_pred Ceeee--ehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcee-eecCceEEEeecCCCcccee Confidence 44444 2455555555443321 011122211100 00111 2346789999 Q ss_pred hcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015294. 58 EFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLR 96 (159) Q Consensus 58 E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~ 96 (159) ||||...|+||||+|+++++.. .+.+.++..+...+. T Consensus 78 E~GT~~~~aqPfl~pa~~~~~~--~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 78 EYGTRFQPGTPHFRPMMEQIQP--QFQKDMTDVMKGAFK 114 (114) T ss_pred ecCccccCCCccchhhHHHHHH--HHHHHHHHHHHhhcC Confidence 9999999999999999998753 466667776666665 No 33 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=97.83 E-value=9.9e-08 Score=58.97 Aligned_cols=90 Identities=22% Similarity=0.275 Sum_probs=48.5 Q ss_pred CcccCCceeeehHHHHHHHHHHHHHhhC------------------------------------------------CEEE Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEALDG------------------------------------------------TTVE 32 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~l~~------------------------------------------------~~v~ 32 (159) |+ .++++ +|++|.+.|+.|.+ ..+. T Consensus 1 Ma---~~~i~----Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~ 73 (140) T protein:vir:10 1 MS---SIQII----GLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLAT 73 (140) T ss_pred Cc---eeeeh----hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhccccccccccccceEE Confidence 33 23333 23333322222211 1223 Q ss_pred EEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHH--HHHHHHHHHHHHHHHhCCC Q lcl|NC_015294. 33 VGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQ--FHYARLMKSTFENVLRDGR 99 (159) Q Consensus 33 VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~--~~~~~~~~~~~~~~~~g~~ 99 (159) ||+...... ..++.+.+.++.+.||||.++||+|||+|+++++..+ ..+.+.+++.+.+++.|.. T Consensus 74 ~g~~~~~~~--~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 74 AGVRVRTKG--KADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGRR 140 (140) T ss_pred eeeeecccc--ccCCCCccceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 333211100 0112345678999999999999999999999987542 2234455566677777776 No 34 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=97.80 E-value=6.9e-08 Score=59.83 Aligned_cols=77 Identities=16% Similarity=0.177 Sum_probs=47.2 Q ss_pred ceeeehHHHHHHHHHHHHHhhC------------------------------------------------CEEEEEeccc Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDG------------------------------------------------TTVEVGFFPE 38 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~------------------------------------------------~~v~VGi~~~ 38 (159) |+|+.+.++|...++.|..... ..|.||+. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~-- 78 (125) T protein:vir:47 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYA-- 78 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccC-- Confidence 4455544444444433322110 01222221 Q ss_pred ccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015294. 39 DRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLR 96 (159) Q Consensus 39 ~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~ 96 (159) -+.+.+|.+.||||.++||+||+|+++++.. .++.+.+...+.++.. T Consensus 79 ---------k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~--~ev~~~~~~~lrk~~k 125 (125) T protein:vir:47 79 ---------KGVSHRIHATEFGTMYQKPQLFITKTEKQGK--NKVLKTMLDTAKRLQK 125 (125) T ss_pred ---------CCCceEEEeccCCccCCCCCchhhHHHHHhH--HHHHHHHHHHHHHHhC Confidence 1123466789999999999999999999875 4567788888888766 No 35 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=97.80 E-value=6.9e-08 Score=59.83 Aligned_cols=77 Identities=16% Similarity=0.177 Sum_probs=47.2 Q ss_pred ceeeehHHHHHHHHHHHHHhhC------------------------------------------------CEEEEEeccc Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDG------------------------------------------------TTVEVGFFPE 38 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~------------------------------------------------~~v~VGi~~~ 38 (159) |+|+.+.++|...++.|..... ..|.||+. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~-- 78 (125) T protein:vir:81 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYA-- 78 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccC-- Confidence 4455544444444433322110 01222221 Q ss_pred ccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015294. 39 DRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLR 96 (159) Q Consensus 39 ~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~ 96 (159) -+.+.+|.+.||||.++||+||+|+++++.. .++.+.+...+.++.. T Consensus 79 ---------k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~--~ev~~~~~~~lrk~~k 125 (125) T protein:vir:81 79 ---------KGVSHRIHATEFGTMYQKPQLFITKTEKQGK--NKVLKTMLDTAKRLQK 125 (125) T ss_pred ---------CCCceEEEeccCCccCCCCCchhhHHHHHhH--HHHHHHHHHHHHHHhC Confidence 1123466789999999999999999999875 4567788888888766 No 36 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=97.80 E-value=6.9e-08 Score=59.83 Aligned_cols=77 Identities=16% Similarity=0.177 Sum_probs=47.2 Q ss_pred ceeeehHHHHHHHHHHHHHhhC------------------------------------------------CEEEEEeccc Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDG------------------------------------------------TTVEVGFFPE 38 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~------------------------------------------------~~v~VGi~~~ 38 (159) |+|+.+.++|...++.|..... ..|.||+. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~-- 78 (125) T protein:vir:79 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYA-- 78 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccC-- Confidence 4455544444444433322110 01222221 Q ss_pred ccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015294. 39 DRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLR 96 (159) Q Consensus 39 ~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~ 96 (159) -+.+.+|.+.||||.++||+||+|+++++.. .++.+.+...+.++.. T Consensus 79 ---------k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~--~ev~~~~~~~lrk~~k 125 (125) T protein:vir:79 79 ---------KGVSHRIHATEFGTMYQKPQLFITKTEKQGK--NKVLKTMLDTAKRLQK 125 (125) T ss_pred ---------CCCceEEEeccCCccCCCCCchhhHHHHHhH--HHHHHHHHHHHHHHhC Confidence 1123466789999999999999999999875 4567788888888766 No 37 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=97.80 E-value=6.9e-08 Score=59.83 Aligned_cols=77 Identities=16% Similarity=0.177 Sum_probs=47.2 Q ss_pred ceeeehHHHHHHHHHHHHHhhC------------------------------------------------CEEEEEeccc Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDG------------------------------------------------TTVEVGFFPE 38 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~------------------------------------------------~~v~VGi~~~ 38 (159) |+|+.+.++|...++.|..... ..|.||+. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~-- 78 (125) T protein:vir:98 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYA-- 78 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccC-- Confidence 4455544444444433322110 01222221 Q ss_pred ccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015294. 39 DRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLR 96 (159) Q Consensus 39 ~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~ 96 (159) -+.+.+|.+.||||.++||+||+|+++++.. .++.+.+...+.++.. T Consensus 79 ---------k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~--~ev~~~~~~~lrk~~k 125 (125) T protein:vir:98 79 ---------KGVSHRIHATEFGTMYQKPQLFITKTEKQGK--NKVLKTMLDTAKRLQK 125 (125) T ss_pred ---------CCCceEEEeccCCccCCCCCchhhHHHHHhH--HHHHHHHHHHHHHHhC Confidence 1123466789999999999999999999875 4567788888888766 No 38 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=97.80 E-value=6.9e-08 Score=59.83 Aligned_cols=77 Identities=16% Similarity=0.177 Sum_probs=47.2 Q ss_pred ceeeehHHHHHHHHHHHHHhhC------------------------------------------------CEEEEEeccc Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDG------------------------------------------------TTVEVGFFPE 38 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~------------------------------------------------~~v~VGi~~~ 38 (159) |+|+.+.++|...++.|..... ..|.||+. T Consensus 1 M~v~v~~~~L~~~l~~l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~-- 78 (125) T protein:vir:94 1 MGARIESNNIEQGLKNAVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYA-- 78 (125) T ss_pred CeeEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccC-- Confidence 4455544444444433322110 01222221 Q ss_pred ccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015294. 39 DRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLR 96 (159) Q Consensus 39 ~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~ 96 (159) -+.+.+|.+.||||.++||+||+|+++++.. .++.+.+...+.++.. T Consensus 79 ---------k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~--~ev~~~~~~~lrk~~k 125 (125) T protein:vir:94 79 ---------KGVSHRIHATEFGTMYQKPQLFITKTEKQGK--NKVLKTMLDTAKRLQK 125 (125) T ss_pred ---------CCCceEEEeccCCccCCCCCchhhHHHHHhH--HHHHHHHHHHHHHHhC Confidence 1123466789999999999999999999875 4567788888888766 No 39 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=97.79 E-value=3.1e-08 Score=61.75 Aligned_cols=79 Identities=20% Similarity=0.183 Sum_probs=47.4 Q ss_pred CcccCC-ceeeehHHHHHHHHHHHH-------HhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhH Q lcl|NC_015294. 1 MIILAS-FSFKTDRRRLTSLIKRVE-------ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAP 72 (159) Q Consensus 1 m~m~~~-~~~k~~~~~l~~l~~~l~-------~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~ 72 (159) .+.... ..-+. .. .+.+.+. .-....|.||+-.+ .+.++.+.||||.+.||||||+| T Consensus 41 ~~k~~ap~~~~~-tg---~l~~~I~~~~~k~~~~g~~~v~Vg~~~~-----------~~~y~~f~E~GT~~~~a~Pf~~p 105 (127) T protein:vir:12 41 RQRSHVNRSDKK-QP---HMQDNITVSNVRESKDGVRFVAVGPNKK-----------VAYRGRFLEWGTSKMPPQPFIEK 105 (127) T ss_pred HHHHhCCCCCCC-hh---HHHHhhhccccccccCceeEEEEeeCCC-----------CcceeeeeccCccCCCCCccchH Confidence 110000 00011 11 2222221 11334677886422 35688899999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015294. 73 TFEEFTSQFHYARLMKSTFENVLR 96 (159) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~ 96 (159) +++++.. .+.+.+.+.+.+.+. T Consensus 106 a~~~~~~--~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 106 GGKEGEG--PAVELMERILTAPIK 127 (127) T ss_pred hHHHHHH--HHHHHHHHHHHHhcC Confidence 9998754 467777777777776 No 40 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=97.77 E-value=3.6e-08 Score=61.39 Aligned_cols=80 Identities=21% Similarity=0.336 Sum_probs=40.8 Q ss_pred ceeeehHHHHHHHHHHHHHhhC------------------------------CEEEEEecccccCCCCCCC-----CCHH Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDG------------------------------TTVEVGFFPEDRYGSENGN-----LPVA 51 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~------------------------------~~v~VGi~~~~~~~~~~~G-----~~~A 51 (159) |++ .+|++|.+.|+.+.+ .-+.-|.+.++=...-++| .+.+ T Consensus 1 i~~----~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~ 76 (115) T protein:vir:96 1 MNI----DGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHA 76 (115) T ss_pred Ccc----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCc Confidence 222 222333332222111 0111222222100000122 2346 Q ss_pred HHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 52 QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 92 (159) Q Consensus 52 ~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 92 (159) .+|.+.||||...|+||||+|+++.+. ..+.+.++++++ T Consensus 77 ~Ya~~vE~GT~km~a~Pfl~PA~~~~~--~~~~~~i~~~~k 115 (115) T protein:vir:96 77 AYSGFLEFGTRYMEAEPFMWPVYEVIR--KSTVEELKALFE 115 (115) T ss_pred cchhhhcccccccCCCCchhhhHHHHH--HHHHHHHHHHhC Confidence 799999999999999999999998774 345555555555 No 41 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=97.77 E-value=3.6e-08 Score=61.39 Aligned_cols=80 Identities=21% Similarity=0.336 Sum_probs=40.8 Q ss_pred ceeeehHHHHHHHHHHHHHhhC------------------------------CEEEEEecccccCCCCCCC-----CCHH Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDG------------------------------TTVEVGFFPEDRYGSENGN-----LPVA 51 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~------------------------------~~v~VGi~~~~~~~~~~~G-----~~~A 51 (159) |++ .+|++|.+.|+.+.+ .-+.-|.+.++=...-++| .+.+ T Consensus 1 i~~----~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~ 76 (115) T protein:vir:93 1 MNI----DGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHA 76 (115) T ss_pred Ccc----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCc Confidence 222 222333332222111 0111222222100000122 2346 Q ss_pred HHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 52 QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 92 (159) Q Consensus 52 ~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 92 (159) .+|.+.||||...|+||||+|+++.+. ..+.+.++++++ T Consensus 77 ~Ya~~vE~GT~km~a~Pfl~PA~~~~~--~~~~~~i~~~~k 115 (115) T protein:vir:93 77 AYSGFLEFGTRYMEAEPFMWPVYEVIR--KSTVEELKALFE 115 (115) T ss_pred cchhhhcccccccCCCCchhhhHHHHH--HHHHHHHHHHhC Confidence 799999999999999999999998774 345555555555 No 42 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=97.77 E-value=3.6e-08 Score=61.39 Aligned_cols=80 Identities=21% Similarity=0.336 Sum_probs=40.8 Q ss_pred ceeeehHHHHHHHHHHHHHhhC------------------------------CEEEEEecccccCCCCCCC-----CCHH Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDG------------------------------TTVEVGFFPEDRYGSENGN-----LPVA 51 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~------------------------------~~v~VGi~~~~~~~~~~~G-----~~~A 51 (159) |++ .+|++|.+.|+.+.+ .-+.-|.+.++=...-++| .+.+ T Consensus 1 i~~----~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~ 76 (115) T protein:vir:97 1 MNI----DGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHA 76 (115) T ss_pred Ccc----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCc Confidence 222 222333332222111 0111222222100000122 2346 Q ss_pred HHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 52 QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 92 (159) Q Consensus 52 ~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 92 (159) .+|.+.||||...|+||||+|+++.+. ..+.+.++++++ T Consensus 77 ~Ya~~vE~GT~km~a~Pfl~PA~~~~~--~~~~~~i~~~~k 115 (115) T protein:vir:97 77 AYSGFLEFGTRYMEAEPFMWPVYEVIR--KSTVEELKALFE 115 (115) T ss_pred cchhhhcccccccCCCCchhhhHHHHH--HHHHHHHHHHhC Confidence 799999999999999999999998774 345555555555 No 43 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=97.77 E-value=3.6e-08 Score=61.39 Aligned_cols=80 Identities=21% Similarity=0.336 Sum_probs=40.8 Q ss_pred ceeeehHHHHHHHHHHHHHhhC------------------------------CEEEEEecccccCCCCCCC-----CCHH Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDG------------------------------TTVEVGFFPEDRYGSENGN-----LPVA 51 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~------------------------------~~v~VGi~~~~~~~~~~~G-----~~~A 51 (159) |++ .+|++|.+.|+.+.+ .-+.-|.+.++=...-++| .+.+ T Consensus 1 i~~----~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~ 76 (115) T protein:vir:96 1 MNI----DGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHA 76 (115) T ss_pred Ccc----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCc Confidence 222 222333332222111 0111222222100000122 2346 Q ss_pred HHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 52 QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 92 (159) Q Consensus 52 ~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 92 (159) .+|.+.||||...|+||||+|+++.+. ..+.+.++++++ T Consensus 77 ~Ya~~vE~GT~km~a~Pfl~PA~~~~~--~~~~~~i~~~~k 115 (115) T protein:vir:96 77 AYSGFLEFGTRYMEAEPFMWPVYEVIR--KSTVEELKALFE 115 (115) T ss_pred cchhhhcccccccCCCCchhhhHHHHH--HHHHHHHHHHhC Confidence 799999999999999999999998774 345555555555 No 44 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=97.77 E-value=3.6e-08 Score=61.39 Aligned_cols=80 Identities=21% Similarity=0.336 Sum_probs=40.8 Q ss_pred ceeeehHHHHHHHHHHHHHhhC------------------------------CEEEEEecccccCCCCCCC-----CCHH Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDG------------------------------TTVEVGFFPEDRYGSENGN-----LPVA 51 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~------------------------------~~v~VGi~~~~~~~~~~~G-----~~~A 51 (159) |++ .+|++|.+.|+.+.+ .-+.-|.+.++=...-++| .+.+ T Consensus 1 i~~----~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~ 76 (115) T protein:vir:10 1 MNI----DGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHA 76 (115) T ss_pred Ccc----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCc Confidence 222 222333332222111 0111222222100000122 2346 Q ss_pred HHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 52 QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 92 (159) Q Consensus 52 ~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 92 (159) .+|.+.||||...|+||||+|+++.+. ..+.+.++++++ T Consensus 77 ~Ya~~vE~GT~km~a~Pfl~PA~~~~~--~~~~~~i~~~~k 115 (115) T protein:vir:10 77 AYSGFLEFGTRYMEAEPFMWPVYEVIR--KSTVEELKALFE 115 (115) T ss_pred cchhhhcccccccCCCCchhhhHHHHH--HHHHHHHHHHhC Confidence 799999999999999999999998774 345555555555 No 45 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=97.77 E-value=3.6e-08 Score=61.39 Aligned_cols=80 Identities=21% Similarity=0.336 Sum_probs=40.8 Q ss_pred ceeeehHHHHHHHHHHHHHhhC------------------------------CEEEEEecccccCCCCCCC-----CCHH Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDG------------------------------TTVEVGFFPEDRYGSENGN-----LPVA 51 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~------------------------------~~v~VGi~~~~~~~~~~~G-----~~~A 51 (159) |++ .+|++|.+.|+.+.+ .-+.-|.+.++=...-++| .+.+ T Consensus 1 i~~----~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~ 76 (115) T protein:vir:78 1 MNI----DGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHA 76 (115) T ss_pred Ccc----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCc Confidence 222 222333332222111 0111222222100000122 2346 Q ss_pred HHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 52 QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 92 (159) Q Consensus 52 ~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 92 (159) .+|.+.||||...|+||||+|+++.+. ..+.+.++++++ T Consensus 77 ~Ya~~vE~GT~km~a~Pfl~PA~~~~~--~~~~~~i~~~~k 115 (115) T protein:vir:78 77 AYSGFLEFGTRYMEAEPFMWPVYEVIR--KSTVEELKALFE 115 (115) T ss_pred cchhhhcccccccCCCCchhhhHHHHH--HHHHHHHHHHhC Confidence 799999999999999999999998774 345555555555 No 46 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=97.76 E-value=1.5e-07 Score=58.02 Aligned_cols=90 Identities=21% Similarity=0.289 Sum_probs=49.1 Q ss_pred CcccCCceeeehHHHHHHHHHHHHHhhC------------------------------------------------CEEE Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEALDG------------------------------------------------TTVE 32 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~l~~------------------------------------------------~~v~ 32 (159) |+ .|.++ +|++|++.|+.|.. ..+. T Consensus 1 Ma---~~~i~----Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~~~~~~ 73 (140) T protein:vir:80 1 MS---SIQIV----GLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLAT 73 (140) T ss_pred Cc---eeeeh----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccccceee Confidence 33 34444 23333333322210 0112 Q ss_pred EEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHH--HHHHHHHHHHHHHHHhCCC Q lcl|NC_015294. 33 VGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQ--FHYARLMKSTFENVLRDGR 99 (159) Q Consensus 33 VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~--~~~~~~~~~~~~~~~~g~~ 99 (159) ||+-.+... ..++.+.+.++.+.||||.++||||||+|+++++..+ ..+.+.+++.+.+++.|.. T Consensus 74 ~~~~~~~~~--~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:80 74 AGVRVRTKG--KADSPSNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDQALGGRR 140 (140) T ss_pred eeeeccccc--ccCCCCCcceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 222211110 0123345778999999999999999999999987532 2234555566677777776 No 47 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=97.76 E-value=2.9e-08 Score=61.93 Aligned_cols=80 Identities=20% Similarity=0.308 Sum_probs=39.5 Q ss_pred ceeeehHHHHHHHHHHHHHhhC--------------------------C----EEEEEecccccCCCCCCC-----CCHH Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDG--------------------------T----TVEVGFFPEDRYGSENGN-----LPVA 51 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~--------------------------~----~v~VGi~~~~~~~~~~~G-----~~~A 51 (159) |+++ +|++|.+.|+.+.+ . -|.-|.+.++-....++| .+.+ T Consensus 1 i~i~----Gld~L~~~l~~~~~~~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~~~g~~~~~v~~~~ 76 (115) T protein:vir:10 1 MQSK----GLKKLMNHLKVMHDDIEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVKKIGDLHYRVISTA 76 (115) T ss_pred Ceeh----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEeeCCC Confidence 2222 22222222222111 0 011122222100011111 2346 Q ss_pred HHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 52 QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 92 (159) Q Consensus 52 ~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 92 (159) .+|.+.||||...|+||||+|+++.+.. .+.+.+++++. T Consensus 77 ~Ya~~vEfGT~km~a~PFl~PA~~~~k~--~~~~~i~~~i~ 115 (115) T protein:vir:10 77 HYSGFLEFGTRYMEPAPFMFPTYQTLKK--STINDLKRLLS 115 (115) T ss_pred ccchheecccccCCCCCchhhhHHHHHH--HHHHHHHHHhC Confidence 7999999999999999999999987643 34444444443 No 48 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=97.64 E-value=4.4e-08 Score=60.91 Aligned_cols=92 Identities=17% Similarity=0.293 Sum_probs=43.0 Q ss_pred ccCCceeeehHHHHHHHHHHHHHhhCC------------------------------EEEEEecccccCCCCCCCCC--- Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLIKRVEALDGT------------------------------TVEVGFFPEDRYGSENGNLP--- 49 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~~~l~~l~~~------------------------------~v~VGi~~~~~~~~~~~G~~--- 49 (159) |+..|. ..|.++|...++.|.+..++ .+.+-...+. ..+|.. T Consensus 1 m~~~~~-~~d~s~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~----s~~g~~~~~ 75 (157) T protein:vir:97 1 MKFSIR-SVDITGILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEE----SVEGIQTYA 75 (157) T ss_pred CeeEee-cccHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeecccc----CCCceEEEE Confidence 222220 22333444444443321111 1111111110 001111 Q ss_pred ------HHHHHHHHhcC------------------------CCCCCCCchhhHHHHHHHHHH--HHHHHHHHHHHHHHhC Q lcl|NC_015294. 50 ------VAQVAAYNEFG------------------------TTRNPTRPFMAPTFEEFTSQF--HYARLMKSTFENVLRD 97 (159) Q Consensus 50 ------~A~iA~~~E~G------------------------~~~IP~RpFlr~~~~~~~~~~--~~~~~~~~~~~~~~~g 97 (159) .+-++.+.||| +..+||||||||+++....+. .+.+.+.+.|.+++.| T Consensus 76 Vg~~~~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k~I~e~l~g 155 (157) T protein:vir:97 76 VSWRKKAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGAKKYAELQRG 155 (157) T ss_pred EeecCCccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhHHHHHHHHHHHHHHHHHHHhcC Confidence 12345566888 235999999999999875431 1123355677888888 Q ss_pred CC Q lcl|NC_015294. 98 GR 99 (159) Q Consensus 98 ~~ 99 (159) +. T Consensus 156 ~~ 157 (157) T protein:vir:97 156 DT 157 (157) T ss_pred CC Confidence 75 No 49 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=97.62 E-value=5.7e-08 Score=60.29 Aligned_cols=84 Identities=23% Similarity=0.464 Sum_probs=44.0 Q ss_pred ccCCceeeehHHHHHHHHHHHHHhhCC----------------------EEEEEecccccCC-CCCCC-----CCHHHHH Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLIKRVEALDGT----------------------TVEVGFFPEDRYG-SENGN-----LPVAQVA 54 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~~~l~~l~~~----------------------~v~VGi~~~~~~~-~~~~G-----~~~A~iA 54 (159) |...+.++ +|++|++.|+.+... -|.-|-+.++-.. ..++| .+.+.+| T Consensus 1 M~~~i~i~----Gld~l~~~L~~~~~~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya 76 (112) T protein:vir:36 1 MKSSLSFK----GIDQLVKHLDKAASLKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMELTEGGFSGQAGPHTDYS 76 (112) T ss_pred Cceeeeeh----hHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeeecCCceEEEeecCCCcc Confidence 33344444 344444444332111 0111211111000 01122 2347789 Q ss_pred HHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 55 AYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 92 (159) Q Consensus 55 ~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 92 (159) .+.||||...|+||||+|+++.+.. .+.+.+++.++ T Consensus 77 ~~vE~GT~k~~a~Pfl~pa~~~~~~--~~~~~i~~~lr 112 (112) T protein:vir:36 77 AYVEYGTRFQSAQPFVKPAYNEQKG--VFIKDLERLLK 112 (112) T ss_pred ceeeccccccCCCcchhhhHHHHHH--HHHHHHHHHcC Confidence 9999999999999999999987743 34444444444 No 50 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=97.55 E-value=9.9e-08 Score=58.98 Aligned_cols=89 Identities=20% Similarity=0.296 Sum_probs=42.2 Q ss_pred CcccCCceeeehHHHHHHHHHHHHHhhCC----------------------------------EEEEEec--ccccCCCC Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEALDGT----------------------------------TVEVGFF--PEDRYGSE 44 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~l~~~----------------------------------~v~VGi~--~~~~~~~~ 44 (159) |+-.. |+ +|++|.+.|+.|... .+...|. ........ T Consensus 1 M~~~~---i~----Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~ 73 (133) T protein:vir:10 1 MIRME---VK----GLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRKAQG 73 (133) T ss_pred CeeEe---ee----hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccccCc Confidence 33222 22 222222222222110 0111110 00000000 Q ss_pred C------CCCCHH--HHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_015294. 45 N------GNLPVA--QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDG 98 (159) Q Consensus 45 ~------~G~~~A--~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~ 98 (159) . -|.+-. .++.+.||||.+.||||||+|+++++.. .+.+.+.+.+.+.+.-. T Consensus 74 ~~~~~v~vg~~~~~~~y~~f~E~GT~k~~a~PF~~pA~~~~~~--~~~~~~~~~~~~~l~K~ 133 (133) T protein:vir:10 74 NAVVTLRVGPSKQHHMKVLAQEFGTVKQVADPFIRPALDYNVQ--TVLRVLTVEIRNGIQNR 133 (133) T ss_pred cceEEEEecCCCCccceEeeeccCCCCCCCCccchHHHHHhHH--HHHHHHHHHHHHHhhcC Confidence 0 011112 2344559999999999999999998753 46677777777777665 No 51 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=97.53 E-value=1.6e-07 Score=57.78 Aligned_cols=84 Identities=18% Similarity=0.300 Sum_probs=42.2 Q ss_pred CcccCCceeeehHHHHHHHHHHHHHhhCC----------------------EEEEEecccccC-----CCCCCC---CCH Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEALDGT----------------------TVEVGFFPEDRY-----GSENGN---LPV 50 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~l~~~----------------------~v~VGi~~~~~~-----~~~~~G---~~~ 50 (159) |+. |++ .+|++|.+.|+.+.+. ...+++..+.-. ...++| .+. T Consensus 1 Ma~---i~~----~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~ 73 (114) T protein:vir:27 1 MAT---IEF----EGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDKATVEAL 73 (114) T ss_pred Cee---eee----ehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCeeEecCC Confidence 332 222 2344444444332110 001111111100 001122 234 Q ss_pred HHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 51 AQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 93 (159) Q Consensus 51 A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 93 (159) +.+|.++||||...||||||||+++.+.. .+.+.+++.++- T Consensus 74 ~~Ya~~vEfGT~km~a~Pfl~PA~~~~~~--~~~~~l~~l~k~ 114 (114) T protein:vir:27 74 TSYSGYLEVGTRKMEAQPFMKPALDEVAP--KMVEELAKWDET 114 (114) T ss_pred CCccceecccccccCCCCchhhhHHHHHH--HHHHHHHHHhcC Confidence 67999999999999999999999997753 344444444432 No 52 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=97.53 E-value=1.6e-07 Score=57.78 Aligned_cols=84 Identities=18% Similarity=0.300 Sum_probs=42.2 Q ss_pred CcccCCceeeehHHHHHHHHHHHHHhhCC----------------------EEEEEecccccC-----CCCCCC---CCH Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEALDGT----------------------TVEVGFFPEDRY-----GSENGN---LPV 50 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~l~~~----------------------~v~VGi~~~~~~-----~~~~~G---~~~ 50 (159) |+. |++ .+|++|.+.|+.+.+. ...+++..+.-. ...++| .+. T Consensus 1 Ma~---i~~----~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~ 73 (114) T protein:vir:49 1 MAT---IEF----EGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDKATVEAL 73 (114) T ss_pred Cee---eee----ehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCeeEecCC Confidence 332 222 2344444444332110 001111111100 001122 234 Q ss_pred HHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 51 AQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 93 (159) Q Consensus 51 A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 93 (159) +.+|.++||||...||||||||+++.+.. .+.+.+++.++- T Consensus 74 ~~Ya~~vEfGT~km~a~Pfl~PA~~~~~~--~~~~~l~~l~k~ 114 (114) T protein:vir:49 74 TSYSGYLEVGTRKMEAQPFMKPALDEVAP--KMVEELAKWDET 114 (114) T ss_pred CCccceecccccccCCCCchhhhHHHHHH--HHHHHHHHHhcC Confidence 67999999999999999999999997753 344444444432 No 53 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=97.50 E-value=1.4e-07 Score=58.12 Aligned_cols=80 Identities=20% Similarity=0.314 Sum_probs=40.1 Q ss_pred ceeeehHHHHHHHHHHHHHhhC--------------------------C----EEEEEecccccCCCCCCC-----CCHH Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDG--------------------------T----TVEVGFFPEDRYGSENGN-----LPVA 51 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~--------------------------~----~v~VGi~~~~~~~~~~~G-----~~~A 51 (159) |+++ +|++|.+.|+.+.+ . -+.-|.+..+=...-++| .+.+ T Consensus 1 i~i~----Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~~g~~~~~V~~~~ 76 (115) T protein:vir:99 1 MNID----GLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTVDLQYTITSHA 76 (115) T ss_pred Ccch----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEecCCc Confidence 2222 22222222222110 0 011122211100000122 2347 Q ss_pred HHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 52 QVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 92 (159) Q Consensus 52 ~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 92 (159) .+|.+.||||...|+||||+|+++.+.. .+.+.++++++ T Consensus 77 ~Ya~~vE~GT~~m~a~PFl~PA~~~~k~--~~~~~l~~~~k 115 (115) T protein:vir:99 77 AYSGFLEFGTRYMEAEPFMWPVYEVIRK--STVEELKTLFE 115 (115) T ss_pred cccccccccccccCCCCcchhhHHHHHH--HHHHHHHHHhC Confidence 7999999999999999999999997743 34555555444 No 54 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=97.49 E-value=2e-07 Score=57.34 Aligned_cols=81 Identities=16% Similarity=0.109 Sum_probs=45.8 Q ss_pred CcccC-CceeeehHHHHHHHHHHHH-------HhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhH Q lcl|NC_015294. 1 MIILA-SFSFKTDRRRLTSLIKRVE-------ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAP 72 (159) Q Consensus 1 m~m~~-~~~~k~~~~~l~~l~~~l~-------~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~ 72 (159) .+... -+.-.....+ +.+.+. ......+.|||..+ .+.++.+.||||.+.||+|||++ T Consensus 37 ~~k~~ap~~~~~~~~h---l~d~I~~~~~k~~~~g~~~~~VG~~k~-----------~~~y~~f~E~GT~k~~~~pF~~p 102 (125) T protein:vir:97 37 ALKANTPVYEVETDER---LQEDTVISGFKGANVGIVSKEIGYGKA-----------TGWRAHYPNDGTIYQRGQDFKER 102 (125) T ss_pred HHHHhCCcCCCCchhh---HHhhhhcccccccccCceEEEEeecCC-----------CceeEeeeccCccCCCcCccchH Confidence 11000 0000011111 222221 12333678888432 25689999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_015294. 73 TFEEFTSQFHYARLMKSTFENVLRD 97 (159) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~g 97 (159) ++++.. .++.+.+.+.+.+.+.= T Consensus 103 a~~~~k--~~~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 103 TINQMT--PKAKQLYAEKVKEGLGL 125 (125) T ss_pred hHHHhH--HHHHHHHHHHHHHHhcC Confidence 999874 34666666666665432 No 55 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=97.47 E-value=2e-07 Score=57.29 Aligned_cols=84 Identities=15% Similarity=0.246 Sum_probs=40.5 Q ss_pred cCCceeeehH---------------HHHHHH----HHHHHHhhCCEEEEEeccccc-CCCCCC----CCCHHHHHHHHhc Q lcl|NC_015294. 4 LASFSFKTDR---------------RRLTSL----IKRVEALDGTTVEVGFFPEDR-YGSENG----NLPVAQVAAYNEF 59 (159) Q Consensus 4 ~~~~~~k~~~---------------~~l~~l----~~~l~~l~~~~v~VGi~~~~~-~~~~~~----G~~~A~iA~~~E~ 59 (159) +.|+ ..-. +.+.+. .+..+.+.. |.-|.+.++= ....++ =.+.+.+|.+.|| T Consensus 1 i~Gl--d~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aP--v~TG~Lr~sI~~~~~~~~~~~v~~~~~Ya~~vE~ 76 (108) T protein:vir:99 1 MRGL--DRFLRSVERKQKSVRIAVDKELSKSAARIERQAKILAP--VDTGWLRAQIYSEQQRLLHYRVVSPALYSIYLEL 76 (108) T ss_pred CchH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--cCchhhhcceeeeecCcEEEEeecCcccchhccc Confidence 1111 0000 111111 111111111 1122221110 000001 1234789999999 Q ss_pred CCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 60 GTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 93 (159) Q Consensus 60 G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 93 (159) ||...|+||||+|+++.++. .+.+.+++.+++ T Consensus 77 GT~~m~a~Pf~~pa~~~~~~--~~~~~i~~~lrk 108 (108) T protein:vir:99 77 GTRKMEAQSFLDPALRKEWP--VLMANIKKMFKR 108 (108) T ss_pred CccccCCCcchhhhHHHHHH--HHHHHHHHHhcC Confidence 99999999999999998753 456666666665 No 56 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=97.45 E-value=3.3e-07 Score=56.09 Aligned_cols=87 Identities=25% Similarity=0.345 Sum_probs=46.7 Q ss_pred cCCceeeehHHHHHHHHHHHHHhhCC------------------------EEEEEeccccc-CCCCCCC-----CCHHHH Q lcl|NC_015294. 4 LASFSFKTDRRRLTSLIKRVEALDGT------------------------TVEVGFFPEDR-YGSENGN-----LPVAQV 53 (159) Q Consensus 4 ~~~~~~k~~~~~l~~l~~~l~~l~~~------------------------~v~VGi~~~~~-~~~~~~G-----~~~A~i 53 (159) ++.++++.+.+.++++.+.|+.+.+. -|.-|-+...= .....+| .+.+.+ T Consensus 1 m~~ms~~i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~~~g~~~~V~~~~~Y 80 (144) T protein:vir:59 1 MALMSVRIDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYKNNGLTAEITVGAEY 80 (144) T ss_pred CCcceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEeecCcEEEEEecCCCc Confidence 44446666666555554444332111 01112111110 0001122 335789 Q ss_pred HHHHhcCC---------------------------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 54 AAYNEFGT---------------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 92 (159) Q Consensus 54 A~~~E~G~---------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 92 (159) |.+.|||| .++||||||+++++.++ ..+.+.+++++. T Consensus 81 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~--~~~~~~i~~~~g 144 (144) T protein:vir:59 81 AIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEGG--EYFEREMRRLRG 144 (144) T ss_pred cchhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHHHHHH--HHHHHHHHHhcC Confidence 99999997 35899999999998864 345555555444 No 57 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=97.41 E-value=1.4e-07 Score=58.14 Aligned_cols=82 Identities=18% Similarity=0.330 Sum_probs=40.5 Q ss_pred CcccCCceeeehHHHHHHHHHHHHHhhCC----------------------EEEEEecccccCC---CCCCCC-----CH Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEALDGT----------------------TVEVGFFPEDRYG---SENGNL-----PV 50 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~l~~~----------------------~v~VGi~~~~~~~---~~~~G~-----~~ 50 (159) |+ .|++ .+|++|.+.|+.+.+. .-..++..+...+ -..+|. +. T Consensus 1 Ma---~i~i----~Gld~L~~~l~~~~~~~~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~~~~~~~~v~~~ 73 (112) T protein:vir:96 1 MA---TIEF----EGLDEMAQSLLKNASSERRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLEAGSDRAVVEAL 73 (112) T ss_pred Cc---eeee----hHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeeecCceEEEecCC Confidence 32 2333 2344444444332110 0111121111100 011222 33 Q ss_pred HHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 51 AQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVL 95 (159) Q Consensus 51 A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~ 95 (159) +.+|.+.||||...|+||||+|+++.... .+.+.+ +++. T Consensus 74 ~~Ya~~vE~GTr~m~AqPF~~PA~~~~~~--~~~~~l----~~L~ 112 (112) T protein:vir:96 74 TNYSGYLEVGTRKMEAQPFMRPALDQVVP--EMVEEM----AKWE 112 (112) T ss_pred CCccceeccCccccCCCCchhhhHHHHHH--HHHHHH----HhcC Confidence 67999999999999999999999987643 233333 3332 No 58 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=97.28 E-value=3.7e-07 Score=55.83 Aligned_cols=83 Identities=14% Similarity=0.110 Sum_probs=44.2 Q ss_pred CcccCCceeeehHHHHHHHHHHHH------HhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHH Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVE------ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPTF 74 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~------~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~~ 74 (159) -+.............-..+.+.+. .-....+.|||..+ .+.++.+.||||.+.||+|||++++ T Consensus 40 ~~k~~ap~~~~~~~~~~h~~d~I~~~~~k~~~g~~~~~VG~~k~-----------~~~y~~f~E~GT~k~~a~pF~~pa~ 108 (128) T protein:vir:38 40 KLKSNTPEWDGETDMSGHLRDDIKLSSVRETSGLTEVDVGYGKD-----------TGWRAHFPNSGTSMQDPQHFIEETQ 108 (128) T ss_pred HHHHhCCCcCCCCcccchhhhhhccccccccCceeEEEeeecCC-----------CceEEeeeccCccCCCCCcchhHHH Confidence 000000000000000011222211 11234688998432 1457899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015294. 75 EEFTSQFHYARLMKSTFENVLR 96 (159) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~ 96 (159) ++... ++.+.+.+.+.+.+- T Consensus 109 ~~~~~--~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 109 EIMRP--VVIAAFLSHLKEGGM 128 (128) T ss_pred HHhHH--HHHHHHHHHHHhhcC Confidence 98753 455666665555433 No 59 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=97.24 E-value=1.6e-06 Score=52.34 Aligned_cols=92 Identities=16% Similarity=0.296 Sum_probs=38.8 Q ss_pred ccCCceeeehHHHHHHHHHHHHHhhCC-----------------------EEEE------Eeccccc---CCCCCC---- Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLIKRVEALDGT-----------------------TVEV------GFFPEDR---YGSENG---- 46 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~~~l~~l~~~-----------------------~v~V------Gi~~~~~---~~~~~~---- 46 (159) |+..++|+ +|++|.+.|+.|... .+-| |-..+.- .....+ T Consensus 1 M~~~~~i~----Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k~~~~~~~ 76 (135) T protein:vir:57 1 MIPEIEIS----GLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSRGKAGSTV 76 (135) T ss_pred Cceeeeeh----hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhccccccccccccee Confidence 44444443 333333333332211 0000 1000000 000001 Q ss_pred -----CCCHHH--HHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHH Q lcl|NC_015294. 47 -----NLPVAQ--VAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGK 110 (159) Q Consensus 47 -----G~~~A~--iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~ 110 (159) |.+-.. ++.+.||||.+.||||||+|+++++.. .+.+.+.+.+. ..|++++. T Consensus 77 v~v~vg~~~~~~~~~~f~E~GT~~~~a~PF~~pa~~~~~~--~~~~~~~~~~~----------~~l~ka~r 135 (135) T protein:vir:57 77 VVLRVGPTRSHYMKALAQEFGTIKQVAKPFIRPALDYNKM--QVLRILTVEIR----------DGLSTLSR 135 (135) T ss_pred EEEEecCCCCcceeEeecccCCCCCCCCcchhHhHHHhHH--HHHHHHHHHHH----------HHHHHhcC Confidence 111122 233349999999999999999988753 23333333332 22333333 No 60 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=97.15 E-value=3.4e-06 Score=50.58 Aligned_cols=95 Identities=14% Similarity=0.157 Sum_probs=47.7 Q ss_pred ccCCceeeehHHHHHHHHHHHHHh-hCCE---EEEEe---------cccc------------------------------ Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLIKRVEAL-DGTT---VEVGF---------FPED------------------------------ 39 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~~~l~~l-~~~~---v~VGi---------~~~~------------------------------ 39 (159) |...++|+.+...+.+.+++|... .+.. -.||- |+.. T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~~~ 80 (175) T protein:vir:79 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred CceEEEEEechHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccccch Confidence 555578887766554444443221 1110 00010 0000 Q ss_pred -----------------------cCCCCCC----CCCHHHHHHHHhcCCC-------CCCCCchhhHHHHHH---HHHHH Q lcl|NC_015294. 40 -----------------------RYGSENG----NLPVAQVAAYNEFGTT-------RNPTRPFMAPTFEEF---TSQFH 82 (159) Q Consensus 40 -----------------------~~~~~~~----G~~~A~iA~~~E~G~~-------~IP~RpFlr~~~~~~---~~~~~ 82 (159) +|...++ |++ ..||++|+||.. +||+||||--.-+.. ...+. T Consensus 81 ~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGtn-~~YAaiHqfGg~~~~~~~v~IPARPfLG~s~~de~~~~~~~~ 159 (175) T protein:vir:79 81 AAASRRKAGLMILQDSGQMAASTATDSGEDYSVIGSN-KEYAAIQHFGGQAGRGLKVTIPGRAWLPVTADGELQPEAVEP 159 (175) T ss_pred hhHhhhccCCCcceechhhhhhhhheecCCEEEEecC-cchhhHhhcccccCCCcccccCcccccCCCcccchhHHHHHH Confidence 0000111 332 468999999963 899999995322211 11234 Q ss_pred HHHHHHHHHHHHHhCC Q lcl|NC_015294. 83 YARLMKSTFENVLRDG 98 (159) Q Consensus 83 ~~~~~~~~~~~~~~g~ 98 (159) +.+.+...+..++.+. T Consensus 160 I~~~i~~~l~~a~~~~ 175 (175) T protein:vir:79 160 VLNTILRHLMDAANRR 175 (175) T ss_pred HHHHHHHHHHHHhccC Confidence 6666667777776665 No 61 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=97.11 E-value=3.5e-07 Score=55.94 Aligned_cols=80 Identities=19% Similarity=0.369 Sum_probs=38.6 Q ss_pred cccCCceeeehHHHHHHHHHHHHHhhCC----------------------EEEEEecccccC-CCCCCC-----CCHHHH Q lcl|NC_015294. 2 IILASFSFKTDRRRLTSLIKRVEALDGT----------------------TVEVGFFPEDRY-GSENGN-----LPVAQV 53 (159) Q Consensus 2 ~m~~~~~~k~~~~~l~~l~~~l~~l~~~----------------------~v~VGi~~~~~~-~~~~~G-----~~~A~i 53 (159) |.. .+|++|.+.|+.+... -|.-|-+..+-. ...++| .+.+.+ T Consensus 1 i~i---------~Gld~l~~~l~~~~~~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~~~~~~~~V~~~~~Y 71 (108) T protein:vir:98 1 MKI---------TGIDALQKKLRKNATLNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFTDGGLTGTTIPHTDY 71 (108) T ss_pred Ccc---------hhHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeeecCceEEEeecCCCc Confidence 111 2222222222211000 011111111100 001112 233678 Q ss_pred HHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 54 AAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 92 (159) Q Consensus 54 A~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 92 (159) |.+.||||...|+||||+|+++.... .+.+.+++.++ T Consensus 72 a~~vE~GT~~m~aqPFl~pa~~~~~~--~~~~~i~~~lr 108 (108) T protein:vir:98 72 AGYVEYGTRFQAAQPFVKPAFDVQKK--IFTNDLERLTK 108 (108) T ss_pred cceeeccccccCCCcchhhHHHHHHH--HHHHHHHHHcC Confidence 99999999999999999999987643 34444444444 No 62 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=97.09 E-value=4.2e-06 Score=50.04 Aligned_cols=81 Identities=10% Similarity=0.030 Sum_probs=59.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhc-----C--CCCCcHHHHHhcCC--CCCchhH Q lcl|NC_015294. 74 FEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDD-----Y--PGSNSPAWAAYKGF--NDPLFHT 144 (159) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~-----~--~~Pna~~Ti~~KG~--~~PLiDT 144 (159) +++.. .+.+.+..++.++ . ..+....|..||..+....+..|.+ | |+|+++.|+++|+. .+||+++ T Consensus 1 m~d~~---~l~~~L~~ll~~L-~-~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~ 75 (149) T protein:vir:98 1 MSELT---ALQERLTGLIASL-S-PAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKGRIRREMFAR 75 (149) T ss_pred CchHH---HHHHHHHHHHHhc-C-chhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccCCCCcccchh Confidence 33321 2333344444443 1 1245678999999999999999964 2 56899999998874 5899999 Q ss_pred HHHHhhhhhhhcccC Q lcl|NC_015294. 145 GKMLESVKFQIHRRQ 159 (159) Q Consensus 145 G~L~~SIty~V~~k~ 159 (159) |.|.+||++.+.... T Consensus 76 g~l~~sl~~~~~~~~ 90 (149) T protein:vir:98 76 LRTNRFMKAKGSDSA 90 (149) T ss_pred hhhhhhhhheecCCe Confidence 999999999998888 No 63 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=96.91 E-value=9.5e-07 Score=53.60 Aligned_cols=80 Identities=19% Similarity=0.347 Sum_probs=38.5 Q ss_pred cccCCceeeehHHHHHHHHHHHHHhhCC----------------------EEEEEecccccCCC-CCCC-----CCHHHH Q lcl|NC_015294. 2 IILASFSFKTDRRRLTSLIKRVEALDGT----------------------TVEVGFFPEDRYGS-ENGN-----LPVAQV 53 (159) Q Consensus 2 ~m~~~~~~k~~~~~l~~l~~~l~~l~~~----------------------~v~VGi~~~~~~~~-~~~G-----~~~A~i 53 (159) |-. .+|++|.+.|+.+... -|.=|.+.++-... ..+| .+.+.+ T Consensus 1 i~i---------~Gld~l~~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~si~~~~~~~~~~~~V~~~~~Y 71 (108) T protein:vir:74 1 MKI---------TGIDALQKKLRKNATLDDVKHVVKSNTASMNKNMQNLAPVDTGNMKRSITSEFTDGGLSGTTGPHTDY 71 (108) T ss_pred Ccc---------hhHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhccceeeeecCceEEEeecCCCc Confidence 111 2222222222211000 01111111110000 1122 234678 Q ss_pred HHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 54 AAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFE 92 (159) Q Consensus 54 A~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 92 (159) |.+.||||...|+||||+|+++.+.. .+.+.+++.++ T Consensus 72 a~~vE~GT~km~aqpf~~pa~~~~~~--~~~~~i~~~~k 108 (108) T protein:vir:74 72 AGYVEYGTRFQSAQPFVKPAFNIQKK--VFTNDLERLTK 108 (108) T ss_pred ccceeccccccCCCcchhhHHHHHHH--HHHHHHHHHcC Confidence 99999999999999999999987743 34444444444 No 64 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=96.82 E-value=1.2e-05 Score=47.57 Aligned_cols=95 Identities=14% Similarity=0.203 Sum_probs=48.1 Q ss_pred ccCCceeeehHHHHHHHHHHHHHhh-CC-EE--EEEe---------cccccCC--------------------------- Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLIKRVEALD-GT-TV--EVGF---------FPEDRYG--------------------------- 42 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~~~l~~l~-~~-~v--~VGi---------~~~~~~~--------------------------- 42 (159) |...++|+.+...+...+.+|.... +. .+ .||- |+....| T Consensus 1 Ms~~i~i~~~~~~l~~~L~~l~~~~~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~~~~ 80 (175) T protein:vir:10 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred CceeEEEEecHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhhhhh Confidence 5555788888776666555553221 10 00 0000 0000000 Q ss_pred --------------------------CCCC----CCCHHHHHHHHhcCCC-------CCCCCchhhHHHHHH---HHHHH Q lcl|NC_015294. 43 --------------------------SENG----NLPVAQVAAYNEFGTT-------RNPTRPFMAPTFEEF---TSQFH 82 (159) Q Consensus 43 --------------------------~~~~----G~~~A~iA~~~E~G~~-------~IP~RpFlr~~~~~~---~~~~~ 82 (159) ..++ |++ ..||++|+||.. +||+||||--.-+.. ..... T Consensus 81 ~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGtn-~~YAaiHqfGg~~~~~~~v~iPaRpfLG~s~~d~~~~e~~~~ 159 (175) T protein:vir:10 81 AAASRRKAGLMILQDSGQMAASVSTDHDDNSAVIGSN-KEYAAIHQFGGQAGRGLKVTIPARPWLPVTADGELQPEAVEP 159 (175) T ss_pred hhhhhhccCCCcceechhhhhhhheeecCCEEEEecC-hhhhhhhhcccccCCCCccccCCccccCCCcccccchHHHHH Confidence 0011 333 357999999964 899999995332111 11234 Q ss_pred HHHHHHHHHHHHHhCC Q lcl|NC_015294. 83 YARLMKSTFENVLRDG 98 (159) Q Consensus 83 ~~~~~~~~~~~~~~g~ 98 (159) |.+.+...+..++.+. T Consensus 160 Il~~~~~~l~~~~~~~ 175 (175) T protein:vir:10 160 VLNTILRHLMDAANRR 175 (175) T ss_pred HHHHHHHHHHHHhccC Confidence 5566666666666655 No 65 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=96.73 E-value=5.8e-06 Score=49.31 Aligned_cols=86 Identities=16% Similarity=0.233 Sum_probs=44.4 Q ss_pred cCCceeeehHHHHHHHHHHHHH----h-----------------hCCEEEEEeccccc-CCCCCCC-------CCHHHHH Q lcl|NC_015294. 4 LASFSFKTDRRRLTSLIKRVEA----L-----------------DGTTVEVGFFPEDR-YGSENGN-------LPVAQVA 54 (159) Q Consensus 4 ~~~~~~k~~~~~l~~l~~~l~~----l-----------------~~~~v~VGi~~~~~-~~~~~~G-------~~~A~iA 54 (159) |++++++.+.++|.+.++.+.. . ...-|.=|-+..+= ...+.+| .+.+.+| T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~g~~~~~~v~~~~~YA 80 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSGGRFSFSVTIGTNVTYA 80 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeccCCceEEEEEecCcccc Confidence 6677777666555443332211 0 00012222222110 0001111 2447899 Q ss_pred HHHhcCCC---------------------------CCCCCchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 55 AYNEFGTT---------------------------RNPTRPFMAPTFEEFTSQFHYARLMKSTF 91 (159) Q Consensus 55 ~~~E~G~~---------------------------~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 91 (159) .++||||. ++||||||++++++.. ..+.+.++.+- T Consensus 81 ~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~--~~i~~~~~~~~ 142 (142) T protein:vir:94 81 ADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAAS--TFLRNHAKGIR 142 (142) T ss_pred hhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHH--HHHHHHHHhcC Confidence 99999962 4889999999998764 33444444333 No 66 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=96.49 E-value=1.7e-05 Score=46.75 Aligned_cols=81 Identities=12% Similarity=0.109 Sum_probs=57.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhc-----C--CCCCcHHHHHhcC--CCCCchhH Q lcl|NC_015294. 74 FEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDD-----Y--PGSNSPAWAAYKG--FNDPLFHT 144 (159) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~-----~--~~Pna~~Ti~~KG--~~~PLiDT 144 (159) +++.. .+...+..++.++- ..+....|..||..+....++.|.+ | |+|+++.|+++|. ..++|.++ T Consensus 1 ~~~~~---~l~~~L~~ll~~l~--~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~~~~~l~~~ 75 (150) T protein:vir:20 1 MNEFK---RFEDRLTGLIESLS--PSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGRVKRKMFAK 75 (150) T ss_pred CchHH---HHHHHHHHHHHhcC--ChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccCCCccccch Confidence 33321 23333334443321 1234678999999999999999963 3 5699999998664 36799999 Q ss_pred HHHHhhhhhhhcccC Q lcl|NC_015294. 145 GKMLESVKFQIHRRQ 159 (159) Q Consensus 145 G~L~~SIty~V~~k~ 159 (159) |.|..||+|.+.... T Consensus 76 ~~l~~sl~~~~~~~~ 90 (150) T protein:vir:20 76 LITSRFLHIRASPEQ 90 (150) T ss_pred hhhhhhhheeecCcE Confidence 999999999988777 No 67 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=96.45 E-value=6.7e-06 Score=48.95 Aligned_cols=79 Identities=13% Similarity=0.205 Sum_probs=40.3 Q ss_pred Cc-cc-------CCceeeehHHHHHHHHHHHH-HhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCC---------- Q lcl|NC_015294. 1 MI-IL-------ASFSFKTDRRRLTSLIKRVE-ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGT---------- 61 (159) Q Consensus 1 m~-m~-------~~~~~k~~~~~l~~l~~~l~-~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~---------- 61 (159) .. -. .+-.-+.-...+ .+...|. ..+...+.|||..+. ++.||++|.||- T Consensus 53 W~p~k~~~~~~k~g~~~~~l~~~~-~l~~sl~~~~~~~~~~vg~~~Gs----------~~~yAa~HQfG~~~~~~~~~~~ 121 (150) T protein:vir:20 53 YAPRQQQSVRKKTGRVKRKMFAKL-ITSRFLHIRASPEQASMEFYGGK----------SPKIASVHQFGLSEENRKDGKK 121 (150) T ss_pred CcccchHHHHHhccCCCccccchh-hhhhhhheeecCcEEEEEeeCCc----------chhhhhhhhcccccccccCCCc Confidence 11 00 000000000000 1222332 235678999986443 467999999992 Q ss_pred CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 62 TRNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 93 (159) Q Consensus 62 ~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 93 (159) .+||+||||- +.+. .+.++.+.+...+.+ T Consensus 122 ~~iPaRp~LG--~s~~-d~~~i~~~i~~~l~k 150 (150) T protein:vir:20 122 IDYPARPLLG--FTGE-DVQMIEEIILAHLER 150 (150) T ss_pred eeccccccCC--CCHH-HHHHHHHHHHHHHhC Confidence 3799999995 4332 233455555555554 No 68 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=96.43 E-value=1.5e-05 Score=47.00 Aligned_cols=77 Identities=18% Similarity=0.210 Sum_probs=42.1 Q ss_pred CcccCC---ceeeehHHHHHHHHHHHHH-hhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCCC--------CCCCCc Q lcl|NC_015294. 1 MIILAS---FSFKTDRRRLTSLIKRVEA-LDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTT--------RNPTRP 68 (159) Q Consensus 1 m~m~~~---~~~k~~~~~l~~l~~~l~~-l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~--------~IP~Rp 68 (159) .=-..+ -.+=.+.- .|...|.. .+...|.||.. ..||++|+||.. +||+|| T Consensus 68 ~r~~~~~~~~~~L~~tg---~L~~Si~~~~~~~~v~vGt~--------------~~yA~vHqfG~~~~~~~~~~~iPaRp 130 (156) T protein:vir:19 68 WRQDHGFVPGSILTLHG---DLARSITTDYGQDYALIGSP--------------KIYAAIHQWGGTPDMAPRPAGVPARP 130 (156) T ss_pred HhhccCCCCCcchhhhH---HHHHHhhheecCCEEEEecc--------------hhhhHHhhcCcccccCCCccccCCcc Confidence 000000 00001111 23333321 24557777752 458999999953 799999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_015294. 69 FMAPTFEEFTSQFHYARLMKSTFENVLRD 97 (159) Q Consensus 69 Flr~~~~~~~~~~~~~~~~~~~~~~~~~g 97 (159) ||- +.+. .+..+.+.+...+..++.- T Consensus 131 fLG--~s~~-d~~~I~~~i~~~l~~~~~~ 156 (156) T protein:vir:19 131 YMG--LDKT-GEQEIFDAIRKRVSAALRQ 156 (156) T ss_pred ccC--CCHH-HHHHHHHHHHHHHHHHhhC Confidence 994 4433 2456777777777777655 No 69 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=96.39 E-value=8.3e-06 Score=48.44 Aligned_cols=96 Identities=23% Similarity=0.355 Sum_probs=38.9 Q ss_pred CcccCCceeeehH---HHHHHHHH--------HH----HHhh-------C--CEEEEEeccccc---CCCCCCC-----C Q lcl|NC_015294. 1 MIILASFSFKTDR---RRLTSLIK--------RV----EALD-------G--TTVEVGFFPEDR---YGSENGN-----L 48 (159) Q Consensus 1 m~m~~~~~~k~~~---~~l~~l~~--------~l----~~l~-------~--~~v~VGi~~~~~---~~~~~~G-----~ 48 (159) ||.. .|+.-+ ++|+++-+ .+ +... + .-|.-|-+..+= ....+++ . T Consensus 1 m~~v---~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~~~~~~g~V~ 77 (182) T protein:vir:10 1 MIEV---ELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVDGDEVIGRWW 77 (182) T ss_pred CeEE---EEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeecCCeEEEEee Confidence 3332 333222 22222110 11 0000 0 011122111110 0000010 1 Q ss_pred CHHHHHHHHhcCC------------------------------------------------------CCCCCCchhhHHH Q lcl|NC_015294. 49 PVAQVAAYNEFGT------------------------------------------------------TRNPTRPFMAPTF 74 (159) Q Consensus 49 ~~A~iA~~~E~G~------------------------------------------------------~~IP~RpFlr~~~ 74 (159) +.+.+|.+.|||| .+.||||||+|++ T Consensus 78 ~~~~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPFl~pA~ 157 (182) T protein:vir:10 78 NSSMVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQFMTPAA 157 (182) T ss_pred cCCCccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcchHHHH Confidence 2244666666664 4579999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCH Q lcl|NC_015294. 75 EEFTSQFHYARLMKSTFENVLRDGRQT 101 (159) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~g~~~~ 101 (159) ++++. .+.+.+++.+...+.-.... T Consensus 158 ~~~~~--~i~~~i~~~i~~~l~~~~g~ 182 (182) T protein:vir:10 158 NKMAK--EAPEIIKRSIDQELHDKLGG 182 (182) T ss_pred HHhHH--HHHHHHHHHHHHHHHHhhcC Confidence 88754 46666665555443322111 No 70 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=96.39 E-value=1.7e-05 Score=46.76 Aligned_cols=86 Identities=15% Similarity=0.264 Sum_probs=38.1 Q ss_pred ceeeehHHHHHHHHHHHHHhhCC------------------------EEEEEeccccc-C--CCCCCC-----CCHHHHH Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDGT------------------------TVEVGFFPEDR-Y--GSENGN-----LPVAQVA 54 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~~------------------------~v~VGi~~~~~-~--~~~~~G-----~~~A~iA 54 (159) |++ .+|++|.+.|+.|.+. -|.=|-+.++= . ...+++ .+.+.+| T Consensus 1 i~i----~Gld~L~~~L~~l~~~~~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~~~~~v~~~~~Ya 76 (173) T protein:vir:10 1 MAV----KGVAEVIAELRKIGKDIDKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLKAKDLISKKITVNELYG 76 (173) T ss_pred Ccc----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeeccCceeEEeeCCCcccc Confidence 221 1222222222222110 01111111000 0 000111 2456788 Q ss_pred HHHhcCCC-------------------------------------------------------CCCCCchhhHHHHHHHH Q lcl|NC_015294. 55 AYNEFGTT-------------------------------------------------------RNPTRPFMAPTFEEFTS 79 (159) Q Consensus 55 ~~~E~G~~-------------------------------------------------------~IP~RpFlr~~~~~~~~ 79 (159) .+.||||. +.||||||+|+++++.. T Consensus 77 ~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~~aqPFl~PA~~~~~~ 156 (173) T protein:vir:10 77 AYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGINPQPFLYPAWIEGKK 156 (173) T ss_pred hhhhcccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceeeEeecCCCCCCccchhHHHHhHH Confidence 99999962 48999999999988753 Q ss_pred HHHHHHHHHHHHHHHHhCCCCHHHHHHHH Q lcl|NC_015294. 80 QFHYARLMKSTFENVLRDGRQTNTLLKKL 108 (159) Q Consensus 80 ~~~~~~~~~~~~~~~~~g~~~~~~~l~~i 108 (159) .+.+.+++.+...+.- + T Consensus 157 --~~~~~i~~~i~~~lrk----------~ 173 (173) T protein:vir:10 157 --QYLKDLENLLKTYNKK----------I 173 (173) T ss_pred --HHHHHHHHHHHHHhhc----------C Confidence 3445555544443321 1 No 71 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=96.30 E-value=1e-05 Score=47.92 Aligned_cols=80 Identities=15% Similarity=0.214 Sum_probs=39.2 Q ss_pred CcccC------------CceeeehHHHHHHHHHHHHH-hhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCC------ Q lcl|NC_015294. 1 MIILA------------SFSFKTDRRRLTSLIKRVEA-LDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGT------ 61 (159) Q Consensus 1 m~m~~------------~~~~k~~~~~l~~l~~~l~~-l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~------ 61 (159) -.-.. +-++-.+.- .|...|.. .....|.||-. ..||++|+||. T Consensus 55 ~plsp~t~~~r~k~g~~~~~~L~~tG---~L~~Si~~~~~~~~v~vGtn--------------~~YA~iHqfGg~~~~~~ 117 (155) T protein:vir:10 55 PQLSPVTVAARAAKGRGAHPILQVTN---ALARSITTRADRDQAQIGSN--------------LSYAAIQQLGGQAGRGR 117 (155) T ss_pred CCCCccchHHHHhccCCCCCccccch---hhhhhhhceecCCEEEEecC--------------cchhhhhhcccccCCCC Confidence 11000 000011111 22333321 24556777642 34899999995 Q ss_pred -CCCCCCchhhHHHHHH-HHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_015294. 62 -TRNPTRPFMAPTFEEF-TSQFHYARLMKSTFENVLRDGR 99 (159) Q Consensus 62 -~~IP~RpFlr~~~~~~-~~~~~~~~~~~~~~~~~~~g~~ 99 (159) .+||+||||- +++. +-+.++.+.+...+...+..+. T Consensus 118 ~~~iPARPfLG--~s~~~e~~~ei~~~I~~~i~~~l~~~r 155 (155) T protein:vir:10 118 KVTIPARPYLP--VLRNGQLKPSARDAVLDVLLAALSQGR 155 (155) T ss_pred ccccCCccccC--CCccccchHHHHHHHHHHHHHHHhhcC Confidence 3799999995 3211 1122355556666666665454 No 72 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=96.30 E-value=1.1e-05 Score=47.83 Aligned_cols=80 Identities=14% Similarity=0.237 Sum_probs=38.3 Q ss_pred CcccCCcee--e---ehHHHHH--HHHHHH-HHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCC----------C Q lcl|NC_015294. 1 MIILASFSF--K---TDRRRLT--SLIKRV-EALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGT----------T 62 (159) Q Consensus 1 m~m~~~~~~--k---~~~~~l~--~l~~~l-~~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~----------~ 62 (159) ..=.+--+. + .....+. .+...| -..+...+.|||..+. ++.||++|.||. . T Consensus 53 W~p~~~~~~~~k~~~~~~~l~~~~~l~~sl~~~~~~~~a~vg~~~Gt----------~~~yAaiHQfG~~~~~~~~~~~~ 122 (150) T protein:vir:60 53 YAPRQQQSARKKTGRVKRKMFAKLITSRFLHIRASPEQASMEFYGGK----------SPKIASVHQFGLSEENRKDGKKI 122 (150) T ss_pred CcccChHHHHHhhcCCCccchhhhhhcceeeeeeeCcEEEEEeeCCC----------chhhhhhhhccccccccCCCCce Confidence 111100000 0 0000000 011112 1234567888886442 467999999993 3 Q ss_pred CCCCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 63 RNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 93 (159) Q Consensus 63 ~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 93 (159) +||+||||- +.+. .+.++.+.+...+.+ T Consensus 123 ~iPaRp~LG--~s~~-d~~~i~~~i~~~l~r 150 (150) T protein:vir:60 123 DYPARPLLG--FTGE-DVQMIEEIILAHLDR 150 (150) T ss_pred ecCCcccCC--CCHH-HHHHHHHHHHHHHhC Confidence 799999996 4332 233455555555544 No 73 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=96.20 E-value=8.3e-06 Score=48.45 Aligned_cols=79 Identities=13% Similarity=0.153 Sum_probs=37.7 Q ss_pred CcccCC------ceeeehHHHHHHHHHH--HH-HhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcC----------- Q lcl|NC_015294. 1 MIILAS------FSFKTDRRRLTSLIKR--VE-ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFG----------- 60 (159) Q Consensus 1 m~m~~~------~~~k~~~~~l~~l~~~--l~-~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G----------- 60 (159) ..-.+. -.++ +...+.+|... |+ ..+...+.|||... +..||++|.|| T Consensus 54 W~p~k~~~~~~k~~~~-~~~m~~~L~~a~~l~~~a~~~~~~Vg~~Gt-----------~~~yAaiHQfG~~~r~~~~~~~ 121 (152) T protein:vir:10 54 YEPRKKPKKGVKSKIK-SGKMFDKITQPRFMRLRLESEGVSLGYEGG-----------DAVIARIHQQGLIGRVRKDWDL 121 (152) T ss_pred Cchhhhhhhhhccccc-chhHHHhhhhcceeeeeecCcEEEEEecCC-----------chhhhhhhccCccccccCCCCc Confidence 111100 0000 11112222211 11 13456788988632 36799999999 Q ss_pred CCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 61 TTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENV 94 (159) Q Consensus 61 ~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~ 94 (159) ..+||+||||--+-++ ..++.+.+...+... T Consensus 122 ~v~iPaRp~LG~s~~d---~~~I~~~i~~~l~~a 152 (152) T protein:vir:10 122 KVKYASRELLGFTDDD---LQMIEDYMINILAGS 152 (152) T ss_pred ceeccccccCCCCHHH---HHHHHHHHHHHHhcC Confidence 3479999999533222 223444444433332 No 74 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=96.09 E-value=1.6e-05 Score=46.82 Aligned_cols=80 Identities=14% Similarity=0.245 Sum_probs=38.1 Q ss_pred CcccCCcee--e---ehHHHHHH--HHHHH-HHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCC----------C Q lcl|NC_015294. 1 MIILASFSF--K---TDRRRLTS--LIKRV-EALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGT----------T 62 (159) Q Consensus 1 m~m~~~~~~--k---~~~~~l~~--l~~~l-~~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~----------~ 62 (159) ..=.+--+. + .....+.. +...| -..+...+.|||..+. +..||++|.||. + T Consensus 53 W~p~k~~~~~~k~~~~~~~l~~~~~l~~sl~~~~~~~~a~vg~~~G~----------~~~yAaiHQfG~~~r~~~~~~~~ 122 (150) T protein:vir:57 53 YAPRQQQSARKKTGRVKRKMFAKLITSRFLHIRASPEQASMEFYGGK----------SPKIASVHQFGLSEETRKDGKKI 122 (150) T ss_pred CcccChHHHHHhccCCCcccchhhhhccceeeeeeCcEEEEEeecCC----------chhhhhhhhccccccccCCCcee Confidence 110000000 0 00000000 11111 1234567888886432 467999999993 3 Q ss_pred CCCCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 63 RNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 93 (159) Q Consensus 63 ~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 93 (159) +||+||||- +.+. .+.++.+.+...+.+ T Consensus 123 ~iPaRp~LG--~s~~-d~~~i~~~i~~~l~r 150 (150) T protein:vir:57 123 DYPARPLLG--FTGE-DVQMIEEIILAHLDR 150 (150) T ss_pred ecCCcccCC--CCHH-HHHHHHHHHHHHHhC Confidence 799999995 4332 233455555555544 No 75 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=96.09 E-value=3.1e-05 Score=45.28 Aligned_cols=81 Identities=14% Similarity=0.097 Sum_probs=58.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhc-----C--CCCCcHHHHHhcCC--CCCchhH Q lcl|NC_015294. 74 FEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDD-----Y--PGSNSPAWAAYKGF--NDPLFHT 144 (159) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~-----~--~~Pna~~Ti~~KG~--~~PLiDT 144 (159) +++.. .+...+...+.++. ..+....+..||..+....++.|.+ | |+|+++.|+++|+. .++|+++ T Consensus 1 ~~~~~---~l~~~L~~~l~~L~--~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~ 75 (150) T protein:vir:60 1 MNEFK---RFEDRLTGLIESLS--PSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAK 75 (150) T ss_pred CchHH---HHHHHHHHHHHhcC--ChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcCCCccchhh Confidence 33331 12333334444431 1234678999999999999999963 3 56999999998754 5899999 Q ss_pred HHHHhhhhhhhcccC Q lcl|NC_015294. 145 GKMLESVKFQIHRRQ 159 (159) Q Consensus 145 G~L~~SIty~V~~k~ 159 (159) |.|..||++.+.... T Consensus 76 ~~l~~sl~~~~~~~~ 90 (150) T protein:vir:60 76 LITSRFLHIRASPEQ 90 (150) T ss_pred hhhcceeeeeeeCcE Confidence 999999999998887 No 76 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=95.94 E-value=5.2e-05 Score=44.06 Aligned_cols=82 Identities=17% Similarity=0.193 Sum_probs=37.9 Q ss_pred Cc-------ccCCceeeehHHHHHHHHHHHHH-----hhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCCC--CCCC Q lcl|NC_015294. 1 MI-------ILASFSFKTDRRRLTSLIKRVEA-----LDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTT--RNPT 66 (159) Q Consensus 1 m~-------m~~~~~~k~~~~~l~~l~~~l~~-----l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~--~IP~ 66 (159) -. ...+=++-.+.- .|...+.. -++..+.||- +..+|++|+||+. +||+ T Consensus 50 ~pLs~st~a~k~~~~~L~~tG---~L~~Si~~~~~~~~~~~~a~vGt--------------n~~YA~~hqfG~~~~~IPa 112 (145) T protein:vir:31 50 EPLKESTIRAKGSDTPLIDNS---RLLTDINAASMMDRANRMAVIGT--------------NLDYAEHHEFGAPEAGIPA 112 (145) T ss_pred cccChHHHHHhcCCCCCccCH---HHHHHHHHHhhhcccCceeEecC--------------CchhhhhhccCCcccccCC Confidence 11 111101111111 22222221 1233344442 2459999999985 7999 Q ss_pred CchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHH Q lcl|NC_015294. 67 RPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQTN 102 (159) Q Consensus 67 RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 102 (159) ||||-...... ++.+.+.+...+..-+.|- -.+ T Consensus 113 RPfLG~~~~~~--~~~~~~ii~~~i~~~L~~~-~~~ 145 (145) T protein:vir:31 113 RPIFGPAGAYA--SQQAPDVIGDEIDTNLEGA-VID 145 (145) T ss_pred CCccCCCccch--HHHHHHHHHHHHHHHhhhh-ccC Confidence 99997654322 2234455555555544442 111 No 77 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=95.90 E-value=4.6e-05 Score=44.34 Aligned_cols=81 Identities=14% Similarity=0.097 Sum_probs=58.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhc-----C--CCCCcHHHHHhcCC--CCCchhH Q lcl|NC_015294. 74 FEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDD-----Y--PGSNSPAWAAYKGF--NDPLFHT 144 (159) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~-----~--~~Pna~~Ti~~KG~--~~PLiDT 144 (159) +++.. .+...+..++.++. ..+...+|..||..+....++.|.+ | |+|+++.|+++|+. .++|+.+ T Consensus 1 m~~~~---~l~~~L~~~l~~L~--~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~~~~~~~l~~~ 75 (150) T protein:vir:57 1 MNEFK---RFEDRLTGLIESLS--PSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAK 75 (150) T ss_pred CchHH---HHHHHHHHHHHhcC--ChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhccCCCcccchh Confidence 33321 12233333333321 1234679999999999999999963 3 56999999987753 5899999 Q ss_pred HHHHhhhhhhhcccC Q lcl|NC_015294. 145 GKMLESVKFQIHRRQ 159 (159) Q Consensus 145 G~L~~SIty~V~~k~ 159 (159) |.|..||+|.+.... T Consensus 76 ~~l~~sl~~~~~~~~ 90 (150) T protein:vir:57 76 LITSRFLHIRASPEQ 90 (150) T ss_pred hhhccceeeeeeCcE Confidence 999999999998887 No 78 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=95.79 E-value=2.7e-05 Score=45.62 Aligned_cols=79 Identities=13% Similarity=0.177 Sum_probs=36.9 Q ss_pred CcccCC----ceeeeh-----HH-HHHHHH--HHHH-HhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCC------ Q lcl|NC_015294. 1 MIILAS----FSFKTD-----RR-RLTSLI--KRVE-ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGT------ 61 (159) Q Consensus 1 m~m~~~----~~~k~~-----~~-~l~~l~--~~l~-~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~------ 61 (159) ..=.+- -..+.. .. .+..+. +.|+ ..+...+.|||.. +++.||++|.||. T Consensus 54 W~prk~~~~~~~~~~~~g~~~~~~m~~~l~~a~~l~~~~~~d~a~Vg~~G-----------s~~~yAaiHQfG~~~r~~~ 122 (155) T protein:vir:79 54 YEPRKVKAGGKRLREKAGRVKREAMFRKLRTARYLRIDVDSTGLAIGFDE-----------RLSRIARVHQEGQKAPVEP 122 (155) T ss_pred CcccchhhhhhhhhcccCcccchhhhhhhhhhheeeeeecCcEEEEEecC-----------cchhhhhhhhcCCcccCCC Confidence 111100 000000 00 011110 1111 1245567788732 2467999999993 Q ss_pred ----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 62 ----TRNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 93 (159) Q Consensus 62 ----~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 93 (159) ++||+||||--+-++ +.++.+.+...+.+ T Consensus 123 ~~~~v~iPaRp~LGls~~d---~~~I~~~i~~~l~r 155 (155) T protein:vir:79 123 GGPLAQYPVRVVLGFSDAD---RELVRDRLLRELTR 155 (155) T ss_pred CCcccccccccccCCCHHH---HHHHHHHHHHHhhC Confidence 379999999533322 33455555555544 No 79 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=95.75 E-value=4.6e-05 Score=44.35 Aligned_cols=79 Identities=19% Similarity=0.221 Sum_probs=43.1 Q ss_pred Cccc---------CCceeeehHHHHHHHHHHHH-HhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCC--------- Q lcl|NC_015294. 1 MIIL---------ASFSFKTDRRRLTSLIKRVE-ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGT--------- 61 (159) Q Consensus 1 m~m~---------~~~~~k~~~~~l~~l~~~l~-~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~--------- 61 (159) -.-. .+-++=.+.- .|.+.+. ...+..|.||.. ..+|++|+||. T Consensus 58 ~p~~~~t~~rk~~~~~~~L~~tg---~L~~Si~~~~~~~~v~vGtn--------------~~yA~iHq~Gg~i~~~~~~~ 120 (190) T protein:vir:99 58 QPLSPAYLRRKRKNRDKILTLDG---HLRNLLRYQLDGSELLFGSD--------------RPYAAIHHFGGTIQRQARSS 120 (190) T ss_pred ccccHHHHHHhhcCCCccceecH---HHHHHHhheecCcEEEEecC--------------cchhhhhhcCCcccccccch Confidence 1100 0000111111 2333333 234557777742 44799999992 Q ss_pred -----------------------------------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_015294. 62 -----------------------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGR 99 (159) Q Consensus 62 -----------------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 99 (159) ++||+||||- +.+. .+.++.+.+...+..++.... T Consensus 121 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~v~IPaRpfLG--~s~~-d~~~I~~~i~~~l~~~~~~~~ 190 (190) T protein:vir:99 121 TVYFRQNERTGEVGREFVPRRRSNFAQDVQIGPYTIQMPARPWLG--TSSQ-DDDTILQRVERYLQRALRERA 190 (190) T ss_pred hhhhhhhhhhhhhhcccccccccccchhcccccceeeecCcccCC--CCHH-HHHHHHHHHHHHHHHHHhhcC Confidence 3689999995 3332 245677777777777776654 No 80 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=95.44 E-value=2.9e-05 Score=45.44 Aligned_cols=90 Identities=16% Similarity=0.335 Sum_probs=44.9 Q ss_pred cCCceeeehHHHH-----HHHHHHHHHh---------h-----CCEEEEEeccccc-CCCCCCC-----CCHHHHHHHHh Q lcl|NC_015294. 4 LASFSFKTDRRRL-----TSLIKRVEAL---------D-----GTTVEVGFFPEDR-YGSENGN-----LPVAQVAAYNE 58 (159) Q Consensus 4 ~~~~~~k~~~~~l-----~~l~~~l~~l---------~-----~~~v~VGi~~~~~-~~~~~~G-----~~~A~iA~~~E 58 (159) +.-+.|..+.+.+ +.+.+.+++. . ..-|.=|-+..+- +..-.+| .+.+.+|.+.| T Consensus 1 ~~~~~f~~~~~~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~~g~~~~V~~~~~YA~yVE 80 (141) T protein:vir:78 1 MNEFEFDSNIPKARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRKSSKEVIVGNSSDYAIYYE 80 (141) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeecCCcEEEEecCCCccceee Confidence 3333444333322 1122222221 0 0123333333221 1000112 24578999999 Q ss_pred cCC--------------------------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_015294. 59 FGT--------------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQ 100 (159) Q Consensus 59 ~G~--------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 100 (159) ||| .+.|+||||+++++++.. .+.+.+++.+..+ + T Consensus 81 ~GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~--~i~~~i~~~~~~l-----~ 141 (141) T protein:vir:78 81 FGTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQD--KVRVFTERALRGI-----N 141 (141) T ss_pred cCCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhHH--HHHHHHHHHhhcc-----C Confidence 997 348999999999988743 4666666655544 3 No 81 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=95.42 E-value=6.6e-05 Score=43.51 Aligned_cols=79 Identities=13% Similarity=0.200 Sum_probs=38.6 Q ss_pred CcccCCceee-----ehHHHHH--HHHHHHH-HhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCC----------C Q lcl|NC_015294. 1 MIILASFSFK-----TDRRRLT--SLIKRVE-ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGT----------T 62 (159) Q Consensus 1 m~m~~~~~~k-----~~~~~l~--~l~~~l~-~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~----------~ 62 (159) ..=.+.-+.+ .....+. .+...|. ......+.|||... +..||++|+||. + T Consensus 53 W~p~~~~~~~~k~~~~~~~l~~~g~l~~sl~~~~~~~~~~V~~~Gs-----------~~~yAa~HQfG~~~r~~~~~~~~ 121 (149) T protein:vir:98 53 YAARKRQSVRSKKGRIRREMFARLRTNRFMKAKGSDSAAVVEFTGR-----------VQRMARVHQYGLKDRPNRHSRDV 121 (149) T ss_pred CcccchHHHHhccCCCCcccchhhhhhhhhhheecCCeeEEEecCc-----------chHHhhHhhccccccccCCCcce Confidence 1111110000 0000001 1122221 23556788888532 367999999994 3 Q ss_pred CCCCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 63 RNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 93 (159) Q Consensus 63 ~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 93 (159) +||+||||- +.+. .+.++.+.+...+.+ T Consensus 122 ~iPaRp~LG--~s~~-d~~~i~~~i~~~l~~ 149 (149) T protein:vir:98 122 QYAARPLLG--FTRD-DEQMIEDIIIRHLGK 149 (149) T ss_pred eccccccCC--CCHH-HHHHHHHHHHHHhhC Confidence 799999995 4322 233455555555444 No 82 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=95.35 E-value=2.1e-05 Score=46.19 Aligned_cols=85 Identities=20% Similarity=0.312 Sum_probs=39.7 Q ss_pred CcccCCceeeehHHHHHHHHHHHHHh-----------------hCCEEEEEecccc-cCCCCCCC-----CCHHHHHHHH Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEAL-----------------DGTTVEVGFFPED-RYGSENGN-----LPVAQVAAYN 57 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~l-----------------~~~~v~VGi~~~~-~~~~~~~G-----~~~A~iA~~~ 57 (159) |.... ..+..-.+.|+++.+.++.- ...-|.-|-+.++ ....+++| .+.+.+|.+. T Consensus 1 Ma~~~-~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~v 79 (135) T protein:vir:96 1 MAKVK-YGADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFENGGFTGVVKIGSNYAVYV 79 (135) T ss_pred Cchhh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEeecCcEEEEEecCCCccchh Confidence 44321 12222223333322222110 0001222322222 01011222 3457799999 Q ss_pred hcCC---------------------------CCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 58 EFGT---------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 88 (159) Q Consensus 58 E~G~---------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 88 (159) |||| +++|+||||++++++.+. .+.+.+. T Consensus 80 e~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~~--~~~~~i~ 135 (135) T protein:vir:96 80 NYGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGRQ--TFEQYFS 135 (135) T ss_pred hcccccccCCCccccccccccccCCcceeecCCcCCCcchhHHHHHHHH--HHHHhcC Confidence 9997 458999999999987643 3444333 No 83 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=95.32 E-value=4.9e-05 Score=44.23 Aligned_cols=78 Identities=19% Similarity=0.262 Sum_probs=34.1 Q ss_pred CcccCC------ceeeehH-HHHHHHHHHH-HHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcC----------CC Q lcl|NC_015294. 1 MIILAS------FSFKTDR-RRLTSLIKRV-EALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFG----------TT 62 (159) Q Consensus 1 m~m~~~------~~~k~~~-~~l~~l~~~l-~~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G----------~~ 62 (159) ..=.+- -..+... ..| .+...| .......+.|||. |+ +..||++|.|| ++ T Consensus 53 W~p~s~~~~~~~g~~~~~~~~~l-~~~~~l~~~~~~~~~~v~~~----------Gt-~~~yAaiHQfG~~~r~~~~~~~v 120 (148) T protein:vir:79 53 YVPRKPQLRHRAGRIRRAMFMRL-RLARYMKTQADANTAVVTFA----------GN-AQRIATVHQFGLRDRVNKAGLTA 120 (148) T ss_pred CcccchHHHhhcccccccccchh-hhhhheeeeeeCCeeeEEee----------cc-chhhhhhhhcCccccccCCCCcc Confidence 110000 0000000 000 001111 1123446777763 22 36799999999 34 Q ss_pred CCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_015294. 63 RNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRD 97 (159) Q Consensus 63 ~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g 97 (159) +||+||||- +.+. ...++ ...+..-+.| T Consensus 121 ~iPaRp~LG--~s~~-d~~~i----~~~i~~~l~~ 148 (148) T protein:vir:79 121 QYPARELLG--MDGV-DMEHI----TNLLLLHLGA 148 (148) T ss_pred ccCcccccC--CCHH-HHHHH----HHHHHHHhcC Confidence 799999995 4332 12223 3444444444 No 84 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=95.29 E-value=2.7e-05 Score=45.59 Aligned_cols=82 Identities=15% Similarity=0.259 Sum_probs=37.8 Q ss_pred ccCCc-eeeehHHHHHHHHHHH-------------------HHhhCCEEEEEecccc-cCCCCCCC-----CCHHHHHHH Q lcl|NC_015294. 3 ILASF-SFKTDRRRLTSLIKRV-------------------EALDGTTVEVGFFPED-RYGSENGN-----LPVAQVAAY 56 (159) Q Consensus 3 m~~~~-~~k~~~~~l~~l~~~l-------------------~~l~~~~v~VGi~~~~-~~~~~~~G-----~~~A~iA~~ 56 (159) |.+.+ .+..-.+.|+++.+.+ +.+. -|.-|-+..+ ......+| .+.+.+|.+ T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~a--PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ 78 (137) T protein:vir:93 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLM--PVDTGYLRESVTMDFKDSGFTGVINIGSEYAIY 78 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--CccccchhccceeEeecCceEEEEecCCCcccc Confidence 33322 1111122222222221 1111 1222222221 01011122 345779999 Q ss_pred HhcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 57 NEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 88 (159) Q Consensus 57 ~E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 88 (159) .|||| .+.|+||||++++++.+. .+.+.+. T Consensus 79 vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~--~~~~~l~ 137 (137) T protein:vir:93 79 VNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRA--FFNKYFS 137 (137) T ss_pred cccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHH--HHHHhhC Confidence 99997 358999999999988753 3443333 No 85 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=95.29 E-value=2.7e-05 Score=45.59 Aligned_cols=82 Identities=15% Similarity=0.259 Sum_probs=37.8 Q ss_pred ccCCc-eeeehHHHHHHHHHHH-------------------HHhhCCEEEEEecccc-cCCCCCCC-----CCHHHHHHH Q lcl|NC_015294. 3 ILASF-SFKTDRRRLTSLIKRV-------------------EALDGTTVEVGFFPED-RYGSENGN-----LPVAQVAAY 56 (159) Q Consensus 3 m~~~~-~~k~~~~~l~~l~~~l-------------------~~l~~~~v~VGi~~~~-~~~~~~~G-----~~~A~iA~~ 56 (159) |.+.+ .+..-.+.|+++.+.+ +.+. -|.-|-+..+ ......+| .+.+.+|.+ T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~a--PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ 78 (137) T protein:vir:97 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLM--PVDTGYLRESVTMDFKDSGFTGVINIGSEYAIY 78 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--CccccchhccceeEeecCceEEEEecCCCcccc Confidence 33322 1111122222222221 1111 1222222221 01011122 345779999 Q ss_pred HhcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 57 NEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 88 (159) Q Consensus 57 ~E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 88 (159) .|||| .+.|+||||++++++.+. .+.+.+. T Consensus 79 vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~--~~~~~l~ 137 (137) T protein:vir:97 79 VNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRA--FFNKYFS 137 (137) T ss_pred cccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHH--HHHHhhC Confidence 99997 358999999999988753 3443333 No 86 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=95.29 E-value=2.7e-05 Score=45.59 Aligned_cols=82 Identities=15% Similarity=0.259 Sum_probs=37.8 Q ss_pred ccCCc-eeeehHHHHHHHHHHH-------------------HHhhCCEEEEEecccc-cCCCCCCC-----CCHHHHHHH Q lcl|NC_015294. 3 ILASF-SFKTDRRRLTSLIKRV-------------------EALDGTTVEVGFFPED-RYGSENGN-----LPVAQVAAY 56 (159) Q Consensus 3 m~~~~-~~k~~~~~l~~l~~~l-------------------~~l~~~~v~VGi~~~~-~~~~~~~G-----~~~A~iA~~ 56 (159) |.+.+ .+..-.+.|+++.+.+ +.+. -|.-|-+..+ ......+| .+.+.+|.+ T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~a--PvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~ 78 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLM--PVDTGYLRESVTMDFKDSGFTGVINIGSEYAIY 78 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--CccccchhccceeEeecCceEEEEecCCCcccc Confidence 33322 1111122222222221 1111 1222222221 01011122 345779999 Q ss_pred HhcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 57 NEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 88 (159) Q Consensus 57 ~E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 88 (159) .|||| .+.|+||||++++++.+. .+.+.+. T Consensus 79 vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~--~~~~~l~ 137 (137) T protein:vir:94 79 VNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRA--FFNKYFS 137 (137) T ss_pred cccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHH--HHHHhhC Confidence 99997 358999999999988753 3443333 No 87 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=94.99 E-value=4.5e-05 Score=44.39 Aligned_cols=83 Identities=27% Similarity=0.390 Sum_probs=36.7 Q ss_pred CcccCCceeeehHHHHHHHHHHHH-------------------HhhCCEEEEEeccccc-CCCCCC-----CCCHHHHHH Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVE-------------------ALDGTTVEVGFFPEDR-YGSENG-----NLPVAQVAA 55 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~-------------------~l~~~~v~VGi~~~~~-~~~~~~-----G~~~A~iA~ 55 (159) |.... .....-.+.|+++.+.++ .+.. |.-|-+...- .....+ ..+.+.+|. T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~p--vdTG~L~~Si~~~~~~~g~~~~V~~~~~YA~ 77 (137) T protein:vir:96 1 MAKVK-YGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAP--VDLGFLKESIDFKVTDGGFSSVISVGAEYAI 77 (137) T ss_pred CchhH-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cCccchhcCceeEeecCceEEEEecCCCccc Confidence 33322 011111222222221111 1111 1112221110 000111 123467999 Q ss_pred HHhcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 56 YNEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 88 (159) Q Consensus 56 ~~E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 88 (159) +.|||| +++|+||||++++++.+. .+.+.+. T Consensus 78 yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~--~i~k~i~ 137 (137) T protein:vir:96 78 YVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRK--VFNRYFS 137 (137) T ss_pred ccccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHHHH--HHHHhhC Confidence 999997 458999999999987753 3444333 No 88 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=94.89 E-value=0.00013 Score=41.85 Aligned_cols=79 Identities=14% Similarity=0.229 Sum_probs=36.5 Q ss_pred CcccCCceee-----ehHHHHHHH--HHHHH-HhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCC----------C Q lcl|NC_015294. 1 MIILASFSFK-----TDRRRLTSL--IKRVE-ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGT----------T 62 (159) Q Consensus 1 m~m~~~~~~k-----~~~~~l~~l--~~~l~-~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~----------~ 62 (159) ..=.+--+.+ .....+..+ ...|+ ......+.|||... +..||++|+||. + T Consensus 53 W~p~~~~~~~~~~g~~~~~~~~~l~~~~~l~~~~~~~~~~v~~~Gt-----------n~~yAaiHQfG~~~r~~~~~~~v 121 (149) T protein:vir:18 53 YAARKRQPVRSKKGRIKREMFAKLRTSRFMKAKGSDSAAVVEFTGK-----------VQRMARVHQYGLKDRPNRNSRDV 121 (149) T ss_pred CcccchhhhhhccCcccchhhhhhhhhhhhheeecCceeEEEeccc-----------chhhhhhhhccccccccCCCccc Confidence 1111110000 001111111 11111 12344677776422 367999999994 3 Q ss_pred CCCCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 63 RNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 93 (159) Q Consensus 63 ~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 93 (159) +||+||||- +.+. .+.++.+.+...+.+ T Consensus 122 ~iPaRp~LG--~s~~-d~~~I~~~i~~~l~~ 149 (149) T protein:vir:18 122 QYEARPLLG--FTRD-DEQMIEDVIISHLGK 149 (149) T ss_pred cccccccCC--CCHH-HHHHHHHHHHHHHhC Confidence 799999995 4322 223455555454444 No 89 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=94.70 E-value=4.9e-05 Score=44.21 Aligned_cols=82 Identities=16% Similarity=0.277 Sum_probs=37.3 Q ss_pred ccCCc-eeeehHHHHHHHHHHH-------------------HHhhCCEEEEEeccccc-CCCCCCC-----CCHHHHHHH Q lcl|NC_015294. 3 ILASF-SFKTDRRRLTSLIKRV-------------------EALDGTTVEVGFFPEDR-YGSENGN-----LPVAQVAAY 56 (159) Q Consensus 3 m~~~~-~~k~~~~~l~~l~~~l-------------------~~l~~~~v~VGi~~~~~-~~~~~~G-----~~~A~iA~~ 56 (159) |.+.+ .+..-.+.|+++-+.+ +.+. -|.-|-+..+- .....+| .+.+.+|.+ T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~a--Pv~TG~L~~Si~~~~~~~~~~~~V~~~~~YA~~ 78 (137) T protein:vir:95 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLM--PVDTGYLRESVTMDFKDGGFTGVINIGSEYAIY 78 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--CccchhhhcCeeeEeeCCceEEEEecCCCcccc Confidence 33322 1111112222222111 1111 12223222210 1001122 345779999 Q ss_pred HhcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 57 NEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 88 (159) Q Consensus 57 ~E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 88 (159) .|||| .+.|+||||++++++.+. .+.+.+. T Consensus 79 vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~--~i~k~l~ 137 (137) T protein:vir:95 79 VNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRA--FFNKYFS 137 (137) T ss_pred cccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHH--HHHHhhC Confidence 99997 358999999999987643 3443333 No 90 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=94.69 E-value=2.3e-05 Score=46.06 Aligned_cols=85 Identities=18% Similarity=0.284 Sum_probs=38.3 Q ss_pred CcccCCceeeehHHHHHHHHHHHHHh---------------hC--CEEEEEeccccc-CCCCCCC-----CCHHHHHHHH Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEAL---------------DG--TTVEVGFFPEDR-YGSENGN-----LPVAQVAAYN 57 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~l---------------~~--~~v~VGi~~~~~-~~~~~~G-----~~~A~iA~~~ 57 (159) |+... ..+..-.+.|+++.+.++.. .+ .-|.-|-+..+= .....+| .+.+.+|.+. T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~v 79 (137) T protein:vir:10 1 MAKVK-YGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEYAVYV 79 (137) T ss_pred Cccch-hCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEecCCcEEEEEecCCcccccc Confidence 44432 12222222332222222110 00 012222222210 0001122 2346799999 Q ss_pred hcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 58 EFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 88 (159) Q Consensus 58 E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 88 (159) |||| +++||||||+++++++.. .+.+.+. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~--~i~k~i~ 137 (137) T protein:vir:10 80 NYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRA--FFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHHH--HHHHhhC Confidence 9996 258999999999988743 3443333 No 91 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=94.67 E-value=0.00013 Score=41.89 Aligned_cols=85 Identities=16% Similarity=0.301 Sum_probs=37.9 Q ss_pred CcccCCceeeehHHHHHHHHHHHHHh--------h---------CCEEEEEeccccc-CCCCCCC-----CCHHHHHHHH Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEAL--------D---------GTTVEVGFFPEDR-YGSENGN-----LPVAQVAAYN 57 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~l--------~---------~~~v~VGi~~~~~-~~~~~~G-----~~~A~iA~~~ 57 (159) |+... ..+..-.+.|+++.+.++.. . ..-|.-|-+...= .....+| .+.+.+|.+. T Consensus 1 Ma~~~-~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~v 79 (137) T protein:vir:94 1 MAKVK-YGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYV 79 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEeecCcEEEEEecCCCccccc Confidence 44332 12222223333322222110 0 0011112111110 0001111 2346799999 Q ss_pred hcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 58 EFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 88 (159) Q Consensus 58 E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 88 (159) |||| .++|+||||++++++.+. .+.+.+. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~--~~~~~l~ 137 (137) T protein:vir:94 80 NYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRV--FFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHHH--HHHHhhC Confidence 9994 368999999999988753 3443333 No 92 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=94.67 E-value=8.2e-05 Score=42.99 Aligned_cols=86 Identities=15% Similarity=0.107 Sum_probs=40.4 Q ss_pred cccCCceeeehHHHHHHHHHHHHH---------------hhC--CEEEEEecccccC---CCC------CCC-CCHHHHH Q lcl|NC_015294. 2 IILASFSFKTDRRRLTSLIKRVEA---------------LDG--TTVEVGFFPEDRY---GSE------NGN-LPVAQVA 54 (159) Q Consensus 2 ~m~~~~~~k~~~~~l~~l~~~l~~---------------l~~--~~v~VGi~~~~~~---~~~------~~G-~~~A~iA 54 (159) ||...+.+...++.|..+.++++. ..+ .-|.=|.+..+=. ..+ ..| .+.+.+| T Consensus 1 m~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA 80 (142) T protein:vir:86 1 MVQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYA 80 (142) T ss_pred CceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCcccc Confidence 444445555444443333322211 111 0122232221100 000 011 2457899 Q ss_pred HHHhcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 55 AYNEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 88 (159) Q Consensus 55 ~~~E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 88 (159) .++|||| ++.||||||+++++++..+. -...++ T Consensus 81 ~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~-~~~~~r 142 (142) T protein:vir:86 81 AAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRD-RRIRVR 142 (142) T ss_pred ceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhh-hhhccC Confidence 9999996 24669999999998875431 111111 No 93 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=94.67 E-value=8.2e-05 Score=42.99 Aligned_cols=86 Identities=15% Similarity=0.107 Sum_probs=40.4 Q ss_pred cccCCceeeehHHHHHHHHHHHHH---------------hhC--CEEEEEecccccC---CCC------CCC-CCHHHHH Q lcl|NC_015294. 2 IILASFSFKTDRRRLTSLIKRVEA---------------LDG--TTVEVGFFPEDRY---GSE------NGN-LPVAQVA 54 (159) Q Consensus 2 ~m~~~~~~k~~~~~l~~l~~~l~~---------------l~~--~~v~VGi~~~~~~---~~~------~~G-~~~A~iA 54 (159) ||...+.+...++.|..+.++++. ..+ .-|.=|.+..+=. ..+ ..| .+.+.+| T Consensus 1 m~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA 80 (142) T protein:vir:99 1 MVQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYA 80 (142) T ss_pred CceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCcccc Confidence 444445555444443333322211 111 0122232221100 000 011 2457899 Q ss_pred HHHhcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 55 AYNEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 88 (159) Q Consensus 55 ~~~E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 88 (159) .++|||| ++.||||||+++++++..+. -...++ T Consensus 81 ~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~-~~~~~r 142 (142) T protein:vir:99 81 AAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRD-RRIRVR 142 (142) T ss_pred ceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhh-hhhccC Confidence 9999996 24669999999998875431 111111 No 94 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=94.64 E-value=3.6e-05 Score=44.93 Aligned_cols=103 Identities=17% Similarity=0.099 Sum_probs=48.6 Q ss_pred CcccCCceeeehHHHHHHHHHHH-------HHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHH Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRV-------EALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPT 73 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l-------~~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~ 73 (159) ..-.+...-+.+ .+...+...+ .....-.+.|||.... .+.+|-+.|+||.++|+.||++.+ T Consensus 44 ~tp~~h~~~~kt-~~~~HlaD~I~~s~~~idG~~dG~s~VG~~~~~----------~a~~a~f~n~GT~km~~~hFie~t 112 (153) T protein:vir:49 44 VTREKHYSKKKD-LKYGHMADGLAVQSTNADGRKNGVSTVGWKNNY----------HAQNARRLNDGTKKYRADHFITNV 112 (153) T ss_pred hccccCCCCCCC-CCCCcccccceeccccccccccceeeecccCCc----------cceeeeecccCcccCCCChhhHHH Confidence 000000000000 0000111111 1112346789996432 367999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCC Q lcl|NC_015294. 74 FEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFN 138 (159) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~ 138 (159) .++...+.++.+.+...+.+++...... -+|.+..+-|... T Consensus 113 r~e~~~k~~vl~A~~~~~~~il~~~~~~------------------------~~~~~~~~~~~~~ 153 (153) T protein:vir:49 113 QNDSTVKNKVLLAEKEEYEKLIRRKGGV------------------------YLSASNFKTKRAT 153 (153) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHhcCCe------------------------eeeccccccccCC Confidence 9876444455555555555555433210 0111111111100 No 95 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=94.58 E-value=0.00011 Score=42.26 Aligned_cols=83 Identities=24% Similarity=0.412 Sum_probs=37.6 Q ss_pred CcccCCceeeehHHHHHHHHHHHH-------------------HhhCCEEEEEeccccc-CCCCCCC-----CCHHHHHH Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVE-------------------ALDGTTVEVGFFPEDR-YGSENGN-----LPVAQVAA 55 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~-------------------~l~~~~v~VGi~~~~~-~~~~~~G-----~~~A~iA~ 55 (159) |+... ..+..-.+.|+++.+.++ .+. -|.-|-+...= .....+| .+.+.+|. T Consensus 13 Ma~~~-~Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~a--PvdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~ 89 (149) T protein:vir:94 13 MAKVK-YGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALA--PVDLGFLEESIDFKYFDGGLSSVISVGADYAI 89 (149) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--CcccchhhcCeeEEeeCCcEEEEEecCCCccc Confidence 54422 011222222222222211 111 11122222110 0011122 23467999 Q ss_pred HHhcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 56 YNEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 88 (159) Q Consensus 56 ~~E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 88 (159) +.|||| ++.||||||++++++.+. .+.+.+. T Consensus 90 ~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~~--~i~~~i~ 149 (149) T protein:vir:94 90 YVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRK--TFEQYFS 149 (149) T ss_pred ccccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHHH--HHHHhhC Confidence 999996 458999999999987643 3444333 No 96 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=94.51 E-value=0.00013 Score=41.85 Aligned_cols=80 Identities=15% Similarity=0.143 Sum_probs=32.5 Q ss_pred CcccC------------CceeeehHHHHHHHHHHHHH-hhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCC------ Q lcl|NC_015294. 1 MIILA------------SFSFKTDRRRLTSLIKRVEA-LDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGT------ 61 (159) Q Consensus 1 m~m~~------------~~~~k~~~~~l~~l~~~l~~-l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~------ 61 (159) ..... +-++=.+.- .|...|.. .++..|.||- +..+|++|+||. T Consensus 55 ~pls~~t~~~r~~~g~~~~~iL~~tG---~L~~Si~~~~~~~~v~vGt--------------~~~YA~iHqfGg~~~~~~ 117 (155) T protein:vir:79 55 PQLSPATVAAREAKGRGPHPILQVTN---ALARSVTTWADRNEAGIGS--------------NLVYAAIHQFGGDAGRGH 117 (155) T ss_pred CCCCHHHHHHHhccCCCCCCccccch---hhhhhhhceecCCEEEEec--------------CchhhhhhhcccccCCCC Confidence 11000 000001111 12333321 2344555553 245899999995 Q ss_pred -CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHH-hCC Q lcl|NC_015294. 62 -TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVL-RDG 98 (159) Q Consensus 62 -~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~-~g~ 98 (159) ++||+||||--.-+.....+ ..+.+...+...+ .|. T Consensus 118 ~v~iPaRpfLG~s~~~~l~~~-~~~~I~~~i~~~l~r~r 155 (155) T protein:vir:79 118 QVEIPARRYLPFDENGQLAAG-ARQSILEVVLTALSRNR 155 (155) T ss_pred ccccCCccccCCCCccccchH-HHHHHHHHHHHHHHhcC Confidence 38999999942221111011 1122223333333 333 No 97 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=94.43 E-value=2.6e-05 Score=45.71 Aligned_cols=83 Identities=17% Similarity=0.212 Sum_probs=40.0 Q ss_pred CcccCCcee---------------eehHHHHHH----HHHHHHHhhC----------------CEEEEEecccccCCCCC Q lcl|NC_015294. 1 MIILASFSF---------------KTDRRRLTS----LIKRVEALDG----------------TTVEVGFFPEDRYGSEN 45 (159) Q Consensus 1 m~m~~~~~~---------------k~~~~~l~~----l~~~l~~l~~----------------~~v~VGi~~~~~~~~~~ 45 (159) |..+---.+ +..++.|.+ +.++++.-.. -.+.||+. T Consensus 1 Ma~iel~G~del~~~l~~~g~~~~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkkik~~~kk~g~~~VG~~--------- 71 (119) T protein:vir:10 1 MASLEIEGFEEFEKFISEDMVLDESTKRKGIKAGITKIGKAIEKNSPIKSGRLSKVKIRVKNTGLATEGTA--------- 71 (119) T ss_pred CceeehhhHHHHHHHHHhhhhhhHHHHHHHHHHHhHHHHHHHhhcCCcccCCcceeeeeeecCceeEeccC--------- Confidence 111100000 011111111 2222211110 13444442 Q ss_pred CCCCHHHHHHHHhcCCCCCCCC-chhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015294. 46 GNLPVAQVAAYNEFGTTRNPTR-PFMAPTFEEFTSQFHYARLMKSTFENVLR 96 (159) Q Consensus 46 ~G~~~A~iA~~~E~G~~~IP~R-pFlr~~~~~~~~~~~~~~~~~~~~~~~~~ 96 (159) .+-+-++-.+||||...|+| |||.+++++... +..+.+...+..=++ T Consensus 72 --ks~~fy~kF~EFGTSkm~a~~pF~~~a~~~~~~--eA~~~~~~el~~~~r 119 (119) T protein:vir:10 72 --SSSEFYDIFQNFGTSEQKAHVGYFDRAVDETTN--EAVEEVAEIIFRKMR 119 (119) T ss_pred --CcchhhhhhccccccccCCCCCccccccccChH--HHHHHHHHHHHHhcC Confidence 23467999999999999999 999999987743 333444443333222 No 98 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=94.33 E-value=0.00041 Score=39.15 Aligned_cols=81 Identities=10% Similarity=0.094 Sum_probs=57.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhc-----C--CCCCcHHHHHhcCC-CCCchhHH Q lcl|NC_015294. 74 FEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDD-----Y--PGSNSPAWAAYKGF-NDPLFHTG 145 (159) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~-----~--~~Pna~~Ti~~KG~-~~PLiDTG 145 (159) +++. .++.+.+..++..+- ..+...++..||..+....++.|.+ | |+|+++.|.++||. .++|.+++ T Consensus 1 m~~~---~~l~~~L~~ll~~l~--~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~~~~~~~~l 75 (148) T protein:vir:79 1 MSES---RELEAWLAGMLTKLD--APARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRIRRAMFMRL 75 (148) T ss_pred CccH---HHHHHHHHHHHHhcC--ChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhcccccccccchh Confidence 3222 123333444444421 1233578999999999999999964 3 45899999999986 47899999 Q ss_pred HHHhhhhhhhcccC Q lcl|NC_015294. 146 KMLESVKFQIHRRQ 159 (159) Q Consensus 146 ~L~~SIty~V~~k~ 159 (159) .+..++++.+.... T Consensus 76 ~~~~~l~~~~~~~~ 89 (148) T protein:vir:79 76 RLARYMKTQADANT 89 (148) T ss_pred hhhhheeeeeeCCe Confidence 99999988887766 No 99 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=94.30 E-value=5.8e-05 Score=43.83 Aligned_cols=77 Identities=17% Similarity=0.380 Sum_probs=36.9 Q ss_pred ccCCceeeehHHHHHH----HHHHHHHhhCCEEEEEeccccc-CCCCCCCC-----CHHHHHHHHhcC------------ Q lcl|NC_015294. 3 ILASFSFKTDRRRLTS----LIKRVEALDGTTVEVGFFPEDR-YGSENGNL-----PVAQVAAYNEFG------------ 60 (159) Q Consensus 3 m~~~~~~k~~~~~l~~----l~~~l~~l~~~~v~VGi~~~~~-~~~~~~G~-----~~A~iA~~~E~G------------ 60 (159) |...+ ++.+.+ +.+..+.+.. |.=|-+...= +...++|+ +.+.+|.+.||| T Consensus 1 v~~~v-----~~~~~~~~~~i~~~ak~~aP--v~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~ 73 (116) T protein:vir:97 1 MERWV-----KRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSR 73 (116) T ss_pred ChHHH-----HHHHHHHHHHHHHHHHHhCC--cCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCccc Confidence 11111 122222 2223333322 2223222211 11112222 357799999999 Q ss_pred -----------------CCCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 61 -----------------TTRNPTRPFMAPTFEEFTSQFHYARLMK 88 (159) Q Consensus 61 -----------------~~~IP~RpFlr~~~~~~~~~~~~~~~~~ 88 (159) +.++|+||||++++++.+. .+.+.+. T Consensus 74 ~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~--~i~k~i~ 116 (116) T protein:vir:97 74 AKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRA--FFNKYFS 116 (116) T ss_pred ccccceeeecCCceeeecCCcCCCcchHHHHHHHHH--HHHHhhC Confidence 4569999999999987643 2333222 No 100 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=94.30 E-value=5.8e-05 Score=43.83 Aligned_cols=77 Identities=17% Similarity=0.380 Sum_probs=36.9 Q ss_pred ccCCceeeehHHHHHH----HHHHHHHhhCCEEEEEeccccc-CCCCCCCC-----CHHHHHHHHhcC------------ Q lcl|NC_015294. 3 ILASFSFKTDRRRLTS----LIKRVEALDGTTVEVGFFPEDR-YGSENGNL-----PVAQVAAYNEFG------------ 60 (159) Q Consensus 3 m~~~~~~k~~~~~l~~----l~~~l~~l~~~~v~VGi~~~~~-~~~~~~G~-----~~A~iA~~~E~G------------ 60 (159) |...+ ++.+.+ +.+..+.+.. |.=|-+...= +...++|+ +.+.+|.+.||| T Consensus 1 v~~~v-----~~~~~~~~~~i~~~ak~~aP--v~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~ 73 (116) T protein:vir:12 1 MERWV-----KRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSR 73 (116) T ss_pred ChHHH-----HHHHHHHHHHHHHHHHHhCC--cCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCccc Confidence 11111 122222 2223333322 2223222211 11112222 357799999999 Q ss_pred -----------------CCCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 61 -----------------TTRNPTRPFMAPTFEEFTSQFHYARLMK 88 (159) Q Consensus 61 -----------------~~~IP~RpFlr~~~~~~~~~~~~~~~~~ 88 (159) +.++|+||||++++++.+. .+.+.+. T Consensus 74 ~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~--~i~k~i~ 116 (116) T protein:vir:12 74 AKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRA--FFNKYFS 116 (116) T ss_pred ccccceeeecCCceeeecCCcCCCcchHHHHHHHHH--HHHHhhC Confidence 4569999999999987643 2333222 No 101 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=94.20 E-value=0.00015 Score=41.62 Aligned_cols=88 Identities=16% Similarity=0.177 Sum_probs=49.1 Q ss_pred CcccCCc---------eeeehHHHHHHHHHHH-------HHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCC Q lcl|NC_015294. 1 MIILASF---------SFKTDRRRLTSLIKRV-------EALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRN 64 (159) Q Consensus 1 m~m~~~~---------~~k~~~~~l~~l~~~l-------~~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~I 64 (159) ......+ +-+.+. ....+...+ .......+.|||... ..|.+|.+.++||..+ T Consensus 35 kv~~~~L~~~tp~~h~~~r~t~-~~~HlaD~I~~~~~~idg~~dG~s~VG~~k~----------~~a~~a~f~NdGT~k~ 103 (140) T protein:vir:48 35 KVFKKELAEVTREKHYSKKKDL-KYGHMADGLAVQSTNVDGRKNGVATVGWKNN----------YHAQNARRLNDGTKKY 103 (140) T ss_pred HHHHHHHHHhcccCCCCCCCCC-CCCcccccceecccccccccccceeecccCC----------CceeEEeecccCcccc Confidence 0000000 000000 000111111 112233567999632 2378999999999999 Q ss_pred CCCchhhHHHHHHHHHHHHHHHHHHHHHHHH--hCCC Q lcl|NC_015294. 65 PTRPFMAPTFEEFTSQFHYARLMKSTFENVL--RDGR 99 (159) Q Consensus 65 P~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~--~g~~ 99 (159) |+.+|+..+.++...+.++.+.+...+.+++ .|+. T Consensus 104 ~~~hFve~t~~e~~~~~~vl~A~~~~y~~~l~kk~~~ 140 (140) T protein:vir:48 104 RADHFVTNVQNDSAVRDKVLLAEKEEYEKLIRKKGGE 140 (140) T ss_pred CCCchHHHHHHhhhhHHHHHHHHHHHHHHHHHhhcCC Confidence 9999999999876555567777777777776 3332 No 102 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=94.15 E-value=5.8e-05 Score=43.82 Aligned_cols=85 Identities=21% Similarity=0.361 Sum_probs=38.3 Q ss_pred CcccCCceeeehHHHHHHHHHHHHH---------------hhC--CEEEEEeccccc-CCCCCCC-----CCHHHHHHHH Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEA---------------LDG--TTVEVGFFPEDR-YGSENGN-----LPVAQVAAYN 57 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~---------------l~~--~~v~VGi~~~~~-~~~~~~G-----~~~A~iA~~~ 57 (159) |+... ..+..-.+.|+++.+.++. ..+ .-|.-|.+...= .....+| .+.+.+|.+. T Consensus 13 Ma~v~-~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~~~~~g~~~~V~~~~~YA~~v 91 (149) T protein:vir:10 13 MAKVK-YGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADYAIYV 91 (149) T ss_pred hHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEEecCCcEEEEEecCCCccccc Confidence 54431 0112222223222222110 000 011122222210 0011122 2346799999 Q ss_pred hcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 58 EFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 88 (159) Q Consensus 58 E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 88 (159) |||| +++|||||||+++++.+. .+.+.+. T Consensus 92 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~--~i~~~i~ 149 (149) T protein:vir:10 92 EYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRK--TFEQYFS 149 (149) T ss_pred ccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHHH--HHHHhhC Confidence 9996 457999999999988753 3444333 No 103 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=94.13 E-value=5.7e-05 Score=43.83 Aligned_cols=77 Identities=17% Similarity=0.364 Sum_probs=36.2 Q ss_pred ccCCceeeehHHHHHHHHH----HHHHhhCCEEEEEeccccc-CCCCCCCC-----CHHHHHHHHhcC------------ Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLIK----RVEALDGTTVEVGFFPEDR-YGSENGNL-----PVAQVAAYNEFG------------ 60 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~~----~l~~l~~~~v~VGi~~~~~-~~~~~~G~-----~~A~iA~~~E~G------------ 60 (159) |...+ ++.+.+... ..+.+.. |.-|-+...- +...++|+ +.+.+|.+.||| T Consensus 1 v~~~v-----~~~~~~~~~~i~~~ak~~ap--v~TG~Lr~SI~~~~~~~~~~~~V~~~~~Ya~yvE~GTg~~~~~~~~~~ 73 (116) T protein:vir:95 1 MERWV-----KRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSR 73 (116) T ss_pred ChHHH-----HHHHHHHHHHHHHHHHhhCC--ccccccccceeEEeecCcEEEEEecCCCccceeecCccccccCCCccc Confidence 11111 122223222 2333322 2222222211 10111221 357799999999 Q ss_pred -----------------CCCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 61 -----------------TTRNPTRPFMAPTFEEFTSQFHYARLMK 88 (159) Q Consensus 61 -----------------~~~IP~RpFlr~~~~~~~~~~~~~~~~~ 88 (159) +.++||||||++++++++. .+.+.+. T Consensus 74 ~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~--~i~k~is 116 (116) T protein:vir:95 74 AKNIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRA--FFNKYFS 116 (116) T ss_pred cccccceeecCccceeeCCCCCCCcchHHHHHHHHH--HHHHhhC Confidence 4468999999999987643 2332222 No 104 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=94.13 E-value=0.00017 Score=41.23 Aligned_cols=78 Identities=14% Similarity=0.187 Sum_probs=32.9 Q ss_pred CcccC------------CceeeehHHHHHHHHHHHHH-hhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCC------ Q lcl|NC_015294. 1 MIILA------------SFSFKTDRRRLTSLIKRVEA-LDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGT------ 61 (159) Q Consensus 1 m~m~~------------~~~~k~~~~~l~~l~~~l~~-l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~------ 61 (159) ..... +-.+-.+.. .|...|.. .+...|.||- +..+|++|+||. T Consensus 55 ~pls~~t~~~r~~~g~~~~~iL~~tg---~L~~Si~~~~~~~~v~vGt--------------n~~YA~iHqfGg~~~~~~ 117 (155) T protein:vir:99 55 PQLSPVTVAAREAKGRGPHPILQVTN---ALARSVTTWADRNEAGIGS--------------NLVYAAIHQFGGDAGRGH 117 (155) T ss_pred CCCChHHHHHHhccCCCCCCcchhch---hhhhhhhceecCCEEEEec--------------CccchhhhhcccccCCCC Confidence 11000 000001111 12333321 2445666653 134799999995 Q ss_pred -CCCCCCchhhHHHHH---HHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_015294. 62 -TRNPTRPFMAPTFEE---FTSQFHYARLMKSTFENVLRDGR 99 (159) Q Consensus 62 -~~IP~RpFlr~~~~~---~~~~~~~~~~~~~~~~~~~~g~~ 99 (159) ++||+||||--.-+. .+.+..+.+.+...+. -+. T Consensus 118 ~v~iPaRpfLG~s~~~~l~~e~~~~I~~~i~~~l~----~~~ 155 (155) T protein:vir:99 118 QVEIPARRYLPFDENGQLAAGARQSILEIVLTALS----RNR 155 (155) T ss_pred ccccCCccccCCCCccccchHHHHHHHHHHHHHHh----ccC Confidence 389999999422111 1112233333333332 232 No 105 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=94.01 E-value=0.00048 Score=38.76 Aligned_cols=81 Identities=10% Similarity=0.029 Sum_probs=56.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhc-----C--CCCCcHHHHHhcCC--CCCchhH Q lcl|NC_015294. 74 FEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDD-----Y--PGSNSPAWAAYKGF--NDPLFHT 144 (159) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~-----~--~~Pna~~Ti~~KG~--~~PLiDT 144 (159) +++.. ...+.+..++.++-. .....+|..||..+....++.|.+ | |+|+++.|++.|.. .++|..+ T Consensus 1 m~~~~---~~~~~l~~ll~~L~~--~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~~~~~~~~~ 75 (149) T protein:vir:18 1 MSELT---ALQERLAGLIASLSP--AARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSKKGRIKREMFAK 75 (149) T ss_pred CchHH---HHHHHHHHHHHhcCC--chHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhccCcccchhhhh Confidence 33221 122333444444321 234679999999999999999964 2 56999999987653 5789999 Q ss_pred HHHHhhhhhhhcccC Q lcl|NC_015294. 145 GKMLESVKFQIHRRQ 159 (159) Q Consensus 145 G~L~~SIty~V~~k~ 159 (159) +.+..++++.+.... T Consensus 76 l~~~~~l~~~~~~~~ 90 (149) T protein:vir:18 76 LRTSRFMKAKGSDSA 90 (149) T ss_pred hhhhhhhheeecCce Confidence 999999998887777 No 106 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=93.98 E-value=8.8e-05 Score=42.82 Aligned_cols=78 Identities=21% Similarity=0.377 Sum_probs=35.2 Q ss_pred CcccCCceeeehHHHHHHHHHHHHHhhCC------------------------EEEEEeccccc-CCCCCCC-----CCH Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEALDGT------------------------TVEVGFFPEDR-YGSENGN-----LPV 50 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~l~~~------------------------~v~VGi~~~~~-~~~~~~G-----~~~ 50 (159) |... . .+|++|.+.|+.+.+. -|.-|-+.+.= .....+| .+. T Consensus 1 Ma~~--~------~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~ 72 (137) T protein:vir:10 1 MAKV--K------YGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIG 72 (137) T ss_pred Cchh--H------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeCCcEEEEEecC Confidence 3322 1 1222222222221110 11112111110 0000111 234 Q ss_pred HHHHHHHhcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 51 AQVAAYNEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMK 88 (159) Q Consensus 51 A~iA~~~E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~ 88 (159) +.+|.+.|||| .++|+||||+++++++.. .+.+.+. T Consensus 73 ~~Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~--~i~k~i~ 137 (137) T protein:vir:10 73 SEYAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRA--FFNKYFS 137 (137) T ss_pred CCcccccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHH--HHHHhcC Confidence 67899999995 247999999999988643 3443333 No 107 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=93.69 E-value=0.00027 Score=40.12 Aligned_cols=83 Identities=11% Similarity=0.135 Sum_probs=37.7 Q ss_pred CcccCCc-------eeeehHHHHHHHHH--HHH-HhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCC--------- Q lcl|NC_015294. 1 MIILASF-------SFKTDRRRLTSLIK--RVE-ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGT--------- 61 (159) Q Consensus 1 m~m~~~~-------~~k~~~~~l~~l~~--~l~-~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~--------- 61 (159) ..-.+-- .++.....+..+.. .|+ ..+...+.|||.. ++..||++|.||. T Consensus 54 W~p~~~~~~~~~~~~~~~~~~m~~~l~~~~~l~~~~~~~~a~vg~~G-----------s~~~yA~iHQfG~~~~~~~~~~ 122 (156) T protein:vir:11 54 YEPRKKRELRGKQGRIRRKIKMFQKLRTVRYLRAKGDAQAITVSFAG-----------RIARIARVHQYGLRDRAEPGAP 122 (156) T ss_pred CcccchHHHhhhccccccchhhhhhhhhhheeeeeecCcEEEEEecC-----------CchhhhhhhcccccccccCCCC Confidence 1111100 00111111111111 111 1245678888742 2367999999994 Q ss_pred -CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHH Q lcl|NC_015294. 62 -TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQTN 102 (159) Q Consensus 62 -~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 102 (159) ++||+||||- +.+. .+.++. ..+...+.+. ++- T Consensus 123 ~v~iPaRp~LG--~s~~-d~~~i~----~~i~~~l~~~-~~~ 156 (156) T protein:vir:11 123 EVSYAQRLLLG--FDSS-DMETIQ----NGILAHIDAN-SPI 156 (156) T ss_pred cccccccccCC--CCHH-HHHHHH----HHHHHHHhhc-CCC Confidence 2799999995 4322 122333 3444444443 332 No 108 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=93.68 E-value=0.00019 Score=40.98 Aligned_cols=84 Identities=19% Similarity=0.304 Sum_probs=39.1 Q ss_pred ccCCc--eeeehHHHHHHHH-HHHHH----hhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcC--------------- Q lcl|NC_015294. 3 ILASF--SFKTDRRRLTSLI-KRVEA----LDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFG--------------- 60 (159) Q Consensus 3 m~~~~--~~k~~~~~l~~l~-~~l~~----l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G--------------- 60 (159) +.-.. .+....=.|.+-+ ..... -....-.|||-.-+. --+.+.||| T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkA-----------PhghlvE~Ghw~~~~~~~~~dG~w 69 (119) T protein:vir:81 1 MRESAKAFVNDETGKLRSNLYVAYSPEESTNGVQTYAVSWRKKAA-----------PHGHLLEFGHWQTHAAYKGKDGEW 69 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCeEEEEeeccCCcC-----------CcccccccceeeeeeeeeccCcee Confidence 10000 0111111121111 11111 011233455543321 123445888 Q ss_pred ---------CCCCCCCchhhHHHHHHHHHHHHHHHHHHH----HHHHHhCCC Q lcl|NC_015294. 61 ---------TTRNPTRPFMAPTFEEFTSQFHYARLMKST----FENVLRDGR 99 (159) Q Consensus 61 ---------~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~----~~~~~~g~~ 99 (159) ...+|+|||||+++|....+ ....+.+. +..++.|.. T Consensus 70 ~~~~~~l~~~~~vPa~pFlRpA~da~~~~--a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:81 70 YSSSVKLVNPKWIPARPFLRPGYDSVAMQ--IPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred eecCccccCceecCCCCccchhHHHHHHH--HHHHHHHHHHHHHHHHhccCC Confidence 34799999999999976543 34444444 677777775 No 109 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=93.64 E-value=0.00021 Score=40.79 Aligned_cols=91 Identities=16% Similarity=0.115 Sum_probs=50.1 Q ss_pred CcccCCceeeehHHHHHHHHHH-------HHHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHH Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKR-------VEALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPT 73 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~-------l~~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~ 73 (159) .--.+-..-+.+. ....+... +....+-.+.|||.... -+.+|.+.|+||.++|+-||+..+ T Consensus 44 ~tp~~hy~~~~~~-~~~HlaD~I~~~~~~~DG~~dg~s~VG~~~~~----------~~~~A~f~n~GT~k~~~~hFve~~ 112 (141) T protein:vir:50 44 VTREKHYSRKKNP-KFGHMADGLAIQSTNADGRKNGVSTVGWKNNY----------HAQNARRLNDGTKKYRADHFVTNV 112 (141) T ss_pred hcccCCCCCCCCC-CCCccccceeeccCccccccCCeeeeccCCCc----------cceeeeccccCccccCCCchhHHH Confidence 1100000000000 00011111 11122346789996321 378999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHH Q lcl|NC_015294. 74 FEEFTSQFHYARLMKSTFENVLRDGRQTN 102 (159) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 102 (159) ..+...+.++.+.+...+++++.-...-+ T Consensus 113 ~~~a~~k~~Vl~A~~~~~k~~l~~~~~~~ 141 (141) T protein:vir:50 113 QNDSTVQKKVLLEKKRNTKNSLEEKEGCD 141 (141) T ss_pred HHhhhhHHHHHHHHHHHHHHHHHhccCCC Confidence 97654445677777777777765432212 No 110 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=93.47 E-value=0.00022 Score=40.64 Aligned_cols=84 Identities=19% Similarity=0.305 Sum_probs=39.0 Q ss_pred ccCCc--eeeehHHHHHHHH-HHHHH----hhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcC--------------- Q lcl|NC_015294. 3 ILASF--SFKTDRRRLTSLI-KRVEA----LDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFG--------------- 60 (159) Q Consensus 3 m~~~~--~~k~~~~~l~~l~-~~l~~----l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G--------------- 60 (159) +.-.. .+....=.|.+-+ ..... -....-.|||-.-+. --+.+.||| T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkA-----------PhghlvE~Ghw~~~~~~~~~dG~w 69 (119) T protein:vir:10 1 MRESAKAFVNDETGKLRSNLYVAYSTEESTNGVQTYAVSWRKKAA-----------PHGHLLEFGHWQTHAAYKGKDGEW 69 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCEEEEEeecCCCcC-----------CcccccccceeeeeeeeeccCcee Confidence 10000 0111111122111 11111 011233455543321 133445898 Q ss_pred ---------CCCCCCCchhhHHHHHHHHHHHHHHHHHHH----HHHHHhCCC Q lcl|NC_015294. 61 ---------TTRNPTRPFMAPTFEEFTSQFHYARLMKST----FENVLRDGR 99 (159) Q Consensus 61 ---------~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~----~~~~~~g~~ 99 (159) ...+|+|||||+++|....+ ....+.+. +..++.|.. T Consensus 70 ~~~~~~l~~~~~vPa~pFlRpA~da~~~~--a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:10 70 YSSSVKLVNPKWIPARPFLRPGYDSVAMQ--IPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred eecCccccCceecCCCCccchhHHHHHHH--HHHHHHHHHHHHHHHHhccCC Confidence 22699999999999976543 34444444 677777775 No 111 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=93.39 E-value=0.0002 Score=40.84 Aligned_cols=88 Identities=13% Similarity=0.126 Sum_probs=48.1 Q ss_pred CcccCCceeeehHHHHHHHHHHHH-------HhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHH Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVE-------ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPT 73 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~-------~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~ 73 (159) +.- +..+...+..+-..+...+. ......+.|||... +.+|.+.||||.++||.||+..+ T Consensus 44 tp~-~~~~~~~~~~~~~HlaD~I~~s~~~~dg~~~g~~~VG~~k~------------~~~A~f~n~GT~k~~~~hFie~t 110 (139) T protein:vir:10 44 TKE-KHPNTKGDGGKYGHLSEDIRSAAGDIDGDHNGSSTVGFHNK------------AHIARFLNDGTKYIRADHFVDNA 110 (139) T ss_pred ccc-ccCcCCCCCCCCcchhhcceecCcccccccceeeeeCCCCC------------cceEeecccCccccCCCchHHHH Confidence 221 11111111101111222221 11233467888421 45789999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCC-HHH Q lcl|NC_015294. 74 FEEFTSQFHYARLMKSTFENVLRDGRQ-TNT 103 (159) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~g~~~-~~~ 103 (159) ..+.+ .++.+.+...+++++..... -+. T Consensus 111 ~~e~~--~evl~a~~~~~k~~l~~~~~~~~~ 139 (139) T protein:vir:10 111 RDDAK--DAVFAAEAEKYQAMIAKANGGGDK 139 (139) T ss_pred HHHHH--HHHHHHHHHHHHHHHhhcCCCCCC Confidence 98754 46777777777777654321 112 No 112 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=93.13 E-value=0.00025 Score=40.29 Aligned_cols=90 Identities=16% Similarity=0.148 Sum_probs=49.4 Q ss_pred CcccCCceeeehHHHHHHHHHHH-------HHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHH Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRV-------EALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPT 73 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l-------~~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~ 73 (159) .--.+--.-+.+ .....+...+ .......+.|||... ..+.+|.+.++||.++|+-||+..+ T Consensus 44 ~tp~~h~~~~~t-~~~~HlaD~I~~~~~~iDg~~~g~s~VG~~kk----------~~a~~A~f~n~GT~k~~~~hFve~~ 112 (140) T protein:vir:48 44 VTRQKHYSNKKH-LKYGHMADGLSVQSTNVDGRKNGVSTVGWVNR----------YHAQNARRLNDGTKKYRADHFVTNV 112 (140) T ss_pred hccccCCCCCCC-CCCCcchhceeecccccccccCceeeeccCCC----------cceeeeeccccCccccCCCchhHHH Confidence 000000000000 0000111111 112244678999532 2378999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHH Q lcl|NC_015294. 74 FEEFTSQFHYARLMKSTFENVLRDGRQTN 102 (159) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 102 (159) .++...+.++.+.+...+.+++.-. ..+ T Consensus 113 ~~e~~~k~~vl~A~~~~~~~~l~~~-~~~ 140 (140) T protein:vir:48 113 QNDSAVQTKVLLAEKEEYEKLIRKK-GGE 140 (140) T ss_pred HHhhhhHHHHHHHHHHHHHHHHHhh-cCC Confidence 9876545566676667777766532 222 No 113 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=92.82 E-value=0.001 Score=36.93 Aligned_cols=83 Identities=13% Similarity=0.197 Sum_probs=45.1 Q ss_pred CcccCCc--eeeehHHHHHHHHHHHH--HhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcC---------------- Q lcl|NC_015294. 1 MIILASF--SFKTDRRRLTSLIKRVE--ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFG---------------- 60 (159) Q Consensus 1 m~m~~~~--~~k~~~~~l~~l~~~l~--~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G---------------- 60 (159) .-=-+.- ..+ ..+.|.++.+.+. ..++..+.++++.+ .++.||++|.|| T Consensus 59 w~pRK~~~~k~k-~~rm~~kL~~~~~~~~~~~~~~~~~~~~g----------~~~~IA~vHQ~G~~~rv~~~~~~~~~~~ 127 (231) T protein:vir:37 59 WEKRKPVDGEIK-NKRLLKKVLRYASILAEERGKGRIYYKNP----------LTGEIAQKQQDGFTEHFRVFATDKNKNG 127 (231) T ss_pred Cchhcccccchh-hHHHHHHhHHhhccccccCCceEEeeecc----------hHHHHHHHhhcCcccccchhhhhhccCC Confidence 1111000 000 1123344443332 22333345555433 257899999999 Q ss_pred ----------------------------------------------------------------------CCCCCCCchh Q lcl|NC_015294. 61 ----------------------------------------------------------------------TTRNPTRPFM 70 (159) Q Consensus 61 ----------------------------------------------------------------------~~~IP~RpFl 70 (159) .+.+|+|||| T Consensus 128 ~~~~pATr~QAk~Lr~lGy~v~~~k~k~~k~~~rkps~kwI~~~ls~~qAgliIR~L~~k~~~~~~k~~W~I~~paR~FL 207 (231) T protein:vir:37 128 SGNDRATIRQAQKLRSLGYRKRNGKNRQGKTKYRLYTIKEIRERLTRTWASMEIRRLENKVNAGNGKTNWEIHVPARPFL 207 (231) T ss_pred CCCCCCCHHHHHHHHHhcccccCCCCCCCCCCcCcCCHHHHHHhhhhHHHHHHHHHHhcccccccCcceeeeecCccccc Confidence 1358999998 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_015294. 71 APTFEEFTSQFHYARLMKSTFENVLRDGRQ 100 (159) Q Consensus 71 r~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 100 (159) -..-+ ++.+++...+.+++.|... T Consensus 208 G~~~~------e~~~~l~~~l~~i~~~~~~ 231 (231) T protein:vir:37 208 DTREK------ENVDILREITLKFLSGEYK 231 (231) T ss_pred CCCHH------HHHHHHHHHHHHHhcccCC Confidence 64432 3567788888888888776 No 114 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=92.67 E-value=0.00011 Score=42.27 Aligned_cols=87 Identities=14% Similarity=0.098 Sum_probs=38.8 Q ss_pred CcccCCceeeehH----------HHHHHHHHHHHHhhCCE--EEEEeccccc-C-CCCCC-------CCCHHHHHHHHhc Q lcl|NC_015294. 1 MIILASFSFKTDR----------RRLTSLIKRVEALDGTT--VEVGFFPEDR-Y-GSENG-------NLPVAQVAAYNEF 59 (159) Q Consensus 1 m~m~~~~~~k~~~----------~~l~~l~~~l~~l~~~~--v~VGi~~~~~-~-~~~~~-------G~~~A~iA~~~E~ 59 (159) |.++..+.+.... +.+.++...++...+.. |.-|-+..+= . ...++ -.+++.+|.++|| T Consensus 1 m~~s~~i~i~~~~l~~~v~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~ve~ 80 (137) T protein:vir:10 1 MPVTARIHINEPELERQTGAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQTYRPFHVGGGVEDNVDYAAPVHE 80 (137) T ss_pred CCeeEEEeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeeccccceEEEEEecCCCceeeeee Confidence 6666665543221 11222222222222211 2223332220 0 00011 1245789999999 Q ss_pred CC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_015294. 60 GT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQ 100 (159) Q Consensus 60 G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 100 (159) |+ +++||||||++++++...... -| .++ T Consensus 81 GT~ph~I~pk~~k~l~f~~~G~~v~~k~v~hpG~~a~Pfl~~A~~~~~~~~~-------ri------~~~ 137 (137) T protein:vir:10 81 GSRPHRITARHANALHFFWHGREVFRKSVWHPGVRPRPFLRNAARRVVAADP-------DI------HMT 137 (137) T ss_pred cCCCceeecccCceeeeeeCCceEEeeeeecCCCCCCchHHHHHHHHhhccc-------cc------cCC Confidence 95 145599999999986421111 01 111 No 115 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=92.50 E-value=0.0014 Score=36.27 Aligned_cols=88 Identities=16% Similarity=0.257 Sum_probs=40.6 Q ss_pred cCCceeeehHHH----HHHHH-------------------HHHHHhhCC---EEEEEecccccCCCCCCC-------CCH Q lcl|NC_015294. 4 LASFSFKTDRRR----LTSLI-------------------KRVEALDGT---TVEVGFFPEDRYGSENGN-------LPV 50 (159) Q Consensus 4 ~~~~~~k~~~~~----l~~l~-------------------~~l~~l~~~---~v~VGi~~~~~~~~~~~G-------~~~ 50 (159) |+.|++..-.+. |+.+. +.+++...+ ...=||-..... +.++ .+- T Consensus 1 Ma~i~id~la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~--~~g~~~~vv~~~~~ 78 (126) T protein:vir:81 1 MANITIDRLADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKED--GYGTTKRIIWNKKH 78 (126) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccchhhccccccccc--cCCcceEEEeccCC Confidence 233444432121 22111 111221111 111112111110 0011 111 Q ss_pred HHHHHHHhcCCC-----CCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_015294. 51 AQVAAYNEFGTT-----RNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGR 99 (159) Q Consensus 51 A~iA~~~E~G~~-----~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 99 (159) ..++-+.|||+. .+|+||||+|+++... +.+.+.++.++.|+. T Consensus 79 ~~l~HLLEfGha~r~gGrV~a~Phi~Pa~e~~~------~~~~~~i~~~l~~gg 126 (126) T protein:vir:81 79 YRRVHLLEFGHAKVNGGRVKEYPHLRPAYDKHG------ARLPDELKRVIENGG 126 (126) T ss_pred CCceeeeecceecCCCCccCCCcchHHHHHHHH------HHHHHHHHHHhhcCC Confidence 345677899976 3899999999987653 335667777777775 No 116 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=91.27 E-value=0.00035 Score=39.55 Aligned_cols=64 Identities=16% Similarity=0.161 Sum_probs=30.0 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_015294. 70 MAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 149 (159) Q Consensus 70 lr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~ 149 (159) |-..+ .+-+++.+.++..-..+. ....++++..+..+++.++.. .| +|||.|++ T Consensus 1 Ma~~~---~g~~~l~~~l~~~~~~~~---~~~~~~~~~~a~~i~~~ak~~-------------------aP-vdTG~Lr~ 54 (137) T protein:vir:97 1 MAKVK---YGNWDLVKELENYERDME---RWVKRGIAKTTAKIHNTIISL-------------------MP-VDTGYLRE 54 (137) T ss_pred CchhH---HhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh-------------------CC-ccccchhc Confidence 33222 112223333333222221 123344444444444433331 22 69999999 Q ss_pred hhhhhhcccC Q lcl|NC_015294. 150 SVKFQIHRRQ 159 (159) Q Consensus 150 SIty~V~~k~ 159 (159) ||++++.... T Consensus 55 SI~~~~~~~~ 64 (137) T protein:vir:97 55 SVTMDFKDSG 64 (137) T ss_pred cceeEeecCc Confidence 9999887654 No 117 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=91.27 E-value=0.00035 Score=39.55 Aligned_cols=64 Identities=16% Similarity=0.161 Sum_probs=30.0 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_015294. 70 MAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 149 (159) Q Consensus 70 lr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~ 149 (159) |-..+ .+-+++.+.++..-..+. ....++++..+..+++.++.. .| +|||.|++ T Consensus 1 Ma~~~---~g~~~l~~~l~~~~~~~~---~~~~~~~~~~a~~i~~~ak~~-------------------aP-vdTG~Lr~ 54 (137) T protein:vir:94 1 MAKVK---YGNWDLVKELENYERDME---RWVKRGIAKTTAKIHNTIISL-------------------MP-VDTGYLRE 54 (137) T ss_pred CchhH---HhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh-------------------CC-ccccchhc Confidence 33222 112223333333222221 123344444444444433331 22 69999999 Q ss_pred hhhhhhcccC Q lcl|NC_015294. 150 SVKFQIHRRQ 159 (159) Q Consensus 150 SIty~V~~k~ 159 (159) ||++++.... T Consensus 55 SI~~~~~~~~ 64 (137) T protein:vir:94 55 SVTMDFKDSG 64 (137) T ss_pred cceeEeecCc Confidence 9999887654 No 118 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=91.27 E-value=0.00035 Score=39.55 Aligned_cols=64 Identities=16% Similarity=0.161 Sum_probs=30.0 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_015294. 70 MAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 149 (159) Q Consensus 70 lr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~ 149 (159) |-..+ .+-+++.+.++..-..+. ....++++..+..+++.++.. .| +|||.|++ T Consensus 1 Ma~~~---~g~~~l~~~l~~~~~~~~---~~~~~~~~~~a~~i~~~ak~~-------------------aP-vdTG~Lr~ 54 (137) T protein:vir:93 1 MAKVK---YGNWDLVKELENYERDME---RWVKRGIAKTTAKIHNTIISL-------------------MP-VDTGYLRE 54 (137) T ss_pred CchhH---HhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh-------------------CC-ccccchhc Confidence 33222 112223333333222221 123344444444444433331 22 69999999 Q ss_pred hhhhhhcccC Q lcl|NC_015294. 150 SVKFQIHRRQ 159 (159) Q Consensus 150 SIty~V~~k~ 159 (159) ||++++.... T Consensus 55 SI~~~~~~~~ 64 (137) T protein:vir:93 55 SVTMDFKDSG 64 (137) T ss_pred cceeEeecCc Confidence 9999887654 No 119 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=88.16 E-value=0.0015 Score=36.10 Aligned_cols=89 Identities=12% Similarity=0.137 Sum_probs=47.1 Q ss_pred CcccCCceeeehHHHHHHHHHH-------HHHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhHH Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKR-------VEALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAPT 73 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~-------l~~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~~ 73 (159) ..-. ..+..-+..+...+... +.....-.+.|||.-. +.+|.+-|+||.++|+.+|+..+ T Consensus 44 tp~~-~~~~~~~~~~~~HlaD~I~~~~~~idg~~~g~~~VG~~~~------------~~~Ahf~n~GT~~~~~~hFie~t 110 (139) T protein:vir:10 44 TKEK-HPNTKGDGGKYGHLSEDISSAAGDIDGDHNGSSTVGFHNK------------AHIARFLNDGTKNIRADHFVDNA 110 (139) T ss_pred cccc-cccCCCCCCCCCcccccceecCccccccccccceeCCCCC------------ceeeeeeccCccccCCCchHHHH Confidence 1100 00000000000011111 1111234577888411 45788999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHH Q lcl|NC_015294. 74 FEEFTSQFHYARLMKSTFENVLRDGRQTNTL 104 (159) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~ 104 (159) ..+. +.++.+.+.+.++.++.....-+.- T Consensus 111 ~~e~--~~ev~~a~~~~~ke~l~~~~~~~~~ 139 (139) T protein:vir:10 111 RDDA--KDAVFAAEAEKYQAMIAKANGGDSK 139 (139) T ss_pred HHHH--HHHHHHHHHHHHHHHHhhcCCCCCC Confidence 9875 3467777777777776543211111 No 120 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=87.57 E-value=0.0016 Score=35.96 Aligned_cols=43 Identities=16% Similarity=0.129 Sum_probs=25.0 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhhhcccC Q lcl|NC_015294. 101 TNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQIHRRQ 159 (159) Q Consensus 101 ~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty~V~~k~ 159 (159) ++.++...-..+...++..+....| +|||.|++||++++.... T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aP----------------v~TG~Lr~SI~~~~~~~~ 43 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMP----------------VDTGYLRESVTMDFKDGG 43 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCC----------------cCcccccccceEEeecCc Confidence 3333333333333444444433222 699999999999987765 No 121 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=87.57 E-value=0.0016 Score=35.96 Aligned_cols=43 Identities=16% Similarity=0.129 Sum_probs=25.0 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhhhcccC Q lcl|NC_015294. 101 TNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQIHRRQ 159 (159) Q Consensus 101 ~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty~V~~k~ 159 (159) ++.++...-..+...++..+....| +|||.|++||++++.... T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aP----------------v~TG~Lr~SI~~~~~~~~ 43 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMP----------------VDTGYLRESVTMDFKDGG 43 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCC----------------cCcccccccceEEeecCc Confidence 3333333333333444444433222 699999999999987765 No 122 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=87.42 E-value=0.006 Score=32.77 Aligned_cols=82 Identities=11% Similarity=0.185 Sum_probs=55.8 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhc-----C--CCCCcHHHHHhc-----C- Q lcl|NC_015294. 70 MAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDD-----Y--PGSNSPAWAAYK-----G- 136 (159) Q Consensus 70 lr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~-----~--~~Pna~~Ti~~K-----G- 136 (159) |...+. .+.+.+..++..+- ..+....+..||..+....+..|.+ | |+|+++.|..++ | T Consensus 1 m~~~~~------~l~~~l~~ll~~l~--~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~ 72 (155) T protein:vir:79 1 MTDDLQ------ALERWAGGLLAKLS--PAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAGR 72 (155) T ss_pred CchHHH------HHHHHHHHHHHhcC--ChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccCc Confidence 333222 23344444444431 1244679999999999999999964 3 568998886543 3 Q ss_pred -CCCCchhHHHHHhhhhhhhcccC Q lcl|NC_015294. 137 -FNDPLFHTGKMLESVKFQIHRRQ 159 (159) Q Consensus 137 -~~~PLiDTG~L~~SIty~V~~k~ 159 (159) ...+|.+++.+..+|+|.+.... T Consensus 73 ~~~~~m~~~l~~a~~l~~~~~~d~ 96 (155) T protein:vir:79 73 VKREAMFRKLRTARYLRIDVDSTG 96 (155) T ss_pred ccchhhhhhhhhhheeeeeecCcE Confidence 24678999999999999998777 No 123 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=87.42 E-value=0.0017 Score=35.80 Aligned_cols=43 Identities=19% Similarity=0.196 Sum_probs=25.0 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhhhcccC Q lcl|NC_015294. 101 TNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQIHRRQ 159 (159) Q Consensus 101 ~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty~V~~k~ 159 (159) +++++...-..+...|+...... .| +|||.|++||++.+.... T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~---------------ap-v~TG~Lr~SI~~~~~~~~ 43 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISL---------------MP-VDTGYLRESVTMDFKDGG 43 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhh---------------CC-ccccccccceeEEeecCc Confidence 23333333333334444444332 34 699999999999987766 No 124 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=86.96 E-value=0.0031 Score=34.35 Aligned_cols=95 Identities=18% Similarity=0.296 Sum_probs=47.9 Q ss_pred CcccCCceeeehHHHHHHHHHHHHHhhCC-------------------------EEEEEecccc-----cC---C--CCC Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEALDGT-------------------------TVEVGFFPED-----RY---G--SEN 45 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~l~~~-------------------------~v~VGi~~~~-----~~---~--~~~ 45 (159) |..+.++. .++|+++.+.|+.+.+. -|.-|-+... .+ + ..+ T Consensus 1 M~~~~~~d----~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~~g 76 (141) T protein:vir:79 1 MARWGSVD----FREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARSLPVYKQG 76 (141) T ss_pred CCCCccCc----HHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccccceeecC Confidence 55555543 34555555555433221 1112222111 00 0 011 Q ss_pred CC-----CCHHHHHHHHhcCCCCCCCCchhhHHH------HHHHHHHHHHHHHHHHHHHHHhCCCCH Q lcl|NC_015294. 46 GN-----LPVAQVAAYNEFGTTRNPTRPFMAPTF------EEFTSQFHYARLMKSTFENVLRDGRQT 101 (159) Q Consensus 46 ~G-----~~~A~iA~~~E~G~~~IP~RpFlr~~~------~~~~~~~~~~~~~~~~~~~~~~g~~~~ 101 (159) ++ .+++.+|-+-|||+...|+|||....+ ++. +..+.+.+++.+..++.+-.++ T Consensus 77 ~~~~v~v~n~~~YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~--~~~~~~~~~~~l~~~l~~~~~~ 141 (141) T protein:vir:79 77 NNYIIEVVNPTEYASYVNFGHRTKDGKGWVKGQHFLTISEMEL--QSQVDKIIEKKLLILLKGVFDA 141 (141) T ss_pred CeeEEEEecCCcchhhhhcceeecCCcceeCCchhHHHHHHHH--HHHHHHHHHHHHHHHHHHhhcC Confidence 22 245789999999998888888766544 332 2335555566665555554444 No 125 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=86.62 E-value=0.0031 Score=34.33 Aligned_cols=64 Identities=16% Similarity=0.167 Sum_probs=29.6 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_015294. 70 MAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 149 (159) Q Consensus 70 lr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~ 149 (159) |-..+ .+-+++.+.+++.-..+ .....++++..+..+++.++.. .| +|||.|++ T Consensus 1 Ma~~~---~G~~~l~~~l~~~~~~~---~~~~~~~~~~~a~~v~~~ak~~-------------------aP-v~TG~L~~ 54 (137) T protein:vir:95 1 MAKVK---YGNWDLVKELENYERDM---ERWVKRGIAKTTAKIHNTIISL-------------------MP-VDTGYLRE 54 (137) T ss_pred CchhH---HhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHh-------------------CC-ccchhhhc Confidence 32221 11122222232222221 1123344444444444444322 23 59999999 Q ss_pred hhhhhhcccC Q lcl|NC_015294. 150 SVKFQIHRRQ 159 (159) Q Consensus 150 SIty~V~~k~ 159 (159) ||++++.... T Consensus 55 Si~~~~~~~~ 64 (137) T protein:vir:95 55 SVTMDFKDGG 64 (137) T ss_pred CeeeEeeCCc Confidence 9999887665 No 126 >protein:vir:98860 Length: 230 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654736;genbank:gi:109302921;genbank:GeneID:4156065 Probab=86.57 E-value=0.0054 Score=33.03 Aligned_cols=82 Identities=13% Similarity=0.036 Sum_probs=34.1 Q ss_pred CcccCCceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCC------------------- Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGT------------------- 61 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~------------------- 61 (159) +-=-+.-.-| -..+|.+++.-.+.-++....++|+.+. .+.||++|.||- T Consensus 61 w~pRKr~k~K-Ml~~L~k~l~~~~~~~~~~~v~~~~~~~----------~~rIA~vHq~G~~~~~~~~~~~~r~~~~~~~ 129 (230) T protein:vir:98 61 WKPRKNGNAK-MLRRIAKTLKFTSADREIKRVCTISRNA----------QRRSQKEHQRGAKITNLKSVILRKSRAGTAK 129 (230) T ss_pred ChhhhhhhHH-HHhhhHHHHHHhhcccccceeeeecccc----------hhhhhhhhhccchhhhhhhhhhhhhcCCCCc Confidence 1100000000 0113444444443333344555665443 256999999991 Q ss_pred ---------------------------------------------------------------------CCCCCCchhhH Q lcl|NC_015294. 62 ---------------------------------------------------------------------TRNPTRPFMAP 72 (159) Q Consensus 62 ---------------------------------------------------------------------~~IP~RpFlr~ 72 (159) +.+|+||||-. T Consensus 130 ~paTr~QAk~Lr~lGy~v~~g~~~~~~k~~kkps~kwI~~nls~~qAgliIR~L~~k~~k~~~~~t~W~I~~PaR~FLG~ 209 (230) T protein:vir:98 130 DPATMRQAKKLRDLGYTVPNGTTKSGKKRYRRPSAREIVATLSRAKASLLIRYFQEKEERQGKRLTKWIIPTEKRPFLDE 209 (230) T ss_pred ccccHHHHHHHHHcCCccCCCCCCcCCCCCCCCCHHHHHHhhhHHHHHHHHHHHhccccccccCccceeeecCcccccCC Confidence 24677777754 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_015294. 73 TFEEFTSQFHYARLMKSTFENVLRDGRQ 100 (159) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 100 (159) .-+ ++.+++...+.++-.-. . T Consensus 210 ~~~------e~~~~l~~~l~~i~~~~-~ 230 (230) T protein:vir:98 210 RDK------ENAEILKEFILKFSGIE-K 230 (230) T ss_pred ChH------HHHHHHHHHHHHhcccc-C Confidence 322 23333444433321111 1 No 127 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=86.39 E-value=0.0029 Score=34.50 Aligned_cols=61 Identities=11% Similarity=0.055 Sum_probs=30.0 Q ss_pred hhH---HHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHH Q lcl|NC_015294. 70 MAP---TFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGK 146 (159) Q Consensus 70 lr~---~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~ 146 (159) |-. ++++ +.+.+++.-..+ +...+++|...+..+++.++.. .| +|||. T Consensus 1 Ma~~~~Gl~~------l~~~l~~~~~~~---~~~~~~al~~~a~~v~~~ak~~-------------------ap-vdTG~ 51 (135) T protein:vir:96 1 MAKVKYGADS------IVVDLEKYSKDM---EKWVKKGITKTTLKIYNTAIHL-------------------MP-VDTGF 51 (135) T ss_pred CchhhhhHHH------HHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHh-------------------CC-ccchh Confidence 211 2222 222233322222 1123445555555444444332 12 79999 Q ss_pred HHhhhhhhhcccC Q lcl|NC_015294. 147 MLESVKFQIHRRQ 159 (159) Q Consensus 147 L~~SIty~V~~k~ 159 (159) |++||+++|.... T Consensus 52 Lr~SI~~~~~~~g 64 (135) T protein:vir:96 52 LRQSTTVDFENGG 64 (135) T ss_pred hhcceeEEeecCc Confidence 9999999887655 No 128 >protein:vir:3750 Length: 227 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043491;genbank:gi:9628626;genbank:GeneID:1261131 Probab=85.89 E-value=0.0057 Score=32.88 Aligned_cols=80 Identities=13% Similarity=0.145 Sum_probs=30.8 Q ss_pred CcccCCceeeehHHHHHHHHHHHHHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCC------------------- Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGT------------------- 61 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~------------------- 61 (159) ..=-+.-.-| -..+|.++++. + .......|||..+. ++.||++|.||- T Consensus 59 ~~pRKr~k~K-M~~kL~k~l~~-~-~~~~~a~v~f~~g~----------~~~IA~vHq~G~~~~v~~~~~~~~~~~~~~~ 125 (227) T protein:vir:37 59 WKKRKNGTAK-MLRRIAKLANS-K-AEKAQGTLFYKQKR----------TGEIAQEHQEGIPHLFKKTEFTGKNKGGIGA 125 (227) T ss_pred CchhcchhHH-HHhhhHHHcce-e-ecccceEEEecCcc----------hHHHHHHhhcCcccccchhhhhhhhcCCccc Confidence 1000000000 00123333322 2 23445668885432 477999999991 Q ss_pred ----------------------------------------------------------------------CCCCCCchhh Q lcl|NC_015294. 62 ----------------------------------------------------------------------TRNPTRPFMA 71 (159) Q Consensus 62 ----------------------------------------------------------------------~~IP~RpFlr 71 (159) +.+|+||||- T Consensus 126 ~paTr~QAk~Lr~lGy~v~~~k~k~~k~~~rkps~kwI~~nls~~qAgliIR~L~~k~~~~~~~~k~~W~I~~PaR~FLG 205 (227) T protein:vir:37 126 DPCTLRQAKKLKDLGYTVANGKTKNGKAKRRKPTLSEIRSTLSRAKASLIIRKLEEKNGMNPSRHLTQWIIPTEKRSFLD 205 (227) T ss_pred cCCCHHHHHHHHHhcccccCCCCCCcCCccccCCHHHHHHhhhHHHHHHHHHHHhcccccccccCccceeeecCcccccC Confidence 1244444443 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_015294. 72 PTFEEFTSQFHYARLMKSTFENVLRDGR 99 (159) Q Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 99 (159) .+-+ ++.+.+...+.++..-.. T Consensus 206 ~~~~------e~~~~l~r~l~~~~~~~~ 227 (227) T protein:vir:37 206 TREE------ENAKIILAEIQKYTQKQQ 227 (227) T ss_pred CCHH------HHHHHHHHHHHHHhhhcC Confidence 2211 122223333332221111 No 129 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=85.63 E-value=0.0036 Score=33.99 Aligned_cols=64 Identities=16% Similarity=0.158 Sum_probs=30.8 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_015294. 70 MAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 149 (159) Q Consensus 70 lr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~ 149 (159) |-... .+-+.+.+.+++....+. ....++|+..+..+++.+|.. .| +|||.|++ T Consensus 1 Ma~~~---~G~~~l~~~L~~~~~~~~---~~~~~al~~~a~~v~~~ak~~-------------------aP-vdTG~Lr~ 54 (137) T protein:vir:94 1 MAKVK---YGNWDLVKELENYERDIE---RWVKRGIAKTTVKIHNTIISL-------------------MP-VDTGYLRE 54 (137) T ss_pred CchhH---HhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh-------------------CC-cCcchhhc Confidence 21110 111222333333333221 123444555454444444432 23 59999999 Q ss_pred hhhhhhcccC Q lcl|NC_015294. 150 SVKFQIHRRQ 159 (159) Q Consensus 150 SIty~V~~k~ 159 (159) ||++++.... T Consensus 55 SI~~~~~~~~ 64 (137) T protein:vir:94 55 SVTMDFKDGG 64 (137) T ss_pred CceeEeecCc Confidence 9999887665 No 130 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=85.27 E-value=0.0027 Score=34.64 Aligned_cols=69 Identities=9% Similarity=0.070 Sum_probs=30.9 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHH Q lcl|NC_015294. 69 FMAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKML 148 (159) Q Consensus 69 Flr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~ 148 (159) -++-. ..+-+.+.+.++..-..+. ...+.++..+...++..|+...... -| +|||.|+ T Consensus 1 m~~v~---i~Gld~L~~kl~~~~~~~~---~~v~~a~~~~~~~~a~~v~~~ak~~---------------~P-vdtG~Lr 58 (182) T protein:vir:10 1 MIEVE---LKGVNELRAKLKKLPDIMA---KATANAQENAIEQAEAYAVDELQSS---------------IK-YSTGELT 58 (182) T ss_pred CeEEE---EecHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhh---------------CC-CCchhhh Confidence 11111 1111112222222211110 1123344444444444444444322 24 7999999 Q ss_pred hhhhhhhcccC Q lcl|NC_015294. 149 ESVKFQIHRRQ 159 (159) Q Consensus 149 ~SIty~V~~k~ 159 (159) +||+++|..+. T Consensus 59 ~SI~~~~~~~~ 69 (182) T protein:vir:10 59 RSFKHEVKVDG 69 (182) T ss_pred hceeeeeeecC Confidence 99999987766 No 131 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=84.90 E-value=0.00053 Score=38.56 Aligned_cols=85 Identities=19% Similarity=0.294 Sum_probs=37.4 Q ss_pred ccCCceeeehHHHHHHHH------------HHHHHhhCC--EEEEEeccccc-C-CCCCCC-------CCHHHHHHHHhc Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLI------------KRVEALDGT--TVEVGFFPEDR-Y-GSENGN-------LPVAQVAAYNEF 59 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~------------~~l~~l~~~--~v~VGi~~~~~-~-~~~~~G-------~~~A~iA~~~E~ 59 (159) |..+ +++.+...|++++ ..++...+. -|.-|=+..+= + ...++| .+++.+|.++|| T Consensus 1 ~~~~-~~~l~~~~l~~~~~~~~~~~~~~~a~~ve~~ak~~aPv~TG~Lr~SI~~~~~~~~g~~v~~~V~~~~~YA~~ve~ 79 (137) T protein:vir:10 1 MVAH-TLRIERAQLHGLGMDEARKAVNRVVRRTFTRSQILAPVDTGYLRASGRLVLGRERGAVVIGSVEYTARYAAAVHN 79 (137) T ss_pred Cccc-ccccChhhHhhHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhccceeeeeeccccEEEEEecCCcccceeeec Confidence 2222 3344444433322 112111111 01111111110 0 000111 246789999999 Q ss_pred CC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_015294. 60 GT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRD 97 (159) Q Consensus 60 G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g 97 (159) || +++|+||||++++++...++ ++.-.+ | T Consensus 80 GT~ph~I~pk~~kaL~f~~~G~~vf~k~V~hPG~k~~PfL~~Al~~~~~~~--------~~~~~~-~ 137 (137) T protein:vir:10 80 GRRALTIRAKGNGRLKFTVEGRTVYARSVHQPARAGRPYLSQALREVAPQE--------GFRVTI-G 137 (137) T ss_pred CCCCceeecCCCccceeecCCeeEeccceecCCCCCChhhHHHHHHhhccc--------ceeEee-C Confidence 95 24669999999998775432 222111 1 No 132 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=84.67 E-value=0.0041 Score=33.69 Aligned_cols=64 Identities=13% Similarity=0.122 Sum_probs=31.7 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_015294. 70 MAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 149 (159) Q Consensus 70 lr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~ 149 (159) |-... .+-+++.+.++..-..+. ...+++|...+..+++.+|.. .| +|||.|++ T Consensus 1 Ma~~~---~G~~~l~~~l~~~~~~~~---~~~~~~l~~~a~~~~~~ak~~-------------------~p-vdTG~L~~ 54 (137) T protein:vir:96 1 MAKVK---YGNWDLVAELEDYRDEME---EWVKKGILKTTLAIYNTAVAL-------------------AP-VDLGFLKE 54 (137) T ss_pred CchhH---hhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh-------------------CC-cCccchhc Confidence 22111 011222333333222221 123455555555555544432 23 69999999 Q ss_pred hhhhhhcccC Q lcl|NC_015294. 150 SVKFQIHRRQ 159 (159) Q Consensus 150 SIty~V~~k~ 159 (159) ||+++|.... T Consensus 55 Si~~~~~~~g 64 (137) T protein:vir:96 55 SIDFKVTDGG 64 (137) T ss_pred CceeEeecCc Confidence 9999887665 No 133 >protein:vir:78755 Length: 228 # NCBI annotation: putative tail completion protein # Family: family:all:743 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285651;genbank:gi:148727157;genbank:GeneID:5220102 Probab=83.54 E-value=0.015 Score=30.66 Aligned_cols=92 Identities=17% Similarity=0.152 Sum_probs=38.9 Q ss_pred CcccCCceeeehHHHHHHHHHHHH--HhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCC----------------- Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVE--ALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGT----------------- 61 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~--~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~----------------- 61 (159) ..--+. .....|.+|.+.|+ ......+.|||..+.. ...++.||++|.||- T Consensus 55 ~~pRKr----~krKMl~~L~k~Lk~~~~~~~~a~v~f~~~~~------~~~~~rIA~vHq~G~~~~v~~~~~~~~~~~r~ 124 (228) T protein:vir:78 55 WAPRKR----GKRKMLRGLPKLLQIREPRQDMAELGFTKGTM------SAHAGVIANTHQKGHTYKVTAASRRRIAPSDV 124 (228) T ss_pred Chhhhh----hHHHHHhhhHHhhhhhcccccceEEEeecCcc------cchHHHHHHHHhcCcccccccchhhhhhcccC Confidence 111110 01122223333333 2334578999964321 124788999999991 Q ss_pred ---------------------------------------------------------------CCCCCCchhhHHHHHHH Q lcl|NC_015294. 62 ---------------------------------------------------------------TRNPTRPFMAPTFEEFT 78 (159) Q Consensus 62 ---------------------------------------------------------------~~IP~RpFlr~~~~~~~ 78 (159) +.+|+||||-..-+ T Consensus 125 ~~~~paTr~QAk~Lr~lGy~~~~~~~k~~rkps~kwI~~nls~gqAgliir~L~~k~~k~~W~I~~PaR~FLG~s~~--- 201 (228) T protein:vir:78 125 GKNKQASKAQARKLRELGFKRPGKRKRAYRSASLGWITANLNYAQAGLLIKKLKDEPVKESWEIQLPARPFLGANAR--- 201 (228) T ss_pred CCCCCCCHHHHHHHHHhhccccCCcCCCcccCCHHHHHHHhhHHHHHHHHHHHhCCCCccceeeecCcccccCCCHH--- Confidence 34666666632221 Q ss_pred HHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhc Q lcl|NC_015294. 79 SQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYK 135 (159) Q Consensus 79 ~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~K 135 (159) ++.+.+...+..+--|. ++ + +.-|+.| T Consensus 202 ---e~~~~l~~~l~~i~~g~-~~-------------------------~-~qd~~~~ 228 (228) T protein:vir:78 202 ---QRQQAFALRPESIDYGW-DV-------------------------N-KQDMKGK 228 (228) T ss_pred ---HHHHHHHHHHHhcccCC-Cc-------------------------c-hhhccCC Confidence 12333333333332221 11 1 1112222 No 134 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=83.35 E-value=0.019 Score=30.06 Aligned_cols=82 Identities=10% Similarity=0.022 Sum_probs=49.5 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhc-----C--CCCCcHHHHHhcCC----C Q lcl|NC_015294. 70 MAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDD-----Y--PGSNSPAWAAYKGF----N 138 (159) Q Consensus 70 lr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~-----~--~~Pna~~Ti~~KG~----~ 138 (159) |..-+. .+.+.+..++.++- . .+....|..||..+....+..|.+ | |+|+++.|++.|.. . T Consensus 1 m~~~~~------~l~~~L~~ll~~L~-~-~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~~W~p~~~~~~~~~~~~~~~~ 72 (156) T protein:vir:11 1 MADSLE------ALEDWAGPILRALE-P-GPRAALARSLARDLRRSQQKRVMAQRNPDGSAYEPRKKRELRGKQGRIRRK 72 (156) T ss_pred CchhHH------HHHHHHHHHHHhcC-C-cchHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHhhhccccccc Confidence 433222 23344444444432 1 245779999999999999999964 3 56899999987632 2 Q ss_pred CCchhHHHHHhhhhhhhcccC Q lcl|NC_015294. 139 DPLFHTGKMLESVKFQIHRRQ 159 (159) Q Consensus 139 ~PLiDTG~L~~SIty~V~~k~ 159 (159) .++.....+..+|.+.+...+ T Consensus 73 ~~m~~~l~~~~~l~~~~~~~~ 93 (156) T protein:vir:11 73 IKMFQKLRTVRYLRAKGDAQA 93 (156) T ss_pred hhhhhhhhhhheeeeeecCcE Confidence 334333333444666665555 No 135 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=82.29 E-value=0.023 Score=29.60 Aligned_cols=82 Identities=11% Similarity=0.137 Sum_probs=47.3 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhc-----C--CCCCcHHHHHhcCCCCCch Q lcl|NC_015294. 70 MAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDD-----Y--PGSNSPAWAAYKGFNDPLF 142 (159) Q Consensus 70 lr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~-----~--~~Pna~~Ti~~KG~~~PLi 142 (159) |...+.+ +.+.+..++..+- ..+...+|..||..+....++.|.+ | |+|+++.+..+|+..+-.. T Consensus 1 M~~~~~~------~~~~L~~ll~~L~--~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~k~~~~~~~ 72 (152) T protein:vir:10 1 MSEPIEQ------VKTAFDSLLNNIS--KPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKGVKSKIKSGK 72 (152) T ss_pred CchHHHH------HHHHHHHHHHhcC--cchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhhcccccchh Confidence 4333322 2233333333321 1244679999999999999999974 3 4588888776666444333 Q ss_pred hHHHHHhh--hhhhhcccC Q lcl|NC_015294. 143 HTGKMLES--VKFQIHRRQ 159 (159) Q Consensus 143 DTG~L~~S--Ity~V~~k~ 159 (159) ....|+.| ++|+..... T Consensus 73 m~~~L~~a~~l~~~a~~~~ 91 (152) T protein:vir:10 73 MFDKITQPRFMRLRLESEG 91 (152) T ss_pred HHHhhhhcceeeeeecCcE Confidence 34444443 455554444 No 136 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=81.68 E-value=0.0033 Score=34.22 Aligned_cols=68 Identities=9% Similarity=0.169 Sum_probs=29.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhh Q lcl|NC_015294. 75 EEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQ 154 (159) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty~ 154 (159) -+..+-+++.+.++++-..+ ......++..-|..++...|..-. ..++.| +|||.|++||++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~-------------~~~~~p-~~TG~Lr~sI~~~ 63 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR-------------EVMNKG-YWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc-------------ccCCCC-CCchhhhhcceee Confidence 11112122222222221111 112344555555544444443211 122334 7999999999987 Q ss_pred hcccC Q lcl|NC_015294. 155 IHRRQ 159 (159) Q Consensus 155 V~~k~ 159 (159) ....- T Consensus 64 ~~g~~ 68 (115) T protein:vir:96 64 KTGDL 68 (115) T ss_pred ecCce Confidence 44322 No 137 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=81.68 E-value=0.0033 Score=34.22 Aligned_cols=68 Identities=9% Similarity=0.169 Sum_probs=29.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhh Q lcl|NC_015294. 75 EEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQ 154 (159) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty~ 154 (159) -+..+-+++.+.++++-..+ ......++..-|..++...|..-. ..++.| +|||.|++||++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~-------------~~~~~p-~~TG~Lr~sI~~~ 63 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR-------------EVMNKG-YWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc-------------ccCCCC-CCchhhhhcceee Confidence 11112122222222221111 112344555555544444443211 122334 7999999999987 Q ss_pred hcccC Q lcl|NC_015294. 155 IHRRQ 159 (159) Q Consensus 155 V~~k~ 159 (159) ....- T Consensus 64 ~~g~~ 68 (115) T protein:vir:93 64 KTGDL 68 (115) T ss_pred ecCce Confidence 44322 No 138 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=81.68 E-value=0.0033 Score=34.22 Aligned_cols=68 Identities=9% Similarity=0.169 Sum_probs=29.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhh Q lcl|NC_015294. 75 EEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQ 154 (159) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty~ 154 (159) -+..+-+++.+.++++-..+ ......++..-|..++...|..-. ..++.| +|||.|++||++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~-------------~~~~~p-~~TG~Lr~sI~~~ 63 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR-------------EVMNKG-YWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc-------------ccCCCC-CCchhhhhcceee Confidence 11112122222222221111 112344555555544444443211 122334 7999999999987 Q ss_pred hcccC Q lcl|NC_015294. 155 IHRRQ 159 (159) Q Consensus 155 V~~k~ 159 (159) ....- T Consensus 64 ~~g~~ 68 (115) T protein:vir:96 64 KTGDL 68 (115) T ss_pred ecCce Confidence 44322 No 139 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=81.68 E-value=0.0033 Score=34.22 Aligned_cols=68 Identities=9% Similarity=0.169 Sum_probs=29.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhh Q lcl|NC_015294. 75 EEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQ 154 (159) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty~ 154 (159) -+..+-+++.+.++++-..+ ......++..-|..++...|..-. ..++.| +|||.|++||++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~-------------~~~~~p-~~TG~Lr~sI~~~ 63 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR-------------EVMNKG-YWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc-------------ccCCCC-CCchhhhhcceee Confidence 11112122222222221111 112344555555544444443211 122334 7999999999987 Q ss_pred hcccC Q lcl|NC_015294. 155 IHRRQ 159 (159) Q Consensus 155 V~~k~ 159 (159) ....- T Consensus 64 ~~g~~ 68 (115) T protein:vir:97 64 KTGDL 68 (115) T ss_pred ecCce Confidence 44322 No 140 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=81.68 E-value=0.0033 Score=34.22 Aligned_cols=68 Identities=9% Similarity=0.169 Sum_probs=29.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhh Q lcl|NC_015294. 75 EEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQ 154 (159) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty~ 154 (159) -+..+-+++.+.++++-..+ ......++..-|..++...|..-. ..++.| +|||.|++||++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~-------------~~~~~p-~~TG~Lr~sI~~~ 63 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR-------------EVMNKG-YWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc-------------ccCCCC-CCchhhhhcceee Confidence 11112122222222221111 112344555555544444443211 122334 7999999999987 Q ss_pred hcccC Q lcl|NC_015294. 155 IHRRQ 159 (159) Q Consensus 155 V~~k~ 159 (159) ....- T Consensus 64 ~~g~~ 68 (115) T protein:vir:78 64 KTGDL 68 (115) T ss_pred ecCce Confidence 44322 No 141 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=81.68 E-value=0.0033 Score=34.22 Aligned_cols=68 Identities=9% Similarity=0.169 Sum_probs=29.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhh Q lcl|NC_015294. 75 EEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQ 154 (159) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty~ 154 (159) -+..+-+++.+.++++-..+ ......++..-|..++...|..-. ..++.| +|||.|++||++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~-------------~~~~~p-~~TG~Lr~sI~~~ 63 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR-------------EVMNKG-YWTGNLSRNIRYK 63 (115) T ss_pred CcchhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc-------------ccCCCC-CCchhhhhcceee Confidence 11112122222222221111 112344555555544444443211 122334 7999999999987 Q ss_pred hcccC Q lcl|NC_015294. 155 IHRRQ 159 (159) Q Consensus 155 V~~k~ 159 (159) ....- T Consensus 64 ~~g~~ 68 (115) T protein:vir:10 64 KTGDL 68 (115) T ss_pred ecCce Confidence 44322 No 142 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=81.65 E-value=0.0047 Score=33.36 Aligned_cols=60 Identities=17% Similarity=0.226 Sum_probs=28.2 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHhh Q lcl|NC_015294. 71 APTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLES 150 (159) Q Consensus 71 r~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~S 150 (159) -.++++. .+.+++....+ ....+.+|...+..++..+|. . .| +|||.|++| T Consensus 1 i~Gld~l------~~~l~~~~~~~---~~~v~~al~~~a~~i~~~ak~----~---------------aP-v~TG~Lr~s 51 (108) T protein:vir:99 1 MRGLDRF------LRSVERKQKSV---RIAVDKELSKSAARIERQAKI----L---------------AP-VDTGWLRAQ 51 (108) T ss_pred CchHHHH------HHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh----c---------------CC-cCchhhhcc Confidence 2344332 22222222221 112234454444444443332 1 22 799999999 Q ss_pred hhhhhcccC Q lcl|NC_015294. 151 VKFQIHRRQ 159 (159) Q Consensus 151 Ity~V~~k~ 159 (159) |++.+.+.- T Consensus 52 I~~~~~~~~ 60 (108) T protein:vir:99 52 IYSEQQRLL 60 (108) T ss_pred eeeeecCcE Confidence 988765432 No 143 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=80.48 E-value=0.014 Score=30.71 Aligned_cols=85 Identities=13% Similarity=0.216 Sum_probs=37.2 Q ss_pred ccCCceeeehHHHHHHHHH-----------------------HHHHhhCC---EEEEEecccccCCCCCCC-------CC Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLIK-----------------------RVEALDGT---TVEVGFFPEDRYGSENGN-------LP 49 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~~-----------------------~l~~l~~~---~v~VGi~~~~~~~~~~~G-------~~ 49 (159) |.+.+++..-.+.+.+-++ .|+....+ ...=||-... ..+| .+ T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~----~~~~~~~v~~~~~ 76 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQK----LKNGDQVIYQKAP 76 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccccceeeee----cCCeeEEEEEecC Confidence 5555554433222221111 11111100 0000111000 0011 01 Q ss_pred HHHHHHHHhcCCC-----CCCCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 50 VAQVAAYNEFGTT-----RNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 93 (159) Q Consensus 50 ~A~iA~~~E~G~~-----~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 93 (159) --.|+-+.|||.. .+|+|||++|+.+... ..+.+.++..+.+ T Consensus 77 ~y~l~HLLE~GHa~r~GGrV~a~phI~paee~~~--~~l~~~i~r~l~~ 123 (123) T protein:vir:96 77 TYRLTHLLENGHAKRNGGRVSPKVHIAPVEEELV--SNYISRVEKRLSQ 123 (123) T ss_pred CcceEEeeecceeecCCceeCcchhhhHHHHHHH--HHHHHHHHHHhcC Confidence 1236777899943 6999999999987643 2344444444444 No 144 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=79.94 E-value=0.0081 Score=32.06 Aligned_cols=69 Identities=13% Similarity=0.137 Sum_probs=31.7 Q ss_pred CCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhH Q lcl|NC_015294. 65 PTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHT 144 (159) Q Consensus 65 P~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDT 144 (159) =+|..++ ++-. ....+.+.+++.-..+. ..++++|...+..+++.++.. . | +|| T Consensus 1 m~~ms~~--i~~~-g~~~l~~~l~~~~~~~~---~~v~~~l~~~a~~i~~~ak~~----a---------------p-v~T 54 (144) T protein:vir:59 1 MALMSVR--IDPS-WRRIMSRNVRTFSGHVL---TQVEQVIIKTAEKIAGLAASL----A---------------P-VDE 54 (144) T ss_pred CCcceee--ehhH-HHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh----C---------------C-ccc Confidence 3333332 2111 11122222323222221 123445555555444444422 1 2 589 Q ss_pred HHHHhhhhhhhcccC Q lcl|NC_015294. 145 GKMLESVKFQIHRRQ 159 (159) Q Consensus 145 G~L~~SIty~V~~k~ 159 (159) |.|++||++++.... T Consensus 55 G~Lr~SI~~~~~~~g 69 (144) T protein:vir:59 55 GNLKNSIQIDYKNNG 69 (144) T ss_pred hhhhcCeeEEeecCc Confidence 999999999886554 No 145 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=79.66 E-value=0.011 Score=31.27 Aligned_cols=62 Identities=6% Similarity=0.017 Sum_probs=31.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhhhh Q lcl|NC_015294. 75 EEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVKFQ 154 (159) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~SIty~ 154 (159) -+.++-+++.+.+++.-+. ++.++...-..+...|+......-| +|||.|++||.+. T Consensus 1 i~i~Gld~L~~~L~~l~~~-------~~~~~~~a~~~~a~~i~~~ak~~aP----------------v~TG~Lr~sI~~~ 57 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKD-------IDKNINATTEEAANFIEDRAKTLAP----------------KNFGKLAQSISTS 57 (173) T ss_pred CcchhHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHhCC----------------cCchhhhhcceee Confidence 1223333333333332222 2233444444444444444433222 7999999999988 Q ss_pred hcccC Q lcl|NC_015294. 155 IHRRQ 159 (159) Q Consensus 155 V~~k~ 159 (159) +.+++ T Consensus 58 ~~~~~ 62 (173) T protein:vir:10 58 DLKAK 62 (173) T ss_pred eeccC Confidence 77666 No 146 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=78.80 E-value=0.0046 Score=33.41 Aligned_cols=64 Identities=17% Similarity=0.280 Sum_probs=34.0 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_015294. 70 MAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 149 (159) Q Consensus 70 lr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~ 149 (159) |-... .+-+++.+.++..-.++. ..++.+|+..+..+++.+|... | +|||.|++ T Consensus 1 Ma~~~---~G~~~l~~~l~~~~~~~~---~~~~~al~~~a~~i~~~ak~~a-------------------P-v~TG~Lr~ 54 (137) T protein:vir:10 1 MAKVK---YGNWDLVKELEEFEKETI---RWAKKGIAKTTTIIHNSIVSNM-------------------P-VDTGYLRE 54 (137) T ss_pred Cccch---hCHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhC-------------------C-cCcchhhc Confidence 22210 011223333333333221 1345666666666666655432 2 59999999 Q ss_pred hhhhhhcccC Q lcl|NC_015294. 150 SVKFQIHRRQ 159 (159) Q Consensus 150 SIty~V~~k~ 159 (159) ||++++.... T Consensus 55 SI~~~~~~~~ 64 (137) T protein:vir:10 55 SVSMDFKKGG 64 (137) T ss_pred CeeeEecCCc Confidence 9999887665 No 147 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=76.77 E-value=0.0066 Score=32.55 Aligned_cols=64 Identities=17% Similarity=0.243 Sum_probs=32.6 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHh Q lcl|NC_015294. 70 MAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLE 149 (159) Q Consensus 70 lr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~ 149 (159) |-..+ .+-+++.+.++..-.++ ...+..+|+..+..+++.+|... | +|||.|++ T Consensus 1 Ma~~~---~Gl~~l~~~l~~~~~~~---~~~~~~al~~~a~~i~~~ak~~a-------------------P-vdTG~Lr~ 54 (137) T protein:vir:10 1 MAKVK---YGNWELVKELEDFEKET---IRWAKKGIAKTTTIIHNSIVSNM-------------------P-VDTGYLRE 54 (137) T ss_pred CchhH---hhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhC-------------------C-cCcchhhc Confidence 22110 01112222333322222 11345666666666666655532 2 59999999 Q ss_pred hhhhhhcccC Q lcl|NC_015294. 150 SVKFQIHRRQ 159 (159) Q Consensus 150 SIty~V~~k~ 159 (159) ||++++.... T Consensus 55 SI~~~~~~~~ 64 (137) T protein:vir:10 55 SVSMDFKKGG 64 (137) T ss_pred CeeEEeeCCc Confidence 9998876554 No 148 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=75.91 E-value=0.0028 Score=34.54 Aligned_cols=87 Identities=17% Similarity=0.149 Sum_probs=40.1 Q ss_pred Cccc-CCceeeehHHHHHHHH------------HHHHHhhCC--EEEEEecccccCC-CCCCC--------CCHHHHHHH Q lcl|NC_015294. 1 MIIL-ASFSFKTDRRRLTSLI------------KRVEALDGT--TVEVGFFPEDRYG-SENGN--------LPVAQVAAY 56 (159) Q Consensus 1 m~m~-~~~~~k~~~~~l~~l~------------~~l~~l~~~--~v~VGi~~~~~~~-~~~~G--------~~~A~iA~~ 56 (159) |... ..+++..+.+.+++.. ..++...+. -|.-|-+...=.. ..++| .+.+.+|.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~ 80 (140) T protein:vir:97 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAP 80 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhh Confidence 4433 3345555544443321 111111111 1333333222110 01111 245889999 Q ss_pred HhcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_015294. 57 NEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQ 100 (159) Q Consensus 57 ~E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 100 (159) +|||| ++.+|||||++++++..... .-|.. + T Consensus 81 Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~-------~~i~~------~ 140 (140) T protein:vir:97 81 VHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTND-------PRVRM------T 140 (140) T ss_pred hccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhh-------hhccC------C Confidence 99995 24669999999998742211 11111 1 No 149 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=75.91 E-value=0.0028 Score=34.54 Aligned_cols=87 Identities=17% Similarity=0.149 Sum_probs=40.1 Q ss_pred Cccc-CCceeeehHHHHHHHH------------HHHHHhhCC--EEEEEecccccCC-CCCCC--------CCHHHHHHH Q lcl|NC_015294. 1 MIIL-ASFSFKTDRRRLTSLI------------KRVEALDGT--TVEVGFFPEDRYG-SENGN--------LPVAQVAAY 56 (159) Q Consensus 1 m~m~-~~~~~k~~~~~l~~l~------------~~l~~l~~~--~v~VGi~~~~~~~-~~~~G--------~~~A~iA~~ 56 (159) |... ..+++..+.+.+++.. ..++...+. -|.-|-+...=.. ..++| .+.+.+|.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~ 80 (140) T protein:vir:10 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAP 80 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhh Confidence 4433 3345555544443321 111111111 1333333222110 01111 245889999 Q ss_pred HhcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_015294. 57 NEFGT-----------------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQ 100 (159) Q Consensus 57 ~E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 100 (159) +|||| ++.+|||||++++++..... .-|.. + T Consensus 81 Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~-------~~i~~------~ 140 (140) T protein:vir:10 81 VHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTND-------PRVRM------T 140 (140) T ss_pred hccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhh-------hhccC------C Confidence 99995 24669999999998742211 11111 1 No 150 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=71.48 E-value=0.013 Score=30.88 Aligned_cols=63 Identities=10% Similarity=0.182 Sum_probs=27.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHH-HHHHhhcCCCCCcHHHHHhcCCCCCchhHHHHHhhhh Q lcl|NC_015294. 74 FEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQ-MQVNIDDYPGSNSPAWAAYKGFNDPLFHTGKMLESVK 152 (159) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~-i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~L~~SIt 152 (159) +++.+.+. -+......+. ....+++..++...... |+..... ..| +|||.|++||+ T Consensus 1 ~~~~~f~~----~~~~~~~~~~---k~~~~~~~~~a~~~~~~~ie~~ak~---------------~~p-vdtG~L~~SI~ 57 (141) T protein:vir:78 1 MNEFEFDS----NIPKARKLIE---KKVLQALEDIGEHMTTELAEGGHGV---------------TSN-NDTGEYAQKSG 57 (141) T ss_pred CcchhHHH----HHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhhhh---------------ccc-cccchhhccee Confidence 33222221 1122211111 12233344443332221 1111111 123 79999999999 Q ss_pred hhhcccC Q lcl|NC_015294. 153 FQIHRRQ 159 (159) Q Consensus 153 y~V~~k~ 159 (159) |+|.... T Consensus 58 ~~v~~~g 64 (141) T protein:vir:78 58 YKVRKSS 64 (141) T ss_pred eeeecCC Confidence 9986555 No 151 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=71.27 E-value=0.056 Score=27.44 Aligned_cols=91 Identities=15% Similarity=0.173 Sum_probs=49.6 Q ss_pred ceeeehHHHHHHHHHHHHHhhCC--------------------------------------------------------- Q lcl|NC_015294. 7 FSFKTDRRRLTSLIKRVEALDGT--------------------------------------------------------- 29 (159) Q Consensus 7 ~~~k~~~~~l~~l~~~l~~l~~~--------------------------------------------------------- 29 (159) |+...|.+.|+++.++|+.+... T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~t 80 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQG 80 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhcccccccc Confidence 34445555666655555433110 Q ss_pred -EEEEEecccccCCCCCCC-----CCHHHHHHHHhcCCC-----CCCCCchhhHHHHHHHHHHHHHHHHHHH----HHHH Q lcl|NC_015294. 30 -TVEVGFFPEDRYGSENGN-----LPVAQVAAYNEFGTT-----RNPTRPFMAPTFEEFTSQFHYARLMKST----FENV 94 (159) Q Consensus 30 -~v~VGi~~~~~~~~~~~G-----~~~A~iA~~~E~G~~-----~IP~RpFlr~~~~~~~~~~~~~~~~~~~----~~~~ 94 (159) .++=||-.+..+ ...++ .+.+.+|-+-|||.. -+|-+.+|+.+.++... .+.+.+++. +..+ T Consensus 81 G~lr~swk~~~~~-k~~~~~~v~v~N~~~YA~~VE~GHR~~~gGfV~G~fml~~s~~~~~~--~~~~~~e~~l~~~l~k~ 157 (163) T protein:vir:10 81 GTLQKGWSKSRIE-VSGRTYKQKVYNKVYYAPHVEYGHKTVNGGFVPGQFFLHKTVEDTKS--DMEKRVRDKYDGFMRKV 157 (163) T ss_pred chhhccceeccee-ecCCceEEEEEecCCccchhhcceeecCCceeccchhhHHHHHHHHH--HHHHHHHHHHHHHHHHh Confidence 111122222111 11222 256788999999964 48999999999877653 355555544 4555 Q ss_pred HhCCCC Q lcl|NC_015294. 95 LRDGRQ 100 (159) Q Consensus 95 ~~g~~~ 100 (159) +.|... T Consensus 158 ~~~~~~ 163 (163) T protein:vir:10 158 VLGNGK 163 (163) T ss_pred hcCCCC Confidence 556544 No 152 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=70.20 E-value=0.0085 Score=31.95 Aligned_cols=89 Identities=13% Similarity=0.165 Sum_probs=43.6 Q ss_pred CcccCCceeeehHHHHHHHHHHHHHhhCCEEEEEe---cccccCC-----CCCCCCC--------HHHHHHHHhcCCC-- Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEALDGTTVEVGF---FPEDRYG-----SENGNLP--------VAQVAAYNEFGTT-- 62 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~l~~~~v~VGi---~~~~~~~-----~~~~G~~--------~A~iA~~~E~G~~-- 62 (159) |.+.- |. +.-+++..++..+.+...+..|.+=+ ..+..-+ -.++|++ .+++|-+.|||+. T Consensus 14 ~s~~d-vk-~VVkkN~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~g~~~~vgp~g~t~dYapyvEyGTR~m 91 (127) T protein:vir:98 14 MSEKR-WD-RVANKNLTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKVNSSKDVITGNFGYIKDYAPHVEYGHRIV 91 (127) T ss_pred hhHHH-HH-HHHhhhhHHHHHHHHhccCCceeccccccCcccceeeeEEEEecCCceEEeccCcccccccceeecceeee Confidence 21111 10 01123334444444443322221001 1111111 1123432 4789999999986 Q ss_pred -------CCCCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 63 -------RNPTRPFMAPTFEEFTSQFHYARLMKSTFEN 93 (159) Q Consensus 63 -------~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 93 (159) -.|+-|||.|+|+..+ ..+-+-++.+++. T Consensus 92 ~~~~~~gf~~aqp~l~paf~~Qk--~iF~~DL~~l~k~ 127 (127) T protein:vir:98 92 RNGKQVGYANGTKYLFNNVKKQR--EIYRQDMLNELRR 127 (127) T ss_pred ecccccccccCccccccchHHHh--HHHHHHHHHHhcC Confidence 3789999999998764 3455555555555 No 153 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=65.50 E-value=0.063 Score=27.15 Aligned_cols=78 Identities=17% Similarity=0.283 Sum_probs=38.6 Q ss_pred CCceeeehHHHHHHH------------------------HHHHHH-h----------------------hCCEEEEEecc Q lcl|NC_015294. 5 ASFSFKTDRRRLTSL------------------------IKRVEA-L----------------------DGTTVEVGFFP 37 (159) Q Consensus 5 ~~~~~k~~~~~l~~l------------------------~~~l~~-l----------------------~~~~v~VGi~~ 37 (159) ..|.++..+.-+..| .+.++. + ..+.|+|||.. T Consensus 1 MsvevkGv~eil~~LE~k~g~~~~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:10 1 MSVKVTGDKALERELEKHFGIKEMVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPEWIKGKRTVTIRWRG 80 (134) T ss_pred CeEEeecHHHHHHHHHHhhchhhhhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCeeecCCceEEEEEEEc Confidence 112233322211111 112221 0 11467777744 Q ss_pred cc-cCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhH--------HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 38 ED-RYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAP--------TFEEFTSQFHYARLMKSTFENV 94 (159) Q Consensus 38 ~~-~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~--------~~~~~~~~~~~~~~~~~~~~~~ 94 (159) +. .| -|--+||||..+-...+|++| +++..+ ..+.+.++..++++ T Consensus 81 ~~~R~----------~ivHLnE~Gyt~~r~Gk~i~PrG~G~i~~a~~~~e--~~~~~~ik~eL~kl 134 (134) T protein:vir:10 81 PFERF----------RIVHLIENGHVEKKSGKFVKPKAMGGINRAIRQGQ--NKYFETLKRELKKL 134 (134) T ss_pred CCcee----------eEEEeeecceeecCCCCeeccchhhHHHHHHHhhh--HHHHHHHHHHHhcC Confidence 32 22 255678999887788888888 555442 33455555555554 No 154 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=62.00 E-value=0.062 Score=27.22 Aligned_cols=88 Identities=15% Similarity=0.153 Sum_probs=51.7 Q ss_pred CcccCCceeee--hHHHHHHHHHH--------HHHhhCCEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCCCCCC--- Q lcl|NC_015294. 1 MIILASFSFKT--DRRRLTSLIKR--------VEALDGTTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTRNPTR--- 67 (159) Q Consensus 1 m~m~~~~~~k~--~~~~l~~l~~~--------l~~l~~~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~IP~R--- 67 (159) +....+ ..|. .......+... +.....-.+.|||-.. ..+.||.+.+.||...|+. T Consensus 57 ~~~~~~-~~k~~~~~~~~~HlaD~I~~~~~~~iDg~~dG~s~VGw~~~----------~~a~~a~f~NdGT~~m~~k~~~ 125 (159) T protein:vir:38 57 RSAGHA-NAKHHNRNRKTKHLQDSITYKPGYTADKLHTGDTDVGFEGK----------YYDFLAKIVNNGQHHMSPKRYK 125 (159) T ss_pred cccccc-cccccCcCcCCCccccceeeecCccccccccceeeecccCC----------ccceEeeecccCccccCCCCcc Confidence 110000 0000 00000111111 2222344788999632 2368999999999999997 Q ss_pred --chhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCH Q lcl|NC_015294. 68 --PFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQT 101 (159) Q Consensus 68 --pFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 101 (159) +|+..+..+. +.++.+.+...+..++...-+- T Consensus 126 gdHFvekt~~~~--k~~Vl~A~~~~~~~il~~~~~~ 159 (159) T protein:vir:38 126 NMHFLDKAQQEA--KKSVAEAELKAYKEVMNHDSDK 159 (159) T ss_pred CChhHHHHHHHH--HHHHHHHHHHHHHHHhhcccCC Confidence 6999988764 4567788888888888876554 No 155 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=60.80 E-value=0.11 Score=25.78 Aligned_cols=66 Identities=9% Similarity=0.148 Sum_probs=30.3 Q ss_pred hhH-HHHHHHHHHHHHHHHHHHHHHHHhCC--CCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchhHHH Q lcl|NC_015294. 70 MAP-TFEEFTSQFHYARLMKSTFENVLRDG--RQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFHTGK 146 (159) Q Consensus 70 lr~-~~~~~~~~~~~~~~~~~~~~~~~~g~--~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiDTG~ 146 (159) |-. .++ .+ .+.++++++-...-.+. ...+.+|+.+|..+...||.. . | +|||. T Consensus 1 Ms~~~id-~~---gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~----t---------------P-VdTG~ 56 (144) T protein:vir:10 1 MSLGHVD-DA---QFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEAN----T---------------P-VKQGN 56 (144) T ss_pred CCCCCcc-HH---HHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHh----C---------------C-CCcch Confidence 332 122 11 12233333222221221 123455555555555554432 2 2 79999 Q ss_pred HHhhhhhhhcccC Q lcl|NC_015294. 147 MLESVKFQIHRRQ 159 (159) Q Consensus 147 L~~SIty~V~~k~ 159 (159) |++|++..-..++ T Consensus 57 Lr~S~~~~~~~~~ 69 (144) T protein:vir:10 57 LRRSWTAEGPTYG 69 (144) T ss_pred hccceeecceeee Confidence 9999987644443 No 156 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=60.55 E-value=0.02 Score=29.94 Aligned_cols=97 Identities=15% Similarity=0.110 Sum_probs=57.6 Q ss_pred CcccCCceeeehHHHHHHHH------------HHHHHhhCCEEEEEecccccC-CC----------CCC----------- Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLI------------KRVEALDGTTVEVGFFPEDRY-GS----------ENG----------- 46 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~------------~~l~~l~~~~v~VGi~~~~~~-~~----------~~~----------- 46 (159) |...+--.|+.+ ++.++. +.|+.-++....|+++.-.+- |. +.. T Consensus 1 ma~~~~~~vrV~--Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aa 78 (143) T protein:vir:62 1 MAQRSAYTIRVD--GLREFQRNVRTLRDKELNKAVREANKASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTA 78 (143) T ss_pred CCcccchheehH--HHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccCcchhhccccccc Confidence 888777777654 223333 333333334445555432210 00 001 Q ss_pred ---------CC-CHHHHHHHHhcCCC--CCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHH Q lcl|NC_015294. 47 ---------NL-PVAQVAAYNEFGTT--RNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKK 107 (159) Q Consensus 47 ---------G~-~~A~iA~~~E~G~~--~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~ 107 (159) |- .-.-+|.+-+||+. +|-++-||..++...+ +.|.+.++..|.++++-. |+. T Consensus 79 T~raa~VrAG~~krVPYA~~I~~G~r~r~Isp~rFl~~a~a~te--~~~~r~Ye~~i~~vl~k~------l~s 143 (143) T protein:vir:62 79 SAKGAVIKAGSASRVPYAAAIHFGYRARNISPNRFLFRAMARKS--DVVAATYERRIAAVVEKY------LES 143 (143) T ss_pred cccceeeeeCCcCCCCcccccccCcccccccchhhhhhhhhccC--HHHHHHHHHHHHHHHHHH------hcC Confidence 11 12246778899997 8889999999998775 568899999998877543 222 No 157 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=57.02 E-value=0.046 Score=27.94 Aligned_cols=61 Identities=21% Similarity=0.274 Sum_probs=30.1 Q ss_pred cCCceeeehHHHHHHHHHHHHHhhC----------------------CEEEEEecccccCCC-CCCC--------CCHHH Q lcl|NC_015294. 4 LASFSFKTDRRRLTSLIKRVEALDG----------------------TTVEVGFFPEDRYGS-ENGN--------LPVAQ 52 (159) Q Consensus 4 ~~~~~~k~~~~~l~~l~~~l~~l~~----------------------~~v~VGi~~~~~~~~-~~~G--------~~~A~ 52 (159) ++.++++ -++|++|.+.|+.... .-|.-|+....-.-. .++| -+.+. T Consensus 1 Ma~~~i~--~~Gld~L~~~L~~~~~~~~v~~vv~~~~~~l~~~ak~~ap~dTG~lrrSI~~~~~~~g~~~~v~~~gp~a~ 78 (92) T protein:vir:99 1 MADYSIS--WDGLDALDEALANQQNMNTVKKVVKKHTANLMTATQQAVPVDTGHLKQSAQIQISRDGFTGSVTYGGGLVN 78 (92) T ss_pred CCceeeE--eehHHHHHHHHHhhccHHHHHHHHHHHHHHHHHHHHHhCCCCccccceeeeEEeecCCeeEEEEeccCccc Confidence 2234444 2455556555543211 012222222111100 1223 24588 Q ss_pred HHHHHhcCCCCCCC Q lcl|NC_015294. 53 VAAYNEFGTTRNPT 66 (159) Q Consensus 53 iA~~~E~G~~~IP~ 66 (159) +|-+.||||...++ T Consensus 79 Ya~YvE~GTR~M~A 92 (92) T protein:vir:99 79 YAAYVEFGTRFMDS 92 (92) T ss_pred cccccccceeecCC Confidence 99999999999988 No 158 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=54.20 E-value=0.031 Score=28.87 Aligned_cols=97 Identities=14% Similarity=0.117 Sum_probs=56.4 Q ss_pred CcccCCceeeehHHHHHHHHH------------HHHHhhCCEEEEEecccccC-C------CC----CC----------- Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIK------------RVEALDGTTVEVGFFPEDRY-G------SE----NG----------- 46 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~------------~l~~l~~~~v~VGi~~~~~~-~------~~----~~----------- 46 (159) |...+--.|+.+ ++.++.. .|+.-++....|+++.-.+- | .. -. T Consensus 1 ma~~~~~~vkV~--Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~~~p~~srr~r~G~L~~Sir~aa 78 (143) T protein:vir:13 1 MAQRSAYTIQVD--GLRQFQRNVRALRDKELNKAVREANKASGEVLIPQAKHESPDGHRDPKSSKRYRPGKLDKSIKVTA 78 (143) T ss_pred CCcccchheehH--HHHHHHHHHHHhhCCcchHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccchhhccccccc Confidence 887777777654 2333333 33333333344554432210 0 00 00 Q ss_pred ---------C-CCHHHHHHHHhcCCC--CCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHH Q lcl|NC_015294. 47 ---------N-LPVAQVAAYNEFGTT--RNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKK 107 (159) Q Consensus 47 ---------G-~~~A~iA~~~E~G~~--~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~ 107 (159) | -.-.-+|.+-+||+. +|-++-||+.++...+ +.|.+.++..|.++++-. |+. T Consensus 79 T~raa~VrAGr~arVPYA~~I~~G~r~r~Is~~rFl~~a~a~te--~~~~r~Ye~~i~~vl~k~------l~s 143 (143) T protein:vir:13 79 SAKGAVIKAGSAARVPYAAAIHFGYRKRNISANRFLYRAMARKS--DVVAATYERRIAAVVEKY------LES 143 (143) T ss_pred cccceeeeecCcCCCCcccccccCCcccccchhhhhhhhhhccC--HHHHHHHHHHHHHHHHHH------hcC Confidence 1 000135678899997 8889999999998775 568899999998877543 222 No 159 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=54.11 E-value=0.061 Score=27.25 Aligned_cols=87 Identities=14% Similarity=0.310 Sum_probs=40.0 Q ss_pred cCCceeeehHHHHHHHH------------HHHHHhhC---CEE----------EEE-----ecccccCCCCCCCC----- Q lcl|NC_015294. 4 LASFSFKTDRRRLTSLI------------KRVEALDG---TTV----------EVG-----FFPEDRYGSENGNL----- 48 (159) Q Consensus 4 ~~~~~~k~~~~~l~~l~------------~~l~~l~~---~~v----------~VG-----i~~~~~~~~~~~G~----- 48 (159) |+.|++..--+.+.+-+ +.++...+ ..+ +-| |-....+ ++. T Consensus 1 M~~i~id~La~~I~~~L~~y~~~v~~~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~~----~~~~v~nk 76 (127) T protein:vir:80 1 MANIKIDRLGDEITRQLKRYSQVIAGDLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRTP----GGWVIHNK 76 (127) T ss_pred CccccHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeecc----CceeEeec Confidence 22244432211111111 11100000 011 111 2111110 110 Q ss_pred CHHHHHHHHhcCCC-----CCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_015294. 49 PVAQVAAYNEFGTT-----RNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQ 100 (159) Q Consensus 49 ~~A~iA~~~E~G~~-----~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 100 (159) +.-+|+-+.|||.- ..++|||++|+.+.. .+.+.+.+...+.|+.- T Consensus 77 ~~yqLtHLLE~GHAkr~GGRV~a~pHI~paee~~------~~~l~~~i~~~l~~~~~ 127 (127) T protein:vir:80 77 TEYRLAHLLEYGHATVDGGRVPETPHIRPVEDWL------EKEFEDRVERAIKNESR 127 (127) T ss_pred CCcceeehhhcceeccCCcccCCccchhhHHHHH------HHHHHHHHHHHhcCCCC Confidence 12357889999963 799999999998643 34456677777776644 No 160 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=53.37 E-value=0.32 Score=23.31 Aligned_cols=84 Identities=13% Similarity=0.220 Sum_probs=46.0 Q ss_pred CcccCCceeeehHHHHHHHHHHH-----HHh------------------------------------------hCCEEEE Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRV-----EAL------------------------------------------DGTTVEV 33 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l-----~~l------------------------------------------~~~~v~V 33 (159) ..|.++-.+|....-+..|.++| +.+ ..++|+| T Consensus 5 ~~~~~~aevkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s~p~~~~G~r~V~i 84 (138) T protein:vir:98 5 VSMSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKL 84 (138) T ss_pred ecccccccccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeecCeeecCCceEEEE Confidence 22333333443322222222111 000 1235666 Q ss_pred EecccccCCCCCCCCCHHHHHHHHhcCCC-CCCCCc--hhhHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_015294. 34 GFFPEDRYGSENGNLPVAQVAAYNEFGTT-RNPTRP--FMAPTFEEFTSQFHYARLMKSTFENVLRD 97 (159) Q Consensus 34 Gi~~~~~~~~~~~G~~~A~iA~~~E~G~~-~IP~Rp--Flr~~~~~~~~~~~~~~~~~~~~~~~~~g 97 (159) ||... +|+ |=-.||||.. .|-||- +++.+++..+ ..+.+.++..+++.+.| T Consensus 85 gW~Gp-R~~----------ivHLNE~GyGk~i~PrG~G~I~ka~~~se--~~y~~~vk~el~k~l~~ 138 (138) T protein:vir:98 85 GFTTP-RWN----------IVHLQELEYGWKHNRRGVGVIRRYSDILE--TIYPRGIRDKLKRGFDG 138 (138) T ss_pred eeecC-eee----------EEeeecccccCCcCCCcchHHHHHHHhhh--HHHHHHHHHHHHHHhcC Confidence 66432 332 5567899976 455665 6888887654 45778899999999998 No 161 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=49.22 E-value=0.064 Score=27.12 Aligned_cols=60 Identities=12% Similarity=0.048 Sum_probs=22.8 Q ss_pred CCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhcCCCCCchh Q lcl|NC_015294. 64 NPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYKGFNDPLFH 143 (159) Q Consensus 64 IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~KG~~~PLiD 143 (159) ++..- .-..+...+.+++|..+...++..-.. +...-.-..-+| T Consensus 1 ~~~~~---------------------------~~~~~~~~~~~~~~~v~r~~l~~~a~~---------v~~~Ak~~aPv~ 44 (137) T protein:vir:10 1 MTVTA---------------------------RYERNPVGEARQFQVIARRRLSRITRG---------TANQARADVPVK 44 (137) T ss_pred CeeEE---------------------------EeccCchhHHHHHHHHHHHHHHHHHHH---------HHHHHHhcCCcc Confidence 11111 111122222223333332222221110 000000022369 Q ss_pred HHHHHhhhhhhhcccC Q lcl|NC_015294. 144 TGKMLESVKFQIHRRQ 159 (159) Q Consensus 144 TG~L~~SIty~V~~k~ 159 (159) ||.|++||.+.+.... T Consensus 45 tG~Lr~SI~~~~~~~~ 60 (137) T protein:vir:10 45 TGNLGRSIREDPIVVA 60 (137) T ss_pred chhhhcCceeeeeecc Confidence 9999999998765444 No 162 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=47.47 E-value=0.42 Score=22.65 Aligned_cols=82 Identities=13% Similarity=0.213 Sum_probs=44.5 Q ss_pred ccCCceeeehHHHHHHHHHHH-----HHhh------------------------------------------CCEEEEEe Q lcl|NC_015294. 3 ILASFSFKTDRRRLTSLIKRV-----EALD------------------------------------------GTTVEVGF 35 (159) Q Consensus 3 m~~~~~~k~~~~~l~~l~~~l-----~~l~------------------------------------------~~~v~VGi 35 (159) |.++-.+|..+.-|..|.++| ..+. -+.|+||| T Consensus 1 ~~~~aevkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~t~dev~~s~~~~~~G~r~V~VgW 80 (132) T protein:vir:96 1 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGF 80 (132) T ss_pred CCccccccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhhhhhcchhhcceeecCeeecCCceEEEecc Confidence 333334443332222222211 1111 12455555 Q ss_pred cccccCCCCCCCCCHHHHHHHHhcCCC-CCCCCc--hhhHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_015294. 36 FPEDRYGSENGNLPVAQVAAYNEFGTT-RNPTRP--FMAPTFEEFTSQFHYARLMKSTFENVLRD 97 (159) Q Consensus 36 ~~~~~~~~~~~G~~~A~iA~~~E~G~~-~IP~Rp--Flr~~~~~~~~~~~~~~~~~~~~~~~~~g 97 (159) .. ..|+ |=-+||||.. .|-||- +++.+++..+ ..+.+.++..+++.+.| T Consensus 81 ~G-pR~~----------ivHLNE~GyGk~~~PrG~G~I~~a~~~se--~~~~~~~~~elkk~l~~ 132 (132) T protein:vir:96 81 TT-PRWN----------IVHLQELEYGWKHNRRGVGVIRRYSDILE--TIYPRGIRDKLKRGFDG 132 (132) T ss_pred cC-Ccee----------EEeeecccccCCcCCCcchHHHHHHHhhh--hHHHHHHHHHHHHHhcC Confidence 32 1222 4457899985 455555 6888887654 45678888999999988 No 163 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=42.60 E-value=0.062 Score=27.19 Aligned_cols=84 Identities=17% Similarity=0.304 Sum_probs=36.9 Q ss_pred cCCceeeehHHHHHHHHH------------HHH--------HhhC-----CEEEEE-----ecccccCCCCCCCC----- Q lcl|NC_015294. 4 LASFSFKTDRRRLTSLIK------------RVE--------ALDG-----TTVEVG-----FFPEDRYGSENGNL----- 48 (159) Q Consensus 4 ~~~~~~k~~~~~l~~l~~------------~l~--------~l~~-----~~v~VG-----i~~~~~~~~~~~G~----- 48 (159) |+.|++..--+.+.+-++ .++ .|.+ --.+-| |-.... .+|. T Consensus 1 M~~i~id~La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~----~e~~~V~nk 76 (124) T protein:vir:95 1 MAKIKIGRLADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRV----PNGWVIHNK 76 (124) T ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeee----cCceeEEEc Confidence 333444332222221111 110 0110 001112 111111 0111 Q ss_pred CHHHHHHHHhcCCC-----CCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_015294. 49 PVAQVAAYNEFGTT-----RNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRD 97 (159) Q Consensus 49 ~~A~iA~~~E~G~~-----~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g 97 (159) .--+|+-+.|||.- ..++|||++|+.+... +.+.+.+...+.. T Consensus 77 ~~yqLtHLLE~GHAkr~GGRV~a~pHI~paee~~~------~~l~~~i~~~l~~ 124 (124) T protein:vir:95 77 TEYRLAHLLEYGHATVDGGRVPGTPHIRPIEDWLE------KEFEDRVEKAIKQ 124 (124) T ss_pred CCCceeeeeecceeccCCcccCCccchhHHHHHHH------HHHHHHHHHHhcC Confidence 11347888899953 7999999999986542 3344555555554 No 164 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=36.03 E-value=1.2 Score=20.16 Aligned_cols=90 Identities=9% Similarity=0.107 Sum_probs=42.5 Q ss_pred ccCC----ceeeehHHHH--------HHHH--------HHHHH---------hhCCEEEEEeccccc-CCCCCCCC---- Q lcl|NC_015294. 3 ILAS----FSFKTDRRRL--------TSLI--------KRVEA---------LDGTTVEVGFFPEDR-YGSENGNL---- 48 (159) Q Consensus 3 m~~~----~~~k~~~~~l--------~~l~--------~~l~~---------l~~~~v~VGi~~~~~-~~~~~~G~---- 48 (159) |-++ +++..+-+.| +.+. .++.. -.+..|.+|-+.... .+.+.+|. T Consensus 1 ~~~~m~~~~sF~~~i~~~~~~ve~~~~~v~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~G~~t~~ 80 (145) T protein:vir:10 1 MARNIGSVVTFEKSIADWIDRAEDGFGIVVSNTVIKTANAIVDLSPVDTGRFKANWQISANSPAQQSLNEYDQTGGQTKT 80 (145) T ss_pred CCCcccchhccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeecccccccccccCCCCccchh Confidence 3344 3333222221 1111 11111 123455555443221 11112232 Q ss_pred ---------------------CHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHH Q lcl|NC_015294. 49 ---------------------PVAQVAAYNEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLL 105 (159) Q Consensus 49 ---------------------~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l 105 (159) +++-+|.-.|||+.+-+|+.|.|.++.+ |.+++++.+.++-. ++ T Consensus 81 ~~~~~~~~i~~~k~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~~~~~------~~~~v~~~~~e~k~-------~~ 145 (145) T protein:vir:10 81 YLARQARAVANSKATSVIYITNRLDYAADLEYGASNQAPAGVLGVVQAR------LGRYFQEAVEEARR-------AI 145 (145) T ss_pred hHHHHHHHhhcccccceEEEeeCchhhhHhhccccCCCcchHHHHHHHH------HHHHHHHHHHHhhc-------cC Confidence 3345777889999999999999998854 44444444443311 11 No 165 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=35.16 E-value=1 Score=20.49 Aligned_cols=75 Identities=15% Similarity=0.136 Sum_probs=37.6 Q ss_pred CcccCC----------------cee----eehHHH---HHHHHHHHHHhhC-CEEEEEecccccCCCCCCCCCHHHHHHH Q lcl|NC_015294. 1 MIILAS----------------FSF----KTDRRR---LTSLIKRVEALDG-TTVEVGFFPEDRYGSENGNLPVAQVAAY 56 (159) Q Consensus 1 m~m~~~----------------~~~----k~~~~~---l~~l~~~l~~l~~-~~v~VGi~~~~~~~~~~~G~~~A~iA~~ 56 (159) |+..+- +.. ..|..+ +..+...+..+.. ..+.+ .+++-+|.- T Consensus 33 iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t~~~~~~~i~~~~~g~~iyi--------------~Nn~pYA~~ 98 (131) T protein:vir:94 33 IIKASPVDTGRFRMNWMASGSTPADGTTDATDKSGNTATGNATSFVLNAADWHTFTL--------------TNNLPYAQR 98 (131) T ss_pred HHHhCCCchhhhhccchhccccccccccCCCCCCchhhHHHHHHHHhhccccceEEE--------------eeCchhhhh Confidence 111111 110 111111 1111111111111 11111 123568889 Q ss_pred HhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 57 NEFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVL 95 (159) Q Consensus 57 ~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~ 95 (159) .|||+.+-+|+.|.|.++.+ |.+++++.+..+- T Consensus 99 LEyG~S~QAP~g~v~~~~~~------~~~~v~~~~~e~k 131 (131) T protein:vir:94 99 LEYGWSQQAPQGFVRVNVSR------FQQLLNEEASKVK 131 (131) T ss_pred hhccccCCCcchHHHHHHHH------HHHHHHHHHHhcC Confidence 99999999999999988843 6677777777663 No 166 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=32.58 E-value=0.65 Score=21.60 Aligned_cols=78 Identities=19% Similarity=0.322 Sum_probs=35.1 Q ss_pred CCceeeehHHHHHHH------------------------HHHHHH-----------------------hhCCEEEEEecc Q lcl|NC_015294. 5 ASFSFKTDRRRLTSL------------------------IKRVEA-----------------------LDGTTVEVGFFP 37 (159) Q Consensus 5 ~~~~~k~~~~~l~~l------------------------~~~l~~-----------------------l~~~~v~VGi~~ 37 (159) ..|.++..+.-+..| ++.++. -..+.|+|||.. T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:10 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRG 80 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEc Confidence 112223222111111 111210 112358888854 Q ss_pred cc-cCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhH--------HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 38 ED-RYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAP--------TFEEFTSQFHYARLMKSTFENV 94 (159) Q Consensus 38 ~~-~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~--------~~~~~~~~~~~~~~~~~~~~~~ 94 (159) +. .| -|--+||||...--..+|++| +++..+ ..+.+.++..++++ T Consensus 81 ~~~R~----------~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e--~~~~~~ik~eL~kl 134 (134) T protein:vir:10 81 SKDRY----------KIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQ--NKYFETLKRELKKL 134 (134) T ss_pred CCcee----------EEEEeecccceecccCCccCcchhhHHHHHHHhhh--HHHHHHHHHHHhcC Confidence 32 22 266688999654434455555 565443 33455555555554 No 167 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=32.58 E-value=0.65 Score=21.60 Aligned_cols=78 Identities=19% Similarity=0.322 Sum_probs=35.1 Q ss_pred CCceeeehHHHHHHH------------------------HHHHHH-----------------------hhCCEEEEEecc Q lcl|NC_015294. 5 ASFSFKTDRRRLTSL------------------------IKRVEA-----------------------LDGTTVEVGFFP 37 (159) Q Consensus 5 ~~~~~k~~~~~l~~l------------------------~~~l~~-----------------------l~~~~v~VGi~~ 37 (159) ..|.++..+.-+..| ++.++. -..+.|+|||.. T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:95 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRG 80 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEc Confidence 112223222111111 111210 112358888854 Q ss_pred cc-cCCCCCCCCCHHHHHHHHhcCCCCCCCCchhhH--------HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 38 ED-RYGSENGNLPVAQVAAYNEFGTTRNPTRPFMAP--------TFEEFTSQFHYARLMKSTFENV 94 (159) Q Consensus 38 ~~-~~~~~~~G~~~A~iA~~~E~G~~~IP~RpFlr~--------~~~~~~~~~~~~~~~~~~~~~~ 94 (159) +. .| -|--+||||...--..+|++| +++..+ ..+.+.++..++++ T Consensus 81 ~~~R~----------~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e--~~~~~~ik~eL~kl 134 (134) T protein:vir:95 81 SKDRY----------KIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQ--NKYFETLKRELKKL 134 (134) T ss_pred CCcee----------EEEEeecccceecccCCccCcchhhHHHHHHHhhh--HHHHHHHHHHHhcC Confidence 32 22 266688999654434455555 565443 33455555555554 No 168 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=31.86 E-value=1.3 Score=20.01 Aligned_cols=76 Identities=13% Similarity=0.172 Sum_probs=38.8 Q ss_pred CcccCCceeeehHHHHHHHHHHHHHhhCCEEEEEeccccc-CCCCCCC----------------------CCHHHHHHHH Q lcl|NC_015294. 1 MIILASFSFKTDRRRLTSLIKRVEALDGTTVEVGFFPEDR-YGSENGN----------------------LPVAQVAAYN 57 (159) Q Consensus 1 m~m~~~~~~k~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~-~~~~~~G----------------------~~~A~iA~~~ 57 (159) |+..+-| |.- +++ .+..|.+|-+.... .+.+.+| .+++-+|.-. T Consensus 33 iv~~sPV----dTG-------r~R--anw~vs~~~~~~~~~~~~d~~g~~t~~~~~~~i~~~~~g~~iyi~Nn~pYA~~L 99 (131) T protein:vir:78 33 IIKASPV----DTG-------RFR--MNWMASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTFTLTNNLPYAQRL 99 (131) T ss_pred HHHhCCC----chh-------hhc--cccceecccccccccCCCCCCchhhHHHHHHHHhhccCCceEEEeeCchhhhHh Confidence 2222221 000 011 12223333222110 0000111 1245688889 Q ss_pred hcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015294. 58 EFGTTRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVL 95 (159) Q Consensus 58 E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~ 95 (159) |||+.+-+|+.|.|.++.+ |.+++++.+..+- T Consensus 100 EyG~S~QAP~G~v~~~~~~------~~~~v~~~~~e~k 131 (131) T protein:vir:78 100 EYGWSQQAPQGFVRVNVSR------FQQLLNEEASKVK 131 (131) T ss_pred hccccCCCcchHHHHHHHH------HHHHHHHHHHhcC Confidence 9999999999999988843 6677777777663 No 169 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=30.68 E-value=0.94 Score=20.73 Aligned_cols=98 Identities=11% Similarity=0.127 Sum_probs=46.8 Q ss_pred CcccCCcee----------------eehHHHHHHHHH-------HHHHhhCCEEEEEecccccCCCCCCCCC-HHHHHHH Q lcl|NC_015294. 1 MIILASFSF----------------KTDRRRLTSLIK-------RVEALDGTTVEVGFFPEDRYGSENGNLP-VAQVAAY 56 (159) Q Consensus 1 m~m~~~~~~----------------k~~~~~l~~l~~-------~l~~l~~~~v~VGi~~~~~~~~~~~G~~-~A~iA~~ 56 (159) =|-..|..+ +- ......|.. .+....+-+..|||-.. .++|+. =|+||.+ T Consensus 29 kITkAGAkv~~~~L~~~t~~kHy~~k~-t~~~~HLaDsI~~~~~niDg~~dG~s~VGf~~k-----~~~~~~~kA~iAr~ 102 (168) T protein:vir:74 29 EVTKAGAKVFEQALAYEVRNRHYRHRD-TGEDPHLADSIVMKNKNIDGVKDGQSVVGWERS-----TEKGTHTKGYIANI 102 (168) T ss_pred HHHHhhhHHHHHHHHHHhHHhhcccCC-CcccchhhhheeecccccCcccCCceeeccccc-----ccccccchhhhhhh Confidence 000000000 00 000001111 11223455788999532 235544 5899999 Q ss_pred HhcCC------------------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHH Q lcl|NC_015294. 57 NEFGT------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLL 105 (159) Q Consensus 57 ~E~G~------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l 105 (159) .+-|+ +.||.=.|+..+-.+...+.++.+.....+.+++.....-.. | T Consensus 103 lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHFvd~~r~~~~~k~~V~~Ae~~~y~eIl~~k~~~~~-~ 168 (168) T protein:vir:74 103 INNGSRFPQFTTRSGRKYKKPGEVAVHADHFIEETRMNLIVQQGILKAEAEAMRKIINRKKKENN-L 168 (168) T ss_pred hcccccccccccccccccccccccccccchhHHHHHhhhhhHHHHHHHHHHHHHHHHHhhcCCCC-C Confidence 99998 468999999888766444445544444444544432211111 1 No 170 >protein:vir:96288 Length: 100 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240315;genbank:gi:66396010;genbank:GeneID:5133365 Probab=29.98 E-value=0.21 Score=24.31 Aligned_cols=75 Identities=17% Similarity=0.176 Sum_probs=43.8 Q ss_pred HHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHhhcCCCCCcHHHHHhc-CCCCCchhHHHHHhhhhhhhcc Q lcl|NC_015294. 79 SQFHYARLMKSTFENVLRDGRQTNTLLKKLGKMVAEQMQVNIDDYPGSNSPAWAAYK-GFNDPLFHTGKMLESVKFQIHR 157 (159) Q Consensus 79 ~~~~~~~~~~~~~~~~~~g~~~~~~~l~~iG~~~~~~i~~~I~~~~~Pna~~Ti~~K-G~~~PLiDTG~L~~SIty~V~~ 157 (159) -+..+-.+-+-...++--|..+.-..++..-......+++-|.+.. .-|... -+..| ||||.|++||+..-.+ T Consensus 1 ~~~~~~~~~~~~makvkyG~~dmvk~~~~f~~~i~~~vk~~IakTa-----~~I~~~Avs~AP-VD~G~Lk~SI~~dyk~ 74 (100) T protein:vir:96 1 MKLNYYDLSRCHMAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTT-----TKIYNTAVALAP-VDLGFLEESIDFKYFD 74 (100) T ss_pred CcccccccchhhhhhheechHHHHHHHhcchHHHHHHHHHHHHHHH-----HHHHhhHHhhcc-ccccccceeeeeeeec Confidence 1112233444555666667777777777777777777777764311 000000 12244 9999999999877555 Q ss_pred cC Q lcl|NC_015294. 158 RQ 159 (159) Q Consensus 158 k~ 159 (159) .- T Consensus 75 GG 76 (100) T protein:vir:96 75 GG 76 (100) T ss_pred CC Confidence 54 No 171 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=25.44 E-value=1.2 Score=20.25 Aligned_cols=78 Identities=17% Similarity=0.086 Sum_probs=36.5 Q ss_pred CcccCCceeeehHHHH----HHHHH--------HHHH---------hhCCEEEEEecccc-cCCCCCCCC---------- Q lcl|NC_015294. 1 MIILASFSFKTDRRRL----TSLIK--------RVEA---------LDGTTVEVGFFPED-RYGSENGNL---------- 48 (159) Q Consensus 1 m~m~~~~~~k~~~~~l----~~l~~--------~l~~---------l~~~~v~VGi~~~~-~~~~~~~G~---------- 48 (159) |.|.....+...-.++ +...+ .+.. -.+..|.+|-+... ....+.+|. T Consensus 2 ~~~sf~~~i~~~~~~ve~~~~~~~r~~~~~~~~~vv~~sPVdtGrfRanw~vs~~~p~~~~~~~~dp~g~~t~~~~~~~~ 81 (121) T protein:vir:94 2 ISMKFNVNLSRLRSNLREEAKKKAIRIAQEIVNGVIARSPVLAGDYRSSWNVSEGSMEFKFNNGGNPANPTPAPAIVVSS 81 (121) T ss_pred ccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCchhhhhccccccccCcccccCCCCCCCcchhHHHHHHHH Confidence 3333333332222222 11111 1111 12334444433311 111111221 Q ss_pred ----------CHHHHHHHHhcCCCCCCCCchhhHHHHHHH Q lcl|NC_015294. 49 ----------PVAQVAAYNEFGTTRNPTRPFMAPTFEEFT 78 (159) Q Consensus 49 ----------~~A~iA~~~E~G~~~IP~RpFlr~~~~~~~ 78 (159) +++-+|.-.|||+..-+|+.|.|.++.+.+ T Consensus 82 ~~~~~~iyi~NnlpYA~~LE~G~S~QAP~G~v~~t~~~~q 121 (121) T protein:vir:94 82 NVALPHFYITNGAPYAQQLEKGSSTQAPLGIVRVTLASLR 121 (121) T ss_pred hhccceEEEeeCcchhhhhhcccCCCCcchHHHHHHHhhC Confidence 234567778999999999999999987653 No 172 >protein:vir:1028 Length: 168 # NCBI annotation: Orf48 # Family: family:all:1029 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076682;genbank:gi:13095791;genbank:GeneID:920342 Probab=25.40 E-value=0.72 Score=21.35 Aligned_cols=98 Identities=10% Similarity=0.107 Sum_probs=47.3 Q ss_pred CcccCCce----------------eeehHHHHHHHHH-------HHHHhhCCEEEEEecccccCCCCCCCCC-HHHHHHH Q lcl|NC_015294. 1 MIILASFS----------------FKTDRRRLTSLIK-------RVEALDGTTVEVGFFPEDRYGSENGNLP-VAQVAAY 56 (159) Q Consensus 1 m~m~~~~~----------------~k~~~~~l~~l~~-------~l~~l~~~~v~VGi~~~~~~~~~~~G~~-~A~iA~~ 56 (159) =|-..|.. -+- ......|.. .+....+-+..|||-.. .++|+. -|+||.| T Consensus 29 kITkAGAkv~~~~L~~~tk~kHy~~k~-t~~~~HLaDsI~~~~~niDg~~dG~s~VGf~~k-----~~~~~~~ka~iAr~ 102 (168) T protein:vir:10 29 EVTKAGAKVFEQALAYEVRNRHYRHRD-TGEDPHLADSIVMKNKNIDGVKDGQSVVGWERS-----TEKGTHTKGYIANI 102 (168) T ss_pred HHhHhhhHHHHHHHHHHhhHhhhccCC-CCccchhhhhheecccccccccCCceeecccCc-----cccccccchheeee Confidence 00000000 000 000001111 12223455788999632 235555 6999999 Q ss_pred HhcCC------------------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHH Q lcl|NC_015294. 57 NEFGT------------------TRNPTRPFMAPTFEEFTSQFHYARLMKSTFENVLRDGRQTNTLL 105 (159) Q Consensus 57 ~E~G~------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l 105 (159) .+-|+ +.||.=.|+..+-.+...+.++.+.....+.+++.....-.. | T Consensus 103 lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHFvd~~r~d~a~k~~V~~Ae~~~y~eIl~~k~~~~~-~ 168 (168) T protein:vir:10 103 INNGSRFPQFTTRSGRKYKKPGEVAVHADHFIEETRKNPIVQQGILKAEAEAMRKIINRKKKESN-L 168 (168) T ss_pred ccccccccccccccccccccccccccccchhHHHhhhchhhhHHHHHHHHHHHHHHHHhhcCCCC-C Confidence 99998 368999999888766444445544444444555432211111 1 No 173 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=23.97 E-value=1.3 Score=19.93 Aligned_cols=87 Identities=13% Similarity=0.121 Sum_probs=45.5 Q ss_pred ccCCceeeehHHHHHH----HHHHHHHhhC------CEEEEEecccccCCCCCCCCCHHHHHHHHhcCCCC--------- Q lcl|NC_015294. 3 ILASFSFKTDRRRLTS----LIKRVEALDG------TTVEVGFFPEDRYGSENGNLPVAQVAAYNEFGTTR--------- 63 (159) Q Consensus 3 m~~~~~~k~~~~~l~~----l~~~l~~l~~------~~v~VGi~~~~~~~~~~~G~~~A~iA~~~E~G~~~--------- 63 (159) |..-+ .+-+.+ +++..++... ..++=+|--+..+...+.=.+.+.+|-+-|||... T Consensus 1 l~~~~-----~~~~~~~a~~l~~~vk~rTPv~~~d~G~LR~sW~~g~v~k~~~~v~N~~eYA~~VE~GHRq~~g~g~~~~ 75 (116) T protein:vir:10 1 MSKNL-----RRAKNNIGNKLLRKVKPKTPVAKIDGGTARKSWKYKELNLFDGVVSNNVEYIHHLEYGHRTRQGTGTSEN 75 (116) T ss_pred CchHH-----HHHHHHHHHHHHHHHHhhCCCCcCCCcccccCceeeeeeccCceeecCCcccccccCCceeeCCcceecc Confidence 11111 122233 3333333322 22333343333332222235678899999999642 Q ss_pred ----------CCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015294. 64 ----------NPTRPFMAPTFEEFTSQFHYARLMKSTFENVLR 96 (159) Q Consensus 64 ----------IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~ 96 (159) .|-+=+|+.+..+.+. .+.+.+++.+..++. T Consensus 76 ~~gkrlk~~~V~G~fml~~s~~e~~~--~~~~~~~~~~~~~l~ 116 (116) T protein:vir:10 76 YRPKPNGISFVPGVFMLARSVDEMSS--IIDDELNQIIIDFWN 116 (116) T ss_pred cccccccCCccCceehHHHHHHHHHH--HHHHHHHHHHHHhcC Confidence 3455577777776643 477777887777776 Done!