Query lcl|NC_020079.1_cdsid_YP_007348523.1 [gene=G377_gp111] [protein=hypothetical protein] [protein_id=YP_007348523.1] [location=70512..71000] Match_columns 162 No_of_seqs 112 out of 194 Neff 6.4 Searched_HMMs 1612 Date Thu Nov 7 17:54:43 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_155 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_155_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:107757 Length: 189 100.0 8.6E-56 5.3E-59 322.5 18.5 158 1-160 1-189 (189) 2 protein:vir:99546 Length: 200 100.0 3.9E-53 2.4E-56 307.9 16.7 151 1-156 5-200 (200) 3 protein:vir:96105 Length: 193 100.0 5.7E-52 3.5E-55 301.5 16.9 148 1-156 1-193 (193) 4 protein:vir:5257 Length: 148 # 100.0 5.9E-51 3.7E-54 296.0 16.6 144 1-156 1-148 (148) 5 protein:vir:80037 Length: 199 100.0 3.2E-48 2E-51 280.9 15.7 145 3-158 1-199 (199) 6 protein:vir:106728 Length: 155 100.0 7.7E-47 4.7E-50 273.4 15.3 138 5-157 1-155 (155) 7 protein:vir:78607 Length: 155 100.0 7.8E-47 4.8E-50 273.4 15.3 138 1-157 1-155 (155) 8 protein:vir:94069 Length: 168 100.0 3.4E-46 2.1E-49 269.9 15.7 147 1-162 1-163 (168) 9 protein:vir:77650 Length: 155 100.0 1.7E-45 1.1E-48 266.0 14.6 138 1-157 1-155 (155) 10 protein:vir:101563 Length: 155 100.0 1.7E-45 1E-48 266.1 14.6 138 1-157 1-155 (155) 11 protein:vir:95260 Length: 160 100.0 1.5E-44 9.2E-48 260.9 16.4 150 4-162 1-159 (160) 12 protein:vir:103841 Length: 155 98.7 4.2E-11 2.6E-14 77.5 5.0 93 67-162 1-97 (155) 13 protein:vir:79225 Length: 155 98.6 1.1E-10 6.9E-14 75.1 5.6 93 67-162 1-97 (155) 14 protein:vir:99196 Length: 155 98.5 2.3E-10 1.4E-13 73.5 5.8 93 67-162 1-97 (155) 15 protein:vir:79091 Length: 175 98.5 2.6E-10 1.6E-13 73.1 4.9 93 67-162 1-114 (175) 16 protein:vir:1988 Length: 156 # 98.5 2.3E-10 1.5E-13 73.4 4.4 92 67-162 1-101 (156) 17 protein:vir:99833 Length: 190 98.5 3E-10 1.9E-13 72.8 4.7 91 66-162 1-97 (190) 18 protein:vir:3163 Length: 145 # 98.4 5.8E-10 3.6E-13 71.2 5.6 81 71-162 1-95 (145) 19 protein:vir:97088 Length: 157 98.3 2.3E-09 1.4E-12 67.9 5.9 93 1-94 1-157 (157) 20 protein:vir:4347 Length: 164 # 98.2 3.7E-09 2.3E-12 66.8 5.7 101 1-109 1-164 (164) 21 protein:vir:107851 Length: 175 98.2 5.3E-09 3.3E-12 66.0 5.0 93 67-162 1-118 (175) 22 protein:vir:1891 Length: 179 # 98.1 4.1E-09 2.6E-12 66.5 3.0 101 1-109 1-179 (179) 23 protein:vir:93617 Length: 148 97.9 1E-08 6.3E-12 64.4 2.7 91 1-101 2-148 (148) 24 protein:vir:80362 Length: 140 97.9 1.8E-08 1.1E-11 63.0 4.0 92 1-94 19-140 (140) 25 protein:vir:3163 Length: 145 # 97.9 9.8E-09 6.1E-12 64.5 1.8 80 1-96 59-145 (145) 26 protein:vir:100243 Length: 140 97.9 3E-08 1.8E-11 61.9 4.1 91 1-94 43-140 (140) 27 protein:vir:1437 Length: 140 # 97.9 3.2E-08 2E-11 61.7 4.2 92 1-94 19-140 (140) 28 protein:vir:100075 Length: 140 97.9 2.7E-08 1.7E-11 62.1 3.6 88 1-94 43-140 (140) 29 protein:vir:1273 Length: 127 # 97.8 2.1E-08 1.3E-11 62.7 2.6 79 1-91 42-127 (127) 30 protein:vir:94538 Length: 125 97.8 4.8E-08 3E-11 60.7 4.3 90 1-93 1-125 (125) 31 protein:vir:95789 Length: 114 97.8 5.5E-08 3.4E-11 60.4 4.5 84 1-91 1-114 (114) 32 protein:vir:3617 Length: 112 # 97.8 3.2E-08 2E-11 61.7 3.2 82 1-87 1-112 (112) 33 protein:vir:194 Length: 149 # 97.8 1.8E-08 1.1E-11 63.1 1.7 91 1-101 2-149 (149) 34 protein:vir:1386 Length: 149 # 97.7 6.2E-08 3.9E-11 60.1 3.5 88 1-96 47-149 (149) 35 protein:vir:102085 Length: 146 97.7 1E-07 6.2E-11 59.0 4.3 86 1-99 45-146 (146) 36 protein:vir:107568 Length: 146 97.7 1E-07 6.2E-11 59.0 4.3 86 1-99 45-146 (146) 37 protein:vir:105007 Length: 146 97.7 1E-07 6.2E-11 59.0 4.3 86 1-99 45-146 (146) 38 protein:vir:102875 Length: 146 97.7 1E-07 6.2E-11 59.0 4.3 86 1-99 45-146 (146) 39 protein:vir:5745 Length: 135 # 97.6 2.5E-07 1.6E-10 56.8 5.5 90 1-104 1-135 (135) 40 protein:vir:3873 Length: 128 # 97.6 7.5E-08 4.7E-11 59.6 2.6 80 1-91 41-128 (128) 41 protein:vir:9708 Length: 125 # 97.6 1.2E-07 7.3E-11 58.6 3.6 81 1-92 38-125 (125) 42 protein:vir:98557 Length: 149 97.6 5.3E-07 3.3E-10 55.0 7.0 87 71-162 1-99 (149) 43 protein:vir:9930 Length: 108 # 97.6 1.4E-07 8.8E-11 58.1 3.6 77 10-88 1-108 (108) 44 protein:vir:9414 Length: 125 # 97.6 7.3E-08 4.5E-11 59.7 1.9 73 1-91 53-125 (125) 45 protein:vir:4704 Length: 125 # 97.6 7.3E-08 4.5E-11 59.7 1.9 73 1-91 53-125 (125) 46 protein:vir:81106 Length: 125 97.6 7.3E-08 4.5E-11 59.7 1.9 73 1-91 53-125 (125) 47 protein:vir:98342 Length: 125 97.6 7.3E-08 4.5E-11 59.7 1.9 73 1-91 53-125 (125) 48 protein:vir:79988 Length: 125 97.6 7.3E-08 4.5E-11 59.7 1.9 73 1-91 53-125 (125) 49 protein:vir:97144 Length: 115 97.5 1.7E-07 1.1E-10 57.7 3.7 85 1-87 20-115 (115) 50 protein:vir:96225 Length: 115 97.5 1.7E-07 1.1E-10 57.7 3.7 85 1-87 20-115 (115) 51 protein:vir:96358 Length: 115 97.5 1.7E-07 1.1E-10 57.7 3.7 85 1-87 20-115 (115) 52 protein:vir:9312 Length: 115 # 97.5 1.7E-07 1.1E-10 57.7 3.7 85 1-87 20-115 (115) 53 protein:vir:103917 Length: 115 97.5 1.7E-07 1.1E-10 57.7 3.7 85 1-87 20-115 (115) 54 protein:vir:78858 Length: 115 97.5 1.7E-07 1.1E-10 57.7 3.7 85 1-87 20-115 (115) 55 protein:vir:2026 Length: 150 # 97.5 7.6E-07 4.7E-10 54.1 7.1 87 72-162 1-100 (150) 56 protein:vir:105089 Length: 133 97.4 2.2E-07 1.4E-10 57.1 3.1 84 1-93 43-133 (133) 57 protein:vir:106623 Length: 115 97.4 4.1E-07 2.5E-10 55.6 4.4 85 1-87 16-115 (115) 58 protein:vir:2740 Length: 114 # 97.3 2.1E-07 1.3E-10 57.2 2.1 84 1-88 19-114 (114) 59 protein:vir:4906 Length: 114 # 97.3 2.1E-07 1.3E-10 57.2 2.1 84 1-88 19-114 (114) 60 protein:vir:98557 Length: 149 97.2 6.9E-07 4.3E-10 54.3 3.9 78 1-88 53-149 (149) 61 protein:vir:5978 Length: 144 # 97.2 5.5E-07 3.4E-10 54.9 3.3 84 1-87 4-144 (144) 62 protein:vir:99744 Length: 115 97.2 7.3E-07 4.5E-10 54.2 4.0 85 1-87 16-115 (115) 63 protein:vir:5703 Length: 150 # 97.2 3.1E-06 1.9E-09 50.8 7.4 87 72-162 1-100 (150) 64 protein:vir:6071 Length: 150 # 97.2 3.2E-06 2E-09 50.7 7.4 87 72-162 1-101 (150) 65 protein:vir:96486 Length: 112 97.2 6.3E-07 3.9E-10 54.6 3.0 84 1-86 16-112 (112) 66 protein:vir:1988 Length: 156 # 97.1 2.2E-06 1.4E-09 51.6 5.7 75 1-92 75-156 (156) 67 protein:vir:743 Length: 108 # 97.1 6.5E-07 4.1E-10 54.5 2.4 83 1-87 13-108 (108) 68 protein:vir:103841 Length: 155 97.0 8.7E-07 5.4E-10 53.8 2.8 77 1-93 71-155 (155) 69 protein:vir:1838 Length: 149 # 97.0 2.1E-06 1.3E-09 51.7 4.7 76 1-88 63-149 (149) 70 protein:vir:99833 Length: 190 97.0 2.3E-06 1.4E-09 51.5 4.9 77 1-94 71-190 (190) 71 protein:vir:79091 Length: 175 96.9 1.5E-06 9.5E-10 52.5 3.3 77 1-93 65-175 (175) 72 protein:vir:6071 Length: 150 # 96.9 2.4E-06 1.5E-09 51.4 4.1 79 1-88 53-150 (150) 73 protein:vir:2026 Length: 150 # 96.9 2.6E-06 1.6E-09 51.2 4.1 79 1-88 63-150 (150) 74 protein:vir:98409 Length: 108 96.9 9E-07 5.6E-10 53.7 1.6 82 1-87 13-108 (108) 75 protein:vir:5703 Length: 150 # 96.8 3.1E-06 1.9E-09 50.8 4.1 79 1-88 53-150 (150) 76 protein:vir:100312 Length: 152 96.8 2.4E-06 1.5E-09 51.3 3.3 79 1-89 54-152 (152) 77 protein:vir:107851 Length: 175 96.7 4E-06 2.5E-09 50.2 3.8 77 1-93 65-175 (175) 78 protein:vir:79179 Length: 155 96.5 7E-06 4.4E-09 48.8 4.1 78 1-88 54-155 (155) 79 protein:vir:105330 Length: 137 96.5 3.9E-06 2.4E-09 50.2 2.8 78 1-83 1-137 (137) 80 protein:vir:101594 Length: 173 96.4 1.7E-05 1.1E-08 46.7 5.9 93 1-93 16-173 (173) 81 protein:vir:79115 Length: 148 96.4 7.6E-06 4.7E-09 48.7 3.6 78 1-92 53-148 (148) 82 protein:vir:81067 Length: 119 96.4 6.9E-06 4.3E-09 48.9 3.3 84 1-94 5-119 (119) 83 protein:vir:10367 Length: 119 96.3 7.7E-06 4.8E-09 48.6 3.4 84 1-94 5-119 (119) 84 protein:vir:94654 Length: 142 96.3 2.8E-06 1.8E-09 51.0 0.8 84 1-86 4-142 (142) 85 protein:vir:106570 Length: 182 96.2 2.5E-05 1.5E-08 45.9 5.6 94 1-104 19-182 (182) 86 protein:vir:107099 Length: 137 96.2 7.9E-06 4.9E-09 48.6 2.7 78 1-83 1-137 (137) 87 protein:vir:1838 Length: 149 # 96.2 5.4E-05 3.3E-08 44.0 7.3 87 71-162 1-93 (149) 88 protein:vir:93738 Length: 137 96.1 1E-05 6.2E-09 48.0 2.7 78 1-83 1-137 (137) 89 protein:vir:97427 Length: 137 96.1 1E-05 6.2E-09 48.0 2.7 78 1-83 1-137 (137) 90 protein:vir:94490 Length: 137 96.1 1E-05 6.2E-09 48.0 2.7 78 1-83 1-137 (137) 91 protein:vir:99196 Length: 155 96.1 1.2E-05 7.5E-09 47.6 3.1 74 1-93 71-155 (155) 92 protein:vir:79225 Length: 155 95.9 1.5E-05 9.4E-09 47.0 3.1 74 1-93 71-155 (155) 93 protein:vir:95894 Length: 137 95.8 1.7E-05 1.1E-08 46.7 2.8 78 1-83 1-137 (137) 94 protein:vir:78077 Length: 141 95.7 9.5E-06 5.9E-09 48.1 1.2 89 1-90 11-141 (141) 95 protein:vir:79179 Length: 155 95.7 0.00012 7.2E-08 42.1 7.0 91 71-162 1-99 (155) 96 protein:vir:94108 Length: 149 95.6 4E-05 2.5E-08 44.7 4.2 81 1-83 30-149 (149) 97 protein:vir:1164 Length: 156 # 95.5 4.8E-05 3E-08 44.3 4.3 82 1-96 64-156 (156) 98 protein:vir:96829 Length: 135 95.3 1.6E-05 9.8E-09 46.9 1.1 78 1-83 1-135 (135) 99 protein:vir:79115 Length: 148 95.3 0.0002 1.2E-07 40.9 7.1 86 72-162 1-92 (148) 100 protein:vir:96121 Length: 137 95.3 2.9E-05 1.8E-08 45.5 2.4 81 1-83 18-137 (137) 101 protein:vir:94796 Length: 137 95.2 6.8E-05 4.2E-08 43.4 4.2 81 1-83 18-137 (137) 102 protein:vir:105916 Length: 149 95.2 7.4E-05 4.6E-08 43.2 4.2 81 1-83 30-149 (149) 103 protein:vir:97327 Length: 116 95.1 3.2E-05 2E-08 45.2 2.1 81 1-83 1-116 (116) 104 protein:vir:1243 Length: 116 # 95.1 3.2E-05 2E-08 45.2 2.1 81 1-83 1-116 (116) 105 protein:vir:95062 Length: 116 94.9 3.5E-05 2.1E-08 45.0 1.8 81 1-83 1-116 (116) 106 protein:vir:100887 Length: 139 94.6 6.1E-05 3.8E-08 43.7 2.4 77 1-97 61-139 (139) 107 protein:vir:966 Length: 123 # 94.4 0.00017 1.1E-07 41.2 4.4 88 1-88 1-123 (123) 108 protein:vir:5000 Length: 141 # 94.3 0.00012 7.4E-08 42.1 3.3 79 1-97 61-141 (141) 109 protein:vir:102154 Length: 119 94.2 4.7E-05 2.9E-08 44.3 0.8 77 1-91 42-119 (119) 110 protein:vir:1164 Length: 156 # 94.1 0.00085 5.3E-07 37.4 7.4 90 71-162 1-96 (156) 111 protein:vir:4833 Length: 140 # 93.9 0.00011 7E-08 42.2 2.4 76 1-94 61-140 (140) 112 protein:vir:81147 Length: 126 93.8 0.00024 1.5E-07 40.5 3.8 90 1-93 20-126 (126) 113 protein:vir:4859 Length: 140 # 93.5 0.00012 7.2E-08 42.2 1.7 78 1-96 61-140 (140) 114 protein:vir:100223 Length: 139 93.0 0.00021 1.3E-07 40.7 2.4 77 1-97 61-139 (139) 115 protein:vir:99101 Length: 142 92.6 0.00019 1.2E-07 41.0 1.5 84 1-84 2-142 (142) 116 protein:vir:8669 Length: 142 # 92.6 0.00019 1.2E-07 41.0 1.5 84 1-84 2-142 (142) 117 protein:vir:93738 Length: 137 92.2 0.00053 3.3E-07 38.6 3.5 67 67-162 1-67 (137) 118 protein:vir:94490 Length: 137 92.2 0.00053 3.3E-07 38.6 3.5 67 67-162 1-67 (137) 119 protein:vir:97427 Length: 137 92.2 0.00053 3.3E-07 38.6 3.5 67 67-162 1-67 (137) 120 protein:vir:4956 Length: 153 # 92.2 0.00039 2.4E-07 39.3 2.7 87 1-126 61-153 (153) 121 protein:vir:3787 Length: 231 # 90.2 0.0012 7.4E-07 36.6 3.3 83 1-95 59-231 (231) 122 protein:vir:96121 Length: 137 90.1 0.0013 7.9E-07 36.5 3.4 67 67-162 1-67 (137) 123 protein:vir:95894 Length: 137 89.7 0.0015 9.4E-07 36.0 3.5 67 67-162 1-67 (137) 124 protein:vir:94796 Length: 137 89.2 0.0017 1E-06 35.8 3.4 67 67-162 1-67 (137) 125 protein:vir:96829 Length: 135 88.9 0.002 1.2E-06 35.4 3.5 67 67-162 1-67 (135) 126 protein:vir:100312 Length: 152 88.4 0.011 6.6E-06 31.4 7.2 82 71-162 1-100 (152) 127 protein:vir:105330 Length: 137 88.3 0.0021 1.3E-06 35.3 3.2 67 67-162 1-67 (137) 128 protein:vir:107099 Length: 137 88.2 0.0024 1.5E-06 35.0 3.5 67 67-162 1-67 (137) 129 protein:vir:5978 Length: 144 # 87.5 0.004 2.5E-06 33.7 4.3 72 62-162 1-72 (144) 130 protein:vir:94654 Length: 142 86.5 0.0035 2.2E-06 34.1 3.4 68 67-162 1-69 (142) 131 protein:vir:9930 Length: 108 # 86.1 0.0038 2.3E-06 33.9 3.3 62 90-162 1-63 (108) 132 protein:vir:106570 Length: 182 85.8 0.0013 7.8E-07 36.5 0.6 72 67-162 1-72 (182) 133 protein:vir:95062 Length: 116 84.0 0.0045 2.8E-06 33.4 2.8 46 82-162 1-46 (116) 134 protein:vir:3848 Length: 159 # 83.8 0.0034 2.1E-06 34.1 2.0 79 1-96 76-159 (159) 135 protein:vir:106041 Length: 137 83.7 0.0013 8.4E-07 36.3 -0.2 81 1-81 1-137 (137) 136 protein:vir:78077 Length: 141 83.4 0.0068 4.2E-06 32.5 3.5 66 67-162 1-67 (141) 137 protein:vir:105467 Length: 144 81.9 0.019 1.2E-05 30.1 5.3 91 1-94 1-144 (144) 138 protein:vir:79034 Length: 141 81.5 0.021 1.3E-05 29.8 5.4 94 1-95 1-141 (141) 139 protein:vir:98636 Length: 138 81.0 0.011 6.8E-06 31.3 3.7 82 1-92 32-138 (138) 140 protein:vir:101594 Length: 173 79.6 0.027 1.7E-05 29.2 5.4 65 70-162 1-66 (173) 141 protein:vir:100652 Length: 134 79.3 0.009 5.6E-06 31.8 2.7 74 1-89 1-134 (134) 142 protein:vir:9879 Length: 127 # 79.2 0.0051 3.2E-06 33.2 1.3 85 1-88 16-127 (127) 143 protein:vir:9312 Length: 115 # 79.0 0.014 8.7E-06 30.8 3.6 71 69-162 1-71 (115) 144 protein:vir:96225 Length: 115 79.0 0.014 8.7E-06 30.8 3.6 71 69-162 1-71 (115) 145 protein:vir:78858 Length: 115 79.0 0.014 8.7E-06 30.8 3.6 71 69-162 1-71 (115) 146 protein:vir:97144 Length: 115 79.0 0.014 8.7E-06 30.8 3.6 71 69-162 1-71 (115) 147 protein:vir:103917 Length: 115 79.0 0.014 8.7E-06 30.8 3.6 71 69-162 1-71 (115) 148 protein:vir:96358 Length: 115 79.0 0.014 8.7E-06 30.8 3.6 71 69-162 1-71 (115) 149 protein:vir:106506 Length: 137 78.8 0.018 1.1E-05 30.2 4.1 62 60-162 1-62 (137) 150 protein:vir:78755 Length: 228 78.1 0.028 1.7E-05 29.1 5.0 93 1-104 55-228 (228) 151 protein:vir:102963 Length: 163 77.5 0.024 1.5E-05 29.5 4.5 92 1-95 1-163 (163) 152 protein:vir:102441 Length: 137 77.1 0.0041 2.5E-06 33.7 0.1 74 1-82 22-137 (137) 153 protein:vir:98409 Length: 108 76.8 0.024 1.5E-05 29.5 4.2 63 66-162 1-63 (108) 154 protein:vir:105916 Length: 149 76.7 0.028 1.8E-05 29.1 4.6 79 28-162 1-79 (149) 155 protein:vir:9647 Length: 132 # 75.6 0.02 1.3E-05 29.9 3.5 82 1-92 45-132 (132) 156 protein:vir:6246 Length: 143 # 67.5 0.025 1.6E-05 29.3 2.1 86 1-101 33-143 (143) 157 protein:vir:1332 Length: 143 # 64.6 0.033 2E-05 28.7 2.1 84 1-101 33-143 (143) 158 protein:vir:80116 Length: 127 64.0 0.035 2.2E-05 28.6 2.1 90 1-94 1-127 (127) 159 protein:vir:93898 Length: 133 63.2 0.06 3.8E-05 27.3 3.3 75 1-88 1-133 (133) 160 protein:vir:101302 Length: 134 58.9 0.11 6.5E-05 25.9 3.8 76 1-89 1-134 (134) 161 protein:vir:9513 Length: 134 # 58.9 0.11 6.5E-05 25.9 3.8 76 1-89 1-134 (134) 162 protein:vir:97982 Length: 140 57.8 0.081 5E-05 26.6 3.0 66 82-162 1-66 (140) 163 protein:vir:107545 Length: 140 57.8 0.081 5E-05 26.6 3.0 66 82-162 1-66 (140) 164 protein:vir:102338 Length: 116 57.4 0.16 9.8E-05 25.0 4.5 91 1-91 1-116 (116) 165 protein:vir:99528 Length: 92 # 56.2 0.046 2.8E-05 27.9 1.4 66 67-162 1-68 (92) 166 protein:vir:95372 Length: 124 56.2 0.039 2.4E-05 28.3 1.0 86 1-88 1-124 (124) 167 protein:vir:94419 Length: 133 54.4 0.11 6.7E-05 25.9 3.1 75 1-88 1-133 (133) 168 protein:vir:96973 Length: 133 54.4 0.11 6.7E-05 25.9 3.1 75 1-88 1-133 (133) 169 protein:vir:9363 Length: 133 # 54.4 0.11 6.7E-05 25.9 3.1 75 1-88 1-133 (133) 170 protein:vir:78644 Length: 133 54.4 0.11 6.7E-05 25.9 3.1 75 1-88 1-133 (133) 171 protein:vir:487 Length: 187 # 47.6 0.23 0.00014 24.1 3.7 90 1-162 1-90 (187) 172 protein:vir:4460 Length: 170 # 42.4 0.067 4.2E-05 27.0 -0.0 76 61-162 1-77 (170) 173 protein:vir:7412 Length: 168 # 38.2 0.33 0.0002 23.2 3.1 89 1-99 62-168 (168) 174 protein:vir:96012 Length: 133 37.1 0.39 0.00024 22.8 3.3 82 1-90 23-133 (133) 175 protein:vir:1028 Length: 168 # 36.9 0.19 0.00012 24.5 1.6 88 1-99 62-168 (168) 176 protein:vir:78335 Length: 133 33.1 0.4 0.00025 22.8 2.7 82 1-90 24-133 (133) 177 protein:vir:6216 Length: 125 # 32.3 0.67 0.00042 21.5 3.8 78 1-90 41-125 (125) 178 protein:vir:3994 Length: 168 # 20.6 0.65 0.0004 21.6 1.4 88 1-99 62-168 (168) No 1 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=100.00 E-value=8.6e-56 Score=322.48 Aligned_cols=158 Identities=22% Similarity=0.289 Sum_probs=147.9 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHHHHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVISQNNIAK 80 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~ 80 (162) |.+.|+..+ +.+++|.+.|++|++++|+||||++++|||| +++|+||+|||||+|..+||||||||+|+++++++|.+ T Consensus 1 M~~~i~~~~-~~~~~L~~~lk~l~~k~V~VGi~~~~~y~dG-~~vA~Ia~~~E~G~p~~~IP~RPFlr~t~~~~~~~~~~ 78 (189) T protein:vir:10 1 MGRVIRKQG-PARVKLNAFIKGMNDYSVRIGWFSTAKYPDG-TPTAYVASIHEFGAPSRGIPARSFIRPTIAAQQAAWSQ 78 (189) T ss_pred CcceeccCc-HHHHHHHHHHHHhhCCeEEEEecCCCCCCCc-ccHHHHHHHHHhcCcCCCCCCchhhhHHHHHHHHHHHH Confidence 999999654 5689999999999999999999999999865 99999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccc-hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccC------------------------- Q lcl|NC_020079. 81 KMKQVFANYLMHN-VGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKG------------------------- 134 (162) Q Consensus 81 ~~~~~~~~~l~G~-~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~------------------------- 134 (162) ++++.+.+++.|+ +++++|+.+|+.++++||.+|.++.+|||||+||++|+..+ T Consensus 79 ~l~~~~~~vl~G~~~~~~~L~~~G~~a~~~Ik~~I~~~~~ppna~sTi~~Kg~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (189) T protein:vir:10 79 QMRFYAKQIVVGQMNVEQALEGLAIVARGDVDATLARLKDPPLSPLTIYIRKFIKDGGVIHGYKDIMRLRSEMQQEQAKG 158 (189) T ss_pred HHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHHhcccCcccchhhhhhhhhhhhhhhhhhhhc Confidence 9999999999998 56899999999999999999999999999999999998542 Q ss_pred -----CCCCCchhhHHHHHhhceeeeecCCC Q lcl|NC_020079. 135 -----NYSNHILIDTAHMINAIETKITKSKS 160 (162) Q Consensus 135 -----~~~~~PLidTG~l~~SIty~V~~~~~ 160 (162) ..|++||||||+|++||||+|+++++ T Consensus 159 ~~~~~~~s~kPLidTG~l~~SIty~V~~k~~ 189 (189) T protein:vir:10 159 TLNLSGVSTDPLDFTGYMRATLSYTVTKEKS 189 (189) T ss_pred cccccccCCCchhhHHHHHhhcceeeeecCC Confidence 34789999999999999999999999 No 2 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=100.00 E-value=3.9e-53 Score=307.92 Aligned_cols=151 Identities=23% Similarity=0.283 Sum_probs=138.5 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCC-----CCCCHHHHHHHHhcCCc------------------ Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPE-----SDLTIPAIAAIQQYGNE------------------ 57 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d-----~g~~~A~iA~~~E~G~~------------------ 57 (162) |..++.+.+.+++++++++|++|++++|+|||+++++||+ +|+++|+||+|||||++ T Consensus 5 ~~~~~k~~~~~~~~~~~~~l~~l~~~~v~vGi~~~~~y~~~~~~~dG~~va~IA~~~EfG~~i~~p~~~~~~~~~~~~g~ 84 (200) T protein:vir:99 5 FSKSNSVAAPLKHFQMLKQFDALKGKTVQAGWFETDRYPAKEGETIGPLVAKIARQLEFGGVINHPGGTKYIKDAIVDGR 84 (200) T ss_pred cceeeeeecchHHHHHHHHHHHhhCCeEEEEEcCCCCcCCcccccccchHHHHHhHHHcCCeeccCCCcccccccccccc Confidence 3334444455689999999999999999999999999873 57999999999999965 Q ss_pred ---------------------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccc-hHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 58 ---------------------TNNIPARPFITDGAVISQNNIAKKMKQVFANYLMHN-VGLAVFEPIARASREGIAQAIA 115 (162) Q Consensus 58 ---------------------~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~-~~~~~l~~iG~~~~~~i~~~I~ 115 (162) +++||||||||+|+++++++|.+++++.+.+++.|+ +++++|+.+|..++++||.+|. T Consensus 85 ~~g~rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~~G~~~~~~ik~~I~ 164 (200) T protein:vir:99 85 YVGTRFVHKSFQGEHEVTKAHQIVIPARPFMRLAWATFNKDKVKIQAQIARQLLDGTINPEQALAQIGLALEGCIVRSIK 164 (200) T ss_pred ccccccccccccceeeeeccccccCCCcchhhHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHh Confidence 458999999999999999999999999999999998 5689999999999999999999 Q ss_pred hcCCCCCCHHHHHHhhccCCCCCCchhhHHHHHhhceeeee Q lcl|NC_020079. 116 MQRYRPLSPVTIKIRQDKGNYSNHILIDTAHMINAIETKIT 156 (162) Q Consensus 116 ~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l~~SIty~V~ 156 (162) ++.+|||||+||++|+ ||+||||||+|++||||+|. T Consensus 165 ~~~~ppna~sTi~~Kg-----~~~PLidTG~l~~SIty~Ve 200 (200) T protein:vir:99 165 SGPWAANSPATIRAKG-----FDKPLIDTAHMWQTVSSKVS 200 (200) T ss_pred cCCCCCChHHHHHHhC-----CCCchHHHHHHHhHhccccC Confidence 9999999999999887 89999999999999999999 No 3 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=100.00 E-value=5.7e-52 Score=301.52 Aligned_cols=148 Identities=22% Similarity=0.326 Sum_probs=136.7 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCC-----CCCCHHHHHHHHhcCCc------------------ Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPE-----SDLTIPAIAAIQQYGNE------------------ 57 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d-----~g~~~A~iA~~~E~G~~------------------ 57 (162) |+.+ .+.+++++++++|++|++++|+|||+++++|+| .|+++|+||+|||||++ T Consensus 1 m~~~---~~~~~~~~~~~~l~~l~~~~v~vGi~~~~~~~~~~~~~~G~~va~iAai~EfG~~I~~~~~~~~~~~~~~~g~ 77 (193) T protein:vir:96 1 MSLR---RDSELIAAHLQMLRAMRGRSVSAGWYSTARYPDKAGGSVGIQVARIARLNEYGGTIDHPGGTRYIRDAIVRGR 77 (193) T ss_pred Ceec---cchHHHHHHHHHHHHhcCCeEEEEEcCCCCCCCcccccccchHHHHHhHHHcCCccccCccceeeeecccccc Confidence 4433 455679999999999999999999999998876 26899999999999965 Q ss_pred ---------------------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccc-hHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 58 ---------------------TNNIPARPFITDGAVISQNNIAKKMKQVFANYLMHN-VGLAVFEPIARASREGIAQAIA 115 (162) Q Consensus 58 ---------------------~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~-~~~~~l~~iG~~~~~~i~~~I~ 115 (162) +++||||||||+|+++++++|.+.+++++.+++.|+ +++++|+.+|..++++||.+|. T Consensus 78 ~~~~~~~k~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~~G~~~~~~ik~~I~ 157 (193) T protein:vir:96 78 FVGVRFVRNDFPGETEVTKPHRITIPARPFMRYAWNLFSADRAAIQNRIAMRLARGQITPDQALAQIGLALEGYIARSIR 157 (193) T ss_pred ccccceeccCcceeeEeecceeccCCCcchhhhhHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHh Confidence 347999999999999999999999999999999998 5689999999999999999999 Q ss_pred hcCCCCCCHHHHHHhhccCCCCCCchhhHHHHHhhceeeee Q lcl|NC_020079. 116 MQRYRPLSPVTIKIRQDKGNYSNHILIDTAHMINAIETKIT 156 (162) Q Consensus 116 ~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l~~SIty~V~ 156 (162) ++.+|||||+||++|+ ||+||||||+|++||+|+|+ T Consensus 158 ~~~~ppna~~Ti~~KG-----~~~PLidTG~l~~SIty~Vv 193 (193) T protein:vir:96 158 TGPWVANSASTVRRKG-----FNRPLVDTAHMLQSISSRVT 193 (193) T ss_pred cCCCCCCcHHHHHHhC-----CCCchhHHHHHHhhhcceeC Confidence 9999999999999887 89999999999999999999 No 4 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=100.00 E-value=5.9e-51 Score=295.96 Aligned_cols=144 Identities=21% Similarity=0.368 Sum_probs=130.4 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecC----CCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLT----NRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVISQN 76 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~----~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~~~ 76 (162) |+++++. +..++++++++|++|++++|+||||+ +..|+ +|+++|+||||||||++ +||+|||||+|++++++ T Consensus 1 M~~~~k~-~~~~~~~l~~~l~~l~~~~v~VGi~~~~~~~~~~~-~g~~vA~ia~~~E~G~~--~IP~Rpflr~t~~~~~~ 76 (148) T protein:vir:52 1 MAVTVTA-NFSAAKQLIEQMKSLKEKAVYVGFPAEFDEKVKGS-ENFNLASLAAVLEFGNE--HIPARPFLRQTLEENQE 76 (148) T ss_pred Ccccccc-ccHHHHHHHHHHHHhhCCeEEEEeecCcCCCCCCC-CCCCHHHHHHHHhcCCC--CCCCcchhHHHHHHHHH Confidence 9999884 45679999999999999999999995 34454 46999999999999975 89999999999999999 Q ss_pred HHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHHHHhhceeeee Q lcl|NC_020079. 77 NIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAHMINAIETKIT 156 (162) Q Consensus 77 ~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l~~SIty~V~ 156 (162) +|.+++++.+.+ +-+++++|+.+|..++++||.+|.++.+|||||+||++|+ ||+||||||+|++||+|+|+ T Consensus 77 ~~~~~~~~~~~~---~~~~~~~L~~~G~~~~~~ik~~I~~~~~ppna~sTi~~Kg-----~~~PLidTG~l~~SIty~V~ 148 (148) T protein:vir:52 77 KYTALFIQWFDQ---GVPAAQIYERLSVMAQGDVQMNIVKGEWVANAKSTIRRKK-----SSKPLIDTGKMRQSVRGIVK 148 (148) T ss_pred HHHHHHHHHHHc---CCCHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHhcC-----CCCchhHHHHHHHHhhhhcC Confidence 999988776654 4467899999999999999999999999999999999887 89999999999999999999 No 5 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=100.00 E-value=3.2e-48 Score=280.94 Aligned_cols=145 Identities=20% Similarity=0.289 Sum_probs=128.2 Q ss_pred CcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCC-------------------------- Q lcl|NC_020079. 3 SEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGN-------------------------- 56 (162) Q Consensus 3 ~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~-------------------------- 56 (162) .+|| .|.+.+++++++|++|++++|+|||+.++ |.++++||.+||||+ T Consensus 1 m~vt-~~~~~~~~~~~~l~~L~~k~v~vGi~~~d-----~~~~~~Ia~~~E~Ga~I~~~~~~l~Ip~~~a~~~k~~~~~~ 74 (199) T protein:vir:80 1 MKVT-TDKSTMNKAIRELDQLDRYSLQIGLFGED-----DSFIQMIAGVHEFGLTIRPKGKYLTIPTPEAGDRRARDIPG 74 (199) T ss_pred Cccc-ccHHHHHHHHHHHHHhcCCEEEEEEecCC-----CcchhheeehhhcCCeeecCCceeeecchhhhcccccccCc Confidence 3444 45577999999999999999999999643 356677777766663 Q ss_pred --------------------------cCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccc-hHHHHHHHHHHHHHHH Q lcl|NC_020079. 57 --------------------------ETNNIPARPFITDGAVISQNNIAKKMKQVFANYLMHN-VGLAVFEPIARASREG 109 (162) Q Consensus 57 --------------------------~~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~-~~~~~l~~iG~~~~~~ 109 (162) ++++||+|||||+|+++++++|.+++++++.++++|+ +++++|+.+|..++++ T Consensus 75 ~~~p~g~~~~~~~~~~~~~~~~e~g~~~~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~vl~g~~~a~~~L~~~G~~~~~~ 154 (199) T protein:vir:80 75 LFKPKGKNILAVAGPDGKLTVMFYLKTEVNIPERSFLRSTFDEKSNKWGELFEGWIDDVIHGKLSAEQVYNRLGAKIVDD 154 (199) T ss_pred ccccCCcceeeeeccccceeeeeeccccccCCCCchhHHHHHHHHHHHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHH Confidence 4468999999999999999999999999999999998 4689999999999999 Q ss_pred HHHHHHhcCCCCCCHHHHH-HhhccCCCCCCchhhHHHHHhhceeeeecC Q lcl|NC_020079. 110 IAQAIAMQRYRPLSPVTIK-IRQDKGNYSNHILIDTAHMINAIETKITKS 158 (162) Q Consensus 110 i~~~I~~~~~ppnsp~Ti~-~k~~k~~~~~~PLidTG~l~~SIty~V~~~ 158 (162) ||.+|.++.||||||+||+ +|+ ||+||||||+|++||+|+|++. T Consensus 155 Ik~~I~~~~~ppna~~Tia~rKg-----~~kPLidTG~l~~SIty~V~~~ 199 (199) T protein:vir:80 155 IQMKIVEIQTPAKSAATLARNPR-----KNNPLIVTGKMKNSVTWKVMKS 199 (199) T ss_pred HHHHHhccCCCCCCHHHHHHhcC-----CCCchHHHHHHHhhcceeeeeC Confidence 9999999999999999997 454 8999999999999999999999 No 6 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=100.00 E-value=7.7e-47 Score=273.41 Aligned_cols=138 Identities=22% Similarity=0.280 Sum_probs=121.1 Q ss_pred ccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCC-----------------CCCCHHHHHHHHhcCCcCCCCCCcchh Q lcl|NC_020079. 5 ILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPE-----------------SDLTIPAIAAIQQYGNETNNIPARPFI 67 (162) Q Consensus 5 i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d-----------------~g~~~A~iA~~~E~G~~~~~IP~RpFl 67 (162) |++.. .+|+.++ .+|++++|+|||+++++||| +|+++|+||+|||||+ .+||+|||| T Consensus 1 m~v~~-k~L~~~~---~~l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~--~~IP~RPFl 74 (155) T protein:vir:10 1 MSVTR-RGLTLPK---DRYRSMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGT--SKLPARPFM 74 (155) T ss_pred CcchH-HHHHHHH---HHHhCCeeEEeecCCCCCccccchhhhhhhhcccccccCCcHHHHHHHHhcCC--CCCCCcchh Confidence 33222 2355554 45678999999999999998 3799999999999996 799999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHHH Q lcl|NC_020079. 68 TDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAHM 147 (162) Q Consensus 68 r~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l 147 (162) |+|+++++++|.+.+++.+.+ +.+++++|+.+|..++++||.+|+++. |||||+||++|+ ||+||||||+| T Consensus 75 r~t~~~~~~~~~~~l~~~~~~---~~~~~~~L~~lG~~~~~~Ik~~I~~~~-~pna~~Ti~~KG-----~~kPLidTG~l 145 (155) T protein:vir:10 75 EKTIADRSAEWIKGLTVMMTM---GYDAEVAMGQIGQAMKDDIKTTISEWP-ADNSADWAGKKG-----FNHGLIWTSHL 145 (155) T ss_pred HHHHHHHHHHHHHHHHHHHHc---CCCHHHHHHHHHHHHHHHHHHHHhcCC-CCCcHHHHHhcC-----CCCchhHHHHH Confidence 999999999999998877765 556789999999999999999999985 999999998777 99999999999 Q ss_pred Hhhceeeeec Q lcl|NC_020079. 148 INAIETKITK 157 (162) Q Consensus 148 ~~SIty~V~~ 157 (162) ++||+|+|++ T Consensus 146 ~~SIty~Vv~ 155 (155) T protein:vir:10 146 LNSVEQEIVK 155 (155) T ss_pred HHhhhhhccC Confidence 9999999999 No 7 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=100.00 E-value=7.8e-47 Score=273.36 Aligned_cols=138 Identities=22% Similarity=0.299 Sum_probs=121.3 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCC-----------------CCCCHHHHHHHHhcCCcCCCCCC Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPE-----------------SDLTIPAIAAIQQYGNETNNIPA 63 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d-----------------~g~~~A~iA~~~E~G~~~~~IP~ 63 (162) |+ |.+ .+|+.++ .+|++++|+|||+++++||| +|+++|+||+|||||+ .+||| T Consensus 1 m~--v~~---k~L~~~~---~~l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~--~~IP~ 70 (155) T protein:vir:78 1 MS--VTR---RGLTLPK---DRYRSMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGT--SKLPA 70 (155) T ss_pred Cc--chH---HHHHHHH---HHHhCCeeEEeecCCCCCCcccchhhhhhhhcccccccCCcHHHHHHhhhcCC--CCCCC Confidence 32 222 2355554 45678999999999999998 3799999999999995 69999 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhh Q lcl|NC_020079. 64 RPFITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILID 143 (162) Q Consensus 64 RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLid 143 (162) |||||+|+++++++|.+.+++.+.+ +.+++++|+.+|+.++++||.+|.++. |||||+||++|+ ||+|||| T Consensus 71 RPFlr~t~~~~~~~~~~~l~~~~~~---~~~~~~~L~~~G~~~~~~Ik~~I~~~~-~pna~~Ti~~Kg-----~~kPLid 141 (155) T protein:vir:78 71 RPFMEKTITDRSAEWIKGLTVMMTM---GYDAEVAMGQIGQAMKDDIKTTISEWP-ADNSADWAGKKG-----FNHGLIW 141 (155) T ss_pred cchhhHHHHHHHHHHHHHHHHHHHc---CCCHHHHHHHHHHHHHHHHHHHHhcCC-CCCcHHHHHhcC-----CCCchhH Confidence 9999999999999999998887765 556789999999999999999999985 999999998877 9999999 Q ss_pred HHHHHhhceeeeec Q lcl|NC_020079. 144 TAHMINAIETKITK 157 (162) Q Consensus 144 TG~l~~SIty~V~~ 157 (162) ||+|++||+|+|++ T Consensus 142 TG~l~~SIty~V~~ 155 (155) T protein:vir:78 142 TSHLLNSVEQEIVK 155 (155) T ss_pred HHHHHHhhhhhccC Confidence 99999999999999 No 8 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=100.00 E-value=3.4e-46 Score=269.88 Aligned_cols=147 Identities=16% Similarity=0.221 Sum_probs=127.1 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCC----------------CCCCHHHHHHHHhcCCcCCCCCCc Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPE----------------SDLTIPAIAAIQQYGNETNNIPAR 64 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d----------------~g~~~A~iA~~~E~G~~~~~IP~R 64 (162) |+. ....+++..++.+.+|.++.|+|||+++.+||+ +|+++|+||+|||||+ .+||+| T Consensus 1 ~~~----~~~~g~~~~~~~~~~l~~~~v~vG~l~~a~yp~G~~~~~~~~~~~~~~~~g~~va~Ia~~~E~G~--~~IP~R 74 (168) T protein:vir:94 1 MTT----IARKGVKMPPHLEAQFQSGEVKAGVLSGSTYPQMTYTDQRTGKQIEDARGGMPVAVIAQALEYGH--GQNHPR 74 (168) T ss_pred Ccc----ccchhhhhhHHHHHhhhccceeeeccccCcccccccchhhcccccccccccccHHHHHHHHhcCC--CCCCCc Confidence 443 333458899999999999999999999988874 5679999999999996 699999 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhH Q lcl|NC_020079. 65 PFITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDT 144 (162) Q Consensus 65 pFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidT 144 (162) ||||+|+++++++|.+.+++.+.+ +-+++++|+.+|..++++||.+|.++. |||||+||++|+ ||+||||| T Consensus 75 PFlr~t~~~~~~~~~~~~~~~~~~---~~~~~~~L~~lG~~~~~~Ik~~I~~~~-ppna~sTi~~KG-----~~~PLiDT 145 (168) T protein:vir:94 75 PFMQQTYAAQYRAWSRDLTLTLKA---GAAADTALRTVGQRMAEDIQDTIRNWP-ADNSPEWAAIKG-----FNAGLRQT 145 (168) T ss_pred hhhHHHHHHHHHHHHHHHHHHHhc---CCCHHHHHHHHHHHHHHHHHHHhhcCC-CCccHHHHHhcC-----CCCchhHH Confidence 999999999999999888766554 346789999999999999999999985 999999999777 99999999 Q ss_pred HHHHhhceeeeecCCCCC Q lcl|NC_020079. 145 AHMINAIETKITKSKSKK 162 (162) Q Consensus 145 G~l~~SIty~V~~~~~~~ 162 (162) |+|++||+|+|++++--- T Consensus 146 G~l~~SIty~Vv~d~~~~ 163 (168) T protein:vir:94 146 GVLLNAIDSAVIIDGEHG 163 (168) T ss_pred HHHHhhcceeeeecCCCC Confidence 999999999888543222 No 9 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=100.00 E-value=1.7e-45 Score=266.03 Aligned_cols=138 Identities=22% Similarity=0.297 Sum_probs=119.9 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCC-----------------CCCCHHHHHHHHhcCCcCCCCCC Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPE-----------------SDLTIPAIAAIQQYGNETNNIPA 63 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d-----------------~g~~~A~iA~~~E~G~~~~~IP~ 63 (162) |+ +.+ .+|+.+ +.+|++++|+|||+++++||| +|+++|+||+|||||+ .+||| T Consensus 1 m~--~~r---~~l~~~---~~~l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~G~pva~ia~~~e~G~--~~IP~ 70 (155) T protein:vir:77 1 MS--VTR---RGLTLP---KDRYRSMSVKAGVLAGATYPDESGKKLADGSILKKDPRAGLPVAMIAMALNYGT--SKLPA 70 (155) T ss_pred Cc--chH---HHHHHH---HHHHhcCceEEeecCCCCCccccchhhhhhhhccccccccccHhhhhhhhhcCC--CCCCC Confidence 32 221 234444 445678999999999999998 3799999999999996 69999 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhh Q lcl|NC_020079. 64 RPFITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILID 143 (162) Q Consensus 64 RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLid 143 (162) |||||+|+++++++|.+.+.+++.. +.+++++|+.+|..++++||.+|+++.+| |+|+||++|+ ||+|||| T Consensus 71 RPFlr~t~~~~~~~~~~~l~~~~~~---~~~~~~~L~~lG~~~~~~Iq~~I~~~~~p-~~~~Ti~~KG-----~d~PLid 141 (155) T protein:vir:77 71 RPFMEKTIADRSAEWIKGLTVMMTM---GYDAEVAMGQIGQAMKDDIKTTISEWPAD-NNADWAGKKG-----FNHGLIW 141 (155) T ss_pred CchhhHHHHHHHHHHHHHHHHHHHc---cCcHHHHHHHHHHHHHHHHHHHHhcCCCC-CChHHHHhcC-----CCCchhH Confidence 9999999999999999999887765 55678999999999999999999999986 5789998877 9999999 Q ss_pred HHHHHhhceeeeec Q lcl|NC_020079. 144 TAHMINAIETKITK 157 (162) Q Consensus 144 TG~l~~SIty~V~~ 157 (162) ||+|++||+|+|++ T Consensus 142 TG~l~~SIty~Vv~ 155 (155) T protein:vir:77 142 TSHLLNSIEQEIVK 155 (155) T ss_pred HHHHHHhhhhhccC Confidence 99999999999999 No 10 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=100.00 E-value=1.7e-45 Score=266.08 Aligned_cols=138 Identities=22% Similarity=0.288 Sum_probs=119.9 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCC-----------------CCCCHHHHHHHHhcCCcCCCCCC Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPE-----------------SDLTIPAIAAIQQYGNETNNIPA 63 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d-----------------~g~~~A~iA~~~E~G~~~~~IP~ 63 (162) |+. .+ .+|++++ +.|++++|+||||++++|+| +|+++|+||+|||||+ .+||| T Consensus 1 m~v--~r---~~L~~~~---~~l~~~~V~VGi~~~a~y~d~~g~~~~~g~~~~~~~~~G~pva~ia~~~e~G~--~~IP~ 70 (155) T protein:vir:10 1 MSV--TR---RGLTLPK---DRYKSMSVKAGVLAGATYPDESGKKLADGTILKKDPRAGLPVAMIAMALNYGT--SKLPA 70 (155) T ss_pred Ccc--hH---HHHHHHH---HHhhCCeeEEeecCCCCCCccccchhhhhhhhccccccCcchhhhhhhhhcCC--CCCCC Confidence 433 22 2356555 45567899999999999998 3799999999999996 69999 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhh Q lcl|NC_020079. 64 RPFITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILID 143 (162) Q Consensus 64 RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLid 143 (162) |||||+|+++++++|.+.+++.+.+ +-+++++|+.+|..++++||.+|.++.+| |+|+||++|+ ||+|||| T Consensus 71 RPFlr~t~~~~~~~~~~~l~~~~~~---~~~~~~~L~~~G~~~~~~Ik~~I~~~~~p-~~~~Ti~~KG-----~~~PLid 141 (155) T protein:vir:10 71 RPFMEKTIADRSAEWIKGLTVMMTM---GYDAEVAMGQIGQAMKDDIKTTISEWPAD-NNADWAGKKG-----FNHGLIW 141 (155) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHc---CCCHHHHHHHHHHHHHHHHHHHHhcCCCC-CChHHHHhcC-----CCCchHH Confidence 9999999999999999998877765 45678999999999999999999999986 6789998776 9999999 Q ss_pred HHHHHhhceeeeec Q lcl|NC_020079. 144 TAHMINAIETKITK 157 (162) Q Consensus 144 TG~l~~SIty~V~~ 157 (162) ||+|++||+|+|++ T Consensus 142 TG~l~~Sity~Vv~ 155 (155) T protein:vir:10 142 TSHLLNSIEQEIVK 155 (155) T ss_pred HHHHHHhhhhhccC Confidence 99999999999999 No 11 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=100.00 E-value=1.5e-44 Score=260.88 Aligned_cols=150 Identities=15% Similarity=0.175 Sum_probs=124.8 Q ss_pred cccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHH-----HHHHH Q lcl|NC_020079. 4 EILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVI-----SQNNI 78 (162) Q Consensus 4 ~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~-----~~~~~ 78 (162) -|.+.+..++++|.++|+.|+++.|+||||+++.++++|+++++||+|||||. .+||+|||||++|+. ++..+ T Consensus 1 ~~~~~~~~G~~~L~~~~k~l~~~~V~VGi~~d~g~~~dG~sv~~vA~~~EfG~--~~iPaRPf~R~tfe~~~~~~~~~~~ 78 (160) T protein:vir:95 1 MVKRVIHPARAKLVGAMKNLQTANAQVGYFQEQGQHSSGFSYPALMYLQEVIG--VPSASGKVYRRLFEITMMLNKQTLL 78 (160) T ss_pred CceeechHhHHHHHHHHHHHhCCeeEEeeccccccCCCCccHHHHHhhhhcCc--ccCCCcchhHHHHHHHHHHHHHHHH Confidence 23344556799999999999999999999999855566799999999999995 689999999999973 33333 Q ss_pred HHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHh----cCCCCCCHHHHHHhhccCCCCCCchhhHHHHHhhceee Q lcl|NC_020079. 79 AKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAM----QRYRPLSPVTIKIRQDKGNYSNHILIDTAHMINAIETK 154 (162) Q Consensus 79 ~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~----~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l~~SIty~ 154 (162) .+..+....++..|.++ .++.+|+.++++|+.+|.+ +.||||||+||++|+ ||+||||||+|++||+|+ T Consensus 79 ~~~~~~i~~~~~~g~~~--~~~~LG~~~~~~ik~~I~~~~~p~~w~pNap~Ti~~Kg-----s~~PLiDTg~l~~Si~y~ 151 (160) T protein:vir:95 79 EQTKKNLYKQLSSLNTD--PSNTLEAFAKNAQKAIKRGFGNSAILPPNAPSTVKKKG-----FNAPLVETGDLRDNLAYK 151 (160) T ss_pred HHHHHHHHHHHhhcchh--HHHHHHHHHHHHHHHHHhhcCCccCCCCCcHHHHHhcC-----CCCcchhhHHHhhhhhhe Confidence 34444455666666654 3456999999999999987 357899999999998 999999999999999999 Q ss_pred eecCCCCC Q lcl|NC_020079. 155 ITKSKSKK 162 (162) Q Consensus 155 V~~~~~~~ 162 (162) |++++|-+ T Consensus 152 v~~~~~~~ 159 (160) T protein:vir:95 152 ISTKKGIK 159 (160) T ss_pred eecccccC Confidence 99999999 No 12 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=98.67 E-value=4.2e-11 Score=77.47 Aligned_cols=93 Identities=19% Similarity=0.288 Sum_probs=67.9 Q ss_pred hhHHH--HHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhc--CCCCCCHHHHHHhhccCCCCCCchh Q lcl|NC_020079. 67 ITDGA--VISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQ--RYRPLSPVTIKIRQDKGNYSNHILI 142 (162) Q Consensus 67 lr~~~--~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~--~~ppnsp~Ti~~k~~k~~~~~~PLi 142 (162) |-..+ .-+...+.+.+.++... ..+...+|..||..+...+++.|... .|+|+||+|+++|.++++++.++|+ T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~---~~~~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~~~~~~L~ 77 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAA---VTDTLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGRGAHPILQ 77 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHH---hhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccCCCCCccc Confidence 32111 11223455555555444 23556889999999999999999653 6999999999998877888889999 Q ss_pred hHHHHHhhceeeeecCCCCC Q lcl|NC_020079. 143 DTAHMINAIETKITKSKSKK 162 (162) Q Consensus 143 dTG~l~~SIty~V~~~~~~~ 162 (162) |||.|++||+|.+....-.= T Consensus 78 ~tG~L~~Si~~~~~~~~v~v 97 (155) T protein:vir:10 78 VTNALARSITTRADRDQAQI 97 (155) T ss_pred cchhhhhhhhceecCCEEEE Confidence 99999999999975543221 No 13 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=98.61 E-value=1.1e-10 Score=75.14 Aligned_cols=93 Identities=18% Similarity=0.224 Sum_probs=68.1 Q ss_pred hhHHH--HHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHh-c-CCCCCCHHHHHHhhccCCCCCCchh Q lcl|NC_020079. 67 ITDGA--VISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAM-Q-RYRPLSPVTIKIRQDKGNYSNHILI 142 (162) Q Consensus 67 lr~~~--~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~-~-~~ppnsp~Ti~~k~~k~~~~~~PLi 142 (162) |-..+ .-+.+.+.+.+.++... ..+...+|..||..+...+++.|.. | .|+|+||+|+++|..+++...++|+ T Consensus 1 M~~~i~i~~d~~~~~~~L~~l~~~---~~d~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~ 77 (155) T protein:vir:79 1 MTTRIDVELDDQEVRQRLAVLMRS---VTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQ 77 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHH---hhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccc Confidence 32211 11224455555555544 2456788999999999999999965 3 6899999999999877777889999 Q ss_pred hHHHHHhhceeeeecCCCCC Q lcl|NC_020079. 143 DTAHMINAIETKITKSKSKK 162 (162) Q Consensus 143 dTG~l~~SIty~V~~~~~~~ 162 (162) |||.|++||+|++......= T Consensus 78 ~tG~L~~Si~~~~~~~~v~v 97 (155) T protein:vir:79 78 VTNALARSVTTWADRNEAGI 97 (155) T ss_pred cchhhhhhhhceecCCEEEE Confidence 99999999999976543221 No 14 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=98.55 E-value=2.3e-10 Score=73.45 Aligned_cols=93 Identities=19% Similarity=0.239 Sum_probs=68.0 Q ss_pred hhHH--HHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHh-c-CCCCCCHHHHHHhhccCCCCCCchh Q lcl|NC_020079. 67 ITDG--AVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAM-Q-RYRPLSPVTIKIRQDKGNYSNHILI 142 (162) Q Consensus 67 lr~~--~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~-~-~~ppnsp~Ti~~k~~k~~~~~~PLi 142 (162) |-.- +.-+.+.+.+.+.++... ..+...+|..||..+...+++.|.. | .|+|+||+|+++|..++....++|+ T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~---~~d~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL~ 77 (155) T protein:vir:99 1 MTTRIDVELDDQEVRQRLALLMRS---VTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPILQ 77 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHH---hhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcch Confidence 3211 111234555556555554 2456789999999999999999964 3 6899999999999877777788999 Q ss_pred hHHHHHhhceeeeecCCCCC Q lcl|NC_020079. 143 DTAHMINAIETKITKSKSKK 162 (162) Q Consensus 143 dTG~l~~SIty~V~~~~~~~ 162 (162) |||.|++||+|.+....-.= T Consensus 78 ~tg~L~~Si~~~~~~~~v~v 97 (155) T protein:vir:99 78 VTNALARSVTTWADRNEAGI 97 (155) T ss_pred hchhhhhhhhceecCCEEEE Confidence 99999999999975543221 No 15 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=98.50 E-value=2.6e-10 Score=73.14 Aligned_cols=93 Identities=20% Similarity=0.202 Sum_probs=66.2 Q ss_pred hhH--HHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhc---CCCCCCHHHHHHhhcc-------- Q lcl|NC_020079. 67 ITD--GAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQ---RYRPLSPVTIKIRQDK-------- 133 (162) Q Consensus 67 lr~--~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~---~~ppnsp~Ti~~k~~k-------- 133 (162) |-. .+.-+-+.+.+.+.++... +.+...+|..||..+...+++.|.+. .|+|+||+|+++|... T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~---~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~ 77 (175) T protein:vir:79 1 MSDFVNFQIDDSALRTRLLQLEQA---GHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNG 77 (175) T ss_pred CceEEEEEechHHHHHHHHHHHHH---hcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccc Confidence 322 1111224556666655554 35667899999999999999999775 5789999999877432 Q ss_pred --------CCCCCCchhhHHHHHhhceeeeecCCCCC Q lcl|NC_020079. 134 --------GNYSNHILIDTAHMINAIETKITKSKSKK 162 (162) Q Consensus 134 --------~~~~~~PLidTG~l~~SIty~V~~~~~~~ 162 (162) ++.+.+||+|||.|++||+|.+....-.= T Consensus 78 ~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~v 114 (175) T protein:vir:79 78 ELTAAASRRKAGLMILQDSGQMAASTATDSGEDYSVI 114 (175) T ss_pred cchhhHhhhccCCCcceechhhhhhhhheecCCEEEE Confidence 24568899999999999999986553311 No 16 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=98.49 E-value=2.3e-10 Score=73.38 Aligned_cols=92 Identities=16% Similarity=0.118 Sum_probs=65.4 Q ss_pred hhHHH--HHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHh------c-CCCCCCHHHHHHhhccCCCC Q lcl|NC_020079. 67 ITDGA--VISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAM------Q-RYRPLSPVTIKIRQDKGNYS 137 (162) Q Consensus 67 lr~~~--~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~------~-~~ppnsp~Ti~~k~~k~~~~ 137 (162) |...+ ....+.+.+.+.++.. . + ....+|..||..+...+++.|.+ | .|+|++|+|+++|...+... T Consensus 1 ms~~i~~~~d~~~l~~~L~~l~~-~--~-~~~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~~ 76 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDELGT-V--T-RDRAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFVP 76 (156) T ss_pred CeEEEEEeecHHHHHHHHHHHHh-h--h-ccHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCCC Confidence 43222 2233455555544322 1 1 22468999999999999999975 3 58899999999988777677 Q ss_pred CCchhhHHHHHhhceeeeecCCCCC Q lcl|NC_020079. 138 NHILIDTAHMINAIETKITKSKSKK 162 (162) Q Consensus 138 ~~PLidTG~l~~SIty~V~~~~~~~ 162 (162) .+||+|||.|++||+|.+......= T Consensus 77 ~~~L~~tg~L~~Si~~~~~~~~v~v 101 (156) T protein:vir:19 77 GSILTLHGDLARSITTDYGQDYALI 101 (156) T ss_pred CcchhhhHHHHHHhhheecCCEEEE Confidence 8999999999999999875543221 No 17 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.47 E-value=3e-10 Score=72.79 Aligned_cols=91 Identities=13% Similarity=0.107 Sum_probs=64.8 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhc------CCCCCCHHHHHHhhccCCCCCC Q lcl|NC_020079. 66 FITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQ------RYRPLSPVTIKIRQDKGNYSNH 139 (162) Q Consensus 66 Flr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppnsp~Ti~~k~~k~~~~~~ 139 (162) -+.-.+.-+-+++.+.+..++... .+...+|..||..+...+++.|... .|+|++|+|+++|.. ...+ T Consensus 1 M~~i~i~~d~~~~~~~L~~l~~~~---~~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~---~~~~ 74 (190) T protein:vir:99 1 MAGITLEWDGRRALDVLNAGSAAL---GDPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRK---NRDK 74 (190) T ss_pred CceeEEEecHHHHHHHHHHHHHHh---hhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhc---CCCc Confidence 222222223345566666666553 3556789999999999999999774 488999999987653 2468 Q ss_pred chhhHHHHHhhceeeeecCCCCC Q lcl|NC_020079. 140 ILIDTAHMINAIETKITKSKSKK 162 (162) Q Consensus 140 PLidTG~l~~SIty~V~~~~~~~ 162 (162) ||+|||.|++||+|.+....-.= T Consensus 75 ~L~~tg~L~~Si~~~~~~~~v~v 97 (190) T protein:vir:99 75 ILTLDGHLRNLLRYQLDGSELLF 97 (190) T ss_pred cceecHHHHHHHhheecCcEEEE Confidence 99999999999999986543221 No 18 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=98.45 E-value=5.8e-10 Score=71.23 Aligned_cols=81 Identities=17% Similarity=0.242 Sum_probs=57.3 Q ss_pred HHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhc------CCCCCCHHHHHHhhccCCCCCCchhhH Q lcl|NC_020079. 71 AVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQ------RYRPLSPVTIKIRQDKGNYSNHILIDT 144 (162) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppnsp~Ti~~k~~k~~~~~~PLidT 144 (162) +.+....+.+.+++...+ ....|..+|...+..+++.|.+. .|+|+||+|+++|+ +++||+|| T Consensus 1 ~i~~~~~i~~~l~~l~~~------~~~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~-----~~~~L~~t 69 (145) T protein:vir:31 1 MVEDENNIPEAREAIQDG------LTDGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKG-----SDTPLIDN 69 (145) T ss_pred CcccHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhc-----CCCCCccC Confidence 444444455555444332 23457778988999999999652 48999999999887 57899999 Q ss_pred HHHHhhceeeeecC--------CCCC Q lcl|NC_020079. 145 AHMINAIETKITKS--------KSKK 162 (162) Q Consensus 145 G~l~~SIty~V~~~--------~~~~ 162 (162) |.|++||+|.|... |+.. T Consensus 70 G~L~~Si~~~~~~~~~~~~a~vGtn~ 95 (145) T protein:vir:31 70 SRLLTDINAASMMDRANRMAVIGTNL 95 (145) T ss_pred HHHHHHHHHHhhhcccCceeEecCCc Confidence 99999999987532 2222 No 19 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=98.31 E-value=2.3e-09 Score=67.94 Aligned_cols=93 Identities=15% Similarity=0.168 Sum_probs=56.3 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCe------------------------------EEEeecCCCCCCCCCC------- Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQ------------------------------IEAGFLTNRRHPESDL------- 43 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~------------------------------v~VGi~~~~~~~d~g~------- 43 (162) |++.|+.-|.++|...++.|.+..++. +.+-...+ ...++.. T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~-~s~~g~~~~~Vg~~ 79 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPE-ESVEGIQTYAVSWR 79 (157) T ss_pred CeeEeecccHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeeccc-cCCCceEEEEEeec Confidence 999998788777777777665443221 11111110 0001100 Q ss_pred -CHHHHHHHHhcCCc----------------------CCCCCCcchhhHHHHHHHHHHHHHH----HHHHHHHhccch Q lcl|NC_020079. 44 -TIPAIAAIQQYGNE----------------------TNNIPARPFITDGAVISQNNIAKKM----KQVFANYLMHNV 94 (162) Q Consensus 44 -~~A~iA~~~E~G~~----------------------~~~IP~RpFlr~~~~~~~~~~~~~~----~~~~~~~l~G~~ 94 (162) ..+-++.+.|||+. +..+||||||||+|+..+++..+.+ .+.+.+++.|.. T Consensus 80 ~~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k~I~e~l~g~~ 157 (157) T protein:vir:97 80 KKAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGAKKYAELQRGDT 157 (157) T ss_pred CCccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhHHHHHHHHHHHHHHHHHHHhcCCC Confidence 01344556788832 2458999999999999998877665 456777777766 No 20 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=98.24 E-value=3.7e-09 Score=66.81 Aligned_cols=101 Identities=12% Similarity=0.230 Sum_probs=58.9 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCC------------------------------------------------------ Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQV------------------------------------------------------ 26 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~------------------------------------------------------ 26 (162) |..+|+.+= .+|++|.+.|++|... T Consensus 1 Ma~~~~~~i-~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~ 79 (164) T protein:vir:43 1 MADTVEFSI-TGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTG 79 (164) T ss_pred CCcceEEee-ecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCcccccc Confidence 887765321 2467777766666321 Q ss_pred --eEEEeecCCCCCC-------CCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccchHHH Q lcl|NC_020079. 27 --QIEAGFLTNRRHP-------ESDLTIPAIAAIQQYGNETNNIPARPFITDGAVISQNNIAKKMKQVFANYLMHNVGLA 97 (162) Q Consensus 27 --~v~VGi~~~~~~~-------d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~ 97 (162) ...||+..+.... .++-..+.++.++|||+ .+.||||||||++++++++..+.+...+...| ++ T Consensus 80 ~~~~~vg~~~~~~~~~~~~~~~~~~~~~~~y~~f~EfGT--~km~a~PFlrPA~~~~k~~~~~~~~~~l~~~i-----~k 152 (164) T protein:vir:43 80 DLGFRIGVLHGAVLPKKGERSDKTANAPTPHWRLLEFGT--EDMRAQPFMRSALADNIAEVTSTFVSEYEKGI-----DR 152 (164) T ss_pred ceeEEecccccccccccccccccCCCCCcceEEEeecCC--CCCCCCcchhhhHHHhHHHHHHHHHHHHHHHH-----HH Confidence 0112221111000 00112246788999996 48999999999999999998888877776533 24 Q ss_pred HHHHHHHHHHHH Q lcl|NC_020079. 98 VFEPIARASREG 109 (162) Q Consensus 98 ~l~~iG~~~~~~ 109 (162) +|.+.+..++.- T Consensus 153 a~~k~~~~~~~~ 164 (164) T protein:vir:43 153 AIKRAAKKAAQG 164 (164) T ss_pred HHHHHHhhhccC Confidence 444444433333 No 21 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=98.16 E-value=5.3e-09 Score=65.96 Aligned_cols=93 Identities=22% Similarity=0.229 Sum_probs=64.9 Q ss_pred hhHH--HHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhc---CCCCCCHHHHHHhhcc-------- Q lcl|NC_020079. 67 ITDG--AVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQ---RYRPLSPVTIKIRQDK-------- 133 (162) Q Consensus 67 lr~~--~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~---~~ppnsp~Ti~~k~~k-------- 133 (162) |--. +.-.-+++.+.+.++... +.+...+|..||..++...++.|.+. .|.|++|+|++.|..+ T Consensus 1 Ms~~i~i~~~~~~l~~~L~~l~~~---~~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~ 77 (175) T protein:vir:10 1 MSDFVNFQIDDSALRTRLLQLEQA---GHQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNG 77 (175) T ss_pred CceeEEEEecHHHHHHHHHHHHHH---hccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhh Confidence 3221 111224455555555554 34567899999999999999999774 5779999999876422 Q ss_pred --------CCCCCCchhhHHHHHhhceeeeecCCCC----C Q lcl|NC_020079. 134 --------GNYSNHILIDTAHMINAIETKITKSKSK----K 162 (162) Q Consensus 134 --------~~~~~~PLidTG~l~~SIty~V~~~~~~----~ 162 (162) ++.+.++|+|||.|++||+|.+.+..-. + T Consensus 78 ~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGtn~ 118 (175) T protein:vir:10 78 ELTAAASRRKAGLMILQDSGQMAASVSTDHDDNSAVIGSNK 118 (175) T ss_pred hhhhhhhhhccCCCcceechhhhhhhheeecCCEEEEecCh Confidence 3456789999999999999998554321 1 No 22 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=98.09 E-value=4.1e-09 Score=66.55 Aligned_cols=101 Identities=13% Similarity=0.176 Sum_probs=52.7 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCC------------------------------------------------------ Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQV------------------------------------------------------ 26 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~------------------------------------------------------ 26 (162) |..+|+.+- .+|++|.+.|+.|... T Consensus 1 Ma~~~~~~i-~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g 79 (179) T protein:vir:18 1 MADSVEVSL-TGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTG 79 (179) T ss_pred CCceEEEEe-ecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeeccccccccccc Confidence 887665321 2466666666655321 Q ss_pred --eEEEeecCCCCC----------------------CCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_020079. 27 --QIEAGFLTNRRH----------------------PESDLTIPAIAAIQQYGNETNNIPARPFITDGAVISQNNIAKKM 82 (162) Q Consensus 27 --~v~VGi~~~~~~----------------------~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~~~ 82 (162) .+.||+..+... ...+-..+.++.+.|||+ .+.||||||||++++++++..+.+ T Consensus 80 ~~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT--~kmpa~PFlrPA~~~~~~~a~~~i 157 (179) T protein:vir:18 80 DLAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGT--EHTSARPILRPAMNGVDNDVINVF 157 (179) T ss_pred ceeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCC--CCCCCCccchhhHHhhHHHHHHHH Confidence 012333211100 000112356667889996 489999999999999998777666 Q ss_pred HHHHHHHhccchHHHHHHHHHHHHHHH Q lcl|NC_020079. 83 KQVFANYLMHNVGLAVFEPIARASREG 109 (162) Q Consensus 83 ~~~~~~~l~G~~~~~~l~~iG~~~~~~ 109 (162) ...+.+.| +++|.+-+...... T Consensus 158 ~~~l~~~i-----~k~lk~~~~~~~~~ 179 (179) T protein:vir:18 158 STEMGKAI-----DRAIRLAMKKGTTA 179 (179) T ss_pred HHHHHHHH-----HHHHHhhcccCCCC Confidence 65554422 12222211111111 No 23 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=97.94 E-value=1e-08 Score=64.41 Aligned_cols=91 Identities=13% Similarity=0.143 Sum_probs=54.1 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCC---eE-------------------------------------------E--Eee Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQV---QI-------------------------------------------E--AGF 32 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~---~v-------------------------------------------~--VGi 32 (162) |..++.+.+ |++|++.|+.|... .+ . |++ T Consensus 2 m~~~~~i~G---ldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~~ 78 (148) T protein:vir:93 2 IETLLDFSG---LEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVHI 78 (148) T ss_pred cceeeeehh---HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeeee Confidence 777777664 56666665555321 00 0 010 Q ss_pred cCCC--CCC------CCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHH Q lcl|NC_020079. 33 LTNR--RHP------ESDLTIPAIAAIQQYGNETNNIPARPFITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEP 101 (162) Q Consensus 33 ~~~~--~~~------d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~ 101 (162) .... ... ..+-..+.++.+.|||+ .+.||||||+|+++++++++.+.+...+...|. .+|.+ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT--~~~pa~PFl~pA~~~~k~~~~~~~~~~~~~~i~-----k~~~k 148 (148) T protein:vir:93 79 RGVNPDTGNSDNTMKADNPRNAFYWRFVEMGT--VNMPPHPFVRPAFDVRSEQAAQVAIARMNRAID-----EVLRR 148 (148) T ss_pred cccccccccccceeecCCCCCcceeeeeccCC--CCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHH-----HHhcC Confidence 0000 000 01123367788899996 589999999999999999888888777766432 22333 No 24 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=97.94 E-value=1.8e-08 Score=62.99 Aligned_cols=92 Identities=10% Similarity=0.085 Sum_probs=50.4 Q ss_pred CCCccccc----ch-hHHHHHHHHHHHhh---------------------CCeEEEeecCCCCCCCCCCCHHHHHHHHhc Q lcl|NC_020079. 1 MESEILPG----DD-TDWETIIKKMMDLE---------------------QVQIEAGFLTNRRHPESDLTIPAIAAIQQY 54 (162) Q Consensus 1 M~~~i~~~----~~-~~l~~l~~~l~~l~---------------------~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~ 54 (162) |...+..+ -- ...+.+.+.++.+. ...+.||+..+.....++.+.+.++.+.|| T Consensus 19 l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~ 98 (140) T protein:vir:80 19 LAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRTKGKADSPSNAFYWRFDEF 98 (140) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccccceeeeeeecccccccCCCCCcceeeeecc Confidence 21111100 00 01112222222221 123455655433222233456889999999 Q ss_pred CCcCCCCCCcchhhHHHHHHHHHHHHHHHH----HHHHHhccch Q lcl|NC_020079. 55 GNETNNIPARPFITDGAVISQNNIAKKMKQ----VFANYLMHNV 94 (162) Q Consensus 55 G~~~~~IP~RpFlr~~~~~~~~~~~~~~~~----~~~~~l~G~~ 94 (162) |+ .+.||+|||+|+++.+++++.+.++. .+..++.|.. T Consensus 99 GT--~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:80 99 GT--QHMKAQPFMRPAFDASIGEAEGAIRTELARAIDQALGGRR 140 (140) T ss_pred CC--CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 95 58999999999999999887665554 4555666655 No 25 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=97.89 E-value=9.8e-09 Score=64.49 Aligned_cols=80 Identities=14% Similarity=0.155 Sum_probs=50.3 Q ss_pred CCC--cccccchhHHHHHHHHHHHh-----hCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHH Q lcl|NC_020079. 1 MES--EILPGDDTDWETIIKKMMDL-----EQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVI 73 (162) Q Consensus 1 M~~--~i~~~~~~~l~~l~~~l~~l-----~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~ 73 (162) .+. ++-++. -.|...|..- ....+.||- +..+|+||+||....+||+||||-.+... T Consensus 59 ~k~~~~~L~~t----G~L~~Si~~~~~~~~~~~~a~vGt------------n~~YA~~hqfG~~~~~IPaRPfLG~~~~~ 122 (145) T protein:vir:31 59 AKGSDTPLIDN----SRLLTDINAASMMDRANRMAVIGT------------NLDYAEHHEFGAPEAGIPARPIFGPAGAY 122 (145) T ss_pred HhcCCCCCccC----HHHHHHHHHHhhhcccCceeEecC------------CchhhhhhccCCcccccCCCCccCCCccc Confidence 111 111111 1333333321 233455542 24689999999988899999999887776 Q ss_pred HHHHHHHHHHHHHHHHhccchHH Q lcl|NC_020079. 74 SQNNIAKKMKQVFANYLMHNVGL 96 (162) Q Consensus 74 ~~~~~~~~~~~~~~~~l~G~~~~ 96 (162) .++++.+.+...+.+-|.|...+ T Consensus 123 ~~~~~~~ii~~~i~~~L~~~~~~ 145 (145) T protein:vir:31 123 ASQQAPDVIGDEIDTNLEGAVID 145 (145) T ss_pred hHHHHHHHHHHHHHHHhhhhccC Confidence 67778788888887777776555 No 26 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=97.87 E-value=3e-08 Score=61.87 Aligned_cols=91 Identities=9% Similarity=0.035 Sum_probs=49.6 Q ss_pred CCCcccccchhHHHHHHH-HHH--HhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIK-KMM--DLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVISQNN 77 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~-~l~--~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~~~~ 77 (162) |+..+-. ++..|.+-+. ... .-....+.||+..+....-++.+.+.++.+.|||+ .+.||+|||+|++++++++ T Consensus 43 ak~~ap~-~tG~l~~sI~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT--~~~~a~PFl~pA~~~~~~~ 119 (140) T protein:vir:10 43 ARARAPK-KTGKLKRNIVTAALKQKDSPGIATAGVRVRTKGKADSPNNAFYWRFVELGT--QFMKAEPFMRPAFDASIAQ 119 (140) T ss_pred HHHhCCC-ChhhHHHhceecccccccccceeEEeeccccccccCCCCcccccceeccCc--CCCCCCcchhhhHHHHHHH Confidence 2222211 1111111110 000 00112355665543322112345678999999996 5899999999999999887 Q ss_pred HHHHHHHH----HHHHhccch Q lcl|NC_020079. 78 IAKKMKQV----FANYLMHNV 94 (162) Q Consensus 78 ~~~~~~~~----~~~~l~G~~ 94 (162) +.+.+... +.+++.|+. T Consensus 120 ~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 120 AEGAIRTEIARAIDQVVGGGL 140 (140) T ss_pred HHHHHHHHHHHHHHHHhhcCC Confidence 76665554 456666665 No 27 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=97.86 E-value=3.2e-08 Score=61.68 Aligned_cols=92 Identities=10% Similarity=0.079 Sum_probs=50.0 Q ss_pred CCCccccc----ch-hHHHHHHHHHHHh---------------------hCCeEEEeecCCCCCCCCCCCHHHHHHHHhc Q lcl|NC_020079. 1 MESEILPG----DD-TDWETIIKKMMDL---------------------EQVQIEAGFLTNRRHPESDLTIPAIAAIQQY 54 (162) Q Consensus 1 M~~~i~~~----~~-~~l~~l~~~l~~l---------------------~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~ 54 (162) |...+..+ -- ...+.+.+.++.. ....+.||+..+.....++-+.+.++.+.|| T Consensus 19 l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~vg~~~~~~~~~~~~~~~~y~~f~E~ 98 (140) T protein:vir:14 19 LAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRTKGKADSPNNAFYWRFDEF 98 (140) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccccccceeEEeeeeeccccccCCCCccceeeeecc Confidence 11111100 00 0011111111111 1234567765443222222355788999999 Q ss_pred CCcCCCCCCcchhhHHHHHHHHHHHHHH----HHHHHHHhccch Q lcl|NC_020079. 55 GNETNNIPARPFITDGAVISQNNIAKKM----KQVFANYLMHNV 94 (162) Q Consensus 55 G~~~~~IP~RpFlr~~~~~~~~~~~~~~----~~~~~~~l~G~~ 94 (162) |+ .+.||||||+|+++.+++++.+.+ ++.+..++.|.. T Consensus 99 GT--~~~~a~pFl~pa~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:14 99 GT--QHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGRR 140 (140) T ss_pred cc--CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 95 689999999999999887766554 455566666666 No 28 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=97.85 E-value=2.7e-08 Score=62.09 Aligned_cols=88 Identities=9% Similarity=0.100 Sum_probs=49.5 Q ss_pred CCCcccccchhHHHHHHHHHHH------hhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMD------LEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVIS 74 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~------l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~ 74 (162) ++..+- .++-.+ .+.|.. -....+.||+.......-++-+.+.++.+.|||+ .+.||+|||+|+++++ T Consensus 43 ak~~aP-~~tG~l---~~sI~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~y~~f~E~GT--~~~~a~PFl~pA~~~~ 116 (140) T protein:vir:10 43 ARKRAP-KKTGKL---RRNIVSAALRQKDAPGLATAGVRVRTKGKADSPNNAFYWRFDEFGT--QHMKAQPFMRPAFDAS 116 (140) T ss_pred HHHhCC-CChhhH---HHhccccccccccccceEEeeeeeccccccCCCCccceeeeeccCC--CCCCCCcchhhhHHHH Confidence 222221 111111 111110 0123456666533221112235578899999996 5899999999999999 Q ss_pred HHHHHHHHH----HHHHHHhccch Q lcl|NC_020079. 75 QNNIAKKMK----QVFANYLMHNV 94 (162) Q Consensus 75 ~~~~~~~~~----~~~~~~l~G~~ 94 (162) ++++.+.+. +.+..++.|.. T Consensus 117 ~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 117 IGEAEGAIRTELARAIDRVLGGRR 140 (140) T ss_pred HHHHHHHHHHHHHHHHHHHhhccC Confidence 987766555 45556666666 No 29 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=97.83 E-value=2.1e-08 Score=62.71 Aligned_cols=79 Identities=13% Similarity=0.228 Sum_probs=53.5 Q ss_pred CCCcccccchhHHHHHHHHHHH-------hhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMD-------LEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVI 73 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~-------l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~ 73 (162) |+...-.+... -..+.+.|.. -....|.||+-. +.+.++.+.|||+ .+.||||||+|++++ T Consensus 42 ~k~~ap~~~~~-tg~l~~~I~~~~~k~~~~g~~~v~Vg~~~---------~~~~y~~f~E~GT--~~~~a~Pf~~pa~~~ 109 (127) T protein:vir:12 42 QRSHVNRSDKK-QPHMQDNITVSNVRESKDGVRFVAVGPNK---------KVAYRGRFLEWGT--SKMPPQPFIEKGGKE 109 (127) T ss_pred HHHhCCCCCCC-hhHHHHhhhccccccccCceeEEEEeeCC---------CCcceeeeeccCc--cCCCCCccchHhHHH Confidence 44333322110 1223333321 123467888733 2357788999996 578999999999999 Q ss_pred HHHHHHHHHHHHHHHHhc Q lcl|NC_020079. 74 SQNNIAKKMKQVFANYLM 91 (162) Q Consensus 74 ~~~~~~~~~~~~~~~~l~ 91 (162) +++++.+.+.+.+...|. T Consensus 110 ~~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 110 GEGPAVELMERILTAPIK 127 (127) T ss_pred HHHHHHHHHHHHHHHhcC Confidence 999999999999999888 No 30 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=97.80 E-value=4.8e-08 Score=60.69 Aligned_cols=90 Identities=10% Similarity=0.233 Sum_probs=59.2 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCe------------------------EEEeecCCC------CCCCCC-----CCH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQ------------------------IEAGFLTNR------RHPESD-----LTI 45 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~------------------------v~VGi~~~~------~~~d~g-----~~~ 45 (162) |..+++++- .++++|.+.|+.+.+.. +.-|.+.++ ...+++ -+. T Consensus 1 Ma~~~~i~~-~Gld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~~~~~~~~~~v~~~ 79 (125) T protein:vir:94 1 MANDFNIKF-KGVDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVKEEHGVVTGRYVAR 79 (125) T ss_pred CCCceeeee-hhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceecceeccCCcEEEEeeCC Confidence 888766443 25777777766653310 111211111 001111 144 Q ss_pred HHHHHHHhcCCcCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_020079. 46 PAIAAIQQYGNETNNIPARPFITDGAVISQNNIAKKMKQVFANYLMHN 93 (162) Q Consensus 46 A~iA~~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~ 93 (162) +.+|.+.|||+ ...|+||||+|++++++.++.+.++..+..++.-- T Consensus 80 ~~Ya~~vEfGT--~~~~a~Pfl~pa~~~~~~~~~~~l~~~l~~a~k~~ 125 (125) T protein:vir:94 80 ADYSSYNEYGT--YRMSAQPFMAPSVAAMTPFFYKAVRDALNKAAKFS 125 (125) T ss_pred CCccceeeccc--ccCCCCcccchhHHHHHHHHHHHHHHHHHHHhccC Confidence 67899999996 57899999999999999999999999998877443 No 31 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=97.80 E-value=5.5e-08 Score=60.37 Aligned_cols=84 Identities=15% Similarity=0.206 Sum_probs=54.4 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCC------------------------eEEEeecCCC-CCCCCC-----CCHHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQV------------------------QIEAGFLTNR-RHPESD-----LTIPAIAA 50 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~------------------------~v~VGi~~~~-~~~d~g-----~~~A~iA~ 50 (162) |+.+|. ++++|.+.|+.+.+. -|.-|.+..+ ...++| .+.+.+|. T Consensus 1 msi~i~-----Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~g~~~~V~~~~~Ya~ 75 (114) T protein:vir:95 1 MAIKWQ-----GIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQLAPKDTEFLKDHITTSYPGMEAHIHGEAGYDG 75 (114) T ss_pred Ceeeee-----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhceeeecCceEEEeecCCCccc Confidence 776664 345555555444321 0111222111 000111 14467899 Q ss_pred HHhcCCcCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020079. 51 IQQYGNETNNIPARPFITDGAVISQNNIAKKMKQVFANYLM 91 (162) Q Consensus 51 ~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~ 91 (162) +.|||+ ...|++|||+|+++.++.++.+.++..++.-+. T Consensus 76 yvE~GT--~~~~aqPfl~pa~~~~~~~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 76 YQEYGT--RFQPGTPHFRPMMEQIQPQFQKDMTDVMKGAFK 114 (114) T ss_pred eeecCc--cccCCCccchhhHHHHHHHHHHHHHHHHHhhcC Confidence 999996 578999999999999999999999999998777 No 32 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=97.79 E-value=3.2e-08 Score=61.69 Aligned_cols=82 Identities=15% Similarity=0.303 Sum_probs=53.9 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCe----------------------EEEeecCCC---CCCCCC-----CCHHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQ----------------------IEAGFLTNR---RHPESD-----LTIPAIAA 50 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~----------------------v~VGi~~~~---~~~d~g-----~~~A~iA~ 50 (162) |+.+|.... +++|++.|.++.... |.-|-+..+ ...+++ .+.+.+|. T Consensus 1 M~~~i~i~G---ld~l~~~L~~~~~~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~ 77 (112) T protein:vir:36 1 MKSSLSFKG---IDQLVKHLDKAASLKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMELTEGGFSGQAGPHTDYSA 77 (112) T ss_pred Cceeeeehh---HHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeeecCCceEEEeecCCCccc Confidence 888887664 566666655443221 111111111 011222 14577899 Q ss_pred HHhcCCcCCCCCCcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 51 IQQYGNETNNIPARPFITDGAVISQNNIAKKMKQVFA 87 (162) Q Consensus 51 ~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 87 (162) +.|||+ ...|+||||+|+++.++.++.+.+++.++ T Consensus 78 ~vE~GT--~k~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 78 YVEYGT--RFQSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred eeeccc--cccCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 999996 57899999999999999999888888887 No 33 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=97.79 E-value=1.8e-08 Score=63.07 Aligned_cols=91 Identities=13% Similarity=0.161 Sum_probs=52.4 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCC---eE-----E-----------EeecCC-------------------------- Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQV---QI-----E-----------AGFLTN-------------------------- 35 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~---~v-----~-----------VGi~~~-------------------------- 35 (162) |..+|..++ |++|++.|+.|... .+ . --.|.. T Consensus 2 m~~~~~i~G---l~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v~ 78 (149) T protein:vir:19 2 IETSLDFSG---LNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGVH 78 (149) T ss_pred cceeeehhh---HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeeccc Confidence 777777664 55555555544321 00 0 000000 Q ss_pred ----CC--CC------CCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHH Q lcl|NC_020079. 36 ----RR--HP------ESDLTIPAIAAIQQYGNETNNIPARPFITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEP 101 (162) Q Consensus 36 ----~~--~~------d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~ 101 (162) .. .. ..+-+.+.++.+.|||+ .+.||+|||+|+++++++++.+.+...+.+.|. ++|.+ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT--~~~~a~PF~~pA~~~~k~~~~~~~~~~l~~~l~-----k~~~k 149 (149) T protein:vir:19 79 IRGVNPRTGNSDNTMKANNPRNAFYWRFVELGT--ANMPAHPFVRPAYDTREEEAASVAIARMNQAID-----EVLSK 149 (149) T ss_pred ccccccccccccceeecCCCCccceeeeeccCC--CCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHH-----HHhcC Confidence 00 00 00112356788899995 689999999999999999888777777765431 22222 No 34 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=97.71 E-value=6.2e-08 Score=60.09 Aligned_cols=88 Identities=15% Similarity=0.190 Sum_probs=48.9 Q ss_pred CCCcccccchh------------HHHHHHH--HHHH-hhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcc Q lcl|NC_020079. 1 MESEILPGDDT------------DWETIIK--KMMD-LEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARP 65 (162) Q Consensus 1 M~~~i~~~~~~------------~l~~l~~--~l~~-l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~Rp 65 (162) |+..+-+.++. ++..-+. .++. -....+.||+..+.. +.+.++.+.|||+ .+.||+| T Consensus 47 ~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~~~~~~g~~~~~VG~~~~~~------~~~~y~~f~E~GT--~k~~a~p 118 (149) T protein:vir:13 47 VAPLIHISDDNSKSGRKGSRPPGHAANNIPEPKIRKKKGNLQCVVGWEKSDN------TPFYYMKMEEWGT--SERPPHH 118 (149) T ss_pred HHHhCCccCCccccccccccccchhhhcceecccccccceeEEEeeccCCCC------CccceeeeeccCc--cCCCCCc Confidence 22222111100 1110000 1111 122357899865321 2367889999996 5789999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhccchHH Q lcl|NC_020079. 66 FITDGAVISQNNIAKKMKQVFANYLMHNVGL 96 (162) Q Consensus 66 Flr~~~~~~~~~~~~~~~~~~~~~l~G~~~~ 96 (162) ||||+++++++++.+.+...+.+.|.-...+ T Consensus 119 F~~pa~~~~~~~~~~~~~~~l~k~i~~~lG~ 149 (149) T protein:vir:13 119 AFGKTNKILKRVYDNIAQKKYDNFVKEKLGD 149 (149) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 9999999999998877777665544221111 No 35 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=97.68 E-value=1e-07 Score=58.96 Aligned_cols=86 Identities=15% Similarity=0.234 Sum_probs=49.4 Q ss_pred CCCcccccch-------------hHHHHHH--HHH-HHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCc Q lcl|NC_020079. 1 MESEILPGDD-------------TDWETII--KKM-MDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPAR 64 (162) Q Consensus 1 M~~~i~~~~~-------------~~l~~l~--~~l-~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~R 64 (162) |+..+-+... ..+..-+ ... ..-+...+.||+-... + +.+.++.+.|||+ .+.||+ T Consensus 45 ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g~~~~~vg~~~~~----~--~~~~y~~f~E~GT--~~~~a~ 116 (146) T protein:vir:10 45 IAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGGIKTVKIGLNKAD----R--SPWFYLKFHEWGT--SKMPAH 116 (146) T ss_pred HHHhCCCccccccccccccccccccccccceeccccccccceeEEeeeccCC----C--CCcceeeeeccCC--CCCCCC Confidence 2222211000 0000000 001 1123456788874422 1 2367899999996 588999 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHH Q lcl|NC_020079. 65 PFITDGAVISQNNIAKKMKQVFANYLMHNVGLAVF 99 (162) Q Consensus 65 pFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l 99 (162) |||+|+++.+++++.+.+...+.+.|. .+| T Consensus 117 PFl~pa~~~~k~~~~~~~~~~l~~~l~-----ka~ 146 (146) T protein:vir:10 117 PFIEPGFNASKAEAVRAMTDILKNEMR-----LDL 146 (146) T ss_pred cchhHHHHHhHHHHHHHHHHHHHHHHh-----hcC Confidence 999999999999988888887776542 112 No 36 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=97.68 E-value=1e-07 Score=58.96 Aligned_cols=86 Identities=15% Similarity=0.234 Sum_probs=49.4 Q ss_pred CCCcccccch-------------hHHHHHH--HHH-HHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCc Q lcl|NC_020079. 1 MESEILPGDD-------------TDWETII--KKM-MDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPAR 64 (162) Q Consensus 1 M~~~i~~~~~-------------~~l~~l~--~~l-~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~R 64 (162) |+..+-+... ..+..-+ ... ..-+...+.||+-... + +.+.++.+.|||+ .+.||+ T Consensus 45 ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g~~~~~vg~~~~~----~--~~~~y~~f~E~GT--~~~~a~ 116 (146) T protein:vir:10 45 IAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGGIKTVKIGLNKAD----R--SPWFYLKFHEWGT--SKMPAH 116 (146) T ss_pred HHHhCCCccccccccccccccccccccccceeccccccccceeEEeeeccCC----C--CCcceeeeeccCC--CCCCCC Confidence 2222211000 0000000 001 1123456788874422 1 2367899999996 588999 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHH Q lcl|NC_020079. 65 PFITDGAVISQNNIAKKMKQVFANYLMHNVGLAVF 99 (162) Q Consensus 65 pFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l 99 (162) |||+|+++.+++++.+.+...+.+.|. .+| T Consensus 117 PFl~pa~~~~k~~~~~~~~~~l~~~l~-----ka~ 146 (146) T protein:vir:10 117 PFIEPGFNASKAEAVRAMTDILKNEMR-----LDL 146 (146) T ss_pred cchhHHHHHhHHHHHHHHHHHHHHHHh-----hcC Confidence 999999999999988888887776542 112 No 37 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=97.68 E-value=1e-07 Score=58.96 Aligned_cols=86 Identities=15% Similarity=0.234 Sum_probs=49.4 Q ss_pred CCCcccccch-------------hHHHHHH--HHH-HHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCc Q lcl|NC_020079. 1 MESEILPGDD-------------TDWETII--KKM-MDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPAR 64 (162) Q Consensus 1 M~~~i~~~~~-------------~~l~~l~--~~l-~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~R 64 (162) |+..+-+... ..+..-+ ... ..-+...+.||+-... + +.+.++.+.|||+ .+.||+ T Consensus 45 ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g~~~~~vg~~~~~----~--~~~~y~~f~E~GT--~~~~a~ 116 (146) T protein:vir:10 45 IAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGGIKTVKIGLNKAD----R--SPWFYLKFHEWGT--SKMPAH 116 (146) T ss_pred HHHhCCCccccccccccccccccccccccceeccccccccceeEEeeeccCC----C--CCcceeeeeccCC--CCCCCC Confidence 2222211000 0000000 001 1123456788874422 1 2367899999996 588999 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHH Q lcl|NC_020079. 65 PFITDGAVISQNNIAKKMKQVFANYLMHNVGLAVF 99 (162) Q Consensus 65 pFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l 99 (162) |||+|+++.+++++.+.+...+.+.|. .+| T Consensus 117 PFl~pa~~~~k~~~~~~~~~~l~~~l~-----ka~ 146 (146) T protein:vir:10 117 PFIEPGFNASKAEAVRAMTDILKNEMR-----LDL 146 (146) T ss_pred cchhHHHHHhHHHHHHHHHHHHHHHHh-----hcC Confidence 999999999999988888887776542 112 No 38 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=97.68 E-value=1e-07 Score=58.96 Aligned_cols=86 Identities=15% Similarity=0.234 Sum_probs=49.4 Q ss_pred CCCcccccch-------------hHHHHHH--HHH-HHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCc Q lcl|NC_020079. 1 MESEILPGDD-------------TDWETII--KKM-MDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPAR 64 (162) Q Consensus 1 M~~~i~~~~~-------------~~l~~l~--~~l-~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~R 64 (162) |+..+-+... ..+..-+ ... ..-+...+.||+-... + +.+.++.+.|||+ .+.||+ T Consensus 45 ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~g~~~~~vg~~~~~----~--~~~~y~~f~E~GT--~~~~a~ 116 (146) T protein:vir:10 45 IAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEGGIKTVKIGLNKAD----R--SPWFYLKFHEWGT--SKMPAH 116 (146) T ss_pred HHHhCCCccccccccccccccccccccccceeccccccccceeEEeeeccCC----C--CCcceeeeeccCC--CCCCCC Confidence 2222211000 0000000 001 1123456788874422 1 2367899999996 588999 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHH Q lcl|NC_020079. 65 PFITDGAVISQNNIAKKMKQVFANYLMHNVGLAVF 99 (162) Q Consensus 65 pFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l 99 (162) |||+|+++.+++++.+.+...+.+.|. .+| T Consensus 117 PFl~pa~~~~k~~~~~~~~~~l~~~l~-----ka~ 146 (146) T protein:vir:10 117 PFIEPGFNASKAEAVRAMTDILKNEMR-----LDL 146 (146) T ss_pred cchhHHHHHhHHHHHHHHHHHHHHHHh-----hcC Confidence 999999999999988888887776542 112 No 39 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=97.60 E-value=2.5e-07 Score=56.77 Aligned_cols=90 Identities=17% Similarity=0.247 Sum_probs=48.9 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCC---e-----EE-------------EeecCCC--------------CCCCC---- Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQV---Q-----IE-------------AGFLTNR--------------RHPES---- 41 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~---~-----v~-------------VGi~~~~--------------~~~d~---- 41 (162) |..++...+ |++|++.|+.|... . +. +.+.... +..++ T Consensus 1 M~~~~~i~G---l~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k~~~~~~~v 77 (135) T protein:vir:57 1 MIPEIEISG---LQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSRGKAGSTVV 77 (135) T ss_pred Cceeeeehh---HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhcccccccccccceeE Confidence 888877664 55555555554321 0 00 1010000 00000 Q ss_pred ----CCCHHH--HHHHHhcCCcCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHH Q lcl|NC_020079. 42 ----DLTIPA--IAAIQQYGNETNNIPARPFITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIAR 104 (162) Q Consensus 42 ----g~~~A~--iA~~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~ 104 (162) |.+-.. ++.+.|||+ .+.||+|||+|+++++++++.+.+...+... |++++. T Consensus 78 ~v~vg~~~~~~~~~~f~E~GT--~~~~a~PF~~pa~~~~~~~~~~~~~~~~~~~---------l~ka~r 135 (135) T protein:vir:57 78 VLRVGPTRSHYMKALAQEFGT--IKQVAKPFIRPALDYNKMQVLRILTVEIRDG---------LSTLSR 135 (135) T ss_pred EEEecCCCCcceeEeecccCC--CCCCCCcchhHhHHHhHHHHHHHHHHHHHHH---------HHHhcC Confidence 001112 233349996 5789999999999999998888777776653 233332 No 40 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=97.60 E-value=7.5e-08 Score=59.63 Aligned_cols=80 Identities=11% Similarity=0.078 Sum_probs=49.7 Q ss_pred CCCcccccc--hhHHHHHHHHHH------HhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHH Q lcl|NC_020079. 1 MESEILPGD--DTDWETIIKKMM------DLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAV 72 (162) Q Consensus 1 M~~~i~~~~--~~~l~~l~~~l~------~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~ 72 (162) |+..+-..+ ...-..+...+. .-....+.||+..+ .+.++.+.|||+ .+.||+|||+++++ T Consensus 41 ~k~~ap~~~~~~~~~~h~~d~I~~~~~k~~~g~~~~~VG~~k~---------~~~y~~f~E~GT--~k~~a~pF~~pa~~ 109 (128) T protein:vir:38 41 LKSNTPEWDGETDMSGHLRDDIKLSSVRETSGLTEVDVGYGKD---------TGWRAHFPNSGT--SMQDPQHFIEETQE 109 (128) T ss_pred HHHhCCCcCCCCcccchhhhhhccccccccCceeEEEeeecCC---------CceEEeeeccCc--cCCCCCcchhHHHH Confidence 322221110 000011111211 11224578888322 246789999996 58899999999999 Q ss_pred HHHHHHHHHHHHHHHHHhc Q lcl|NC_020079. 73 ISQNNIAKKMKQVFANYLM 91 (162) Q Consensus 73 ~~~~~~~~~~~~~~~~~l~ 91 (162) ++++++.+.+.+.+++.|. T Consensus 110 ~~~~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 110 IMRPVVIAAFLSHLKEGGM 128 (128) T ss_pred HhHHHHHHHHHHHHHhhcC Confidence 9999999999999888555 No 41 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=97.59 E-value=1.2e-07 Score=58.57 Aligned_cols=81 Identities=10% Similarity=-0.050 Sum_probs=52.6 Q ss_pred CCCcccccchhHHHHHHHHHHH-------hhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMD-------LEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVI 73 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~-------l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~ 73 (162) |+..+-..+...-..+.+.+.- .....+.|||..+ .+.++.+.|||+ .+.||+|||++++++ T Consensus 38 ~k~~ap~~~~~~~~hl~d~I~~~~~k~~~~g~~~~~VG~~k~---------~~~y~~f~E~GT--~k~~~~pF~~pa~~~ 106 (125) T protein:vir:97 38 LKANTPVYEVETDERLQEDTVISGFKGANVGIVSKEIGYGKA---------TGWRAHYPNDGT--IYQRGQDFKERTINQ 106 (125) T ss_pred HHHhCCcCCCCchhhHHhhhhcccccccccCceEEEEeecCC---------CceeEeeeccCc--cCCCcCccchHhHHH Confidence 4443332221111122222221 1223678888432 257899999996 588999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcc Q lcl|NC_020079. 74 SQNNIAKKMKQVFANYLMH 92 (162) Q Consensus 74 ~~~~~~~~~~~~~~~~l~G 92 (162) +++++.+.+...+.+.|.= T Consensus 107 ~k~~~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 107 MTPKAKQLYAEKVKEGLGL 125 (125) T ss_pred hHHHHHHHHHHHHHHHhcC Confidence 9999999999999886633 No 42 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=97.58 E-value=5.3e-07 Score=54.99 Aligned_cols=87 Identities=13% Similarity=0.133 Sum_probs=63.1 Q ss_pred HHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhc------CCCCCCHHHHHHhhccCCCCCCchhhH Q lcl|NC_020079. 71 AVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQ------RYRPLSPVTIKIRQDKGNYSNHILIDT 144 (162) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppnsp~Ti~~k~~k~~~~~~PLidT 144 (162) +++ -.++...+..++.+ |.......+|..||..+....++.|.+. .|+|+++.|++.|+.+ ..+||+++ T Consensus 1 m~d-~~~l~~~L~~ll~~-L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~~---~~~~l~~~ 75 (149) T protein:vir:98 1 MSE-LTALQERLTGLIAS-LSPAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKGR---IRREMFAR 75 (149) T ss_pred Cch-HHHHHHHHHHHHHh-cCchhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccCC---CCcccchh Confidence 232 23444555555544 2334456789999999999999999763 5899999998766633 46799999 Q ss_pred HHHHhhceeeeecCCCC------C Q lcl|NC_020079. 145 AHMINAIETKITKSKSK------K 162 (162) Q Consensus 145 G~l~~SIty~V~~~~~~------~ 162 (162) |.|.+||+|.+...+.. - T Consensus 76 g~l~~sl~~~~~~~~~~V~~~Gs~ 99 (149) T protein:vir:98 76 LRTNRFMKAKGSDSAAVVEFTGRV 99 (149) T ss_pred hhhhhhhhheecCCeeEEEecCcc Confidence 99999999988877532 2 No 43 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=97.55 E-value=1.4e-07 Score=58.13 Aligned_cols=77 Identities=9% Similarity=0.232 Sum_probs=44.9 Q ss_pred hhHHHHHHHHHHHhhCC------------------------eEEEeecCCC---CCCCC----CCCHHHHHHHHhcCCcC Q lcl|NC_020079. 10 DTDWETIIKKMMDLEQV------------------------QIEAGFLTNR---RHPES----DLTIPAIAAIQQYGNET 58 (162) Q Consensus 10 ~~~l~~l~~~l~~l~~~------------------------~v~VGi~~~~---~~~d~----g~~~A~iA~~~E~G~~~ 58 (162) ..++++|.+.|+.+... -|.-|.+..+ ...++ -.+.+.+|.+.|||+ T Consensus 1 i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~~v~~~~~Ya~~vE~GT-- 78 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRIAVDKELSKSAARIERQAKILAPVDTGWLRAQIYSEQQRLLHYRVVSPALYSIYLELGT-- 78 (108) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhcceeeeecCcEEEEeecCcccchhcccCc-- Confidence 11222222222222110 0111211111 00000 024478899999996 Q ss_pred CCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 59 NNIPARPFITDGAVISQNNIAKKMKQVFAN 88 (162) Q Consensus 59 ~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (162) ...|+||||+|+++.++..+.+.+++.++. T Consensus 79 ~~m~a~Pf~~pa~~~~~~~~~~~i~~~lrk 108 (108) T protein:vir:99 79 RKMEAQSFLDPALRKEWPVLMANIKKMFKR 108 (108) T ss_pred cccCCCcchhhhHHHHHHHHHHHHHHHhcC Confidence 578999999999999999999999999888 No 44 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=97.55 E-value=7.3e-08 Score=59.72 Aligned_cols=73 Identities=14% Similarity=0.136 Sum_probs=49.0 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHHHHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVISQNNIAK 80 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~ 80 (162) |+-.|.+... +.-..-....|.||+..+. +.+|...|||+ .+.||+||||+++++++++..+ T Consensus 53 l~d~I~vs~~-------k~~~~~g~~~v~VG~~k~~---------~~~a~F~E~GT--~k~~a~pF~~~a~~~~~~ev~~ 114 (125) T protein:vir:94 53 ARDHIAVSNV-------KTDRHTSEKIVTIGYAKGV---------SHRIHATEFGT--MYQKPQLFITKTEKQGKNKVLK 114 (125) T ss_pred hhhheeeccc-------ccccccceEEEEeccCCCC---------ceEEEeccCCc--cCCCCCchhhHHHHHhHHHHHH Confidence 3333332210 0000012235667763321 35677899995 6899999999999999999999 Q ss_pred HHHHHHHHHhc Q lcl|NC_020079. 81 KMKQVFANYLM 91 (162) Q Consensus 81 ~~~~~~~~~l~ 91 (162) .+...++.+.. T Consensus 115 ~~~~~lrk~~k 125 (125) T protein:vir:94 115 TMLDTAKRLQK 125 (125) T ss_pred HHHHHHHHHhC Confidence 99999999877 No 45 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=97.55 E-value=7.3e-08 Score=59.72 Aligned_cols=73 Identities=14% Similarity=0.136 Sum_probs=49.0 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHHHHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVISQNNIAK 80 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~ 80 (162) |+-.|.+... +.-..-....|.||+..+. +.+|...|||+ .+.||+||||+++++++++..+ T Consensus 53 l~d~I~vs~~-------k~~~~~g~~~v~VG~~k~~---------~~~a~F~E~GT--~k~~a~pF~~~a~~~~~~ev~~ 114 (125) T protein:vir:47 53 ARDHIAVSNV-------KTDRHTSEKIVTIGYAKGV---------SHRIHATEFGT--MYQKPQLFITKTEKQGKNKVLK 114 (125) T ss_pred hhhheeeccc-------ccccccceEEEEeccCCCC---------ceEEEeccCCc--cCCCCCchhhHHHHHhHHHHHH Confidence 3333332210 0000012235667763321 35677899995 6899999999999999999999 Q ss_pred HHHHHHHHHhc Q lcl|NC_020079. 81 KMKQVFANYLM 91 (162) Q Consensus 81 ~~~~~~~~~l~ 91 (162) .+...++.+.. T Consensus 115 ~~~~~lrk~~k 125 (125) T protein:vir:47 115 TMLDTAKRLQK 125 (125) T ss_pred HHHHHHHHHhC Confidence 99999999877 No 46 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=97.55 E-value=7.3e-08 Score=59.72 Aligned_cols=73 Identities=14% Similarity=0.136 Sum_probs=49.0 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHHHHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVISQNNIAK 80 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~ 80 (162) |+-.|.+... +.-..-....|.||+..+. +.+|...|||+ .+.||+||||+++++++++..+ T Consensus 53 l~d~I~vs~~-------k~~~~~g~~~v~VG~~k~~---------~~~a~F~E~GT--~k~~a~pF~~~a~~~~~~ev~~ 114 (125) T protein:vir:81 53 ARDHIAVSNV-------KTDRHTSEKIVTIGYAKGV---------SHRIHATEFGT--MYQKPQLFITKTEKQGKNKVLK 114 (125) T ss_pred hhhheeeccc-------ccccccceEEEEeccCCCC---------ceEEEeccCCc--cCCCCCchhhHHHHHhHHHHHH Confidence 3333332210 0000012235667763321 35677899995 6899999999999999999999 Q ss_pred HHHHHHHHHhc Q lcl|NC_020079. 81 KMKQVFANYLM 91 (162) Q Consensus 81 ~~~~~~~~~l~ 91 (162) .+...++.+.. T Consensus 115 ~~~~~lrk~~k 125 (125) T protein:vir:81 115 TMLDTAKRLQK 125 (125) T ss_pred HHHHHHHHHhC Confidence 99999999877 No 47 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=97.55 E-value=7.3e-08 Score=59.72 Aligned_cols=73 Identities=14% Similarity=0.136 Sum_probs=49.0 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHHHHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVISQNNIAK 80 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~ 80 (162) |+-.|.+... +.-..-....|.||+..+. +.+|...|||+ .+.||+||||+++++++++..+ T Consensus 53 l~d~I~vs~~-------k~~~~~g~~~v~VG~~k~~---------~~~a~F~E~GT--~k~~a~pF~~~a~~~~~~ev~~ 114 (125) T protein:vir:98 53 ARDHIAVSNV-------KTDRHTSEKIVTIGYAKGV---------SHRIHATEFGT--MYQKPQLFITKTEKQGKNKVLK 114 (125) T ss_pred hhhheeeccc-------ccccccceEEEEeccCCCC---------ceEEEeccCCc--cCCCCCchhhHHHHHhHHHHHH Confidence 3333332210 0000012235667763321 35677899995 6899999999999999999999 Q ss_pred HHHHHHHHHhc Q lcl|NC_020079. 81 KMKQVFANYLM 91 (162) Q Consensus 81 ~~~~~~~~~l~ 91 (162) .+...++.+.. T Consensus 115 ~~~~~lrk~~k 125 (125) T protein:vir:98 115 TMLDTAKRLQK 125 (125) T ss_pred HHHHHHHHHhC Confidence 99999999877 No 48 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=97.55 E-value=7.3e-08 Score=59.72 Aligned_cols=73 Identities=14% Similarity=0.136 Sum_probs=49.0 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHHHHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVISQNNIAK 80 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~ 80 (162) |+-.|.+... +.-..-....|.||+..+. +.+|...|||+ .+.||+||||+++++++++..+ T Consensus 53 l~d~I~vs~~-------k~~~~~g~~~v~VG~~k~~---------~~~a~F~E~GT--~k~~a~pF~~~a~~~~~~ev~~ 114 (125) T protein:vir:79 53 ARDHIAVSNV-------KTDRHTSEKIVTIGYAKGV---------SHRIHATEFGT--MYQKPQLFITKTEKQGKNKVLK 114 (125) T ss_pred hhhheeeccc-------ccccccceEEEEeccCCCC---------ceEEEeccCCc--cCCCCCchhhHHHHHhHHHHHH Confidence 3333332210 0000012235667763321 35677899995 6899999999999999999999 Q ss_pred HHHHHHHHHhc Q lcl|NC_020079. 81 KMKQVFANYLM 91 (162) Q Consensus 81 ~~~~~~~~~l~ 91 (162) .+...++.+.. T Consensus 115 ~~~~~lrk~~k 125 (125) T protein:vir:79 115 TMLDTAKRLQK 125 (125) T ss_pred HHHHHHHHHhC Confidence 99999999877 No 49 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=97.53 E-value=1.7e-07 Score=57.68 Aligned_cols=85 Identities=12% Similarity=0.181 Sum_probs=46.2 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCe----EEEeecCCC--CCCCCC-----CCHHHHHHHHhcCCcCCCCCCcchhhH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQ----IEAGFLTNR--RHPESD-----LTIPAIAAIQQYGNETNNIPARPFITD 69 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~----v~VGi~~~~--~~~d~g-----~~~A~iA~~~E~G~~~~~IP~RpFlr~ 69 (162) +.-.+...=......+....+...... +.-|.+.++ ..-+++ .+.+++|.+.|||+ ...|+||||+| T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT--~km~a~Pfl~P 97 (115) T protein:vir:97 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGT--RYMEAEPFMWP 97 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchhhhcccc--cccCCCCchhh Confidence 111111000011233444443333110 111111110 000111 13367899999995 68999999999 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 70 GAVISQNNIAKKMKQVFA 87 (162) Q Consensus 70 ~~~~~~~~~~~~~~~~~~ 87 (162) +++.++..+.+.++++++ T Consensus 98 A~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 98 VYEVIRKSTVEELKALFE 115 (115) T ss_pred hHHHHHHHHHHHHHHHhC Confidence 999999999999998888 No 50 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=97.53 E-value=1.7e-07 Score=57.68 Aligned_cols=85 Identities=12% Similarity=0.181 Sum_probs=46.2 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCe----EEEeecCCC--CCCCCC-----CCHHHHHHHHhcCCcCCCCCCcchhhH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQ----IEAGFLTNR--RHPESD-----LTIPAIAAIQQYGNETNNIPARPFITD 69 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~----v~VGi~~~~--~~~d~g-----~~~A~iA~~~E~G~~~~~IP~RpFlr~ 69 (162) +.-.+...=......+....+...... +.-|.+.++ ..-+++ .+.+++|.+.|||+ ...|+||||+| T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT--~km~a~Pfl~P 97 (115) T protein:vir:96 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGT--RYMEAEPFMWP 97 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchhhhcccc--cccCCCCchhh Confidence 111111000011233444443333110 111111110 000111 13367899999995 68999999999 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 70 GAVISQNNIAKKMKQVFA 87 (162) Q Consensus 70 ~~~~~~~~~~~~~~~~~~ 87 (162) +++.++..+.+.++++++ T Consensus 98 A~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 98 VYEVIRKSTVEELKALFE 115 (115) T ss_pred hHHHHHHHHHHHHHHHhC Confidence 999999999999998888 No 51 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=97.53 E-value=1.7e-07 Score=57.68 Aligned_cols=85 Identities=12% Similarity=0.181 Sum_probs=46.2 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCe----EEEeecCCC--CCCCCC-----CCHHHHHHHHhcCCcCCCCCCcchhhH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQ----IEAGFLTNR--RHPESD-----LTIPAIAAIQQYGNETNNIPARPFITD 69 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~----v~VGi~~~~--~~~d~g-----~~~A~iA~~~E~G~~~~~IP~RpFlr~ 69 (162) +.-.+...=......+....+...... +.-|.+.++ ..-+++ .+.+++|.+.|||+ ...|+||||+| T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT--~km~a~Pfl~P 97 (115) T protein:vir:96 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGT--RYMEAEPFMWP 97 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchhhhcccc--cccCCCCchhh Confidence 111111000011233444443333110 111111110 000111 13367899999995 68999999999 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 70 GAVISQNNIAKKMKQVFA 87 (162) Q Consensus 70 ~~~~~~~~~~~~~~~~~~ 87 (162) +++.++..+.+.++++++ T Consensus 98 A~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 98 VYEVIRKSTVEELKALFE 115 (115) T ss_pred hHHHHHHHHHHHHHHHhC Confidence 999999999999998888 No 52 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=97.53 E-value=1.7e-07 Score=57.68 Aligned_cols=85 Identities=12% Similarity=0.181 Sum_probs=46.2 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCe----EEEeecCCC--CCCCCC-----CCHHHHHHHHhcCCcCCCCCCcchhhH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQ----IEAGFLTNR--RHPESD-----LTIPAIAAIQQYGNETNNIPARPFITD 69 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~----v~VGi~~~~--~~~d~g-----~~~A~iA~~~E~G~~~~~IP~RpFlr~ 69 (162) +.-.+...=......+....+...... +.-|.+.++ ..-+++ .+.+++|.+.|||+ ...|+||||+| T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT--~km~a~Pfl~P 97 (115) T protein:vir:93 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGT--RYMEAEPFMWP 97 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchhhhcccc--cccCCCCchhh Confidence 111111000011233444443333110 111111110 000111 13367899999995 68999999999 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 70 GAVISQNNIAKKMKQVFA 87 (162) Q Consensus 70 ~~~~~~~~~~~~~~~~~~ 87 (162) +++.++..+.+.++++++ T Consensus 98 A~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 98 VYEVIRKSTVEELKALFE 115 (115) T ss_pred hHHHHHHHHHHHHHHHhC Confidence 999999999999998888 No 53 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=97.53 E-value=1.7e-07 Score=57.68 Aligned_cols=85 Identities=12% Similarity=0.181 Sum_probs=46.2 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCe----EEEeecCCC--CCCCCC-----CCHHHHHHHHhcCCcCCCCCCcchhhH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQ----IEAGFLTNR--RHPESD-----LTIPAIAAIQQYGNETNNIPARPFITD 69 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~----v~VGi~~~~--~~~d~g-----~~~A~iA~~~E~G~~~~~IP~RpFlr~ 69 (162) +.-.+...=......+....+...... +.-|.+.++ ..-+++ .+.+++|.+.|||+ ...|+||||+| T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT--~km~a~Pfl~P 97 (115) T protein:vir:10 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGT--RYMEAEPFMWP 97 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchhhhcccc--cccCCCCchhh Confidence 111111000011233444443333110 111111110 000111 13367899999995 68999999999 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 70 GAVISQNNIAKKMKQVFA 87 (162) Q Consensus 70 ~~~~~~~~~~~~~~~~~~ 87 (162) +++.++..+.+.++++++ T Consensus 98 A~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 98 VYEVIRKSTVEELKALFE 115 (115) T ss_pred hHHHHHHHHHHHHHHHhC Confidence 999999999999998888 No 54 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=97.53 E-value=1.7e-07 Score=57.68 Aligned_cols=85 Identities=12% Similarity=0.181 Sum_probs=46.2 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCe----EEEeecCCC--CCCCCC-----CCHHHHHHHHhcCCcCCCCCCcchhhH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQ----IEAGFLTNR--RHPESD-----LTIPAIAAIQQYGNETNNIPARPFITD 69 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~----v~VGi~~~~--~~~d~g-----~~~A~iA~~~E~G~~~~~IP~RpFlr~ 69 (162) +.-.+...=......+....+...... +.-|.+.++ ..-+++ .+.+++|.+.|||+ ...|+||||+| T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT--~km~a~Pfl~P 97 (115) T protein:vir:78 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGT--RYMEAEPFMWP 97 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchhhhcccc--cccCCCCchhh Confidence 111111000011233444443333110 111111110 000111 13367899999995 68999999999 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 70 GAVISQNNIAKKMKQVFA 87 (162) Q Consensus 70 ~~~~~~~~~~~~~~~~~~ 87 (162) +++.++..+.+.++++++ T Consensus 98 A~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 98 VYEVIRKSTVEELKALFE 115 (115) T ss_pred hHHHHHHHHHHHHHHHhC Confidence 999999999999998888 No 55 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=97.52 E-value=7.6e-07 Score=54.14 Aligned_cols=87 Identities=7% Similarity=0.154 Sum_probs=62.2 Q ss_pred HHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhc------CCCCCCHHHHHHhhccCCCCCCchhhHH Q lcl|NC_020079. 72 VISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQ------RYRPLSPVTIKIRQDKGNYSNHILIDTA 145 (162) Q Consensus 72 ~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppnsp~Ti~~k~~k~~~~~~PLidTG 145 (162) .+.-+++...+...+.+ +...+...+|..||..+....++.|... .|+|+++.|++.|..+ ..++|+++| T Consensus 1 ~~~~~~l~~~L~~ll~~-l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~---~~~~l~~~~ 76 (150) T protein:vir:20 1 MNEFKRFEDRLTGLIES-LSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGR---VKRKMFAKL 76 (150) T ss_pred CchHHHHHHHHHHHHHh-cCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccC---CCccccchh Confidence 22234455555555544 3334446789999999999999999764 5899999998766533 356899999 Q ss_pred HHHhhceeeeecCCCC-------C Q lcl|NC_020079. 146 HMINAIETKITKSKSK-------K 162 (162) Q Consensus 146 ~l~~SIty~V~~~~~~-------~ 162 (162) .|.+||+|.+...... - T Consensus 77 ~l~~sl~~~~~~~~~~vg~~~Gs~ 100 (150) T protein:vir:20 77 ITSRFLHIRASPEQASMEFYGGKS 100 (150) T ss_pred hhhhhhheeecCcEEEEEeeCCcc Confidence 9999999988654422 1 No 56 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=97.42 E-value=2.2e-07 Score=57.10 Aligned_cols=84 Identities=10% Similarity=0.094 Sum_probs=44.8 Q ss_pred CCCcccccchhHHHHHHHHHH-----Hhh--CCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMM-----DLE--QVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVI 73 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~-----~l~--~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~ 73 (162) |+..+-.++...-..+...+. ... ...|.|++-.+. +...++.+.|||+ .+.||||||+|+++. T Consensus 43 ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~~~~~~v~vg~~~-------~~~~y~~f~E~GT--~k~~a~PF~~pA~~~ 113 (133) T protein:vir:10 43 MKQHAGFDETSTGQHMRDSIKIRSSTRKAQGNAVVTLRVGPSK-------QHHMKVLAQEFGT--VKQVADPFIRPALDY 113 (133) T ss_pred HHHhCCCCCCcchhhhhhcccccccccccCccceEEEEecCCC-------CccceEeeeccCC--CCCCCCccchHHHHH Confidence 222221111110001111110 000 111222221110 1123344559996 578999999999999 Q ss_pred HHHHHHHHHHHHHHHHhccc Q lcl|NC_020079. 74 SQNNIAKKMKQVFANYLMHN 93 (162) Q Consensus 74 ~~~~~~~~~~~~~~~~l~G~ 93 (162) +++++.+.+.+.+.+.|.-+ T Consensus 114 ~~~~~~~~~~~~~~~~l~K~ 133 (133) T protein:vir:10 114 NVQTVLRVLTVEIRNGIQNR 133 (133) T ss_pred hHHHHHHHHHHHHHHHhhcC Confidence 99999999999999988777 No 57 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=97.41 E-value=4.1e-07 Score=55.63 Aligned_cols=85 Identities=11% Similarity=0.164 Sum_probs=44.6 Q ss_pred CCCcccccc----hhHHHHHHHHHHHhhCCe----EEEeecCCC--CCCCCC-----CCHHHHHHHHhcCCcCCCCCCcc Q lcl|NC_020079. 1 MESEILPGD----DTDWETIIKKMMDLEQVQ----IEAGFLTNR--RHPESD-----LTIPAIAAIQQYGNETNNIPARP 65 (162) Q Consensus 1 M~~~i~~~~----~~~l~~l~~~l~~l~~~~----v~VGi~~~~--~~~d~g-----~~~A~iA~~~E~G~~~~~IP~Rp 65 (162) |.-.+...- ......+....+.+.... |.-|.+..+ ...+|+ .+.+.+|.+.|||+ ...|+|| T Consensus 16 ~~~~~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vEfGT--~km~a~P 93 (115) T protein:vir:10 16 MHDDIEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVKKIGDLHYRVISTAHYSGFLEFGT--RYMEPAP 93 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEeeCCCccchheeccc--ccCCCCC Confidence 211111000 011223333333322110 111111100 000111 13467899999995 6899999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 66 FITDGAVISQNNIAKKMKQVFA 87 (162) Q Consensus 66 Flr~~~~~~~~~~~~~~~~~~~ 87 (162) ||+|+++.++..+.+.+++++. T Consensus 94 Fl~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 94 FMFPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred chhhhHHHHHHHHHHHHHHHhC Confidence 9999999999999999888887 No 58 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=97.34 E-value=2.1e-07 Score=57.24 Aligned_cols=84 Identities=10% Similarity=0.103 Sum_probs=44.9 Q ss_pred CCCc--ccccchhHHHHHHHHHHHhhCCeEEEeecCCC-------CCCCCC---CCHHHHHHHHhcCCcCCCCCCcchhh Q lcl|NC_020079. 1 MESE--ILPGDDTDWETIIKKMMDLEQVQIEAGFLTNR-------RHPESD---LTIPAIAAIQQYGNETNNIPARPFIT 68 (162) Q Consensus 1 M~~~--i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~-------~~~d~g---~~~A~iA~~~E~G~~~~~IP~RpFlr 68 (162) +... +..--.....++.+.+.+... ..+++..|. ...+++ .+.+.+|.++|||+ ...|+||||+ T Consensus 19 ~~~~~~v~~~~~~~~~~~~~~~~~~a~--~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~Ya~~vEfGT--~km~a~Pfl~ 94 (114) T protein:vir:27 19 NASPEKRSKVLRKYGSKLKEAAVNRAQ--FNKGYSTGATRRSITLQVESDKATVEALTSYSGYLEVGT--RKMEAQPFMK 94 (114) T ss_pred hcCHHHHHHHHHHHHHHHHHHHHHhcc--cCCCCCchhhhhceeeeecCCeeEecCCCCccceecccc--cccCCCCchh Confidence 3211 100000112222222222221 111111110 001111 13467899999995 6899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 69 DGAVISQNNIAKKMKQVFAN 88 (162) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~ 88 (162) |+++.++.++.+.+++.++- T Consensus 95 PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 95 PALDEVAPKMVEELAKWDET 114 (114) T ss_pred hhHHHHHHHHHHHHHHHhcC Confidence 99999999999888888875 No 59 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=97.34 E-value=2.1e-07 Score=57.24 Aligned_cols=84 Identities=10% Similarity=0.103 Sum_probs=44.9 Q ss_pred CCCc--ccccchhHHHHHHHHHHHhhCCeEEEeecCCC-------CCCCCC---CCHHHHHHHHhcCCcCCCCCCcchhh Q lcl|NC_020079. 1 MESE--ILPGDDTDWETIIKKMMDLEQVQIEAGFLTNR-------RHPESD---LTIPAIAAIQQYGNETNNIPARPFIT 68 (162) Q Consensus 1 M~~~--i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~-------~~~d~g---~~~A~iA~~~E~G~~~~~IP~RpFlr 68 (162) +... +..--.....++.+.+.+... ..+++..|. ...+++ .+.+.+|.++|||+ ...|+||||+ T Consensus 19 ~~~~~~v~~~~~~~~~~~~~~~~~~a~--~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~Ya~~vEfGT--~km~a~Pfl~ 94 (114) T protein:vir:49 19 NASPEKRSKVLRKYGSKLKEAAVNRAQ--FNKGYSTGATRRSITLQVESDKATVEALTSYSGYLEVGT--RKMEAQPFMK 94 (114) T ss_pred hcCHHHHHHHHHHHHHHHHHHHHHhcc--cCCCCCchhhhhceeeeecCCeeEecCCCCccceecccc--cccCCCCchh Confidence 3211 100000112222222222221 111111110 001111 13467899999995 6899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 69 DGAVISQNNIAKKMKQVFAN 88 (162) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~ 88 (162) |+++.++.++.+.+++.++- T Consensus 95 PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 95 PALDEVAPKMVEELAKWDET 114 (114) T ss_pred hhHHHHHHHHHHHHHHHhcC Confidence 99999999999888888875 No 60 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=97.23 E-value=6.9e-07 Score=54.35 Aligned_cols=78 Identities=13% Similarity=0.155 Sum_probs=41.7 Q ss_pred CCCc----ccccchh----HHH--HHHHHHH-HhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCc--------CCCC Q lcl|NC_020079. 1 MESE----ILPGDDT----DWE--TIIKKMM-DLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNE--------TNNI 61 (162) Q Consensus 1 M~~~----i~~~~~~----~l~--~l~~~l~-~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~--------~~~I 61 (162) ..+. +..+..+ -+. .+...|. ......+.|||.+ ++..||++|+||.. .++| T Consensus 53 W~p~~~~~~~~k~~~~~~~l~~~g~l~~sl~~~~~~~~~~V~~~G---------s~~~yAa~HQfG~~~r~~~~~~~~~i 123 (149) T protein:vir:98 53 YAARKRQSVRSKKGRIRREMFARLRTNRFMKAKGSDSAAVVEFTG---------RVQRMARVHQYGLKDRPNRHSRDVQY 123 (149) T ss_pred CcccchHHHHhccCCCCcccchhhhhhhhhhheecCCeeEEEecC---------cchHHhhHhhccccccccCCCcceec Confidence 1111 0000000 011 1122221 2344578888853 34789999999953 3479 Q ss_pred CCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 62 PARPFITDGAVISQNNIAKKMKQVFAN 88 (162) Q Consensus 62 P~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (162) |+||||--+ ++..+++.+.+...+.. T Consensus 124 PaRp~LG~s-~~d~~~i~~~i~~~l~~ 149 (149) T protein:vir:98 124 AARPLLGFT-RDDEQMIEDIIIRHLGK 149 (149) T ss_pred cccccCCCC-HHHHHHHHHHHHHHhhC Confidence 999999433 34456666666666655 No 61 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=97.23 E-value=5.5e-07 Score=54.92 Aligned_cols=84 Identities=17% Similarity=0.264 Sum_probs=47.5 Q ss_pred CCCcccccchhHHHHHHHHHHHhhC------------------------CeEEEeecCCC---CCCCCC-----CCHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQ------------------------VQIEAGFLTNR---RHPESD-----LTIPAI 48 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~------------------------~~v~VGi~~~~---~~~d~g-----~~~A~i 48 (162) |+.+|..+.. +++.+.|+++.+ .-|.-|-+..+ ...+++ .+.+.+ T Consensus 4 ms~~i~~~g~---~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~~~g~~~~V~~~~~Y 80 (144) T protein:vir:59 4 MSVRIDPSWR---RIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYKNNGLTAEITVGAEY 80 (144) T ss_pred ceeeehhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEeecCcEEEEEecCCCc Confidence 6666544332 222221111111 00111211111 001111 245789 Q ss_pred HHHHhcCCcC-------------------------CCCCCcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 49 AAIQQYGNET-------------------------NNIPARPFITDGAVISQNNIAKKMKQVFA 87 (162) Q Consensus 49 A~~~E~G~~~-------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 87 (162) |.+.|||+.. ..+|+||||+++++.+++.+.+.+++++- T Consensus 81 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~~~~~i~~~~g 144 (144) T protein:vir:59 81 AIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEGGEYFEREMRRLRG 144 (144) T ss_pred cchhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHHHHHHHHHHHHHHHhcC Confidence 9999999721 35899999999999999999999888875 No 62 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=97.23 E-value=7.3e-07 Score=54.23 Aligned_cols=85 Identities=15% Similarity=0.215 Sum_probs=46.0 Q ss_pred CCCcccccch----hHHHHHHHHHHHhhCCe----EEEeecCCC--CCCCCC-----CCHHHHHHHHhcCCcCCCCCCcc Q lcl|NC_020079. 1 MESEILPGDD----TDWETIIKKMMDLEQVQ----IEAGFLTNR--RHPESD-----LTIPAIAAIQQYGNETNNIPARP 65 (162) Q Consensus 1 M~~~i~~~~~----~~l~~l~~~l~~l~~~~----v~VGi~~~~--~~~d~g-----~~~A~iA~~~E~G~~~~~IP~Rp 65 (162) |.-.+...-. ....++....+.+.... +.-|.+..+ ..-+++ .+.+.+|.+.|||+ ...|+|| T Consensus 16 ~~~~~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~~g~~~~~V~~~~~Ya~~vE~GT--~~m~a~P 93 (115) T protein:vir:99 16 MKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTVDLQYTITSHAAYSGFLEFGT--RYMEAEP 93 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEecCCccccccccccc--cccCCCC Confidence 2222111000 11233333333332110 111111100 000111 13467899999995 6899999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 66 FITDGAVISQNNIAKKMKQVFA 87 (162) Q Consensus 66 Flr~~~~~~~~~~~~~~~~~~~ 87 (162) ||+|+++.++..+.+.++++++ T Consensus 94 Fl~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 94 FMWPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred cchhhHHHHHHHHHHHHHHHhC Confidence 9999999999999999998888 No 63 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=97.22 E-value=3.1e-06 Score=50.76 Aligned_cols=87 Identities=7% Similarity=0.139 Sum_probs=63.0 Q ss_pred HHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhc------CCCCCCHHHHHHhhccCCCCCCchhhHH Q lcl|NC_020079. 72 VISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQ------RYRPLSPVTIKIRQDKGNYSNHILIDTA 145 (162) Q Consensus 72 ~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppnsp~Ti~~k~~k~~~~~~PLidTG 145 (162) .+.-+++...+...+.+ |...+...+|..||..+....++.|.+. .|+|+++.|+..|..+ ..++|+.+| T Consensus 1 m~~~~~l~~~L~~~l~~-L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~~~---~~~~l~~~~ 76 (150) T protein:vir:57 1 MNEFKRFEDRLTGLIES-LSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGR---VKRKMFAKL 76 (150) T ss_pred CchHHHHHHHHHHHHHh-cCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhccC---CCcccchhh Confidence 22224444455555544 2333446689999999999999999764 6899999998777533 357899999 Q ss_pred HHHhhceeeeecCCCC-------C Q lcl|NC_020079. 146 HMINAIETKITKSKSK-------K 162 (162) Q Consensus 146 ~l~~SIty~V~~~~~~-------~ 162 (162) .|..||+|.+...... = T Consensus 77 ~l~~sl~~~~~~~~a~vg~~~G~~ 100 (150) T protein:vir:57 77 ITSRFLHIRASPEQASMEFYGGKS 100 (150) T ss_pred hhccceeeeeeCcEEEEEeecCCc Confidence 9999999988877532 1 No 64 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=97.22 E-value=3.2e-06 Score=50.71 Aligned_cols=87 Identities=7% Similarity=0.137 Sum_probs=63.2 Q ss_pred HHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhc------CCCCCCHHHHHHhhccCCCCCCchhhHH Q lcl|NC_020079. 72 VISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQ------RYRPLSPVTIKIRQDKGNYSNHILIDTA 145 (162) Q Consensus 72 ~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppnsp~Ti~~k~~k~~~~~~PLidTG 145 (162) .+.-+++...+...+.+ |.......+|..||..+....++.|.+. .|+|+++.|+++|..+ ..++|+++| T Consensus 1 ~~~~~~l~~~L~~~l~~-L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~---~~~~l~~~~ 76 (150) T protein:vir:60 1 MNEFKRFEDRLTGLIES-LSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGR---VKRKMFAKL 76 (150) T ss_pred CchHHHHHHHHHHHHHh-cCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcC---CCccchhhh Confidence 23334445555555555 3334446689999999999999999764 6899999998877633 357899999 Q ss_pred HHHhhceeeeecCCCC--------C Q lcl|NC_020079. 146 HMINAIETKITKSKSK--------K 162 (162) Q Consensus 146 ~l~~SIty~V~~~~~~--------~ 162 (162) .|..||+|.+...+.. . T Consensus 77 ~l~~sl~~~~~~~~a~vg~~~Gt~~ 101 (150) T protein:vir:60 77 ITSRFLHIRASPEQASMEFYGGKSP 101 (150) T ss_pred hhcceeeeeeeCcEEEEEeeCCCch Confidence 9999999988866432 1 No 65 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=97.16 E-value=6.3e-07 Score=54.59 Aligned_cols=84 Identities=8% Similarity=0.095 Sum_probs=43.1 Q ss_pred CCCcccccc-hhHHHH----HHHHHHHhhCC--eEEEeecCCC-CCCCCCC-----CHHHHHHHHhcCCcCCCCCCcchh Q lcl|NC_020079. 1 MESEILPGD-DTDWET----IIKKMMDLEQV--QIEAGFLTNR-RHPESDL-----TIPAIAAIQQYGNETNNIPARPFI 67 (162) Q Consensus 1 M~~~i~~~~-~~~l~~----l~~~l~~l~~~--~v~VGi~~~~-~~~d~g~-----~~A~iA~~~E~G~~~~~IP~RpFl 67 (162) |+..-...+ ..-+.+ +...+.+..+. -|.-|.+..+ ...+++. +.+.+|.+.|||+ ...|+|||| T Consensus 16 l~~~~~~~~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~~~~~~~~v~~~~~Ya~~vE~GT--r~m~AqPF~ 93 (112) T protein:vir:96 16 LLKNASSERRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLEAGSDRAVVEALTNYSGYLEVGT--RKMEAQPFM 93 (112) T ss_pred HHhhcCHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeeecCceEEEecCCCCccceeccCc--cccCCCCch Confidence 321111111 011222 22222221110 1111111111 0111221 3367899999995 689999999 Q ss_pred hHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 68 TDGAVISQNNIAKKMKQVF 86 (162) Q Consensus 68 r~~~~~~~~~~~~~~~~~~ 86 (162) +|+++.++..+.+.++++- T Consensus 94 ~PA~~~~~~~~~~~l~~L~ 112 (112) T protein:vir:96 94 RPALDQVVPEMVEEMAKWE 112 (112) T ss_pred hhhHHHHHHHHHHHHHhcC Confidence 9999999999888888766 No 66 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=97.12 E-value=2.2e-06 Score=51.55 Aligned_cols=75 Identities=20% Similarity=0.295 Sum_probs=44.0 Q ss_pred CCCcccccchhHHHHHHHHHHH-hhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcC------CCCCCcchhhHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMD-LEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNET------NNIPARPFITDGAVI 73 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~-l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~------~~IP~RpFlr~~~~~ 73 (162) ...++-.+. -+|...|.. .....|.||.. ..+|++|+||... ++||+||||--+ ++ T Consensus 75 ~~~~~L~~t----g~L~~Si~~~~~~~~v~vGt~------------~~yA~vHqfG~~~~~~~~~~~iPaRpfLG~s-~~ 137 (156) T protein:vir:19 75 VPGSILTLH----GDLARSITTDYGQDYALIGSP------------KIYAAIHQWGGTPDMAPRPAGVPARPYMGLD-KT 137 (156) T ss_pred CCCcchhhh----HHHHHHhhheecCCEEEEecc------------hhhhHHhhcCcccccCCCccccCCccccCCC-HH Confidence 112222111 123333322 24456777752 4689999999643 479999999533 45 Q ss_pred HHHHHHHHHHHHHHHHhcc Q lcl|NC_020079. 74 SQNNIAKKMKQVFANYLMH 92 (162) Q Consensus 74 ~~~~~~~~~~~~~~~~l~G 92 (162) ..+++.+.+...+..+++- T Consensus 138 d~~~I~~~i~~~l~~~~~~ 156 (156) T protein:vir:19 138 GEQEIFDAIRKRVSAALRQ 156 (156) T ss_pred HHHHHHHHHHHHHHHHhhC Confidence 5677777777777766644 No 67 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=97.08 E-value=6.5e-07 Score=54.49 Aligned_cols=83 Identities=14% Similarity=0.253 Sum_probs=42.4 Q ss_pred CCCcccccch-hHHH----HHHHHHHHhhCCeEEEeecCCC---CCCCCC-----CCHHHHHHHHhcCCcCCCCCCcchh Q lcl|NC_020079. 1 MESEILPGDD-TDWE----TIIKKMMDLEQVQIEAGFLTNR---RHPESD-----LTIPAIAAIQQYGNETNNIPARPFI 67 (162) Q Consensus 1 M~~~i~~~~~-~~l~----~l~~~l~~l~~~~v~VGi~~~~---~~~d~g-----~~~A~iA~~~E~G~~~~~IP~RpFl 67 (162) +.-....... ..+. .+.+.++.+.. |.-|.+..+ ...+++ .+.+.+|.+.|||+ ...|+|||| T Consensus 13 l~~~~~~~~~~~al~~~a~~i~~~ak~~aP--v~TG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~GT--~km~aqpf~ 88 (108) T protein:vir:74 13 LRKNATLDDVKHVVKSNTASMNKNMQNLAP--VDTGNMKRSITSEFTDGGLSGTTGPHTDYAGYVEYGT--RFQSAQPFV 88 (108) T ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHhCC--CCchhhhccceeeeecCceEEEeecCCCcccceeccc--cccCCCcch Confidence 1100000000 0111 11122222211 111111110 001111 13356899999996 578999999 Q ss_pred hHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 68 TDGAVISQNNIAKKMKQVFA 87 (162) Q Consensus 68 r~~~~~~~~~~~~~~~~~~~ 87 (162) +|+++.++.++.+.+++.++ T Consensus 89 ~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 89 KPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred hhHHHHHHHHHHHHHHHHcC Confidence 99999999999988888887 No 68 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=97.05 E-value=8.7e-07 Score=53.82 Aligned_cols=77 Identities=22% Similarity=0.242 Sum_probs=43.5 Q ss_pred CCCcccccchhHHHHHHHHHHH-hhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCc-----CCCCCCcchhhHH-HHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMD-LEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNE-----TNNIPARPFITDG-AVI 73 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~-l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~-----~~~IP~RpFlr~~-~~~ 73 (162) ...++..+. -.|...|.. .....|.||- +..+|++|+||.. ..+||+||||--. -++ T Consensus 71 ~~~~~L~~t----G~L~~Si~~~~~~~~v~vGt------------n~~YA~iHqfGg~~~~~~~~~iPARPfLG~s~~~e 134 (155) T protein:vir:10 71 GAHPILQVT----NALARSITTRADRDQAQIGS------------NLSYAAIQQLGGQAGRGRKVTIPARPYLPVLRNGQ 134 (155) T ss_pred CCCCccccc----hhhhhhhhceecCCEEEEec------------CcchhhhhhcccccCCCCccccCCccccCCCcccc Confidence 222222221 123333332 2445677774 1357999999963 4579999999422 123 Q ss_pred HHHHHHHHHHHHHHHHh-ccc Q lcl|NC_020079. 74 SQNNIAKKMKQVFANYL-MHN 93 (162) Q Consensus 74 ~~~~~~~~~~~~~~~~l-~G~ 93 (162) -+.++.+.+...+...| .|+ T Consensus 135 ~~~ei~~~I~~~i~~~l~~~r 155 (155) T protein:vir:10 135 LKPSARDAVLDVLLAALSQGR 155 (155) T ss_pred chHHHHHHHHHHHHHHHhhcC Confidence 34566666666666666 466 No 69 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=97.02 E-value=2.1e-06 Score=51.69 Aligned_cols=76 Identities=13% Similarity=0.164 Sum_probs=40.4 Q ss_pred CCCcccccchhHHHHH--HHHHHH-hhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCc--------CCCCCCcchhhH Q lcl|NC_020079. 1 MESEILPGDDTDWETI--IKKMMD-LEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNE--------TNNIPARPFITD 69 (162) Q Consensus 1 M~~~i~~~~~~~l~~l--~~~l~~-l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~--------~~~IP~RpFlr~ 69 (162) ++.+..... .+..+ -+.|.. .....+.||+.+ ++..||++|.||.. .++||+||||-- T Consensus 63 ~~~g~~~~~--~~~~l~~~~~l~~~~~~~~~~v~~~G---------tn~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~ 131 (149) T protein:vir:18 63 SKKGRIKRE--MFAKLRTSRFMKAKGSDSAAVVEFTG---------KVQRMARVHQYGLKDRPNRNSRDVQYEARPLLGF 131 (149) T ss_pred hccCcccch--hhhhhhhhhhhheeecCceeEEEecc---------cchhhhhhhhccccccccCCCccccccccccCCC Confidence 322221111 01111 111211 123457777642 34689999999953 247999999954 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 70 GAVISQNNIAKKMKQVFAN 88 (162) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~ 88 (162) + ++...++.+.+...+.. T Consensus 132 s-~~d~~~I~~~i~~~l~~ 149 (149) T protein:vir:18 132 T-RDDEQMIEDVIISHLGK 149 (149) T ss_pred C-HHHHHHHHHHHHHHHhC Confidence 4 33446666666666655 No 70 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=97.02 E-value=2.3e-06 Score=51.48 Aligned_cols=77 Identities=21% Similarity=0.330 Sum_probs=45.9 Q ss_pred CCCcccccchhHHHHHHHHHHH-hhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCc---------------------- Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMD-LEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNE---------------------- 57 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~-l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~---------------------- 57 (162) ...++..+. -.|.+.|.. .....|.||. +..+|++|+||.. T Consensus 71 ~~~~~L~~t----g~L~~Si~~~~~~~~v~vGt------------n~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~ 134 (190) T protein:vir:99 71 NRDKILTLD----GHLRNLLRYQLDGSELLFGS------------DRPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVG 134 (190) T ss_pred CCCccceec----HHHHHHHhheecCcEEEEec------------CcchhhhhhcCCcccccccchhhhhhhhhhhhhhh Confidence 232322211 133444432 3445677764 2466899999942 Q ss_pred --------------------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccch Q lcl|NC_020079. 58 --------------------TNNIPARPFITDGAVISQNNIAKKMKQVFANYLMHNV 94 (162) Q Consensus 58 --------------------~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~ 94 (162) .++||+||||--+ ++..+++.+.+...+..++.... T Consensus 135 ~~~~~~~~~~~~~~~~~~~~~v~IPaRpfLG~s-~~d~~~I~~~i~~~l~~~~~~~~ 190 (190) T protein:vir:99 135 REFVPRRRSNFAQDVQIGPYTIQMPARPWLGTS-SQDDDTILQRVERYLQRALRERA 190 (190) T ss_pred cccccccccccchhcccccceeeecCcccCCCC-HHHHHHHHHHHHHHHHHHHhhcC Confidence 2468999999544 44567777777777777766555 No 71 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=96.94 E-value=1.5e-06 Score=52.47 Aligned_cols=77 Identities=17% Similarity=0.171 Sum_probs=44.0 Q ss_pred CCCc-----------------------ccccchhHHHHHHHHHHHh-hCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCC Q lcl|NC_020079. 1 MESE-----------------------ILPGDDTDWETIIKKMMDL-EQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGN 56 (162) Q Consensus 1 M~~~-----------------------i~~~~~~~l~~l~~~l~~l-~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~ 56 (162) ++.+ +.++. -.|...|..- ....|.||-. ..+|++|+||. T Consensus 65 ~r~~~~~~~~~~~~~~~~~~~~~~~~~~L~~t----G~L~~Si~~~~~~~~v~vGtn------------~~YAaiHqfGg 128 (175) T protein:vir:79 65 MRVGGKKAYKKNGELTAAASRRKAGLMILQDS----GQMAASTATDSGEDYSVIGSN------------KEYAAIQHFGG 128 (175) T ss_pred hhccccccccccccchhhHhhhccCCCcceec----hhhhhhhhheecCCEEEEecC------------cchhhHhhccc Confidence 1111 11110 1233333222 3446667652 36799999997 Q ss_pred cC-----CCCCCcchhhHHHHH-----HHHHHHHHHHHHHHHHhccc Q lcl|NC_020079. 57 ET-----NNIPARPFITDGAVI-----SQNNIAKKMKQVFANYLMHN 93 (162) Q Consensus 57 ~~-----~~IP~RpFlr~~~~~-----~~~~~~~~~~~~~~~~l~G~ 93 (162) .. .+||+||||--+-+. ..+.+.+.+...+..++.++ T Consensus 129 ~~~~~~~v~IPARPfLG~s~~de~~~~~~~~I~~~i~~~l~~a~~~~ 175 (175) T protein:vir:79 129 QAGRGLKVTIPGRAWLPVTADGELQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred ccCCCcccccCcccccCCCcccchhHHHHHHHHHHHHHHHHHHhccC Confidence 53 479999999533322 14667777777777777777 No 72 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=96.90 E-value=2.4e-06 Score=51.40 Aligned_cols=79 Identities=15% Similarity=0.193 Sum_probs=40.7 Q ss_pred CCCc----ccccchh----HHH--HHHHHHH-HhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCc--------CCCC Q lcl|NC_020079. 1 MESE----ILPGDDT----DWE--TIIKKMM-DLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNE--------TNNI 61 (162) Q Consensus 1 M~~~----i~~~~~~----~l~--~l~~~l~-~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~--------~~~I 61 (162) ..+. +..+..+ .+. .+...|. ..+...+.|||..| +++.||++|.||.. ..+| T Consensus 53 W~p~~~~~~~~k~~~~~~~l~~~~~l~~sl~~~~~~~~a~vg~~~G--------t~~~yAaiHQfG~~~~~~~~~~~~~i 124 (150) T protein:vir:60 53 YAPRQQQSARKKTGRVKRKMFAKLITSRFLHIRASPEQASMEFYGG--------KSPKIASVHQFGLSEENRKDGKKIDY 124 (150) T ss_pred CcccChHHHHHhhcCCCccchhhhhhcceeeeeeeCcEEEEEeeCC--------CchhhhhhhhccccccccCCCCceec Confidence 2111 0000000 011 0111121 22345678887643 44789999999953 3479 Q ss_pred CCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 62 PARPFITDGAVISQNNIAKKMKQVFAN 88 (162) Q Consensus 62 P~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (162) |+||||--+-+ ..+++.+.+...+.. T Consensus 125 PaRp~LG~s~~-d~~~i~~~i~~~l~r 150 (150) T protein:vir:60 125 PARPLLGFTGE-DVQMIEEIILAHLDR 150 (150) T ss_pred CCcccCCCCHH-HHHHHHHHHHHHHhC Confidence 99999955432 345566666666655 No 73 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=96.89 E-value=2.6e-06 Score=51.23 Aligned_cols=79 Identities=15% Similarity=0.151 Sum_probs=42.5 Q ss_pred CCCcccccchhHHHHHHHHHH-HhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCc--------CCCCCCcchhhHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMM-DLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNE--------TNNIPARPFITDGA 71 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~-~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~--------~~~IP~RpFlr~~~ 71 (162) .+.+-..+-...--.+...|. ..+...+.|||..| +++.||++|.||.. ..+||+||||--+- T Consensus 63 ~k~g~~~~~l~~~~~l~~sl~~~~~~~~~~vg~~~G--------s~~~yAa~HQfG~~~~~~~~~~~~~iPaRp~LG~s~ 134 (150) T protein:vir:20 63 KKTGRVKRKMFAKLITSRFLHIRASPEQASMEFYGG--------KSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTG 134 (150) T ss_pred HhccCCCccccchhhhhhhhheeecCcEEEEEeeCC--------cchhhhhhhhcccccccccCCCceeccccccCCCCH Confidence 111100000000001222332 23456788998643 44689999999943 34799999995443 Q ss_pred HHHHHHHHHHHHHHHHH Q lcl|NC_020079. 72 VISQNNIAKKMKQVFAN 88 (162) Q Consensus 72 ~~~~~~~~~~~~~~~~~ 88 (162) +..+++.+.+...+.. T Consensus 135 -~d~~~i~~~i~~~l~k 150 (150) T protein:vir:20 135 -EDVQMIEEIILAHLER 150 (150) T ss_pred -HHHHHHHHHHHHHHhC Confidence 3345566666666655 No 74 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=96.89 E-value=9e-07 Score=53.73 Aligned_cols=82 Identities=13% Similarity=0.265 Sum_probs=41.5 Q ss_pred CCCcccccchhHHHHHHH----HHHHhhCC--eEEEeecCCC---CCCCCC-----CCHHHHHHHHhcCCcCCCCCCcch Q lcl|NC_020079. 1 MESEILPGDDTDWETIIK----KMMDLEQV--QIEAGFLTNR---RHPESD-----LTIPAIAAIQQYGNETNNIPARPF 66 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~----~l~~l~~~--~v~VGi~~~~---~~~d~g-----~~~A~iA~~~E~G~~~~~IP~RpF 66 (162) ++.... ...+++.++ .+.+..+. -|.-|-+..+ ...+++ .+.+.+|.+.|||+ ...|+||| T Consensus 13 l~~~~~---~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~GT--~~m~aqPF 87 (108) T protein:vir:98 13 LRKNAT---LNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFTDGGLTGTTIPHTDYAGYVEYGT--RFQAAQPF 87 (108) T ss_pred HHHhhh---HHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeeecCceEEEeecCCCccceeeccc--cccCCCcc Confidence 110000 001111111 11111110 0111111100 000111 13357899999996 57899999 Q ss_pred hhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 67 ITDGAVISQNNIAKKMKQVFA 87 (162) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~ 87 (162) |+|+++.++..+.+.+++.++ T Consensus 88 l~pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 88 VKPAFDVQKKIFTNDLERLTK 108 (108) T ss_pred hhhHHHHHHHHHHHHHHHHcC Confidence 999999999999998888887 No 75 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=96.82 E-value=3.1e-06 Score=50.76 Aligned_cols=79 Identities=15% Similarity=0.201 Sum_probs=40.5 Q ss_pred CCCc----ccc----cchhHHHH--HHHHH-HHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCc--------CCCC Q lcl|NC_020079. 1 MESE----ILP----GDDTDWET--IIKKM-MDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNE--------TNNI 61 (162) Q Consensus 1 M~~~----i~~----~~~~~l~~--l~~~l-~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~--------~~~I 61 (162) ..+. +.. .....+.. +...| -..+...+.|||..| ++..||++|.||.. .++| T Consensus 53 W~p~k~~~~~~k~~~~~~~l~~~~~l~~sl~~~~~~~~a~vg~~~G--------~~~~yAaiHQfG~~~r~~~~~~~~~i 124 (150) T protein:vir:57 53 YAPRQQQSARKKTGRVKRKMFAKLITSRFLHIRASPEQASMEFYGG--------KSPKIASVHQFGLSEETRKDGKKIDY 124 (150) T ss_pred CcccChHHHHHhccCCCcccchhhhhccceeeeeeCcEEEEEeecC--------CchhhhhhhhccccccccCCCceeec Confidence 1110 000 00000110 11112 122345678887543 44789999999953 3469 Q ss_pred CCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 62 PARPFITDGAVISQNNIAKKMKQVFAN 88 (162) Q Consensus 62 P~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (162) |+||||--+- +...++.+.+...+.+ T Consensus 125 PaRp~LG~s~-~d~~~i~~~i~~~l~r 150 (150) T protein:vir:57 125 PARPLLGFTG-EDVQMIEEIILAHLDR 150 (150) T ss_pred CCcccCCCCH-HHHHHHHHHHHHHHhC Confidence 9999995442 3345566666666655 No 76 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=96.79 E-value=2.4e-06 Score=51.35 Aligned_cols=79 Identities=19% Similarity=0.183 Sum_probs=41.6 Q ss_pred CCC--------cccccchhHHHHHHHH--H-HHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCc---------CCC Q lcl|NC_020079. 1 MES--------EILPGDDTDWETIIKK--M-MDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNE---------TNN 60 (162) Q Consensus 1 M~~--------~i~~~~~~~l~~l~~~--l-~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~---------~~~ 60 (162) ..+ +-..+....+..|... | -......+.|||.+ ++..||++|.||.. .++ T Consensus 54 W~p~k~~~~~~k~~~~~~~m~~~L~~a~~l~~~a~~~~~~Vg~~G---------t~~~yAaiHQfG~~~r~~~~~~~~v~ 124 (152) T protein:vir:10 54 YEPRKKPKKGVKSKIKSGKMFDKITQPRFMRLRLESEGVSLGYEG---------GDAVIARIHQQGLIGRVRKDWDLKVK 124 (152) T ss_pred CchhhhhhhhhcccccchhHHHhhhhcceeeeeecCcEEEEEecC---------CchhhhhhhccCccccccCCCCccee Confidence 211 1111111112222221 1 12344578899863 34689999999953 357 Q ss_pred CCCcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 61 IPARPFITDGAVISQNNIAKKMKQVFANY 89 (162) Q Consensus 61 IP~RpFlr~~~~~~~~~~~~~~~~~~~~~ 89 (162) ||+||||--+-+ ..+++.+.+...+... T Consensus 125 iPaRp~LG~s~~-d~~~I~~~i~~~l~~a 152 (152) T protein:vir:10 125 YASRELLGFTDD-DLQMIEDYMINILAGS 152 (152) T ss_pred ccccccCCCCHH-HHHHHHHHHHHHHhcC Confidence 999999955432 3345555555555443 No 77 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=96.68 E-value=4e-06 Score=50.15 Aligned_cols=77 Identities=19% Similarity=0.242 Sum_probs=42.0 Q ss_pred CC-----------------------CcccccchhHHHHHHHHHHH-hhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCC Q lcl|NC_020079. 1 ME-----------------------SEILPGDDTDWETIIKKMMD-LEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGN 56 (162) Q Consensus 1 M~-----------------------~~i~~~~~~~l~~l~~~l~~-l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~ 56 (162) .+ .++.++. -.|...|.. .+...|.||- +..+|++|+||. T Consensus 65 ~r~~~g~~~~k~~~~~~~~~~~~~~~~~L~~t----G~L~~Si~~~~~~~~v~vGt------------n~~YAaiHqfGg 128 (175) T protein:vir:10 65 MRVGGKKAYKKNGELTAAASRRKAGLMILQDS----GQMAASVSTDHDDNSAVIGS------------NKEYAAIHQFGG 128 (175) T ss_pred hhhcccccchhhhhhhhhhhhhccCCCcceec----hhhhhhhheeecCCEEEEec------------Chhhhhhhhccc Confidence 11 1111111 122333321 1334555554 236799999996 Q ss_pred c-----CCCCCCcchhhHHHHH-----HHHHHHHHHHHHHHHHhccc Q lcl|NC_020079. 57 E-----TNNIPARPFITDGAVI-----SQNNIAKKMKQVFANYLMHN 93 (162) Q Consensus 57 ~-----~~~IP~RpFlr~~~~~-----~~~~~~~~~~~~~~~~l~G~ 93 (162) . .++||+||||--+-+. ..++|.+.+...+...+.++ T Consensus 129 ~~~~~~~v~iPaRpfLG~s~~d~~~~e~~~~Il~~~~~~l~~~~~~~ 175 (175) T protein:vir:10 129 QAGRGLKVTIPARPWLPVTADGELQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred ccCCCCccccCCccccCCCcccccchHHHHHHHHHHHHHHHHHhccC Confidence 4 3489999999543221 23566777777777777777 No 78 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=96.52 E-value=7e-06 Score=48.84 Aligned_cols=78 Identities=15% Similarity=0.301 Sum_probs=40.2 Q ss_pred CCCcc------cccc-hhHH--HHHHHHH-------HHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCc------- Q lcl|NC_020079. 1 MESEI------LPGD-DTDW--ETIIKKM-------MDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNE------- 57 (162) Q Consensus 1 M~~~i------~~~~-~~~l--~~l~~~l-------~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~------- 57 (162) ..+.- ...+ ...+ .-+...+ -..+...+.|||.+ +++.||++|.||.. T Consensus 54 W~prk~~~~~~~~~~~~g~~~~~~m~~~l~~a~~l~~~~~~d~a~Vg~~G---------s~~~yAaiHQfG~~~r~~~~~ 124 (155) T protein:vir:79 54 YEPRKVKAGGKRLREKAGRVKREAMFRKLRTARYLRIDVDSTGLAIGFDE---------RLSRIARVHQEGQKAPVEPGG 124 (155) T ss_pred CcccchhhhhhhhhcccCcccchhhhhhhhhhheeeeeecCcEEEEEecC---------cchhhhhhhhcCCcccCCCCC Confidence 22110 0000 0000 0011111 11234567788732 45789999999953 Q ss_pred -CCCCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 58 -TNNIPARPFITDGAVISQNNIAKKMKQVFAN 88 (162) Q Consensus 58 -~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (162) .++||+||||--+-+ ..+++.+.+...+.. T Consensus 125 ~~v~iPaRp~LGls~~-d~~~I~~~i~~~l~r 155 (155) T protein:vir:79 125 PLAQYPVRVVLGFSDA-DRELVRDRLLRELTR 155 (155) T ss_pred cccccccccccCCCHH-HHHHHHHHHHHHhhC Confidence 347999999954433 345666666666655 No 79 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=96.52 E-value=3.9e-06 Score=50.21 Aligned_cols=78 Identities=17% Similarity=0.233 Sum_probs=41.8 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCC------------------------eEEEeecCCC---CCCCCC-----CCHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQV------------------------QIEAGFLTNR---RHPESD-----LTIPAI 48 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~------------------------~v~VGi~~~~---~~~d~g-----~~~A~i 48 (162) |..-.. ++++|.+.|+.+.+. -|.-|-+..+ ...+++ .+.+.+ T Consensus 1 Ma~~~~-----G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~Y 75 (137) T protein:vir:10 1 MAKVKY-----GNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEY 75 (137) T ss_pred Cccchh-----CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEecCCcEEEEEecCCcc Confidence 444221 122222222221110 0122222211 001111 134788 Q ss_pred HHHHhcCCc---------------------------CCCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_020079. 49 AAIQQYGNE---------------------------TNNIPARPFITDGAVISQNNIAKKMK 83 (162) Q Consensus 49 A~~~E~G~~---------------------------~~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (162) |.+.|||+. +.+.|+||||++++++++.++.+.+. T Consensus 76 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 76 AVYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred ccccccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 999999962 23589999999999999999988888 No 80 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=96.44 E-value=1.7e-05 Score=46.69 Aligned_cols=93 Identities=13% Similarity=0.162 Sum_probs=44.1 Q ss_pred CCCccc---ccch-hHHHHHHHHHHHhhCC-------eEEEeecCCC-CCCCCCCCHHHHHHHHhcCCcC---------- Q lcl|NC_020079. 1 MESEIL---PGDD-TDWETIIKKMMDLEQV-------QIEAGFLTNR-RHPESDLTIPAIAAIQQYGNET---------- 58 (162) Q Consensus 1 M~~~i~---~~~~-~~l~~l~~~l~~l~~~-------~v~VGi~~~~-~~~d~g~~~A~iA~~~E~G~~~---------- 58 (162) |...+. ..-. .....+.+..+.+.-. .+.+-..... .-.-+..+.+.+|.+.|||+.. T Consensus 16 l~~~~~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~~~~~v~~~~~Ya~fvEfGT~~m~a~P~~~~~ 95 (173) T protein:vir:10 16 IGKDIDKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLKAKDLISKKITVNELYGAYMEFGTGAKVSVPKEFAD 95 (173) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeeccCceeEEeeCCCcccchhhhcccccccCCCchhhh Confidence 211111 0000 0111112222222110 1111110000 0000012446778888888621 Q ss_pred -------------------------------------------CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_020079. 59 -------------------------------------------NNIPARPFITDGAVISQNNIAKKMKQVFANYLMHN 93 (162) Q Consensus 59 -------------------------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~ 93 (162) -..||||||+|+++++++++.+.+++.+...+.-- T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~~aqPFl~PA~~~~~~~~~~~i~~~i~~~lrk~ 173 (173) T protein:vir:10 96 MAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGINPQPFLYPAWIEGKKQYLKDLENLLKTYNKKI 173 (173) T ss_pred hhcccccccccccccccccccccccccccchhcccceeeEeecCCCCCCccchhHHHHhHHHHHHHHHHHHHHHhhcC Confidence 13799999999999999998888888877655433 No 81 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=96.39 E-value=7.6e-06 Score=48.66 Aligned_cols=78 Identities=17% Similarity=0.217 Sum_probs=35.7 Q ss_pred CCC---ccc-ccch---hHHHH--HHHHHH-HhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCc--------CCCCC Q lcl|NC_020079. 1 MES---EIL-PGDD---TDWET--IIKKMM-DLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNE--------TNNIP 62 (162) Q Consensus 1 M~~---~i~-~~~~---~~l~~--l~~~l~-~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~--------~~~IP 62 (162) ..+ .-. .+.. ..+.. +...|. ......+.|||.. ++..||++|.||.. .++|| T Consensus 53 W~p~s~~~~~~~g~~~~~~~~~l~~~~~l~~~~~~~~~~v~~~G---------t~~~yAaiHQfG~~~r~~~~~~~v~iP 123 (148) T protein:vir:79 53 YVPRKPQLRHRAGRIRRAMFMRLRLARYMKTQADANTAVVTFAG---------NAQRIATVHQFGLRDRVNKAGLTAQYP 123 (148) T ss_pred CcccchHHHhhcccccccccchhhhhhheeeeeeCCeeeEEeec---------cchhhhhhhhcCccccccCCCCccccC Confidence 111 000 0000 00000 011111 1123356677632 34689999999943 45799 Q ss_pred CcchhhHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_020079. 63 ARPFITDGAVISQNNIAKKMKQVFANYLMH 92 (162) Q Consensus 63 ~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G 92 (162) +||||--+-+ ..+++.+.+... |.| T Consensus 124 aRp~LG~s~~-d~~~i~~~i~~~----l~~ 148 (148) T protein:vir:79 124 ARELLGMDGV-DMEHITNLLLLH----LGA 148 (148) T ss_pred cccccCCCHH-HHHHHHHHHHHH----hcC Confidence 9999954422 334444444444 344 No 82 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=96.38 E-value=6.9e-06 Score=48.86 Aligned_cols=84 Identities=14% Similarity=0.172 Sum_probs=49.2 Q ss_pred CCCcccccchhHHH-HHHHHHHHhhC----CeEEEeecCCCCCCCCCCCHHHHHHHHhcC-------------------- Q lcl|NC_020079. 1 MESEILPGDDTDWE-TIIKKMMDLEQ----VQIEAGFLTNRRHPESDLTIPAIAAIQQYG-------------------- 55 (162) Q Consensus 1 M~~~i~~~~~~~l~-~l~~~l~~l~~----~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G-------------------- 55 (162) |+..+-... -.|. .|.....+-.+ ..-.|||-.-+. --+.+.||| T Consensus 5 akarv~~~~-G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkA---------PhghlvE~Ghw~~~~~~~~~dG~w~~~~~ 74 (119) T protein:vir:81 5 AKAFVNDET-GKLRSNLYVAYSPEESTNGVQTYAVSWRKKAA---------PHGHLLEFGHWQTHAAYKGKDGEWYSSSV 74 (119) T ss_pred cccccCCCc-cchhhhheeeeccccCCCCeEEEEeeccCCcC---------CcccccccceeeeeeeeeccCceeeecCc Confidence 555543221 1233 22233222211 234577765432 123344787 Q ss_pred --CcCCCCCCcchhhHHHHHHHHHHHHHHHHH----HHHHhccch Q lcl|NC_020079. 56 --NETNNIPARPFITDGAVISQNNIAKKMKQV----FANYLMHNV 94 (162) Q Consensus 56 --~~~~~IP~RpFlr~~~~~~~~~~~~~~~~~----~~~~l~G~~ 94 (162) .....+|+|||||++|+....+..+.+.+. +.+++.|+. T Consensus 75 ~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:81 75 KLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred cccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 123569999999999998888877777766 777888877 No 83 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=96.34 E-value=7.7e-06 Score=48.62 Aligned_cols=84 Identities=14% Similarity=0.188 Sum_probs=49.3 Q ss_pred CCCcccccchhHHH-HHHHHHHHhhC----CeEEEeecCCCCCCCCCCCHHHHHHHHhcCC------------------- Q lcl|NC_020079. 1 MESEILPGDDTDWE-TIIKKMMDLEQ----VQIEAGFLTNRRHPESDLTIPAIAAIQQYGN------------------- 56 (162) Q Consensus 1 M~~~i~~~~~~~l~-~l~~~l~~l~~----~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~------------------- 56 (162) |+..+-... -.|. .|.....+-.+ ..-.|||-.-+. --+.+.|||. T Consensus 5 akarv~~~~-G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkA---------PhghlvE~Ghw~~~~~~~~~dG~w~~~~~ 74 (119) T protein:vir:10 5 AKAFVNDET-GKLRSNLYVAYSTEESTNGVQTYAVSWRKKAA---------PHGHLLEFGHWQTHAAYKGKDGEWYSSSV 74 (119) T ss_pred cccccCCCc-cchhhhheeeeccccCCCCEEEEEeecCCCcC---------CcccccccceeeeeeeeeccCceeeecCc Confidence 555543221 1233 22233322211 234577765332 1234458881 Q ss_pred ---cCCCCCCcchhhHHHHHHHHHHHHHHHHH----HHHHhccch Q lcl|NC_020079. 57 ---ETNNIPARPFITDGAVISQNNIAKKMKQV----FANYLMHNV 94 (162) Q Consensus 57 ---~~~~IP~RpFlr~~~~~~~~~~~~~~~~~----~~~~l~G~~ 94 (162) ....+|+|||||++|+....+..+.+.+. +.+++.|+. T Consensus 75 ~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:10 75 KLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred cccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 13469999999999998888887777766 777888877 No 84 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=96.30 E-value=2.8e-06 Score=51.00 Aligned_cols=84 Identities=14% Similarity=0.152 Sum_probs=42.5 Q ss_pred CCCcccccch-hHHHHHHH-------------------HHHHhhCCeEEEeecCCC----CCCCC------CCCHHHHHH Q lcl|NC_020079. 1 MESEILPGDD-TDWETIIK-------------------KMMDLEQVQIEAGFLTNR----RHPES------DLTIPAIAA 50 (162) Q Consensus 1 M~~~i~~~~~-~~l~~l~~-------------------~l~~l~~~~v~VGi~~~~----~~~d~------g~~~A~iA~ 50 (162) |..++..++. +.|+++.+ ..+.+.- |.=|-+..+ ...++ -.+.+.+|. T Consensus 4 ~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aP--v~TG~Lr~SI~~~~~~~g~~~~~~v~~~~~YA~ 81 (142) T protein:vir:94 4 LNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGLCP--VDTGRLRSSIQAVPSGGRFSFSVTIGTNVTYAA 81 (142) T ss_pred eEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--ccchhhhccceeeeccCCceEEEEEecCcccch Confidence 4433322211 11222211 1111111 122222110 00001 014478999 Q ss_pred HHhcCCcCC-------------------------CCCCcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 51 IQQYGNETN-------------------------NIPARPFITDGAVISQNNIAKKMKQVF 86 (162) Q Consensus 51 ~~E~G~~~~-------------------------~IP~RpFlr~~~~~~~~~~~~~~~~~~ 86 (162) ++|||+... .+||||||+++++++++.+.+.++..- T Consensus 82 ~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i~~~~~~~~ 142 (142) T protein:vir:94 82 DVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAASTFLRNHAKGIR 142 (142) T ss_pred hhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHHHHHHHHHHhcC Confidence 999997321 378999999999999888877776654 No 85 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=96.23 E-value=2.5e-05 Score=45.86 Aligned_cols=94 Identities=17% Similarity=0.260 Sum_probs=41.5 Q ss_pred CCCcccccchhHHHHHH--------HHHHHhhCCeEEEeecCCC-----CCCCCC-----CCHHHHHHHHhcCCc----- Q lcl|NC_020079. 1 MESEILPGDDTDWETII--------KKMMDLEQVQIEAGFLTNR-----RHPESD-----LTIPAIAAIQQYGNE----- 57 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~--------~~l~~l~~~~v~VGi~~~~-----~~~d~g-----~~~A~iA~~~E~G~~----- 57 (162) |...+...-.+.+.+.. ..++.+.. |.-|-+..+ ...+++ .+.+.+|.+.|||+. T Consensus 19 ~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~P--vdtG~Lr~SI~~~~~~~~~~~~g~V~~~~~ya~yvE~GTG~~~~~ 96 (182) T protein:vir:10 19 LPDIMAKATANAQENAIEQAEAYAVDELQSSIK--YSTGELTRSFKHEVKVDGDEVIGRWWNSSMVAVFREFGTGLVGER 96 (182) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCC--CCchhhhhceeeeeeecCCeEEEEeecCCCccceeecCccccccc Confidence 32222211111121111 11222211 111111100 000000 122456666666641 Q ss_pred -----------------------------------------------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020079. 58 -----------------------------------------------TNNIPARPFITDGAVISQNNIAKKMKQVFANYL 90 (162) Q Consensus 58 -----------------------------------------------~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l 90 (162) +.+.||||||+|++.++++++.+.+++.+...+ T Consensus 97 ~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPFl~pA~~~~~~~i~~~i~~~i~~~l 176 (182) T protein:vir:10 97 SHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQFMTPAANKMAKEAPEIIKRSIDQEL 176 (182) T ss_pred CccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcchHHHHHHhHHHHHHHHHHHHHHHH Confidence 124699999999999999998888887666644 Q ss_pred ccchHHHHHHHHHH Q lcl|NC_020079. 91 MHNVGLAVFEPIAR 104 (162) Q Consensus 91 ~G~~~~~~l~~iG~ 104 (162) .-. +|- T Consensus 177 ~~~--------~g~ 182 (182) T protein:vir:10 177 HDK--------LGG 182 (182) T ss_pred HHh--------hcC Confidence 211 111 No 86 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=96.19 E-value=7.9e-06 Score=48.56 Aligned_cols=78 Identities=17% Similarity=0.228 Sum_probs=43.7 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCC------------------------eEEEeecCCC---CCCCCC-----CCHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQV------------------------QIEAGFLTNR---RHPESD-----LTIPAI 48 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~------------------------~v~VGi~~~~---~~~d~g-----~~~A~i 48 (162) |+..+. ++++|.+.|+.+.+. -|.-|-+..+ ....++ .+.+.+ T Consensus 1 Ma~~~~-----Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y 75 (137) T protein:vir:10 1 MAKVKY-----GNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEY 75 (137) T ss_pred CchhHh-----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeCCcEEEEEecCCCc Confidence 655421 334444444333221 0111211111 001111 134678 Q ss_pred HHHHhcCCcC---------------------------CCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_020079. 49 AAIQQYGNET---------------------------NNIPARPFITDGAVISQNNIAKKMK 83 (162) Q Consensus 49 A~~~E~G~~~---------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (162) |.+.|||+.. ...|+||||++++++++.++.+.+. T Consensus 76 a~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 76 AVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred ccccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Confidence 9999999622 2479999999999999999988887 No 87 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=96.19 E-value=5.4e-05 Score=43.98 Aligned_cols=87 Identities=13% Similarity=0.133 Sum_probs=59.4 Q ss_pred HHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhc------CCCCCCHHHHHHhhccCCCCCCchhhH Q lcl|NC_020079. 71 AVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQ------RYRPLSPVTIKIRQDKGNYSNHILIDT 144 (162) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppnsp~Ti~~k~~k~~~~~~PLidT 144 (162) +++ -+++.+.+...+.+ |.......+|..||..+....++.|... .|+|+++.|+..|..+ ..+||..+ T Consensus 1 m~~-~~~~~~~l~~ll~~-L~~~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~---~~~~~~~~ 75 (149) T protein:vir:18 1 MSE-LTALQERLAGLIAS-LSPAARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSKKGR---IKREMFAK 75 (149) T ss_pred Cch-HHHHHHHHHHHHHh-cCCchHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhccCc---ccchhhhh Confidence 222 24444444444444 2233345689999999999999999763 5889999997654422 35789999 Q ss_pred HHHHhhceeeeecCCCCC Q lcl|NC_020079. 145 AHMINAIETKITKSKSKK 162 (162) Q Consensus 145 G~l~~SIty~V~~~~~~~ 162 (162) +.+.+++++.+...+..= T Consensus 76 l~~~~~l~~~~~~~~~~v 93 (149) T protein:vir:18 76 LRTSRFMKAKGSDSAAVV 93 (149) T ss_pred hhhhhhhheeecCceeEE Confidence 999999988777665332 No 88 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=96.06 E-value=1e-05 Score=48.00 Aligned_cols=78 Identities=17% Similarity=0.223 Sum_probs=45.0 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCC------------------------eEEEeecCCC---CCCCCC-----CCHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQV------------------------QIEAGFLTNR---RHPESD-----LTIPAI 48 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~------------------------~v~VGi~~~~---~~~d~g-----~~~A~i 48 (162) |...+. ++++|.+.|+++.+. -|.-|-+..+ ...+++ .+.+.+ T Consensus 1 Ma~~~~-----g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y 75 (137) T protein:vir:93 1 MAKVKY-----GNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEY 75 (137) T ss_pred CchhHH-----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCc Confidence 666543 233333333222111 1122222211 001121 245788 Q ss_pred HHHHhcCCcC---------------------------CCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_020079. 49 AAIQQYGNET---------------------------NNIPARPFITDGAVISQNNIAKKMK 83 (162) Q Consensus 49 A~~~E~G~~~---------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (162) |.+.|||+.. ...|+||||++++++++..+.+.|. T Consensus 76 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:93 76 AIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9999999721 3579999999999999999998888 No 89 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=96.06 E-value=1e-05 Score=48.00 Aligned_cols=78 Identities=17% Similarity=0.223 Sum_probs=45.0 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCC------------------------eEEEeecCCC---CCCCCC-----CCHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQV------------------------QIEAGFLTNR---RHPESD-----LTIPAI 48 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~------------------------~v~VGi~~~~---~~~d~g-----~~~A~i 48 (162) |...+. ++++|.+.|+++.+. -|.-|-+..+ ...+++ .+.+.+ T Consensus 1 Ma~~~~-----g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y 75 (137) T protein:vir:97 1 MAKVKY-----GNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEY 75 (137) T ss_pred CchhHH-----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCc Confidence 666543 233333333222111 1122222211 001121 245788 Q ss_pred HHHHhcCCcC---------------------------CCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_020079. 49 AAIQQYGNET---------------------------NNIPARPFITDGAVISQNNIAKKMK 83 (162) Q Consensus 49 A~~~E~G~~~---------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (162) |.+.|||+.. ...|+||||++++++++..+.+.|. T Consensus 76 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:97 76 AIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9999999721 3579999999999999999998888 No 90 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=96.06 E-value=1e-05 Score=48.00 Aligned_cols=78 Identities=17% Similarity=0.223 Sum_probs=45.0 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCC------------------------eEEEeecCCC---CCCCCC-----CCHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQV------------------------QIEAGFLTNR---RHPESD-----LTIPAI 48 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~------------------------~v~VGi~~~~---~~~d~g-----~~~A~i 48 (162) |...+. ++++|.+.|+++.+. -|.-|-+..+ ...+++ .+.+.+ T Consensus 1 Ma~~~~-----g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Y 75 (137) T protein:vir:94 1 MAKVKY-----GNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEY 75 (137) T ss_pred CchhHH-----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCc Confidence 666543 233333333222111 1122222211 001121 245788 Q ss_pred HHHHhcCCcC---------------------------CCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_020079. 49 AAIQQYGNET---------------------------NNIPARPFITDGAVISQNNIAKKMK 83 (162) Q Consensus 49 A~~~E~G~~~---------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (162) |.+.|||+.. ...|+||||++++++++..+.+.|. T Consensus 76 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 76 AIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9999999721 3579999999999999999998888 No 91 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=96.06 E-value=1.2e-05 Score=47.55 Aligned_cols=74 Identities=18% Similarity=0.258 Sum_probs=38.5 Q ss_pred CCCcccccchhHHHHHHHHHHH-hhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCc-----CCCCCCcchhhHHHH-- Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMD-LEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNE-----TNNIPARPFITDGAV-- 72 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~-l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~-----~~~IP~RpFlr~~~~-- 72 (162) ...++..+. -.|...|.. .....|.||- +..+|++|+||.. .++||+||||--+-+ T Consensus 71 ~~~~iL~~t----g~L~~Si~~~~~~~~v~vGt------------n~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~ 134 (155) T protein:vir:99 71 GPHPILQVT----NALARSVTTWADRNEAGIGS------------NLVYAAIHQFGGDAGRGHQVEIPARRYLPFDENGQ 134 (155) T ss_pred CCCCcchhc----hhhhhhhhceecCCEEEEec------------CccchhhhhcccccCCCCccccCCccccCCCCccc Confidence 233333221 123333322 2445677763 1356999999964 458999999943221 Q ss_pred ---HHHHHHHHHHHHHHHHHhccc Q lcl|NC_020079. 73 ---ISQNNIAKKMKQVFANYLMHN 93 (162) Q Consensus 73 ---~~~~~~~~~~~~~~~~~l~G~ 93 (162) +-.+++.+.+...+.. ++ T Consensus 135 l~~e~~~~I~~~i~~~l~~---~~ 155 (155) T protein:vir:99 135 LAAGARQSILEIVLTALSR---NR 155 (155) T ss_pred cchHHHHHHHHHHHHHHhc---cC Confidence 2234444444444443 44 No 92 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=95.94 E-value=1.5e-05 Score=47.01 Aligned_cols=74 Identities=18% Similarity=0.250 Sum_probs=37.4 Q ss_pred CCCcccccchhHHHHHHHHHHH-hhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCc-----CCCCCCcchhhHHHH-- Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMD-LEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNE-----TNNIPARPFITDGAV-- 72 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~-l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~-----~~~IP~RpFlr~~~~-- 72 (162) ...++..+. -.|...|.. .....|.||- +..+|++|+||.. .++||+||||--.-+ T Consensus 71 ~~~~iL~~t----G~L~~Si~~~~~~~~v~vGt------------~~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~ 134 (155) T protein:vir:79 71 GPHPILQVT----NALARSVTTWADRNEAGIGS------------NLVYAAIHQFGGDAGRGHQVEIPARRYLPFDENGQ 134 (155) T ss_pred CCCCccccc----hhhhhhhhceecCCEEEEec------------CchhhhhhhcccccCCCCccccCCccccCCCCccc Confidence 222232221 123333322 2344566653 2467999999964 458999999943222 Q ss_pred ---HHHHHHHHHHHHHHHHHhccc Q lcl|NC_020079. 73 ---ISQNNIAKKMKQVFANYLMHN 93 (162) Q Consensus 73 ---~~~~~~~~~~~~~~~~~l~G~ 93 (162) +-.+++.+.+...+. +|+ T Consensus 135 l~~~~~~~I~~~i~~~l~---r~r 155 (155) T protein:vir:79 135 LAAGARQSILEVVLTALS---RNR 155 (155) T ss_pred cchHHHHHHHHHHHHHHH---hcC Confidence 122334444444443 466 No 93 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=95.79 E-value=1.7e-05 Score=46.73 Aligned_cols=78 Identities=15% Similarity=0.216 Sum_probs=45.2 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCC------------------------eEEEeecCCC-C--CCCCC-----CCHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQV------------------------QIEAGFLTNR-R--HPESD-----LTIPAI 48 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~------------------------~v~VGi~~~~-~--~~d~g-----~~~A~i 48 (162) |...+. ++++|.+.|+++.+. -|.-|-+..+ + ..+++ .+.+.+ T Consensus 1 Ma~~~~-----G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~~~~~~~~~V~~~~~Y 75 (137) T protein:vir:95 1 MAKVKY-----GNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEY 75 (137) T ss_pred CchhHH-----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEeeCCceEEEEecCCCc Confidence 766553 233333333322211 1122222211 0 01111 245788 Q ss_pred HHHHhcCCcC---------------------------CCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_020079. 49 AAIQQYGNET---------------------------NNIPARPFITDGAVISQNNIAKKMK 83 (162) Q Consensus 49 A~~~E~G~~~---------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (162) |.+.|||+.. ...|+||||++++++++.++.+.+. T Consensus 76 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~l~ 137 (137) T protein:vir:95 76 AIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9999999722 3579999999999999999998888 No 94 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=95.72 E-value=9.5e-06 Score=48.11 Aligned_cols=89 Identities=8% Similarity=-0.018 Sum_probs=45.5 Q ss_pred CCCcccccchhHHHHHHHH-H-------HHhhC--CeEEEeecCCCC-C--CCCCC-----CHHHHHHHHhcCCcC---- Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKK-M-------MDLEQ--VQIEAGFLTNRR-H--PESDL-----TIPAIAAIQQYGNET---- 58 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~-l-------~~l~~--~~v~VGi~~~~~-~--~d~g~-----~~A~iA~~~E~G~~~---- 58 (162) ++...... ..-.+.+.+. + +...+ .-|.=|-+..+- + ..+|. +.+.+|.+.|||+.. T Consensus 11 ~~~~~~~~-k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~~g~~~~V~~~~~YA~yVE~GTG~~~~~ 89 (141) T protein:vir:78 11 PKARKLIE-KKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRKSSKEVIVGNSSDYAIYYEFGTGEKSER 89 (141) T ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeecCCcEEEEecCCCccceeecCCcccccC Confidence 11111110 0001111110 1 11111 112333332210 1 01111 457899999999732 Q ss_pred --------------------CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020079. 59 --------------------NNIPARPFITDGAVISQNNIAKKMKQVFANYL 90 (162) Q Consensus 59 --------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l 90 (162) ..-|+||||++++.++++++.+.+++.+..+- T Consensus 90 ~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~~i~~~i~~~~~~l~ 141 (141) T protein:vir:78 90 GGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQDKVRVFTERALRGIN 141 (141) T ss_pred CCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhHHHHHHHHHHHhhccC Confidence 23699999999999999999998888887642 No 95 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=95.70 E-value=0.00012 Score=42.14 Aligned_cols=91 Identities=9% Similarity=0.080 Sum_probs=64.5 Q ss_pred HHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhc------CCCCCCHHHHHHhhccC--CCCCCchh Q lcl|NC_020079. 71 AVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQ------RYRPLSPVTIKIRQDKG--NYSNHILI 142 (162) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppnsp~Ti~~k~~k~--~~~~~PLi 142 (162) ++++.+++.+.+..++..+ ...+...+|..||..+....++.|... .|+|+++.|...+...+ .....+|. T Consensus 1 m~~~~~~l~~~l~~ll~~l-~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~~~~~~m~ 79 (155) T protein:vir:79 1 MTDDLQALERWAGGLLAKL-SPAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAGRVKREAMF 79 (155) T ss_pred CchHHHHHHHHHHHHHHhc-CChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccCcccchhhh Confidence 4556666666666666552 333445689999999999999999763 57788888876554433 33467899 Q ss_pred hHHHHHhhceeeeecCCCCC Q lcl|NC_020079. 143 DTAHMINAIETKITKSKSKK 162 (162) Q Consensus 143 dTG~l~~SIty~V~~~~~~~ 162 (162) +.+.+.++|+|.+......= T Consensus 80 ~~l~~a~~l~~~~~~d~a~V 99 (155) T protein:vir:79 80 RKLRTARYLRIDVDSTGLAI 99 (155) T ss_pred hhhhhhheeeeeecCcEEEE Confidence 99999999999887554321 No 96 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=95.60 E-value=4e-05 Score=44.68 Aligned_cols=81 Identities=14% Similarity=0.200 Sum_probs=40.1 Q ss_pred CCCcccccch----hHHHHHHHHHHHhhCCeEEEeecCCC---CCCCCC-----CCHHHHHHHHhcCCcC---------- Q lcl|NC_020079. 1 MESEILPGDD----TDWETIIKKMMDLEQVQIEAGFLTNR---RHPESD-----LTIPAIAAIQQYGNET---------- 58 (162) Q Consensus 1 M~~~i~~~~~----~~l~~l~~~l~~l~~~~v~VGi~~~~---~~~d~g-----~~~A~iA~~~E~G~~~---------- 58 (162) +...+...-. .....+....+.+.. |.-|-+..+ ...+++ .+.+.+|.+.|||+.. T Consensus 30 ~~~~~~~~~~~al~~~a~~v~~~ak~~aP--vdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~VE~GT~~~~~~~~~~~~ 107 (149) T protein:vir:94 30 FDKKIEEWVKKGIAKTTTKIYNTAVALAP--VDLGFLEESIDFKYFDGGLSSVISVGADYAIYVEYGTGIYATGPGGSRA 107 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cccchhhcCeeEEeeCCcEEEEEecCCCcccccccCccccccCCCcccc Confidence 2221110000 111222222232222 111211110 000011 1336789999999632 Q ss_pred -----------------CCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_020079. 59 -----------------NNIPARPFITDGAVISQNNIAKKMK 83 (162) Q Consensus 59 -----------------~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (162) ...||||||+|++++++..+.+.|. T Consensus 108 ~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~~~i~~~i~ 149 (149) T protein:vir:94 108 TKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred ccccceeecCccceecCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 2478999999999999998888887 No 97 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=95.53 E-value=4.8e-05 Score=44.28 Aligned_cols=82 Identities=13% Similarity=0.144 Sum_probs=39.6 Q ss_pred CCCcccccchhHHHHHHHH--HH-HhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCc--------CCCCCCcchhhH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKK--MM-DLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNE--------TNNIPARPFITD 69 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~--l~-~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~--------~~~IP~RpFlr~ 69 (162) .+.+-..+....+..+... |. ..+...+.|||.+ ++..||++|.||.. .++||+||||-- T Consensus 64 ~~~~~~~~~~~m~~~l~~~~~l~~~~~~~~a~vg~~G---------s~~~yA~iHQfG~~~~~~~~~~~v~iPaRp~LG~ 134 (156) T protein:vir:11 64 GKQGRIRRKIKMFQKLRTVRYLRAKGDAQAITVSFAG---------RIARIARVHQYGLRDRAEPGAPEVSYAQRLLLGF 134 (156) T ss_pred hhccccccchhhhhhhhhhheeeeeecCcEEEEEecC---------CchhhhhhhcccccccccCCCCcccccccccCCC Confidence 1111111110111111111 11 1234567788742 44688999999964 347999999954 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccchHH Q lcl|NC_020079. 70 GAVISQNNIAKKMKQVFANYLMHNVGL 96 (162) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~l~G~~~~ 96 (162) +- +..+++.+.+...+.. ...- T Consensus 135 s~-~d~~~i~~~i~~~l~~----~~~~ 156 (156) T protein:vir:11 135 DS-SDMETIQNGILAHIDA----NSPI 156 (156) T ss_pred CH-HHHHHHHHHHHHHHhh----cCCC Confidence 42 2334455555444443 2211 No 98 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=95.33 E-value=1.6e-05 Score=46.90 Aligned_cols=78 Identities=13% Similarity=0.208 Sum_probs=43.9 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCC------------------------eEEEeecCCC-C--CCCCC-----CCHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQV------------------------QIEAGFLTNR-R--HPESD-----LTIPAI 48 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~------------------------~v~VGi~~~~-~--~~d~g-----~~~A~i 48 (162) |..... ++++|.+.|+++.+. -|.-|-+..+ . ..+++ .+.+.+ T Consensus 1 Ma~~~~-----Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~~~g~~~~V~~~~~Y 75 (135) T protein:vir:96 1 MAKVKY-----GADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFENGGFTGVVKIGSNY 75 (135) T ss_pred Cchhhh-----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEeecCcEEEEEecCCCc Confidence 665321 233333333322111 1222322221 0 01122 245788 Q ss_pred HHHHhcCCcC-------------------------CCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_020079. 49 AAIQQYGNET-------------------------NNIPARPFITDGAVISQNNIAKKMK 83 (162) Q Consensus 49 A~~~E~G~~~-------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (162) |.+.|||+.. ..+|+||||++++++++..+.+.+. T Consensus 76 A~~ve~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 76 AVYVNYGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGRQTFEQYFS 135 (135) T ss_pred cchhhcccccccCCCccccccccccccCCcceeecCCcCCCcchhHHHHHHHHHHHHhcC Confidence 9999999621 3489999999999999999888887 No 99 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=95.33 E-value=0.0002 Score=40.87 Aligned_cols=86 Identities=13% Similarity=0.113 Sum_probs=57.1 Q ss_pred HHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhc------CCCCCCHHHHHHhhccCCCCCCchhhHH Q lcl|NC_020079. 72 VISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQ------RYRPLSPVTIKIRQDKGNYSNHILIDTA 145 (162) Q Consensus 72 ~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppnsp~Ti~~k~~k~~~~~~PLidTG 145 (162) .++-+++.+.+...+.. +....-..+|..||..+....++.|.+. .|+|+++.|.+.++.. .+||.+++ T Consensus 1 m~~~~~l~~~L~~ll~~-l~~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~----~~~~~~~l 75 (148) T protein:vir:79 1 MSESRELEAWLAGMLTK-LDAPARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRI----RRAMFMRL 75 (148) T ss_pred CccHHHHHHHHHHHHHh-cCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhcccc----cccccchh Confidence 22235555555555555 2222335689999999999999999763 4778899887655532 36788888 Q ss_pred HHHhhceeeeecCCCCC Q lcl|NC_020079. 146 HMINAIETKITKSKSKK 162 (162) Q Consensus 146 ~l~~SIty~V~~~~~~~ 162 (162) .+..++++.+......- T Consensus 76 ~~~~~l~~~~~~~~~~v 92 (148) T protein:vir:79 76 RLARYMKTQADANTAVV 92 (148) T ss_pred hhhhheeeeeeCCeeeE Confidence 88888877765544332 No 100 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=95.31 E-value=2.9e-05 Score=45.47 Aligned_cols=81 Identities=14% Similarity=0.148 Sum_probs=40.6 Q ss_pred CCCcccccchhHHH----HHHHHHHHhhCCeEEEeecCCC--------CCCCCCCCHHHHHHHHhcCCcC---------- Q lcl|NC_020079. 1 MESEILPGDDTDWE----TIIKKMMDLEQVQIEAGFLTNR--------RHPESDLTIPAIAAIQQYGNET---------- 58 (162) Q Consensus 1 M~~~i~~~~~~~l~----~l~~~l~~l~~~~v~VGi~~~~--------~~~d~g~~~A~iA~~~E~G~~~---------- 58 (162) |...+...-...+. .+....+.+.. |.-|-+..+ ...-.-.+.+.+|.+.|||+.. T Consensus 18 ~~~~~~~~~~~~l~~~a~~~~~~ak~~~p--vdTG~L~~Si~~~~~~~g~~~~V~~~~~YA~yvE~GT~~~~~~~~~~~~ 95 (137) T protein:vir:96 18 YRDEMEEWVKKGILKTTLAIYNTAVALAP--VDLGFLKESIDFKVTDGGFSSVISVGAEYAIYVEFGTGIYATGPGGSRA 95 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cCccchhcCceeEeecCceEEEEecCCCcccccccCccccccCCCcccc Confidence 22222111111122 22222222222 111111110 0000011336789999999622 Q ss_pred -----------------CCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_020079. 59 -----------------NNIPARPFITDGAVISQNNIAKKMK 83 (162) Q Consensus 59 -----------------~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (162) ..+|+||||++++++++..+.+.+. T Consensus 96 ~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:96 96 RKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRKVFNRYFS 137 (137) T ss_pred ccccceeeccCcceeecCCCCCCcchhHHHHHHHHHHHHhhC Confidence 3489999999999999999988888 No 101 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=95.23 E-value=6.8e-05 Score=43.45 Aligned_cols=81 Identities=17% Similarity=0.199 Sum_probs=40.7 Q ss_pred CCCcccccch----hHHHHHHHHHHHhhCCeEEEeecCCC---CCCCCC-----CCHHHHHHHHhcCCc----------- Q lcl|NC_020079. 1 MESEILPGDD----TDWETIIKKMMDLEQVQIEAGFLTNR---RHPESD-----LTIPAIAAIQQYGNE----------- 57 (162) Q Consensus 1 M~~~i~~~~~----~~l~~l~~~l~~l~~~~v~VGi~~~~---~~~d~g-----~~~A~iA~~~E~G~~----------- 57 (162) +...+...-. .....+....+.+.. |.-|-+..+ ...+++ .+.+.+|.+.|||+. T Consensus 18 ~~~~~~~~~~~al~~~a~~v~~~ak~~aP--vdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~ 95 (137) T protein:vir:94 18 YERDIERWVKRGIAKTTVKIHNTIISLMP--VDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRA 95 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cCcchhhcCceeEeecCcEEEEEecCCCcccccccCccccccCCCcccc Confidence 2211110000 011222222333322 111111110 000011 133678999999952 Q ss_pred ----------------CCCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_020079. 58 ----------------TNNIPARPFITDGAVISQNNIAKKMK 83 (162) Q Consensus 58 ----------------~~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (162) +..+|+||||++++++++..+.+.|. T Consensus 96 ~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 96 KKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRVFFNKYFS 137 (137) T ss_pred cccccceeccCCceeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 23689999999999999999998888 No 102 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=95.16 E-value=7.4e-05 Score=43.24 Aligned_cols=81 Identities=14% Similarity=0.197 Sum_probs=40.3 Q ss_pred CCCcccccc----hhHHHHHHHHHHHhhCCeEEEeecCCC---CCCCCC-----CCHHHHHHHHhcCCcC---------- Q lcl|NC_020079. 1 MESEILPGD----DTDWETIIKKMMDLEQVQIEAGFLTNR---RHPESD-----LTIPAIAAIQQYGNET---------- 58 (162) Q Consensus 1 M~~~i~~~~----~~~l~~l~~~l~~l~~~~v~VGi~~~~---~~~d~g-----~~~A~iA~~~E~G~~~---------- 58 (162) +...+...- ......+....+.+.. |.-|.+..+ ...+++ .+.+.+|...|||+.. T Consensus 30 ~~~~~~~~~~~~l~~~a~~v~~~ak~~aP--vdTG~L~~SI~~~~~~~g~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~ 107 (149) T protein:vir:10 30 FDKKIEEWVKKGIAKTTTKIYNTAVALAP--VDLGFLEESIDFKYFDGGLSSVISVGADYAIYVEYGTGIYATGPGGSRA 107 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cccchhhccceEEecCCcEEEEEecCCCcccccccCccccccCCccccc Confidence 111111100 0111122222222222 111221111 000011 1336789999999632 Q ss_pred -----------------CCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_020079. 59 -----------------NNIPARPFITDGAVISQNNIAKKMK 83 (162) Q Consensus 59 -----------------~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (162) ...||||||++++++++..+.+.|. T Consensus 108 ~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~~i~~~i~ 149 (149) T protein:vir:10 108 TKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred ccccceeeccccceecCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 3478999999999999999988887 No 103 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=95.11 E-value=3.2e-05 Score=45.21 Aligned_cols=81 Identities=19% Similarity=0.261 Sum_probs=42.5 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCC---CCCCCCC-----CHHHHHHHHhcCCc--------------- Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNR---RHPESDL-----TIPAIAAIQQYGNE--------------- 57 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~---~~~d~g~-----~~A~iA~~~E~G~~--------------- 57 (162) |+--+..-=.+.-..+...++.+.. |.-|-+..+ ...++++ +.+++|.+.|||+. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aP--v~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~~ 78 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIP 78 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCC--cCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCcccccccc Confidence 3322221111112223333333322 112222111 0011111 45789999999932 Q ss_pred ------------CCCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_020079. 58 ------------TNNIPARPFITDGAVISQNNIAKKMK 83 (162) Q Consensus 58 ------------~~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (162) +...|+||||++++++++..+.+.|. T Consensus 79 ~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 79 WSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 23589999999999999998887777 No 104 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=95.11 E-value=3.2e-05 Score=45.21 Aligned_cols=81 Identities=19% Similarity=0.261 Sum_probs=42.5 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCC---CCCCCCC-----CHHHHHHHHhcCCc--------------- Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNR---RHPESDL-----TIPAIAAIQQYGNE--------------- 57 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~---~~~d~g~-----~~A~iA~~~E~G~~--------------- 57 (162) |+--+..-=.+.-..+...++.+.. |.-|-+..+ ...++++ +.+++|.+.|||+. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aP--v~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~~ 78 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIP 78 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCC--cCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCcccccccc Confidence 3322221111112223333333322 112222111 0011111 45789999999932 Q ss_pred ------------CCCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_020079. 58 ------------TNNIPARPFITDGAVISQNNIAKKMK 83 (162) Q Consensus 58 ------------~~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (162) +...|+||||++++++++..+.+.|. T Consensus 79 ~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 79 WSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 23589999999999999998887777 No 105 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=94.94 E-value=3.5e-05 Score=45.05 Aligned_cols=81 Identities=19% Similarity=0.261 Sum_probs=41.8 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCC---CCCCCCC-----CHHHHHHHHhcCCc--------------- Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNR---RHPESDL-----TIPAIAAIQQYGNE--------------- 57 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~---~~~d~g~-----~~A~iA~~~E~G~~--------------- 57 (162) |.--+..-=.+....+...++.+.. |.-|-+..+ ...++++ +.+++|.+.|||+. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~ap--v~TG~Lr~SI~~~~~~~~~~~~V~~~~~Ya~yvE~GTg~~~~~~~~~~~~~~~ 78 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKNIP 78 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhCC--ccccccccceeEEeecCcEEEEEecCCCccceeecCccccccCCCcccccccc Confidence 2222221111112222223333322 122222111 0011111 45778999999942 Q ss_pred ------------CCCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_020079. 58 ------------TNNIPARPFITDGAVISQNNIAKKMK 83 (162) Q Consensus 58 ------------~~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (162) +...|+||||++++++++..+.+.|. T Consensus 79 ~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 79 WSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ceeecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 23589999999999999998888777 No 106 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=94.65 E-value=6.1e-05 Score=43.70 Aligned_cols=77 Identities=13% Similarity=0.179 Sum_probs=53.8 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHHHHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVISQNNIAK 80 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~ 80 (162) |+-.|.....+ +.......+.|||... +.+|-+-|||+ .++||.||+..|..+.++++.+ T Consensus 61 laD~I~~s~~~--------~dg~~~g~~~VG~~k~----------~~~A~f~n~GT--~k~~~~hFie~t~~e~~~evl~ 120 (139) T protein:vir:10 61 LSEDIRSAAGD--------IDGDHNGSSTVGFHNK----------AHIARFLNDGT--KYIRADHFVDNARDDAKDAVFA 120 (139) T ss_pred hhhcceecCcc--------cccccceeeeeCCCCC----------cceEeecccCc--cccCCCchHHHHHHHHHHHHHH Confidence 55555433311 1111234467899421 46789999995 7899999999999999999999 Q ss_pred HHHHHHHHHhccch--HHH Q lcl|NC_020079. 81 KMKQVFANYLMHNV--GLA 97 (162) Q Consensus 81 ~~~~~~~~~l~G~~--~~~ 97 (162) .+...+..+|...- .+. T Consensus 121 a~~~~~k~~l~~~~~~~~~ 139 (139) T protein:vir:10 121 AEAEKYQAMIAKANGGGDK 139 (139) T ss_pred HHHHHHHHHHhhcCCCCCC Confidence 99999999886542 233 No 107 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=94.42 E-value=0.00017 Score=41.19 Aligned_cols=88 Identities=10% Similarity=0.144 Sum_probs=50.0 Q ss_pred CCCcccccchhH-HHHHHHHHHH------------hhC---CeEEEeecCCC-CCC--------CCC-------CCHHHH Q lcl|NC_020079. 1 MESEILPGDDTD-WETIIKKMMD------------LEQ---VQIEAGFLTNR-RHP--------ESD-------LTIPAI 48 (162) Q Consensus 1 M~~~i~~~~~~~-l~~l~~~l~~------------l~~---~~v~VGi~~~~-~~~--------d~g-------~~~A~i 48 (162) |..+|++++... +.+-++++.+ .++ ..++-+-|... .|. .++ .+--.| T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~~~~~~~~v~~~~~~y~l 80 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQKLKNGDQVIYQKAPTYRL 80 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccccceeeeecCCeeEEEEEecCCcce Confidence 999999887522 2222222111 100 11111111110 000 000 011246 Q ss_pred HHHHhcCCcCCC---CCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 49 AAIQQYGNETNN---IPARPFITDGAVISQNNIAKKMKQVFAN 88 (162) Q Consensus 49 A~~~E~G~~~~~---IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (162) +-+.|||...++ +|+||||+|+.+...+.+.+.++..+.+ T Consensus 81 ~HLLE~GHa~r~GGrV~a~phI~paee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 81 THLLENGHAKRNGGRVSPKVHIAPVEEELVSNYISRVEKRLSQ 123 (123) T ss_pred EEeeecceeecCCceeCcchhhhHHHHHHHHHHHHHHHHHhcC Confidence 777799954443 8999999999999999999999999988 No 108 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=94.31 E-value=0.00012 Score=42.10 Aligned_cols=79 Identities=18% Similarity=0.092 Sum_probs=51.9 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHH--HHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVIS--QNNI 78 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~--~~~~ 78 (162) |+-.|+....+ +.......+.|||... ..+++|-+.++|+ .++|+-||+..+..+. +.++ T Consensus 61 laD~I~~~~~~--------~DG~~dg~s~VG~~~~--------~~~~~A~f~n~GT--~k~~~~hFve~~~~~a~~k~~V 122 (141) T protein:vir:50 61 MADGLAIQSTN--------ADGRKNGVSTVGWKNN--------YHAQNARRLNDGT--KKYRADHFVTNVQNDSTVQKKV 122 (141) T ss_pred cccceeeccCc--------cccccCCeeeeccCCC--------ccceeeeccccCc--cccCCCchhHHHHHhhhhHHHH Confidence 65555543311 1112234567999532 2369999999995 6899999999999865 6778 Q ss_pred HHHHHHHHHHHhccchHHH Q lcl|NC_020079. 79 AKKMKQVFANYLMHNVGLA 97 (162) Q Consensus 79 ~~~~~~~~~~~l~G~~~~~ 97 (162) .+.+...++++|+-+-... T Consensus 123 l~A~~~~~k~~l~~~~~~~ 141 (141) T protein:vir:50 123 LLEKKRNTKNSLEEKEGCD 141 (141) T ss_pred HHHHHHHHHHHHHhccCCC Confidence 8888888888773321111 No 109 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=94.20 E-value=4.7e-05 Score=44.31 Aligned_cols=77 Identities=8% Similarity=0.091 Sum_probs=48.0 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCc-chhhHHHHHHHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPAR-PFITDGAVISQNNIA 79 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~R-pFlr~~~~~~~~~~~ 79 (162) |...+-.. +.++.++...++.- .-+.||+. -+-+.++-.+|||+. ..|+| |||.++++++.++.. T Consensus 42 ~~~n~P~~-tg~lkkik~~~kk~--g~~~VG~~---------ks~~fy~kF~EFGTS--km~a~~pF~~~a~~~~~~eA~ 107 (119) T protein:vir:10 42 IEKNSPIK-SGRLSKVKIRVKNT--GLATEGTA---------SSSEFYDIFQNFGTS--EQKAHVGYFDRAVDETTNEAV 107 (119) T ss_pred HhhcCCcc-cCCcceeeeeeecC--ceeEeccC---------Ccchhhhhhcccccc--ccCCCCCccccccccChHHHH Confidence 22222111 11122222221111 13666662 144689999999964 68999 999999999999999 Q ss_pred HHHHHHHHHHhc Q lcl|NC_020079. 80 KKMKQVFANYLM 91 (162) Q Consensus 80 ~~~~~~~~~~l~ 91 (162) ..+...+..=++ T Consensus 108 ~~~~~el~~~~r 119 (119) T protein:vir:10 108 EEVAEIIFRKMR 119 (119) T ss_pred HHHHHHHHHhcC Confidence 888888877555 No 110 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=94.06 E-value=0.00085 Score=37.42 Aligned_cols=90 Identities=12% Similarity=0.107 Sum_probs=55.9 Q ss_pred HHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhc------CCCCCCHHHHHHhhccCCCCCCchhhH Q lcl|NC_020079. 71 AVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQ------RYRPLSPVTIKIRQDKGNYSNHILIDT 144 (162) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppnsp~Ti~~k~~k~~~~~~PLidT 144 (162) +++...++.+.+..++.+ +.......+|..||..+....++.|... .|+|+++.|++.|....+. ..+|... T Consensus 1 m~~~~~~l~~~L~~ll~~-L~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~~W~p~~~~~~~~~~~~~~~-~~~m~~~ 78 (156) T protein:vir:11 1 MADSLEALEDWAGPILRA-LEPGPRAALARSLARDLRRSQQKRVMAQRNPDGSAYEPRKKRELRGKQGRIRR-KIKMFQK 78 (156) T ss_pred CchhHHHHHHHHHHHHHh-cCCcchHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHhhhcccccc-chhhhhh Confidence 555566666666666655 2333345689999999999999999763 5788999998766532221 2344444 Q ss_pred HHHHhhceeeeecCCCCC Q lcl|NC_020079. 145 AHMINAIETKITKSKSKK 162 (162) Q Consensus 145 G~l~~SIty~V~~~~~~~ 162 (162) ..+..+|++.+...+..= T Consensus 79 l~~~~~l~~~~~~~~a~v 96 (156) T protein:vir:11 79 LRTVRYLRAKGDAQAITV 96 (156) T ss_pred hhhhheeeeeecCcEEEE Confidence 444445666654332221 No 111 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=93.91 E-value=0.00011 Score=42.21 Aligned_cols=76 Identities=13% Similarity=0.102 Sum_probs=51.0 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHH--HHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVIS--QNNI 78 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~--~~~~ 78 (162) |+-.|+....+ +.......+.|||... ..|++|-+.++|+ ..+|+.+|+..|.++. ++++ T Consensus 61 laD~I~~~~~~--------idg~~dG~s~VG~~k~--------~~a~~a~f~NdGT--~k~~~~hFve~t~~e~~~~~~v 122 (140) T protein:vir:48 61 MADGLAVQSTN--------VDGRKNGVATVGWKNN--------YHAQNARRLNDGT--KKYRADHFVTNVQNDSAVRDKV 122 (140) T ss_pred ccccceecccc--------cccccccceeecccCC--------CceeEEeecccCc--cccCCCchHHHHHHhhhhHHHH Confidence 44444432211 1111233456888531 2378999999995 6899999999999865 6788 Q ss_pred HHHHHHHHHHHh--ccch Q lcl|NC_020079. 79 AKKMKQVFANYL--MHNV 94 (162) Q Consensus 79 ~~~~~~~~~~~l--~G~~ 94 (162) .+.+...+.++| .|.+ T Consensus 123 l~A~~~~y~~~l~kk~~~ 140 (140) T protein:vir:48 123 LLAEKEEYEKLIRKKGGE 140 (140) T ss_pred HHHHHHHHHHHHHhhcCC Confidence 888888888887 4555 No 112 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=93.75 E-value=0.00024 Score=40.46 Aligned_cols=90 Identities=11% Similarity=0.181 Sum_probs=45.3 Q ss_pred CCCcccccch----hHHHHHHHHHHHhhCC---eEEEeecCCCCCCCCCC-------CHHHHHHHHhcCCcCCC---CCC Q lcl|NC_020079. 1 MESEILPGDD----TDWETIIKKMMDLEQV---QIEAGFLTNRRHPESDL-------TIPAIAAIQQYGNETNN---IPA 63 (162) Q Consensus 1 M~~~i~~~~~----~~l~~l~~~l~~l~~~---~v~VGi~~~~~~~d~g~-------~~A~iA~~~E~G~~~~~---IP~ 63 (162) |.-.+...=. ..-+.+.+.+++.... ...=+|-.....+.++. +-..++-+.|||.-..+ +|+ T Consensus 20 y~~~v~~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~~~g~~~~vv~~~~~~~l~HLLEfGha~r~gGrV~a 99 (126) T protein:vir:81 20 YTDDVAEGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKEDGYGTTKRIIWNKKHYRRVHLLEFGHAKVNGGRVKE 99 (126) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccchhhccccccccccCCcceEEEeccCCCCceeeeecceecCCCCccCC Confidence 2222110000 1123334444443331 11112211111111110 11355678999964332 799 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_020079. 64 RPFITDGAVISQNNIAKKMKQVFANYLMHN 93 (162) Q Consensus 64 RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~ 93 (162) ||||+|+++...+++.+.++..+.. |+ T Consensus 100 ~Phi~Pa~e~~~~~~~~~i~~~l~~---gg 126 (126) T protein:vir:81 100 YPHLRPAYDKHGARLPDELKRVIEN---GG 126 (126) T ss_pred CcchHHHHHHHHHHHHHHHHHHhhc---CC Confidence 9999999999888888888888775 55 No 113 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=93.51 E-value=0.00012 Score=42.17 Aligned_cols=78 Identities=14% Similarity=0.110 Sum_probs=51.4 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHH--HHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVIS--QNNI 78 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~--~~~~ 78 (162) |+-.|+..+.+ +.......+.|||... ..+++|-+.++|+ .++|+-||+..+.++. +.++ T Consensus 61 laD~I~~~~~~--------iDg~~~g~s~VG~~kk--------~~a~~A~f~n~GT--~k~~~~hFve~~~~e~~~k~~v 122 (140) T protein:vir:48 61 MADGLSVQSTN--------VDGRKNGVSTVGWVNR--------YHAQNARRLNDGT--KKYRADHFVTNVQNDSAVQTKV 122 (140) T ss_pred chhceeecccc--------cccccCceeeeccCCC--------cceeeeeccccCc--cccCCCchhHHHHHhhhhHHHH Confidence 55554433211 1111234567888531 3379999999995 7899999999999976 6778 Q ss_pred HHHHHHHHHHHhccchHH Q lcl|NC_020079. 79 AKKMKQVFANYLMHNVGL 96 (162) Q Consensus 79 ~~~~~~~~~~~l~G~~~~ 96 (162) .+.+...+.++|+-.-.+ T Consensus 123 l~A~~~~~~~~l~~~~~~ 140 (140) T protein:vir:48 123 LLAEKEEYEKLIRKKGGE 140 (140) T ss_pred HHHHHHHHHHHHHhhcCC Confidence 888888888877432222 No 114 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=93.01 E-value=0.00021 Score=40.70 Aligned_cols=77 Identities=14% Similarity=0.209 Sum_probs=53.7 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHHHHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVISQNNIAK 80 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~ 80 (162) |+-.|+.... .+.......+.|||.. . +.+|-+-|+|+ .++|+.+|+..|..+.++++.+ T Consensus 61 laD~I~~~~~--------~idg~~~g~~~VG~~~---------~-~~~Ahf~n~GT--~~~~~~hFie~t~~e~~~ev~~ 120 (139) T protein:vir:10 61 LSEDISSAAG--------DIDGDHNGSSTVGFHN---------K-AHIARFLNDGT--KNIRADHFVDNARDDAKDAVFA 120 (139) T ss_pred ccccceecCc--------cccccccccceeCCCC---------C-ceeeeeeccCc--cccCCCchHHHHHHHHHHHHHH Confidence 5555543321 1111223457899942 1 46788999994 7899999999999999999999 Q ss_pred HHHHHHHHHhccchH--HH Q lcl|NC_020079. 81 KMKQVFANYLMHNVG--LA 97 (162) Q Consensus 81 ~~~~~~~~~l~G~~~--~~ 97 (162) .+...+..+|...-. +. T Consensus 121 a~~~~~ke~l~~~~~~~~~ 139 (139) T protein:vir:10 121 AEAEKYQAMIAKANGGDSK 139 (139) T ss_pred HHHHHHHHHHhhcCCCCCC Confidence 999999999855321 22 No 115 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=92.55 E-value=0.00019 Score=40.96 Aligned_cols=84 Identities=11% Similarity=0.080 Sum_probs=41.3 Q ss_pred CCCcccccch-hHHHHHHHH---------------HHHhhCC--eEEEeecCCC--------CCC---CC-CCCHHHHHH Q lcl|NC_020079. 1 MESEILPGDD-TDWETIIKK---------------MMDLEQV--QIEAGFLTNR--------RHP---ES-DLTIPAIAA 50 (162) Q Consensus 1 M~~~i~~~~~-~~l~~l~~~---------------l~~l~~~--~v~VGi~~~~--------~~~---d~-g~~~A~iA~ 50 (162) |..++..+.. ..++.+.++ ++...+. -|.-|.+..+ ..+ .. -.+.+.+|. T Consensus 2 ~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA~ 81 (142) T protein:vir:99 2 VQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYAA 81 (142) T ss_pred ceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCccccc Confidence 5666654432 222222221 1111110 1222322211 000 01 125688999 Q ss_pred HHhcCCcC------------------------C---CCCCcchhhHHHHHHHHHHHHHHHH Q lcl|NC_020079. 51 IQQYGNET------------------------N---NIPARPFITDGAVISQNNIAKKMKQ 84 (162) Q Consensus 51 ~~E~G~~~------------------------~---~IP~RpFlr~~~~~~~~~~~~~~~~ 84 (162) ++|||+.. + ..||||||+++++.+.++....... T Consensus 82 ~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:99 82 AVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred eeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 99999731 1 2459999999999988765443333 No 116 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=92.55 E-value=0.00019 Score=40.96 Aligned_cols=84 Identities=11% Similarity=0.080 Sum_probs=41.3 Q ss_pred CCCcccccch-hHHHHHHHH---------------HHHhhCC--eEEEeecCCC--------CCC---CC-CCCHHHHHH Q lcl|NC_020079. 1 MESEILPGDD-TDWETIIKK---------------MMDLEQV--QIEAGFLTNR--------RHP---ES-DLTIPAIAA 50 (162) Q Consensus 1 M~~~i~~~~~-~~l~~l~~~---------------l~~l~~~--~v~VGi~~~~--------~~~---d~-g~~~A~iA~ 50 (162) |..++..+.. ..++.+.++ ++...+. -|.-|.+..+ ..+ .. -.+.+.+|. T Consensus 2 ~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA~ 81 (142) T protein:vir:86 2 VQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYAA 81 (142) T ss_pred ceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCccccc Confidence 5666654432 222222221 1111110 1222322211 000 01 125688999 Q ss_pred HHhcCCcC------------------------C---CCCCcchhhHHHHHHHHHHHHHHHH Q lcl|NC_020079. 51 IQQYGNET------------------------N---NIPARPFITDGAVISQNNIAKKMKQ 84 (162) Q Consensus 51 ~~E~G~~~------------------------~---~IP~RpFlr~~~~~~~~~~~~~~~~ 84 (162) ++|||+.. + ..||||||+++++.+.++....... T Consensus 82 ~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:86 82 AVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred eeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 99999731 1 2459999999999988765443333 No 117 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=92.23 E-value=0.00053 Score=38.55 Aligned_cols=67 Identities=7% Similarity=0.019 Sum_probs=30.2 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHH Q lcl|NC_020079. 67 ITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAH 146 (162) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~ 146 (162) |-.++ ...+++.+.++.+-.++ .....++|+..+..+++.++. ..| +|||. T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~--~~~~~~~~~~~a~~i~~~ak~-------------------------~aP-vdTG~ 51 (137) T protein:vir:93 1 MAKVK-YGNWDLVKELENYERDM--ERWVKRGIAKTTAKIHNTIIS-------------------------LMP-VDTGY 51 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHH-------------------------hCC-ccccc Confidence 32222 13333333333322221 011123333333333332221 234 69999 Q ss_pred HHhhceeeeecCCCCC Q lcl|NC_020079. 147 MINAIETKITKSKSKK 162 (162) Q Consensus 147 l~~SIty~V~~~~~~~ 162 (162) |++||++++..++-+- T Consensus 52 Lr~SI~~~~~~~~~~~ 67 (137) T protein:vir:93 52 LRESVTMDFKDSGFTG 67 (137) T ss_pred hhccceeEeecCceEE Confidence 9999999987655222 No 118 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=92.23 E-value=0.00053 Score=38.55 Aligned_cols=67 Identities=7% Similarity=0.019 Sum_probs=30.2 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHH Q lcl|NC_020079. 67 ITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAH 146 (162) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~ 146 (162) |-.++ ...+++.+.++.+-.++ .....++|+..+..+++.++. ..| +|||. T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~--~~~~~~~~~~~a~~i~~~ak~-------------------------~aP-vdTG~ 51 (137) T protein:vir:94 1 MAKVK-YGNWDLVKELENYERDM--ERWVKRGIAKTTAKIHNTIIS-------------------------LMP-VDTGY 51 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHH-------------------------hCC-ccccc Confidence 32222 13333333333322221 011123333333333332221 234 69999 Q ss_pred HHhhceeeeecCCCCC Q lcl|NC_020079. 147 MINAIETKITKSKSKK 162 (162) Q Consensus 147 l~~SIty~V~~~~~~~ 162 (162) |++||++++..++-+- T Consensus 52 Lr~SI~~~~~~~~~~~ 67 (137) T protein:vir:94 52 LRESVTMDFKDSGFTG 67 (137) T ss_pred hhccceeEeecCceEE Confidence 9999999987655222 No 119 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=92.23 E-value=0.00053 Score=38.55 Aligned_cols=67 Identities=7% Similarity=0.019 Sum_probs=30.2 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHH Q lcl|NC_020079. 67 ITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAH 146 (162) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~ 146 (162) |-.++ ...+++.+.++.+-.++ .....++|+..+..+++.++. ..| +|||. T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~--~~~~~~~~~~~a~~i~~~ak~-------------------------~aP-vdTG~ 51 (137) T protein:vir:97 1 MAKVK-YGNWDLVKELENYERDM--ERWVKRGIAKTTAKIHNTIIS-------------------------LMP-VDTGY 51 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHH-------------------------hCC-ccccc Confidence 32222 13333333333322221 011123333333333332221 234 69999 Q ss_pred HHhhceeeeecCCCCC Q lcl|NC_020079. 147 MINAIETKITKSKSKK 162 (162) Q Consensus 147 l~~SIty~V~~~~~~~ 162 (162) |++||++++..++-+- T Consensus 52 Lr~SI~~~~~~~~~~~ 67 (137) T protein:vir:97 52 LRESVTMDFKDSGFTG 67 (137) T ss_pred hhccceeEeecCceEE Confidence 9999999987655222 No 120 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=92.23 E-value=0.00039 Score=39.30 Aligned_cols=87 Identities=17% Similarity=0.159 Sum_probs=50.5 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHH--HHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVIS--QNNI 78 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~--~~~~ 78 (162) |+-.|+....+ +.......+.|||... ..+.+|-+.|+|+ .++|+.||++.+.++. ++++ T Consensus 61 laD~I~~s~~~--------idG~~dG~s~VG~~~~--------~~a~~a~f~n~GT--~km~~~hFie~tr~e~~~k~~v 122 (153) T protein:vir:49 61 MADGLAVQSTN--------ADGRKNGVSTVGWKNN--------YHAQNARRLNDGT--KKYRADHFITNVQNDSTVKNKV 122 (153) T ss_pred ccccceecccc--------ccccccceeeecccCC--------ccceeeeecccCc--ccCCCChhhHHHHHHhhHHHHH Confidence 44444432210 1111234567888642 2368899999995 7899999999999876 5678 Q ss_pred HHHHHHHHHHHhccchH----HHHHHHHHHHHHHHHHHHHHhcCCCCCCHHH Q lcl|NC_020079. 79 AKKMKQVFANYLMHNVG----LAVFEPIARASREGIAQAIAMQRYRPLSPVT 126 (162) Q Consensus 79 ~~~~~~~~~~~l~G~~~----~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~T 126 (162) .+.+...+..+|+-+.. ..-+.. ..+ | T Consensus 123 l~A~~~~~~~il~~~~~~~~~~~~~~~--~~~-------------------~ 153 (153) T protein:vir:49 123 LLAEKEEYEKLIRRKGGVYLSASNFKT--KRA-------------------T 153 (153) T ss_pred HHHHHHHHHHHHHhcCCeeeecccccc--ccC-------------------C Confidence 87777777777743321 000000 000 0 No 121 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=90.17 E-value=0.0012 Score=36.61 Aligned_cols=83 Identities=16% Similarity=0.167 Sum_probs=40.9 Q ss_pred CCCcc----cccchhHHHHHHHHHH--HhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCc----------------- Q lcl|NC_020079. 1 MESEI----LPGDDTDWETIIKKMM--DLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNE----------------- 57 (162) Q Consensus 1 M~~~i----~~~~~~~l~~l~~~l~--~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~----------------- 57 (162) ..+.= ..+....+.+|.+.+. ..+...+.++++.+ .++.||++|.||-. T Consensus 59 w~pRK~~~~k~k~~rm~~kL~~~~~~~~~~~~~~~~~~~~g--------~~~~IA~vHQ~G~~~rv~~~~~~~~~~~~~~ 130 (231) T protein:vir:37 59 WEKRKPVDGEIKNKRLLKKVLRYASILAEERGKGRIYYKNP--------LTGEIAQKQQDGFTEHFRVFATDKNKNGSGN 130 (231) T ss_pred CchhcccccchhhHHHHHHhHHhhccccccCCceEEeeecc--------hHHHHHHHhhcCcccccchhhhhhccCCCCC Confidence 32221 0011112444444332 22333344554432 46799999999820 Q ss_pred -------------------------------------------------------------------CCCCCCcchhhHH Q lcl|NC_020079. 58 -------------------------------------------------------------------TNNIPARPFITDG 70 (162) Q Consensus 58 -------------------------------------------------------------------~~~IP~RpFlr~~ 70 (162) ++.+|+||||-.. T Consensus 131 ~pATr~QAk~Lr~lGy~v~~~k~k~~k~~~rkps~kwI~~~ls~~qAgliIR~L~~k~~~~~~k~~W~I~~paR~FLG~~ 210 (231) T protein:vir:37 131 DRATIRQAQKLRSLGYRKRNGKNRQGKTKYRLYTIKEIRERLTRTWASMEIRRLENKVNAGNGKTNWEIHVPARPFLDTR 210 (231) T ss_pred CCCCHHHHHHHHHhcccccCCCCCCCCCCcCcCCHHHHHHhhhhHHHHHHHHHHhcccccccCcceeeeecCcccccCCC Confidence 0236788887544 Q ss_pred HHHHHHHHHHHHHHHHHHHhccchH Q lcl|NC_020079. 71 AVISQNNIAKKMKQVFANYLMHNVG 95 (162) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~l~G~~~ 95 (162) - +++.+++...+.+++.|..- T Consensus 211 ~----~e~~~~l~~~l~~i~~~~~~ 231 (231) T protein:vir:37 211 E----KENVDILREITLKFLSGEYK 231 (231) T ss_pred H----HHHHHHHHHHHHHHhcccCC Confidence 3 44455556666666666543 No 122 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=90.13 E-value=0.0013 Score=36.47 Aligned_cols=67 Identities=7% Similarity=-0.013 Sum_probs=30.5 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHH Q lcl|NC_020079. 67 ITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAH 146 (162) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~ 146 (162) |-..+ ...+++.+.|+..-..+. ....++|...+..+++.+|. ..| +|||. T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~--~~~~~~l~~~a~~~~~~ak~-------------------------~~p-vdTG~ 51 (137) T protein:vir:96 1 MAKVK-YGNWDLVAELEDYRDEME--EWVKKGILKTTLAIYNTAVA-------------------------LAP-VDLGF 51 (137) T ss_pred CchhH-hhHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH-------------------------hCC-cCccc Confidence 22211 122333333333322210 11223344344333333221 234 68999 Q ss_pred HHhhceeeeecCCCCC Q lcl|NC_020079. 147 MINAIETKITKSKSKK 162 (162) Q Consensus 147 l~~SIty~V~~~~~~~ 162 (162) |++||+++|..++..- T Consensus 52 L~~Si~~~~~~~g~~~ 67 (137) T protein:vir:96 52 LKESIDFKVTDGGFSS 67 (137) T ss_pred hhcCceeEeecCceEE Confidence 9999999987665322 No 123 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=89.71 E-value=0.0015 Score=36.04 Aligned_cols=67 Identities=7% Similarity=0.020 Sum_probs=29.0 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHH Q lcl|NC_020079. 67 ITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAH 146 (162) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~ 146 (162) |-..+ ...+++.+.++++-..+ .+...++|+..+..+++.++ ...| +|||. T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~--~~~~~~~~~~~a~~v~~~ak-------------------------~~aP-v~TG~ 51 (137) T protein:vir:95 1 MAKVK-YGNWDLVKELENYERDM--ERWVKRGIAKTTAKIHNTII-------------------------SLMP-VDTGY 51 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH-------------------------HhCC-ccchh Confidence 32222 12333333333322221 01112233333333333222 1234 58999 Q ss_pred HHhhceeeeecCCCCC Q lcl|NC_020079. 147 MINAIETKITKSKSKK 162 (162) Q Consensus 147 l~~SIty~V~~~~~~~ 162 (162) |++||+++|..++-+= T Consensus 52 L~~Si~~~~~~~~~~~ 67 (137) T protein:vir:95 52 LRESVTMDFKDGGFTG 67 (137) T ss_pred hhcCeeeEeeCCceEE Confidence 9999999886653211 No 124 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=89.19 E-value=0.0017 Score=35.79 Aligned_cols=67 Identities=6% Similarity=-0.015 Sum_probs=29.9 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHH Q lcl|NC_020079. 67 ITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAH 146 (162) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~ 146 (162) |-... ...+++.+.|+.+-..+- ....++|+..+..+++.++. ..| +|||. T Consensus 1 Ma~~~-~G~~~l~~~L~~~~~~~~--~~~~~al~~~a~~v~~~ak~-------------------------~aP-vdTG~ 51 (137) T protein:vir:94 1 MAKVK-YGNWDLVKELENYERDIE--RWVKRGIAKTTVKIHNTIIS-------------------------LMP-VDTGY 51 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH-------------------------hCC-cCcch Confidence 21111 123333333333333221 11223333333333333321 234 58999 Q ss_pred HHhhceeeeecCCCCC Q lcl|NC_020079. 147 MINAIETKITKSKSKK 162 (162) Q Consensus 147 l~~SIty~V~~~~~~~ 162 (162) |++||++++..++.+= T Consensus 52 Lr~SI~~~~~~~~~~~ 67 (137) T protein:vir:94 52 LRESVTMDFKDGGFTG 67 (137) T ss_pred hhcCceeEeecCcEEE Confidence 9999999886654221 No 125 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=88.87 E-value=0.002 Score=35.41 Aligned_cols=67 Identities=4% Similarity=-0.031 Sum_probs=29.4 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHH Q lcl|NC_020079. 67 ITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAH 146 (162) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~ 146 (162) |-.. ....+++.+.++++-.++ ....+++|...+..+++.++. ..| +|||. T Consensus 1 Ma~~-~~Gl~~l~~~l~~~~~~~--~~~~~~al~~~a~~v~~~ak~-------------------------~ap-vdTG~ 51 (135) T protein:vir:96 1 MAKV-KYGADSIVVDLEKYSKDM--EKWVKKGITKTTLKIYNTAIH-------------------------LMP-VDTGF 51 (135) T ss_pred Cchh-hhhHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHH-------------------------hCC-ccchh Confidence 2111 012233333333332221 011123333333333333221 234 79999 Q ss_pred HHhhceeeeecCCCCC Q lcl|NC_020079. 147 MINAIETKITKSKSKK 162 (162) Q Consensus 147 l~~SIty~V~~~~~~~ 162 (162) |++||+++|.+++-.- T Consensus 52 Lr~SI~~~~~~~g~~~ 67 (135) T protein:vir:96 52 LRQSTTVDFENGGFTG 67 (135) T ss_pred hhcceeEEeecCcEEE Confidence 9999999886554222 No 126 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=88.45 E-value=0.011 Score=31.41 Aligned_cols=82 Identities=10% Similarity=0.188 Sum_probs=45.3 Q ss_pred HHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcC------CCCCCHHHHHHhhccCCCCCCchhhH Q lcl|NC_020079. 71 AVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQR------YRPLSPVTIKIRQDKGNYSNHILIDT 144 (162) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~------~ppnsp~Ti~~k~~k~~~~~~PLidT 144 (162) ++++-.++.+.+...+.+. ....-..+|..||..+....++.|.... |+|+++.+...| .++++ T Consensus 1 M~~~~~~~~~~L~~ll~~L-~~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~k---------~~~~~ 70 (152) T protein:vir:10 1 MSEPIEQVKTAFDSLLNNI-SKPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKGVK---------SKIKS 70 (152) T ss_pred CchHHHHHHHHHHHHHHhc-CcchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhhc---------ccccc Confidence 4445555555555555542 2223356899999999999999998864 444454442211 23333 Q ss_pred HH----HHhh--ceeeeecCCCC------C Q lcl|NC_020079. 145 AH----MINA--IETKITKSKSK------K 162 (162) Q Consensus 145 G~----l~~S--Ity~V~~~~~~------~ 162 (162) +. |+.| ++|+....+.. - T Consensus 71 ~~m~~~L~~a~~l~~~a~~~~~~Vg~~Gt~ 100 (152) T protein:vir:10 71 GKMFDKITQPRFMRLRLESEGVSLGYEGGD 100 (152) T ss_pred hhHHHhhhhcceeeeeecCcEEEEEecCCc Confidence 44 3333 55554333221 1 No 127 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=88.26 E-value=0.0021 Score=35.28 Aligned_cols=67 Identities=9% Similarity=0.103 Sum_probs=31.7 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHH Q lcl|NC_020079. 67 ITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAH 146 (162) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~ 146 (162) |-... ..-+++.+.++..-.++- ...+++|+..+..+++.+|. ..| +|||. T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~--~~~~~al~~~a~~i~~~ak~-------------------------~aP-v~TG~ 51 (137) T protein:vir:10 1 MAKVK-YGNWDLVKELEEFEKETI--RWAKKGIAKTTTIIHNSIVS-------------------------NMP-VDTGY 51 (137) T ss_pred Cccch-hCHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH-------------------------hCC-cCcch Confidence 22111 122333333333322220 11234455555444444443 223 58999 Q ss_pred HHhhceeeeecCCCCC Q lcl|NC_020079. 147 MINAIETKITKSKSKK 162 (162) Q Consensus 147 l~~SIty~V~~~~~~~ 162 (162) |++||++++..++.+- T Consensus 52 Lr~SI~~~~~~~~~~~ 67 (137) T protein:vir:10 52 LRESVSMDFKKGGLTG 67 (137) T ss_pred hhcCeeeEecCCcEEE Confidence 9999999887655222 No 128 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=88.17 E-value=0.0024 Score=34.97 Aligned_cols=67 Identities=9% Similarity=0.068 Sum_probs=29.7 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHH Q lcl|NC_020079. 67 ITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAH 146 (162) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~ 146 (162) |-..+ ...+++.+.++..-.++ .+...++|+..+..+++.+|. ..| +|||. T Consensus 1 Ma~~~-~Gl~~l~~~l~~~~~~~--~~~~~~al~~~a~~i~~~ak~-------------------------~aP-vdTG~ 51 (137) T protein:vir:10 1 MAKVK-YGNWELVKELEDFEKET--IRWAKKGIAKTTTIIHNSIVS-------------------------NMP-VDTGY 51 (137) T ss_pred CchhH-hhHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHH-------------------------hCC-cCcch Confidence 21111 12223333332222211 111233444444444444333 234 58999 Q ss_pred HHhhceeeeecCCCCC Q lcl|NC_020079. 147 MINAIETKITKSKSKK 162 (162) Q Consensus 147 l~~SIty~V~~~~~~~ 162 (162) |++||++++..++..- T Consensus 52 Lr~SI~~~~~~~~~~~ 67 (137) T protein:vir:10 52 LRESVSMDFKKGGLTG 67 (137) T ss_pred hhcCeeEEeeCCcEEE Confidence 9999999876544222 No 129 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=87.52 E-value=0.004 Score=33.75 Aligned_cols=72 Identities=10% Similarity=-0.021 Sum_probs=29.8 Q ss_pred CCcchhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCch Q lcl|NC_020079. 62 PARPFITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHIL 141 (162) Q Consensus 62 P~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PL 141 (162) =+|..++-.+ +..+++.+.+++.-..+ .+..+++|...+..+++.++. ..| T Consensus 1 m~~ms~~i~~-~g~~~l~~~l~~~~~~~--~~~v~~~l~~~a~~i~~~ak~-------------------------~ap- 51 (144) T protein:vir:59 1 MALMSVRIDP-SWRRIMSRNVRTFSGHV--LTQVEQVIIKTAEKIAGLAAS-------------------------LAP- 51 (144) T ss_pred CCcceeeehh-HHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHH-------------------------hCC- Confidence 1233221110 11122222222222221 011233444444433333331 223 Q ss_pred hhHHHHHhhceeeeecCCCCC Q lcl|NC_020079. 142 IDTAHMINAIETKITKSKSKK 162 (162) Q Consensus 142 idTG~l~~SIty~V~~~~~~~ 162 (162) +|||.|++||++++..++.+- T Consensus 52 v~TG~Lr~SI~~~~~~~g~~~ 72 (144) T protein:vir:59 52 VDEGNLKNSIQIDYKNNGLTA 72 (144) T ss_pred ccchhhhcCeeEEeecCcEEE Confidence 589999999999986554221 No 130 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=86.49 E-value=0.0035 Score=34.07 Aligned_cols=68 Identities=7% Similarity=0.044 Sum_probs=32.9 Q ss_pred hhHH-HHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHH Q lcl|NC_020079. 67 ITDG-AVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTA 145 (162) Q Consensus 67 lr~~-~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG 145 (162) |-.. +.-+.+++.+.++.....+ ....+.+|+..+..+++.++ ...| +||| T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~--~~~~~~~l~~~a~~i~~~ak-------------------------~~aP-v~TG 52 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRL--TGAAREATEAAANDMVNMAK-------------------------GLCP-VDTG 52 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH-------------------------HhCC-ccch Confidence 2111 1112344444444443332 11223444444444433321 1344 6999 Q ss_pred HHHhhceeeeecCCCCC Q lcl|NC_020079. 146 HMINAIETKITKSKSKK 162 (162) Q Consensus 146 ~l~~SIty~V~~~~~~~ 162 (162) .|++||+++|...+..= T Consensus 53 ~Lr~SI~~~~~~~g~~~ 69 (142) T protein:vir:94 53 RLRSSIQAVPSGGRFSF 69 (142) T ss_pred hhhccceeeeccCCceE Confidence 99999999887765432 No 131 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=86.06 E-value=0.0038 Score=33.88 Aligned_cols=62 Identities=10% Similarity=0.061 Sum_probs=27.6 Q ss_pred hccchH-HHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHHHHhhceeeeecCCCCC Q lcl|NC_020079. 90 LMHNVG-LAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAHMINAIETKITKSKSKK 162 (162) Q Consensus 90 l~G~~~-~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l~~SIty~V~~~~~~~ 162 (162) +.|-+- .+.|+.++..+...++..+...-. +..-..|. ..| +|||.|++||++.+.++.... T Consensus 1 i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~----~i~~~ak~------~aP-v~TG~Lr~sI~~~~~~~~~~~ 63 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRIAVDKELSKSAA----RIERQAKI------LAP-VDTGWLRAQIYSEQQRLLHYR 63 (108) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHh------cCC-cCchhhhcceeeeecCcEEEE Confidence 333321 233444444444444444432110 00001111 345 799999999998764322111 No 132 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=85.84 E-value=0.0013 Score=36.48 Aligned_cols=72 Identities=8% Similarity=0.054 Sum_probs=30.1 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHH Q lcl|NC_020079. 67 ITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAH 146 (162) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~ 146 (162) |-.-=-...+++. +-|+.+...+.+.+++.+.....- ....+.... + +..| +|||. T Consensus 1 m~~v~i~Gld~L~-----------------~kl~~~~~~~~~~v~~a~~~~~~~--~a~~v~~~a-k---~~~P-vdtG~ 56 (182) T protein:vir:10 1 MIEVELKGVNELR-----------------AKLKKLPDIMAKATANAQENAIEQ--AEAYAVDEL-Q---SSIK-YSTGE 56 (182) T ss_pred CeEEEEecHHHHH-----------------HHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHH-H---hhCC-CCchh Confidence 1000001112222 224444444444443333221000 011111111 0 2346 79999 Q ss_pred HHhhceeeeecCCCCC Q lcl|NC_020079. 147 MINAIETKITKSKSKK 162 (162) Q Consensus 147 l~~SIty~V~~~~~~~ 162 (162) |++||+++|..+++.- T Consensus 57 Lr~SI~~~~~~~~~~~ 72 (182) T protein:vir:10 57 LTRSFKHEVKVDGDEV 72 (182) T ss_pred hhhceeeeeeecCCeE Confidence 9999999988765432 No 133 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=84.04 E-value=0.0045 Score=33.43 Aligned_cols=46 Identities=11% Similarity=0.021 Sum_probs=23.9 Q ss_pred HHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHHHHhhceeeeecCCCC Q lcl|NC_020079. 82 MKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAHMINAIETKITKSKSK 161 (162) Q Consensus 82 ~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l~~SIty~V~~~~~~ 161 (162) +++. .+++++..+..+++.+| ...| +|||.|++||++++..++-+ T Consensus 1 v~~~---------v~~~~~~~~~~i~~~ak-------------------------~~ap-v~TG~Lr~SI~~~~~~~~~~ 45 (116) T protein:vir:95 1 MERW---------VKRGIAKTTAKIHNTII-------------------------SLMP-VDTGYLRESVTMDFKDGGFT 45 (116) T ss_pred ChHH---------HHHHHHHHHHHHHHHHH-------------------------hhCC-ccccccccceeEEeecCcEE Confidence 1111 12333333333333322 1345 68999999999998765422 Q ss_pred C Q lcl|NC_020079. 162 K 162 (162) Q Consensus 162 ~ 162 (162) = T Consensus 46 ~ 46 (116) T protein:vir:95 46 G 46 (116) T ss_pred E Confidence 1 No 134 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=83.81 E-value=0.0034 Score=34.12 Aligned_cols=79 Identities=13% Similarity=0.225 Sum_probs=56.8 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCc-----chhhHHHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPAR-----PFITDGAVISQ 75 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~R-----pFlr~~~~~~~ 75 (162) |+-.|+.... ..+.......+.|||... ..+.+|-+.+.|+ .+.|+. +|+..+..+.+ T Consensus 76 laD~I~~~~~-------~~iDg~~dG~s~VGw~~~--------~~a~~a~f~NdGT--~~m~~k~~~gdHFvekt~~~~k 138 (159) T protein:vir:38 76 LQDSITYKPG-------YTADKLHTGDTDVGFEGK--------YYDFLAKIVNNGQ--HHMSPKRYKNMHFLDKAQQEAK 138 (159) T ss_pred cccceeeecC-------ccccccccceeeecccCC--------ccceEeeecccCc--cccCCCCccCChhHHHHHHHHH Confidence 5555543321 122233445788999642 2368999999996 567876 69999999999 Q ss_pred HHHHHHHHHHHHHHhccchHH Q lcl|NC_020079. 76 NNIAKKMKQVFANYLMHNVGL 96 (162) Q Consensus 76 ~~~~~~~~~~~~~~l~G~~~~ 96 (162) .++.+.+...+..+|+...-. T Consensus 139 ~~Vl~A~~~~~~~il~~~~~~ 159 (159) T protein:vir:38 139 KSVAEAELKAYKEVMNHDSDK 159 (159) T ss_pred HHHHHHHHHHHHHHhhcccCC Confidence 999999999999999776433 No 135 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=83.69 E-value=0.0013 Score=36.32 Aligned_cols=81 Identities=16% Similarity=0.174 Sum_probs=37.1 Q ss_pred CCCccccc-chhHHHH--------HHH----HHHHhhCCe--EEEeecCCC-CC---CCC-------CCCHHHHHHHHhc Q lcl|NC_020079. 1 MESEILPG-DDTDWET--------IIK----KMMDLEQVQ--IEAGFLTNR-RH---PES-------DLTIPAIAAIQQY 54 (162) Q Consensus 1 M~~~i~~~-~~~~l~~--------l~~----~l~~l~~~~--v~VGi~~~~-~~---~d~-------g~~~A~iA~~~E~ 54 (162) |.++.... |...+.+ .++ .++...+.. |.-|-+..+ ++ .++ -.+++.+|.++|| T Consensus 1 m~~s~~i~i~~~~l~~~v~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~ve~ 80 (137) T protein:vir:10 1 MPVTARIHINEPELERQTGAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQTYRPFHVGGGVEDNVDYAAPVHE 80 (137) T ss_pred CCeeEEEeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeeccccceEEEEEecCCCceeeeee Confidence 66665433 2122211 111 122222211 222333221 00 011 1255789999999 Q ss_pred CCc------------------------CCCCC---CcchhhHHHHHH---HHHHHHH Q lcl|NC_020079. 55 GNE------------------------TNNIP---ARPFITDGAVIS---QNNIAKK 81 (162) Q Consensus 55 G~~------------------------~~~IP---~RpFlr~~~~~~---~~~~~~~ 81 (162) |+. .+++| +||||++++++. ...+.-. T Consensus 81 GT~ph~I~pk~~k~l~f~~~G~~v~~k~v~hpG~~a~Pfl~~A~~~~~~~~~ri~~~ 137 (137) T protein:vir:10 81 GSRPHRITARHANALHFFWHGREVFRKSVWHPGVRPRPFLRNAARRVVAADPDIHMT 137 (137) T ss_pred cCCCceeecccCceeeeeeCCceEEeeeeecCCCCCCchHHHHHHHHhhccccccCC Confidence 962 12345 999999999874 2333211 No 136 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=83.38 E-value=0.0068 Score=32.47 Aligned_cols=66 Identities=12% Similarity=0.037 Sum_probs=25.7 Q ss_pred hhH-HHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHH Q lcl|NC_020079. 67 ITD-GAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTA 145 (162) Q Consensus 67 lr~-~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG 145 (162) |-. =|+.+.+.+.+.+...+. +.+..++...+.. .|.... + ...| +||| T Consensus 1 ~~~~~f~~~~~~~~~~~~k~~~---------~~~~~~a~~~~~~----------------~ie~~a-k---~~~p-vdtG 50 (141) T protein:vir:78 1 MNEFEFDSNIPKARKLIEKKVL---------QALEDIGEHMTTE----------------LAEGGH-G---VTSN-NDTG 50 (141) T ss_pred CcchhHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHH----------------HHHHhh-h---hccc-cccc Confidence 100 011122222222221111 1122221111111 111111 1 2345 8999 Q ss_pred HHHhhceeeeecCCCCC Q lcl|NC_020079. 146 HMINAIETKITKSKSKK 162 (162) Q Consensus 146 ~l~~SIty~V~~~~~~~ 162 (162) .|++||+|+|...+.+= T Consensus 51 ~L~~SI~~~v~~~g~~~ 67 (141) T protein:vir:78 51 EYAQKSGYKVRKSSKEV 67 (141) T ss_pred hhhcceeeeeecCCcEE Confidence 99999999987655432 No 137 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=81.91 E-value=0.019 Score=30.05 Aligned_cols=91 Identities=8% Similarity=0.090 Sum_probs=57.3 Q ss_pred CCC-cccccchhHHHHHHHHHHHhhCC--------------------------eEEEeec-----CCCC-CCCCC----- Q lcl|NC_020079. 1 MES-EILPGDDTDWETIIKKMMDLEQV--------------------------QIEAGFL-----TNRR-HPESD----- 42 (162) Q Consensus 1 M~~-~i~~~~~~~l~~l~~~l~~l~~~--------------------------~v~VGi~-----~~~~-~~d~g----- 42 (162) |+. +|.. .+|+++.+.|+++... -|.-|-+ .+.. +.+++ T Consensus 1 Ms~~~id~---~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~tPVdTG~Lr~S~~~~~~~~~~~~~~~~V 77 (144) T protein:vir:10 1 MSLGHVDD---AQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEANTPVKQGNLRRSWTAEGPTYGCGGWTIKL 77 (144) T ss_pred CCCCCccH---HHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHhCCCCcchhccceeecceeeecCeeEEEE Confidence 664 4443 3466666666543221 1122222 1111 11222 Q ss_pred CCHHHHHHHHhcCCcC---------------CCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccch Q lcl|NC_020079. 43 LTIPAIAAIQQYGNET---------------NNIPARPFITDGAVISQNNIAKKMKQVFANYLMHNV 94 (162) Q Consensus 43 ~~~A~iA~~~E~G~~~---------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~ 94 (162) .+.+.+|-+-|||.-. ..+|.++||+.++.+.+..+.+.+++.+..++.=.. T Consensus 78 ~n~~~YA~~VE~Ghr~~~G~~v~~~~~~~~~g~V~G~~~~~~a~~~~~~~~~~~l~k~l~~l~d~~~ 144 (144) T protein:vir:10 78 INNAEYASYVESGHRQTPGRYVPVLKKRLVRDWVPGQFYMKKSIPQIQRQLPQLVTEGLWGLKDLFE 144 (144) T ss_pred ecCCCcccccccceeecCCcccccCCCccccceecCccchHHHHHHHHHHHHHHHHHHHHHHhhhcC Confidence 2668889999999632 247899999999999999999999999988764444 No 138 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=81.49 E-value=0.021 Score=29.79 Aligned_cols=94 Identities=18% Similarity=0.237 Sum_probs=48.1 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCe-------------------------EEEeecC-----CC------CC-CCCC- Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQ-------------------------IEAGFLT-----NR------RH-PESD- 42 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~-------------------------v~VGi~~-----~~------~~-~d~g- 42 (162) |++--. -|.++|+++.+.|+.+.... |.-|-+- +. .+ .+++ T Consensus 1 M~~~~~-~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~~g~~~ 79 (141) T protein:vir:79 1 MARWGS-VDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARSLPVYKQGNNY 79 (141) T ss_pred CCCCcc-CcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccccceeecCCee Confidence 766322 23345666666665443321 1112221 10 01 1111 Q ss_pred ----CCHHHHHHHHhcCCcCC----CCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccc-hH Q lcl|NC_020079. 43 ----LTIPAIAAIQQYGNETN----NIPARPFITDGAVISQNNIAKKMKQVFANYLMHN-VG 95 (162) Q Consensus 43 ----~~~A~iA~~~E~G~~~~----~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~-~~ 95 (162) .+++.+|-.-|||.-.. -+|++.+|+.++.+.+..+.+.+++.+.++|.+- ++ T Consensus 80 ~v~v~n~~~YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l~~~~~~ 141 (141) T protein:vir:79 80 IIEVVNPTEYASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLILLKGVFDA 141 (141) T ss_pred EEEEecCCcchhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 25578999999996322 1344445666667777777766666666655432 11 No 139 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=81.03 E-value=0.011 Score=31.35 Aligned_cols=82 Identities=11% Similarity=0.020 Sum_probs=51.5 Q ss_pred CCCcccccchhHHHHHHHH----HHH-------------------hhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCc Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKK----MMD-------------------LEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNE 57 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~----l~~-------------------l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~ 57 (162) |.--....=...-+.+.+. +.. -..+.|+|||.+ .+| .|--+||||. T Consensus 32 ~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s~p~~~~G~r~V~igW~G-pR~--------~ivHLNE~Gy- 101 (138) T protein:vir:98 32 VNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGFTT-PRW--------NIVHLQELEY- 101 (138) T ss_pred hhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeecCeeecCCceEEEEeeec-Cee--------eEEeeecccc- Confidence 3222111100111112222 210 013578889864 344 4667899998 Q ss_pred CCCCCCcc--hhhHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_020079. 58 TNNIPARP--FITDGAVISQNNIAKKMKQVFANYLMH 92 (162) Q Consensus 58 ~~~IP~Rp--Flr~~~~~~~~~~~~~~~~~~~~~l~G 92 (162) ...|-||- +++.+++..+..+.+.++..++..|+| T Consensus 102 Gk~i~PrG~G~I~ka~~~se~~y~~~vk~el~k~l~~ 138 (138) T protein:vir:98 102 GWKHNRRGVGVIRRYSDILETIYPRGIRDKLKRGFDG 138 (138) T ss_pred cCCcCCCcchHHHHHHHhhhHHHHHHHHHHHHHHhcC Confidence 45676776 699999999999999999999999999 No 140 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=79.60 E-value=0.027 Score=29.16 Aligned_cols=65 Identities=14% Similarity=0.035 Sum_probs=29.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHHHHh Q lcl|NC_020079. 70 GAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAHMIN 149 (162) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l~~ 149 (162) ---+..+++.+.|++.-..+ .....++|...+..+++.++. ..| +|||.|++ T Consensus 1 i~i~Gld~L~~~L~~l~~~~--~~~~~~a~~~~a~~i~~~ak~-------------------------~aP-v~TG~Lr~ 52 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDI--DKNINATTEEAANFIEDRAKT-------------------------LAP-KNFGKLAQ 52 (173) T ss_pred CcchhHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHH-------------------------hCC-cCchhhhh Confidence 01122344444444432221 111233343333333333322 223 68999999 Q ss_pred hceeeeecCCCC-C Q lcl|NC_020079. 150 AIETKITKSKSK-K 162 (162) Q Consensus 150 SIty~V~~~~~~-~ 162 (162) ||.+.+..+++. . T Consensus 53 sI~~~~~~~~~~~~ 66 (173) T protein:vir:10 53 SISTSDLKAKDLIS 66 (173) T ss_pred cceeeeeccCceeE Confidence 999887655432 2 No 141 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=79.28 E-value=0.009 Score=31.79 Aligned_cols=74 Identities=11% Similarity=0.197 Sum_probs=42.2 Q ss_pred CCCcccccchhHHHHHHHHHHHh-------------------------h--------------------------CCeEE Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDL-------------------------E--------------------------QVQIE 29 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l-------------------------~--------------------------~~~v~ 29 (162) |++.|+-. ++|++.|++. . .+.|+ T Consensus 1 MsvevkGv-----~eil~~LE~k~g~~~~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~~~~G~r~V~ 75 (134) T protein:vir:10 1 MSVKVTGD-----KALERELEKHFGIKEMVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPEWIKGKRTVT 75 (134) T ss_pred CeEEeecH-----HHHHHHHHHhhchhhhhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCeeecCCceEEE Confidence 66666522 2222222221 0 02466 Q ss_pred EeecCC-CCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhH--------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 30 AGFLTN-RRHPESDLTIPAIAAIQQYGNETNNIPARPFITD--------GAVISQNNIAKKMKQVFANY 89 (162) Q Consensus 30 VGi~~~-~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~--------~~~~~~~~~~~~~~~~~~~~ 89 (162) |||-+. .+| -|--+||||.. .-...+|++| +++..+..+.+.++..++.. T Consensus 76 vgW~G~~~R~--------~ivHLnE~Gyt--~~r~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 76 IRWRGPFERF--------RIVHLIENGHV--EKKSGKFVKPKAMGGINRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred EEEEcCCcee--------eEEEeeeccee--ecCCCCeeccchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 666332 122 34567899963 1134556666 88888888888888888885 No 142 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=79.18 E-value=0.0051 Score=33.16 Aligned_cols=85 Identities=8% Similarity=0.081 Sum_probs=50.6 Q ss_pred CCC--cccccchhHHHHHHHHHHHhhCCeEEEeec---CCCC-------CCCCCC--------CHHHHHHHHhcCCcCC- Q lcl|NC_020079. 1 MES--EILPGDDTDWETIIKKMMDLEQVQIEAGFL---TNRR-------HPESDL--------TIPAIAAIQQYGNETN- 59 (162) Q Consensus 1 M~~--~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~---~~~~-------~~d~g~--------~~A~iA~~~E~G~~~~- 59 (162) |.- +|..++ ..+|..+++...+..|.+=+. -|.. -.++|+ ..+++|-..|||+--. T Consensus 16 ~~dvk~VVkkN---~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~g~~~~vgp~g~t~dYapyvEyGTR~m~ 92 (127) T protein:vir:98 16 EKRWDRVANKN---LTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKVNSSKDVITGNFGYIKDYAPHVEYGHRIVR 92 (127) T ss_pred HHHHHHHHhhh---hHHHHHHHHhccCCceeccccccCcccceeeeEEEEecCCceEEeccCcccccccceeecceeeee Confidence 221 133333 455666665554332211111 0100 011222 2478899999997422 Q ss_pred ------CCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 60 ------NIPARPFITDGAVISQNNIAKKMKQVFAN 88 (162) Q Consensus 60 ------~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (162) -.|+-|||.|+|+..+..|.+-++..++. T Consensus 93 ~~~~~gf~~aqp~l~paf~~Qk~iF~~DL~~l~k~ 127 (127) T protein:vir:98 93 NGKQVGYANGTKYLFNNVKKQREIYRQDMLNELRR 127 (127) T ss_pred cccccccccCccccccchHHHhHHHHHHHHHHhcC Confidence 27899999999999999999999998887 No 143 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=79.02 E-value=0.014 Score=30.75 Aligned_cols=71 Identities=10% Similarity=-0.014 Sum_probs=28.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHHHH Q lcl|NC_020079. 69 DGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAHMI 148 (162) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l~ 148 (162) -. -+..+++.+.+++.-..+ .+....++..-|..++..++.. ++ ..++.| +|||.|+ T Consensus 1 i~-~~Gld~l~~~l~~~~~~~--~~~v~~a~~~~~~~i~~~a~~~---------a~----------~~~~~p-~~TG~Lr 57 (115) T protein:vir:93 1 MN-IDGLDALLNQFHDMKTNI--DDDVDDILQENAKEYVVRAKLK---------AR----------EVMNKG-YWTGNLS 57 (115) T ss_pred Cc-chhHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHh---------cc----------ccCCCC-CCchhhh Confidence 00 011222222222221111 0112334444444444444332 11 112334 8999999 Q ss_pred hhceeeeecCCCCC Q lcl|NC_020079. 149 NAIETKITKSKSKK 162 (162) Q Consensus 149 ~SIty~V~~~~~~~ 162 (162) +||++...++..-. T Consensus 58 ~sI~~~~~g~~~~~ 71 (115) T protein:vir:93 58 RNIRYKKTGDLQYT 71 (115) T ss_pred hcceeeecCceEEE Confidence 99998754221111 No 144 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=79.02 E-value=0.014 Score=30.75 Aligned_cols=71 Identities=10% Similarity=-0.014 Sum_probs=28.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHHHH Q lcl|NC_020079. 69 DGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAHMI 148 (162) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l~ 148 (162) -. -+..+++.+.+++.-..+ .+....++..-|..++..++.. ++ ..++.| +|||.|+ T Consensus 1 i~-~~Gld~l~~~l~~~~~~~--~~~v~~a~~~~~~~i~~~a~~~---------a~----------~~~~~p-~~TG~Lr 57 (115) T protein:vir:96 1 MN-IDGLDALLNQFHDMKTNI--DDDVDDILQENAKEYVVRAKLK---------AR----------EVMNKG-YWTGNLS 57 (115) T ss_pred Cc-chhHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHh---------cc----------ccCCCC-CCchhhh Confidence 00 011222222222221111 0112334444444444444332 11 112334 8999999 Q ss_pred hhceeeeecCCCCC Q lcl|NC_020079. 149 NAIETKITKSKSKK 162 (162) Q Consensus 149 ~SIty~V~~~~~~~ 162 (162) +||++...++..-. T Consensus 58 ~sI~~~~~g~~~~~ 71 (115) T protein:vir:96 58 RNIRYKKTGDLQYT 71 (115) T ss_pred hcceeeecCceEEE Confidence 99998754221111 No 145 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=79.02 E-value=0.014 Score=30.75 Aligned_cols=71 Identities=10% Similarity=-0.014 Sum_probs=28.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHHHH Q lcl|NC_020079. 69 DGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAHMI 148 (162) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l~ 148 (162) -. -+..+++.+.+++.-..+ .+....++..-|..++..++.. ++ ..++.| +|||.|+ T Consensus 1 i~-~~Gld~l~~~l~~~~~~~--~~~v~~a~~~~~~~i~~~a~~~---------a~----------~~~~~p-~~TG~Lr 57 (115) T protein:vir:78 1 MN-IDGLDALLNQFHDMKTNI--DDDVDDILQENAKEYVVRAKLK---------AR----------EVMNKG-YWTGNLS 57 (115) T ss_pred Cc-chhHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHh---------cc----------ccCCCC-CCchhhh Confidence 00 011222222222221111 0112334444444444444332 11 112334 8999999 Q ss_pred hhceeeeecCCCCC Q lcl|NC_020079. 149 NAIETKITKSKSKK 162 (162) Q Consensus 149 ~SIty~V~~~~~~~ 162 (162) +||++...++..-. T Consensus 58 ~sI~~~~~g~~~~~ 71 (115) T protein:vir:78 58 RNIRYKKTGDLQYT 71 (115) T ss_pred hcceeeecCceEEE Confidence 99998754221111 No 146 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=79.02 E-value=0.014 Score=30.75 Aligned_cols=71 Identities=10% Similarity=-0.014 Sum_probs=28.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHHHH Q lcl|NC_020079. 69 DGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAHMI 148 (162) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l~ 148 (162) -. -+..+++.+.+++.-..+ .+....++..-|..++..++.. ++ ..++.| +|||.|+ T Consensus 1 i~-~~Gld~l~~~l~~~~~~~--~~~v~~a~~~~~~~i~~~a~~~---------a~----------~~~~~p-~~TG~Lr 57 (115) T protein:vir:97 1 MN-IDGLDALLNQFHDMKTNI--DDDVDDILQENAKEYVVRAKLK---------AR----------EVMNKG-YWTGNLS 57 (115) T ss_pred Cc-chhHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHh---------cc----------ccCCCC-CCchhhh Confidence 00 011222222222221111 0112334444444444444332 11 112334 8999999 Q ss_pred hhceeeeecCCCCC Q lcl|NC_020079. 149 NAIETKITKSKSKK 162 (162) Q Consensus 149 ~SIty~V~~~~~~~ 162 (162) +||++...++..-. T Consensus 58 ~sI~~~~~g~~~~~ 71 (115) T protein:vir:97 58 RNIRYKKTGDLQYT 71 (115) T ss_pred hcceeeecCceEEE Confidence 99998754221111 No 147 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=79.02 E-value=0.014 Score=30.75 Aligned_cols=71 Identities=10% Similarity=-0.014 Sum_probs=28.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHHHH Q lcl|NC_020079. 69 DGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAHMI 148 (162) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l~ 148 (162) -. -+..+++.+.+++.-..+ .+....++..-|..++..++.. ++ ..++.| +|||.|+ T Consensus 1 i~-~~Gld~l~~~l~~~~~~~--~~~v~~a~~~~~~~i~~~a~~~---------a~----------~~~~~p-~~TG~Lr 57 (115) T protein:vir:10 1 MN-IDGLDALLNQFHDMKTNI--DDDVDDILQENAKEYVVRAKLK---------AR----------EVMNKG-YWTGNLS 57 (115) T ss_pred Cc-chhHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHh---------cc----------ccCCCC-CCchhhh Confidence 00 011222222222221111 0112334444444444444332 11 112334 8999999 Q ss_pred hhceeeeecCCCCC Q lcl|NC_020079. 149 NAIETKITKSKSKK 162 (162) Q Consensus 149 ~SIty~V~~~~~~~ 162 (162) +||++...++..-. T Consensus 58 ~sI~~~~~g~~~~~ 71 (115) T protein:vir:10 58 RNIRYKKTGDLQYT 71 (115) T ss_pred hcceeeecCceEEE Confidence 99998754221111 No 148 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=79.02 E-value=0.014 Score=30.75 Aligned_cols=71 Identities=10% Similarity=-0.014 Sum_probs=28.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHHHH Q lcl|NC_020079. 69 DGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAHMI 148 (162) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l~ 148 (162) -. -+..+++.+.+++.-..+ .+....++..-|..++..++.. ++ ..++.| +|||.|+ T Consensus 1 i~-~~Gld~l~~~l~~~~~~~--~~~v~~a~~~~~~~i~~~a~~~---------a~----------~~~~~p-~~TG~Lr 57 (115) T protein:vir:96 1 MN-IDGLDALLNQFHDMKTNI--DDDVDDILQENAKEYVVRAKLK---------AR----------EVMNKG-YWTGNLS 57 (115) T ss_pred Cc-chhHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHh---------cc----------ccCCCC-CCchhhh Confidence 00 011222222222221111 0112334444444444444332 11 112334 8999999 Q ss_pred hhceeeeecCCCCC Q lcl|NC_020079. 149 NAIETKITKSKSKK 162 (162) Q Consensus 149 ~SIty~V~~~~~~~ 162 (162) +||++...++..-. T Consensus 58 ~sI~~~~~g~~~~~ 71 (115) T protein:vir:96 58 RNIRYKKTGDLQYT 71 (115) T ss_pred hcceeeecCceEEE Confidence 99998754221111 No 149 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=78.84 E-value=0.018 Score=30.17 Aligned_cols=62 Identities=6% Similarity=-0.011 Sum_probs=27.3 Q ss_pred CCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCC Q lcl|NC_020079. 60 NIPARPFITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNH 139 (162) Q Consensus 60 ~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~ 139 (162) -|-+|+-| +...+.+++ |...+++++.++...+... |. +. T Consensus 1 ~~~~~~~l------~~~~l~~~~---------~~~~~~~~~~~a~~ve~~a-------------------k~------~a 40 (137) T protein:vir:10 1 MVAHTLRI------ERAQLHGLG---------MDEARKAVNRVVRRTFTRS-------------------QI------LA 40 (137) T ss_pred Cccccccc------ChhhHhhHH---------HHHHHHHHHHHHHHHHHHH-------------------Hh------cC Confidence 01111111 111111111 1112223333333332221 21 34 Q ss_pred chhhHHHHHhhceeeeecCCCCC Q lcl|NC_020079. 140 ILIDTAHMINAIETKITKSKSKK 162 (162) Q Consensus 140 PLidTG~l~~SIty~V~~~~~~~ 162 (162) | +|||+|++||++.+.+..|.. T Consensus 41 P-v~TG~Lr~SI~~~~~~~~g~~ 62 (137) T protein:vir:10 41 P-VDTGYLRASGRLVLGRERGAV 62 (137) T ss_pred C-cCchhhhccceeeeeeccccE Confidence 4 899999999999998776665 No 150 >protein:vir:78755 Length: 228 # NCBI annotation: putative tail completion protein # Family: family:all:743 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285651;genbank:gi:148727157;genbank:GeneID:5220102 Probab=78.12 E-value=0.028 Score=29.11 Aligned_cols=93 Identities=16% Similarity=0.092 Sum_probs=45.7 Q ss_pred CCCcccccchhHHHHHHHHHH--HhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCc--------------------- Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMM--DLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNE--------------------- 57 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~--~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~--------------------- 57 (162) ..+.=..+. ..+..|-+.|+ +.....+.|||..+.. ...++.||++|.||-. T Consensus 55 ~~pRKr~kr-KMl~~L~k~Lk~~~~~~~~a~v~f~~~~~----~~~~~rIA~vHq~G~~~~v~~~~~~~~~~~r~~~~~p 129 (228) T protein:vir:78 55 WAPRKRGKR-KMLRGLPKLLQIREPRQDMAELGFTKGTM----SAHAGVIANTHQKGHTYKVTAASRRRIAPSDVGKNKQ 129 (228) T ss_pred ChhhhhhHH-HHHhhhHHhhhhhcccccceEEEeecCcc----cchHHHHHHHHhcCcccccccchhhhhhcccCCCCCC Confidence 222211110 01223333332 2233568899865432 1357899999999921 Q ss_pred ---------------------------------------------------------CCCCCCcchhhHHHHHHHHHHHH Q lcl|NC_020079. 58 ---------------------------------------------------------TNNIPARPFITDGAVISQNNIAK 80 (162) Q Consensus 58 ---------------------------------------------------------~~~IP~RpFlr~~~~~~~~~~~~ 80 (162) ++.+|+||||-.+-+ ++.+ T Consensus 130 aTr~QAk~Lr~lGy~~~~~~~k~~rkps~kwI~~nls~gqAgliir~L~~k~~k~~W~I~~PaR~FLG~s~~----e~~~ 205 (228) T protein:vir:78 130 ASKAQARKLRELGFKRPGKRKRAYRSASLGWITANLNYAQAGLLIKKLKDEPVKESWEIQLPARPFLGANAR----QRQQ 205 (228) T ss_pred CCHHHHHHHHHhhccccCCcCCCcccCCHHHHHHHhhHHHHHHHHHHHhCCCCccceeeecCcccccCCCHH----HHHH Confidence 123688888844333 4555 Q ss_pred HHHHHHHHHhccchHH-HHHHHHHH Q lcl|NC_020079. 81 KMKQVFANYLMHNVGL-AVFEPIAR 104 (162) Q Consensus 81 ~~~~~~~~~l~G~~~~-~~l~~iG~ 104 (162) ++...+.++-.|.++. |=+. |. T Consensus 206 ~l~~~l~~i~~g~~~~~qd~~--~~ 228 (228) T protein:vir:78 206 AFALRPESIDYGWDVNKQDMK--GK 228 (228) T ss_pred HHHHHHHhcccCCCcchhhcc--CC Confidence 5555566655565541 1111 01 No 151 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=77.54 E-value=0.024 Score=29.46 Aligned_cols=92 Identities=12% Similarity=0.215 Sum_probs=52.2 Q ss_pred CCCcccccchhHHHHHHHHHHHhhC------------------------------------------------------- Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQ------------------------------------------------------- 25 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~------------------------------------------------------- 25 (162) |+.++--.+ |+++.+.|..+.. T Consensus 1 m~~~~d~~~---l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~ 77 (163) T protein:vir:10 1 MSGGFDYRS---FAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHG 77 (163) T ss_pred CCCccCHHH---HHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhccccc Confidence 666654332 3333333322211 Q ss_pred ---CeEEEeecCCCCCCCCC------CCHHHHHHHHhcCCcCC---CCCCcchhhHHHHHHHHHHHHHHHHHHHHHhc-- Q lcl|NC_020079. 26 ---VQIEAGFLTNRRHPESD------LTIPAIAAIQQYGNETN---NIPARPFITDGAVISQNNIAKKMKQVFANYLM-- 91 (162) Q Consensus 26 ---~~v~VGi~~~~~~~d~g------~~~A~iA~~~E~G~~~~---~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~-- 91 (162) ..++=||-.+..+..++ .+.+.+|-+-|||.-.. -+|.+.+|+.+.++.+..+.+.+++.+..++. T Consensus 78 k~tG~lr~swk~~~~~k~~~~~~v~v~N~~~YA~~VE~GHR~~~gGfV~G~fml~~s~~~~~~~~~~~~e~~l~~~l~k~ 157 (163) T protein:vir:10 78 KQGGTLQKGWSKSRIEVSGRTYKQKVYNKVYYAPHVEYGHKTVNGGFVPGQFFLHKTVEDTKSDMEKRVRDKYDGFMRKV 157 (163) T ss_pred cccchhhccceecceeecCCceEEEEEecCCccchhhcceeecCCceeccchhhHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 11222222222211121 25678899999996443 38999999999999998888877777666554 Q ss_pred --cchH Q lcl|NC_020079. 92 --HNVG 95 (162) Q Consensus 92 --G~~~ 95 (162) |+.- T Consensus 158 ~~~~~~ 163 (163) T protein:vir:10 158 VLGNGK 163 (163) T ss_pred hcCCCC Confidence 4332 No 152 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=77.13 E-value=0.0041 Score=33.70 Aligned_cols=74 Identities=15% Similarity=0.179 Sum_probs=34.7 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCe--EEEeecCC-----------CCCCCCC-CCHHHHHHHHhcCCcC-------- Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQ--IEAGFLTN-----------RRHPESD-LTIPAIAAIQQYGNET-------- 58 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~--v~VGi~~~-----------~~~~d~g-~~~A~iA~~~E~G~~~-------- 58 (162) |+.-+ .+....+....+.. |.-|-+.. ..+-... .+.+.+|.++|||+.. T Consensus 22 ~r~~l--------~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~V~~~~~YA~~ve~GT~ph~I~Pk~~ 93 (137) T protein:vir:10 22 ARRRL--------SRITRGTANQARADVPVKTGNLGRSIREDPIVVAGPLRLDSGVTAHADYARYVHDGTRAHVIRPRRP 93 (137) T ss_pred HHHHH--------HHHHHHHHHHHHhcCCccchhhhcCceeeeeeccccceEEEEecCCCccceeeecCCCCceeecccc Confidence 22222 22222222211110 11111110 0000000 1447889999999621 Q ss_pred -----------------C---CCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_020079. 59 -----------------N---NIPARPFITDGAVISQNNIAKKM 82 (162) Q Consensus 59 -----------------~---~IP~RpFlr~~~~~~~~~~~~~~ 82 (162) + .+|+||||+++++++..+-...- T Consensus 94 k~~l~~~~~g~~vf~k~V~hPG~~a~PfL~~A~~~~~~~~~~~~ 137 (137) T protein:vir:10 94 GGVLRFTVGGRVVYARRVNHPGTRARPFLRNAAERVVARETATS 137 (137) T ss_pred ceeeeEeeCCeeEecceeecCCCCCCchHHHHHHHhhhhhcccC Confidence 1 25699999999999987655443 No 153 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=76.78 E-value=0.024 Score=29.45 Aligned_cols=63 Identities=16% Similarity=0.163 Sum_probs=32.7 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHH Q lcl|NC_020079. 66 FITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTA 145 (162) Q Consensus 66 Flr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG 145 (162) +=-.++ +++.+.++... .....+.+|...+..+++.++. ..| +||| T Consensus 1 i~i~Gl----d~l~~~l~~~~----~~~~~~~al~~~a~~i~~~ak~-------------------------~ap-vdTG 46 (108) T protein:vir:98 1 MKITGI----DALQKKLRKNA----TLNDVKHVVKRNTVSMNKNMQN-------------------------LAP-VDTG 46 (108) T ss_pred CcchhH----HHHHHHHHHhh----hHHHHHHHHHHHHHHHHHHHHH-------------------------hCC-CCch Confidence 111122 33333333221 1122345666666666655542 123 6999 Q ss_pred HHHhhceeeeecCCCCC Q lcl|NC_020079. 146 HMINAIETKITKSKSKK 162 (162) Q Consensus 146 ~l~~SIty~V~~~~~~~ 162 (162) .|++||+..+.+++.+- T Consensus 47 ~Lr~si~~~~~~~~~~~ 63 (108) T protein:vir:98 47 NMKRSITSEFTDGGLTG 63 (108) T ss_pred hhHhhceeeeecCceEE Confidence 99999998876655332 No 154 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=76.74 E-value=0.028 Score=29.07 Aligned_cols=79 Identities=6% Similarity=0.032 Sum_probs=34.9 Q ss_pred EEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHH Q lcl|NC_020079. 28 IEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASR 107 (162) Q Consensus 28 v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~ 107 (162) ++ +|-|= -.|-.|-.. ....+++.+.+++.-.++ .....++|+..+..++ T Consensus 1 ~~---------------------~~~~~------~~~~~Ma~v-~~Gld~l~~~l~~~~~~~--~~~~~~~l~~~a~~v~ 50 (149) T protein:vir:10 1 MK---------------------LNYYD------LSRCHMAKV-KYGADSMVVELDKFDKKI--EEWVKKGIAKTTTKIY 50 (149) T ss_pred Ce---------------------eeeec------cchhhhHHH-HHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHH Confidence 11 11111 123333221 112234444444333322 1122344555555444 Q ss_pred HHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHHHHhhceeeeecCCCCC Q lcl|NC_020079. 108 EGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAHMINAIETKITKSKSKK 162 (162) Q Consensus 108 ~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l~~SIty~V~~~~~~~ 162 (162) +.++. ..| +|||.|++||+++|..++-+- T Consensus 51 ~~ak~-------------------------~aP-vdTG~L~~SI~~~~~~~g~~~ 79 (149) T protein:vir:10 51 NTAVA-------------------------LAP-VDLGFLEESIDFKYFDGGLSS 79 (149) T ss_pred HHHHH-------------------------hCC-cccchhhccceEEecCCcEEE Confidence 44331 233 689999999999886553211 No 155 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=75.62 E-value=0.02 Score=29.86 Aligned_cols=82 Identities=15% Similarity=0.088 Sum_probs=49.9 Q ss_pred CCCccccc-ch-hHHHHHHH-HHH-HhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcc--hhhHHHHHH Q lcl|NC_020079. 1 MESEILPG-DD-TDWETIIK-KMM-DLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARP--FITDGAVIS 74 (162) Q Consensus 1 M~~~i~~~-~~-~~l~~l~~-~l~-~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~Rp--Flr~~~~~~ 74 (162) ++..+..- |. ...+++.. ... .-.-+.|+|||.. .+| .|--+||||. ...|-||- +++.+++.. T Consensus 45 lK~~~~~f~DTG~t~dev~~s~~~~~~G~r~V~VgW~G-pR~--------~ivHLNE~Gy-Gk~~~PrG~G~I~~a~~~s 114 (132) T protein:vir:96 45 FKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGFTT-PRW--------NIVHLQELEY-GWKHNRRGVGVIRRYSDIL 114 (132) T ss_pred HHHhhhhhhhcchhhcceeecCeeecCCceEEEecccC-Cce--------eEEeeecccc-cCCcCCCcchHHHHHHHhh Confidence 11111110 00 00111100 000 0123579999953 344 4667899998 45676676 699999999 Q ss_pred HHHHHHHHHHHHHHHhcc Q lcl|NC_020079. 75 QNNIAKKMKQVFANYLMH 92 (162) Q Consensus 75 ~~~~~~~~~~~~~~~l~G 92 (162) +..+.+.++..+++.|.| T Consensus 115 e~~~~~~~~~elkk~l~~ 132 (132) T protein:vir:96 115 ETIYPRGIRDKLKRGFDG 132 (132) T ss_pred hhHHHHHHHHHHHHHhcC Confidence 999999999999999999 No 156 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=67.54 E-value=0.025 Score=29.32 Aligned_cols=86 Identities=15% Similarity=0.167 Sum_probs=47.4 Q ss_pred CC------------------Ccccccchh----HHHHHHHHHHHhhC---CeEEEeecCCCCCCCCCCCHHHHHHHHhcC Q lcl|NC_020079. 1 ME------------------SEILPGDDT----DWETIIKKMMDLEQ---VQIEAGFLTNRRHPESDLTIPAIAAIQQYG 55 (162) Q Consensus 1 M~------------------~~i~~~~~~----~l~~l~~~l~~l~~---~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G 55 (162) |+ +-..++... .--+|-..|+--+. ..|++|=- -.+ -+|.+-+|| T Consensus 33 lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aaT~raa~VrAG~~---------krV-PYA~~I~~G 102 (143) T protein:vir:62 33 VREANKASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTASAKGAVIKAGSA---------SRV-PYAAAIHFG 102 (143) T ss_pred HHHHHHHHHHHHHHHHHhhcCCcccccccccccCcchhhccccccccccceeeeeCCc---------CCC-CcccccccC Confidence 11 111000000 00111122211111 12222210 122 246778999 Q ss_pred CcCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHH Q lcl|NC_020079. 56 NETNNIPARPFITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEP 101 (162) Q Consensus 56 ~~~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~ 101 (162) ++..+|-++-||..++....+.|.+..++.++++|+-. |+. T Consensus 103 ~r~r~Isp~rFl~~a~a~te~~~~r~Ye~~i~~vl~k~-----l~s 143 (143) T protein:vir:62 103 YRARNISPNRFLFRAMARKSDVVAATYERRIAAVVEKY-----LES 143 (143) T ss_pred cccccccchhhhhhhhhccCHHHHHHHHHHHHHHHHHH-----hcC Confidence 99999999999999999999999999999998887433 221 No 157 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=64.61 E-value=0.033 Score=28.74 Aligned_cols=84 Identities=17% Similarity=0.168 Sum_probs=47.8 Q ss_pred CC------------------------CcccccchhHHHHHHHHHHHhhC---CeEEEeecCCCCCCCCCCCHHHHHHHHh Q lcl|NC_020079. 1 ME------------------------SEILPGDDTDWETIIKKMMDLEQ---VQIEAGFLTNRRHPESDLTIPAIAAIQQ 53 (162) Q Consensus 1 M~------------------------~~i~~~~~~~l~~l~~~l~~l~~---~~v~VGi~~~~~~~d~g~~~A~iA~~~E 53 (162) |+ ++.++.-- --+|-..|+--+. ..|++|= . -.+ -+|.+-+ T Consensus 33 lk~a~~~aa~v~~~~ar~~tP~g~~~p~~srr~r--~G~L~~Sir~aaT~raa~VrAGr----~-----arV-PYA~~I~ 100 (143) T protein:vir:13 33 VREANKASGEVLIPQAKHESPDGHRDPKSSKRYR--PGKLDKSIKVTASAKGAVIKAGS----A-----ARV-PYAAAIH 100 (143) T ss_pred HHHHHHHHHHHHHHHHHhhcCCcccccccccccc--cchhhccccccccccceeeeecC----c-----CCC-Ccccccc Confidence 11 11111000 0112222221111 1233331 1 112 2366789 Q ss_pred cCCcCCCCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccchHHHHHHH Q lcl|NC_020079. 54 YGNETNNIPARPFITDGAVISQNNIAKKMKQVFANYLMHNVGLAVFEP 101 (162) Q Consensus 54 ~G~~~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~l~~ 101 (162) ||++..+|-++-|+..++....+.|.+..++.++++|+-. |+. T Consensus 101 ~G~r~r~Is~~rFl~~a~a~te~~~~r~Ye~~i~~vl~k~-----l~s 143 (143) T protein:vir:13 101 FGYRKRNISANRFLYRAMARKSDVVAATYERRIAAVVEKY-----LES 143 (143) T ss_pred cCCcccccchhhhhhhhhhccCHHHHHHHHHHHHHHHHHH-----hcC Confidence 9999999999999999999999999999999999887433 221 No 158 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=63.97 E-value=0.035 Score=28.58 Aligned_cols=90 Identities=18% Similarity=0.225 Sum_probs=47.0 Q ss_pred CCCcccccchhH-HHHHH------------HHHHHhhCC---eE----------EEeecC-----CCCCCCCC---CCHH Q lcl|NC_020079. 1 MESEILPGDDTD-WETII------------KKMMDLEQV---QI----------EAGFLT-----NRRHPESD---LTIP 46 (162) Q Consensus 1 M~~~i~~~~~~~-l~~l~------------~~l~~l~~~---~v----------~VGi~~-----~~~~~d~g---~~~A 46 (162) |.. |.+++... +.+-| +.+.+..+. .| +-|-+. .....+.- -..- T Consensus 1 M~~-i~id~La~~I~~~L~~y~~~v~~~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~~~~~~v~nk~~y 79 (127) T protein:vir:80 1 MAN-IKIDRLGDEITRQLKRYSQVIAGDLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRTPGGWVIHNKTEY 79 (127) T ss_pred Ccc-ccHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeeccCceeEeecCCc Confidence 765 66665421 21111 111111110 11 122111 11111000 0123 Q ss_pred HHHHHHhcCCcCCC---CCCcchhhHHHHHHHHHHHHHHHHHHHHHhccch Q lcl|NC_020079. 47 AIAAIQQYGNETNN---IPARPFITDGAVISQNNIAKKMKQVFANYLMHNV 94 (162) Q Consensus 47 ~iA~~~E~G~~~~~---IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~ 94 (162) +++-+.|||.-..+ +++||||+|+.+....++.+.++..+.. |.. T Consensus 80 qLtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~---~~~ 127 (127) T protein:vir:80 80 RLAHLLEYGHATVDGGRVPETPHIRPVEDWLEKEFEDRVERAIKN---ESR 127 (127) T ss_pred ceeehhhcceeccCCcccCCccchhhHHHHHHHHHHHHHHHHhcC---CCC Confidence 67889999965443 7999999999998888888888877776 333 No 159 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=63.23 E-value=0.06 Score=27.26 Aligned_cols=75 Identities=17% Similarity=0.266 Sum_probs=43.1 Q ss_pred CCCcccccchhHHHHHHHHHHHh--------------------------------------------------hC---Ce Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDL--------------------------------------------------EQ---VQ 27 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l--------------------------------------------------~~---~~ 27 (162) |++.++. +++|++.|+.. .+ +. T Consensus 1 msvevkG-----v~eilk~le~k~G~~~~~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rt 75 (133) T protein:vir:93 1 MSVEIKG-----IPEVLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERA 75 (133) T ss_pred CeEEEec-----HHHHHHHHHHhhCHhhhHhhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceE Confidence 6666552 22333222211 01 23 Q ss_pred EEEeecCC-CCCCCCCCCHHHHHHHHhcCCcCC--CCCCcch--hhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 28 IEAGFLTN-RRHPESDLTIPAIAAIQQYGNETN--NIPARPF--ITDGAVISQNNIAKKMKQVFAN 88 (162) Q Consensus 28 v~VGi~~~-~~~~d~g~~~A~iA~~~E~G~~~~--~IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 88 (162) |+|||.+. .+| -|--+||||.... .|-||-| ++.+++..+..+.+.++..++. T Consensus 76 V~i~W~gp~~R~--------~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 76 VLIEWVGPMNRK--------NIIHLNEHGYTRDGKKYTPRGFGVIAKTLAANERKYREIIKKELAR 133 (133) T ss_pred EEEEeecCCCce--------eEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 56666442 222 3556789996322 2557775 8888888888888888877776 No 160 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=58.89 E-value=0.11 Score=25.94 Aligned_cols=76 Identities=13% Similarity=0.199 Sum_probs=42.1 Q ss_pred CCCcccccchhHHHHHHHHHHHh---------------------------------------------------hCCeEE Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDL---------------------------------------------------EQVQIE 29 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l---------------------------------------------------~~~~v~ 29 (162) |++.++- +++|++.|++. ..+.|+ T Consensus 1 msvevkG-----v~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~ 75 (134) T protein:vir:10 1 MSVKVIG-----DKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTIT 75 (134) T ss_pred CeEEEec-----HHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEE Confidence 5555552 22222222111 012477 Q ss_pred EeecCC-CCCCCCCCCHHHHHHHHhcCCcCC----CCCCcc--hhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 30 AGFLTN-RRHPESDLTIPAIAAIQQYGNETN----NIPARP--FITDGAVISQNNIAKKMKQVFANY 89 (162) Q Consensus 30 VGi~~~-~~~~d~g~~~A~iA~~~E~G~~~~----~IP~Rp--Flr~~~~~~~~~~~~~~~~~~~~~ 89 (162) |||-+. .+| -|--+||||.... .|-||- -++.+++..+..+.+.++..++.. T Consensus 76 vgW~G~~~R~--------~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 76 VHWRGSKDRY--------KIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred EEEEcCCcee--------EEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 777432 222 3567889996331 233333 355588888888888888888885 No 161 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=58.89 E-value=0.11 Score=25.94 Aligned_cols=76 Identities=13% Similarity=0.199 Sum_probs=42.1 Q ss_pred CCCcccccchhHHHHHHHHHHHh---------------------------------------------------hCCeEE Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDL---------------------------------------------------EQVQIE 29 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l---------------------------------------------------~~~~v~ 29 (162) |++.++- +++|++.|++. ..+.|+ T Consensus 1 msvevkG-----v~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~ 75 (134) T protein:vir:95 1 MSVKVIG-----DKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTIT 75 (134) T ss_pred CeEEEec-----HHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEE Confidence 5555552 22222222111 012477 Q ss_pred EeecCC-CCCCCCCCCHHHHHHHHhcCCcCC----CCCCcc--hhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 30 AGFLTN-RRHPESDLTIPAIAAIQQYGNETN----NIPARP--FITDGAVISQNNIAKKMKQVFANY 89 (162) Q Consensus 30 VGi~~~-~~~~d~g~~~A~iA~~~E~G~~~~----~IP~Rp--Flr~~~~~~~~~~~~~~~~~~~~~ 89 (162) |||-+. .+| -|--+||||.... .|-||- -++.+++..+..+.+.++..++.. T Consensus 76 vgW~G~~~R~--------~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:95 76 VHWRGSKDRY--------KIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred EEEEcCCcee--------EEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 777432 222 3567889996331 233333 355588888888888888888885 No 162 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=57.84 E-value=0.081 Score=26.58 Aligned_cols=66 Identities=11% Similarity=-0.066 Sum_probs=27.1 Q ss_pred HHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHHHHhhceeeeecCCCC Q lcl|NC_020079. 82 MKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAHMINAIETKITKSKSK 161 (162) Q Consensus 82 ~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l~~SIty~V~~~~~~ 161 (162) |.+.-..+--+-+.+.+-..++..+...++..-.. ..-..|. ..| +|||.|++||++.+.+.++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------v~~~ak~------~aP-vdtG~Lr~SI~~~~~~~~~~ 65 (140) T protein:vir:97 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRR--------IANQSRV------AVP-VRTGNLGRTIGELPQVYTPF 65 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHh------cCC-ccchhhhccceeeeeeCCCc Confidence 11111110001111122222333333333221110 0011122 344 69999999999988877654 Q ss_pred C Q lcl|NC_020079. 162 K 162 (162) Q Consensus 162 ~ 162 (162) - T Consensus 66 ~ 66 (140) T protein:vir:97 66 R 66 (140) T ss_pred e Confidence 4 No 163 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=57.84 E-value=0.081 Score=26.58 Aligned_cols=66 Identities=11% Similarity=-0.066 Sum_probs=27.1 Q ss_pred HHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHHHHhhceeeeecCCCC Q lcl|NC_020079. 82 MKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAHMINAIETKITKSKSK 161 (162) Q Consensus 82 ~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l~~SIty~V~~~~~~ 161 (162) |.+.-..+--+-+.+.+-..++..+...++..-.. ..-..|. ..| +|||.|++||++.+.+.++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------v~~~ak~------~aP-vdtG~Lr~SI~~~~~~~~~~ 65 (140) T protein:vir:10 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRR--------IANQSRV------AVP-VRTGNLGRTIGELPQVYTPF 65 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHh------cCC-ccchhhhccceeeeeeCCCc Confidence 11111110001111122222333333333221110 0011122 344 69999999999988877654 Q ss_pred C Q lcl|NC_020079. 162 K 162 (162) Q Consensus 162 ~ 162 (162) - T Consensus 66 ~ 66 (140) T protein:vir:10 66 R 66 (140) T ss_pred e Confidence 4 No 164 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=57.44 E-value=0.16 Score=24.97 Aligned_cols=91 Identities=9% Similarity=0.104 Sum_probs=52.4 Q ss_pred CCCcccccchhHHHHHHHHHHHhhC------CeEEEeecCCCCC--CCCCCCHHHHHHHHhcCCcCC------------- Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQ------VQIEAGFLTNRRH--PESDLTIPAIAAIQQYGNETN------------- 59 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~------~~v~VGi~~~~~~--~d~g~~~A~iA~~~E~G~~~~------------- 59 (162) |..-+...-..-..++++..++..- ..++=+|--+..+ .+.=.+.+.+|-.-|||.-.. T Consensus 1 l~~~~~~~~~~~a~~l~~~vk~rTPv~~~d~G~LR~sW~~g~v~k~~~~v~N~~eYA~~VE~GHRq~~g~g~~~~~~gkr 80 (116) T protein:vir:10 1 MSKNLRRAKNNIGNKLLRKVKPKTPVAKIDGGTARKSWKYKELNLFDGVVSNNVEYIHHLEYGHRTRQGTGTSENYRPKP 80 (116) T ss_pred CchHHHHHHHHHHHHHHHHHHhhCCCCcCCCcccccCceeeeeeccCceeecCCcccccccCCceeeCCcceeccccccc Confidence 3322221110111222222332221 2233333332221 122247789999999996432 Q ss_pred ----CCCCcchhhHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020079. 60 ----NIPARPFITDGAVISQNNIAKKMKQVFANYLM 91 (162) Q Consensus 60 ----~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~ 91 (162) -+|-+-+|+.+..+.+..+.+.+++.+.++++ T Consensus 81 lk~~~V~G~fml~~s~~e~~~~~~~~~~~~~~~~l~ 116 (116) T protein:vir:10 81 NGISFVPGVFMLARSVDEMSSIIDDELNQIIIDFWN 116 (116) T ss_pred ccCCccCceehHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 35777799999999999999999999999988 No 165 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=56.22 E-value=0.046 Score=27.92 Aligned_cols=66 Identities=12% Similarity=0.086 Sum_probs=31.8 Q ss_pred hhHH-HH-HHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhH Q lcl|NC_020079. 67 ITDG-AV-ISQNNIAKKMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDT 144 (162) Q Consensus 67 lr~~-~~-~~~~~~~~~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidT 144 (162) |-.. +. +-.+++.+.++. .....+.++++...|..++..+|. ..| +|| T Consensus 1 Ma~~~i~~~Gld~L~~~L~~----~~~~~~v~~vv~~~~~~l~~~ak~-------------------------~ap-~dT 50 (92) T protein:vir:99 1 MADYSISWDGLDALDEALAN----QQNMNTVKKVVKKHTANLMTATQQ-------------------------AVP-VDT 50 (92) T ss_pred CCceeeEeehHHHHHHHHHh----hccHHHHHHHHHHHHHHHHHHHHH-------------------------hCC-CCc Confidence 1110 00 011222222222 112223345555555555544443 123 699 Q ss_pred HHHHhhceeeeecCCCCC Q lcl|NC_020079. 145 AHMINAIETKITKSKSKK 162 (162) Q Consensus 145 G~l~~SIty~V~~~~~~~ 162 (162) |.|++||+..+.+++-.- T Consensus 51 G~lrrSI~~~~~~~g~~~ 68 (92) T protein:vir:99 51 GHLKQSAQIQISRDGFTG 68 (92) T ss_pred cccceeeeEEeecCCeeE Confidence 999999998887776333 No 166 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=56.17 E-value=0.039 Score=28.28 Aligned_cols=86 Identities=20% Similarity=0.282 Sum_probs=46.8 Q ss_pred CCCcccccchhH-HHHHHHH------------HHHhhC-------------CeEEEeecC-----CCCCCCCCC----CH Q lcl|NC_020079. 1 MESEILPGDDTD-WETIIKK------------MMDLEQ-------------VQIEAGFLT-----NRRHPESDL----TI 45 (162) Q Consensus 1 M~~~i~~~~~~~-l~~l~~~------------l~~l~~-------------~~v~VGi~~-----~~~~~d~g~----~~ 45 (162) |.. |++++... +.+-++. +++..+ --..-|-+. .... ++.+ .- T Consensus 1 M~~-i~id~La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~~-e~~~V~nk~~ 78 (124) T protein:vir:95 1 MAK-IKIGRLADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRVP-NGWVIHNKTE 78 (124) T ss_pred Ccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeeec-CceeEEEcCC Confidence 654 66665421 2221111 111111 111222211 1111 1100 12 Q ss_pred HHHHHHHhcCCcCC---CCCCcchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 46 PAIAAIQQYGNETN---NIPARPFITDGAVISQNNIAKKMKQVFAN 88 (162) Q Consensus 46 A~iA~~~E~G~~~~---~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (162) -+++-+.|||.-.. .+++||||+|+.+.....+.+.++..+.+ T Consensus 79 yqLtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~ 124 (124) T protein:vir:95 79 YRLAHLLEYGHATVDGGRVPGTPHIRPIEDWLEKEFEDRVEKAIKQ 124 (124) T ss_pred CceeeeeecceeccCCcccCCccchhHHHHHHHHHHHHHHHHHhcC Confidence 35788889996444 37999999999999999999999888887 No 167 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=54.39 E-value=0.11 Score=25.88 Aligned_cols=75 Identities=17% Similarity=0.268 Sum_probs=42.5 Q ss_pred CCCcccccchhHHHHHHHHHHH-hh----------------------------------------------------CCe Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMD-LE----------------------------------------------------QVQ 27 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~-l~----------------------------------------------------~~~ 27 (162) |++.++. +++|++.|+. |+ .+. T Consensus 1 msvevkG-----v~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rt 75 (133) T protein:vir:94 1 MSVEIKG-----IPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERA 75 (133) T ss_pred CeEEEec-----HHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCccee Confidence 6666552 2233332222 00 123 Q ss_pred EEEeecCC-CCCCCCCCCHHHHHHHHhcCCcCCC--CCCcch--hhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 28 IEAGFLTN-RRHPESDLTIPAIAAIQQYGNETNN--IPARPF--ITDGAVISQNNIAKKMKQVFAN 88 (162) Q Consensus 28 v~VGi~~~-~~~~d~g~~~A~iA~~~E~G~~~~~--IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 88 (162) |+|||.+. .+| -|--+||||..... |-||-| ++.+++..+..+.+.++..++. T Consensus 76 V~i~W~gp~~R~--------~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:94 76 VLIEWVGPMNRK--------NIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred EEEEeecCCCce--------eEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 55666442 222 35567899963222 457775 7888888888888888877776 No 168 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=54.39 E-value=0.11 Score=25.88 Aligned_cols=75 Identities=17% Similarity=0.268 Sum_probs=42.5 Q ss_pred CCCcccccchhHHHHHHHHHHH-hh----------------------------------------------------CCe Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMD-LE----------------------------------------------------QVQ 27 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~-l~----------------------------------------------------~~~ 27 (162) |++.++. +++|++.|+. |+ .+. T Consensus 1 msvevkG-----v~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rt 75 (133) T protein:vir:96 1 MSVEIKG-----IPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERA 75 (133) T ss_pred CeEEEec-----HHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCccee Confidence 6666552 2233332222 00 123 Q ss_pred EEEeecCC-CCCCCCCCCHHHHHHHHhcCCcCCC--CCCcch--hhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 28 IEAGFLTN-RRHPESDLTIPAIAAIQQYGNETNN--IPARPF--ITDGAVISQNNIAKKMKQVFAN 88 (162) Q Consensus 28 v~VGi~~~-~~~~d~g~~~A~iA~~~E~G~~~~~--IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 88 (162) |+|||.+. .+| -|--+||||..... |-||-| ++.+++..+..+.+.++..++. T Consensus 76 V~i~W~gp~~R~--------~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:96 76 VLIEWVGPMNRK--------NIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred EEEEeecCCCce--------eEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 55666442 222 35567899963222 457775 7888888888888888877776 No 169 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=54.39 E-value=0.11 Score=25.88 Aligned_cols=75 Identities=17% Similarity=0.268 Sum_probs=42.5 Q ss_pred CCCcccccchhHHHHHHHHHHH-hh----------------------------------------------------CCe Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMD-LE----------------------------------------------------QVQ 27 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~-l~----------------------------------------------------~~~ 27 (162) |++.++. +++|++.|+. |+ .+. T Consensus 1 msvevkG-----v~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rt 75 (133) T protein:vir:93 1 MSVEIKG-----IPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERA 75 (133) T ss_pred CeEEEec-----HHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCccee Confidence 6666552 2233332222 00 123 Q ss_pred EEEeecCC-CCCCCCCCCHHHHHHHHhcCCcCCC--CCCcch--hhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 28 IEAGFLTN-RRHPESDLTIPAIAAIQQYGNETNN--IPARPF--ITDGAVISQNNIAKKMKQVFAN 88 (162) Q Consensus 28 v~VGi~~~-~~~~d~g~~~A~iA~~~E~G~~~~~--IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 88 (162) |+|||.+. .+| -|--+||||..... |-||-| ++.+++..+..+.+.++..++. T Consensus 76 V~i~W~gp~~R~--------~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 76 VLIEWVGPMNRK--------NIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred EEEEeecCCCce--------eEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 55666442 222 35567899963222 457775 7888888888888888877776 No 170 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=54.39 E-value=0.11 Score=25.88 Aligned_cols=75 Identities=17% Similarity=0.268 Sum_probs=42.5 Q ss_pred CCCcccccchhHHHHHHHHHHH-hh----------------------------------------------------CCe Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMD-LE----------------------------------------------------QVQ 27 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~-l~----------------------------------------------------~~~ 27 (162) |++.++. +++|++.|+. |+ .+. T Consensus 1 msvevkG-----v~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rt 75 (133) T protein:vir:78 1 MSVEIKG-----IPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERA 75 (133) T ss_pred CeEEEec-----HHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCccee Confidence 6666552 2233332222 00 123 Q ss_pred EEEeecCC-CCCCCCCCCHHHHHHHHhcCCcCCC--CCCcch--hhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020079. 28 IEAGFLTN-RRHPESDLTIPAIAAIQQYGNETNN--IPARPF--ITDGAVISQNNIAKKMKQVFAN 88 (162) Q Consensus 28 v~VGi~~~-~~~~d~g~~~A~iA~~~E~G~~~~~--IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 88 (162) |+|||.+. .+| -|--+||||..... |-||-| ++.+++..+..+.+.++..++. T Consensus 76 V~i~W~gp~~R~--------~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:78 76 VLIEWVGPMNRK--------NIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred EEEEeecCCCce--------eEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 55666442 222 35567899963222 457775 7888888888888888877776 No 171 >protein:vir:487 Length: 187 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543094;swissprot:trembl:q8w625;genbank:gi:18249906;uniprot:Q8W625;genbank:GeneID:929690 Probab=47.56 E-value=0.23 Score=24.10 Aligned_cols=90 Identities=11% Similarity=0.135 Sum_probs=44.0 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCcCCCCCCcchhhHHHHHHHHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNETNNIPARPFITDGAVISQNNIAK 80 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~~~~IP~RpFlr~~~~~~~~~~~~ 80 (162) |+.-|-++ ||--.+++.+++|-- .+-|.+=-| +.. T Consensus 1 ~~~~~~~~-------------------------------~~~nam~~~~~lHvd----F~qp~~~~F------nr~---- 35 (187) T protein:vir:48 1 MKNCVQRD-------------------------------DGVNAMNQTAFLHVD----FKQPKELEF------NRA---- 35 (187) T ss_pred Cccccccc-------------------------------cchhhhhhccceeEe----eecCCceee------cHH---- Confidence 22211111 221233444444411 112322111 111 Q ss_pred HHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCCchhhHHHHHhhceeeeecCCC Q lcl|NC_020079. 81 KMKQVFANYLMHNVGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNHILIDTAHMINAIETKITKSKS 160 (162) Q Consensus 81 ~~~~~~~~~l~G~~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~PLidTG~l~~SIty~V~~~~~ 160 (162) .+ ..++..+|+....+.|.-+-. || +.+..+.|-..||.|..||+|.|-+.-+ T Consensus 36 ri-------------RraF~~iGq~h~r~ArrLvm~-------------RG-rs~pge~P~~qTGrLa~SIgy~Vpkat~ 88 (187) T protein:vir:48 36 RL-------------RRAFVQIGRVYMRDARRLVIK-------------RG-RSGPGENPGYQTGRLARSIGYYVPKKTT 88 (187) T ss_pred HH-------------HHHHHHHhHHHHHHHHHHHHh-------------cc-cCCCCCCCcchhhhhhhhhhhccccccC Confidence 11 234555666666666644321 12 2233589999999999999999887666 Q ss_pred CC Q lcl|NC_020079. 161 KK 162 (162) Q Consensus 161 ~~ 162 (162) .+ T Consensus 89 ~R 90 (187) T protein:vir:48 89 RR 90 (187) T ss_pred CC Confidence 66 No 172 >protein:vir:4460 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700383;genbank:gi:23505455;genbank:GeneID:955662 Probab=42.43 E-value=0.067 Score=27.01 Aligned_cols=76 Identities=11% Similarity=0.208 Sum_probs=41.4 Q ss_pred CCCcchhhHHHHHHHHHHHHHHHHHHHHHhccc-hHHHHHHHHHHHHHHHHHHHHHhcCCCCCCHHHHHHhhccCCCCCC Q lcl|NC_020079. 61 IPARPFITDGAVISQNNIAKKMKQVFANYLMHN-VGLAVFEPIARASREGIAQAIAMQRYRPLSPVTIKIRQDKGNYSNH 139 (162) Q Consensus 61 IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~-~~~~~l~~iG~~~~~~i~~~I~~~~~ppnsp~Ti~~k~~k~~~~~~ 139 (162) .+.-+||---|+.-.+ +...+ ....++..+|+....+.|.-+-. ++ +.+..++ T Consensus 1 M~~~~~lHvdF~qp~~------------~~Fnr~r~RraF~~iGq~h~r~Arrlvm~-------------RG-rs~pGe~ 54 (170) T protein:vir:44 1 MPQKAYLHVDFVQPEE------------LVFNRARMRRAFVKIGQVHMRDARRLVMK-------------RG-RSKPGEN 54 (170) T ss_pred CCCCceeEEeeecCCc------------eeecHHHHHHHHHHHhHHHHHHHHHHHHH-------------hc-CCCCCCC Confidence 1111122111111100 00001 12345677888877777754421 22 3344689 Q ss_pred chhhHHHHHhhceeeeecCCCCC Q lcl|NC_020079. 140 ILIDTAHMINAIETKITKSKSKK 162 (162) Q Consensus 140 PLidTG~l~~SIty~V~~~~~~~ 162 (162) |-..||.|..||.|.|-..-.++ T Consensus 55 P~~~TGrLa~SIgy~Vpras~~r 77 (170) T protein:vir:44 55 PSYRTGQLARSIGYYVPRASKKR 77 (170) T ss_pred CcchhhhhhhhhhhccccccCCC Confidence 99999999999999988776665 No 173 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=38.19 E-value=0.33 Score=23.25 Aligned_cols=89 Identities=16% Similarity=0.161 Sum_probs=51.7 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCCHHHHHHHHhcCCc----------------CCCCCCc Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLTIPAIAAIQQYGNE----------------TNNIPAR 64 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~~A~iA~~~E~G~~----------------~~~IP~R 64 (162) |+-+|+..+. .+.........|||.. .|.++--.=|+||.|.+-|+. .+.||.= T Consensus 62 LaDsI~~~~~--------niDg~~dG~s~VGf~~--k~~~~~~~kA~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gD 131 (168) T protein:vir:74 62 LADSIVMKNK--------NIDGVKDGQSVVGWER--STEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHAD 131 (168) T ss_pred hhhheeeccc--------ccCcccCCceeecccc--cccccccchhhhhhhhcccccccccccccccccccccccccccc Confidence 4444443321 1222344567888854 232221235899999999962 2468999 Q ss_pred chhhHHHHH--HHHHHHHHHHHHHHHHhccchHHHHH Q lcl|NC_020079. 65 PFITDGAVI--SQNNIAKKMKQVFANYLMHNVGLAVF 99 (162) Q Consensus 65 pFlr~~~~~--~~~~~~~~~~~~~~~~l~G~~~~~~l 99 (162) .|+..+-.+ -++++.+.-...+..+|.-+-.+--| T Consensus 132 HFvd~~r~~~~~k~~V~~Ae~~~y~eIl~~k~~~~~~ 168 (168) T protein:vir:74 132 HFIEETRMNLIVQQGILKAEAEAMRKIINRKKKENNL 168 (168) T ss_pred hhHHHHHhhhhhHHHHHHHHHHHHHHHHHhhcCCCCC Confidence 999998887 45777776666666666332111111 No 174 >protein:vir:96012 Length: 133 # NCBI annotation: ORF023 # Family: family:all:589 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239805;genbank:gi:66395471;genbank:GeneID:5132929 Probab=37.12 E-value=0.39 Score=22.82 Aligned_cols=82 Identities=13% Similarity=0.118 Sum_probs=46.8 Q ss_pred CCCccc----ccchhHHHHHHHHHHHh-------------------hCCeEEEeecCC-CCCCCCCCCHHHHHHHHhcCC Q lcl|NC_020079. 1 MESEIL----PGDDTDWETIIKKMMDL-------------------EQVQIEAGFLTN-RRHPESDLTIPAIAAIQQYGN 56 (162) Q Consensus 1 M~~~i~----~~~~~~l~~l~~~l~~l-------------------~~~~v~VGi~~~-~~~~d~g~~~A~iA~~~E~G~ 56 (162) |.--+. .....-.+.+.+++... ..++|+|||-+. .+| -|--+||||. T Consensus 23 m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~rtV~i~W~gp~~R~--------~iVHLNE~G~ 94 (133) T protein:vir:96 23 LMRITDRALTEAGEVVLEAIRTNLKYFRDTGAEYGEVKLSKPTWENGKRTIRVYWEGEKHRY--------SIVHLNEKGF 94 (133) T ss_pred HHHHhhHHHHHHHHHHHHHHHHhhHHHhhccceeeeEEecCceecCCceEEEEEeecCCCce--------eeEeeecccc Confidence 322211 11100011222222211 124688888653 343 4567889993 Q ss_pred cC---CCCCCcch--hhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020079. 57 ET---NNIPARPF--ITDGAVISQNNIAKKMKQVFANYL 90 (162) Q Consensus 57 ~~---~~IP~RpF--lr~~~~~~~~~~~~~~~~~~~~~l 90 (162) -+ ..|-||-| ++.+++..+..+.+.++..++..| T Consensus 95 ytr~Gk~i~PrG~G~I~~al~~se~~y~~~vk~el~kll 133 (133) T protein:vir:96 95 YAKDGKFIRPKGMGAIDKALRASRDKFFKVYAEEVSKLL 133 (133) T ss_pred eecCCceeccchhhHHHHHHHhhhHHHHHHHHHHHHHhC Confidence 22 23667875 888999999999999999998887 No 175 >protein:vir:1028 Length: 168 # NCBI annotation: Orf48 # Family: family:all:1029 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076682;genbank:gi:13095791;genbank:GeneID:920342 Probab=36.87 E-value=0.19 Score=24.52 Aligned_cols=88 Identities=16% Similarity=0.162 Sum_probs=53.5 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCC-HHHHHHHHhcCCc----------------CCCCCC Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLT-IPAIAAIQQYGNE----------------TNNIPA 63 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~-~A~iA~~~E~G~~----------------~~~IP~ 63 (162) |+-+|+..+. .+......+..|||.... ++|+. -|+||.|.+-|+. .+.||. T Consensus 62 LaDsI~~~~~--------niDg~~dG~s~VGf~~k~---~~~~~~ka~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~g 130 (168) T protein:vir:10 62 LADSIVMKNK--------NIDGVKDGQSVVGWERST---EKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHA 130 (168) T ss_pred hhhhheeccc--------ccccccCCceeecccCcc---ccccccchheeeecccccccccccccccccccccccccccc Confidence 5555553331 233334567889996432 23333 6999999999962 245899 Q ss_pred cchhhHHHHH--HHHHHHHHHHHHHHHHhccchHHHHH Q lcl|NC_020079. 64 RPFITDGAVI--SQNNIAKKMKQVFANYLMHNVGLAVF 99 (162) Q Consensus 64 RpFlr~~~~~--~~~~~~~~~~~~~~~~l~G~~~~~~l 99 (162) =.|+..+-.+ .++++.+.....+..+|.-+-.+.-| T Consensus 131 DHFvd~~r~d~a~k~~V~~Ae~~~y~eIl~~k~~~~~~ 168 (168) T protein:vir:10 131 DHFIEETRKNPIVQQGILKAEAEAMRKIINRKKKESNL 168 (168) T ss_pred chhHHHhhhchhhhHHHHHHHHHHHHHHHHhhcCCCCC Confidence 9999998886 36777777666666666332211111 No 176 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=33.07 E-value=0.4 Score=22.78 Aligned_cols=82 Identities=15% Similarity=0.181 Sum_probs=47.6 Q ss_pred CCCcccccchhHHHHHHHHHHH----h-------------------hCCeEEEeecCC-CCCCCCCCCHHHHHHHHhcCC Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMD----L-------------------EQVQIEAGFLTN-RRHPESDLTIPAIAAIQQYGN 56 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~----l-------------------~~~~v~VGi~~~-~~~~d~g~~~A~iA~~~E~G~ 56 (162) |.--+...=...-+.+.+.|+. . ..+.|+|||.+. .+| -|--+||||. T Consensus 24 m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~G~r~V~i~W~gp~~R~--------~iVHLNE~GY 95 (133) T protein:vir:78 24 LPQLVDPALIAGATLVAKTLKSEFVQFKDTGASIDEINIEKPSYDKGVRSIKIDWKGPKDRY--------KIIHLNEYGY 95 (133) T ss_pred HHHhhhHHHHHHHHHHHHHHHHhhcchhcccceeeeEEecCeeeeCCceEEEEEEecCCCce--------eEEEeeccce Confidence 3322211100112222222222 0 124688888653 344 4567889996 Q ss_pred cCC--CCCCcch--hhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_020079. 57 ETN--NIPARPF--ITDGAVISQNNIAKKMKQVFANYL 90 (162) Q Consensus 57 ~~~--~IP~RpF--lr~~~~~~~~~~~~~~~~~~~~~l 90 (162) ... .|-||-| ++.+++..+..+.+.++.-+...| T Consensus 96 tr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~el~k~l 133 (133) T protein:vir:78 96 TRNGKKITPAGTGSVARSLRISERAYRAIVQKKIGDKL 133 (133) T ss_pred ecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHHHhhC Confidence 322 2567775 888899988888888888888877 No 177 >protein:vir:6216 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:10886 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852596;genbank:gi:31415856;genbank:GeneID:1489214 Probab=32.27 E-value=0.67 Score=21.52 Aligned_cols=78 Identities=12% Similarity=0.148 Sum_probs=42.6 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCC---CCCCCCCCHHHHHHHHhcCCcCC----CCCCcchhhHHHHH Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNR---RHPESDLTIPAIAAIQQYGNETN----NIPARPFITDGAVI 73 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~---~~~d~g~~~A~iA~~~E~G~~~~----~IP~RpFlr~~~~~ 73 (162) |+++|-..-.++---+. -+++|=+-++. ..+ .-|-+=+.-|.|+... .|-+|.|...||+. T Consensus 41 L~P~Ip~Sl~kkk~Hlr--------D~lkVvvk~d~V~V~Fe----d~a~yW~f~EnGt~~~~~~g~vkaqhf~~~Tf~~ 108 (125) T protein:vir:62 41 LKPKINVSNKNKRTHLR--------DSLKVVVKDDRVSVEFK----DEAWYWYLVEHGHKKAKGKGRVKGKHFVQNTFDA 108 (125) T ss_pred hccccChhhhhhhhhcc--------eeeeEEeeCCeEEEEEc----chhhhhhhhhccccccccccccchhhhhhccHHh Confidence 33333211000000000 02333333221 111 2256666778886432 27889999999999 Q ss_pred HHHHHHHHHHHHHHHHh Q lcl|NC_020079. 74 SQNNIAKKMKQVFANYL 90 (162) Q Consensus 74 ~~~~~~~~~~~~~~~~l 90 (162) ++++|.+.|.+-+..-+ T Consensus 109 nk~kI~~iM~kki~d~m 125 (125) T protein:vir:62 109 EGDKIADIMAQKIINRM 125 (125) T ss_pred hHHHHHHHHHHHHHhhC Confidence 99999999887776655 No 178 >protein:vir:3994 Length: 168 # NCBI annotation: unknown # Family: family:all:1029 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116502;genbank:gi:14251135;genbank:GeneID:921309 Probab=20.56 E-value=0.65 Score=21.60 Aligned_cols=88 Identities=16% Similarity=0.162 Sum_probs=50.3 Q ss_pred CCCcccccchhHHHHHHHHHHHhhCCeEEEeecCCCCCCCCCCC-HHHHHHHHhcCCc----------------CCCCCC Q lcl|NC_020079. 1 MESEILPGDDTDWETIIKKMMDLEQVQIEAGFLTNRRHPESDLT-IPAIAAIQQYGNE----------------TNNIPA 63 (162) Q Consensus 1 M~~~i~~~~~~~l~~l~~~l~~l~~~~v~VGi~~~~~~~d~g~~-~A~iA~~~E~G~~----------------~~~IP~ 63 (162) |+-+|+..+. .+.......-.|||.... ++|+. -|+||.+-+-|+. ++.||. T Consensus 62 LADsI~~~~~--------niDg~~dG~StVGw~~k~---~~~~~~~a~iAr~lNDGTrf~~~~~~~~~~y~~~g~v~i~g 130 (168) T protein:vir:39 62 LADSIVMKNK--------NIDGVKDGQSVVGWERST---EKGTHTKGYIANIINNGSRFPQFTTRSGRKYKNPGEVAVHA 130 (168) T ss_pred chhheeeccc--------ccCcccCCceeccccCcc---ccccccchhheehhccccccchhhhhcccccccccceeecc Confidence 4444443321 112223334567775421 22333 6899999999962 246889 Q ss_pred cchhhHHHHHH--HHHHHHHHHHHHHHHhccchHHHHH Q lcl|NC_020079. 64 RPFITDGAVIS--QNNIAKKMKQVFANYLMHNVGLAVF 99 (162) Q Consensus 64 RpFlr~~~~~~--~~~~~~~~~~~~~~~l~G~~~~~~l 99 (162) =+|+..+-.+. ++++.+.....+..+|.-+-.+.-| T Consensus 131 DHFvd~~r~~~a~k~aV~~Ae~e~~~eil~~k~~~~~~ 168 (168) T protein:vir:39 131 DHFIEETRKNPIVQQGILKAEAEAMRKIINRKKKENNL 168 (168) T ss_pred cchhHHHhhhhhhhHHHHHHHHHHHHHHHHhcCCCCCC Confidence 99999988864 6777777767777766433221112 Done!