Query lcl|NC_019457.1_cdsid_YP_007003087.1 [gene=F353_gp47] [protein=hypothetical protein] [protein_id=YP_007003087.1] [location=30680..31150] Match_columns 156 No_of_seqs 119 out of 197 Neff 6.3 Searched_HMMs 1612 Date Thu Nov 7 17:21:59 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_47 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_47_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:5257 Length: 148 # 100.0 6.1E-54 3.8E-57 312.3 16.9 146 2-148 1-148 (148) 2 protein:vir:99546 Length: 200 100.0 5.1E-52 3.1E-55 301.8 16.9 146 1-148 6-200 (200) 3 protein:vir:107757 Length: 189 100.0 6.6E-52 4.1E-55 301.2 16.9 148 2-152 1-189 (189) 4 protein:vir:96105 Length: 193 100.0 3.2E-51 2E-54 297.4 16.6 144 4-148 1-193 (193) 5 protein:vir:78607 Length: 155 100.0 1.8E-50 1.1E-53 293.3 14.9 139 2-149 1-155 (155) 6 protein:vir:106728 Length: 155 100.0 2.2E-50 1.4E-53 292.8 15.0 139 2-149 1-155 (155) 7 protein:vir:94069 Length: 168 100.0 7.1E-50 4.4E-53 290.0 16.1 150 1-156 1-166 (168) 8 protein:vir:101563 Length: 155 100.0 1.4E-49 8.9E-53 288.4 15.1 139 2-149 1-155 (155) 9 protein:vir:77650 Length: 155 100.0 4E-49 2.5E-52 285.9 14.1 139 2-149 1-155 (155) 10 protein:vir:80037 Length: 199 100.0 2.5E-46 1.5E-49 270.6 14.6 140 2-150 1-199 (199) 11 protein:vir:95260 Length: 160 100.0 5.6E-44 3.5E-47 257.7 14.9 146 1-156 1-158 (160) 12 protein:vir:99833 Length: 190 98.9 2.3E-12 1.4E-15 84.4 4.6 91 66-156 1-99 (190) 13 protein:vir:3163 Length: 145 # 98.8 1.5E-11 9.3E-15 79.9 6.0 83 71-156 1-93 (145) 14 protein:vir:79091 Length: 175 98.8 1.4E-11 8.5E-15 80.1 5.2 90 67-156 1-116 (175) 15 protein:vir:103841 Length: 155 98.8 1.1E-11 6.9E-15 80.6 4.6 90 67-156 1-99 (155) 16 protein:vir:79225 Length: 155 98.7 1.9E-11 1.2E-14 79.4 5.1 90 67-156 1-99 (155) 17 protein:vir:99196 Length: 155 98.7 3.5E-11 2.2E-14 77.9 5.2 90 67-156 1-99 (155) 18 protein:vir:1988 Length: 156 # 98.6 6.7E-11 4.1E-14 76.4 5.3 89 67-156 1-103 (156) 19 protein:vir:4347 Length: 164 # 98.6 1.1E-10 7E-14 75.1 5.3 104 1-106 4-164 (164) 20 protein:vir:100243 Length: 140 98.5 8.3E-10 5.1E-13 70.4 6.7 89 1-90 1-140 (140) 21 protein:vir:107851 Length: 175 98.5 3.8E-10 2.4E-13 72.2 4.7 90 67-156 1-116 (175) 22 protein:vir:93617 Length: 148 98.4 5.1E-10 3.2E-13 71.5 4.4 96 1-98 1-148 (148) 23 protein:vir:80362 Length: 140 98.4 1.1E-09 6.8E-13 69.7 6.1 89 1-90 1-140 (140) 24 protein:vir:1437 Length: 140 # 98.4 1.4E-09 8.7E-13 69.1 6.3 89 1-90 1-140 (140) 25 protein:vir:100075 Length: 140 98.4 1.4E-09 8.9E-13 69.1 5.9 89 1-90 1-140 (140) 26 protein:vir:1891 Length: 179 # 98.3 5.6E-10 3.5E-13 71.3 3.3 104 1-131 4-179 (179) 27 protein:vir:1273 Length: 127 # 98.3 2.4E-09 1.5E-12 67.8 6.6 81 1-91 1-127 (127) 28 protein:vir:2740 Length: 114 # 98.3 1.1E-09 6.9E-13 69.7 3.8 88 1-88 1-114 (114) 29 protein:vir:4906 Length: 114 # 98.3 1.1E-09 6.9E-13 69.7 3.8 88 1-88 1-114 (114) 30 protein:vir:95789 Length: 114 98.3 1.3E-09 7.8E-13 69.4 4.0 90 2-91 1-114 (114) 31 protein:vir:3873 Length: 128 # 98.2 5E-09 3.1E-12 66.1 5.0 80 2-91 1-128 (128) 32 protein:vir:105089 Length: 133 98.2 2.2E-09 1.4E-12 68.0 3.0 93 1-93 1-133 (133) 33 protein:vir:102085 Length: 146 98.2 3.9E-09 2.4E-12 66.7 4.3 87 1-96 4-146 (146) 34 protein:vir:102875 Length: 146 98.2 3.9E-09 2.4E-12 66.7 4.3 87 1-96 4-146 (146) 35 protein:vir:107568 Length: 146 98.2 3.9E-09 2.4E-12 66.7 4.3 87 1-96 4-146 (146) 36 protein:vir:105007 Length: 146 98.2 3.9E-09 2.4E-12 66.7 4.3 87 1-96 4-146 (146) 37 protein:vir:103917 Length: 115 98.2 4.5E-09 2.8E-12 66.4 4.5 84 4-87 1-115 (115) 38 protein:vir:96225 Length: 115 98.2 4.5E-09 2.8E-12 66.4 4.5 84 4-87 1-115 (115) 39 protein:vir:78858 Length: 115 98.2 4.5E-09 2.8E-12 66.4 4.5 84 4-87 1-115 (115) 40 protein:vir:97144 Length: 115 98.2 4.5E-09 2.8E-12 66.4 4.5 84 4-87 1-115 (115) 41 protein:vir:9312 Length: 115 # 98.2 4.5E-09 2.8E-12 66.4 4.5 84 4-87 1-115 (115) 42 protein:vir:96358 Length: 115 98.2 4.5E-09 2.8E-12 66.4 4.5 84 4-87 1-115 (115) 43 protein:vir:194 Length: 149 # 98.2 3.5E-09 2.1E-12 67.0 3.9 96 1-98 1-149 (149) 44 protein:vir:106623 Length: 115 98.2 4E-09 2.5E-12 66.6 4.2 84 4-87 1-115 (115) 45 protein:vir:96486 Length: 112 98.1 2.6E-09 1.6E-12 67.6 2.9 86 1-86 1-112 (112) 46 protein:vir:5745 Length: 135 # 98.0 5.3E-08 3.3E-11 60.5 7.2 95 1-101 2-135 (135) 47 protein:vir:99744 Length: 115 98.0 1.5E-08 9.3E-12 63.5 3.9 84 4-87 1-115 (115) 48 protein:vir:98557 Length: 149 97.9 8.2E-08 5.1E-11 59.4 6.8 85 71-156 1-95 (149) 49 protein:vir:1386 Length: 149 # 97.9 8.5E-09 5.3E-12 64.8 1.2 89 1-113 46-149 (149) 50 protein:vir:9708 Length: 125 # 97.9 1.5E-08 9E-12 63.5 2.5 82 1-92 37-125 (125) 51 protein:vir:3617 Length: 112 # 97.8 3E-08 1.9E-11 61.8 3.8 87 1-87 2-112 (112) 52 protein:vir:106570 Length: 182 97.8 5.2E-08 3.2E-11 60.5 4.9 96 1-101 1-182 (182) 53 protein:vir:2026 Length: 150 # 97.8 1.4E-07 8.5E-11 58.2 6.8 85 72-156 1-95 (150) 54 protein:vir:97088 Length: 157 97.8 7.1E-08 4.4E-11 59.8 4.9 86 2-90 1-157 (157) 55 protein:vir:9930 Length: 108 # 97.8 6.6E-08 4.1E-11 59.9 4.6 83 6-88 1-108 (108) 56 protein:vir:6071 Length: 150 # 97.7 2.5E-07 1.5E-10 56.8 6.8 85 72-156 1-95 (150) 57 protein:vir:5703 Length: 150 # 97.6 3.5E-07 2.2E-10 56.0 6.8 85 72-156 1-95 (150) 58 protein:vir:79091 Length: 175 97.6 3.7E-07 2.3E-10 55.8 6.2 87 1-89 4-175 (175) 59 protein:vir:94538 Length: 125 97.6 7.8E-08 4.9E-11 59.5 2.2 93 1-93 19-125 (125) 60 protein:vir:743 Length: 108 # 97.5 5.1E-07 3.2E-10 55.1 6.5 84 4-87 1-108 (108) 61 protein:vir:5978 Length: 144 # 97.4 2E-07 1.3E-10 57.3 3.2 87 1-87 1-144 (144) 62 protein:vir:9414 Length: 125 # 97.4 1.8E-07 1.1E-10 57.6 2.5 81 1-91 21-125 (125) 63 protein:vir:98342 Length: 125 97.4 1.8E-07 1.1E-10 57.6 2.5 81 1-91 21-125 (125) 64 protein:vir:81106 Length: 125 97.4 1.8E-07 1.1E-10 57.6 2.5 81 1-91 21-125 (125) 65 protein:vir:4704 Length: 125 # 97.4 1.8E-07 1.1E-10 57.6 2.5 81 1-91 21-125 (125) 66 protein:vir:79988 Length: 125 97.4 1.8E-07 1.1E-10 57.6 2.5 81 1-91 21-125 (125) 67 protein:vir:98409 Length: 108 97.4 3.1E-07 1.9E-10 56.3 3.7 87 1-87 10-108 (108) 68 protein:vir:102154 Length: 119 97.3 8E-07 5E-10 54.0 4.8 81 1-91 1-119 (119) 69 protein:vir:3163 Length: 145 # 97.1 1.1E-06 6.9E-10 53.2 4.1 80 1-93 52-145 (145) 70 protein:vir:94654 Length: 142 97.0 2.3E-06 1.4E-09 51.5 5.0 86 1-86 1-142 (142) 71 protein:vir:101594 Length: 173 97.0 3E-06 1.9E-09 50.8 5.6 90 4-99 1-173 (173) 72 protein:vir:100312 Length: 152 97.0 1.2E-06 7.3E-10 53.1 3.3 77 1-89 63-152 (152) 73 protein:vir:1838 Length: 149 # 97.0 2.3E-06 1.4E-09 51.5 4.7 75 1-88 63-149 (149) 74 protein:vir:96121 Length: 137 97.0 1.1E-06 6.9E-10 53.2 2.7 82 1-83 1-137 (137) 75 protein:vir:98557 Length: 149 96.9 2.8E-06 1.7E-09 51.0 4.6 76 1-88 53-149 (149) 76 protein:vir:94796 Length: 137 96.9 1.2E-06 7.6E-10 53.0 2.7 82 1-83 1-137 (137) 77 protein:vir:96829 Length: 135 96.9 1.8E-06 1.1E-09 52.1 3.1 82 1-83 1-135 (135) 78 protein:vir:79115 Length: 148 96.8 8.9E-06 5.5E-09 48.3 6.7 85 72-156 1-94 (148) 79 protein:vir:93738 Length: 137 96.8 1.6E-06 1E-09 52.3 2.6 82 1-83 1-137 (137) 80 protein:vir:94490 Length: 137 96.8 1.6E-06 1E-09 52.3 2.6 82 1-83 1-137 (137) 81 protein:vir:97427 Length: 137 96.8 1.6E-06 1E-09 52.3 2.6 82 1-83 1-137 (137) 82 protein:vir:94108 Length: 149 96.8 4.4E-06 2.7E-09 49.9 4.8 82 1-83 13-149 (149) 83 protein:vir:107851 Length: 175 96.7 1.9E-06 1.2E-09 52.0 2.4 76 1-89 76-175 (175) 84 protein:vir:1838 Length: 149 # 96.7 1.3E-05 8.2E-09 47.3 7.0 85 71-156 1-95 (149) 85 protein:vir:105916 Length: 149 96.7 1.3E-06 8.2E-10 52.8 1.4 82 1-83 13-149 (149) 86 protein:vir:81147 Length: 126 96.7 3.8E-06 2.3E-09 50.3 3.9 89 1-90 1-126 (126) 87 protein:vir:99833 Length: 190 96.7 7.4E-06 4.6E-09 48.7 5.3 73 1-90 38-190 (190) 88 protein:vir:95894 Length: 137 96.7 2.1E-06 1.3E-09 51.7 2.2 82 1-83 1-137 (137) 89 protein:vir:103841 Length: 155 96.7 4.5E-06 2.8E-09 49.9 4.0 77 1-90 57-155 (155) 90 protein:vir:107099 Length: 137 96.7 3.5E-06 2.2E-09 50.5 3.3 82 1-83 1-137 (137) 91 protein:vir:1164 Length: 156 # 96.6 4.4E-06 2.7E-09 49.9 3.2 81 1-92 64-156 (156) 92 protein:vir:105330 Length: 137 96.6 3.4E-06 2.1E-09 50.6 2.6 82 1-83 1-137 (137) 93 protein:vir:1988 Length: 156 # 96.5 7.6E-06 4.7E-09 48.7 4.4 69 1-88 76-156 (156) 94 protein:vir:99196 Length: 155 96.5 4.6E-06 2.9E-09 49.8 2.9 73 1-90 39-155 (155) 95 protein:vir:79115 Length: 148 96.4 5.4E-06 3.4E-09 49.5 3.0 77 1-88 34-148 (148) 96 protein:vir:79179 Length: 155 96.3 2.7E-05 1.7E-08 45.6 6.4 86 71-156 1-101 (155) 97 protein:vir:2026 Length: 150 # 96.3 9.9E-06 6.1E-09 48.0 3.9 77 1-88 53-150 (150) 98 protein:vir:6071 Length: 150 # 96.1 7.1E-06 4.4E-09 48.8 2.2 78 1-88 53-150 (150) 99 protein:vir:100312 Length: 152 96.1 6E-05 3.8E-08 43.7 7.3 86 71-156 1-96 (152) 100 protein:vir:5703 Length: 150 # 96.1 8.4E-06 5.2E-09 48.4 2.5 78 1-88 53-150 (150) 101 protein:vir:78077 Length: 141 96.1 6.9E-06 4.3E-09 48.9 1.8 90 1-90 1-141 (141) 102 protein:vir:79225 Length: 155 95.9 2.6E-05 1.6E-08 45.7 4.3 77 1-90 39-155 (155) 103 protein:vir:79179 Length: 155 95.9 2.2E-05 1.4E-08 46.1 3.7 77 1-88 54-155 (155) 104 protein:vir:97327 Length: 116 95.8 1.4E-05 8.6E-09 47.2 2.2 79 2-83 1-116 (116) 105 protein:vir:1243 Length: 116 # 95.8 1.4E-05 8.6E-09 47.2 2.2 79 2-83 1-116 (116) 106 protein:vir:95062 Length: 116 95.7 1.3E-05 8.1E-09 47.4 1.9 79 2-83 1-116 (116) 107 protein:vir:99101 Length: 142 95.5 1.9E-05 1.2E-08 46.4 2.1 84 1-84 1-142 (142) 108 protein:vir:8669 Length: 142 # 95.5 1.9E-05 1.2E-08 46.4 2.1 84 1-84 1-142 (142) 109 protein:vir:1164 Length: 156 # 95.3 0.00021 1.3E-07 40.7 7.1 86 71-156 1-98 (156) 110 protein:vir:94490 Length: 137 94.5 6.7E-05 4.1E-08 43.5 2.3 68 67-156 1-69 (137) 111 protein:vir:97427 Length: 137 94.5 6.7E-05 4.1E-08 43.5 2.3 68 67-156 1-69 (137) 112 protein:vir:93738 Length: 137 94.5 6.7E-05 4.1E-08 43.5 2.3 68 67-156 1-69 (137) 113 protein:vir:81067 Length: 119 93.4 0.00024 1.5E-07 40.4 3.3 80 1-90 4-119 (119) 114 protein:vir:10367 Length: 119 93.3 0.00027 1.7E-07 40.1 3.3 80 1-90 4-119 (119) 115 protein:vir:100887 Length: 139 93.2 0.00034 2.1E-07 39.6 3.9 83 1-94 40-139 (139) 116 protein:vir:96829 Length: 135 91.8 0.00039 2.4E-07 39.3 2.3 68 67-156 1-69 (135) 117 protein:vir:3787 Length: 231 # 91.7 0.0015 9.3E-07 36.1 5.4 81 1-91 59-231 (231) 118 protein:vir:5000 Length: 141 # 91.6 0.00071 4.4E-07 37.8 3.4 82 1-91 41-141 (141) 119 protein:vir:95894 Length: 137 90.6 0.00046 2.8E-07 38.9 1.4 68 67-156 1-69 (137) 120 protein:vir:80116 Length: 127 89.9 0.0021 1.3E-06 35.2 4.5 91 1-91 1-127 (127) 121 protein:vir:105467 Length: 144 89.4 0.001 6.5E-07 36.9 2.4 94 1-94 3-144 (144) 122 protein:vir:95062 Length: 116 89.2 0.0012 7.1E-07 36.7 2.5 48 92-156 1-48 (116) 123 protein:vir:94796 Length: 137 89.2 0.00095 5.9E-07 37.2 2.0 68 67-156 1-69 (137) 124 protein:vir:3750 Length: 227 # 89.0 0.004 2.5E-06 33.7 5.3 101 1-111 59-227 (227) 125 protein:vir:4956 Length: 153 # 89.0 0.0018 1.1E-06 35.6 3.4 96 1-130 44-153 (153) 126 protein:vir:96121 Length: 137 88.6 0.00076 4.7E-07 37.7 1.1 68 67-156 1-69 (137) 127 protein:vir:1243 Length: 116 # 88.6 0.0012 7.2E-07 36.7 2.1 48 92-156 1-48 (116) 128 protein:vir:97327 Length: 116 88.6 0.0012 7.2E-07 36.7 2.1 48 92-156 1-48 (116) 129 protein:vir:100223 Length: 139 88.4 0.0011 6.9E-07 36.8 1.9 82 1-93 48-139 (139) 130 protein:vir:966 Length: 123 # 88.3 0.0035 2.2E-06 34.0 4.5 88 1-88 1-123 (123) 131 protein:vir:4859 Length: 140 # 87.9 0.0015 9.1E-07 36.1 2.1 82 1-91 41-140 (140) 132 protein:vir:107099 Length: 137 87.5 0.0011 6.8E-07 36.8 1.2 68 67-156 1-69 (137) 133 protein:vir:4833 Length: 140 # 87.5 0.0016 9.8E-07 35.9 2.1 82 1-91 41-140 (140) 134 protein:vir:95372 Length: 124 87.3 0.0023 1.4E-06 35.0 2.9 88 1-88 1-124 (124) 135 protein:vir:105330 Length: 137 87.2 0.001 6.2E-07 37.0 0.8 68 67-156 1-69 (137) 136 protein:vir:2740 Length: 114 # 86.5 0.0033 2E-06 34.2 3.3 72 67-156 1-72 (114) 137 protein:vir:4906 Length: 114 # 86.5 0.0033 2E-06 34.2 3.3 72 67-156 1-72 (114) 138 protein:vir:102441 Length: 137 86.4 0.00086 5.4E-07 37.4 0.0 81 2-82 1-137 (137) 139 protein:vir:97982 Length: 140 86.0 0.0013 8.3E-07 36.3 0.8 79 1-81 1-140 (140) 140 protein:vir:107545 Length: 140 86.0 0.0013 8.3E-07 36.3 0.8 79 1-81 1-140 (140) 141 protein:vir:98409 Length: 108 85.8 0.0046 2.8E-06 33.4 3.7 65 66-156 1-65 (108) 142 protein:vir:9930 Length: 108 # 85.6 0.0039 2.4E-06 33.8 3.2 65 68-156 1-66 (108) 143 protein:vir:94654 Length: 142 84.6 0.0029 1.8E-06 34.5 2.0 69 67-156 1-73 (142) 144 protein:vir:100652 Length: 134 83.5 0.0046 2.9E-06 33.4 2.6 79 2-89 1-134 (134) 145 protein:vir:9513 Length: 134 # 82.8 0.0097 6E-06 31.6 4.1 80 2-89 1-134 (134) 146 protein:vir:101302 Length: 134 82.8 0.0097 6E-06 31.6 4.1 80 2-89 1-134 (134) 147 protein:vir:5978 Length: 144 # 81.8 0.0077 4.8E-06 32.2 3.2 73 62-156 1-74 (144) 148 protein:vir:3617 Length: 112 # 81.6 0.0066 4.1E-06 32.5 2.7 68 67-156 1-71 (112) 149 protein:vir:96486 Length: 112 79.3 0.016 1E-05 30.4 4.1 72 67-156 1-72 (112) 150 protein:vir:78755 Length: 228 78.3 0.028 1.7E-05 29.1 5.1 121 1-127 55-228 (228) 151 protein:vir:105916 Length: 149 77.7 0.017 1.1E-05 30.3 3.7 80 29-156 1-81 (149) 152 protein:vir:95789 Length: 114 75.4 0.016 9.7E-06 30.5 2.8 68 67-156 1-69 (114) 153 protein:vir:743 Length: 108 # 74.7 0.024 1.5E-05 29.5 3.7 65 66-156 1-65 (108) 154 protein:vir:96225 Length: 115 73.8 0.012 7.4E-06 31.1 1.8 71 67-156 1-74 (115) 155 protein:vir:103917 Length: 115 73.8 0.012 7.4E-06 31.1 1.8 71 67-156 1-74 (115) 156 protein:vir:78858 Length: 115 73.8 0.012 7.4E-06 31.1 1.8 71 67-156 1-74 (115) 157 protein:vir:96358 Length: 115 73.8 0.012 7.4E-06 31.1 1.8 71 67-156 1-74 (115) 158 protein:vir:97144 Length: 115 73.8 0.012 7.4E-06 31.1 1.8 71 67-156 1-74 (115) 159 protein:vir:9312 Length: 115 # 73.8 0.012 7.4E-06 31.1 1.8 71 67-156 1-74 (115) 160 protein:vir:94108 Length: 149 72.0 0.0089 5.5E-06 31.8 0.6 80 29-156 1-81 (149) 161 protein:vir:98636 Length: 138 71.6 0.028 1.8E-05 29.1 3.3 82 1-92 32-138 (138) 162 protein:vir:106506 Length: 137 69.8 0.034 2.1E-05 28.6 3.3 64 60-156 1-67 (137) 163 protein:vir:96973 Length: 133 68.3 0.068 4.2E-05 27.0 4.6 78 2-88 1-133 (133) 164 protein:vir:9363 Length: 133 # 68.3 0.068 4.2E-05 27.0 4.6 78 2-88 1-133 (133) 165 protein:vir:78644 Length: 133 68.3 0.068 4.2E-05 27.0 4.6 78 2-88 1-133 (133) 166 protein:vir:94419 Length: 133 68.3 0.068 4.2E-05 27.0 4.6 78 2-88 1-133 (133) 167 protein:vir:9879 Length: 127 # 67.5 0.022 1.4E-05 29.6 1.8 83 6-88 1-127 (127) 168 protein:vir:79034 Length: 141 67.2 0.017 1.1E-05 30.3 1.1 92 1-92 4-141 (141) 169 protein:vir:78335 Length: 133 64.8 0.048 3E-05 27.8 3.1 80 2-90 1-133 (133) 170 protein:vir:78077 Length: 141 63.7 0.028 1.8E-05 29.1 1.6 67 67-156 1-71 (141) 171 protein:vir:9647 Length: 132 # 61.7 0.058 3.6E-05 27.4 2.9 78 1-88 26-132 (132) 172 protein:vir:99528 Length: 92 # 58.5 0.052 3.3E-05 27.6 2.1 68 67-156 1-75 (92) 173 protein:vir:94994 Length: 131 55.1 0.38 0.00023 22.9 6.2 73 1-86 49-131 (131) 174 protein:vir:106041 Length: 137 51.9 0.1 6.2E-05 26.1 2.5 63 67-156 1-68 (137) 175 protein:vir:3848 Length: 159 # 46.1 0.33 0.00021 23.2 4.4 88 67-156 1-100 (159) 176 protein:vir:104347 Length: 145 45.8 0.5 0.00031 22.2 5.3 76 1-86 57-145 (145) 177 protein:vir:103280 Length: 142 44.7 0.48 0.0003 22.3 5.1 79 1-90 54-142 (142) 178 protein:vir:78380 Length: 131 44.2 0.82 0.00051 21.1 6.3 73 1-86 49-131 (131) 179 protein:vir:93898 Length: 133 43.6 0.23 0.00015 24.0 3.2 78 2-88 1-133 (133) 180 protein:vir:96774 Length: 152 37.6 0.86 0.00054 20.9 5.3 72 1-89 73-152 (152) 181 protein:vir:102963 Length: 163 37.2 0.23 0.00014 24.1 2.0 90 2-91 1-163 (163) 182 protein:vir:96012 Length: 133 36.8 0.35 0.00022 23.1 3.0 81 1-90 23-133 (133) 183 protein:vir:97190 Length: 148 35.8 1.2 0.00075 20.1 5.9 77 1-92 53-148 (148) 184 protein:vir:2688 Length: 123 # 31.4 0.58 0.00036 21.9 3.3 79 1-88 14-123 (123) 185 protein:vir:107703 Length: 147 29.2 1.7 0.001 19.4 6.6 79 1-96 55-147 (147) 186 protein:vir:6216 Length: 125 # 29.1 0.53 0.00033 22.1 2.7 85 1-90 18-125 (125) 187 protein:vir:79638 Length: 146 24.0 1.9 0.0012 19.1 4.7 90 1-90 1-146 (146) 188 protein:vir:4460 Length: 170 # 23.3 0.26 0.00016 23.8 -0.2 78 61-156 1-80 (170) 189 protein:vir:6246 Length: 143 # 20.4 0.15 9.2E-05 25.1 -2.2 91 1-98 6-143 (143) No 1 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=100.00 E-value=6.1e-54 Score=312.31 Aligned_cols=146 Identities=29% Similarity=0.408 Sum_probs=137.3 Q ss_pred eeeeecCcHHHHHHHHHHHHHHhcCCcEEEEeccCCC--CCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHH Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKALNKLNSDEFVTVGIHEADN--ARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEIL 79 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~--~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~ 79 (156) |+++++.+...+++|.+.|++|+++ .|+||||++.. ..+++|++||+||+|||||+++||+|||||+|+++++++|. T Consensus 1 M~~~~k~~~~~~~~l~~~l~~l~~~-~v~VGi~~~~~~~~~~~~g~~vA~ia~~~E~G~~~IP~Rpflr~t~~~~~~~~~ 79 (148) T protein:vir:52 1 MAVTVTANFSAAKQLIEQMKSLKEK-AVYVGFPAEFDEKVKGSENFNLASLAAVLEFGNEHIPARPFLRQTLEENQEKYT 79 (148) T ss_pred CccccccccHHHHHHHHHHHHhhCC-eEEEEeecCcCCCCCCCCCCCHHHHHHHHhcCCCCCCCcchhHHHHHHHHHHHH Confidence 9999999888899999999999765 59999995432 22356999999999999999999999999999999999999 Q ss_pred HHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeeec Q lcl|NC_019457. 80 DTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTYNIQ 148 (156) Q Consensus 80 ~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty~V~ 148 (156) +++++++.++.+++++|+++|+.++++||++|+++.||||||+||++||||+||||||+|++||+|+|+ T Consensus 80 ~~~~~~~~~~~~~~~~L~~~G~~~~~~ik~~I~~~~~ppna~sTi~~Kg~~~PLidTG~l~~SIty~V~ 148 (148) T protein:vir:52 80 ALFIQWFDQGVPAAQIYERLSVMAQGDVQMNIVKGEWVANAKSTIRRKKSSKPLIDTGKMRQSVRGIVK 148 (148) T ss_pred HHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHhcCCCCchhHHHHHHHHhhhhcC Confidence 999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=100.00 E-value=5.1e-52 Score=301.80 Aligned_cols=146 Identities=25% Similarity=0.307 Sum_probs=133.6 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCC----CCCCCHHHHHHHHhcCCC----------------- Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLNSDEFVTVGIHEADNARP----EGVLTNAQLGAIQHFGND----------------- 59 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~----~~g~~~A~ia~~~E~G~~----------------- 59 (156) +|+++++++++ ++++.+.|++|.++ .|+|||+++++|++ ++|+++|+||+|||||+. T Consensus 6 ~~~~k~~~~~~-~~~~~~~l~~l~~~-~v~vGi~~~~~y~~~~~~~dG~~va~IA~~~EfG~~i~~p~~~~~~~~~~~~g 83 (200) T protein:vir:99 6 SKSNSVAAPLK-HFQMLKQFDALKGK-TVQAGWFETDRYPAKEGETIGPLVAKIARQLEFGGVINHPGGTKYIKDAIVDG 83 (200) T ss_pred ceeeeeecchH-HHHHHHHHHHhhCC-eEEEEEcCCCCcCCcccccccchHHHHHhHHHcCCeeccCCCccccccccccc Confidence 89999999876 56677888998665 59999999999874 468999999999999943 Q ss_pred ------------------------CCCCchhhhHHHHHHHHHHHHHH----HHHHhccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019457. 60 ------------------------RIPARPWLDVGVASVNDEILDTI----AASLEDGEDISQLLNRVGVVAVAGVQNYI 111 (156) Q Consensus 60 ------------------------~IP~RpFlr~~~~~~~~~~~~~~----~~~~~~~~~~~~~l~~iG~~~~~~i~~~I 111 (156) +||||||||+|+++++++|.+.+ ++++.|+.+++++|+++|+.++++||++| T Consensus 84 ~~~g~rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~~G~~~~~~ik~~I 163 (200) T protein:vir:99 84 RYVGTRFVHKSFQGEHEVTKAHQIVIPARPFMRLAWATFNKDKVKIQAQIARQLLDGTINPEQALAQIGLALEGCIVRSI 163 (200) T ss_pred cccccccccccccceeeeeccccccCCCcchhhHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHH Confidence 79999999999999999998765 55668899999999999999999999999 Q ss_pred HcCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeeec Q lcl|NC_019457. 112 DELRSPANAPSTVERKGADNPLVDTGEMKQSVTYNIQ 148 (156) Q Consensus 112 ~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty~V~ 148 (156) +++.||||||+||++||||+||||||+|++||+|+|+ T Consensus 164 ~~~~~ppna~sTi~~Kg~~~PLidTG~l~~SIty~Ve 200 (200) T protein:vir:99 164 KSGPWAANSPATIRAKGFDKPLIDTAHMWQTVSSKVS 200 (200) T ss_pred hcCCCCCChHHHHHHhCCCCchHHHHHHHhHhccccC Confidence 9999999999999999999999999999999999999 No 3 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=100.00 E-value=6.6e-52 Score=301.18 Aligned_cols=148 Identities=20% Similarity=0.315 Sum_probs=137.4 Q ss_pred eeeeecCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCC--CCCCchhhhHHHHHHHHHHH Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGND--RIPARPWLDVGVASVNDEIL 79 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~--~IP~RpFlr~~~~~~~~~~~ 79 (156) |+++++..+..+++|.+.+++|+++ .|+||||++++|+ +|+++|+||+|||||++ +||||||||+|+++++++|. T Consensus 1 M~~~i~~~~~~~~~L~~~lk~l~~k-~V~VGi~~~~~y~--dG~~vA~Ia~~~E~G~p~~~IP~RPFlr~t~~~~~~~~~ 77 (189) T protein:vir:10 1 MGRVIRKQGPARVKLNAFIKGMNDY-SVRIGWFSTAKYP--DGTPTAYVASIHEFGAPSRGIPARSFIRPTIAAQQAAWS 77 (189) T ss_pred CcceeccCcHHHHHHHHHHHHhhCC-eEEEEecCCCCCC--CcccHHHHHHHHHhcCcCCCCCCchhhhHHHHHHHHHHH Confidence 9999999999999999999999655 5999999999885 49999999999999996 69999999999999999998 Q ss_pred HHHH----HHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCC-------------------------- Q lcl|NC_019457. 80 DTIA----ASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGA-------------------------- 129 (156) Q Consensus 80 ~~~~----~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~-------------------------- 129 (156) +.++ +++.|+++++++|+.+|+.++++||.+|+++.||||||+||++||+ T Consensus 78 ~~l~~~~~~vl~G~~~~~~~L~~~G~~a~~~Ik~~I~~~~~ppna~sTi~~Kg~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (189) T protein:vir:10 78 QQMRFYAKQIVVGQMNVEQALEGLAIVARGDVDATLARLKDPPLSPLTIYIRKFIKDGGVIHGYKDIMRLRSEMQQEQAK 157 (189) T ss_pred HHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHHhcccCcccchhhhhhhhhhhhhhhhhhhh Confidence 8654 5667899999999999999999999999999999999999999994 Q ss_pred ---------CCchhHHHHHHhhceeeeccccc Q lcl|NC_019457. 130 ---------DNPLVDTGEMKQSVTYNIQTGRP 152 (156) Q Consensus 130 ---------~~PLidTG~L~~SIty~V~~~k~ 152 (156) ++||||||+|++||||+|++++- T Consensus 158 ~~~~~~~~s~kPLidTG~l~~SIty~V~~k~~ 189 (189) T protein:vir:10 158 GTLNLSGVSTDPLDFTGYMRATLSYTVTKEKS 189 (189) T ss_pred ccccccccCCCchhhHHHHHhhcceeeeecCC Confidence 79999999999999999999888 No 4 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=100.00 E-value=3.2e-51 Score=297.42 Aligned_cols=144 Identities=21% Similarity=0.260 Sum_probs=130.3 Q ss_pred eeecCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCC----CCCCHHHHHHHHhcCCC-------------------- Q lcl|NC_019457. 4 ITVPNFDAVRDELTKALNKLNSDEFVTVGIHEADNARPE----GVLTNAQLGAIQHFGND-------------------- 59 (156) Q Consensus 4 v~~~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~----~g~~~A~ia~~~E~G~~-------------------- 59 (156) |+++...+.+++|.+.|++|+++ .|+|||++++.|+++ .|+++|+||+|||||+. T Consensus 1 m~~~~~~~~~~~~~~~l~~l~~~-~v~vGi~~~~~~~~~~~~~~G~~va~iAai~EfG~~I~~~~~~~~~~~~~~~g~~~ 79 (193) T protein:vir:96 1 MSLRRDSELIAAHLQMLRAMRGR-SVSAGWYSTARYPDKAGGSVGIQVARIARLNEYGGTIDHPGGTRYIRDAIVRGRFV 79 (193) T ss_pred CeeccchHHHHHHHHHHHHhcCC-eEEEEEcCCCCCCCcccccccchHHHHHhHHHcCCccccCccceeeeecccccccc Confidence 55556667789999999999655 599999999988763 38999999999999953 Q ss_pred ---------------------CCCCchhhhHHHHHHHHHHHHHH----HHHHhccccHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_019457. 60 ---------------------RIPARPWLDVGVASVNDEILDTI----AASLEDGEDISQLLNRVGVVAVAGVQNYIDEL 114 (156) Q Consensus 60 ---------------------~IP~RpFlr~~~~~~~~~~~~~~----~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~ 114 (156) +||||||||+|+++++++|.+.+ ++++.|+.+++++|+++|+.++++||++|+++ T Consensus 80 ~~~~~k~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~~G~~~~~~ik~~I~~~ 159 (193) T protein:vir:96 80 GVRFVRNDFPGETEVTKPHRITIPARPFMRYAWNLFSADRAAIQNRIAMRLARGQITPDQALAQIGLALEGYIARSIRTG 159 (193) T ss_pred ccceeccCcceeeEeecceeccCCCcchhhhhHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcC Confidence 79999999999999999998765 45667899999999999999999999999999 Q ss_pred CCCCCcHHHHHhcCCCCchhHHHHHHhhceeeec Q lcl|NC_019457. 115 RSPANAPSTVERKGADNPLVDTGEMKQSVTYNIQ 148 (156) Q Consensus 115 ~~ppns~~Ti~~KG~~~PLidTG~L~~SIty~V~ 148 (156) .||||||+||++||||+||||||+|++||+|+|. T Consensus 160 ~~ppna~~Ti~~KG~~~PLidTG~l~~SIty~Vv 193 (193) T protein:vir:96 160 PWVANSASTVRRKGFNRPLVDTAHMLQSISSRVT 193 (193) T ss_pred CCCCCcHHHHHHhCCCCchhHHHHHHhhhcceeC Confidence 9999999999999999999999999999999999 No 5 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=100.00 E-value=1.8e-50 Score=293.29 Aligned_cols=139 Identities=19% Similarity=0.299 Sum_probs=126.4 Q ss_pred eeeeecCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCC----------------CCCCHHHHHHHHhcCCCCCCCch Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKALNKLNSDEFVTVGIHEADNARPE----------------GVLTNAQLGAIQHFGNDRIPARP 65 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~----------------~g~~~A~ia~~~E~G~~~IP~Rp 65 (156) |+|.++..+ ++ +++|+++ .|+|||+++++|+|. +|+++|+||+|||||+.+||||| T Consensus 1 m~v~~k~L~----~~---~~~l~~~-~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~~~IP~RP 72 (155) T protein:vir:78 1 MSVTRRGLT----LP---KDRYRSM-SVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGTSKLPARP 72 (155) T ss_pred CcchHHHHH----HH---HHHHhCC-eeEEeecCCCCCCcccchhhhhhhhcccccccCCcHHHHHHhhhcCCCCCCCcc Confidence 777766633 33 3445444 599999999999862 38999999999999999999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 66 WLDVGVASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 66 Flr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) |||+|+++++++|.+.+++++.++.+++++|+++|+.++++||.+|+++. |||||+||++||||+||||||+|++||+| T Consensus 73 Flr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~~G~~~~~~Ik~~I~~~~-~pna~~Ti~~Kg~~kPLidTG~l~~SIty 151 (155) T protein:vir:78 73 FMEKTITDRSAEWIKGLTVMMTMGYDAEVAMGQIGQAMKDDIKTTISEWP-ADNSADWAGKKGFNHGLIWTSHLLNSVEQ 151 (155) T ss_pred hhhHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhcCC-CCCcHHHHHhcCCCCchhHHHHHHHhhhh Confidence 99999999999999999999999999999999999999999999999996 99999999999999999999999999999 Q ss_pred eecc Q lcl|NC_019457. 146 NIQT 149 (156) Q Consensus 146 ~V~~ 149 (156) +|++ T Consensus 152 ~V~~ 155 (155) T protein:vir:78 152 EIVK 155 (155) T ss_pred hccC Confidence 9998 No 6 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=100.00 E-value=2.2e-50 Score=292.82 Aligned_cols=139 Identities=19% Similarity=0.304 Sum_probs=126.3 Q ss_pred eeeeecCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCC----------------CCCCHHHHHHHHhcCCCCCCCch Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKALNKLNSDEFVTVGIHEADNARPE----------------GVLTNAQLGAIQHFGNDRIPARP 65 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~----------------~g~~~A~ia~~~E~G~~~IP~Rp 65 (156) |+|.++..+ ++ +++|+++ .|+|||+++++|+|. +|+++|+||+|||||+.+||||| T Consensus 1 m~v~~k~L~----~~---~~~l~~~-~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~~~IP~RP 72 (155) T protein:vir:10 1 MSVTRRGLT----LP---KDRYRSM-SVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGTSKLPARP 72 (155) T ss_pred CcchHHHHH----HH---HHHHhCC-eeEEeecCCCCCccccchhhhhhhhcccccccCCcHHHHHHHHhcCCCCCCCcc Confidence 777766533 33 3445444 599999999999862 38999999999999999999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 66 WLDVGVASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 66 Flr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) |||+|+++++++|.+.+++++.++.+++++|+++|+.++++||.+|+++. |||||+||++||||+||||||+|++||+| T Consensus 73 Flr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~lG~~~~~~Ik~~I~~~~-~pna~~Ti~~KG~~kPLidTG~l~~SIty 151 (155) T protein:vir:10 73 FMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQIGQAMKDDIKTTISEWP-ADNSADWAGKKGFNHGLIWTSHLLNSVEQ 151 (155) T ss_pred hhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhcCC-CCCcHHHHHhcCCCCchhHHHHHHHhhhh Confidence 99999999999999999999999999999999999999999999999986 99999999999999999999999999999 Q ss_pred eecc Q lcl|NC_019457. 146 NIQT 149 (156) Q Consensus 146 ~V~~ 149 (156) +|++ T Consensus 152 ~Vv~ 155 (155) T protein:vir:10 152 EIVK 155 (155) T ss_pred hccC Confidence 9998 No 7 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=100.00 E-value=7.1e-50 Score=290.03 Aligned_cols=150 Identities=18% Similarity=0.243 Sum_probs=133.8 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCC---------------CCCCCHHHHHHHHhcCCCCCCCch Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLNSDEFVTVGIHEADNARP---------------EGVLTNAQLGAIQHFGNDRIPARP 65 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~---------------~~g~~~A~ia~~~E~G~~~IP~Rp 65 (156) |-.++.++ +..+.+.+.+|.++ .|+|||+++++|++ ++|+++|+||+|||||+.+||+|| T Consensus 1 ~~~~~~~g----~~~~~~~~~~l~~~-~v~vG~l~~a~yp~G~~~~~~~~~~~~~~~~g~~va~Ia~~~E~G~~~IP~RP 75 (168) T protein:vir:94 1 MTTIARKG----VKMPPHLEAQFQSG-EVKAGVLSGSTYPQMTYTDQRTGKQIEDARGGMPVAVIAQALEYGHGQNHPRP 75 (168) T ss_pred Cccccchh----hhhhHHHHHhhhcc-ceeeeccccCcccccccchhhcccccccccccccHHHHHHHHhcCCCCCCCch Confidence 33333333 55666667777655 59999999998864 468899999999999999999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 66 WLDVGVASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 66 Flr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) |||+|+++++++|.+.+++++.++.+++++|+.+|+.++++||.+|+++. |||||+||++||||+||||||+|++||+| T Consensus 76 Flr~t~~~~~~~~~~~~~~~~~~~~~~~~~L~~lG~~~~~~Ik~~I~~~~-ppna~sTi~~KG~~~PLiDTG~l~~SIty 154 (168) T protein:vir:94 76 FMQQTYAAQYRAWSRDLTLTLKAGAAADTALRTVGQRMAEDIQDTIRNWP-ADNSPEWAAIKGFNAGLRQTGVLLNAIDS 154 (168) T ss_pred hhHHHHHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHhhcCC-CCccHHHHHhcCCCCchhHHHHHHhhcce Confidence 99999999999999999999999999999999999999999999999985 99999999999999999999999999999 Q ss_pred eec-ccccccCC Q lcl|NC_019457. 146 NIQ-TGRPSEGL 156 (156) Q Consensus 146 ~V~-~~k~~~g~ 156 (156) +|+ +|.+.|.- T Consensus 155 ~Vv~d~~~~~~~ 166 (168) T protein:vir:94 155 AVIIDGEHGEAP 166 (168) T ss_pred eeeecCCCCCCC Confidence 766 88888877 No 8 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=100.00 E-value=1.4e-49 Score=288.36 Aligned_cols=139 Identities=19% Similarity=0.298 Sum_probs=125.6 Q ss_pred eeeeecCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCCC----------------CCCHHHHHHHHhcCCCCCCCch Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKALNKLNSDEFVTVGIHEADNARPEG----------------VLTNAQLGAIQHFGNDRIPARP 65 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~~----------------g~~~A~ia~~~E~G~~~IP~Rp 65 (156) |+|.++. |++ .+++|.++ .|+||||++++|+|+. |+++|+||+|||||+.+||||| T Consensus 1 m~v~r~~----L~~---~~~~l~~~-~V~VGi~~~a~y~d~~g~~~~~g~~~~~~~~~G~pva~ia~~~e~G~~~IP~RP 72 (155) T protein:vir:10 1 MSVTRRG----LTL---PKDRYKSM-SVKAGVLAGATYPDESGKKLADGTILKKDPRAGLPVAMIAMALNYGTSKLPARP 72 (155) T ss_pred CcchHHH----HHH---HHHHhhCC-eeEEeecCCCCCCccccchhhhhhhhccccccCcchhhhhhhhhcCCCCCCCcc Confidence 7776654 333 34455544 5999999999998733 7999999999999999999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 66 WLDVGVASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 66 Flr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) |||+|+++++++|.+.+++++.++.+++++|+.+|+.++++||++|+++.+| |+|+||++||||+||||||+|++||+| T Consensus 73 Flr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~~G~~~~~~Ik~~I~~~~~p-~~~~Ti~~KG~~~PLidTG~l~~Sity 151 (155) T protein:vir:10 73 FMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQIGQAMKDDIKTTISEWPAD-NNADWAGKKGFNHGLIWTSHLLNSIEQ 151 (155) T ss_pred hhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhcCCCC-CChHHHHhcCCCCchHHHHHHHHhhhh Confidence 9999999999999999999999999999999999999999999999999986 678999999999999999999999999 Q ss_pred eecc Q lcl|NC_019457. 146 NIQT 149 (156) Q Consensus 146 ~V~~ 149 (156) +|++ T Consensus 152 ~Vv~ 155 (155) T protein:vir:10 152 EIVK 155 (155) T ss_pred hccC Confidence 9998 No 9 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=100.00 E-value=4e-49 Score=285.95 Aligned_cols=139 Identities=19% Similarity=0.300 Sum_probs=124.7 Q ss_pred eeeeecCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCC----------------CCCCHHHHHHHHhcCCCCCCCch Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKALNKLNSDEFVTVGIHEADNARPE----------------GVLTNAQLGAIQHFGNDRIPARP 65 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~----------------~g~~~A~ia~~~E~G~~~IP~Rp 65 (156) |++.++.. +++ +.+|+++ .|+|||+++++|+|. +|+++|+||+|||||+.+||||| T Consensus 1 m~~~r~~l----~~~---~~~l~~~-~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~G~pva~ia~~~e~G~~~IP~RP 72 (155) T protein:vir:77 1 MSVTRRGL----TLP---KDRYRSM-SVKAGVLAGATYPDESGKKLADGSILKKDPRAGLPVAMIAMALNYGTSKLPARP 72 (155) T ss_pred CcchHHHH----HHH---HHHHhcC-ceEEeecCCCCCccccchhhhhhhhccccccccccHhhhhhhhhcCCCCCCCCc Confidence 77666642 333 3344444 599999999998762 48999999999999999999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 66 WLDVGVASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 66 Flr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) |||+|+++++++|.+.+++++.++.+++++|+.+|+.++++||++|+++.+| |+|+||++||||+||||||+|++||+| T Consensus 73 Flr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~lG~~~~~~Iq~~I~~~~~p-~~~~Ti~~KG~d~PLidTG~l~~SIty 151 (155) T protein:vir:77 73 FMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQIGQAMKDDIKTTISEWPAD-NNADWAGKKGFNHGLIWTSHLLNSIEQ 151 (155) T ss_pred hhhHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHhcCCCC-CChHHHHhcCCCCchhHHHHHHHhhhh Confidence 9999999999999999999999999999999999999999999999999986 578999999999999999999999999 Q ss_pred eecc Q lcl|NC_019457. 146 NIQT 149 (156) Q Consensus 146 ~V~~ 149 (156) +|++ T Consensus 152 ~Vv~ 155 (155) T protein:vir:77 152 EIVK 155 (155) T ss_pred hccC Confidence 9998 No 10 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=100.00 E-value=2.5e-46 Score=270.64 Aligned_cols=140 Identities=24% Similarity=0.391 Sum_probs=121.8 Q ss_pred eeeeecCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCC----------------------- Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGN----------------------- 58 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~----------------------- 58 (156) |+|+ ...+-+++|.+.|++|++ +.|+|||+.++ |..+++||.+||||+ T Consensus 1 m~vt--~~~~~~~~~~~~l~~L~~-k~v~vGi~~~d------~~~~~~Ia~~~E~Ga~I~~~~~~l~Ip~~~a~~~k~~~ 71 (199) T protein:vir:80 1 MKVT--TDKSTMNKAIRELDQLDR-YSLQIGLFGED------DSFIQMIAGVHEFGLTIRPKGKYLTIPTPEAGDRRARD 71 (199) T ss_pred Cccc--ccHHHHHHHHHHHHHhcC-CEEEEEEecCC------CcchhheeehhhcCCeeecCCceeeecchhhhcccccc Confidence 6666 344557899999999965 56999999633 556777777777772 Q ss_pred -------------------------------CCCCCchhhhHHHHHHHHHHHHHHH----HHHhccccHHHHHHHHHHHH Q lcl|NC_019457. 59 -------------------------------DRIPARPWLDVGVASVNDEILDTIA----ASLEDGEDISQLLNRVGVVA 103 (156) Q Consensus 59 -------------------------------~~IP~RpFlr~~~~~~~~~~~~~~~----~~~~~~~~~~~~l~~iG~~~ 103 (156) .+||+|||||+|+++++++|.+.++ +++.|+.+++++|+++|+.+ T Consensus 72 ~~~~~~p~g~~~~~~~~~~~~~~~~e~g~~~~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~vl~g~~~a~~~L~~~G~~~ 151 (199) T protein:vir:80 72 IPGLFKPKGKNILAVAGPDGKLTVMFYLKTEVNIPERSFLRSTFDEKSNKWGELFEGWIDDVIHGKLSAEQVYNRLGAKI 151 (199) T ss_pred cCcccccCCcceeeeeccccceeeeeeccccccCCCCchhHHHHHHHHHHHHHHHHHHHHHHHhCCCcHHHHHHHHHHHH Confidence 1699999999999999999988765 45678999999999999999 Q ss_pred HHHHHHHHHcCCCCCCcHHHHH-hcCCCCchhHHHHHHhhceeeeccc Q lcl|NC_019457. 104 VAGVQNYIDELRSPANAPSTVE-RKGADNPLVDTGEMKQSVTYNIQTG 150 (156) Q Consensus 104 ~~~i~~~I~~~~~ppns~~Ti~-~KG~~~PLidTG~L~~SIty~V~~~ 150 (156) +++||.+|+++.||||||+||+ |||||+||||||+|++||+|+|++- T Consensus 152 ~~~Ik~~I~~~~~ppna~~Tia~rKg~~kPLidTG~l~~SIty~V~~~ 199 (199) T protein:vir:80 152 VDDIQMKIVEIQTPAKSAATLARNPRKNNPLIVTGKMKNSVTWKVMKS 199 (199) T ss_pred HHHHHHHHhccCCCCCCHHHHHHhcCCCCchHHHHHHHhhcceeeeeC Confidence 9999999999999999999997 8999999999999999999999998 No 11 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=100.00 E-value=5.6e-44 Score=257.72 Aligned_cols=146 Identities=22% Similarity=0.331 Sum_probs=115.7 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHH----HHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVAS----VND 76 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~----~~~ 76 (156) |||..+ .+++++|.+++++|++++ |+||||+++.+++ +|+++++||+|||||+.+||+|||||++|+. +.. T Consensus 1 ~~~~~~---~~G~~~L~~~~k~l~~~~-V~VGi~~d~g~~~-dG~sv~~vA~~~EfG~~~iPaRPf~R~tfe~~~~~~~~ 75 (160) T protein:vir:95 1 MVKRVI---HPARAKLVGAMKNLQTAN-AQVGYFQEQGQHS-SGFSYPALMYLQEVIGVPSASGKVYRRLFEITMMLNKQ 75 (160) T ss_pred Cceeec---hHhHHHHHHHHHHHhCCe-eEEeeccccccCC-CCccHHHHHhhhhcCcccCCCcchhHHHHHHHHHHHHH Confidence 887443 446789999999997664 9999999986664 6999999999999999999999999999974 233 Q ss_pred HHHHHHHH----HHhccccHHHHHHHHHHHHHHHHHHHHHcC----CCCCCcHHHHHhcCCCCchhHHHHHHhhceeeec Q lcl|NC_019457. 77 EILDTIAA----SLEDGEDISQLLNRVGVVAVAGVQNYIDEL----RSPANAPSTVERKGADNPLVDTGEMKQSVTYNIQ 148 (156) Q Consensus 77 ~~~~~~~~----~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~----~~ppns~~Ti~~KG~~~PLidTG~L~~SIty~V~ 148 (156) .+.+++.. .+..+. ....+.+|+.++++|+.+|.+. .||||||+||++||||+||||||+|++||+|+|. T Consensus 76 ~~~~~~~~~i~~~~~~g~--~~~~~~LG~~~~~~ik~~I~~~~~p~~w~pNap~Ti~~Kgs~~PLiDTg~l~~Si~y~v~ 153 (160) T protein:vir:95 76 TLLEQTKKNLYKQLSSLN--TDPSNTLEAFAKNAQKAIKRGFGNSAILPPNAPSTVKKKGFNAPLVETGDLRDNLAYKIS 153 (160) T ss_pred HHHHHHHHHHHHHHhhcc--hhHHHHHHHHHHHHHHHHHhhcCCccCCCCCcHHHHHhcCCCCcchhhHHHhhhhhheee Confidence 33333222 222222 2233559999999999999884 5789999999999999999999999999999998 Q ss_pred ccccccCC Q lcl|NC_019457. 149 TGRPSEGL 156 (156) Q Consensus 149 ~~k~~~g~ 156 (156) ++. || T Consensus 154 ~~~---~~ 158 (160) T protein:vir:95 154 TKK---GI 158 (160) T ss_pred ccc---cc Confidence 653 44 No 12 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.89 E-value=2.3e-12 Score=84.36 Aligned_cols=91 Identities=18% Similarity=0.190 Sum_probs=72.2 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcC------CCCCCcHHHHHhc--CCCCchhHHH Q lcl|NC_019457. 66 WLDVGVASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDEL------RSPANAPSTVERK--GADNPLVDTG 137 (156) Q Consensus 66 Flr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppns~~Ti~~K--G~~~PLidTG 137 (156) -+.-.+.-+-.++.+.|...+....+...++..||..+...+++.|... .|+|++++|+++| +..++|+||| T Consensus 1 M~~i~i~~d~~~~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~~~~L~~tg 80 (190) T protein:vir:99 1 MAGITLEWDGRRALDVLNAGSAALGDPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNRDKILTLDG 80 (190) T ss_pred CceeEEEecHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCCCccceecH Confidence 1111122223456666666666555788999999999999999999875 5789999999766 5679999999 Q ss_pred HHHhhceeeecccccccCC Q lcl|NC_019457. 138 EMKQSVTYNIQTGRPSEGL 156 (156) Q Consensus 138 ~L~~SIty~V~~~k~~~g~ 156 (156) .|++||+|++....+++|- T Consensus 81 ~L~~Si~~~~~~~~v~vGt 99 (190) T protein:vir:99 81 HLRNLLRYQLDGSELLFGS 99 (190) T ss_pred HHHHHHhheecCcEEEEec Confidence 9999999999999999997 No 13 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=98.79 E-value=1.5e-11 Score=79.94 Aligned_cols=83 Identities=16% Similarity=0.234 Sum_probs=63.3 Q ss_pred HHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcC------CCCCCcHHHHHhcCCCCchhHHHHHHhhce Q lcl|NC_019457. 71 VASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDEL------RSPANAPSTVERKGADNPLVDTGEMKQSVT 144 (156) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppns~~Ti~~KG~~~PLidTG~L~~SIt 144 (156) +-+....+.+.+++... +....|..+|..++..+++.|.+. .|+|+||+|+++|+.++||+|||.|++||+ T Consensus 1 ~i~~~~~i~~~l~~l~~---~~~~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~~~~L~~tG~L~~Si~ 77 (145) T protein:vir:31 1 MVEDENNIPEAREAIQD---GLTDGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGSDTPLIDNSRLLTDIN 77 (145) T ss_pred CcccHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcCCCCCccCHHHHHHHH Confidence 44444455555554432 234578889999999999999863 588999999999999999999999999999 Q ss_pred eeec----ccccccCC Q lcl|NC_019457. 145 YNIQ----TGRPSEGL 156 (156) Q Consensus 145 y~V~----~~k~~~g~ 156 (156) |.+. +.-++.|- T Consensus 78 ~~~~~~~~~~~a~vGt 93 (145) T protein:vir:31 78 AASMMDRANRMAVIGT 93 (145) T ss_pred HHhhhcccCceeEecC Confidence 9984 33366666 No 14 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=98.77 E-value=1.4e-11 Score=80.14 Aligned_cols=90 Identities=12% Similarity=0.118 Sum_probs=73.6 Q ss_pred hhH--HHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcC---CCCCCcHHHHHhc-------------- Q lcl|NC_019457. 67 LDV--GVASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDEL---RSPANAPSTVERK-------------- 127 (156) Q Consensus 67 lr~--~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~---~~ppns~~Ti~~K-------------- 127 (156) |-. .+.-+-+++.+.|.+......+...+|..||..+...+++.|.+. .|+|+||+|+++| T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~~~ 80 (175) T protein:vir:79 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred CceEEEEEechHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccccch Confidence 322 121123567778888777777889999999999999999999876 5779999998643 Q ss_pred -------CCCCchhHHHHHHhhceeeecccccccCC Q lcl|NC_019457. 128 -------GADNPLVDTGEMKQSVTYNIQTGRPSEGL 156 (156) Q Consensus 128 -------G~~~PLidTG~L~~SIty~V~~~k~~~g~ 156 (156) +..++|+|||.|++||+|.+....+++|- T Consensus 81 ~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGt 116 (175) T protein:vir:79 81 AAASRRKAGLMILQDSGQMAASTATDSGEDYSVIGS 116 (175) T ss_pred hhHhhhccCCCcceechhhhhhhhheecCCEEEEec Confidence 46789999999999999999999999998 No 15 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=98.77 E-value=1.1e-11 Score=80.62 Aligned_cols=90 Identities=20% Similarity=0.217 Sum_probs=72.1 Q ss_pred hhHH--HHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcC--CCCCCcHHHHHh-----cCCCCchhHHH Q lcl|NC_019457. 67 LDVG--VASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDEL--RSPANAPSTVER-----KGADNPLVDTG 137 (156) Q Consensus 67 lr~~--~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~--~~ppns~~Ti~~-----KG~~~PLidTG 137 (156) |-.. +.-+...+.+.|.+......+...+|+.||..+...+++.|... .|+|+||+|+++ +|..++|+||| T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~~~~~~L~~tG 80 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAAVTDTLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGRGAHPILQVTN 80 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccCCCCCccccch Confidence 3221 12233456667766666656789999999999999999999754 789999999864 35678999999 Q ss_pred HHHhhceeeecccccccCC Q lcl|NC_019457. 138 EMKQSVTYNIQTGRPSEGL 156 (156) Q Consensus 138 ~L~~SIty~V~~~k~~~g~ 156 (156) .|++||+|.+....|.+|- T Consensus 81 ~L~~Si~~~~~~~~v~vGt 99 (155) T protein:vir:10 81 ALARSITTRADRDQAQIGS 99 (155) T ss_pred hhhhhhhceecCCEEEEec Confidence 9999999999999999998 No 16 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=98.74 E-value=1.9e-11 Score=79.38 Aligned_cols=90 Identities=18% Similarity=0.187 Sum_probs=72.3 Q ss_pred hhHH--HHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcC--CCCCCcHHHHHhc-----CCCCchhHHH Q lcl|NC_019457. 67 LDVG--VASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDEL--RSPANAPSTVERK-----GADNPLVDTG 137 (156) Q Consensus 67 lr~~--~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~--~~ppns~~Ti~~K-----G~~~PLidTG 137 (156) |-.. +.-+.+++.+.|.+......+...+|..||..+...+++.|... .|+|+||+|+++| +..++|+||| T Consensus 1 M~~~i~i~~d~~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~~tG 80 (155) T protein:vir:79 1 MTTRIDVELDDQEVRQRLAVLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQVTN 80 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccccch Confidence 3221 11123456677777666666789999999999999999999754 7899999999865 3568999999 Q ss_pred HHHhhceeeecccccccCC Q lcl|NC_019457. 138 EMKQSVTYNIQTGRPSEGL 156 (156) Q Consensus 138 ~L~~SIty~V~~~k~~~g~ 156 (156) .|++||+|++....+.+|- T Consensus 81 ~L~~Si~~~~~~~~v~vGt 99 (155) T protein:vir:79 81 ALARSVTTWADRNEAGIGS 99 (155) T ss_pred hhhhhhhceecCCEEEEec Confidence 9999999999999999997 No 17 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=98.69 E-value=3.5e-11 Score=77.90 Aligned_cols=90 Identities=18% Similarity=0.174 Sum_probs=72.4 Q ss_pred hhHH--HHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcC--CCCCCcHHHHHhc-----CCCCchhHHH Q lcl|NC_019457. 67 LDVG--VASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDEL--RSPANAPSTVERK-----GADNPLVDTG 137 (156) Q Consensus 67 lr~~--~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~--~~ppns~~Ti~~K-----G~~~PLidTG 137 (156) |..- +.-+.+++.+.|.+......+...+|..||..+...+++.|... .|+|+||+|+++| +..++|+||| T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL~~tg 80 (155) T protein:vir:99 1 MTTRIDVELDDQEVRQRLALLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPILQVTN 80 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcchhch Confidence 3221 11233566777777776666789999999999999999999643 7889999999865 3467999999 Q ss_pred HHHhhceeeecccccccCC Q lcl|NC_019457. 138 EMKQSVTYNIQTGRPSEGL 156 (156) Q Consensus 138 ~L~~SIty~V~~~k~~~g~ 156 (156) .|++||+|.+....+++|- T Consensus 81 ~L~~Si~~~~~~~~v~vGt 99 (155) T protein:vir:99 81 ALARSVTTWADRNEAGIGS 99 (155) T ss_pred hhhhhhhceecCCEEEEec Confidence 9999999999999999997 No 18 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=98.64 E-value=6.7e-11 Score=76.38 Aligned_cols=89 Identities=11% Similarity=0.037 Sum_probs=68.9 Q ss_pred hhHH--HHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcC-------CCCCCcHHHHHhcC-----CCCc Q lcl|NC_019457. 67 LDVG--VASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDEL-------RSPANAPSTVERKG-----ADNP 132 (156) Q Consensus 67 lr~~--~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~-------~~ppns~~Ti~~KG-----~~~P 132 (156) |... +....+.+.+.|.+.. ...+...++..||..+...+++.|... .|+|++|+|+++|. ..+| T Consensus 1 ms~~i~~~~d~~~l~~~L~~l~-~~~~~~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~~~~~ 79 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDELG-TVTRDRAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFVPGSI 79 (156) T ss_pred CeEEEEEeecHHHHHHHHHHHH-hhhccHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCCCCcc Confidence 3322 2223345666665543 333445799999999999999999853 58899999999873 3689 Q ss_pred hhHHHHHHhhceeeecccccccCC Q lcl|NC_019457. 133 LVDTGEMKQSVTYNIQTGRPSEGL 156 (156) Q Consensus 133 LidTG~L~~SIty~V~~~k~~~g~ 156 (156) |+|||.|++||+|.+....+++|- T Consensus 80 L~~tg~L~~Si~~~~~~~~v~vGt 103 (156) T protein:vir:19 80 LTLHGDLARSITTDYGQDYALIGS 103 (156) T ss_pred hhhhHHHHHHhhheecCCEEEEec Confidence 999999999999999999999998 No 19 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=98.59 E-value=1.1e-10 Score=75.11 Aligned_cols=104 Identities=14% Similarity=0.276 Sum_probs=59.1 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHhcC---------------------------------------------------CcE Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLNSD---------------------------------------------------EFV 29 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~~~---------------------------------------------------~~V 29 (156) .|.+++++++++.++|.+.-.++..+ ... T Consensus 4 ~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~~~~~ 83 (164) T protein:vir:43 4 TVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTGDLGF 83 (164) T ss_pred ceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccccceeE Confidence 45678887776655554332221100 001 Q ss_pred EEEeccCCCCCC------CCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHH Q lcl|NC_019457. 30 TVGIHEADNARP------EGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAASLEDGEDISQLLNRVGVVA 103 (156) Q Consensus 30 ~VGi~~~~~~~~------~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~ 103 (156) .||+..+..... .++-+.+.++.++||||.++||||||||+++++++++.+.+.+.+...+ +.+|.+.+... T Consensus 84 ~vg~~~~~~~~~~~~~~~~~~~~~~~y~~f~EfGT~km~a~PFlrPA~~~~k~~~~~~~~~~l~~~i--~ka~~k~~~~~ 161 (164) T protein:vir:43 84 RIGVLHGAVLPKKGERSDKTANAPTPHWRLLEFGTEDMRAQPFMRSALADNIAEVTSTFVSEYEKGI--DRAIKRAAKKA 161 (164) T ss_pred EecccccccccccccccccCCCCCcceEEEeecCCCCCCCCcchhhhHHHhHHHHHHHHHHHHHHHH--HHHHHHHHhhh Confidence 222222211110 0112335678899999999999999999999999988877776665443 24444444333 Q ss_pred HHH Q lcl|NC_019457. 104 VAG 106 (156) Q Consensus 104 ~~~ 106 (156) +.- T Consensus 162 ~~~ 164 (164) T protein:vir:43 162 AQG 164 (164) T ss_pred ccC Confidence 322 No 20 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=98.46 E-value=8.3e-10 Score=70.38 Aligned_cols=89 Identities=12% Similarity=0.203 Sum_probs=55.4 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHhc-------------------------------------------CCcEEEEeccCC Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLNS-------------------------------------------DEFVTVGIHEAD 37 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~~-------------------------------------------~~~V~VGi~~~~ 37 (156) |.++++++.++..++|.+.-.+... ...+.+|+..+. T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~~~~~~~~ 80 (140) T protein:vir:10 1 MSSVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKDSPGIATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceecccccccccceeEEeecccc Confidence 9999999887766555432211100 001222222111 Q ss_pred CCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHH--------HHHHhccc Q lcl|NC_019457. 38 NARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTI--------AASLEDGE 90 (156) Q Consensus 38 ~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~--------~~~~~~~~ 90 (156) ... .++.+.+.++.+.||||.+.||+|||+|+++++++++.+.+ .+++.+|. T Consensus 81 ~~~-~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 81 KGK-ADSPNNAFYWRFVELGTQFMKAEPFMRPAFDASIAQAEGAIRTEIARAIDQVVGGGL 140 (140) T ss_pred ccc-cCCCCcccccceeccCcCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 100 01345678999999999999999999999999987776544 45556666 No 21 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=98.45 E-value=3.8e-10 Score=72.22 Aligned_cols=90 Identities=14% Similarity=0.126 Sum_probs=73.5 Q ss_pred hhHH--HHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcC---CCCCCcHHHHHh--------------- Q lcl|NC_019457. 67 LDVG--VASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDEL---RSPANAPSTVER--------------- 126 (156) Q Consensus 67 lr~~--~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~---~~ppns~~Ti~~--------------- 126 (156) |--. +.-..+++.+.|.+......+...+|..||..++..+++.|.+. .|+|.+|+|+++ T Consensus 1 Ms~~i~i~~~~~~l~~~L~~l~~~~~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~~~~ 80 (175) T protein:vir:10 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred CceeEEEEecHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhhhhh Confidence 3221 11123567788888887777889999999999999999999876 567999999863 Q ss_pred ------cCCCCchhHHHHHHhhceeeecccccccCC Q lcl|NC_019457. 127 ------KGADNPLVDTGEMKQSVTYNIQTGRPSEGL 156 (156) Q Consensus 127 ------KG~~~PLidTG~L~~SIty~V~~~k~~~g~ 156 (156) ++..++|+|||.|++||+|.+.+..+++|- T Consensus 81 ~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGt 116 (175) T protein:vir:10 81 AAASRRKAGLMILQDSGQMAASVSTDHDDNSAVIGS 116 (175) T ss_pred hhhhhhccCCCcceechhhhhhhheeecCCEEEEec Confidence 346789999999999999999999999998 No 22 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=98.40 E-value=5.1e-10 Score=71.51 Aligned_cols=96 Identities=16% Similarity=0.221 Sum_probs=52.1 Q ss_pred Ceeeeec--CcHHHHHHHHHHHHHHhcC-----------------------------CcEEEE------------e--cc Q lcl|NC_019457. 1 MIKITVP--NFDAVRDELTKALNKLNSD-----------------------------EFVTVG------------I--HE 35 (156) Q Consensus 1 m~~v~~~--~~~~~~~~l~~~l~~l~~~-----------------------------~~V~VG------------i--~~ 35 (156) ||+++++ ++++..++|.++-.+...+ ..+.+. + .. T Consensus 1 mm~~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~~~~ 80 (148) T protein:vir:93 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVHIRG 80 (148) T ss_pred CcceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeeeecc Confidence 8888666 5454444433221111000 001111 1 00 Q ss_pred CCCC-------CCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHhccccHHHHHHH Q lcl|NC_019457. 36 ADNA-------RPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAASLEDGEDISQLLNR 98 (156) Q Consensus 36 ~~~~-------~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 98 (156) .... ....+.+.+.++.+.||||.++||||||+|+++++++++.+.+.+.+...++ .+|.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~pa~PFl~pA~~~~k~~~~~~~~~~~~~~i~--k~~~k 148 (148) T protein:vir:93 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAID--EVLRR 148 (148) T ss_pred cccccccccceeecCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHH--HHhcC Confidence 0000 0001234567889999999999999999999999998888777666543321 22222 No 23 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=98.40 E-value=1.1e-09 Score=69.71 Aligned_cols=89 Identities=13% Similarity=0.239 Sum_probs=54.8 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHhcC-----------------------------CcEE--------------EEeccCC Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLNSD-----------------------------EFVT--------------VGIHEAD 37 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~~~-----------------------------~~V~--------------VGi~~~~ 37 (156) |.++++++.++..++|.+.-.+...+ ..+. ||+..+. T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~~~~~~~~~~~~~ 80 (140) T protein:vir:80 1 MSSIQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccccceeeeeeeccc Confidence 99999998777665554432111100 1111 2222111 Q ss_pred CCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHH--------hccc Q lcl|NC_019457. 38 NARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAASL--------EDGE 90 (156) Q Consensus 38 ~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~--------~~~~ 90 (156) .. ..++.+.+.++.+.||||.++||+|||+|+++.+++++.+.+++.+ .|.. T Consensus 81 ~~-~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:80 81 KG-KADSPSNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDQALGGRR 140 (140) T ss_pred cc-ccCCCCCcceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 10 0123456789999999999999999999999999988766655443 3333 No 24 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=98.38 E-value=1.4e-09 Score=69.12 Aligned_cols=89 Identities=17% Similarity=0.279 Sum_probs=54.4 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHhc-------------------------------------------CCcEEEEeccCC Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLNS-------------------------------------------DEFVTVGIHEAD 37 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~~-------------------------------------------~~~V~VGi~~~~ 37 (156) |.++++++.++..++|.+.-.+... ...+.||+..+. T Consensus 1 M~~~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~vg~~~~~ 80 (140) T protein:vir:14 1 MSSIQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccccccceeEEeeeeecc Confidence 9999999877766555542111100 011223332221 Q ss_pred CCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHH--------Hhccc Q lcl|NC_019457. 38 NARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAAS--------LEDGE 90 (156) Q Consensus 38 ~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~--------~~~~~ 90 (156) ... .++-+.+.++.+.||||.++||||||+|+++++++++.+.+.+. +.|.. T Consensus 81 ~~~-~~~~~~~~y~~f~E~GT~~~~a~pFl~pa~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:14 81 KGK-ADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGRR 140 (140) T ss_pred ccc-cCCCCccceeeeeccccCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 111 11234578899999999999999999999999988776655443 33333 No 25 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=98.36 E-value=1.4e-09 Score=69.06 Aligned_cols=89 Identities=17% Similarity=0.268 Sum_probs=53.2 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHhcC-------------------------------------------CcEEEEeccCC Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLNSD-------------------------------------------EFVTVGIHEAD 37 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~~~-------------------------------------------~~V~VGi~~~~ 37 (156) |.++++++.++..+.|.+.-.+...+ ..+.||+.... T Consensus 1 Ma~~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~~g~~~~~ 80 (140) T protein:vir:10 1 MSSIQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhccccccccccccceEEeeeeecc Confidence 99999998877665554322111000 01122221111 Q ss_pred CCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHH--------HHhccc Q lcl|NC_019457. 38 NARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAA--------SLEDGE 90 (156) Q Consensus 38 ~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~--------~~~~~~ 90 (156) .. ..++-+.+.++.+.||||.++||+|||+|+++++++++.+.+.+ ++.|.. T Consensus 81 ~~-~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 81 KG-KADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGRR 140 (140) T ss_pred cc-ccCCCCccceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 00 01123457789999999999999999999999999877665544 333433 No 26 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=98.35 E-value=5.6e-10 Score=71.31 Aligned_cols=104 Identities=12% Similarity=0.254 Sum_probs=51.8 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHhcC------------------C---------------------------------cE Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLNSD------------------E---------------------------------FV 29 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~~~------------------~---------------------------------~V 29 (156) .|++++++++++.++|.++-.++..+ . .+ T Consensus 4 ~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g~~~~ 83 (179) T protein:vir:18 4 SVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTGDLAF 83 (179) T ss_pred eEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccccceeE Confidence 45677777766555444322111100 0 02 Q ss_pred EEEeccCCCCC---------------------CCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 30 TVGIHEADNAR---------------------PEGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAASLED 88 (156) Q Consensus 30 ~VGi~~~~~~~---------------------~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (156) .||+..+.... ...+-..+.++.+.||||.++||||||||+++++++++.+.|...+. T Consensus 84 ~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa~PFlrPA~~~~~~~a~~~i~~~l~- 162 (179) T protein:vir:18 84 RVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSARPILRPAMNGVDNDVINVFSTEMG- 162 (179) T ss_pred eeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCCCccchhhHHhhHHHHHHHHHHHHH- Confidence 23332221100 00112345677889999999999999999999998776655543332 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCC Q lcl|NC_019457. 89 GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADN 131 (156) Q Consensus 89 ~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~ 131 (156) ..|+..+..+.. ||-+- T Consensus 163 ----------------~~i~k~lk~~~~----------~~~~~ 179 (179) T protein:vir:18 163 ----------------KAIDRAIRLAMK----------KGTTA 179 (179) T ss_pred ----------------HHHHHHHHhhcc----------cCCCC Confidence 222222221110 11000 No 27 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=98.34 E-value=2.4e-09 Score=67.82 Aligned_cols=81 Identities=17% Similarity=0.327 Sum_probs=55.5 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHh----------------------c------------------------CCcEEEEec Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLN----------------------S------------------------DEFVTVGIH 34 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~----------------------~------------------------~~~V~VGi~ 34 (156) |..+++++++.+.+.|.+.-.+.. . ...|.||+- T Consensus 1 M~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~~k~~ap~~~~~tg~l~~~I~~~~~k~~~~g~~~v~Vg~~ 80 (127) T protein:vir:12 1 MADMSFDGIDDLTQYFEKIGGDIEKVEPVALKAGGEIIAERQRSHVNRSDKKQPHMQDNITVSNVRESKDGVRFVAVGPN 80 (127) T ss_pred CeeeeehhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCChhHHHHhhhccccccccCceeEEEEeeC Confidence 888888887766555543211110 0 001112211 Q ss_pred cCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019457. 35 EADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAASLEDGED 91 (156) Q Consensus 35 ~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~ 91 (156) -+.+.++.+.||||.+.||||||+|+++++++++.+.+.+.+...+- T Consensus 81 ----------~~~~~y~~f~E~GT~~~~a~Pf~~pa~~~~~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 81 ----------KKVAYRGRFLEWGTSKMPPQPFIEKGGKEGEGPAVELMERILTAPIK 127 (127) T ss_pred ----------CCCcceeeeeccCccCCCCCccchHhHHHHHHHHHHHHHHHHHHhcC Confidence 23467889999999999999999999999999998888877766555 No 28 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=98.29 E-value=1.1e-09 Score=69.66 Aligned_cols=88 Identities=16% Similarity=0.244 Sum_probs=57.4 Q ss_pred CeeeeecCcHHHHHHHHHH-----HHH--------Hh----cCCcEEEEeccCCCC------CCCCC---CCHHHHHHHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKA-----LNK--------LN----SDEFVTVGIHEADNA------RPEGV---LTNAQLGAIQ 54 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~-----l~~--------l~----~~~~V~VGi~~~~~~------~~~~g---~~~A~ia~~~ 54 (156) |.+|++++.++..++|.+. +++ +. ......+++..|.-. ..++| .+.+.+|.++ T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~Ya~~v 80 (114) T protein:vir:27 1 MATIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDKATVEALTSYSGYL 80 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCeeEecCCCCcccee Confidence 9999999887766666432 111 00 000001111111100 01111 3457899999 Q ss_pred hcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 55 HFGNDRIPARPWLDVGVASVNDEILDTIAASLED 88 (156) Q Consensus 55 E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (156) ||||...||||||||+++.++.++.+.|++.+.- T Consensus 81 EfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 81 EVGTRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred cccccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 9999999999999999999999999999988877 No 29 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=98.29 E-value=1.1e-09 Score=69.66 Aligned_cols=88 Identities=16% Similarity=0.244 Sum_probs=57.4 Q ss_pred CeeeeecCcHHHHHHHHHH-----HHH--------Hh----cCCcEEEEeccCCCC------CCCCC---CCHHHHHHHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKA-----LNK--------LN----SDEFVTVGIHEADNA------RPEGV---LTNAQLGAIQ 54 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~-----l~~--------l~----~~~~V~VGi~~~~~~------~~~~g---~~~A~ia~~~ 54 (156) |.+|++++.++..++|.+. +++ +. ......+++..|.-. ..++| .+.+.+|.++ T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~Ya~~v 80 (114) T protein:vir:49 1 MATIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDKATVEALTSYSGYL 80 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCeeEecCCCCcccee Confidence 9999999887766666432 111 00 000001111111100 01111 3457899999 Q ss_pred hcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 55 HFGNDRIPARPWLDVGVASVNDEILDTIAASLED 88 (156) Q Consensus 55 E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (156) ||||...||||||||+++.++.++.+.|++.+.- T Consensus 81 EfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 81 EVGTRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred cccccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 9999999999999999999999999999988877 No 30 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=98.29 E-value=1.3e-09 Score=69.37 Aligned_cols=90 Identities=9% Similarity=0.083 Sum_probs=59.5 Q ss_pred eeeeecCcHHHHHHHHHHHHHHhc-------------------CCcEEEEeccCCCCCCCCC-----CCHHHHHHHHhcC Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKALNKLNS-------------------DEFVTVGIHEADNARPEGV-----LTNAQLGAIQHFG 57 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l~~l~~-------------------~~~V~VGi~~~~~~~~~~g-----~~~A~ia~~~E~G 57 (156) |++++++.+...+.|.+...+... .-.|.-|.+..+-....+| .+.+.+|.+.||| T Consensus 1 msi~i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~g~~~~V~~~~~Ya~yvE~G 80 (114) T protein:vir:95 1 MAIKWQGIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQLAPKDTEFLKDHITTSYPGMEAHIHGEAGYDGYQEYG 80 (114) T ss_pred CeeeeehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhceeeecCceEEEeecCCCccceeecC Confidence 999998877665555433322110 0011123332221111111 2457789999999 Q ss_pred CCCCCCchhhhHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019457. 58 NDRIPARPWLDVGVASVNDEILDTIAASLEDGED 91 (156) Q Consensus 58 ~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~ 91 (156) |...|+||||+|+++.++.++.+.|++.+.+++- T Consensus 81 T~~~~aqPfl~pa~~~~~~~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 81 TRFQPGTPHFRPMMEQIQPQFQKDMTDVMKGAFK 114 (114) T ss_pred ccccCCCccchhhHHHHHHHHHHHHHHHHHhhcC Confidence 9999999999999999999999988888877766 No 31 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=98.17 E-value=5e-09 Score=66.11 Aligned_cols=80 Identities=11% Similarity=0.111 Sum_probs=53.5 Q ss_pred eeeeecCcHHHHHHHHHHHHHHhc------------------------------------------------CCcEEEEe Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKALNKLNS------------------------------------------------DEFVTVGI 33 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l~~l~~------------------------------------------------~~~V~VGi 33 (156) |++++++++...+.|.+.-.+... ...+.||+ T Consensus 1 m~v~i~Gl~el~~~l~~l~~~~~k~~~~al~~ga~~~~~~~k~~ap~~~~~~~~~~h~~d~I~~~~~k~~~g~~~~~VG~ 80 (128) T protein:vir:38 1 MGVKVTGDAELLANLNKLQFGVAKEARAAVRDGAQKFADKLKSNTPEWDGETDMSGHLRDDIKLSSVRETSGLTEVDVGY 80 (128) T ss_pred CccchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCCCcccchhhhhhccccccccCceeEEEeee Confidence 788887766655554433211100 00122222 Q ss_pred ccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019457. 34 HEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAASLEDGED 91 (156) Q Consensus 34 ~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~ 91 (156) . -+.+.++.+.||||.+.||+|||+++++++++++.+.+.+.+..++= T Consensus 81 ~----------k~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 81 G----------KDTGWRAHFPNSGTSMQDPQHFIEETQEIMRPVVIAAFLSHLKEGGM 128 (128) T ss_pred c----------CCCceEEeeeccCccCCCCCcchhHHHHHhHHHHHHHHHHHHHhhcC Confidence 1 12356789999999999999999999999999999988887765543 No 32 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=98.17 E-value=2.2e-09 Score=68.03 Aligned_cols=93 Identities=18% Similarity=0.299 Sum_probs=52.5 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHhcC---Cc-----------E--EEEeccCCCC-------------CCC--C------ Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLNSD---EF-----------V--TVGIHEADNA-------------RPE--G------ 43 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~~~---~~-----------V--~VGi~~~~~~-------------~~~--~------ 43 (156) ||++++++.+...++|.+.-.+...+ +. + .+.+-.++.+ ..+ + T Consensus 1 M~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~~~~~~v~ 80 (133) T protein:vir:10 1 MIRMEVKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRKAQGNAVVTLR 80 (133) T ss_pred CeeEeeehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccccCccceEEEE Confidence 99999999877655444322111000 00 0 0111111100 000 0 Q ss_pred -CC--CHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHhccccHH Q lcl|NC_019457. 44 -VL--TNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAASLEDGEDIS 93 (156) Q Consensus 44 -g~--~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~ 93 (156) |. +...++.+.||||.+.||||||+|+++++++++.+.+.+.+...++-. T Consensus 81 vg~~~~~~~y~~f~E~GT~k~~a~PF~~pA~~~~~~~~~~~~~~~~~~~l~K~ 133 (133) T protein:vir:10 81 VGPSKQHHMKVLAQEFGTVKQVADPFIRPALDYNVQTVLRVLTVEIRNGIQNR 133 (133) T ss_pred ecCCCCccceEeeeccCCCCCCCCccchHHHHHhHHHHHHHHHHHHHHHhhcC Confidence 11 112234455999999999999999999999988888777665544433 No 33 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=98.17 E-value=3.9e-09 Score=66.70 Aligned_cols=87 Identities=18% Similarity=0.296 Sum_probs=52.0 Q ss_pred CeeeeecCcHHHHHHHHHHHHH----------------------Hhc--------------------------------- Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNK----------------------LNS--------------------------------- 25 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~----------------------l~~--------------------------------- 25 (156) .|+++++++++..++|.++-.+ ... T Consensus 4 ~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~ 83 (146) T protein:vir:10 4 GIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEG 83 (146) T ss_pred ceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccccccc Confidence 4567788877665555422111 000 Q ss_pred -CCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHhccccHHHHH Q lcl|NC_019457. 26 -DEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAASLEDGEDISQLL 96 (156) Q Consensus 26 -~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~~~l 96 (156) ...|.||+-.. .-+.+.++.+.||||.+.||+|||+|+++++++++.+.+.+.+...+ ..+| T Consensus 84 g~~~~~vg~~~~-------~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l--~ka~ 146 (146) T protein:vir:10 84 GIKTVKIGLNKA-------DRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEM--RLDL 146 (146) T ss_pred cceeEEeeeccC-------CCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH--hhcC Confidence 00111222110 01346788999999999999999999999999888877766654433 1122 No 34 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=98.17 E-value=3.9e-09 Score=66.70 Aligned_cols=87 Identities=18% Similarity=0.296 Sum_probs=52.0 Q ss_pred CeeeeecCcHHHHHHHHHHHHH----------------------Hhc--------------------------------- Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNK----------------------LNS--------------------------------- 25 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~----------------------l~~--------------------------------- 25 (156) .|+++++++++..++|.++-.+ ... T Consensus 4 ~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~ 83 (146) T protein:vir:10 4 GIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEG 83 (146) T ss_pred ceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccccccc Confidence 4567788877665555422111 000 Q ss_pred -CCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHhccccHHHHH Q lcl|NC_019457. 26 -DEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAASLEDGEDISQLL 96 (156) Q Consensus 26 -~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~~~l 96 (156) ...|.||+-.. .-+.+.++.+.||||.+.||+|||+|+++++++++.+.+.+.+...+ ..+| T Consensus 84 g~~~~~vg~~~~-------~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l--~ka~ 146 (146) T protein:vir:10 84 GIKTVKIGLNKA-------DRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEM--RLDL 146 (146) T ss_pred cceeEEeeeccC-------CCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH--hhcC Confidence 00111222110 01346788999999999999999999999999888877766654433 1122 No 35 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=98.17 E-value=3.9e-09 Score=66.70 Aligned_cols=87 Identities=18% Similarity=0.296 Sum_probs=52.0 Q ss_pred CeeeeecCcHHHHHHHHHHHHH----------------------Hhc--------------------------------- Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNK----------------------LNS--------------------------------- 25 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~----------------------l~~--------------------------------- 25 (156) .|+++++++++..++|.++-.+ ... T Consensus 4 ~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~ 83 (146) T protein:vir:10 4 GIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEG 83 (146) T ss_pred ceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccccccc Confidence 4567788877665555422111 000 Q ss_pred -CCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHhccccHHHHH Q lcl|NC_019457. 26 -DEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAASLEDGEDISQLL 96 (156) Q Consensus 26 -~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~~~l 96 (156) ...|.||+-.. .-+.+.++.+.||||.+.||+|||+|+++++++++.+.+.+.+...+ ..+| T Consensus 84 g~~~~~vg~~~~-------~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l--~ka~ 146 (146) T protein:vir:10 84 GIKTVKIGLNKA-------DRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEM--RLDL 146 (146) T ss_pred cceeEEeeeccC-------CCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH--hhcC Confidence 00111222110 01346788999999999999999999999999888877766654433 1122 No 36 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=98.17 E-value=3.9e-09 Score=66.70 Aligned_cols=87 Identities=18% Similarity=0.296 Sum_probs=52.0 Q ss_pred CeeeeecCcHHHHHHHHHHHHH----------------------Hhc--------------------------------- Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNK----------------------LNS--------------------------------- 25 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~----------------------l~~--------------------------------- 25 (156) .|+++++++++..++|.++-.+ ... T Consensus 4 ~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~ 83 (146) T protein:vir:10 4 GIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKLEG 83 (146) T ss_pred ceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccccccc Confidence 4567788877665555422111 000 Q ss_pred -CCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHhccccHHHHH Q lcl|NC_019457. 26 -DEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAASLEDGEDISQLL 96 (156) Q Consensus 26 -~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~~~l 96 (156) ...|.||+-.. .-+.+.++.+.||||.+.||+|||+|+++++++++.+.+.+.+...+ ..+| T Consensus 84 g~~~~~vg~~~~-------~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l--~ka~ 146 (146) T protein:vir:10 84 GIKTVKIGLNKA-------DRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEM--RLDL 146 (146) T ss_pred cceeEEeeeccC-------CCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH--hhcC Confidence 00111222110 01346788999999999999999999999999888877766654433 1122 No 37 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=98.16 E-value=4.5e-09 Score=66.36 Aligned_cols=84 Identities=13% Similarity=0.245 Sum_probs=52.4 Q ss_pred eeecCcHHHHHHHHHHHHHHh---------------------cC----CcEEEEeccCCCC-CCCCC-----CCHHHHHH Q lcl|NC_019457. 4 ITVPNFDAVRDELTKALNKLN---------------------SD----EFVTVGIHEADNA-RPEGV-----LTNAQLGA 52 (156) Q Consensus 4 v~~~~~~~~~~~l~~~l~~l~---------------------~~----~~V~VGi~~~~~~-~~~~g-----~~~A~ia~ 52 (156) |++++.+...++|.+.-.+.. .. ..|.-|.+.++-. ..++| .+.+++|. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 666666665554432111100 00 0111222222210 11122 24578999 Q ss_pred HHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019457. 53 IQHFGNDRIPARPWLDVGVASVNDEILDTIAASLE 87 (156) Q Consensus 53 ~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 87 (156) +.||||...|+||||+|+++.++.++.+.|++++. T Consensus 81 ~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 81 FLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999999999999999999999999999999998 No 38 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=98.16 E-value=4.5e-09 Score=66.36 Aligned_cols=84 Identities=13% Similarity=0.245 Sum_probs=52.4 Q ss_pred eeecCcHHHHHHHHHHHHHHh---------------------cC----CcEEEEeccCCCC-CCCCC-----CCHHHHHH Q lcl|NC_019457. 4 ITVPNFDAVRDELTKALNKLN---------------------SD----EFVTVGIHEADNA-RPEGV-----LTNAQLGA 52 (156) Q Consensus 4 v~~~~~~~~~~~l~~~l~~l~---------------------~~----~~V~VGi~~~~~~-~~~~g-----~~~A~ia~ 52 (156) |++++.+...++|.+.-.+.. .. ..|.-|.+.++-. ..++| .+.+++|. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 666666665554432111100 00 0111222222210 11122 24578999 Q ss_pred HHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019457. 53 IQHFGNDRIPARPWLDVGVASVNDEILDTIAASLE 87 (156) Q Consensus 53 ~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 87 (156) +.||||...|+||||+|+++.++.++.+.|++++. T Consensus 81 ~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 81 FLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999999999999999999999999999999998 No 39 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=98.16 E-value=4.5e-09 Score=66.36 Aligned_cols=84 Identities=13% Similarity=0.245 Sum_probs=52.4 Q ss_pred eeecCcHHHHHHHHHHHHHHh---------------------cC----CcEEEEeccCCCC-CCCCC-----CCHHHHHH Q lcl|NC_019457. 4 ITVPNFDAVRDELTKALNKLN---------------------SD----EFVTVGIHEADNA-RPEGV-----LTNAQLGA 52 (156) Q Consensus 4 v~~~~~~~~~~~l~~~l~~l~---------------------~~----~~V~VGi~~~~~~-~~~~g-----~~~A~ia~ 52 (156) |++++.+...++|.+.-.+.. .. ..|.-|.+.++-. ..++| .+.+++|. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 666666665554432111100 00 0111222222210 11122 24578999 Q ss_pred HHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019457. 53 IQHFGNDRIPARPWLDVGVASVNDEILDTIAASLE 87 (156) Q Consensus 53 ~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 87 (156) +.||||...|+||||+|+++.++.++.+.|++++. T Consensus 81 ~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 81 FLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999999999999999999999999999999998 No 40 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=98.16 E-value=4.5e-09 Score=66.36 Aligned_cols=84 Identities=13% Similarity=0.245 Sum_probs=52.4 Q ss_pred eeecCcHHHHHHHHHHHHHHh---------------------cC----CcEEEEeccCCCC-CCCCC-----CCHHHHHH Q lcl|NC_019457. 4 ITVPNFDAVRDELTKALNKLN---------------------SD----EFVTVGIHEADNA-RPEGV-----LTNAQLGA 52 (156) Q Consensus 4 v~~~~~~~~~~~l~~~l~~l~---------------------~~----~~V~VGi~~~~~~-~~~~g-----~~~A~ia~ 52 (156) |++++.+...++|.+.-.+.. .. ..|.-|.+.++-. ..++| .+.+++|. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 666666665554432111100 00 0111222222210 11122 24578999 Q ss_pred HHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019457. 53 IQHFGNDRIPARPWLDVGVASVNDEILDTIAASLE 87 (156) Q Consensus 53 ~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 87 (156) +.||||...|+||||+|+++.++.++.+.|++++. T Consensus 81 ~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 81 FLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999999999999999999999999999999998 No 41 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=98.16 E-value=4.5e-09 Score=66.36 Aligned_cols=84 Identities=13% Similarity=0.245 Sum_probs=52.4 Q ss_pred eeecCcHHHHHHHHHHHHHHh---------------------cC----CcEEEEeccCCCC-CCCCC-----CCHHHHHH Q lcl|NC_019457. 4 ITVPNFDAVRDELTKALNKLN---------------------SD----EFVTVGIHEADNA-RPEGV-----LTNAQLGA 52 (156) Q Consensus 4 v~~~~~~~~~~~l~~~l~~l~---------------------~~----~~V~VGi~~~~~~-~~~~g-----~~~A~ia~ 52 (156) |++++.+...++|.+.-.+.. .. ..|.-|.+.++-. ..++| .+.+++|. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 666666665554432111100 00 0111222222210 11122 24578999 Q ss_pred HHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019457. 53 IQHFGNDRIPARPWLDVGVASVNDEILDTIAASLE 87 (156) Q Consensus 53 ~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 87 (156) +.||||...|+||||+|+++.++.++.+.|++++. T Consensus 81 ~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 81 FLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999999999999999999999999999999998 No 42 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=98.16 E-value=4.5e-09 Score=66.36 Aligned_cols=84 Identities=13% Similarity=0.245 Sum_probs=52.4 Q ss_pred eeecCcHHHHHHHHHHHHHHh---------------------cC----CcEEEEeccCCCC-CCCCC-----CCHHHHHH Q lcl|NC_019457. 4 ITVPNFDAVRDELTKALNKLN---------------------SD----EFVTVGIHEADNA-RPEGV-----LTNAQLGA 52 (156) Q Consensus 4 v~~~~~~~~~~~l~~~l~~l~---------------------~~----~~V~VGi~~~~~~-~~~~g-----~~~A~ia~ 52 (156) |++++.+...++|.+.-.+.. .. ..|.-|.+.++-. ..++| .+.+++|. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 666666665554432111100 00 0111222222210 11122 24578999 Q ss_pred HHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019457. 53 IQHFGNDRIPARPWLDVGVASVNDEILDTIAASLE 87 (156) Q Consensus 53 ~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 87 (156) +.||||...|+||||+|+++.++.++.+.|++++. T Consensus 81 ~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 81 FLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999999999999999999999999999999998 No 43 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=98.16 E-value=3.5e-09 Score=66.97 Aligned_cols=96 Identities=19% Similarity=0.306 Sum_probs=51.0 Q ss_pred Ceeeeec--CcHHHHHHHHHH---HH-HHhcC-------------------------CcEE---------------EEec Q lcl|NC_019457. 1 MIKITVP--NFDAVRDELTKA---LN-KLNSD-------------------------EFVT---------------VGIH 34 (156) Q Consensus 1 m~~v~~~--~~~~~~~~l~~~---l~-~l~~~-------------------------~~V~---------------VGi~ 34 (156) ||.++++ ++++..++|.+. +. +.... ..+. |++. T Consensus 1 mm~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v~~~ 80 (149) T protein:vir:19 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) T ss_pred CcceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeeccccc Confidence 9988777 445543333322 11 11000 0000 0000 Q ss_pred cCCC--CCC-----CCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHhccccHHHHHHH Q lcl|NC_019457. 35 EADN--ARP-----EGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAASLEDGEDISQLLNR 98 (156) Q Consensus 35 ~~~~--~~~-----~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 98 (156) .... ... ..+-+.+.++.+.||||.++||+|||+|+++++++++.+.+.+.+...+ +.++.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PF~~pA~~~~k~~~~~~~~~~l~~~l--~k~~~k 149 (149) T protein:vir:19 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAI--DEVLSK 149 (149) T ss_pred ccccccccccceeecCCCCccceeeeeccCCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHH--HHHhcC Confidence 0000 000 0012346788999999999999999999999999988877766554322 122222 No 44 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=98.16 E-value=4e-09 Score=66.61 Aligned_cols=84 Identities=10% Similarity=0.140 Sum_probs=51.6 Q ss_pred eeecCcHHHHHHHHHHHHH----------------------Hhc---CCcEEEEeccCCCCC-CCCC-----CCHHHHHH Q lcl|NC_019457. 4 ITVPNFDAVRDELTKALNK----------------------LNS---DEFVTVGIHEADNAR-PEGV-----LTNAQLGA 52 (156) Q Consensus 4 v~~~~~~~~~~~l~~~l~~----------------------l~~---~~~V~VGi~~~~~~~-~~~g-----~~~A~ia~ 52 (156) |++++.+...++|.+.-++ +.. ...|.-|.+..+-.. .+++ .+.+.+|. T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVKKIGDLHYRVISTAHYSG 80 (115) T ss_pred CeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEeeCCCccch Confidence 6666665544444321110 000 000111222222100 0111 34577999 Q ss_pred HHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019457. 53 IQHFGNDRIPARPWLDVGVASVNDEILDTIAASLE 87 (156) Q Consensus 53 ~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 87 (156) +.||||...|+||||+|+++.++.++.+.|++++. T Consensus 81 ~vEfGT~km~a~PFl~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 81 FLEFGTRYMEPAPFMFPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred heecccccCCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 99999999999999999999999999999999998 No 45 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=98.14 E-value=2.6e-09 Score=67.65 Aligned_cols=86 Identities=17% Similarity=0.265 Sum_probs=55.3 Q ss_pred CeeeeecCcHHHHHHHHHH--HHH-----------Hh----cC----CcEEEEeccCCCCCCCCC-----CCHHHHHHHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKA--LNK-----------LN----SD----EFVTVGIHEADNARPEGV-----LTNAQLGAIQ 54 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~--l~~-----------l~----~~----~~V~VGi~~~~~~~~~~g-----~~~A~ia~~~ 54 (156) |.+|++++.++..++|.+. ..+ +. .. .-|.-|....+-...++| .+.+.+|.+. T Consensus 1 Ma~i~i~Gld~L~~~l~~~~~~~~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~~~~~~~~v~~~~~Ya~~v 80 (112) T protein:vir:96 1 MATIEFEGLDEMAQSLLKNASSERRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLEAGSDRAVVEALTNYSGYL 80 (112) T ss_pred CceeeehHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeeecCceEEEecCCCCcccee Confidence 9999999887766655432 111 00 00 011112222211111223 2347899999 Q ss_pred hcCCCCCCCchhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019457. 55 HFGNDRIPARPWLDVGVASVNDEILDTIAASL 86 (156) Q Consensus 55 E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~ 86 (156) ||||...|+||||+|+++.++.++.+.|++.- T Consensus 81 E~GTr~m~AqPF~~PA~~~~~~~~~~~l~~L~ 112 (112) T protein:vir:96 81 EVGTRKMEAQPFMRPALDQVVPEMVEEMAKWE 112 (112) T ss_pred ccCccccCCCCchhhhHHHHHHHHHHHHHhcC Confidence 99999999999999999999999988887665 No 46 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=97.98 E-value=5.3e-08 Score=60.49 Aligned_cols=95 Identities=14% Similarity=0.241 Sum_probs=51.3 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHhcC------------------CcEEE------EeccCCC----CCCCC--------- Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLNSD------------------EFVTV------GIHEADN----ARPEG--------- 43 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~~~------------------~~V~V------Gi~~~~~----~~~~~--------- 43 (156) |+++++++.+...++|.+.-.+...+ ..+-| |-..++- ...+. T Consensus 2 ~~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k~~~~~~~v~v~v 81 (135) T protein:vir:57 2 IPEIEISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSRGKAGSTVVVLRV 81 (135) T ss_pred ceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhcccccccccccceeEEEEe Confidence 88888888777665554332211100 00000 1100000 00000 Q ss_pred CCCHHH--HHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHH Q lcl|NC_019457. 44 VLTNAQ--LGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAASLEDGEDISQLLNRVGV 101 (156) Q Consensus 44 g~~~A~--ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~ 101 (156) |.+... ++.+.||||.+.||||||+|+++++++++.+.+.+.+... |++++. T Consensus 82 g~~~~~~~~~~f~E~GT~~~~a~PF~~pa~~~~~~~~~~~~~~~~~~~------l~ka~r 135 (135) T protein:vir:57 82 GPTRSHYMKALAQEFGTIKQVAKPFIRPALDYNKMQVLRILTVEIRDG------LSTLSR 135 (135) T ss_pred cCCCCcceeEeecccCCCCCCCCcchhHhHHHhHHHHHHHHHHHHHHH------HHHhcC Confidence 111122 2344599999999999999999999999888877666432 223333 No 47 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=97.96 E-value=1.5e-08 Score=63.49 Aligned_cols=84 Identities=12% Similarity=0.196 Sum_probs=52.2 Q ss_pred eeecCcHHHHHHHHHH-----------HHH----Hh------cC----CcEEEEeccCCCC-CCCCC-----CCHHHHHH Q lcl|NC_019457. 4 ITVPNFDAVRDELTKA-----------LNK----LN------SD----EFVTVGIHEADNA-RPEGV-----LTNAQLGA 52 (156) Q Consensus 4 v~~~~~~~~~~~l~~~-----------l~~----l~------~~----~~V~VGi~~~~~~-~~~~g-----~~~A~ia~ 52 (156) |++++.+...++|.+. +++ +. .. ..|.-|.+..+-. ..++| .+.+.+|. T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~~g~~~~~V~~~~~Ya~ 80 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTVDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEecCCccccc Confidence 6666666655444321 110 00 00 0111122222210 01122 34578999 Q ss_pred HHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019457. 53 IQHFGNDRIPARPWLDVGVASVNDEILDTIAASLE 87 (156) Q Consensus 53 ~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 87 (156) +.||||...|+||||+|+++.++.++.+.|+++++ T Consensus 81 ~vE~GT~~m~a~PFl~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 81 FLEFGTRYMEAEPFMWPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred cccccccccCCCCcchhhHHHHHHHHHHHHHHHhC Confidence 99999999999999999999999999999999998 No 48 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=97.89 E-value=8.2e-08 Score=59.44 Aligned_cols=85 Identities=8% Similarity=0.035 Sum_probs=66.3 Q ss_pred HHHHHHHHHHHHHHHHhc--cccHHHHHHHHHHHHHHHHHHHHHcC------CCCCCcHHHHHhcCC--CCchhHHHHHH Q lcl|NC_019457. 71 VASVNDEILDTIAASLED--GEDISQLLNRVGVVAVAGVQNYIDEL------RSPANAPSTVERKGA--DNPLVDTGEMK 140 (156) Q Consensus 71 ~~~~~~~~~~~~~~~~~~--~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppns~~Ti~~KG~--~~PLidTG~L~ 140 (156) +++ -.++.+.|...+.. ..+....|..||..+....++.|.+. .|+|+++.|+++|+. .+||+++|.|. T Consensus 1 m~d-~~~l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~g~l~ 79 (149) T protein:vir:98 1 MSE-LTALQERLTGLIASLSPAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKGRIRREMFARLRTN 79 (149) T ss_pred Cch-HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccCCCCcccchhhhhh Confidence 222 23444455554433 23457799999999999999999874 688999999998874 58999999999 Q ss_pred hhceeeecccccccCC Q lcl|NC_019457. 141 QSVTYNIQTGRPSEGL 156 (156) Q Consensus 141 ~SIty~V~~~k~~~g~ 156 (156) +||+|.+....+.+|. T Consensus 80 ~sl~~~~~~~~~~V~~ 95 (149) T protein:vir:98 80 RFMKAKGSDSAAVVEF 95 (149) T ss_pred hhhhheecCCeeEEEe Confidence 9999999999998854 No 49 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=97.88 E-value=8.5e-09 Score=64.83 Aligned_cols=89 Identities=10% Similarity=0.129 Sum_probs=45.0 Q ss_pred Ceeeeec--------------CcHHHHHHHHH-HHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCch Q lcl|NC_019457. 1 MIKITVP--------------NFDAVRDELTK-ALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARP 65 (156) Q Consensus 1 m~~v~~~--------------~~~~~~~~l~~-~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~Rp 65 (156) .++-++. ......+.+.- .+..-.....|.||+..+.+ +.+.++.+.||||.+.||+| T Consensus 46 ~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~~~~~~g~~~~~VG~~~~~~-------~~~~y~~f~E~GT~k~~a~p 118 (149) T protein:vir:13 46 TVAPLIHISDDNSKSGRKGSRPPGHAANNIPEPKIRKKKGNLQCVVGWEKSDN-------TPFYYMKMEEWGTSERPPHH 118 (149) T ss_pred HHHHhCCccCCccccccccccccchhhhcceecccccccceeEEEeeccCCCC-------CccceeeeeccCccCCCCCc Confidence 0000000 00000111100 00111112246788764322 24678999999999999999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_019457. 66 WLDVGVASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDE 113 (156) Q Consensus 66 Flr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~ 113 (156) ||||+++++++++.+.+...+... |++.+.+ T Consensus 119 F~~pa~~~~~~~~~~~~~~~l~k~-----------------i~~~lG~ 149 (149) T protein:vir:13 119 AFGKTNKILKRVYDNIAQKKYDNF-----------------VKEKLGD 149 (149) T ss_pred cchHHHHHHHHHHHHHHHHHHHHH-----------------HHHHhcC Confidence 999999999988876665443211 1111111 No 50 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=97.87 E-value=1.5e-08 Score=63.55 Aligned_cols=82 Identities=10% Similarity=0.104 Sum_probs=49.6 Q ss_pred CeeeeecCc-HHHHHHHHHHHHH--H----hcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHH Q lcl|NC_019457. 1 MIKITVPNF-DAVRDELTKALNK--L----NSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVAS 73 (156) Q Consensus 1 m~~v~~~~~-~~~~~~l~~~l~~--l----~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~ 73 (156) .++-++... ...-..|.+.+.- . .+...|.|||..+ .+.++.+.||||.+.||+|||++++++ T Consensus 37 ~~k~~ap~~~~~~~~hl~d~I~~~~~k~~~~g~~~~~VG~~k~----------~~~y~~f~E~GT~k~~~~pF~~pa~~~ 106 (125) T protein:vir:97 37 ALKANTPVYEVETDERLQEDTVISGFKGANVGIVSKEIGYGKA----------TGWRAHYPNDGTIYQRGQDFKERTINQ 106 (125) T ss_pred HHHHhCCcCCCCchhhHHhhhhcccccccccCceEEEEeecCC----------CceeEeeeccCccCCCcCccchHhHHH Confidence 111111100 0000112222211 0 0112456776322 356899999999999999999999999 Q ss_pred HHHHHHHHHHHHHhccccH Q lcl|NC_019457. 74 VNDEILDTIAASLEDGEDI 92 (156) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~ 92 (156) +++++.+.+.+.+...+.. T Consensus 107 ~k~~~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 107 MTPKAKQLYAEKVKEGLGL 125 (125) T ss_pred hHHHHHHHHHHHHHHHhcC Confidence 9999999999888776665 No 51 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=97.85 E-value=3e-08 Score=61.82 Aligned_cols=87 Identities=14% Similarity=0.172 Sum_probs=51.4 Q ss_pred CeeeeecCcHHHHHHHHH---------HHHHHh--------cCCcEEEEeccCCCC--CCCCC-----CCHHHHHHHHhc Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTK---------ALNKLN--------SDEFVTVGIHEADNA--RPEGV-----LTNAQLGAIQHF 56 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~---------~l~~l~--------~~~~V~VGi~~~~~~--~~~~g-----~~~A~ia~~~E~ 56 (156) -+++++++.+...+.|.+ ++++.. ..-.|.-|-+..+-. ..++| -+.+.+|.+.|| T Consensus 2 ~~~i~i~Gld~l~~~L~~~~~~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~ 81 (112) T protein:vir:36 2 KSSLSFKGIDQLVKHLDKAASLKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMELTEGGFSGQAGPHTDYSAYVEY 81 (112) T ss_pred ceeeeehhHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeeecCCceEEEeecCCCccceeec Confidence 334444555543333321 111100 000112222222211 11122 345788999999 Q ss_pred CCCCCCCchhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019457. 57 GNDRIPARPWLDVGVASVNDEILDTIAASLE 87 (156) Q Consensus 57 G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 87 (156) ||...|+||||+|+++.++.++.+.|++.++ T Consensus 82 GT~k~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 82 GTRFQSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred cccccCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 9999999999999999999999999999998 No 52 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=97.83 E-value=5.2e-08 Score=60.54 Aligned_cols=96 Identities=17% Similarity=0.265 Sum_probs=50.6 Q ss_pred CeeeeecCcHHHHHHHHHH-----------HHHHh------------cCCcEEEEeccCCCC----CCCCC-----CCHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKA-----------LNKLN------------SDEFVTVGIHEADNA----RPEGV-----LTNA 48 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~-----------l~~l~------------~~~~V~VGi~~~~~~----~~~~g-----~~~A 48 (156) ||++++++.++..++|.+. +.+.. ..-.|.-|-+..+-. .++++ .+.+ T Consensus 1 m~~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~~~~~~g~V~~~~ 80 (182) T protein:vir:10 1 MIEVELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVDGDEVIGRWWNSS 80 (182) T ss_pred CeEEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeecCCeEEEEeecCC Confidence 9999999988765555331 11100 000122222222210 01111 2335 Q ss_pred HHHHHHhcCC------------------------------------------------------CCCCCchhhhHHHHHH Q lcl|NC_019457. 49 QLGAIQHFGN------------------------------------------------------DRIPARPWLDVGVASV 74 (156) Q Consensus 49 ~ia~~~E~G~------------------------------------------------------~~IP~RpFlr~~~~~~ 74 (156) .+|.+.|||| ...||||||+|++.++ T Consensus 81 ~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPFl~pA~~~~ 160 (182) T protein:vir:10 81 MVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQFMTPAANKM 160 (182) T ss_pred CccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcchHHHHHHh Confidence 5777777775 2479999999999999 Q ss_pred HHHHHHHHHHHHhccccHHHHHHHHHH Q lcl|NC_019457. 75 NDEILDTIAASLEDGEDISQLLNRVGV 101 (156) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~l~~iG~ 101 (156) ++++.+.|++.+...+. +.+|- T Consensus 161 ~~~i~~~i~~~i~~~l~-----~~~g~ 182 (182) T protein:vir:10 161 AKEAPEIIKRSIDQELH-----DKLGG 182 (182) T ss_pred HHHHHHHHHHHHHHHHH-----HhhcC Confidence 98887776655432110 00010 No 53 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=97.81 E-value=1.4e-07 Score=58.21 Aligned_cols=85 Identities=8% Similarity=0.026 Sum_probs=66.5 Q ss_pred HHHHHHHHHHHHHHHhc--cccHHHHHHHHHHHHHHHHHHHHHcC------CCCCCcHHHHHhcC--CCCchhHHHHHHh Q lcl|NC_019457. 72 ASVNDEILDTIAASLED--GEDISQLLNRVGVVAVAGVQNYIDEL------RSPANAPSTVERKG--ADNPLVDTGEMKQ 141 (156) Q Consensus 72 ~~~~~~~~~~~~~~~~~--~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppns~~Ti~~KG--~~~PLidTG~L~~ 141 (156) -+.-+++...+...+.. ..+....|..||..+....++.|... .|+|+++.|+++|. ..++|+++|.|.. T Consensus 1 ~~~~~~l~~~L~~ll~~l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~~~~~l~~~~~l~~ 80 (150) T protein:vir:20 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGRVKRKMFAKLITSR 80 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccCCCccccchhhhhh Confidence 22334455555555543 22446789999999999999999875 68899999998664 4679999999999 Q ss_pred hceeeecccccccCC Q lcl|NC_019457. 142 SVTYNIQTGRPSEGL 156 (156) Q Consensus 142 SIty~V~~~k~~~g~ 156 (156) ||+|++....+++|. T Consensus 81 sl~~~~~~~~~~vg~ 95 (150) T protein:vir:20 81 FLHIRASPEQASMEF 95 (150) T ss_pred hhheeecCcEEEEEe Confidence 999999999998865 No 54 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=97.79 E-value=7.1e-08 Score=59.77 Aligned_cols=86 Identities=21% Similarity=0.307 Sum_probs=40.9 Q ss_pred eeeeecCcH-----HHHHHHHHHHHHH-------------------hc------CCcEEEEeccCCCCCCCCCC------ Q lcl|NC_019457. 2 IKITVPNFD-----AVRDELTKALNKL-------------------NS------DEFVTVGIHEADNARPEGVL------ 45 (156) Q Consensus 2 ~~v~~~~~~-----~~~~~l~~~l~~l-------------------~~------~~~V~VGi~~~~~~~~~~g~------ 45 (156) |+|++.+.+ +.++.|.+...+. .. ++.+.+-..... ...|. T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~---s~~g~~~~~Vg 77 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEE---SVEGIQTYAVS 77 (157) T ss_pred CeeEeecccHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeecccc---CCCceEEEEEe Confidence 777775433 2233322211111 00 011222111110 00011 Q ss_pred ---CHHHHHHHHhcCC------------------------CCCCCchhhhHHHHHHHHHHHHHHH--------HHHhccc Q lcl|NC_019457. 46 ---TNAQLGAIQHFGN------------------------DRIPARPWLDVGVASVNDEILDTIA--------ASLEDGE 90 (156) Q Consensus 46 ---~~A~ia~~~E~G~------------------------~~IP~RpFlr~~~~~~~~~~~~~~~--------~~~~~~~ 90 (156) ..+-++.+.|||+ ..+||||||||+|+..+++..+.+. +++.|.. T Consensus 78 ~~~~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k~I~e~l~g~~ 157 (157) T protein:vir:97 78 WRKKAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGAKKYAELQRGDT 157 (157) T ss_pred ecCCccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhHHHHHHHHHHHHHHHHHHHhcCCC Confidence 1233456678882 3499999999999999887766543 3333433 No 55 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=97.78 E-value=6.6e-08 Score=59.95 Aligned_cols=83 Identities=13% Similarity=0.144 Sum_probs=48.2 Q ss_pred ecCcHHHHHHHHHHHHH-----------Hh--------cCCcEEEEeccCCCC--CCCC----CCCHHHHHHHHhcCCCC Q lcl|NC_019457. 6 VPNFDAVRDELTKALNK-----------LN--------SDEFVTVGIHEADNA--RPEG----VLTNAQLGAIQHFGNDR 60 (156) Q Consensus 6 ~~~~~~~~~~l~~~l~~-----------l~--------~~~~V~VGi~~~~~~--~~~~----g~~~A~ia~~~E~G~~~ 60 (156) +++.++..+.|.+...+ .. ...-|.-|.+..+-. ..++ -.+.+.+|.+.||||.. T Consensus 1 i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~~v~~~~~Ya~~vE~GT~~ 80 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRIAVDKELSKSAARIERQAKILAPVDTGWLRAQIYSEQQRLLHYRVVSPALYSIYLELGTRK 80 (108) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhcceeeeecCcEEEEeecCcccchhcccCccc Confidence 33333322222211111 10 000112222222200 0011 13457899999999999 Q ss_pred CCCchhhhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 61 IPARPWLDVGVASVNDEILDTIAASLED 88 (156) Q Consensus 61 IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (156) .|+||||+|+++.++.++.+.|++.++. T Consensus 81 m~a~Pf~~pa~~~~~~~~~~~i~~~lrk 108 (108) T protein:vir:99 81 MEAQSFLDPALRKEWPVLMANIKKMFKR 108 (108) T ss_pred cCCCcchhhhHHHHHHHHHHHHHHHhcC Confidence 9999999999999999999999988887 No 56 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=97.70 E-value=2.5e-07 Score=56.82 Aligned_cols=85 Identities=7% Similarity=0.015 Sum_probs=66.6 Q ss_pred HHHHHHHHHHHHHHHhc--cccHHHHHHHHHHHHHHHHHHHHHcC------CCCCCcHHHHHhcCC--CCchhHHHHHHh Q lcl|NC_019457. 72 ASVNDEILDTIAASLED--GEDISQLLNRVGVVAVAGVQNYIDEL------RSPANAPSTVERKGA--DNPLVDTGEMKQ 141 (156) Q Consensus 72 ~~~~~~~~~~~~~~~~~--~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppns~~Ti~~KG~--~~PLidTG~L~~ 141 (156) .+.-+++...|...+.. ..+....|..||..+....++.|.+. .|+|+++.|+++|+. .++|+++|.|.. T Consensus 1 ~~~~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~~~l~~ 80 (150) T protein:vir:60 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLITSR 80 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcCCCccchhhhhhcc Confidence 23333444444444443 23457799999999999999999875 788999999988754 589999999999 Q ss_pred hceeeecccccccCC Q lcl|NC_019457. 142 SVTYNIQTGRPSEGL 156 (156) Q Consensus 142 SIty~V~~~k~~~g~ 156 (156) ||+|++....+++|+ T Consensus 81 sl~~~~~~~~a~vg~ 95 (150) T protein:vir:60 81 FLHIRASPEQASMEF 95 (150) T ss_pred eeeeeeeCcEEEEEe Confidence 999999999999865 No 57 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=97.64 E-value=3.5e-07 Score=55.97 Aligned_cols=85 Identities=7% Similarity=0.015 Sum_probs=66.8 Q ss_pred HHHHHHHHHHHHHHHhc--cccHHHHHHHHHHHHHHHHHHHHHcC------CCCCCcHHHHHhcCC--CCchhHHHHHHh Q lcl|NC_019457. 72 ASVNDEILDTIAASLED--GEDISQLLNRVGVVAVAGVQNYIDEL------RSPANAPSTVERKGA--DNPLVDTGEMKQ 141 (156) Q Consensus 72 ~~~~~~~~~~~~~~~~~--~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppns~~Ti~~KG~--~~PLidTG~L~~ 141 (156) .++-+++...+...+.. ..+...+|..||..+....++.|... +|+|+++.|+++|+. .++|+.+|.|.. T Consensus 1 m~~~~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~~~~~~~l~~~~~l~~ 80 (150) T protein:vir:57 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLITSR 80 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhccCCCcccchhhhhcc Confidence 22234444455555443 23457799999999999999999875 789999999987753 589999999999 Q ss_pred hceeeecccccccCC Q lcl|NC_019457. 142 SVTYNIQTGRPSEGL 156 (156) Q Consensus 142 SIty~V~~~k~~~g~ 156 (156) ||+|++....+++|+ T Consensus 81 sl~~~~~~~~a~vg~ 95 (150) T protein:vir:57 81 FLHIRASPEQASMEF 95 (150) T ss_pred ceeeeeeCcEEEEEe Confidence 999999999998865 No 58 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=97.58 E-value=3.7e-07 Score=55.83 Aligned_cols=87 Identities=22% Similarity=0.358 Sum_probs=43.8 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHhcCCcE--EEE----------eccC-------------------------------- Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLNSDEFV--TVG----------IHEA-------------------------------- 36 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~~~~~V--~VG----------i~~~-------------------------------- 36 (156) |+.+++++ ..+.++|.+..+.+.....+ .|| |-.+ T Consensus 4 ~i~i~~d~-~~~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~~~~~ 82 (175) T protein:vir:79 4 FVNFQIDD-SALRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELTAA 82 (175) T ss_pred EEEEEech-HHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccccchhh Confidence 77777765 44555555544433211100 000 0000 Q ss_pred -------------------C-CC--CCCC---CCCHHHHHHHHhcCCC-------CCCCchhhhHHH---------HHHH Q lcl|NC_019457. 37 -------------------D-NA--RPEG---VLTNAQLGAIQHFGND-------RIPARPWLDVGV---------ASVN 75 (156) Q Consensus 37 -------------------~-~~--~~~~---g~~~A~ia~~~E~G~~-------~IP~RpFlr~~~---------~~~~ 75 (156) + ++ .++. |+ +..||++|+||.. +||+||||=-.- +... T Consensus 83 ~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGt-n~~YAaiHqfGg~~~~~~~v~IPARPfLG~s~~de~~~~~~~~I~ 161 (175) T protein:vir:79 83 ASRRKAGLMILQDSGQMAASTATDSGEDYSVIGS-NKEYAAIQHFGGQAGRGLKVTIPGRAWLPVTADGELQPEAVEPVL 161 (175) T ss_pred HhhhccCCCcceechhhhhhhhheecCCEEEEec-CcchhhHhhcccccCCCcccccCcccccCCCcccchhHHHHHHHH Confidence 0 00 0011 33 3468999999974 799999994321 2234 Q ss_pred HHHHHHHHHHHhcc Q lcl|NC_019457. 76 DEILDTIAASLEDG 89 (156) Q Consensus 76 ~~~~~~~~~~~~~~ 89 (156) +.+.++|+.++.+. T Consensus 162 ~~i~~~l~~a~~~~ 175 (175) T protein:vir:79 162 NTILRHLMDAANRR 175 (175) T ss_pred HHHHHHHHHHhccC Confidence 44455666666666 No 59 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=97.56 E-value=7.8e-08 Score=59.54 Aligned_cols=93 Identities=15% Similarity=0.233 Sum_probs=46.0 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHhc----CCcEEEEeccCCC-----CCCCCC-----CCHHHHHHHHhcCCCCCCCchh Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLNS----DEFVTVGIHEADN-----ARPEGV-----LTNAQLGAIQHFGNDRIPARPW 66 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~~----~~~V~VGi~~~~~-----~~~~~g-----~~~A~ia~~~E~G~~~IP~RpF 66 (156) |-++.-...+.+.+++.+..+.+.. .-.+.-|.+.++- ....+| -+.+.+|.+.||||...|+||| T Consensus 19 L~~~~~~~~~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~~~~~~~~~~v~~~~~Ya~~vEfGT~~~~a~Pf 98 (125) T protein:vir:94 19 FDISRKELVPYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVKEEHGVVTGRYVARADYSSYNEYGTYRMSAQPF 98 (125) T ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceecceeccCCcEEEEeeCCCCccceeecccccCCCCcc Confidence 1111001111112222222222111 0001122222110 001111 2346789999999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHhccccHH Q lcl|NC_019457. 67 LDVGVASVNDEILDTIAASLEDGEDIS 93 (156) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~~~~~~ 93 (156) |+|+++.++.++.+.|++.+....--. T Consensus 99 l~pa~~~~~~~~~~~l~~~l~~a~k~~ 125 (125) T protein:vir:94 99 MAPSVAAMTPFFYKAVRDALNKAAKFS 125 (125) T ss_pred cchhHHHHHHHHHHHHHHHHHHHhccC Confidence 999999999998888777765432111 No 60 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=97.54 E-value=5.1e-07 Score=55.09 Aligned_cols=84 Identities=12% Similarity=0.195 Sum_probs=50.6 Q ss_pred eeecCcHHHHHHHHHH---------HHHHh--------cCCcEEEEeccCCCCC--CCCC-----CCHHHHHHHHhcCCC Q lcl|NC_019457. 4 ITVPNFDAVRDELTKA---------LNKLN--------SDEFVTVGIHEADNAR--PEGV-----LTNAQLGAIQHFGND 59 (156) Q Consensus 4 v~~~~~~~~~~~l~~~---------l~~l~--------~~~~V~VGi~~~~~~~--~~~g-----~~~A~ia~~~E~G~~ 59 (156) |++++.++..+.|.+. |++.. ..-.|.-|.+..+-.. .++| .+.+.+|.+.||||. T Consensus 1 i~i~Gld~l~~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~GT~ 80 (108) T protein:vir:74 1 MKITGIDALQKKLRKNATLDDVKHVVKSNTASMNKNMQNLAPVDTGNMKRSITSEFTDGGLSGTTGPHTDYAGYVEYGTR 80 (108) T ss_pred CcchhHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhccceeeeecCceEEEeecCCCcccceecccc Confidence 5555555544444322 11100 0001111222221110 1112 244678999999999 Q ss_pred CCCCchhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019457. 60 RIPARPWLDVGVASVNDEILDTIAASLE 87 (156) Q Consensus 60 ~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 87 (156) ..|+||||+|+++.++.++.+.|++.++ T Consensus 81 km~aqpf~~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 81 FQSAQPFVKPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred ccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 9999999999999999999999999888 No 61 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=97.45 E-value=2e-07 Score=57.26 Aligned_cols=87 Identities=13% Similarity=0.106 Sum_probs=48.3 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHhc-----------------------CCcEEEEeccCCCC--CCCCC-----CCHHHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLNS-----------------------DEFVTVGIHEADNA--RPEGV-----LTNAQL 50 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~~-----------------------~~~V~VGi~~~~~~--~~~~g-----~~~A~i 50 (156) |.+++++...++++.|.+.|+++.. .-.|.-|-+..+-. ...+| -+.+.+ T Consensus 1 m~~ms~~i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~~~g~~~~V~~~~~Y 80 (144) T protein:vir:59 1 MALMSVRIDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYKNNGLTAEITVGAEY 80 (144) T ss_pred CCcceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEeecCcEEEEEecCCCc Confidence 4444443333333333222211110 00111122222110 01112 345789 Q ss_pred HHHHhcCC---------------------------CCCCCchhhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019457. 51 GAIQHFGN---------------------------DRIPARPWLDVGVASVNDEILDTIAASLE 87 (156) Q Consensus 51 a~~~E~G~---------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 87 (156) |.+.|||| ..+||||||+++++.+++.+.+.|++++- T Consensus 81 A~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~~~~~i~~~~g 144 (144) T protein:vir:59 81 AIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEGGEYFEREMRRLRG 144 (144) T ss_pred cchhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHHHHHHHHHHHHHHHhcC Confidence 99999997 24899999999999999999998888776 No 62 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=97.42 E-value=1.8e-07 Score=57.59 Aligned_cols=81 Identities=12% Similarity=0.073 Sum_probs=43.2 Q ss_pred CeeeeecCcHHHHHH-----------------HHHHHHH-----H--hcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhc Q lcl|NC_019457. 1 MIKITVPNFDAVRDE-----------------LTKALNK-----L--NSDEFVTVGIHEADNARPEGVLTNAQLGAIQHF 56 (156) Q Consensus 1 m~~v~~~~~~~~~~~-----------------l~~~l~~-----l--~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~ 56 (156) .-+......+++-+- |...+.- - .+...|.||+..+ .+.+|.+.|| T Consensus 21 ~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~----------~~~~a~F~E~ 90 (125) T protein:vir:94 21 MNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG----------VSHRIHATEF 90 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC----------CceEEEeccC Confidence 000000000011111 1111100 0 0112355664322 1256789999 Q ss_pred CCCCCCCchhhhHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019457. 57 GNDRIPARPWLDVGVASVNDEILDTIAASLEDGED 91 (156) Q Consensus 57 G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~ 91 (156) ||.++||+||+|+|++++++++.+.+.+.+..-.- T Consensus 91 GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:94 91 GTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 99999999999999999999998887776643222 No 63 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=97.42 E-value=1.8e-07 Score=57.59 Aligned_cols=81 Identities=12% Similarity=0.073 Sum_probs=43.2 Q ss_pred CeeeeecCcHHHHHH-----------------HHHHHHH-----H--hcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhc Q lcl|NC_019457. 1 MIKITVPNFDAVRDE-----------------LTKALNK-----L--NSDEFVTVGIHEADNARPEGVLTNAQLGAIQHF 56 (156) Q Consensus 1 m~~v~~~~~~~~~~~-----------------l~~~l~~-----l--~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~ 56 (156) .-+......+++-+- |...+.- - .+...|.||+..+ .+.+|.+.|| T Consensus 21 ~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~----------~~~~a~F~E~ 90 (125) T protein:vir:98 21 MNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG----------VSHRIHATEF 90 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC----------CceEEEeccC Confidence 000000000011111 1111100 0 0112355664322 1256789999 Q ss_pred CCCCCCCchhhhHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019457. 57 GNDRIPARPWLDVGVASVNDEILDTIAASLEDGED 91 (156) Q Consensus 57 G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~ 91 (156) ||.++||+||+|+|++++++++.+.+.+.+..-.- T Consensus 91 GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:98 91 GTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 99999999999999999999998887776643222 No 64 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=97.42 E-value=1.8e-07 Score=57.59 Aligned_cols=81 Identities=12% Similarity=0.073 Sum_probs=43.2 Q ss_pred CeeeeecCcHHHHHH-----------------HHHHHHH-----H--hcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhc Q lcl|NC_019457. 1 MIKITVPNFDAVRDE-----------------LTKALNK-----L--NSDEFVTVGIHEADNARPEGVLTNAQLGAIQHF 56 (156) Q Consensus 1 m~~v~~~~~~~~~~~-----------------l~~~l~~-----l--~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~ 56 (156) .-+......+++-+- |...+.- - .+...|.||+..+ .+.+|.+.|| T Consensus 21 ~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~----------~~~~a~F~E~ 90 (125) T protein:vir:81 21 MNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG----------VSHRIHATEF 90 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC----------CceEEEeccC Confidence 000000000011111 1111100 0 0112355664322 1256789999 Q ss_pred CCCCCCCchhhhHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019457. 57 GNDRIPARPWLDVGVASVNDEILDTIAASLEDGED 91 (156) Q Consensus 57 G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~ 91 (156) ||.++||+||+|+|++++++++.+.+.+.+..-.- T Consensus 91 GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:81 91 GTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 99999999999999999999998887776643222 No 65 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=97.42 E-value=1.8e-07 Score=57.59 Aligned_cols=81 Identities=12% Similarity=0.073 Sum_probs=43.2 Q ss_pred CeeeeecCcHHHHHH-----------------HHHHHHH-----H--hcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhc Q lcl|NC_019457. 1 MIKITVPNFDAVRDE-----------------LTKALNK-----L--NSDEFVTVGIHEADNARPEGVLTNAQLGAIQHF 56 (156) Q Consensus 1 m~~v~~~~~~~~~~~-----------------l~~~l~~-----l--~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~ 56 (156) .-+......+++-+- |...+.- - .+...|.||+..+ .+.+|.+.|| T Consensus 21 ~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~----------~~~~a~F~E~ 90 (125) T protein:vir:47 21 MNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG----------VSHRIHATEF 90 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC----------CceEEEeccC Confidence 000000000011111 1111100 0 0112355664322 1256789999 Q ss_pred CCCCCCCchhhhHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019457. 57 GNDRIPARPWLDVGVASVNDEILDTIAASLEDGED 91 (156) Q Consensus 57 G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~ 91 (156) ||.++||+||+|+|++++++++.+.+.+.+..-.- T Consensus 91 GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:47 91 GTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 99999999999999999999998887776643222 No 66 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=97.42 E-value=1.8e-07 Score=57.59 Aligned_cols=81 Identities=12% Similarity=0.073 Sum_probs=43.2 Q ss_pred CeeeeecCcHHHHHH-----------------HHHHHHH-----H--hcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhc Q lcl|NC_019457. 1 MIKITVPNFDAVRDE-----------------LTKALNK-----L--NSDEFVTVGIHEADNARPEGVLTNAQLGAIQHF 56 (156) Q Consensus 1 m~~v~~~~~~~~~~~-----------------l~~~l~~-----l--~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~ 56 (156) .-+......+++-+- |...+.- - .+...|.||+..+ .+.+|.+.|| T Consensus 21 ~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~----------~~~~a~F~E~ 90 (125) T protein:vir:79 21 MNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKG----------VSHRIHATEF 90 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCC----------CceEEEeccC Confidence 000000000011111 1111100 0 0112355664322 1256789999 Q ss_pred CCCCCCCchhhhHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019457. 57 GNDRIPARPWLDVGVASVNDEILDTIAASLEDGED 91 (156) Q Consensus 57 G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~ 91 (156) ||.++||+||+|+|++++++++.+.+.+.+..-.- T Consensus 91 GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:79 91 GTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred CccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 99999999999999999999998887776643222 No 67 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=97.41 E-value=3.1e-07 Score=56.30 Aligned_cols=87 Identities=10% Similarity=0.176 Sum_probs=46.1 Q ss_pred Ceeeeec-CcHHHHHHHHHHHHHHhc----CCcEEEEeccCCCCC--CCCC-----CCHHHHHHHHhcCCCCCCCchhhh Q lcl|NC_019457. 1 MIKITVP-NFDAVRDELTKALNKLNS----DEFVTVGIHEADNAR--PEGV-----LTNAQLGAIQHFGNDRIPARPWLD 68 (156) Q Consensus 1 m~~v~~~-~~~~~~~~l~~~l~~l~~----~~~V~VGi~~~~~~~--~~~g-----~~~A~ia~~~E~G~~~IP~RpFlr 68 (156) +-.+... ......+.+.+....+.. .-.|.-|-+..+-.. .++| .+.+.+|.+.||||...|+||||+ T Consensus 10 ~~~l~~~~~~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~GT~~m~aqPFl~ 89 (108) T protein:vir:98 10 QKKLRKNATLNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFTDGGLTGTTIPHTDYAGYVEYGTRFQAAQPFVK 89 (108) T ss_pred HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeeecCceEEEeecCCCccceeeccccccCCCcchh Confidence 0000000 011111222222222110 001111222111110 0111 234678999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHh Q lcl|NC_019457. 69 VGVASVNDEILDTIAASLE 87 (156) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~ 87 (156) |+++.++.++.+.|++.++ T Consensus 90 pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 90 PAFDVQKKIFTNDLERLTK 108 (108) T ss_pred hHHHHHHHHHHHHHHHHcC Confidence 9999999999999999998 No 68 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=97.29 E-value=8e-07 Score=54.01 Aligned_cols=81 Identities=15% Similarity=0.313 Sum_probs=56.9 Q ss_pred CeeeeecCcHHHHHHHHHHH-----------H--------HHhcC------------------CcEEEEeccCCCCCCCC Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKAL-----------N--------KLNSD------------------EFVTVGIHEADNARPEG 43 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l-----------~--------~l~~~------------------~~V~VGi~~~~~~~~~~ 43 (156) |.++++.+.+...+.|.+.. . ++... ..|.||+ T Consensus 1 Ma~iel~G~del~~~l~~~g~~~~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkkik~~~kk~g~~~VG~---------- 70 (119) T protein:vir:10 1 MASLEIEGFEEFEKFISEDMVLDESTKRKGIKAGITKIGKAIEKNSPIKSGRLSKVKIRVKNTGLATEGT---------- 70 (119) T ss_pred CceeehhhHHHHHHHHHhhhhhhHHHHHHHHHHHhHHHHHHHhhcCCcccCCcceeeeeeecCceeEecc---------- Confidence 88888888777666552211 1 11000 0112222 Q ss_pred CCCHHHHHHHHhcCCCCCCCc-hhhhHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019457. 44 VLTNAQLGAIQHFGNDRIPAR-PWLDVGVASVNDEILDTIAASLEDGED 91 (156) Q Consensus 44 g~~~A~ia~~~E~G~~~IP~R-pFlr~~~~~~~~~~~~~~~~~~~~~~~ 91 (156) +.+-+.++-++||||...|+| |||.+++++..++..+.++..+..++- T Consensus 71 ~ks~~fy~kF~EFGTSkm~a~~pF~~~a~~~~~~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 71 ASSSEFYDIFQNFGTSEQKAHVGYFDRAVDETTNEAVEEVAEIIFRKMR 119 (119) T ss_pred CCcchhhhhhccccccccCCCCCccccccccChHHHHHHHHHHHHHhcC Confidence 235578999999999999999 999999999999999999888866555 No 69 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=97.13 E-value=1.1e-06 Score=53.24 Aligned_cols=80 Identities=21% Similarity=0.324 Sum_probs=35.3 Q ss_pred Ceeee---------ecCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCC--CCCCchhhhH Q lcl|NC_019457. 1 MIKIT---------VPNFDAVRDELTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGND--RIPARPWLDV 69 (156) Q Consensus 1 m~~v~---------~~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~--~IP~RpFlr~ 69 (156) +-.-+ ..+....++.|...+..-.....|.|| ++..+|++|+||+. +||+||||-. T Consensus 52 Ls~st~a~k~~~~~L~~tG~L~~Si~~~~~~~~~~~~a~vG-------------tn~~YA~~hqfG~~~~~IPaRPfLG~ 118 (145) T protein:vir:31 52 LKESTIRAKGSDTPLIDNSRLLTDINAASMMDRANRMAVIG-------------TNLDYAEHHEFGAPEAGIPARPIFGP 118 (145) T ss_pred cChHHHHHhcCCCCCccCHHHHHHHHHHhhhcccCceeEec-------------CCchhhhhhccCCcccccCCCCccCC Confidence 10000 000111112222211100011123333 34568999999985 5999999977 Q ss_pred HHHHHHHHHHHHHHHHHhc---cccHH Q lcl|NC_019457. 70 GVASVNDEILDTIAASLED---GEDIS 93 (156) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~---~~~~~ 93 (156) +.....+++.+.+...+.. +.-.+ T Consensus 119 ~~~~~~~~~~~ii~~~i~~~L~~~~~~ 145 (145) T protein:vir:31 119 AGAYASQQAPDVIGDEIDTNLEGAVID 145 (145) T ss_pred CccchHHHHHHHHHHHHHHHhhhhccC Confidence 6555455555444443322 11111 No 70 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=97.04 E-value=2.3e-06 Score=51.52 Aligned_cols=86 Identities=19% Similarity=0.197 Sum_probs=48.4 Q ss_pred Ceeeeec-CcHHHHHHHHHHHHHHh-------------------cCCcEEEEeccCCCC--CCCCC-------CCHHHHH Q lcl|NC_019457. 1 MIKITVP-NFDAVRDELTKALNKLN-------------------SDEFVTVGIHEADNA--RPEGV-------LTNAQLG 51 (156) Q Consensus 1 m~~v~~~-~~~~~~~~l~~~l~~l~-------------------~~~~V~VGi~~~~~~--~~~~g-------~~~A~ia 51 (156) |.+++++ +.+.+.++|.+..+++. ..-.|.-|-+..+-. ...+| .+.+.+| T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~g~~~~~~v~~~~~YA 80 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSGGRFSFSVTIGTNVTYA 80 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeccCCceEEEEEecCcccc Confidence 8877766 44433333332221110 000122232222110 01111 2457899 Q ss_pred HHHhcCCC---------------------------CCCCchhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019457. 52 AIQHFGND---------------------------RIPARPWLDVGVASVNDEILDTIAASL 86 (156) Q Consensus 52 ~~~E~G~~---------------------------~IP~RpFlr~~~~~~~~~~~~~~~~~~ 86 (156) .++|||+. .+||||||+++++++++++.+.++++= T Consensus 81 ~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i~~~~~~~~ 142 (142) T protein:vir:94 81 ADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAASTFLRNHAKGIR 142 (142) T ss_pred hhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHHHHHHHHHHhcC Confidence 99999973 278999999999999988887777655 No 71 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=97.02 E-value=3e-06 Score=50.85 Aligned_cols=90 Identities=18% Similarity=0.237 Sum_probs=47.6 Q ss_pred eeecCcHHHHHHHHHHHHHHh-------------------cCCcEEEEeccCCC----CCCCCC-----CCHHHHHHHHh Q lcl|NC_019457. 4 ITVPNFDAVRDELTKALNKLN-------------------SDEFVTVGIHEADN----ARPEGV-----LTNAQLGAIQH 55 (156) Q Consensus 4 v~~~~~~~~~~~l~~~l~~l~-------------------~~~~V~VGi~~~~~----~~~~~g-----~~~A~ia~~~E 55 (156) |++++.++..++|.+.-+.+. ..--|.-|-+..+- ..++++ .+.+.+|.+.| T Consensus 1 i~i~Gld~L~~~L~~l~~~~~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~~~~~v~~~~~Ya~fvE 80 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDIDKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLKAKDLISKKITVNELYGAYME 80 (173) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeeccCceeEEeeCCCcccchhhh Confidence 555555544433333211100 00001111111110 001111 25578899999 Q ss_pred cCCC-------------------------------------------------------CCCCchhhhHHHHHHHHHHHH Q lcl|NC_019457. 56 FGND-------------------------------------------------------RIPARPWLDVGVASVNDEILD 80 (156) Q Consensus 56 ~G~~-------------------------------------------------------~IP~RpFlr~~~~~~~~~~~~ 80 (156) |||. ..||||||+|+++++++++.+ T Consensus 81 fGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~~aqPFl~PA~~~~~~~~~~ 160 (173) T protein:vir:10 81 FGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGINPQPFLYPAWIEGKKQYLK 160 (173) T ss_pred cccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceeeEeecCCCCCCccchhHHHHhHHHHHH Confidence 9973 389999999999999998888 Q ss_pred HHHHHHhccccHHHHHHHH Q lcl|NC_019457. 81 TIAASLEDGEDISQLLNRV 99 (156) Q Consensus 81 ~~~~~~~~~~~~~~~l~~i 99 (156) .|++.+.. .|..| T Consensus 161 ~i~~~i~~------~lrk~ 173 (173) T protein:vir:10 161 DLENLLKT------YNKKI 173 (173) T ss_pred HHHHHHHH------HhhcC Confidence 88776643 22222 No 72 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=97.02 E-value=1.2e-06 Score=53.10 Aligned_cols=77 Identities=18% Similarity=0.173 Sum_probs=40.3 Q ss_pred CeeeeecCcHHHHHHHHHH--HHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCC-----------CCCCCchhh Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKA--LNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGN-----------DRIPARPWL 67 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~--l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~-----------~~IP~RpFl 67 (156) =-+..++ ...++.+|... |.--.+...+.|||.+ ++..||++|.||. .+||+|||| T Consensus 63 ~~k~~~~-~~~m~~~L~~a~~l~~~a~~~~~~Vg~~G----------t~~~yAaiHQfG~~~r~~~~~~~~v~iPaRp~L 131 (152) T protein:vir:10 63 GVKSKIK-SGKMFDKITQPRFMRLRLESEGVSLGYEG----------GDAVIARIHQQGLIGRVRKDWDLKVKYASRELL 131 (152) T ss_pred hhccccc-chhHHHhhhhcceeeeeecCcEEEEEecC----------CchhhhhhhccCccccccCCCCcceeccccccC Confidence 0011111 11223333221 1111233468888863 3567999999993 469999999 Q ss_pred hHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019457. 68 DVGVASVNDEILDTIAASLEDG 89 (156) Q Consensus 68 r~~~~~~~~~~~~~~~~~~~~~ 89 (156) =-+ ++...++.+.+.+.|.+. T Consensus 132 G~s-~~d~~~I~~~i~~~l~~a 152 (152) T protein:vir:10 132 GFT-DDDLQMIEDYMINILAGS 152 (152) T ss_pred CCC-HHHHHHHHHHHHHHHhcC Confidence 433 223455555555555554 No 73 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=96.99 E-value=2.3e-06 Score=51.52 Aligned_cols=75 Identities=19% Similarity=0.220 Sum_probs=38.8 Q ss_pred CeeeeecCcHHHHHHH--HHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCC----------CCCCchhhh Q lcl|NC_019457. 1 MIKITVPNFDAVRDEL--TKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGND----------RIPARPWLD 68 (156) Q Consensus 1 m~~v~~~~~~~~~~~l--~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~----------~IP~RpFlr 68 (156) .-+ ... ....+..+ ...|.-......+.||+.+ ++..||++|.||.. +||+||||= T Consensus 63 ~~~-g~~-~~~~~~~l~~~~~l~~~~~~~~~~v~~~G----------tn~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG 130 (149) T protein:vir:18 63 SKK-GRI-KREMFAKLRTSRFMKAKGSDSAAVVEFTG----------KVQRMARVHQYGLKDRPNRNSRDVQYEARPLLG 130 (149) T ss_pred hcc-Ccc-cchhhhhhhhhhhhheeecCceeEEEecc----------cchhhhhhhhccccccccCCCccccccccccCC Confidence 111 000 01112222 2222222233346677652 35679999999953 699999994 Q ss_pred HHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 69 VGVASVNDEILDTIAASLED 88 (156) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~ 88 (156) -+ ++...++.+.+...|.. T Consensus 131 ~s-~~d~~~I~~~i~~~l~~ 149 (149) T protein:vir:18 131 FT-RDDEQMIEDVIISHLGK 149 (149) T ss_pred CC-HHHHHHHHHHHHHHHhC Confidence 32 33455666666555554 No 74 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=96.97 E-value=1.1e-06 Score=53.22 Aligned_cols=82 Identities=15% Similarity=0.144 Sum_probs=47.7 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHh-------------------cCCcEEEEeccCCCCC--CCCC-----CCHHHHHHHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLN-------------------SDEFVTVGIHEADNAR--PEGV-----LTNAQLGAIQ 54 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~-------------------~~~~V~VGi~~~~~~~--~~~g-----~~~A~ia~~~ 54 (156) |.++. ++.++..+.|.+.-+++. ..-.|.-|-+..+-.. ..+| -+.+.+|.+. T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~pvdTG~L~~Si~~~~~~~g~~~~V~~~~~YA~yv 79 (137) T protein:vir:96 1 MAKVK-YGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAPVDLGFLKESIDFKVTDGGFSSVISVGAEYAIYV 79 (137) T ss_pred CchhH-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCccchhcCceeEeecCceEEEEecCCCccccc Confidence 88876 455554444433221110 0001112322222100 1111 2447899999 Q ss_pred hcCC-----------------------------CCCCCchhhhHHHHHHHHHHHHHHH Q lcl|NC_019457. 55 HFGN-----------------------------DRIPARPWLDVGVASVNDEILDTIA 83 (156) Q Consensus 55 E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (156) |||| +.+|+||||+++++++++.+.+.|. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:96 80 EFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRKVFNRYFS 137 (137) T ss_pred ccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHHHHHHHHhhC Confidence 9997 3489999999999999999998888 No 75 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=96.93 E-value=2.8e-06 Score=51.04 Aligned_cols=76 Identities=17% Similarity=0.195 Sum_probs=39.4 Q ss_pred Ceee-------eec-CcHHHHH--HHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCC----------C Q lcl|NC_019457. 1 MIKI-------TVP-NFDAVRD--ELTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGND----------R 60 (156) Q Consensus 1 m~~v-------~~~-~~~~~~~--~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~----------~ 60 (156) .-.. ... .....+. .+...|.-......|.|||.+ +++.||++|+||.. + T Consensus 53 W~p~~~~~~~~k~~~~~~~l~~~g~l~~sl~~~~~~~~~~V~~~G----------s~~~yAa~HQfG~~~r~~~~~~~~~ 122 (149) T protein:vir:98 53 YAARKRQSVRSKKGRIRREMFARLRTNRFMKAKGSDSAAVVEFTG----------RVQRMARVHQYGLKDRPNRHSRDVQ 122 (149) T ss_pred CcccchHHHHhccCCCCcccchhhhhhhhhhheecCCeeEEEecC----------cchHHhhHhhccccccccCCCccee Confidence 0000 000 0000001 122222222234457888763 35689999999953 6 Q ss_pred CCCchhhhHHH-HHHHHHHHHHHHHHHhc Q lcl|NC_019457. 61 IPARPWLDVGV-ASVNDEILDTIAASLED 88 (156) Q Consensus 61 IP~RpFlr~~~-~~~~~~~~~~~~~~~~~ 88 (156) ||+|||| ++ ++..+++.+.+...|.. T Consensus 123 iPaRp~L--G~s~~d~~~i~~~i~~~l~~ 149 (149) T protein:vir:98 123 YAARPLL--GFTRDDEQMIEDIIIRHLGK 149 (149) T ss_pred ccccccC--CCCHHHHHHHHHHHHHHhhC Confidence 9999999 44 33456666666555554 No 76 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=96.93 E-value=1.2e-06 Score=53.00 Aligned_cols=82 Identities=13% Similarity=0.101 Sum_probs=47.1 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHh-------------------cCCcEEEEeccCCC-C-CCCCC-----CCHHHHHHHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLN-------------------SDEFVTVGIHEADN-A-RPEGV-----LTNAQLGAIQ 54 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~-------------------~~~~V~VGi~~~~~-~-~~~~g-----~~~A~ia~~~ 54 (156) |.++.. +.++..+.|.+.-.++. ..-.|.-|-+..+- . ..++| .+.+.+|.+. T Consensus 1 Ma~~~~-G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~v 79 (137) T protein:vir:94 1 MAKVKY-GNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYV 79 (137) T ss_pred CchhHH-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEeecCcEEEEEecCCCccccc Confidence 888753 44444444432222211 00011112222221 0 01111 2457899999 Q ss_pred hcCC-----------------------------CCCCCchhhhHHHHHHHHHHHHHHH Q lcl|NC_019457. 55 HFGN-----------------------------DRIPARPWLDVGVASVNDEILDTIA 83 (156) Q Consensus 55 E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (156) |||| ..+|+||||+++++++++++.+.|. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 80 NYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRVFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 9995 2489999999999999999999888 No 77 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=96.87 E-value=1.8e-06 Score=52.13 Aligned_cols=82 Identities=12% Similarity=0.172 Sum_probs=48.2 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHh-------------------cCCcEEEEeccCCCCC--CCCC-----CCHHHHHHHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLN-------------------SDEFVTVGIHEADNAR--PEGV-----LTNAQLGAIQ 54 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~-------------------~~~~V~VGi~~~~~~~--~~~g-----~~~A~ia~~~ 54 (156) |.++.. +.++..+.|.+.-+++. ..-.|.-|-+..+-.. .++| -+.+.+|.+. T Consensus 1 Ma~~~~-Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~v 79 (135) T protein:vir:96 1 MAKVKY-GADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFENGGFTGVVKIGSNYAVYV 79 (135) T ss_pred Cchhhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEeecCcEEEEEecCCCccchh Confidence 777654 44444443333222111 0001222333333110 1122 3567899999 Q ss_pred hcCC---------------------------CCCCCchhhhHHHHHHHHHHHHHHH Q lcl|NC_019457. 55 HFGN---------------------------DRIPARPWLDVGVASVNDEILDTIA 83 (156) Q Consensus 55 E~G~---------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (156) |||| ..+|+||||+++++++++++.+.|. T Consensus 80 e~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 80 NYGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGRQTFEQYFS 135 (135) T ss_pred hcccccccCCCccccccccccccCCcceeecCCcCCCcchhHHHHHHHHHHHHhcC Confidence 9997 2489999999999999999988888 No 78 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=96.83 E-value=8.9e-06 Score=48.26 Aligned_cols=85 Identities=13% Similarity=0.028 Sum_probs=65.9 Q ss_pred HHHHHHHHHHHHHHHhcc--ccHHHHHHHHHHHHHHHHHHHHHcC------CCCCCcHHHHHhcCCC-CchhHHHHHHhh Q lcl|NC_019457. 72 ASVNDEILDTIAASLEDG--EDISQLLNRVGVVAVAGVQNYIDEL------RSPANAPSTVERKGAD-NPLVDTGEMKQS 142 (156) Q Consensus 72 ~~~~~~~~~~~~~~~~~~--~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppns~~Ti~~KG~~-~PLidTG~L~~S 142 (156) .++-+++.+.+...+..- .+...+|..||..+....++.|... .|+|+|+.|.++||.. ++|.+++.+..+ T Consensus 1 m~~~~~l~~~L~~ll~~l~~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~~~~~~~~l~~~~~ 80 (148) T protein:vir:79 1 MSESRELEAWLAGMLTKLDAPARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRIRRAMFMRLRLARY 80 (148) T ss_pred CccHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhcccccccccchhhhhhh Confidence 222355555555555442 2335789999999999999999863 4779999999999864 699999999999 Q ss_pred ceeeecccccccCC Q lcl|NC_019457. 143 VTYNIQTGRPSEGL 156 (156) Q Consensus 143 Ity~V~~~k~~~g~ 156 (156) +++.+....+++|+ T Consensus 81 l~~~~~~~~~~v~~ 94 (148) T protein:vir:79 81 MKTQADANTAVVTF 94 (148) T ss_pred eeeeeeCCeeeEEe Confidence 99999888888854 No 79 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=96.82 E-value=1.6e-06 Score=52.33 Aligned_cols=82 Identities=12% Similarity=0.076 Sum_probs=48.3 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHh-------------------cCCcEEEEeccCCCC--CCCCC-----CCHHHHHHHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLN-------------------SDEFVTVGIHEADNA--RPEGV-----LTNAQLGAIQ 54 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~-------------------~~~~V~VGi~~~~~~--~~~~g-----~~~A~ia~~~ 54 (156) |.++ +++.++..+.|.+.-+++. ..-.|.-|-+..+-. ..++| .+.+.+|.+. T Consensus 1 Ma~~-~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~v 79 (137) T protein:vir:93 1 MAKV-KYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYV 79 (137) T ss_pred Cchh-HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCccccc Confidence 6655 4555554444433222210 001122233332211 11122 3457899999 Q ss_pred hcCC-----------------------------CCCCCchhhhHHHHHHHHHHHHHHH Q lcl|NC_019457. 55 HFGN-----------------------------DRIPARPWLDVGVASVNDEILDTIA 83 (156) Q Consensus 55 E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (156) |||+ ..+|+||||++++++++..+.+.|. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:93 80 NYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9998 2479999999999999999999988 No 80 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=96.82 E-value=1.6e-06 Score=52.33 Aligned_cols=82 Identities=12% Similarity=0.076 Sum_probs=48.3 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHh-------------------cCCcEEEEeccCCCC--CCCCC-----CCHHHHHHHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLN-------------------SDEFVTVGIHEADNA--RPEGV-----LTNAQLGAIQ 54 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~-------------------~~~~V~VGi~~~~~~--~~~~g-----~~~A~ia~~~ 54 (156) |.++ +++.++..+.|.+.-+++. ..-.|.-|-+..+-. ..++| .+.+.+|.+. T Consensus 1 Ma~~-~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~v 79 (137) T protein:vir:94 1 MAKV-KYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYV 79 (137) T ss_pred Cchh-HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCccccc Confidence 6655 4555554444433222210 001122233332211 11122 3457899999 Q ss_pred hcCC-----------------------------CCCCCchhhhHHHHHHHHHHHHHHH Q lcl|NC_019457. 55 HFGN-----------------------------DRIPARPWLDVGVASVNDEILDTIA 83 (156) Q Consensus 55 E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (156) |||+ ..+|+||||++++++++..+.+.|. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 80 NYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9998 2479999999999999999999988 No 81 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=96.82 E-value=1.6e-06 Score=52.33 Aligned_cols=82 Identities=12% Similarity=0.076 Sum_probs=48.3 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHh-------------------cCCcEEEEeccCCCC--CCCCC-----CCHHHHHHHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLN-------------------SDEFVTVGIHEADNA--RPEGV-----LTNAQLGAIQ 54 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~-------------------~~~~V~VGi~~~~~~--~~~~g-----~~~A~ia~~~ 54 (156) |.++ +++.++..+.|.+.-+++. ..-.|.-|-+..+-. ..++| .+.+.+|.+. T Consensus 1 Ma~~-~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~v 79 (137) T protein:vir:97 1 MAKV-KYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYV 79 (137) T ss_pred Cchh-HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCccccc Confidence 6655 4555554444433222210 001122233332211 11122 3457899999 Q ss_pred hcCC-----------------------------CCCCCchhhhHHHHHHHHHHHHHHH Q lcl|NC_019457. 55 HFGN-----------------------------DRIPARPWLDVGVASVNDEILDTIA 83 (156) Q Consensus 55 E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (156) |||+ ..+|+||||++++++++..+.+.|. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:97 80 NYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9998 2479999999999999999999988 No 82 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=96.79 E-value=4.4e-06 Score=49.94 Aligned_cols=82 Identities=17% Similarity=0.206 Sum_probs=45.0 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHh-------------------cCCcEEEEeccCCCC--CCCCC-----CCHHHHHHHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLN-------------------SDEFVTVGIHEADNA--RPEGV-----LTNAQLGAIQ 54 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~-------------------~~~~V~VGi~~~~~~--~~~~g-----~~~A~ia~~~ 54 (156) |.++.. +.++..++|.+.-+++. ..-.|.-|-+..+-. ...+| -+.+.+|.+. T Consensus 13 Ma~~~~-Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~V 91 (149) T protein:vir:94 13 MAKVKY-GADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADYAIYV 91 (149) T ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEEeeCCcEEEEEecCCCccccc Confidence 777642 33333333222211110 000111233322211 11112 2447899999 Q ss_pred hcCCC-----------------------------CCCCchhhhHHHHHHHHHHHHHHH Q lcl|NC_019457. 55 HFGND-----------------------------RIPARPWLDVGVASVNDEILDTIA 83 (156) Q Consensus 55 E~G~~-----------------------------~IP~RpFlr~~~~~~~~~~~~~~~ 83 (156) ||||. .+||||||+++++++++++.+.|. T Consensus 92 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~~~i~~~i~ 149 (149) T protein:vir:94 92 EYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred ccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 99972 379999999999999999988888 No 83 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=96.74 E-value=1.9e-06 Score=51.95 Aligned_cols=76 Identities=26% Similarity=0.437 Sum_probs=35.4 Q ss_pred Ceeee-----ecCcHHHHH---HHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCC-------CCCCch Q lcl|NC_019457. 1 MIKIT-----VPNFDAVRD---ELTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGND-------RIPARP 65 (156) Q Consensus 1 m~~v~-----~~~~~~~~~---~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~-------~IP~Rp 65 (156) .++.. .......+. .|..-+.--.....|.|| ++..+|++|+||.. +||+|| T Consensus 76 ~~~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vG-------------tn~~YAaiHqfGg~~~~~~~v~iPaRp 142 (175) T protein:vir:10 76 NGELTAAASRRKAGLMILQDSGQMAASVSTDHDDNSAVIG-------------SNKEYAAIHQFGGQAGRGLKVTIPARP 142 (175) T ss_pred hhhhhhhhhhhccCCCcceechhhhhhhheeecCCEEEEe-------------cChhhhhhhhcccccCCCCccccCCcc Confidence 00000 000000000 111111111122234443 23467999999975 799999 Q ss_pred hhhHH---------HHHHHHHHHHHHHHHHhcc Q lcl|NC_019457. 66 WLDVG---------VASVNDEILDTIAASLEDG 89 (156) Q Consensus 66 Flr~~---------~~~~~~~~~~~~~~~~~~~ 89 (156) ||=-. .++..+.+.++|..++.+. T Consensus 143 fLG~s~~d~~~~e~~~~Il~~~~~~l~~~~~~~ 175 (175) T protein:vir:10 143 WLPVTADGELQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred ccCCCcccccchHHHHHHHHHHHHHHHHHhccC Confidence 99432 1233455556666677666 No 84 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=96.74 E-value=1.3e-05 Score=47.34 Aligned_cols=85 Identities=7% Similarity=0.032 Sum_probs=63.1 Q ss_pred HHHHHHHHHHHHHHHHhc--cccHHHHHHHHHHHHHHHHHHHHHcC------CCCCCcHHHHHhcCC--CCchhHHHHHH Q lcl|NC_019457. 71 VASVNDEILDTIAASLED--GEDISQLLNRVGVVAVAGVQNYIDEL------RSPANAPSTVERKGA--DNPLVDTGEMK 140 (156) Q Consensus 71 ~~~~~~~~~~~~~~~~~~--~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppns~~Ti~~KG~--~~PLidTG~L~ 140 (156) +++ -+++.+.+...+.. ......+|..||..+...+++.|... .|+|+++.|++.|.. .++|..++.+. T Consensus 1 m~~-~~~~~~~l~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~~~~~~~~~l~~~ 79 (149) T protein:vir:18 1 MSE-LTALQERLAGLIASLSPAARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSKKGRIKREMFAKLRTS 79 (149) T ss_pred Cch-HHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhccCcccchhhhhhhhh Confidence 222 23333444444432 12346799999999999999999874 688999999987653 46999999999 Q ss_pred hhceeeecccccccCC Q lcl|NC_019457. 141 QSVTYNIQTGRPSEGL 156 (156) Q Consensus 141 ~SIty~V~~~k~~~g~ 156 (156) .++++.+....+.+|. T Consensus 80 ~~l~~~~~~~~~~v~~ 95 (149) T protein:vir:18 80 RFMKAKGSDSAAVVEF 95 (149) T ss_pred hhhheeecCceeEEEe Confidence 9999999888888754 No 85 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=96.72 E-value=1.3e-06 Score=52.83 Aligned_cols=82 Identities=17% Similarity=0.206 Sum_probs=45.7 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHh-------------------cCCcEEEEeccCCCC--CCCCC-----CCHHHHHHHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLN-------------------SDEFVTVGIHEADNA--RPEGV-----LTNAQLGAIQ 54 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~-------------------~~~~V~VGi~~~~~~--~~~~g-----~~~A~ia~~~ 54 (156) |.++.. +.++..+.|.+.-+++. ..-.|.-|.+..+-. ...+| .+.+.+|.+. T Consensus 13 Ma~v~~-Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~~~~~g~~~~V~~~~~YA~~v 91 (149) T protein:vir:10 13 MAKVKY-GADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADYAIYV 91 (149) T ss_pred hHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEEecCCcEEEEEecCCCccccc Confidence 777632 33333333322211110 000112233332211 11112 2447899999 Q ss_pred hcCCC-----------------------------CCCCchhhhHHHHHHHHHHHHHHH Q lcl|NC_019457. 55 HFGND-----------------------------RIPARPWLDVGVASVNDEILDTIA 83 (156) Q Consensus 55 E~G~~-----------------------------~IP~RpFlr~~~~~~~~~~~~~~~ 83 (156) ||||. .+||||||+++++++++++.+.|. T Consensus 92 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~~i~~~i~ 149 (149) T protein:vir:10 92 EYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred ccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 99972 379999999999999999999888 No 86 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=96.71 E-value=3.8e-06 Score=50.31 Aligned_cols=89 Identities=13% Similarity=0.218 Sum_probs=53.3 Q ss_pred CeeeeecCcHH-HHHHHHHH-----------HH-----------HHhcCC--cEEEEeccCCCCCCCCC-------CCHH Q lcl|NC_019457. 1 MIKITVPNFDA-VRDELTKA-----------LN-----------KLNSDE--FVTVGIHEADNARPEGV-------LTNA 48 (156) Q Consensus 1 m~~v~~~~~~~-~~~~l~~~-----------l~-----------~l~~~~--~V~VGi~~~~~~~~~~g-------~~~A 48 (156) |-+|++++..+ +.+.|... ++ ..+.++ ...=+|-..... +.++ -+-. T Consensus 1 Ma~i~id~la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~-~~g~~~~vv~~~~~~ 79 (126) T protein:vir:81 1 MANITIDRLADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKED-GYGTTKRIIWNKKHY 79 (126) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccchhhccccccccc-cCCcceEEEeccCCC Confidence 99999998644 32323211 11 111110 011122211111 1111 1113 Q ss_pred HHHHHHhcCCCC-----CCCchhhhHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_019457. 49 QLGAIQHFGNDR-----IPARPWLDVGVASVNDEILDTIAASLEDGE 90 (156) Q Consensus 49 ~ia~~~E~G~~~-----IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 90 (156) .++-+.|||+-. +|+||||+|+++...+++.+.++.+|.||. T Consensus 80 ~l~HLLEfGha~r~gGrV~a~Phi~Pa~e~~~~~~~~~i~~~l~~gg 126 (126) T protein:vir:81 80 RRVHLLEFGHAKVNGGRVKEYPHLRPAYDKHGARLPDELKRVIENGG 126 (126) T ss_pred CceeeeecceecCCCCccCCCcchHHHHHHHHHHHHHHHHHHhhcCC Confidence 457789999863 899999999999999999999999999877 No 87 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=96.69 E-value=7.4e-06 Score=48.72 Aligned_cols=73 Identities=22% Similarity=0.393 Sum_probs=34.9 Q ss_pred Ceee---------------------------------eecCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCH Q lcl|NC_019457. 1 MIKI---------------------------------TVPNFDAVRDELTKALNKLNSDEFVTVGIHEADNARPEGVLTN 47 (156) Q Consensus 1 m~~v---------------------------------~~~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~ 47 (156) |... ...+.. .|...+.--.....|.||. + T Consensus 38 l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~~~~L~~tg----~L~~Si~~~~~~~~v~vGt-------------n 100 (190) T protein:vir:99 38 LLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNRDKILTLDG----HLRNLLRYQLDGSELLFGS-------------D 100 (190) T ss_pred HHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCCCccceecH----HHHHHHhheecCcEEEEec-------------C Confidence 0000 000111 2333333222344577762 3 Q ss_pred HHHHHHHhcCC--------------------------------------------CCCCCchhhhHH---HHHHHHHHHH Q lcl|NC_019457. 48 AQLGAIQHFGN--------------------------------------------DRIPARPWLDVG---VASVNDEILD 80 (156) Q Consensus 48 A~ia~~~E~G~--------------------------------------------~~IP~RpFlr~~---~~~~~~~~~~ 80 (156) ..+|++|+||. .+||+||||--. -.+..+.+.+ T Consensus 101 ~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~v~IPaRpfLG~s~~d~~~I~~~i~~ 180 (190) T protein:vir:99 101 RPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFVPRRRSNFAQDVQIGPYTIQMPARPWLGTSSQDDDTILQRVER 180 (190) T ss_pred cchhhhhhcCCcccccccchhhhhhhhhhhhhhhcccccccccccchhcccccceeeecCcccCCCCHHHHHHHHHHHHH Confidence 45799999993 258999999332 2222333444 Q ss_pred HHHHHHhccc Q lcl|NC_019457. 81 TIAASLEDGE 90 (156) Q Consensus 81 ~~~~~~~~~~ 90 (156) .|+.++.... T Consensus 181 ~l~~~~~~~~ 190 (190) T protein:vir:99 181 YLQRALRERA 190 (190) T ss_pred HHHHHHhhcC Confidence 5555555444 No 88 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=96.67 E-value=2.1e-06 Score=51.66 Aligned_cols=82 Identities=13% Similarity=0.101 Sum_probs=47.6 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHh-------------------cCCcEEEEeccCCCC--CCCCC-----CCHHHHHHHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLN-------------------SDEFVTVGIHEADNA--RPEGV-----LTNAQLGAIQ 54 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~-------------------~~~~V~VGi~~~~~~--~~~~g-----~~~A~ia~~~ 54 (156) |.++ +++.+...+.|.+.-.++. ..-.|.-|-+..+-. ..++| .+.+.+|.+. T Consensus 1 Ma~~-~~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~~~~~~~~~V~~~~~YA~~v 79 (137) T protein:vir:95 1 MAKV-KYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYV 79 (137) T ss_pred Cchh-HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEeeCCceEEEEecCCCccccc Confidence 6554 4455544444432221110 001122233333211 11122 3457899999 Q ss_pred hcCC-----------------------------CCCCCchhhhHHHHHHHHHHHHHHH Q lcl|NC_019457. 55 HFGN-----------------------------DRIPARPWLDVGVASVNDEILDTIA 83 (156) Q Consensus 55 E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (156) |||+ ..+|+||||+++++++++++.+.|. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~l~ 137 (137) T protein:vir:95 80 NYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9998 2479999999999999999999888 No 89 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=96.66 E-value=4.5e-06 Score=49.87 Aligned_cols=77 Identities=30% Similarity=0.534 Sum_probs=36.7 Q ss_pred Ceeeeec-------CcHHHH---HHHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCC-------CCCC Q lcl|NC_019457. 1 MIKITVP-------NFDAVR---DELTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGND-------RIPA 63 (156) Q Consensus 1 m~~v~~~-------~~~~~~---~~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~-------~IP~ 63 (156) .-.-+.. .....| -.|..-+.--.....|.|| ++..+|++|+||.. +||+ T Consensus 57 lsp~t~~~r~k~g~~~~~~L~~tG~L~~Si~~~~~~~~v~vG-------------tn~~YA~iHqfGg~~~~~~~~~iPA 123 (155) T protein:vir:10 57 LSPVTVAARAAKGRGAHPILQVTNALARSITTRADRDQAQIG-------------SNLSYAAIQQLGGQAGRGRKVTIPA 123 (155) T ss_pred CCccchHHHHhccCCCCCccccchhhhhhhhceecCCEEEEe-------------cCcchhhhhhcccccCCCCccccCC Confidence 1100000 000000 0122222222233446665 23457999999973 6999 Q ss_pred chhhhHH-HH----HHHHHHHHHHHHHHhccc Q lcl|NC_019457. 64 RPWLDVG-VA----SVNDEILDTIAASLEDGE 90 (156) Q Consensus 64 RpFlr~~-~~----~~~~~~~~~~~~~~~~~~ 90 (156) ||||=-. -+ +..+.+.+.+.+.|..+. T Consensus 124 RPfLG~s~~~e~~~ei~~~I~~~i~~~l~~~r 155 (155) T protein:vir:10 124 RPYLPVLRNGQLKPSARDAVLDVLLAALSQGR 155 (155) T ss_pred ccccCCCccccchHHHHHHHHHHHHHHHhhcC Confidence 9999321 11 233555566667776555 No 90 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=96.66 E-value=3.5e-06 Score=50.52 Aligned_cols=82 Identities=12% Similarity=0.109 Sum_probs=44.1 Q ss_pred CeeeeecCcHHHHHHHHHHHHHH-----------h----c----CCcEEEEeccCCCC--CCCCC-----CCHHHHHHHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKL-----------N----S----DEFVTVGIHEADNA--RPEGV-----LTNAQLGAIQ 54 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l-----------~----~----~~~V~VGi~~~~~~--~~~~g-----~~~A~ia~~~ 54 (156) |.++. .+.+++.++|.+.-+++ . . .-.|.-|-+..+-. ...+| .+.+.+|.+. T Consensus 1 Ma~~~-~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Ya~~v 79 (137) T protein:vir:10 1 MAKVK-YGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEYAVYV 79 (137) T ss_pred CchhH-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeCCcEEEEEecCCCccccc Confidence 77763 24333333222211110 0 0 00111122222210 01111 2446789999 Q ss_pred hcCCC-----------------------------CCCCchhhhHHHHHHHHHHHHHHH Q lcl|NC_019457. 55 HFGND-----------------------------RIPARPWLDVGVASVNDEILDTIA 83 (156) Q Consensus 55 E~G~~-----------------------------~IP~RpFlr~~~~~~~~~~~~~~~ 83 (156) ||||. .+|+||||+++++++++++.+.|. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 80 NYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Confidence 99962 379999999999999999998888 No 91 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=96.55 E-value=4.4e-06 Score=49.94 Aligned_cols=81 Identities=16% Similarity=0.176 Sum_probs=41.9 Q ss_pred CeeeeecCcHHHHHHHHHH--HHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCC----------CCCCchhhh Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKA--LNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGND----------RIPARPWLD 68 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~--l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~----------~IP~RpFlr 68 (156) .-+-.......++..|... |..-.+...+.|||.+ ++..||++|.||.. +||+||||- T Consensus 64 ~~~~~~~~~~~m~~~l~~~~~l~~~~~~~~a~vg~~G----------s~~~yA~iHQfG~~~~~~~~~~~v~iPaRp~LG 133 (156) T protein:vir:11 64 GKQGRIRRKIKMFQKLRTVRYLRAKGDAQAITVSFAG----------RIARIARVHQYGLRDRAEPGAPEVSYAQRLLLG 133 (156) T ss_pred hhccccccchhhhhhhhhhheeeeeecCcEEEEEecC----------CchhhhhhhcccccccccCCCCcccccccccCC Confidence 1110111111122222111 1111133457777752 35678999999964 699999994 Q ss_pred HHHHHHHHHHHHHHHHHHhccccH Q lcl|NC_019457. 69 VGVASVNDEILDTIAASLEDGEDI 92 (156) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~ 92 (156) -+ .+..+++.+.+.+.+.+..-. T Consensus 134 ~s-~~d~~~i~~~i~~~l~~~~~~ 156 (156) T protein:vir:11 134 FD-SSDMETIQNGILAHIDANSPI 156 (156) T ss_pred CC-HHHHHHHHHHHHHHHhhcCCC Confidence 33 234566667776666664433 No 92 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=96.55 E-value=3.4e-06 Score=50.56 Aligned_cols=82 Identities=13% Similarity=0.111 Sum_probs=46.0 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHh-------------------cCCcEEEEeccCCCC--CCCCC-----CCHHHHHHHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLN-------------------SDEFVTVGIHEADNA--RPEGV-----LTNAQLGAIQ 54 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~-------------------~~~~V~VGi~~~~~~--~~~~g-----~~~A~ia~~~ 54 (156) |.++.. +.+...++|.+.-+++. ..-.|.-|-+..+-. ..++| .+.+.+|.+. T Consensus 1 Ma~~~~-G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~v 79 (137) T protein:vir:10 1 MAKVKY-GNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEYAVYV 79 (137) T ss_pred Cccchh-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEecCCcEEEEEecCCcccccc Confidence 777753 44333333322111110 000122233332210 01122 2446899999 Q ss_pred hcCCC-----------------------------CCCCchhhhHHHHHHHHHHHHHHH Q lcl|NC_019457. 55 HFGND-----------------------------RIPARPWLDVGVASVNDEILDTIA 83 (156) Q Consensus 55 E~G~~-----------------------------~IP~RpFlr~~~~~~~~~~~~~~~ 83 (156) ||||. .+||||||+++++++++++.+.|. T Consensus 80 E~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 80 NYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred ccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 99962 389999999999999999999888 No 93 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=96.53 E-value=7.6e-06 Score=48.66 Aligned_cols=69 Identities=22% Similarity=0.477 Sum_probs=34.7 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCC--------CCCCchhhhHHHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGND--------RIPARPWLDVGVA 72 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~--------~IP~RpFlr~~~~ 72 (156) .-++ ..+.. .|...+.--.+...|.||. +..||++|+||.. +||+||||--+ + T Consensus 76 ~~~~-L~~tg----~L~~Si~~~~~~~~v~vGt-------------~~~yA~vHqfG~~~~~~~~~~~iPaRpfLG~s-~ 136 (156) T protein:vir:19 76 PGSI-LTLHG----DLARSITTDYGQDYALIGS-------------PKIYAAIHQWGGTPDMAPRPAGVPARPYMGLD-K 136 (156) T ss_pred CCcc-hhhhH----HHHHHhhheecCCEEEEec-------------chhhhHHhhcCcccccCCCccccCCccccCCC-H Confidence 1111 11111 2332232222344567762 3468999999964 69999999432 2 Q ss_pred HHHHHHHHH----HHHHHhc Q lcl|NC_019457. 73 SVNDEILDT----IAASLED 88 (156) Q Consensus 73 ~~~~~~~~~----~~~~~~~ 88 (156) +..+++.+. |..++.. T Consensus 137 ~d~~~I~~~i~~~l~~~~~~ 156 (156) T protein:vir:19 137 TGEQEIFDAIRKRVSAALRQ 156 (156) T ss_pred HHHHHHHHHHHHHHHHHhhC Confidence 334444443 4444444 No 94 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=96.48 E-value=4.6e-06 Score=49.82 Aligned_cols=73 Identities=23% Similarity=0.431 Sum_probs=38.1 Q ss_pred Ceee--------------------------------eecCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHH Q lcl|NC_019457. 1 MIKI--------------------------------TVPNFDAVRDELTKALNKLNSDEFVTVGIHEADNARPEGVLTNA 48 (156) Q Consensus 1 m~~v--------------------------------~~~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A 48 (156) |... ...+.. .|...+.--.....|.|| ++. T Consensus 39 l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL~~tg----~L~~Si~~~~~~~~v~vG-------------tn~ 101 (155) T protein:vir:99 39 LLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPILQVTN----ALARSVTTWADRNEAGIG-------------SNL 101 (155) T ss_pred HHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcchhch----hhhhhhhceecCCEEEEe-------------cCc Confidence 0000 000111 122222222233345555 234 Q ss_pred HHHHHHhcCCC-------CCCCchhhhHHH-----HHHHHHHHHHHHHHHhccc Q lcl|NC_019457. 49 QLGAIQHFGND-------RIPARPWLDVGV-----ASVNDEILDTIAASLEDGE 90 (156) Q Consensus 49 ~ia~~~E~G~~-------~IP~RpFlr~~~-----~~~~~~~~~~~~~~~~~~~ 90 (156) .+|++|+||.. +||+||||=-.- .+..+++.+.+.+.|..+. T Consensus 102 ~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~e~~~~I~~~i~~~l~~~~ 155 (155) T protein:vir:99 102 VYAAIHQFGGDAGRGHQVEIPARRYLPFDENGQLAAGARQSILEIVLTALSRNR 155 (155) T ss_pred cchhhhhcccccCCCCccccCCccccCCCCccccchHHHHHHHHHHHHHHhccC Confidence 57999999964 799999993221 2345667777777776665 No 95 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=96.44 E-value=5.4e-06 Score=49.45 Aligned_cols=77 Identities=13% Similarity=0.200 Sum_probs=38.8 Q ss_pred CeeeeecCcHH-------HHHH---------------------HHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHH Q lcl|NC_019457. 1 MIKITVPNFDA-------VRDE---------------------LTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGA 52 (156) Q Consensus 1 m~~v~~~~~~~-------~~~~---------------------l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~ 52 (156) |.+.+...+.. -|.. +...|........+.|||. | ++..||+ T Consensus 34 l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~~~~~~~~l~~~~~l~~~~~~~~~~v~~~---------G-t~~~yAa 103 (148) T protein:vir:79 34 LRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRIRRAMFMRLRLARYMKTQADANTAVVTFA---------G-NAQRIAT 103 (148) T ss_pred HHHHHHHHHHhhcCCCCCcCcccchHHHhhcccccccccchhhhhhheeeeeeCCeeeEEee---------c-cchhhhh Confidence 11110000000 0110 1111111112234556653 2 3568999 Q ss_pred HHhcCC----------CCCCCchhhhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 53 IQHFGN----------DRIPARPWLDVGVASVNDEILDTIAASLED 88 (156) Q Consensus 53 ~~E~G~----------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (156) +|.||. ++||+||||=-+ .+...++.+.+...|.+ T Consensus 104 iHQfG~~~r~~~~~~~v~iPaRp~LG~s-~~d~~~i~~~i~~~l~~ 148 (148) T protein:vir:79 104 VHQFGLRDRVNKAGLTAQYPARELLGMD-GVDMEHITNLLLLHLGA 148 (148) T ss_pred hhhcCccccccCCCCccccCcccccCCC-HHHHHHHHHHHHHHhcC Confidence 999993 369999999433 23456677777777777 No 96 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=96.35 E-value=2.7e-05 Score=45.64 Aligned_cols=86 Identities=9% Similarity=0.072 Sum_probs=66.5 Q ss_pred HHHHHHHHHHHHHHHHhc--cccHHHHHHHHHHHHHHHHHHHHHcC------CCCCCcHHHHHhc-----C--CCCchhH Q lcl|NC_019457. 71 VASVNDEILDTIAASLED--GEDISQLLNRVGVVAVAGVQNYIDEL------RSPANAPSTVERK-----G--ADNPLVD 135 (156) Q Consensus 71 ~~~~~~~~~~~~~~~~~~--~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppns~~Ti~~K-----G--~~~PLid 135 (156) +.++-.++.+.+...+.. ..+...+|..||..+....++.|... .|+|+++.|..++ | ...+|.+ T Consensus 1 m~~~~~~l~~~l~~ll~~l~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~~~~~~m~~ 80 (155) T protein:vir:79 1 MTDDLQALERWAGGLLAKLSPAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAGRVKREAMFR 80 (155) T ss_pred CchHHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccCcccchhhhh Confidence 555666666666666643 23456799999999999999999874 5778888886533 3 2467999 Q ss_pred HHHHHhhceeeecccccccCC Q lcl|NC_019457. 136 TGEMKQSVTYNIQTGRPSEGL 156 (156) Q Consensus 136 TG~L~~SIty~V~~~k~~~g~ 156 (156) ++.+-.+|+|.+....+++|+ T Consensus 81 ~l~~a~~l~~~~~~d~a~Vg~ 101 (155) T protein:vir:79 81 KLRTARYLRIDVDSTGLAIGF 101 (155) T ss_pred hhhhhheeeeeecCcEEEEEe Confidence 999999999999999999965 No 97 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=96.33 E-value=9.9e-06 Score=48.03 Aligned_cols=77 Identities=19% Similarity=0.214 Sum_probs=39.7 Q ss_pred Ce-------eeeecC-cHHHHH--HHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCC----------CC Q lcl|NC_019457. 1 MI-------KITVPN-FDAVRD--ELTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGN----------DR 60 (156) Q Consensus 1 m~-------~v~~~~-~~~~~~--~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~----------~~ 60 (156) .. +..... ...... .+...|.--.+...+.|||..| +++.||++|.||. .+ T Consensus 53 W~p~k~~~~~~k~g~~~~~l~~~~~l~~sl~~~~~~~~~~vg~~~G---------s~~~yAa~HQfG~~~~~~~~~~~~~ 123 (150) T protein:vir:20 53 YAPRQQQSVRKKTGRVKRKMFAKLITSRFLHIRASPEQASMEFYGG---------KSPKIASVHQFGLSEENRKDGKKID 123 (150) T ss_pred CcccchHHHHHhccCCCccccchhhhhhhhheeecCcEEEEEeeCC---------cchhhhhhhhcccccccccCCCcee Confidence 00 000000 000001 1222232223345678888754 3567999999994 36 Q ss_pred CCCchhhhHHHHH-HHHHHHHHHHHHHhc Q lcl|NC_019457. 61 IPARPWLDVGVAS-VNDEILDTIAASLED 88 (156) Q Consensus 61 IP~RpFlr~~~~~-~~~~~~~~~~~~~~~ 88 (156) ||+||||= +.+ ..+++.+.+.+.+.. T Consensus 124 iPaRp~LG--~s~~d~~~i~~~i~~~l~k 150 (150) T protein:vir:20 124 YPARPLLG--FTGEDVQMIEEIILAHLER 150 (150) T ss_pred ccccccCC--CCHHHHHHHHHHHHHHHhC Confidence 99999994 443 345555555555554 No 98 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=96.14 E-value=7.1e-06 Score=48.82 Aligned_cols=78 Identities=18% Similarity=0.213 Sum_probs=37.8 Q ss_pred Ceeee-------e-cCcHHHHHH--HHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCC----------C Q lcl|NC_019457. 1 MIKIT-------V-PNFDAVRDE--LTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGND----------R 60 (156) Q Consensus 1 m~~v~-------~-~~~~~~~~~--l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~----------~ 60 (156) .-..+ . ......+.. +...|.--.+...+.|||..| +++.||++|.||.. + T Consensus 53 W~p~~~~~~~~k~~~~~~~l~~~~~l~~sl~~~~~~~~a~vg~~~G---------t~~~yAaiHQfG~~~~~~~~~~~~~ 123 (150) T protein:vir:60 53 YAPRQQQSARKKTGRVKRKMFAKLITSRFLHIRASPEQASMEFYGG---------KSPKIASVHQFGLSEENRKDGKKID 123 (150) T ss_pred CcccChHHHHHhhcCCCccchhhhhhcceeeeeeeCcEEEEEeeCC---------CchhhhhhhhccccccccCCCCcee Confidence 10000 0 000001111 111111111234577787643 35689999999942 6 Q ss_pred CCCchhhhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 61 IPARPWLDVGVASVNDEILDTIAASLED 88 (156) Q Consensus 61 IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (156) ||+||||=-+ ++...++.+.+...+.. T Consensus 124 iPaRp~LG~s-~~d~~~i~~~i~~~l~r 150 (150) T protein:vir:60 124 YPARPLLGFT-GEDVQMIEEIILAHLDR 150 (150) T ss_pred cCCcccCCCC-HHHHHHHHHHHHHHHhC Confidence 9999999433 22345555555555544 No 99 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=96.13 E-value=6e-05 Score=43.71 Aligned_cols=86 Identities=8% Similarity=0.047 Sum_probs=62.0 Q ss_pred HHHHHHHHHHHHHHHHhcc--ccHHHHHHHHHHHHHHHHHHHHHcC------CCCCCcHHHHHhcCCCCchhHHHHHHhh Q lcl|NC_019457. 71 VASVNDEILDTIAASLEDG--EDISQLLNRVGVVAVAGVQNYIDEL------RSPANAPSTVERKGADNPLVDTGEMKQS 142 (156) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~--~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppns~~Ti~~KG~~~PLidTG~L~~S 142 (156) ++++-.++.+.|...+..- .+...+|..||..+....++.|... .|+|+++.+..+|+..+-......|+.| T Consensus 1 M~~~~~~~~~~L~~ll~~L~~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~k~~~~~~~m~~~L~~a 80 (152) T protein:vir:10 1 MSEPIEQVKTAFDSLLNNISKPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKGVKSKIKSGKMFDKITQP 80 (152) T ss_pred CchHHHHHHHHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhhcccccchhHHHhhhhc Confidence 5555555666666555542 3446799999999999999999976 5778888887777755544445556554 Q ss_pred --ceeeecccccccCC Q lcl|NC_019457. 143 --VTYNIQTGRPSEGL 156 (156) Q Consensus 143 --Ity~V~~~k~~~g~ 156 (156) ++|+.....+++|. T Consensus 81 ~~l~~~a~~~~~~Vg~ 96 (152) T protein:vir:10 81 RFMRLRLESEGVSLGY 96 (152) T ss_pred ceeeeeecCcEEEEEe Confidence 78888888888865 No 100 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=96.11 E-value=8.4e-06 Score=48.40 Aligned_cols=78 Identities=18% Similarity=0.214 Sum_probs=38.0 Q ss_pred Ceee-------ee-cCcHHHHHH--HHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCC----------C Q lcl|NC_019457. 1 MIKI-------TV-PNFDAVRDE--LTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGND----------R 60 (156) Q Consensus 1 m~~v-------~~-~~~~~~~~~--l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~----------~ 60 (156) .-.. .. ......+.. +...|.--.+...+.|||..| ++..||++|.||.. + T Consensus 53 W~p~k~~~~~~k~~~~~~~l~~~~~l~~sl~~~~~~~~a~vg~~~G---------~~~~yAaiHQfG~~~r~~~~~~~~~ 123 (150) T protein:vir:57 53 YAPRQQQSARKKTGRVKRKMFAKLITSRFLHIRASPEQASMEFYGG---------KSPKIASVHQFGLSEETRKDGKKID 123 (150) T ss_pred CcccChHHHHHhccCCCcccchhhhhccceeeeeeCcEEEEEeecC---------CchhhhhhhhccccccccCCCceee Confidence 1000 00 000001111 111111112334567777643 35689999999942 6 Q ss_pred CCCchhhhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 61 IPARPWLDVGVASVNDEILDTIAASLED 88 (156) Q Consensus 61 IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (156) ||+||||=-+ ++...++.+.+...|.. T Consensus 124 iPaRp~LG~s-~~d~~~i~~~i~~~l~r 150 (150) T protein:vir:57 124 YPARPLLGFT-GEDVQMIEEIILAHLDR 150 (150) T ss_pred cCCcccCCCC-HHHHHHHHHHHHHHHhC Confidence 9999999433 22345555555555554 No 101 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=96.07 E-value=6.9e-06 Score=48.89 Aligned_cols=90 Identities=10% Similarity=0.033 Sum_probs=50.0 Q ss_pred CeeeeecC---------cHHHHHHHHHH-HHH----Hh--c--CCcEEEEeccCCCCCC--CCC-----CCHHHHHHHHh Q lcl|NC_019457. 1 MIKITVPN---------FDAVRDELTKA-LNK----LN--S--DEFVTVGIHEADNARP--EGV-----LTNAQLGAIQH 55 (156) Q Consensus 1 m~~v~~~~---------~~~~~~~l~~~-l~~----l~--~--~~~V~VGi~~~~~~~~--~~g-----~~~A~ia~~~E 55 (156) |-.+..+. .+...+.+.+. ++. +. . .-.|.-|-+..+-..+ .+| -+.+.+|.+.| T Consensus 1 ~~~~~f~~~~~~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~~g~~~~V~~~~~YA~yVE 80 (141) T protein:vir:78 1 MNEFEFDSNIPKARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRKSSKEVIVGNSSDYAIYYE 80 (141) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeecCCcEEEEecCCCccceee Confidence 33332221 12222222221 110 00 0 0123334444332111 122 25678999999 Q ss_pred cCCC--------------------------CCCCchhhhHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_019457. 56 FGND--------------------------RIPARPWLDVGVASVNDEILDTIAASLEDGE 90 (156) Q Consensus 56 ~G~~--------------------------~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 90 (156) |||. ..|+||||++++.++++++.+.|++.+.|=. T Consensus 81 ~GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~~i~~~i~~~~~~l~ 141 (141) T protein:vir:78 81 FGTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQDKVRVFTERALRGIN 141 (141) T ss_pred cCCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhHHHHHHHHHHHhhccC Confidence 9983 3799999999999999999999999887633 No 102 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=95.90 E-value=2.6e-05 Score=45.70 Aligned_cols=77 Identities=25% Similarity=0.448 Sum_probs=37.0 Q ss_pred CeeeeecCc-------------------------HHHH---HHHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHH Q lcl|NC_019457. 1 MIKITVPNF-------------------------DAVR---DELTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGA 52 (156) Q Consensus 1 m~~v~~~~~-------------------------~~~~---~~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~ 52 (156) |...+...+ ...| -.|..-+.--.....|.|| ++..+|+ T Consensus 39 l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~~tG~L~~Si~~~~~~~~v~vG-------------t~~~YA~ 105 (155) T protein:vir:79 39 LLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQVTNALARSVTTWADRNEAGIG-------------SNLVYAA 105 (155) T ss_pred HHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccccchhhhhhhhceecCCEEEEe-------------cCchhhh Confidence 000000000 0000 0122222221223345554 2345899 Q ss_pred HHhcCCC-------CCCCchhhhHHH-----HHHHHHHHHHHHHHHhccc Q lcl|NC_019457. 53 IQHFGND-------RIPARPWLDVGV-----ASVNDEILDTIAASLEDGE 90 (156) Q Consensus 53 ~~E~G~~-------~IP~RpFlr~~~-----~~~~~~~~~~~~~~~~~~~ 90 (156) +|+||.. +||+||||=-.- .+-.+++.+.+.+.|..+. T Consensus 106 iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~~~~~~I~~~i~~~l~r~r 155 (155) T protein:vir:79 106 IHQFGGDAGRGHQVEIPARRYLPFDENGQLAAGARQSILEVVLTALSRNR 155 (155) T ss_pred hhhcccccCCCCccccCCccccCCCCccccchHHHHHHHHHHHHHHHhcC Confidence 9999964 799999993221 2334567777777775555 No 103 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=95.87 E-value=2.2e-05 Score=46.13 Aligned_cols=77 Identities=14% Similarity=0.198 Sum_probs=36.8 Q ss_pred Cee---------eee-cC---cHHHHHHHH--HHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCC------ Q lcl|NC_019457. 1 MIK---------ITV-PN---FDAVRDELT--KALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGND------ 59 (156) Q Consensus 1 m~~---------v~~-~~---~~~~~~~l~--~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~------ 59 (156) .-. ... ++ ...++..+. +.|.--.....|.|||.+ +++.||++|.||.. T Consensus 54 W~prk~~~~~~~~~~~~g~~~~~~m~~~l~~a~~l~~~~~~d~a~Vg~~G----------s~~~yAaiHQfG~~~r~~~~ 123 (155) T protein:vir:79 54 YEPRKVKAGGKRLREKAGRVKREAMFRKLRTARYLRIDVDSTGLAIGFDE----------RLSRIARVHQEGQKAPVEPG 123 (155) T ss_pred CcccchhhhhhhhhcccCcccchhhhhhhhhhheeeeeecCcEEEEEecC----------cchhhhhhhhcCCcccCCCC Confidence 110 000 00 000111111 101111123346777642 45779999999943 Q ss_pred ----CCCCchhhhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 60 ----RIPARPWLDVGVASVNDEILDTIAASLED 88 (156) Q Consensus 60 ----~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (156) +||+||||=-+ ++...++.+.+...|.. T Consensus 124 ~~~v~iPaRp~LGls-~~d~~~I~~~i~~~l~r 155 (155) T protein:vir:79 124 GPLAQYPVRVVLGFS-DADRELVRDRLLRELTR 155 (155) T ss_pred CcccccccccccCCC-HHHHHHHHHHHHHHhhC Confidence 69999999332 23345565555555544 No 104 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=95.76 E-value=1.4e-05 Score=47.21 Aligned_cols=79 Identities=10% Similarity=0.133 Sum_probs=41.0 Q ss_pred eeeeec-CcHHHHHHHHHHHHHHhcCCcEEEEeccCCCC--CCCCC-----CCHHHHHHHHhcC---------------- Q lcl|NC_019457. 2 IKITVP-NFDAVRDELTKALNKLNSDEFVTVGIHEADNA--RPEGV-----LTNAQLGAIQHFG---------------- 57 (156) Q Consensus 2 ~~v~~~-~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~--~~~~g-----~~~A~ia~~~E~G---------------- 57 (156) |+--++ ...+.-..+....+.+. .|.-|-+..+-. ..++| -+.+.+|.+.||| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~a---Pv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~ 77 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLM---PVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKI 77 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhC---CcCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCccccccc Confidence 111111 11111111222222221 122233322210 01122 2457799999999 Q ss_pred -------------CCCCCCchhhhHHHHHHHHHHHHHHH Q lcl|NC_019457. 58 -------------NDRIPARPWLDVGVASVNDEILDTIA 83 (156) Q Consensus 58 -------------~~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (156) +..+|+||||++++++++..+.+.|. T Consensus 78 ~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 78 PWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 33589999999999999999888887 No 105 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=95.76 E-value=1.4e-05 Score=47.21 Aligned_cols=79 Identities=10% Similarity=0.133 Sum_probs=41.0 Q ss_pred eeeeec-CcHHHHHHHHHHHHHHhcCCcEEEEeccCCCC--CCCCC-----CCHHHHHHHHhcC---------------- Q lcl|NC_019457. 2 IKITVP-NFDAVRDELTKALNKLNSDEFVTVGIHEADNA--RPEGV-----LTNAQLGAIQHFG---------------- 57 (156) Q Consensus 2 ~~v~~~-~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~--~~~~g-----~~~A~ia~~~E~G---------------- 57 (156) |+--++ ...+.-..+....+.+. .|.-|-+..+-. ..++| -+.+.+|.+.||| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~a---Pv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~ 77 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLM---PVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKI 77 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhC---CcCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCccccccc Confidence 111111 11111111222222221 122233322210 01122 2457799999999 Q ss_pred -------------CCCCCCchhhhHHHHHHHHHHHHHHH Q lcl|NC_019457. 58 -------------NDRIPARPWLDVGVASVNDEILDTIA 83 (156) Q Consensus 58 -------------~~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (156) +..+|+||||++++++++..+.+.|. T Consensus 78 ~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 78 PWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 33589999999999999999888887 No 106 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=95.72 E-value=1.3e-05 Score=47.36 Aligned_cols=79 Identities=10% Similarity=0.133 Sum_probs=40.4 Q ss_pred eeeeec-CcHHHHHHHHHHHHHHhcCCcEEEEeccCCCC--CCCCC-----CCHHHHHHHHhcCC--------------- Q lcl|NC_019457. 2 IKITVP-NFDAVRDELTKALNKLNSDEFVTVGIHEADNA--RPEGV-----LTNAQLGAIQHFGN--------------- 58 (156) Q Consensus 2 ~~v~~~-~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~--~~~~g-----~~~A~ia~~~E~G~--------------- 58 (156) |.--++ ...+.-..+....+.+. .|.-|-+..+-. ..++| -+.+.+|.+.|||| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~a---pv~TG~Lr~SI~~~~~~~~~~~~V~~~~~Ya~yvE~GTg~~~~~~~~~~~~~~ 77 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLM---PVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKNI 77 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhC---CccccccccceeEEeecCcEEEEEecCCCccceeecCccccccCCCccccccc Confidence 111111 11111111222222221 122233322210 01112 24577899999993 Q ss_pred --------------CCCCCchhhhHHHHHHHHHHHHHHH Q lcl|NC_019457. 59 --------------DRIPARPWLDVGVASVNDEILDTIA 83 (156) Q Consensus 59 --------------~~IP~RpFlr~~~~~~~~~~~~~~~ 83 (156) ..+||||||++++++++..+.+.|. T Consensus 78 ~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 78 PWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred cceeecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 3589999999999999998888887 No 107 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=95.51 E-value=1.9e-05 Score=46.43 Aligned_cols=84 Identities=19% Similarity=0.175 Sum_probs=44.0 Q ss_pred Ceeeeec--CcHHHHHHHHHHHHHH--------h--------cCCcEEEEeccCCCCC------CC----CC-CCHHHHH Q lcl|NC_019457. 1 MIKITVP--NFDAVRDELTKALNKL--------N--------SDEFVTVGIHEADNAR------PE----GV-LTNAQLG 51 (156) Q Consensus 1 m~~v~~~--~~~~~~~~l~~~l~~l--------~--------~~~~V~VGi~~~~~~~------~~----~g-~~~A~ia 51 (156) ||.+++. +++..++.+.+.++.. . ..--|.-|-+..+-.. .. .+ .+++.+| T Consensus 1 m~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA 80 (142) T protein:vir:99 1 MVQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYA 80 (142) T ss_pred CceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCcccc Confidence 7777655 4444444443322111 0 0011222433332110 00 11 2567899 Q ss_pred HHHhcCCC-----------------------------CCCCchhhhHHHHHHHHHHHHHHHH Q lcl|NC_019457. 52 AIQHFGND-----------------------------RIPARPWLDVGVASVNDEILDTIAA 84 (156) Q Consensus 52 ~~~E~G~~-----------------------------~IP~RpFlr~~~~~~~~~~~~~~~~ 84 (156) .++||||. ..||||||+++++++.++-....-. T Consensus 81 ~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:99 81 AAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred ceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 99999973 2569999999999987664332222 No 108 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=95.51 E-value=1.9e-05 Score=46.43 Aligned_cols=84 Identities=19% Similarity=0.175 Sum_probs=44.0 Q ss_pred Ceeeeec--CcHHHHHHHHHHHHHH--------h--------cCCcEEEEeccCCCCC------CC----CC-CCHHHHH Q lcl|NC_019457. 1 MIKITVP--NFDAVRDELTKALNKL--------N--------SDEFVTVGIHEADNAR------PE----GV-LTNAQLG 51 (156) Q Consensus 1 m~~v~~~--~~~~~~~~l~~~l~~l--------~--------~~~~V~VGi~~~~~~~------~~----~g-~~~A~ia 51 (156) ||.+++. +++..++.+.+.++.. . ..--|.-|-+..+-.. .. .+ .+++.+| T Consensus 1 m~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA 80 (142) T protein:vir:86 1 MVQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYA 80 (142) T ss_pred CceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCcccc Confidence 7777655 4444444443322111 0 0011222433332110 00 11 2567899 Q ss_pred HHHhcCCC-----------------------------CCCCchhhhHHHHHHHHHHHHHHHH Q lcl|NC_019457. 52 AIQHFGND-----------------------------RIPARPWLDVGVASVNDEILDTIAA 84 (156) Q Consensus 52 ~~~E~G~~-----------------------------~IP~RpFlr~~~~~~~~~~~~~~~~ 84 (156) .++||||. ..||||||+++++++.++-....-. T Consensus 81 ~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:86 81 AAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred ceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 99999973 2569999999999987664332222 No 109 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=95.28 E-value=0.00021 Score=40.71 Aligned_cols=86 Identities=7% Similarity=-0.047 Sum_probs=61.0 Q ss_pred HHHHHHHHHHHHHHHHhc--cccHHHHHHHHHHHHHHHHHHHHHcC------CCCCCcHHHHHhcCC----CCchhHHHH Q lcl|NC_019457. 71 VASVNDEILDTIAASLED--GEDISQLLNRVGVVAVAGVQNYIDEL------RSPANAPSTVERKGA----DNPLVDTGE 138 (156) Q Consensus 71 ~~~~~~~~~~~~~~~~~~--~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppns~~Ti~~KG~----~~PLidTG~ 138 (156) +++.-.++.+.+...+.. ..+...+|..||..+....++.|... .|+|+++.|++.|.. ..+|..... T Consensus 1 m~~~~~~l~~~L~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~~W~p~~~~~~~~~~~~~~~~~~m~~~l~ 80 (156) T protein:vir:11 1 MADSLEALEDWAGPILRALEPGPRAALARSLARDLRRSQQKRVMAQRNPDGSAYEPRKKRELRGKQGRIRRKIKMFQKLR 80 (156) T ss_pred CchhHHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHhhhccccccchhhhhhhh Confidence 555556666666655543 23457799999999999999999874 678999999986532 234444444 Q ss_pred HHhhceeeecccccccCC Q lcl|NC_019457. 139 MKQSVTYNIQTGRPSEGL 156 (156) Q Consensus 139 L~~SIty~V~~~k~~~g~ 156 (156) +..+|++.+....+++|+ T Consensus 81 ~~~~l~~~~~~~~a~vg~ 98 (156) T protein:vir:11 81 TVRYLRAKGDAQAITVSF 98 (156) T ss_pred hhheeeeeecCcEEEEEe Confidence 455688888888888854 No 110 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=94.52 E-value=6.7e-05 Score=43.48 Aligned_cols=68 Identities=15% Similarity=0.151 Sum_probs=32.3 Q ss_pred hhHHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 67 LDVGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) |-..+ ...+++.+.|+++-.. .....++++..+..+++.++.. .| +|||.|++||++ T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~--------------------aP-vdTG~Lr~SI~~ 58 (137) T protein:vir:94 1 MAKVK-YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISL--------------------MP-VDTGYLRESVTM 58 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--------------------CC-ccccchhcccee Confidence 32222 1233333333322110 1123334444444444433331 12 589999999999 Q ss_pred eecccccccCC Q lcl|NC_019457. 146 NIQTGRPSEGL 156 (156) Q Consensus 146 ~V~~~k~~~g~ 156 (156) ++.++.++.-| T Consensus 59 ~~~~~~~~~~V 69 (137) T protein:vir:94 59 DFKDSGFTGVI 69 (137) T ss_pred EeecCceEEEE Confidence 99877544333 No 111 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=94.52 E-value=6.7e-05 Score=43.48 Aligned_cols=68 Identities=15% Similarity=0.151 Sum_probs=32.3 Q ss_pred hhHHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 67 LDVGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) |-..+ ...+++.+.|+++-.. .....++++..+..+++.++.. .| +|||.|++||++ T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~--------------------aP-vdTG~Lr~SI~~ 58 (137) T protein:vir:97 1 MAKVK-YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISL--------------------MP-VDTGYLRESVTM 58 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--------------------CC-ccccchhcccee Confidence 32222 1233333333322110 1123334444444444433331 12 589999999999 Q ss_pred eecccccccCC Q lcl|NC_019457. 146 NIQTGRPSEGL 156 (156) Q Consensus 146 ~V~~~k~~~g~ 156 (156) ++.++.++.-| T Consensus 59 ~~~~~~~~~~V 69 (137) T protein:vir:97 59 DFKDSGFTGVI 69 (137) T ss_pred EeecCceEEEE Confidence 99877544333 No 112 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=94.52 E-value=6.7e-05 Score=43.48 Aligned_cols=68 Identities=15% Similarity=0.151 Sum_probs=32.3 Q ss_pred hhHHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 67 LDVGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) |-..+ ...+++.+.|+++-.. .....++++..+..+++.++.. .| +|||.|++||++ T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~--------------------aP-vdTG~Lr~SI~~ 58 (137) T protein:vir:93 1 MAKVK-YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISL--------------------MP-VDTGYLRESVTM 58 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--------------------CC-ccccchhcccee Confidence 32222 1233333333322110 1123334444444444433331 12 589999999999 Q ss_pred eecccccccCC Q lcl|NC_019457. 146 NIQTGRPSEGL 156 (156) Q Consensus 146 ~V~~~k~~~g~ 156 (156) ++.++.++.-| T Consensus 59 ~~~~~~~~~~V 69 (137) T protein:vir:93 59 DFKDSGFTGVI 69 (137) T ss_pred EeecCceEEEE Confidence 99877544333 No 113 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=93.41 E-value=0.00024 Score=40.44 Aligned_cols=80 Identities=26% Similarity=0.305 Sum_probs=38.2 Q ss_pred CeeeeecCcH-HHHHHHHHHHHH-Hh--cCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcC------------------- Q lcl|NC_019457. 1 MIKITVPNFD-AVRDELTKALNK-LN--SDEFVTVGIHEADNARPEGVLTNAQLGAIQHFG------------------- 57 (156) Q Consensus 1 m~~v~~~~~~-~~~~~l~~~l~~-l~--~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G------------------- 57 (156) =++-.+.... ...++|.....+ -+ +..+-.|||-.-+.++ +.+.||| T Consensus 4 eakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPh----------ghlvE~Ghw~~~~~~~~~dG~w~~~~ 73 (119) T protein:vir:81 4 SAKAFVNDETGKLRSNLYVAYSPEESTNGVQTYAVSWRKKAAPH----------GHLLEFGHWQTHAAYKGKDGEWYSSS 73 (119) T ss_pred ccccccCCCccchhhhheeeeccccCCCCeEEEEeeccCCcCCc----------ccccccceeeeeeeeeccCceeeecC Confidence 1111111111 111222111111 11 1122355666544432 3556888 Q ss_pred -----CCCCCCchhhhHHHHHHHHHHHHH--------HHHHHhccc Q lcl|NC_019457. 58 -----NDRIPARPWLDVGVASVNDEILDT--------IAASLEDGE 90 (156) Q Consensus 58 -----~~~IP~RpFlr~~~~~~~~~~~~~--------~~~~~~~~~ 90 (156) ...+|+|||||++|+....+..+. +.+++.|.. T Consensus 74 ~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:81 74 VKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred ccccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 235999999999999877665544 444555544 No 114 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=93.27 E-value=0.00027 Score=40.15 Aligned_cols=80 Identities=26% Similarity=0.308 Sum_probs=38.1 Q ss_pred CeeeeecCcH-HHHHHHHHHHHH-Hh--cCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcC------------------- Q lcl|NC_019457. 1 MIKITVPNFD-AVRDELTKALNK-LN--SDEFVTVGIHEADNARPEGVLTNAQLGAIQHFG------------------- 57 (156) Q Consensus 1 m~~v~~~~~~-~~~~~l~~~l~~-l~--~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G------------------- 57 (156) =++-.+.... ...++|.....+ -+ +..+-.|||-.-+.++ +-+.||| T Consensus 4 eakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkAPh----------ghlvE~Ghw~~~~~~~~~dG~w~~~~ 73 (119) T protein:vir:10 4 SAKAFVNDETGKLRSNLYVAYSTEESTNGVQTYAVSWRKKAAPH----------GHLLEFGHWQTHAAYKGKDGEWYSSS 73 (119) T ss_pred ccccccCCCccchhhhheeeeccccCCCCEEEEEeecCCCcCCc----------ccccccceeeeeeeeeccCceeeecC Confidence 1111111111 111222111111 11 1122355666544432 3556888 Q ss_pred -----CCCCCCchhhhHHHHHHHHHHHHH--------HHHHHhccc Q lcl|NC_019457. 58 -----NDRIPARPWLDVGVASVNDEILDT--------IAASLEDGE 90 (156) Q Consensus 58 -----~~~IP~RpFlr~~~~~~~~~~~~~--------~~~~~~~~~ 90 (156) ...+|+|||||++|+....+..+. +.+++.|.. T Consensus 74 ~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:10 74 VKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred ccccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 225999999999999877665544 444555544 No 115 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=93.25 E-value=0.00034 Score=39.57 Aligned_cols=83 Identities=16% Similarity=0.251 Sum_probs=41.9 Q ss_pred Ceeeeec-------Cc---HHHHHHHHHHHHHHh--cCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhh Q lcl|NC_019457. 1 MIKITVP-------NF---DAVRDELTKALNKLN--SDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLD 68 (156) Q Consensus 1 m~~v~~~-------~~---~~~~~~l~~~l~~l~--~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr 68 (156) |=..+-. +. +-+.+.|.-.--... ....+.|||... +.+|.+.||||.++||.||+. T Consensus 40 L~~~tp~~~~~~~~~~~~~~HlaD~I~~s~~~~dg~~~g~~~VG~~k~-----------~~~A~f~n~GT~k~~~~hFie 108 (139) T protein:vir:10 40 LAETTKEKHPNTKGDGGKYGHLSEDIRSAAGDIDGDHNGSSTVGFHNK-----------AHIARFLNDGTKYIRADHFVD 108 (139) T ss_pred HHHhcccccCcCCCCCCCCcchhhcceecCcccccccceeeeeCCCCC-----------cceEeecccCccccCCCchHH Confidence 0000000 00 001111110000000 112356888421 357899999999999999999 Q ss_pred HHHHHHHHHHHHHHHH----HHhc-cccHHH Q lcl|NC_019457. 69 VGVASVNDEILDTIAA----SLED-GEDISQ 94 (156) Q Consensus 69 ~~~~~~~~~~~~~~~~----~~~~-~~~~~~ 94 (156) .|..+.++++.+.+.+ +|.. +.+.+. T Consensus 109 ~t~~e~~~evl~a~~~~~k~~l~~~~~~~~~ 139 (139) T protein:vir:10 109 NARDDAKDAVFAAEAEKYQAMIAKANGGGDK 139 (139) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCCCCC Confidence 9999998887665544 4432 111122 No 116 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=91.78 E-value=0.00039 Score=39.26 Aligned_cols=68 Identities=18% Similarity=0.192 Sum_probs=32.7 Q ss_pred hhHHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 67 LDVGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) |-.. ...-+++.+.|+++-.. ....+++|...+..+++.++.. + | +|||.|++||++ T Consensus 1 Ma~~-~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~---------a-----------p-vdTG~Lr~SI~~ 58 (135) T protein:vir:96 1 MAKV-KYGADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHL---------M-----------P-VDTGFLRQSTTV 58 (135) T ss_pred Cchh-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---------C-----------C-ccchhhhcceeE Confidence 2110 01122333333322111 1123445555555555444332 1 2 699999999999 Q ss_pred eecccccccCC Q lcl|NC_019457. 146 NIQTGRPSEGL 156 (156) Q Consensus 146 ~V~~~k~~~g~ 156 (156) +|.++.++--| T Consensus 59 ~~~~~g~~~~V 69 (135) T protein:vir:96 59 DFENGGFTGVV 69 (135) T ss_pred EeecCcEEEEE Confidence 98876544323 No 117 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=91.72 E-value=0.0015 Score=36.06 Aligned_cols=81 Identities=15% Similarity=0.115 Sum_probs=36.9 Q ss_pred Cee-----eeecCcHHHHHHHHHHHHHHh-cCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCC--------------- Q lcl|NC_019457. 1 MIK-----ITVPNFDAVRDELTKALNKLN-SDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGND--------------- 59 (156) Q Consensus 1 m~~-----v~~~~~~~~~~~l~~~l~~l~-~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~--------------- 59 (156) .-. -+.+ ...++.+|.+.+.-.. ....+.++++.+ .++.||++|.||-. T Consensus 59 w~pRK~~~~k~k-~~rm~~kL~~~~~~~~~~~~~~~~~~~~g---------~~~~IA~vHQ~G~~~rv~~~~~~~~~~~~ 128 (231) T protein:vir:37 59 WEKRKPVDGEIK-NKRLLKKVLRYASILAEERGKGRIYYKNP---------LTGEIAQKQQDGFTEHFRVFATDKNKNGS 128 (231) T ss_pred Cchhcccccchh-hHHHHHHhHHhhccccccCCceEEeeecc---------hHHHHHHHhhcCcccccchhhhhhccCCC Confidence 111 0000 0123344443332111 112234444432 36789999999921 Q ss_pred -----------------------------------------------------------------------CCCCchhhh Q lcl|NC_019457. 60 -----------------------------------------------------------------------RIPARPWLD 68 (156) Q Consensus 60 -----------------------------------------------------------------------~IP~RpFlr 68 (156) ++|+||||- T Consensus 129 ~~~pATr~QAk~Lr~lGy~v~~~k~k~~k~~~rkps~kwI~~~ls~~qAgliIR~L~~k~~~~~~k~~W~I~~paR~FLG 208 (231) T protein:vir:37 129 GNDRATIRQAQKLRSLGYRKRNGKNRQGKTKYRLYTIKEIRERLTRTWASMEIRRLENKVNAGNGKTNWEIHVPARPFLD 208 (231) T ss_pred CCCCCCHHHHHHHHHhcccccCCCCCCCCCCcCcCCHHHHHHhhhhHHHHHHHHHHhcccccccCcceeeeecCcccccC Confidence 255566655 Q ss_pred HHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019457. 69 VGVASVNDEILDTIAASLEDGED 91 (156) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~ 91 (156) ..-++...-+...|.+++.+... T Consensus 209 ~~~~e~~~~l~~~l~~i~~~~~~ 231 (231) T protein:vir:37 209 TREKENVDILREITLKFLSGEYK 231 (231) T ss_pred CCHHHHHHHHHHHHHHHhcccCC Confidence 55555555555555555555444 No 118 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=91.58 E-value=0.00071 Score=37.84 Aligned_cols=82 Identities=21% Similarity=0.234 Sum_probs=41.7 Q ss_pred Ceeeee--------c-CcHHHHHHHHHHHHHHh--cCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhH Q lcl|NC_019457. 1 MIKITV--------P-NFDAVRDELTKALNKLN--SDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDV 69 (156) Q Consensus 1 m~~v~~--------~-~~~~~~~~l~~~l~~l~--~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~ 69 (156) |=.++- + ..+-+.+.|.-.=.... ....+.|||... ..+++|.+.++||.++|+-||+.. T Consensus 41 L~~~tp~~hy~~~~~~~~~HlaD~I~~~~~~~DG~~dg~s~VG~~~~---------~~~~~A~f~n~GT~k~~~~hFve~ 111 (141) T protein:vir:50 41 LEEVTREKHYSRKKNPKFGHMADGLAIQSTNADGRKNGVSTVGWKNN---------YHAQNARRLNDGTKKYRADHFVTN 111 (141) T ss_pred HHHhcccCCCCCCCCCCCCccccceeeccCccccccCCeeeeccCCC---------ccceeeeccccCccccCCCchhHH Confidence 000000 0 00000011100000000 123567898632 246899999999999999999999 Q ss_pred HHHHH--HHHHHHH----HHHHHh--cccc Q lcl|NC_019457. 70 GVASV--NDEILDT----IAASLE--DGED 91 (156) Q Consensus 70 ~~~~~--~~~~~~~----~~~~~~--~~~~ 91 (156) +..+. ++++.+. ++++|. |+.+ T Consensus 112 ~~~~a~~k~~Vl~A~~~~~k~~l~~~~~~~ 141 (141) T protein:vir:50 112 VQNDSTVQKKVLLEKKRNTKNSLEEKEGCD 141 (141) T ss_pred HHHhhhhHHHHHHHHHHHHHHHHHhccCCC Confidence 99764 4455443 445553 3444 No 119 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=90.57 E-value=0.00046 Score=38.90 Aligned_cols=68 Identities=16% Similarity=0.181 Sum_probs=31.0 Q ss_pred hhHHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 67 LDVGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) |-..+ ...+++.+.|+++-.. .....++++..+..+++.++.. .| +|||.|++||++ T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~--------------------aP-v~TG~L~~Si~~ 58 (137) T protein:vir:95 1 MAKVK-YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISL--------------------MP-VDTGYLRESVTM 58 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--------------------CC-ccchhhhcCeee Confidence 32221 1223333333322110 1122334444444444433322 12 489999999999 Q ss_pred eecccccccCC Q lcl|NC_019457. 146 NIQTGRPSEGL 156 (156) Q Consensus 146 ~V~~~k~~~g~ 156 (156) ++.++.++--+ T Consensus 59 ~~~~~~~~~~V 69 (137) T protein:vir:95 59 DFKDGGFTGVI 69 (137) T ss_pred EeeCCceEEEE Confidence 99876443222 No 120 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=89.94 E-value=0.0021 Score=35.22 Aligned_cols=91 Identities=15% Similarity=0.359 Sum_probs=54.0 Q ss_pred CeeeeecCcHH------------HHHHHHHHHHHHhcC--CcEEEEeccCC-----CCCC-------CCC-----CCHHH Q lcl|NC_019457. 1 MIKITVPNFDA------------VRDELTKALNKLNSD--EFVTVGIHEAD-----NARP-------EGV-----LTNAQ 49 (156) Q Consensus 1 m~~v~~~~~~~------------~~~~l~~~l~~l~~~--~~V~VGi~~~~-----~~~~-------~~g-----~~~A~ 49 (156) |-+|+++++.+ +-+.+.+.+++.... +.++-.+.+.+ .|.. .++ -+.-+ T Consensus 1 M~~i~id~La~~I~~~L~~y~~~v~~~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~~~~~~v~nk~~yq 80 (127) T protein:vir:80 1 MANIKIDRLGDEITRQLKRYSQVIAGDLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRTPGGWVIHNKTEYR 80 (127) T ss_pred CccccHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeeccCceeEeecCCcc Confidence 88888887732 222333333222110 01111111111 1100 011 12236 Q ss_pred HHHHHhcCCC-----CCCCchhhhHHHHHHHHHHHHHHHHHHhcccc Q lcl|NC_019457. 50 LGAIQHFGND-----RIPARPWLDVGVASVNDEILDTIAASLEDGED 91 (156) Q Consensus 50 ia~~~E~G~~-----~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~ 91 (156) ++-+.|||.- +.++|||++|+.+....++.+.++.++.|+.- T Consensus 81 LtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~~~~ 127 (127) T protein:vir:80 81 LAHLLEYGHATVDGGRVPETPHIRPVEDWLEKEFEDRVERAIKNESR 127 (127) T ss_pred eeehhhcceeccCCcccCCccchhhHHHHHHHHHHHHHHHHhcCCCC Confidence 7899999963 69999999999999999999999999988665 No 121 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=89.44 E-value=0.001 Score=36.93 Aligned_cols=94 Identities=11% Similarity=0.233 Sum_probs=55.1 Q ss_pred CeeeeecCcHHHHHHHHHHHHH-------------H--------hcCCcEEEEecc-----CCCCCCCCC-----CCHHH Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNK-------------L--------NSDEFVTVGIHE-----ADNARPEGV-----LTNAQ 49 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~-------------l--------~~~~~V~VGi~~-----~~~~~~~~g-----~~~A~ 49 (156) |-.+..+++++..+.|.+.... + ...-.|.-|-+. +..+...++ .+++. T Consensus 3 ~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~tPVdTG~Lr~S~~~~~~~~~~~~~~~~V~n~~~ 82 (144) T protein:vir:10 3 LGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEANTPVKQGNLRRSWTAEGPTYGCGGWTIKLINNAE 82 (144) T ss_pred CCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHhCCCCcchhccceeecceeeecCeeEEEEecCCC Confidence 4445555555555544433211 0 000012223222 212222233 36788 Q ss_pred HHHHHhcCCC-----------------CCCCchhhhHHHHHHHHHHHHHHHHHHhccccHHH Q lcl|NC_019457. 50 LGAIQHFGND-----------------RIPARPWLDVGVASVNDEILDTIAASLEDGEDISQ 94 (156) Q Consensus 50 ia~~~E~G~~-----------------~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~~ 94 (156) +|-+-|||+. -+|-++||+.++.+.+..+.+.+.+.+.+=.|.-+ T Consensus 83 YA~~VE~Ghr~~~G~~v~~~~~~~~~g~V~G~~~~~~a~~~~~~~~~~~l~k~l~~l~d~~~ 144 (144) T protein:vir:10 83 YASYVESGHRQTPGRYVPVLKKRLVRDWVPGQFYMKKSIPQIQRQLPQLVTEGLWGLKDLFE 144 (144) T ss_pred cccccccceeecCCcccccCCCccccceecCccchHHHHHHHHHHHHHHHHHHHHHHhhhcC Confidence 9999999973 47899999999999999999888888876544433 No 122 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=89.21 E-value=0.0012 Score=36.69 Aligned_cols=48 Identities=29% Similarity=0.387 Sum_probs=28.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeeecccccccCC Q lcl|NC_019457. 92 ISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTYNIQTGRPSEGL 156 (156) Q Consensus 92 ~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty~V~~~k~~~g~ 156 (156) .++++.+.-......|+..+... .| +|||.|++||++++.++.+.-.+ T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~----------------ap-v~TG~Lr~SI~~~~~~~~~~~~V 48 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISL----------------MP-VDTGYLRESVTMDFKDGGFTGVI 48 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhh----------------CC-ccccccccceeEEeecCcEEEEE Confidence 44444444444444444444321 23 58999999999999876544333 No 123 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=89.19 E-value=0.00095 Score=37.16 Aligned_cols=68 Identities=18% Similarity=0.204 Sum_probs=32.2 Q ss_pred hhHHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 67 LDVGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) |-.- ....+++.+.|+++-.. ....+++|+..+..+++.++.. .| +|||.|++||++ T Consensus 1 Ma~~-~~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~--------------------aP-vdTG~Lr~SI~~ 58 (137) T protein:vir:94 1 MAKV-KYGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISL--------------------MP-VDTGYLRESVTM 58 (137) T ss_pred Cchh-HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--------------------CC-cCcchhhcCcee Confidence 2211 01223333333322211 1123444555454444444432 12 489999999999 Q ss_pred eecccccccCC Q lcl|NC_019457. 146 NIQTGRPSEGL 156 (156) Q Consensus 146 ~V~~~k~~~g~ 156 (156) ++.++.++.-+ T Consensus 59 ~~~~~~~~~~V 69 (137) T protein:vir:94 59 DFKDGGFTGVI 69 (137) T ss_pred EeecCcEEEEE Confidence 99876543322 No 124 >protein:vir:3750 Length: 227 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043491;genbank:gi:9628626;genbank:GeneID:1261131 Probab=89.02 E-value=0.004 Score=33.74 Aligned_cols=101 Identities=9% Similarity=-0.005 Sum_probs=37.8 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCC--------------------- Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGND--------------------- 59 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~--------------------- 59 (156) +-. ......+++.+|.+.|.--.......|+|..+ -++.||++|.||.. T Consensus 59 ~~p-RKr~k~KM~~kL~k~l~~~~~~~~a~v~f~~g---------~~~~IA~vHq~G~~~~v~~~~~~~~~~~~~~~~pa 128 (227) T protein:vir:37 59 WKK-RKNGTAKMLRRIAKLANSKAEKAQGTLFYKQK---------RTGEIAQEHQEGIPHLFKKTEFTGKNKGGIGADPC 128 (227) T ss_pred Cch-hcchhHHHHhhhHHHcceeecccceEEEecCc---------chHHHHHHhhcCcccccchhhhhhhhcCCccccCC Confidence 111 11122233444444443333333456777643 25689999999932 Q ss_pred -------------CCCCc-------hhhhHHHHHHHHHHH----HHHHHHHhc-----------------------cccH Q lcl|NC_019457. 60 -------------RIPAR-------PWLDVGVASVNDEIL----DTIAASLED-----------------------GEDI 92 (156) Q Consensus 60 -------------~IP~R-------pFlr~~~~~~~~~~~----~~~~~~~~~-----------------------~~~~ 92 (156) .+|.. -+=+|++....+.+. -++-..|.+ |.+. T Consensus 129 Tr~QAk~Lr~lGy~v~~~k~k~~k~~~rkps~kwI~~nls~~qAgliIR~L~~k~~~~~~~~k~~W~I~~PaR~FLG~~~ 208 (227) T protein:vir:37 129 TLRQAKKLKDLGYTVANGKTKNGKAKRRKPTLSEIRSTLSRAKASLIIRKLEEKNGMNPSRHLTQWIIPTEKRSFLDTRE 208 (227) T ss_pred CHHHHHHHHHhcccccCCCCCCcCCccccCCHHHHHHhhhHHHHHHHHHHHhcccccccccCccceeeecCcccccCCCH Confidence 12210 011222222211111 111122221 2233 Q ss_pred HHHHHHHHHHHHHHHHHHH Q lcl|NC_019457. 93 SQLLNRVGVVAVAGVQNYI 111 (156) Q Consensus 93 ~~~l~~iG~~~~~~i~~~I 111 (156) +++.+.|...+...-|+.= T Consensus 209 ~e~~~~l~r~l~~~~~~~~ 227 (227) T protein:vir:37 209 EENAKIILAEIQKYTQKQQ 227 (227) T ss_pred HHHHHHHHHHHHHHhhhcC Confidence 4444444433333332221 No 125 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=88.99 E-value=0.0018 Score=35.61 Aligned_cols=96 Identities=14% Similarity=0.122 Sum_probs=42.2 Q ss_pred Ceeeee-----c-CcHHHHHHHHHHHHHHh--cCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHH Q lcl|NC_019457. 1 MIKITV-----P-NFDAVRDELTKALNKLN--SDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVA 72 (156) Q Consensus 1 m~~v~~-----~-~~~~~~~~l~~~l~~l~--~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~ 72 (156) .....- + ..+-+.|.|.-.-.... ....+.|||... ..+.+|.+.|+||.++|+.||++.+.. T Consensus 44 ~tp~~h~~~~kt~~~~HlaD~I~~s~~~idG~~dG~s~VG~~~~---------~~a~~a~f~n~GT~km~~~hFie~tr~ 114 (153) T protein:vir:49 44 VTREKHYSKKKDLKYGHMADGLAVQSTNADGRKNGVSTVGWKNN---------YHAQNARRLNDGTKKYRADHFITNVQN 114 (153) T ss_pred hccccCCCCCCCCCCCcccccceeccccccccccceeeecccCC---------ccceeeeecccCcccCCCChhhHHHHH Confidence 000000 0 00001111110000000 123567998742 246789999999999999999999998 Q ss_pred HH--HHHHHH----HHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCC Q lcl|NC_019457. 73 SV--NDEILD----TIAASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGAD 130 (156) Q Consensus 73 ~~--~~~~~~----~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~ 130 (156) +. +.++.+ .++++|.... ---+|.+..+-|..+ T Consensus 115 e~~~k~~vl~A~~~~~~~il~~~~-------------------------~~~~~~~~~~~~~~~ 153 (153) T protein:vir:49 115 DSTVKNKVLLAEKEEYEKLIRRKG-------------------------GVYLSASNFKTKRAT 153 (153) T ss_pred HhhHHHHHHHHHHHHHHHHHHhcC-------------------------CeeeeccccccccCC Confidence 75 344443 2333332210 001111111111111 No 126 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=88.65 E-value=0.00076 Score=37.69 Aligned_cols=68 Identities=15% Similarity=0.189 Sum_probs=32.6 Q ss_pred hhHHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 67 LDVGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) |-.-. ...+++.+.|+++-.. ....+++|...+..+++.+|... | +|||.|++||++ T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~--------------------p-vdTG~L~~Si~~ 58 (137) T protein:vir:96 1 MAKVK-YGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALA--------------------P-VDLGFLKESIDF 58 (137) T ss_pred CchhH-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--------------------C-cCccchhcCcee Confidence 22211 1223333333322111 11234455555555555544321 2 589999999999 Q ss_pred eecccccccCC Q lcl|NC_019457. 146 NIQTGRPSEGL 156 (156) Q Consensus 146 ~V~~~k~~~g~ 156 (156) +|..+..+.-+ T Consensus 59 ~~~~~g~~~~V 69 (137) T protein:vir:96 59 KVTDGGFSSVI 69 (137) T ss_pred EeecCceEEEE Confidence 98765433222 No 127 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=88.61 E-value=0.0012 Score=36.68 Aligned_cols=48 Identities=29% Similarity=0.398 Sum_probs=27.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeeecccccccCC Q lcl|NC_019457. 92 ISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTYNIQTGRPSEGL 156 (156) Q Consensus 92 ~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty~V~~~k~~~g~ 156 (156) .++++.+.-......++..+... +| +|||.|++||++++.++.+.-.+ T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~-----aP------------v~TG~Lr~SI~~~~~~~~~~~~V 48 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISL-----MP------------VDTGYLRESVTMDFKDGGFTGVI 48 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHh-----CC------------cCcccccccceEEeecCcEEEEE Confidence 34444444344444444444331 11 58999999999999876544333 No 128 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=88.61 E-value=0.0012 Score=36.68 Aligned_cols=48 Identities=29% Similarity=0.398 Sum_probs=27.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeeecccccccCC Q lcl|NC_019457. 92 ISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTYNIQTGRPSEGL 156 (156) Q Consensus 92 ~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty~V~~~k~~~g~ 156 (156) .++++.+.-......++..+... +| +|||.|++||++++.++.+.-.+ T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~-----aP------------v~TG~Lr~SI~~~~~~~~~~~~V 48 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISL-----MP------------VDTGYLRESVTMDFKDGGFTGVI 48 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHh-----CC------------cCcccccccceEEeecCcEEEEE Confidence 34444444344444444444331 11 58999999999999876544333 No 129 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=88.44 E-value=0.0011 Score=36.79 Aligned_cols=82 Identities=17% Similarity=0.272 Sum_probs=42.1 Q ss_pred CeeeeecCc--HHHHHHHHHHHHHHh--cCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHH Q lcl|NC_019457. 1 MIKITVPNF--DAVRDELTKALNKLN--SDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVND 76 (156) Q Consensus 1 m~~v~~~~~--~~~~~~l~~~l~~l~--~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~ 76 (156) ..+...+.. +-+.+.|.-.-.... ....+.|||.. -+.+|.+-|+||.++||.+|+..|..+.++ T Consensus 48 ~~~~~~~~~~~~HlaD~I~~~~~~idg~~~g~~~VG~~~-----------~~~~Ahf~n~GT~~~~~~hFie~t~~e~~~ 116 (139) T protein:vir:10 48 HPNTKGDGGKYGHLSEDISSAAGDIDGDHNGSSTVGFHN-----------KAHIARFLNDGTKNIRADHFVDNARDDAKD 116 (139) T ss_pred cccCCCCCCCCCcccccceecCccccccccccceeCCCC-----------CceeeeeeccCccccCCCchHHHHHHHHHH Confidence 111000000 000111100000000 12246788842 135789999999999999999999999888 Q ss_pred HHHHHHHH----HHhc--cccHH Q lcl|NC_019457. 77 EILDTIAA----SLED--GEDIS 93 (156) Q Consensus 77 ~~~~~~~~----~~~~--~~~~~ 93 (156) ++.+.+.+ +|.. +.+-. T Consensus 117 ev~~a~~~~~ke~l~~~~~~~~~ 139 (139) T protein:vir:10 117 AVFAAEAEKYQAMIAKANGGDSK 139 (139) T ss_pred HHHHHHHHHHHHHHhhcCCCCCC Confidence 87665544 4432 22212 No 130 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=88.28 E-value=0.0035 Score=34.03 Aligned_cols=88 Identities=8% Similarity=0.156 Sum_probs=45.9 Q ss_pred Cee-eeecCcHHHH------------HHHHHHHHHHhc--CCcEEEEeccCC-CC-------CCCCC-------CCHHHH Q lcl|NC_019457. 1 MIK-ITVPNFDAVR------------DELTKALNKLNS--DEFVTVGIHEAD-NA-------RPEGV-------LTNAQL 50 (156) Q Consensus 1 m~~-v~~~~~~~~~------------~~l~~~l~~l~~--~~~V~VGi~~~~-~~-------~~~~g-------~~~A~i 50 (156) |-+ |++++..+.. +.+.+.+++.+. .+.++-+-|... .| .+.+| .+--.| T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~~~~~~~~v~~~~~~y~l 80 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQKLKNGDQVIYQKAPTYRL 80 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccccceeeeecCCeeEEEEEecCCcce Confidence 554 4555543211 112222222110 001111111100 00 00011 111246 Q ss_pred HHHHhcCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 51 GAIQHFGN-----DRIPARPWLDVGVASVNDEILDTIAASLED 88 (156) Q Consensus 51 a~~~E~G~-----~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (156) +-+.|||. ..+|+||||+|+.+...+++.+.++..+.. T Consensus 81 ~HLLE~GHa~r~GGrV~a~phI~paee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 81 THLLENGHAKRNGGRVSPKVHIAPVEEELVSNYISRVEKRLSQ 123 (123) T ss_pred EEeeecceeecCCceeCcchhhhHHHHHHHHHHHHHHHHHhcC Confidence 78889994 469999999999999999999999988877 No 131 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=87.86 E-value=0.0015 Score=36.11 Aligned_cols=82 Identities=15% Similarity=0.202 Sum_probs=41.3 Q ss_pred Ceeee-------ecC--cHHHHHHHHHHHHHHh--cCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhH Q lcl|NC_019457. 1 MIKIT-------VPN--FDAVRDELTKALNKLN--SDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDV 69 (156) Q Consensus 1 m~~v~-------~~~--~~~~~~~l~~~l~~l~--~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~ 69 (156) |-.++ .+. .+-+.+.|.-.=.... ....+.|||... +.+++|.+.++||.++|+-||+.. T Consensus 41 L~~~tp~~h~~~~~t~~~~HlaD~I~~~~~~iDg~~~g~s~VG~~kk---------~~a~~A~f~n~GT~k~~~~hFve~ 111 (140) T protein:vir:48 41 LAEVTRQKHYSNKKHLKYGHMADGLSVQSTNVDGRKNGVSTVGWVNR---------YHAQNARRLNDGTKKYRADHFVTN 111 (140) T ss_pred HHHhccccCCCCCCCCCCCcchhceeecccccccccCceeeeccCCC---------cceeeeeccccCccccCCCchhHH Confidence 00000 000 0001111110000000 123567898632 346899999999999999999999 Q ss_pred HHHHH--HHHHHHH----HHHHHhc-ccc Q lcl|NC_019457. 70 GVASV--NDEILDT----IAASLED-GED 91 (156) Q Consensus 70 ~~~~~--~~~~~~~----~~~~~~~-~~~ 91 (156) +.++. +.++.+. ++++|.. +.+ T Consensus 112 ~~~e~~~k~~vl~A~~~~~~~~l~~~~~~ 140 (140) T protein:vir:48 112 VQNDSAVQTKVLLAEKEEYEKLIRKKGGE 140 (140) T ss_pred HHHhhhhHHHHHHHHHHHHHHHHHhhcCC Confidence 99875 4455443 3344422 222 No 132 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=87.53 E-value=0.0011 Score=36.83 Aligned_cols=68 Identities=15% Similarity=0.190 Sum_probs=35.9 Q ss_pred hhHHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 67 LDVGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) |-..+ ..-+++.+.|++.-.. ....+++|+..+..+++.+|... | +|||.|++||++ T Consensus 1 Ma~~~-~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~a---------P------------vdTG~Lr~SI~~ 58 (137) T protein:vir:10 1 MAKVK-YGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNM---------P------------VDTGYLRESVSM 58 (137) T ss_pred CchhH-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC---------C------------cCcchhhcCeeE Confidence 22210 1222333333222111 12345667777777666666542 1 489999999999 Q ss_pred eecccccccCC Q lcl|NC_019457. 146 NIQTGRPSEGL 156 (156) Q Consensus 146 ~V~~~k~~~g~ 156 (156) ++..+.++.-+ T Consensus 59 ~~~~~~~~~~V 69 (137) T protein:vir:10 59 DFKKGGLTGVI 69 (137) T ss_pred EeeCCcEEEEE Confidence 98766543333 No 133 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=87.47 E-value=0.0016 Score=35.93 Aligned_cols=82 Identities=16% Similarity=0.214 Sum_probs=41.3 Q ss_pred Ceeeee-------cCc--HHHHHHHHHHHHHHh--cCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhH Q lcl|NC_019457. 1 MIKITV-------PNF--DAVRDELTKALNKLN--SDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDV 69 (156) Q Consensus 1 m~~v~~-------~~~--~~~~~~l~~~l~~l~--~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~ 69 (156) |-.++- +.. +=+.+.|.-.-..+. ....+.|||... ..|++|.+.++||..+|+.||+.. T Consensus 41 L~~~tp~~h~~~r~t~~~~HlaD~I~~~~~~idg~~dG~s~VG~~k~---------~~a~~a~f~NdGT~k~~~~hFve~ 111 (140) T protein:vir:48 41 LAEVTREKHYSKKKDLKYGHMADGLAVQSTNVDGRKNGVATVGWKNN---------YHAQNARRLNDGTKKYRADHFVTN 111 (140) T ss_pred HHHhcccCCCCCCCCCCCCcccccceecccccccccccceeecccCC---------CceeEEeecccCccccCCCchHHH Confidence 100000 000 001111110000000 123467888742 246899999999999999999999 Q ss_pred HHHHH--HHHHHHHH----HHHHh-cccc Q lcl|NC_019457. 70 GVASV--NDEILDTI----AASLE-DGED 91 (156) Q Consensus 70 ~~~~~--~~~~~~~~----~~~~~-~~~~ 91 (156) |.++. ++++.+.. +++|. .+.+ T Consensus 112 t~~e~~~~~~vl~A~~~~y~~~l~kk~~~ 140 (140) T protein:vir:48 112 VQNDSAVRDKVLLAEKEEYEKLIRKKGGE 140 (140) T ss_pred HHHhhhhHHHHHHHHHHHHHHHHHhhcCC Confidence 99864 45555444 44442 1222 No 134 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=87.33 E-value=0.0023 Score=35.04 Aligned_cols=88 Identities=16% Similarity=0.327 Sum_probs=50.7 Q ss_pred CeeeeecCcHH------------HHHHHHHHHHHHhcC--CcEEEEeccCC-----CCCC-------CCC-----CCHHH Q lcl|NC_019457. 1 MIKITVPNFDA------------VRDELTKALNKLNSD--EFVTVGIHEAD-----NARP-------EGV-----LTNAQ 49 (156) Q Consensus 1 m~~v~~~~~~~------------~~~~l~~~l~~l~~~--~~V~VGi~~~~-----~~~~-------~~g-----~~~A~ 49 (156) |-+|+++++.+ +-+.+.+.+++.... ..|+-.+++.+ .|.. .++ -+--+ T Consensus 1 M~~i~id~La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~~e~~~V~nk~~yq 80 (124) T protein:vir:95 1 MAKIKIGRLADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRVPNGWVIHNKTEYR 80 (124) T ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeeecCceeEEEcCCCc Confidence 99998888733 222222222111100 00111111100 1100 011 11235 Q ss_pred HHHHHhcCC-----CCCCCchhhhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 50 LGAIQHFGN-----DRIPARPWLDVGVASVNDEILDTIAASLED 88 (156) Q Consensus 50 ia~~~E~G~-----~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (156) ++-+.|||. ++.++|||++|+.+....++.+.++.++.+ T Consensus 81 LtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~ 124 (124) T protein:vir:95 81 LAHLLEYGHATVDGGRVPGTPHIRPIEDWLEKEFEDRVEKAIKQ 124 (124) T ss_pred eeeeeecceeccCCcccCCccchhHHHHHHHHHHHHHHHHHhcC Confidence 788999996 369999999999999999999999999988 No 135 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=87.17 E-value=0.001 Score=37.02 Aligned_cols=68 Identities=13% Similarity=0.187 Sum_probs=35.5 Q ss_pred hhHHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 67 LDVGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) |-... ..-+++.+.|+++-.. ....+++|+..+..+++.+|... | +|||.|++||++ T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~a--------------------P-v~TG~Lr~SI~~ 58 (137) T protein:vir:10 1 MAKVK-YGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNM--------------------P-VDTGYLRESVSM 58 (137) T ss_pred Cccch-hCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--------------------C-cCcchhhcCeee Confidence 22210 1223333333322111 12345667777766666666532 1 589999999999 Q ss_pred eecccccccCC Q lcl|NC_019457. 146 NIQTGRPSEGL 156 (156) Q Consensus 146 ~V~~~k~~~g~ 156 (156) ++..+.++--+ T Consensus 59 ~~~~~~~~~~V 69 (137) T protein:vir:10 59 DFKKGGLTGVI 69 (137) T ss_pred EecCCcEEEEE Confidence 98766433222 No 136 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=86.47 E-value=0.0033 Score=34.19 Aligned_cols=72 Identities=17% Similarity=0.207 Sum_probs=42.9 Q ss_pred hhHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhceee Q lcl|NC_019457. 67 LDVGVASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTYN 146 (156) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty~ 146 (156) |-+-=-+--+++.+.|++. ......+.++...|..++..++.... .+ .| +|||.|++||+.. T Consensus 1 Ma~i~~~Gld~l~~~L~~~-~~~~~v~~~~~~~~~~~~~~~~~~a~-----~~-----------~p-~~TG~Lr~sI~~~ 62 (114) T protein:vir:27 1 MATIEFEGLDEMAQSLLKN-ASPEKRSKVLRKYGSKLKEAAVNRAQ-----FN-----------KG-YSTGATRRSITLQ 62 (114) T ss_pred CeeeeeehHHHHHHHHHHh-cCHHHHHHHHHHHHHHHHHHHHHhcc-----cC-----------CC-CCchhhhhceeee Confidence 3220001124444444432 22344567777777777776665431 11 12 6899999999999 Q ss_pred ecccccccCC Q lcl|NC_019457. 147 IQTGRPSEGL 156 (156) Q Consensus 147 V~~~k~~~g~ 156 (156) +.++..++|. T Consensus 63 ~~~~~~~V~~ 72 (114) T protein:vir:27 63 VESDKATVEA 72 (114) T ss_pred ecCCeeEecC Confidence 9999888887 No 137 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=86.47 E-value=0.0033 Score=34.19 Aligned_cols=72 Identities=17% Similarity=0.207 Sum_probs=42.9 Q ss_pred hhHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhceee Q lcl|NC_019457. 67 LDVGVASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTYN 146 (156) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty~ 146 (156) |-+-=-+--+++.+.|++. ......+.++...|..++..++.... .+ .| +|||.|++||+.. T Consensus 1 Ma~i~~~Gld~l~~~L~~~-~~~~~v~~~~~~~~~~~~~~~~~~a~-----~~-----------~p-~~TG~Lr~sI~~~ 62 (114) T protein:vir:49 1 MATIEFEGLDEMAQSLLKN-ASPEKRSKVLRKYGSKLKEAAVNRAQ-----FN-----------KG-YSTGATRRSITLQ 62 (114) T ss_pred CeeeeeehHHHHHHHHHHh-cCHHHHHHHHHHHHHHHHHHHHHhcc-----cC-----------CC-CCchhhhhceeee Confidence 3220001124444444432 22344567777777777776665431 11 12 6899999999999 Q ss_pred ecccccccCC Q lcl|NC_019457. 147 IQTGRPSEGL 156 (156) Q Consensus 147 V~~~k~~~g~ 156 (156) +.++..++|. T Consensus 63 ~~~~~~~V~~ 72 (114) T protein:vir:49 63 VESDKATVEA 72 (114) T ss_pred ecCCeeEecC Confidence 9999888887 No 138 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=86.39 E-value=0.00086 Score=37.38 Aligned_cols=81 Identities=19% Similarity=0.152 Sum_probs=36.7 Q ss_pred eeeeec--CcHHHH-----HHHHHHHHHH-------hc-CCcEEEEeccCCCCC----C------CC-CCCHHHHHHHHh Q lcl|NC_019457. 2 IKITVP--NFDAVR-----DELTKALNKL-------NS-DEFVTVGIHEADNAR----P------EG-VLTNAQLGAIQH 55 (156) Q Consensus 2 ~~v~~~--~~~~~~-----~~l~~~l~~l-------~~-~~~V~VGi~~~~~~~----~------~~-g~~~A~ia~~~E 55 (156) |.++.. .+.+.+ ..+++.+++. .. .-.|.-|-+..+-.. + .. -.+.+.+|.++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~r~~l~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~V~~~~~YA~~ve 80 (137) T protein:vir:10 1 MTVTARYERNPVGEARQFQVIARRRLSRITRGTANQARADVPVKTGNLGRSIREDPIVVAGPLRLDSGVTAHADYARYVH 80 (137) T ss_pred CeeEEEeccCchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhcCceeeeeeccccceEEEEecCCCccceeee Confidence 333322 221111 1111111111 10 001222222222110 0 00 124578999999 Q ss_pred cCCC------------------------------CCCCchhhhHHHHHHHHHHHHHH Q lcl|NC_019457. 56 FGND------------------------------RIPARPWLDVGVASVNDEILDTI 82 (156) Q Consensus 56 ~G~~------------------------------~IP~RpFlr~~~~~~~~~~~~~~ 82 (156) |||. .+|+||||+++++++..+-...- T Consensus 81 ~GT~ph~I~Pk~~k~~l~~~~~g~~vf~k~V~hPG~~a~PfL~~A~~~~~~~~~~~~ 137 (137) T protein:vir:10 81 DGTRAHVIRPRRPGGVLRFTVGGRVVYARRVNHPGTRARPFLRNAAERVVARETATS 137 (137) T ss_pred cCCCCceeeccccceeeeEeeCCeeEecceeecCCCCCCchHHHHHHHhhhhhcccC Confidence 9962 26799999999999876543322 No 139 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=85.97 E-value=0.0013 Score=36.33 Aligned_cols=79 Identities=19% Similarity=0.172 Sum_probs=37.6 Q ss_pred Ceee------eecCcHHHHH-----HHHHHHHH----Hhc----CCcEEEEeccCCCCC--CCCC--------CCHHHHH Q lcl|NC_019457. 1 MIKI------TVPNFDAVRD-----ELTKALNK----LNS----DEFVTVGIHEADNAR--PEGV--------LTNAQLG 51 (156) Q Consensus 1 m~~v------~~~~~~~~~~-----~l~~~l~~----l~~----~~~V~VGi~~~~~~~--~~~g--------~~~A~ia 51 (156) |+++ +++ ...++ .+.+.+++ +.+ .-.|.-|-+..+-.. .++| .+++.+| T Consensus 1 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA 78 (140) T protein:vir:97 1 MATIRARARIEID--EAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYA 78 (140) T ss_pred CeeeeeeeeeeeC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccch Confidence 4433 333 11111 11111111 110 111333444333221 1111 3568899 Q ss_pred HHHhcCCC-----------------------------CCCCchhhhHHHHHH---HHHHHHH Q lcl|NC_019457. 52 AIQHFGND-----------------------------RIPARPWLDVGVASV---NDEILDT 81 (156) Q Consensus 52 ~~~E~G~~-----------------------------~IP~RpFlr~~~~~~---~~~~~~~ 81 (156) .+.||||. ..+|||||++++++. ...+..- T Consensus 79 ~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:97 79 APVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred hhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 99999973 256999999999874 3444333 No 140 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=85.97 E-value=0.0013 Score=36.33 Aligned_cols=79 Identities=19% Similarity=0.172 Sum_probs=37.6 Q ss_pred Ceee------eecCcHHHHH-----HHHHHHHH----Hhc----CCcEEEEeccCCCCC--CCCC--------CCHHHHH Q lcl|NC_019457. 1 MIKI------TVPNFDAVRD-----ELTKALNK----LNS----DEFVTVGIHEADNAR--PEGV--------LTNAQLG 51 (156) Q Consensus 1 m~~v------~~~~~~~~~~-----~l~~~l~~----l~~----~~~V~VGi~~~~~~~--~~~g--------~~~A~ia 51 (156) |+++ +++ ...++ .+.+.+++ +.+ .-.|.-|-+..+-.. .++| .+++.+| T Consensus 1 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA 78 (140) T protein:vir:10 1 MATIRARARIEID--EAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYA 78 (140) T ss_pred CeeeeeeeeeeeC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccch Confidence 4433 333 11111 11111111 110 111333444333221 1111 3568899 Q ss_pred HHHhcCCC-----------------------------CCCCchhhhHHHHHH---HHHHHHH Q lcl|NC_019457. 52 AIQHFGND-----------------------------RIPARPWLDVGVASV---NDEILDT 81 (156) Q Consensus 52 ~~~E~G~~-----------------------------~IP~RpFlr~~~~~~---~~~~~~~ 81 (156) .+.||||. ..+|||||++++++. ...+..- T Consensus 79 ~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:10 79 APVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred hhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 99999973 256999999999874 3444333 No 141 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=85.84 E-value=0.0046 Score=33.40 Aligned_cols=65 Identities=28% Similarity=0.393 Sum_probs=37.5 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 66 WLDVGVASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 66 Flr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) +=-.++ +++.+.|++.. .....+.+|...+..+++.+|.. + | +|||.|++||++ T Consensus 1 i~i~Gl----d~l~~~l~~~~-~~~~~~~al~~~a~~i~~~ak~~---------a-----------p-vdTG~Lr~si~~ 54 (108) T protein:vir:98 1 MKITGI----DALQKKLRKNA-TLNDVKHVVKRNTVSMNKNMQNL---------A-----------P-VDTGNMKRSITS 54 (108) T ss_pred CcchhH----HHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHh---------C-----------C-CCchhhHhhcee Confidence 111233 33333443321 12235667777777777766642 1 2 589999999999 Q ss_pred eecccccccCC Q lcl|NC_019457. 146 NIQTGRPSEGL 156 (156) Q Consensus 146 ~V~~~k~~~g~ 156 (156) .+.++.++.-+ T Consensus 55 ~~~~~~~~~~V 65 (108) T protein:vir:98 55 EFTDGGLTGTT 65 (108) T ss_pred eeecCceEEEe Confidence 99876654333 No 142 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=85.59 E-value=0.0039 Score=33.82 Aligned_cols=65 Identities=12% Similarity=0.028 Sum_probs=30.5 Q ss_pred hHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeee Q lcl|NC_019457. 68 DVGVASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTYNI 147 (156) Q Consensus 68 r~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty~V 147 (156) -.++++-..++.+.-..+ ....+++|...+..++..+|. + .| +|||.|++||++.+ T Consensus 1 i~Gld~l~~~l~~~~~~~---~~~v~~al~~~a~~i~~~ak~---------~-----------aP-v~TG~Lr~sI~~~~ 56 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSV---RIAVDKELSKSAARIERQAKI---------L-----------AP-VDTGWLRAQIYSEQ 56 (108) T ss_pred CchHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh---------c-----------CC-cCchhhhcceeeee Confidence 233444333332222111 111234444444444443332 1 22 68999999999888 Q ss_pred ccc-ccccCC Q lcl|NC_019457. 148 QTG-RPSEGL 156 (156) Q Consensus 148 ~~~-k~~~g~ 156 (156) .++ .+++|- T Consensus 57 ~~~~~~~v~~ 66 (108) T protein:vir:99 57 QRLLHYRVVS 66 (108) T ss_pred cCcEEEEeec Confidence 643 333333 No 143 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=84.60 E-value=0.0029 Score=34.52 Aligned_cols=69 Identities=19% Similarity=0.207 Sum_probs=35.1 Q ss_pred hhH-HHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhce Q lcl|NC_019457. 67 LDV-GVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVT 144 (156) Q Consensus 67 lr~-~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIt 144 (156) |-. .+.-+.+++.+.+++.... ....+.+|+..+..+++.++.. + | +|||.|++||+ T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~---------a-----------P-v~TG~Lr~SI~ 59 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGL---------C-----------P-VDTGRLRSSIQ 59 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---------C-----------C-ccchhhhccce Confidence 211 1111234444444444332 1234566666666655554332 1 2 68999999999 Q ss_pred eeecccccc--cCC Q lcl|NC_019457. 145 YNIQTGRPS--EGL 156 (156) Q Consensus 145 y~V~~~k~~--~g~ 156 (156) ++|...... ..+ T Consensus 60 ~~~~~~g~~~~~~v 73 (142) T protein:vir:94 60 AVPSGGRFSFSVTI 73 (142) T ss_pred eeeccCCceEEEEE Confidence 998754322 112 No 144 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=83.51 E-value=0.0046 Score=33.40 Aligned_cols=79 Identities=15% Similarity=0.204 Sum_probs=48.6 Q ss_pred eeeeecCcHHHHHHHHHHH------------------------HH-H---------------------hcCCcEEEEecc Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKAL------------------------NK-L---------------------NSDEFVTVGIHE 35 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l------------------------~~-l---------------------~~~~~V~VGi~~ 35 (156) |+|++++.+..++.|.+.+ +. + .+.+.|+|||-+ T Consensus 1 MsvevkGv~eil~~LE~k~g~~~~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:10 1 MSVKVTGDKALERELEKHFGIKEMVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPEWIKGKRTVTIRWRG 80 (134) T ss_pred CeEEeecHHHHHHHHHHhhchhhhhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCeeecCCceEEEEEEEc Confidence 8888888888777766541 10 0 011346666643 Q ss_pred CC-CCCCCCCCCHHHHHHHHhcCCCCCCCchhhhH--------HHHHHHHHHHHHHHHHHhcc Q lcl|NC_019457. 36 AD-NARPEGVLTNAQLGAIQHFGNDRIPARPWLDV--------GVASVNDEILDTIAASLEDG 89 (156) Q Consensus 36 ~~-~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~--------~~~~~~~~~~~~~~~~~~~~ 89 (156) .. .| -|--+||||..+-...+|++| +++..+..+.+.++.-+..= T Consensus 81 ~~~R~---------~ivHLnE~Gyt~~r~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 81 PFERF---------RIVHLIENGHVEKKSGKFVKPKAMGGINRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred CCcee---------eEEEeeecceeecCCCCeeccchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 22 22 245679999987777777777 77777777766665544332 No 145 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=82.78 E-value=0.0097 Score=31.62 Aligned_cols=80 Identities=16% Similarity=0.284 Sum_probs=45.5 Q ss_pred eeeeecCcHHHHHHHHHHH------------------------H-HH---------------------hcCCcEEEEecc Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKAL------------------------N-KL---------------------NSDEFVTVGIHE 35 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l------------------------~-~l---------------------~~~~~V~VGi~~ 35 (156) |+|.+++.+..++.|.+.+ + .+ .+.+.|+|||-+ T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:95 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRG 80 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEc Confidence 8888888887776665541 1 00 112346677643 Q ss_pred CCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhH--------HHHHHHHHHHHHHHHHHhcc Q lcl|NC_019457. 36 ADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDV--------GVASVNDEILDTIAASLEDG 89 (156) Q Consensus 36 ~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~--------~~~~~~~~~~~~~~~~~~~~ 89 (156) ..+ + --|--+||||..+--..+|++| +++..+..+.+.++.-+..= T Consensus 81 ~~~-R-------~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:95 81 SKD-R-------YKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred CCc-e-------eEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 221 1 1255789999654333444444 77777777766665544332 No 146 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=82.78 E-value=0.0097 Score=31.62 Aligned_cols=80 Identities=16% Similarity=0.284 Sum_probs=45.5 Q ss_pred eeeeecCcHHHHHHHHHHH------------------------H-HH---------------------hcCCcEEEEecc Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKAL------------------------N-KL---------------------NSDEFVTVGIHE 35 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l------------------------~-~l---------------------~~~~~V~VGi~~ 35 (156) |+|.+++.+..++.|.+.+ + .+ .+.+.|+|||-+ T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:10 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRG 80 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEc Confidence 8888888887776665541 1 00 112346677643 Q ss_pred CCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhH--------HHHHHHHHHHHHHHHHHhcc Q lcl|NC_019457. 36 ADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDV--------GVASVNDEILDTIAASLEDG 89 (156) Q Consensus 36 ~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~--------~~~~~~~~~~~~~~~~~~~~ 89 (156) ..+ + --|--+||||..+--..+|++| +++..+..+.+.++.-+..= T Consensus 81 ~~~-R-------~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 81 SKD-R-------YKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred CCc-e-------eEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 221 1 1255789999654333444444 77777777766665544332 No 147 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=81.76 E-value=0.0077 Score=32.17 Aligned_cols=73 Identities=12% Similarity=0.099 Sum_probs=33.4 Q ss_pred CCchhhhHHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHH Q lcl|NC_019457. 62 PARPWLDVGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMK 140 (156) Q Consensus 62 P~RpFlr~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~ 140 (156) =+|..++-.+ +-.+++.+.+++.-.. ....+++|...+..+++.++.. + =+|||.|+ T Consensus 1 m~~ms~~i~~-~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~---------a------------pv~TG~Lr 58 (144) T protein:vir:59 1 MALMSVRIDP-SWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASL---------A------------PVDEGNLK 58 (144) T ss_pred CCcceeeehh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---------C------------Cccchhhh Confidence 2222221100 1112233222222111 1123455555555555544432 1 15899999 Q ss_pred hhceeeecccccccCC Q lcl|NC_019457. 141 QSVTYNIQTGRPSEGL 156 (156) Q Consensus 141 ~SIty~V~~~k~~~g~ 156 (156) +||++++..+.++.-| T Consensus 59 ~SI~~~~~~~g~~~~V 74 (144) T protein:vir:59 59 NSIQIDYKNNGLTAEI 74 (144) T ss_pred cCeeEEeecCcEEEEE Confidence 9999998766543222 No 148 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=81.62 E-value=0.0066 Score=32.53 Aligned_cols=68 Identities=21% Similarity=0.318 Sum_probs=39.5 Q ss_pred hhHHHHH-HHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 67 LDVGVAS-VNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 67 lr~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) |+.++.- --+++.+.|.+.. .....+++|...+..++..++.. +| +|||.|++||+. T Consensus 1 M~~~i~i~Gld~l~~~L~~~~-~~~~~~~al~~~~~~i~~~ak~~---------aP------------vdTG~Lr~si~~ 58 (112) T protein:vir:36 1 MKSSLSFKGIDQLVKHLDKAA-SLKGVQQVVKSNTSNMTANMQKL---------VP------------VDTGYMKRSIKM 58 (112) T ss_pred CceeeeehhHHHHHHHHHhhh-hHHHHHHHHHHHHHHHHHHHHHh---------CC------------CCchhhhhceee Confidence 6665542 2355555555422 12235666666666666665532 11 689999999998 Q ss_pred eeccccc--ccCC Q lcl|NC_019457. 146 NIQTGRP--SEGL 156 (156) Q Consensus 146 ~V~~~k~--~~g~ 156 (156) .+.++.. ++|- T Consensus 59 ~~~~~~~~~~V~~ 71 (112) T protein:vir:36 59 ELTEGGFSGQAGP 71 (112) T ss_pred eecCCceEEEeec Confidence 8876543 3333 No 149 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=79.29 E-value=0.016 Score=30.39 Aligned_cols=72 Identities=15% Similarity=0.139 Sum_probs=40.0 Q ss_pred hhHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhceee Q lcl|NC_019457. 67 LDVGVASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTYN 146 (156) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty~ 146 (156) |-+---+--+++.+.|++.. .....++++...+...+..++..... ++ | +|||.|++||+++ T Consensus 1 Ma~i~i~Gld~L~~~l~~~~-~~~~v~~~v~~~~~~~~~~~~~~a~~-----~a-----------p-vdTG~Lr~sI~~~ 62 (112) T protein:vir:96 1 MATIEFEGLDEMAQSLLKNA-SSERRSKVLRKYGAKLKEAAVSKAQF-----KK-----------G-YSTGATRRSITLE 62 (112) T ss_pred CceeeehHHHHHHHHHHhhc-CHHHHHHHHHHHHHHHHHHHHHHhhh-----cC-----------C-CCchhhhhceeee Confidence 22200011233444443222 12345677777777777777765532 11 2 6999999999987 Q ss_pred ecccccccCC Q lcl|NC_019457. 147 IQTGRPSEGL 156 (156) Q Consensus 147 V~~~k~~~g~ 156 (156) ..+..+.+|- T Consensus 63 ~~~~~~~v~~ 72 (112) T protein:vir:96 63 AGSDRAVVEA 72 (112) T ss_pred cCceEEEecC Confidence 6655555555 No 150 >protein:vir:78755 Length: 228 # NCBI annotation: putative tail completion protein # Family: family:all:743 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285651;genbank:gi:148727157;genbank:GeneID:5220102 Probab=78.34 E-value=0.028 Score=29.09 Aligned_cols=121 Identities=8% Similarity=0.053 Sum_probs=38.0 Q ss_pred CeeeeecCcHHHHHHHHHHHHHHh-cCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCC----------CCCchhh-- Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKLN-SDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDR----------IPARPWL-- 67 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l~-~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~----------IP~RpFl-- 67 (156) .-. +-..-.+++..|.+.|+-.+ ....+.|||..+... ..++.||++|.||... -|.||=. T Consensus 55 ~~p-RKr~krKMl~~L~k~Lk~~~~~~~~a~v~f~~~~~~-----~~~~rIA~vHq~G~~~~v~~~~~~~~~~~r~~~~~ 128 (228) T protein:vir:78 55 WAP-RKRGKRKMLRGLPKLLQIREPRQDMAELGFTKGTMS-----AHAGVIANTHQKGHTYKVTAASRRRIAPSDVGKNK 128 (228) T ss_pred Chh-hhhhHHHHHhhhHHhhhhhcccccceEEEeecCccc-----chHHHHHHHHhcCcccccccchhhhhhcccCCCCC Confidence 100 00111223344445444222 223578888764321 2578999999999310 1111110 Q ss_pred --------------------------hHHHHHHHHHH----HHHHHHHHhcc---------ccHHHHHHHHHHHHHHHHH Q lcl|NC_019457. 68 --------------------------DVGVASVNDEI----LDTIAASLEDG---------EDISQLLNRVGVVAVAGVQ 108 (156) Q Consensus 68 --------------------------r~~~~~~~~~~----~~~~~~~~~~~---------~~~~~~l~~iG~~~~~~i~ 108 (156) +|++....+.+ .-++-..|.+. .-+..+|-.=.......+. T Consensus 129 paTr~QAk~Lr~lGy~~~~~~~k~~rkps~kwI~~nls~gqAgliir~L~~k~~k~~W~I~~PaR~FLG~s~~e~~~~l~ 208 (228) T protein:vir:78 129 QASKAQARKLRELGFKRPGKRKRAYRSASLGWITANLNYAQAGLLIKKLKDEPVKESWEIQLPARPFLGANARQRQQAFA 208 (228) T ss_pred CCCHHHHHHHHHhhccccCCcCCCcccCCHHHHHHHhhHHHHHHHHHHHhCCCCccceeeecCcccccCCCHHHHHHHHH Confidence 00000000000 00111112210 0011111111111111111 Q ss_pred HHHHcCCC-CCCcHHHHHhc Q lcl|NC_019457. 109 NYIDELRS-PANAPSTVERK 127 (156) Q Consensus 109 ~~I~~~~~-ppns~~Ti~~K 127 (156) ..+....| ..-.+.-|+.| T Consensus 209 ~~l~~i~~g~~~~~qd~~~~ 228 (228) T protein:vir:78 209 LRPESIDYGWDVNKQDMKGK 228 (228) T ss_pred HHHHhcccCCCcchhhccCC Confidence 11111111 12223334444 No 151 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=77.73 E-value=0.017 Score=30.26 Aligned_cols=80 Identities=13% Similarity=0.163 Sum_probs=34.6 Q ss_pred EEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHH Q lcl|NC_019457. 29 VTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGV 107 (156) Q Consensus 29 V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i 107 (156) .+|-.. +-|--.+|.+ +| +++ ++.+.|++.-.. ....+++++..+..+++.+ T Consensus 1 ~~~~~~------~~~~~~Ma~v----~~-------------Gld----~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~a 53 (149) T protein:vir:10 1 MKLNYY------DLSRCHMAKV----KY-------------GAD----SMVVELDKFDKKIEEWVKKGIAKTTTKIYNTA 53 (149) T ss_pred Ceeeee------ccchhhhHHH----HH-------------HHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111 1010112111 11 222 222222222111 1133455555555555554 Q ss_pred HHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeeecccccccCC Q lcl|NC_019457. 108 QNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTYNIQTGRPSEGL 156 (156) Q Consensus 108 ~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty~V~~~k~~~g~ 156 (156) +... | +|||.|++||++++.++.++..+ T Consensus 54 k~~a--------------------P-vdTG~L~~SI~~~~~~~g~~~~V 81 (149) T protein:vir:10 54 VALA--------------------P-VDLGFLEESIDFKYFDGGLSSVI 81 (149) T ss_pred HHhC--------------------C-cccchhhccceEEecCCcEEEEE Confidence 4321 1 68999999999999876554333 No 152 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=75.37 E-value=0.016 Score=30.49 Aligned_cols=68 Identities=15% Similarity=0.063 Sum_probs=38.1 Q ss_pred hhHHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 67 LDVGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) |.-.++ --+++.+.|++.-.. ....+.+|...|..++..+|... | +|||.|++||+. T Consensus 1 msi~i~-Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~a------P---------------v~TG~Lr~sI~~ 58 (114) T protein:vir:95 1 MAIKWQ-GIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQLA------P---------------KDTEFLKDHITT 58 (114) T ss_pred Ceeeee-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC------C---------------cCchhhhhceee Confidence 443332 123444444332211 12245667777777666665542 1 689999999998 Q ss_pred eecccccccCC Q lcl|NC_019457. 146 NIQTGRPSEGL 156 (156) Q Consensus 146 ~V~~~k~~~g~ 156 (156) +..+-..++|. T Consensus 59 ~~~g~~~~V~~ 69 (114) T protein:vir:95 59 SYPGMEAHIHG 69 (114) T ss_pred ecCceEEEeec Confidence 87655555555 No 153 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=74.67 E-value=0.024 Score=29.45 Aligned_cols=65 Identities=25% Similarity=0.347 Sum_probs=37.2 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_019457. 66 WLDVGVASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTY 145 (156) Q Consensus 66 Flr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty 145 (156) +=-.++ +++.+.|++.. .....+.+|...|..++.++|..- | +|||.|++||++ T Consensus 1 i~i~Gl----d~l~~~l~~~~-~~~~~~~al~~~a~~i~~~ak~~a--------------------P-v~TG~Lr~si~~ 54 (108) T protein:vir:74 1 MKITGI----DALQKKLRKNA-TLDDVKHVVKSNTASMNKNMQNLA--------------------P-VDTGNMKRSITS 54 (108) T ss_pred CcchhH----HHHHHHHHHhh-hHHHHHHHHHHHHHHHHHHHHHhC--------------------C-CCchhhhcccee Confidence 111122 33344443321 123456777777777777665421 2 589999999999 Q ss_pred eecccccccCC Q lcl|NC_019457. 146 NIQTGRPSEGL 156 (156) Q Consensus 146 ~V~~~k~~~g~ 156 (156) ++.++..+.-+ T Consensus 55 ~~~~~~~~~~V 65 (108) T protein:vir:74 55 EFTDGGLSGTT 65 (108) T ss_pred eeecCceEEEe Confidence 99876543333 No 154 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=73.79 E-value=0.012 Score=31.13 Aligned_cols=71 Identities=13% Similarity=0.204 Sum_probs=32.5 Q ss_pred hh-HHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhce Q lcl|NC_019457. 67 LD-VGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVT 144 (156) Q Consensus 67 lr-~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIt 144 (156) +. .++ +++.+.|++.-.. ......++..-|..++...+..-. ..++.| +|||.|++||+ T Consensus 1 i~~~Gl----d~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~~TG~Lr~sI~ 61 (115) T protein:vir:96 1 MNIDGL----DALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-YWTGNLSRNIR 61 (115) T ss_pred CcchhH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-CCchhhhhcce Confidence 11 112 2222222211100 112345555555555555544321 123333 79999999999 Q ss_pred eeeccc-ccccCC Q lcl|NC_019457. 145 YNIQTG-RPSEGL 156 (156) Q Consensus 145 y~V~~~-k~~~g~ 156 (156) +...++ ..++|- T Consensus 62 ~~~~g~~~~~v~~ 74 (115) T protein:vir:96 62 YKKTGDLQYTITS 74 (115) T ss_pred eeecCceEEEeec Confidence 986543 233333 No 155 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=73.79 E-value=0.012 Score=31.13 Aligned_cols=71 Identities=13% Similarity=0.204 Sum_probs=32.5 Q ss_pred hh-HHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhce Q lcl|NC_019457. 67 LD-VGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVT 144 (156) Q Consensus 67 lr-~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIt 144 (156) +. .++ +++.+.|++.-.. ......++..-|..++...+..-. ..++.| +|||.|++||+ T Consensus 1 i~~~Gl----d~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~~TG~Lr~sI~ 61 (115) T protein:vir:10 1 MNIDGL----DALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-YWTGNLSRNIR 61 (115) T ss_pred CcchhH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-CCchhhhhcce Confidence 11 112 2222222211100 112345555555555555544321 123333 79999999999 Q ss_pred eeeccc-ccccCC Q lcl|NC_019457. 145 YNIQTG-RPSEGL 156 (156) Q Consensus 145 y~V~~~-k~~~g~ 156 (156) +...++ ..++|- T Consensus 62 ~~~~g~~~~~v~~ 74 (115) T protein:vir:10 62 YKKTGDLQYTITS 74 (115) T ss_pred eeecCceEEEeec Confidence 986543 233333 No 156 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=73.79 E-value=0.012 Score=31.13 Aligned_cols=71 Identities=13% Similarity=0.204 Sum_probs=32.5 Q ss_pred hh-HHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhce Q lcl|NC_019457. 67 LD-VGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVT 144 (156) Q Consensus 67 lr-~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIt 144 (156) +. .++ +++.+.|++.-.. ......++..-|..++...+..-. ..++.| +|||.|++||+ T Consensus 1 i~~~Gl----d~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~~TG~Lr~sI~ 61 (115) T protein:vir:78 1 MNIDGL----DALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-YWTGNLSRNIR 61 (115) T ss_pred CcchhH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-CCchhhhhcce Confidence 11 112 2222222211100 112345555555555555544321 123333 79999999999 Q ss_pred eeeccc-ccccCC Q lcl|NC_019457. 145 YNIQTG-RPSEGL 156 (156) Q Consensus 145 y~V~~~-k~~~g~ 156 (156) +...++ ..++|- T Consensus 62 ~~~~g~~~~~v~~ 74 (115) T protein:vir:78 62 YKKTGDLQYTITS 74 (115) T ss_pred eeecCceEEEeec Confidence 986543 233333 No 157 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=73.79 E-value=0.012 Score=31.13 Aligned_cols=71 Identities=13% Similarity=0.204 Sum_probs=32.5 Q ss_pred hh-HHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhce Q lcl|NC_019457. 67 LD-VGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVT 144 (156) Q Consensus 67 lr-~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIt 144 (156) +. .++ +++.+.|++.-.. ......++..-|..++...+..-. ..++.| +|||.|++||+ T Consensus 1 i~~~Gl----d~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~~TG~Lr~sI~ 61 (115) T protein:vir:96 1 MNIDGL----DALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-YWTGNLSRNIR 61 (115) T ss_pred CcchhH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-CCchhhhhcce Confidence 11 112 2222222211100 112345555555555555544321 123333 79999999999 Q ss_pred eeeccc-ccccCC Q lcl|NC_019457. 145 YNIQTG-RPSEGL 156 (156) Q Consensus 145 y~V~~~-k~~~g~ 156 (156) +...++ ..++|- T Consensus 62 ~~~~g~~~~~v~~ 74 (115) T protein:vir:96 62 YKKTGDLQYTITS 74 (115) T ss_pred eeecCceEEEeec Confidence 986543 233333 No 158 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=73.79 E-value=0.012 Score=31.13 Aligned_cols=71 Identities=13% Similarity=0.204 Sum_probs=32.5 Q ss_pred hh-HHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhce Q lcl|NC_019457. 67 LD-VGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVT 144 (156) Q Consensus 67 lr-~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIt 144 (156) +. .++ +++.+.|++.-.. ......++..-|..++...+..-. ..++.| +|||.|++||+ T Consensus 1 i~~~Gl----d~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~~TG~Lr~sI~ 61 (115) T protein:vir:97 1 MNIDGL----DALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-YWTGNLSRNIR 61 (115) T ss_pred CcchhH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-CCchhhhhcce Confidence 11 112 2222222211100 112345555555555555544321 123333 79999999999 Q ss_pred eeeccc-ccccCC Q lcl|NC_019457. 145 YNIQTG-RPSEGL 156 (156) Q Consensus 145 y~V~~~-k~~~g~ 156 (156) +...++ ..++|- T Consensus 62 ~~~~g~~~~~v~~ 74 (115) T protein:vir:97 62 YKKTGDLQYTITS 74 (115) T ss_pred eeecCceEEEeec Confidence 986543 233333 No 159 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=73.79 E-value=0.012 Score=31.13 Aligned_cols=71 Identities=13% Similarity=0.204 Sum_probs=32.5 Q ss_pred hh-HHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhce Q lcl|NC_019457. 67 LD-VGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVT 144 (156) Q Consensus 67 lr-~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIt 144 (156) +. .++ +++.+.|++.-.. ......++..-|..++...+..-. ..++.| +|||.|++||+ T Consensus 1 i~~~Gl----d~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~~TG~Lr~sI~ 61 (115) T protein:vir:93 1 MNIDGL----DALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-YWTGNLSRNIR 61 (115) T ss_pred CcchhH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-CCchhhhhcce Confidence 11 112 2222222211100 112345555555555555544321 123333 79999999999 Q ss_pred eeeccc-ccccCC Q lcl|NC_019457. 145 YNIQTG-RPSEGL 156 (156) Q Consensus 145 y~V~~~-k~~~g~ 156 (156) +...++ ..++|- T Consensus 62 ~~~~g~~~~~v~~ 74 (115) T protein:vir:93 62 YKKTGDLQYTITS 74 (115) T ss_pred eeecCceEEEeec Confidence 986543 233333 No 160 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=71.95 E-value=0.0089 Score=31.84 Aligned_cols=80 Identities=13% Similarity=0.121 Sum_probs=34.5 Q ss_pred EEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHHhc-cccHHHHHHHHHHHHHHHH Q lcl|NC_019457. 29 VTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAASLED-GEDISQLLNRVGVVAVAGV 107 (156) Q Consensus 29 V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i 107 (156) .+|-... -.|-.|-.. ..--+++.+.|+++-.. ....+++|...+..+++.+ T Consensus 1 ~~~~~~~--------------------------~~~~~Ma~~-~~Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~a 53 (149) T protein:vir:94 1 MKLSYYD--------------------------LSRCHMAKV-KYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTA 53 (149) T ss_pred Ceeeeee--------------------------cchhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111111 112222110 01112222222221111 1133455555555555544 Q ss_pred HHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeeecccccccCC Q lcl|NC_019457. 108 QNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVTYNIQTGRPSEGL 156 (156) Q Consensus 108 ~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIty~V~~~k~~~g~ 156 (156) +... | +|||.|++||++++.++.++-.+ T Consensus 54 k~~a--------------------P-vdTG~Lr~SI~~~~~~~g~~~~V 81 (149) T protein:vir:94 54 VALA--------------------P-VDLGFLEESIDFKYFDGGLSSVI 81 (149) T ss_pred HHhC--------------------C-cccchhhcCeeEEeeCCcEEEEE Confidence 4221 2 58999999999998876554333 No 161 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=71.60 E-value=0.028 Score=29.07 Aligned_cols=82 Identities=12% Similarity=0.086 Sum_probs=42.9 Q ss_pred CeeeeecCcHHHHHHHHHHHHHH----------------------hcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCC Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKL----------------------NSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGN 58 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l----------------------~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~ 58 (156) |.++.-+.....-+.+.+.|+.- .+.+.|+|||.+ ..|+ |=-+||||. T Consensus 32 ~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s~p~~~~G~r~V~igW~G-pR~~---------ivHLNE~Gy 101 (138) T protein:vir:98 32 VNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGFTT-PRWN---------IVHLQELEY 101 (138) T ss_pred hhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeecCeeecCCceEEEEeeec-Ceee---------EEeeecccc Confidence 44433333222222233333221 113467777753 3442 457899998 Q ss_pred CC-CCCchh--hhHHHHHHHHHHHHHHHHHHhccccH Q lcl|NC_019457. 59 DR-IPARPW--LDVGVASVNDEILDTIAASLEDGEDI 92 (156) Q Consensus 59 ~~-IP~RpF--lr~~~~~~~~~~~~~~~~~~~~~~~~ 92 (156) .. |-||-| ++.+++..+..+.+.++..+....+. T Consensus 102 Gk~i~PrG~G~I~ka~~~se~~y~~~vk~el~k~l~~ 138 (138) T protein:vir:98 102 GWKHNRRGVGVIRRYSDILETIYPRGIRDKLKRGFDG 138 (138) T ss_pred cCCcCCCcchHHHHHHHhhhHHHHHHHHHHHHHHhcC Confidence 64 667765 88888887777766554444332222 No 162 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=69.80 E-value=0.034 Score=28.64 Aligned_cols=64 Identities=19% Similarity=0.124 Sum_probs=30.1 Q ss_pred CCCCchhhhHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHH Q lcl|NC_019457. 60 RIPARPWLDVGVASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEM 139 (156) Q Consensus 60 ~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L 139 (156) -|-.|+- -++..+.+++ +...+++++.++..++...|. |+ | +|||+| T Consensus 1 ~~~~~~~------l~~~~l~~~~------~~~~~~~~~~~a~~ve~~ak~---------~a-----------P-v~TG~L 47 (137) T protein:vir:10 1 MVAHTLR------IERAQLHGLG------MDEARKAVNRVVRRTFTRSQI---------LA-----------P-VDTGYL 47 (137) T ss_pred Ccccccc------cChhhHhhHH------HHHHHHHHHHHHHHHHHHHHh---------cC-----------C-cCchhh Confidence 0111111 1112222222 223455566666655555443 11 2 789999 Q ss_pred Hhhceeeecccc---cccCC Q lcl|NC_019457. 140 KQSVTYNIQTGR---PSEGL 156 (156) Q Consensus 140 ~~SIty~V~~~k---~~~g~ 156 (156) ++||++.+.+.. +.-++ T Consensus 48 r~SI~~~~~~~~g~~v~~~V 67 (137) T protein:vir:10 48 RASGRLVLGRERGAVVIGSV 67 (137) T ss_pred hccceeeeeeccccEEEEEe Confidence 999999885322 22122 No 163 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=68.26 E-value=0.068 Score=27.00 Aligned_cols=78 Identities=13% Similarity=0.208 Sum_probs=49.2 Q ss_pred eeeeecCcHHHHHHHHHHHHH-----H--------------------h-----------------------cCCcEEEEe Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKALNK-----L--------------------N-----------------------SDEFVTVGI 33 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l~~-----l--------------------~-----------------------~~~~V~VGi 33 (156) |+|.+++.+..++.|.+.+-+ + . ..++|+||| T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:96 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 888888888777666553210 0 0 013355565 Q ss_pred ccCC-CCCCCCCCCHHHHHHHHhcCCCC----CCCchh--hhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 34 HEAD-NARPEGVLTNAQLGAIQHFGNDR----IPARPW--LDVGVASVNDEILDTIAASLED 88 (156) Q Consensus 34 ~~~~-~~~~~~g~~~A~ia~~~E~G~~~----IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 88 (156) .+.. .| .|=-+||||... |-||-| ++.+++..+..+.+.++.-+.. T Consensus 81 ~gp~~R~---------~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:96 81 VGPMNRK---------NIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ecCCCce---------eEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 4321 22 245689999643 678888 7888888888887777766655 No 164 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=68.26 E-value=0.068 Score=27.00 Aligned_cols=78 Identities=13% Similarity=0.208 Sum_probs=49.2 Q ss_pred eeeeecCcHHHHHHHHHHHHH-----H--------------------h-----------------------cCCcEEEEe Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKALNK-----L--------------------N-----------------------SDEFVTVGI 33 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l~~-----l--------------------~-----------------------~~~~V~VGi 33 (156) |+|.+++.+..++.|.+.+-+ + . ..++|+||| T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:93 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 888888888777666553210 0 0 013355565 Q ss_pred ccCC-CCCCCCCCCHHHHHHHHhcCCCC----CCCchh--hhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 34 HEAD-NARPEGVLTNAQLGAIQHFGNDR----IPARPW--LDVGVASVNDEILDTIAASLED 88 (156) Q Consensus 34 ~~~~-~~~~~~g~~~A~ia~~~E~G~~~----IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 88 (156) .+.. .| .|=-+||||... |-||-| ++.+++..+..+.+.++.-+.. T Consensus 81 ~gp~~R~---------~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 81 VGPMNRK---------NIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ecCCCce---------eEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 4321 22 245689999643 678888 7888888888887777766655 No 165 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=68.26 E-value=0.068 Score=27.00 Aligned_cols=78 Identities=13% Similarity=0.208 Sum_probs=49.2 Q ss_pred eeeeecCcHHHHHHHHHHHHH-----H--------------------h-----------------------cCCcEEEEe Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKALNK-----L--------------------N-----------------------SDEFVTVGI 33 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l~~-----l--------------------~-----------------------~~~~V~VGi 33 (156) |+|.+++.+..++.|.+.+-+ + . ..++|+||| T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:78 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 888888888777666553210 0 0 013355565 Q ss_pred ccCC-CCCCCCCCCHHHHHHHHhcCCCC----CCCchh--hhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 34 HEAD-NARPEGVLTNAQLGAIQHFGNDR----IPARPW--LDVGVASVNDEILDTIAASLED 88 (156) Q Consensus 34 ~~~~-~~~~~~g~~~A~ia~~~E~G~~~----IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 88 (156) .+.. .| .|=-+||||... |-||-| ++.+++..+..+.+.++.-+.. T Consensus 81 ~gp~~R~---------~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:78 81 VGPMNRK---------NIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ecCCCce---------eEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 4321 22 245689999643 678888 7888888888887777766655 No 166 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=68.26 E-value=0.068 Score=27.00 Aligned_cols=78 Identities=13% Similarity=0.208 Sum_probs=49.2 Q ss_pred eeeeecCcHHHHHHHHHHHHH-----H--------------------h-----------------------cCCcEEEEe Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKALNK-----L--------------------N-----------------------SDEFVTVGI 33 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l~~-----l--------------------~-----------------------~~~~V~VGi 33 (156) |+|.+++.+..++.|.+.+-+ + . ..++|+||| T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:94 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEe Confidence 888888888777666553210 0 0 013355565 Q ss_pred ccCC-CCCCCCCCCHHHHHHHHhcCCCC----CCCchh--hhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 34 HEAD-NARPEGVLTNAQLGAIQHFGNDR----IPARPW--LDVGVASVNDEILDTIAASLED 88 (156) Q Consensus 34 ~~~~-~~~~~~g~~~A~ia~~~E~G~~~----IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 88 (156) .+.. .| .|=-+||||... |-||-| ++.+++..+..+.+.++.-+.. T Consensus 81 ~gp~~R~---------~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:94 81 VGPMNRK---------NIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ecCCCce---------eEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 4321 22 245689999643 678888 7888888888887777766655 No 167 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=67.47 E-value=0.022 Score=29.62 Aligned_cols=83 Identities=10% Similarity=0.081 Sum_probs=45.1 Q ss_pred ecCcHHHHHHHHH------------HHHHHh----cCCcEEE-Eecc----CCCCC------CCCCCC--------HHHH Q lcl|NC_019457. 6 VPNFDAVRDELTK------------ALNKLN----SDEFVTV-GIHE----ADNAR------PEGVLT--------NAQL 50 (156) Q Consensus 6 ~~~~~~~~~~l~~------------~l~~l~----~~~~V~V-Gi~~----~~~~~------~~~g~~--------~A~i 50 (156) +.+.++..++|++ ...+|. .....-| |+.. |..-+ .++|++ .+++ T Consensus 1 i~G~~~L~~~Lk~~s~~dvk~VVkkN~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~g~~~~vgp~g~t~dY 80 (127) T protein:vir:98 1 MTGMPALEVKLRSMSEKRWDRVANKNLTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKVNSSKDVITGNFGYIKDY 80 (127) T ss_pred CcChHHHHHHHHHhhHHHHHHHHhhhhHHHHHHHHhccCCceeccccccCcccceeeeEEEEecCCceEEeccCcccccc Confidence 2332222222221 111111 1000112 1211 11111 123332 4789 Q ss_pred HHHHhcCCC---------CCCCchhhhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 51 GAIQHFGND---------RIPARPWLDVGVASVNDEILDTIAASLED 88 (156) Q Consensus 51 a~~~E~G~~---------~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 88 (156) |-..|||+. -.|+-|||.|+|+.++..+.+-|...++. T Consensus 81 apyvEyGTR~m~~~~~~gf~~aqp~l~paf~~Qk~iF~~DL~~l~k~ 127 (127) T protein:vir:98 81 APHVEYGHRIVRNGKQVGYANGTKYLFNNVKKQREIYRQDMLNELRR 127 (127) T ss_pred cceeecceeeeecccccccccCccccccchHHHhHHHHHHHHHHhcC Confidence 999999987 37899999999999999998888777766 No 168 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=67.17 E-value=0.017 Score=30.26 Aligned_cols=92 Identities=13% Similarity=0.183 Sum_probs=39.1 Q ss_pred CeeeeecCcHHHHHHHHHHHH-HH-----------h--------cCCcEEEEeccCC-----C------CCCCCC----- Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALN-KL-----------N--------SDEFVTVGIHEAD-----N------ARPEGV----- 44 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~-~l-----------~--------~~~~V~VGi~~~~-----~------~~~~~g----- 44 (156) |-++..+++++..++|.++.. ++ . ..-.|.-|-+..+ . +...++ T Consensus 4 ~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~~g~~~~v~v 83 (141) T protein:vir:79 4 WGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARSLPVYKQGNNYIIEV 83 (141) T ss_pred CccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccccceeecCCeeEEEE Confidence 333444444443333322111 00 0 0001222322211 1 111111 Q ss_pred CCHHHHHHHHhcCCCCCCCchhhhHH------HHHHHHHHHHHH----HHHHhccccH Q lcl|NC_019457. 45 LTNAQLGAIQHFGNDRIPARPWLDVG------VASVNDEILDTI----AASLEDGEDI 92 (156) Q Consensus 45 ~~~A~ia~~~E~G~~~IP~RpFlr~~------~~~~~~~~~~~~----~~~~~~~~~~ 92 (156) .+++.+|-+-|||+...|+|||.... +.+.+..+.+.+ .+.|.+=.++ T Consensus 84 ~n~~~YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l~~~~~~ 141 (141) T protein:vir:79 84 VNPTEYASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLILLKGVFDA 141 (141) T ss_pred ecCCcchhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 25578999999999887777776554 444444443333 2333332222 No 169 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=64.77 E-value=0.048 Score=27.81 Aligned_cols=80 Identities=6% Similarity=0.135 Sum_probs=48.6 Q ss_pred eeeeecCcHHHHHHHHHHH------------------------HH----------------------HhcCCcEEEEecc Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKAL------------------------NK----------------------LNSDEFVTVGIHE 35 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l------------------------~~----------------------l~~~~~V~VGi~~ 35 (156) |+|.+++.+..++.|.+.+ +. -.+.+.|+|||.+ T Consensus 1 msvevkGv~eilk~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~G~r~V~i~W~g 80 (133) T protein:vir:78 1 MSVEVTGVEELERQLVSLFGRENLPQLVDPALIAGATLVAKTLKSEFVQFKDTGASIDEINIEKPSYDKGVRSIKIDWKG 80 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHHhhcchhcccceeeeEEecCeeeeCCceEEEEEEec Confidence 8888888887776665422 11 0112345666654 Q ss_pred CC-CCCCCCCCCHHHHHHHHhcCCC----CCCCchh--hhHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_019457. 36 AD-NARPEGVLTNAQLGAIQHFGND----RIPARPW--LDVGVASVNDEILDTIAASLEDGE 90 (156) Q Consensus 36 ~~-~~~~~~g~~~A~ia~~~E~G~~----~IP~RpF--lr~~~~~~~~~~~~~~~~~~~~~~ 90 (156) .. .| .|=-+||||.. -|-||-| ++.+++..+..+.+.++.-+..-. T Consensus 81 p~~R~---------~iVHLNE~GYtr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~el~k~l 133 (133) T protein:vir:78 81 PKDRY---------KIIHLNEYGYTRNGKKITPAGTGSVARSLRISERAYRAIVQKKIGDKL 133 (133) T ss_pred CCCce---------eEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHHHhhC Confidence 22 22 24468999964 2778888 777888777777666655554444 No 170 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=63.66 E-value=0.028 Score=29.06 Aligned_cols=67 Identities=18% Similarity=0.172 Sum_probs=30.4 Q ss_pred hhH-HHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHH-HHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhce Q lcl|NC_019457. 67 LDV-GVASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAG-VQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVT 144 (156) Q Consensus 67 lr~-~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~-i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIt 144 (156) |-. =|+.+..++.+.+.+.+ .+++..++...+.. |+.... ...| +|||.|++||+ T Consensus 1 ~~~~~f~~~~~~~~~~~~k~~------~~~~~~~a~~~~~~~ie~~ak----------------~~~p-vdtG~L~~SI~ 57 (141) T protein:vir:78 1 MNEFEFDSNIPKARKLIEKKV------LQALEDIGEHMTTELAEGGHG----------------VTSN-NDTGEYAQKSG 57 (141) T ss_pred CcchhHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHhhh----------------hccc-cccchhhccee Confidence 211 23444455544444332 22233333322221 111111 1123 79999999999 Q ss_pred eeeccc--ccccCC Q lcl|NC_019457. 145 YNIQTG--RPSEGL 156 (156) Q Consensus 145 y~V~~~--k~~~g~ 156 (156) |+|..+ ++.+|- T Consensus 58 ~~v~~~g~~~~V~~ 71 (141) T protein:vir:78 58 YKVRKSSKEVIVGN 71 (141) T ss_pred eeeecCCcEEEEec Confidence 998533 333333 No 171 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=61.73 E-value=0.058 Score=27.36 Aligned_cols=78 Identities=8% Similarity=0.080 Sum_probs=39.5 Q ss_pred CeeeeecCcHHH----HHHHHHHHHH------------------HhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCC Q lcl|NC_019457. 1 MIKITVPNFDAV----RDELTKALNK------------------LNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGN 58 (156) Q Consensus 1 m~~v~~~~~~~~----~~~l~~~l~~------------------l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~ 58 (156) |.++.-+..... .+.|++++.- ....+.|+|||. +..| .|=-+||||. T Consensus 26 v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~t~dev~~s~~~~~~G~r~V~VgW~-GpR~---------~ivHLNE~Gy 95 (132) T protein:vir:96 26 VNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGFT-TPRW---------NIVHLQELEY 95 (132) T ss_pred HHHHhHHHHHHHHHHHHHHHHHhhhhhhhcchhhcceeecCeeecCCceEEEeccc-CCce---------eEEeeecccc Confidence 322222221111 1222222210 011345677775 3343 2457899998 Q ss_pred CC-CCCchh--hhHHHHHHHHHH----HHHHHHHHhc Q lcl|NC_019457. 59 DR-IPARPW--LDVGVASVNDEI----LDTIAASLED 88 (156) Q Consensus 59 ~~-IP~RpF--lr~~~~~~~~~~----~~~~~~~~~~ 88 (156) .. |-||-| ++.+++..+..+ ...|++.|.| T Consensus 96 Gk~~~PrG~G~I~~a~~~se~~~~~~~~~elkk~l~~ 132 (132) T protein:vir:96 96 GWKHNRRGVGVIRRYSDILETIYPRGIRDKLKRGFDG 132 (132) T ss_pred cCCcCCCcchHHHHHHHhhhhHHHHHHHHHHHHHhcC Confidence 64 667766 888888877544 4555666666 No 172 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=58.48 E-value=0.052 Score=27.60 Aligned_cols=68 Identities=19% Similarity=0.270 Sum_probs=36.4 Q ss_pred hhHH-HH-HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhce Q lcl|NC_019457. 67 LDVG-VA-SVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVT 144 (156) Q Consensus 67 lr~~-~~-~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIt 144 (156) |-.. +. +-.+++.+.|++.- ...+.++++...|..++...|..- | +|||.|++||+ T Consensus 1 Ma~~~i~~~Gld~L~~~L~~~~-~~~~v~~vv~~~~~~l~~~ak~~a---------p------------~dTG~lrrSI~ 58 (92) T protein:vir:99 1 MADYSISWDGLDALDEALANQQ-NMNTVKKVVKKHTANLMTATQQAV---------P------------VDTGHLKQSAQ 58 (92) T ss_pred CCceeeEeehHHHHHHHHHhhc-cHHHHHHHHHHHHHHHHHHHHHhC---------C------------CCccccceeee Confidence 2110 00 01234444443321 124466777777777776666532 1 68999999999 Q ss_pred eeecccc----cccC-C Q lcl|NC_019457. 145 YNIQTGR----PSEG-L 156 (156) Q Consensus 145 y~V~~~k----~~~g-~ 156 (156) ..+.++. +..+ = T Consensus 59 ~~~~~~g~~~~v~~~gp 75 (92) T protein:vir:99 59 IQISRDGFTGSVTYGGG 75 (92) T ss_pred EEeecCCeeEEEEeccC Confidence 8887653 2221 1 No 173 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=55.11 E-value=0.38 Score=22.91 Aligned_cols=73 Identities=8% Similarity=0.017 Sum_probs=36.9 Q ss_pred Ceeee----------ecCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHH Q lcl|NC_019457. 1 MIKIT----------VPNFDAVRDELTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDVG 70 (156) Q Consensus 1 m~~v~----------~~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~ 70 (156) |+++. -+.+...+..+...+..+.....+ ++ .+++-+|...|||+.+-+|+.|.|.+ T Consensus 49 ~vs~~~~~~~~~~~~d~~g~~t~~~~~~~i~~~~~g~~i---yi----------~Nn~pYA~~LEyG~S~QAP~g~v~~~ 115 (131) T protein:vir:94 49 MASGSTPADGTTDATDKSGNTATGNATSFVLNAADWHTF---TL----------TNNLPYAQRLEYGWSQQAPQGFVRVN 115 (131) T ss_pred hhccccccccccCCCCCCchhhHHHHHHHHhhccccceE---EE----------eeCchhhhhhhccccCCCcchHHHHH Confidence 22221 001111112222222222112222 11 23556899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_019457. 71 VASVNDEILDTIAASL 86 (156) Q Consensus 71 ~~~~~~~~~~~~~~~~ 86 (156) +.+...-+.+..+++= T Consensus 116 ~~~~~~~v~~~~~e~k 131 (131) T protein:vir:94 116 VSRFQQLLNEEASKVK 131 (131) T ss_pred HHHHHHHHHHHHHhcC Confidence 8876654444333222 No 174 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=51.85 E-value=0.1 Score=26.06 Aligned_cols=63 Identities=17% Similarity=0.160 Sum_probs=26.3 Q ss_pred hhHHHH--HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHcCCCCCCcHHHHHhcCCCCchhHHHHHHhhce Q lcl|NC_019457. 67 LDVGVA--SVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNYIDELRSPANAPSTVERKGADNPLVDTGEMKQSVT 144 (156) Q Consensus 67 lr~~~~--~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppns~~Ti~~KG~~~PLidTG~L~~SIt 144 (156) |-.++. -+...+.+.+... .+..|+.++..+++..|. +-.+|||.|++||+ T Consensus 1 m~~s~~i~i~~~~l~~~v~~~------~k~~l~~~a~~i~~~ak~---------------------~aPv~tG~Lr~SI~ 53 (137) T protein:vir:10 1 MPVTARIHINEPELERQTGAI------FRGKHRSITRRIATQARA---------------------DVPVRTGNLGRGIQ 53 (137) T ss_pred CCeeEEEeeCHHHHHHHHHHH------HHHHHHHHHHHHHHHHHH---------------------hCCcccchhhcCce Confidence 111111 0111111111111 233344443333333221 11368999999999 Q ss_pred eeeccccc---ccCC Q lcl|NC_019457. 145 YNIQTGRP---SEGL 156 (156) Q Consensus 145 y~V~~~k~---~~g~ 156 (156) +++.++.- +..+ T Consensus 54 ~~~~~~~~~~~~~~v 68 (137) T protein:vir:10 54 EMPQTYRPFHVGGGV 68 (137) T ss_pred eeeeccccceEEEEE Confidence 98865532 2222 No 175 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=46.12 E-value=0.33 Score=23.20 Aligned_cols=88 Identities=13% Similarity=0.203 Sum_probs=42.5 Q ss_pred hhHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHH-HHHHHHHHcCCCCCC----cHHHHHhcCCCCchhHHHHHHh Q lcl|NC_019457. 67 LDVGVASVNDEILDTIAASLEDGEDISQLLNRVGVVAV-AGVQNYIDELRSPAN----APSTVERKGADNPLVDTGEMKQ 141 (156) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~-~~i~~~I~~~~~ppn----s~~Ti~~KG~~~PLidTG~L~~ 141 (156) |-.-++..-++|.+.+.+...-...........|..+- ..++..-..... .+ ++-....|-. .-=..+|+|++ T Consensus 1 mm~~~~~~l~~~l~~v~k~~~~~~~~k~kiTkAGAkv~~e~L~~~Tp~~h~-~~~k~~~~~~~~~k~~-~~~~~~~HlaD 78 (159) T protein:vir:38 1 MANDMGEFYNNWVNEVEKGMKLSVEDKAKITGEGAEAFSTVLHDHTPRSNE-IYRRGRSAGHANAKHH-NRNRKTKHLQD 78 (159) T ss_pred CcchHHHHHHHHHHHHHHhcCCCHHHHHHHHHHhHHHHHHHHHHhcccCCC-cccccccccccccccc-CcCcCCCcccc Confidence 55557767777777776633333333334444444332 222222211111 11 1111111100 01234679999 Q ss_pred hceee-------ecccccccCC Q lcl|NC_019457. 142 SVTYN-------IQTGRPSEGL 156 (156) Q Consensus 142 SIty~-------V~~~k~~~g~ 156 (156) ||+|. +.+|-+.+|. T Consensus 79 ~I~~~~~~~iDg~~dG~s~VGw 100 (159) T protein:vir:38 79 SITYKPGYTADKLHTGDTDVGF 100 (159) T ss_pred ceeeecCccccccccceeeecc Confidence 99985 3578888998 No 176 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=45.76 E-value=0.5 Score=22.25 Aligned_cols=76 Identities=11% Similarity=0.084 Sum_probs=37.3 Q ss_pred Ceeee------ec----CcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHH Q lcl|NC_019457. 1 MIKIT------VP----NFDAVRDELTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDVG 70 (156) Q Consensus 1 m~~v~------~~----~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~ 70 (156) ++++. .. ++......+.+....+.+.+.=.+=++ .+++-+|...|||+..-+|..|.|.+ T Consensus 57 ~vs~~~~~~~~~~~~d~~G~~t~~~~~~~~~~i~~~k~g~~iyi----------~Nn~pYA~~LEyG~S~QAP~G~v~~~ 126 (145) T protein:vir:10 57 QISANSPAQQSLNEYDQTGGQTKTYLARQARAVANSKATSVIYI----------TNRLDYAADLEYGASNQAPAGVLGVV 126 (145) T ss_pred ceeecccccccccccCCCCccchhhHHHHHHHhhcccccceEEE----------eeCchhhhHhhccccCCCcchHHHHH Confidence 33221 11 111111222222222211110011111 14556789999999999999999999 Q ss_pred HHHHHHHH---HHHHHHHH Q lcl|NC_019457. 71 VASVNDEI---LDTIAASL 86 (156) Q Consensus 71 ~~~~~~~~---~~~~~~~~ 86 (156) +.+...-+ .+.+++++ T Consensus 127 ~~~~~~~v~~~~~e~k~~~ 145 (145) T protein:vir:10 127 QARLGRYFQEAVEEARRAI 145 (145) T ss_pred HHHHHHHHHHHHHHhhccC Confidence 98876333 33444445 No 177 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=44.69 E-value=0.48 Score=22.32 Aligned_cols=79 Identities=13% Similarity=0.101 Sum_probs=36.9 Q ss_pred Ceeeee----------cCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHH Q lcl|NC_019457. 1 MIKITV----------PNFDAVRDELTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDVG 70 (156) Q Consensus 1 m~~v~~----------~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~ 70 (156) ++++.. ++.....+.....+..+.+.+.-.+-++ .+++-+|.-.|||+..-.|+.|.|.+ T Consensus 54 ~vs~~~~~~~~~~~~d~~G~~t~~~~~~~~~~i~~~~~g~~iyi----------~Nn~pYA~~LEyG~S~QAP~G~v~~a 123 (142) T protein:vir:10 54 QATGNSPAAQSLNNYDPDGNETRNSLRRQIYALARDANTNVIYI----------SNRLDYAQGLEFGSSNQAPSGVLGVV 123 (142) T ss_pred eeeecCcccccccCcCCCCccchhhHHHHHHHhhhccccceEEE----------eeCcchhhhhhccccCCCcchHHHHH Confidence 222211 1111112222222222221111011122 13456789999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhccc Q lcl|NC_019457. 71 VASVNDEILDTIAASLEDGE 90 (156) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~ 90 (156) +.+...-+.+..++ +++.+ T Consensus 124 ~q~~~~~v~~a~~e-~~~~~ 142 (142) T protein:vir:10 124 QKRLGRYFAEAVQE-AKRAL 142 (142) T ss_pred HHHHHHHHHHHHHH-hhccC Confidence 88765443332221 12222 No 178 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=44.20 E-value=0.82 Score=21.07 Aligned_cols=73 Identities=7% Similarity=-0.001 Sum_probs=35.6 Q ss_pred Ceeee------ec----CcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHH Q lcl|NC_019457. 1 MIKIT------VP----NFDAVRDELTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDVG 70 (156) Q Consensus 1 m~~v~------~~----~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~ 70 (156) ++++. .. .+..-+......+..+.....+ ++ .+++-+|.-.|||+.+-+|+.|.|.+ T Consensus 49 ~vs~~~~~~~~~~~~d~~g~~t~~~~~~~i~~~~~g~~i---yi----------~Nn~pYA~~LEyG~S~QAP~G~v~~~ 115 (131) T protein:vir:78 49 MASGGTPADGTTDATDKAGTTATSNAANFVLNAADWHTF---TL----------TNNLPYAQRLEYGWSQQAPQGFVRVN 115 (131) T ss_pred ceecccccccccCCCCCCchhhHHHHHHHHhhccCCceE---EE----------eeCchhhhHhhccccCCCcchHHHHH Confidence 22221 00 0011111111111111111111 11 14566899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_019457. 71 VASVNDEILDTIAASL 86 (156) Q Consensus 71 ~~~~~~~~~~~~~~~~ 86 (156) +.+...-+.+..+++= T Consensus 116 ~~~~~~~v~~~~~e~k 131 (131) T protein:vir:78 116 VSRFQQLLNEEASKVK 131 (131) T ss_pred HHHHHHHHHHHHHhcC Confidence 8876654444333222 No 179 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=43.59 E-value=0.23 Score=24.03 Aligned_cols=78 Identities=13% Similarity=0.208 Sum_probs=50.0 Q ss_pred eeeeecCcHHHHHHHHHHHHH-------------------------Hh--------------------c---CCcEEEEe Q lcl|NC_019457. 2 IKITVPNFDAVRDELTKALNK-------------------------LN--------------------S---DEFVTVGI 33 (156) Q Consensus 2 ~~v~~~~~~~~~~~l~~~l~~-------------------------l~--------------------~---~~~V~VGi 33 (156) |+|.+++.+..++.|.+.+-. +. . .++|+||| T Consensus 1 msvevkGv~eilk~le~k~G~~~~~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W 80 (133) T protein:vir:93 1 MSVEIKGIPEVLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEW 80 (133) T ss_pred CeEEEecHHHHHHHHHHhhCHhhhHhhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceEEEEEe Confidence 888888888777666544210 00 0 13455666 Q ss_pred ccCC-CCCCCCCCCHHHHHHHHhcCCCC----CCCchh--hhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 34 HEAD-NARPEGVLTNAQLGAIQHFGNDR----IPARPW--LDVGVASVNDEILDTIAASLED 88 (156) Q Consensus 34 ~~~~-~~~~~~g~~~A~ia~~~E~G~~~----IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 88 (156) .+.. .| .|=-+||||... |-||-| ++.+++..+..+.+.++.-|.. T Consensus 81 ~gp~~R~---------~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 81 VGPMNRK---------NIIHLNEHGYTRDGKKYTPRGFGVIAKTLAANERKYREIIKKELAR 133 (133) T ss_pred ecCCCce---------eEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 5322 22 245689999643 678888 8888888888888777766665 No 180 >protein:vir:96774 Length: 152 # NCBI annotation: hypothetical phage protein # Family: family:all:448 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224253;genbank:gi:62362388;genbank:GeneID:3345713 Probab=37.61 E-value=0.86 Score=20.94 Aligned_cols=72 Identities=10% Similarity=0.140 Sum_probs=34.8 Q ss_pred Ceeeee--------cCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchhhhHHHH Q lcl|NC_019457. 1 MIKITV--------PNFDAVRDELTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPWLDVGVA 72 (156) Q Consensus 1 m~~v~~--------~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpFlr~~~~ 72 (156) ++++.. .+....+..+...+..+....++. + .+++-+|.-.|||+..-+|.-|.|.++. T Consensus 73 ~vS~~~p~~~~~~~~~~~~t~~~~~~~i~~~~~g~~iy---i----------~NnlPYA~~LEyG~S~QAP~G~vr~t~~ 139 (152) T protein:vir:96 73 RVSISKITSFEKGISSQSSIMMDLQSDIAKFKIGETLF---M----------TNPLPYATSIEYGHSSQAPNGVYRPAVR 139 (152) T ss_pred eeeecCCCcccccCCCCCchHHHHHHHHhhccccceEE---E----------eeCchhhhHhhccccCCCCchHHHHHHH Confidence 222111 000011111111111111111111 1 2345578889999999999999999887 Q ss_pred HHHHHHHHHHHHHHhcc Q lcl|NC_019457. 73 SVNDEILDTIAASLEDG 89 (156) Q Consensus 73 ~~~~~~~~~~~~~~~~~ 89 (156) +. .+.+++++++. T Consensus 140 ~~----~~~v~ea~~~~ 152 (152) T protein:vir:96 140 RL----VKFLNTELKAK 152 (152) T ss_pred HH----HHHHHHHhccC Confidence 54 44555555555 No 181 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=37.17 E-value=0.23 Score=24.13 Aligned_cols=90 Identities=8% Similarity=0.106 Sum_probs=43.6 Q ss_pred eeeeec--Cc--------------------HHHHHHHHHHHHHHhcC--------------------------------- Q lcl|NC_019457. 2 IKITVP--NF--------------------DAVRDELTKALNKLNSD--------------------------------- 26 (156) Q Consensus 2 ~~v~~~--~~--------------------~~~~~~l~~~l~~l~~~--------------------------------- 26 (156) |+..++ ++ .+.++.+...|.+...+ T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~t 80 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQG 80 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhcccccccc Confidence 332222 11 11111111111111011 Q ss_pred CcEEEEeccCCCCCCCCC-----CCHHHHHHHHhcCCC-----CCCCchhhhHHHHHHHHHHHHHHH--------HHHhc Q lcl|NC_019457. 27 EFVTVGIHEADNARPEGV-----LTNAQLGAIQHFGND-----RIPARPWLDVGVASVNDEILDTIA--------ASLED 88 (156) Q Consensus 27 ~~V~VGi~~~~~~~~~~g-----~~~A~ia~~~E~G~~-----~IP~RpFlr~~~~~~~~~~~~~~~--------~~~~~ 88 (156) ..+.=||-.+..+...++ .+.+.+|-+-|||+. -+|-+.+|+.+.++.+.++.+.++ +.+.| T Consensus 81 G~lr~swk~~~~~k~~~~~~v~v~N~~~YA~~VE~GHR~~~gGfV~G~fml~~s~~~~~~~~~~~~e~~l~~~l~k~~~~ 160 (163) T protein:vir:10 81 GTLQKGWSKSRIEVSGRTYKQKVYNKVYYAPHVEYGHKTVNGGFVPGQFFLHKTVEDTKSDMEKRVRDKYDGFMRKVVLG 160 (163) T ss_pred chhhccceecceeecCCceEEEEEecCCccchhhcceeecCCceeccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 112223333333222222 366788999999975 389999999999988776655443 34444 Q ss_pred ccc Q lcl|NC_019457. 89 GED 91 (156) Q Consensus 89 ~~~ 91 (156) ... T Consensus 161 ~~~ 163 (163) T protein:vir:10 161 NGK 163 (163) T ss_pred CCC Confidence 443 No 182 >protein:vir:96012 Length: 133 # NCBI annotation: ORF023 # Family: family:all:589 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239805;genbank:gi:66395471;genbank:GeneID:5132929 Probab=36.84 E-value=0.35 Score=23.10 Aligned_cols=81 Identities=12% Similarity=0.159 Sum_probs=43.6 Q ss_pred CeeeeecCcHHHHHHHHHHHHH----H------------------hcCCcEEEEeccCC-CCCCCCCCCHHHHHHHHhcC Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNK----L------------------NSDEFVTVGIHEAD-NARPEGVLTNAQLGAIQHFG 57 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~----l------------------~~~~~V~VGi~~~~-~~~~~~g~~~A~ia~~~E~G 57 (156) |.++.-+.....-+.+.+.++. . .+.++|+|||-+.. .| .|=-+|||| T Consensus 23 m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~rtV~i~W~gp~~R~---------~iVHLNE~G 93 (133) T protein:vir:96 23 LMRITDRALTEAGEVVLEAIRTNLKYFRDTGAEYGEVKLSKPTWENGKRTIRVYWEGEKHRY---------SIVHLNEKG 93 (133) T ss_pred HHHHhhHHHHHHHHHHHHHHHHhhHHHhhccceeeeEEecCceecCCceEEEEEeecCCCce---------eeEeeeccc Confidence 5555444333333333333322 1 12245777776432 23 245789999 Q ss_pred CC-----CCCCchh--hhHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_019457. 58 ND-----RIPARPW--LDVGVASVNDEILDTIAASLEDGE 90 (156) Q Consensus 58 ~~-----~IP~RpF--lr~~~~~~~~~~~~~~~~~~~~~~ 90 (156) .- .|-||-| ++.+++..+..+.+.++.-+..-+ T Consensus 94 ~ytr~Gk~i~PrG~G~I~~al~~se~~y~~~vk~el~kll 133 (133) T protein:vir:96 94 FYAKDGKFIRPKGMGAIDKALRASRDKFFKVYAEEVSKLL 133 (133) T ss_pred ceecCCceeccchhhHHHHHHHhhhHHHHHHHHHHHHHhC Confidence 32 4888888 777788777666655554443322 No 183 >protein:vir:97190 Length: 148 # NCBI annotation: hypothetical protein ORF030 # Family: family:all:448 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294538;genbank:gi:149408259;genbank:GeneID:5237055 Probab=35.85 E-value=1.2 Score=20.14 Aligned_cols=77 Identities=12% Similarity=0.048 Sum_probs=37.8 Q ss_pred Ceeeee-------------------cCcHHHHHHHHHHHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCC Q lcl|NC_019457. 1 MIKITV-------------------PNFDAVRDELTKALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRI 61 (156) Q Consensus 1 m~~v~~-------------------~~~~~~~~~l~~~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~I 61 (156) ++++.. .+....+..+...+..+.....+ ++ .+++-+|.-.|||+..- T Consensus 53 ~vs~~~p~~~~~~~~dp~~~G~~~~~~~~~~i~~~~~vi~~~k~g~~i---yi----------~NnlpYA~~LEyG~S~Q 119 (148) T protein:vir:97 53 IAAIGSAPSSVIDAYSPGEAGSTEAANTQAAIDQAESVIRGYNYGEEI---HI----------TNNLPYIQRLNDGYSAQ 119 (148) T ss_pred heeecccccccccccCCCCCCcccccchhHHHHHHHHHhhccCCCceE---EE----------eecchhhhHhhccccCC Confidence 333211 11112222222222222111222 11 13556899999999999 Q ss_pred CCchhhhHHHHHHHHHHHHHHHHHHhccccH Q lcl|NC_019457. 62 PARPWLDVGVASVNDEILDTIAASLEDGEDI 92 (156) Q Consensus 62 P~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~ 92 (156) .|+.|.|.++.+...-+.+ .+++++.-.. T Consensus 120 AP~G~v~~t~~~~~~~v~~--~~~~~~~~~~ 148 (148) T protein:vir:97 120 APANFVEQAVLEAVQVVQF--GRVVDGDPGS 148 (148) T ss_pred CcchHHHHHHHHHHHHHHh--hhhhcCCCCC Confidence 9999999998765544432 2333332111 No 184 >protein:vir:2688 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075507;genbank:gi:12719436;genbank:GeneID:920156 Probab=31.41 E-value=0.58 Score=21.89 Aligned_cols=79 Identities=15% Similarity=0.195 Sum_probs=47.7 Q ss_pred CeeeeecCcHHHHHHHHHHHHHH----h------------------c--CCcEEEEeccC-CCCCCCCCCCHHHHHHHHh Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKALNKL----N------------------S--DEFVTVGIHEA-DNARPEGVLTNAQLGAIQH 55 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l~~l----~------------------~--~~~V~VGi~~~-~~~~~~~g~~~A~ia~~~E 55 (156) |.++.-+.....-+.+.+.|+.- . + .++|+|||-+. +.| .|=-+|| T Consensus 14 m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~~~rtV~i~W~gp~~R~---------~iVHLNE 84 (123) T protein:vir:26 14 MQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPMNRK---------NIIHLNE 84 (123) T ss_pred HHHhhhHHHHHHHHHHHHHHHHhhHHhhhccceeeeEEecCeeeccCCccceEEEEeecCCCce---------eeEeeec Confidence 66665444444333344333321 0 1 14577777643 233 2557899 Q ss_pred cCCCC----CCCchh--hhHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019457. 56 FGNDR----IPARPW--LDVGVASVNDEILDTIAASLED 88 (156) Q Consensus 56 ~G~~~----IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 88 (156) ||... |-||-| ++.+++..+..+.+.++.-|.. T Consensus 85 ~GYtr~Gk~i~PRG~G~i~~a~~~se~~y~~~vk~eL~k 123 (123) T protein:vir:26 85 HGYTRDGKKYTPRGFGVIAKTLAANERKYREIIKKELAR 123 (123) T ss_pred cceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 99643 678888 8888888888887777766665 No 185 >protein:vir:107703 Length: 147 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003902;genbank:gi:45686318;genbank:GeneID:2773043 Probab=29.21 E-value=1.7 Score=19.35 Aligned_cols=79 Identities=11% Similarity=0.044 Sum_probs=35.5 Q ss_pred CeeeeecC----------cHHHHHHHHH----HHHHHhcCCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCCCCCchh Q lcl|NC_019457. 1 MIKITVPN----------FDAVRDELTK----ALNKLNSDEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDRIPARPW 66 (156) Q Consensus 1 m~~v~~~~----------~~~~~~~l~~----~l~~l~~~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~IP~RpF 66 (156) ++++..-. ....+..... .+.+......+ ++ .+++-+|...|||+.+-+|+.| T Consensus 55 ~vs~~~~~~~~~~~~dp~g~~t~a~~~~~~~~~~~~~~~~~~i---yi----------~Nn~pYA~~LEyG~S~QAP~G~ 121 (147) T protein:vir:10 55 QITFNEIPNHALNRYDKTGGVVRGEEQAKTYGMFSRGGAITSV---HF----------SNMLIYANALEYGHSQQAPSGV 121 (147) T ss_pred ceeecCccccccCCcCCCccchhhhhhHHHHHHhhhccCcceE---EE----------eeCcchhhhhhccccCCCCchH Confidence 33221110 0000111111 11111111112 11 1345678899999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHhccccHHHHH Q lcl|NC_019457. 67 LDVGVASVNDEILDTIAASLEDGEDISQLL 96 (156) Q Consensus 67 lr~~~~~~~~~~~~~~~~~~~~~~~~~~~l 96 (156) .|.++.+...-+.+.+.++=+ .+.+| T Consensus 122 V~~t~q~~~~~v~~~~~e~k~----~~~~~ 147 (147) T protein:vir:10 122 VGLVALRLRSYMADAIKQARR----QQNAL 147 (147) T ss_pred HHHHHHHHHHHHHHHHHHHHh----hhccC Confidence 999887665444333322211 11111 No 186 >protein:vir:6216 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:10886 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852596;genbank:gi:31415856;genbank:GeneID:1489214 Probab=29.12 E-value=0.53 Score=22.07 Aligned_cols=85 Identities=14% Similarity=0.214 Sum_probs=46.7 Q ss_pred CeeeeecCcHHHHHHHHHHH----HHHh--------c-----CCcEEEEeccCCCCCCCCCCCHHHHHHHHhcCCCC--- Q lcl|NC_019457. 1 MIKITVPNFDAVRDELTKAL----NKLN--------S-----DEFVTVGIHEADNARPEGVLTNAQLGAIQHFGNDR--- 60 (156) Q Consensus 1 m~~v~~~~~~~~~~~l~~~l----~~l~--------~-----~~~V~VGi~~~~~~~~~~g~~~A~ia~~~E~G~~~--- 60 (156) +.+|.-+-.. +.|.++. ++|- . +..++|=+-.+.-.-. =-+-|-+=++-|.|+.+ T Consensus 18 l~kVd~kvs~---e~L~eAA~~f~~KL~P~Ip~Sl~kkk~HlrD~lkVvvk~d~V~V~--Fed~a~yW~f~EnGt~~~~~ 92 (125) T protein:vir:62 18 LLRVNKKVSL---DALDEAAKYFASKLKPKINVSNKNKRTHLRDSLKVVVKDDRVSVE--FKDEAWYWYLVEHGHKKAKG 92 (125) T ss_pred hhhhhhhhhH---HHHHHHHHHHHHhhccccChhhhhhhhhcceeeeEEeeCCeEEEE--Ecchhhhhhhhhcccccccc Confidence 4444422222 2222221 1111 0 1123443333221100 01235566778999875 Q ss_pred ---CCCchhhhHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_019457. 61 ---IPARPWLDVGVASVNDEILDTIAASLEDGE 90 (156) Q Consensus 61 ---IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 90 (156) |-+|-|...||+.+++++.+.+.+-+...+ T Consensus 93 ~g~vkaqhf~~~Tf~~nk~kI~~iM~kki~d~m 125 (125) T protein:vir:62 93 KGRVKGKHFVQNTFDAEGDKIADIMAQKIINRM 125 (125) T ss_pred ccccchhhhhhccHHhhHHHHHHHHHHHHHhhC Confidence 799999999999999999999877666555 No 187 >protein:vir:79638 Length: 146 # NCBI annotation: gp40 # Family: family:all:448 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285529;genbank:gi:148734512;genbank:GeneID:5219996 Probab=24.03 E-value=1.9 Score=19.09 Aligned_cols=90 Identities=10% Similarity=0.084 Sum_probs=39.4 Q ss_pred CeeeeecCcHHHHHH----------------HHHHHHHHhcCCc-----------EEEEeccCCC--CCC-CCC------ Q lcl|NC_019457. 1 MIKITVPNFDAVRDE----------------LTKALNKLNSDEF-----------VTVGIHEADN--ARP-EGV------ 44 (156) Q Consensus 1 m~~v~~~~~~~~~~~----------------l~~~l~~l~~~~~-----------V~VGi~~~~~--~~~-~~g------ 44 (156) |..-.+.++.+-..+ -.+.++++-..-- |.+|-+.... .+| +++ T Consensus 1 ma~~~~~sFa~~i~~~~~~ve~~~~~~~r~~a~~i~~~vv~~sPVDTGr~Ranw~vs~~~~~~~~~~~~dp~G~~t~~~~ 80 (146) T protein:vir:79 1 MADYSIREFHGNVDKWIEQVESGLNDVIQIFGEKVHGALVDIAPVDTGRFKANMQITANKPPLYALNQYDPDGEKIKAEG 80 (146) T ss_pred CCcchhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCcccccccCCCCCCcccHHHH Confidence 333222111111111 1111122211111 2232222110 011 111 Q ss_pred -------------------CCHHHHHHHHhcCCCCCCCchhhhHHHHHHHHHHHHHHHHHH-hccc Q lcl|NC_019457. 45 -------------------LTNAQLGAIQHFGNDRIPARPWLDVGVASVNDEILDTIAASL-EDGE 90 (156) Q Consensus 45 -------------------~~~A~ia~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~-~~~~ 90 (156) .+++-+|.-.|||+.+-.|+.|.|.++.+...-+.+...++= +.++ T Consensus 81 ~~~i~~~~~g~~~~~~iyi~NnlpYA~~LEyG~S~QAP~G~v~~~~~~~~~~v~~a~~e~k~~~~l 146 (146) T protein:vir:79 81 RRTLYALLHGGGAIKSIYFSNMLIYANALEYGHSKQAPAGVFGIVAIRLRSYMAEAIREARKKNAL 146 (146) T ss_pred HHHHHHHHhcccccceeEEeeCchhhhhhhccccCCCcchHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 145567888999999999999999999876654444332221 1122 No 188 >protein:vir:4460 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700383;genbank:gi:23505455;genbank:GeneID:955662 Probab=23.28 E-value=0.26 Score=23.77 Aligned_cols=78 Identities=21% Similarity=0.417 Sum_probs=44.4 Q ss_pred CCCchhhhHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHH-HHcCCCCCCcHHHHHhcCCCCchhHHHHH Q lcl|NC_019457. 61 IPARPWLDVGVASVNDEILDTIAASLEDGEDISQLLNRVGVVAVAGVQNY-IDELRSPANAPSTVERKGADNPLVDTGEM 139 (156) Q Consensus 61 IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~-I~~~~~ppns~~Ti~~KG~~~PLidTG~L 139 (156) ++.-+||---|+. .+...-+..-...++..||+.-..+-+.- +..|.. +-..+|-..||.| T Consensus 1 M~~~~~lHvdF~q--------p~~~~Fnr~r~RraF~~iGq~h~r~Arrlvm~RGrs----------~pGe~P~~~TGrL 62 (170) T protein:vir:44 1 MPQKAYLHVDFVQ--------PEELVFNRARMRRAFVKIGQVHMRDARRLVMKRGRS----------KPGENPSYRTGQL 62 (170) T ss_pred CCCCceeEEeeec--------CCceeecHHHHHHHHHHHhHHHHHHHHHHHHHhcCC----------CCCCCCcchhhhh Confidence 2222222211111 11111222334667888999888887744 444443 3356899999999 Q ss_pred Hhhceeeeccc-ccccCC Q lcl|NC_019457. 140 KQSVTYNIQTG-RPSEGL 156 (156) Q Consensus 140 ~~SIty~V~~~-k~~~g~ 156 (156) ..||+|.|-.. +---|+ T Consensus 63 a~SIgy~Vpras~~rpG~ 80 (170) T protein:vir:44 63 ARSIGYYVPRASKKRPGL 80 (170) T ss_pred hhhhhhccccccCCCCce Confidence 99999999644 333355 No 189 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=20.39 E-value=0.15 Score=25.14 Aligned_cols=91 Identities=20% Similarity=0.277 Sum_probs=41.7 Q ss_pred CeeeeecCc------------HHHHHHHHHHHHHHhcCCcEEEEec-------cCC-------CCCC------------- Q lcl|NC_019457. 1 MIKITVPNF------------DAVRDELTKALNKLNSDEFVTVGIH-------EAD-------NARP------------- 41 (156) Q Consensus 1 m~~v~~~~~------------~~~~~~l~~~l~~l~~~~~V~VGi~-------~~~-------~~~~------------- 41 (156) .-.|++++. ..+.+.|++.+++.. .|+++ .++ .|++ T Consensus 6 ~~~vrV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa-----~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aaT~ 80 (143) T protein:vir:62 6 AYTIRVDGLREFQRNVRTLRDKELNKAVREANKASG-----EVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTASA 80 (143) T ss_pred chheehHHHHHHHHHHHHhhCCchhHHHHHHHHHHH-----HHHHHHHHhhcCCcccccccccccCcchhhccccccccc Confidence 111222221 112233333333321 12222 111 0110 Q ss_pred ------CCCCCHHHHHHHHhcCCC--CCCCchhhhHHHHHHHHHHHHHHHHHHhccccHHHHHHH Q lcl|NC_019457. 42 ------EGVLTNAQLGAIQHFGND--RIPARPWLDVGVASVNDEILDTIAASLEDGEDISQLLNR 98 (156) Q Consensus 42 ------~~g~~~A~ia~~~E~G~~--~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 98 (156) -|+-.-.-+|.+-+||+. +|-++-|+..++...+++|.+..+.-+..- .++.|+. T Consensus 81 raa~VrAG~~krVPYA~~I~~G~r~r~Isp~rFl~~a~a~te~~~~r~Ye~~i~~v--l~k~l~s 143 (143) T protein:vir:62 81 KGAVIKAGSASRVPYAAAIHFGYRARNISPNRFLFRAMARKSDVVAATYERRIAAV--VEKYLES 143 (143) T ss_pred cceeeeeCCcCCCCcccccccCcccccccchhhhhhhhhccCHHHHHHHHHHHHHH--HHHHhcC Confidence 011122345778889987 688999999999999998876544333211 1112221 Done!