Query lcl|NC_019544.1_cdsid_YP_007010928.1 [gene=F494_gp07] [protein=hypothetical protein] [protein_id=YP_007010928.1] [location=6809..7315] Match_columns 168 No_of_seqs 105 out of 193 Neff 6.6 Searched_HMMs 1612 Date Thu Nov 7 16:22:44 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_7 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_7_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:99546 Length: 200 100.0 3.7E-60 2.3E-63 346.4 17.3 168 1-168 7-200 (200) 2 protein:vir:80037 Length: 199 100.0 5E-59 3.1E-62 340.2 15.8 167 1-168 1-197 (199) 3 protein:vir:96105 Length: 193 100.0 2.9E-57 1.8E-60 330.5 16.7 167 1-168 1-193 (193) 4 protein:vir:107757 Length: 189 100.0 9.5E-53 5.9E-56 305.8 16.4 145 1-168 1-185 (189) 5 protein:vir:5257 Length: 148 # 100.0 4.6E-52 2.9E-55 302.0 16.3 139 1-168 1-148 (148) 6 protein:vir:78607 Length: 155 100.0 1.5E-47 9.2E-51 277.3 14.3 132 1-168 1-154 (155) 7 protein:vir:106728 Length: 155 100.0 3.7E-47 2.3E-50 275.1 14.3 132 1-168 1-154 (155) 8 protein:vir:94069 Length: 168 100.0 4.9E-46 3E-49 269.0 14.0 136 1-168 1-157 (168) 9 protein:vir:77650 Length: 155 100.0 9.4E-46 5.8E-49 267.4 13.9 132 1-168 1-154 (155) 10 protein:vir:101563 Length: 155 100.0 1.5E-45 9.1E-49 266.4 14.2 132 1-168 1-154 (155) 11 protein:vir:95260 Length: 160 100.0 1.3E-43 8.4E-47 255.6 15.0 139 2-168 1-153 (160) 12 protein:vir:3163 Length: 145 # 98.8 2.1E-11 1.3E-14 79.1 6.7 75 87-168 1-81 (145) 13 protein:vir:99833 Length: 190 98.6 1.1E-10 6.7E-14 75.2 5.2 83 68-168 1-91 (190) 14 protein:vir:79225 Length: 155 98.5 1.1E-10 7E-14 75.1 3.8 84 47-168 1-91 (155) 15 protein:vir:79091 Length: 175 98.5 1.4E-10 8.6E-14 74.6 4.3 84 69-168 1-108 (175) 16 protein:vir:1988 Length: 156 # 98.5 6.9E-10 4.3E-13 70.8 8.0 83 1-168 1-95 (156) 17 protein:vir:103841 Length: 155 98.5 1.7E-10 1E-13 74.2 3.9 84 47-168 1-91 (155) 18 protein:vir:99196 Length: 155 98.4 3.7E-10 2.3E-13 72.3 4.0 84 69-168 1-91 (155) 19 protein:vir:107851 Length: 175 98.3 1.2E-09 7.2E-13 69.6 3.9 84 69-168 1-108 (175) 20 protein:vir:93617 Length: 148 98.1 3.2E-09 2E-12 67.1 2.5 111 1-118 2-148 (148) 21 protein:vir:102085 Length: 146 98.1 1.7E-08 1.1E-11 63.1 5.9 107 1-116 5-146 (146) 22 protein:vir:102875 Length: 146 98.1 1.7E-08 1.1E-11 63.1 5.9 107 1-116 5-146 (146) 23 protein:vir:105007 Length: 146 98.1 1.7E-08 1.1E-11 63.1 5.9 107 1-116 5-146 (146) 24 protein:vir:107568 Length: 146 98.1 1.7E-08 1.1E-11 63.1 5.9 107 1-116 5-146 (146) 25 protein:vir:97088 Length: 157 98.0 7.6E-09 4.7E-12 65.1 3.6 105 1-110 1-157 (157) 26 protein:vir:95789 Length: 114 98.0 1.8E-08 1.1E-11 63.0 5.5 96 1-107 1-114 (114) 27 protein:vir:99833 Length: 190 98.0 2.5E-08 1.5E-11 62.3 6.1 99 1-110 71-190 (190) 28 protein:vir:1386 Length: 149 # 98.0 3E-08 1.9E-11 61.8 6.1 112 1-121 1-149 (149) 29 protein:vir:3873 Length: 128 # 98.0 1.8E-08 1.1E-11 63.1 4.6 101 1-107 1-128 (128) 30 protein:vir:4347 Length: 164 # 98.0 1.8E-08 1.1E-11 63.0 4.3 120 1-126 1-164 (164) 31 protein:vir:98557 Length: 149 98.0 2.3E-08 1.4E-11 62.4 4.8 84 1-104 54-149 (149) 32 protein:vir:94538 Length: 125 97.9 4.5E-08 2.8E-11 60.9 6.1 99 1-109 5-125 (125) 33 protein:vir:194 Length: 149 # 97.9 1.5E-08 9.4E-12 63.4 2.9 111 1-118 2-149 (149) 34 protein:vir:3617 Length: 112 # 97.9 5.4E-08 3.4E-11 60.4 5.6 102 1-103 1-112 (112) 35 protein:vir:1891 Length: 179 # 97.8 1.6E-08 9.7E-12 63.4 2.1 120 1-126 1-179 (179) 36 protein:vir:1988 Length: 156 # 97.8 7.1E-08 4.4E-11 59.8 5.5 80 1-108 76-156 (156) 37 protein:vir:1838 Length: 149 # 97.8 6E-08 3.7E-11 60.2 4.8 84 1-104 63-149 (149) 38 protein:vir:1437 Length: 140 # 97.8 1.3E-07 8.1E-11 58.3 6.6 107 1-110 1-140 (140) 39 protein:vir:100075 Length: 140 97.8 1.5E-07 9.3E-11 58.0 6.5 107 1-110 1-140 (140) 40 protein:vir:106570 Length: 182 97.8 4.7E-08 2.9E-11 60.8 3.5 112 1-112 2-182 (182) 41 protein:vir:100312 Length: 152 97.7 6.6E-08 4.1E-11 60.0 4.0 86 1-105 64-152 (152) 42 protein:vir:80362 Length: 140 97.7 2.4E-07 1.5E-10 56.8 6.6 107 1-110 1-140 (140) 43 protein:vir:1273 Length: 127 # 97.6 1.2E-07 7.4E-11 58.5 4.0 78 1-107 42-127 (127) 44 protein:vir:6071 Length: 150 # 97.6 8.6E-08 5.4E-11 59.3 3.2 85 1-104 63-150 (150) 45 protein:vir:101594 Length: 173 97.6 3.2E-07 2E-10 56.2 6.2 104 3-109 1-173 (173) 46 protein:vir:5978 Length: 144 # 97.6 2.8E-07 1.7E-10 56.5 5.8 99 1-103 4-144 (144) 47 protein:vir:5703 Length: 150 # 97.6 1E-07 6.4E-11 58.9 3.4 85 1-104 63-150 (150) 48 protein:vir:5745 Length: 135 # 97.6 3.6E-07 2.2E-10 55.9 6.2 110 1-121 1-135 (135) 49 protein:vir:2026 Length: 150 # 97.5 1.8E-07 1.1E-10 57.5 4.0 85 1-104 63-150 (150) 50 protein:vir:98557 Length: 149 97.5 7.2E-07 4.5E-10 54.3 7.2 79 87-168 1-87 (149) 51 protein:vir:2740 Length: 114 # 97.5 2.5E-07 1.5E-10 56.8 4.4 99 1-104 1-114 (114) 52 protein:vir:4906 Length: 114 # 97.5 2.5E-07 1.5E-10 56.8 4.4 99 1-104 1-114 (114) 53 protein:vir:100243 Length: 140 97.5 4.5E-07 2.8E-10 55.4 5.6 82 1-110 43-140 (140) 54 protein:vir:103841 Length: 155 97.4 3.4E-07 2.1E-10 56.0 4.5 83 1-110 71-155 (155) 55 protein:vir:79091 Length: 175 97.4 5.4E-07 3.3E-10 55.0 5.4 82 1-109 1-175 (175) 56 protein:vir:79179 Length: 155 97.4 1.6E-07 1E-10 57.8 2.6 84 1-104 55-155 (155) 57 protein:vir:81106 Length: 125 97.4 1.7E-07 1.1E-10 57.7 2.6 78 1-107 17-125 (125) 58 protein:vir:79988 Length: 125 97.4 1.7E-07 1.1E-10 57.7 2.6 78 1-107 17-125 (125) 59 protein:vir:4704 Length: 125 # 97.4 1.7E-07 1.1E-10 57.7 2.6 78 1-107 17-125 (125) 60 protein:vir:98342 Length: 125 97.4 1.7E-07 1.1E-10 57.7 2.6 78 1-107 17-125 (125) 61 protein:vir:9414 Length: 125 # 97.4 1.7E-07 1.1E-10 57.7 2.6 78 1-107 17-125 (125) 62 protein:vir:3163 Length: 145 # 97.4 4.3E-07 2.6E-10 55.5 4.5 82 1-113 52-145 (145) 63 protein:vir:105089 Length: 133 97.3 4.6E-07 2.9E-10 55.3 4.0 82 1-109 19-133 (133) 64 protein:vir:79115 Length: 148 97.3 3.7E-07 2.3E-10 55.9 3.0 84 1-108 54-148 (148) 65 protein:vir:9930 Length: 108 # 97.3 5.9E-07 3.7E-10 54.7 4.1 71 8-104 1-108 (108) 66 protein:vir:103917 Length: 115 97.2 8E-07 4.9E-10 54.0 3.9 71 1-103 16-115 (115) 67 protein:vir:96358 Length: 115 97.2 8E-07 4.9E-10 54.0 3.9 71 1-103 16-115 (115) 68 protein:vir:96225 Length: 115 97.2 8E-07 4.9E-10 54.0 3.9 71 1-103 16-115 (115) 69 protein:vir:78858 Length: 115 97.2 8E-07 4.9E-10 54.0 3.9 71 1-103 16-115 (115) 70 protein:vir:9312 Length: 115 # 97.2 8E-07 4.9E-10 54.0 3.9 71 1-103 16-115 (115) 71 protein:vir:97144 Length: 115 97.2 8E-07 4.9E-10 54.0 3.9 71 1-103 16-115 (115) 72 protein:vir:106623 Length: 115 97.1 1E-06 6.5E-10 53.4 4.1 78 1-103 16-115 (115) 73 protein:vir:9708 Length: 125 # 97.1 6E-07 3.7E-10 54.7 2.8 79 1-108 38-125 (125) 74 protein:vir:99744 Length: 115 97.1 9E-07 5.6E-10 53.7 3.0 78 1-103 16-115 (115) 75 protein:vir:2026 Length: 150 # 97.1 4.3E-06 2.7E-09 50.0 6.7 79 87-168 1-87 (150) 76 protein:vir:107851 Length: 175 97.0 2E-06 1.3E-09 51.8 4.4 84 1-109 77-175 (175) 77 protein:vir:1164 Length: 156 # 97.0 1.2E-06 7.5E-10 53.0 2.9 88 1-113 64-156 (156) 78 protein:vir:107099 Length: 137 97.0 1.9E-06 1.2E-09 51.9 4.0 91 1-99 1-137 (137) 79 protein:vir:6071 Length: 150 # 97.0 5.4E-06 3.3E-09 49.5 6.4 79 87-168 1-87 (150) 80 protein:vir:93738 Length: 137 96.9 1.3E-06 8.2E-10 52.8 3.0 91 1-99 1-137 (137) 81 protein:vir:97427 Length: 137 96.9 1.3E-06 8.2E-10 52.8 3.0 91 1-99 1-137 (137) 82 protein:vir:94490 Length: 137 96.9 1.3E-06 8.2E-10 52.8 3.0 91 1-99 1-137 (137) 83 protein:vir:94654 Length: 142 96.9 9.4E-07 5.8E-10 53.6 2.1 100 1-102 4-142 (142) 84 protein:vir:99196 Length: 155 96.9 1.6E-06 9.7E-10 52.4 3.2 78 1-109 71-155 (155) 85 protein:vir:96486 Length: 112 96.9 2.1E-06 1.3E-09 51.7 3.5 70 1-102 19-112 (112) 86 protein:vir:78077 Length: 141 96.9 3.3E-06 2E-09 50.6 4.5 92 1-111 32-141 (141) 87 protein:vir:5703 Length: 150 # 96.9 7.9E-06 4.9E-09 48.5 6.6 79 87-168 1-87 (150) 88 protein:vir:98409 Length: 108 96.8 1.5E-06 9.6E-10 52.4 2.3 75 3-103 1-108 (108) 89 protein:vir:743 Length: 108 # 96.7 4.1E-06 2.6E-09 50.1 4.2 71 1-103 16-108 (108) 90 protein:vir:106041 Length: 137 96.7 1.7E-06 1.1E-09 52.2 1.9 95 1-111 1-137 (137) 91 protein:vir:95894 Length: 137 96.6 3.4E-06 2.1E-09 50.5 3.2 92 1-99 1-137 (137) 92 protein:vir:96121 Length: 137 96.4 2.7E-06 1.7E-09 51.1 1.1 93 1-99 1-137 (137) 93 protein:vir:94796 Length: 137 96.4 3.3E-06 2.1E-09 50.6 1.5 96 1-99 18-137 (137) 94 protein:vir:105330 Length: 137 96.3 2.9E-06 1.8E-09 51.0 0.7 91 1-99 1-137 (137) 95 protein:vir:99101 Length: 142 96.2 6.6E-06 4.1E-09 49.0 2.4 93 1-100 22-142 (142) 96 protein:vir:8669 Length: 142 # 96.2 6.6E-06 4.1E-09 49.0 2.4 93 1-100 22-142 (142) 97 protein:vir:79225 Length: 155 96.2 1.2E-05 7.2E-09 47.7 3.8 78 1-109 71-155 (155) 98 protein:vir:96829 Length: 135 96.1 5.2E-06 3.2E-09 49.6 1.2 93 1-99 1-135 (135) 99 protein:vir:81067 Length: 119 96.1 2.1E-05 1.3E-08 46.2 4.5 99 1-110 5-119 (119) 100 protein:vir:10367 Length: 119 96.1 2.2E-05 1.4E-08 46.1 4.6 101 1-110 5-119 (119) 101 protein:vir:95062 Length: 116 95.7 2.5E-05 1.5E-08 45.8 3.4 92 1-99 1-116 (116) 102 protein:vir:105916 Length: 149 95.6 1.8E-05 1.1E-08 46.6 2.3 96 1-99 30-149 (149) 103 protein:vir:97327 Length: 116 95.6 3.7E-05 2.3E-08 44.9 4.0 92 1-99 1-116 (116) 104 protein:vir:1243 Length: 116 # 95.6 3.7E-05 2.3E-08 44.9 4.0 92 1-99 1-116 (116) 105 protein:vir:107545 Length: 140 95.5 5E-06 3.1E-09 49.7 -1.0 95 1-97 1-140 (140) 106 protein:vir:97982 Length: 140 95.5 5E-06 3.1E-09 49.7 -1.0 95 1-97 1-140 (140) 107 protein:vir:3787 Length: 231 # 95.5 6.4E-05 3.9E-08 43.6 5.1 104 1-111 65-231 (231) 108 protein:vir:79115 Length: 148 95.4 0.00014 8.9E-08 41.6 6.6 79 87-168 1-86 (148) 109 protein:vir:1838 Length: 149 # 95.4 0.00015 9.2E-08 41.6 6.4 79 87-168 1-87 (149) 110 protein:vir:94108 Length: 149 95.2 1.8E-05 1.1E-08 46.6 1.0 96 1-99 30-149 (149) 111 protein:vir:102441 Length: 137 94.7 2.5E-05 1.5E-08 45.8 0.5 96 1-98 1-137 (137) 112 protein:vir:106506 Length: 137 94.3 1.2E-05 7.7E-09 47.5 -2.2 99 1-111 1-137 (137) 113 protein:vir:102154 Length: 119 94.0 4.9E-05 3.1E-08 44.2 0.5 76 1-107 19-119 (119) 114 protein:vir:93738 Length: 137 93.7 0.00014 8.4E-08 41.8 2.3 61 83-168 1-61 (137) 115 protein:vir:94490 Length: 137 93.7 0.00014 8.4E-08 41.8 2.3 61 83-168 1-61 (137) 116 protein:vir:97427 Length: 137 93.7 0.00014 8.4E-08 41.8 2.3 61 83-168 1-61 (137) 117 protein:vir:100887 Length: 139 93.3 0.00019 1.2E-07 41.0 2.6 84 1-114 44-139 (139) 118 protein:vir:100312 Length: 152 93.1 0.0013 8.3E-07 36.3 6.9 80 87-168 1-88 (152) 119 protein:vir:79179 Length: 155 93.1 0.001 6.4E-07 36.9 6.2 80 87-168 1-93 (155) 120 protein:vir:81147 Length: 126 91.2 0.00053 3.3E-07 38.5 2.4 84 1-110 1-126 (126) 121 protein:vir:100223 Length: 139 91.1 0.00048 3E-07 38.8 2.0 85 1-115 44-139 (139) 122 protein:vir:96121 Length: 137 91.0 0.0005 3.1E-07 38.7 2.0 61 83-168 1-61 (137) 123 protein:vir:100652 Length: 134 91.0 0.0012 7.3E-07 36.6 4.1 75 1-105 1-134 (134) 124 protein:vir:5000 Length: 141 # 90.4 0.0013 8.2E-07 36.4 3.8 79 1-113 61-141 (141) 125 protein:vir:1164 Length: 156 # 90.4 0.0043 2.7E-06 33.6 6.6 80 87-168 1-90 (156) 126 protein:vir:95062 Length: 116 89.3 0.0011 6.8E-07 36.8 2.5 40 112-168 1-40 (116) 127 protein:vir:107099 Length: 137 89.1 0.0035 2.2E-06 34.1 5.0 61 83-168 1-61 (137) 128 protein:vir:96829 Length: 135 89.1 0.0036 2.3E-06 34.0 5.1 61 69-168 1-61 (135) 129 protein:vir:97327 Length: 116 89.1 0.00098 6.1E-07 37.1 2.0 40 98-168 1-40 (116) 130 protein:vir:1243 Length: 116 # 89.1 0.00098 6.1E-07 37.1 2.0 40 98-168 1-40 (116) 131 protein:vir:105330 Length: 137 88.9 0.0026 1.6E-06 34.8 4.2 61 83-168 1-61 (137) 132 protein:vir:4956 Length: 153 # 88.1 0.0014 8.9E-07 36.2 2.3 89 1-150 61-153 (153) 133 protein:vir:5978 Length: 144 # 88.1 0.0027 1.6E-06 34.7 3.7 66 78-168 1-66 (144) 134 protein:vir:78755 Length: 228 87.8 0.0034 2.1E-06 34.1 4.1 113 1-118 55-228 (228) 135 protein:vir:3750 Length: 227 # 87.7 0.0028 1.7E-06 34.6 3.6 101 1-110 59-227 (227) 136 protein:vir:106570 Length: 182 87.5 0.0028 1.8E-06 34.5 3.5 66 69-168 1-66 (182) 137 protein:vir:4859 Length: 140 # 87.5 0.0029 1.8E-06 34.5 3.5 78 1-113 61-140 (140) 138 protein:vir:95894 Length: 137 87.1 0.0018 1.1E-06 35.6 2.2 61 83-168 1-61 (137) 139 protein:vir:966 Length: 123 # 86.8 0.0029 1.8E-06 34.5 3.1 93 1-104 1-123 (123) 140 protein:vir:4833 Length: 140 # 86.7 0.0042 2.6E-06 33.6 4.0 76 1-110 61-140 (140) 141 protein:vir:9513 Length: 134 # 86.5 0.0048 3E-06 33.3 4.2 75 1-105 1-134 (134) 142 protein:vir:101302 Length: 134 86.5 0.0048 3E-06 33.3 4.2 75 1-105 1-134 (134) 143 protein:vir:9930 Length: 108 # 85.7 0.0033 2E-06 34.2 2.8 57 84-168 1-57 (108) 144 protein:vir:94654 Length: 142 85.7 0.0035 2.2E-06 34.0 3.0 63 57-168 1-63 (142) 145 protein:vir:98860 Length: 230 85.1 0.004 2.5E-06 33.7 3.0 103 1-114 61-230 (230) 146 protein:vir:94796 Length: 137 85.1 0.0027 1.7E-06 34.7 2.1 61 83-168 1-61 (137) 147 protein:vir:9879 Length: 127 # 84.2 0.004 2.5E-06 33.7 2.6 87 1-104 16-127 (127) 148 protein:vir:105916 Length: 149 83.9 0.021 1.3E-05 29.8 6.4 73 55-168 1-73 (149) 149 protein:vir:94108 Length: 149 81.3 0.03 1.9E-05 28.9 6.2 73 55-168 1-73 (149) 150 protein:vir:78077 Length: 141 81.0 0.015 9.5E-06 30.5 4.5 60 82-168 1-61 (141) 151 protein:vir:3617 Length: 112 # 81.0 0.007 4.3E-06 32.4 2.6 60 83-168 1-61 (112) 152 protein:vir:105467 Length: 144 79.8 0.017 1.1E-05 30.3 4.4 102 1-110 1-144 (144) 153 protein:vir:106506 Length: 137 79.1 0.011 6.7E-06 31.4 3.0 56 76-168 1-56 (137) 154 protein:vir:99744 Length: 115 76.0 0.05 3.1E-05 27.7 5.8 65 75-168 1-65 (115) 155 protein:vir:79034 Length: 141 75.9 0.02 1.2E-05 29.9 3.5 93 1-112 1-141 (141) 156 protein:vir:95789 Length: 114 75.4 0.023 1.5E-05 29.5 3.8 61 83-168 1-61 (114) 157 protein:vir:9312 Length: 115 # 74.3 0.011 6.7E-06 31.4 1.7 65 75-168 1-65 (115) 158 protein:vir:96225 Length: 115 74.3 0.011 6.7E-06 31.4 1.7 65 75-168 1-65 (115) 159 protein:vir:97144 Length: 115 74.3 0.011 6.7E-06 31.4 1.7 65 75-168 1-65 (115) 160 protein:vir:103917 Length: 115 74.3 0.011 6.7E-06 31.4 1.7 65 75-168 1-65 (115) 161 protein:vir:78858 Length: 115 74.3 0.011 6.7E-06 31.4 1.7 65 75-168 1-65 (115) 162 protein:vir:96358 Length: 115 74.3 0.011 6.7E-06 31.4 1.7 65 75-168 1-65 (115) 163 protein:vir:3848 Length: 159 # 72.2 0.031 2E-05 28.8 3.7 84 1-112 62-159 (159) 164 protein:vir:93898 Length: 133 71.7 0.054 3.4E-05 27.5 4.8 78 1-104 1-133 (133) 165 protein:vir:106623 Length: 115 70.0 0.097 6E-05 26.1 5.8 65 75-168 1-65 (115) 166 protein:vir:9647 Length: 132 # 68.4 0.095 5.9E-05 26.2 5.4 79 1-108 1-132 (132) 167 protein:vir:78335 Length: 133 66.2 0.12 7.4E-05 25.6 5.5 80 1-106 1-133 (133) 168 protein:vir:96973 Length: 133 59.0 0.15 9.2E-05 25.1 4.7 78 1-104 1-133 (133) 169 protein:vir:94419 Length: 133 59.0 0.15 9.2E-05 25.1 4.7 78 1-104 1-133 (133) 170 protein:vir:9363 Length: 133 # 59.0 0.15 9.2E-05 25.1 4.7 78 1-104 1-133 (133) 171 protein:vir:78644 Length: 133 59.0 0.15 9.2E-05 25.1 4.7 78 1-104 1-133 (133) 172 protein:vir:99528 Length: 92 # 54.6 0.03 1.9E-05 28.9 0.1 61 78-168 1-62 (92) 173 protein:vir:1332 Length: 143 # 53.3 0.056 3.5E-05 27.4 1.4 89 1-118 21-143 (143) 174 protein:vir:6246 Length: 143 # 51.0 0.057 3.5E-05 27.4 1.0 89 1-118 21-143 (143) 175 protein:vir:98636 Length: 138 48.4 0.2 0.00012 24.4 3.5 78 1-108 32-138 (138) 176 protein:vir:102963 Length: 163 44.2 0.4 0.00025 22.7 4.6 90 1-111 1-163 (163) 177 protein:vir:4096 Length: 140 # 36.6 0.53 0.00033 22.1 3.9 107 1-113 1-140 (140) 178 protein:vir:4460 Length: 170 # 35.3 0.15 9.1E-05 25.2 0.7 70 69-168 1-71 (170) 179 protein:vir:96288 Length: 100 31.3 0.11 6.9E-05 25.8 -0.7 72 90-168 1-73 (100) 180 protein:vir:7412 Length: 168 # 23.1 1.6 0.00097 19.5 4.1 102 1-116 62-168 (168) 181 protein:vir:78380 Length: 131 22.8 1.2 0.00077 20.1 3.5 53 85-168 1-53 (131) 182 protein:vir:94944 Length: 121 21.8 2.3 0.0014 18.6 4.7 56 82-168 1-56 (121) 183 protein:vir:80425 Length: 134 21.3 1.5 0.00091 19.7 3.5 53 85-168 1-53 (134) No 1 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=100.00 E-value=3.7e-60 Score=346.44 Aligned_cols=168 Identities=22% Similarity=0.331 Sum_probs=156.2 Q ss_pred CcceecccchHHHHHHHHHHhhCCeEEEEeec----------CCCchHHHHHhhhhcCceeccCccccccccc------- Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQIKVGLFG----------KDDSELVMIGAVHEYGAEIPVTPKMRAWFAA------- 63 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~v~VGi~~----------~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~------- 63 (168) |++++++.+++++++++|++|++++|+|||++ +||+++|+||+|||||++|+++.+.+++... T Consensus 7 ~~~k~~~~~~~~~~~~~l~~l~~~~v~vGi~~~~~y~~~~~~~dG~~va~IA~~~EfG~~i~~p~~~~~~~~~~~~g~~~ 86 (200) T protein:vir:99 7 KSNSVAAPLKHFQMLKQFDALKGKTVQAGWFETDRYPAKEGETIGPLVAKIARQLEFGGVINHPGGTKYIKDAIVDGRYV 86 (200) T ss_pred eeeeeecchHHHHHHHHHHHhhCCeEEEEEcCCCCcCCcccccccchHHHHHhHHHcCCeeccCCCcccccccccccccc Confidence 88888998999999999999999999999985 3689999999999999999987766554322 Q ss_pred ---------cchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019544. 64 ---------NGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL 134 (168) Q Consensus 64 ---------~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~ 134 (168) -++.++++.++++||||||||+|+++++++|.+++++.+.+++.|+.+++++|+.+|..++++||++|+++ T Consensus 87 g~rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~~G~~~~~~ik~~I~~~ 166 (200) T protein:vir:99 87 GTRFVHKSFQGEHEVTKAHQIVIPARPFMRLAWATFNKDKVKIQAQIARQLLDGTINPEQALAQIGLALEGCIVRSIKSG 166 (200) T ss_pred ccccccccccceeeeeccccccCCCcchhhHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcC Confidence 23456888999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCChHHHHHhcCCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 135 KDPPNSQMTIERKGSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 135 ~~ppnsp~Ti~~KG~~~PLiDTG~L~~SIty~V~ 168 (168) .+|||||+||++||||+||||||+|++||||+|| T Consensus 167 ~~ppna~sTi~~Kg~~~PLidTG~l~~SIty~Ve 200 (200) T protein:vir:99 167 PWAANSPATIRAKGFDKPLIDTAHMWQTVSSKVS 200 (200) T ss_pred CCCCChHHHHHHhCCCCchHHHHHHHhHhccccC Confidence 9999999999999999999999999999999999 No 2 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=100.00 E-value=5e-59 Score=340.22 Aligned_cols=167 Identities=35% Similarity=0.535 Sum_probs=153.0 Q ss_pred CcceecccchHHHHHHHHHHhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCcccccc----------------c--- Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAW----------------F--- 61 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~----------------~--- 61 (168) |+|+ ++.+.+++++++|++|++++|+||||++||+++++||.+||||++|+++++..+. + T Consensus 1 m~vt-~~~~~~~~~~~~l~~L~~k~v~vGi~~~d~~~~~~Ia~~~E~Ga~I~~~~~~l~Ip~~~a~~~k~~~~~~~~~p~ 79 (199) T protein:vir:80 1 MKVT-TDKSTMNKAIRELDQLDRYSLQIGLFGEDDSFIQMIAGVHEFGLTIRPKGKYLTIPTPEAGDRRARDIPGLFKPK 79 (199) T ss_pred Cccc-ccHHHHHHHHHHHHHhcCCEEEEEEecCCCcchhheeehhhcCCeeecCCceeeecchhhhcccccccCcccccC Confidence 9988 6678899999999999999999999999999999999999999999987653221 0 Q ss_pred ----------cccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 62 ----------AANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKM 131 (168) Q Consensus 62 ----------~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I 131 (168) ...+..++++.++++||+|||||+|+++++++|.+++++++.++++|+.+++++|+++|+.++++||.+| T Consensus 80 g~~~~~~~~~~~~~~~~e~g~~~~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~vl~g~~~a~~~L~~~G~~~~~~Ik~~I 159 (199) T protein:vir:80 80 GKNILAVAGPDGKLTVMFYLKTEVNIPERSFLRSTFDEKSNKWGELFEGWIDDVIHGKLSAEQVYNRLGAKIVDDIQMKI 159 (199) T ss_pred CcceeeeeccccceeeeeeccccccCCCCchhHHHHHHHHHHHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHHHHHHH Confidence 0112346788899999999999999999999999999999999999999999999999999999999999 Q ss_pred HhCCCCCChHHHHH-hcCCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 132 RDLKDPPNSQMTIE-RKGSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 132 ~~~~~ppnsp~Ti~-~KG~~~PLiDTG~L~~SIty~V~ 168 (168) .++.+|||||+||+ |||||+||||||+|++||+|+|. T Consensus 160 ~~~~~ppna~~Tia~rKg~~kPLidTG~l~~SIty~V~ 197 (199) T protein:vir:80 160 VEIQTPAKSAATLARNPRKNNPLIVTGKMKNSVTWKVM 197 (199) T ss_pred hccCCCCCCHHHHHHhcCCCCchHHHHHHHhhcceeee Confidence 99999999999997 89999999999999999999999 No 3 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=100.00 E-value=2.9e-57 Score=330.51 Aligned_cols=167 Identities=24% Similarity=0.384 Sum_probs=150.7 Q ss_pred CcceecccchHHHHHHHHHHhhCCeEEEEeecC----------CCchHHHHHhhhhcCceeccCccccccccc------- Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQIKVGLFGK----------DDSELVMIGAVHEYGAEIPVTPKMRAWFAA------- 63 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~v~VGi~~~----------~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~------- 63 (168) |+|+ .+.+.+++++++|++|++++|+||||++ +|+++|+||+|||||++|+++...++.... T Consensus 1 m~~~-~~~~~~~~~~~~l~~l~~~~v~vGi~~~~~~~~~~~~~~G~~va~iAai~EfG~~I~~~~~~~~~~~~~~~g~~~ 79 (193) T protein:vir:96 1 MSLR-RDSELIAAHLQMLRAMRGRSVSAGWYSTARYPDKAGGSVGIQVARIARLNEYGGTIDHPGGTRYIRDAIVRGRFV 79 (193) T ss_pred Ceec-cchHHHHHHHHHHHHhcCCeEEEEEcCCCCCCCcccccccchHHHHHhHHHcCCccccCccceeeeecccccccc Confidence 7777 4456799999999999999999999964 388999999999999999877654432111 Q ss_pred ---------cchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019544. 64 ---------NGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL 134 (168) Q Consensus 64 ---------~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~ 134 (168) .+..++++.++++||||||||+|+++++++|.+.+++.+.+++.|+.+++++|+++|..++++||++|+++ T Consensus 80 ~~~~~k~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~~G~~~~~~ik~~I~~~ 159 (193) T protein:vir:96 80 GVRFVRNDFPGETEVTKPHRITIPARPFMRYAWNLFSADRAAIQNRIAMRLARGQITPDQALAQIGLALEGYIARSIRTG 159 (193) T ss_pred ccceeccCcceeeEeecceeccCCCcchhhhhHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcC Confidence 13455788899999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCChHHHHHhcCCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 135 KDPPNSQMTIERKGSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 135 ~~ppnsp~Ti~~KG~~~PLiDTG~L~~SIty~V~ 168 (168) .+|||||+||++||||+||||||+|++||||+|= T Consensus 160 ~~ppna~~Ti~~KG~~~PLidTG~l~~SIty~Vv 193 (193) T protein:vir:96 160 PWVANSASTVRRKGFNRPLVDTAHMLQSISSRVT 193 (193) T ss_pred CCCCCcHHHHHHhCCCCchhHHHHHHhhhcceeC Confidence 9999999999999999999999999999999999 No 4 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=100.00 E-value=9.5e-53 Score=305.79 Aligned_cols=145 Identities=23% Similarity=0.400 Sum_probs=137.2 Q ss_pred Ccceeccc-chHHHHHHHHHHhhCCeEEEEeec----CCCchHHHHHhhhhcCceeccCccccccccccchhhhccccee Q lcl|NC_019544. 1 MKVTIKDT-NNIDKITRNLQQLGGKQIKVGLFG----KDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVI 75 (168) Q Consensus 1 M~v~i~~~-~~~~~~~~~l~~l~~~~v~VGi~~----~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i 75 (168) |+++|+.. +.+++|.+.|++|++++|+||||+ +||.++|+||+|||||+ ++. T Consensus 1 M~~~i~~~~~~~~~L~~~lk~l~~k~V~VGi~~~~~y~dG~~vA~Ia~~~E~G~-----------------------p~~ 57 (189) T protein:vir:10 1 MGRVIRKQGPARVKLNAFIKGMNDYSVRIGWFSTAKYPDGTPTAYVASIHEFGA-----------------------PSR 57 (189) T ss_pred CcceeccCcHHHHHHHHHHHHhhCCeEEEEecCCCCCCCcccHHHHHHHHHhcC-----------------------cCC Confidence 99999854 567889999999999999999996 47999999999999995 456 Q ss_pred ccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcC------- Q lcl|NC_019544. 76 KIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKG------- 148 (168) Q Consensus 76 ~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG------- 148 (168) +||||||||+|+++++++|.+++++.+.++++|+.+++++|+.+|+.++++||.+|+++.+|||||+||++|| T Consensus 58 ~IP~RPFlr~t~~~~~~~~~~~l~~~~~~vl~G~~~~~~~L~~~G~~a~~~Ik~~I~~~~~ppna~sTi~~Kg~~~~~~~ 137 (189) T protein:vir:10 58 GIPARSFIRPTIAAQQAAWSQQMRFYAKQIVVGQMNVEQALEGLAIVARGDVDATLARLKDPPLSPLTIYIRKFIKDGGV 137 (189) T ss_pred CCCCchhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHHhcccCcccc Confidence 8999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ----------------------------CCCcchhHHHHHhHhccccC Q lcl|NC_019544. 149 ----------------------------SDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 149 ----------------------------~~~PLiDTG~L~~SIty~V~ 168 (168) |++||||||+|++||||+|. T Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~kPLidTG~l~~SIty~V~ 185 (189) T protein:vir:10 138 IHGYKDIMRLRSEMQQEQAKGTLNLSGVSTDPLDFTGYMRATLSYTVT 185 (189) T ss_pred hhhhhhhhhhhhhhhhhhhhccccccccCCCchhhHHHHHhhcceeee Confidence 47999999999999999999 No 5 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=100.00 E-value=4.6e-52 Score=302.03 Aligned_cols=139 Identities=27% Similarity=0.414 Sum_probs=130.1 Q ss_pred Ccceeccc-chHHHHHHHHHHhhCCeEEEEeec--------CCCchHHHHHhhhhcCceeccCccccccccccchhhhcc Q lcl|NC_019544. 1 MKVTIKDT-NNIDKITRNLQQLGGKQIKVGLFG--------KDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKE 71 (168) Q Consensus 1 M~v~i~~~-~~~~~~~~~l~~l~~~~v~VGi~~--------~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~ 71 (168) |+++++.+ .++++++++|++|++++|+||||+ +||.++|+||+|||||. T Consensus 1 M~~~~k~~~~~~~~l~~~l~~l~~~~v~VGi~~~~~~~~~~~~g~~vA~ia~~~E~G~---------------------- 58 (148) T protein:vir:52 1 MAVTVTANFSAAKQLIEQMKSLKEKAVYVGFPAEFDEKVKGSENFNLASLAAVLEFGN---------------------- 58 (148) T ss_pred CccccccccHHHHHHHHHHHHhhCCeEEEEeecCcCCCCCCCCCCCHHHHHHHHhcCC---------------------- Confidence 99999875 479999999999999999999984 36899999999999994 Q ss_pred cceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCC Q lcl|NC_019544. 72 TTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDN 151 (168) Q Consensus 72 ~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~ 151 (168) .+||+|||||+|+++++++|.+++. +++.|+.+++++|+.+|+.++++||++|.++.+|||||+||++||||+ T Consensus 59 ---~~IP~Rpflr~t~~~~~~~~~~~~~----~~~~~~~~~~~~L~~~G~~~~~~ik~~I~~~~~ppna~sTi~~Kg~~~ 131 (148) T protein:vir:52 59 ---EHIPARPFLRQTLEENQEKYTALFI----QWFDQGVPAAQIYERLSVMAQGDVQMNIVKGEWVANAKSTIRRKKSSK 131 (148) T ss_pred ---CCCCCcchhHHHHHHHHHHHHHHHH----HHHHcCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHhcCCCC Confidence 4899999999999999999998765 455679999999999999999999999999999999999999999999 Q ss_pred cchhHHHHHhHhccccC Q lcl|NC_019544. 152 PLIDTGRLVGSIRHTVE 168 (168) Q Consensus 152 PLiDTG~L~~SIty~V~ 168 (168) ||||||+|++||||+|| T Consensus 132 PLidTG~l~~SIty~V~ 148 (148) T protein:vir:52 132 PLIDTGKMRQSVRGIVK 148 (148) T ss_pred chhHHHHHHHHhhhhcC Confidence 99999999999999999 No 6 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=100.00 E-value=1.5e-47 Score=277.32 Aligned_cols=132 Identities=20% Similarity=0.311 Sum_probs=117.5 Q ss_pred CcceecccchHHHHHHHHHHhhCCeEEEEeecC----------------------CCchHHHHHhhhhcCceeccCcccc Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQIKVGLFGK----------------------DDSELVMIGAVHEYGAEIPVTPKMR 58 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~v~VGi~~~----------------------~g~~~a~iA~~~E~G~~i~~~~~~~ 58 (168) |++..+. |+++ +++|++++|+||||++ +|.++|+||++|||| T Consensus 1 m~v~~k~---L~~~---~~~l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G---------- 64 (155) T protein:vir:78 1 MSVTRRG---LTLP---KDRYRSMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYG---------- 64 (155) T ss_pred CcchHHH---HHHH---HHHHhCCeeEEeecCCCCCCcccchhhhhhhhcccccccCCcHHHHHHhhhcC---------- Confidence 7776553 5554 4556889999999975 278999999999999 Q ss_pred ccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCC Q lcl|NC_019544. 59 AWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPP 138 (168) Q Consensus 59 ~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~pp 138 (168) +++||||||||+|+++++++|.+.+++.+ .+..+++++|+++|+.++++||++|+++. || T Consensus 65 ---------------~~~IP~RPFlr~t~~~~~~~~~~~l~~~~----~~~~~~~~~L~~~G~~~~~~Ik~~I~~~~-~p 124 (155) T protein:vir:78 65 ---------------TSKLPARPFMEKTITDRSAEWIKGLTVMM----TMGYDAEVAMGQIGQAMKDDIKTTISEWP-AD 124 (155) T ss_pred ---------------CCCCCCcchhhHHHHHHHHHHHHHHHHHH----HcCCCHHHHHHHHHHHHHHHHHHHHhcCC-CC Confidence 36899999999999999999998876555 45889999999999999999999999986 99 Q ss_pred ChHHHHHhcCCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 139 NSQMTIERKGSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 139 nsp~Ti~~KG~~~PLiDTG~L~~SIty~V~ 168 (168) |||+||++||||+||||||+|++||+|+|+ T Consensus 125 na~~Ti~~Kg~~kPLidTG~l~~SIty~V~ 154 (155) T protein:vir:78 125 NSADWAGKKGFNHGLIWTSHLLNSVEQEIV 154 (155) T ss_pred CcHHHHHhcCCCCchhHHHHHHHhhhhhcc Confidence 999999999999999999999999999999 No 7 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=100.00 E-value=3.7e-47 Score=275.13 Aligned_cols=132 Identities=20% Similarity=0.310 Sum_probs=117.4 Q ss_pred CcceecccchHHHHHHHHHHhhCCeEEEEeecC----------------------CCchHHHHHhhhhcCceeccCcccc Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQIKVGLFGK----------------------DDSELVMIGAVHEYGAEIPVTPKMR 58 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~v~VGi~~~----------------------~g~~~a~iA~~~E~G~~i~~~~~~~ 58 (168) |++..+. |+++ +++|++++|+||||++ +|.++|+||++|||| T Consensus 1 m~v~~k~---L~~~---~~~l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G---------- 64 (155) T protein:vir:10 1 MSVTRRG---LTLP---KDRYRSMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYG---------- 64 (155) T ss_pred CcchHHH---HHHH---HHHHhCCeeEEeecCCCCCccccchhhhhhhhcccccccCCcHHHHHHHHhcC---------- Confidence 7776553 5554 4556889999999975 278999999999999 Q ss_pred ccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCC Q lcl|NC_019544. 59 AWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPP 138 (168) Q Consensus 59 ~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~pp 138 (168) +++||+|||||+|+++++++|.+.+++.+ .+..+++++|+++|+.++++||++|+++. || T Consensus 65 ---------------~~~IP~RPFlr~t~~~~~~~~~~~l~~~~----~~~~~~~~~L~~lG~~~~~~Ik~~I~~~~-~p 124 (155) T protein:vir:10 65 ---------------TSKLPARPFMEKTIADRSAEWIKGLTVMM----TMGYDAEVAMGQIGQAMKDDIKTTISEWP-AD 124 (155) T ss_pred ---------------CCCCCCcchhHHHHHHHHHHHHHHHHHHH----HcCCCHHHHHHHHHHHHHHHHHHHHhcCC-CC Confidence 36899999999999999999998876554 56889999999999999999999999986 99 Q ss_pred ChHHHHHhcCCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 139 NSQMTIERKGSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 139 nsp~Ti~~KG~~~PLiDTG~L~~SIty~V~ 168 (168) |||+||++||||+||||||+|++||+|+|. T Consensus 125 na~~Ti~~KG~~kPLidTG~l~~SIty~Vv 154 (155) T protein:vir:10 125 NSADWAGKKGFNHGLIWTSHLLNSVEQEIV 154 (155) T ss_pred CcHHHHHhcCCCCchhHHHHHHHhhhhhcc Confidence 999999999999999999999999999999 No 8 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=100.00 E-value=4.9e-46 Score=268.99 Aligned_cols=136 Identities=18% Similarity=0.271 Sum_probs=121.6 Q ss_pred CcceecccchHHHHHHHHHHhhCCeEEEEeecC---------------------CCchHHHHHhhhhcCceeccCccccc Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQIKVGLFGK---------------------DDSELVMIGAVHEYGAEIPVTPKMRA 59 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~v~VGi~~~---------------------~g~~~a~iA~~~E~G~~i~~~~~~~~ 59 (168) |+...+ .+++...+.+++|++++|+|||+++ +|.++|+||++||||. T Consensus 1 ~~~~~~--~g~~~~~~~~~~l~~~~v~vG~l~~a~yp~G~~~~~~~~~~~~~~~~g~~va~Ia~~~E~G~---------- 68 (168) T protein:vir:94 1 MTTIAR--KGVKMPPHLEAQFQSGEVKAGVLSGSTYPQMTYTDQRTGKQIEDARGGMPVAVIAQALEYGH---------- 68 (168) T ss_pred Cccccc--hhhhhhHHHHHhhhccceeeeccccCcccccccchhhcccccccccccccHHHHHHHHhcCC---------- Confidence 665533 4688888999999999999999652 4568999999999994 Q ss_pred cccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCC Q lcl|NC_019544. 60 WFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPN 139 (168) Q Consensus 60 ~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppn 139 (168) ++||+|||||+|+++++++|.+.++ +++.|+.+++++|+.+|+.++++||.+|+++. ||| T Consensus 69 ---------------~~IP~RPFlr~t~~~~~~~~~~~~~----~~~~~~~~~~~~L~~lG~~~~~~Ik~~I~~~~-ppn 128 (168) T protein:vir:94 69 ---------------GQNHPRPFMQQTYAAQYRAWSRDLT----LTLKAGAAADTALRTVGQRMAEDIQDTIRNWP-ADN 128 (168) T ss_pred ---------------CCCCCchhhHHHHHHHHHHHHHHHH----HHHhcCCCHHHHHHHHHHHHHHHHHHHhhcCC-CCc Confidence 5899999999999999999988664 56678999999999999999999999999985 999 Q ss_pred hHHHHHhcCCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 140 SQMTIERKGSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 140 sp~Ti~~KG~~~PLiDTG~L~~SIty~V~ 168 (168) ||+||++||||+||||||+|++||+|+|- T Consensus 129 a~sTi~~KG~~~PLiDTG~l~~SIty~Vv 157 (168) T protein:vir:94 129 SPEWAAIKGFNAGLRQTGVLLNAIDSAVI 157 (168) T ss_pred cHHHHHhcCCCCchhHHHHHHhhcceeee Confidence 99999999999999999999999999665 No 9 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=100.00 E-value=9.4e-46 Score=267.44 Aligned_cols=132 Identities=21% Similarity=0.326 Sum_probs=116.3 Q ss_pred CcceecccchHHHHHHHHHHhhCCeEEEEeecC----C------------------CchHHHHHhhhhcCceeccCcccc Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQIKVGLFGK----D------------------DSELVMIGAVHEYGAEIPVTPKMR 58 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~v~VGi~~~----~------------------g~~~a~iA~~~E~G~~i~~~~~~~ 58 (168) |++..+. |+. .+++|++++|+|||+++ | |.++|+||+|||||. T Consensus 1 m~~~r~~---l~~---~~~~l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~G~pva~ia~~~e~G~--------- 65 (155) T protein:vir:77 1 MSVTRRG---LTL---PKDRYRSMSVKAGVLAGATYPDESGKKLADGSILKKDPRAGLPVAMIAMALNYGT--------- 65 (155) T ss_pred CcchHHH---HHH---HHHHHhcCceEEeecCCCCCccccchhhhhhhhccccccccccHhhhhhhhhcCC--------- Confidence 7777552 444 34557889999999874 2 789999999999993 Q ss_pred ccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCC Q lcl|NC_019544. 59 AWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPP 138 (168) Q Consensus 59 ~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~pp 138 (168) ++||||||||+|+++++++|.+.+.+.+ .+..+++++|+.+|..++++||++|+++.+| T Consensus 66 ----------------~~IP~RPFlr~t~~~~~~~~~~~l~~~~----~~~~~~~~~L~~lG~~~~~~Iq~~I~~~~~p- 124 (155) T protein:vir:77 66 ----------------SKLPARPFMEKTIADRSAEWIKGLTVMM----TMGYDAEVAMGQIGQAMKDDIKTTISEWPAD- 124 (155) T ss_pred ----------------CCCCCCchhhHHHHHHHHHHHHHHHHHH----HccCcHHHHHHHHHHHHHHHHHHHHhcCCCC- Confidence 6899999999999999999998877655 4578999999999999999999999999986 Q ss_pred ChHHHHHhcCCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 139 NSQMTIERKGSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 139 nsp~Ti~~KG~~~PLiDTG~L~~SIty~V~ 168 (168) |+|+||++||||+||||||+|++||||+|. T Consensus 125 ~~~~Ti~~KG~d~PLidTG~l~~SIty~Vv 154 (155) T protein:vir:77 125 NNADWAGKKGFNHGLIWTSHLLNSIEQEIV 154 (155) T ss_pred CChHHHHhcCCCCchhHHHHHHHhhhhhcc Confidence 678999999999999999999999999999 No 10 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=100.00 E-value=1.5e-45 Score=266.38 Aligned_cols=132 Identities=21% Similarity=0.327 Sum_probs=115.7 Q ss_pred CcceecccchHHHHHHHHHHhhCCeEEEEeecC----C------------------CchHHHHHhhhhcCceeccCcccc Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQIKVGLFGK----D------------------DSELVMIGAVHEYGAEIPVTPKMR 58 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~v~VGi~~~----~------------------g~~~a~iA~~~E~G~~i~~~~~~~ 58 (168) |+|.++ +|+++++ +|++++|+||||++ | |.++|.||+|||||. T Consensus 1 m~v~r~---~L~~~~~---~l~~~~V~VGi~~~a~y~d~~g~~~~~g~~~~~~~~~G~pva~ia~~~e~G~--------- 65 (155) T protein:vir:10 1 MSVTRR---GLTLPKD---RYKSMSVKAGVLAGATYPDESGKKLADGTILKKDPRAGLPVAMIAMALNYGT--------- 65 (155) T ss_pred CcchHH---HHHHHHH---HhhCCeeEEeecCCCCCCccccchhhhhhhhccccccCcchhhhhhhhhcCC--------- Confidence 888765 3565554 55778899999864 2 788999999999993 Q ss_pred ccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCC Q lcl|NC_019544. 59 AWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPP 138 (168) Q Consensus 59 ~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~pp 138 (168) ++||||||||+|+++++++|.+.+++. +.+..+++++|+.+|..++++||++|+++.+| T Consensus 66 ----------------~~IP~RPFlr~t~~~~~~~~~~~l~~~----~~~~~~~~~~L~~~G~~~~~~Ik~~I~~~~~p- 124 (155) T protein:vir:10 66 ----------------SKLPARPFMEKTIADRSAEWIKGLTVM----MTMGYDAEVAMGQIGQAMKDDIKTTISEWPAD- 124 (155) T ss_pred ----------------CCCCCcchhHHHHHHHHHHHHHHHHHH----HHcCCCHHHHHHHHHHHHHHHHHHHHhcCCCC- Confidence 689999999999999999999877655 45688999999999999999999999999976 Q ss_pred ChHHHHHhcCCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 139 NSQMTIERKGSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 139 nsp~Ti~~KG~~~PLiDTG~L~~SIty~V~ 168 (168) |+|+||++||||+||||||+|++||+|+|. T Consensus 125 ~~~~Ti~~KG~~~PLidTG~l~~Sity~Vv 154 (155) T protein:vir:10 125 NNADWAGKKGFNHGLIWTSHLLNSIEQEIV 154 (155) T ss_pred CChHHHHhcCCCCchHHHHHHHHhhhhhcc Confidence 678999999999999999999999999888 No 11 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=100.00 E-value=1.3e-43 Score=255.61 Aligned_cols=139 Identities=17% Similarity=0.233 Sum_probs=114.8 Q ss_pred cceecccchHHHHHHHHHHhhCCeEEEEeecC-----CCchHHHHHhhhhcCceeccCccccccccccchhhhcccceec Q lcl|NC_019544. 2 KVTIKDTNNIDKITRNLQQLGGKQIKVGLFGK-----DDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIK 76 (168) Q Consensus 2 ~v~i~~~~~~~~~~~~l~~l~~~~v~VGi~~~-----~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~ 76 (168) -|+.....++++|.++|++|+++.|+||||+| ||.++++||+|||||. ++ T Consensus 1 ~~~~~~~~G~~~L~~~~k~l~~~~V~VGi~~d~g~~~dG~sv~~vA~~~EfG~-------------------------~~ 55 (160) T protein:vir:95 1 MVKRVIHPARAKLVGAMKNLQTANAQVGYFQEQGQHSSGFSYPALMYLQEVIG-------------------------VP 55 (160) T ss_pred CceeechHhHHHHHHHHHHHhCCeeEEeeccccccCCCCccHHHHHhhhhcCc-------------------------cc Confidence 23333456789999999999999999999974 6889999999999994 58 Q ss_pred cCCCchhHHHHHH----HHHHHHHH-HHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC----CCCCChHHHHHhc Q lcl|NC_019544. 77 IPERSWLRSGYDE----NIDKIAKK-IEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL----KDPPNSQMTIERK 147 (168) Q Consensus 77 IP~RpFlr~~~~~----~~~~~~~~-~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~----~~ppnsp~Ti~~K 147 (168) ||+|||||++|+. +...+.++ ..+...++..|+.++ .+.+|+.++++||.+|.+. .||||||+||++| T Consensus 56 iPaRPf~R~tfe~~~~~~~~~~~~~~~~~i~~~~~~g~~~~---~~~LG~~~~~~ik~~I~~~~~p~~w~pNap~Ti~~K 132 (160) T protein:vir:95 56 SASGKVYRRLFEITMMLNKQTLLEQTKKNLYKQLSSLNTDP---SNTLEAFAKNAQKAIKRGFGNSAILPPNAPSTVKKK 132 (160) T ss_pred CCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhH---HHHHHHHHHHHHHHHHhhcCCccCCCCCcHHHHHhc Confidence 9999999999973 33334344 444556666666554 4559999999999999884 4789999999999 Q ss_pred CCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 148 GSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 148 G~~~PLiDTG~L~~SIty~V~ 168 (168) |||+||||||+|++||+|+|. T Consensus 133 gs~~PLiDTg~l~~Si~y~v~ 153 (160) T protein:vir:95 133 GFNAPLVETGDLRDNLAYKIS 153 (160) T ss_pred CCCCcchhhHHHhhhhhheee Confidence 999999999999999999999 No 12 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=98.79 E-value=2.1e-11 Score=79.08 Aligned_cols=75 Identities=25% Similarity=0.243 Sum_probs=58.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCChHHHHHhcCCCCcchhHHHHH Q lcl|NC_019544. 87 YDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL------KDPPNSQMTIERKGSDNPLIDTGRLV 160 (168) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~------~~ppnsp~Ti~~KG~~~PLiDTG~L~ 160 (168) +-+....+.+.++.... .....|..+|..++..+++.+.+. .|+|+||+|+++|++++||+|||.|+ T Consensus 1 ~i~~~~~i~~~l~~l~~-------~~~~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~~~~L~~tG~L~ 73 (145) T protein:vir:31 1 MVEDENNIPEAREAIQD-------GLTDGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGSDTPLIDNSRLL 73 (145) T ss_pred CcccHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcCCCCCccCHHHH Confidence 33334444444443322 234568889999999999999863 48899999999999999999999999 Q ss_pred hHhccccC Q lcl|NC_019544. 161 GSIRHTVE 168 (168) Q Consensus 161 ~SIty~V~ 168 (168) +||+|.+. T Consensus 74 ~Si~~~~~ 81 (145) T protein:vir:31 74 TDINAASM 81 (145) T ss_pred HHHHHHhh Confidence 99999875 No 13 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.59 E-value=1.1e-10 Score=75.21 Aligned_cols=83 Identities=14% Similarity=0.201 Sum_probs=59.8 Q ss_pred hhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCChH Q lcl|NC_019544. 68 LRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL------KDPPNSQ 141 (168) Q Consensus 68 ~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~------~~ppnsp 141 (168) |. . -++.|- -.++.+.+...+..+ .+...++..||..+...+++.|.+. .|+|++| T Consensus 1 M~-~-i~i~~d------------~~~~~~~L~~l~~~~----~~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~ 62 (190) T protein:vir:99 1 MA-G-ITLEWD------------GRRALDVLNAGSAAL----GDPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSP 62 (190) T ss_pred Cc-e-eEEEec------------HHHHHHHHHHHHHHh----hhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccH Confidence 10 0 112221 123334444444432 2568899999999999999999886 4789999 Q ss_pred HHHHhc--CCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 142 MTIERK--GSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 142 ~Ti~~K--G~~~PLiDTG~L~~SIty~V~ 168 (168) +|+++| ++.++|.|||.|++||+|.+. T Consensus 63 ~t~~rk~~~~~~~L~~tg~L~~Si~~~~~ 91 (190) T protein:vir:99 63 AYLRRKRKNRDKILTLDGHLRNLLRYQLD 91 (190) T ss_pred HHHHHhhcCCCccceecHHHHHHHhheec Confidence 999765 577999999999999999999 No 14 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=98.53 E-value=1.1e-10 Score=75.10 Aligned_cols=84 Identities=14% Similarity=0.149 Sum_probs=59.2 Q ss_pred cCceeccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHH Q lcl|NC_019544. 47 YGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGL 126 (168) Q Consensus 47 ~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ 126 (168) .|. .-++.+- ...+.+.+.+.... -.+...+|..||..+... T Consensus 1 M~~----------------------~i~i~~d------------~~~~~~~L~~l~~~----~~d~~~l~~~ig~~l~~~ 42 (155) T protein:vir:79 1 MTT----------------------RIDVELD------------DQEVRQRLAVLMRS----VTDTLPVMRGIAAELLAE 42 (155) T ss_pred Cce----------------------EEEEEec------------hHHHHHHHHHHHHH----hhhHHHHHHHHHHHHHHH Confidence 111 1112221 12333333333333 226788999999999999 Q ss_pred HHHHHHhC--CCCCChHHHHHhc-----CCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 127 IQKKMRDL--KDPPNSQMTIERK-----GSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 127 ik~~I~~~--~~ppnsp~Ti~~K-----G~~~PLiDTG~L~~SIty~V~ 168 (168) +++.|... .|+|+||+|+++| +..++|+|||.|++||+|.+. T Consensus 43 ~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~~tG~L~~Si~~~~~ 91 (155) T protein:vir:79 43 TEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQVTNALARSVTTWAD 91 (155) T ss_pred HHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccccchhhhhhhhceec Confidence 99999664 5899999999875 356899999999999999998 No 15 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=98.53 E-value=1.4e-10 Score=74.63 Aligned_cols=84 Identities=12% Similarity=0.086 Sum_probs=59.5 Q ss_pred hcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCC---CCCChHHHHH Q lcl|NC_019544. 69 RKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLK---DPPNSQMTIE 145 (168) Q Consensus 69 ~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~---~ppnsp~Ti~ 145 (168) +...-+| .++. +.+.+.+.+.... -.+...+|..||..+...+++.|.+.. |+|+||+|++ T Consensus 1 Ms~~i~i----------~~d~--~~~~~~L~~l~~~----~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~ 64 (175) T protein:vir:79 1 MSDFVNF----------QIDD--SALRTRLLQLEQA----GHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIH 64 (175) T ss_pred CceEEEE----------Eech--HHHHHHHHHHHHH----hcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHH Confidence 0000111 1221 2344444444433 337788999999999999999998873 7899999986 Q ss_pred hc---------------------CCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 146 RK---------------------GSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 146 ~K---------------------G~~~PLiDTG~L~~SIty~V~ 168 (168) +| ++.++|+|||.|++||+|.+. T Consensus 65 ~r~~~~~~~~~~~~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~ 108 (175) T protein:vir:79 65 MRVGGKKAYKKNGELTAAASRRKAGLMILQDSGQMAASTATDSG 108 (175) T ss_pred hhccccccccccccchhhHhhhccCCCcceechhhhhhhhheec Confidence 43 467899999999999999999 No 16 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=98.53 E-value=6.9e-10 Score=70.80 Aligned_cols=83 Identities=11% Similarity=0.108 Sum_probs=55.8 Q ss_pred CcceecccchHHHHHHHHHHhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCCC Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPER 80 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~R 80 (168) |++. T Consensus 1 ms~~---------------------------------------------------------------------------- 4 (156) T protein:vir:19 1 MSLD---------------------------------------------------------------------------- 4 (156) T ss_pred CeEE---------------------------------------------------------------------------- Confidence 0000 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC-------CCCCChHHHHHhcC----- Q lcl|NC_019544. 81 SWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL-------KDPPNSQMTIERKG----- 148 (168) Q Consensus 81 pFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~-------~~ppnsp~Ti~~KG----- 148 (168) + .+....+.+.+.+.+.. . ......+|..||..+...+++.|.+. .|+|+||+|+++|. T Consensus 5 --i--~~~~d~~~l~~~L~~l~-~----~~~~~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~ 75 (156) T protein:vir:19 5 --M--NVAVDVRRIQLALDELG-T----VTRDRAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFV 75 (156) T ss_pred --E--EEeecHHHHHHHHHHHH-h----hhccHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCC Confidence 0 11111123333333321 1 22335789999999999999999863 47899999999873 Q ss_pred CCCcchhHHHHHhHhccccC Q lcl|NC_019544. 149 SDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 149 ~~~PLiDTG~L~~SIty~V~ 168 (168) ..+||+|||.|++||+|.+. T Consensus 76 ~~~~L~~tg~L~~Si~~~~~ 95 (156) T protein:vir:19 76 PGSILTLHGDLARSITTDYG 95 (156) T ss_pred CCcchhhhHHHHHHhhheec Confidence 36899999999999999998 No 17 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=98.50 E-value=1.7e-10 Score=74.21 Aligned_cols=84 Identities=15% Similarity=0.152 Sum_probs=58.5 Q ss_pred cCceeccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHH Q lcl|NC_019544. 47 YGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGL 126 (168) Q Consensus 47 ~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ 126 (168) .+. ++.-.++ ...+.+.+.+.... -.+...+|..||..+... T Consensus 1 Ms~--------------------------------~i~i~~~--~~~~~~~L~~l~~~----~~~~~~l~~~ig~~l~~~ 42 (155) T protein:vir:10 1 MAN--------------------------------RIELELV--DREVQERLAALYAA----VTDTLPLMRGIAAELLAE 42 (155) T ss_pred CCc--------------------------------eEEEEec--hHHHHHHHHHHHHH----hhhHHHHHHHHHHHHHHH Confidence 110 1111222 12333333333332 236788999999999999 Q ss_pred HHHHHHhC--CCCCChHHHHHh-----cCCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 127 IQKKMRDL--KDPPNSQMTIER-----KGSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 127 ik~~I~~~--~~ppnsp~Ti~~-----KG~~~PLiDTG~L~~SIty~V~ 168 (168) +++.|... .|+|+||.|+++ +|+.++|+|||.|++||+|.+. T Consensus 43 ~~~rF~p~G~~W~plsp~t~~~r~k~g~~~~~~L~~tG~L~~Si~~~~~ 91 (155) T protein:vir:10 43 TEFAFMDEGPGWPQLSPVTVAARAAKGRGAHPILQVTNALARSITTRAD 91 (155) T ss_pred HHHHHhhcCCCCCCCCccchHHHHhccCCCCCccccchhhhhhhhceec Confidence 99999664 689999999864 3567899999999999999998 No 18 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=98.42 E-value=3.7e-10 Score=72.28 Aligned_cols=84 Identities=15% Similarity=0.155 Sum_probs=58.7 Q ss_pred hcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC--CCCCChHHHHHh Q lcl|NC_019544. 69 RKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL--KDPPNSQMTIER 146 (168) Q Consensus 69 ~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~--~~ppnsp~Ti~~ 146 (168) +...-++ .++ ...+.+.+.+.... -.+...+|..||..+...+++.|... .|+|+||+|+++ T Consensus 1 Ms~~i~i----------~~d--~~~~~~~L~~l~~~----~~d~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~ 64 (155) T protein:vir:99 1 MTTRIDV----------ELD--DQEVRQRLALLMRS----VTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAA 64 (155) T ss_pred CceEEEE----------Eec--hHHHHHHHHHHHHH----hhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHH Confidence 0001111 111 12333444444333 23578899999999999999999653 589999999987 Q ss_pred c-----CCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 147 K-----GSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 147 K-----G~~~PLiDTG~L~~SIty~V~ 168 (168) | +..++|+|||.|++||+|.+. T Consensus 65 r~~~g~~~~~iL~~tg~L~~Si~~~~~ 91 (155) T protein:vir:99 65 REAKGRGPHPILQVTNALARSVTTWAD 91 (155) T ss_pred HhccCCCCCCcchhchhhhhhhhceec Confidence 5 346799999999999999998 No 19 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=98.29 E-value=1.2e-09 Score=69.57 Aligned_cols=84 Identities=14% Similarity=0.136 Sum_probs=58.8 Q ss_pred hcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC---CCCCChHHHHH Q lcl|NC_019544. 69 RKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL---KDPPNSQMTIE 145 (168) Q Consensus 69 ~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~---~~ppnsp~Ti~ 145 (168) +...-++.+ + ..++.+. +.++.....+...+|..||..++...++.|.+. +|.|++|+|++ T Consensus 1 Ms~~i~i~~----------~--~~~l~~~----L~~l~~~~~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~ 64 (175) T protein:vir:10 1 MSDFVNFQI----------D--DSALRTR----LLQLEQAGHQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIH 64 (175) T ss_pred CceeEEEEe----------c--HHHHHHH----HHHHHHHhccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhh Confidence 111111221 1 1233333 333333334678899999999999999999886 36799999986 Q ss_pred h---------------------cCCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 146 R---------------------KGSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 146 ~---------------------KG~~~PLiDTG~L~~SIty~V~ 168 (168) + ++..++|+|||.|++||+|.+. T Consensus 65 ~r~~~g~~~~k~~~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~ 108 (175) T protein:vir:10 65 MRVGGKKAYKKNGELTAAASRRKAGLMILQDSGQMAASVSTDHD 108 (175) T ss_pred hhhcccccchhhhhhhhhhhhhccCCCcceechhhhhhhheeec Confidence 3 3467899999999999999998 No 20 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=98.09 E-value=3.2e-09 Score=67.14 Aligned_cols=111 Identities=15% Similarity=0.165 Sum_probs=56.3 Q ss_pred CcceecccchHHHHHHHHHHhhCCeE-EE------------------EeecCCCchHHHHHhh---hhcCc-eeccC--- Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQI-KV------------------GLFGKDDSELVMIGAV---HEYGA-EIPVT--- 54 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~v-~V------------------Gi~~~~g~~~a~iA~~---~E~G~-~i~~~--- 54 (168) |+++++- .++++|++.|++|..... +| -.|.++|.---.|..- ...|. .+.+. T Consensus 2 m~~~~~i-~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~~~~ 80 (148) T protein:vir:93 2 IETLLDF-SGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVHIRG 80 (148) T ss_pred cceeeee-hhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeeeecc Confidence 7777663 457888888888753211 11 1111111100000000 00000 00000 Q ss_pred ----cc-c----c-ccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHH Q lcl|NC_019544. 55 ----PK-M----R-AWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDA 118 (168) Q Consensus 55 ----~~-~----~-~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~ 118 (168) .. . . +......+..|.++++.+.||||||||+++++++++.+.+.+.+.+.+ +.+|.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~pa~PFl~pA~~~~k~~~~~~~~~~~~~~i------~k~~~k 148 (148) T protein:vir:93 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAI------DEVLRR 148 (148) T ss_pred cccccccccceeecCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH------HHHhcC Confidence 00 0 0 000111234567778899999999999999999988888877776633 334444 No 21 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=98.06 E-value=1.7e-08 Score=63.13 Aligned_cols=107 Identities=10% Similarity=0.178 Sum_probs=49.1 Q ss_pred CcceecccchHHHHHHHHHHhhCCe-------EEEEe-----------ecCCCchHHHHH-hhhhcC--c---ee-ccCc Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQ-------IKVGL-----------FGKDDSELVMIG-AVHEYG--A---EI-PVTP 55 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~-------v~VGi-----------~~~~g~~~a~iA-~~~E~G--~---~i-~~~~ 55 (168) |++++++ +++|++.|+.|.... +..|- |.++|..--.+. ..-..| . .+ +... T Consensus 5 ~~~~i~G---l~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~ 81 (146) T protein:vir:10 5 IDLDLLG---FDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKL 81 (146) T ss_pred eeeeehh---HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccccc Confidence 5555554 566666666654321 00010 101110000000 000000 0 00 0000 Q ss_pred cc--cc----ccc----ccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHH Q lcl|NC_019544. 56 KM--RA----WFA----ANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFM 116 (168) Q Consensus 56 ~~--~~----~~~----~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l 116 (168) .. .. +-. .-.+..|.++.+.+.||||||+|+++++++++.+.+...+.+.+.=. | T Consensus 82 ~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka------~ 146 (146) T protein:vir:10 82 EGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLD------L 146 (146) T ss_pred cccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhc------C Confidence 00 00 000 01133456667789999999999999999998888888777755322 2 No 22 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=98.06 E-value=1.7e-08 Score=63.13 Aligned_cols=107 Identities=10% Similarity=0.178 Sum_probs=49.1 Q ss_pred CcceecccchHHHHHHHHHHhhCCe-------EEEEe-----------ecCCCchHHHHH-hhhhcC--c---ee-ccCc Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQ-------IKVGL-----------FGKDDSELVMIG-AVHEYG--A---EI-PVTP 55 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~-------v~VGi-----------~~~~g~~~a~iA-~~~E~G--~---~i-~~~~ 55 (168) |++++++ +++|++.|+.|.... +..|- |.++|..--.+. ..-..| . .+ +... T Consensus 5 ~~~~i~G---l~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~ 81 (146) T protein:vir:10 5 IDLDLLG---FDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKL 81 (146) T ss_pred eeeeehh---HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccccc Confidence 5555554 566666666654321 00010 101110000000 000000 0 00 0000 Q ss_pred cc--cc----ccc----ccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHH Q lcl|NC_019544. 56 KM--RA----WFA----ANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFM 116 (168) Q Consensus 56 ~~--~~----~~~----~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l 116 (168) .. .. +-. .-.+..|.++.+.+.||||||+|+++++++++.+.+...+.+.+.=. | T Consensus 82 ~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka------~ 146 (146) T protein:vir:10 82 EGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLD------L 146 (146) T ss_pred cccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhc------C Confidence 00 00 000 01133456667789999999999999999998888888777755322 2 No 23 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=98.06 E-value=1.7e-08 Score=63.13 Aligned_cols=107 Identities=10% Similarity=0.178 Sum_probs=49.1 Q ss_pred CcceecccchHHHHHHHHHHhhCCe-------EEEEe-----------ecCCCchHHHHH-hhhhcC--c---ee-ccCc Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQ-------IKVGL-----------FGKDDSELVMIG-AVHEYG--A---EI-PVTP 55 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~-------v~VGi-----------~~~~g~~~a~iA-~~~E~G--~---~i-~~~~ 55 (168) |++++++ +++|++.|+.|.... +..|- |.++|..--.+. ..-..| . .+ +... T Consensus 5 ~~~~i~G---l~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~ 81 (146) T protein:vir:10 5 IDLDLLG---FDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKL 81 (146) T ss_pred eeeeehh---HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccccc Confidence 5555554 566666666654321 00010 101110000000 000000 0 00 0000 Q ss_pred cc--cc----ccc----ccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHH Q lcl|NC_019544. 56 KM--RA----WFA----ANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFM 116 (168) Q Consensus 56 ~~--~~----~~~----~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l 116 (168) .. .. +-. .-.+..|.++.+.+.||||||+|+++++++++.+.+...+.+.+.=. | T Consensus 82 ~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka------~ 146 (146) T protein:vir:10 82 EGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLD------L 146 (146) T ss_pred cccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhc------C Confidence 00 00 000 01133456667789999999999999999998888888777755322 2 No 24 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=98.06 E-value=1.7e-08 Score=63.13 Aligned_cols=107 Identities=10% Similarity=0.178 Sum_probs=49.1 Q ss_pred CcceecccchHHHHHHHHHHhhCCe-------EEEEe-----------ecCCCchHHHHH-hhhhcC--c---ee-ccCc Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQ-------IKVGL-----------FGKDDSELVMIG-AVHEYG--A---EI-PVTP 55 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~-------v~VGi-----------~~~~g~~~a~iA-~~~E~G--~---~i-~~~~ 55 (168) |++++++ +++|++.|+.|.... +..|- |.++|..--.+. ..-..| . .+ +... T Consensus 5 ~~~~i~G---l~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~ 81 (146) T protein:vir:10 5 IDLDLLG---FDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAKL 81 (146) T ss_pred eeeeehh---HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceeccccc Confidence 5555554 566666666654321 00010 101110000000 000000 0 00 0000 Q ss_pred cc--cc----ccc----ccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHH Q lcl|NC_019544. 56 KM--RA----WFA----ANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFM 116 (168) Q Consensus 56 ~~--~~----~~~----~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l 116 (168) .. .. +-. .-.+..|.++.+.+.||||||+|+++++++++.+.+...+.+.+.=. | T Consensus 82 ~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka------~ 146 (146) T protein:vir:10 82 EGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLD------L 146 (146) T ss_pred cccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhc------C Confidence 00 00 000 01133456667789999999999999999998888888777755322 2 No 25 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=98.04 E-value=7.6e-09 Score=65.10 Aligned_cols=105 Identities=17% Similarity=0.221 Sum_probs=54.2 Q ss_pred Ccceeccc--chHHHHHHHHHHhhCC------------------------------eEEEEeecC---CCc--------- Q lcl|NC_019544. 1 MKVTIKDT--NNIDKITRNLQQLGGK------------------------------QIKVGLFGK---DDS--------- 36 (168) Q Consensus 1 M~v~i~~~--~~~~~~~~~l~~l~~~------------------------------~v~VGi~~~---~g~--------- 36 (168) ||+++.+. +++...++.|.+..++ .+.+-...+ +|. T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~s~~g~~~~~Vg~~~ 80 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEESVEGIQTYAVSWRK 80 (157) T ss_pred CeeEeecccHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeeccccCCCceEEEEEeecC Confidence 99998643 3444444444322211 111111111 111 Q ss_pred hHHHHHhhhhcCceecc----CccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHH----HHHHHHHHHhc Q lcl|NC_019544. 37 ELVMIGAVHEYGAEIPV----TPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKK----IEKMVPDVIEG 108 (168) Q Consensus 37 ~~a~iA~~~E~G~~i~~----~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~----~~~~~~~~l~G 108 (168) .-+-++.+.|||...+. .++..|+-.. ... ..+..+||||||||+|+..+++..+. +.+.+.+++.| T Consensus 81 ~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~----~~~-~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k~I~e~l~g 155 (157) T protein:vir:97 81 KAAPHGHLLEFGHWQTHAAYRDKDGQWYSSK----VKL-VNPKWIPAKPFLRPGYDSVAMQIPDIARAAGAKKYAELQRG 155 (157) T ss_pred CccceeeeeecCcccccccccCCcccccccc----ccc-CCCCcCCCCcccchHHHHhHHHHHHHHHHHHHHHHHHHhcC Confidence 12344555688843211 1111111000 011 12457999999999999988777666 45678888888 Q ss_pred cC Q lcl|NC_019544. 109 NV 110 (168) Q Consensus 109 ~~ 110 (168) +. T Consensus 156 ~~ 157 (157) T protein:vir:97 156 DT 157 (157) T ss_pred CC Confidence 85 No 26 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=98.03 E-value=1.8e-08 Score=63.00 Aligned_cols=96 Identities=13% Similarity=0.071 Sum_probs=52.3 Q ss_pred CcceecccchHHHHHHHHHHhhCC---eEE---------------EEeecCCCchHHHHHhhhhcCceeccCcccccccc Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGK---QIK---------------VGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFA 62 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~---~v~---------------VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~ 62 (168) |++++++ ++++.+.|+.+.+. .|. -..|-++|.--..| .+...+....... T Consensus 1 msi~i~G---ld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI--------~~~~~g~~~~V~~ 69 (114) T protein:vir:95 1 MAIKWQG---IEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQLAPKDTEFLKDHI--------TTSYPGMEAHIHG 69 (114) T ss_pred Ceeeeeh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhce--------eeecCceEEEeec Confidence 9998875 66666666655431 110 00111111100000 0000000000011 Q ss_pred ccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019544. 63 ANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIE 107 (168) Q Consensus 63 ~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~ 107 (168) ...+..+.++++...|+||||+|++++++.++.+.++..+++-+. T Consensus 70 ~~~Ya~yvE~GT~~~~aqPfl~pa~~~~~~~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 70 EAGYDGYQEYGTRFQPGTPHFRPMMEQIQPQFQKDMTDVMKGAFK 114 (114) T ss_pred CCCccceeecCccccCCCccchhhHHHHHHHHHHHHHHHHHhhcC Confidence 123344445556689999999999999999999999999988776 No 27 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.02 E-value=2.5e-08 Score=62.30 Aligned_cols=99 Identities=17% Similarity=0.343 Sum_probs=61.0 Q ss_pred Cccee-cccchHHHHHHHHHH-hhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccc--------------- Q lcl|NC_019544. 1 MKVTI-KDTNNIDKITRNLQQ-LGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAA--------------- 63 (168) Q Consensus 1 M~v~i-~~~~~~~~~~~~l~~-l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~--------------- 63 (168) .+-++ .++ -.|.+.|.. .....|.||- ...||++|+||++|.++++....... T Consensus 71 ~~~~~L~~t---g~L~~Si~~~~~~~~v~vGt-------n~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~~ 140 (190) T protein:vir:99 71 NRDKILTLD---GHLRNLLRYQLDGSELLFGS-------DRPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFVPR 140 (190) T ss_pred CCCccceec---HHHHHHHhheecCcEEEEec-------CcchhhhhhcCCcccccccchhhhhhhhhhhhhhhcccccc Confidence 11111 111 234555543 3556788874 25689999999999887655432210 Q ss_pred c----chhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019544. 64 N----GYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNV 110 (168) Q Consensus 64 ~----g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~ 110 (168) . +.......++++||+||||--+ ++..+++.+.+...+..++...+ T Consensus 141 ~~~~~~~~~~~~~~~v~IPaRpfLG~s-~~d~~~I~~~i~~~l~~~~~~~~ 190 (190) T protein:vir:99 141 RRSNFAQDVQIGPYTIQMPARPWLGTS-SQDDDTILQRVERYLQRALRERA 190 (190) T ss_pred cccccchhcccccceeeecCcccCCCC-HHHHHHHHHHHHHHHHHHHhhcC Confidence 0 1112334568999999999544 45568888888888888887665 No 28 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=97.99 E-value=3e-08 Score=61.83 Aligned_cols=112 Identities=15% Similarity=0.090 Sum_probs=52.4 Q ss_pred Ccceec-ccchHHHHHHHHHHhhC-CeE--------EEEe-----------ecCCCchHHHHHhhhhcC-----cee-cc Q lcl|NC_019544. 1 MKVTIK-DTNNIDKITRNLQQLGG-KQI--------KVGL-----------FGKDDSELVMIGAVHEYG-----AEI-PV 53 (168) Q Consensus 1 M~v~i~-~~~~~~~~~~~l~~l~~-~~v--------~VGi-----------~~~~g~~~a~iA~~~E~G-----~~i-~~ 53 (168) |+=.++ ...+|++|++.|++|.+ ..+ ..|- |.+++..-.....+...| ..+ ++ T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~~ 80 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSGRKGSRPPGHAANNIPEPKI 80 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCccccccccccccchhhhcceeccc Confidence 874433 23568888888888843 111 1111 111100000000000000 000 00 Q ss_pred Cc-ccc-----cccc----ccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHH Q lcl|NC_019544. 54 TP-KMR-----AWFA----ANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGM 121 (168) Q Consensus 54 ~~-~~~-----~~~~----~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~ 121 (168) .. ++. ++.. ...+..|.++.+.+.||+||||++++++++++.+.+...+.+.+.-. ||. T Consensus 81 ~~~~g~~~~~VG~~~~~~~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i~~~---------lG~ 149 (149) T protein:vir:13 81 RKKKGNLQCVVGWEKSDNTPFYYMKMEEWGTSERPPHHAFGKTNKILKRVYDNIAQKKYDNFVKEK---------LGD 149 (149) T ss_pred ccccceeEEEeeccCCCCCccceeeeeccCccCCCCCccchHHHHHHHHHHHHHHHHHHHHHHHHH---------hcC Confidence 00 000 0000 01244566778899999999999999999888877766555544321 111 No 29 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=97.98 E-value=1.8e-08 Score=63.11 Aligned_cols=101 Identities=11% Similarity=-0.005 Sum_probs=53.8 Q ss_pred CcceecccchHHHHHHHHHHhhCCe-------EEEEe-----------ecCCCch-----HHH---HHhhhhcCceeccC Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQ-------IKVGL-----------FGKDDSE-----LVM---IGAVHEYGAEIPVT 54 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~-------v~VGi-----------~~~~g~~-----~a~---iA~~~E~G~~i~~~ 54 (168) |++.+++ +++|++.|+.|.... +..|- |.++|.. ++. +..+...+....+ T Consensus 1 m~v~i~G---l~el~~~l~~l~~~~~k~~~~al~~ga~~~~~~~k~~ap~~~~~~~~~~h~~d~I~~~~~k~~~g~~~~- 76 (128) T protein:vir:38 1 MGVKVTG---DAELLANLNKLQFGVAKEARAAVRDGAQKFADKLKSNTPEWDGETDMSGHLRDDIKLSSVRETSGLTEV- 76 (128) T ss_pred Cccchhh---HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCCCcccchhhhhhccccccccCceeEE- Confidence 9998875 566666666654211 11110 1111110 000 0000000000000 Q ss_pred ccccccc-cccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019544. 55 PKMRAWF-AANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIE 107 (168) Q Consensus 55 ~~~~~~~-~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~ 107 (168) ..++. ....+..|.+..+.+.||+||||++++++++++.+.+.+.+++.+- T Consensus 77 --~VG~~k~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 77 --DVGYGKDTGWRAHFPNSGTSMQDPQHFIEETQEIMRPVVIAAFLSHLKEGGM 128 (128) T ss_pred --EeeecCCCceEEeeeccCccCCCCCcchhHHHHHhHHHHHHHHHHHHHhhcC Confidence 00000 1123455677788999999999999999999999999988888554 No 30 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=97.96 E-value=1.8e-08 Score=63.01 Aligned_cols=120 Identities=14% Similarity=0.034 Sum_probs=59.7 Q ss_pred Ccceec-ccchHHHHHHHHHHhhCCeE-EE-------E-----------eec-C---CCchHHHHHhhhhcCce------ Q lcl|NC_019544. 1 MKVTIK-DTNNIDKITRNLQQLGGKQI-KV-------G-----------LFG-K---DDSELVMIGAVHEYGAE------ 50 (168) Q Consensus 1 M~v~i~-~~~~~~~~~~~l~~l~~~~v-~V-------G-----------i~~-~---~g~~~a~iA~~~E~G~~------ 50 (168) |+-.++ .-.+|++|.+.|++|....- ++ | .|. + ++..+..--.+.+-+.. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~~ 80 (164) T protein:vir:43 1 MADTVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTGD 80 (164) T ss_pred CCcceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccccc Confidence 553332 12357788888877753210 00 0 010 0 11111100000000000 Q ss_pred ----eccCccc-------ccccccc---chhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHH Q lcl|NC_019544. 51 ----IPVTPKM-------RAWFAAN---GYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFM 116 (168) Q Consensus 51 ----i~~~~~~-------~~~~~~~---g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l 116 (168) +.+..+. +..-..+ .+..|.++.+.+.||||||||++++++++..+.+.+.+.+. .+.+| T Consensus 81 ~~~~vg~~~~~~~~~~~~~~~~~~~~~~~y~~f~EfGT~km~a~PFlrPA~~~~k~~~~~~~~~~l~~~------i~ka~ 154 (164) T protein:vir:43 81 LGFRIGVLHGAVLPKKGERSDKTANAPTPHWRLLEFGTEDMRAQPFMRSALADNIAEVTSTFVSEYEKG------IDRAI 154 (164) T ss_pred eeEEecccccccccccccccccCCCCCcceEEEeecCCCCCCCCcchhhhHHHhHHHHHHHHHHHHHHH------HHHHH Confidence 0000000 0000011 23446777888999999999999999999888888777763 35666 Q ss_pred HHHHHHHHHH Q lcl|NC_019544. 117 DAIGMEFAGL 126 (168) Q Consensus 117 ~~iG~~~~~~ 126 (168) .+.+..++.- T Consensus 155 ~k~~~~~~~~ 164 (164) T protein:vir:43 155 KRAAKKAAQG 164 (164) T ss_pred HHHHhhhccC Confidence 6666655554 No 31 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=97.95 E-value=2.3e-08 Score=62.45 Aligned_cols=84 Identities=13% Similarity=0.161 Sum_probs=48.7 Q ss_pred Ccce---------ecccchHH--HHHHHHH-HhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhh Q lcl|NC_019544. 1 MKVT---------IKDTNNID--KITRNLQ-QLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPL 68 (168) Q Consensus 1 M~v~---------i~~~~~~~--~~~~~l~-~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~ 68 (168) -..+ .....-+. .+...|. ......+.|||.+. ...||++|.||+++++.++ T Consensus 54 ~p~~~~~~~~k~~~~~~~l~~~g~l~~sl~~~~~~~~~~V~~~Gs----~~~yAa~HQfG~~~r~~~~------------ 117 (149) T protein:vir:98 54 AARKRQSVRSKKGRIRREMFARLRTNRFMKAKGSDSAAVVEFTGR----VQRMARVHQYGLKDRPNRH------------ 117 (149) T ss_pred cccchHHHHhccCCCCcccchhhhhhhhhhheecCCeeEEEecCc----chHHhhHhhccccccccCC------------ Confidence 0000 00000011 1122222 23556799998743 3589999999998876543 Q ss_pred hcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 69 RKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPD 104 (168) Q Consensus 69 ~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 104 (168) .+.++||+||||--+ ++.++++.+.+...+.+ T Consensus 118 ---~~~~~iPaRp~LG~s-~~d~~~i~~~i~~~l~~ 149 (149) T protein:vir:98 118 ---SRDVQYAARPLLGFT-RDDEQMIEDIIIRHLGK 149 (149) T ss_pred ---CcceeccccccCCCC-HHHHHHHHHHHHHHhhC Confidence 357899999999433 34456666666666665 No 32 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=97.93 E-value=4.5e-08 Score=60.86 Aligned_cols=99 Identities=9% Similarity=0.020 Sum_probs=51.4 Q ss_pred CcceecccchHHHHHHHHHHhhCCe---EEEE---------------eecCCCchHHHHHh--h--hhcCceeccCcccc Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQ---IKVG---------------LFGKDDSELVMIGA--V--HEYGAEIPVTPKMR 58 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~---v~VG---------------i~~~~g~~~a~iA~--~--~E~G~~i~~~~~~~ 58 (168) |++.+++ +++|.+.|+++.... |.-. .|-++|.----|.. + -.-|.++.+ T Consensus 5 ~~i~~~G---ld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~~~~~~~~~~v----- 76 (125) T protein:vir:94 5 FNIKFKG---VDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVKEEHGVVTGRY----- 76 (125) T ss_pred eeeeehh---HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceecceeccCCcEEEEe----- Confidence 6666654 666666666553311 1000 01111210000000 0 000000000 Q ss_pred ccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019544. 59 AWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGN 109 (168) Q Consensus 59 ~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~ 109 (168) -....+..+.++++...|+||||+|+++++++++.+.+++.+.+++.-. T Consensus 77 --~~~~~Ya~~vEfGT~~~~a~Pfl~pa~~~~~~~~~~~l~~~l~~a~k~~ 125 (125) T protein:vir:94 77 --VARADYSSYNEYGTYRMSAQPFMAPSVAAMTPFFYKAVRDALNKAAKFS 125 (125) T ss_pred --eCCCCccceeecccccCCCCcccchhHHHHHHHHHHHHHHHHHHHhccC Confidence 0111233445555678999999999999999999999999998877544 No 33 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=97.90 E-value=1.5e-08 Score=63.45 Aligned_cols=111 Identities=12% Similarity=0.127 Sum_probs=55.9 Q ss_pred CcceecccchHHHHHHHHHHhhCCeE-EE---E---------------eecCCCchHHHHHh-----hhhcC--ceeccC Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQI-KV---G---------------LFGKDDSELVMIGA-----VHEYG--AEIPVT 54 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~v-~V---G---------------i~~~~g~~~a~iA~-----~~E~G--~~i~~~ 54 (168) |+++++- .+|++|++.|+.|..... ++ . .|.++|.--..|.. -.+-+ ..+.+. T Consensus 2 m~~~~~i-~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v~~~ 80 (149) T protein:vir:19 2 IETSLDF-SGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) T ss_pred cceeeeh-hhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeeccccc Confidence 7777763 358888888888754211 11 0 11112210000000 00000 000000 Q ss_pred c------cccccc-----cccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHH Q lcl|NC_019544. 55 P------KMRAWF-----AANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDA 118 (168) Q Consensus 55 ~------~~~~~~-----~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~ 118 (168) . ....+. ....+..|.+..+.+.||||||||+++++++++.+.+...+.+.+ +.++.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PF~~pA~~~~k~~~~~~~~~~l~~~l------~k~~~k 149 (149) T protein:vir:19 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAI------DEVLSK 149 (149) T ss_pred ccccccccccceeecCCCCccceeeeeccCCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHH------HHHhcC Confidence 0 000000 011234567778899999999999999999988887777776633 334444 No 34 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=97.87 E-value=5.4e-08 Score=60.42 Aligned_cols=102 Identities=14% Similarity=0.203 Sum_probs=52.4 Q ss_pred CcceecccchHHHHHHHHHHhhCCeEEEEeecCCCchHHHHHhhh---hcC-----ceeccCc-ccccc-ccccchhhhc Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQIKVGLFGKDDSELVMIGAVH---EYG-----AEIPVTP-KMRAW-FAANGYPLRK 70 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~v~VGi~~~~g~~~a~iA~~~---E~G-----~~i~~~~-~~~~~-~~~~g~~~~~ 70 (168) |+++++- .+++++++.|+++......--...+.+..+..-|.-+ .-| +...... +..+. .....+..+. T Consensus 1 M~~~i~i-~Gld~l~~~L~~~~~~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~v 79 (112) T protein:vir:36 1 MKSSLSF-KGIDQLVKHLDKAASLKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMELTEGGFSGQAGPHTDYSAYV 79 (112) T ss_pred Cceeeee-hhHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeeecCCceEEEeecCCCcccee Confidence 8888774 4588888888776442110000000011111100000 000 0000000 00000 1112344556 Q ss_pred ccceeccCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 71 ETTVIKIPERSWLRSGYDENIDKIAKKIEKMVP 103 (168) Q Consensus 71 ~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 103 (168) ++++...|+||||||+++.++.++.+.+++.++ T Consensus 80 E~GT~k~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 80 EYGTRFQSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred eccccccCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 666778999999999999999998888887777 No 35 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=97.84 E-value=1.6e-08 Score=63.38 Aligned_cols=120 Identities=9% Similarity=0.016 Sum_probs=49.9 Q ss_pred Ccceec-ccchHHHHHHHHHHhhCCeEE-EEe--ecCCCchHHHHHhh-------------------------------- Q lcl|NC_019544. 1 MKVTIK-DTNNIDKITRNLQQLGGKQIK-VGL--FGKDDSELVMIGAV-------------------------------- 44 (168) Q Consensus 1 M~v~i~-~~~~~~~~~~~l~~l~~~~v~-VGi--~~~~g~~~a~iA~~-------------------------------- 44 (168) |.-+++ ...+|++|.+.|++|....-. +-- ....+..++.-|.- T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g~ 80 (179) T protein:vir:18 1 MADSVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTGD 80 (179) T ss_pred CCceEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccccc Confidence 542222 123477777777776531100 000 00001111111110 Q ss_pred ------hhcCceecc-------Cccccccc----------cccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 45 ------HEYGAEIPV-------TPKMRAWF----------AANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKM 101 (168) Q Consensus 45 ------~E~G~~i~~-------~~~~~~~~----------~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~ 101 (168) .+.|+.... .+...... ....+..|.++.+.+.||||||||++++++++..+.+... T Consensus 81 ~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa~PFlrPA~~~~~~~a~~~i~~~ 160 (179) T protein:vir:18 81 LAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSARPILRPAMNGVDNDVINVFSTE 160 (179) T ss_pred eeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCCCccchhhHHhhHHHHHHHHHHH Confidence 011110000 00000000 0012234667778899999999999999887766666555 Q ss_pred HHHHHhccCcHHHHHHHHHHHHHHH Q lcl|NC_019544. 102 VPDVIEGNVNPRLFMDAIGMEFAGL 126 (168) Q Consensus 102 ~~~~l~G~~~~~~~l~~iG~~~~~~ 126 (168) +.+.| +.+|.+.+...... T Consensus 161 l~~~i------~k~lk~~~~~~~~~ 179 (179) T protein:vir:18 161 MGKAI------DRAIRLAMKKGTTA 179 (179) T ss_pred HHHHH------HHHHHhhcccCCCC Confidence 54422 22332222222211 No 36 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=97.82 E-value=7.1e-08 Score=59.79 Aligned_cols=80 Identities=16% Similarity=0.303 Sum_probs=49.2 Q ss_pred CcceecccchHHHHHHHHHH-hhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCC Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQ-LGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPE 79 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~-l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~ 79 (168) ..-...++. .|...|.. .....|.||.. ..||++|+||+.+...+ +.++||+ T Consensus 76 ~~~~L~~tg---~L~~Si~~~~~~~~v~vGt~-------~~yA~vHqfG~~~~~~~-----------------~~~~iPa 128 (156) T protein:vir:19 76 PGSILTLHG---DLARSITTDYGQDYALIGSP-------KIYAAIHQWGGTPDMAP-----------------RPAGVPA 128 (156) T ss_pred CCcchhhhH---HHHHHhhheecCCEEEEecc-------hhhhHHhhcCcccccCC-----------------CccccCC Confidence 111112212 23334433 34567888752 46899999999876433 3578999 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019544. 80 RSWLRSGYDENIDKIAKKIEKMVPDVIEG 108 (168) Q Consensus 80 RpFlr~~~~~~~~~~~~~~~~~~~~~l~G 108 (168) ||||--+ ++.++++.+.+...+..++.= T Consensus 129 RpfLG~s-~~d~~~I~~~i~~~l~~~~~~ 156 (156) T protein:vir:19 129 RPYMGLD-KTGEQEIFDAIRKRVSAALRQ 156 (156) T ss_pred ccccCCC-HHHHHHHHHHHHHHHHHHhhC Confidence 9999533 355677777777777766653 No 37 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=97.80 E-value=6e-08 Score=60.16 Aligned_cols=84 Identities=15% Similarity=0.164 Sum_probs=47.8 Q ss_pred CcceecccchHHHH--HHHHHH-hhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceecc Q lcl|NC_019544. 1 MKVTIKDTNNIDKI--TRNLQQ-LGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKI 77 (168) Q Consensus 1 M~v~i~~~~~~~~~--~~~l~~-l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~I 77 (168) ++-.......+.++ -..|.. .....+.||+.+ + ...||++|.||+++++.++ .+.++| T Consensus 63 ~~~g~~~~~~~~~l~~~~~l~~~~~~~~~~v~~~G---t-n~~yAaiHQfG~~~r~~~~---------------~~~v~i 123 (149) T protein:vir:18 63 SKKGRIKREMFAKLRTSRFMKAKGSDSAAVVEFTG---K-VQRMARVHQYGLKDRPNRN---------------SRDVQY 123 (149) T ss_pred hccCcccchhhhhhhhhhhhheeecCceeEEEecc---c-chhhhhhhhccccccccCC---------------Cccccc Confidence 22111111111111 111211 234568888763 3 4579999999998876543 357899 Q ss_pred CCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 78 PERSWLRSGYDENIDKIAKKIEKMVPD 104 (168) Q Consensus 78 P~RpFlr~~~~~~~~~~~~~~~~~~~~ 104 (168) |+||||--+ ++...++.+.+...+.+ T Consensus 124 PaRp~LG~s-~~d~~~I~~~i~~~l~~ 149 (149) T protein:vir:18 124 EARPLLGFT-RDDEQMIEDVIISHLGK 149 (149) T ss_pred cccccCCCC-HHHHHHHHHHHHHHHhC Confidence 999999544 34456666666666665 No 38 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=97.80 E-value=1.3e-07 Score=58.33 Aligned_cols=107 Identities=16% Similarity=0.153 Sum_probs=50.4 Q ss_pred Cc-ceecccchHHHHHHHHHHhhCCe-EEEE---e---------------ecCCCchHHHHHhh----hhcCceecc--- Q lcl|NC_019544. 1 MK-VTIKDTNNIDKITRNLQQLGGKQ-IKVG---L---------------FGKDDSELVMIGAV----HEYGAEIPV--- 53 (168) Q Consensus 1 M~-v~i~~~~~~~~~~~~l~~l~~~~-v~VG---i---------------~~~~g~~~a~iA~~----~E~G~~i~~--- 53 (168) |. ++++ ++++|++.|+.|.... -++. + |.++|.--..|-.- .+....+.+ T Consensus 1 M~~~~i~---Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~vg~~ 77 (140) T protein:vir:14 1 MSSIQII---GLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVR 77 (140) T ss_pred Cceeeeh---hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccccccceeEEeeee Confidence 66 4444 4777777777765321 1111 0 11122100000000 000000000 Q ss_pred -Cccccccc-cccchhhhcccceeccCCCchhHHHHHHHHHHHHHHH----HHHHHHHHhccC Q lcl|NC_019544. 54 -TPKMRAWF-AANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKI----EKMVPDVIEGNV 110 (168) Q Consensus 54 -~~~~~~~~-~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~----~~~~~~~l~G~~ 110 (168) ..+..... ....+..|.++.+.+.||||||+|+++++++++.+.+ ++.+.+++.|.. T Consensus 78 ~~~~~~~~~~~~~~y~~f~E~GT~~~~a~pFl~pa~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:14 78 VRTKGKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGRR 140 (140) T ss_pred eccccccCCCCccceeeeeccccCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 00000000 0012334566778899999999999999887766555 555666777765 No 39 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=97.77 E-value=1.5e-07 Score=57.98 Aligned_cols=107 Identities=18% Similarity=0.187 Sum_probs=49.8 Q ss_pred Cc-ceecccchHHHHHHHHHHhhCCeE-EE-------Ee-----------ecCCCchHHHHHhhh----hcCcee----c Q lcl|NC_019544. 1 MK-VTIKDTNNIDKITRNLQQLGGKQI-KV-------GL-----------FGKDDSELVMIGAVH----EYGAEI----P 52 (168) Q Consensus 1 M~-v~i~~~~~~~~~~~~l~~l~~~~v-~V-------Gi-----------~~~~g~~~a~iA~~~----E~G~~i----~ 52 (168) |. ++++ ++++|++.|+.|..... ++ |- |.++|.--..|-.-- +.+... . T Consensus 1 Ma~~~i~---Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~~g~~ 77 (140) T protein:vir:10 1 MSSIQII---GLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVR 77 (140) T ss_pred Cceeeeh---hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhccccccccccccceEEeeee Confidence 76 4444 46777777766643110 00 00 111221000000000 000000 0 Q ss_pred cCccccccc-cccchhhhcccceeccCCCchhHHHHHHHHHHHHHHH----HHHHHHHHhccC Q lcl|NC_019544. 53 VTPKMRAWF-AANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKI----EKMVPDVIEGNV 110 (168) Q Consensus 53 ~~~~~~~~~-~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~----~~~~~~~l~G~~ 110 (168) ...+..... ..-.+..|.++.+.+.||+|||+|+++++++++.+.+ ++.+.+++.|.- T Consensus 78 ~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 78 VRTKGKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGRR 140 (140) T ss_pred eccccccCCCCccceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 000000000 0012344566677899999999999999997776655 555666777765 No 40 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=97.76 E-value=4.7e-08 Score=60.76 Aligned_cols=112 Identities=11% Similarity=0.230 Sum_probs=55.5 Q ss_pred CcceecccchHHHHHHHHH---------Hh-------hC---CeEEEEeecCCC--------------c-------hHHH Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQ---------QL-------GG---KQIKVGLFGKDD--------------S-------ELVM 40 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~---------~l-------~~---~~v~VGi~~~~g--------------~-------~~a~ 40 (168) |++++++-+.|.+-++++. .+ .. ...+-.+|-++| . ..+. T Consensus 2 ~~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~~~~~~g~V~~~~~ 81 (182) T protein:vir:10 2 IEVELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVDGDEVIGRWWNSSM 81 (182) T ss_pred eEEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeecCCeEEEEeecCCC Confidence 7788876554332222110 00 00 000111121111 1 2267 Q ss_pred HHhhhhcCceeccC--------------ccccccccccch--------hhh--cc-----cceeccCCCchhHHHHHHHH Q lcl|NC_019544. 41 IGAVHEYGAEIPVT--------------PKMRAWFAANGY--------PLR--KE-----TTVIKIPERSWLRSGYDENI 91 (168) Q Consensus 41 iA~~~E~G~~i~~~--------------~~~~~~~~~~g~--------~~~--~~-----~~~i~IP~RpFlr~~~~~~~ 91 (168) +|.++|||+..... .+..|+...... .++ .. ..+-..||||||+|++++++ T Consensus 82 ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPFl~pA~~~~~ 161 (182) T protein:vir:10 82 VAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQFMTPAANKMA 161 (182) T ss_pred ccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcchHHHHHHhH Confidence 99999999853211 111122111110 000 00 12347899999999999999 Q ss_pred HHHHHHHHHHHHHHHhccCcH Q lcl|NC_019544. 92 DKIAKKIEKMVPDVIEGNVNP 112 (168) Q Consensus 92 ~~~~~~~~~~~~~~l~G~~~~ 112 (168) +++.+.+++.+.+++.-.+-. T Consensus 162 ~~i~~~i~~~i~~~l~~~~g~ 182 (182) T protein:vir:10 162 KEAPEIIKRSIDQELHDKLGG 182 (182) T ss_pred HHHHHHHHHHHHHHHHHhhcC Confidence 999888887776655432221 No 41 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=97.74 E-value=6.6e-08 Score=59.96 Aligned_cols=86 Identities=21% Similarity=0.282 Sum_probs=47.1 Q ss_pred CcceecccchHHHHHHH--HH-HhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceecc Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRN--LQ-QLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKI 77 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~--l~-~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~I 77 (168) -+-.++....+.++..+ |. +.....+.|||.+. ...||++|.||.++++..+ ....++| T Consensus 64 ~k~~~~~~~m~~~L~~a~~l~~~a~~~~~~Vg~~Gt----~~~yAaiHQfG~~~r~~~~--------------~~~~v~i 125 (152) T protein:vir:10 64 VKSKIKSGKMFDKITQPRFMRLRLESEGVSLGYEGG----DAVIARIHQQGLIGRVRKD--------------WDLKVKY 125 (152) T ss_pred hcccccchhHHHhhhhcceeeeeecCcEEEEEecCC----chhhhhhhccCccccccCC--------------CCcceec Confidence 11111111112222211 11 23456789999743 4589999999998765432 1336799 Q ss_pred CCCchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 78 PERSWLRSGYDENIDKIAKKIEKMVPDV 105 (168) Q Consensus 78 P~RpFlr~~~~~~~~~~~~~~~~~~~~~ 105 (168) |+||||--+- +...++.+.+...+..+ T Consensus 126 PaRp~LG~s~-~d~~~I~~~i~~~l~~a 152 (152) T protein:vir:10 126 ASRELLGFTD-DDLQMIEDYMINILAGS 152 (152) T ss_pred cccccCCCCH-HHHHHHHHHHHHHHhcC Confidence 9999995442 23355555555555443 No 42 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=97.69 E-value=2.4e-07 Score=56.85 Aligned_cols=107 Identities=15% Similarity=0.174 Sum_probs=49.9 Q ss_pred Cc-ceecccchHHHHHHHHHHhhCCe-EEE-------Ee-----------ecCCCchHHHHHh----hhhcCceecc--- Q lcl|NC_019544. 1 MK-VTIKDTNNIDKITRNLQQLGGKQ-IKV-------GL-----------FGKDDSELVMIGA----VHEYGAEIPV--- 53 (168) Q Consensus 1 M~-v~i~~~~~~~~~~~~l~~l~~~~-v~V-------Gi-----------~~~~g~~~a~iA~----~~E~G~~i~~--- 53 (168) |. ++++ ++++|++.|+.|.... -++ |- |.++|.--..|-. -.+++....+ T Consensus 1 Ma~~~i~---Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~~~~~~~~~~ 77 (140) T protein:vir:80 1 MSSIQIV---GLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVR 77 (140) T ss_pred Cceeeeh---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccccceeeeeee Confidence 77 4444 4677777776664321 011 10 1112211000000 0000000000 Q ss_pred -Ccccc-ccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHH----HHHHHHHhccC Q lcl|NC_019544. 54 -TPKMR-AWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIE----KMVPDVIEGNV 110 (168) Q Consensus 54 -~~~~~-~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~----~~~~~~l~G~~ 110 (168) ..... .....-.+..|.++.+.+.||||||+|+++++++++.+.++ +.+.+++.|.. T Consensus 78 ~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:80 78 VRTKGKADSPSNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDQALGGRR 140 (140) T ss_pred cccccccCCCCCcceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 00000 00001123445666678999999999999999877766654 45566666665 No 43 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=97.62 E-value=1.2e-07 Score=58.54 Aligned_cols=78 Identities=17% Similarity=0.323 Sum_probs=50.4 Q ss_pred Ccceec-ccchHHHHHHHHHH-------hhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhccc Q lcl|NC_019544. 1 MKVTIK-DTNNIDKITRNLQQ-------LGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKET 72 (168) Q Consensus 1 M~v~i~-~~~~~~~~~~~l~~-------l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~ 72 (168) |+-... +...-..+.+.+.. -....|.||+-. ..+.|+.+.|||+ T Consensus 42 ~k~~ap~~~~~tg~l~~~I~~~~~k~~~~g~~~v~Vg~~~----~~~~y~~f~E~GT----------------------- 94 (127) T protein:vir:12 42 QRSHVNRSDKKQPHMQDNITVSNVRESKDGVRFVAVGPNK----KVAYRGRFLEWGT----------------------- 94 (127) T ss_pred HHHhCCCCCCChhHHHHhhhccccccccCceeEEEEeeCC----CCcceeeeeccCc----------------------- Confidence 221111 10001123333321 122367778632 3467788889994 Q ss_pred ceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019544. 73 TVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIE 107 (168) Q Consensus 73 ~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~ 107 (168) .+.||||||+++++++++++.+.+.+.+.+.+. T Consensus 95 --~~~~a~Pf~~pa~~~~~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 95 --SKMPPQPFIEKGGKEGEGPAVELMERILTAPIK 127 (127) T ss_pred --cCCCCCccchHhHHHHHHHHHHHHHHHHHHhcC Confidence 578999999999999999999999999999887 No 44 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=97.62 E-value=8.6e-08 Score=59.31 Aligned_cols=85 Identities=20% Similarity=0.226 Sum_probs=47.6 Q ss_pred CcceecccchHHH--HHHHH-HHhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceecc Q lcl|NC_019544. 1 MKVTIKDTNNIDK--ITRNL-QQLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKI 77 (168) Q Consensus 1 M~v~i~~~~~~~~--~~~~l-~~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~I 77 (168) .+-.......+.+ +...| -......+.|||..+ ..+.||++|.||.++++.++ .+.++| T Consensus 63 ~k~~~~~~~l~~~~~l~~sl~~~~~~~~a~vg~~~G---t~~~yAaiHQfG~~~~~~~~---------------~~~~~i 124 (150) T protein:vir:60 63 KKTGRVKRKMFAKLITSRFLHIRASPEQASMEFYGG---KSPKIASVHQFGLSEENRKD---------------GKKIDY 124 (150) T ss_pred HhhcCCCccchhhhhhcceeeeeeeCcEEEEEeeCC---CchhhhhhhhccccccccCC---------------CCceec Confidence 1100000000111 11112 123456788888643 24689999999998765442 357899 Q ss_pred CCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 78 PERSWLRSGYDENIDKIAKKIEKMVPD 104 (168) Q Consensus 78 P~RpFlr~~~~~~~~~~~~~~~~~~~~ 104 (168) |+||||--+- +..+++.+.+...+.+ T Consensus 125 PaRp~LG~s~-~d~~~i~~~i~~~l~r 150 (150) T protein:vir:60 125 PARPLLGFTG-EDVQMIEEIILAHLDR 150 (150) T ss_pred CCcccCCCCH-HHHHHHHHHHHHHHhC Confidence 9999995543 3345666666666655 No 45 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=97.61 E-value=3.2e-07 Score=56.16 Aligned_cols=104 Identities=20% Similarity=0.309 Sum_probs=52.7 Q ss_pred ceecccchHHHHHHHHHHhhC---------------------------------CeEEEEeecCCC------chHHHHHh Q lcl|NC_019544. 3 VTIKDTNNIDKITRNLQQLGG---------------------------------KQIKVGLFGKDD------SELVMIGA 43 (168) Q Consensus 3 v~i~~~~~~~~~~~~l~~l~~---------------------------------~~v~VGi~~~~g------~~~a~iA~ 43 (168) |++++ +++|++.|+.|.. .++++-...+.| ..-+.||. T Consensus 1 i~i~G---ld~L~~~L~~l~~~~~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~~~~~v~~~~~Ya~ 77 (173) T protein:vir:10 1 MAVKG---VAEVIAELRKIGKDIDKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLKAKDLISKKITVNELYGA 77 (173) T ss_pred Ccchh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeeccCceeEEeeCCCcccch Confidence 33332 3333333333211 122222111111 13478999 Q ss_pred hhhcCceec-cCcccccccccc-----------------------------chhhhcccceeccCCCchhHHHHHHHHHH Q lcl|NC_019544. 44 VHEYGAEIP-VTPKMRAWFAAN-----------------------------GYPLRKETTVIKIPERSWLRSGYDENIDK 93 (168) Q Consensus 44 ~~E~G~~i~-~~~~~~~~~~~~-----------------------------g~~~~~~~~~i~IP~RpFlr~~~~~~~~~ 93 (168) +.|||+... ..|++......+ +...+....+-..||||||+|++++++++ T Consensus 78 fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~~aqPFl~PA~~~~~~~ 157 (173) T protein:vir:10 78 YMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGINPQPFLYPAWIEGKKQ 157 (173) T ss_pred hhhcccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceeeEeecCCCCCCccchhHHHHhHHH Confidence 999998641 112211110000 11112222334689999999999999999 Q ss_pred HHHHHHHHHHHHHhcc Q lcl|NC_019544. 94 IAKKIEKMVPDVIEGN 109 (168) Q Consensus 94 ~~~~~~~~~~~~l~G~ 109 (168) +.+.+++.+.+.+.-= T Consensus 158 ~~~~i~~~i~~~lrk~ 173 (173) T protein:vir:10 158 YLKDLENLLKTYNKKI 173 (173) T ss_pred HHHHHHHHHHHHhhcC Confidence 8888888777654421 No 46 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=97.60 E-value=2.8e-07 Score=56.52 Aligned_cols=99 Identities=14% Similarity=0.189 Sum_probs=51.8 Q ss_pred CcceecccchHHHHHHHHHHhhC---------------------------------CeEEEEeecCCC-----chHHHHH Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGG---------------------------------KQIKVGLFGKDD-----SELVMIG 42 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~---------------------------------~~v~VGi~~~~g-----~~~a~iA 42 (168) |++++.. ..++++.+.|+++.. .++.+-+- .+| ...+.|| T Consensus 4 ms~~i~~-~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~-~~g~~~~V~~~~~YA 81 (144) T protein:vir:59 4 MSVRIDP-SWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYK-NNGLTAEITVGAEYA 81 (144) T ss_pred ceeeehh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEee-cCcEEEEEecCCCcc Confidence 6665531 122222222111110 12222221 122 1347899 Q ss_pred hhhhcCceeccC-c---cccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 43 AVHEYGAEIPVT-P---KMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVP 103 (168) Q Consensus 43 ~~~E~G~~i~~~-~---~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 103 (168) .+.|||+..... + +..+.........+. .+..+||||||+++++.+++.+.+.+++.+. T Consensus 82 ~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~--~t~g~~a~Pfl~pA~~~~~~~~~~~i~~~~g 144 (144) T protein:vir:59 82 IYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYV--RTQGAPAQPFFWPAVEEGGEYFEREMRRLRG 144 (144) T ss_pred chhhcCccccccCCCcccccccccccccccee--cCCCCCCCcchhHHHHHHHHHHHHHHHHhcC Confidence 999999854321 1 111111111111111 2347999999999999999999998888875 No 47 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=97.60 E-value=1e-07 Score=58.88 Aligned_cols=85 Identities=20% Similarity=0.235 Sum_probs=47.9 Q ss_pred CcceecccchHHH--HHHHH-HHhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceecc Q lcl|NC_019544. 1 MKVTIKDTNNIDK--ITRNL-QQLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKI 77 (168) Q Consensus 1 M~v~i~~~~~~~~--~~~~l-~~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~I 77 (168) .+-.......+.+ +...| -..+...+.|||..+ ....||++|.||.++++.++ .+.++| T Consensus 63 ~k~~~~~~~l~~~~~l~~sl~~~~~~~~a~vg~~~G---~~~~yAaiHQfG~~~r~~~~---------------~~~~~i 124 (150) T protein:vir:57 63 KKTGRVKRKMFAKLITSRFLHIRASPEQASMEFYGG---KSPKIASVHQFGLSEETRKD---------------GKKIDY 124 (150) T ss_pred HhccCCCcccchhhhhccceeeeeeCcEEEEEeecC---CchhhhhhhhccccccccCC---------------Cceeec Confidence 1110000000111 11112 123456788888643 25689999999998765442 357899 Q ss_pred CCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 78 PERSWLRSGYDENIDKIAKKIEKMVPD 104 (168) Q Consensus 78 P~RpFlr~~~~~~~~~~~~~~~~~~~~ 104 (168) |+||||--+- +...++.+.+...+.+ T Consensus 125 PaRp~LG~s~-~d~~~i~~~i~~~l~r 150 (150) T protein:vir:57 125 PARPLLGFTG-EDVQMIEEIILAHLDR 150 (150) T ss_pred CCcccCCCCH-HHHHHHHHHHHHHHhC Confidence 9999995442 3346666666666655 No 48 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=97.59 E-value=3.6e-07 Score=55.91 Aligned_cols=110 Identities=14% Similarity=0.112 Sum_probs=54.9 Q ss_pred CcceecccchHHHHHHHHHHhhCCeE-EE--EeecCCCchHHHHHh----hhh---cCc---eeccC----cccccc--- Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQI-KV--GLFGKDDSELVMIGA----VHE---YGA---EIPVT----PKMRAW--- 60 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~v-~V--Gi~~~~g~~~a~iA~----~~E---~G~---~i~~~----~~~~~~--- 60 (168) |+++++ ..++++|++.|++|....- ++ ......+..+..-+- +.+ .|. .|.+. .....+ T Consensus 1 M~~~~~-i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k~~~~~~~v~v 79 (135) T protein:vir:57 1 MIPEIE-ISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSRGKAGSTVVVL 79 (135) T ss_pred Cceeee-ehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhcccccccccccceeEEE Confidence 888877 3568888888888754210 11 111000000000000 000 000 01000 000000 Q ss_pred ---cccc--chhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHH Q lcl|NC_019544. 61 ---FAAN--GYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGM 121 (168) Q Consensus 61 ---~~~~--g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~ 121 (168) .... .+..|.++++.+.||||||+|+++++++++.+.+.+.+.+. |++++. T Consensus 80 ~vg~~~~~~~~~~f~E~GT~~~~a~PF~~pa~~~~~~~~~~~~~~~~~~~----------l~ka~r 135 (135) T protein:vir:57 80 RVGPTRSHYMKALAQEFGTIKQVAKPFIRPALDYNKMQVLRILTVEIRDG----------LSTLSR 135 (135) T ss_pred EecCCCCcceeEeecccCCCCCCCCcchhHhHHHhHHHHHHHHHHHHHHH----------HHHhcC Confidence 0011 12345578889999999999999999998888777766553 333333 No 49 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=97.54 E-value=1.8e-07 Score=57.51 Aligned_cols=85 Identities=19% Similarity=0.219 Sum_probs=49.0 Q ss_pred CcceecccchHH--HHHHHHH-HhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceecc Q lcl|NC_019544. 1 MKVTIKDTNNID--KITRNLQ-QLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKI 77 (168) Q Consensus 1 M~v~i~~~~~~~--~~~~~l~-~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~I 77 (168) .+-.......+. .+...|. +.+...+.|||..+ ..+.||++|.||.++++.++ .+.++| T Consensus 63 ~k~g~~~~~l~~~~~l~~sl~~~~~~~~~~vg~~~G---s~~~yAa~HQfG~~~~~~~~---------------~~~~~i 124 (150) T protein:vir:20 63 KKTGRVKRKMFAKLITSRFLHIRASPEQASMEFYGG---KSPKIASVHQFGLSEENRKD---------------GKKIDY 124 (150) T ss_pred HhccCCCccccchhhhhhhhheeecCcEEEEEeeCC---cchhhhhhhhcccccccccC---------------CCceec Confidence 110000000000 1222332 23556899998643 24579999999998765442 357899 Q ss_pred CCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 78 PERSWLRSGYDENIDKIAKKIEKMVPD 104 (168) Q Consensus 78 P~RpFlr~~~~~~~~~~~~~~~~~~~~ 104 (168) |+||||--+- +..+++.+.+...+.+ T Consensus 125 PaRp~LG~s~-~d~~~i~~~i~~~l~k 150 (150) T protein:vir:20 125 PARPLLGFTG-EDVQMIEEIILAHLER 150 (150) T ss_pred cccccCCCCH-HHHHHHHHHHHHHHhC Confidence 9999995442 3346666666666665 No 50 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=97.54 E-value=7.2e-07 Score=54.26 Aligned_cols=79 Identities=9% Similarity=0.177 Sum_probs=58.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCChHHHHHhcCC--CCcchhHHH Q lcl|NC_019544. 87 YDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL------KDPPNSQMTIERKGS--DNPLIDTGR 158 (168) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~------~~ppnsp~Ti~~KG~--~~PLiDTG~ 158 (168) +++ -.++...+...+.++ . ..+....|..||..+....++.|.+. .|+|+++.|+++|+. .+||+++|. T Consensus 1 m~d-~~~l~~~L~~ll~~L-~-~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~g~ 77 (149) T protein:vir:98 1 MSE-LTALQERLTGLIASL-S-PAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKGRIRREMFARLR 77 (149) T ss_pred Cch-HHHHHHHHHHHHHhc-C-chhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccCCCCcccchhhh Confidence 222 123333333333322 1 12456789999999999999999885 488999999998874 589999999 Q ss_pred HHhHhccccC Q lcl|NC_019544. 159 LVGSIRHTVE 168 (168) Q Consensus 159 L~~SIty~V~ 168 (168) |.+||+|.+. T Consensus 78 l~~sl~~~~~ 87 (149) T protein:vir:98 78 TNRFMKAKGS 87 (149) T ss_pred hhhhhhheec Confidence 9999999988 No 51 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=97.52 E-value=2.5e-07 Score=56.81 Aligned_cols=99 Identities=16% Similarity=0.167 Sum_probs=48.3 Q ss_pred Cc-ceecccchHHHHHHHHHHhhCC-eEEEEeecCCCchHHHHHhhhhcCceec-------------cCccccccccccc Q lcl|NC_019544. 1 MK-VTIKDTNNIDKITRNLQQLGGK-QIKVGLFGKDDSELVMIGAVHEYGAEIP-------------VTPKMRAWFAANG 65 (168) Q Consensus 1 M~-v~i~~~~~~~~~~~~l~~l~~~-~v~VGi~~~~g~~~a~iA~~~E~G~~i~-------------~~~~~~~~~~~~g 65 (168) |. |++ .++++|++.|+++.+. .|+=-+ ...+..++.-+. .+-....+ ...+......... T Consensus 1 Ma~i~~---~Gld~l~~~L~~~~~~~~v~~~~-~~~~~~~~~~~~-~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~ 75 (114) T protein:vir:27 1 MATIEF---EGLDEMAQSLLKNASPEKRSKVL-RKYGSKLKEAAV-NRAQFNKGYSTGATRRSITLQVESDKATVEALTS 75 (114) T ss_pred Ceeeee---ehHHHHHHHHHHhcCHHHHHHHH-HHHHHHHHHHHH-HhcccCCCCCchhhhhceeeeecCCeeEecCCCC Confidence 66 333 3577888887766431 110000 001111111100 00000000 0000000001112 Q ss_pred hhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 66 YPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPD 104 (168) Q Consensus 66 ~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 104 (168) +..+.++++...||||||||+++.++.++.+.+++.++- T Consensus 76 Ya~~vEfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 76 YSGYLEVGTRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred ccceecccccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 445566667889999999999999999988888777764 No 52 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=97.52 E-value=2.5e-07 Score=56.81 Aligned_cols=99 Identities=16% Similarity=0.167 Sum_probs=48.3 Q ss_pred Cc-ceecccchHHHHHHHHHHhhCC-eEEEEeecCCCchHHHHHhhhhcCceec-------------cCccccccccccc Q lcl|NC_019544. 1 MK-VTIKDTNNIDKITRNLQQLGGK-QIKVGLFGKDDSELVMIGAVHEYGAEIP-------------VTPKMRAWFAANG 65 (168) Q Consensus 1 M~-v~i~~~~~~~~~~~~l~~l~~~-~v~VGi~~~~g~~~a~iA~~~E~G~~i~-------------~~~~~~~~~~~~g 65 (168) |. |++ .++++|++.|+++.+. .|+=-+ ...+..++.-+. .+-....+ ...+......... T Consensus 1 Ma~i~~---~Gld~l~~~L~~~~~~~~v~~~~-~~~~~~~~~~~~-~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~ 75 (114) T protein:vir:49 1 MATIEF---EGLDEMAQSLLKNASPEKRSKVL-RKYGSKLKEAAV-NRAQFNKGYSTGATRRSITLQVESDKATVEALTS 75 (114) T ss_pred Ceeeee---ehHHHHHHHHHHhcCHHHHHHHH-HHHHHHHHHHHH-HhcccCCCCCchhhhhceeeeecCCeeEecCCCC Confidence 66 333 3577888887766431 110000 001111111100 00000000 0000000001112 Q ss_pred hhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 66 YPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPD 104 (168) Q Consensus 66 ~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 104 (168) +..+.++++...||||||||+++.++.++.+.+++.++- T Consensus 76 Ya~~vEfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 76 YSGYLEVGTRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred ccceecccccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 445566667889999999999999999988888777764 No 53 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=97.49 E-value=4.5e-07 Score=55.38 Aligned_cols=82 Identities=16% Similarity=0.282 Sum_probs=45.3 Q ss_pred Ccceec-ccchHHHHHHHHHH------hhCCeEEEEeecC-----CCchHHHHHhhhhcCceeccCccccccccccchhh Q lcl|NC_019544. 1 MKVTIK-DTNNIDKITRNLQQ------LGGKQIKVGLFGK-----DDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPL 68 (168) Q Consensus 1 M~v~i~-~~~~~~~~~~~l~~------l~~~~v~VGi~~~-----~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~ 68 (168) ++-..- ++..+ .+.+.. -....+.+|+... .+..-+.|+.+.|||+ T Consensus 43 ak~~ap~~tG~l---~~sI~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT------------------- 100 (140) T protein:vir:10 43 ARARAPKKTGKL---KRNIVTAALKQKDSPGIATAGVRVRTKGKADSPNNAFYWRFVELGT------------------- 100 (140) T ss_pred HHHhCCCChhhH---HHhceecccccccccceeEEeeccccccccCCCCcccccceeccCc------------------- Confidence 111110 11111 111110 0112345554321 1233466777788884 Q ss_pred hcccceeccCCCchhHHHHHHHHHHHHHHH----HHHHHHHHhccC Q lcl|NC_019544. 69 RKETTVIKIPERSWLRSGYDENIDKIAKKI----EKMVPDVIEGNV 110 (168) Q Consensus 69 ~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~----~~~~~~~l~G~~ 110 (168) .+.||+|||||+++++++++.+.+ ++.+.+++.|++ T Consensus 101 ------~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 101 ------QFMKAEPFMRPAFDASIAQAEGAIRTEIARAIDQVVGGGL 140 (140) T ss_pred ------CCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 578999999999999887766555 555577888887 No 54 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=97.45 E-value=3.4e-07 Score=56.02 Aligned_cols=83 Identities=12% Similarity=0.181 Sum_probs=46.4 Q ss_pred CcceecccchHHHHHHHHHH-hhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCC Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQ-LGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPE 79 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~-l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~ 79 (168) ..-++-..+ -.|..+|.. .....|.||- ...||++|+||+.+.. .+.++||+ T Consensus 71 ~~~~~L~~t--G~L~~Si~~~~~~~~v~vGt-------n~~YA~iHqfGg~~~~------------------~~~~~iPA 123 (155) T protein:vir:10 71 GAHPILQVT--NALARSITTRADRDQAQIGS-------NLSYAAIQQLGGQAGR------------------GRKVTIPA 123 (155) T ss_pred CCCCccccc--hhhhhhhhceecCCEEEEec-------CcchhhhhhcccccCC------------------CCccccCC Confidence 111111111 124444433 2456788874 2458999999987532 24579999 Q ss_pred CchhHHH-HHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019544. 80 RSWLRSG-YDENIDKIAKKIEKMVPDVIEGNV 110 (168) Q Consensus 80 RpFlr~~-~~~~~~~~~~~~~~~~~~~l~G~~ 110 (168) ||||--. -++-+.++.+.+...+.+.+.-+. T Consensus 124 RPfLG~s~~~e~~~ei~~~I~~~i~~~l~~~r 155 (155) T protein:vir:10 124 RPYLPVLRNGQLKPSARDAVLDVLLAALSQGR 155 (155) T ss_pred ccccCCCccccchHHHHHHHHHHHHHHHhhcC Confidence 9999422 122345666777777776664333 No 55 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=97.44 E-value=5.4e-07 Score=54.95 Aligned_cols=82 Identities=18% Similarity=0.305 Sum_probs=44.9 Q ss_pred Ccce--ecccchHHHHHHHHHHhhC------------------------------------------------------- Q lcl|NC_019544. 1 MKVT--IKDTNNIDKITRNLQQLGG------------------------------------------------------- 23 (168) Q Consensus 1 M~v~--i~~~~~~~~~~~~l~~l~~------------------------------------------------------- 23 (168) |+.. |+.++ +.+.++|++|.. T Consensus 1 Ms~~i~i~~d~--~~~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~ 78 (175) T protein:vir:79 1 MSDFVNFQIDD--SALRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGE 78 (175) T ss_pred CceEEEEEech--HHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhcccccccccccc Confidence 7643 33111 223333322211 Q ss_pred -------------------------------CeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhccc Q lcl|NC_019544. 24 -------------------------------KQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKET 72 (168) Q Consensus 24 -------------------------------~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~ 72 (168) ..|.| |++ ..||++|+||+.+. .. T Consensus 79 ~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~v------Gtn-~~YAaiHqfGg~~~------------------~~ 133 (175) T protein:vir:79 79 LTAAASRRKAGLMILQDSGQMAASTATDSGEDYSVI------GSN-KEYAAIQHFGGQAG------------------RG 133 (175) T ss_pred chhhHhhhccCCCcceechhhhhhhhheecCCEEEE------ecC-cchhhHhhcccccC------------------CC Confidence 11111 222 35799999997532 12 Q ss_pred ceeccCCCchhHHHHHHH-----HHHHHHHHHHHHHHHHhcc Q lcl|NC_019544. 73 TVIKIPERSWLRSGYDEN-----IDKIAKKIEKMVPDVIEGN 109 (168) Q Consensus 73 ~~i~IP~RpFlr~~~~~~-----~~~~~~~~~~~~~~~l~G~ 109 (168) +.++||+||||--+-++. .+.+.+.+...+..++.++ T Consensus 134 ~~v~IPARPfLG~s~~de~~~~~~~~I~~~i~~~l~~a~~~~ 175 (175) T protein:vir:79 134 LKVTIPGRAWLPVTADGELQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred cccccCcccccCCCcccchhHHHHHHHHHHHHHHHHHHhccC Confidence 467999999995433322 4667777777777777777 No 56 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=97.44 E-value=1.6e-07 Score=57.81 Aligned_cols=84 Identities=17% Similarity=0.344 Sum_probs=47.0 Q ss_pred Cc-------ceecc-cchH-HH-HHHHH------H-HhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccc Q lcl|NC_019544. 1 MK-------VTIKD-TNNI-DK-ITRNL------Q-QLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAA 63 (168) Q Consensus 1 M~-------v~i~~-~~~~-~~-~~~~l------~-~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~ 63 (168) -. .+.+. ...+ .. +...| + ..+...+.|||. |+ .+.||++|.||.++++.++ T Consensus 55 ~prk~~~~~~~~~~~~g~~~~~~m~~~l~~a~~l~~~~~~d~a~Vg~~---Gs-~~~yAaiHQfG~~~r~~~~------- 123 (155) T protein:vir:79 55 EPRKVKAGGKRLREKAGRVKREAMFRKLRTARYLRIDVDSTGLAIGFD---ER-LSRIARVHQEGQKAPVEPG------- 123 (155) T ss_pred cccchhhhhhhhhcccCcccchhhhhhhhhhheeeeeecCcEEEEEec---Cc-chhhhhhhhcCCcccCCCC------- Confidence 00 00010 0001 00 11222 1 134467888874 33 5779999999998765432 Q ss_pred cchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 64 NGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPD 104 (168) Q Consensus 64 ~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 104 (168) .+.++||+||||--+- +..+++.+.+...+.+ T Consensus 124 --------~~~v~iPaRp~LGls~-~d~~~I~~~i~~~l~r 155 (155) T protein:vir:79 124 --------GPLAQYPVRVVLGFSD-ADRELVRDRLLRELTR 155 (155) T ss_pred --------CcccccccccccCCCH-HHHHHHHHHHHHHhhC Confidence 3578999999995443 3446666666666665 No 57 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=97.43 E-value=1.7e-07 Score=57.68 Aligned_cols=78 Identities=9% Similarity=0.106 Sum_probs=44.4 Q ss_pred Ccceec-----------------------ccchHHHHHHHH-----HHh---hCCeEEEEeecCCCchHHHHHhhhhcCc Q lcl|NC_019544. 1 MKVTIK-----------------------DTNNIDKITRNL-----QQL---GGKQIKVGLFGKDDSELVMIGAVHEYGA 49 (168) Q Consensus 1 M~v~i~-----------------------~~~~~~~~~~~l-----~~l---~~~~v~VGi~~~~g~~~a~iA~~~E~G~ 49 (168) |..... -...-..+.+.+ +.- ....|.||+..++ +.+|...||| T Consensus 17 l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~----~~~a~F~E~G- 91 (125) T protein:vir:81 17 AVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKGV----SHRIHATEFG- 91 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCCC----ceEEEeccCC- Confidence 111110 000000011111 000 1123555553322 2455566666 Q ss_pred eeccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019544. 50 EIPVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIE 107 (168) Q Consensus 50 ~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~ 107 (168) +++.||+||+|+++++++++..+.+...++++.. T Consensus 92 ------------------------T~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:81 92 ------------------------TMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred ------------------------ccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 5789999999999999999999999999998876 No 58 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=97.43 E-value=1.7e-07 Score=57.68 Aligned_cols=78 Identities=9% Similarity=0.106 Sum_probs=44.4 Q ss_pred Ccceec-----------------------ccchHHHHHHHH-----HHh---hCCeEEEEeecCCCchHHHHHhhhhcCc Q lcl|NC_019544. 1 MKVTIK-----------------------DTNNIDKITRNL-----QQL---GGKQIKVGLFGKDDSELVMIGAVHEYGA 49 (168) Q Consensus 1 M~v~i~-----------------------~~~~~~~~~~~l-----~~l---~~~~v~VGi~~~~g~~~a~iA~~~E~G~ 49 (168) |..... -...-..+.+.+ +.- ....|.||+..++ +.+|...||| T Consensus 17 l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~----~~~a~F~E~G- 91 (125) T protein:vir:79 17 AVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKGV----SHRIHATEFG- 91 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCCC----ceEEEeccCC- Confidence 111110 000000011111 000 1123555553322 2455566666 Q ss_pred eeccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019544. 50 EIPVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIE 107 (168) Q Consensus 50 ~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~ 107 (168) +++.||+||+|+++++++++..+.+...++++.. T Consensus 92 ------------------------T~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:79 92 ------------------------TMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred ------------------------ccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 5789999999999999999999999999998876 No 59 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=97.43 E-value=1.7e-07 Score=57.68 Aligned_cols=78 Identities=9% Similarity=0.106 Sum_probs=44.4 Q ss_pred Ccceec-----------------------ccchHHHHHHHH-----HHh---hCCeEEEEeecCCCchHHHHHhhhhcCc Q lcl|NC_019544. 1 MKVTIK-----------------------DTNNIDKITRNL-----QQL---GGKQIKVGLFGKDDSELVMIGAVHEYGA 49 (168) Q Consensus 1 M~v~i~-----------------------~~~~~~~~~~~l-----~~l---~~~~v~VGi~~~~g~~~a~iA~~~E~G~ 49 (168) |..... -...-..+.+.+ +.- ....|.||+..++ +.+|...||| T Consensus 17 l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~----~~~a~F~E~G- 91 (125) T protein:vir:47 17 AVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKGV----SHRIHATEFG- 91 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCCC----ceEEEeccCC- Confidence 111110 000000011111 000 1123555553322 2455566666 Q ss_pred eeccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019544. 50 EIPVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIE 107 (168) Q Consensus 50 ~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~ 107 (168) +++.||+||+|+++++++++..+.+...++++.. T Consensus 92 ------------------------T~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:47 92 ------------------------TMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred ------------------------ccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 5789999999999999999999999999998876 No 60 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=97.43 E-value=1.7e-07 Score=57.68 Aligned_cols=78 Identities=9% Similarity=0.106 Sum_probs=44.4 Q ss_pred Ccceec-----------------------ccchHHHHHHHH-----HHh---hCCeEEEEeecCCCchHHHHHhhhhcCc Q lcl|NC_019544. 1 MKVTIK-----------------------DTNNIDKITRNL-----QQL---GGKQIKVGLFGKDDSELVMIGAVHEYGA 49 (168) Q Consensus 1 M~v~i~-----------------------~~~~~~~~~~~l-----~~l---~~~~v~VGi~~~~g~~~a~iA~~~E~G~ 49 (168) |..... -...-..+.+.+ +.- ....|.||+..++ +.+|...||| T Consensus 17 l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~----~~~a~F~E~G- 91 (125) T protein:vir:98 17 AVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKGV----SHRIHATEFG- 91 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCCC----ceEEEeccCC- Confidence 111110 000000011111 000 1123555553322 2455566666 Q ss_pred eeccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019544. 50 EIPVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIE 107 (168) Q Consensus 50 ~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~ 107 (168) +++.||+||+|+++++++++..+.+...++++.. T Consensus 92 ------------------------T~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:98 92 ------------------------TMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred ------------------------ccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 5789999999999999999999999999998876 No 61 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=97.43 E-value=1.7e-07 Score=57.68 Aligned_cols=78 Identities=9% Similarity=0.106 Sum_probs=44.4 Q ss_pred Ccceec-----------------------ccchHHHHHHHH-----HHh---hCCeEEEEeecCCCchHHHHHhhhhcCc Q lcl|NC_019544. 1 MKVTIK-----------------------DTNNIDKITRNL-----QQL---GGKQIKVGLFGKDDSELVMIGAVHEYGA 49 (168) Q Consensus 1 M~v~i~-----------------------~~~~~~~~~~~l-----~~l---~~~~v~VGi~~~~g~~~a~iA~~~E~G~ 49 (168) |..... -...-..+.+.+ +.- ....|.||+..++ +.+|...||| T Consensus 17 l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k~~----~~~a~F~E~G- 91 (125) T protein:vir:94 17 AVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAKGV----SHRIHATEFG- 91 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCCCC----ceEEEeccCC- Confidence 111110 000000011111 000 1123555553322 2455566666 Q ss_pred eeccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019544. 50 EIPVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIE 107 (168) Q Consensus 50 ~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~ 107 (168) +++.||+||+|+++++++++..+.+...++++.. T Consensus 92 ------------------------T~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:94 92 ------------------------TMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred ------------------------ccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 5789999999999999999999999999998876 No 62 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=97.41 E-value=4.3e-07 Score=55.51 Aligned_cols=82 Identities=20% Similarity=0.268 Sum_probs=46.6 Q ss_pred Ccc-ee--ccc----chHHHHHHHHHH-----hhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhh Q lcl|NC_019544. 1 MKV-TI--KDT----NNIDKITRNLQQ-----LGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPL 68 (168) Q Consensus 1 M~v-~i--~~~----~~~~~~~~~l~~-----l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~ 68 (168) .+- ++ +.. ..--.|..+|.. -.+..+.||- ...||++|+||+ T Consensus 52 Ls~st~a~k~~~~~L~~tG~L~~Si~~~~~~~~~~~~a~vGt-------n~~YA~~hqfG~------------------- 105 (145) T protein:vir:31 52 LKESTIRAKGSDTPLIDNSRLLTDINAASMMDRANRMAVIGT-------NLDYAEHHEFGA------------------- 105 (145) T ss_pred cChHHHHHhcCCCCCccCHHHHHHHHHHhhhcccCceeEecC-------CchhhhhhccCC------------------- Confidence 110 00 000 000134444432 1233455552 236899999996 Q ss_pred hcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHH Q lcl|NC_019544. 69 RKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPR 113 (168) Q Consensus 69 ~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~ 113 (168) ..++||+||||-.+....++++.+.+...+.+-|.|-. ++ T Consensus 106 ----~~~~IPaRPfLG~~~~~~~~~~~~ii~~~i~~~L~~~~-~~ 145 (145) T protein:vir:31 106 ----PEAGIPARPIFGPAGAYASQQAPDVIGDEIDTNLEGAV-ID 145 (145) T ss_pred ----cccccCCCCccCCCccchHHHHHHHHHHHHHHHhhhhc-cC Confidence 35789999999887776667777777777777666642 11 No 63 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=97.34 E-value=4.6e-07 Score=55.31 Aligned_cols=82 Identities=13% Similarity=0.249 Sum_probs=42.5 Q ss_pred Cccee--cc-cchH----HHHHHHHHHhhC--------------------------CeEEEEeecCCCchHHHHHhhhhc Q lcl|NC_019544. 1 MKVTI--KD-TNNI----DKITRNLQQLGG--------------------------KQIKVGLFGKDDSELVMIGAVHEY 47 (168) Q Consensus 1 M~v~i--~~-~~~~----~~~~~~l~~l~~--------------------------~~v~VGi~~~~g~~~a~iA~~~E~ 47 (168) |+-.+ +. ...+ .-+.+.++.... ..|.|++-.++ .-..++.+.|| T Consensus 19 L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~~~~~~v~vg~~~--~~~~y~~f~E~ 96 (133) T protein:vir:10 19 LGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRKAQGNAVVTLRVGPSK--QHHMKVLAQEF 96 (133) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccccCccceEEEEecCCC--CccceEeeecc Confidence 22111 00 0011 112222222110 11112221111 11122333355 Q ss_pred CceeccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019544. 48 GAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGN 109 (168) Q Consensus 48 G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~ 109 (168) | +.+.||||||+|++++++++..+.+.+.+.+.|.-+ T Consensus 97 G-------------------------T~k~~a~PF~~pA~~~~~~~~~~~~~~~~~~~l~K~ 133 (133) T protein:vir:10 97 G-------------------------TVKQVADPFIRPALDYNVQTVLRVLTVEIRNGIQNR 133 (133) T ss_pred C-------------------------CCCCCCCccchHHHHHhHHHHHHHHHHHHHHHhhcC Confidence 5 568899999999999999999999999999988877 No 64 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=97.29 E-value=3.7e-07 Score=55.85 Aligned_cols=84 Identities=14% Similarity=0.174 Sum_probs=42.3 Q ss_pred Ccce-----ecc---cchHHH--HHHHHH-HhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhh Q lcl|NC_019544. 1 MKVT-----IKD---TNNIDK--ITRNLQ-QLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLR 69 (168) Q Consensus 1 M~v~-----i~~---~~~~~~--~~~~l~-~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~ 69 (168) -..+ .+. .+.+.. +...|. ......+.|||. |+ ...||++|.||.++++.++ T Consensus 54 ~p~s~~~~~~~g~~~~~~~~~l~~~~~l~~~~~~~~~~v~~~---Gt-~~~yAaiHQfG~~~r~~~~------------- 116 (148) T protein:vir:79 54 VPRKPQLRHRAGRIRRAMFMRLRLARYMKTQADANTAVVTFA---GN-AQRIATVHQFGLRDRVNKA------------- 116 (148) T ss_pred cccchHHHhhcccccccccchhhhhhheeeeeeCCeeeEEee---cc-chhhhhhhhcCccccccCC------------- Confidence 0000 000 000000 011121 123446788874 33 3689999999998765432 Q ss_pred cccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019544. 70 KETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEG 108 (168) Q Consensus 70 ~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G 108 (168) .+.++||+||||--+-+ ..+++.+.+...+ .| T Consensus 117 --~~~v~iPaRp~LG~s~~-d~~~i~~~i~~~l----~~ 148 (148) T protein:vir:79 117 --GLTAQYPARELLGMDGV-DMEHITNLLLLHL----GA 148 (148) T ss_pred --CCccccCcccccCCCHH-HHHHHHHHHHHHh----cC Confidence 35789999999954422 3344444433333 33 No 65 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=97.29 E-value=5.9e-07 Score=54.72 Aligned_cols=71 Identities=17% Similarity=0.189 Sum_probs=41.8 Q ss_pred cchHHHHHHHHHHhhCC---------------------------------eEEEEeecC----CCchHHHHHhhhhcCce Q lcl|NC_019544. 8 TNNIDKITRNLQQLGGK---------------------------------QIKVGLFGK----DDSELVMIGAVHEYGAE 50 (168) Q Consensus 8 ~~~~~~~~~~l~~l~~~---------------------------------~v~VGi~~~----~g~~~a~iA~~~E~G~~ 50 (168) ..++++|.+.|+++... ++.+-+.++ .+ +-+.||.+.||| T Consensus 1 i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~~v~-~~~~Ya~~vE~G-- 77 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRIAVDKELSKSAARIERQAKILAPVDTGWLRAQIYSEQQRLLHYRVV-SPALYSIYLELG-- 77 (108) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhcceeeeecCcEEEEee-cCcccchhcccC-- Confidence 22334444433333210 111111100 01 124566666666 Q ss_pred eccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 51 IPVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPD 104 (168) Q Consensus 51 i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 104 (168) +...|+||||+|+++.++.++.+.+++.+++ T Consensus 78 -----------------------T~~m~a~Pf~~pa~~~~~~~~~~~i~~~lrk 108 (108) T protein:vir:99 78 -----------------------TRKMEAQSFLDPALRKEWPVLMANIKKMFKR 108 (108) T ss_pred -----------------------ccccCCCcchhhhHHHHHHHHHHHHHHHhcC Confidence 4578999999999999999999999998887 No 66 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=97.19 E-value=8e-07 Score=54.02 Aligned_cols=71 Identities=11% Similarity=0.192 Sum_probs=41.2 Q ss_pred Ccce----ecc--cchHHHHHHHHHHhh-----------------------CCeEEEEeecCCCchHHHHHhhhhcCcee Q lcl|NC_019544. 1 MKVT----IKD--TNNIDKITRNLQQLG-----------------------GKQIKVGLFGKDDSELVMIGAVHEYGAEI 51 (168) Q Consensus 1 M~v~----i~~--~~~~~~~~~~l~~l~-----------------------~~~v~VGi~~~~g~~~a~iA~~~E~G~~i 51 (168) ++-. ++. ......+.++.++.. +.++.||- -+.||.+.|||+ T Consensus 16 ~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~-------~~~Ya~~vE~GT-- 86 (115) T protein:vir:10 16 MKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITS-------HAAYSGFLEFGT-- 86 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeec-------Cccchhhhcccc-- Confidence 1100 000 011122333333322 11222221 245788888884 Q ss_pred ccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 52 PVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVP 103 (168) Q Consensus 52 ~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 103 (168) ...||||||+|+++.++.++.+.+++.++ T Consensus 87 -----------------------~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 87 -----------------------RYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -----------------------cccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 47899999999999999999999888887 No 67 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=97.19 E-value=8e-07 Score=54.02 Aligned_cols=71 Identities=11% Similarity=0.192 Sum_probs=41.2 Q ss_pred Ccce----ecc--cchHHHHHHHHHHhh-----------------------CCeEEEEeecCCCchHHHHHhhhhcCcee Q lcl|NC_019544. 1 MKVT----IKD--TNNIDKITRNLQQLG-----------------------GKQIKVGLFGKDDSELVMIGAVHEYGAEI 51 (168) Q Consensus 1 M~v~----i~~--~~~~~~~~~~l~~l~-----------------------~~~v~VGi~~~~g~~~a~iA~~~E~G~~i 51 (168) ++-. ++. ......+.++.++.. +.++.||- -+.||.+.|||+ T Consensus 16 ~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~-------~~~Ya~~vE~GT-- 86 (115) T protein:vir:96 16 MKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITS-------HAAYSGFLEFGT-- 86 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeec-------Cccchhhhcccc-- Confidence 1100 000 011122333333322 11222221 245788888884 Q ss_pred ccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 52 PVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVP 103 (168) Q Consensus 52 ~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 103 (168) ...||||||+|+++.++.++.+.+++.++ T Consensus 87 -----------------------~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 87 -----------------------RYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -----------------------cccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 47899999999999999999999888887 No 68 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=97.19 E-value=8e-07 Score=54.02 Aligned_cols=71 Identities=11% Similarity=0.192 Sum_probs=41.2 Q ss_pred Ccce----ecc--cchHHHHHHHHHHhh-----------------------CCeEEEEeecCCCchHHHHHhhhhcCcee Q lcl|NC_019544. 1 MKVT----IKD--TNNIDKITRNLQQLG-----------------------GKQIKVGLFGKDDSELVMIGAVHEYGAEI 51 (168) Q Consensus 1 M~v~----i~~--~~~~~~~~~~l~~l~-----------------------~~~v~VGi~~~~g~~~a~iA~~~E~G~~i 51 (168) ++-. ++. ......+.++.++.. +.++.||- -+.||.+.|||+ T Consensus 16 ~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~-------~~~Ya~~vE~GT-- 86 (115) T protein:vir:96 16 MKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITS-------HAAYSGFLEFGT-- 86 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeec-------Cccchhhhcccc-- Confidence 1100 000 011122333333322 11222221 245788888884 Q ss_pred ccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 52 PVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVP 103 (168) Q Consensus 52 ~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 103 (168) ...||||||+|+++.++.++.+.+++.++ T Consensus 87 -----------------------~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 87 -----------------------RYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -----------------------cccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 47899999999999999999999888887 No 69 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=97.19 E-value=8e-07 Score=54.02 Aligned_cols=71 Identities=11% Similarity=0.192 Sum_probs=41.2 Q ss_pred Ccce----ecc--cchHHHHHHHHHHhh-----------------------CCeEEEEeecCCCchHHHHHhhhhcCcee Q lcl|NC_019544. 1 MKVT----IKD--TNNIDKITRNLQQLG-----------------------GKQIKVGLFGKDDSELVMIGAVHEYGAEI 51 (168) Q Consensus 1 M~v~----i~~--~~~~~~~~~~l~~l~-----------------------~~~v~VGi~~~~g~~~a~iA~~~E~G~~i 51 (168) ++-. ++. ......+.++.++.. +.++.||- -+.||.+.|||+ T Consensus 16 ~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~-------~~~Ya~~vE~GT-- 86 (115) T protein:vir:78 16 MKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITS-------HAAYSGFLEFGT-- 86 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeec-------Cccchhhhcccc-- Confidence 1100 000 011122333333322 11222221 245788888884 Q ss_pred ccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 52 PVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVP 103 (168) Q Consensus 52 ~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 103 (168) ...||||||+|+++.++.++.+.+++.++ T Consensus 87 -----------------------~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 87 -----------------------RYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -----------------------cccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 47899999999999999999999888887 No 70 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=97.19 E-value=8e-07 Score=54.02 Aligned_cols=71 Identities=11% Similarity=0.192 Sum_probs=41.2 Q ss_pred Ccce----ecc--cchHHHHHHHHHHhh-----------------------CCeEEEEeecCCCchHHHHHhhhhcCcee Q lcl|NC_019544. 1 MKVT----IKD--TNNIDKITRNLQQLG-----------------------GKQIKVGLFGKDDSELVMIGAVHEYGAEI 51 (168) Q Consensus 1 M~v~----i~~--~~~~~~~~~~l~~l~-----------------------~~~v~VGi~~~~g~~~a~iA~~~E~G~~i 51 (168) ++-. ++. ......+.++.++.. +.++.||- -+.||.+.|||+ T Consensus 16 ~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~-------~~~Ya~~vE~GT-- 86 (115) T protein:vir:93 16 MKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITS-------HAAYSGFLEFGT-- 86 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeec-------Cccchhhhcccc-- Confidence 1100 000 011122333333322 11222221 245788888884 Q ss_pred ccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 52 PVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVP 103 (168) Q Consensus 52 ~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 103 (168) ...||||||+|+++.++.++.+.+++.++ T Consensus 87 -----------------------~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 87 -----------------------RYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -----------------------cccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 47899999999999999999999888887 No 71 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=97.19 E-value=8e-07 Score=54.02 Aligned_cols=71 Identities=11% Similarity=0.192 Sum_probs=41.2 Q ss_pred Ccce----ecc--cchHHHHHHHHHHhh-----------------------CCeEEEEeecCCCchHHHHHhhhhcCcee Q lcl|NC_019544. 1 MKVT----IKD--TNNIDKITRNLQQLG-----------------------GKQIKVGLFGKDDSELVMIGAVHEYGAEI 51 (168) Q Consensus 1 M~v~----i~~--~~~~~~~~~~l~~l~-----------------------~~~v~VGi~~~~g~~~a~iA~~~E~G~~i 51 (168) ++-. ++. ......+.++.++.. +.++.||- -+.||.+.|||+ T Consensus 16 ~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~-------~~~Ya~~vE~GT-- 86 (115) T protein:vir:97 16 MKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITS-------HAAYSGFLEFGT-- 86 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeec-------Cccchhhhcccc-- Confidence 1100 000 011122333333322 11222221 245788888884 Q ss_pred ccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 52 PVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVP 103 (168) Q Consensus 52 ~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 103 (168) ...||||||+|+++.++.++.+.+++.++ T Consensus 87 -----------------------~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 87 -----------------------RYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred -----------------------cccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 47899999999999999999999888887 No 72 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=97.15 E-value=1e-06 Score=53.38 Aligned_cols=78 Identities=13% Similarity=0.154 Sum_probs=39.2 Q ss_pred Ccceecc--c----chHHHHHHHHHHhhCCe----EEEEeec-------CCC-----chHHHHHhhhhcCceeccCcccc Q lcl|NC_019544. 1 MKVTIKD--T----NNIDKITRNLQQLGGKQ----IKVGLFG-------KDD-----SELVMIGAVHEYGAEIPVTPKMR 58 (168) Q Consensus 1 M~v~i~~--~----~~~~~~~~~l~~l~~~~----v~VGi~~-------~~g-----~~~a~iA~~~E~G~~i~~~~~~~ 58 (168) |+=.+.+ . .....+....+++.... |.-|-+. ++| ..-+.||.+.|||+ T Consensus 16 ~~~~~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vEfGT--------- 86 (115) T protein:vir:10 16 MHDDIEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVKKIGDLHYRVISTAHYSGFLEFGT--------- 86 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEeeCCCccchheeccc--------- Confidence 1100000 0 01112222222222100 0001110 000 01245777777774 Q ss_pred ccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 59 AWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVP 103 (168) Q Consensus 59 ~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 103 (168) ...|+||||||+++.++..+.+.+++++. T Consensus 87 ----------------~km~a~PFl~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 87 ----------------RYMEPAPFMFPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred ----------------ccCCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 57899999999999999999888888776 No 73 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=97.15 E-value=6e-07 Score=54.70 Aligned_cols=79 Identities=9% Similarity=0.047 Sum_probs=49.8 Q ss_pred Cccee--cccchHHHHHHHHHH-------hhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcc Q lcl|NC_019544. 1 MKVTI--KDTNNIDKITRNLQQ-------LGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKE 71 (168) Q Consensus 1 M~v~i--~~~~~~~~~~~~l~~-------l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~ 71 (168) |+-.+ .....-..+.+.+.. .....+.||+..+ -+.|+.+.|||+ T Consensus 38 ~k~~ap~~~~~~~~hl~d~I~~~~~k~~~~g~~~~~VG~~k~----~~~y~~f~E~GT---------------------- 91 (125) T protein:vir:97 38 LKANTPVYEVETDERLQEDTVISGFKGANVGIVSKEIGYGKA----TGWRAHYPNDGT---------------------- 91 (125) T ss_pred HHHhCCcCCCCchhhHHhhhhcccccccccCceEEEEeecCC----CceeEeeeccCc---------------------- Confidence 22111 111111113333211 1223678887432 357788889994 Q ss_pred cceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019544. 72 TTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEG 108 (168) Q Consensus 72 ~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G 108 (168) .+.||||||+++++++++++.+.+.+.+.+.|.= T Consensus 92 ---~k~~~~pF~~pa~~~~k~~~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 92 ---IYQRGQDFKERTINQMTPKAKQLYAEKVKEGLGL 125 (125) T ss_pred ---cCCCcCccchHhHHHhHHHHHHHHHHHHHHHhcC Confidence 5789999999999999999999999988886632 No 74 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=97.06 E-value=9e-07 Score=53.72 Aligned_cols=78 Identities=13% Similarity=0.204 Sum_probs=39.7 Q ss_pred Ccceecc--cc----hHHHHHHHHHHhhCCe----EEEEeec-------CCC-----chHHHHHhhhhcCceeccCcccc Q lcl|NC_019544. 1 MKVTIKD--TN----NIDKITRNLQQLGGKQ----IKVGLFG-------KDD-----SELVMIGAVHEYGAEIPVTPKMR 58 (168) Q Consensus 1 M~v~i~~--~~----~~~~~~~~l~~l~~~~----v~VGi~~-------~~g-----~~~a~iA~~~E~G~~i~~~~~~~ 58 (168) |+-.+.+ .. ...++....+.+.... +.-|.+. ++| .+-+.||.+.|||+ T Consensus 16 ~~~~~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~~g~~~~~V~~~~~Ya~~vE~GT--------- 86 (115) T protein:vir:99 16 MKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTVDLQYTITSHAAYSGFLEFGT--------- 86 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEecCCccccccccccc--------- Confidence 2111110 00 0112222222222100 0011110 000 01245677777774 Q ss_pred ccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 59 AWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVP 103 (168) Q Consensus 59 ~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 103 (168) ...||||||+|+++.++..+.+.++++++ T Consensus 87 ----------------~~m~a~PFl~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 87 ----------------RYMEAEPFMWPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred ----------------cccCCCCcchhhHHHHHHHHHHHHHHHhC Confidence 57899999999999999999998888887 No 75 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=97.06 E-value=4.3e-06 Score=50.00 Aligned_cols=79 Identities=8% Similarity=0.156 Sum_probs=56.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCChHHHHHhcC--CCCcchhHHH Q lcl|NC_019544. 87 YDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL------KDPPNSQMTIERKG--SDNPLIDTGR 158 (168) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~------~~ppnsp~Ti~~KG--~~~PLiDTG~ 158 (168) +++ -+++...+...+.++ . ..+-...|..||..+....++.|.+. .|+|+++.|+++|. ..++|.++|. T Consensus 1 ~~~-~~~l~~~L~~ll~~l-~-~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~~~~~l~~~~~ 77 (150) T protein:vir:20 1 MNE-FKRFEDRLTGLIESL-S-PSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGRVKRKMFAKLI 77 (150) T ss_pred Cch-HHHHHHHHHHHHHhc-C-ChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccCCCccccchhh Confidence 222 133333333333321 1 11346689999999999999999886 58899999998764 3579999999 Q ss_pred HHhHhccccC Q lcl|NC_019544. 159 LVGSIRHTVE 168 (168) Q Consensus 159 L~~SIty~V~ 168 (168) |.+||+|.+- T Consensus 78 l~~sl~~~~~ 87 (150) T protein:vir:20 78 TSRFLHIRAS 87 (150) T ss_pred hhhhhheeec Confidence 9999999987 No 76 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=97.00 E-value=2e-06 Score=51.79 Aligned_cols=84 Identities=17% Similarity=0.306 Sum_probs=45.4 Q ss_pred Ccce-----ecccch-H---HHHHHHHHH-hhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhc Q lcl|NC_019544. 1 MKVT-----IKDTNN-I---DKITRNLQQ-LGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRK 70 (168) Q Consensus 1 M~v~-----i~~~~~-~---~~~~~~l~~-l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~ 70 (168) +... .+.... | -.|...|.. .....|.||- + ..||++|+||+.+.. T Consensus 77 ~~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGt------n-~~YAaiHqfGg~~~~----------------- 132 (175) T protein:vir:10 77 GELTAAASRRKAGLMILQDSGQMAASVSTDHDDNSAVIGS------N-KEYAAIHQFGGQAGR----------------- 132 (175) T ss_pred hhhhhhhhhhccCCCcceechhhhhhhheeecCCEEEEec------C-hhhhhhhhcccccCC----------------- Confidence 1100 000000 0 123333332 2344566654 2 368999999986431 Q ss_pred ccceeccCCCchhHHHHHH-----HHHHHHHHHHHHHHHHHhcc Q lcl|NC_019544. 71 ETTVIKIPERSWLRSGYDE-----NIDKIAKKIEKMVPDVIEGN 109 (168) Q Consensus 71 ~~~~i~IP~RpFlr~~~~~-----~~~~~~~~~~~~~~~~l~G~ 109 (168) .+.++||+||||--+-+. ..++|.+.+...+..++.++ T Consensus 133 -~~~v~iPaRpfLG~s~~d~~~~e~~~~Il~~~~~~l~~~~~~~ 175 (175) T protein:vir:10 133 -GLKVTIPARPWLPVTADGELQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred -CCccccCCccccCCCcccccchHHHHHHHHHHHHHHHHHhccC Confidence 246799999999543222 23566777777777777767 No 77 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=96.96 E-value=1.2e-06 Score=53.02 Aligned_cols=88 Identities=22% Similarity=0.325 Sum_probs=45.3 Q ss_pred Ccce-ec-ccchHHHHHHH--HH-HhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhccccee Q lcl|NC_019544. 1 MKVT-IK-DTNNIDKITRN--LQ-QLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVI 75 (168) Q Consensus 1 M~v~-i~-~~~~~~~~~~~--l~-~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i 75 (168) .+-. .+ ....+..+..+ |. ..+...+.|||.+ ....||++|.||.++++.++ .+.+ T Consensus 64 ~~~~~~~~~~~m~~~l~~~~~l~~~~~~~~a~vg~~G----s~~~yA~iHQfG~~~~~~~~---------------~~~v 124 (156) T protein:vir:11 64 GKQGRIRRKIKMFQKLRTVRYLRAKGDAQAITVSFAG----RIARIARVHQYGLRDRAEPG---------------APEV 124 (156) T ss_pred hhccccccchhhhhhhhhhheeeeeecCcEEEEEecC----CchhhhhhhcccccccccCC---------------CCcc Confidence 1100 00 00001111111 11 1245678899863 34578999999998765543 3578 Q ss_pred ccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHH Q lcl|NC_019544. 76 KIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPR 113 (168) Q Consensus 76 ~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~ 113 (168) +||+||||--+- +..+++.+.+...+. +. ++- T Consensus 125 ~iPaRp~LG~s~-~d~~~i~~~i~~~l~----~~-~~~ 156 (156) T protein:vir:11 125 SYAQRLLLGFDS-SDMETIQNGILAHID----AN-SPI 156 (156) T ss_pred cccccccCCCCH-HHHHHHHHHHHHHHh----hc-CCC Confidence 999999994442 233444444444433 33 332 No 78 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=96.96 E-value=1.9e-06 Score=51.90 Aligned_cols=91 Identities=12% Similarity=0.072 Sum_probs=49.4 Q ss_pred CcceecccchHHHHHHHHHHhhC---------------------------------CeEEEEeecCCC-----chHHHHH Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGG---------------------------------KQIKVGLFGKDD-----SELVMIG 42 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~---------------------------------~~v~VGi~~~~g-----~~~a~iA 42 (168) |+-... ++++|.+.|+.+.. .++.+-+.. +| ..-+.+| T Consensus 1 Ma~~~~---Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~-~~~~~~V~~~~~Ya 76 (137) T protein:vir:10 1 MAKVKY---GNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKK-GGLTGVINIGSEYA 76 (137) T ss_pred CchhHh---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeC-CcEEEEEecCCCcc Confidence 765543 34444444433322 112221111 11 1236799 Q ss_pred hhhhcCceeccCc-ccc-------ccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_019544. 43 AVHEYGAEIPVTP-KMR-------AWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIE 99 (168) Q Consensus 43 ~~~E~G~~i~~~~-~~~-------~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~ 99 (168) .+.|||+.+.... ..+ .+....+. ...+..+|+|||||++++++++++.+.+. T Consensus 77 ~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~----~~~t~g~~a~PFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 77 VYVNYGTGIYAVGPGGSRAKNIPWCYKDADGH----WHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred cccccCccccccCCCccccccccceeeccccc----eeccCCCCCCcchhHHHHHHHHHHHHhcC Confidence 9999998653221 111 11111111 12345789999999999999999888777 No 79 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=96.95 E-value=5.4e-06 Score=49.48 Aligned_cols=79 Identities=8% Similarity=0.145 Sum_probs=57.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCChHHHHHhcCC--CCcchhHHH Q lcl|NC_019544. 87 YDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL------KDPPNSQMTIERKGS--DNPLIDTGR 158 (168) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~------~~ppnsp~Ti~~KG~--~~PLiDTG~ 158 (168) +++ -+++...+...+.++ . ..+....|..||..+....++.|.+. .|+|+++.|+++|+. .++|+++|. T Consensus 1 ~~~-~~~l~~~L~~~l~~L-~-~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~~~ 77 (150) T protein:vir:60 1 MNE-FKRFEDRLTGLIESL-S-PSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLI 77 (150) T ss_pred Cch-HHHHHHHHHHHHHhc-C-ChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcCCCccchhhhh Confidence 222 223333333333331 1 12346789999999999999999886 588999999998754 579999999 Q ss_pred HHhHhccccC Q lcl|NC_019544. 159 LVGSIRHTVE 168 (168) Q Consensus 159 L~~SIty~V~ 168 (168) |..||+|.+. T Consensus 78 l~~sl~~~~~ 87 (150) T protein:vir:60 78 TSRFLHIRAS 87 (150) T ss_pred hcceeeeeee Confidence 9999999888 No 80 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=96.95 E-value=1.3e-06 Score=52.82 Aligned_cols=91 Identities=13% Similarity=0.064 Sum_probs=50.0 Q ss_pred CcceecccchHHHHHHHHHHhhC---------------------------------CeEEEEeecCCC-----chHHHHH Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGG---------------------------------KQIKVGLFGKDD-----SELVMIG 42 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~---------------------------------~~v~VGi~~~~g-----~~~a~iA 42 (168) |.-.+++ +++|.+.|+++.. .++.+-+- ++| ...+.|| T Consensus 1 Ma~~~~g---~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~-~~~~~~~V~~~~~YA 76 (137) T protein:vir:93 1 MAKVKYG---NWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK-DSGFTGVINIGSEYA 76 (137) T ss_pred CchhHHh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEee-cCceEEEEecCCCcc Confidence 7777653 3344444333211 01111111 111 1347799 Q ss_pred hhhhcCceeccCccc--------cccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_019544. 43 AVHEYGAEIPVTPKM--------RAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIE 99 (168) Q Consensus 43 ~~~E~G~~i~~~~~~--------~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~ 99 (168) .+.|||+.+...... +.+....+.. ..+...|+||||+++++++++.+.+.+. T Consensus 77 ~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~----~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:93 77 IYVNYGTGIYATGAGGSRAKKIPWSYKDANGKW----HTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cccccCccccccCCCcccccccccceeccCcce----eecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999999865432211 1111111211 1234789999999999999999988877 No 81 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=96.95 E-value=1.3e-06 Score=52.82 Aligned_cols=91 Identities=13% Similarity=0.064 Sum_probs=50.0 Q ss_pred CcceecccchHHHHHHHHHHhhC---------------------------------CeEEEEeecCCC-----chHHHHH Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGG---------------------------------KQIKVGLFGKDD-----SELVMIG 42 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~---------------------------------~~v~VGi~~~~g-----~~~a~iA 42 (168) |.-.+++ +++|.+.|+++.. .++.+-+- ++| ...+.|| T Consensus 1 Ma~~~~g---~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~-~~~~~~~V~~~~~YA 76 (137) T protein:vir:97 1 MAKVKYG---NWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK-DSGFTGVINIGSEYA 76 (137) T ss_pred CchhHHh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEee-cCceEEEEecCCCcc Confidence 7777653 3344444333211 01111111 111 1347799 Q ss_pred hhhhcCceeccCccc--------cccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_019544. 43 AVHEYGAEIPVTPKM--------RAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIE 99 (168) Q Consensus 43 ~~~E~G~~i~~~~~~--------~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~ 99 (168) .+.|||+.+...... +.+....+.. ..+...|+||||+++++++++.+.+.+. T Consensus 77 ~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~----~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:97 77 IYVNYGTGIYATGAGGSRAKKIPWSYKDANGKW----HTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cccccCccccccCCCcccccccccceeccCcce----eecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999999865432211 1111111211 1234789999999999999999988877 No 82 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=96.95 E-value=1.3e-06 Score=52.82 Aligned_cols=91 Identities=13% Similarity=0.064 Sum_probs=50.0 Q ss_pred CcceecccchHHHHHHHHHHhhC---------------------------------CeEEEEeecCCC-----chHHHHH Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGG---------------------------------KQIKVGLFGKDD-----SELVMIG 42 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~---------------------------------~~v~VGi~~~~g-----~~~a~iA 42 (168) |.-.+++ +++|.+.|+++.. .++.+-+- ++| ...+.|| T Consensus 1 Ma~~~~g---~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~-~~~~~~~V~~~~~YA 76 (137) T protein:vir:94 1 MAKVKYG---NWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK-DSGFTGVINIGSEYA 76 (137) T ss_pred CchhHHh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEee-cCceEEEEecCCCcc Confidence 7777653 3344444333211 01111111 111 1347799 Q ss_pred hhhhcCceeccCccc--------cccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_019544. 43 AVHEYGAEIPVTPKM--------RAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIE 99 (168) Q Consensus 43 ~~~E~G~~i~~~~~~--------~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~ 99 (168) .+.|||+.+...... +.+....+.. ..+...|+||||+++++++++.+.+.+. T Consensus 77 ~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~----~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 77 IYVNYGTGIYATGAGGSRAKKIPWSYKDANGKW----HTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cccccCccccccCCCcccccccccceeccCcce----eecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999999865432211 1111111211 1234789999999999999999988877 No 83 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=96.94 E-value=9.4e-07 Score=53.64 Aligned_cols=100 Identities=12% Similarity=0.093 Sum_probs=46.5 Q ss_pred Ccceeccc---chHHHHHHH-------------------HHHhhC-------CeEEEEeecCCCc-------hHHHHHhh Q lcl|NC_019544. 1 MKVTIKDT---NNIDKITRN-------------------LQQLGG-------KQIKVGLFGKDDS-------ELVMIGAV 44 (168) Q Consensus 1 M~v~i~~~---~~~~~~~~~-------------------l~~l~~-------~~v~VGi~~~~g~-------~~a~iA~~ 44 (168) |.+++..+ +.++++.+. .+.+.- .++.+-+- .+|. ..+.||.+ T Consensus 4 ~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~-~~g~~~~~~v~~~~~YA~~ 82 (142) T protein:vir:94 4 LNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPS-GGRFSFSVTIGTNVTYAAD 82 (142) T ss_pred eEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeec-cCCceEEEEEecCcccchh Confidence 55443321 122221111 111110 11111111 1111 23689999 Q ss_pred hhcCceecc-Ccccccc--ccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 45 HEYGAEIPV-TPKMRAW--FAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMV 102 (168) Q Consensus 45 ~E~G~~i~~-~~~~~~~--~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~ 102 (168) +|||+.... .|+.... +...+. .......-.+||||||+++++++++++.+.++..- T Consensus 83 vE~Gt~~~~i~pk~~k~l~~~~~~~-~~~~v~~pG~~~~pfl~~A~~~~~~~i~~~~~~~~ 142 (142) T protein:vir:94 83 VEYGTAPHVIVPKDKKALYWPGAAH-PVAKVNHPGTRAQPFMRPAIAAASTFLRNHAKGIR 142 (142) T ss_pred hhccCCCceeccCCCccceecccce-eeeeeeecCCCCCcchhHHHHHHHHHHHHHHHhcC Confidence 999985321 1222222 221111 11122223589999999999999888877666543 No 84 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=96.93 E-value=1.6e-06 Score=52.42 Aligned_cols=78 Identities=17% Similarity=0.302 Sum_probs=40.5 Q ss_pred Cccee-cccchHHHHHHHHHH-hhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccC Q lcl|NC_019544. 1 MKVTI-KDTNNIDKITRNLQQ-LGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIP 78 (168) Q Consensus 1 M~v~i-~~~~~~~~~~~~l~~-l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP 78 (168) ..-++ .++. .|..+|.. .....|.||- + ..||++|+||+.+.. .+.++|| T Consensus 71 ~~~~iL~~tg---~L~~Si~~~~~~~~v~vGt------n-~~YA~iHqfGg~~~~------------------~~~v~iP 122 (155) T protein:vir:99 71 GPHPILQVTN---ALARSVTTWADRNEAGIGS------N-LVYAAIHQFGGDAGR------------------GHQVEIP 122 (155) T ss_pred CCCCcchhch---hhhhhhhceecCCEEEEec------C-ccchhhhhcccccCC------------------CCccccC Confidence 11111 1112 23444433 2445677764 2 457999999987532 2468999 Q ss_pred CCchhHHHHH-----HHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019544. 79 ERSWLRSGYD-----ENIDKIAKKIEKMVPDVIEGN 109 (168) Q Consensus 79 ~RpFlr~~~~-----~~~~~~~~~~~~~~~~~l~G~ 109 (168) +||||--+-+ +-.+++.+.+...+.+ ++ T Consensus 123 aRpfLG~s~~~~l~~e~~~~I~~~i~~~l~~---~~ 155 (155) T protein:vir:99 123 ARRYLPFDENGQLAAGARQSILEIVLTALSR---NR 155 (155) T ss_pred CccccCCCCccccchHHHHHHHHHHHHHHhc---cC Confidence 9999943221 2234444444444443 22 No 85 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=96.86 E-value=2.1e-06 Score=51.67 Aligned_cols=70 Identities=14% Similarity=0.249 Sum_probs=37.9 Q ss_pred CcceecccchH----HHHHHHHHHh--------------------hCCeEEEEeecCCCchHHHHHhhhhcCceeccCcc Q lcl|NC_019544. 1 MKVTIKDTNNI----DKITRNLQQL--------------------GGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPK 56 (168) Q Consensus 1 M~v~i~~~~~~----~~~~~~l~~l--------------------~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~ 56 (168) ++-.-+....+ .++...+..- .+.++.|| +-+.||.+.|||+ T Consensus 19 ~~~~~~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~~~~~~~~v~-------~~~~Ya~~vE~GT------- 84 (112) T protein:vir:96 19 NASSERRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLEAGSDRAVVE-------ALTNYSGYLEVGT------- 84 (112) T ss_pred hcCHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeeecCceEEEec-------CCCCccceeccCc------- Confidence 32110101111 1111111111 11122222 1245778888884 Q ss_pred ccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 57 MRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMV 102 (168) Q Consensus 57 ~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~ 102 (168) ...|+||||+|+++.++..+.+.+++.- T Consensus 85 ------------------r~m~AqPF~~PA~~~~~~~~~~~l~~L~ 112 (112) T protein:vir:96 85 ------------------RKMEAQPFMRPALDQVVPEMVEEMAKWE 112 (112) T ss_pred ------------------cccCCCCchhhhHHHHHHHHHHHHHhcC Confidence 4789999999999999999888777764 No 86 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=96.86 E-value=3.3e-06 Score=50.64 Aligned_cols=92 Identities=15% Similarity=0.142 Sum_probs=50.6 Q ss_pred Cccee----------cccchHHHHHHHHHH---hhCCeEEEEeecCCCchHHHHHhhhhcCceeccCcc-----cccccc Q lcl|NC_019544. 1 MKVTI----------KDTNNIDKITRNLQQ---LGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPK-----MRAWFA 62 (168) Q Consensus 1 M~v~i----------~~~~~~~~~~~~l~~---l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~-----~~~~~~ 62 (168) |...+ .++.. |..++.. -.+.++.||. -+.||.+.|||+.+....+ .++|.. T Consensus 32 ~~~~~ie~~ak~~~pvdtG~---L~~SI~~~v~~~g~~~~V~~-------~~~YA~yVE~GTG~~~~~~~grk~~w~y~~ 101 (141) T protein:vir:78 32 MTTELAEGGHGVTSNNDTGE---YAQKSGYKVRKSSKEVIVGN-------SSDYAIYYEFGTGEKSERGGGKAGGWFYMD 101 (141) T ss_pred HHHHHHHHhhhhccccccch---hhcceeeeeecCCcEEEEec-------CCCccceeecCCcccccCCCCCcCcceeec Confidence 10000 00111 1111111 1233444442 3568999999987644321 223333 Q ss_pred ccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|NC_019544. 63 ANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVN 111 (168) Q Consensus 63 ~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~ 111 (168) .+|...+ +..-|+||||+++++++++++.+.+++.+..+ + T Consensus 102 ~~g~~~~----t~G~~aqpFl~~A~~~~~~~i~~~i~~~~~~l-----~ 141 (141) T protein:vir:78 102 KKGHWHF----TRGSQASKRMRYTFRDEQDKVRVFTERALRGI-----N 141 (141) T ss_pred CCCeeEe----ccCCCCchhhhhhHHhhHHHHHHHHHHHhhcc-----C Confidence 3444322 33689999999999999999999888887653 3 No 87 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=96.85 E-value=7.9e-06 Score=48.55 Aligned_cols=79 Identities=8% Similarity=0.128 Sum_probs=57.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCChHHHHHhcCC--CCcchhHHH Q lcl|NC_019544. 87 YDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL------KDPPNSQMTIERKGS--DNPLIDTGR 158 (168) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~------~~ppnsp~Ti~~KG~--~~PLiDTG~ 158 (168) +++. +++...+...+.++ . ..+...+|..||..+....++.|.+. .|+|+++.|+++|+. .++|+++|. T Consensus 1 m~~~-~~l~~~L~~~l~~L-~-~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~~~~~~~l~~~~~ 77 (150) T protein:vir:57 1 MNEF-KRFEDRLTGLIESL-S-PSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLI 77 (150) T ss_pred CchH-HHHHHHHHHHHHhc-C-ChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhccCCCcccchhhh Confidence 2221 23333333333321 1 12346789999999999999999886 688999999987753 579999999 Q ss_pred HHhHhccccC Q lcl|NC_019544. 159 LVGSIRHTVE 168 (168) Q Consensus 159 L~~SIty~V~ 168 (168) |..||+|.+. T Consensus 78 l~~sl~~~~~ 87 (150) T protein:vir:57 78 TSRFLHIRAS 87 (150) T ss_pred hccceeeeee Confidence 9999999888 No 88 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=96.80 E-value=1.5e-06 Score=52.44 Aligned_cols=75 Identities=13% Similarity=0.247 Sum_probs=37.9 Q ss_pred ceecccchHHHH----------HHHH-----------HHhhC-------CeEEEEeecCCC-----chHHHHHhhhhcCc Q lcl|NC_019544. 3 VTIKDTNNIDKI----------TRNL-----------QQLGG-------KQIKVGLFGKDD-----SELVMIGAVHEYGA 49 (168) Q Consensus 3 v~i~~~~~~~~~----------~~~l-----------~~l~~-------~~v~VGi~~~~g-----~~~a~iA~~~E~G~ 49 (168) |++++.+.|.+- .++| +.+.. .++.+-+-. +| .+-+.||.+.|||+ T Consensus 1 i~i~Gld~l~~~l~~~~~~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~~-~~~~~~V~~~~~Ya~~vE~GT 79 (108) T protein:vir:98 1 MKITGIDALQKKLRKNATLNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFTD-GGLTGTTIPHTDYAGYVEYGT 79 (108) T ss_pred CcchhHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeeec-CceEEEeecCCCccceeeccc Confidence 222221111111 1111 11100 011111100 00 01234666677774 Q ss_pred eeccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 50 EIPVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVP 103 (168) Q Consensus 50 ~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 103 (168) ...|+||||+|+++.+++++.+.+++.++ T Consensus 80 -------------------------~~m~aqPFl~pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 80 -------------------------RFQAAQPFVKPAFDVQKKIFTNDLERLTK 108 (108) T ss_pred -------------------------cccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 47899999999999999999988888877 No 89 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=96.74 E-value=4.1e-06 Score=50.11 Aligned_cols=71 Identities=13% Similarity=0.233 Sum_probs=38.9 Q ss_pred CcceecccchH----HHHHHHHHHhh------------------CCeEEEEeecCCCchHHHHHhhhhcCceeccCcccc Q lcl|NC_019544. 1 MKVTIKDTNNI----DKITRNLQQLG------------------GKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMR 58 (168) Q Consensus 1 M~v~i~~~~~~----~~~~~~l~~l~------------------~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~ 58 (168) |...-.-...+ ..+.++.+.+. +..+.|| +-+.||.+.|||+ T Consensus 16 ~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~si~~~~~~~~~~~~V~-------~~~~Ya~~vE~GT--------- 79 (108) T protein:vir:74 16 NATLDDVKHVVKSNTASMNKNMQNLAPVDTGNMKRSITSEFTDGGLSGTTG-------PHTDYAGYVEYGT--------- 79 (108) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhccceeeeecCceEEEee-------cCCCcccceeccc--------- Confidence 22110000111 11111222211 1122222 1244677778884 Q ss_pred ccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 59 AWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVP 103 (168) Q Consensus 59 ~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 103 (168) ...|+||||||+++.++.++.+.+++.++ T Consensus 80 ----------------~km~aqpf~~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 80 ----------------RFQSAQPFVKPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred ----------------cccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 46899999999999999999988887777 No 90 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=96.70 E-value=1.7e-06 Score=52.19 Aligned_cols=95 Identities=18% Similarity=0.151 Sum_probs=45.3 Q ss_pred Ccceeccc---chHHHHH----H-HHHHhh-------C-----------CeEEEEeecCCC-------chHHHHHhhhhc Q lcl|NC_019544. 1 MKVTIKDT---NNIDKIT----R-NLQQLG-------G-----------KQIKVGLFGKDD-------SELVMIGAVHEY 47 (168) Q Consensus 1 M~v~i~~~---~~~~~~~----~-~l~~l~-------~-----------~~v~VGi~~~~g-------~~~a~iA~~~E~ 47 (168) |+|+.+-. ..+.+.+ + .|++.. . .++..-+..+.+ ...+.||.++|| T Consensus 1 m~~s~~i~i~~~~l~~~v~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~ve~ 80 (137) T protein:vir:10 1 MPVTARIHINEPELERQTGAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQTYRPFHVGGGVEDNVDYAAPVHE 80 (137) T ss_pred CCeeEEEeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeeccccceEEEEEecCCCceeeeee Confidence 88886532 2221111 1 111111 1 111111111111 134789999999 Q ss_pred Cc---eeccCccccccccccchhhhcccceeccC---CCchhHHHHHHH---HHHHHHHHHHHHHHHHhccCc Q lcl|NC_019544. 48 GA---EIPVTPKMRAWFAANGYPLRKETTVIKIP---ERSWLRSGYDEN---IDKIAKKIEKMVPDVIEGNVN 111 (168) Q Consensus 48 G~---~i~~~~~~~~~~~~~g~~~~~~~~~i~IP---~RpFlr~~~~~~---~~~~~~~~~~~~~~~l~G~~~ 111 (168) |+ .|.+..+....+...|...|.. +++.| |||||++++++. ..++.- + T Consensus 81 GT~ph~I~pk~~k~l~f~~~G~~v~~k--~v~hpG~~a~Pfl~~A~~~~~~~~~ri~~--------------~ 137 (137) T protein:vir:10 81 GSRPHRITARHANALHFFWHGREVFRK--SVWHPGVRPRPFLRNAARRVVAADPDIHM--------------T 137 (137) T ss_pred cCCCceeecccCceeeeeeCCceEEee--eeecCCCCCCchHHHHHHHHhhccccccC--------------C Confidence 98 3544433333334445544443 45556 999999999874 222211 1 No 91 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=96.65 E-value=3.4e-06 Score=50.53 Aligned_cols=92 Identities=14% Similarity=0.087 Sum_probs=50.0 Q ss_pred CcceecccchHHHHHHHHHHhhCC------------------------eEEEEee--------cCCC-----chHHHHHh Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGK------------------------QIKVGLF--------GKDD-----SELVMIGA 43 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~------------------------~v~VGi~--------~~~g-----~~~a~iA~ 43 (168) |+..+++ +++|.+.|+++... -|.-|-+ .++| ...+.||. T Consensus 1 Ma~~~~G---~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~~~~~~~~~V~~~~~YA~ 77 (137) T protein:vir:95 1 MAKVKYG---NWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAI 77 (137) T ss_pred CchhHHh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEeeCCceEEEEecCCCccc Confidence 8777653 34444444332210 0111111 0111 13477999 Q ss_pred hhhcCceeccCcccc--------ccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_019544. 44 VHEYGAEIPVTPKMR--------AWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIE 99 (168) Q Consensus 44 ~~E~G~~i~~~~~~~--------~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~ 99 (168) +.|||+.+....+.. .+....+.. ..+...|+||||+++++++++++.+.+. T Consensus 78 ~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~----~~t~g~~a~PFl~pA~~~~~~~i~k~l~ 137 (137) T protein:vir:95 78 YVNYGTGIYATGAGGSRAKKIPWSYKDANGKW----HTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred ccccCccccccCCCcccccccccceeccCcce----eecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 999998654332211 111111211 1234789999999999999999988877 No 92 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=96.39 E-value=2.7e-06 Score=51.15 Aligned_cols=93 Identities=13% Similarity=0.070 Sum_probs=47.6 Q ss_pred CcceecccchHHHHHHHHHHhhC---------------------------------CeEEEEeecCCC-----chHHHHH Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGG---------------------------------KQIKVGLFGKDD-----SELVMIG 42 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~---------------------------------~~v~VGi~~~~g-----~~~a~iA 42 (168) |.-.+++ ++++.+.|+.+.. .++.+-+- .+| ..-+.|| T Consensus 1 Ma~~~~G---~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~pvdTG~L~~Si~~~~~-~~g~~~~V~~~~~YA 76 (137) T protein:vir:96 1 MAKVKYG---NWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAPVDLGFLKESIDFKVT-DGGFSSVISVGAEYA 76 (137) T ss_pred CchhHhh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCccchhcCceeEee-cCceEEEEecCCCcc Confidence 7665442 3333333322111 01111111 111 1236899 Q ss_pred hhhhcCceeccCcccc------ccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_019544. 43 AVHEYGAEIPVTPKMR------AWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIE 99 (168) Q Consensus 43 ~~~E~G~~i~~~~~~~------~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~ 99 (168) .+.|||+......+.. .++........ ..+..+|+||||+++++++++.+.+.+. T Consensus 77 ~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~--~~t~g~~a~pFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:96 77 IYVEFGTGIYATGPGGSRARKLPWTYKGDDGEW--HTTYGQQAQPFWNPAIDEGRKVFNRYFS 137 (137) T ss_pred cccccCccccccCCCccccccccceeeccCcce--eecCCCCCCcchhHHHHHHHHHHHHhhC Confidence 9999998653321111 01111111111 1345789999999999999999888777 No 93 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=96.37 E-value=3.3e-06 Score=50.63 Aligned_cols=96 Identities=15% Similarity=0.095 Sum_probs=43.1 Q ss_pred Ccceec--ccchHH----HHHHHHHHhhC-------CeEEEEeecCCC-----chHHHHHhhhhcCceeccCcccc---- Q lcl|NC_019544. 1 MKVTIK--DTNNID----KITRNLQQLGG-------KQIKVGLFGKDD-----SELVMIGAVHEYGAEIPVTPKMR---- 58 (168) Q Consensus 1 M~v~i~--~~~~~~----~~~~~l~~l~~-------~~v~VGi~~~~g-----~~~a~iA~~~E~G~~i~~~~~~~---- 58 (168) +.-.+. ....+. .+....+.+.. .++.+-+- ++| ..-+.||.+.|||+.+....... T Consensus 18 ~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~-~~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~ 96 (137) T protein:vir:94 18 YERDIERWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFK-DGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAK 96 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEee-cCcEEEEEecCCCcccccccCccccccCCCccccc Confidence 110000 000010 11111111111 01111111 111 12367999999997654322111 Q ss_pred --ccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_019544. 59 --AWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIE 99 (168) Q Consensus 59 --~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~ 99 (168) .++...... ....+..+|+||||+++++++++++.+.+. T Consensus 97 ~~~~~~~~~~~--~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 97 KIPWSYKDANG--KWHTTKGQHAQPFWEPAIDAGRVFFNKYFS 137 (137) T ss_pred ccccceeccCC--ceeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 111111111 111345789999999999999999988877 No 94 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=96.28 E-value=2.9e-06 Score=50.98 Aligned_cols=91 Identities=14% Similarity=0.115 Sum_probs=47.3 Q ss_pred CcceecccchHHHHHHHHHHhhC---------------------------------CeEEEEeecCCC-----chHHHHH Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGG---------------------------------KQIKVGLFGKDD-----SELVMIG 42 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~---------------------------------~~v~VGi~~~~g-----~~~a~iA 42 (168) |.-... ++++|.+.|+.+.. .++.+-+- ++| ..-+.|| T Consensus 1 Ma~~~~---G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~-~~~~~~~V~~~~~YA 76 (137) T protein:vir:10 1 MAKVKY---GNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFK-KGGLTGVINIGSEYA 76 (137) T ss_pred Cccchh---CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEec-CCcEEEEEecCCccc Confidence 543322 23333333322211 11221111 112 1237799 Q ss_pred hhhhcCceeccC-cccc-------ccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_019544. 43 AVHEYGAEIPVT-PKMR-------AWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIE 99 (168) Q Consensus 43 ~~~E~G~~i~~~-~~~~-------~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~ 99 (168) .+.|||+.+... +..+ .+....|.. ..+..+||||||++++++++.++.+.+. T Consensus 77 ~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~----~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 77 VYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHW----HTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred cccccCccccccCCCcccccccceeeecccccc----ccCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 999999755321 1111 111222221 2345799999999999999999888777 No 95 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=96.23 E-value=6.6e-06 Score=49.00 Aligned_cols=93 Identities=12% Similarity=0.104 Sum_probs=43.6 Q ss_pred Ccceeccc--chHHHHHHHHHHhh---------CCe-----------EEEEeecCCCchHHHHHhhhhcCce---eccCc Q lcl|NC_019544. 1 MKVTIKDT--NNIDKITRNLQQLG---------GKQ-----------IKVGLFGKDDSELVMIGAVHEYGAE---IPVTP 55 (168) Q Consensus 1 M~v~i~~~--~~~~~~~~~l~~l~---------~~~-----------v~VGi~~~~g~~~a~iA~~~E~G~~---i~~~~ 55 (168) +.-.++.. .....+....+++. +.. |.+|+. ..+.||.++|||+. |.+.. T Consensus 22 ~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~-----~~a~YA~~ve~GT~ph~i~pk~ 96 (142) T protein:vir:99 22 VGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVT-----AHAKYAAAVHEGTRPHVIRAKH 96 (142) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEec-----cCccccceeccCCccceecccc Confidence 11111000 00011111112211 111 222221 34779999999984 44333 Q ss_pred cccccccccchhhhcccceecc---CCCchhHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 56 KMRAWFAANGYPLRKETTVIKI---PERSWLRSGYDENIDKIAKKIEK 100 (168) Q Consensus 56 ~~~~~~~~~g~~~~~~~~~i~I---P~RpFlr~~~~~~~~~~~~~~~~ 100 (168) +....+...|...+.. +++. ||||||+++++.+.++-.+.... T Consensus 97 ~~al~f~~~g~~~~~k--~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:99 97 AQALHFWWRGREVFVR--QVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred CceeeEecCCceeeee--eeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 3323334445544443 4454 49999999999988764443333 No 96 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=96.23 E-value=6.6e-06 Score=49.00 Aligned_cols=93 Identities=12% Similarity=0.104 Sum_probs=43.6 Q ss_pred Ccceeccc--chHHHHHHHHHHhh---------CCe-----------EEEEeecCCCchHHHHHhhhhcCce---eccCc Q lcl|NC_019544. 1 MKVTIKDT--NNIDKITRNLQQLG---------GKQ-----------IKVGLFGKDDSELVMIGAVHEYGAE---IPVTP 55 (168) Q Consensus 1 M~v~i~~~--~~~~~~~~~l~~l~---------~~~-----------v~VGi~~~~g~~~a~iA~~~E~G~~---i~~~~ 55 (168) +.-.++.. .....+....+++. +.. |.+|+. ..+.||.++|||+. |.+.. T Consensus 22 ~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~-----~~a~YA~~ve~GT~ph~i~pk~ 96 (142) T protein:vir:86 22 VGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVT-----AHAKYAAAVHEGTRPHVIRAKH 96 (142) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEec-----cCccccceeccCCccceecccc Confidence 11111000 00011111112211 111 222221 34779999999984 44333 Q ss_pred cccccccccchhhhcccceecc---CCCchhHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 56 KMRAWFAANGYPLRKETTVIKI---PERSWLRSGYDENIDKIAKKIEK 100 (168) Q Consensus 56 ~~~~~~~~~g~~~~~~~~~i~I---P~RpFlr~~~~~~~~~~~~~~~~ 100 (168) +....+...|...+.. +++. ||||||+++++.+.++-.+.... T Consensus 97 ~~al~f~~~g~~~~~k--~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:86 97 AQALHFWWRGREVFVR--QVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred CceeeEecCCceeeee--eeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 3323334445544443 4454 49999999999988764443333 No 97 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=96.23 E-value=1.2e-05 Score=47.66 Aligned_cols=78 Identities=17% Similarity=0.307 Sum_probs=39.3 Q ss_pred Cccee-cccchHHHHHHHHHH-hhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccC Q lcl|NC_019544. 1 MKVTI-KDTNNIDKITRNLQQ-LGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIP 78 (168) Q Consensus 1 M~v~i-~~~~~~~~~~~~l~~-l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP 78 (168) ..-++ .++. .|..+|.. .....|.||- + ..||++|+||+.+.. .+.++|| T Consensus 71 ~~~~iL~~tG---~L~~Si~~~~~~~~v~vGt------~-~~YA~iHqfGg~~~~------------------~~~v~iP 122 (155) T protein:vir:79 71 GPHPILQVTN---ALARSVTTWADRNEAGIGS------N-LVYAAIHQFGGDAGR------------------GHQVEIP 122 (155) T ss_pred CCCCccccch---hhhhhhhceecCCEEEEec------C-chhhhhhhcccccCC------------------CCccccC Confidence 11111 1111 24444432 2445677753 2 458999999986532 2468999 Q ss_pred CCchhHHHHHH-----HHHHHHHHHHHHHHHHHhcc Q lcl|NC_019544. 79 ERSWLRSGYDE-----NIDKIAKKIEKMVPDVIEGN 109 (168) Q Consensus 79 ~RpFlr~~~~~-----~~~~~~~~~~~~~~~~l~G~ 109 (168) +||||--+-+. -.+++.+.+...+. .|+ T Consensus 123 aRpfLG~s~~~~l~~~~~~~I~~~i~~~l~---r~r 155 (155) T protein:vir:79 123 ARRYLPFDENGQLAAGARQSILEVVLTALS---RNR 155 (155) T ss_pred CccccCCCCccccchHHHHHHHHHHHHHHH---hcC Confidence 99999433221 12334444443333 344 No 98 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=96.09 E-value=5.2e-06 Score=49.57 Aligned_cols=93 Identities=13% Similarity=0.108 Sum_probs=47.4 Q ss_pred CcceecccchHHHHHHHHHHhhC---------------------------------CeEEEEeecCCC-----chHHHHH Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGG---------------------------------KQIKVGLFGKDD-----SELVMIG 42 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~---------------------------------~~v~VGi~~~~g-----~~~a~iA 42 (168) |+.... +++++.+.|+++.. .++.+=+ ..+| ...+.|| T Consensus 1 Ma~~~~---Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~-~~~g~~~~V~~~~~YA 76 (135) T protein:vir:96 1 MAKVKY---GADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDF-ENGGFTGVVKIGSNYA 76 (135) T ss_pred Cchhhh---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEe-ecCcEEEEEecCCCcc Confidence 664322 33333333332211 1111111 1111 1347799 Q ss_pred hhhhcCceeccCcc----ccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_019544. 43 AVHEYGAEIPVTPK----MRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIE 99 (168) Q Consensus 43 ~~~E~G~~i~~~~~----~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~ 99 (168) .+.|||+....... ...++..+....+ ..+..+|+||||+++++++++++.+.+. T Consensus 77 ~~ve~GT~~~~~~~~~~~~~~~~~~~~~g~~--~~~~~~~a~pfl~~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 77 VYVNYGTGIYATKGSRAHKIPWTYKDPNGKW--HTTYGQMPQPFWEPAIDAGRQTFEQYFS 135 (135) T ss_pred chhhcccccccCCCccccccccccccCCcce--eecCCcCCCcchhHHHHHHHHHHHHhcC Confidence 99999985532211 0111111111111 1235899999999999999998887776 No 99 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=96.08 E-value=2.1e-05 Score=46.23 Aligned_cols=99 Identities=21% Similarity=0.254 Sum_probs=51.6 Q ss_pred CcceecccchHHHHHHHH----HH---hhCC-eEEEEeecCCCchHHHHHhhhhcCcee----ccCccccccccccchhh Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNL----QQ---LGGK-QIKVGLFGKDDSELVMIGAVHEYGAEI----PVTPKMRAWFAANGYPL 68 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l----~~---l~~~-~v~VGi~~~~g~~~a~iA~~~E~G~~i----~~~~~~~~~~~~~g~~~ 68 (168) |+-.+....+ +|.++| .+ -++. .-.|||-. .-|-.+.+-|||--. ...+++.|. .++. T Consensus 5 akarv~~~~G--~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~----rkAPhghlvE~Ghw~~~~~~~~~dG~w~--~~~~-- 74 (119) T protein:vir:81 5 AKAFVNDETG--KLRSNLYVAYSPEESTNGVQTYAVSWRK----KAAPHGHLLEFGHWQTHAAYKGKDGEWY--SSSV-- 74 (119) T ss_pred cccccCCCcc--chhhhheeeeccccCCCCeEEEEeeccC----CcCCcccccccceeeeeeeeeccCceee--ecCc-- Confidence 5544433322 233333 11 1221 23455532 122333445788321 111222221 1221 Q ss_pred hcccceeccCCCchhHHHHHHHHHHHHHHHHHH----HHHHHhccC Q lcl|NC_019544. 69 RKETTVIKIPERSWLRSGYDENIDKIAKKIEKM----VPDVIEGNV 110 (168) Q Consensus 69 ~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~----~~~~l~G~~ 110 (168) .+.....+|+|||||++|+....+....+.+. +.+++.|+. T Consensus 75 -~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:81 75 -KLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred -cccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 23456689999999999998887777766655 888898886 No 100 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=96.06 E-value=2.2e-05 Score=46.07 Aligned_cols=101 Identities=21% Similarity=0.254 Sum_probs=51.4 Q ss_pred CcceecccchHHHHHHHH----HH---hhCC-eEEEEeecCCCchHHHHHhhhhcCcee--ccCccccccccccchhhhc Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNL----QQ---LGGK-QIKVGLFGKDDSELVMIGAVHEYGAEI--PVTPKMRAWFAANGYPLRK 70 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l----~~---l~~~-~v~VGi~~~~g~~~a~iA~~~E~G~~i--~~~~~~~~~~~~~g~~~~~ 70 (168) |+-.+....+ +|.++| .+ -++. .-.|||-. .-|-.+.+-|||--. .+.....+....++. . T Consensus 5 akarv~~~~G--~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~----rkAPhghlvE~Ghw~~~~~~~~~dG~w~~~~~---~ 75 (119) T protein:vir:10 5 AKAFVNDETG--KLRSNLYVAYSTEESTNGVQTYAVSWRK----KAAPHGHLLEFGHWQTHAAYKGKDGEWYSSSV---K 75 (119) T ss_pred cccccCCCcc--chhhhheeeeccccCCCCEEEEEeecCC----CcCCcccccccceeeeeeeeeccCceeeecCc---c Confidence 5544443322 233333 11 1221 23455532 122333445777321 111111111111221 2 Q ss_pred ccceeccCCCchhHHHHHHHHHHHHHHHHHH----HHHHHhccC Q lcl|NC_019544. 71 ETTVIKIPERSWLRSGYDENIDKIAKKIEKM----VPDVIEGNV 110 (168) Q Consensus 71 ~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~----~~~~l~G~~ 110 (168) +.....+|+|||||++|+....+....+.+. +.+++.|+. T Consensus 76 l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:10 76 LVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred ccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 3455689999999999998887777766665 888898886 No 101 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=95.71 E-value=2.5e-05 Score=45.82 Aligned_cols=92 Identities=17% Similarity=0.125 Sum_probs=47.1 Q ss_pred CcceecccchHHH----HHHHHHHhhC-------CeEEEEeecCCC-----chHHHHHhhhhcCceeccCcc------c- Q lcl|NC_019544. 1 MKVTIKDTNNIDK----ITRNLQQLGG-------KQIKVGLFGKDD-----SELVMIGAVHEYGAEIPVTPK------M- 57 (168) Q Consensus 1 M~v~i~~~~~~~~----~~~~l~~l~~-------~~v~VGi~~~~g-----~~~a~iA~~~E~G~~i~~~~~------~- 57 (168) |.--++ +.+++ +....+.+.. .++.+-+-. +| ...+.||.+.|||+.+....+ . T Consensus 1 v~~~v~--~~~~~~~~~i~~~ak~~apv~TG~Lr~SI~~~~~~-~~~~~~V~~~~~Ya~yvE~GTg~~~~~~~~~~~~~~ 77 (116) T protein:vir:95 1 MERWVK--RGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKD-GGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKNI 77 (116) T ss_pred ChHHHH--HHHHHHHHHHHHHHHhhCCccccccccceeEEeec-CcEEEEEecCCCccceeecCccccccCCCccccccc Confidence 222221 12222 2222222221 122222211 11 134779999999987643211 1 Q ss_pred -cccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_019544. 58 -RAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIE 99 (168) Q Consensus 58 -~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~ 99 (168) +.+....|.. ..+...||||||+++++++++.+.+.+. T Consensus 78 ~~~~~~~~g~~----~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 78 PWSYKDANGKW----HTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred cceeecCccce----eeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 1122222222 2345799999999999999998887766 No 102 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=95.60 E-value=1.8e-05 Score=46.59 Aligned_cols=96 Identities=18% Similarity=0.166 Sum_probs=43.0 Q ss_pred Ccceec--ccchHH----HHHHHHHHhhC-------CeEEEEeecCCC-----chHHHHHhhhhcCceeccC-cccc--- Q lcl|NC_019544. 1 MKVTIK--DTNNID----KITRNLQQLGG-------KQIKVGLFGKDD-----SELVMIGAVHEYGAEIPVT-PKMR--- 58 (168) Q Consensus 1 M~v~i~--~~~~~~----~~~~~l~~l~~-------~~v~VGi~~~~g-----~~~a~iA~~~E~G~~i~~~-~~~~--- 58 (168) +.-.+. -...+. .+....+.+.. .++.+-+. ++| ..-+.+|.+.|||+.+... +..+ T Consensus 30 ~~~~~~~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~~~-~~g~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~ 108 (149) T protein:vir:10 30 FDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYF-DGGLSSVISVGADYAIYVEYGTGIYATGPGGSRAT 108 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEEec-CCcEEEEEecCCCcccccccCccccccCCcccccc Confidence 100000 000010 11111111111 01111111 111 1236799999999865321 1111 Q ss_pred --ccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_019544. 59 --AWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIE 99 (168) Q Consensus 59 --~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~ 99 (168) .++.......+ ..+...|||||||++++++++++.+.+. T Consensus 109 ~~~~~~~~~~~~~--~~t~g~~a~PFl~pA~~~~k~~i~~~i~ 149 (149) T protein:vir:10 109 KIPWSFKGDDGEW--YTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred cccceeeccccce--ecCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 11111111111 2345889999999999999999888777 No 103 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=95.59 E-value=3.7e-05 Score=44.86 Aligned_cols=92 Identities=17% Similarity=0.122 Sum_probs=47.0 Q ss_pred CcceecccchHHH----HHHHHHHhhC-------CeEEEEeecCCC-----chHHHHHhhhhcCceeccCccc------- Q lcl|NC_019544. 1 MKVTIKDTNNIDK----ITRNLQQLGG-------KQIKVGLFGKDD-----SELVMIGAVHEYGAEIPVTPKM------- 57 (168) Q Consensus 1 M~v~i~~~~~~~~----~~~~l~~l~~-------~~v~VGi~~~~g-----~~~a~iA~~~E~G~~i~~~~~~------- 57 (168) |.--++ +.+++ +.+..+.+.. .++.+-+- ++| ...+.||.+.|||+.+....+. T Consensus 1 v~~~v~--~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~-~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~ 77 (116) T protein:vir:97 1 MERWVK--RGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK-DGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKI 77 (116) T ss_pred ChHHHH--HHHHHHHHHHHHHHHHhCCcCcccccccceEEee-cCcEEEEEecCCCcccccccCCcccccCCCccccccc Confidence 222222 12222 2222232221 12222111 111 1347799999999876432211 Q ss_pred -cccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_019544. 58 -RAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIE 99 (168) Q Consensus 58 -~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~ 99 (168) +.+....|.. ..+..+|+||||++++++++..+.+.+. T Consensus 78 ~~~~~~~~g~~----~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 78 PWSYKDANGKW----HTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ceeeecCCcee----eecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 1222222222 1245799999999999999998877666 No 104 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=95.59 E-value=3.7e-05 Score=44.86 Aligned_cols=92 Identities=17% Similarity=0.122 Sum_probs=47.0 Q ss_pred CcceecccchHHH----HHHHHHHhhC-------CeEEEEeecCCC-----chHHHHHhhhhcCceeccCccc------- Q lcl|NC_019544. 1 MKVTIKDTNNIDK----ITRNLQQLGG-------KQIKVGLFGKDD-----SELVMIGAVHEYGAEIPVTPKM------- 57 (168) Q Consensus 1 M~v~i~~~~~~~~----~~~~l~~l~~-------~~v~VGi~~~~g-----~~~a~iA~~~E~G~~i~~~~~~------- 57 (168) |.--++ +.+++ +.+..+.+.. .++.+-+- ++| ...+.||.+.|||+.+....+. T Consensus 1 v~~~v~--~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~-~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~ 77 (116) T protein:vir:12 1 MERWVK--RGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFK-DGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKI 77 (116) T ss_pred ChHHHH--HHHHHHHHHHHHHHHHhCCcCcccccccceEEee-cCcEEEEEecCCCcccccccCCcccccCCCccccccc Confidence 222222 12222 2222232221 12222111 111 1347799999999876432211 Q ss_pred -cccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_019544. 58 -RAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIE 99 (168) Q Consensus 58 -~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~ 99 (168) +.+....|.. ..+..+|+||||++++++++..+.+.+. T Consensus 78 ~~~~~~~~g~~----~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 78 PWSYKDANGKW----HTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ceeeecCCcee----eecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 1222222222 1245799999999999999998877666 No 105 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=95.54 E-value=5e-06 Score=49.67 Aligned_cols=95 Identities=12% Similarity=0.047 Sum_probs=41.8 Q ss_pred Cc-----ceeccc-chH--------HHHHHH----H----HHhhC-------CeEEEEeecCC-------CchHHHHHhh Q lcl|NC_019544. 1 MK-----VTIKDT-NNI--------DKITRN----L----QQLGG-------KQIKVGLFGKD-------DSELVMIGAV 44 (168) Q Consensus 1 M~-----v~i~~~-~~~--------~~~~~~----l----~~l~~-------~~v~VGi~~~~-------g~~~a~iA~~ 44 (168) |. ++++.+ ..+ ++++++ + +.+.- .++......+. -...+.||.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~ 80 (140) T protein:vir:10 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAP 80 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhh Confidence 22 111111 111 111111 1 11100 11221111111 1234889999 Q ss_pred hhcCce---eccCccccccccccchhhhcccceec---cCCCchhHHHHHHH---HHHHHHH Q lcl|NC_019544. 45 HEYGAE---IPVTPKMRAWFAANGYPLRKETTVIK---IPERSWLRSGYDEN---IDKIAKK 97 (168) Q Consensus 45 ~E~G~~---i~~~~~~~~~~~~~g~~~~~~~~~i~---IP~RpFlr~~~~~~---~~~~~~~ 97 (168) +|||+. |.+..+...++..+|...|.. .++ .++||||++++++. ..++..- T Consensus 81 Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k--~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:10 81 VHEGSRPHAIRARNAQYLHFWWHGREMFRK--SVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred hccCCCCceeecCCCccceeecCCCEEEee--eeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 999984 333333333334455544333 445 45999999999874 3333222 No 106 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=95.54 E-value=5e-06 Score=49.67 Aligned_cols=95 Identities=12% Similarity=0.047 Sum_probs=41.8 Q ss_pred Cc-----ceeccc-chH--------HHHHHH----H----HHhhC-------CeEEEEeecCC-------CchHHHHHhh Q lcl|NC_019544. 1 MK-----VTIKDT-NNI--------DKITRN----L----QQLGG-------KQIKVGLFGKD-------DSELVMIGAV 44 (168) Q Consensus 1 M~-----v~i~~~-~~~--------~~~~~~----l----~~l~~-------~~v~VGi~~~~-------g~~~a~iA~~ 44 (168) |. ++++.+ ..+ ++++++ + +.+.- .++......+. -...+.||.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~ 80 (140) T protein:vir:97 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAP 80 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhh Confidence 22 111111 111 111111 1 11100 11221111111 1234889999 Q ss_pred hhcCce---eccCccccccccccchhhhcccceec---cCCCchhHHHHHHH---HHHHHHH Q lcl|NC_019544. 45 HEYGAE---IPVTPKMRAWFAANGYPLRKETTVIK---IPERSWLRSGYDEN---IDKIAKK 97 (168) Q Consensus 45 ~E~G~~---i~~~~~~~~~~~~~g~~~~~~~~~i~---IP~RpFlr~~~~~~---~~~~~~~ 97 (168) +|||+. |.+..+...++..+|...|.. .++ .++||||++++++. ..++..- T Consensus 81 Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k--~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:97 81 VHEGSRPHAIRARNAQYLHFWWHGREMFRK--SVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred hccCCCCceeecCCCccceeecCCCEEEee--eeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 999984 333333333334455544333 445 45999999999874 3333222 No 107 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=95.54 E-value=6.4e-05 Score=43.59 Aligned_cols=104 Identities=13% Similarity=0.148 Sum_probs=54.7 Q ss_pred CcceecccchHHHHHHHHH--HhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCcc--------------cc------ Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQ--QLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPK--------------MR------ 58 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~--~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~--------------~~------ 58 (168) -+-+.+-...+.++.+.+. ...+....++++. ..+..||++|.||.+.++... ++ T Consensus 65 ~~~k~k~~rm~~kL~~~~~~~~~~~~~~~~~~~~---g~~~~IA~vHQ~G~~~rv~~~~~~~~~~~~~~~pATr~QAk~L 141 (231) T protein:vir:37 65 VDGEIKNKRLLKKVLRYASILAEERGKGRIYYKN---PLTGEIAQKQQDGFTEHFRVFATDKNKNGSGNDRATIRQAQKL 141 (231) T ss_pred cccchhhHHHHHHhHHhhccccccCCceEEeeec---chHHHHHHHhhcCcccccchhhhhhccCCCCCCCCCHHHHHHH Confidence 2211111112444444332 2233334444442 368899999999997655311 00 Q ss_pred ---ccccccc---------------------------hhhhc-----------ccceeccCCCchhHHHHHHHHHHHHHH Q lcl|NC_019544. 59 ---AWFAANG---------------------------YPLRK-----------ETTVIKIPERSWLRSGYDENIDKIAKK 97 (168) Q Consensus 59 ---~~~~~~g---------------------------~~~~~-----------~~~~i~IP~RpFlr~~~~~~~~~~~~~ 97 (168) +|....| ...+. .+-+|.+|+||||-..- ++..+. T Consensus 142 r~lGy~v~~~k~k~~k~~~rkps~kwI~~~ls~~qAgliIR~L~~k~~~~~~k~~W~I~~paR~FLG~~~----~e~~~~ 217 (231) T protein:vir:37 142 RSLGYRKRNGKNRQGKTKYRLYTIKEIRERLTRTWASMEIRRLENKVNAGNGKTNWEIHVPARPFLDTRE----KENVDI 217 (231) T ss_pred HHhcccccCCCCCCCCCCcCcCCHHHHHHhhhhHHHHHHHHHHhcccccccCcceeeeecCcccccCCCH----HHHHHH Confidence 1111111 00111 12348999999996554 455667 Q ss_pred HHHHHHHHHhccCc Q lcl|NC_019544. 98 IEKMVPDVIEGNVN 111 (168) Q Consensus 98 ~~~~~~~~l~G~~~ 111 (168) +...+.+++.|... T Consensus 218 l~~~l~~i~~~~~~ 231 (231) T protein:vir:37 218 LREITLKFLSGEYK 231 (231) T ss_pred HHHHHHHHhcccCC Confidence 77778888888766 No 108 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=95.43 E-value=0.00014 Score=41.64 Aligned_cols=79 Identities=13% Similarity=0.210 Sum_probs=55.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCChHHHHHhcCC-CCcchhHHHH Q lcl|NC_019544. 87 YDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL------KDPPNSQMTIERKGS-DNPLIDTGRL 159 (168) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~------~~ppnsp~Ti~~KG~-~~PLiDTG~L 159 (168) +++ -+++.+.+...+..+ . ..+...+|..||..+....++.|.+. .|+|+|+.|.++||. .+++.+++.+ T Consensus 1 m~~-~~~l~~~L~~ll~~l-~-~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~~~~~~~~l~~ 77 (148) T protein:vir:79 1 MSE-SRELEAWLAGMLTKL-D-APARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRIRRAMFMRLRL 77 (148) T ss_pred Ccc-HHHHHHHHHHHHHhc-C-ChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhcccccccccchhhh Confidence 222 233333333333331 1 12335789999999999999999874 367889999988886 4689999999 Q ss_pred HhHhccccC Q lcl|NC_019544. 160 VGSIRHTVE 168 (168) Q Consensus 160 ~~SIty~V~ 168 (168) ..++++.+. T Consensus 78 ~~~l~~~~~ 86 (148) T protein:vir:79 78 ARYMKTQAD 86 (148) T ss_pred hhheeeeee Confidence 999988877 No 109 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=95.35 E-value=0.00015 Score=41.58 Aligned_cols=79 Identities=8% Similarity=0.157 Sum_probs=54.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCChHHHHHhcCC--CCcchhHHH Q lcl|NC_019544. 87 YDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL------KDPPNSQMTIERKGS--DNPLIDTGR 158 (168) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~------~~ppnsp~Ti~~KG~--~~PLiDTG~ 158 (168) +++ -+++.+.+...+.++- ....+.+|..||..+....++.|... .|+|+++.|++.|.+ .++|..++. T Consensus 1 m~~-~~~~~~~l~~ll~~L~--~~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~~~~~~~~~l~ 77 (149) T protein:vir:18 1 MSE-LTALQERLAGLIASLS--PAARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSKKGRIKREMFAKLR 77 (149) T ss_pred Cch-HHHHHHHHHHHHHhcC--CchHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhccCcccchhhhhhh Confidence 222 1222233333333211 11346789999999999999999885 478999999987653 468999999 Q ss_pred HHhHhccccC Q lcl|NC_019544. 159 LVGSIRHTVE 168 (168) Q Consensus 159 L~~SIty~V~ 168 (168) +.+++++.+. T Consensus 78 ~~~~l~~~~~ 87 (149) T protein:vir:18 78 TSRFMKAKGS 87 (149) T ss_pred hhhhhheeec Confidence 9999988777 No 110 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=95.22 E-value=1.8e-05 Score=46.62 Aligned_cols=96 Identities=18% Similarity=0.162 Sum_probs=42.7 Q ss_pred Ccceec--ccchH----HHHHHHHHHhhC-------CeEEEEeecCCC-----chHHHHHhhhhcCceeccC-ccccc-- Q lcl|NC_019544. 1 MKVTIK--DTNNI----DKITRNLQQLGG-------KQIKVGLFGKDD-----SELVMIGAVHEYGAEIPVT-PKMRA-- 59 (168) Q Consensus 1 M~v~i~--~~~~~----~~~~~~l~~l~~-------~~v~VGi~~~~g-----~~~a~iA~~~E~G~~i~~~-~~~~~-- 59 (168) +.-.+. ....+ ..+....+.+.. .++.+-+.. +| ..-+.||.+.|||+..... +..+. T Consensus 30 ~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~-~g~~~~V~~~~~YA~~VE~GT~~~~~~~~~~~~~ 108 (149) T protein:vir:94 30 FDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFD-GGLSSVISVGADYAIYVEYGTGIYATGPGGSRAT 108 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEEeeC-CcEEEEEecCCCcccccccCccccccCCCccccc Confidence 100000 00000 011111111111 112221111 11 1236799999999865321 11111 Q ss_pred ---cccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHH Q lcl|NC_019544. 60 ---WFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIE 99 (168) Q Consensus 60 ---~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~ 99 (168) ++.......+ ..+...||||||++++++++.++.+.+. T Consensus 109 ~~~~~~~~~~~~~--~~~~g~~a~PFl~pA~~~~~~~i~~~i~ 149 (149) T protein:vir:94 109 KIPWSFKGDDGEW--YTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred cccceeecCccce--ecCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 1111111111 2245789999999999999998888776 No 111 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=94.73 E-value=2.5e-05 Score=45.83 Aligned_cols=96 Identities=16% Similarity=0.185 Sum_probs=44.8 Q ss_pred Ccceecccch-------HHHHHHH-HHHh-------hC-----------CeEEEE--eecCCC------chHHHHHhhhh Q lcl|NC_019544. 1 MKVTIKDTNN-------IDKITRN-LQQL-------GG-----------KQIKVG--LFGKDD------SELVMIGAVHE 46 (168) Q Consensus 1 M~v~i~~~~~-------~~~~~~~-l~~l-------~~-----------~~v~VG--i~~~~g------~~~a~iA~~~E 46 (168) |-++++...+ +..++++ |+.. .+ .++... .-+..+ ..-+.||.++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~r~~l~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~V~~~~~YA~~ve 80 (137) T protein:vir:10 1 MTVTARYERNPVGEARQFQVIARRRLSRITRGTANQARADVPVKTGNLGRSIREDPIVVAGPLRLDSGVTAHADYARYVH 80 (137) T ss_pred CeeEEEeccCchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhcCceeeeeeccccceEEEEecCCCccceeee Confidence 6655543321 1111111 1111 00 111111 111000 02378999999 Q ss_pred cCce---eccCc-cccccccccchhhhccccee---ccCCCchhHHHHHHHHHHHHHHH Q lcl|NC_019544. 47 YGAE---IPVTP-KMRAWFAANGYPLRKETTVI---KIPERSWLRSGYDENIDKIAKKI 98 (168) Q Consensus 47 ~G~~---i~~~~-~~~~~~~~~g~~~~~~~~~i---~IP~RpFlr~~~~~~~~~~~~~~ 98 (168) ||+. |.+.. +....+...|..+|.. ++ .+|+||||+++++++..+-...- T Consensus 81 ~GT~ph~I~Pk~~k~~l~~~~~g~~vf~k--~V~hPG~~a~PfL~~A~~~~~~~~~~~~ 137 (137) T protein:vir:10 81 DGTRAHVIRPRRPGGVLRFTVGGRVVYAR--RVNHPGTRARPFLRNAAERVVARETATS 137 (137) T ss_pred cCCCCceeeccccceeeeEeeCCeeEecc--eeecCCCCCCchHHHHHHHhhhhhcccC Confidence 9974 44332 2232333344444432 33 46699999999999887654322 No 112 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=94.27 E-value=1.2e-05 Score=47.48 Aligned_cols=99 Identities=15% Similarity=0.150 Sum_probs=42.4 Q ss_pred Ccce-ec-ccchH--------HHHHH--------HHHHhhC-------CeEEEEeecCCC-------chHHHHHhhhhcC Q lcl|NC_019544. 1 MKVT-IK-DTNNI--------DKITR--------NLQQLGG-------KQIKVGLFGKDD-------SELVMIGAVHEYG 48 (168) Q Consensus 1 M~v~-i~-~~~~~--------~~~~~--------~l~~l~~-------~~v~VGi~~~~g-------~~~a~iA~~~E~G 48 (168) |-.- ++ +...+ ++.++ +.+.+.- .++.+.+-.++| ...+.||.++||| T Consensus 1 ~~~~~~~l~~~~l~~~~~~~~~~~~~~~a~~ve~~ak~~aPv~TG~Lr~SI~~~~~~~~g~~v~~~V~~~~~YA~~ve~G 80 (137) T protein:vir:10 1 MVAHTLRIERAQLHGLGMDEARKAVNRVVRRTFTRSQILAPVDTGYLRASGRLVLGRERGAVVIGSVEYTARYAAAVHNG 80 (137) T ss_pred CcccccccChhhHhhHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhccceeeeeeccccEEEEEecCCcccceeeecC Confidence 2100 00 00011 11111 1111100 122222211111 1457899999999 Q ss_pred c---eeccCccccccccccchhhhcccceeccC---CCchhHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|NC_019544. 49 A---EIPVTPKMRAWFAANGYPLRKETTVIKIP---ERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVN 111 (168) Q Consensus 49 ~---~i~~~~~~~~~~~~~g~~~~~~~~~i~IP---~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~ 111 (168) + .|+++.+....+...|...|.. .++.| +||||++++++....- + +.+.+. T Consensus 81 T~ph~I~pk~~kaL~f~~~G~~vf~k--~V~hPG~k~~PfL~~Al~~~~~~~------~----~~~~~~ 137 (137) T protein:vir:10 81 RRALTIRAKGNGRLKFTVEGRTVYAR--SVHQPARAGRPYLSQALREVAPQE------G----FRVTIG 137 (137) T ss_pred CCCceeecCCCccceeecCCeeEecc--ceecCCCCCChhhHHHHHHhhccc------c----eeEeeC Confidence 8 4554444333334445543333 45545 9999999999865431 1 111111 No 113 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=94.00 E-value=4.9e-05 Score=44.20 Aligned_cols=76 Identities=9% Similarity=0.187 Sum_probs=42.5 Q ss_pred Ccceec------------------------ccchHHHHHHHHHHhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCcc Q lcl|NC_019544. 1 MKVTIK------------------------DTNNIDKITRNLQQLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPK 56 (168) Q Consensus 1 M~v~i~------------------------~~~~~~~~~~~l~~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~ 56 (168) |....+ ++.++.++..+++.- .-+.||+. .+-+-|+-.+|||+ T Consensus 19 ~g~~~~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkkik~~~kk~--g~~~VG~~----ks~~fy~kF~EFGT------- 85 (119) T protein:vir:10 19 DMVLDESTKRKGIKAGITKIGKAIEKNSPIKSGRLSKVKIRVKNT--GLATEGTA----SSSEFYDIFQNFGT------- 85 (119) T ss_pred hhhhhHHHHHHHHHHHhHHHHHHHhhcCCcccCCcceeeeeeecC--ceeEeccC----Ccchhhhhhccccc------- Confidence 221111 111122211111111 12444442 23345666777774 Q ss_pred ccccccccchhhhcccceeccCCC-chhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019544. 57 MRAWFAANGYPLRKETTVIKIPER-SWLRSGYDENIDKIAKKIEKMVPDVIE 107 (168) Q Consensus 57 ~~~~~~~~g~~~~~~~~~i~IP~R-pFlr~~~~~~~~~~~~~~~~~~~~~l~ 107 (168) -..|+| ||+.+++++..++....+...+.+=++ T Consensus 86 ------------------Skm~a~~pF~~~a~~~~~~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 86 ------------------SEQKAHVGYFDRAVDETTNEAVEEVAEIIFRKMR 119 (119) T ss_pred ------------------cccCCCCCccccccccChHHHHHHHHHHHHHhcC Confidence 578999 999999999999988888777766444 No 114 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=93.66 E-value=0.00014 Score=41.78 Aligned_cols=61 Identities=15% Similarity=0.052 Sum_probs=33.0 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHHhH Q lcl|NC_019544. 83 LRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLVGS 162 (168) Q Consensus 83 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~~S 162 (168) |-.++ ...+++.+.++..-.++- ...+++++..+..+++.+|. ++| +|||.|++| T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~~---~~~~~~~~~~a~~i~~~ak~---------~aP------------vdTG~Lr~S 55 (137) T protein:vir:93 1 MAKVK-YGNWDLVKELENYERDME---RWVKRGIAKTTAKIHNTIIS---------LMP------------VDTGYLRES 55 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHH---------hCC------------ccccchhcc Confidence 33322 234444444444333321 12345555555555554443 122 599999999 Q ss_pred hccccC Q lcl|NC_019544. 163 IRHTVE 168 (168) Q Consensus 163 Ity~V~ 168 (168) |+++++ T Consensus 56 I~~~~~ 61 (137) T protein:vir:93 56 VTMDFK 61 (137) T ss_pred ceeEee Confidence 999998 No 115 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=93.66 E-value=0.00014 Score=41.78 Aligned_cols=61 Identities=15% Similarity=0.052 Sum_probs=33.0 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHHhH Q lcl|NC_019544. 83 LRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLVGS 162 (168) Q Consensus 83 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~~S 162 (168) |-.++ ...+++.+.++..-.++- ...+++++..+..+++.+|. ++| +|||.|++| T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~~---~~~~~~~~~~a~~i~~~ak~---------~aP------------vdTG~Lr~S 55 (137) T protein:vir:94 1 MAKVK-YGNWDLVKELENYERDME---RWVKRGIAKTTAKIHNTIIS---------LMP------------VDTGYLRES 55 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHH---------hCC------------ccccchhcc Confidence 33322 234444444444333321 12345555555555554443 122 599999999 Q ss_pred hccccC Q lcl|NC_019544. 163 IRHTVE 168 (168) Q Consensus 163 Ity~V~ 168 (168) |+++++ T Consensus 56 I~~~~~ 61 (137) T protein:vir:94 56 VTMDFK 61 (137) T ss_pred ceeEee Confidence 999998 No 116 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=93.66 E-value=0.00014 Score=41.78 Aligned_cols=61 Identities=15% Similarity=0.052 Sum_probs=33.0 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHHhH Q lcl|NC_019544. 83 LRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLVGS 162 (168) Q Consensus 83 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~~S 162 (168) |-.++ ...+++.+.++..-.++- ...+++++..+..+++.+|. ++| +|||.|++| T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~~---~~~~~~~~~~a~~i~~~ak~---------~aP------------vdTG~Lr~S 55 (137) T protein:vir:97 1 MAKVK-YGNWDLVKELENYERDME---RWVKRGIAKTTAKIHNTIIS---------LMP------------VDTGYLRES 55 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHH---------hCC------------ccccchhcc Confidence 33322 234444444444333321 12345555555555554443 122 599999999 Q ss_pred hccccC Q lcl|NC_019544. 163 IRHTVE 168 (168) Q Consensus 163 Ity~V~ 168 (168) |+++++ T Consensus 56 I~~~~~ 61 (137) T protein:vir:97 56 VTMDFK 61 (137) T ss_pred ceeEee Confidence 999998 No 117 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=93.34 E-value=0.00019 Score=40.95 Aligned_cols=84 Identities=13% Similarity=0.128 Sum_probs=48.6 Q ss_pred Ccce----ecccchHHHHHHHH-------HHhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhh Q lcl|NC_019544. 1 MKVT----IKDTNNIDKITRNL-------QQLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLR 69 (168) Q Consensus 1 M~v~----i~~~~~~~~~~~~l-------~~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~ 69 (168) +... -++...-.-+...+ +.-....+.|||.-. +.+|.+-||| T Consensus 44 tp~~~~~~~~~~~~~~HlaD~I~~s~~~~dg~~~g~~~VG~~k~-----~~~A~f~n~G--------------------- 97 (139) T protein:vir:10 44 TKEKHPNTKGDGGKYGHLSEDIRSAAGDIDGDHNGSSTVGFHNK-----AHIARFLNDG--------------------- 97 (139) T ss_pred cccccCcCCCCCCCCcchhhcceecCcccccccceeeeeCCCCC-----cceEeecccC--------------------- Confidence 1100 00000000011111 111123456777321 4566777777 Q ss_pred cccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccC-cHHH Q lcl|NC_019544. 70 KETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNV-NPRL 114 (168) Q Consensus 70 ~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~-~~~~ 114 (168) ++++|+.||+..|.++.++++.+.+...++++|+... +-+. T Consensus 98 ----T~k~~~~hFie~t~~e~~~evl~a~~~~~k~~l~~~~~~~~~ 139 (139) T protein:vir:10 98 ----TKYIRADHFVDNARDDAKDAVFAAEAEKYQAMIAKANGGGDK 139 (139) T ss_pred ----ccccCCCchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCC Confidence 5789999999999999999999999999999888642 2222 No 118 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=93.07 E-value=0.0013 Score=36.32 Aligned_cols=80 Identities=15% Similarity=0.249 Sum_probs=48.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCChHHHHHhcCCCCcchhHHHHH Q lcl|NC_019544. 87 YDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL------KDPPNSQMTIERKGSDNPLIDTGRLV 160 (168) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~------~~ppnsp~Ti~~KG~~~PLiDTG~L~ 160 (168) ++++-.++...+...+.++- ..+...+|..||..+....++.|.+. .|+|+++.+..+|+..+.......|+ T Consensus 1 M~~~~~~~~~~L~~ll~~L~--~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~k~~~~~~~m~~~L~ 78 (152) T protein:vir:10 1 MSEPIEQVKTAFDSLLNNIS--KPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKGVKSKIKSGKMFDKIT 78 (152) T ss_pred CchHHHHHHHHHHHHHHhcC--cchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhhcccccchhHHHhhh Confidence 44444444444444444321 12446799999999999999999987 35666666666665544444444455 Q ss_pred hH--hccccC Q lcl|NC_019544. 161 GS--IRHTVE 168 (168) Q Consensus 161 ~S--Ity~V~ 168 (168) .| ++|..- T Consensus 79 ~a~~l~~~a~ 88 (152) T protein:vir:10 79 QPRFMRLRLE 88 (152) T ss_pred hcceeeeeec Confidence 54 444433 No 119 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=93.07 E-value=0.001 Score=36.94 Aligned_cols=80 Identities=10% Similarity=0.216 Sum_probs=55.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCChHHHHHhc-----C--CCCcc Q lcl|NC_019544. 87 YDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL------KDPPNSQMTIERK-----G--SDNPL 153 (168) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~------~~ppnsp~Ti~~K-----G--~~~PL 153 (168) ++++-.++.+.+...+..+- ..+....|..||..+....++.|... .|+|+++.|..++ | ...+| T Consensus 1 m~~~~~~l~~~l~~ll~~l~--~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~~~~~~m 78 (155) T protein:vir:79 1 MTDDLQALERWAGGLLAKLS--PAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAGRVKREAM 78 (155) T ss_pred CchHHHHHHHHHHHHHHhcC--ChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccCcccchhh Confidence 44444455554444444321 12346789999999999999999885 4677788776543 3 24678 Q ss_pred hhHHHHHhHhccccC Q lcl|NC_019544. 154 IDTGRLVGSIRHTVE 168 (168) Q Consensus 154 iDTG~L~~SIty~V~ 168 (168) .+.+.+-++|+|.+- T Consensus 79 ~~~l~~a~~l~~~~~ 93 (155) T protein:vir:79 79 FRKLRTARYLRIDVD 93 (155) T ss_pred hhhhhhhheeeeeec Confidence 999999999999888 No 120 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=91.25 E-value=0.00053 Score=38.53 Aligned_cols=84 Identities=15% Similarity=0.237 Sum_probs=36.6 Q ss_pred Ccc-eecccchHHHHHHHHHHhhC------------------CeEEEEeecCCCc-----------------------hH Q lcl|NC_019544. 1 MKV-TIKDTNNIDKITRNLQQLGG------------------KQIKVGLFGKDDS-----------------------EL 38 (168) Q Consensus 1 M~v-~i~~~~~~~~~~~~l~~l~~------------------~~v~VGi~~~~g~-----------------------~~ 38 (168) |+- ++.+. -+.|.++|+++.. .+++.-.|..+|. +- T Consensus 1 Ma~i~id~l--a~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~~~g~~~~vv~~~~~ 78 (126) T protein:vir:81 1 MANITIDRL--ADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKEDGYGTTKRIIWNKKH 78 (126) T ss_pred CcccchhhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccchhhccccccccccCCcceEEEeccCC Confidence 663 23221 1334444433221 0111111211110 11 Q ss_pred HHHHhhhhcCceeccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019544. 39 VMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNV 110 (168) Q Consensus 39 a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~ 110 (168) ..++-+.|||.-. ..+ ...|+||||+|+++...+++.+.++..+ .|+. T Consensus 79 ~~l~HLLEfGha~-----------------r~g---GrV~a~Phi~Pa~e~~~~~~~~~i~~~l----~~gg 126 (126) T protein:vir:81 79 YRRVHLLEFGHAK-----------------VNG---GRVKEYPHLRPAYDKHGARLPDELKRVI----ENGG 126 (126) T ss_pred CCceeeeecceec-----------------CCC---CccCCCcchHHHHHHHHHHHHHHHHHHh----hcCC Confidence 2223334444211 000 1379999999999887777666555554 4554 No 121 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=91.09 E-value=0.00048 Score=38.80 Aligned_cols=85 Identities=14% Similarity=0.148 Sum_probs=47.2 Q ss_pred Ccc----eeccc---chHHH-HHH---HHHHhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhh Q lcl|NC_019544. 1 MKV----TIKDT---NNIDK-ITR---NLQQLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLR 69 (168) Q Consensus 1 M~v----~i~~~---~~~~~-~~~---~l~~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~ 69 (168) ... +-++. ..+.. +.- .++....-.+.|||.- . +.+|.+-|+| T Consensus 44 tp~~~~~~~~~~~~~~HlaD~I~~~~~~idg~~~g~~~VG~~~----~-~~~Ahf~n~G--------------------- 97 (139) T protein:vir:10 44 TKEKHPNTKGDGGKYGHLSEDISSAAGDIDGDHNGSSTVGFHN----K-AHIARFLNDG--------------------- 97 (139) T ss_pred cccccccCCCCCCCCCcccccceecCccccccccccceeCCCC----C-ceeeeeeccC--------------------- Confidence 000 00000 01110 000 0111112346677731 1 3455666666 Q ss_pred cccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHH Q lcl|NC_019544. 70 KETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLF 115 (168) Q Consensus 70 ~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~ 115 (168) ++++|+.+|+..|.++.++++.+.+...++++|....--++- T Consensus 98 ----T~~~~~~hFie~t~~e~~~ev~~a~~~~~ke~l~~~~~~~~~ 139 (139) T protein:vir:10 98 ----TKNIRADHFVDNARDDAKDAVFAAEAEKYQAMIAKANGGDSK 139 (139) T ss_pred ----ccccCCCchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCC Confidence 578999999999999999999999999999998864321111 No 122 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=91.02 E-value=0.0005 Score=38.68 Aligned_cols=61 Identities=13% Similarity=0.036 Sum_probs=33.7 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHHhH Q lcl|NC_019544. 83 LRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLVGS 162 (168) Q Consensus 83 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~~S 162 (168) |-..+ ...+++.+.++..-.++. ...+++|...+..+++.+|.. + | +|||.|++| T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~---~~~~~~l~~~a~~~~~~ak~~---------~-----------p-vdTG~L~~S 55 (137) T protein:vir:96 1 MAKVK-YGNWDLVAELEDYRDEME---EWVKKGILKTTLAIYNTAVAL---------A-----------P-VDLGFLKES 55 (137) T ss_pred CchhH-hhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh---------C-----------C-cCccchhcC Confidence 22211 233444444444333321 134556666666666655532 1 2 589999999 Q ss_pred hccccC Q lcl|NC_019544. 163 IRHTVE 168 (168) Q Consensus 163 Ity~V~ 168 (168) |+++|+ T Consensus 56 i~~~~~ 61 (137) T protein:vir:96 56 IDFKVT 61 (137) T ss_pred ceeEee Confidence 999988 No 123 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=91.00 E-value=0.0012 Score=36.64 Aligned_cols=75 Identities=13% Similarity=0.237 Sum_probs=41.7 Q ss_pred CcceecccchHHHHHHHHHHh---------hC------------------------------------------CeEEEE Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQL---------GG------------------------------------------KQIKVG 29 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l---------~~------------------------------------------~~v~VG 29 (168) |||.+++. +++++.|+.. .+ +.|+|| T Consensus 1 MsvevkGv---~eil~~LE~k~g~~~~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~~~~G~r~V~vg 77 (134) T protein:vir:10 1 MSVKVTGD---KALERELEKHFGIKEMVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPEWIKGKRTVTIR 77 (134) T ss_pred CeEEeecH---HHHHHHHHHhhchhhhhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCeeecCCceEEEEE Confidence 99998864 3444444332 11 234444 Q ss_pred eecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCCCchhHH--------HHHHHHHHHHHHHHHH Q lcl|NC_019544. 30 LFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSWLRS--------GYDENIDKIAKKIEKM 101 (168) Q Consensus 30 i~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~--------~~~~~~~~~~~~~~~~ 101 (168) |-++- .-..|-.+||||.+ .-...+|++| ++++.+..+.+.++.. T Consensus 78 W~G~~--~R~~ivHLnE~Gyt-------------------------~~r~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~e 130 (134) T protein:vir:10 78 WRGPF--ERFRIVHLIENGHV-------------------------EKKSGKFVKPKAMGGINRAIRQGQNKYFETLKRE 130 (134) T ss_pred EEcCC--ceeeEEEeeeccee-------------------------ecCCCCeeccchhhHHHHHHHhhhHHHHHHHHHH Confidence 43221 11223344555532 2245556666 8888888888888888 Q ss_pred HHHH Q lcl|NC_019544. 102 VPDV 105 (168) Q Consensus 102 ~~~~ 105 (168) +.++ T Consensus 131 L~kl 134 (134) T protein:vir:10 131 LKKL 134 (134) T ss_pred HhcC Confidence 8776 No 124 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=90.42 E-value=0.0013 Score=36.36 Aligned_cols=79 Identities=10% Similarity=0.027 Sum_probs=48.2 Q ss_pred CcceecccchHHHHHHHHHHhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCCC Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPER 80 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~R 80 (168) |.=.|...+. .++-.....+.|||... .-+.+|.+.+||+ +++|+. T Consensus 61 laD~I~~~~~------~~DG~~dg~s~VG~~~~---~~~~~A~f~n~GT-------------------------~k~~~~ 106 (141) T protein:vir:50 61 MADGLAIQST------NADGRKNGVSTVGWKNN---YHAQNARRLNDGT-------------------------KKYRAD 106 (141) T ss_pred cccceeeccC------ccccccCCeeeeccCCC---ccceeeeccccCc-------------------------cccCCC Confidence 3322111000 01111223567888532 2467888888884 689999 Q ss_pred chhHHHHHHH--HHHHHHHHHHHHHHHHhccCcHH Q lcl|NC_019544. 81 SWLRSGYDEN--IDKIAKKIEKMVPDVIEGNVNPR 113 (168) Q Consensus 81 pFlr~~~~~~--~~~~~~~~~~~~~~~l~G~~~~~ 113 (168) ||+..+.++. ++++.+.+...++++|+-..--+ T Consensus 107 hFve~~~~~a~~k~~Vl~A~~~~~k~~l~~~~~~~ 141 (141) T protein:vir:50 107 HFVTNVQNDSTVQKKVLLEKKRNTKNSLEEKEGCD 141 (141) T ss_pred chhHHHHHhhhhHHHHHHHHHHHHHHHHHhccCCC Confidence 9999999864 57788888888888776432212 No 125 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=90.36 E-value=0.0043 Score=33.56 Aligned_cols=80 Identities=10% Similarity=0.146 Sum_probs=50.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCChHHHHHhcCC--CC--cchhH Q lcl|NC_019544. 87 YDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL------KDPPNSQMTIERKGS--DN--PLIDT 156 (168) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~------~~ppnsp~Ti~~KG~--~~--PLiDT 156 (168) +++.-.++.+.+...+.++ . ..+....|..||..+....++.|... .|+|+++.|++.|.. .+ ++... T Consensus 1 m~~~~~~l~~~L~~ll~~L-~-~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~~W~p~~~~~~~~~~~~~~~~~~m~~~ 78 (156) T protein:vir:11 1 MADSLEALEDWAGPILRAL-E-PGPRAALARSLARDLRRSQQKRVMAQRNPDGSAYEPRKKRELRGKQGRIRRKIKMFQK 78 (156) T ss_pred CchhHHHHHHHHHHHHHhc-C-CcchHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHhhhccccccchhhhhh Confidence 4555555555555555442 1 12457799999999999999999874 477899999987532 22 33333 Q ss_pred HHHHhHhccccC Q lcl|NC_019544. 157 GRLVGSIRHTVE 168 (168) Q Consensus 157 G~L~~SIty~V~ 168 (168) ..+-.+|++.+- T Consensus 79 l~~~~~l~~~~~ 90 (156) T protein:vir:11 79 LRTVRYLRAKGD 90 (156) T ss_pred hhhhheeeeeec Confidence 333333555544 No 126 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=89.31 E-value=0.0011 Score=36.81 Aligned_cols=40 Identities=18% Similarity=0.173 Sum_probs=23.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 112 PRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 112 ~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~~SIty~V~ 168 (168) +++++.+.-......|+..... ++ | +|||.|++||++.++ T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~-----~a-----------p-v~TG~Lr~SI~~~~~ 40 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIIS-----LM-----------P-VDTGYLRESVTMDFK 40 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHh-----hC-----------C-ccccccccceeEEee Confidence 3333333333334444444433 22 3 589999999999998 No 127 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=89.08 E-value=0.0035 Score=34.06 Aligned_cols=61 Identities=15% Similarity=0.106 Sum_probs=34.3 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHHhH Q lcl|NC_019544. 83 LRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLVGS 162 (168) Q Consensus 83 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~~S 162 (168) |-.. ...-+++.+.++..-.++. ..++.+|+..+..+++.+|.. +| +|||.|++| T Consensus 1 Ma~~-~~Gl~~l~~~l~~~~~~~~---~~~~~al~~~a~~i~~~ak~~---------aP------------vdTG~Lr~S 55 (137) T protein:vir:10 1 MAKV-KYGNWELVKELEDFEKETI---RWAKKGIAKTTTIIHNSIVSN---------MP------------VDTGYLRES 55 (137) T ss_pred Cchh-HhhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh---------CC------------cCcchhhcC Confidence 2111 1122333333333222221 144667777777777777664 22 599999999 Q ss_pred hccccC Q lcl|NC_019544. 163 IRHTVE 168 (168) Q Consensus 163 Ity~V~ 168 (168) |+++++ T Consensus 56 I~~~~~ 61 (137) T protein:vir:10 56 VSMDFK 61 (137) T ss_pred eeEEee Confidence 999888 No 128 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=89.07 E-value=0.0036 Score=33.96 Aligned_cols=61 Identities=20% Similarity=0.089 Sum_probs=30.7 Q ss_pred hcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcC Q lcl|NC_019544. 69 RKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKG 148 (168) Q Consensus 69 ~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG 148 (168) +. . +.+ ++++ +.+.+++.-.++- ...+++|...+..+++.++.. +| T Consensus 1 Ma---~-------~~~-Gl~~----l~~~l~~~~~~~~---~~~~~al~~~a~~v~~~ak~~---------ap------- 46 (135) T protein:vir:96 1 MA---K-------VKY-GADS----IVVDLEKYSKDME---KWVKKGITKTTLKIYNTAIHL---------MP------- 46 (135) T ss_pred Cc---h-------hhh-hHHH----HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh---------CC------- Confidence 00 0 001 3333 3333333322211 133555665555555554432 22 Q ss_pred CCCcchhHHHHHhHhccccC Q lcl|NC_019544. 149 SDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 149 ~~~PLiDTG~L~~SIty~V~ 168 (168) +|||.|++||+++|+ T Consensus 47 -----vdTG~Lr~SI~~~~~ 61 (135) T protein:vir:96 47 -----VDTGFLRQSTTVDFE 61 (135) T ss_pred -----ccchhhhcceeEEee Confidence 799999999999988 No 129 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=89.05 E-value=0.00098 Score=37.08 Aligned_cols=40 Identities=18% Similarity=0.149 Sum_probs=23.3 Q ss_pred HHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 98 IEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 98 ~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~~SIty~V~ 168 (168) +++ -++++++..+..+++.+|. ++| +|||.|++||+++++ T Consensus 1 v~~----------~v~~~~~~~~~~i~~~ak~---------~aP------------v~TG~Lr~SI~~~~~ 40 (116) T protein:vir:97 1 MER----------WVKRGIAKTTAKIHNTIIS---------LMP------------VDTGYLRESVTMDFK 40 (116) T ss_pred ChH----------HHHHHHHHHHHHHHHHHHH---------hCC------------cCcccccccceEEee Confidence 111 1233444444444444433 222 589999999999998 No 130 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=89.05 E-value=0.00098 Score=37.08 Aligned_cols=40 Identities=18% Similarity=0.149 Sum_probs=23.3 Q ss_pred HHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 98 IEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 98 ~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~~SIty~V~ 168 (168) +++ -++++++..+..+++.+|. ++| +|||.|++||+++++ T Consensus 1 v~~----------~v~~~~~~~~~~i~~~ak~---------~aP------------v~TG~Lr~SI~~~~~ 40 (116) T protein:vir:12 1 MER----------WVKRGIAKTTAKIHNTIIS---------LMP------------VDTGYLRESVTMDFK 40 (116) T ss_pred ChH----------HHHHHHHHHHHHHHHHHHH---------hCC------------cCcccccccceEEee Confidence 111 1233444444444444433 222 589999999999998 No 131 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=88.91 E-value=0.0026 Score=34.79 Aligned_cols=61 Identities=15% Similarity=0.092 Sum_probs=34.6 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHHhH Q lcl|NC_019544. 83 LRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLVGS 162 (168) Q Consensus 83 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~~S 162 (168) |-... ..-+++.+.++..-.++- -.++++|+..+..+++.+|.. +| +|||.|++| T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~---~~~~~al~~~a~~i~~~ak~~---------aP------------v~TG~Lr~S 55 (137) T protein:vir:10 1 MAKVK-YGNWDLVKELEEFEKETI---RWAKKGIAKTTTIIHNSIVSN---------MP------------VDTGYLRES 55 (137) T ss_pred Cccch-hCHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh---------CC------------cCcchhhcC Confidence 11110 122333333333332221 144667777777777766654 22 599999999 Q ss_pred hccccC Q lcl|NC_019544. 163 IRHTVE 168 (168) Q Consensus 163 Ity~V~ 168 (168) |+++++ T Consensus 56 I~~~~~ 61 (137) T protein:vir:10 56 VSMDFK 61 (137) T ss_pred eeeEec Confidence 999988 No 132 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=88.14 E-value=0.0014 Score=36.18 Aligned_cols=89 Identities=12% Similarity=0.068 Sum_probs=48.3 Q ss_pred Cc--ceecccchHHHHHHHHHHhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccC Q lcl|NC_019544. 1 MK--VTIKDTNNIDKITRNLQQLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIP 78 (168) Q Consensus 1 M~--v~i~~~~~~~~~~~~l~~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP 78 (168) |. +.+++.+ .+......+.|||.... -+.+|.+.|+|+ +++| T Consensus 61 laD~I~~s~~~--------idG~~dG~s~VG~~~~~---~a~~a~f~n~GT-------------------------~km~ 104 (153) T protein:vir:49 61 MADGLAVQSTN--------ADGRKNGVSTVGWKNNY---HAQNARRLNDGT-------------------------KKYR 104 (153) T ss_pred ccccceecccc--------ccccccceeeecccCCc---cceeeeecccCc-------------------------ccCC Confidence 22 1111100 01111235689997432 467888899994 6899 Q ss_pred CCchhHHHHHHH--HHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCC Q lcl|NC_019544. 79 ERSWLRSGYDEN--IDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSD 150 (168) Q Consensus 79 ~RpFlr~~~~~~--~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~ 150 (168) +.||++.+.++. ++++.+.+...+.++|+-+... . +|.+..+-|.-. T Consensus 105 ~~hFie~tr~e~~~k~~vl~A~~~~~~~il~~~~~~---------~----------------~~~~~~~~~~~~ 153 (153) T protein:vir:49 105 ADHFITNVQNDSTVKNKVLLAEKEEYEKLIRRKGGV---------Y----------------LSASNFKTKRAT 153 (153) T ss_pred CChhhHHHHHHhhHHHHHHHHHHHHHHHHHHhcCCe---------e----------------eeccccccccCC Confidence 999999999876 5677777777777777654321 0 000000000000 No 133 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=88.13 E-value=0.0027 Score=34.71 Aligned_cols=66 Identities=15% Similarity=0.156 Sum_probs=30.9 Q ss_pred CCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHH Q lcl|NC_019544. 78 PERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTG 157 (168) Q Consensus 78 P~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG 157 (168) =+|..++-.+ +..+++.+.+++.-..+. -.++++|...+..+++.+|.. +| +||| T Consensus 1 m~~ms~~i~~-~g~~~l~~~l~~~~~~~~---~~v~~~l~~~a~~i~~~ak~~---------ap------------v~TG 55 (144) T protein:vir:59 1 MALMSVRIDP-SWRRIMSRNVRTFSGHVL---TQVEQVIIKTAEKIAGLAASL---------AP------------VDEG 55 (144) T ss_pred CCcceeeehh-HHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh---------CC------------ccch Confidence 1222221111 112223332322222211 023455555555555555432 22 5899 Q ss_pred HHHhHhccccC Q lcl|NC_019544. 158 RLVGSIRHTVE 168 (168) Q Consensus 158 ~L~~SIty~V~ 168 (168) .|++||+++++ T Consensus 56 ~Lr~SI~~~~~ 66 (144) T protein:vir:59 56 NLKNSIQIDYK 66 (144) T ss_pred hhhcCeeEEee Confidence 99999999988 No 134 >protein:vir:78755 Length: 228 # NCBI annotation: putative tail completion protein # Family: family:all:743 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285651;genbank:gi:148727157;genbank:GeneID:5220102 Probab=87.83 E-value=0.0034 Score=34.12 Aligned_cols=113 Identities=17% Similarity=0.216 Sum_probs=54.0 Q ss_pred Ccceeccc----chHHHHHHHHHHhhCCeEEEEeecCC-CchHHHHHhhhhcCceeccCcccc---cc------------ Q lcl|NC_019544. 1 MKVTIKDT----NNIDKITRNLQQLGGKQIKVGLFGKD-DSELVMIGAVHEYGAEIPVTPKMR---AW------------ 60 (168) Q Consensus 1 M~v~i~~~----~~~~~~~~~l~~l~~~~v~VGi~~~~-g~~~a~iA~~~E~G~~i~~~~~~~---~~------------ 60 (168) ..-..... .+|.++++- ......++.|||.... ...+..||++|.||...++.+... +. T Consensus 55 ~~pRKr~krKMl~~L~k~Lk~-~~~~~~~a~v~f~~~~~~~~~~rIA~vHq~G~~~~v~~~~~~~~~~~r~~~~~paTr~ 133 (228) T protein:vir:78 55 WAPRKRGKRKMLRGLPKLLQI-REPRQDMAELGFTKGTMSAHAGVIANTHQKGHTYKVTAASRRRIAPSDVGKNKQASKA 133 (228) T ss_pred ChhhhhhHHHHHhhhHHhhhh-hcccccceEEEeecCcccchHHHHHHHHhcCcccccccchhhhhhcccCCCCCCCCHH Confidence 11110100 112333322 1234457999997532 246899999999998776553211 00 Q ss_pred ----ccccchhh-----------------------------h-------cccceeccCCCchhHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 61 ----FAANGYPL-----------------------------R-------KETTVIKIPERSWLRSGYDENIDKIAKKIEK 100 (168) Q Consensus 61 ----~~~~g~~~-----------------------------~-------~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~ 100 (168) |..-|+.. + ..+-+|.+|+||||-.+-++ +.+.+.. T Consensus 134 QAk~Lr~lGy~~~~~~~k~~rkps~kwI~~nls~gqAgliir~L~~k~~k~~W~I~~PaR~FLG~s~~e----~~~~l~~ 209 (228) T protein:vir:78 134 QARKLRELGFKRPGKRKRAYRSASLGWITANLNYAQAGLLIKKLKDEPVKESWEIQLPARPFLGANARQ----RQQAFAL 209 (228) T ss_pred HHHHHHHhhccccCCcCCCcccCCHHHHHHHhhHHHHHHHHHHHhCCCCccceeeecCcccccCCCHHH----HHHHHHH Confidence 00111110 1 11246899999999555443 3444444 Q ss_pred HHHHHHhc-cCcHHHHHHH Q lcl|NC_019544. 101 MVPDVIEG-NVNPRLFMDA 118 (168) Q Consensus 101 ~~~~~l~G-~~~~~~~l~~ 118 (168) .+..+--| +..+++.=.+ T Consensus 210 ~l~~i~~g~~~~~qd~~~~ 228 (228) T protein:vir:78 210 RPESIDYGWDVNKQDMKGK 228 (228) T ss_pred HHHhcccCCCcchhhccCC Confidence 44444334 3334333222 No 135 >protein:vir:3750 Length: 227 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043491;genbank:gi:9628626;genbank:GeneID:1261131 Probab=87.71 E-value=0.0028 Score=34.56 Aligned_cols=101 Identities=20% Similarity=0.156 Sum_probs=48.0 Q ss_pred Ccceeccc----chHHHHHHHHHHhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCcc---------------cc--- Q lcl|NC_019544. 1 MKVTIKDT----NNIDKITRNLQQLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPK---------------MR--- 58 (168) Q Consensus 1 M~v~i~~~----~~~~~~~~~l~~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~---------------~~--- 58 (168) ..-..... .+|.++++. ........|+|..+ ....||++|.||..+.+... ++ T Consensus 59 ~~pRKr~k~KM~~kL~k~l~~--~~~~~~a~v~f~~g---~~~~IA~vHq~G~~~~v~~~~~~~~~~~~~~~~paTr~QA 133 (227) T protein:vir:37 59 WKKRKNGTAKMLRRIAKLANS--KAEKAQGTLFYKQK---RTGEIAQEHQEGIPHLFKKTEFTGKNKGGIGADPCTLRQA 133 (227) T ss_pred CchhcchhHHHHhhhHHHcce--eecccceEEEecCc---chHHHHHHhhcCcccccchhhhhhhhcCCccccCCCHHHH Confidence 11111000 012222221 13444566887532 47889999999998865321 00 Q ss_pred ------cccccc---------------------------chhh---h----------cccceeccCCCchhHHHHHHHHH Q lcl|NC_019544. 59 ------AWFAAN---------------------------GYPL---R----------KETTVIKIPERSWLRSGYDENID 92 (168) Q Consensus 59 ------~~~~~~---------------------------g~~~---~----------~~~~~i~IP~RpFlr~~~~~~~~ 92 (168) +|-... |... . ..+-+|.+|+||||-.+-+++.. T Consensus 134 k~Lr~lGy~v~~~k~k~~k~~~rkps~kwI~~nls~~qAgliIR~L~~k~~~~~~~~k~~W~I~~PaR~FLG~~~~e~~~ 213 (227) T protein:vir:37 134 KKLKDLGYTVANGKTKNGKAKRRKPTLSEIRSTLSRAKASLIIRKLEEKNGMNPSRHLTQWIIPTEKRSFLDTREEENAK 213 (227) T ss_pred HHHHHhcccccCCCCCCcCCccccCCHHHHHHhhhHHHHHHHHHHHhcccccccccCccceeeecCcccccCCCHHHHHH Confidence 111110 0000 1 11246899999999876655544 Q ss_pred HHHHHHHHHHHHHHhccC Q lcl|NC_019544. 93 KIAKKIEKMVPDVIEGNV 110 (168) Q Consensus 93 ~~~~~~~~~~~~~l~G~~ 110 (168) -+...+.+.-.+ +. T Consensus 214 ~l~r~l~~~~~~----~~ 227 (227) T protein:vir:37 214 IILAEIQKYTQK----QQ 227 (227) T ss_pred HHHHHHHHHhhh----cC Confidence 444444433322 22 No 136 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=87.54 E-value=0.0028 Score=34.54 Aligned_cols=66 Identities=15% Similarity=0.183 Sum_probs=28.8 Q ss_pred hcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcC Q lcl|NC_019544. 69 RKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKG 148 (168) Q Consensus 69 ~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG 148 (168) +. ++.| .++++ +.+.++..-..+. -..+.++.++...++..++...... T Consensus 1 m~---~v~i-------~Gld~----L~~kl~~~~~~~~---~~v~~a~~~~~~~~a~~v~~~ak~~-------------- 49 (182) T protein:vir:10 1 MI---EVEL-------KGVNE----LRAKLKKLPDIMA---KATANAQENAIEQAEAYAVDELQSS-------------- 49 (182) T ss_pred Ce---EEEE-------ecHHH----HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhh-------------- Confidence 00 0000 02222 2222222211110 0123344444444455554444321 Q ss_pred CCCcchhHHHHHhHhccccC Q lcl|NC_019544. 149 SDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 149 ~~~PLiDTG~L~~SIty~V~ 168 (168) -| +|||.|++||+++|. T Consensus 50 --~P-vdtG~Lr~SI~~~~~ 66 (182) T protein:vir:10 50 --IK-YSTGELTRSFKHEVK 66 (182) T ss_pred --CC-CCchhhhhceeeeee Confidence 24 699999999998877 No 137 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=87.50 E-value=0.0029 Score=34.52 Aligned_cols=78 Identities=10% Similarity=0.057 Sum_probs=47.2 Q ss_pred CcceecccchHHHHHHHHHHhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCCC Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPER 80 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~R 80 (168) |.=.|...+ ..++......+.|||... .-+.+|.+.++|+ .++|+- T Consensus 61 laD~I~~~~------~~iDg~~~g~s~VG~~kk---~~a~~A~f~n~GT-------------------------~k~~~~ 106 (140) T protein:vir:48 61 MADGLSVQS------TNVDGRKNGVSTVGWVNR---YHAQNARRLNDGT-------------------------KKYRAD 106 (140) T ss_pred chhceeecc------cccccccCceeeeccCCC---cceeeeeccccCc-------------------------cccCCC Confidence 221111000 001111233567888532 3467888888884 689999 Q ss_pred chhHHHHHHH--HHHHHHHHHHHHHHHHhccCcHH Q lcl|NC_019544. 81 SWLRSGYDEN--IDKIAKKIEKMVPDVIEGNVNPR 113 (168) Q Consensus 81 pFlr~~~~~~--~~~~~~~~~~~~~~~l~G~~~~~ 113 (168) ||+..+.++. +.++.+.....++++|+-. ..+ T Consensus 107 hFve~~~~e~~~k~~vl~A~~~~~~~~l~~~-~~~ 140 (140) T protein:vir:48 107 HFVTNVQNDSAVQTKVLLAEKEEYEKLIRKK-GGE 140 (140) T ss_pred chhHHHHHhhhhHHHHHHHHHHHHHHHHHhh-cCC Confidence 9999999976 6678887777777777643 222 No 138 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=87.11 E-value=0.0018 Score=35.59 Aligned_cols=61 Identities=16% Similarity=0.099 Sum_probs=31.9 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHHhH Q lcl|NC_019544. 83 LRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLVGS 162 (168) Q Consensus 83 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~~S 162 (168) |-..+ ...+++.+.+++.-.++- -..+++++..+..+++.+|.. + | +|||.|++| T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~---~~~~~~~~~~a~~v~~~ak~~---------a-----------P-v~TG~L~~S 55 (137) T protein:vir:95 1 MAKVK-YGNWDLVKELENYERDME---RWVKRGIAKTTAKIHNTIISL---------M-----------P-VDTGYLRES 55 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh---------C-----------C-ccchhhhcC Confidence 32222 233444444443332221 133445555555555544432 2 2 489999999 Q ss_pred hccccC Q lcl|NC_019544. 163 IRHTVE 168 (168) Q Consensus 163 Ity~V~ 168 (168) |+++++ T Consensus 56 i~~~~~ 61 (137) T protein:vir:95 56 VTMDFK 61 (137) T ss_pred eeeEee Confidence 999988 No 139 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=86.80 E-value=0.0029 Score=34.49 Aligned_cols=93 Identities=11% Similarity=0.090 Sum_probs=45.9 Q ss_pred CcceecccchHHHHHHHHHHhhCC------------------eEEEEeecCCCchHHHHHhhhhcCceeccCcccccc-- Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGK------------------QIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAW-- 60 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~------------------~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~-- 60 (168) |+.+|+-++--+.+.+.|++.... .++-+-|..+|. . .-||.+...+..... T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP~~TG~-y-------aksW~~k~~~~~~~~v~ 72 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSPKRTGD-Y-------AKNWTSQKLKNGDQVIY 72 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccc-c-------ccceeeeecCCeeEEEE Confidence 988776544334445555443321 111121211111 0 011222211111000 Q ss_pred ----------ccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 61 ----------FAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPD 104 (168) Q Consensus 61 ----------~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 104 (168) +-..|++.+.+ .-.|+||||+|+.+...+.+.+.++..+.+ T Consensus 73 ~~~~~y~l~HLLE~GHa~r~G---GrV~a~phI~paee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 73 QKAPTYRLTHLLENGHAKRNG---GRVSPKVHIAPVEEELVSNYISRVEKRLSQ 123 (123) T ss_pred EecCCcceEEeeecceeecCC---ceeCcchhhhHHHHHHHHHHHHHHHHHhcC Confidence 11122222222 246999999999999999999988888887 No 140 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=86.69 E-value=0.0042 Score=33.60 Aligned_cols=76 Identities=13% Similarity=0.083 Sum_probs=45.7 Q ss_pred CcceecccchHHHHHHHHHHhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCCC Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPER 80 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~R 80 (168) |.=.|...+ ..++-.......|||... .-+.+|.+.++|+ ..+|+. T Consensus 61 laD~I~~~~------~~idg~~dG~s~VG~~k~---~~a~~a~f~NdGT-------------------------~k~~~~ 106 (140) T protein:vir:48 61 MADGLAVQS------TNVDGRKNGVATVGWKNN---YHAQNARRLNDGT-------------------------KKYRAD 106 (140) T ss_pred ccccceecc------cccccccccceeecccCC---CceeEEeecccCc-------------------------cccCCC Confidence 221111000 001111122456888532 2467778888884 589999 Q ss_pred chhHHHHHHH--HHHHHHHHHHHHHHHHh--ccC Q lcl|NC_019544. 81 SWLRSGYDEN--IDKIAKKIEKMVPDVIE--GNV 110 (168) Q Consensus 81 pFlr~~~~~~--~~~~~~~~~~~~~~~l~--G~~ 110 (168) ||+..|.++. ++++.+.....++++++ |+- T Consensus 107 hFve~t~~e~~~~~~vl~A~~~~y~~~l~kk~~~ 140 (140) T protein:vir:48 107 HFVTNVQNDSAVRDKVLLAEKEEYEKLIRKKGGE 140 (140) T ss_pred chHHHHHHhhhhHHHHHHHHHHHHHHHHHhhcCC Confidence 9999999865 67888888888888773 332 No 141 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=86.54 E-value=0.0048 Score=33.31 Aligned_cols=75 Identities=16% Similarity=0.266 Sum_probs=40.3 Q ss_pred CcceecccchHHHHHHHHHHh---------hC------------------------------------------CeEEEE Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQL---------GG------------------------------------------KQIKVG 29 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l---------~~------------------------------------------~~v~VG 29 (168) |+|.+++. +++++.|++. .+ +.|+|| T Consensus 1 msvevkGv---~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vg 77 (134) T protein:vir:95 1 MSVKVIGD---KALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVH 77 (134) T ss_pred CeEEEecH---HHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEE Confidence 88888863 3344433322 00 235555 Q ss_pred eecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCCCchh--------HHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 30 LFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSWL--------RSGYDENIDKIAKKIEKM 101 (168) Q Consensus 30 i~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFl--------r~~~~~~~~~~~~~~~~~ 101 (168) |-++- .-..|-.+||||.+-+ -..+|+ +.+++..+..+.+.++.. T Consensus 78 W~G~~--~R~~iiHLNE~Gytr~-------------------------~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~e 130 (134) T protein:vir:95 78 WRGSK--DRYKIVHLIEYGHVQK-------------------------GTGKFIKPKAMGGVNRAIRQGQNKYFETLKRE 130 (134) T ss_pred EEcCC--ceeEEEEeecccceec-------------------------ccCCccCcchhhHHHHHHHhhhHHHHHHHHHH Confidence 54321 1223344566663210 012344 448888888888888888 Q ss_pred HHHH Q lcl|NC_019544. 102 VPDV 105 (168) Q Consensus 102 ~~~~ 105 (168) +.++ T Consensus 131 L~kl 134 (134) T protein:vir:95 131 LKKL 134 (134) T ss_pred HhcC Confidence 8776 No 142 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=86.54 E-value=0.0048 Score=33.31 Aligned_cols=75 Identities=16% Similarity=0.266 Sum_probs=40.3 Q ss_pred CcceecccchHHHHHHHHHHh---------hC------------------------------------------CeEEEE Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQL---------GG------------------------------------------KQIKVG 29 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l---------~~------------------------------------------~~v~VG 29 (168) |+|.+++. +++++.|++. .+ +.|+|| T Consensus 1 msvevkGv---~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vg 77 (134) T protein:vir:10 1 MSVKVIGD---KALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVH 77 (134) T ss_pred CeEEEecH---HHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEE Confidence 88888863 3344433322 00 235555 Q ss_pred eecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCCCchh--------HHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 30 LFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSWL--------RSGYDENIDKIAKKIEKM 101 (168) Q Consensus 30 i~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFl--------r~~~~~~~~~~~~~~~~~ 101 (168) |-++- .-..|-.+||||.+-+ -..+|+ +.+++..+..+.+.++.. T Consensus 78 W~G~~--~R~~iiHLNE~Gytr~-------------------------~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~e 130 (134) T protein:vir:10 78 WRGSK--DRYKIVHLIEYGHVQK-------------------------GTGKFIKPKAMGGVNRAIRQGQNKYFETLKRE 130 (134) T ss_pred EEcCC--ceeEEEEeecccceec-------------------------ccCCccCcchhhHHHHHHHhhhHHHHHHHHHH Confidence 54321 1223344566663210 012344 448888888888888888 Q ss_pred HHHH Q lcl|NC_019544. 102 VPDV 105 (168) Q Consensus 102 ~~~~ 105 (168) +.++ T Consensus 131 L~kl 134 (134) T protein:vir:10 131 LKKL 134 (134) T ss_pred HhcC Confidence 8776 No 143 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=85.73 E-value=0.0033 Score=34.22 Aligned_cols=57 Identities=14% Similarity=0.159 Sum_probs=29.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHHhHh Q lcl|NC_019544. 84 RSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLVGSI 163 (168) Q Consensus 84 r~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~~SI 163 (168) =.++++-.+.+.+ ....+ +...+++|...+..++..+|. ++| +|||.|++|| T Consensus 1 i~Gld~l~~~l~~----~~~~~---~~~v~~al~~~a~~i~~~ak~---------~aP------------v~TG~Lr~sI 52 (108) T protein:vir:99 1 MRGLDRFLRSVER----KQKSV---RIAVDKELSKSAARIERQAKI---------LAP------------VDTGWLRAQI 52 (108) T ss_pred CchHHHHHHHHHH----HHHHH---HHHHHHHHHHHHHHHHHHHHh---------cCC------------cCchhhhcce Confidence 2344443333322 22211 112345555555555544433 222 7999999999 Q ss_pred ccccC Q lcl|NC_019544. 164 RHTVE 168 (168) Q Consensus 164 ty~V~ 168 (168) ++.+. T Consensus 53 ~~~~~ 57 (108) T protein:vir:99 53 YSEQQ 57 (108) T ss_pred eeeec Confidence 98877 No 144 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=85.68 E-value=0.0035 Score=34.03 Aligned_cols=63 Identities=19% Similarity=0.224 Sum_probs=31.5 Q ss_pred ccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019544. 57 MRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKD 136 (168) Q Consensus 57 ~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ 136 (168) |. .-+ +|- +.+++.+.++.....+- ..++.+|+..+..+++.++.. T Consensus 1 Ma-------------~~~--------~~~----~~~~l~~~l~~~~~~~~---~~~~~~l~~~a~~i~~~ak~~------ 46 (142) T protein:vir:94 1 MA-------------GLN--------YRV----NSTEFQGALRAALDRLT---GAAREATEAAANDMVNMAKGL------ 46 (142) T ss_pred Cc-------------eeE--------EEe----cHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh------ Confidence 00 000 111 22333333333333221 134556666665555554332 Q ss_pred CCChHHHHHhcCCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 137 PPNSQMTIERKGSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 137 ppnsp~Ti~~KG~~~PLiDTG~L~~SIty~V~ 168 (168) +| +|||.|++||+++|+ T Consensus 47 ---aP------------v~TG~Lr~SI~~~~~ 63 (142) T protein:vir:94 47 ---CP------------VDTGRLRSSIQAVPS 63 (142) T ss_pred ---CC------------ccchhhhccceeeec Confidence 22 689999999999998 No 145 >protein:vir:98860 Length: 230 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654736;genbank:gi:109302921;genbank:GeneID:4156065 Probab=85.13 E-value=0.004 Score=33.74 Aligned_cols=103 Identities=17% Similarity=0.108 Sum_probs=45.3 Q ss_pred Ccceecc----cchHHHHHHHHHHhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCcc---------------cc--- Q lcl|NC_019544. 1 MKVTIKD----TNNIDKITRNLQQLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPK---------------MR--- 58 (168) Q Consensus 1 M~v~i~~----~~~~~~~~~~l~~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~---------------~~--- 58 (168) +.-.... ..++.++++-...-+.....++++.. ....||++|.||..+.+... ++ T Consensus 61 w~pRKr~k~KMl~~L~k~l~~~~~~~~~~~v~~~~~~---~~~rIA~vHq~G~~~~~~~~~~~~r~~~~~~~~paTr~QA 137 (230) T protein:vir:98 61 WKPRKNGNAKMLRRIAKTLKFTSADREIKRVCTISRN---AQRRSQKEHQRGAKITNLKSVILRKSRAGTAKDPATMRQA 137 (230) T ss_pred ChhhhhhhHHHHhhhHHHHHHhhcccccceeeeeccc---chhhhhhhhhccchhhhhhhhhhhhhcCCCCcccccHHHH Confidence 1111110 01233333333322233344555532 45779999999998744310 00 Q ss_pred ------ccccccc---------------------------hhhhc------------ccceeccCCCchhHHHHHHHHHH Q lcl|NC_019544. 59 ------AWFAANG---------------------------YPLRK------------ETTVIKIPERSWLRSGYDENIDK 93 (168) Q Consensus 59 ------~~~~~~g---------------------------~~~~~------------~~~~i~IP~RpFlr~~~~~~~~~ 93 (168) +|-...| ...+. .+-+|.+|+||||-.+-+++..- T Consensus 138 k~Lr~lGy~v~~g~~~~~~k~~kkps~kwI~~nls~~qAgliIR~L~~k~~k~~~~~t~W~I~~PaR~FLG~~~~e~~~~ 217 (230) T protein:vir:98 138 KKLRDLGYTVPNGTTKSGKKRYRRPSAREIVATLSRAKASLLIRYFQEKEERQGKRLTKWIIPTEKRPFLDERDKENAEI 217 (230) T ss_pred HHHHHcCCccCCCCCCcCCCCCCCCCHHHHHHhhhHHHHHHHHHHHhccccccccCccceeeecCcccccCCChHHHHHH Confidence 1111111 00111 12468999999998775555444 Q ss_pred HHHHHHHHHHHHHhccCcHHH Q lcl|NC_019544. 94 IAKKIEKMVPDVIEGNVNPRL 114 (168) Q Consensus 94 ~~~~~~~~~~~~l~G~~~~~~ 114 (168) +..++.+ +.+.- . T Consensus 218 l~~~l~~-i~~~~-------~ 230 (230) T protein:vir:98 218 LKEFILK-FSGIE-------K 230 (230) T ss_pred HHHHHHH-hcccc-------C Confidence 4433332 21111 1 No 146 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=85.09 E-value=0.0027 Score=34.66 Aligned_cols=61 Identities=16% Similarity=0.134 Sum_probs=33.1 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHHhH Q lcl|NC_019544. 83 LRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLVGS 162 (168) Q Consensus 83 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~~S 162 (168) |-.. ....+++.+.+++.-.++- ...+++|+..+..+++.+|.. + | +|||.|++| T Consensus 1 Ma~~-~~G~~~l~~~L~~~~~~~~---~~~~~al~~~a~~v~~~ak~~---------a-----------P-vdTG~Lr~S 55 (137) T protein:vir:94 1 MAKV-KYGNWDLVKELENYERDIE---RWVKRGIAKTTVKIHNTIISL---------M-----------P-VDTGYLRES 55 (137) T ss_pred Cchh-HHhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh---------C-----------C-cCcchhhcC Confidence 2111 1133444444444443321 134555666555555555532 2 2 489999999 Q ss_pred hccccC Q lcl|NC_019544. 163 IRHTVE 168 (168) Q Consensus 163 Ity~V~ 168 (168) |+++++ T Consensus 56 I~~~~~ 61 (137) T protein:vir:94 56 VTMDFK 61 (137) T ss_pred ceeEee Confidence 999988 No 147 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=84.23 E-value=0.004 Score=33.74 Aligned_cols=87 Identities=13% Similarity=0.103 Sum_probs=51.7 Q ss_pred Cc-ceecccchHHHHHHHHHHhhCCeE-------EEEee--------cC--------CCchHHHHHhhhhcCceeccCcc Q lcl|NC_019544. 1 MK-VTIKDTNNIDKITRNLQQLGGKQI-------KVGLF--------GK--------DDSELVMIGAVHEYGAEIPVTPK 56 (168) Q Consensus 1 M~-v~i~~~~~~~~~~~~l~~l~~~~v-------~VGi~--------~~--------~g~~~a~iA~~~E~G~~i~~~~~ 56 (168) |. |..-...+..++..+.++..+..| .-|-. .+ .+.-+..||-+.|||..+--.. T Consensus 16 ~~dvk~VVkkN~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~g~~~~vgp~g~t~dYapyvEyGTR~m~~~- 94 (127) T protein:vir:98 16 EKRWDRVANKNLTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKVNSSKDVITGNFGYIKDYAPHVEYGHRIVRNG- 94 (127) T ss_pred HHHHHHHHhhhhHHHHHHHHhccCCceeccccccCcccceeeeEEEEecCCceEEeccCcccccccceeecceeeeecc- Confidence 22 111112234566666666544333 11110 11 1223578999999997531101 Q ss_pred ccccccccchhhhcccceec-cCCCchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 57 MRAWFAANGYPLRKETTVIK-IPERSWLRSGYDENIDKIAKKIEKMVPD 104 (168) Q Consensus 57 ~~~~~~~~g~~~~~~~~~i~-IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 104 (168) ..|. .|+-|||.|+|+.++..+.+.++..+++ T Consensus 95 ----------------~~~gf~~aqp~l~paf~~Qk~iF~~DL~~l~k~ 127 (127) T protein:vir:98 95 ----------------KQVGYANGTKYLFNNVKKQREIYRQDMLNELRR 127 (127) T ss_pred ----------------cccccccCccccccchHHHhHHHHHHHHHHhcC Confidence 1122 5899999999999999999999999887 No 148 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=83.90 E-value=0.021 Score=29.82 Aligned_cols=73 Identities=12% Similarity=0.044 Sum_probs=35.3 Q ss_pred ccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019544. 55 PKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL 134 (168) Q Consensus 55 ~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~ 134 (168) -+..+|+ -.|-.|=.. ...-+++.+.+++.-.++- ...+++++..+..+++.+|.. T Consensus 1 ~~~~~~~----------------~~~~~Ma~v-~~Gld~l~~~l~~~~~~~~---~~~~~~l~~~a~~v~~~ak~~---- 56 (149) T protein:vir:10 1 MKLNYYD----------------LSRCHMAKV-KYGADSMVVELDKFDKKIE---EWVKKGIAKTTTKIYNTAVAL---- 56 (149) T ss_pred Ceeeeec----------------cchhhhHHH-HHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh---- Confidence 0111111 124444221 1233344444444333221 134555555555555555432 Q ss_pred CCCCChHHHHHhcCCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 135 KDPPNSQMTIERKGSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 135 ~~ppnsp~Ti~~KG~~~PLiDTG~L~~SIty~V~ 168 (168) +| +|||.|++||+++|+ T Consensus 57 -----aP------------vdTG~L~~SI~~~~~ 73 (149) T protein:vir:10 57 -----AP------------VDLGFLEESIDFKYF 73 (149) T ss_pred -----CC------------cccchhhccceEEec Confidence 12 699999999999988 No 149 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=81.31 E-value=0.03 Score=28.93 Aligned_cols=73 Identities=12% Similarity=0.052 Sum_probs=35.9 Q ss_pred ccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_019544. 55 PKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDL 134 (168) Q Consensus 55 ~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~ 134 (168) -+..+|+ -.|-.|=. +..--+++.+.+++.-.++- -..++++...+..+++.+|.. T Consensus 1 ~~~~~~~----------------~~~~~Ma~-~~~Gld~l~~~L~~~~~~~~---~~~~~al~~~a~~v~~~ak~~---- 56 (149) T protein:vir:94 1 MKLSYYD----------------LSRCHMAK-VKYGADSMVVELDKFDKKIE---EWVKKGIAKTTTKIYNTAVAL---- 56 (149) T ss_pred Ceeeeee----------------cchhhHHH-HHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHh---- Confidence 1111111 12444422 11233344444444333321 134555666666555555432 Q ss_pred CCCCChHHHHHhcCCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 135 KDPPNSQMTIERKGSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 135 ~~ppnsp~Ti~~KG~~~PLiDTG~L~~SIty~V~ 168 (168) +| +|||.|++||+++|+ T Consensus 57 -----aP------------vdTG~Lr~SI~~~~~ 73 (149) T protein:vir:94 57 -----AP------------VDLGFLEESIDFKYF 73 (149) T ss_pred -----CC------------cccchhhcCeeEEee Confidence 22 689999999999988 No 150 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=81.01 E-value=0.015 Score=30.53 Aligned_cols=60 Identities=27% Similarity=0.212 Sum_probs=26.6 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHH-HHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHH Q lcl|NC_019544. 82 WLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGL-IQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLV 160 (168) Q Consensus 82 Flr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~-ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~ 160 (168) -=.-=|+.+.+++...+...+. +++..++...+.. |+.... -++ | +|||.|+ T Consensus 1 ~~~~~f~~~~~~~~~~~~k~~~----------~~~~~~a~~~~~~~ie~~ak-----~~~-----------p-vdtG~L~ 53 (141) T protein:vir:78 1 MNEFEFDSNIPKARKLIEKKVL----------QALEDIGEHMTTELAEGGHG-----VTS-----------N-NDTGEYA 53 (141) T ss_pred CcchhHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHhhh-----hcc-----------c-cccchhh Confidence 0001233444444443333332 2233333222211 111111 122 3 7999999 Q ss_pred hHhccccC Q lcl|NC_019544. 161 GSIRHTVE 168 (168) Q Consensus 161 ~SIty~V~ 168 (168) +||+|+|. T Consensus 54 ~SI~~~v~ 61 (141) T protein:vir:78 54 QKSGYKVR 61 (141) T ss_pred cceeeeee Confidence 99999987 No 151 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=80.97 E-value=0.007 Score=32.41 Aligned_cols=60 Identities=20% Similarity=0.316 Sum_probs=34.5 Q ss_pred hHHHHH-HHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHHh Q lcl|NC_019544. 83 LRSGYD-ENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLVG 161 (168) Q Consensus 83 lr~~~~-~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~~ 161 (168) |+.+++ +.-+++.+.+.+... ....+.+|...+..+++++|. |+| +|||.|++ T Consensus 1 M~~~i~i~Gld~l~~~L~~~~~-----~~~~~~al~~~~~~i~~~ak~---------~aP------------vdTG~Lr~ 54 (112) T protein:vir:36 1 MKSSLSFKGIDQLVKHLDKAAS-----LKGVQQVVKSNTSNMTANMQK---------LVP------------VDTGYMKR 54 (112) T ss_pred CceeeeehhHHHHHHHHHhhhh-----HHHHHHHHHHHHHHHHHHHHH---------hCC------------CCchhhhh Confidence 554444 223444444443211 123466666666666666653 222 69999999 Q ss_pred HhccccC Q lcl|NC_019544. 162 SIRHTVE 168 (168) Q Consensus 162 SIty~V~ 168 (168) ||+..++ T Consensus 55 si~~~~~ 61 (112) T protein:vir:36 55 SIKMELT 61 (112) T ss_pred ceeeeec Confidence 9998777 No 152 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=79.85 E-value=0.017 Score=30.26 Aligned_cols=102 Identities=9% Similarity=0.042 Sum_probs=59.8 Q ss_pred CcceecccchHHHHHHHHHHhhCCe--------------------------EEEEee---------c--CCC-----chH Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQ--------------------------IKVGLF---------G--KDD-----SEL 38 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~--------------------------v~VGi~---------~--~~g-----~~~ 38 (168) |++.=-+..+++++.+.|+++.... |.-|-+ . +++ .+. T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~tPVdTG~Lr~S~~~~~~~~~~~~~~~~V~n~ 80 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEANTPVKQGNLRRSWTAEGPTYGCGGWTIKLINN 80 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHhCCCCcchhccceeecceeeecCeeEEEEecC Confidence 8765334456777777776543210 111111 0 111 145 Q ss_pred HHHHhhhhcCceeccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019544. 39 VMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNV 110 (168) Q Consensus 39 a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~ 110 (168) +.||.+-|||..+... .+.+..+... ..--+|-++||+.++++.+..+.+.+++.+..+++=.- T Consensus 81 ~~YA~~VE~Ghr~~~G----~~v~~~~~~~----~~g~V~G~~~~~~a~~~~~~~~~~~l~k~l~~l~d~~~ 144 (144) T protein:vir:10 81 AEYASYVESGHRQTPG----RYVPVLKKRL----VRDWVPGQFYMKKSIPQIQRQLPQLVTEGLWGLKDLFE 144 (144) T ss_pred CCcccccccceeecCC----cccccCCCcc----ccceecCccchHHHHHHHHHHHHHHHHHHHHHHhhhcC Confidence 7788899999754211 1222222211 12347899999999999999999999999988764332 No 153 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=79.05 E-value=0.011 Score=31.36 Aligned_cols=56 Identities=14% Similarity=0.029 Sum_probs=29.2 Q ss_pred ccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchh Q lcl|NC_019544. 76 KIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLID 155 (168) Q Consensus 76 ~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiD 155 (168) -|-+|+-|. ...+.+++.+ .++++++.++...++..|. |+| +| T Consensus 1 ~~~~~~~l~------~~~l~~~~~~----------~~~~~~~~~a~~ve~~ak~---------~aP------------v~ 43 (137) T protein:vir:10 1 MVAHTLRIE------RAQLHGLGMD----------EARKAVNRVVRRTFTRSQI---------LAP------------VD 43 (137) T ss_pred CcccccccC------hhhHhhHHHH----------HHHHHHHHHHHHHHHHHHh---------cCC------------cC Confidence 112222222 2222222211 3355566666665555443 222 79 Q ss_pred HHHHHhHhccccC Q lcl|NC_019544. 156 TGRLVGSIRHTVE 168 (168) Q Consensus 156 TG~L~~SIty~V~ 168 (168) ||+|++||++.+. T Consensus 44 TG~Lr~SI~~~~~ 56 (137) T protein:vir:10 44 TGYLRASGRLVLG 56 (137) T ss_pred chhhhccceeeee Confidence 9999999999886 No 154 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=75.96 E-value=0.05 Score=27.71 Aligned_cols=65 Identities=18% Similarity=0.213 Sum_probs=36.0 Q ss_pred eccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcch Q lcl|NC_019544. 75 IKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLI 154 (168) Q Consensus 75 i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLi 154 (168) |.| .++ +++.+.+++.-.++. -.++.++...|..+++.+|..-. ...+.| + T Consensus 1 i~i-------~Gl----d~L~~~l~~~~~~~~---~~v~~av~~~~~~i~~~a~~~a~--------------~~~~~p-~ 51 (115) T protein:vir:99 1 MNI-------DGL----DALLNQFHDMKTNID---DDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-Y 51 (115) T ss_pred Ccc-------hhH----HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhhc--------------cccCCC-C Confidence 000 123 333333333322221 13467778888888777776421 122234 5 Q ss_pred hHHHHHhHhccccC Q lcl|NC_019544. 155 DTGRLVGSIRHTVE 168 (168) Q Consensus 155 DTG~L~~SIty~V~ 168 (168) |||.|++||++... T Consensus 52 ~TG~Lr~SI~~~~~ 65 (115) T protein:vir:99 52 WTGNLSRNIRYKKT 65 (115) T ss_pred cchhhhhceeeeec Confidence 89999999998877 No 155 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=75.89 E-value=0.02 Score=29.94 Aligned_cols=93 Identities=13% Similarity=0.165 Sum_probs=52.0 Q ss_pred Cccee-cccchHHHHHHHHHHhhCCeEE-------------------------EEee-----------------cCCC-- Q lcl|NC_019544. 1 MKVTI-KDTNNIDKITRNLQQLGGKQIK-------------------------VGLF-----------------GKDD-- 35 (168) Q Consensus 1 M~v~i-~~~~~~~~~~~~l~~l~~~~v~-------------------------VGi~-----------------~~~g-- 35 (168) |+-.- -+..+|+++.+.|+.+....+. -|-+ .+++ T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~~g~~~~ 80 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARSLPVYKQGNNYI 80 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccccceeecCCeeE Confidence 76532 2345667777766554332111 0110 0000 Q ss_pred ---chHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcH Q lcl|NC_019544. 36 ---SELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNP 112 (168) Q Consensus 36 ---~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~ 112 (168) .+.+.||.+-|||..+.. +.-=+|.+.+|+.+.++.+..+.+.+++.+.+++.+=.++ T Consensus 81 v~v~n~~~YA~~VE~Ghr~~~-------------------~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l~~~~~~ 141 (141) T protein:vir:79 81 IEVVNPTEYASYVNFGHRTKD-------------------GKGWVKGQHFLTISEMELQSQVDKIIEKKLLILLKGVFDA 141 (141) T ss_pred EEEecCCcchhhhhcceeecC-------------------CcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 022456666666643321 0113466777788888888888888888888888776666 No 156 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=75.40 E-value=0.023 Score=29.52 Aligned_cols=61 Identities=16% Similarity=0.223 Sum_probs=35.6 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHHhH Q lcl|NC_019544. 83 LRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLVGS 162 (168) Q Consensus 83 lr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~~S 162 (168) |.-.++ .-+++.+.+++.-..+.. ..+.++...|..+++++|..- | +|||.|++| T Consensus 1 msi~i~-Gld~l~~~l~~~~~~~~~---~v~~al~~~a~~i~~~ak~~a---------P------------v~TG~Lr~s 55 (114) T protein:vir:95 1 MAIKWQ-GIEKLVATISNAQPKAVE---QSLQVLKNNGEKGKRIAKQLA---------P------------KDTEFLKDH 55 (114) T ss_pred Ceeeee-hHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhC---------C------------cCchhhhhc Confidence 332222 234444444444433321 346677777777777766642 2 689999999 Q ss_pred hccccC Q lcl|NC_019544. 163 IRHTVE 168 (168) Q Consensus 163 Ity~V~ 168 (168) |+.+.. T Consensus 56 I~~~~~ 61 (114) T protein:vir:95 56 ITTSYP 61 (114) T ss_pred eeeecC Confidence 987666 No 157 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=74.33 E-value=0.011 Score=31.36 Aligned_cols=65 Identities=17% Similarity=0.211 Sum_probs=32.2 Q ss_pred eccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcch Q lcl|NC_019544. 75 IKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLI 154 (168) Q Consensus 75 i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLi 154 (168) +. =.++ +++.+.+++.-.++ .-..+.++..-|..+++.+|..-. ..++.| + T Consensus 1 i~-------~~Gl----d~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~ 51 (115) T protein:vir:93 1 MN-------IDGL----DALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-Y 51 (115) T ss_pred Cc-------chhH----HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-C Confidence 00 0122 33333332222221 112355666666666666554321 123333 7 Q ss_pred hHHHHHhHhccccC Q lcl|NC_019544. 155 DTGRLVGSIRHTVE 168 (168) Q Consensus 155 DTG~L~~SIty~V~ 168 (168) |||.|++||++... T Consensus 52 ~TG~Lr~sI~~~~~ 65 (115) T protein:vir:93 52 WTGNLSRNIRYKKT 65 (115) T ss_pred Cchhhhhcceeeec Confidence 99999999998866 No 158 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=74.33 E-value=0.011 Score=31.36 Aligned_cols=65 Identities=17% Similarity=0.211 Sum_probs=32.2 Q ss_pred eccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcch Q lcl|NC_019544. 75 IKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLI 154 (168) Q Consensus 75 i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLi 154 (168) +. =.++ +++.+.+++.-.++ .-..+.++..-|..+++.+|..-. ..++.| + T Consensus 1 i~-------~~Gl----d~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~ 51 (115) T protein:vir:96 1 MN-------IDGL----DALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-Y 51 (115) T ss_pred Cc-------chhH----HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-C Confidence 00 0122 33333332222221 112355666666666666554321 123333 7 Q ss_pred hHHHHHhHhccccC Q lcl|NC_019544. 155 DTGRLVGSIRHTVE 168 (168) Q Consensus 155 DTG~L~~SIty~V~ 168 (168) |||.|++||++... T Consensus 52 ~TG~Lr~sI~~~~~ 65 (115) T protein:vir:96 52 WTGNLSRNIRYKKT 65 (115) T ss_pred Cchhhhhcceeeec Confidence 99999999998866 No 159 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=74.33 E-value=0.011 Score=31.36 Aligned_cols=65 Identities=17% Similarity=0.211 Sum_probs=32.2 Q ss_pred eccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcch Q lcl|NC_019544. 75 IKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLI 154 (168) Q Consensus 75 i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLi 154 (168) +. =.++ +++.+.+++.-.++ .-..+.++..-|..+++.+|..-. ..++.| + T Consensus 1 i~-------~~Gl----d~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~ 51 (115) T protein:vir:97 1 MN-------IDGL----DALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-Y 51 (115) T ss_pred Cc-------chhH----HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-C Confidence 00 0122 33333332222221 112355666666666666554321 123333 7 Q ss_pred hHHHHHhHhccccC Q lcl|NC_019544. 155 DTGRLVGSIRHTVE 168 (168) Q Consensus 155 DTG~L~~SIty~V~ 168 (168) |||.|++||++... T Consensus 52 ~TG~Lr~sI~~~~~ 65 (115) T protein:vir:97 52 WTGNLSRNIRYKKT 65 (115) T ss_pred Cchhhhhcceeeec Confidence 99999999998866 No 160 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=74.33 E-value=0.011 Score=31.36 Aligned_cols=65 Identities=17% Similarity=0.211 Sum_probs=32.2 Q ss_pred eccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcch Q lcl|NC_019544. 75 IKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLI 154 (168) Q Consensus 75 i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLi 154 (168) +. =.++ +++.+.+++.-.++ .-..+.++..-|..+++.+|..-. ..++.| + T Consensus 1 i~-------~~Gl----d~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~ 51 (115) T protein:vir:10 1 MN-------IDGL----DALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-Y 51 (115) T ss_pred Cc-------chhH----HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-C Confidence 00 0122 33333332222221 112355666666666666554321 123333 7 Q ss_pred hHHHHHhHhccccC Q lcl|NC_019544. 155 DTGRLVGSIRHTVE 168 (168) Q Consensus 155 DTG~L~~SIty~V~ 168 (168) |||.|++||++... T Consensus 52 ~TG~Lr~sI~~~~~ 65 (115) T protein:vir:10 52 WTGNLSRNIRYKKT 65 (115) T ss_pred Cchhhhhcceeeec Confidence 99999999998866 No 161 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=74.33 E-value=0.011 Score=31.36 Aligned_cols=65 Identities=17% Similarity=0.211 Sum_probs=32.2 Q ss_pred eccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcch Q lcl|NC_019544. 75 IKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLI 154 (168) Q Consensus 75 i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLi 154 (168) +. =.++ +++.+.+++.-.++ .-..+.++..-|..+++.+|..-. ..++.| + T Consensus 1 i~-------~~Gl----d~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~ 51 (115) T protein:vir:78 1 MN-------IDGL----DALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-Y 51 (115) T ss_pred Cc-------chhH----HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-C Confidence 00 0122 33333332222221 112355666666666666554321 123333 7 Q ss_pred hHHHHHhHhccccC Q lcl|NC_019544. 155 DTGRLVGSIRHTVE 168 (168) Q Consensus 155 DTG~L~~SIty~V~ 168 (168) |||.|++||++... T Consensus 52 ~TG~Lr~sI~~~~~ 65 (115) T protein:vir:78 52 WTGNLSRNIRYKKT 65 (115) T ss_pred Cchhhhhcceeeec Confidence 99999999998866 No 162 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=74.33 E-value=0.011 Score=31.36 Aligned_cols=65 Identities=17% Similarity=0.211 Sum_probs=32.2 Q ss_pred eccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcch Q lcl|NC_019544. 75 IKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLI 154 (168) Q Consensus 75 i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLi 154 (168) +. =.++ +++.+.+++.-.++ .-..+.++..-|..+++.+|..-. ..++.| + T Consensus 1 i~-------~~Gl----d~l~~~l~~~~~~~---~~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~ 51 (115) T protein:vir:96 1 MN-------IDGL----DALLNQFHDMKTNI---DDDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-Y 51 (115) T ss_pred Cc-------chhH----HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-C Confidence 00 0122 33333332222221 112355666666666666554321 123333 7 Q ss_pred hHHHHHhHhccccC Q lcl|NC_019544. 155 DTGRLVGSIRHTVE 168 (168) Q Consensus 155 DTG~L~~SIty~V~ 168 (168) |||.|++||++... T Consensus 52 ~TG~Lr~sI~~~~~ 65 (115) T protein:vir:96 52 WTGNLSRNIRYKKT 65 (115) T ss_pred Cchhhhhcceeeec Confidence 99999999998866 No 163 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=72.20 E-value=0.031 Score=28.82 Aligned_cols=84 Identities=13% Similarity=0.224 Sum_probs=52.5 Q ss_pred Ccceeccc-chHHHHHH--------HHHHhhCCeEEEEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcc Q lcl|NC_019544. 1 MKVTIKDT-NNIDKITR--------NLQQLGGKQIKVGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKE 71 (168) Q Consensus 1 M~v~i~~~-~~~~~~~~--------~l~~l~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~ 71 (168) +..+--.. ....-+.. .++-...-++.|||...+ -+.||.+.+.|+ T Consensus 62 ~~~k~~~~~~~~~HlaD~I~~~~~~~iDg~~dG~s~VGw~~~~---~a~~a~f~NdGT---------------------- 116 (159) T protein:vir:38 62 ANAKHHNRNRKTKHLQDSITYKPGYTADKLHTGDTDVGFEGKY---YDFLAKIVNNGQ---------------------- 116 (159) T ss_pred ccccccCcCcCCCccccceeeecCccccccccceeeecccCCc---cceEeeecccCc---------------------- Confidence 11110000 00001111 122223346889996443 357888888885 Q ss_pred cceeccCCC-----chhHHHHHHHHHHHHHHHHHHHHHHHhccCcH Q lcl|NC_019544. 72 TTVIKIPER-----SWLRSGYDENIDKIAKKIEKMVPDVIEGNVNP 112 (168) Q Consensus 72 ~~~i~IP~R-----pFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~ 112 (168) +..|+. +|+..+..+.++++.+.+...+.++|+-+-+- T Consensus 117 ---~~m~~k~~~gdHFvekt~~~~k~~Vl~A~~~~~~~il~~~~~~ 159 (159) T protein:vir:38 117 ---HHMSPKRYKNMHFLDKAQQEAKKSVAEAELKAYKEVMNHDSDK 159 (159) T ss_pred ---cccCCCCccCChhHHHHHHHHHHHHHHHHHHHHHHHhhcccCC Confidence 356665 79999999999999999999999999887554 No 164 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=71.66 E-value=0.054 Score=27.52 Aligned_cols=78 Identities=21% Similarity=0.261 Sum_probs=39.8 Q ss_pred CcceecccchHHHHHHHHHHh--------------------------------------------------hC---CeEE Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQL--------------------------------------------------GG---KQIK 27 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l--------------------------------------------------~~---~~v~ 27 (168) |+|.+++. +++++.|+.. .+ +.|+ T Consensus 1 msvevkGv---~eilk~le~k~G~~~~~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~ 77 (133) T protein:vir:93 1 MSVEIKGI---PEVLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVL 77 (133) T ss_pred CeEEEecH---HHHHHHHHHhhCHhhhHhhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceEEE Confidence 99888864 3344333221 11 2345 Q ss_pred EEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCCCch--hHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 28 VGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSW--LRSGYDENIDKIAKKIEKMVPD 104 (168) Q Consensus 28 VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 104 (168) |||.++- .-..|-.+||||.+- .++ .|-||-| ++.+++..+..+.+.++.-+.+ T Consensus 78 i~W~gp~--~R~~iVHLNE~Gytr----~Gk-----------------~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 78 IEWVGPM--NRKNIIHLNEHGYTR----DGK-----------------KYTPRGFGVIAKTLAANERKYREIIKKELAR 133 (133) T ss_pred EEeecCC--CceeEEEeeccceec----CCC-----------------eEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 5554321 122333456666321 011 2233433 6777888887777777766665 No 165 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=70.02 E-value=0.097 Score=26.15 Aligned_cols=65 Identities=15% Similarity=0.163 Sum_probs=33.2 Q ss_pred eccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcch Q lcl|NC_019544. 75 IKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLI 154 (168) Q Consensus 75 i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLi 154 (168) |.| .+++ ++.+.++..-+++ .-.++.++..-|..+++.+|..- | +....| + T Consensus 1 i~i-------~Gld----~L~~~l~~~~~~~---~~~~~~al~~~~~~i~~~a~~~a---------~-----~~~~~p-v 51 (115) T protein:vir:10 1 MQS-------KGLK----KLMNHLKVMHDDI---EDDVDDILKNNAKEGVGIAVSNA---------K-----EVMNKG-Y 51 (115) T ss_pred Cee-------hhHH----HHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhh---------c-----cccCCC-C Confidence 000 1333 3333333222221 11346667777777777766532 2 122334 7 Q ss_pred hHHHHHhHhccccC Q lcl|NC_019544. 155 DTGRLVGSIRHTVE 168 (168) Q Consensus 155 DTG~L~~SIty~V~ 168 (168) |||.|++||+.... T Consensus 52 ~TG~Lr~sI~~~~~ 65 (115) T protein:vir:10 52 WTGNLASLIEVKKI 65 (115) T ss_pred cchhhhhceeeeec Confidence 99999999986654 No 166 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=68.37 E-value=0.095 Score=26.19 Aligned_cols=79 Identities=18% Similarity=0.185 Sum_probs=45.1 Q ss_pred CcceecccchHHHHHHHHHH-hh--------------------------------------------------CCeEEEE Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQ-LG--------------------------------------------------GKQIKVG 29 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~-l~--------------------------------------------------~~~v~VG 29 (168) |+-.-. --+++++++.|++ |. -+.|+|| T Consensus 1 ~~~~ae-vkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~t~dev~~s~~~~~~G~r~V~Vg 79 (132) T protein:vir:96 1 MSGFAN-LKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLG 79 (132) T ss_pred CCcccc-ccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhhhhhcchhhcceeecCeeecCCceEEEec Confidence 432211 1134444444433 21 1345555 Q ss_pred eecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCCCc--hhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019544. 30 LFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERS--WLRSGYDENIDKIAKKIEKMVPDVIE 107 (168) Q Consensus 30 i~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~Rp--Flr~~~~~~~~~~~~~~~~~~~~~l~ 107 (168) |- |. --.|-.+||||.. ..|-||- +++.+++..+..+.+.++.-+.+.|+ T Consensus 80 W~---Gp-R~~ivHLNE~GyG------------------------k~~~PrG~G~I~~a~~~se~~~~~~~~~elkk~l~ 131 (132) T protein:vir:96 80 FT---TP-RWNIVHLQELEYG------------------------WKHNRRGVGVIRRYSDILETIYPRGIRDKLKRGFD 131 (132) T ss_pred cc---CC-ceeEEeeeccccc------------------------CCcCCCcchHHHHHHHhhhhHHHHHHHHHHHHHhc Confidence 53 21 3344455666641 1233332 58999999999999999999999999 Q ss_pred c Q lcl|NC_019544. 108 G 108 (168) Q Consensus 108 G 108 (168) | T Consensus 132 ~ 132 (132) T protein:vir:96 132 G 132 (132) T ss_pred C Confidence 9 No 167 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=66.20 E-value=0.12 Score=25.64 Aligned_cols=80 Identities=20% Similarity=0.309 Sum_probs=43.3 Q ss_pred CcceecccchHHHHHHHHHH-hh--------C------------------------------------------CeEEEE Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQ-LG--------G------------------------------------------KQIKVG 29 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~-l~--------~------------------------------------------~~v~VG 29 (168) |+|.+++. +++++.|+. |. . ++|+|| T Consensus 1 msvevkGv---~eilk~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~G~r~V~i~ 77 (133) T protein:vir:78 1 MSVEVTGV---EELERQLVSLFGRENLPQLVDPALIAGATLVAKTLKSEFVQFKDTGASIDEINIEKPSYDKGVRSIKID 77 (133) T ss_pred CeEEEecH---HHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHHhhcchhcccceeeeEEecCeeeeCCceEEEEE Confidence 99988864 334443322 10 0 245555 Q ss_pred eecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCCCch--hHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 30 LFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSW--LRSGYDENIDKIAKKIEKMVPDVI 106 (168) Q Consensus 30 i~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpF--lr~~~~~~~~~~~~~~~~~~~~~l 106 (168) |.++- .-..|-.+||||.+ + .++ .|-||-| ++.+++..+..+.+.++.-+.+.| T Consensus 78 W~gp~--~R~~iVHLNE~GYt-r---~Gk-----------------~i~PrG~G~i~~a~~~se~~y~~~vk~el~k~l 133 (133) T protein:vir:78 78 WKGPK--DRYKIIHLNEYGYT-R---NGK-----------------KITPAGTGSVARSLRISERAYRAIVQKKIGDKL 133 (133) T ss_pred EecCC--CceeEEEeecccee-c---CCC-----------------eEccchhhHHHHHHHhhhHHHHHHHHHHHHhhC Confidence 54321 12233445677642 1 111 2333433 777888888888888888887766 No 168 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=59.00 E-value=0.15 Score=25.12 Aligned_cols=78 Identities=19% Similarity=0.230 Sum_probs=38.9 Q ss_pred CcceecccchHHHHHHHHHH-h-------------------------------------------------hC---CeEE Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQ-L-------------------------------------------------GG---KQIK 27 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~-l-------------------------------------------------~~---~~v~ 27 (168) |+|.+++. +++++.|+. | ++ +.|+ T Consensus 1 msvevkGv---~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~ 77 (133) T protein:vir:96 1 MSVEIKGI---PEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVL 77 (133) T ss_pred CeEEEecH---HHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEE Confidence 99888864 334433322 1 01 2345 Q ss_pred EEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCCCch--hHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 28 VGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSW--LRSGYDENIDKIAKKIEKMVPD 104 (168) Q Consensus 28 VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 104 (168) |||.++- .-..|-.+||||.+- .++ .|-||-| ++.+++.-+..+.+.++.-+.+ T Consensus 78 i~W~gp~--~R~~iVHLNE~Gytr----~Gk-----------------~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:96 78 IEWVGPM--NRKNIIHLNEHGYTR----DGK-----------------KYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred EEeecCC--CceeEEEeeccceec----CCC-----------------eEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 5554321 122333456666321 011 1233433 6777777777777777766665 No 169 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=59.00 E-value=0.15 Score=25.12 Aligned_cols=78 Identities=19% Similarity=0.230 Sum_probs=38.9 Q ss_pred CcceecccchHHHHHHHHHH-h-------------------------------------------------hC---CeEE Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQ-L-------------------------------------------------GG---KQIK 27 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~-l-------------------------------------------------~~---~~v~ 27 (168) |+|.+++. +++++.|+. | ++ +.|+ T Consensus 1 msvevkGv---~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~ 77 (133) T protein:vir:94 1 MSVEIKGI---PEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVL 77 (133) T ss_pred CeEEEecH---HHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEE Confidence 99888864 334433322 1 01 2345 Q ss_pred EEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCCCch--hHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 28 VGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSW--LRSGYDENIDKIAKKIEKMVPD 104 (168) Q Consensus 28 VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 104 (168) |||.++- .-..|-.+||||.+- .++ .|-||-| ++.+++.-+..+.+.++.-+.+ T Consensus 78 i~W~gp~--~R~~iVHLNE~Gytr----~Gk-----------------~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:94 78 IEWVGPM--NRKNIIHLNEHGYTR----DGK-----------------KYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred EEeecCC--CceeEEEeeccceec----CCC-----------------eEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 5554321 122333456666321 011 1233433 6777777777777777766665 No 170 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=59.00 E-value=0.15 Score=25.12 Aligned_cols=78 Identities=19% Similarity=0.230 Sum_probs=38.9 Q ss_pred CcceecccchHHHHHHHHHH-h-------------------------------------------------hC---CeEE Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQ-L-------------------------------------------------GG---KQIK 27 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~-l-------------------------------------------------~~---~~v~ 27 (168) |+|.+++. +++++.|+. | ++ +.|+ T Consensus 1 msvevkGv---~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~ 77 (133) T protein:vir:93 1 MSVEIKGI---PEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVL 77 (133) T ss_pred CeEEEecH---HHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEE Confidence 99888864 334433322 1 01 2345 Q ss_pred EEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCCCch--hHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 28 VGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSW--LRSGYDENIDKIAKKIEKMVPD 104 (168) Q Consensus 28 VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 104 (168) |||.++- .-..|-.+||||.+- .++ .|-||-| ++.+++.-+..+.+.++.-+.+ T Consensus 78 i~W~gp~--~R~~iVHLNE~Gytr----~Gk-----------------~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 78 IEWVGPM--NRKNIIHLNEHGYTR----DGK-----------------KYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred EEeecCC--CceeEEEeeccceec----CCC-----------------eEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 5554321 122333456666321 011 1233433 6777777777777777766665 No 171 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=59.00 E-value=0.15 Score=25.12 Aligned_cols=78 Identities=19% Similarity=0.230 Sum_probs=38.9 Q ss_pred CcceecccchHHHHHHHHHH-h-------------------------------------------------hC---CeEE Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQ-L-------------------------------------------------GG---KQIK 27 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~-l-------------------------------------------------~~---~~v~ 27 (168) |+|.+++. +++++.|+. | ++ +.|+ T Consensus 1 msvevkGv---~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~ 77 (133) T protein:vir:78 1 MSVEIKGI---PEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVL 77 (133) T ss_pred CeEEEecH---HHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEE Confidence 99888864 334433322 1 01 2345 Q ss_pred EEeecCCCchHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCCCch--hHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019544. 28 VGLFGKDDSELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSW--LRSGYDENIDKIAKKIEKMVPD 104 (168) Q Consensus 28 VGi~~~~g~~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 104 (168) |||.++- .-..|-.+||||.+- .++ .|-||-| ++.+++.-+..+.+.++.-+.+ T Consensus 78 i~W~gp~--~R~~iVHLNE~Gytr----~Gk-----------------~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:78 78 IEWVGPM--NRKNIIHLNEHGYTR----DGK-----------------KYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred EEeecCC--CceeEEEeeccceec----CCC-----------------eEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 5554321 122333456666321 011 1233433 6777777777777777766665 No 172 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=54.63 E-value=0.03 Score=28.92 Aligned_cols=61 Identities=15% Similarity=0.146 Sum_probs=32.6 Q ss_pred CCC-chhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhH Q lcl|NC_019544. 78 PER-SWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDT 156 (168) Q Consensus 78 P~R-pFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDT 156 (168) =+| .|==.++ +++.+.+++... ...+++++...|..++..+|.. +| +|| T Consensus 1 Ma~~~i~~~Gl----d~L~~~L~~~~~-----~~~v~~vv~~~~~~l~~~ak~~---------ap------------~dT 50 (92) T protein:vir:99 1 MADYSISWDGL----DALDEALANQQN-----MNTVKKVVKKHTANLMTATQQA---------VP------------VDT 50 (92) T ss_pred CCceeeEeehH----HHHHHHHHhhcc-----HHHHHHHHHHHHHHHHHHHHHh---------CC------------CCc Confidence 011 0000022 333333332211 1245677777777777666652 22 799 Q ss_pred HHHHhHhccccC Q lcl|NC_019544. 157 GRLVGSIRHTVE 168 (168) Q Consensus 157 G~L~~SIty~V~ 168 (168) |.|++||+..++ T Consensus 51 G~lrrSI~~~~~ 62 (92) T protein:vir:99 51 GHLKQSAQIQIS 62 (92) T ss_pred cccceeeeEEee Confidence 999999997777 No 173 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=53.34 E-value=0.056 Score=27.44 Aligned_cols=89 Identities=18% Similarity=0.243 Sum_probs=50.6 Q ss_pred Ccceec-cc-chHHH--------HHHHHHHhhC------------------CeEEEEeecC-----CCch-HHHHHhhhh Q lcl|NC_019544. 1 MKVTIK-DT-NNIDK--------ITRNLQQLGG------------------KQIKVGLFGK-----DDSE-LVMIGAVHE 46 (168) Q Consensus 1 M~v~i~-~~-~~~~~--------~~~~l~~l~~------------------~~v~VGi~~~-----~g~~-~a~iA~~~E 46 (168) |+=.+. |. ..+.. ++.+++++.- .+|+++=-+. .|.. -.-||.+-+ T Consensus 21 mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~~~p~~srr~r~G~L~~Sir~aaT~raa~VrAGr~arVPYA~~I~ 100 (143) T protein:vir:13 21 VRALRDKELNKAVREANKASGEVLIPQAKHESPDGHRDPKSSKRYRPGKLDKSIKVTASAKGAVIKAGSAARVPYAAAIH 100 (143) T ss_pred HHHhhCCcchHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccchhhccccccccccceeeeecCcCCCCcccccc Confidence 443321 11 11222 2223332211 1233332211 1322 245677778 Q ss_pred cCceeccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHH Q lcl|NC_019544. 47 YGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDA 118 (168) Q Consensus 47 ~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~ 118 (168) ||+ +.-+|-++-||..++...++.|.+..++.++++++-.+ +. T Consensus 101 ~G~-----------------------r~r~Is~~rFl~~a~a~te~~~~r~Ye~~i~~vl~k~l------~s 143 (143) T protein:vir:13 101 FGY-----------------------RKRNISANRFLYRAMARKSDVVAATYERRIAAVVEKYL------ES 143 (143) T ss_pred cCC-----------------------cccccchhhhhhhhhhccCHHHHHHHHHHHHHHHHHHh------cC Confidence 886 45578899999999999999999999999999876443 22 No 174 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=50.99 E-value=0.057 Score=27.41 Aligned_cols=89 Identities=18% Similarity=0.235 Sum_probs=51.6 Q ss_pred Ccceecc-c-chHHH--------HHHHHHHhhC------------------CeEEEEeecC-----CCc-hHHHHHhhhh Q lcl|NC_019544. 1 MKVTIKD-T-NNIDK--------ITRNLQQLGG------------------KQIKVGLFGK-----DDS-ELVMIGAVHE 46 (168) Q Consensus 1 M~v~i~~-~-~~~~~--------~~~~l~~l~~------------------~~v~VGi~~~-----~g~-~~a~iA~~~E 46 (168) |.-.+.. . ..+.. ++.+++++.- .+|+++=-+. .|. .-.-||.+-+ T Consensus 21 mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aaT~raa~VrAG~~krVPYA~~I~ 100 (143) T protein:vir:62 21 VRTLRDKELNKAVREANKASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTASAKGAVIKAGSASRVPYAAAIH 100 (143) T ss_pred HHHhhCCchhHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccCcchhhccccccccccceeeeeCCcCCCCcccccc Confidence 4433211 1 11222 2223332211 1233322211 233 3456677778 Q ss_pred cCceeccCccccccccccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHH Q lcl|NC_019544. 47 YGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDA 118 (168) Q Consensus 47 ~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~ 118 (168) ||+ +.-+|-++-||..++...++.|.+..++.++++++-.+ +. T Consensus 101 ~G~-----------------------r~r~Isp~rFl~~a~a~te~~~~r~Ye~~i~~vl~k~l------~s 143 (143) T protein:vir:62 101 FGY-----------------------RARNISPNRFLFRAMARKSDVVAATYERRIAAVVEKYL------ES 143 (143) T ss_pred cCc-----------------------ccccccchhhhhhhhhccCHHHHHHHHHHHHHHHHHHh------cC Confidence 886 45578899999999999999999999999999876443 22 No 175 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=48.44 E-value=0.2 Score=24.44 Aligned_cols=78 Identities=14% Similarity=0.148 Sum_probs=47.3 Q ss_pred CcceecccchH----HHHHHHHHH----------------------h-hCCeEEEEeecCCCchHHHHHhhhhcCceecc Q lcl|NC_019544. 1 MKVTIKDTNNI----DKITRNLQQ----------------------L-GGKQIKVGLFGKDDSELVMIGAVHEYGAEIPV 53 (168) Q Consensus 1 M~v~i~~~~~~----~~~~~~l~~----------------------l-~~~~v~VGi~~~~g~~~a~iA~~~E~G~~i~~ 53 (168) |.-.+ +..| +.+.++|+. - .-++|+|||.+. -..|-.+||||..= T Consensus 32 ~~ri~--nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s~p~~~~G~r~V~igW~Gp----R~~ivHLNE~GyGk-- 103 (138) T protein:vir:98 32 VNRVV--NRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGFTTP----RWNIVHLQELEYGW-- 103 (138) T ss_pred hhhhh--hHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeecCeeecCCceEEEEeeecC----eeeEEeeecccccC-- Confidence 22111 1111 122222211 0 125788888643 55566778999521 Q ss_pred CccccccccccchhhhcccceeccCCCc--hhHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019544. 54 TPKMRAWFAANGYPLRKETTVIKIPERS--WLRSGYDENIDKIAKKIEKMVPDVIEG 108 (168) Q Consensus 54 ~~~~~~~~~~~g~~~~~~~~~i~IP~Rp--Flr~~~~~~~~~~~~~~~~~~~~~l~G 108 (168) .|-||- +++.+++..+..+.+.++.-+.+.|+| T Consensus 104 ----------------------~i~PrG~G~I~ka~~~se~~y~~~vk~el~k~l~~ 138 (138) T protein:vir:98 104 ----------------------KHNRRGVGVIRRYSDILETIYPRGIRDKLKRGFDG 138 (138) T ss_pred ----------------------CcCCCcchHHHHHHHhhhHHHHHHHHHHHHHHhcC Confidence 222332 589999999999999999999999999 No 176 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=44.25 E-value=0.4 Score=22.75 Aligned_cols=90 Identities=16% Similarity=0.243 Sum_probs=47.6 Q ss_pred CcceecccchHHHHHHHHHHhhCCe-E-----------------------EEEe-------------------------- Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQ-I-----------------------KVGL-------------------------- 30 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~-v-----------------------~VGi-------------------------- 30 (168) |+..+. ...|+++.++|..+.... + =||. T Consensus 1 m~~~~d-~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~ 79 (163) T protein:vir:10 1 MSGGFD-YRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQ 79 (163) T ss_pred CCCccC-HHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhccccccc Confidence 777754 445666666665442210 0 0110 Q ss_pred ---e-----c------CCCc-----hHHHHHhhhhcCceeccCccccccccccchhhhcccceeccCCCchhHHHHHHHH Q lcl|NC_019544. 31 ---F-----G------KDDS-----ELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKIPERSWLRSGYDENI 91 (168) Q Consensus 31 ---~-----~------~~g~-----~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~ 91 (168) + . +++. +.+.+|.+-|||=.+. . -.=+|-+.+|+.+.++.+ T Consensus 80 tG~lr~swk~~~~~k~~~~~~v~v~N~~~YA~~VE~GHR~~-----------------~---gGfV~G~fml~~s~~~~~ 139 (163) T protein:vir:10 80 GGTLQKGWSKSRIEVSGRTYKQKVYNKVYYAPHVEYGHKTV-----------------N---GGFVPGQFFLHKTVEDTK 139 (163) T ss_pred cchhhccceecceeecCCceEEEEEecCCccchhhcceeec-----------------C---CceeccchhhHHHHHHHH Confidence 0 0 0000 1233444444443221 1 124799999999999999 Q ss_pred HHHHHHHHHHHHHHH----hccCc Q lcl|NC_019544. 92 DKIAKKIEKMVPDVI----EGNVN 111 (168) Q Consensus 92 ~~~~~~~~~~~~~~l----~G~~~ 111 (168) ..+.+.+++.+..++ +|+.. T Consensus 140 ~~~~~~~e~~l~~~l~k~~~~~~~ 163 (163) T protein:vir:10 140 SDMEKRVRDKYDGFMRKVVLGNGK 163 (163) T ss_pred HHHHHHHHHHHHHHHHHhhcCCCC Confidence 888777776665554 45444 No 177 >protein:vir:4096 Length: 140 # NCBI annotation: Gp9 protein # Family: family:all:28682 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510990;swissprot:trembl:q8w600;genbank:gi:17488512;uniprot:Q8W600;genbank:GeneID:1260318 Probab=36.58 E-value=0.53 Score=22.11 Aligned_cols=107 Identities=16% Similarity=0.264 Sum_probs=56.7 Q ss_pred Ccceec-ccchHHHHHHHHHHhhCCeEEE--EeecCCCchHHH-----------------------------HHhhhhcC Q lcl|NC_019544. 1 MKVTIK-DTNNIDKITRNLQQLGGKQIKV--GLFGKDDSELVM-----------------------------IGAVHEYG 48 (168) Q Consensus 1 M~v~i~-~~~~~~~~~~~l~~l~~~~v~V--Gi~~~~g~~~a~-----------------------------iA~~~E~G 48 (168) |.-+-. +.++++++.+.+.++.++.=.+ =.+..+|.++|. .-..-..| T Consensus 1 m~~~~sld~s~~e~L~~~i~r~P~ksE~~IN~~L~tkg~~~~~~~I~~~iPvS~~~k~~~RnK~HAK~s~pl~~~~~NLg 80 (140) T protein:vir:40 1 MCAKWSLEFSDVERLSNLISQIPNKSEAIINKTLETKAVPLVKLNIEKRINLSKNWKGQLLNKNHAQSSGPFNVKMGNLG 80 (140) T ss_pred CCcceecchhhHHHHHHHHHhccchHHHHHHHHHHhhhhHHHHhhhhhccCcCccchhhhccccchhhhhhhhhhhhhcc Confidence 887654 4567888888888776642110 001112222221 11122345 Q ss_pred ceeccCccccccc-cccchhhhcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHH Q lcl|NC_019544. 49 AEIPVTPKMRAWF-AANGYPLRKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPR 113 (168) Q Consensus 49 ~~i~~~~~~~~~~-~~~g~~~~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~ 113 (168) .+|.++++-.|.. +..|.. ++ -.+|.| ||+.+++...+++.+.+.+++.++++--+..+ T Consensus 81 f~i~~k~kf~YLvfPD~G~G----~s-n~~~q~-FmerGl~~~t~~i~E~L~~~l~k~in~~Lgg~ 140 (140) T protein:vir:40 81 FELLTKPKFNYLIFPDQGIG----KH-NKTKQD-FMQLGVEESSQEIVEMLEQAVFKEINDTLGGK 140 (140) T ss_pred eeEeecCcccccccccccCC----CC-CcchHH-HHHhccccchhHHHHHHHHHHHHHHHHhhcCC Confidence 5555544433332 222321 22 246766 99999999999988888777766554221111 No 178 >protein:vir:4460 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700383;genbank:gi:23505455;genbank:GeneID:955662 Probab=35.28 E-value=0.15 Score=25.17 Aligned_cols=70 Identities=19% Similarity=0.220 Sum_probs=39.7 Q ss_pred hcccceeccCCCchhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHH-HhCCCCCChHHHHHhc Q lcl|NC_019544. 69 RKETTVIKIPERSWLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKM-RDLKDPPNSQMTIERK 147 (168) Q Consensus 69 ~~~~~~i~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I-~~~~~ppnsp~Ti~~K 147 (168) +..+. ||---|++-.. +...+.-...++..+|+.-..+.++-+ ..|.. + T Consensus 1 M~~~~--------~lHvdF~qp~~------------~~Fnr~r~RraF~~iGq~h~r~Arrlvm~RGrs----------~ 50 (170) T protein:vir:44 1 MPQKA--------YLHVDFVQPEE------------LVFNRARMRRAFVKIGQVHMRDARRLVMKRGRS----------K 50 (170) T ss_pred CCCCc--------eeEEeeecCCc------------eeecHHHHHHHHHHHhHHHHHHHHHHHHHhcCC----------C Confidence 11111 11111211111 111122345678888888888888543 33421 3 Q ss_pred CCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 148 GSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 148 G~~~PLiDTG~L~~SIty~V~ 168 (168) .+.+|-..||.|..||+|.|- T Consensus 51 pGe~P~~~TGrLa~SIgy~Vp 71 (170) T protein:vir:44 51 PGENPSYRTGQLARSIGYYVP 71 (170) T ss_pred CCCCCcchhhhhhhhhhhccc Confidence 356899999999999999998 No 179 >protein:vir:96288 Length: 100 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240315;genbank:gi:66396010;genbank:GeneID:5133365 Probab=31.25 E-value=0.11 Score=25.81 Aligned_cols=72 Identities=15% Similarity=0.054 Sum_probs=33.4 Q ss_pred HHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhc-CCCCcchhHHHHHhHhccccC Q lcl|NC_019544. 90 NIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERK-GSDNPLIDTGRLVGSIRHTVE 168 (168) Q Consensus 90 ~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~K-G~~~PLiDTG~L~~SIty~V~ 168 (168) .+-.+-.+-+-.+.++--|..+....++..-......+++-|.+-- .-|... -+..| ||||.|+|||+..-+ T Consensus 1 ~~~~~~~~~~~~makvkyG~~dmvk~~~~f~~~i~~~vk~~IakTa------~~I~~~Avs~AP-VD~G~Lk~SI~~dyk 73 (100) T protein:vir:96 1 MKLNYYDLSRCHMAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTT------TKIYNTAVALAP-VDLGFLEESIDFKYF 73 (100) T ss_pred CcccccccchhhhhhheechHHHHHHHhcchHHHHHHHHHHHHHHH------HHHHhhHHhhcc-ccccccceeeeeeee Confidence 0111111222233344444444445555555555555555443210 000000 12234 999999999998866 No 180 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=23.11 E-value=1.6 Score=19.51 Aligned_cols=102 Identities=13% Similarity=0.113 Sum_probs=53.6 Q ss_pred CcceecccchHHHHHHHHHHhhCCeEEEEeecC--CCc-hHHHHHhhhhcCceeccCccccccccccchhhhcccceecc Q lcl|NC_019544. 1 MKVTIKDTNNIDKITRNLQQLGGKQIKVGLFGK--DDS-ELVMIGAVHEYGAEIPVTPKMRAWFAANGYPLRKETTVIKI 77 (168) Q Consensus 1 M~v~i~~~~~~~~~~~~l~~l~~~~v~VGi~~~--~g~-~~a~iA~~~E~G~~i~~~~~~~~~~~~~g~~~~~~~~~i~I 77 (168) |+=.|...+. .+.....-+..|||... +|. .=|+||.+.+.|+..++--.+ + ...+.....+.| T Consensus 62 LaDsI~~~~~------niDg~~dG~s~VGf~~k~~~~~~~kA~iAr~lNDGTk~~~~~~~------~-~~~~~~~g~v~i 128 (168) T protein:vir:74 62 LADSIVMKNK------NIDGVKDGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTR------S-GRKYKKPGEVAV 128 (168) T ss_pred hhhheeeccc------ccCcccCCceeecccccccccccchhhhhhhhcccccccccccc------c-cccccccccccc Confidence 3211111100 11223345678999653 342 369999999999754211000 0 001122346789 Q ss_pred CCCchhHHHHHH--HHHHHHHHHHHHHHHHHhccCcHHHHH Q lcl|NC_019544. 78 PERSWLRSGYDE--NIDKIAKKIEKMVPDVIEGNVNPRLFM 116 (168) Q Consensus 78 P~RpFlr~~~~~--~~~~~~~~~~~~~~~~l~G~~~~~~~l 116 (168) |.=+|+..+-.+ -++++.+.-...+.++|.-+.--.. | T Consensus 129 ~gDHFvd~~r~~~~~k~~V~~Ae~~~y~eIl~~k~~~~~-~ 168 (168) T protein:vir:74 129 HADHFIEETRMNLIVQQGILKAEAEAMRKIINRKKKENN-L 168 (168) T ss_pred ccchhHHHHHhhhhhHHHHHHHHHHHHHHHHHhhcCCCC-C Confidence 999999998887 4566666666666666553211111 1 No 181 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=22.81 E-value=1.2 Score=20.06 Aligned_cols=53 Identities=19% Similarity=0.309 Sum_probs=27.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHHhHhc Q lcl|NC_019544. 85 SGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLVGSIR 164 (168) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~~SIt 164 (168) ..|...-.+|.+..+..+.. ++..+...+.+.|+. ..| +|||++|.|.. T Consensus 1 msf~~~i~~~~~~ve~~~~~----------~~r~~a~~~~~~iv~--------------------~sP-VdTGr~Ranw~ 49 (131) T protein:vir:78 1 MSFALDVSKFVEKAKKNPEK----------VIRQVSIKLFSAIIK--------------------ASP-VDTGRFRMNWM 49 (131) T ss_pred CCcCcCHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHH--------------------hCC-Cchhhhccccc Confidence 35555556666665555544 334444444444432 123 57888877765 Q ss_pred cccC Q lcl|NC_019544. 165 HTVE 168 (168) Q Consensus 165 y~V~ 168 (168) .-+. T Consensus 50 vs~~ 53 (131) T protein:vir:78 50 ASGG 53 (131) T ss_pred eecc Confidence 5544 No 182 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=21.77 E-value=2.3 Score=18.61 Aligned_cols=56 Identities=11% Similarity=0.160 Sum_probs=29.2 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHHh Q lcl|NC_019544. 82 WLRSGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLVG 161 (168) Q Consensus 82 Flr~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~~ 161 (168) .+-..|...-.+|.+.+++.+.. .+..+...+.+.|+. ..| ||||++|. T Consensus 1 ~~~~sf~~~i~~~~~~ve~~~~~----------~~r~~~~~~~~~vv~--------------------~sP-VdtGrfRa 49 (121) T protein:vir:94 1 MISMKFNVNLSRLRSNLREEAKK----------KAIRIAQEIVNGVIA--------------------RSP-VLAGDYRS 49 (121) T ss_pred CccchhhccHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHH--------------------hcC-Cchhhhhc Confidence 45556666666776666655443 344444444444442 122 56666666 Q ss_pred HhccccC Q lcl|NC_019544. 162 SIRHTVE 168 (168) Q Consensus 162 SIty~V~ 168 (168) |-..-+- T Consensus 50 nw~vs~~ 56 (121) T protein:vir:94 50 SWNVSEG 56 (121) T ss_pred ccccccc Confidence 5443332 No 183 >protein:vir:80425 Length: 134 # NCBI annotation: BcepGomrgp15 # Family: family:all:448 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210235;genbank:gi:146329927;genbank:GeneID:5123534 Probab=21.30 E-value=1.5 Score=19.67 Aligned_cols=53 Identities=21% Similarity=0.369 Sum_probs=26.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCChHHHHHhcCCCCcchhHHHHHhHhc Q lcl|NC_019544. 85 SGYDENIDKIAKKIEKMVPDVIEGNVNPRLFMDAIGMEFAGLIQKKMRDLKDPPNSQMTIERKGSDNPLIDTGRLVGSIR 164 (168) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~l~G~~~~~~~l~~iG~~~~~~ik~~I~~~~~ppnsp~Ti~~KG~~~PLiDTG~L~~SIt 164 (168) ..|...-.+|.+.++..+.. .+..+...+.+.++.. .| +|||++|.|-. T Consensus 1 msF~~~i~~~~~~ve~~~~~----------~~r~~a~~~~~~vv~~--------------------sP-VdTGr~Ranw~ 49 (134) T protein:vir:80 1 MSYTDRFNVIAKGIEDNVDN----------LVKNVALAIGSNVIAD--------------------TP-ILTGQARRNWQ 49 (134) T ss_pred CCcccCHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHh--------------------CC-Ccchhhhcccc Confidence 45556666666655555443 3444444444444331 12 36666666553 Q ss_pred cccC Q lcl|NC_019544. 165 HTVE 168 (168) Q Consensus 165 y~V~ 168 (168) .-+- T Consensus 50 vs~~ 53 (134) T protein:vir:80 50 TELN 53 (134) T ss_pred eeec Confidence 3332 Done!