Query lcl|NC_019527.1_cdsid_YP_007007751.1 [gene=F394_gp62] [protein=hypothetical protein] [protein_id=YP_007007751.1] [location=29574..30146] Match_columns 190 No_of_seqs 106 out of 172 Neff 5.9 Searched_HMMs 1612 Date Thu Nov 7 18:09:50 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_62 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_62_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:107757 Length: 189 100.0 5.1E-60 3.1E-63 345.7 18.6 180 1-181 1-189 (189) 2 protein:vir:99546 Length: 200 100.0 5.3E-56 3.3E-59 323.6 16.0 146 1-177 7-200 (200) 3 protein:vir:5257 Length: 148 # 100.0 1.8E-53 1.1E-56 309.7 16.2 142 1-177 1-148 (148) 4 protein:vir:96105 Length: 193 100.0 3.9E-53 2.4E-56 307.9 16.6 145 2-177 1-193 (193) 5 protein:vir:80037 Length: 199 100.0 3E-50 1.8E-53 292.1 15.9 144 1-179 1-199 (199) 6 protein:vir:78607 Length: 155 100.0 5.3E-49 3.3E-52 285.3 15.2 137 1-178 1-155 (155) 7 protein:vir:106728 Length: 155 100.0 7.1E-49 4.4E-52 284.5 15.2 137 1-178 1-155 (155) 8 protein:vir:94069 Length: 168 100.0 4.7E-48 2.9E-51 280.1 15.2 150 1-187 1-168 (168) 9 protein:vir:77650 Length: 155 100.0 8.6E-48 5.3E-51 278.6 14.9 137 1-178 1-155 (155) 10 protein:vir:101563 Length: 155 100.0 2.6E-47 1.6E-50 276.0 15.3 137 2-178 1-155 (155) 11 protein:vir:95260 Length: 160 100.0 1.2E-43 7.3E-47 255.9 14.5 147 1-184 1-160 (160) 12 protein:vir:99833 Length: 190 98.8 1.6E-11 9.7E-15 79.8 6.1 97 63-190 1-105 (190) 13 protein:vir:79091 Length: 175 98.8 3.7E-11 2.3E-14 77.8 7.1 115 64-190 1-122 (175) 14 protein:vir:103841 Length: 155 98.8 2.3E-11 1.4E-14 78.9 5.9 101 56-190 1-105 (155) 15 protein:vir:99196 Length: 155 98.7 4.8E-11 3E-14 77.1 6.9 99 64-190 1-105 (155) 16 protein:vir:79225 Length: 155 98.7 4.6E-11 2.8E-14 77.3 6.5 99 64-190 1-105 (155) 17 protein:vir:3163 Length: 145 # 98.7 3.8E-11 2.4E-14 77.7 4.8 88 68-190 1-99 (145) 18 protein:vir:1988 Length: 156 # 98.7 9.1E-11 5.6E-14 75.6 6.6 98 64-190 1-109 (156) 19 protein:vir:107851 Length: 175 98.5 7.5E-10 4.7E-13 70.6 6.5 115 64-190 1-122 (175) 20 protein:vir:4347 Length: 164 # 98.2 3E-09 1.8E-12 67.3 4.3 98 1-104 1-164 (164) 21 protein:vir:1891 Length: 179 # 98.0 5.9E-09 3.6E-12 65.7 2.2 98 1-104 1-179 (179) 22 protein:vir:102085 Length: 146 97.9 3.3E-08 2E-11 61.6 5.6 83 1-94 1-146 (146) 23 protein:vir:105007 Length: 146 97.9 3.3E-08 2E-11 61.6 5.6 83 1-94 1-146 (146) 24 protein:vir:102875 Length: 146 97.9 3.3E-08 2E-11 61.6 5.6 83 1-94 1-146 (146) 25 protein:vir:107568 Length: 146 97.9 3.3E-08 2E-11 61.6 5.6 83 1-94 1-146 (146) 26 protein:vir:80362 Length: 140 97.9 4.3E-08 2.7E-11 61.0 5.3 92 1-100 1-140 (140) 27 protein:vir:100243 Length: 140 97.8 6.9E-08 4.3E-11 59.8 5.8 92 1-113 1-140 (140) 28 protein:vir:93617 Length: 148 97.8 2.5E-08 1.5E-11 62.3 3.0 90 1-96 1-148 (148) 29 protein:vir:95789 Length: 114 97.8 7.8E-08 4.8E-11 59.6 5.7 85 1-88 1-114 (114) 30 protein:vir:2026 Length: 150 # 97.8 3.3E-07 2.1E-10 56.1 9.0 92 69-190 1-105 (150) 31 protein:vir:106623 Length: 115 97.8 8.2E-08 5.1E-11 59.4 5.2 80 2-84 1-115 (115) 32 protein:vir:103917 Length: 115 97.8 4.3E-08 2.7E-11 61.0 3.7 80 2-84 1-115 (115) 33 protein:vir:9312 Length: 115 # 97.8 4.3E-08 2.7E-11 61.0 3.7 80 2-84 1-115 (115) 34 protein:vir:78858 Length: 115 97.8 4.3E-08 2.7E-11 61.0 3.7 80 2-84 1-115 (115) 35 protein:vir:97144 Length: 115 97.8 4.3E-08 2.7E-11 61.0 3.7 80 2-84 1-115 (115) 36 protein:vir:96358 Length: 115 97.8 4.3E-08 2.7E-11 61.0 3.7 80 2-84 1-115 (115) 37 protein:vir:96225 Length: 115 97.8 4.3E-08 2.7E-11 61.0 3.7 80 2-84 1-115 (115) 38 protein:vir:1437 Length: 140 # 97.8 6.9E-08 4.3E-11 59.8 4.8 92 1-100 1-140 (140) 39 protein:vir:3873 Length: 128 # 97.7 5.2E-08 3.2E-11 60.5 3.7 77 1-88 44-128 (128) 40 protein:vir:1386 Length: 149 # 97.7 8.2E-08 5.1E-11 59.4 4.4 81 1-89 50-149 (149) 41 protein:vir:98557 Length: 149 97.7 4.9E-07 3E-10 55.2 8.7 92 68-190 1-104 (149) 42 protein:vir:100075 Length: 140 97.7 2.4E-07 1.5E-10 56.9 6.9 85 1-88 1-140 (140) 43 protein:vir:4906 Length: 114 # 97.6 3.2E-08 2E-11 61.6 0.7 82 1-85 1-114 (114) 44 protein:vir:2740 Length: 114 # 97.6 3.2E-08 2E-11 61.6 0.7 82 1-85 1-114 (114) 45 protein:vir:1273 Length: 127 # 97.6 2.2E-07 1.3E-10 57.1 4.9 77 1-88 45-127 (127) 46 protein:vir:99744 Length: 115 97.6 1.3E-07 7.9E-11 58.4 3.3 80 2-84 1-115 (115) 47 protein:vir:5703 Length: 150 # 97.5 1.8E-06 1.1E-09 52.1 9.0 92 69-190 1-105 (150) 48 protein:vir:105330 Length: 137 97.4 1.6E-07 1E-10 57.8 2.4 80 1-80 1-137 (137) 49 protein:vir:6071 Length: 150 # 97.4 2.7E-06 1.7E-09 51.1 9.0 92 69-190 1-105 (150) 50 protein:vir:96486 Length: 112 97.4 1.5E-07 9.1E-11 58.0 2.1 80 1-83 1-112 (112) 51 protein:vir:94538 Length: 125 97.4 1.9E-07 1.2E-10 57.4 2.7 86 1-89 1-125 (125) 52 protein:vir:94796 Length: 137 97.4 2.9E-07 1.8E-10 56.4 3.7 80 1-80 1-137 (137) 53 protein:vir:9930 Length: 108 # 97.4 4.7E-07 2.9E-10 55.3 4.5 80 1-85 8-108 (108) 54 protein:vir:9708 Length: 125 # 97.4 4.7E-07 2.9E-10 55.3 4.5 77 1-88 41-125 (125) 55 protein:vir:3617 Length: 112 # 97.4 5.5E-08 3.4E-11 60.4 -0.7 79 1-84 1-112 (112) 56 protein:vir:106570 Length: 182 97.4 1.1E-07 6.8E-11 58.7 0.8 93 1-99 1-182 (182) 57 protein:vir:94654 Length: 142 97.4 5.8E-07 3.6E-10 54.8 4.8 83 1-84 1-142 (142) 58 protein:vir:96121 Length: 137 97.3 1.8E-07 1.1E-10 57.5 1.8 80 1-80 1-137 (137) 59 protein:vir:194 Length: 149 # 97.3 1.7E-07 1.1E-10 57.7 1.6 90 1-96 21-149 (149) 60 protein:vir:94108 Length: 149 97.3 3.4E-07 2.1E-10 56.0 2.7 80 1-80 13-149 (149) 61 protein:vir:1988 Length: 156 # 97.3 2.1E-06 1.3E-09 51.7 6.9 74 1-96 76-156 (156) 62 protein:vir:98409 Length: 108 97.2 4.6E-07 2.9E-10 55.3 2.9 80 2-84 1-108 (108) 63 protein:vir:97088 Length: 157 97.2 7.4E-07 4.6E-10 54.2 3.7 79 1-88 46-157 (157) 64 protein:vir:107099 Length: 137 97.2 7.7E-07 4.8E-10 54.1 3.7 80 1-80 1-137 (137) 65 protein:vir:100312 Length: 152 97.2 1.4E-06 8.5E-10 52.7 5.0 77 1-86 65-152 (152) 66 protein:vir:81147 Length: 126 97.2 8.9E-07 5.5E-10 53.8 3.8 87 1-87 1-126 (126) 67 protein:vir:97427 Length: 137 97.1 1.1E-07 6.5E-11 58.8 -1.5 80 1-80 1-137 (137) 68 protein:vir:94490 Length: 137 97.1 1.1E-07 6.5E-11 58.8 -1.5 80 1-80 1-137 (137) 69 protein:vir:93738 Length: 137 97.1 1.1E-07 6.5E-11 58.8 -1.5 80 1-80 1-137 (137) 70 protein:vir:5745 Length: 135 # 97.1 1.4E-06 8.7E-10 52.7 4.7 83 1-99 47-135 (135) 71 protein:vir:105916 Length: 149 97.1 5.3E-07 3.3E-10 55.0 2.2 80 1-80 13-149 (149) 72 protein:vir:3163 Length: 145 # 97.1 5.3E-07 3.3E-10 55.0 2.2 77 1-89 52-145 (145) 73 protein:vir:95894 Length: 137 97.1 9.7E-07 6E-10 53.5 3.6 80 1-80 1-137 (137) 74 protein:vir:96829 Length: 135 97.1 1.1E-06 6.7E-10 53.3 3.5 80 1-80 1-135 (135) 75 protein:vir:4704 Length: 125 # 97.0 7.3E-07 4.6E-10 54.2 2.3 74 1-88 17-125 (125) 76 protein:vir:81106 Length: 125 97.0 7.3E-07 4.6E-10 54.2 2.3 74 1-88 17-125 (125) 77 protein:vir:9414 Length: 125 # 97.0 7.3E-07 4.6E-10 54.2 2.3 74 1-88 17-125 (125) 78 protein:vir:98342 Length: 125 97.0 7.3E-07 4.6E-10 54.2 2.3 74 1-88 17-125 (125) 79 protein:vir:79988 Length: 125 97.0 7.3E-07 4.6E-10 54.2 2.3 74 1-88 17-125 (125) 80 protein:vir:743 Length: 108 # 97.0 7E-07 4.4E-10 54.3 1.9 78 2-84 1-108 (108) 81 protein:vir:103841 Length: 155 96.8 3E-06 1.8E-09 50.9 4.1 73 1-87 71-155 (155) 82 protein:vir:105089 Length: 133 96.8 3.2E-06 2E-09 50.7 3.7 81 1-91 16-133 (133) 83 protein:vir:101594 Length: 173 96.7 5.3E-06 3.3E-09 49.5 4.5 87 1-97 13-173 (173) 84 protein:vir:5978 Length: 144 # 96.7 1.7E-06 1.1E-09 52.2 1.8 82 1-86 16-144 (144) 85 protein:vir:79225 Length: 155 96.6 9.2E-06 5.7E-09 48.2 5.1 73 1-87 73-155 (155) 86 protein:vir:99833 Length: 190 96.6 1.4E-05 8.4E-09 47.3 6.0 76 1-98 73-190 (190) 87 protein:vir:99196 Length: 155 96.6 8.9E-06 5.5E-09 48.3 4.9 73 1-87 73-155 (155) 88 protein:vir:78077 Length: 141 96.5 3.2E-06 2E-09 50.7 2.2 87 1-89 10-141 (141) 89 protein:vir:5703 Length: 150 # 96.5 7.9E-06 4.9E-09 48.6 4.3 77 1-85 53-150 (150) 90 protein:vir:6071 Length: 150 # 96.5 8.2E-06 5.1E-09 48.5 4.4 77 1-85 53-150 (150) 91 protein:vir:107851 Length: 175 96.5 7.2E-06 4.5E-09 48.8 4.0 75 1-86 83-175 (175) 92 protein:vir:79091 Length: 175 96.4 4.4E-06 2.7E-09 49.9 2.4 75 1-86 83-175 (175) 93 protein:vir:98557 Length: 149 96.2 2.5E-05 1.5E-08 45.8 5.7 76 1-85 53-149 (149) 94 protein:vir:79179 Length: 155 96.2 7.6E-05 4.7E-08 43.2 8.0 98 68-190 1-110 (155) 95 protein:vir:1838 Length: 149 # 96.1 7.2E-05 4.4E-08 43.3 7.6 92 68-190 1-104 (149) 96 protein:vir:2026 Length: 150 # 96.1 2.8E-05 1.7E-08 45.6 5.3 77 1-85 53-150 (150) 97 protein:vir:79179 Length: 155 96.1 1.7E-05 1.1E-08 46.7 3.9 76 1-85 54-155 (155) 98 protein:vir:99101 Length: 142 95.9 8.5E-06 5.3E-09 48.4 1.6 81 1-81 1-142 (142) 99 protein:vir:8669 Length: 142 # 95.9 8.5E-06 5.3E-09 48.4 1.6 81 1-81 1-142 (142) 100 protein:vir:79115 Length: 148 95.8 0.00016 1E-07 41.3 8.1 91 69-190 1-103 (148) 101 protein:vir:79115 Length: 148 95.7 3.3E-05 2.1E-08 45.1 4.2 76 1-85 53-148 (148) 102 protein:vir:1243 Length: 116 # 95.5 8.2E-06 5.1E-09 48.5 0.1 77 1-80 1-116 (116) 103 protein:vir:97327 Length: 116 95.5 8.2E-06 5.1E-09 48.5 0.1 77 1-80 1-116 (116) 104 protein:vir:1838 Length: 149 # 95.5 8.6E-05 5.3E-08 42.9 5.6 76 1-85 53-149 (149) 105 protein:vir:1164 Length: 156 # 95.4 0.00033 2E-07 39.7 8.6 95 68-190 1-107 (156) 106 protein:vir:95062 Length: 116 95.4 9.1E-06 5.6E-09 48.2 -0.1 75 4-80 1-116 (116) 107 protein:vir:1164 Length: 156 # 95.2 6.4E-05 4E-08 43.6 3.9 80 1-91 54-156 (156) 108 protein:vir:102154 Length: 119 95.1 2.1E-05 1.3E-08 46.3 1.0 77 1-88 19-119 (119) 109 protein:vir:96829 Length: 135 94.8 0.00016 9.9E-08 41.4 5.2 74 64-190 1-77 (135) 110 protein:vir:95062 Length: 116 94.3 0.00015 9E-08 41.6 3.8 53 90-190 1-56 (116) 111 protein:vir:966 Length: 123 # 93.8 0.0005 3.1E-07 38.7 5.7 85 1-85 1-123 (123) 112 protein:vir:80116 Length: 127 93.3 0.00022 1.3E-07 40.7 2.9 88 1-88 1-127 (127) 113 protein:vir:1243 Length: 116 # 93.2 0.00032 2E-07 39.8 3.6 53 90-190 1-56 (116) 114 protein:vir:97327 Length: 116 93.2 0.00032 2E-07 39.8 3.6 53 90-190 1-56 (116) 115 protein:vir:93738 Length: 137 93.2 0.00075 4.7E-07 37.7 5.7 74 64-190 1-77 (137) 116 protein:vir:97427 Length: 137 93.2 0.00075 4.7E-07 37.7 5.7 74 64-190 1-77 (137) 117 protein:vir:94490 Length: 137 93.2 0.00075 4.7E-07 37.7 5.7 74 64-190 1-77 (137) 118 protein:vir:10367 Length: 119 92.7 0.0004 2.5E-07 39.2 3.5 75 4-88 1-119 (119) 119 protein:vir:100312 Length: 152 92.7 0.0024 1.5E-06 34.9 7.7 91 68-190 1-105 (152) 120 protein:vir:96121 Length: 137 92.5 0.0008 5E-07 37.6 4.9 74 64-190 1-77 (137) 121 protein:vir:94654 Length: 142 92.5 0.00032 2E-07 39.7 2.7 75 64-190 1-81 (142) 122 protein:vir:94796 Length: 137 92.0 0.0011 6.9E-07 36.8 5.0 74 64-190 1-77 (137) 123 protein:vir:5978 Length: 144 # 92.0 0.00074 4.6E-07 37.8 4.0 79 59-190 1-82 (144) 124 protein:vir:100887 Length: 139 91.9 0.00054 3.4E-07 38.5 3.2 74 1-92 61-139 (139) 125 protein:vir:95894 Length: 137 91.6 0.0015 9.5E-07 36.0 5.3 74 64-190 1-77 (137) 126 protein:vir:95372 Length: 124 91.5 0.00032 2E-07 39.8 1.4 84 1-85 1-124 (124) 127 protein:vir:81067 Length: 119 91.1 0.00087 5.4E-07 37.4 3.5 77 4-88 1-119 (119) 128 protein:vir:9930 Length: 108 # 90.6 0.0013 8.1E-07 36.4 3.9 70 65-190 1-72 (108) 129 protein:vir:105330 Length: 137 90.6 0.0014 8.8E-07 36.2 4.1 74 64-190 1-77 (137) 130 protein:vir:107099 Length: 137 90.5 0.0021 1.3E-06 35.3 5.0 74 64-190 1-77 (137) 131 protein:vir:5000 Length: 141 # 90.2 0.0011 6.5E-07 36.9 3.0 73 1-89 61-141 (141) 132 protein:vir:78077 Length: 141 89.7 0.0024 1.5E-06 35.0 4.6 73 64-190 1-77 (141) 133 protein:vir:107545 Length: 140 88.8 0.00048 3E-07 38.8 0.1 78 1-78 7-140 (140) 134 protein:vir:97982 Length: 140 88.8 0.00048 3E-07 38.8 0.1 78 1-78 7-140 (140) 135 protein:vir:94108 Length: 149 88.7 0.0038 2.4E-06 33.9 5.0 86 23-190 1-89 (149) 136 protein:vir:99744 Length: 115 88.6 0.0046 2.8E-06 33.4 5.3 78 63-190 1-80 (115) 137 protein:vir:106623 Length: 115 88.4 0.0052 3.2E-06 33.1 5.5 78 63-190 1-80 (115) 138 protein:vir:106570 Length: 182 88.4 0.0016 1E-06 35.8 2.8 75 83-190 1-84 (182) 139 protein:vir:96358 Length: 115 88.2 0.0051 3.2E-06 33.2 5.3 78 63-190 1-80 (115) 140 protein:vir:96225 Length: 115 88.2 0.0051 3.2E-06 33.2 5.3 78 63-190 1-80 (115) 141 protein:vir:97144 Length: 115 88.2 0.0051 3.2E-06 33.2 5.3 78 63-190 1-80 (115) 142 protein:vir:9312 Length: 115 # 88.2 0.0051 3.2E-06 33.2 5.3 78 63-190 1-80 (115) 143 protein:vir:78858 Length: 115 88.2 0.0051 3.2E-06 33.2 5.3 78 63-190 1-80 (115) 144 protein:vir:103917 Length: 115 88.2 0.0051 3.2E-06 33.2 5.3 78 63-190 1-80 (115) 145 protein:vir:100223 Length: 139 87.8 0.0018 1.1E-06 35.7 2.5 74 1-92 61-139 (139) 146 protein:vir:4906 Length: 114 # 87.0 0.003 1.9E-06 34.4 3.4 77 64-190 1-78 (114) 147 protein:vir:2740 Length: 114 # 87.0 0.003 1.9E-06 34.4 3.4 77 64-190 1-78 (114) 148 protein:vir:102441 Length: 137 86.6 0.00086 5.4E-07 37.4 0.1 79 1-79 14-137 (137) 149 protein:vir:4859 Length: 140 # 85.6 0.0036 2.2E-06 34.0 3.0 73 1-89 61-140 (140) 150 protein:vir:4956 Length: 153 # 85.5 0.0027 1.7E-06 34.6 2.3 86 1-128 61-153 (153) 151 protein:vir:4833 Length: 140 # 85.3 0.003 1.8E-06 34.4 2.4 71 1-87 61-140 (140) 152 protein:vir:3787 Length: 231 # 84.2 0.014 8.6E-06 30.8 5.6 81 1-89 59-231 (231) 153 protein:vir:79034 Length: 141 84.2 0.0065 4E-06 32.6 3.8 90 1-90 1-141 (141) 154 protein:vir:743 Length: 108 # 83.8 0.011 6.7E-06 31.4 4.8 70 63-190 1-73 (108) 155 protein:vir:100652 Length: 134 83.6 0.0077 4.8E-06 32.2 3.9 77 1-86 1-134 (134) 156 protein:vir:105916 Length: 149 83.2 0.013 8.2E-06 30.9 5.0 86 45-190 1-89 (149) 157 protein:vir:9879 Length: 127 # 82.6 0.0055 3.4E-06 33.0 2.7 81 4-85 1-127 (127) 158 protein:vir:101594 Length: 173 82.5 0.014 8.6E-06 30.8 4.8 71 68-190 1-77 (173) 159 protein:vir:106506 Length: 137 81.8 0.015 9.3E-06 30.6 4.7 69 57-190 1-75 (137) 160 protein:vir:3617 Length: 112 # 81.0 0.016 9.7E-06 30.5 4.6 73 64-190 1-83 (112) 161 protein:vir:106041 Length: 137 80.3 0.0027 1.7E-06 34.6 0.2 70 64-190 1-76 (137) 162 protein:vir:105467 Length: 144 80.2 0.033 2.1E-05 28.7 6.1 75 64-190 1-91 (144) 163 protein:vir:98409 Length: 108 79.5 0.023 1.5E-05 29.5 5.0 70 63-190 1-73 (108) 164 protein:vir:95789 Length: 114 74.4 0.017 1.1E-05 30.3 2.8 73 64-190 1-75 (114) 165 protein:vir:9647 Length: 132 # 73.4 0.053 3.3E-05 27.6 5.2 79 1-90 4-132 (132) 166 protein:vir:78755 Length: 228 72.9 0.035 2.2E-05 28.6 4.1 91 1-96 55-228 (228) 167 protein:vir:101302 Length: 134 72.5 0.038 2.4E-05 28.4 4.2 77 1-86 1-134 (134) 168 protein:vir:9513 Length: 134 # 72.5 0.038 2.4E-05 28.4 4.2 77 1-86 1-134 (134) 169 protein:vir:102963 Length: 163 65.3 0.11 6.9E-05 25.8 5.2 102 64-190 1-117 (163) 170 protein:vir:99528 Length: 92 # 63.2 0.072 4.5E-05 26.8 3.7 74 59-190 1-81 (92) 171 protein:vir:3873 Length: 128 # 61.7 0.13 7.9E-05 25.5 4.8 80 64-190 1-82 (128) 172 protein:vir:102963 Length: 163 54.4 0.071 4.4E-05 26.9 2.1 89 1-89 20-163 (163) 173 protein:vir:1332 Length: 143 # 52.8 0.057 3.5E-05 27.4 1.3 91 1-96 21-143 (143) 174 protein:vir:7412 Length: 168 # 49.5 0.25 0.00016 23.9 4.3 82 1-90 62-168 (168) 175 protein:vir:98636 Length: 138 47.8 0.28 0.00017 23.6 4.3 79 1-90 10-138 (138) 176 protein:vir:3848 Length: 159 # 41.4 0.21 0.00013 24.3 2.5 75 1-90 76-159 (159) 177 protein:vir:6246 Length: 143 # 36.6 0.26 0.00016 23.8 2.2 91 1-96 21-143 (143) 178 protein:vir:95372 Length: 124 35.6 1.2 0.00076 20.1 6.0 80 63-190 1-96 (124) 179 protein:vir:1028 Length: 168 # 25.4 0.98 0.00061 20.6 3.4 82 1-90 62-168 (168) 180 protein:vir:2688 Length: 123 # 25.0 1.1 0.00068 20.4 3.6 66 75-190 1-76 (123) 181 protein:vir:104347 Length: 145 20.8 2.7 0.0017 18.2 5.3 69 1-83 63-145 (145) 182 protein:vir:94944 Length: 121 20.0 1.7 0.001 19.4 3.5 69 1-72 1-121 (121) No 1 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=100.00 E-value=5.1e-60 Score=345.66 Aligned_cols=180 Identities=27% Similarity=0.425 Sum_probs=161.5 Q ss_pred CC-CCchhhH-HHHHHHHHHhhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHH Q lcl|NC_019527. 1 MA-TLTGGDK-LAKILADIGGKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKR 78 (190) Q Consensus 1 Ma-~i~~~d~-l~~il~~l~~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~ 78 (190) |+ .|+++++ ++++.+.|++|++++|+||||++++||||++||+||+|||||+|++ +||||||||+||++++++|.++ T Consensus 1 M~~~i~~~~~~~~~L~~~lk~l~~k~V~VGi~~~~~y~dG~~vA~Ia~~~E~G~p~~-~IP~RPFlr~t~~~~~~~~~~~ 79 (189) T protein:vir:10 1 MGRVIRKQGPARVKLNAFIKGMNDYSVRIGWFSTAKYPDGTPTAYVASIHEFGAPSR-GIPARSFIRPTIAAQQAAWSQQ 79 (189) T ss_pred CcceeccCcHHHHHHHHHHHHhhCCeEEEEecCCCCCCCcccHHHHHHHHHhcCcCC-CCCCchhhhHHHHHHHHHHHHH Confidence 98 7776665 6777788999999999999999999999999999999999999987 4999999999999999999999 Q ss_pred HHHHHhh---cCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhhHHH----hhhcc Q lcl|NC_019527. 79 LGDAIKH---YDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAAQEL----VEEGF 151 (190) Q Consensus 79 l~~~i~~---g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~~~~----~~~g~ 151 (190) +++++.+ |.++++++|+.+|++++++||.+|+++.||||||+||++|+|+++.+.....+........ ..+++ T Consensus 80 l~~~~~~vl~G~~~~~~~L~~~G~~a~~~Ik~~I~~~~~ppna~sTi~~Kg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (189) T protein:vir:10 80 MRFYAKQIVVGQMNVEQALEGLAIVARGDVDATLARLKDPPLSPLTIYIRKFIKDGGVIHGYKDIMRLRSEMQQEQAKGT 159 (189) T ss_pred HHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHHhcccCcccchhhhhhhhhhhhhhhhhhhhcc Confidence 9887754 7899999999999999999999999999999999999999999888776666665444444 44777 Q ss_pred cccccCccCchHHHHHHHhhcceeeecCce Q lcl|NC_019527. 152 QGAGGSQAKPLVWTGHMLNSITYQVDGGAT 181 (190) Q Consensus 152 ~~~~~~s~kPLIDTG~L~~SIty~V~~g~~ 181 (190) ...+..|++||||||+|++||||+|++-.+ T Consensus 160 ~~~~~~s~kPLidTG~l~~SIty~V~~k~~ 189 (189) T protein:vir:10 160 LNLSGVSTDPLDFTGYMRATLSYTVTKEKS 189 (189) T ss_pred ccccccCCCchhhHHHHHhhcceeeeecCC Confidence 888899999999999999999999999888 No 2 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=100.00 E-value=5.3e-56 Score=323.62 Aligned_cols=146 Identities=25% Similarity=0.293 Sum_probs=137.2 Q ss_pred CC-CCchhhHHHHHHHHHHhhcCCEEEEEecCCCCCC------CCccHHHHHHHhhcCcc-------------------- Q lcl|NC_019527. 1 MA-TLTGGDKLAKILADIGGKAQGSVDVGFMSGATYP------DGTPVAQVAFWNEFGHG-------------------- 53 (190) Q Consensus 1 Ma-~i~~~d~l~~il~~l~~l~~~~V~VGi~~~~~~~------dG~~vA~iA~~~EfG~~-------------------- 53 (190) |+ +++++++|++++++|++|++++|+|||+++++|+ ||++||+||+|||||++ T Consensus 7 ~~~k~~~~~~~~~~~~~l~~l~~~~v~vGi~~~~~y~~~~~~~dG~~va~IA~~~EfG~~i~~p~~~~~~~~~~~~g~~~ 86 (200) T protein:vir:99 7 KSNSVAAPLKHFQMLKQFDALKGKTVQAGWFETDRYPAKEGETIGPLVAKIARQLEFGGVINHPGGTKYIKDAIVDGRYV 86 (200) T ss_pred eeeeeecchHHHHHHHHHHHhhCCeEEEEEcCCCCcCCcccccccchHHHHHhHHHcCCeeccCCCcccccccccccccc Confidence 66 8899999999999999999999999999999886 78999999999999964 Q ss_pred ------------------ccCCCCCCchhhHHHHHHHHHHHHHHHHHHh---hcCCcHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019527. 54 ------------------GRFPAPPRPFFRNMVNEKSSEWPKRLGDAIK---HYDGDGRKALASMGEMIGGDLGSSIIST 112 (190) Q Consensus 54 ------------------~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~---~g~~~~~~aL~~iG~~a~~~Iq~~I~~~ 112 (190) +.++||||||||+|+++++++|.+.+++.+. .|+++++++|+.+|+.++++||++|+++ T Consensus 87 g~rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~~G~~~~~~ik~~I~~~ 166 (200) T protein:vir:99 87 GTRFVHKSFQGEHEVTKAHQIVIPARPFMRLAWATFNKDKVKIQAQIARQLLDGTINPEQALAQIGLALEGCIVRSIKSG 166 (200) T ss_pred ccccccccccceeeeeccccccCCCcchhhHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcC Confidence 3457999999999999999999999987764 5888999999999999999999999999 Q ss_pred CCCCCChHHHHHhccccccccchhhhhhhhhHHHhhhcccccccCccCchHHHHHHHhhcceeee Q lcl|NC_019527. 113 NEPALSKTTLMLRSIYGNNPQEIRARDVLAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVD 177 (190) Q Consensus 113 ~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~ 177 (190) .||||||+||++|+ +|+||||||+|++||||+|+ T Consensus 167 ~~ppna~sTi~~Kg-------------------------------~~~PLidTG~l~~SIty~Ve 200 (200) T protein:vir:99 167 PWAANSPATIRAKG-------------------------------FDKPLIDTAHMWQTVSSKVS 200 (200) T ss_pred CCCCChHHHHHHhC-------------------------------CCCchHHHHHHHhHhccccC Confidence 99999999999998 69999999999999999999 No 3 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=100.00 E-value=1.8e-53 Score=309.72 Aligned_cols=142 Identities=24% Similarity=0.339 Sum_probs=132.5 Q ss_pred CC-CCchh-hHHHHHHHHHHhhcCCEEEEEecC----CCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHH Q lcl|NC_019527. 1 MA-TLTGG-DKLAKILADIGGKAQGSVDVGFMS----GATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSE 74 (190) Q Consensus 1 Ma-~i~~~-d~l~~il~~l~~l~~~~V~VGi~~----~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~ 74 (190) |+ +++.. +++++++++|++|++++|+||||+ +..|+||++||+||||||||++ +||||||||+||++++++ T Consensus 1 M~~~~k~~~~~~~~l~~~l~~l~~~~v~VGi~~~~~~~~~~~~g~~vA~ia~~~E~G~~---~IP~Rpflr~t~~~~~~~ 77 (148) T protein:vir:52 1 MAVTVTANFSAAKQLIEQMKSLKEKAVYVGFPAEFDEKVKGSENFNLASLAAVLEFGNE---HIPARPFLRQTLEENQEK 77 (148) T ss_pred CccccccccHHHHHHHHHHHHhhCCeEEEEeecCcCCCCCCCCCCCHHHHHHHHhcCCC---CCCCcchhHHHHHHHHHH Confidence 88 66654 479999999999999999999994 5678999999999999999975 599999999999999999 Q ss_pred HHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhhHHHhhhccccc Q lcl|NC_019527. 75 WPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAAQELVEEGFQGA 154 (190) Q Consensus 75 ~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 154 (190) |.+.+++.+.+ ..+++++|+.+|+.++++||++|+++.+|||||+||++|+ T Consensus 78 ~~~~~~~~~~~-~~~~~~~L~~~G~~~~~~ik~~I~~~~~ppna~sTi~~Kg---------------------------- 128 (148) T protein:vir:52 78 YTALFIQWFDQ-GVPAAQIYERLSVMAQGDVQMNIVKGEWVANAKSTIRRKK---------------------------- 128 (148) T ss_pred HHHHHHHHHHc-CCCHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHhcC---------------------------- Confidence 99999988764 5899999999999999999999999999999999999988 Q ss_pred ccCccCchHHHHHHHhhcceeee Q lcl|NC_019527. 155 GGSQAKPLVWTGHMLNSITYQVD 177 (190) Q Consensus 155 ~~~s~kPLIDTG~L~~SIty~V~ 177 (190) |++||||||+|++||||+|+ T Consensus 129 ---~~~PLidTG~l~~SIty~V~ 148 (148) T protein:vir:52 129 ---SSKPLIDTGKMRQSVRGIVK 148 (148) T ss_pred ---CCCchhHHHHHHHHhhhhcC Confidence 69999999999999999999 No 4 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=100.00 E-value=3.9e-53 Score=307.90 Aligned_cols=145 Identities=32% Similarity=0.377 Sum_probs=132.7 Q ss_pred CCCc-hhhHHHHHHHHHHhhcCCEEEEEecCCCCCCC------CccHHHHHHHhhcCcc--------------------- Q lcl|NC_019527. 2 ATLT-GGDKLAKILADIGGKAQGSVDVGFMSGATYPD------GTPVAQVAFWNEFGHG--------------------- 53 (190) Q Consensus 2 a~i~-~~d~l~~il~~l~~l~~~~V~VGi~~~~~~~d------G~~vA~iA~~~EfG~~--------------------- 53 (190) ++|+ +.++|++++++|++|++++|+|||+++++|+| |++||+||+|||||++ T Consensus 1 m~~~~~~~~~~~~~~~l~~l~~~~v~vGi~~~~~~~~~~~~~~G~~va~iAai~EfG~~I~~~~~~~~~~~~~~~g~~~~ 80 (193) T protein:vir:96 1 MSLRRDSELIAAHLQMLRAMRGRSVSAGWYSTARYPDKAGGSVGIQVARIARLNEYGGTIDHPGGTRYIRDAIVRGRFVG 80 (193) T ss_pred CeeccchHHHHHHHHHHHHhcCCeEEEEEcCCCCCCCcccccccchHHHHHhHHHcCCccccCccceeeeeccccccccc Confidence 4555 44579999999999999999999999998876 8999999999999964 Q ss_pred -----------------ccCCCCCCchhhHHHHHHHHHHHHHHHHHHh---hcCCcHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_019527. 54 -----------------GRFPAPPRPFFRNMVNEKSSEWPKRLGDAIK---HYDGDGRKALASMGEMIGGDLGSSIISTN 113 (190) Q Consensus 54 -----------------~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~---~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~ 113 (190) +.++||||||||+++++++++|.+.+++++. .|+++++++|+.+|+.++++||++|+++. T Consensus 81 ~~~~k~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~~G~~~~~~ik~~I~~~~ 160 (193) T protein:vir:96 81 VRFVRNDFPGETEVTKPHRITIPARPFMRYAWNLFSADRAAIQNRIAMRLARGQITPDQALAQIGLALEGYIARSIRTGP 160 (193) T ss_pred cceeccCcceeeEeecceeccCCCcchhhhhHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCC Confidence 2347999999999999999999998887654 58899999999999999999999999999 Q ss_pred CCCCChHHHHHhccccccccchhhhhhhhhHHHhhhcccccccCccCchHHHHHHHhhcceeee Q lcl|NC_019527. 114 EPALSKTTLMLRSIYGNNPQEIRARDVLAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVD 177 (190) Q Consensus 114 ~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~ 177 (190) +|||||+||++|+ ||+||||||+|++||||+|. T Consensus 161 ~ppna~~Ti~~KG-------------------------------~~~PLidTG~l~~SIty~Vv 193 (193) T protein:vir:96 161 WVANSASTVRRKG-------------------------------FNRPLVDTAHMLQSISSRVT 193 (193) T ss_pred CCCCcHHHHHHhC-------------------------------CCCchhHHHHHHhhhcceeC Confidence 9999999999998 69999999999999999999 No 5 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=100.00 E-value=3e-50 Score=292.12 Aligned_cols=144 Identities=26% Similarity=0.393 Sum_probs=129.4 Q ss_pred CCCCchhhHHHHHHHHHHhhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCcc--------------------------- Q lcl|NC_019527. 1 MATLTGGDKLAKILADIGGKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHG--------------------------- 53 (190) Q Consensus 1 Ma~i~~~d~l~~il~~l~~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~--------------------------- 53 (190) |.-.++.+++++++++|++|++++|+|||++ +||.++++||.+||||+. T Consensus 1 m~vt~~~~~~~~~~~~l~~L~~k~v~vGi~~----~d~~~~~~Ia~~~E~Ga~I~~~~~~l~Ip~~~a~~~k~~~~~~~~ 76 (199) T protein:vir:80 1 MKVTTDKSTMNKAIRELDQLDRYSLQIGLFG----EDDSFIQMIAGVHEFGLTIRPKGKYLTIPTPEAGDRRARDIPGLF 76 (199) T ss_pred CcccccHHHHHHHHHHHHHhcCCEEEEEEec----CCCcchhheeehhhcCCeeecCCceeeecchhhhcccccccCccc Confidence 5444667899999999999999999999994 678889999999999942 Q ss_pred ------------------------ccCCCCCCchhhHHHHHHHHHHHHHHHHHHhh---cCCcHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 54 ------------------------GRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKH---YDGDGRKALASMGEMIGGDLG 106 (190) Q Consensus 54 ------------------------~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~---g~~~~~~aL~~iG~~a~~~Iq 106 (190) ..++||||||||+|+++++++|.+++++++.+ |+.+++++|+.+|+.++++|| T Consensus 77 ~p~g~~~~~~~~~~~~~~~~e~g~~~~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~vl~g~~~a~~~L~~~G~~~~~~Ik 156 (199) T protein:vir:80 77 KPKGKNILAVAGPDGKLTVMFYLKTEVNIPERSFLRSTFDEKSNKWGELFEGWIDDVIHGKLSAEQVYNRLGAKIVDDIQ 156 (199) T ss_pred ccCCcceeeeeccccceeeeeeccccccCCCCchhHHHHHHHHHHHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHHHH Confidence 11469999999999999999999999987754 789999999999999999999 Q ss_pred HHHhccCCCCCChHHHH-HhccccccccchhhhhhhhhHHHhhhcccccccCccCchHHHHHHHhhcceeeecC Q lcl|NC_019527. 107 SSIISTNEPALSKTTLM-LRSIYGNNPQEIRARDVLAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGG 179 (190) Q Consensus 107 ~~I~~~~~pPnap~Ti~-~K~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g 179 (190) .+|+++.||||||+||+ +|+ +++||||||+|++||||+|+|- T Consensus 157 ~~I~~~~~ppna~~Tia~rKg-------------------------------~~kPLidTG~l~~SIty~V~~~ 199 (199) T protein:vir:80 157 MKIVEIQTPAKSAATLARNPR-------------------------------KNNPLIVTGKMKNSVTWKVMKS 199 (199) T ss_pred HHHhccCCCCCCHHHHHHhcC-------------------------------CCCchHHHHHHHhhcceeeeeC Confidence 99999999999999998 455 6999999999999999999998 No 6 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=100.00 E-value=5.3e-49 Score=285.27 Aligned_cols=137 Identities=28% Similarity=0.440 Sum_probs=122.6 Q ss_pred CCCCchhhHHHHHHHHHHhhcCCEEEEEecCCCCCCC------------------CccHHHHHHHhhcCccccCCCCCCc Q lcl|NC_019527. 1 MATLTGGDKLAKILADIGGKAQGSVDVGFMSGATYPD------------------GTPVAQVAFWNEFGHGGRFPAPPRP 62 (190) Q Consensus 1 Ma~i~~~d~l~~il~~l~~l~~~~V~VGi~~~~~~~d------------------G~~vA~iA~~~EfG~~~~~~IP~RP 62 (190) |.-.+. .|++++++ |++++|+|||+++++||| |+|+|+||+|||||+ ++||||| T Consensus 1 m~v~~k--~L~~~~~~---l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~---~~IP~RP 72 (155) T protein:vir:78 1 MSVTRR--GLTLPKDR---YRSMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGT---SKLPARP 72 (155) T ss_pred CcchHH--HHHHHHHH---HhCCeeEEeecCCCCCCcccchhhhhhhhcccccccCCcHHHHHHhhhcCC---CCCCCcc Confidence 433332 26666554 578999999999999998 899999999999996 4699999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhh Q lcl|NC_019527. 63 FFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLA 142 (190) Q Consensus 63 Flr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~ 142 (190) |||+||++++++|.+.+++++.++ .+++++|+.+|++++++||.+|+++. |||||+||++|+ T Consensus 73 Flr~t~~~~~~~~~~~l~~~~~~~-~~~~~~L~~~G~~~~~~Ik~~I~~~~-~pna~~Ti~~Kg---------------- 134 (155) T protein:vir:78 73 FMEKTITDRSAEWIKGLTVMMTMG-YDAEVAMGQIGQAMKDDIKTTISEWP-ADNSADWAGKKG---------------- 134 (155) T ss_pred hhhHHHHHHHHHHHHHHHHHHHcC-CCHHHHHHHHHHHHHHHHHHHHhcCC-CCCcHHHHHhcC---------------- Confidence 999999999999999999988654 79999999999999999999999996 999999999988 Q ss_pred hHHHhhhcccccccCccCchHHHHHHHhhcceeeec Q lcl|NC_019527. 143 AQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDG 178 (190) Q Consensus 143 ~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~ 178 (190) +|+||||||+|++||||+|.. T Consensus 135 ---------------~~kPLidTG~l~~SIty~V~~ 155 (155) T protein:vir:78 135 ---------------FNHGLIWTSHLLNSVEQEIVK 155 (155) T ss_pred ---------------CCCchhHHHHHHHhhhhhccC Confidence 699999999999999999998 No 7 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=100.00 E-value=7.1e-49 Score=284.54 Aligned_cols=137 Identities=28% Similarity=0.439 Sum_probs=122.4 Q ss_pred CCCCchhhHHHHHHHHHHhhcCCEEEEEecCCCCCCC------------------CccHHHHHHHhhcCccccCCCCCCc Q lcl|NC_019527. 1 MATLTGGDKLAKILADIGGKAQGSVDVGFMSGATYPD------------------GTPVAQVAFWNEFGHGGRFPAPPRP 62 (190) Q Consensus 1 Ma~i~~~d~l~~il~~l~~l~~~~V~VGi~~~~~~~d------------------G~~vA~iA~~~EfG~~~~~~IP~RP 62 (190) |.-.+. .|++++++ |++++|+|||+++++||| |+|+|+||+|||||+ ++||||| T Consensus 1 m~v~~k--~L~~~~~~---l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~---~~IP~RP 72 (155) T protein:vir:10 1 MSVTRR--GLTLPKDR---YRSMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGT---SKLPARP 72 (155) T ss_pred CcchHH--HHHHHHHH---HhCCeeEEeecCCCCCccccchhhhhhhhcccccccCCcHHHHHHHHhcCC---CCCCCcc Confidence 433332 26666554 578999999999999998 899999999999996 4699999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhh Q lcl|NC_019527. 63 FFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLA 142 (190) Q Consensus 63 Flr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~ 142 (190) |||+||++++++|.+.+++++.++ .+++++|+.+|+.++++||.+|+++. |||||+||++|+ T Consensus 73 Flr~t~~~~~~~~~~~l~~~~~~~-~~~~~~L~~lG~~~~~~Ik~~I~~~~-~pna~~Ti~~KG---------------- 134 (155) T protein:vir:10 73 FMEKTIADRSAEWIKGLTVMMTMG-YDAEVAMGQIGQAMKDDIKTTISEWP-ADNSADWAGKKG---------------- 134 (155) T ss_pred hhHHHHHHHHHHHHHHHHHHHHcC-CCHHHHHHHHHHHHHHHHHHHHhcCC-CCCcHHHHHhcC---------------- Confidence 999999999999999999988654 79999999999999999999999996 999999999988 Q ss_pred hHHHhhhcccccccCccCchHHHHHHHhhcceeeec Q lcl|NC_019527. 143 AQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDG 178 (190) Q Consensus 143 ~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~ 178 (190) +|+||||||+|++||||+|.. T Consensus 135 ---------------~~kPLidTG~l~~SIty~Vv~ 155 (155) T protein:vir:10 135 ---------------FNHGLIWTSHLLNSVEQEIVK 155 (155) T ss_pred ---------------CCCchhHHHHHHHhhhhhccC Confidence 699999999999999999988 No 8 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=100.00 E-value=4.7e-48 Score=280.06 Aligned_cols=150 Identities=25% Similarity=0.336 Sum_probs=128.7 Q ss_pred CCCCchhhHHHHHHHHHHhhcCCEEEEEecCCCCCCCC-----------------ccHHHHHHHhhcCccccCCCCCCch Q lcl|NC_019527. 1 MATLTGGDKLAKILADIGGKAQGSVDVGFMSGATYPDG-----------------TPVAQVAFWNEFGHGGRFPAPPRPF 63 (190) Q Consensus 1 Ma~i~~~d~l~~il~~l~~l~~~~V~VGi~~~~~~~dG-----------------~~vA~iA~~~EfG~~~~~~IP~RPF 63 (190) |.+++. ..++..++.+..|.+++|+|||+++++|+|| ++||+||+|||||+ ++|||||| T Consensus 1 ~~~~~~-~g~~~~~~~~~~l~~~~v~vG~l~~a~yp~G~~~~~~~~~~~~~~~~g~~va~Ia~~~E~G~---~~IP~RPF 76 (168) T protein:vir:94 1 MTTIAR-KGVKMPPHLEAQFQSGEVKAGVLSGSTYPQMTYTDQRTGKQIEDARGGMPVAVIAQALEYGH---GQNHPRPF 76 (168) T ss_pred Cccccc-hhhhhhHHHHHhhhccceeeeccccCcccccccchhhcccccccccccccHHHHHHHHhcCC---CCCCCchh Confidence 887754 3377778888888999999999999988765 49999999999996 46999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhh Q lcl|NC_019527. 64 FRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAA 143 (190) Q Consensus 64 lr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~ 143 (190) ||+||++++++|.+.+++++.+ .+|++++|+.+|+.++++||.+|+++. |||||+||++|+ T Consensus 77 lr~t~~~~~~~~~~~~~~~~~~-~~~~~~~L~~lG~~~~~~Ik~~I~~~~-ppna~sTi~~KG----------------- 137 (168) T protein:vir:94 77 MQQTYAAQYRAWSRDLTLTLKA-GAAADTALRTVGQRMAEDIQDTIRNWP-ADNSPEWAAIKG----------------- 137 (168) T ss_pred hHHHHHHHHHHHHHHHHHHHhc-CCCHHHHHHHHHHHHHHHHHHHhhcCC-CCccHHHHHhcC----------------- Confidence 9999999999999999987754 689999999999999999999999995 999999999988 Q ss_pred HHHhhhcccccccCccCchHHHHHHHhhcceeee-cCceeEEeec Q lcl|NC_019527. 144 QELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVD-GGATIKVKVN 187 (190) Q Consensus 144 ~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~-~g~~~~~~~~ 187 (190) |++||||||+|++||||+|. ||.-=.---. T Consensus 138 --------------~~~PLiDTG~l~~SIty~Vv~d~~~~~~~~~ 168 (168) T protein:vir:94 138 --------------FNAGLRQTGVLLNAIDSAVIIDGEHGEAPRE 168 (168) T ss_pred --------------CCCchhHHHHHHhhcceeeeecCCCCCCCCC Confidence 69999999999999999554 7642110000 No 9 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=100.00 E-value=8.6e-48 Score=278.62 Aligned_cols=137 Identities=29% Similarity=0.463 Sum_probs=121.7 Q ss_pred CCCCchhhHHHHHHHHHHhhcCCEEEEEecCCCCCCC------------------CccHHHHHHHhhcCccccCCCCCCc Q lcl|NC_019527. 1 MATLTGGDKLAKILADIGGKAQGSVDVGFMSGATYPD------------------GTPVAQVAFWNEFGHGGRFPAPPRP 62 (190) Q Consensus 1 Ma~i~~~d~l~~il~~l~~l~~~~V~VGi~~~~~~~d------------------G~~vA~iA~~~EfG~~~~~~IP~RP 62 (190) |...+.+ |+++++ .|++++|+|||+++++||| |+|+|+||+|||||+ ++||||| T Consensus 1 m~~~r~~--l~~~~~---~l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~G~pva~ia~~~e~G~---~~IP~RP 72 (155) T protein:vir:77 1 MSVTRRG--LTLPKD---RYRSMSVKAGVLAGATYPDESGKKLADGSILKKDPRAGLPVAMIAMALNYGT---SKLPARP 72 (155) T ss_pred CcchHHH--HHHHHH---HHhcCceEEeecCCCCCccccchhhhhhhhccccccccccHhhhhhhhhcCC---CCCCCCc Confidence 5544432 555444 4578999999999999998 799999999999996 4699999 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhh Q lcl|NC_019527. 63 FFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLA 142 (190) Q Consensus 63 Flr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~ 142 (190) |||+||++++++|.+.+.+++.+ .++++++|+.+|+.++++||++|+++.+| |+|+||++|+ T Consensus 73 Flr~t~~~~~~~~~~~l~~~~~~-~~~~~~~L~~lG~~~~~~Iq~~I~~~~~p-~~~~Ti~~KG---------------- 134 (155) T protein:vir:77 73 FMEKTIADRSAEWIKGLTVMMTM-GYDAEVAMGQIGQAMKDDIKTTISEWPAD-NNADWAGKKG---------------- 134 (155) T ss_pred hhhHHHHHHHHHHHHHHHHHHHc-cCcHHHHHHHHHHHHHHHHHHHHhcCCCC-CChHHHHhcC---------------- Confidence 99999999999999999998865 47999999999999999999999999987 5679999988 Q ss_pred hHHHhhhcccccccCccCchHHHHHHHhhcceeeec Q lcl|NC_019527. 143 AQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDG 178 (190) Q Consensus 143 ~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~ 178 (190) ||+||||||+|++||||+|.. T Consensus 135 ---------------~d~PLidTG~l~~SIty~Vv~ 155 (155) T protein:vir:77 135 ---------------FNHGLIWTSHLLNSIEQEIVK 155 (155) T ss_pred ---------------CCCchhHHHHHHHhhhhhccC Confidence 699999999999999999988 No 10 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=100.00 E-value=2.6e-47 Score=276.02 Aligned_cols=137 Identities=28% Similarity=0.449 Sum_probs=121.5 Q ss_pred CCCchhhHHHHHHHHHHhhcCCEEEEEecCCCCCCC------------------CccHHHHHHHhhcCccccCCCCCCch Q lcl|NC_019527. 2 ATLTGGDKLAKILADIGGKAQGSVDVGFMSGATYPD------------------GTPVAQVAFWNEFGHGGRFPAPPRPF 63 (190) Q Consensus 2 a~i~~~d~l~~il~~l~~l~~~~V~VGi~~~~~~~d------------------G~~vA~iA~~~EfG~~~~~~IP~RPF 63 (190) ++|.. .+|++++++ |++++|+||||++++|+| |+|+|+||+|||||+ ++|||||| T Consensus 1 m~v~r-~~L~~~~~~---l~~~~V~VGi~~~a~y~d~~g~~~~~g~~~~~~~~~G~pva~ia~~~e~G~---~~IP~RPF 73 (155) T protein:vir:10 1 MSVTR-RGLTLPKDR---YKSMSVKAGVLAGATYPDESGKKLADGTILKKDPRAGLPVAMIAMALNYGT---SKLPARPF 73 (155) T ss_pred CcchH-HHHHHHHHH---hhCCeeEEeecCCCCCCccccchhhhhhhhccccccCcchhhhhhhhhcCC---CCCCCcch Confidence 44433 346666554 466899999999999998 899999999999996 36999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhh Q lcl|NC_019527. 64 FRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAA 143 (190) Q Consensus 64 lr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~ 143 (190) ||+||++++++|.+.+++++.+ .++++++|+.+|+.++++||++|+++.+| |+|+||++|+ T Consensus 74 lr~t~~~~~~~~~~~l~~~~~~-~~~~~~~L~~~G~~~~~~Ik~~I~~~~~p-~~~~Ti~~KG----------------- 134 (155) T protein:vir:10 74 MEKTIADRSAEWIKGLTVMMTM-GYDAEVAMGQIGQAMKDDIKTTISEWPAD-NNADWAGKKG----------------- 134 (155) T ss_pred hHHHHHHHHHHHHHHHHHHHHc-CCCHHHHHHHHHHHHHHHHHHHHhcCCCC-CChHHHHhcC----------------- Confidence 9999999999999999998865 47999999999999999999999999986 5789999988 Q ss_pred HHHhhhcccccccCccCchHHHHHHHhhcceeeec Q lcl|NC_019527. 144 QELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDG 178 (190) Q Consensus 144 ~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~ 178 (190) +|+||||||+|++||||+|.. T Consensus 135 --------------~~~PLidTG~l~~Sity~Vv~ 155 (155) T protein:vir:10 135 --------------FNHGLIWTSHLLNSIEQEIVK 155 (155) T ss_pred --------------CCCchHHHHHHHHhhhhhccC Confidence 699999999999999999988 No 11 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=100.00 E-value=1.2e-43 Score=255.94 Aligned_cols=147 Identities=14% Similarity=0.203 Sum_probs=117.6 Q ss_pred CCCCchhhHHHHHHHHHHhhcCCEEEEEecCCC-CCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHH----HHHH Q lcl|NC_019527. 1 MATLTGGDKLAKILADIGGKAQGSVDVGFMSGA-TYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEK----SSEW 75 (190) Q Consensus 1 Ma~i~~~d~l~~il~~l~~l~~~~V~VGi~~~~-~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~----~~~~ 75 (190) |.+--.+..++++.+.|++|.++.|+|||++++ .|+||+|+++||+|||||. .+||+|||||++|+.. ...+ T Consensus 1 ~~~~~~~~G~~~L~~~~k~l~~~~V~VGi~~d~g~~~dG~sv~~vA~~~EfG~---~~iPaRPf~R~tfe~~~~~~~~~~ 77 (160) T protein:vir:95 1 MVKRVIHPARAKLVGAMKNLQTANAQVGYFQEQGQHSSGFSYPALMYLQEVIG---VPSASGKVYRRLFEITMMLNKQTL 77 (160) T ss_pred CceeechHhHHHHHHHHHHHhCCeeEEeeccccccCCCCccHHHHHhhhhcCc---ccCCCcchhHHHHHHHHHHHHHHH Confidence 876555566777777788889999999999988 6889999999999999996 3699999999999843 2333 Q ss_pred HHHHHH----HHhhcCCcHHHHHHHHHHHHHHHHHHHHhcc----CCCCCChHHHHHhccccccccchhhhhhhhhHHHh Q lcl|NC_019527. 76 PKRLGD----AIKHYDGDGRKALASMGEMIGGDLGSSIIST----NEPALSKTTLMLRSIYGNNPQEIRARDVLAAQELV 147 (190) Q Consensus 76 ~~~l~~----~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~----~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~~~~~ 147 (190) .+++.. .+..|... +.+.+|+.++++|+.+|++. .||||||+||++|| T Consensus 78 ~~~~~~~i~~~~~~g~~~---~~~~LG~~~~~~ik~~I~~~~~p~~w~pNap~Ti~~Kg--------------------- 133 (160) T protein:vir:95 78 LEQTKKNLYKQLSSLNTD---PSNTLEAFAKNAQKAIKRGFGNSAILPPNAPSTVKKKG--------------------- 133 (160) T ss_pred HHHHHHHHHHHHhhcchh---HHHHHHHHHHHHHHHHHhhcCCccCCCCCcHHHHHhcC--------------------- Confidence 333322 22334322 44569999999999999984 47899999999999 Q ss_pred hhcccccccCccCchHHHHHHHhhcceeeecCceeEE Q lcl|NC_019527. 148 EEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKV 184 (190) Q Consensus 148 ~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~ 184 (190) ||+||||||+|++||+|+|.+.+-++- T Consensus 134 ----------s~~PLiDTg~l~~Si~y~v~~~~~~~~ 160 (160) T protein:vir:95 134 ----------FNAPLVETGDLRDNLAYKISTKKGIKK 160 (160) T ss_pred ----------CCCcchhhHHHhhhhhheeecccccCC Confidence 799999999999999999988766554 No 12 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.79 E-value=1.6e-11 Score=79.82 Aligned_cols=97 Identities=20% Similarity=0.214 Sum_probs=70.5 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhcc------CCCCCChHHHHHhccccccccchh Q lcl|NC_019527. 63 FFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIIST------NEPALSKTTLMLRSIYGNNPQEIR 136 (190) Q Consensus 63 Flr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~------~~pPnap~Ti~~K~~~~~~~~~~~ 136 (190) -+.-.+.-.-.++.+.|...+... .+.+.+|..||..+...+++.|++. .|+|++|+|+++|.+. T Consensus 1 M~~i~i~~d~~~~~~~L~~l~~~~-~~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~-------- 71 (190) T protein:vir:99 1 MAGITLEWDGRRALDVLNAGSAAL-GDPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKN-------- 71 (190) T ss_pred CceeEEEecHHHHHHHHHHHHHHh-hhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcC-------- Confidence 222112222345556666655444 3778999999999999999999986 4889999999887643 Q ss_pred hhhhhhhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEeec--cCC Q lcl|NC_019527. 137 ARDVLAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKVN--YGR 190 (190) Q Consensus 137 ~~~~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~~--~~~ 190 (190) ..+||+|||.|++||+|.+. +..+.|+.| |.. T Consensus 72 ---------------------~~~~L~~tg~L~~Si~~~~~-~~~v~vGtn~~yA~ 105 (190) T protein:vir:99 72 ---------------------RDKILTLDGHLRNLLRYQLD-GSELLFGSDRPYAA 105 (190) T ss_pred ---------------------CCccceecHHHHHHHhheec-CcEEEEecCcchhh Confidence 36899999999999999985 456666644 433 No 13 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=98.76 E-value=3.7e-11 Score=77.80 Aligned_cols=115 Identities=14% Similarity=0.049 Sum_probs=77.1 Q ss_pred hhHH--HHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhcc---CCCCCChHHHHHhccccccccchhhh Q lcl|NC_019527. 64 FRNM--VNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIIST---NEPALSKTTLMLRSIYGNNPQEIRAR 138 (190) Q Consensus 64 lr~~--~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~---~~pPnap~Ti~~K~~~~~~~~~~~~~ 138 (190) |-.. |.-.-+++.+.|.+.+... .+...+|..||..+...+++.|.+. +|+|++|+|+++|.+.++.......+ T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~-~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~~ 79 (175) T protein:vir:79 1 MSDFVNFQIDDSALRTRLLQLEQAG-HQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGEL 79 (175) T ss_pred CceEEEEEechHHHHHHHHHHHHHh-cCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccccc Confidence 2221 1112245667777765544 3788999999999999999999986 48999999999887655432211111 Q ss_pred hhhhhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEeec--cCC Q lcl|NC_019527. 139 DVLAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKVN--YGR 190 (190) Q Consensus 139 ~~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~~--~~~ 190 (190) ... ..++ ..+.+||+|||.|++||+|.+ +...+.|+-| |++ T Consensus 80 ~~~----~~~~------~~~~~~L~~tG~L~~Si~~~~-~~~~v~vGtn~~YAa 122 (175) T protein:vir:79 80 TAA----ASRR------KAGLMILQDSGQMAASTATDS-GEDYSVIGSNKEYAA 122 (175) T ss_pred hhh----Hhhh------ccCCCcceechhhhhhhhhee-cCCEEEEecCcchhh Confidence 111 1111 125899999999999999998 4667777655 444 No 14 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=98.75 E-value=2.3e-11 Score=78.89 Aligned_cols=101 Identities=17% Similarity=0.157 Sum_probs=72.8 Q ss_pred CCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhcc--CCCCCChHHHHHhcccccccc Q lcl|NC_019527. 56 FPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIIST--NEPALSKTTLMLRSIYGNNPQ 133 (190) Q Consensus 56 ~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~--~~pPnap~Ti~~K~~~~~~~~ 133 (190) +. .++.-.+ +..++.+.|.+..... .+...+|..||..+...+++.|... .|+||||+|+++|.++++. T Consensus 1 Ms----~~i~i~~--~~~~~~~~L~~l~~~~-~~~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~~-- 71 (155) T protein:vir:10 1 MA----NRIELEL--VDREVQERLAALYAAV-TDTLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGRG-- 71 (155) T ss_pred CC----ceEEEEe--chHHHHHHHHHHHHHh-hhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccCC-- Confidence 10 1122222 2345556666544433 3788999999999999999999764 6999999999887754432 Q ss_pred chhhhhhhhhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEeec--cCC Q lcl|NC_019527. 134 EIRARDVLAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKVN--YGR 190 (190) Q Consensus 134 ~~~~~~~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~~--~~~ 190 (190) +.++|+|||.|++||+|.+ ++..+.|+.| |.. T Consensus 72 ------------------------~~~~L~~tG~L~~Si~~~~-~~~~v~vGtn~~YA~ 105 (155) T protein:vir:10 72 ------------------------AHPILQVTNALARSITTRA-DRDQAQIGSNLSYAA 105 (155) T ss_pred ------------------------CCCccccchhhhhhhhcee-cCCEEEEecCcchhh Confidence 4689999999999999998 5677888765 433 No 15 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=98.73 E-value=4.8e-11 Score=77.15 Aligned_cols=99 Identities=17% Similarity=0.172 Sum_probs=72.9 Q ss_pred hhHH--HHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhcc--CCCCCChHHHHHhccccccccchhhhh Q lcl|NC_019527. 64 FRNM--VNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIIST--NEPALSKTTLMLRSIYGNNPQEIRARD 139 (190) Q Consensus 64 lr~~--~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~--~~pPnap~Ti~~K~~~~~~~~~~~~~~ 139 (190) |-.. +.-+..++.+.|.+..... .+...+|..||..+...+++.|... .|+|++|+|+++|.+++.. T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~-~d~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~-------- 71 (155) T protein:vir:99 1 MTTRIDVELDDQEVRQRLALLMRSV-TDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRG-------- 71 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHh-hhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCC-------- Confidence 2211 1112245666666655444 3788999999999999999999764 5899999999998764432 Q ss_pred hhhhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEeec--cCC Q lcl|NC_019527. 140 VLAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKVN--YGR 190 (190) Q Consensus 140 ~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~~--~~~ 190 (190) ..++|+|||.|++||+|.+ ++.++.|+.| |.. T Consensus 72 ------------------~~~iL~~tg~L~~Si~~~~-~~~~v~vGtn~~YA~ 105 (155) T protein:vir:99 72 ------------------PHPILQVTNALARSVTTWA-DRNEAGIGSNLVYAA 105 (155) T ss_pred ------------------CCCcchhchhhhhhhhcee-cCCEEEEecCccchh Confidence 3679999999999999998 5778888855 444 No 16 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=98.72 E-value=4.6e-11 Score=77.26 Aligned_cols=99 Identities=17% Similarity=0.193 Sum_probs=72.7 Q ss_pred hhHHH--HHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhcc--CCCCCChHHHHHhccccccccchhhhh Q lcl|NC_019527. 64 FRNMV--NEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIIST--NEPALSKTTLMLRSIYGNNPQEIRARD 139 (190) Q Consensus 64 lr~~~--~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~--~~pPnap~Ti~~K~~~~~~~~~~~~~~ 139 (190) |-..+ .-..+++.+.|.+..... .+...+|..||..+...+++.|... .|+|+||+|+++|.++++. T Consensus 1 M~~~i~i~~d~~~~~~~L~~l~~~~-~d~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~-------- 71 (155) T protein:vir:79 1 MTTRIDVELDDQEVRQRLAVLMRSV-TDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRG-------- 71 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHh-hhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCC-------- Confidence 22111 112245556666655444 3788999999999999999999764 6999999999998765432 Q ss_pred hhhhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEeec--cCC Q lcl|NC_019527. 140 VLAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKVN--YGR 190 (190) Q Consensus 140 ~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~~--~~~ 190 (190) ..++|+|||.|++||+|.+. +..+.|+.| |.. T Consensus 72 ------------------~~~iL~~tG~L~~Si~~~~~-~~~v~vGt~~~YA~ 105 (155) T protein:vir:79 72 ------------------PHPILQVTNALARSVTTWAD-RNEAGIGSNLVYAA 105 (155) T ss_pred ------------------CCCccccchhhhhhhhceec-CCEEEEecCchhhh Confidence 36899999999999999984 677888755 443 No 17 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=98.67 E-value=3.8e-11 Score=77.70 Aligned_cols=88 Identities=14% Similarity=0.171 Sum_probs=63.1 Q ss_pred HHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhcc------CCCCCChHHHHHhccccccccchhhhhhh Q lcl|NC_019527. 68 VNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIIST------NEPALSKTTLMLRSIYGNNPQEIRARDVL 141 (190) Q Consensus 68 ~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~------~~pPnap~Ti~~K~~~~~~~~~~~~~~~~ 141 (190) +-+....+.+.+.+.. . +....|..+|......+++.+.+. .|+||||+|+++|+ T Consensus 1 ~i~~~~~i~~~l~~l~-~---~~~~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~--------------- 61 (145) T protein:vir:31 1 MVEDENNIPEAREAIQ-D---GLTDGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKG--------------- 61 (145) T ss_pred CcccHHHHHHHHHHHH-H---HHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhc--------------- Confidence 3333344444444432 1 334568889999999999999873 48999999999887 Q ss_pred hhHHHhhhcccccccCccCchHHHHHHHhhcceeee---cCceeEEeec--cCC Q lcl|NC_019527. 142 AAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVD---GGATIKVKVN--YGR 190 (190) Q Consensus 142 ~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~---~g~~~~~~~~--~~~ 190 (190) +.+||+|||.|++||+|.|. ++.+..|+-| |++ T Consensus 62 ----------------~~~~L~~tG~L~~Si~~~~~~~~~~~~a~vGtn~~YA~ 99 (145) T protein:vir:31 62 ----------------SDTPLIDNSRLLTDINAASMMDRANRMAVIGTNLDYAE 99 (145) T ss_pred ----------------CCCCCccCHHHHHHHHHHhhhcccCceeEecCCchhhh Confidence 47899999999999999873 5555556543 555 No 18 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=98.66 E-value=9.1e-11 Score=75.64 Aligned_cols=98 Identities=16% Similarity=0.054 Sum_probs=69.9 Q ss_pred hhHH--HHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhcc-------CCCCCChHHHHHhccccccccc Q lcl|NC_019527. 64 FRNM--VNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIIST-------NEPALSKTTLMLRSIYGNNPQE 134 (190) Q Consensus 64 lr~~--~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~-------~~pPnap~Ti~~K~~~~~~~~~ 134 (190) |... +.-..+++.+.|.+.... .+...+|..||......+++.|.+. .|+|++|+|+++|.+.+.. T Consensus 1 ms~~i~~~~d~~~l~~~L~~l~~~--~~~~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~--- 75 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDELGTV--TRDRAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFV--- 75 (156) T ss_pred CeEEEEEeecHHHHHHHHHHHHhh--hccHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCC--- Confidence 3322 222334555556553322 2445799999999999999999864 4889999999988754322 Q ss_pred hhhhhhhhhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEeec--cCC Q lcl|NC_019527. 135 IRARDVLAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKVN--YGR 190 (190) Q Consensus 135 ~~~~~~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~~--~~~ 190 (190) ..+||+|||.|++||+|.+ ++..+.|+.| |.+ T Consensus 76 -----------------------~~~~L~~tg~L~~Si~~~~-~~~~v~vGt~~~yA~ 109 (156) T protein:vir:19 76 -----------------------PGSILTLHGDLARSITTDY-GQDYALIGSPKIYAA 109 (156) T ss_pred -----------------------CCcchhhhHHHHHHhhhee-cCCEEEEecchhhhH Confidence 4789999999999999998 4667777765 333 No 19 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=98.46 E-value=7.5e-10 Score=70.60 Aligned_cols=115 Identities=14% Similarity=0.062 Sum_probs=75.8 Q ss_pred hhHH--HHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhcc---CCCCCChHHHHHhccccccccchhhh Q lcl|NC_019527. 64 FRNM--VNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIIST---NEPALSKTTLMLRSIYGNNPQEIRAR 138 (190) Q Consensus 64 lr~~--~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~---~~pPnap~Ti~~K~~~~~~~~~~~~~ 138 (190) |--. +.-..+++.+.|.+....+ .+...+|..||......+++.|.+. +|+|++|+|++.|.++++.......+ T Consensus 1 Ms~~i~i~~~~~~l~~~L~~l~~~~-~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~~~ 79 (175) T protein:vir:10 1 MSDFVNFQIDDSALRTRLLQLEQAG-HQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGEL 79 (175) T ss_pred CceeEEEEecHHHHHHHHHHHHHHh-ccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhhhh Confidence 2211 1112355666676655444 3788999999999999999999886 58999999999886554322111111 Q ss_pred hhhhhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEeec--cCC Q lcl|NC_019527. 139 DVLAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKVN--YGR 190 (190) Q Consensus 139 ~~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~~--~~~ 190 (190) .... . ....+.++|+|||.|++||+|.+ ++..+.|+-| |.+ T Consensus 80 ~~~~-----~-----~~~~~~~~L~~tG~L~~Si~~~~-~~~~v~vGtn~~YAa 122 (175) T protein:vir:10 80 TAAA-----S-----RRKAGLMILQDSGQMAASVSTDH-DDNSAVIGSNKEYAA 122 (175) T ss_pred hhhh-----h-----hhccCCCcceechhhhhhhheee-cCCEEEEecChhhhh Confidence 1111 0 01125789999999999999999 5666767654 544 No 20 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=98.20 E-value=3e-09 Score=67.33 Aligned_cols=98 Identities=19% Similarity=0.286 Sum_probs=55.8 Q ss_pred CC-----CCchhhHHHHHHHHHHhh-cC---------------------------------------------------- Q lcl|NC_019527. 1 MA-----TLTGGDKLAKILADIGGK-AQ---------------------------------------------------- 22 (190) Q Consensus 1 Ma-----~i~~~d~l~~il~~l~~l-~~---------------------------------------------------- 22 (190) || +|+|-|.|.+-|++|..- .+ T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~~ 80 (164) T protein:vir:43 1 MADTVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTGD 80 (164) T ss_pred CCcceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccccc Confidence 76 234555554444443211 00 Q ss_pred CEEEEEecCCCCCC--------CCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHH Q lcl|NC_019527. 23 GSVDVGFMSGATYP--------DGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKAL 94 (190) Q Consensus 23 ~~V~VGi~~~~~~~--------dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL 94 (190) ....||+..+.... .+-.-+.++.++|||+. ..||||||||++++++++..+.+...|.. -.+++| T Consensus 81 ~~~~vg~~~~~~~~~~~~~~~~~~~~~~~y~~f~EfGT~---km~a~PFlrPA~~~~k~~~~~~~~~~l~~---~i~ka~ 154 (164) T protein:vir:43 81 LGFRIGVLHGAVLPKKGERSDKTANAPTPHWRLLEFGTE---DMRAQPFMRSALADNIAEVTSTFVSEYEK---GIDRAI 154 (164) T ss_pred eeEEecccccccccccccccccCCCCCcceEEEeecCCC---CCCCCcchhhhHHHhHHHHHHHHHHHHHH---HHHHHH Confidence 11223332221100 00123567889999985 48999999999999999988887776644 245566 Q ss_pred HHHHHHHHHH Q lcl|NC_019527. 95 ASMGEMIGGD 104 (190) Q Consensus 95 ~~iG~~a~~~ 104 (190) .+.+..++.. T Consensus 155 ~k~~~~~~~~ 164 (164) T protein:vir:43 155 KRAAKKAAQG 164 (164) T ss_pred HHHHhhhccC Confidence 6655544443 No 21 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=97.99 E-value=5.9e-09 Score=65.72 Aligned_cols=98 Identities=18% Similarity=0.171 Sum_probs=48.7 Q ss_pred CC-CC----chhhHHHHHHHHHH-hhcC---------------------------------------------------- Q lcl|NC_019527. 1 MA-TL----TGGDKLAKILADIG-GKAQ---------------------------------------------------- 22 (190) Q Consensus 1 Ma-~i----~~~d~l~~il~~l~-~l~~---------------------------------------------------- 22 (190) || +| +|-|.|.+-|++|. +..+ T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g~ 80 (179) T protein:vir:18 1 MADSVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTGD 80 (179) T ss_pred CCceEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccccc Confidence 77 34 36666555444442 1100 Q ss_pred CEEEEEecCCCCC---------------------C--CCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHH Q lcl|NC_019527. 23 GSVDVGFMSGATY---------------------P--DGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRL 79 (190) Q Consensus 23 ~~V~VGi~~~~~~---------------------~--dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l 79 (190) ..+.||+..+... + .+-..+.++.+.|||+. +.||||||||++++++++..+.+ T Consensus 81 ~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~---kmpa~PFlrPA~~~~~~~a~~~i 157 (179) T protein:vir:18 81 LAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTE---HTSARPILRPAMNGVDNDVINVF 157 (179) T ss_pred eeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCC---CCCCCccchhhHHhhHHHHHHHH Confidence 0134444322110 0 01123567778899985 48999999999999998877666 Q ss_pred HHHHhhcCCcHHHHHHHHHHHHHHH Q lcl|NC_019527. 80 GDAIKHYDGDGRKALASMGEMIGGD 104 (190) Q Consensus 80 ~~~i~~g~~~~~~aL~~iG~~a~~~ 104 (190) ...|... .+++|.+.+...... T Consensus 158 ~~~l~~~---i~k~lk~~~~~~~~~ 179 (179) T protein:vir:18 158 STEMGKA---IDRAIRLAMKKGTTA 179 (179) T ss_pred HHHHHHH---HHHHHHhhcccCCCC Confidence 5544221 111121111110000 No 22 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=97.95 E-value=3.3e-08 Score=61.62 Aligned_cols=83 Identities=28% Similarity=0.392 Sum_probs=49.1 Q ss_pred CCC-----CchhhHHHHHHHHHH-----------------------hhc------------------------------- Q lcl|NC_019527. 1 MAT-----LTGGDKLAKILADIG-----------------------GKA------------------------------- 21 (190) Q Consensus 1 Ma~-----i~~~d~l~~il~~l~-----------------------~l~------------------------------- 21 (190) ||+ |+|-|.|.+-|++|. .+. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 663 556555544443321 111 Q ss_pred ----CCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHH Q lcl|NC_019527. 22 ----QGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKAL 94 (190) Q Consensus 22 ----~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL 94 (190) ...+.||+-.. + -.-+.++.+.|||+. ..||+|||+|+++.+++++.+.+.+.+... .+.+| T Consensus 81 ~~~g~~~~~vg~~~~----~-~~~~~y~~f~E~GT~---~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~---l~ka~ 146 (146) T protein:vir:10 81 LEGGIKTVKIGLNKA----D-RSPWFYLKFHEWGTS---KMPAHPFIEPGFNASKAEAVRAMTDILKNE---MRLDL 146 (146) T ss_pred ccccceeEEeeeccC----C-CCCcceeeeeccCCC---CCCCCcchhHHHHHhHHHHHHHHHHHHHHH---HhhcC Confidence 01222333111 1 123678889999985 479999999999999999888887766432 23333 No 23 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=97.95 E-value=3.3e-08 Score=61.62 Aligned_cols=83 Identities=28% Similarity=0.392 Sum_probs=49.1 Q ss_pred CCC-----CchhhHHHHHHHHHH-----------------------hhc------------------------------- Q lcl|NC_019527. 1 MAT-----LTGGDKLAKILADIG-----------------------GKA------------------------------- 21 (190) Q Consensus 1 Ma~-----i~~~d~l~~il~~l~-----------------------~l~------------------------------- 21 (190) ||+ |+|-|.|.+-|++|. .+. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 663 556555544443321 111 Q ss_pred ----CCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHH Q lcl|NC_019527. 22 ----QGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKAL 94 (190) Q Consensus 22 ----~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL 94 (190) ...+.||+-.. + -.-+.++.+.|||+. ..||+|||+|+++.+++++.+.+.+.+... .+.+| T Consensus 81 ~~~g~~~~~vg~~~~----~-~~~~~y~~f~E~GT~---~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~---l~ka~ 146 (146) T protein:vir:10 81 LEGGIKTVKIGLNKA----D-RSPWFYLKFHEWGTS---KMPAHPFIEPGFNASKAEAVRAMTDILKNE---MRLDL 146 (146) T ss_pred ccccceeEEeeeccC----C-CCCcceeeeeccCCC---CCCCCcchhHHHHHhHHHHHHHHHHHHHHH---HhhcC Confidence 01222333111 1 123678889999985 479999999999999999888887766432 23333 No 24 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=97.95 E-value=3.3e-08 Score=61.62 Aligned_cols=83 Identities=28% Similarity=0.392 Sum_probs=49.1 Q ss_pred CCC-----CchhhHHHHHHHHHH-----------------------hhc------------------------------- Q lcl|NC_019527. 1 MAT-----LTGGDKLAKILADIG-----------------------GKA------------------------------- 21 (190) Q Consensus 1 Ma~-----i~~~d~l~~il~~l~-----------------------~l~------------------------------- 21 (190) ||+ |+|-|.|.+-|++|. .+. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 663 556555544443321 111 Q ss_pred ----CCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHH Q lcl|NC_019527. 22 ----QGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKAL 94 (190) Q Consensus 22 ----~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL 94 (190) ...+.||+-.. + -.-+.++.+.|||+. ..||+|||+|+++.+++++.+.+.+.+... .+.+| T Consensus 81 ~~~g~~~~~vg~~~~----~-~~~~~y~~f~E~GT~---~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~---l~ka~ 146 (146) T protein:vir:10 81 LEGGIKTVKIGLNKA----D-RSPWFYLKFHEWGTS---KMPAHPFIEPGFNASKAEAVRAMTDILKNE---MRLDL 146 (146) T ss_pred ccccceeEEeeeccC----C-CCCcceeeeeccCCC---CCCCCcchhHHHHHhHHHHHHHHHHHHHHH---HhhcC Confidence 01222333111 1 123678889999985 479999999999999999888887766432 23333 No 25 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=97.95 E-value=3.3e-08 Score=61.62 Aligned_cols=83 Identities=28% Similarity=0.392 Sum_probs=49.1 Q ss_pred CCC-----CchhhHHHHHHHHHH-----------------------hhc------------------------------- Q lcl|NC_019527. 1 MAT-----LTGGDKLAKILADIG-----------------------GKA------------------------------- 21 (190) Q Consensus 1 Ma~-----i~~~d~l~~il~~l~-----------------------~l~------------------------------- 21 (190) ||+ |+|-|.|.+-|++|. .+. T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 663 556555544443321 111 Q ss_pred ----CCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHH Q lcl|NC_019527. 22 ----QGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKAL 94 (190) Q Consensus 22 ----~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL 94 (190) ...+.||+-.. + -.-+.++.+.|||+. ..||+|||+|+++.+++++.+.+.+.+... .+.+| T Consensus 81 ~~~g~~~~~vg~~~~----~-~~~~~y~~f~E~GT~---~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~---l~ka~ 146 (146) T protein:vir:10 81 LEGGIKTVKIGLNKA----D-RSPWFYLKFHEWGTS---KMPAHPFIEPGFNASKAEAVRAMTDILKNE---MRLDL 146 (146) T ss_pred ccccceeEEeeeccC----C-CCCcceeeeeccCCC---CCCCCcchhHHHHHhHHHHHHHHHHHHHHH---HhhcC Confidence 01222333111 1 123678889999985 479999999999999999888887766432 23333 No 26 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=97.89 E-value=4.3e-08 Score=60.96 Aligned_cols=92 Identities=17% Similarity=0.213 Sum_probs=50.9 Q ss_pred CCCC--chhhHHHHHHHHHHhh-cC--------------------------------------------CEEEEEecCCC Q lcl|NC_019527. 1 MATL--TGGDKLAKILADIGGK-AQ--------------------------------------------GSVDVGFMSGA 33 (190) Q Consensus 1 Ma~i--~~~d~l~~il~~l~~l-~~--------------------------------------------~~V~VGi~~~~ 33 (190) ||+| +|-|.|.+-|+.|... .. ..+.||+..+. T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~~~~~~~~~~~~~ 80 (140) T protein:vir:80 1 MSSIQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccccceeeeeeeccc Confidence 8844 5666655444444211 10 01222222111 Q ss_pred CC-CCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHH Q lcl|NC_019527. 34 TY-PDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEM 100 (190) Q Consensus 34 ~~-~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~ 100 (190) .. -.+.+.+.++.+.|||+. ..||+|||+|+++.+++++.+.+...+... .+++|..- . T Consensus 81 ~~~~~~~~~~~y~~f~E~GT~---~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~---l~k~~~~~--~ 140 (140) T protein:vir:80 81 KGKADSPSNAFYWRFDEFGTQ---HMKAQPFMRPAFDASIGEAEGAIRTELARA---IDQALGGR--R 140 (140) T ss_pred ccccCCCCCcceeeeeccCCC---CCCCCcchhhhHHHHHHHHHHHHHHHHHHH---HHHHhhcc--C Confidence 10 112345788999999974 489999999999999999888877655321 11111100 0 No 27 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=97.85 E-value=6.9e-08 Score=59.84 Aligned_cols=92 Identities=14% Similarity=0.128 Sum_probs=49.3 Q ss_pred CCCC--chhhHHHHHHHHHHhh-c--------------------------------------------CCEEEEEecCCC Q lcl|NC_019527. 1 MATL--TGGDKLAKILADIGGK-A--------------------------------------------QGSVDVGFMSGA 33 (190) Q Consensus 1 Ma~i--~~~d~l~~il~~l~~l-~--------------------------------------------~~~V~VGi~~~~ 33 (190) ||+| +|-|.|.+-|+.|... . ...+.+|+..+. T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~~~~~~~~ 80 (140) T protein:vir:10 1 MSSVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKDSPGIATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceecccccccccceeEEeecccc Confidence 8844 5656554444433211 0 011222322111 Q ss_pred C-CCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_019527. 34 T-YPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIIST 112 (190) Q Consensus 34 ~-~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~ 112 (190) . ...+.+.+.++.+.|||+. ..||+|||+|+++++++++.+.+.+.+... .+++ +..| T Consensus 81 ~~~~~~~~~~~y~~f~E~GT~---~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~---l~k~---------------~~~~ 139 (140) T protein:vir:10 81 KGKADSPNNAFYWRFVELGTQ---FMKAEPFMRPAFDASIAQAEGAIRTEIARA---IDQV---------------VGGG 139 (140) T ss_pred ccccCCCCcccccceeccCcC---CCCCCcchhhhHHHHHHHHHHHHHHHHHHH---HHHH---------------hhcC Confidence 1 0123356889999999974 479999999999999988877776654320 0000 0000 Q ss_pred C Q lcl|NC_019527. 113 N 113 (190) Q Consensus 113 ~ 113 (190) - T Consensus 140 ~ 140 (140) T protein:vir:10 140 L 140 (140) T ss_pred C Confidence 0 No 28 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=97.82 E-value=2.5e-08 Score=62.29 Aligned_cols=90 Identities=19% Similarity=0.221 Sum_probs=50.2 Q ss_pred CCCC----chhhHHHHHHHHHHh-hcCC------------------------------------------EEE--EEecC Q lcl|NC_019527. 1 MATL----TGGDKLAKILADIGG-KAQG------------------------------------------SVD--VGFMS 31 (190) Q Consensus 1 Ma~i----~~~d~l~~il~~l~~-l~~~------------------------------------------~V~--VGi~~ 31 (190) ||+| +|-|.|.+-|++|.. ...+ .+. |++.. T Consensus 1 mm~~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~~~~ 80 (148) T protein:vir:93 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVHIRG 80 (148) T ss_pred CcceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeeeecc Confidence 6644 455655444444421 1110 011 11110 Q ss_pred CC--CC-------CCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHH Q lcl|NC_019527. 32 GA--TY-------PDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALAS 96 (190) Q Consensus 32 ~~--~~-------~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~ 96 (190) .. .. -.+...+.|+.+.|||+. ..||||||+|+++++++++.+.+.+.+.. ..+.+|.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~---~~pa~PFl~pA~~~~k~~~~~~~~~~~~~---~i~k~~~k 148 (148) T protein:vir:93 81 VNPDTGNSDNTMKADNPRNAFYWRFVEMGTV---NMPPHPFVRPAFDVRSEQAAQVAIARMNR---AIDEVLRR 148 (148) T ss_pred cccccccccceeecCCCCCcceeeeeccCCC---CCCCCcchhHHHHHhHHHHHHHHHHHHHH---HHHHHhcC Confidence 00 00 011234678889999974 48999999999999999988888776643 23344443 No 29 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=97.82 E-value=7.8e-08 Score=59.56 Aligned_cols=85 Identities=26% Similarity=0.339 Sum_probs=54.5 Q ss_pred CC-CCchhhHHHHHHHHHHhhcCCE---------------------EEEEecCCC--CCCCC-----ccHHHHHHHhhcC Q lcl|NC_019527. 1 MA-TLTGGDKLAKILADIGGKAQGS---------------------VDVGFMSGA--TYPDG-----TPVAQVAFWNEFG 51 (190) Q Consensus 1 Ma-~i~~~d~l~~il~~l~~l~~~~---------------------V~VGi~~~~--~~~dG-----~~vA~iA~~~EfG 51 (190) |+ ++.|-|.|.+-|+.+....... |.-|.+.+. ...+| .+.+.+|.+.||| T Consensus 1 msi~i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~g~~~~V~~~~~Ya~yvE~G 80 (114) T protein:vir:95 1 MAIKWQGIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQLAPKDTEFLKDHITTSYPGMEAHIHGEAGYDGYQEYG 80 (114) T ss_pred CeeeeehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhceeeecCceEEEeecCCCccceeecC Confidence 88 8888777666555443211111 111222111 01122 2457889999999 Q ss_pred ccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019527. 52 HGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDG 88 (190) Q Consensus 52 ~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~ 88 (190) +. ..|++|||+|+++.++.++.+.|.+.+..+-- T Consensus 81 T~---~~~aqPfl~pa~~~~~~~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 81 TR---FQPGTPHFRPMMEQIQPQFQKDMTDVMKGAFK 114 (114) T ss_pred cc---ccCCCccchhhHHHHHHHHHHHHHHHHHhhcC Confidence 84 47999999999999999999988887765422 No 30 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=97.81 E-value=3.3e-07 Score=56.08 Aligned_cols=92 Identities=5% Similarity=0.034 Sum_probs=65.5 Q ss_pred HHHHHHHHHHHHHHHhhc-CCcHHHHHHHHHHHHHHHHHHHHhcc------CCCCCChHHHHHhccccccccchhhhhhh Q lcl|NC_019527. 69 NEKSSEWPKRLGDAIKHY-DGDGRKALASMGEMIGGDLGSSIIST------NEPALSKTTLMLRSIYGNNPQEIRARDVL 141 (190) Q Consensus 69 ~~~~~~~~~~l~~~i~~g-~~~~~~aL~~iG~~a~~~Iq~~I~~~------~~pPnap~Ti~~K~~~~~~~~~~~~~~~~ 141 (190) -+.-.++...|...+... ..+-+..|..||..+....++.|.+. .|+|+++.|+..|.+++ T Consensus 1 ~~~~~~l~~~L~~ll~~l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~~------------ 68 (150) T protein:vir:20 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGRV------------ 68 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccCC------------ Confidence 222344445555555432 12456789999999999999999986 69999999998877532 Q ss_pred hhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEeec------cCC Q lcl|NC_019527. 142 AAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKVN------YGR 190 (190) Q Consensus 142 ~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~~------~~~ 190 (190) .++|+++|.|..||+|.+. ....-|++. |++ T Consensus 69 -----------------~~~l~~~~~l~~sl~~~~~-~~~~~vg~~~Gs~~~yAa 105 (150) T protein:vir:20 69 -----------------KRKMFAKLITSRFLHIRAS-PEQASMEFYGGKSPKIAS 105 (150) T ss_pred -----------------Cccccchhhhhhhhheeec-CcEEEEEeeCCcchhhhh Confidence 5789999999999999886 334444333 333 No 31 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=97.78 E-value=8.2e-08 Score=59.45 Aligned_cols=80 Identities=18% Similarity=0.139 Sum_probs=47.7 Q ss_pred CCCchhhHHHHHHHHHH-----------------------hhcCC----EEEEEecCCC---CCCCC-----ccHHHHHH Q lcl|NC_019527. 2 ATLTGGDKLAKILADIG-----------------------GKAQG----SVDVGFMSGA---TYPDG-----TPVAQVAF 46 (190) Q Consensus 2 a~i~~~d~l~~il~~l~-----------------------~l~~~----~V~VGi~~~~---~~~dG-----~~vA~iA~ 46 (190) .++.|-|.|.+.|+++. .+... -|.-|-+.+. ...+| .+.+.+|. T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVKKIGDLHYRVISTAHYSG 80 (115) T ss_pred CeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEeeCCCccch Confidence 33444444433333221 11110 1111222211 11122 24578999 Q ss_pred HhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 47 WNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIK 84 (190) Q Consensus 47 ~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~ 84 (190) +.|||+- ..|+||||+|+++.++.++.+.|+++|. T Consensus 81 ~vEfGT~---km~a~PFl~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 81 FLEFGTR---YMEPAPFMFPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred heecccc---cCCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 9999984 4799999999999999999999999886 No 32 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=97.78 E-value=4.3e-08 Score=60.96 Aligned_cols=80 Identities=20% Similarity=0.175 Sum_probs=47.9 Q ss_pred CCCchhhHHHHHHHHH-----------------------HhhcCC----EEEEEecCCC---CCCCC-----ccHHHHHH Q lcl|NC_019527. 2 ATLTGGDKLAKILADI-----------------------GGKAQG----SVDVGFMSGA---TYPDG-----TPVAQVAF 46 (190) Q Consensus 2 a~i~~~d~l~~il~~l-----------------------~~l~~~----~V~VGi~~~~---~~~dG-----~~vA~iA~ 46 (190) .++.|-|.|.+-|+++ ++++.. -|.-|.+.+. ..++| .+.+++|. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 3444444433333222 111110 1112222211 11222 24578999 Q ss_pred HhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 47 WNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIK 84 (190) Q Consensus 47 ~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~ 84 (190) +.|||+- ..||||||+|+++.++.++.+.|+++++ T Consensus 81 ~vE~GT~---km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 81 FLEFGTR---YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhccccc---ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 9999984 4799999999999999999999999988 No 33 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=97.78 E-value=4.3e-08 Score=60.96 Aligned_cols=80 Identities=20% Similarity=0.175 Sum_probs=47.9 Q ss_pred CCCchhhHHHHHHHHH-----------------------HhhcCC----EEEEEecCCC---CCCCC-----ccHHHHHH Q lcl|NC_019527. 2 ATLTGGDKLAKILADI-----------------------GGKAQG----SVDVGFMSGA---TYPDG-----TPVAQVAF 46 (190) Q Consensus 2 a~i~~~d~l~~il~~l-----------------------~~l~~~----~V~VGi~~~~---~~~dG-----~~vA~iA~ 46 (190) .++.|-|.|.+-|+++ ++++.. -|.-|.+.+. ..++| .+.+++|. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 3444444433333222 111110 1112222211 11222 24578999 Q ss_pred HhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 47 WNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIK 84 (190) Q Consensus 47 ~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~ 84 (190) +.|||+- ..||||||+|+++.++.++.+.|+++++ T Consensus 81 ~vE~GT~---km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 81 FLEFGTR---YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhccccc---ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 9999984 4799999999999999999999999988 No 34 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=97.78 E-value=4.3e-08 Score=60.96 Aligned_cols=80 Identities=20% Similarity=0.175 Sum_probs=47.9 Q ss_pred CCCchhhHHHHHHHHH-----------------------HhhcCC----EEEEEecCCC---CCCCC-----ccHHHHHH Q lcl|NC_019527. 2 ATLTGGDKLAKILADI-----------------------GGKAQG----SVDVGFMSGA---TYPDG-----TPVAQVAF 46 (190) Q Consensus 2 a~i~~~d~l~~il~~l-----------------------~~l~~~----~V~VGi~~~~---~~~dG-----~~vA~iA~ 46 (190) .++.|-|.|.+-|+++ ++++.. -|.-|.+.+. ..++| .+.+++|. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 3444444433333222 111110 1112222211 11222 24578999 Q ss_pred HhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 47 WNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIK 84 (190) Q Consensus 47 ~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~ 84 (190) +.|||+- ..||||||+|+++.++.++.+.|+++++ T Consensus 81 ~vE~GT~---km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 81 FLEFGTR---YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhccccc---ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 9999984 4799999999999999999999999988 No 35 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=97.78 E-value=4.3e-08 Score=60.96 Aligned_cols=80 Identities=20% Similarity=0.175 Sum_probs=47.9 Q ss_pred CCCchhhHHHHHHHHH-----------------------HhhcCC----EEEEEecCCC---CCCCC-----ccHHHHHH Q lcl|NC_019527. 2 ATLTGGDKLAKILADI-----------------------GGKAQG----SVDVGFMSGA---TYPDG-----TPVAQVAF 46 (190) Q Consensus 2 a~i~~~d~l~~il~~l-----------------------~~l~~~----~V~VGi~~~~---~~~dG-----~~vA~iA~ 46 (190) .++.|-|.|.+-|+++ ++++.. -|.-|.+.+. ..++| .+.+++|. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 3444444433333222 111110 1112222211 11222 24578999 Q ss_pred HhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 47 WNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIK 84 (190) Q Consensus 47 ~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~ 84 (190) +.|||+- ..||||||+|+++.++.++.+.|+++++ T Consensus 81 ~vE~GT~---km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 81 FLEFGTR---YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhccccc---ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 9999984 4799999999999999999999999988 No 36 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=97.78 E-value=4.3e-08 Score=60.96 Aligned_cols=80 Identities=20% Similarity=0.175 Sum_probs=47.9 Q ss_pred CCCchhhHHHHHHHHH-----------------------HhhcCC----EEEEEecCCC---CCCCC-----ccHHHHHH Q lcl|NC_019527. 2 ATLTGGDKLAKILADI-----------------------GGKAQG----SVDVGFMSGA---TYPDG-----TPVAQVAF 46 (190) Q Consensus 2 a~i~~~d~l~~il~~l-----------------------~~l~~~----~V~VGi~~~~---~~~dG-----~~vA~iA~ 46 (190) .++.|-|.|.+-|+++ ++++.. -|.-|.+.+. ..++| .+.+++|. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 3444444433333222 111110 1112222211 11222 24578999 Q ss_pred HhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 47 WNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIK 84 (190) Q Consensus 47 ~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~ 84 (190) +.|||+- ..||||||+|+++.++.++.+.|+++++ T Consensus 81 ~vE~GT~---km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 81 FLEFGTR---YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhccccc---ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 9999984 4799999999999999999999999988 No 37 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=97.78 E-value=4.3e-08 Score=60.96 Aligned_cols=80 Identities=20% Similarity=0.175 Sum_probs=47.9 Q ss_pred CCCchhhHHHHHHHHH-----------------------HhhcCC----EEEEEecCCC---CCCCC-----ccHHHHHH Q lcl|NC_019527. 2 ATLTGGDKLAKILADI-----------------------GGKAQG----SVDVGFMSGA---TYPDG-----TPVAQVAF 46 (190) Q Consensus 2 a~i~~~d~l~~il~~l-----------------------~~l~~~----~V~VGi~~~~---~~~dG-----~~vA~iA~ 46 (190) .++.|-|.|.+-|+++ ++++.. -|.-|.+.+. ..++| .+.+++|. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchh Confidence 3444444433333222 111110 1112222211 11222 24578999 Q ss_pred HhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 47 WNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIK 84 (190) Q Consensus 47 ~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~ 84 (190) +.|||+- ..||||||+|+++.++.++.+.|+++++ T Consensus 81 ~vE~GT~---km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 81 FLEFGTR---YMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred hhccccc---ccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 9999984 4799999999999999999999999988 No 38 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=97.78 E-value=6.9e-08 Score=59.84 Aligned_cols=92 Identities=16% Similarity=0.208 Sum_probs=49.7 Q ss_pred CCCC--chhhHHHHHHHHHHhhc---------------------------------------------CCEEEEEecCCC Q lcl|NC_019527. 1 MATL--TGGDKLAKILADIGGKA---------------------------------------------QGSVDVGFMSGA 33 (190) Q Consensus 1 Ma~i--~~~d~l~~il~~l~~l~---------------------------------------------~~~V~VGi~~~~ 33 (190) |++| +|-|.|.+-|+.|.... ...+.||+..+. T Consensus 1 M~~~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~vg~~~~~ 80 (140) T protein:vir:14 1 MSSIQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccccccceeEEeeeeecc Confidence 8844 56565544444332110 112233332211 Q ss_pred C-CCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHH Q lcl|NC_019527. 34 T-YPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEM 100 (190) Q Consensus 34 ~-~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~ 100 (190) . .-++-+.+.++.+.|||+. .+||+|||+|+++.+++++.+.+.+.+... .+++|..- . T Consensus 81 ~~~~~~~~~~~y~~f~E~GT~---~~~a~pFl~pa~~~~~~~~~~~~~~~~~~~---l~k~~~~~--~ 140 (140) T protein:vir:14 81 KGKADSPNNAFYWRFDEFGTQ---HMKAQPFMRPAFDASIGEAEGAIRTELARA---IDRVLGGR--R 140 (140) T ss_pred ccccCCCCccceeeeeccccC---CCCCCcchhHHHHHHHHHHHHHHHHHHHHH---HHHHhhcc--C Confidence 1 1112345778899999984 489999999999999988887777655321 01111000 0 No 39 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=97.75 E-value=5.2e-08 Score=60.52 Aligned_cols=77 Identities=14% Similarity=0.119 Sum_probs=49.4 Q ss_pred CCCCchhh-----HHHHHHH--HHH-hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHH Q lcl|NC_019527. 1 MATLTGGD-----KLAKILA--DIG-GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKS 72 (190) Q Consensus 1 Ma~i~~~d-----~l~~il~--~l~-~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~ 72 (190) -+....++ ++..-+. ..+ .-....+.||+..+ .+.++.+.|||+. ..||+|||+|++++.+ T Consensus 44 ~ap~~~~~~~~~~h~~d~I~~~~~k~~~g~~~~~VG~~k~--------~~~y~~f~E~GT~---k~~a~pF~~pa~~~~~ 112 (128) T protein:vir:38 44 NTPEWDGETDMSGHLRDDIKLSSVRETSGLTEVDVGYGKD--------TGWRAHFPNSGTS---MQDPQHFIEETQEIMR 112 (128) T ss_pred hCCCcCCCCcccchhhhhhccccccccCceeEEEeeecCC--------CceEEeeeccCcc---CCCCCcchhHHHHHhH Confidence 11111111 1222111 111 11335688998322 3678899999974 4799999999999999 Q ss_pred HHHHHHHHHHHhhcCC Q lcl|NC_019527. 73 SEWPKRLGDAIKHYDG 88 (190) Q Consensus 73 ~~~~~~l~~~i~~g~~ 88 (190) +++.+.+.+.+..+.+ T Consensus 113 ~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 113 PVVIAAFLSHLKEGGM 128 (128) T ss_pred HHHHHHHHHHHHhhcC Confidence 9999999888877766 No 40 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=97.72 E-value=8.2e-08 Score=59.43 Aligned_cols=81 Identities=16% Similarity=0.166 Sum_probs=47.0 Q ss_pred CCCCc------------hhhHHHHHH--HHHH-hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhh Q lcl|NC_019527. 1 MATLT------------GGDKLAKIL--ADIG-GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFR 65 (190) Q Consensus 1 Ma~i~------------~~d~l~~il--~~l~-~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr 65 (190) .+-.. .+.++..-+ ..++ .-....|.||+..+.. +.+.++.+.|||+. ..||+|||| T Consensus 50 ~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~~~~~~g~~~~~VG~~~~~~-----~~~~y~~f~E~GT~---k~~a~pF~~ 121 (149) T protein:vir:13 50 LIHISDDNSKSGRKGSRPPGHAANNIPEPKIRKKKGNLQCVVGWEKSDN-----TPFYYMKMEEWGTS---ERPPHHAFG 121 (149) T ss_pred hCCccCCccccccccccccchhhhcceecccccccceeEEEeeccCCCC-----CccceeeeeccCcc---CCCCCccch Confidence 11000 011121111 1121 1234468999864332 23688999999985 479999999 Q ss_pred HHHHHHHHHHHHHHHH----HHhhcCCc Q lcl|NC_019527. 66 NMVNEKSSEWPKRLGD----AIKHYDGD 89 (190) Q Consensus 66 ~~~~~~~~~~~~~l~~----~i~~g~~~ 89 (190) |+++++++++.+.+.+ +|+...+| T Consensus 122 pa~~~~~~~~~~~~~~~l~k~i~~~lG~ 149 (149) T protein:vir:13 122 KTNKILKRVYDNIAQKKYDNFVKEKLGD 149 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 9999999888766654 45443334 No 41 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=97.72 E-value=4.9e-07 Score=55.17 Aligned_cols=92 Identities=11% Similarity=0.068 Sum_probs=65.5 Q ss_pred HHHHHHHHHHHHHHHHhhc-CCcHHHHHHHHHHHHHHHHHHHHhcc------CCCCCChHHHHHhccccccccchhhhhh Q lcl|NC_019527. 68 VNEKSSEWPKRLGDAIKHY-DGDGRKALASMGEMIGGDLGSSIIST------NEPALSKTTLMLRSIYGNNPQEIRARDV 140 (190) Q Consensus 68 ~~~~~~~~~~~l~~~i~~g-~~~~~~aL~~iG~~a~~~Iq~~I~~~------~~pPnap~Ti~~K~~~~~~~~~~~~~~~ 140 (190) +++ -.++...|..++..- ..+....|..||..+....++.|++. .|+|+++.|++.|+.+ T Consensus 1 m~d-~~~l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~~------------ 67 (149) T protein:vir:98 1 MSE-LTALQERLTGLIASLSPAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKGR------------ 67 (149) T ss_pred Cch-HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccCC------------ Confidence 222 234445555555332 12457789999999999999999985 5999999999887753 Q ss_pred hhhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEee-----ccCC Q lcl|NC_019527. 141 LAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKV-----NYGR 190 (190) Q Consensus 141 ~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~-----~~~~ 190 (190) ..+||+++|.|.+||+|.+.+. .+.|.+ -|++ T Consensus 68 -----------------~~~~l~~~g~l~~sl~~~~~~~-~~~V~~~Gs~~~yAa 104 (149) T protein:vir:98 68 -----------------IRREMFARLRTNRFMKAKGSDS-AAVVEFTGRVQRMAR 104 (149) T ss_pred -----------------CCcccchhhhhhhhhhheecCC-eeEEEecCcchHHhh Confidence 3679999999999999987644 555532 2322 No 42 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=97.72 E-value=2.4e-07 Score=56.89 Aligned_cols=85 Identities=16% Similarity=0.220 Sum_probs=48.4 Q ss_pred CCCC--chhhHHHHHHHHHHhh-c--------------------------------------------CCEEEEEecCCC Q lcl|NC_019527. 1 MATL--TGGDKLAKILADIGGK-A--------------------------------------------QGSVDVGFMSGA 33 (190) Q Consensus 1 Ma~i--~~~d~l~~il~~l~~l-~--------------------------------------------~~~V~VGi~~~~ 33 (190) |++| +|-|.|.+-|+.|... . ...+.||+.... T Consensus 1 Ma~~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~~g~~~~~ 80 (140) T protein:vir:10 1 MSSIQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRT 80 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhccccccccccccceEEeeeeecc Confidence 8844 4556554444333211 0 011222222111 Q ss_pred C-CCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhh-------cCC Q lcl|NC_019527. 34 T-YPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKH-------YDG 88 (190) Q Consensus 34 ~-~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~-------g~~ 88 (190) . .-++.+.+.++.+.|||+. .+||+|||+|+++++++++.+.+.+.+.. |.. T Consensus 81 ~~~~~~~~~~~y~~f~E~GT~---~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 81 KGKADSPNNAFYWRFDEFGTQ---HMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGRR 140 (140) T ss_pred ccccCCCCccceeeeeccCCC---CCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 0 0011244778899999984 48999999999999999888877765532 322 No 43 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=97.61 E-value=3.2e-08 Score=61.65 Aligned_cols=82 Identities=21% Similarity=0.319 Sum_probs=50.8 Q ss_pred CCCC--chhhHHHHHHHHHH--------------hh-----cCCEEEEEecCCC-------C-CCCC---ccHHHHHHHh Q lcl|NC_019527. 1 MATL--TGGDKLAKILADIG--------------GK-----AQGSVDVGFMSGA-------T-YPDG---TPVAQVAFWN 48 (190) Q Consensus 1 Ma~i--~~~d~l~~il~~l~--------------~l-----~~~~V~VGi~~~~-------~-~~dG---~~vA~iA~~~ 48 (190) |++| .|-|.|.+-|+++. .+ ......+++..|. . .++| .+.+.+|.++ T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~Ya~~v 80 (114) T protein:vir:49 1 MATIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDKATVEALTSYSGYL 80 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCeeEecCCCCcccee Confidence 8854 46665544333220 00 0001111222111 0 1223 3457899999 Q ss_pred hcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019527. 49 EFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKH 85 (190) Q Consensus 49 EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~ 85 (190) |||+- ..||||||||+++.++.++.+.|++.++. T Consensus 81 EfGT~---km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 81 EVGTR---KMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred ccccc---ccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 99974 47999999999999999999999998876 No 44 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=97.61 E-value=3.2e-08 Score=61.65 Aligned_cols=82 Identities=21% Similarity=0.319 Sum_probs=50.8 Q ss_pred CCCC--chhhHHHHHHHHHH--------------hh-----cCCEEEEEecCCC-------C-CCCC---ccHHHHHHHh Q lcl|NC_019527. 1 MATL--TGGDKLAKILADIG--------------GK-----AQGSVDVGFMSGA-------T-YPDG---TPVAQVAFWN 48 (190) Q Consensus 1 Ma~i--~~~d~l~~il~~l~--------------~l-----~~~~V~VGi~~~~-------~-~~dG---~~vA~iA~~~ 48 (190) |++| .|-|.|.+-|+++. .+ ......+++..|. . .++| .+.+.+|.++ T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~Ya~~v 80 (114) T protein:vir:27 1 MATIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDKATVEALTSYSGYL 80 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCeeEecCCCCcccee Confidence 8854 46665544333220 00 0001111222111 0 1223 3457899999 Q ss_pred hcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019527. 49 EFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKH 85 (190) Q Consensus 49 EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~ 85 (190) |||+- ..||||||||+++.++.++.+.|++.++. T Consensus 81 EfGT~---km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 81 EVGTR---KMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred ccccc---ccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 99974 47999999999999999999999998876 No 45 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=97.58 E-value=2.2e-07 Score=57.12 Aligned_cols=77 Identities=16% Similarity=0.140 Sum_probs=47.8 Q ss_pred CCCCc--hhhHHHHHHHH--HH--hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHH Q lcl|NC_019527. 1 MATLT--GGDKLAKILAD--IG--GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSE 74 (190) Q Consensus 1 Ma~i~--~~d~l~~il~~--l~--~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~ 74 (190) .+... .+.+|..-+.. .+ ......|.||+-. +.+.++.+.|||+. ..||||||+|+++.++++ T Consensus 45 ~ap~~~~~tg~l~~~I~~~~~k~~~~g~~~v~Vg~~~--------~~~~y~~f~E~GT~---~~~a~Pf~~pa~~~~~~~ 113 (127) T protein:vir:12 45 HVNRSDKKQPHMQDNITVSNVRESKDGVRFVAVGPNK--------KVAYRGRFLEWGTS---KMPPQPFIEKGGKEGEGP 113 (127) T ss_pred hCCCCCCChhHHHHhhhccccccccCceeEEEEeeCC--------CCcceeeeeccCcc---CCCCCccchHhHHHHHHH Confidence 12111 12233222210 10 1133467788632 24778899999984 479999999999999999 Q ss_pred HHHHHHHHHhhcCC Q lcl|NC_019527. 75 WPKRLGDAIKHYDG 88 (190) Q Consensus 75 ~~~~l~~~i~~g~~ 88 (190) +.+.+.+.+...-- T Consensus 114 ~~~~~~~~~~~~lk 127 (127) T protein:vir:12 114 AVELMERILTAPIK 127 (127) T ss_pred HHHHHHHHHHHhcC Confidence 98888877755422 No 46 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=97.55 E-value=1.3e-07 Score=58.38 Aligned_cols=80 Identities=20% Similarity=0.164 Sum_probs=47.9 Q ss_pred CCCchhhHHHHHHHHHH-----------------------hhcCC----EEEEEecCCC---CCCCC-----ccHHHHHH Q lcl|NC_019527. 2 ATLTGGDKLAKILADIG-----------------------GKAQG----SVDVGFMSGA---TYPDG-----TPVAQVAF 46 (190) Q Consensus 2 a~i~~~d~l~~il~~l~-----------------------~l~~~----~V~VGi~~~~---~~~dG-----~~vA~iA~ 46 (190) .+|.|-|+|.+-|+++. .++.. -|.-|.+... ..++| .+.+.+|. T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~~g~~~~~V~~~~~Ya~ 80 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTVDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEecCCccccc Confidence 44444444433332221 11110 1111222111 11122 24588999 Q ss_pred HhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 47 WNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIK 84 (190) Q Consensus 47 ~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~ 84 (190) +.|||+- ..||||||+|+++.++..+.+.|+++++ T Consensus 81 ~vE~GT~---~m~a~PFl~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 81 FLEFGTR---YMEAEPFMWPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred ccccccc---ccCCCCcchhhHHHHHHHHHHHHHHHhC Confidence 9999984 4799999999999999999999999888 No 47 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=97.50 E-value=1.8e-06 Score=52.08 Aligned_cols=92 Identities=5% Similarity=0.024 Sum_probs=64.9 Q ss_pred HHHHHHHHHHHHHHHhhc-CCcHHHHHHHHHHHHHHHHHHHHhcc------CCCCCChHHHHHhccccccccchhhhhhh Q lcl|NC_019527. 69 NEKSSEWPKRLGDAIKHY-DGDGRKALASMGEMIGGDLGSSIIST------NEPALSKTTLMLRSIYGNNPQEIRARDVL 141 (190) Q Consensus 69 ~~~~~~~~~~l~~~i~~g-~~~~~~aL~~iG~~a~~~Iq~~I~~~------~~pPnap~Ti~~K~~~~~~~~~~~~~~~~ 141 (190) -+.-+++...|...+..- ..+.+.+|..||.......++.|.+. .|+|+++.|+..|..++ T Consensus 1 m~~~~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~~~~------------ 68 (150) T protein:vir:57 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRV------------ 68 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhccCC------------ Confidence 222244444455555432 23457789999999999999999986 79999999998887542 Q ss_pred hhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEe------eccCC Q lcl|NC_019527. 142 AAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVK------VNYGR 190 (190) Q Consensus 142 ~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~------~~~~~ 190 (190) .++|+++|.|..||+|.+.. ...-|+ +.|++ T Consensus 69 -----------------~~~l~~~~~l~~sl~~~~~~-~~a~vg~~~G~~~~yAa 105 (150) T protein:vir:57 69 -----------------KRKMFAKLITSRFLHIRASP-EQASMEFYGGKSPKIAS 105 (150) T ss_pred -----------------CcccchhhhhccceeeeeeC-cEEEEEeecCCchhhhh Confidence 57899999999999998763 333332 22443 No 48 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=97.43 E-value=1.6e-07 Score=57.80 Aligned_cols=80 Identities=21% Similarity=0.352 Sum_probs=48.8 Q ss_pred CCCCc-hhhHHHHHHHHHHhhc---------------------CCEEEEEecCCC----CCCCC-----ccHHHHHHHhh Q lcl|NC_019527. 1 MATLT-GGDKLAKILADIGGKA---------------------QGSVDVGFMSGA----TYPDG-----TPVAQVAFWNE 49 (190) Q Consensus 1 Ma~i~-~~d~l~~il~~l~~l~---------------------~~~V~VGi~~~~----~~~dG-----~~vA~iA~~~E 49 (190) ||++. +.|.|.+-|+++.... ..-|.-|-+..+ ...+| .+.+.+|.+.| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:10 1 MAKVKYGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEYAVYVN 80 (137) T ss_pred CccchhCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEecCCcEEEEEecCCccccccc Confidence 99885 6666544443322110 001222332221 11233 24578999999 Q ss_pred cCccc--------------------------cCCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_019527. 50 FGHGG--------------------------RFPAPPRPFFRNMVNEKSSEWPKRLG 80 (190) Q Consensus 50 fG~~~--------------------------~~~IP~RPFlr~~~~~~~~~~~~~l~ 80 (190) ||+.. ..++||||||+|+++++++++.+.|. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 81 YGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred cCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 99721 12489999999999999999999888 No 49 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=97.42 E-value=2.7e-06 Score=51.15 Aligned_cols=92 Identities=5% Similarity=0.024 Sum_probs=63.9 Q ss_pred HHHHHHHHHHHHHHHhhc-CCcHHHHHHHHHHHHHHHHHHHHhcc------CCCCCChHHHHHhccccccccchhhhhhh Q lcl|NC_019527. 69 NEKSSEWPKRLGDAIKHY-DGDGRKALASMGEMIGGDLGSSIIST------NEPALSKTTLMLRSIYGNNPQEIRARDVL 141 (190) Q Consensus 69 ~~~~~~~~~~l~~~i~~g-~~~~~~aL~~iG~~a~~~Iq~~I~~~------~~pPnap~Ti~~K~~~~~~~~~~~~~~~~ 141 (190) -+.-.++...|...+..- ..+.+..|..||.......++.|.+. .|+|+++.|+++|..++ T Consensus 1 ~~~~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~~------------ 68 (150) T protein:vir:60 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRV------------ 68 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcCC------------ Confidence 222333344444444332 12457789999999999999999986 69999999998887532 Q ss_pred hhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEe------eccCC Q lcl|NC_019527. 142 AAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVK------VNYGR 190 (190) Q Consensus 142 ~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~------~~~~~ 190 (190) .++|+++|.|..||+|.+. +...-|+ +.|++ T Consensus 69 -----------------~~~l~~~~~l~~sl~~~~~-~~~a~vg~~~Gt~~~yAa 105 (150) T protein:vir:60 69 -----------------KRKMFAKLITSRFLHIRAS-PEQASMEFYGGKSPKIAS 105 (150) T ss_pred -----------------Cccchhhhhhcceeeeeee-CcEEEEEeeCCCchhhhh Confidence 5789999999999999886 2333332 22444 No 50 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=97.42 E-value=1.5e-07 Score=58.04 Aligned_cols=80 Identities=21% Similarity=0.329 Sum_probs=47.6 Q ss_pred CCCC--chhhHHHHHHHHHHh--------------h-----cCCEEEEEecCCC-----C-CCCC-----ccHHHHHHHh Q lcl|NC_019527. 1 MATL--TGGDKLAKILADIGG--------------K-----AQGSVDVGFMSGA-----T-YPDG-----TPVAQVAFWN 48 (190) Q Consensus 1 Ma~i--~~~d~l~~il~~l~~--------------l-----~~~~V~VGi~~~~-----~-~~dG-----~~vA~iA~~~ 48 (190) ||+| .|-|+|.+-|+.+.. + ....-..++..|. + ..+| .+.+.+|.+. T Consensus 1 Ma~i~i~Gld~L~~~l~~~~~~~~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~~~~~~~~v~~~~~Ya~~v 80 (112) T protein:vir:96 1 MATIEFEGLDEMAQSLLKNASSERRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLEAGSDRAVVEALTNYSGYL 80 (112) T ss_pred CceeeehHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeeecCceEEEecCCCCcccee Confidence 8855 466655443332210 0 0001112222211 0 1123 2457899999 Q ss_pred hcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_019527. 49 EFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAI 83 (190) Q Consensus 49 EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i 83 (190) |||+- ..|+||||+|+++.++.++.+.|++.- T Consensus 81 E~GTr---~m~AqPF~~PA~~~~~~~~~~~l~~L~ 112 (112) T protein:vir:96 81 EVGTR---KMEAQPFMRPALDQVVPEMVEEMAKWE 112 (112) T ss_pred ccCcc---ccCCCCchhhhHHHHHHHHHHHHHhcC Confidence 99974 489999999999999999888888743 No 51 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=97.42 E-value=1.9e-07 Score=57.42 Aligned_cols=86 Identities=23% Similarity=0.261 Sum_probs=50.0 Q ss_pred CC-----CCchhhHHHHHHHHHHhhcCCE---------------------EEEEecCCC------CC-CCC-----ccHH Q lcl|NC_019527. 1 MA-----TLTGGDKLAKILADIGGKAQGS---------------------VDVGFMSGA------TY-PDG-----TPVA 42 (190) Q Consensus 1 Ma-----~i~~~d~l~~il~~l~~l~~~~---------------------V~VGi~~~~------~~-~dG-----~~vA 42 (190) || .++|.|.|.+-|+++....... +.-|-+.+. .. .+| -+.+ T Consensus 1 Ma~~~~i~~~Gld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~~~~~~~~~~v~~~~ 80 (125) T protein:vir:94 1 MANDFNIKFKGVDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVKEEHGVVTGRYVARA 80 (125) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceecceeccCCcEEEEeeCCC Confidence 66 3446666544444332110000 111222111 01 112 2457 Q ss_pred HHHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhc-CCc Q lcl|NC_019527. 43 QVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHY-DGD 89 (190) Q Consensus 43 ~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g-~~~ 89 (190) .+|.+.|||+- ..|+||||+|+++.++.++.+.|++.|... ..+ T Consensus 81 ~Ya~~vEfGT~---~~~a~Pfl~pa~~~~~~~~~~~l~~~l~~a~k~~ 125 (125) T protein:vir:94 81 DYSSYNEYGTY---RMSAQPFMAPSVAAMTPFFYKAVRDALNKAAKFS 125 (125) T ss_pred Cccceeecccc---cCCCCcccchhHHHHHHHHHHHHHHHHHHHhccC Confidence 88999999974 479999999999999999988888877532 223 No 52 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=97.41 E-value=2.9e-07 Score=56.44 Aligned_cols=80 Identities=20% Similarity=0.304 Sum_probs=48.6 Q ss_pred CCCCc-hhhHHHHHHHHHHhhc---------------------CCEEEEEecCCC----CCCCC-----ccHHHHHHHhh Q lcl|NC_019527. 1 MATLT-GGDKLAKILADIGGKA---------------------QGSVDVGFMSGA----TYPDG-----TPVAQVAFWNE 49 (190) Q Consensus 1 Ma~i~-~~d~l~~il~~l~~l~---------------------~~~V~VGi~~~~----~~~dG-----~~vA~iA~~~E 49 (190) ||+++ +.|+|.+-|+++.... ..-|.-|-+..+ ...+| .+.+.+|.+.| T Consensus 1 Ma~~~~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVN 80 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEeecCcEEEEEecCCCcccccc Confidence 99886 6666554444322110 001111222111 01122 24578999999 Q ss_pred cCccc--------------------------cCCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_019527. 50 FGHGG--------------------------RFPAPPRPFFRNMVNEKSSEWPKRLG 80 (190) Q Consensus 50 fG~~~--------------------------~~~IP~RPFlr~~~~~~~~~~~~~l~ 80 (190) ||+.. ..++||||||+|+++++++++.+.|. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 81 YGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRVFFNKYFS 137 (137) T ss_pred cCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 99621 12589999999999999999999888 No 53 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=97.38 E-value=4.7e-07 Score=55.29 Aligned_cols=80 Identities=15% Similarity=0.182 Sum_probs=45.2 Q ss_pred CCCCchh---------hHHHHHHHH----HHhhcCCEEEEEecCCC---CCCCC-----ccHHHHHHHhhcCccccCCCC Q lcl|NC_019527. 1 MATLTGG---------DKLAKILAD----IGGKAQGSVDVGFMSGA---TYPDG-----TPVAQVAFWNEFGHGGRFPAP 59 (190) Q Consensus 1 Ma~i~~~---------d~l~~il~~----l~~l~~~~V~VGi~~~~---~~~dG-----~~vA~iA~~~EfG~~~~~~IP 59 (190) +..+..- +.+.+.... .+.+.. |.-|.+.+. ...++ .+.+.+|.+.|||+. ..| T Consensus 8 ~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aP--v~TG~Lr~sI~~~~~~~~~~~v~~~~~Ya~~vE~GT~---~m~ 82 (108) T protein:vir:99 8 LRSVERKQKSVRIAVDKELSKSAARIERQAKILAP--VDTGWLRAQIYSEQQRLLHYRVVSPALYSIYLELGTR---KME 82 (108) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--cCchhhhcceeeeecCcEEEEeecCcccchhcccCcc---ccC Confidence 1111100 112221111 222211 112222211 01111 245789999999985 479 Q ss_pred CCchhhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019527. 60 PRPFFRNMVNEKSSEWPKRLGDAIKH 85 (190) Q Consensus 60 ~RPFlr~~~~~~~~~~~~~l~~~i~~ 85 (190) +||||+|+++.++.++.+.|++.|+. T Consensus 83 a~Pf~~pa~~~~~~~~~~~i~~~lrk 108 (108) T protein:vir:99 83 AQSFLDPALRKEWPVLMANIKKMFKR 108 (108) T ss_pred CCcchhhhHHHHHHHHHHHHHHHhcC Confidence 99999999999999999999998877 No 54 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=97.38 E-value=4.7e-07 Score=55.29 Aligned_cols=77 Identities=10% Similarity=0.053 Sum_probs=47.2 Q ss_pred CCCCchhhHHHHHHHHHH-------hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHH Q lcl|NC_019527. 1 MATLTGGDKLAKILADIG-------GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSS 73 (190) Q Consensus 1 Ma~i~~~d~l~~il~~l~-------~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~ 73 (190) -+-...+..-..+.++|. ......+.||+..+ .+.++.+.|||+. ..||+|||+|++++.++ T Consensus 41 ~ap~~~~~~~~hl~d~I~~~~~k~~~~g~~~~~VG~~k~--------~~~y~~f~E~GT~---k~~~~pF~~pa~~~~k~ 109 (125) T protein:vir:97 41 NTPVYEVETDERLQEDTVISGFKGANVGIVSKEIGYGKA--------TGWRAHYPNDGTI---YQRGQDFKERTINQMTP 109 (125) T ss_pred hCCcCCCCchhhHHhhhhcccccccccCceEEEEeecCC--------CceeEeeeccCcc---CCCcCccchHhHHHhHH Confidence 111111110011222221 12334678887422 3688999999984 47999999999999999 Q ss_pred HHHHHHHHHHhhc-CC Q lcl|NC_019527. 74 EWPKRLGDAIKHY-DG 88 (190) Q Consensus 74 ~~~~~l~~~i~~g-~~ 88 (190) ++.+.+.+.+... .. T Consensus 110 ~~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 110 KAKQLYAEKVKEGLGL 125 (125) T ss_pred HHHHHHHHHHHHHhcC Confidence 9988888777543 12 No 55 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=97.38 E-value=5.5e-08 Score=60.37 Aligned_cols=79 Identities=27% Similarity=0.421 Sum_probs=47.1 Q ss_pred CC---CCchhhHHHHHHHHH---------------------HhhcCCEEEEEecCCC----CCCCC-----ccHHHHHHH Q lcl|NC_019527. 1 MA---TLTGGDKLAKILADI---------------------GGKAQGSVDVGFMSGA----TYPDG-----TPVAQVAFW 47 (190) Q Consensus 1 Ma---~i~~~d~l~~il~~l---------------------~~l~~~~V~VGi~~~~----~~~dG-----~~vA~iA~~ 47 (190) |+ ++.|-|.|.+-|+++ +.+. -|.-|-+.+. ..++| .+.+.+|.+ T Consensus 1 M~~~i~i~Gld~l~~~L~~~~~~~~~~~al~~~~~~i~~~ak~~a--PvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~ 78 (112) T protein:vir:36 1 MKSSLSFKGIDQLVKHLDKAASLKGVQQVVKSNTSNMTANMQKLV--PVDTGYMKRSIKMELTEGGFSGQAGPHTDYSAY 78 (112) T ss_pred CceeeeehhHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhC--CCCchhhhhceeeeecCCceEEEeecCCCccce Confidence 55 334444332222111 1111 0111111111 11233 245889999 Q ss_pred hhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 48 NEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIK 84 (190) Q Consensus 48 ~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~ 84 (190) .|||+. ..|+||||+|+++.++.++.+.|++.|+ T Consensus 79 vE~GT~---k~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 79 VEYGTR---FQSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred eecccc---ccCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 999984 4799999999999999999999999887 No 56 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=97.37 E-value=1.1e-07 Score=58.73 Aligned_cols=93 Identities=24% Similarity=0.312 Sum_probs=51.1 Q ss_pred CCC--CchhhHHHHHHHHHHh---------h----------------cCCEEEEEecCCC-----C-CCCC-----ccHH Q lcl|NC_019527. 1 MAT--LTGGDKLAKILADIGG---------K----------------AQGSVDVGFMSGA-----T-YPDG-----TPVA 42 (190) Q Consensus 1 Ma~--i~~~d~l~~il~~l~~---------l----------------~~~~V~VGi~~~~-----~-~~dG-----~~vA 42 (190) |++ |+|.|+|.+-|+++.. + ..--|.-|-+..+ . .+++ .+.+ T Consensus 1 m~~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~~~~~~g~V~~~~ 80 (182) T protein:vir:10 1 MIEVELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVDGDEVIGRWWNSS 80 (182) T ss_pred CeEEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeecCCeEEEEeecCC Confidence 884 4688876555543311 0 0001222222111 0 0111 1347 Q ss_pred HHHHHhhcCccc---------------------------------------------------cCCCCCCchhhHHHHHH Q lcl|NC_019527. 43 QVAFWNEFGHGG---------------------------------------------------RFPAPPRPFFRNMVNEK 71 (190) Q Consensus 43 ~iA~~~EfG~~~---------------------------------------------------~~~IP~RPFlr~~~~~~ 71 (190) .+|.+.|||+.. ..+.||||||+|+++++ T Consensus 81 ~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPFl~pA~~~~ 160 (182) T protein:vir:10 81 MVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQFMTPAANKM 160 (182) T ss_pred CccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcchHHHHHHh Confidence 789999999720 02579999999999999 Q ss_pred HHHHHHHHHHHHhhcCCcHHHHHHHHHH Q lcl|NC_019527. 72 SSEWPKRLGDAIKHYDGDGRKALASMGE 99 (190) Q Consensus 72 ~~~~~~~l~~~i~~g~~~~~~aL~~iG~ 99 (190) ++++.+.|.++|... +++.| |- T Consensus 161 ~~~i~~~i~~~i~~~---l~~~~---g~ 182 (182) T protein:vir:10 161 AKEAPEIIKRSIDQE---LHDKL---GG 182 (182) T ss_pred HHHHHHHHHHHHHHH---HHHhh---cC Confidence 999998888766431 00000 00 No 57 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=97.36 E-value=5.8e-07 Score=54.77 Aligned_cols=83 Identities=18% Similarity=0.175 Sum_probs=47.3 Q ss_pred CCCCc---hhhHHHHHHHHHHh----h-----------------cCCEEEEEecCCC----CCCCC-------ccHHHHH Q lcl|NC_019527. 1 MATLT---GGDKLAKILADIGG----K-----------------AQGSVDVGFMSGA----TYPDG-------TPVAQVA 45 (190) Q Consensus 1 Ma~i~---~~d~l~~il~~l~~----l-----------------~~~~V~VGi~~~~----~~~dG-------~~vA~iA 45 (190) ||+|+ +.++|.+.|+.+.+ . ...-|.-|-+..+ ...+| .+.+.+| T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~g~~~~~~v~~~~~YA 80 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSGGRFSFSVTIGTNVTYA 80 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeccCCceEEEEEecCcccc Confidence 99774 55544443332211 0 0111222322221 01122 1458899 Q ss_pred HHhhcCcccc------------------------CCCCCCchhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 46 FWNEFGHGGR------------------------FPAPPRPFFRNMVNEKSSEWPKRLGDAIK 84 (190) Q Consensus 46 ~~~EfG~~~~------------------------~~IP~RPFlr~~~~~~~~~~~~~l~~~i~ 84 (190) .++|||+... -++||||||+|+++++++++.+.+++ |. T Consensus 81 ~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i~~~~~~-~~ 142 (142) T protein:vir:94 81 ADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAASTFLRNHAKG-IR 142 (142) T ss_pred hhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHHHHHHHHHHh-cC Confidence 9999997311 14789999999999999888777665 44 No 58 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=97.35 E-value=1.8e-07 Score=57.53 Aligned_cols=80 Identities=25% Similarity=0.457 Sum_probs=48.4 Q ss_pred CCCCc-hhhHHHHHHHHHHhhcC---------------------CEEEEEecCCCC----CCCC-----ccHHHHHHHhh Q lcl|NC_019527. 1 MATLT-GGDKLAKILADIGGKAQ---------------------GSVDVGFMSGAT----YPDG-----TPVAQVAFWNE 49 (190) Q Consensus 1 Ma~i~-~~d~l~~il~~l~~l~~---------------------~~V~VGi~~~~~----~~dG-----~~vA~iA~~~E 49 (190) ||++. |.|+|.+-|+.+..... .-|.-|-+...- ..+| .+.+.+|.+.| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~pvdTG~L~~Si~~~~~~~g~~~~V~~~~~YA~yvE 80 (137) T protein:vir:96 1 MAKVKYGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAPVDLGFLKESIDFKVTDGGFSSVISVGAEYAIYVE 80 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCccchhcCceeEeecCceEEEEecCCCcccccc Confidence 99886 66665444433221100 011122222110 1123 24578999999 Q ss_pred cCccc--------------------------cCCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_019527. 50 FGHGG--------------------------RFPAPPRPFFRNMVNEKSSEWPKRLG 80 (190) Q Consensus 50 fG~~~--------------------------~~~IP~RPFlr~~~~~~~~~~~~~l~ 80 (190) ||+.. ..++||||||+|+++++++.+.+.|. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:96 81 FGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRKVFNRYFS 137 (137) T ss_pred cCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHHHHHHHHhhC Confidence 99721 12489999999999999999988888 No 59 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=97.34 E-value=1.7e-07 Score=57.68 Aligned_cols=90 Identities=14% Similarity=0.129 Sum_probs=44.3 Q ss_pred CC-CCch-h--h----HHHHHHHHHHhhc----------------------CCEEEEEecCCCCC---------CCCccH Q lcl|NC_019527. 1 MA-TLTG-G--D----KLAKILADIGGKA----------------------QGSVDVGFMSGATY---------PDGTPV 41 (190) Q Consensus 1 Ma-~i~~-~--d----~l~~il~~l~~l~----------------------~~~V~VGi~~~~~~---------~dG~~v 41 (190) |. .+.. . . ....+.+.++... .....|++...... -.+-.- T Consensus 21 l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 100 (149) T protein:vir:19 21 LSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRN 100 (149) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeecccccccccccccccceeecCCCCc Confidence 21 0000 0 0 0111111121111 01112222111000 001234 Q ss_pred HHHHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHH Q lcl|NC_019527. 42 AQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALAS 96 (190) Q Consensus 42 A~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~ 96 (190) +.++.+.|||+. +.||+|||+|+++++++++.+.+...|.. ..+++|.+ T Consensus 101 ~~y~~f~E~GT~---~~~a~PF~~pA~~~~k~~~~~~~~~~l~~---~l~k~~~k 149 (149) T protein:vir:19 101 AFYWRFVELGTA---NMPAHPFVRPAYDTREEEAASVAIARMNQ---AIDEVLSK 149 (149) T ss_pred cceeeeeccCCC---CCCCCcchhHHHHHHHHHHHHHHHHHHHH---HHHHHhcC Confidence 778889999984 48999999999999999888888776643 23344443 No 60 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=97.29 E-value=3.4e-07 Score=56.03 Aligned_cols=80 Identities=24% Similarity=0.503 Sum_probs=48.0 Q ss_pred CCCCc-hhhHHHHHHHHHHhhc---------------------CCEEEEEecCCC----CCCCC-----ccHHHHHHHhh Q lcl|NC_019527. 1 MATLT-GGDKLAKILADIGGKA---------------------QGSVDVGFMSGA----TYPDG-----TPVAQVAFWNE 49 (190) Q Consensus 1 Ma~i~-~~d~l~~il~~l~~l~---------------------~~~V~VGi~~~~----~~~dG-----~~vA~iA~~~E 49 (190) ||++. |.|+|.+-|+++.... ..-|.-|-+... ...+| .+.+.+|.+.| T Consensus 13 Ma~~~~Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~VE 92 (149) T protein:vir:94 13 MAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADYAIYVE 92 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEEeeCCcEEEEEecCCCcccccc Confidence 99875 6565544444332110 011122322221 01233 24578999999 Q ss_pred cCccc--------------------------cCCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_019527. 50 FGHGG--------------------------RFPAPPRPFFRNMVNEKSSEWPKRLG 80 (190) Q Consensus 50 fG~~~--------------------------~~~IP~RPFlr~~~~~~~~~~~~~l~ 80 (190) ||+.. .-+.||||||+|+++++++++.+.|. T Consensus 93 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~~~i~~~i~ 149 (149) T protein:vir:94 93 YGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred cCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 99721 12479999999999999999888877 No 61 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=97.27 E-value=2.1e-06 Score=51.70 Aligned_cols=74 Identities=19% Similarity=0.145 Sum_probs=40.0 Q ss_pred CC-CCchhhHHHHHHHHHH-hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCcc-----ccCCCCCCchhhHHHHHHHH Q lcl|NC_019527. 1 MA-TLTGGDKLAKILADIG-GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHG-----GRFPAPPRPFFRNMVNEKSS 73 (190) Q Consensus 1 Ma-~i~~~d~l~~il~~l~-~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~-----~~~~IP~RPFlr~~~~~~~~ 73 (190) .. .+.+..+|. .+|. ......|.||. +..||++|+||.. ..+.||+||||--+ ++..+ T Consensus 76 ~~~~L~~tg~L~---~Si~~~~~~~~v~vGt-----------~~~yA~vHqfG~~~~~~~~~~~iPaRpfLG~s-~~d~~ 140 (156) T protein:vir:19 76 PGSILTLHGDLA---RSITTDYGQDYALIGS-----------PKIYAAIHQWGGTPDMAPRPAGVPARPYMGLD-KTGEQ 140 (156) T ss_pred CCcchhhhHHHH---HHhhheecCCEEEEec-----------chhhhHHhhcCcccccCCCccccCCccccCCC-HHHHH Confidence 11 333333333 3443 23567888886 2578999999974 23469999999433 23344 Q ss_pred HHHHHHHHHHhhcCCcHHHHHHH Q lcl|NC_019527. 74 EWPKRLGDAIKHYDGDGRKALAS 96 (190) Q Consensus 74 ~~~~~l~~~i~~g~~~~~~aL~~ 96 (190) ++.+.+...|... |.+ T Consensus 141 ~I~~~i~~~l~~~-------~~~ 156 (156) T protein:vir:19 141 EIFDAIRKRVSAA-------LRQ 156 (156) T ss_pred HHHHHHHHHHHHH-------hhC Confidence 4444444333210 000 No 62 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=97.23 E-value=4.6e-07 Score=55.30 Aligned_cols=80 Identities=21% Similarity=0.351 Sum_probs=44.1 Q ss_pred CCCchhhH-------------HHHHHH----HHHhhcCC--EEEEEecCCC---C-CCCC-----ccHHHHHHHhhcCcc Q lcl|NC_019527. 2 ATLTGGDK-------------LAKILA----DIGGKAQG--SVDVGFMSGA---T-YPDG-----TPVAQVAFWNEFGHG 53 (190) Q Consensus 2 a~i~~~d~-------------l~~il~----~l~~l~~~--~V~VGi~~~~---~-~~dG-----~~vA~iA~~~EfG~~ 53 (190) .+|.|-|. +++.++ .+.+.... -|.-|-+... . ..+| .+.+.+|.+.|||+. T Consensus 1 i~i~Gld~l~~~l~~~~~~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~GT~ 80 (108) T protein:vir:98 1 MKITGIDALQKKLRKNATLNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFTDGGLTGTTIPHTDYAGYVEYGTR 80 (108) T ss_pred CcchhHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeeecCceEEEeecCCCccceeecccc Confidence 22222222 222221 11111100 0111111110 0 1122 134778999999984 Q ss_pred ccCCCCCCchhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 54 GRFPAPPRPFFRNMVNEKSSEWPKRLGDAIK 84 (190) Q Consensus 54 ~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~ 84 (190) ..|+||||+|+++..+.++.+.|++.|+ T Consensus 81 ---~m~aqPFl~pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 81 ---FQAAQPFVKPAFDVQKKIFTNDLERLTK 108 (108) T ss_pred ---ccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 4799999999999999999999999887 No 63 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=97.20 E-value=7.4e-07 Score=54.20 Aligned_cols=79 Identities=20% Similarity=0.158 Sum_probs=37.2 Q ss_pred CCCCchhhHHHHHHH-HH-Hhh--cC-CEEEEEecCCCCCCCCccHHHHHHHhhcCccc--------------------- Q lcl|NC_019527. 1 MATLTGGDKLAKILA-DI-GGK--AQ-GSVDVGFMSGATYPDGTPVAQVAFWNEFGHGG--------------------- 54 (190) Q Consensus 1 Ma~i~~~d~l~~il~-~l-~~l--~~-~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~--------------------- 54 (190) -+-.+. -+|.+-+. .. +.- .+ ....|||-.. -+-++.+.|||+.. T Consensus 46 ~aP~~t-G~LkksI~~~~~~~~s~~g~~~~~Vg~~~~--------~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t 116 (157) T protein:vir:97 46 FVNDET-GKLRNNLYVAYSPEESVEGIQTYAVSWRKK--------AAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVN 116 (157) T ss_pred hCCCCc-chhhhheeeeeccccCCCceEEEEEeecCC--------ccceeeeeecCcccccccccCCcccccccccccCC Confidence 111111 11111110 00 000 01 1223444322 13445566777311 Q ss_pred cCCCCCCchhhHHHHHHHHHHHHHHHHHHh-------hcCC Q lcl|NC_019527. 55 RFPAPPRPFFRNMVNEKSSEWPKRLGDAIK-------HYDG 88 (190) Q Consensus 55 ~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~-------~g~~ 88 (190) ...+||||||||+|+..+++..+.+.+.|. .|+. T Consensus 117 ~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k~I~e~l~g~~ 157 (157) T protein:vir:97 117 PKWIPAKPFLRPGYDSVAMQIPDIARAAGAKKYAELQRGDT 157 (157) T ss_pred CCcCCCCcccchHHHHhHHHHHHHHHHHHHHHHHHHhcCCC Confidence 124899999999999999988877655432 2432 No 64 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=97.19 E-value=7.7e-07 Score=54.10 Aligned_cols=80 Identities=23% Similarity=0.363 Sum_probs=47.8 Q ss_pred CCCCc-hhhHHHHHHHHHHhhc---------------------CCEEEEEecCCC----CCCCC-----ccHHHHHHHhh Q lcl|NC_019527. 1 MATLT-GGDKLAKILADIGGKA---------------------QGSVDVGFMSGA----TYPDG-----TPVAQVAFWNE 49 (190) Q Consensus 1 Ma~i~-~~d~l~~il~~l~~l~---------------------~~~V~VGi~~~~----~~~dG-----~~vA~iA~~~E 49 (190) ||++. |-|+|.+-|+++.... ..-|.-|-+... ...+| .+.+.+|.+.| T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Ya~~vE 80 (137) T protein:vir:10 1 MAKVKYGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEYAVYVN 80 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeCCcEEEEEecCCCcccccc Confidence 99875 6666544444332100 001111222211 01123 24477999999 Q ss_pred cCccc--------------------------cCCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_019527. 50 FGHGG--------------------------RFPAPPRPFFRNMVNEKSSEWPKRLG 80 (190) Q Consensus 50 fG~~~--------------------------~~~IP~RPFlr~~~~~~~~~~~~~l~ 80 (190) ||+.. .-++||||||+|+++++++++.+.|. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 81 YGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred cCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Confidence 99621 01479999999999999999998888 No 65 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=97.17 E-value=1.4e-06 Score=52.73 Aligned_cols=77 Identities=13% Similarity=0.117 Sum_probs=41.8 Q ss_pred CCCCchhhHHHHHHHH--HH-hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCcc--------ccCCCCCCchhhHHHH Q lcl|NC_019527. 1 MATLTGGDKLAKILAD--IG-GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHG--------GRFPAPPRPFFRNMVN 69 (190) Q Consensus 1 Ma~i~~~d~l~~il~~--l~-~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~--------~~~~IP~RPFlr~~~~ 69 (190) -..++.+.-|.++... |. ..+...+.|||.+. +..||++|.||.. ..+.||+||||--+= T Consensus 65 k~~~~~~~m~~~L~~a~~l~~~a~~~~~~Vg~~Gt--------~~~yAaiHQfG~~~r~~~~~~~~v~iPaRp~LG~s~- 135 (152) T protein:vir:10 65 KSKIKSGKMFDKITQPRFMRLRLESEGVSLGYEGG--------DAVIARIHQQGLIGRVRKDWDLKVKYASRELLGFTD- 135 (152) T ss_pred cccccchhHHHhhhhcceeeeeecCcEEEEEecCC--------chhhhhhhccCccccccCCCCcceeccccccCCCCH- Confidence 1112222223332221 11 23567889998632 4689999999953 245699999994442 Q ss_pred HHHHHHHHHHHHHHhhc Q lcl|NC_019527. 70 EKSSEWPKRLGDAIKHY 86 (190) Q Consensus 70 ~~~~~~~~~l~~~i~~g 86 (190) +...++.+.+...|..- T Consensus 136 ~d~~~I~~~i~~~l~~a 152 (152) T protein:vir:10 136 DDLQMIEDYMINILAGS 152 (152) T ss_pred HHHHHHHHHHHHHHhcC Confidence 23344555555544322 No 66 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=97.16 E-value=8.9e-07 Score=53.76 Aligned_cols=87 Identities=20% Similarity=0.191 Sum_probs=49.2 Q ss_pred CCCCchhh---HHHHHHH-----------------------HHHhhcCC---EEEEEecCCCCCCCCc--------cHHH Q lcl|NC_019527. 1 MATLTGGD---KLAKILA-----------------------DIGGKAQG---SVDVGFMSGATYPDGT--------PVAQ 43 (190) Q Consensus 1 Ma~i~~~d---~l~~il~-----------------------~l~~l~~~---~V~VGi~~~~~~~dG~--------~vA~ 43 (190) ||+|+=.+ .+.+-|+ .|++.+.+ ...=+|-.....++|. +-.. T Consensus 1 Ma~i~id~la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~~~g~~~~vv~~~~~~~ 80 (126) T protein:vir:81 1 MANITIDRLADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKEDGYGTTKRIIWNKKHYR 80 (126) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccchhhccccccccccCCcceEEEeccCCCC Confidence 99876332 2211111 22221111 1111121111222221 1144 Q ss_pred HHHHhhcCccccCC--CCCCchhhHHHHHHHHHHHHHHHHHHhhcC Q lcl|NC_019527. 44 VAFWNEFGHGGRFP--APPRPFFRNMVNEKSSEWPKRLGDAIKHYD 87 (190) Q Consensus 44 iA~~~EfG~~~~~~--IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~ 87 (190) ++.+.|||+-..++ .|+||||+|+++....++.+.++++|..|. T Consensus 81 l~HLLEfGha~r~gGrV~a~Phi~Pa~e~~~~~~~~~i~~~l~~gg 126 (126) T protein:vir:81 81 RVHLLEFGHAKVNGGRVKEYPHLRPAYDKHGARLPDELKRVIENGG 126 (126) T ss_pred ceeeeecceecCCCCccCCCcchHHHHHHHHHHHHHHHHHHhhcCC Confidence 57789999754322 699999999999999999999999998764 No 67 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=97.13 E-value=1.1e-07 Score=58.84 Aligned_cols=80 Identities=20% Similarity=0.318 Sum_probs=49.3 Q ss_pred CCCCc-hhhHHHHHHHHHHhhc---------------------CCEEEEEecCCC---C-CCCC-----ccHHHHHHHhh Q lcl|NC_019527. 1 MATLT-GGDKLAKILADIGGKA---------------------QGSVDVGFMSGA---T-YPDG-----TPVAQVAFWNE 49 (190) Q Consensus 1 Ma~i~-~~d~l~~il~~l~~l~---------------------~~~V~VGi~~~~---~-~~dG-----~~vA~iA~~~E 49 (190) ||++. +.|+|.+.|+++.... ..-|.-|-+..+ . ..+| .+.+.+|.+.| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:97 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVN 80 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCcccccc Confidence 99664 6666655554332110 011222332221 0 1122 24678999999 Q ss_pred cCcccc--------------------------CCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_019527. 50 FGHGGR--------------------------FPAPPRPFFRNMVNEKSSEWPKRLG 80 (190) Q Consensus 50 fG~~~~--------------------------~~IP~RPFlr~~~~~~~~~~~~~l~ 80 (190) ||+... .+.||||||+|+++.+++++.+.|. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:97 81 YGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 998210 2479999999999999999999888 No 68 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=97.13 E-value=1.1e-07 Score=58.84 Aligned_cols=80 Identities=20% Similarity=0.318 Sum_probs=49.3 Q ss_pred CCCCc-hhhHHHHHHHHHHhhc---------------------CCEEEEEecCCC---C-CCCC-----ccHHHHHHHhh Q lcl|NC_019527. 1 MATLT-GGDKLAKILADIGGKA---------------------QGSVDVGFMSGA---T-YPDG-----TPVAQVAFWNE 49 (190) Q Consensus 1 Ma~i~-~~d~l~~il~~l~~l~---------------------~~~V~VGi~~~~---~-~~dG-----~~vA~iA~~~E 49 (190) ||++. +.|+|.+.|+++.... ..-|.-|-+..+ . ..+| .+.+.+|.+.| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVN 80 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCcccccc Confidence 99664 6666655554332110 011222332221 0 1122 24678999999 Q ss_pred cCcccc--------------------------CCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_019527. 50 FGHGGR--------------------------FPAPPRPFFRNMVNEKSSEWPKRLG 80 (190) Q Consensus 50 fG~~~~--------------------------~~IP~RPFlr~~~~~~~~~~~~~l~ 80 (190) ||+... .+.||||||+|+++.+++++.+.|. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 81 YGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 998210 2479999999999999999999888 No 69 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=97.13 E-value=1.1e-07 Score=58.84 Aligned_cols=80 Identities=20% Similarity=0.318 Sum_probs=49.3 Q ss_pred CCCCc-hhhHHHHHHHHHHhhc---------------------CCEEEEEecCCC---C-CCCC-----ccHHHHHHHhh Q lcl|NC_019527. 1 MATLT-GGDKLAKILADIGGKA---------------------QGSVDVGFMSGA---T-YPDG-----TPVAQVAFWNE 49 (190) Q Consensus 1 Ma~i~-~~d~l~~il~~l~~l~---------------------~~~V~VGi~~~~---~-~~dG-----~~vA~iA~~~E 49 (190) ||++. +.|+|.+.|+++.... ..-|.-|-+..+ . ..+| .+.+.+|.+.| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:93 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVN 80 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCcccccc Confidence 99664 6666655554332110 011222332221 0 1122 24678999999 Q ss_pred cCcccc--------------------------CCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_019527. 50 FGHGGR--------------------------FPAPPRPFFRNMVNEKSSEWPKRLG 80 (190) Q Consensus 50 fG~~~~--------------------------~~IP~RPFlr~~~~~~~~~~~~~l~ 80 (190) ||+... .+.||||||+|+++.+++++.+.|. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:93 81 YGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 998210 2479999999999999999999888 No 70 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=97.13 E-value=1.4e-06 Score=52.66 Aligned_cols=83 Identities=16% Similarity=0.182 Sum_probs=39.8 Q ss_pred CCCC---chhhHHHHHHH--HHHh-hcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHH Q lcl|NC_019527. 1 MATL---TGGDKLAKILA--DIGG-KAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSE 74 (190) Q Consensus 1 Ma~i---~~~d~l~~il~--~l~~-l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~ 74 (190) .+-+ +...+|..-+. ..+. -....|.|++-.+. +...++.+.|||+. ..||+|||+|+++.++++ T Consensus 47 ~ap~~~~~~~g~l~~~I~i~~~k~~~~~~~v~v~vg~~~------~~~~~~~f~E~GT~---~~~a~PF~~pa~~~~~~~ 117 (135) T protein:vir:57 47 NAGYDNSSTNAHMRDSIKIRSSRGKAGSTVVVLRVGPTR------SHYMKALAQEFGTI---KQVAKPFIRPALDYNKMQ 117 (135) T ss_pred hCCCCCCCchhhHHhhcccccccccccceeEEEEecCCC------CcceeEeecccCCC---CCCCCcchhHhHHHhHHH Confidence 1111 11111111110 0000 01122333331111 11233444599984 479999999999999999 Q ss_pred HHHHHHHHHhhcCCcHHHHHHHHHH Q lcl|NC_019527. 75 WPKRLGDAIKHYDGDGRKALASMGE 99 (190) Q Consensus 75 ~~~~l~~~i~~g~~~~~~aL~~iG~ 99 (190) +.+.+.+.+.. .|++++. T Consensus 118 ~~~~~~~~~~~-------~l~ka~r 135 (135) T protein:vir:57 118 VLRILTVEIRD-------GLSTLSR 135 (135) T ss_pred HHHHHHHHHHH-------HHHHhcC Confidence 88887776643 2333333 No 71 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=97.12 E-value=5.3e-07 Score=55.00 Aligned_cols=80 Identities=24% Similarity=0.503 Sum_probs=48.7 Q ss_pred CCCCc-hhhHHHHHHHHHHhhc---------------------CCEEEEEecCCC----CCCCC-----ccHHHHHHHhh Q lcl|NC_019527. 1 MATLT-GGDKLAKILADIGGKA---------------------QGSVDVGFMSGA----TYPDG-----TPVAQVAFWNE 49 (190) Q Consensus 1 Ma~i~-~~d~l~~il~~l~~l~---------------------~~~V~VGi~~~~----~~~dG-----~~vA~iA~~~E 49 (190) ||++. |.|+|.+-|+++.... ..-|.-|.+... ...+| .+.+.+|.+.| T Consensus 13 Ma~v~~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~~~~~g~~~~V~~~~~YA~~vE 92 (149) T protein:vir:10 13 MAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADYAIYVE 92 (149) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEEecCCcEEEEEecCCCcccccc Confidence 99885 6666555444331110 011222333222 01233 24578999999 Q ss_pred cCccc--------------------------cCCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_019527. 50 FGHGG--------------------------RFPAPPRPFFRNMVNEKSSEWPKRLG 80 (190) Q Consensus 50 fG~~~--------------------------~~~IP~RPFlr~~~~~~~~~~~~~l~ 80 (190) ||+.. .-+.||||||+|+++++++++.+.|. T Consensus 93 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~~i~~~i~ 149 (149) T protein:vir:10 93 YGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred cCccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 99721 12479999999999999999988887 No 72 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=97.11 E-value=5.3e-07 Score=54.97 Aligned_cols=77 Identities=30% Similarity=0.412 Sum_probs=41.6 Q ss_pred CC--CC--chhhH-H---HHHHHHHHh-----hcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHH Q lcl|NC_019527. 1 MA--TL--TGGDK-L---AKILADIGG-----KAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNM 67 (190) Q Consensus 1 Ma--~i--~~~d~-l---~~il~~l~~-----l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~ 67 (190) .+ ++ +++++ | -.+..+|.. -.+..|.||- +..||+||+||+.. ++||+||||-+. T Consensus 52 Ls~st~a~k~~~~~L~~tG~L~~Si~~~~~~~~~~~~a~vGt-----------n~~YA~~hqfG~~~-~~IPaRPfLG~~ 119 (145) T protein:vir:31 52 LKESTIRAKGSDTPLIDNSRLLTDINAASMMDRANRMAVIGT-----------NLDYAEHHEFGAPE-AGIPARPIFGPA 119 (145) T ss_pred cChHHHHHhcCCCCCccCHHHHHHHHHHhhhcccCceeEecC-----------CchhhhhhccCCcc-cccCCCCccCCC Confidence 21 00 01111 0 123334432 1344566664 35789999999864 469999999877 Q ss_pred HHHHHHHHHHHHHHHHh----hcCCc Q lcl|NC_019527. 68 VNEKSSEWPKRLGDAIK----HYDGD 89 (190) Q Consensus 68 ~~~~~~~~~~~l~~~i~----~g~~~ 89 (190) .+..++++.+.+.+.+. .-..| T Consensus 120 ~~~~~~~~~~ii~~~i~~~L~~~~~~ 145 (145) T protein:vir:31 120 GAYASQQAPDVIGDEIDTNLEGAVID 145 (145) T ss_pred ccchHHHHHHHHHHHHHHHhhhhccC Confidence 66555556555555443 22233 No 73 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=97.10 E-value=9.7e-07 Score=53.54 Aligned_cols=80 Identities=21% Similarity=0.318 Sum_probs=48.8 Q ss_pred CCCC-chhhHHHHHHHHHHhhc---------------------CCEEEEEecCCC-C---CCCC-----ccHHHHHHHhh Q lcl|NC_019527. 1 MATL-TGGDKLAKILADIGGKA---------------------QGSVDVGFMSGA-T---YPDG-----TPVAQVAFWNE 49 (190) Q Consensus 1 Ma~i-~~~d~l~~il~~l~~l~---------------------~~~V~VGi~~~~-~---~~dG-----~~vA~iA~~~E 49 (190) ||++ ++.|.|.+-|+++.... ..-|.-|-+..+ + ..+| .+.+.+|.+.| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:95 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVN 80 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEeeCCceEEEEecCCCcccccc Confidence 9965 46666655554332211 011122322221 0 1122 24578999999 Q ss_pred cCcccc--------------------------CCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_019527. 50 FGHGGR--------------------------FPAPPRPFFRNMVNEKSSEWPKRLG 80 (190) Q Consensus 50 fG~~~~--------------------------~~IP~RPFlr~~~~~~~~~~~~~l~ 80 (190) ||+... .+.||||||+|+++.+++++.+.|. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~l~ 137 (137) T protein:vir:95 81 YGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 997210 2479999999999999999999888 No 74 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=97.06 E-value=1.1e-06 Score=53.28 Aligned_cols=80 Identities=20% Similarity=0.381 Sum_probs=48.2 Q ss_pred CCCCc-hhhHHHHHHHHHHhh---------------------cCCEEEEEecCCC----CCCCC-----ccHHHHHHHhh Q lcl|NC_019527. 1 MATLT-GGDKLAKILADIGGK---------------------AQGSVDVGFMSGA----TYPDG-----TPVAQVAFWNE 49 (190) Q Consensus 1 Ma~i~-~~d~l~~il~~l~~l---------------------~~~~V~VGi~~~~----~~~dG-----~~vA~iA~~~E 49 (190) ||++. |.|+|.+-|+++... ...-|.-|-+..+ ...+| .+.+.+|.+.| T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~ve 80 (135) T protein:vir:96 1 MAKVKYGADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFENGGFTGVVKIGSNYAVYVN 80 (135) T ss_pred CchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEeecCcEEEEEecCCCccchhh Confidence 99775 666554443322110 0111222333222 11233 24678999999 Q ss_pred cCccc------------------------cCCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_019527. 50 FGHGG------------------------RFPAPPRPFFRNMVNEKSSEWPKRLG 80 (190) Q Consensus 50 fG~~~------------------------~~~IP~RPFlr~~~~~~~~~~~~~l~ 80 (190) ||+.. ...+||||||+|++++.++++.+.|. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 81 YGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGRQTFEQYFS 135 (135) T ss_pred cccccccCCCccccccccccccCCcceeecCCcCCCcchhHHHHHHHHHHHHhcC Confidence 99721 02489999999999999999888777 No 75 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=97.03 E-value=7.3e-07 Score=54.22 Aligned_cols=74 Identities=18% Similarity=0.245 Sum_probs=42.6 Q ss_pred CC--------------------------CC-chhhHHHHHHHHH-----H---hhcCCEEEEEecCCCCCCCCccHHHHH Q lcl|NC_019527. 1 MA--------------------------TL-TGGDKLAKILADI-----G---GKAQGSVDVGFMSGATYPDGTPVAQVA 45 (190) Q Consensus 1 Ma--------------------------~i-~~~d~l~~il~~l-----~---~l~~~~V~VGi~~~~~~~dG~~vA~iA 45 (190) |. -. +++.+ +.+.| + .-....|.||+..+ .+.+| T Consensus 17 l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~h---l~d~I~vs~~k~~~~~g~~~v~VG~~k~--------~~~~a 85 (125) T protein:vir:47 17 AVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKH---ARDHIAVSNVKTDRHTSEKIVTIGYAKG--------VSHRI 85 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCch---hhhheeecccccccccceEEEEeccCCC--------CceEE Confidence 00 00 00001 11111 1 11233566665322 24677 Q ss_pred HHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019527. 46 FWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDG 88 (190) Q Consensus 46 ~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~ 88 (190) .+.|||+. .+||+||||+++++..+++.+.+...++.-.- T Consensus 86 ~F~E~GT~---k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:47 86 HATEFGTM---YQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred EeccCCcc---CCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 89999984 57999999999999999988887766643211 No 76 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=97.03 E-value=7.3e-07 Score=54.22 Aligned_cols=74 Identities=18% Similarity=0.245 Sum_probs=42.6 Q ss_pred CC--------------------------CC-chhhHHHHHHHHH-----H---hhcCCEEEEEecCCCCCCCCccHHHHH Q lcl|NC_019527. 1 MA--------------------------TL-TGGDKLAKILADI-----G---GKAQGSVDVGFMSGATYPDGTPVAQVA 45 (190) Q Consensus 1 Ma--------------------------~i-~~~d~l~~il~~l-----~---~l~~~~V~VGi~~~~~~~dG~~vA~iA 45 (190) |. -. +++.+ +.+.| + .-....|.||+..+ .+.+| T Consensus 17 l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~h---l~d~I~vs~~k~~~~~g~~~v~VG~~k~--------~~~~a 85 (125) T protein:vir:81 17 AVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKH---ARDHIAVSNVKTDRHTSEKIVTIGYAKG--------VSHRI 85 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCch---hhhheeecccccccccceEEEEeccCCC--------CceEE Confidence 00 00 00001 11111 1 11233566665322 24677 Q ss_pred HHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019527. 46 FWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDG 88 (190) Q Consensus 46 ~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~ 88 (190) .+.|||+. .+||+||||+++++..+++.+.+...++.-.- T Consensus 86 ~F~E~GT~---k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:81 86 HATEFGTM---YQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred EeccCCcc---CCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 89999984 57999999999999999988887766643211 No 77 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=97.03 E-value=7.3e-07 Score=54.22 Aligned_cols=74 Identities=18% Similarity=0.245 Sum_probs=42.6 Q ss_pred CC--------------------------CC-chhhHHHHHHHHH-----H---hhcCCEEEEEecCCCCCCCCccHHHHH Q lcl|NC_019527. 1 MA--------------------------TL-TGGDKLAKILADI-----G---GKAQGSVDVGFMSGATYPDGTPVAQVA 45 (190) Q Consensus 1 Ma--------------------------~i-~~~d~l~~il~~l-----~---~l~~~~V~VGi~~~~~~~dG~~vA~iA 45 (190) |. -. +++.+ +.+.| + .-....|.||+..+ .+.+| T Consensus 17 l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~h---l~d~I~vs~~k~~~~~g~~~v~VG~~k~--------~~~~a 85 (125) T protein:vir:94 17 AVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKH---ARDHIAVSNVKTDRHTSEKIVTIGYAKG--------VSHRI 85 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCch---hhhheeecccccccccceEEEEeccCCC--------CceEE Confidence 00 00 00001 11111 1 11233566665322 24677 Q ss_pred HHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019527. 46 FWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDG 88 (190) Q Consensus 46 ~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~ 88 (190) .+.|||+. .+||+||||+++++..+++.+.+...++.-.- T Consensus 86 ~F~E~GT~---k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:94 86 HATEFGTM---YQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred EeccCCcc---CCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 89999984 57999999999999999988887766643211 No 78 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=97.03 E-value=7.3e-07 Score=54.22 Aligned_cols=74 Identities=18% Similarity=0.245 Sum_probs=42.6 Q ss_pred CC--------------------------CC-chhhHHHHHHHHH-----H---hhcCCEEEEEecCCCCCCCCccHHHHH Q lcl|NC_019527. 1 MA--------------------------TL-TGGDKLAKILADI-----G---GKAQGSVDVGFMSGATYPDGTPVAQVA 45 (190) Q Consensus 1 Ma--------------------------~i-~~~d~l~~il~~l-----~---~l~~~~V~VGi~~~~~~~dG~~vA~iA 45 (190) |. -. +++.+ +.+.| + .-....|.||+..+ .+.+| T Consensus 17 l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~h---l~d~I~vs~~k~~~~~g~~~v~VG~~k~--------~~~~a 85 (125) T protein:vir:98 17 AVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKH---ARDHIAVSNVKTDRHTSEKIVTIGYAKG--------VSHRI 85 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCch---hhhheeecccccccccceEEEEeccCCC--------CceEE Confidence 00 00 00001 11111 1 11233566665322 24677 Q ss_pred HHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019527. 46 FWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDG 88 (190) Q Consensus 46 ~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~ 88 (190) .+.|||+. .+||+||||+++++..+++.+.+...++.-.- T Consensus 86 ~F~E~GT~---k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:98 86 HATEFGTM---YQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred EeccCCcc---CCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 89999984 57999999999999999988887766643211 No 79 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=97.03 E-value=7.3e-07 Score=54.22 Aligned_cols=74 Identities=18% Similarity=0.245 Sum_probs=42.6 Q ss_pred CC--------------------------CC-chhhHHHHHHHHH-----H---hhcCCEEEEEecCCCCCCCCccHHHHH Q lcl|NC_019527. 1 MA--------------------------TL-TGGDKLAKILADI-----G---GKAQGSVDVGFMSGATYPDGTPVAQVA 45 (190) Q Consensus 1 Ma--------------------------~i-~~~d~l~~il~~l-----~---~l~~~~V~VGi~~~~~~~dG~~vA~iA 45 (190) |. -. +++.+ +.+.| + .-....|.||+..+ .+.+| T Consensus 17 l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~h---l~d~I~vs~~k~~~~~g~~~v~VG~~k~--------~~~~a 85 (125) T protein:vir:79 17 AVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKH---ARDHIAVSNVKTDRHTSEKIVTIGYAKG--------VSHRI 85 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCch---hhhheeecccccccccceEEEEeccCCC--------CceEE Confidence 00 00 00001 11111 1 11233566665322 24677 Q ss_pred HHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019527. 46 FWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDG 88 (190) Q Consensus 46 ~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~ 88 (190) .+.|||+. .+||+||||+++++..+++.+.+...++.-.- T Consensus 86 ~F~E~GT~---k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:79 86 HATEFGTM---YQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred EeccCCcc---CCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 89999984 57999999999999999988887766643211 No 80 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=97.00 E-value=7e-07 Score=54.32 Aligned_cols=78 Identities=23% Similarity=0.360 Sum_probs=44.6 Q ss_pred CCCchhhH-------------HHHHHH--------HHHhhcCCEEEEEecCCC---C-CCCC-----ccHHHHHHHhhcC Q lcl|NC_019527. 2 ATLTGGDK-------------LAKILA--------DIGGKAQGSVDVGFMSGA---T-YPDG-----TPVAQVAFWNEFG 51 (190) Q Consensus 2 a~i~~~d~-------------l~~il~--------~l~~l~~~~V~VGi~~~~---~-~~dG-----~~vA~iA~~~EfG 51 (190) .+|+|-|. +.+.++ +++.+. -|.-|-+... . ..+| .+.+.+|.+.||| T Consensus 1 i~i~Gld~l~~~l~~~~~~~~~~~al~~~a~~i~~~ak~~a--Pv~TG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~G 78 (108) T protein:vir:74 1 MKITGIDALQKKLRKNATLDDVKHVVKSNTASMNKNMQNLA--PVDTGNMKRSITSEFTDGGLSGTTGPHTDYAGYVEYG 78 (108) T ss_pred CcchhHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhC--CCCchhhhccceeeeecCceEEEeecCCCcccceecc Confidence 22222222 222221 111111 1111211111 1 1122 2457799999999 Q ss_pred ccccCCCCCCchhhHHHHHHHHHHHHHHHHHHh Q lcl|NC_019527. 52 HGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIK 84 (190) Q Consensus 52 ~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~ 84 (190) +- ..|+||||+|+++.++.++.+.|++.++ T Consensus 79 T~---km~aqpf~~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 79 TR---FQSAQPFVKPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred cc---ccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 74 4799999999999999999999999887 No 81 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=96.84 E-value=3e-06 Score=50.89 Aligned_cols=73 Identities=15% Similarity=0.167 Sum_probs=42.9 Q ss_pred CC--CCchhhHHHHHHHHHH-hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCcc----ccCCCCCCchhhHH-HH--- Q lcl|NC_019527. 1 MA--TLTGGDKLAKILADIG-GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHG----GRFPAPPRPFFRNM-VN--- 69 (190) Q Consensus 1 Ma--~i~~~d~l~~il~~l~-~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~----~~~~IP~RPFlr~~-~~--- 69 (190) .. .+..... +..+|. ......|.||. +..||++|+||.. ..+.||+||||--. -+ T Consensus 71 ~~~~~L~~tG~---L~~Si~~~~~~~~v~vGt-----------n~~YA~iHqfGg~~~~~~~~~iPARPfLG~s~~~e~~ 136 (155) T protein:vir:10 71 GAHPILQVTNA---LARSITTRADRDQAQIGS-----------NLSYAAIQQLGGQAGRGRKVTIPARPYLPVLRNGQLK 136 (155) T ss_pred CCCCccccchh---hhhhhhceecCCEEEEec-----------CcchhhhhhcccccCCCCccccCCccccCCCccccch Confidence 11 1122222 333443 23567888875 2568999999973 34579999999522 11 Q ss_pred -HHHHHHHHHHHHHHhhcC Q lcl|NC_019527. 70 -EKSSEWPKRLGDAIKHYD 87 (190) Q Consensus 70 -~~~~~~~~~l~~~i~~g~ 87 (190) +..+.+.+.+.+.+..|. T Consensus 137 ~ei~~~I~~~i~~~l~~~r 155 (155) T protein:vir:10 137 PSARDAVLDVLLAALSQGR 155 (155) T ss_pred HHHHHHHHHHHHHHHhhcC Confidence 234556666667777776 No 82 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=96.75 E-value=3.2e-06 Score=50.68 Aligned_cols=81 Identities=17% Similarity=0.197 Sum_probs=37.4 Q ss_pred CCCCch--hhHH-HH--------HHHHHHhhc--------------------------CCEEEEEecCCCCCCCCccHHH Q lcl|NC_019527. 1 MATLTG--GDKL-AK--------ILADIGGKA--------------------------QGSVDVGFMSGATYPDGTPVAQ 43 (190) Q Consensus 1 Ma~i~~--~d~l-~~--------il~~l~~l~--------------------------~~~V~VGi~~~~~~~dG~~vA~ 43 (190) +..+.. ..++ .+ +.+.++... ...|.|.+-.+ -+-.. T Consensus 16 l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~~~~~~v~vg~~------~~~~~ 89 (133) T protein:vir:10 16 LTALGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRKAQGNAVVTLRVGPS------KQHHM 89 (133) T ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccccCccceEEEEecCC------CCccc Confidence 111110 0010 11 111222111 11111111100 01123 Q ss_pred HHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHH Q lcl|NC_019527. 44 VAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGR 91 (190) Q Consensus 44 iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~ 91 (190) ++.+.|||+. ..||||||+|+++.+++++.+.+.+.+... ++-. T Consensus 90 y~~f~E~GT~---k~~a~PF~~pA~~~~~~~~~~~~~~~~~~~-l~K~ 133 (133) T protein:vir:10 90 KVLAQEFGTV---KQVADPFIRPALDYNVQTVLRVLTVEIRNG-IQNR 133 (133) T ss_pred eEeeeccCCC---CCCCCccchHHHHHhHHHHHHHHHHHHHHH-hhcC Confidence 4445599985 479999999999999998888777665432 0111 No 83 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=96.69 E-value=5.3e-06 Score=49.51 Aligned_cols=87 Identities=20% Similarity=0.238 Sum_probs=43.2 Q ss_pred CCCCc-hhh-HHHH----HH----HHHHhhc-------CCEEEEEecCCCCCCCC-----ccHHHHHHHhhcCccc---- Q lcl|NC_019527. 1 MATLT-GGD-KLAK----IL----ADIGGKA-------QGSVDVGFMSGATYPDG-----TPVAQVAFWNEFGHGG---- 54 (190) Q Consensus 1 Ma~i~-~~d-~l~~----il----~~l~~l~-------~~~V~VGi~~~~~~~dG-----~~vA~iA~~~EfG~~~---- 54 (190) |.++. ..+ .+.+ .. +..+.+. ..++.+-...+ .++ .+.+.+|.+.|||+.. T Consensus 13 L~~l~~~~~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~---~~~~~~~v~~~~~Ya~fvEfGT~~m~a~ 89 (173) T protein:vir:10 13 LRKIGKDIDKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLKA---KDLISKKITVNELYGAYMEFGTGAKVSV 89 (173) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeecc---CceeEEeeCCCcccchhhhcccccccCC Confidence 11110 011 1111 11 1112211 01111111100 011 2457889999999721 Q ss_pred ------------------------------------------------cCCCCCCchhhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019527. 55 ------------------------------------------------RFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHY 86 (190) Q Consensus 55 ------------------------------------------------~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g 86 (190) .-+.||||||+|+++.+++++.+.|.+.|..- T Consensus 90 P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~~aqPFl~PA~~~~~~~~~~~i~~~i~~~ 169 (173) T protein:vir:10 90 PKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGINPQPFLYPAWIEGKKQYLKDLENLLKTY 169 (173) T ss_pred CchhhhhhcccccccccccccccccccccccccccchhcccceeeEeecCCCCCCccchhHHHHhHHHHHHHHHHHHHHH Confidence 01489999999999999999999888877532 Q ss_pred CCcHHHHHHHH Q lcl|NC_019527. 87 DGDGRKALASM 97 (190) Q Consensus 87 ~~~~~~aL~~i 97 (190) |..| T Consensus 170 -------lrk~ 173 (173) T protein:vir:10 170 -------NKKI 173 (173) T ss_pred -------hhcC Confidence 1211 No 84 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=96.68 E-value=1.7e-06 Score=52.18 Aligned_cols=82 Identities=23% Similarity=0.345 Sum_probs=44.8 Q ss_pred CC-CCch-----hhHHHHHHHH--------HHhhcCCEEEEEecCCC----CCCCC-----ccHHHHHHHhhcCccc--- Q lcl|NC_019527. 1 MA-TLTG-----GDKLAKILAD--------IGGKAQGSVDVGFMSGA----TYPDG-----TPVAQVAFWNEFGHGG--- 54 (190) Q Consensus 1 Ma-~i~~-----~d~l~~il~~--------l~~l~~~~V~VGi~~~~----~~~dG-----~~vA~iA~~~EfG~~~--- 54 (190) |+ .+.. .+.+.+.+.+ .+.+. -|.-|-+... ...+| .+.+.+|.+.|||+.. T Consensus 16 l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~a--pv~TG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~vE~GT~~~~~ 93 (144) T protein:vir:59 16 MSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASLA--PVDEGNLKNSIQIDYKNNGLTAEITVGAEYAIYVEYGTGIYAV 93 (144) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--CccchhhhcCeeEEeecCcEEEEEecCCCccchhhcCcccccc Confidence 11 1110 1122222221 12221 1122222211 01233 2457899999999721 Q ss_pred ---------------------cCCCCCCchhhHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019527. 55 ---------------------RFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHY 86 (190) Q Consensus 55 ---------------------~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g 86 (190) .-++||||||+|+++.+++.+.+.|++++ | T Consensus 94 ~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~~~~~i~~~~--g 144 (144) T protein:vir:59 94 DGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEGGEYFEREMRRLR--G 144 (144) T ss_pred CCCccccccccccccccceecCCCCCCCcchhHHHHHHHHHHHHHHHHhc--C Confidence 01489999999999999999999998876 3 No 85 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=96.58 E-value=9.2e-06 Score=48.19 Aligned_cols=73 Identities=11% Similarity=0.091 Sum_probs=44.2 Q ss_pred CCCCchhhHHHHHHHHHH-hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCcc----ccCCCCCCchhhHHH-----HH Q lcl|NC_019527. 1 MATLTGGDKLAKILADIG-GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHG----GRFPAPPRPFFRNMV-----NE 70 (190) Q Consensus 1 Ma~i~~~d~l~~il~~l~-~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~----~~~~IP~RPFlr~~~-----~~ 70 (190) ...+.+... +..+|. ....-.|.||- +..||++|+||.. ..++||+||||=-.- .+ T Consensus 73 ~~iL~~tG~---L~~Si~~~~~~~~v~vGt-----------~~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~~ 138 (155) T protein:vir:79 73 HPILQVTNA---LARSVTTWADRNEAGIGS-----------NLVYAAIHQFGGDAGRGHQVEIPARRYLPFDENGQLAAG 138 (155) T ss_pred CCccccchh---hhhhhhceecCCEEEEec-----------CchhhhhhhcccccCCCCccccCCccccCCCCccccchH Confidence 122233222 233443 22456677764 3568999999974 356799999994332 23 Q ss_pred HHHHHHHHHHHHHhhcC Q lcl|NC_019527. 71 KSSEWPKRLGDAIKHYD 87 (190) Q Consensus 71 ~~~~~~~~l~~~i~~g~ 87 (190) ..+++.+.+.+.+..|. T Consensus 139 ~~~~I~~~i~~~l~r~r 155 (155) T protein:vir:79 139 ARQSILEVVLTALSRNR 155 (155) T ss_pred HHHHHHHHHHHHHHhcC Confidence 45667777777787776 No 86 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=96.57 E-value=1.4e-05 Score=47.27 Aligned_cols=76 Identities=14% Similarity=0.134 Sum_probs=37.5 Q ss_pred CCCCchhhHHHHHHHHHH-hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccc------------------------- Q lcl|NC_019527. 1 MATLTGGDKLAKILADIG-GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGG------------------------- 54 (190) Q Consensus 1 Ma~i~~~d~l~~il~~l~-~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~------------------------- 54 (190) =..+.+..+ +.+.|. ......|.||. +..||.+|+||..- T Consensus 73 ~~~L~~tg~---L~~Si~~~~~~~~v~vGt-----------n~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~ 138 (190) T protein:vir:99 73 DKILTLDGH---LRNLLRYQLDGSELLFGS-----------DRPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFV 138 (190) T ss_pred CccceecHH---HHHHHhheecCcEEEEec-----------CcchhhhhhcCCcccccccchhhhhhhhhhhhhhhcccc Confidence 001122223 334454 33566888875 35679999999521 Q ss_pred ----------------cCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHH Q lcl|NC_019527. 55 ----------------RFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMG 98 (190) Q Consensus 55 ----------------~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG 98 (190) .++||+||||--. ++...+|.+.+.+.+... |+... T Consensus 139 ~~~~~~~~~~~~~~~~~v~IPaRpfLG~s-~~d~~~I~~~i~~~l~~~-------~~~~~ 190 (190) T protein:vir:99 139 PRRRSNFAQDVQIGPYTIQMPARPWLGTS-SQDDDTILQRVERYLQRA-------LRERA 190 (190) T ss_pred cccccccchhcccccceeeecCcccCCCC-HHHHHHHHHHHHHHHHHH-------HhhcC Confidence 2469999999433 233344444444333210 00000 No 87 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=96.56 E-value=8.9e-06 Score=48.28 Aligned_cols=73 Identities=12% Similarity=0.095 Sum_probs=44.3 Q ss_pred CCCCchhhHHHHHHHHHH-hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCcc----ccCCCCCCchhhHHH-----HH Q lcl|NC_019527. 1 MATLTGGDKLAKILADIG-GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHG----GRFPAPPRPFFRNMV-----NE 70 (190) Q Consensus 1 Ma~i~~~d~l~~il~~l~-~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~----~~~~IP~RPFlr~~~-----~~ 70 (190) ...+.....| ..+|. .-....|.||. +..||++|+||.. ..++||+||||=-.- .+ T Consensus 73 ~~iL~~tg~L---~~Si~~~~~~~~v~vGt-----------n~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~e 138 (155) T protein:vir:99 73 HPILQVTNAL---ARSVTTWADRNEAGIGS-----------NLVYAAIHQFGGDAGRGHQVEIPARRYLPFDENGQLAAG 138 (155) T ss_pred CCcchhchhh---hhhhhceecCCEEEEec-----------CccchhhhhcccccCCCCccccCCccccCCCCccccchH Confidence 1122322222 33443 22556777764 2457999999973 356799999994332 24 Q ss_pred HHHHHHHHHHHHHhhcC Q lcl|NC_019527. 71 KSSEWPKRLGDAIKHYD 87 (190) Q Consensus 71 ~~~~~~~~l~~~i~~g~ 87 (190) ..+++.+.+.+.+..+. T Consensus 139 ~~~~I~~~i~~~l~~~~ 155 (155) T protein:vir:99 139 ARQSILEIVLTALSRNR 155 (155) T ss_pred HHHHHHHHHHHHHhccC Confidence 45667777777777665 No 88 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=96.50 E-value=3.2e-06 Score=50.70 Aligned_cols=87 Identities=16% Similarity=0.208 Sum_probs=46.4 Q ss_pred CCCCc-hh-hHHHHHHHH---------HHhh--cCCEEEEEecCCCC-C---CCCc-----cHHHHHHHhhcCccc---- Q lcl|NC_019527. 1 MATLT-GG-DKLAKILAD---------IGGK--AQGSVDVGFMSGAT-Y---PDGT-----PVAQVAFWNEFGHGG---- 54 (190) Q Consensus 1 Ma~i~-~~-d~l~~il~~---------l~~l--~~~~V~VGi~~~~~-~---~dG~-----~vA~iA~~~EfG~~~---- 54 (190) |.++. .. +++.+.+++ +++. ..--|.-|-+..+- + .+|. +.+.+|.+.|||+.. T Consensus 10 ~~~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~~g~~~~V~~~~~YA~yVE~GTG~~~~~ 89 (141) T protein:vir:78 10 IPKARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRKSSKEVIVGNSSDYAIYYEFGTGEKSER 89 (141) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeecCCcEEEEecCCCccceeecCCcccccC Confidence 22211 00 111111111 1111 11123333333221 1 1221 457899999999731 Q ss_pred -------------------cCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCc Q lcl|NC_019527. 55 -------------------RFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGD 89 (190) Q Consensus 55 -------------------~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~ 89 (190) ..+-||||||+++++++++++.+.|.+.|..- | T Consensus 90 ~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~~i~~~i~~~~~~l--~ 141 (141) T protein:vir:78 90 GGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQDKVRVFTERALRGI--N 141 (141) T ss_pred CCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhHHHHHHHHHHHhhcc--C Confidence 01369999999999999999999999988543 3 No 89 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=96.50 E-value=7.9e-06 Score=48.56 Aligned_cols=77 Identities=16% Similarity=0.175 Sum_probs=41.8 Q ss_pred CCCCc----------hhh-HHHH--HHHHHH-hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccc-------cCCCC Q lcl|NC_019527. 1 MATLT----------GGD-KLAK--ILADIG-GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGG-------RFPAP 59 (190) Q Consensus 1 Ma~i~----------~~d-~l~~--il~~l~-~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~-------~~~IP 59 (190) .+-.+ +.. -+.. +...|. ..+...+.|||..+. +..||++|.||... ++.|| T Consensus 53 W~p~k~~~~~~k~~~~~~~l~~~~~l~~sl~~~~~~~~a~vg~~~G~-------~~~yAaiHQfG~~~r~~~~~~~~~iP 125 (150) T protein:vir:57 53 YAPRQQQSARKKTGRVKRKMFAKLITSRFLHIRASPEQASMEFYGGK-------SPKIASVHQFGLSEETRKDGKKIDYP 125 (150) T ss_pred CcccChHHHHHhccCCCcccchhhhhccceeeeeeCcEEEEEeecCC-------chhhhhhhhccccccccCCCceeecC Confidence 11111 000 0000 011121 235667888886442 47899999999643 34699 Q ss_pred CCchhhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019527. 60 PRPFFRNMVNEKSSEWPKRLGDAIKH 85 (190) Q Consensus 60 ~RPFlr~~~~~~~~~~~~~l~~~i~~ 85 (190) +||||=-+- +...++.+.+...+.. T Consensus 126 aRp~LG~s~-~d~~~i~~~i~~~l~r 150 (150) T protein:vir:57 126 ARPLLGFTG-EDVQMIEEIILAHLDR 150 (150) T ss_pred CcccCCCCH-HHHHHHHHHHHHHHhC Confidence 999996553 3345555555555544 No 90 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=96.50 E-value=8.2e-06 Score=48.47 Aligned_cols=77 Identities=16% Similarity=0.210 Sum_probs=41.9 Q ss_pred CCCCch-------h----hHHHH--HHHHHH-hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccc-------cCCCC Q lcl|NC_019527. 1 MATLTG-------G----DKLAK--ILADIG-GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGG-------RFPAP 59 (190) Q Consensus 1 Ma~i~~-------~----d~l~~--il~~l~-~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~-------~~~IP 59 (190) .+..+. + .-+.. +...|. ..+...+.|||..+. ++.||++|.||... ++.|| T Consensus 53 W~p~~~~~~~~k~~~~~~~l~~~~~l~~sl~~~~~~~~a~vg~~~Gt-------~~~yAaiHQfG~~~~~~~~~~~~~iP 125 (150) T protein:vir:60 53 YAPRQQQSARKKTGRVKRKMFAKLITSRFLHIRASPEQASMEFYGGK-------SPKIASVHQFGLSEENRKDGKKIDYP 125 (150) T ss_pred CcccChHHHHHhhcCCCccchhhhhhcceeeeeeeCcEEEEEeeCCC-------chhhhhhhhccccccccCCCCceecC Confidence 221110 0 00110 011121 235677888886442 47999999999632 34699 Q ss_pred CCchhhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019527. 60 PRPFFRNMVNEKSSEWPKRLGDAIKH 85 (190) Q Consensus 60 ~RPFlr~~~~~~~~~~~~~l~~~i~~ 85 (190) +||||=-+-+ ...++.+.+...|.. T Consensus 126 aRp~LG~s~~-d~~~i~~~i~~~l~r 150 (150) T protein:vir:60 126 ARPLLGFTGE-DVQMIEEIILAHLDR 150 (150) T ss_pred CcccCCCCHH-HHHHHHHHHHHHHhC Confidence 9999965533 345555555555544 No 91 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=96.48 E-value=7.2e-06 Score=48.78 Aligned_cols=75 Identities=20% Similarity=0.260 Sum_probs=39.9 Q ss_pred CCCCchhhH-H---HHHHHHHH-hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCcc----ccCCCCCCchhhHH---- Q lcl|NC_019527. 1 MATLTGGDK-L---AKILADIG-GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHG----GRFPAPPRPFFRNM---- 67 (190) Q Consensus 1 Ma~i~~~d~-l---~~il~~l~-~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~----~~~~IP~RPFlr~~---- 67 (190) +..-..+.+ | -.+..+|. ......|.||- +..||++|+||.. ..++||+||||=-. T Consensus 83 ~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGt-----------n~~YAaiHqfGg~~~~~~~v~iPaRpfLG~s~~d~ 151 (175) T protein:vir:10 83 ASRRKAGLMILQDSGQMAASVSTDHDDNSAVIGS-----------NKEYAAIHQFGGQAGRGLKVTIPARPWLPVTADGE 151 (175) T ss_pred hhhhccCCCcceechhhhhhhheeecCCEEEEec-----------ChhhhhhhhcccccCCCCccccCCccccCCCcccc Confidence 000011111 0 11333443 22455677765 3568999999974 34579999999643 Q ss_pred -----HHHHHHHHHHHHHHHHhhc Q lcl|NC_019527. 68 -----VNEKSSEWPKRLGDAIKHY 86 (190) Q Consensus 68 -----~~~~~~~~~~~l~~~i~~g 86 (190) +++..+.|.+.|+.++..- T Consensus 152 ~~~e~~~~Il~~~~~~l~~~~~~~ 175 (175) T protein:vir:10 152 LQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred cchHHHHHHHHHHHHHHHHHhccC Confidence 2334455555666655432 No 92 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=96.41 E-value=4.4e-06 Score=49.95 Aligned_cols=75 Identities=15% Similarity=0.150 Sum_probs=39.0 Q ss_pred CCCCchhhHH----HHHHHHHHh-hcCCEEEEEecCCCCCCCCccHHHHHHHhhcCcc----ccCCCCCCchhhHHH--- Q lcl|NC_019527. 1 MATLTGGDKL----AKILADIGG-KAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHG----GRFPAPPRPFFRNMV--- 68 (190) Q Consensus 1 Ma~i~~~d~l----~~il~~l~~-l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~----~~~~IP~RPFlr~~~--- 68 (190) +....++.++ -.+..+|.. .....|.||- +..||++|+||.. ..++||+||||--.- T Consensus 83 ~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGt-----------n~~YAaiHqfGg~~~~~~~v~IPARPfLG~s~~de 151 (175) T protein:vir:79 83 ASRRKAGLMILQDSGQMAASTATDSGEDYSVIGS-----------NKEYAAIQHFGGQAGRGLKVTIPGRAWLPVTADGE 151 (175) T ss_pred HhhhccCCCcceechhhhhhhhheecCCEEEEec-----------CcchhhHhhcccccCCCcccccCcccccCCCcccc Confidence 1111111110 123334432 2556777766 2568999999974 345799999996432 Q ss_pred ------HHHHHHHHHHHHHHHhhc Q lcl|NC_019527. 69 ------NEKSSEWPKRLGDAIKHY 86 (190) Q Consensus 69 ------~~~~~~~~~~l~~~i~~g 86 (190) ++..+.+.+.|+.++..- T Consensus 152 ~~~~~~~~I~~~i~~~l~~a~~~~ 175 (175) T protein:vir:79 152 LQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred hhHHHHHHHHHHHHHHHHHHhccC Confidence 223344444555555322 No 93 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=96.25 E-value=2.5e-05 Score=45.82 Aligned_cols=76 Identities=13% Similarity=0.153 Sum_probs=42.2 Q ss_pred CCCCc-------hh----hHHH--HHHHHHH-hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCcc-------ccCCCC Q lcl|NC_019527. 1 MATLT-------GG----DKLA--KILADIG-GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHG-------GRFPAP 59 (190) Q Consensus 1 Ma~i~-------~~----d~l~--~il~~l~-~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~-------~~~~IP 59 (190) .+..+ .+ .-+. .+...|. ......|.|||.+. +..||++|.||.. .++.|| T Consensus 53 W~p~~~~~~~~k~~~~~~~l~~~g~l~~sl~~~~~~~~~~V~~~Gs--------~~~yAa~HQfG~~~r~~~~~~~~~iP 124 (149) T protein:vir:98 53 YAARKRQSVRSKKGRIRREMFARLRTNRFMKAKGSDSAAVVEFTGR--------VQRMARVHQYGLKDRPNRHSRDVQYA 124 (149) T ss_pred CcccchHHHHhccCCCCcccchhhhhhhhhhheecCCeeEEEecCc--------chHHhhHhhccccccccCCCcceecc Confidence 11110 00 0011 1112332 23566888988632 4789999999963 245799 Q ss_pred CCchhhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019527. 60 PRPFFRNMVNEKSSEWPKRLGDAIKH 85 (190) Q Consensus 60 ~RPFlr~~~~~~~~~~~~~l~~~i~~ 85 (190) +||||=-+ ++...++.+.+.+.+.. T Consensus 125 aRp~LG~s-~~d~~~i~~~i~~~l~~ 149 (149) T protein:vir:98 125 ARPLLGFT-RDDEQMIEDIIIRHLGK 149 (149) T ss_pred ccccCCCC-HHHHHHHHHHHHHHhhC Confidence 99999533 33345566666665644 No 94 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=96.17 E-value=7.6e-05 Score=43.17 Aligned_cols=98 Identities=9% Similarity=0.069 Sum_probs=67.5 Q ss_pred HHHHHHHHHHHHHHHHhhc-CCcHHHHHHHHHHHHHHHHHHHHhcc------CCCCCChHHHHHhccccccccchhhhhh Q lcl|NC_019527. 68 VNEKSSEWPKRLGDAIKHY-DGDGRKALASMGEMIGGDLGSSIIST------NEPALSKTTLMLRSIYGNNPQEIRARDV 140 (190) Q Consensus 68 ~~~~~~~~~~~l~~~i~~g-~~~~~~aL~~iG~~a~~~Iq~~I~~~------~~pPnap~Ti~~K~~~~~~~~~~~~~~~ 140 (190) +++...++.+.|...+... ..+....|..||.......+..|... .|+|+++.|...+.+.+.. T Consensus 1 m~~~~~~l~~~l~~ll~~l~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g--------- 71 (155) T protein:vir:79 1 MTDDLQALERWAGGLLAKLSPAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAG--------- 71 (155) T ss_pred CchHHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccC--------- Confidence 5556666666666665432 23456799999999999999999985 5788888887665532211 Q ss_pred hhhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEE---eec--cCC Q lcl|NC_019527. 141 LAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKV---KVN--YGR 190 (190) Q Consensus 141 ~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~---~~~--~~~ 190 (190) ....++|.+++.+-++|+|.+. ....-| +.| |++ T Consensus 72 ---------------~~~~~~m~~~l~~a~~l~~~~~-~d~a~Vg~~Gs~~~yAa 110 (155) T protein:vir:79 72 ---------------RVKREAMFRKLRTARYLRIDVD-STGLAIGFDERLSRIAR 110 (155) T ss_pred ---------------cccchhhhhhhhhhheeeeeec-CcEEEEEecCcchhhhh Confidence 1135689999999999999985 344445 322 544 No 95 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=96.11 E-value=7.2e-05 Score=43.31 Aligned_cols=92 Identities=12% Similarity=0.059 Sum_probs=61.1 Q ss_pred HHHHHHHHHHHHHHHHhhc-CCcHHHHHHHHHHHHHHHHHHHHhcc------CCCCCChHHHHHhccccccccchhhhhh Q lcl|NC_019527. 68 VNEKSSEWPKRLGDAIKHY-DGDGRKALASMGEMIGGDLGSSIIST------NEPALSKTTLMLRSIYGNNPQEIRARDV 140 (190) Q Consensus 68 ~~~~~~~~~~~l~~~i~~g-~~~~~~aL~~iG~~a~~~Iq~~I~~~------~~pPnap~Ti~~K~~~~~~~~~~~~~~~ 140 (190) +++ -.++.+.|...+..- ..+.+.+|..||..+...+++.|... .|+|+++.|+..|..+ T Consensus 1 m~~-~~~~~~~l~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~------------ 67 (149) T protein:vir:18 1 MSE-LTALQERLAGLIASLSPAARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSKKGR------------ 67 (149) T ss_pred Cch-HHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhccCc------------ Confidence 222 233334444444332 12346799999999999999999985 5999999998766532 Q ss_pred hhhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEE---ee--ccCC Q lcl|NC_019527. 141 LAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKV---KV--NYGR 190 (190) Q Consensus 141 ~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~---~~--~~~~ 190 (190) ..+||..++.+.+++++.+..- ...| +. -|++ T Consensus 68 -----------------~~~~~~~~l~~~~~l~~~~~~~-~~~v~~~Gtn~~yAa 104 (149) T protein:vir:18 68 -----------------IKREMFAKLRTSRFMKAKGSDS-AAVVEFTGKVQRMAR 104 (149) T ss_pred -----------------ccchhhhhhhhhhhhheeecCc-eeEEEecccchhhhh Confidence 2578999999999998876633 3343 22 2333 No 96 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=96.10 E-value=2.8e-05 Score=45.56 Aligned_cols=77 Identities=13% Similarity=0.167 Sum_probs=42.9 Q ss_pred CCCCc---------hhhH-H-H--HHHHHHH-hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCcc-------ccCCCC Q lcl|NC_019527. 1 MATLT---------GGDK-L-A--KILADIG-GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHG-------GRFPAP 59 (190) Q Consensus 1 Ma~i~---------~~d~-l-~--~il~~l~-~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~-------~~~~IP 59 (190) .+-.+ .+.+ + . .+...|. ..+...+.|||..+. ++.||++|.||.. .++.|| T Consensus 53 W~p~k~~~~~~k~g~~~~~l~~~~~l~~sl~~~~~~~~~~vg~~~Gs-------~~~yAa~HQfG~~~~~~~~~~~~~iP 125 (150) T protein:vir:20 53 YAPRQQQSVRKKTGRVKRKMFAKLITSRFLHIRASPEQASMEFYGGK-------SPKIASVHQFGLSEENRKDGKKIDYP 125 (150) T ss_pred CcccchHHHHHhccCCCccccchhhhhhhhheeecCcEEEEEeeCCc-------chhhhhhhhcccccccccCCCceecc Confidence 11000 0000 0 0 1122342 335778999986543 4789999999953 234699 Q ss_pred CCchhhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019527. 60 PRPFFRNMVNEKSSEWPKRLGDAIKH 85 (190) Q Consensus 60 ~RPFlr~~~~~~~~~~~~~l~~~i~~ 85 (190) +||||=-+-+ ...++.+.+.+.+.. T Consensus 126 aRp~LG~s~~-d~~~i~~~i~~~l~k 150 (150) T protein:vir:20 126 ARPLLGFTGE-DVQMIEEIILAHLER 150 (150) T ss_pred ccccCCCCHH-HHHHHHHHHHHHHhC Confidence 9999965533 345555555555544 No 97 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=96.05 E-value=1.7e-05 Score=46.68 Aligned_cols=76 Identities=11% Similarity=0.122 Sum_probs=40.1 Q ss_pred CCCCc----------hhhHH--HHHHHH------HH-hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccc------- Q lcl|NC_019527. 1 MATLT----------GGDKL--AKILAD------IG-GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGG------- 54 (190) Q Consensus 1 Ma~i~----------~~d~l--~~il~~------l~-~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~------- 54 (190) .+-.+ ...++ .-++.. |. ..+...+.|||.+ +++.||++|.||... T Consensus 54 W~prk~~~~~~~~~~~~g~~~~~~m~~~l~~a~~l~~~~~~d~a~Vg~~G--------s~~~yAaiHQfG~~~r~~~~~~ 125 (155) T protein:vir:79 54 YEPRKVKAGGKRLREKAGRVKREAMFRKLRTARYLRIDVDSTGLAIGFDE--------RLSRIARVHQEGQKAPVEPGGP 125 (155) T ss_pred CcccchhhhhhhhhcccCcccchhhhhhhhhhheeeeeecCcEEEEEecC--------cchhhhhhhhcCCcccCCCCCc Confidence 11100 00001 001112 21 2355678888742 257899999999642 Q ss_pred cCCCCCCchhhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019527. 55 RFPAPPRPFFRNMVNEKSSEWPKRLGDAIKH 85 (190) Q Consensus 55 ~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~ 85 (190) +++||+||||--+-+ ...++.+.+...|.. T Consensus 126 ~v~iPaRp~LGls~~-d~~~I~~~i~~~l~r 155 (155) T protein:vir:79 126 LAQYPVRVVLGFSDA-DRELVRDRLLRELTR 155 (155) T ss_pred ccccccccccCCCHH-HHHHHHHHHHHHhhC Confidence 457999999944422 345555555555544 No 98 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=95.89 E-value=8.5e-06 Score=48.39 Aligned_cols=81 Identities=20% Similarity=0.168 Sum_probs=38.8 Q ss_pred CCCCc----hhhH-H-----------HHHH----HHHHhhc--CCEEEEEecCCC--------CCCC---C--ccHHHHH Q lcl|NC_019527. 1 MATLT----GGDK-L-----------AKIL----ADIGGKA--QGSVDVGFMSGA--------TYPD---G--TPVAQVA 45 (190) Q Consensus 1 Ma~i~----~~d~-l-----------~~il----~~l~~l~--~~~V~VGi~~~~--------~~~d---G--~~vA~iA 45 (190) |++++ +.+. + ++.+ ..+.... ..-|.-|-+..+ ..+. + .+.+.+| T Consensus 1 m~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA 80 (142) T protein:vir:99 1 MVQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYA 80 (142) T ss_pred CceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCcccc Confidence 66443 2221 1 1111 1111111 111222333221 0111 1 2468899 Q ss_pred HHhhcCccc-----------------------cC---CCCCCchhhHHHHHHHHHHHHHHHH Q lcl|NC_019527. 46 FWNEFGHGG-----------------------RF---PAPPRPFFRNMVNEKSSEWPKRLGD 81 (190) Q Consensus 46 ~~~EfG~~~-----------------------~~---~IP~RPFlr~~~~~~~~~~~~~l~~ 81 (190) .++|||+.- .+ +.||||||+++++.+..+-.....+ T Consensus 81 ~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:99 81 AAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred ceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 999999831 01 4669999999999888764443333 No 99 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=95.89 E-value=8.5e-06 Score=48.39 Aligned_cols=81 Identities=20% Similarity=0.168 Sum_probs=38.8 Q ss_pred CCCCc----hhhH-H-----------HHHH----HHHHhhc--CCEEEEEecCCC--------CCCC---C--ccHHHHH Q lcl|NC_019527. 1 MATLT----GGDK-L-----------AKIL----ADIGGKA--QGSVDVGFMSGA--------TYPD---G--TPVAQVA 45 (190) Q Consensus 1 Ma~i~----~~d~-l-----------~~il----~~l~~l~--~~~V~VGi~~~~--------~~~d---G--~~vA~iA 45 (190) |++++ +.+. + ++.+ ..+.... ..-|.-|-+..+ ..+. + .+.+.+| T Consensus 1 m~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA 80 (142) T protein:vir:86 1 MVQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYA 80 (142) T ss_pred CceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCcccc Confidence 66443 2221 1 1111 1111111 111222333221 0111 1 2468899 Q ss_pred HHhhcCccc-----------------------cC---CCCCCchhhHHHHHHHHHHHHHHHH Q lcl|NC_019527. 46 FWNEFGHGG-----------------------RF---PAPPRPFFRNMVNEKSSEWPKRLGD 81 (190) Q Consensus 46 ~~~EfG~~~-----------------------~~---~IP~RPFlr~~~~~~~~~~~~~l~~ 81 (190) .++|||+.- .+ +.||||||+++++.+..+-.....+ T Consensus 81 ~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:86 81 AAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred ceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 999999831 01 4669999999999888764443333 No 100 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=95.77 E-value=0.00016 Score=41.33 Aligned_cols=91 Identities=10% Similarity=0.021 Sum_probs=61.8 Q ss_pred HHHHHHHHHHHHHHHhhc-CCcHHHHHHHHHHHHHHHHHHHHhcc------CCCCCChHHHHHhccccccccchhhhhhh Q lcl|NC_019527. 69 NEKSSEWPKRLGDAIKHY-DGDGRKALASMGEMIGGDLGSSIIST------NEPALSKTTLMLRSIYGNNPQEIRARDVL 141 (190) Q Consensus 69 ~~~~~~~~~~l~~~i~~g-~~~~~~aL~~iG~~a~~~Iq~~I~~~------~~pPnap~Ti~~K~~~~~~~~~~~~~~~~ 141 (190) -+.-+++.+.|...+... ..+-..+|..||..+....++.|++. .|+|+++.|.++|++. T Consensus 1 m~~~~~l~~~L~~ll~~l~~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~------------- 67 (148) T protein:vir:79 1 MSESRELEAWLAGMLTKLDAPARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRI------------- 67 (148) T ss_pred CccHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhcccc------------- Confidence 222455555566655443 12346789999999999999999984 4789999998777642 Q ss_pred hhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEee-----ccCC Q lcl|NC_019527. 142 AAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKV-----NYGR 190 (190) Q Consensus 142 ~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~-----~~~~ 190 (190) .+||.+++.+..++++.+. ..+..|.. .|++ T Consensus 68 -----------------~~~~~~~l~~~~~l~~~~~-~~~~~v~~~Gt~~~yAa 103 (148) T protein:vir:79 68 -----------------RRAMFMRLRLARYMKTQAD-ANTAVVTFAGNAQRIAT 103 (148) T ss_pred -----------------cccccchhhhhhheeeeee-CCeeeEEeeccchhhhh Confidence 4678888888888887774 44555532 2333 No 101 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=95.73 E-value=3.3e-05 Score=45.14 Aligned_cols=76 Identities=14% Similarity=0.121 Sum_probs=40.0 Q ss_pred CCCC------chhh---HHHHHH---HHHH-hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccc-------cCCCCC Q lcl|NC_019527. 1 MATL------TGGD---KLAKIL---ADIG-GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGG-------RFPAPP 60 (190) Q Consensus 1 Ma~i------~~~d---~l~~il---~~l~-~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~-------~~~IP~ 60 (190) .... +++. .+-..+ ..|. ......+.|||. |+ +..||++|.||... ++.||+ T Consensus 53 W~p~s~~~~~~~g~~~~~~~~~l~~~~~l~~~~~~~~~~v~~~-------Gt-~~~yAaiHQfG~~~r~~~~~~~v~iPa 124 (148) T protein:vir:79 53 YVPRKPQLRHRAGRIRRAMFMRLRLARYMKTQADANTAVVTFA-------GN-AQRIATVHQFGLRDRVNKAGLTAQYPA 124 (148) T ss_pred CcccchHHHhhcccccccccchhhhhhheeeeeeCCeeeEEee-------cc-chhhhhhhhcCccccccCCCCccccCc Confidence 1100 0000 000111 1121 123446777763 22 47899999999542 456999 Q ss_pred CchhhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019527. 61 RPFFRNMVNEKSSEWPKRLGDAIKH 85 (190) Q Consensus 61 RPFlr~~~~~~~~~~~~~l~~~i~~ 85 (190) ||||=-+ ++...++.+.+...|.+ T Consensus 125 Rp~LG~s-~~d~~~i~~~i~~~l~~ 148 (148) T protein:vir:79 125 RELLGMD-GVDMEHITNLLLLHLGA 148 (148) T ss_pred ccccCCC-HHHHHHHHHHHHHHhcC Confidence 9999644 23455666666666644 No 102 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=95.51 E-value=8.2e-06 Score=48.47 Aligned_cols=77 Identities=18% Similarity=0.281 Sum_probs=40.7 Q ss_pred CCCCchhhHHHHHH----HHHHhhcCCEEEEEecCCCC----CCCCc-----cHHHHHHHhhcCccc------------- Q lcl|NC_019527. 1 MATLTGGDKLAKIL----ADIGGKAQGSVDVGFMSGAT----YPDGT-----PVAQVAFWNEFGHGG------------- 54 (190) Q Consensus 1 Ma~i~~~d~l~~il----~~l~~l~~~~V~VGi~~~~~----~~dG~-----~vA~iA~~~EfG~~~------------- 54 (190) |-.+- -+.+.+.- +.++.+. -|.-|-+...- ..+|. +.+.+|.+.|||+.. T Consensus 1 v~~~v-~~~~~~~~~~i~~~ak~~a--Pv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~ 77 (116) T protein:vir:12 1 MERWV-KRGIAKTTAKIHNTIISLM--PVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKI 77 (116) T ss_pred ChHHH-HHHHHHHHHHHHHHHHHhC--CcCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCccccccc Confidence 11110 01121111 1222221 11222222110 12231 467899999999521 Q ss_pred -------------cCCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_019527. 55 -------------RFPAPPRPFFRNMVNEKSSEWPKRLG 80 (190) Q Consensus 55 -------------~~~IP~RPFlr~~~~~~~~~~~~~l~ 80 (190) ..++||||||+|++++++..+.+.|. T Consensus 78 ~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 78 PWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 11589999999999999999888887 No 103 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=95.51 E-value=8.2e-06 Score=48.47 Aligned_cols=77 Identities=18% Similarity=0.281 Sum_probs=40.7 Q ss_pred CCCCchhhHHHHHH----HHHHhhcCCEEEEEecCCCC----CCCCc-----cHHHHHHHhhcCccc------------- Q lcl|NC_019527. 1 MATLTGGDKLAKIL----ADIGGKAQGSVDVGFMSGAT----YPDGT-----PVAQVAFWNEFGHGG------------- 54 (190) Q Consensus 1 Ma~i~~~d~l~~il----~~l~~l~~~~V~VGi~~~~~----~~dG~-----~vA~iA~~~EfG~~~------------- 54 (190) |-.+- -+.+.+.- +.++.+. -|.-|-+...- ..+|. +.+.+|.+.|||+.. T Consensus 1 v~~~v-~~~~~~~~~~i~~~ak~~a--Pv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~ 77 (116) T protein:vir:97 1 MERWV-KRGIAKTTAKIHNTIISLM--PVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKI 77 (116) T ss_pred ChHHH-HHHHHHHHHHHHHHHHHhC--CcCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCccccccc Confidence 11110 01121111 1222221 11222222110 12231 467899999999521 Q ss_pred -------------cCCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_019527. 55 -------------RFPAPPRPFFRNMVNEKSSEWPKRLG 80 (190) Q Consensus 55 -------------~~~IP~RPFlr~~~~~~~~~~~~~l~ 80 (190) ..++||||||+|++++++..+.+.|. T Consensus 78 ~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 78 PWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 11589999999999999999888887 No 104 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=95.51 E-value=8.6e-05 Score=42.88 Aligned_cols=76 Identities=13% Similarity=0.194 Sum_probs=40.2 Q ss_pred CCC-----C--chhhHHHHHHHH------HH-hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCcc-------ccCCCC Q lcl|NC_019527. 1 MAT-----L--TGGDKLAKILAD------IG-GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHG-------GRFPAP 59 (190) Q Consensus 1 Ma~-----i--~~~d~l~~il~~------l~-~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~-------~~~~IP 59 (190) .+. . +.+.....++.. |. ......+.|||.+ + +..||++|.||.. .++.|| T Consensus 53 W~p~~~~~~~~~~g~~~~~~~~~l~~~~~l~~~~~~~~~~v~~~G-------t-n~~yAaiHQfG~~~r~~~~~~~v~iP 124 (149) T protein:vir:18 53 YAARKRQPVRSKKGRIKREMFAKLRTSRFMKAKGSDSAAVVEFTG-------K-VQRMARVHQYGLKDRPNRNSRDVQYE 124 (149) T ss_pred CcccchhhhhhccCcccchhhhhhhhhhhhheeecCceeEEEecc-------c-chhhhhhhhccccccccCCCcccccc Confidence 110 0 111111111111 21 1234567777752 2 4689999999964 234699 Q ss_pred CCchhhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019527. 60 PRPFFRNMVNEKSSEWPKRLGDAIKH 85 (190) Q Consensus 60 ~RPFlr~~~~~~~~~~~~~l~~~i~~ 85 (190) +||||=-+ ++...++.+.+.+.+.. T Consensus 125 aRp~LG~s-~~d~~~I~~~i~~~l~~ 149 (149) T protein:vir:18 125 ARPLLGFT-RDDEQMIEDVIISHLGK 149 (149) T ss_pred ccccCCCC-HHHHHHHHHHHHHHHhC Confidence 99999654 33445666666666644 No 105 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=95.45 E-value=0.00033 Score=39.70 Aligned_cols=95 Identities=9% Similarity=0.036 Sum_probs=59.4 Q ss_pred HHHHHHHHHHHHHHHHhhc-CCcHHHHHHHHHHHHHHHHHHHHhcc------CCCCCChHHHHHhccccccccchhhhhh Q lcl|NC_019527. 68 VNEKSSEWPKRLGDAIKHY-DGDGRKALASMGEMIGGDLGSSIIST------NEPALSKTTLMLRSIYGNNPQEIRARDV 140 (190) Q Consensus 68 ~~~~~~~~~~~l~~~i~~g-~~~~~~aL~~iG~~a~~~Iq~~I~~~------~~pPnap~Ti~~K~~~~~~~~~~~~~~~ 140 (190) +++...++.+.|...+... ..+.+..|..||..+....++.|... .|+|+++.|++.|..+.. T Consensus 1 m~~~~~~l~~~L~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~~W~p~~~~~~~~~~~~~~---------- 70 (156) T protein:vir:11 1 MADSLEALEDWAGPILRALEPGPRAALARSLARDLRRSQQKRVMAQRNPDGSAYEPRKKRELRGKQGRIR---------- 70 (156) T ss_pred CchhHHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHhhhccccc---------- Confidence 5556666666666666443 23457799999999999999999984 588999999977764321 Q ss_pred hhhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEE---eec--cCC Q lcl|NC_019527. 141 LAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKV---KVN--YGR 190 (190) Q Consensus 141 ~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~---~~~--~~~ 190 (190) ...++.....+..+|++.+. .....| +.| |++ T Consensus 71 -----------------~~~~m~~~l~~~~~l~~~~~-~~~a~vg~~Gs~~~yA~ 107 (156) T protein:vir:11 71 -----------------RKIKMFQKLRTVRYLRAKGD-AQAITVSFAGRIARIAR 107 (156) T ss_pred -----------------cchhhhhhhhhhheeeeeec-CcEEEEEecCCchhhhh Confidence 12234333334444666553 333444 222 444 No 106 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=95.39 E-value=9.1e-06 Score=48.22 Aligned_cols=75 Identities=19% Similarity=0.292 Sum_probs=40.8 Q ss_pred Cchh--hHHHHHHH----HHHhhcCCEEEEEecCCCC----CCCC-----ccHHHHHHHhhcCccc-------------- Q lcl|NC_019527. 4 LTGG--DKLAKILA----DIGGKAQGSVDVGFMSGAT----YPDG-----TPVAQVAFWNEFGHGG-------------- 54 (190) Q Consensus 4 i~~~--d~l~~il~----~l~~l~~~~V~VGi~~~~~----~~dG-----~~vA~iA~~~EfG~~~-------------- 54 (190) |... +.+.+.-. ..+.+. -|.-|-+...- ..+| .+.+.+|.+.|||+.. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~a--pv~TG~Lr~SI~~~~~~~~~~~~V~~~~~Ya~yvE~GTg~~~~~~~~~~~~~~~ 78 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLM--PVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKNIP 78 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhC--CccccccccceeEEeecCcEEEEEecCCCccceeecCccccccCCCcccccccc Confidence 2211 11222222 222221 12222222110 1122 1457899999999521 Q ss_pred ------------cCCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_019527. 55 ------------RFPAPPRPFFRNMVNEKSSEWPKRLG 80 (190) Q Consensus 55 ------------~~~IP~RPFlr~~~~~~~~~~~~~l~ 80 (190) ..+.||||||+|++++++..+.+.|. T Consensus 79 ~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 79 WSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ceeecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 11589999999999999999888888 No 107 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=95.17 E-value=6.4e-05 Score=43.56 Aligned_cols=80 Identities=10% Similarity=0.081 Sum_probs=42.8 Q ss_pred CCCCc---------hhhHHHHHHHH------HH-hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCcc-------ccCC Q lcl|NC_019527. 1 MATLT---------GGDKLAKILAD------IG-GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHG-------GRFP 57 (190) Q Consensus 1 Ma~i~---------~~d~l~~il~~------l~-~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~-------~~~~ 57 (190) .+..+ ..++-..++.. |. ..+...+.|||.+ ++..||++|.||.. .++. T Consensus 54 W~p~~~~~~~~~~~~~~~~~~m~~~l~~~~~l~~~~~~~~a~vg~~G--------s~~~yA~iHQfG~~~~~~~~~~~v~ 125 (156) T protein:vir:11 54 YEPRKKRELRGKQGRIRRKIKMFQKLRTVRYLRAKGDAQAITVSFAG--------RIARIARVHQYGLRDRAEPGAPEVS 125 (156) T ss_pred CcccchHHHhhhccccccchhhhhhhhhhheeeeeecCcEEEEEecC--------CchhhhhhhcccccccccCCCCccc Confidence 11000 00111111111 21 2256678888853 24789999999964 2456 Q ss_pred CCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHH Q lcl|NC_019527. 58 APPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGR 91 (190) Q Consensus 58 IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~ 91 (190) ||+||||--+- +...++.+.+.+.+... ++- T Consensus 126 iPaRp~LG~s~-~d~~~i~~~i~~~l~~~--~~~ 156 (156) T protein:vir:11 126 YAQRLLLGFDS-SDMETIQNGILAHIDAN--SPI 156 (156) T ss_pred ccccccCCCCH-HHHHHHHHHHHHHHhhc--CCC Confidence 99999995442 34556666666666432 111 No 108 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=95.09 E-value=2.1e-05 Score=46.26 Aligned_cols=77 Identities=14% Similarity=0.197 Sum_probs=45.2 Q ss_pred CCCCchh---hHHHHHH----HHHHhh----------------cCCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCC Q lcl|NC_019527. 1 MATLTGG---DKLAKIL----ADIGGK----------------AQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFP 57 (190) Q Consensus 1 Ma~i~~~---d~l~~il----~~l~~l----------------~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~ 57 (190) |...-.- ..|.+.. ++++.. ...-|.||+. .+-+.++-.+|||+. . T Consensus 19 ~g~~~~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkkik~~~kk~g~~~VG~~--------ks~~fy~kF~EFGTS---k 87 (119) T protein:vir:10 19 DMVLDESTKRKGIKAGITKIGKAIEKNSPIKSGRLSKVKIRVKNTGLATEGTA--------SSSEFYDIFQNFGTS---E 87 (119) T ss_pred hhhhhHHHHHHHHHHHhHHHHHHHhhcCCcccCCcceeeeeeecCceeEeccC--------Ccchhhhhhcccccc---c Confidence 3311110 1122211 122111 1113455552 245789999999985 3 Q ss_pred CCCC-chhhHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019527. 58 APPR-PFFRNMVNEKSSEWPKRLGDAIKHYDG 88 (190) Q Consensus 58 IP~R-PFlr~~~~~~~~~~~~~l~~~i~~g~~ 88 (190) .|+| |||.+++++..++....+.+-|..... T Consensus 88 m~a~~pF~~~a~~~~~~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 88 QKAHVGYFDRAVDETTNEAVEEVAEIIFRKMR 119 (119) T ss_pred cCCCCCccccccccChHHHHHHHHHHHHHhcC Confidence 7999 999999999999988888776654422 No 109 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=94.84 E-value=0.00016 Score=41.40 Aligned_cols=74 Identities=22% Similarity=0.142 Sum_probs=32.9 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhh Q lcl|NC_019527. 64 FRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAA 143 (190) Q Consensus 64 lr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~ 143 (190) |-.. .-.-+++.+.|+++-..-+-..+++|...+..+++.++. T Consensus 1 Ma~~-~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~------------------------------------ 43 (135) T protein:vir:96 1 MAKV-KYGADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIH------------------------------------ 43 (135) T ss_pred Cchh-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------------------------ Confidence 1110 001122333333222111112334444444433333211 Q ss_pred HHHhhhcccccccCccCchHHHHHHHhhcceeeecCc-eeEEe--eccCC Q lcl|NC_019527. 144 QELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGA-TIKVK--VNYGR 190 (190) Q Consensus 144 ~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~-~~~~~--~~~~~ 190 (190) .-| +|||.|++||+++|+++. +..|+ ++|.. T Consensus 44 ---------------~ap-vdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~ 77 (135) T protein:vir:96 44 ---------------LMP-VDTGFLRQSTTVDFENGGFTGVVKIGSNYAV 77 (135) T ss_pred ---------------hCC-ccchhhhcceeEEeecCcEEEEEecCCCccc Confidence 224 699999999999988765 44443 33433 No 110 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=94.33 E-value=0.00015 Score=41.63 Aligned_cols=53 Identities=23% Similarity=0.222 Sum_probs=28.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhhHHHhhhcccccccCccCchHHHHHHH Q lcl|NC_019527. 90 GRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAAQELVEEGFQGAGGSQAKPLVWTGHML 169 (190) Q Consensus 90 ~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~ 169 (190) +++++...-..+...|++.+.. ..| +|||.|+ T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~-----------------------------------------------~ap-v~TG~Lr 32 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIIS-----------------------------------------------LMP-VDTGYLR 32 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHh-----------------------------------------------hCC-ccccccc Confidence 3333333333333333333322 235 7999999 Q ss_pred hhcceeeecCc-eeEEe--eccCC Q lcl|NC_019527. 170 NSITYQVDGGA-TIKVK--VNYGR 190 (190) Q Consensus 170 ~SIty~V~~g~-~~~~~--~~~~~ 190 (190) +||++.+.++. +..|. +.|+. T Consensus 33 ~SI~~~~~~~~~~~~V~~~~~Ya~ 56 (116) T protein:vir:95 33 ESVTMDFKDGGFTGVINIGSEYAI 56 (116) T ss_pred cceeEEeecCcEEEEEecCCCccc Confidence 99999998875 33333 33544 No 111 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=93.80 E-value=0.0005 Score=38.70 Aligned_cols=85 Identities=19% Similarity=0.264 Sum_probs=47.0 Q ss_pred CCC-Cchhh------------------HHHHHHHHHHhhcCCEEEEEecCCC----------CCCCCc-------cHHHH Q lcl|NC_019527. 1 MAT-LTGGD------------------KLAKILADIGGKAQGSVDVGFMSGA----------TYPDGT-------PVAQV 44 (190) Q Consensus 1 Ma~-i~~~d------------------~l~~il~~l~~l~~~~V~VGi~~~~----------~~~dG~-------~vA~i 44 (190) ||+ |+-.+ .++++++++....-+.++-+-|... ...+|. +--+| T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~~~~~~~~v~~~~~~y~l 80 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQKLKNGDQVIYQKAPTYRL 80 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccccceeeeecCCeeEEEEEecCCcce Confidence 774 33211 1222222221111112222222111 111221 12346 Q ss_pred HHHhhcCccccCC--CCCCchhhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019527. 45 AFWNEFGHGGRFP--APPRPFFRNMVNEKSSEWPKRLGDAIKH 85 (190) Q Consensus 45 A~~~EfG~~~~~~--IP~RPFlr~~~~~~~~~~~~~l~~~i~~ 85 (190) +.+.|||+..+++ .|+||||+|+.+...+.+.+.+++.|.. T Consensus 81 ~HLLE~GHa~r~GGrV~a~phI~paee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 81 THLLENGHAKRNGGRVSPKVHIAPVEEELVSNYISRVEKRLSQ 123 (123) T ss_pred EEeeecceeecCCceeCcchhhhHHHHHHHHHHHHHHHHHhcC Confidence 7888999765543 6999999999999999999999998876 No 112 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=93.33 E-value=0.00022 Score=40.68 Aligned_cols=88 Identities=22% Similarity=0.292 Sum_probs=50.9 Q ss_pred CCCCchhhH---HHHHHH---------------HHHhhcCCEEEEEecCCC-----CCC---------CCc-----cHHH Q lcl|NC_019527. 1 MATLTGGDK---LAKILA---------------DIGGKAQGSVDVGFMSGA-----TYP---------DGT-----PVAQ 43 (190) Q Consensus 1 Ma~i~~~d~---l~~il~---------------~l~~l~~~~V~VGi~~~~-----~~~---------dG~-----~vA~ 43 (190) ||+|+=.+= +.+-|+ ++......+|+-.+.+.+ .|- ++. .--+ T Consensus 1 M~~i~id~La~~I~~~L~~y~~~v~~~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~~~~~~v~nk~~yq 80 (127) T protein:vir:80 1 MANIKIDRLGDEITRQLKRYSQVIAGDLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRTPGGWVIHNKTEYR 80 (127) T ss_pred CccccHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeeccCceeEeecCCcc Confidence 888764321 221111 111111112222222111 011 121 1245 Q ss_pred HHHHhhcCccccCC--CCCCchhhHHHHHHHHHHHHHHHHHHhhcCC Q lcl|NC_019527. 44 VAFWNEFGHGGRFP--APPRPFFRNMVNEKSSEWPKRLGDAIKHYDG 88 (190) Q Consensus 44 iA~~~EfG~~~~~~--IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~ 88 (190) ++.+.|||+..+++ .++||||+|..+....++.+.++++|.+|.. T Consensus 81 LtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~~~~ 127 (127) T protein:vir:80 81 LAHLLEYGHATVDGGRVPETPHIRPVEDWLEKEFEDRVERAIKNESR 127 (127) T ss_pred eeehhhcceeccCCcccCCccchhhHHHHHHHHHHHHHHHHhcCCCC Confidence 78999999866543 6999999999999999999999999987644 No 113 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=93.24 E-value=0.00032 Score=39.78 Aligned_cols=53 Identities=23% Similarity=0.248 Sum_probs=28.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhhHHHhhhcccccccCccCchHHHHHHH Q lcl|NC_019527. 90 GRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAAQELVEEGFQGAGGSQAKPLVWTGHML 169 (190) Q Consensus 90 ~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~ 169 (190) +++++..+-..+...|++.+.. ..| +|||.|+ T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~-----------------------------------------------~aP-v~TG~Lr 32 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIIS-----------------------------------------------LMP-VDTGYLR 32 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHH-----------------------------------------------hCC-cCccccc Confidence 3333333333333333333322 123 5899999 Q ss_pred hhcceeeecCc-eeEEee--ccCC Q lcl|NC_019527. 170 NSITYQVDGGA-TIKVKV--NYGR 190 (190) Q Consensus 170 ~SIty~V~~g~-~~~~~~--~~~~ 190 (190) +||++.|.++. +..|.. .|+. T Consensus 33 ~SI~~~~~~~~~~~~V~~~~~YA~ 56 (116) T protein:vir:12 33 ESVTMDFKDGGFTGVINIGSEYAI 56 (116) T ss_pred ccceEEeecCcEEEEEecCCCccc Confidence 99999998875 444442 3444 No 114 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=93.24 E-value=0.00032 Score=39.78 Aligned_cols=53 Identities=23% Similarity=0.248 Sum_probs=28.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhhHHHhhhcccccccCccCchHHHHHHH Q lcl|NC_019527. 90 GRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAAQELVEEGFQGAGGSQAKPLVWTGHML 169 (190) Q Consensus 90 ~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~ 169 (190) +++++..+-..+...|++.+.. ..| +|||.|+ T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~-----------------------------------------------~aP-v~TG~Lr 32 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIIS-----------------------------------------------LMP-VDTGYLR 32 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHH-----------------------------------------------hCC-cCccccc Confidence 3333333333333333333322 123 5899999 Q ss_pred hhcceeeecCc-eeEEee--ccCC Q lcl|NC_019527. 170 NSITYQVDGGA-TIKVKV--NYGR 190 (190) Q Consensus 170 ~SIty~V~~g~-~~~~~~--~~~~ 190 (190) +||++.|.++. +..|.. .|+. T Consensus 33 ~SI~~~~~~~~~~~~V~~~~~YA~ 56 (116) T protein:vir:97 33 ESVTMDFKDGGFTGVINIGSEYAI 56 (116) T ss_pred ccceEEeecCcEEEEEecCCCccc Confidence 99999998875 444442 3444 No 115 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=93.21 E-value=0.00075 Score=37.70 Aligned_cols=74 Identities=18% Similarity=0.177 Sum_probs=32.7 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhh Q lcl|NC_019527. 64 FRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAA 143 (190) Q Consensus 64 lr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~ 143 (190) |-..+ ...+++.+.|+.+-..-.-..+++|+..+..+++.++. T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~------------------------------------ 43 (137) T protein:vir:93 1 MAKVK-YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIIS------------------------------------ 43 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------------------------ Confidence 22222 12223333332221111112233333333333332221 Q ss_pred HHHhhhcccccccCccCchHHHHHHHhhcceeeecCc-eeEEeec--cCC Q lcl|NC_019527. 144 QELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGA-TIKVKVN--YGR 190 (190) Q Consensus 144 ~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~-~~~~~~~--~~~ 190 (190) .-| +|||.|++||++++.+|. +..|..+ |+. T Consensus 44 ---------------~aP-vdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~ 77 (137) T protein:vir:93 44 ---------------LMP-VDTGYLRESVTMDFKDSGFTGVINIGSEYAI 77 (137) T ss_pred ---------------hCC-ccccchhccceeEeecCceEEEEecCCCccc Confidence 223 599999999999998776 4444432 333 No 116 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=93.21 E-value=0.00075 Score=37.70 Aligned_cols=74 Identities=18% Similarity=0.177 Sum_probs=32.7 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhh Q lcl|NC_019527. 64 FRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAA 143 (190) Q Consensus 64 lr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~ 143 (190) |-..+ ...+++.+.|+.+-..-.-..+++|+..+..+++.++. T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~------------------------------------ 43 (137) T protein:vir:97 1 MAKVK-YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIIS------------------------------------ 43 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------------------------ Confidence 22222 12223333332221111112233333333333332221 Q ss_pred HHHhhhcccccccCccCchHHHHHHHhhcceeeecCc-eeEEeec--cCC Q lcl|NC_019527. 144 QELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGA-TIKVKVN--YGR 190 (190) Q Consensus 144 ~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~-~~~~~~~--~~~ 190 (190) .-| +|||.|++||++++.+|. +..|..+ |+. T Consensus 44 ---------------~aP-vdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~ 77 (137) T protein:vir:97 44 ---------------LMP-VDTGYLRESVTMDFKDSGFTGVINIGSEYAI 77 (137) T ss_pred ---------------hCC-ccccchhccceeEeecCceEEEEecCCCccc Confidence 223 599999999999998776 4444432 333 No 117 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=93.21 E-value=0.00075 Score=37.70 Aligned_cols=74 Identities=18% Similarity=0.177 Sum_probs=32.7 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhh Q lcl|NC_019527. 64 FRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAA 143 (190) Q Consensus 64 lr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~ 143 (190) |-..+ ...+++.+.|+.+-..-.-..+++|+..+..+++.++. T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~------------------------------------ 43 (137) T protein:vir:94 1 MAKVK-YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIIS------------------------------------ 43 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------------------------ Confidence 22222 12223333332221111112233333333333332221 Q ss_pred HHHhhhcccccccCccCchHHHHHHHhhcceeeecCc-eeEEeec--cCC Q lcl|NC_019527. 144 QELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGA-TIKVKVN--YGR 190 (190) Q Consensus 144 ~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~-~~~~~~~--~~~ 190 (190) .-| +|||.|++||++++.+|. +..|..+ |+. T Consensus 44 ---------------~aP-vdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~ 77 (137) T protein:vir:94 44 ---------------LMP-VDTGYLRESVTMDFKDSGFTGVINIGSEYAI 77 (137) T ss_pred ---------------hCC-ccccchhccceeEeecCceEEEEecCCCccc Confidence 223 599999999999998776 4444432 333 No 118 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=92.73 E-value=0.0004 Score=39.20 Aligned_cols=75 Identities=21% Similarity=0.259 Sum_probs=36.3 Q ss_pred CchhhH------HHHHHHHH----Hh---h-cCCEEEEEecCCCCCCCCccHHHHHHHhhcCc----------------- Q lcl|NC_019527. 4 LTGGDK------LAKILADI----GG---K-AQGSVDVGFMSGATYPDGTPVAQVAFWNEFGH----------------- 52 (190) Q Consensus 4 i~~~d~------l~~il~~l----~~---l-~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~----------------- 52 (190) +++..+ --.+-++| .. - ....-.|||-.-. |--+.+.|||. T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rk--------APhghlvE~Ghw~~~~~~~~~dG~w~~~ 72 (119) T protein:vir:10 1 MRESAKAFVNDETGKLRSNLYVAYSTEESTNGVQTYAVSWRKKA--------APHGHLLEFGHWQTHAAYKGKDGEWYSS 72 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCEEEEEeecCCCc--------CCcccccccceeeeeeeeeccCceeeec Confidence 222211 11111122 00 0 1112233433211 23344568982 Q ss_pred ------cccCCCCCCchhhHHHHHHHHHHHHHHHHHHh-------hcCC Q lcl|NC_019527. 53 ------GGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIK-------HYDG 88 (190) Q Consensus 53 ------~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~-------~g~~ 88 (190) |. .+|+||||||+|+....+..+.+.+.+. .|+- T Consensus 73 ~~~l~~~~--~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:10 73 SVKLVNPK--WIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred CccccCce--ecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 22 3899999999999988887777665432 2322 No 119 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=92.69 E-value=0.0024 Score=34.92 Aligned_cols=91 Identities=9% Similarity=0.080 Sum_probs=50.8 Q ss_pred HHHHHHHHHHHHHHHHhhc-CCcHHHHHHHHHHHHHHHHHHHHhcc------CCCCCChHHHHHhccccccccchhhhhh Q lcl|NC_019527. 68 VNEKSSEWPKRLGDAIKHY-DGDGRKALASMGEMIGGDLGSSIIST------NEPALSKTTLMLRSIYGNNPQEIRARDV 140 (190) Q Consensus 68 ~~~~~~~~~~~l~~~i~~g-~~~~~~aL~~iG~~a~~~Iq~~I~~~------~~pPnap~Ti~~K~~~~~~~~~~~~~~~ 140 (190) +++.-.++...|...+... ..+.+.+|..||..+....++.|... .|+|+++.+...|+. T Consensus 1 M~~~~~~~~~~L~~ll~~L~~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~k~~------------- 67 (152) T protein:vir:10 1 MSEPIEQVKTAFDSLLNNISKPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKGVKSK------------- 67 (152) T ss_pred CchHHHHHHHHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhhccc------------- Confidence 5555555555566555432 13456799999999999999999997 466666655433331 Q ss_pred hhhHHHhhhcccccccCccCchHHHHHHHhh--cceeeecCceeEEe---e--ccCC Q lcl|NC_019527. 141 LAAQELVEEGFQGAGGSQAKPLVWTGHMLNS--ITYQVDGGATIKVK---V--NYGR 190 (190) Q Consensus 141 ~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~S--Ity~V~~g~~~~~~---~--~~~~ 190 (190) .+-......|+.| ++|... .....|+ . -|++ T Consensus 68 ------------------~~~~~m~~~L~~a~~l~~~a~-~~~~~Vg~~Gt~~~yAa 105 (152) T protein:vir:10 68 ------------------IKSGKMFDKITQPRFMRLRLE-SEGVSLGYEGGDAVIAR 105 (152) T ss_pred ------------------ccchhHHHhhhhcceeeeeec-CcEEEEEecCCchhhhh Confidence 1112222334444 455543 2334442 2 2443 No 120 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=92.55 E-value=0.0008 Score=37.56 Aligned_cols=74 Identities=18% Similarity=0.177 Sum_probs=34.6 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhh Q lcl|NC_019527. 64 FRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAA 143 (190) Q Consensus 64 lr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~ 143 (190) |-.-+ ...+++.+.|+++-..-+-.++++|...+..++..+|. T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~------------------------------------ 43 (137) T protein:vir:96 1 MAKVK-YGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVA------------------------------------ 43 (137) T ss_pred CchhH-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------------------------ Confidence 22211 12233333333322111113344455554444443321 Q ss_pred HHHhhhcccccccCccCchHHHHHHHhhcceeeecCc-eeEEee--ccCC Q lcl|NC_019527. 144 QELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGA-TIKVKV--NYGR 190 (190) Q Consensus 144 ~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~-~~~~~~--~~~~ 190 (190) .-| +|||.|++||+++|.++. +..|+. +|+. T Consensus 44 ---------------~~p-vdTG~L~~Si~~~~~~~g~~~~V~~~~~YA~ 77 (137) T protein:vir:96 44 ---------------LAP-VDLGFLKESIDFKVTDGGFSSVISVGAEYAI 77 (137) T ss_pred ---------------hCC-cCccchhcCceeEeecCceEEEEecCCCccc Confidence 123 589999999999866554 455543 3444 No 121 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=92.49 E-value=0.00032 Score=39.72 Aligned_cols=75 Identities=23% Similarity=0.296 Sum_probs=34.6 Q ss_pred hhH-HHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhh Q lcl|NC_019527. 64 FRN-MVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLA 142 (190) Q Consensus 64 lr~-~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~ 142 (190) |-. .+.-..+++.+.|+.....-+..++.+|+..+...++.++ T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak------------------------------------ 44 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAK------------------------------------ 44 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------------------------ Confidence 110 0011223333333333222111234444444444333321 Q ss_pred hHHHhhhcccccccCccCchHHHHHHHhhcceeeecCc---eeEEe--eccCC Q lcl|NC_019527. 143 AQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGA---TIKVK--VNYGR 190 (190) Q Consensus 143 ~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~---~~~~~--~~~~~ 190 (190) ..-| +|||.|++||+++|.+.. ++.|+ ++|+. T Consensus 45 ---------------~~aP-v~TG~Lr~SI~~~~~~~g~~~~~~v~~~~~YA~ 81 (142) T protein:vir:94 45 ---------------GLCP-VDTGRLRSSIQAVPSGGRFSFSVTIGTNVTYAA 81 (142) T ss_pred ---------------HhCC-ccchhhhccceeeeccCCceEEEEEecCcccch Confidence 1234 699999999999887643 33333 56665 No 122 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=92.04 E-value=0.0011 Score=36.77 Aligned_cols=74 Identities=19% Similarity=0.178 Sum_probs=33.6 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhh Q lcl|NC_019527. 64 FRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAA 143 (190) Q Consensus 64 lr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~ 143 (190) |-.-. ...+++.+.|+++-..-+-..+++|+..+..+++.+|. T Consensus 1 Ma~~~-~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~------------------------------------ 43 (137) T protein:vir:94 1 MAKVK-YGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIIS------------------------------------ 43 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------------------------ Confidence 21110 12233333333322211112334444444433333321 Q ss_pred HHHhhhcccccccCccCchHHHHHHHhhcceeeecCc-eeEEe--eccCC Q lcl|NC_019527. 144 QELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGA-TIKVK--VNYGR 190 (190) Q Consensus 144 ~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~-~~~~~--~~~~~ 190 (190) .-| +|||.|++||++.+.++. +..|+ +.|+. T Consensus 44 ---------------~aP-vdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~ 77 (137) T protein:vir:94 44 ---------------LMP-VDTGYLRESVTMDFKDGGFTGVINIGSEYAI 77 (137) T ss_pred ---------------hCC-cCcchhhcCceeEeecCcEEEEEecCCCccc Confidence 223 589999999999988765 34433 23333 No 123 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=92.01 E-value=0.00074 Score=37.75 Aligned_cols=79 Identities=18% Similarity=0.120 Sum_probs=33.5 Q ss_pred CCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhh Q lcl|NC_019527. 59 PPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRAR 138 (190) Q Consensus 59 P~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~ 138 (190) =+|..++-.+ +...++.+.+...-..-.-.++++|...+..+++.++. T Consensus 1 m~~ms~~i~~-~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~------------------------------- 48 (144) T protein:vir:59 1 MALMSVRIDP-SWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAAS------------------------------- 48 (144) T ss_pred CCcceeeehh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------------------- Confidence 1333322111 11112222222211111112334444444433333321 Q ss_pred hhhhhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCc-eeEEe--eccCC Q lcl|NC_019527. 139 DVLAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGA-TIKVK--VNYGR 190 (190) Q Consensus 139 ~~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~-~~~~~--~~~~~ 190 (190) .-| +|||.|++||++++.++. +..|+ ++|+. T Consensus 49 --------------------~ap-v~TG~Lr~SI~~~~~~~g~~~~V~~~~~YA~ 82 (144) T protein:vir:59 49 --------------------LAP-VDEGNLKNSIQIDYKNNGLTAEITVGAEYAI 82 (144) T ss_pred --------------------hCC-ccchhhhcCeeEEeecCcEEEEEecCCCccc Confidence 112 689999999999987665 44444 33433 No 124 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=91.94 E-value=0.00054 Score=38.49 Aligned_cols=74 Identities=22% Similarity=0.318 Sum_probs=43.6 Q ss_pred CC-CCchhhHHHHHHHHHHhhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHH Q lcl|NC_019527. 1 MA-TLTGGDKLAKILADIGGKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRL 79 (190) Q Consensus 1 Ma-~i~~~d~l~~il~~l~~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l 79 (190) |+ +|.-.+ ..+.......+.|||... +.+|.+-|||+- .+||.||+..+.++.++++-+.+ T Consensus 61 laD~I~~s~------~~~dg~~~g~~~VG~~k~---------~~~A~f~n~GT~---k~~~~hFie~t~~e~~~evl~a~ 122 (139) T protein:vir:10 61 LSEDIRSAA------GDIDGDHNGSSTVGFHNK---------AHIARFLNDGTK---YIRADHFVDNARDDAKDAVFAAE 122 (139) T ss_pred hhhcceecC------cccccccceeeeeCCCCC---------cceEeecccCcc---ccCCCchHHHHHHHHHHHHHHHH Confidence 32 111100 012222344567888421 578899999973 58999999999999988877666 Q ss_pred HHHH----hhcCCcHHH Q lcl|NC_019527. 80 GDAI----KHYDGDGRK 92 (190) Q Consensus 80 ~~~i----~~g~~~~~~ 92 (190) .+.+ ....++.+. T Consensus 123 ~~~~k~~l~~~~~~~~~ 139 (139) T protein:vir:10 123 AEKYQAMIAKANGGGDK 139 (139) T ss_pred HHHHHHHHhhcCCCCCC Confidence 5544 322222222 No 125 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=91.55 E-value=0.0015 Score=36.01 Aligned_cols=74 Identities=19% Similarity=0.188 Sum_probs=32.4 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhh Q lcl|NC_019527. 64 FRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAA 143 (190) Q Consensus 64 lr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~ 143 (190) |-..+ ...+++.+.|+++-+.-.-..+++|+..+..+++.++. T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~------------------------------------ 43 (137) T protein:vir:95 1 MAKVK-YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIIS------------------------------------ 43 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------------------------ Confidence 22222 12223333332221111112233333333333333211 Q ss_pred HHHhhhcccccccCccCchHHHHHHHhhcceeeecCc-eeEEe--eccCC Q lcl|NC_019527. 144 QELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGA-TIKVK--VNYGR 190 (190) Q Consensus 144 ~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~-~~~~~--~~~~~ 190 (190) .-| +|||.|++||+++|.++. +..|+ +.|+. T Consensus 44 ---------------~aP-v~TG~L~~Si~~~~~~~~~~~~V~~~~~YA~ 77 (137) T protein:vir:95 44 ---------------LMP-VDTGYLRESVTMDFKDGGFTGVINIGSEYAI 77 (137) T ss_pred ---------------hCC-ccchhhhcCeeeEeeCCceEEEEecCCCccc Confidence 223 589999999999998764 33333 33433 No 126 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=91.45 E-value=0.00032 Score=39.76 Aligned_cols=84 Identities=26% Similarity=0.351 Sum_probs=49.1 Q ss_pred CCCCchhh---HHHHHHHHHHhh---------------cCCE----------EEEEecC-----CCCCCCCc-----cHH Q lcl|NC_019527. 1 MATLTGGD---KLAKILADIGGK---------------AQGS----------VDVGFMS-----GATYPDGT-----PVA 42 (190) Q Consensus 1 Ma~i~~~d---~l~~il~~l~~l---------------~~~~----------V~VGi~~-----~~~~~dG~-----~vA 42 (190) ||+|+=.+ .+.+.|+...+. .-.. +.-|-.. ..+ .+|. .-- T Consensus 1 M~~i~id~La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~-~e~~~V~nk~~y 79 (124) T protein:vir:95 1 MAKIKIGRLADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRV-PNGWVIHNKTEY 79 (124) T ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeee-cCceeEEEcCCC Confidence 99876432 122222111111 0001 1222221 111 1221 124 Q ss_pred HHHHHhhcCccccCC--CCCCchhhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019527. 43 QVAFWNEFGHGGRFP--APPRPFFRNMVNEKSSEWPKRLGDAIKH 85 (190) Q Consensus 43 ~iA~~~EfG~~~~~~--IP~RPFlr~~~~~~~~~~~~~l~~~i~~ 85 (190) +++.+.|||+..+++ .++||||+|..+....++.+.++++|.+ T Consensus 80 qLtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~ 124 (124) T protein:vir:95 80 RLAHLLEYGHATVDGGRVPGTPHIRPIEDWLEKEFEDRVEKAIKQ 124 (124) T ss_pred ceeeeeecceeccCCcccCCccchhHHHHHHHHHHHHHHHHHhcC Confidence 578899999865543 6999999999999999999999999977 No 127 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=91.15 E-value=0.00087 Score=37.36 Aligned_cols=77 Identities=21% Similarity=0.229 Sum_probs=36.3 Q ss_pred CchhhH------HHHHHHHH----Hh---h-cCCEEEEEecCCCCCCCCccHHHHHHHhhcCcc---------------- Q lcl|NC_019527. 4 LTGGDK------LAKILADI----GG---K-AQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHG---------------- 53 (190) Q Consensus 4 i~~~d~------l~~il~~l----~~---l-~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~---------------- 53 (190) +++..+ --.+-++| .. - ....-.|||-.-. |--+.+.|||.- T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rk--------APhghlvE~Ghw~~~~~~~~~dG~w~~~ 72 (119) T protein:vir:81 1 MRESAKAFVNDETGKLRSNLYVAYSPEESTNGVQTYAVSWRKKA--------APHGHLLEFGHWQTHAAYKGKDGEWYSS 72 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCCeEEEEeeccCCc--------CCcccccccceeeeeeeeeccCceeeec Confidence 222211 11111122 00 0 1112233433211 223445688820 Q ss_pred -----ccCCCCCCchhhHHHHHHHHHHHHHHHHHHh-------hcCC Q lcl|NC_019527. 54 -----GRFPAPPRPFFRNMVNEKSSEWPKRLGDAIK-------HYDG 88 (190) Q Consensus 54 -----~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~-------~g~~ 88 (190) ....+|+||||||+|+....+..+.+.+.+. .|+- T Consensus 73 ~~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:81 73 SVKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred CccccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 1124899999999999988887776665432 2322 No 128 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=90.59 E-value=0.0013 Score=36.40 Aligned_cols=70 Identities=14% Similarity=0.055 Sum_probs=34.7 Q ss_pred hHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhhH Q lcl|NC_019527. 65 RNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAAQ 144 (190) Q Consensus 65 r~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~~ 144 (190) -.++++-..++.+... .+. -.++++|...+..++.++| . T Consensus 1 i~Gld~l~~~l~~~~~-~~~---~~v~~al~~~a~~i~~~ak-------------------~------------------ 39 (108) T protein:vir:99 1 MRGLDRFLRSVERKQK-SVR---IAVDKELSKSAARIERQAK-------------------I------------------ 39 (108) T ss_pred CchHHHHHHHHHHHHH-HHH---HHHHHHHHHHHHHHHHHHH-------------------h------------------ Confidence 3344333333322111 111 1223444443333333221 1 Q ss_pred HHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEe--eccCC Q lcl|NC_019527. 145 ELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVK--VNYGR 190 (190) Q Consensus 145 ~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~--~~~~~ 190 (190) ..| +|||.|++||++.+.++.+..|. ..|.. T Consensus 40 --------------~aP-v~TG~Lr~sI~~~~~~~~~~~v~~~~~Ya~ 72 (108) T protein:vir:99 40 --------------LAP-VDTGWLRAQIYSEQQRLLHYRVVSPALYSI 72 (108) T ss_pred --------------cCC-cCchhhhcceeeeecCcEEEEeecCcccch Confidence 234 68999999999999888777764 33444 No 129 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=90.58 E-value=0.0014 Score=36.20 Aligned_cols=74 Identities=20% Similarity=0.201 Sum_probs=34.2 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhh Q lcl|NC_019527. 64 FRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAA 143 (190) Q Consensus 64 lr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~ 143 (190) |-... ..-+++.+.|...-..-.-.++++|+..+..++..+|.. T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~----------------------------------- 44 (137) T protein:vir:10 1 MAKVK-YGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSN----------------------------------- 44 (137) T ss_pred Cccch-hCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----------------------------------- Confidence 21110 011222222222111111133455555554444444321 Q ss_pred HHHhhhcccccccCccCchHHHHHHHhhcceeeecCc-eeEEe--eccCC Q lcl|NC_019527. 144 QELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGA-TIKVK--VNYGR 190 (190) Q Consensus 144 ~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~-~~~~~--~~~~~ 190 (190) -| +|||.|++||++++.++. +..|. +.|+. T Consensus 45 ----------------aP-v~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~ 77 (137) T protein:vir:10 45 ----------------MP-VDTGYLRESVSMDFKKGGLTGVINIGSEYAV 77 (137) T ss_pred ----------------CC-cCcchhhcCeeeEecCCcEEEEEecCCcccc Confidence 23 489999999999988765 33333 44444 No 130 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=90.51 E-value=0.0021 Score=35.25 Aligned_cols=74 Identities=23% Similarity=0.206 Sum_probs=34.1 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhh Q lcl|NC_019527. 64 FRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAA 143 (190) Q Consensus 64 lr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~ 143 (190) |-..+ ..-+++.+.|+..-++-.-.++++|+..+..+++.+|.. T Consensus 1 Ma~~~-~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~----------------------------------- 44 (137) T protein:vir:10 1 MAKVK-YGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSN----------------------------------- 44 (137) T ss_pred CchhH-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----------------------------------- Confidence 21110 011222222222111111133455555555555544332 Q ss_pred HHHhhhcccccccCccCchHHHHHHHhhcceeeecCc-eeEE--eeccCC Q lcl|NC_019527. 144 QELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGA-TIKV--KVNYGR 190 (190) Q Consensus 144 ~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~-~~~~--~~~~~~ 190 (190) -| +|||.|++||++.+.++. +.-| .+.|+. T Consensus 45 ----------------aP-vdTG~Lr~SI~~~~~~~~~~~~V~~~~~Ya~ 77 (137) T protein:vir:10 45 ----------------MP-VDTGYLRESVSMDFKKGGLTGVINIGSEYAV 77 (137) T ss_pred ----------------CC-cCcchhhcCeeEEeeCCcEEEEEecCCCccc Confidence 23 489999999999987664 3333 334444 No 131 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=90.18 E-value=0.0011 Score=36.91 Aligned_cols=73 Identities=21% Similarity=0.194 Sum_probs=42.3 Q ss_pred CC-CCchhhHHHHHHHHHHhhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHH--HHHHHH Q lcl|NC_019527. 1 MA-TLTGGDKLAKILADIGGKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEK--SSEWPK 77 (190) Q Consensus 1 Ma-~i~~~d~l~~il~~l~~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~--~~~~~~ 77 (190) |+ +|.-.+ ..+.......+.|||.... -+++|.+.+||+. .+|+-||+..+..+. +.++-+ T Consensus 61 laD~I~~~~------~~~DG~~dg~s~VG~~~~~-------~~~~A~f~n~GT~---k~~~~hFve~~~~~a~~k~~Vl~ 124 (141) T protein:vir:50 61 MADGLAIQS------TNADGRKNGVSTVGWKNNY-------HAQNARRLNDGTK---KYRADHFVTNVQNDSTVQKKVLL 124 (141) T ss_pred cccceeecc------CccccccCCeeeeccCCCc-------cceeeeccccCcc---ccCCCchhHHHHHhhhhHHHHHH Confidence 44 232100 1122234456789996422 3899999999974 589999999999764 344444 Q ss_pred HH----HHHHhh-cCCc Q lcl|NC_019527. 78 RL----GDAIKH-YDGD 89 (190) Q Consensus 78 ~l----~~~i~~-g~~~ 89 (190) .+ ++.|.. |.-| T Consensus 125 A~~~~~k~~l~~~~~~~ 141 (141) T protein:vir:50 125 EKKRNTKNSLEEKEGCD 141 (141) T ss_pred HHHHHHHHHHHhccCCC Confidence 33 344432 3224 No 132 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=89.71 E-value=0.0024 Score=34.98 Aligned_cols=73 Identities=16% Similarity=0.165 Sum_probs=31.3 Q ss_pred hhH-HHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhh Q lcl|NC_019527. 64 FRN-MVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLA 142 (190) Q Consensus 64 lr~-~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~ 142 (190) |-. =|+.+..++.+.+.+.+ .+++..+++..+..+ |+.... T Consensus 1 ~~~~~f~~~~~~~~~~~~k~~-------~~~~~~~a~~~~~~~----------------ie~~ak--------------- 42 (141) T protein:vir:78 1 MNEFEFDSNIPKARKLIEKKV-------LQALEDIGEHMTTEL----------------AEGGHG--------------- 42 (141) T ss_pred CcchhHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHH----------------HHHhhh--------------- Confidence 111 12233333333333222 222333222211111 111111 Q ss_pred hHHHhhhcccccccCccCchHHHHHHHhhcceeee-cCceeEEee--ccCC Q lcl|NC_019527. 143 AQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVD-GGATIKVKV--NYGR 190 (190) Q Consensus 143 ~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~-~g~~~~~~~--~~~~ 190 (190) ..-| +|||.|++||+|+|. +|.++.|+. +|+- T Consensus 43 ---------------~~~p-vdtG~L~~SI~~~v~~~g~~~~V~~~~~YA~ 77 (141) T protein:vir:78 43 ---------------VTSN-NDTGEYAQKSGYKVRKSSKEVIVGNSSDYAI 77 (141) T ss_pred ---------------hccc-cccchhhcceeeeeecCCcEEEEecCCCccc Confidence 1234 899999999999986 555555543 2332 No 133 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=88.80 E-value=0.00048 Score=38.77 Aligned_cols=78 Identities=17% Similarity=0.135 Sum_probs=37.1 Q ss_pred CCCCch---------hhHHHHHHHHHH----hhc--CCEEEEEecCCCC----CCCC--------ccHHHHHHHhhcCcc Q lcl|NC_019527. 1 MATLTG---------GDKLAKILADIG----GKA--QGSVDVGFMSGAT----YPDG--------TPVAQVAFWNEFGHG 53 (190) Q Consensus 1 Ma~i~~---------~d~l~~il~~l~----~l~--~~~V~VGi~~~~~----~~dG--------~~vA~iA~~~EfG~~ 53 (190) +++++- .+.+++++++.. ... .--|.-|-+..+- .++| .+.+.+|.++|||+. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~Ve~GT~ 86 (140) T protein:vir:10 7 RARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAPVHEGSR 86 (140) T ss_pred eeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhhhccCCC Confidence 333321 122333332221 111 1122333333221 0111 245899999999982 Q ss_pred c-----------------------cC---CCCCCchhhHHHHHH---HHHHHHH Q lcl|NC_019527. 54 G-----------------------RF---PAPPRPFFRNMVNEK---SSEWPKR 78 (190) Q Consensus 54 ~-----------------------~~---~IP~RPFlr~~~~~~---~~~~~~~ 78 (190) - ++ +.+|||||++++++. .+++... T Consensus 87 ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:10 87 PHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred CceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 1 11 466999999999874 3433332 No 134 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=88.80 E-value=0.00048 Score=38.77 Aligned_cols=78 Identities=17% Similarity=0.135 Sum_probs=37.1 Q ss_pred CCCCch---------hhHHHHHHHHHH----hhc--CCEEEEEecCCCC----CCCC--------ccHHHHHHHhhcCcc Q lcl|NC_019527. 1 MATLTG---------GDKLAKILADIG----GKA--QGSVDVGFMSGAT----YPDG--------TPVAQVAFWNEFGHG 53 (190) Q Consensus 1 Ma~i~~---------~d~l~~il~~l~----~l~--~~~V~VGi~~~~~----~~dG--------~~vA~iA~~~EfG~~ 53 (190) +++++- .+.+++++++.. ... .--|.-|-+..+- .++| .+.+.+|.++|||+. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~Ve~GT~ 86 (140) T protein:vir:97 7 RARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAPVHEGSR 86 (140) T ss_pred eeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhhhccCCC Confidence 333321 122333332221 111 1122333333221 0111 245899999999982 Q ss_pred c-----------------------cC---CCCCCchhhHHHHHH---HHHHHHH Q lcl|NC_019527. 54 G-----------------------RF---PAPPRPFFRNMVNEK---SSEWPKR 78 (190) Q Consensus 54 ~-----------------------~~---~IP~RPFlr~~~~~~---~~~~~~~ 78 (190) - ++ +.+|||||++++++. .+++... T Consensus 87 ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:97 87 PHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred CceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 1 11 466999999999874 3433332 No 135 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=88.69 E-value=0.0038 Score=33.86 Aligned_cols=86 Identities=15% Similarity=0.137 Sum_probs=36.2 Q ss_pred CEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHH Q lcl|NC_019527. 23 GSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIG 102 (190) Q Consensus 23 ~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~ 102 (190) .+|.. | --.|-.|-.. ..--+++.+.|++.-..-.-..+++|...+..++ T Consensus 1 ~~~~~----------------------~-------~~~~~~Ma~~-~~Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~ 50 (149) T protein:vir:94 1 MKLSY----------------------Y-------DLSRCHMAKV-KYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIY 50 (149) T ss_pred Ceeee----------------------e-------ecchhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111 1 0133333221 1122233333333221111123444554444444 Q ss_pred HHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCc-e Q lcl|NC_019527. 103 GDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGA-T 181 (190) Q Consensus 103 ~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~-~ 181 (190) +.++. .-| +|||.|++||++.|.++. + T Consensus 51 ~~ak~---------------------------------------------------~aP-vdTG~Lr~SI~~~~~~~g~~ 78 (149) T protein:vir:94 51 NTAVA---------------------------------------------------LAP-VDLGFLEESIDFKYFDGGLS 78 (149) T ss_pred HHHHH---------------------------------------------------hCC-cccchhhcCeeEEeeCCcEE Confidence 43321 224 689999999999887654 3 Q ss_pred eEEe--eccCC Q lcl|NC_019527. 182 IKVK--VNYGR 190 (190) Q Consensus 182 ~~~~--~~~~~ 190 (190) ..|. ++|+. T Consensus 79 ~~V~~~~~YA~ 89 (149) T protein:vir:94 79 SVISVGADYAI 89 (149) T ss_pred EEEecCCCccc Confidence 3333 33433 No 136 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=88.61 E-value=0.0046 Score=33.41 Aligned_cols=78 Identities=14% Similarity=0.138 Sum_probs=39.9 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhh Q lcl|NC_019527. 63 FFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLA 142 (190) Q Consensus 63 Flr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~ 142 (190) .=-.++ +++.+.|++.-....-.++.+|...|...++++|..-.. . T Consensus 1 i~i~Gl----d~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~a~~~a~~--------------~---------------- 46 (115) T protein:vir:99 1 MNIDGL----DALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKARE--------------V---------------- 46 (115) T ss_pred CcchhH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--------------c---------------- Confidence 001122 233333322211111234566666666666655442110 0 Q ss_pred hHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEee--ccCC Q lcl|NC_019527. 143 AQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKV--NYGR 190 (190) Q Consensus 143 ~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~--~~~~ 190 (190) .+.| +|||.|++||++...+|-+..|.. .|+. T Consensus 47 ---------------~~~p-~~TG~Lr~SI~~~~~g~~~~~V~~~~~Ya~ 80 (115) T protein:vir:99 47 ---------------MNKG-YWTGNLSRNIRYKKTVDLQYTITSHAAYSG 80 (115) T ss_pred ---------------cCCC-CcchhhhhceeeeecCcEEEEecCCccccc Confidence 1334 589999999999998887766653 3444 No 137 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=88.44 E-value=0.0052 Score=33.12 Aligned_cols=78 Identities=18% Similarity=0.153 Sum_probs=37.5 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhh Q lcl|NC_019527. 63 FFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLA 142 (190) Q Consensus 63 Flr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~ 142 (190) -=-.++ +++.+.|+..=....-.++.+|...|...+.+++.. +|. . T Consensus 1 i~i~Gl----d~L~~~l~~~~~~~~~~~~~al~~~~~~i~~~a~~~---------a~~-----~---------------- 46 (115) T protein:vir:10 1 MQSKGL----KKLMNHLKVMHDDIEDDVDDILKNNAKEGVGIAVSN---------AKE-----V---------------- 46 (115) T ss_pred CeehhH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---------hcc-----c---------------- Confidence 000122 222222222111111133455555555555444332 110 0 Q ss_pred hHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEeec--cCC Q lcl|NC_019527. 143 AQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKVN--YGR 190 (190) Q Consensus 143 ~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~~--~~~ 190 (190) ...| +|||.|++||+....+|-+..|..+ |+. T Consensus 47 ---------------~~~p-v~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:10 47 ---------------MNKG-YWTGNLASLIEVKKIGDLHYRVISTAHYSG 80 (115) T ss_pred ---------------cCCC-CcchhhhhceeeeecCcEEEEeeCCCccch Confidence 1334 6999999999999887777777543 554 No 138 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=88.40 E-value=0.0016 Score=35.84 Aligned_cols=75 Identities=11% Similarity=0.006 Sum_probs=30.7 Q ss_pred Hhh----cCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhhHHHhhhcccccccCc Q lcl|NC_019527. 83 IKH----YDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAAQELVEEGFQGAGGSQ 158 (190) Q Consensus 83 i~~----g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~s 158 (190) |.. |.-...+-|..+...+.+.|+..+.+.-.- ....+....+ + T Consensus 1 m~~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~--~a~~v~~~ak------------------------------~ 48 (182) T protein:vir:10 1 MIEVELKGVNELRAKLKKLPDIMAKATANAQENAIEQ--AEAYAVDELQ------------------------------S 48 (182) T ss_pred CeEEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHH------------------------------h Confidence 221 111122334444444444444433321100 0011111110 1 Q ss_pred cCchHHHHHHHhhcceeeec-C--ceeEEee--ccCC Q lcl|NC_019527. 159 AKPLVWTGHMLNSITYQVDG-G--ATIKVKV--NYGR 190 (190) Q Consensus 159 ~kPLIDTG~L~~SIty~V~~-g--~~~~~~~--~~~~ 190 (190) --| +|||.|++||+++|.. | -+..|.. +|+. T Consensus 49 ~~P-vdtG~Lr~SI~~~~~~~~~~~~g~V~~~~~ya~ 84 (182) T protein:vir:10 49 SIK-YSTGELTRSFKHEVKVDGDEVIGRWWNSSMVAV 84 (182) T ss_pred hCC-CCchhhhhceeeeeeecCCeEEEEeecCCCccc Confidence 235 7999999999986652 2 2333333 3544 No 139 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=88.16 E-value=0.0051 Score=33.16 Aligned_cols=78 Identities=15% Similarity=0.183 Sum_probs=34.6 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhh Q lcl|NC_019527. 63 FFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLA 142 (190) Q Consensus 63 Flr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~ 142 (190) .=-.++++ +.+.|++.=....-..+.+|...|...+..+|..-. T Consensus 1 i~~~Gld~----l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~-------------------------------- 44 (115) T protein:vir:96 1 MNIDGLDA----LLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAR-------------------------------- 44 (115) T ss_pred CcchhHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------------------------- Confidence 00011111 111111110111112233444444333333322110 Q ss_pred hHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEee--ccCC Q lcl|NC_019527. 143 AQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKV--NYGR 190 (190) Q Consensus 143 ~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~--~~~~ 190 (190) ...+.| +|||.|++||++...+|.+..|.. .|++ T Consensus 45 -------------~~~~~p-~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:96 45 -------------EVMNKG-YWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred -------------ccCCCC-CCchhhhhcceeeecCceEEEeecCccchh Confidence 001333 799999999999998887766654 4555 No 140 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=88.16 E-value=0.0051 Score=33.16 Aligned_cols=78 Identities=15% Similarity=0.183 Sum_probs=34.6 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhh Q lcl|NC_019527. 63 FFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLA 142 (190) Q Consensus 63 Flr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~ 142 (190) .=-.++++ +.+.|++.=....-..+.+|...|...+..+|..-. T Consensus 1 i~~~Gld~----l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~-------------------------------- 44 (115) T protein:vir:96 1 MNIDGLDA----LLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAR-------------------------------- 44 (115) T ss_pred CcchhHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------------------------- Confidence 00011111 111111110111112233444444333333322110 Q ss_pred hHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEee--ccCC Q lcl|NC_019527. 143 AQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKV--NYGR 190 (190) Q Consensus 143 ~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~--~~~~ 190 (190) ...+.| +|||.|++||++...+|.+..|.. .|++ T Consensus 45 -------------~~~~~p-~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:96 45 -------------EVMNKG-YWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred -------------ccCCCC-CCchhhhhcceeeecCceEEEeecCccchh Confidence 001333 799999999999998887766654 4555 No 141 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=88.16 E-value=0.0051 Score=33.16 Aligned_cols=78 Identities=15% Similarity=0.183 Sum_probs=34.6 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhh Q lcl|NC_019527. 63 FFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLA 142 (190) Q Consensus 63 Flr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~ 142 (190) .=-.++++ +.+.|++.=....-..+.+|...|...+..+|..-. T Consensus 1 i~~~Gld~----l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~-------------------------------- 44 (115) T protein:vir:97 1 MNIDGLDA----LLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAR-------------------------------- 44 (115) T ss_pred CcchhHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------------------------- Confidence 00011111 111111110111112233444444333333322110 Q ss_pred hHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEee--ccCC Q lcl|NC_019527. 143 AQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKV--NYGR 190 (190) Q Consensus 143 ~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~--~~~~ 190 (190) ...+.| +|||.|++||++...+|.+..|.. .|++ T Consensus 45 -------------~~~~~p-~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:97 45 -------------EVMNKG-YWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred -------------ccCCCC-CCchhhhhcceeeecCceEEEeecCccchh Confidence 001333 799999999999998887766654 4555 No 142 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=88.16 E-value=0.0051 Score=33.16 Aligned_cols=78 Identities=15% Similarity=0.183 Sum_probs=34.6 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhh Q lcl|NC_019527. 63 FFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLA 142 (190) Q Consensus 63 Flr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~ 142 (190) .=-.++++ +.+.|++.=....-..+.+|...|...+..+|..-. T Consensus 1 i~~~Gld~----l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~-------------------------------- 44 (115) T protein:vir:93 1 MNIDGLDA----LLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAR-------------------------------- 44 (115) T ss_pred CcchhHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------------------------- Confidence 00011111 111111110111112233444444333333322110 Q ss_pred hHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEee--ccCC Q lcl|NC_019527. 143 AQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKV--NYGR 190 (190) Q Consensus 143 ~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~--~~~~ 190 (190) ...+.| +|||.|++||++...+|.+..|.. .|++ T Consensus 45 -------------~~~~~p-~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:93 45 -------------EVMNKG-YWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred -------------ccCCCC-CCchhhhhcceeeecCceEEEeecCccchh Confidence 001333 799999999999998887766654 4555 No 143 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=88.16 E-value=0.0051 Score=33.16 Aligned_cols=78 Identities=15% Similarity=0.183 Sum_probs=34.6 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhh Q lcl|NC_019527. 63 FFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLA 142 (190) Q Consensus 63 Flr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~ 142 (190) .=-.++++ +.+.|++.=....-..+.+|...|...+..+|..-. T Consensus 1 i~~~Gld~----l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~-------------------------------- 44 (115) T protein:vir:78 1 MNIDGLDA----LLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAR-------------------------------- 44 (115) T ss_pred CcchhHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------------------------- Confidence 00011111 111111110111112233444444333333322110 Q ss_pred hHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEee--ccCC Q lcl|NC_019527. 143 AQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKV--NYGR 190 (190) Q Consensus 143 ~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~--~~~~ 190 (190) ...+.| +|||.|++||++...+|.+..|.. .|++ T Consensus 45 -------------~~~~~p-~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:78 45 -------------EVMNKG-YWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred -------------ccCCCC-CCchhhhhcceeeecCceEEEeecCccchh Confidence 001333 799999999999998887766654 4555 No 144 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=88.16 E-value=0.0051 Score=33.16 Aligned_cols=78 Identities=15% Similarity=0.183 Sum_probs=34.6 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhh Q lcl|NC_019527. 63 FFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLA 142 (190) Q Consensus 63 Flr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~ 142 (190) .=-.++++ +.+.|++.=....-..+.+|...|...+..+|..-. T Consensus 1 i~~~Gld~----l~~~l~~~~~~~~~~v~~a~~~~~~~i~~~a~~~a~-------------------------------- 44 (115) T protein:vir:10 1 MNIDGLDA----LLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAR-------------------------------- 44 (115) T ss_pred CcchhHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------------------------- Confidence 00011111 111111110111112233444444333333322110 Q ss_pred hHHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEee--ccCC Q lcl|NC_019527. 143 AQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVKV--NYGR 190 (190) Q Consensus 143 ~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~~--~~~~ 190 (190) ...+.| +|||.|++||++...+|.+..|.. .|++ T Consensus 45 -------------~~~~~p-~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:10 45 -------------EVMNKG-YWTGNLSRNIRYKKTGDLQYTITSHAAYSG 80 (115) T ss_pred -------------ccCCCC-CCchhhhhcceeeecCceEEEeecCccchh Confidence 001333 799999999999998887766654 4555 No 145 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=87.76 E-value=0.0018 Score=35.69 Aligned_cols=74 Identities=20% Similarity=0.304 Sum_probs=43.7 Q ss_pred CC-CCchhhHHHHHHHHHHhhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHH Q lcl|NC_019527. 1 MA-TLTGGDKLAKILADIGGKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRL 79 (190) Q Consensus 1 Ma-~i~~~d~l~~il~~l~~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l 79 (190) |+ .|.-.. ..+.......+.|||.- -+.+|.+-|+|+- .+||.||+..+.++.++++.+.+ T Consensus 61 laD~I~~~~------~~idg~~~g~~~VG~~~---------~~~~Ahf~n~GT~---~~~~~hFie~t~~e~~~ev~~a~ 122 (139) T protein:vir:10 61 LSEDISSAA------GDIDGDHNGSSTVGFHN---------KAHIARFLNDGTK---NIRADHFVDNARDDAKDAVFAAE 122 (139) T ss_pred ccccceecC------ccccccccccceeCCCC---------CceeeeeeccCcc---ccCCCchHHHHHHHHHHHHHHHH Confidence 33 221100 12222234557788841 1567899999973 58999999999999988877666 Q ss_pred HHHH----hhcCCcHHH Q lcl|NC_019527. 80 GDAI----KHYDGDGRK 92 (190) Q Consensus 80 ~~~i----~~g~~~~~~ 92 (190) .+.+ ....++-+. T Consensus 123 ~~~~ke~l~~~~~~~~~ 139 (139) T protein:vir:10 123 AEKYQAMIAKANGGDSK 139 (139) T ss_pred HHHHHHHHhhcCCCCCC Confidence 5544 221122222 No 146 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=87.01 E-value=0.003 Score=34.39 Aligned_cols=77 Identities=21% Similarity=0.157 Sum_probs=38.6 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhh Q lcl|NC_019527. 64 FRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAA 143 (190) Q Consensus 64 lr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~ 143 (190) |-.-=-+--+++.+.|++.. ....++.+|...|...+..+++... T Consensus 1 Ma~i~~~Gld~l~~~L~~~~--~~~~v~~~~~~~~~~~~~~~~~~a~--------------------------------- 45 (114) T protein:vir:49 1 MATIEFEGLDEMAQSLLKNA--SPEKRSKVLRKYGSKLKEAAVNRAQ--------------------------------- 45 (114) T ss_pred CeeeeeehHHHHHHHHHHhc--CHHHHHHHHHHHHHHHHHHHHHhcc--------------------------------- Confidence 22100011133333333211 1224456666666555555543210 Q ss_pred HHHhhhcccccccCccCchHHHHHHHhhcceeeecCc-eeEEeeccCC Q lcl|NC_019527. 144 QELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGA-TIKVKVNYGR 190 (190) Q Consensus 144 ~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~-~~~~~~~~~~ 190 (190) ...| +|||.|++||+..+.+|. +|-....|++ T Consensus 46 --------------~~~p-~~TG~Lr~sI~~~~~~~~~~V~~~~~Ya~ 78 (114) T protein:vir:49 46 --------------FNKG-YSTGATRRSITLQVESDKATVEALTSYSG 78 (114) T ss_pred --------------cCCC-CCchhhhhceeeeecCCeeEecCCCCccc Confidence 1234 699999999999887654 4444456766 No 147 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=87.01 E-value=0.003 Score=34.39 Aligned_cols=77 Identities=21% Similarity=0.157 Sum_probs=38.6 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhh Q lcl|NC_019527. 64 FRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAA 143 (190) Q Consensus 64 lr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~ 143 (190) |-.-=-+--+++.+.|++.. ....++.+|...|...+..+++... T Consensus 1 Ma~i~~~Gld~l~~~L~~~~--~~~~v~~~~~~~~~~~~~~~~~~a~--------------------------------- 45 (114) T protein:vir:27 1 MATIEFEGLDEMAQSLLKNA--SPEKRSKVLRKYGSKLKEAAVNRAQ--------------------------------- 45 (114) T ss_pred CeeeeeehHHHHHHHHHHhc--CHHHHHHHHHHHHHHHHHHHHHhcc--------------------------------- Confidence 22100011133333333211 1224456666666555555543210 Q ss_pred HHHhhhcccccccCccCchHHHHHHHhhcceeeecCc-eeEEeeccCC Q lcl|NC_019527. 144 QELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGA-TIKVKVNYGR 190 (190) Q Consensus 144 ~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~-~~~~~~~~~~ 190 (190) ...| +|||.|++||+..+.+|. +|-....|++ T Consensus 46 --------------~~~p-~~TG~Lr~sI~~~~~~~~~~V~~~~~Ya~ 78 (114) T protein:vir:27 46 --------------FNKG-YSTGATRRSITLQVESDKATVEALTSYSG 78 (114) T ss_pred --------------cCCC-CCchhhhhceeeeecCCeeEecCCCCccc Confidence 1234 699999999999887654 4444456766 No 148 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=86.61 E-value=0.00086 Score=37.38 Aligned_cols=79 Identities=22% Similarity=0.250 Sum_probs=36.4 Q ss_pred CC-CCch--hhHHHHHHHHHHhhcCC--EEEEEecCCC-----------CCCCC--ccHHHHHHHhhcCccc-------- Q lcl|NC_019527. 1 MA-TLTG--GDKLAKILADIGGKAQG--SVDVGFMSGA-----------TYPDG--TPVAQVAFWNEFGHGG-------- 54 (190) Q Consensus 1 Ma-~i~~--~d~l~~il~~l~~l~~~--~V~VGi~~~~-----------~~~dG--~~vA~iA~~~EfG~~~-------- 54 (190) |+ .+.. -+.+.++-..+....+. -|.-|=+... .+-++ .+.+.+|.++|||+.. T Consensus 14 ~~~~~~~v~r~~l~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~V~~~~~YA~~ve~GT~ph~I~Pk~~ 93 (137) T protein:vir:10 14 EARQFQVIARRRLSRITRGTANQARADVPVKTGNLGRSIREDPIVVAGPLRLDSGVTAHADYARYVHDGTRAHVIRPRRP 93 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhcCceeeeeeccccceEEEEecCCCccceeeecCCCCceeecccc Confidence 11 1000 01122222222211111 1111111111 11111 1458889999999720 Q ss_pred ----------------cC---CCCCCchhhHHHHHHHHHHHHHH Q lcl|NC_019527. 55 ----------------RF---PAPPRPFFRNMVNEKSSEWPKRL 79 (190) Q Consensus 55 ----------------~~---~IP~RPFlr~~~~~~~~~~~~~l 79 (190) .+ ++||||||+++++++..+-+..- T Consensus 94 k~~l~~~~~g~~vf~k~V~hPG~~a~PfL~~A~~~~~~~~~~~~ 137 (137) T protein:vir:10 94 GGVLRFTVGGRVVYARRVNHPGTRARPFLRNAAERVVARETATS 137 (137) T ss_pred ceeeeEeeCCeeEecceeecCCCCCCchHHHHHHHhhhhhcccC Confidence 01 47799999999999987765544 No 149 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=85.57 E-value=0.0036 Score=33.99 Aligned_cols=73 Identities=22% Similarity=0.251 Sum_probs=41.3 Q ss_pred CC-CCchhhHHHHHHHHHHhhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHH--HHHHHH Q lcl|NC_019527. 1 MA-TLTGGDKLAKILADIGGKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEK--SSEWPK 77 (190) Q Consensus 1 Ma-~i~~~d~l~~il~~l~~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~--~~~~~~ 77 (190) |+ +|+-.+ ..+.......+.|||... ..+++|.+.++|+. .+|+-||+..+.++. +.++-+ T Consensus 61 laD~I~~~~------~~iDg~~~g~s~VG~~kk-------~~a~~A~f~n~GT~---k~~~~hFve~~~~e~~~k~~vl~ 124 (140) T protein:vir:48 61 MADGLSVQS------TNVDGRKNGVSTVGWVNR-------YHAQNARRLNDGTK---KYRADHFVTNVQNDSAVQTKVLL 124 (140) T ss_pred chhceeecc------cccccccCceeeeccCCC-------cceeeeeccccCcc---ccCCCchhHHHHHhhhhHHHHHH Confidence 43 222100 122223345677888532 24899999999984 589999999999875 334443 Q ss_pred HHH----HHHhhcCCc Q lcl|NC_019527. 78 RLG----DAIKHYDGD 89 (190) Q Consensus 78 ~l~----~~i~~g~~~ 89 (190) ... +.|..-..+ T Consensus 125 A~~~~~~~~l~~~~~~ 140 (140) T protein:vir:48 125 AEKEEYEKLIRKKGGE 140 (140) T ss_pred HHHHHHHHHHHhhcCC Confidence 332 333221112 No 150 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=85.52 E-value=0.0027 Score=34.65 Aligned_cols=86 Identities=21% Similarity=0.184 Sum_probs=40.8 Q ss_pred CC-CCchhhHHHHHHHHHHhhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHH--HHHHHH Q lcl|NC_019527. 1 MA-TLTGGDKLAKILADIGGKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEK--SSEWPK 77 (190) Q Consensus 1 Ma-~i~~~d~l~~il~~l~~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~--~~~~~~ 77 (190) |+ +|.-. -..+.......+.|||.... .+++|.+.|+|+. .+||.||+..+.++. +.++-+ T Consensus 61 laD~I~~s------~~~idG~~dG~s~VG~~~~~-------~a~~a~f~n~GT~---km~~~hFie~tr~e~~~k~~vl~ 124 (153) T protein:vir:49 61 MADGLAVQ------STNADGRKNGVSTVGWKNNY-------HAQNARRLNDGTK---KYRADHFITNVQNDSTVKNKVLL 124 (153) T ss_pred ccccceec------cccccccccceeeecccCCc-------cceeeeecccCcc---cCCCChhhHHHHHHhhHHHHHHH Confidence 33 22110 01122223446778886432 4799999999974 589999999999875 334443 Q ss_pred HH----HHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccc Q lcl|NC_019527. 78 RL----GDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIY 128 (190) Q Consensus 78 ~l----~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~ 128 (190) .+ ++.|.. -|- --||.+..+-|..- T Consensus 125 A~~~~~~~il~~-----------~~~---------------~~~~~~~~~~~~~~ 153 (153) T protein:vir:49 125 AEKEEYEKLIRR-----------KGG---------------VYLSASNFKTKRAT 153 (153) T ss_pred HHHHHHHHHHHh-----------cCC---------------eeeeccccccccCC Confidence 22 233321 000 00000000000000 No 151 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=85.26 E-value=0.003 Score=34.44 Aligned_cols=71 Identities=21% Similarity=0.184 Sum_probs=41.1 Q ss_pred CC-CCchhhHHHHHHHHHHhhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHHHH--HHHHHH Q lcl|NC_019527. 1 MA-TLTGGDKLAKILADIGGKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEK--SSEWPK 77 (190) Q Consensus 1 Ma-~i~~~d~l~~il~~l~~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~--~~~~~~ 77 (190) |+ +|.-. -..+.......+.|||... ..|++|.+.++|+. .+|+.||+..+.++. ++++-+ T Consensus 61 laD~I~~~------~~~idg~~dG~s~VG~~k~-------~~a~~a~f~NdGT~---k~~~~hFve~t~~e~~~~~~vl~ 124 (140) T protein:vir:48 61 MADGLAVQ------STNVDGRKNGVATVGWKNN-------YHAQNARRLNDGTK---KYRADHFVTNVQNDSAVRDKVLL 124 (140) T ss_pred ccccceec------ccccccccccceeecccCC-------CceeEEeecccCcc---ccCCCchHHHHHHhhhhHHHHHH Confidence 33 22210 0122223344667888632 24899999999984 589999999999864 444444 Q ss_pred HH----HHHHh--hcC Q lcl|NC_019527. 78 RL----GDAIK--HYD 87 (190) Q Consensus 78 ~l----~~~i~--~g~ 87 (190) .. ++.|. .|+ T Consensus 125 A~~~~y~~~l~kk~~~ 140 (140) T protein:vir:48 125 AEKEEYEKLIRKKGGE 140 (140) T ss_pred HHHHHHHHHHHhhcCC Confidence 33 33342 233 No 152 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=84.24 E-value=0.014 Score=30.78 Aligned_cols=81 Identities=14% Similarity=0.138 Sum_probs=40.1 Q ss_pred CC-------CCchhhHHHHHHHHHH--hhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccc----------------- Q lcl|NC_019527. 1 MA-------TLTGGDKLAKILADIG--GKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGG----------------- 54 (190) Q Consensus 1 Ma-------~i~~~d~l~~il~~l~--~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~----------------- 54 (190) .+ +.+....|.++.+.+. ...+..+.++++.+ .++.||++|.||-.+ T Consensus 59 w~pRK~~~~k~k~~rm~~kL~~~~~~~~~~~~~~~~~~~~g-------~~~~IA~vHQ~G~~~rv~~~~~~~~~~~~~~~ 131 (231) T protein:vir:37 59 WEKRKPVDGEIKNKRLLKKVLRYASILAEERGKGRIYYKNP-------LTGEIAQKQQDGFTEHFRVFATDKNKNGSGND 131 (231) T ss_pred CchhcccccchhhHHHHHHhHHhhccccccCCceEEeeecc-------hHHHHHHHhhcCcccccchhhhhhccCCCCCC Confidence 11 1111112333433322 22333344444432 368999999999310 Q ss_pred ------------------------------------------------------------------cCCCCCCchhhHHH Q lcl|NC_019527. 55 ------------------------------------------------------------------RFPAPPRPFFRNMV 68 (190) Q Consensus 55 ------------------------------------------------------------------~~~IP~RPFlr~~~ 68 (190) .+.+|+||||-..- T Consensus 132 pATr~QAk~Lr~lGy~v~~~k~k~~k~~~rkps~kwI~~~ls~~qAgliIR~L~~k~~~~~~k~~W~I~~paR~FLG~~~ 211 (231) T protein:vir:37 132 RATIRQAQKLRSLGYRKRNGKNRQGKTKYRLYTIKEIRERLTRTWASMEIRRLENKVNAGNGKTNWEIHVPARPFLDTRE 211 (231) T ss_pred CCCHHHHHHHHHhcccccCCCCCCCCCCcCcCCHHHHHHhhhhHHHHHHHHHHhcccccccCcceeeeecCcccccCCCH Confidence 03467777777666 Q ss_pred HHHHHHHHHHHHHHHhhcCCc Q lcl|NC_019527. 69 NEKSSEWPKRLGDAIKHYDGD 89 (190) Q Consensus 69 ~~~~~~~~~~l~~~i~~g~~~ 89 (190) ++...-+...|.+ |.+|... T Consensus 212 ~e~~~~l~~~l~~-i~~~~~~ 231 (231) T protein:vir:37 212 KENVDILREITLK-FLSGEYK 231 (231) T ss_pred HHHHHHHHHHHHH-HhcccCC Confidence 6665555555554 3344333 No 153 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=84.23 E-value=0.0065 Score=32.59 Aligned_cols=90 Identities=18% Similarity=0.152 Sum_probs=44.8 Q ss_pred CCCCchhh--HHHHHHHHHHhhcCCE-------------------------EEEEecCC-----------CC--CCCC-- Q lcl|NC_019527. 1 MATLTGGD--KLAKILADIGGKAQGS-------------------------VDVGFMSG-----------AT--YPDG-- 38 (190) Q Consensus 1 Ma~i~~~d--~l~~il~~l~~l~~~~-------------------------V~VGi~~~-----------~~--~~dG-- 38 (190) ||++.+.| .|+++.++|+.+.... |.-|-+-. .. ..++ T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~~g~~~~ 80 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARSLPVYKQGNNYI 80 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccccceeecCCeeE Confidence 88775433 3444444443322211 11122211 01 1122 Q ss_pred ---ccHHHHHHHhhcCccccC---CCCCCchhhHHHHHHHHHHHHHHHHHHhh---cCCcH Q lcl|NC_019527. 39 ---TPVAQVAFWNEFGHGGRF---PAPPRPFFRNMVNEKSSEWPKRLGDAIKH---YDGDG 90 (190) Q Consensus 39 ---~~vA~iA~~~EfG~~~~~---~IP~RPFlr~~~~~~~~~~~~~l~~~i~~---g~~~~ 90 (190) .+++.+|.+-|||+-... -+|++.+|+.++++.+..+.+.+++.|.. +-.|+ T Consensus 81 v~v~n~~~YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l~~~~~~ 141 (141) T protein:vir:79 81 IEVVNPTEYASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLILLKGVFDA 141 (141) T ss_pred EEEecCCcchhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 256789999999974211 13555556666677777776666665532 33344 No 154 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=83.76 E-value=0.011 Score=31.36 Aligned_cols=70 Identities=20% Similarity=0.226 Sum_probs=36.8 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhh Q lcl|NC_019527. 63 FFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLA 142 (190) Q Consensus 63 Flr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~ 142 (190) +=-.++ +++.+.|.+... ....+.+|...|...+.++|. T Consensus 1 i~i~Gl----d~l~~~l~~~~~--~~~~~~al~~~a~~i~~~ak~----------------------------------- 39 (108) T protein:vir:74 1 MKITGI----DALQKKLRKNAT--LDDVKHVVKSNTASMNKNMQN----------------------------------- 39 (108) T ss_pred CcchhH----HHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHH----------------------------------- Confidence 111222 333333333211 123456666666655555432 Q ss_pred hHHHhhhcccccccCccCchHHHHHHHhhcceeeecC-ceeEEe--eccCC Q lcl|NC_019527. 143 AQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGG-ATIKVK--VNYGR 190 (190) Q Consensus 143 ~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g-~~~~~~--~~~~~ 190 (190) .-| +|||.|++||++.+++| -+..|. ..|+. T Consensus 40 ----------------~aP-v~TG~Lr~si~~~~~~~~~~~~V~~~~~Ya~ 73 (108) T protein:vir:74 40 ----------------LAP-VDTGNMKRSITSEFTDGGLSGTTGPHTDYAG 73 (108) T ss_pred ----------------hCC-CCchhhhccceeeeecCceEEEeecCCCccc Confidence 113 48999999999999865 344443 44555 No 155 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=83.58 E-value=0.0077 Score=32.17 Aligned_cols=77 Identities=17% Similarity=0.273 Sum_probs=41.1 Q ss_pred CC-CCchhhHHHHHHH-------------------------HHH-hh-----c-----------------CCEEEEEecC Q lcl|NC_019527. 1 MA-TLTGGDKLAKILA-------------------------DIG-GK-----A-----------------QGSVDVGFMS 31 (190) Q Consensus 1 Ma-~i~~~d~l~~il~-------------------------~l~-~l-----~-----------------~~~V~VGi~~ 31 (190) |+ .++|.+.|.+-|+ .|+ ++ + .+.|+|||-+ T Consensus 1 MsvevkGv~eil~~LE~k~g~~~~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:10 1 MSVKVTGDKALERELEKHFGIKEMVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPEWIKGKRTVTIRWRG 80 (134) T ss_pred CeEEeecHHHHHHHHHHhhchhhhhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCeeecCCceEEEEEEEc Confidence 66 6666554322221 222 11 1 1455566543 Q ss_pred CCCCCCCccHHHHHHHhhcCccccCCCCCCchhhH--------HHHHHHHHHHHHHHHHHhhc Q lcl|NC_019527. 32 GATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRN--------MVNEKSSEWPKRLGDAIKHY 86 (190) Q Consensus 32 ~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~--------~~~~~~~~~~~~l~~~i~~g 86 (190) +... --|-.+||||..+ -...+|++| +++..+..+.+.++.-|..- T Consensus 81 ~~~R------~~ivHLnE~Gyt~---~r~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 81 PFER------FRIVHLIENGHVE---KKSGKFVKPKAMGGINRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred CCce------eeEEEeeecceee---cCCCCeeccchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 2210 2355679999743 245677777 77777777777776655443 No 156 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=83.18 E-value=0.013 Score=30.89 Aligned_cols=86 Identities=16% Similarity=0.134 Sum_probs=37.1 Q ss_pred HHHhhcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHH Q lcl|NC_019527. 45 AFWNEFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLML 124 (190) Q Consensus 45 A~~~EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~ 124 (190) -.+|-|= -.|-.|-.. ..--+++.+.|++.-..-.-..+++|+..+..+++.++. T Consensus 1 ~~~~~~~-------~~~~~Ma~v-~~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~----------------- 55 (149) T protein:vir:10 1 MKLNYYD-------LSRCHMAKV-KYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVA----------------- 55 (149) T ss_pred Ceeeeec-------cchhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------------- Confidence 1111111 133333221 112233333333322211112344455444444443321 Q ss_pred hccccccccchhhhhhhhhHHHhhhcccccccCccCchHHHHHHHhhcceeeecCc-eeEEe--eccCC Q lcl|NC_019527. 125 RSIYGNNPQEIRARDVLAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGA-TIKVK--VNYGR 190 (190) Q Consensus 125 K~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~-~~~~~--~~~~~ 190 (190) .-| +|||.|++||++.|.++. +..|+ +.|+. T Consensus 56 ----------------------------------~aP-vdTG~L~~SI~~~~~~~g~~~~V~~~~~YA~ 89 (149) T protein:vir:10 56 ----------------------------------LAP-VDLGFLEESIDFKYFDGGLSSVISVGADYAI 89 (149) T ss_pred ----------------------------------hCC-cccchhhccceEEecCCcEEEEEecCCCccc Confidence 123 589999999999887653 44444 33433 No 157 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=82.65 E-value=0.0055 Score=32.96 Aligned_cols=81 Identities=15% Similarity=0.176 Sum_probs=47.7 Q ss_pred CchhhHHHHHHH--------------------HHHhhcCCEEEEEecC----CC--------CCCCCc--------cHHH Q lcl|NC_019527. 4 LTGGDKLAKILA--------------------DIGGKAQGSVDVGFMS----GA--------TYPDGT--------PVAQ 43 (190) Q Consensus 4 i~~~d~l~~il~--------------------~l~~l~~~~V~VGi~~----~~--------~~~dG~--------~vA~ 43 (190) |.|-|.|.+.|+ +.+...+..|. |+.. +. -.++|. ..++ T Consensus 1 i~G~~~L~~~Lk~~s~~dvk~VVkkN~ael~~r~q~~~~~pv~-~~~k~~dTG~lkRSi~l~~~~~g~~~~vgp~g~t~d 79 (127) T protein:vir:98 1 MTGMPALEVKLRSMSEKRWDRVANKNLTEMFNRAARPPGTPIG-KNTKRHKSGELLRSRRLKKVNSSKDVITGNFGYIKD 79 (127) T ss_pred CcChHHHHHHHHHhhHHHHHHHHhhhhHHHHHHHHhccCCcee-ccccccCcccceeeeEEEEecCCceEEeccCccccc Confidence 444444433332 22222222221 1111 11 012332 2588 Q ss_pred HHHHhhcCccc-----cCC-CCCCchhhHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019527. 44 VAFWNEFGHGG-----RFP-APPRPFFRNMVNEKSSEWPKRLGDAIKH 85 (190) Q Consensus 44 iA~~~EfG~~~-----~~~-IP~RPFlr~~~~~~~~~~~~~l~~~i~~ 85 (190) +|-+.|||+-- .++ .|+-|||.|+|+.++..+.+.|+..++. T Consensus 80 YapyvEyGTR~m~~~~~~gf~~aqp~l~paf~~Qk~iF~~DL~~l~k~ 127 (127) T protein:vir:98 80 YAPHVEYGHRIVRNGKQVGYANGTKYLFNNVKKQREIYRQDMLNELRR 127 (127) T ss_pred ccceeecceeeeecccccccccCccccccchHHHhHHHHHHHHHHhcC Confidence 99999999731 111 6899999999999999999999998877 No 158 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=82.49 E-value=0.014 Score=30.77 Aligned_cols=71 Identities=18% Similarity=0.203 Sum_probs=33.1 Q ss_pred HH-HHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhhHHH Q lcl|NC_019527. 68 VN-EKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAAQEL 146 (190) Q Consensus 68 ~~-~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~~~~ 146 (190) ++ +-.+++.+.|++.-...+...+.+|...+..+++ .+.. T Consensus 1 i~i~Gld~L~~~L~~l~~~~~~~~~~a~~~~a~~i~~----~ak~----------------------------------- 41 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDIDKNINATTEEAANFIED----RAKT----------------------------------- 41 (173) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHH----------------------------------- Confidence 11 1223333333332111111223333333333333 3222 Q ss_pred hhhcccccccCccCchHHHHHHHhhccee-eecCceeEEeec----cCC Q lcl|NC_019527. 147 VEEGFQGAGGSQAKPLVWTGHMLNSITYQ-VDGGATIKVKVN----YGR 190 (190) Q Consensus 147 ~~~g~~~~~~~s~kPLIDTG~L~~SIty~-V~~g~~~~~~~~----~~~ 190 (190) .-| +|||.|++||.+. ++++.++.+.|+ |++ T Consensus 42 ------------~aP-v~TG~Lr~sI~~~~~~~~~~~~~~v~~~~~Ya~ 77 (173) T protein:vir:10 42 ------------LAP-KNFGKLAQSISTSDLKAKDLISKKITVNELYGA 77 (173) T ss_pred ------------hCC-cCchhhhhcceeeeeccCceeEEeeCCCcccch Confidence 123 5899999999886 566777776654 444 No 159 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=81.76 E-value=0.015 Score=30.58 Aligned_cols=69 Identities=17% Similarity=0.050 Sum_probs=29.8 Q ss_pred CCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchh Q lcl|NC_019527. 57 PAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIR 136 (190) Q Consensus 57 ~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~ 136 (190) -|-.|+-|. ...+..++.+ .++++|+.++...++..| . T Consensus 1 ~~~~~~~l~------~~~l~~~~~~-------~~~~~~~~~a~~ve~~ak-------------------~---------- 38 (137) T protein:vir:10 1 MVAHTLRIE------RAQLHGLGMD-------EARKAVNRVVRRTFTRSQ-------------------I---------- 38 (137) T ss_pred CcccccccC------hhhHhhHHHH-------HHHHHHHHHHHHHHHHHH-------------------h---------- Confidence 011111111 1111111111 223334433333333221 1 Q ss_pred hhhhhhhHHHhhhcccccccCccCchHHHHHHHhhcceeee--cCceeEEe----eccCC Q lcl|NC_019527. 137 ARDVLAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVD--GGATIKVK----VNYGR 190 (190) Q Consensus 137 ~~~~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~--~g~~~~~~----~~~~~ 190 (190) .-| +|||+|++||++.+. +|..+... ++|+. T Consensus 39 ----------------------~aP-v~TG~Lr~SI~~~~~~~~g~~v~~~V~~~~~YA~ 75 (137) T protein:vir:10 39 ----------------------LAP-VDTGYLRASGRLVLGRERGAVVIGSVEYTARYAA 75 (137) T ss_pred ----------------------cCC-cCchhhhccceeeeeeccccEEEEEecCCcccce Confidence 223 899999999999875 34444433 45776 No 160 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=81.04 E-value=0.016 Score=30.49 Aligned_cols=73 Identities=19% Similarity=0.231 Sum_probs=39.7 Q ss_pred hhHHHH-HHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhh Q lcl|NC_019527. 64 FRNMVN-EKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLA 142 (190) Q Consensus 64 lr~~~~-~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~ 142 (190) |+-+|+ +..+++.+.|.+... ....+.+|...+...+.+++. T Consensus 1 M~~~i~i~Gld~l~~~L~~~~~--~~~~~~al~~~~~~i~~~ak~----------------------------------- 43 (112) T protein:vir:36 1 MKSSLSFKGIDQLVKHLDKAAS--LKGVQQVVKSNTSNMTANMQK----------------------------------- 43 (112) T ss_pred CceeeeehhHHHHHHHHHhhhh--HHHHHHHHHHHHHHHHHHHHH----------------------------------- Confidence 665554 224555555544211 112455555555554444421 Q ss_pred hHHHhhhcccccccCccCchHHHHHHHhhcceeeecC-ceeEEeec--------cCC Q lcl|NC_019527. 143 AQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGG-ATIKVKVN--------YGR 190 (190) Q Consensus 143 ~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g-~~~~~~~~--------~~~ 190 (190) .-| +|||.|++||+..+.+| -++.|+-| ||- T Consensus 44 ----------------~aP-vdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~GT 83 (112) T protein:vir:36 44 ----------------LVP-VDTGYMKRSIKMELTEGGFSGQAGPHTDYSAYVEYGT 83 (112) T ss_pred ----------------hCC-CCchhhhhceeeeecCCceEEEeecCCCccceeeccc Confidence 112 58999999999888754 47776532 332 No 161 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=80.25 E-value=0.0027 Score=34.63 Aligned_cols=70 Identities=16% Similarity=0.118 Sum_probs=29.2 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhh Q lcl|NC_019527. 64 FRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAA 143 (190) Q Consensus 64 lr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~ 143 (190) |-.++ .-+++...++..++..+...+.+.... +....+ T Consensus 1 m~~s~----------------~i~i~~~~l~~~v~~~~k~~l~~~a~~----------i~~~ak---------------- 38 (137) T protein:vir:10 1 MPVTA----------------RIHINEPELERQTGAIFRGKHRSITRR----------IATQAR---------------- 38 (137) T ss_pred CCeeE----------------EEeeCHHHHHHHHHHHHHHHHHHHHHH----------HHHHHH---------------- Confidence 10000 012233333333433333333222111 111110 Q ss_pred HHHhhhcccccccCccCchHHHHHHHhhcceeeecCc--e--eEEe--eccCC Q lcl|NC_019527. 144 QELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGA--T--IKVK--VNYGR 190 (190) Q Consensus 144 ~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~--~--~~~~--~~~~~ 190 (190) ..-| +|||.|++||++.+.++. + +.|+ +.|+. T Consensus 39 --------------~~aP-v~tG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~ 76 (137) T protein:vir:10 39 --------------ADVP-VRTGNLGRGIQEMPQTYRPFHVGGGVEDNVDYAA 76 (137) T ss_pred --------------HhCC-cccchhhcCceeeeeccccceEEEEEecCCCcee Confidence 0223 799999999999875543 2 3332 34655 No 162 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=80.20 E-value=0.033 Score=28.70 Aligned_cols=75 Identities=24% Similarity=0.237 Sum_probs=40.4 Q ss_pred hhH-HHH-HHHHHHHHHHHHHHhhcCC--cHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhh Q lcl|NC_019527. 64 FRN-MVN-EKSSEWPKRLGDAIKHYDG--DGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARD 139 (190) Q Consensus 64 lr~-~~~-~~~~~~~~~l~~~i~~g~~--~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~ 139 (190) |-. +|+ +.-+++.+.|.++...+.. ..+.+|+.+|..+...|+.. T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~------------------------------- 49 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEAN------------------------------- 49 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHh------------------------------- Confidence 332 333 2336666666665544321 24555666666655544221 Q ss_pred hhhhHHHhhhcccccccCccCchHHHHHHHhhccee--ee--cCceeEEe--------eccCC Q lcl|NC_019527. 140 VLAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQ--VD--GGATIKVK--------VNYGR 190 (190) Q Consensus 140 ~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~--V~--~g~~~~~~--------~~~~~ 190 (190) .| +|||.|++|++.. .. +|-++.|. ||||. T Consensus 50 --------------------tP-VdTG~Lr~S~~~~~~~~~~~~~~~~V~n~~~YA~~VE~Gh 91 (144) T protein:vir:10 50 --------------------TP-VKQGNLRRSWTAEGPTYGCGGWTIKLINNAEYASYVESGH 91 (144) T ss_pred --------------------CC-CCcchhccceeecceeeecCeeEEEEecCCCcccccccce Confidence 23 7899999999863 23 33344443 56664 No 163 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=79.46 E-value=0.023 Score=29.52 Aligned_cols=70 Identities=20% Similarity=0.223 Sum_probs=36.1 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhh Q lcl|NC_019527. 63 FFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLA 142 (190) Q Consensus 63 Flr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~ 142 (190) +=-.+++ ++.+.|++... ....+.+|...+...++++|. T Consensus 1 i~i~Gld----~l~~~l~~~~~--~~~~~~al~~~a~~i~~~ak~----------------------------------- 39 (108) T protein:vir:98 1 MKITGID----ALQKKLRKNAT--LNDVKHVVKRNTVSMNKNMQN----------------------------------- 39 (108) T ss_pred CcchhHH----HHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHH----------------------------------- Confidence 2223333 33333333211 113455666666655555432 Q ss_pred hHHHhhhcccccccCccCchHHHHHHHhhcceeeecCc-eeEEe--eccCC Q lcl|NC_019527. 143 AQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGA-TIKVK--VNYGR 190 (190) Q Consensus 143 ~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~-~~~~~--~~~~~ 190 (190) .-| +|||.|++||++.+++|. +..|. ..|+. T Consensus 40 ----------------~ap-vdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~ 73 (108) T protein:vir:98 40 ----------------LAP-VDTGNMKRSITSEFTDGGLTGTTIPHTDYAG 73 (108) T ss_pred ----------------hCC-CCchhhHhhceeeeecCceEEEeecCCCccc Confidence 123 589999999999998653 44443 34544 No 164 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=74.41 E-value=0.017 Score=30.28 Aligned_cols=73 Identities=14% Similarity=0.052 Sum_probs=34.0 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhh Q lcl|NC_019527. 64 FRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAA 143 (190) Q Consensus 64 lr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~ 143 (190) |.-.++ ..+++.+.|.+.-....-..+.+|...|...+.++|.. T Consensus 1 msi~i~-Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~----------------------------------- 44 (114) T protein:vir:95 1 MAIKWQ-GIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQL----------------------------------- 44 (114) T ss_pred Ceeeee-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----------------------------------- Confidence 332222 12333333333222111123445555544444443221 Q ss_pred HHHhhhcccccccCccCchHHHHHHHhhcceeeecCceeEEe--eccCC Q lcl|NC_019527. 144 QELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGATIKVK--VNYGR 190 (190) Q Consensus 144 ~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~~~~~--~~~~~ 190 (190) -| +|||.|++||+... +|-+..|. ..|+. T Consensus 45 ----------------aP-v~TG~Lr~sI~~~~-~g~~~~V~~~~~Ya~ 75 (114) T protein:vir:95 45 ----------------AP-KDTEFLKDHITTSY-PGMEAHIHGEAGYDG 75 (114) T ss_pred ----------------CC-cCchhhhhceeeec-CceEEEeecCCCccc Confidence 12 58999999998765 34444443 44555 No 165 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=73.38 E-value=0.053 Score=27.60 Aligned_cols=79 Identities=22% Similarity=0.330 Sum_probs=39.6 Q ss_pred CCCCchhhHHHHHHHH-HH-----hhc------------------------------------------CCEEEEEecCC Q lcl|NC_019527. 1 MATLTGGDKLAKILAD-IG-----GKA------------------------------------------QGSVDVGFMSG 32 (190) Q Consensus 1 Ma~i~~~d~l~~il~~-l~-----~l~------------------------------------------~~~V~VGi~~~ 32 (190) ||.++|.+.+.+-|++ |. ... -+.|+|||. + T Consensus 4 ~aevkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~t~dev~~s~~~~~~G~r~V~VgW~-G 82 (132) T protein:vir:96 4 FANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGFT-T 82 (132) T ss_pred cccccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhhhhhcchhhcceeecCeeecCCceEEEeccc-C Confidence 6677776644322221 11 000 123444443 1 Q ss_pred CCCCCCccHHHHHHHhhcCccccCCCCCCc--hhhHHHHHHHHHHHHHHHHHHhhcCCcH Q lcl|NC_019527. 33 ATYPDGTPVAQVAFWNEFGHGGRFPAPPRP--FFRNMVNEKSSEWPKRLGDAIKHYDGDG 90 (190) Q Consensus 33 ~~~~dG~~vA~iA~~~EfG~~~~~~IP~RP--Flr~~~~~~~~~~~~~l~~~i~~g~~~~ 90 (190) ++ -.|-.+||||...+ |=||- +++.+++..+..+...++.-|... .+. T Consensus 83 pR-------~~ivHLNE~GyGk~--~~PrG~G~I~~a~~~se~~~~~~~~~elkk~-l~~ 132 (132) T protein:vir:96 83 PR-------WNIVHLQELEYGWK--HNRRGVGVIRRYSDILETIYPRGIRDKLKRG-FDG 132 (132) T ss_pred Cc-------eeEEeeecccccCC--cCCCcchHHHHHHHhhhhHHHHHHHHHHHHH-hcC Confidence 12 23556899998543 55665 688888888855555554444321 011 No 166 >protein:vir:78755 Length: 228 # NCBI annotation: putative tail completion protein # Family: family:all:743 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285651;genbank:gi:148727157;genbank:GeneID:5220102 Probab=72.89 E-value=0.035 Score=28.58 Aligned_cols=91 Identities=21% Similarity=0.316 Sum_probs=40.0 Q ss_pred CCCCchh-----hHHHHHHHHHHhhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccc--------------------- Q lcl|NC_019527. 1 MATLTGG-----DKLAKILADIGGKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGG--------------------- 54 (190) Q Consensus 1 Ma~i~~~-----d~l~~il~~l~~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~--------------------- 54 (190) .+.=+.+ -+|.+.|+- .+.....+.|||..+.. ...++.||++|.||-.. T Consensus 55 ~~pRKr~krKMl~~L~k~Lk~-~~~~~~~a~v~f~~~~~---~~~~~rIA~vHq~G~~~~v~~~~~~~~~~~r~~~~~pa 130 (228) T protein:vir:78 55 WAPRKRGKRKMLRGLPKLLQI-REPRQDMAELGFTKGTM---SAHAGVIANTHQKGHTYKVTAASRRRIAPSDVGKNKQA 130 (228) T ss_pred ChhhhhhHHHHHhhhHHhhhh-hcccccceEEEeecCcc---cchHHHHHHHHhcCcccccccchhhhhhcccCCCCCCC Confidence 1100000 012233322 23345578999965432 12579999999999410 Q ss_pred --------------------------------------------------------cCCCCCCchhhHHHHHHHHHHHHH Q lcl|NC_019527. 55 --------------------------------------------------------RFPAPPRPFFRNMVNEKSSEWPKR 78 (190) Q Consensus 55 --------------------------------------------------------~~~IP~RPFlr~~~~~~~~~~~~~ 78 (190) .+.+|+||||-..-++....+... T Consensus 131 Tr~QAk~Lr~lGy~~~~~~~k~~rkps~kwI~~nls~gqAgliir~L~~k~~k~~W~I~~PaR~FLG~s~~e~~~~l~~~ 210 (228) T protein:vir:78 131 SKAQARKLRELGFKRPGKRKRAYRSASLGWITANLNYAQAGLLIKKLKDEPVKESWEIQLPARPFLGANARQRQQAFALR 210 (228) T ss_pred CHHHHHHHHHhhccccCCcCCCcccCCHHHHHHHhhHHHHHHHHHHHhCCCCccceeeecCcccccCCCHHHHHHHHHHH Confidence 134677777754444333333332 Q ss_pred HHHHHh-hcCCcHHHHHHH Q lcl|NC_019527. 79 LGDAIK-HYDGDGRKALAS 96 (190) Q Consensus 79 l~~~i~-~g~~~~~~aL~~ 96 (190) |.. |. .++..+++.=.. T Consensus 211 l~~-i~~g~~~~~qd~~~~ 228 (228) T protein:vir:78 211 PES-IDYGWDVNKQDMKGK 228 (228) T ss_pred HHh-cccCCCcchhhccCC Confidence 222 21 122222211111 No 167 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=72.45 E-value=0.038 Score=28.35 Aligned_cols=77 Identities=14% Similarity=0.345 Sum_probs=37.9 Q ss_pred CC-CCchhhHHHH-------------------------HHHHHH----h-------------------hcCCEEEEEecC Q lcl|NC_019527. 1 MA-TLTGGDKLAK-------------------------ILADIG----G-------------------KAQGSVDVGFMS 31 (190) Q Consensus 1 Ma-~i~~~d~l~~-------------------------il~~l~----~-------------------l~~~~V~VGi~~ 31 (190) |+ .++|.+.|.+ +++.++ . -..+.|+|||-+ T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:10 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRG 80 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEc Confidence 33 3344332211 111111 0 112357777744 Q ss_pred CCCCCCCccHHHHHHHhhcCccccCCCCCCchhhH--------HHHHHHHHHHHHHHHHHhhc Q lcl|NC_019527. 32 GATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRN--------MVNEKSSEWPKRLGDAIKHY 86 (190) Q Consensus 32 ~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~--------~~~~~~~~~~~~l~~~i~~g 86 (190) +... --|-.+||||.... -..+|++| +++..+..+.+.++.-|..- T Consensus 81 ~~~R------~~iiHLNE~Gytr~---~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 81 SKDR------YKIVHLIEYGHVQK---GTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred CCce------eEEEEeecccceec---ccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 3211 23667899997542 12344444 77777777777776655443 No 168 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=72.45 E-value=0.038 Score=28.35 Aligned_cols=77 Identities=14% Similarity=0.345 Sum_probs=37.9 Q ss_pred CC-CCchhhHHHH-------------------------HHHHHH----h-------------------hcCCEEEEEecC Q lcl|NC_019527. 1 MA-TLTGGDKLAK-------------------------ILADIG----G-------------------KAQGSVDVGFMS 31 (190) Q Consensus 1 Ma-~i~~~d~l~~-------------------------il~~l~----~-------------------l~~~~V~VGi~~ 31 (190) |+ .++|.+.|.+ +++.++ . -..+.|+|||-+ T Consensus 1 msvevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:95 1 MSVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRG 80 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEc Confidence 33 3344332211 111111 0 112357777744 Q ss_pred CCCCCCCccHHHHHHHhhcCccccCCCCCCchhhH--------HHHHHHHHHHHHHHHHHhhc Q lcl|NC_019527. 32 GATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRN--------MVNEKSSEWPKRLGDAIKHY 86 (190) Q Consensus 32 ~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~--------~~~~~~~~~~~~l~~~i~~g 86 (190) +... --|-.+||||.... -..+|++| +++..+..+.+.++.-|..- T Consensus 81 ~~~R------~~iiHLNE~Gytr~---~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:95 81 SKDR------YKIVHLIEYGHVQK---GTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred CCce------eEEEEeecccceec---ccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 3211 23667899997542 12344444 77777777777776655443 No 169 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=65.30 E-value=0.11 Score=25.81 Aligned_cols=102 Identities=14% Similarity=0.076 Sum_probs=53.6 Q ss_pred hhHHHHHH-HHHHHHHHHHHHhhcCCc--HHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhh Q lcl|NC_019527. 64 FRNMVNEK-SSEWPKRLGDAIKHYDGD--GRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDV 140 (190) Q Consensus 64 lr~~~~~~-~~~~~~~l~~~i~~g~~~--~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~ 140 (190) |-.+|+.. -+++.+.|.+.+..+.+. .+++|+.+|......|+...=-+..+-.-.......+. .. T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k------~~----- 69 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGK------HV----- 69 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccc------hh----- Confidence 87788754 377777777766555443 36778888877777666532212111111111111110 00 Q ss_pred hhhHHHhhhcccccccCccCchHHHHHHHhhcce--eeecCceeEEe----------eccCC Q lcl|NC_019527. 141 LAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITY--QVDGGATIKVK----------VNYGR 190 (190) Q Consensus 141 ~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty--~V~~g~~~~~~----------~~~~~ 190 (190) . ...|..| .|||.|++|.+. .-+.|.+..|. |+||. T Consensus 70 -------k------~~~~~~~-k~tG~lr~swk~~~~~k~~~~~~v~v~N~~~YA~~VE~GH 117 (163) T protein:vir:10 70 -------K------FWASAHG-KQGGTLQKGWSKSRIEVSGRTYKQKVYNKVYYAPHVEYGH 117 (163) T ss_pred -------h------hhccccc-cccchhhccceecceeecCCceEEEEEecCCccchhhcce Confidence 0 0012233 689999999766 34556666555 34554 No 170 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=63.25 E-value=0.072 Score=26.85 Aligned_cols=74 Identities=22% Similarity=0.172 Sum_probs=33.9 Q ss_pred CCC-chhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhh Q lcl|NC_019527. 59 PPR-PFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRA 137 (190) Q Consensus 59 P~R-PFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~ 137 (190) =+| .|=-.+ .+++.+.|++.. -.-++++++...|..++...|. T Consensus 1 Ma~~~i~~~G----ld~L~~~L~~~~--~~~~v~~vv~~~~~~l~~~ak~------------------------------ 44 (92) T protein:vir:99 1 MADYSISWDG----LDALDEALANQQ--NMNTVKKVVKKHTANLMTATQQ------------------------------ 44 (92) T ss_pred CCceeeEeeh----HHHHHHHHHhhc--cHHHHHHHHHHHHHHHHHHHHH------------------------------ Confidence 011 000011 123333332211 0123444455444444433322 Q ss_pred hhhhhhHHHhhhcccccccCccCchHHHHHHHhhcceeee-cCceeEEe-----eccCC Q lcl|NC_019527. 138 RDVLAAQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVD-GGATIKVK-----VNYGR 190 (190) Q Consensus 138 ~~~~~~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~-~g~~~~~~-----~~~~~ 190 (190) .-| +|||.|++||+..+. ||-|.+|. .+|.. T Consensus 45 ---------------------~ap-~dTG~lrrSI~~~~~~~g~~~~v~~~gp~a~Ya~ 81 (92) T protein:vir:99 45 ---------------------AVP-VDTGHLKQSAQIQISRDGFTGSVTYGGGLVNYAA 81 (92) T ss_pred ---------------------hCC-CCccccceeeeEEeecCCeeEEEEeccCcccccc Confidence 113 589999999996655 45577776 44555 No 171 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=61.67 E-value=0.13 Score=25.49 Aligned_cols=80 Identities=19% Similarity=0.220 Sum_probs=38.8 Q ss_pred hhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhh Q lcl|NC_019527. 64 FRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAA 143 (190) Q Consensus 64 lr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~ 143 (190) |---|. --++|.+.|.+.-....-..+.+|...+...+..++. |+|. T Consensus 1 m~v~i~-Gl~el~~~l~~l~~~~~k~~~~al~~ga~~~~~~~k~---------~ap~----------------------- 47 (128) T protein:vir:38 1 MGVKVT-GDAELLANLNKLQFGVAKEARAAVRDGAQKFADKLKS---------NTPE----------------------- 47 (128) T ss_pred Cccchh-hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHH---------hCCC----------------------- Confidence 221111 2244555555432222223355666666666665553 2221 Q ss_pred HHHhhhcccccccCccCchHHHHHHHhhccee-e-ecCceeEEeeccCC Q lcl|NC_019527. 144 QELVEEGFQGAGGSQAKPLVWTGHMLNSITYQ-V-DGGATIKVKVNYGR 190 (190) Q Consensus 144 ~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~-V-~~g~~~~~~~~~~~ 190 (190) ++..+.++|+|.++|.+. + ..+.+..+.|-|+. T Consensus 48 --------------~~~~~~~~~h~~d~I~~~~~k~~~g~~~~~VG~~k 82 (128) T protein:vir:38 48 --------------WDGETDMSGHLRDDIKLSSVRETSGLTEVDVGYGK 82 (128) T ss_pred --------------cCCCCcccchhhhhhccccccccCceeEEEeeecC Confidence 234566789999999772 2 23334445554444 No 172 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=54.38 E-value=0.071 Score=26.89 Aligned_cols=89 Identities=19% Similarity=0.236 Sum_probs=47.9 Q ss_pred CCCCchhhH-----HH----HHHHHHHhh------------------------------cCCEEEEEecCCCCCC--CC- Q lcl|NC_019527. 1 MATLTGGDK-----LA----KILADIGGK------------------------------AQGSVDVGFMSGATYP--DG- 38 (190) Q Consensus 1 Ma~i~~~d~-----l~----~il~~l~~l------------------------------~~~~V~VGi~~~~~~~--dG- 38 (190) |+.-..-++ +. ++++.++.. .+..++=||-.+..+. |+ T Consensus 20 ~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~tG~lr~swk~~~~~k~~~~~ 99 (163) T protein:vir:10 20 NANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQGGTLQKGWSKSRIEVSGRTY 99 (163) T ss_pred HhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhccccccccchhhccceecceeecCCce Confidence 443221111 11 222222221 1112333333333222 22 Q ss_pred ----ccHHHHHHHhhcCccccC--CCCCCchhhHHHHHHHHHHHHHHHHHHh-------hcCCc Q lcl|NC_019527. 39 ----TPVAQVAFWNEFGHGGRF--PAPPRPFFRNMVNEKSSEWPKRLGDAIK-------HYDGD 89 (190) Q Consensus 39 ----~~vA~iA~~~EfG~~~~~--~IP~RPFlr~~~~~~~~~~~~~l~~~i~-------~g~~~ 89 (190) .+.+.+|.+-|||+-..+ -+|-+++|+.++++.+.++.+.+++.+. .|.+. T Consensus 100 ~v~v~N~~~YA~~VE~GHR~~~gGfV~G~fml~~s~~~~~~~~~~~~e~~l~~~l~k~~~~~~~ 163 (163) T protein:vir:10 100 KQKVYNKVYYAPHVEYGHKTVNGGFVPGQFFLHKTVEDTKSDMEKRVRDKYDGFMRKVVLGNGK 163 (163) T ss_pred EEEEEecCCccchhhcceeecCCceeccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC Confidence 256888999999974322 2799999999999999888888776553 24333 No 173 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=52.81 E-value=0.057 Score=27.42 Aligned_cols=91 Identities=20% Similarity=0.134 Sum_probs=49.8 Q ss_pred CCCCchh---hHHHHHHHHHHhh----cC----------------------CEEEEEecCCC-CCCCC--ccHHHHHHHh Q lcl|NC_019527. 1 MATLTGG---DKLAKILADIGGK----AQ----------------------GSVDVGFMSGA-TYPDG--TPVAQVAFWN 48 (190) Q Consensus 1 Ma~i~~~---d~l~~il~~l~~l----~~----------------------~~V~VGi~~~~-~~~dG--~~vA~iA~~~ 48 (190) |-+..+- +.|+...++..+. .. .+|+++=-+-+ .-.-| ..|. +|.+- T Consensus 21 mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~~~p~~srr~r~G~L~~Sir~aaT~raa~VrAGr~arVP-YA~~I 99 (143) T protein:vir:13 21 VRALRDKELNKAVREANKASGEVLIPQAKHESPDGHRDPKSSKRYRPGKLDKSIKVTASAKGAVIKAGSAARVP-YAAAI 99 (143) T ss_pred HHHhhCCcchHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccchhhccccccccccceeeeecCcCCCC-ccccc Confidence 5555443 3344444332221 11 12222211100 00011 1233 45566 Q ss_pred hcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHH Q lcl|NC_019527. 49 EFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALAS 96 (190) Q Consensus 49 EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~ 96 (190) +||++.++ |-++=||..++...+++|.+..++.|.+ ..++-|+. T Consensus 100 ~~G~r~r~-Is~~rFl~~a~a~te~~~~r~Ye~~i~~---vl~k~l~s 143 (143) T protein:vir:13 100 HFGYRKRN-ISANRFLYRAMARKSDVVAATYERRIAA---VVEKYLES 143 (143) T ss_pred ccCCcccc-cchhhhhhhhhhccCHHHHHHHHHHHHH---HHHHHhcC Confidence 99998874 8999999999999999999988877754 23444443 No 174 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=49.55 E-value=0.25 Score=23.87 Aligned_cols=82 Identities=17% Similarity=0.208 Sum_probs=44.9 Q ss_pred CC-CCchhhHHHHHHHHHHhhcCCEEEEEecCCCCCCCCcc-HHHHHHHhhcCcc---------------ccCCCCCCch Q lcl|NC_019527. 1 MA-TLTGGDKLAKILADIGGKAQGSVDVGFMSGATYPDGTP-VAQVAFWNEFGHG---------------GRFPAPPRPF 63 (190) Q Consensus 1 Ma-~i~~~d~l~~il~~l~~l~~~~V~VGi~~~~~~~dG~~-vA~iA~~~EfG~~---------------~~~~IP~RPF 63 (190) || .|+..+ ..+.........|||.. .|++|+. =|+||.|..-|+. ..+.||.=+| T Consensus 62 LaDsI~~~~------~niDg~~dG~s~VGf~~--k~~~~~~~kA~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHF 133 (168) T protein:vir:74 62 LADSIVMKN------KNIDGVKDGQSVVGWER--STEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHADHF 133 (168) T ss_pred hhhheeecc------cccCcccCCceeecccc--cccccccchhhhhhhhcccccccccccccccccccccccccccchh Confidence 44 222100 13444566778899953 4556654 6999999999972 1245899999 Q ss_pred hhHHHHHH--HHHHHHH----HHHHHhh--cCCcH Q lcl|NC_019527. 64 FRNMVNEK--SSEWPKR----LGDAIKH--YDGDG 90 (190) Q Consensus 64 lr~~~~~~--~~~~~~~----l~~~i~~--g~~~~ 90 (190) +..+-.+. ++++-+. +++.|.. ++.+. T Consensus 134 vd~~r~~~~~k~~V~~Ae~~~y~eIl~~k~~~~~~ 168 (168) T protein:vir:74 134 IEETRMNLIVQQGILKAEAEAMRKIINRKKKENNL 168 (168) T ss_pred HHHHHhhhhhHHHHHHHHHHHHHHHHHhhcCCCCC Confidence 98876652 2332221 2222321 22222 No 175 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=47.85 E-value=0.28 Score=23.62 Aligned_cols=79 Identities=20% Similarity=0.340 Sum_probs=42.2 Q ss_pred CCCCchhhHHHHHHHH-------------------------HHh-h-----c-----------------CCEEEEEecCC Q lcl|NC_019527. 1 MATLTGGDKLAKILAD-------------------------IGG-K-----A-----------------QGSVDVGFMSG 32 (190) Q Consensus 1 Ma~i~~~d~l~~il~~-------------------------l~~-l-----~-----------------~~~V~VGi~~~ 32 (190) ||.++|.+.+.+-|++ |+. + + -+.|+|||.+ T Consensus 10 ~aevkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s~p~~~~G~r~V~igW~G- 88 (138) T protein:vir:98 10 FANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKLGFTT- 88 (138) T ss_pred cccccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeecCeeecCCceEEEEeeec- Confidence 6777776543222221 100 0 0 2344455532 Q ss_pred CCCCCCccHHHHHHHhhcCccccCCCCCCc--hhhHHHHHHHHHHHHHHHHHHhhcCCcH Q lcl|NC_019527. 33 ATYPDGTPVAQVAFWNEFGHGGRFPAPPRP--FFRNMVNEKSSEWPKRLGDAIKHYDGDG 90 (190) Q Consensus 33 ~~~~dG~~vA~iA~~~EfG~~~~~~IP~RP--Flr~~~~~~~~~~~~~l~~~i~~g~~~~ 90 (190) ++| .|-.+||||...+ |=||- +++.+++..+..+...++.-|... .+. T Consensus 89 pR~-------~ivHLNE~GyGk~--i~PrG~G~I~ka~~~se~~y~~~vk~el~k~-l~~ 138 (138) T protein:vir:98 89 PRW-------NIVHLQELEYGWK--HNRRGVGVIRRYSDILETIYPRGIRDKLKRG-FDG 138 (138) T ss_pred Cee-------eEEeeecccccCC--cCCCcchHHHHHHHhhhHHHHHHHHHHHHHH-hcC Confidence 232 3556899998543 55665 688888888887777776544321 011 No 176 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=41.40 E-value=0.21 Score=24.32 Aligned_cols=75 Identities=20% Similarity=0.177 Sum_probs=44.7 Q ss_pred CC-CCchhhHHHHHHHHHHhhcCCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCC-----chhhHHHHHHHHH Q lcl|NC_019527. 1 MA-TLTGGDKLAKILADIGGKAQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPR-----PFFRNMVNEKSSE 74 (190) Q Consensus 1 Ma-~i~~~d~l~~il~~l~~l~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~R-----PFlr~~~~~~~~~ 74 (190) |+ +|.--+. ..+.......+.|||... .-+.||.+...|+. ..||. +|+..+..+.+.+ T Consensus 76 laD~I~~~~~-----~~iDg~~dG~s~VGw~~~-------~~a~~a~f~NdGT~---~m~~k~~~gdHFvekt~~~~k~~ 140 (159) T protein:vir:38 76 LQDSITYKPG-----YTADKLHTGDTDVGFEGK-------YYDFLAKIVNNGQH---HMSPKRYKNMHFLDKAQQEAKKS 140 (159) T ss_pred cccceeeecC-----ccccccccceeeecccCC-------ccceEeeecccCcc---ccCCCCccCChhHHHHHHHHHHH Confidence 33 2210000 134444556899999642 23799999999984 36776 6999999988877 Q ss_pred HHHHHHHHHh---hcCCcH Q lcl|NC_019527. 75 WPKRLGDAIK---HYDGDG 90 (190) Q Consensus 75 ~~~~l~~~i~---~g~~~~ 90 (190) +-+.+...+. .-+-|- T Consensus 141 Vl~A~~~~~~~il~~~~~~ 159 (159) T protein:vir:38 141 VAEAELKAYKEVMNHDSDK 159 (159) T ss_pred HHHHHHHHHHHHhhcccCC Confidence 7655544332 222122 No 177 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=36.64 E-value=0.26 Score=23.83 Aligned_cols=91 Identities=21% Similarity=0.146 Sum_probs=50.1 Q ss_pred CCCCchh---hHHHHHHHHHHhhcC--------------------------CEEEEEecCCC-CCCCC--ccHHHHHHHh Q lcl|NC_019527. 1 MATLTGG---DKLAKILADIGGKAQ--------------------------GSVDVGFMSGA-TYPDG--TPVAQVAFWN 48 (190) Q Consensus 1 Ma~i~~~---d~l~~il~~l~~l~~--------------------------~~V~VGi~~~~-~~~dG--~~vA~iA~~~ 48 (190) |-+..+- +.|+...++..+..- .+|+|+=-+-+ .-.-| ..|. +|.+- T Consensus 21 mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aaT~raa~VrAG~~krVP-YA~~I 99 (143) T protein:vir:62 21 VRTLRDKELNKAVREANKASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTASAKGAVIKAGSASRVP-YAAAI 99 (143) T ss_pred HHHhhCCchhHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccCcchhhccccccccccceeeeeCCcCCCC-ccccc Confidence 6655444 334444433322111 11222111100 00012 1333 45666 Q ss_pred hcCccccCCCCCCchhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHH Q lcl|NC_019527. 49 EFGHGGRFPAPPRPFFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALAS 96 (190) Q Consensus 49 EfG~~~~~~IP~RPFlr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~ 96 (190) +||++.++ |-|+=||..++...+++|.+..++.|.+ ..++-|+. T Consensus 100 ~~G~r~r~-Isp~rFl~~a~a~te~~~~r~Ye~~i~~---vl~k~l~s 143 (143) T protein:vir:62 100 HFGYRARN-ISPNRFLFRAMARKSDVVAATYERRIAA---VVEKYLES 143 (143) T ss_pred ccCccccc-ccchhhhhhhhhccCHHHHHHHHHHHHH---HHHHHhcC Confidence 99998874 8899999999999999999988877754 23444443 No 178 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=35.56 E-value=1.2 Score=20.10 Aligned_cols=80 Identities=19% Similarity=0.162 Sum_probs=42.3 Q ss_pred hhhHHHHHHHHHHHHHHHHHHhhcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhh Q lcl|NC_019527. 63 FFRNMVNEKSSEWPKRLGDAIKHYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLA 142 (190) Q Consensus 63 Flr~~~~~~~~~~~~~l~~~i~~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~ 142 (190) .=.-.+++-.+++.+.|+.....-.-+++++++.++..+++.++..|... || T Consensus 1 M~~i~id~La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~t-----sp----------------------- 52 (124) T protein:vir:95 1 MAKIKIGRLADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQEV-----GL----------------------- 52 (124) T ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhc-----Cc----------------------- Confidence 11234555666666666665544444678889999999999988888652 11 Q ss_pred hHHHhhhcccccccCccCchHHHHHHHhhcceeeecCce-eE---------------EeeccCC Q lcl|NC_019527. 143 AQELVEEGFQGAGGSQAKPLVWTGHMLNSITYQVDGGAT-IK---------------VKVNYGR 190 (190) Q Consensus 143 ~~~~~~~g~~~~~~~s~kPLIDTG~L~~SIty~V~~g~~-~~---------------~~~~~~~ 190 (190) ..||.|..|.+-......+ |. .+.|=|| T Consensus 53 --------------------krTG~YaK~W~~kk~~e~~~V~nk~~yqLtHLLE~GHAkr~GGR 96 (124) T protein:vir:95 53 --------------------VQTGDYMRGWTRKRVPNGWVIHNKTEYRLAHLLEYGHATVDGGR 96 (124) T ss_pred --------------------ccccchhccceeeeecCceeEEEcCCCceeeeeecceeccCCcc Confidence 2334444444433222221 11 1122233 No 179 >protein:vir:1028 Length: 168 # NCBI annotation: Orf48 # Family: family:all:1029 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076682;genbank:gi:13095791;genbank:GeneID:920342 Probab=25.44 E-value=0.98 Score=20.63 Aligned_cols=82 Identities=17% Similarity=0.232 Sum_probs=43.8 Q ss_pred CC-CCchhhHHHHHHHHHHhhcCCEEEEEecCCCCCCCCcc-HHHHHHHhhcCcc---------------ccCCCCCCch Q lcl|NC_019527. 1 MA-TLTGGDKLAKILADIGGKAQGSVDVGFMSGATYPDGTP-VAQVAFWNEFGHG---------------GRFPAPPRPF 63 (190) Q Consensus 1 Ma-~i~~~d~l~~il~~l~~l~~~~V~VGi~~~~~~~dG~~-vA~iA~~~EfG~~---------------~~~~IP~RPF 63 (190) || .|+--+ ..+.........|||.... +.|+. -|+||.|..-|+. ..+.||.=+| T Consensus 62 LaDsI~~~~------~niDg~~dG~s~VGf~~k~--~~~~~~ka~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHF 133 (168) T protein:vir:10 62 LADSIVMKN------KNIDGVKDGQSVVGWERST--EKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHADHF 133 (168) T ss_pred hhhhheecc------cccccccCCceeecccCcc--ccccccchheeeeccccccccccccccccccccccccccccchh Confidence 43 222100 1344456678889996421 23443 6999999999972 1245899999 Q ss_pred hhHHHHHH--HHHHHHH----HHHHHhh--cCCcH Q lcl|NC_019527. 64 FRNMVNEK--SSEWPKR----LGDAIKH--YDGDG 90 (190) Q Consensus 64 lr~~~~~~--~~~~~~~----l~~~i~~--g~~~~ 90 (190) +..+-.+. ++++-+. +++.|.. ++.+. T Consensus 134 vd~~r~d~a~k~~V~~Ae~~~y~eIl~~k~~~~~~ 168 (168) T protein:vir:10 134 IEETRKNPIVQQGILKAEAEAMRKIINRKKKESNL 168 (168) T ss_pred HHHhhhchhhhHHHHHHHHHHHHHHHHhhcCCCCC Confidence 98877652 2332221 2222321 22232 No 180 >protein:vir:2688 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075507;genbank:gi:12719436;genbank:GeneID:920156 Probab=24.98 E-value=1.1 Score=20.37 Aligned_cols=66 Identities=18% Similarity=0.151 Sum_probs=40.2 Q ss_pred HHHHHHHHHh--hcCCcHHHHHHHHHHHHHHHHHHHHhccCCCCCChHHHHHhccccccccchhhhhhhhhHHHhhhccc Q lcl|NC_019527. 75 WPKRLGDAIK--HYDGDGRKALASMGEMIGGDLGSSIISTNEPALSKTTLMLRSIYGNNPQEIRARDVLAAQELVEEGFQ 152 (190) Q Consensus 75 ~~~~l~~~i~--~g~~~~~~aL~~iG~~a~~~Iq~~I~~~~~pPnap~Ti~~K~~~~~~~~~~~~~~~~~~~~~~~~g~~ 152 (190) +.+.|+..+= +-..-.+.||...|...+..+|..+.. T Consensus 1 ilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~----------------------------------------- 39 (123) T protein:vir:26 1 MLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFES----------------------------------------- 39 (123) T ss_pred ChhhHHHhcCHHHHHHhhhHHHHHHHHHHHHHHHHhhHH----------------------------------------- Confidence 3333333220 111245789999999999999887744 Q ss_pred ccccCccCchHHHHHHHhhccee-e--ecC---ceeEEeeccC--C Q lcl|NC_019527. 153 GAGGSQAKPLVWTGHMLNSITYQ-V--DGG---ATIKVKVNYG--R 190 (190) Q Consensus 153 ~~~~~s~kPLIDTG~L~~SIty~-V--~~g---~~~~~~~~~~--~ 190 (190) ..|||.-..+++.. + .+| +||+|.-.=- | T Consensus 40 ---------fkDTGatidev~~s~p~~~~g~~~rtV~i~W~gp~~R 76 (123) T protein:vir:26 40 ---------FKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPMNR 76 (123) T ss_pred ---------hhhccceeeeEEecCeeeccCCccceEEEEeecCCCc Confidence 67899888888764 2 234 5666654321 3 No 181 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=20.81 E-value=2.7 Score=18.21 Aligned_cols=69 Identities=17% Similarity=0.114 Sum_probs=34.4 Q ss_pred CCCC------chhh----HHHHHHHHHHhh-cCCEEEEEecCCCCCCCCccHHHHHHHhhcCccccCCCCCCchhhHHHH Q lcl|NC_019527. 1 MATL------TGGD----KLAKILADIGGK-AQGSVDVGFMSGATYPDGTPVAQVAFWNEFGHGGRFPAPPRPFFRNMVN 69 (190) Q Consensus 1 Ma~i------~~~d----~l~~il~~l~~l-~~~~V~VGi~~~~~~~dG~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~ 69 (190) |+.- .+|. .+......|..+ .+..+.++ +++-||...|||+- .-+|..|.|.++. T Consensus 63 ~~~~~~~~~d~~G~~t~~~~~~~~~~i~~~k~g~~iyi~-----------Nn~pYA~~LEyG~S---~QAP~G~v~~~~~ 128 (145) T protein:vir:10 63 PAQQSLNEYDQTGGQTKTYLARQARAVANSKATSVIYIT-----------NRLDYAADLEYGAS---NQAPAGVLGVVQA 128 (145) T ss_pred cccccccccCCCCccchhhHHHHHHHhhcccccceEEEe-----------eCchhhhHhhcccc---CCCcchHHHHHHH Confidence 2211 1111 122222222221 11222221 35778888899973 4799999999998 Q ss_pred HHHH---HHHHHHHHHH Q lcl|NC_019527. 70 EKSS---EWPKRLGDAI 83 (190) Q Consensus 70 ~~~~---~~~~~l~~~i 83 (190) +... +..+.++++| T Consensus 129 ~~~~~v~~~~~e~k~~~ 145 (145) T protein:vir:10 129 RLGRYFQEAVEEARRAI 145 (145) T ss_pred HHHHHHHHHHHHhhccC Confidence 7653 3333334444 No 182 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=20.02 E-value=1.7 Score=19.38 Aligned_cols=69 Identities=20% Similarity=0.175 Sum_probs=36.7 Q ss_pred CCCCchhhHHHHHHHHH----------------Hhh------------cCCEEEEEecCCC----CCCCC---------- Q lcl|NC_019527. 1 MATLTGGDKLAKILADI----------------GGK------------AQGSVDVGFMSGA----TYPDG---------- 38 (190) Q Consensus 1 Ma~i~~~d~l~~il~~l----------------~~l------------~~~~V~VGi~~~~----~~~dG---------- 38 (190) |+.|+=...+.+..+++ .++ ++..|.+|-|... ..|.| T Consensus 1 ~~~~sf~~~i~~~~~~ve~~~~~~~r~~~~~~~~~vv~~sPVdtGrfRanw~vs~~~p~~~~~~~~dp~g~~t~~~~~~~ 80 (121) T protein:vir:94 1 MISMKFNVNLSRLRSNLREEAKKKAIRIAQEIVNGVIARSPVLAGDYRSSWNVSEGSMEFKFNNGGNPANPTPAPAIVVS 80 (121) T ss_pred CccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCchhhhhccccccccCcccccCCCCCCCcchhHHHHHHH Confidence 77665333222211111 111 2334444433311 11222 Q ss_pred ----------ccHHHHHHHhhcCccccCCCCCCchhhHHHHHHH Q lcl|NC_019527. 39 ----------TPVAQVAFWNEFGHGGRFPAPPRPFFRNMVNEKS 72 (190) Q Consensus 39 ----------~~vA~iA~~~EfG~~~~~~IP~RPFlr~~~~~~~ 72 (190) ++++-||.-.|||+. .-+|+.|.|.++.+-+ T Consensus 81 ~~~~~~~iyi~NnlpYA~~LE~G~S---~QAP~G~v~~t~~~~q 121 (121) T protein:vir:94 81 SNVALPHFYITNGAPYAQQLEKGSS---TQAPLGIVRVTLASLR 121 (121) T ss_pred HhhccceEEEeeCcchhhhhhcccC---CCCcchHHHHHHHhhC Confidence 134556778899974 4699999999999888 Done!