Query lcl|NC_020201.1_cdsid_YP_007392681.1 [gene=H378_gp219] [protein=hypothetical protein] [protein_id=YP_007392681.1] [location=121558..122019] Match_columns 153 No_of_seqs 111 out of 182 Neff 6.4 Searched_HMMs 1612 Date Thu Nov 7 16:30:11 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_219 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_219_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:107757 Length: 189 100.0 2.3E-53 1.4E-56 309.1 17.9 150 1-152 1-189 (189) 2 protein:vir:5257 Length: 148 # 100.0 4.2E-53 2.6E-56 307.7 16.7 143 1-148 1-148 (148) 3 protein:vir:99546 Length: 200 100.0 1.3E-52 8E-56 305.1 16.4 145 1-148 7-200 (200) 4 protein:vir:96105 Length: 193 100.0 6.4E-52 3.9E-55 301.3 16.9 144 1-148 1-193 (193) 5 protein:vir:80037 Length: 199 100.0 7.1E-49 4.4E-52 284.5 15.8 143 1-150 1-199 (199) 6 protein:vir:106728 Length: 155 100.0 9.1E-48 5.6E-51 278.5 15.6 136 1-149 1-155 (155) 7 protein:vir:78607 Length: 155 100.0 9.3E-48 5.7E-51 278.4 15.6 136 1-149 1-155 (155) 8 protein:vir:94069 Length: 168 100.0 3.4E-47 2.1E-50 275.3 15.8 144 1-153 1-162 (168) 9 protein:vir:101563 Length: 155 100.0 1E-46 6.5E-50 272.7 14.9 136 1-149 1-155 (155) 10 protein:vir:77650 Length: 155 100.0 3E-46 1.9E-49 270.1 15.2 136 1-149 1-155 (155) 11 protein:vir:95260 Length: 160 100.0 6.5E-46 4E-49 268.3 16.3 149 1-153 1-158 (160) 12 protein:vir:4347 Length: 164 # 98.7 3.2E-11 2E-14 78.1 4.8 99 1-104 1-164 (164) 13 protein:vir:1891 Length: 179 # 98.6 2.9E-11 1.8E-14 78.3 2.1 99 1-104 1-179 (179) 14 protein:vir:3163 Length: 145 # 98.5 4.4E-10 2.7E-13 71.9 6.6 82 66-153 1-87 (145) 15 protein:vir:1386 Length: 149 # 98.5 4.1E-10 2.5E-13 72.0 4.9 87 1-99 1-149 (149) 16 protein:vir:102875 Length: 146 98.4 6.1E-10 3.8E-13 71.1 5.0 85 1-94 1-146 (146) 17 protein:vir:105007 Length: 146 98.4 6.1E-10 3.8E-13 71.1 5.0 85 1-94 1-146 (146) 18 protein:vir:102085 Length: 146 98.4 6.1E-10 3.8E-13 71.1 5.0 85 1-94 1-146 (146) 19 protein:vir:107568 Length: 146 98.4 6.1E-10 3.8E-13 71.1 5.0 85 1-94 1-146 (146) 20 protein:vir:97088 Length: 157 98.4 1.4E-09 8.5E-13 69.2 6.3 89 1-89 1-157 (157) 21 protein:vir:93617 Length: 148 98.3 5.4E-10 3.4E-13 71.4 3.2 89 1-96 2-148 (148) 22 protein:vir:94538 Length: 125 98.3 1.7E-09 1E-12 68.7 5.6 88 1-88 1-125 (125) 23 protein:vir:79225 Length: 155 98.3 9.7E-10 6E-13 70.0 3.9 89 62-153 1-96 (155) 24 protein:vir:95789 Length: 114 98.3 1.7E-09 1E-12 68.7 4.4 82 1-86 1-114 (114) 25 protein:vir:99833 Length: 190 98.3 2.1E-09 1.3E-12 68.2 4.9 90 61-153 1-96 (190) 26 protein:vir:99196 Length: 155 98.2 1.8E-09 1.1E-12 68.5 4.0 89 62-153 1-96 (155) 27 protein:vir:103841 Length: 155 98.2 1.8E-09 1.1E-12 68.6 3.6 89 62-153 1-96 (155) 28 protein:vir:3617 Length: 112 # 98.2 1.2E-09 7.7E-13 69.4 2.6 80 1-82 1-112 (112) 29 protein:vir:194 Length: 149 # 98.2 3.3E-09 2.1E-12 67.1 4.3 89 1-96 2-149 (149) 30 protein:vir:1273 Length: 127 # 98.1 4E-09 2.5E-12 66.6 3.5 79 1-86 42-127 (127) 31 protein:vir:79091 Length: 175 98.0 1.1E-08 6.7E-12 64.3 4.1 89 62-153 1-113 (175) 32 protein:vir:78858 Length: 115 98.0 9E-09 5.6E-12 64.7 3.2 82 1-82 20-115 (115) 33 protein:vir:96225 Length: 115 98.0 9E-09 5.6E-12 64.7 3.2 82 1-82 20-115 (115) 34 protein:vir:103917 Length: 115 98.0 9E-09 5.6E-12 64.7 3.2 82 1-82 20-115 (115) 35 protein:vir:9312 Length: 115 # 98.0 9E-09 5.6E-12 64.7 3.2 82 1-82 20-115 (115) 36 protein:vir:97144 Length: 115 98.0 9E-09 5.6E-12 64.7 3.2 82 1-82 20-115 (115) 37 protein:vir:96358 Length: 115 98.0 9E-09 5.6E-12 64.7 3.2 82 1-82 20-115 (115) 38 protein:vir:1988 Length: 156 # 97.9 1.9E-08 1.2E-11 62.9 4.2 88 62-153 1-100 (156) 39 protein:vir:9708 Length: 125 # 97.9 1E-08 6.4E-12 64.4 2.5 80 1-87 38-125 (125) 40 protein:vir:106623 Length: 115 97.9 1.3E-08 8.2E-12 63.8 3.0 82 1-82 16-115 (115) 41 protein:vir:9930 Length: 108 # 97.9 1.7E-08 1E-11 63.2 3.3 75 9-83 1-108 (108) 42 protein:vir:3873 Length: 128 # 97.9 1.2E-08 7.1E-12 64.1 2.3 79 1-86 41-128 (128) 43 protein:vir:80362 Length: 140 97.9 2.5E-08 1.5E-11 62.3 3.9 89 1-89 19-140 (140) 44 protein:vir:105089 Length: 133 97.8 1.5E-08 9.1E-12 63.5 2.1 81 1-88 19-133 (133) 45 protein:vir:99744 Length: 115 97.8 2E-08 1.2E-11 62.8 2.6 82 1-82 16-115 (115) 46 protein:vir:100243 Length: 140 97.8 3.6E-08 2.2E-11 61.4 4.0 89 1-89 19-140 (140) 47 protein:vir:9414 Length: 125 # 97.8 1.4E-08 8.7E-12 63.6 1.7 73 1-86 53-125 (125) 48 protein:vir:98342 Length: 125 97.8 1.4E-08 8.7E-12 63.6 1.7 73 1-86 53-125 (125) 49 protein:vir:81106 Length: 125 97.8 1.4E-08 8.7E-12 63.6 1.7 73 1-86 53-125 (125) 50 protein:vir:4704 Length: 125 # 97.8 1.4E-08 8.7E-12 63.6 1.7 73 1-86 53-125 (125) 51 protein:vir:79988 Length: 125 97.8 1.4E-08 8.7E-12 63.6 1.7 73 1-86 53-125 (125) 52 protein:vir:2740 Length: 114 # 97.8 2E-08 1.2E-11 62.8 2.1 82 1-83 19-114 (114) 53 protein:vir:4906 Length: 114 # 97.8 2E-08 1.2E-11 62.8 2.1 82 1-83 19-114 (114) 54 protein:vir:100075 Length: 140 97.8 3.4E-08 2.1E-11 61.5 3.4 89 1-89 19-140 (140) 55 protein:vir:1437 Length: 140 # 97.8 5.5E-08 3.4E-11 60.4 4.1 89 1-89 19-140 (140) 56 protein:vir:96486 Length: 112 97.7 3.6E-08 2.2E-11 61.4 1.9 80 1-81 19-112 (112) 57 protein:vir:5745 Length: 135 # 97.6 7.6E-08 4.7E-11 59.6 3.0 85 1-99 20-135 (135) 58 protein:vir:107851 Length: 175 97.6 1.2E-07 7.4E-11 58.5 3.9 89 62-153 1-113 (175) 59 protein:vir:5978 Length: 144 # 97.6 1E-07 6.4E-11 58.9 3.3 80 1-82 4-144 (144) 60 protein:vir:98409 Length: 108 97.6 4.8E-08 3E-11 60.7 1.3 80 1-82 13-108 (108) 61 protein:vir:743 Length: 108 # 97.6 7.8E-08 4.8E-11 59.6 2.3 79 1-82 16-108 (108) 62 protein:vir:3163 Length: 145 # 97.5 8.8E-08 5.4E-11 59.3 2.1 78 1-91 52-145 (145) 63 protein:vir:79091 Length: 175 97.3 9.7E-07 6E-10 53.6 5.8 87 1-88 1-175 (175) 64 protein:vir:1988 Length: 156 # 97.3 1.3E-06 8.3E-10 52.8 6.0 73 1-87 75-156 (156) 65 protein:vir:98557 Length: 149 97.2 3.3E-06 2E-09 50.7 7.1 86 66-153 1-92 (149) 66 protein:vir:101594 Length: 173 97.1 9.1E-07 5.6E-10 53.7 3.4 86 1-88 16-173 (173) 67 protein:vir:100312 Length: 152 97.1 1.7E-06 1.1E-09 52.2 4.6 75 1-84 64-152 (152) 68 protein:vir:97427 Length: 137 97.0 6.7E-07 4.2E-10 54.4 2.1 74 1-78 1-137 (137) 69 protein:vir:93738 Length: 137 97.0 6.7E-07 4.2E-10 54.4 2.1 74 1-78 1-137 (137) 70 protein:vir:94490 Length: 137 97.0 6.7E-07 4.2E-10 54.4 2.1 74 1-78 1-137 (137) 71 protein:vir:107099 Length: 137 97.0 7.8E-07 4.8E-10 54.1 2.2 74 1-78 1-137 (137) 72 protein:vir:99833 Length: 190 96.9 4.1E-06 2.6E-09 50.1 5.6 75 1-89 71-190 (190) 73 protein:vir:98557 Length: 149 96.8 2.5E-06 1.6E-09 51.3 3.7 73 1-83 64-149 (149) 74 protein:vir:2026 Length: 150 # 96.8 2.6E-06 1.6E-09 51.2 3.7 75 1-83 63-150 (150) 75 protein:vir:94654 Length: 142 96.8 1.1E-06 7E-10 53.2 1.7 81 1-81 4-142 (142) 76 protein:vir:6071 Length: 150 # 96.8 2.6E-06 1.6E-09 51.2 3.5 75 1-83 63-150 (150) 77 protein:vir:5703 Length: 150 # 96.7 3.4E-06 2.1E-09 50.5 3.6 75 1-83 63-150 (150) 78 protein:vir:107851 Length: 175 96.6 6.2E-06 3.9E-09 49.1 4.6 75 1-88 65-175 (175) 79 protein:vir:1838 Length: 149 # 96.6 7.7E-06 4.8E-09 48.6 5.1 74 1-83 63-149 (149) 80 protein:vir:78077 Length: 141 96.6 2.4E-06 1.5E-09 51.4 2.3 85 1-85 14-141 (141) 81 protein:vir:103841 Length: 155 96.6 6.7E-06 4.1E-09 49.0 4.4 75 1-88 71-155 (155) 82 protein:vir:102154 Length: 119 96.5 1.5E-06 9.5E-10 52.5 0.5 79 1-86 20-119 (119) 83 protein:vir:106570 Length: 182 96.5 2.3E-06 1.5E-09 51.5 1.4 86 1-87 2-182 (182) 84 protein:vir:96121 Length: 137 96.5 3.2E-06 2E-09 50.7 2.0 76 1-78 18-137 (137) 85 protein:vir:95894 Length: 137 96.4 3.7E-06 2.3E-09 50.4 1.9 76 1-78 18-137 (137) 86 protein:vir:94108 Length: 149 96.4 2.1E-06 1.3E-09 51.8 0.4 74 1-78 13-149 (149) 87 protein:vir:2026 Length: 150 # 96.4 3.3E-05 2.1E-08 45.1 7.0 86 66-153 1-99 (150) 88 protein:vir:79225 Length: 155 96.3 1.2E-05 7.3E-09 47.6 4.0 75 1-88 71-155 (155) 89 protein:vir:81067 Length: 119 96.2 8.6E-06 5.3E-09 48.4 2.9 82 1-89 5-119 (119) 90 protein:vir:94796 Length: 137 96.2 2.1E-06 1.3E-09 51.7 -0.4 78 1-78 1-137 (137) 91 protein:vir:105916 Length: 149 96.2 3.1E-06 1.9E-09 50.8 0.4 74 1-78 13-149 (149) 92 protein:vir:1164 Length: 156 # 96.2 1.3E-05 8.3E-09 47.3 3.8 79 1-91 65-156 (156) 93 protein:vir:10367 Length: 119 96.1 9.6E-06 6E-09 48.1 2.9 82 1-89 5-119 (119) 94 protein:vir:105330 Length: 137 96.1 3.9E-06 2.4E-09 50.2 0.5 74 1-78 1-137 (137) 95 protein:vir:79179 Length: 155 96.0 1.8E-05 1.1E-08 46.6 4.0 75 1-83 55-155 (155) 96 protein:vir:6071 Length: 150 # 96.0 7.4E-05 4.6E-08 43.2 7.1 86 66-153 1-92 (150) 97 protein:vir:99196 Length: 155 95.9 1.8E-05 1.1E-08 46.5 3.6 72 1-88 71-155 (155) 98 protein:vir:100887 Length: 139 95.9 9.4E-06 5.9E-09 48.1 1.9 77 1-92 61-139 (139) 99 protein:vir:79115 Length: 148 95.9 2.1E-05 1.3E-08 46.3 3.6 75 1-87 56-148 (148) 100 protein:vir:96829 Length: 135 95.9 5.9E-06 3.6E-09 49.3 0.5 74 1-78 1-135 (135) 101 protein:vir:5703 Length: 150 # 95.8 0.00011 6.9E-08 42.3 7.1 86 66-153 1-99 (150) 102 protein:vir:97327 Length: 116 95.7 7.3E-06 4.5E-09 48.7 0.3 76 1-78 1-116 (116) 103 protein:vir:1243 Length: 116 # 95.7 7.3E-06 4.5E-09 48.7 0.3 76 1-78 1-116 (116) 104 protein:vir:95062 Length: 116 95.7 6.3E-06 3.9E-09 49.1 -0.1 76 1-78 1-116 (116) 105 protein:vir:5000 Length: 141 # 95.3 1.9E-05 1.2E-08 46.4 1.3 79 1-92 61-141 (141) 106 protein:vir:4859 Length: 140 # 95.2 2.5E-05 1.5E-08 45.8 1.7 78 1-91 61-140 (140) 107 protein:vir:79115 Length: 148 94.8 0.00042 2.6E-07 39.1 7.3 86 66-153 1-91 (148) 108 protein:vir:100223 Length: 139 94.7 3.8E-05 2.3E-08 44.8 1.3 77 1-92 61-139 (139) 109 protein:vir:4956 Length: 153 # 94.6 3E-05 1.9E-08 45.3 0.7 96 1-123 25-153 (153) 110 protein:vir:4833 Length: 140 # 94.3 5.4E-05 3.4E-08 44.0 1.3 83 1-91 25-140 (140) 111 protein:vir:99101 Length: 142 94.0 9E-05 5.6E-08 42.8 1.9 77 1-79 22-142 (142) 112 protein:vir:8669 Length: 142 # 94.0 9E-05 5.6E-08 42.8 1.9 77 1-79 22-142 (142) 113 protein:vir:1838 Length: 149 # 93.7 0.0013 7.8E-07 36.5 7.8 86 66-153 1-92 (149) 114 protein:vir:79034 Length: 141 93.6 0.00017 1.1E-07 41.2 2.8 90 1-90 1-141 (141) 115 protein:vir:81147 Length: 126 93.6 0.00015 9.3E-08 41.5 2.4 83 1-88 9-126 (126) 116 protein:vir:3787 Length: 231 # 92.1 0.0013 8.2E-07 36.4 5.6 79 1-90 65-231 (231) 117 protein:vir:966 Length: 123 # 91.2 0.0011 6.8E-07 36.8 4.1 81 1-83 1-123 (123) 118 protein:vir:105467 Length: 144 90.4 0.0022 1.3E-06 35.2 4.9 88 1-89 1-144 (144) 119 protein:vir:107545 Length: 140 90.2 0.00024 1.5E-07 40.5 -0.5 76 1-76 6-140 (140) 120 protein:vir:97982 Length: 140 90.2 0.00024 1.5E-07 40.5 -0.5 76 1-76 6-140 (140) 121 protein:vir:106570 Length: 182 90.0 0.0018 1.1E-06 35.7 4.1 71 62-153 1-71 (182) 122 protein:vir:79179 Length: 155 89.8 0.0048 2.9E-06 33.3 6.3 87 66-153 1-98 (155) 123 protein:vir:3750 Length: 227 # 88.3 0.005 3.1E-06 33.2 5.3 79 1-109 59-227 (227) 124 protein:vir:102441 Length: 137 88.2 0.00026 1.6E-07 40.2 -1.8 77 1-77 1-137 (137) 125 protein:vir:100312 Length: 152 88.1 0.011 7.1E-06 31.2 7.2 87 66-153 1-93 (152) 126 protein:vir:94490 Length: 137 87.9 0.0015 9.2E-07 36.1 2.2 66 62-153 1-66 (137) 127 protein:vir:93738 Length: 137 87.9 0.0015 9.2E-07 36.1 2.2 66 62-153 1-66 (137) 128 protein:vir:97427 Length: 137 87.9 0.0015 9.2E-07 36.1 2.2 66 62-153 1-66 (137) 129 protein:vir:78755 Length: 228 87.5 0.0079 4.9E-06 32.1 5.9 92 1-127 55-228 (228) 130 protein:vir:9879 Length: 127 # 87.3 0.0011 6.9E-07 36.8 1.1 83 1-83 16-127 (127) 131 protein:vir:96121 Length: 137 87.3 0.0017 1.1E-06 35.7 2.2 66 62-153 1-66 (137) 132 protein:vir:95894 Length: 137 86.6 0.0021 1.3E-06 35.3 2.2 66 62-153 1-66 (137) 133 protein:vir:3848 Length: 159 # 86.1 0.0023 1.4E-06 35.1 2.2 79 1-91 76-159 (159) 134 protein:vir:9930 Length: 108 # 85.9 0.0034 2.1E-06 34.1 3.0 61 63-153 1-61 (108) 135 protein:vir:100652 Length: 134 85.8 0.0034 2.1E-06 34.1 3.0 75 1-84 1-134 (134) 136 protein:vir:97327 Length: 116 85.5 0.0035 2.2E-06 34.0 2.9 45 77-153 1-45 (116) 137 protein:vir:1243 Length: 116 # 85.5 0.0035 2.2E-06 34.0 2.9 45 77-153 1-45 (116) 138 protein:vir:101594 Length: 173 85.4 0.014 8.5E-06 30.8 6.1 64 62-153 1-64 (173) 139 protein:vir:94796 Length: 137 85.1 0.0027 1.7E-06 34.7 2.1 66 62-153 1-66 (137) 140 protein:vir:106506 Length: 137 84.8 0.0077 4.8E-06 32.2 4.4 61 55-153 1-61 (137) 141 protein:vir:96225 Length: 115 84.5 0.0021 1.3E-06 35.2 1.3 68 61-153 1-68 (115) 142 protein:vir:78858 Length: 115 84.5 0.0021 1.3E-06 35.2 1.3 68 61-153 1-68 (115) 143 protein:vir:96358 Length: 115 84.5 0.0021 1.3E-06 35.2 1.3 68 61-153 1-68 (115) 144 protein:vir:103917 Length: 115 84.5 0.0021 1.3E-06 35.2 1.3 68 61-153 1-68 (115) 145 protein:vir:9312 Length: 115 # 84.5 0.0021 1.3E-06 35.2 1.3 68 61-153 1-68 (115) 146 protein:vir:97144 Length: 115 84.5 0.0021 1.3E-06 35.2 1.3 68 61-153 1-68 (115) 147 protein:vir:102963 Length: 163 84.4 0.0069 4.3E-06 32.4 4.0 88 1-90 1-163 (163) 148 protein:vir:94654 Length: 142 84.1 0.004 2.5E-06 33.7 2.6 67 62-153 1-68 (142) 149 protein:vir:107099 Length: 137 83.7 0.0038 2.4E-06 33.9 2.2 66 62-153 1-66 (137) 150 protein:vir:1164 Length: 156 # 83.4 0.03 1.8E-05 28.9 7.0 87 66-153 1-101 (156) 151 protein:vir:105330 Length: 137 83.3 0.0037 2.3E-06 33.9 2.0 66 62-153 1-66 (137) 152 protein:vir:5978 Length: 144 # 82.7 0.012 7.4E-06 31.1 4.6 71 57-153 1-71 (144) 153 protein:vir:78077 Length: 141 82.0 0.0066 4.1E-06 32.5 2.9 64 62-153 1-66 (141) 154 protein:vir:96829 Length: 135 82.0 0.0048 3E-06 33.3 2.1 66 62-153 1-66 (135) 155 protein:vir:95062 Length: 116 80.0 0.0095 5.9E-06 31.7 3.0 45 90-153 1-45 (116) 156 protein:vir:9513 Length: 134 # 78.8 0.01 6.2E-06 31.5 2.8 75 1-84 1-134 (134) 157 protein:vir:101302 Length: 134 78.8 0.01 6.2E-06 31.5 2.8 75 1-84 1-134 (134) 158 protein:vir:97982 Length: 140 67.1 0.021 1.3E-05 29.8 1.5 65 77-153 1-65 (140) 159 protein:vir:107545 Length: 140 67.1 0.021 1.3E-05 29.8 1.5 65 77-153 1-65 (140) 160 protein:vir:7412 Length: 168 # 66.1 0.035 2.2E-05 28.6 2.6 93 1-94 22-168 (168) 161 protein:vir:9879 Length: 127 # 65.7 0.043 2.7E-05 28.1 3.0 60 85-153 1-68 (127) 162 protein:vir:9647 Length: 132 # 64.7 0.099 6.1E-05 26.1 4.8 78 1-87 1-132 (132) 163 protein:vir:98636 Length: 138 63.7 0.098 6.1E-05 26.1 4.6 78 1-87 7-138 (138) 164 protein:vir:95372 Length: 124 58.3 0.022 1.4E-05 29.7 -0.0 83 1-83 1-124 (124) 165 protein:vir:106041 Length: 137 56.1 0.065 4E-05 27.1 2.2 60 62-153 1-62 (137) 166 protein:vir:99528 Length: 92 # 53.9 0.06 3.7E-05 27.3 1.6 57 1-58 1-92 (92) 167 protein:vir:103280 Length: 142 53.1 0.54 0.00033 22.1 6.8 74 1-84 60-142 (142) 168 protein:vir:80116 Length: 127 53.0 0.093 5.8E-05 26.2 2.5 83 1-89 1-127 (127) 169 protein:vir:1028 Length: 168 # 48.3 0.071 4.4E-05 26.9 1.1 86 1-94 62-168 (168) 170 protein:vir:6246 Length: 143 # 47.9 0.091 5.6E-05 26.3 1.6 91 1-96 1-143 (143) 171 protein:vir:104347 Length: 145 47.7 0.38 0.00023 22.9 5.0 77 1-84 63-145 (145) 172 protein:vir:93898 Length: 133 47.0 0.19 0.00012 24.6 3.2 73 1-83 1-133 (133) 173 protein:vir:1332 Length: 143 # 44.3 0.11 6.6E-05 25.9 1.4 91 1-96 1-143 (143) 174 protein:vir:6216 Length: 125 # 42.8 0.33 0.0002 23.2 3.8 85 1-88 19-125 (125) 175 protein:vir:102338 Length: 116 37.9 0.6 0.00037 21.8 4.5 83 1-86 1-116 (116) 176 protein:vir:4096 Length: 140 # 37.5 0.22 0.00014 24.2 2.0 86 1-88 1-140 (140) 177 protein:vir:9363 Length: 133 # 34.8 0.41 0.00025 22.7 3.0 73 1-83 1-133 (133) 178 protein:vir:94419 Length: 133 34.8 0.41 0.00025 22.7 3.0 73 1-83 1-133 (133) 179 protein:vir:78644 Length: 133 34.8 0.41 0.00025 22.7 3.0 73 1-83 1-133 (133) 180 protein:vir:96973 Length: 133 34.8 0.41 0.00025 22.7 3.0 73 1-83 1-133 (133) 181 protein:vir:94994 Length: 131 34.4 0.78 0.00049 21.2 4.5 71 1-85 55-131 (131) 182 protein:vir:78380 Length: 131 32.6 0.87 0.00054 20.9 4.5 71 1-85 55-131 (131) 183 protein:vir:3994 Length: 168 # 27.2 0.26 0.00016 23.8 0.6 91 1-94 22-168 (168) 184 protein:vir:78335 Length: 133 26.0 0.95 0.00059 20.7 3.4 76 1-85 1-133 (133) 185 protein:vir:96774 Length: 152 22.5 1.2 0.00074 20.2 3.3 74 1-88 79-152 (152) No 1 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=100.00 E-value=2.3e-53 Score=309.14 Aligned_cols=150 Identities=19% Similarity=0.288 Sum_probs=142.4 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCC-CCCCHHHHHHHHhcCC--CCCCCCchhhHHHHHHHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHY-SGLNMATLAAIHEEGW--NNLPERNFMFSTSMHFQEGLRKHI 77 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~-dG~~va~iA~~~E~G~--~~IP~RpFlr~~~~~~~~~~~~~~ 77 (153) |++.|+...+.+++|.+.|++|++++|+|||+++.+| ||+++|+||+|||||+ .+||||||||+|+++++++|.+.+ T Consensus 1 M~~~i~~~~~~~~~L~~~lk~l~~k~V~VGi~~~~~y~dG~~vA~Ia~~~E~G~p~~~IP~RPFlr~t~~~~~~~~~~~l 80 (189) T protein:vir:10 1 MGRVIRKQGPARVKLNAFIKGMNDYSVRIGWFSTAKYPDGTPTAYVASIHEFGAPSRGIPARSFIRPTIAAQQAAWSQQM 80 (189) T ss_pred CcceeccCcHHHHHHHHHHHHhhCCeEEEEecCCCCCCCcccHHHHHHHHHhcCcCCCCCCchhhhHHHHHHHHHHHHHH Confidence 9999998889999999999999999999999988755 7999999999999999 479999999999999999999999 Q ss_pred HHHHHHHHcCC-CHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCC--------------------------- Q lcl|NC_020201. 78 KRMHNGIIQGR-GFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGF--------------------------- 129 (153) Q Consensus 78 ~~~~~~~~~G~-~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~--------------------------- 129 (153) +++++++++|+ +++++|+.+|+.++++||.+|.++. .|||||+||++||+ T Consensus 81 ~~~~~~vl~G~~~~~~~L~~~G~~a~~~Ik~~I~~~~--~ppna~sTi~~Kg~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (189) T protein:vir:10 81 RFYAKQIVVGQMNVEQALEGLAIVARGDVDATLARLK--DPPLSPLTIYIRKFIKDGGVIHGYKDIMRLRSEMQQEQAKG 158 (189) T ss_pred HHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCC--CCCCcHHHHHHhcccCcccchhhhhhhhhhhhhhhhhhhhc Confidence 99999999996 9999999999999999999999955 58999999999994 Q ss_pred --------CCcchhHHHHHhhcceeeeeCCC Q lcl|NC_020201. 130 --------DDAMIHYGDLSSAATYKIVKYQG 152 (153) Q Consensus 130 --------~~PLidTG~L~~Sity~V~~~~g 152 (153) ++||||||+|++||||+|++++. T Consensus 159 ~~~~~~~s~kPLidTG~l~~SIty~V~~k~~ 189 (189) T protein:vir:10 159 TLNLSGVSTDPLDFTGYMRATLSYTVTKEKS 189 (189) T ss_pred cccccccCCCchhhHHHHHhhcceeeeecCC Confidence 79999999999999999999999 No 2 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=100.00 E-value=4.2e-53 Score=307.74 Aligned_cols=143 Identities=24% Similarity=0.286 Sum_probs=132.5 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCc-----CCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDE-----KHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRK 75 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~-----~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~ 75 (153) |++++|.+.+++++++++|++|++++|+|||+++ .++||+++|+||||||||+.+||+|||||+|+++++++|.+ T Consensus 1 M~~~~k~~~~~~~~l~~~l~~l~~~~v~VGi~~~~~~~~~~~~g~~vA~ia~~~E~G~~~IP~Rpflr~t~~~~~~~~~~ 80 (148) T protein:vir:52 1 MAVTVTANFSAAKQLIEQMKSLKEKAVYVGFPAEFDEKVKGSENFNLASLAAVLEFGNEHIPARPFLRQTLEENQEKYTA 80 (148) T ss_pred CccccccccHHHHHHHHHHHHhhCCeEEEEeecCcCCCCCCCCCCCHHHHHHHHhcCCCCCCCcchhHHHHHHHHHHHHH Confidence 9999999999999999999999999999999843 34589999999999999999999999999999999999998 Q ss_pred HHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHhhcceeee Q lcl|NC_020201. 76 HIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSSAATYKIV 148 (153) Q Consensus 76 ~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~Sity~V~ 148 (153) .+++++. .+.+++++|+.+|+.++++||.+|.++. .|||||+||++||||+||||||+|++||+|+|+ T Consensus 81 ~~~~~~~---~~~~~~~~L~~~G~~~~~~ik~~I~~~~--~ppna~sTi~~Kg~~~PLidTG~l~~SIty~V~ 148 (148) T protein:vir:52 81 LFIQWFD---QGVPAAQIYERLSVMAQGDVQMNIVKGE--WVANAKSTIRRKKSSKPLIDTGKMRQSVRGIVK 148 (148) T ss_pred HHHHHHH---cCCCHHHHHHHHHHHHHHHHHHHHhcCC--CCCCcHHHHHhcCCCCchhHHHHHHHHhhhhcC Confidence 8776443 3458999999999999999999999854 688999999999999999999999999999998 No 3 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=100.00 E-value=1.3e-52 Score=305.06 Aligned_cols=145 Identities=21% Similarity=0.275 Sum_probs=136.8 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCC-------CCCCHHHHHHHHhcCC-------------------- Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHY-------SGLNMATLAAIHEEGW-------------------- 53 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~-------dG~~va~iA~~~E~G~-------------------- 53 (153) |+++++.+ +++++++++|++|++++|+|||+++++| ||+++|+||+|||||+ T Consensus 7 ~~~k~~~~-~~~~~~~~~l~~l~~~~v~vGi~~~~~y~~~~~~~dG~~va~IA~~~EfG~~i~~p~~~~~~~~~~~~g~~ 85 (200) T protein:vir:99 7 KSNSVAAP-LKHFQMLKQFDALKGKTVQAGWFETDRYPAKEGETIGPLVAKIARQLEFGGVINHPGGTKYIKDAIVDGRY 85 (200) T ss_pred eeeeeecc-hHHHHHHHHHHHhhCCeEEEEEcCCCCcCCcccccccchHHHHHhHHHcCCeeccCCCccccccccccccc Confidence 99999876 6899999999999999999999988755 6899999999999994 Q ss_pred ---------------------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCC-CHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020201. 54 ---------------------NNLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGR-GFSSYLTKIGKDAADSIRFTIST 111 (153) Q Consensus 54 ---------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~-~~~~~L~~iG~~~~~~i~~~I~~ 111 (153) .+||||||||+|+++++++|.+++++++..++.|+ +++++|+.+|+.++++||.+|++ T Consensus 86 ~g~rfv~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~~G~~~~~~ik~~I~~ 165 (200) T protein:vir:99 86 VGTRFVHKSFQGEHEVTKAHQIVIPARPFMRLAWATFNKDKVKIQAQIARQLLDGTINPEQALAQIGLALEGCIVRSIKS 165 (200) T ss_pred cccccccccccceeeeeccccccCCCcchhhHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc Confidence 37999999999999999999999999999999997 99999999999999999999998 Q ss_pred cCCCCCCccHHHHHhcCCCCcchhHHHHHhhcceeee Q lcl|NC_020201. 112 GSFSNPKVSKDWASYKGFDDAMIHYGDLSSAATYKIV 148 (153) Q Consensus 112 ~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~Sity~V~ 148 (153) +. .|||||+||++||||+||||||+|++||||+|. T Consensus 166 ~~--~ppna~sTi~~Kg~~~PLidTG~l~~SIty~Ve 200 (200) T protein:vir:99 166 GP--WAANSPATIRAKGFDKPLIDTAHMWQTVSSKVS 200 (200) T ss_pred CC--CCCChHHHHHHhCCCCchHHHHHHHhHhccccC Confidence 65 589999999999999999999999999999998 No 4 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=100.00 E-value=6.4e-52 Score=301.26 Aligned_cols=144 Identities=21% Similarity=0.319 Sum_probs=134.0 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCC-------CCCCHHHHHHHHhcCCC------------------- Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHY-------SGLNMATLAAIHEEGWN------------------- 54 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~-------dG~~va~iA~~~E~G~~------------------- 54 (153) |+++ .+.++|++++++|++|++++|+|||++++.| +|+++|+||+|||||.. T Consensus 1 m~~~--~~~~~~~~~~~~l~~l~~~~v~vGi~~~~~~~~~~~~~~G~~va~iAai~EfG~~I~~~~~~~~~~~~~~~g~~ 78 (193) T protein:vir:96 1 MSLR--RDSELIAAHLQMLRAMRGRSVSAGWYSTARYPDKAGGSVGIQVARIARLNEYGGTIDHPGGTRYIRDAIVRGRF 78 (193) T ss_pred Ceec--cchHHHHHHHHHHHHhcCCeEEEEEcCCCCCCCcccccccchHHHHHhHHHcCCccccCccceeeeeccccccc Confidence 5554 6778999999999999999999999987644 38999999999999943 Q ss_pred ----------------------CCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCC-CHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_020201. 55 ----------------------NLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGR-GFSSYLTKIGKDAADSIRFTIST 111 (153) Q Consensus 55 ----------------------~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~-~~~~~L~~iG~~~~~~i~~~I~~ 111 (153) +||||||||+++++++++|.+++++++..++.|. +++++|+.+|+.++++||.+|++ T Consensus 79 ~~~~~~k~~~~~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~~G~~~~~~ik~~I~~ 158 (193) T protein:vir:96 79 VGVRFVRNDFPGETEVTKPHRITIPARPFMRYAWNLFSADRAAIQNRIAMRLARGQITPDQALAQIGLALEGYIARSIRT 158 (193) T ss_pred cccceeccCcceeeEeecceeccCCCcchhhhhHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc Confidence 7999999999999999999999999999999996 99999999999999999999998 Q ss_pred cCCCCCCccHHHHHhcCCCCcchhHHHHHhhcceeee Q lcl|NC_020201. 112 GSFSNPKVSKDWASYKGFDDAMIHYGDLSSAATYKIV 148 (153) Q Consensus 112 ~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~Sity~V~ 148 (153) +. .|||||+||++||||+||||||+|++||||+|+ T Consensus 159 ~~--~ppna~~Ti~~KG~~~PLidTG~l~~SIty~Vv 193 (193) T protein:vir:96 159 GP--WVANSASTVRRKGFNRPLVDTAHMLQSISSRVT 193 (193) T ss_pred CC--CCCCcHHHHHHhCCCCchhHHHHHHhhhcceeC Confidence 55 589999999999999999999999999999999 No 5 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=100.00 E-value=7.1e-49 Score=284.54 Aligned_cols=143 Identities=22% Similarity=0.361 Sum_probs=131.8 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCC--------------------------- Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGW--------------------------- 53 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~--------------------------- 53 (153) |++. .|.+.+++++++|++|++++|+|||+.+ ||.++++||.+||||+ T Consensus 1 m~vt--~~~~~~~~~~~~l~~L~~k~v~vGi~~~---d~~~~~~Ia~~~E~Ga~I~~~~~~l~Ip~~~a~~~k~~~~~~~ 75 (199) T protein:vir:80 1 MKVT--TDKSTMNKAIRELDQLDRYSLQIGLFGE---DDSFIQMIAGVHEFGLTIRPKGKYLTIPTPEAGDRRARDIPGL 75 (199) T ss_pred Cccc--ccHHHHHHHHHHHHHhcCCEEEEEEecC---CCcchhheeehhhcCCeeecCCceeeecchhhhcccccccCcc Confidence 6554 6778899999999999999999999964 6888999999999992 Q ss_pred ---------------------------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCC-CHHHHHHHHHHHHHHHH Q lcl|NC_020201. 54 ---------------------------NNLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGR-GFSSYLTKIGKDAADSI 105 (153) Q Consensus 54 ---------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~-~~~~~L~~iG~~~~~~i 105 (153) .+||+|||||+|+++++++|.+.++++++++++|+ +++++|+.+|+.++++| T Consensus 76 ~~p~g~~~~~~~~~~~~~~~~e~g~~~~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~vl~g~~~a~~~L~~~G~~~~~~I 155 (199) T protein:vir:80 76 FKPKGKNILAVAGPDGKLTVMFYLKTEVNIPERSFLRSTFDEKSNKWGELFEGWIDDVIHGKLSAEQVYNRLGAKIVDDI 155 (199) T ss_pred cccCCcceeeeeccccceeeeeeccccccCCCCchhHHHHHHHHHHHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHHH Confidence 16999999999999999999999999999999996 99999999999999999 Q ss_pred HHHHhccCCCCCCccHHHHH-hcCCCCcchhHHHHHhhcceeeeeC Q lcl|NC_020201. 106 RFTISTGSFSNPKVSKDWAS-YKGFDDAMIHYGDLSSAATYKIVKY 150 (153) Q Consensus 106 ~~~I~~~~~~~p~ns~~Ti~-~KG~~~PLidTG~L~~Sity~V~~~ 150 (153) |.+|.++. .|||||+|++ |||||+||||||+|++||+|+|++. T Consensus 156 k~~I~~~~--~ppna~~Tia~rKg~~kPLidTG~l~~SIty~V~~~ 199 (199) T protein:vir:80 156 QMKIVEIQ--TPAKSAATLARNPRKNNPLIVTGKMKNSVTWKVMKS 199 (199) T ss_pred HHHHhccC--CCCCCHHHHHHhcCCCCchHHHHHHHhhcceeeeeC Confidence 99999865 5899999996 8999999999999999999999999 No 6 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=100.00 E-value=9.1e-48 Score=278.48 Aligned_cols=136 Identities=28% Similarity=0.353 Sum_probs=119.7 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCC-C------------------CCCHHHHHHHHhcCCCCCCCCch Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHY-S------------------GLNMATLAAIHEEGWNNLPERNF 61 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~-d------------------G~~va~iA~~~E~G~~~IP~RpF 61 (153) |++ +.++|++++++ |++++|+|||++++.| | |+|+|+||+|||||+.+|||||| T Consensus 1 m~v----~~k~L~~~~~~---l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~~~IP~RPF 73 (155) T protein:vir:10 1 MSV----TRRGLTLPKDR---YRSMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGTSKLPARPF 73 (155) T ss_pred Ccc----hHHHHHHHHHH---HhCCeeEEeecCCCCCccccchhhhhhhhcccccccCCcHHHHHHHHhcCCCCCCCcch Confidence 544 44567777654 5679999999998543 3 89999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHh Q lcl|NC_020201. 62 MFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSS 141 (153) Q Consensus 62 lr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~ 141 (153) ||+|+++++++|.+.+++++ ..+.+++++|+++|+.++++||.+|++ + +|||||+|+++||||+||||||+|++ T Consensus 74 lr~t~~~~~~~~~~~l~~~~---~~~~~~~~~L~~lG~~~~~~Ik~~I~~--~-~~pna~~Ti~~KG~~kPLidTG~l~~ 147 (155) T protein:vir:10 74 MEKTIADRSAEWIKGLTVMM---TMGYDAEVAMGQIGQAMKDDIKTTISE--W-PADNSADWAGKKGFNHGLIWTSHLLN 147 (155) T ss_pred hHHHHHHHHHHHHHHHHHHH---HcCCCHHHHHHHHHHHHHHHHHHHHhc--C-CCCCcHHHHHhcCCCCchhHHHHHHH Confidence 99999999999998877654 345689999999999999999999987 4 48899999999999999999999999 Q ss_pred hcceeeee Q lcl|NC_020201. 142 AATYKIVK 149 (153) Q Consensus 142 Sity~V~~ 149 (153) ||+|+|++ T Consensus 148 SIty~Vv~ 155 (155) T protein:vir:10 148 SVEQEIVK 155 (155) T ss_pred hhhhhccC Confidence 99999999 No 7 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=100.00 E-value=9.3e-48 Score=278.44 Aligned_cols=136 Identities=28% Similarity=0.356 Sum_probs=119.8 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCC-C------------------CCCHHHHHHHHhcCCCCCCCCch Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHY-S------------------GLNMATLAAIHEEGWNNLPERNF 61 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~-d------------------G~~va~iA~~~E~G~~~IP~RpF 61 (153) |++ +.++|++++++ |++++|+|||++++.| | |+|+|+||+|||||+.+|||||| T Consensus 1 m~v----~~k~L~~~~~~---l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G~~~IP~RPF 73 (155) T protein:vir:78 1 MSV----TRRGLTLPKDR---YRSMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYGTSKLPARPF 73 (155) T ss_pred Ccc----hHHHHHHHHHH---HhCCeeEEeecCCCCCCcccchhhhhhhhcccccccCCcHHHHHHhhhcCCCCCCCcch Confidence 544 44567777654 5679999999998543 3 89999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHh Q lcl|NC_020201. 62 MFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSS 141 (153) Q Consensus 62 lr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~ 141 (153) ||+|+++++++|.+.+++++. .+.+++++|+++|+.++++||.+|++ + +|||||+|+++||||+||||||+|++ T Consensus 74 lr~t~~~~~~~~~~~l~~~~~---~~~~~~~~L~~~G~~~~~~Ik~~I~~--~-~~pna~~Ti~~Kg~~kPLidTG~l~~ 147 (155) T protein:vir:78 74 MEKTITDRSAEWIKGLTVMMT---MGYDAEVAMGQIGQAMKDDIKTTISE--W-PADNSADWAGKKGFNHGLIWTSHLLN 147 (155) T ss_pred hhHHHHHHHHHHHHHHHHHHH---cCCCHHHHHHHHHHHHHHHHHHHHhc--C-CCCCcHHHHHhcCCCCchhHHHHHHH Confidence 999999999999988776543 45689999999999999999999987 4 37899999999999999999999999 Q ss_pred hcceeeee Q lcl|NC_020201. 142 AATYKIVK 149 (153) Q Consensus 142 Sity~V~~ 149 (153) ||+|+|++ T Consensus 148 SIty~V~~ 155 (155) T protein:vir:78 148 SVEQEIVK 155 (155) T ss_pred hhhhhccC Confidence 99999999 No 8 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=100.00 E-value=3.4e-47 Score=275.32 Aligned_cols=144 Identities=22% Similarity=0.290 Sum_probs=125.4 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCC------------------CCCCHHHHHHHHhcCCCCCCCCchh Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHY------------------SGLNMATLAAIHEEGWNNLPERNFM 62 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~------------------dG~~va~iA~~~E~G~~~IP~RpFl 62 (153) |+..-+. +++...+.+..|.+..|+|||+++..| +|+++|+||+|||||+.+||+|||| T Consensus 1 ~~~~~~~---g~~~~~~~~~~l~~~~v~vG~l~~a~yp~G~~~~~~~~~~~~~~~~g~~va~Ia~~~E~G~~~IP~RPFl 77 (168) T protein:vir:94 1 MTTIARK---GVKMPPHLEAQFQSGEVKAGVLSGSTYPQMTYTDQRTGKQIEDARGGMPVAVIAQALEYGHGQNHPRPFM 77 (168) T ss_pred Cccccch---hhhhhHHHHHhhhccceeeeccccCcccccccchhhcccccccccccccHHHHHHHHhcCCCCCCCchhh Confidence 8776654 467777778888889999999876433 4579999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHhh Q lcl|NC_020201. 63 FSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSSA 142 (153) Q Consensus 63 r~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~S 142 (153) |+|+++++++|.+.++++++ .+.+++++|+.+|+.++++||.+|.+ + +|||||+||++||||+||||||+|++| T Consensus 78 r~t~~~~~~~~~~~~~~~~~---~~~~~~~~L~~lG~~~~~~Ik~~I~~--~-~ppna~sTi~~KG~~~PLiDTG~l~~S 151 (168) T protein:vir:94 78 QQTYAAQYRAWSRDLTLTLK---AGAAADTALRTVGQRMAEDIQDTIRN--W-PADNSPEWAAIKGFNAGLRQTGVLLNA 151 (168) T ss_pred HHHHHHHHHHHHHHHHHHHh---cCCCHHHHHHHHHHHHHHHHHHHhhc--C-CCCccHHHHHhcCCCCchhHHHHHHhh Confidence 99999999999988776443 24599999999999999999999987 5 489999999999999999999999999 Q ss_pred cceeeeeCCCC Q lcl|NC_020201. 143 ATYKIVKYQGK 153 (153) Q Consensus 143 ity~V~~~~gk 153 (153) |+|+|++++-- T Consensus 152 Ity~Vv~d~~~ 162 (168) T protein:vir:94 152 IDSAVIIDGEH 162 (168) T ss_pred cceeeeecCCC Confidence 99999976544 No 9 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=100.00 E-value=1e-46 Score=272.67 Aligned_cols=136 Identities=27% Similarity=0.329 Sum_probs=118.4 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCC-C------------------CCCHHHHHHHHhcCCCCCCCCch Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHY-S------------------GLNMATLAAIHEEGWNNLPERNF 61 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~-d------------------G~~va~iA~~~E~G~~~IP~RpF 61 (153) |++. .++|++++++ |++++|+|||+++++| | |+|+|+||+|||||+.+|||||| T Consensus 1 m~v~----r~~L~~~~~~---l~~~~V~VGi~~~a~y~d~~g~~~~~g~~~~~~~~~G~pva~ia~~~e~G~~~IP~RPF 73 (155) T protein:vir:10 1 MSVT----RRGLTLPKDR---YKSMSVKAGVLAGATYPDESGKKLADGTILKKDPRAGLPVAMIAMALNYGTSKLPARPF 73 (155) T ss_pred Ccch----HHHHHHHHHH---hhCCeeEEeecCCCCCCccccchhhhhhhhccccccCcchhhhhhhhhcCCCCCCCcch Confidence 6554 4567777765 4568899999987654 4 89999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHh Q lcl|NC_020201. 62 MFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSS 141 (153) Q Consensus 62 lr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~ 141 (153) ||+|+++++++|.+.+++++ ..+.+++++|+.+|+.++++||.+|.++. +| |+++|+++||||+||||||+|++ T Consensus 74 lr~t~~~~~~~~~~~l~~~~---~~~~~~~~~L~~~G~~~~~~Ik~~I~~~~--~p-~~~~Ti~~KG~~~PLidTG~l~~ 147 (155) T protein:vir:10 74 MEKTIADRSAEWIKGLTVMM---TMGYDAEVAMGQIGQAMKDDIKTTISEWP--AD-NNADWAGKKGFNHGLIWTSHLLN 147 (155) T ss_pred hHHHHHHHHHHHHHHHHHHH---HcCCCHHHHHHHHHHHHHHHHHHHHhcCC--CC-CChHHHHhcCCCCchHHHHHHHH Confidence 99999999999998877654 34568999999999999999999999854 44 57899999999999999999999 Q ss_pred hcceeeee Q lcl|NC_020201. 142 AATYKIVK 149 (153) Q Consensus 142 Sity~V~~ 149 (153) ||+|+|++ T Consensus 148 Sity~Vv~ 155 (155) T protein:vir:10 148 SIEQEIVK 155 (155) T ss_pred hhhhhccC Confidence 99999999 No 10 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=100.00 E-value=3e-46 Score=270.14 Aligned_cols=136 Identities=27% Similarity=0.334 Sum_probs=118.0 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCC-C------------------CCCHHHHHHHHhcCCCCCCCCch Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHY-S------------------GLNMATLAAIHEEGWNNLPERNF 61 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~-d------------------G~~va~iA~~~E~G~~~IP~RpF 61 (153) |++.- .+|+.++++ |.+++|+|||++++.| | |+|+|+||+|||||+.+|||||| T Consensus 1 m~~~r----~~l~~~~~~---l~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~G~pva~ia~~~e~G~~~IP~RPF 73 (155) T protein:vir:77 1 MSVTR----RGLTLPKDR---YRSMSVKAGVLAGATYPDESGKKLADGSILKKDPRAGLPVAMIAMALNYGTSKLPARPF 73 (155) T ss_pred CcchH----HHHHHHHHH---HhcCceEEeecCCCCCccccchhhhhhhhccccccccccHhhhhhhhhcCCCCCCCCch Confidence 66544 456666554 5678999999988543 3 79999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHh Q lcl|NC_020201. 62 MFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSS 141 (153) Q Consensus 62 lr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~ 141 (153) ||+|+++++++|.+.+++++ ..+.+++++|+.+|+.++++||.+|+++. +| |+++|+++||||+||||||+|++ T Consensus 74 lr~t~~~~~~~~~~~l~~~~---~~~~~~~~~L~~lG~~~~~~Iq~~I~~~~--~p-~~~~Ti~~KG~d~PLidTG~l~~ 147 (155) T protein:vir:77 74 MEKTIADRSAEWIKGLTVMM---TMGYDAEVAMGQIGQAMKDDIKTTISEWP--AD-NNADWAGKKGFNHGLIWTSHLLN 147 (155) T ss_pred hhHHHHHHHHHHHHHHHHHH---HccCcHHHHHHHHHHHHHHHHHHHHhcCC--CC-CChHHHHhcCCCCchhHHHHHHH Confidence 99999999999998887654 34568999999999999999999999865 44 57899999999999999999999 Q ss_pred hcceeeee Q lcl|NC_020201. 142 AATYKIVK 149 (153) Q Consensus 142 Sity~V~~ 149 (153) ||+|+|++ T Consensus 148 SIty~Vv~ 155 (155) T protein:vir:77 148 SIEQEIVK 155 (155) T ss_pred hhhhhccC Confidence 99999999 No 11 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=100.00 E-value=6.5e-46 Score=268.32 Aligned_cols=149 Identities=17% Similarity=0.168 Sum_probs=126.9 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcC--CCCCCCHHHHHHHHhcCCCCCCCCchhhHHHH-----HHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEK--HYSGLNMATLAAIHEEGWNNLPERNFMFSTSM-----HFQEGL 73 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~--~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~-----~~~~~~ 73 (153) |=.++ +..++++|.++|++|+++.|+||||+++ ++||+|+++||+|||||+.+||+|||||++++ +++..+ T Consensus 1 ~~~~~--~~~G~~~L~~~~k~l~~~~V~VGi~~d~g~~~dG~sv~~vA~~~EfG~~~iPaRPf~R~tfe~~~~~~~~~~~ 78 (160) T protein:vir:95 1 MVKRV--IHPARAKLVGAMKNLQTANAQVGYFQEQGQHSSGFSYPALMYLQEVIGVPSASGKVYRRLFEITMMLNKQTLL 78 (160) T ss_pred Cceee--chHhHHHHHHHHHHHhCCeeEEeeccccccCCCCccHHHHHhhhhcCcccCCCcchhHHHHHHHHHHHHHHHH Confidence 44333 5678999999999999999999999865 77999999999999999999999999999997 445556 Q ss_pred HHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccC--CCCCCccHHHHHhcCCCCcchhHHHHHhhcceeeeeCC Q lcl|NC_020201. 74 RKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGS--FSNPKVSKDWASYKGFDDAMIHYGDLSSAATYKIVKYQ 151 (153) Q Consensus 74 ~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~--~~~p~ns~~Ti~~KG~~~PLidTG~L~~Sity~V~~~~ 151 (153) .++.++....+..|++ ...+.+|+.++++|+..|.+.. -.+|||||+||++||||+||||||+|++||+|+|++++ T Consensus 79 ~~~~~~i~~~~~~g~~--~~~~~LG~~~~~~ik~~I~~~~~p~~w~pNap~Ti~~Kgs~~PLiDTg~l~~Si~y~v~~~~ 156 (160) T protein:vir:95 79 EQTKKNLYKQLSSLNT--DPSNTLEAFAKNAQKAIKRGFGNSAILPPNAPSTVKKKGFNAPLVETGDLRDNLAYKISTKK 156 (160) T ss_pred HHHHHHHHHHHhhcch--hHHHHHHHHHHHHHHHHHhhcCCccCCCCCcHHHHHhcCCCCcchhhHHHhhhhhheeeccc Confidence 6666667778888874 2445699999999999998721 12458999999999999999999999999999999999 Q ss_pred CC Q lcl|NC_020201. 152 GK 153 (153) Q Consensus 152 gk 153 (153) |= T Consensus 157 ~~ 158 (160) T protein:vir:95 157 GI 158 (160) T ss_pred cc Confidence 97 No 12 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=98.69 E-value=3.2e-11 Score=78.13 Aligned_cols=99 Identities=14% Similarity=0.119 Sum_probs=65.6 Q ss_pred CCccccccHHHHHHHHHHHHHhhCC---EE-------------------------------------------------- Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEK---EV-------------------------------------------------- 27 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~---~v-------------------------------------------------- 27 (153) |+.+++.+..+|++|.++|+.|... .+ T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~~ 80 (164) T protein:vir:43 1 MADTVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTGD 80 (164) T ss_pred CCcceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccccc Confidence 9999998889999999998888432 11 Q ss_pred ---EEeecCcC-CC--------CCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHH Q lcl|NC_020201. 28 ---EYGFYDEK-HY--------SGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLT 95 (153) Q Consensus 28 ---~VGi~~~~-~~--------dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~ 95 (153) .||+.... .+ .+-+.+.++.++||||.+.||||||||+++++++++.+.+.+-+... ++.+|. T Consensus 81 ~~~~vg~~~~~~~~~~~~~~~~~~~~~~~y~~f~EfGT~km~a~PFlrPA~~~~k~~~~~~~~~~l~~~-----i~ka~~ 155 (164) T protein:vir:43 81 LGFRIGVLHGAVLPKKGERSDKTANAPTPHWRLLEFGTEDMRAQPFMRSALADNIAEVTSTFVSEYEKG-----IDRAIK 155 (164) T ss_pred eeEEecccccccccccccccccCCCCCcceEEEeecCCCCCCCCcchhhhHHHhHHHHHHHHHHHHHHH-----HHHHHH Confidence 11111000 00 00112457889999999999999999999999998877777655433 335555 Q ss_pred HHHHHHHHH Q lcl|NC_020201. 96 KIGKDAADS 104 (153) Q Consensus 96 ~iG~~~~~~ 104 (153) +.+..++.- T Consensus 156 k~~~~~~~~ 164 (164) T protein:vir:43 156 RAAKKAAQG 164 (164) T ss_pred HHHhhhccC Confidence 555544444 No 13 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=98.59 E-value=2.9e-11 Score=78.32 Aligned_cols=99 Identities=10% Similarity=0.065 Sum_probs=57.5 Q ss_pred CCccccccHHHHHHHHHHHHHhhCC---E--------------------------------------------------- Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEK---E--------------------------------------------------- 26 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~---~--------------------------------------------------- 26 (153) |+.+++.+..+|++|.+.|+.|... . T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g~ 80 (179) T protein:vir:18 1 MADSVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTGD 80 (179) T ss_pred CCceEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccccc Confidence 9998988888999998888877422 0 Q ss_pred --EEEeecCcCC------------------------CCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 27 --VEYGFYDEKH------------------------YSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRM 80 (153) Q Consensus 27 --v~VGi~~~~~------------------------~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~ 80 (153) +.||+..+.. ..+-..+.++.+.||||.+.||||||||+++++++++.+.+.+. T Consensus 81 ~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa~PFlrPA~~~~~~~a~~~i~~~ 160 (179) T protein:vir:18 81 LAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSARPILRPAMNGVDNDVINVFSTE 160 (179) T ss_pred eeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCCCccchhhHHhhHHHHHHHHHHH Confidence 1222211000 00111245667789999999999999999999987766555543 Q ss_pred HHHHHcCCCHHHHHHHHHHHHHHH Q lcl|NC_020201. 81 HNGIIQGRGFSSYLTKIGKDAADS 104 (153) Q Consensus 81 ~~~~~~G~~~~~~L~~iG~~~~~~ 104 (153) +...+ +.+|...+...... T Consensus 161 l~~~i-----~k~lk~~~~~~~~~ 179 (179) T protein:vir:18 161 MGKAI-----DRAIRLAMKKGTTA 179 (179) T ss_pred HHHHH-----HHHHHhhcccCCCC Confidence 33211 11221111111111 No 14 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=98.52 E-value=4.4e-10 Score=71.90 Aligned_cols=82 Identities=11% Similarity=0.132 Sum_probs=59.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccC----CCCCCccHHHHHhcCCCCcchhHHHHHh Q lcl|NC_020201. 66 SMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGS----FSNPKVSKDWASYKGFDDAMIHYGDLSS 141 (153) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~----~~~p~ns~~Ti~~KG~~~PLidTG~L~~ 141 (153) +.+....+.+.++++.. .....|..+|..++..+++.|.++. -+++|+|++|+++|+.++||+|||.|++ T Consensus 1 ~i~~~~~i~~~l~~l~~------~~~~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~~~~L~~tG~L~~ 74 (145) T protein:vir:31 1 MVEDENNIPEAREAIQD------GLTDGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGSDTPLIDNSRLLT 74 (145) T ss_pred CcccHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcCCCCCccCHHHHH Confidence 44444445555554433 2345688899999999999997532 1357899999999999999999999999 Q ss_pred hcceeeeeC-CCC Q lcl|NC_020201. 142 AATYKIVKY-QGK 153 (153) Q Consensus 142 Sity~V~~~-~gk 153 (153) ||+|.+... .+. T Consensus 75 Si~~~~~~~~~~~ 87 (145) T protein:vir:31 75 DINAASMMDRANR 87 (145) T ss_pred HHHHHhhhcccCc Confidence 999987533 333 No 15 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=98.45 E-value=4.1e-10 Score=72.04 Aligned_cols=87 Identities=13% Similarity=0.164 Sum_probs=58.9 Q ss_pred CCccccccHHHHHHHHHHHHHhhC-CE----------------------------------------------------- Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAE-KE----------------------------------------------------- 26 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~-~~----------------------------------------------------- 26 (153) |+-+|+.+..+|++|+++|+.|.. .. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~~~~~~~~~~~~d~i~~~~~ 80 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSGRKGSRPPGHAANNIPEPKI 80 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCccccccccccccchhhhcceeccc Confidence 999888888889999998888842 11 Q ss_pred --------EEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHH Q lcl|NC_020201. 27 --------VEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIG 98 (153) Q Consensus 27 --------v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG 98 (153) +.||+..+ ++ +.+.++.+.||||.+.||+||||++++++++++.+.+.+-+...+.- + +| T Consensus 81 ~~~~g~~~~~VG~~~~---~~-~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i~~-----~---lG 148 (149) T protein:vir:13 81 RKKKGNLQCVVGWEKS---DN-TPFYYMKMEEWGTSERPPHHAFGKTNKILKRVYDNIAQKKYDNFVKE-----K---LG 148 (149) T ss_pred ccccceeEEEeeccCC---CC-CccceeeeeccCccCCCCCccchHHHHHHHHHHHHHHHHHHHHHHHH-----H---hc Confidence 11221110 11 23567888999999999999999999999988877776655433321 0 11 Q ss_pred H Q lcl|NC_020201. 99 K 99 (153) Q Consensus 99 ~ 99 (153) . T Consensus 149 ~ 149 (149) T protein:vir:13 149 D 149 (149) T ss_pred C Confidence 1 No 16 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=98.42 E-value=6.1e-10 Score=71.12 Aligned_cols=85 Identities=15% Similarity=0.187 Sum_probs=57.6 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCE------------------------------------------------------ Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKE------------------------------------------------------ 26 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~------------------------------------------------------ 26 (153) |+..++.+..+|++|++.|+.|.... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 88777777778888888777663220 Q ss_pred -------EEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHH Q lcl|NC_020201. 27 -------VEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYL 94 (153) Q Consensus 27 -------v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L 94 (153) +.||+-.. + -+-+.++.+.||||.+.||+|||+|+++.+++++.+.+.+.+...+ +.+| T Consensus 81 ~~~g~~~~~vg~~~~---~-~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l-----~ka~ 146 (146) T protein:vir:10 81 LEGGIKTVKIGLNKA---D-RSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEM-----RLDL 146 (146) T ss_pred ccccceeEEeeeccC---C-CCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH-----hhcC Confidence 11111000 0 1235788889999999999999999999999998887777665433 1222 No 17 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=98.42 E-value=6.1e-10 Score=71.12 Aligned_cols=85 Identities=15% Similarity=0.187 Sum_probs=57.6 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCE------------------------------------------------------ Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKE------------------------------------------------------ 26 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~------------------------------------------------------ 26 (153) |+..++.+..+|++|++.|+.|.... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 88777777778888888777663220 Q ss_pred -------EEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHH Q lcl|NC_020201. 27 -------VEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYL 94 (153) Q Consensus 27 -------v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L 94 (153) +.||+-.. + -+-+.++.+.||||.+.||+|||+|+++.+++++.+.+.+.+...+ +.+| T Consensus 81 ~~~g~~~~~vg~~~~---~-~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l-----~ka~ 146 (146) T protein:vir:10 81 LEGGIKTVKIGLNKA---D-RSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEM-----RLDL 146 (146) T ss_pred ccccceeEEeeeccC---C-CCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH-----hhcC Confidence 11111000 0 1235788889999999999999999999999998887777665433 1222 No 18 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=98.42 E-value=6.1e-10 Score=71.12 Aligned_cols=85 Identities=15% Similarity=0.187 Sum_probs=57.6 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCE------------------------------------------------------ Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKE------------------------------------------------------ 26 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~------------------------------------------------------ 26 (153) |+..++.+..+|++|++.|+.|.... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 88777777778888888777663220 Q ss_pred -------EEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHH Q lcl|NC_020201. 27 -------VEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYL 94 (153) Q Consensus 27 -------v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L 94 (153) +.||+-.. + -+-+.++.+.||||.+.||+|||+|+++.+++++.+.+.+.+...+ +.+| T Consensus 81 ~~~g~~~~~vg~~~~---~-~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l-----~ka~ 146 (146) T protein:vir:10 81 LEGGIKTVKIGLNKA---D-RSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEM-----RLDL 146 (146) T ss_pred ccccceeEEeeeccC---C-CCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH-----hhcC Confidence 11111000 0 1235788889999999999999999999999998887777665433 1222 No 19 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=98.42 E-value=6.1e-10 Score=71.12 Aligned_cols=85 Identities=15% Similarity=0.187 Sum_probs=57.6 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCE------------------------------------------------------ Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKE------------------------------------------------------ 26 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~------------------------------------------------------ 26 (153) |+..++.+..+|++|++.|+.|.... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 80 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPWRTGQHGADQIKVTKAK 80 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccccccccccccceecccc Confidence 88777777778888888777663220 Q ss_pred -------EEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHH Q lcl|NC_020201. 27 -------VEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYL 94 (153) Q Consensus 27 -------v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L 94 (153) +.||+-.. + -+-+.++.+.||||.+.||+|||+|+++.+++++.+.+.+.+...+ +.+| T Consensus 81 ~~~g~~~~~vg~~~~---~-~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l-----~ka~ 146 (146) T protein:vir:10 81 LEGGIKTVKIGLNKA---D-RSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEM-----RLDL 146 (146) T ss_pred ccccceeEEeeeccC---C-CCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH-----hhcC Confidence 11111000 0 1235788889999999999999999999999998887777665433 1222 No 20 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=98.38 E-value=1.4e-09 Score=69.18 Aligned_cols=89 Identities=13% Similarity=0.158 Sum_probs=55.8 Q ss_pred CCccc-cccHHHHHHHHHHHHHhhCCE------------------------------EEEeecCcCCCCCCC-------- Q lcl|NC_020201. 1 MAKKS-STDISELKRYFSQLSDLAEKE------------------------------VEYGFYDEKHYSGLN-------- 41 (153) Q Consensus 1 M~~~i-~~~~~~l~~~~~~l~~l~~~~------------------------------v~VGi~~~~~~dG~~-------- 41 (153) |++.+ +.|+++|...++.|.+..++- +.+-...+...+|.. T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~s~~g~~~~~Vg~~~ 80 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEESVEGIQTYAVSWRK 80 (157) T ss_pred CeeEeecccHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeeccccCCCceEEEEEeecC Confidence 99999 578787777777665443221 111111111112211 Q ss_pred -HHHHHHHHhcC------------------------CCCCCCCchhhHHHHHHHHHHHHHHH----HHHHHHHcCCC Q lcl|NC_020201. 42 -MATLAAIHEEG------------------------WNNLPERNFMFSTSMHFQEGLRKHIK----RMHNGIIQGRG 89 (153) Q Consensus 42 -va~iA~~~E~G------------------------~~~IP~RpFlr~~~~~~~~~~~~~~~----~~~~~~~~G~~ 89 (153) -+-++.+.||| +..+||||||||+++..+++..+.+. +.+..++.|.+ T Consensus 81 ~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k~I~e~l~g~~ 157 (157) T protein:vir:97 81 KAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGAKKYAELQRGDT 157 (157) T ss_pred CccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhHHHHHHHHHHHHHHHHHHHhcCCC Confidence 13445667888 24599999999999999888776654 45667788877 No 21 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=98.35 E-value=5.4e-10 Score=71.37 Aligned_cols=89 Identities=17% Similarity=0.039 Sum_probs=54.2 Q ss_pred CCccccccHHHHHHHHHHHHHhhCC---EE-----E----------------------------------------Eeec Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEK---EV-----E----------------------------------------YGFY 32 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~---~v-----~----------------------------------------VGi~ 32 (153) |+++++. .+|++|+++|+.|... .+ . |++. T Consensus 2 m~~~~~i--~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~g~~~~~v~~~ 79 (148) T protein:vir:93 2 IETLLDF--SGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRRSRDGGMESGVHIR 79 (148) T ss_pred cceeeee--hhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceeccccccCCceeeeeeec Confidence 6665554 4677777777766321 11 0 0000 Q ss_pred CcC---C-------CCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH Q lcl|NC_020201. 33 DEK---H-------YSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTK 96 (153) Q Consensus 33 ~~~---~-------~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~ 96 (153) ... . ..+...+.++.+.||||.+.||||||+|+++++++++.+.+.+.+...+ +.+|.+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~pa~PFl~pA~~~~k~~~~~~~~~~~~~~i-----~k~~~k 148 (148) T protein:vir:93 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAI-----DEVLRR 148 (148) T ss_pred ccccccccccceeecCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH-----HHHhcC Confidence 000 0 0112335778889999999999999999999999888777776655332 233333 No 22 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=98.33 E-value=1.7e-09 Score=68.72 Aligned_cols=88 Identities=13% Similarity=0.161 Sum_probs=60.4 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCE------------------------EEEeecCcC--------CCCC-----CCHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKE------------------------VEYGFYDEK--------HYSG-----LNMA 43 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~------------------------v~VGi~~~~--------~~dG-----~~va 43 (153) |+..++.+..+|++|.+.|+.+.+.. +.-|-+.+. ..+| -+.+ T Consensus 1 Ma~~~~i~~~Gld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~~~~~~~~~~v~~~~ 80 (125) T protein:vir:94 1 MANDFNIKFKGVDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVKEEHGVVTGRYVARA 80 (125) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceecceeccCCcEEEEeeCCC Confidence 77776666667888888777664320 122221110 0011 2346 Q ss_pred HHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_020201. 44 TLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGR 88 (153) Q Consensus 44 ~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~ 88 (153) .+|.+.||||.+.|+||||+|+++.++..+.+.+++.++.++.-- T Consensus 81 ~Ya~~vEfGT~~~~a~Pfl~pa~~~~~~~~~~~l~~~l~~a~k~~ 125 (125) T protein:vir:94 81 DYSSYNEYGTYRMSAQPFMAPSVAAMTPFFYKAVRDALNKAAKFS 125 (125) T ss_pred CccceeecccccCCCCcccchhHHHHHHHHHHHHHHHHHHHhccC Confidence 889999999999999999999999999999888888777655322 No 23 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=98.31 E-value=9.7e-10 Score=69.99 Aligned_cols=89 Identities=10% Similarity=0.076 Sum_probs=63.7 Q ss_pred hhHHHH--HHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhc-----CCCCcch Q lcl|NC_020201. 62 MFSTSM--HFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYK-----GFDDAMI 134 (153) Q Consensus 62 lr~~~~--~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~K-----G~~~PLi 134 (153) |-..++ -+.+.+.+.+.++...+ .+...+|..||..+...+++.|.....+++|+||+|+++| +..++|+ T Consensus 1 M~~~i~i~~d~~~~~~~L~~l~~~~---~d~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~r~~~g~~~~~iL~ 77 (155) T protein:vir:79 1 MTTRIDVELDDQEVRQRLAVLMRSV---TDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAAREAKGRGPHPILQ 77 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHh---hhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHHHhccCCCCCCccc Confidence 433221 12234555555554433 3677999999999999999999765556789999998765 3568999 Q ss_pred hHHHHHhhcceeeeeCCCC Q lcl|NC_020201. 135 HYGDLSSAATYKIVKYQGK 153 (153) Q Consensus 135 dTG~L~~Sity~V~~~~gk 153 (153) |||.|++||+|++-+..-. T Consensus 78 ~tG~L~~Si~~~~~~~~v~ 96 (155) T protein:vir:79 78 VTNALARSVTTWADRNEAG 96 (155) T ss_pred cchhhhhhhhceecCCEEE Confidence 9999999999987544333 No 24 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=98.28 E-value=1.7e-09 Score=68.69 Aligned_cols=82 Identities=18% Similarity=0.272 Sum_probs=55.7 Q ss_pred CCccccccHHHHHHHHHHHHHhhCC---E---------------------EEEeecCcC---CCCC-----CCHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEK---E---------------------VEYGFYDEK---HYSG-----LNMATLAAI 48 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~---~---------------------v~VGi~~~~---~~dG-----~~va~iA~~ 48 (153) |+++++ +++++.+.|+.+.+. . |.-|.+... ..+| .+.+.+|.+ T Consensus 1 msi~i~----Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~g~~~~V~~~~~Ya~y 76 (114) T protein:vir:95 1 MAIKWQ----GIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQLAPKDTEFLKDHITTSYPGMEAHIHGEAGYDGY 76 (114) T ss_pred Ceeeee----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhceeeecCceEEEeecCCCccce Confidence 988885 355555555544321 1 112221110 0122 234788999 Q ss_pred HhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020201. 49 HEEGWNNLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQ 86 (153) Q Consensus 49 ~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~ 86 (153) .||||...|+||||+|+++.++.++.+.++..++.-+. T Consensus 77 vE~GT~~~~aqPfl~pa~~~~~~~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 77 QEYGTRFQPGTPHFRPMMEQIQPQFQKDMTDVMKGAFK 114 (114) T ss_pred eecCccccCCCccchhhHHHHHHHHHHHHHHHHHhhcC Confidence 99999999999999999999999999888888876655 No 25 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.27 E-value=2.1e-09 Score=68.16 Aligned_cols=90 Identities=13% Similarity=0.002 Sum_probs=62.4 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCC----CCCCccHHHHHhc--CCCCcch Q lcl|NC_020201. 61 FMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSF----SNPKVSKDWASYK--GFDDAMI 134 (153) Q Consensus 61 Flr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~----~~p~ns~~Ti~~K--G~~~PLi 134 (153) -+.-.+.-.-+++.+.+..++..+ .+...+|..||..+...+++.|.++.. +++|++++|+++| +..++|+ T Consensus 1 M~~i~i~~d~~~~~~~L~~l~~~~---~~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~rk~~~~~~~L~ 77 (190) T protein:vir:99 1 MAGITLEWDGRRALDVLNAGSAAL---GDPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRRKRKNRDKILT 77 (190) T ss_pred CceeEEEecHHHHHHHHHHHHHHh---hhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHHhhcCCCccce Confidence 221112222334555555555443 357789999999999999999987532 3578999998765 5679999 Q ss_pred hHHHHHhhcceeeeeCCCC Q lcl|NC_020201. 135 HYGDLSSAATYKIVKYQGK 153 (153) Q Consensus 135 dTG~L~~Sity~V~~~~gk 153 (153) |||.|++||+|++....=. T Consensus 78 ~tg~L~~Si~~~~~~~~v~ 96 (190) T protein:vir:99 78 LDGHLRNLLRYQLDGSELL 96 (190) T ss_pred ecHHHHHHHhheecCcEEE Confidence 9999999999987433222 No 26 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=98.25 E-value=1.8e-09 Score=68.50 Aligned_cols=89 Identities=10% Similarity=0.066 Sum_probs=63.6 Q ss_pred hhHHHH--HHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhc-----CCCCcch Q lcl|NC_020201. 62 MFSTSM--HFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYK-----GFDDAMI 134 (153) Q Consensus 62 lr~~~~--~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~K-----G~~~PLi 134 (153) |-..++ -+.+.+.+.+.++...+ .+...+|..||..+...+++.|.....+++|+||.|+++| +..++|+ T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~---~d~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~r~~~g~~~~~iL~ 77 (155) T protein:vir:99 1 MTTRIDVELDDQEVRQRLALLMRSV---TDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAAREAKGRGPHPILQ 77 (155) T ss_pred CceEEEEEechHHHHHHHHHHHHHh---hhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHHHhccCCCCCCcch Confidence 432221 12244555555554433 3578999999999999999999755456789999998765 3467999 Q ss_pred hHHHHHhhcceeeeeCCCC Q lcl|NC_020201. 135 HYGDLSSAATYKIVKYQGK 153 (153) Q Consensus 135 dTG~L~~Sity~V~~~~gk 153 (153) |||.|++||+|.+.+..-. T Consensus 78 ~tg~L~~Si~~~~~~~~v~ 96 (155) T protein:vir:99 78 VTNALARSVTTWADRNEAG 96 (155) T ss_pred hchhhhhhhhceecCCEEE Confidence 9999999999987554333 No 27 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=98.23 E-value=1.8e-09 Score=68.59 Aligned_cols=89 Identities=10% Similarity=0.104 Sum_probs=62.4 Q ss_pred hhHHH--HHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHh-----cCCCCcch Q lcl|NC_020201. 62 MFSTS--MHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASY-----KGFDDAMI 134 (153) Q Consensus 62 lr~~~--~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~-----KG~~~PLi 134 (153) |-..+ .-+...+.+.+.++...+ .+...+|..||..+...+++.|.....+++|+||.|+++ +|..++|+ T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~~---~~~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~~~~~~L~ 77 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAAV---TDTLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGRGAHPILQ 77 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHHh---hhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccCCCCCccc Confidence 33222 112234555555544433 357799999999999999999975445678999999754 35678999 Q ss_pred hHHHHHhhcceeeeeCCCC Q lcl|NC_020201. 135 HYGDLSSAATYKIVKYQGK 153 (153) Q Consensus 135 dTG~L~~Sity~V~~~~gk 153 (153) |||.|++||+|.+....-. T Consensus 78 ~tG~L~~Si~~~~~~~~v~ 96 (155) T protein:vir:10 78 VTNALARSITTRADRDQAQ 96 (155) T ss_pred cchhhhhhhhceecCCEEE Confidence 9999999999987544333 No 28 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=98.22 E-value=1.2e-09 Score=69.41 Aligned_cols=80 Identities=16% Similarity=0.171 Sum_probs=56.4 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCE----------------------EEEeecCcC-----CCCC-----CCHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKE----------------------VEYGFYDEK-----HYSG-----LNMATLAAI 48 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~----------------------v~VGi~~~~-----~~dG-----~~va~iA~~ 48 (153) |+.+|+.+ +|++++++|+++.... |.-|-+... ..+| .+.+.+|.+ T Consensus 1 M~~~i~i~--Gld~l~~~L~~~~~~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~ 78 (112) T protein:vir:36 1 MKSSLSFK--GIDQLVKHLDKAASLKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMELTEGGFSGQAGPHTDYSAY 78 (112) T ss_pred Cceeeeeh--hHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeeecCCceEEEeecCCCccce Confidence 88888654 6777777777653321 111211110 1123 244789999 Q ss_pred HhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 49 HEEGWNNLPERNFMFSTSMHFQEGLRKHIKRMHN 82 (153) Q Consensus 49 ~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 82 (153) .||||...|+||||+|+++.++.++.+.++++++ T Consensus 79 vE~GT~k~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 79 VEYGTRFQSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred eeccccccCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 9999999999999999999999998888888777 No 29 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=98.19 E-value=3.3e-09 Score=67.08 Aligned_cols=89 Identities=19% Similarity=0.126 Sum_probs=53.4 Q ss_pred CCccccccHHHHHHHHHHHHHhhCC---EE-----EEe-----------------------------------------e Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEK---EV-----EYG-----------------------------------------F 31 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~---~v-----~VG-----------------------------------------i 31 (153) |.++++. .+|++|++.|+.|... .+ ..| + T Consensus 2 m~~~~~i--~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v~~ 79 (149) T protein:vir:19 2 IETSLDF--SGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) T ss_pred cceeeeh--hhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeecccc Confidence 6666654 3677777766666321 00 000 0 Q ss_pred cCcCC---C-------CCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH Q lcl|NC_020201. 32 YDEKH---Y-------SGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTK 96 (153) Q Consensus 32 ~~~~~---~-------dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~ 96 (153) ..... . .+-+.+.++.+.||||.+.||+|||+|+++++++++.+.+.+.+...+ +.+|.+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PF~~pA~~~~k~~~~~~~~~~l~~~l-----~k~~~k 149 (149) T protein:vir:19 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAI-----DEVLSK 149 (149) T ss_pred cccccccccccceeecCCCCccceeeeeccCCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHH-----HHHhcC Confidence 00000 0 011235678899999999999999999999999988777776655432 233333 No 30 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=98.12 E-value=4e-09 Score=66.60 Aligned_cols=79 Identities=11% Similarity=0.063 Sum_probs=50.5 Q ss_pred CCccccccHHHHHHHHHHHHH-------hhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSD-------LAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGL 73 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~-------l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~ 73 (153) |...+..+...=..+.+.+.. -+...|.||+..+ .+.++.+.||||.+.||||||+++++.+++++ T Consensus 42 ~k~~ap~~~~~tg~l~~~I~~~~~k~~~~g~~~v~Vg~~~~-------~~~y~~f~E~GT~~~~a~Pf~~pa~~~~~~~~ 114 (127) T protein:vir:12 42 QRSHVNRSDKKQPHMQDNITVSNVRESKDGVRFVAVGPNKK-------VAYRGRFLEWGTSKMPPQPFIEKGGKEGEGPA 114 (127) T ss_pred HHHhCCCCCCChhHHHHhhhccccccccCceeEEEEeeCCC-------CcceeeeeccCccCCCCCccchHhHHHHHHHH Confidence 211111110000122222211 1234677776432 36788999999999999999999999999999 Q ss_pred HHHHHHHHHHHHc Q lcl|NC_020201. 74 RKHIKRMHNGIIQ 86 (153) Q Consensus 74 ~~~~~~~~~~~~~ 86 (153) .+.+++.+...+. T Consensus 115 ~~~~~~~~~~~lk 127 (127) T protein:vir:12 115 VELMERILTAPIK 127 (127) T ss_pred HHHHHHHHHHhcC Confidence 9999888876665 No 31 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=98.02 E-value=1.1e-08 Score=64.28 Aligned_cols=89 Identities=9% Similarity=-0.005 Sum_probs=60.7 Q ss_pred hhHHH--HHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccC-CCCCCccHHHHHhc----------- Q lcl|NC_020201. 62 MFSTS--MHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGS-FSNPKVSKDWASYK----------- 127 (153) Q Consensus 62 lr~~~--~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~-~~~p~ns~~Ti~~K----------- 127 (153) |-..+ .-.-+.+.+.+.++... +.+...+|..||..+...+++.|.++. -.++|+||+|+++| T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~---~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~ 77 (175) T protein:vir:79 1 MSDFVNFQIDDSALRTRLLQLEQA---GHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNG 77 (175) T ss_pred CceEEEEEechHHHHHHHHHHHHH---hcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccc Confidence 33211 11123455555544433 346788999999999999999998641 13468999997542 Q ss_pred ----------CCCCcchhHHHHHhhcceeeeeCCCC Q lcl|NC_020201. 128 ----------GFDDAMIHYGDLSSAATYKIVKYQGK 153 (153) Q Consensus 128 ----------G~~~PLidTG~L~~Sity~V~~~~gk 153 (153) +..++|+|||.|++||+|.+.+..=. T Consensus 78 ~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~ 113 (175) T protein:vir:79 78 ELTAAASRRKAGLMILQDSGQMAASTATDSGEDYSV 113 (175) T ss_pred cchhhHhhhccCCCcceechhhhhhhhheecCCEEE Confidence 46789999999999999987544322 No 32 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=97.99 E-value=9e-09 Score=64.69 Aligned_cols=82 Identities=12% Similarity=0.136 Sum_probs=47.7 Q ss_pred CCccccccH-HHHHHHHHHHHHhhCC----EEEEeecCcC----CCCC-----CCHHHHHHHHhcCCCCCCCCchhhHHH Q lcl|NC_020201. 1 MAKKSSTDI-SELKRYFSQLSDLAEK----EVEYGFYDEK----HYSG-----LNMATLAAIHEEGWNNLPERNFMFSTS 66 (153) Q Consensus 1 M~~~i~~~~-~~l~~~~~~l~~l~~~----~v~VGi~~~~----~~dG-----~~va~iA~~~E~G~~~IP~RpFlr~~~ 66 (153) +.-.++.-. +....+.+..+.+... -+.-|.+... ..+| .+.+.+|.+.||||...|+||||+|++ T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~ 99 (115) T protein:vir:78 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVY 99 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchhhhcccccccCCCCchhhhH Confidence 111111111 1123344444443321 1122221110 0112 123689999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_020201. 67 MHFQEGLRKHIKRMHN 82 (153) Q Consensus 67 ~~~~~~~~~~~~~~~~ 82 (153) +.++..+.+.++++++ T Consensus 100 ~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 100 EVIRKSTVEELKALFE 115 (115) T ss_pred HHHHHHHHHHHHHHhC Confidence 9999999999988877 No 33 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=97.99 E-value=9e-09 Score=64.69 Aligned_cols=82 Identities=12% Similarity=0.136 Sum_probs=47.7 Q ss_pred CCccccccH-HHHHHHHHHHHHhhCC----EEEEeecCcC----CCCC-----CCHHHHHHHHhcCCCCCCCCchhhHHH Q lcl|NC_020201. 1 MAKKSSTDI-SELKRYFSQLSDLAEK----EVEYGFYDEK----HYSG-----LNMATLAAIHEEGWNNLPERNFMFSTS 66 (153) Q Consensus 1 M~~~i~~~~-~~l~~~~~~l~~l~~~----~v~VGi~~~~----~~dG-----~~va~iA~~~E~G~~~IP~RpFlr~~~ 66 (153) +.-.++.-. +....+.+..+.+... -+.-|.+... ..+| .+.+.+|.+.||||...|+||||+|++ T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~ 99 (115) T protein:vir:96 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVY 99 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchhhhcccccccCCCCchhhhH Confidence 111111111 1123344444443321 1122221110 0112 123689999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_020201. 67 MHFQEGLRKHIKRMHN 82 (153) Q Consensus 67 ~~~~~~~~~~~~~~~~ 82 (153) +.++..+.+.++++++ T Consensus 100 ~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 100 EVIRKSTVEELKALFE 115 (115) T ss_pred HHHHHHHHHHHHHHhC Confidence 9999999999988877 No 34 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=97.99 E-value=9e-09 Score=64.69 Aligned_cols=82 Identities=12% Similarity=0.136 Sum_probs=47.7 Q ss_pred CCccccccH-HHHHHHHHHHHHhhCC----EEEEeecCcC----CCCC-----CCHHHHHHHHhcCCCCCCCCchhhHHH Q lcl|NC_020201. 1 MAKKSSTDI-SELKRYFSQLSDLAEK----EVEYGFYDEK----HYSG-----LNMATLAAIHEEGWNNLPERNFMFSTS 66 (153) Q Consensus 1 M~~~i~~~~-~~l~~~~~~l~~l~~~----~v~VGi~~~~----~~dG-----~~va~iA~~~E~G~~~IP~RpFlr~~~ 66 (153) +.-.++.-. +....+.+..+.+... -+.-|.+... ..+| .+.+.+|.+.||||...|+||||+|++ T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~ 99 (115) T protein:vir:10 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVY 99 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchhhhcccccccCCCCchhhhH Confidence 111111111 1123344444443321 1122221110 0112 123689999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_020201. 67 MHFQEGLRKHIKRMHN 82 (153) Q Consensus 67 ~~~~~~~~~~~~~~~~ 82 (153) +.++..+.+.++++++ T Consensus 100 ~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 100 EVIRKSTVEELKALFE 115 (115) T ss_pred HHHHHHHHHHHHHHhC Confidence 9999999999988877 No 35 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=97.99 E-value=9e-09 Score=64.69 Aligned_cols=82 Identities=12% Similarity=0.136 Sum_probs=47.7 Q ss_pred CCccccccH-HHHHHHHHHHHHhhCC----EEEEeecCcC----CCCC-----CCHHHHHHHHhcCCCCCCCCchhhHHH Q lcl|NC_020201. 1 MAKKSSTDI-SELKRYFSQLSDLAEK----EVEYGFYDEK----HYSG-----LNMATLAAIHEEGWNNLPERNFMFSTS 66 (153) Q Consensus 1 M~~~i~~~~-~~l~~~~~~l~~l~~~----~v~VGi~~~~----~~dG-----~~va~iA~~~E~G~~~IP~RpFlr~~~ 66 (153) +.-.++.-. +....+.+..+.+... -+.-|.+... ..+| .+.+.+|.+.||||...|+||||+|++ T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~ 99 (115) T protein:vir:93 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVY 99 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchhhhcccccccCCCCchhhhH Confidence 111111111 1123344444443321 1122221110 0112 123689999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_020201. 67 MHFQEGLRKHIKRMHN 82 (153) Q Consensus 67 ~~~~~~~~~~~~~~~~ 82 (153) +.++..+.+.++++++ T Consensus 100 ~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 100 EVIRKSTVEELKALFE 115 (115) T ss_pred HHHHHHHHHHHHHHhC Confidence 9999999999988877 No 36 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=97.99 E-value=9e-09 Score=64.69 Aligned_cols=82 Identities=12% Similarity=0.136 Sum_probs=47.7 Q ss_pred CCccccccH-HHHHHHHHHHHHhhCC----EEEEeecCcC----CCCC-----CCHHHHHHHHhcCCCCCCCCchhhHHH Q lcl|NC_020201. 1 MAKKSSTDI-SELKRYFSQLSDLAEK----EVEYGFYDEK----HYSG-----LNMATLAAIHEEGWNNLPERNFMFSTS 66 (153) Q Consensus 1 M~~~i~~~~-~~l~~~~~~l~~l~~~----~v~VGi~~~~----~~dG-----~~va~iA~~~E~G~~~IP~RpFlr~~~ 66 (153) +.-.++.-. +....+.+..+.+... -+.-|.+... ..+| .+.+.+|.+.||||...|+||||+|++ T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~ 99 (115) T protein:vir:97 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVY 99 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchhhhcccccccCCCCchhhhH Confidence 111111111 1123344444443321 1122221110 0112 123689999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_020201. 67 MHFQEGLRKHIKRMHN 82 (153) Q Consensus 67 ~~~~~~~~~~~~~~~~ 82 (153) +.++..+.+.++++++ T Consensus 100 ~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 100 EVIRKSTVEELKALFE 115 (115) T ss_pred HHHHHHHHHHHHHHhC Confidence 9999999999988877 No 37 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=97.99 E-value=9e-09 Score=64.69 Aligned_cols=82 Identities=12% Similarity=0.136 Sum_probs=47.7 Q ss_pred CCccccccH-HHHHHHHHHHHHhhCC----EEEEeecCcC----CCCC-----CCHHHHHHHHhcCCCCCCCCchhhHHH Q lcl|NC_020201. 1 MAKKSSTDI-SELKRYFSQLSDLAEK----EVEYGFYDEK----HYSG-----LNMATLAAIHEEGWNNLPERNFMFSTS 66 (153) Q Consensus 1 M~~~i~~~~-~~l~~~~~~l~~l~~~----~v~VGi~~~~----~~dG-----~~va~iA~~~E~G~~~IP~RpFlr~~~ 66 (153) +.-.++.-. +....+.+..+.+... -+.-|.+... ..+| .+.+.+|.+.||||...|+||||+|++ T Consensus 20 ~~~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vE~GT~km~a~Pfl~PA~ 99 (115) T protein:vir:96 20 IDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTGDLQYTITSHAAYSGFLEFGTRYMEAEPFMWPVY 99 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeeecCceEEEeecCccchhhhcccccccCCCCchhhhH Confidence 111111111 1123344444443321 1122221110 0112 123689999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_020201. 67 MHFQEGLRKHIKRMHN 82 (153) Q Consensus 67 ~~~~~~~~~~~~~~~~ 82 (153) +.++..+.+.++++++ T Consensus 100 ~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 100 EVIRKSTVEELKALFE 115 (115) T ss_pred HHHHHHHHHHHHHHhC Confidence 9999999999988877 No 38 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=97.94 E-value=1.9e-08 Score=62.87 Aligned_cols=88 Identities=9% Similarity=0.021 Sum_probs=59.8 Q ss_pred hhHHHH--HHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccC-----CCCCCccHHHHHhcC-----C Q lcl|NC_020201. 62 MFSTSM--HFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGS-----FSNPKVSKDWASYKG-----F 129 (153) Q Consensus 62 lr~~~~--~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~-----~~~p~ns~~Ti~~KG-----~ 129 (153) |...++ ...+.+.+.+.++.. .. ....+|..||..+...+++.|.+.. -+++|++|+|+++|. . T Consensus 1 ms~~i~~~~d~~~l~~~L~~l~~-~~---~~~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~~ 76 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDELGT-VT---RDRAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFVP 76 (156) T ss_pred CeEEEEEeecHHHHHHHHHHHHh-hh---ccHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCCC Confidence 432221 122345555544322 11 2347899999999999999998642 245689999998763 3 Q ss_pred CCcchhHHHHHhhcceeeeeCCCC Q lcl|NC_020201. 130 DDAMIHYGDLSSAATYKIVKYQGK 153 (153) Q Consensus 130 ~~PLidTG~L~~Sity~V~~~~gk 153 (153) .+||+|||.|++||+|.+-...-. T Consensus 77 ~~~L~~tg~L~~Si~~~~~~~~v~ 100 (156) T protein:vir:19 77 GSILTLHGDLARSITTDYGQDYAL 100 (156) T ss_pred CcchhhhHHHHHHhhheecCCEEE Confidence 689999999999999987544333 No 39 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=97.93 E-value=1e-08 Score=64.39 Aligned_cols=80 Identities=10% Similarity=-0.007 Sum_probs=50.1 Q ss_pred CCccccccHHH-HHHHHHHHH-------HhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISE-LKRYFSQLS-------DLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEG 72 (153) Q Consensus 1 M~~~i~~~~~~-l~~~~~~l~-------~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~ 72 (153) |...+-.+... -..+.+.+. ......+.||+..+ .+.++.+.||||.+.||+|||++++++.+++ T Consensus 38 ~k~~ap~~~~~~~~hl~d~I~~~~~k~~~~g~~~~~VG~~k~-------~~~y~~f~E~GT~k~~~~pF~~pa~~~~k~~ 110 (125) T protein:vir:97 38 LKANTPVYEVETDERLQEDTVISGFKGANVGIVSKEIGYGKA-------TGWRAHYPNDGTIYQRGQDFKERTINQMTPK 110 (125) T ss_pred HHHhCCcCCCCchhhHHhhhhcccccccccCceEEEEeecCC-------CceeEeeeccCccCCCcCccchHhHHHhHHH Confidence 11111111000 011222221 11223678888533 2678999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHcC Q lcl|NC_020201. 73 LRKHIKRMHNGIIQG 87 (153) Q Consensus 73 ~~~~~~~~~~~~~~G 87 (153) +.+.+.+.++..+.= T Consensus 111 ~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 111 AKQLYAEKVKEGLGL 125 (125) T ss_pred HHHHHHHHHHHHhcC Confidence 988888877655433 No 40 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=97.92 E-value=1.3e-08 Score=63.78 Aligned_cols=82 Identities=17% Similarity=0.182 Sum_probs=46.5 Q ss_pred CCccccccH-----HHHHHHHHHHHHhhCC----EEEEeecCcC----CCCC-----CCHHHHHHHHhcCCCCCCCCchh Q lcl|NC_020201. 1 MAKKSSTDI-----SELKRYFSQLSDLAEK----EVEYGFYDEK----HYSG-----LNMATLAAIHEEGWNNLPERNFM 62 (153) Q Consensus 1 M~~~i~~~~-----~~l~~~~~~l~~l~~~----~v~VGi~~~~----~~dG-----~~va~iA~~~E~G~~~IP~RpFl 62 (153) |+-.+.... +....+.+..+.+... .|.-|-+... ..+| .+.+.+|.+.||||...|+|||| T Consensus 16 ~~~~~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~~vEfGT~km~a~PFl 95 (115) T protein:vir:10 16 MHDDIEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVKKIGDLHYRVISTAHYSGFLEFGTRYMEPAPFM 95 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEeeCCCccchheecccccCCCCCch Confidence 222211110 1112333333333211 1112221110 0111 13367999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 63 FSTSMHFQEGLRKHIKRMHN 82 (153) Q Consensus 63 r~~~~~~~~~~~~~~~~~~~ 82 (153) +|+++.++..+.+.++++++ T Consensus 96 ~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 96 FPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred hhhHHHHHHHHHHHHHHHhC Confidence 99999999999888888777 No 41 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=97.91 E-value=1.7e-08 Score=63.21 Aligned_cols=75 Identities=16% Similarity=0.208 Sum_probs=46.5 Q ss_pred HHHHHHHHHHHHHhhC------------------------CEEEEeecCcC----CCCC-----CCHHHHHHHHhcCCCC Q lcl|NC_020201. 9 ISELKRYFSQLSDLAE------------------------KEVEYGFYDEK----HYSG-----LNMATLAAIHEEGWNN 55 (153) Q Consensus 9 ~~~l~~~~~~l~~l~~------------------------~~v~VGi~~~~----~~dG-----~~va~iA~~~E~G~~~ 55 (153) ..+|++|.+.|+.+.. .-|.-|.+... ..++ .+.+.+|.+.||||.. T Consensus 1 i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~~v~~~~~Ya~~vE~GT~~ 80 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRIAVDKELSKSAARIERQAKILAPVDTGWLRAQIYSEQQRLLHYRVVSPALYSIYLELGTRK 80 (108) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhcceeeeecCcEEEEeecCcccchhcccCccc Confidence 2233333333222211 01222222110 0011 2347899999999999 Q ss_pred CCCCchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 56 LPERNFMFSTSMHFQEGLRKHIKRMHNG 83 (153) Q Consensus 56 IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 83 (153) .|+||||+|+++.++..+.+.+++.++. T Consensus 81 m~a~Pf~~pa~~~~~~~~~~~i~~~lrk 108 (108) T protein:vir:99 81 MEAQSFLDPALRKEWPVLMANIKKMFKR 108 (108) T ss_pred cCCCcchhhhHHHHHHHHHHHHHHHhcC Confidence 9999999999999999999888888776 No 42 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=97.90 E-value=1.2e-08 Score=64.11 Aligned_cols=79 Identities=9% Similarity=0.023 Sum_probs=47.0 Q ss_pred CCccccc---cHHHHHHHHHHH-----H-HhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHH Q lcl|NC_020201. 1 MAKKSST---DISELKRYFSQL-----S-DLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQE 71 (153) Q Consensus 1 M~~~i~~---~~~~l~~~~~~l-----~-~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~ 71 (153) |+-.+.. +...-..+.+.+ . .-....+.||+..+ .+.++.+.||||.+.||+|||+++++++++ T Consensus 41 ~k~~ap~~~~~~~~~~h~~d~I~~~~~k~~~g~~~~~VG~~k~-------~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~ 113 (128) T protein:vir:38 41 LKSNTPEWDGETDMSGHLRDDIKLSSVRETSGLTEVDVGYGKD-------TGWRAHFPNSGTSMQDPQHFIEETQEIMRP 113 (128) T ss_pred HHHhCCCcCCCCcccchhhhhhccccccccCceeEEEeeecCC-------CceEEeeeccCccCCCCCcchhHHHHHhHH Confidence 1111100 000000111111 1 11223578887433 257899999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHc Q lcl|NC_020201. 72 GLRKHIKRMHNGIIQ 86 (153) Q Consensus 72 ~~~~~~~~~~~~~~~ 86 (153) ++.+.+.+.++..+- T Consensus 114 ~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 114 VVIAAFLSHLKEGGM 128 (128) T ss_pred HHHHHHHHHHHhhcC Confidence 998888876653322 No 43 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=97.88 E-value=2.5e-08 Score=62.29 Aligned_cols=89 Identities=17% Similarity=0.056 Sum_probs=49.1 Q ss_pred CCcccccc--H----HHHHHHHHHHHHhh---------------------CCEEEEeecCcCC--CCCCCHHHHHHHHhc Q lcl|NC_020201. 1 MAKKSSTD--I----SELKRYFSQLSDLA---------------------EKEVEYGFYDEKH--YSGLNMATLAAIHEE 51 (153) Q Consensus 1 M~~~i~~~--~----~~l~~~~~~l~~l~---------------------~~~v~VGi~~~~~--~dG~~va~iA~~~E~ 51 (153) |+...... . ...+.+.+.++.+. ...+.||+..... -++.+.+.++.+.|| T Consensus 19 l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~ 98 (140) T protein:vir:80 19 LAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRTKGKADSPSNAFYWRFDEF 98 (140) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccccceeeeeeecccccccCCCCCcceeeeecc Confidence 22111000 0 11122222222221 1234445432211 123455789999999 Q ss_pred CCCCCCCCchhhHHHHHHHHHHHHHHHHH----HHHHHcCCC Q lcl|NC_020201. 52 GWNNLPERNFMFSTSMHFQEGLRKHIKRM----HNGIIQGRG 89 (153) Q Consensus 52 G~~~IP~RpFlr~~~~~~~~~~~~~~~~~----~~~~~~G~~ 89 (153) ||.+.||+|||+|+++.+++++.+.+++. +..++.|.- T Consensus 99 GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:80 99 GTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDQALGGRR 140 (140) T ss_pred CCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 99999999999999999988876666554 445555532 No 44 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=97.85 E-value=1.5e-08 Score=63.53 Aligned_cols=81 Identities=14% Similarity=0.003 Sum_probs=44.7 Q ss_pred CCcccc--cc----HHHHHHHHHHHHHhhC----------------------------CEEEEeecCcCCCCCCCHHHHH Q lcl|NC_020201. 1 MAKKSS--TD----ISELKRYFSQLSDLAE----------------------------KEVEYGFYDEKHYSGLNMATLA 46 (153) Q Consensus 1 M~~~i~--~~----~~~l~~~~~~l~~l~~----------------------------~~v~VGi~~~~~~dG~~va~iA 46 (153) |.-++. .. ..+.+-+.+.++.... ..|.||.-. +...++ T Consensus 19 L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~~~~~~v~vg~~~-------~~~~y~ 91 (133) T protein:vir:10 19 LGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRKAQGNAVVTLRVGPSK-------QHHMKV 91 (133) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccccCccceEEEEecCCC-------CccceE Confidence 111110 00 0111112222222211 112222111 112344 Q ss_pred HHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_020201. 47 AIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGR 88 (153) Q Consensus 47 ~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~ 88 (153) .+.||||.+.||||||+|+++.+++++.+.+.+.+...+..+ T Consensus 92 ~f~E~GT~k~~a~PF~~pA~~~~~~~~~~~~~~~~~~~l~K~ 133 (133) T protein:vir:10 92 LAQEFGTVKQVADPFIRPALDYNVQTVLRVLTVEIRNGIQNR 133 (133) T ss_pred eeeccCCCCCCCCccchHHHHHhHHHHHHHHHHHHHHHhhcC Confidence 566999999999999999999999999988888887767666 No 45 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=97.83 E-value=2e-08 Score=62.83 Aligned_cols=82 Identities=15% Similarity=0.171 Sum_probs=47.4 Q ss_pred CCccccccH-----HHHHHHHHHHHHhhCC----EEEEeecCcC----CCCC-----CCHHHHHHHHhcCCCCCCCCchh Q lcl|NC_020201. 1 MAKKSSTDI-----SELKRYFSQLSDLAEK----EVEYGFYDEK----HYSG-----LNMATLAAIHEEGWNNLPERNFM 62 (153) Q Consensus 1 M~~~i~~~~-----~~l~~~~~~l~~l~~~----~v~VGi~~~~----~~dG-----~~va~iA~~~E~G~~~IP~RpFl 62 (153) |+-.+.... +...++....+.+... -+.-|.+... ..+| .+.+.+|.+.||||...|+|||| T Consensus 16 ~~~~~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~~g~~~~~V~~~~~Ya~~vE~GT~~m~a~PFl 95 (115) T protein:vir:99 16 MKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTVDLQYTITSHAAYSGFLEFGTRYMEAEPFM 95 (115) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEecCCccccccccccccccCCCCcc Confidence 332222111 1112333333333211 1122221110 0112 13368999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 63 FSTSMHFQEGLRKHIKRMHN 82 (153) Q Consensus 63 r~~~~~~~~~~~~~~~~~~~ 82 (153) +|+++.++..+.+.++++++ T Consensus 96 ~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 96 WPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred hhhHHHHHHHHHHHHHHHhC Confidence 99999999999999888877 No 46 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=97.83 E-value=3.6e-08 Score=61.39 Aligned_cols=89 Identities=15% Similarity=0.003 Sum_probs=47.0 Q ss_pred CCcccccc--H----HHHHHHHHHHHHh---------------------hCCEEEEeecCc--CCCCCCCHHHHHHHHhc Q lcl|NC_020201. 1 MAKKSSTD--I----SELKRYFSQLSDL---------------------AEKEVEYGFYDE--KHYSGLNMATLAAIHEE 51 (153) Q Consensus 1 M~~~i~~~--~----~~l~~~~~~l~~l---------------------~~~~v~VGi~~~--~~~dG~~va~iA~~~E~ 51 (153) |+.+.... . ++.+.+.+.++.. ....+.+|+... ....+.+.+.++.+.|| T Consensus 19 l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~ 98 (140) T protein:vir:10 19 LAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKDSPGIATAGVRVRTKGKADSPNNAFYWRFVEL 98 (140) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceecccccccccceeEEeeccccccccCCCCcccccceecc Confidence 11110000 0 0001111111111 112344554322 12234556889999999 Q ss_pred CCCCCCCCchhhHHHHHHHHHHHHHHHHHH----HHHHcCCC Q lcl|NC_020201. 52 GWNNLPERNFMFSTSMHFQEGLRKHIKRMH----NGIIQGRG 89 (153) Q Consensus 52 G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~----~~~~~G~~ 89 (153) ||.+.||+|||+|+++.+++++.+.+.+.+ +.++.|.= T Consensus 99 GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 99 GTQFMKAEPFMRPAFDASIAQAEGAIRTEIARAIDQVVGGGL 140 (140) T ss_pred CcCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 999999999999999999888766666544 34443322 No 47 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=97.83 E-value=1.4e-08 Score=63.65 Aligned_cols=73 Identities=12% Similarity=0.032 Sum_probs=48.2 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRM 80 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~ 80 (153) |+-.|+.... +.-..-....|.||+..+. +.+|-+.||||.+.||+||++++++++++++.+.+.+. T Consensus 53 l~d~I~vs~~------k~~~~~g~~~v~VG~~k~~-------~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~ 119 (125) T protein:vir:94 53 ARDHIAVSNV------KTDRHTSEKIVTIGYAKGV-------SHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDT 119 (125) T ss_pred hhhheeeccc------ccccccceEEEEeccCCCC-------ceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHH Confidence 3322221100 0000011234666653321 36777899999999999999999999999999999988 Q ss_pred HHHHHc Q lcl|NC_020201. 81 HNGIIQ 86 (153) Q Consensus 81 ~~~~~~ 86 (153) ++.+.. T Consensus 120 lrk~~k 125 (125) T protein:vir:94 120 AKRLQK 125 (125) T ss_pred HHHHhC Confidence 888776 No 48 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=97.83 E-value=1.4e-08 Score=63.65 Aligned_cols=73 Identities=12% Similarity=0.032 Sum_probs=48.2 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRM 80 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~ 80 (153) |+-.|+.... +.-..-....|.||+..+. +.+|-+.||||.+.||+||++++++++++++.+.+.+. T Consensus 53 l~d~I~vs~~------k~~~~~g~~~v~VG~~k~~-------~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~ 119 (125) T protein:vir:98 53 ARDHIAVSNV------KTDRHTSEKIVTIGYAKGV-------SHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDT 119 (125) T ss_pred hhhheeeccc------ccccccceEEEEeccCCCC-------ceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHH Confidence 3322221100 0000011234666653321 36777899999999999999999999999999999988 Q ss_pred HHHHHc Q lcl|NC_020201. 81 HNGIIQ 86 (153) Q Consensus 81 ~~~~~~ 86 (153) ++.+.. T Consensus 120 lrk~~k 125 (125) T protein:vir:98 120 AKRLQK 125 (125) T ss_pred HHHHhC Confidence 888776 No 49 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=97.83 E-value=1.4e-08 Score=63.65 Aligned_cols=73 Identities=12% Similarity=0.032 Sum_probs=48.2 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRM 80 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~ 80 (153) |+-.|+.... +.-..-....|.||+..+. +.+|-+.||||.+.||+||++++++++++++.+.+.+. T Consensus 53 l~d~I~vs~~------k~~~~~g~~~v~VG~~k~~-------~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~ 119 (125) T protein:vir:81 53 ARDHIAVSNV------KTDRHTSEKIVTIGYAKGV-------SHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDT 119 (125) T ss_pred hhhheeeccc------ccccccceEEEEeccCCCC-------ceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHH Confidence 3322221100 0000011234666653321 36777899999999999999999999999999999988 Q ss_pred HHHHHc Q lcl|NC_020201. 81 HNGIIQ 86 (153) Q Consensus 81 ~~~~~~ 86 (153) ++.+.. T Consensus 120 lrk~~k 125 (125) T protein:vir:81 120 AKRLQK 125 (125) T ss_pred HHHHhC Confidence 888776 No 50 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=97.83 E-value=1.4e-08 Score=63.65 Aligned_cols=73 Identities=12% Similarity=0.032 Sum_probs=48.2 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRM 80 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~ 80 (153) |+-.|+.... +.-..-....|.||+..+. +.+|-+.||||.+.||+||++++++++++++.+.+.+. T Consensus 53 l~d~I~vs~~------k~~~~~g~~~v~VG~~k~~-------~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~ 119 (125) T protein:vir:47 53 ARDHIAVSNV------KTDRHTSEKIVTIGYAKGV-------SHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDT 119 (125) T ss_pred hhhheeeccc------ccccccceEEEEeccCCCC-------ceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHH Confidence 3322221100 0000011234666653321 36777899999999999999999999999999999988 Q ss_pred HHHHHc Q lcl|NC_020201. 81 HNGIIQ 86 (153) Q Consensus 81 ~~~~~~ 86 (153) ++.+.. T Consensus 120 lrk~~k 125 (125) T protein:vir:47 120 AKRLQK 125 (125) T ss_pred HHHHhC Confidence 888776 No 51 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=97.83 E-value=1.4e-08 Score=63.65 Aligned_cols=73 Identities=12% Similarity=0.032 Sum_probs=48.2 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRM 80 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~ 80 (153) |+-.|+.... +.-..-....|.||+..+. +.+|-+.||||.+.||+||++++++++++++.+.+.+. T Consensus 53 l~d~I~vs~~------k~~~~~g~~~v~VG~~k~~-------~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~ 119 (125) T protein:vir:79 53 ARDHIAVSNV------KTDRHTSEKIVTIGYAKGV-------SHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDT 119 (125) T ss_pred hhhheeeccc------ccccccceEEEEeccCCCC-------ceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHH Confidence 3322221100 0000011234666653321 36777899999999999999999999999999999988 Q ss_pred HHHHHc Q lcl|NC_020201. 81 HNGIIQ 86 (153) Q Consensus 81 ~~~~~~ 86 (153) ++.+.. T Consensus 120 lrk~~k 125 (125) T protein:vir:79 120 AKRLQK 125 (125) T ss_pred HHHHhC Confidence 888776 No 52 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=97.80 E-value=2e-08 Score=62.85 Aligned_cols=82 Identities=10% Similarity=0.052 Sum_probs=46.1 Q ss_pred CCccccccHHHH----HHHHHHHHHhhCC--EEEEeecCc-----CCCCC---CCHHHHHHHHhcCCCCCCCCchhhHHH Q lcl|NC_020201. 1 MAKKSSTDISEL----KRYFSQLSDLAEK--EVEYGFYDE-----KHYSG---LNMATLAAIHEEGWNNLPERNFMFSTS 66 (153) Q Consensus 1 M~~~i~~~~~~l----~~~~~~l~~l~~~--~v~VGi~~~-----~~~dG---~~va~iA~~~E~G~~~IP~RpFlr~~~ 66 (153) ++..-.. .+-+ ..+.+.+...... -+.-|-+.. ...+| .+.+.+|.++||||...||||||||++ T Consensus 19 ~~~~~~v-~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~Ya~~vEfGT~km~a~Pfl~PA~ 97 (114) T protein:vir:27 19 NASPEKR-SKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDKATVEALTSYSGYLEVGTRKMEAQPFMKPAL 97 (114) T ss_pred hcCHHHH-HHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCeeEecCCCCccceecccccccCCCCchhhhH Confidence 3221111 1112 2222222222110 111121111 01122 234689999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHH Q lcl|NC_020201. 67 MHFQEGLRKHIKRMHNG 83 (153) Q Consensus 67 ~~~~~~~~~~~~~~~~~ 83 (153) +.++..+.+.++++++- T Consensus 98 ~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 98 DEVAPKMVEELAKWDET 114 (114) T ss_pred HHHHHHHHHHHHHHhcC Confidence 99999998888887764 No 53 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=97.80 E-value=2e-08 Score=62.85 Aligned_cols=82 Identities=10% Similarity=0.052 Sum_probs=46.1 Q ss_pred CCccccccHHHH----HHHHHHHHHhhCC--EEEEeecCc-----CCCCC---CCHHHHHHHHhcCCCCCCCCchhhHHH Q lcl|NC_020201. 1 MAKKSSTDISEL----KRYFSQLSDLAEK--EVEYGFYDE-----KHYSG---LNMATLAAIHEEGWNNLPERNFMFSTS 66 (153) Q Consensus 1 M~~~i~~~~~~l----~~~~~~l~~l~~~--~v~VGi~~~-----~~~dG---~~va~iA~~~E~G~~~IP~RpFlr~~~ 66 (153) ++..-.. .+-+ ..+.+.+...... -+.-|-+.. ...+| .+.+.+|.++||||...||||||||++ T Consensus 19 ~~~~~~v-~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~Ya~~vEfGT~km~a~Pfl~PA~ 97 (114) T protein:vir:49 19 NASPEKR-SKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDKATVEALTSYSGYLEVGTRKMEAQPFMKPAL 97 (114) T ss_pred hcCHHHH-HHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCeeEecCCCCccceecccccccCCCCchhhhH Confidence 3221111 1112 2222222222110 111121111 01122 234689999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHH Q lcl|NC_020201. 67 MHFQEGLRKHIKRMHNG 83 (153) Q Consensus 67 ~~~~~~~~~~~~~~~~~ 83 (153) +.++..+.+.++++++- T Consensus 98 ~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 98 DEVAPKMVEELAKWDET 114 (114) T ss_pred HHHHHHHHHHHHHHhcC Confidence 99999998888887764 No 54 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=97.80 E-value=3.4e-08 Score=61.51 Aligned_cols=89 Identities=18% Similarity=0.081 Sum_probs=48.1 Q ss_pred CCcccccc--HH----HHHHHHHHHHHh---------------------hCCEEEEeecCcC--CCCCCCHHHHHHHHhc Q lcl|NC_020201. 1 MAKKSSTD--IS----ELKRYFSQLSDL---------------------AEKEVEYGFYDEK--HYSGLNMATLAAIHEE 51 (153) Q Consensus 1 M~~~i~~~--~~----~l~~~~~~l~~l---------------------~~~~v~VGi~~~~--~~dG~~va~iA~~~E~ 51 (153) |...+... .+ ..+.+.+.++.+ ....+.||+.... .-++.+.+.++.+.|| T Consensus 19 L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~y~~f~E~ 98 (140) T protein:vir:10 19 LAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRTKGKADSPNNAFYWRFDEF 98 (140) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhccccccccccccceEEeeeeeccccccCCCCccceeeeecc Confidence 21111000 00 001111111111 1234555553221 1123345789999999 Q ss_pred CCCCCCCCchhhHHHHHHHHHHHHHHHH----HHHHHHcCCC Q lcl|NC_020201. 52 GWNNLPERNFMFSTSMHFQEGLRKHIKR----MHNGIIQGRG 89 (153) Q Consensus 52 G~~~IP~RpFlr~~~~~~~~~~~~~~~~----~~~~~~~G~~ 89 (153) ||.+.||+|||+|+++.+++++.+.+++ .++.++.|.- T Consensus 99 GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 99 GTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGRR 140 (140) T ss_pred CCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 9999999999999999998877665554 4455666633 No 55 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=97.76 E-value=5.5e-08 Score=60.36 Aligned_cols=89 Identities=17% Similarity=0.074 Sum_probs=48.6 Q ss_pred CCcccccc--HH----HHHHHHHHHHHhh---------------------CCEEEEeecCcC--CCCCCCHHHHHHHHhc Q lcl|NC_020201. 1 MAKKSSTD--IS----ELKRYFSQLSDLA---------------------EKEVEYGFYDEK--HYSGLNMATLAAIHEE 51 (153) Q Consensus 1 M~~~i~~~--~~----~l~~~~~~l~~l~---------------------~~~v~VGi~~~~--~~dG~~va~iA~~~E~ 51 (153) |+.....+ .+ ..+.+.+.++... ...+.||+.... ..++-+.+.++.+.|| T Consensus 19 l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~~~vg~~~~~~~~~~~~~~~~y~~f~E~ 98 (140) T protein:vir:14 19 LAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAGVRVRTKGKADSPNNAFYWRFDEF 98 (140) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccccccceeEEeeeeeccccccCCCCccceeeeecc Confidence 21111000 00 0111111222211 124556654321 2233445789999999 Q ss_pred CCCCCCCCchhhHHHHHHHHHHHHHHHH----HHHHHHcCCC Q lcl|NC_020201. 52 GWNNLPERNFMFSTSMHFQEGLRKHIKR----MHNGIIQGRG 89 (153) Q Consensus 52 G~~~IP~RpFlr~~~~~~~~~~~~~~~~----~~~~~~~G~~ 89 (153) ||.+.||||||+|+++.+++++.+.+.+ .++.++.|.- T Consensus 99 GT~~~~a~pFl~pa~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:14 99 GTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGRR 140 (140) T ss_pred ccCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 9999999999999999998876655554 4455666633 No 56 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=97.68 E-value=3.6e-08 Score=61.39 Aligned_cols=80 Identities=11% Similarity=0.096 Sum_probs=44.8 Q ss_pred CCccccccHHHHHHHHHH----HHHhhCC--EEEEeecCcC---CCCCC-----CHHHHHHHHhcCCCCCCCCchhhHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQ----LSDLAEK--EVEYGFYDEK---HYSGL-----NMATLAAIHEEGWNNLPERNFMFSTS 66 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~----l~~l~~~--~v~VGi~~~~---~~dG~-----~va~iA~~~E~G~~~IP~RpFlr~~~ 66 (153) ++... .-.+-+++.... +...... -|.-|-+... ..+|. +.+.+|.+.||||...|+||||+|++ T Consensus 19 ~~~~~-~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~~~~~~~~v~~~~~Ya~~vE~GTr~m~AqPF~~PA~ 97 (112) T protein:vir:96 19 NASSE-RRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLEAGSDRAVVEALTNYSGYLEVGTRKMEAQPFMRPAL 97 (112) T ss_pred hcCHH-HHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeeecCceEEEecCCCCccceeccCccccCCCCchhhhH Confidence 33211 111222222222 2221111 1222222110 11222 33689999999999999999999999 Q ss_pred HHHHHHHHHHHHHHH Q lcl|NC_020201. 67 MHFQEGLRKHIKRMH 81 (153) Q Consensus 67 ~~~~~~~~~~~~~~~ 81 (153) +.++..+.+.++++- T Consensus 98 ~~~~~~~~~~l~~L~ 112 (112) T protein:vir:96 98 DQVVPEMVEEMAKWE 112 (112) T ss_pred HHHHHHHHHHHHhcC Confidence 999999888888765 No 57 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=97.63 E-value=7.6e-08 Score=59.62 Aligned_cols=85 Identities=8% Similarity=-0.033 Sum_probs=40.5 Q ss_pred CCcccc--ccHH----HHHHHHHHHHHhh-------------------------CCEEEEeecCcCCCCCCCHHHHHHHH Q lcl|NC_020201. 1 MAKKSS--TDIS----ELKRYFSQLSDLA-------------------------EKEVEYGFYDEKHYSGLNMATLAAIH 49 (153) Q Consensus 1 M~~~i~--~~~~----~l~~~~~~l~~l~-------------------------~~~v~VGi~~~~~~dG~~va~iA~~~ 49 (153) |...+. .... +.+-+.+.++... ...|.|++-..+ +...++-+. T Consensus 20 L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k~~~~~~~v~v~vg~~~-----~~~~~~~f~ 94 (135) T protein:vir:57 20 VGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSRGKAGSTVVVLRVGPTR-----SHYMKALAQ 94 (135) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhcccccccccccceeEEEEecCCC-----CcceeEeec Confidence 110000 0000 0011111111111 112333331111 112334455 Q ss_pred hcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHH Q lcl|NC_020201. 50 EEGWNNLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGK 99 (153) Q Consensus 50 E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~ 99 (153) ||||.+.||||||+++++.+++++.+.+.+.+. ..|++++. T Consensus 95 E~GT~~~~a~PF~~pa~~~~~~~~~~~~~~~~~---------~~l~ka~r 135 (135) T protein:vir:57 95 EFGTIKQVAKPFIRPALDYNKMQVLRILTVEIR---------DGLSTLSR 135 (135) T ss_pred ccCCCCCCCCcchhHhHHHhHHHHHHHHHHHHH---------HHHHHhcC Confidence 999999999999999999999988777776554 23333333 No 58 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=97.62 E-value=1.2e-07 Score=58.53 Aligned_cols=89 Identities=10% Similarity=0.012 Sum_probs=59.2 Q ss_pred hhHHH--HHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhcc-CCCCCCccHHHHHh------------ Q lcl|NC_020201. 62 MFSTS--MHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTG-SFSNPKVSKDWASY------------ 126 (153) Q Consensus 62 lr~~~--~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~-~~~~p~ns~~Ti~~------------ 126 (153) |-..+ .-..+++.+.+.++... +.+...+|..||..++..+++.|.+. .-...|.+|.|+++ T Consensus 1 Ms~~i~i~~~~~~l~~~L~~l~~~---~~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~ 77 (175) T protein:vir:10 1 MSDFVNFQIDDSALRTRLLQLEQA---GHQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNG 77 (175) T ss_pred CceeEEEEecHHHHHHHHHHHHHH---hccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhh Confidence 33221 11123455555544432 24677899999999999999999754 11345789998753 Q ss_pred ---------cCCCCcchhHHHHHhhcceeeeeCCCC Q lcl|NC_020201. 127 ---------KGFDDAMIHYGDLSSAATYKIVKYQGK 153 (153) Q Consensus 127 ---------KG~~~PLidTG~L~~Sity~V~~~~gk 153 (153) ++..++|+|||.|++||+|.+.+..=- T Consensus 78 ~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~ 113 (175) T protein:vir:10 78 ELTAAASRRKAGLMILQDSGQMAASVSTDHDDNSAV 113 (175) T ss_pred hhhhhhhhhccCCCcceechhhhhhhheeecCCEEE Confidence 346789999999999999987433222 No 59 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=97.59 E-value=1e-07 Score=58.89 Aligned_cols=80 Identities=16% Similarity=0.188 Sum_probs=48.5 Q ss_pred CCccccccHHHHHHHHHHHHHhhCC------------------------EEEEeecCcC-----CCCC-----CCHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEK------------------------EVEYGFYDEK-----HYSG-----LNMATLA 46 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~------------------------~v~VGi~~~~-----~~dG-----~~va~iA 46 (153) |+++|. .++++++.+.|+.+.+. -|.-|-+... ..+| .+.+.+| T Consensus 4 ms~~i~--~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~~~g~~~~V~~~~~YA 81 (144) T protein:vir:59 4 MSVRID--PSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYKNNGLTAEITVGAEYA 81 (144) T ss_pred ceeeeh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEeecCcEEEEEecCCCcc Confidence 777664 33333433322222110 1222321110 1123 2347899 Q ss_pred HHHhcCC---------------------------CCCCCCchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 47 AIHEEGW---------------------------NNLPERNFMFSTSMHFQEGLRKHIKRMHN 82 (153) Q Consensus 47 ~~~E~G~---------------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~ 82 (153) .+.|||| .++||||||+++++.+++.+.+.+++++- T Consensus 82 ~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~~~~~i~~~~g 144 (144) T protein:vir:59 82 IYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEGGEYFEREMRRLRG 144 (144) T ss_pred chhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHHHHHHHHHHHHHHHhcC Confidence 9999997 25899999999999999999998888655 No 60 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=97.58 E-value=4.8e-08 Score=60.72 Aligned_cols=80 Identities=13% Similarity=0.187 Sum_probs=43.5 Q ss_pred CCccccccHHHHHHHHH----HHHHhhCC--EEEEeecCcC-----CCCC-----CCHHHHHHHHhcCCCCCCCCchhhH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFS----QLSDLAEK--EVEYGFYDEK-----HYSG-----LNMATLAAIHEEGWNNLPERNFMFS 64 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~----~l~~l~~~--~v~VGi~~~~-----~~dG-----~~va~iA~~~E~G~~~IP~RpFlr~ 64 (153) +.... ....+++.++ .+.+..+. -|.-|-+... ..+| .+.+.+|.+.||||...|+||||+| T Consensus 13 l~~~~--~~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~GT~~m~aqPFl~p 90 (108) T protein:vir:98 13 LRKNA--TLNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFTDGGLTGTTIPHTDYAGYVEYGTRFQAAQPFVKP 90 (108) T ss_pred HHHhh--hHHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeeecCceEEEeecCCCccceeeccccccCCCcchhh Confidence 11000 0111111111 11111111 1222211110 0122 1236789999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 65 TSMHFQEGLRKHIKRMHN 82 (153) Q Consensus 65 ~~~~~~~~~~~~~~~~~~ 82 (153) +++..+..+.+.++++++ T Consensus 91 a~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 91 AFDVQKKIFTNDLERLTK 108 (108) T ss_pred HHHHHHHHHHHHHHHHcC Confidence 999999999888888777 No 61 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=97.57 E-value=7.8e-08 Score=59.55 Aligned_cols=79 Identities=15% Similarity=0.114 Sum_probs=44.7 Q ss_pred CCccccccHHHHHH----HHHHHHHhhCCEEEEeecCcC-----CCCC-----CCHHHHHHHHhcCCCCCCCCchhhHHH Q lcl|NC_020201. 1 MAKKSSTDISELKR----YFSQLSDLAEKEVEYGFYDEK-----HYSG-----LNMATLAAIHEEGWNNLPERNFMFSTS 66 (153) Q Consensus 1 M~~~i~~~~~~l~~----~~~~l~~l~~~~v~VGi~~~~-----~~dG-----~~va~iA~~~E~G~~~IP~RpFlr~~~ 66 (153) |...-.. .+.+++ +.+..+.+.. |.-|.+... ..+| .+.+.+|.+.||||...|+||||+|++ T Consensus 16 ~~~~~~~-~~al~~~a~~i~~~ak~~aP--v~TG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~GT~km~aqpf~~pa~ 92 (108) T protein:vir:74 16 NATLDDV-KHVVKSNTASMNKNMQNLAP--VDTGNMKRSITSEFTDGGLSGTTGPHTDYAGYVEYGTRFQSAQPFVKPAF 92 (108) T ss_pred hhhHHHH-HHHHHHHHHHHHHHHHHhCC--CCchhhhccceeeeecCceEEEeecCCCcccceeccccccCCCcchhhHH Confidence 3211100 011111 1112222221 222221110 1122 133678999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_020201. 67 MHFQEGLRKHIKRMHN 82 (153) Q Consensus 67 ~~~~~~~~~~~~~~~~ 82 (153) +.++.++.+.++++++ T Consensus 93 ~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 93 NIQKKVFTNDLERLTK 108 (108) T ss_pred HHHHHHHHHHHHHHcC Confidence 9999999888888777 No 62 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=97.53 E-value=8.8e-08 Score=59.27 Aligned_cols=78 Identities=17% Similarity=0.162 Sum_probs=45.0 Q ss_pred CC---------ccccccHHHHHHHHHHHHHh-----hCCEEEEeecCcCCCCCCCHHHHHHHHhcCCC--CCCCCchhhH Q lcl|NC_020201. 1 MA---------KKSSTDISELKRYFSQLSDL-----AEKEVEYGFYDEKHYSGLNMATLAAIHEEGWN--NLPERNFMFS 64 (153) Q Consensus 1 M~---------~~i~~~~~~l~~~~~~l~~l-----~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~--~IP~RpFlr~ 64 (153) .+ .++-.+.- +|...|..- ++..+.||- +..||++|+||+. +||+||||-. T Consensus 52 Ls~st~a~k~~~~~L~~tG---~L~~Si~~~~~~~~~~~~a~vGt----------n~~YA~~hqfG~~~~~IPaRPfLG~ 118 (145) T protein:vir:31 52 LKESTIRAKGSDTPLIDNS---RLLTDINAASMMDRANRMAVIGT----------NLDYAEHHEFGAPEAGIPARPIFGP 118 (145) T ss_pred cChHHHHHhcCCCCCccCH---HHHHHHHHHhhhcccCceeEecC----------CchhhhhhccCCcccccCCCCccCC Confidence 11 11111111 233333321 233455553 2479999999985 6999999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCHH Q lcl|NC_020201. 65 TSMHFQEGLRKHIKRMHNGIIQGRGFS 91 (153) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~G~~~~ 91 (153) +....++++.+.+...+..-+.|--++ T Consensus 119 ~~~~~~~~~~~ii~~~i~~~L~~~~~~ 145 (145) T protein:vir:31 119 AGAYASQQAPDVIGDEIDTNLEGAVID 145 (145) T ss_pred CccchHHHHHHHHHHHHHHHhhhhccC Confidence 766666677777776666656664333 No 63 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=97.35 E-value=9.7e-07 Score=53.55 Aligned_cols=87 Identities=24% Similarity=0.211 Sum_probs=44.5 Q ss_pred CCc--cccccHHHHHHHHHHHHHhhC-CE---EEEe----------ecCcC----------------------------- Q lcl|NC_020201. 1 MAK--KSSTDISELKRYFSQLSDLAE-KE---VEYG----------FYDEK----------------------------- 35 (153) Q Consensus 1 M~~--~i~~~~~~l~~~~~~l~~l~~-~~---v~VG----------i~~~~----------------------------- 35 (153) |+. .|+.|...+.+.+++|..... .. -.|| |..+. T Consensus 1 Ms~~i~i~~d~~~~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~~~~ 80 (175) T protein:vir:79 1 MSDFVNFQIDDSALRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNGELT 80 (175) T ss_pred CceEEEEEechHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccccch Confidence 874 566665554444444332211 10 0111 00000 Q ss_pred ---------------------------CCC----CCCHHHHHHHHhcCCC-------CCCCCchhhHHHHHH-----HHH Q lcl|NC_020201. 36 ---------------------------HYS----GLNMATLAAIHEEGWN-------NLPERNFMFSTSMHF-----QEG 72 (153) Q Consensus 36 ---------------------------~~d----G~~va~iA~~~E~G~~-------~IP~RpFlr~~~~~~-----~~~ 72 (153) ..+ |++ ..||++|+||.. +||+||||--+-++. .+. T Consensus 81 ~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGtn-~~YAaiHqfGg~~~~~~~v~IPARPfLG~s~~de~~~~~~~~ 159 (175) T protein:vir:79 81 AAASRRKAGLMILQDSGQMAASTATDSGEDYSVIGSN-KEYAAIQHFGGQAGRGLKVTIPGRAWLPVTADGELQPEAVEP 159 (175) T ss_pred hhHhhhccCCCcceechhhhhhhhheecCCEEEEecC-cchhhHhhcccccCCCcccccCcccccCCCcccchhHHHHHH Confidence 011 222 468999999974 799999995433221 344 Q ss_pred HHHHHHHHHHHHHcCC Q lcl|NC_020201. 73 LRKHIKRMHNGIIQGR 88 (153) Q Consensus 73 ~~~~~~~~~~~~~~G~ 88 (153) +.+.+...+..++.++ T Consensus 160 I~~~i~~~l~~a~~~~ 175 (175) T protein:vir:79 160 VLNTILRHLMDAANRR 175 (175) T ss_pred HHHHHHHHHHHHhccC Confidence 5555555566566666 No 64 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=97.28 E-value=1.3e-06 Score=52.79 Aligned_cols=73 Identities=15% Similarity=0.127 Sum_probs=42.3 Q ss_pred CCccccccHHHHHHHHHHHHH-hhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCC--------CCCCCchhhHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSD-LAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWN--------NLPERNFMFSTSMHFQE 71 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~-l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~--------~IP~RpFlr~~~~~~~~ 71 (153) ...++-.+.- +|...|.. .+...|.||.. ..||++|+||.. +||+||||--+ ++..+ T Consensus 75 ~~~~~L~~tg---~L~~Si~~~~~~~~v~vGt~----------~~yA~vHqfG~~~~~~~~~~~iPaRpfLG~s-~~d~~ 140 (156) T protein:vir:19 75 VPGSILTLHG---DLARSITTDYGQDYALIGSP----------KIYAAIHQWGGTPDMAPRPAGVPARPYMGLD-KTGEQ 140 (156) T ss_pred CCCcchhhhH---HHHHHhhheecCCEEEEecc----------hhhhHHhhcCcccccCCCccccCCccccCCC-HHHHH Confidence 1112222211 23333332 24567888752 578999999963 69999999533 34456 Q ss_pred HHHHHHHHHHHHHHcC Q lcl|NC_020201. 72 GLRKHIKRMHNGIIQG 87 (153) Q Consensus 72 ~~~~~~~~~~~~~~~G 87 (153) ++.+.+...+..++.- T Consensus 141 ~I~~~i~~~l~~~~~~ 156 (156) T protein:vir:19 141 EIFDAIRKRVSAALRQ 156 (156) T ss_pred HHHHHHHHHHHHHhhC Confidence 6666666666655443 No 65 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=97.18 E-value=3.3e-06 Score=50.66 Aligned_cols=86 Identities=9% Similarity=-0.028 Sum_probs=61.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCC----CCCCccHHHHHhcCC--CCcchhHHHH Q lcl|NC_020201. 66 SMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSF----SNPKVSKDWASYKGF--DDAMIHYGDL 139 (153) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~----~~p~ns~~Ti~~KG~--~~PLidTG~L 139 (153) +++. .++.+.+..++..+ ........|..||..+....++.|.+..- +++|+++.|+++|+. .+||+++|.| T Consensus 1 m~d~-~~l~~~L~~ll~~L-~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~g~l 78 (149) T protein:vir:98 1 MSEL-TALQERLTGLIASL-SPAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKGRIRREMFARLRT 78 (149) T ss_pred CchH-HHHHHHHHHHHHhc-CchhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccCCCCcccchhhhh Confidence 3332 23444444444433 22345678999999999999999987411 357899999998763 5899999999 Q ss_pred HhhcceeeeeCCCC Q lcl|NC_020201. 140 SSAATYKIVKYQGK 153 (153) Q Consensus 140 ~~Sity~V~~~~gk 153 (153) .+||++..-...-. T Consensus 79 ~~sl~~~~~~~~~~ 92 (149) T protein:vir:98 79 NRFMKAKGSDSAAV 92 (149) T ss_pred hhhhhheecCCeeE Confidence 99999987766533 No 66 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=97.11 E-value=9.1e-07 Score=53.70 Aligned_cols=86 Identities=12% Similarity=0.101 Sum_probs=42.1 Q ss_pred CCccccccH-----HHHHHHHHHHHHhhCCEEEEeecCc-----C--CCC-----CCCHHHHHHHHhcCCC--------- Q lcl|NC_020201. 1 MAKKSSTDI-----SELKRYFSQLSDLAEKEVEYGFYDE-----K--HYS-----GLNMATLAAIHEEGWN--------- 54 (153) Q Consensus 1 M~~~i~~~~-----~~l~~~~~~l~~l~~~~v~VGi~~~-----~--~~d-----G~~va~iA~~~E~G~~--------- 54 (153) |...+.... +..+.+.+..+.+.. |.-|-+.. . ..+ ..+.+.+|.+.||||. T Consensus 16 l~~~~~~~~~~a~~~~a~~i~~~ak~~aP--v~TG~Lr~sI~~~~~~~~~~~~~~v~~~~~Ya~fvEfGT~~m~a~P~~~ 93 (173) T protein:vir:10 16 IGKDIDKNINATTEEAANFIEDRAKTLAP--KNFGKLAQSISTSDLKAKDLISKKITVNELYGAYMEFGTGAKVSVPKEF 93 (173) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cCchhhhhcceeeeeccCceeEEeeCCCcccchhhhcccccccCCCchh Confidence 221111000 001111112222211 11111100 0 001 1234678888999963 Q ss_pred ----------------------------------------------CCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_020201. 55 ----------------------------------------------NLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGR 88 (153) Q Consensus 55 ----------------------------------------------~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~ 88 (153) +.||||||+|+++.+++.+.+.+++.+...+..- T Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~~aqPFl~PA~~~~~~~~~~~i~~~i~~~lrk~ 173 (173) T protein:vir:10 94 ADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGINPQPFLYPAWIEGKKQYLKDLENLLKTYNKKI 173 (173) T ss_pred hhhhcccccccccccccccccccccccccccchhcccceeeEeecCCCCCCccchhHHHHhHHHHHHHHHHHHHHHhhcC Confidence 3899999999999999988887777665433322 No 67 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=97.06 E-value=1.7e-06 Score=52.15 Aligned_cols=75 Identities=16% Similarity=0.195 Sum_probs=39.1 Q ss_pred CCccccccHHHHHHHHHH--HH-HhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCC-----------CCCCCCchhhHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQ--LS-DLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGW-----------NNLPERNFMFSTS 66 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~--l~-~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~-----------~~IP~RpFlr~~~ 66 (153) -+..++ +...+.+|... |. ......+.|||.. ++..||++|.||- .+||+||||--+- T Consensus 64 ~k~~~~-~~~m~~~L~~a~~l~~~a~~~~~~Vg~~G-------t~~~yAaiHQfG~~~r~~~~~~~~v~iPaRp~LG~s~ 135 (152) T protein:vir:10 64 VKSKIK-SGKMFDKITQPRFMRLRLESEGVSLGYEG-------GDAVIARIHQQGLIGRVRKDWDLKVKYASRELLGFTD 135 (152) T ss_pred hccccc-chhHHHhhhhcceeeeeecCcEEEEEecC-------CchhhhhhhccCccccccCCCCcceeccccccCCCCH Confidence 111111 11223333221 11 1345678899864 2479999999993 3699999995542 Q ss_pred HHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 67 MHFQEGLRKHIKRMHNGI 84 (153) Q Consensus 67 ~~~~~~~~~~~~~~~~~~ 84 (153) + ...++.+.+.+.+... T Consensus 136 ~-d~~~I~~~i~~~l~~a 152 (152) T protein:vir:10 136 D-DLQMIEDYMINILAGS 152 (152) T ss_pred H-HHHHHHHHHHHHHhcC Confidence 2 2333444444333322 No 68 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=97.03 E-value=6.7e-07 Score=54.43 Aligned_cols=74 Identities=18% Similarity=0.079 Sum_probs=45.8 Q ss_pred CCccccccHHHHHHHHHHHHHhhCC------------------------EEEEeecCcC-----CCCC-----CCHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEK------------------------EVEYGFYDEK-----HYSG-----LNMATLA 46 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~------------------------~v~VGi~~~~-----~~dG-----~~va~iA 46 (153) |+..++. +++|.+.|+.+... .|.-|-+... ..+| .+.+.+| T Consensus 1 Ma~~~~g----~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA 76 (137) T protein:vir:97 1 MAKVKYG----NWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYA 76 (137) T ss_pred CchhHHh----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCcc Confidence 9888752 33333333322111 1222322111 1122 2347899 Q ss_pred HHHhcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_020201. 47 AIHEEGW-----------------------------NNLPERNFMFSTSMHFQEGLRKHIK 78 (153) Q Consensus 47 ~~~E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 78 (153) .+.|||| ...|+||||+++++.++..+.+.+. T Consensus 77 ~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:97 77 IYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9999998 2589999999999999999988877 No 69 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=97.03 E-value=6.7e-07 Score=54.43 Aligned_cols=74 Identities=18% Similarity=0.079 Sum_probs=45.8 Q ss_pred CCccccccHHHHHHHHHHHHHhhCC------------------------EEEEeecCcC-----CCCC-----CCHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEK------------------------EVEYGFYDEK-----HYSG-----LNMATLA 46 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~------------------------~v~VGi~~~~-----~~dG-----~~va~iA 46 (153) |+..++. +++|.+.|+.+... .|.-|-+... ..+| .+.+.+| T Consensus 1 Ma~~~~g----~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA 76 (137) T protein:vir:93 1 MAKVKYG----NWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYA 76 (137) T ss_pred CchhHHh----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCcc Confidence 9888752 33333333322111 1222322111 1122 2347899 Q ss_pred HHHhcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_020201. 47 AIHEEGW-----------------------------NNLPERNFMFSTSMHFQEGLRKHIK 78 (153) Q Consensus 47 ~~~E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 78 (153) .+.|||| ...|+||||+++++.++..+.+.+. T Consensus 77 ~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:93 77 IYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9999998 2589999999999999999988877 No 70 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=97.03 E-value=6.7e-07 Score=54.43 Aligned_cols=74 Identities=18% Similarity=0.079 Sum_probs=45.8 Q ss_pred CCccccccHHHHHHHHHHHHHhhCC------------------------EEEEeecCcC-----CCCC-----CCHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEK------------------------EVEYGFYDEK-----HYSG-----LNMATLA 46 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~------------------------~v~VGi~~~~-----~~dG-----~~va~iA 46 (153) |+..++. +++|.+.|+.+... .|.-|-+... ..+| .+.+.+| T Consensus 1 Ma~~~~g----~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA 76 (137) T protein:vir:94 1 MAKVKYG----NWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYA 76 (137) T ss_pred CchhHHh----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCcc Confidence 9888752 33333333322111 1222322111 1122 2347899 Q ss_pred HHHhcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_020201. 47 AIHEEGW-----------------------------NNLPERNFMFSTSMHFQEGLRKHIK 78 (153) Q Consensus 47 ~~~E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 78 (153) .+.|||| ...|+||||+++++.++..+.+.+. T Consensus 77 ~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 77 IYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9999998 2589999999999999999988877 No 71 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=97.00 E-value=7.8e-07 Score=54.07 Aligned_cols=74 Identities=18% Similarity=0.113 Sum_probs=45.0 Q ss_pred CCccccccHHHHHHHHHHHHHhhCC------------------------EEEEeecCcC-----CCCC-----CCHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEK------------------------EVEYGFYDEK-----HYSG-----LNMATLA 46 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~------------------------~v~VGi~~~~-----~~dG-----~~va~iA 46 (153) |+..+. ++++|.+.|+.+... .|.-|-+... ..+| .+.+.+| T Consensus 1 Ma~~~~----Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~Ya 76 (137) T protein:vir:10 1 MAKVKY----GNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEYA 76 (137) T ss_pred CchhHh----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeCCcEEEEEecCCCcc Confidence 888753 334444444433221 1222221110 1122 1336799 Q ss_pred HHHhcCCC-----------------------------CCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_020201. 47 AIHEEGWN-----------------------------NLPERNFMFSTSMHFQEGLRKHIK 78 (153) Q Consensus 47 ~~~E~G~~-----------------------------~IP~RpFlr~~~~~~~~~~~~~~~ 78 (153) .+.||||. ++|+||||+++++++++.+.+.+. T Consensus 77 ~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 77 VYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred cccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHHHHHHhcC Confidence 99999962 479999999999999999888776 No 72 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=96.93 E-value=4.1e-06 Score=50.10 Aligned_cols=75 Identities=20% Similarity=0.207 Sum_probs=40.5 Q ss_pred CCccccccHHHHHHHHHHHHH-hhCCEEEEeecCcCCCCCCCHHHHHHHHhcCC-------------------------- Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSD-LAEKEVEYGFYDEKHYSGLNMATLAAIHEEGW-------------------------- 53 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~-l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~-------------------------- 53 (153) ...++-.+.- .|.+.|.. .....|.||.. ..+|.+|+||. T Consensus 71 ~~~~~L~~tg---~L~~Si~~~~~~~~v~vGtn----------~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~ 137 (190) T protein:vir:99 71 NRDKILTLDG---HLRNLLRYQLDGSELLFGSD----------RPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREF 137 (190) T ss_pred CCCccceecH---HHHHHHhheecCcEEEEecC----------cchhhhhhcCCcccccccchhhhhhhhhhhhhhhccc Confidence 2222222221 33343432 34566777742 56789999993 Q ss_pred ------------------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCCC Q lcl|NC_020201. 54 ------------------NNLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGRG 89 (153) Q Consensus 54 ------------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~~ 89 (153) .+||+||||--+ ++..+++.+.+.+.+..++.... T Consensus 138 ~~~~~~~~~~~~~~~~~~v~IPaRpfLG~s-~~d~~~I~~~i~~~l~~~~~~~~ 190 (190) T protein:vir:99 138 VPRRRSNFAQDVQIGPYTIQMPARPWLGTS-SQDDDTILQRVERYLQRALRERA 190 (190) T ss_pred ccccccccchhcccccceeeecCcccCCCC-HHHHHHHHHHHHHHHHHHHhhcC Confidence 258999999443 33345555555555554444332 No 73 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=96.83 E-value=2.5e-06 Score=51.28 Aligned_cols=73 Identities=12% Similarity=0.079 Sum_probs=38.5 Q ss_pred CCccccccHHHHH--HHHHHHH-HhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCC----------CCCCCCchhhHHHH Q lcl|NC_020201. 1 MAKKSSTDISELK--RYFSQLS-DLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGW----------NNLPERNFMFSTSM 67 (153) Q Consensus 1 M~~~i~~~~~~l~--~~~~~l~-~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~----------~~IP~RpFlr~~~~ 67 (153) -....+ ...+. ++.+.|. ......+.|||+. ++..||++|+||. .+||+||||--+ + T Consensus 64 k~~~~~--~~l~~~g~l~~sl~~~~~~~~~~V~~~G-------s~~~yAa~HQfG~~~r~~~~~~~~~iPaRp~LG~s-~ 133 (149) T protein:vir:98 64 KKGRIR--REMFARLRTNRFMKAKGSDSAAVVEFTG-------RVQRMARVHQYGLKDRPNRHSRDVQYAARPLLGFT-R 133 (149) T ss_pred ccCCCC--cccchhhhhhhhhhheecCCeeEEEecC-------cchHHhhHhhccccccccCCCcceeccccccCCCC-H Confidence 110110 01111 1122222 2355678888864 3479999999995 269999999433 2 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_020201. 68 HFQEGLRKHIKRMHNG 83 (153) Q Consensus 68 ~~~~~~~~~~~~~~~~ 83 (153) +..+++.+.+.+.+.. T Consensus 134 ~d~~~i~~~i~~~l~~ 149 (149) T protein:vir:98 134 DDEQMIEDIIIRHLGK 149 (149) T ss_pred HHHHHHHHHHHHHhhC Confidence 3344555554444432 No 74 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=96.82 E-value=2.6e-06 Score=51.20 Aligned_cols=75 Identities=13% Similarity=0.104 Sum_probs=39.8 Q ss_pred CCccccccHHHHH--HHHHHHH-HhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCC----------CCCCCCchhhHHHH Q lcl|NC_020201. 1 MAKKSSTDISELK--RYFSQLS-DLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGW----------NNLPERNFMFSTSM 67 (153) Q Consensus 1 M~~~i~~~~~~l~--~~~~~l~-~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~----------~~IP~RpFlr~~~~ 67 (153) .+.+-. ....+. .+...|. ..+...+.|||..+ ++..||++|.||- .+||+||||--+-+ T Consensus 63 ~k~g~~-~~~l~~~~~l~~sl~~~~~~~~~~vg~~~G------s~~~yAa~HQfG~~~~~~~~~~~~~iPaRp~LG~s~~ 135 (150) T protein:vir:20 63 KKTGRV-KRKMFAKLITSRFLHIRASPEQASMEFYGG------KSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTGE 135 (150) T ss_pred HhccCC-CccccchhhhhhhhheeecCcEEEEEeeCC------cchhhhhhhhcccccccccCCCceeccccccCCCCHH Confidence 111100 000001 1222232 23456788998644 3578999999993 37999999954432 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_020201. 68 HFQEGLRKHIKRMHNG 83 (153) Q Consensus 68 ~~~~~~~~~~~~~~~~ 83 (153) ..+++.+.+.+.+.. T Consensus 136 -d~~~i~~~i~~~l~k 150 (150) T protein:vir:20 136 -DVQMIEEIILAHLER 150 (150) T ss_pred -HHHHHHHHHHHHHhC Confidence 344455544444443 No 75 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=96.82 E-value=1.1e-06 Score=53.20 Aligned_cols=81 Identities=17% Similarity=0.128 Sum_probs=44.3 Q ss_pred CCcccccc--HHHHHHHHH---------------HHHHh--hCCEEEEeecCcC-----CCCC-------CCHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTD--ISELKRYFS---------------QLSDL--AEKEVEYGFYDEK-----HYSG-------LNMATLAAIH 49 (153) Q Consensus 1 M~~~i~~~--~~~l~~~~~---------------~l~~l--~~~~v~VGi~~~~-----~~dG-------~~va~iA~~~ 49 (153) |.+.+..+ .+.|+++.+ .++.. ...-|.-|-+... ..+| .+.+.+|.++ T Consensus 4 ~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~g~~~~~~v~~~~~YA~~v 83 (142) T protein:vir:94 4 LNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSGGRFSFSVTIGTNVTYAADV 83 (142) T ss_pred eEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeccCCceEEEEEecCcccchhh Confidence 55443321 122222221 11111 1111333322210 1112 1347899999 Q ss_pred hcCCC---------------------------CCCCCchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 50 EEGWN---------------------------NLPERNFMFSTSMHFQEGLRKHIKRMH 81 (153) Q Consensus 50 E~G~~---------------------------~IP~RpFlr~~~~~~~~~~~~~~~~~~ 81 (153) ||||. .+||||||+++++++++.+.+.++.+- T Consensus 84 E~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i~~~~~~~~ 142 (142) T protein:vir:94 84 EYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAASTFLRNHAKGIR 142 (142) T ss_pred hccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHHHHHHHHHHhcC Confidence 99972 388999999999999998888888754 No 76 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=96.80 E-value=2.6e-06 Score=51.23 Aligned_cols=75 Identities=13% Similarity=0.162 Sum_probs=38.1 Q ss_pred CCccccccHHHHHHH--HHHHH-HhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCC----------CCCCCCchhhHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRY--FSQLS-DLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGW----------NNLPERNFMFSTSM 67 (153) Q Consensus 1 M~~~i~~~~~~l~~~--~~~l~-~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~----------~~IP~RpFlr~~~~ 67 (153) ...+-. ....+..+ ...|. ..+...+.|||..+ ++..||++|+||. .+||+||||--+-+ T Consensus 63 ~k~~~~-~~~l~~~~~l~~sl~~~~~~~~a~vg~~~G------t~~~yAaiHQfG~~~~~~~~~~~~~iPaRp~LG~s~~ 135 (150) T protein:vir:60 63 KKTGRV-KRKMFAKLITSRFLHIRASPEQASMEFYGG------KSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTGE 135 (150) T ss_pred HhhcCC-CccchhhhhhcceeeeeeeCcEEEEEeeCC------CchhhhhhhhccccccccCCCCceecCCcccCCCCHH Confidence 111110 00111111 11111 22455688887644 3579999999993 26999999965533 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_020201. 68 HFQEGLRKHIKRMHNG 83 (153) Q Consensus 68 ~~~~~~~~~~~~~~~~ 83 (153) + ..++.+.+...+.. T Consensus 136 d-~~~i~~~i~~~l~r 150 (150) T protein:vir:60 136 D-VQMIEEIILAHLDR 150 (150) T ss_pred H-HHHHHHHHHHHHhC Confidence 2 34444444444333 No 77 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=96.71 E-value=3.4e-06 Score=50.55 Aligned_cols=75 Identities=13% Similarity=0.141 Sum_probs=38.1 Q ss_pred CCccccccHHHHHHH--HHHHH-HhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCC----------CCCCCCchhhHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRY--FSQLS-DLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGW----------NNLPERNFMFSTSM 67 (153) Q Consensus 1 M~~~i~~~~~~l~~~--~~~l~-~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~----------~~IP~RpFlr~~~~ 67 (153) .+.+-. ....+..+ ...|. ..+...+.|||..+ ++..||++|+||. .+||+||||--+-+ T Consensus 63 ~k~~~~-~~~l~~~~~l~~sl~~~~~~~~a~vg~~~G------~~~~yAaiHQfG~~~r~~~~~~~~~iPaRp~LG~s~~ 135 (150) T protein:vir:57 63 KKTGRV-KRKMFAKLITSRFLHIRASPEQASMEFYGG------KSPKIASVHQFGLSEETRKDGKKIDYPARPLLGFTGE 135 (150) T ss_pred HhccCC-CcccchhhhhccceeeeeeCcEEEEEeecC------CchhhhhhhhccccccccCCCceeecCCcccCCCCHH Confidence 111110 00111111 11111 23455678887544 3579999999993 26999999955422 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_020201. 68 HFQEGLRKHIKRMHNG 83 (153) Q Consensus 68 ~~~~~~~~~~~~~~~~ 83 (153) ...++.+.+...+.. T Consensus 136 -d~~~i~~~i~~~l~r 150 (150) T protein:vir:57 136 -DVQMIEEIILAHLDR 150 (150) T ss_pred -HHHHHHHHHHHHHhC Confidence 244444444444433 No 78 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=96.65 E-value=6.2e-06 Score=49.13 Aligned_cols=75 Identities=23% Similarity=0.189 Sum_probs=36.1 Q ss_pred CC-----------------------ccccccHHHHHHHHHHHHH-hhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCC-- Q lcl|NC_020201. 1 MA-----------------------KKSSTDISELKRYFSQLSD-LAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWN-- 54 (153) Q Consensus 1 M~-----------------------~~i~~~~~~l~~~~~~l~~-l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~-- 54 (153) .+ .++-.+.- .|...|.. .+...|.||- + ..||++|+||.. T Consensus 65 ~r~~~g~~~~k~~~~~~~~~~~~~~~~~L~~tG---~L~~Si~~~~~~~~v~vGt---------n-~~YAaiHqfGg~~~ 131 (175) T protein:vir:10 65 MRVGGKKAYKKNGELTAAASRRKAGLMILQDSG---QMAASVSTDHDDNSAVIGS---------N-KEYAAIHQFGGQAG 131 (175) T ss_pred hhhcccccchhhhhhhhhhhhhccCCCcceech---hhhhhhheeecCCEEEEec---------C-hhhhhhhhcccccC Confidence 11 01101100 12222221 1233445543 2 468999999975 Q ss_pred -----CCCCCchhhHHHHHH-----HHHHHHHHHHHHHHHHcCC Q lcl|NC_020201. 55 -----NLPERNFMFSTSMHF-----QEGLRKHIKRMHNGIIQGR 88 (153) Q Consensus 55 -----~IP~RpFlr~~~~~~-----~~~~~~~~~~~~~~~~~G~ 88 (153) +||+||||--+-++. .+.|...+...+...+.++ T Consensus 132 ~~~~v~iPaRpfLG~s~~d~~~~e~~~~Il~~~~~~l~~~~~~~ 175 (175) T protein:vir:10 132 RGLKVTIPARPWLPVTADGELQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred CCCccccCCccccCCCcccccchHHHHHHHHHHHHHHHHHhccC Confidence 899999995432221 2334444444444455555 No 79 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=96.64 E-value=7.7e-06 Score=48.62 Aligned_cols=74 Identities=11% Similarity=0.069 Sum_probs=38.1 Q ss_pred CCccccccHHHHHHH--HHHHHH-hhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCC----------CCCCCchhhHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRY--FSQLSD-LAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWN----------NLPERNFMFSTSM 67 (153) Q Consensus 1 M~~~i~~~~~~l~~~--~~~l~~-l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~----------~IP~RpFlr~~~~ 67 (153) ++...+. ...+..+ -+.|.. .....+.||+.. ++..||++|+||.. +||+||||--+ + T Consensus 63 ~~~g~~~-~~~~~~l~~~~~l~~~~~~~~~~v~~~G-------tn~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~s-~ 133 (149) T protein:vir:18 63 SKKGRIK-REMFAKLRTSRFMKAKGSDSAAVVEFTG-------KVQRMARVHQYGLKDRPNRNSRDVQYEARPLLGFT-R 133 (149) T ss_pred hccCccc-chhhhhhhhhhhhheeecCceeEEEecc-------cchhhhhhhhccccccccCCCccccccccccCCCC-H Confidence 2222211 1122221 111211 234467777753 24789999999953 79999999544 2 Q ss_pred HHHHHHHHHHHHHHHH Q lcl|NC_020201. 68 HFQEGLRKHIKRMHNG 83 (153) Q Consensus 68 ~~~~~~~~~~~~~~~~ 83 (153) +...++.+.+.+.+.. T Consensus 134 ~d~~~I~~~i~~~l~~ 149 (149) T protein:vir:18 134 DDEQMIEDVIISHLGK 149 (149) T ss_pred HHHHHHHHHHHHHHhC Confidence 3344454444444332 No 80 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=96.64 E-value=2.4e-06 Score=51.41 Aligned_cols=85 Identities=19% Similarity=0.053 Sum_probs=46.0 Q ss_pred CCccccccHHHHHHHHH-----HHHHhhC--CEEEEeecCcC-----CCCCC-----CHHHHHHHHhcCC---------- Q lcl|NC_020201. 1 MAKKSSTDISELKRYFS-----QLSDLAE--KEVEYGFYDEK-----HYSGL-----NMATLAAIHEEGW---------- 53 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~-----~l~~l~~--~~v~VGi~~~~-----~~dG~-----~va~iA~~~E~G~---------- 53 (153) |..-.+.-...++++.. .++...+ ..|.-|-+... ..+|. +.+.+|.+.|||| T Consensus 14 ~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~~g~~~~V~~~~~YA~yVE~GTG~~~~~~~gr 93 (141) T protein:vir:78 14 RKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRKSSKEVIVGNSSDYAIYYEFGTGEKSERGGGK 93 (141) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeecCCcEEEEecCCCccceeecCCcccccCCCCC Confidence 11111111111222100 1111111 12344433221 01221 3478999999997 Q ss_pred ----------------CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 54 ----------------NNLPERNFMFSTSMHFQEGLRKHIKRMHNGII 85 (153) Q Consensus 54 ----------------~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~ 85 (153) ..-|+||||+++++++++++.+.+++.+..+- T Consensus 94 k~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~~i~~~i~~~~~~l~ 141 (141) T protein:vir:78 94 AGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQDKVRVFTERALRGIN 141 (141) T ss_pred cCcceeecCCCeeEeccCCCCchhhhhhHHhhHHHHHHHHHHHhhccC Confidence 24799999999999999999999888777543 No 81 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=96.59 E-value=6.7e-06 Score=48.97 Aligned_cols=75 Identities=16% Similarity=0.204 Sum_probs=40.9 Q ss_pred CCccccccHHHHHHHHHHHHH-hhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCC-------CCCCCchhhHH-HHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSD-LAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWN-------NLPERNFMFST-SMHFQE 71 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~-l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~-------~IP~RpFlr~~-~~~~~~ 71 (153) ...++-.+.- .|...|.. .....|.||.. ..||++|+||.. +||+||||--. -++-+. T Consensus 71 ~~~~~L~~tG---~L~~Si~~~~~~~~v~vGtn----------~~YA~iHqfGg~~~~~~~~~iPARPfLG~s~~~e~~~ 137 (155) T protein:vir:10 71 GAHPILQVTN---ALARSITTRADRDQAQIGSN----------LSYAAIQQLGGQAGRGRKVTIPARPYLPVLRNGQLKP 137 (155) T ss_pred CCCCccccch---hhhhhhhceecCCEEEEecC----------cchhhhhhcccccCCCCccccCCccccCCCccccchH Confidence 2222222221 23333332 24556777642 468999999963 79999999422 122244 Q ss_pred HHHHHHHHHHH-HHHcCC Q lcl|NC_020201. 72 GLRKHIKRMHN-GIIQGR 88 (153) Q Consensus 72 ~~~~~~~~~~~-~~~~G~ 88 (153) ++.+.+.+.+. .+..|+ T Consensus 138 ei~~~I~~~i~~~l~~~r 155 (155) T protein:vir:10 138 SARDAVLDVLLAALSQGR 155 (155) T ss_pred HHHHHHHHHHHHHHhhcC Confidence 55555555554 455777 No 82 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=96.52 E-value=1.5e-06 Score=52.46 Aligned_cols=79 Identities=8% Similarity=-0.034 Sum_probs=46.8 Q ss_pred CCccccccHHHHHHH----HHHHHHhhC----------------CEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCC- Q lcl|NC_020201. 1 MAKKSSTDISELKRY----FSQLSDLAE----------------KEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPER- 59 (153) Q Consensus 1 M~~~i~~~~~~l~~~----~~~l~~l~~----------------~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~R- 59 (153) |...-+...+.|++. .++++.-.. --+.||+. .+-+-++-.+||||...|+| T Consensus 20 g~~~~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkkik~~~kk~g~~~VG~~-------ks~~fy~kF~EFGTSkm~a~~ 92 (119) T protein:vir:10 20 MVLDESTKRKGIKAGITKIGKAIEKNSPIKSGRLSKVKIRVKNTGLATEGTA-------SSSEFYDIFQNFGTSEQKAHV 92 (119) T ss_pred hhhhHHHHHHHHHHHhHHHHHHHhhcCCcccCCcceeeeeeecCceeEeccC-------CcchhhhhhccccccccCCCC Confidence 322222222222221 112221111 12445542 24578999999999999999 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020201. 60 NFMFSTSMHFQEGLRKHIKRMHNGIIQ 86 (153) Q Consensus 60 pFlr~~~~~~~~~~~~~~~~~~~~~~~ 86 (153) |||.+++++..++..+.+...+..=+. T Consensus 93 pF~~~a~~~~~~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 93 GYFDRAVDETTNEAVEEVAEIIFRKMR 119 (119) T ss_pred CccccccccChHHHHHHHHHHHHHhcC Confidence 999999999999988887775543233 No 83 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=96.50 E-value=2.3e-06 Score=51.46 Aligned_cols=86 Identities=19% Similarity=0.260 Sum_probs=44.1 Q ss_pred CCccccccHHHHHHHHHH------------H----HHh---------hCCEEEEeecCcC-----CCCC-------CCHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQ------------L----SDL---------AEKEVEYGFYDEK-----HYSG-------LNMA 43 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~------------l----~~l---------~~~~v~VGi~~~~-----~~dG-------~~va 43 (153) |+++|+. .+.|.+-+++ + +.+ ....|.-|-+... ..+| .+.+ T Consensus 2 ~~v~i~G-ld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~~~~~~g~V~~~~ 80 (182) T protein:vir:10 2 IEVELKG-VNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVDGDEVIGRWWNSS 80 (182) T ss_pred eEEEEec-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeecCCeEEEEeecCC Confidence 7777653 2222221211 1 000 0001222321110 0011 1225 Q ss_pred HHHHHHhcCC------------------------------------------------------CCCCCCchhhHHHHHH Q lcl|NC_020201. 44 TLAAIHEEGW------------------------------------------------------NNLPERNFMFSTSMHF 69 (153) Q Consensus 44 ~iA~~~E~G~------------------------------------------------------~~IP~RpFlr~~~~~~ 69 (153) .+|.+.|||| .+.||||||+|+++++ T Consensus 81 ~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPFl~pA~~~~ 160 (182) T protein:vir:10 81 MVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQFMTPAANKM 160 (182) T ss_pred CccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcchHHHHHHh Confidence 6777777775 2469999999999999 Q ss_pred HHHHHHHHHHHHHHH----HcC Q lcl|NC_020201. 70 QEGLRKHIKRMHNGI----IQG 87 (153) Q Consensus 70 ~~~~~~~~~~~~~~~----~~G 87 (153) ++.+.+.+++.++.. ..| T Consensus 161 ~~~i~~~i~~~i~~~l~~~~g~ 182 (182) T protein:vir:10 161 AKEAPEIIKRSIDQELHDKLGG 182 (182) T ss_pred HHHHHHHHHHHHHHHHHHhhcC Confidence 998887777665543 333 No 84 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=96.46 E-value=3.2e-06 Score=50.68 Aligned_cols=76 Identities=16% Similarity=0.126 Sum_probs=40.1 Q ss_pred CCccccccH-HHHHH----HHHHHHHhhCCEEEEeecCcC-----CCCC-----CCHHHHHHHHhcCC------------ Q lcl|NC_020201. 1 MAKKSSTDI-SELKR----YFSQLSDLAEKEVEYGFYDEK-----HYSG-----LNMATLAAIHEEGW------------ 53 (153) Q Consensus 1 M~~~i~~~~-~~l~~----~~~~l~~l~~~~v~VGi~~~~-----~~dG-----~~va~iA~~~E~G~------------ 53 (153) |.-.+.... +.+.+ +.+..+.+.. |.-|-+... ..+| .+.+.+|.+.|||| T Consensus 18 ~~~~~~~~~~~~l~~~a~~~~~~ak~~~p--vdTG~L~~Si~~~~~~~g~~~~V~~~~~YA~yvE~GT~~~~~~~~~~~~ 95 (137) T protein:vir:96 18 YRDEMEEWVKKGILKTTLAIYNTAVALAP--VDLGFLKESIDFKVTDGGFSSVISVGAEYAIYVEFGTGIYATGPGGSRA 95 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCC--cCccchhcCceeEeecCceEEEEecCCCcccccccCccccccCCCcccc Confidence 221111000 11111 2222222222 222322110 1122 23368999999997 Q ss_pred -----------------CCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_020201. 54 -----------------NNLPERNFMFSTSMHFQEGLRKHIK 78 (153) Q Consensus 54 -----------------~~IP~RpFlr~~~~~~~~~~~~~~~ 78 (153) +.+|+||||++++++++..+.+.+. T Consensus 96 ~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:96 96 RKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRKVFNRYFS 137 (137) T ss_pred ccccceeeccCcceeecCCCCCCcchhHHHHHHHHHHHHhhC Confidence 3489999999999999999888777 No 85 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=96.39 E-value=3.7e-06 Score=50.36 Aligned_cols=76 Identities=13% Similarity=0.096 Sum_probs=40.1 Q ss_pred CCccc----cccHHH-HHHHHHHHHHhhCCEEEEeecCcC-----CCCC-----CCHHHHHHHHhcCC------------ Q lcl|NC_020201. 1 MAKKS----STDISE-LKRYFSQLSDLAEKEVEYGFYDEK-----HYSG-----LNMATLAAIHEEGW------------ 53 (153) Q Consensus 1 M~~~i----~~~~~~-l~~~~~~l~~l~~~~v~VGi~~~~-----~~dG-----~~va~iA~~~E~G~------------ 53 (153) +.-.+ ...... -..+.+..+.+.. |.-|-+... ..+| .+.+.+|.+.|||| T Consensus 18 ~~~~~~~~~~~~~~~~a~~v~~~ak~~aP--v~TG~L~~Si~~~~~~~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~ 95 (137) T protein:vir:95 18 YERDMERWVKRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRA 95 (137) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCC--ccchhhhcCeeeEeeCCceEEEEecCCCcccccccCccccccCCCcccc Confidence 22111 111100 1111122222222 223322110 0111 23467899999997 Q ss_pred -----------------CCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_020201. 54 -----------------NNLPERNFMFSTSMHFQEGLRKHIK 78 (153) Q Consensus 54 -----------------~~IP~RpFlr~~~~~~~~~~~~~~~ 78 (153) ...|+||||+++++.++..+.+.+. T Consensus 96 ~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k~l~ 137 (137) T protein:vir:95 96 KKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cccccceeccCcceeecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 3589999999999999999988877 No 86 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=96.37 E-value=2.1e-06 Score=51.76 Aligned_cols=74 Identities=20% Similarity=0.180 Sum_probs=44.0 Q ss_pred CCccccccHHHHHHHHHHHHHhhCC------------------------EEEEeecCcC-----CCCC-----CCHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEK------------------------EVEYGFYDEK-----HYSG-----LNMATLA 46 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~------------------------~v~VGi~~~~-----~~dG-----~~va~iA 46 (153) |+. ++. ++++|.+.|+.+... .|.-|-+... ..+| .+.+.+| T Consensus 13 Ma~-~~~---Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA 88 (149) T protein:vir:94 13 MAK-VKY---GADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADYA 88 (149) T ss_pred HHH-HHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEEeeCCcEEEEEecCCCcc Confidence 744 322 444444444333211 1222322110 1122 2337899 Q ss_pred HHHhcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_020201. 47 AIHEEGW-----------------------------NNLPERNFMFSTSMHFQEGLRKHIK 78 (153) Q Consensus 47 ~~~E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 78 (153) .+.|||| ++.||||||++++++++..+.+.+. T Consensus 89 ~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~~~i~~~i~ 149 (149) T protein:vir:94 89 IYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred cccccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 9999997 2479999999999999999888877 No 87 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=96.37 E-value=3.3e-05 Score=45.12 Aligned_cols=86 Identities=12% Similarity=0.058 Sum_probs=58.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCC----CCCCccHHHHHhcC--CCCcchhHHHH Q lcl|NC_020201. 66 SMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSF----SNPKVSKDWASYKG--FDDAMIHYGDL 139 (153) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~----~~p~ns~~Ti~~KG--~~~PLidTG~L 139 (153) +++. +++...+..++..+ ...+....|..||..+....++.|.+..- +++|+++.|+++|. ..++|.++|.| T Consensus 1 ~~~~-~~l~~~L~~ll~~l-~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~~~~~l~~~~~l 78 (150) T protein:vir:20 1 MNEF-KRFEDRLTGLIESL-SPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGRVKRKMFAKLIT 78 (150) T ss_pred CchH-HHHHHHHHHHHHhc-CChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccCCCccccchhhh Confidence 2222 33333444443332 22245678999999999999999987521 24688999987663 35799999999 Q ss_pred HhhcceeeeeCC-------CC Q lcl|NC_020201. 140 SSAATYKIVKYQ-------GK 153 (153) Q Consensus 140 ~~Sity~V~~~~-------gk 153 (153) ..||+|++-... |. T Consensus 79 ~~sl~~~~~~~~~~vg~~~Gs 99 (150) T protein:vir:20 79 SRFLHIRASPEQASMEFYGGK 99 (150) T ss_pred hhhhheeecCcEEEEEeeCCc Confidence 999999876544 33 No 88 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=96.27 E-value=1.2e-05 Score=47.61 Aligned_cols=75 Identities=13% Similarity=0.131 Sum_probs=37.3 Q ss_pred CCccccccHHHHHHHHHHHHH-hhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCC-------CCCCCchhhHHHHHH--H Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSD-LAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWN-------NLPERNFMFSTSMHF--Q 70 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~-l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~-------~IP~RpFlr~~~~~~--~ 70 (153) ...++-.+.- .|...|.. .+...|.||- ...||++|+||.. +||+||||--+-++. . T Consensus 71 ~~~~iL~~tG---~L~~Si~~~~~~~~v~vGt----------~~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~ 137 (155) T protein:vir:79 71 GPHPILQVTN---ALARSVTTWADRNEAGIGS----------NLVYAAIHQFGGDAGRGHQVEIPARRYLPFDENGQLAA 137 (155) T ss_pred CCCCccccch---hhhhhhhceecCCEEEEec----------CchhhhhhhcccccCCCCccccCCccccCCCCccccch Confidence 1222222221 23333322 2344566653 2468999999964 799999994332211 2 Q ss_pred HHHHHHHHHHHHHHHcCC Q lcl|NC_020201. 71 EGLRKHIKRMHNGIIQGR 88 (153) Q Consensus 71 ~~~~~~~~~~~~~~~~G~ 88 (153) +.+...+.-+...+..|+ T Consensus 138 ~~~~~I~~~i~~~l~r~r 155 (155) T protein:vir:79 138 GARQSILEVVLTALSRNR 155 (155) T ss_pred HHHHHHHHHHHHHHHhcC Confidence 222333333334555677 No 89 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=96.20 E-value=8.6e-06 Score=48.36 Aligned_cols=82 Identities=11% Similarity=0.078 Sum_probs=45.3 Q ss_pred CCccccccHHHHHHHH-HHHHHh---h-CCEEEEeecCcCCCCCCCHHHHHHHHhcC----------------------- Q lcl|NC_020201. 1 MAKKSSTDISELKRYF-SQLSDL---A-EKEVEYGFYDEKHYSGLNMATLAAIHEEG----------------------- 52 (153) Q Consensus 1 M~~~i~~~~~~l~~~~-~~l~~l---~-~~~v~VGi~~~~~~dG~~va~iA~~~E~G----------------------- 52 (153) |+-.+....=.|.+-+ .....- + ...-.||+-.-+. --+.+.||| T Consensus 5 akarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkA-------PhghlvE~Ghw~~~~~~~~~dG~w~~~~~~l~ 77 (119) T protein:vir:81 5 AKAFVNDETGKLRSNLYVAYSPEESTNGVQTYAVSWRKKAA-------PHGHLLEFGHWQTHAAYKGKDGEWYSSSVKLV 77 (119) T ss_pred cccccCCCccchhhhheeeeccccCCCCeEEEEeeccCCcC-------CcccccccceeeeeeeeeccCceeeecCcccc Confidence 4444433322232222 211111 1 1234455544332 223345888 Q ss_pred -CCCCCCCchhhHHHHHHHHHHHHHHHHH----HHHHHcCCC Q lcl|NC_020201. 53 -WNNLPERNFMFSTSMHFQEGLRKHIKRM----HNGIIQGRG 89 (153) Q Consensus 53 -~~~IP~RpFlr~~~~~~~~~~~~~~~~~----~~~~~~G~~ 89 (153) ...+|+|||||++++.......+.+.+. +..++.|+. T Consensus 78 ~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:81 78 NPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred CceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 3469999999999998888887777765 455667754 No 90 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=96.19 E-value=2.1e-06 Score=51.70 Aligned_cols=78 Identities=18% Similarity=0.104 Sum_probs=42.4 Q ss_pred CCccccccH---HHHHHHHHHHH--------Hhh---------CCEEEEeecCcC-----CCCC-----CCHHHHHHHHh Q lcl|NC_020201. 1 MAKKSSTDI---SELKRYFSQLS--------DLA---------EKEVEYGFYDEK-----HYSG-----LNMATLAAIHE 50 (153) Q Consensus 1 M~~~i~~~~---~~l~~~~~~l~--------~l~---------~~~v~VGi~~~~-----~~dG-----~~va~iA~~~E 50 (153) |....+... +.|+++.+.++ +.. ...|.-|-+... ..+| .+.+.+|.+.| T Consensus 1 Ma~~~~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVN 80 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEeecCcEEEEEecCCCcccccc Confidence 665432211 22222222111 100 011222322110 1122 23368999999 Q ss_pred cCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_020201. 51 EGW-----------------------------NNLPERNFMFSTSMHFQEGLRKHIK 78 (153) Q Consensus 51 ~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 78 (153) ||| ..+|+||||+++++.+++.+.+.+. T Consensus 81 ~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 81 YGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRVFFNKYFS 137 (137) T ss_pred cCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 995 3589999999999999999988877 No 91 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=96.19 E-value=3.1e-06 Score=50.80 Aligned_cols=74 Identities=20% Similarity=0.177 Sum_probs=44.1 Q ss_pred CCccccccHHHHHHHHHHHHHhhCC------------------------EEEEeecCcC-----CCCC-----CCHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEK------------------------EVEYGFYDEK-----HYSG-----LNMATLA 46 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~------------------------~v~VGi~~~~-----~~dG-----~~va~iA 46 (153) |+.. +. +++++.+.|+++... .|.-|.+... ..+| .+.+.+| T Consensus 13 Ma~v-~~---Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~~~~~g~~~~V~~~~~YA 88 (149) T protein:vir:10 13 MAKV-KY---GADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADYA 88 (149) T ss_pred hHHH-HH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEEecCCcEEEEEecCCCcc Confidence 7543 22 344444444333111 1222322211 1122 2337899 Q ss_pred HHHhcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_020201. 47 AIHEEGW-----------------------------NNLPERNFMFSTSMHFQEGLRKHIK 78 (153) Q Consensus 47 ~~~E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 78 (153) .+.|||| ++.||||||++++++++..+.+.+. T Consensus 89 ~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~~i~~~i~ 149 (149) T protein:vir:10 89 IYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred cccccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 9999997 2479999999999999999988877 No 92 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=96.16 E-value=1.3e-05 Score=47.32 Aligned_cols=79 Identities=14% Similarity=0.158 Sum_probs=40.1 Q ss_pred CCccccccHHHHHHHHHH--HH-HhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCC----------CCCCCchhhHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQ--LS-DLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWN----------NLPERNFMFSTSM 67 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~--l~-~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~----------~IP~RpFlr~~~~ 67 (153) ....+......+..+... |. ..+...+.|||.. ++..||++|.||.. +||+||||--+-+ T Consensus 65 ~~~~~~~~~~m~~~l~~~~~l~~~~~~~~a~vg~~G-------s~~~yA~iHQfG~~~~~~~~~~~v~iPaRp~LG~s~~ 137 (156) T protein:vir:11 65 KQGRIRRKIKMFQKLRTVRYLRAKGDAQAITVSFAG-------RIARIARVHQYGLRDRAEPGAPEVSYAQRLLLGFDSS 137 (156) T ss_pred hccccccchhhhhhhhhhheeeeeecCcEEEEEecC-------CchhhhhhhcccccccccCCCCcccccccccCCCCHH Confidence 112222222222222211 11 1245678888753 34789999999953 7999999954422 Q ss_pred HHHHHHHHHHHHHHHHHHcCCCHH Q lcl|NC_020201. 68 HFQEGLRKHIKRMHNGIIQGRGFS 91 (153) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~G~~~~ 91 (153) + .+++.+. +...+.+.++- T Consensus 138 d-~~~i~~~----i~~~l~~~~~~ 156 (156) T protein:vir:11 138 D-METIQNG----ILAHIDANSPI 156 (156) T ss_pred H-HHHHHHH----HHHHHhhcCCC Confidence 2 2333333 33334455433 No 93 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=96.14 E-value=9.6e-06 Score=48.09 Aligned_cols=82 Identities=11% Similarity=0.081 Sum_probs=45.2 Q ss_pred CCccccccHHHHHHHH-HHHHHh---h-CCEEEEeecCcCCCCCCCHHHHHHHHhcC----------------------- Q lcl|NC_020201. 1 MAKKSSTDISELKRYF-SQLSDL---A-EKEVEYGFYDEKHYSGLNMATLAAIHEEG----------------------- 52 (153) Q Consensus 1 M~~~i~~~~~~l~~~~-~~l~~l---~-~~~v~VGi~~~~~~dG~~va~iA~~~E~G----------------------- 52 (153) |+-.+....=.|.+-+ .....- + ...-.||+-.-+. --+.+.||| T Consensus 5 akarv~~~~G~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~rkA-------PhghlvE~Ghw~~~~~~~~~dG~w~~~~~~l~ 77 (119) T protein:vir:10 5 AKAFVNDETGKLRSNLYVAYSTEESTNGVQTYAVSWRKKAA-------PHGHLLEFGHWQTHAAYKGKDGEWYSSSVKLV 77 (119) T ss_pred cccccCCCccchhhhheeeeccccCCCCEEEEEeecCCCcC-------CcccccccceeeeeeeeeccCceeeecCcccc Confidence 4444433322232222 221111 1 1234455544332 223445888 Q ss_pred -CCCCCCCchhhHHHHHHHHHHHHHHHHH----HHHHHcCCC Q lcl|NC_020201. 53 -WNNLPERNFMFSTSMHFQEGLRKHIKRM----HNGIIQGRG 89 (153) Q Consensus 53 -~~~IP~RpFlr~~~~~~~~~~~~~~~~~----~~~~~~G~~ 89 (153) ...+|+|||||++++.......+.+.+. +..++.|+. T Consensus 78 ~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:10 78 NPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred CceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 2369999999999999888887777765 455667754 No 94 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=96.08 E-value=3.9e-06 Score=50.21 Aligned_cols=74 Identities=16% Similarity=0.091 Sum_probs=42.5 Q ss_pred CCccccccHHHHHHHHHHHHHhhC------------------------CEEEEeecCcC-----CCCC-----CCHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAE------------------------KEVEYGFYDEK-----HYSG-----LNMATLA 46 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~------------------------~~v~VGi~~~~-----~~dG-----~~va~iA 46 (153) |+..... . ++|.+.|+.+.+ ..|.-|-+... ..+| .+.+.+| T Consensus 1 Ma~~~~G-~---~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA 76 (137) T protein:vir:10 1 MAKVKYG-N---WDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEYA 76 (137) T ss_pred CccchhC-H---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEecCCcEEEEEecCCccc Confidence 7765432 2 222222222111 01222322111 1123 2336899 Q ss_pred HHHhcCC-----------------------------CCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_020201. 47 AIHEEGW-----------------------------NNLPERNFMFSTSMHFQEGLRKHIK 78 (153) Q Consensus 47 ~~~E~G~-----------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 78 (153) .+.|||| .++||||||+++++++++.+.+.+. T Consensus 77 ~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 77 VYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred cccccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 9999996 1489999999999999999988777 No 95 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=96.04 E-value=1.8e-05 Score=46.61 Aligned_cols=75 Identities=19% Similarity=0.290 Sum_probs=37.2 Q ss_pred CC-------cccccc------HHHHHHHHHH--HH-HhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCC----------C Q lcl|NC_020201. 1 MA-------KKSSTD------ISELKRYFSQ--LS-DLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGW----------N 54 (153) Q Consensus 1 M~-------~~i~~~------~~~l~~~~~~--l~-~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~----------~ 54 (153) -+ .+.+.. ...+..+..+ |. ..+...+.|||.. ++..||++|.||. . T Consensus 55 ~prk~~~~~~~~~~~~g~~~~~~m~~~l~~a~~l~~~~~~d~a~Vg~~G-------s~~~yAaiHQfG~~~r~~~~~~~v 127 (155) T protein:vir:79 55 EPRKVKAGGKRLREKAGRVKREAMFRKLRTARYLRIDVDSTGLAIGFDE-------RLSRIARVHQEGQKAPVEPGGPLA 127 (155) T ss_pred cccchhhhhhhhhcccCcccchhhhhhhhhhheeeeeecCcEEEEEecC-------cchhhhhhhhcCCcccCCCCCccc Confidence 10 000000 0011111100 11 1344668888743 3588999999994 2 Q ss_pred CCCCCchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 55 NLPERNFMFSTSMHFQEGLRKHIKRMHNG 83 (153) Q Consensus 55 ~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 83 (153) +||+||||--+-+ ..+++...+...+.. T Consensus 128 ~iPaRp~LGls~~-d~~~I~~~i~~~l~r 155 (155) T protein:vir:79 128 QYPVRVVLGFSDA-DRELVRDRLLRELTR 155 (155) T ss_pred ccccccccCCCHH-HHHHHHHHHHHHhhC Confidence 6999999954433 344454444444333 No 96 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=95.99 E-value=7.4e-05 Score=43.24 Aligned_cols=86 Identities=10% Similarity=0.011 Sum_probs=58.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCC----CCCCccHHHHHhcCC--CCcchhHHHH Q lcl|NC_020201. 66 SMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSF----SNPKVSKDWASYKGF--DDAMIHYGDL 139 (153) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~----~~p~ns~~Ti~~KG~--~~PLidTG~L 139 (153) +++. +++...+..++..+ ...+....|..||..+....++.|.+..- +++|+++.|+++|+. .++|+++|.| T Consensus 1 ~~~~-~~l~~~L~~~l~~L-~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~~~l 78 (150) T protein:vir:60 1 MNEF-KRFEDRLTGLIESL-SPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLIT 78 (150) T ss_pred CchH-HHHHHHHHHHHHhc-CChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcCCCccchhhhhh Confidence 2222 22333333333332 23345678999999999999999986411 356789999987753 5899999999 Q ss_pred HhhcceeeeeCCCC Q lcl|NC_020201. 140 SSAATYKIVKYQGK 153 (153) Q Consensus 140 ~~Sity~V~~~~gk 153 (153) ..||+|++.....- T Consensus 79 ~~sl~~~~~~~~a~ 92 (150) T protein:vir:60 79 SRFLHIRASPEQAS 92 (150) T ss_pred cceeeeeeeCcEEE Confidence 99999988765433 No 97 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=95.95 E-value=1.8e-05 Score=46.53 Aligned_cols=72 Identities=14% Similarity=0.168 Sum_probs=36.3 Q ss_pred CCccccccHHHHHHHHHHHHH-hhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCC-------CCCCCchhhHHHH----- Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSD-LAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWN-------NLPERNFMFSTSM----- 67 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~-l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~-------~IP~RpFlr~~~~----- 67 (153) ...++-.+.. .|...|.. .+...|.||.. ..||++|+||.. +||+||||--+-+ T Consensus 71 ~~~~iL~~tg---~L~~Si~~~~~~~~v~vGtn----------~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~ 137 (155) T protein:vir:99 71 GPHPILQVTN---ALARSVTTWADRNEAGIGSN----------LVYAAIHQFGGDAGRGHQVEIPARRYLPFDENGQLAA 137 (155) T ss_pred CCCCcchhch---hhhhhhhceecCCEEEEecC----------ccchhhhhcccccCCCCccccCCccccCCCCccccch Confidence 2222322222 23333322 24456666632 467999999964 7999999943321 Q ss_pred HHHHHHHHHHHHHHHHHHcCC Q lcl|NC_020201. 68 HFQEGLRKHIKRMHNGIIQGR 88 (153) Q Consensus 68 ~~~~~~~~~~~~~~~~~~~G~ 88 (153) +-.+++.+.+. +.+..++ T Consensus 138 e~~~~I~~~i~---~~l~~~~ 155 (155) T protein:vir:99 138 GARQSILEIVL---TALSRNR 155 (155) T ss_pred HHHHHHHHHHH---HHHhccC Confidence 12223333333 3444555 No 98 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=95.91 E-value=9.4e-06 Score=48.13 Aligned_cols=77 Identities=10% Similarity=0.163 Sum_probs=51.9 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRM 80 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~ 80 (153) |+-.|+..... +.......+.|||... +.+|-+.||||.++||.||+..|..+.++++.+.+.+. T Consensus 61 laD~I~~s~~~-------~dg~~~g~~~VG~~k~--------~~~A~f~n~GT~k~~~~hFie~t~~e~~~evl~a~~~~ 125 (139) T protein:vir:10 61 LSEDIRSAAGD-------IDGDHNGSSTVGFHNK--------AHIARFLNDGTKYIRADHFVDNARDDAKDAVFAAEAEK 125 (139) T ss_pred hhhcceecCcc-------cccccceeeeeCCCCC--------cceEeecccCccccCCCchHHHHHHHHHHHHHHHHHHH Confidence 44333322111 1111123456888421 47899999999999999999999999999999988888 Q ss_pred HHHHHcCC--CHHH Q lcl|NC_020201. 81 HNGIIQGR--GFSS 92 (153) Q Consensus 81 ~~~~~~G~--~~~~ 92 (153) ++.+++.. +-+. T Consensus 126 ~k~~l~~~~~~~~~ 139 (139) T protein:vir:10 126 YQAMIAKANGGGDK 139 (139) T ss_pred HHHHHhhcCCCCCC Confidence 88877553 2222 No 99 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=95.89 E-value=2.1e-05 Score=46.27 Aligned_cols=75 Identities=12% Similarity=0.145 Sum_probs=34.5 Q ss_pred CC-----ccccccHHHHHHH--HHHHHH-hhCCEEEEeecCcCCCCCCCHHHHHHHHhcCC----------CCCCCCchh Q lcl|NC_020201. 1 MA-----KKSSTDISELKRY--FSQLSD-LAEKEVEYGFYDEKHYSGLNMATLAAIHEEGW----------NNLPERNFM 62 (153) Q Consensus 1 M~-----~~i~~~~~~l~~~--~~~l~~-l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~----------~~IP~RpFl 62 (153) -+ -+-+.....++.+ ...|.. .....+.|||. + ++..||++|+||- .+||+|||| T Consensus 56 ~s~~~~~~~g~~~~~~~~~l~~~~~l~~~~~~~~~~v~~~-G------t~~~yAaiHQfG~~~r~~~~~~~v~iPaRp~L 128 (148) T protein:vir:79 56 RKPQLRHRAGRIRRAMFMRLRLARYMKTQADANTAVVTFA-G------NAQRIATVHQFGLRDRVNKAGLTAQYPARELL 128 (148) T ss_pred cchHHHhhcccccccccchhhhhhheeeeeeCCeeeEEee-c------cchhhhhhhhcCccccccCCCCccccCccccc Confidence 00 0000000001111 111111 23345777763 2 3478999999992 369999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_020201. 63 FSTSMHFQEGLRKHIKRMHNGIIQG 87 (153) Q Consensus 63 r~~~~~~~~~~~~~~~~~~~~~~~G 87 (153) --+-+ ...++.+.+... +.| T Consensus 129 G~s~~-d~~~i~~~i~~~----l~~ 148 (148) T protein:vir:79 129 GMDGV-DMEHITNLLLLH----LGA 148 (148) T ss_pred CCCHH-HHHHHHHHHHHH----hcC Confidence 54422 233343333333 333 No 100 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=95.86 E-value=5.9e-06 Score=49.26 Aligned_cols=74 Identities=15% Similarity=0.105 Sum_probs=44.9 Q ss_pred CCccccccHHHHHHHHHHHHHhhCC------------------------EEEEeecCcC-----CCCC-----CCHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEK------------------------EVEYGFYDEK-----HYSG-----LNMATLA 46 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~------------------------~v~VGi~~~~-----~~dG-----~~va~iA 46 (153) |+.... +|++|.+.|+++... .|.-|-+... ..+| .+.+.+| T Consensus 1 Ma~~~~----Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA 76 (135) T protein:vir:96 1 MAKVKY----GADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFENGGFTGVVKIGSNYA 76 (135) T ss_pred Cchhhh----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEeecCcEEEEEecCCCcc Confidence 876421 333443333332211 1223322111 1123 2457899 Q ss_pred HHHhcCC---------------------------CCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_020201. 47 AIHEEGW---------------------------NNLPERNFMFSTSMHFQEGLRKHIK 78 (153) Q Consensus 47 ~~~E~G~---------------------------~~IP~RpFlr~~~~~~~~~~~~~~~ 78 (153) .+.|||| ..+|+||||++++++.+..+.+.+. T Consensus 77 ~~ve~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 77 VYVNYGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGRQTFEQYFS 135 (135) T ss_pred chhhcccccccCCCccccccccccccCCcceeecCCcCCCcchhHHHHHHHHHHHHhcC Confidence 9999997 3489999999999999999888877 No 101 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=95.75 E-value=0.00011 Score=42.27 Aligned_cols=86 Identities=12% Similarity=0.023 Sum_probs=59.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCC----CCCCccHHHHHhcCC--CCcchhHHHH Q lcl|NC_020201. 66 SMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSF----SNPKVSKDWASYKGF--DDAMIHYGDL 139 (153) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~----~~p~ns~~Ti~~KG~--~~PLidTG~L 139 (153) +++. +++...+..++..+ ...+....|..||..+....++.|.+..- +++|+++.|+++|+. .++|+.+|.| T Consensus 1 m~~~-~~l~~~L~~~l~~L-~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~~~~~~~l~~~~~l 78 (150) T protein:vir:57 1 MNEF-KRFEDRLTGLIESL-SPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLIT 78 (150) T ss_pred CchH-HHHHHHHHHHHHhc-CChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhccCCCcccchhhhh Confidence 2222 33333333333332 22345678999999999999999987511 346899999987753 5899999999 Q ss_pred HhhcceeeeeCCC-------C Q lcl|NC_020201. 140 SSAATYKIVKYQG-------K 153 (153) Q Consensus 140 ~~Sity~V~~~~g-------k 153 (153) ..||+|++..... . T Consensus 79 ~~sl~~~~~~~~a~vg~~~G~ 99 (150) T protein:vir:57 79 SRFLHIRASPEQASMEFYGGK 99 (150) T ss_pred ccceeeeeeCcEEEEEeecCC Confidence 9999998876643 3 No 102 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=95.66 E-value=7.3e-06 Score=48.74 Aligned_cols=76 Identities=14% Similarity=0.122 Sum_probs=42.9 Q ss_pred CCccccccHHHH-HHHHHHHHHhhCCEEEEeecCcC-----CCCCC-----CHHHHHHHHhcC----------------- Q lcl|NC_020201. 1 MAKKSSTDISEL-KRYFSQLSDLAEKEVEYGFYDEK-----HYSGL-----NMATLAAIHEEG----------------- 52 (153) Q Consensus 1 M~~~i~~~~~~l-~~~~~~l~~l~~~~v~VGi~~~~-----~~dG~-----~va~iA~~~E~G----------------- 52 (153) |.--++...... ..+.+.++.+.. |.-|-+... ..+|+ +.+.+|.+.||| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aP--v~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~~ 78 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIP 78 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCC--cCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCcccccccc Confidence 444443332221 122222333222 223322211 11221 347899999999 Q ss_pred ------------CCCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_020201. 53 ------------WNNLPERNFMFSTSMHFQEGLRKHIK 78 (153) Q Consensus 53 ------------~~~IP~RpFlr~~~~~~~~~~~~~~~ 78 (153) +..+|+||||++++++++..+.+.+. T Consensus 79 ~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 79 WSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 34699999999999999998877666 No 103 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=95.66 E-value=7.3e-06 Score=48.74 Aligned_cols=76 Identities=14% Similarity=0.122 Sum_probs=42.9 Q ss_pred CCccccccHHHH-HHHHHHHHHhhCCEEEEeecCcC-----CCCCC-----CHHHHHHHHhcC----------------- Q lcl|NC_020201. 1 MAKKSSTDISEL-KRYFSQLSDLAEKEVEYGFYDEK-----HYSGL-----NMATLAAIHEEG----------------- 52 (153) Q Consensus 1 M~~~i~~~~~~l-~~~~~~l~~l~~~~v~VGi~~~~-----~~dG~-----~va~iA~~~E~G----------------- 52 (153) |.--++...... ..+.+.++.+.. |.-|-+... ..+|+ +.+.+|.+.||| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aP--v~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~~~~ 78 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKKIP 78 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCC--cCcccccccceEEeecCcEEEEEecCCCcccccccCCcccccCCCcccccccc Confidence 444443332221 122222333222 223322211 11221 347899999999 Q ss_pred ------------CCCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_020201. 53 ------------WNNLPERNFMFSTSMHFQEGLRKHIK 78 (153) Q Consensus 53 ------------~~~IP~RpFlr~~~~~~~~~~~~~~~ 78 (153) +..+|+||||++++++++..+.+.+. T Consensus 79 ~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 79 WSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 34699999999999999998877666 No 104 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=95.66 E-value=6.3e-06 Score=49.10 Aligned_cols=76 Identities=14% Similarity=0.122 Sum_probs=42.4 Q ss_pred CCccccccHHHH-HHHHHHHHHhhCCEEEEeecCcC-----CCCCC-----CHHHHHHHHhcCC---------------- Q lcl|NC_020201. 1 MAKKSSTDISEL-KRYFSQLSDLAEKEVEYGFYDEK-----HYSGL-----NMATLAAIHEEGW---------------- 53 (153) Q Consensus 1 M~~~i~~~~~~l-~~~~~~l~~l~~~~v~VGi~~~~-----~~dG~-----~va~iA~~~E~G~---------------- 53 (153) |.--++...... ..+.+..+.+.. |.-|-+... ..+|+ +.+.+|.+.|||| T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~ap--v~TG~Lr~SI~~~~~~~~~~~~V~~~~~Ya~yvE~GTg~~~~~~~~~~~~~~~ 78 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLMP--VDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVNYGTGIYATGAGGSRAKNIP 78 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhCC--ccccccccceeEEeecCcEEEEEecCCCccceeecCccccccCCCcccccccc Confidence 444443332222 122222222222 333322211 11221 3478999999993 Q ss_pred -------------CCCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_020201. 54 -------------NNLPERNFMFSTSMHFQEGLRKHIK 78 (153) Q Consensus 54 -------------~~IP~RpFlr~~~~~~~~~~~~~~~ 78 (153) ...||||||++++++++..+.+.+. T Consensus 79 ~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 79 WSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ceeecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 3589999999999999998877666 No 105 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=95.27 E-value=1.9e-05 Score=46.43 Aligned_cols=79 Identities=13% Similarity=0.085 Sum_probs=48.5 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHH--HHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHF--QEGLRKHIK 78 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~--~~~~~~~~~ 78 (153) |+=.|+......+ ....-.+.|||... .-+++|-+.||||.++|+-||+..+..+. ++++-+.+. T Consensus 61 laD~I~~~~~~~D-------G~~dg~s~VG~~~~------~~~~~A~f~n~GT~k~~~~hFve~~~~~a~~k~~Vl~A~~ 127 (141) T protein:vir:50 61 MADGLAIQSTNAD-------GRKNGVSTVGWKNN------YHAQNARRLNDGTKKYRADHFVTNVQNDSTVQKKVLLEKK 127 (141) T ss_pred cccceeeccCccc-------cccCCeeeeccCCC------ccceeeeccccCccccCCCchhHHHHHhhhhHHHHHHHHH Confidence 4444432211111 11122456777533 23799999999999999999999999765 566777766 Q ss_pred HHHHHHHcCCCHHH Q lcl|NC_020201. 79 RMHNGIIQGRGFSS 92 (153) Q Consensus 79 ~~~~~~~~G~~~~~ 92 (153) ..++.+++-...+. T Consensus 128 ~~~k~~l~~~~~~~ 141 (141) T protein:vir:50 128 RNTKNSLEEKEGCD 141 (141) T ss_pred HHHHHHHHhccCCC Confidence 66766664331111 No 106 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=95.19 E-value=2.5e-05 Score=45.83 Aligned_cols=78 Identities=15% Similarity=0.111 Sum_probs=49.5 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHH--HHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHF--QEGLRKHIK 78 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~--~~~~~~~~~ 78 (153) |+-.|+......+. ...-.+.|||... ..+++|-+.++||.++|+-||+..+.++. +.++-+.+. T Consensus 61 laD~I~~~~~~iDg-------~~~g~s~VG~~kk------~~a~~A~f~n~GT~k~~~~hFve~~~~e~~~k~~vl~A~~ 127 (140) T protein:vir:48 61 MADGLSVQSTNVDG-------RKNGVSTVGWVNR------YHAQNARRLNDGTKKYRADHFVTNVQNDSAVQTKVLLAEK 127 (140) T ss_pred chhceeeccccccc-------ccCceeeeccCCC------cceeeeeccccCccccCCCchhHHHHHhhhhHHHHHHHHH Confidence 43333321111111 1122456777432 23799999999999999999999999876 566777777 Q ss_pred HHHHHHHcCCCHH Q lcl|NC_020201. 79 RMHNGIIQGRGFS 91 (153) Q Consensus 79 ~~~~~~~~G~~~~ 91 (153) ..++.+++-...+ T Consensus 128 ~~~~~~l~~~~~~ 140 (140) T protein:vir:48 128 EEYEKLIRKKGGE 140 (140) T ss_pred HHHHHHHHhhcCC Confidence 7777776544333 No 107 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=94.77 E-value=0.00042 Score=39.10 Aligned_cols=86 Identities=8% Similarity=-0.039 Sum_probs=58.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCC----CCCCccHHHHHhcCC-CCcchhHHHHH Q lcl|NC_020201. 66 SMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSF----SNPKVSKDWASYKGF-DDAMIHYGDLS 140 (153) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~----~~p~ns~~Ti~~KG~-~~PLidTG~L~ 140 (153) +++ -+++.+.+..++..+ ...+-...|..||..+....++.|.+..- +++|+|+.|.++||. .++|.+++.+. T Consensus 1 m~~-~~~l~~~L~~ll~~l-~~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~~~~~~~~l~~~ 78 (148) T protein:vir:79 1 MSE-SRELEAWLAGMLTKL-DAPARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRIRRAMFMRLRLA 78 (148) T ss_pred Ccc-HHHHHHHHHHHHHhc-CChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhcccccccccchhhhh Confidence 222 234444444444432 22234578999999999999999986322 346789999888886 47899999999 Q ss_pred hhcceeeeeCCCC Q lcl|NC_020201. 141 SAATYKIVKYQGK 153 (153) Q Consensus 141 ~Sity~V~~~~gk 153 (153) .++++.+...... T Consensus 79 ~~l~~~~~~~~~~ 91 (148) T protein:vir:79 79 RYMKTQADANTAV 91 (148) T ss_pred hheeeeeeCCeee Confidence 9998876554433 No 108 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=94.66 E-value=3.8e-05 Score=44.84 Aligned_cols=77 Identities=14% Similarity=0.206 Sum_probs=52.4 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRM 80 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~ 80 (153) |+-.|+..... +.....-.+.|||... +.+|-+-|+||.++|+.+|+..|..+.++++.+.+.+. T Consensus 61 laD~I~~~~~~-------idg~~~g~~~VG~~~~--------~~~Ahf~n~GT~~~~~~hFie~t~~e~~~ev~~a~~~~ 125 (139) T protein:vir:10 61 LSEDISSAAGD-------IDGDHNGSSTVGFHNK--------AHIARFLNDGTKNIRADHFVDNARDDAKDAVFAAEAEK 125 (139) T ss_pred ccccceecCcc-------ccccccccceeCCCCC--------ceeeeeeccCccccCCCchHHHHHHHHHHHHHHHHHHH Confidence 44444432111 1111123477888421 46788999999999999999999999999999998888 Q ss_pred HHHHHcCC--CHHH Q lcl|NC_020201. 81 HNGIIQGR--GFSS 92 (153) Q Consensus 81 ~~~~~~G~--~~~~ 92 (153) ++.+++.. +-+. T Consensus 126 ~ke~l~~~~~~~~~ 139 (139) T protein:vir:10 126 YQAMIAKANGGDSK 139 (139) T ss_pred HHHHHhhcCCCCCC Confidence 88877553 1112 No 109 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=94.61 E-value=3e-05 Score=45.34 Aligned_cols=96 Identities=16% Similarity=0.178 Sum_probs=46.8 Q ss_pred CCccccccHHHHHHHHHHHHHh-------------------------------hCCEEEEeecCcCCCCCCCHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDL-------------------------------AEKEVEYGFYDEKHYSGLNMATLAAIH 49 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l-------------------------------~~~~v~VGi~~~~~~dG~~va~iA~~~ 49 (153) .+.+++.. +-+-+.+.|+.- ..-.+.|||.... .+.+|-+. T Consensus 25 ~~~katkA--GA~v~~e~L~~~tp~~h~~~~kt~~~~HlaD~I~~s~~~idG~~dG~s~VG~~~~~------~a~~a~f~ 96 (153) T protein:vir:49 25 EQAKITTA--GAKVFKEELAEVTREKHYSKKKDLKYGHMADGLAVQSTNADGRKNGVSTVGWKNNY------HAQNARRL 96 (153) T ss_pred HHHHHHHH--HHHHHHHHHHHhccccCCCCCCCCCCCcccccceeccccccccccceeeecccCCc------cceeeeec Confidence 11222111 111111111111 1124567775332 37899999 Q ss_pred hcCCCCCCCCchhhHHHHHH--HHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHH Q lcl|NC_020201. 50 EEGWNNLPERNFMFSTSMHF--QEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDW 123 (153) Q Consensus 50 E~G~~~IP~RpFlr~~~~~~--~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~T 123 (153) |+||.++|+.||++.+.++. +.++-+.+...++.+++-.. |. ++....|... -+ | T Consensus 97 n~GT~km~~~hFie~tr~e~~~k~~vl~A~~~~~~~il~~~~--------~~--------~~~~~~~~~~-~~--~ 153 (153) T protein:vir:49 97 NDGTKKYRADHFITNVQNDSTVKNKVLLAEKEEYEKLIRRKG--------GV--------YLSASNFKTK-RA--T 153 (153) T ss_pred ccCcccCCCChhhHHHHHHhhHHHHHHHHHHHHHHHHHHhcC--------Ce--------eeeccccccc-cC--C Confidence 99999999999999999876 45666655555555553220 00 0111111110 00 0 No 110 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=94.26 E-value=5.4e-05 Score=43.97 Aligned_cols=83 Identities=14% Similarity=0.149 Sum_probs=49.4 Q ss_pred CCccccccHHHHHHHHHHHHHhh-------------------------------CCEEEEeecCcCCCCCCCHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLA-------------------------------EKEVEYGFYDEKHYSGLNMATLAAIH 49 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~-------------------------------~~~v~VGi~~~~~~dG~~va~iA~~~ 49 (153) .+.+|+.. +-+-+.+.|+.-. .-.+.|||... ..|++|-+. T Consensus 25 ~~~katkA--GAkv~~~~L~~~tp~~h~~~r~t~~~~HlaD~I~~~~~~idg~~dG~s~VG~~k~------~~a~~a~f~ 96 (140) T protein:vir:48 25 EQAKITTA--GAKVFKKELAEVTREKHYSKKKDLKYGHMADGLAVQSTNVDGRKNGVATVGWKNN------YHAQNARRL 96 (140) T ss_pred HHHHHHHH--hHHHHHHHHHHhcccCCCCCCCCCCCCcccccceecccccccccccceeecccCC------CceeEEeec Confidence 23333222 1122222222221 11344666432 237999999 Q ss_pred hcCCCCCCCCchhhHHHHHH--HHHHHHHHHHHHHHHHcCCCHH Q lcl|NC_020201. 50 EEGWNNLPERNFMFSTSMHF--QEGLRKHIKRMHNGIIQGRGFS 91 (153) Q Consensus 50 E~G~~~IP~RpFlr~~~~~~--~~~~~~~~~~~~~~~~~G~~~~ 91 (153) ++||..+|+.||+..+.++. ++++.+.+...++.+++-...+ T Consensus 97 NdGT~k~~~~hFve~t~~e~~~~~~vl~A~~~~y~~~l~kk~~~ 140 (140) T protein:vir:48 97 NDGTKKYRADHFVTNVQNDSAVRDKVLLAEKEEYEKLIRKKGGE 140 (140) T ss_pred ccCccccCCCchHHHHHHhhhhHHHHHHHHHHHHHHHHHhhcCC Confidence 99999999999999999865 6677777777777766332222 No 111 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=93.97 E-value=9e-05 Score=42.77 Aligned_cols=77 Identities=17% Similarity=0.093 Sum_probs=35.5 Q ss_pred CCccccccHHH-HHHHHHHHHHhhCCEEEEeecCcC---------CC---C-C-CCHHHHHHHHhcCCC----------- Q lcl|NC_020201. 1 MAKKSSTDISE-LKRYFSQLSDLAEKEVEYGFYDEK---------HY---S-G-LNMATLAAIHEEGWN----------- 54 (153) Q Consensus 1 M~~~i~~~~~~-l~~~~~~l~~l~~~~v~VGi~~~~---------~~---d-G-~~va~iA~~~E~G~~----------- 54 (153) +...+....+. ...+....+.+.. |.-|-+... .. . | .+.+.+|.++||||. T Consensus 22 ~~~~~~~~i~~~a~~v~~~Ak~~aP--v~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA~~ve~GT~ph~i~pk~~~a 99 (142) T protein:vir:99 22 VGPILRRTHSSLTRQIANETRARVP--VLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYAAAVHEGTRPHVIRAKHAQA 99 (142) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCC--ccchhhhcceeeeeccccccceEEEEeccCccccceeccCCccceeccccCce Confidence 22111111110 1111112222211 222222110 00 0 1 144789999999973 Q ss_pred ------------------CCCCCchhhHHHHHHHHHHHHHHHH Q lcl|NC_020201. 55 ------------------NLPERNFMFSTSMHFQEGLRKHIKR 79 (153) Q Consensus 55 ------------------~IP~RpFlr~~~~~~~~~~~~~~~~ 79 (153) +.||||||+++++.+.++.....-. T Consensus 100 l~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:99 100 LHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred eeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 3669999999999887664333222 No 112 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=93.97 E-value=9e-05 Score=42.77 Aligned_cols=77 Identities=17% Similarity=0.093 Sum_probs=35.5 Q ss_pred CCccccccHHH-HHHHHHHHHHhhCCEEEEeecCcC---------CC---C-C-CCHHHHHHHHhcCCC----------- Q lcl|NC_020201. 1 MAKKSSTDISE-LKRYFSQLSDLAEKEVEYGFYDEK---------HY---S-G-LNMATLAAIHEEGWN----------- 54 (153) Q Consensus 1 M~~~i~~~~~~-l~~~~~~l~~l~~~~v~VGi~~~~---------~~---d-G-~~va~iA~~~E~G~~----------- 54 (153) +...+....+. ...+....+.+.. |.-|-+... .. . | .+.+.+|.++||||. T Consensus 22 ~~~~~~~~i~~~a~~v~~~Ak~~aP--v~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA~~ve~GT~ph~i~pk~~~a 99 (142) T protein:vir:86 22 VGPILRRTHSSLTRQIANETRARVP--VLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYAAAVHEGTRPHVIRAKHAQA 99 (142) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCC--ccchhhhcceeeeeccccccceEEEEeccCccccceeccCCccceeccccCce Confidence 22111111110 1111112222211 222222110 00 0 1 144789999999973 Q ss_pred ------------------CCCCCchhhHHHHHHHHHHHHHHHH Q lcl|NC_020201. 55 ------------------NLPERNFMFSTSMHFQEGLRKHIKR 79 (153) Q Consensus 55 ------------------~IP~RpFlr~~~~~~~~~~~~~~~~ 79 (153) +.||||||+++++.+.++.....-. T Consensus 100 l~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~r 142 (142) T protein:vir:86 100 LHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRDRRIRVR 142 (142) T ss_pred eeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhhhhhccC Confidence 3669999999999887664333222 No 113 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=93.74 E-value=0.0013 Score=36.50 Aligned_cols=86 Identities=10% Similarity=-0.046 Sum_probs=57.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCC----CCCCccHHHHHhcCC--CCcchhHHHH Q lcl|NC_020201. 66 SMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSF----SNPKVSKDWASYKGF--DDAMIHYGDL 139 (153) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~----~~p~ns~~Ti~~KG~--~~PLidTG~L 139 (153) +++ -+++.+.+..++..+. ......+|..||..+....++.|.+..- +++|+++.|++.|.. .++|..++.+ T Consensus 1 m~~-~~~~~~~l~~ll~~L~-~~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~~~~~~~~~l~~ 78 (149) T protein:vir:18 1 MSE-LTALQERLAGLIASLS-PAARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSKKGRIKREMFAKLRT 78 (149) T ss_pred Cch-HHHHHHHHHHHHHhcC-CchHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhccCcccchhhhhhhh Confidence 332 2333333444333332 2234678999999999999999986411 356889999887643 4799999999 Q ss_pred HhhcceeeeeCCCC Q lcl|NC_020201. 140 SSAATYKIVKYQGK 153 (153) Q Consensus 140 ~~Sity~V~~~~gk 153 (153) .+++.+.+...+.. T Consensus 79 ~~~l~~~~~~~~~~ 92 (149) T protein:vir:18 79 SRFMKAKGSDSAAV 92 (149) T ss_pred hhhhheeecCceeE Confidence 99998866655433 No 114 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=93.60 E-value=0.00017 Score=41.23 Aligned_cols=90 Identities=21% Similarity=0.299 Sum_probs=55.0 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCE-------------------------EEEeecCcC------------CC--CC-- Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKE-------------------------VEYGFYDEK------------HY--SG-- 39 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~-------------------------v~VGi~~~~------------~~--dG-- 39 (153) |+.-...|.++|+++.+.|+++.... |.-|-+-.. .. ++ T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~~g~~~~ 80 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARSLPVYKQGNNYI 80 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccccceeecCCeeE Confidence 99988888889999988887664421 222322110 01 11 Q ss_pred ---CCHHHHHHHHhcCCCCCCCCchh------hHHHHHHHHHHHHHHHHHHHHHHcCC-CH Q lcl|NC_020201. 40 ---LNMATLAAIHEEGWNNLPERNFM------FSTSMHFQEGLRKHIKRMHNGIIQGR-GF 90 (153) Q Consensus 40 ---~~va~iA~~~E~G~~~IP~RpFl------r~~~~~~~~~~~~~~~~~~~~~~~G~-~~ 90 (153) .+.+.+|-+-|||+...|+|||. +.++.+.+..+.+.+++.+..++.+- ++ T Consensus 81 v~v~n~~~YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l~~~~~~ 141 (141) T protein:vir:79 81 IEVVNPTEYASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLILLKGVFDA 141 (141) T ss_pred EEEecCCcchhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 14478999999999877666665 45556666666666665555544431 22 No 115 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=93.56 E-value=0.00015 Score=41.55 Aligned_cols=83 Identities=17% Similarity=0.186 Sum_probs=43.0 Q ss_pred CCccccccHH----------------HHHHHHHHHHHhhCCEEEEeecCc-----CCC-CC--------CCHHHHHHHHh Q lcl|NC_020201. 1 MAKKSSTDIS----------------ELKRYFSQLSDLAEKEVEYGFYDE-----KHY-SG--------LNMATLAAIHE 50 (153) Q Consensus 1 M~~~i~~~~~----------------~l~~~~~~l~~l~~~~v~VGi~~~-----~~~-dG--------~~va~iA~~~E 50 (153) |+..|...+. .-+.+.+.+++... ++-|=+.. ... .| -+-..++-+.| T Consensus 9 la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~~aP--~rTG~y~ksw~vk~~~~~g~~~~vv~~~~~~~l~HLLE 86 (126) T protein:vir:81 9 LADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQALAP--KRTGEYARTFTITKEDGYGTTKRIIWNKKHYRRVHLLE 86 (126) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCC--cccchhhccccccccccCCcceEEEeccCCCCceeeee Confidence 1111111111 11223333333332 22332111 111 11 11135677899 Q ss_pred cCCC-----CCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_020201. 51 EGWN-----NLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGR 88 (153) Q Consensus 51 ~G~~-----~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~ 88 (153) ||+. .+|+||||+|+++...+++.+.++..++ .|. T Consensus 87 fGha~r~gGrV~a~Phi~Pa~e~~~~~~~~~i~~~l~---~gg 126 (126) T protein:vir:81 87 FGHAKVNGGRVKEYPHLRPAYDKHGARLPDELKRVIE---NGG 126 (126) T ss_pred cceecCCCCccCCCcchHHHHHHHHHHHHHHHHHHhh---cCC Confidence 9985 4899999999999988888888777554 444 No 116 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=92.14 E-value=0.0013 Score=36.36 Aligned_cols=79 Identities=11% Similarity=0.170 Sum_probs=42.7 Q ss_pred CCccccccHHHHHHHHHHHH--HhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCC------------------------ Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLS--DLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWN------------------------ 54 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~--~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~------------------------ 54 (153) -+-+++ ....|.++.+.++ ...+....++++.+ .++.||++|.||-. T Consensus 65 ~~~k~k-~~rm~~kL~~~~~~~~~~~~~~~~~~~~g------~~~~IA~vHQ~G~~~rv~~~~~~~~~~~~~~~pATr~Q 137 (231) T protein:vir:37 65 VDGEIK-NKRLLKKVLRYASILAEERGKGRIYYKNP------LTGEIAQKQQDGFTEHFRVFATDKNKNGSGNDRATIRQ 137 (231) T ss_pred cccchh-hHHHHHHhHHhhccccccCCceEEeeecc------hHHHHHHHhhcCcccccchhhhhhccCCCCCCCCCHHH Confidence 333333 2245666655433 22333355555444 36899999999910 Q ss_pred --------------------------------------------------------------CCCCCchhhHHHHHHHHH Q lcl|NC_020201. 55 --------------------------------------------------------------NLPERNFMFSTSMHFQEG 72 (153) Q Consensus 55 --------------------------------------------------------------~IP~RpFlr~~~~~~~~~ 72 (153) .+|+||||-..- ++ T Consensus 138 Ak~Lr~lGy~v~~~k~k~~k~~~rkps~kwI~~~ls~~qAgliIR~L~~k~~~~~~k~~W~I~~paR~FLG~~~----~e 213 (231) T protein:vir:37 138 AQKLRSLGYRKRNGKNRQGKTKYRLYTIKEIRERLTRTWASMEIRRLENKVNAGNGKTNWEIHVPARPFLDTRE----KE 213 (231) T ss_pred HHHHHHhcccccCCCCCCCCCCcCcCCHHHHHHhhhhHHHHHHHHHHhcccccccCcceeeeecCcccccCCCH----HH Confidence 177888876553 34 Q ss_pred HHHHHHHHHHHHHcCCCH Q lcl|NC_020201. 73 LRKHIKRMHNGIIQGRGF 90 (153) Q Consensus 73 ~~~~~~~~~~~~~~G~~~ 90 (153) ..+++...+.++++|.-- T Consensus 214 ~~~~l~~~l~~i~~~~~~ 231 (231) T protein:vir:37 214 NVDILREITLKFLSGEYK 231 (231) T ss_pred HHHHHHHHHHHHhcccCC Confidence 444455555556666311 No 117 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=91.21 E-value=0.0011 Score=36.81 Aligned_cols=81 Identities=20% Similarity=0.217 Sum_probs=46.4 Q ss_pred CCccccccHHHH-HHHHHHHHHh-----------------------hCC-EEEEeecCc-----CCCCC-------CCHH Q lcl|NC_020201. 1 MAKKSSTDISEL-KRYFSQLSDL-----------------------AEK-EVEYGFYDE-----KHYSG-------LNMA 43 (153) Q Consensus 1 M~~~i~~~~~~l-~~~~~~l~~l-----------------------~~~-~v~VGi~~~-----~~~dG-------~~va 43 (153) |+.+|+.|. | +.|.+.|++. ... -+.-|-+.. ...+| .+-- T Consensus 1 m~~~v~id~--L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~~~~~~~~v~~~~~~y 78 (123) T protein:vir:96 1 MANKISIDD--LAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQKLKNGDQVIYQKAPTY 78 (123) T ss_pred CCcccchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccccceeeeecCCeeEEEEEecCCc Confidence 998887552 2 1112212111 100 012221111 01112 1113 Q ss_pred HHHHHHhcCC-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 44 TLAAIHEEGW-----NNLPERNFMFSTSMHFQEGLRKHIKRMHNG 83 (153) Q Consensus 44 ~iA~~~E~G~-----~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 83 (153) .|+-+.|||. ..+|+||||+|+.+...+.+.+.++..++. T Consensus 79 ~l~HLLE~GHa~r~GGrV~a~phI~paee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 79 RLTHLLENGHAKRNGGRVSPKVHIAPVEEELVSNYISRVEKRLSQ 123 (123) T ss_pred ceEEeeecceeecCCceeCcchhhhHHHHHHHHHHHHHHHHHhcC Confidence 5777889995 469999999999999999998888887775 No 118 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=90.39 E-value=0.0022 Score=35.21 Aligned_cols=88 Identities=20% Similarity=0.259 Sum_probs=60.0 Q ss_pred CCccccccHHHHHHHHHHHHHhhCC--------------------------EEEEeecC-----cC-CC--CC-----CC Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEK--------------------------EVEYGFYD-----EK-HY--SG-----LN 41 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~--------------------------~v~VGi~~-----~~-~~--dG-----~~ 41 (153) |+.. ..|.++|+++.+.|++.... .|.-|.+. +. .+ +| .+ T Consensus 1 Ms~~-~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~tPVdTG~Lr~S~~~~~~~~~~~~~~~~V~n 79 (144) T protein:vir:10 1 MSLG-HVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEANTPVKQGNLRRSWTAEGPTYGCGGWTIKLIN 79 (144) T ss_pred CCCC-CccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHhCCCCcchhccceeecceeeecCeeEEEEec Confidence 8754 34667888888877765321 12233221 11 11 22 25 Q ss_pred HHHHHHHHhcCCC-----------------CCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCCC Q lcl|NC_020201. 42 MATLAAIHEEGWN-----------------NLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGRG 89 (153) Q Consensus 42 va~iA~~~E~G~~-----------------~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~~ 89 (153) .+.+|-+-|||+. -+|-++||+.++...+..+.+.+++.+..+.+=.. T Consensus 80 ~~~YA~~VE~Ghr~~~G~~v~~~~~~~~~g~V~G~~~~~~a~~~~~~~~~~~l~k~l~~l~d~~~ 144 (144) T protein:vir:10 80 NAEYASYVESGHRQTPGRYVPVLKKRLVRDWVPGQFYMKKSIPQIQRQLPQLVTEGLWGLKDLFE 144 (144) T ss_pred CCCcccccccceeecCCcccccCCCccccceecCccchHHHHHHHHHHHHHHHHHHHHHHhhhcC Confidence 5789999999973 37889999999999999999999998887765433 No 119 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=90.20 E-value=0.00024 Score=40.48 Aligned_cols=76 Identities=18% Similarity=0.171 Sum_probs=36.9 Q ss_pred CCccccccHHHHHHHH------------HHHHHhhC--CEEEEeecCcC-----CCCC--------CCHHHHHHHHhcCC Q lcl|NC_020201. 1 MAKKSSTDISELKRYF------------SQLSDLAE--KEVEYGFYDEK-----HYSG--------LNMATLAAIHEEGW 53 (153) Q Consensus 1 M~~~i~~~~~~l~~~~------------~~l~~l~~--~~v~VGi~~~~-----~~dG--------~~va~iA~~~E~G~ 53 (153) =.+++..|...+++.. ..++...+ -.|.-|-+... ..+| .+.+.+|.++|||| T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~Ve~GT 85 (140) T protein:vir:10 6 ARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAPVHEGS 85 (140) T ss_pred eeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhhhccCC Confidence 1233333333332221 11222111 12344432221 0111 24589999999997 Q ss_pred C-----------------------------CCCCCchhhHHHHHH---HHHHHHH Q lcl|NC_020201. 54 N-----------------------------NLPERNFMFSTSMHF---QEGLRKH 76 (153) Q Consensus 54 ~-----------------------------~IP~RpFlr~~~~~~---~~~~~~~ 76 (153) . +.+|||||++++++. ...+... T Consensus 86 ~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:10 86 RPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred CCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 2 366999999999874 3334332 No 120 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=90.20 E-value=0.00024 Score=40.48 Aligned_cols=76 Identities=18% Similarity=0.171 Sum_probs=36.9 Q ss_pred CCccccccHHHHHHHH------------HHHHHhhC--CEEEEeecCcC-----CCCC--------CCHHHHHHHHhcCC Q lcl|NC_020201. 1 MAKKSSTDISELKRYF------------SQLSDLAE--KEVEYGFYDEK-----HYSG--------LNMATLAAIHEEGW 53 (153) Q Consensus 1 M~~~i~~~~~~l~~~~------------~~l~~l~~--~~v~VGi~~~~-----~~dG--------~~va~iA~~~E~G~ 53 (153) =.+++..|...+++.. ..++...+ -.|.-|-+... ..+| .+.+.+|.++|||| T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~Ve~GT 85 (140) T protein:vir:97 6 ARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAPVHEGS 85 (140) T ss_pred eeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhhhccCC Confidence 1233333333332221 11222111 12344432221 0111 24589999999997 Q ss_pred C-----------------------------CCCCCchhhHHHHHH---HHHHHHH Q lcl|NC_020201. 54 N-----------------------------NLPERNFMFSTSMHF---QEGLRKH 76 (153) Q Consensus 54 ~-----------------------------~IP~RpFlr~~~~~~---~~~~~~~ 76 (153) . +.+|||||++++++. ...+... T Consensus 86 ~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:97 86 RPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred CCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 2 366999999999874 3334332 No 121 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=89.97 E-value=0.0018 Score=35.65 Aligned_cols=71 Identities=10% Similarity=0.083 Sum_probs=30.1 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHh Q lcl|NC_020201. 62 MFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSS 141 (153) Q Consensus 62 lr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~ 141 (153) |-.---...+++.+.++.+-..+.+ .++.++..+...++..|+..... .-| +|||.|++ T Consensus 1 m~~v~i~Gld~L~~kl~~~~~~~~~--~v~~a~~~~~~~~a~~v~~~ak~------------------~~P-vdtG~Lr~ 59 (182) T protein:vir:10 1 MIEVELKGVNELRAKLKKLPDIMAK--ATANAQENAIEQAEAYAVDELQS------------------SIK-YSTGELTR 59 (182) T ss_pred CeEEEEecHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHh------------------hCC-CCchhhhh Confidence 1110001112222222221111110 11223333333333333332221 134 69999999 Q ss_pred hcceeeeeCCCC Q lcl|NC_020201. 142 AATYKIVKYQGK 153 (153) Q Consensus 142 Sity~V~~~~gk 153 (153) ||+++|..++|. T Consensus 60 SI~~~~~~~~~~ 71 (182) T protein:vir:10 60 SFKHEVKVDGDE 71 (182) T ss_pred ceeeeeeecCCe Confidence 999999988876 No 122 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=89.77 E-value=0.0048 Score=33.32 Aligned_cols=87 Identities=8% Similarity=0.004 Sum_probs=58.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCC----CCCCccHHHHHhc-----C--CCCcch Q lcl|NC_020201. 66 SMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSF----SNPKVSKDWASYK-----G--FDDAMI 134 (153) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~----~~p~ns~~Ti~~K-----G--~~~PLi 134 (153) +.++-+++.+.+..++..+. ..+....|..||..+....++.|.+..- +++|.++.|..++ | ...+|. T Consensus 1 m~~~~~~l~~~l~~ll~~l~-~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~~~~~~m~ 79 (155) T protein:vir:79 1 MTDDLQALERWAGGLLAKLS-PAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAGRVKREAMF 79 (155) T ss_pred CchHHHHHHHHHHHHHHhcC-ChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccCcccchhhh Confidence 44556666666666655443 2234568999999999999999986421 2345677765432 3 346789 Q ss_pred hHHHHHhhcceeeeeCCCC Q lcl|NC_020201. 135 HYGDLSSAATYKIVKYQGK 153 (153) Q Consensus 135 dTG~L~~Sity~V~~~~gk 153 (153) +++.+-.+|+|+.-.+.-- T Consensus 80 ~~l~~a~~l~~~~~~d~a~ 98 (155) T protein:vir:79 80 RKLRTARYLRIDVDSTGLA 98 (155) T ss_pred hhhhhhheeeeeecCcEEE Confidence 9999999999877544332 No 123 >protein:vir:3750 Length: 227 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043491;genbank:gi:9628626;genbank:GeneID:1261131 Probab=88.25 E-value=0.005 Score=33.20 Aligned_cols=79 Identities=13% Similarity=0.135 Sum_probs=33.6 Q ss_pred CCccccccHHHHHHHHHHHHH-hhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCC------------------------- Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSD-LAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWN------------------------- 54 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~-l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~------------------------- 54 (153) +.-.-....+.|.+|-+-|.- .......|+|..+ -++.||++|+||-. T Consensus 59 ~~pRKr~k~KM~~kL~k~l~~~~~~~~a~v~f~~g------~~~~IA~vHq~G~~~~v~~~~~~~~~~~~~~~~paTr~Q 132 (227) T protein:vir:37 59 WKKRKNGTAKMLRRIAKLANSKAEKAQGTLFYKQK------RTGEIAQEHQEGIPHLFKKTEFTGKNKGGIGADPCTLRQ 132 (227) T ss_pred CchhcchhHHHHhhhHHHcceeecccceEEEecCc------chHHHHHHhhcCcccccchhhhhhhhcCCccccCCCHHH Confidence 222222222333333222111 2334566777544 25799999999921 Q ss_pred ----------------------------------------------------------------CCCCCchhhHHHHHHH Q lcl|NC_020201. 55 ----------------------------------------------------------------NLPERNFMFSTSMHFQ 70 (153) Q Consensus 55 ----------------------------------------------------------------~IP~RpFlr~~~~~~~ 70 (153) .+|+||||-.+-+++. T Consensus 133 Ak~Lr~lGy~v~~~k~k~~k~~~rkps~kwI~~nls~~qAgliIR~L~~k~~~~~~~~k~~W~I~~PaR~FLG~~~~e~~ 212 (227) T protein:vir:37 133 AKKLKDLGYTVANGKTKNGKAKRRKPTLSEIRSTLSRAKASLIIRKLEEKNGMNPSRHLTQWIIPTEKRSFLDTREEENA 212 (227) T ss_pred HHHHHHhcccccCCCCCCcCCccccCCHHHHHHhhhHHHHHHHHHHHhcccccccccCccceeeecCcccccCCCHHHHH Confidence 1455555544433333 Q ss_pred HHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 71 EGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTI 109 (153) Q Consensus 71 ~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I 109 (153) ..+...+.+.- |+.= T Consensus 213 ~~l~r~l~~~~------------------------~~~~ 227 (227) T protein:vir:37 213 KIILAEIQKYT------------------------QKQQ 227 (227) T ss_pred HHHHHHHHHHh------------------------hhcC Confidence 33332222211 1110 No 124 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=88.18 E-value=0.00026 Score=40.22 Aligned_cols=77 Identities=12% Similarity=0.063 Sum_probs=37.6 Q ss_pred CCccccccH--H----HHHHHHH--------HHHHhhCC--EEEEeecCcC-----------CC-CC--CCHHHHHHHHh Q lcl|NC_020201. 1 MAKKSSTDI--S----ELKRYFS--------QLSDLAEK--EVEYGFYDEK-----------HY-SG--LNMATLAAIHE 50 (153) Q Consensus 1 M~~~i~~~~--~----~l~~~~~--------~l~~l~~~--~v~VGi~~~~-----------~~-dG--~~va~iA~~~E 50 (153) |.+...... . ++..+++ .++...+. .|.-|-+... .. ++ .+.+.+|.++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~r~~l~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~V~~~~~YA~~ve 80 (137) T protein:vir:10 1 MTVTARYERNPVGEARQFQVIARRRLSRITRGTANQARADVPVKTGNLGRSIREDPIVVAGPLRLDSGVTAHADYARYVH 80 (137) T ss_pred CeeEEEeccCchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhcCceeeeeeccccceEEEEecCCCccceeee Confidence 444333221 1 1111221 12222111 1222321110 00 11 13478999999 Q ss_pred cCCC------------------------------CCCCCchhhHHHHHHHHHHHHHH Q lcl|NC_020201. 51 EGWN------------------------------NLPERNFMFSTSMHFQEGLRKHI 77 (153) Q Consensus 51 ~G~~------------------------------~IP~RpFlr~~~~~~~~~~~~~~ 77 (153) |||. ++|+||||+++++++..+-.... T Consensus 81 ~GT~ph~I~Pk~~k~~l~~~~~g~~vf~k~V~hPG~~a~PfL~~A~~~~~~~~~~~~ 137 (137) T protein:vir:10 81 DGTRAHVIRPRRPGGVLRFTVGGRVVYARRVNHPGTRARPFLRNAAERVVARETATS 137 (137) T ss_pred cCCCCceeeccccceeeeEeeCCeeEecceeecCCCCCCchHHHHHHHhhhhhcccC Confidence 9962 37799999999999887665443 No 125 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=88.13 E-value=0.011 Score=31.23 Aligned_cols=87 Identities=13% Similarity=0.045 Sum_probs=48.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCC----CCCCccHHHHHhcCCCCcchhHHHHHh Q lcl|NC_020201. 66 SMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSF----SNPKVSKDWASYKGFDDAMIHYGDLSS 141 (153) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~----~~p~ns~~Ti~~KG~~~PLidTG~L~~ 141 (153) +++.-.++.+.+..++..+. ..+....|..||..+....++.|.+..- +++|.++.+..+|+..+-......|+. T Consensus 1 M~~~~~~~~~~L~~ll~~L~-~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~k~~~~~~~m~~~L~~ 79 (152) T protein:vir:10 1 MSEPIEQVKTAFDSLLNNIS-KPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKGVKSKIKSGKMFDKITQ 79 (152) T ss_pred CchHHHHHHHHHHHHHHhcC-cchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhhcccccchhHHHhhhh Confidence 44444444455554444432 1244678999999999999999987421 234556666555554443333444444 Q ss_pred h--cceeeeeCCCC Q lcl|NC_020201. 142 A--ATYKIVKYQGK 153 (153) Q Consensus 142 S--ity~V~~~~gk 153 (153) | ++|+...+.-. T Consensus 80 a~~l~~~a~~~~~~ 93 (152) T protein:vir:10 80 PRFMRLRLESEGVS 93 (152) T ss_pred cceeeeeecCcEEE Confidence 4 45543333222 No 126 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=87.92 E-value=0.0015 Score=36.08 Aligned_cols=66 Identities=14% Similarity=0.030 Sum_probs=30.7 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHh Q lcl|NC_020201. 62 MFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSS 141 (153) Q Consensus 62 lr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~ 141 (153) |-..+ ...+++.+.++.+-..+. ..++++|+..+..+++.++.. .| +|||.|++ T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~~--~~~~~~~~~~a~~i~~~ak~~----------------------aP-vdTG~Lr~ 54 (137) T protein:vir:94 1 MAKVK-YGNWDLVKELENYERDME--RWVKRGIAKTTAKIHNTIISL----------------------MP-VDTGYLRE 54 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHh----------------------CC-ccccchhc Confidence 33322 123333333333322221 123344444444443333321 22 59999999 Q ss_pred hcceeeeeCCCC Q lcl|NC_020201. 142 AATYKIVKYQGK 153 (153) Q Consensus 142 Sity~V~~~~gk 153 (153) ||++++...+-. T Consensus 55 SI~~~~~~~~~~ 66 (137) T protein:vir:94 55 SVTMDFKDSGFT 66 (137) T ss_pred cceeEeecCceE Confidence 999988554323 No 127 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=87.92 E-value=0.0015 Score=36.08 Aligned_cols=66 Identities=14% Similarity=0.030 Sum_probs=30.7 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHh Q lcl|NC_020201. 62 MFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSS 141 (153) Q Consensus 62 lr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~ 141 (153) |-..+ ...+++.+.++.+-..+. ..++++|+..+..+++.++.. .| +|||.|++ T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~~--~~~~~~~~~~a~~i~~~ak~~----------------------aP-vdTG~Lr~ 54 (137) T protein:vir:93 1 MAKVK-YGNWDLVKELENYERDME--RWVKRGIAKTTAKIHNTIISL----------------------MP-VDTGYLRE 54 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHh----------------------CC-ccccchhc Confidence 33322 123333333333322221 123344444444443333321 22 59999999 Q ss_pred hcceeeeeCCCC Q lcl|NC_020201. 142 AATYKIVKYQGK 153 (153) Q Consensus 142 Sity~V~~~~gk 153 (153) ||++++...+-. T Consensus 55 SI~~~~~~~~~~ 66 (137) T protein:vir:93 55 SVTMDFKDSGFT 66 (137) T ss_pred cceeEeecCceE Confidence 999988554323 No 128 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=87.92 E-value=0.0015 Score=36.08 Aligned_cols=66 Identities=14% Similarity=0.030 Sum_probs=30.7 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHh Q lcl|NC_020201. 62 MFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSS 141 (153) Q Consensus 62 lr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~ 141 (153) |-..+ ...+++.+.++.+-..+. ..++++|+..+..+++.++.. .| +|||.|++ T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~~--~~~~~~~~~~a~~i~~~ak~~----------------------aP-vdTG~Lr~ 54 (137) T protein:vir:97 1 MAKVK-YGNWDLVKELENYERDME--RWVKRGIAKTTAKIHNTIISL----------------------MP-VDTGYLRE 54 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHh----------------------CC-ccccchhc Confidence 33322 123333333333322221 123344444444443333321 22 59999999 Q ss_pred hcceeeeeCCCC Q lcl|NC_020201. 142 AATYKIVKYQGK 153 (153) Q Consensus 142 Sity~V~~~~gk 153 (153) ||++++...+-. T Consensus 55 SI~~~~~~~~~~ 66 (137) T protein:vir:97 55 SVTMDFKDSGFT 66 (137) T ss_pred cceeEeecCceE Confidence 999988554323 No 129 >protein:vir:78755 Length: 228 # NCBI annotation: putative tail completion protein # Family: family:all:743 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285651;genbank:gi:148727157;genbank:GeneID:5220102 Probab=87.53 E-value=0.0079 Score=32.11 Aligned_cols=92 Identities=18% Similarity=0.174 Sum_probs=44.4 Q ss_pred CCccccccHHHHHHHHHHHHH--hhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCC------------------------ Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSD--LAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWN------------------------ 54 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~--l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~------------------------ 54 (153) ..-.-....+.|.+|.+-|.- .....+.|||..+... ..++.||++|.||-. T Consensus 55 ~~pRKr~krKMl~~L~k~Lk~~~~~~~~a~v~f~~~~~~--~~~~rIA~vHq~G~~~~v~~~~~~~~~~~r~~~~~paTr 132 (228) T protein:vir:78 55 WAPRKRGKRKMLRGLPKLLQIREPRQDMAELGFTKGTMS--AHAGVIANTHQKGHTYKVTAASRRRIAPSDVGKNKQASK 132 (228) T ss_pred ChhhhhhHHHHHhhhHHhhhhhcccccceEEEeecCccc--chHHHHHHHHhcCcccccccchhhhhhcccCCCCCCCCH Confidence 222222223344444444432 3345789998654321 247899999999921 Q ss_pred --------------------------------------------------------CCCCCchhhHHHHHHHHHHHHHHH Q lcl|NC_020201. 55 --------------------------------------------------------NLPERNFMFSTSMHFQEGLRKHIK 78 (153) Q Consensus 55 --------------------------------------------------------~IP~RpFlr~~~~~~~~~~~~~~~ 78 (153) .+|+||||-.+-+ ++.+.+. T Consensus 133 ~QAk~Lr~lGy~~~~~~~k~~rkps~kwI~~nls~gqAgliir~L~~k~~k~~W~I~~PaR~FLG~s~~----e~~~~l~ 208 (228) T protein:vir:78 133 AQARKLRELGFKRPGKRKRAYRSASLGWITANLNYAQAGLLIKKLKDEPVKESWEIQLPARPFLGANAR----QRQQAFA 208 (228) T ss_pred HHHHHHHHhhccccCCcCCCcccCCHHHHHHHhhHHHHHHHHHHHhCCCCccceeeecCcccccCCCHH----HHHHHHH Confidence 1677777744322 3333344 Q ss_pred HHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhc Q lcl|NC_020201. 79 RMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYK 127 (153) Q Consensus 79 ~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~K 127 (153) ..++.+--|.++ -+.-|+.| T Consensus 209 ~~l~~i~~g~~~-----------------------------~~qd~~~~ 228 (228) T protein:vir:78 209 LRPESIDYGWDV-----------------------------NKQDMKGK 228 (228) T ss_pred HHHHhcccCCCc-----------------------------chhhccCC Confidence 444444444321 11122222 No 130 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=87.32 E-value=0.0011 Score=36.79 Aligned_cols=83 Identities=11% Similarity=0.071 Sum_probs=49.5 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeec---CcC---------CCCCC--------CHHHHHHHHhcCCC------ Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFY---DEK---------HYSGL--------NMATLAAIHEEGWN------ 54 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~---~~~---------~~dG~--------~va~iA~~~E~G~~------ 54 (153) |.---+.-.+...++..+.+...+..|.+=+. .+. ..+|+ ..+++|-+.|||++ T Consensus 16 ~~dvk~VVkkN~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~g~~~~vgp~g~t~dYapyvEyGTR~m~~~~ 95 (127) T protein:vir:98 16 EKRWDRVANKNLTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKVNSSKDVITGNFGYIKDYAPHVEYGHRIVRNGK 95 (127) T ss_pred HHHHHHHHhhhhHHHHHHHHhccCCceeccccccCcccceeeeEEEEecCCceEEeccCcccccccceeecceeeeeccc Confidence 22111111223345566666554443311111 110 01233 24789999999996 Q ss_pred ---CCCCCchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 55 ---NLPERNFMFSTSMHFQEGLRKHIKRMHNG 83 (153) Q Consensus 55 ---~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 83 (153) -.|+-|||.|+|+..+..+.+-++.+++. T Consensus 96 ~~gf~~aqp~l~paf~~Qk~iF~~DL~~l~k~ 127 (127) T protein:vir:98 96 QVGYANGTKYLFNNVKKQREIYRQDMLNELRR 127 (127) T ss_pred ccccccCccccccchHHHhHHHHHHHHHHhcC Confidence 38899999999999999988888877765 No 131 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=87.32 E-value=0.0017 Score=35.70 Aligned_cols=66 Identities=9% Similarity=-0.007 Sum_probs=30.6 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHh Q lcl|NC_020201. 62 MFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSS 141 (153) Q Consensus 62 lr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~ 141 (153) |-.-. ...+++.+.++++-..+. ..++++|...+..+++.+|.. .| +|||.|++ T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~--~~~~~~l~~~a~~~~~~ak~~----------------------~p-vdTG~L~~ 54 (137) T protein:vir:96 1 MAKVK-YGNWDLVAELEDYRDEME--EWVKKGILKTTLAIYNTAVAL----------------------AP-VDLGFLKE 54 (137) T ss_pred CchhH-hhHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHh----------------------CC-cCccchhc Confidence 22111 122223222332222211 123344444444444443322 12 58999999 Q ss_pred hcceeeeeCCCC Q lcl|NC_020201. 142 AATYKIVKYQGK 153 (153) Q Consensus 142 Sity~V~~~~gk 153 (153) ||+++|..++.. T Consensus 55 Si~~~~~~~g~~ 66 (137) T protein:vir:96 55 SIDFKVTDGGFS 66 (137) T ss_pred CceeEeecCceE Confidence 999988765544 No 132 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=86.56 E-value=0.0021 Score=35.27 Aligned_cols=66 Identities=14% Similarity=0.019 Sum_probs=29.5 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHh Q lcl|NC_020201. 62 MFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSS 141 (153) Q Consensus 62 lr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~ 141 (153) |-..+ ...+++.+.++++-..+. ..++++|+..+..+++.++.. .| +|||.|++ T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~--~~~~~~~~~~a~~v~~~ak~~----------------------aP-v~TG~L~~ 54 (137) T protein:vir:95 1 MAKVK-YGNWDLVKELENYERDME--RWVKRGIAKTTAKIHNTIISL----------------------MP-VDTGYLRE 54 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHh----------------------CC-ccchhhhc Confidence 33222 122333333333222111 123344444444333333321 12 48999999 Q ss_pred hcceeeeeCCCC Q lcl|NC_020201. 142 AATYKIVKYQGK 153 (153) Q Consensus 142 Sity~V~~~~gk 153 (153) ||++++..++.. T Consensus 55 Si~~~~~~~~~~ 66 (137) T protein:vir:95 55 SVTMDFKDGGFT 66 (137) T ss_pred CeeeEeeCCceE Confidence 999988544322 No 133 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=86.07 E-value=0.0023 Score=35.07 Aligned_cols=79 Identities=10% Similarity=0.159 Sum_probs=55.2 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCC-----chhhHHHHHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPER-----NFMFSTSMHFQEGLRK 75 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~R-----pFlr~~~~~~~~~~~~ 75 (153) |+=.|+.... ..+.....-.+.|||.... -+.||.+.+.||...|+. +|+..+..+.+.++.+ T Consensus 76 laD~I~~~~~------~~iDg~~dG~s~VGw~~~~------~a~~a~f~NdGT~~m~~k~~~gdHFvekt~~~~k~~Vl~ 143 (159) T protein:vir:38 76 LQDSITYKPG------YTADKLHTGDTDVGFEGKY------YDFLAKIVNNGQHHMSPKRYKNMHFLDKAQQEAKKSVAE 143 (159) T ss_pred cccceeeecC------ccccccccceeeecccCCc------cceEeeecccCccccCCCCccCChhHHHHHHHHHHHHHH Confidence 4444432210 0111122336889996432 369999999999999987 7999999999999988 Q ss_pred HHHHHHHHHHcCCCHH Q lcl|NC_020201. 76 HIKRMHNGIIQGRGFS 91 (153) Q Consensus 76 ~~~~~~~~~~~G~~~~ 91 (153) .+...++.+++-.+-. T Consensus 144 A~~~~~~~il~~~~~~ 159 (159) T protein:vir:38 144 AELKAYKEVMNHDSDK 159 (159) T ss_pred HHHHHHHHHhhcccCC Confidence 8888888888776322 No 134 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=85.90 E-value=0.0034 Score=34.10 Aligned_cols=61 Identities=10% Similarity=0.029 Sum_probs=27.3 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHhh Q lcl|NC_020201. 63 FSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSSA 142 (153) Q Consensus 63 r~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~S 142 (153) -.++++-...+.++.+.+-. .++.+|...+..++..+|. ..| +|||.|++| T Consensus 1 i~Gld~l~~~l~~~~~~~~~------~v~~al~~~a~~i~~~ak~----------------------~aP-v~TG~Lr~s 51 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRI------AVDKELSKSAARIERQAKI----------------------LAP-VDTGWLRAQ 51 (108) T ss_pred CchHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHh----------------------cCC-cCchhhhcc Confidence 33444444333332221111 1234444444333333222 122 699999999 Q ss_pred cceeeeeCCCC Q lcl|NC_020201. 143 ATYKIVKYQGK 153 (153) Q Consensus 143 ity~V~~~~gk 153 (153) |++.+. .+++ T Consensus 52 I~~~~~-~~~~ 61 (108) T protein:vir:99 52 IYSEQQ-RLLH 61 (108) T ss_pred eeeeec-CcEE Confidence 987653 2223 No 135 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=85.83 E-value=0.0034 Score=34.12 Aligned_cols=75 Identities=15% Similarity=0.124 Sum_probs=44.1 Q ss_pred CCccccccHHHHHHHHHHHHHh---------hC------------------------------------------CEEEE Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDL---------AE------------------------------------------KEVEY 29 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l---------~~------------------------------------------~~v~V 29 (153) ||+.+++.. ++++.|++. .. ++|+| T Consensus 1 MsvevkGv~----eil~~LE~k~g~~~~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~~~~G~r~V~v 76 (134) T protein:vir:10 1 MSVKVTGDK----ALERELEKHFGIKEMVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPEWIKGKRTVTI 76 (134) T ss_pred CeEEeecHH----HHHHHHHHhhchhhhhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCeeecCCceEEEE Confidence 998888653 333333322 11 22333 Q ss_pred eecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhH--------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 30 GFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFS--------TSMHFQEGLRKHIKRMHNGI 84 (153) Q Consensus 30 Gi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~--------~~~~~~~~~~~~~~~~~~~~ 84 (153) ||-... +=--|--+||||.-.-...+|++| +++..+..+.+.++.-++.+ T Consensus 77 gW~G~~-----~R~~ivHLnE~Gyt~~r~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 77 RWRGPF-----ERFRIVHLIENGHVEKKSGKFVKPKAMGGINRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred EEEcCC-----ceeeEEEeeecceeecCCCCeeccchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 432111 001334468999876677888888 77777888877777777666 No 136 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=85.52 E-value=0.0035 Score=34.02 Aligned_cols=45 Identities=16% Similarity=0.102 Sum_probs=24.8 Q ss_pred HHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHhhcceeeeeCCCC Q lcl|NC_020201. 77 IKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSSAATYKIVKYQGK 153 (153) Q Consensus 77 ~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~Sity~V~~~~gk 153 (153) +++ -++++|+..+..+++.+|.. .| +|||.|++||++++...+-. T Consensus 1 v~~---------~v~~~~~~~~~~i~~~ak~~-------aP----------------v~TG~Lr~SI~~~~~~~~~~ 45 (116) T protein:vir:97 1 MER---------WVKRGIAKTTAKIHNTIISL-------MP----------------VDTGYLRESVTMDFKDGGFT 45 (116) T ss_pred ChH---------HHHHHHHHHHHHHHHHHHHh-------CC----------------cCcccccccceEEeecCcEE Confidence 111 12344444444444444331 12 58999999999988654423 No 137 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=85.52 E-value=0.0035 Score=34.02 Aligned_cols=45 Identities=16% Similarity=0.102 Sum_probs=24.8 Q ss_pred HHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHhhcceeeeeCCCC Q lcl|NC_020201. 77 IKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSSAATYKIVKYQGK 153 (153) Q Consensus 77 ~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~Sity~V~~~~gk 153 (153) +++ -++++|+..+..+++.+|.. .| +|||.|++||++++...+-. T Consensus 1 v~~---------~v~~~~~~~~~~i~~~ak~~-------aP----------------v~TG~Lr~SI~~~~~~~~~~ 45 (116) T protein:vir:12 1 MER---------WVKRGIAKTTAKIHNTIISL-------MP----------------VDTGYLRESVTMDFKDGGFT 45 (116) T ss_pred ChH---------HHHHHHHHHHHHHHHHHHHh-------CC----------------cCcccccccceEEeecCcEE Confidence 111 12344444444444444331 12 58999999999988654423 No 138 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=85.39 E-value=0.014 Score=30.80 Aligned_cols=64 Identities=13% Similarity=0.188 Sum_probs=32.0 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHh Q lcl|NC_020201. 62 MFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSS 141 (153) Q Consensus 62 lr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~ 141 (153) |. -+.-+++.+.++++-.. ++.++...-..+...|+..+.. . + | +|||.|++ T Consensus 1 i~---i~Gld~L~~~L~~l~~~------~~~~~~~a~~~~a~~i~~~ak~--~-----a-----------P-v~TG~Lr~ 52 (173) T protein:vir:10 1 MA---VKGVAEVIAELRKIGKD------IDKNINATTEEAANFIEDRAKT--L-----A-----------P-KNFGKLAQ 52 (173) T ss_pred Cc---chhHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHH--h-----C-----------C-cCchhhhh Confidence 11 12233343333333221 2233333333334444444432 1 1 1 78999999 Q ss_pred hcceeeeeCCCC Q lcl|NC_020201. 142 AATYKIVKYQGK 153 (153) Q Consensus 142 Sity~V~~~~gk 153 (153) ||.+.+..++|. T Consensus 53 sI~~~~~~~~~~ 64 (173) T protein:vir:10 53 SISTSDLKAKDL 64 (173) T ss_pred cceeeeeccCce Confidence 999998887776 No 139 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=85.14 E-value=0.0027 Score=34.67 Aligned_cols=66 Identities=15% Similarity=0.013 Sum_probs=30.0 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHh Q lcl|NC_020201. 62 MFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSS 141 (153) Q Consensus 62 lr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~ 141 (153) |-.- ....+++.+.++++...+. ..++++|+..+..+++.+|.. .| +|||.|++ T Consensus 1 Ma~~-~~G~~~l~~~L~~~~~~~~--~~~~~al~~~a~~v~~~ak~~----------------------aP-vdTG~Lr~ 54 (137) T protein:vir:94 1 MAKV-KYGNWDLVKELENYERDIE--RWVKRGIAKTTVKIHNTIISL----------------------MP-VDTGYLRE 54 (137) T ss_pred Cchh-HHhHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHh----------------------CC-cCcchhhc Confidence 2211 0122333333333322221 123344444444444433322 22 48999999 Q ss_pred hcceeeeeCCCC Q lcl|NC_020201. 142 AATYKIVKYQGK 153 (153) Q Consensus 142 Sity~V~~~~gk 153 (153) ||++++...+.. T Consensus 55 SI~~~~~~~~~~ 66 (137) T protein:vir:94 55 SVTMDFKDGGFT 66 (137) T ss_pred CceeEeecCcEE Confidence 999987544322 No 140 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=84.84 E-value=0.0077 Score=32.18 Aligned_cols=61 Identities=7% Similarity=0.003 Sum_probs=30.7 Q ss_pred CCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcch Q lcl|NC_020201. 55 NLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMI 134 (153) Q Consensus 55 ~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLi 134 (153) -|-+|+-| +...+.+++ |..++++|+.++...+...|. +.| + T Consensus 1 ~~~~~~~l------~~~~l~~~~---------~~~~~~~~~~~a~~ve~~ak~----------------------~aP-v 42 (137) T protein:vir:10 1 MVAHTLRI------ERAQLHGLG---------MDEARKAVNRVVRRTFTRSQI----------------------LAP-V 42 (137) T ss_pred Cccccccc------ChhhHhhHH---------HHHHHHHHHHHHHHHHHHHHh----------------------cCC-c Confidence 11111111 111111111 123445555555555554432 112 7 Q ss_pred hHHHHHhhcceeeeeCCCC Q lcl|NC_020201. 135 HYGDLSSAATYKIVKYQGK 153 (153) Q Consensus 135 dTG~L~~Sity~V~~~~gk 153 (153) |||.|++||++.+.+.+|. T Consensus 43 ~TG~Lr~SI~~~~~~~~g~ 61 (137) T protein:vir:10 43 DTGYLRASGRLVLGRERGA 61 (137) T ss_pred Cchhhhccceeeeeecccc Confidence 9999999999999877776 No 141 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=84.53 E-value=0.0021 Score=35.21 Aligned_cols=68 Identities=13% Similarity=0.125 Sum_probs=27.7 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHH Q lcl|NC_020201. 61 FMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLS 140 (153) Q Consensus 61 Flr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~ 140 (153) .=-.++++-... ++++-..+. +.+..++..-|..++..++..-. ..++.| +|||.|+ T Consensus 1 i~~~Gld~l~~~----l~~~~~~~~--~~v~~a~~~~~~~i~~~a~~~a~----------------~~~~~p-~~TG~Lr 57 (115) T protein:vir:96 1 MNIDGLDALLNQ----FHDMKTNID--DDVDDILQENAKEYVVRAKLKAR----------------EVMNKG-YWTGNLS 57 (115) T ss_pred CcchhHHHHHHH----HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhcc----------------ccCCCC-CCchhhh Confidence 001122222222 221111110 11334444444444444433211 122333 7999999 Q ss_pred hhcceeeeeCCCC Q lcl|NC_020201. 141 SAATYKIVKYQGK 153 (153) Q Consensus 141 ~Sity~V~~~~gk 153 (153) +||++.. .+|- T Consensus 58 ~sI~~~~--~g~~ 68 (115) T protein:vir:96 58 RNIRYKK--TGDL 68 (115) T ss_pred hcceeee--cCce Confidence 9999864 2333 No 142 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=84.53 E-value=0.0021 Score=35.21 Aligned_cols=68 Identities=13% Similarity=0.125 Sum_probs=27.7 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHH Q lcl|NC_020201. 61 FMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLS 140 (153) Q Consensus 61 Flr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~ 140 (153) .=-.++++-... ++++-..+. +.+..++..-|..++..++..-. ..++.| +|||.|+ T Consensus 1 i~~~Gld~l~~~----l~~~~~~~~--~~v~~a~~~~~~~i~~~a~~~a~----------------~~~~~p-~~TG~Lr 57 (115) T protein:vir:78 1 MNIDGLDALLNQ----FHDMKTNID--DDVDDILQENAKEYVVRAKLKAR----------------EVMNKG-YWTGNLS 57 (115) T ss_pred CcchhHHHHHHH----HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhcc----------------ccCCCC-CCchhhh Confidence 001122222222 221111110 11334444444444444433211 122333 7999999 Q ss_pred hhcceeeeeCCCC Q lcl|NC_020201. 141 SAATYKIVKYQGK 153 (153) Q Consensus 141 ~Sity~V~~~~gk 153 (153) +||++.. .+|- T Consensus 58 ~sI~~~~--~g~~ 68 (115) T protein:vir:78 58 RNIRYKK--TGDL 68 (115) T ss_pred hcceeee--cCce Confidence 9999864 2333 No 143 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=84.53 E-value=0.0021 Score=35.21 Aligned_cols=68 Identities=13% Similarity=0.125 Sum_probs=27.7 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHH Q lcl|NC_020201. 61 FMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLS 140 (153) Q Consensus 61 Flr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~ 140 (153) .=-.++++-... ++++-..+. +.+..++..-|..++..++..-. ..++.| +|||.|+ T Consensus 1 i~~~Gld~l~~~----l~~~~~~~~--~~v~~a~~~~~~~i~~~a~~~a~----------------~~~~~p-~~TG~Lr 57 (115) T protein:vir:96 1 MNIDGLDALLNQ----FHDMKTNID--DDVDDILQENAKEYVVRAKLKAR----------------EVMNKG-YWTGNLS 57 (115) T ss_pred CcchhHHHHHHH----HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhcc----------------ccCCCC-CCchhhh Confidence 001122222222 221111110 11334444444444444433211 122333 7999999 Q ss_pred hhcceeeeeCCCC Q lcl|NC_020201. 141 SAATYKIVKYQGK 153 (153) Q Consensus 141 ~Sity~V~~~~gk 153 (153) +||++.. .+|- T Consensus 58 ~sI~~~~--~g~~ 68 (115) T protein:vir:96 58 RNIRYKK--TGDL 68 (115) T ss_pred hcceeee--cCce Confidence 9999864 2333 No 144 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=84.53 E-value=0.0021 Score=35.21 Aligned_cols=68 Identities=13% Similarity=0.125 Sum_probs=27.7 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHH Q lcl|NC_020201. 61 FMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLS 140 (153) Q Consensus 61 Flr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~ 140 (153) .=-.++++-... ++++-..+. +.+..++..-|..++..++..-. ..++.| +|||.|+ T Consensus 1 i~~~Gld~l~~~----l~~~~~~~~--~~v~~a~~~~~~~i~~~a~~~a~----------------~~~~~p-~~TG~Lr 57 (115) T protein:vir:10 1 MNIDGLDALLNQ----FHDMKTNID--DDVDDILQENAKEYVVRAKLKAR----------------EVMNKG-YWTGNLS 57 (115) T ss_pred CcchhHHHHHHH----HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhcc----------------ccCCCC-CCchhhh Confidence 001122222222 221111110 11334444444444444433211 122333 7999999 Q ss_pred hhcceeeeeCCCC Q lcl|NC_020201. 141 SAATYKIVKYQGK 153 (153) Q Consensus 141 ~Sity~V~~~~gk 153 (153) +||++.. .+|- T Consensus 58 ~sI~~~~--~g~~ 68 (115) T protein:vir:10 58 RNIRYKK--TGDL 68 (115) T ss_pred hcceeee--cCce Confidence 9999864 2333 No 145 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=84.53 E-value=0.0021 Score=35.21 Aligned_cols=68 Identities=13% Similarity=0.125 Sum_probs=27.7 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHH Q lcl|NC_020201. 61 FMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLS 140 (153) Q Consensus 61 Flr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~ 140 (153) .=-.++++-... ++++-..+. +.+..++..-|..++..++..-. ..++.| +|||.|+ T Consensus 1 i~~~Gld~l~~~----l~~~~~~~~--~~v~~a~~~~~~~i~~~a~~~a~----------------~~~~~p-~~TG~Lr 57 (115) T protein:vir:93 1 MNIDGLDALLNQ----FHDMKTNID--DDVDDILQENAKEYVVRAKLKAR----------------EVMNKG-YWTGNLS 57 (115) T ss_pred CcchhHHHHHHH----HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhcc----------------ccCCCC-CCchhhh Confidence 001122222222 221111110 11334444444444444433211 122333 7999999 Q ss_pred hhcceeeeeCCCC Q lcl|NC_020201. 141 SAATYKIVKYQGK 153 (153) Q Consensus 141 ~Sity~V~~~~gk 153 (153) +||++.. .+|- T Consensus 58 ~sI~~~~--~g~~ 68 (115) T protein:vir:93 58 RNIRYKK--TGDL 68 (115) T ss_pred hcceeee--cCce Confidence 9999864 2333 No 146 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=84.53 E-value=0.0021 Score=35.21 Aligned_cols=68 Identities=13% Similarity=0.125 Sum_probs=27.7 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHH Q lcl|NC_020201. 61 FMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLS 140 (153) Q Consensus 61 Flr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~ 140 (153) .=-.++++-... ++++-..+. +.+..++..-|..++..++..-. ..++.| +|||.|+ T Consensus 1 i~~~Gld~l~~~----l~~~~~~~~--~~v~~a~~~~~~~i~~~a~~~a~----------------~~~~~p-~~TG~Lr 57 (115) T protein:vir:97 1 MNIDGLDALLNQ----FHDMKTNID--DDVDDILQENAKEYVVRAKLKAR----------------EVMNKG-YWTGNLS 57 (115) T ss_pred CcchhHHHHHHH----HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhcc----------------ccCCCC-CCchhhh Confidence 001122222222 221111110 11334444444444444433211 122333 7999999 Q ss_pred hhcceeeeeCCCC Q lcl|NC_020201. 141 SAATYKIVKYQGK 153 (153) Q Consensus 141 ~Sity~V~~~~gk 153 (153) +||++.. .+|- T Consensus 58 ~sI~~~~--~g~~ 68 (115) T protein:vir:97 58 RNIRYKK--TGDL 68 (115) T ss_pred hcceeee--cCce Confidence 9999864 2333 No 147 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=84.43 E-value=0.0069 Score=32.45 Aligned_cols=88 Identities=16% Similarity=0.258 Sum_probs=53.7 Q ss_pred CCccccccHHHHHHHHHHHHHhhCC------------------------------------------------------- Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEK------------------------------------------------------- 25 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~------------------------------------------------------- 25 (153) |++.+ |.+.|+++.++|..+... T Consensus 1 m~~~~--d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k 78 (163) T protein:vir:10 1 MSGGF--DYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGK 78 (163) T ss_pred CCCcc--CHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhcccccc Confidence 77776 555677766666543211 Q ss_pred ---EEEEeecCcCC-C--CC-----CCHHHHHHHHhcCCC-----CCCCCchhhHHHHHHHHHHHHHHHHHH----HHHH Q lcl|NC_020201. 26 ---EVEYGFYDEKH-Y--SG-----LNMATLAAIHEEGWN-----NLPERNFMFSTSMHFQEGLRKHIKRMH----NGII 85 (153) Q Consensus 26 ---~v~VGi~~~~~-~--dG-----~~va~iA~~~E~G~~-----~IP~RpFlr~~~~~~~~~~~~~~~~~~----~~~~ 85 (153) .++=||-.++. + ++ .+.+.+|-+-|||+. -+|-+.+|+.+.++.+.++.+.+++.+ +.++ T Consensus 79 ~tG~lr~swk~~~~~k~~~~~~v~v~N~~~YA~~VE~GHR~~~gGfV~G~fml~~s~~~~~~~~~~~~e~~l~~~l~k~~ 158 (163) T protein:vir:10 79 QGGTLQKGWSKSRIEVSGRTYKQKVYNKVYYAPHVEYGHKTVNGGFVPGQFFLHKTVEDTKSDMEKRVRDKYDGFMRKVV 158 (163) T ss_pred ccchhhccceecceeecCCceEEEEEecCCccchhhcceeecCCceeccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 11111111111 1 22 245788999999974 489999999999999888776666554 4555 Q ss_pred cCCCH Q lcl|NC_020201. 86 QGRGF 90 (153) Q Consensus 86 ~G~~~ 90 (153) +|+.- T Consensus 159 ~~~~~ 163 (163) T protein:vir:10 159 LGNGK 163 (163) T ss_pred cCCCC Confidence 66432 No 148 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=84.10 E-value=0.004 Score=33.70 Aligned_cols=67 Identities=7% Similarity=-0.056 Sum_probs=34.3 Q ss_pred hhHH-HHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHH Q lcl|NC_020201. 62 MFST-SMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLS 140 (153) Q Consensus 62 lr~~-~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~ 140 (153) |-.. +.-..+++.+.++.+.+.+. ..++.+|+..+..+++.++.. .| +|||.|+ T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~--~~~~~~l~~~a~~i~~~ak~~----------------------aP-v~TG~Lr 55 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLT--GAAREATEAAANDMVNMAKGL----------------------CP-VDTGRLR 55 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHh----------------------CC-ccchhhh Confidence 2111 11123344444444333221 124455555554444443222 22 6899999 Q ss_pred hhcceeeeeCCCC Q lcl|NC_020201. 141 SAATYKIVKYQGK 153 (153) Q Consensus 141 ~Sity~V~~~~gk 153 (153) +||++++...++. T Consensus 56 ~SI~~~~~~~g~~ 68 (142) T protein:vir:94 56 SSIQAVPSGGRFS 68 (142) T ss_pred ccceeeeccCCce Confidence 9999998777655 No 149 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=83.66 E-value=0.0038 Score=33.86 Aligned_cols=66 Identities=18% Similarity=0.099 Sum_probs=33.0 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHh Q lcl|NC_020201. 62 MFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSS 141 (153) Q Consensus 62 lr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~ 141 (153) |-..+ ..-+++.+.++.+-..+. ..++.+|+..+..+++.+|... | +|||.|++ T Consensus 1 Ma~~~-~Gl~~l~~~l~~~~~~~~--~~~~~al~~~a~~i~~~ak~~a-------P----------------vdTG~Lr~ 54 (137) T protein:vir:10 1 MAKVK-YGNWELVKELEDFEKETI--RWAKKGIAKTTTIIHNSIVSNM-------P----------------VDTGYLRE 54 (137) T ss_pred CchhH-hhHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhC-------C----------------cCcchhhc Confidence 22211 112233333332222221 1345666666666666555431 1 59999999 Q ss_pred hcceeeeeCCCC Q lcl|NC_020201. 142 AATYKIVKYQGK 153 (153) Q Consensus 142 Sity~V~~~~gk 153 (153) ||++++...+.. T Consensus 55 SI~~~~~~~~~~ 66 (137) T protein:vir:10 55 SVSMDFKKGGLT 66 (137) T ss_pred CeeEEeeCCcEE Confidence 999987544333 No 150 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=83.43 E-value=0.03 Score=28.95 Aligned_cols=87 Identities=9% Similarity=-0.128 Sum_probs=50.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccC----CCCCCccHHHHHhcCC----CCcchhHH Q lcl|NC_020201. 66 SMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGS----FSNPKVSKDWASYKGF----DDAMIHYG 137 (153) Q Consensus 66 ~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~----~~~p~ns~~Ti~~KG~----~~PLidTG 137 (153) +++.-.++.+.+..++..+. .......|..||..+....++.|.+.. -+++|.++.|++.|.. ..+|.... T Consensus 1 m~~~~~~l~~~L~~ll~~L~-~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~~W~p~~~~~~~~~~~~~~~~~~m~~~l 79 (156) T protein:vir:11 1 MADSLEALEDWAGPILRALE-PGPRAALARSLARDLRRSQQKRVMAQRNPDGSAYEPRKKRELRGKQGRIRRKIKMFQKL 79 (156) T ss_pred CchhHHHHHHHHHHHHHhcC-CcchHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHhhhccccccchhhhhhh Confidence 55555566666666555443 224567899999999999999998631 1345788888876532 23333333 Q ss_pred HHHhhcceeeeeCCC------C Q lcl|NC_020201. 138 DLSSAATYKIVKYQG------K 153 (153) Q Consensus 138 ~L~~Sity~V~~~~g------k 153 (153) .+..+|.+......- . T Consensus 80 ~~~~~l~~~~~~~~a~vg~~Gs 101 (156) T protein:vir:11 80 RTVRYLRAKGDAQAITVSFAGR 101 (156) T ss_pred hhhheeeeeecCcEEEEEecCC Confidence 333335554432221 1 No 151 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=83.25 E-value=0.0037 Score=33.90 Aligned_cols=66 Identities=18% Similarity=0.107 Sum_probs=33.8 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHh Q lcl|NC_020201. 62 MFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSS 141 (153) Q Consensus 62 lr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~ 141 (153) |-... ..-+++.+.++.+-..+. ..++.+|+..+..+++.+|... | +|||.|++ T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~--~~~~~al~~~a~~i~~~ak~~a-------P----------------v~TG~Lr~ 54 (137) T protein:vir:10 1 MAKVK-YGNWDLVKELEEFEKETI--RWAKKGIAKTTTIIHNSIVSNM-------P----------------VDTGYLRE 54 (137) T ss_pred Cccch-hCHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhC-------C----------------cCcchhhc Confidence 32211 122333333333333221 1345666666665555555431 1 58999999 Q ss_pred hcceeeeeCCCC Q lcl|NC_020201. 142 AATYKIVKYQGK 153 (153) Q Consensus 142 Sity~V~~~~gk 153 (153) ||++++..++.. T Consensus 55 SI~~~~~~~~~~ 66 (137) T protein:vir:10 55 SVSMDFKKGGLT 66 (137) T ss_pred CeeeEecCCcEE Confidence 999987554323 No 152 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=82.68 E-value=0.012 Score=31.12 Aligned_cols=71 Identities=7% Similarity=0.094 Sum_probs=32.0 Q ss_pred CCCchhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhH Q lcl|NC_020201. 57 PERNFMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHY 136 (153) Q Consensus 57 P~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidT 136 (153) =+|...+-.+ +...++.+.+++.-..+. +.++++|...+..+++.++.. .| +|| T Consensus 1 m~~ms~~i~~-~g~~~l~~~l~~~~~~~~--~~v~~~l~~~a~~i~~~ak~~-------ap----------------v~T 54 (144) T protein:vir:59 1 MALMSVRIDP-SWRRIMSRNVRTFSGHVL--TQVEQVIIKTAEKIAGLAASL-------AP----------------VDE 54 (144) T ss_pred CCcceeeehh-HHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHh-------CC----------------ccc Confidence 2233221110 112233333333222221 124455555555444444321 11 589 Q ss_pred HHHHhhcceeeeeCCCC Q lcl|NC_020201. 137 GDLSSAATYKIVKYQGK 153 (153) Q Consensus 137 G~L~~Sity~V~~~~gk 153 (153) |.|++||++++...+.. T Consensus 55 G~Lr~SI~~~~~~~g~~ 71 (144) T protein:vir:59 55 GNLKNSIQIDYKNNGLT 71 (144) T ss_pred hhhhcCeeEEeecCcEE Confidence 99999999988444322 No 153 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=82.01 E-value=0.0066 Score=32.54 Aligned_cols=64 Identities=17% Similarity=0.148 Sum_probs=26.7 Q ss_pred hhH-HHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHH-HHHHhccCCCCCCccHHHHHhcCCCCcchhHHHH Q lcl|NC_020201. 62 MFS-TSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSI-RFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDL 139 (153) Q Consensus 62 lr~-~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i-~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L 139 (153) |-. =|+.+...+...+.. .+.+++..++......+ +.... ...| +|||.| T Consensus 1 ~~~~~f~~~~~~~~~~~~k---------~~~~~~~~~a~~~~~~~ie~~ak------------------~~~p-vdtG~L 52 (141) T protein:vir:78 1 MNEFEFDSNIPKARKLIEK---------KVLQALEDIGEHMTTELAEGGHG------------------VTSN-NDTGEY 52 (141) T ss_pred CcchhHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHhhh------------------hccc-cccchh Confidence 111 111222222211111 12233333333222211 11111 1123 799999 Q ss_pred HhhcceeeeeCCCC Q lcl|NC_020201. 140 SSAATYKIVKYQGK 153 (153) Q Consensus 140 ~~Sity~V~~~~gk 153 (153) ++||+|+|...+.+ T Consensus 53 ~~SI~~~v~~~g~~ 66 (141) T protein:vir:78 53 AQKSGYKVRKSSKE 66 (141) T ss_pred hcceeeeeecCCcE Confidence 99999998766555 No 154 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=81.99 E-value=0.0048 Score=33.31 Aligned_cols=66 Identities=9% Similarity=-0.011 Sum_probs=29.7 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHh Q lcl|NC_020201. 62 MFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSS 141 (153) Q Consensus 62 lr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~ 141 (153) |-.. ...-+++.+.++++...+. ..++++|...+..+++.++.. .| +|||.|++ T Consensus 1 Ma~~-~~Gl~~l~~~l~~~~~~~~--~~~~~al~~~a~~v~~~ak~~----------------------ap-vdTG~Lr~ 54 (135) T protein:vir:96 1 MAKV-KYGADSIVVDLEKYSKDME--KWVKKGITKTTLKIYNTAIHL----------------------MP-VDTGFLRQ 54 (135) T ss_pred Cchh-hhhHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHh----------------------CC-ccchhhhc Confidence 2111 0112223333333222221 123444555444444443321 12 79999999 Q ss_pred hcceeeeeCCCC Q lcl|NC_020201. 142 AATYKIVKYQGK 153 (153) Q Consensus 142 Sity~V~~~~gk 153 (153) ||+++|.+.+-. T Consensus 55 SI~~~~~~~g~~ 66 (135) T protein:vir:96 55 STTVDFENGGFT 66 (135) T ss_pred ceeEEeecCcEE Confidence 999987544322 No 155 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=80.05 E-value=0.0095 Score=31.68 Aligned_cols=45 Identities=13% Similarity=0.095 Sum_probs=24.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHhhcceeeeeCCCC Q lcl|NC_020201. 90 FSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSSAATYKIVKYQGK 153 (153) Q Consensus 90 ~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~Sity~V~~~~gk 153 (153) ++.++.+.-..+...|+..+.. . .| +|||.|++||++++...+-. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~--~----------------ap-v~TG~Lr~SI~~~~~~~~~~ 45 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIIS--L----------------MP-VDTGYLRESVTMDFKDGGFT 45 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHh--h----------------CC-ccccccccceeEEeecCcEE Confidence 3333333333333344444432 1 12 48999999999988554322 No 156 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=78.81 E-value=0.01 Score=31.54 Aligned_cols=75 Identities=15% Similarity=0.124 Sum_probs=40.4 Q ss_pred CCccccccHHHHHHHHHHHHHh---------hC------------------------------------------CEEEE Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDL---------AE------------------------------------------KEVEY 29 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l---------~~------------------------------------------~~v~V 29 (153) |++.+++.. ++++.|++. .. ++|+| T Consensus 1 msvevkGv~----eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~v 76 (134) T protein:vir:95 1 MSVKVIGDK----ALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITV 76 (134) T ss_pred CeEEEecHH----HHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEE Confidence 888887543 333333222 00 12444 Q ss_pred eecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhH--------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 30 GFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFS--------TSMHFQEGLRKHIKRMHNGI 84 (153) Q Consensus 30 Gi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~--------~~~~~~~~~~~~~~~~~~~~ 84 (153) ||-... +=--|--+||||.-.--..+|++| +++..+..+.+.++.-++.+ T Consensus 77 gW~G~~-----~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:95 77 HWRGSK-----DRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred EEEcCC-----ceeEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 442211 002344578999643323444444 77778888877777776666 No 157 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=78.81 E-value=0.01 Score=31.54 Aligned_cols=75 Identities=15% Similarity=0.124 Sum_probs=40.4 Q ss_pred CCccccccHHHHHHHHHHHHHh---------hC------------------------------------------CEEEE Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDL---------AE------------------------------------------KEVEY 29 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l---------~~------------------------------------------~~v~V 29 (153) |++.+++.. ++++.|++. .. ++|+| T Consensus 1 msvevkGv~----eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~v 76 (134) T protein:vir:10 1 MSVKVIGDK----ALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITV 76 (134) T ss_pred CeEEEecHH----HHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEE Confidence 888887543 333333222 00 12444 Q ss_pred eecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhH--------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 30 GFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFS--------TSMHFQEGLRKHIKRMHNGI 84 (153) Q Consensus 30 Gi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~--------~~~~~~~~~~~~~~~~~~~~ 84 (153) ||-... +=--|--+||||.-.--..+|++| +++..+..+.+.++.-++.+ T Consensus 77 gW~G~~-----~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 77 HWRGSK-----DRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred EEEcCC-----ceeEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 442211 002344578999643323444444 77778888877777776666 No 158 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=67.07 E-value=0.021 Score=29.84 Aligned_cols=65 Identities=8% Similarity=-0.063 Sum_probs=26.9 Q ss_pred HHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHhhcceeeeeCCCC Q lcl|NC_020201. 77 IKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSSAATYKIVKYQGK 153 (153) Q Consensus 77 ~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~Sity~V~~~~gk 153 (153) |.+.-..+.-+-+.+.+-..++..+...++..-.. .....| .+.| +|||.|++||++++.+.++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------v~~~ak-~~aP-vdtG~Lr~SI~~~~~~~~~~ 65 (140) T protein:vir:97 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRR----------IANQSR-VAVP-VRTGNLGRTIGELPQVYTPF 65 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHH----------HHHHHH-hcCC-ccchhhhccceeeeeeCCCc Confidence 11111111111122222222333333222221110 001111 1123 69999999999998887766 No 159 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=67.07 E-value=0.021 Score=29.84 Aligned_cols=65 Identities=8% Similarity=-0.063 Sum_probs=26.9 Q ss_pred HHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHHHhhcceeeeeCCCC Q lcl|NC_020201. 77 IKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDLSSAATYKIVKYQGK 153 (153) Q Consensus 77 ~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L~~Sity~V~~~~gk 153 (153) |.+.-..+.-+-+.+.+-..++..+...++..-.. .....| .+.| +|||.|++||++++.+.++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------v~~~ak-~~aP-vdtG~Lr~SI~~~~~~~~~~ 65 (140) T protein:vir:10 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRR----------IANQSR-VAVP-VRTGNLGRTIGELPQVYTPF 65 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHH----------HHHHHH-hcCC-ccchhhhccceeeeeeCCCc Confidence 11111111111122222222333333222221110 001111 1123 69999999999998887766 No 160 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=66.10 E-value=0.035 Score=28.57 Aligned_cols=93 Identities=19% Similarity=0.215 Sum_probs=48.1 Q ss_pred CCc--cccccHHHHHHHHHHHHHh-------------------------------hCCEEEEeecCcCCCCCCC-HHHHH Q lcl|NC_020201. 1 MAK--KSSTDISELKRYFSQLSDL-------------------------------AEKEVEYGFYDEKHYSGLN-MATLA 46 (153) Q Consensus 1 M~~--~i~~~~~~l~~~~~~l~~l-------------------------------~~~~v~VGi~~~~~~dG~~-va~iA 46 (153) |++ +.+....+.+.+.+.|..- ..-...|||. ...++|+. =|+|| T Consensus 22 lt~eqkakITkAGAkv~~~~L~~~t~~kHy~~k~t~~~~HLaDsI~~~~~niDg~~dG~s~VGf~-~k~~~~~~~kA~iA 100 (168) T protein:vir:74 22 MTVEDKAEVTKAGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVKDGQSVVGWE-RSTEKGTHTKGYIA 100 (168) T ss_pred CCHHHHHHHHHhhhHHHHHHHHHHhHHhhcccCCCcccchhhhheeecccccCcccCCceeeccc-ccccccccchhhhh Confidence 422 1111122222222222221 2234567773 23344543 48999 Q ss_pred HHHhcCCC------------------CCCCCchhhHHHHH--HHHHHHHHHHHHHHHHHcCCCHHHHH Q lcl|NC_020201. 47 AIHEEGWN------------------NLPERNFMFSTSMH--FQEGLRKHIKRMHNGIIQGRGFSSYL 94 (153) Q Consensus 47 ~~~E~G~~------------------~IP~RpFlr~~~~~--~~~~~~~~~~~~~~~~~~G~~~~~~L 94 (153) .+.+-|+. .||.=.|+..+-.+ .++++.+.....++.+++-...+.-| T Consensus 101 r~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHFvd~~r~~~~~k~~V~~Ae~~~y~eIl~~k~~~~~~ 168 (168) T protein:vir:74 101 NIINNGSRFPQFTTRSGRKYKKPGEVAVHADHFIEETRMNLIVQQGILKAEAEAMRKIINRKKKENNL 168 (168) T ss_pred hhhcccccccccccccccccccccccccccchhHHHHHhhhhhHHHHHHHHHHHHHHHHHhhcCCCCC Confidence 99999983 59999999998776 34665555555555555332111111 No 161 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=65.66 E-value=0.043 Score=28.07 Aligned_cols=60 Identities=10% Similarity=0.133 Sum_probs=34.3 Q ss_pred HcCC-CHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHH-hcCCCCcc------hhHHHHHhhcceeeeeCCCC Q lcl|NC_020201. 85 IQGR-GFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWAS-YKGFDDAM------IHYGDLSSAATYKIVKYQGK 153 (153) Q Consensus 85 ~~G~-~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~-~KG~~~PL------idTG~L~~Sity~V~~~~gk 153 (153) +.|. ..+..|... ...+++...... -+..|.+ .+--+.|. .|||.|+.||++++++.+.. T Consensus 1 i~G~~~L~~~Lk~~---s~~dvk~VVkkN------~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~g~~ 68 (127) T protein:vir:98 1 MTGMPALEVKLRSM---SEKRWDRVANKN------LTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKVNSSKD 68 (127) T ss_pred CcChHHHHHHHHHh---hHHHHHHHHhhh------hHHHHHHHHhccCCceeccccccCcccceeeeEEEEecCCce Confidence 4453 344444433 446666655431 1333333 12113333 48999999999999998876 No 162 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=64.72 E-value=0.099 Score=26.09 Aligned_cols=78 Identities=13% Similarity=0.214 Sum_probs=47.5 Q ss_pred CCccccccHHHHHHHHHHHHH-hhC--------------------------------------------------CEEEE Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSD-LAE--------------------------------------------------KEVEY 29 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~-l~~--------------------------------------------------~~v~V 29 (153) |+.-. +..++++++++|++ |+. ++|+| T Consensus 1 ~~~~a--evkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~t~dev~~s~~~~~~G~r~V~V 78 (132) T protein:vir:96 1 MSGFA--NLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKL 78 (132) T ss_pred CCccc--cccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhhhhhcchhhcceeecCeeecCCceEEEe Confidence 55433 23355666666655 332 12333 Q ss_pred eecCcCCCCCCCHHHHHHHHhcCCC-CCCCCc--hhhHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_020201. 30 GFYDEKHYSGLNMATLAAIHEEGWN-NLPERN--FMFSTSMHFQEGLRKHIKRMHNGIIQG 87 (153) Q Consensus 30 Gi~~~~~~dG~~va~iA~~~E~G~~-~IP~Rp--Flr~~~~~~~~~~~~~~~~~~~~~~~G 87 (153) ||. +++ -.|--+||||.. .|-||- +++.+++..+..+.+.++.-++..++| T Consensus 79 gW~-GpR------~~ivHLNE~GyGk~~~PrG~G~I~~a~~~se~~~~~~~~~elkk~l~~ 132 (132) T protein:vir:96 79 GFT-TPR------WNIVHLQELEYGWKHNRRGVGVIRRYSDILETIYPRGIRDKLKRGFDG 132 (132) T ss_pred ccc-CCc------eeEEeeecccccCCcCCCcchHHHHHHHhhhhHHHHHHHHHHHHHhcC Confidence 332 111 134457899985 355554 688899988888888888888888888 No 163 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=63.72 E-value=0.098 Score=26.11 Aligned_cols=78 Identities=13% Similarity=0.247 Sum_probs=48.0 Q ss_pred CCccccccHHHHHHHHHHHHH-hhC--------------------------------------------------CEEEE Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSD-LAE--------------------------------------------------KEVEY 29 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~-l~~--------------------------------------------------~~v~V 29 (153) |+.-. +.+++++++++|++ |+. ++|+| T Consensus 7 ~~~~a--evkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s~p~~~~G~r~V~i 84 (138) T protein:vir:98 7 MSGFA--NLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRREDGIPKVKL 84 (138) T ss_pred ccccc--cccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeecCeeecCCceEEEE Confidence 43322 23355555555554 111 23444 Q ss_pred eecCcCCCCCCCHHHHHHHHhcCCC-CCCCCc--hhhHHHHHHHHHHHHHHHHHHHHHHcC Q lcl|NC_020201. 30 GFYDEKHYSGLNMATLAAIHEEGWN-NLPERN--FMFSTSMHFQEGLRKHIKRMHNGIIQG 87 (153) Q Consensus 30 Gi~~~~~~dG~~va~iA~~~E~G~~-~IP~Rp--Flr~~~~~~~~~~~~~~~~~~~~~~~G 87 (153) ||.. ++| .|--+||||.. .|-||- +++.+++..+..+.+.++.-++..++| T Consensus 85 gW~G-pR~------~ivHLNE~GyGk~i~PrG~G~I~ka~~~se~~y~~~vk~el~k~l~~ 138 (138) T protein:vir:98 85 GFTT-PRW------NIVHLQELEYGWKHNRRGVGVIRRYSDILETIYPRGIRDKLKRGFDG 138 (138) T ss_pred eeec-Cee------eEEeeecccccCCcCCCcchHHHHHHHhhhHHHHHHHHHHHHHHhcC Confidence 4421 121 34457899985 355554 688899999999999999888888888 No 164 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=58.26 E-value=0.022 Score=29.65 Aligned_cols=83 Identities=17% Similarity=0.219 Sum_probs=44.1 Q ss_pred CCccccccHH-----HH-----------HHHH----H-HHHHhh-----CCEEEEeecCc-----CCCCCCC-----HHH Q lcl|NC_020201. 1 MAKKSSTDIS-----EL-----------KRYF----S-QLSDLA-----EKEVEYGFYDE-----KHYSGLN-----MAT 44 (153) Q Consensus 1 M~~~i~~~~~-----~l-----------~~~~----~-~l~~l~-----~~~v~VGi~~~-----~~~dG~~-----va~ 44 (153) |+.-.-.++. .| ++.+ + .++.|. .-..+-|-+.. ...+|.. --+ T Consensus 1 M~~i~id~La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~~e~~~V~nk~~yq 80 (124) T protein:vir:95 1 MAKIKIGRLADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRVPNGWVIHNKTEYR 80 (124) T ss_pred CccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeeecCceeEEEcCCCc Confidence 6552212221 11 1111 0 011111 11123332211 1223321 146 Q ss_pred HHHHHhcCC-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 45 LAAIHEEGW-----NNLPERNFMFSTSMHFQEGLRKHIKRMHNG 83 (153) Q Consensus 45 iA~~~E~G~-----~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~ 83 (153) ++-+.|||. .+.++|||++|..+.....+.+.++..++. T Consensus 81 LtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~ 124 (124) T protein:vir:95 81 LAHLLEYGHATVDGGRVPGTPHIRPIEDWLEKEFEDRVEKAIKQ 124 (124) T ss_pred eeeeeecceeccCCcccCCccchhHHHHHHHHHHHHHHHHHhcC Confidence 788999996 479999999999999888888888877665 No 165 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=56.11 E-value=0.065 Score=27.09 Aligned_cols=60 Identities=17% Similarity=0.102 Sum_probs=26.7 Q ss_pred hhHHHH--HHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhccCCCCCCccHHHHHhcCCCCcchhHHHH Q lcl|NC_020201. 62 MFSTSM--HFQEGLRKHIKRMHNGIIQGRGFSSYLTKIGKDAADSIRFTISTGSFSNPKVSKDWASYKGFDDAMIHYGDL 139 (153) Q Consensus 62 lr~~~~--~~~~~~~~~~~~~~~~~~~G~~~~~~L~~iG~~~~~~i~~~I~~~~~~~p~ns~~Ti~~KG~~~PLidTG~L 139 (153) |-.++. .+...+.+.+.+ .++..|+.++...++..|.. .| +|||.| T Consensus 1 m~~s~~i~i~~~~l~~~v~~---------~~k~~l~~~a~~i~~~ak~~----------------------aP-v~tG~L 48 (137) T protein:vir:10 1 MPVTARIHINEPELERQTGA---------IFRGKHRSITRRIATQARAD----------------------VP-VRTGNL 48 (137) T ss_pred CCeeEEEeeCHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHh----------------------CC-cccchh Confidence 222111 011111111111 12233444443333332211 12 699999 Q ss_pred HhhcceeeeeCCCC Q lcl|NC_020201. 140 SSAATYKIVKYQGK 153 (153) Q Consensus 140 ~~Sity~V~~~~gk 153 (153) ++||++++...++- T Consensus 49 r~SI~~~~~~~~~~ 62 (137) T protein:vir:10 49 GRGIQEMPQTYRPF 62 (137) T ss_pred hcCceeeeeccccc Confidence 99999998776654 No 166 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=53.94 E-value=0.06 Score=27.29 Aligned_cols=57 Identities=21% Similarity=0.221 Sum_probs=33.4 Q ss_pred CCccccccHHHHHHHHHHHHHhhCC----------------------EEEEeecCcC-----CCCCC--------CHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEK----------------------EVEYGFYDEK-----HYSGL--------NMATL 45 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~----------------------~v~VGi~~~~-----~~dG~--------~va~i 45 (153) |+- +..+.+++++|++.|++.... -|.-|..... ..+|+ +-+.+ T Consensus 1 Ma~-~~i~~~Gld~L~~~L~~~~~~~~v~~vv~~~~~~l~~~ak~~ap~dTG~lrrSI~~~~~~~g~~~~v~~~gp~a~Y 79 (92) T protein:vir:99 1 MAD-YSISWDGLDALDEALANQQNMNTVKKVVKKHTANLMTATQQAVPVDTGHLKQSAQIQISRDGFTGSVTYGGGLVNY 79 (92) T ss_pred CCc-eeeEeehHHHHHHHHHhhccHHHHHHHHHHHHHHHHHHHHHhCCCCccccceeeeEEeecCCeeEEEEeccCcccc Confidence 776 333445778888877654221 1222221110 11232 45899 Q ss_pred HHHHhcCCCCCCC Q lcl|NC_020201. 46 AAIHEEGWNNLPE 58 (153) Q Consensus 46 A~~~E~G~~~IP~ 58 (153) |-+.||||+..++ T Consensus 80 a~YvE~GTR~M~A 92 (92) T protein:vir:99 80 AAYVEFGTRFMDS 92 (92) T ss_pred ccccccceeecCC Confidence 9999999999998 No 167 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=53.12 E-value=0.54 Score=22.07 Aligned_cols=74 Identities=14% Similarity=0.054 Sum_probs=41.0 Q ss_pred CCcccccc-----HHHHHHHHHHHHHhh----CCEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTD-----ISELKRYFSQLSDLA----EKEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQE 71 (153) Q Consensus 1 M~~~i~~~-----~~~l~~~~~~l~~l~----~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~ 71 (153) |....... ...+...-..+..+. +..+-++ +.+-||.-.|||+..-.|+.|.|.++.+.+. T Consensus 60 ~~~~~~~~~d~~G~~t~~~~~~~~~~i~~~~~g~~iyi~----------Nn~pYA~~LEyG~S~QAP~G~v~~a~q~~~~ 129 (142) T protein:vir:10 60 PAAQSLNNYDPDGNETRNSLRRQIYALARDANTNVIYIS----------NRLDYAQGLEFGSSNQAPSGVLGVVQKRLGR 129 (142) T ss_pred cccccccCcCCCCccchhhHHHHHHHhhhccccceEEEe----------eCcchhhhhhccccCCCcchHHHHHHHHHHH Confidence 44433221 001111111122221 1222222 2367888899999999999999999887776 Q ss_pred HHHHHHHHHHHHH Q lcl|NC_020201. 72 GLRKHIKRMHNGI 84 (153) Q Consensus 72 ~~~~~~~~~~~~~ 84 (153) -+.+..+++-..+ T Consensus 130 ~v~~a~~e~~~~~ 142 (142) T protein:vir:10 130 YFAEAVQEAKRAL 142 (142) T ss_pred HHHHHHHHhhccC Confidence 6666665554444 No 168 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=53.03 E-value=0.093 Score=26.23 Aligned_cols=83 Identities=22% Similarity=0.290 Sum_probs=41.7 Q ss_pred CCcccccc-HHHHHHHHHHH-----------------------HHhh-----CCEEEEeecCc-----CCCCCC-----C Q lcl|NC_020201. 1 MAKKSSTD-ISELKRYFSQL-----------------------SDLA-----EKEVEYGFYDE-----KHYSGL-----N 41 (153) Q Consensus 1 M~~~i~~~-~~~l~~~~~~l-----------------------~~l~-----~~~v~VGi~~~-----~~~dG~-----~ 41 (153) |+. |+.| +. +.|.+.| +.|. .--++-|-+.. ...++. . T Consensus 1 M~~-i~id~La--~~I~~~L~~y~~~v~~~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~~~~~~v~nk~ 77 (127) T protein:vir:80 1 MAN-IKIDRLG--DEITRQLKRYSQVIAGDLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRTPGGWVIHNKT 77 (127) T ss_pred Ccc-ccHhhHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeeccCceeEeecC Confidence 665 3322 21 1111111 1111 01112332211 111221 1 Q ss_pred HHHHHHHHhcCC-----CCCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCCC Q lcl|NC_020201. 42 MATLAAIHEEGW-----NNLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGRG 89 (153) Q Consensus 42 va~iA~~~E~G~-----~~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~~ 89 (153) .-+++-+.|||. .+.++|||++|..+....++.+.++..++ +|.- T Consensus 78 ~yqLtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~---~~~~ 127 (127) T protein:vir:80 78 EYRLAHLLEYGHATVDGGRVPETPHIRPVEDWLEKEFEDRVERAIK---NESR 127 (127) T ss_pred CcceeehhhcceeccCCcccCCccchhhHHHHHHHHHHHHHHHHhc---CCCC Confidence 237889999996 47999999999988877777666665433 3322 No 169 >protein:vir:1028 Length: 168 # NCBI annotation: Orf48 # Family: family:all:1029 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076682;genbank:gi:13095791;genbank:GeneID:920342 Probab=48.33 E-value=0.071 Score=26.90 Aligned_cols=86 Identities=17% Similarity=0.197 Sum_probs=47.2 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCCCCCC-HHHHHHHHhcCCC------------------CCCCCch Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHYSGLN-MATLAAIHEEGWN------------------NLPERNF 61 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~dG~~-va~iA~~~E~G~~------------------~IP~RpF 61 (153) |+=.|+.....++ ....-...|||... ..+|+- -|+||.|.+-|+. .||.=.| T Consensus 62 LaDsI~~~~~niD-------g~~dG~s~VGf~~k-~~~~~~~ka~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHF 133 (168) T protein:vir:10 62 LADSIVMKNKNID-------GVKDGQSVVGWERS-TEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHADHF 133 (168) T ss_pred hhhhheecccccc-------cccCCceeecccCc-cccccccchheeeeccccccccccccccccccccccccccccchh Confidence 4444432211111 12233567777432 223433 5999999999983 5999999 Q ss_pred hhHHHHHH--HHHHHHHHHHHHHHHHcCCCHHHHH Q lcl|NC_020201. 62 MFSTSMHF--QEGLRKHIKRMHNGIIQGRGFSSYL 94 (153) Q Consensus 62 lr~~~~~~--~~~~~~~~~~~~~~~~~G~~~~~~L 94 (153) +..+-.+. ++++.+.....++.+++-...+.-| T Consensus 134 vd~~r~d~a~k~~V~~Ae~~~y~eIl~~k~~~~~~ 168 (168) T protein:vir:10 134 IEETRKNPIVQQGILKAEAEAMRKIINRKKKESNL 168 (168) T ss_pred HHHhhhchhhhHHHHHHHHHHHHHHHHhhcCCCCC Confidence 99988763 5555555555555555332111111 No 170 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=47.88 E-value=0.091 Score=26.30 Aligned_cols=91 Identities=21% Similarity=0.327 Sum_probs=52.7 Q ss_pred CCcccc--ccHHHHHHHHHHHHHhhCCE------------EEEeecC-------cC-------C-CC------------- Q lcl|NC_020201. 1 MAKKSS--TDISELKRYFSQLSDLAEKE------------VEYGFYD-------EK-------H-YS------------- 38 (153) Q Consensus 1 M~~~i~--~~~~~l~~~~~~l~~l~~~~------------v~VGi~~-------~~-------~-~d------------- 38 (153) |+.+.- ..-.++.++...|+.+-+.+ .+|+++. ++ . .. T Consensus 1 ma~~~~~~vrV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s~~~r~G~L~~Sir~aaT~ 80 (143) T protein:vir:62 1 MAQRSAYTIRVDGLREFQRNVRTLRDKELNKAVREANKASGEVLIPQAKHESPDGKRDAKSSKKYRPGKLDKSIKVTASA 80 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccCcchhhccccccccc Confidence 654421 11124455555555542222 2333321 11 0 01 Q ss_pred -------CC-CHHHHHHHHhcCCC--CCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH Q lcl|NC_020201. 39 -------GL-NMATLAAIHEEGWN--NLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTK 96 (153) Q Consensus 39 -------G~-~va~iA~~~E~G~~--~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~ 96 (153) |- .-.-+|.+-+||++ +|-++-|+..++....++|.+..++-+.++++- -|+. T Consensus 81 raa~VrAG~~krVPYA~~I~~G~r~r~Isp~rFl~~a~a~te~~~~r~Ye~~i~~vl~k-----~l~s 143 (143) T protein:vir:62 81 KGAVIKAGSASRVPYAAAIHFGYRARNISPNRFLFRAMARKSDVVAATYERRIAAVVEK-----YLES 143 (143) T ss_pred cceeeeeCCcCCCCcccccccCcccccccchhhhhhhhhccCHHHHHHHHHHHHHHHHH-----HhcC Confidence 11 22346778899985 788999999999999999999998888776543 2222 No 171 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=47.70 E-value=0.38 Score=22.90 Aligned_cols=77 Identities=14% Similarity=0.084 Sum_probs=39.9 Q ss_pred CCcccccc--H---HHHHHHHHHHHHhhCCEE-EEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTD--I---SELKRYFSQLSDLAEKEV-EYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLR 74 (153) Q Consensus 1 M~~~i~~~--~---~~l~~~~~~l~~l~~~~v-~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~ 74 (153) |....... . .....+.+....+.+.++ .+=++. +++-||.-.|||+.+-+|..|.|.++.+.+.-+. T Consensus 63 ~~~~~~~~~d~~G~~t~~~~~~~~~~i~~~k~g~~iyi~-------Nn~pYA~~LEyG~S~QAP~G~v~~~~~~~~~~v~ 135 (145) T protein:vir:10 63 PAQQSLNEYDQTGGQTKTYLARQARAVANSKATSVIYIT-------NRLDYAADLEYGASNQAPAGVLGVVQARLGRYFQ 135 (145) T ss_pred cccccccccCCCCccchhhHHHHHHHhhcccccceEEEe-------eCchhhhHhhccccCCCcchHHHHHHHHHHHHHH Confidence 33322110 0 011111111112221110 000111 2367888889999999999999999988876666 Q ss_pred HHHHHHHHHH Q lcl|NC_020201. 75 KHIKRMHNGI 84 (153) Q Consensus 75 ~~~~~~~~~~ 84 (153) +..+++-+++ T Consensus 136 ~~~~e~k~~~ 145 (145) T protein:vir:10 136 EAVEEARRAI 145 (145) T ss_pred HHHHHhhccC Confidence 6665555544 No 172 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=47.00 E-value=0.19 Score=24.56 Aligned_cols=73 Identities=15% Similarity=0.226 Sum_probs=38.5 Q ss_pred CCccccccHHHHHHHHHHHHHh---------h-----------------------------------------C---CEE Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDL---------A-----------------------------------------E---KEV 27 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l---------~-----------------------------------------~---~~v 27 (153) |++.+|+.. ++++.|++. . + ++| T Consensus 1 msvevkGv~----eilk~le~k~G~~~~~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV 76 (133) T protein:vir:93 1 MSVEIKGIP----EVLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAV 76 (133) T ss_pred CeEEEecHH----HHHHHHHHhhCHhhhHhhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceEE Confidence 888887653 333333221 0 1 223 Q ss_pred EEeecCc-CCCCCCCHHHHHHHHhcCCC----CCCCCch--hhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 28 EYGFYDE-KHYSGLNMATLAAIHEEGWN----NLPERNF--MFSTSMHFQEGLRKHIKRMHNG 83 (153) Q Consensus 28 ~VGi~~~-~~~dG~~va~iA~~~E~G~~----~IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 83 (153) +|||... .+ --|--+||||.- .|-||-| ++.+++..+..+.+.++.-++. T Consensus 77 ~i~W~gp~~R------~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 77 LIEWVGPMNR------KNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAANERKYREIIKKELAR 133 (133) T ss_pred EEEeecCCCc------eeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 4444322 11 134457899963 3567775 6666776676666666654443 No 173 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=44.27 E-value=0.11 Score=25.92 Aligned_cols=91 Identities=21% Similarity=0.366 Sum_probs=52.3 Q ss_pred CCcccc--ccHHHHHHHHHHHHHhhCCE------------EEEeecCcC--------------CC-C------------- Q lcl|NC_020201. 1 MAKKSS--TDISELKRYFSQLSDLAEKE------------VEYGFYDEK--------------HY-S------------- 38 (153) Q Consensus 1 M~~~i~--~~~~~l~~~~~~l~~l~~~~------------v~VGi~~~~--------------~~-d------------- 38 (153) |+.+.- ..-.++.++...|+++-+.+ .+|+++.-. +| . T Consensus 1 ma~~~~~~vkV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~~~p~~srr~r~G~L~~Sir~aaT~ 80 (143) T protein:vir:13 1 MAQRSAYTIQVDGLRQFQRNVRALRDKELNKAVREANKASGEVLIPQAKHESPDGHRDPKSSKRYRPGKLDKSIKVTASA 80 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCcchHHHHHHHHHHHHHHHHHHHhhcCCcccccccccccccchhhccccccccc Confidence 665422 11124455555555552222 233332110 00 0 Q ss_pred -------C-CCHHHHHHHHhcCCC--CCCCCchhhHHHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH Q lcl|NC_020201. 39 -------G-LNMATLAAIHEEGWN--NLPERNFMFSTSMHFQEGLRKHIKRMHNGIIQGRGFSSYLTK 96 (153) Q Consensus 39 -------G-~~va~iA~~~E~G~~--~IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~~~~~~L~~ 96 (153) | -.-.-+|.+-+||++ +|-++-|+..++...+++|.+..++-+.++++- -|+. T Consensus 81 raa~VrAGr~arVPYA~~I~~G~r~r~Is~~rFl~~a~a~te~~~~r~Ye~~i~~vl~k-----~l~s 143 (143) T protein:vir:13 81 KGAVIKAGSAARVPYAAAIHFGYRKRNISANRFLYRAMARKSDVVAATYERRIAAVVEK-----YLES 143 (143) T ss_pred cceeeeecCcCCCCcccccccCCcccccchhhhhhhhhhccCHHHHHHHHHHHHHHHHH-----HhcC Confidence 1 001245677899985 788999999999999999999999888776543 2222 No 174 >protein:vir:6216 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:10886 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852596;genbank:gi:31415856;genbank:GeneID:1489214 Probab=42.81 E-value=0.33 Score=23.22 Aligned_cols=85 Identities=12% Similarity=0.084 Sum_probs=41.3 Q ss_pred CCccccccHHHHHHHHHH-HHHh---------hCC-----EEEEeecCcCCC-CCCCHHHHHHHHhcCCCC------CCC Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQ-LSDL---------AEK-----EVEYGFYDEKHY-SGLNMATLAAIHEEGWNN------LPE 58 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~-l~~l---------~~~-----~v~VGi~~~~~~-dG~~va~iA~~~E~G~~~------IP~ 58 (153) -+|.-+.....|++...- ++.| .++ +++|=+-++.-. .=.+-|-+=++-|.|+.+ |-+ T Consensus 19 ~kVd~kvs~e~L~eAA~~f~~KL~P~Ip~Sl~kkk~HlrD~lkVvvk~d~V~V~Fed~a~yW~f~EnGt~~~~~~g~vka 98 (125) T protein:vir:62 19 LRVNKKVSLDALDEAAKYFASKLKPKINVSNKNKRTHLRDSLKVVVKDDRVSVEFKDEAWYWYLVEHGHKKAKGKGRVKG 98 (125) T ss_pred hhhhhhhhHHHHHHHHHHHHHhhccccChhhhhhhhhcceeeeEEeeCCeEEEEEcchhhhhhhhhccccccccccccch Confidence 111111112222222111 1111 111 233333221000 000126667788999854 899 Q ss_pred CchhhHHHHHHHHHHHHHHHHHHHHHHcCC Q lcl|NC_020201. 59 RNFMFSTSMHFQEGLRKHIKRMHNGIIQGR 88 (153) Q Consensus 59 RpFlr~~~~~~~~~~~~~~~~~~~~~~~G~ 88 (153) |-|...||+.+++.|.+.|.+-+ ++-. T Consensus 99 qhf~~~Tf~~nk~kI~~iM~kki---~d~m 125 (125) T protein:vir:62 99 KHFVQNTFDAEGDKIADIMAQKI---INRM 125 (125) T ss_pred hhhhhccHHhhHHHHHHHHHHHH---HhhC Confidence 99999999999999998877632 2222 No 175 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=37.94 E-value=0.6 Score=21.79 Aligned_cols=83 Identities=10% Similarity=0.090 Sum_probs=49.4 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEE----------eecCcC--CCCC--CCHHHHHHHHhcCCCC----------- Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEY----------GFYDEK--HYSG--LNMATLAAIHEEGWNN----------- 55 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~V----------Gi~~~~--~~dG--~~va~iA~~~E~G~~~----------- 55 (153) |+.-+..- ++++..++........-| +|.-+. .++| .+.+++|-+-|||... T Consensus 1 l~~~~~~~---~~~~a~~l~~~vk~rTPv~~~d~G~LR~sW~~g~v~k~~~~v~N~~eYA~~VE~GHRq~~g~g~~~~~~ 77 (116) T protein:vir:10 1 MSKNLRRA---KNNIGNKLLRKVKPKTPVAKIDGGTARKSWKYKELNLFDGVVSNNVEYIHHLEYGHRTRQGTGTSENYR 77 (116) T ss_pred CchHHHHH---HHHHHHHHHHHHHhhCCCCcCCCcccccCceeeeeeccCceeecCCcccccccCCceeeCCcceecccc Confidence 44444322 233333333322222223 222111 1233 3558999999999743 Q ss_pred --------CCCCchhhHHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_020201. 56 --------LPERNFMFSTSMHFQEGLRKHIKRMHNGIIQ 86 (153) Q Consensus 56 --------IP~RpFlr~~~~~~~~~~~~~~~~~~~~~~~ 86 (153) +|-+=||+.++.+.+..+.+.+++.+..+++ T Consensus 78 gkrlk~~~V~G~fml~~s~~e~~~~~~~~~~~~~~~~l~ 116 (116) T protein:vir:10 78 PKPNGISFVPGVFMLARSVDEMSSIIDDELNQIIIDFWN 116 (116) T ss_pred cccccCCccCceehHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 4556688999999999999999998888877 No 176 >protein:vir:4096 Length: 140 # NCBI annotation: Gp9 protein # Family: family:all:28682 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510990;swissprot:trembl:q8w600;genbank:gi:17488512;uniprot:Q8W600;genbank:GeneID:1260318 Probab=37.46 E-value=0.22 Score=24.21 Aligned_cols=86 Identities=15% Similarity=0.213 Sum_probs=48.4 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCCCCCCHHHHH-----------------------------HH--- Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHYSGLNMATLA-----------------------------AI--- 48 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~dG~~va~iA-----------------------------~~--- 48 (153) |+.+-+.|.+.++++.+.++.+-++.=++ |...-+.+|.++|.-. -. T Consensus 1 m~~~~sld~s~~e~L~~~i~r~P~ksE~~-IN~~L~tkg~~~~~~~I~~~iPvS~~~k~~~RnK~HAK~s~pl~~~~~NL 79 (140) T protein:vir:40 1 MCAKWSLEFSDVERLSNLISQIPNKSEAI-INKTLETKAVPLVKLNIEKRINLSKNWKGQLLNKNHAQSSGPFNVKMGNL 79 (140) T ss_pred CCcceecchhhHHHHHHHHHhccchHHHH-HHHHHHhhhhHHHHhhhhhccCcCccchhhhccccchhhhhhhhhhhhhc Confidence 99999999999999999998876642000 0000000111111000 00 Q ss_pred -------HhcC-----------CCCCCCCchhhHHHHHHHHHHHHHHHHHH----HHHHcCC Q lcl|NC_020201. 49 -------HEEG-----------WNNLPERNFMFSTSMHFQEGLRKHIKRMH----NGIIQGR 88 (153) Q Consensus 49 -------~E~G-----------~~~IP~RpFlr~~~~~~~~~~~~~~~~~~----~~~~~G~ 88 (153) --|| +..+|.| ||+.+++...+.+.+.+.+.+ ...+.|. T Consensus 80 gf~i~~k~kf~YLvfPD~G~G~sn~~~q~-FmerGl~~~t~~i~E~L~~~l~k~in~~Lgg~ 140 (140) T protein:vir:40 80 GFELLTKPKFNYLIFPDQGIGKHNKTKQD-FMQLGVEESSQEIVEMLEQAVFKEINDTLGGK 140 (140) T ss_pred ceeEeecCcccccccccccCCCCCcchHH-HHHhccccchhHHHHHHHHHHHHHHHHhhcCC Confidence 0122 2346666 999999988777766555544 4444455 No 177 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=34.82 E-value=0.41 Score=22.73 Aligned_cols=73 Identities=15% Similarity=0.236 Sum_probs=37.6 Q ss_pred CCccccccHHHHHHHHHHHHH-hh--------C--------------------------------------------CEE Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSD-LA--------E--------------------------------------------KEV 27 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~-l~--------~--------------------------------------------~~v 27 (153) |++.+|+.. ++++.|++ ++ . ++| T Consensus 1 msvevkGv~----eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV 76 (133) T protein:vir:93 1 MSVEIKGIP----EVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAV 76 (133) T ss_pred CeEEEecHH----HHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeE Confidence 888887643 33333332 10 0 223 Q ss_pred EEeecCc-CCCCCCCHHHHHHHHhcCCC----CCCCCch--hhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 28 EYGFYDE-KHYSGLNMATLAAIHEEGWN----NLPERNF--MFSTSMHFQEGLRKHIKRMHNG 83 (153) Q Consensus 28 ~VGi~~~-~~~dG~~va~iA~~~E~G~~----~IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 83 (153) +|||... .+ --|--+||||.- .|-||-| ++.+++..+..+.+.++.-++. T Consensus 77 ~i~W~gp~~R------~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 77 LIEWVGPMNR------KNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred EEEeecCCCc------eeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 3444221 11 134457899963 3557765 5666666666666665554433 No 178 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=34.82 E-value=0.41 Score=22.73 Aligned_cols=73 Identities=15% Similarity=0.236 Sum_probs=37.6 Q ss_pred CCccccccHHHHHHHHHHHHH-hh--------C--------------------------------------------CEE Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSD-LA--------E--------------------------------------------KEV 27 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~-l~--------~--------------------------------------------~~v 27 (153) |++.+|+.. ++++.|++ ++ . ++| T Consensus 1 msvevkGv~----eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV 76 (133) T protein:vir:94 1 MSVEIKGIP----EVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAV 76 (133) T ss_pred CeEEEecHH----HHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeE Confidence 888887643 33333332 10 0 223 Q ss_pred EEeecCc-CCCCCCCHHHHHHHHhcCCC----CCCCCch--hhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 28 EYGFYDE-KHYSGLNMATLAAIHEEGWN----NLPERNF--MFSTSMHFQEGLRKHIKRMHNG 83 (153) Q Consensus 28 ~VGi~~~-~~~dG~~va~iA~~~E~G~~----~IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 83 (153) +|||... .+ --|--+||||.- .|-||-| ++.+++..+..+.+.++.-++. T Consensus 77 ~i~W~gp~~R------~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:94 77 LIEWVGPMNR------KNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred EEEeecCCCc------eeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 3444221 11 134457899963 3557765 5666666666666665554433 No 179 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=34.82 E-value=0.41 Score=22.73 Aligned_cols=73 Identities=15% Similarity=0.236 Sum_probs=37.6 Q ss_pred CCccccccHHHHHHHHHHHHH-hh--------C--------------------------------------------CEE Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSD-LA--------E--------------------------------------------KEV 27 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~-l~--------~--------------------------------------------~~v 27 (153) |++.+|+.. ++++.|++ ++ . ++| T Consensus 1 msvevkGv~----eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV 76 (133) T protein:vir:78 1 MSVEIKGIP----EVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAV 76 (133) T ss_pred CeEEEecHH----HHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeE Confidence 888887643 33333332 10 0 223 Q ss_pred EEeecCc-CCCCCCCHHHHHHHHhcCCC----CCCCCch--hhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 28 EYGFYDE-KHYSGLNMATLAAIHEEGWN----NLPERNF--MFSTSMHFQEGLRKHIKRMHNG 83 (153) Q Consensus 28 ~VGi~~~-~~~dG~~va~iA~~~E~G~~----~IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 83 (153) +|||... .+ --|--+||||.- .|-||-| ++.+++..+..+.+.++.-++. T Consensus 77 ~i~W~gp~~R------~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:78 77 LIEWVGPMNR------KNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred EEEeecCCCc------eeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 3444221 11 134457899963 3557765 5666666666666665554433 No 180 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=34.82 E-value=0.41 Score=22.73 Aligned_cols=73 Identities=15% Similarity=0.236 Sum_probs=37.6 Q ss_pred CCccccccHHHHHHHHHHHHH-hh--------C--------------------------------------------CEE Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSD-LA--------E--------------------------------------------KEV 27 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~-l~--------~--------------------------------------------~~v 27 (153) |++.+|+.. ++++.|++ ++ . ++| T Consensus 1 msvevkGv~----eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV 76 (133) T protein:vir:96 1 MSVEIKGIP----EVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAV 76 (133) T ss_pred CeEEEecHH----HHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeE Confidence 888887643 33333332 10 0 223 Q ss_pred EEeecCc-CCCCCCCHHHHHHHHhcCCC----CCCCCch--hhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 28 EYGFYDE-KHYSGLNMATLAAIHEEGWN----NLPERNF--MFSTSMHFQEGLRKHIKRMHNG 83 (153) Q Consensus 28 ~VGi~~~-~~~dG~~va~iA~~~E~G~~----~IP~RpF--lr~~~~~~~~~~~~~~~~~~~~ 83 (153) +|||... .+ --|--+||||.- .|-||-| ++.+++..+..+.+.++.-++. T Consensus 77 ~i~W~gp~~R------~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:96 77 LIEWVGPMNR------KNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred EEEeecCCCc------eeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 3444221 11 134457899963 3557765 5666666666666665554433 No 181 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=34.35 E-value=0.78 Score=21.16 Aligned_cols=71 Identities=18% Similarity=0.157 Sum_probs=36.0 Q ss_pred CCccccc--cHHHHHHHHH---HHHHhh-CCEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHH Q lcl|NC_020201. 1 MAKKSST--DISELKRYFS---QLSDLA-EKEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLR 74 (153) Q Consensus 1 M~~~i~~--~~~~l~~~~~---~l~~l~-~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~ 74 (153) |....+. |..+-..+-+ -+..+. +..+-++ +.+-+|.-.|||+.+-+|+.|.|.++.+.+.-+. T Consensus 55 ~~~~~~~~~d~~g~~t~~~~~~~i~~~~~g~~iyi~----------Nn~pYA~~LEyG~S~QAP~g~v~~~~~~~~~~v~ 124 (131) T protein:vir:94 55 PADGTTDATDKSGNTATGNATSFVLNAADWHTFTLT----------NNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLN 124 (131) T ss_pred ccccccCCCCCCchhhHHHHHHHHhhccccceEEEe----------eCchhhhhhhccccCCCcchHHHHHHHHHHHHHH Confidence 3332221 1111111111 111111 1111111 2367888999999999999999999776555544 Q ss_pred HHHHHHHHHHH Q lcl|NC_020201. 75 KHIKRMHNGII 85 (153) Q Consensus 75 ~~~~~~~~~~~ 85 (153) +..++ +. T Consensus 125 ~~~~e----~k 131 (131) T protein:vir:94 125 EEASK----VK 131 (131) T ss_pred HHHHh----cC Confidence 44443 33 No 182 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=32.58 E-value=0.87 Score=20.91 Aligned_cols=71 Identities=15% Similarity=0.173 Sum_probs=35.1 Q ss_pred CCccccc--cHHHHHHHHHHHHHhhC----CEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHH Q lcl|NC_020201. 1 MAKKSST--DISELKRYFSQLSDLAE----KEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLR 74 (153) Q Consensus 1 M~~~i~~--~~~~l~~~~~~l~~l~~----~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~ 74 (153) |...... |.++-..+.+....+.+ ..+-++ +.+-+|.-.|||+.+-+|+.|.|.++.+.+.-+. T Consensus 55 ~~~~~~~~~d~~g~~t~~~~~~~i~~~~~g~~iyi~----------Nn~pYA~~LEyG~S~QAP~G~v~~~~~~~~~~v~ 124 (131) T protein:vir:78 55 PADGTTDATDKAGTTATSNAANFVLNAADWHTFTLT----------NNLPYAQRLEYGWSQQAPQGFVRVNVSRFQQLLN 124 (131) T ss_pred ccccccCCCCCCchhhHHHHHHHHhhccCCceEEEe----------eCchhhhHhhccccCCCcchHHHHHHHHHHHHHH Confidence 2222211 11111111111111111 111111 3367888999999999999999999766555444 Q ss_pred HHHHHHHHHHH Q lcl|NC_020201. 75 KHIKRMHNGII 85 (153) Q Consensus 75 ~~~~~~~~~~~ 85 (153) +..++ +. T Consensus 125 ~~~~e----~k 131 (131) T protein:vir:78 125 EEASK----VK 131 (131) T ss_pred HHHHh----cC Confidence 44443 33 No 183 >protein:vir:3994 Length: 168 # NCBI annotation: unknown # Family: family:all:1029 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116502;genbank:gi:14251135;genbank:GeneID:921309 Probab=27.19 E-value=0.26 Score=23.77 Aligned_cols=91 Identities=18% Similarity=0.174 Sum_probs=47.3 Q ss_pred CCc----cccccHHHHHHHHHHHHHhhC-------------------------------CEEEEeecCcCCCCCCC-HHH Q lcl|NC_020201. 1 MAK----KSSTDISELKRYFSQLSDLAE-------------------------------KEVEYGFYDEKHYSGLN-MAT 44 (153) Q Consensus 1 M~~----~i~~~~~~l~~~~~~l~~l~~-------------------------------~~v~VGi~~~~~~dG~~-va~ 44 (153) |++ +||. .+-+.+.+.|..... -.-.|||... ...|+. -|+ T Consensus 22 lt~e~kakIT~--AGAkv~a~~L~~~T~~kHy~~rktg~~~HLADsI~~~~~niDg~~dG~StVGw~~k-~~~~~~~~a~ 98 (168) T protein:vir:39 22 MTVEDKAEVTK--AGAKVFEQALAYEVRNRHYRHRDTGEDPHLADSIVMKNKNIDGVKDGQSVVGWERS-TEKGTHTKGY 98 (168) T ss_pred CCHHHHHHHHH--HhHHHHHHHHHHHhHHhcccCCCCCCCccchhheeecccccCcccCCceeccccCc-cccccccchh Confidence 533 3332 233333333433221 1233444221 112333 589 Q ss_pred HHHHHhcCCC------------------CCCCCchhhHHHHHH--HHHHHHHHHHHHHHHHcCCCHHHHH Q lcl|NC_020201. 45 LAAIHEEGWN------------------NLPERNFMFSTSMHF--QEGLRKHIKRMHNGIIQGRGFSSYL 94 (153) Q Consensus 45 iA~~~E~G~~------------------~IP~RpFlr~~~~~~--~~~~~~~~~~~~~~~~~G~~~~~~L 94 (153) ||.+-+-|+. .||.=+|+..+-.+. ++++-+.....++.+++-...+.-| T Consensus 99 iAr~lNDGTrf~~~~~~~~~~y~~~g~v~i~gDHFvd~~r~~~a~k~aV~~Ae~e~~~eil~~k~~~~~~ 168 (168) T protein:vir:39 99 IANIINNGSRFPQFTTRSGRKYKNPGEVAVHADHFIEETRKNPIVQQGILKAEAEAMRKIINRKKKENNL 168 (168) T ss_pred heehhccccccchhhhhcccccccccceeecccchhHHHhhhhhhhHHHHHHHHHHHHHHHHhcCCCCCC Confidence 9999999983 589999999888764 5565555555555555433222222 No 184 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=26.00 E-value=0.95 Score=20.71 Aligned_cols=76 Identities=9% Similarity=0.173 Sum_probs=40.6 Q ss_pred CCccccccHHHHHHHHHHHHH-hh--------C------------------------------------------CEEEE Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSD-LA--------E------------------------------------------KEVEY 29 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~-l~--------~------------------------------------------~~v~V 29 (153) |++.+++. +++++.|++ ++ . ++|+| T Consensus 1 msvevkGv----~eilk~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~G~r~V~i 76 (133) T protein:vir:78 1 MSVEVTGV----EELERQLVSLFGRENLPQLVDPALIAGATLVAKTLKSEFVQFKDTGASIDEINIEKPSYDKGVRSIKI 76 (133) T ss_pred CeEEEecH----HHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHHhhcchhcccceeeeEEecCeeeeCCceEEEE Confidence 99888764 333333332 11 0 12334 Q ss_pred eecCcCCCCCCCHHHHHHHHhcCCC----CCCCCch--hhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 30 GFYDEKHYSGLNMATLAAIHEEGWN----NLPERNF--MFSTSMHFQEGLRKHIKRMHNGII 85 (153) Q Consensus 30 Gi~~~~~~dG~~va~iA~~~E~G~~----~IP~RpF--lr~~~~~~~~~~~~~~~~~~~~~~ 85 (153) ||.+.. +=--|--+||||.- .|-||-| ++.+++..+..+.+.++.-+...+ T Consensus 77 ~W~gp~-----~R~~iVHLNE~GYtr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~el~k~l 133 (133) T protein:vir:78 77 DWKGPK-----DRYKIIHLNEYGYTRNGKKITPAGTGSVARSLRISERAYRAIVQKKIGDKL 133 (133) T ss_pred EEecCC-----CceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHHHhhC Confidence 442211 00134457899962 3667775 666777777776666666555444 No 185 >protein:vir:96774 Length: 152 # NCBI annotation: hypothetical phage protein # Family: family:all:448 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224253;genbank:gi:62362388;genbank:GeneID:3345713 Probab=22.46 E-value=1.2 Score=20.16 Aligned_cols=74 Identities=15% Similarity=0.077 Sum_probs=33.7 Q ss_pred CCccccccHHHHHHHHHHHHHhhCCEEEEeecCcCCCCCCCHHHHHHHHhcCCCCCCCCchhhHHHHHHHHHHHHHHHHH Q lcl|NC_020201. 1 MAKKSSTDISELKRYFSQLSDLAEKEVEYGFYDEKHYSGLNMATLAAIHEEGWNNLPERNFMFSTSMHFQEGLRKHIKRM 80 (153) Q Consensus 1 M~~~i~~~~~~l~~~~~~l~~l~~~~v~VGi~~~~~~dG~~va~iA~~~E~G~~~IP~RpFlr~~~~~~~~~~~~~~~~~ 80 (153) |....+.+..+-..+.+.-..+.+.+ +| +.-|= ++++-+|.-.|||+.+-+|.-|.|.++.+.+.-+ +++ T Consensus 79 p~~~~~~~~~~~~t~~~~~~~i~~~~--~g---~~iyi-~NnlPYA~~LEyG~S~QAP~G~vr~t~~~~~~~v----~ea 148 (152) T protein:vir:96 79 ITSFEKGISSQSSIMMDLQSDIAKFK--IG---ETLFM-TNPLPYATSIEYGHSSQAPNGVYRPAVRRLVKFL----NTE 148 (152) T ss_pred CCcccccCCCCCchHHHHHHHHhhcc--cc---ceEEE-eeCchhhhHhhccccCCCCchHHHHHHHHHHHHH----HHH Confidence 33222222111111111111111111 11 00000 1235677888999999999999999976655444 443 Q ss_pred HHHHHcCC Q lcl|NC_020201. 81 HNGIIQGR 88 (153) Q Consensus 81 ~~~~~~G~ 88 (153) +++ + T Consensus 149 ~~~----~ 152 (152) T protein:vir:96 149 LKA----K 152 (152) T ss_pred hcc----C Confidence 332 2 Done!