Query lcl|NC_021342.1_cdsid_YP_008060426.1 [gene=M171_gp23] [protein=hypothetical protein] [protein_id=YP_008060426.1] [location=13296..13889] Match_columns 197 No_of_seqs 128 out of 201 Neff 6.5 Searched_HMMs 1612 Date Thu Nov 7 16:42:48 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_23 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_23_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:99546 Length: 200 100.0 1.7E-62 1.1E-65 359.2 17.7 185 1-188 8-200 (200) 2 protein:vir:96105 Length: 193 100.0 3.9E-61 2.4E-64 351.8 18.2 185 2-188 1-193 (193) 3 protein:vir:80037 Length: 199 100.0 4.3E-54 2.7E-57 313.2 17.9 180 2-190 1-199 (199) 4 protein:vir:107757 Length: 189 100.0 6.7E-51 4.1E-54 295.7 16.8 149 1-192 1-189 (189) 5 protein:vir:5257 Length: 148 # 100.0 1.9E-50 1.2E-53 293.2 16.0 144 2-188 1-148 (148) 6 protein:vir:78607 Length: 155 100.0 5.1E-50 3.2E-53 290.8 15.5 139 2-189 1-155 (155) 7 protein:vir:106728 Length: 155 100.0 5.6E-50 3.5E-53 290.6 15.5 139 2-189 1-155 (155) 8 protein:vir:101563 Length: 155 100.0 1.9E-49 1.2E-52 287.6 15.4 139 2-189 1-155 (155) 9 protein:vir:77650 Length: 155 100.0 1.5E-48 9E-52 282.8 14.7 139 2-189 1-155 (155) 10 protein:vir:94069 Length: 168 100.0 4.1E-48 2.6E-51 280.3 16.0 151 2-197 1-166 (168) 11 protein:vir:95260 Length: 160 100.0 1.4E-43 8.9E-47 255.5 14.5 150 1-195 1-160 (160) 12 protein:vir:99833 Length: 190 98.8 9.1E-12 5.6E-15 81.1 5.7 114 1-134 75-190 (190) 13 protein:vir:3163 Length: 145 # 98.7 2.6E-11 1.6E-14 78.6 5.5 84 111-197 1-90 (145) 14 protein:vir:99833 Length: 190 98.6 4.6E-11 2.8E-14 77.3 4.4 91 93-197 1-99 (190) 15 protein:vir:79091 Length: 175 98.5 3.9E-10 2.4E-13 72.1 5.1 92 89-197 1-116 (175) 16 protein:vir:103841 Length: 155 98.5 3E-10 1.9E-13 72.8 4.3 92 89-197 1-99 (155) 17 protein:vir:79225 Length: 155 98.4 4.5E-10 2.8E-13 71.8 4.9 92 89-197 1-99 (155) 18 protein:vir:99196 Length: 155 98.4 1E-09 6.4E-13 69.8 5.1 92 89-197 1-99 (155) 19 protein:vir:1988 Length: 156 # 98.3 1.5E-09 9.2E-13 69.0 4.6 89 107-197 1-103 (156) 20 protein:vir:1838 Length: 149 # 98.1 4.8E-09 3E-12 66.2 4.5 85 1-128 63-149 (149) 21 protein:vir:101594 Length: 173 98.1 1.7E-08 1E-11 63.2 7.0 131 2-139 1-173 (173) 22 protein:vir:2026 Length: 150 # 98.1 5.2E-09 3.2E-12 66.0 4.0 85 1-128 53-150 (150) 23 protein:vir:78858 Length: 115 98.1 9.9E-09 6.1E-12 64.5 5.3 112 2-127 1-115 (115) 24 protein:vir:9312 Length: 115 # 98.1 9.9E-09 6.1E-12 64.5 5.3 112 2-127 1-115 (115) 25 protein:vir:103917 Length: 115 98.1 9.9E-09 6.1E-12 64.5 5.3 112 2-127 1-115 (115) 26 protein:vir:96358 Length: 115 98.1 9.9E-09 6.1E-12 64.5 5.3 112 2-127 1-115 (115) 27 protein:vir:96225 Length: 115 98.1 9.9E-09 6.1E-12 64.5 5.3 112 2-127 1-115 (115) 28 protein:vir:97144 Length: 115 98.1 9.9E-09 6.1E-12 64.5 5.3 112 2-127 1-115 (115) 29 protein:vir:6071 Length: 150 # 98.1 3.9E-09 2.4E-12 66.7 2.9 85 1-128 53-150 (150) 30 protein:vir:4347 Length: 164 # 98.1 1.7E-08 1.1E-11 63.2 6.3 141 1-146 1-164 (164) 31 protein:vir:107851 Length: 175 98.1 9.3E-09 5.8E-12 64.6 4.7 92 89-197 1-116 (175) 32 protein:vir:100312 Length: 152 98.1 6.7E-09 4.2E-12 65.4 3.9 87 1-129 54-152 (152) 33 protein:vir:5703 Length: 150 # 98.1 4.6E-09 2.8E-12 66.3 3.0 85 1-128 53-150 (150) 34 protein:vir:106623 Length: 115 98.1 8.3E-09 5.1E-12 64.9 4.3 84 2-127 1-115 (115) 35 protein:vir:93617 Length: 148 98.0 2.2E-08 1.4E-11 62.6 5.5 126 1-138 1-148 (148) 36 protein:vir:100243 Length: 140 98.0 4.2E-08 2.6E-11 61.0 6.8 122 1-130 1-140 (140) 37 protein:vir:1988 Length: 156 # 98.0 1.3E-08 8.3E-12 63.8 4.0 80 1-138 76-156 (156) 38 protein:vir:79179 Length: 155 98.0 1.1E-08 6.6E-12 64.3 3.3 85 1-128 54-155 (155) 39 protein:vir:4906 Length: 114 # 98.0 1.6E-08 1E-11 63.3 4.1 86 1-128 1-114 (114) 40 protein:vir:2740 Length: 114 # 98.0 1.6E-08 1E-11 63.3 4.1 86 1-128 1-114 (114) 41 protein:vir:79115 Length: 148 97.9 1.2E-08 7.7E-12 63.9 3.3 84 1-128 53-148 (148) 42 protein:vir:98557 Length: 149 97.9 2.5E-08 1.6E-11 62.2 4.6 84 1-128 53-149 (149) 43 protein:vir:99744 Length: 115 97.9 2.7E-08 1.7E-11 62.1 4.7 84 2-127 1-115 (115) 44 protein:vir:1273 Length: 127 # 97.9 5.9E-08 3.7E-11 60.2 6.3 112 1-131 1-127 (127) 45 protein:vir:1891 Length: 179 # 97.9 8.5E-08 5.3E-11 59.3 6.7 136 1-161 1-179 (179) 46 protein:vir:1164 Length: 156 # 97.9 2.8E-08 1.7E-11 62.0 4.0 89 1-132 54-156 (156) 47 protein:vir:80362 Length: 140 97.9 5.5E-08 3.4E-11 60.4 5.6 123 1-155 1-140 (140) 48 protein:vir:103841 Length: 155 97.9 4.1E-08 2.6E-11 61.1 4.8 76 1-130 75-155 (155) 49 protein:vir:100075 Length: 140 97.9 8E-08 5E-11 59.5 6.3 120 1-130 1-140 (140) 50 protein:vir:1437 Length: 140 # 97.8 1.1E-07 6.9E-11 58.7 6.7 121 1-130 1-140 (140) 51 protein:vir:102875 Length: 146 97.8 8.5E-08 5.3E-11 59.3 6.0 122 1-136 1-146 (146) 52 protein:vir:105007 Length: 146 97.8 8.5E-08 5.3E-11 59.3 6.0 122 1-136 1-146 (146) 53 protein:vir:107568 Length: 146 97.8 8.5E-08 5.3E-11 59.3 6.0 122 1-136 1-146 (146) 54 protein:vir:102085 Length: 146 97.8 8.5E-08 5.3E-11 59.3 6.0 122 1-136 1-146 (146) 55 protein:vir:3163 Length: 145 # 97.8 4.2E-08 2.6E-11 61.0 4.2 75 1-133 65-145 (145) 56 protein:vir:95789 Length: 114 97.8 4.7E-08 2.9E-11 60.7 4.2 89 1-131 1-114 (114) 57 protein:vir:106570 Length: 182 97.8 1.9E-07 1.2E-10 57.5 7.4 134 1-141 1-182 (182) 58 protein:vir:107851 Length: 175 97.8 2.9E-08 1.8E-11 61.9 2.9 81 1-129 76-175 (175) 59 protein:vir:1386 Length: 149 # 97.8 1E-07 6.5E-11 58.8 5.8 126 1-153 1-149 (149) 60 protein:vir:99196 Length: 155 97.8 4.8E-08 3E-11 60.7 3.7 76 1-130 75-155 (155) 61 protein:vir:79091 Length: 175 97.7 1.4E-08 8.9E-12 63.6 0.4 81 1-129 83-175 (175) 62 protein:vir:79225 Length: 155 97.7 9.1E-08 5.7E-11 59.2 4.5 76 1-130 75-155 (155) 63 protein:vir:9930 Length: 108 # 97.7 1.5E-07 9.1E-11 58.1 5.6 83 4-128 1-108 (108) 64 protein:vir:94654 Length: 142 97.7 2.4E-07 1.5E-10 56.9 6.5 109 1-126 1-142 (142) 65 protein:vir:94796 Length: 137 97.7 1.4E-07 8.9E-11 58.1 5.1 110 1-123 1-137 (137) 66 protein:vir:105330 Length: 137 97.7 1.4E-07 8.6E-11 58.2 4.9 109 1-123 1-137 (137) 67 protein:vir:194 Length: 149 # 97.6 1.6E-07 1E-10 57.8 4.6 128 1-138 1-149 (149) 68 protein:vir:98557 Length: 149 97.6 4.6E-07 2.9E-10 55.3 7.1 86 111-197 1-98 (149) 69 protein:vir:96486 Length: 112 97.6 1.4E-07 8.6E-11 58.2 3.6 84 1-126 1-112 (112) 70 protein:vir:94538 Length: 125 97.5 1.5E-07 9.1E-11 58.0 3.2 91 1-133 1-125 (125) 71 protein:vir:94108 Length: 149 97.5 3E-07 1.9E-10 56.4 4.5 110 1-123 13-149 (149) 72 protein:vir:93738 Length: 137 97.5 4E-07 2.5E-10 55.7 5.1 110 1-123 1-137 (137) 73 protein:vir:97427 Length: 137 97.5 4E-07 2.5E-10 55.7 5.1 110 1-123 1-137 (137) 74 protein:vir:94490 Length: 137 97.5 4E-07 2.5E-10 55.7 5.1 110 1-123 1-137 (137) 75 protein:vir:96829 Length: 135 97.5 4.8E-07 3E-10 55.2 5.3 108 1-123 1-135 (135) 76 protein:vir:105916 Length: 149 97.4 3.5E-07 2.2E-10 56.0 4.4 109 1-123 13-149 (149) 77 protein:vir:743 Length: 108 # 97.4 1.4E-07 8.8E-11 58.1 2.2 84 2-127 1-108 (108) 78 protein:vir:98409 Length: 108 97.4 1.3E-07 8.2E-11 58.3 2.0 82 2-127 1-108 (108) 79 protein:vir:107099 Length: 137 97.4 6.5E-07 4E-10 54.5 5.7 108 1-123 1-137 (137) 80 protein:vir:2026 Length: 150 # 97.4 1.3E-06 8.2E-10 52.8 7.2 86 111-197 1-99 (150) 81 protein:vir:9708 Length: 125 # 97.4 2.9E-07 1.8E-10 56.4 3.5 79 1-132 15-125 (125) 82 protein:vir:96121 Length: 137 97.4 3.8E-07 2.3E-10 55.8 4.0 109 1-123 1-137 (137) 83 protein:vir:95894 Length: 137 97.4 7.1E-07 4.4E-10 54.3 5.1 110 1-123 1-137 (137) 84 protein:vir:105089 Length: 133 97.3 1.2E-06 7.2E-10 53.1 5.6 115 1-133 1-133 (133) 85 protein:vir:6071 Length: 150 # 97.3 2.6E-06 1.6E-09 51.2 7.4 86 111-197 1-99 (150) 86 protein:vir:5703 Length: 150 # 97.2 3E-06 1.9E-09 50.8 7.4 86 111-197 1-99 (150) 87 protein:vir:5978 Length: 144 # 97.1 3E-06 1.9E-09 50.8 6.4 110 1-127 1-144 (144) 88 protein:vir:3873 Length: 128 # 97.1 7.3E-07 4.5E-10 54.2 2.9 79 1-131 22-128 (128) 89 protein:vir:5745 Length: 135 # 96.9 2.7E-06 1.7E-09 51.1 4.6 111 1-141 20-135 (135) 90 protein:vir:3617 Length: 112 # 96.9 2.1E-06 1.3E-09 51.7 3.9 85 1-127 4-112 (112) 91 protein:vir:97088 Length: 157 96.9 1.9E-06 1.1E-09 52.0 3.6 106 1-134 7-157 (157) 92 protein:vir:79988 Length: 125 96.7 2.2E-06 1.4E-09 51.6 2.3 79 1-131 17-125 (125) 93 protein:vir:81106 Length: 125 96.7 2.2E-06 1.4E-09 51.6 2.3 79 1-131 17-125 (125) 94 protein:vir:4704 Length: 125 # 96.7 2.2E-06 1.4E-09 51.6 2.3 79 1-131 17-125 (125) 95 protein:vir:98342 Length: 125 96.7 2.2E-06 1.4E-09 51.6 2.3 79 1-131 17-125 (125) 96 protein:vir:9414 Length: 125 # 96.7 2.2E-06 1.4E-09 51.6 2.3 79 1-131 17-125 (125) 97 protein:vir:99101 Length: 142 96.6 3E-06 1.9E-09 50.8 2.9 105 1-124 1-142 (142) 98 protein:vir:8669 Length: 142 # 96.6 3E-06 1.9E-09 50.8 2.9 105 1-124 1-142 (142) 99 protein:vir:78077 Length: 141 96.4 1.2E-05 7.3E-09 47.6 4.7 114 1-130 10-141 (141) 100 protein:vir:1838 Length: 149 # 96.1 6.4E-05 3.9E-08 43.6 7.4 86 111-197 1-98 (149) 101 protein:vir:79115 Length: 148 96.0 8E-05 5E-08 43.0 7.3 86 111-197 1-97 (148) 102 protein:vir:95062 Length: 116 96.0 1.7E-05 1E-08 46.8 3.5 102 1-123 1-116 (116) 103 protein:vir:97327 Length: 116 95.9 2.2E-05 1.4E-08 46.1 4.1 108 1-123 1-116 (116) 104 protein:vir:1243 Length: 116 # 95.9 2.2E-05 1.4E-08 46.1 4.1 108 1-123 1-116 (116) 105 protein:vir:102441 Length: 137 95.8 1.4E-05 8.7E-09 47.2 2.6 103 1-122 14-137 (137) 106 protein:vir:102154 Length: 119 95.5 4.9E-05 3.1E-08 44.2 4.5 89 1-131 1-119 (119) 107 protein:vir:81147 Length: 126 95.5 7.3E-05 4.5E-08 43.3 5.4 93 1-130 1-126 (126) 108 protein:vir:3787 Length: 231 # 95.2 0.00016 9.8E-08 41.4 6.3 122 1-131 59-231 (231) 109 protein:vir:78755 Length: 228 95.1 0.00017 1.1E-07 41.2 6.0 132 1-138 56-228 (228) 110 protein:vir:3750 Length: 227 # 94.6 0.00019 1.2E-07 41.0 5.0 121 1-130 59-227 (227) 111 protein:vir:100312 Length: 152 94.4 0.00067 4.2E-07 38.0 7.6 87 111-197 1-99 (152) 112 protein:vir:79179 Length: 155 94.3 0.00058 3.6E-07 38.3 7.0 87 111-197 1-104 (155) 113 protein:vir:97982 Length: 140 93.7 0.00014 8.8E-08 41.7 2.6 102 1-121 1-140 (140) 114 protein:vir:107545 Length: 140 93.7 0.00014 8.8E-08 41.7 2.6 102 1-121 1-140 (140) 115 protein:vir:1164 Length: 156 # 93.6 0.0014 8.7E-07 36.2 7.7 87 111-197 1-101 (156) 116 protein:vir:106506 Length: 137 92.7 0.00017 1.1E-07 41.2 1.5 106 1-131 5-137 (137) 117 protein:vir:106041 Length: 137 92.3 0.00016 9.9E-08 41.4 0.7 102 1-122 4-137 (137) 118 protein:vir:94490 Length: 137 92.1 0.00041 2.5E-07 39.1 2.7 67 107-197 1-68 (137) 119 protein:vir:97427 Length: 137 92.1 0.00041 2.5E-07 39.1 2.7 67 107-197 1-68 (137) 120 protein:vir:93738 Length: 137 92.1 0.00041 2.5E-07 39.1 2.7 67 107-197 1-68 (137) 121 protein:vir:98860 Length: 230 91.6 0.002 1.2E-06 35.4 5.9 121 1-130 61-230 (230) 122 protein:vir:9879 Length: 127 # 89.4 0.0026 1.6E-06 34.8 4.5 91 4-128 1-127 (127) 123 protein:vir:96829 Length: 135 87.7 0.002 1.3E-06 35.3 2.8 68 89-197 1-68 (135) 124 protein:vir:106570 Length: 182 85.6 0.00074 4.6E-07 37.8 -0.8 72 89-197 1-75 (182) 125 protein:vir:95894 Length: 137 85.2 0.0034 2.1E-06 34.1 2.7 67 107-197 1-68 (137) 126 protein:vir:94654 Length: 142 84.8 0.0029 1.8E-06 34.5 2.1 71 89-197 1-72 (142) 127 protein:vir:98636 Length: 138 84.8 0.013 8.1E-06 30.9 5.7 118 1-132 10-138 (138) 128 protein:vir:106506 Length: 137 84.1 0.0052 3.2E-06 33.1 3.2 65 92-197 1-66 (137) 129 protein:vir:9647 Length: 132 # 83.6 0.013 8.1E-06 30.9 5.2 121 1-132 4-132 (132) 130 protein:vir:1243 Length: 116 # 83.5 0.0048 3E-06 33.3 2.7 49 132-197 1-55 (116) 131 protein:vir:97327 Length: 116 83.5 0.0048 3E-06 33.3 2.7 49 132-197 1-55 (116) 132 protein:vir:96121 Length: 137 83.0 0.0033 2.1E-06 34.2 1.7 69 107-197 1-70 (137) 133 protein:vir:105916 Length: 149 82.7 0.017 1.1E-05 30.2 5.5 79 89-197 1-80 (149) 134 protein:vir:81067 Length: 119 81.2 0.007 4.3E-06 32.4 2.7 104 1-134 8-119 (119) 135 protein:vir:966 Length: 123 # 81.2 0.038 2.4E-05 28.4 6.8 91 1-128 1-123 (123) 136 protein:vir:10367 Length: 119 80.5 0.0079 4.9E-06 32.1 2.8 104 1-134 8-119 (119) 137 protein:vir:94796 Length: 137 80.5 0.007 4.3E-06 32.4 2.5 67 107-197 1-68 (137) 138 protein:vir:5978 Length: 144 # 80.1 0.0091 5.7E-06 31.8 3.0 70 102-197 1-73 (144) 139 protein:vir:101594 Length: 173 79.9 0.017 1E-05 30.4 4.3 67 107-197 1-68 (173) 140 protein:vir:100887 Length: 139 78.7 0.0084 5.2E-06 32.0 2.3 108 1-134 25-139 (139) 141 protein:vir:9930 Length: 108 # 77.4 0.015 9.2E-06 30.6 3.3 66 108-197 1-71 (108) 142 protein:vir:95062 Length: 116 77.1 0.012 7.6E-06 31.1 2.7 47 132-197 1-47 (116) 143 protein:vir:107099 Length: 137 76.3 0.0089 5.5E-06 31.8 1.7 65 107-197 1-68 (137) 144 protein:vir:105467 Length: 144 75.4 0.022 1.4E-05 29.6 3.7 108 1-134 1-144 (144) 145 protein:vir:105330 Length: 137 75.2 0.0081 5E-06 32.0 1.2 67 107-197 1-71 (137) 146 protein:vir:80116 Length: 127 72.0 0.043 2.7E-05 28.1 4.4 93 1-131 1-127 (127) 147 protein:vir:95372 Length: 124 71.3 0.061 3.8E-05 27.2 5.0 90 1-128 1-124 (124) 148 protein:vir:96358 Length: 115 69.4 0.02 1.2E-05 29.9 1.9 70 107-197 1-71 (115) 149 protein:vir:103917 Length: 115 69.4 0.02 1.2E-05 29.9 1.9 70 107-197 1-71 (115) 150 protein:vir:96225 Length: 115 69.4 0.02 1.2E-05 29.9 1.9 70 107-197 1-71 (115) 151 protein:vir:9312 Length: 115 # 69.4 0.02 1.2E-05 29.9 1.9 70 107-197 1-71 (115) 152 protein:vir:78858 Length: 115 69.4 0.02 1.2E-05 29.9 1.9 70 107-197 1-71 (115) 153 protein:vir:97144 Length: 115 69.4 0.02 1.2E-05 29.9 1.9 70 107-197 1-71 (115) 154 protein:vir:4859 Length: 140 # 69.2 0.016 9.7E-06 30.5 1.3 84 1-142 29-140 (140) 155 protein:vir:5000 Length: 141 # 69.1 0.023 1.4E-05 29.5 2.2 80 1-131 26-141 (141) 156 protein:vir:100223 Length: 139 67.0 0.02 1.2E-05 29.9 1.4 100 1-134 25-139 (139) 157 protein:vir:100652 Length: 134 61.8 0.062 3.9E-05 27.2 3.1 85 1-129 1-134 (134) 158 protein:vir:102963 Length: 163 52.1 0.42 0.00026 22.7 5.9 94 1-131 20-163 (163) 159 protein:vir:4460 Length: 170 # 51.9 0.029 1.8E-05 29.0 -0.4 79 101-197 1-80 (170) 160 protein:vir:99528 Length: 92 # 51.3 0.099 6.1E-05 26.1 2.4 70 102-197 1-80 (92) 161 protein:vir:9513 Length: 134 # 49.0 0.42 0.00026 22.7 5.4 85 1-129 2-134 (134) 162 protein:vir:101302 Length: 134 49.0 0.42 0.00026 22.7 5.4 85 1-129 2-134 (134) 163 protein:vir:96012 Length: 133 40.0 0.9 0.00056 20.8 5.8 85 1-130 1-133 (133) 164 protein:vir:487 Length: 187 # 36.8 0.57 0.00035 21.9 4.1 81 107-197 1-93 (187) 165 protein:vir:79034 Length: 141 36.5 0.92 0.00057 20.8 5.2 96 1-132 1-141 (141) 166 protein:vir:6246 Length: 143 # 32.1 0.13 8.1E-05 25.4 -0.1 124 1-138 8-143 (143) 167 protein:vir:1332 Length: 143 # 30.2 0.14 8.8E-05 25.2 -0.3 120 1-138 8-143 (143) 168 protein:vir:78335 Length: 133 30.1 1.1 0.0007 20.3 4.6 84 1-130 24-133 (133) 169 protein:vir:2688 Length: 123 # 27.4 1.7 0.001 19.3 5.1 82 1-128 14-123 (123) 170 protein:vir:93898 Length: 133 24.9 1.9 0.0012 19.1 4.8 82 1-128 24-133 (133) 171 protein:vir:94994 Length: 131 22.1 2.5 0.0015 18.4 6.7 71 1-126 36-131 (131) 172 protein:vir:96973 Length: 133 21.7 2.4 0.0015 18.5 4.8 82 1-128 24-133 (133) 173 protein:vir:94419 Length: 133 21.7 2.4 0.0015 18.5 4.8 82 1-128 24-133 (133) 174 protein:vir:78644 Length: 133 21.7 2.4 0.0015 18.5 4.8 82 1-128 24-133 (133) 175 protein:vir:9363 Length: 133 # 21.7 2.4 0.0015 18.5 4.8 82 1-128 24-133 (133) 176 protein:vir:4514 Length: 168 # 21.1 0.32 0.0002 23.3 -0.2 78 89-197 1-79 (168) 177 protein:vir:104347 Length: 145 20.3 2.8 0.0017 18.1 5.0 72 1-126 44-145 (145) No 1 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=100.00 E-value=1.7e-62 Score=359.18 Aligned_cols=185 Identities=25% Similarity=0.291 Sum_probs=169.4 Q ss_pred CcccccHHHHHHHHHHHHHHHhcCCeEEEEeecCCCCCC----CccchHHHHhhHhHcCceeeeCCCceeeecccccccc Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQIRDDQYVTVGIHEAAGDVE----SGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESR 76 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~----~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~ 76 (197) =.|+++++++.+.|++|.+ +++++|+|||+++++|++ ++++++|+||+|||||++|+||++++|++.....+.. T Consensus 8 ~~k~~~~~~~~~~~~~l~~--l~~~~v~vGi~~~~~y~~~~~~~dG~~va~IA~~~EfG~~i~~p~~~~~~~~~~~~g~~ 85 (200) T protein:vir:99 8 SNSVAAPLKHFQMLKQFDA--LKGKTVQAGWFETDRYPAKEGETIGPLVAKIARQLEFGGVINHPGGTKYIKDAIVDGRY 85 (200) T ss_pred eeeeecchHHHHHHHHHHH--hhCCeEEEEEcCCCCcCCcccccccchHHHHHhHHHcCCeeccCCCccccccccccccc Confidence 2466788888888777733 478999999999999873 4579999999999999999999999999999999999 Q ss_pred cCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHH----HHHHccCcHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 77 KEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIE----RGASRDESTTSILEKVGVTAQAAVRMFMT 152 (197) Q Consensus 77 ~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~----~~~~~~~~~~~~l~~iG~~~~~~i~~~I~ 152 (197) .+++|+++ ++|.+.+++++|+++||||||||+|+++++++|.++++ +++.|++|++++|+++|+.++++||++|+ T Consensus 86 ~g~rfv~k-~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~l~g~~~~~~~L~~~G~~~~~~ik~~I~ 164 (200) T protein:vir:99 86 VGTRFVHK-SFQGEHEVTKAHQIVIPARPFMRLAWATFNKDKVKIQAQIARQLLDGTINPEQALAQIGLALEGCIVRSIK 164 (200) T ss_pred cccccccc-cccceeeeeccccccCCCcchhhHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHh Confidence 99999875 68999999999999999999999999999999998765 45678999999999999999999999999 Q ss_pred hCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeeee Q lcl|NC_021342. 153 ELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTYVVH 188 (197) Q Consensus 153 ~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty~V~ 188 (197) ++.||||||+||++||||+||||||+|++||||+|+ T Consensus 165 ~~~~ppna~sTi~~Kg~~~PLidTG~l~~SIty~Ve 200 (200) T protein:vir:99 165 SGPWAANSPATIRAKGFDKPLIDTAHMWQTVSSKVS 200 (200) T ss_pred cCCCCCChHHHHHHhCCCCchHHHHHHHhHhccccC Confidence 999999999999999999999999999999999999 No 2 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=100.00 E-value=3.9e-61 Score=351.77 Aligned_cols=185 Identities=28% Similarity=0.363 Sum_probs=163.3 Q ss_pred cccccHHHHHHHHHHHHHHHhcCCeEEEEeecCCCCCCC----ccchHHHHhhHhHcCceeeeCCCceeeeccccccccc Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIRDDQYVTVGIHEAAGDVES----GEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRK 77 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~~----~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~ 77 (197) |+|++..+-.++|.+.+++| ++++|+|||++++.|++. +++++|+||+|||||++|+||+++++++.....+... T Consensus 1 m~~~~~~~~~~~~~~~l~~l-~~~~v~vGi~~~~~~~~~~~~~~G~~va~iAai~EfG~~I~~~~~~~~~~~~~~~g~~~ 79 (193) T protein:vir:96 1 MSLRRDSELIAAHLQMLRAM-RGRSVSAGWYSTARYPDKAGGSVGIQVARIARLNEYGGTIDHPGGTRYIRDAIVRGRFV 79 (193) T ss_pred CeeccchHHHHHHHHHHHHh-cCCeEEEEEcCCCCCCCcccccccchHHHHHhHHHcCCccccCccceeeeecccccccc Confidence 78886654333333333444 689999999999988763 3689999999999999999999999999999999999 Q ss_pred CCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHH----HHHccCcHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021342. 78 EVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIER----GASRDESTTSILEKVGVTAQAAVRMFMTE 153 (197) Q Consensus 78 ~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~----~~~~~~~~~~~l~~iG~~~~~~i~~~I~~ 153 (197) +.+|++++.+ .+.+++++|+++||||||||+|+++++++|.+.+++ ++.|+.|++++|+++|+.++++||++|++ T Consensus 80 ~~~~~k~~~~-~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~~G~~~~~~ik~~I~~ 158 (193) T protein:vir:96 80 GVRFVRNDFP-GETEVTKPHRITIPARPFMRYAWNLFSADRAAIQNRIAMRLARGQITPDQALAQIGLALEGYIARSIRT 158 (193) T ss_pred ccceeccCcc-eeeEeecceeccCCCcchhhhhHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhc Confidence 9999976543 567899999999999999999999999999987654 56688999999999999999999999999 Q ss_pred CCCCCCcHHHHHhcCCCCchhHHHHHHhhceeeee Q lcl|NC_021342. 154 LQDPPNAKSTIRKKGSSNPLIDTGALRQSVTYVVH 188 (197) Q Consensus 154 ~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty~V~ 188 (197) +.||||||+||++||||+||||||+|++||||+|+ T Consensus 159 ~~~ppna~~Ti~~KG~~~PLidTG~l~~SIty~Vv 193 (193) T protein:vir:96 159 GPWVANSASTVRRKGFNRPLVDTAHMLQSISSRVT 193 (193) T ss_pred CCCCCCcHHHHHHhCCCCchhHHHHHHhhhcceeC Confidence 99999999999999999999999999999999999 No 3 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=100.00 E-value=4.3e-54 Score=313.15 Aligned_cols=180 Identities=24% Similarity=0.357 Sum_probs=142.3 Q ss_pred cccccHHHHHHHHHHHHHHHhcCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeeccccccc-----c Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIRDDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAES-----R 76 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~-----~ 76 (197) |+|+..+....+|.+-++.| ++++|+|||+.++ +..+|+||.+|||||+|+||++ +++++...+. . T Consensus 1 m~vt~~~~~~~~~~~~l~~L-~~k~v~vGi~~~d------~~~~~~Ia~~~E~Ga~I~~~~~--~l~Ip~~~a~~~k~~~ 71 (199) T protein:vir:80 1 MKVTTDKSTMNKAIRELDQL-DRYSLQIGLFGED------DSFIQMIAGVHEFGLTIRPKGK--YLTIPTPEAGDRRARD 71 (199) T ss_pred CcccccHHHHHHHHHHHHHh-cCCEEEEEEecCC------CcchhheeehhhcCCeeecCCc--eeeecchhhhcccccc Confidence 88875544333333333333 7899999999543 3469999999999999999975 4444332221 1 Q ss_pred cCCcccccc---------ccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHH----HHHccCcHHHHHHHHHHHH Q lcl|NC_021342. 77 KEVRFLKTG---------TGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIER----GASRDESTTSILEKVGVTA 143 (197) Q Consensus 77 ~~~~f~k~~---------~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~----~~~~~~~~~~~l~~iG~~~ 143 (197) .+..|..++ .+...+++++.++++||||||||+|+++++++|.+++++ ++.|+.+++++|+++|+.+ T Consensus 72 ~~~~~~p~g~~~~~~~~~~~~~~~~e~g~~~~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~vl~g~~~a~~~L~~~G~~~ 151 (199) T protein:vir:80 72 IPGLFKPKGKNILAVAGPDGKLTVMFYLKTEVNIPERSFLRSTFDEKSNKWGELFEGWIDDVIHGKLSAEQVYNRLGAKI 151 (199) T ss_pred cCcccccCCcceeeeeccccceeeeeeccccccCCCCchhHHHHHHHHHHHHHHHHHHHHHHHhCCCcHHHHHHHHHHHH Confidence 122222222 233456789999999999999999999999999998765 4567999999999999999 Q ss_pred HHHHHHHHHhCCCCCCcHHHHH-hcCCCCchhHHHHHHhhceeeeecc Q lcl|NC_021342. 144 QAAVRMFMTELQDPPNAKSTIR-KKGSSNPLIDTGALRQSVTYVVHSG 190 (197) Q Consensus 144 ~~~i~~~I~~~~~ppna~~Ti~-~KG~~~PLidTG~L~~SIty~V~~k 190 (197) +++||++|+++.||||||+||+ +||||+||||||+|++||+|+|++- T Consensus 152 ~~~Ik~~I~~~~~ppna~~Tia~rKg~~kPLidTG~l~~SIty~V~~~ 199 (199) T protein:vir:80 152 VDDIQMKIVEIQTPAKSAATLARNPRKNNPLIVTGKMKNSVTWKVMKS 199 (199) T ss_pred HHHHHHHHhccCCCCCCHHHHHHhcCCCCchHHHHHHHhhcceeeeeC Confidence 9999999999999999999997 8999999999999999999999998 No 4 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=100.00 E-value=6.7e-51 Score=295.67 Aligned_cols=149 Identities=23% Similarity=0.362 Sum_probs=132.7 Q ss_pred Ccccc-cHHHHHHHHHHHHHHHhcCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccCC Q lcl|NC_021342. 1 MMKVV-GLQETLAELDKVLGQIRDDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEV 79 (197) Q Consensus 1 M~ki~-~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~ 79 (197) |-.+. +.++..++|++.++.| ++++|+|||+++++|+| ++++|+||+|||||. T Consensus 1 M~~~i~~~~~~~~~L~~~lk~l-~~k~V~VGi~~~~~y~d--G~~vA~Ia~~~E~G~----------------------- 54 (189) T protein:vir:10 1 MGRVIRKQGPARVKLNAFIKGM-NDYSVRIGWFSTAKYPD--GTPTAYVASIHEFGA----------------------- 54 (189) T ss_pred CcceeccCcHHHHHHHHHHHHh-hCCeEEEEecCCCCCCC--cccHHHHHHHHHhcC----------------------- Confidence 55443 5567778888888877 67899999999988875 489999999999994 Q ss_pred ccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHH----HHHccCcHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_021342. 80 RFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIER----GASRDESTTSILEKVGVTAQAAVRMFMTELQ 155 (197) Q Consensus 80 ~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~----~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~ 155 (197) +..+||||||||+|+++++++|.+++++ ++.|+++++++|+++|+.++++||.+|+++. T Consensus 55 -----------------p~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~vl~G~~~~~~~L~~~G~~a~~~Ik~~I~~~~ 117 (189) T protein:vir:10 55 -----------------PSRGIPARSFIRPTIAAQQAAWSQQMRFYAKQIVVGQMNVEQALEGLAIVARGDVDATLARLK 117 (189) T ss_pred -----------------cCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHhcCC Confidence 3568999999999999999999997654 5567999999999999999999999999999 Q ss_pred CCCCcHHHHHhcC-----------------------------------CCCchhHHHHHHhhceeeeecccc Q lcl|NC_021342. 156 DPPNAKSTIRKKG-----------------------------------SSNPLIDTGALRQSVTYVVHSGKL 192 (197) Q Consensus 156 ~ppna~~Ti~~KG-----------------------------------~~~PLidTG~L~~SIty~V~~k~~ 192 (197) ||||||+||++|| |++||||||+|++||||+|+++++ T Consensus 118 ~ppna~sTi~~Kg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~kPLidTG~l~~SIty~V~~k~~ 189 (189) T protein:vir:10 118 DPPLSPLTIYIRKFIKDGGVIHGYKDIMRLRSEMQQEQAKGTLNLSGVSTDPLDFTGYMRATLSYTVTKEKS 189 (189) T ss_pred CCCCcHHHHHHhcccCcccchhhhhhhhhhhhhhhhhhhhccccccccCCCchhhHHHHHhhcceeeeecCC Confidence 9999999999999 479999999999999999999998 No 5 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=100.00 E-value=1.9e-50 Score=293.23 Aligned_cols=144 Identities=35% Similarity=0.495 Sum_probs=122.2 Q ss_pred cccccHHHHHHHHHHHHHHHh--cCCeEEEEeecCC--CCCCCccchHHHHhhHhHcCceeeeCCCceeeeccccccccc Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIR--DDQYVTVGIHEAA--GDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRK 77 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~--~~~~V~VGi~~~~--~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~ 77 (197) |.++-.. -...|++|.+++. ++++|+|||+++. ...+++++++|+||+||||| T Consensus 1 M~~~~k~-~~~~~~~l~~~l~~l~~~~v~VGi~~~~~~~~~~~~g~~vA~ia~~~E~G---------------------- 57 (148) T protein:vir:52 1 MAVTVTA-NFSAAKQLIEQMKSLKEKAVYVGFPAEFDEKVKGSENFNLASLAAVLEFG---------------------- 57 (148) T ss_pred Ccccccc-ccHHHHHHHHHHHHhhCCeEEEEeecCcCCCCCCCCCCCHHHHHHHHhcC---------------------- Confidence 4333111 1122333333332 5789999999642 33456679999999999999 Q ss_pred CCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_021342. 78 EVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTELQDP 157 (197) Q Consensus 78 ~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~p 157 (197) +.+||+|||||+|+++++++|.+++.++++++++++++|+++|+.++++||++|+++.+| T Consensus 58 --------------------~~~IP~Rpflr~t~~~~~~~~~~~~~~~~~~~~~~~~~L~~~G~~~~~~ik~~I~~~~~p 117 (148) T protein:vir:52 58 --------------------NEHIPARPFLRQTLEENQEKYTALFIQWFDQGVPAAQIYERLSVMAQGDVQMNIVKGEWV 117 (148) T ss_pred --------------------CCCCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHhcCCCC Confidence 468999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcHHHHHhcCCCCchhHHHHHHhhceeeee Q lcl|NC_021342. 158 PNAKSTIRKKGSSNPLIDTGALRQSVTYVVH 188 (197) Q Consensus 158 pna~~Ti~~KG~~~PLidTG~L~~SIty~V~ 188 (197) ||||+||++||||+||||||+|++||||+|+ T Consensus 118 pna~sTi~~Kg~~~PLidTG~l~~SIty~V~ 148 (148) T protein:vir:52 118 ANAKSTIRRKKSSKPLIDTGKMRQSVRGIVK 148 (148) T ss_pred CCcHHHHHhcCCCCchhHHHHHHHHhhhhcC Confidence 9999999999999999999999999999999 No 6 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=100.00 E-value=5.1e-50 Score=290.82 Aligned_cols=139 Identities=21% Similarity=0.294 Sum_probs=127.9 Q ss_pred cccccHHHHHHHHHHHHHHHhcCCeEEEEeecCCCCCCC----------------ccchHHHHhhHhHcCceeeeCCCce Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIRDDQYVTVGIHEAAGDVES----------------GEINMATLGAVLNFGAEIDHPGGTS 65 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~~----------------~~~~~A~iA~~~EfGa~I~~p~~~~ 65 (197) |+|. .+.|+++.+++ ++++|+|||+++++|+|+ +++++|+||+||||| T Consensus 1 m~v~-----~k~L~~~~~~l-~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G---------- 64 (155) T protein:vir:78 1 MSVT-----RRGLTLPKDRY-RSMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYG---------- 64 (155) T ss_pred Ccch-----HHHHHHHHHHH-hCCeeEEeecCCCCCCcccchhhhhhhhcccccccCCcHHHHHHhhhcC---------- Confidence 6666 34477777776 468999999999999874 269999999999999 Q ss_pred eeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHH Q lcl|NC_021342. 66 YGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQA 145 (197) Q Consensus 66 ~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~ 145 (197) +++||||||||+|+++++++|.+.+.+++.++++++++|+++|+.+++ T Consensus 65 --------------------------------~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~~G~~~~~ 112 (155) T protein:vir:78 65 --------------------------------TSKLPARPFMEKTITDRSAEWIKGLTVMMTMGYDAEVAMGQIGQAMKD 112 (155) T ss_pred --------------------------------CCCCCCcchhhHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHH Confidence 579999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeeeec Q lcl|NC_021342. 146 AVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTYVVHS 189 (197) Q Consensus 146 ~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty~V~~ 189 (197) +||++|+++. |||||+||++||||+||||||+|++||||+|+. T Consensus 113 ~Ik~~I~~~~-~pna~~Ti~~Kg~~kPLidTG~l~~SIty~V~~ 155 (155) T protein:vir:78 113 DIKTTISEWP-ADNSADWAGKKGFNHGLIWTSHLLNSVEQEIVK 155 (155) T ss_pred HHHHHHhcCC-CCCcHHHHHhcCCCCchhHHHHHHHhhhhhccC Confidence 9999999986 999999999999999999999999999999998 No 7 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=100.00 E-value=5.6e-50 Score=290.61 Aligned_cols=139 Identities=21% Similarity=0.293 Sum_probs=127.9 Q ss_pred cccccHHHHHHHHHHHHHHHhcCCeEEEEeecCCCCCCC----------------ccchHHHHhhHhHcCceeeeCCCce Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIRDDQYVTVGIHEAAGDVES----------------GEINMATLGAVLNFGAEIDHPGGTS 65 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~~----------------~~~~~A~iA~~~EfGa~I~~p~~~~ 65 (197) |+|. .+.|+++.+++ ++++|+|||+++++|+|+ +++++|+||+||||| T Consensus 1 m~v~-----~k~L~~~~~~l-~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~g~~va~ia~~~E~G---------- 64 (155) T protein:vir:10 1 MSVT-----RRGLTLPKDRY-RSMSVKAGVLAGATYPDESGKKLADGTILTKDPRAGLPVAMIAMALNYG---------- 64 (155) T ss_pred Ccch-----HHHHHHHHHHH-hCCeeEEeecCCCCCccccchhhhhhhhcccccccCCcHHHHHHHHhcC---------- Confidence 6666 34477777776 468999999999999873 369999999999999 Q ss_pred eeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHH Q lcl|NC_021342. 66 YGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQA 145 (197) Q Consensus 66 ~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~ 145 (197) +++||||||||+|+++++++|.+.+.+++.++++++++|+++|+.+++ T Consensus 65 --------------------------------~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~lG~~~~~ 112 (155) T protein:vir:10 65 --------------------------------TSKLPARPFMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQIGQAMKD 112 (155) T ss_pred --------------------------------CCCCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHH Confidence 579999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeeeec Q lcl|NC_021342. 146 AVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTYVVHS 189 (197) Q Consensus 146 ~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty~V~~ 189 (197) +||++|+++. |||||+||++||||+||||||+|++||||+|+. T Consensus 113 ~Ik~~I~~~~-~pna~~Ti~~KG~~kPLidTG~l~~SIty~Vv~ 155 (155) T protein:vir:10 113 DIKTTISEWP-ADNSADWAGKKGFNHGLIWTSHLLNSVEQEIVK 155 (155) T ss_pred HHHHHHhcCC-CCCcHHHHHhcCCCCchhHHHHHHHhhhhhccC Confidence 9999999986 999999999999999999999999999999998 No 8 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=100.00 E-value=1.9e-49 Score=287.63 Aligned_cols=139 Identities=20% Similarity=0.295 Sum_probs=128.0 Q ss_pred cccccHHHHHHHHHHHHHHHhcCCeEEEEeecCCCCCCCc----------------cchHHHHhhHhHcCceeeeCCCce Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIRDDQYVTVGIHEAAGDVESG----------------EINMATLGAVLNFGAEIDHPGGTS 65 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~~~----------------~~~~A~iA~~~EfGa~I~~p~~~~ 65 (197) |+|. .+.|++++++|. +++|+|||+++++|+|++ ++++|+||+||||| T Consensus 1 m~v~-----r~~L~~~~~~l~-~~~V~VGi~~~a~y~d~~g~~~~~g~~~~~~~~~G~pva~ia~~~e~G---------- 64 (155) T protein:vir:10 1 MSVT-----RRGLTLPKDRYK-SMSVKAGVLAGATYPDESGKKLADGTILKKDPRAGLPVAMIAMALNYG---------- 64 (155) T ss_pred Ccch-----HHHHHHHHHHhh-CCeeEEeecCCCCCCccccchhhhhhhhccccccCcchhhhhhhhhcC---------- Confidence 6666 356888888875 578999999999998744 68899999999999 Q ss_pred eeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHH Q lcl|NC_021342. 66 YGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQA 145 (197) Q Consensus 66 ~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~ 145 (197) +++||||||||+|+++++++|.+.+.+++.++++++++|+++|+.+++ T Consensus 65 --------------------------------~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~~G~~~~~ 112 (155) T protein:vir:10 65 --------------------------------TSKLPARPFMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQIGQAMKD 112 (155) T ss_pred --------------------------------CCCCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHH Confidence 579999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeeeec Q lcl|NC_021342. 146 AVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTYVVHS 189 (197) Q Consensus 146 ~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty~V~~ 189 (197) +||++|+++.+| |+|+||++||||+||||||+|++||||+|++ T Consensus 113 ~Ik~~I~~~~~p-~~~~Ti~~KG~~~PLidTG~l~~Sity~Vv~ 155 (155) T protein:vir:10 113 DIKTTISEWPAD-NNADWAGKKGFNHGLIWTSHLLNSIEQEIVK 155 (155) T ss_pred HHHHHHhcCCCC-CChHHHHhcCCCCchHHHHHHHHhhhhhccC Confidence 999999999986 6789999999999999999999999999998 No 9 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=100.00 E-value=1.5e-48 Score=282.85 Aligned_cols=139 Identities=20% Similarity=0.288 Sum_probs=126.0 Q ss_pred cccccHHHHHHHHHHHHHHHhcCCeEEEEeecCCCCCCC----------------ccchHHHHhhHhHcCceeeeCCCce Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIRDDQYVTVGIHEAAGDVES----------------GEINMATLGAVLNFGAEIDHPGGTS 65 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~~----------------~~~~~A~iA~~~EfGa~I~~p~~~~ 65 (197) |++. ...|+.+++++ ++++|+|||+++++|+|+ +++++|+||+||||| T Consensus 1 m~~~-----r~~l~~~~~~l-~~~~v~VGi~~~a~y~d~~~~~~~~~~~~~~~~~~G~pva~ia~~~e~G---------- 64 (155) T protein:vir:77 1 MSVT-----RRGLTLPKDRY-RSMSVKAGVLAGATYPDESGKKLADGSILKKDPRAGLPVAMIAMALNYG---------- 64 (155) T ss_pred Ccch-----HHHHHHHHHHH-hcCceEEeecCCCCCccccchhhhhhhhccccccccccHhhhhhhhhcC---------- Confidence 4444 23477777766 467899999999999873 359999999999999 Q ss_pred eeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHH Q lcl|NC_021342. 66 YGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQA 145 (197) Q Consensus 66 ~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~ 145 (197) +++||||||||+|+++++++|.+.+.+++.++++++++|+++|+.+++ T Consensus 65 --------------------------------~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~lG~~~~~ 112 (155) T protein:vir:77 65 --------------------------------TSKLPARPFMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQIGQAMKD 112 (155) T ss_pred --------------------------------CCCCCCCchhhHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHH Confidence 579999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeeeec Q lcl|NC_021342. 146 AVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTYVVHS 189 (197) Q Consensus 146 ~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty~V~~ 189 (197) +||++|+++.+| |+|+||++||||+||||||+|++||||+|+. T Consensus 113 ~Iq~~I~~~~~p-~~~~Ti~~KG~d~PLidTG~l~~SIty~Vv~ 155 (155) T protein:vir:77 113 DIKTTISEWPAD-NNADWAGKKGFNHGLIWTSHLLNSIEQEIVK 155 (155) T ss_pred HHHHHHhcCCCC-CChHHHHhcCCCCchhHHHHHHHhhhhhccC Confidence 999999999986 6789999999999999999999999999998 No 10 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=100.00 E-value=4.1e-48 Score=280.35 Aligned_cols=151 Identities=15% Similarity=0.188 Sum_probs=127.8 Q ss_pred cccccHHHHHHHHHHHHHHHhcCCeEEEEeecCCCCCCC---------------ccchHHHHhhHhHcCceeeeCCCcee Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIRDDQYVTVGIHEAAGDVES---------------GEINMATLGAVLNFGAEIDHPGGTSY 66 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~~---------------~~~~~A~iA~~~EfGa~I~~p~~~~~ 66 (197) |+....+- ...+.++.+.+ +++.|+|||+.+..|+++ +++++|+||+||||| T Consensus 1 ~~~~~~~g-~~~~~~~~~~l-~~~~v~vG~l~~a~yp~G~~~~~~~~~~~~~~~~g~~va~Ia~~~E~G----------- 67 (168) T protein:vir:94 1 MTTIARKG-VKMPPHLEAQF-QSGEVKAGVLSGSTYPQMTYTDQRTGKQIEDARGGMPVAVIAQALEYG----------- 67 (168) T ss_pred Cccccchh-hhhhHHHHHhh-hccceeeeccccCcccccccchhhcccccccccccccHHHHHHHHhcC----------- Confidence 44443222 33444555555 467899999999888753 568999999999999 Q ss_pred eecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHH Q lcl|NC_021342. 67 GYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAA 146 (197) Q Consensus 67 ~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~ 146 (197) +++||||||||+|+++++++|.+.+++++++++|++++|+++|+.++++ T Consensus 68 -------------------------------~~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~L~~lG~~~~~~ 116 (168) T protein:vir:94 68 -------------------------------HGQNHPRPFMQQTYAAQYRAWSRDLTLTLKAGAAADTALRTVGQRMAED 116 (168) T ss_pred -------------------------------CCCCCCchhhHHHHHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHH Confidence 5789999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeeeeccccccccC Q lcl|NC_021342. 147 VRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTYVVHSGKLPDEGL 197 (197) Q Consensus 147 i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty~V~~k~~~~~~~ 197 (197) ||++|+++. |||||+||++||||+||||||+|++||||+|+..+..+|-- T Consensus 117 Ik~~I~~~~-ppna~sTi~~KG~~~PLiDTG~l~~SIty~Vv~d~~~~~~~ 166 (168) T protein:vir:94 117 IQDTIRNWP-ADNSPEWAAIKGFNAGLRQTGVLLNAIDSAVIIDGEHGEAP 166 (168) T ss_pred HHHHhhcCC-CCccHHHHHhcCCCCchhHHHHHHhhcceeeeecCCCCCCC Confidence 999999985 99999999999999999999999999999887433333333 No 11 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=100.00 E-value=1.4e-43 Score=255.46 Aligned_cols=150 Identities=20% Similarity=0.238 Sum_probs=112.5 Q ss_pred CcccccHHHHHHHHHHHHHHHhcCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccCCc Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQIRDDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEVR 80 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~ 80 (197) |||..... =.++|.+.+++| +++.|+|||+++.++.+ ++.++++||+||||| T Consensus 1 ~~~~~~~~-G~~~L~~~~k~l-~~~~V~VGi~~d~g~~~-dG~sv~~vA~~~EfG------------------------- 52 (160) T protein:vir:95 1 MVKRVIHP-ARAKLVGAMKNL-QTANAQVGYFQEQGQHS-SGFSYPALMYLQEVI------------------------- 52 (160) T ss_pred CceeechH-hHHHHHHHHHHH-hCCeeEEeeccccccCC-CCccHHHHHhhhhcC------------------------- Confidence 88877332 233344444444 56789999999986654 568999999999999 Q ss_pred cccccccccccccccccccCCCcchhHHHHHHH----HHHHHHHHHHHHH-HccCcH-HHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_021342. 81 FLKTGTGFKPLGVTKPHKINIPARPWLEPGVQS----KSNEYVTIIERGA-SRDEST-TSILEKVGVTAQAAVRMFMTEL 154 (197) Q Consensus 81 f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~----~~~~~~~~~~~~~-~~~~~~-~~~l~~iG~~~~~~i~~~I~~~ 154 (197) +++||+|||||+||+. +...+.+++...+ ..-.+. ..+.+.||+.++++|+.+|++. T Consensus 53 -----------------~~~iPaRPf~R~tfe~~~~~~~~~~~~~~~~~i~~~~~~g~~~~~~~LG~~~~~~ik~~I~~~ 115 (160) T protein:vir:95 53 -----------------GVPSASGKVYRRLFEITMMLNKQTLLEQTKKNLYKQLSSLNTDPSNTLEAFAKNAQKAIKRGF 115 (160) T ss_pred -----------------cccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhHHHHHHHHHHHHHHHHHhhc Confidence 4789999999999975 3333334433322 211111 2244569999999999999884 Q ss_pred ----CCCCCcHHHHHhcCCCCchhHHHHHHhhceeeeeccccccc Q lcl|NC_021342. 155 ----QDPPNAKSTIRKKGSSNPLIDTGALRQSVTYVVHSGKLPDE 195 (197) Q Consensus 155 ----~~ppna~~Ti~~KG~~~PLidTG~L~~SIty~V~~k~~~~~ 195 (197) .||||||+||++||||+||||||+|++||+|+|.++.+=.. T Consensus 116 ~~p~~w~pNap~Ti~~Kgs~~PLiDTg~l~~Si~y~v~~~~~~~~ 160 (160) T protein:vir:95 116 GNSAILPPNAPSTVKKKGFNAPLVETGDLRDNLAYKISTKKGIKK 160 (160) T ss_pred CCccCCCCCcHHHHHhcCCCCcchhhHHHhhhhhheeecccccCC Confidence 47899999999999999999999999999999999884444 No 12 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.82 E-value=9.1e-12 Score=81.12 Aligned_cols=114 Identities=21% Similarity=0.326 Sum_probs=68.0 Q ss_pred CcccccHHHHHHHHHHHHHHHhcCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccc-cccccCC Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQIRDDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEE-AESRKEV 79 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~-~~~~~~~ 79 (197) .+.-+| .+.+-|.. +. ....|.||- +..||++|+||++|+++.++.+.+.... ..+..+- T Consensus 75 ~L~~tg--~L~~Si~~---~~-~~~~v~vGt-------------n~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~ 135 (190) T protein:vir:99 75 ILTLDG--HLRNLLRY---QL-DGSELLFGS-------------DRPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGR 135 (190) T ss_pred cceecH--HHHHHHhh---ee-cCcEEEEec-------------CcchhhhhhcCCcccccccchhhhhhhhhhhhhhhc Confidence 222222 22222221 11 455788873 3679999999999999999887665433 3334444 Q ss_pred ccccc-cccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHH Q lcl|NC_021342. 80 RFLKT-GTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTS 134 (197) Q Consensus 80 ~f~k~-~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~ 134 (197) +|++. .++|........++++||+||||--+ ++..+++.+.+.+.+.+.+.-.. T Consensus 136 ~~~~~~~~~~~~~~~~~~~~v~IPaRpfLG~s-~~d~~~I~~~i~~~l~~~~~~~~ 190 (190) T protein:vir:99 136 EFVPRRRSNFAQDVQIGPYTIQMPARPWLGTS-SQDDDTILQRVERYLQRALRERA 190 (190) T ss_pred ccccccccccchhcccccceeeecCcccCCCC-HHHHHHHHHHHHHHHHHHHhhcC Confidence 56653 45677777788899999999999543 44555665555544443211111 No 13 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=98.73 E-value=2.6e-11 Score=78.58 Aligned_cols=84 Identities=23% Similarity=0.206 Sum_probs=58.9 Q ss_pred HHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCCcHHHHHhcCCCCchhHHHHHHhhce Q lcl|NC_021342. 111 VQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTEL------QDPPNAKSTIRKKGSSNPLIDTGALRQSVT 184 (197) Q Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppna~~Ti~~KG~~~PLidTG~L~~SIt 184 (197) +-+....+.+.+.+... +....|..+|..++..+++.+.+. .|+|+||+|+++|++++||+|||.|++||+ T Consensus 1 ~i~~~~~i~~~l~~l~~---~~~~~l~~i~~~~~~~~~~rf~~~~~p~G~~W~pLs~st~a~k~~~~~L~~tG~L~~Si~ 77 (145) T protein:vir:31 1 MVEDENNIPEAREAIQD---GLTDGLERLHTITLRELITNMSDGQDALGNPWEPLKESTIRAKGSDTPLIDNSRLLTDIN 77 (145) T ss_pred CcccHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCcccChHHHHHhcCCCCCccCHHHHHHHH Confidence 22333334444443322 234468889999999999998863 489999999999999999999999999999 Q ss_pred eeeeccccccccC Q lcl|NC_021342. 185 YVVHSGKLPDEGL 197 (197) Q Consensus 185 y~V~~k~~~~~~~ 197 (197) |.+..-....+.. T Consensus 78 ~~~~~~~~~~~a~ 90 (145) T protein:vir:31 78 AASMMDRANRMAV 90 (145) T ss_pred HHhhhcccCceeE Confidence 9874322111111 No 14 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.64 E-value=4.6e-11 Score=77.28 Aligned_cols=91 Identities=15% Similarity=0.221 Sum_probs=64.3 Q ss_pred cccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCCcHHHHHh Q lcl|NC_021342. 93 VTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTEL------QDPPNAKSTIRK 166 (197) Q Consensus 93 ~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppna~~Ti~~ 166 (197) ++ --++.|- -.++.+.+...+....+...+|..||..+...+++.|++. .|+|++|+|+++ T Consensus 1 M~-~i~i~~d------------~~~~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rf~~~~~PdG~~W~p~~~~t~~r 67 (190) T protein:vir:99 1 MA-GITLEWD------------GRRALDVLNAGSAALGDPSGLLQDIGELLLNIHRRRFQAQVSPDGTPWQPLSPAYLRR 67 (190) T ss_pred Cc-eeEEEec------------HHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCccccHHHHHH Confidence 01 0122222 2234444555555445778999999999999999999875 488999999976 Q ss_pred c--CCCCchhHHHHHHhhceeeeeccccccccC Q lcl|NC_021342. 167 K--GSSNPLIDTGALRQSVTYVVHSGKLPDEGL 197 (197) Q Consensus 167 K--G~~~PLidTG~L~~SIty~V~~k~~~~~~~ 197 (197) | +..++|.|||.|++||+|++.+.... =|- T Consensus 68 k~~~~~~~L~~tg~L~~Si~~~~~~~~v~-vGt 99 (190) T protein:vir:99 68 KRKNRDKILTLDGHLRNLLRYQLDGSELL-FGS 99 (190) T ss_pred hhcCCCccceecHHHHHHHhheecCcEEE-Eec Confidence 6 57899999999999999998544322 232 No 15 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=98.46 E-value=3.9e-10 Score=72.14 Aligned_cols=92 Identities=11% Similarity=-0.007 Sum_probs=66.1 Q ss_pred cccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhC---CCCCCcHHHHH Q lcl|NC_021342. 89 KPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTEL---QDPPNAKSTIR 165 (197) Q Consensus 89 ~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~---~~ppna~~Ti~ 165 (197) |.. .-+|.|- ..++.+.+.+......+...+|..||..+...+++.|.+. .|+|+||+|++ T Consensus 1 Ms~----~i~i~~d------------~~~~~~~L~~l~~~~~d~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~ 64 (175) T protein:vir:79 1 MSD----FVNFQID------------DSALRTRLLQLEQAGHQKADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIH 64 (175) T ss_pred Cce----EEEEEec------------hHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHH Confidence 111 1122222 2345566777666666889999999999999999999975 58899999986 Q ss_pred hc---------------------CCCCchhHHHHHHhhceeeeeccccccccC Q lcl|NC_021342. 166 KK---------------------GSSNPLIDTGALRQSVTYVVHSGKLPDEGL 197 (197) Q Consensus 166 ~K---------------------G~~~PLidTG~L~~SIty~V~~k~~~~~~~ 197 (197) +| ++.++|+|||.|++||+|.+.+..+ .-|- T Consensus 65 ~r~~~~~~~~~~~~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v-~vGt 116 (175) T protein:vir:79 65 MRVGGKKAYKKNGELTAAASRRKAGLMILQDSGQMAASTATDSGEDYS-VIGS 116 (175) T ss_pred hhccccccccccccchhhHhhhccCCCcceechhhhhhhhheecCCEE-EEec Confidence 43 4678999999999999999854432 2222 No 16 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=98.46 E-value=3e-10 Score=72.80 Aligned_cols=92 Identities=15% Similarity=0.118 Sum_probs=63.5 Q ss_pred cccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhC--CCCCCcHHHHHh Q lcl|NC_021342. 89 KPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTEL--QDPPNAKSTIRK 166 (197) Q Consensus 89 ~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~--~~ppna~~Ti~~ 166 (197) |. . ++. +.-+...+.+.+.++.....+...+|+.||..+...+++.|... .|+|+||+|+++ T Consensus 1 Ms------------~--~i~--i~~~~~~~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~ 64 (155) T protein:vir:10 1 MA------------N--RIE--LELVDREVQERLAALYAAVTDTLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAA 64 (155) T ss_pred CC------------c--eEE--EEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHH Confidence 00 0 011 11122344455555555555788999999999999999999753 799999999864 Q ss_pred -----cCCCCchhHHHHHHhhceeeeeccccccccC Q lcl|NC_021342. 167 -----KGSSNPLIDTGALRQSVTYVVHSGKLPDEGL 197 (197) Q Consensus 167 -----KG~~~PLidTG~L~~SIty~V~~k~~~~~~~ 197 (197) ++..++|+|||.|++||+|.+....+ .=|- T Consensus 65 r~k~g~~~~~~L~~tG~L~~Si~~~~~~~~v-~vGt 99 (155) T protein:vir:10 65 RAAKGRGAHPILQVTNALARSITTRADRDQA-QIGS 99 (155) T ss_pred HHhccCCCCCccccchhhhhhhhceecCCEE-EEec Confidence 35678999999999999999844332 2233 No 17 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=98.44 E-value=4.5e-10 Score=71.83 Aligned_cols=92 Identities=15% Similarity=0.109 Sum_probs=64.0 Q ss_pred cccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhC--CCCCCcHHHHHh Q lcl|NC_021342. 89 KPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTEL--QDPPNAKSTIRK 166 (197) Q Consensus 89 ~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~--~~ppna~~Ti~~ 166 (197) |..+ -+|.+- ...+.+.+.+......+...+|..||..+...+++.|... .|+|+||+|+++ T Consensus 1 M~~~----i~i~~d------------~~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~~eG~~W~pls~~t~~~ 64 (155) T protein:vir:79 1 MTTR----IDVELD------------DQEVRQRLAVLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPATVAA 64 (155) T ss_pred CceE----EEEEec------------hHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCHHHHHH Confidence 1111 122222 2334455555555555788999999999999999999753 689999999987 Q ss_pred c-----CCCCchhHHHHHHhhceeeeeccccccccC Q lcl|NC_021342. 167 K-----GSSNPLIDTGALRQSVTYVVHSGKLPDEGL 197 (197) Q Consensus 167 K-----G~~~PLidTG~L~~SIty~V~~k~~~~~~~ 197 (197) | +..++|+|||.|++||+|++.+..+ .-|- T Consensus 65 r~~~g~~~~~iL~~tG~L~~Si~~~~~~~~v-~vGt 99 (155) T protein:vir:79 65 REAKGRGPHPILQVTNALARSVTTWADRNEA-GIGS 99 (155) T ss_pred HhccCCCCCCccccchhhhhhhhceecCCEE-EEec Confidence 6 3568999999999999999754332 2232 No 18 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=98.36 E-value=1e-09 Score=69.84 Aligned_cols=92 Identities=15% Similarity=0.105 Sum_probs=63.7 Q ss_pred cccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHh-C-CCCCCcHHHHHh Q lcl|NC_021342. 89 KPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTE-L-QDPPNAKSTIRK 166 (197) Q Consensus 89 ~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~-~-~~ppna~~Ti~~ 166 (197) |..+ -+|.+ +...+.+.+.+......+...+|..||..+...+++.|.. | .|+|+||+|+++ T Consensus 1 Ms~~----i~i~~------------d~~~~~~~L~~l~~~~~d~~~l~~~ig~~l~~~~~~rF~pdG~~W~pls~~t~~~ 64 (155) T protein:vir:99 1 MTTR----IDVEL------------DDQEVRQRLALLMRSVTDTLPVMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAA 64 (155) T ss_pred CceE----EEEEe------------chHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccCCCCCCCChHHHHH Confidence 1111 11111 2234445555555555578899999999999999999974 3 689999999987 Q ss_pred c-----CCCCchhHHHHHHhhceeeeeccccccccC Q lcl|NC_021342. 167 K-----GSSNPLIDTGALRQSVTYVVHSGKLPDEGL 197 (197) Q Consensus 167 K-----G~~~PLidTG~L~~SIty~V~~k~~~~~~~ 197 (197) | +..++|+|||.|++||+|.+.+.. -.-|- T Consensus 65 r~~~g~~~~~iL~~tg~L~~Si~~~~~~~~-v~vGt 99 (155) T protein:vir:99 65 REAKGRGPHPILQVTNALARSVTTWADRNE-AGIGS 99 (155) T ss_pred HhccCCCCCCcchhchhhhhhhhceecCCE-EEEec Confidence 6 346899999999999999985433 22232 No 19 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=98.30 E-value=1.5e-09 Score=68.99 Aligned_cols=89 Identities=10% Similarity=-0.018 Sum_probs=60.8 Q ss_pred HHH--HHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhC-------CCCCCcHHHHHhcC-----CCCc Q lcl|NC_021342. 107 LEP--GVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTEL-------QDPPNAKSTIRKKG-----SSNP 172 (197) Q Consensus 107 lr~--t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~-------~~ppna~~Ti~~KG-----~~~P 172 (197) |.. .+..+.+.+.+.+.+. ....+...+|..||..+...+++.|.+. .|+|++|+|+++|. ..+| T Consensus 1 ms~~i~~~~d~~~l~~~L~~l-~~~~~~~~l~~~Ig~~l~~~~~~rf~~~~~Pd~G~~W~pls~~t~~~r~~~~~~~~~~ 79 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDEL-GTVTRDRAIPRVMAAALLSSTEQAFERQADPDTGKGWEAWSDSWLAWRQDHGFVPGSI 79 (156) T ss_pred CeEEEEEeecHHHHHHHHHHH-HhhhccHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCcccChHHHHHhhccCCCCCcc Confidence 111 1111223444445443 2333456799999999999999999863 48899999999873 3689 Q ss_pred hhHHHHHHhhceeeeeccccccccC Q lcl|NC_021342. 173 LIDTGALRQSVTYVVHSGKLPDEGL 197 (197) Q Consensus 173 LidTG~L~~SIty~V~~k~~~~~~~ 197 (197) |+|||.|++||+|.+....+.. |- T Consensus 80 L~~tg~L~~Si~~~~~~~~v~v-Gt 103 (156) T protein:vir:19 80 LTLHGDLARSITTDYGQDYALI-GS 103 (156) T ss_pred hhhhHHHHHHhhheecCCEEEE-ec Confidence 9999999999999884433221 33 No 20 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=98.15 E-value=4.8e-09 Score=66.17 Aligned_cols=85 Identities=15% Similarity=0.166 Sum_probs=47.3 Q ss_pred CcccccHHHHHHHH--HHHHHHHhcCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccC Q lcl|NC_021342. 1 MMKVVGLQETLAEL--DKVLGQIRDDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKE 78 (197) Q Consensus 1 M~ki~~~~~~~~~L--~~l~~~~~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~ 78 (197) +-+-.....+...| ...+...-....+.||+.. +++.||++|+||++|++..+ T Consensus 63 ~~~g~~~~~~~~~l~~~~~l~~~~~~~~~~v~~~G----------tn~~yAaiHQfG~~~r~~~~--------------- 117 (149) T protein:vir:18 63 SKKGRIKREMFAKLRTSRFMKAKGSDSAAVVEFTG----------KVQRMARVHQYGLKDRPNRN--------------- 117 (149) T ss_pred hccCcccchhhhhhhhhhhhheeecCceeEEEecc----------cchhhhhhhhccccccccCC--------------- Confidence 11100001111111 1111111134567777641 36789999999998875321 Q ss_pred CccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 79 VRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 79 ~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~ 128 (197) .+.++||+||||--+ ++.+.++.+.+.+.+.. T Consensus 118 -----------------~~~v~iPaRp~LG~s-~~d~~~I~~~i~~~l~~ 149 (149) T protein:vir:18 118 -----------------SRDVQYEARPLLGFT-RDDEQMIEDVIISHLGK 149 (149) T ss_pred -----------------CccccccccccCCCC-HHHHHHHHHHHHHHHhC Confidence 246899999999644 55567777777766666 No 21 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=98.12 E-value=1.7e-08 Score=63.24 Aligned_cols=131 Identities=19% Similarity=0.146 Sum_probs=67.6 Q ss_pred cccccHHHHHHHHHHHHHHHh-------------------c---------CCeEEEEeecCCCCCCCccchHHHHhhHhH Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIR-------------------D---------DQYVTVGIHEAAGDVESGEINMATLGAVLN 53 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~-------------------~---------~~~V~VGi~~~~~~~~~~~~~~A~iA~~~E 53 (197) |++.|.+++.+.|+++.+.+. . ..++.+-...+.+.-.....+.+.||.+.| T Consensus 1 i~i~Gld~L~~~L~~l~~~~~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~~~~~v~~~~~Ya~fvE 80 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDIDKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLKAKDLISKKITVNELYGAYME 80 (173) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeeccCceeEEeeCCCcccchhhh Confidence 999999999888888754331 0 111222211111100111235688999999 Q ss_pred cCceeeeCCCceeeec-ccccccccCCccc-------------cccccccccccccccccCCCcchhHHHHHHHHHHHHH Q lcl|NC_021342. 54 FGAEIDHPGGTSYGYA-TEEAESRKEVRFL-------------KTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYV 119 (197) Q Consensus 54 fGa~I~~p~~~~~~~~-~~~~~~~~~~~f~-------------k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~ 119 (197) ||. -.++++..-... ...........|. +...-+....+....+-..||||||+|+++++++++. T Consensus 81 fGT-~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~G~~aqPFl~PA~~~~~~~~~ 159 (173) T protein:vir:10 81 FGT-GAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKILGAGINPQPFLYPAWIEGKKQYL 159 (173) T ss_pred ccc-ccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceeeEeecCCCCCCccchhHHHHhHHHHH Confidence 995 333332211100 0001111111111 1111111222233345578999999999999999998 Q ss_pred HHHHHHHHccCcHHHHHHHH Q lcl|NC_021342. 120 TIIERGASRDESTTSILEKV 139 (197) Q Consensus 120 ~~~~~~~~~~~~~~~~l~~i 139 (197) +.+.+.+.. .|..| T Consensus 160 ~~i~~~i~~------~lrk~ 173 (173) T protein:vir:10 160 KDLENLLKT------YNKKI 173 (173) T ss_pred HHHHHHHHH------HhhcC Confidence 888887665 33333 No 22 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=98.11 E-value=5.2e-09 Score=66.00 Aligned_cols=85 Identities=18% Similarity=0.176 Sum_probs=48.6 Q ss_pred Cc----------ccccHHHHHHHHHHHHHHHh---cCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceee Q lcl|NC_021342. 1 MM----------KVVGLQETLAELDKVLGQIR---DDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYG 67 (197) Q Consensus 1 M~----------ki~~~~~~~~~L~~l~~~~~---~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~ 67 (197) .. +-.+.+.+...+ .+...|. +...+.|||..+ +++.||++|+||++|++.. T Consensus 53 W~p~k~~~~~~k~g~~~~~l~~~~-~l~~sl~~~~~~~~~~vg~~~G---------s~~~yAa~HQfG~~~~~~~----- 117 (150) T protein:vir:20 53 YAPRQQQSVRKKTGRVKRKMFAKL-ITSRFLHIRASPEQASMEFYGG---------KSPKIASVHQFGLSEENRK----- 117 (150) T ss_pred CcccchHHHHHhccCCCccccchh-hhhhhhheeecCcEEEEEeeCC---------cchhhhhhhhccccccccc----- Confidence 00 000111111111 1111121 456788888643 3678999999999886422 Q ss_pred ecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 68 YATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 68 ~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~ 128 (197) ..+.++||+||||--+ ++.+.++.+.+.+.+.. T Consensus 118 ---------------------------~~~~~~iPaRp~LG~s-~~d~~~i~~~i~~~l~k 150 (150) T protein:vir:20 118 ---------------------------DGKKIDYPARPLLGFT-GEDVQMIEEIILAHLER 150 (150) T ss_pred ---------------------------CCCceeccccccCCCC-HHHHHHHHHHHHHHHhC Confidence 1246899999999644 44566677777666666 No 23 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=98.10 E-value=9.9e-09 Score=64.47 Aligned_cols=112 Identities=12% Similarity=0.119 Sum_probs=52.3 Q ss_pred cccccHHHHHHHHHHHHHHHhc--CCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccCC Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIRD--DQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEV 79 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~~--~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~ 79 (197) |++.|.+++.+.|+++.+.+.. ...|.-+ + ..++.-|. ..-+....+|-++..+.........++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~-----~------~~i~~~a~-~~a~~~~~~p~~TG~Lr~sI~~~~~g~~ 68 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQEN-----A------KEYVVRAK-LKAREVMNKGYWTGNLSRNIRYKKTGDL 68 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHH-----H------HHHHHHHH-HhccccCCCCCCchhhhhcceeeecCce Confidence 9999999999999877554321 0000000 0 00000000 0000000111111111111111111111 Q ss_pred cc-ccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 80 RF-LKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGAS 127 (197) Q Consensus 80 ~f-~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~ 127 (197) .+ +..+..|....++ .|..+|+||||+|+++.++.++.+.++++++ T Consensus 69 ~~~v~~~~~Ya~~vE~--GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 69 QYTITSHAAYSGFLEF--GTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred EEEeecCccchhhhcc--cccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 10 0111111111111 2567999999999999999999999999998 No 24 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=98.10 E-value=9.9e-09 Score=64.47 Aligned_cols=112 Identities=12% Similarity=0.119 Sum_probs=52.3 Q ss_pred cccccHHHHHHHHHHHHHHHhc--CCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccCC Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIRD--DQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEV 79 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~~--~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~ 79 (197) |++.|.+++.+.|+++.+.+.. ...|.-+ + ..++.-|. ..-+....+|-++..+.........++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~-----~------~~i~~~a~-~~a~~~~~~p~~TG~Lr~sI~~~~~g~~ 68 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQEN-----A------KEYVVRAK-LKAREVMNKGYWTGNLSRNIRYKKTGDL 68 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHH-----H------HHHHHHHH-HhccccCCCCCCchhhhhcceeeecCce Confidence 9999999999999877554321 0000000 0 00000000 0000000111111111111111111111 Q ss_pred cc-ccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 80 RF-LKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGAS 127 (197) Q Consensus 80 ~f-~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~ 127 (197) .+ +..+..|....++ .|..+|+||||+|+++.++.++.+.++++++ T Consensus 69 ~~~v~~~~~Ya~~vE~--GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 69 QYTITSHAAYSGFLEF--GTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred EEEeecCccchhhhcc--cccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 10 0111111111111 2567999999999999999999999999998 No 25 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=98.10 E-value=9.9e-09 Score=64.47 Aligned_cols=112 Identities=12% Similarity=0.119 Sum_probs=52.3 Q ss_pred cccccHHHHHHHHHHHHHHHhc--CCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccCC Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIRD--DQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEV 79 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~~--~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~ 79 (197) |++.|.+++.+.|+++.+.+.. ...|.-+ + ..++.-|. ..-+....+|-++..+.........++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~-----~------~~i~~~a~-~~a~~~~~~p~~TG~Lr~sI~~~~~g~~ 68 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQEN-----A------KEYVVRAK-LKAREVMNKGYWTGNLSRNIRYKKTGDL 68 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHH-----H------HHHHHHHH-HhccccCCCCCCchhhhhcceeeecCce Confidence 9999999999999877554321 0000000 0 00000000 0000000111111111111111111111 Q ss_pred cc-ccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 80 RF-LKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGAS 127 (197) Q Consensus 80 ~f-~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~ 127 (197) .+ +..+..|....++ .|..+|+||||+|+++.++.++.+.++++++ T Consensus 69 ~~~v~~~~~Ya~~vE~--GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 69 QYTITSHAAYSGFLEF--GTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred EEEeecCccchhhhcc--cccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 10 0111111111111 2567999999999999999999999999998 No 26 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=98.10 E-value=9.9e-09 Score=64.47 Aligned_cols=112 Identities=12% Similarity=0.119 Sum_probs=52.3 Q ss_pred cccccHHHHHHHHHHHHHHHhc--CCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccCC Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIRD--DQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEV 79 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~~--~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~ 79 (197) |++.|.+++.+.|+++.+.+.. ...|.-+ + ..++.-|. ..-+....+|-++..+.........++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~-----~------~~i~~~a~-~~a~~~~~~p~~TG~Lr~sI~~~~~g~~ 68 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQEN-----A------KEYVVRAK-LKAREVMNKGYWTGNLSRNIRYKKTGDL 68 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHH-----H------HHHHHHHH-HhccccCCCCCCchhhhhcceeeecCce Confidence 9999999999999877554321 0000000 0 00000000 0000000111111111111111111111 Q ss_pred cc-ccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 80 RF-LKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGAS 127 (197) Q Consensus 80 ~f-~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~ 127 (197) .+ +..+..|....++ .|..+|+||||+|+++.++.++.+.++++++ T Consensus 69 ~~~v~~~~~Ya~~vE~--GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 69 QYTITSHAAYSGFLEF--GTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred EEEeecCccchhhhcc--cccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 10 0111111111111 2567999999999999999999999999998 No 27 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=98.10 E-value=9.9e-09 Score=64.47 Aligned_cols=112 Identities=12% Similarity=0.119 Sum_probs=52.3 Q ss_pred cccccHHHHHHHHHHHHHHHhc--CCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccCC Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIRD--DQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEV 79 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~~--~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~ 79 (197) |++.|.+++.+.|+++.+.+.. ...|.-+ + ..++.-|. ..-+....+|-++..+.........++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~-----~------~~i~~~a~-~~a~~~~~~p~~TG~Lr~sI~~~~~g~~ 68 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQEN-----A------KEYVVRAK-LKAREVMNKGYWTGNLSRNIRYKKTGDL 68 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHH-----H------HHHHHHHH-HhccccCCCCCCchhhhhcceeeecCce Confidence 9999999999999877554321 0000000 0 00000000 0000000111111111111111111111 Q ss_pred cc-ccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 80 RF-LKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGAS 127 (197) Q Consensus 80 ~f-~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~ 127 (197) .+ +..+..|....++ .|..+|+||||+|+++.++.++.+.++++++ T Consensus 69 ~~~v~~~~~Ya~~vE~--GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 69 QYTITSHAAYSGFLEF--GTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred EEEeecCccchhhhcc--cccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 10 0111111111111 2567999999999999999999999999998 No 28 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=98.10 E-value=9.9e-09 Score=64.47 Aligned_cols=112 Identities=12% Similarity=0.119 Sum_probs=52.3 Q ss_pred cccccHHHHHHHHHHHHHHHhc--CCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccCC Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIRD--DQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEV 79 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~~--~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~ 79 (197) |++.|.+++.+.|+++.+.+.. ...|.-+ + ..++.-|. ..-+....+|-++..+.........++. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~~~v~~a~~~~-----~------~~i~~~a~-~~a~~~~~~p~~TG~Lr~sI~~~~~g~~ 68 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQEN-----A------KEYVVRAK-LKAREVMNKGYWTGNLSRNIRYKKTGDL 68 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHH-----H------HHHHHHHH-HhccccCCCCCCchhhhhcceeeecCce Confidence 9999999999999877554321 0000000 0 00000000 0000000111111111111111111111 Q ss_pred cc-ccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 80 RF-LKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGAS 127 (197) Q Consensus 80 ~f-~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~ 127 (197) .+ +..+..|....++ .|..+|+||||+|+++.++.++.+.++++++ T Consensus 69 ~~~v~~~~~Ya~~vE~--GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 69 QYTITSHAAYSGFLEF--GTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred EEEeecCccchhhhcc--cccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 10 0111111111111 2567999999999999999999999999998 No 29 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=98.09 E-value=3.9e-09 Score=66.67 Aligned_cols=85 Identities=18% Similarity=0.180 Sum_probs=47.4 Q ss_pred Cccc----------ccHHHHHHHHHHHHHHHh---cCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceee Q lcl|NC_021342. 1 MMKV----------VGLQETLAELDKVLGQIR---DDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYG 67 (197) Q Consensus 1 M~ki----------~~~~~~~~~L~~l~~~~~---~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~ 67 (197) ...- .+...+...+ .+...+. +...+.|||..+ +++.||++|+||+++++.. T Consensus 53 W~p~~~~~~~~k~~~~~~~l~~~~-~l~~sl~~~~~~~~a~vg~~~G---------t~~~yAaiHQfG~~~~~~~----- 117 (150) T protein:vir:60 53 YAPRQQQSARKKTGRVKRKMFAKL-ITSRFLHIRASPEQASMEFYGG---------KSPKIASVHQFGLSEENRK----- 117 (150) T ss_pred CcccChHHHHHhhcCCCccchhhh-hhcceeeeeeeCcEEEEEeeCC---------CchhhhhhhhccccccccC----- Confidence 1110 0111111111 0111111 345677887633 3678999999999876422 Q ss_pred ecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 68 YATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 68 ~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~ 128 (197) ..+.++||+||||--+ ++.+.++.+.+.+.+.. T Consensus 118 ---------------------------~~~~~~iPaRp~LG~s-~~d~~~i~~~i~~~l~r 150 (150) T protein:vir:60 118 ---------------------------DGKKIDYPARPLLGFT-GEDVQMIEEIILAHLDR 150 (150) T ss_pred ---------------------------CCCceecCCcccCCCC-HHHHHHHHHHHHHHHhC Confidence 1246899999999754 45566666766666666 No 30 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=98.08 E-value=1.7e-08 Score=63.18 Aligned_cols=141 Identities=12% Similarity=0.162 Sum_probs=60.4 Q ss_pred Cc-----ccccHHHHHHHHHHHHHHHhcC---CeEEE-----------Eeec-CCCCCCCccchHHHHhhHhHcCceeee Q lcl|NC_021342. 1 MM-----KVVGLQETLAELDKVLGQIRDD---QYVTV-----------GIHE-AAGDVESGEINMATLGAVLNFGAEIDH 60 (197) Q Consensus 1 M~-----ki~~~~~~~~~L~~l~~~~~~~---~~V~V-----------Gi~~-~~~~~~~~~~~~A~iA~~~EfGa~I~~ 60 (197) |+ +|.|.+++.+.|++|..++.+. ..+.. -.+. +..+..+. +..--.+.+=+..-.. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~---l~~~i~~~~~~~~~~~ 77 (164) T protein:vir:43 1 MADTVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRS---ISDNIALRWNGRLFKR 77 (164) T ss_pred CCcceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccch---hhhhhhhhcccCcccc Confidence 55 5669999999999886554211 11111 1110 00010000 0000011110100001 Q ss_pred CCCceeeeccccccc--ccCCcc-ccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHH Q lcl|NC_021342. 61 PGGTSYGYATEEAES--RKEVRF-LKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILE 137 (197) Q Consensus 61 p~~~~~~~~~~~~~~--~~~~~f-~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~ 137 (197) ++...+......... ....+. ...++..+.++|.--.|.++|||||||+++++++++..+.+.+.+...+ +.+|. T Consensus 78 ~~~~~~~vg~~~~~~~~~~~~~~~~~~~~~~~y~~f~EfGT~km~a~PFlrPA~~~~k~~~~~~~~~~l~~~i--~ka~~ 155 (164) T protein:vir:43 78 TGDLGFRIGVLHGAVLPKKGERSDKTANAPTPHWRLLEFGTEDMRAQPFMRSALADNIAEVTSTFVSEYEKGI--DRAIK 155 (164) T ss_pred ccceeEEecccccccccccccccccCCCCCcceEEEeecCCCCCCCCcchhhhHHHhHHHHHHHHHHHHHHHH--HHHHH Confidence 111111000000000 000111 1122233456666667889999999999999999998877765554322 12333 Q ss_pred HHHHHHHHH Q lcl|NC_021342. 138 KVGVTAQAA 146 (197) Q Consensus 138 ~iG~~~~~~ 146 (197) +.+..++.- T Consensus 156 k~~~~~~~~ 164 (164) T protein:vir:43 156 RAAKKAAQG 164 (164) T ss_pred HHHhhhccC Confidence 333322222 No 31 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=98.08 E-value=9.3e-09 Score=64.61 Aligned_cols=92 Identities=13% Similarity=0.032 Sum_probs=66.6 Q ss_pred cccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhC---CCCCCcHHHHH Q lcl|NC_021342. 89 KPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTEL---QDPPNAKSTIR 165 (197) Q Consensus 89 ~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~---~~ppna~~Ti~ 165 (197) |.. .-++.+. ..++.+.+.++.....+...+|..||..++...++.|.+. +|.|.+|+|++ T Consensus 1 Ms~----~i~i~~~------------~~~l~~~L~~l~~~~~d~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~ 64 (175) T protein:vir:10 1 MSD----FVNFQID------------DSALRTRLLQLEQAGHQKAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIH 64 (175) T ss_pred Cce----eEEEEec------------HHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhh Confidence 111 1122222 2456667777777767888999999999999999999875 57899999986 Q ss_pred h---------------------cCCCCchhHHHHHHhhceeeeeccccccccC Q lcl|NC_021342. 166 K---------------------KGSSNPLIDTGALRQSVTYVVHSGKLPDEGL 197 (197) Q Consensus 166 ~---------------------KG~~~PLidTG~L~~SIty~V~~k~~~~~~~ 197 (197) + ++..++|+|||.|++||+|.+.+..+.. |- T Consensus 65 ~r~~~g~~~~k~~~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~v-Gt 116 (175) T protein:vir:10 65 MRVGGKKAYKKNGELTAAASRRKAGLMILQDSGQMAASVSTDHDDNSAVI-GS 116 (175) T ss_pred hhhcccccchhhhhhhhhhhhhccCCCcceechhhhhhhheeecCCEEEE-ec Confidence 3 3467899999999999999985544322 22 No 32 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=98.07 E-value=6.7e-09 Score=65.38 Aligned_cols=87 Identities=11% Similarity=0.026 Sum_probs=49.3 Q ss_pred Cccc-------ccHHHHHHHHHHHHHH--H--h-cCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeee Q lcl|NC_021342. 1 MMKV-------VGLQETLAELDKVLGQ--I--R-DDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGY 68 (197) Q Consensus 1 M~ki-------~~~~~~~~~L~~l~~~--~--~-~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~ 68 (197) ...- .+..+....+.+|... + . +...+.|||.. +++.||++|+||+++++..+ T Consensus 54 W~p~k~~~~~~k~~~~~~~m~~~L~~a~~l~~~a~~~~~~Vg~~G----------t~~~yAaiHQfG~~~r~~~~----- 118 (152) T protein:vir:10 54 YEPRKKPKKGVKSKIKSGKMFDKITQPRFMRLRLESEGVSLGYEG----------GDAVIARIHQQGLIGRVRKD----- 118 (152) T ss_pred CchhhhhhhhhcccccchhHHHhhhhcceeeeeecCcEEEEEecC----------CchhhhhhhccCccccccCC----- Confidence 1100 0000011112222110 1 1 34568888862 35789999999998764321 Q ss_pred cccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHcc Q lcl|NC_021342. 69 ATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRD 129 (197) Q Consensus 69 ~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~ 129 (197) ++.+++||+||||--+ ++...++.+.+.+.+.+. T Consensus 119 --------------------------~~~~v~iPaRp~LG~s-~~d~~~I~~~i~~~l~~a 152 (152) T protein:vir:10 119 --------------------------WDLKVKYASRELLGFT-DDDLQMIEDYMINILAGS 152 (152) T ss_pred --------------------------CCcceeccccccCCCC-HHHHHHHHHHHHHHHhcC Confidence 2346899999999654 455667777777777665 No 33 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=98.07 E-value=4.6e-09 Score=66.30 Aligned_cols=85 Identities=18% Similarity=0.167 Sum_probs=47.4 Q ss_pred Ccc----------cccHHHHHHHHHHHHHHHh---cCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceee Q lcl|NC_021342. 1 MMK----------VVGLQETLAELDKVLGQIR---DDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYG 67 (197) Q Consensus 1 M~k----------i~~~~~~~~~L~~l~~~~~---~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~ 67 (197) ... -.+...+...+ .+...|. +...+.|||..+ +++.||++|+||+++++..+ T Consensus 53 W~p~k~~~~~~k~~~~~~~l~~~~-~l~~sl~~~~~~~~a~vg~~~G---------~~~~yAaiHQfG~~~r~~~~---- 118 (150) T protein:vir:57 53 YAPRQQQSARKKTGRVKRKMFAKL-ITSRFLHIRASPEQASMEFYGG---------KSPKIASVHQFGLSEETRKD---- 118 (150) T ss_pred CcccChHHHHHhccCCCcccchhh-hhccceeeeeeCcEEEEEeecC---------CchhhhhhhhccccccccCC---- Confidence 110 01111111111 0111111 345677887533 36789999999998764221 Q ss_pred ecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 68 YATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 68 ~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~ 128 (197) .+.++||+||||--+ ++...++.+.+.+.+.. T Consensus 119 ----------------------------~~~~~iPaRp~LG~s-~~d~~~i~~~i~~~l~r 150 (150) T protein:vir:57 119 ----------------------------GKKIDYPARPLLGFT-GEDVQMIEEIILAHLDR 150 (150) T ss_pred ----------------------------CceeecCCcccCCCC-HHHHHHHHHHHHHHHhC Confidence 246899999999654 45566777777666666 No 34 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=98.07 E-value=8.3e-09 Score=64.90 Aligned_cols=84 Identities=21% Similarity=0.281 Sum_probs=52.7 Q ss_pred cccccHHHHHHHHHHHHHHHhc--CCeE-----------------------EEEeecCCCC--CC----CccchHHHHhh Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIRD--DQYV-----------------------TVGIHEAAGD--VE----SGEINMATLGA 50 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~~--~~~V-----------------------~VGi~~~~~~--~~----~~~~~~A~iA~ 50 (197) |++.|.+++.+.|+++.+.+.. .+.| .-|-+.++-. .+ ....+.+.||. T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~~~g~~~~~v~~~~~Ya~ 80 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIEDDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVKKIGDLHYRVISTAHYSG 80 (115) T ss_pred CeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEeeCCCccch Confidence 9999999999999887654311 0000 0011111000 00 00123344555 Q ss_pred HhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 51 VLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGAS 127 (197) Q Consensus 51 ~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~ 127 (197) +.||| |...|+||||||+++.++.++.+.+++++. T Consensus 81 ~vEfG------------------------------------------T~km~a~PFl~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 81 FLEFG------------------------------------------TRYMEPAPFMFPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred heecc------------------------------------------cccCCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 55555 567999999999999999999999999998 No 35 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=98.00 E-value=2.2e-08 Score=62.56 Aligned_cols=126 Identities=13% Similarity=0.205 Sum_probs=55.7 Q ss_pred Ccccc----cHHHHHHHHHHHHHHHhcC---CeEEE-----------EeecCCCCCCCccchHHHHhhHhHcCceeeeCC Q lcl|NC_021342. 1 MMKVV----GLQETLAELDKVLGQIRDD---QYVTV-----------GIHEAAGDVESGEINMATLGAVLNFGAEIDHPG 62 (197) Q Consensus 1 M~ki~----~~~~~~~~L~~l~~~~~~~---~~V~V-----------Gi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~ 62 (197) ||+++ |.+++.+.|++|...+.+. ..+.. -.|.+.+.-.. ++..-.....-| T Consensus 1 mm~~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~---~i~~~~~~~~~g------- 70 (148) T protein:vir:93 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRR---NVVVLSRRSRDG------- 70 (148) T ss_pred CcceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhh---hceeccccccCC------- Confidence 87765 8899999998885443211 11111 11111110000 000000000000 Q ss_pred Cce-eeecccccccccCC-ccc--cccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHH Q lcl|NC_021342. 63 GTS-YGYATEEAESRKEV-RFL--KTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEK 138 (197) Q Consensus 63 ~~~-~~~~~~~~~~~~~~-~f~--k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 138 (197) +.. .............. ..+ ..+.....+++.--.|.++|||||||++++++++++.+.+.+.+...+ +.+|.+ T Consensus 71 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~pa~PFl~pA~~~~k~~~~~~~~~~~~~~i--~k~~~k 148 (148) T protein:vir:93 71 GMESGVHIRGVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAI--DEVLRR 148 (148) T ss_pred ceeeeeeecccccccccccceeecCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH--HHHhcC Confidence 000 00000000000000 011 112223345555667899999999999999999988887766554321 112222 No 36 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=97.99 E-value=4.2e-08 Score=61.04 Aligned_cols=122 Identities=11% Similarity=0.035 Sum_probs=49.0 Q ss_pred Ccc--cccHHHHHHHHHHHHHHHhc---CCeEEEEee--cC---CCCCCCccchHHHHhhHhHcCceeeeCCCceeeecc Q lcl|NC_021342. 1 MMK--VVGLQETLAELDKVLGQIRD---DQYVTVGIH--EA---AGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYAT 70 (197) Q Consensus 1 M~k--i~~~~~~~~~L~~l~~~~~~---~~~V~VGi~--~~---~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~ 70 (197) |++ |.|.+++.+.|+.|.....+ .+.+..|-. .+ ..-|.+.+.--..|... .+ +.+.+..+.... T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~--~~---~~~~~~~~~~~~ 75 (140) T protein:vir:10 1 MSSVQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTA--AL---KQKDSPGIATAG 75 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceec--cc---ccccccceeEEe Confidence 764 45888888888877654321 111111100 00 00000000000000000 00 000000000000 Q ss_pred cccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHH--------HHHHccC Q lcl|NC_021342. 71 EEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIE--------RGASRDE 130 (197) Q Consensus 71 ~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~--------~~~~~~~ 130 (197) .. ....+... ..+.-+.+++.--.+...||+||||++++++++++.+.+. +++.+++ T Consensus 76 ~~-~~~~~~~~--~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 76 VR-VRTKGKAD--SPNNAFYWRFVELGTQFMKAEPFMRPAFDASIAQAEGAIRTEIARAIDQVVGGGL 140 (140) T ss_pred ec-cccccccC--CCCcccccceeccCcCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 00 00000000 0011112233333477899999999999999988776654 4555666 No 37 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=97.98 E-value=1.3e-08 Score=63.76 Aligned_cols=80 Identities=18% Similarity=0.319 Sum_probs=43.2 Q ss_pred Ccccc-cHHHHHHHHHHHHHHHhcCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccCC Q lcl|NC_021342. 1 MMKVV-GLQETLAELDKVLGQIRDDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEV 79 (197) Q Consensus 1 M~ki~-~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~ 79 (197) .-++- ....+.+-|. -+ .+...|.||. +..||++|+||+++++ T Consensus 76 ~~~~L~~tg~L~~Si~---~~-~~~~~v~vGt-------------~~~yA~vHqfG~~~~~------------------- 119 (156) T protein:vir:19 76 PGSILTLHGDLARSIT---TD-YGQDYALIGS-------------PKIYAAIHQWGGTPDM------------------- 119 (156) T ss_pred CCcchhhhHHHHHHhh---he-ecCCEEEEec-------------chhhhHHhhcCccccc------------------- Confidence 11111 1111221111 11 1456788874 3579999999998753 Q ss_pred ccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHH Q lcl|NC_021342. 80 RFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEK 138 (197) Q Consensus 80 ~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 138 (197) .++.++||+||||--+ ++.++++.+.+.+.+.+. |.+ T Consensus 120 ---------------~~~~~~iPaRpfLG~s-~~d~~~I~~~i~~~l~~~------~~~ 156 (156) T protein:vir:19 120 ---------------APRPAGVPARPYMGLD-KTGEQEIFDAIRKRVSAA------LRQ 156 (156) T ss_pred ---------------CCCccccCCccccCCC-HHHHHHHHHHHHHHHHHH------hhC Confidence 1346789999999533 455566666555544441 111 No 38 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=97.97 E-value=1.1e-08 Score=64.29 Aligned_cols=85 Identities=14% Similarity=0.099 Sum_probs=46.9 Q ss_pred Cc---ccccHHHHH---------HHHHHHH--HHH---hcCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCC Q lcl|NC_021342. 1 MM---KVVGLQETL---------AELDKVL--GQI---RDDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGG 63 (197) Q Consensus 1 M~---ki~~~~~~~---------~~L~~l~--~~~---~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~ 63 (197) .. .-++..+.. ..+..+. ..+ .+...+.|||. + +++.||++|+||+++++.. T Consensus 54 W~prk~~~~~~~~~~~~g~~~~~~m~~~l~~a~~l~~~~~~d~a~Vg~~-G---------s~~~yAaiHQfG~~~r~~~- 122 (155) T protein:vir:79 54 YEPRKVKAGGKRLREKAGRVKREAMFRKLRTARYLRIDVDSTGLAIGFD-E---------RLSRIARVHQEGQKAPVEP- 122 (155) T ss_pred CcccchhhhhhhhhcccCcccchhhhhhhhhhheeeeeecCcEEEEEec-C---------cchhhhhhhhcCCcccCCC- Confidence 11 111111100 0011110 001 13456777763 1 3678999999998875422 Q ss_pred ceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 64 TSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 64 ~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~ 128 (197) ..+.++||+||||--+ ++...++.+.+.+.+.. T Consensus 123 -------------------------------~~~~v~iPaRp~LGls-~~d~~~I~~~i~~~l~r 155 (155) T protein:vir:79 123 -------------------------------GGPLAQYPVRVVLGFS-DADRELVRDRLLRELTR 155 (155) T ss_pred -------------------------------CCcccccccccccCCC-HHHHHHHHHHHHHHhhC Confidence 1357899999999644 45567777777776666 No 39 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=97.96 E-value=1.6e-08 Score=63.27 Aligned_cols=86 Identities=15% Similarity=0.238 Sum_probs=49.1 Q ss_pred Ccc--cccHHHHHHHHHHHHH--HHh---------------cCCeEEEEe----ecCC-----CCCCCccchHHHHhhHh Q lcl|NC_021342. 1 MMK--VVGLQETLAELDKVLG--QIR---------------DDQYVTVGI----HEAA-----GDVESGEINMATLGAVL 52 (197) Q Consensus 1 M~k--i~~~~~~~~~L~~l~~--~~~---------------~~~~V~VGi----~~~~-----~~~~~~~~~~A~iA~~~ 52 (197) |+. +.|.+++.+.|+++.. .+. ......+++ +.++ ........+.+.||.++ T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~Ya~~v 80 (114) T protein:vir:49 1 MATIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDKATVEALTSYSGYL 80 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCeeEecCCCCcccee Confidence 874 5588888888876521 110 000000010 0000 00000011234444455 Q ss_pred HcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 53 NFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 53 EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~ 128 (197) ||| +..+|+||||||+++.++.++.+.+++.++- T Consensus 81 EfG------------------------------------------T~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 81 EVG------------------------------------------TRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred ccc------------------------------------------ccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 554 5679999999999999999999999988876 No 40 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=97.96 E-value=1.6e-08 Score=63.27 Aligned_cols=86 Identities=15% Similarity=0.238 Sum_probs=49.1 Q ss_pred Ccc--cccHHHHHHHHHHHHH--HHh---------------cCCeEEEEe----ecCC-----CCCCCccchHHHHhhHh Q lcl|NC_021342. 1 MMK--VVGLQETLAELDKVLG--QIR---------------DDQYVTVGI----HEAA-----GDVESGEINMATLGAVL 52 (197) Q Consensus 1 M~k--i~~~~~~~~~L~~l~~--~~~---------------~~~~V~VGi----~~~~-----~~~~~~~~~~A~iA~~~ 52 (197) |+. +.|.+++.+.|+++.. .+. ......+++ +.++ ........+.+.||.++ T Consensus 1 Ma~i~~~Gld~l~~~L~~~~~~~~v~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~~~~~V~~~~~Ya~~v 80 (114) T protein:vir:27 1 MATIEFEGLDEMAQSLLKNASPEKRSKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVESDKATVEALTSYSGYL 80 (114) T ss_pred CeeeeeehHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecCCeeEecCCCCcccee Confidence 874 5588888888876521 110 000000010 0000 00000011234444455 Q ss_pred HcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 53 NFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 53 EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~ 128 (197) ||| +..+|+||||||+++.++.++.+.+++.++- T Consensus 81 EfG------------------------------------------T~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 81 EVG------------------------------------------TRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred ccc------------------------------------------ccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 554 5679999999999999999999999988876 No 41 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=97.95 E-value=1.2e-08 Score=63.92 Aligned_cols=84 Identities=14% Similarity=0.166 Sum_probs=46.9 Q ss_pred Cccc-------ccH--HHHHHHHHHHHHHHh---cCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeee Q lcl|NC_021342. 1 MMKV-------VGL--QETLAELDKVLGQIR---DDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGY 68 (197) Q Consensus 1 M~ki-------~~~--~~~~~~L~~l~~~~~---~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~ 68 (197) .... .|. +.+...| .+...|. ....+.|||. + +++.||++|+||+++++.. T Consensus 53 W~p~s~~~~~~~g~~~~~~~~~l-~~~~~l~~~~~~~~~~v~~~-G---------t~~~yAaiHQfG~~~r~~~------ 115 (148) T protein:vir:79 53 YVPRKPQLRHRAGRIRRAMFMRL-RLARYMKTQADANTAVVTFA-G---------NAQRIATVHQFGLRDRVNK------ 115 (148) T ss_pred CcccchHHHhhcccccccccchh-hhhhheeeeeeCCeeeEEee-c---------cchhhhhhhhcCccccccC------ Confidence 1100 000 0000001 0111111 2345667763 1 3678999999999876421 Q ss_pred cccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 69 ATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 69 ~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~ 128 (197) ..+.++||+||||--+ ++...++.+.+.+.+.+ T Consensus 116 --------------------------~~~~v~iPaRp~LG~s-~~d~~~i~~~i~~~l~~ 148 (148) T protein:vir:79 116 --------------------------AGLTAQYPARELLGMD-GVDMEHITNLLLLHLGA 148 (148) T ss_pred --------------------------CCCccccCcccccCCC-HHHHHHHHHHHHHHhcC Confidence 1347899999999754 55667777888887777 No 42 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=97.93 E-value=2.5e-08 Score=62.24 Aligned_cols=84 Identities=17% Similarity=0.158 Sum_probs=47.6 Q ss_pred Cccc-------c---cHHHHHHHH--H-HHHHHHhcCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceee Q lcl|NC_021342. 1 MMKV-------V---GLQETLAEL--D-KVLGQIRDDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYG 67 (197) Q Consensus 1 M~ki-------~---~~~~~~~~L--~-~l~~~~~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~ 67 (197) .... . ....+...+ . .|..+. ....|.||+.. +++.||++|+||+++++..+ T Consensus 53 W~p~~~~~~~~k~~~~~~~l~~~g~l~~sl~~~~-~~~~~~V~~~G----------s~~~yAa~HQfG~~~r~~~~---- 117 (149) T protein:vir:98 53 YAARKRQSVRSKKGRIRREMFARLRTNRFMKAKG-SDSAAVVEFTG----------RVQRMARVHQYGLKDRPNRH---- 117 (149) T ss_pred CcccchHHHHhccCCCCcccchhhhhhhhhhhee-cCCeeEEEecC----------cchHHhhHhhccccccccCC---- Confidence 1000 0 000011111 1 111111 45578888752 36789999999998865321 Q ss_pred ecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 68 YATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 68 ~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~ 128 (197) .+.++||+||||--+ ++.+.++.+.+.+.+.. T Consensus 118 ----------------------------~~~~~iPaRp~LG~s-~~d~~~i~~~i~~~l~~ 149 (149) T protein:vir:98 118 ----------------------------SRDVQYAARPLLGFT-RDDEQMIEDIIIRHLGK 149 (149) T ss_pred ----------------------------CcceeccccccCCCC-HHHHHHHHHHHHHHhhC Confidence 246899999999533 55567777777776666 No 43 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=97.92 E-value=2.7e-08 Score=62.05 Aligned_cols=84 Identities=19% Similarity=0.226 Sum_probs=52.0 Q ss_pred cccccHHHHHHHHHHHHHHHhc--CCeE-----------------------EEEeecCCCC--CCCc----cchHHHHhh Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIRD--DQYV-----------------------TVGIHEAAGD--VESG----EINMATLGA 50 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~~--~~~V-----------------------~VGi~~~~~~--~~~~----~~~~A~iA~ 50 (197) |++.|.+++.+.|+++.+++.. .+.| .-|-+..+-. .+++ ..+.+.||. T Consensus 1 i~i~Gld~L~~~l~~~~~~~~~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~~g~~~~~V~~~~~Ya~ 80 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNIDDDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKKTVDLQYTITSHAAYSG 80 (115) T ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeeecCcEEEEecCCccccc Confidence 9999999999999877654321 0000 0011100000 0000 112344444 Q ss_pred HhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 51 VLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGAS 127 (197) Q Consensus 51 ~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~ 127 (197) +.||| |...|+||||+|+++.++.++.+.++++++ T Consensus 81 ~vE~G------------------------------------------T~~m~a~PFl~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 81 FLEFG------------------------------------------TRYMEAEPFMWPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred ccccc------------------------------------------ccccCCCCcchhhHHHHHHHHHHHHHHHhC Confidence 44444 567999999999999999999999999998 No 44 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=97.90 E-value=5.9e-08 Score=60.21 Aligned_cols=112 Identities=14% Similarity=0.178 Sum_probs=52.7 Q ss_pred Ccccc--cHHHHHHHHHHHHHHHhc--CCeEEEE-----------eecCCCCCCCccchHHHHhhHhHcCceeeeCCCce Q lcl|NC_021342. 1 MMKVV--GLQETLAELDKVLGQIRD--DQYVTVG-----------IHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTS 65 (197) Q Consensus 1 M~ki~--~~~~~~~~L~~l~~~~~~--~~~V~VG-----------i~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~ 65 (197) |+++. |.+++.+.|++|...+.. ++.++.| .+.+ .+..+.--..-.+..+...+ .+.. T Consensus 1 M~~~~i~Gl~el~~~l~~l~~~~~~~~~~al~~~a~~v~~~~k~~ap~~-~~~tg~l~~~I~~~~~k~~~------~g~~ 73 (127) T protein:vir:12 1 MADMSFDGIDDLTQYFEKIGGDIEKVEPVALKAGGEIIAERQRSHVNRS-DKKQPHMQDNITVSNVRESK------DGVR 73 (127) T ss_pred CeeeeehhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHhCCCC-CCChhHHHHhhhcccccccc------Ccee Confidence 76655 888888888776554310 1111111 0100 00000000000000000000 0000 Q ss_pred eeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCc Q lcl|NC_021342. 66 YGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDES 131 (197) Q Consensus 66 ~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~ 131 (197) +. .....++....+++.--.|.++||||||+++++++++++.+.+.+.+...+- T Consensus 74 ~v------------~Vg~~~~~~~y~~f~E~GT~~~~a~Pf~~pa~~~~~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 74 FV------------AVGPNKKVAYRGRFLEWGTSKMPPQPFIEKGGKEGEGPAVELMERILTAPIK 127 (127) T ss_pred EE------------EEeeCCCCcceeeeeccCccCCCCCccchHhHHHHHHHHHHHHHHHHHHhcC Confidence 00 0000111122344444457889999999999999999999988887776655 No 45 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=97.88 E-value=8.5e-08 Score=59.34 Aligned_cols=136 Identities=13% Similarity=0.123 Sum_probs=53.2 Q ss_pred Cc-----ccccHHHHHHHHHHHHHHHhcC---Ce-----------EEEEeecCCCCCCC----------------ccchH Q lcl|NC_021342. 1 MM-----KVVGLQETLAELDKVLGQIRDD---QY-----------VTVGIHEAAGDVES----------------GEINM 45 (197) Q Consensus 1 M~-----ki~~~~~~~~~L~~l~~~~~~~---~~-----------V~VGi~~~~~~~~~----------------~~~~~ 45 (197) |+ +|.|.+++.+.|++|.+++.+. +. ++--.+........ ..... T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g~ 80 (179) T protein:vir:18 1 MADSVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTGD 80 (179) T ss_pred CCceEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccccc Confidence 76 3449999999998886554210 00 00000000000000 00000 Q ss_pred HHHhhHhHcCceeeeCCCceeeeccc--ccccccCCcccc------ccccccccccccccccCCCcchhHHHHHHHHHHH Q lcl|NC_021342. 46 ATLGAVLNFGAEIDHPGGTSYGYATE--EAESRKEVRFLK------TGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNE 117 (197) Q Consensus 46 A~iA~~~EfGa~I~~p~~~~~~~~~~--~~~~~~~~~f~k------~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~ 117 (197) +.+....++|.. ...... ......+..+.. .+...+.++|.--.|.++||||||||++++++++ T Consensus 81 ~~~~vgv~~~~~--------~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa~PFlrPA~~~~~~~ 152 (179) T protein:vir:18 81 LAFRVGVMGGAR--------QYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSARPILRPAMNGVDND 152 (179) T ss_pred eeEeeecccccc--------cccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCCCccchhhHHhhHHH Confidence 011111111110 000000 000011111111 1122344555555688999999999999999987 Q ss_pred HHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcH Q lcl|NC_021342. 118 YVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTELQDPPNAK 161 (197) Q Consensus 118 ~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~ 161 (197) ..+.+.+.+. ..|+..+..+..---+- T Consensus 153 a~~~i~~~l~-----------------~~i~k~lk~~~~~~~~~ 179 (179) T protein:vir:18 153 VINVFSTEMG-----------------KAIDRAIRLAMKKGTTA 179 (179) T ss_pred HHHHHHHHHH-----------------HHHHHHHHhhcccCCCC Confidence 7665543322 12222221111000000 No 46 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=97.87 E-value=2.8e-08 Score=62.00 Aligned_cols=89 Identities=16% Similarity=0.162 Sum_probs=49.8 Q ss_pred Ccc---------cccHHHHHHHHHHHHH--HHh---cCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCcee Q lcl|NC_021342. 1 MMK---------VVGLQETLAELDKVLG--QIR---DDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSY 66 (197) Q Consensus 1 M~k---------i~~~~~~~~~L~~l~~--~~~---~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~ 66 (197) ... -...++....+..+.. .+. +...+.|||.. +++.||++|+||+++++..+ T Consensus 54 W~p~~~~~~~~~~~~~~~~~~m~~~l~~~~~l~~~~~~~~a~vg~~G----------s~~~yA~iHQfG~~~~~~~~--- 120 (156) T protein:vir:11 54 YEPRKKRELRGKQGRIRRKIKMFQKLRTVRYLRAKGDAQAITVSFAG----------RIARIARVHQYGLRDRAEPG--- 120 (156) T ss_pred CcccchHHHhhhccccccchhhhhhhhhhheeeeeecCcEEEEEecC----------CchhhhhhhcccccccccCC--- Confidence 000 0000111111111110 011 34567777741 35789999999998764321 Q ss_pred eecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcH Q lcl|NC_021342. 67 GYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDEST 132 (197) Q Consensus 67 ~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~ 132 (197) .+.++||+||||--+ ++.+.++.+.+.+.+.+.... T Consensus 121 -----------------------------~~~v~iPaRp~LG~s-~~d~~~i~~~i~~~l~~~~~~ 156 (156) T protein:vir:11 121 -----------------------------APEVSYAQRLLLGFD-SSDMETIQNGILAHIDANSPI 156 (156) T ss_pred -----------------------------CCcccccccccCCCC-HHHHHHHHHHHHHHHhhcCCC Confidence 246899999999654 556677777777777765444 No 47 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=97.87 E-value=5.5e-08 Score=60.39 Aligned_cols=123 Identities=15% Similarity=0.171 Sum_probs=47.9 Q ss_pred Ccc--cccHHHHHHHHHHHHHHHhc---CCeEEEE-----------eecCCCCCCCccchHHHHhhHhHcCceeeeCCCc Q lcl|NC_021342. 1 MMK--VVGLQETLAELDKVLGQIRD---DQYVTVG-----------IHEAAGDVESGEINMATLGAVLNFGAEIDHPGGT 64 (197) Q Consensus 1 M~k--i~~~~~~~~~L~~l~~~~~~---~~~V~VG-----------i~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~ 64 (197) |+. +.|.+++.+.|+.|...... .+.|..| .|.+.+.--++ +..... .-.+++..+. T Consensus 1 Ma~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~-i~~~~~-~~~~~~~~~~----- 73 (140) T protein:vir:80 1 MSSIQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRN-IVSAAL-RQKDAPGLAT----- 73 (140) T ss_pred CceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhc-eeeecc-ccccccceee----- Confidence 864 45888888888877544321 0001110 01100000000 000000 0000000000 Q ss_pred eeeecccccccccCCc-cccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHH Q lcl|NC_021342. 65 SYGYATEEAESRKEVR-FLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTA 143 (197) Q Consensus 65 ~~~~~~~~~~~~~~~~-f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~ 143 (197) ... ..+.. .....+....+++.--.+..+||||||+++++++++++.+.+.+.+...+ T Consensus 74 --~~~------~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l------------- 132 (140) T protein:vir:80 74 --AGV------RVRTKGKADSPSNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAI------------- 132 (140) T ss_pred --eee------ecccccccCCCCCcceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHH------------- Confidence 000 00000 00011112223333334788999999999999999888777654433211 Q ss_pred HHHHHHHHHhCC Q lcl|NC_021342. 144 QAAVRMFMTELQ 155 (197) Q Consensus 144 ~~~i~~~I~~~~ 155 (197) ++.+.... T Consensus 133 ----~k~~~~~~ 140 (140) T protein:vir:80 133 ----DQALGGRR 140 (140) T ss_pred ----HHHhhccC Confidence 00000000 No 48 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=97.86 E-value=4.1e-08 Score=61.07 Aligned_cols=76 Identities=17% Similarity=0.323 Sum_probs=40.3 Q ss_pred CcccccHHHHHHHHHHHHHHHhcCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccCCc Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQIRDDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEVR 80 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~ 80 (197) +..-+| .+.+- +.-+. ....|.||- +..||++|+||+++.+ T Consensus 75 ~L~~tG--~L~~S---i~~~~-~~~~v~vGt-------------n~~YA~iHqfGg~~~~-------------------- 115 (155) T protein:vir:10 75 ILQVTN--ALARS---ITTRA-DRDQAQIGS-------------NLSYAAIQQLGGQAGR-------------------- 115 (155) T ss_pred ccccch--hhhhh---hhcee-cCCEEEEec-------------CcchhhhhhcccccCC-------------------- Confidence 222221 11111 11111 355677773 3579999999987642 Q ss_pred cccccccccccccccccccCCCcchhHHHH-HHH----HHHHHHHHHHHHHHccC Q lcl|NC_021342. 81 FLKTGTGFKPLGVTKPHKINIPARPWLEPG-VQS----KSNEYVTIIERGASRDE 130 (197) Q Consensus 81 f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t-~~~----~~~~~~~~~~~~~~~~~ 130 (197) .++++||+||||--. -++ -.+.+.+.+.+.+..+- T Consensus 116 ---------------~~~~~iPARPfLG~s~~~e~~~ei~~~I~~~i~~~l~~~r 155 (155) T protein:vir:10 116 ---------------GRKVTIPARPYLPVLRNGQLKPSARDAVLDVLLAALSQGR 155 (155) T ss_pred ---------------CCccccCCccccCCCccccchHHHHHHHHHHHHHHHhhcC Confidence 236899999999522 122 22444445555554444 No 49 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=97.86 E-value=8e-08 Score=59.48 Aligned_cols=120 Identities=12% Similarity=0.118 Sum_probs=48.5 Q ss_pred Ccc--cccHHHHHHHHHHHHHHHhc---CCeEEEEee--cC---CCCCCCccchHHHHhhHhHcCc-eeeeCCCceeeec Q lcl|NC_021342. 1 MMK--VVGLQETLAELDKVLGQIRD---DQYVTVGIH--EA---AGDVESGEINMATLGAVLNFGA-EIDHPGGTSYGYA 69 (197) Q Consensus 1 M~k--i~~~~~~~~~L~~l~~~~~~---~~~V~VGi~--~~---~~~~~~~~~~~A~iA~~~EfGa-~I~~p~~~~~~~~ 69 (197) |++ +.|.+++.+.|++|...... .+.+..|-. .+ ..-|.+++- ....... ..+......+... T Consensus 1 Ma~~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~------l~~sI~~~~~~~~~~~~~~~~ 74 (140) T protein:vir:10 1 MSSIQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGK------LRRNIVSAALRQKDAPGLATA 74 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhh------HHHhccccccccccccceEEe Confidence 764 45888888888877544321 001111100 00 000000000 0000000 0000000000000 Q ss_pred ccccccccCC-ccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHH--------HccC Q lcl|NC_021342. 70 TEEAESRKEV-RFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGA--------SRDE 130 (197) Q Consensus 70 ~~~~~~~~~~-~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~--------~~~~ 130 (197) . ...+. .....++....+++.--.+..+||+|||+++++++++++.+.+.+.+ .|.- T Consensus 75 g----~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:10 75 G----VRVRTKGKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGRR 140 (140) T ss_pred e----eeeccccccCCCCccceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 0 00000 00111122223334444578899999999999999988877655433 3333 No 50 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=97.83 E-value=1.1e-07 Score=58.72 Aligned_cols=121 Identities=12% Similarity=0.106 Sum_probs=49.5 Q ss_pred Ccc--cccHHHHHHHHHHHHHHHhc---CCeEEEEe--ecC---CCCCCCccchHHHHhhHhHcCce-eeeCCCceeeec Q lcl|NC_021342. 1 MMK--VVGLQETLAELDKVLGQIRD---DQYVTVGI--HEA---AGDVESGEINMATLGAVLNFGAE-IDHPGGTSYGYA 69 (197) Q Consensus 1 M~k--i~~~~~~~~~L~~l~~~~~~---~~~V~VGi--~~~---~~~~~~~~~~~A~iA~~~EfGa~-I~~p~~~~~~~~ 69 (197) |++ +.|.+++.+.|+.|...... .+.+.-|- ..+ ..-|.+++. + ....... .+-+.+..+... T Consensus 1 M~~~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~-l-----~~sI~~~~~~~~~~~~~~~v 74 (140) T protein:vir:14 1 MSSIQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGK-L-----RRNIVSAALRQKDAPGLATA 74 (140) T ss_pred CceeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhh-H-----HhhcccccccccccceeEEe Confidence 865 45889998888887544321 01111100 000 000001100 0 0000000 000000000000 Q ss_pred ccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHH--------HccC Q lcl|NC_021342. 70 TEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGA--------SRDE 130 (197) Q Consensus 70 ~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~--------~~~~ 130 (197) ....... .....+.....+++.--.+.++||||||++++++++.++.+.+.+.+ .+.- T Consensus 75 -g~~~~~~--~~~~~~~~~~y~~f~E~GT~~~~a~pFl~pa~~~~~~~~~~~~~~~~~~~l~k~~~~~~ 140 (140) T protein:vir:14 75 -GVRVRTK--GKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGGRR 140 (140) T ss_pred -eeeeccc--cccCCCCccceeeeeccccCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 0000000 00111122233344444578899999999999999988877665433 3333 No 51 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=97.83 E-value=8.5e-08 Score=59.33 Aligned_cols=122 Identities=15% Similarity=0.160 Sum_probs=53.7 Q ss_pred Ccc-----cccHHHHHHHHHHHHHHHhc--CCeEEEE-----------eecCCCCCCCccchHHHHhhHhHcCceeeeCC Q lcl|NC_021342. 1 MMK-----VVGLQETLAELDKVLGQIRD--DQYVTVG-----------IHEAAGDVESGEINMATLGAVLNFGAEIDHPG 62 (197) Q Consensus 1 M~k-----i~~~~~~~~~L~~l~~~~~~--~~~V~VG-----------i~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~ 62 (197) |+. |.|.+++.+.|.+|...... .+.+..| +|.+.+... +..+...+... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~------------~~~~~~~~~~~ 68 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKK------------RSKSEPWRTGQ 68 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccc------------ccccccccccc Confidence 664 77999998888877554210 0000000 111100000 00000000000 Q ss_pred Cceeee---cccccccc--cCCcccc-ccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHH Q lcl|NC_021342. 63 GTSYGY---ATEEAESR--KEVRFLK-TGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSIL 136 (197) Q Consensus 63 ~~~~~~---~~~~~~~~--~~~~f~k-~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l 136 (197) ....-. ......+. ..+.+-+ .+...+.+++.--.+.+.||+|||+++++++++++.+.+.+.+...+ ..+| T Consensus 69 ~~~~~i~~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l--~ka~ 146 (146) T protein:vir:10 69 HGADQIKVTKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEM--RLDL 146 (146) T ss_pred cccccceeccccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH--hhcC Confidence 000000 00000000 0111111 12233445555556789999999999999999998887776655432 1122 No 52 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=97.83 E-value=8.5e-08 Score=59.33 Aligned_cols=122 Identities=15% Similarity=0.160 Sum_probs=53.7 Q ss_pred Ccc-----cccHHHHHHHHHHHHHHHhc--CCeEEEE-----------eecCCCCCCCccchHHHHhhHhHcCceeeeCC Q lcl|NC_021342. 1 MMK-----VVGLQETLAELDKVLGQIRD--DQYVTVG-----------IHEAAGDVESGEINMATLGAVLNFGAEIDHPG 62 (197) Q Consensus 1 M~k-----i~~~~~~~~~L~~l~~~~~~--~~~V~VG-----------i~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~ 62 (197) |+. |.|.+++.+.|.+|...... .+.+..| +|.+.+... +..+...+... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~------------~~~~~~~~~~~ 68 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKK------------RSKSEPWRTGQ 68 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccc------------ccccccccccc Confidence 664 77999998888877554210 0000000 111100000 00000000000 Q ss_pred Cceeee---cccccccc--cCCcccc-ccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHH Q lcl|NC_021342. 63 GTSYGY---ATEEAESR--KEVRFLK-TGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSIL 136 (197) Q Consensus 63 ~~~~~~---~~~~~~~~--~~~~f~k-~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l 136 (197) ....-. ......+. ..+.+-+ .+...+.+++.--.+.+.||+|||+++++++++++.+.+.+.+...+ ..+| T Consensus 69 ~~~~~i~~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l--~ka~ 146 (146) T protein:vir:10 69 HGADQIKVTKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEM--RLDL 146 (146) T ss_pred cccccceeccccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH--hhcC Confidence 000000 00000000 0111111 12233445555556789999999999999999998887776655432 1122 No 53 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=97.83 E-value=8.5e-08 Score=59.33 Aligned_cols=122 Identities=15% Similarity=0.160 Sum_probs=53.7 Q ss_pred Ccc-----cccHHHHHHHHHHHHHHHhc--CCeEEEE-----------eecCCCCCCCccchHHHHhhHhHcCceeeeCC Q lcl|NC_021342. 1 MMK-----VVGLQETLAELDKVLGQIRD--DQYVTVG-----------IHEAAGDVESGEINMATLGAVLNFGAEIDHPG 62 (197) Q Consensus 1 M~k-----i~~~~~~~~~L~~l~~~~~~--~~~V~VG-----------i~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~ 62 (197) |+. |.|.+++.+.|.+|...... .+.+..| +|.+.+... +..+...+... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~------------~~~~~~~~~~~ 68 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKK------------RSKSEPWRTGQ 68 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccc------------ccccccccccc Confidence 664 77999998888877554210 0000000 111100000 00000000000 Q ss_pred Cceeee---cccccccc--cCCcccc-ccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHH Q lcl|NC_021342. 63 GTSYGY---ATEEAESR--KEVRFLK-TGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSIL 136 (197) Q Consensus 63 ~~~~~~---~~~~~~~~--~~~~f~k-~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l 136 (197) ....-. ......+. ..+.+-+ .+...+.+++.--.+.+.||+|||+++++++++++.+.+.+.+...+ ..+| T Consensus 69 ~~~~~i~~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l--~ka~ 146 (146) T protein:vir:10 69 HGADQIKVTKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEM--RLDL 146 (146) T ss_pred cccccceeccccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH--hhcC Confidence 000000 00000000 0111111 12233445555556789999999999999999998887776655432 1122 No 54 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=97.83 E-value=8.5e-08 Score=59.33 Aligned_cols=122 Identities=15% Similarity=0.160 Sum_probs=53.7 Q ss_pred Ccc-----cccHHHHHHHHHHHHHHHhc--CCeEEEE-----------eecCCCCCCCccchHHHHhhHhHcCceeeeCC Q lcl|NC_021342. 1 MMK-----VVGLQETLAELDKVLGQIRD--DQYVTVG-----------IHEAAGDVESGEINMATLGAVLNFGAEIDHPG 62 (197) Q Consensus 1 M~k-----i~~~~~~~~~L~~l~~~~~~--~~~V~VG-----------i~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~ 62 (197) |+. |.|.+++.+.|.+|...... .+.+..| +|.+.+... +..+...+... T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~------------~~~~~~~~~~~ 68 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRGEKIEDKALAAGGEPIRKAIAERAPRSPSPKK------------RSKSEPWRTGQ 68 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccc------------ccccccccccc Confidence 664 77999998888877554210 0000000 111100000 00000000000 Q ss_pred Cceeee---cccccccc--cCCcccc-ccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHH Q lcl|NC_021342. 63 GTSYGY---ATEEAESR--KEVRFLK-TGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSIL 136 (197) Q Consensus 63 ~~~~~~---~~~~~~~~--~~~~f~k-~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l 136 (197) ....-. ......+. ..+.+-+ .+...+.+++.--.+.+.||+|||+++++++++++.+.+.+.+...+ ..+| T Consensus 69 ~~~~~i~~~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l--~ka~ 146 (146) T protein:vir:10 69 HGADQIKVTKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEM--RLDL 146 (146) T ss_pred cccccceeccccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHH--hhcC Confidence 000000 00000000 0111111 12233445555556789999999999999999998887776655432 1122 No 55 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=97.82 E-value=4.2e-08 Score=61.01 Aligned_cols=75 Identities=17% Similarity=0.223 Sum_probs=39.5 Q ss_pred CcccccHHHHHHHHHHHHHHHh---cCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeeccccccccc Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQIR---DDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRK 77 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~~~---~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~ 77 (197) ...-+| .+.+ .+..++. +...|.|| ++..||++|+||+ T Consensus 65 ~L~~tG--~L~~---Si~~~~~~~~~~~~a~vG-------------tn~~YA~~hqfG~--------------------- 105 (145) T protein:vir:31 65 PLIDNS--RLLT---DINAASMMDRANRMAVIG-------------TNLDYAEHHEFGA--------------------- 105 (145) T ss_pred CCccCH--HHHH---HHHHHhhhcccCceeEec-------------CCchhhhhhccCC--------------------- Confidence 222221 1221 2222221 23345555 2457999999995 Q ss_pred CCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHc---cCcHH Q lcl|NC_021342. 78 EVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASR---DESTT 133 (197) Q Consensus 78 ~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~---~~~~~ 133 (197) ..++||+||||-.+....++++.+.+.+.+.. +.-.+ T Consensus 106 -------------------~~~~IPaRPfLG~~~~~~~~~~~~ii~~~i~~~L~~~~~~ 145 (145) T protein:vir:31 106 -------------------PEAGIPARPIFGPAGAYASQQAPDVIGDEIDTNLEGAVID 145 (145) T ss_pred -------------------cccccCCCCccCCCccchHHHHHHHHHHHHHHHhhhhccC Confidence 24789999999876655555665555544332 11111 No 56 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=97.80 E-value=4.7e-08 Score=60.74 Aligned_cols=89 Identities=11% Similarity=0.124 Sum_probs=55.8 Q ss_pred Cc-ccccHHHHHHHHHHHHHHHhcC-------------------CeEEEEeecCC-----CCCCCccchHHHHhhHhHcC Q lcl|NC_021342. 1 MM-KVVGLQETLAELDKVLGQIRDD-------------------QYVTVGIHEAA-----GDVESGEINMATLGAVLNFG 55 (197) Q Consensus 1 M~-ki~~~~~~~~~L~~l~~~~~~~-------------------~~V~VGi~~~~-----~~~~~~~~~~A~iA~~~EfG 55 (197) |- ++.|.+++.+.|+++.+.+... .-|.-|-+.++ +..+....+.+.||.+.||| T Consensus 1 msi~i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~g~~~~V~~~~~Ya~yvE~G 80 (114) T protein:vir:95 1 MAIKWQGIEKLVATISNAQPKAVEQSLQVLKNNGEKGKRIAKQLAPKDTEFLKDHITTSYPGMEAHIHGEAGYDGYQEYG 80 (114) T ss_pred CeeeeehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhceeeecCceEEEeecCCCccceeecC Confidence 54 7779999998888877643210 00111111110 00011112344555555555 Q ss_pred ceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCc Q lcl|NC_021342. 56 AEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDES 131 (197) Q Consensus 56 a~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~ 131 (197) +...|+|||||++++.++.++.+.+.+.+..++- T Consensus 81 ------------------------------------------T~~~~aqPfl~pa~~~~~~~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 81 ------------------------------------------TRFQPGTPHFRPMMEQIQPQFQKDMTDVMKGAFK 114 (114) T ss_pred ------------------------------------------ccccCCCccchhhHHHHHHHHHHHHHHHHHhhcC Confidence 5678999999999999999999999988888766 No 57 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=97.80 E-value=1.9e-07 Score=57.48 Aligned_cols=134 Identities=19% Similarity=0.218 Sum_probs=59.3 Q ss_pred Ccccc--cHHHHHHHHHHHHHHHhc-----------------------CCeEEEEeecCCC----CCCC-----ccchHH Q lcl|NC_021342. 1 MMKVV--GLQETLAELDKVLGQIRD-----------------------DQYVTVGIHEAAG----DVES-----GEINMA 46 (197) Q Consensus 1 M~ki~--~~~~~~~~L~~l~~~~~~-----------------------~~~V~VGi~~~~~----~~~~-----~~~~~A 46 (197) ||++. |.+++.+.|+++-+.+.+ ..-|.-|-+..+= ..++ ...+.+ T Consensus 1 m~~v~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~~~~~~g~V~~~~ 80 (182) T protein:vir:10 1 MIEVELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVDGDEVIGRWWNSS 80 (182) T ss_pred CeEEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeecCCeEEEEeecCC Confidence 88776 889888888665322110 0011122111110 0011 123557 Q ss_pred HHhhHhHcCceeeeCCC-------ceeeeccccc-ccccCCccccc-ccccccccccc-----ccccCCCcchhHHHHHH Q lcl|NC_021342. 47 TLGAVLNFGAEIDHPGG-------TSYGYATEEA-ESRKEVRFLKT-GTGFKPLGVTK-----PHKINIPARPWLEPGVQ 112 (197) Q Consensus 47 ~iA~~~EfGa~I~~p~~-------~~~~~~~~~~-~~~~~~~f~k~-~~g~~~~~~~~-----~~~v~IP~RpFlr~t~~ 112 (197) .||.+.|||.-+.-... ....+....+ -....+.|--. ..++. .+.. ..+-..||||||+++++ T Consensus 81 ~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~--~~~~~~~~~~~t~G~~aqPFl~pA~~ 158 (182) T protein:vir:10 81 MVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIP--KIKINGKYFYRTTGQPARQFMTPAAN 158 (182) T ss_pred CccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccc--eeeecCceEeecCCCCCCcchHHHHH Confidence 89999999964321000 0000000000 00000000000 00000 0000 12457899999999999 Q ss_pred HHHHHHHHHHHHHHHccCcHHHHHHHHHH Q lcl|NC_021342. 113 SKSNEYVTIIERGASRDESTTSILEKVGV 141 (197) Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~l~~iG~ 141 (197) ++++++.+.+.+.+...+. +.+|- T Consensus 159 ~~~~~i~~~i~~~i~~~l~-----~~~g~ 182 (182) T protein:vir:10 159 KMAKEAPEIIKRSIDQELH-----DKLGG 182 (182) T ss_pred HhHHHHHHHHHHHHHHHHH-----HhhcC Confidence 9999998887765543110 01111 No 58 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=97.80 E-value=2.9e-08 Score=61.93 Aligned_cols=81 Identities=22% Similarity=0.368 Sum_probs=36.7 Q ss_pred Ccccc-------cHHHHHHHHHHHHHHHh---cCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecc Q lcl|NC_021342. 1 MMKVV-------GLQETLAELDKVLGQIR---DDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYAT 70 (197) Q Consensus 1 M~ki~-------~~~~~~~~L~~l~~~~~---~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~ 70 (197) ..++. +..+++..=-.|...+. ....|.|| ++..||++|+||+.+.. T Consensus 76 ~~~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vG-------------tn~~YAaiHqfGg~~~~---------- 132 (175) T protein:vir:10 76 NGELTAAASRRKAGLMILQDSGQMAASVSTDHDDNSAVIG-------------SNKEYAAIHQFGGQAGR---------- 132 (175) T ss_pred hhhhhhhhhhhccCCCcceechhhhhhhheeecCCEEEEe-------------cChhhhhhhhcccccCC---------- Confidence 10000 00011000001111111 23455555 34679999999975421 Q ss_pred cccccccCCccccccccccccccccccccCCCcchhHHHHHH---------HHHHHHHHHHHHHHHcc Q lcl|NC_021342. 71 EEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQ---------SKSNEYVTIIERGASRD 129 (197) Q Consensus 71 ~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~---------~~~~~~~~~~~~~~~~~ 129 (197) .+.++||+||||--+-+ +-.+.|.+.+.+++.+. T Consensus 133 -------------------------~~~v~iPaRpfLG~s~~d~~~~e~~~~Il~~~~~~l~~~~~~~ 175 (175) T protein:vir:10 133 -------------------------GLKVTIPARPWLPVTADGELQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred -------------------------CCccccCCccccCCCcccccchHHHHHHHHHHHHHHHHHhccC Confidence 24689999999964321 12233333344444444 No 59 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=97.78 E-value=1e-07 Score=58.85 Aligned_cols=126 Identities=14% Similarity=0.121 Sum_probs=53.1 Q ss_pred Cc-----ccccHHHHHHHHHHHH--HH---Hh-----cC-----CeEEEEeecCCCCCCCccchHHHHhhHhHcCceeee Q lcl|NC_021342. 1 MM-----KVVGLQETLAELDKVL--GQ---IR-----DD-----QYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDH 60 (197) Q Consensus 1 M~-----ki~~~~~~~~~L~~l~--~~---~~-----~~-----~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~ 60 (197) |+ ++.|.+++.+.|++|. +. +. .. ..++--+|.+....... ...+...| +.+ T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~------~~~~~~~~-~~~- 72 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSG------RKGSRPPG-HAA- 72 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCccccc------cccccccc-hhh- Confidence 76 5669899888888772 11 10 00 01111122111100000 00000000 000 Q ss_pred CCCceeeecccccccc--cCCcccc-ccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHH Q lcl|NC_021342. 61 PGGTSYGYATEEAESR--KEVRFLK-TGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILE 137 (197) Q Consensus 61 p~~~~~~~~~~~~~~~--~~~~f~k-~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~ 137 (197) .+-....... ..+. ..+-+-+ .++..+.++|.--.|.+.||+||||++++++++++.+.+.+.+.. T Consensus 73 -d~i~~~~~~~-~~g~~~~~VG~~~~~~~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k--------- 141 (149) T protein:vir:13 73 -NNIPEPKIRK-KKGNLQCVVGWEKSDNTPFYYMKMEEWGTSERPPHHAFGKTNKILKRVYDNIAQKKYDN--------- 141 (149) T ss_pred -hcceeccccc-ccceeEEEeeccCCCCCccceeeeeccCccCCCCCccchHHHHHHHHHHHHHHHHHHHH--------- Confidence 0000000000 0000 0011111 123345666666779999999999999999999888776543322 Q ss_pred HHHHHHHHHHHHHHHh Q lcl|NC_021342. 138 KVGVTAQAAVRMFMTE 153 (197) Q Consensus 138 ~iG~~~~~~i~~~I~~ 153 (197) .|++.+.+ T Consensus 142 --------~i~~~lG~ 149 (149) T protein:vir:13 142 --------FVKEKLGD 149 (149) T ss_pred --------HHHHHhcC Confidence 11111111 No 60 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=97.76 E-value=4.8e-08 Score=60.70 Aligned_cols=76 Identities=20% Similarity=0.337 Sum_probs=44.0 Q ss_pred CcccccHHHHHHHHHHHHHHHhcCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccCCc Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQIRDDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEVR 80 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~ 80 (197) +..-+| .+.+-|. -+ .+...|.||- +..||++|+||+.+.. T Consensus 75 iL~~tg--~L~~Si~---~~-~~~~~v~vGt-------------n~~YA~iHqfGg~~~~-------------------- 115 (155) T protein:vir:99 75 ILQVTN--ALARSVT---TW-ADRNEAGIGS-------------NLVYAAIHQFGGDAGR-------------------- 115 (155) T ss_pred cchhch--hhhhhhh---ce-ecCCEEEEec-------------CccchhhhhcccccCC-------------------- Confidence 222221 1111111 11 1345677763 3569999999987631 Q ss_pred cccccccccccccccccccCCCcchhHHHHH-----HHHHHHHHHHHHHHHHccC Q lcl|NC_021342. 81 FLKTGTGFKPLGVTKPHKINIPARPWLEPGV-----QSKSNEYVTIIERGASRDE 130 (197) Q Consensus 81 f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~-----~~~~~~~~~~~~~~~~~~~ 130 (197) .+.++||+||||--+- .+..+++.+.+.+.++.+- T Consensus 116 ---------------~~~v~iPaRpfLG~s~~~~l~~e~~~~I~~~i~~~l~~~~ 155 (155) T protein:vir:99 116 ---------------GHQVEIPARRYLPFDENGQLAAGARQSILEIVLTALSRNR 155 (155) T ss_pred ---------------CCccccCCccccCCCCccccchHHHHHHHHHHHHHHhccC Confidence 2468999999995332 2445667777777776655 No 61 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=97.74 E-value=1.4e-08 Score=63.60 Aligned_cols=81 Identities=17% Similarity=0.323 Sum_probs=36.9 Q ss_pred CcccccHHHHHHHHHHHHHHHh---cCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeeccccccccc Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQIR---DDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRK 77 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~~~---~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~ 77 (197) +.+..+..+++..=-.|...+. ....|.||- +..||++|+||+.+.. T Consensus 83 ~~~~~~~~~~L~~tG~L~~Si~~~~~~~~v~vGt-------------n~~YAaiHqfGg~~~~----------------- 132 (175) T protein:vir:79 83 ASRRKAGLMILQDSGQMAASTATDSGEDYSVIGS-------------NKEYAAIQHFGGQAGR----------------- 132 (175) T ss_pred HhhhccCCCcceechhhhhhhhheecCCEEEEec-------------CcchhhHhhcccccCC----------------- Confidence 1111111111110011211111 345666763 4579999999975310 Q ss_pred CCccccccccccccccccccccCCCcchhHHHHHHHH-----HHHHHHH----HHHHHHcc Q lcl|NC_021342. 78 EVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSK-----SNEYVTI----IERGASRD 129 (197) Q Consensus 78 ~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~-----~~~~~~~----~~~~~~~~ 129 (197) .+.++||+||||--+-+.. .+++.+. +++++.+. T Consensus 133 ------------------~~~v~IPARPfLG~s~~de~~~~~~~~I~~~i~~~l~~a~~~~ 175 (175) T protein:vir:79 133 ------------------GLKVTIPGRAWLPVTADGELQPEAVEPVLNTILRHLMDAANRR 175 (175) T ss_pred ------------------CcccccCcccccCCCcccchhHHHHHHHHHHHHHHHHHHhccC Confidence 1357999999995332111 2333333 33444444 No 62 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=97.71 E-value=9.1e-08 Score=59.17 Aligned_cols=76 Identities=20% Similarity=0.346 Sum_probs=42.7 Q ss_pred CcccccHHHHHHHHHHHHHHHhcCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccCCc Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQIRDDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEVR 80 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~ 80 (197) +..-+| .+.+- +.-+. ....|.|| ++..||++|+||+.+.. T Consensus 75 iL~~tG--~L~~S---i~~~~-~~~~v~vG-------------t~~~YA~iHqfGg~~~~-------------------- 115 (155) T protein:vir:79 75 ILQVTN--ALARS---VTTWA-DRNEAGIG-------------SNLVYAAIHQFGGDAGR-------------------- 115 (155) T ss_pred ccccch--hhhhh---hhcee-cCCEEEEe-------------cCchhhhhhhcccccCC-------------------- Confidence 222121 11111 11111 34556666 24579999999987642 Q ss_pred cccccccccccccccccccCCCcchhHHHHH-----HHHHHHHHHHHHHHHHccC Q lcl|NC_021342. 81 FLKTGTGFKPLGVTKPHKINIPARPWLEPGV-----QSKSNEYVTIIERGASRDE 130 (197) Q Consensus 81 f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~-----~~~~~~~~~~~~~~~~~~~ 130 (197) .+.++||+||||--.- .+..+++.+.+.+.+..+- T Consensus 116 ---------------~~~v~iPaRpfLG~s~~~~l~~~~~~~I~~~i~~~l~r~r 155 (155) T protein:vir:79 116 ---------------GHQVEIPARRYLPFDENGQLAAGARQSILEVVLTALSRNR 155 (155) T ss_pred ---------------CCccccCCccccCCCCccccchHHHHHHHHHHHHHHHhcC Confidence 2468999999995332 2334567777777775544 No 63 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=97.71 E-value=1.5e-07 Score=58.06 Aligned_cols=83 Identities=18% Similarity=0.229 Sum_probs=52.0 Q ss_pred cccHHHHHHHHHHHHHHHhc-------------------CCeEEEEeecCC------CCCCCccchHHHHhhHhHcCcee Q lcl|NC_021342. 4 VVGLQETLAELDKVLGQIRD-------------------DQYVTVGIHEAA------GDVESGEINMATLGAVLNFGAEI 58 (197) Q Consensus 4 i~~~~~~~~~L~~l~~~~~~-------------------~~~V~VGi~~~~------~~~~~~~~~~A~iA~~~EfGa~I 58 (197) +.|.+++.+.|+++.+.+.. ..-|.-|-+.++ +.-...-.+.+.||.+.||| T Consensus 1 i~Gld~l~~~l~~~~~~~~~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~~~~~v~~~~~Ya~~vE~G--- 77 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVRIAVDKELSKSAARIERQAKILAPVDTGWLRAQIYSEQQRLLHYRVVSPALYSIYLELG--- 77 (108) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhcceeeeecCcEEEEeecCcccchhcccC--- Confidence 88888888777777554320 000111111110 00001112446677777777 Q ss_pred eeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 59 DHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 59 ~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~ 128 (197) +...|+||||+++++.++.++.+.+++.++. T Consensus 78 ---------------------------------------T~~m~a~Pf~~pa~~~~~~~~~~~i~~~lrk 108 (108) T protein:vir:99 78 ---------------------------------------TRKMEAQSFLDPALRKEWPVLMANIKKMFKR 108 (108) T ss_pred ---------------------------------------ccccCCCcchhhhHHHHHHHHHHHHHHHhcC Confidence 4568999999999999999999999988888 No 64 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=97.69 E-value=2.4e-07 Score=56.87 Aligned_cols=109 Identities=14% Similarity=0.082 Sum_probs=55.2 Q ss_pred Ccccc---cHHHHHHHHHHHHHHHh-------------------cCCeEEEEeecCC--------CC-CCCccchHHHHh Q lcl|NC_021342. 1 MMKVV---GLQETLAELDKVLGQIR-------------------DDQYVTVGIHEAA--------GD-VESGEINMATLG 49 (197) Q Consensus 1 M~ki~---~~~~~~~~L~~l~~~~~-------------------~~~~V~VGi~~~~--------~~-~~~~~~~~A~iA 49 (197) |++++ +.+++.+.|+.+.+++. ...-|.-|-+..+ +. -.....+.+.|| T Consensus 1 Ma~~~~~~~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~g~~~~~~v~~~~~YA 80 (142) T protein:vir:94 1 MAGLNYRVNSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSGGRFSFSVTIGTNVTYA 80 (142) T ss_pred CceeEEEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeccCCceEEEEEecCcccc Confidence 87775 77766666655443221 0111122222111 00 011123568999 Q ss_pred hHhHcCceee--eCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 50 AVLNFGAEID--HPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGA 126 (197) Q Consensus 50 ~~~EfGa~I~--~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~ 126 (197) .++|||.-.. .|.+...+.... +..|.+. ...-.+||||||++++++++.++.+.++++= T Consensus 81 ~~vE~Gt~~~~i~pk~~k~l~~~~------~~~~~~~-----------v~~pG~~~~pfl~~A~~~~~~~i~~~~~~~~ 142 (142) T protein:vir:94 81 ADVEYGTAPHVIVPKDKKALYWPG------AAHPVAK-----------VNHPGTRAQPFMRPAIAAASTFLRNHAKGIR 142 (142) T ss_pred hhhhccCCCceeccCCCccceecc------cceeeee-----------eeecCCCCCcchhHHHHHHHHHHHHHHHhcC Confidence 9999996321 122222211110 0011100 0112489999999999999988887776654 No 65 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=97.67 E-value=1.4e-07 Score=58.09 Aligned_cols=110 Identities=18% Similarity=0.157 Sum_probs=59.6 Q ss_pred Ccccc-cHHHHHHHHHHHHHHHh-------------------cCCeEEEEeecCCC-------CCCCccchHHHHhhHhH Q lcl|NC_021342. 1 MMKVV-GLQETLAELDKVLGQIR-------------------DDQYVTVGIHEAAG-------DVESGEINMATLGAVLN 53 (197) Q Consensus 1 M~ki~-~~~~~~~~L~~l~~~~~-------------------~~~~V~VGi~~~~~-------~~~~~~~~~A~iA~~~E 53 (197) |+++. |.+++.+.|+++.+++. ...-|.-|-+..+- .......+.+.||.+.| T Consensus 1 Ma~~~~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVN 80 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEeecCcEEEEEecCCCcccccc Confidence 99997 99998888877665541 00011112211110 01111235688999999 Q ss_pred cCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 54 FGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIE 123 (197) Q Consensus 54 fGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~ 123 (197) ||..+....+.... ......+.....+.. ..+..+|+||||+++++++++++.+.+. T Consensus 81 ~GT~~~~~~~~~~~-------~~~~~~~~~~~~~~~------~~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 81 YGTGIYATGAGGSR-------AKKIPWSYKDANGKW------HTTKGQHAQPFWEPAIDAGRVFFNKYFS 137 (137) T ss_pred cCccccccCCCccc-------ccccccceeccCCce------eecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 99654321111000 000000000011111 1255799999999999999999998887 No 66 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=97.66 E-value=1.4e-07 Score=58.18 Aligned_cols=109 Identities=17% Similarity=0.182 Sum_probs=59.3 Q ss_pred Ccccc-cHHHHHHHHHHHHHHHh-------------------cCCeEEEEeecCCCC---C----CCccchHHHHhhHhH Q lcl|NC_021342. 1 MMKVV-GLQETLAELDKVLGQIR-------------------DDQYVTVGIHEAAGD---V----ESGEINMATLGAVLN 53 (197) Q Consensus 1 M~ki~-~~~~~~~~L~~l~~~~~-------------------~~~~V~VGi~~~~~~---~----~~~~~~~A~iA~~~E 53 (197) |+++. |.+++.+.|+++.+++. ...-|.-|-+..+-. . .+...+.+.||.+.| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:10 1 MAKVKYGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINIGSEYAVYVN 80 (137) T ss_pred CccchhCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEecCCcEEEEEecCCccccccc Confidence 99987 88888888877665431 011112232222110 0 111235688999999 Q ss_pred cCceeeeCCCceeeecccccccccCCc-cccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 54 FGAEIDHPGGTSYGYATEEAESRKEVR-FLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIE 123 (197) Q Consensus 54 fGa~I~~p~~~~~~~~~~~~~~~~~~~-f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~ 123 (197) ||..+....... ......+ +.....+.. ..+..+|+||||++++++++.++.+.+. T Consensus 81 ~GT~~~~~~~~~--------~~~~~~~~~~~~~~~~~------~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 81 YGTGIYAVGPGG--------SRAKNIPWRYKDADGHW------HTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred cCccccccCCCc--------ccccccceeeecccccc------ccCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 996443211000 0000000 000011110 1256799999999999999999988887 No 67 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=97.62 E-value=1.6e-07 Score=57.81 Aligned_cols=128 Identities=15% Similarity=0.187 Sum_probs=54.1 Q ss_pred Ccccc----cHHHHHHHHHHHHHHHhc---CCe-----------EEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCC Q lcl|NC_021342. 1 MMKVV----GLQETLAELDKVLGQIRD---DQY-----------VTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPG 62 (197) Q Consensus 1 M~ki~----~~~~~~~~L~~l~~~~~~---~~~-----------V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~ 62 (197) ||.+. |.+++.+.|+.|...+.. ... ++--.|.+.+.-..+ +....--.-+ .| .+ T Consensus 1 mm~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~s-i~~~~~~~~~-~~-~~---- 73 (149) T protein:vir:19 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKN-VVVVTQKSRR-RG-EI---- 73 (149) T ss_pred CcceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhh-cccccccccc-cc-ce---- Confidence 87665 888998888887544321 011 111112111110000 0000000000 00 00 Q ss_pred Cceeeecccccccc--cCCcc-ccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHH Q lcl|NC_021342. 63 GTSYGYATEEAESR--KEVRF-LKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEK 138 (197) Q Consensus 63 ~~~~~~~~~~~~~~--~~~~f-~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 138 (197) ............. ..... ...+...+.++|.--.+.++|||||||++++++++++.+.+.+.+...+ +.+|.+ T Consensus 74 -~~~v~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PF~~pA~~~~k~~~~~~~~~~l~~~l--~k~~~k 149 (149) T protein:vir:19 74 -SSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAI--DEVLSK 149 (149) T ss_pred -eecccccccccccccccceeecCCCCccceeeeeccCCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHH--HHHhcC Confidence 0000000000000 00000 0112223344555556889999999999999999988877766554311 111111 No 68 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=97.61 E-value=4.6e-07 Score=55.33 Aligned_cols=86 Identities=6% Similarity=0.015 Sum_probs=63.9 Q ss_pred HHHHHHHHHHHHHHHHHc--cCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCCcHHHHHhcCC--CCchhHHHHHH Q lcl|NC_021342. 111 VQSKSNEYVTIIERGASR--DESTTSILEKVGVTAQAAVRMFMTEL------QDPPNAKSTIRKKGS--SNPLIDTGALR 180 (197) Q Consensus 111 ~~~~~~~~~~~~~~~~~~--~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppna~~Ti~~KG~--~~PLidTG~L~ 180 (197) +++ -.++...+...+.. ..+...+|..||..+....++.|++. .|+|+++.|+++|+. .+||+++|.|. T Consensus 1 m~d-~~~l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~g~l~ 79 (149) T protein:vir:98 1 MSE-LTALQERLTGLIASLSPAARRQMAADIAKKLRASQQQRIRRQQAPDGTPYAARKRQSVRSKKGRIRREMFARLRTN 79 (149) T ss_pred Cch-HHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHhccCCCCcccchhhhhh Confidence 332 23344444444433 12456789999999999999999974 589999999998874 58999999999 Q ss_pred hhceeeeeccccccc--cC Q lcl|NC_021342. 181 QSVTYVVHSGKLPDE--GL 197 (197) Q Consensus 181 ~SIty~V~~k~~~~~--~~ 197 (197) +||++.+....+... |- T Consensus 80 ~sl~~~~~~~~~~V~~~Gs 98 (149) T protein:vir:98 80 RFMKAKGSDSAAVVEFTGR 98 (149) T ss_pred hhhhheecCCeeEEEecCc Confidence 999999887766553 33 No 69 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=97.57 E-value=1.4e-07 Score=58.19 Aligned_cols=84 Identities=19% Similarity=0.268 Sum_probs=47.3 Q ss_pred Ccc--cccHHHHHHHHHHHH--HHHh---------------cCC----eEEEEeecCCCCCCCc-----cchHHHHhhHh Q lcl|NC_021342. 1 MMK--VVGLQETLAELDKVL--GQIR---------------DDQ----YVTVGIHEAAGDVESG-----EINMATLGAVL 52 (197) Q Consensus 1 M~k--i~~~~~~~~~L~~l~--~~~~---------------~~~----~V~VGi~~~~~~~~~~-----~~~~A~iA~~~ 52 (197) |++ +.|.+++.+.|+++. +++. ... -|.-|-+.++-.-..+ ..+.+.||.+. T Consensus 1 Ma~i~i~Gld~L~~~l~~~~~~~~v~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~~~~~~~~v~~~~~Ya~~v 80 (112) T protein:vir:96 1 MATIEFEGLDEMAQSLLKNASSERRSKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLEAGSDRAVVEALTNYSGYL 80 (112) T ss_pred CceeeehHHHHHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeeecCceEEEecCCCCcccee Confidence 874 558888888776652 1110 000 0111111110000000 12334555566 Q ss_pred HcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 53 NFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGA 126 (197) Q Consensus 53 EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~ 126 (197) ||| |..+|+||||+|+++.++.++.+.+++.- T Consensus 81 E~G------------------------------------------Tr~m~AqPF~~PA~~~~~~~~~~~l~~L~ 112 (112) T protein:vir:96 81 EVG------------------------------------------TRKMEAQPFMRPALDQVVPEMVEEMAKWE 112 (112) T ss_pred ccC------------------------------------------ccccCCCCchhhhHHHHHHHHHHHHHhcC Confidence 665 56799999999999999999888877665 No 70 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=97.52 E-value=1.5e-07 Score=58.05 Aligned_cols=91 Identities=16% Similarity=0.263 Sum_probs=51.5 Q ss_pred Cc-----ccccHHHHHHHHHHHHHHHhcC-------------------CeEEEEeecCCC-----CCC-----CccchHH Q lcl|NC_021342. 1 MM-----KVVGLQETLAELDKVLGQIRDD-------------------QYVTVGIHEAAG-----DVE-----SGEINMA 46 (197) Q Consensus 1 M~-----ki~~~~~~~~~L~~l~~~~~~~-------------------~~V~VGi~~~~~-----~~~-----~~~~~~A 46 (197) |+ ++.|.+++.+.|+++.+.+... .-+.-|-+.++= ... ....+.+ T Consensus 1 Ma~~~~i~~~Gld~l~~~L~~~~~~~~~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~~~~~~~~~~v~~~~ 80 (125) T protein:vir:94 1 MANDFNIKFKGVDKLLDEFDISRKELVPYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVKEEHGVVTGRYVARA 80 (125) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceecceeccCCcEEEEeeCCC Confidence 54 5679999988888876543210 000111111100 000 0011334 Q ss_pred HHhhHhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 47 TLGAVLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGA 126 (197) Q Consensus 47 ~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~ 126 (197) .||.+.||| +...|+||||++++++++.++.+.+++.+ T Consensus 81 ~Ya~~vEfG------------------------------------------T~~~~a~Pfl~pa~~~~~~~~~~~l~~~l 118 (125) T protein:vir:94 81 DYSSYNEYG------------------------------------------TYRMSAQPFMAPSVAAMTPFFYKAVRDAL 118 (125) T ss_pred Cccceeecc------------------------------------------cccCCCCcccchhHHHHHHHHHHHHHHHH Confidence 555555555 56789999999999999999888887766 Q ss_pred HccCcHH Q lcl|NC_021342. 127 SRDESTT 133 (197) Q Consensus 127 ~~~~~~~ 133 (197) ...+.-. T Consensus 119 ~~a~k~~ 125 (125) T protein:vir:94 119 NKAAKFS 125 (125) T ss_pred HHHhccC Confidence 5422111 No 71 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=97.48 E-value=3e-07 Score=56.36 Aligned_cols=110 Identities=21% Similarity=0.207 Sum_probs=58.5 Q ss_pred Ccccc-cHHHHHHHHHHHHHHHh-------------------cCCeEEEEeecCCCC--CC-----CccchHHHHhhHhH Q lcl|NC_021342. 1 MMKVV-GLQETLAELDKVLGQIR-------------------DDQYVTVGIHEAAGD--VE-----SGEINMATLGAVLN 53 (197) Q Consensus 1 M~ki~-~~~~~~~~L~~l~~~~~-------------------~~~~V~VGi~~~~~~--~~-----~~~~~~A~iA~~~E 53 (197) |+++. |.+++.+.|+++.+++. ...-|.-|-+..+=. .. ....+.+.||.+.| T Consensus 13 Ma~~~~Gld~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~VE 92 (149) T protein:vir:94 13 MAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADYAIYVE 92 (149) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEEeeCCcEEEEEecCCCcccccc Confidence 99987 88888888877655431 001111222222110 01 11235588999999 Q ss_pred cCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 54 FGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIE 123 (197) Q Consensus 54 fGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~ 123 (197) ||.-+....... .... ....|.....+.. ..+..+||||||++++++++.++.+.+. T Consensus 93 ~GT~~~~~~~~~------~~~~-~~~~~~~~~~~~~------~~~~g~~a~PFl~pA~~~~~~~i~~~i~ 149 (149) T protein:vir:94 93 YGTGIYATGPGG------SRAT-KIPWSFKGDDGEW------YTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred cCccccccCCCc------cccc-cccceeecCccce------ecCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 996432111000 0000 0000111111111 1245789999999999999998888887 No 72 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=97.48 E-value=4e-07 Score=55.66 Aligned_cols=110 Identities=17% Similarity=0.158 Sum_probs=59.8 Q ss_pred Ccccc-cHHHHHHHHHHHHHHHh-------------------cCCeEEEEeecCCCC-------CCCccchHHHHhhHhH Q lcl|NC_021342. 1 MMKVV-GLQETLAELDKVLGQIR-------------------DDQYVTVGIHEAAGD-------VESGEINMATLGAVLN 53 (197) Q Consensus 1 M~ki~-~~~~~~~~L~~l~~~~~-------------------~~~~V~VGi~~~~~~-------~~~~~~~~A~iA~~~E 53 (197) |+++. |.+++.+.|+++.+++. ...-|.-|-+..+-. ..+...+.+.||.+.| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:93 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVN 80 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCcccccc Confidence 99988 88888888877655431 011122222222110 0111245789999999 Q ss_pred cCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 54 FGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIE 123 (197) Q Consensus 54 fGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~ 123 (197) ||..+....+.... ......+.....+... .+..+|+||||++++++++.++.+.+. T Consensus 81 ~GT~~~~~~~~~~~-------~~~~~~~~~~~~~~~~------~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:93 81 YGTGIYATGAGGSR-------AKKIPWSYKDANGKWH------TTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cCccccccCCCccc-------ccccccceeccCccee------ecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 99644321110000 0000001011111111 245789999999999999999998888 No 73 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=97.48 E-value=4e-07 Score=55.66 Aligned_cols=110 Identities=17% Similarity=0.158 Sum_probs=59.8 Q ss_pred Ccccc-cHHHHHHHHHHHHHHHh-------------------cCCeEEEEeecCCCC-------CCCccchHHHHhhHhH Q lcl|NC_021342. 1 MMKVV-GLQETLAELDKVLGQIR-------------------DDQYVTVGIHEAAGD-------VESGEINMATLGAVLN 53 (197) Q Consensus 1 M~ki~-~~~~~~~~L~~l~~~~~-------------------~~~~V~VGi~~~~~~-------~~~~~~~~A~iA~~~E 53 (197) |+++. |.+++.+.|+++.+++. ...-|.-|-+..+-. ..+...+.+.||.+.| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:97 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVN 80 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCcccccc Confidence 99988 88888888877655431 011122222222110 0111245789999999 Q ss_pred cCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 54 FGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIE 123 (197) Q Consensus 54 fGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~ 123 (197) ||..+....+.... ......+.....+... .+..+|+||||++++++++.++.+.+. T Consensus 81 ~GT~~~~~~~~~~~-------~~~~~~~~~~~~~~~~------~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:97 81 YGTGIYATGAGGSR-------AKKIPWSYKDANGKWH------TTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cCccccccCCCccc-------ccccccceeccCccee------ecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 99644321110000 0000001011111111 245789999999999999999998888 No 74 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=97.48 E-value=4e-07 Score=55.66 Aligned_cols=110 Identities=17% Similarity=0.158 Sum_probs=59.8 Q ss_pred Ccccc-cHHHHHHHHHHHHHHHh-------------------cCCeEEEEeecCCCC-------CCCccchHHHHhhHhH Q lcl|NC_021342. 1 MMKVV-GLQETLAELDKVLGQIR-------------------DDQYVTVGIHEAAGD-------VESGEINMATLGAVLN 53 (197) Q Consensus 1 M~ki~-~~~~~~~~L~~l~~~~~-------------------~~~~V~VGi~~~~~~-------~~~~~~~~A~iA~~~E 53 (197) |+++. |.+++.+.|+++.+++. ...-|.-|-+..+-. ..+...+.+.||.+.| T Consensus 1 Ma~~~~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:94 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGVINIGSEYAIYVN 80 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEEEecCCCcccccc Confidence 99988 88888888877655431 011122222222110 0111245789999999 Q ss_pred cCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 54 FGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIE 123 (197) Q Consensus 54 fGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~ 123 (197) ||..+....+.... ......+.....+... .+..+|+||||++++++++.++.+.+. T Consensus 81 ~GT~~~~~~~~~~~-------~~~~~~~~~~~~~~~~------~t~g~~a~PFl~pA~~~~~~~~~~~l~ 137 (137) T protein:vir:94 81 YGTGIYATGAGGSR-------AKKIPWSYKDANGKWH------TTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cCccccccCCCccc-------ccccccceeccCccee------ecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 99644321110000 0000001011111111 245789999999999999999998888 No 75 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=97.46 E-value=4.8e-07 Score=55.22 Aligned_cols=108 Identities=18% Similarity=0.222 Sum_probs=60.9 Q ss_pred Ccccc-cHHHHHHHHHHHHHHHhc-------------------CCeEEEEeecCCCC--CC-----CccchHHHHhhHhH Q lcl|NC_021342. 1 MMKVV-GLQETLAELDKVLGQIRD-------------------DQYVTVGIHEAAGD--VE-----SGEINMATLGAVLN 53 (197) Q Consensus 1 M~ki~-~~~~~~~~L~~l~~~~~~-------------------~~~V~VGi~~~~~~--~~-----~~~~~~A~iA~~~E 53 (197) |+++. |.+++.+.|+++.+++.. ..-|.-|-+..+-. .. ....+.+.||.+.| T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~~~g~~~~V~~~~~YA~~ve 80 (135) T protein:vir:96 1 MAKVKYGADSIVVDLEKYSKDMEKWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFENGGFTGVVKIGSNYAVYVN 80 (135) T ss_pred CchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEeecCcEEEEEecCCCccchhh Confidence 99886 999988888877665320 11122233322210 01 11235788999999 Q ss_pred cCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 54 FGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIE 123 (197) Q Consensus 54 fGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~ 123 (197) ||.-+..+... ......-|.....+... .+..+|+||||++++++.+.++.+.+. T Consensus 81 ~GT~~~~~~~~---------~~~~~~~~~~~~~g~~~------~~~~~~a~pfl~~A~~~~~~~~~~~i~ 135 (135) T protein:vir:96 81 YGTGIYATKGS---------RAHKIPWTYKDPNGKWH------TTYGQMPQPFWEPAIDAGRQTFEQYFS 135 (135) T ss_pred cccccccCCCc---------cccccccccccCCccee------ecCCcCCCcchhHHHHHHHHHHHHhcC Confidence 99643321110 00011111111122211 245799999999999999999888877 No 76 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=97.44 E-value=3.5e-07 Score=55.99 Aligned_cols=109 Identities=20% Similarity=0.221 Sum_probs=59.6 Q ss_pred Ccccc-cHHHHHHHHHHHHHHHh-------------------cCCeEEEEeecCCCC--CC-----CccchHHHHhhHhH Q lcl|NC_021342. 1 MMKVV-GLQETLAELDKVLGQIR-------------------DDQYVTVGIHEAAGD--VE-----SGEINMATLGAVLN 53 (197) Q Consensus 1 M~ki~-~~~~~~~~L~~l~~~~~-------------------~~~~V~VGi~~~~~~--~~-----~~~~~~A~iA~~~E 53 (197) |+++. |.+++.+.|+++.+++. ...-|.-|-+..+-. .. +...+.+.||.+.| T Consensus 13 Ma~v~~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~~~~~g~~~~V~~~~~YA~~vE 92 (149) T protein:vir:10 13 MAKVKYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADYAIYVE 92 (149) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEEecCCcEEEEEecCCCcccccc Confidence 99986 88888888877755431 011112232222110 11 11235688999999 Q ss_pred cCceeeeCCCceeeecccccccccCC-ccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 54 FGAEIDHPGGTSYGYATEEAESRKEV-RFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIE 123 (197) Q Consensus 54 fGa~I~~p~~~~~~~~~~~~~~~~~~-~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~ 123 (197) ||.-+....... ...... .+.+...+.. ..+..+|||||||+++++++.++.+.+. T Consensus 93 ~GT~~~~~~~~~--------~~~~~~~~~~~~~~~~~------~~t~g~~a~PFl~pA~~~~k~~i~~~i~ 149 (149) T protein:vir:10 93 YGTGIYATGPGG--------SRATKIPWSFKGDDGEW------YTTYGQAPQPFWNPAIDAGRKTFEQYFS 149 (149) T ss_pred cCccccccCCcc--------cccccccceeeccccce------ecCCCCCCCcchhHHHHHHHHHHHHhhC Confidence 996442211100 000000 0111111111 1246799999999999999999988887 No 77 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=97.44 E-value=1.4e-07 Score=58.11 Aligned_cols=84 Identities=15% Similarity=0.204 Sum_probs=48.7 Q ss_pred cccccHHHHHHHHHHHHHHHh-----------------cCCeEEEEeecCCCC-------CCCccchHHHHhhHhHcCce Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQIR-----------------DDQYVTVGIHEAAGD-------VESGEINMATLGAVLNFGAE 57 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~~-----------------~~~~V~VGi~~~~~~-------~~~~~~~~A~iA~~~EfGa~ 57 (197) |++.|.+++.+.|++....-. ...-|.-|-+.++-. ......+.+.||.+-||| T Consensus 1 i~i~Gld~l~~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~G-- 78 (108) T protein:vir:74 1 MKITGIDALQKKLRKNATLDDVKHVVKSNTASMNKNMQNLAPVDTGNMKRSITSEFTDGGLSGTTGPHTDYAGYVEYG-- 78 (108) T ss_pred CcchhHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhccceeeeecCceEEEeecCCCcccceecc-- Confidence 999999888888876532100 000000011000000 000011233444444444 Q ss_pred eeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 58 IDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGAS 127 (197) Q Consensus 58 I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~ 127 (197) +...|+|||||++++.++.++.+.+++.++ T Consensus 79 ----------------------------------------T~km~aqpf~~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 79 ----------------------------------------TRFQSAQPFVKPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred ----------------------------------------ccccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 456899999999999999999999999888 No 78 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=97.44 E-value=1.3e-07 Score=58.29 Aligned_cols=82 Identities=16% Similarity=0.293 Sum_probs=49.9 Q ss_pred cccccHHHHHHHHHHHHHHH-------------hc-------------CCeEEEEeecCCCCCCCccchHHHHhhHhHcC Q lcl|NC_021342. 2 MKVVGLQETLAELDKVLGQI-------------RD-------------DQYVTVGIHEAAGDVESGEINMATLGAVLNFG 55 (197) Q Consensus 2 ~ki~~~~~~~~~L~~l~~~~-------------~~-------------~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfG 55 (197) |++.|.+++.+.|++....- .+ ..++.+-+-.+. -.....+.+.||.+.||| T Consensus 1 i~i~Gld~l~~~l~~~~~~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~~~~--~~~~V~~~~~Ya~~vE~G 78 (108) T protein:vir:98 1 MKITGIDALQKKLRKNATLNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFTDGG--LTGTTIPHTDYAGYVEYG 78 (108) T ss_pred CcchhHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeeecCc--eEEEeecCCCccceeecc Confidence 89999988888887653211 00 001111110000 000112334455555555 Q ss_pred ceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 56 AEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGAS 127 (197) Q Consensus 56 a~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~ 127 (197) +...|+||||+++++..+.++.+.+++.++ T Consensus 79 ------------------------------------------T~~m~aqPFl~pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 79 ------------------------------------------TRFQAAQPFVKPAFDVQKKIFTNDLERLTK 108 (108) T ss_pred ------------------------------------------ccccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 457899999999999999999999999888 No 79 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=97.43 E-value=6.5e-07 Score=54.52 Aligned_cols=108 Identities=18% Similarity=0.204 Sum_probs=57.9 Q ss_pred Ccccc-cHHHHHHHHHHHHHHHh-------------------c---------CCeEEEEeecCCCCCCCccchHHHHhhH Q lcl|NC_021342. 1 MMKVV-GLQETLAELDKVLGQIR-------------------D---------DQYVTVGIHEAAGDVESGEINMATLGAV 51 (197) Q Consensus 1 M~ki~-~~~~~~~~L~~l~~~~~-------------------~---------~~~V~VGi~~~~~~~~~~~~~~A~iA~~ 51 (197) |+++. |.+++.+.|+++.+++. . ..++.+-+..+ + -.....+.+.||.+ T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~-~-~~~~V~~~~~Ya~~ 78 (137) T protein:vir:10 1 MAKVKYGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKG-G-LTGVINIGSEYAVY 78 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeCC-c-EEEEEecCCCcccc Confidence 99986 88888888877655421 0 11122211111 0 01112356789999 Q ss_pred hHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 52 LNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIE 123 (197) Q Consensus 52 ~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~ 123 (197) .|||..+..+...... . .....+.....+.. ..+..+|+|||||+++++++.++.+.+. T Consensus 79 vE~GT~~~~~~~~~~~------~-~~~~~~~~~~~~~~------~~t~g~~a~PFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:10 79 VNYGTGIYAVGPGGSR------A-KNIPWCYKDADGHW------HTTKGQHAQPFWEPAIDEGRAFFNKYFS 137 (137) T ss_pred cccCccccccCCCccc------c-ccccceeeccccce------eccCCCCCCcchhHHHHHHHHHHHHhcC Confidence 9999644321110000 0 00000000001110 1245789999999999999999988887 No 80 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=97.41 E-value=1.3e-06 Score=52.80 Aligned_cols=86 Identities=7% Similarity=0.062 Sum_probs=62.4 Q ss_pred HHHHHHHHHHHHHHHHHc--cCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCCcHHHHHhcC--CCCchhHHHHHH Q lcl|NC_021342. 111 VQSKSNEYVTIIERGASR--DESTTSILEKVGVTAQAAVRMFMTEL------QDPPNAKSTIRKKG--SSNPLIDTGALR 180 (197) Q Consensus 111 ~~~~~~~~~~~~~~~~~~--~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppna~~Ti~~KG--~~~PLidTG~L~ 180 (197) +++ -.++...+...+.. ..+-..+|..||..+....++.|.+. .|+|+++.|+++|. ..++|+++|.|. T Consensus 1 ~~~-~~~l~~~L~~ll~~l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~g~~~~~l~~~~~l~ 79 (150) T protein:vir:20 1 MNE-FKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSVRKKTGRVKRKMFAKLITS 79 (150) T ss_pred Cch-HHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHHHhccCCCccccchhhhh Confidence 222 23333444444443 12346689999999999999999985 58999999998764 468999999999 Q ss_pred hhceeeeeccccccc---cC Q lcl|NC_021342. 181 QSVTYVVHSGKLPDE---GL 197 (197) Q Consensus 181 ~SIty~V~~k~~~~~---~~ 197 (197) .||+|++....+..+ |- T Consensus 80 ~sl~~~~~~~~~~vg~~~Gs 99 (150) T protein:vir:20 80 RFLHIRASPEQASMEFYGGK 99 (150) T ss_pred hhhheeecCcEEEEEeeCCc Confidence 999999876665542 33 No 81 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=97.40 E-value=2.9e-07 Score=56.43 Aligned_cols=79 Identities=9% Similarity=0.028 Sum_probs=42.7 Q ss_pred CcccccHHHHHHHHHHHH----HHHh----------------------------cCCeEEEEeecCCCCCCCccchHHHH Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVL----GQIR----------------------------DDQYVTVGIHEAAGDVESGEINMATL 48 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~----~~~~----------------------------~~~~V~VGi~~~~~~~~~~~~~~A~i 48 (197) |.+.. .+...+.|.+-. +.+. +...+.||+.. ..+.| T Consensus 15 l~~~~-~~~~~~al~~ga~~~~~~~k~~ap~~~~~~~~hl~d~I~~~~~k~~~~g~~~~~VG~~k----------~~~~y 83 (125) T protein:vir:97 15 LEVKA-PKTAKAAVTEVAKEFEKALKANTPVYEVETDERLQEDTVISGFKGANVGIVSKEIGYGK----------ATGWR 83 (125) T ss_pred hhHHH-HHHHHHHHHHHHHHHHHHHHHhCCcCCCCchhhHHhhhhcccccccccCceEEEEeecC----------CCcee Confidence 11100 000111111111 1111 11123444321 12345 Q ss_pred hhHhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 49 GAVLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 49 A~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~ 128 (197) +.+.||| +..+||+|||++++++.++++.+.+.+.+.. T Consensus 84 ~~f~E~G------------------------------------------T~k~~~~pF~~pa~~~~k~~~~~~~~~~~~~ 121 (125) T protein:vir:97 84 AHYPNDG------------------------------------------TIYQRGQDFKERTINQMTPKAKQLYAEKVKE 121 (125) T ss_pred EeeeccC------------------------------------------ccCCCcCccchHhHHHhHHHHHHHHHHHHHH Confidence 5555555 7889999999999999999999999988877 Q ss_pred cCcH Q lcl|NC_021342. 129 DEST 132 (197) Q Consensus 129 ~~~~ 132 (197) .+.. T Consensus 122 ~L~l 125 (125) T protein:vir:97 122 GLGL 125 (125) T ss_pred HhcC Confidence 5555 No 82 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=97.39 E-value=3.8e-07 Score=55.81 Aligned_cols=109 Identities=18% Similarity=0.217 Sum_probs=59.1 Q ss_pred Ccccc-cHHHHHHHHHHHHHHHhc-------------------CCeEEEEeecCCC-------CCCCccchHHHHhhHhH Q lcl|NC_021342. 1 MMKVV-GLQETLAELDKVLGQIRD-------------------DQYVTVGIHEAAG-------DVESGEINMATLGAVLN 53 (197) Q Consensus 1 M~ki~-~~~~~~~~L~~l~~~~~~-------------------~~~V~VGi~~~~~-------~~~~~~~~~A~iA~~~E 53 (197) |+++. |.+++.+.|+++.+++.. ..-|.-|-+..+- .......+.+.||.+.| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~pvdTG~L~~Si~~~~~~~g~~~~V~~~~~YA~yvE 80 (137) T protein:vir:96 1 MAKVKYGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALAPVDLGFLKESIDFKVTDGGFSSVISVGAEYAIYVE 80 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcCccchhcCceeEeecCceEEEEecCCCcccccc Confidence 99998 888888888775554310 0011112221111 01112235688999999 Q ss_pred cCceeeeCCCceeeecccccccccCCc-cccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 54 FGAEIDHPGGTSYGYATEEAESRKEVR-FLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIE 123 (197) Q Consensus 54 fGa~I~~p~~~~~~~~~~~~~~~~~~~-f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~ 123 (197) ||..+..+.... ......+ +.....+.. ..+..+|+||||++++++++..+.+.+. T Consensus 81 ~GT~~~~~~~~~--------~~~~~~~~~~~~~~~~~------~~t~g~~a~pFl~pA~~~~~~~i~k~i~ 137 (137) T protein:vir:96 81 FGTGIYATGPGG--------SRARKLPWTYKGDDGEW------HTTYGQQAQPFWNPAIDEGRKVFNRYFS 137 (137) T ss_pred cCccccccCCCc--------cccccccceeeccCcce------eecCCCCCCcchhHHHHHHHHHHHHhhC Confidence 996433211110 0000000 001111111 1245799999999999999999988887 No 83 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=97.35 E-value=7.1e-07 Score=54.29 Aligned_cols=110 Identities=17% Similarity=0.158 Sum_probs=59.4 Q ss_pred Ccccc-cHHHHHHHHHHHHHHHh-------------------cCCeEEEEeecCCCC-------CCCccchHHHHhhHhH Q lcl|NC_021342. 1 MMKVV-GLQETLAELDKVLGQIR-------------------DDQYVTVGIHEAAGD-------VESGEINMATLGAVLN 53 (197) Q Consensus 1 M~ki~-~~~~~~~~L~~l~~~~~-------------------~~~~V~VGi~~~~~~-------~~~~~~~~A~iA~~~E 53 (197) |+++. |.+++.+.|+++.+++. ...-|.-|-+..+-. ..+...+.+.||.+.| T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~~~~~~~~~V~~~~~YA~~vE 80 (137) T protein:vir:95 1 MAKVKYGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVINIGSEYAIYVN 80 (137) T ss_pred CchhHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEeeCCceEEEEecCCCcccccc Confidence 99977 88888888876655431 011122222222110 0111245688999999 Q ss_pred cCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 54 FGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIE 123 (197) Q Consensus 54 fGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~ 123 (197) ||.-+..+.+.... ......+.....+... .+..+|+||||++++++++.++.+.+. T Consensus 81 ~GT~~~~~~~~~~~-------~~~~~~~~~~~~~~~~------~t~g~~a~PFl~pA~~~~~~~i~k~l~ 137 (137) T protein:vir:95 81 YGTGIYATGAGGSR-------AKKIPWSYKDANGKWH------TTKGQHAQPFWEPAIDAGRAFFNKYFS 137 (137) T ss_pred cCccccccCCCccc-------ccccccceeccCccee------ecCCCCCCcchHHHHHHHHHHHHHhhC Confidence 99644322111000 0000001111111111 245789999999999999999998887 No 84 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=97.28 E-value=1.2e-06 Score=53.11 Aligned_cols=115 Identities=8% Similarity=0.060 Sum_probs=56.7 Q ss_pred Ccccc--cHHHHHHHHHHHHHHHhcC---CeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCcee----eecc- Q lcl|NC_021342. 1 MMKVV--GLQETLAELDKVLGQIRDD---QYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSY----GYAT- 70 (197) Q Consensus 1 M~ki~--~~~~~~~~L~~l~~~~~~~---~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~----~~~~- 70 (197) ||.+. |.+++.+.|.+|..++... ..+.-|- . .++.-|.- +.|-.+.. +... T Consensus 1 M~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a-----~------~i~~~ak~-------~ap~~~~~~~~~~~~~I 62 (133) T protein:vir:10 1 MIRMEVKGLDELERQLTALGEKVATKVLRDAGREAL-----K------VVEEDMKQ-------HAGFDETSTGQHMRDSI 62 (133) T ss_pred CeeEeeehHHHHHHHHHHhHHHHHHHHHHHHHHHHH-----H------HHHHHHHH-------hCCCCCCcchhhhhhcc Confidence 87765 8899999998886554210 0011000 0 00000000 01111000 0000 Q ss_pred -----ccccccc---CCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHH Q lcl|NC_021342. 71 -----EEAESRK---EVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTT 133 (197) Q Consensus 71 -----~~~~~~~---~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~ 133 (197) ....... .++.-..+..+..+++.--.|.+.|||||||++++++++++.+.+.+.+...++-. T Consensus 63 ~v~~~~~~~~~~~~~~v~vg~~~~~~~y~~f~E~GT~k~~a~PF~~pA~~~~~~~~~~~~~~~~~~~l~K~ 133 (133) T protein:vir:10 63 KIRSSTRKAQGNAVVTLRVGPSKQHHMKVLAQEFGTVKQVADPFIRPALDYNVQTVLRVLTVEIRNGIQNR 133 (133) T ss_pred cccccccccCccceEEEEecCCCCccceEeeeccCCCCCCCCccchHHHHHhHHHHHHHHHHHHHHHhhcC Confidence 0000000 01111122223345555557889999999999999999999888877666554444 No 85 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=97.27 E-value=2.6e-06 Score=51.20 Aligned_cols=86 Identities=7% Similarity=0.052 Sum_probs=62.8 Q ss_pred HHHHHHHHHHHHHHHHHc--cCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCCcHHHHHhcCC--CCchhHHHHHH Q lcl|NC_021342. 111 VQSKSNEYVTIIERGASR--DESTTSILEKVGVTAQAAVRMFMTEL------QDPPNAKSTIRKKGS--SNPLIDTGALR 180 (197) Q Consensus 111 ~~~~~~~~~~~~~~~~~~--~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppna~~Ti~~KG~--~~PLidTG~L~ 180 (197) +++ -.++...+..++.. ..+...+|..||..+....++.|.+. .|+|+++.|+++|+. .++|+++|.|. T Consensus 1 ~~~-~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~PdG~~W~p~~~~~~~~k~~~~~~~l~~~~~l~ 79 (150) T protein:vir:60 1 MNE-FKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLITS 79 (150) T ss_pred Cch-HHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhhcCCCccchhhhhhc Confidence 222 22233333333333 22346689999999999999999975 689999999998754 58999999999 Q ss_pred hhceeeeeccccccc---cC Q lcl|NC_021342. 181 QSVTYVVHSGKLPDE---GL 197 (197) Q Consensus 181 ~SIty~V~~k~~~~~---~~ 197 (197) .||++++....+... |- T Consensus 80 ~sl~~~~~~~~a~vg~~~Gt 99 (150) T protein:vir:60 80 RFLHIRASPEQASMEFYGGK 99 (150) T ss_pred ceeeeeeeCcEEEEEeeCCC Confidence 999999987766553 33 No 86 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=97.23 E-value=3e-06 Score=50.84 Aligned_cols=86 Identities=7% Similarity=0.050 Sum_probs=63.2 Q ss_pred HHHHHHHHHHHHHHHHHc--cCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCCcHHHHHhcCC--CCchhHHHHHH Q lcl|NC_021342. 111 VQSKSNEYVTIIERGASR--DESTTSILEKVGVTAQAAVRMFMTEL------QDPPNAKSTIRKKGS--SNPLIDTGALR 180 (197) Q Consensus 111 ~~~~~~~~~~~~~~~~~~--~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppna~~Ti~~KG~--~~PLidTG~L~ 180 (197) +++. +++...+...+.. ..+...+|..||..+....++.|.+. .|+|+++.|+++|+. .++|+++|.|. T Consensus 1 m~~~-~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~PdG~~W~p~k~~~~~~k~~~~~~~l~~~~~l~ 79 (150) T protein:vir:57 1 MNEF-KRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPDGTPYAPRQQQSARKKTGRVKRKMFAKLITS 79 (150) T ss_pred CchH-HHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccChHHHHHhccCCCcccchhhhhc Confidence 2222 3333334444433 23346699999999999999999975 699999999987753 58999999999 Q ss_pred hhceeeeeccccccc---cC Q lcl|NC_021342. 181 QSVTYVVHSGKLPDE---GL 197 (197) Q Consensus 181 ~SIty~V~~k~~~~~---~~ 197 (197) .||+|++....+..+ |- T Consensus 80 ~sl~~~~~~~~a~vg~~~G~ 99 (150) T protein:vir:57 80 RFLHIRASPEQASMEFYGGK 99 (150) T ss_pred cceeeeeeCcEEEEEeecCC Confidence 999999987766543 33 No 87 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=97.12 E-value=3e-06 Score=50.84 Aligned_cols=110 Identities=14% Similarity=0.200 Sum_probs=53.2 Q ss_pred Ccccc------cHHHHHHHHHHHHHH---------------Hh----c---------CCeEEEEeecCCCCCCCccchHH Q lcl|NC_021342. 1 MMKVV------GLQETLAELDKVLGQ---------------IR----D---------DQYVTVGIHEAAGDVESGEINMA 46 (197) Q Consensus 1 M~ki~------~~~~~~~~L~~l~~~---------------~~----~---------~~~V~VGi~~~~~~~~~~~~~~A 46 (197) |++++ |.+++.+.|+.+.++ +. . ..++.+-+..+ + ......+.+ T Consensus 1 m~~ms~~i~~~g~~~l~~~l~~~~~~~~~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~~~-g-~~~~V~~~~ 78 (144) T protein:vir:59 1 MALMSVRIDPSWRRIMSRNVRTFSGHVLTQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYKNN-G-LTAEITVGA 78 (144) T ss_pred CCcceeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEeecC-c-EEEEEecCC Confidence 44433 334443333332221 11 0 11222222111 1 111224568 Q ss_pred HHhhHhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 47 TLGAVLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGA 126 (197) Q Consensus 47 ~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~ 126 (197) .||.+.|||..+....+.. ...+..+.....+... .+..+|+||||+++++.+++.+.+.+++++ T Consensus 79 ~YA~~vE~GT~~~~~~~~~---------~~~~~~~~~~~~g~~~------~t~g~~a~Pfl~pA~~~~~~~~~~~i~~~~ 143 (144) T protein:vir:59 79 EYAIYVEYGTGIYAVDGNG---------RKTPWTYYSPKLGRYV------RTQGAPAQPFFWPAVEEGGEYFEREMRRLR 143 (144) T ss_pred CccchhhcCccccccCCCc---------ccccccccccccccee------cCCCCCCCcchhHHHHHHHHHHHHHHHHhc Confidence 9999999996442211100 0000001111111111 245799999999999999999999888876 Q ss_pred H Q lcl|NC_021342. 127 S 127 (197) Q Consensus 127 ~ 127 (197) - T Consensus 144 g 144 (144) T protein:vir:59 144 G 144 (144) T ss_pred C Confidence 5 No 88 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=97.11 E-value=7.3e-07 Score=54.23 Aligned_cols=79 Identities=13% Similarity=0.060 Sum_probs=44.3 Q ss_pred Ccccc------cHHHHHHHHHHHH--------------HH-----H---hcCCeEEEEeecCCCCCCCccchHHHHhhHh Q lcl|NC_021342. 1 MMKVV------GLQETLAELDKVL--------------GQ-----I---RDDQYVTVGIHEAAGDVESGEINMATLGAVL 52 (197) Q Consensus 1 M~ki~------~~~~~~~~L~~l~--------------~~-----~---~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~ 52 (197) +-++. |.+.+.+.+++.. +. . .....+.||+.. ..+.|+.+. T Consensus 22 ~~k~~~~al~~ga~~~~~~~k~~ap~~~~~~~~~~h~~d~I~~~~~k~~~g~~~~~VG~~k----------~~~~y~~f~ 91 (128) T protein:vir:38 22 VAKEARAAVRDGAQKFADKLKSNTPEWDGETDMSGHLRDDIKLSSVRETSGLTEVDVGYGK----------DTGWRAHFP 91 (128) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCcCCCCcccchhhhhhccccccccCceeEEEeeecC----------CCceEEeee Confidence 11100 1111111111111 00 0 012335666521 124577777 Q ss_pred HcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCc Q lcl|NC_021342. 53 NFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDES 131 (197) Q Consensus 53 EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~ 131 (197) ||| |.++||+||||++++++++++.+.+.+.+..++= T Consensus 92 E~G------------------------------------------T~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 92 NSG------------------------------------------TSMQDPQHFIEETQEIMRPVVIAAFLSHLKEGGM 128 (128) T ss_pred ccC------------------------------------------ccCCCCCcchhHHHHHhHHHHHHHHHHHHHhhcC Confidence 777 6789999999999999999999988877765443 No 89 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=96.93 E-value=2.7e-06 Score=51.15 Aligned_cols=111 Identities=11% Similarity=0.100 Sum_probs=41.3 Q ss_pred CcccccHHHH-HHHHHH----HHHHHhcCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeeccccccc Q lcl|NC_021342. 1 MMKVVGLQET-LAELDK----VLGQIRDDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAES 75 (197) Q Consensus 1 M~ki~~~~~~-~~~L~~----l~~~~~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~ 75 (197) |.+-.. +++ ...|.+ +.+.+... + |.+.....+.--....+. ..+...+..+... T Consensus 20 L~~~~~-~k~~~~Al~~~a~~v~~~~k~~----a--p~~~~~~~g~l~~~I~i~-------~~k~~~~~~~v~v------ 79 (135) T protein:vir:57 20 VGEEVG-TKILRDAGRAAMAVVEADMKQN----A--GYDNSSTNAHMRDSIKIR-------SSRGKAGSTVVVL------ 79 (135) T ss_pred hHHHHH-HHHHHHHHHHHHHHHHHHHHHh----C--CCCCCCchhhHHhhcccc-------cccccccceeEEE------ Confidence 211111 111 122322 22222111 1 111100000000000000 0000000000000 Q ss_pred ccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHH Q lcl|NC_021342. 76 RKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGV 141 (197) Q Consensus 76 ~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~ 141 (197) ..-+.+..+...++.--.|.+.||||||+++++++++++.+.+.+.+.. .|++++. T Consensus 80 ----~vg~~~~~~~~~~f~E~GT~~~~a~PF~~pa~~~~~~~~~~~~~~~~~~------~l~ka~r 135 (135) T protein:vir:57 80 ----RVGPTRSHYMKALAQEFGTIKQVAKPFIRPALDYNKMQVLRILTVEIRD------GLSTLSR 135 (135) T ss_pred ----EecCCCCcceeEeecccCCCCCCCCcchhHhHHHhHHHHHHHHHHHHHH------HHHHhcC Confidence 0000111122223233347889999999999999999998888777655 3444444 No 90 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=96.92 E-value=2.1e-06 Score=51.74 Aligned_cols=85 Identities=15% Similarity=0.247 Sum_probs=45.6 Q ss_pred CcccccHHHHHHHHHHHHHH-------------H----hcCCeEEEEeecCCCC---C----CCccchHHHHhhHhHcCc Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQ-------------I----RDDQYVTVGIHEAAGD---V----ESGEINMATLGAVLNFGA 56 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~-------------~----~~~~~V~VGi~~~~~~---~----~~~~~~~A~iA~~~EfGa 56 (197) =+++.|.+++.+.|+++... + ....-|.-|-+.++-. . .....+.+.||.+.||| T Consensus 4 ~i~i~Gld~l~~~L~~~~~~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~~~~~~~~~V~~~~~Ya~~vE~G- 82 (112) T protein:vir:36 4 SLSFKGIDQLVKHLDKAASLKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMELTEGGFSGQAGPHTDYSAYVEYG- 82 (112) T ss_pred eeeehhHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeeecCCceEEEeecCCCccceeecc- Confidence 13344666666665543221 0 0000011111110000 0 00012334555555555 Q ss_pred eeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 57 EIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGAS 127 (197) Q Consensus 57 ~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~ 127 (197) +...|+|||||++++.++.++.+.++++++ T Consensus 83 -----------------------------------------T~k~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 83 -----------------------------------------TRFQSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred -----------------------------------------ccccCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 457899999999999999999999999888 No 91 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=96.92 E-value=1.9e-06 Score=52.01 Aligned_cols=106 Identities=19% Similarity=0.293 Sum_probs=40.0 Q ss_pred CcccccHHHHHHHHHHH----------------HHHHh-----c----CCeE----------------EEEeecCCCCCC Q lcl|NC_021342. 1 MMKVVGLQETLAELDKV----------------LGQIR-----D----DQYV----------------TVGIHEAAGDVE 39 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l----------------~~~~~-----~----~~~V----------------~VGi~~~~~~~~ 39 (197) =+.+++.....+.|.+. .+... . .+.| .||+-.. T Consensus 7 ~~d~s~l~~~l~~l~~~~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~s~~g~~~~~Vg~~~~----- 81 (157) T protein:vir:97 7 SVDITGILAGLETVVEHSSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEESVEGIQTYAVSWRKK----- 81 (157) T ss_pred cccHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeeccccCCCceEEEEEeecCC----- Confidence 00111000000000000 00000 0 1112 1333211 Q ss_pred CccchHHHHhhHhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHH Q lcl|NC_021342. 40 SGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYV 119 (197) Q Consensus 40 ~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~ 119 (197) -+-++.+.|||.....- ........|+.... ..+ -++.+|||||||++|+..+++.. T Consensus 82 -----~a~~g~~vEfG~~~~~~-----------~~~~~~~~~~~~~~------~~~-t~~~~Pa~PFlRPA~d~~k~~a~ 138 (157) T protein:vir:97 82 -----AAPHGHLLEFGHWQTHA-----------AYRDKDGQWYSSKV------KLV-NPKWIPAKPFLRPGYDSVAMQIP 138 (157) T ss_pred -----ccceeeeeecCcccccc-----------cccCCccccccccc------ccC-CCCcCCCCcccchHHHHhHHHHH Confidence 24567788999422100 00000111111110 011 14669999999999999998888 Q ss_pred HHHHHHHHc----cCcHHH Q lcl|NC_021342. 120 TIIERGASR----DESTTS 134 (197) Q Consensus 120 ~~~~~~~~~----~~~~~~ 134 (197) +.+.+.+.. -+..++ T Consensus 139 ~~~~~~l~k~I~e~l~g~~ 157 (157) T protein:vir:97 139 DIARAAGAKKYAELQRGDT 157 (157) T ss_pred HHHHHHHHHHHHHHhcCCC Confidence 775443221 111111 No 92 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=96.67 E-value=2.2e-06 Score=51.61 Aligned_cols=79 Identities=11% Similarity=0.148 Sum_probs=37.8 Q ss_pred Ccccc--cHHHH-HHHHHHHHHHHhc---------------------------CCeEEEEeecCCCCCCCccchHHHHhh Q lcl|NC_021342. 1 MMKVV--GLQET-LAELDKVLGQIRD---------------------------DQYVTVGIHEAAGDVESGEINMATLGA 50 (197) Q Consensus 1 M~ki~--~~~~~-~~~L~~l~~~~~~---------------------------~~~V~VGi~~~~~~~~~~~~~~A~iA~ 50 (197) |.... ..++. .+.-+-+.+.+.. ...|.||+.. + .+.+|. T Consensus 17 l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k------~----~~~~a~ 86 (125) T protein:vir:79 17 AVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAK------G----VSHRIH 86 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCC------C----CceEEE Confidence 11111 00111 1111111111110 0112222210 0 123444 Q ss_pred HhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_021342. 51 VLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDE 130 (197) Q Consensus 51 ~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~ 130 (197) ..||| |+++||+||+|++++++++++.+.+.+.+..-. T Consensus 87 F~E~G------------------------------------------T~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~ 124 (125) T protein:vir:79 87 ATEFG------------------------------------------TMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQ 124 (125) T ss_pred eccCC------------------------------------------ccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHh Confidence 44444 889999999999999999999988877665422 Q ss_pred c Q lcl|NC_021342. 131 S 131 (197) Q Consensus 131 ~ 131 (197) . T Consensus 125 k 125 (125) T protein:vir:79 125 K 125 (125) T ss_pred C Confidence 2 No 93 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=96.67 E-value=2.2e-06 Score=51.61 Aligned_cols=79 Identities=11% Similarity=0.148 Sum_probs=37.8 Q ss_pred Ccccc--cHHHH-HHHHHHHHHHHhc---------------------------CCeEEEEeecCCCCCCCccchHHHHhh Q lcl|NC_021342. 1 MMKVV--GLQET-LAELDKVLGQIRD---------------------------DQYVTVGIHEAAGDVESGEINMATLGA 50 (197) Q Consensus 1 M~ki~--~~~~~-~~~L~~l~~~~~~---------------------------~~~V~VGi~~~~~~~~~~~~~~A~iA~ 50 (197) |.... ..++. .+.-+-+.+.+.. ...|.||+.. + .+.+|. T Consensus 17 l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k------~----~~~~a~ 86 (125) T protein:vir:81 17 AVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAK------G----VSHRIH 86 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCC------C----CceEEE Confidence 11111 00111 1111111111110 0112222210 0 123444 Q ss_pred HhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_021342. 51 VLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDE 130 (197) Q Consensus 51 ~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~ 130 (197) ..||| |+++||+||+|++++++++++.+.+.+.+..-. T Consensus 87 F~E~G------------------------------------------T~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~ 124 (125) T protein:vir:81 87 ATEFG------------------------------------------TMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQ 124 (125) T ss_pred eccCC------------------------------------------ccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHh Confidence 44444 889999999999999999999988877665422 Q ss_pred c Q lcl|NC_021342. 131 S 131 (197) Q Consensus 131 ~ 131 (197) . T Consensus 125 k 125 (125) T protein:vir:81 125 K 125 (125) T ss_pred C Confidence 2 No 94 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=96.67 E-value=2.2e-06 Score=51.61 Aligned_cols=79 Identities=11% Similarity=0.148 Sum_probs=37.8 Q ss_pred Ccccc--cHHHH-HHHHHHHHHHHhc---------------------------CCeEEEEeecCCCCCCCccchHHHHhh Q lcl|NC_021342. 1 MMKVV--GLQET-LAELDKVLGQIRD---------------------------DQYVTVGIHEAAGDVESGEINMATLGA 50 (197) Q Consensus 1 M~ki~--~~~~~-~~~L~~l~~~~~~---------------------------~~~V~VGi~~~~~~~~~~~~~~A~iA~ 50 (197) |.... ..++. .+.-+-+.+.+.. ...|.||+.. + .+.+|. T Consensus 17 l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k------~----~~~~a~ 86 (125) T protein:vir:47 17 AVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAK------G----VSHRIH 86 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCC------C----CceEEE Confidence 11111 00111 1111111111110 0112222210 0 123444 Q ss_pred HhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_021342. 51 VLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDE 130 (197) Q Consensus 51 ~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~ 130 (197) ..||| |+++||+||+|++++++++++.+.+.+.+..-. T Consensus 87 F~E~G------------------------------------------T~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~ 124 (125) T protein:vir:47 87 ATEFG------------------------------------------TMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQ 124 (125) T ss_pred eccCC------------------------------------------ccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHh Confidence 44444 889999999999999999999988877665422 Q ss_pred c Q lcl|NC_021342. 131 S 131 (197) Q Consensus 131 ~ 131 (197) . T Consensus 125 k 125 (125) T protein:vir:47 125 K 125 (125) T ss_pred C Confidence 2 No 95 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=96.67 E-value=2.2e-06 Score=51.61 Aligned_cols=79 Identities=11% Similarity=0.148 Sum_probs=37.8 Q ss_pred Ccccc--cHHHH-HHHHHHHHHHHhc---------------------------CCeEEEEeecCCCCCCCccchHHHHhh Q lcl|NC_021342. 1 MMKVV--GLQET-LAELDKVLGQIRD---------------------------DQYVTVGIHEAAGDVESGEINMATLGA 50 (197) Q Consensus 1 M~ki~--~~~~~-~~~L~~l~~~~~~---------------------------~~~V~VGi~~~~~~~~~~~~~~A~iA~ 50 (197) |.... ..++. .+.-+-+.+.+.. ...|.||+.. + .+.+|. T Consensus 17 l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k------~----~~~~a~ 86 (125) T protein:vir:98 17 AVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAK------G----VSHRIH 86 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCC------C----CceEEE Confidence 11111 00111 1111111111110 0112222210 0 123444 Q ss_pred HhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_021342. 51 VLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDE 130 (197) Q Consensus 51 ~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~ 130 (197) ..||| |+++||+||+|++++++++++.+.+.+.+..-. T Consensus 87 F~E~G------------------------------------------T~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~ 124 (125) T protein:vir:98 87 ATEFG------------------------------------------TMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQ 124 (125) T ss_pred eccCC------------------------------------------ccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHh Confidence 44444 889999999999999999999988877665422 Q ss_pred c Q lcl|NC_021342. 131 S 131 (197) Q Consensus 131 ~ 131 (197) . T Consensus 125 k 125 (125) T protein:vir:98 125 K 125 (125) T ss_pred C Confidence 2 No 96 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=96.67 E-value=2.2e-06 Score=51.61 Aligned_cols=79 Identities=11% Similarity=0.148 Sum_probs=37.8 Q ss_pred Ccccc--cHHHH-HHHHHHHHHHHhc---------------------------CCeEEEEeecCCCCCCCccchHHHHhh Q lcl|NC_021342. 1 MMKVV--GLQET-LAELDKVLGQIRD---------------------------DQYVTVGIHEAAGDVESGEINMATLGA 50 (197) Q Consensus 1 M~ki~--~~~~~-~~~L~~l~~~~~~---------------------------~~~V~VGi~~~~~~~~~~~~~~A~iA~ 50 (197) |.... ..++. .+.-+-+.+.+.. ...|.||+.. + .+.+|. T Consensus 17 l~~~~~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~~~g~~~v~VG~~k------~----~~~~a~ 86 (125) T protein:vir:94 17 AVLKMNLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDRHTSEKIVTIGYAK------G----VSHRIH 86 (125) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeecccccccccceEEEEeccCC------C----CceEEE Confidence 11111 00111 1111111111110 0112222210 0 123444 Q ss_pred HhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_021342. 51 VLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDE 130 (197) Q Consensus 51 ~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~ 130 (197) ..||| |+++||+||+|++++++++++.+.+.+.+..-. T Consensus 87 F~E~G------------------------------------------T~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~ 124 (125) T protein:vir:94 87 ATEFG------------------------------------------TMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQ 124 (125) T ss_pred eccCC------------------------------------------ccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHh Confidence 44444 889999999999999999999988877665422 Q ss_pred c Q lcl|NC_021342. 131 S 131 (197) Q Consensus 131 ~ 131 (197) . T Consensus 125 k 125 (125) T protein:vir:94 125 K 125 (125) T ss_pred C Confidence 2 No 97 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=96.64 E-value=3e-06 Score=50.85 Aligned_cols=105 Identities=21% Similarity=0.176 Sum_probs=46.5 Q ss_pred Ccccc----cHHHHH------------HHHHHHHHHHh----cCCeEEEEeecCCC-----------CCCCccchHHHHh Q lcl|NC_021342. 1 MMKVV----GLQETL------------AELDKVLGQIR----DDQYVTVGIHEAAG-----------DVESGEINMATLG 49 (197) Q Consensus 1 M~ki~----~~~~~~------------~~L~~l~~~~~----~~~~V~VGi~~~~~-----------~~~~~~~~~A~iA 49 (197) ||.++ |.++.. +.|++...++. ...-|.-|-+..+- +-.....++|.|| T Consensus 1 m~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA 80 (142) T protein:vir:99 1 MVQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYA 80 (142) T ss_pred CceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCcccc Confidence 76665 332221 12222222111 11112222221110 0011123678899 Q ss_pred hHhHcCc---eeeeCCCceeeecccccccccCCccccccccccccccccccccC---CCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 50 AVLNFGA---EIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKIN---IPARPWLEPGVQSKSNEYVTIIE 123 (197) Q Consensus 50 ~~~EfGa---~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~---IP~RpFlr~t~~~~~~~~~~~~~ 123 (197) .++|||. .|+ |.+...+. |...+.. ++ ..+++ +||||||+++++.+.++-.+... T Consensus 81 ~~ve~GT~ph~i~-pk~~~al~------------f~~~g~~----~~--~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~ 141 (142) T protein:vir:99 81 AAVHEGTRPHVIR-AKHAQALH------------FWWRGRE----VF--VRQVNHPGTRARPYLRNAGEAVVRRDRRIRV 141 (142) T ss_pred ceeccCCccceec-cccCceee------------EecCCce----ee--eeeeecCCCCCCchhHHHHHHHHhhhhhhcc Confidence 9999997 243 22222221 1111111 11 12344 45999999999998876554433 Q ss_pred H Q lcl|NC_021342. 124 R 124 (197) Q Consensus 124 ~ 124 (197) + T Consensus 142 r 142 (142) T protein:vir:99 142 R 142 (142) T ss_pred C Confidence 3 No 98 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=96.64 E-value=3e-06 Score=50.85 Aligned_cols=105 Identities=21% Similarity=0.176 Sum_probs=46.5 Q ss_pred Ccccc----cHHHHH------------HHHHHHHHHHh----cCCeEEEEeecCCC-----------CCCCccchHHHHh Q lcl|NC_021342. 1 MMKVV----GLQETL------------AELDKVLGQIR----DDQYVTVGIHEAAG-----------DVESGEINMATLG 49 (197) Q Consensus 1 M~ki~----~~~~~~------------~~L~~l~~~~~----~~~~V~VGi~~~~~-----------~~~~~~~~~A~iA 49 (197) ||.++ |.++.. +.|++...++. ...-|.-|-+..+- +-.....++|.|| T Consensus 1 m~~~~~~~~gl~~~l~~~~~~~~~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~v~~~a~YA 80 (142) T protein:vir:86 1 MVQVSVRYEGFDYNPVGAAAQVGPILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSGGVTAHAKYA 80 (142) T ss_pred CceeEEEeeecchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEEEeccCcccc Confidence 76665 332221 12222222111 11112222221110 0011123678899 Q ss_pred hHhHcCc---eeeeCCCceeeecccccccccCCccccccccccccccccccccC---CCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 50 AVLNFGA---EIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKIN---IPARPWLEPGVQSKSNEYVTIIE 123 (197) Q Consensus 50 ~~~EfGa---~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~---IP~RpFlr~t~~~~~~~~~~~~~ 123 (197) .++|||. .|+ |.+...+. |...+.. ++ ..+++ +||||||+++++.+.++-.+... T Consensus 81 ~~ve~GT~ph~i~-pk~~~al~------------f~~~g~~----~~--~k~v~hpG~~a~Pfl~~A~~~~~~~~~~~~~ 141 (142) T protein:vir:86 81 AAVHEGTRPHVIR-AKHAQALH------------FWWRGRE----VF--VRQVNHPGTRARPYLRNAGEAVVRRDRRIRV 141 (142) T ss_pred ceeccCCccceec-cccCceee------------EecCCce----ee--eeeeecCCCCCCchhHHHHHHHHhhhhhhcc Confidence 9999997 243 22222221 1111111 11 12344 45999999999998876554433 Q ss_pred H Q lcl|NC_021342. 124 R 124 (197) Q Consensus 124 ~ 124 (197) + T Consensus 142 r 142 (142) T protein:vir:86 142 R 142 (142) T ss_pred C Confidence 3 No 99 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=96.39 E-value=1.2e-05 Score=47.61 Aligned_cols=114 Identities=12% Similarity=0.059 Sum_probs=53.8 Q ss_pred Ccccc--cHHHHHHHHHHHHH---------HHhcCCeEEEEeecCCCCC---CCc----cchHHHHhhHhHcCceeeeCC Q lcl|NC_021342. 1 MMKVV--GLQETLAELDKVLG---------QIRDDQYVTVGIHEAAGDV---ESG----EINMATLGAVLNFGAEIDHPG 62 (197) Q Consensus 1 M~ki~--~~~~~~~~L~~l~~---------~~~~~~~V~VGi~~~~~~~---~~~----~~~~A~iA~~~EfGa~I~~p~ 62 (197) |.++. -.+++.+.++.+.. +.....-|.-|-+..+-.. .++ ..+.+.||.+.|||.-|.... T Consensus 10 ~~~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~~g~~~~V~~~~~YA~yVE~GTG~~~~~ 89 (141) T protein:vir:78 10 IPKARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRKSSKEVIVGNSSDYAIYYEFGTGEKSER 89 (141) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeecCCcEEEEecCCCccceeecCCcccccC Confidence 33321 01222222222211 1000111222322221100 111 135688999999997553322 Q ss_pred CceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_021342. 63 GTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDE 130 (197) Q Consensus 63 ~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~ 130 (197) +. ++...-|...++|...+ +...|+||||+++++++++++.+.+.+.+.+-- T Consensus 90 ~~----------grk~~w~y~~~~g~~~~------t~G~~aqpFl~~A~~~~~~~i~~~i~~~~~~l~ 141 (141) T protein:vir:78 90 GG----------GKAGGWFYMDKKGHWHF------TRGSQASKRMRYTFRDEQDKVRVFTERALRGIN 141 (141) T ss_pred CC----------CCcCcceeecCCCeeEe------ccCCCCchhhhhhHHhhHHHHHHHHHHHhhccC Confidence 10 01111111222222111 345899999999999999999999998887622 No 100 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=96.13 E-value=6.4e-05 Score=43.59 Aligned_cols=86 Identities=5% Similarity=0.008 Sum_probs=59.7 Q ss_pred HHHHHHHHHHHHHHHHHc--cCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCCcHHHHHhcCC--CCchhHHHHHH Q lcl|NC_021342. 111 VQSKSNEYVTIIERGASR--DESTTSILEKVGVTAQAAVRMFMTEL------QDPPNAKSTIRKKGS--SNPLIDTGALR 180 (197) Q Consensus 111 ~~~~~~~~~~~~~~~~~~--~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppna~~Ti~~KG~--~~PLidTG~L~ 180 (197) +++ -+++.+.+...+.. ..+.+.+|..||..+....++.|.+. .|+|+++.|++.|.. .++|..++.+. T Consensus 1 m~~-~~~~~~~l~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG~~W~p~~~~~~~~~~g~~~~~~~~~l~~~ 79 (149) T protein:vir:18 1 MSE-LTALQERLAGLIASLSPAARRKMAAEIAKKLRTSQQQRIKRQQAPDGTPYAARKRQPVRSKKGRIKREMFAKLRTS 79 (149) T ss_pred Cch-HHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhccCcccchhhhhhhhh Confidence 222 22233333333332 12346699999999999999999974 589999999987653 57999999999 Q ss_pred hhceeeeeccccccc--cC Q lcl|NC_021342. 181 QSVTYVVHSGKLPDE--GL 197 (197) Q Consensus 181 ~SIty~V~~k~~~~~--~~ 197 (197) .++++.+........ |- T Consensus 80 ~~l~~~~~~~~~~v~~~Gt 98 (149) T protein:vir:18 80 RFMKAKGSDSAAVVEFTGK 98 (149) T ss_pred hhhheeecCceeEEEeccc Confidence 999988776654442 22 No 101 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=95.98 E-value=8e-05 Score=43.04 Aligned_cols=86 Identities=9% Similarity=0.040 Sum_probs=61.9 Q ss_pred HHHHHHHHHHHHHHHHHcc--CcHHHHHHHHHHHHHHHHHHHHHhC------CCCCCcHHHHHhcCC-CCchhHHHHHHh Q lcl|NC_021342. 111 VQSKSNEYVTIIERGASRD--ESTTSILEKVGVTAQAAVRMFMTEL------QDPPNAKSTIRKKGS-SNPLIDTGALRQ 181 (197) Q Consensus 111 ~~~~~~~~~~~~~~~~~~~--~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppna~~Ti~~KG~-~~PLidTG~L~~ 181 (197) +++ -+++.+.+...+..- .+-..+|..||..+....++.|++. .|+|+++.|.++||. .++|.+++.+.. T Consensus 1 m~~-~~~l~~~L~~ll~~l~~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~~~~~~~~l~~~~ 79 (148) T protein:vir:79 1 MSE-SRELEAWLAGMLTKLDAPARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRIRRAMFMRLRLAR 79 (148) T ss_pred Ccc-HHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhcccccccccchhhhhh Confidence 222 234444444444432 2235689999999999999999973 478999999999986 479999999999 Q ss_pred hceeeeeccccccc--cC Q lcl|NC_021342. 182 SVTYVVHSGKLPDE--GL 197 (197) Q Consensus 182 SIty~V~~k~~~~~--~~ 197 (197) ++++.+....+... |. T Consensus 80 ~l~~~~~~~~~~v~~~Gt 97 (148) T protein:vir:79 80 YMKTQADANTAVVTFAGN 97 (148) T ss_pred heeeeeeCCeeeEEeecc Confidence 99998866654443 33 No 102 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=95.97 E-value=1.7e-05 Score=46.78 Aligned_cols=102 Identities=16% Similarity=0.195 Sum_probs=47.3 Q ss_pred CcccccHHHHHHHHHHHHHHHh----c---------CCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceee Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQIR----D---------DQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYG 67 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~~~----~---------~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~ 67 (197) |= +.+.+.|.+....+. . ..++.+-+..+ .......+.+.||.+.|||.-+..+..... T Consensus 1 v~-----~~v~~~~~~~~~~i~~~ak~~apv~TG~Lr~SI~~~~~~~--~~~~~V~~~~~Ya~yvE~GTg~~~~~~~~~- 72 (116) T protein:vir:95 1 ME-----RWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDG--GFTGVINIGSEYAIYVNYGTGIYATGAGGS- 72 (116) T ss_pred Ch-----HHHHHHHHHHHHHHHHHHHhhCCccccccccceeEEeecC--cEEEEEecCCCccceeecCccccccCCCcc- Confidence 11 112222222222221 1 12233322211 011112356889999999976543221100 Q ss_pred ecccccccccCCc-cccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 68 YATEEAESRKEVR-FLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIE 123 (197) Q Consensus 68 ~~~~~~~~~~~~~-f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~ 123 (197) .....+ +....+|. + ..+..+|+||||++++++++..+.+.+. T Consensus 73 -------~~~~~~~~~~~~~g~--~----~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 73 -------RAKNIPWSYKDANGK--W----HTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred -------ccccccceeecCccc--e----eeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 000000 00111111 1 1256799999999999999998888777 No 103 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=95.94 E-value=2.2e-05 Score=46.08 Aligned_cols=108 Identities=18% Similarity=0.236 Sum_probs=47.5 Q ss_pred Ccccc--cHHHHHHHHHHHHHHHh--c----CCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccc Q lcl|NC_021342. 1 MMKVV--GLQETLAELDKVLGQIR--D----DQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEE 72 (197) Q Consensus 1 M~ki~--~~~~~~~~L~~l~~~~~--~----~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~ 72 (197) |=++. ...+....+.+.++.+. + ..++.+-+..+ + -.....+.+.||.+.|||.-+..+.+...... T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~-~-~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~--- 75 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDG-G-FTGVINIGSEYAIYVNYGTGIYATGAGGSRAK--- 75 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeecC-c-EEEEEecCCCcccccccCCcccccCCCccccc--- Confidence 11111 11111112222222111 0 12222222111 0 11112356889999999976643332111000 Q ss_pred cccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 73 AESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIE 123 (197) Q Consensus 73 ~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~ 123 (197) +... +.....|. + ..+..+|+||||++++++++..+.+.+. T Consensus 76 ---~~~~-~~~~~~g~--~----~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 76 ---KIPW-SYKDANGK--W----HTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ---ccce-eeecCCce--e----eecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 0000 00001111 1 1255799999999999999998887777 No 104 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=95.94 E-value=2.2e-05 Score=46.08 Aligned_cols=108 Identities=18% Similarity=0.236 Sum_probs=47.5 Q ss_pred Ccccc--cHHHHHHHHHHHHHHHh--c----CCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccc Q lcl|NC_021342. 1 MMKVV--GLQETLAELDKVLGQIR--D----DQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEE 72 (197) Q Consensus 1 M~ki~--~~~~~~~~L~~l~~~~~--~----~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~ 72 (197) |=++. ...+....+.+.++.+. + ..++.+-+..+ + -.....+.+.||.+.|||.-+..+.+...... T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~-~-~~~~V~~~~~YA~yvE~GTg~~~~~~~~~~~~--- 75 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDG-G-FTGVINIGSEYAIYVNYGTGIYATGAGGSRAK--- 75 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeecC-c-EEEEEecCCCcccccccCCcccccCCCccccc--- Confidence 11111 11111112222222111 0 12222222111 0 11112356889999999976643332111000 Q ss_pred cccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 73 AESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIE 123 (197) Q Consensus 73 ~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~ 123 (197) +... +.....|. + ..+..+|+||||++++++++..+.+.+. T Consensus 76 ---~~~~-~~~~~~g~--~----~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 76 ---KIPW-SYKDANGK--W----HTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ---ccce-eeecCCce--e----eecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 0000 00001111 1 1255799999999999999998887777 No 105 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=95.84 E-value=1.4e-05 Score=47.19 Aligned_cols=103 Identities=12% Similarity=0.045 Sum_probs=45.3 Q ss_pred CcccccHHHHHHHHHHHHHHHh----c---------CCeEEEEeecC--CCCCCCccchHHHHhhHhHcCc---eeeeCC Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQIR----D---------DQYVTVGIHEA--AGDVESGEINMATLGAVLNFGA---EIDHPG 62 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~~~----~---------~~~V~VGi~~~--~~~~~~~~~~~A~iA~~~EfGa---~I~~p~ 62 (197) |+.-.+ ..+.+.|++....+. . ..++...+..+ ..+.+....+.+.||.++|||. .|++.. T Consensus 14 ~~~~~~-~v~r~~l~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~V~~~~~YA~~ve~GT~ph~I~Pk~ 92 (137) T protein:vir:10 14 EARQFQ-VIARRRLSRITRGTANQARADVPVKTGNLGRSIREDPIVVAGPLRLDSGVTAHADYARYVHDGTRAHVIRPRR 92 (137) T ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHhcCCccchhhhcCceeeeeeccccceEEEEecCCCccceeeecCCCCceeeccc Confidence 222211 122223333322221 1 11122111111 1111112235689999999996 255433 Q ss_pred CceeeecccccccccCCcccccccccccccccccccc---CCCcchhHHHHHHHHHHHHHHHH Q lcl|NC_021342. 63 GTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKI---NIPARPWLEPGVQSKSNEYVTII 122 (197) Q Consensus 63 ~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v---~IP~RpFlr~t~~~~~~~~~~~~ 122 (197) +..+++.... ..++ -.++| .+|+||||+++++++..+-.... T Consensus 93 ~k~~l~~~~~------g~~v------------f~k~V~hPG~~a~PfL~~A~~~~~~~~~~~~ 137 (137) T protein:vir:10 93 PGGVLRFTVG------GRVV------------YARRVNHPGTRARPFLRNAAERVVARETATS 137 (137) T ss_pred cceeeeEeeC------CeeE------------ecceeecCCCCCCchHHHHHHHhhhhhcccC Confidence 3333332211 1111 01223 36699999999999887665444 No 106 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=95.55 E-value=4.9e-05 Score=44.19 Aligned_cols=89 Identities=18% Similarity=0.175 Sum_probs=50.4 Q ss_pred Ccccc--cHHHHHHHHHHHHHHHh--cCCeEEEEee-----------cC-------------CCCC-CCccchHHHHhhH Q lcl|NC_021342. 1 MMKVV--GLQETLAELDKVLGQIR--DDQYVTVGIH-----------EA-------------AGDV-ESGEINMATLGAV 51 (197) Q Consensus 1 M~ki~--~~~~~~~~L~~l~~~~~--~~~~V~VGi~-----------~~-------------~~~~-~~~~~~~A~iA~~ 51 (197) |+.+. |.+++.+.|+++..... .++.++.|.- .. .++- -+..-+-+-|+-. T Consensus 1 Ma~iel~G~del~~~l~~~g~~~~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkkik~~~kk~g~~~VG~~ks~~fy~kF 80 (119) T protein:vir:10 1 MASLEIEGFEEFEKFISEDMVLDESTKRKGIKAGITKIGKAIEKNSPIKSGRLSKVKIRVKNTGLATEGTASSSEFYDIF 80 (119) T ss_pred CceeehhhHHHHHHHHHhhhhhhHHHHHHHHHHHhHHHHHHHhhcCCcccCCcceeeeeeecCceeEeccCCcchhhhhh Confidence 77665 77777777755543211 1122222210 00 0000 0000112344444 Q ss_pred hHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcc-hhHHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_021342. 52 LNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPAR-PWLEPGVQSKSNEYVTIIERGASRDE 130 (197) Q Consensus 52 ~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~R-pFlr~t~~~~~~~~~~~~~~~~~~~~ 130 (197) +||| |...|+| |||.+++++..++....+.+.+...+ T Consensus 81 ~EFG------------------------------------------TSkm~a~~pF~~~a~~~~~~eA~~~~~~el~~~~ 118 (119) T protein:vir:10 81 QNFG------------------------------------------TSEQKAHVGYFDRAVDETTNEAVEEVAEIIFRKM 118 (119) T ss_pred cccc------------------------------------------ccccCCCCCccccccccChHHHHHHHHHHHHHhc Confidence 4444 6789999 99999999999999999988876655 Q ss_pred c Q lcl|NC_021342. 131 S 131 (197) Q Consensus 131 ~ 131 (197) - T Consensus 119 r 119 (119) T protein:vir:10 119 R 119 (119) T ss_pred C Confidence 4 No 107 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=95.54 E-value=7.3e-05 Score=43.27 Aligned_cols=93 Identities=15% Similarity=0.148 Sum_probs=45.8 Q ss_pred Cccccc--H-HHHHHHHHHHHHHHh-------------------cCCeEEE-----EeecCCCCCCCcc------chHHH Q lcl|NC_021342. 1 MMKVVG--L-QETLAELDKVLGQIR-------------------DDQYVTV-----GIHEAAGDVESGE------INMAT 47 (197) Q Consensus 1 M~ki~~--~-~~~~~~L~~l~~~~~-------------------~~~~V~V-----Gi~~~~~~~~~~~------~~~A~ 47 (197) |++|+- . +.+.+.|+.+.+.+. ..--+.- ||-.......+.. -+-.. T Consensus 1 Ma~i~id~la~~I~~~L~~y~~~v~~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~~~g~~~~vv~~~~~~~ 80 (126) T protein:vir:81 1 MANITIDRLADELLQAVKEYTDDVAEGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKEDGYGTTKRIIWNKKHYR 80 (126) T ss_pred CcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccchhhccccccccccCCcceEEEeccCCCC Confidence 887762 1 224444554433221 0000111 1111000000000 00012 Q ss_pred HhhHhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 48 LGAVLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGAS 127 (197) Q Consensus 48 iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~ 127 (197) ++.+.|||. .+++.| -+|+||||+|+++...+++.+.++++|. T Consensus 81 l~HLLEfGh-------------------------a~r~gG------------rV~a~Phi~Pa~e~~~~~~~~~i~~~l~ 123 (126) T protein:vir:81 81 RVHLLEFGH-------------------------AKVNGG------------RVKEYPHLRPAYDKHGARLPDELKRVIE 123 (126) T ss_pred ceeeeecce-------------------------ecCCCC------------ccCCCcchHHHHHHHHHHHHHHHHHHhh Confidence 233445551 011111 2799999999999999999999999999 Q ss_pred ccC Q lcl|NC_021342. 128 RDE 130 (197) Q Consensus 128 ~~~ 130 (197) ++. T Consensus 124 ~gg 126 (126) T protein:vir:81 124 NGG 126 (126) T ss_pred cCC Confidence 877 No 108 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=95.24 E-value=0.00016 Score=41.42 Aligned_cols=122 Identities=15% Similarity=-0.007 Sum_probs=57.9 Q ss_pred Cccc---ccHHHHHHHHHHHHHHH----hcCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceee-ecccc Q lcl|NC_021342. 1 MMKV---VGLQETLAELDKVLGQI----RDDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYG-YATEE 72 (197) Q Consensus 1 M~ki---~~~~~~~~~L~~l~~~~----~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~-~~~~~ 72 (197) ...= .+..+....|.+|...+ .+.....++++.+ ..+.||++|+||.+.++....... ..+.. T Consensus 59 w~pRK~~~~k~k~~rm~~kL~~~~~~~~~~~~~~~~~~~~g---------~~~~IA~vHQ~G~~~rv~~~~~~~~~~~~~ 129 (231) T protein:vir:37 59 WEKRKPVDGEIKNKRLLKKVLRYASILAEERGKGRIYYKNP---------LTGEIAQKQQDGFTEHFRVFATDKNKNGSG 129 (231) T ss_pred CchhcccccchhhHHHHHHhHHhhccccccCCceEEeeecc---------hHHHHHHHhhcCcccccchhhhhhccCCCC Confidence 1110 11111222333332211 1222344444322 368899999999988775431110 00100 Q ss_pred --cccc--------cC--Cc-----ccc----ccc--------cccccc------c--cc------ccccCCCcchhHHH Q lcl|NC_021342. 73 --AESR--------KE--VR-----FLK----TGT--------GFKPLG------V--TK------PHKINIPARPWLEP 109 (197) Q Consensus 73 --~~~~--------~~--~~-----f~k----~~~--------g~~~~~------~--~~------~~~v~IP~RpFlr~ 109 (197) .+-. -| ++ ..+ +.+ .+.+.. . .. .-+|.+|+||||-. T Consensus 130 ~~pATr~QAk~Lr~lGy~v~~~k~k~~k~~~rkps~kwI~~~ls~~qAgliIR~L~~k~~~~~~k~~W~I~~paR~FLG~ 209 (231) T protein:vir:37 130 NDRATIRQAQKLRSLGYRKRNGKNRQGKTKYRLYTIKEIRERLTRTWASMEIRRLENKVNAGNGKTNWEIHVPARPFLDT 209 (231) T ss_pred CCCCCHHHHHHHHHhcccccCCCCCCCCCCcCcCCHHHHHHhhhhHHHHHHHHHHhcccccccCcceeeeecCcccccCC Confidence 0000 00 11 100 000 000000 0 10 13589999999998 Q ss_pred HHHHHHHHHHHHHHHHHHccCc Q lcl|NC_021342. 110 GVQSKSNEYVTIIERGASRDES 131 (197) Q Consensus 110 t~~~~~~~~~~~~~~~~~~~~~ 131 (197) .-++...-+..++.+++.+... T Consensus 210 ~~~e~~~~l~~~l~~i~~~~~~ 231 (231) T protein:vir:37 210 REKENVDILREITLKFLSGEYK 231 (231) T ss_pred CHHHHHHHHHHHHHHHhcccCC Confidence 8888888888889999888766 No 109 >protein:vir:78755 Length: 228 # NCBI annotation: putative tail completion protein # Family: family:all:743 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285651;genbank:gi:148727157;genbank:GeneID:5220102 Probab=95.08 E-value=0.00017 Score=41.24 Aligned_cols=132 Identities=13% Similarity=0.201 Sum_probs=55.7 Q ss_pred CcccccHHHHHHHHHHHHHHH-hcCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCce---eeecccccccc Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQI-RDDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTS---YGYATEEAESR 76 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~~-~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~---~~~~~~~~~~~ 76 (197) =-+-.+-.++...|.+.++-. .....+.|||..+... ..++.||++|+||.+.++..+.. |.-.+.....- T Consensus 56 ~pRKr~krKMl~~L~k~Lk~~~~~~~~a~v~f~~~~~~-----~~~~rIA~vHq~G~~~~v~~~~~~~~~~~r~~~~~pa 130 (228) T protein:vir:78 56 APRKRGKRKMLRGLPKLLQIREPRQDMAELGFTKGTMS-----AHAGVIANTHQKGHTYKVTAASRRRIAPSDVGKNKQA 130 (228) T ss_pred hhhhhhHHHHHhhhHHhhhhhcccccceEEEeecCccc-----chHHHHHHHHhcCcccccccchhhhhhcccCCCCCCC Confidence 111112222233333333211 1334688988643211 24789999999999887765422 21111111000 Q ss_pred --c--------CCccc-cccccccc--------------ccc----------ccccccCCCcchhHHHHHHHHHHHHHHH Q lcl|NC_021342. 77 --K--------EVRFL-KTGTGFKP--------------LGV----------TKPHKINIPARPWLEPGVQSKSNEYVTI 121 (197) Q Consensus 77 --~--------~~~f~-k~~~g~~~--------------~~~----------~~~~~v~IP~RpFlr~t~~~~~~~~~~~ 121 (197) + |-+.. +.++++.. ... --.-+|.+|+||||-..-++...-+..+ T Consensus 131 Tr~QAk~Lr~lGy~~~~~~~k~~rkps~kwI~~nls~gqAgliir~L~~k~~k~~W~I~~PaR~FLG~s~~e~~~~l~~~ 210 (228) T protein:vir:78 131 SKAQARKLRELGFKRPGKRKRAYRSASLGWITANLNYAQAGLLIKKLKDEPVKESWEIQLPARPFLGANARQRQQAFALR 210 (228) T ss_pred CHHHHHHHHHhhccccCCcCCCcccCCHHHHHHHhhHHHHHHHHHHHhCCCCccceeeecCcccccCCCHHHHHHHHHHH Confidence 0 00110 11122110 000 0114889999999976655554444444 Q ss_pred HHHHHHccCc--HHHHHHH Q lcl|NC_021342. 122 IERGASRDES--TTSILEK 138 (197) Q Consensus 122 ~~~~~~~~~~--~~~~l~~ 138 (197) ++. +.-+.+ +.+.=.+ T Consensus 211 l~~-i~~g~~~~~qd~~~~ 228 (228) T protein:vir:78 211 PES-IDYGWDVNKQDMKGK 228 (228) T ss_pred HHh-cccCCCcchhhccCC Confidence 433 221111 1111111 No 110 >protein:vir:3750 Length: 227 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043491;genbank:gi:9628626;genbank:GeneID:1261131 Probab=94.56 E-value=0.00019 Score=40.96 Aligned_cols=121 Identities=9% Similarity=-0.014 Sum_probs=54.2 Q ss_pred Cc-ccccHHHHHHHHHHHHHHHhcCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCce-eeeccccc---cc Q lcl|NC_021342. 1 MM-KVVGLQETLAELDKVLGQIRDDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTS-YGYATEEA---ES 75 (197) Q Consensus 1 M~-ki~~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~-~~~~~~~~---~~ 75 (197) .. +-.+-.++...|.+++.--.......|+|..+ ..+.||++|+||.+.++..... ..+-.... +. T Consensus 59 ~~pRKr~k~KM~~kL~k~l~~~~~~~~a~v~f~~g---------~~~~IA~vHq~G~~~~v~~~~~~~~~~~~~~~~paT 129 (227) T protein:vir:37 59 WKKRKNGTAKMLRRIAKLANSKAEKAQGTLFYKQK---------RTGEIAQEHQEGIPHLFKKTEFTGKNKGGIGADPCT 129 (227) T ss_pred CchhcchhHHHHhhhHHHcceeecccceEEEecCc---------chHHHHHHhhcCcccccchhhhhhhhcCCccccCCC Confidence 11 11122222333333221112334456887533 2578999999999998744311 11100000 00 Q ss_pred c--------cC--Cc----------cccccc-------cccccc------c----c------cccccCCCcchhHHHHHH Q lcl|NC_021342. 76 R--------KE--VR----------FLKTGT-------GFKPLG------V----T------KPHKINIPARPWLEPGVQ 112 (197) Q Consensus 76 ~--------~~--~~----------f~k~~~-------g~~~~~------~----~------~~~~v~IP~RpFlr~t~~ 112 (197) . -| ++ +.+..- .+.+.. . + -.-+|.+|+||||-.+-+ T Consensus 130 r~QAk~Lr~lGy~v~~~k~k~~k~~~rkps~kwI~~nls~~qAgliIR~L~~k~~~~~~~~k~~W~I~~PaR~FLG~~~~ 209 (227) T protein:vir:37 130 LRQAKKLKDLGYTVANGKTKNGKAKRRKPTLSEIRSTLSRAKASLIIRKLEEKNGMNPSRHLTQWIIPTEKRSFLDTREE 209 (227) T ss_pred HHHHHHHHHhcccccCCCCCCcCCccccCCHHHHHHhhhHHHHHHHHHHHhcccccccccCccceeeecCcccccCCCHH Confidence 0 00 01 000000 000000 0 0 113788999999988766 Q ss_pred HHHHHHHHHHHHHHHccC Q lcl|NC_021342. 113 SKSNEYVTIIERGASRDE 130 (197) Q Consensus 113 ~~~~~~~~~~~~~~~~~~ 130 (197) +...-+..++.++-...- T Consensus 210 e~~~~l~r~l~~~~~~~~ 227 (227) T protein:vir:37 210 ENAKIILAEIQKYTQKQQ 227 (227) T ss_pred HHHHHHHHHHHHHhhhcC Confidence 666666666665544333 No 111 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=94.42 E-value=0.00067 Score=37.98 Aligned_cols=87 Identities=10% Similarity=0.042 Sum_probs=56.6 Q ss_pred HHHHHHHHHHHHHHHHHc--cCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCCcHHHHHhcCCCCchhHHHHHHhh Q lcl|NC_021342. 111 VQSKSNEYVTIIERGASR--DESTTSILEKVGVTAQAAVRMFMTEL------QDPPNAKSTIRKKGSSNPLIDTGALRQS 182 (197) Q Consensus 111 ~~~~~~~~~~~~~~~~~~--~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppna~~Ti~~KG~~~PLidTG~L~~S 182 (197) ++++-.++...+...+.. ..+...+|..||..+....++.|.+. .|+|+++.+..+|+..+-......|+.| T Consensus 1 M~~~~~~~~~~L~~ll~~L~~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~~~k~~~~~~~m~~~L~~a 80 (152) T protein:vir:10 1 MSEPIEQVKTAFDSLLNNISKPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKKGVKSKIKSGKMFDKITQP 80 (152) T ss_pred CchHHHHHHHHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhhcccccchhHHHhhhhc Confidence 455555555555555554 23446699999999999999999986 5788888887777665555555555554 Q ss_pred --ceeeeeccccccc--cC Q lcl|NC_021342. 183 --VTYVVHSGKLPDE--GL 197 (197) Q Consensus 183 --Ity~V~~k~~~~~--~~ 197 (197) ++|+.....+... |- T Consensus 81 ~~l~~~a~~~~~~Vg~~Gt 99 (152) T protein:vir:10 81 RFMRLRLESEGVSLGYEGG 99 (152) T ss_pred ceeeeeecCcEEEEEecCC Confidence 5666544443322 22 No 112 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=94.29 E-value=0.00058 Score=38.34 Aligned_cols=87 Identities=10% Similarity=0.048 Sum_probs=62.6 Q ss_pred HHHHHHHHHHHHHHHHHc--cCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCCcHHHHHhc-----C--CCCchhH Q lcl|NC_021342. 111 VQSKSNEYVTIIERGASR--DESTTSILEKVGVTAQAAVRMFMTEL------QDPPNAKSTIRKK-----G--SSNPLID 175 (197) Q Consensus 111 ~~~~~~~~~~~~~~~~~~--~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppna~~Ti~~K-----G--~~~PLid 175 (197) ++++-.++.+.+..++.. ..+...+|..||..+....++.|... .|+|+++.|..++ | ...+|.+ T Consensus 1 m~~~~~~l~~~l~~ll~~l~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~~W~prk~~~~~~~~~~~~g~~~~~~m~~ 80 (155) T protein:vir:79 1 MTDDLQALERWAGGLLAKLSPAARRQLLRELGRDLRRAQQSRVAAQRNPDGSAYEPRKVKAGGKRLREKAGRVKREAMFR 80 (155) T ss_pred CchHHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchhhhhhhhhcccCcccchhhhh Confidence 445555666666555543 23446699999999999999999974 4788998886543 3 2467999 Q ss_pred HHHHHhhceeeeeccccccc--cC Q lcl|NC_021342. 176 TGALRQSVTYVVHSGKLPDE--GL 197 (197) Q Consensus 176 TG~L~~SIty~V~~k~~~~~--~~ 197 (197) .+.+-.+|+|++....+... |- T Consensus 81 ~l~~a~~l~~~~~~d~a~Vg~~Gs 104 (155) T protein:vir:79 81 KLRTARYLRIDVDSTGLAIGFDER 104 (155) T ss_pred hhhhhheeeeeecCcEEEEEecCc Confidence 99999999999876665442 22 No 113 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=93.73 E-value=0.00014 Score=41.68 Aligned_cols=102 Identities=17% Similarity=0.113 Sum_probs=43.7 Q ss_pred Ccccc---------------cHHHHHHHHHHHHHHHhc----CCeEEEEeecCCCC---C-C------CccchHHHHhhH Q lcl|NC_021342. 1 MMKVV---------------GLQETLAELDKVLGQIRD----DQYVTVGIHEAAGD---V-E------SGEINMATLGAV 51 (197) Q Consensus 1 M~ki~---------------~~~~~~~~L~~l~~~~~~----~~~V~VGi~~~~~~---~-~------~~~~~~A~iA~~ 51 (197) |+++. ..+.+++.+++....+.+ ..-|.-|-+..+-. . + ....++|.||.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~ 80 (140) T protein:vir:97 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAP 80 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhh Confidence 44443 112223333333332211 11122233322111 0 1 111356899999 Q ss_pred hHcCc---eeeeCCCceeeecccccccccCCccccccccccccccccccccC---CCcchhHHHHHHHH---HHHHHHH Q lcl|NC_021342. 52 LNFGA---EIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKIN---IPARPWLEPGVQSK---SNEYVTI 121 (197) Q Consensus 52 ~EfGa---~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~---IP~RpFlr~t~~~~---~~~~~~~ 121 (197) +|||. .|++..+ ..+ .|...++.++ ...|+ +++||||++++++. +..+... T Consensus 81 Ve~GT~ph~I~pk~~-k~L------------~~~~~G~~~~------~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:97 81 VHEGSRPHAIRARNA-QYL------------HFWWHGREMF------RKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred hccCCCCceeecCCC-ccc------------eeecCCCEEE------eeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 99997 3332222 222 2222222111 12344 45999999999874 3333222 No 114 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=93.73 E-value=0.00014 Score=41.68 Aligned_cols=102 Identities=17% Similarity=0.113 Sum_probs=43.7 Q ss_pred Ccccc---------------cHHHHHHHHHHHHHHHhc----CCeEEEEeecCCCC---C-C------CccchHHHHhhH Q lcl|NC_021342. 1 MMKVV---------------GLQETLAELDKVLGQIRD----DQYVTVGIHEAAGD---V-E------SGEINMATLGAV 51 (197) Q Consensus 1 M~ki~---------------~~~~~~~~L~~l~~~~~~----~~~V~VGi~~~~~~---~-~------~~~~~~A~iA~~ 51 (197) |+++. ..+.+++.+++....+.+ ..-|.-|-+..+-. . + ....++|.||.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~~~~~v~~~a~YA~~ 80 (140) T protein:vir:10 1 MATIRARARIEIDEAALERESGEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFRVRGGVEATADYAAP 80 (140) T ss_pred CeeeeeeeeeeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCceEEEEecCCccchhh Confidence 44443 112223333333332211 11122233322111 0 1 111356899999 Q ss_pred hHcCc---eeeeCCCceeeecccccccccCCccccccccccccccccccccC---CCcchhHHHHHHHH---HHHHHHH Q lcl|NC_021342. 52 LNFGA---EIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKIN---IPARPWLEPGVQSK---SNEYVTI 121 (197) Q Consensus 52 ~EfGa---~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~---IP~RpFlr~t~~~~---~~~~~~~ 121 (197) +|||. .|++..+ ..+ .|...++.++ ...|+ +++||||++++++. +..+... T Consensus 81 Ve~GT~ph~I~pk~~-k~L------------~~~~~G~~~~------~k~V~hpG~~a~Pfl~~A~~~~~~~~~~i~~~ 140 (140) T protein:vir:10 81 VHEGSRPHAIRARNA-QYL------------HFWWHGREMF------RKSVWHPGTRARPFMRNSAQRVVTNDPRVRMT 140 (140) T ss_pred hccCCCCceeecCCC-ccc------------eeecCCCEEE------eeeeecCCCCCChhHHHHHHHHhhhhhhccCC Confidence 99997 3332222 222 2222222111 12344 45999999999874 3333222 No 115 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=93.56 E-value=0.0014 Score=36.21 Aligned_cols=87 Identities=6% Similarity=-0.019 Sum_probs=57.8 Q ss_pred HHHHHHHHHHHHHHHHHc--cCcHHHHHHHHHHHHHHHHHHHHHhC------CCCCCcHHHHHhcCC----CCchhHHHH Q lcl|NC_021342. 111 VQSKSNEYVTIIERGASR--DESTTSILEKVGVTAQAAVRMFMTEL------QDPPNAKSTIRKKGS----SNPLIDTGA 178 (197) Q Consensus 111 ~~~~~~~~~~~~~~~~~~--~~~~~~~l~~iG~~~~~~i~~~I~~~------~~ppna~~Ti~~KG~----~~PLidTG~ 178 (197) +++.-.++.+.+...+.. ..+...+|..||..+....++.|... .|+|+++.|++.|.. ..+|..... T Consensus 1 m~~~~~~l~~~L~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PdG~~W~p~~~~~~~~~~~~~~~~~~m~~~l~ 80 (156) T protein:vir:11 1 MADSLEALEDWAGPILRALEPGPRAALARSLARDLRRSQQKRVMAQRNPDGSAYEPRKKRELRGKQGRIRRKIKMFQKLR 80 (156) T ss_pred CchhHHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcccchHHHhhhccccccchhhhhhhh Confidence 556666666666665543 23456799999999999999999974 588999999987642 234544444 Q ss_pred HHhhceeeeeccccccc--cC Q lcl|NC_021342. 179 LRQSVTYVVHSGKLPDE--GL 197 (197) Q Consensus 179 L~~SIty~V~~k~~~~~--~~ 197 (197) +..+|++.+....+... |- T Consensus 81 ~~~~l~~~~~~~~a~vg~~Gs 101 (156) T protein:vir:11 81 TVRYLRAKGDAQAITVSFAGR 101 (156) T ss_pred hhheeeeeecCcEEEEEecCC Confidence 45557777655444331 22 No 116 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=92.74 E-value=0.00017 Score=41.19 Aligned_cols=106 Identities=16% Similarity=0.245 Sum_probs=43.3 Q ss_pred Ccccc-------cHHHHHHHHHHHHHHHhc-------------CCeEEEEeecCCCC-CCCccchHHHHhhHhHcCc--- Q lcl|NC_021342. 1 MMKVV-------GLQETLAELDKVLGQIRD-------------DQYVTVGIHEAAGD-VESGEINMATLGAVLNFGA--- 56 (197) Q Consensus 1 M~ki~-------~~~~~~~~L~~l~~~~~~-------------~~~V~VGi~~~~~~-~~~~~~~~A~iA~~~EfGa--- 56 (197) |.++. .++-+.+.+++...++.+ ..++.+.+..+.+. ......+++.||.++|||. T Consensus 5 ~~~l~~~~l~~~~~~~~~~~~~~~a~~ve~~ak~~aPv~TG~Lr~SI~~~~~~~~g~~v~~~V~~~~~YA~~ve~GT~ph 84 (137) T protein:vir:10 5 TLRIERAQLHGLGMDEARKAVNRVVRRTFTRSQILAPVDTGYLRASGRLVLGRERGAVVIGSVEYTARYAAAVHNGRRAL 84 (137) T ss_pred ccccChhhHhhHHHHHHHHHHHHHHHHHHHHHHhcCCcCchhhhccceeeeeeccccEEEEEecCCcccceeeecCCCCc Confidence 22221 112223333333332211 11222222111000 0111235789999999997 Q ss_pred eeeeCCCceeeecccccccccCCccccccccccccccccccccCCC---cchhHHHHHHHHHHHHHHHHHHHHHccCc Q lcl|NC_021342. 57 EIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIP---ARPWLEPGVQSKSNEYVTIIERGASRDES 131 (197) Q Consensus 57 ~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP---~RpFlr~t~~~~~~~~~~~~~~~~~~~~~ 131 (197) +|++..+. .+ .|...++ + +-...|+.| +||||++++++....- ++...+. T Consensus 85 ~I~pk~~k-aL------------~f~~~G~----~--vf~k~V~hPG~k~~PfL~~Al~~~~~~~------~~~~~~~ 137 (137) T protein:vir:10 85 TIRAKGNG-RL------------KFTVEGR----T--VYARSVHQPARAGRPYLSQALREVAPQE------GFRVTIG 137 (137) T ss_pred eeecCCCc-cc------------eeecCCe----e--EeccceecCCCCCChhhHHHHHHhhccc------ceeEeeC Confidence 45543332 22 2211111 0 112244444 9999999988655322 2222222 No 117 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=92.32 E-value=0.00016 Score=41.41 Aligned_cols=102 Identities=15% Similarity=0.134 Sum_probs=41.7 Q ss_pred Ccccc-cHH--------HHHHHHHHHHHHHh----cCCeEEEEeecCCCC----------CCCccchHHHHhhHhHcCc- Q lcl|NC_021342. 1 MMKVV-GLQ--------ETLAELDKVLGQIR----DDQYVTVGIHEAAGD----------VESGEINMATLGAVLNFGA- 56 (197) Q Consensus 1 M~ki~-~~~--------~~~~~L~~l~~~~~----~~~~V~VGi~~~~~~----------~~~~~~~~A~iA~~~EfGa- 56 (197) ++++. ... .+++.|++...++. ...-|.-|-+..+-. -.....+++.||.++|||. T Consensus 4 s~~i~i~~~~l~~~v~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~v~~~~~YA~~ve~GT~ 83 (137) T protein:vir:10 4 TARIHINEPELERQTGAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQTYRPFHVGGGVEDNVDYAAPVHEGSR 83 (137) T ss_pred eEEEeeCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeeccccceEEEEEecCCCceeeeeecCC Confidence 22222 111 12233333333221 111122233222110 0111246789999999996 Q ss_pred --eeeeCCCceeeecccccccccCCccccccccccccccccccccCCC---cchhHHHHHHHHH---HHHHHHH Q lcl|NC_021342. 57 --EIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIP---ARPWLEPGVQSKS---NEYVTII 122 (197) Q Consensus 57 --~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP---~RpFlr~t~~~~~---~~~~~~~ 122 (197) +|++..+ .++ +|...++. +-..++++| |||||++++++.. ..+. .. T Consensus 84 ph~I~pk~~-k~l------------~f~~~G~~------v~~k~v~hpG~~a~Pfl~~A~~~~~~~~~ri~-~~ 137 (137) T protein:vir:10 84 PHRITARHA-NAL------------HFFWHGRE------VFRKSVWHPGVRPRPFLRNAARRVVAADPDIH-MT 137 (137) T ss_pred CceeecccC-cee------------eeeeCCce------EEeeeeecCCCCCCchHHHHHHHHhhcccccc-CC Confidence 4543322 222 12111111 112245555 9999999998742 2221 11 No 118 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=92.08 E-value=0.00041 Score=39.15 Aligned_cols=67 Identities=19% Similarity=0.205 Sum_probs=30.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHc-cCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_021342. 107 LEPGVQSKSNEYVTIIERGASR-DESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTY 185 (197) Q Consensus 107 lr~t~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty 185 (197) |-.++ ...+++.+.++++-+. .-...++|+..+..+++.++.. + | +|||.|++||++ T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~---------a-----------P-vdTG~Lr~SI~~ 58 (137) T protein:vir:94 1 MAKVK-YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISL---------M-----------P-VDTGYLRESVTM 58 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---------C-----------C-ccccchhcccee Confidence 32221 1223333333222110 1123344555555444444431 2 2 599999999999 Q ss_pred eeeccccccccC Q lcl|NC_021342. 186 VVHSGKLPDEGL 197 (197) Q Consensus 186 ~V~~k~~~~~~~ 197 (197) ++..+. -+|. T Consensus 59 ~~~~~~--~~~~ 68 (137) T protein:vir:94 59 DFKDSG--FTGV 68 (137) T ss_pred EeecCc--eEEE Confidence 985443 2222 No 119 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=92.08 E-value=0.00041 Score=39.15 Aligned_cols=67 Identities=19% Similarity=0.205 Sum_probs=30.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHc-cCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_021342. 107 LEPGVQSKSNEYVTIIERGASR-DESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTY 185 (197) Q Consensus 107 lr~t~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty 185 (197) |-.++ ...+++.+.++++-+. .-...++|+..+..+++.++.. + | +|||.|++||++ T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~---------a-----------P-vdTG~Lr~SI~~ 58 (137) T protein:vir:97 1 MAKVK-YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISL---------M-----------P-VDTGYLRESVTM 58 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---------C-----------C-ccccchhcccee Confidence 32221 1223333333222110 1123344555555444444431 2 2 599999999999 Q ss_pred eeeccccccccC Q lcl|NC_021342. 186 VVHSGKLPDEGL 197 (197) Q Consensus 186 ~V~~k~~~~~~~ 197 (197) ++..+. -+|. T Consensus 59 ~~~~~~--~~~~ 68 (137) T protein:vir:97 59 DFKDSG--FTGV 68 (137) T ss_pred EeecCc--eEEE Confidence 985443 2222 No 120 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=92.08 E-value=0.00041 Score=39.15 Aligned_cols=67 Identities=19% Similarity=0.205 Sum_probs=30.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHc-cCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_021342. 107 LEPGVQSKSNEYVTIIERGASR-DESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTY 185 (197) Q Consensus 107 lr~t~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty 185 (197) |-.++ ...+++.+.++++-+. .-...++|+..+..+++.++.. + | +|||.|++||++ T Consensus 1 Ma~~~-~g~~~l~~~l~~~~~~~~~~~~~~~~~~a~~i~~~ak~~---------a-----------P-vdTG~Lr~SI~~ 58 (137) T protein:vir:93 1 MAKVK-YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISL---------M-----------P-VDTGYLRESVTM 58 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---------C-----------C-ccccchhcccee Confidence 32221 1223333333222110 1123344555555444444431 2 2 599999999999 Q ss_pred eeeccccccccC Q lcl|NC_021342. 186 VVHSGKLPDEGL 197 (197) Q Consensus 186 ~V~~k~~~~~~~ 197 (197) ++..+. -+|. T Consensus 59 ~~~~~~--~~~~ 68 (137) T protein:vir:93 59 DFKDSG--FTGV 68 (137) T ss_pred EeecCc--eEEE Confidence 985443 2222 No 121 >protein:vir:98860 Length: 230 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654736;genbank:gi:109302921;genbank:GeneID:4156065 Probab=91.59 E-value=0.002 Score=35.41 Aligned_cols=121 Identities=12% Similarity=0.074 Sum_probs=47.2 Q ss_pred Cc-ccccHHHHHHHHHHHHHHH--hcCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCcee-eeccc-ccc- Q lcl|NC_021342. 1 MM-KVVGLQETLAELDKVLGQI--RDDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSY-GYATE-EAE- 74 (197) Q Consensus 1 M~-ki~~~~~~~~~L~~l~~~~--~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~-~~~~~-~~~- 74 (197) +. +-.+-.++...|.+++.-. .++....++++.+ ..+.||++|+||.+.++...... .+... ..+ T Consensus 61 w~pRKr~k~KMl~~L~k~l~~~~~~~~~~~v~~~~~~---------~~~rIA~vHq~G~~~~~~~~~~~~r~~~~~~~~p 131 (230) T protein:vir:98 61 WKPRKNGNAKMLRRIAKTLKFTSADREIKRVCTISRN---------AQRRSQKEHQRGAKITNLKSVILRKSRAGTAKDP 131 (230) T ss_pred ChhhhhhhHHHHhhhHHHHHHhhcccccceeeeeccc---------chhhhhhhhhccchhhhhhhhhhhhhcCCCCccc Confidence 11 1111122222333332211 1222333444432 14679999999998877442111 00000 000 Q ss_pred -cc----------cCCccccc---ccccc--------------cccc---------c------cccccCCCcchhHHHHH Q lcl|NC_021342. 75 -SR----------KEVRFLKT---GTGFK--------------PLGV---------T------KPHKINIPARPWLEPGV 111 (197) Q Consensus 75 -~~----------~~~~f~k~---~~g~~--------------~~~~---------~------~~~~v~IP~RpFlr~t~ 111 (197) .. ..++-.+. ++++. +... . -+-+|.+|+||||-..- T Consensus 132 aTr~QAk~Lr~lGy~v~~g~~~~~~k~~kkps~kwI~~nls~~qAgliIR~L~~k~~k~~~~~t~W~I~~PaR~FLG~~~ 211 (230) T protein:vir:98 132 ATMRQAKKLRDLGYTVPNGTTKSGKKRYRRPSAREIVATLSRAKASLLIRYFQEKEERQGKRLTKWIIPTEKRPFLDERD 211 (230) T ss_pred ccHHHHHHHHHcCCccCCCCCCcCCCCCCCCCHHHHHHhhhHHHHHHHHHHHhccccccccCccceeeecCcccccCCCh Confidence 00 00111000 11111 0000 0 12478899999998765 Q ss_pred HHHHHHHHHHHHHHHHccC Q lcl|NC_021342. 112 QSKSNEYVTIIERGASRDE 130 (197) Q Consensus 112 ~~~~~~~~~~~~~~~~~~~ 130 (197) ++...-+..++.+.-...- T Consensus 212 ~e~~~~l~~~l~~i~~~~~ 230 (230) T protein:vir:98 212 KENAEILKEFILKFSGIEK 230 (230) T ss_pred HHHHHHHHHHHHHhccccC Confidence 5544444444433221111 No 122 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=89.38 E-value=0.0026 Score=34.79 Aligned_cols=91 Identities=11% Similarity=0.207 Sum_probs=48.9 Q ss_pred cccHHHHHHHHHHH----------------HHHHhcCCeEEEE-eec----------------CCCCC--CCccchHHHH Q lcl|NC_021342. 4 VVGLQETLAELDKV----------------LGQIRDDQYVTVG-IHE----------------AAGDV--ESGEINMATL 48 (197) Q Consensus 4 i~~~~~~~~~L~~l----------------~~~~~~~~~V~VG-i~~----------------~~~~~--~~~~~~~A~i 48 (197) |+|.+.+.++|++. ..+.+.....-|| ... +.+.. -+.+.-.++| T Consensus 1 i~G~~~L~~~Lk~~s~~dvk~VVkkN~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~g~~~~vgp~g~t~dY 80 (127) T protein:vir:98 1 MTGMPALEVKLRSMSEKRWDRVANKNLTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKVNSSKDVITGNFGYIKDY 80 (127) T ss_pred CcChHHHHHHHHHhhHHHHHHHHhhhhHHHHHHHHhccCCceeccccccCcccceeeeEEEEecCCceEEeccCcccccc Confidence 77777766666543 2222111001111 110 00000 0011123668 Q ss_pred hhHhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccC-CCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 49 GAVLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKIN-IPARPWLEPGVQSKSNEYVTIIERGAS 127 (197) Q Consensus 49 A~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~-IP~RpFlr~t~~~~~~~~~~~~~~~~~ 127 (197) |-.-|||. ||+.+++ .|. .|+-|||.++|+.++..+.+-+.+.++ T Consensus 81 apyvEyGT-----------------------R~m~~~~-----------~~gf~~aqp~l~paf~~Qk~iF~~DL~~l~k 126 (127) T protein:vir:98 81 APHVEYGH-----------------------RIVRNGK-----------QVGYANGTKYLFNNVKKQREIYRQDMLNELR 126 (127) T ss_pred cceeecce-----------------------eeeeccc-----------ccccccCccccccchHHHhHHHHHHHHHHhc Confidence 88888873 4443322 122 689999999999999988888877776 Q ss_pred c Q lcl|NC_021342. 128 R 128 (197) Q Consensus 128 ~ 128 (197) . T Consensus 127 ~ 127 (127) T protein:vir:98 127 R 127 (127) T ss_pred C Confidence 6 No 123 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=87.74 E-value=0.002 Score=35.35 Aligned_cols=68 Identities=22% Similarity=0.178 Sum_probs=32.3 Q ss_pred cccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcC Q lcl|NC_021342. 89 KPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKG 168 (197) Q Consensus 89 ~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG 168 (197) |. . +.+ ++++-..++.++-.++ . -..+++|...+..+++.++.. + T Consensus 1 Ma-------~-------~~~-Gl~~l~~~l~~~~~~~-~--~~~~~al~~~a~~v~~~ak~~---------a-------- 45 (135) T protein:vir:96 1 MA-------K-------VKY-GADSIVVDLEKYSKDM-E--KWVKKGITKTTLKIYNTAIHL---------M-------- 45 (135) T ss_pred Cc-------h-------hhh-hHHHHHHHHHHHHHHH-H--HHHHHHHHHHHHHHHHHHHHh---------C-------- Confidence 00 0 001 3444333333322111 1 123445555555555554432 2 Q ss_pred CCCchhHHHHHHhhceeeeeccccccccC Q lcl|NC_021342. 169 SSNPLIDTGALRQSVTYVVHSGKLPDEGL 197 (197) Q Consensus 169 ~~~PLidTG~L~~SIty~V~~k~~~~~~~ 197 (197) | +|||.|++||+++|.+.. .+|. T Consensus 46 ---p-vdTG~Lr~SI~~~~~~~g--~~~~ 68 (135) T protein:vir:96 46 ---P-VDTGFLRQSTTVDFENGG--FTGV 68 (135) T ss_pred ---C-ccchhhhcceeEEeecCc--EEEE Confidence 2 799999999999885443 2222 No 124 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=85.56 E-value=0.00074 Score=37.75 Aligned_cols=72 Identities=17% Similarity=0.088 Sum_probs=30.6 Q ss_pred cccccccccccCCCcchhHHHHHHHHHHHHHHH---HHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHH Q lcl|NC_021342. 89 KPLGVTKPHKINIPARPWLEPGVQSKSNEYVTI---IERGASRDESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIR 165 (197) Q Consensus 89 ~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~---~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~ 165 (197) |. .-+++ ++++-..++.++ +.+++ +.++.++...++..|+..... T Consensus 1 m~-----~v~i~---------Gld~L~~kl~~~~~~~~~~v------~~a~~~~~~~~a~~v~~~ak~------------ 48 (182) T protein:vir:10 1 MI-----EVELK---------GVNELRAKLKKLPDIMAKAT------ANAQENAIEQAEAYAVDELQS------------ 48 (182) T ss_pred Ce-----EEEEe---------cHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHh------------ Confidence 00 00000 222222222111 11111 223333333333333333321 Q ss_pred hcCCCCchhHHHHHHhhceeeeeccccccccC Q lcl|NC_021342. 166 KKGSSNPLIDTGALRQSVTYVVHSGKLPDEGL 197 (197) Q Consensus 166 ~KG~~~PLidTG~L~~SIty~V~~k~~~~~~~ 197 (197) .-| +|||.|++||+++|..+.+...|- T Consensus 49 ----~~P-vdtG~Lr~SI~~~~~~~~~~~~g~ 75 (182) T protein:vir:10 49 ----SIK-YSTGELTRSFKHEVKVDGDEVIGR 75 (182) T ss_pred ----hCC-CCchhhhhceeeeeeecCCeEEEE Confidence 224 699999999999987766544433 No 125 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=85.20 E-value=0.0034 Score=34.08 Aligned_cols=67 Identities=21% Similarity=0.260 Sum_probs=29.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHc-cCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_021342. 107 LEPGVQSKSNEYVTIIERGASR-DESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTY 185 (197) Q Consensus 107 lr~t~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty 185 (197) |-..+ ...+++.+.++++-.. .-..+++|+..+..+++.++.. .| +|||.|++||++ T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~~~~~~~~~~~a~~v~~~ak~~--------------------aP-v~TG~L~~Si~~ 58 (137) T protein:vir:95 1 MAKVK-YGNWDLVKELENYERDMERWVKRGIAKTTAKIHNTIISL--------------------MP-VDTGYLRESVTM 58 (137) T ss_pred CchhH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--------------------CC-ccchhhhcCeee Confidence 22221 1222222222221110 1123334444444444444332 12 489999999999 Q ss_pred eeeccccccccC Q lcl|NC_021342. 186 VVHSGKLPDEGL 197 (197) Q Consensus 186 ~V~~k~~~~~~~ 197 (197) ++..+.. +|- T Consensus 59 ~~~~~~~--~~~ 68 (137) T protein:vir:95 59 DFKDGGF--TGV 68 (137) T ss_pred EeeCCce--EEE Confidence 9865431 121 No 126 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=84.82 E-value=0.0029 Score=34.52 Aligned_cols=71 Identities=17% Similarity=0.218 Sum_probs=33.9 Q ss_pred cccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHc-cCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhc Q lcl|NC_021342. 89 KPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASR-DESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKK 167 (197) Q Consensus 89 ~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~K 167 (197) |. ..++.| +.+++.+.+++..+. .-..+.+|+..+..+++.++.. +| T Consensus 1 Ma-----~~~~~~------------~~~~l~~~l~~~~~~~~~~~~~~l~~~a~~i~~~ak~~---------aP------ 48 (142) T protein:vir:94 1 MA-----GLNYRV------------NSTEFQGALRAALDRLTGAAREATEAAANDMVNMAKGL---------CP------ 48 (142) T ss_pred Cc-----eeEEEe------------cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---------CC------ Confidence 11 111111 112222222222221 1124556666666665555332 22 Q ss_pred CCCCchhHHHHHHhhceeeeeccccccccC Q lcl|NC_021342. 168 GSSNPLIDTGALRQSVTYVVHSGKLPDEGL 197 (197) Q Consensus 168 G~~~PLidTG~L~~SIty~V~~k~~~~~~~ 197 (197) +|||.|++||+++|........+- T Consensus 49 ------v~TG~Lr~SI~~~~~~~g~~~~~~ 72 (142) T protein:vir:94 49 ------VDTGRLRSSIQAVPSGGRFSFSVT 72 (142) T ss_pred ------ccchhhhccceeeeccCCceEEEE Confidence 689999999999987665433332 No 127 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=84.80 E-value=0.013 Score=30.91 Aligned_cols=118 Identities=13% Similarity=0.080 Sum_probs=42.8 Q ss_pred CcccccHHHHHHHHHHHHHHH-h---cCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccc Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQI-R---DDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESR 76 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~~-~---~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~ 76 (197) |+.++|.+.+.+.|++-+.+- + .++.+..|- ..-.. .+. ..++..---|++|.---.+...+ . T Consensus 10 ~aevkGv~Eilk~lE~klG~~~~~ri~nkAL~~~g----e~v~~-~lK-~~~~~fkDTGat~dev~~s~p~~-------~ 76 (138) T protein:vir:98 10 FANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIG----KELEP-SFK-SAISIYKRTGETTESAVVSGVRR-------E 76 (138) T ss_pred cccccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHH----HHHHH-HHH-hhhhhhhhccceeeeeeecCeee-------c Confidence 899999988888877632211 0 000000000 00000 000 00111112222221000000000 0 Q ss_pred cCCcccc---ccccccccc--cccccccCCCcchh--HHHHHHHHHHHHHHHHHHHHHccCcH Q lcl|NC_021342. 77 KEVRFLK---TGTGFKPLG--VTKPHKINIPARPW--LEPGVQSKSNEYVTIIERGASRDEST 132 (197) Q Consensus 77 ~~~~f~k---~~~g~~~~~--~~~~~~v~IP~RpF--lr~t~~~~~~~~~~~~~~~~~~~~~~ 132 (197) .++|-++ .++.|...| +++ +...|=||-| ++.+++..+..+...++.-++..++. T Consensus 77 ~G~r~V~igW~GpR~~ivHLNE~G-yGk~i~PrG~G~I~ka~~~se~~y~~~vk~el~k~l~~ 138 (138) T protein:vir:98 77 DGIPKVKLGFTTPRWNIVHLQELE-YGWKHNRRGVGVIRRYSDILETIYPRGIRDKLKRGFDG 138 (138) T ss_pred CCceEEEEeeecCeeeEEeeeccc-ccCCcCCCcchHHHHHHHhhhHHHHHHHHHHHHHHhcC Confidence 0000000 001111111 111 1223445544 78888888888877776555554444 No 128 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=84.14 E-value=0.0052 Score=33.10 Aligned_cols=65 Identities=18% Similarity=0.133 Sum_probs=30.9 Q ss_pred ccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCC Q lcl|NC_021342. 92 GVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSN 171 (197) Q Consensus 92 ~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~ 171 (197) |++....+ +...+.+++.++ .+++|+.++..+++..|. |+| T Consensus 1 ~~~~~~~l--------------~~~~l~~~~~~~------~~~~~~~~a~~ve~~ak~---------~aP---------- 41 (137) T protein:vir:10 1 MVAHTLRI--------------ERAQLHGLGMDE------ARKAVNRVVRRTFTRSQI---------LAP---------- 41 (137) T ss_pred Cccccccc--------------ChhhHhhHHHHH------HHHHHHHHHHHHHHHHHh---------cCC---------- Confidence 11111111 112222222222 233555555555555433 222 Q ss_pred chhHHHHHHhhceeeeeccccc-cccC Q lcl|NC_021342. 172 PLIDTGALRQSVTYVVHSGKLP-DEGL 197 (197) Q Consensus 172 PLidTG~L~~SIty~V~~k~~~-~~~~ 197 (197) +|||+|++||++.+.+.++. .++. T Consensus 42 --v~TG~Lr~SI~~~~~~~~g~~v~~~ 66 (137) T protein:vir:10 42 --VDTGYLRASGRLVLGRERGAVVIGS 66 (137) T ss_pred --cCchhhhccceeeeeeccccEEEEE Confidence 79999999999998754322 2222 No 129 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=83.63 E-value=0.013 Score=30.92 Aligned_cols=121 Identities=12% Similarity=0.089 Sum_probs=43.7 Q ss_pred CcccccHHHHHHHHHHHHHH--Hh--cCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccc Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQ--IR--DDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESR 76 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~--~~--~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~ 76 (197) |+.++|.+.+.+.|++.+.+ +. .++.+..|= ..-.. .+. ..++..--.|++|.---.+.-.+ ..+. T Consensus 4 ~aevkGv~Eilk~lE~klG~~~v~ri~nkAL~~~g----e~v~~-~lK-~~~~~f~DTG~t~dev~~s~~~~----~~G~ 73 (132) T protein:vir:96 4 FANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIG----KELEP-SFK-SAISIYKRTGETTESAVVSGVRR----EDGI 73 (132) T ss_pred cccccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHH----HHHHH-HHH-HhhhhhhhcchhhcceeecCeee----cCCc Confidence 99999999998888873332 10 011111110 00000 000 00111112222221000000000 0000 Q ss_pred cCCccccccccccccc--cccccccCCCcchh--HHHHHHHHHHHHHHHHHHHHHccCcH Q lcl|NC_021342. 77 KEVRFLKTGTGFKPLG--VTKPHKINIPARPW--LEPGVQSKSNEYVTIIERGASRDEST 132 (197) Q Consensus 77 ~~~~f~k~~~g~~~~~--~~~~~~v~IP~RpF--lr~t~~~~~~~~~~~~~~~~~~~~~~ 132 (197) +-+...-++..|...| +++ +-..|=||-| ++.+++..+..+.+.++.-++..++. T Consensus 74 r~V~VgW~GpR~~ivHLNE~G-yGk~~~PrG~G~I~~a~~~se~~~~~~~~~elkk~l~~ 132 (132) T protein:vir:96 74 PKVKLGFTTPRWNIVHLQELE-YGWKHNRRGVGVIRRYSDILETIYPRGIRDKLKRGFDG 132 (132) T ss_pred eEEEecccCCceeEEeeeccc-ccCCcCCCcchHHHHHHHhhhhHHHHHHHHHHHHHhcC Confidence 0000000111222221 111 1333445554 78888888866666555544444443 No 130 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=83.50 E-value=0.0048 Score=33.31 Aligned_cols=49 Identities=22% Similarity=0.226 Sum_probs=25.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeeeeccccc------cccC Q lcl|NC_021342. 132 TTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTYVVHSGKLP------DEGL 197 (197) Q Consensus 132 ~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty~V~~k~~~------~~~~ 197 (197) .+.++.+.-..+...|++.+.. ++| +|||.|++||++++...... .++- T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~-----~aP------------v~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA 55 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIIS-----LMP------------VDTGYLRESVTMDFKDGGFTGVINIGSEYA 55 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHH-----hCC------------cCcccccccceEEeecCcEEEEEecCCCcc Confidence 3434333333444444444433 222 58999999999998654311 0110 No 131 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=83.50 E-value=0.0048 Score=33.31 Aligned_cols=49 Identities=22% Similarity=0.226 Sum_probs=25.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeeeeccccc------cccC Q lcl|NC_021342. 132 TTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTYVVHSGKLP------DEGL 197 (197) Q Consensus 132 ~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty~V~~k~~~------~~~~ 197 (197) .+.++.+.-..+...|++.+.. ++| +|||.|++||++++...... .++- T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~-----~aP------------v~TG~Lr~SI~~~~~~~~~~~~V~~~~~YA 55 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIIS-----LMP------------VDTGYLRESVTMDFKDGGFTGVINIGSEYA 55 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHH-----hCC------------cCcccccccceEEeecCcEEEEEecCCCcc Confidence 3434333333444444444433 222 58999999999998654311 0110 No 132 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=83.01 E-value=0.0033 Score=34.17 Aligned_cols=69 Identities=16% Similarity=0.179 Sum_probs=31.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHc-cCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_021342. 107 LEPGVQSKSNEYVTIIERGASR-DESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTY 185 (197) Q Consensus 107 lr~t~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty 185 (197) |-..+ ...+++.+.++++-+. .-..+++|...+..+++.+|... | +|||.|++||++ T Consensus 1 Ma~~~-~G~~~l~~~l~~~~~~~~~~~~~~l~~~a~~~~~~ak~~~--------------------p-vdTG~L~~Si~~ 58 (137) T protein:vir:96 1 MAKVK-YGNWDLVAELEDYRDEMEEWVKKGILKTTLAIYNTAVALA--------------------P-VDLGFLKESIDF 58 (137) T ss_pred CchhH-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC--------------------C-cCccchhcCcee Confidence 22111 1222222222221111 11234456666665555555331 2 589999999999 Q ss_pred eeeccccccccC Q lcl|NC_021342. 186 VVHSGKLPDEGL 197 (197) Q Consensus 186 ~V~~k~~~~~~~ 197 (197) +|..+....+.- T Consensus 59 ~~~~~g~~~~V~ 70 (137) T protein:vir:96 59 KVTDGGFSSVIS 70 (137) T ss_pred EeecCceEEEEe Confidence 986543211100 No 133 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=82.70 E-value=0.017 Score=30.25 Aligned_cols=79 Identities=13% Similarity=0.149 Sum_probs=35.9 Q ss_pred cccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHc-cCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhc Q lcl|NC_021342. 89 KPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASR-DESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKK 167 (197) Q Consensus 89 ~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~K 167 (197) ++++.+-. .|--|-.. ...-+++.+.+++.-.. .-..+++|+..+..+++.++... | T Consensus 1 ~~~~~~~~------~~~~Ma~v-~~Gld~l~~~l~~~~~~~~~~~~~~l~~~a~~v~~~ak~~a---------P------ 58 (149) T protein:vir:10 1 MKLNYYDL------SRCHMAKV-KYGADSMVVELDKFDKKIEEWVKKGIAKTTTKIYNTAVALA---------P------ 58 (149) T ss_pred Ceeeeecc------chhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC---------C------ Confidence 23322221 24333221 11222333333222111 11344455666665555554321 1 Q ss_pred CCCCchhHHHHHHhhceeeeeccccccccC Q lcl|NC_021342. 168 GSSNPLIDTGALRQSVTYVVHSGKLPDEGL 197 (197) Q Consensus 168 G~~~PLidTG~L~~SIty~V~~k~~~~~~~ 197 (197) +|||.|++||+++|..+. .+|- T Consensus 59 ------vdTG~L~~SI~~~~~~~g--~~~~ 80 (149) T protein:vir:10 59 ------VDLGFLEESIDFKYFDGG--LSSV 80 (149) T ss_pred ------cccchhhccceEEecCCc--EEEE Confidence 699999999999885432 2222 No 134 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=81.24 E-value=0.007 Score=32.41 Aligned_cols=104 Identities=21% Similarity=0.209 Sum_probs=38.7 Q ss_pred CcccccHHHHHHHHHHHHHHH-h-c-CCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeeccccccccc Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQI-R-D-DQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRK 77 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~~-~-~-~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~ 77 (197) =.....+ ++.+.|-....+= + + ...-.||+-.- -|-.+.+-|||.--.+.. T Consensus 8 rv~~~~G-~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~r----------kAPhghlvE~Ghw~~~~~--------------- 61 (119) T protein:vir:81 8 FVNDETG-KLRSNLYVAYSPEESTNGVQTYAVSWRKK----------AAPHGHLLEFGHWQTHAA--------------- 61 (119) T ss_pred ccCCCcc-chhhhheeeeccccCCCCeEEEEeeccCC----------cCCcccccccceeeeeee--------------- Confidence 0000000 1111110000000 0 0 11123333221 133445567773211100 Q ss_pred CCcccccccccc-ccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHc----cCcHHH Q lcl|NC_021342. 78 EVRFLKTGTGFK-PLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASR----DESTTS 134 (197) Q Consensus 78 ~~~f~k~~~g~~-~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~----~~~~~~ 134 (197) +....|.. ....-+.....||+|||||++||....+..+.+.+.+.. =+.+++ T Consensus 62 ----~~~~dG~w~~~~~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:81 62 ----YKGKDGEWYSSSVKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred ----eeccCceeeecCccccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 00001100 011112335679999999999999888877766544221 111111 No 135 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=81.15 E-value=0.038 Score=28.35 Aligned_cols=91 Identities=13% Similarity=0.203 Sum_probs=40.6 Q ss_pred CcccccHHHHHHH----HHHHHHHHhc-------------CCeEEEEeecCCCC--------CCCcc--c-----hHHHH Q lcl|NC_021342. 1 MMKVVGLQETLAE----LDKVLGQIRD-------------DQYVTVGIHEAAGD--------VESGE--I-----NMATL 48 (197) Q Consensus 1 M~ki~~~~~~~~~----L~~l~~~~~~-------------~~~V~VGi~~~~~~--------~~~~~--~-----~~A~i 48 (197) |.+-...+.+.+. |++..+.+.+ -+.++-+-|...|. .++++ + +--+| T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~~~~~~~~v~~~~~~y~l 80 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVVDDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQKLKNGDQVIYQKAPTYRL 80 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccccceeeeecCCeeEEEEEecCCcce Confidence 7765555554332 2222221100 01111111111000 00000 0 00112 Q ss_pred hhHhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 49 GAVLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 49 A~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~ 128 (197) +.+.|||- . +++. --.|+||||+++.+...+++.+.+++.+.. T Consensus 81 ~HLLE~GH-----------------a--------~r~G------------GrV~a~phI~paee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 81 THLLENGH-----------------A--------KRNG------------GRVSPKVHIAPVEEELVSNYISRVEKRLSQ 123 (123) T ss_pred EEeeecce-----------------e--------ecCC------------ceeCcchhhhHHHHHHHHHHHHHHHHHhcC Confidence 23333431 1 1111 126999999999999999999998888877 No 136 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=80.52 E-value=0.0079 Score=32.12 Aligned_cols=104 Identities=21% Similarity=0.185 Sum_probs=38.5 Q ss_pred CcccccHHHHHHHHHHHHHHH-h-c-CCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeeccccccccc Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQI-R-D-DQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRK 77 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~~-~-~-~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~ 77 (197) =.....+ ++.+.|-....+= + + ...-.||+-.- -|-.+.+-|||.--.+..+ T Consensus 8 rv~~~~G-~Lr~sIY~ay~~~~S~dG~~~Y~Vswn~r----------kAPhghlvE~Ghw~~~~~~-------------- 62 (119) T protein:vir:10 8 FVNDETG-KLRSNLYVAYSTEESTNGVQTYAVSWRKK----------AAPHGHLLEFGHWQTHAAY-------------- 62 (119) T ss_pred ccCCCcc-chhhhheeeeccccCCCCEEEEEeecCCC----------cCCcccccccceeeeeeee-------------- Confidence 0000010 1111110000000 0 1 11123343221 1334455677732111000 Q ss_pred CCccccccccccc-cccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHc----cCcHHH Q lcl|NC_021342. 78 EVRFLKTGTGFKP-LGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASR----DESTTS 134 (197) Q Consensus 78 ~~~f~k~~~g~~~-~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~----~~~~~~ 134 (197) ....|... ...-+.-...||+|||||++||....+..+.+.+.+.. =+.+++ T Consensus 63 -----~~~dG~w~~~~~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg~~ 119 (119) T protein:vir:10 63 -----KGKDGEWYSSSVKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRGEQ 119 (119) T ss_pred -----eccCceeeecCccccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 00001000 00112335679999999999999888877766544221 111111 No 137 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=80.50 E-value=0.007 Score=32.41 Aligned_cols=67 Identities=22% Similarity=0.293 Sum_probs=30.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHc-cCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_021342. 107 LEPGVQSKSNEYVTIIERGASR-DESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTY 185 (197) Q Consensus 107 lr~t~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty 185 (197) |-.. ....+++.+.+++.... .-..+++|+..+..+++.++.. .| +|||.|++||++ T Consensus 1 Ma~~-~~G~~~l~~~L~~~~~~~~~~~~~al~~~a~~v~~~ak~~--------------------aP-vdTG~Lr~SI~~ 58 (137) T protein:vir:94 1 MAKV-KYGNWDLVKELENYERDIERWVKRGIAKTTVKIHNTIISL--------------------MP-VDTGYLRESVTM 58 (137) T ss_pred Cchh-HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--------------------CC-cCcchhhcCcee Confidence 2111 01222222222222111 1123345555555555555432 12 489999999999 Q ss_pred eeeccccccccC Q lcl|NC_021342. 186 VVHSGKLPDEGL 197 (197) Q Consensus 186 ~V~~k~~~~~~~ 197 (197) ++..+.. +|. T Consensus 59 ~~~~~~~--~~~ 68 (137) T protein:vir:94 59 DFKDGGF--TGV 68 (137) T ss_pred EeecCcE--EEE Confidence 9865431 122 No 138 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=80.08 E-value=0.0091 Score=31.77 Aligned_cols=70 Identities=13% Similarity=0.043 Sum_probs=31.4 Q ss_pred CcchhHH---HHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHH Q lcl|NC_021342. 102 PARPWLE---PGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGA 178 (197) Q Consensus 102 P~RpFlr---~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~ 178 (197) =+|..++ .++++-..++.+.-++ +.. ..+++|...+..+++.++.. +| +|||. T Consensus 1 m~~ms~~i~~~g~~~l~~~l~~~~~~-~~~--~v~~~l~~~a~~i~~~ak~~---------ap------------v~TG~ 56 (144) T protein:vir:59 1 MALMSVRIDPSWRRIMSRNVRTFSGH-VLT--QVEQVIIKTAEKIAGLAASL---------AP------------VDEGN 56 (144) T ss_pred CCcceeeehhHHHHHHHHHHHHHHHH-HHH--HHHHHHHHHHHHHHHHHHHh---------CC------------ccchh Confidence 1122221 1222222222111111 111 13446666666655555532 22 58999 Q ss_pred HHhhceeeeeccccccccC Q lcl|NC_021342. 179 LRQSVTYVVHSGKLPDEGL 197 (197) Q Consensus 179 L~~SIty~V~~k~~~~~~~ 197 (197) |++||++++.... .+|. T Consensus 57 Lr~SI~~~~~~~g--~~~~ 73 (144) T protein:vir:59 57 LKNSIQIDYKNNG--LTAE 73 (144) T ss_pred hhcCeeEEeecCc--EEEE Confidence 9999999884432 2222 No 139 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=79.85 E-value=0.017 Score=30.35 Aligned_cols=67 Identities=13% Similarity=0.063 Sum_probs=32.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHc-cCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_021342. 107 LEPGVQSKSNEYVTIIERGASR-DESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTY 185 (197) Q Consensus 107 lr~t~~~~~~~~~~~~~~~~~~-~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty 185 (197) |. -+.-+++.+.+++.-+. .--.+.+|...+..+++.++. |+| +|||.|++||.+ T Consensus 1 i~---i~Gld~L~~~L~~l~~~~~~~~~~a~~~~a~~i~~~ak~---------~aP------------v~TG~Lr~sI~~ 56 (173) T protein:vir:10 1 MA---VKGVAEVIAELRKIGKDIDKNINATTEEAANFIEDRAKT---------LAP------------KNFGKLAQSIST 56 (173) T ss_pred Cc---chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------hCC------------cCchhhhhccee Confidence 11 11223333333332111 112344555555555554444 222 799999999999 Q ss_pred eeeccccccccC Q lcl|NC_021342. 186 VVHSGKLPDEGL 197 (197) Q Consensus 186 ~V~~k~~~~~~~ 197 (197) .+.+++++..+- T Consensus 57 ~~~~~~~~~~~~ 68 (173) T protein:vir:10 57 SDLKAKDLISKK 68 (173) T ss_pred eeeccCceeEEe Confidence 876655432221 No 140 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=78.72 E-value=0.0084 Score=31.96 Aligned_cols=108 Identities=8% Similarity=-0.022 Sum_probs=41.4 Q ss_pred Ccccc--cHHHHHHHHHHHHHHHhcCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccC Q lcl|NC_021342. 1 MMKVV--GLQETLAELDKVLGQIRDDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKE 78 (197) Q Consensus 1 M~ki~--~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~ 78 (197) =.+++ |.+.+...|++...+ ....-+= ....+++- -....|...+.=|. ..+... T Consensus 25 ~~ki~kaGA~v~~e~L~~~tp~----~~~~~~~-~~~~~~Hl--aD~I~~s~~~~dg~----------------~~g~~~ 81 (139) T protein:vir:10 25 QEKITKAGADVYAKKLAETTKE----KHPNTKG-DGGKYGHL--SEDIRSAAGDIDGD----------------HNGSST 81 (139) T ss_pred HHHHHHHHHHHHHHHHHHhccc----ccCcCCC-CCCCCcch--hhcceecCcccccc----------------cceeee Confidence 22222 444445555444321 0000000 00000000 00000100110000 000000 Q ss_pred CccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHH----HHc-cCcHHH Q lcl|NC_021342. 79 VRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERG----ASR-DESTTS 134 (197) Q Consensus 79 ~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~----~~~-~~~~~~ 134 (197) +-| ++.+...+|..-.|+.+|+.||+..|.++.++++.+.+.+. |.. ..+.+. T Consensus 82 VG~---~k~~~~A~f~n~GT~k~~~~hFie~t~~e~~~evl~a~~~~~k~~l~~~~~~~~~ 139 (139) T protein:vir:10 82 VGF---HNKAHIARFLNDGTKYIRADHFVDNARDDAKDAVFAAEAEKYQAMIAKANGGGDK 139 (139) T ss_pred eCC---CCCcceEeecccCccccCCCchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCC Confidence 000 01122334444558999999999999999999888776544 432 122222 No 141 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=77.40 E-value=0.015 Score=30.62 Aligned_cols=66 Identities=14% Similarity=-0.015 Sum_probs=31.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeee Q lcl|NC_021342. 108 EPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTYVV 187 (197) Q Consensus 108 r~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty~V 187 (197) -.++++-.+++.++..++ . -..+++|...+..++.++|. ++| +|||.|++||++.+ T Consensus 1 i~Gld~l~~~l~~~~~~~-~--~~v~~al~~~a~~i~~~ak~---------~aP------------v~TG~Lr~sI~~~~ 56 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSV-R--IAVDKELSKSAARIERQAKI---------LAP------------VDTGWLRAQIYSEQ 56 (108) T ss_pred CchHHHHHHHHHHHHHHH-H--HHHHHHHHHHHHHHHHHHHh---------cCC------------cCchhhhcceeeee Confidence 224555444443332221 1 11344566555555555443 222 69999999999876 Q ss_pred ecccccc-----ccC Q lcl|NC_021342. 188 HSGKLPD-----EGL 197 (197) Q Consensus 188 ~~k~~~~-----~~~ 197 (197) .+..... +|- T Consensus 57 ~~~~~~~v~~~~~Ya 71 (108) T protein:vir:99 57 QRLLHYRVVSPALYS 71 (108) T ss_pred cCcEEEEeecCcccc Confidence 4321110 000 No 142 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=77.13 E-value=0.012 Score=31.08 Aligned_cols=47 Identities=26% Similarity=0.302 Sum_probs=25.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhceeeeeccccccccC Q lcl|NC_021342. 132 TTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTYVVHSGKLPDEGL 197 (197) Q Consensus 132 ~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty~V~~k~~~~~~~ 197 (197) .++++.+.-..+...|+..+.. + .| +|||.|++||++++.... ..|- T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~-----~-----------ap-v~TG~Lr~SI~~~~~~~~--~~~~ 47 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIIS-----L-----------MP-VDTGYLRESVTMDFKDGG--FTGV 47 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHh-----h-----------CC-ccccccccceeEEeecCc--EEEE Confidence 4444444434444444444432 1 23 489999999999986543 1111 No 143 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=76.26 E-value=0.0089 Score=31.83 Aligned_cols=65 Identities=22% Similarity=0.255 Sum_probs=34.3 Q ss_pred HHH---HHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhc Q lcl|NC_021342. 107 LEP---GVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSV 183 (197) Q Consensus 107 lr~---t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SI 183 (197) |-. ++++-.+++.++- +.+. -..+.+|+..+..+++.+|... | +|||.|++|| T Consensus 1 Ma~~~~Gl~~l~~~l~~~~-~~~~--~~~~~al~~~a~~i~~~ak~~a---------P------------vdTG~Lr~SI 56 (137) T protein:vir:10 1 MAKVKYGNWELVKELEDFE-KETI--RWAKKGIAKTTTIIHNSIVSNM---------P------------VDTGYLRESV 56 (137) T ss_pred CchhHhhHHHHHHHHHHHH-HHHH--HHHHHHHHHHHHHHHHHHHHhC---------C------------cCcchhhcCe Confidence 211 3333333332211 1111 2345577777777777766642 2 5999999999 Q ss_pred eeeeeccccccccC Q lcl|NC_021342. 184 TYVVHSGKLPDEGL 197 (197) Q Consensus 184 ty~V~~k~~~~~~~ 197 (197) ++++..+.. ++. T Consensus 57 ~~~~~~~~~--~~~ 68 (137) T protein:vir:10 57 SMDFKKGGL--TGV 68 (137) T ss_pred eEEeeCCcE--EEE Confidence 998754332 111 No 144 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=75.38 E-value=0.022 Score=29.62 Aligned_cols=108 Identities=14% Similarity=0.197 Sum_probs=55.7 Q ss_pred Cc--c--cccHHHHHHHHHHHHH-----------------H----HhcCCeEEEEee-----cCCCCCCCc-----cchH Q lcl|NC_021342. 1 MM--K--VVGLQETLAELDKVLG-----------------Q----IRDDQYVTVGIH-----EAAGDVESG-----EINM 45 (197) Q Consensus 1 M~--k--i~~~~~~~~~L~~l~~-----------------~----~~~~~~V~VGi~-----~~~~~~~~~-----~~~~ 45 (197) |. . ++|.+++.+.|.+... + +....-|.-|-+ .+..+.+++ -.++ T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~tPVdTG~Lr~S~~~~~~~~~~~~~~~~V~n~ 80 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEANTPVKQGNLRRSWTAEGPTYGCGGWTIKLINN 80 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHhCCCCcchhccceeecceeeecCeeEEEEecC Confidence 33 2 2355555555544321 1 111111222222 111111222 2478 Q ss_pred HHHhhHhHcCceeeeCCCceeeecccccccccCCcccc-ccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 46 ATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEVRFLK-TGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIER 124 (197) Q Consensus 46 A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k-~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~ 124 (197) +.||.+-|||...+ ++ +|+. .+.+ ...--+|-++||+.++++.+..+.+.+++ T Consensus 81 ~~YA~~VE~Ghr~~-~G-----------------~~v~~~~~~--------~~~g~V~G~~~~~~a~~~~~~~~~~~l~k 134 (144) T protein:vir:10 81 AEYASYVESGHRQT-PG-----------------RYVPVLKKR--------LVRDWVPGQFYMKKSIPQIQRQLPQLVTE 134 (144) T ss_pred CCcccccccceeec-CC-----------------cccccCCCc--------cccceecCccchHHHHHHHHHHHHHHHHH Confidence 89999999995432 11 1111 1110 11223688999999999999999999988 Q ss_pred HHHccCcHHH Q lcl|NC_021342. 125 GASRDESTTS 134 (197) Q Consensus 125 ~~~~~~~~~~ 134 (197) .+..-.|.-. T Consensus 135 ~l~~l~d~~~ 144 (144) T protein:vir:10 135 GLWGLKDLFE 144 (144) T ss_pred HHHHHhhhcC Confidence 8876444333 No 145 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=75.16 E-value=0.0081 Score=32.04 Aligned_cols=67 Identities=19% Similarity=0.215 Sum_probs=34.1 Q ss_pred HHH---HHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhc Q lcl|NC_021342. 107 LEP---GVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSV 183 (197) Q Consensus 107 lr~---t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SI 183 (197) |-. ++++-.+++.++- +.+. -..+++|+..+..+++.+|... | +|||.|++|| T Consensus 1 Ma~~~~G~~~l~~~l~~~~-~~~~--~~~~~al~~~a~~i~~~ak~~a---------P------------v~TG~Lr~SI 56 (137) T protein:vir:10 1 MAKVKYGNWDLVKELEEFE-KETI--RWAKKGIAKTTTIIHNSIVSNM---------P------------VDTGYLRESV 56 (137) T ss_pred CccchhCHHHHHHHHHHHH-HHHH--HHHHHHHHHHHHHHHHHHHHhC---------C------------cCcchhhcCe Confidence 211 2232222222211 1111 1345677777777777666542 2 5899999999 Q ss_pred eeeeeccccccc-cC Q lcl|NC_021342. 184 TYVVHSGKLPDE-GL 197 (197) Q Consensus 184 ty~V~~k~~~~~-~~ 197 (197) ++++..+....+ |- T Consensus 57 ~~~~~~~~~~~~V~~ 71 (137) T protein:vir:10 57 SMDFKKGGLTGVINI 71 (137) T ss_pred eeEecCCcEEEEEec Confidence 998854431111 00 No 146 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=72.03 E-value=0.043 Score=28.06 Aligned_cols=93 Identities=16% Similarity=0.231 Sum_probs=44.8 Q ss_pred CcccccHHHH----HHHHHHHHHHHhc-------------CCeEEEEeecCC-----CCC---------CCccc---hHH Q lcl|NC_021342. 1 MMKVVGLQET----LAELDKVLGQIRD-------------DQYVTVGIHEAA-----GDV---------ESGEI---NMA 46 (197) Q Consensus 1 M~ki~~~~~~----~~~L~~l~~~~~~-------------~~~V~VGi~~~~-----~~~---------~~~~~---~~A 46 (197) |.+|+ .+.+ .+.|++..+.+.. ...|+-.|.+.+ .|. ++..+ +-- T Consensus 1 M~~i~-id~La~~I~~~L~~y~~~v~~~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~~~~~~v~nk~~y 79 (127) T protein:vir:80 1 MANIK-IDRLGDEITRQLKRYSQVIAGDLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRTPGGWVIHNKTEY 79 (127) T ss_pred Ccccc-HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeeccCceeEeecCCc Confidence 77765 2332 3333222221100 001111111111 000 00000 111 Q ss_pred HHhhHhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 47 TLGAVLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGA 126 (197) Q Consensus 47 ~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~ 126 (197) +++.+.||| ++.+.|. ..++||||+|+.+....++.+.+++++ T Consensus 80 qLtHLLE~G-----------------HAkr~GG--------------------RV~a~pHI~paee~~~~~l~~~i~~~l 122 (127) T protein:vir:80 80 RLAHLLEYG-----------------HATVDGG--------------------RVPETPHIRPVEDWLEKEFEDRVERAI 122 (127) T ss_pred ceeehhhcc-----------------eeccCCc--------------------ccCCccchhhHHHHHHHHHHHHHHHHh Confidence 234444554 2222222 268999999999999999999999999 Q ss_pred HccCc Q lcl|NC_021342. 127 SRDES 131 (197) Q Consensus 127 ~~~~~ 131 (197) .++.- T Consensus 123 ~~~~~ 127 (127) T protein:vir:80 123 KNESR 127 (127) T ss_pred cCCCC Confidence 88666 No 147 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=71.28 E-value=0.061 Score=27.24 Aligned_cols=90 Identities=19% Similarity=0.231 Sum_probs=41.5 Q ss_pred CcccccHHHH----HHHHHHHHHHHhc-------------CCeEEEEeecCC-----CCC---------CCccc---hHH Q lcl|NC_021342. 1 MMKVVGLQET----LAELDKVLGQIRD-------------DQYVTVGIHEAA-----GDV---------ESGEI---NMA 46 (197) Q Consensus 1 M~ki~~~~~~----~~~L~~l~~~~~~-------------~~~V~VGi~~~~-----~~~---------~~~~~---~~A 46 (197) |.+|+- +.+ .+.|++..+.+.+ -..|+-.|.+.+ .|. ++..+ +-- T Consensus 1 M~~i~i-d~La~~I~~~L~~Ys~~v~~~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~~e~~~V~nk~~y 79 (124) T protein:vir:95 1 MAKIKI-GRLADEITSQLRKYSQVIADDVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRVPNGWVIHNKTEY 79 (124) T ss_pred CccccH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeeecCceeEEEcCCC Confidence 888763 333 3333222221100 000111111100 000 00000 001 Q ss_pred HHhhHhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 47 TLGAVLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGA 126 (197) Q Consensus 47 ~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~ 126 (197) +++.+.||| ++.+.|. ..++||||+|+.+....++.+.+++++ T Consensus 80 qLtHLLE~G-----------------HAkr~GG--------------------RV~a~pHI~paee~~~~~l~~~i~~~l 122 (124) T protein:vir:95 80 RLAHLLEYG-----------------HATVDGG--------------------RVPGTPHIRPIEDWLEKEFEDRVEKAI 122 (124) T ss_pred ceeeeeecc-----------------eeccCCc--------------------ccCCccchhHHHHHHHHHHHHHHHHHh Confidence 223333333 2222222 268999999999999999999999988 Q ss_pred Hc Q lcl|NC_021342. 127 SR 128 (197) Q Consensus 127 ~~ 128 (197) .+ T Consensus 123 ~~ 124 (124) T protein:vir:95 123 KQ 124 (124) T ss_pred cC Confidence 88 No 148 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=69.43 E-value=0.02 Score=29.91 Aligned_cols=70 Identities=11% Similarity=0.155 Sum_probs=32.0 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_021342. 107 LE-PGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTY 185 (197) Q Consensus 107 lr-~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty 185 (197) |. .++++-.+.+.++-. .+. -..+.++..-|..++..++..-. ..++.| +|||.|++||++ T Consensus 1 i~~~Gld~l~~~l~~~~~-~~~--~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~~TG~Lr~sI~~ 62 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKT-NID--DDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-YWTGNLSRNIRY 62 (115) T ss_pred CcchhHHHHHHHHHHHHH-HHH--HHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-CCchhhhhccee Confidence 11 122332222222111 111 11345666666666665554321 123333 799999999998 Q ss_pred eeeccccccccC Q lcl|NC_021342. 186 VVHSGKLPDEGL 197 (197) Q Consensus 186 ~V~~k~~~~~~~ 197 (197) +..++ -++. T Consensus 63 ~~~g~---~~~~ 71 (115) T protein:vir:96 63 KKTGD---LQYT 71 (115) T ss_pred eecCc---eEEE Confidence 75332 1221 No 149 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=69.43 E-value=0.02 Score=29.91 Aligned_cols=70 Identities=11% Similarity=0.155 Sum_probs=32.0 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_021342. 107 LE-PGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTY 185 (197) Q Consensus 107 lr-~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty 185 (197) |. .++++-.+.+.++-. .+. -..+.++..-|..++..++..-. ..++.| +|||.|++||++ T Consensus 1 i~~~Gld~l~~~l~~~~~-~~~--~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~~TG~Lr~sI~~ 62 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKT-NID--DDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-YWTGNLSRNIRY 62 (115) T ss_pred CcchhHHHHHHHHHHHHH-HHH--HHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-CCchhhhhccee Confidence 11 122332222222111 111 11345666666666665554321 123333 799999999998 Q ss_pred eeeccccccccC Q lcl|NC_021342. 186 VVHSGKLPDEGL 197 (197) Q Consensus 186 ~V~~k~~~~~~~ 197 (197) +..++ -++. T Consensus 63 ~~~g~---~~~~ 71 (115) T protein:vir:10 63 KKTGD---LQYT 71 (115) T ss_pred eecCc---eEEE Confidence 75332 1221 No 150 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=69.43 E-value=0.02 Score=29.91 Aligned_cols=70 Identities=11% Similarity=0.155 Sum_probs=32.0 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_021342. 107 LE-PGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTY 185 (197) Q Consensus 107 lr-~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty 185 (197) |. .++++-.+.+.++-. .+. -..+.++..-|..++..++..-. ..++.| +|||.|++||++ T Consensus 1 i~~~Gld~l~~~l~~~~~-~~~--~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~~TG~Lr~sI~~ 62 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKT-NID--DDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-YWTGNLSRNIRY 62 (115) T ss_pred CcchhHHHHHHHHHHHHH-HHH--HHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-CCchhhhhccee Confidence 11 122332222222111 111 11345666666666665554321 123333 799999999998 Q ss_pred eeeccccccccC Q lcl|NC_021342. 186 VVHSGKLPDEGL 197 (197) Q Consensus 186 ~V~~k~~~~~~~ 197 (197) +..++ -++. T Consensus 63 ~~~g~---~~~~ 71 (115) T protein:vir:96 63 KKTGD---LQYT 71 (115) T ss_pred eecCc---eEEE Confidence 75332 1221 No 151 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=69.43 E-value=0.02 Score=29.91 Aligned_cols=70 Identities=11% Similarity=0.155 Sum_probs=32.0 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_021342. 107 LE-PGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTY 185 (197) Q Consensus 107 lr-~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty 185 (197) |. .++++-.+.+.++-. .+. -..+.++..-|..++..++..-. ..++.| +|||.|++||++ T Consensus 1 i~~~Gld~l~~~l~~~~~-~~~--~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~~TG~Lr~sI~~ 62 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKT-NID--DDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-YWTGNLSRNIRY 62 (115) T ss_pred CcchhHHHHHHHHHHHHH-HHH--HHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-CCchhhhhccee Confidence 11 122332222222111 111 11345666666666665554321 123333 799999999998 Q ss_pred eeeccccccccC Q lcl|NC_021342. 186 VVHSGKLPDEGL 197 (197) Q Consensus 186 ~V~~k~~~~~~~ 197 (197) +..++ -++. T Consensus 63 ~~~g~---~~~~ 71 (115) T protein:vir:93 63 KKTGD---LQYT 71 (115) T ss_pred eecCc---eEEE Confidence 75332 1221 No 152 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=69.43 E-value=0.02 Score=29.91 Aligned_cols=70 Identities=11% Similarity=0.155 Sum_probs=32.0 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_021342. 107 LE-PGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTY 185 (197) Q Consensus 107 lr-~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty 185 (197) |. .++++-.+.+.++-. .+. -..+.++..-|..++..++..-. ..++.| +|||.|++||++ T Consensus 1 i~~~Gld~l~~~l~~~~~-~~~--~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~~TG~Lr~sI~~ 62 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKT-NID--DDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-YWTGNLSRNIRY 62 (115) T ss_pred CcchhHHHHHHHHHHHHH-HHH--HHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-CCchhhhhccee Confidence 11 122332222222111 111 11345666666666665554321 123333 799999999998 Q ss_pred eeeccccccccC Q lcl|NC_021342. 186 VVHSGKLPDEGL 197 (197) Q Consensus 186 ~V~~k~~~~~~~ 197 (197) +..++ -++. T Consensus 63 ~~~g~---~~~~ 71 (115) T protein:vir:78 63 KKTGD---LQYT 71 (115) T ss_pred eecCc---eEEE Confidence 75332 1221 No 153 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=69.43 E-value=0.02 Score=29.91 Aligned_cols=70 Identities=11% Similarity=0.155 Sum_probs=32.0 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHHhhcee Q lcl|NC_021342. 107 LE-PGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALRQSVTY 185 (197) Q Consensus 107 lr-~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~~SIty 185 (197) |. .++++-.+.+.++-. .+. -..+.++..-|..++..++..-. ..++.| +|||.|++||++ T Consensus 1 i~~~Gld~l~~~l~~~~~-~~~--~~v~~a~~~~~~~i~~~a~~~a~--------------~~~~~p-~~TG~Lr~sI~~ 62 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKT-NID--DDVDDILQENAKEYVVRAKLKAR--------------EVMNKG-YWTGNLSRNIRY 62 (115) T ss_pred CcchhHHHHHHHHHHHHH-HHH--HHHHHHHHHHHHHHHHHHHHhcc--------------ccCCCC-CCchhhhhccee Confidence 11 122332222222111 111 11345666666666665554321 123333 799999999998 Q ss_pred eeeccccccccC Q lcl|NC_021342. 186 VVHSGKLPDEGL 197 (197) Q Consensus 186 ~V~~k~~~~~~~ 197 (197) +..++ -++. T Consensus 63 ~~~g~---~~~~ 71 (115) T protein:vir:97 63 KKTGD---LQYT 71 (115) T ss_pred eecCc---eEEE Confidence 75332 1221 No 154 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=69.19 E-value=0.016 Score=30.48 Aligned_cols=84 Identities=18% Similarity=0.141 Sum_probs=36.5 Q ss_pred CcccccHHHHHHHHHHHHHHH--hcC------------------------CeEEEEeecCCCCCCCccchHHHHhhHhHc Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQI--RDD------------------------QYVTVGIHEAAGDVESGEINMATLGAVLNF 54 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~~--~~~------------------------~~V~VGi~~~~~~~~~~~~~~A~iA~~~Ef 54 (197) +.+ -|.+-+.+.|++...+- ..+ ..+.|||... +.+.+|.+.++ T Consensus 29 atk-AGA~v~~~~L~~~tp~~h~~~~~t~~~~HlaD~I~~~~~~iDg~~~g~s~VG~~kk---------~~a~~A~f~n~ 98 (140) T protein:vir:48 29 ITT-AGAKVFKEELAEVTRQKHYSNKKHLKYGHMADGLSVQSTNVDGRKNGVSTVGWVNR---------YHAQNARRLND 98 (140) T ss_pred HHH-HHHHHHHHHHHHhccccCCCCCCCCCCCcchhceeecccccccccCceeeeccCCC---------cceeeeecccc Confidence 222 24444455554442210 000 0122232100 12334444444 Q ss_pred CceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHH--HHHHHHHHHHHHHccCcH Q lcl|NC_021342. 55 GAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSK--SNEYVTIIERGASRDEST 132 (197) Q Consensus 55 Ga~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~--~~~~~~~~~~~~~~~~~~ 132 (197) | |+.+|+-||+..+.++. +.++.+.....+.. T Consensus 99 G------------------------------------------T~k~~~~hFve~~~~e~~~k~~vl~A~~~~~~~---- 132 (140) T protein:vir:48 99 G------------------------------------------TKKYRADHFVTNVQNDSAVQTKVLLAEKEEYEK---- 132 (140) T ss_pred C------------------------------------------ccccCCCchhHHHHHhhhhHHHHHHHHHHHHHH---- Confidence 4 78899999999999876 45565544333221 Q ss_pred HHHHHHHHHH Q lcl|NC_021342. 133 TSILEKVGVT 142 (197) Q Consensus 133 ~~~l~~iG~~ 142 (197) .|++-|.. T Consensus 133 --~l~~~~~~ 140 (140) T protein:vir:48 133 --LIRKKGGE 140 (140) T ss_pred --HHHhhcCC Confidence 11111111 No 155 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=69.14 E-value=0.023 Score=29.54 Aligned_cols=80 Identities=15% Similarity=0.116 Sum_probs=37.0 Q ss_pred Cccc--ccHHHHHHHHHHHHHHH--hc------------------------CCeEEEEeecCCCCCCCccchHHHHhhHh Q lcl|NC_021342. 1 MMKV--VGLQETLAELDKVLGQI--RD------------------------DQYVTVGIHEAAGDVESGEINMATLGAVL 52 (197) Q Consensus 1 M~ki--~~~~~~~~~L~~l~~~~--~~------------------------~~~V~VGi~~~~~~~~~~~~~~A~iA~~~ 52 (197) =.++ -|.+-+.+.|++...+- .. ...+.|||... .-+.+|.+. T Consensus 26 ~~katkAGA~v~~~~L~~~tp~~hy~~~~~~~~~HlaD~I~~~~~~~DG~~dg~s~VG~~~~---------~~~~~A~f~ 96 (141) T protein:vir:50 26 QVEITTAGAKVFKKELEEVTREKHYSRKKNPKFGHMADGLAIQSTNADGRKNGVSTVGWKNN---------YHAQNARRL 96 (141) T ss_pred HHHHHHHHHHHHHHHHHHhcccCCCCCCCCCCCCccccceeeccCccccccCCeeeeccCCC---------ccceeeecc Confidence 1111 24444455554443210 00 01122333100 013344444 Q ss_pred HcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHH--HHHHHHHH----HHHH Q lcl|NC_021342. 53 NFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSK--SNEYVTII----ERGA 126 (197) Q Consensus 53 EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~--~~~~~~~~----~~~~ 126 (197) +|| |+.+|+-||+..+.++. +.++.+.+ +++| T Consensus 97 n~G------------------------------------------T~k~~~~hFve~~~~~a~~k~~Vl~A~~~~~k~~l 134 (141) T protein:vir:50 97 NDG------------------------------------------TKKYRADHFVTNVQNDSTVQKKVLLEKKRNTKNSL 134 (141) T ss_pred ccC------------------------------------------ccccCCCchhHHHHHhhhhHHHHHHHHHHHHHHHH Confidence 444 78899999999999764 45555543 3444 Q ss_pred H--ccCc Q lcl|NC_021342. 127 S--RDES 131 (197) Q Consensus 127 ~--~~~~ 131 (197) . ++-| T Consensus 135 ~~~~~~~ 141 (141) T protein:vir:50 135 EEKEGCD 141 (141) T ss_pred HhccCCC Confidence 3 3334 No 156 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=66.96 E-value=0.02 Score=29.92 Aligned_cols=100 Identities=14% Similarity=0.179 Sum_probs=42.6 Q ss_pred Ccccc--cHHHHHHHHHHHHHHH--hcCC------eEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecc Q lcl|NC_021342. 1 MMKVV--GLQETLAELDKVLGQI--RDDQ------YVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYAT 70 (197) Q Consensus 1 M~ki~--~~~~~~~~L~~l~~~~--~~~~------~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~ 70 (197) =.+++ |-+.+.+.|++...+- ..++ .+.=.|....+.-++. T Consensus 25 ~~k~tkaGA~v~~~~L~~~tp~~~~~~~~~~~~~~HlaD~I~~~~~~idg~----------------------------- 75 (139) T protein:vir:10 25 QEKITKAGADVYAKELAETTKEKHPNTKGDGGKYGHLSEDISSAAGDIDGD----------------------------- 75 (139) T ss_pred HHHHHHHHHHHHHHHHHHhcccccccCCCCCCCCCcccccceecCcccccc----------------------------- Confidence 22332 4455555565543320 0000 0000000000000000 Q ss_pred cccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHH----HHc-cCcHHH Q lcl|NC_021342. 71 EEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERG----ASR-DESTTS 134 (197) Q Consensus 71 ~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~----~~~-~~~~~~ 134 (197) ..+...+-| ..+ +...+|..-.|+++|+.+|+..|.++.++++.+.+.+. |.. +.+-+. T Consensus 76 --~~g~~~VG~--~~~-~~~Ahf~n~GT~~~~~~hFie~t~~e~~~ev~~a~~~~~ke~l~~~~~~~~~ 139 (139) T protein:vir:10 76 --HNGSSTVGF--HNK-AHIARFLNDGTKNIRADHFVDNARDDAKDAVFAAEAEKYQAMIAKANGGDSK 139 (139) T ss_pred --ccccceeCC--CCC-ceeeeeeccCccccCCCchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCC Confidence 000000111 111 22344555568999999999999999998887766544 432 111111 No 157 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=61.80 E-value=0.062 Score=27.19 Aligned_cols=85 Identities=18% Similarity=0.291 Sum_probs=35.9 Q ss_pred Cc-ccccHHHHHHHHHHHH--HHH-------------------h-------------------------cCCeEEEEeec Q lcl|NC_021342. 1 MM-KVVGLQETLAELDKVL--GQI-------------------R-------------------------DDQYVTVGIHE 33 (197) Q Consensus 1 M~-ki~~~~~~~~~L~~l~--~~~-------------------~-------------------------~~~~V~VGi~~ 33 (197) |- +++|.+.+.+.|++.+ +++ . +.+.|+|||-. T Consensus 1 MsvevkGv~eil~~LE~k~g~~~~~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~~~~G~r~V~vgW~G 80 (134) T protein:vir:10 1 MSVKVTGDKALERELEKHFGIKEMVKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPEWIKGKRTVTIRWRG 80 (134) T ss_pred CeEEeecHHHHHHHHHHhhchhhhhhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCeeecCCceEEEEEEEc Confidence 43 5556666665555441 111 0 01445555532 Q ss_pred CCCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchh--HHHHH Q lcl|NC_021342. 34 AAGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPW--LEPGV 111 (197) Q Consensus 34 ~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpF--lr~t~ 111 (197) + .++ --|-.+||||-+ ..+..+|+ =||-| ++.++ T Consensus 81 ~-~~R-------~~ivHLnE~Gyt-----------------~~r~Gk~i-------------------~PrG~G~i~~a~ 116 (134) T protein:vir:10 81 P-FER-------FRIVHLIENGHV-----------------EKKSGKFV-------------------KPKAMGGINRAI 116 (134) T ss_pred C-Cce-------eeEEEeeeccee-----------------ecCCCCee-------------------ccchhhHHHHHH Confidence 2 111 124455666621 00111111 12222 34467 Q ss_pred HHHHHHHHHHHHHHHHcc Q lcl|NC_021342. 112 QSKSNEYVTIIERGASRD 129 (197) Q Consensus 112 ~~~~~~~~~~~~~~~~~~ 129 (197) ++.+..+.+.++.-+..= T Consensus 117 ~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 117 RQGQNKYFETLKRELKKL 134 (134) T ss_pred HhhhHHHHHHHHHHHhcC Confidence 777777766665544332 No 158 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=52.09 E-value=0.42 Score=22.65 Aligned_cols=94 Identities=11% Similarity=0.174 Sum_probs=44.4 Q ss_pred CcccccHHH-HHHHHHHHHHHHh----c--------------------------------CCeEEEEeecCCCCCCCcc- Q lcl|NC_021342. 1 MMKVVGLQE-TLAELDKVLGQIR----D--------------------------------DQYVTVGIHEAAGDVESGE- 42 (197) Q Consensus 1 M~ki~~~~~-~~~~L~~l~~~~~----~--------------------------------~~~V~VGi~~~~~~~~~~~- 42 (197) |+...-.++ +.+.++++..++. . +..+.=||-.+..+.+++. T Consensus 20 ~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~tG~lr~swk~~~~~k~~~~~ 99 (163) T protein:vir:10 20 NANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQGGTLQKGWSKSRIEVSGRTY 99 (163) T ss_pred HhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhccccccccchhhccceecceeecCCce Confidence 221111111 1222333222221 0 0112222222212222222 Q ss_pred ----chHHHHhhHhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHH Q lcl|NC_021342. 43 ----INMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEY 118 (197) Q Consensus 43 ----~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~ 118 (197) .+.+.||.+-|||..+. + .|| +|-+.+|+.+.++.+.++ T Consensus 100 ~v~v~N~~~YA~~VE~GHR~~-~------------------------gGf------------V~G~fml~~s~~~~~~~~ 142 (163) T protein:vir:10 100 KQKVYNKVYYAPHVEYGHKTV-N------------------------GGF------------VPGQFFLHKTVEDTKSDM 142 (163) T ss_pred EEEEEecCCccchhhcceeec-C------------------------Cce------------eccchhhHHHHHHHHHHH Confidence 36788999999995432 0 122 688999999999988777 Q ss_pred HHHHHHH--------HHccCc Q lcl|NC_021342. 119 VTIIERG--------ASRDES 131 (197) Q Consensus 119 ~~~~~~~--------~~~~~~ 131 (197) .+.+++. +.|... T Consensus 143 ~~~~e~~l~~~l~k~~~~~~~ 163 (163) T protein:vir:10 143 EKRVRDKYDGFMRKVVLGNGK 163 (163) T ss_pred HHHHHHHHHHHHHHhhcCCCC Confidence 6665543 333333 No 159 >protein:vir:4460 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700383;genbank:gi:23505455;genbank:GeneID:955662 Probab=51.94 E-value=0.029 Score=29.01 Aligned_cols=79 Identities=22% Similarity=0.304 Sum_probs=46.1 Q ss_pred CCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHH-HHhCCCCCCcHHHHHhcCCCCchhHHHHH Q lcl|NC_021342. 101 IPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMF-MTELQDPPNAKSTIRKKGSSNPLIDTGAL 179 (197) Q Consensus 101 IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~-I~~~~~ppna~~Ti~~KG~~~PLidTG~L 179 (197) ++--+||.--|++.+... =+..-...++..+|+....+.|+- +..|.+ +-..+|-..||.| T Consensus 1 M~~~~~lHvdF~qp~~~~--------Fnr~r~RraF~~iGq~h~r~Arrlvm~RGrs----------~pGe~P~~~TGrL 62 (170) T protein:vir:44 1 MPQKAYLHVDFVQPEELV--------FNRARMRRAFVKIGQVHMRDARRLVMKRGRS----------KPGENPSYRTGQL 62 (170) T ss_pred CCCCceeEEeeecCCcee--------ecHHHHHHHHHHHhHHHHHHHHHHHHHhcCC----------CCCCCCcchhhhh Confidence 111122221122111111 111123447788999888888853 444443 1245899999999 Q ss_pred HhhceeeeeccccccccC Q lcl|NC_021342. 180 RQSVTYVVHSGKLPDEGL 197 (197) Q Consensus 180 ~~SIty~V~~k~~~~~~~ 197 (197) ..||.|.|-...--.-|| T Consensus 63 a~SIgy~Vpras~~rpG~ 80 (170) T protein:vir:44 63 ARSIGYYVPRASKKRPGL 80 (170) T ss_pred hhhhhhccccccCCCCce Confidence 999999998776667788 No 160 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=51.29 E-value=0.099 Score=26.09 Aligned_cols=70 Identities=14% Similarity=0.149 Sum_probs=35.2 Q ss_pred Ccc-hhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHhcCCCCchhHHHHHH Q lcl|NC_021342. 102 PAR-PWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFMTELQDPPNAKSTIRKKGSSNPLIDTGALR 180 (197) Q Consensus 102 P~R-pFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I~~~~~ppna~~Ti~~KG~~~PLidTG~L~ 180 (197) =+| .|=-.++ +++.+.+++... .-++++++...|..++..+|.- +| +|||.|+ T Consensus 1 Ma~~~i~~~Gl----d~L~~~L~~~~~-~~~v~~vv~~~~~~l~~~ak~~---------ap------------~dTG~lr 54 (92) T protein:vir:99 1 MADYSISWDGL----DALDEALANQQN-MNTVKKVVKKHTANLMTATQQA---------VP------------VDTGHLK 54 (92) T ss_pred CCceeeEeehH----HHHHHHHHhhcc-HHHHHHHHHHHHHHHHHHHHHh---------CC------------CCccccc Confidence 001 0000022 222232322111 2345667777777777766663 22 6999999 Q ss_pred hhceeeeecccccc---------ccC Q lcl|NC_021342. 181 QSVTYVVHSGKLPD---------EGL 197 (197) Q Consensus 181 ~SIty~V~~k~~~~---------~~~ 197 (197) +||+..+.++..+. +|- T Consensus 55 rSI~~~~~~~g~~~~v~~~gp~a~Ya 80 (92) T protein:vir:99 55 QSAQIQISRDGFTGSVTYGGGLVNYA 80 (92) T ss_pred eeeeEEeecCCeeEEEEeccCccccc Confidence 99998876654211 111 No 161 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=49.05 E-value=0.42 Score=22.67 Aligned_cols=85 Identities=19% Similarity=0.307 Sum_probs=35.5 Q ss_pred CcccccHHHHHHHH----------------------------HHHHHHHh------------------cCCeEEEEeecC Q lcl|NC_021342. 1 MMKVVGLQETLAEL----------------------------DKVLGQIR------------------DDQYVTVGIHEA 34 (197) Q Consensus 1 M~ki~~~~~~~~~L----------------------------~~l~~~~~------------------~~~~V~VGi~~~ 34 (197) =++++|.+.+.+.| +....-.. ..+.|+|||-.+ T Consensus 2 svevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G~ 81 (134) T protein:vir:95 2 SVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRGS 81 (134) T ss_pred eEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEcC Confidence 12233444433333 32222110 124466776432 Q ss_pred CCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchh--HHHHHH Q lcl|NC_021342. 35 AGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPW--LEPGVQ 112 (197) Q Consensus 35 ~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpF--lr~t~~ 112 (197) .++ --|-.+||||-+- .+..+| |=||-| ++.+++ T Consensus 82 -~~R-------~~iiHLNE~Gytr-----------------~~~Gk~-------------------i~PrG~G~i~~a~~ 117 (134) T protein:vir:95 82 -KDR-------YKIVHLIEYGHVQ-----------------KGTGKF-------------------IKPKAMGGVNRAIR 117 (134) T ss_pred -Cce-------eEEEEeeccccee-----------------cccCCc-------------------cCcchhhHHHHHHH Confidence 111 2255677777321 001111 222332 455677 Q ss_pred HHHHHHHHHHHHHHHcc Q lcl|NC_021342. 113 SKSNEYVTIIERGASRD 129 (197) Q Consensus 113 ~~~~~~~~~~~~~~~~~ 129 (197) +.+..+.+.++.-+..= T Consensus 118 ~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:95 118 QGQNKYFETLKRELKKL 134 (134) T ss_pred hhhHHHHHHHHHHHhcC Confidence 77776666665544332 No 162 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=49.05 E-value=0.42 Score=22.67 Aligned_cols=85 Identities=19% Similarity=0.307 Sum_probs=35.5 Q ss_pred CcccccHHHHHHHH----------------------------HHHHHHHh------------------cCCeEEEEeecC Q lcl|NC_021342. 1 MMKVVGLQETLAEL----------------------------DKVLGQIR------------------DDQYVTVGIHEA 34 (197) Q Consensus 1 M~ki~~~~~~~~~L----------------------------~~l~~~~~------------------~~~~V~VGi~~~ 34 (197) =++++|.+.+.+.| +....-.. ..+.|+|||-.+ T Consensus 2 svevkGv~eil~~le~k~g~~~~~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~~G~r~V~vgW~G~ 81 (134) T protein:vir:10 2 SVKVIGDKALERELEKRFGIKEMVKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWINGKRTITVHWRGS 81 (134) T ss_pred eEEEecHHHHHHHHHHhhchhhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeecCCceEEEEEEEcC Confidence 12233444433333 32222110 124466776432 Q ss_pred CCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchh--HHHHHH Q lcl|NC_021342. 35 AGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPW--LEPGVQ 112 (197) Q Consensus 35 ~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpF--lr~t~~ 112 (197) .++ --|-.+||||-+- .+..+| |=||-| ++.+++ T Consensus 82 -~~R-------~~iiHLNE~Gytr-----------------~~~Gk~-------------------i~PrG~G~i~~a~~ 117 (134) T protein:vir:10 82 -KDR-------YKIVHLIEYGHVQ-----------------KGTGKF-------------------IKPKAMGGVNRAIR 117 (134) T ss_pred -Cce-------eEEEEeeccccee-----------------cccCCc-------------------cCcchhhHHHHHHH Confidence 111 2255677777321 001111 222332 455677 Q ss_pred HHHHHHHHHHHHHHHcc Q lcl|NC_021342. 113 SKSNEYVTIIERGASRD 129 (197) Q Consensus 113 ~~~~~~~~~~~~~~~~~ 129 (197) +.+..+.+.++.-+..= T Consensus 118 ~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 118 QGQNKYFETLKRELKKL 134 (134) T ss_pred hhhHHHHHHHHHHHhcC Confidence 77776666665544332 No 163 >protein:vir:96012 Length: 133 # NCBI annotation: ORF023 # Family: family:all:589 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239805;genbank:gi:66395471;genbank:GeneID:5132929 Probab=40.00 E-value=0.9 Score=20.84 Aligned_cols=85 Identities=13% Similarity=0.142 Sum_probs=36.8 Q ss_pred CcccccHHHHHHHHHHHHHH----------------------------Hh------------------cCCeEEEEeecC Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQ----------------------------IR------------------DDQYVTVGIHEA 34 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~----------------------------~~------------------~~~~V~VGi~~~ 34 (197) |-.+.|.+++.+.|++-+.. .. ..+.|+|||... T Consensus 1 m~evkGv~eilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~rtV~i~W~gp 80 (133) T protein:vir:96 1 MRLIYDTKKLERELEKRLSKRALMRITDRALTEAGEVVLEAIRTNLKYFRDTGAEYGEVKLSKPTWENGKRTIRVYWEGE 80 (133) T ss_pred CccccCHHHHHHHHHHhcCHHHHHHHhhHHHHHHHHHHHHHHHHhhHHHhhccceeeeEEecCceecCCceEEEEEeecC Confidence 77777776666555443221 10 012344444322 Q ss_pred CCCCCCccchHHHHhhHhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchh--HHHHHH Q lcl|NC_021342. 35 AGDVESGEINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPW--LEPGVQ 112 (197) Q Consensus 35 ~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpF--lr~t~~ 112 (197) . .+ -.|-.+||||-- .+.. ..|-||-| ++.+++ T Consensus 81 ~-~R-------~~iVHLNE~G~y------------------tr~G-------------------k~i~PrG~G~I~~al~ 115 (133) T protein:vir:96 81 K-HR-------YSIVHLNEKGFY------------------AKDG-------------------KFIRPKGMGAIDKALR 115 (133) T ss_pred C-Cc-------eeeEeeecccce------------------ecCC-------------------ceeccchhhHHHHHHH Confidence 0 00 113345555510 0011 12344554 556666 Q ss_pred HHHHHHHHHHHHHHHccC Q lcl|NC_021342. 113 SKSNEYVTIIERGASRDE 130 (197) Q Consensus 113 ~~~~~~~~~~~~~~~~~~ 130 (197) ..+..+.+.+++-+..-+ T Consensus 116 ~se~~y~~~vk~el~kll 133 (133) T protein:vir:96 116 ASRDKFFKVYAEEVSKLL 133 (133) T ss_pred hhhHHHHHHHHHHHHHhC Confidence 666666555554443322 No 164 >protein:vir:487 Length: 187 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543094;swissprot:trembl:q8w625;genbank:gi:18249906;uniprot:Q8W625;genbank:GeneID:929690 Probab=36.75 E-value=0.57 Score=21.93 Aligned_cols=81 Identities=19% Similarity=0.186 Sum_probs=46.9 Q ss_pred HHHHHHHH--HHHHHHH---------HHHHHHccCcHHHHHHHHHHHHHHHHHHHH-HhCCCCCCcHHHHHhcCCCCchh Q lcl|NC_021342. 107 LEPGVQSK--SNEYVTI---------IERGASRDESTTSILEKVGVTAQAAVRMFM-TELQDPPNAKSTIRKKGSSNPLI 174 (197) Q Consensus 107 lr~t~~~~--~~~~~~~---------~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I-~~~~~ppna~~Ti~~KG~~~PLi 174 (197) |.+.+... .+...+. .+...=+..-...++..+|+....+.|+-+ ..|.+ +-.++|-. T Consensus 1 ~~~~~~~~~~~nam~~~~~lHvdF~qp~~~~Fnr~riRraF~~iGq~h~r~ArrLvm~RGrs----------~pge~P~~ 70 (187) T protein:vir:48 1 MKNCVQRDDGVNAMNQTAFLHVDFKQPKELEFNRARLRRAFVQIGRVYMRDARRLVIKRGRS----------GPGENPGY 70 (187) T ss_pred CccccccccchhhhhhccceeEeeecCCceeecHHHHHHHHHHHhHHHHHHHHHHHHhcccC----------CCCCCCcc Confidence 22221110 0111000 000111122345688999999888888653 23332 11368999 Q ss_pred HHHHHHhhceeeeeccccccccC Q lcl|NC_021342. 175 DTGALRQSVTYVVHSGKLPDEGL 197 (197) Q Consensus 175 dTG~L~~SIty~V~~k~~~~~~~ 197 (197) .||.|..||.|.|-+...-.-|| T Consensus 71 qTGrLa~SIgy~Vpkat~~RpG~ 93 (187) T protein:vir:48 71 QTGRLARSIGYYVPKKTTRRPGL 93 (187) T ss_pred hhhhhhhhhhhccccccCCCCcc Confidence 99999999999998777778888 No 165 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=36.54 E-value=0.92 Score=20.79 Aligned_cols=96 Identities=14% Similarity=0.164 Sum_probs=47.9 Q ss_pred Ccccc-----cHHHHHHHHHHHHHH-H---------------h----cCCeEEEEeecCC-----------CCCCCc--- Q lcl|NC_021342. 1 MMKVV-----GLQETLAELDKVLGQ-I---------------R----DDQYVTVGIHEAA-----------GDVESG--- 41 (197) Q Consensus 1 M~ki~-----~~~~~~~~L~~l~~~-~---------------~----~~~~V~VGi~~~~-----------~~~~~~--- 41 (197) |.++. |.+++.+.|.++... + . ...-|.-|-+..+ .+..++ T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~~g~~~~ 80 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARSLPVYKQGNNYI 80 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccccceeecCCeeE Confidence 65554 555555555444331 1 1 0111222222111 011122 Q ss_pred --cchHHHHhhHhHcCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHH Q lcl|NC_021342. 42 --EINMATLGAVLNFGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYV 119 (197) Q Consensus 42 --~~~~A~iA~~~EfGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~ 119 (197) -.+++.||.+-|||..++.+ .+| +|.+.+|+.++++.+..+. T Consensus 81 v~v~n~~~YA~~VE~Ghr~~~~------------------------~gf------------V~G~fml~~s~~~~~~~~~ 124 (141) T protein:vir:79 81 IEVVNPTEYASYVNFGHRTKDG------------------------KGW------------VKGQHFLTISEMELQSQVD 124 (141) T ss_pred EEEecCCcchhhhhcceeecCC------------------------cce------------eCCchhHHHHHHHHHHHHH Confidence 13678899999999533210 111 4778888888888887776 Q ss_pred HHHHH----HHHccCcH Q lcl|NC_021342. 120 TIIER----GASRDEST 132 (197) Q Consensus 120 ~~~~~----~~~~~~~~ 132 (197) +.+++ .+.+-+++ T Consensus 125 ~~~~~~l~~~l~~~~~~ 141 (141) T protein:vir:79 125 KIIEKKLLILLKGVFDA 141 (141) T ss_pred HHHHHHHHHHHHHhhcC Confidence 66554 34444444 No 166 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=32.10 E-value=0.13 Score=25.42 Aligned_cols=124 Identities=17% Similarity=0.124 Sum_probs=49.9 Q ss_pred CcccccHHHHHHHHHHH-----HHHHh--cCCeEEEEeecCCCC-CCCccchHHHHhhHhHcC---ceeeeCCCceeeec Q lcl|NC_021342. 1 MMKVVGLQETLAELDKV-----LGQIR--DDQYVTVGIHEAAGD-VESGEINMATLGAVLNFG---AEIDHPGGTSYGYA 69 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l-----~~~~~--~~~~V~VGi~~~~~~-~~~~~~~~A~iA~~~EfG---a~I~~p~~~~~~~~ 69 (197) -++|.|..++...++++ .+.|. +...-.|+++.-... |.+. --|.=.--|.-| +.|++.+-.....+ T Consensus 8 ~vrV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar~~tP~g~--r~~~~s~~~r~G~L~~Sir~aaT~raa~V 85 (143) T protein:vir:62 8 TIRVDGLREFQRNVRTLRDKELNKAVREANKASGEVLIPQAKHESPDGK--RDAKSSKKYRPGKLDKSIKVTASAKGAVI 85 (143) T ss_pred heehHHHHHHHHHHHHhhCCchhHHHHHHHHHHHHHHHHHHHhhcCCcc--cccccccccCcchhhccccccccccceee Confidence 45666777777766666 22221 111223333321100 0000 000000000000 01111111000000 Q ss_pred ccccccc-cCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHH Q lcl|NC_021342. 70 TEEAESR-KEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEK 138 (197) Q Consensus 70 ~~~~~~~-~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 138 (197) .+++ ..+| |.-+--++-+.-+|-++-||..++...+++|.+..++-++.- .++.|+. T Consensus 86 ---rAG~~krVP-------YA~~I~~G~r~r~Isp~rFl~~a~a~te~~~~r~Ye~~i~~v--l~k~l~s 143 (143) T protein:vir:62 86 ---KAGSASRVP-------YAAAIHFGYRARNISPNRFLFRAMARKSDVVAATYERRIAAV--VEKYLES 143 (143) T ss_pred ---eeCCcCCCC-------cccccccCcccccccchhhhhhhhhccCHHHHHHHHHHHHHH--HHHHhcC Confidence 0011 1111 222223445667888999999999999999998876554431 1112222 No 167 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=30.17 E-value=0.14 Score=25.23 Aligned_cols=120 Identities=19% Similarity=0.189 Sum_probs=51.2 Q ss_pred CcccccHHHHHHHHHHH-----HHHHh--cCCeEEEEeecCCCCCCCccchHHHHhhHhHcCceeeeCCCce-eeecccc Q lcl|NC_021342. 1 MMKVVGLQETLAELDKV-----LGQIR--DDQYVTVGIHEAAGDVESGEINMATLGAVLNFGAEIDHPGGTS-YGYATEE 72 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l-----~~~~~--~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa~I~~p~~~~-~~~~~~~ 72 (197) -++|.|..++...++++ .+.|. +...-.|+++.-. .+---|++ -|..++ |..-... T Consensus 8 ~vkV~Glr~f~~~mrK~~g~dl~k~lk~a~~~aa~v~~~~ar--------------~~tP~g~~--~p~~srr~r~G~L~ 71 (143) T protein:vir:13 8 TIQVDGLRQFQRNVRALRDKELNKAVREANKASGEVLIPQAK--------------HESPDGHR--DPKSSKRYRPGKLD 71 (143) T ss_pred heehHHHHHHHHHHHHhhCCcchHHHHHHHHHHHHHHHHHHH--------------hhcCCccc--ccccccccccchhh Confidence 45666777777777666 22221 1111223332111 11111111 011110 0000011 Q ss_pred cccccCC----ccccccc----cccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHH Q lcl|NC_021342. 73 AESRKEV----RFLKTGT----GFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEK 138 (197) Q Consensus 73 ~~~~~~~----~f~k~~~----g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 138 (197) ++.+... -.++.++ .|.-+--++-+.-+|-++-||..++...+++|.+..++-++.- .++.|+. T Consensus 72 ~Sir~aaT~raa~VrAGr~arVPYA~~I~~G~r~r~Is~~rFl~~a~a~te~~~~r~Ye~~i~~v--l~k~l~s 143 (143) T protein:vir:13 72 KSIKVTASAKGAVIKAGSAARVPYAAAIHFGYRKRNISANRFLYRAMARKSDVVAATYERRIAAV--VEKYLES 143 (143) T ss_pred ccccccccccceeeeecCcCCCCcccccccCCcccccchhhhhhhhhhccCHHHHHHHHHHHHHH--HHHHhcC Confidence 1111100 0111111 1222333445677888999999999999999998876554431 1112222 No 168 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=30.09 E-value=1.1 Score=20.31 Aligned_cols=84 Identities=11% Similarity=0.160 Sum_probs=34.5 Q ss_pred Ccccc------cHHHHHHHHHHHHHHHh------------------cCCeEEEEeecCCCCCCCccchHHHHhhHhHcCc Q lcl|NC_021342. 1 MMKVV------GLQETLAELDKVLGQIR------------------DDQYVTVGIHEAAGDVESGEINMATLGAVLNFGA 56 (197) Q Consensus 1 M~ki~------~~~~~~~~L~~l~~~~~------------------~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfGa 56 (197) |.+|+ +++.+...|++-++-.. +.+.|+|||.... .+ -.|-.+||||- T Consensus 24 m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~G~r~V~i~W~gp~-~R-------~~iVHLNE~GY 95 (133) T protein:vir:78 24 LPQLVDPALIAGATLVAKTLKSEFVQFKDTGASIDEINIEKPSYDKGVRSIKIDWKGPK-DR-------YKIIHLNEYGY 95 (133) T ss_pred HHHhhhHHHHHHHHHHHHHHHHhhcchhcccceeeeEEecCeeeeCCceEEEEEEecCC-Cc-------eeEEEeeccce Confidence 44444 23333333333322211 1245666665321 11 12446777772 Q ss_pred eeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchh--HHHHHHHHHHHHHHHHHHHHHccC Q lcl|NC_021342. 57 EIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPW--LEPGVQSKSNEYVTIIERGASRDE 130 (197) Q Consensus 57 ~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpF--lr~t~~~~~~~~~~~~~~~~~~~~ 130 (197) + ++.+| |-||-| ++.+++..+..+.+.+++=+...+ T Consensus 96 t-------------------r~Gk~-------------------i~PrG~G~i~~a~~~se~~y~~~vk~el~k~l 133 (133) T protein:vir:78 96 T-------------------RNGKK-------------------ITPAGTGSVARSLRISERAYRAIVQKKIGDKL 133 (133) T ss_pred e-------------------cCCCe-------------------EccchhhHHHHHHHhhhHHHHHHHHHHHHhhC Confidence 1 11111 223433 455555555555555444433333 No 169 >protein:vir:2688 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075507;genbank:gi:12719436;genbank:GeneID:920156 Probab=27.37 E-value=1.7 Score=19.34 Aligned_cols=82 Identities=13% Similarity=0.187 Sum_probs=41.5 Q ss_pred Ccccc------cHHHHHHHHHHHHHHHhc-----------------C---CeEEEEeecCCCCCCCccchHHHHhhHhHc Q lcl|NC_021342. 1 MMKVV------GLQETLAELDKVLGQIRD-----------------D---QYVTVGIHEAAGDVESGEINMATLGAVLNF 54 (197) Q Consensus 1 M~ki~------~~~~~~~~L~~l~~~~~~-----------------~---~~V~VGi~~~~~~~~~~~~~~A~iA~~~Ef 54 (197) |.+|+ +++.+...|++-++...+ + +.|+|||.... . =-.|-.+||| T Consensus 14 m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~~g~~~rtV~i~W~gp~-~-------R~~iVHLNE~ 85 (123) T protein:vir:26 14 MQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPM-N-------RKNIIHLNEH 85 (123) T ss_pred HHHhhhHHHHHHHHHHHHHHHHhhHHhhhccceeeeEEecCeeeccCCccceEEEEeecCC-C-------ceeeEeeecc Confidence 77766 556666666666554321 1 55777775431 1 1235578888 Q ss_pred CceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchh--HHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 55 GAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPW--LEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 55 Ga~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpF--lr~t~~~~~~~~~~~~~~~~~~ 128 (197) |-+ + +..+ |=||=| ++.+++..+..+.+.+++=++. T Consensus 86 GYt-r------------------~Gk~-------------------i~PRG~G~i~~a~~~se~~y~~~vk~eL~k 123 (123) T protein:vir:26 86 GYT-R------------------DGKK-------------------YTPRGFGVIAKTLAANERKYREIIKKELAR 123 (123) T ss_pred cee-c------------------CCCe-------------------EccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 832 1 1111 223433 4555556665555555544433 No 170 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=24.86 E-value=1.9 Score=19.08 Aligned_cols=82 Identities=12% Similarity=0.167 Sum_probs=34.3 Q ss_pred Ccccc------cHHHHHHHHHHHHHHHh-----------------cC---CeEEEEeecCCCCCCCccchHHHHhhHhHc Q lcl|NC_021342. 1 MMKVV------GLQETLAELDKVLGQIR-----------------DD---QYVTVGIHEAAGDVESGEINMATLGAVLNF 54 (197) Q Consensus 1 M~ki~------~~~~~~~~L~~l~~~~~-----------------~~---~~V~VGi~~~~~~~~~~~~~~A~iA~~~Ef 54 (197) |.+|. +++.+...|++-++-.. ++ +.|+|||.... .+ -.|-.+||| T Consensus 24 ~~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W~gp~-~R-------~~iVHLNE~ 95 (133) T protein:vir:93 24 MQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPM-NR-------KNIIHLNEH 95 (133) T ss_pred hHhhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceEEEEEeecCC-Cc-------eeEEEeecc Confidence 44443 23333333443333211 11 34566664321 11 124466777 Q ss_pred CceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchh--HHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 55 GAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPW--LEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 55 Ga~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpF--lr~t~~~~~~~~~~~~~~~~~~ 128 (197) |-+ ++..| |-||-| ++.+++..+..+.+.+++=++. T Consensus 96 Gyt-------------------r~Gk~-------------------i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 96 GYT-------------------RDGKK-------------------YTPRGFGVIAKTLAANERKYREIIKKELAR 133 (133) T ss_pred cee-------------------cCCCe-------------------EccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 721 11111 223444 4555666666665555544443 No 171 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=22.10 E-value=2.5 Score=18.40 Aligned_cols=71 Identities=10% Similarity=-0.011 Sum_probs=30.6 Q ss_pred CcccccH-------------------------HHHHHHHHHHHHHHhcCCeEEEEeecCCCCCCCccchHHHHhhHhHcC Q lcl|NC_021342. 1 MMKVVGL-------------------------QETLAELDKVLGQIRDDQYVTVGIHEAAGDVESGEINMATLGAVLNFG 55 (197) Q Consensus 1 M~ki~~~-------------------------~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~~~~~~~~~A~iA~~~EfG 55 (197) ...+.++ ......+...++.+.....+-+ .+++.||.-.||| T Consensus 36 ~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~t~~~~~~~i~~~~~g~~iyi-------------~Nn~pYA~~LEyG 102 (131) T protein:vir:94 36 ASPVDTGRFRMNWMASGSTPADGTTDATDKSGNTATGNATSFVLNAADWHTFTL-------------TNNLPYAQRLEYG 102 (131) T ss_pred hCCCchhhhhccchhccccccccccCCCCCCchhhHHHHHHHHhhccccceEEE-------------eeCchhhhhhhcc Confidence 2222222 1112222222222211111111 2456677777777 Q ss_pred ceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021342. 56 AEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGA 126 (197) Q Consensus 56 a~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~ 126 (197) + .+-+|+.|.|.++.+-..-+.+...++= T Consensus 103 ~------------------------------------------S~QAP~g~v~~~~~~~~~~v~~~~~e~k 131 (131) T protein:vir:94 103 W------------------------------------------SQQAPQGFVRVNVSRFQQLLNEEASKVK 131 (131) T ss_pred c------------------------------------------cCCCcchHHHHHHHHHHHHHHHHHHhcC Confidence 3 3568899999887655433333222221 No 172 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=21.69 E-value=2.4 Score=18.49 Aligned_cols=82 Identities=12% Similarity=0.165 Sum_probs=33.1 Q ss_pred Ccccc------cHHHHHHHHHHHHHHHh-----------------cC---CeEEEEeecCCCCCCCccchHHHHhhHhHc Q lcl|NC_021342. 1 MMKVV------GLQETLAELDKVLGQIR-----------------DD---QYVTVGIHEAAGDVESGEINMATLGAVLNF 54 (197) Q Consensus 1 M~ki~------~~~~~~~~L~~l~~~~~-----------------~~---~~V~VGi~~~~~~~~~~~~~~A~iA~~~Ef 54 (197) |.+|. +++.+...|++-++-.. ++ +.|+|||.... .+ -.|-.+||| T Consensus 24 m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W~gp~-~R-------~~iVHLNE~ 95 (133) T protein:vir:96 24 MQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPM-NR-------KNIIHLNEH 95 (133) T ss_pred HHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEeecCC-Cc-------eeEEEeecc Confidence 44443 23333333333333211 11 34566664321 11 124466777 Q ss_pred CceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchh--HHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 55 GAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPW--LEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 55 Ga~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpF--lr~t~~~~~~~~~~~~~~~~~~ 128 (197) |-+ ++..| |-||-| ++.+++..+..+.+.+++=++. T Consensus 96 Gyt-------------------r~Gk~-------------------i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:96 96 GYT-------------------RDGKK-------------------YTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred cee-------------------cCCCe-------------------EccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 721 11111 223433 4555555555555555444433 No 173 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=21.69 E-value=2.4 Score=18.49 Aligned_cols=82 Identities=12% Similarity=0.165 Sum_probs=33.1 Q ss_pred Ccccc------cHHHHHHHHHHHHHHHh-----------------cC---CeEEEEeecCCCCCCCccchHHHHhhHhHc Q lcl|NC_021342. 1 MMKVV------GLQETLAELDKVLGQIR-----------------DD---QYVTVGIHEAAGDVESGEINMATLGAVLNF 54 (197) Q Consensus 1 M~ki~------~~~~~~~~L~~l~~~~~-----------------~~---~~V~VGi~~~~~~~~~~~~~~A~iA~~~Ef 54 (197) |.+|. +++.+...|++-++-.. ++ +.|+|||.... .+ -.|-.+||| T Consensus 24 m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W~gp~-~R-------~~iVHLNE~ 95 (133) T protein:vir:94 24 MQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPM-NR-------KNIIHLNEH 95 (133) T ss_pred HHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEeecCC-Cc-------eeEEEeecc Confidence 44443 23333333333333211 11 34566664321 11 124466777 Q ss_pred CceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchh--HHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 55 GAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPW--LEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 55 Ga~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpF--lr~t~~~~~~~~~~~~~~~~~~ 128 (197) |-+ ++..| |-||-| ++.+++..+..+.+.+++=++. T Consensus 96 Gyt-------------------r~Gk~-------------------i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:94 96 GYT-------------------RDGKK-------------------YTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred cee-------------------cCCCe-------------------EccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 721 11111 223433 4555555555555555444433 No 174 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=21.69 E-value=2.4 Score=18.49 Aligned_cols=82 Identities=12% Similarity=0.165 Sum_probs=33.1 Q ss_pred Ccccc------cHHHHHHHHHHHHHHHh-----------------cC---CeEEEEeecCCCCCCCccchHHHHhhHhHc Q lcl|NC_021342. 1 MMKVV------GLQETLAELDKVLGQIR-----------------DD---QYVTVGIHEAAGDVESGEINMATLGAVLNF 54 (197) Q Consensus 1 M~ki~------~~~~~~~~L~~l~~~~~-----------------~~---~~V~VGi~~~~~~~~~~~~~~A~iA~~~Ef 54 (197) |.+|. +++.+...|++-++-.. ++ +.|+|||.... .+ -.|-.+||| T Consensus 24 m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W~gp~-~R-------~~iVHLNE~ 95 (133) T protein:vir:78 24 MQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPM-NR-------KNIIHLNEH 95 (133) T ss_pred HHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEeecCC-Cc-------eeEEEeecc Confidence 44443 23333333333333211 11 34566664321 11 124466777 Q ss_pred CceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchh--HHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 55 GAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPW--LEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 55 Ga~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpF--lr~t~~~~~~~~~~~~~~~~~~ 128 (197) |-+ ++..| |-||-| ++.+++..+..+.+.+++=++. T Consensus 96 Gyt-------------------r~Gk~-------------------i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:78 96 GYT-------------------RDGKK-------------------YTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred cee-------------------cCCCe-------------------EccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 721 11111 223433 4555555555555555444433 No 175 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=21.69 E-value=2.4 Score=18.49 Aligned_cols=82 Identities=12% Similarity=0.165 Sum_probs=33.1 Q ss_pred Ccccc------cHHHHHHHHHHHHHHHh-----------------cC---CeEEEEeecCCCCCCCccchHHHHhhHhHc Q lcl|NC_021342. 1 MMKVV------GLQETLAELDKVLGQIR-----------------DD---QYVTVGIHEAAGDVESGEINMATLGAVLNF 54 (197) Q Consensus 1 M~ki~------~~~~~~~~L~~l~~~~~-----------------~~---~~V~VGi~~~~~~~~~~~~~~A~iA~~~Ef 54 (197) |.+|. +++.+...|++-++-.. ++ +.|+|||.... .+ -.|-.+||| T Consensus 24 m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~~~rtV~i~W~gp~-~R-------~~iVHLNE~ 95 (133) T protein:vir:93 24 MQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGSQERAVLIEWVGPM-NR-------KNIIHLNEH 95 (133) T ss_pred HHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCCcceeEEEEeecCC-Cc-------eeEEEeecc Confidence 44443 23333333333333211 11 34566664321 11 124466777 Q ss_pred CceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchh--HHHHHHHHHHHHHHHHHHHHHc Q lcl|NC_021342. 55 GAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPW--LEPGVQSKSNEYVTIIERGASR 128 (197) Q Consensus 55 Ga~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpF--lr~t~~~~~~~~~~~~~~~~~~ 128 (197) |-+ ++..| |-||-| ++.+++..+..+.+.+++=++. T Consensus 96 Gyt-------------------r~Gk~-------------------i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 96 GYT-------------------RDGKK-------------------YTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred cee-------------------cCCCe-------------------EccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 721 11111 223433 4555555555555555444433 No 176 >protein:vir:4514 Length: 168 # NCBI annotation: unknown # Family: family:all:2152 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599040;genbank:gi:19548998;genbank:GeneID:935228 Probab=21.07 E-value=0.32 Score=23.31 Aligned_cols=78 Identities=17% Similarity=0.225 Sum_probs=42.8 Q ss_pred cccccccccccCCCcchhHHHHHHHHHHHHHHHHHHHHHccCcHHHHHHHHHHHHHHHHHHHH-HhCCCCCCcHHHHHhc Q lcl|NC_021342. 89 KPLGVTKPHKINIPARPWLEPGVQSKSNEYVTIIERGASRDESTTSILEKVGVTAQAAVRMFM-TELQDPPNAKSTIRKK 167 (197) Q Consensus 89 ~~~~~~~~~~v~IP~RpFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~l~~iG~~~~~~i~~~I-~~~~~ppna~~Ti~~K 167 (197) +...++.. ..+-|.+= .=+..-...++..||+.-..+.+.-+ ..+.+ + T Consensus 1 m~~~~lHv-dF~qp~~~--------------------~Fnr~riRraFv~igq~hmr~ArrlV~rrgrs----------~ 49 (168) T protein:vir:45 1 MTTSFLHV-DFQQPAEM--------------------RFNRARVRRAFVTIGQRHMRDARRLVMRHARS----------A 49 (168) T ss_pred CCccceee-eeecCCce--------------------eecHHHHHHHHHHHhHHHHHHHHHHHhhcccc----------c Confidence 11000000 01112110 00111233466778877666666543 33443 2 Q ss_pred CCCCchhHHHHHHhhceeeeeccccccccC Q lcl|NC_021342. 168 GSSNPLIDTGALRQSVTYVVHSGKLPDEGL 197 (197) Q Consensus 168 G~~~PLidTG~L~~SIty~V~~k~~~~~~~ 197 (197) -..+|-..||.|..||.|.|-...--.-|| T Consensus 50 pGe~P~~qTGrLa~SIgy~Vpras~~rpG~ 79 (168) T protein:vir:45 50 PGENPGYQTGRLARSIGYMVPRASKHRPGF 79 (168) T ss_pred CCCCCcchhhhhhhhhhhccccccCCCCce Confidence 345999999999999999998776677788 No 177 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=20.29 E-value=2.8 Score=18.13 Aligned_cols=72 Identities=17% Similarity=0.094 Sum_probs=32.7 Q ss_pred CcccccHHHHHHHHHHHHHHHhcCCeEEEEeecCCCC----CCCcc-----------------------chHHHHhhHhH Q lcl|NC_021342. 1 MMKVVGLQETLAELDKVLGQIRDDQYVTVGIHEAAGD----VESGE-----------------------INMATLGAVLN 53 (197) Q Consensus 1 M~ki~~~~~~~~~L~~l~~~~~~~~~V~VGi~~~~~~----~~~~~-----------------------~~~A~iA~~~E 53 (197) ...+..+ ++..+..|.++-+..... +++.. .+++.||.-.| T Consensus 44 ~sPVdTG------------r~Ranw~vs~~~~~~~~~~~~d~~G~~t~~~~~~~~~~i~~~k~g~~iyi~Nn~pYA~~LE 111 (145) T protein:vir:10 44 LSPVDTG------------RFKANWQISANSPAQQSLNEYDQTGGQTKTYLARQARAVANSKATSVIYITNRLDYAADLE 111 (145) T ss_pred hCCccch------------hhccccceeecccccccccccCCCCccchhhHHHHHHHhhcccccceEEEeeCchhhhHhh Confidence 3333322 122334444444332100 11111 13455555555 Q ss_pred cCceeeeCCCceeeecccccccccCCccccccccccccccccccccCCCcchhHHHHHHHHHH---HHHHHHHHHH Q lcl|NC_021342. 54 FGAEIDHPGGTSYGYATEEAESRKEVRFLKTGTGFKPLGVTKPHKINIPARPWLEPGVQSKSN---EYVTIIERGA 126 (197) Q Consensus 54 fGa~I~~p~~~~~~~~~~~~~~~~~~~f~k~~~g~~~~~~~~~~~v~IP~RpFlr~t~~~~~~---~~~~~~~~~~ 126 (197) ||+ -+-.|..|.|.++.+-.. +..+.+++++ T Consensus 112 yG~------------------------------------------S~QAP~G~v~~~~~~~~~~v~~~~~e~k~~~ 145 (145) T protein:vir:10 112 YGA------------------------------------------SNQAPAGVLGVVQARLGRYFQEAVEEARRAI 145 (145) T ss_pred ccc------------------------------------------cCCCcchHHHHHHHHHHHHHHHHHHHhhccC Confidence 553 345789999998877653 2333334444 Done!