Query lcl|NC_019767.1_cdsid_YP_007151615.1 [gene=F865_gp08] [protein=hypothetical protein] [protein_id=YP_007151615.1] [location=5831..6280] Match_columns 149 No_of_seqs 131 out of 320 Neff 8.8 Searched_HMMs 1612 Date Thu Nov 7 16:45:01 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_8 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_8_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:194 Length: 149 # 100.0 6.4E-43 4E-46 251.9 14.4 149 1-149 1-149 (149) 2 protein:vir:93617 Length: 148 100.0 1E-42 6.3E-46 250.8 14.7 148 1-149 1-148 (148) 3 protein:vir:4347 Length: 164 # 100.0 2.1E-39 1.3E-42 232.7 13.3 148 1-149 1-156 (164) 4 protein:vir:1891 Length: 179 # 100.0 3.4E-39 2.1E-42 231.5 13.5 149 1-149 1-171 (179) 5 protein:vir:80362 Length: 140 100.0 2.7E-36 1.6E-39 215.6 14.1 138 1-149 1-138 (140) 6 protein:vir:100075 Length: 140 100.0 2.4E-36 1.5E-39 215.9 13.5 138 1-149 1-138 (140) 7 protein:vir:107568 Length: 146 100.0 2.5E-36 1.6E-39 215.7 13.5 143 1-147 1-146 (146) 8 protein:vir:102875 Length: 146 100.0 2.5E-36 1.6E-39 215.7 13.5 143 1-147 1-146 (146) 9 protein:vir:102085 Length: 146 100.0 2.5E-36 1.6E-39 215.7 13.5 143 1-147 1-146 (146) 10 protein:vir:105007 Length: 146 100.0 2.5E-36 1.6E-39 215.7 13.5 143 1-147 1-146 (146) 11 protein:vir:100243 Length: 140 100.0 5E-36 3.1E-39 214.1 13.4 137 1-149 1-138 (140) 12 protein:vir:1437 Length: 140 # 100.0 9.6E-36 6E-39 212.6 13.6 138 1-149 1-138 (140) 13 protein:vir:1386 Length: 149 # 100.0 3.2E-35 2E-38 209.7 13.4 145 1-149 1-149 (149) 14 protein:vir:5745 Length: 135 # 100.0 5E-35 3.1E-38 208.6 13.0 130 2-148 1-135 (135) 15 protein:vir:105089 Length: 133 100.0 2.5E-34 1.6E-37 204.8 12.9 129 1-146 1-133 (133) 16 protein:vir:3873 Length: 128 # 100.0 4.5E-33 2.8E-36 197.9 11.4 128 4-144 1-128 (128) 17 protein:vir:1273 Length: 127 # 100.0 2.1E-32 1.3E-35 194.3 10.8 124 1-144 1-127 (127) 18 protein:vir:94538 Length: 125 99.9 2.5E-31 1.6E-34 188.3 11.3 124 1-146 1-125 (125) 19 protein:vir:101594 Length: 173 99.9 5.4E-31 3.4E-34 186.5 11.1 118 6-146 1-173 (173) 20 protein:vir:97088 Length: 157 99.9 5.8E-30 3.6E-33 180.9 14.0 147 2-149 1-155 (157) 21 protein:vir:9708 Length: 125 # 99.9 8.2E-30 5.1E-33 180.0 11.2 121 7-145 1-125 (125) 22 protein:vir:3617 Length: 112 # 99.9 7E-30 4.3E-33 180.4 9.8 112 2-140 1-112 (112) 23 protein:vir:95789 Length: 114 99.9 1.4E-29 8.8E-33 178.8 10.2 114 4-144 1-114 (114) 24 protein:vir:106623 Length: 115 99.9 8.1E-29 5E-32 174.6 10.5 109 6-140 1-115 (115) 25 protein:vir:96358 Length: 115 99.9 8.5E-29 5.3E-32 174.5 10.5 109 6-140 1-115 (115) 26 protein:vir:9312 Length: 115 # 99.9 8.5E-29 5.3E-32 174.5 10.5 109 6-140 1-115 (115) 27 protein:vir:78858 Length: 115 99.9 8.5E-29 5.3E-32 174.5 10.5 109 6-140 1-115 (115) 28 protein:vir:103917 Length: 115 99.9 8.5E-29 5.3E-32 174.5 10.5 109 6-140 1-115 (115) 29 protein:vir:96225 Length: 115 99.9 8.5E-29 5.3E-32 174.5 10.5 109 6-140 1-115 (115) 30 protein:vir:97144 Length: 115 99.9 8.5E-29 5.3E-32 174.5 10.5 109 6-140 1-115 (115) 31 protein:vir:99744 Length: 115 99.9 1.9E-28 1.2E-31 172.6 10.3 109 6-140 1-115 (115) 32 protein:vir:9930 Length: 108 # 99.9 3.2E-28 2E-31 171.3 10.4 108 8-141 1-108 (108) 33 protein:vir:81106 Length: 125 99.9 8.2E-28 5.1E-31 169.1 12.2 122 2-144 1-125 (125) 34 protein:vir:98342 Length: 125 99.9 8.2E-28 5.1E-31 169.1 12.2 122 2-144 1-125 (125) 35 protein:vir:79988 Length: 125 99.9 8.2E-28 5.1E-31 169.1 12.2 122 2-144 1-125 (125) 36 protein:vir:4704 Length: 125 # 99.9 8.2E-28 5.1E-31 169.1 12.2 122 2-144 1-125 (125) 37 protein:vir:9414 Length: 125 # 99.9 8.2E-28 5.1E-31 169.1 12.2 122 2-144 1-125 (125) 38 protein:vir:743 Length: 108 # 99.9 3E-28 1.9E-31 171.5 9.7 108 6-140 1-108 (108) 39 protein:vir:98409 Length: 108 99.9 7.9E-28 4.9E-31 169.2 9.2 108 6-140 1-108 (108) 40 protein:vir:102154 Length: 119 99.9 1.3E-27 7.8E-31 168.1 8.6 118 1-144 1-119 (119) 41 protein:vir:4906 Length: 114 # 99.9 7.9E-27 4.9E-30 163.7 9.0 112 1-141 1-114 (114) 42 protein:vir:2740 Length: 114 # 99.9 7.9E-27 4.9E-30 163.7 9.0 112 1-141 1-114 (114) 43 protein:vir:96486 Length: 112 99.9 1.7E-26 1.1E-29 161.8 10.0 110 1-139 1-112 (112) 44 protein:vir:106570 Length: 182 99.9 5.4E-26 3.4E-29 159.1 12.1 147 1-149 1-182 (182) 45 protein:vir:5978 Length: 144 # 99.8 4.1E-24 2.6E-27 148.8 10.6 115 1-140 3-144 (144) 46 protein:vir:93738 Length: 137 99.8 7E-24 4.3E-27 147.5 10.0 108 1-136 1-137 (137) 47 protein:vir:94490 Length: 137 99.8 7E-24 4.3E-27 147.5 10.0 108 1-136 1-137 (137) 48 protein:vir:97427 Length: 137 99.8 7E-24 4.3E-27 147.5 10.0 108 1-136 1-137 (137) 49 protein:vir:95894 Length: 137 99.8 1.2E-23 7.3E-27 146.3 9.8 108 1-136 1-137 (137) 50 protein:vir:94796 Length: 137 99.8 1E-23 6.5E-27 146.6 9.4 108 1-136 1-137 (137) 51 protein:vir:94108 Length: 149 99.8 8.9E-24 5.5E-27 147.0 8.8 108 1-136 13-149 (149) 52 protein:vir:105916 Length: 149 99.8 1.2E-23 7.7E-27 146.2 8.7 108 1-136 13-149 (149) 53 protein:vir:4956 Length: 153 # 99.8 7.6E-23 4.7E-26 141.9 11.4 136 1-149 1-140 (153) 54 protein:vir:96829 Length: 135 99.8 7.4E-23 4.6E-26 141.9 9.3 108 1-136 1-135 (135) 55 protein:vir:96121 Length: 137 99.8 8.1E-23 5E-26 141.7 9.5 108 1-136 1-137 (137) 56 protein:vir:107099 Length: 137 99.8 2.3E-22 1.4E-25 139.3 10.5 108 1-136 1-137 (137) 57 protein:vir:100887 Length: 139 99.8 6.5E-22 4.1E-25 136.7 11.2 136 2-149 1-136 (139) 58 protein:vir:94654 Length: 142 99.8 9.3E-22 5.8E-25 135.9 10.3 115 1-140 1-142 (142) 59 protein:vir:105330 Length: 137 99.8 1.5E-21 9.6E-25 134.7 10.5 108 1-136 1-137 (137) 60 protein:vir:5000 Length: 141 # 99.8 2.5E-21 1.5E-24 133.6 11.4 136 1-149 1-140 (141) 61 protein:vir:99101 Length: 142 99.8 8.2E-22 5.1E-25 136.2 8.2 113 1-137 1-142 (142) 62 protein:vir:8669 Length: 142 # 99.8 8.2E-22 5.1E-25 136.2 8.2 113 1-137 1-142 (142) 63 protein:vir:4859 Length: 140 # 99.8 4.1E-21 2.5E-24 132.4 11.6 137 1-149 1-140 (140) 64 protein:vir:79034 Length: 141 99.7 6E-21 3.7E-24 131.4 11.7 134 1-149 1-137 (141) 65 protein:vir:100223 Length: 139 99.7 1E-20 6.3E-24 130.2 11.2 136 4-149 1-136 (139) 66 protein:vir:4833 Length: 140 # 99.7 2.4E-20 1.5E-23 128.1 11.4 137 1-149 1-140 (140) 67 protein:vir:81147 Length: 126 99.7 1.2E-19 7.7E-23 124.2 10.1 120 1-143 1-126 (126) 68 protein:vir:97327 Length: 116 99.6 2.5E-18 1.6E-21 117.1 6.7 87 26-136 1-116 (116) 69 protein:vir:1243 Length: 116 # 99.6 2.5E-18 1.6E-21 117.1 6.7 87 26-136 1-116 (116) 70 protein:vir:95062 Length: 116 99.6 2.6E-18 1.6E-21 117.0 6.6 87 26-136 1-116 (116) 71 protein:vir:78077 Length: 141 99.6 1.1E-17 6.7E-21 113.6 9.4 115 1-144 1-141 (141) 72 protein:vir:105467 Length: 144 99.5 3.9E-17 2.4E-20 110.6 11.3 124 1-149 1-142 (144) 73 protein:vir:99528 Length: 92 # 99.5 1.5E-17 9.5E-21 112.8 6.8 92 1-116 1-92 (92) 74 protein:vir:3848 Length: 159 # 99.5 1E-16 6.3E-20 108.3 11.3 146 1-149 1-159 (159) 75 protein:vir:81067 Length: 119 99.5 1.8E-17 1.1E-20 112.3 5.1 93 41-149 1-117 (119) 76 protein:vir:100652 Length: 134 99.5 9.7E-17 6E-20 108.4 8.8 121 4-142 1-134 (134) 77 protein:vir:10367 Length: 119 99.5 2.3E-17 1.4E-20 111.8 5.1 93 41-149 1-117 (119) 78 protein:vir:9513 Length: 134 # 99.4 3E-16 1.9E-19 105.7 8.7 121 4-142 1-134 (134) 79 protein:vir:101302 Length: 134 99.4 3E-16 1.9E-19 105.7 8.7 121 4-142 1-134 (134) 80 protein:vir:102441 Length: 137 99.4 5.5E-16 3.4E-19 104.2 6.3 107 2-135 1-137 (137) 81 protein:vir:9879 Length: 127 # 99.4 1.5E-15 9.5E-19 101.8 7.3 109 8-141 1-127 (127) 82 protein:vir:106041 Length: 137 99.3 1.1E-15 6.6E-19 102.7 5.8 104 2-134 1-137 (137) 83 protein:vir:966 Length: 123 # 99.3 1.4E-14 8.8E-18 96.5 10.2 117 2-141 1-123 (123) 84 protein:vir:107545 Length: 140 99.3 6.6E-15 4.1E-18 98.3 5.6 108 1-134 1-140 (140) 85 protein:vir:97982 Length: 140 99.3 6.6E-15 4.1E-18 98.3 5.6 108 1-134 1-140 (140) 86 protein:vir:9647 Length: 132 # 99.2 1.6E-13 9.7E-17 90.8 9.8 126 2-145 1-132 (132) 87 protein:vir:6246 Length: 143 # 99.2 2.1E-13 1.3E-16 90.1 9.5 137 1-149 1-143 (143) 88 protein:vir:1332 Length: 143 # 99.1 3.6E-13 2.3E-16 88.8 9.6 137 1-149 1-143 (143) 89 protein:vir:6216 Length: 125 # 99.1 3.2E-13 2E-16 89.1 6.7 115 2-143 1-125 (125) 90 protein:vir:102963 Length: 163 99.1 2.8E-12 1.7E-15 84.0 11.7 144 2-149 1-160 (163) 91 protein:vir:98636 Length: 138 98.9 8.2E-12 5.1E-15 81.4 9.5 127 1-145 6-138 (138) 92 protein:vir:78335 Length: 133 98.9 2.7E-11 1.7E-14 78.5 9.5 123 4-143 1-133 (133) 93 protein:vir:106506 Length: 137 98.9 3.9E-12 2.4E-15 83.2 4.8 108 1-148 1-137 (137) 94 protein:vir:78644 Length: 133 98.9 3.5E-11 2.2E-14 77.9 9.9 123 4-141 1-133 (133) 95 protein:vir:94419 Length: 133 98.9 3.5E-11 2.2E-14 77.9 9.9 123 4-141 1-133 (133) 96 protein:vir:96973 Length: 133 98.9 3.5E-11 2.2E-14 77.9 9.9 123 4-141 1-133 (133) 97 protein:vir:9363 Length: 133 # 98.9 3.5E-11 2.2E-14 77.9 9.9 123 4-141 1-133 (133) 98 protein:vir:95372 Length: 124 98.8 6.3E-11 3.9E-14 76.5 9.9 114 1-141 1-124 (124) 99 protein:vir:93898 Length: 133 98.7 2.6E-10 1.6E-13 73.1 9.9 123 4-141 1-133 (133) 100 protein:vir:80116 Length: 127 98.7 1.9E-10 1.2E-13 73.8 8.8 117 1-144 1-127 (127) 101 protein:vir:104347 Length: 145 98.5 4E-10 2.5E-13 72.1 6.6 137 1-143 1-145 (145) 102 protein:vir:1087 Length: 161 # 98.5 1.2E-09 7.7E-13 69.4 9.2 138 1-149 1-161 (161) 103 protein:vir:79638 Length: 146 98.5 8.9E-10 5.6E-13 70.2 8.4 137 1-147 1-146 (146) 104 protein:vir:7412 Length: 168 # 98.5 1.8E-09 1.1E-12 68.6 9.3 137 1-149 1-165 (168) 105 protein:vir:107703 Length: 147 98.5 2.1E-09 1.3E-12 68.2 9.0 136 1-147 1-147 (147) 106 protein:vir:99833 Length: 190 98.4 1.6E-09 9.8E-13 68.8 7.4 132 1-147 1-190 (190) 107 protein:vir:103280 Length: 142 98.4 2.1E-09 1.3E-12 68.1 7.5 134 1-143 1-142 (142) 108 protein:vir:1028 Length: 168 # 98.3 5.6E-09 3.5E-12 65.8 8.3 137 1-149 1-165 (168) 109 protein:vir:79091 Length: 175 98.3 7.1E-09 4.4E-12 65.3 8.3 137 1-149 1-174 (175) 110 protein:vir:94994 Length: 131 98.3 5.5E-09 3.4E-12 65.9 6.9 123 2-140 1-131 (131) 111 protein:vir:94944 Length: 121 98.2 3E-09 1.8E-12 67.3 5.1 113 1-128 1-121 (121) 112 protein:vir:78380 Length: 131 98.2 8.7E-09 5.4E-12 64.8 7.1 123 2-140 1-131 (131) 113 protein:vir:101563 Length: 155 98.2 3.6E-09 2.2E-12 66.9 4.5 103 4-149 1-105 (155) 114 protein:vir:3994 Length: 168 # 98.2 2.2E-08 1.4E-11 62.6 8.6 135 1-149 1-165 (168) 115 protein:vir:3163 Length: 145 # 98.1 1.6E-08 9.7E-12 63.4 7.2 131 1-149 1-145 (145) 116 protein:vir:96012 Length: 133 98.1 4.6E-08 2.9E-11 60.8 9.3 122 1-143 1-133 (133) 117 protein:vir:1988 Length: 156 # 98.1 2.1E-08 1.3E-11 62.7 7.4 130 2-145 1-156 (156) 118 protein:vir:4096 Length: 140 # 98.1 5.3E-08 3.3E-11 60.5 9.1 130 1-149 1-139 (140) 119 protein:vir:77650 Length: 155 98.0 5.9E-09 3.7E-12 65.7 2.8 103 4-149 1-105 (155) 120 protein:vir:5257 Length: 148 # 98.0 1.4E-08 8.5E-12 63.7 4.0 94 2-149 1-98 (148) 121 protein:vir:95157 Length: 144 98.0 2.7E-08 1.7E-11 62.1 5.5 130 1-144 1-144 (144) 122 protein:vir:103841 Length: 155 97.9 1.2E-07 7.4E-11 58.6 7.9 133 2-147 1-155 (155) 123 protein:vir:79225 Length: 155 97.8 1.2E-07 7.6E-11 58.5 7.0 133 1-147 1-155 (155) 124 protein:vir:106728 Length: 155 97.8 2.2E-08 1.4E-11 62.6 2.5 103 4-149 1-105 (155) 125 protein:vir:78607 Length: 155 97.8 2.2E-08 1.4E-11 62.6 2.5 103 4-149 1-105 (155) 126 protein:vir:107851 Length: 175 97.7 2.9E-07 1.8E-10 56.4 7.8 137 1-149 1-174 (175) 127 protein:vir:97190 Length: 148 97.7 1.2E-07 7.2E-11 58.6 5.0 135 1-149 1-148 (148) 128 protein:vir:80425 Length: 134 97.7 7.4E-08 4.6E-11 59.7 3.8 125 2-141 1-134 (134) 129 protein:vir:96105 Length: 193 97.6 1.5E-07 9.2E-11 58.0 4.4 135 2-149 1-143 (193) 130 protein:vir:99196 Length: 155 97.6 6.3E-07 3.9E-10 54.6 7.7 132 1-149 1-153 (155) 131 protein:vir:96774 Length: 152 97.6 2.6E-07 1.6E-10 56.7 5.5 126 1-138 10-152 (152) 132 protein:vir:107757 Length: 189 97.6 5.5E-08 3.4E-11 60.4 1.6 92 21-149 1-100 (189) 133 protein:vir:102338 Length: 116 97.5 1.1E-06 6.6E-10 53.3 7.8 94 26-144 1-116 (116) 134 protein:vir:96288 Length: 100 97.4 4.8E-07 3E-10 55.2 4.4 88 1-94 13-100 (100) 135 protein:vir:7449 Length: 123 # 97.4 5.2E-06 3.2E-09 49.5 10.0 121 1-147 1-123 (123) 136 protein:vir:94069 Length: 168 97.3 5.7E-07 3.5E-10 54.8 4.1 104 25-149 1-108 (168) 137 protein:vir:95260 Length: 160 97.3 1.2E-06 7.4E-10 53.1 5.7 91 1-149 1-102 (160) 138 protein:vir:105773 Length: 131 97.1 4E-06 2.5E-09 50.2 6.5 111 6-141 1-131 (131) 139 protein:vir:97088 Length: 157 97.0 8.8E-06 5.5E-09 48.3 8.1 130 6-149 1-151 (157) 140 protein:vir:101508 Length: 120 97.0 2.5E-05 1.5E-08 45.8 10.0 114 1-149 1-118 (120) 141 protein:vir:6071 Length: 150 # 96.9 7.9E-06 4.9E-09 48.5 6.6 126 8-141 1-150 (150) 142 protein:vir:98892 Length: 108 96.9 1.4E-05 8.8E-09 47.2 7.9 103 1-141 1-108 (108) 143 protein:vir:99546 Length: 200 96.8 7.8E-06 4.8E-09 48.6 6.4 120 1-149 4-150 (200) 144 protein:vir:5703 Length: 150 # 96.8 8.7E-06 5.4E-09 48.3 6.6 131 8-141 1-150 (150) 145 protein:vir:2688 Length: 123 # 96.8 1.9E-05 1.2E-08 46.5 8.3 111 14-141 1-123 (123) 146 protein:vir:80970 Length: 112 96.8 2.5E-05 1.6E-08 45.8 8.7 104 2-143 1-112 (112) 147 protein:vir:1838 Length: 149 # 96.7 1.1E-05 6.7E-09 47.8 6.1 130 1-141 1-149 (149) 148 protein:vir:2026 Length: 150 # 96.6 1.4E-05 8.7E-09 47.2 6.5 131 8-141 1-150 (150) 149 protein:vir:98557 Length: 149 96.6 1.4E-05 8.7E-09 47.2 6.4 125 8-141 1-149 (149) 150 protein:vir:79179 Length: 155 96.6 1.8E-05 1.1E-08 46.6 6.9 131 1-141 1-155 (155) 151 protein:vir:79115 Length: 148 96.6 1.4E-05 8.4E-09 47.3 6.2 127 8-141 1-148 (148) 152 protein:vir:396 Length: 184 # 96.6 9.2E-05 5.7E-08 42.7 10.5 140 6-149 1-184 (184) 153 protein:vir:80037 Length: 199 96.3 3.2E-06 2E-09 50.7 1.3 130 2-149 1-146 (199) 154 protein:vir:45 Length: 112 # N 96.1 8.5E-05 5.3E-08 42.9 8.1 104 2-143 1-112 (112) 155 protein:vir:100312 Length: 152 96.1 4.8E-05 3E-08 44.3 6.6 133 2-142 1-152 (152) 156 protein:vir:107099 Length: 137 95.7 0.00014 8.6E-08 41.7 7.5 112 11-148 1-137 (137) 157 protein:vir:3427 Length: 192 # 95.6 0.0004 2.5E-07 39.2 9.7 140 6-149 1-192 (192) 158 protein:vir:96763 Length: 177 95.4 0.00064 3.9E-07 38.1 10.2 148 1-149 4-175 (177) 159 protein:vir:4790 Length: 114 # 95.4 0.00024 1.5E-07 40.4 7.7 104 2-149 1-114 (114) 160 protein:vir:6375 Length: 205 # 95.3 0.00073 4.5E-07 37.8 10.2 146 2-149 1-202 (205) 161 protein:vir:105330 Length: 137 95.0 0.00016 9.8E-08 41.4 5.5 111 11-148 1-137 (137) 162 protein:vir:9823 Length: 118 # 94.9 0.00026 1.6E-07 40.3 6.5 103 1-149 1-117 (118) 163 protein:vir:3036 Length: 118 # 94.9 0.00026 1.6E-07 40.3 6.5 103 1-149 1-117 (118) 164 protein:vir:7993 Length: 108 # 94.9 2.2E-05 1.4E-08 46.1 0.5 90 1-126 1-108 (108) 165 protein:vir:1581 Length: 116 # 94.7 0.00051 3.2E-07 38.6 7.6 105 2-140 1-116 (116) 166 protein:vir:79687 Length: 113 93.1 0.00098 6.1E-07 37.1 6.2 103 11-147 1-113 (113) 167 protein:vir:8106 Length: 150 # 92.9 0.00019 1.2E-07 41.0 1.9 117 1-149 5-144 (150) 168 protein:vir:1164 Length: 156 # 92.4 0.0011 6.7E-07 36.8 5.5 135 1-145 1-156 (156) 169 protein:vir:79555 Length: 192 90.9 0.015 9.1E-06 30.6 10.0 142 8-149 1-184 (192) 170 protein:vir:102190 Length: 93 85.3 0.011 7E-06 31.3 5.6 91 30-144 1-93 (93) 171 protein:vir:105825 Length: 108 68.5 0.02 1.3E-05 29.9 1.8 90 1-126 11-108 (108) 172 protein:vir:102608 Length: 108 68.5 0.02 1.3E-05 29.9 1.8 90 1-126 11-108 (108) 173 protein:vir:4200 Length: 133 # 66.4 0.13 8E-05 25.5 5.7 127 1-141 1-133 (133) 174 protein:vir:4460 Length: 170 # 59.5 0.17 0.00011 24.7 5.1 131 1-149 1-168 (170) 175 protein:vir:487 Length: 187 # 57.1 0.17 0.0001 24.9 4.6 134 1-149 14-185 (187) 176 protein:vir:4162 Length: 133 # 51.3 0.37 0.00023 22.9 5.5 127 1-141 1-133 (133) 177 protein:vir:5745 Length: 135 # 43.2 0.85 0.00053 21.0 8.7 119 6-149 1-132 (135) 178 protein:vir:7859 Length: 126 # 40.3 0.19 0.00012 24.6 2.1 117 8-134 1-126 (126) 179 protein:vir:101654 Length: 126 40.3 0.19 0.00012 24.6 2.1 117 8-134 1-126 (126) 180 protein:vir:78894 Length: 105 40.0 0.16 9.8E-05 25.0 1.6 101 2-141 1-105 (105) 181 protein:vir:3787 Length: 231 # 40.0 0.47 0.00029 22.4 4.2 141 2-148 1-231 (231) 182 protein:vir:79034 Length: 141 39.6 1 0.00063 20.6 7.8 128 1-149 9-141 (141) 183 protein:vir:4514 Length: 168 # 31.2 1.5 0.0009 19.7 5.4 139 1-149 1-167 (168) 184 protein:vir:3750 Length: 227 # 28.0 1.8 0.0011 19.2 5.4 135 2-149 1-225 (227) 185 protein:vir:1891 Length: 179 # 21.7 2.6 0.0016 18.3 12.1 145 1-149 5-175 (179) 186 protein:vir:78163 Length: 92 # 21.5 0.93 0.00057 20.8 2.5 91 1-129 1-92 (92) No 1 >protein:vir:194 Length: 149 # NCBI annotation: Gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037704;genbank:gi:9634169;genbank:GeneID:1262536 Probab=100.00 E-value=6.4e-43 Score=251.92 Aligned_cols=149 Identities=99% Similarity=1.349 Sum_probs=141.4 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) ||+++|+|+|||+|++.|++|+.++.++++++||.+||++|+++|+++||+++|++++||.++..+....+++...+++. T Consensus 1 mm~~~~~i~Gl~~l~~~l~~l~~~~~~~~~~~al~~~a~~i~~~ak~~aP~~~g~l~~si~~~~~~~~~~~~~~~~v~~~ 80 (149) T protein:vir:19 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIDRAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) T ss_pred CcceeeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhhccccccccccccceeeccccc Confidence 99999999999999999999999998889999999999999999999999999999999999999888899999999888 Q ss_pred cccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) ......+.........+.+++|||+|+||||++|||||||+||+++++++++++|.++|+++|+|+++| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PF~~pA~~~~k~~~~~~~~~~l~~~l~k~~~k 149 (149) T protein:vir:19 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) T ss_pred ccccccccccceeecCCCCccceeeeeccCCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 887777777777777778889999999999999999999999999999999999999999999999999 No 2 >protein:vir:93617 Length: 148 # NCBI annotation: putative structural component # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449299;genbank:gi:157166047;interpro:IPR010064;interpro:IPR011693;uniprot:Q6H9U2;genbank:GeneID:5580439 Probab=100.00 E-value=1e-42 Score=250.82 Aligned_cols=148 Identities=75% Similarity=1.174 Sum_probs=137.3 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) ||+++|+|+|||+|++.|++|+.++.++++++||++||++|+++|+.+||+++|.+.+||.++... ...|++...+++. T Consensus 1 mm~~~~~i~Gldel~~~l~~L~~~~~~~~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~-~~~g~~~~~v~~~ 79 (148) T protein:vir:93 1 MIETLLDFSGLEDISRDLQLLSGAENNRVLREATRAGANVLKEEVVSRAPVRRGKLRRNVVVLSRR-SRDGGMESGVHIR 79 (148) T ss_pred CcceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcchhhhhceecccc-ccCCceeeeeeec Confidence 999999999999999999999999888899999999999999999999999999999999877644 4577888888888 Q ss_pred cccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) .+....+.....+..++..++|||||+||||++|||||||+|||+++++++++.|.++++++|+++++| T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~pa~PFl~pA~~~~k~~~~~~~~~~~~~~i~k~~~k 148 (148) T protein:vir:93 80 GVNPDTGNSDNTMKADNPRNAFYWRFVEMGTVNMPPHPFVRPAFDVRSEQAAQVAIARMNRAIDEVLRR 148 (148) T ss_pred ccccccccccceeecCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHHHHhcC Confidence 877777777666777788899999999999999999999999999999999999999999999999999 No 3 >protein:vir:4347 Length: 164 # NCBI annotation: Orf14 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061510;genbank:gi:9635606;genbank:GeneID:1262873 Probab=100.00 E-value=2.1e-39 Score=232.68 Aligned_cols=148 Identities=18% Similarity=0.314 Sum_probs=121.4 Q ss_pred Ccc-ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCc-----CCCcccccceecc--cccccCCc Q lcl|NC_019767. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPV-----RTGKLKKNVVVVT--QKSRRRGE 72 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~-----~~g~l~~~i~~~~--~~~~~~g~ 72 (149) |++ ++|+|.|||+|.++|++|+.++.++++++||++||++|+++|+.+||+ .++++.++|.+.. ......+. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~~~ 80 (164) T protein:vir:43 1 MADTVEFSITGLDSLLGKLDSVTDDVKRRGGRAALRKAAMIVVQAAKQGAEKVDDPGTGRSISDNIALRWNGRLFKRTGD 80 (164) T ss_pred CCcceEEeeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCcccCCCccchhhhhhhhhcccCccccccc Confidence 887 899999999999999999999988999999999999999999999997 4578888887643 33444555 Q ss_pred cccceeeecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 73 ISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) +...+++...... ...........++++|||||+||||++|||||||+|||++++++++++|.++|+++|+++++| T Consensus 81 ~~~~vg~~~~~~~-~~~~~~~~~~~~~~~~y~~f~EfGT~km~a~PFlrPA~~~~k~~~~~~~~~~l~~~i~ka~~k 156 (164) T protein:vir:43 81 LGFRIGVLHGAVL-PKKGERSDKTANAPTPHWRLLEFGTEDMRAQPFMRSALADNIAEVTSTFVSEYEKGIDRAIKR 156 (164) T ss_pred eeEEecccccccc-cccccccccCCCCCcceEEEeecCCCCCCCCcchhhhHHHhHHHHHHHHHHHHHHHHHHHHHH Confidence 5544444332221 112223334566789999999999999999999999999999999999999999999999999 No 4 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=100.00 E-value=3.4e-39 Score=231.51 Aligned_cols=149 Identities=28% Similarity=0.485 Sum_probs=119.3 Q ss_pred Ccc-ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcC-----CCcccccceecccc--cccCCc Q lcl|NC_019767. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVR-----TGKLKKNVVVVTQK--SRRRGE 72 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~-----~g~l~~~i~~~~~~--~~~~g~ 72 (149) |++ |+|+|.||+||.++|++|+.++.++++++||++||++|+++|+.+||+. +|.+.++|.+.... ....|. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g~ 80 (179) T protein:vir:18 1 MADSVEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTGD 80 (179) T ss_pred CCceEEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccccc Confidence 997 8999999999999999999999889999999999999999999999764 56778888766543 344454 Q ss_pred cccceeeeccccc------------cccc--ccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 73 ISSGVHIRGVNPR------------TGNS--DNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIAR 138 (149) Q Consensus 73 ~~~~~~~~~~~~~------------~~~~--~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~ 138 (149) +...+++...... .+.. ..........++|||||+||||++|||||||+|||+++++++++.|.++ T Consensus 81 ~~~~vgv~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~y~~fvEfGT~kmpa~PFlrPA~~~~~~~a~~~i~~~ 160 (179) T protein:vir:18 81 LAFRVGVMGGARQYANTKANVRKGRAGKTYKTSGDKGNPGGDTWYWRFLEFGTEHTSARPILRPAMNGVDNDVINVFSTE 160 (179) T ss_pred eeEeeecccccccccccccccccCcccccccccccccCCCCccceeEEeccCCCCCCCCccchhhHHhhHHHHHHHHHHH Confidence 4444443221110 0000 1112234456899999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcC Q lcl|NC_019767. 139 MNQAIDEVLSK 149 (149) Q Consensus 139 l~~~i~k~~~k 149 (149) |+++|+|+++| T Consensus 161 l~~~i~k~lk~ 171 (179) T protein:vir:18 161 MGKAIDRAIRL 171 (179) T ss_pred HHHHHHHHHHh Confidence 99999999999 No 5 >protein:vir:80362 Length: 140 # NCBI annotation: gp10, phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111089;genbank:gi:134288660;genbank:GeneID:4960609 Probab=100.00 E-value=2.7e-36 Score=215.62 Aligned_cols=138 Identities=40% Similarity=0.676 Sum_probs=114.3 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |. +|+|+|||+|++.|+.|+.++.++++++||+++|++|+++|+.+||+++|++++||.+...+....+...... T Consensus 1 Ma--~~~i~Gld~l~~~l~~l~~~~~~k~~~~a~~~~a~~v~~~ak~~aP~~tG~l~~~i~~~~~~~~~~~~~~~~~--- 75 (140) T protein:vir:80 1 MS--SIQIVGLADLLADFERLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGLATAG--- 75 (140) T ss_pred Cc--eeeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhceeeeccccccccceeeee--- Confidence 55 5678899999999999999988889999999999999999999999999999999987765544332221111 Q ss_pred cccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) ... +.. ......+++|||+|+||||++|||||||+||+++++++++++|.+++.++|++++++ T Consensus 76 -~~~--~~~---~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:80 76 -VRV--RTK---GKADSPSNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDQALGG 138 (140) T ss_pred -eec--ccc---cccCCCCCcceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 110 000 112345678999999999999999999999999999999999999999999999999 No 6 >protein:vir:100075 Length: 140 # NCBI annotation: gp9 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945039;genbank:gi:38707899;genbank:GeneID:2744122 Probab=100.00 E-value=2.4e-36 Score=215.89 Aligned_cols=138 Identities=41% Similarity=0.695 Sum_probs=114.6 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |.+ |+|+|||+|++.|++|+.++.++++++||+++|++|+++|+++||+++|+|++||.++..+....+.. ..+.+. T Consensus 1 Ma~--~~i~Gld~l~~~l~~L~~~~~~k~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~~-~~~g~~ 77 (140) T protein:vir:10 1 MSS--IQIIGLADLRADFEKLAKSQSTKALRRATVAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPGL-ATAGVR 77 (140) T ss_pred Cce--eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhccccccccccccce-EEeeee Confidence 664 56889999999999999998888999999999999999999999999999999998776554332222 111111 Q ss_pred cccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) . . ........+++|||+|+||||++|||||||+||+++++++++++|.+++.++|+|++++ T Consensus 78 ~---~-----~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:10 78 V---R-----TKGKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGG 138 (140) T ss_pred e---c-----cccccCCCCccceeeeeccCCCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 1 0 01123345778999999999999999999999999999999999999999999999999 No 7 >protein:vir:107568 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338191;genbank:gi:77020147;genbank:GeneID:3703699 Probab=100.00 E-value=2.5e-36 Score=215.74 Aligned_cols=143 Identities=22% Similarity=0.400 Sum_probs=121.2 Q ss_pred Ccc-ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |++ |+|+|+||++|+++|++|+.++ ++++++||++||++|+++++.++|+++|.+++++.... ...++..+.+.+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~-~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~---~~~~~~~~~i~~ 76 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRG-EKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPW---RTGQHGADQIKV 76 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccc---ccccccccccee Confidence 887 8999999999999999999985 67999999999999999999999999999998864332 234566666666 Q ss_pred eccccccccccccc--ccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019767. 80 RGVNPRTGNSDNTM--KANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVL 147 (149) Q Consensus 80 ~~~~~~~~~~~~~~--~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~ 147 (149) ..++...+.....+ .....+++|||||+||||++|||||||+||+++++++++++|.++|.++|++++ T Consensus 77 ~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 77 TKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred ccccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 66555554444333 233456789999999999999999999999999999999999999999999999 No 8 >protein:vir:102875 Length: 146 # NCBI annotation: conserved phage protein, HK97 gp10 family # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338140;genbank:gi:77020200;genbank:GeneID:3703784 Probab=100.00 E-value=2.5e-36 Score=215.74 Aligned_cols=143 Identities=22% Similarity=0.400 Sum_probs=121.2 Q ss_pred Ccc-ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |++ |+|+|+||++|+++|++|+.++ ++++++||++||++|+++++.++|+++|.+++++.... ...++..+.+.+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~-~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~---~~~~~~~~~i~~ 76 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRG-EKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPW---RTGQHGADQIKV 76 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccc---ccccccccccee Confidence 887 8999999999999999999985 67999999999999999999999999999998864332 234566666666 Q ss_pred eccccccccccccc--ccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019767. 80 RGVNPRTGNSDNTM--KANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVL 147 (149) Q Consensus 80 ~~~~~~~~~~~~~~--~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~ 147 (149) ..++...+.....+ .....+++|||||+||||++|||||||+||+++++++++++|.++|.++|++++ T Consensus 77 ~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 77 TKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred ccccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 66555554444333 233456789999999999999999999999999999999999999999999999 No 9 >protein:vir:102085 Length: 146 # NCBI annotation: head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512318;genbank:gi:89152487;genbank:GeneID:3953078 Probab=100.00 E-value=2.5e-36 Score=215.74 Aligned_cols=143 Identities=22% Similarity=0.400 Sum_probs=121.2 Q ss_pred Ccc-ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |++ |+|+|+||++|+++|++|+.++ ++++++||++||++|+++++.++|+++|.+++++.... ...++..+.+.+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~-~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~---~~~~~~~~~i~~ 76 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRG-EKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPW---RTGQHGADQIKV 76 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccc---ccccccccccee Confidence 887 8999999999999999999985 67999999999999999999999999999998864332 234566666666 Q ss_pred eccccccccccccc--ccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019767. 80 RGVNPRTGNSDNTM--KANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVL 147 (149) Q Consensus 80 ~~~~~~~~~~~~~~--~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~ 147 (149) ..++...+.....+ .....+++|||||+||||++|||||||+||+++++++++++|.++|.++|++++ T Consensus 77 ~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 77 TKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred ccccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 66555554444333 233456789999999999999999999999999999999999999999999999 No 10 >protein:vir:105007 Length: 146 # NCBI annotation: conserved phage protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459972;genbank:gi:85701387;genbank:GeneID:3882148 Probab=100.00 E-value=2.5e-36 Score=215.74 Aligned_cols=143 Identities=22% Similarity=0.400 Sum_probs=121.2 Q ss_pred Ccc-ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |++ |+|+|+||++|+++|++|+.++ ++++++||++||++|+++++.++|+++|.+++++.... ...++..+.+.+ T Consensus 1 Ma~~~~~~i~Gl~el~~~l~~L~~~~-~~~~~~al~~ga~~i~~~ak~~ap~~~~~~~~~~~~~~---~~~~~~~~~i~~ 76 (146) T protein:vir:10 1 MADGIDLDLLGFDRLVTELDQMGLRG-EKIEDKALAAGGEPIRKAIAERAPRSPSPKKRSKSEPW---RTGQHGADQIKV 76 (146) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCccccccccccccc---ccccccccccee Confidence 887 8999999999999999999985 67999999999999999999999999999998864332 234566666666 Q ss_pred eccccccccccccc--ccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019767. 80 RGVNPRTGNSDNTM--KANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVL 147 (149) Q Consensus 80 ~~~~~~~~~~~~~~--~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~ 147 (149) ..++...+.....+ .....+++|||||+||||++|||||||+||+++++++++++|.++|.++|++++ T Consensus 77 ~~~~~~~g~~~~~vg~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pa~~~~k~~~~~~~~~~l~~~l~ka~ 146 (146) T protein:vir:10 77 TKAKLEGGIKTVKIGLNKADRSPWFYLKFHEWGTSKMPAHPFIEPGFNASKAEAVRAMTDILKNEMRLDL 146 (146) T ss_pred ccccccccceeEEeeeccCCCCCcceeeeeccCCCCCCCCcchhHHHHHhHHHHHHHHHHHHHHHHhhcC Confidence 66555554444333 233456789999999999999999999999999999999999999999999999 No 11 >protein:vir:100243 Length: 140 # NCBI annotation: gp72 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355408;genbank:gi:77864698;genbank:GeneID:3725965 Probab=100.00 E-value=5e-36 Score=214.12 Aligned_cols=137 Identities=44% Similarity=0.710 Sum_probs=113.8 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCc-cccceee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGE-ISSGVHI 79 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~-~~~~~~~ 79 (149) |.+ |+|+|+|+|+++|++|+.++.++++++||++||++|+++|+.+||+++|+|++||.++..+...... ...++ T Consensus 1 Ma~--~~i~Gld~l~~~l~~l~~~~~~k~~~~al~~~a~~v~~~ak~~ap~~tG~l~~sI~~~~~~~~~~~~~~~~~~-- 76 (140) T protein:vir:10 1 MSS--VQILGLADLQADFLKLAKAQSTKALRRATVAGANVIRDEARARAPKKTGKLKRNIVTAALKQKDSPGIATAGV-- 76 (140) T ss_pred Cce--eeehhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHHhceecccccccccceeEEee-- Confidence 664 6688999999999999999888899999999999999999999999999999999887654433221 11111 Q ss_pred ecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) ... ........+++|||+|+||||++|||||||+||++++++++++.|.++++++|+|++++ T Consensus 77 ---~~~-----~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:10 77 ---RVR-----TKGKADSPNNAFYWRFVELGTQFMKAEPFMRPAFDASIAQAEGAIRTEIARAIDQVVGG 138 (140) T ss_pred ---ccc-----cccccCCCCcccccceeccCcCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 100 00112345678999999999999999999999999999999999999999999999999 No 12 >protein:vir:1437 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536366;genbank:gi:17975171;genbank:GeneID:929147 Probab=100.00 E-value=9.6e-36 Score=212.55 Aligned_cols=138 Identities=40% Similarity=0.667 Sum_probs=114.1 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |.+ |+|+|||+|++.|+.|+.++.++++++||.+||++|+++++.+||+++|+|++||.++..+...... ...+.+. T Consensus 1 M~~--~~i~Gld~l~~~l~~l~~~~~~~~~~~al~~~a~~v~~~ak~~aP~~tG~l~~sI~~~~~~~~~~~~-~~~vg~~ 77 (140) T protein:vir:14 1 MSS--IQIIGLADLRADFEKLAKSQSAKALRRATLAGAKVIRDEARKRAPKKTGKLRRNIVSAALRQKDAPG-LATAGVR 77 (140) T ss_pred Cce--eeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCChhhHHhhcccccccccccce-eEEeeee Confidence 654 6688999999999999999888899999999999999999999999999999999876554432222 1111111 Q ss_pred cccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) . +.. ......+++|||+|+||||++|||||||+||++++++++++.|.+++.++|++++++ T Consensus 78 ~-----~~~---~~~~~~~~~~y~~f~E~GT~~~~a~pFl~pa~~~~~~~~~~~~~~~~~~~l~k~~~~ 138 (140) T protein:vir:14 78 V-----RTK---GKADSPNNAFYWRFDEFGTQHMKAQPFMRPAFDASIGEAEGAIRTELARAIDRVLGG 138 (140) T ss_pred e-----ccc---cccCCCCccceeeeeccccCCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 1 111 122345679999999999999999999999999999999999999999999999999 No 13 >protein:vir:1386 Length: 149 # NCBI annotation: Gp9 protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612838;genbank:gi:20065972;genbank:GeneID:935787 Probab=100.00 E-value=3.2e-35 Score=209.66 Aligned_cols=145 Identities=19% Similarity=0.315 Sum_probs=116.4 Q ss_pred Ccc-ceeehhhHHHHHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCcccccee Q lcl|NC_019767. 1 MIE-TSLDFSGLNDIAKDLEALSR-AENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVH 78 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~-~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~ 78 (149) |++ ++|+|.||+||+++|++|+. ...++++++||++||++|+++++.+||++.+...... ......+++.+++. T Consensus 1 Ma~~~~~~i~Gl~eL~~~l~~L~~~~~~~k~~~~Al~~ga~~v~~~~k~~aP~~~~~~~~~~----~~~~~~~~~~d~i~ 76 (149) T protein:vir:13 1 MSDGWEIKFEGLDDLIKTFEQLGTEKENEDVEKSILKECGDLAKKTVAPLIHISDDNSKSGR----KGSRPPGHAANNIP 76 (149) T ss_pred CCceeEEEeecHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCCcccccc----ccccccchhhhcce Confidence 886 89999999999999999963 4457899999999999999999999999765432221 11223556666776 Q ss_pred eecccccccccccccc--cCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 79 IRGVNPRTGNSDNTMK--ANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~--~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) +..+....+.....+. ...++++|||||+||||++|||||||+||+++++++++++|.++|++.|+++++- T Consensus 77 ~~~~~~~~g~~~~~VG~~~~~~~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i~~~lG~ 149 (149) T protein:vir:13 77 EPKIRKKKGNLQCVVGWEKSDNTPFYYMKMEEWGTSERPPHHAFGKTNKILKRVYDNIAQKKYDNFVKEKLGD 149 (149) T ss_pred ecccccccceeEEEeeccCCCCCccceeeeeccCccCCCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 6665555444433332 2234578999999999999999999999999999999999999999999999999 No 14 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=100.00 E-value=5e-35 Score=208.65 Aligned_cols=130 Identities=19% Similarity=0.368 Sum_probs=109.1 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCC----CcccccceecccccccCCccccce Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRT----GKLKKNVVVVTQKSRRRGEISSGV 77 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~----g~l~~~i~~~~~~~~~~g~~~~~~ 77 (149) |.++|+|+||++|+++|++|+.++.++++++||++||++|+++++.++|+++ |++++||.++..+... T Consensus 1 M~~~~~i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~v~~~~k~~ap~~~~~~~g~l~~~I~i~~~k~~~-------- 72 (135) T protein:vir:57 1 MIPEIEISGLQELERRLIAVGEEVGTKILRDAGRAAMAVVEADMKQNAGYDNSSTNAHMRDSIKIRSSRGKA-------- 72 (135) T ss_pred CceeeeehhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhHHhhcccccccccc-------- Confidence 8899999999999999999999988889999999999999999999999975 7888888765433221 Q ss_pred eeecccccccccccccc-cCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019767. 78 HIRGVNPRTGNSDNTMK-ANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLS 148 (149) Q Consensus 78 ~~~~~~~~~~~~~~~~~-~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~ 148 (149) +.....+. +......||++|+||||++|||||||+|||++++++++++|.++|+++|+|+.+ T Consensus 73 ---------~~~~v~v~vg~~~~~~~~~~f~E~GT~~~~a~PF~~pa~~~~~~~~~~~~~~~~~~~l~ka~r 135 (135) T protein:vir:57 73 ---------GSTVVVLRVGPTRSHYMKALAQEFGTIKQVAKPFIRPALDYNKMQVLRILTVEIRDGLSTLSR 135 (135) T ss_pred ---------cceeEEEEecCCCCcceeEeecccCCCCCCCCcchhHhHHHhHHHHHHHHHHHHHHHHHHhcC Confidence 11111111 122344688999999999999999999999999999999999999999999999 No 15 >protein:vir:105089 Length: 133 # NCBI annotation: Gp11 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006591;genbank:gi:46402097;genbank:GeneID:2777955 Probab=100.00 E-value=2.5e-34 Score=204.78 Aligned_cols=129 Identities=24% Similarity=0.385 Sum_probs=102.2 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCc----ccccceecccccccCCccccc Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGK----LKKNVVVVTQKSRRRGEISSG 76 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~----l~~~i~~~~~~~~~~g~~~~~ 76 (149) ||+++ |+||++|+++|++|+.++.++++++||.+||++|+++|+.+||+++|. ++++|.++.......+ ... T Consensus 1 M~~~~--i~Gl~el~~~l~~L~~~~~~k~~~~Al~~~a~~i~~~ak~~ap~~~~~~~~~~~~~I~v~~~~~~~~~--~~~ 76 (133) T protein:vir:10 1 MIRME--VKGLDELERQLTALGEKVATKVLRDAGREALKVVEEDMKQHAGFDETSTGQHMRDSIKIRSSTRKAQG--NAV 76 (133) T ss_pred CeeEe--eehHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCcchhhhhhcccccccccccCc--cce Confidence 88776 779999999999999998888999999999999999999999999886 4555543221111110 000 Q ss_pred eeeecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 77 VHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEV 146 (149) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~ 146 (149) +. ..++.....+|||+|+||||++|||||||+|||++++++++++|.++|+++|+|- T Consensus 77 ~~-------------v~vg~~~~~~~y~~f~E~GT~k~~a~PF~~pA~~~~~~~~~~~~~~~~~~~l~K~ 133 (133) T protein:vir:10 77 VT-------------LRVGPSKQHHMKVLAQEFGTVKQVADPFIRPALDYNVQTVLRVLTVEIRNGIQNR 133 (133) T ss_pred EE-------------EEecCCCCccceEeeeccCCCCCCCCccchHHHHHhHHHHHHHHHHHHHHHhhcC Confidence 00 0112234567999999999999999999999999999999999999999999887 No 16 >protein:vir:3873 Length: 128 # NCBI annotation: putative head-tail joining protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680490;swissprot:trembl:p94214;genbank:gi:22296530;interpro:IPR010064;uniprot:P94214;genbank:GeneID:951688 Probab=99.96 E-value=4.5e-33 Score=197.90 Aligned_cols=128 Identities=14% Similarity=0.254 Sum_probs=103.8 Q ss_pred ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeeccc Q lcl|NC_019767. 4 TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVN 83 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~ 83 (149) |+++|+||+||+++|++|+.++ +++.++||++||++++++++.++|+++|.++.+ +++.+.|.+..++ T Consensus 1 m~v~i~Gl~el~~~l~~l~~~~-~k~~~~al~~ga~~~~~~~k~~ap~~~~~~~~~-----------~h~~d~I~~~~~k 68 (128) T protein:vir:38 1 MGVKVTGDAELLANLNKLQFGV-AKEARAAVRDGAQKFADKLKSNTPEWDGETDMS-----------GHLRDDIKLSSVR 68 (128) T ss_pred CccchhhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCcCCCCccc-----------chhhhhhcccccc Confidence 8888999999999999999986 689999999999999999999999987754332 3344444443333 Q ss_pred ccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 84 PRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAID 144 (149) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~ 144 (149) ...+..... ++++..++|||||+||||++|||||||+||+++++++++++|.++|+++|. T Consensus 69 ~~~g~~~~~-VG~~k~~~~y~~f~E~GT~k~~a~pF~~pa~~~~~~~~~~~~~~~l~k~i~ 128 (128) T protein:vir:38 69 ETSGLTEVD-VGYGKDTGWRAHFPNSGTSMQDPQHFIEETQEIMRPVVIAAFLSHLKEGGM 128 (128) T ss_pred ccCceeEEE-eeecCCCceEEeeeccCccCCCCCcchhHHHHHhHHHHHHHHHHHHHhhcC Confidence 333332222 223456789999999999999999999999999999999999999999999 No 17 >protein:vir:1273 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690765;genbank:gi:22855005;genbank:GeneID:955232 Probab=99.95 E-value=2.1e-32 Score=194.25 Aligned_cols=124 Identities=23% Similarity=0.388 Sum_probs=102.9 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcC---CCcccccceecccccccCCccccce Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVR---TGKLKKNVVVVTQKSRRRGEISSGV 77 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~---~g~l~~~i~~~~~~~~~~g~~~~~~ 77 (149) |.+ |+|+||++|++.|++|+.++ ++++++||++||++|.++++.++|++ +|+++++|.++..+....|..... T Consensus 1 M~~--~~i~Gl~el~~~l~~l~~~~-~~~~~~al~~~a~~v~~~~k~~ap~~~~~tg~l~~~I~~~~~k~~~~g~~~v~- 76 (127) T protein:vir:12 1 MAD--MSFDGIDDLTQYFEKIGGDI-EKVEPVALKAGGEIIAERQRSHVNRSDKKQPHMQDNITVSNVRESKDGVRFVA- 76 (127) T ss_pred Cee--eeehhHHHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCChhHHHHhhhccccccccCceeEEE- Confidence 666 56789999999999999987 57899999999999999999999975 689999987654433222221111 Q ss_pred eeecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 78 HIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAID 144 (149) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~ 144 (149) ++.+.+.+|||||+||||++|||||||+||+++++++++++|.++|+++|+ T Consensus 77 ----------------Vg~~~~~~~y~~f~E~GT~~~~a~Pf~~pa~~~~~~~~~~~~~~~~~~~lk 127 (127) T protein:vir:12 77 ----------------VGPNKKVAYRGRFLEWGTSKMPPQPFIEKGGKEGEGPAVELMERILTAPIK 127 (127) T ss_pred ----------------EeeCCCCcceeeeeccCccCCCCCccchHhHHHHHHHHHHHHHHHHHHhcC Confidence 122345689999999999999999999999999999999999999999999 No 18 >protein:vir:94538 Length: 125 # NCBI annotation: putative head to tail joining # Family: family:all:180 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223893;genbank:gi:62327105;genbank:GeneID:5075554 Probab=99.95 E-value=2.5e-31 Score=188.33 Aligned_cols=124 Identities=19% Similarity=0.247 Sum_probs=104.0 Q ss_pred Ccc-ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |++ |+|+|+|+|+|.+.|+++++++. +.+..||..+++.|.++++.+||++||+|++||.++..+.. .|.+... T Consensus 1 Ma~~~~i~~~Gld~l~~~L~~~~~~~~-~~v~~al~~~a~~i~~~ak~~ap~~tG~L~~sI~~~~~~~~-~~~~~~~--- 75 (125) T protein:vir:94 1 MANDFNIKFKGVDKLLDEFDISRKELV-PYSVEAMKTSLSRAVEKSKGLARVDTGYMRNNIQQDEVKEE-HGVVTGR--- 75 (125) T ss_pred CCCceeeeehhHHHHHHHHHHhHHHHH-HHHHHHHHHHHHHHHHHHHhhCCCCChhhhhhceecceecc-CCcEEEE--- Confidence 776 89999999999999999999986 45688999999999999999999999999999976543221 1211111 Q ss_pred ecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEV 146 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~ 146 (149) -.++.+||+|+||||++|||||||+||++++++.+++.|.++|+++|++. T Consensus 76 -----------------v~~~~~Ya~~vEfGT~~~~a~Pfl~pa~~~~~~~~~~~l~~~l~~a~k~~ 125 (125) T protein:vir:94 76 -----------------YVARADYSSYNEYGTYRMSAQPFMAPSVAAMTPFFYKAVRDALNKAAKFS 125 (125) T ss_pred -----------------eeCCCCccceeecccccCCCCcccchhHHHHHHHHHHHHHHHHHHHhccC Confidence 11346799999999999999999999999999999999988888888888 No 19 >protein:vir:101594 Length: 173 # NCBI annotation: hypothetical protein # Family: family:all:26502 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112510;genbank:gi:53793610;interpro:IPR010064;uniprot:Q5ZGE3;genbank:GeneID:3101702 Probab=99.94 E-value=5.4e-31 Score=186.52 Aligned_cols=118 Identities=17% Similarity=0.336 Sum_probs=101.7 Q ss_pred eehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeeccccc Q lcl|NC_019767. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPR 85 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~ 85 (149) |+|+|||+|+++|++|++.+ +++++.|+.++|++|+++|+.+||++||+|++||.++.... .+. T Consensus 1 i~i~Gld~L~~~L~~l~~~~-~~~~~~a~~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~~~--~~~------------- 64 (173) T protein:vir:10 1 MAVKGVAEVIAELRKIGKDI-DKNINATTEEAANFIEDRAKTLAPKNFGKLAQSISTSDLKA--KDL------------- 64 (173) T ss_pred CcchhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCcCchhhhhcceeeeecc--Cce------------- Confidence 99999999999999999887 57899999999999999999999999999999997653221 111 Q ss_pred ccccccccccCCCCCcceehhcccCCcC---------------------------------------------------- Q lcl|NC_019767. 86 TGNSDNTMKANNPRNAFYWRFVELGTAN---------------------------------------------------- 113 (149) Q Consensus 86 ~~~~~~~~~~~~~~~~~y~~f~E~GT~~---------------------------------------------------- 113 (149) +.+....+.+||.|+||||++ T Consensus 65 -------~~~~v~~~~~Ya~fvEfGT~~m~a~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 137 (173) T protein:vir:10 65 -------ISKKITVNELYGAYMEFGTGAKVSVPKEFADMAASFKGQKTGSFKDGLESIKAWCRAKGIDEKAAYPIFAKIL 137 (173) T ss_pred -------eEEeeCCCcccchhhhcccccccCCCchhhhhhcccccccccccccccccccccccccccchhcccceeeEee Confidence 112234567899999999974 Q ss_pred ---CCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 114 ---MPAHPFVRPAYDTREEEAASVAIARMNQAIDEV 146 (149) Q Consensus 114 ---~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~ 146 (149) |||||||+|||+++++++.+.|++.|+++|+|+ T Consensus 138 ~~G~~aqPFl~PA~~~~~~~~~~~i~~~i~~~lrk~ 173 (173) T protein:vir:10 138 GAGINPQPFLYPAWIEGKKQYLKDLENLLKTYNKKI 173 (173) T ss_pred cCCCCCCccchhHHHHhHHHHHHHHHHHHHHHhhcC Confidence 789999999999999999999999999999999 No 20 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=99.94 E-value=5.8e-30 Score=180.86 Aligned_cols=147 Identities=23% Similarity=0.306 Sum_probs=112.6 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceeccccccc-CCccccceeee Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRR-RGEISSGVHIR 80 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~-~g~~~~~~~~~ 80 (149) |+++|.-..|++|.+.|+.|++. .++++++|+.+||++|+++|+.+||+++|.|+++|.+...+... .|...+.|++. T Consensus 1 m~~~~~~~d~s~l~~~l~~l~~~-~~~v~R~A~~~ga~vv~dear~~aP~~tG~LkksI~~~~~~~~s~~g~~~~~Vg~~ 79 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVVEH-SSDVVRTMTYESAVAVRESAKAFVNDETGKLRNNLYVAYSPEESVEGIQTYAVSWR 79 (157) T ss_pred CeeEeecccHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHhCCCCcchhhhheeeeeccccCCCceEEEEEeec Confidence 66665444477999999999865 67899999999999999999999999999999999887655443 24334434333 Q ss_pred ccccc------ccccccccccCCCCCcceehhcccCCcC-CCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 81 GVNPR------TGNSDNTMKANNPRNAFYWRFVELGTAN-MPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 81 ~~~~~------~~~~~~~~~~~~~~~~~y~~f~E~GT~~-~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) ..... -|.........+....|||+|+|+||.. |||||||+|||++.++++.+.+.+++.++|+++++= T Consensus 80 ~~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~k~I~e~l~g 155 (157) T protein:vir:97 80 KKAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGAKKYAELQRG 155 (157) T ss_pred CCccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhHHHHHHHHHHHHHHHHHHHhcC Confidence 22110 0111122223344567889999999865 999999999999999999999999999999999998 No 21 >protein:vir:9708 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795470;genbank:gi:28876221;genbank:GeneID:1257765 Probab=99.93 E-value=8.2e-30 Score=180.04 Aligned_cols=121 Identities=17% Similarity=0.209 Sum_probs=99.5 Q ss_pred ehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCc----ccccceecccccccCCccccceeeecc Q lcl|NC_019767. 7 DFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGK----LKKNVVVVTQKSRRRGEISSGVHIRGV 82 (149) Q Consensus 7 ~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~----l~~~i~~~~~~~~~~g~~~~~~~~~~~ 82 (149) =|+||+||+++|++|+.++ +++.++||++||++++++++.++|++++. ++++|.++..+....|... T Consensus 1 mv~Gl~el~~~l~~l~~~~-~~~~~~al~~ga~~~~~~~k~~ap~~~~~~~~hl~d~I~~~~~k~~~~g~~~-------- 71 (125) T protein:vir:97 1 MTKGLDEILANLTKLEVKA-PKTAKAAVTEVAKEFEKALKANTPVYEVETDERLQEDTVISGFKGANVGIVS-------- 71 (125) T ss_pred CchhHHHHHHHHHHhhHHH-HHHHHHHHHHHHHHHHHHHHHhCCcCCCCchhhHHhhhhcccccccccCceE-------- Confidence 4899999999999999886 67899999999999999999999998875 6677665443322222111 Q ss_pred cccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 83 NPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDE 145 (149) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k 145 (149) . .++++..++|||||+||||++|||||||+||+++++++++++|.++|+++|.= T Consensus 72 --------~-~VG~~k~~~~y~~f~E~GT~k~~~~pF~~pa~~~~k~~~~~~~~~~~~~~L~l 125 (125) T protein:vir:97 72 --------K-EIGYGKATGWRAHYPNDGTIYQRGQDFKERTINQMTPKAKQLYAEKVKEGLGL 125 (125) T ss_pred --------E-EEeecCCCceeEeeeccCccCCCcCccchHhHHHhHHHHHHHHHHHHHHHhcC Confidence 1 12234456899999999999999999999999999999999999999999876 No 22 >protein:vir:3617 Length: 112 # NCBI annotation: ORF40 # Family: family:all:180 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112703;genbank:gi:13786571;genbank:GeneID:921069 Probab=99.93 E-value=7e-30 Score=180.43 Aligned_cols=112 Identities=22% Similarity=0.389 Sum_probs=94.0 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeec Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~ 81 (149) |+++|+|+|+|+|++.|+++.. .+.++.+|++++.+|+++++.++|++||+|++||.+.....+ .. T Consensus 1 M~~~i~i~Gld~l~~~L~~~~~---~~~~~~al~~~~~~i~~~ak~~aPvdTG~Lr~si~~~~~~~~----~~------- 66 (112) T protein:vir:36 1 MKSSLSFKGIDQLVKHLDKAAS---LKGVQQVVKSNTSNMTANMQKLVPVDTGYMKRSIKMELTEGG----FS------- 66 (112) T ss_pred CceeeeehhHHHHHHHHHhhhh---HHHHHHHHHHHHHHHHHHHHHhCCCCchhhhhceeeeecCCc----eE------- Confidence 9999999999999999998754 356789999999999999999999999999999976432211 11 Q ss_pred ccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 82 VNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMN 140 (149) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~ 140 (149) ..+ .++.+||+|+||||++|||||||+||++.+++++.+.|.+.|+ T Consensus 67 ---------~~V----~~~~~Ya~~vE~GT~k~~a~Pfl~pa~~~~~~~~~~~i~~~lr 112 (112) T protein:vir:36 67 ---------GQA----GPHTDYSAYVEYGTRFQSAQPFVKPAYNEQKGVFIKDLERLLK 112 (112) T ss_pred ---------EEe----ecCCCccceeeccccccCCCcchhhhHHHHHHHHHHHHHHHcC Confidence 011 1346799999999999999999999999999998888887777 No 23 >protein:vir:95789 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950593;genbank:gi:119953788;genbank:GeneID:5076859 Probab=99.93 E-value=1.4e-29 Score=178.75 Aligned_cols=114 Identities=17% Similarity=0.211 Sum_probs=98.2 Q ss_pred ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeeccc Q lcl|NC_019767. 4 TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVN 83 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~ 83 (149) |+|+|+|+|+|.+.|+.|++.+.+. ++.||+++|..++++|+.+||++||.|++||.++.. +.. T Consensus 1 msi~i~Gld~l~~~l~~~~~~~~~~-v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~------g~~--------- 64 (114) T protein:vir:95 1 MAIKWQGIEKLVATISNAQPKAVEQ-SLQVLKNNGEKGKRIAKQLAPKDTEFLKDHITTSYP------GME--------- 64 (114) T ss_pred CeeeeehHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhCCcCchhhhhceeeecC------ceE--------- Confidence 7778889999999999999998654 588999999999999999999999999999875321 110 Q ss_pred ccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 84 PRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAID 144 (149) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~ 144 (149) ..+ ..+.+||+|+||||++|||||||+||++++++++.+.|.+.++++|+ T Consensus 65 -------~~V----~~~~~Ya~yvE~GT~~~~aqPfl~pa~~~~~~~~~~~l~~~l~~~~k 114 (114) T protein:vir:95 65 -------AHI----HGEAGYDGYQEYGTRFQPGTPHFRPMMEQIQPQFQKDMTDVMKGAFK 114 (114) T ss_pred -------EEe----ecCCCccceeecCccccCCCccchhhHHHHHHHHHHHHHHHHHhhcC Confidence 001 13357999999999999999999999999999999999999999998 No 24 >protein:vir:106623 Length: 115 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239497;genbank:gi:66395260;genbank:GeneID:4555777 Probab=99.92 E-value=8.1e-29 Score=174.60 Aligned_cols=109 Identities=21% Similarity=0.294 Sum_probs=92.2 Q ss_pred eehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC------CcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA------PVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |+|+|||+|++.|+.+++.+. +.++++|++++..++++++.+| |++||.|++||.+.. .|.+... T Consensus 1 i~i~Gld~L~~~l~~~~~~~~-~~~~~al~~~~~~i~~~a~~~a~~~~~~pv~TG~Lr~sI~~~~-----~g~~~~~--- 71 (115) T protein:vir:10 1 MQSKGLKKLMNHLKVMHDDIE-DDVDDILKNNAKEGVGIAVSNAKEVMNKGYWTGNLASLIEVKK-----IGDLHYR--- 71 (115) T ss_pred CeehhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeee-----cCcEEEE--- Confidence 999999999999999999875 5668999999999999999998 789999999987531 1211111 Q ss_pred ecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMN 140 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~ 140 (149) -..+++||+|+||||++|||||||+|||+++++.+++.|.+.+. T Consensus 72 -----------------v~~~~~Ya~~vEfGT~km~a~PFl~PA~~~~k~~~~~~i~~~i~ 115 (115) T protein:vir:10 72 -----------------VISTAHYSGFLEFGTRYMEPAPFMFPTYQTLKKSTINDLKRLLS 115 (115) T ss_pred -----------------eeCCCccchheecccccCCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 11346899999999999999999999999999999988888888 No 25 >protein:vir:96358 Length: 115 # NCBI annotation: ORF045 # Family: family:all:180 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239651;genbank:gi:66395408;genbank:GeneID:5132834 Probab=99.92 E-value=8.5e-29 Score=174.48 Aligned_cols=109 Identities=22% Similarity=0.379 Sum_probs=91.9 Q ss_pred eehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC------CcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA------PVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |+|+|||+|++.|+.|++.+. +.++.||.+++..+.++++++| |++||+|++||.++. .|.+.. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-----~g~~~~---- 70 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-----TGDLQY---- 70 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-----cCceEE---- Confidence 999999999999999999985 4568999999999999999998 899999999997641 111111 Q ss_pred ecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMN 140 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~ 140 (149) .+ .++.+||+|+||||++|||||||+|||+++++.+++.|.+.+. T Consensus 71 ------------~v----~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 71 ------------TI----TSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ------------Ee----ecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 01 1335799999999999999999999999999999988887777 No 26 >protein:vir:9312 Length: 115 # NCBI annotation: phi Mu50B-like protein # Family: family:all:180 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803290;genbank:gi:29028600;genbank:GeneID:1258048 Probab=99.92 E-value=8.5e-29 Score=174.48 Aligned_cols=109 Identities=22% Similarity=0.379 Sum_probs=91.9 Q ss_pred eehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC------CcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA------PVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |+|+|||+|++.|+.|++.+. +.++.||.+++..+.++++++| |++||+|++||.++. .|.+.. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-----~g~~~~---- 70 (115) T protein:vir:93 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-----TGDLQY---- 70 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-----cCceEE---- Confidence 999999999999999999985 4568999999999999999998 899999999997641 111111 Q ss_pred ecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMN 140 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~ 140 (149) .+ .++.+||+|+||||++|||||||+|||+++++.+++.|.+.+. T Consensus 71 ------------~v----~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:93 71 ------------TI----TSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ------------Ee----ecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 01 1335799999999999999999999999999999988887777 No 27 >protein:vir:78858 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285365;genbank:gi:148717893;genbank:GeneID:5246989 Probab=99.92 E-value=8.5e-29 Score=174.48 Aligned_cols=109 Identities=22% Similarity=0.379 Sum_probs=91.9 Q ss_pred eehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC------CcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA------PVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |+|+|||+|++.|+.|++.+. +.++.||.+++..+.++++++| |++||+|++||.++. .|.+.. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-----~g~~~~---- 70 (115) T protein:vir:78 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-----TGDLQY---- 70 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-----cCceEE---- Confidence 999999999999999999985 4568999999999999999998 899999999997641 111111 Q ss_pred ecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMN 140 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~ 140 (149) .+ .++.+||+|+||||++|||||||+|||+++++.+++.|.+.+. T Consensus 71 ------------~v----~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:78 71 ------------TI----TSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ------------Ee----ecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 01 1335799999999999999999999999999999988887777 No 28 >protein:vir:103917 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873996;genbank:gi:118430771;genbank:GeneID:4525409 Probab=99.92 E-value=8.5e-29 Score=174.48 Aligned_cols=109 Identities=22% Similarity=0.379 Sum_probs=91.9 Q ss_pred eehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC------CcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA------PVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |+|+|||+|++.|+.|++.+. +.++.||.+++..+.++++++| |++||+|++||.++. .|.+.. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-----~g~~~~---- 70 (115) T protein:vir:10 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-----TGDLQY---- 70 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-----cCceEE---- Confidence 999999999999999999985 4568999999999999999998 899999999997641 111111 Q ss_pred ecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMN 140 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~ 140 (149) .+ .++.+||+|+||||++|||||||+|||+++++.+++.|.+.+. T Consensus 71 ------------~v----~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:10 71 ------------TI----TSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ------------Ee----ecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 01 1335799999999999999999999999999999988887777 No 29 >protein:vir:96225 Length: 115 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239574;genbank:gi:66395330;genbank:GeneID:5132773 Probab=99.92 E-value=8.5e-29 Score=174.48 Aligned_cols=109 Identities=22% Similarity=0.379 Sum_probs=91.9 Q ss_pred eehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC------CcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA------PVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |+|+|||+|++.|+.|++.+. +.++.||.+++..+.++++++| |++||+|++||.++. .|.+.. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-----~g~~~~---- 70 (115) T protein:vir:96 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-----TGDLQY---- 70 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-----cCceEE---- Confidence 999999999999999999985 4568999999999999999998 899999999997641 111111 Q ss_pred ecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMN 140 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~ 140 (149) .+ .++.+||+|+||||++|||||||+|||+++++.+++.|.+.+. T Consensus 71 ------------~v----~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:96 71 ------------TI----TSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ------------Ee----ecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 01 1335799999999999999999999999999999988887777 No 30 >protein:vir:97144 Length: 115 # NCBI annotation: ORF047 # Family: family:all:180 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239729;genbank:gi:66394911;genbank:GeneID:5130877 Probab=99.92 E-value=8.5e-29 Score=174.48 Aligned_cols=109 Identities=22% Similarity=0.379 Sum_probs=91.9 Q ss_pred eehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC------CcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA------PVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |+|+|||+|++.|+.|++.+. +.++.||.+++..+.++++++| |++||+|++||.++. .|.+.. T Consensus 1 i~~~Gld~l~~~l~~~~~~~~-~~v~~a~~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~sI~~~~-----~g~~~~---- 70 (115) T protein:vir:97 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-----TGDLQY---- 70 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccccCCCCCCchhhhhcceeee-----cCceEE---- Confidence 999999999999999999985 4568999999999999999998 899999999997641 111111 Q ss_pred ecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMN 140 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~ 140 (149) .+ .++.+||+|+||||++|||||||+|||+++++.+++.|.+.+. T Consensus 71 ------------~v----~~~~~Ya~~vE~GT~km~a~Pfl~PA~~~~~~~~~~~i~~~~k 115 (115) T protein:vir:97 71 ------------TI----TSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKALFE 115 (115) T ss_pred ------------Ee----ecCccchhhhcccccccCCCCchhhhHHHHHHHHHHHHHHHhC Confidence 01 1335799999999999999999999999999999988887777 No 31 >protein:vir:99744 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004311;genbank:gi:122891765;genbank:GeneID:4712299 Probab=99.91 E-value=1.9e-28 Score=172.56 Aligned_cols=109 Identities=20% Similarity=0.327 Sum_probs=92.8 Q ss_pred eehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC------CcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA------PVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~a------P~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |+|+|||+|.+.|+.|++.+. +.+++||++++..++++|+.+| |++||.|++||.+.. .|++.. T Consensus 1 i~i~Gld~L~~~l~~~~~~~~-~~v~~av~~~~~~i~~~a~~~a~~~~~~p~~TG~Lr~SI~~~~-----~g~~~~---- 70 (115) T protein:vir:99 1 MNIDGLDALLNQFHDMKTNID-DDVDDILQENAKEYVVRAKLKAREVMNKGYWTGNLSRNIRYKK-----TVDLQY---- 70 (115) T ss_pred CcchhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccccCCCCcchhhhhceeeee-----cCcEEE---- Confidence 999999999999999999875 6679999999999999999998 999999999997642 111111 Q ss_pred ecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMN 140 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~ 140 (149) . -.++++||+|+||||++|+|||||+|||+++++.+++.|.+.++ T Consensus 71 ------------~----V~~~~~Ya~~vE~GT~~m~a~PFl~PA~~~~k~~~~~~l~~~~k 115 (115) T protein:vir:99 71 ------------T----ITSHAAYSGFLEFGTRYMEAEPFMWPVYEVIRKSTVEELKTLFE 115 (115) T ss_pred ------------E----ecCCccccccccccccccCCCCcchhhHHHHHHHHHHHHHHHhC Confidence 1 11346899999999999999999999999999999998887777 No 32 >protein:vir:9930 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795692;genbank:gi:28876456;genbank:GeneID:1257995 Probab=99.91 E-value=3.2e-28 Score=171.29 Aligned_cols=108 Identities=20% Similarity=0.284 Sum_probs=91.7 Q ss_pred hhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeeccccccc Q lcl|NC_019767. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTG 87 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~ 87 (149) |+|+|+|++.|++++..+. +.++.||.++|..++++|+.++|++||.|++||.+... +... T Consensus 1 i~Gld~l~~~l~~~~~~~~-~~v~~al~~~a~~i~~~ak~~aPv~TG~Lr~sI~~~~~-----~~~~------------- 61 (108) T protein:vir:99 1 MRGLDRFLRSVERKQKSVR-IAVDKELSKSAARIERQAKILAPVDTGWLRAQIYSEQQ-----RLLH------------- 61 (108) T ss_pred CchHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhcCCcCchhhhcceeeeec-----CcEE------------- Confidence 9999999999999999975 56699999999999999999999999999999975431 1111 Q ss_pred ccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 88 NSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 88 ~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~ 141 (149) ..+ .++.+||+|+||||++|||||||+||++++++++.+.|.+.|++ T Consensus 62 ---~~v----~~~~~Ya~~vE~GT~~m~a~Pf~~pa~~~~~~~~~~~i~~~lrk 108 (108) T protein:vir:99 62 ---YRV----VSPALYSIYLELGTRKMEAQSFLDPALRKEWPVLMANIKKMFKR 108 (108) T ss_pred ---EEe----ecCcccchhcccCccccCCCcchhhhHHHHHHHHHHHHHHHhcC Confidence 001 13467999999999999999999999999999888888877777 No 33 >protein:vir:81106 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429878;genbank:gi:156603931;genbank:GeneID:5525326 Probab=99.91 E-value=8.2e-28 Score=169.09 Aligned_cols=122 Identities=14% Similarity=0.156 Sum_probs=92.1 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCc--ccccceecccccccCCccccceee Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGK--LKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~--l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |+++++++| |...|++|+.+. +++.+.||++||+++++.++.++|++++. ++++|.++..+... T Consensus 1 M~v~v~~~~---L~~~l~~l~~~~-~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~---------- 66 (125) T protein:vir:81 1 MGARIESNN---IEQGLKNAVLKM-NLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDR---------- 66 (125) T ss_pred CeeEeeHHH---HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeeccccccc---------- Confidence 888877765 555556666664 56778999999999999999999998664 78888765433221 Q ss_pred eccccccccccccc-ccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTM-KANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAID 144 (149) Q Consensus 80 ~~~~~~~~~~~~~~-~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~ 144 (149) +.....+ ++++...+|||||+||||++||||||++||+++++++++++|.++|++-.+ T Consensus 67 -------~~g~~~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:81 67 -------HTSEKIVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred -------ccceEEEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 1111111 123345579999999999999999999999999999999999999965554 No 34 >protein:vir:98342 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918934;genbank:gi:119443696;genbank:GeneID:4594504 Probab=99.91 E-value=8.2e-28 Score=169.09 Aligned_cols=122 Identities=14% Similarity=0.156 Sum_probs=92.1 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCc--ccccceecccccccCCccccceee Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGK--LKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~--l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |+++++++| |...|++|+.+. +++.+.||++||+++++.++.++|++++. ++++|.++..+... T Consensus 1 M~v~v~~~~---L~~~l~~l~~~~-~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~---------- 66 (125) T protein:vir:98 1 MGARIESNN---IEQGLKNAVLKM-NLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDR---------- 66 (125) T ss_pred CeeEeeHHH---HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeeccccccc---------- Confidence 888877765 555556666664 56778999999999999999999998664 78888765433221 Q ss_pred eccccccccccccc-ccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTM-KANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAID 144 (149) Q Consensus 80 ~~~~~~~~~~~~~~-~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~ 144 (149) +.....+ ++++...+|||||+||||++||||||++||+++++++++++|.++|++-.+ T Consensus 67 -------~~g~~~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:98 67 -------HTSEKIVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred -------ccceEEEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 1111111 123345579999999999999999999999999999999999999965554 No 35 >protein:vir:79988 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430006;genbank:gi:156604061;genbank:GeneID:5525448 Probab=99.91 E-value=8.2e-28 Score=169.09 Aligned_cols=122 Identities=14% Similarity=0.156 Sum_probs=92.1 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCc--ccccceecccccccCCccccceee Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGK--LKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~--l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |+++++++| |...|++|+.+. +++.+.||++||+++++.++.++|++++. ++++|.++..+... T Consensus 1 M~v~v~~~~---L~~~l~~l~~~~-~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~---------- 66 (125) T protein:vir:79 1 MGARIESNN---IEQGLKNAVLKM-NLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDR---------- 66 (125) T ss_pred CeeEeeHHH---HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeeccccccc---------- Confidence 888877765 555556666664 56778999999999999999999998664 78888765433221 Q ss_pred eccccccccccccc-ccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTM-KANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAID 144 (149) Q Consensus 80 ~~~~~~~~~~~~~~-~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~ 144 (149) +.....+ ++++...+|||||+||||++||||||++||+++++++++++|.++|++-.+ T Consensus 67 -------~~g~~~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:79 67 -------HTSEKIVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred -------ccceEEEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 1111111 123345579999999999999999999999999999999999999965554 No 36 >protein:vir:4704 Length: 125 # NCBI annotation: phi PVL ORF 11 homologue # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061636;genbank:gi:9635723;genbank:GeneID:1262995 Probab=99.91 E-value=8.2e-28 Score=169.09 Aligned_cols=122 Identities=14% Similarity=0.156 Sum_probs=92.1 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCc--ccccceecccccccCCccccceee Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGK--LKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~--l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |+++++++| |...|++|+.+. +++.+.||++||+++++.++.++|++++. ++++|.++..+... T Consensus 1 M~v~v~~~~---L~~~l~~l~~~~-~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~---------- 66 (125) T protein:vir:47 1 MGARIESNN---IEQGLKNAVLKM-NLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDR---------- 66 (125) T ss_pred CeeEeeHHH---HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeeccccccc---------- Confidence 888877765 555556666664 56778999999999999999999998664 78888765433221 Q ss_pred eccccccccccccc-ccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTM-KANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAID 144 (149) Q Consensus 80 ~~~~~~~~~~~~~~-~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~ 144 (149) +.....+ ++++...+|||||+||||++||||||++||+++++++++++|.++|++-.+ T Consensus 67 -------~~g~~~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:47 67 -------HTSEKIVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred -------ccceEEEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 1111111 123345579999999999999999999999999999999999999965554 No 37 >protein:vir:9414 Length: 125 # NCBI annotation: phi PVL orf 11-like protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803392;genbank:gi:29028704;genbank:GeneID:1258141 Probab=99.91 E-value=8.2e-28 Score=169.09 Aligned_cols=122 Identities=14% Similarity=0.156 Sum_probs=92.1 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCc--ccccceecccccccCCccccceee Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGK--LKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~--l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |+++++++| |...|++|+.+. +++.+.||++||+++++.++.++|++++. ++++|.++..+... T Consensus 1 M~v~v~~~~---L~~~l~~l~~~~-~k~~~~Al~aga~~~~e~l~~~aP~~~~~~hl~d~I~vs~~k~~~---------- 66 (125) T protein:vir:94 1 MGARIESNN---IEQGLKNAVLKM-NLNSNVIVKAGAMSLVPLLKSNTPFANTKKHARDHIAVSNVKTDR---------- 66 (125) T ss_pred CeeEeeHHH---HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhCCCCCCCchhhhheeeccccccc---------- Confidence 888877765 555556666664 56778999999999999999999998664 78888765433221 Q ss_pred eccccccccccccc-ccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTM-KANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAID 144 (149) Q Consensus 80 ~~~~~~~~~~~~~~-~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~ 144 (149) +.....+ ++++...+|||||+||||++||||||++||+++++++++++|.++|++-.+ T Consensus 67 -------~~g~~~v~VG~~k~~~~~a~F~E~GT~k~~a~pF~~~a~~~~~~ev~~~~~~~lrk~~k 125 (125) T protein:vir:94 67 -------HTSEKIVTIGYAKGVSHRIHATEFGTMYQKPQLFITKTEKQGKNKVLKTMLDTAKRLQK 125 (125) T ss_pred -------ccceEEEEeccCCCCceEEEeccCCccCCCCCchhhHHHHHhHHHHHHHHHHHHHHHhC Confidence 1111111 123345579999999999999999999999999999999999999965554 No 38 >protein:vir:743 Length: 108 # NCBI annotation: unknown # Family: family:all:180 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108720;genbank:gi:13487842;genbank:GeneID:920877 Probab=99.91 E-value=3e-28 Score=171.47 Aligned_cols=108 Identities=20% Similarity=0.359 Sum_probs=89.8 Q ss_pred eehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeeccccc Q lcl|NC_019767. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPR 85 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~ 85 (149) |+|+|||+|.+.|+++.. ...+++||+++|..|+++|+.+||++||+|++||.+.....+..+ T Consensus 1 i~i~Gld~l~~~l~~~~~---~~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~si~~~~~~~~~~~-------------- 63 (108) T protein:vir:74 1 MKITGIDALQKKLRKNAT---LDDVKHVVKSNTASMNKNMQNLAPVDTGNMKRSITSEFTDGGLSG-------------- 63 (108) T ss_pred CcchhHHHHHHHHHHhhh---HHHHHHHHHHHHHHHHHHHHHhCCCCchhhhccceeeeecCceEE-------------- Confidence 999999999999998753 356789999999999999999999999999999976543221110 Q ss_pred ccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 86 TGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMN 140 (149) Q Consensus 86 ~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~ 140 (149) .+ .+..+||+|+||||++|||||||+||++++++++.+.|.+.++ T Consensus 64 ------~V----~~~~~Ya~~vE~GT~km~aqpf~~pa~~~~~~~~~~~i~~~~k 108 (108) T protein:vir:74 64 ------TT----GPHTDYAGYVEYGTRFQSAQPFVKPAFNIQKKVFTNDLERLTK 108 (108) T ss_pred ------Ee----ecCCCcccceeccccccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 01 1345699999999999999999999999999998888877777 No 39 >protein:vir:98409 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:YP_001210363;genbank:gi:146334932;genbank:GeneID:5114801 Probab=99.90 E-value=7.9e-28 Score=169.17 Aligned_cols=108 Identities=20% Similarity=0.354 Sum_probs=89.3 Q ss_pred eehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeeccccc Q lcl|NC_019767. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPR 85 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~ 85 (149) |+|+|||+|.+.|+.+.. ...++++|+++|.+++++|+.+||++||+|++||.+....+. ... T Consensus 1 i~i~Gld~l~~~l~~~~~---~~~~~~al~~~a~~i~~~ak~~apvdTG~Lr~si~~~~~~~~----~~~---------- 63 (108) T protein:vir:98 1 MKITGIDALQKKLRKNAT---LNDVKHVVKRNTVSMNKNMQNLAPVDTGNMKRSITSEFTDGG----LTG---------- 63 (108) T ss_pred CcchhHHHHHHHHHHhhh---HHHHHHHHHHHHHHHHHHHHHhCCCCchhhHhhceeeeecCc----eEE---------- Confidence 999999999999998753 345789999999999999999999999999999975432211 110 Q ss_pred ccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 86 TGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMN 140 (149) Q Consensus 86 ~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~ 140 (149) .+ .+..+||+|+||||++|||||||+||++.+++++.+.|.+.++ T Consensus 64 ------~V----~~~~~Ya~~vE~GT~~m~aqPFl~pa~~~~~~~~~~~i~~~lr 108 (108) T protein:vir:98 64 ------TT----IPHTDYAGYVEYGTRFQAAQPFVKPAFDVQKKIFTNDLERLTK 108 (108) T ss_pred ------Ee----ecCCCccceeeccccccCCCcchhhHHHHHHHHHHHHHHHHcC Confidence 01 1335799999999999999999999999999998888887777 No 40 >protein:vir:102154 Length: 119 # NCBI annotation: phage protein, HK97 gp10 family # Family: family:all:10671 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699937;genbank:gi:110804042;genbank:GeneID:4206698 Probab=99.90 E-value=1.3e-27 Score=168.07 Aligned_cols=118 Identities=21% Similarity=0.372 Sum_probs=98.4 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |+ ++++.|+|+|...|++|+... +++.++||++|+++|++++..++|++||.++. |..+-.+. | T Consensus 1 Ma--~iel~G~del~~~l~~~g~~~-~~ie~kAlk~g~e~I~~~~~~n~P~~tg~lkk-ik~~~kk~---g--------- 64 (119) T protein:vir:10 1 MA--SLEIEGFEEFEKFISEDMVLD-ESTKRKGIKAGITKIGKAIEKNSPIKSGRLSK-VKIRVKNT---G--------- 64 (119) T ss_pred Cc--eeehhhHHHHHHHHHhhhhhh-HHHHHHHHHHHhHHHHHHHhhcCCcccCCcce-eeeeeecC---c--------- Confidence 55 556889999999999999774 78999999999999999999999999999885 32211111 1 Q ss_pred cccccccccccccccCCCCCcceehhcccCCcCCCCC-cchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAH-PFVRPAYDTREEEAASVAIARMNQAID 144 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~-PFl~pA~~~~~~~~~~~~~~~l~~~i~ 144 (149) ...++.+.+..||..|+|||||+|||| |||.||+++++++|++.|.++|.+.++ T Consensus 65 ----------~~~VG~~ks~~fy~kF~EFGTSkm~a~~pF~~~a~~~~~~eA~~~~~~el~~~~r 119 (119) T protein:vir:10 65 ----------LATEGTASSSEFYDIFQNFGTSEQKAHVGYFDRAVDETTNEAVEEVAEIIFRKMR 119 (119) T ss_pred ----------eeEeccCCcchhhhhhccccccccCCCCCccccccccChHHHHHHHHHHHHHhcC Confidence 112223446789999999999999999 999999999999999999999999999 No 41 >protein:vir:4906 Length: 114 # NCBI annotation: gp114 # Family: family:all:180 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056684;genbank:gi:9635019;genbank:GeneID:1262668 Probab=99.89 E-value=7.9e-27 Score=163.69 Aligned_cols=112 Identities=22% Similarity=0.346 Sum_probs=88.7 Q ss_pred CccceeehhhHHHHHHHHHHhH--HHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCcccccee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALS--RAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVH 78 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~--~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~ 78 (149) |. +|+|+|||+|++.|+++. .++ ++++++++...++.+++.|+.++|++||+|++||.+.... ++. T Consensus 1 Ma--~i~~~Gld~l~~~L~~~~~~~~v-~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~----~~~----- 68 (114) T protein:vir:49 1 MA--TIEFEGLDEMAQSLLKNASPEKR-SKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVES----DKA----- 68 (114) T ss_pred Ce--eeeeehHHHHHHHHHHhcCHHHH-HHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecC----Cee----- Confidence 55 477889999999999983 333 5677777777777777777778899999999999764321 110 Q ss_pred eecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 79 IRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~ 141 (149) .+ ..+.+||+|+||||++|||||||+||++.+++.+++.|.+.+.. T Consensus 69 -------------~V----~~~~~Ya~~vEfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:49 69 -------------TV----EALTSYSGYLEVGTRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred -------------Ee----cCCCCccceecccccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 00 13357999999999999999999999999999999998888887 No 42 >protein:vir:2740 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695113;genbank:gi:23455882;genbank:GeneID:955595 Probab=99.89 E-value=7.9e-27 Score=163.69 Aligned_cols=112 Identities=22% Similarity=0.346 Sum_probs=88.7 Q ss_pred CccceeehhhHHHHHHHHHHhH--HHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCcccccee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALS--RAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVH 78 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~--~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~ 78 (149) |. +|+|+|||+|++.|+++. .++ ++++++++...++.+++.|+.++|++||+|++||.+.... ++. T Consensus 1 Ma--~i~~~Gld~l~~~L~~~~~~~~v-~~~~~~~~~~~~~~~~~~a~~~~p~~TG~Lr~sI~~~~~~----~~~----- 68 (114) T protein:vir:27 1 MA--TIEFEGLDEMAQSLLKNASPEKR-SKVLRKYGSKLKEAAVNRAQFNKGYSTGATRRSITLQVES----DKA----- 68 (114) T ss_pred Ce--eeeeehHHHHHHHHHHhcCHHHH-HHHHHHHHHHHHHHHHHhcccCCCCCchhhhhceeeeecC----Cee----- Confidence 55 477889999999999983 333 5677777777777777777778899999999999764321 110 Q ss_pred eecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 79 IRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~ 141 (149) .+ ..+.+||+|+||||++|||||||+||++.+++.+++.|.+.+.. T Consensus 69 -------------~V----~~~~~Ya~~vEfGT~km~a~Pfl~PA~~~~~~~~~~~l~~l~k~ 114 (114) T protein:vir:27 69 -------------TV----EALTSYSGYLEVGTRKMEAQPFMKPALDEVAPKMVEELAKWDET 114 (114) T ss_pred -------------Ee----cCCCCccceecccccccCCCCchhhhHHHHHHHHHHHHHHHhcC Confidence 00 13357999999999999999999999999999999998888887 No 43 >protein:vir:96486 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238496;genbank:gi:66391772;genbank:GeneID:5176908 Probab=99.88 E-value=1.7e-26 Score=161.80 Aligned_cols=110 Identities=25% Similarity=0.359 Sum_probs=89.1 Q ss_pred CccceeehhhHHHHHHHHHHh--HHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCcccccee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEAL--SRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVH 78 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l--~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~ 78 (149) |.+ |+|.|||+|++.|+.+ +.++ ++++++++.+.+..+++.|+.++|++||+|++||.+.. |+.. T Consensus 1 Ma~--i~i~Gld~L~~~l~~~~~~~~v-~~~v~~~~~~~~~~~~~~a~~~apvdTG~Lr~sI~~~~------~~~~---- 67 (112) T protein:vir:96 1 MAT--IEFEGLDEMAQSLLKNASSERR-SKVLRKYGAKLKEAAVSKAQFKKGYSTGATRRSITLEA------GSDR---- 67 (112) T ss_pred Cce--eeehHHHHHHHHHHhhcCHHHH-HHHHHHHHHHHHHHHHHHhhhcCCCCchhhhhceeeec------CceE---- Confidence 555 5688999999999998 4454 57889999999999999999999999999999997532 1111 Q ss_pred eecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 79 IRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARM 139 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l 139 (149) ..+ .++.+||+|+||||++|||||||+|||+++++.+++.+++-- T Consensus 68 ------------~~v----~~~~~Ya~~vE~GTr~m~AqPF~~PA~~~~~~~~~~~l~~L~ 112 (112) T protein:vir:96 68 ------------AVV----EALTNYSGYLEVGTRKMEAQPFMRPALDQVVPEMVEEMAKWE 112 (112) T ss_pred ------------EEe----cCCCCccceeccCccccCCCCchhhhHHHHHHHHHHHHHhcC Confidence 001 133579999999999999999999999999998877776543 No 44 >protein:vir:106570 Length: 182 # NCBI annotation: putative protein # Family: family:all:6475 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958588;genbank:gi:41179258;genbank:GeneID:2717106 Probab=99.88 E-value=5.4e-26 Score=159.09 Aligned_cols=147 Identities=18% Similarity=0.234 Sum_probs=97.7 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHH---HHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccc--cCCc--- Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENN---KVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSR--RRGE--- 72 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~---k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~--~~g~--- 72 (149) ||.+ +|.|+|+|.++|++|++.+.+ +++..++.+++..++++|+.++|++||+|++||........ ..+. T Consensus 1 m~~v--~i~Gld~L~~kl~~~~~~~~~~v~~a~~~~~~~~a~~v~~~ak~~~PvdtG~Lr~SI~~~~~~~~~~~~g~V~~ 78 (182) T protein:vir:10 1 MIEV--ELKGVNELRAKLKKLPDIMAKATANAQENAIEQAEAYAVDELQSSIKYSTGELTRSFKHEVKVDGDEVIGRWWN 78 (182) T ss_pred CeEE--EEecHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCchhhhhceeeeeeecCCeEEEEeec Confidence 8776 577999999999999987754 23455556677788899999999999999999975433221 1111 Q ss_pred -cccceeeeccccccccccc-------ccccCCCCCcceehhcc-------------------cCCcCCCCCcchhHHHH Q lcl|NC_019767. 73 -ISSGVHIRGVNPRTGNSDN-------TMKANNPRNAFYWRFVE-------------------LGTANMPAHPFVRPAYD 125 (149) Q Consensus 73 -~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~y~~f~E-------------------~GT~~~~a~PFl~pA~~ 125 (149) ..+.+++..+.+..+.... .......+..+|++.++ |+|+.|||||||+|||+ T Consensus 79 ~~~ya~yvE~GTG~~~~~~~~~~~p~~~~~~~~~~w~~~~~~v~~~~a~~~~~~~~~~~~~~~~~t~G~~aqPFl~pA~~ 158 (182) T protein:vir:10 79 SSMVAVFREFGTGLVGERSHKQLPKNVAIIYRQTPWFFPVDSVDLDLTKIYGIPKIKINGKYFYRTTGQPARQFMTPAAN 158 (182) T ss_pred CCCccceeecCcccccccCccccCccceeeeecCCceeeccccccccccccccceeeecCceEeecCCCCCCcchHHHHH Confidence 1222333222111110000 00111111112233333 55789999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 126 TREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 126 ~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) ++++++.+.|.+.++++|++.++= T Consensus 159 ~~~~~i~~~i~~~i~~~l~~~~g~ 182 (182) T protein:vir:10 159 KMAKEAPEIIKRSIDQELHDKLGG 182 (182) T ss_pred HhHHHHHHHHHHHHHHHHHHhhcC Confidence 999999999999999999999888 No 45 >protein:vir:5978 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690678;genbank:geneid:6329146;genbank:gi:22855072;interpro:IPR011693;uniprot:O48447;genbank:GeneID:955318 Probab=99.84 E-value=4.1e-24 Score=148.78 Aligned_cols=115 Identities=23% Similarity=0.259 Sum_probs=91.5 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) +|+++++++|+++|.+.|+.+++.+. +.++.+|..+|+.++++++.++|++||+|++||.+.....+..+.+ T Consensus 3 ~ms~~i~~~g~~~l~~~l~~~~~~~~-~~v~~~l~~~a~~i~~~ak~~apv~TG~Lr~SI~~~~~~~g~~~~V------- 74 (144) T protein:vir:59 3 LMSVRIDPSWRRIMSRNVRTFSGHVL-TQVEQVIIKTAEKIAGLAASLAPVDEGNLKNSIQIDYKNNGLTAEI------- 74 (144) T ss_pred cceeeehhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeEEeecCcEEEEE------- Confidence 56667788999999999999999985 5678999999999999999999999999999997653222111111 Q ss_pred cccccccccccccccCCCCCcceehhcccCC---------------------------cCCCCCcchhHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGT---------------------------ANMPAHPFVRPAYDTREEEAAS 133 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT---------------------------~~~~a~PFl~pA~~~~~~~~~~ 133 (149) ..+..|+.|+|||| ..|||||||+||++.+++.+.+ T Consensus 75 -----------------~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~~~~ 137 (144) T protein:vir:59 75 -----------------TVGAEYAIYVEYGTGIYAVDGNGRKTPWTYYSPKLGRYVRTQGAPAQPFFWPAVEEGGEYFER 137 (144) T ss_pred -----------------ecCCCccchhhcCccccccCCCccccccccccccccceecCCCCCCCcchhHHHHHHHHHHHH Confidence 11234777777776 5699999999999999998888 Q ss_pred HHHHHHH Q lcl|NC_019767. 134 VAIARMN 140 (149) Q Consensus 134 ~~~~~l~ 140 (149) .|++.+- T Consensus 138 ~i~~~~g 144 (144) T protein:vir:59 138 EMRRLRG 144 (144) T ss_pred HHHHhcC Confidence 7777666 No 46 >protein:vir:93738 Length: 137 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240463;genbank:gi:66396153;genbank:GeneID:5133507 Probab=99.83 E-value=7e-24 Score=147.54 Aligned_cols=108 Identities=24% Similarity=0.292 Sum_probs=88.1 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |.++ +.|+++|.+.|+++++++. +.++.++..+|..++++|+.++|++||.|++||.+........+. T Consensus 1 Ma~~---~~g~~~l~~~l~~~~~~~~-~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~-------- 68 (137) T protein:vir:93 1 MAKV---KYGNWDLVKELENYERDME-RWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGV-------- 68 (137) T ss_pred Cchh---HHhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEE-------- Confidence 5444 5699999999999999985 678999999999999999999999999999999754322211110 Q ss_pred cccccccccccccccCCCCCcceehhcccCC-----------------------------cCCCCCcchhHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGT-----------------------------ANMPAHPFVRPAYDTREEEA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT-----------------------------~~~~a~PFl~pA~~~~~~~~ 131 (149) + ..+..|++|+|||| +.|||||||+||++.+++.+ T Consensus 69 -------------V---~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~ 132 (137) T protein:vir:93 69 -------------I---NIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFF 132 (137) T ss_pred -------------E---ecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHH Confidence 0 12345888888888 67999999999999999999 Q ss_pred HHHHH Q lcl|NC_019767. 132 ASVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 133 ~~~l~ 137 (137) T protein:vir:93 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 99999 No 47 >protein:vir:94490 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240680;genbank:gi:66396374;genbank:GeneID:5133754 Probab=99.83 E-value=7e-24 Score=147.54 Aligned_cols=108 Identities=24% Similarity=0.292 Sum_probs=88.1 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |.++ +.|+++|.+.|+++++++. +.++.++..+|..++++|+.++|++||.|++||.+........+. T Consensus 1 Ma~~---~~g~~~l~~~l~~~~~~~~-~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~-------- 68 (137) T protein:vir:94 1 MAKV---KYGNWDLVKELENYERDME-RWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGV-------- 68 (137) T ss_pred Cchh---HHhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEE-------- Confidence 5444 5699999999999999985 678999999999999999999999999999999754322211110 Q ss_pred cccccccccccccccCCCCCcceehhcccCC-----------------------------cCCCCCcchhHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGT-----------------------------ANMPAHPFVRPAYDTREEEA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT-----------------------------~~~~a~PFl~pA~~~~~~~~ 131 (149) + ..+..|++|+|||| +.|||||||+||++.+++.+ T Consensus 69 -------------V---~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~ 132 (137) T protein:vir:94 69 -------------I---NIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFF 132 (137) T ss_pred -------------E---ecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHH Confidence 0 12345888888888 67999999999999999999 Q ss_pred HHHHH Q lcl|NC_019767. 132 ASVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 133 ~~~l~ 137 (137) T protein:vir:94 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 99999 No 48 >protein:vir:97427 Length: 137 # NCBI annotation: ORF043 # Family: family:all:180 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240753;genbank:gi:66396447;genbank:GeneID:5133783 Probab=99.83 E-value=7e-24 Score=147.54 Aligned_cols=108 Identities=24% Similarity=0.292 Sum_probs=88.1 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |.++ +.|+++|.+.|+++++++. +.++.++..+|..++++|+.++|++||.|++||.+........+. T Consensus 1 Ma~~---~~g~~~l~~~l~~~~~~~~-~~~~~~~~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~-------- 68 (137) T protein:vir:97 1 MAKV---KYGNWDLVKELENYERDME-RWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDSGFTGV-------- 68 (137) T ss_pred Cchh---HHhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccccchhccceeEeecCceEEE-------- Confidence 5444 5699999999999999985 678999999999999999999999999999999754322211110 Q ss_pred cccccccccccccccCCCCCcceehhcccCC-----------------------------cCCCCCcchhHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGT-----------------------------ANMPAHPFVRPAYDTREEEA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT-----------------------------~~~~a~PFl~pA~~~~~~~~ 131 (149) + ..+..|++|+|||| +.|||||||+||++.+++.+ T Consensus 69 -------------V---~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~ 132 (137) T protein:vir:97 69 -------------I---NIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFF 132 (137) T ss_pred -------------E---ecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHH Confidence 0 12345888888888 67999999999999999999 Q ss_pred HHHHH Q lcl|NC_019767. 132 ASVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 133 ~~~l~ 137 (137) T protein:vir:97 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 99999 No 49 >protein:vir:95894 Length: 137 # NCBI annotation: ORF046 # Family: family:all:180 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240389;genbank:gi:66396083;genbank:GeneID:5133405 Probab=99.82 E-value=1.2e-23 Score=146.29 Aligned_cols=108 Identities=23% Similarity=0.286 Sum_probs=87.6 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |+++ +.|+++|.+.|+++++++. ++++.++..++..++++|+.++|++||.|++||.......+..+. T Consensus 1 Ma~~---~~G~~~l~~~l~~~~~~~~-~~~~~~~~~~a~~v~~~ak~~aPv~TG~L~~Si~~~~~~~~~~~~-------- 68 (137) T protein:vir:95 1 MAKV---KYGNWDLVKELENYERDME-RWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGV-------- 68 (137) T ss_pred Cchh---HHhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhcCeeeEeeCCceEEE-------- Confidence 5554 4799999999999999974 788999999999999999999999999999999754322111111 Q ss_pred cccccccccccccccCCCCCcceehhcccCC-----------------------------cCCCCCcchhHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGT-----------------------------ANMPAHPFVRPAYDTREEEA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT-----------------------------~~~~a~PFl~pA~~~~~~~~ 131 (149) + .++..|+.|+|||| +.|||||||+||++.+++++ T Consensus 69 -------------V---~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i 132 (137) T protein:vir:95 69 -------------I---NIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFF 132 (137) T ss_pred -------------E---ecCCCcccccccCccccccCCCcccccccccceeccCcceeecCCCCCCcchHHHHHHHHHHH Confidence 0 12345788888887 67999999999999999999 Q ss_pred HHHHH Q lcl|NC_019767. 132 ASVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 133 ~k~l~ 137 (137) T protein:vir:95 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 99998 No 50 >protein:vir:94796 Length: 137 # NCBI annotation: ORF050 # Family: family:all:180 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240540;genbank:gi:66396237;genbank:GeneID:5133576 Probab=99.82 E-value=1e-23 Score=146.56 Aligned_cols=108 Identities=24% Similarity=0.302 Sum_probs=88.4 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |.+++ .|+|+|.+.|+++++++. +.++.+|..+|..++++|+.++|++||+|++||.+.....+..+.+ T Consensus 1 Ma~~~---~G~~~l~~~L~~~~~~~~-~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V------- 69 (137) T protein:vir:94 1 MAKVK---YGNWDLVKELENYERDIE-RWVKRGIAKTTVKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVI------- 69 (137) T ss_pred CchhH---HhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCceeEeecCcEEEEE------- Confidence 77775 499999999999999985 6789999999999999999999999999999997643222111110 Q ss_pred cccccccccccccccCCCCCcceehhcccC-----------------------------CcCCCCCcchhHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELG-----------------------------TANMPAHPFVRPAYDTREEEA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~G-----------------------------T~~~~a~PFl~pA~~~~~~~~ 131 (149) ..+..|+.|+||| |+.|||||||+||++.+++++ T Consensus 70 -----------------~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~~ 132 (137) T protein:vir:94 70 -----------------NIGSEYAIYVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRVFF 132 (137) T ss_pred -----------------ecCCCcccccccCccccccCCCcccccccccceeccCCceeecCCcCCCcchHHHHHHHHHHH Confidence 1234588888888 567999999999999999999 Q ss_pred HHHHH Q lcl|NC_019767. 132 ASVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 133 ~~~l~ 137 (137) T protein:vir:94 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 99999 No 51 >protein:vir:94108 Length: 149 # NCBI annotation: ORF029 # Family: family:all:180 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240238;genbank:gi:66395914;genbank:GeneID:5133277 Probab=99.82 E-value=8.9e-24 Score=146.96 Aligned_cols=108 Identities=19% Similarity=0.243 Sum_probs=86.9 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |.++. .|+|+|.+.|+++++++. ++++.++..+|..|+++|+.++|++||.|++||.+.....+..+. T Consensus 13 Ma~~~---~Gld~l~~~L~~~~~~~~-~~~~~al~~~a~~v~~~ak~~aPvdTG~Lr~SI~~~~~~~g~~~~-------- 80 (149) T protein:vir:94 13 MAKVK---YGADSMVVELDKFDKKIE-EWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSV-------- 80 (149) T ss_pred HHHHH---HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcccchhhcCeeEEeeCCcEEEE-------- Confidence 87754 499999999999999985 688999999999999999999999999999999764322111110 Q ss_pred cccccccccccccccCCCCCcceehhcccCC-----------------------------cCCCCCcchhHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGT-----------------------------ANMPAHPFVRPAYDTREEEA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT-----------------------------~~~~a~PFl~pA~~~~~~~~ 131 (149) + ..+..|+.|+|||| ..|||||||+||++.+++++ T Consensus 81 -------------V---~~~~~YA~~VE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~a~PFl~pA~~~~~~~i 144 (149) T protein:vir:94 81 -------------I---SVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTF 144 (149) T ss_pred -------------E---ecCCCcccccccCccccccCCCccccccccceeecCccceecCCCCCCCcchHHHHHHHHHHH Confidence 0 12245888888887 44789999999999999999 Q ss_pred HHHHH Q lcl|NC_019767. 132 ASVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 145 ~~~i~ 149 (149) T protein:vir:94 145 EQYFS 149 (149) T ss_pred HHhhC Confidence 88888 No 52 >protein:vir:105916 Length: 149 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004379;genbank:gi:122891834;genbank:GeneID:4712387 Probab=99.82 E-value=1.2e-23 Score=146.15 Aligned_cols=108 Identities=19% Similarity=0.262 Sum_probs=86.6 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |.++. .|+|+|.+.|+++++++. ++++.++.+++..|+++|+.++|++||.|++||.+.....+..+.+ T Consensus 13 Ma~v~---~Gld~l~~~l~~~~~~~~-~~~~~~l~~~a~~v~~~ak~~aPvdTG~L~~SI~~~~~~~g~~~~V------- 81 (149) T protein:vir:10 13 MAKVK---YGADSMVVELDKFDKKIE-EWVKKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVI------- 81 (149) T ss_pred hHHHH---HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcccchhhccceEEecCCcEEEEE------- Confidence 87753 399999999999999985 6889999999999999999999999999999997653222111111 Q ss_pred cccccccccccccccCCCCCcceehhcccCC-----------------------------cCCCCCcchhHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGT-----------------------------ANMPAHPFVRPAYDTREEEA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT-----------------------------~~~~a~PFl~pA~~~~~~~~ 131 (149) ..+..|+.|+|||| ..|||||||+||++.+++++ T Consensus 82 -----------------~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~k~~i 144 (149) T protein:vir:10 82 -----------------SVGADYAIYVEYGTGIYATGPGGSRATKIPWSFKGDDGEWYTTYGQAPQPFWNPAIDAGRKTF 144 (149) T ss_pred -----------------ecCCCcccccccCccccccCCcccccccccceeeccccceecCCCCCCCcchhHHHHHHHHHH Confidence 12234777777776 55789999999999999999 Q ss_pred HHHHH Q lcl|NC_019767. 132 ASVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 145 ~~~i~ 149 (149) T protein:vir:10 145 EQYFS 149 (149) T ss_pred HHhhC Confidence 99888 No 53 >protein:vir:4956 Length: 153 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049932;genbank:gi:9632903;genbank:GeneID:1262079 Probab=99.81 E-value=7.6e-23 Score=141.85 Aligned_cols=136 Identities=18% Similarity=0.202 Sum_probs=101.3 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |.+|. .||++++++|++|.... .++.++|+++||+++++..+..+|+..- +..+....+++.++|.+. T Consensus 1 M~~~~---~glee~~~~lekL~~~~-~~~~~katkAGA~v~~e~L~~~tp~~h~--------~~~kt~~~~HlaD~I~~s 68 (153) T protein:vir:49 1 MTGLD---EALEGWLKTVASIGDLT-PAEQAKITTAGAKVFKEELAEVTREKHY--------SKKKDLKYGHMADGLAVQ 68 (153) T ss_pred CccHH---HHHHHHHHHHHHhccCC-HHHHHHHHHHHHHHHHHHHHHhccccCC--------CCCCCCCCCcccccceec Confidence 87766 79999999999999875 4677899999999999999999998531 111233345666666654 Q ss_pred ccccccccc--ccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHH--HHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 81 GVNPRTGNS--DNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTR--EEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 81 ~~~~~~~~~--~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~--~~~~~~~~~~~l~~~i~k~~~k 149 (149) .... .|.. ...+++.+...+||+||+||||++|||+||+.++.+++ ++++++++.+++++.|++..+= T Consensus 69 ~~~i-dG~~dG~s~VG~~~~~~a~~a~f~n~GT~km~~~hFie~tr~e~~~k~~vl~A~~~~~~~il~~~~~~ 140 (153) T protein:vir:49 69 STNA-DGRKNGVSTVGWKNNYHAQNARRLNDGTKKYRADHFITNVQNDSTVKNKVLLAEKEEYEKLIRRKGGV 140 (153) T ss_pred cccc-cccccceeeecccCCccceeeeecccCcccCCCChhhHHHHHHhhHHHHHHHHHHHHHHHHHHhcCCe Confidence 3221 1222 22233334556899999999999999999999999876 6788888888888888776554 No 54 >protein:vir:96829 Length: 135 # NCBI annotation: ORF033 # Family: family:all:180 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240161;genbank:gi:66395838;genbank:GeneID:5133170 Probab=99.80 E-value=7.4e-23 Score=141.92 Aligned_cols=108 Identities=23% Similarity=0.291 Sum_probs=86.4 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |.+.+ .|+|+|.+.|+++++.+. ++++.++..+|+.++++|+.++|++||.|++||.+.....+..+.+ T Consensus 1 Ma~~~---~Gl~~l~~~l~~~~~~~~-~~~~~al~~~a~~v~~~ak~~apvdTG~Lr~SI~~~~~~~g~~~~V------- 69 (135) T protein:vir:96 1 MAKVK---YGADSIVVDLEKYSKDME-KWVKKGITKTTLKIYNTAIHLMPVDTGFLRQSTTVDFENGGFTGVV------- 69 (135) T ss_pred Cchhh---hhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeEEeecCcEEEEE------- Confidence 66543 399999999999999984 6789999999999999999999999999999997643222111111 Q ss_pred cccccccccccccccCCCCCcceehhcccCC---------------------------cCCCCCcchhHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGT---------------------------ANMPAHPFVRPAYDTREEEAAS 133 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT---------------------------~~~~a~PFl~pA~~~~~~~~~~ 133 (149) .++..|+.|+|||| ..|||||||+||++.+++++.+ T Consensus 70 -----------------~~~~~YA~~ve~GT~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~a~pfl~~A~~~~~~~~~~ 132 (135) T protein:vir:96 70 -----------------KIGSNYAVYVNYGTGIYATKGSRAHKIPWTYKDPNGKWHTTYGQMPQPFWEPAIDAGRQTFEQ 132 (135) T ss_pred -----------------ecCCCccchhhcccccccCCCccccccccccccCCcceeecCCcCCCcchhHHHHHHHHHHHH Confidence 12345777777777 6689999999999999999888 Q ss_pred HHH Q lcl|NC_019767. 134 VAI 136 (149) Q Consensus 134 ~~~ 136 (149) .|. T Consensus 133 ~i~ 135 (135) T protein:vir:96 133 YFS 135 (135) T ss_pred hcC Confidence 888 No 55 >protein:vir:96121 Length: 137 # NCBI annotation: ORF040 # Family: family:all:180 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240082;genbank:gi:66395767;genbank:GeneID:5133101 Probab=99.80 E-value=8.1e-23 Score=141.70 Aligned_cols=108 Identities=22% Similarity=0.250 Sum_probs=87.2 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |+++. .|+|+|++.|+.+++.+. ++++.+|.++|..++++|+.++|++||.|++||.+....++..+ T Consensus 1 Ma~~~---~G~~~l~~~l~~~~~~~~-~~~~~~l~~~a~~~~~~ak~~~pvdTG~L~~Si~~~~~~~g~~~--------- 67 (137) T protein:vir:96 1 MAKVK---YGNWDLVAELEDYRDEME-EWVKKGILKTTLAIYNTAVALAPVDLGFLKESIDFKVTDGGFSS--------- 67 (137) T ss_pred CchhH---hhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcCccchhcCceeEeecCceEE--------- Confidence 66663 599999999999999985 67789999999999999999999999999999975432211100 Q ss_pred cccccccccccccccCCCCCcceehhcccCC-----------------------------cCCCCCcchhHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGT-----------------------------ANMPAHPFVRPAYDTREEEA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT-----------------------------~~~~a~PFl~pA~~~~~~~~ 131 (149) .+ ..+..|+.|+|||| ..|||||||+||++.+++.+ T Consensus 68 ------------~V---~~~~~YA~yvE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~pFl~pA~~~~~~~i 132 (137) T protein:vir:96 68 ------------VI---SVGAEYAIYVEFGTGIYATGPGGSRARKLPWTYKGDDGEWHTTYGQQAQPFWNPAIDEGRKVF 132 (137) T ss_pred ------------EE---ecCCCcccccccCccccccCCCccccccccceeeccCcceeecCCCCCCcchhHHHHHHHHHH Confidence 01 12245888999998 45889999999999999999 Q ss_pred HHHHH Q lcl|NC_019767. 132 ASVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 133 ~k~i~ 137 (137) T protein:vir:96 133 NRYFS 137 (137) T ss_pred HHhhC Confidence 99888 No 56 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=99.79 E-value=2.3e-22 Score=139.25 Aligned_cols=108 Identities=22% Similarity=0.321 Sum_probs=84.7 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |... +.|+|+|.+.|+++++.+. +.++.+|.++|..|+++|+.+||++||.|++||.+.....+..+.+. T Consensus 1 Ma~~---~~Gl~~l~~~l~~~~~~~~-~~~~~al~~~a~~i~~~ak~~aPvdTG~Lr~SI~~~~~~~~~~~~V~------ 70 (137) T protein:vir:10 1 MAKV---KYGNWELVKELEDFEKETI-RWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVIN------ 70 (137) T ss_pred Cchh---HhhHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeEEeeCCcEEEEEe------ Confidence 5555 4699999999999999985 56799999999999999999999999999999976433222111111 Q ss_pred cccccccccccccccCCCCCcceehhcccC-----------------------------CcCCCCCcchhHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELG-----------------------------TANMPAHPFVRPAYDTREEEA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~G-----------------------------T~~~~a~PFl~pA~~~~~~~~ 131 (149) ++..|+.|+||| |+.|||||||+||++++++++ T Consensus 71 ------------------~~~~Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i 132 (137) T protein:vir:10 71 ------------------IGSEYAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFF 132 (137) T ss_pred ------------------cCCCcccccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHHHH Confidence 112355555555 456899999999999999999 Q ss_pred HHHHH Q lcl|NC_019767. 132 ASVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 133 ~k~i~ 137 (137) T protein:vir:10 133 NKYFS 137 (137) T ss_pred HHhcC Confidence 99998 No 57 >protein:vir:100887 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358767;genbank:gi:77999993;genbank:GeneID:3726158 Probab=99.78 E-value=6.5e-22 Score=136.73 Aligned_cols=136 Identities=15% Similarity=0.216 Sum_probs=98.4 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeec Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~ 81 (149) |+|. +||++++++|++|.... .+.-.+|+++||+++++.++.++|+..-.. ....+..+++.++|.+.. T Consensus 1 v~~~---~~lee~l~~i~kl~~~~-~~~~~ki~kaGA~v~~e~L~~~tp~~~~~~-------~~~~~~~~HlaD~I~~s~ 69 (139) T protein:vir:10 1 MDMD---EALGQWLKQVSKAAELS-ISDQEKITKAGADVYAKKLAETTKEKHPNT-------KGDGGKYGHLSEDIRSAA 69 (139) T ss_pred CCHH---HHHHHHHHHHHHhhccC-HHHHHHHHHHHHHHHHHHHHHhcccccCcC-------CCCCCCCcchhhcceecC Confidence 3333 69999999999997643 345578999999999999999999742100 011112235555555544 Q ss_pred ccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 82 VNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) .+.. +...+...+++...+|++||+||||++|||+||+.++.+++++++++++.++|++.|++..+= T Consensus 70 ~~~d-g~~~g~~~VG~~k~~~~A~f~n~GT~k~~~~hFie~t~~e~~~evl~a~~~~~k~~l~~~~~~ 136 (139) T protein:vir:10 70 GDID-GDHNGSSTVGFHNKAHIARFLNDGTKYIRADHFVDNARDDAKDAVFAAEAEKYQAMIAKANGG 136 (139) T ss_pred cccc-cccceeeeeCCCCCcceEeecccCccccCCCchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 3222 222222233444568999999999999999999999999999999999999999999887766 No 58 >protein:vir:94654 Length: 142 # NCBI annotation: tail component protein # Family: family:all:1084 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579211;genbank:gi:93007447;genbank:GeneID:5076773 Probab=99.77 E-value=9.3e-22 Score=135.88 Aligned_cols=115 Identities=27% Similarity=0.296 Sum_probs=89.4 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |..++++| |+++|.+.|+.+.+.+. .+++.+|..+|+.++++|+.++|++||+|++||.+.....+. .... T Consensus 1 Ma~~~~~~-~~~~l~~~l~~~~~~~~-~~~~~~l~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~g~--~~~~----- 71 (142) T protein:vir:94 1 MAGLNYRV-NSTEFQGALRAALDRLT-GAAREATEAAANDMVNMAKGLCPVDTGRLRSSIQAVPSGGRF--SFSV----- 71 (142) T ss_pred CceeEEEe-cHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhccceeeeccCCc--eEEE----- Confidence 88899888 58999999999999975 578999999999999999999999999999999754322211 1000 Q ss_pred cccccccccccccccCCCCCcceehhcccCCcC---------------------------CCCCcchhHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTAN---------------------------MPAHPFVRPAYDTREEEAAS 133 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~---------------------------~~a~PFl~pA~~~~~~~~~~ 133 (149) .+ ..+..|+.|+||||.. +||||||+||++.+++.+.+ T Consensus 72 -----------~v----~~~~~YA~~vE~Gt~~~~i~pk~~k~l~~~~~~~~~~~v~~pG~~~~pfl~~A~~~~~~~i~~ 136 (142) T protein:vir:94 72 -----------TI----GTNVTYAADVEYGTAPHVIVPKDKKALYWPGAAHPVAKVNHPGTRAQPFMRPAIAAASTFLRN 136 (142) T ss_pred -----------EE----ecCcccchhhhccCCCceeccCCCccceecccceeeeeeeecCCCCCcchhHHHHHHHHHHHH Confidence 00 1345799999999853 67999999999999887755 Q ss_pred HHHHHHH Q lcl|NC_019767. 134 VAIARMN 140 (149) Q Consensus 134 ~~~~~l~ 140 (149) .++ +|+ T Consensus 137 ~~~-~~~ 142 (142) T protein:vir:94 137 HAK-GIR 142 (142) T ss_pred HHH-hcC Confidence 543 344 No 59 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=99.76 E-value=1.5e-21 Score=134.68 Aligned_cols=108 Identities=23% Similarity=0.340 Sum_probs=84.7 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |..+. .|+|+|.+.|+++++.+. +.++.+|..+|..|+++|+.+||++||.|++||.+.....+..+.+.. T Consensus 1 Ma~~~---~G~~~l~~~l~~~~~~~~-~~~~~al~~~a~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~~----- 71 (137) T protein:vir:10 1 MAKVK---YGNWDLVKELEEFEKETI-RWAKKGIAKTTTIIHNSIVSNMPVDTGYLRESVSMDFKKGGLTGVINI----- 71 (137) T ss_pred Cccch---hCHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCcCcchhhcCeeeEecCCcEEEEEec----- Confidence 66664 499999999999999985 567899999999999999999999999999999765432222221111 Q ss_pred cccccccccccccccCCCCCcceehhcccC-----------------------------CcCCCCCcchhHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELG-----------------------------TANMPAHPFVRPAYDTREEEA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~G-----------------------------T~~~~a~PFl~pA~~~~~~~~ 131 (149) +..|+.|+||| |+.|||||||+||++++++++ T Consensus 72 -------------------~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~~~~~~~i 132 (137) T protein:vir:10 72 -------------------GSEYAVYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAIDEGRAFF 132 (137) T ss_pred -------------------CCccccccccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHHHHHHHHH Confidence 12345555555 456999999999999999999 Q ss_pred HHHHH Q lcl|NC_019767. 132 ASVAI 136 (149) Q Consensus 132 ~~~~~ 136 (149) .+.|. T Consensus 133 ~k~i~ 137 (137) T protein:vir:10 133 NKYFS 137 (137) T ss_pred HHhhC Confidence 99998 No 60 >protein:vir:5000 Length: 141 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049974;genbank:gi:9632946;genbank:GeneID:1262109 Probab=99.76 E-value=2.5e-21 Score=133.55 Aligned_cols=136 Identities=19% Similarity=0.231 Sum_probs=98.0 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |.+|. +||++|+++|++|.... .+.-.+|+++||+++++..+..+|++.-. ..+.+..+++.++|.+. T Consensus 1 M~~~~---~gl~e~~~~lekl~~~~-~~~~~katkAGA~v~~~~L~~~tp~~hy~--------~~~~~~~~HlaD~I~~~ 68 (141) T protein:vir:50 1 MVGLA---EALDEWLKTVASIGNLT-PAEQVEITTAGAKVFKKELEEVTREKHYS--------RKKNPKFGHMADGLAIQ 68 (141) T ss_pred CccHH---HHHHHHHHHHHHhcCCC-HHHHHHHHHHHHHHHHHHHHHhcccCCCC--------CCCCCCCCccccceeec Confidence 77765 89999999999998665 45668999999999999999999975210 11223344555555554 Q ss_pred ccccccccc--ccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHH--HHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 81 GVNPRTGNS--DNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTR--EEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 81 ~~~~~~~~~--~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~--~~~~~~~~~~~l~~~i~k~~~k 149 (149) .... .|.. ...+++.+...+|++||+||||++|||+||+.+|.+.+ +++|++++.++|++.|++--+- T Consensus 69 ~~~~-DG~~dg~s~VG~~~~~~~~~A~f~n~GT~k~~~~hFve~~~~~a~~k~~Vl~A~~~~~k~~l~~~~~~ 140 (141) T protein:vir:50 69 STNA-DGRKNGVSTVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSTVQKKVLLEKKRNTKNSLEEKEGC 140 (141) T ss_pred cCcc-ccccCCeeeeccCCCccceeeeccccCccccCCCchhHHHHHhhhhHHHHHHHHHHHHHHHHHhccCC Confidence 3221 1211 22233334556899999999999999999999999864 7888888888888777764444 No 61 >protein:vir:99101 Length: 142 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1608 # MgeName: Qyrzula # Cross-refs: genbank:acc:YP_655705;genbank:gi:109521783;genbank:GeneID:4157823 Probab=99.75 E-value=8.2e-22 Score=136.17 Aligned_cols=113 Identities=22% Similarity=0.272 Sum_probs=84.6 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) ||++++.++||+. .|+.+...+. ++++++|...+..++++|+.++|++||+|++||............+.. T Consensus 1 m~~~~~~~~gl~~---~l~~~~~~~~-~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~----- 71 (142) T protein:vir:99 1 MVQVSVRYEGFDY---NPVGAAAQVG-PILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSG----- 71 (142) T ss_pred CceeEEEeeecch---hHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEE----- Confidence 9999999999875 5566666664 678999999999999999999999999999999754432221111111 Q ss_pred cccccccccccccccCCCCCcceehhcccCCc-----------------------------CCCCCcchhHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTA-----------------------------NMPAHPFVRPAYDTREEEA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~-----------------------------~~~a~PFl~pA~~~~~~~~ 131 (149) ...+++.|+.|+||||. .++|||||+||++.+.++. T Consensus 72 ---------------~v~~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~ 136 (142) T protein:vir:99 72 ---------------GVTAHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRD 136 (142) T ss_pred ---------------EeccCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhh Confidence 11234568888888885 2459999999999998876 Q ss_pred HHHHHH Q lcl|NC_019767. 132 ASVAIA 137 (149) Q Consensus 132 ~~~~~~ 137 (149) .....+ T Consensus 137 ~~~~~r 142 (142) T protein:vir:99 137 RRIRVR 142 (142) T ss_pred hhhccC Confidence 666555 No 62 >protein:vir:8669 Length: 142 # NCBI annotation: gp27 # Family: family:all:1084 # MgeID: mge:156 # MgeName: Rosebush # Cross-refs: genbank:acc:NP_817788;genbank:gi:29566220;genbank:GeneID:1259476 Probab=99.75 E-value=8.2e-22 Score=136.17 Aligned_cols=113 Identities=22% Similarity=0.272 Sum_probs=84.6 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) ||++++.++||+. .|+.+...+. ++++++|...+..++++|+.++|++||+|++||............+.. T Consensus 1 m~~~~~~~~gl~~---~l~~~~~~~~-~~~~~~i~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~----- 71 (142) T protein:vir:86 1 MVQVSVRYEGFDY---NPVGAAAQVG-PILRRTHSSLTRQIANETRARVPVLTGHLGRSVREDPQVMVTPFHVSG----- 71 (142) T ss_pred CceeEEEeeecch---hHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhhcceeeeeccccccceEEE----- Confidence 9999999999875 5566666664 678999999999999999999999999999999754432221111111 Q ss_pred cccccccccccccccCCCCCcceehhcccCCc-----------------------------CCCCCcchhHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTA-----------------------------NMPAHPFVRPAYDTREEEA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~-----------------------------~~~a~PFl~pA~~~~~~~~ 131 (149) ...+++.|+.|+||||. .++|||||+||++.+.++. T Consensus 72 ---------------~v~~~a~YA~~ve~GT~ph~i~pk~~~al~f~~~g~~~~~k~v~hpG~~a~Pfl~~A~~~~~~~~ 136 (142) T protein:vir:86 72 ---------------GVTAHAKYAAAVHEGTRPHVIRAKHAQALHFWWRGREVFVRQVNHPGTRARPYLRNAGEAVVRRD 136 (142) T ss_pred ---------------EeccCccccceeccCCccceeccccCceeeEecCCceeeeeeeecCCCCCCchhHHHHHHHHhhh Confidence 11234568888888885 2459999999999998876 Q ss_pred HHHHHH Q lcl|NC_019767. 132 ASVAIA 137 (149) Q Consensus 132 ~~~~~~ 137 (149) .....+ T Consensus 137 ~~~~~r 142 (142) T protein:vir:86 137 RRIRVR 142 (142) T ss_pred hhhccC Confidence 666555 No 63 >protein:vir:4859 Length: 140 # NCBI annotation: putative tail component protein # Family: family:all:1029 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049399;genbank:gi:9632427;genbank:GeneID:1258496 Probab=99.75 E-value=4.1e-21 Score=132.37 Aligned_cols=137 Identities=19% Similarity=0.186 Sum_probs=99.3 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |.+|. +||++|+++|++|.... .+.-.+|+++||+++++..+.++|+..- +....+..+++.++|.+. T Consensus 1 M~~~~---d~l~e~~~~lekl~~~~-~~~~~katkAGA~v~~~~L~~~tp~~h~--------~~~~t~~~~HlaD~I~~~ 68 (140) T protein:vir:48 1 MTGLD---EALEGWLKTVASIGDLT-PAEQAKITTAGAKVFKEELAEVTRQKHY--------SNKKHLKYGHMADGLSVQ 68 (140) T ss_pred CccHH---HHHHHHHHHHHHhccCC-HHHHHHHHHHHHHHHHHHHHHhccccCC--------CCCCCCCCCcchhceeec Confidence 88766 69999999999999765 3566899999999999999999996421 011122334555555554 Q ss_pred cccccc-cccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHH--HHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 81 GVNPRT-GNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTR--EEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 81 ~~~~~~-~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~--~~~~~~~~~~~l~~~i~k~~~k 149 (149) ...... ..+...+++.+...+|++||+||||++|||+||+.+|.+++ +.++++++.+++++.|++--+- T Consensus 69 ~~~iDg~~~g~s~VG~~kk~~a~~A~f~n~GT~k~~~~hFve~~~~e~~~k~~vl~A~~~~~~~~l~~~~~~ 140 (140) T protein:vir:48 69 STNVDGRKNGVSTVGWVNRYHAQNARRLNDGTKKYRADHFVTNVQNDSAVQTKVLLAEKEEYEKLIRKKGGE 140 (140) T ss_pred ccccccccCceeeeccCCCcceeeeeccccCccccCCCchhHHHHHhhhhHHHHHHHHHHHHHHHHHhhcCC Confidence 322111 11122233334456899999999999999999999999966 7789999999888888775555 No 64 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=99.75 E-value=6e-21 Score=131.45 Aligned_cols=134 Identities=23% Similarity=0.261 Sum_probs=103.3 Q ss_pred Cccc-eeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 1 MIET-SLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~~-~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |..+ +|+++||++|.+.|+++......+.+++++++.|..+..+++.++|++||.|++|+........ +.+ T Consensus 1 M~~~~~~d~~gl~~~~~~l~~~~~~~~~~~~~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~--~~~------ 72 (141) T protein:vir:79 1 MARWGSVDFREFKRVCKKMEKLTKIDLDKFCKDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARS--LPV------ 72 (141) T ss_pred CCCCccCcHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccc--cce------ Confidence 7776 8999999999999998866434678899999999999999999999999999998754321110 000 Q ss_pred ecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHH--HHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAY--DTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~--~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) ....+... +. -..+..||+|+||||+.++++||+.+++ +.+.+++.+.+.+.+.+.|++.+++ T Consensus 73 ---~~~g~~~~--v~--v~n~~~YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l~~ 137 (141) T protein:vir:79 73 ---YKQGNNYI--IE--VVNPTEYASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLILLKG 137 (141) T ss_pred ---eecCCeeE--EE--EecCCcchhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00000000 11 1234679999999999999999999998 7777888888989999999999998 No 65 >protein:vir:100223 Length: 139 # NCBI annotation: putative head-tail joining protein # Family: family:all:1029 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025034;genbank:gi:48697267;genbank:GeneID:2948321 Probab=99.73 E-value=1e-20 Score=130.21 Aligned_cols=136 Identities=18% Similarity=0.245 Sum_probs=98.4 Q ss_pred ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeeccc Q lcl|NC_019767. 4 TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVN 83 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~ 83 (149) |+|+ .||++++++|++|.... .+.-.+|+++||+++++..+.++|+..-. ........+++.+.|.+.... T Consensus 1 ~~~~-~~l~e~l~~lekl~~~~-~~~~~k~tkaGA~v~~~~L~~~tp~~~~~-------~~~~~~~~~HlaD~I~~~~~~ 71 (139) T protein:vir:10 1 MDMD-EALGQWLKQVSKAAQLS-VSDQEKITKAGADVYAKELAETTKEKHPN-------TKGDGGKYGHLSEDISSAAGD 71 (139) T ss_pred CCHH-HHHHHHHHHHHHhccCC-HHHHHHHHHHHHHHHHHHHHHhccccccc-------CCCCCCCCCcccccceecCcc Confidence 3333 69999999999998654 35557899999999999999999964200 001112234555555554422 Q ss_pred ccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 84 PRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) .. +...+.+.+++...+|.+||+||||++|||+||+..+.+++++++++++.+++++.|.+..+- T Consensus 72 id-g~~~g~~~VG~~~~~~~Ahf~n~GT~~~~~~hFie~t~~e~~~ev~~a~~~~~ke~l~~~~~~ 136 (139) T protein:vir:10 72 ID-GDHNGSSTVGFHNKAHIARFLNDGTKNIRADHFVDNARDDAKDAVFAAEAEKYQAMIAKANGG 136 (139) T ss_pred cc-ccccccceeCCCCCceeeeeeccCccccCCCchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 21 222233334444567889999999999999999999999999999999999999988887665 No 66 >protein:vir:4833 Length: 140 # NCBI annotation: ORF29 # Family: family:all:1029 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038330;genbank:gi:9634656;genbank:GeneID:1262624 Probab=99.72 E-value=2.4e-20 Score=128.12 Aligned_cols=137 Identities=18% Similarity=0.189 Sum_probs=101.5 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |.+|. .||++|+++|++|...+. +.-.+|+++||+++++..+.++|+..-. ..+.+..+++.++|.+. T Consensus 1 M~~~~---d~l~e~~~~v~kl~~~~~-~~~~katkAGAkv~~~~L~~~tp~~h~~--------~r~t~~~~HlaD~I~~~ 68 (140) T protein:vir:48 1 MTGLD---EALEGWLKTVASIGDLTP-AEQAKITTAGAKVFKKELAEVTREKHYS--------KKKDLKYGHMADGLAVQ 68 (140) T ss_pred CccHH---HHHHHHHHHHHHhccCCH-HHHHHHHHHhHHHHHHHHHHhcccCCCC--------CCCCCCCCcccccceec Confidence 88766 599999999999998653 5568899999999999999999986311 12233445666666655 Q ss_pred cccccc-cccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHH--HHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 81 GVNPRT-GNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTR--EEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 81 ~~~~~~-~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~--~~~~~~~~~~~l~~~i~k~~~k 149 (149) ...... ..+...+++.....+|+++|+++||++|||+||+..+.+++ +++++.++.+++++.|.+--+- T Consensus 69 ~~~idg~~dG~s~VG~~k~~~a~~a~f~NdGT~k~~~~hFve~t~~e~~~~~~vl~A~~~~y~~~l~kk~~~ 140 (140) T protein:vir:48 69 STNVDGRKNGVATVGWKNNYHAQNARRLNDGTKKYRADHFVTNVQNDSAVRDKVLLAEKEEYEKLIRKKGGE 140 (140) T ss_pred ccccccccccceeecccCCCceeEEeecccCccccCCCchHHHHHHhhhhHHHHHHHHHHHHHHHHHhhcCC Confidence 322111 11122233333446899999999999999999999999855 8899999999998888776666 No 67 >protein:vir:81147 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285816;genbank:gi:148747737;genbank:GeneID:5247190 Probab=99.68 E-value=1.2e-19 Score=124.23 Aligned_cols=120 Identities=18% Similarity=0.284 Sum_probs=92.0 Q ss_pred CccceeehhhH-HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 1 MIETSLDFSGL-NDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~~~~~i~Gl-~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |.. |+|++| +++.+.|+.++.++. +.+++++.++|+.+++++|.++|++||.+++++.++.... .|. T Consensus 1 Ma~--i~id~la~~I~~~L~~y~~~v~-~~v~~~v~~~a~~~~~~ik~~aP~rTG~y~ksw~vk~~~~--~g~------- 68 (126) T protein:vir:81 1 MAN--ITIDRLADELLQAVKEYTDDVA-EGVRKKVDETARKVLKEAQALAPKRTGEYARTFTITKEDG--YGT------- 68 (126) T ss_pred Ccc--cchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhCCcccchhhcccccccccc--CCc------- Confidence 665 567787 568888999999975 6779999999999999999999999999999976553211 111 Q ss_pred ecccccccccccccccCCCCCcceehhcccCCcC-----CCCCcchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTAN-----MPAHPFVRPAYDTREEEAASVAIARMNQAI 143 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~-----~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i 143 (149) ...+..+.....+.|++|||+.+ +||+|||+||++...+++.+.|++.|+..= T Consensus 69 -----------~~~vv~~~~~~~l~HLLEfGha~r~gGrV~a~Phi~Pa~e~~~~~~~~~i~~~l~~gg 126 (126) T protein:vir:81 69 -----------TKRIIWNKKHYRRVHLLEFGHAKVNGGRVKEYPHLRPAYDKHGARLPDELKRVIENGG 126 (126) T ss_pred -----------ceEEEeccCCCCceeeeecceecCCCCccCCCcchHHHHHHHHHHHHHHHHHHhhcCC Confidence 11233344556679999999997 799999999999888877766666665333 No 68 >protein:vir:97327 Length: 116 # NCBI annotation: ORF041 # Family: family:all:180 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240615;genbank:gi:66396305;genbank:GeneID:5133683 Probab=99.57 E-value=2.5e-18 Score=117.08 Aligned_cols=87 Identities=22% Similarity=0.289 Sum_probs=68.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeecccccccccccccccCCCCCcceeh Q lcl|NC_019767. 26 NNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWR 105 (149) Q Consensus 26 ~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~ 105 (149) .+++++++|.+++..++++|+.++|++||+|++||.......+..+.+. .+..|+. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~------------------------~~~~YA~ 56 (116) T protein:vir:97 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVIN------------------------IGSEYAI 56 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeecCcEEEEEe------------------------cCCCccc Confidence 3678899999999999999999999999999999975432222111111 1234666 Q ss_pred hcccC-----------------------------CcCCCCCcchhHHHHHHHHHHHHHHH Q lcl|NC_019767. 106 FVELG-----------------------------TANMPAHPFVRPAYDTREEEAASVAI 136 (149) Q Consensus 106 f~E~G-----------------------------T~~~~a~PFl~pA~~~~~~~~~~~~~ 136 (149) |+||| |..|+|||||+||++++++.+.+.|. T Consensus 57 yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:97 57 YVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ccccCCcccccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 66666 88899999999999999999988888 No 69 >protein:vir:1243 Length: 116 # NCBI annotation: similar to phage Spp1 gp16.1 # Family: family:all:180 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510942;genbank:gi:17426276;genbank:GeneID:927389 Probab=99.57 E-value=2.5e-18 Score=117.08 Aligned_cols=87 Identities=22% Similarity=0.289 Sum_probs=68.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeecccccccccccccccCCCCCcceeh Q lcl|NC_019767. 26 NNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWR 105 (149) Q Consensus 26 ~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~ 105 (149) .+++++++|.+++..++++|+.++|++||+|++||.......+..+.+. .+..|+. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~aPv~TG~Lr~SI~~~~~~~~~~~~V~------------------------~~~~YA~ 56 (116) T protein:vir:12 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVIN------------------------IGSEYAI 56 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHHhCCcCcccccccceEEeecCcEEEEEe------------------------cCCCccc Confidence 3678899999999999999999999999999999975432222111111 1234666 Q ss_pred hcccC-----------------------------CcCCCCCcchhHHHHHHHHHHHHHHH Q lcl|NC_019767. 106 FVELG-----------------------------TANMPAHPFVRPAYDTREEEAASVAI 136 (149) Q Consensus 106 f~E~G-----------------------------T~~~~a~PFl~pA~~~~~~~~~~~~~ 136 (149) |+||| |..|+|||||+||++++++.+.+.|. T Consensus 57 yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~i~ 116 (116) T protein:vir:12 57 YVNYGTGIYATGAGGSRAKKIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred ccccCCcccccCCCcccccccceeeecCCceeeecCCcCCCcchHHHHHHHHHHHHHhhC Confidence 66666 88899999999999999999988888 No 70 >protein:vir:95062 Length: 116 # NCBI annotation: ORF044 # Family: family:all:180 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240827;genbank:gi:66394711;genbank:GeneID:5133856 Probab=99.57 E-value=2.6e-18 Score=117.03 Aligned_cols=87 Identities=22% Similarity=0.289 Sum_probs=67.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeecccccccccccccccCCCCCcceeh Q lcl|NC_019767. 26 NNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWR 105 (149) Q Consensus 26 ~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~ 105 (149) .+++++++|..++..|+.+|+.+||++||+|++||..........+.+. .+..|+. T Consensus 1 v~~~v~~~~~~~~~~i~~~ak~~apv~TG~Lr~SI~~~~~~~~~~~~V~------------------------~~~~Ya~ 56 (116) T protein:vir:95 1 MERWVKRGIAKTTAKIHNTIISLMPVDTGYLRESVTMDFKDGGFTGVIN------------------------IGSEYAI 56 (116) T ss_pred ChHHHHHHHHHHHHHHHHHHHhhCCccccccccceeEEeecCcEEEEEe------------------------cCCCccc Confidence 3678899999999999999999999999999999976443222111111 1123555 Q ss_pred hcccC-----------------------------CcCCCCCcchhHHHHHHHHHHHHHHH Q lcl|NC_019767. 106 FVELG-----------------------------TANMPAHPFVRPAYDTREEEAASVAI 136 (149) Q Consensus 106 f~E~G-----------------------------T~~~~a~PFl~pA~~~~~~~~~~~~~ 136 (149) |+||| |..|+|||||+||++.+++.+++.|. T Consensus 57 yvE~GTg~~~~~~~~~~~~~~~~~~~~~~g~~~~t~g~~a~Pfl~pA~~~~~~~i~k~is 116 (116) T protein:vir:95 57 YVNYGTGIYATGAGGSRAKNIPWSYKDANGKWHTTKGQHAQPFWEPAIDAGRAFFNKYFS 116 (116) T ss_pred eeecCccccccCCCccccccccceeecCccceeeCCCCCCCcchHHHHHHHHHHHHHhhC Confidence 55555 77899999999999999999988888 No 71 >protein:vir:78077 Length: 141 # NCBI annotation: gp9 # Family: family:all:180 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468793;genbank:gi:157325374;genbank:GeneID:5601839 Probab=99.56 E-value=1.1e-17 Score=113.61 Aligned_cols=115 Identities=15% Similarity=0.142 Sum_probs=78.3 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |-+++|+ +.+.+.++.+.+.+.+.+...|+..++++++..|+.++|+++|.|++||..........+. T Consensus 1 ~~~~~f~----~~~~~~~~~~~k~~~~~~~~~a~~~~~~~ie~~ak~~~pvdtG~L~~SI~~~v~~~g~~~~-------- 68 (141) T protein:vir:78 1 MNEFEFD----SNIPKARKLIEKKVLQALEDIGEHMTTELAEGGHGVTSNNDTGEYAQKSGYKVRKSSKEVI-------- 68 (141) T ss_pred CcchhHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccchhhcceeeeeecCCcEEE-------- Confidence 7778776 3344445555555544443446888889999999999999999999999654322211110 Q ss_pred cccccccccccccccCCCCCcceehhcccCC--------------------------cCCCCCcchhHHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGT--------------------------ANMPAHPFVRPAYDTREEEAASV 134 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT--------------------------~~~~a~PFl~pA~~~~~~~~~~~ 134 (149) + ..+..|+-|+|||| +.|||||||+||++.+++++.+. T Consensus 69 -------------V---~~~~~YA~yVE~GTG~~~~~~~grk~~w~y~~~~g~~~~t~G~~aqpFl~~A~~~~~~~i~~~ 132 (141) T protein:vir:78 69 -------------V---GNSSDYAIYYEFGTGEKSERGGGKAGGWFYMDKKGHWHFTRGSQASKRMRYTFRDEQDKVRVF 132 (141) T ss_pred -------------E---ecCCCccceeecCCcccccCCCCCcCcceeecCCCeeEeccCCCCchhhhhhHHhhHHHHHHH Confidence 0 01234666666666 56999999999999999988776 Q ss_pred HHHHHHHHHH Q lcl|NC_019767. 135 AIARMNQAID 144 (149) Q Consensus 135 ~~~~l~~~i~ 144 (149) |.+.|+ .|+ T Consensus 133 i~~~~~-~l~ 141 (141) T protein:vir:78 133 TERALR-GIN 141 (141) T ss_pred HHHHhh-ccC Confidence 666554 344 No 72 >protein:vir:105467 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529877;genbank:gi:90592617;genbank:GeneID:3974531 Probab=99.55 E-value=3.9e-17 Score=110.56 Aligned_cols=124 Identities=15% Similarity=0.156 Sum_probs=89.2 Q ss_pred CccceeehhhHHHHHHHHHHhHHHH-HHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAE-NNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~-~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |...+|+++||++|.+.|+++.... ..+.+.++|+..|..+.++++.++|++||.|++|+....... ..+.+ T Consensus 1 Ms~~~id~~gl~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~~~vk~~tPVdTG~Lr~S~~~~~~~~-~~~~~------ 73 (144) T protein:vir:10 1 MSLGHVDDAQFQQFASRVRQKIDSGYVKQELGKSSRRIGTQSLRILEANTPVKQGNLRRSWTAEGPTY-GCGGW------ 73 (144) T ss_pred CCCCCccHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHhCCCCcchhccceeecceee-ecCee------ Confidence 5557999999999999999986531 246778999999999999999999999999999987543211 01111 Q ss_pred ecccccccccccccccCCCCCcceehhcccCCcCC-----------------CCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANM-----------------PAHPFVRPAYDTREEEAASVAIARMNQA 142 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~-----------------~a~PFl~pA~~~~~~~~~~~~~~~l~~~ 142 (149) ...+ ..+..|++|+||||+.+ +.+|||++|.+..+.. |.+.+.+. T Consensus 74 ----------~~~V----~n~~~YA~~VE~Ghr~~~G~~v~~~~~~~~~g~V~G~~~~~~a~~~~~~~----~~~~l~k~ 135 (144) T protein:vir:10 74 ----------TIKL----INNAEYASYVESGHRQTPGRYVPVLKKRLVRDWVPGQFYMKKSIPQIQRQ----LPQLVTEG 135 (144) T ss_pred ----------EEEE----ecCCCcccccccceeecCCcccccCCCccccceecCccchHHHHHHHHHH----HHHHHHHH Confidence 0111 24567999999999754 5678999988866664 44445555 Q ss_pred HHHHhcC Q lcl|NC_019767. 143 IDEVLSK 149 (149) Q Consensus 143 i~k~~~k 149 (149) |++.... T Consensus 136 l~~l~d~ 142 (144) T protein:vir:10 136 LWGLKDL 142 (144) T ss_pred HHHHhhh Confidence 5555555 No 73 >protein:vir:99528 Length: 92 # NCBI annotation: putative major tail protein # Family: family:all:180 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958541;genbank:gi:41179323;genbank:GeneID:2717166 Probab=99.52 E-value=1.5e-17 Score=112.77 Aligned_cols=92 Identities=21% Similarity=0.342 Sum_probs=73.3 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |..++++|.|+|+|++.|++.... ..++++++..+..++.+|+++||++||.|++||.......+..+.+ T Consensus 1 Ma~~~i~~~Gld~L~~~L~~~~~~---~~v~~vv~~~~~~l~~~ak~~ap~dTG~lrrSI~~~~~~~g~~~~v------- 70 (92) T protein:vir:99 1 MADYSISWDGLDALDEALANQQNM---NTVKKVVKKHTANLMTATQQAVPVDTGHLKQSAQIQISRDGFTGSV------- 70 (92) T ss_pred CCceeeEeehHHHHHHHHHhhccH---HHHHHHHHHHHHHHHHHHHHhCCCCccccceeeeEEeecCCeeEEE------- Confidence 888999999999999999987643 3457899999999999999999999999999998665433222111 Q ss_pred cccccccccccccccCCCCCcceehhcccCCcCCCC Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPA 116 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a 116 (149) ....+.+.|+-|+||||++|+| T Consensus 71 --------------~~~gp~a~Ya~YvE~GTR~M~A 92 (92) T protein:vir:99 71 --------------TYGGGLVNYAAYVEFGTRFMDS 92 (92) T ss_pred --------------EeccCccccccccccceeecCC Confidence 1112345699999999999999 No 74 >protein:vir:3848 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050154;swissprot:trembl:q9t1f3;genbank:gi:9633046;uniprot:Q9T1F3;genbank:GeneID:1262148 Probab=99.52 E-value=1e-16 Score=108.25 Aligned_cols=146 Identities=12% Similarity=0.152 Sum_probs=105.7 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceec------ccccccCCccc Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVV------TQKSRRRGEIS 74 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~------~~~~~~~g~~~ 74 (149) ||. +++ .+|++++++|+++.... .+.-.++..+||+++++..+..+|+..-..+...... .......+++. T Consensus 1 mm~-~~~-~~l~~~l~~v~k~~~~~-~~~k~kiTkAGAkv~~e~L~~~Tp~~h~~~~k~~~~~~~~~k~~~~~~~~~Hla 77 (159) T protein:vir:38 1 MAN-DMG-EFYNNWVNEVEKGMKLS-VEDKAKITGEGAEAFSTVLHDHTPRSNEIYRRGRSAGHANAKHHNRNRKTKHLQ 77 (159) T ss_pred Ccc-hHH-HHHHHHHHHHHHhcCCC-HHHHHHHHHHhHHHHHHHHHHhcccCCCccccccccccccccccCcCcCCCccc Confidence 444 344 67999999998854432 2334678999999999999999999644333221111 12345567888 Q ss_pred cceeeeccccccccccccc--ccCCCCCcceehhcccCCcCCCCC-----cchhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019767. 75 SGVHIRGVNPRTGNSDNTM--KANNPRNAFYWRFVELGTANMPAH-----PFVRPAYDTREEEAASVAIARMNQAIDEVL 147 (149) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~y~~f~E~GT~~~~a~-----PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~ 147 (149) ++|.+..+....|...+.. +..+...+|+++|+++||++|||+ ||+..+.++.+++|+.++.+++++-|+..- T Consensus 78 D~I~~~~~~~iDg~~dG~s~VGw~~~~~a~~a~f~NdGT~~m~~k~~~gdHFvekt~~~~k~~Vl~A~~~~~~~il~~~~ 157 (159) T protein:vir:38 78 DSITYKPGYTADKLHTGDTDVGFEGKYYDFLAKIVNNGQHHMSPKRYKNMHFLDKAQQEAKKSVAEAELKAYKEVMNHDS 157 (159) T ss_pred cceeeecCccccccccceeeecccCCccceEeeecccCccccCCCCccCChhHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 8887765533333332222 222445579999999999999998 899999999999999999999999998888 Q ss_pred cC Q lcl|NC_019767. 148 SK 149 (149) Q Consensus 148 ~k 149 (149) -| T Consensus 158 ~~ 159 (159) T protein:vir:38 158 DK 159 (159) T ss_pred CC Confidence 88 No 75 >protein:vir:81067 Length: 119 # NCBI annotation: p12 # Family: family:all:2714 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285682;genbank:gi:156535145;genbank:GeneID:5247112 Probab=99.49 E-value=1.8e-17 Score=112.33 Aligned_cols=93 Identities=24% Similarity=0.312 Sum_probs=79.1 Q ss_pred HHHHHHhhCCcCCCcccccceecccccccCCccccceeeecccccccccccccccCCCCCcceehhcccC---------- Q lcl|NC_019767. 41 LKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELG---------- 110 (149) Q Consensus 41 v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~G---------- 110 (149) |+|+++..+|+.+|.|+++|........+.-+ .....++++...++|||++||| T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG----------------~~~Y~Vswn~rkAPhghlvE~Ghw~~~~~~~~ 64 (119) T protein:vir:81 1 MRESAKAFVNDETGKLRSNLYVAYSPEESTNG----------------VQTYAVSWRKKAAPHGHLLEFGHWQTHAAYKG 64 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCC----------------eEEEEeeccCCcCCcccccccceeeeeeeeec Confidence 99999999999999999999776544333211 1223456677889999999999 Q ss_pred --------------CcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 111 --------------TANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 111 --------------T~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) |+++||+|||+|||++..+++.++|.+.+++.+.++++= T Consensus 65 ~dG~w~~~~~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg 117 (119) T protein:vir:81 65 KDGEWYSSSVKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRG 117 (119) T ss_pred cCceeeecCccccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 999999999999999999999999999999999999987 No 76 >protein:vir:100652 Length: 134 # NCBI annotation: 77ORF029 # Family: family:all:589 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958610;genbank:gi:41189542;genbank:GeneID:2743798 Probab=99.48 E-value=9.7e-17 Score=108.37 Aligned_cols=121 Identities=19% Similarity=0.215 Sum_probs=88.4 Q ss_pred ceeehhhHHHHHHHHHHh--HHHHHHHHHHHHHHHHHHHHHHHHHhhCCc--CCCcccccceecccccccCCccccceee Q lcl|NC_019767. 4 TSLDFSGLNDIAKDLEAL--SRAENNKVLRDATRAGAEVLKEEVIARAPV--RTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l--~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |++++.|++||+++|++. +..+ .++.++||.+|+++|.++++.+.++ |||.+.+++.++.-... T Consensus 1 MsvevkGv~eil~~LE~k~g~~~~-~ri~dkAL~~age~v~~~~K~~~~~fkDTGati~ev~~s~p~~~----------- 68 (134) T protein:vir:10 1 MSVKVTGDKALERELEKHFGIKEM-VKVQDKALIAGAKVIVEEIKKQLKPSEDSGALISEIGRTEPEWI----------- 68 (134) T ss_pred CeEEeecHHHHHHHHHHhhchhhh-hhhhhHHHHHHhHHHHHHHHhhcCccccccceeccEeecCeeec----------- Confidence 889999999999999998 5555 6899999999999999999998776 89998888776543211 Q ss_pred eccccccccccccccc-CCCCCcceehhcccCCcCCCCCcchhH--------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKA-NNPRNAFYWRFVELGTANMPAHPFVRP--------AYDTREEEAASVAIARMNQA 142 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~-~~~~~~~y~~f~E~GT~~~~a~PFl~p--------A~~~~~~~~~~~~~~~l~~~ 142 (149) .|...+.+.+ ++.+..++.|+.|||+.++..-+|+.| |+++.+..+.+.++++|++- T Consensus 69 ------~G~r~V~vgW~G~~~R~~ivHLnE~Gyt~~r~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 69 ------KGKRTVTIRWRGPFERFRIVHLIENGHVEKKSGKFVKPKAMGGINRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred ------CCceEEEEEEEcCCceeeEEEeeecceeecCCCCeeccchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 1222222222 234567889999999999999999999 55555544444444443333 No 77 >protein:vir:10367 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:2714 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858959;genbank:gi:32128424;genbank:GeneID:2648366 Probab=99.48 E-value=2.3e-17 Score=111.85 Aligned_cols=93 Identities=24% Similarity=0.314 Sum_probs=79.0 Q ss_pred HHHHHHhhCCcCCCcccccceecccccccCCccccceeeecccccccccccccccCCCCCcceehhcccC---------- Q lcl|NC_019767. 41 LKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELG---------- 110 (149) Q Consensus 41 v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~G---------- 110 (149) |+|+++..+|+.+|.|+++|........+.-+ .....++++...++|||++||| T Consensus 1 ~rDeakarv~~~~G~Lr~sIY~ay~~~~S~dG----------------~~~Y~Vswn~rkAPhghlvE~Ghw~~~~~~~~ 64 (119) T protein:vir:10 1 MRESAKAFVNDETGKLRSNLYVAYSTEESTNG----------------VQTYAVSWRKKAAPHGHLLEFGHWQTHAAYKG 64 (119) T ss_pred CCcccccccCCCccchhhhheeeeccccCCCC----------------EEEEEeecCCCcCCcccccccceeeeeeeeec Confidence 99999999999999999999776544333211 1123456677889999999999 Q ss_pred --------------CcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 111 --------------TANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 111 --------------T~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) ++++||+|||+|||++..+++.++|.+.+++.+.++++= T Consensus 65 ~dG~w~~~~~~l~~~~~vPa~pFlRpA~da~~~~a~~~~~~r~~~rv~Ev~rg 117 (119) T protein:vir:10 65 KDGEWYSSSVKLVNPKWIPARPFLRPGYDSVAMQIPDIAKAAGAKKYAELQRG 117 (119) T ss_pred cCceeeecCccccCceecCCCCccchhHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 889999999999999999999999999999999999987 No 78 >protein:vir:9513 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835560;genbank:gi:30043947;genbank:GeneID:1260542 Probab=99.44 E-value=3e-16 Score=105.70 Aligned_cols=121 Identities=21% Similarity=0.227 Sum_probs=88.7 Q ss_pred ceeehhhHHHHHHHHHHh--HHHHHHHHHHHHHHHHHHHHHHHHHhhCCc--CCCcccccceecccccccCCccccceee Q lcl|NC_019767. 4 TSLDFSGLNDIAKDLEAL--SRAENNKVLRDATRAGAEVLKEEVIARAPV--RTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l--~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |+++++|++||+++|++. +..+ +++.++||.+|++.|.++++.+.++ |||.+.+.+.++.-... T Consensus 1 msvevkGv~eil~~le~k~g~~~~-~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~----------- 68 (134) T protein:vir:95 1 MSVKVIGDKALERELEKRFGIKEM-VKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWI----------- 68 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhh-hhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeec----------- Confidence 889999999999999988 5555 6899999999999999999999986 89998888776543211 Q ss_pred eccccccccccccccc-CCCCCcceehhcccCCcCCCCCcchhH--------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKA-NNPRNAFYWRFVELGTANMPAHPFVRP--------AYDTREEEAASVAIARMNQA 142 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~-~~~~~~~y~~f~E~GT~~~~a~PFl~p--------A~~~~~~~~~~~~~~~l~~~ 142 (149) .|...+.+.+ ++.+..++.|+.|||+.++...+|+.| |+++.+..+.+.++++|++- T Consensus 69 ------~G~r~V~vgW~G~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:95 69 ------NGKRTITVHWRGSKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred ------CCceEEEEEEEcCCceeEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 1222222222 234567889999999999999999999 55555554444444444333 No 79 >protein:vir:101302 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908835;genbank:gi:118725099;genbank:GeneID:4555873 Probab=99.44 E-value=3e-16 Score=105.70 Aligned_cols=121 Identities=21% Similarity=0.227 Sum_probs=88.7 Q ss_pred ceeehhhHHHHHHHHHHh--HHHHHHHHHHHHHHHHHHHHHHHHHhhCCc--CCCcccccceecccccccCCccccceee Q lcl|NC_019767. 4 TSLDFSGLNDIAKDLEAL--SRAENNKVLRDATRAGAEVLKEEVIARAPV--RTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l--~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |+++++|++||+++|++. +..+ +++.++||.+|++.|.++++.+.++ |||.+.+.+.++.-... T Consensus 1 msvevkGv~eil~~le~k~g~~~~-~ri~nkAL~~age~v~~~~K~~~~~fkDTG~t~~ev~~s~p~~~----------- 68 (134) T protein:vir:10 1 MSVKVIGDKALERELEKRFGIKEM-VKVQDKALIAGAKVIVEEVKKQLKPSKDTGALINEVSFSKPEWI----------- 68 (134) T ss_pred CeEEEecHHHHHHHHHHhhchhhh-hhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeccEEecCeeec----------- Confidence 889999999999999988 5555 6899999999999999999999986 89998888776543211 Q ss_pred eccccccccccccccc-CCCCCcceehhcccCCcCCCCCcchhH--------HHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKA-NNPRNAFYWRFVELGTANMPAHPFVRP--------AYDTREEEAASVAIARMNQA 142 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~-~~~~~~~y~~f~E~GT~~~~a~PFl~p--------A~~~~~~~~~~~~~~~l~~~ 142 (149) .|...+.+.+ ++.+..++.|+.|||+.++...+|+.| |+++.+..+.+.++++|++- T Consensus 69 ------~G~r~V~vgW~G~~~R~~iiHLNE~Gytr~~~Gk~i~PrG~G~i~~a~~~~e~~~~~~ik~eL~kl 134 (134) T protein:vir:10 69 ------NGKRTITVHWRGSKDRYKIVHLIEYGHVQKGTGKFIKPKAMGGVNRAIRQGQNKYFETLKRELKKL 134 (134) T ss_pred ------CCceEEEEEEEcCCceeEEEEeecccceecccCCccCcchhhHHHHHHHhhhHHHHHHHHHHHhcC Confidence 1222222222 234567889999999999999999999 55555554444444444333 No 80 >protein:vir:102441 Length: 137 # NCBI annotation: gp26 # Family: family:all:1084 # MgeID: mge:1618 # MgeName: Pipefish # Cross-refs: genbank:acc:YP_655303;genbank:gi:109521866;genbank:GeneID:4157756 Probab=99.38 E-value=5.5e-16 Score=104.23 Aligned_cols=107 Identities=20% Similarity=0.234 Sum_probs=68.8 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeec Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~ 81 (149) |..++.++ -....|.+.+ ..+++.+|+..+..++++|+.++|+++|+|++||..........+.+... T Consensus 1 ~~~~~~~~------~~~~~~~~~~-~~v~r~~l~~~a~~v~~~Ak~~aPv~tG~Lr~SI~~~~~~~~~~~~~~~~----- 68 (137) T protein:vir:10 1 MTVTARYE------RNPVGEARQF-QVIARRRLSRITRGTANQARADVPVKTGNLGRSIREDPIVVAGPLRLDSG----- 68 (137) T ss_pred CeeEEEec------cCchhHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCccchhhhcCceeeeeeccccceEEEE----- Confidence 66665554 1122233344 35667888999999999999999999999999998654322221111111 Q ss_pred ccccccccccccccCCCCCcceehhcccCCc------------------------------CCCCCcchhHHHHHHHHHH Q lcl|NC_019767. 82 VNPRTGNSDNTMKANNPRNAFYWRFVELGTA------------------------------NMPAHPFVRPAYDTREEEA 131 (149) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~------------------------------~~~a~PFl~pA~~~~~~~~ 131 (149) ...+..|+.|+||||. .++|+|||+||++.++.+. T Consensus 69 ---------------V~~~~~YA~~ve~GT~ph~I~Pk~~k~~l~~~~~g~~vf~k~V~hPG~~a~PfL~~A~~~~~~~~ 133 (137) T protein:vir:10 69 ---------------VTAHADYARYVHDGTRAHVIRPRRPGGVLRFTVGGRVVYARRVNHPGTRARPFLRNAAERVVARE 133 (137) T ss_pred ---------------ecCCCccceeeecCCCCceeeccccceeeeEeeCCeeEecceeecCCCCCCchHHHHHHHhhhhh Confidence 1122334444444442 2459999999999999988 Q ss_pred HHHH Q lcl|NC_019767. 132 ASVA 135 (149) Q Consensus 132 ~~~~ 135 (149) ...- T Consensus 134 ~~~~ 137 (137) T protein:vir:10 134 TATS 137 (137) T ss_pred cccC Confidence 7655 No 81 >protein:vir:9879 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:2718 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795641;genbank:gi:28876400;genbank:GeneID:1257931 Probab=99.36 E-value=1.5e-15 Score=101.81 Aligned_cols=109 Identities=12% Similarity=0.156 Sum_probs=79.5 Q ss_pred hhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhh--CCc-------CCCcccccceecccccccCCcccccee Q lcl|NC_019767. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR--APV-------RTGKLKKNVVVVTQKSRRRGEISSGVH 78 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~--aP~-------~~g~l~~~i~~~~~~~~~~g~~~~~~~ 78 (149) |.|+|+|.++|++...+ . ++..++.-...+...|++. +|+ +||.++.||+......+..+... T Consensus 1 i~G~~~L~~~Lk~~s~~---d-vk~VVkkN~ael~~r~q~~~~~pv~~~~k~~dTG~lkRSi~l~~~~~g~~~~vg---- 72 (127) T protein:vir:98 1 MTGMPALEVKLRSMSEK---R-WDRVANKNLTEMFNRAARPPGTPIGKNTKRHKSGELLRSRRLKKVNSSKDVITG---- 72 (127) T ss_pred CcChHHHHHHHHHhhHH---H-HHHHHhhhhHHHHHHHHhccCCceeccccccCcccceeeeEEEEecCCceEEec---- Confidence 99999999999988433 2 3555666666677777775 888 99999999987655444322211 Q ss_pred eecccccccccccccccCCCCCcceehhcccCCcC---------CCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 79 IRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTAN---------MPAHPFVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~---------~~a~PFl~pA~~~~~~~~~~~~~~~l~~ 141 (149) ......-|+-++||||+. +|+||||.|||+.-++..++.+.+.+++ T Consensus 73 -----------------p~g~t~dYapyvEyGTR~m~~~~~~gf~~aqp~l~paf~~Qk~iF~~DL~~l~k~ 127 (127) T protein:vir:98 73 -----------------NFGYIKDYAPHVEYGHRIVRNGKQVGYANGTKYLFNNVKKQREIYRQDMLNELRR 127 (127) T ss_pred -----------------cCcccccccceeecceeeeecccccccccCccccccchHHHhHHHHHHHHHHhcC Confidence 111234599999999995 4599999999999999777777777766 No 82 >protein:vir:106041 Length: 137 # NCBI annotation: gp23 # Family: family:all:1084 # MgeID: mge:1505 # MgeName: Cooper # Cross-refs: genbank:acc:YP_654920;genbank:gi:109392376;genbank:GeneID:4157069 Probab=99.35 E-value=1.1e-15 Score=102.68 Aligned_cols=104 Identities=19% Similarity=0.259 Sum_probs=68.0 Q ss_pred ccceeehh-hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 2 IETSLDFS-GLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 2 m~~~~~i~-Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |.++++|+ ..++|. +.+ ..+++.+|...+..++.+++.++|+++|+|++||.......... .+ T Consensus 1 m~~s~~i~i~~~~l~-------~~v-~~~~k~~l~~~a~~i~~~ak~~aPv~tG~Lr~SI~~~~~~~~~~-~~------- 64 (137) T protein:vir:10 1 MPVTARIHINEPELE-------RQT-GAIFRGKHRSITRRIATQARADVPVRTGNLGRGIQEMPQTYRPF-HV------- 64 (137) T ss_pred CCeeEEEeeCHHHHH-------HHH-HHHHHHHHHHHHHHHHHHHHHhCCcccchhhcCceeeeeccccc-eE------- Confidence 88887776 334443 333 34567788889999999999999999999999997543221100 00 Q ss_pred cccccccccccccccCCCCCcceehhcccCCc--------------------------CC---CCCcchhHHHHHH---H Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTA--------------------------NM---PAHPFVRPAYDTR---E 128 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~--------------------------~~---~a~PFl~pA~~~~---~ 128 (149) ......+..|+.|+||||. ++ +|||||+||++.. + T Consensus 65 -------------~~~v~~~~~YA~~ve~GT~ph~I~pk~~k~l~f~~~G~~v~~k~v~hpG~~a~Pfl~~A~~~~~~~~ 131 (137) T protein:vir:10 65 -------------GGGVEDNVDYAAPVHEGSRPHRITARHANALHFFWHGREVFRKSVWHPGVRPRPFLRNAARRVVAAD 131 (137) T ss_pred -------------EEEEecCCCceeeeeecCCCceeecccCceeeeeeCCceEEeeeeecCCCCCCchHHHHHHHHhhcc Confidence 0001133457778888773 23 4999999999974 4 Q ss_pred HHHHHH Q lcl|NC_019767. 129 EEAASV 134 (149) Q Consensus 129 ~~~~~~ 134 (149) ++|.-. T Consensus 132 ~ri~~~ 137 (137) T protein:vir:10 132 PDIHMT 137 (137) T ss_pred ccccCC Confidence 444333 No 83 >protein:vir:966 Length: 123 # NCBI annotation: Orf48 # Family: family:all:970 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076620;genbank:gi:13095728;genbank:GeneID:920248 Probab=99.31 E-value=1.4e-14 Score=96.51 Aligned_cols=117 Identities=14% Similarity=0.171 Sum_probs=89.4 Q ss_pred ccceeehhhH-HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 2 IETSLDFSGL-NDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 2 m~~~~~i~Gl-~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |+.+|+|+.| +++.+.|+.+..++. ..+..++.+.|..+.+++++.+|++||.+.++..+.....+ T Consensus 1 m~~~v~id~L~~~i~~~L~~y~~~v~-~~v~~~v~~~a~~~~~~lk~~sP~~TG~yaksW~~k~~~~~------------ 67 (123) T protein:vir:96 1 MANKISIDDLAKTIESEVRNWTKDVV-DDIDDIKKDITKNGVKQLRESSPKRTGDYAKNWTSQKLKNG------------ 67 (123) T ss_pred CCcccchhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhCCccccccccceeeeecCCe------------ Confidence 8889999887 556999999999986 56789999999999999999999999999998654321100 Q ss_pred cccccccccccccccCCCCCcceehhcccC-----CcCCCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELG-----TANMPAHPFVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~G-----T~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~ 141 (149) ..+...+.......|++||| .-+-+|+|||+||++...+.+.+.+.+.|++ T Consensus 68 ----------~~~v~~~~~~y~l~HLLE~GHa~r~GGrV~a~phI~paee~~~~~l~~~i~r~l~~ 123 (123) T protein:vir:96 68 ----------DQVIYQKAPTYRLTHLLENGHAKRNGGRVSPKVHIAPVEEELVSNYISRVEKRLSQ 123 (123) T ss_pred ----------eEEEEEecCCcceEEeeecceeecCCceeCcchhhhHHHHHHHHHHHHHHHHHhcC Confidence 01112222333468999999 3456999999999998888777777766666 No 84 >protein:vir:107545 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1481 # MgeName: PG1 # Cross-refs: genbank:acc:NP_943803;genbank:gi:38638428;genbank:GeneID:2657225 Probab=99.26 E-value=6.6e-15 Score=98.32 Aligned_cols=108 Identities=22% Similarity=0.240 Sum_probs=65.8 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |..+..+++ |......+.+.+ ..+++.+++..+..++.+|+.+||+++|+|++||.......+..+ T Consensus 1 ~~~~~~~~~----~~~~~~~~~~~~-~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~--------- 66 (140) T protein:vir:10 1 MATIRARAR----IEIDEAALERES-GEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFR--------- 66 (140) T ss_pred Ceeeeeeee----eeeCHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCce--------- Confidence 665544433 222233343343 346677788899999999999999999999999975432221110 Q ss_pred cccccccccccccccCCCCCcceehhcccCCc-----------------------------CCCCCcchhHHHHH---HH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTA-----------------------------NMPAHPFVRPAYDT---RE 128 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~-----------------------------~~~a~PFl~pA~~~---~~ 128 (149) +......+..|+.|+||||. .++|||||+||++. ++ T Consensus 67 ------------~~~~v~~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~ 134 (140) T protein:vir:10 67 ------------VRGGVEATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTND 134 (140) T ss_pred ------------EEEEecCCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhh Confidence 00001122345555555552 35699999999997 45 Q ss_pred HHHHHH Q lcl|NC_019767. 129 EEAASV 134 (149) Q Consensus 129 ~~~~~~ 134 (149) ++|... T Consensus 135 ~~i~~~ 140 (140) T protein:vir:10 135 PRVRMT 140 (140) T ss_pred hhccCC Confidence 666555 No 85 >protein:vir:97982 Length: 140 # NCBI annotation: gp25 # Family: family:all:1084 # MgeID: mge:1482 # MgeName: Orion # Cross-refs: genbank:acc:YP_655121;genbank:gi:109391871;genbank:GeneID:4157345 Probab=99.26 E-value=6.6e-15 Score=98.32 Aligned_cols=108 Identities=22% Similarity=0.240 Sum_probs=65.8 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |..+..+++ |......+.+.+ ..+++.+++..+..++.+|+.+||+++|+|++||.......+..+ T Consensus 1 ~~~~~~~~~----~~~~~~~~~~~~-~~~~~~~~~~~~~~v~~~ak~~aPvdtG~Lr~SI~~~~~~~~~~~--------- 66 (140) T protein:vir:97 1 MATIRARAR----IEIDEAALERES-GEHLRAFHRSLTRRIANQSRVAVPVRTGNLGRTIGELPQVYTPFR--------- 66 (140) T ss_pred Ceeeeeeee----eeeCHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcCCccchhhhccceeeeeeCCCce--------- Confidence 665544433 222233343343 346677788899999999999999999999999975432221110 Q ss_pred cccccccccccccccCCCCCcceehhcccCCc-----------------------------CCCCCcchhHHHHH---HH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTA-----------------------------NMPAHPFVRPAYDT---RE 128 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~-----------------------------~~~a~PFl~pA~~~---~~ 128 (149) +......+..|+.|+||||. .++|||||+||++. ++ T Consensus 67 ------------~~~~v~~~a~YA~~Ve~GT~ph~I~pk~~k~L~~~~~G~~~~~k~V~hpG~~a~Pfl~~A~~~~~~~~ 134 (140) T protein:vir:97 67 ------------VRGGVEATADYAAPVHEGSRPHAIRARNAQYLHFWWHGREMFRKSVWHPGTRARPFMRNSAQRVVTND 134 (140) T ss_pred ------------EEEEecCCccchhhhccCCCCceeecCCCccceeecCCCEEEeeeeecCCCCCChhHHHHHHHHhhhh Confidence 00001122345555555552 35699999999997 45 Q ss_pred HHHHHH Q lcl|NC_019767. 129 EEAASV 134 (149) Q Consensus 129 ~~~~~~ 134 (149) ++|... T Consensus 135 ~~i~~~ 140 (140) T protein:vir:97 135 PRVRMT 140 (140) T ss_pred hhccCC Confidence 666555 No 86 >protein:vir:9647 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795409;genbank:gi:28876182;genbank:GeneID:1257731 Probab=99.19 E-value=1.6e-13 Score=90.78 Aligned_cols=126 Identities=15% Similarity=0.178 Sum_probs=96.7 Q ss_pred ccceeehhhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHhhCCc--CCCcccccceecccccccCCcccccee Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEA-LSRAENNKVLRDATRAGAEVLKEEVIARAPV--RTGKLKKNVVVVTQKSRRRGEISSGVH 78 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~-l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~g~~~~~~~ 78 (149) |+.--+|.|++||+++|++ |++.-.+++.++||.+||+.+.+.++.+.|+ |||...+.+.++.-... T Consensus 1 ~~~~aevkGv~Eilk~lE~klG~~~v~ri~nkAL~~~ge~v~~~lK~~~~~f~DTG~t~dev~~s~~~~~---------- 70 (132) T protein:vir:96 1 MSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRRE---------- 70 (132) T ss_pred CCccccccCHHHHHHHHHHhhCHHHHHHHhHHHHHHHHHHHHHHHHHhhhhhhhcchhhcceeecCeeec---------- Confidence 5555678899999999999 9986557899999999999999999999996 89998888776543321 Q ss_pred eecccccccccccccccCCCCCcceehhcccCCcCCC-CC--cchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 79 IRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMP-AH--PFVRPAYDTREEEAASVAIARMNQAIDE 145 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~-a~--PFl~pA~~~~~~~~~~~~~~~l~~~i~k 145 (149) .|...+.+.+ ..+....-|..|||+.+++ |+ -+|..|++.++..+.+.++++|++.|+- T Consensus 71 -------~G~r~V~VgW-~GpR~~ivHLNE~GyGk~~~PrG~G~I~~a~~~se~~~~~~~~~elkk~l~~ 132 (132) T protein:vir:96 71 -------DGIPKVKLGF-TTPRWNIVHLQELEYGWKHNRRGVGVIRRYSDILETIYPRGIRDKLKRGFDG 132 (132) T ss_pred -------CCceEEEecc-cCCceeEEeeecccccCCcCCCcchHHHHHHHhhhhHHHHHHHHHHHHHhcC Confidence 1222222222 2233344788899986653 22 4899999999999999999999999988 No 87 >protein:vir:6246 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813700;swissprot:trembl:q859b7;genbank:gi:29366760;uniprot:Q859B7;genbank:GeneID:1258903 Probab=99.17 E-value=2.1e-13 Score=90.06 Aligned_cols=137 Identities=19% Similarity=0.339 Sum_probs=100.8 Q ss_pred Ccc---ceeehhhHHHHHHHHHHh-HHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccc Q lcl|NC_019767. 1 MIE---TSLDFSGLNDIAKDLEAL-SRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSG 76 (149) Q Consensus 1 Mm~---~~~~i~Gl~~l~~~l~~l-~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~ 76 (149) |.+ ..|.|+|+.++.+.|+++ +.++ .+.++.+.+.+|+++...+++.+|+..-+.+.| +..++|.+... T Consensus 1 ma~~~~~~vrV~Glr~f~~~mrK~~g~dl-~k~lk~a~~~aa~v~~~~ar~~tP~g~r~~~~s------~~~r~G~L~~S 73 (143) T protein:vir:62 1 MAQRSAYTIRVDGLREFQRNVRTLRDKEL-NKAVREANKASGEVLIPQAKHESPDGKRDAKSS------KKYRPGKLDKS 73 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCch-hHHHHHHHHHHHHHHHHHHHhhcCCcccccccc------cccCcchhhcc Confidence 666 689999999999999999 8776 578899999999999999999999953222211 11122332222 Q ss_pred eeeecccccccccccccccCCCCCcceehhcccCCcCCC--CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 77 VHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMP--AHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~--a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) +.+.. +..-..+..+..+..+|+-|++||+..+. |+-||+.++-++++++.....+.+.+.|++.+.- T Consensus 74 ir~aa-----T~raa~VrAG~~krVPYA~~I~~G~r~r~Isp~rFl~~a~a~te~~~~r~Ye~~i~~vl~k~l~s 143 (143) T protein:vir:62 74 IKVTA-----SAKGAVIKAGSASRVPYAAAIHFGYRARNISPNRFLFRAMARKSDVVAATYERRIAAVVEKYLES 143 (143) T ss_pred ccccc-----cccceeeeeCCcCCCCcccccccCcccccccchhhhhhhhhccCHHHHHHHHHHHHHHHHHHhcC Confidence 22111 11111222333356789999999998766 9999999999999999999999999998888888 No 88 >protein:vir:1332 Length: 143 # NCBI annotation: gp40 # Family: family:all:11660 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047931;swissprot:trembl:q9zxa7;genbank:gi:9631149;uniprot:Q9ZXA7;genbank:GeneID:2715891 Probab=99.14 E-value=3.6e-13 Score=88.78 Aligned_cols=137 Identities=20% Similarity=0.347 Sum_probs=101.8 Q ss_pred Ccc---ceeehhhHHHHHHHHHHh-HHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccc Q lcl|NC_019767. 1 MIE---TSLDFSGLNDIAKDLEAL-SRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSG 76 (149) Q Consensus 1 Mm~---~~~~i~Gl~~l~~~l~~l-~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~ 76 (149) |.+ ..|+|+|+..+.+.|+++ +.++ .+.++.+.+.+|+++...+++.+|+..-+...| +..++|.+... T Consensus 1 ma~~~~~~vkV~Glr~f~~~mrK~~g~dl-~k~lk~a~~~aa~v~~~~ar~~tP~g~~~p~~s------rr~r~G~L~~S 73 (143) T protein:vir:13 1 MAQRSAYTIQVDGLRQFQRNVRALRDKEL-NKAVREANKASGEVLIPQAKHESPDGHRDPKSS------KRYRPGKLDKS 73 (143) T ss_pred CCcccchheehHHHHHHHHHHHHhhCCcc-hHHHHHHHHHHHHHHHHHHHhhcCCcccccccc------cccccchhhcc Confidence 666 689999999999999999 8776 578899999999999999999999983222221 11123333222 Q ss_pred eeeecccccccccccccccCCCCCcceehhcccCCcCCC--CCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 77 VHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMP--AHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~--a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) +.+.. +..-..+..+.....+|+-|++||+.++. ++-||..++-++++++.....+.+.+.|++.+.- T Consensus 74 ir~aa-----T~raa~VrAGr~arVPYA~~I~~G~r~r~Is~~rFl~~a~a~te~~~~r~Ye~~i~~vl~k~l~s 143 (143) T protein:vir:13 74 IKVTA-----SAKGAVIKAGSAARVPYAAAIHFGYRKRNISANRFLYRAMARKSDVVAATYERRIAAVVEKYLES 143 (143) T ss_pred ccccc-----cccceeeeecCcCCCCcccccccCCcccccchhhhhhhhhhccCHHHHHHHHHHHHHHHHHHhcC Confidence 22211 11111222333345789999999998876 9999999999999999999999999999888888 No 89 >protein:vir:6216 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:10886 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852596;genbank:gi:31415856;genbank:GeneID:1489214 Probab=99.07 E-value=3.2e-13 Score=89.08 Aligned_cols=115 Identities=21% Similarity=0.398 Sum_probs=91.1 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcC----CCcccccceecccccccCCccccce Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVR----TGKLKKNVVVVTQKSRRRGEISSGV 77 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~----~g~l~~~i~~~~~~~~~~g~~~~~~ 77 (149) |..+ =.|+.|+...|..|.+- .+++....|.+||+...+..+.+.|.+ .|++++++.+. T Consensus 1 m~sN--NNGFae~~~~~~tl~kV-d~kvs~e~L~eAA~~f~~KL~P~Ip~Sl~kkk~HlrD~lkVv-------------- 63 (125) T protein:vir:62 1 MASN--NNGFAEALEDINTLLRV-NKKVSLDALDEAAKYFASKLKPKINVSNKNKRTHLRDSLKVV-------------- 63 (125) T ss_pred CCCC--chhHHHHHHHhhhhhhh-hhhhhHHHHHHHHHHHHHhhccccChhhhhhhhhcceeeeEE-------------- Confidence 4444 45999999999999765 588999999999999999999999965 34555554432 Q ss_pred eeecccccccccccccccCCCCCcceehhcccCCcCC------CCCcchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 78 HIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANM------PAHPFVRPAYDTREEEAASVAIARMNQAI 143 (149) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~------~a~PFl~pA~~~~~~~~~~~~~~~l~~~i 143 (149) .-. -.+.+.....+|||+|+|.||+++ .+|+|...+|+++++.+.++|.+.+-.++ T Consensus 64 ---vk~-------d~V~V~Fed~a~yW~f~EnGt~~~~~~g~vkaqhf~~~Tf~~nk~kI~~iM~kki~d~m 125 (125) T protein:vir:62 64 ---VKD-------DRVSVEFKDEAWYWYLVEHGHKKAKGKGRVKGKHFVQNTFDAEGDKIADIMAQKIINRM 125 (125) T ss_pred ---eeC-------CeEEEEEcchhhhhhhhhccccccccccccchhhhhhccHHhhHHHHHHHHHHHHHhhC Confidence 111 112233456799999999999997 89999999999999999999998887777 No 90 >protein:vir:102963 Length: 163 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945289;genbank:gi:39653724;uniprot:Q708M3;genbank:GeneID:2672877 Probab=99.07 E-value=2.8e-12 Score=83.95 Aligned_cols=144 Identities=13% Similarity=0.240 Sum_probs=93.0 Q ss_pred ccceeehhhHHHHHHHHHHhHHH-HHHHHHHHHHHHHHHHHHHHHHhhCCcCC-Ccccccce---------ecccccccC Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRA-ENNKVLRDATRAGAEVLKEEVIARAPVRT-GKLKKNVV---------VVTQKSRRR 70 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~-~~~k~~~~Al~~~a~~v~~~ak~~aP~~~-g~l~~~i~---------~~~~~~~~~ 70 (149) |+..|+.++|+++.++|..+... ...+.+...+.+.|..+..+++.+.|+.. ..+..... ........+ T Consensus 1 m~~~~d~~~l~~f~k~l~~~~~~~~~~~~~~~~~~e~a~~ll~~vk~rtPv~~~~~~~~~~~~~~~k~~k~~~~~~~k~t 80 (163) T protein:vir:10 1 MSGGFDYRSFAKFANNFNRNANHAKVDRFMRQTLNYEGTELKSKVKERTPVGVYTDHWVEFTTKDGKHVKFWASAHGKQG 80 (163) T ss_pred CCCccCHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCcccchhhhhhhhhcccchhhhhcccccccc Confidence 99999999999999999987532 12456788999999999999999999832 11100000 001111224 Q ss_pred CccccceeeecccccccccccccccCCCCCcceehhcccCCc-----CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 71 GEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTA-----NMPAHPFVRPAYDTREEEAASVAIARMNQAIDE 145 (149) Q Consensus 71 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~-----~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k 145 (149) |.+..+..+..+....+...+.+ ..+..|+||+|||.. +-|.+++|..|.+....++-+.+.+.|.+-|++ T Consensus 81 G~lr~swk~~~~~k~~~~~~v~v----~N~~~YA~~VE~GHR~~~gGfV~G~fml~~s~~~~~~~~~~~~e~~l~~~l~k 156 (163) T protein:vir:10 81 GTLQKGWSKSRIEVSGRTYKQKV----YNKVYYAPHVEYGHKTVNGGFVPGQFFLHKTVEDTKSDMEKRVRDKYDGFMRK 156 (163) T ss_pred chhhccceecceeecCCceEEEE----EecCCccchhhcceeecCCceeccchhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44444444433332222222211 134579999999954 358999999999988887777777777666666 Q ss_pred HhcC Q lcl|NC_019767. 146 VLSK 149 (149) Q Consensus 146 ~~~k 149 (149) .+.= T Consensus 157 ~~~~ 160 (163) T protein:vir:10 157 VVLG 160 (163) T ss_pred hhcC Confidence 6544 No 91 >protein:vir:98636 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:5009 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039927;genbank:gi:126011102;genbank:GeneID:4818472 Probab=98.94 E-value=8.2e-12 Score=81.37 Aligned_cols=127 Identities=15% Similarity=0.161 Sum_probs=94.5 Q ss_pred CccceeehhhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHhhCC--cCCCcccccceecccccccCCccccce Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEA-LSRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGV 77 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~-l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~g~~~~~~ 77 (149) -|+.--+|+|++||+++|++ |++.-.+++.++||.++++.|.+..+.+.+ .|||...+.+.++.-... T Consensus 6 ~~~~~aevkGv~Eilk~lE~klG~~~~~ri~nkAL~~~ge~v~~~lK~~~~~fkDTGat~dev~~s~p~~~--------- 76 (138) T protein:vir:98 6 SMSGFANLKGVEELLANMEKKLGPAKVNRVVNRSLKEIGKELEPSFKSAISIYKRTGETTESAVVSGVRRE--------- 76 (138) T ss_pred cccccccccCHHHHHHHHHHhhCHHhhhhhhhHHHHHHHHHHHHHHHhhhhhhhhccceeeeeeecCeeec--------- Confidence 23333467799999999999 888866889999999999999999999998 588888777765443211 Q ss_pred eeecccccccccccccccCCCCCcceehhcccCCcCCC-CC--cchhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 78 HIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMP-AH--PFVRPAYDTREEEAASVAIARMNQAIDE 145 (149) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~-a~--PFl~pA~~~~~~~~~~~~~~~l~~~i~k 145 (149) .|...+.+.+ ..+....-|..|||+.+++ |+ -+|..|++.++..+...++++|.+.|+- T Consensus 77 --------~G~r~V~igW-~GpR~~ivHLNE~GyGk~i~PrG~G~I~ka~~~se~~y~~~vk~el~k~l~~ 138 (138) T protein:vir:98 77 --------DGIPKVKLGF-TTPRWNIVHLQELEYGWKHNRRGVGVIRRYSDILETIYPRGIRDKLKRGFDG 138 (138) T ss_pred --------CCceEEEEee-ecCeeeEEeeecccccCCcCCCcchHHHHHHHhhhHHHHHHHHHHHHHHhcC Confidence 1222222222 2233444788999987653 22 3899999999999999999999999988 No 92 >protein:vir:78335 Length: 133 # NCBI annotation: gp9 # Family: family:all:589 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468648;genbank:gi:157325225;genbank:GeneID:5601681 Probab=98.86 E-value=2.7e-11 Score=78.52 Aligned_cols=123 Identities=12% Similarity=0.181 Sum_probs=89.5 Q ss_pred ceeehhhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHhhC--CcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 4 TSLDFSGLNDIAKDLEA-LSRAENNKVLRDATRAGAEVLKEEVIARA--PVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~-l~~~~~~k~~~~Al~~~a~~v~~~ak~~a--P~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |++++.|++||+++|++ |++.-.+++.++||.++++.+.+..+.+. ..|||...+.+..+.-.. T Consensus 1 msvevkGv~eilk~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~------------- 67 (133) T protein:vir:78 1 MSVEVTGVEELERQLVSLFGRENLPQLVDPALIAGATLVAKTLKSEFVQFKDTGASIDEINIEKPSY------------- 67 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHHhhcchhcccceeeeEEecCeee------------- Confidence 89999999999999998 88887789999999999999999999954 568998888776554321 Q ss_pred cccccccccccccccCC-CCCcceehhcccCCcC----CCCCc--chhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANN-PRNAFYWRFVELGTAN----MPAHP--FVRPAYDTREEEAASVAIARMNQAI 143 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~-~~~~~y~~f~E~GT~~----~~a~P--Fl~pA~~~~~~~~~~~~~~~l~~~i 143 (149) ..|.....+.+.. ......-|+.|||..+ ..|+- -|..|+++++..+.+.++++|.+.| T Consensus 68 ----~~G~r~V~i~W~gp~~R~~iVHLNE~GYtr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~el~k~l 133 (133) T protein:vir:78 68 ----DKGVRSIKIDWKGPKDRYKIIHLNEYGYTRNGKKITPAGTGSVARSLRISERAYRAIVQKKIGDKL 133 (133) T ss_pred ----eCCceEEEEEEecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHHHhhC Confidence 1122222222212 2234457999999532 23443 5888888888888888887777777 No 93 >protein:vir:106506 Length: 137 # NCBI annotation: Pas21 # Family: family:all:1084 # MgeID: mge:1680 # MgeName: phiAsp2 # Cross-refs: genbank:acc:YP_024807;genbank:gi:48697422;genbank:GeneID:2846163 Probab=98.86 E-value=3.9e-12 Score=83.15 Aligned_cols=108 Identities=19% Similarity=0.178 Sum_probs=62.2 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |---+++|+ .. .|+.+ . ..+++.++...+..++.+++.++|++||+|++||.......... .+ T Consensus 1 ~~~~~~~l~-~~----~l~~~---~-~~~~~~~~~~~a~~ve~~ak~~aPv~TG~Lr~SI~~~~~~~~g~-~v------- 63 (137) T protein:vir:10 1 MVAHTLRIE-RA----QLHGL---G-MDEARKAVNRVVRRTFTRSQILAPVDTGYLRASGRLVLGRERGA-VV------- 63 (137) T ss_pred CcccccccC-hh----hHhhH---H-HHHHHHHHHHHHHHHHHHHHhcCCcCchhhhccceeeeeecccc-EE------- Confidence 555555554 11 22222 2 35667888999999999999999999999999997654321100 00 Q ss_pred cccccccccccccccCCCCCcceehhcccCCc--------------------------C---CCCCcchhHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTA--------------------------N---MPAHPFVRPAYDTREEEA 131 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~--------------------------~---~~a~PFl~pA~~~~~~~~ 131 (149) .....++..|+.|+||||. + ++|+|||+||++...++. T Consensus 64 -------------~~~V~~~~~YA~~ve~GT~ph~I~pk~~kaL~f~~~G~~vf~k~V~hPG~k~~PfL~~Al~~~~~~~ 130 (137) T protein:vir:10 64 -------------IGSVEYTARYAAAVHNGRRALTIRAKGNGRLKFTVEGRTVYARSVHQPARAGRPYLSQALREVAPQE 130 (137) T ss_pred -------------EEEecCCcccceeeecCCCCceeecCCCccceeecCCeeEeccceecCCCCCChhhHHHHHHhhccc Confidence 0000112234444444442 2 349999999999776642 Q ss_pred HHHHHHHHHHHHHHHhc Q lcl|NC_019767. 132 ASVAIARMNQAIDEVLS 148 (149) Q Consensus 132 ~~~~~~~l~~~i~k~~~ 148 (149) --.+ .++ T Consensus 131 ~~~~----------~~~ 137 (137) T protein:vir:10 131 GFRV----------TIG 137 (137) T ss_pred ceeE----------eeC Confidence 1111 111 No 94 >protein:vir:78644 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429946;genbank:gi:156604000;genbank:GeneID:5525390 Probab=98.85 E-value=3.5e-11 Score=77.90 Aligned_cols=123 Identities=9% Similarity=0.098 Sum_probs=82.7 Q ss_pred ceeehhhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHhhCC--cCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 4 TSLDFSGLNDIAKDLEA-LSRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~-l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |+++++|++||+++|++ |++...+++.++||.++++.|.+..+.+.. .|||...+.+..+.-...... T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~--------- 71 (133) T protein:vir:78 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGS--------- 71 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCC--------- Confidence 89999999999999998 888877899999999999999999999988 688988887765442211100 Q ss_pred cccccccccccccccCC-CCCcceehhcccCCcCC----CCCc--chhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANN-PRNAFYWRFVELGTANM----PAHP--FVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~-~~~~~y~~f~E~GT~~~----~a~P--Fl~pA~~~~~~~~~~~~~~~l~~ 141 (149) +...+.+.+.. ......-|+.|||..+- .|+- -|..|+++.+..+.+.++++|.+ T Consensus 72 ------~~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:78 72 ------QERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ------cceeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 00111222211 22344579999995332 2332 36666666666665555555554 No 95 >protein:vir:94419 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240010;genbank:gi:66395683;genbank:GeneID:5133079 Probab=98.85 E-value=3.5e-11 Score=77.90 Aligned_cols=123 Identities=9% Similarity=0.098 Sum_probs=82.7 Q ss_pred ceeehhhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHhhCC--cCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 4 TSLDFSGLNDIAKDLEA-LSRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~-l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |+++++|++||+++|++ |++...+++.++||.++++.|.+..+.+.. .|||...+.+..+.-...... T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~--------- 71 (133) T protein:vir:94 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGS--------- 71 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCC--------- Confidence 89999999999999998 888877899999999999999999999988 688988887765442211100 Q ss_pred cccccccccccccccCC-CCCcceehhcccCCcCC----CCCc--chhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANN-PRNAFYWRFVELGTANM----PAHP--FVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~-~~~~~y~~f~E~GT~~~----~a~P--Fl~pA~~~~~~~~~~~~~~~l~~ 141 (149) +...+.+.+.. ......-|+.|||..+- .|+- -|..|+++.+..+.+.++++|.+ T Consensus 72 ------~~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:94 72 ------QERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ------cceeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 00111222211 22344579999995332 2332 36666666666665555555554 No 96 >protein:vir:96973 Length: 133 # NCBI annotation: ORF034 # Family: family:all:589 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239864;genbank:gi:66395542;genbank:GeneID:5133006 Probab=98.85 E-value=3.5e-11 Score=77.90 Aligned_cols=123 Identities=9% Similarity=0.098 Sum_probs=82.7 Q ss_pred ceeehhhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHhhCC--cCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 4 TSLDFSGLNDIAKDLEA-LSRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~-l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |+++++|++||+++|++ |++...+++.++||.++++.|.+..+.+.. .|||...+.+..+.-...... T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~--------- 71 (133) T protein:vir:96 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGS--------- 71 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCC--------- Confidence 89999999999999998 888877899999999999999999999988 688988887765442211100 Q ss_pred cccccccccccccccCC-CCCcceehhcccCCcCC----CCCc--chhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANN-PRNAFYWRFVELGTANM----PAHP--FVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~-~~~~~y~~f~E~GT~~~----~a~P--Fl~pA~~~~~~~~~~~~~~~l~~ 141 (149) +...+.+.+.. ......-|+.|||..+- .|+- -|..|+++.+..+.+.++++|.+ T Consensus 72 ------~~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:96 72 ------QERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ------cceeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 00111222211 22344579999995332 2332 36666666666665555555554 No 97 >protein:vir:9363 Length: 133 # NCBI annotation: SLT orf 123-like protein # Family: family:all:589 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803341;genbank:gi:29028652;genbank:GeneID:1258087 Probab=98.85 E-value=3.5e-11 Score=77.90 Aligned_cols=123 Identities=9% Similarity=0.098 Sum_probs=82.7 Q ss_pred ceeehhhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHhhCC--cCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 4 TSLDFSGLNDIAKDLEA-LSRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~-l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |+++++|++||+++|++ |++...+++.++||.++++.|.+..+.+.. .|||...+.+..+.-...... T Consensus 1 msvevkGv~eilr~le~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~--------- 71 (133) T protein:vir:93 1 MSVEIKGIPEVLNKLESVYGKQAMQAKSDKALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGS--------- 71 (133) T ss_pred CeEEEecHHHHHHHHHHhcCHhhHHHhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCC--------- Confidence 89999999999999998 888877899999999999999999999988 688988887765442211100 Q ss_pred cccccccccccccccCC-CCCcceehhcccCCcCC----CCCc--chhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANN-PRNAFYWRFVELGTANM----PAHP--FVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~-~~~~~y~~f~E~GT~~~----~a~P--Fl~pA~~~~~~~~~~~~~~~l~~ 141 (149) +...+.+.+.. ......-|+.|||..+- .|+- -|..|+++.+..+.+.++++|.+ T Consensus 72 ------~~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 72 ------QERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAASERKYREIIKKELAR 133 (133) T ss_pred ------cceeEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 00111222211 22344579999995332 2332 36666666666665555555554 No 98 >protein:vir:95372 Length: 124 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764480;genbank:gi:115334634;genbank:GeneID:5179259 Probab=98.81 E-value=6.3e-11 Score=76.52 Aligned_cols=114 Identities=19% Similarity=0.239 Sum_probs=78.8 Q ss_pred CccceeehhhH-HHHHHHHHHhHHHHHHHHHHHHH----HHHHHHHHHHHHhhCCcCCCcccccceecccccccCCcccc Q lcl|NC_019767. 1 MIETSLDFSGL-NDIAKDLEALSRAENNKVLRDAT----RAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISS 75 (149) Q Consensus 1 Mm~~~~~i~Gl-~~l~~~l~~l~~~~~~k~~~~Al----~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~ 75 (149) |..++ |..| +++.+.|+...+++.+ .+++++ +.+++.+..++++.+|+.+|.+.++..+...... T Consensus 1 M~~i~--id~La~~I~~~L~~Ys~~v~~-~v~~~v~~vak~a~~~lkk~i~~tspkrTG~YaK~W~~kk~~e~------- 70 (124) T protein:vir:95 1 MAKIK--IGRLADEITSQLRKYSQVIAD-DVEQIMDDVTKEAVGRLKSKIQEVGLVQTGDYMRGWTRKRVPNG------- 70 (124) T ss_pred Ccccc--HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhHhcCcccccchhccceeeeecCc------- Confidence 66655 5564 7788899888888754 446666 5555555666667999999999887655433211 Q ss_pred ceeeecccccccccccccccCCCCCcceehhcccCCcC-----CCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 76 GVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTAN-----MPAHPFVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~-----~~a~PFl~pA~~~~~~~~~~~~~~~l~~ 141 (149) . ...+.....-.|++|||..+ -+|+|+|+|+.+...+..++.|.+.|+. T Consensus 71 ----------------~-~V~nk~~yqLtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~ 124 (124) T protein:vir:95 71 ----------------W-VIHNKTEYRLAHLLEYGHATVDGGRVPGTPHIRPIEDWLEKEFEDRVEKAIKQ 124 (124) T ss_pred ----------------e-eEEEcCCCceeeeeecceeccCCcccCCccchhHHHHHHHHHHHHHHHHHhcC Confidence 0 11122223359999999664 4799999999998888777777777766 No 99 >protein:vir:93898 Length: 133 # NCBI annotation: ORF028 # Family: family:all:589 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239942;genbank:gi:66395616;genbank:GeneID:5130964 Probab=98.69 E-value=2.6e-10 Score=73.15 Aligned_cols=123 Identities=10% Similarity=0.109 Sum_probs=81.2 Q ss_pred ceeehhhHHHHHHHHHHh-HHHHHHHHHHHHHHHHHHHHHHHHHhhCC--cCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 4 TSLDFSGLNDIAKDLEAL-SRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l-~~~~~~k~~~~Al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |+++++|++||+++|++. ++.-.+++.++||.++++.|.+..+.+.. .|||...+.+..+.-...... T Consensus 1 msvevkGv~eilk~le~k~G~~~~~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGati~ev~~s~p~~~~g~--------- 71 (133) T protein:vir:93 1 MSVEIKGIPEVLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTKVGS--------- 71 (133) T ss_pred CeEEEecHHHHHHHHHHhhCHhhhHhhhhHHHHHHHHHHHHHHHhhhhhhhcccceeeeEEecCeeeccCC--------- Confidence 899999999999999976 55545789999999999999999999988 688988887766532211100 Q ss_pred cccccccccccccccCC-CCCcceehhcccCCcCC----CCCc--chhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANN-PRNAFYWRFVELGTANM----PAHP--FVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~-~~~~~y~~f~E~GT~~~----~a~P--Fl~pA~~~~~~~~~~~~~~~l~~ 141 (149) +...+.+.+.. ......-|+.|||..+- .|+- -|..|+++++..+.+.++++|.+ T Consensus 72 ------~~rtV~i~W~gp~~R~~iVHLNE~Gytr~Gk~i~PrG~G~i~~a~~~se~~y~~~vk~eL~k 133 (133) T protein:vir:93 72 ------QERAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAANERKYREIIKKELAR 133 (133) T ss_pred ------cceEEEEEeecCCCceeEEEeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 00111222211 22344579999995332 2332 46666776666666555555555 No 100 >protein:vir:80116 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:970 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425608;genbank:gi:155042941;genbank:GeneID:5469542 Probab=98.68 E-value=1.9e-10 Score=73.84 Aligned_cols=117 Identities=20% Similarity=0.269 Sum_probs=78.0 Q ss_pred CccceeehhhH-HHHHHHHHHhHHHHHHHHHHHHHHH----HHHHHHHHHHhhCCcCCCcccccceecccccccCCcccc Q lcl|NC_019767. 1 MIETSLDFSGL-NDIAKDLEALSRAENNKVLRDATRA----GAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISS 75 (149) Q Consensus 1 Mm~~~~~i~Gl-~~l~~~l~~l~~~~~~k~~~~Al~~----~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~ 75 (149) |.. |+|..| +++.+.|+....++.+ .+.+++.+ +++.++.+++..+|+.+|.+.++..+...... T Consensus 1 M~~--i~id~La~~I~~~L~~y~~~v~~-~v~~~v~evak~a~~~lkk~i~~tsPkrTG~YaK~W~~k~~~~~------- 70 (127) T protein:vir:80 1 MAN--IKIDRLGDEITRQLKRYSQVIAG-DLEQIMDDVSKEAVDRLKAKIEEEGLVQTGDYKRGWTRKRTPGG------- 70 (127) T ss_pred Ccc--ccHhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhhhcCccccccccccceeeeccCc------- Confidence 666 456564 7788899999888764 45677744 44455555557999999999887654332110 Q ss_pred ceeeecccccccccccccccCCCCCcceehhcccCCcC-----CCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 76 GVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTAN-----MPAHPFVRPAYDTREEEAASVAIARMNQAID 144 (149) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~-----~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~ 144 (149) ....+.....-.|++|||..+ -+|+|+|+|+.+...+.+.+.+.+.|+-.=+ T Consensus 71 -----------------~~v~nk~~yqLtHLLE~GHAkr~GGRV~a~pHI~paee~~~~~l~~~i~~~l~~~~~ 127 (127) T protein:vir:80 71 -----------------WVIHNKTEYRLAHLLEYGHATVDGGRVPETPHIRPVEDWLEKEFEDRVERAIKNESR 127 (127) T ss_pred -----------------eeEeecCCcceeehhhcceeccCCcccCCccchhhHHHHHHHHHHHHHHHHhcCCCC Confidence 011122222349999999664 4799999999988777766666666654444 No 101 >protein:vir:104347 Length: 145 # NCBI annotation: conserved phage-related protein # Family: family:all:448 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398975;genbank:gi:81343959;genbank:GeneID:3778879 Probab=98.53 E-value=4e-10 Score=72.09 Aligned_cols=137 Identities=13% Similarity=0.101 Sum_probs=73.9 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccc--------cCCc Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSR--------RRGE 72 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~--------~~g~ 72 (149) |..--- ++-.+...+..+.+.+.+. +...+++.+..+..++...+|+|||.++.|-.++...-. ..|. T Consensus 1 ~~~~m~---~~~sF~~~i~~~~~~ve~~-~~~v~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~G~ 76 (145) T protein:vir:10 1 MARNIG---SVVTFEKSIADWIDRAEDG-FGIVVSNTVIKTANAIVDLSPVDTGRFKANWQISANSPAQQSLNEYDQTGG 76 (145) T ss_pred CCCccc---chhccccCHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhCCccchhhccccceeecccccccccccCCCCc Confidence 222100 0111222333333333222 234567777888888899999999999988766532111 1121 Q ss_pred cccceeeecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 73 ISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAI 143 (149) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i 143 (149) ............-.+.. ......-..+..|+.++|||+|.|+|..|++.++..- .+++.....+++++| T Consensus 77 ~t~~~~~~~~~~i~~~k-~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~~~~~~-~~~v~~~~~e~k~~~ 145 (145) T protein:vir:10 77 QTKTYLARQARAVANSK-ATSVIYITNRLDYAADLEYGASNQAPAGVLGVVQARL-GRYFQEAVEEARRAI 145 (145) T ss_pred cchhhHHHHHHHhhccc-ccceEEEeeCchhhhHhhccccCCCcchHHHHHHHHH-HHHHHHHHHHhhccC Confidence 11100000000000000 0011112245789999999999999999999999766 555556666777776 No 102 >protein:vir:1087 Length: 161 # NCBI annotation: Orf46 # Family: family:all:1029 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076741;genbank:gi:13095851;genbank:GeneID:920400 Probab=98.52 E-value=1.2e-09 Score=69.42 Aligned_cols=138 Identities=17% Similarity=0.212 Sum_probs=89.2 Q ss_pred Cccceeeh-hhHHHHHHHHHHhHHHH--HHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccce Q lcl|NC_019767. 1 MIETSLDF-SGLNDIAKDLEALSRAE--NNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGV 77 (149) Q Consensus 1 Mm~~~~~i-~Gl~~l~~~l~~l~~~~--~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~ 77 (149) ||.-+--+ ..|+++++++++|..++ .++ .+...+||+++++.....+|...=. ..+.+..+++.++| T Consensus 1 ~~~~~~~fdd~L~~~~~~v~klv~~lt~e~k--akIT~AGAkv~a~~L~~~T~~kHy~--------~~kt~k~~HLADsI 70 (161) T protein:vir:10 1 MMEEKQLFEDIMNGIIFQAESVSTSLTVEDK--AKITKAGANAFAIGLEKVTKDKHYR--------IRKTGENPHLADSI 70 (161) T ss_pred CcchhHHHHHHHHHHHHHHHhhcCCCCHHHH--HHHHHHhHHHHHHHHHHHhhhhcCc--------CCCCCCcchhhhhe Confidence 88865444 56888899998887553 334 4678899999999999999874211 12233445565555 Q ss_pred eeecccccccccccc-cccCCCCCcceehhcccCC-----------------cCCCCCcchhHHHH--HHHHHHHHHHHH Q lcl|NC_019767. 78 HIRGVNPRTGNSDNT-MKANNPRNAFYWRFVELGT-----------------ANMPAHPFVRPAYD--TREEEAASVAIA 137 (149) Q Consensus 78 ~~~~~~~~~~~~~~~-~~~~~~~~~~y~~f~E~GT-----------------~~~~a~PFl~pA~~--~~~~~~~~~~~~ 137 (149) .+...... |...+. .+++.+..++.+||++-|| .+|++-||+..+-+ +.+++|+++..+ T Consensus 71 ~~~~~niD-g~~dG~StVGw~~kka~ia~~indGtr~~~~~~~~~~~~n~Gt~~i~gDHFvd~~r~~~~~k~aV~~Ae~~ 149 (161) T protein:vir:10 71 LVQNTNID-GIKDGNSTVGWDYTKSRVGHLIENGTRFPMYSKKGTKYRKGGQVAITSDPFVSTYRDSMEAQVAMFSAEAE 149 (161) T ss_pred eecccccC-cccCCceeccccCchhhhhhhhcccchhhhhhcccccccCCcceeecCcchhHHHHhhhhhHHHHHHHHHH Confidence 55433222 222222 2233233466666666665 66999999999999 577999998888 Q ss_pred HHHHHHHHHhcC Q lcl|NC_019767. 138 RMNQAIDEVLSK 149 (149) Q Consensus 138 ~l~~~i~k~~~k 149 (149) ++++-|++-=+- T Consensus 150 ~y~eil~~k~~~ 161 (161) T protein:vir:10 150 VFSEILKKKGAE 161 (161) T ss_pred HHHHHHHhhcCC Confidence 887776654333 No 103 >protein:vir:79638 Length: 146 # NCBI annotation: gp40 # Family: family:all:448 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285529;genbank:gi:148734512;genbank:GeneID:5219996 Probab=98.52 E-value=8.9e-10 Score=70.19 Aligned_cols=137 Identities=17% Similarity=0.169 Sum_probs=80.1 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceeccccccc--------CCc Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRR--------RGE 72 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~--------~g~ 72 (149) |++-+ +.++.+.+.++.+.+.. .+..++++.+..+..++..++|||||.++.|-.++...-.. .|. T Consensus 1 ma~~~-----~~sFa~~i~~~~~~ve~-~~~~~~r~~a~~i~~~vv~~sPVDTGr~Ranw~vs~~~~~~~~~~~~dp~G~ 74 (146) T protein:vir:79 1 MADYS-----IREFHGNVDKWIEQVES-GLNDVIQIFGEKVHGALVDIAPVDTGRFKANMQITANKPPLYALNQYDPDGE 74 (146) T ss_pred CCcch-----hHHHHHhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCcccccccCCCCCCc Confidence 66644 33566777777777643 34567888888999999999999999999987765432111 111 Q ss_pred cccceeeeccc-ccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019767. 73 ISSGVHIRGVN-PRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVL 147 (149) Q Consensus 73 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~ 147 (149) .+......... ...+. .......-..+..|+.++|||+|.|+|..|++.++.+- .+++.....++++. ..+ T Consensus 75 ~t~~~~~~~i~~~~~g~-~~~~~iyi~NnlpYA~~LEyG~S~QAP~G~v~~~~~~~-~~~v~~a~~e~k~~--~~l 146 (146) T protein:vir:79 75 KIKAEGRRTLYALLHGG-GAIKSIYFSNMLIYANALEYGHSKQAPAGVFGIVAIRL-RSYMAEAIREARKK--NAL 146 (146) T ss_pred ccHHHHHHHHHHHHhcc-cccceeEEeeCchhhhhhhccccCCCcchHHHHHHHHH-HHHHHHHHHHHHhh--ccC Confidence 00000000000 00000 00011111245789999999999999999999999754 44444444444442 222 No 104 >protein:vir:7412 Length: 168 # NCBI annotation: hypothetical protein # Family: family:all:1029 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839929;genbank:gi:30089899;genbank:GeneID:1260686 Probab=98.49 E-value=1.8e-09 Score=68.57 Aligned_cols=137 Identities=14% Similarity=0.190 Sum_probs=90.5 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |.++. ..|++.+.++++|..++..+--.+...+||+++++.....+|...-. ..+.+..+++.+.|.+. T Consensus 1 M~~~~---~~l~~~~~~vekl~~~lt~eqkakITkAGAkv~~~~L~~~t~~kHy~--------~k~t~~~~HLaDsI~~~ 69 (168) T protein:vir:74 1 MATFE---EAMQLIINQAESLSTKMTVEDKAEVTKAGAKVFEQALAYEVRNRHYR--------HRDTGEDPHLADSIVMK 69 (168) T ss_pred CccHH---HHHHHHHHHHHhhccCCCHHHHHHHHHhhhHHHHHHHHHHhHHhhcc--------cCCCcccchhhhheeec Confidence 77766 55778888998887664222224678899999999999988853211 11223334555555444 Q ss_pred ccccccccccccc-ccCC-------CCCcceehhcccCCc------------------CCCCCcchhHHHHH--HHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTM-KANN-------PRNAFYWRFVELGTA------------------NMPAHPFVRPAYDT--REEEAA 132 (149) Q Consensus 81 ~~~~~~~~~~~~~-~~~~-------~~~~~y~~f~E~GT~------------------~~~a~PFl~pA~~~--~~~~~~ 132 (149) ..... +...+.. +++. ...++.++|++.||. +|++-||+..+-+. .++.|+ T Consensus 70 ~~niD-g~~dG~s~VGf~~k~~~~~~~kA~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHFvd~~r~~~~~k~~V~ 148 (168) T protein:vir:74 70 NKNID-GVKDGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHADHFIEETRMNLIVQQGIL 148 (168) T ss_pred ccccC-cccCCceeecccccccccccchhhhhhhhcccccccccccccccccccccccccccchhHHHHHhhhhhHHHHH Confidence 43221 1111111 2222 225788999999995 68999999999998 679999 Q ss_pred HHHHHHHHHHHHHHhcC Q lcl|NC_019767. 133 SVAIARMNQAIDEVLSK 149 (149) Q Consensus 133 ~~~~~~l~~~i~k~~~k 149 (149) ++..+++++-|++--+- T Consensus 149 ~Ae~~~y~eIl~~k~~~ 165 (168) T protein:vir:74 149 KAEAEAMRKIINRKKKE 165 (168) T ss_pred HHHHHHHHHHHHhhcCC Confidence 99998888888776555 No 105 >protein:vir:107703 Length: 147 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003902;genbank:gi:45686318;genbank:GeneID:2773043 Probab=98.46 E-value=2.1e-09 Score=68.17 Aligned_cols=136 Identities=17% Similarity=0.217 Sum_probs=77.8 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccc--------cCCc Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSR--------RRGE 72 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~--------~~g~ 72 (149) |++-++ .++...+.++.+.+... +...+++.+..+..++..++|+|+|.++.|-.++...-. ..|. T Consensus 1 ma~~~~-----~~F~~~i~~~~~~ve~~-~~~~~r~~a~~i~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~dp~g~ 74 (147) T protein:vir:10 1 MANYQI-----RRFQGEIDAWINAAEST-LEHAIEIFVRDVHDALVSRSPVDTGRFKGNWQITFNEIPNHALNRYDKTGG 74 (147) T ss_pred CCCcch-----hhhhhhHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhCCCcchhhccccceeecCccccccCCcCCCcc Confidence 776553 35666677777776443 456788888899999999999999999998776533211 1111 Q ss_pred cccceeeec---ccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019767. 73 ISSGVHIRG---VNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVL 147 (149) Q Consensus 73 ~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~ 147 (149) .+....... +..+.+... ...-..+..|+.++|||+|.|+|..|++-++..-.. ++.....++++.= .++ T Consensus 75 ~t~a~~~~~~~~~~~~~~~~~---~iyi~Nn~pYA~~LEyG~S~QAP~G~V~~t~q~~~~-~v~~~~~e~k~~~-~~~ 147 (147) T protein:vir:10 75 VVRGEEQAKTYGMFSRGGAIT---SVHFSNMLIYANALEYGHSQQAPSGVVGLVALRLRS-YMADAIKQARRQQ-NAL 147 (147) T ss_pred chhhhhhHHHHHHhhhccCcc---eEEEeeCcchhhhhhccccCCCCchHHHHHHHHHHH-HHHHHHHHHHhhh-ccC Confidence 111110000 000001111 111124568999999999999999999988864432 2222222222211 111 No 106 >protein:vir:99833 Length: 190 # NCBI annotation: hypothetical protein # Family: family:all:274 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164071;genbank:gi:56692603;genbank:GeneID:3192561 Probab=98.42 E-value=1.6e-09 Score=68.84 Aligned_cols=132 Identities=16% Similarity=0.142 Sum_probs=80.5 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC-----CcCCCcccccceec---------ccc Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA-----PVRTGKLKKNVVVV---------TQK 66 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~a-----P~~~g~l~~~i~~~---------~~~ 66 (149) ||.++|+|. +++|.+.|+.|...+.+ .+..++.-|+.++...+++- |.. ......... ... T Consensus 1 M~~i~i~~d-~~~~~~~L~~l~~~~~~--~~~l~~~ig~~l~~~~~~rf~~~~~PdG--~~W~p~~~~t~~rk~~~~~~~ 75 (190) T protein:vir:99 1 MAGITLEWD-GRRALDVLNAGSAALGD--PSGLLQDIGELLLNIHRRRFQAQVSPDG--TPWQPLSPAYLRRKRKNRDKI 75 (190) T ss_pred CceeEEEec-HHHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHhcCCCCC--CCCccccHHHHHHhhcCCCcc Confidence 999999997 57899999999877653 35677778888777776653 432 111111000 000 Q ss_pred cccCCccccceeeecccccccccccccccCCCCCcceehhcccC------------------------------------ Q lcl|NC_019767. 67 SRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELG------------------------------------ 110 (149) Q Consensus 67 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~G------------------------------------ 110 (149) -..+|.+...+... .+.....+ .++..|+..++|| T Consensus 76 L~~tg~L~~Si~~~-----~~~~~v~v----Gtn~~yA~iHq~Gg~i~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 146 (190) T protein:vir:99 76 LTLDGHLRNLLRYQ-----LDGSELLF----GSDRPYAAIHHFGGTIQRQARSSTVYFRQNERTGEVGREFVPRRRSNFA 146 (190) T ss_pred ceecHHHHHHHhhe-----ecCcEEEE----ecCcchhhhhhcCCcccccccchhhhhhhhhhhhhhhcccccccccccc Confidence 11123333322211 11111111 2345677777777 Q ss_pred --------CcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019767. 111 --------TANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVL 147 (149) Q Consensus 111 --------T~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~ 147 (149) |.++|++|||.-. ++.++++.+.+.+.|.+.|.+++ T Consensus 147 ~~~~~~~~~v~IPaRpfLG~s-~~d~~~I~~~i~~~l~~~~~~~~ 190 (190) T protein:vir:99 147 QDVQIGPYTIQMPARPWLGTS-SQDDDTILQRVERYLQRALRERA 190 (190) T ss_pred hhcccccceeeecCcccCCCC-HHHHHHHHHHHHHHHHHHHhhcC Confidence 3457999999766 56778888888888888888888 No 107 >protein:vir:103280 Length: 142 # NCBI annotation: phage-related hypothetical protein # Family: family:all:448 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277459;genbank:gi:71834102;genbank:GeneID:3562391 Probab=98.39 E-value=2.1e-09 Score=68.11 Aligned_cols=134 Identities=13% Similarity=0.134 Sum_probs=77.4 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceeccccccc--------CCc Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRR--------RGE 72 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~--------~g~ 72 (149) |.+-.+ .+...+..+.+.+.. .....+++.+..+..++..++|+|||.++.|-.++...-.. .|. T Consensus 1 Ma~~~~------sf~~~i~~~~~~ve~-~~~~v~r~~a~~i~~~vv~~sPVdTGr~R~nw~vs~~~~~~~~~~~~d~~G~ 73 (142) T protein:vir:10 1 MANDVV------SFRNSINAWIDGVTE-GVELIVEGTLTKATKDIVKLSPVDTGRFRGNWQATGNSPAAQSLNNYDPDGN 73 (142) T ss_pred Cccchh------hhhccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhCcccchhhcccceeeecCcccccccCcCCCCc Confidence 554212 233444555555432 33456777788888888999999999999988765432211 111 Q ss_pred cccceeeecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 73 ISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAI 143 (149) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i 143 (149) ............- ..........-..+..|+.++|||.|.|+|..|++.++.+- .++++....++++.| T Consensus 74 ~t~~~~~~~~~~i-~~~~~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~a~q~~-~~~v~~a~~e~~~~~ 142 (142) T protein:vir:10 74 ETRNSLRRQIYAL-ARDANTNVIYISNRLDYAQGLEFGSSNQAPSGVLGVVQKRL-GRYFAEAVQEAKRAL 142 (142) T ss_pred cchhhHHHHHHHh-hhccccceEEEeeCcchhhhhhccccCCCcchHHHHHHHHH-HHHHHHHHHHhhccC Confidence 1111000000000 00000011112245789999999999999999999999644 556666666666666 No 108 >protein:vir:1028 Length: 168 # NCBI annotation: Orf48 # Family: family:all:1029 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076682;genbank:gi:13095791;genbank:GeneID:920342 Probab=98.32 E-value=5.6e-09 Score=65.83 Aligned_cols=137 Identities=14% Similarity=0.196 Sum_probs=86.6 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |.++.-. |++++.++++|+-....+.-.+...+||+++++.....+|...-. ..+.+.-+++.+.|.+. T Consensus 1 M~~~~d~---l~~~~~~vekl~~~ls~eqkakITkAGAkv~~~~L~~~tk~kHy~--------~k~t~~~~HLaDsI~~~ 69 (168) T protein:vir:10 1 MVSFYDA---MQLIVDRAEELSTKMSVEDKAEVTKAGAKVFEQALAYEVRNRHYR--------HRDTGEDPHLADSIVMK 69 (168) T ss_pred CCcHHHH---HHHHHHHHHHhhcCCCHHHHHHHhHhhhHHHHHHHHHHhhHhhhc--------cCCCCccchhhhhheec Confidence 7666544 556777777763222222224678899999999999999853211 11223334555555444 Q ss_pred cccccccccccc-cccCC-------CCCcceehhcccCCc------------------CCCCCcchhHHHHH--HHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNT-MKANN-------PRNAFYWRFVELGTA------------------NMPAHPFVRPAYDT--REEEAA 132 (149) Q Consensus 81 ~~~~~~~~~~~~-~~~~~-------~~~~~y~~f~E~GT~------------------~~~a~PFl~pA~~~--~~~~~~ 132 (149) ..... +...+. .+++. ...++.++|++.||. +|++-||+..+-+. .++.|+ T Consensus 70 ~~niD-g~~dG~s~VGf~~k~~~~~~~ka~iAr~lNDGTk~~~~~~~~~~~~~~~g~v~i~gDHFvd~~r~d~a~k~~V~ 148 (168) T protein:vir:10 70 NKNID-GVKDGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKKPGEVAVHADHFIEETRKNPIVQQGIL 148 (168) T ss_pred ccccc-cccCCceeecccCccccccccchheeeeccccccccccccccccccccccccccccchhHHHhhhchhhhHHHH Confidence 33221 111111 12222 236788999999995 58999999999996 578899 Q ss_pred HHHHHHHHHHHHHHhcC Q lcl|NC_019767. 133 SVAIARMNQAIDEVLSK 149 (149) Q Consensus 133 ~~~~~~l~~~i~k~~~k 149 (149) ++..+++++-|++--+- T Consensus 149 ~Ae~~~y~eIl~~k~~~ 165 (168) T protein:vir:10 149 KAEAEAMRKIINRKKKE 165 (168) T ss_pred HHHHHHHHHHHHhhcCC Confidence 99988888888776555 No 109 >protein:vir:79091 Length: 175 # NCBI annotation: gp5, phage virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111205;genbank:gi:134288802;genbank:GeneID:4960765 Probab=98.30 E-value=7.1e-09 Score=65.26 Aligned_cols=137 Identities=12% Similarity=0.186 Sum_probs=80.8 Q ss_pred Ccc-ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhh-----CCcCCCcccccceec----------- Q lcl|NC_019767. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVVVV----------- 63 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~~~----------- 63 (149) |.. ++|+|++ +++.+.|++|...+.+ .+.+++.-|+.++.....+ .|..+.-....+... T Consensus 1 Ms~~i~i~~d~-~~~~~~L~~l~~~~~d--~~~lm~~Ig~~l~~~t~~rF~~~~~PdW~pls~~t~~~r~~~~~~~~~~~ 77 (175) T protein:vir:79 1 MSDFVNFQIDD-SALRTRLLQLEQAGHQ--KADAMRKITQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNG 77 (175) T ss_pred CceEEEEEech-HHHHHHHHHHHHHhcC--HHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCChHHHHhhccccccccccc Confidence 444 5777776 7899999999877643 3567777788888777764 343211110000000 Q ss_pred ---c---------cccccCCccccceeeecccccccccccccccCCCCCcceehhcccCCc-------CCCCCcchhHHH Q lcl|NC_019767. 64 ---T---------QKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTA-------NMPAHPFVRPAY 124 (149) Q Consensus 64 ---~---------~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~-------~~~a~PFl~pA~ 124 (149) . ..-..+|.+...+.... +.....+ +++..|+.++.||+. .+||+|||.=.- T Consensus 78 ~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~-----~~~~v~v----Gtn~~YAaiHqfGg~~~~~~~v~IPARPfLG~s~ 148 (175) T protein:vir:79 78 ELTAAASRRKAGLMILQDSGQMAASTATDS-----GEDYSVI----GSNKEYAAIQHFGGQAGRGLKVTIPGRAWLPVTA 148 (175) T ss_pred cchhhHhhhccCCCcceechhhhhhhhhee-----cCCEEEE----ecCcchhhHhhcccccCCCcccccCcccccCCCc Confidence 0 00001222222222111 1111111 234579999999986 699999998543 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 125 -DTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 125 -~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) ++-..++.+.|.+.+.+.|++++++ T Consensus 149 ~de~~~~~~~~I~~~i~~~l~~a~~~ 174 (175) T protein:vir:79 149 DGELQPEAVEPVLNTILRHLMDAANR 174 (175) T ss_pred ccchhHHHHHHHHHHHHHHHHHHhcc Confidence 3345677788888888888888888 No 110 >protein:vir:94994 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224022;genbank:gi:62327309;genbank:GeneID:5176822 Probab=98.26 E-value=5.5e-09 Score=65.85 Aligned_cols=123 Identities=16% Similarity=0.201 Sum_probs=69.3 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccc--------cCCcc Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSR--------RRGEI 73 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~--------~~g~~ 73 (149) |++..+|. ++..+.+.- +..++++.+..+..++...+|+|+|.++.|-.++..... ..|.. T Consensus 1 msF~~~i~---~~~~~ve~~--------~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~ 69 (131) T protein:vir:94 1 MSFALDVT---RFVEKAKKN--------PEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGSTPADGTTDATDKSGNT 69 (131) T ss_pred CCcccCHH---HHHHHHHHH--------HHHHHHHHHHHHHHHHHHhCCCchhhhhccchhccccccccccCCCCCCchh Confidence 66655533 444433322 234555666777777888999999999998766542111 11111 Q ss_pred ccceeeecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 74 SSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMN 140 (149) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~ 140 (149) ........+. ...... ...-..+..|+.++|||+|.|+|..|++.++..- .+++.....+++ T Consensus 70 t~~~~~~~i~-~~~~g~---~iyi~Nn~pYA~~LEyG~S~QAP~g~v~~~~~~~-~~~v~~~~~e~k 131 (131) T protein:vir:94 70 ATGNATSFVL-NAADWH---TFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRF-QQLLNEEASKVK 131 (131) T ss_pred hHHHHHHHHh-hccccc---eEEEeeCchhhhhhhccccCCCcchHHHHHHHHH-HHHHHHHHHhcC Confidence 0000000000 000111 1112345789999999999999999999999754 444444444554 No 111 >protein:vir:94944 Length: 121 # NCBI annotation: hypothetical protein phage protein # Family: family:all:448 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239282;genbank:gi:66392064;genbank:GeneID:5076589 Probab=98.24 E-value=3e-09 Score=67.33 Aligned_cols=113 Identities=18% Similarity=0.180 Sum_probs=69.9 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccc--------cCCc Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSR--------RRGE 72 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~--------~~g~ 72 (149) ||.|++..+ ++++..+++.- +...++..+..+.+.+...+|+|+|.++.|-.++..... ..|. T Consensus 1 ~~~~sf~~~-i~~~~~~ve~~--------~~~~~r~~~~~~~~~vv~~sPVdtGrfRanw~vs~~~p~~~~~~~~dp~g~ 71 (121) T protein:vir:94 1 MISMKFNVN-LSRLRSNLREE--------AKKKAIRIAQEIVNGVIARSPVLAGDYRSSWNVSEGSMEFKFNNGGNPANP 71 (121) T ss_pred Cccchhhcc-HHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHhcCCchhhhhccccccccCcccccCCCCCCCcc Confidence 999998766 55555544433 233455566677778889999999999988765432111 1111 Q ss_pred cccceeeecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHH Q lcl|NC_019767. 73 ISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTRE 128 (149) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~ 128 (149) .+....+.... +.... ..-..+..|+..+|||+|.|+|..|++.++.+-+ T Consensus 72 ~t~~~~~~~~~---~~~~~---iyi~NnlpYA~~LE~G~S~QAP~G~v~~t~~~~q 121 (121) T protein:vir:94 72 TPAPAIVVSSN---VALPH---FYITNGAPYAQQLEKGSSTQAPLGIVRVTLASLR 121 (121) T ss_pred hhHHHHHHHHh---hccce---EEEeeCcchhhhhhcccCCCCcchHHHHHHHhhC Confidence 11111111100 11111 1122456899999999999999999999998776 No 112 >protein:vir:78380 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:448 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110844;genbank:gi:134288605;genbank:GeneID:5179643 Probab=98.21 E-value=8.7e-09 Score=64.77 Aligned_cols=123 Identities=16% Similarity=0.168 Sum_probs=69.6 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceeccccccc--------CCcc Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRR--------RGEI 73 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~--------~g~~ 73 (149) |++..+|. ++..+.+.- +...+++.+..+..++...+|+|+|.++.|-.++...-.. .|.. T Consensus 1 msf~~~i~---~~~~~ve~~--------~~~~~r~~a~~~~~~iv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~ 69 (131) T protein:vir:78 1 MSFALDVS---KFVEKAKKN--------PEKVIRQVSIKLFSAIIKASPVDTGRFRMNWMASGGTPADGTTDATDKAGTT 69 (131) T ss_pred CCcCcCHH---HHHHHHHHH--------HHHHHHHHHHHHHHHHHHhCCCchhhhccccceecccccccccCCCCCCchh Confidence 66655543 444333322 2345566677777778889999999999988765432111 1111 Q ss_pred ccceeeecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 74 SSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMN 140 (149) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~ 140 (149) ........+. + ........-..+..|+.++|||.|.|+|..|++.++..- .++++....+++ T Consensus 70 t~~~~~~~i~---~-~~~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~~~~~~-~~~v~~~~~e~k 131 (131) T protein:vir:78 70 ATSNAANFVL---N-AADWHTFTLTNNLPYAQRLEYGWSQQAPQGFVRVNVSRF-QQLLNEEASKVK 131 (131) T ss_pred hHHHHHHHHh---h-ccCCceEEEeeCchhhhHhhccccCCCcchHHHHHHHHH-HHHHHHHHHhcC Confidence 0000000000 0 000011112345789999999999999999999999754 444444444554 No 113 >protein:vir:101563 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958111;genbank:gi:41057657;genbank:GeneID:2716820 Probab=98.19 E-value=3.6e-09 Score=66.86 Aligned_cols=103 Identities=17% Similarity=0.153 Sum_probs=54.0 Q ss_pred ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeeccc Q lcl|NC_019767. 4 TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVN 83 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~ 83 (149) |++.-+||++|+++|+... ++-..|.+.+.. +. .|.. ..+ T Consensus 1 m~v~r~~L~~~~~~l~~~~----------------------V~VGi~~~a~y~-d~----------~g~~-~~~------ 40 (155) T protein:vir:10 1 MSVTRRGLTLPKDRYKSMS----------------------VKAGVLAGATYP-DE----------SGKK-LAD------ 40 (155) T ss_pred CcchHHHHHHHHHHhhCCe----------------------eEEeecCCCCCC-cc----------ccch-hhh------ Confidence 7777788887776665420 111111111000 00 0000 000 Q ss_pred ccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHH--HHHhcC Q lcl|NC_019767. 84 PRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAI--DEVLSK 149 (149) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i--~k~~~k 149 (149) +.........+.+.+.++.++||||.+.||||||+|++++++++..+.+...+...+ ++++.. T Consensus 41 ---g~~~~~~~~~G~pva~ia~~~e~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~ 105 (155) T protein:vir:10 41 ---GTILKKDPRAGLPVAMIAMALNYGTSKLPARPFMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQ 105 (155) T ss_pred ---hhhhccccccCcchhhhhhhhhcCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH Confidence 000000011122346688899999999999999999999999988877766554432 122222 No 114 >protein:vir:3994 Length: 168 # NCBI annotation: unknown # Family: family:all:1029 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116502;genbank:gi:14251135;genbank:GeneID:921309 Probab=98.18 E-value=2.2e-08 Score=62.56 Aligned_cols=135 Identities=15% Similarity=0.233 Sum_probs=87.5 Q ss_pred CccceeehhhHHHHHHHHHHhHHHH--HHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCcccccee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAE--NNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVH 78 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~--~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~ 78 (149) |.++.- .|++++.++++|..++ .++ .+...+||+++++.....+|...-. ..+.+..+++.+.|. T Consensus 1 M~~~~d---~l~~~~~~v~kl~~~lt~e~k--akIT~AGAkv~a~~L~~~T~~kHy~--------~rktg~~~HLADsI~ 67 (168) T protein:vir:39 1 MVSFYD---AMQLIINQAESLSTKMTVEDK--AEVTKAGAKVFEQALAYEVRNRHYR--------HRDTGEDPHLADSIV 67 (168) T ss_pred CccHHH---HHHHHHHHHHhccCCCCHHHH--HHHHHHhHHHHHHHHHHHhHHhccc--------CCCCCCCccchhhee Confidence 666653 4667888888876443 333 4678899999999998888753211 112334456666665 Q ss_pred eecccccccccccc-cccCCC-------CCcceehhcccCCc------------------CCCCCcchhHHHHH--HHHH Q lcl|NC_019767. 79 IRGVNPRTGNSDNT-MKANNP-------RNAFYWRFVELGTA------------------NMPAHPFVRPAYDT--REEE 130 (149) Q Consensus 79 ~~~~~~~~~~~~~~-~~~~~~-------~~~~y~~f~E~GT~------------------~~~a~PFl~pA~~~--~~~~ 130 (149) +...... +...+. .+++.+ ..++.++|++-||. +|++-||+..+-+. .+++ T Consensus 68 ~~~~niD-g~~dG~StVGw~~k~~~~~~~~a~iAr~lNDGTrf~~~~~~~~~~y~~~g~v~i~gDHFvd~~r~~~a~k~a 146 (168) T protein:vir:39 68 MKNKNID-GVKDGQSVVGWERSTEKGTHTKGYIANIINNGSRFPQFTTRSGRKYKNPGEVAVHADHFIEETRKNPIVQQG 146 (168) T ss_pred ecccccC-cccCCceeccccCccccccccchhheehhccccccchhhhhcccccccccceeecccchhHHHhhhhhhhHH Confidence 5444322 222222 222222 36788999999994 68999999999996 4788 Q ss_pred HHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 131 AASVAIARMNQAIDEVLSK 149 (149) Q Consensus 131 ~~~~~~~~l~~~i~k~~~k 149 (149) |+++..+++++-|++--+- T Consensus 147 V~~Ae~e~~~eil~~k~~~ 165 (168) T protein:vir:39 147 ILKAEAEAMRKIINRKKKE 165 (168) T ss_pred HHHHHHHHHHHHHHhcCCC Confidence 8888888887777654443 No 115 >protein:vir:3163 Length: 145 # NCBI annotation: unknown # Family: family:all:28417 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665934;genbank:gi:22091120;genbank:GeneID:951270 Probab=98.14 E-value=1.6e-08 Score=63.38 Aligned_cols=131 Identities=18% Similarity=0.196 Sum_probs=64.4 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhh-----CCcCCCcccccceeccc-------ccc Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVVVVTQ-------KSR 68 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~~~~~-------~~~ 68 (149) |... .+++.+.|++|...+. .+|...+..+.+++..+ .|. |..+.....++. .-. T Consensus 1 ~i~~------~~~i~~~l~~l~~~~~-----~~l~~i~~~~~~~~~~rf~~~~~p~--G~~W~pLs~st~a~k~~~~~L~ 67 (145) T protein:vir:31 1 MVED------ENNIPEAREAIQDGLT-----DGLERLHTITLRELITNMSDGQDAL--GNPWEPLKESTIRAKGSDTPLI 67 (145) T ss_pred Cccc------HHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHhcCCCC--CCCCcccChHHHHHhcCCCCCc Confidence 4332 3345555555544432 33444555555554443 333 222221111110 011 Q ss_pred cCCccccceeeecccccccccccccccCCCCCcceehhcccCCcC--CCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 69 RRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTAN--MPAHPFVRPAYDTREEEAASVAIARMNQAIDEV 146 (149) Q Consensus 69 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~--~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~ 146 (149) .+|.+...+.... ...........+++..|+.+++||+.+ +||||||.++.....+++.+.+.+.+...|+-+ T Consensus 68 ~tG~L~~Si~~~~-----~~~~~~~~a~vGtn~~YA~~hqfG~~~~~IPaRPfLG~~~~~~~~~~~~ii~~~i~~~L~~~ 142 (145) T protein:vir:31 68 DNSRLLTDINAAS-----MMDRANRMAVIGTNLDYAEHHEFGAPEAGIPARPIFGPAGAYASQQAPDVIGDEIDTNLEGA 142 (145) T ss_pred cCHHHHHHHHHHh-----hhcccCceeEecCCchhhhhhccCCcccccCCCCccCCCccchHHHHHHHHHHHHHHHhhhh Confidence 1222222111100 000000111123556899999999986 999999999987777777777776666666655 Q ss_pred hcC Q lcl|NC_019767. 147 LSK 149 (149) Q Consensus 147 ~~k 149 (149) +-- T Consensus 143 ~~~ 145 (145) T protein:vir:31 143 VID 145 (145) T ss_pred ccC Confidence 555 No 116 >protein:vir:96012 Length: 133 # NCBI annotation: ORF023 # Family: family:all:589 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239805;genbank:gi:66395471;genbank:GeneID:5132929 Probab=98.12 E-value=4.6e-08 Score=60.81 Aligned_cols=122 Identities=14% Similarity=0.103 Sum_probs=84.4 Q ss_pred CccceeehhhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHhhCC--cCCCcccccceecccccccCCccccce Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEA-LSRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGV 77 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~-l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~g~~~~~~ 77 (149) |- +|.|++||+++|++ |++.-.+++.++||.++++.+.+..|.+.- .|||...+.+..+.... T Consensus 1 m~----evkGv~eilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~---------- 66 (133) T protein:vir:96 1 MR----LIYDTKKLERELEKRLSKRALMRITDRALTEAGEVVLEAIRTNLKYFRDTGAEYGEVKLSKPTW---------- 66 (133) T ss_pred Cc----cccCHHHHHHHHHHhcCHHHHHHHhhHHHHHHHHHHHHHHHHhhHHHhhccceeeeEEecCcee---------- Confidence 33 56899999999986 576666889999999999999999999865 47887777665543211 Q ss_pred eeecccccccccccccccCC-CCCcceehhcccCCc-----CCCCCc--chhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 78 HIRGVNPRTGNSDNTMKANN-PRNAFYWRFVELGTA-----NMPAHP--FVRPAYDTREEEAASVAIARMNQAI 143 (149) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~-~~~~~y~~f~E~GT~-----~~~a~P--Fl~pA~~~~~~~~~~~~~~~l~~~i 143 (149) ..|...+.+.+.. ......-|+.|||+. +..|+- -|..|+++++..+.+.++++|.+.| T Consensus 67 -------~~g~rtV~i~W~gp~~R~~iVHLNE~G~ytr~Gk~i~PrG~G~I~~al~~se~~y~~~vk~el~kll 133 (133) T protein:vir:96 67 -------ENGKRTIRVYWEGEKHRYSIVHLNEKGFYAKDGKFIRPKGMGAIDKALRASRDKFFKVYAEEVSKLL 133 (133) T ss_pred -------cCCceEEEEEeecCCCceeeEeeecccceecCCceeccchhhHHHHHHHhhhHHHHHHHHHHHHHhC Confidence 1111122222212 223445788899843 233443 5889999999988888888888777 No 117 >protein:vir:1988 Length: 156 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050635;genbank:gi:9633522;genbank:GeneID:2636282 Probab=98.11 E-value=2.1e-08 Score=62.67 Aligned_cols=130 Identities=18% Similarity=0.233 Sum_probs=72.8 Q ss_pred ccceeehh-hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC-----CcCCCcccccceeccc----c----- Q lcl|NC_019767. 2 IETSLDFS-GLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA-----PVRTGKLKKNVVVVTQ----K----- 66 (149) Q Consensus 2 m~~~~~i~-Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~a-----P~~~g~l~~~i~~~~~----~----- 66 (149) |++.|+|+ .+++|.+.|.+|...... +..++.-++.++.....+- |. .|........++. + T Consensus 1 ms~~i~~~~d~~~l~~~L~~l~~~~~~---~~l~~~Ig~~l~~~~~~rf~~~~~Pd-~G~~W~pls~~t~~~r~~~~~~~ 76 (156) T protein:vir:19 1 MSLDMNVAVDVRRIQLALDELGTVTRD---RAIPRVMAAALLSSTEQAFERQADPD-TGKGWEAWSDSWLAWRQDHGFVP 76 (156) T ss_pred CeEEEEEeecHHHHHHHHHHHHhhhcc---HHHHHHHHHHHHHHHHHHHHhcCCCC-CCCCCcccChHHHHHhhccCCCC Confidence 88888887 678899999988654322 3456666666666666543 43 1322211111000 0 Q ss_pred ---cccCCccccceeeecccccccccccccccCCCCCcceehhcccCCc--------CCCCCcchhHHHHHHHHHHHHHH Q lcl|NC_019767. 67 ---SRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTA--------NMPAHPFVRPAYDTREEEAASVA 135 (149) Q Consensus 67 ---~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~--------~~~a~PFl~pA~~~~~~~~~~~~ 135 (149) -..+|.+...+... .+.....+ .++..|+..++||+. .+||+|||. -=+..++.+.+.+ T Consensus 77 ~~~L~~tg~L~~Si~~~-----~~~~~v~v----Gt~~~yA~vHqfG~~~~~~~~~~~iPaRpfLG-~s~~d~~~I~~~i 146 (156) T protein:vir:19 77 GSILTLHGDLARSITTD-----YGQDYALI----GSPKIYAAIHQWGGTPDMAPRPAGVPARPYMG-LDKTGEQEIFDAI 146 (156) T ss_pred CcchhhhHHHHHHhhhe-----ecCCEEEE----ecchhhhHHhhcCcccccCCCccccCCccccC-CCHHHHHHHHHHH Confidence 01123333222211 11222222 245689999999976 599999995 4455666666666 Q ss_pred HHHHHHHHHH Q lcl|NC_019767. 136 IARMNQAIDE 145 (149) Q Consensus 136 ~~~l~~~i~k 145 (149) .+.|.+.+++ T Consensus 147 ~~~l~~~~~~ 156 (156) T protein:vir:19 147 RKRVSAALRQ 156 (156) T ss_pred HHHHHHHhhC Confidence 6666666666 No 118 >protein:vir:4096 Length: 140 # NCBI annotation: Gp9 protein # Family: family:all:28682 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510990;swissprot:trembl:q8w600;genbank:gi:17488512;uniprot:Q8W600;genbank:GeneID:1260318 Probab=98.09 E-value=5.3e-08 Score=60.48 Aligned_cols=130 Identities=13% Similarity=0.206 Sum_probs=91.2 Q ss_pred Ccc-ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHH-HHHHHHHHHHhhCCcCCCc---ccccceeccccc--ccCCcc Q lcl|NC_019767. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRA-GAEVLKEEVIARAPVRTGK---LKKNVVVVTQKS--RRRGEI 73 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~-~a~~v~~~ak~~aP~~~g~---l~~~i~~~~~~~--~~~g~~ 73 (149) |.. .++++.++++|.++++++|.+. ++++.++|.. |+..+.+.+-...|++.+. +++.......+. ..-..+ T Consensus 1 m~~~~sld~s~~e~L~~~i~r~P~ks-E~~IN~~L~tkg~~~~~~~I~~~iPvS~~~k~~~RnK~HAK~s~pl~~~~~NL 79 (140) T protein:vir:40 1 MCAKWSLEFSDVERLSNLISQIPNKS-EAIINKTLETKAVPLVKLNIEKRINLSKNWKGQLLNKNHAQSSGPFNVKMGNL 79 (140) T ss_pred CCcceecchhhHHHHHHHHHhccchH-HHHHHHHHHhhhhHHHHhhhhhccCcCccchhhhccccchhhhhhhhhhhhhc Confidence 655 7999999999999999999996 6888888875 7778888899999998532 222211111111 011111 Q ss_pred ccceeeecccccccccccccccCCCCCcceehhccc--CCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 74 SSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVEL--GTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~--GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) .-. -...+.+-|-.|..- ||++-.+|.||+..++...+.+++.+.+++.++|.+.++= T Consensus 80 gf~------------------i~~k~kf~YLvfPD~G~G~sn~~~q~FmerGl~~~t~~i~E~L~~~l~k~in~~Lgg 139 (140) T protein:vir:40 80 GFE------------------LLTKPKFNYLIFPDQGIGKHNKTKQDFMQLGVEESSQEIVEMLEQAVFKEINDTLGG 139 (140) T ss_pred cee------------------EeecCcccccccccccCCCCCcchHHHHHhccccchhHHHHHHHHHHHHHHHHhhcC Confidence 110 011233457777764 5777778899999999999999998888888888888877 No 119 >protein:vir:77650 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:YP_022741;genbank:gi:47835022;genbank:GeneID:2821447 Probab=98.03 E-value=5.9e-09 Score=65.68 Aligned_cols=103 Identities=17% Similarity=0.126 Sum_probs=52.8 Q ss_pred ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeeccc Q lcl|NC_019767. 4 TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVN 83 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~ 83 (149) |++.-.||+++.+.|+... .+-..|.+++.... .+ ....+... . T Consensus 1 m~~~r~~l~~~~~~l~~~~----------------------v~VGi~~~a~y~d~-----------~~-~~~~~~~~--~ 44 (155) T protein:vir:77 1 MSVTRRGLTLPKDRYRSMS----------------------VKAGVLAGATYPDE-----------SG-KKLADGSI--L 44 (155) T ss_pred CcchHHHHHHHHHHHhcCc----------------------eEEeecCCCCCccc-----------cc-hhhhhhhh--c Confidence 6666677777766555311 01111111110000 00 00000000 0 Q ss_pred ccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHH--HHhcC Q lcl|NC_019767. 84 PRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAID--EVLSK 149 (149) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~--k~~~k 149 (149) .. ....+-+.+.++.++||||.+.||||||+|++++++++..+.+...+...++ +++.. T Consensus 45 ~~-------~~~~G~pva~ia~~~e~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~ 105 (155) T protein:vir:77 45 KK-------DPRAGLPVAMIAMALNYGTSKLPARPFMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQ 105 (155) T ss_pred cc-------cccccccHhhhhhhhhcCCCCCCCCchhhHHHHHHHHHHHHHHHHHHHccCcHHHHHHH Confidence 00 0111223466888999999999999999999999999888777765543321 11111 No 120 >protein:vir:5257 Length: 148 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852762;genbank:gi:31544037;uniprot:Q7Y5T8;genbank:GeneID:2753554 Probab=97.98 E-value=1.4e-08 Score=63.70 Aligned_cols=94 Identities=18% Similarity=0.381 Sum_probs=54.5 Q ss_pred ccceee--hhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 2 IETSLD--FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 2 m~~~~~--i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |+++++ ..|+++|++.|++|.... + +-..|.+... T Consensus 1 M~~~~k~~~~~~~~l~~~l~~l~~~~---v----------------~VGi~~~~~~------------------------ 37 (148) T protein:vir:52 1 MAVTVTANFSAAKQLIEQMKSLKEKA---V----------------YVGFPAEFDE------------------------ 37 (148) T ss_pred CccccccccHHHHHHHHHHHHhhCCe---E----------------EEEeecCcCC------------------------ Confidence 665444 457888888887774210 0 0001100000 Q ss_pred ecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHH--HHhcC Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAID--EVLSK 149 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~--k~~~k 149 (149) ......+-+.+.++.++|||+.+.||||||+|+++.++++..+.+...+...++ +++.. T Consensus 38 -----------~~~~~~g~~vA~ia~~~E~G~~~IP~Rpflr~t~~~~~~~~~~~~~~~~~~~~~~~~~L~~ 98 (148) T protein:vir:52 38 -----------KVKGSENFNLASLAAVLEFGNEHIPARPFLRQTLEENQEKYTALFIQWFDQGVPAAQIYER 98 (148) T ss_pred -----------CCCCCCCCCHHHHHHHHhcCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH Confidence 000001124467899999999999999999999999999887777655543221 11111 No 121 >protein:vir:95157 Length: 144 # NCBI annotation: hypothetical protein ORF019 # Family: family:all:448 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293426;genbank:gi:148912847;genbank:GeneID:5228232 Probab=97.97 E-value=2.7e-08 Score=62.06 Aligned_cols=130 Identities=18% Similarity=0.152 Sum_probs=72.8 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCcccccee-- Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVH-- 78 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~-- 78 (149) |++..+ ++...++.+.+.+. ..+...+++.|..+...+...+|+|||.++.|-.++..... .+....... T Consensus 1 MA~~~~------~f~~~i~~~~~~ve-~~~~~~~r~~a~~v~~~vv~~sPVDTGrfRanw~vs~~~p~-~~~~~~~~~~~ 72 (144) T protein:vir:95 1 MAKSLL------DLADRLEKKAKAID-EAASQNAVDTALAIVGDLAYKTPVDTSQALSNWIVTLESPS-GQQIKPHFPGS 72 (144) T ss_pred Cchhhh------hhhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhCCccchhhccccceeccccc-ccccccccccc Confidence 655222 23455566666653 44567788888888899999999999999988776644211 000000000 Q ss_pred eecccccc------------cccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 79 IRGVNPRT------------GNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAID 144 (149) Q Consensus 79 ~~~~~~~~------------~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~ 144 (149) ........ ..........-..+..|+..+|||.|.|+|..|++.++.+-..-+ +.++-. + T Consensus 73 ~~~t~d~sg~~tl~~~~~vi~~~~~g~~iyi~NnlpYA~~LEyG~S~QAP~G~vr~~~q~~~~~v-~~~~~~-----~ 144 (144) T protein:vir:95 73 QGSTQRASAAETLNSAKLVLRNKKPGQAIFITNNLPYIRRLNDGYSAQAPAGFVERAVLIGRKMR-KKFKIK-----D 144 (144) T ss_pred ccccCCCchhHHHHHHHHHHhhcCccceEEEeeCchhhhhhhccccCCCcchHHHHHHHHHHHHH-HhhccC-----C Confidence 00000000 000000111122456899999999999999999999997554433 222111 0 No 122 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=97.90 E-value=1.2e-07 Score=58.55 Aligned_cols=133 Identities=16% Similarity=0.159 Sum_probs=71.7 Q ss_pred ccceeehhh-HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC-CcCCCcccccceecc----c--------cc Q lcl|NC_019767. 2 IETSLDFSG-LNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA-PVRTGKLKKNVVVVT----Q--------KS 67 (149) Q Consensus 2 m~~~~~i~G-l~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~a-P~~~g~l~~~i~~~~----~--------~~ 67 (149) |+..|+|++ .++|.+.|++|.....+ ....++.-++.+......+- |. |.........+ . .- T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~~~~--~~~l~~~ig~~l~~~~~~rF~p~--G~~W~plsp~t~~~r~k~g~~~~~~L 76 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAAVTD--TLPLMRGIAAELLAETEFAFMDE--GPGWPQLSPVTVAARAAKGRGAHPIL 76 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHHhhc--CCCCCCCCccchHHHHhccCCCCCcc Confidence 554444442 46799999999877643 35678888888888777765 32 22211111000 0 01 Q ss_pred ccCCccccceeeecccccccccccccccCCCCCcceehhcccCCc-------CCCCCcchh-HHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 68 RRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTA-------NMPAHPFVR-PAYDTREEEAASVAIARM 139 (149) Q Consensus 68 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~-------~~~a~PFl~-pA~~~~~~~~~~~~~~~l 139 (149) ..+|.+...+... .+.....+ .++..|+.+++||+. .+||+|||. ..-++-.+++.+.|.+.+ T Consensus 77 ~~tG~L~~Si~~~-----~~~~~v~v----Gtn~~YA~iHqfGg~~~~~~~~~iPARPfLG~s~~~e~~~ei~~~I~~~i 147 (155) T protein:vir:10 77 QVTNALARSITTR-----ADRDQAQI----GSNLSYAAIQQLGGQAGRGRKVTIPARPYLPVLRNGQLKPSARDAVLDVL 147 (155) T ss_pred ccchhhhhhhhce-----ecCCEEEE----ecCcchhhhhhcccccCCCCccccCCccccCCCccccchHHHHHHHHHHH Confidence 1133343333222 11112222 245679999999974 699999997 222333445555555555 Q ss_pred HHHHHHHh Q lcl|NC_019767. 140 NQAIDEVL 147 (149) Q Consensus 140 ~~~i~k~~ 147 (149) .+.|.+-- T Consensus 148 ~~~l~~~r 155 (155) T protein:vir:10 148 LAALSQGR 155 (155) T ss_pred HHHHhhcC Confidence 55444433 No 123 >protein:vir:79225 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469157;genbank:gi:157835000;genbank:GeneID:5648806 Probab=97.84 E-value=1.2e-07 Score=58.48 Aligned_cols=133 Identities=14% Similarity=0.141 Sum_probs=69.9 Q ss_pred Ccc-ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC-CcCCCcccccceecc----c--------c Q lcl|NC_019767. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA-PVRTGKLKKNVVVVT----Q--------K 66 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~a-P~~~g~l~~~i~~~~----~--------~ 66 (149) |.. ++|++.+ +++.+.|.+|...+.+ .+..++.-++.++...+.+- |. |.........+ . . T Consensus 1 M~~~i~i~~d~-~~~~~~L~~l~~~~~d--~~~l~~~ig~~l~~~~~~rF~~e--G~~W~pls~~t~~~r~~~g~~~~~i 75 (155) T protein:vir:79 1 MTTRIDVELDD-QEVRQRLAVLMRSVTD--TLPVMRGIAAELLAETEFAFMDE--GPGWPQLSPATVAAREAKGRGPHPI 75 (155) T ss_pred CceEEEEEech-HHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHhhcc--CCCCCCCCHHHHHHHhccCCCCCCc Confidence 333 4555554 6899999999877653 35678888888888888775 32 32221111000 0 0 Q ss_pred cccCCccccceeeecccccccccccccccCCCCCcceehhcccCCc-------CCCCCcchhHHHH-HHHHHHHHHHHHH Q lcl|NC_019767. 67 SRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTA-------NMPAHPFVRPAYD-TREEEAASVAIAR 138 (149) Q Consensus 67 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~-------~~~a~PFl~pA~~-~~~~~~~~~~~~~ 138 (149) -..+|.+...+... .+.....+ .++..|+..++||+. .+||+|||.=.-+ ....++.+.|.+. T Consensus 76 L~~tG~L~~Si~~~-----~~~~~v~v----Gt~~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~~~l~~~~~~~I~~~ 146 (155) T protein:vir:79 76 LQVTNALARSVTTW-----ADRNEAGI----GSNLVYAAIHQFGGDAGRGHQVEIPARRYLPFDENGQLAAGARQSILEV 146 (155) T ss_pred cccchhhhhhhhce-----ecCCEEEE----ecCchhhhhhhcccccCCCCccccCCccccCCCCccccchHHHHHHHHH Confidence 11123333332221 11111111 345679999999976 7999999964332 1223333333333 Q ss_pred HHHHHHHHh Q lcl|NC_019767. 139 MNQAIDEVL 147 (149) Q Consensus 139 l~~~i~k~~ 147 (149) +.+.|.+.- T Consensus 147 i~~~l~r~r 155 (155) T protein:vir:79 147 VLTALSRNR 155 (155) T ss_pred HHHHHHhcC Confidence 333333322 No 124 >protein:vir:106728 Length: 155 # NCBI annotation: gp07 # Family: family:all:503 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944315;genbank:gi:38638614;genbank:GeneID:2657357 Probab=97.81 E-value=2.2e-08 Score=62.59 Aligned_cols=103 Identities=17% Similarity=0.129 Sum_probs=49.4 Q ss_pred ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeeccc Q lcl|NC_019767. 4 TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVN 83 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~ 83 (149) |++.-+||+.+.+.|+... .+-..|.+.+.. .+. ...... T Consensus 1 m~v~~k~L~~~~~~l~~~~----------------------v~VGi~~~a~y~-------------d~~-----~~~~~~ 40 (155) T protein:vir:10 1 MSVTRRGLTLPKDRYRSMS----------------------VKAGVLAGATYP-------------DES-----GKKLAD 40 (155) T ss_pred CcchHHHHHHHHHHHhCCe----------------------eEEeecCCCCCc-------------ccc-----chhhhh Confidence 5666666555544332110 000111111000 000 000000 Q ss_pred ccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHH--HHHhcC Q lcl|NC_019767. 84 PRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAI--DEVLSK 149 (149) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i--~k~~~k 149 (149) +.........+-+.+.++.+.||||.+.||||||+|++++++++..+.+...+...+ ++++.. T Consensus 41 ---~~~~~~~~~~g~~va~ia~~~E~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~ 105 (155) T protein:vir:10 41 ---GTILTKDPRAGLPVAMIAMALNYGTSKLPARPFMEKTIADRSAEWIKGLTVMMTMGYDAEVAMGQ 105 (155) T ss_pred ---hhhcccccccCCcHHHHHHHHhcCCCCCCCcchhHHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH Confidence 000000001122346678899999999999999999999999988777665554322 111111 No 125 >protein:vir:78607 Length: 155 # NCBI annotation: BcepNY3gp06 # Family: family:all:503 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294843;genbank:gi:149882906;genbank:GeneID:5291078 Probab=97.81 E-value=2.2e-08 Score=62.55 Aligned_cols=103 Identities=17% Similarity=0.130 Sum_probs=49.1 Q ss_pred ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeeccc Q lcl|NC_019767. 4 TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVN 83 (149) Q Consensus 4 ~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~ 83 (149) |++.-+||+.+.+.|+... .+-..|.+.+.. .+. ...... T Consensus 1 m~v~~k~L~~~~~~l~~~~----------------------v~VGi~~~a~y~-------------d~~-----~~~~~~ 40 (155) T protein:vir:78 1 MSVTRRGLTLPKDRYRSMS----------------------VKAGVLAGATYP-------------DES-----GKKLAD 40 (155) T ss_pred CcchHHHHHHHHHHHhCCe----------------------eEEeecCCCCCC-------------ccc-----chhhhh Confidence 5666666555544332110 000111111000 000 000000 Q ss_pred ccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHH--HHHhcC Q lcl|NC_019767. 84 PRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAI--DEVLSK 149 (149) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i--~k~~~k 149 (149) +.........+-+.+.++.+.||||.+.||||||+|++++++++..+.+...+...+ ++++.. T Consensus 41 ---~~~~~~~~~~g~~va~ia~~~E~G~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~~~~~~~~L~~ 105 (155) T protein:vir:78 41 ---GTILTKDPRAGLPVAMIAMALNYGTSKLPARPFMEKTITDRSAEWIKGLTVMMTMGYDAEVAMGQ 105 (155) T ss_pred ---hhhcccccccCCcHHHHHHhhhcCCCCCCCcchhhHHHHHHHHHHHHHHHHHHHcCCCHHHHHHH Confidence 000000001122346678899999999999999999999999988777665554321 111111 No 126 >protein:vir:107851 Length: 175 # NCBI annotation: gp31 # Family: family:all:274 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024704;genbank:gi:48696941;genbank:GeneID:2845939 Probab=97.75 E-value=2.9e-07 Score=56.39 Aligned_cols=137 Identities=15% Similarity=0.197 Sum_probs=75.6 Q ss_pred Ccc-ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhh-----CCcCCCcccccceec----------- Q lcl|NC_019767. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVVVV----------- 63 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~~~----------- 63 (149) |.. ++|+|. .++|.+.|++|.....+ .+..++.-++.++.....+ .|..+.-....+..+ T Consensus 1 Ms~~i~i~~~-~~~l~~~L~~l~~~~~d--~~~l~~~Ig~~l~~~t~~rF~~e~~Pdw~p~~p~t~~~r~~~g~~~~k~~ 77 (175) T protein:vir:10 1 MSDFVNFQID-DSALRTRLLQLEQAGHQ--KAGAMRKIAQALVLVTEDNFAAQGRPRWQALSEATIHMRVGGKKAYKKNG 77 (175) T ss_pred CceeEEEEec-HHHHHHHHHHHHHHhcc--HHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhhhcccccchhhh Confidence 433 355556 37889999988777643 2456666677777666654 353211110000000 Q ss_pred ------------ccccccCCccccceeeecccccccccccccccCCCCCcceehhcccCCc-------CCCCCcchhHHH Q lcl|NC_019767. 64 ------------TQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTA-------NMPAHPFVRPAY 124 (149) Q Consensus 64 ------------~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~-------~~~a~PFl~pA~ 124 (149) ...-..+|.+...+.... +.....+ .++..|+.++.||+. ++||+|||.=.- T Consensus 78 ~~~~~~~~~~~~~~~L~~tG~L~~Si~~~~-----~~~~v~v----Gtn~~YAaiHqfGg~~~~~~~v~iPaRpfLG~s~ 148 (175) T protein:vir:10 78 ELTAAASRRKAGLMILQDSGQMAASVSTDH-----DDNSAVI----GSNKEYAAIHQFGGQAGRGLKVTIPARPWLPVTA 148 (175) T ss_pred hhhhhhhhhccCCCcceechhhhhhhheee-----cCCEEEE----ecChhhhhhhhcccccCCCCccccCCccccCCCc Confidence 000001122222222111 1111111 245679999999987 899999998644 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 125 DT-REEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 125 ~~-~~~~~~~~~~~~l~~~i~k~~~k 149 (149) +. ...++++.|.+.+.+.|.+++++ T Consensus 149 ~d~~~~e~~~~Il~~~~~~l~~~~~~ 174 (175) T protein:vir:10 149 DGELQPEAVEPVLNTILRHLMDAANR 174 (175) T ss_pred ccccchHHHHHHHHHHHHHHHHHhcc Confidence 32 23466777888888888888888 No 127 >protein:vir:97190 Length: 148 # NCBI annotation: hypothetical protein ORF030 # Family: family:all:448 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294538;genbank:gi:149408259;genbank:GeneID:5237055 Probab=97.71 E-value=1.2e-07 Score=58.62 Aligned_cols=135 Identities=16% Similarity=0.145 Sum_probs=68.0 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccC---------- Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRR---------- 70 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~---------- 70 (149) |.++. ++...+..+.+.+.. .+...++..+..+..++....|+|+|.++.|-.++...-... T Consensus 1 m~~~~-------sFa~~i~~~~~~ve~-~~~~~~r~~a~~i~~~vv~~sPVdTGrfRanw~vs~~~p~~~~~~~~dp~~~ 72 (148) T protein:vir:97 1 MPSLS-------EFSRRITLRGRKVAE-GADALTRKVALAADQAVVSGTPVDTGRARSNWIAAIGSAPSSVIDAYSPGEA 72 (148) T ss_pred CCccc-------hhcccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhCCCcchhhhhhhheeecccccccccccCCCCC Confidence 55541 223333444444422 234456666777777888899999999998877653321110 Q ss_pred Ccccccee---eecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019767. 71 GEISSGVH---IRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVL 147 (149) Q Consensus 71 g~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~ 147 (149) |.....-. +.................-..+..|+.++|||.|.|+|..|++.++..-..-+.+ .+.+++.= T Consensus 73 G~~~~~~~~~~i~~~~~vi~~~k~g~~iyi~NnlpYA~~LEyG~S~QAP~G~v~~t~~~~~~~v~~------~~~~~~~~ 146 (148) T protein:vir:97 73 GSTEAANTQAAIDQAESVIRGYNYGEEIHITNNLPYIQRLNDGYSAQAPANFVEQAVLEAVQVVQF------GRVVDGDP 146 (148) T ss_pred CcccccchhHHHHHHHHHhhccCCCceEEEeecchhhhHhhccccCCCcchHHHHHHHHHHHHHHh------hhhhcCCC Confidence 10000000 0000000000000011122345789999999999999999999999654432211 11222222 Q ss_pred cC Q lcl|NC_019767. 148 SK 149 (149) Q Consensus 148 ~k 149 (149) +- T Consensus 147 ~~ 148 (148) T protein:vir:97 147 GS 148 (148) T ss_pred CC Confidence 22 No 128 >protein:vir:80425 Length: 134 # NCBI annotation: BcepGomrgp15 # Family: family:all:448 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210235;genbank:gi:146329927;genbank:GeneID:5123534 Probab=97.70 E-value=7.4e-08 Score=59.68 Aligned_cols=125 Identities=18% Similarity=0.192 Sum_probs=65.8 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccC--------Ccc Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRR--------GEI 73 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~--------g~~ 73 (149) |++..+|. ++..+++.- +...++..+..+..++...+|+|+|.++.|-.++...-... |.. T Consensus 1 msF~~~i~---~~~~~ve~~--------~~~~~r~~a~~~~~~vv~~sPVdTGr~Ranw~vs~~~~~~~~~~~~d~~g~~ 69 (134) T protein:vir:80 1 MSYTDRFN---VIAKGIEDN--------VDNLVKNVALAIGSNVIADTPILTGQARRNWQTELNQMPESVLDIPESPSEG 69 (134) T ss_pred CCcccCHH---HHHHHHHHH--------HHHHHHHHHHHHHHHHHHhCCCcchhhhcccceeecCcccccccCcCCCCcc Confidence 66655533 444433322 23355556677777788899999999998876653321110 000 Q ss_pred -ccceeeecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 74 -SSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 74 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~ 141 (149) ...+. ................-..+..|+.++|||.|.|+|..|++-+...-.. +++.++ ++-+ T Consensus 70 ~~~~~~--~~~~vi~~~k~g~~iyi~Nn~pYA~~LEyG~S~QAP~G~v~~t~~~~~~-~v~~~~-~~~~ 134 (134) T protein:vir:80 70 MDEALQ--VLQQTVGQYKAGDTVHITNNAPYIKELNSGSSQQAPANFVETSIMRATR-LIRNVK-VVPQ 134 (134) T ss_pred chhhHH--HHHHHHhhccCcceEEEeeCchhhhhhhccccCCCcchHHHHHHHHHHH-HHHhhc-cCCC Confidence 00000 0000000000001111224578999999999999999999988865433 222222 1111 No 129 >protein:vir:96105 Length: 193 # NCBI annotation: hypothetical protein ORF028 # Family: family:all:503 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294445;genbank:gi:149408342;genbank:GeneID:5237224 Probab=97.61 E-value=1.5e-07 Score=58.02 Aligned_cols=135 Identities=12% Similarity=-0.015 Sum_probs=59.9 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH-HHHHHhhCCcCCCcccccceecccccccCCccccceee- Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVL-KEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI- 79 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v-~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~- 79 (149) |+++.+..++++|++.|++|.... + .-.+-..+..- .+......|+..- ..+. ..+...........+.. T Consensus 1 m~~~~~~~~~~~~~~~l~~l~~~~---v-~vGi~~~~~~~~~~~~~~G~~va~i---Aai~-EfG~~I~~~~~~~~~~~~ 72 (193) T protein:vir:96 1 MSLRRDSELIAAHLQMLRAMRGRS---V-SAGWYSTARYPDKAGGSVGIQVARI---ARLN-EYGGTIDHPGGTRYIRDA 72 (193) T ss_pred CeeccchHHHHHHHHHHHHhcCCe---E-EEEEcCCCCCCCcccccccchHHHH---HhHH-HcCCccccCccceeeeec Confidence 999999999999999999986431 1 11111111000 0000000110000 0000 00000000000000000 Q ss_pred ecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHH------HHhcC Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAID------EVLSK 149 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~------k~~~k 149 (149) .......+.... ..+ -..-+.|.-..|.++||||||+++++.++++..+.+.+.+...+. +++.+ T Consensus 73 ~~~g~~~~~~~~--k~~---~~~~~~~~~~~~v~IPaRPFlr~t~~~~~~~~~~~~~~~~~~~~~g~~~~~~~l~~ 143 (193) T protein:vir:96 73 IVRGRFVGVRFV--RND---FPGETEVTKPHRITIPARPFMRYAWNLFSADRAAIQNRIAMRLARGQITPDQALAQ 143 (193) T ss_pred ccccccccccee--ccC---cceeeEeecceeccCCCcchhhhhHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHH Confidence 000000111110 100 012244555568899999999999999999888877776665443 22222 No 130 >protein:vir:99196 Length: 155 # NCBI annotation: putative virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950453;genbank:gi:119953654;genbank:GeneID:4643056 Probab=97.60 E-value=6.3e-07 Score=54.57 Aligned_cols=132 Identities=14% Similarity=0.151 Sum_probs=67.5 Q ss_pred Ccc-ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC-CcCCCcccccce-----eccc-------c Q lcl|NC_019767. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA-PVRTGKLKKNVV-----VVTQ-------K 66 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~a-P~~~g~l~~~i~-----~~~~-------~ 66 (149) |.. ++|++.. ++|.+.|.+|...+.+ .+..++.-++.++.....+- |. |....... .... . T Consensus 1 Ms~~i~i~~d~-~~~~~~L~~l~~~~~d--~~~l~~~ig~~l~~~~~~rF~pd--G~~W~pls~~t~~~r~~~g~~~~~i 75 (155) T protein:vir:99 1 MTTRIDVELDD-QEVRQRLALLMRSVTD--TLPVMRGIAAELLAETEFAFMDE--GPGWPQLSPVTVAAREAKGRGPHPI 75 (155) T ss_pred CceEEEEEech-HHHHHHHHHHHHHhhh--HHHHHHHHHHHHHHHHHHHhhcc--CCCCCCCChHHHHHHhccCCCCCCc Confidence 333 4555554 6889999999877653 36777878888888777764 33 22211110 0000 0 Q ss_pred cccCCccccceeeecccccccccccccccCCCCCcceehhcccCCc-------CCCCCcchhHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 67 SRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTA-------NMPAHPFVRPAYDTREEEAASVAIARM 139 (149) Q Consensus 67 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~-------~~~a~PFl~pA~~~~~~~~~~~~~~~l 139 (149) -..+|.+...+.+. .+.....+ .++..|+..++||+. .+|++|||.=.-+ +++.....+++ T Consensus 76 L~~tg~L~~Si~~~-----~~~~~v~v----Gtn~~YA~iHqfGg~~~~~~~v~iPaRpfLG~s~~---~~l~~e~~~~I 143 (155) T protein:vir:99 76 LQVTNALARSVTTW-----ADRNEAGI----GSNLVYAAIHQFGGDAGRGHQVEIPARRYLPFDEN---GQLAAGARQSI 143 (155) T ss_pred chhchhhhhhhhce-----ecCCEEEE----ecCccchhhhhcccccCCCCccccCCccccCCCCc---cccchHHHHHH Confidence 11223333333221 11111212 245679999999976 6899999964322 11111222223 Q ss_pred HHHHHHHhcC Q lcl|NC_019767. 140 NQAIDEVLSK 149 (149) Q Consensus 140 ~~~i~k~~~k 149 (149) ...|.+.++| T Consensus 144 ~~~i~~~l~~ 153 (155) T protein:vir:99 144 LEIVLTALSR 153 (155) T ss_pred HHHHHHHHhc Confidence 3333333333 No 131 >protein:vir:96774 Length: 152 # NCBI annotation: hypothetical phage protein # Family: family:all:448 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224253;genbank:gi:62362388;genbank:GeneID:3345713 Probab=97.59 E-value=2.6e-07 Score=56.69 Aligned_cols=126 Identities=17% Similarity=0.100 Sum_probs=69.5 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCc--------------CCCcccccceecccc Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPV--------------RTGKLKKNVVVVTQK 66 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~--------------~~g~l~~~i~~~~~~ 66 (149) =|++..+| ..+.+.+.. -+...+++.+..+...+...+|+ |+|.++.|-.++... T Consensus 10 ~msFaa~i----------~~~~~~~e~-~~~~~~R~~~~~i~~~vv~~sPVg~~~~~~~~a~~~ydtGrfRanw~vS~~~ 78 (152) T protein:vir:96 10 PMSWSKSL----------KNIIVKNEN-LTEKQLRAGLFDAANTVILGSPVGAPELWQQPAPNYYRAGSYRSNHRVSISK 78 (152) T ss_pred cccccccH----------HHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhccccccccccccccccchhhhhhhheeeecC Confidence 33344333 233333322 23445666777788888889999 999999887776432 Q ss_pred cccCCccccceeeecc---cccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 67 SRRRGEISSGVHIRGV---NPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIAR 138 (149) Q Consensus 67 ~~~~g~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~ 138 (149) .. .+...+...-... ..............-..+..|+..+|||+|.|+|..|++.++..-.+-+.++++.+ T Consensus 79 p~-~~~~~~~~~~~t~~~~~~~i~~~~~g~~iyi~NnlPYA~~LEyG~S~QAP~G~vr~t~~~~~~~v~ea~~~~ 152 (152) T protein:vir:96 79 IT-SFEKGISSQSSIMMDLQSDIAKFKIGETLFMTNPLPYATSIEYGHSSQAPNGVYRPAVRRLVKFLNTELKAK 152 (152) T ss_pred CC-cccccCCCCCchHHHHHHHHhhccccceEEEeeCchhhhHhhccccCCCCchHHHHHHHHHHHHHHHHhccC Confidence 21 1111110000000 00000000001111234578999999999999999999999977666555544444 No 132 >protein:vir:107757 Length: 189 # NCBI annotation: gp20 # Family: family:all:503 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024868;genbank:gi:48697510;genbank:GeneID:2948378 Probab=97.58 E-value=5.5e-08 Score=60.39 Aligned_cols=92 Identities=14% Similarity=0.185 Sum_probs=48.4 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeecccccccccccccccCCCCC Q lcl|NC_019767. 21 LSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRN 100 (149) Q Consensus 21 l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 100 (149) |+..+. ..-. +...+.+.++.+. ...+.+. + ..... ...+.+. T Consensus 1 M~~~i~-----~~~~-~~~~L~~~lk~l~-------~k~V~VG-------------i--~~~~~---------y~dG~~v 43 (189) T protein:vir:10 1 MGRVIR-----KQGP-ARVKLNAFIKGMN-------DYSVRIG-------------W--FSTAK---------YPDGTPT 43 (189) T ss_pred Ccceec-----cCcH-HHHHHHHHHHHhh-------CCeEEEE-------------e--cCCCC---------CCCcccH Confidence 333321 1111 1122333333321 0111111 0 00000 0012235 Q ss_pred cceehhcccCC--cCCCCCcchhHHHHHHHHHHHHHHHHHHHHHH------HHHhcC Q lcl|NC_019767. 101 AFYWRFVELGT--ANMPAHPFVRPAYDTREEEAASVAIARMNQAI------DEVLSK 149 (149) Q Consensus 101 ~~y~~f~E~GT--~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i------~k~~~k 149 (149) +.++.++|||+ ..+||||||+|++++++++..+.+...+...| ++++.. T Consensus 44 A~Ia~~~E~G~p~~~IP~RPFlr~t~~~~~~~~~~~l~~~~~~vl~G~~~~~~~L~~ 100 (189) T protein:vir:10 44 AYVASIHEFGAPSRGIPARSFIRPTIAAQQAAWSQQMRFYAKQIVVGQMNVEQALEG 100 (189) T ss_pred HHHHHHHHhcCcCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHHHHH Confidence 78899999998 45899999999999999998888877777633 344444 No 133 >protein:vir:102338 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:26573 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529563;genbank:gi:90592648;genbank:GeneID:3974470 Probab=97.50 E-value=1.1e-06 Score=53.34 Aligned_cols=94 Identities=18% Similarity=0.158 Sum_probs=62.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhCCcC---CCcccccceecccccccCCccccceeeecccccccccccccccCCCCCcc Q lcl|NC_019767. 26 NNKVLRDATRAGAEVLKEEVIARAPVR---TGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAF 102 (149) Q Consensus 26 ~~k~~~~Al~~~a~~v~~~ak~~aP~~---~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 102 (149) ..+.+.+++++.|..+...++.++|+. +|+|++|..+..... .+ + .. ..+.. T Consensus 1 l~~~~~~~~~~~a~~l~~~vk~rTPv~~~d~G~LR~sW~~g~v~k--~~---------------~-----~v---~N~~e 55 (116) T protein:vir:10 1 MSKNLRRAKNNIGNKLLRKVKPKTPVAKIDGGTARKSWKYKELNL--FD---------------G-----VV---SNNVE 55 (116) T ss_pred CchHHHHHHHHHHHHHHHHHHhhCCCCcCCCcccccCceeeeeec--cC---------------c-----ee---ecCCc Confidence 245667788889999999999999984 599988865432110 00 0 01 14467 Q ss_pred eehhcccCCcC-------------------CCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 103 YWRFVELGTAN-------------------MPAHPFVRPAYDTREEEAASVAIARMNQAID 144 (149) Q Consensus 103 y~~f~E~GT~~-------------------~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~ 144 (149) |++|+|||..- -|.+.||..+..+.+.++-..+.+.|.+-++ T Consensus 56 YA~~VE~GHRq~~g~g~~~~~~gkrlk~~~V~G~fml~~s~~e~~~~~~~~~~~~~~~~l~ 116 (116) T protein:vir:10 56 YIHHLEYGHRTRQGTGTSENYRPKPNGISFVPGVFMLARSVDEMSSIIDDELNQIIIDFWN 116 (116) T ss_pred ccccccCCceeeCCcceecccccccccCCccCceehHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 99999999543 3456678887777666555555555544444 No 134 >protein:vir:96288 Length: 100 # NCBI annotation: ORF049 # Family: family:all:180 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240315;genbank:gi:66396010;genbank:GeneID:5133365 Probab=97.37 E-value=4.8e-07 Score=55.20 Aligned_cols=88 Identities=17% Similarity=0.186 Sum_probs=62.9 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |++++. |.++|.+.|+..++++.+.+ ++.+.+.|+.|...|..+||+|+|.|++||.......+.++.+..+-.+. T Consensus 13 makvky---G~~dmvk~~~~f~~~i~~~v-k~~IakTa~~I~~~Avs~APVD~G~Lk~SI~~dyk~GGltavI~vGAeYA 88 (100) T protein:vir:96 13 MAKVKY---GADSMVVELDKFDKKIEEWV-KKGIAKTTTKIYNTAVALAPVDLGFLEESIDFKYFDGGLSSVISVGADYA 88 (100) T ss_pred hhhhee---chHHHHHHHhcchHHHHHHH-HHHHHHHHHHHHhhHHhhccccccccceeeeeeeecCCeeEEEecchhHH Confidence 776654 89999999999999997665 77899999999999999999999999999998877666665554332221 Q ss_pred cccccccccccccc Q lcl|NC_019767. 81 GVNPRTGNSDNTMK 94 (149) Q Consensus 81 ~~~~~~~~~~~~~~ 94 (149) - .+-..-...+. T Consensus 89 I--krmsqllvtvi 100 (100) T protein:vir:96 89 I--KRMSQLLVTVI 100 (100) T ss_pred H--HHHHHHHhhcC Confidence 1 00000000011 No 135 >protein:vir:7449 Length: 123 # NCBI annotation: gp26 # Family: family:all:2713 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818564;genbank:gi:29567001;genbank:GeneID:1260238 Probab=97.36 E-value=5.2e-06 Score=49.54 Aligned_cols=121 Identities=15% Similarity=0.241 Sum_probs=78.6 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCC--cCCCcccccceecccccccCCcccccee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGVH 78 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~g~~~~~~~ 78 (149) |+.++|++. +++|+++++.+..+.. ....--...+|..+..+||.+|| ..||+.+..|...... .|.-...|+ T Consensus 1 ~~~~~f~~d-~~~l~~~i~~~~~k~~-~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARqgl~~~~~~---~g~~~~~Iy 75 (123) T protein:vir:74 1 MAKVTFEYD-AQELRTNIRNLDRRME-SAVDALMDYEAAYATGQLKMRAPWTDRTGAARSGLLAVANK---LGPGSHELI 75 (123) T ss_pred CceeEEEec-HHHHHHHHHhhHHHHH-HHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhcccccc---CCCceEEEE Confidence 988999998 8899999998877753 33333344578899999999999 3567666555321110 111111111 Q ss_pred eecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019767. 79 IRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVL 147 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~ 147 (149) +. ....|+-|+|.++...++ -|.|+++.-.+++++-+...+. +|+++- T Consensus 76 ls------------------h~veYG~~LEla~~~kya--Ii~Ptv~~~~~~im~g~~~ll~-~l~~~~ 123 (123) T protein:vir:74 76 MS------------------YSVHYGIWLEIANSGQYA--VIGPFLPVMGRKLMHDLEHLID-RLERAQ 123 (123) T ss_pred Ee------------------cCeeecceeeecCCCCce--eecchHHHHhHHHHHHHHHHHH-HhhccC Confidence 11 234699999988775554 6888888888888877766554 344444 No 136 >protein:vir:94069 Length: 168 # NCBI annotation: putative RNA polymerase # Family: family:all:503 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453622;genbank:gi:84662658;genbank:GeneID:5142579 Probab=97.30 E-value=5.7e-07 Score=54.83 Aligned_cols=104 Identities=13% Similarity=0.122 Sum_probs=44.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhCCcC--CCcccccceecccccccCCccccceeeecccccccccccccccCCCCCcc Q lcl|NC_019767. 25 ENNKVLRDATRAGAEVLKEEVIARAPVR--TGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAF 102 (149) Q Consensus 25 ~~~k~~~~Al~~~a~~v~~~ak~~aP~~--~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 102 (149) +. -+-++ |.+.....++.+...+ =|.+.. .....|....... +. .......+-+.+. T Consensus 1 ~~-~~~~~----g~~~~~~~~~~l~~~~v~vG~l~~-------a~yp~G~~~~~~~--------~~-~~~~~~~g~~va~ 59 (168) T protein:vir:94 1 MT-TIARK----GVKMPPHLEAQFQSGEVKAGVLSG-------STYPQMTYTDQRT--------GK-QIEDARGGMPVAV 59 (168) T ss_pred Cc-cccch----hhhhhHHHHHhhhccceeeecccc-------Ccccccccchhhc--------cc-ccccccccccHHH Confidence 11 01111 2222222222221110 011110 0000000000000 00 0000011123467 Q ss_pred eehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHH--HHHhcC Q lcl|NC_019767. 103 YWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAI--DEVLSK 149 (149) Q Consensus 103 y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i--~k~~~k 149 (149) ++.++|||+.+.||||||+|++++++++..+.+...++..+ +.++.. T Consensus 60 Ia~~~E~G~~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~~~~~~~~L~~ 108 (168) T protein:vir:94 60 IAQALEYGHGQNHPRPFMQQTYAAQYRAWSRDLTLTLKAGAAADTALRT 108 (168) T ss_pred HHHHHhcCCCCCCCchhhHHHHHHHHHHHHHHHHHHHhcCCCHHHHHHH Confidence 89999999999999999999999999888776665554221 111111 No 137 >protein:vir:95260 Length: 160 # NCBI annotation: Phage conserved protein # Family: family:all:31735 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944893;genbank:gi:38707833;genbank:GeneID:2744046 Probab=97.29 E-value=1.2e-06 Score=53.06 Aligned_cols=91 Identities=8% Similarity=0.049 Sum_probs=46.1 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) ||+ ++...|++.|...|++| ++.. +.+. +.-. T Consensus 1 ~~~-~~~~~G~~~L~~~~k~l-----------------------~~~~-----------V~VG-------------i~~d 32 (160) T protein:vir:95 1 MVK-RVIHPARAKLVGAMKNL-----------------------QTAN-----------AQVG-------------YFQE 32 (160) T ss_pred Cce-eechHhHHHHHHHHHHH-----------------------hCCe-----------eEEe-------------eccc Confidence 443 44456666666555543 0100 1110 0000 Q ss_pred cccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHH----HHHHHHHHHHHHHHHHHHH-------HhcC Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDT----REEEAASVAIARMNQAIDE-------VLSK 149 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~----~~~~~~~~~~~~l~~~i~k-------~~~k 149 (149) .... ..+.+-+..+.+.||||.+.|++||||++|+. .+...+..+...+.+.+.. .++. T Consensus 33 ~g~~----------~dG~sv~~vA~~~EfG~~~iPaRPf~R~tfe~~~~~~~~~~~~~~~~~i~~~~~~g~~~~~~~LG~ 102 (160) T protein:vir:95 33 QGQH----------SSGFSYPALMYLQEVIGVPSASGKVYRRLFEITMMLNKQTLLEQTKKNLYKQLSSLNTDPSNTLEA 102 (160) T ss_pred cccC----------CCCccHHHHHhhhhcCcccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhHHHHHHH Confidence 0000 01112345778999999999999999999973 4555555544444444442 2222 No 138 >protein:vir:105773 Length: 131 # NCBI annotation: gp14 # Family: family:all:10996 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224152;genbank:gi:62362227;genbank:GeneID:3342526 Probab=97.06 E-value=4e-06 Score=50.20 Aligned_cols=111 Identities=13% Similarity=0.120 Sum_probs=70.3 Q ss_pred eehhhHHHHHHHHHHhHHHHHH-HHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccc--cccCCccccceeeecc Q lcl|NC_019767. 6 LDFSGLNDIAKDLEALSRAENN-KVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQK--SRRRGEISSGVHIRGV 82 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~-k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~--~~~~g~~~~~~~~~~~ 82 (149) |+|+|+.+...+|+++-++++. +++ .||..+..+....|-...|+||..|-+|--..... .+.+|.+.+. T Consensus 1 ikV~Gi~~~~~nl~~~i~~I~~~K~~-Ral~~al~~~~~~AA~~TPIDTSTLiNSQfrei~~ngtritGRVGYS------ 73 (131) T protein:vir:10 1 MPVKGIKRIQMNTRRVLSDIAGIRTE-KVLYLVMNAGANHAAVITPVKSSTLINSQYKKLEPIPSGMIGRVGYT------ 73 (131) T ss_pred CCcchHHHHHHHHHHHHHhhccchHH-HHHHHHHHHHHhhhhhccccchhhhccccceeeeccCceeEEeeccc------ Confidence 8899999999999999999884 554 58888888999999999999999998764322111 1112222111 Q ss_pred cccccccccccccCCCCCcceehhccc--CCcCCC--------------CCcchhHHHHHHHH-HHHHHHHHHHHH Q lcl|NC_019767. 83 NPRTGNSDNTMKANNPRNAFYWRFVEL--GTANMP--------------AHPFVRPAYDTREE-EAASVAIARMNQ 141 (149) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~y~~f~E~--GT~~~~--------------a~PFl~pA~~~~~~-~~~~~~~~~l~~ 141 (149) +-|+-++.- |+-+.. -.-||..+|+.++. .+...+++++.- T Consensus 74 ------------------AnYA~yVHda~Gklkgqprp~gkgn~w~p~ae~eFL~kgfe~~~~d~i~avik~e~k~ 131 (131) T protein:vir:10 74 ------------------ANYAAAVNAAKGKLKGKPRPDGSGNYWDPNGEPDFLRKGFERDGLNEIKAIIRQGYKV 131 (131) T ss_pred ------------------eeeeeeeecCccccCCCcCCCCCcceecCCCChhhhhhhhhccchHHHHHHHhhhcCC Confidence 223333321 222222 23499999987644 455555555544 No 139 >protein:vir:97088 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2714 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453568;genbank:gi:84662603;genbank:GeneID:5142503 Probab=97.02 E-value=8.8e-06 Score=48.29 Aligned_cols=130 Identities=13% Similarity=0.018 Sum_probs=84.5 Q ss_pred eehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeeccccc Q lcl|NC_019767. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPR 85 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~ 85 (149) |+++=...=+..|.+.-+++. +..+++++.|+..-....+..|-- .....+|.+..+|++.....+ T Consensus 1 m~~~~~~~d~s~l~~~l~~l~-~~~~~v~R~A~~~ga~vv~dear~-------------~aP~~tG~LkksI~~~~~~~~ 66 (157) T protein:vir:97 1 MKFSIRSVDITGILAGLETVV-EHSSDVVRTMTYESAVAVRESAKA-------------FVNDETGKLRNNLYVAYSPEE 66 (157) T ss_pred CeeEeecccHHHHHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHH-------------hCCCCcchhhhheeeeecccc Confidence 888732322444555556665 456777787777777766666521 112357999999998877766 Q ss_pred ccccccc-cccCCCCCcceehhcccCCcCCCC------C-----------c-c--hhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 86 TGNSDNT-MKANNPRNAFYWRFVELGTANMPA------H-----------P-F--VRPAYDTREEEAASVAIARMNQAID 144 (149) Q Consensus 86 ~~~~~~~-~~~~~~~~~~y~~f~E~GT~~~~a------~-----------P-F--l~pA~~~~~~~~~~~~~~~l~~~i~ 144 (149) .+..... .++++...++||||+|||++..-. . + + =+|=+.-.-+...+.+.+.+...|. T Consensus 67 s~~g~~~~~Vg~~~~~a~~g~~vEfG~~~~~~~~~~~~~~~~~~~~~~~t~~~~Pa~PFlRPA~d~~k~~a~~~~~~~l~ 146 (157) T protein:vir:97 67 SVEGIQTYAVSWRKKAAPHGHLLEFGHWQTHAAYRDKDGQWYSSKVKLVNPKWIPAKPFLRPGYDSVAMQIPDIARAAGA 146 (157) T ss_pred CCCceEEEEEeecCCccceeeeeecCcccccccccCCcccccccccccCCCCcCCCCcccchHHHHhHHHHHHHHHHHHH Confidence 5544333 345666789999999999754211 1 1 1 2555666677777888888888888 Q ss_pred HHhcC Q lcl|NC_019767. 145 EVLSK 149 (149) Q Consensus 145 k~~~k 149 (149) +.+.. T Consensus 147 k~I~e 151 (157) T protein:vir:97 147 KKYAE 151 (157) T ss_pred HHHHH Confidence 88877 No 140 >protein:vir:101508 Length: 120 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655400;genbank:gi:109522588;genbank:GeneID:4157580 Probab=96.95 E-value=2.5e-05 Score=45.82 Aligned_cols=114 Identities=13% Similarity=0.165 Sum_probs=68.5 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCC--cCCCcccccceecccccccCCcccccee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGVH 78 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~g~~~~~~~ 78 (149) |+.++|++. .++|+++++.+..+.. ....--+..+|..+..+||.+|| ..||+.+..|...... .+.-...++ T Consensus 1 ~~~~~f~~~-~~~l~~~i~~~~~k~~-~~~~~~~d~~a~~le~~aK~nApW~DRTg~ARq~i~~~~~~---~~~~~~~Iy 75 (120) T protein:vir:10 1 MAKIEFKFK-DIELRRGVEDMEAKVD-RAMKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVAST---PQPDRYEIV 75 (120) T ss_pred CceEEEEec-HHHHHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHhcCCCCcccchhhhhhhcccccc---CCCceEEEE Confidence 999999998 6899999998876653 33344455678889999999999 3567666555321110 111111111 Q ss_pred eecccccccccccccccCCCCCcceehhcc--cCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 79 IRGVNPRTGNSDNTMKANNPRNAFYWRFVE--LGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E--~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) +. ....|+-|+| .|..++. |+|+++.-.+++++-++. .++| T Consensus 76 ls------------------h~veYG~~LEla~~~kyaI----l~PTi~~~~~~il~g~~~--------ll~~ 118 (120) T protein:vir:10 76 FA------------------HTVHYGIWLEIANSGRYEI----IMPTVHHEGKLMAQRLRG--------LLGR 118 (120) T ss_pred Ee------------------cCeeecceEEeeCCCCccc----ccchHHHHhHHHHHHHHH--------Hhhh Confidence 11 2345888899 5544443 555555555555544443 4444 No 141 >protein:vir:6071 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878212;genbank:gi:33438911;genbank:GeneID:1457746 Probab=96.85 E-value=7.9e-06 Score=48.55 Aligned_cols=126 Identities=15% Similarity=0.153 Sum_probs=60.5 Q ss_pred hhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhh-----CCcCCCcccccce-----eccccccc----CCcc Q lcl|NC_019767. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVV-----VVTQKSRR----RGEI 73 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~-----~~~~~~~~----~g~~ 73 (149) +..+++|...|..+-..+...--+..++.-|+.++...+.+ .|. |....... ...++... .+.+ T Consensus 1 ~~~~~~l~~~L~~~l~~L~~~~~~~l~r~Ig~~l~~~~~~Rf~~q~~Pd--G~~W~p~~~~~~~~k~~~~~~~l~~~~~l 78 (150) T protein:vir:60 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPD--GTPYAPRQQQSARKKTGRVKRKMFAKLIT 78 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCC--CCCCcccChHHHHHhhcCCCccchhhhhh Confidence 44555666555555443321112333445555555555543 453 22221111 00000000 1111 Q ss_pred ccceeeecccccccccccccccCCCCCcceehhcccCC----------cCCCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 74 SSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGT----------ANMPAHPFVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT----------~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~ 141 (149) ...+ ....+.....+....+++..|+..+-||- ..+|++|||.=.- +.++++++.+.+.|.+ T Consensus 79 ~~sl-----~~~~~~~~a~vg~~~Gt~~~yAaiHQfG~~~~~~~~~~~~~iPaRp~LG~s~-~d~~~i~~~i~~~l~r 150 (150) T protein:vir:60 79 SRFL-----HIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTG-EDVQMIEEIILAHLDR 150 (150) T ss_pred ccee-----eeeeeCcEEEEEeeCCCchhhhhhhhccccccccCCCCceecCCcccCCCCH-HHHHHHHHHHHHHHhC Confidence 1111 11222222222223356788999999993 3589999998774 4566666666666666 No 142 >protein:vir:98892 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164422;genbank:gi:56694912;genbank:GeneID:3197282 Probab=96.85 E-value=1.4e-05 Score=47.17 Aligned_cols=103 Identities=17% Similarity=0.091 Sum_probs=60.7 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) ||.++|++.++...+ .. +.+..|...-++.+..++..-+|.++|.|++|..+... .|.+ T Consensus 1 mmkvkv~~~~~~~~~---~~-------~~~~~aq~~~~~ev~~~~~~yVP~~~G~L~~s~~~~s~----~g~I------- 59 (108) T protein:vir:98 1 MPKIRVELSGAKDKL---SP-------QTQRRGQYAMANQMLQDMNQFVPMEEGILRLTGNISSD----AEEI------- 59 (108) T ss_pred CceeEeeehHHHHHH---HH-------HHHHHHHHHHHHHHHHhhcccCcCcCCccccceeeccC----CceE------- Confidence 999999988754432 11 11223444455677778899999999999998543321 1111 Q ss_pred cccccccccccccccCCCCCcceehhcccCCcCC-----CCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANM-----PAHPFVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~-----~a~PFl~pA~~~~~~~~~~~~~~~l~~ 141 (149) ..+++|++++=||+.+. ....|+..|.....+++++.+.+.++= T Consensus 60 -----------------~y~tPYAr~qYYg~~~n~~~p~ag~~W~eraka~~~~~~~~~~~k~~k~ 108 (108) T protein:vir:98 60 -----------------YYNTPYAKRRFYEPAYNYTTPGTGPRWDMKAKRLFISDWERAYMKGANW 108 (108) T ss_pred -----------------EecChhhHHhhhccccCCCCCCCcchhHHHHHhhhhHHHHHHHHHhhcC Confidence 12345777777665443 233477767666666555554444333 No 143 >protein:vir:99546 Length: 200 # NCBI annotation: hypothetical protein # Family: family:all:503 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039796;genbank:gi:126011046;genbank:GeneID:4818241 Probab=96.84 E-value=7.8e-06 Score=48.59 Aligned_cols=120 Identities=16% Similarity=0.162 Sum_probs=56.5 Q ss_pred Cccceeehhh---HHHHHHHHHHhHHHHHHHHHHHHHHHHH-----------HHHHHHHHhh-------CCcCCCccccc Q lcl|NC_019767. 1 MIETSLDFSG---LNDIAKDLEALSRAENNKVLRDATRAGA-----------EVLKEEVIAR-------APVRTGKLKKN 59 (149) Q Consensus 1 Mm~~~~~i~G---l~~l~~~l~~l~~~~~~k~~~~Al~~~a-----------~~v~~~ak~~-------aP~~~g~l~~~ 59 (149) =|.|+++|.| ++++++.|++|.... +.-.+-+.+ ..++.-|.-+ .|-.+..++.. T Consensus 4 ~~~~~~k~~~~~~~~~~~~~l~~l~~~~----v~vGi~~~~~y~~~~~~~dG~~va~IA~~~EfG~~i~~p~~~~~~~~~ 79 (200) T protein:vir:99 4 GFSKSNSVAAPLKHFQMLKQFDALKGKT----VQAGWFETDRYPAKEGETIGPLVAKIARQLEFGGVINHPGGTKYIKDA 79 (200) T ss_pred CcceeeeeecchHHHHHHHHHHHhhCCe----EEEEEcCCCCcCCcccccccchHHHHHhHHHcCCeeccCCCccccccc Confidence 1335556665 777777777664321 111111111 1111111110 11111111110 Q ss_pred ceecccccccCCccccceeeecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 60 VVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARM 139 (149) Q Consensus 60 i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l 139 (149) +. . ....+... . .+...-|+.|+--.|.++||||||+|+++.++++..+.+...+ T Consensus 80 ~~--------~------------g~~~g~rf---v--~k~~~~~~~~~~~~~v~IP~RPFlr~t~~~~~~~~~~~~~~~~ 134 (200) T protein:vir:99 80 IV--------D------------GRYVGTRF---V--HKSFQGEHEVTKAHQIVIPARPFMRLAWATFNKDKVKIQAQIA 134 (200) T ss_pred cc--------c------------cccccccc---c--cccccceeeeeccccccCCCcchhhHHHHHHHHHHHHHHHHHH Confidence 00 0 00011111 1 1223445556555688999999999999999998888877777 Q ss_pred HHHHH------HHhcC Q lcl|NC_019767. 140 NQAID------EVLSK 149 (149) Q Consensus 140 ~~~i~------k~~~k 149 (149) .+.|. +++.+ T Consensus 135 ~~~l~g~~~~~~~L~~ 150 (200) T protein:vir:99 135 RQLLDGTINPEQALAQ 150 (200) T ss_pred HHHHhCCCCHHHHHHH Confidence 65442 33333 No 144 >protein:vir:5703 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839862;genbank:gi:30065717;genbank:GeneID:1260611 Probab=96.83 E-value=8.7e-06 Score=48.32 Aligned_cols=131 Identities=15% Similarity=0.171 Sum_probs=62.0 Q ss_pred hhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhh-----CCcCCCcccccceecc--cccccCC-cc-cccee Q lcl|NC_019767. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVVVVT--QKSRRRG-EI-SSGVH 78 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~~~~--~~~~~~g-~~-~~~~~ 78 (149) +..+++|...|..+-..+...-.+..++.-|+.++...+.+ .|. |.........+ .+.+..+ .+ ..... T Consensus 1 m~~~~~l~~~L~~~l~~L~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~Pd--G~~W~p~k~~~~~~k~~~~~~~l~~~~~l 78 (150) T protein:vir:57 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPD--GTPYAPRQQQSARKKTGRVKRKMFAKLIT 78 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCC--CCCCcccChHHHHHhccCCCcccchhhhh Confidence 44566666666665444322222334555555666655554 453 22221111000 0000000 00 00000 Q ss_pred eecccccccccccccccCCCCCcceehhcccCCc----------CCCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 79 IRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTA----------NMPAHPFVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~----------~~~a~PFl~pA~~~~~~~~~~~~~~~l~~ 141 (149) ........+.....+....+++..|+..+-||-. .+||+|||.=.- ..++++.+.+.+.|.+ T Consensus 79 ~~sl~~~~~~~~a~vg~~~G~~~~yAaiHQfG~~~r~~~~~~~~~iPaRp~LG~s~-~d~~~i~~~i~~~l~r 150 (150) T protein:vir:57 79 SRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEETRKDGKKIDYPARPLLGFTG-EDVQMIEEIILAHLDR 150 (150) T ss_pred ccceeeeeeCcEEEEEeecCCchhhhhhhhccccccccCCCceeecCCcccCCCCH-HHHHHHHHHHHHHHhC Confidence 1111122222222222233567889999999943 589999999774 4566666666666666 No 145 >protein:vir:2688 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:589 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075507;genbank:gi:12719436;genbank:GeneID:920156 Probab=96.80 E-value=1.9e-05 Score=46.46 Aligned_cols=111 Identities=10% Similarity=0.089 Sum_probs=68.7 Q ss_pred HHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHHHhhCC--cCCCcccccceecccccccCCccccceeeecccccccc-- Q lcl|NC_019767. 14 IAKDLEA-LSRAENNKVLRDATRAGAEVLKEEVIARAP--VRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGN-- 88 (149) Q Consensus 14 l~~~l~~-l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP--~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~-- 88 (149) |+++|++ |++.-.+++.++||.++++.|.+..+.+.- .|||...+.+..+.-... .+. T Consensus 1 ilk~lE~k~G~~~m~ri~dkAL~~~g~~v~~~lK~~~~~fkDTGatidev~~s~p~~~-----------------~g~~~ 63 (123) T protein:vir:26 1 MLKKLESVYGKQSMQAKSDRALNEASEFFIKALKKEFESFKDTGASIEEMTKSKPYTK-----------------VGSQE 63 (123) T ss_pred ChhhHHHhcCHHHHHHhhhHHHHHHHHHHHHHHHHhhHHhhhccceeeeEEecCeeec-----------------cCCcc Confidence 6777764 566666789999999999999999999865 478877776655432111 111 Q ss_pred cccccccCC-CCCcceehhcccCCcCC----CCCc--chhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 89 SDNTMKANN-PRNAFYWRFVELGTANM----PAHP--FVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 89 ~~~~~~~~~-~~~~~y~~f~E~GT~~~----~a~P--Fl~pA~~~~~~~~~~~~~~~l~~ 141 (149) ..+.+.+.. ......-|+.|||..+- .|+- -|..|+++++..+.+.++++|.+ T Consensus 64 rtV~i~W~gp~~R~~iVHLNE~GYtr~Gk~i~PRG~G~i~~a~~~se~~y~~~vk~eL~k 123 (123) T protein:vir:26 64 RAVLIEWVGPMNRKNIIHLNEHGYTRDGKKYTPRGFGVIAKTLAANERKYREIIKKELAR 123 (123) T ss_pred ceEEEEeecCCCceeeEeeeccceecCCCeEccchhhHHHHHHHhhhHHHHHHHHHHhcC Confidence 112222211 22344579999995332 2332 46667777766666665555555 No 146 >protein:vir:80970 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468396;genbank:gi:157324970;genbank:GeneID:5601405 Probab=96.76 E-value=2.5e-05 Score=45.77 Aligned_cols=104 Identities=14% Similarity=0.159 Sum_probs=67.0 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeec Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~ 81 (149) |+++|+|. +..+.++|.+ ....|...-++.+..++..-+|.++|.|++|..+. ..|.+ T Consensus 1 M~vkV~id-~~~~~~~l~~--------a~~~aq~~~~~ev~~~~~~yVP~~tG~L~~s~~~~-----~~g~I-------- 58 (112) T protein:vir:80 1 MPIKVRVD-LSKAKGSVKK--------AKERGQFALINQAAADIALYVPFLSGDLSNQYVIM-----NDKEI-------- 58 (112) T ss_pred CceeEEee-hHHHHHHHHH--------HHHHHHHHHHHHHHHHhhcCCCcccCccccceeec-----cCceE-------- Confidence 89888887 3444443332 22345555667777788999999999998873210 00111 Q ss_pred ccccccccccccccCCCCCcceehhcccCCcC--------CCCCcchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 82 VNPRTGNSDNTMKANNPRNAFYWRFVELGTAN--------MPAHPFVRPAYDTREEEAASVAIARMNQAI 143 (149) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~--------~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i 143 (149) ..+++|++++-||... ..-..|+..+.....+++++.+.+.+.+.| T Consensus 59 ----------------~y~tPYAr~qYY~~~~~~~~~~~p~ag~~W~erak~~~~~~~~~~~~k~~~~~l 112 (112) T protein:vir:80 59 ----------------MWTSIYARRLYNGINFNFTLTHHPLAGPKWDQRAKVDKLESWIEVAQKAVEEGL 112 (112) T ss_pred ----------------EecCchhhHhhhcccCCCCcCCCCCcchhhHHHHHhhhhHHHHHHHHHHHhhcC Confidence 1235677777665432 223468888888888888888888888888 No 147 >protein:vir:1838 Length: 149 # NCBI annotation: O protein # Family: family:all:370 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052262;genbank:gi:9634069;genbank:GeneID:1262457 Probab=96.68 E-value=1.1e-05 Score=47.81 Aligned_cols=130 Identities=17% Similarity=0.152 Sum_probs=56.7 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhh-----CCcCCCcccccceecc--ccccc-CCc Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVVVVT--QKSRR-RGE 72 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~~~~--~~~~~-~g~ 72 (149) |- .|+++...|+.|-..+....-+..++.-|+.+....+.+ .|.. .......... .+.+. ... T Consensus 1 m~-------~~~~~~~~l~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~rf~~q~~PdG--~~W~p~~~~~~~~~~g~~~~~ 71 (149) T protein:vir:18 1 MS-------ELTALQERLAGLIASLSPAARRKMAAEIAKKLRTSQQQRIKRQQAPDG--TPYAARKRQPVRSKKGRIKRE 71 (149) T ss_pred Cc-------hHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhhcCCCC--CCCcccchhhhhhccCcccch Confidence 22 344455444444333211111234455555555555543 4532 2111111000 00000 000 Q ss_pred cccce-eeecccccccccccccccCCCCCcceehhcccCCc----------CCCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 73 ISSGV-HIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTA----------NMPAHPFVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 73 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~----------~~~a~PFl~pA~~~~~~~~~~~~~~~l~~ 141 (149) +.... ............. ..++..+++..|+..+-||.. .+|++|||.=.- +.++++++.+.+.|.+ T Consensus 72 ~~~~l~~~~~l~~~~~~~~-~~v~~~Gtn~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~s~-~d~~~I~~~i~~~l~~ 149 (149) T protein:vir:18 72 MFAKLRTSRFMKAKGSDSA-AVVEFTGKVQRMARVHQYGLKDRPNRNSRDVQYEARPLLGFTR-DDEQMIEDVIISHLGK 149 (149) T ss_pred hhhhhhhhhhhheeecCce-eEEEecccchhhhhhhhccccccccCCCccccccccccCCCCH-HHHHHHHHHHHHHHhC Confidence 00000 0000111111111 122223566789999999965 689999998663 4566676666666666 No 148 >protein:vir:2026 Length: 150 # NCBI annotation: gpS # Family: family:all:370 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046769;genbank:gi:9630340;genbank:GeneID:1261511 Probab=96.64 E-value=1.4e-05 Score=47.19 Aligned_cols=131 Identities=15% Similarity=0.162 Sum_probs=62.3 Q ss_pred hhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhh-----CCcCCCcccccceecc--cccccCCc-cccc-ee Q lcl|NC_019767. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVVVVT--QKSRRRGE-ISSG-VH 78 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~~~~--~~~~~~g~-~~~~-~~ 78 (149) +.-+++|...|..|-..+...-.+..++.-|+.++...+.+ .|. |.......... .+.+..+. +... .. T Consensus 1 ~~~~~~l~~~L~~ll~~l~~~~~~~l~~~Ig~~l~~~~~~rf~~q~~Pd--G~~W~p~k~~~~~~k~g~~~~~l~~~~~l 78 (150) T protein:vir:20 1 MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKAPD--GTPYAPRQQQSVRKKTGRVKRKMFAKLIT 78 (150) T ss_pred CchHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCC--CCCCcccchHHHHHhccCCCccccchhhh Confidence 44556666666665444321112334555555565555554 353 22221111000 00111110 1110 01 Q ss_pred eecccccccccccccccCCCCCcceehhcccCC----------cCCCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 79 IRGVNPRTGNSDNTMKANNPRNAFYWRFVELGT----------ANMPAHPFVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT----------~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~ 141 (149) ........+.....+....+++..|+..+-||- ..+||+|||.=.- ..++++++.+.+.|.+ T Consensus 79 ~~sl~~~~~~~~~~vg~~~Gs~~~yAa~HQfG~~~~~~~~~~~~~iPaRp~LG~s~-~d~~~i~~~i~~~l~k 150 (150) T protein:vir:20 79 SRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTG-EDVQMIEEIILAHLER 150 (150) T ss_pred hhhhheeecCcEEEEEeeCCcchhhhhhhhcccccccccCCCceeccccccCCCCH-HHHHHHHHHHHHHHhC Confidence 111122222222233323456778999999993 3689999998764 4566666666666666 No 149 >protein:vir:98557 Length: 149 # NCBI annotation: gp14 # Family: family:all:370 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958069;genbank:gi:41057366;genbank:GeneID:2744228 Probab=96.62 E-value=1.4e-05 Score=47.19 Aligned_cols=125 Identities=16% Similarity=0.133 Sum_probs=59.6 Q ss_pred hhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhh-----CCcCCCccccccee-----cccccc----cCCcc Q lcl|NC_019767. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVVV-----VTQKSR----RRGEI 73 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~~-----~~~~~~----~~g~~ 73 (149) ++.+++|...|+.|-..+...--+..++.-|+.+....+.+ .|.. ........ ..++.. ..+.+ T Consensus 1 m~d~~~l~~~L~~ll~~L~~~~~~~ll~~Ig~~l~~~t~~rf~~q~~PdG--~~W~p~~~~~~~~k~~~~~~~l~~~g~l 78 (149) T protein:vir:98 1 MSELTALQERLTGLIASLSPAARRQMAADIAKKLRASQQQRIRRQQAPDG--TPYAARKRQSVRSKKGRIRREMFARLRT 78 (149) T ss_pred CchHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHhhcCCCC--CCCcccchHHHHhccCCCCcccchhhhh Confidence 22345666666555443321112334555555666555554 4532 21111110 000000 01111 Q ss_pred ccceeeecccccccccccccccCCCCCcceehhcccCCc----------CCCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 74 SSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTA----------NMPAHPFVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~----------~~~a~PFl~pA~~~~~~~~~~~~~~~l~~ 141 (149) ...+ ....+.... .++..+++..|+..+.||.. .+|++|||.=. +..++++++.+.+.|.+ T Consensus 79 ~~sl-----~~~~~~~~~-~V~~~Gs~~~yAa~HQfG~~~r~~~~~~~~~iPaRp~LG~s-~~d~~~i~~~i~~~l~~ 149 (149) T protein:vir:98 79 NRFM-----KAKGSDSAA-VVEFTGRVQRMARVHQYGLKDRPNRHSRDVQYAARPLLGFT-RDDEQMIEDIIIRHLGK 149 (149) T ss_pred hhhh-----hheecCCee-EEEecCcchHHhhHhhccccccccCCCcceeccccccCCCC-HHHHHHHHHHHHHHhhC Confidence 1111 111111111 22223566789999999954 58999999854 34566666666666666 No 150 >protein:vir:79179 Length: 155 # NCBI annotation: gp39, phage virion morphogenesis protein # Family: family:all:370 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111070;genbank:gi:134288746;genbank:GeneID:4960698 Probab=96.61 E-value=1.8e-05 Score=46.60 Aligned_cols=131 Identities=13% Similarity=0.098 Sum_probs=58.4 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhh-----CCcCCCcccccceecc---cccccCCc Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVVVVT---QKSRRRGE 72 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~~~~---~~~~~~g~ 72 (149) ||+ + +++|...|+.|-..+....-+..++.-|+.++...+.+ .|..+ ......... ......|. T Consensus 1 m~~---~---~~~l~~~l~~ll~~l~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~PDG~--~W~prk~~~~~~~~~~~~g~ 72 (155) T protein:vir:79 1 MTD---D---LQALERWAGGLLAKLSPAARRQLLRELGRDLRRAQQSRVAAQRNPDGS--AYEPRKVKAGGKRLREKAGR 72 (155) T ss_pred Cch---H---HHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCC--CCcccchhhhhhhhhcccCc Confidence 775 2 33444444444333211111223444444444444433 45322 111110000 00111122 Q ss_pred ccccee------eecccccccccccccccCCCCCcceehhcccCCc----------CCCCCcchhHHHHHHHHHHHHHHH Q lcl|NC_019767. 73 ISSGVH------IRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTA----------NMPAHPFVRPAYDTREEEAASVAI 136 (149) Q Consensus 73 ~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~----------~~~a~PFl~pA~~~~~~~~~~~~~ 136 (149) +..... ........+..... ++..+++..|+..+-||.. .+||+|||.=.-+ .++++++.+. T Consensus 73 ~~~~~m~~~l~~a~~l~~~~~~d~a~-Vg~~Gs~~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LGls~~-d~~~I~~~i~ 150 (155) T protein:vir:79 73 VKREAMFRKLRTARYLRIDVDSTGLA-IGFDERLSRIARVHQEGQKAPVEPGGPLAQYPVRVVLGFSDA-DRELVRDRLL 150 (155) T ss_pred ccchhhhhhhhhhheeeeeecCcEEE-EEecCcchhhhhhhhcCCcccCCCCCcccccccccccCCCHH-HHHHHHHHHH Confidence 111100 00111112222222 2223566789999999954 5899999977743 5677777777 Q ss_pred HHHHH Q lcl|NC_019767. 137 ARMNQ 141 (149) Q Consensus 137 ~~l~~ 141 (149) +.|.+ T Consensus 151 ~~l~r 155 (155) T protein:vir:79 151 RELTR 155 (155) T ss_pred HHhhC Confidence 77777 No 151 >protein:vir:79115 Length: 148 # NCBI annotation: tail completion protein gpS # Family: family:all:370 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165266;genbank:gi:145708091;genbank:GeneID:5247126 Probab=96.61 E-value=1.4e-05 Score=47.27 Aligned_cols=127 Identities=15% Similarity=0.118 Sum_probs=62.2 Q ss_pred hhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhh-----CCcCCCc--ccccceecccccc----cCCccccc Q lcl|NC_019767. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGK--LKKNVVVVTQKSR----RRGEISSG 76 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~-----aP~~~g~--l~~~i~~~~~~~~----~~g~~~~~ 76 (149) +..+++|...|..|-..+....-+..++.-|+.++...+.+ .|..+.. +............ ..+.+... T Consensus 1 m~~~~~l~~~L~~ll~~l~~~~~~~l~r~Ig~~l~~st~~Rf~~q~~PDG~~W~p~s~~~~~~~g~~~~~~~~~l~~~~~ 80 (148) T protein:vir:79 1 MSESRELEAWLAGMLTKLDAPARRMLARAVAAELRRRQAARIAEQRNPDGSPYVPRKPQLRHRAGRIRRAMFMRLRLARY 80 (148) T ss_pred CccHHHHHHHHHHHHHhcCChhHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCcccchHHHhhcccccccccchhhhhhh Confidence 34456666666666544432222334455555555555543 4532211 1111100000000 00111111 Q ss_pred eeeecccccccccccccccCCCCCcceehhcccC----------CcCCCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 77 VHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELG----------TANMPAHPFVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~G----------T~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~ 141 (149) + ....+.. ...++..+++..|+..+-|| +..+|++|||.=.- +.++++++.+.+.|.. T Consensus 81 l-----~~~~~~~-~~~v~~~Gt~~~yAaiHQfG~~~r~~~~~~~v~iPaRp~LG~s~-~d~~~i~~~i~~~l~~ 148 (148) T protein:vir:79 81 M-----KTQADAN-TAVVTFAGNAQRIATVHQFGLRDRVNKAGLTAQYPARELLGMDG-VDMEHITNLLLLHLGA 148 (148) T ss_pred e-----eeeeeCC-eeeEEeeccchhhhhhhhcCccccccCCCCccccCcccccCCCH-HHHHHHHHHHHHHhcC Confidence 1 1111111 12222235667899999999 44689999998664 4677788888877777 No 152 >protein:vir:396 Length: 184 # NCBI annotation: gp11 # Family: family:all:869 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046906;genbank:gi:9630476;genbank:GeneID:1261650 Probab=96.56 E-value=9.2e-05 Score=42.72 Aligned_cols=140 Identities=14% Similarity=0.177 Sum_probs=64.6 Q ss_pred eehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCc----ccccceecccccccCCccccceeeec Q lcl|NC_019767. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGK----LKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~----l~~~i~~~~~~~~~~g~~~~~~~~~~ 81 (149) |+|+||+++.+.|..|++....+++..||...+.-+..++.+.+....+- +++.+.+.. ...+.+...+.+.. T Consensus 1 ~~v~~l~~~~~~L~~l~~~~v~kA~~rAiNrt~~~~rt~~~r~v~~~~~i~~~~ir~r~~~~k---as~~~l~a~I~~~~ 77 (184) T protein:vir:39 1 MSLKGLEQAIENLNSISKTAVPRASAQAVNRVANRAVSRSVAVVSKDTRVPRKLVKQRARVKR---ATVNKPRALIRVNR 77 (184) T ss_pred CchHHHHHHHHHHhccCHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHhhheecc---cCCCCeEEEEEEec Confidence 99999999999999997775578888999999888888887776544432 222222211 11122222221111 Q ss_pred c------------------cccccccccccccCC-CCCccee--------hhcccCCcCCC--------CCcchhHHHHH Q lcl|NC_019767. 82 V------------------NPRTGNSDNTMKANN-PRNAFYW--------RFVELGTANMP--------AHPFVRPAYDT 126 (149) Q Consensus 82 ~------------------~~~~~~~~~~~~~~~-~~~~~y~--------~f~E~GT~~~~--------a~PFl~pA~~~ 126 (149) . ....+.......+.. -..+|.+ -|.--|....| +.| +..+++. T Consensus 78 ~~i~l~~~g~~~~k~~~~~~~~~~~~~~~~~g~~~~~gaFia~~~~G~~~Vf~R~gk~R~PI~~~~~~i~~~-~~e~~~~ 156 (184) T protein:vir:39 78 GNLPAIKLGTASVRLSRRKRDKKGANSVLRIGPFRFPGGFIQQLKNGRWHVMRRTSKPRYPIEVVSIPLAAP-LTTAFKE 156 (184) T ss_pred cceeeeeccccccccCccccccccccceeeecceecCcceeeecCCCceEEEEEecCcccceeEEEcCchHH-HHHHHHH Confidence 0 000010000000000 0112221 12233333332 222 1233332 Q ss_pred H-----HHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 127 R-----EEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 127 ~-----~~~~~~~~~~~l~~~i~k~~~k 149 (149) . .+.+.+.|..+|..+|++.++| T Consensus 157 ~~~~~~~~~~~~el~~~l~~~L~~~l~r 184 (184) T protein:vir:39 157 ELPKLMESDMPKELRASLTNQLRLILTR 184 (184) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhcCC Confidence 2 2333444444444444555555 No 153 >protein:vir:80037 Length: 199 # NCBI annotation: gp11 # Family: family:all:503 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468715;genbank:gi:157325295;genbank:GeneID:5601728 Probab=96.35 E-value=3.2e-06 Score=50.74 Aligned_cols=130 Identities=12% Similarity=0.143 Sum_probs=55.8 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccce---- Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGV---- 77 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~---- 77 (149) |+++-+-.=+++|++.|+.|..... ++ .-+......++--|.-+ ..-..|... .+.+.... T Consensus 1 m~vt~~~~~~~~~~~~l~~L~~k~v-~v--Gi~~~d~~~~~~Ia~~~------E~Ga~I~~~------~~~l~Ip~~~a~ 65 (199) T protein:vir:80 1 MKVTTDKSTMNKAIRELDQLDRYSL-QI--GLFGEDDSFIQMIAGVH------EFGLTIRPK------GKYLTIPTPEAG 65 (199) T ss_pred CcccccHHHHHHHHHHHHHhcCCEE-EE--EEecCCCcchhheeehh------hcCCeeecC------Cceeeecchhhh Confidence 7766555557788888877753211 11 00100000000000000 000011110 00000000 Q ss_pred eeecc----cccccccccccccCCCCCcceehhcccCCc--CCCCCcchhHHHHHHHHHHHHHHHHHHHHHH------HH Q lcl|NC_019767. 78 HIRGV----NPRTGNSDNTMKANNPRNAFYWRFVELGTA--NMPAHPFVRPAYDTREEEAASVAIARMNQAI------DE 145 (149) Q Consensus 78 ~~~~~----~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~--~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i------~k 145 (149) ..... ............. ...--...+|||+. +.||||||+|++++++++..+.+...+...| ++ T Consensus 66 ~~k~~~~~~~~~p~g~~~~~~~---~~~~~~~~~e~g~~~~~IP~RPFlr~t~~~~~~~~~~~~~~~~~~vl~g~~~a~~ 142 (199) T protein:vir:80 66 DRRARDIPGLFKPKGKNILAVA---GPDGKLTVMFYLKTEVNIPERSFLRSTFDEKSNKWGELFEGWIDDVIHGKLSAEQ 142 (199) T ss_pred cccccccCcccccCCcceeeee---ccccceeeeeeccccccCCCCchhHHHHHHHHHHHHHHHHHHHHHHHhCCCcHHH Confidence 00000 0000000000001 11123456899974 7899999999999999998888877776643 22 Q ss_pred HhcC Q lcl|NC_019767. 146 VLSK 149 (149) Q Consensus 146 ~~~k 149 (149) ++.. T Consensus 143 ~L~~ 146 (199) T protein:vir:80 143 VYNR 146 (199) T ss_pred HHHH Confidence 3333 No 154 >protein:vir:45 Length: 112 # NCBI annotation: gp10 # Family: family:all:899 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463471;swissprot:trembl:q9t1b3;genbank:gi:16798793;uniprot:Q9T1B3;genbank:GeneID:922369 Probab=96.14 E-value=8.5e-05 Score=42.90 Aligned_cols=104 Identities=13% Similarity=0.135 Sum_probs=66.8 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeec Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~ 81 (149) |+++|+|.. ..+.++|. +++.+|...-++.+..++..-+|.++|.|++|..+- ..|. T Consensus 1 M~vkv~vn~-~~~~~~l~--------~a~~r~q~~~~~ev~~~~~~yVP~~~G~L~~S~~~~-----~~g~--------- 57 (112) T protein:vir:45 1 MPIKVRVDL-SKAKGSVK--------KAKERGQFALINQAAADIALYVPFLSGDLSNQYVIM-----NDKE--------- 57 (112) T ss_pred CceeEEeeh-HHHHHHHH--------HHHHHHHHHHHHHHHHHhhcCCccccCccccceeec-----cCCe--------- Confidence 998888873 33333221 223345666677788888999999999999863210 0011 Q ss_pred ccccccccccccccCCCCCcceehhcccCCcC--------CCCCcchhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 82 VNPRTGNSDNTMKANNPRNAFYWRFVELGTAN--------MPAHPFVRPAYDTREEEAASVAIARMNQAI 143 (149) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~--------~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i 143 (149) + ..+++|++++=||... ..-..|+..|.....+++++.+.+.+.+.| T Consensus 58 -----------I----~y~tPYAr~qYY~~~~~~~~~~~p~ag~~W~erak~~~~~~~~~~~~k~~~~gl 112 (112) T protein:vir:45 58 -----------I----MWTSIYARRLYKGINFNFTLTHHPLAGPEWDQRAKIDKMDVWEKVAQKAVEEGL 112 (112) T ss_pred -----------E----EecChhhHHhhhccccCCCCCCCCCCchhhHHHHHHhhHHHHHHHHHHHHhhcC Confidence 1 1234577777665433 233468888888888888888888888887 No 155 >protein:vir:100312 Length: 152 # NCBI annotation: tail synthesis protein S # Family: family:all:370 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655481;genbank:gi:109289949;genbank:GeneID:4157355 Probab=96.11 E-value=4.8e-05 Score=44.25 Aligned_cols=133 Identities=11% Similarity=0.062 Sum_probs=59.8 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhh-----CCcCCCcc--cccceecccccccCCccc Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKL--KKNVVVVTQKSRRRGEIS 74 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~-----aP~~~g~l--~~~i~~~~~~~~~~g~~~ 74 (149) |+-. +.+|...|..|-..+...--+..++.-|+.+....+.+ +|..+.-- +.... ........+.+. T Consensus 1 M~~~-----~~~~~~~L~~ll~~L~~~~r~~l~~~Ig~~l~~~t~~Rf~~q~~PDG~pW~p~k~~~~-~~k~~~~~~~m~ 74 (152) T protein:vir:10 1 MSEP-----IEQVKTAFDSLLNNISKPRRRLMYQQIGRELARSQRRRIKAQQNPDGSAYEPRKKPKK-GVKSKIKSGKMF 74 (152) T ss_pred CchH-----HHHHHHHHHHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHhccCCCCCCCchhhhhhh-hhcccccchhHH Confidence 3332 44555555544433321111234445555555555443 45432211 11110 000011111111 Q ss_pred ccee-eecccccccccccccccCCCCCcceehhcccC-----------CcCCCCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 75 SGVH-IRGVNPRTGNSDNTMKANNPRNAFYWRFVELG-----------TANMPAHPFVRPAYDTREEEAASVAIARMNQA 142 (149) Q Consensus 75 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~G-----------T~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~ 142 (149) ..+. ........+... ..++..+++..|+..+-|| +..+|++|||.=+- ..+.++.+.+.+.|..+ T Consensus 75 ~~L~~a~~l~~~a~~~~-~~Vg~~Gt~~~yAaiHQfG~~~r~~~~~~~~v~iPaRp~LG~s~-~d~~~I~~~i~~~l~~a 152 (152) T protein:vir:10 75 DKITQPRFMRLRLESEG-VSLGYEGGDAVIARIHQQGLIGRVRKDWDLKVKYASRELLGFTD-DDLQMIEDYMINILAGS 152 (152) T ss_pred HhhhhcceeeeeecCcE-EEEEecCCchhhhhhhccCccccccCCCCcceeccccccCCCCH-HHHHHHHHHHHHHHhcC Confidence 1110 000111111111 2222235667899999999 55699999998774 45666777777666666 No 156 >protein:vir:107099 Length: 137 # NCBI annotation: conserved phage protein # Family: family:all:180 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950610;genbank:gi:119953690;genbank:GeneID:4643108 Probab=95.71 E-value=0.00014 Score=41.75 Aligned_cols=112 Identities=11% Similarity=-0.024 Sum_probs=34.4 Q ss_pred HHHHHHHHHHhHHHHHH--HHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeecccccccc Q lcl|NC_019767. 11 LNDIAKDLEALSRAENN--KVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGN 88 (149) Q Consensus 11 l~~l~~~l~~l~~~~~~--k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 88 (149) +..+...|++|-+.+.. +.+..++..+.+.....+...|- ... ...+|.+..+|.+.... T Consensus 1 Ma~~~~Gl~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~ak-------~~a------PvdTG~Lr~SI~~~~~~----- 62 (137) T protein:vir:10 1 MAKVKYGNWELVKELEDFEKETIRWAKKGIAKTTTIIHNSIV-------SNM------PVDTGYLRESVSMDFKK----- 62 (137) T ss_pred CchhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HhC------CcCcchhhcCeeEEeeC----- Confidence 44444455566555432 22233444444444444443331 111 12356666665432100 Q ss_pred cccccccCCCCCcceehhcccCCcC------CCCCcchhHHHHHH-----------------HHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 89 SDNTMKANNPRNAFYWRFVELGTAN------MPAHPFVRPAYDTR-----------------EEEAASVAIARMNQAIDE 145 (149) Q Consensus 89 ~~~~~~~~~~~~~~y~~f~E~GT~~------~~a~PFl~pA~~~~-----------------~~~~~~~~~~~l~~~i~k 145 (149) +......+.=.+|+-.- |..+|+.+++.... -...+.---++-+..|.+ T Consensus 63 --------~~~~~~V~~~~~Ya~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~PFl~pA~~~~~~~i~k 134 (137) T protein:vir:10 63 --------GGLTGVINIGSEYAVYVNYGTGIYAVGPGGSRAKNIPWCYKDADGHWHTTKGQHAQPFWEPAIDEGRAFFNK 134 (137) T ss_pred --------CcEEEEEecCCCcccccccCccccccCCCccccccccceeeccccceeccCCCCCCcchhHHHHHHHHHHHH Confidence 00001111112222110 12334333222100 000111111112222222 Q ss_pred Hhc Q lcl|NC_019767. 146 VLS 148 (149) Q Consensus 146 ~~~ 148 (149) .+. T Consensus 135 ~i~ 137 (137) T protein:vir:10 135 YFS 137 (137) T ss_pred hcC Confidence 222 No 157 >protein:vir:3427 Length: 192 # NCBI annotation: tail component # Family: family:all:869 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040590;genbank:gi:9626254;genbank:GeneID:2703485 Probab=95.63 E-value=0.0004 Score=39.20 Aligned_cols=140 Identities=14% Similarity=0.182 Sum_probs=61.7 Q ss_pred eehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCccc----ccceecccccccCCccccceeeec Q lcl|NC_019767. 6 LDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLK----KNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 6 ~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~----~~i~~~~~~~~~~g~~~~~~~~~~ 81 (149) ++|+||+++++.|+.|++....++...|+...|.-+...+...+...+|-.. +.+.... ...+.+...|.+.. T Consensus 1 ~~ik~l~~~~~~L~~i~~~~vp~A~~rAiNrta~~a~t~~~r~v~~e~~I~~k~Ir~r~r~~k---As~~~l~a~I~~~~ 77 (192) T protein:vir:34 1 MAIKGLEQAVENLSRISKTAVPGAAAMAINRVASSAISQSASQVARETKVRRKLVKERARLKR---ATVKNPQARIKVNR 77 (192) T ss_pred CcchhHHHHHHHHhhcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCHHHHHhhheecc---ccCCCceEEEEEec Confidence 7788999999999999888667888889998888888777776655444222 2222211 11111111111110 Q ss_pred cc-------c-c-----------------ccccccc-cccCCCCCcceeh--------hcc-cCCcCCC--------CCc Q lcl|NC_019767. 82 VN-------P-R-----------------TGNSDNT-MKANNPRNAFYWR--------FVE-LGTANMP--------AHP 118 (149) Q Consensus 82 ~~-------~-~-----------------~~~~~~~-~~~~~~~~~~y~~--------f~E-~GT~~~~--------a~P 118 (149) .. . . ....... ++...-..+|+++ |.- .|...-| ..| T Consensus 78 ~~l~~~~l~~~~~~~~rr~~~~~~~~~~~~~~g~~~k~Gk~~f~gaFia~m~ng~~~Vf~R~~gk~R~PIe~vkIpis~~ 157 (192) T protein:vir:34 78 GDLPVIKLGNARVVLSRRRRRKKGQRSSLKGGGSVLVVGNRRIPGAFIQQLKNGRWHVMQRVAGKNRYPIDVVKIPMAVP 157 (192) T ss_pred cceeeeeecccccccccccccccccccccccccceeeecceecCCcccccCCCCCceeEEEccCCCccceeEEEechhHH Confidence 00 0 0 0000000 0000011123221 222 2332222 122 Q ss_pred chhHHHHHHHHHHH-HHHHHHHHHHHHH----HhcC Q lcl|NC_019767. 119 FVRPAYDTREEEAA-SVAIARMNQAIDE----VLSK 149 (149) Q Consensus 119 Fl~pA~~~~~~~~~-~~~~~~l~~~i~k----~~~k 149 (149) . ..||+...++++ +.|..+|.++|.. .++| T Consensus 158 l-~~af~~~~~~~~~~~~~~El~~~L~~~lr~~~k~ 192 (192) T protein:vir:34 158 L-TTAFKQNIERIRRERLPKELGYALQHQLRMVIKR 192 (192) T ss_pred H-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccC Confidence 2 555554443322 2233333333322 2333 No 158 >protein:vir:96763 Length: 177 # NCBI annotation: putative phage-related protein # Family: family:all:1091 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039824;genbank:gi:126010915;genbank:GeneID:5076273 Probab=95.43 E-value=0.00064 Score=38.11 Aligned_cols=148 Identities=16% Similarity=0.184 Sum_probs=72.9 Q ss_pred Cccceeehhh-HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCC----cccccceecccccccCCcccc Q lcl|NC_019767. 1 MIETSLDFSG-LNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTG----KLKKNVVVVTQKSRRRGEISS 75 (149) Q Consensus 1 Mm~~~~~i~G-l~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g----~l~~~i~~~~~~~~~~g~~~~ 75 (149) =|+++|++++ ++.+...|..++..+ .+++..||...|.-+...+.+.+...++ .+++.+.+..........+.. T Consensus 4 ~~~l~idv~~~l~~i~~~l~~~~~~~-~~A~~rAlNrta~~~rt~~~r~v~~~~~i~~k~ir~r~~~~~a~~~~~~~i~~ 82 (177) T protein:vir:96 4 GFEMKIDVSREAEDIAAMVAATTKQL-ELAAQRAMTKAGQWLRTHSVRELGQQLGIKQEPLKKRFRVYPQRQKGEVRFWV 82 (177) T ss_pred CceeEEehhHHHHHHHHHHhhcHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHhhheeeccCCCcEEEEEE Confidence 4667888877 555555566666665 6788899999888888877776654333 233333222111111111100 Q ss_pred c---eeeecc-cccccccccccccCCCCCcceeh--------hcccCCcCC-------CCCcchhHHHHHHHHHHHHHHH Q lcl|NC_019767. 76 G---VHIRGV-NPRTGNSDNTMKANNPRNAFYWR--------FVELGTANM-------PAHPFVRPAYDTREEEAASVAI 136 (149) Q Consensus 76 ~---~~~~~~-~~~~~~~~~~~~~~~~~~~~y~~--------f~E~GT~~~-------~a~PFl~pA~~~~~~~~~~~~~ 136 (149) + +.+... ....+..-..+....-..+|... |.--|.... |--|=+..+++...+++.+.|. T Consensus 83 ~~~~i~l~~~~~~r~t~~Gv~~g~~~~~gaFia~~~~g~~~Vf~R~gk~R~PI~~~~~pi~~~~~~~~e~~~~~~~~~~~ 162 (177) T protein:vir:96 83 GLDPIGVYRLGTPKVTQKGVKVNRNEYDGAFISPMKSNYPLVFKRRGKERLPIDLVDEDIDEPAMEVVERWERRVFQRFK 162 (177) T ss_pred eccceehhhcccCCCCccceEEeeEEcCCceeccCCCCCceEEEEecCCccceEEEEcCchHHHHHHHHHHHHHHHHHHH Confidence 0 000000 00000000000000000112211 222232222 3233356677777788888888 Q ss_pred HHHHHHHHHHhcC Q lcl|NC_019767. 137 ARMNQAIDEVLSK 149 (149) Q Consensus 137 ~~l~~~i~k~~~k 149 (149) ..|.++|+.+++- T Consensus 163 ~~l~~Ei~~~L~g 175 (177) T protein:vir:96 163 ELFEQEARAIING 175 (177) T ss_pred HHHHHHHHHHhcc Confidence 8999999999988 No 159 >protein:vir:4790 Length: 114 # NCBI annotation: putative minor capsid protein 3 # Family: family:all:899 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150170;swissprot:trembl:q94m41;genbank:gi:15088781;uniprot:Q94M41;genbank:GeneID:955992 Probab=95.40 E-value=0.00024 Score=40.44 Aligned_cols=104 Identities=16% Similarity=0.172 Sum_probs=56.9 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeec Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~ 81 (149) |+++|+|. ++.+.+.|.. +.+.++...-++.+...+..-+|.++|.|++|..+... .|.+ T Consensus 1 M~~kVkv~-l~~~~~~l~~-------~~l~r~Q~~~~~ev~~~~~~YVP~~~G~L~~S~~~~~~----~~~I-------- 60 (114) T protein:vir:47 1 MNIAIKVD-LQKAKQKLSN-------ESMTRGKIAVASKILLDNEQYIPLRGGELRASGRIVGQ----GDAV-------- 60 (114) T ss_pred CceeEEee-hhHHHHHHHH-------HHHHHHHHHHHHHHHHhhccCCcCccCccccceeeeeC----CcEE-------- Confidence 88888777 5555554421 12223344445667777899999999999987643210 0111 Q ss_pred ccccccccccccccCCCCCcceehhcccCCc----------CCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 82 VNPRTGNSDNTMKANNPRNAFYWRFVELGTA----------NMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~----------~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) ..+++|++++=||.. .+.-..|+..|.....+++++.+.+ +++= T Consensus 61 ----------------~y~tPYAr~qyYg~~~~~~~~~~~~p~~g~~W~eraka~~~~~~~~~~~k--------~~g~ 114 (114) T protein:vir:47 61 ----------------VYGTVYARAQFYGSNGIVTFRRYTTPGTGKRWDQVATSKHAEEWARAFVK--------GMGL 114 (114) T ss_pred ----------------EecCchhhHhhhcccCCCCCCccCCCCCcchhHHHHHhhhhHHHHHHHHH--------hhCC Confidence 122456666555421 2233457776666655555444443 3333 No 160 >protein:vir:6375 Length: 205 # NCBI annotation: hypothetical protein # Family: family:all:10491 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918988;genbank:gi:34610163;genbank:gi:91214209;genbank:GeneID:2559587 Probab=95.34 E-value=0.00073 Score=37.79 Aligned_cols=146 Identities=14% Similarity=0.138 Sum_probs=57.7 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH-H----HHHhhCCcCCCcccccceecccc----ccc--- Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLK-E----EVIARAPVRTGKLKKNVVVVTQK----SRR--- 69 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~-~----~ak~~aP~~~g~l~~~i~~~~~~----~~~--- 69 (149) |.+++.++|++++.+.|+.|++... ++...|+...|..-. . ++....-...+.+..+...+..+ ... T Consensus 1 m~i~v~~~G~~~~~~~l~~l~~~~~-~a~~~AIN~ta~~~~~~~A~~~i~~~vn~k~~yv~~~~Rlti~k~As~~~L~A~ 79 (205) T protein:vir:63 1 MSIEIVAEGLGEFRDYVDRLPDISQ-QAAMIAINQTAQRTALPLARTEIGEQVNFPDNYLKDDSRLGVTKKATRNDLEAV 79 (205) T ss_pred CeeeeehhhHHHHHHHHHhcchhhh-HHHHHHHHHHHHHhhHHHHHHhhhhccccchhhhccceeeEEEeecCCCCeeEE Confidence 9999999999999999999998865 555666666554443 2 23333333344444332211111 110 Q ss_pred ---------------CCcccc-----ceeeeccccccccccccc---ccCCC----CCcceehhccc----------CCc Q lcl|NC_019767. 70 ---------------RGEISS-----GVHIRGVNPRTGNSDNTM---KANNP----RNAFYWRFVEL----------GTA 112 (149) Q Consensus 70 ---------------~g~~~~-----~~~~~~~~~~~~~~~~~~---~~~~~----~~~~y~~f~E~----------GT~ 112 (149) ++.... ++.+....+....-.... ...+. .+...+-|.-- |-. T Consensus 80 I~ar~rpt~LsRF~~p~~~~~~~r~~GVsV~Vk~G~ak~l~gaF~~~lk~g~~l~e~~~~vgva~R~~~g~~~~~~~g~~ 159 (205) T protein:vir:63 80 IGARQRPTSLARFAEPGQTTKSTRKGGVSVVVKPGRTKQFKRGFLVRLRAGKTLTEDKYNLGLAVRLSPGETLHATDGAT 159 (205) T ss_pred EecCCCcceeeeccCCCccccccccCCeEEEEEcCCCeeccCceEEEeeccccccccccceEEEeeecCccccccccCce Confidence 111111 111111110000000000 00000 00001111111 111 Q ss_pred CCCCCc---chhHHHHHHHHHHHH----HHHHHHHHHHHHHhcC Q lcl|NC_019767. 113 NMPAHP---FVRPAYDTREEEAAS----VAIARMNQAIDEVLSK 149 (149) Q Consensus 113 ~~~a~P---Fl~pA~~~~~~~~~~----~~~~~l~~~i~k~~~k 149 (149) +. +.+ |..|++++.-..+.+ .|.+.+.+++++--.+ T Consensus 160 k~-~~~~k~LYGPSV~Qvf~~~~e~I~~~i~~~l~~~f~r~~~~ 202 (205) T protein:vir:63 160 KL-SNNVYLLYGPSVDQVFRTVADDITTEVLDALADEFLRQFTR 202 (205) T ss_pred ec-CCceEEEEcCcHHHHHhhhhhhhhHHHHHHHHHHHHHhhhh Confidence 11 112 566666554443333 3333333333332222 No 161 >protein:vir:105330 Length: 137 # NCBI annotation: hypothetical protein # Family: family:all:180 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950673;genbank:gi:119967843;genbank:GeneID:4643209 Probab=94.97 E-value=0.00016 Score=41.43 Aligned_cols=111 Identities=9% Similarity=-0.048 Sum_probs=29.3 Q ss_pred HHHHHHHHHHhHHHHHH--HHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeecccccccc Q lcl|NC_019767. 11 LNDIAKDLEALSRAENN--KVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGN 88 (149) Q Consensus 11 l~~l~~~l~~l~~~~~~--k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 88 (149) +-.+...|+.|.+.+.. +.++.++..+.+....++...| +... ...+|.+..+|.+.... T Consensus 1 Ma~~~~G~~~l~~~l~~~~~~~~~~~~~al~~~a~~i~~~a-------k~~a------Pv~TG~Lr~SI~~~~~~----- 62 (137) T protein:vir:10 1 MAKVKYGNWDLVKELEEFEKETIRWAKKGIAKTTTIIHNSI-------VSNM------PVDTGYLRESVSMDFKK----- 62 (137) T ss_pred CccchhCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHhC------CcCcchhhcCeeeEecC----- Confidence 33333445555444421 1122223332222222222222 1111 11355555555322100 Q ss_pred cccccccCCCCCcceehhcccCCcC------CCCCcchhHHHHH------------------HHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 89 SDNTMKANNPRNAFYWRFVELGTAN------MPAHPFVRPAYDT------------------REEEAASVAIARMNQAID 144 (149) Q Consensus 89 ~~~~~~~~~~~~~~y~~f~E~GT~~------~~a~PFl~pA~~~------------------~~~~~~~~~~~~l~~~i~ 144 (149) .......+-=.+|+-.- +..+|..+++..- .++-...++ ++-+..|. T Consensus 63 --------~~~~~~V~~~~~YA~~vE~GT~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~g~~a~Pfl~pA~-~~~~~~i~ 133 (137) T protein:vir:10 63 --------GGLTGVINIGSEYAVYVNYGTGIYAVGPGGSRAKNIPWRYKDADGHWHTTKGQHAQPFWEPAI-DEGRAFFN 133 (137) T ss_pred --------CcEEEEEecCCccccccccCccccccCCCcccccccceeeeccccccccCCCCCCCcchhHHH-HHHHHHHH Confidence 00001111112222111 1223333332210 001011111 11222222 Q ss_pred HHhc Q lcl|NC_019767. 145 EVLS 148 (149) Q Consensus 145 k~~~ 148 (149) +.+. T Consensus 134 k~i~ 137 (137) T protein:vir:10 134 KYFS 137 (137) T ss_pred HhhC Confidence 2222 No 162 >protein:vir:9823 Length: 118 # NCBI annotation: putative minor capsid protein # Family: family:all:899 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795585;genbank:gi:28876336;genbank:GeneID:1257873 Probab=94.90 E-value=0.00026 Score=40.26 Aligned_cols=103 Identities=11% Similarity=0.118 Sum_probs=51.7 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) ||.++|++.+++..+. .+.+.++...-++.+..++..-+|.++|.|++|..+... .+ . T Consensus 1 m~kV~vdl~~~~~~ls----------~~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i~~~------~I------~ 58 (118) T protein:vir:98 1 MAKVVVELGGIKRKVS----------PQALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRANSV------GV------T 58 (118) T ss_pred CceeeechhHHhhhhh----------HHHHHHHHHHHHHHHHHHhhcCCCCccCccccceeecCC------ee------E Confidence 9999999988765541 111223344445677778899999999999998543211 11 0 Q ss_pred cccccccccccccccCCCCCcceehhcccCC------------cCCC--CCcchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGT------------ANMP--AHPFVRPAYDTREEEAASVAIARMNQAIDEV 146 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT------------~~~~--a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~ 146 (149) .+++|++.+=||. ...| ...|..++.-..+ ......+...|. T Consensus 59 ------------------Y~tPYAr~qYY~~~~~~~~g~~~~~~~~p~~g~~Wd~R~ka~~~------~~~~w~~~~~k~ 114 (118) T protein:vir:98 59 ------------------WSGPHARAQFYGGAYNKYKSFKFKKYTTPGTGKRWDKRALANAT------IVKDWEKSLLRG 114 (118) T ss_pred ------------------ECCchhhHhhhccccCCCCccccccccCCCCCCcccchhhcchh------hhHHHHHHHHHh Confidence 1234444444432 2222 3345444443221 111122222222 Q ss_pred hcC Q lcl|NC_019767. 147 LSK 149 (149) Q Consensus 147 ~~k 149 (149) ++= T Consensus 115 ~g~ 117 (118) T protein:vir:98 115 MGF 117 (118) T ss_pred cCC Confidence 222 No 163 >protein:vir:3036 Length: 118 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438149;genbank:gi:16271812;genbank:GeneID:929237 Probab=94.90 E-value=0.00026 Score=40.26 Aligned_cols=103 Identities=11% Similarity=0.118 Sum_probs=51.7 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) ||.++|++.+++..+. .+.+.++...-++.+..++..-+|.++|.|++|..+... .+ . T Consensus 1 m~kV~vdl~~~~~~ls----------~~~~~k~Q~~~~~ev~~~~~~YVP~~tG~Lk~S~~i~~~------~I------~ 58 (118) T protein:vir:30 1 MAKVVVELGGIKRKVS----------PQALAKGKLIMNNQVMMSMNPYVPYRDGALRGSSRANSV------GV------T 58 (118) T ss_pred CceeeechhHHhhhhh----------HHHHHHHHHHHHHHHHHHhhcCCCCccCccccceeecCC------ee------E Confidence 9999999988765541 111223344445677778899999999999998543211 11 0 Q ss_pred cccccccccccccccCCCCCcceehhcccCC------------cCCC--CCcchhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGT------------ANMP--AHPFVRPAYDTREEEAASVAIARMNQAIDEV 146 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT------------~~~~--a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~ 146 (149) .+++|++.+=||. ...| ...|..++.-..+ ......+...|. T Consensus 59 ------------------Y~tPYAr~qYY~~~~~~~~g~~~~~~~~p~~g~~Wd~R~ka~~~------~~~~w~~~~~k~ 114 (118) T protein:vir:30 59 ------------------WSGPHARAQFYGGAYNKYKSFKFKKYTTPGTGKRWDKRALANAT------IVKDWEKSLLRG 114 (118) T ss_pred ------------------ECCchhhHhhhccccCCCCccccccccCCCCCCcccchhhcchh------hhHHHHHHHHHh Confidence 1234444444432 2222 3345444443221 111122222222 Q ss_pred hcC Q lcl|NC_019767. 147 LSK 149 (149) Q Consensus 147 ~~k 149 (149) ++= T Consensus 115 ~g~ 117 (118) T protein:vir:30 115 MGF 117 (118) T ss_pred cCC Confidence 222 No 164 >protein:vir:7993 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817347;genbank:gi:29565775;genbank:GeneID:1259013 Probab=94.87 E-value=2.2e-05 Score=46.12 Aligned_cols=90 Identities=23% Similarity=0.341 Sum_probs=52.4 Q ss_pred Ccc----------ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccC Q lcl|NC_019767. 1 MIE----------TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRR 70 (149) Q Consensus 1 Mm~----------~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~ 70 (149) |.. +.+. +++|.+ |+ ++ ...+..=.+.+...++.+.|+++|.+++|..+....... T Consensus 1 ma~gpt~kNP~~KFGvs---~~d~~K----~~-EV-----n~GvNeFMdE~~~~~K~~SPV~~G~Y~~S~~V~ers~Nk- 66 (108) T protein:vir:79 1 MANGPTRKNPLAKFGVR---LDDFDK----LP-EV-----NQGVNEFMDEVVDAWKNNSPVGTGAYRDSVQVTERSTNK- 66 (108) T ss_pred CCCCcccccchhhhcCC---hhhhhh----ch-hh-----hhhHHHHHHHHHHHHhhcCCCCchhhHHHHHHHHhhhcc- Confidence 433 3333 333332 32 11 234444445677889999999999999988764322111 Q ss_pred CccccceeeecccccccccccccccCCCCCcceehhcccCCcCC----CCC----cchhHHHHH Q lcl|NC_019767. 71 GEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANM----PAH----PFVRPAYDT 126 (149) Q Consensus 71 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~----~a~----PFl~pA~~~ 126 (149) |. +......||+||+||||.+- |+| -|=..||+. T Consensus 67 ----------------GR------G~~G~~~~~AH~VEFGs~hndeyapaqktakqfggtay~d 108 (108) T protein:vir:79 67 ----------------GR------GKVGATDPQAHLVEFGSAHNDEYAPAQKTAKQFGGTAYGD 108 (108) T ss_pred ----------------Cc------cccCCcchhhhhhhhhccccccccchhhHHHhhcccccCC Confidence 00 11224579999999999874 433 466666665 No 165 >protein:vir:1581 Length: 116 # NCBI annotation: minor capsid protein # Family: family:all:899 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695163;swissprot:trembl:o03933;genbank:gi:23455806;uniprot:O03933;genbank:GeneID:955512 Probab=94.69 E-value=0.00051 Score=38.62 Aligned_cols=105 Identities=10% Similarity=-0.005 Sum_probs=55.2 Q ss_pred ccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeec Q lcl|NC_019767. 2 IETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 2 m~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~ 81 (149) |+++|+|. ++.+.++|. .+.+.+|-..-+..+..++..-+|.++|.+..+........ .|.+. T Consensus 1 M~ikVkv~-l~~~~~~~~-------~~~~~r~Q~~l~~qv~~~m~~YVP~~tg~~~ls~~~~~~~~--~~~I~------- 63 (116) T protein:vir:15 1 MAFRINVD-LDGFMDQTS-------LDNVKRGQYALVNQAMYDMEQFVPKDRPEEPLRQSVHATSD--GSEIT------- 63 (116) T ss_pred CCceEEee-hhHhhhhhh-------HHHHHHHHHHHHHHHHHhhhccCCcccCCcccccceeeecC--CceEE------- Confidence 88888776 444444432 22233344445566777889999999987554322111100 01111 Q ss_pred ccccccccccccccCCCCCcceehhcccCC-----------cCCCCCcchhHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 82 VNPRTGNSDNTMKANNPRNAFYWRFVELGT-----------ANMPAHPFVRPAYDTREEEAASVAIARMN 140 (149) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT-----------~~~~a~PFl~pA~~~~~~~~~~~~~~~l~ 140 (149) .+++|++.+=||- ....-..|+..|.....+..++.+.+.+. T Consensus 64 -----------------y~tPYAr~qyYg~~~~~~~~~~~t~p~ag~~W~eraK~~h~~~w~~~~~k~~~ 116 (116) T protein:vir:15 64 -----------------YSTPYAKAQFYGIINDKYPVHNYTTPGTTKRWDLKAKSMFMSSWIDTFTKGMK 116 (116) T ss_pred -----------------ecCchhHHHhcccccCCCCcccccCCCCCcchhHHHHhhhHHHHHHHHHHhcC Confidence 2234554443332 12234457777777776666665555555 No 166 >protein:vir:79687 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:899 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285886;genbank:gi:148750843;genbank:GeneID:5220386 Probab=93.13 E-value=0.00098 Score=37.07 Aligned_cols=103 Identities=18% Similarity=0.163 Sum_probs=53.0 Q ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeecccccccccc Q lcl|NC_019767. 11 LNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSD 90 (149) Q Consensus 11 l~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 90 (149) +.+| ..+.+.++.+.+.+|-..-++.|..++..-+|.++|.|++|..+.. +.+ T Consensus 1 ~~dL----~~~~~~~~~~~~~raQ~~l~~ev~~~~~pYVP~~~G~Lk~S~~i~s------~~I----------------- 53 (113) T protein:vir:79 1 MSDL----SVFSRMAQSTGSRSVRLQVLNQMHQDMEQYVPKRAGFLRSQSFVND------TGI----------------- 53 (113) T ss_pred CchH----HHHHHhhchhHHHHHHHHHHHHHHHhhcccCcccccchhccccccC------Cee----------------- Confidence 2233 3333333333445556666678888899999999999998753211 000 Q ss_pred cccccCCCCCcceehhcccCCcC----------CCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019767. 91 NTMKANNPRNAFYWRFVELGTAN----------MPAHPFVRPAYDTREEEAASVAIARMNQAIDEVL 147 (149) Q Consensus 91 ~~~~~~~~~~~~y~~f~E~GT~~----------~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~ 147 (149) ..+++|++++=||... ..-..|+..|.....++.++.+.+.+.+.-+=.- T Consensus 54 -------~y~tPYAr~qyYg~~~~~~~~~~t~p~ag~~W~eraKa~h~~~w~~~~~~a~~~G~~~~~ 113 (113) T protein:vir:79 54 -------HYTAKYARAQFYGFVNGHRVRNYSTPGTGRRWDLKAKAVYKADWQKVAVAAFLKEAKGEY 113 (113) T ss_pred -------EecChhhhHhhccccCCCCccccCCCCCCchhhHHHHHHhHHHHHHHHHHHhhccccccC Confidence 1224466666555332 2234566666655555544444443332211111 No 167 >protein:vir:8106 Length: 150 # NCBI annotation: gp10 # Family: family:all:3937 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817687;genbank:gi:29566118;genbank:GeneID:1259312 Probab=92.93 E-value=0.00019 Score=41.04 Aligned_cols=117 Identities=19% Similarity=0.239 Sum_probs=55.1 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) ...+-+. +++|.+-++.-+ ++. .-++.=+.. +...-|+++.|+++|.++.|..+......-+ T Consensus 5 ~~KFGvS---~~e~~K~irns~-EV~-~GiNdFMe~---~A~~~aK~~SPV~~GeY~~S~~V~~ka~NGR---------- 66 (150) T protein:vir:81 5 FEKFGVS---DSELAKHIRNSA-EVD-AGINDFMEN---EAIPYAKSISPVDDGEYAASWAVMKKAKNGR---------- 66 (150) T ss_pred hhhhcCC---HHHHHHhhccch-hhh-hhHHHHHHh---hhhhhhhccCCcccchhHHHHHHHhhcccCc---------- Confidence 2334444 344444433322 222 222322322 2233468999999999998876543211100 Q ss_pred cccccccccccccccCCCCCcceehhcccCCcCCC---------------------CCcchh--HHHHHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMP---------------------AHPFVR--PAYDTREEEAASVAIA 137 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~---------------------a~PFl~--pA~~~~~~~~~~~~~~ 137 (149) +......||+||+||||---. ---|-+ |----..+-+.+.+.. T Consensus 67 --------------G~~G~~~~~AH~VEFGtgadkkqgrgkkgkrgkdgkrtveiddgefrrvgpdtptkaqgiaqkvas 132 (150) T protein:vir:81 67 --------------GVFGPKAWYAHFVEFGTGADKKQGRGKKGKRGKDGKRTVEIDDGEFRRVGPDTPTKAQGIAQKVAS 132 (150) T ss_pred --------------cccCccchhhhhhhhccccccccccccccccCcccceeeeecCccceecCCCCchhhhhHHHHHHH Confidence 112355799999999985321 011211 1111222334444444 Q ss_pred HHHHHHHHHhcC Q lcl|NC_019767. 138 RMNQAIDEVLSK 149 (149) Q Consensus 138 ~l~~~i~k~~~k 149 (149) .+.-.|+--+.| T Consensus 133 hfggslkggisk 144 (150) T protein:vir:81 133 HFGGSLKGGISK 144 (150) T ss_pred hccccccccccc Confidence 444444444444 No 168 >protein:vir:1164 Length: 156 # NCBI annotation: predicted tail completion # Family: family:all:370 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490613;genbank:gi:17313233;genbank:GeneID:927308 Probab=92.44 E-value=0.0011 Score=36.84 Aligned_cols=135 Identities=13% Similarity=0.082 Sum_probs=54.4 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhh-----CCcCCCcccccce---eccccc--ccC Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIAR-----APVRTGKLKKNVV---VVTQKS--RRR 70 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~-----aP~~~g~l~~~i~---~~~~~~--~~~ 70 (149) |-+ +++ +|...|..|-..+....-+..++.-|+.+....+.+ .|. |....... ...... ... T Consensus 1 m~~---~~~---~l~~~L~~ll~~L~~~~~~~l~r~Ig~~l~~~t~~Rf~~q~~Pd--G~~W~p~~~~~~~~~~~~~~~~ 72 (156) T protein:vir:11 1 MAD---SLE---ALEDWAGPILRALEPGPRAALARSLARDLRRSQQKRVMAQRNPD--GSAYEPRKKRELRGKQGRIRRK 72 (156) T ss_pred Cch---hHH---HHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHhhcCCC--CCCCcccchHHHhhhccccccc Confidence 333 233 333333333222211111223444445555554443 453 22221111 000000 000 Q ss_pred Cccccce-eeecccccccccccccccCCCCCcceehhcccCCc----------CCCCCcchhHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 71 GEISSGV-HIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTA----------NMPAHPFVRPAYDTREEEAASVAIARM 139 (149) Q Consensus 71 g~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~----------~~~a~PFl~pA~~~~~~~~~~~~~~~l 139 (149) ..+-... .........+.... .++..+++..|++.+-||.. .+||+|||.=.- +.++++++.+.+.| T Consensus 73 ~~m~~~l~~~~~l~~~~~~~~a-~vg~~Gs~~~yA~iHQfG~~~~~~~~~~~v~iPaRp~LG~s~-~d~~~i~~~i~~~l 150 (156) T protein:vir:11 73 IKMFQKLRTVRYLRAKGDAQAI-TVSFAGRIARIARVHQYGLRDRAEPGAPEVSYAQRLLLGFDS-SDMETIQNGILAHI 150 (156) T ss_pred hhhhhhhhhhheeeeeecCcEE-EEEecCCchhhhhhhcccccccccCCCCcccccccccCCCCH-HHHHHHHHHHHHHH Confidence 0000000 00001111111112 22223577789999999965 689999997764 35555555555555 Q ss_pred HHHHHH Q lcl|NC_019767. 140 NQAIDE 145 (149) Q Consensus 140 ~~~i~k 145 (149) ....-- T Consensus 151 ~~~~~~ 156 (156) T protein:vir:11 151 DANSPI 156 (156) T ss_pred hhcCCC Confidence 443222 No 169 >protein:vir:79555 Length: 192 # NCBI annotation: putative tail component # Family: family:all:869 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272521;genbank:gi:148609390;genbank:GeneID:5204391 Probab=90.93 E-value=0.015 Score=30.63 Aligned_cols=142 Identities=16% Similarity=0.211 Sum_probs=50.4 Q ss_pred hhhHHHHHHHHHHhHHHHHHHHHHHH--------HHHHHHHHHHHHHhhCCcCCCcccccceeccc--ccccCCccccc- Q lcl|NC_019767. 8 FSGLNDIAKDLEALSRAENNKVLRDA--------TRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQ--KSRRRGEISSG- 76 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~A--------l~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~--~~~~~g~~~~~- 76 (149) |+||+++++.|+.|......++...| +..++.-|..+.....+...+-..+-|..... +....+..... T Consensus 1 ~kgl~~a~~nl~~l~~~~vp~A~~~ainrva~ra~~~t~~~v~~~~~~~~~~~~~I~~k~iR~R~r~~ka~~~~~~~~~I 80 (192) T protein:vir:79 1 MKGLENAIRNLNSLDTRMVPQASAWAINRVAQKAVSVATRQVAGNTVAGDNQVKGIPLKLVRQRVRVFKASPSGKMTARI 80 (192) T ss_pred CchHHHHHHHHHhcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcCcHHHHHhhhhcccccCCCceEEEE Confidence 99999999999998877544444444 44455666555422211111100000000000 00000000000 Q ss_pred ------eeeeccc----------ccc-cccccccccCCC-CCcc-------eeh-hcc-cCCcCCC----CCcchhHHHH Q lcl|NC_019767. 77 ------VHIRGVN----------PRT-GNSDNTMKANNP-RNAF-------YWR-FVE-LGTANMP----AHPFVRPAYD 125 (149) Q Consensus 77 ------~~~~~~~----------~~~-~~~~~~~~~~~~-~~~~-------y~~-f~E-~GT~~~~----a~PFl~pA~~ 125 (149) +.+.... ... ....+..++... ..+| +|| |.- -|...-| -=|.-.|.-+ T Consensus 81 ~v~~~~l~ai~lg~~r~r~~rr~~~~~~~~s~~~vGk~~f~gaFia~m~ngr~~V~~R~~gk~R~PIevvkIpis~~l~~ 160 (192) T protein:vir:79 81 RVNRGNLPAIKLGTARVRLARRGGKLQYRGSVLKVGKYLFRDAFIQQLANGRWHVMRRIDGKNRYPIDVVKIPLSGPLTQ 160 (192) T ss_pred EEecCceeeeeecccccccccccccccccccceEEcceecCchhccccCCCCccceEecCCCccCCeeeEeechHHHHHH Confidence 0000000 000 000000000000 0011 122 222 2433333 1133444433 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 126 TREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 126 ~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) .-..++.+.+.+++.++|..+++. T Consensus 161 af~~e~~r~~~~~~~~el~~~L~~ 184 (192) T protein:vir:79 161 AFEDARDRIIAAEMPKQLGYALKQ 184 (192) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Confidence 334445444555555555544444 No 170 >protein:vir:102190 Length: 93 # NCBI annotation: gp21 # Family: family:all:2713 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655217;genbank:gi:109522797;genbank:GeneID:4157429 Probab=85.26 E-value=0.011 Score=31.25 Aligned_cols=91 Identities=12% Similarity=0.134 Sum_probs=54.0 Q ss_pred HHHHHHHHHHHHHHHHHhhCCc--CCCcccccceecccccccCCccccceeeecccccccccccccccCCCCCcceehhc Q lcl|NC_019767. 30 LRDATRAGAEVLKEEVIARAPV--RTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFV 107 (149) Q Consensus 30 ~~~Al~~~a~~v~~~ak~~aP~--~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~ 107 (149) +..-...+|..+..+||.+||= .||+.+..|...... .|.-...|++ .....|+-|+ T Consensus 1 ~~~~~d~aa~~le~~aK~nApW~DRTg~AR~~l~~~~~~---~g~~~~~i~l------------------sh~v~Yg~~L 59 (93) T protein:vir:10 1 MKATSNYHAVEGTAHMKEHAPWTDRTGAARAGLHAVAST---PQPDRYEIVF------------------AHTVHYGIWL 59 (93) T ss_pred CchhhhHHHHHHHHHHhcCCCccccchhhhhhhcccccc---cCCceEEEEE------------------ecCeeccceE Confidence 3445556788999999999993 467666655322111 0101111111 1234688899 Q ss_pred ccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 108 ELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAID 144 (149) Q Consensus 108 E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~ 144 (149) |.++...++ -|+|+++.-.+++++-++..+.+- + T Consensus 60 E~a~~~kya--Il~Ptv~~~~~~i~~g~~~ll~~l-~ 93 (93) T protein:vir:10 60 EIANSGRYE--IIMPTVHHEGKLMAQRLRGLLGRL-R 93 (93) T ss_pred EeecCCCcc--chhhhHHHHHHHHHHHHHHHHHhc-C Confidence 998877664 688888877776666665544332 2 No 171 >protein:vir:105825 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655770;genbank:gi:109522093;genbank:GeneID:4157633 Probab=68.45 E-value=0.02 Score=29.86 Aligned_cols=90 Identities=19% Similarity=0.317 Sum_probs=49.6 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) +..+.+.+.- |.+|+.- ...+.+=...+...=+.+.|+++|.+++|+.+....... | T Consensus 11 lakfgi~ldd-------fdklpev------nqgvnef~dev~aawk~nspv~~g~yrdsvqvterstnk-g--------- 67 (108) T protein:vir:10 11 LAKFGVRLDD-------FDKLPEV------NQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVTERSTNK-G--------- 67 (108) T ss_pred hhhhccchhh-------hhccchh------hhhHHHHHHHHHHhhhcCCCccccccccceeeccccccc-c--------- Confidence 3344444332 2344322 223344444566667899999999999998775432211 0 Q ss_pred cccccccccccccccCCCCCcceehhcccCCcCC----CCC----cchhHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANM----PAH----PFVRPAYDT 126 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~----~a~----PFl~pA~~~ 126 (149) . +.+ ...-.-+|++|||..+- |+| -|=..||+. T Consensus 68 -------r--gkv----gatdpqahlvefgs~hndeyapaqktakqfggtay~d 108 (108) T protein:vir:10 68 -------R--GKV----GATDPQAHLVEFGSAHNDEYAPAQKTAKQFGGTAYGD 108 (108) T ss_pred -------c--ccc----cCcchhhhhhhhhccccccccchhhhHHhhcccccCC Confidence 0 001 11234689999998763 443 466666665 No 172 >protein:vir:102608 Length: 108 # NCBI annotation: gp9 # Family: family:all:3937 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655005;genbank:gi:109392195;genbank:GeneID:4157230 Probab=68.45 E-value=0.02 Score=29.86 Aligned_cols=90 Identities=19% Similarity=0.317 Sum_probs=49.6 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) +..+.+.+.- |.+|+.- ...+.+=...+...=+.+.|+++|.+++|+.+....... | T Consensus 11 lakfgi~ldd-------fdklpev------nqgvnef~dev~aawk~nspv~~g~yrdsvqvterstnk-g--------- 67 (108) T protein:vir:10 11 LAKFGVRLDD-------FDKLPEV------NQGVNEFIDEVVAAWKNNSPVGTGAYRDSVQVTERSTNK-G--------- 67 (108) T ss_pred hhhhccchhh-------hhccchh------hhhHHHHHHHHHHhhhcCCCccccccccceeeccccccc-c--------- Confidence 3344444332 2344322 223344444566667899999999999998775432211 0 Q ss_pred cccccccccccccccCCCCCcceehhcccCCcCC----CCC----cchhHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWRFVELGTANM----PAH----PFVRPAYDT 126 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~----~a~----PFl~pA~~~ 126 (149) . +.+ ...-.-+|++|||..+- |+| -|=..||+. T Consensus 68 -------r--gkv----gatdpqahlvefgs~hndeyapaqktakqfggtay~d 108 (108) T protein:vir:10 68 -------R--GKV----GATDPQAHLVEFGSAHNDEYAPAQKTAKQFGGTAYGD 108 (108) T ss_pred -------c--ccc----cCcchhhhhhhhhccccccccchhhhHHhhcccccCC Confidence 0 001 11234689999998763 443 466666665 No 173 >protein:vir:4200 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071825;genbank:gi:11863108;genbank:GeneID:1257610 Probab=66.39 E-value=0.13 Score=25.47 Aligned_cols=127 Identities=16% Similarity=0.226 Sum_probs=62.3 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |. .+.|+--|.|+.+-....+.+. ..+..-...++.-+...||+.+|.|+.|-.++ ..+.+|.++.-+++. T Consensus 1 mi--~i~idkp~almek~~ev~~~ie-----~t~~~~~~~l~~i~~ntapiktg~lr~sh~~s--iegstgelsn~~~yl 71 (133) T protein:vir:42 1 MI--EIRIDKPDALMEKPHEVQGKIE-----ETLEKILNQLQGIAENTAPVKTGNLRDSHIIS--IEGSTGELSNLAYYL 71 (133) T ss_pred Ce--eeecCCchhhhcchhhhhhHHH-----HHHHHHHHHHHHHhhhccccccccceeeeeEE--eecCccchhhhhHHh Confidence 54 4455566777766555544432 23444455677778888999999999875443 334566665443321 Q ss_pred cccccccccccccccCCCCCcceeh----hcccCCcCCCCCcchhHHH--HHHHHHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWR----FVELGTANMPAHPFVRPAY--DTREEEAASVAIARMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~----f~E~GT~~~~a~PFl~pA~--~~~~~~~~~~~~~~l~~ 141 (149) .---. |.+ ++..-...+.||- -+-| .+.-||+.||.-+. -..+.-+.+.+.+-++. T Consensus 72 ~~vl~-grg---wvfpv~~kal~wpelphpvay-arpappndyfsa~vay~~~~give~s~iewlre 133 (133) T protein:vir:42 72 PFVLH-GRG---WVFPVRRKALWWPELPHPVAY-ARPAPPNDYFSAVVAYSAPEGVVEETLIEWLRE 133 (133) T ss_pred hHhhh-ccc---ceeeccccccccCCCCCcccc-cCCCCCchhhhhhhhhhcccchhHHHHHHHHhC Confidence 10000 000 1111112233432 2222 13345666776544 33344444444444443 No 174 >protein:vir:4460 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700383;genbank:gi:23505455;genbank:GeneID:955662 Probab=59.47 E-value=0.17 Score=24.74 Aligned_cols=131 Identities=21% Similarity=0.332 Sum_probs=75.3 Q ss_pred Ccc---ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC-----------Cc-CCCcccccceeccc Q lcl|NC_019767. 1 MIE---TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA-----------PV-RTGKLKKNVVVVTQ 65 (149) Q Consensus 1 Mm~---~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~a-----------P~-~~g~l~~~i~~~~~ 65 (149) |.+ +-|++.-.+++. +...-++.|..+.+++...+|+.++ |. .||.|..||..... T Consensus 1 M~~~~~lHvdF~qp~~~~---------Fnr~r~RraF~~iGq~h~r~Arrlvm~RGrs~pGe~P~~~TGrLa~SIgy~Vp 71 (170) T protein:vir:44 1 MPQKAYLHVDFVQPEELV---------FNRARMRRAFVKIGQVHMRDARRLVMKRGRSKPGENPSYRTGQLARSIGYYVP 71 (170) T ss_pred CCCCceeEEeeecCCcee---------ecHHHHHHHHHHHhHHHHHHHHHHHHHhcCCCCCCCCcchhhhhhhhhhhccc Confidence 666 344443333332 3344568888888888888888654 21 35666666654322 Q ss_pred cc--ccCCccccceeeecccccccccccccccCCCCCcceehhcccCCcC-------------------C-CCCcchhHH Q lcl|NC_019767. 66 KS--RRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTAN-------------------M-PAHPFVRPA 123 (149) Q Consensus 66 ~~--~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~-------------------~-~a~PFl~pA 123 (149) +. .++|-+ . .+.. +.+.|.....+ ...||=.|+-||... . |-.-||.-+ T Consensus 72 ras~~rpG~m-V--kIaP-Nqk~G~g~r~i-----~g~fYPafL~YGVr~gakr~k~hhr~a~ggsgwriaPR~Nym~~~ 142 (170) T protein:vir:44 72 RASKKRPGLM-V--KIAP-NQKNGEGNRHI-----NGAFYPAFLFYGVRRGAKRKKGHHRGASGGSGWRVEPRNNYMTEV 142 (170) T ss_pred cccCCCCcee-E--EecC-CCCCCCCcccc-----ccccchhhhhhhhhcccccchhhcccccCCCcceeccchhHHHHH Confidence 21 112221 1 1111 11111111111 235898999998432 1 334699999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 124 YDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 124 ~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) ++..++.....+..+|++.|+-.-.| T Consensus 143 l~~~~~wt~~~L~r~L~~sLrp~~r~ 168 (170) T protein:vir:44 143 LDKRRSWTRYVLSRELRKSLRPQRRK 168 (170) T ss_pred HHhhHHHHHHHHHHHHHHhcCccccc Confidence 99999999999988888888655444 No 175 >protein:vir:487 Length: 187 # NCBI annotation: hypothetical protein # Family: family:all:2152 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543094;swissprot:trembl:q8w625;genbank:gi:18249906;uniprot:Q8W625;genbank:GeneID:929690 Probab=57.08 E-value=0.17 Score=24.86 Aligned_cols=134 Identities=19% Similarity=0.283 Sum_probs=73.7 Q ss_pred Ccc---ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCC-----------c-CCCcccccceeccc Q lcl|NC_019767. 1 MIE---TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAP-----------V-RTGKLKKNVVVVTQ 65 (149) Q Consensus 1 Mm~---~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP-----------~-~~g~l~~~i~~~~~ 65 (149) |.+ +-|++.-.+ .+- +...-++.|..+.+++...+|+.++- . .||.|..||..... T Consensus 14 m~~~~~lHvdF~qp~-------~~~--Fnr~riRraF~~iGq~h~r~ArrLvm~RGrs~pge~P~~qTGrLa~SIgy~Vp 84 (187) T protein:vir:48 14 MNQTAFLHVDFKQPK-------ELE--FNRARLRRAFVQIGRVYMRDARRLVIKRGRSGPGENPGYQTGRLARSIGYYVP 84 (187) T ss_pred hhhccceeEeeecCC-------cee--ecHHHHHHHHHHHhHHHHHHHHHHHHhcccCCCCCCCcchhhhhhhhhhhccc Confidence 443 222322222 221 33455688888888888888887662 1 35666666653322 Q ss_pred -ccccCCccccceeeecccccccccccccccCCCCCcceehhcccCCcC---------------------C-CCCcchhH Q lcl|NC_019767. 66 -KSRRRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTAN---------------------M-PAHPFVRP 122 (149) Q Consensus 66 -~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~---------------------~-~a~PFl~p 122 (149) .+..+.++-. .+.. +.+.|...... .-...||=.|+-||... . |-+-||.- T Consensus 85 kat~~RpG~mV--kIaP-Nqk~G~g~r~~---Pi~gdfYPafL~YGVr~ga~~~~~~~k~~~~~~~sgwriaPR~Nym~~ 158 (187) T protein:vir:48 85 KKTTRRPGLMV--KISP-NQKNGQGNRRF---PEGAPYYPAFLYYGVRHSAYGMDKKDKRQKKHHSSTFRLAPRNNFMAD 158 (187) T ss_pred cccCCCCcceE--EecC-CcccCcccccc---cccccchhHHHHhhhhhhhhccchhhhhhhcccCCcceeccchhHHHH Confidence 1111111111 1111 11111111110 12235898999998432 1 33469999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 123 AYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 123 A~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) +++..+......+..+|++.|+-.-.| T Consensus 159 ~L~~~~~wt~~~L~raL~~sLrp~~r~ 185 (187) T protein:vir:48 159 VIERRRHWTQELLSRELQRSLRPVKRK 185 (187) T ss_pred HHHhhHHHHHHHHHHHHHHhcCccccc Confidence 999999999999999888888755544 No 176 >protein:vir:4162 Length: 133 # NCBI annotation: unknown # Family: family:all:11764 # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046971;genbank:gi:9630541;genbank:GeneID:1261715 Probab=51.29 E-value=0.37 Score=22.94 Aligned_cols=127 Identities=16% Similarity=0.197 Sum_probs=61.1 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeee Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIR 80 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~ 80 (149) |. .+.|+--|.|+.+-....+.+. ..+..-...++.-+...||+.+|.|+.|-.++ ..+.+|.++.-+++. T Consensus 1 mi--~i~idkp~almek~~ev~~~ie-----~t~~~~~~~l~~i~~ntapiktg~lr~sh~~s--iegstgelsn~~~yl 71 (133) T protein:vir:41 1 MI--RINIDKPEALMEKASEVEDRVE-----QTVTLLMIELEEILMNTAPIKTGELRISHTWS--VEGSTGELTNTVPYL 71 (133) T ss_pred Ce--eeecCCchhhhcchhhhhhHHH-----HHHHHHHHHHHHHhhhccccccccceeeeeEE--eecCccchhhhhHHh Confidence 54 4455566777766555544432 23444455677778888999999999875443 334566665443321 Q ss_pred cccccccccccccccCCCCCcceeh----hcccCCcCCCCCcchhHHH--HHHHHHHHHHHHHHHHH Q lcl|NC_019767. 81 GVNPRTGNSDNTMKANNPRNAFYWR----FVELGTANMPAHPFVRPAY--DTREEEAASVAIARMNQ 141 (149) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~y~~----f~E~GT~~~~a~PFl~pA~--~~~~~~~~~~~~~~l~~ 141 (149) .---. |.+ ++..-...+.||- -+-| .+.-||+.||.-+. -..+.-+.+.+.+-+-. T Consensus 72 ~~vl~-grg---wvfpv~~kal~wpelphpvay-arpappndyfsa~vay~~~~give~s~iewlis 133 (133) T protein:vir:41 72 QWVLF-GRG---WVFPVEKKALYWPELPHPVAY-ARPAPPNDYFSAAVAYIDAKGIVEDSFIEWLIS 133 (133) T ss_pred hHhhh-ccc---ceeeecccccccCCCCCcccc-cCCCCCchhhhhhhhhhcccchhHHHHHHHhcC Confidence 10000 000 1111112233332 2222 13345666776544 33333344444333333 No 177 >protein:vir:5745 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892056;genbank:gi:33770519;interpro:IPR010064;interpro:IPR011693;uniprot:Q7Y404;genbank:GeneID:2637451 Probab=43.24 E-value=0.85 Score=20.96 Aligned_cols=119 Identities=13% Similarity=0.089 Sum_probs=52.2 Q ss_pred eehh-hHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceeeecccc Q lcl|NC_019767. 6 LDFS-GLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHIRGVNP 84 (149) Q Consensus 6 ~~i~-Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~~~~~~ 84 (149) +.++ -++-|..-++.|.. +..++.+++++.+...-.+.+...+ +.+..+.. ...+|++.++|.+..++ T Consensus 1 M~~~~~i~Gl~el~~~l~~-L~~~~~~k~~~~Al~~~a~~v~~~~-------k~~ap~~~--~~~~g~l~~~I~i~~~k- 69 (135) T protein:vir:57 1 MIPEIEISGLQELERRLIA-VGEEVGTKILRDAGRAAMAVVEADM-------KQNAGYDN--SSTNAHMRDSIKIRSSR- 69 (135) T ss_pred CceeeeehhHHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHH-------HHhCCCCC--CCchhhHHhhccccccc- Confidence 5544 12233333333321 1223334444444444444444443 22221111 12235555555432210 Q ss_pred cccccccccccCCCCCcceehhcccCCcCC------------CCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 85 RTGNSDNTMKANNPRNAFYWRFVELGTANM------------PAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 85 ~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~------------~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) ..... ....+..|..+- ..+| =+|=+.-.-++..+.+.+.+.++|++.+.| T Consensus 70 -----------~~~~~--~~v~v~vg~~~~~~~~~~f~E~GT~~~~-a~PF~~pa~~~~~~~~~~~~~~~~~~~l~k 132 (135) T protein:vir:57 70 -----------GKAGS--TVVVLRVGPTRSHYMKALAQEFGTIKQV-AKPFIRPALDYNKMQVLRILTVEIRDGLST 132 (135) T ss_pred -----------ccccc--eeEEEEecCCCCcceeEeecccCCCCCC-CCcchhHhHHHhHHHHHHHHHHHHHHHHHH Confidence 00011 111222332211 1111 256677777888888888888888888888 No 178 >protein:vir:7859 Length: 126 # NCBI annotation: gp16 # Family: family:all:11115 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817466;genbank:gi:29565895;genbank:GeneID:1259088 Probab=40.35 E-value=0.19 Score=24.55 Aligned_cols=117 Identities=14% Similarity=0.265 Sum_probs=46.6 Q ss_pred hhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHh---hCCc--C-CCcccccceecccccccCCccccceeeec Q lcl|NC_019767. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIA---RAPV--R-TGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~---~aP~--~-~g~l~~~i~~~~~~~~~~g~~~~~~~~~~ 81 (149) ++||-.|+.+-+-...-....++-..|-+-|+.+++-=.. ..|. + +-.|+.. -...+|.....+.+.. T Consensus 1 mkglgnliskadiaaaiatspavhagliakakevqeywveywnsiphphsrthtlksg------yvenpgdyaksirvsf 74 (126) T protein:vir:78 1 MKGLGNLISKADIAAAIATSPAVHAGLIAKAKEVQEYWVEYWNSIPHPHSRTHTLKSG------YVENPGDYAKSIRVSF 74 (126) T ss_pred CcchhhhhhhhhhhhhhhcccchhhhhhhhhHHHHHHHHHHhhcCCCccccccccccc------cccCchhhhhhhheee Confidence 6677777765442221111223333444445555543332 1221 1 1111111 1112344444444444 Q ss_pred ccccccccccccccCCCCCcceehhcccCCcCCCC---CcchhHHHHHHHHHHHHH Q lcl|NC_019767. 82 VNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPA---HPFVRPAYDTREEEAASV 134 (149) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a---~PFl~pA~~~~~~~~~~~ 134 (149) .+.+.|-....+- ..-|..+|+|||..+||. +..--.-|+-.-.--+.+ T Consensus 75 iksksglpkarvm----atdykswwieygakhmpefaprahtlahfegggattvsa 126 (126) T protein:vir:78 75 IKSKSGLPKARVM----ATDYKSWWIEYGAKHMPEFAPRAHTLAHFEGGGATTVSA 126 (126) T ss_pred eecccCCccccee----hhhhhHHHHhhhhhhcccccccchhhhhccCCccccccC Confidence 4444333333222 223557899999999973 211111111111111111 No 179 >protein:vir:101654 Length: 126 # NCBI annotation: gp17 # Family: family:all:11115 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654772;genbank:gi:109302770;genbank:GeneID:4156088 Probab=40.35 E-value=0.19 Score=24.55 Aligned_cols=117 Identities=14% Similarity=0.265 Sum_probs=46.6 Q ss_pred hhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHh---hCCc--C-CCcccccceecccccccCCccccceeeec Q lcl|NC_019767. 8 FSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIA---RAPV--R-TGKLKKNVVVVTQKSRRRGEISSGVHIRG 81 (149) Q Consensus 8 i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~---~aP~--~-~g~l~~~i~~~~~~~~~~g~~~~~~~~~~ 81 (149) ++||-.|+.+-+-...-....++-..|-+-|+.+++-=.. ..|. + +-.|+.. -...+|.....+.+.. T Consensus 1 mkglgnliskadiaaaiatspavhagliakakevqeywveywnsiphphsrthtlksg------yvenpgdyaksirvsf 74 (126) T protein:vir:10 1 MKGLGNLISKADIAAAIATSPAVHAGLIAKAKEVQEYWVEYWNSIPHPHSRTHTLKSG------YVENPGDYAKSIRVSF 74 (126) T ss_pred CcchhhhhhhhhhhhhhhcccchhhhhhhhhHHHHHHHHHHhhcCCCccccccccccc------cccCchhhhhhhheee Confidence 6677777765442221111223333444445555543332 1221 1 1111111 1112344444444444 Q ss_pred ccccccccccccccCCCCCcceehhcccCCcCCCC---CcchhHHHHHHHHHHHHH Q lcl|NC_019767. 82 VNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPA---HPFVRPAYDTREEEAASV 134 (149) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a---~PFl~pA~~~~~~~~~~~ 134 (149) .+.+.|-....+- ..-|..+|+|||..+||. +..--.-|+-.-.--+.+ T Consensus 75 iksksglpkarvm----atdykswwieygakhmpefaprahtlahfegggattvsa 126 (126) T protein:vir:10 75 IKSKSGLPKARVM----ATDYKSWWIEYGAKHMPEFAPRAHTLAHFEGGGATTVSA 126 (126) T ss_pred eecccCCccccee----hhhhhHHHHhhhhhhcccccccchhhhhccCCccccccC Confidence 4444333333222 223557899999999973 211111111111111111 No 180 >protein:vir:78894 Length: 105 # NCBI annotation: gp10 # Family: family:all:29989 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468850;genbank:gi:157325424;genbank:GeneID:5601891 Probab=40.02 E-value=0.16 Score=24.97 Aligned_cols=101 Identities=16% Similarity=0.087 Sum_probs=46.9 Q ss_pred ccc-eeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHH---HHHHHHHhhCCcCCCcccccceecccccccCCccccce Q lcl|NC_019767. 2 IET-SLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAE---VLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGV 77 (149) Q Consensus 2 m~~-~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~---~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~ 77 (149) |++ +|+..=.+.|. +.+|..+++ .+.+.+...+|.++|.|++|-...... T Consensus 1 ~~f~~f~~~~~k~l~---------------kr~L~~~g~vq~EvlR~~~PyvP~~tG~Lk~S~~l~tvI----------- 54 (105) T protein:vir:78 1 MSFSSFKDAVIDDIH---------------NKALSTAAKAGGELVELAQPVTPILYGDLRRSSYFKIII----------- 54 (105) T ss_pred CCcccccchHHHHHH---------------HhcCCCCchhhHHHHHHhCCCCcccccccccccccceee----------- Confidence 332 33322222222 222222221 344556677899999999874321110 Q ss_pred eeecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019767. 78 HIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQ 141 (149) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~ 141 (149) | ...+......-++|++.+=|... ...-|++.+...+++.+.+.+..-++- T Consensus 55 ---------g--sg~I~y~~~~~aPYAr~qYYe~~--Rg~~WfErm~a~hk~~I~~~vegg~~~ 105 (105) T protein:vir:78 55 ---------Q--KNSIVARVFSLTPYARRQYYENR--RNPRWYEMAVSYGIQSINQIVEGGMRL 105 (105) T ss_pred ---------c--CCeeEeeccccCchhhhhhhccc--CCCchhHHhhhcchhHHHHHHhcccCC Confidence 0 01111111122567777777533 333488888888877644443322211 No 181 >protein:vir:3787 Length: 231 # NCBI annotation: orf22 # Family: family:all:743 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536827;genbank:gi:17981836;genbank:GeneID:929215 Probab=40.02 E-value=0.47 Score=22.38 Aligned_cols=141 Identities=15% Similarity=0.215 Sum_probs=52.3 Q ss_pred ccc--eeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC-----CcCCCcc-cc----cce-------- Q lcl|NC_019767. 2 IET--SLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA-----PVRTGKL-KK----NVV-------- 61 (149) Q Consensus 2 m~~--~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~a-----P~~~g~l-~~----~i~-------- 61 (149) |.+ +++.+.+..|...|..|. +.-+.=+.-|..-|..+...+++++ |..+..- ++ .+. T Consensus 1 m~~~~~~n~~dl~~l~~~L~ll~--L~p~kRrrLl~~iak~lr~~~~~rI~~Q~~PDGs~w~pRK~~~~k~k~~rm~~kL 78 (231) T protein:vir:37 1 MQIRLGLKQEDLDAFVRDLRTLN--LTGKQKKKILTWTLGAIKRKSQKNIREQHSPDGTAWEKRKPVDGEIKNKRLLKKV 78 (231) T ss_pred CCccCCcCHHHHHHHHHHHHHhc--CCHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcCchhcccccchhhHHHHHHh Confidence 664 555568888888887652 2222223445556667777776653 4432211 10 000 Q ss_pred -----ecc---------cccccCCcc--------ccceeeecccc-cccccc---------cc----cccC--------- Q lcl|NC_019767. 62 -----VVT---------QKSRRRGEI--------SSGVHIRGVNP-RTGNSD---------NT----MKAN--------- 96 (149) Q Consensus 62 -----~~~---------~~~~~~g~~--------~~~~~~~~~~~-~~~~~~---------~~----~~~~--------- 96 (149) ... ...+..+.+ .+.+....... ...... .. +... T Consensus 79 ~~~~~~~~~~~~~~~~~~~~g~~~~IA~vHQ~G~~~rv~~~~~~~~~~~~~~~pATr~QAk~Lr~lGy~v~~~k~k~~k~ 158 (231) T protein:vir:37 79 LRYASILAEERGKGRIYYKNPLTGEIAQKQQDGFTEHFRVFATDKNKNGSGNDRATIRQAQKLRSLGYRKRNGKNRQGKT 158 (231) T ss_pred HHhhccccccCCceEEeeecchHHHHHHHhhcCcccccchhhhhhccCCCCCCCCCHHHHHHHHHhcccccCCCCCCCCC Confidence 000 000000000 00000000000 000000 00 0000 Q ss_pred -CCCC--cce------------ehhcccC----------CcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_019767. 97 -NPRN--AFY------------WRFVELG----------TANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLS 148 (149) Q Consensus 97 -~~~~--~~y------------~~f~E~G----------T~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~ 148 (149) +... .|. -+.++-= +...|++|||...-++..+.+...|...+... .+ T Consensus 159 ~~rkps~kwI~~~ls~~qAgliIR~L~~k~~~~~~k~~W~I~~paR~FLG~~~~e~~~~l~~~l~~i~~~~----~~ 231 (231) T protein:vir:37 159 KYRLYTIKEIRERLTRTWASMEIRRLENKVNAGNGKTNWEIHVPARPFLDTREKENVDILREITLKFLSGE----YK 231 (231) T ss_pred CcCcCCHHHHHHhhhhHHHHHHHHHHhcccccccCcceeeeecCcccccCCCHHHHHHHHHHHHHHHhccc----CC Confidence 0000 000 0222310 34579999999876655444333333333322 22 No 182 >protein:vir:79034 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:1892 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110729;genbank:gi:134287346;genbank:GeneID:4955208 Probab=39.65 E-value=1 Score=20.57 Aligned_cols=128 Identities=7% Similarity=0.005 Sum_probs=55.3 Q ss_pred CccceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCC--cCC---CcccccceecccccccCCcccc Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAP--VRT---GKLKKNVVVVTQKSRRRGEISS 75 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP--~~~---g~l~~~i~~~~~~~~~~g~~~~ 75 (149) +.++.=-.+-|.++.. ..++..+ +++++.........+.+.+--..- ..+ |...+...+... .+.+.. T Consensus 9 ~~gl~~~~~~l~~~~~--~~~~~~~-~~~~~~~a~~l~~~vk~~tPVdTG~Lr~sw~~~~~~~~~~~~~~----g~~~~v 81 (141) T protein:vir:79 9 FREFKRVCKKMEKLTK--IDLDKFC-KDAARELAARLLGKVIRRTPVDTGFLRQGWNGVAYARSLPVYKQ----GNNYII 81 (141) T ss_pred HHHHHHHHHHHHHHhH--HHHHHHH-HHHHHHHHHHHHHHHHHhCCCcchhhcccccccccccccceeec----CCeeEE Confidence 2222111111222211 0122221 122222222222223222211111 111 111111111111 111111 Q ss_pred ceeeecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 76 GVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) . +... .. +..---|+|..--|..+.|++++|+.+.+..+.++.+.+.+.|.+.|++...- T Consensus 82 ~--v~n~-----~~-------YA~~VE~Ghr~~~~~gfV~G~fml~~s~~~~~~~~~~~~~~~l~~~l~~~~~~ 141 (141) T protein:vir:79 82 E--VVNP-----TE-------YASYVNFGHRTKDGKGWVKGQHFLTISEMELQSQVDKIIEKKLLILLKGVFDA 141 (141) T ss_pred E--EecC-----Cc-------chhhhhcceeecCCcceeCCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 0 0000 00 01112244555555567899999999999999999999999999999999988 No 183 >protein:vir:4514 Length: 168 # NCBI annotation: unknown # Family: family:all:2152 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599040;genbank:gi:19548998;genbank:GeneID:935228 Probab=31.16 E-value=1.5 Score=19.69 Aligned_cols=139 Identities=16% Similarity=0.309 Sum_probs=67.6 Q ss_pred Cccc--eeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhC---C-cCCCcccccceeccccc--ccCCc Q lcl|NC_019767. 1 MIET--SLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARA---P-VRTGKLKKNVVVVTQKS--RRRGE 72 (149) Q Consensus 1 Mm~~--~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~a---P-~~~g~l~~~i~~~~~~~--~~~g~ 72 (149) |-.. -|+++-.+++.=.=..|-..+. ++-+.=++.|...|..-++... | ..||.|..||.....+. .++|- T Consensus 1 m~~~~lHvdF~qp~~~~Fnr~riRraFv-~igq~hmr~ArrlV~rrgrs~pGe~P~~qTGrLa~SIgy~Vpras~~rpG~ 79 (168) T protein:vir:45 1 MTTSFLHVDFQQPAEMRFNRARVRRAFV-TIGQRHMRDARRLVMRHARSAPGENPGYQTGRLARSIGYMVPRASKHRPGF 79 (168) T ss_pred CCccceeeeeecCCceeecHHHHHHHHH-HHhHHHHHHHHHHHhhcccccCCCCCcchhhhhhhhhhhccccccCCCCce Confidence 4443 3343333333211111111111 1222224445555554443311 2 24677777776433221 11222 Q ss_pred cccceeeecccccccccccccccCCCCCcceehhcccCCcC-------------------C-CCCcchhHHHHHHHHHHH Q lcl|NC_019767. 73 ISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTAN-------------------M-PAHPFVRPAYDTREEEAA 132 (149) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~-------------------~-~a~PFl~pA~~~~~~~~~ 132 (149) + . .+.. +.+.|..+..+ ...||=.|+.||... . |-+-||.-+++..+.... T Consensus 80 m-v--kIaP-Nqk~G~g~r~i-----~gdfYPafL~YGVr~gakr~r~h~rga~ggsgwriaPR~Nym~~~l~~~~~wt~ 150 (168) T protein:vir:45 80 M-A--RIAP-NQRNGEGNRRI-----TGDFYPAFLFYGVRGGAKRRRSHHRGASGGSGWRLAPRNNFMVETLEKNRSWTR 150 (168) T ss_pred E-E--EecC-CCCCCCCCCcc-----ccccchhhhhhhhhcchhhhhhhhccccCCCcceeccchhhHHHHHHhhHHHHH Confidence 1 1 1111 11112111111 245888999998432 1 334699999999999999 Q ss_pred HHHHHHHHHHHHHHhcC Q lcl|NC_019767. 133 SVAIARMNQAIDEVLSK 149 (149) Q Consensus 133 ~~~~~~l~~~i~k~~~k 149 (149) ..+..+|++.|+-.-.+ T Consensus 151 ~~L~r~L~~sLrp~rr~ 167 (168) T protein:vir:45 151 YFLARELRKSLKPERRR 167 (168) T ss_pred HHHHHHHHHhcCccccc Confidence 98888888888755545 No 184 >protein:vir:3750 Length: 227 # NCBI annotation: hypothetical protein # Family: family:all:743 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043491;genbank:gi:9628626;genbank:GeneID:1261131 Probab=28.02 E-value=1.8 Score=19.20 Aligned_cols=135 Identities=17% Similarity=0.196 Sum_probs=52.3 Q ss_pred cc--ceeehhhHHHHHHHHHHh--HHHHHHHHHHHHHHHHHHHHHHHHHhhC-----CcCCCc-------------cccc Q lcl|NC_019767. 2 IE--TSLDFSGLNDIAKDLEAL--SRAENNKVLRDATRAGAEVLKEEVIARA-----PVRTGK-------------LKKN 59 (149) Q Consensus 2 m~--~~~~i~Gl~~l~~~l~~l--~~~~~~k~~~~Al~~~a~~v~~~ak~~a-----P~~~g~-------------l~~~ 59 (149) |. ++++..+++.|...|.-| +..-. +.-|..-|..+...+++++ |..+.. |.+. T Consensus 1 M~i~~~~n~~~~~~l~~~L~ll~L~p~~R----r~ll~~iak~lr~~~k~rIr~Q~~PDGs~~~pRKr~k~KM~~kL~k~ 76 (227) T protein:vir:37 1 MNIRMGIDKEDLKKFLKDLEIISLPDKKK----REILIRSLQMIKRQAVKSAANQRNPMGGSWKKRKNGTAKMLRRIAKL 76 (227) T ss_pred CcccccCCHHHHHHHHHHHHHhcCCHHHH----HHHHHHHHHHHHHHHHHHHHhhcCCCCCCCchhcchhHHHHhhhHHH Confidence 55 566668999999888655 33332 3445556666777777654 433211 1111 Q ss_pred ceecccc--------cccCCccccceeeeccccccc-----------ccc--c---------------ccccCC------ Q lcl|NC_019767. 60 VVVVTQK--------SRRRGEISSGVHIRGVNPRTG-----------NSD--N---------------TMKANN------ 97 (149) Q Consensus 60 i~~~~~~--------~~~~g~~~~~~~~~~~~~~~~-----------~~~--~---------------~~~~~~------ 97 (149) ..+.... .+..|.+. .++..+.....+ ... . ....+. T Consensus 77 l~~~~~~~~a~v~f~~g~~~~IA-~vHq~G~~~~v~~~~~~~~~~~~~~~~paTr~QAk~Lr~lGy~v~~~k~k~~k~~~ 155 (227) T protein:vir:37 77 ANSKAEKAQGTLFYKQKRTGEIA-QEHQEGIPHLFKKTEFTGKNKGGIGADPCTLRQAKKLKDLGYTVANGKTKNGKAKR 155 (227) T ss_pred cceeecccceEEEecCcchHHHH-HHhhcCcccccchhhhhhhhcCCccccCCCHHHHHHHHHhcccccCCCCCCcCCcc Confidence 1111000 00000000 000000000000 000 0 000000 Q ss_pred CCC--cce------------ehhccc------------CCcCCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_019767. 98 PRN--AFY------------WRFVEL------------GTANMPAHPFVRPAYDTREEEAASVAIARMNQAIDEVLSK 149 (149) Q Consensus 98 ~~~--~~y------------~~f~E~------------GT~~~~a~PFl~pA~~~~~~~~~~~~~~~l~~~i~k~~~k 149 (149) ... .|. -+.++- =+...|++|||...-+ .+.+.|...|.++..+ T Consensus 156 rkps~kwI~~nls~~qAgliIR~L~~k~~~~~~~~k~~W~I~~PaR~FLG~~~~--------e~~~~l~r~l~~~~~~ 225 (227) T protein:vir:37 156 RKPTLSEIRSTLSRAKASLIIRKLEEKNGMNPSRHLTQWIIPTEKRSFLDTREE--------ENAKIILAEIQKYTQK 225 (227) T ss_pred ccCCHHHHHHhhhHHHHHHHHHHHhcccccccccCccceeeecCcccccCCCHH--------HHHHHHHHHHHHHhhh Confidence 000 011 122221 0334699999998543 2333344444444444 No 185 >protein:vir:1891 Length: 179 # NCBI annotation: gp10 # Family: family:all:135 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037671;genbank:gi:9634129;genbank:GeneID:1262520 Probab=21.69 E-value=2.6 Score=18.34 Aligned_cols=145 Identities=13% Similarity=0.071 Sum_probs=57.2 Q ss_pred CccceeehhhHHHHHHHHH-HhHHHHHHHHHHHHHHHHHHHHHHHH------------HhhCCc--------CCCccccc Q lcl|NC_019767. 1 MIETSLDFSGLNDIAKDLE-ALSRAENNKVLRDATRAGAEVLKEEV------------IARAPV--------RTGKLKKN 59 (149) Q Consensus 1 Mm~~~~~i~Gl~~l~~~l~-~l~~~~~~k~~~~Al~~~a~~v~~~a------------k~~aP~--------~~g~l~~~ 59 (149) |-=---=++-|..-+..|. .+.+++.+++++.|..--...++..| +.+..+ .+|.+.-. T Consensus 5 ~~~~i~Gl~eL~~~l~~L~~~~~~k~~r~Al~~aa~~v~~~ak~~ap~~~~~~~~~~l~~~i~~~~~~~~~~~~g~~~~~ 84 (179) T protein:vir:18 5 VEVSLTGLESLLGKMEAVSEVTRNKAGRFALRKAANIIRDRARSNASRVDDPLTKEAIHKNIVASFSSKQFRRTGDLAFR 84 (179) T ss_pred EEEEeecHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccccccchhhhhhheeecccccccccccceeEe Confidence 2210001122222333332 33334444555555444444443333 222221 12222211 Q ss_pred ceecccccc----cCCccccceeeecccccccccccccccCCCCCcceehhcccCCcCCC-CCcchhHHHHHHHHHHHHH Q lcl|NC_019767. 60 VVVVTQKSR----RRGEISSGVHIRGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMP-AHPFVRPAYDTREEEAASV 134 (149) Q Consensus 60 i~~~~~~~~----~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~-a~PFl~pA~~~~~~~~~~~ 134 (149) +.+...... ................ + .......+...+++--|=..+...+| -+|=++-.-++..+.+.+. T Consensus 85 vgv~~~~~~~~~~~~~~~~~~~~~~~~~~--g--~~~~~~~~~~y~~fvEfGT~kmpa~PFlrPA~~~~~~~a~~~i~~~ 160 (179) T protein:vir:18 85 VGVMGGARQYANTKANVRKGRAGKTYKTS--G--DKGNPGGDTWYWRFLEFGTEHTSARPILRPAMNGVDNDVINVFSTE 160 (179) T ss_pred eecccccccccccccccccCccccccccc--c--cccCCCCccceeEEeccCCCCCCCCccchhhHHhhHHHHHHHHHHH Confidence 111110000 0000011111111000 0 00000000011111111112333444 4678888899999999999 Q ss_pred HHHHHHHHHHHHhcC Q lcl|NC_019767. 135 AIARMNQAIDEVLSK 149 (149) Q Consensus 135 ~~~~l~~~i~k~~~k 149 (149) +.++|.++|++..+| T Consensus 161 l~~~i~k~lk~~~~~ 175 (179) T protein:vir:18 161 MGKAIDRAIRLAMKK 175 (179) T ss_pred HHHHHHHHHHhhccc Confidence 999999999999999 No 186 >protein:vir:78163 Length: 92 # NCBI annotation: hypothetical protein # Family: family:all:29889 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294805;genbank:gi:149882826;genbank:GeneID:5309191 Probab=21.52 E-value=0.93 Score=20.77 Aligned_cols=91 Identities=20% Similarity=0.276 Sum_probs=40.1 Q ss_pred Ccc-ceeehhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcCCCcccccceecccccccCCccccceee Q lcl|NC_019767. 1 MIE-TSLDFSGLNDIAKDLEALSRAENNKVLRDATRAGAEVLKEEVIARAPVRTGKLKKNVVVVTQKSRRRGEISSGVHI 79 (149) Q Consensus 1 Mm~-~~~~i~Gl~~l~~~l~~l~~~~~~k~~~~Al~~~a~~v~~~ak~~aP~~~g~l~~~i~~~~~~~~~~g~~~~~~~~ 79 (149) |.+ ++-+-.-+|.+++. . -++.-..-+|+.-...||.+||+|+|.+++...+....... ...+.+ T Consensus 1 madaftpNp~~FDqIl~s-------~---~VrALt~gaAe~aLa~AKAsAPVDTGAYRDGL~iE~~q~~~----RtT~MV 66 (92) T protein:vir:78 1 MADAFTPNPTWFDQIMRT-------P---KVRALVDGVAEETLADAKASAPVDTGAYRDGLHIEHRQGRS----RETAMV 66 (92) T ss_pred CCCccCCChhHHHHhhcc-------c---chhhhhhhhhhhhhhhhcccCcccccccccccchhhhhccc----cceeEE Confidence 665 45554545554421 0 11222334556667789999999999999887655432221 111222 Q ss_pred ecccccccccccccccCCCCCcceehhcccCCcCCCCCcchhHHHHHHHH Q lcl|NC_019767. 80 RGVNPRTGNSDNTMKANNPRNAFYWRFVELGTANMPAHPFVRPAYDTREE 129 (149) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~y~~f~E~GT~~~~a~PFl~pA~~~~~~ 129 (149) .+...+. -++|.-|-. |..|+...+. T Consensus 67 VG~D~KT------------------lLvESrTGN------Lakalk~~rs 92 (92) T protein:vir:78 67 VGSDEKT------------------LLIESRTGN------LARSVKRRRS 92 (92) T ss_pred eecCcce------------------eeeecccch------HHHHHhhhcC Confidence 2211110 112211111 1122221111 Done!